Category Archives: Apache Spark

Four Really Real Meanings of Real-Time

Here are four different things that I believe real-time really means, and how to determine which meaning you’re using. Sub-Second Response Generally, when engineers say “real-time”, they are usually referring to sub-second response time. In this kind of real-time data processing, nanoseconds count. Extreme levels of performance are key to success…. Read more »

Apache Spark resources for self-study

#apachespark pic.twitter.com/KZsiNeBpSf — Trieu Nguyen (@tantrieuf31) November 11, 2015 https://www.mapr.com/ebooks/spark/01-what-is-apache-spark.html https://dzone.com/articles/using-apache-spark-and-mysql-for-data-analysis http://www.infoq.com/articles/apache-spark-introduction https://www.edx.org/course/introduction-big-data-apache-spark-uc-berkeleyx-cs100-1x https://www.edx.org/course/scalable-machine-learning-uc-berkeleyx-cs190-1x