View profile

SF Data Weekly - Netflix High Availability, Spark vs. Hadoop, Kafka Adoption Journey

May 7 · Issue #66 · View online
SF Data Weekly
Our Pick
Tips for High Availability | Netflix Technology Blog
Data Pipelines
Copy Data to Amazon Redshift Using AWS Data Pipeline
Event Stream Processing Architecture on Azure with Apache Kafka and Spark
Kafka and Spark for event ingestion and stream processing.
How to schedule a BigQuery ETL job with Dataprep
Data Storage
What Keeps Apache Kafka from Eating the World?
Confluent's vision of Apache Kafka.
Hadoop 3: Comparison with Hadoop 2 and Spark
An excerpt of the comparison between the three platforms.
Using SQL to Query Kafka, MongoDB, MySQL, PostgreSQL and Redis with Presto
Data Analysis
Benchmarking Apache Spark on a Single Node Machine
PySpark vs Pandas performance test: calculating max value of large datasets.
Data Visualization
10 Visualizations to Try in Amazon QuickSight with Sample Data
A typical Amazon QuickSight workflow in 8 steps.
Data-driven Products
Weather Data Set: Open data on AWS | AWS Big Data Blog
The HSDS service responds to the request volume by elastically scaling resources.
Data Engineering Jobs
Data Engineer - Centro
Big Data Engineer - Avenue Code
At the end of each SF Data Weekly issue you can find job postings that are relevant to all members of our community. 🎉
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108