View profile

SF Data Weekly - Microservices in Kafka, Airflow Tutorial, Google Colaboratory, Spark Summit 2017

November 13 · Issue #41 · View online
SF Data Weekly
Our Pick
How To Prepare For A Data Engineering Job In Silicon Valley
Data Pipelines
Building a Microservices Ecosystem with Kafka Streams and KSQL
A top-level view of a microservices system build with Kafka Streams.
Airflow Tutorial for Data Pipelines
An view of directed acyclic graphs in Airflow.
The Data Pipeline – Analytics at the Speed of Business
Data Storage
Introducing Vectorized UDFs for PySpark
Vectorized UDFs perform much better than row-at-a-time UDFs across the board.
Spark Summit EU 2017 Recap and Reflections
Cassandra NoSQL Data Model Design
Data Analysis
Google Colaboratory — Simplifying Data Science Workflow
Data Visualization
10 Years of Data Science Visualizations
A timeline view of preferred visualization tools in the past 10 years.
Data-driven Products
10 Companies Using Machine Learning in Cool Ways
A simplified diagram illustrating the key stages of a NLP system in Baidu.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108