View profile

SF Data Weekly - ETL vs ELT, Warehouse vs Data Lake, Kafka Batch to Real-time, Airflow, Databot Pipes

September 4 · Issue #83 · View online
SF Data Weekly
Our Pick
Data-Driven? Think Again
Data Pipelines
Airflow, Meta Data Engineering, and a Data Platform for the World's Largest Democracy
How to Use Apache Kafka to Transform a Batch Pipeline into a Real-time One
The architecture of a real-time pipeline, with micro services in different colors.
Databot: High Performance Python Data Driven Programming Framework for Web Crawler, ETL, Data Pipeline Work
Data Storage
ETL vs ELT or Data Warehouse vs Data Lake
From raw data to insights and analytics.
Hooking up Spark and Scylla: Part 1
A typical Spark application.
Google Cloud Platform for AWS Professionals
Data Analysis
A Business Stakeholder’s Quick Start Guide to Useful Analytics
Detecting Image Similarity Using Spark, LSH and TensorFlow
Data Visualization
How to Visualize and Understand Your MongoDB Data
Using Compass to visualize MongoDB data.
Data-driven Products
MusicVAE: Creating a Palette for Musical Scores with Machine Learning.
Data Engineering Jobs
Data Engineer - PlanGrid
Senior Data Engineer - Instacart
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108