View profile

SF Data Weekly - Data Engineering at Unruly, Airflow at Plaid, Python Stream Processing, Hadoop Migrations

August 7 · Issue #79 · View online
SF Data Weekly
Our Pick
Data Engineering Tech at Unruly | Unruly Engineering
The latest data architecture at Unruly.
Data Pipelines
Getting Ramped-Up on Airflow with MySQL → S3 → Redshift
An overview of the workflow used for regular data movement.
Hey, Data Teams - We're Working on a Tool Just for You | GitLab
Decoupling Systems with Apache Kafka, Schema Registry and Avro | Confluent
Data Storage
Oracle vs. Hadoop
Migrating Hulu’s Hadoop Clusters to a New Data Center — Part Two: Creating a Mirrored Hadoop Instance
DCValidator Architecture - tool used to compare the contents of two clusters.
Data Analysis
MLflow: A Platform for Managing the Machine Learning Lifecycle
Faust - Python Stream Processing
How F1 and Others are Moving Beyond Descriptive Analytics
Data Visualization
Your Friendly Guide to Colors in Data Visualisation | Chartable
Color selection is different for continuous and categorical data.
Data-driven Products
A Custom WordPress Dashboard with MongoDB Atlas, Microsoft Azure, & Serverless Functions
A peak at the sales reporting section in the dashboard.
Data Engineering Jobs
Data Engineer - Robinhood
Data Engineer - Automattic
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108