View profile

SF Data Weekly - Pipelines Overhead - Airbnb, People.ai's Data Lake, Redshift Automatic WLMs, Simplifying Data Pipelines

Revue
 
This week's issue is packed - PACKED! - with content on data pipelines, data lakes, and file formats.
 
October 22 · Issue #137 · View online
SF Data Weekly
This week’s issue is packed - PACKED! - with content on data pipelines, data lakes, and file formats.
My favorite one is AirBnB’s post on removing overhead from their various data pipelines / operational workloads they’re running for the business. Pipeline overhead is the silent killer for meeting your SLAs….
Happy reading!!

Lars

Our Pick
Airbnb: Scaling a Mature Data Pipeline — Managing Overhead
Data Pipelines
Simplifying the Data Pipeline - Dremio Simplifying the Data Pipeline - Dremio
ETL Using Python’s Petl
Data Storage
Redshift's Automatic WLM with Query Priority: A First Look at Performance - intermix.io Redshift's Automatic WLM with Query Priority: A First Look at Performance - intermix.io
Data Lake: an asset or a liability?
Building a Data Lake in AWS - People.ai Engineering Building a Data Lake in AWS - People.ai Engineering
Big Data File Formats - Clairvoyant Blog Big Data File Formats - Clairvoyant Blog
Data Analysis
Unit testing Apache Spark Applications
Benchmarking Transformers: PyTorch and TensorFlow
Data Visualization
Dashboard Design Best Practices - 4 Key Principles | Sisense Dashboard Design Best Practices - 4 Key Principles | Sisense
Data-driven Products
[Podcast] LinkedIn Kafka with Nacho Solis - Software Engineering Daily
Data Engineering Jobs
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108