View profile

SF Data Weekly - Presto at Pinterest, Redshift at Intuit, Facebook Open-sources DLRM, NoSQL and DynamoDB, Benchmarking Spark Performance

July 29 · Issue #124 · View online
SF Data Weekly
Our Pick
Data Science for Startups: Data Pipelines
Components in a managed Analytics Architecture (GCP).
Data Pipelines
How to Amortize Cloud Spend and Reserved Instances with Amazon Redshift - Intuit's Transformational Use Case
Amazon Redshift Concurrency Scaling - A Guide and Our Test Results
Database performance: cluster scaling.
Orchestrate an ETL process using AWS Step Functions for Amazon Redshift | AWS
A workflow using Step Functions adding fresh data to the Redshift warehouse.
Microservices, Apache Kafka, and Domain-Driven Design | Confluent
An example architecture of an event streaming system.
Building a Real-Time Anomaly Detection Experiment With Kafka and Cassandra - DZone AI
Anomaly detection application design.
Data Storage
Data Modeling in AWS DynamoDB - The Startup
Data Analysis
Presto at Pinterest - Pinterest Engineering
Spark UDF — Deep Insights in Performance | QuantumBlack
Presto deployment at Pinterest.
Data Visualization
Data Visualization for Dashboards and Reports - Prototypr
Data-driven Products
Facebook Open-sources Deep Learning Recommendation Model (DLRM)
Butterfly shuffle for the all-to-all (personalized) communication.
Data Engineering Jobs
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108