View profile

SF Data Weekly - TensorFlow at Twitter, Mixpanel's Customer Cost, ETL with AWS Lambda, Stream Processing

June 18 · Issue #72 · View online
SF Data Weekly
Our Pick
Making a Metric
The resulting chart of ratio of CEO pay to median employee pay.
Data Pipelines
Do We Need Distributed Stream Processing?
Cluster throughput for Yahoo Streaming Benchmark.
Orchestrate Multiple ETL Jobs Using AWS Step Functions and AWS Lambda
ETL orchestration architecture and the main flow of events.
Oracle Data to Google BigQuery Using Google Cloud Dataflow and Dataprep
The data pipeline used to load and process data from Oracle to BigQuery.
Data Storage
How to Create a Fast and Globally Available User Profiling System by Using Amazon DynamoDB Global Tables
A diagram for possible use of DynamoDB global tables.
Microsoft's Newest Data Center Is a Giant Metal Can at the Bottom of the Sea
A bus-sized metal cylinder will be used to carry 12 racks of servers underwater.
Data Analysis
Twitter Meets TensorFlow
TensorBoard is one of the reasons Twitter choose TensorFlow instead of PyTorch.
How We Track Customer Costs in Mixpanel
Per-user infrastructure costs in time.
Data Visualization
Introducing Semiotic for Data Visualization
Combinatorial charts and custom effects using sketchy rendering.
Data-driven Products
Will Your Money Last If You Retire Early? Visualizing Longevity Risk
Longevity Risk through an interactive chart.
Data Engineering Jobs
Data Engineer - Automatic
Big Data Engineer - Braintree
Data Engineering Intern - Castlight Health
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108