View profile

SF Data Weekly - Spark at Facebook, Azure Pipelines, Ebay Data Governance, AWS Deep Learning

October 16 · Issue #89 · View online
SF Data Weekly
Our Pick
How Companies Turn Your Data Into Money
Rank of companies in tracking web site visits.
Data Pipelines
A Fast, Serverless, Big Data Pipeline Powered by a Single Azure Function | Microsoft Azure
Azure Function pipeline architecture
Build a Real-time Data Pipeline During the Weekend in Go
Architectural design of the implemented pipeline.
How is AWS Redshift so fast? - DEV Community 👩‍💻👨‍💻
Using Amazon CloudFront and AWS Media Services
Delivering video services through AWS cloud.
Data Storage
Tips for Migrating to Apache HBase on Amazon S3 from HDFS | AWS Big Data Blog
Three options of migrating HDFS data to HBase on S3.
Processing Petabytes of Data in Seconds with Databricks Delta | The Databricks Blog
Big Data Governance: Hive Metastore Listener for Apache Atlas Use Cases
Data Analysis
Using Apache Spark for Large-scale Language Model Training | Facebook Code
Performance comparison of Spark vs Hive language model training.
Get Started with Deep Learning Using the AWS Deep Learning AMI | AWS Machine Learning Blog
A/B Testing: The Definitive Guide to Improving Your Product
Comparing two versions of a flow using A/B testing.
Data Visualization
The Differences in How CNN, MSNBC, and FOX Cover the News
A triangle of words distribution over channels CNN, FOX and MSNBC.
Data-driven Products
INRIX Global Traffic Scorecard
Ranking of cities based on average time spent in traffic.
Data Engineering Jobs
Data Engineer (Machine Learning Focused) - Toyota Research Institute
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108