View profile

SF Data Weekly - Mozilla's Data Pipeline, DB Migrations to Kafka, NLP in Spark, Cloud Warehouses

Revue
 
 
April 2 · Issue #61 · View online
SF Data Weekly
Our Pick
Getting started with Data Engineering
Data Pipelines
Overview of Mozilla's Data Pipeline
Migrating from an In-House Deployment Agent to AWS CodeDeploy and AWS CodePipeline
Number of HTTP 500 errors, before and after moving to AWS CodeDeploy.
Data Storage
How to Integrate your Databases with Apache Kafka and CDC
The Kafka Connect API is a core component of Apache Kafka.
How to Know if Apache Kafka is Right for You
Data Analysis
Analysing 1.4 Billion Rows with Python
Introducing the Natural Language Processing Library for Apache Spark
High-level structure of Spark's ML and NLP libraries.
Data Visualization
The Architecture of a Data Visualization
A portion of Accurat's design methodology.
Data-driven Products
Separation of Metadata and Data: Building a Cloud Native Warehouse, Part 3
Data Engineering Jobs
Data Engineer - Scribd
Data Engineer - Bosch
At the end of each SF Data Weekly issue you can find job postings that are relevant to all members of our community. 🎉
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
Thumbs up 1ae5a7bdfcd3220e2b376aa0c1607bc5edaba758e5dd83b482d03965219a220b Thumbs down e13779fa29e2935b47488fb8f82977fedcf689a0cc0cc3c19fa3c6bb14d1493b
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108