View profile

SF Data Weekly - Mozilla's Data Pipeline, DB Migrations to Kafka, NLP in Spark, Cloud Warehouses

April 2 · Issue #61 · View online
SF Data Weekly
Our Pick
Getting started with Data Engineering
Data Pipelines
Overview of Mozilla's Data Pipeline
Migrating from an In-House Deployment Agent to AWS CodeDeploy and AWS CodePipeline
Number of HTTP 500 errors, before and after moving to AWS CodeDeploy.
Data Storage
How to Integrate your Databases with Apache Kafka and CDC
The Kafka Connect API is a core component of Apache Kafka.
How to Know if Apache Kafka is Right for You
Data Analysis
Analysing 1.4 Billion Rows with Python
Introducing the Natural Language Processing Library for Apache Spark
High-level structure of Spark's ML and NLP libraries.
Data Visualization
The Architecture of a Data Visualization
A portion of Accurat's design methodology.
Data-driven Products
Separation of Metadata and Data: Building a Cloud Native Warehouse, Part 3
Data Engineering Jobs
Data Engineer - Scribd
Data Engineer - Bosch
At the end of each SF Data Weekly issue you can find job postings that are relevant to all members of our community. 🎉
If you want to post a job for your company, you can do it here.
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here
If you were forwarded this newsletter and you like it, you can subscribe here
Powered by Revue
650 California St., San Francisco, CA 94108