View profile

SF Data Weekly - Visualizing a Year of Pandemic, Hive and Vaex

April 16 · Issue #185 · View online
SF Data Weekly
This week’s pick is a great Instacart review of how the year of pandemic shopping went, and how they identified groups like the “locked down and lovelorn.”
We’ve also got a piece on how Hive is still relevant and useful, and how to use Vaex to turbocharge your Python DataFrames. Stay Healthy!

Our Pick
Beyond the Cart: A Year of Essential Insights | by Instacart | Apr, 2021 | Instacart News
Data Pipelines
Build Slowly Changing Dimensions Type 2 (SCD2) with Apache Spark and Apache Hudi on Amazon EMR | Amazon Web Services
Xplenty | Simplified ETL & ELT for Flat File Integration
Data Storage
Old but gold: implementing a Hive Metastore Infrastructure | by Rafael Ribaldo | QuintoAndar Tech Blog | Apr, 2021 | Medium
Role of Foreign Data Wrappers in Migrations to PostgreSQL
Data Analysis
How to process a DataFrame with millions of rows in seconds | by Roman Orac | Apr, 2021 | Towards Data Science
What People Don’t Get About That “50% of Small Businesses Fail in 5 Years” Statistic | by Rachel Presser | Apr, 2021 | Medium
Data Visualization
COVID-19 VACCINATION: How Far Have We Gone? | by Ibukun Ogunnaike | Apr, 2021 | Medium
COVID-19 Dashboard Tracker Data Visualization | by AnyChart | Apr, 2021 | Medium
Data-driven Products
Our Journey Towards Cloud Efficiency | by Jennifer Rice | Airbnb Engineering & Data Science | Apr, 2021 | Medium
Seven lessons from the 2020 ArcGIS StoryMaps competition winners
Should You Get a Data Analyst Job? | Better Programming
Did you enjoy this issue?
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
Powered by Revue
650 California St., San Francisco, CA 94108