|
March 19 · Issue #181 · View online |
|
This week’s pick is a data analytics approach to lottery picks that worked. We also have tips for better PrestoDB performance on AWS and some cool visualizations of floods in Madagascar. Stay Healthy!
|
|
|
I Used Data Analytics to Play the Free Lottery (and Won) | by John Bica | Mar, 2021 | Towards AI
Can you win the lottery using data analytics? Using python, OCR, and data analysis to create a program to pick numbers for the Yotta Savings lottery.
|
|
How to build a DAG Factory on Airflow | by Axel Furlan | Mar, 2021 | Towards Data Science
Isn’t it weird the amount of boilerplate code necessary in order to execute 2 simple python scripts on Airflow? If your DAGs share similar values, it’s probably better to write a Factory class to build your DAG programmatically. This piece explains how.
|
Get a Complete View of Salesforce Data with MongoDB
For our Salesforce users with Mongo databases, Xplenty is having a live demo on April 8th, @11am PT. Learn how to combine multiple data types to get a single, unified view of customer data. [Sponsored Content]
|
|
Top 9 performance tuning tips for PrestoDB on Amazon EMR | Amazon Web Services
Presto is a popular distributed SQL query engine for interactive data analytics. With its massively parallel processing (MPP) architecture, it’s capable of directly querying large datasets without the need of time-consuming and costly ETL processes. This article will help you run Presto faster on AWS EMR.
|
High Performance MySQL. Having a healthy relational database… | by Ferhat Can | Star Gazers | Mar, 2021 | Medium
MySQL isn’t usually the first pick for big data, but sometimes you have to use the tools you have. If you’re in that position, this is the first in a series of articles that dives pretty deep into MySQL tuning.
|
|
Digging into the Texas Freeze using Python and ERA5 zarr data | by Eneli Toodu | Planet OS (by Intertrust) | Mar, 2021 | Medium
This year’s February cold snap in Texas, analyzed and including a Jupyter notebook if you want to take it further.
|
A Simple Way to Trace Code in Python | by Edward Krueger | Mar, 2021 | Towards Data Science
A reusable way to trace functions in Python, something every analyst who’s using Python needs to hunt down those last few troublesome bugs.
|
|
Flood Detection and Monitoring using Satellite Imagery with Python | by Syam Kakarla | Mar, 2021 | Towards Data Science
Using USGS Earth Explorer and Sentinel hub images to visualize floods, with the 2020 Madagascar floods as an example. This has some really cool visualizations plus the code that created them.
|
How and Why We Sketch When Visualizing Data | by Dee Williams | Nightingale | Mar, 2021 | Medium
Two data researchers who say they can’t draw explain how they use simple stick figures and other drawings to sketch out their data stories.
|
|
Data Engineers of Netflix — Interview with Dhevi Rajendran | by Netflix Technology Blog | Mar, 2021 | Medium
Dhevi is a backend software engineer who made the transition to data engineering at Netflix. She explains why she made the switch, and why Netflix.
|
Migrating a Data Warehouse is Hard. Here’s How Zocdoc Pulled It Off. | by Zocdoc Engineering | Zocdoc Engineering Blog | Mar, 2021 | Medium
Zocdoc is a medical appointment booking service that had a data warehouse which quadrupled in size (and cost) in less than two years. How they made a choice that reduced cost and increased performance.
|
|
|
Did you enjoy this issue?
|
|
|
|
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
650 California St., San Francisco, CA 94108
|