|
February 12 · Issue #176 · View online |
|
Our pick this week is the ultimate dogfooding exercise - using the techniques and tools of data science to find a new house. We also have some great examples of the power of AWS Athena, and a look at a python library called Sweetviz. Stay healthy!
|
|
|
The Perks of Data Science: How I Found My New Home in Dublin | by Andrea Ialenti | Feb, 2021 | Towards Data Science
Using Data Science to find a new home in a high demand market, featuring a number of tools in the Google Cloud Platform, with lots of detail and code snippets.
|
|
How EMX reduced data pipeline costs by 85% with Amazon Athena | Amazon Web Services
EMX’s pipeline is dumping data into S3, then using Athena SQL to aggregate the data and push it into Redshift. This post explains their pipeline and shows how it saved them significant costs over their traditional data warehouse.
|
Are You Still Using Pandas to Process Big Data in 2021? | by Roman Orac | Feb, 2021 | Towards Data Science
Benchmarks for two python big data libraries that handle datasets larger than those that can fit in-memory when using Pandas.
|
Xplenty | Simplified ETL & ELT to BigQuery, Snowflake, Redshift & Azure
Rapid data preparation and transformation for ever-evolving and changing data requirements. Secure & compliant data pipelines. Get started today. [Sponsored Content]
|
|
Seven Tips for Forecasting Cloud Costs (with FB’s Prophet) | by Gad Benram | Feb, 2021 | DoiT International
Using Facebook’s Prophet, a time-series forecasting tool, to forecast future usage of cloud computing resources.
|
Data Partitioning. Key benefits after reading this blog | by Suyash Namdeo | Enjoy Algorithms | Feb, 2021 | Medium
A basic introduction to partitioning, and the benefits and drawbacks of different partitioning approaches.
|
|
Time series decomposition — ETS model using Python | by Jayashree domala | Artificial Intelligence in Plain English | Feb, 2021 | Medium
Error-Trend-Seasonality (ETS) is a model used for time series decomposition to understand, as the name implies, trend and seasonality in a dataset. This piece uses pandas and matplotlib to perform ETS analysis.
|
Automating Exploratory Data Analysis- Part 1 | by Himanshu Sharma | The Startup | Feb, 2021 | Medium
Using Sweetviz, an open-source python library that has a HTML interface, for EDA. A nice introduction to a powerful library.
|
|
Step-Up Your Visualization: Bar Chart Race | by Francis Adrian Viernes | The Startup | Feb, 2021 | Medium
Bar chart races are a popular animated data visualization technique. This piece shows how to make them quick and easy using a python package.
|
Building AWS Data Lake visualizations with Amazon Athena and Tableau | Amazon Web Services
Athena runs SQL queries against data stored in AWS S3 data lakes. This post shows how to use Athena and Tableau Desktop to create data visualizations quickly and economically.
|
|
Supercharging Apache Superset | by Airbnb | Airbnb Engineering & Data Science
How Airbnb customized Apache Superset for business intelligence at scale - 2K users, 50K SQL queries, 6K dashboard views and 125K graph views per week.
|
We increased the data limits of our Free Plan by 100x and re-thought our online pricing strategy. Here’s why. - Mixpanel
Mixpanel, a data visualization tool, launched an all new Free Plan with 100x the data volume of their old one, and made changes to their pricing curve to give startups more runway for growth. This post explains why.
|
|
|
Did you enjoy this issue?
|
|
|
|
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
650 California St., San Francisco, CA 94108
|