|
December 11 · Issue #167 · View online |
|
Holidays are coming, and if you’re looking for a good read while you’re relaxing and regrouping, our pick is a good long read about data storytelling. There’s also a nice introduction to the Pomegranate statistical library, as well as a interesting way to visualize our plastic footprint. Stay healthy!
|
|
|
The Art of Data Storytelling in Six Big Ideas | by Kristi Pelzel | The Innovation | Dec, 2020 | Medium
A seven-part series (part 6 is not a big idea, just good examples) is a good read for your holiday downtime, to learn new skills or refresh existing ones.
|
|
Preparing data for ML models using AWS Glue DataBrew in a Jupyter notebook | Amazon Web Services
AWS Glue DataBrew is a new visual data preparation tool that makes it easy for data analysts and data scientists to clean and normalize data to prepare it for analytics and machine learning (ML). This post examines a sample ML use case and describes a DataBrew pipeline for that case.
|
Xplenty | Simplified ETL & ELT to BigQuery, Snowflake, Redshift & Azure
Rapid data preparation and transformation for ever-evolving and changing data requirements. Secure & compliant data pipelines. Get started today. [Sponsored Content]
|
|
AWS announces AQUA for Amazon Redshift (preview)
AQUA (Advanced Query Accelerator) for Amazon Redshift is available in preview. AQUA provides a new distributed and hardware accelerated cache that brings compute to the storage layer for Amazon Redshift and delivers up to 10x faster query performance.
|
Controlling data lake access across multiple AWS accounts using AWS Lake Formation | Amazon Web Services
It’s not uncommon for organizations to have data stored in data lakes owned by different AWS accounts. This post shows how to control those accounts using AWS Lake Formation.
|
|
Statistical modeling with “Pomegranate” —fast and intuitive | by Tirthajyoti Sarkar | Dec, 2020 | Towards Data Science
Pomegranate is a delicious fruit. It can also be a super useful Python library for statistical analysis. This article shows how.
|
A practical guide for better-looking python code | by Oleg Polivin | Dec, 2020 | Medium
How to set up a CI/CD pipeline using GitHub, inspired by two sources of good Python coding practices.
|
|
My Plastic Footprint: a Physical Data Visualisation Project | by Kat Greenbrook | Nightingale | Dec, 2020 | Medium
Trying to add empathy to data visualization, using our plastic consumption as an example.
|
Amazon QuickSight: 2020 in review | Amazon Web Services
2020 is arguably the year that QuickSight became a real competitor to other data visualization tools, and this piece is a good round-up of the major features added this year.
|
|
Financial Times Data Platform: From zero to hero | by Mihail Petkov | FT Product & Technology | Dec, 2020 | Medium
How the 130 year-old Financial Times built a data platform to analyze how readers interact with the publication.
|
Toward a Better Quality Metric for the Video Community | by Netflix Technology Blog | Dec, 2020 | Netflix TechBlog
VMAF is an open-source video quality metric that Netflix jointly developed with university collaborators. This piece highlights updates and a new API.
|
|
|
Did you enjoy this issue?
|
|
|
|
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
650 California St., San Francisco, CA 94108
|