|
June 24 · Issue #245 · View online |
|
This week’s pick shows how you can map anything to a vector, using Reddit as an example. We also have the first in a series evaluating the use of different Python libraries to build Choropleths, as well as a remembrance of the found of Exploratory Data Analysis. Stay Healthy!
|
|
|
Anything2Vec: Mapping Reddit into Vector Spaces 💥 | by Cameron Raymond | Towards Data Science
Using Word2Vec to map Reddit communities to a vector in a way that is tied back to the real world.
|
|
Build an Apache Iceberg data lake using Amazon Athena, Amazon EMR, and AWS Glue | Amazon Web Services
Iceberg is an open-source table format for storing data in data lakes. This step-by-step loads an example dataset into S3 buckets in the Iceberg format.
|
Overview: NetSuite Ecommerce Integration | Integrate.io Blog
The NetSuite platform includes built-in analytics, however, these metrics are limited. That’s why successful companies move their data from NetSuite to an external platform to make it easier to generate actionable insights. [Sponsored]
|
|
Inconsistent thoughts on database consistency
Understanding the different concepts of consistency as applied to distributed databases, as well as some issues with the conversation of consistency.
|
Phases of Database Growth and Cost
When it comes to database performance, many think they are solving technical problems, but are solving a money problem. There are 2 phases to database-cost management, and understanding which phase you are in will help you understand your options for growth.
|
|
How To Productize ML Faster With MLOps Automation | by yaron haviv | Towards Data Science
In MLOps live survey the #1 data science challenge raised was “bringing machine learning to production.“ This post suggest a solution using automation.
|
Remembrances of Things EDA | Nightingale
Published for Tukey Day, June 16, the 107th anniversary of John Tukey’s birth, this article recounts his influence on Exploratory Data Analysis.
|
|
The Battle of Choropleths — Part 1 | by Francis Adrian Viernes | Jun, 2022 | Towards Data Science
The first installation of the battle of the Choropleths (really, the battle of Choropeth tools) uses Geopandas to create a test Choropleth.
|
Bridging the Gap between Devs and D3.js with this One Tool | by Antonio Jesus Ayala | Jun, 2022 | Medium
D3.js is a visualization library that lets React developers create visualizations, but it can be difficult. This piece introduces ad3lie, a tool to help create D3 visualizations interactively in a GUI.
|
|
How We Built Infrastructure to Run User Forecasts at Spotify - Spotify Engineering : Spotify Engineering
A description of Spotify’s internal system to allow user forecasts to run automatically or on demand.
|
How GE Proficy Manufacturing Data Cloud replatformed to improve TCO, data SLA, and performance | Amazon Web Services
How GE moved its MDC system from Predix to Redshift, Glue and Airflow hosted on AWS.
|
|
|
Did you enjoy this issue?
|
|
|
|
In order to unsubscribe, click here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
650 California St., San Francisco, CA 94108
|