Would you like to see your article featured in SFData? Email us at submissions@sfdata.io for consider
|
September 9 · Issue #131 · View online |
|
Would you like to see your article featured in SFData? Email us at submissions@sfdata.io for consideration in next weeks issue.
|
|
|
An Engineer’s Perspective on Engineering and Data Science Collaboration for Data Products
At Coursera, engineers and data scientists have built many data products. They’ve learned that building a data product is a team sport. This post outlines three themes that worked well in their pursuit of this goal from an engineering perspective.
|
|
Transform Your AWS Data Lake using Databricks Delta and the AWS Glue Data Catalog Service
Delta Lake is an open source storage layer that brings reliability to data lakes. In this blog post Databricks shows how to reliably and efficiently transform your AWS Data Lake into a Delta Lake seamlessly using the AWS Glue Data Catalog service.
|
A Real Showcase of Kafka at Wirecard Brazil
Wirecard is a Brazilian company building payment solutions for businesses. This is their journey of the development of their platform, with all the ups and downs, and how Kafka solves many of their problems.
|
Kafka implemented architecture
|
|
Should I Enable Amazon Redshift's Automatic WLM? - intermix.io
Workload Management (WLM) configuration allows you to maximize your query throughput on Amazon Redshift. This post explains when / why you should use Redshift’s AutoWLM along with real world examples of the feature in action.
|
Result of implementing Auto WLM
|
Explaining SQL and NoSQL, to Grandma
Over the last 15 years, many new databases have come to the market as part of the No-SQL movement. These include key-value stores such as Redis and Amazon DynamoDB, wide-column stores such as Cassandra and HBase, document stores such as MongoDB and Couchbase, and graph databases and search engines such as Elasticsearch and Solr. This is a high-level overview of SQL and NoSQL.
|
Building a Distributed Time-series Database on PostgreSQL
TimescaleDB, a time-series database on PostgreSQL, has been production-ready for over two years, with millions of downloads and production deployments worldwide. This month, for the first time, Timescale publicly shares their design, plans, and benchmarks for the distributed version of TimescaleDB.
|
|
A/B Testing Design & Execution
This series of articles was designed to explain how to use Python in a simplistic way to fuel your company’s growth by applying the predictive approach to all your actions.
|
Evaluate the Top BI Tools by Capability & Price including Looker, Mode, Tableau and more.
Cloud warehouses like Amazon Redshift, Google BigQuery and Snowflake have brought down the cost and complexity to build a data platform, in a shift away from Hadoop, with BI tools as the catalyst to make data exploration and visualization available to a much wider audience. This post evaluates the top business intelligence tools (BI tools) by capability & price, featuring Tableau, Chartio, Looker, Mode, Metabase, PowerBI, Qlik and Domo.
|
|
Data Studio Showdown: Dashboards vs Reports
Do your clients need automated dashboards or narrative reporting? Learn how to create (and deliver) each using Google’s free Data Studio.
|
|
Business Analytics and Insights: A Time of Transformation
Blendo can be the link between your data sources and your BI platform, enabling powerful business analytics and actionable insights to power every team in their interactions with customers.
|
|
|
Did you enjoy this issue?
|
|
|
|
If you don't want these updates anymore, please unsubscribe here.
If you were forwarded this newsletter and you like it, you can subscribe here.
|
|
650 California St., San Francisco, CA 94108
|