All the data you need.

Tag: ETL

Data Platforms – A journey. The Yesteryears, Today, and What Lies Ahead
In this contributed article, Darshan Rawal, Founder and CEO of Isima, explains how the data ecosystem has exploded in the last decade to deal with multi-structured data sources. But the fundamental architecture of using queues, caches, and batches to support Enterprise Data Warehousing and BI hasn't. This article looks at …
Announcing the Launch of SQL Analytics
Today, we announced the new SQL Analytics service to provide Databricks customers with a first-class experience for performing BI and SQL workloads directly on the data lake. This launch brings to life a new experience within Databricks that data analysts and data engineers are going to love. The service provides …
Why Cloud Centric Data Lake is the future of EDW
In this first of two blogs, we want to talk about WHY an organization might want to look at a... The post Why Cloud Centric Data Lake is the future of EDW appeared first on Databricks.
How Automation Helps You Exploit the Value in Big Data
In this sponsored post, Simon Shah spearheads marketing at Redwood Software to support continued market growth and innovation for their cloud-based IT and business process automation solutions. He believes that by using automation to collect and manage your big data processes, you will truly exploit its value for the business.
Top 5 Reasons to Convert Your Cloud Data Lake to a Delta Lake
If you examine the agenda for any of the Spark Summits in the past five years, you will notice that there is no shortage of talks on how best to architect a data lake in the cloud using Apache Spark™ as the ETL and query engine and Apache Parquet as …
Retail and Consumer Goods Sessions You Don’t Want to Miss at Spark + AI Summit 2020
The current economic environment is having a significant impact on the Retail and Consumer Goods sector. Rapid changes in how consumers shop is forcing companies to rethink their sales, marketing, and supply chain strategies. Companies can still reduce costs and win market share to drive stronger growth, but this requires …
Monitor Your Databricks Workspace with Audit Logs
Cloud computing has fundamentally changed how companies operate – users are no longer subject to the restrictions of on-premises hardware deployments such as physical limits of resources and onerous environment upgrade processes. With the convenience and flexibility of cloud services comes challenges on how to properly monitor how your users …
Matillion Launches Matillion ETL for Azure Synapse Empowering Users with Data Transformation Capabilities for Rapid Access to Insights
Matillion, a leading provider of data transformation software for cloud data warehouses (CDWs), announced the availability of Matillion ETL for Azure Synapse to enable data transformations in complex IT environments, at scale. Empowering enterprises to achieve faster time to insights by loading, transforming, and joining together data, the release extends …
Building a Modern Clinical Health Data Lake with Delta Lake
The healthcare industry is one of the biggest producers of data. In fact, the average healthcare organization is sitting on nearly 9 petabytes of medical data. The rise of electronic health records (EHR), digital medical imagery, and wearables are contributing to this data explosion. For example, an EHR system at …
New Data Ingestion Network for Databricks: The Partner Ecosystem for Applications, Database, and Big Data Integrations into Delta Lake
Organizations have a wealth of information siloed in various sources, and pulling this data together for BI, reporting and machine learning applications is one of the biggest obstacles to realizing business value from data. The data sources vary from operational databases such as Oracle, MySQL, etc. to SaaS applications like …
Do You Actually Need a Data Lake?
In this contributed article, Eran Levy, Director of Marketing at Upsolver, sets out to formally define "data lake" and then goes on to ask whether your organization needs a data lake by examining 5 key indicators.
Simplify Advertising Analytics Click Prediction with Databricks Unified Analytics Platform
Advertising teams want to analyze their immense stores and varieties of data requiring a scalable, extensible, and elastic platform. Advanced analytics, including but not limited to classification, clustering, recognition, prediction, and recommendations allow these organizations to gain deeper insights from their data and drive business outcomes. As data of various …
Analyze Games from European Soccer Leagues with Apache Spark and Databricks
Introduction The global sports market is huge, comprised of players, teams, leagues, fan clubs, sponsors, etc., and all of these entities interact in myriad ways generating an enormous amount of data. Some of that data is used internally to help make better decisions, and there are a number of use …