All the data you need.

Tag: SQL

Schema Evolution in Merge Operations and Operational Metrics in Delta Lake
Try this notebook to reproduce the steps outlined below We recently announced the release of Delta Lake 0.6.0, which introduces schema evolution and performance improvements in merge and operational metrics in table history. The key features in this release are: Support for schema evolution in merge operations (#170) – You …
Faster SQL Queries on Delta Lake with Dynamic File Pruning
There are two time-honored optimization techniques for making queries run faster in data systems: process data at a faster rate or simply process less data by skipping non-relevant data. This blog post introduces Dynamic File Pruning (DFP), a new data-skipping technique enabled by default in Databricks Runtime 6.1, which can …
Glow 0.3.0 Introduces New Large-Scale Genomic Analysis Features
In October of last year, Databricks and the Regeneron Genetics Center® partnered together to introduce Project Glow, an open-source analysis tool aimed at empowering genetics researchers to work on genomics projects at the scale of millions of samples. Since we introduced Glow, we have been busy at work adding new …
SQL current date — How to get the current date, time, month or year in postgreSQL?
Working with current dates and times in data science projects is quite common. In this episode of my SQL tutorial series I’ll show you the best functions that... The post SQL current date — How to get the current date, time, month or year in postgreSQL? appeared first on Data36.
SQL TRUNCATE TABLE and DROP TABLE (tutorial)
In this episode of the SQL tutorial series you’ll learn two simple but important commands that you’ll use frequently when working in SQL: TRUNCATE TABLE and DROP TABLE.... The post SQL TRUNCATE TABLE and DROP TABLE (tutorial) appeared first on Data36.
Cloud Data Science 9
Lots of announcements this week, so without delay, let’s get right to Cloud Data Science 9. News Google Announces Cloud SQL for Microsoft SQL ServerGoogle’s Cloud SQL now supports SQL … The post Cloud Data Science 9 appeared first on Data Science 101.
SQL Fundamentals Tutorial: Start Learning SQL Today!
Learn the fundamentals of SQL and start writing SQL queries to answer business questions in this free SQL fundamentals tutorial and interactive course. The post SQL Fundamentals Tutorial: Start Learning SQL Today! appeared first on Dataquest.
Beginner SQL Tutorial: Learn SQL Basics While Analyzing Bike-Sharing
Learn the basics of SQL and databases while analyzing a data set on bike rentals in this free beginner SQL tutorial. Then try our interactive courses! The post Beginner SQL Tutorial: Learn SQL Basics While Analyzing Bike-Sharing appeared first on Dataquest.
SQL Joins Tutorial: Working with Databases
When first learning SQL, it’s common to work with data in a single table. In the real world, databases generally have data in more than one table. If we want to be able to work with that data, we’ll have to combine multiple tables within a query. In this SQL …
Why You should Attend SQLSaturday – An Interview with John Byrnes
An interview with John Byrnes. Where has SQL taken his career? What does the SQLSaturday community mean to him? Why should someone attend a SQLSaturday event?
SQL Insert Tutorial: Inserting Records and DataFrames Into a Database
Master the art of the SQL Insert to add data to SQL and MySQL databases using SQL queries, as well as from within Python, and when using pandas. The post SQL Insert Tutorial: Inserting Records and DataFrames Into a Database appeared first on Dataquest.
NuoDB 4.0 Expands Cloud-native and Cloud-agnostic Capabilities of Distributed SQL Database
NuoDB, the distributed SQL database company, unveiled NuoDB 4.0, featuring expanded cloud-native and cloud-agnostic capabilities with support for Kubernetes Operators and Google Cloud and Azure public clouds. This includes the recently announced Kubernetes Operator to simplify and automate database deployments in Red Hat OpenShift.
YugaByte Commits to 100 Percent Open Source with Apache 2.0 License
YugaByte, a leader in open source distributed SQL databases, announced that YugaByte DB is now 100 percent open source under the Apache 2.0 license, bringing previously commercial features into the open source core. The move, in addition to other updates available now through YugaByte DB 1.3, allows users to more …
Themes and Conferences per Pacoid, Episode 11
Paco Nathan‘s latest article covers program synthesis, AutoPandas, model-driven data queries, and more. Introduction Welcome back to our monthly burst of themespotting and conference summaries. BTW, videos for Rev2 are up: https://rev.dominodatalab.com/rev-2019/ On deck this time ’round the Moon: program synthesis. In other words, using metadata about data science work …
New Course: SQL Intermediate for R Users
Learn more advanced SQL skills like Joins, Table Relations, and Advanced Queries, and learn to integrate SQL in our new interactive SQL course. The post New Course: SQL Intermediate for R Users appeared first on Dataquest.
Want a Job in Data? Learn SQL.
Learning SQL might not be as "sexy" as learning Python or R, but it's a fundamental skill for almost every data scientist and data analyst job. Here's why. The post Want a Job in Data? Learn SQL. appeared first on Dataquest.
Real Talk with A Data Scientist: The Future of Data Wrangling
At Springboard, we recently sat down with Michael Beaumier, a data scientist at Google, to discuss his transition into the field, what the interview process is like, the future of data wrangling, and the advice he has for aspiring data professionals.
New Course: SQL Fundamentals for R Users
Add a crucial data analysis and data science skill to your toolset by mastering the basics of SQL and learning to write queries using R programming skills. The post New Course: SQL Fundamentals for R Users appeared first on Dataquest.