All the data you need.

Tag: Model Context

On Being Model-driven: Metrics and Monitoring
This article covers a couple of key Machine Learning (ML) vital signs to consider when tracking ML models in production to ensure model reliability, consistency and performance in the future. Many thanks to Don Miner for collaborating with Domino on this article. For additional vital signs and insight beyond what …
Understanding Causal Inference
This article covers causal relationships and includes a chapter excerpt from the book Machine Learning in Production: Developing and Optimizing Data Science Workflows and Applications by Andrew Kelleher and Adam Kelleher. A complementary Domino project is available. Introduction As data science work is experimental and probabilistic in nature, data scientists …
Towards Predictive Accuracy: Tuning Hyperparameters and Pipelines
This article provides an excerpt of “Tuning Hyperparameters and Pipelines” from the book, Machine Learning with Python for Everyone by Mark E. Fenner. The excerpt and complementary Domino project evaluates hyperparameters including GridSearch and RandomizedSearch as well as building an automated ML workflow. Introduction Data scientists, machine learning (ML) researchers, …
Themes and Conferences per Pacoid, Episode 11
Paco Nathan‘s latest article covers program synthesis, AutoPandas, model-driven data queries, and more. Introduction Welcome back to our monthly burst of themespotting and conference summaries. BTW, videos for Rev2 are up: https://rev.dominodatalab.com/rev-2019/ On deck this time ’round the Moon: program synthesis. In other words, using metadata about data science work …
Make Machine Learning Interpretability More Rigorous
This Domino Data Science Field Note covers a proposed definition of machine learning interpretability, why interpretability matters, and the arguments for considering a rigorous evaluation of interpretability. Insights are drawn from Finale Doshi-Velez’s talk, “A Roadmap for the Rigorous Science of Interpretability” as well as the paper, “Towards a Rigorous …
Feature Engineering: A Framework and Techniques
This Domino Field Note provides highlights and excerpted slides from Amanda Casari’s “Feature Engineering for Machine Learning” talk at QCon Sao Paulo. Casari is the Principal Product Manager + Data Scientist at Concur Labs. Casari is also the co-author of the book, Feature Engineering for Machine Learning: Principles and Techniques …
Model Management and the Era of the Model-Driven Business
Over the past few years, we’ve seen a new community of data science leaders emerge. Regardless of their industry, we have heard three themes emerge over and over: 1) Companies are recognizing that data science is a competitive differentiator. 2) People are worried their companies are falling behind — that …
Put Models at the Core of Business Processes
At Rev, Nick Elprin, Domino’s CEO, continued to provide insights on managing data science based upon years of candid discussions with customers. He also delved into how data science leaders can utilize model management and help their companies become successful model-driven organizations. This blog post provides a distilled summary of …
Model Evaluation
This Domino Data Science Field Note provides some highlights of Alice Zheng’s report, “Evaluating Machine Learning Models“, including evaluation metrics for supervised learning models and offline evaluation mechanisms. The full in-depth report also includes coverage on offline vs online evaluation mechanisms, hyperparameter tuning and potential A/B testing pitfalls is available …
Data Science Models Build on Each Other
Alex Leeds, presented “Building Up Local Models of Customers” at a Domino Data Science Popup. Leeds discussed how the Squarespace data science team built models to address a key business challenge as well as utilized a complex organizational structure to accelerate data science work. This Domino Data Science Field Note …
On Ingesting Kate Crawford’s “The Trouble with Bias”
Kate Crawford discussed bias at a recent SF-based City Arts and Lectures talk and a recording of the discussion will be broadcast, May 6th, on KQED and local affiliates. Members of Domino were in the live audience for the City Arts talk. This Domino Data Science Field Note provides insights …
The Machine Learning Reproducibility Crisis
Pete Warden is the Technical Lead on the TensorFlow Mobile Embedded Team at Google doing Deep Learning. He is formerly the CTO of Jetpac, which was acquired by Google. He is also an Apple alumnus and blogs at petewarden.com. This post candidly discusses some of the real world reproducibility challenges …