All the data you need.

Tag: Machine Learning

Improving Customer Experience With Transaction Enrichment
The retail banking landscape has dramatically changed over the past five years with the accessibility of open banking applications, mainstream adoption of Neobanks and the recent introduction of tech giants into the financial services industry. According to a recent Forbes article, millennials now represent 75% of the global workforce, and …
AI Gets the Glory but ML is Quietly Making Fortunes
In this contributed article, technologist Bernard Brode looks at the differences between AI and ML, look at why AI gets all the attention, and why we shouldn’t overlook the everyday revolution that ML is already creating.
Building Forward-Looking Intelligence With External Data
This post was written in collaboration with the Foursquare data team. We thank co-author Javier Soliz, sales engineer specializing in data engineering and geospatial analysis at Foursquare, for his contribution. “In an interlocked global economy, triggering events can quickly set off a chain reaction,” wrote Boston Consulting Group in early …
Rise of the Lakehouse
With the fast-moving evolution of the data lake, Billy Bosworth and Ali Ghodsi share their mutual thoughts on the top 5 common questions they get asked about data warehouses, data lakes, and lakehouses. Coming from different backgrounds, they each provide unique and valuable insights into this market. Ali has spent …
Investment in Lending Club - Loan Default & Investor ROI Prediction
Introduction In financial industry, banks have historically handled most consumer and small business lending to a great extent. However, banks have some key limitations like interest rates are not individualized, loan decisions can take months, regulation process is strong and the costs of underwriting loans are high. Peer-to-peer (P2P) lending …
Data-driven Software: Towards the Future of Programming in Data Science
This is a guest authored post by Tim Hunter, data scientist, and Rocío Ventura Abreu, data scientist, of ABN AMRO Bank N.V. Data science is now placed at the center of business decision making thanks to the tremendous success of data-driven analytics. However, more stringent expectations around data quality control, …
Predicting House Prices with XGBoost
LinkedIn | GitHub | Email | Data | Web App | Notebook Introduction “Location, location, location.” The likelihood that you will hear that phrase if you are looking into purchasing a house, apartment, condo, or timeshare, is .9999999999. (Yes, I performed that study myself.) However, there are many other factors …
House Price Predictions in Ames, Iowa
Github Repo | LinkedIn Introduction and Objective Statement In this project I will be analyzing house prices for the town of Ames, Iowa, using a dataset provided for a Kaggle competition. This dataset contains 1,460 observations and 80 features that capture many aspects involved in predicting the market value of …
TOP 10 insideBIGDATA Articles for April 2021
In this continuing regular feature, we give all our valued readers a monthly heads-up for the top 10 most viewed articles appearing on insideBIGDATA. Over the past several months, we’ve heard from many of our followers that this feature will enable them to catch up with important news and features …
“Above the Trend Line” – Your Industry Rumor Central for 4/30/2021
Above the Trend Line: your industry rumor central is a recurring feature of insideBIGDATA. In this column, we present a variety of short time-critical news items grouped by category such as M&A activity, people movements, funding news, financial results, industry alignments, customer wins, rumors and general scuttlebutt floating around the …
Automated Background Removal in E-commerce Fashion Image Processing Using PyTorch on Databricks
This is a guest blog from Simona Stolnicu, a data scientist and machine learning engineer at Wehkamp, an e-commerce company, where her team builds data pipelines for machine learning tasks at scale. Wehkamp is one of the biggest e-commerce companies in the Netherlands, with more than 500,000 daily visitors on …
Using Artificial Intelligence Tools to Run Proactive “Health Check” Investigations
In this contributed article David Carns, Chief Revenue Officer of Casepoint, discusses how businesses can use AI to run “health check” investigations preemptively. If your organization is already using an eDiscovery platform with built-in AI tools, it might make sense to explore how you can use those tools for broader …
insideBIGDATA Latest News – 4/27/2021
In this regular column, we’ll bring you all the latest industry news centered around our main topics of focus: big data, data science, machine learning, AI, and deep learning. Our industry is constantly accelerating with new products and services being announced everyday. Fortunately, we’re in close touch with vendors from …
Reproduce Anything: Machine Learning Meets Lakehouse
Machine learning has proved to add unprecedented value to organization and projects – whether that’s for accelerating innovation, personalization, demand forecasting and countless other use cases. However, machine learning (ML) leverages data from a myriad of sources with an ever-changing ecosystem of tools and dependencies, making these solutions constantly in …
How Yum! Brands Uses Location Data from Foursquare to Make Smarter Decisions
Join this virtual event with a compelling panel of technology leaders to discuss to discover how Yum! Brands and other organizations are leveraging location-based data to boost in-app location accuracy, increase in-store foot traffic, and expand e-commerce business.
KDD 2021 Data Science Conference Will Convene Aug. 14 – 18, 2021
The Association for Computing Machinery (ACM) Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) announced KDD 2021, the group's flagship conference, will take place virtually Aug. 14-18. The premier interdisciplinary data science conference, KDD 2021 will bring together researchers and practitioners from data science, machine learning, big data …
JADBio Provides AutoML for BioMed Data
JADBio is an AI startup company working with BioMed data. This remarkable team, headed by Prof. Ioannis Tsamardinos, has created an automated machine learning (AutoML) platform designed for life scientists. No Coding. No Statistics. No Math. No Problem ... just add data.
ALT Highlights – An Interview with Joelle Pineau
Welcome to ALT Highlights, a series of blog posts spotlighting various happenings at the recent conference ALT 2021, including plenary talks, tutorials, trends in learning theory, and more! To reach a broad audience, the series will be disseminated as guest posts on different blogs in machine learning and theoretical computer …