After playing with GPT for some time, testing GenAI vendor solutions, designing my own, and reading feedback from other users, I uncovered a number of problems. Here I share some of the most common issues, and how to address them. It impacts LLM and synthetic data generation the most, including …
In the last two years, I published 5 machine learning and AI books, including one on synthetic data by Elsevier. This represents over 800 pages of compact, state-of-the-art material. The new addition features my most recent advances: the problems that I encountered with generative adversarial networks, and how I overcome …
Working as a data scientist is the dream of many IT professionals these days. It is no secret that data science is a skyrocketing field attracting young professionals and inspiring many to switch careers to data science. On one front are young professionals who study their courses in colleges to …
Sept. 14, 2023, 3:10 p.m.
Since starting my own AI / machine learning research lab over a year ago, I published 24 technical papers and 4 books, in addition to my articles on Data Science Central. Here I list the most popular ones in random order, with a short summary. The number attached to each …
April 30, 2023, 9:06 p.m.
Given the current economy, with large companies laying off machine learning employees in droves, one may wonder if spending 4 years and over $80k in education is worth it. How long will it take to get a job when competing with hundreds of candidates for the few listed positions? What …
April 25, 2023, 4:54 p.m.
Like many managers in the corporate world, until recently I thought you should not use these tools. The common theme is that it’s for small projects or classroom problems. Not for the real world. Then, in the process of designing a new course, I had to work with notebooks. Because …
March 31, 2023, 6:35 p.m.
In less than 100 pages, the book covers all important topics about discrete chaotic dynamical systems. It also includes related time series and stochastic processes, ranging from introductory to advanced, in one and two dimensions. The author discusses state-of-the art methods and new results in simple English. Yet, some mathematical …
March 22, 2023, 1:42 a.m.
Data is very valuable to organizations. Actionable insights that give an organization competitive advantage and help it run more efficiently can be extracted from the organization’s data. Therefore, data must be collected and stored. Databases are an organized way to store and query data. There are two main types of …
Data is very valuable to organizations. Actionable insights that give an organization competitive advantage and help it run more efficiently can be extracted from the organization’s data. Therefore, data must be collected and stored. Databases are an organized way to store and query data. There are two main types of …
Getting into Data Science and landing your first job can be trickier than it looks. There are many tools, skill-sets, and subareas that you can work with when starting to work with data, and if you’re not familiar with them, choosing the right one for you can be confusing. In …
Getting into Data Science and landing your first job can be trickier than it looks. There are many tools, skill-sets, and subareas that you can work with when starting to work with data, and if you’re not familiar with them, choosing the right one for you can be confusing. In …
Irrational numbers such as π may have been the first ones used to create perfect randomness and strong cryptographic systems. They were also among the first ones to be dismissed, long ago. Since then, they were never revisited and are completely abandoned. Binary digits of numbers such as π are …
We’ve compiled a list of data engineering programs and courses that can take you from beginner to pro in no time. This guide will help you find the best data engineering course, based on your existing knowledge and comfort with programming. We analyzed the top courses available and came up …
Choosing the right data science course is essential to achieving your goals. It can do the following: Thankfully, we’ve done the research. We compared multiple courses and rated them on a 14-point scale. That way, you can make an informed decision without wasting time or money while preparing to launch …
Although it’s a crucial skill for any data practitioner, visualization is often taken for granted in many newcomers’ data science learning paths. It may seem trivial to plot a simple chart to show, for instance, that the revenue increased last month. Compared with other tasks, data visualization might seem overly …
If you’re an aspiring (or working) data professional, then you know that the cloud is the future of data management. One of the most powerful choices for managing data in the cloud is Microsoft Azure. Azure is a cloud computing platform and infrastructure created by Microsoft for building, deploying, and …
Synthetic data is used more and more to augment real-life datasets. It enriches them and allow black-box systems to correctly classify observations or predict values that are well outside of training and validation sets. In addition, it helps understand decisions from obscure systems such as deep neural networks. Thus, it …
Choosing the right Python course is crucial to your success. It can help you do the following: We all have different goals and learning styles. There’s no one-size-fits-all, “best” way to learn Python. This guide will help you find the best Python course for your situation. We analyzed dozens of …