All the data you need.

Tag: Software Development

What are the major differences between Python and R for data science?
Both Python and R have vast software ecosystems and communities, so either language is suitable for almost any data science task. That said, there are some areas in which one is stronger than the other.
Expressiveness
Programmers like highly expressive programming languages, but programing managers do not. I wrote about this on Twitter a few months ago. Q: Why do people like Lisp so much? A: Because Lisp is so expressive. Q: Why don’t teams use Lisp much? A: Because Lisp is so expressive. Q: Why …
From shell to system
Routine computer tasks and system programming require different tools, though I’m not entirely sure why. Many people have thought about how inconsistent shells and system programming languages are and tried to unite them. Wouldn’t it be nice to use one language for everything? But attempts to bring system languages down …
Software analysis and synthesis
People who haven’t written large programs think that writing software is easy. All you have to do is break a big problem into smaller problems until you have something so small that it’s easy to program. The problem is putting the pieces back together. If you’ve only written small programs, …
Randomization audit
“How would you go about drawing a random sample?” I thought that was kind of a silly question. I was in my first probability class in college, and the professor started the course with this. You just take a sample, right? As with many things in life, it gets more …
Pretending OOP never happened
I ran across someone recently who says the way to move past object oriented programming (OOP) is to go back to simply telling the computer what to do, to clear OOP from your mind like it never happened. I don’t think that’s a good idea, but I also don’t think …
Short essays on programming languages
I saw a link to So You Think You Know C? by Oleksandr Kaleniuk on Hacker News and was pleasantly surprised. I expected a few comments about tricky parts of C, and found them, but there’s much more. The subtitle of the free book is And Ten More Short Essays …
A wrinkle in Clojure
Bob Martin recently posted a nice pair of articles, A Little Clojure and A Little More Clojure. In the first article he talks about how spare and elegant Clojure is. In the second article he shows how to write a program to list primes using map and filter rather than …
Software metric outliers
Goodhart’s law says “When a measure becomes a target, it ceases to be a good measure.” That is, when people are rewarded on the basis of some metric, they’ll learn how to improve that metric, but not necessarily in a way that increases what you’re after. Here are three examples …
Computational survivalist
Some programmers and systems engineers try to do everything they can with basic command line tools on the grounds that someday they may be in an environment where that’s all they have. I think of this as a sort of computational survivalism. I’m not much of a computational survivalist, but …
New Software Development Initiatives Lead To Second Stage Of Big Data
The big data market is expected to be worth $189 billion by the end of this year. This is over a 50% increase in just four years. A number of factors are driving growth in big data. Demand for big data is part of the reason for the growth, but …
The hopeless task of the Unicode Consortium
Randall Monroe, author of xkcd, discussing Unicode on the Triangulation podcast: I am endlessly delighted by the hopeless task that the Unicode Consortium has created for themselves. … They started out just trying to unify a couple different character sets. And before they quite realized what was happening, they were …
Regular expressions and special characters
Special characters make text processing more complicated because you have to pay close attention to context. If you’re looking at Python code containing a regular expression, you have to think about what you see, what Python sees, and what the regular expression engine sees. A character may be special to …
Contributing to open source projects
David Heinemeier Hansson presents a very gracious view of open source software in his keynote address at RailsConf 2019. And by gracious, I mean gracious in the theological sense. He says at one point “If I were a Christian …” implying that he is not, but his philosophy of software …
What is the real difference between Data Science and Software Engineering Teams?
Although there are lots of similarities across Software Development and Data Science , they also have three main differences: processes, tooling and behavior. Find out. In my previous article, I talked about model governance and holistic model management. I received great response, along with some questions about the differences between …
Traditional Vs Machine Learning For Software Development Paradigms
Most of the practitioners agree upon the neural nets proceeds while representing a shift in coding as currently, the industry has first interest in artificial intelligence which is entirely focused on data-driven AI. Is machine learning helpful to speed up traditional software development? We all know how AI is becoming …
3 Important Things To Know About Cloud Automation
DevOps has become the go-to methodology that technology companies use to streamline the process of developing and releasing their apps and services. Old methodologies like the waterfall approach don’t deploy fast enough and at the scale you need these days. But to take full advantage of the speed, agility, convenience, …
Introducing a simple and intuitive Python API for UCI machine learning repository
Introducing a simple and intuitive API for UCI machine learning portal, where users can easily look up a data set description, search for a particular data set they are interested, and even download datasets categorized by size or machine learning task.