My Production Data Science Workflow

Hello World,

So I’ve spent a while now looking at 3 competing languages and I did my best to give each one a fair shake. Those 3 languages were F#, Python and R. I have to say it was really close for a while because each language has its strengths and weaknesses. That said, I am moving forward with 2 languages and a very specific way I use each one. I wanted to outline this, because for me it has taken a very long time to learn all of the languages to the level that I have to discover this and I would hate for others to go through the same exercise.

Continue reading

Merging Data Sets in Python

Hello World,

So this article is inspired by a customer doing financial analysis who can only grab a certain amount of data at a time from the data steward’s stores in chunks based on time windows. As time is constantly moving, what happens is that occasionally you get duplicate data in each request. If you attempt to grab exactly on the edges, you have a chance of missing something, so its best to have a bit of an overlap and just deal with that overlap.
Continue reading

Getting Started with Linear Algebra in Python

Hello World!

So here I am after trying for a long time to not learn Python learning Python.  It just seems like I might get a hit or two more on my blog with some Python content.  Well whats the first thing I need to figure out aside from getting it up and running in my environment and installing some libraries… Thats right, find a numerical computing library and see how it ticks.

Lets just start with my environment, because I painstakingly chose one.

Continue reading

Battle of the Programming Languages

Hello World!

So this article is to help provide some guidance around which programming language to use.  Note that this article is specifically geared towards delivering code in which intelligence and information is the soul of the product.  In this day and age, that should be every product.

I want to preface this article with a few things

  1. This is an excerpt from a paper I wrote for internal use of my own volition.  As this is the case, I was able to remove all confidential information and publish my findings.
  2. I only analyzed F#, C#, R and Python.  I know there are more, but I picked the top dogs, but F# had some special circumstances that I felt it belonged.

Continue reading