I’ve been performing data science before there was a field called “data science
“, so I’ve had the opportunity to work with and hire a lot of great people. But if you’re trying to hire a data scientist, how do you know what to look for, and what should you consider in the interview process?
I’ve been doing what is now called “data science” since the early 1990s and have helped to hire numerous scientists and engineers over the years. The teams I’ve had the opportunity to work with are some of the best in the world, tackling some of the most challenging problems facing our country. These folks are also some of the smartest people I’ve ever had the opportunity to work with.
That said, not everyone is a good fit, and the discipline of data science requires important key elements. Hiring someone into your team is incredibly important to your business, especially if you’re a small startup or building a critical internal data science team; mistakes can be expensive in both time and money. This can be even more intimidating if you don’t have the background or experience in hiring scientists, especially someone responsible for this new discipline of working with data.
Science and business seem like two very different disciplines, but is the best approach to learning any different in these two fields? These areas of life seem so unique, and the people in them can be quite varying (one with the nerdy pocket protector and the other dressed in the well-tailored suit). However, both science and business require learning, and the best approach to learning in each is really the same.
The best approach to learning is generally through failure. For example, Thomas Edison failed an astounding number of times
before he invented a working lightbulb, and there are likely thousand of stories about how successes came as a result of many tries and many failures.
I read a couple of items in this month’s Fortune magazine that I thought it was worth passing along.
The first was a small article by Brian Dumaine about the work being done at Applied Proteomics to identify cancer before it develops. At Applied Proteomics, they use mass spectroscopy to capture and catalog 360,000 different pieces of protein found in blood plasma, and then let supercomputers crunch on the data to identify anomalies associated with cancer. The company has raised $57 million in venture capital and is backed by Microsoft co-founder Paul Allen. You can read the first bit of the article here.
The second is from the Word Check callout, showing how access to information is making the word a better place:
wasa: Pronounced [wah-SUH]
(noun) Arabic slang: A display of partiality toward a favored person or group without regard for their qualifications. A system that drives much of life in the Middle East — from getting into a good school to landing a good job.
But on the Internet, there is no wasa.
- Adapted from Startup Rising: The Entrepreneurial Revolution Remaking the Middle East by Christopher M. Schroeder
I found this set of business wisdoms in the August 2013 issue of Entrepreneur magazine. While not perfect mantras by which to guide a business, I thought there were pretty fun.
Chris Hardwick didn’t rely on just his nerdy instincts in founding his company; he also took inspiration from his heroes. Super-power your business with these lessons from some epic nerd properties.
Ever wonder what your own personal network looks like? You are likely connected to many different groups (family, friends, community, work), but do you know how they are connected? Or are they connected at all? Are you the glue that connects these various groups?
This is a great age we’re living in, and I’m glad to be involved with developing lots of really advanced technologies. One of the technology areas
that I’m really fascinated with has been pushed forward by Stephen Wolfram
. He created the industry standard computing environment Mathematica, which now serves as the engine behind his company’s newest creation, Wolfram|Alpha
. (I’ve written a few posts on Wolfram|Alpha in the past, and you can read them here
This is a technical post about what I’ve discovered in creating my own custom URL shortener. Hopefully, you can learn to do the same things I did, and my experience will save you some headaches if it’s something you’re interesting in trying.
On my website, I focus a lot about decisions and discovery. I love finding out how the world works and then applying what I’ve learned to make better decisions, and I also try to share what I can along the way. I hope that it helps others.
It’s a complex world, and we are constantly making decisions. Just imagine the number of decisions we make about breakfast: How big a breakfast should I have? Should I have coffee? If so, how much? Should I have toast? Should I use butter? Should I have one piece or two? Should I cut the toast? If so, should they be cut into rectangles or triangles? Should I keep the crust? Should I have juice? Should it be apple juice or orange juice? How about milk? I haven’t even gotten to the pancakes, waffles, syrup, sausage, cereal, bacon… (mmm, bacon…)
And these aren’t the really important ones! How do we know we’re making good decisions, and can we make better ones?
You might think that it’s a bit odd, treating yourself like a science experiement. However, the best way to achieve your goals may be to do just that – be committed to collecting data on yourself.
In science, we’re always collecting data and analyzing it to find out more about the world. However, collecting data isn’t only for people with pocket protectors (although we don’t all wear those!). It is something that any of us can use to help us achieve any goal we set for ourselves.
Imagine a guy with glasses who used to model baseball stats and play online poker nailing the outcome of the 2012 elections. And when I say “nailing”, I mean that he correctly predicted the U.S. Presidential contest in every one of the 50 states (and nearly every U.S. Senate race, too). He even performed better than some of the most widely-used polling firms. Now imagine that he gives his thoughts on making these types of predictions. That’s exactly what Nate Silver does in his new book The Signal and the Noise.
I’ve worked in what’s now being called “data science
” for nearly twenty years. The title of Silver’s book – The Signal and the Noise – presents an important and sometimes overlooked part of this science. The “signal” is what we’re looking for in the data, and the “noise” is all the stuff in the data that gets in the way of what we’re looking for.
60 Minutes aired a piece last night about scientific fraud at Duke University, where data was fabricated in order to support alleged discoveries in individualized cancer therapies. As a result of these investigations, a number of previously published scientific articles have been retracted.
Less than a week ago, I highlighted an infographic from Jen Rhee about the alarming statistics in science fraud. I’m really disheartened that such a highly visible example came up so quickly…