So, what is data science? There is an increasing buzz about it lately – A Fortune article recently dubbed the “data scientist” as “The Hot New Gig in Tech”.
This report is a good primer for the new field of data science. In some ways (as with many fields), it’s not new at all. In fact, I think my company has been involved with raising the bar of data science within the defense and intelligence worlds for over 35 years – I’ve personally been involved for nearly 20 of those. However, with the volumes of data being generated, and the ease of being able to process this data, there is now a recognition of the value of data science.
Another recent post by DJ Patil, chief scientist at LinkedIn, discusses how he built the data science team there, and he has a lengthy discussion on what the roles of data scientists are and what to look for in building your teams.
In fact, in the O’Reilly data science report, Patil was referenced as believing that the best data scientists:
“tend to be “hard scientists”, particularly physicists, rather than computer science majors. Physicists have a strong mathematical background, computing skills, and come from a discipline in which survival depends on getting the most from the data. They have to think about the big picture, the big problem.”
That has been my experience as well – the best data scientists that we find are ones that have a strong math background, whether they are majors in physics or in computer science. Strict coders find data science a bit challenging if they don’t come armed with the strong math skills needed to pull information from the data…
Well, here’s my definition. “Data science” is the general analysis of the creation of data. This means the comprehensive understanding of where data comes from, what data represents, and how to turn data into actionable information (something upon which we can base decisions). This encompasses statistics, hypothesis testing, predictive modeling, and understanding the effects of performing computations on data, among other things. Science in general has been armed with many of these tools, but data science pools the necessary tools together to provide a scientific discipline to the analysis and productizing of data.
A complementary term that is being used currently is “analytics”, and I found an interesting and very appropriate definition on Wikipedia, which is “the process of obtaining an optimal or realistic decision based on existing data.” Analytics are then useful metrics derived from data upon which decisions are made.
Combining data science and analytics together gives a foundation upon which new computing paradigms and data products can be generated. Cool stuff!…






Facebook
LinkedIn
Twitter
I like the article and find it very relevent and interesting pertaining to my own career (current position working with biostatisticians who make a living doing data crunching ). I have been considering some sort of refresher training in biostat since I am a biochemist not a biostatstician and can benefit from polishing my math skills. any suggestions will be appreciated.
[...] Science &… | Everything… From A Science Angle Everything… From A Science Angle Skip to content HomeAboutPrivacy PolicyDig Deeper ← Data Science Defined [...]
Thanks for the comment! Off the top of my head, there are two books that are pretty good when it comes to understanding fundamentals of data science, but since the field itself is new, they don’t scream out “data science” just yet. The first one teaches the background how you make decisions from data – it is “Detection, Estimation, and Modulation Theory” by Van Trees. The other goes through the fundamentals of probability – it is “Probability Theory: The Logic of Science” by Jaynes.
Both books are really textbooks, so they are a little pricey, and quite technical in nature. However, they are very good books on the subject. As I come across other resources that can provide a quicker jumpstart (and less costly), I’ll be sure to post them. Thanks!
[...] would say that this is true to a point, and it got me thinking about what data science and the development of analytics are really about in my experience. Ultimately, I see data [...]
[...] Data science is becoming a real discipline, one where an engineering approach will become commonplace. This will eventually take over from the ad-hoc approaches that currently tackle the easy problems, and leave the hard and nearly impossible problems unaffected (I wrote about the three different types of data science problems here…). [...]
[...] but the important part is how to turn data into information that’s reliable. This is what data science is really all [...]
[...] WP Greet Box WordPress PluginDan Woods of Forbes is continuing to interview people in the field of data science to gain their understanding of this new [...]
[...] to the RSS feed for updates on this topic.Powered by WP Greet Box WordPress PluginIt seems that data science is permeating every industry these days, and new industry reports by Forrester Research describe [...]
[...] of Engineering and Applied Sciences (SEAS) is making a big push to become a leader in the new data science field, planning for a huge expansion which include 1,000,000 square feet of additional space that would [...]
[...] This weekend, he posted an article about the use of data science in the winemaking business (yes, data science is getting to be [...]
[...] is lengthy, but if you’re interesting in social and political dynamics (like me) and how data science plays a role in understanding these behaviors (like me!), then you might find Jay’s post [...]
[...] continues the pattern of heavy entry level funding into analytics and big data startups – data science applications are becoming the next big technology [...]
[...] subscribe to the RSS feed for updates on this topic.Powered by WP Greet Box WordPress PluginAs the data science and big data technology booms start accelerating, it’s worth noting how these technologies [...]
[...] is a really good insight from Dumbill and how data science applies to business: Why is the scientific method applicable to business and [...]
[...] to the RSS feed for updates on this topic.Powered by WP Greet Box WordPress PluginYou know that data science is truly becoming a recognized scientific discipline when billions of university dollars will be [...]
[...] or Data Science” panel at Techonomy 2011 in Tucson, AZ this week, the panel explored how data science has taken its place next to computer science as a fundamental element of information technology. [...]
[...] definition of data science is still being defined, but I’m convinced it will have huge impact in the next five years. [...]
[...] is about finding the truth – no matter what it is – and as more businesses start using data science in order to drive business outcomes, we need to make sure that science is about being honest [...]