It’s an absolute myth that you can send an algorithm over raw data and have insights pop up. … the predicament of data wrangling [is] big data’s “iceberg” issue, meaning attention is focused on the result that is seen rather than all the unseen toil beneath.

“For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights” via NYTimes

A great article that covers the inherent issues of dealing with unstructured data. “Data wrangling” is as important as the actual magic delivered by “data science.”

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s