Over a million developers have joined DZone.

Big Data, Python, and R: The Language of Web Science

· Big Data Zone

Hortonworks DataFlow is an integrated platform that makes data ingestion fast, easy, and secure. Download the white paper now.  Brought to you in partnership with Hortonworks

This recent article from Karissa McKelvey discusses the emerging field of web science, and the increasing popularity of Python as the ideal language for data analysis over previous standards, such as STATA and R. There's been quite a bit of attention on Python's Big Data applications lately - Mikio Braun's recent history of Python in data science, for example - and McKelvey's article takes a similar stance, though her focus on web science directs the discussion toward academia and the slow pace of change.

Data sets in web science, McKelvey says, are massive enough that STATA and R won't cut it for them, yet the problem faced by many social scientists and others is that those are still the languages being taught and the only languages they have learned. It's an interesting dilemma: technological needs outpacing academia.

Check out McKelvey's full article for all the details and her predictions for the future.

Hortonworks Sandbox is a personal, portable Apache Hadoop® environment that comes with dozens of interactive Hadoop and it's ecosystem tutorials and the most exciting developments from the latest HDP distribution, brought to you in partnership with Hortonworks.


The best of DZone straight to your inbox.

Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}