Over a million developers have joined DZone.

Big Data, Python, and R: The Language of Web Science

DZone's Guide to

Big Data, Python, and R: The Language of Web Science

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

This recent article from Karissa McKelvey discusses the emerging field of web science, and the increasing popularity of Python as the ideal language for data analysis over previous standards, such as STATA and R. There's been quite a bit of attention on Python's Big Data applications lately - Mikio Braun's recent history of Python in data science, for example - and McKelvey's article takes a similar stance, though her focus on web science directs the discussion toward academia and the slow pace of change.

Data sets in web science, McKelvey says, are massive enough that STATA and R won't cut it for them, yet the problem faced by many social scientists and others is that those are still the languages being taught and the only languages they have learned. It's an interesting dilemma: technological needs outpacing academia.

Check out McKelvey's full article for all the details and her predictions for the future.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.


Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}