DZone
Big Data Zone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
  • Refcardz
  • Trend Reports
  • Webinars
  • Zones
  • |
    • Agile
    • AI
    • Big Data
    • Cloud
    • Database
    • DevOps
    • Integration
    • IoT
    • Java
    • Microservices
    • Open Source
    • Performance
    • Security
    • Web Dev
DZone > Big Data Zone > Using Python for Big Data Workloads (Part 1)

Using Python for Big Data Workloads (Part 1)

Get several resources on using Python for Big Data workloads, and learn about various programming SDKs, APIs, and libraries.

Tim Spann user avatar by
Tim Spann
CORE ·
May. 13, 17 · Big Data Zone · Tutorial
Like (13)
Save
Tweet
20.80K Views

Join the DZone community and get the full member experience.

Join For Free

In Part 2, we will look at Python for Spark (PySpark), Machine Learning, and deep learning in depth. In this first part, we'll go over the basics, some examples, and some tutorials to get you started.

Get the latest Python for your environment — Linux, OSX, and even Windows are supported.   There's a debate whether to finally move to Python 3.x; try it and see if it works for all your tools. Since my Hadoop installation has Python 2.7, I am going to use that for my work.

Python is great. I can run it for Machine Learning, websites, from NiFi, deep learning, and stitching together a lot of jobs. Using Apache Zeppelin, I can run Python and PySpark without installing it and tons of modules on my developer workstation.

Python Resources

  • Python Against Phoenix

  • Python Against Hive (GitHub)

  • Here is a simple example:

    from pyhive
    import hive
    conn = hive.connect('myhiveserverisawesome.tim.com').cursor()
    conn.execute('SELECT * FROM amwatertweetshive LIMIT 10')
    print conn.fetchone()
  • Using HDFS

  • Python With MQTT for IOT (Option 2)

  • HBase 101

  • Spark and Python

  • Python Development With Zeppelin 

  • Introduction to Machine Learning With Apache Spark and Zeppelin

  • Getting Started With Apache Zeppelin

  • Sentiment Analysis With Python

  • Sentiment Analysis and CoreNLP With REST API

  • Machine Learning With Apache Hive and Apache Hivemall

  • Apache HiveMall With Spark, Hive, and Pig

  • Sentiment Analysis and NLP With Python

  • Python Deep Learning

  • Example Zeppelin Notebooks

  • Enabling Apache Zeppelin and Spark for Data Science in the Enterprise

  • Introduction to Data Science

  • Apache Spark Crash Course

  • Hadoop Crashcourse

  • Apache Spark Using Python Class

Python (language) Big data Data science

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • Migrating Legacy Applications and Services to Low Code
  • Progressive Web Apps vs Native Apps: Differences and Similarities
  • Java’s Encapsulation - When the Getter and Setter Became Your Enemy
  • When Writing Code Isn't Enough: Citizen Development and the Developer Experience

Comments

Big Data Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • MVB Program
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends:

DZone.com is powered by 

AnswerHub logo