Over a million developers have joined DZone.

Natural Language Processing With Apache Spark

DZone's Guide to

Natural Language Processing With Apache Spark

This article focuses on natural language processing (text munging and machine learning) on the Apache Spark platform.

· Big Data Zone
Free Resource

Need to build an application around your data? Learn more about dataflow programming for rapid development and greater creativity. 

There are a lot of exciting things going on in Natural Language Processing (NLP) in the Apache Spark world.   There's a ton of libraries and new work going on in OpenNLP and StanfordNLP.   

There's a lot of interesting applications that can be done using NLP and large sources of text, a few notes, presentations and code examples that follow will be helpful.

The last few Apache Spark Summit's have produced some great talks on NLP.

From the Advanced Apache Spark Meetup in San Francisco, there were a ton of great documents and source code:

Check out the Exaptive data application Studio. Technology agnostic. No glue code. Use what you know and rely on the community for what you don't. Try the community version.

apache spark ,big data ,hadoop ,machine learning ,nlp

Opinions expressed by DZone contributors are their own.


Dev Resources & Solutions Straight to Your Inbox

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.


{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}