Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Using IBM Watson Discovery to Query Unstructured Data

DZone's Guide to

Using IBM Watson Discovery to Query Unstructured Data

Using IBM's Watson Discovery Service can help you make sense of and identify patterns in large amounts of unstructured data.

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

In one of my previous blog entries, I described how to use Watson Knowledge Studio to create models to identify information in unstructured data. These models can be used by the Watson services and offerings Watson Discovery, Watson Explorer, and Watson Natural Language Understanding. Below is a quick intro how to use Watson Discovery to query unstructured data.

Watson Discovery is a service to extract value from unstructured data by converting, normalizing, and enriching it. In order to use it, you first need to upload your own content. In the next step, you need to deploy your model created by Knowledge Studio into the service.

This deployment process has two steps. First, the model is deployed from Knowledge Studio into a specific Discovery service instance. Next, Discovery service is configured to actually use it.

After this, you can query your data. Discovery service provides an API to invoke different types of queries. You can run full-text search-like queries that return documents ranked by relevance. You can also use filters to run SQL-like queries. Additionally, combined queries can be defined that do both. There is also the option to aggregate data and only return specific fields rather than the full documents.

This screenshot shows the Discovery query builder user interface with a simple sample that returns car incident reports that include Honda cars.

Image title

How cool is that?

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:
big data ,data analytics ,queries ,ibm watson

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}