Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Using IBM Watson Discovery to Query Unstructured Data

DZone's Guide to

Using IBM Watson Discovery to Query Unstructured Data

Using IBM's Watson Discovery Service can help you make sense of and identify patterns in large amounts of unstructured data.

· Big Data Zone ·
Free Resource

The open source HPCC Systems platform is a proven, easy to use solution for managing data at scale. Visit our Easy Guide to learn more about this completely free platform, test drive some code in the online Playground, and get started today.

In one of my previous blog entries, I described how to use Watson Knowledge Studio to create models to identify information in unstructured data. These models can be used by the Watson services and offerings Watson Discovery, Watson Explorer, and Watson Natural Language Understanding. Below is a quick intro how to use Watson Discovery to query unstructured data.

Watson Discovery is a service to extract value from unstructured data by converting, normalizing, and enriching it. In order to use it, you first need to upload your own content. In the next step, you need to deploy your model created by Knowledge Studio into the service.

This deployment process has two steps. First, the model is deployed from Knowledge Studio into a specific Discovery service instance. Next, Discovery service is configured to actually use it.

After this, you can query your data. Discovery service provides an API to invoke different types of queries. You can run full-text search-like queries that return documents ranked by relevance. You can also use filters to run SQL-like queries. Additionally, combined queries can be defined that do both. There is also the option to aggregate data and only return specific fields rather than the full documents.

This screenshot shows the Discovery query builder user interface with a simple sample that returns car incident reports that include Honda cars.

Image title

How cool is that?

Managing data at scale doesn’t have to be hard. Find out how the completely free, open source HPCC Systems platform makes it easier to update, easier to program, easier to integrate data, and easier to manage clusters. Download and get started today.

Topics:
big data ,data analytics ,queries ,ibm watson

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}