Infrastructure at your Service

In a previous blog, we described how to install Hitachi Content Intelligence the solution of Hitachi Ventara for data indexing and search. In this blog post, we will see how we can use Hitachi Content Intelligence to perform the basic search on personal information (PII).

Data Connections

HCI allows you to connect to multiple data source using default data connectors. The first step is to create a data connection. By default, multiples data connectors are available:

HCI_data_connectors

For our example, we will simply use the Local File System as the data repository. Note that, the directory must be within the HCI install directory

Below the data connection configuration for our PII demo.

HCI_Data_Connection

Click on Test after adding the information and click on Create.

A new data connection will appear in your dashboard.

Processing Pipelines

After creating the data connection, will build a processing pipeline for our PII example
Click on Processing Pipelines > Create a Pipeline. Enter a name for your pipeline (optionally a description) and click on Create.
Click on Add Stages, and create your desired pipeline. For PII search we will use the following pipeline.
HCI_PII_Pipeline
After building your pipeline, you can test it by clicking on the Test Pipeline button at the top right of your page.

Index Collections

We should now, create an index collection to specify how you want to index your data set.
First, click on Create Index inside the Index Collections button. Create an HCI Index and use the schemaless option.
HCI_Index

Content Classes

Then you should create your content classes to extract your desired information from your data set. For our PII example, we will create 3 content classes for American Express and Visa credit card and for Security Social Number.
HCI_content_classes
For America Express credit card, your should add the following pattern.
HCI_AMEX
Pattern for Visa credit card.
HCI_VISA
Pattern for Social Security Number.
HCI_SSN

Start your workflow

When all steps are completed you can start your workflow and wait until it finish.
HCI_Workflow_Start

HCI Search

Use the HCI Search application to visualize the results.
https://<hci_instance_ip>:8888
Select your index name in the search field, and naviguate through the results.
You can also display the results in charts and graphics
HCI_Search_Graph
This demo is also available in the Hitachi Ventara Community website: https://community.hitachivantara.com/thread/13804-pii-workflows

Leave a Reply

Mehdi Bada
Mehdi Bada

Consultant