Comma separated value (csv) files are supported in several different capacities. Which of the following is NOT correct?
A. You can import a csv file to create or add words to a custom user dictionary.
B. You can export documents as csv files.
C. You can export query statistics to a csv file.
D. You can import csv files and add them to a search collection.
In regard to Crawler plug-ins, which of the following statements is NOT true?
A. Crawler plug-ins come in two flavors: Web and Non-web.
B. Non-web crawler plug-ins are of two types: prefetch or postparse.
C. Crawler plug-ins must be written in Java.
D. Data source crawler plug-ins can change the content or metadata of crawled documents.
What is the simple mechanical process of breaking up white space delimited text into words called?
A. Tokenization.
B. Segmentation.
C. Shallow parsing.
D. Regular expressions.
All of the following statements about annotators are true, EXCEPT for:
A. A software component that performs specific linguistic analysis tasks and produces and records annotations.
B. Are always independent of (i.e., do not rely on) other annotators in the document processing pipeline.
C. Do the real work of extracting structured information from unstructured data.
D. Annotators typically read from and write to the common analysis structure.
E. May be collection specific, and are administered via the Configure document processing pipeline link on the Parse and Index tab.
Users are being discouraged from using ________________ because they may not be supported in future releases.
A. Real-time NLP API.
B. SIAPI Administration APIs.
C. Crawler Plug-in APIs.
D. Identity management component APIs.
Which ICA component generates Facets?
A. Crawlers.
B. Search Servers.
C. Indexer.
D. All of the above.
Crawler plug-ins for archive files support which of the following crawler(s)?
A. Agent for Windows file systems.
B. FileNet P8.
C. Sharepoint.
D. All of the above.
Integrating the IBM Classification Module in the ICA document processing pipeline allows one to:
A. Reduce indexing time.
B. Improve search quality.
C. Categorize documents.
D. B and C only.
Custom rule files are implemented by the ______________ annotator in the document processing pipeline.
A. Dictionary lookup.
B. Linguistic analysis.
C. Named entity recognition.
D. Pattern matcher.
Assuming you want to export documents to IBM Content Collector, which one of the following options to export crawled or analyzed documents is NOT useful?
A. Enable the CAS as XMI format option.
B. Specify whether you want to export information about documents that were deleted from the crawl space since the crawler last checked for new,changed, and deleted documents.
C. Specify whether you want to use the field name or facet path for mapping metadata when the documents are exported.
D. Specify URI patterns to identify the documents that you want to export.