Analysis in Ocient

Text Analysis in Database Queries

the {{ocienthyperscaledatawarehouse}} enables search and analysis of textual data you can run queries efficiently using indexes in the ocient hyperscale data warehouse to perform text analysis the database supports different types of indexes the n gram index is a type of secondary index that enables text analysis cluster key indexes define cluster key (ck) indexes when you create the table index type number of columns column types filters primary ck index (always exists) multiple fixed length or gdc columns, fixed length tuple columns equality like, range additional ck indexes subset of ck in any order secondary indexes you can create or drop secondary indexes at any time index type number of columns column types filters inverted indexes single fixed length or gdc column, array, or tuple component equality like, range hash indexes variable length column, array, or tuple component equality like n gram indexes varchar column, array, or tuple component like, equality like n gram indexes support text analysis by providing an efficient search of textual data this type of index works by tokenizing the pattern in the string, and then pruning and transforming the tokens you can create n gram indexes at any time for details about creating an n gram index, see database, tables, views, and indexes docid\ a4jvhkvg31tapexr9zpcq there are other ways to perform text analysis for details, see the syntax for general sql syntax docid\ ijazgahmd u0x4gsmz8e1 related links general sql syntax docid\ ijazgahmd u0x4gsmz8e1 secondary indexes docid\ efbuyqjp 9ole py4jpst