Release Notes
Ocient Hyperscale Data Warehouse Release Notes
25 1 system catalog table changes — the record number column of the sys pipeline errors system catalog table is now the index, which starts at 1, of the processed record relative to the source file (or {{kafka}} partition) where the record was extracted the record offset column of the sys pipeline errors is now the offset of 1 for the processed record relative to the start of the source file for files with contiguous records, 1 for files with non contiguous records (i e , {{parquet}} ), and the kafka offset for kafka loads 25 0 release highlights load data into the {{ocient}} system — the new way to load data into the ocient system is to use data pipeline functionality this functionality supports loading data from multiple formats such as csv, json, and {{parquet}} data pipelines support data transformation during the load you can load multiple data formats, including geospatial data this functionality supports these new syntaxes create pipeline sql statement to create a data pipeline drop pipeline statement to remove a data pipeline preview pipeline statement to view the results of a data load before creating the data pipeline start pipeline statement to start the execution of the data load stop pipeline statement to stop the execution of the data load alter pipeline rename statement to rename a data pipeline export pipeline statement to return the create pipeline sql statement used to create the specified data pipeline create or replace pipeline function statement to define your own data pipeline function for loading data drop pipeline function statement to remove the data pipeline function for details, see load data docid\ qxg07ea5hv1vxat6neyg and data pipelines docid\ tqsshgl ifitlhjt78mbd features {{java}} runtime environment — the ocient system in stallation requires {{java}} 21, where the openjdk 21 jre headless package is recommended persistent data — data persists in the system storage space of the ocient system in system catalog tables such as the sys completed queries table for details, see core elements of an ocient system docid\ mw cl16ivsflmrdqgwtk6 default compression scheme — the default compression scheme for variable length columns is none set more granular privileges for database and system objects for details, see object type level privileges management docid\ xc0kiho2 m6cvsc4t4zgq and data control language (dcl) statement reference docid 7jq6fizrghsmuumsa4qiq the show system tables sql statement displays the system catalog tables system catalog table changes — information schema tables has a column table type with two new types system table for system catalog tables and system view for information schema tables base table and view still exist for user defined tables and views sys system tables has new columns schema , type , and description sys segment groups has the new depth and visibility columns sys stored segments has the new abnormal placement column sys completed queries contains persistent data the system experiences a small loading delay between completing the execution of a sql query and when the corresponding results appear in the table the definition for the sys queries table has been updated to remove these columns time start time optimization start time execution start time first byte sent the definition for the sys queries table has been updated to change the data type of these columns from long to timestamp timestamp start timestamp optimization start timestamp execution start timestamp first byte sent the definition for the sys locks table has been updated for these columns renamed createtime to create time renamed lastrefreshtime to last refresh time renamed priortyid to priority id updated column descriptions for these tables sys segment directories sys addendum directories sys segment groups sys segment parts sys segments sys stored segments sys service role status updated the data type for the segment group ids column in the sys segment group transfers table from array(long) to array(char) because segment group identifiers are unsigned renamed the service roles id column in the sys service role channel endpoint table to service role id to align with the naming in the sys service role status table version compatibility sys built in views has been removed the built in views have been removed from sys built in views , sys views , and information schema views the view information now appears in the sys system tables system catalog table information schema views includes user defined views and only information schema views sys views no longer has a view type column 24 0 release highlights better sql errors improved error messages for poorly constructed sql queries enhance build metadata updated {{ocient}} package file names and build information for clarity improved tracing enabled using the trace keyword in sql queries to profile query performance for details, see the general sql syntax docid\ cb6h0pxd4vqlizfr6jr0l keyword large blobstore support enabled more than 2tib of data to spill per disk for high density drive situations large drive support added support for drives up to 15 36tb in size per node rebalance system added the rebalance task, which enables optimization of query efficiency by transferring data around the system until nodes are roughly balanced in terms of data volume per node for details, see expand and rebalance system docid 2tnktuwb1mudckqw7sou4 system storage space added support for multiple storage spaces and enabled the creation of a system storage space for internal ocient system data workload management usability added support for assigning service classes to queries based on query text for details, see the users, groups, and service classes docid\ iwkirup8638xpra30vnl3 sql statement added support for changing the priority for queries for details, see the users, groups, and service classes docid\ iwkirup8638xpra30vnl3 sql statement features \[db 19266] network configuration — all nodes must now belong to a connectivity pool manage connectivity pools using these new sql statements for details, see cluster and node management docid\ bnlgs0qq1wre7ndja8q0x create connectivity pool to create a connectivity pool drop connectivity pool to drop a connectivity pool alter connectivity pool set to set the metadata of a connectivity pool alter connectivity pool rename to to rename a connectivity pool alter connectivity pool add participants to add nodes to a connectivity pool alter connectivity pool drop participants to remove nodes from a connectivity pool the alter node set address sql statement changes the internal ip address for a node for details, see cluster and node management docid\ bnlgs0qq1wre7ndja8q0x you can now configure the network of an ocient system for details, see manage the network configuration of an ocient system docid\ x6ap klapnhg3uvaqxwdm redirects now occur only within connectivity pools when you upgrade an ocient system, you must first configure a connectivity pool \[db 28986] system catalog table updates — renamed client version to protocol version in the sys queries and sys completed queries system catalog tables added driver version column to the sys queries and sys completed queries tables \[db 29788] regular expression functions — added new functions that use regular expression search patterns the new functions are character and binary functions docid\ t4v9mfehdtbx8w 6ipulp character and binary functions docid\ t4v9mfehdtbx8w 6ipulp character and binary functions docid\ t4v9mfehdtbx8w 6ipulp character and binary functions docid\ t4v9mfehdtbx8w 6ipulp version compatibility the bootstrap conf no longer supports highspeedaddress as an advanced system configuration option for other options, see node bootstrapping reference docid\ xi qqejsaauzkunlnq9yy the default behavior for the unnest function no longer uses the null input clause for a multi item select list if you would like to utilize the default behavior from version 23 0 and prior, you may do so by changing alter system alter config set sql unnestlegacyselectlistbehavior = 'true' feature removal removed compression lz4 from the compression options this compression scheme remains in use as part of compression dynamic for variable length columns removed the table valued function replacement join for creating compressed lookup tables, see global dictionary compression docid\ fen6iivd8dyogczsgl4zn particle swarm optimization functionality has been removed from the ocient system odbc connection has been removed from the ocient system 23 0 release highlights all machine learning functionality is available to use for details, see machine learning model functions docid\ jsgwuw5og56fzrve5h10g and machine learning in ocient docid\ arh0cufpzx bd gw0248n to get started delete syntax enabled the deletion of individual rows in the database integrations added drivers and support f or the following third party applications dbeaver {{tableau}} features \[db 13607] delete syntax — added the sql delete statement syntax that enables the deletion of individual rows in the database for details, see database, tables, views, and indexes docid\ ejutg6wjnk5eg55kizq8d \[db 18020] large geospatial types — increased the size of linestring and polygon geospatial data types to 512 mb for details, see lat data types in loading docid\ de5hb ki2fylva6wtelgt \[db 19048] geospatial index — added the spatial index type for indexing geospatial data for details, see secondary indexes docid\ pvmvtea poabqrd9q vkc \[db 18280] connectors refresh — added integration with dbeaver and {{tableau}} for details, see dbeaver integration docid hshvqd89sresrjjdv0ln and tableau integration docid\ fpsnmste3fc1yul4htpkk \[db 20412] multi cluster loading and cluster of clusters — added support for loading and working with multiple clusters for details, see multiple storage clusters for loading data docid\ hg8wacdzy s2zap9rio \[db 21609] machine learning model updates — added the clustering and dimension reduction models docid\ affc3g4 myb9ilf7xsv2i added the other models docid 3yrttxa4mtacybwk7auxf added the clustering and dimension reduction models docid\ affc3g4 myb9ilf7xsv2i version compatibility large geospatial types are not backwards compatible with earlier releases for details, see version compatibility docid\ zp3jyidhcqu5kfidb4kpy the database data control language (dcl) denotes user role privileges to remove data using the delete keyword instead of truncate 22 0 release highlights hyperloglog (hll) added hll sketch functionality information schema added the information schema schema that shows system metadata integrations added drivers and support for the following third party applications {{metabase}} sqlalchemy {{apache}} superset ® features \[db 13603] information schema — added the information schema schema that shows system metadata in an accessible format \[db 20484] superset integration — merged sqlalchemy ocient driver into superset repository, allowing superset to support {{ocient}} database connections \[db 20484] sqlalchemy integration — published sqlalchemy ocient driver to pypi \[db 21011] except clause — added except clause so that select queries can explicitly omit columns from results \[db 21769] metabase integration — added ocient as a metabase partner driver, allowing metabase to access ocient databases out of the box \[db 23030] jdbc packaging — removed openjump dependency from ocient jdbc4 \[db 23175] time zone adjustment support — added various improvements to time zone functionality, including support for daylight savings adjustment based on time zone added time zone functions convert utc timestamp to local and convert local timestamp to utc for more information, see time zone functions docid\ f8elldnuxgrzirili8c4p enhanced performance for time zone conversion \[db 23177] push down aggregation to the i/o layer — under certain conditions, the system pushes aggregation to the i/o operator for better efficiency and performance \[db 23299] hll sketch functionality — added support for variable log2k hll sketch algorithm and associated functions for details, see the hyperloglog functions docid\ lsrp6umq ruz2rnmfx6 7 \[db 23745] implement evacuate node — evacuate node is a tool to move all segments off of a node in a system that is overprovisioned to the other nodes in the cluster this tool is useful when you replace drives or a node \[db 19888] machine learning model updates — the ocient system scopes machine learning models to schemas the system assigns the pre v22 mlmodel schema to any model you created prior to version 22 0 the sys multiple linear regression slopes system catalog table has been removed rename machine learning models using alter mlmodel new ddl commands create or replace refresh export for details about the new syntaxes, see machine learning model functions docid\ jsgwuw5og56fzrve5h10g lat features \[lat 1469] manual configuration of lat endpoints — enabled manual configuration of lat endpoints for oauth with {{okta}} \[lat 1475] enablement of stopping load processing during error condition — added default behavior to stop processing during file loading in the event of an unrecoverable error when the system extracts records from a file for details, see lat pipeline configuration docid\ aczjrpa6a8wlrrdqa swc \[lat 1476] enablement of lat service in installation — enabled lat service in systemd by default upon installation completion this update reflects a change in the default behavior during installation \[lat 1477] lat version for metrics — exposed lat version in the metrics \[lat 1557] support for loading multiple s3 buckets — added lat functionality to load data from multiple s3 buckets simultaneously within the same pipeline version compatibility information schema — views created prior to version 22 0 do not have column data appearing in the information schema you can drop and recreate these views to populate column data lat — version 3 0 0 and greater is only compatible with version 22 0 and greater of the ocient system for details, see version compatibility docid\ zp3jyidhcqu5kfidb4kpy 21 0 release highlights the ocient system now supports the following operating systems ubuntu® 20 04 debian 11 rhel 8 other highlights include whole column compression added zstandard (zstd) compression for fixed and variable length columns check system configuration added precheck and postcheck commands to check system configuration before and after installation workload management dynamic priority enabled the adjustment of the query priority dynamically at the session, service class, and query levels ability to quiesce node added process for graceful node shutdown features \[db 18636] zstd compression added a new whole column compression scheme (zstd) that can be enabled for fixed and variable length columns \[db 18990] improved stats storage and usage various improvements have been added to speed up the fetching of statistics by the optimizer and ensure it gets up to date statistics these changes primarily center around probability density functions being stored as pre aggregated stats files instead of on a per segment basis \[db 20190] distributed tasks added check disk task type and new vtables sys subtasks , sys tasks , and sys rebuild tasks for monitoring tasks remove check data command \[db 19117] metadata added participating nodes to the sys queries and sys completed queries virtual tables \[db 18633] graceful node shutdown added quiesce process for graceful node shutdown \[db 18061] lck deprecation added new disk data format that is smaller and also improves performance of some index based queries \[db 19414] range query improvement improved performance of range queries by utilizing the inverted secondary index \[db 20168] geospatial function expansion added these geospatial scalar functions measurement functions st angle st distancesphere st distancespheroid st length2d st hausdorffdistance analytic and property functions st dimension st geohash st srid st ispolygoncw st ispolygonccw to string and binary functions st aswkt st aswkb st asewkt geography simplification function st simplify constructor functions st pointfromgeohash st geogpoint st makepolygonoriented st point fromewkt st linestring fromewkt st polygon fromewkt st makeenvelope additionally, you can construct st polygon types directly from a point\[] without going through an intermediate st linestring keywords added these new keywords as reserved words in the ocient system analysis autoregression bayes cancel component decision disable disable stats file updates enable feedforward insert kmeans knn logistic machine move naive network nonlinear principal replace source support tree vector zstd 20 0 release highlights create table as select sql statement extract, load, and transform (elt) workflow functionality to extract data and load it into a new database table by using the query results from a select sql statement the tables you create using the create table as select sql statement have some indexing limitations in version 20 0 for details, see the "about create table as select (ctas)" section of the ocient user documentation insert into sql statement elt workflow functionality to extract data and insert it into an existing database table using the insert into sql statement n gram indexes full index on varchar, varchar arrays, and varchar tuple components for efficient queries using the like sql statement large varchar \[db 16142] support varchar columns up to 1gb in size ocient simulator an instance of the ocient system for data loading and functional testing single sign on (sso) authenticate access to ocient through an external sso server and assign sso users to groups in ocient feature removal the alter role ddl command has been removed you can make all changes using the alter config sql statement to alter a role, prefix the key with the role name followed by a dot the following system tables have been added average bb sizes linear combination regression models node config node status sso connections storage device status the following system tables have been removed hugepage configurations memory module models node memory modules oidc integrations oidc sessions polynomial regression models security integrations sessions 19 0 features \[db 14527] adaptive water mark feature indexer node dynamically increase and reduce batch size without manual tuning \[db 14656] added a rest endpoint to expose a node’s configuration parameters ( 9090/v1/configparams) \[db 15123] expose cluster total storage space and storage usage through virtual tables \[db 15515] add support for expr dtype cast notation \[db 16289] remove the web ui and yaml service role configuration \[db 16904] allow any predicate type to be used in conjunction with the values in arrays \[db 17889] improve ability to continue data loading when a foundation node is down \[db 18393] leverage hyperthreading in query execution in v19 the service role configuration previously set through the web ui has been replaced by the alter … alter role/config … ddl command the web ui is still available in v19, but will be removed in a subsequent release the alter … alter role/config … command should be used to change system configuration, rather than the web ui please reference the upgrade ocient software section of the user documentation for details \[db 12747] add support for lateral joins \[db 13924] add support for multi column subqueries \[db 14990] add support for native right joins \[db 15996] add support for array to string function \[db 16231] improve gis function performance and introduce expanded support for gis functions please refer to the user documentation for details \[db 17037] add new scalar functions and operators added for gis types (point, linestring, and polygon) please refer to the user documentation for details \[db 17892] add support for right lateral joins \[db 16061] secondary indexes can now be created on varchar and varchar\[] columns please refer to the user documentation for details 18 0 features \[db 17635] remove query log properties timestamp optimizationcomplete and time optimizationcomplete and add new properties timestamp optimizationstart and time optimizationstart \[db 16567] make error messages more clear for queries with groupby missing \[db 17316] change array length(empty array) to return 0 \[db 16417] allow for integral types for integer field is gis functions \[db 16200] make explains more convenient for the user \[db 16092] distributed result set caching \[db 15623] add support for rebuilding individual nodes via ddl \[db 15375] alter cluster add participants ddl \[db 14720] provide a way to kill long running optimizations \[db 14017] support for cli command history across sessions 17 0 features \[db 12888] add support for array values larger than 128 kb the new maximum value of an array is 512 mb 16 0 features \[db 10329] add support for full disk encryption of opal drives disk encryption will be automatically enabled when opal support is detected 14 0 features \[db 14159] default hex values for binary or varbinary columns must contain a leading 0x 13 0 features \[db 14330] remove last dependencies on postgresql from the database 12 0 features \[db 13334] add support for zip unnest, which unnests multiple arrays in parallel 11 0 features \[db 12887] add support for the array of tuples users can create array columns containing tuple sql types please refer to the user documentation for the latest information on supported data types \[db 12885] add support for unnest(), which expands array elements from input array columns out to individual output rows 10 0 features \[db 12394] added support for running on centos 8 9 0 features \[db 13162] added support for tasks to the system catalog \[db 10332] implemented access controls on system and database level objects improved users, groups, and added new roles within ocient \[db 12829] optionally enforce encrypted connections for jdbc and odbc 8 0 features \[db 10330] external network security ssl/tls support in odbc and jdbc, ssl support for the web interface 7 0 features \[db 10921] adds support for multi dimensional arrays and the ability to do joins, windows, sorts and aggregations that involve arrays \[db 10927] adds support for global dictionary compression (gdc) on varchar array columns and the ability to do replacement joins \[db 10282] adds support for drop column ddl to remove columns from a table \[db 11472] adds support for skipping failed rows for csv loading up to some specified threshold 6 0 features \[db 9707] scriptable bulk load essentials allows users to create translations and launch bulk load tasks via ddl \[db 10479] adds support for tableau through ocient’s jdbc custom connector users can find ocient’s connector and the installation instructions on tableau’s extension gallery please refer to tableau for more inforamation 5 0 features \[db 9477] adds support for the array data type users can create single dimensional array columns from any other supported data type please refer to the user documentation for the latest information on supported data types \[db 9656] add column support the engine now supports the add column ddl statement with the ability to add columns to an existing table existing data that was loaded without the new column uses the configured default values when queried please refer to user documentation for information on the ddl syntax and default values 4 0 features \[db 6386] availability of the storage engine, allowing queries to run with a node or drive failure \[db 6588] bulk loading of csv files from hdfs or an s3 endpoint \[db 7221] delta compression in the tkt engine for timestamp columns \[db 7623] virtual tables to retrieve information from the storage cluster state \[db 7247] os upgrade functionality \[db 6125] aws initial support \[db 6383] data definition language (ddl) operations \[db 6940] all system configuration in the system catalog \[db 6362] stats virtual tables \[db 7098] external window operator support \[db 7497] list running tasks page \[db 7097] segment group deletion \[db 7139] cancel query and cancel task support numerous stability and performance improvements