Who should attend this Apache ORC Training Course?
The Apache Optimised Row Columnar (ORC) Training is a specialised course aimed to provide Engineers, Architects, and Developers with an in-depth understanding of high-performance columnar storage format used in the Hadoop ecosystem. The following are some professionals who can benefit from this course:
- Data Engineers
- Big Data Developers
- Database Administrators
- Data Scientists
- Hadoop Administrators
- Cloud Engineers
- ETL Developers
Prerequisites of the Apache ORC Training Course
There are no formal prerequisites for this Apache ORC Training Course. However, a basic understanding of Hadoop would be useful.
Apache ORC Training Course Overview
Apache is a non-profit organisation that helps those open-source software projects that are released under the license of Apache. Apache ORC is a self-describing columnar file format enabling efficient querying and storage of data on Hadoop. It uses multi-version concurrency control for supporting ACID transactions. This Apache ORC Training is designed to equip delegates with a detailed knowledge of Apache ORC.
The Knowledge Academy’s Apache OCR Training will introduce delegates to ORC adapters and types. Delegates will gain knowledge of Apache ORC’s three levels of indexes. In addition, delegates will learn how to build Apache ORC. Delegates will get familiarised with hive DDL and configuration, including table and configuration properties.
During this 1-day course, delegates will learn how to read and write ORC files. Delegates will get an understanding of how to send OrcStruct, OrcList, OrcMap through the shuffle. This Apache ORC Training will fully prepare delegates on how to use Apache ORC tools – C++ and Java tools. Post completion of this training, delegates will be able to use Java meta, data, scan, convert, and JSON Schema.