Skip to content
Rishu Shrivastava edited this page Oct 4, 2020 · 9 revisions

Welcome to the GitHub repository: Hive Custom UDFs

Apache Hive is a big data database that facilitates reading, writing, and managing large datasets residing in the distributed storage and queried using SQL syntax. Built on top of Apache Hadoop, hive enables easy access to data via SQL, thus enabling data warehousing tasks such as extract/transform/load (ETL), reporting, and data analysis.

Apache Hive supports many in-built functions to manipulate and process the data. Though there are lot of available options, sometimes due to business use-cases, readily available functions may not be available. Hive allows you to extend and create User defined functions (UDFs) by extending the org.apache.hadoop.hive.ql.exec.UDF class.

This repository provides users with a single unified custom user defined binary for an array of plugins and functions. The idea is to solve the work-around-solutions in apache hive UDFs and create a single repository for all the custom udfs.

List of Available Plugins:

Clone this wiki locally