Hive User-Defined Functions
There are a lot of useful functions available to add to Hive. You can also write your own in Java, Scala, and Python. The main 3rd party open source collections are referenced in this article.
Join the DZone community and get the full member experience.Join For Free
When you start using Hive you may miss some of the functions you are used to from Oracle, MySQL or elsewhere. Or you might just want a profanity filter. Whatever the case you can browse our list below for a large selection of UDF libraries. You can also use the pointers listed to write your own.
There's also a number of smaller UDF collections for various purposes to add to Hive:
UDF Collection for Various String distances, Text classification and other Text Mining.
UDF for anonymizing data with Apache Pig.
Hive UDF for various functions like array count
Curve Computing UDF
Ngram Functions UDF
Hive UDFs Similar to Oracle Funcitons
A collection of UDFs for GeocodeIP, Haversine Distance, DecodeURL UDFs
Hive Funnel Analysis UDF by Yahoo (tracking user conversion rates across actions)
Hive UDF Collection by LivingSocial for Min and Max Date, MySQL Style Like, and more.
Hive UDF to Count Business Days.
User Agent String Parser Hive UDF
Date Range Generator Hive UDF
Roll Your Own
If you want to write something a bit different from Hive UDF for your functions.
There's also a database independent hybrid procedural SQL language supported on Hive in Hive 2.0. This works with Hadoop, NoSQL database and SQL databases like MySQL and is mostly compatible with Oracle PL/SQL. This looks pretty interesting.
Opinions expressed by DZone contributors are their own.