hive-自定义udf函数

1.准备编写udf环境

引入依赖

<groupId>jdk.tools</groupId>

<artifactId>jdk.tools</artifactId>

<scope>system</scope>

<systemPath>${JAVA_HOME}/lib/tools.jar</systemPath>

</dependency>

<groupId>org.apache.hadoop</groupId>

<artifactId>hadoop-common</artifactId>

</dependency>

<groupId>org.apache.hive</groupId>

</dependency>

</dependencies>

2.编写udf,继承UDF,实现evaluate方法

package bigdata.hive;

import org.apache.hadoop.hive.ql.exec.UDF;

public class UdfDemo extends UDF{

public String evaluate(String ip){

String[] datas = ip.split("\\.");//按.切分转义

StringBuffer sb = new StringBuffer();

//前边补三个0

for(String s:datas){

s="000"+s;

s=s.substring(s.length()-3);

sb.append(s).append(".");

}

return sb.substring(0,sb.length()-1);

}

//本地测试

public static void main(String[] args) {

UdfDemo udfDemo = new UdfDemo();

String evaluate =udfDemo.evaluate("2.3.5.3");

System.out.println(evaluate);

}

3.打包udf

执行 maven install命令进行打包，执行成功出现，如下图

hive-自定义udf函数

4.上传udf jar 包到 hdfs

hdfs dfs -put bigdata-0.0.1-SNAPSHOT.jar /user/hive/hiveFunction/

5.启动 hive 创建注册udf函数

hive-自定义udf函数

6.使用udf函数

測試 udf

select UdfDemo("2.3.5.3");

hive-自定义udf函数