Oozie安装
1.进入Oozie的官网
http://oozie.apache.org/
2.点击downloads

3.点击画红圈部分

4.点击画红圈部分

5.点击画红圈部分

6.上传到liunx上 解压到/opt/module下
7. Cd bin进行编译 命令是:./mkdistro.sh -Phadoop-2 -DskipTests
8.ping repository.cloudera.com得到ip地址,写到/etc/hosts文件里。
9.在/opt/oozie-4.3.0/distro/target目录下找到tar包oozie-4.3.0-distro.tar.gz
解压后,得到/opt/oozie/oozie-4.3.0。
10.修改环境变量
vim /etc/profile,加入OOZIE_HOME这个环境变量,并在PATH中加入$OOZIE_HOME/bin,保存退出,然后执行source /etc/profile
修改oozie-4.3.0/conf的配置文件,添加如下
oozie.service.JPAService.jdbc.driver
com.mysql.jdbc.Driver
JDBC driver class.
oozie.service.JPAService.jdbc.url
jdbc:mysql://192.168.127.11:3306/oozie
JDBC URL.
oozie.service.JPAService.jdbc.username
root
DB user name.
oozie.service.JPAService.jdbc.password
123456
DB user password.
IMPORTANT: if password is emtpy leave a 1 space string, the service trims the value,
if empty Configuration assumes it is NULL.
oozie.service.HadoopAccessorService.hadoop.configurations
*=/opt/hadoop-2.6.0/etc/hadoop
Comma separated AUTHORITY=HADOOP_CONF_DIR, where AUTHORITY is the HOST:PORT of
the Hadoop service (JobTracker, YARN, HDFS). The wildcard '*' configuration is
used when there is no exact match for an authority. The HADOOP_CONF_DIR contains
the relevant Hadoop *-site.xml files. If the path is relative is looked within
the Oozie configuration directory; though the path can be absolute (i.e. to point
to Hadoop client conf/ directories in the local filesystem.
oozie.service.SparkConfigurationService.spark.configurations
*=/opt/spark-1.4.0-bin-hadoop2.6-hive/conf
Comma separated AUTHORITY=SPARK_CONF_DIR, where AUTHORITY is the HOST:PORT of
the ResourceManager of a YARN cluster. The wildcard '*' configuration is
used when there is no exact match for an authority. The SPARK_CONF_DIR contains
the relevant spark-defaults.conf properties file. If the path is relative is looked within
the Oozie configuration directory; though the path can be absolute. This is only used
when the Spark master is set to either "yarn-client" or "yarn-cluster".
oozie.service.WorkflowAppService.system.libpath
hdfs://ns1/user/${user.name}/share/lib
System library path to use for workflow applications.
This path is added to workflow application if their job properties sets
the property 'oozie.use.system.libpath' to true.
11.在$HADOOP_HOME/etc/hadoop/下找到core-site.xml,添加如下配置
hadoop.proxyuser.root.hosts
centos01
hadoop.proxyuser.root.groups
root
注:value可以配置成* *
12.在/opt/ooize/oozie-4.3.0目录下,新建文件夹libext,cd到$HADOOP_HOME/ 下,执行如下命令,把jar包全都copy到该目录下
find -name *.jar |xargs -t -i cp {} /opt/oozie-4.2.0/libext
13.注意:需要将libext中的hsqldb-2.0.0.jar删掉,否则会引起jar包冲突
将下载好的ext-2.2.zip和mysql驱动包也放到libext目录下
14.安装oozie-sharelib
在oozie-4.3.0目录下有一个 oozie-sharelib-4.3.0.tar.gz。使用tar -zxvf oozie-sharelib-4.3.0.tar.gz将它解压。解压出后是一个share目录
进入到share/lib/sqoop中。将sqoop-1.4.6-hadoop100.jar去掉(我这边改成sqoop-1.4.6-hadoop100.jar.bak)(在oozie-4.2版本有,4.3版本没有这个jar包,不需要这步)
将mysql-connector-java-5.1.38.jar复制到/opt/oozie-4.2.0/share/lib/sqoop目录中。如果没有这个jar,将不能从mysql将数据导入或导出
将这个share上传到hdfs上:
hdfs dfs -put /opt/oozie/oozie-4.3.0/share /user/root
14.启动oozie
bin/oozie-start.sh
15.可看到一个叫做Bootstrap的进程。
oozie admin -oozie http://localhost:11000/oozie -status
查看oozie web页面
http://192.168.137.100:11000/oozie/