博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
flume--收集日志到hive
阅读量:4160 次
发布时间:2019-05-26

本文共 2201 字,大约阅读时间需要 7 分钟。

方案一:收集到hdfs中方案二:插入已经有的表,使用flume收集数据到hive,hive中数据必须以orc格式保存source  网络日志channel 本地磁盘+memory,优先使用内存,如果内存使用完毕,就使用本地磁盘作为缓冲sink    hivea1.sources = s1a1.channels=c1a1.sinks=k1#tcp协议a1.sources.s1.type =  syslogtcpa1.sources.s1.port= 5140a1.sources.s1.host= wangfutaia1.sources.s1.channels = c1a1.channels = c1a1.channels.c1.type = SPILLABLEMEMORY a1.channels.c1.memoryCapacity = 10000 a1.channels.c1.overflowCapacity = 1000000 a1.channels.c1.byteCapacity = 800000 a1.channels.c1.checkpointDir =/home/wangfutai/a/flume/checkPointa1.channels.c1.dataDirs = /home/wangfutai/a/flume/dataa1.sinks = k1 a1.sinks.k1.type = hive a1.sinks.k1.channel = c1 a1.sinks.k1.hive.metastore = thrift://wangfutai:9083 a1.sinks.k1.hive.database = hivea1.sinks.k1.hive.table = flume#a1.sinks.k1.hive.partition = asia,%{country},%y-%m-%d-%H-%M #a1.sinks.k1.useLocalTimeStamp = false a1.sinks.k1.round = true a1.sinks.k1.roundValue = 10 a1.sinks.k1.roundUnit = minute a1.sinks.k1.serializer = DELIMITED a1.sinks.k1.serializer.delimiter = "," a1.sinks.k1.serializer.serdeSeparator = '\t' a1.sinks.k1.serializer.fieldnames =id,name,age19/01/16 22:24:59 ERROR node.PollingPropertiesFileConfigurationProvider: Failed to start agent because dependencies were not found in classpath. Error follows.java.lang.NoClassDefFoundError: org/apache/hive/hcatalog/streaming/RecordWriter	at org.apache.flume.sink.hive.HiveSink.createSerializer(HiveSink.java:219)	at org.apache.flume.sink.hive.HiveSink.configure(HiveSink.java:202)1.将/home/wangfutai/module/hive-1.1.0-cdh5.15.0/hcatalog/share/hcatalog下的所有包,拷贝入/home/wangfutai/module/apache-flume-1.6.0-cdh5.15.0-bin/lib2..bash_profileexport HADOOP_CLASSPATH=$HADOOP_CLASSPATH:$HIVE_HOME/lib/*3.hive-site.xml
   
hive.support.concurrency
   
true
   
hive.enforce.bucketing
   
true
   
hive.txn.manager
   
org.apache.hadoop.hive.ql.lockmgr.DbTxnManager
4.表要分桶,和orc 格式create table hive.flume2 ( id int , name string, age int ) clustered by (id) into 2 bucketsstored as orctblproperties("transactional"='true');5.将hive.xml和hive-env.sh放到apache-flume-1.6.0-cdh5.15.0-bin/conf下

 

转载地址:http://mgjxi.baihongyu.com/

你可能感兴趣的文章
HTML5学习之——HTML 5 Canvas vs. SVG
查看>>
HTML5学习之——HTML 5 应用程序缓存
查看>>
HTML5学习之——HTML 5 Web Workers
查看>>
HTML5学习之——HTML 5 Canvas
查看>>
HTML5学习之——HTML5 内联 SVG
查看>>
HTML5学习之——HTML 5 服务器发送事件
查看>>
SVG学习之——HTML 页面中的 SVG
查看>>
SVG 形状学习之——SVG 矩形<rect>
查看>>
SVG 形状学习之——SVG圆形
查看>>
SVG 滤镜学习之——SVG 滤镜
查看>>
mysql中用命令行复制表结构的方法
查看>>
hbase shell出现ERROR: org.apache.hadoop.hbase.ipc.ServerNotRunningYetException
查看>>
让代码变得更优雅-Lombok
查看>>
解决Rhythmbox乱码
查看>>
豆瓣爱问共享资料插件发布啦
查看>>
Ubuntu10.10 CAJView安装 读取nh\kdh\caj文件 成功
查看>>
kermit的安装和配置
查看>>
vim 配置
查看>>
openocd zylin
查看>>
进程创建时文件系统处理
查看>>