1.先在hive-site.xml中设置小文件的标准.
<property> <name>hive.merge.smallfiles.avgsize</name> <value>536870912</value> <description>When the average output file size of a job is less than this number, Hive will start an additional map-reduce job to merge the output files into bigger files. This is only done for map-only jobs if hive.merge.mapfiles is true, and for map-reduce jobs if hive.merge.mapredfiles is true.</description> </property>
2.为只有map的mapreduce的输出并合并小文件.
<property> <name>hive.merge.mapfiles</name> <value>true</value> <description>Merge small files at the end of a map-only job</description> </property>
3.为含有reduce的mapreduce的输出并合并小文件.
<property> <name>hive.merge.mapredfiles</name> <value>true</value> <description>Merge small files at the end of a map-reduce job</description> </property>
更多相关教程请访问 MySQL视频教程
Copyright © 2019- aiwanbo.com 版权所有 赣ICP备2024042808号-3
违法及侵权请联系:TEL:199 18 7713 E-MAIL:2724546146@qq.com
本站由北京市万商天勤律师事务所王兴未律师提供法律服务