Hadoop及HIVE学习宝典收集



Hive常用命令
https://cwiki.apache.org/confluence/display/Hive/GettingStarted
http://richardxu.com/hiveql-common-operations/
http://www.cnblogs.com/ggjucheng/archive/2013/01/03/2843448.html

 

hadoop mapreduce核心功能描述
http://www.open-open.com/lib/view/open1337349822015.html

 

HADOOP测试常见问题和测试方法
http://www.verydemo.com/demo_c290_i1937.html
https://www.opensciencegrid.org/bin/view/Storage/HadoopOperations#Hadoop_Filesystem

 

hadoop namenode启动过程详细剖析及瓶颈分析
http://blog.csdn.net/wh62592855/article/details/6557051

 

namenode节点元数据大小所需java堆内存计算方法
http://www.mail-archive.com/core-user@hadoop.apache.org/msg02835.html

也可以在这几个属性中添加调试端口
mapreduce.map.java.opts
mapreduce.reduce.java.opts
mapred.child.java.opts
http://blog.javachen.com/hadoop/2013/08/01/remote-debug-hadoop/

in current version, if zk CONNECTION_LOSS, the namenode will shut down?
https://cwiki.apache.org/confluence/display/ZOOKEEPER/FAQ

java.lang.OutOfMemoryError: GC overhead limit exceeded
http://stackoverflow.com/questions/10109572/gc-overhead-limit-exceeded-on-hadoop-20-datanode

out.write(s)并不是立即往hdfs中写数据
http://blog.csdn.net/dandingyy/article/details/7434092


Mapreduce测试框架mockito
http://code.google.com/p/mockito/
http://caffebig.wordpress.com/2012/10/16/unit-testing-hadoop-map-reduce-jobs/

wordcount实例详解
http://developer.51cto.com/art/201206/345334_1.htm


hive 三种模式
http://www.linuxboy.net/Linux/2011-08/41451.htm
http://blog.csdn.net/cindy9902/article/details/6215769
https://cwiki.apache.org/confluence/display/Hive/AdminManual+MetastoreAdmin
https://cwiki.apache.org/confluence/display/Hive/GettingStarted
https://cwiki.apache.org/confluence/display/Hive/Home

OpenTSDB介绍
http://richardxu.com/hiveql-common-operations/


mysql linux安装方式
http://blog.csdn.net/houqingdong2012/article/details/8197234
解决方法:ERROR 1018 (HY000): Can't read dir of './dbname/' (errno: 13)
http://www.cnblogs.com/win-and-first/archive/2013/04/21/mysql.html
/etc/init.d/mysql restart

 

Nmon文件过大解析失败问题解决方法
http://blog.csdn.net/yixiayizi/article/details/8093572

Job OOM问题
https://issues.apache.org/jira/browse/MAPREDUCE-5429
http://lucene.472066.n3.nabble.com/Run-mr-example-wordcount-error-on-hadoop-2-0-1-alpha-HA-td4007743.html

Log4j详细配置
http://www.jb51.net/article/16394.htm

HDFS读写文件流程
http://blog.csdn.net/hguisu/article/details/7259716

wordcount详解
http://f.dataguru.cn/thread-86484-1-1.html

DataNode本地数据存储和管理
http://www.2cto.com/kf/201303/192586.html


Mapreduce之间的参数传递
http://blog.csdn.net/guoery/article/details/8525473

配置项调优
http://developer.yahoo.com/hadoop/tutorial/module7.html
https://sites.google.com/site/hadoopandhive/system/app/pages/sitemap/hierarchy


如何让linux的history命令显示时间记录
http://www.9php.net/article/20121105090348.html
# export HISTTIMEFORMAT='%F %T '
# history | more

首先检测系统是否安装了mysql
rpm -qa|grep -i mysql
如果有则删除
rpm --nodpes -e mysql*;
相关链接http://blog.csdn.net/zengmuansha/article/details/9081281

Hadoop集群报错Caused by: java.net.NoRouteToHostException: No route to host
http://zhaohe162.blog.163.com/blog/static/382167972013030114815803/
hbase启动时间要求集群时间差30秒内。
关闭防火墙命令: service iptables stop


yarn.app.mapreduce.am.staging-dir应该和服务端一致
https://issues.apache.org/jira/i#browse/MAPREDUCE-4119

检查操作系统语言设置
解决方法:参考https://wiki.archlinux.org/index.php/Locale

hbase常用命令
http://blog.csdn.net/scutshuxue/article/details/6988348

 

Hadoop源码分析和相关配置项解释
http://blog.csdn.net/woelegant/article/details/8870399

Hbase put数据过程中客户端报SocktTimeout
http://blog.csdn.net/yangbutao/article/details/8608146
http://www.codesky.net/article/201206/171896.html


Hbase数据导入导出命令
http://koven2049.iteye.com/blog/1162904

hbase默认配置项说明
http://hi.baidu.com/qingchunranzhi/item/a87007b692c27eadebba93cc

【Hadoop】中map与reduce的个数问题
http://blog.csdn.net/zwan0518/article/details/9409361


Linux下的IO监控与分析
http://www.cnblogs.com/quixotic/p/3258730.html
http://xjsunjie.blog.51cto.com/999372/640932

CPU消耗很高
http://bbs.csdn.net/topics/390413349

 

相关内容

    暂无相关文章