Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

총관리자 2013.12.16 22:09 조회 수 : 4459

출처 : http://www.spikyjohn.com/cribsheets/20130609_hadoopinstall.html

Just the command lines to get hadoop 2 installed on Ubuntu. These are all cribbed from the following source notes, and I am preserving them here for my own benefit so I can quickly repeat what I did. Note many of these instructions are also in the main hadoop docs from apache.

Source material	Use Michael-noll's guide for version 1 & ssh http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-single-node-cluster/ http://hadoop.apache.org/docs/r1.1.2/single_node_setup.html Or this one for Hadoop 2 http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html http://hadoop.apache.org/docs/r2.0.5-alpha/
Create the hadoop user and ssh	sudo apt-get install openssh-server openssh-client sudo addgroup hadoop sudo adduser --ingroup hadoop hduser su - hduser If you cannot ssh to localhost without a passphrase, execute the following commands: ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys Testing your SSH ssh localhost Say yes #exit
Get hadoop all set up	As the hduser, after downloading the tar tar -xvf hadoop-2.0.5-alpha.tar.gz ln -s hadoop-2.0.5-alpha hadoop #edit .bashrc export JAVA_HOME=/usr/lib/jvm/jdk1.7.0_21/ export HADOOP_PREFIX="/home/hduser/hadoop" export PATH=$PATH:$HADOOP_PREFIX/bin export PATH=$PATH:$HADOOP_PREFIX/sbin export HADOOP_MAPRED_HOME=${HADOOP_PREFIX} export HADOOP_COMMON_HOME=${HADOOP_PREFIX} export HADOOP_HDFS_HOME=${HADOOP_PREFIX} export YARN_HOME=${HADOOP_PREFIX}
Stolen entirely from JJ, but with path changed for my Ubuntu	Stolen from http://jugnu-life.blogspot.com/2012/05/hadoop-20-install-tutorial-023x.html Please click on his blog. Login again so bash has paths above. In Hadoop 2.x version /etc/hadoop is the default conf directory. We need to modify / create following property files in the /etc/hadoop directory cd ~ mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/name;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/dfs/data;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/system;mkdir -p /home/hduser/workspace/hadoop_space/hadoop23/mapred/local Edit core-site.xml with following contents <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020</value> <description>The name of the default file system. Either the literal string "local" or a host:port for NDFS.</description> <final>true</final> </property> </configuration> Edit hdfs-site.xml with following contents <configuration> <property> <name>dfs.namenode.name.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name</value> <description>Determines where on the local filesystem the DFS name node should store the name table. If this is a comma-delimited list of directories then the name table is replicated in all of the directories, for redundancy. </description> <final>true</final> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data</value> <description>Determines where on the local filesystem an DFS data node should store its blocks. If this is a comma-delimited list of directories, then data will be stored in all named directories, typically on different devices. Directories that do not exist are ignored. </description> <final>true</final> </property> <property> <name>dfs.replication</name> <value>1</value> </property> <property> <name>dfs.permissions</name> <value>false</value> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/name AND file:/home/hduser/workspace/hadoop_space/hadoop23/dfs/data are some folders in your computer which would give space to store data and name edit files Path should be specified as URI Create a file mapred-site.xml inside /etc/hadoop with following contents <configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> <property> <name>mapred.system.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system</value> <final>true</final> </property> <property> <name>mapred.local.dir</name> <value>file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local</value> <final>true</final> </property> </configuration> The path file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/system AND file:/home/hduser/workspace/hadoop_space/hadoop23/mapred/local are some folders in your computer which would give space to store data Path should be specified as URI Edit yarn-site.xml with following contents <configuration> <property> <name>yarn.nodemanager.aux-services</name> <value>mapreduce.shuffle</value> </property> <property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name> <value>org.apache.hadoop.mapred.ShuffleHandler</value> </property> </configuration> Format the namenode # hdfs namenode –format Say Yes and let it complete the format Time to start the daemons # hadoop-daemon.sh start namenode # hadoop-daemon.sh start datanode You can also start both of them together by # start-dfs.sh Start Yarn Daemons # yarn-daemon.sh start resourcemanager # yarn-daemon.sh start nodemanager You can also start all yarn daemons together by # start-yarn.sh Time to check if Daemons have started Enter the command # jps 2539 NameNode 2744 NodeManager 3075 Jps 3030 DataNode 2691 ResourceManager Time to launch UI Open the localhost:8088 to see the Resource Manager page Done :) Happy Hadooping :)

이 게시물을

이 글의 추천인 목록 목록

번호	제목	날짜	조회 수
750	hue.axes_accessattempt테이블의 username컬럼에 NULL 혹은 space가 들어갈수도 있음.	2021.11.03	90865
749	bananapi 5대(ubuntu계열 리눅스)에 yarn(hadoop 2.6.0)설치하기-ResourceManager HA/HDFS HA포함, JobHistory포함	2015.04.24	22225
748	mapreduce appliction을 실행시 "is running beyond virtual memory limits" 오류 발생시 조치사항	2017.05.04	19794
747	org.apache.hadoop.hdfs.server.common.InconsistentFSStateException: Directory /tmp/hadoop-root/dfs/name is in an inconsistent state: storage directory does not exist or is not accessible.	2013.03.11	16996
746	Hive Query Examples from test code (2 of 2)	2014.03.26	13505
745	Resource temporarily unavailable(자원이 일시적으로 사용 불가능함) 오류조치	2015.11.19	11597
744	drop table로 삭제했으나 tablet server에는 여전히 존재하는 테이블 삭제방법	2021.07.09	10808
743	[Decommission]시 시간이 많이 걸리면서(수일) Decommission이 완료되지 않는 경우 조치	2018.01.03	10236
742	mysql 5.5.34-0ubuntu0.13.04용 설치/진행 화면	2014.09.10	9946
741	[DataNode]org.apache.hadoop.security.KerberosAuthException: failure to login: for principal: hdfs/datanode03@GOOPER.COM from keytab hdfs.keytab오류	2023.04.18	9821
740	insert hbase by hive ... error occured after 5 hours..HMaster가 뜨지 않는 장애에 대한 복구 방법	2014.04.29	9666
739	hive 2.0.1 설치및 mariadb로 metastore 설정	2016.06.03	9063
738	cumulusRDF 1.0.1설치및 "KeyspaceCumulus" keyspace확인하기	2016.04.15	8725
737	.git폴더를 삭제하고 다시 git에 추가하고 서버에 반영하는 방법	2017.06.19	8630
736	HBase shell로 작업하기	2013.03.15	8335
735	Ubuntu 16.04 LTS에서 sendmail설치및 설정(수신,발신 가능)및 메일서버 만들기	2017.05.23	7973
734	spark-sql실행시 The specified datastore driver ("com.mysql.jdbc.Driver") was not found in the CLASSPATH오류 발생시 조치사항	2016.06.09	7887
733	dr.who로 공격들어오는 경우 조치방법	2018.06.09	7849
732	Ubuntu 16.04LTS 설치후 초기에 주어야 하는 작업(php, apache, mariadb설치및 OS보안설정등)	2017.05.23	7828
731	Spark에서 Serializable관련 오류및 조치사항	2017.04.21	7743

쓰기 태그

첫 페이지 1 2 3 4 5 6 7 8 9 10 끝 페이지

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

Cloudera, BigData, Semantic IoT, Hadoop, NoSQL

Cloudera CDH/CDP 및 Hadoop EcoSystem, Semantic IoT등의 개발/운영 기술을 정리합니다. gooper@gooper.com로 문의 주세요.

기타 ubuntu에 hadoop 2.0.5설치하기

댓글 0

LOGIN