2013-09-23 2 views
0

내가 두 레드햇 6.4 리눅스 시스템에 구성된 하둡에서 프로그램을 작성 TestDFSIO 하둡을 실행하고 레드햇 리눅스에서 제대로 작업을 줄일 수 있지만 프로그램이 응답하지하둡 TestDFSIO 프로그램 완료

100 %의지도 감소 16 % 후

나는 SECON에 다시 한 실행을 위해 잘 작동 네임 노드를 포맷 한 후

hadoop jar hadoop-test-1.2.1.jar -write -nrFiles 960 -fileSize 1024 . 

로 쓰기 TestDFSIO 워크로드를 실행하지만, 지도 작업을 마친 후에 매달려서 그런 식으로 실패했습니다.

100 %지도 16 % 감소. 내가 읽기 워크로드

hadoop jar hadoop-test-1.2.1.jar -read -nrFiles 960 -fileSize 1024 

을 실행할 때

hadoop jar hadoop-test-1.2.1.jar -write -nrFiles 960 -fileSize 1024 

하여 데이터를 작성하는 하나 개의 실행을 완료 할 수있는 네임 노드를 포맷하지만 후에는 최종 단계 이후에 붙어 다음과 같습니다. -

100% map 16% reduce done. 

왜 줄일 수 있습니까? 제대로 끝내지 못하니? 마스터 노드 쇼에 TaskTracker의

로그 (시간과 클래스 명 단축) : -

 
...0:15,541 INFO ....JvmManager: JVM : jvm_201309241959_0001_m_226512462 exited with exit code 0. Number of tasks it ran: 1 
...0:15,814 INFO ....TaskTracker: attempt_201309241959_0001_m_000958_0 0.0% reading [email protected]/1073741824 ::host = 9.122.227.170 
...0:16,768 INFO ....TaskTracker: Received KillTaskAction for task: attempt_201309241959_0001_m_000957_1 
...0:16,768 INFO ....TaskTracker: About to purge task: attempt_201309241959_0001_m_000957_1 
...0:16,768 INFO ....IndexCache: Map ID attempt_201309241959_0001_m_000957_1 not found in cache 
...0:17,559 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16597223% reduce > copy (478 of 960 at 0.00 MB/s) > 
...0:18,355 INFO ....TaskTracker: attempt_201309241959_0001_m_000958_0 1.0% finished test_io_8 ::host = 9.122.227.170 
...0:18,355 INFO ....TaskTracker: Task attempt_201309241959_0001_m_000958_0 is done. 
...0:18,355 INFO ....TaskTracker: reported output size for attempt_201309241959_0001_m_000958_0 was 93 
...0:18,356 INFO ....TaskTracker: addFreeSlot : current free slots : 2 
...0:18,498 INFO ....JvmManager: JVM : jvm_201309241959_0001_m_832308806 exited with exit code 0. Number of tasks it ran: 1 
...0:20,584 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16597223% reduce > copy (478 of 960 at 0.00 MB/s) > 
...0:21,697 INFO ....TaskTracker.clienttrace: src: 9.122.227.170:50060, dest: 9.122.227.170:48771, bytes: 93, op: MAPRED_SHUFFLE, cliID: attempt_201309241959_0001_m_000958_0, duration: 6041257 
...0:26,608 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:32,632 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:35,655 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:41,679 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:47,700 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:50,721 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:56,744 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...0:59,766 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:05,789 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:11,812 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:14,835 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:20,859 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:26,885 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:29,908 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:35,931 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:41,955 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:44,978 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:51,002 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...1:57,025 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:00,048 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:06,072 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:12,096 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:15,119 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:21,143 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:27,167 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) > 
...2:30,190 INFO ....TaskTracker: attempt_201309241959_0001_r_000000_0 0.16631946% reduce > copy (479 of 960 at 0.00 MB/s) >

The Screenshot of the terminal where the hadoop process is stucked at the reduce jobs.

스크린 샷은 하둡 프로세스가 작업의 감소 단계에 갇혀 있음을 보여줍니다.

+1

jobtracker 및 감소 작업에서 로그를 게시 할 수 있습니까? –

+0

나는 tasktracker의 로그를 붙여 넣었다. 스크린 샷은 걸려있는 터미널입니다. –

답변

0

문제는 DNS 이름 확인과 관련하여/etc/hosts 파일을 편집하고 localhost 항목을 제거하고 RedHat Linux 6.4에서 저에게 실제 호스트 이름을 추가해야한다는 문제를 해결했습니다. hadoop 클러스터가 hadoop 마스터 및 hadoop 슬레이브와 식별하도록합니다.