Attempt time threshold of job running in hadoop
2014-01-03 02:19
363 查看
Attempt time threshold is a good feature to avoid time out or too slowly running of the job. However, you may need to run some scripts by forking a new sub-process sometimes. You may not get the progress of the process. In hadoop, if you do not report the progress during a time interval (defalut as 600s), it will consider the attemp is timeout, kill the current attemp process, and fork a new attemp.
In streaming way, you can set the mapred.task.timeout as much as you wish in command line running your job, e.g.,
hadoop jar test.jar -jobconf mapred.task.timeout=3600000
To develop a jar, you can set it like this:
long milliSeconds = 1000*60*60;
conf.setLong("mapred.task.timeout", milliSeconds);
NOTICE: The unit of timeout property is millisecond
Or, if you could, you need to report the progress to the job to aviod time out, e.g.,
context.progress()
本文出自 “maxwell” 博客,请务必保留此出处http://drmaxwell.blog.51cto.com/394635/1347904
In streaming way, you can set the mapred.task.timeout as much as you wish in command line running your job, e.g.,
hadoop jar test.jar -jobconf mapred.task.timeout=3600000
To develop a jar, you can set it like this:
long milliSeconds = 1000*60*60;
conf.setLong("mapred.task.timeout", milliSeconds);
NOTICE: The unit of timeout property is millisecond
Or, if you could, you need to report the progress to the job to aviod time out, e.g.,
context.progress()
本文出自 “maxwell” 博客,请务必保留此出处http://drmaxwell.blog.51cto.com/394635/1347904
相关文章推荐
- The Program of Algorithms ------- Sorting in Linear Time---- Bucket Sort
- php Fatal error: Maximum execution time of 30 seconds exceeded in 问题的解决方案
- Hadoop中MapReduce运行剖析-Anatomy of a MapReduce Job Run with Hadoop
- 现Fatal error: Maximum execution time of 30 seconds exceeded in解决办法
- Exception in thread "main" org.apache.hadoop.mapred.InvalidJobConfException: Output directory not se
- 【每周论文】Time, Clocks, and Ordering of Events in a Distributed System
- hadoop 任务运行到running job就卡住了 INFO mapreduce.Job: Running job: job_1403905542893_0004
- Why there are no job running on hadoop
- P. Laguna/Evaluation of an Automatic Threshold Based Detector of Waveform Limits in Holter ECG
- 译《Time, Clocks, and the Ordering of Events in a Distributed System》
- Running multiple versions of the Framework in ASP.NET
- 第一次外国人在qq上找我聊天【为了找php程序员】The first time of foreigners in qq chat on me.
- How to distinguish Design time or Running time in Mobile cusotmer Contorl(the NetCF2.0 is different to NetCF1.0)
- 解决 “Maximum execution time of 300 seconds exceeded in ...”问题
- Creating backups of running VMs in XenServer(创建在XenServer上运行中的虚拟机的备份)
- python下载时报错 Errno 10060] A connection attempt failed because the connected party did not properly respond after a period of time
- Fatal error: Maximum execution time of 30 seconds exceeded in
- 问题记录:hadoop集群提交job时出现Exception in thread "main" java.io.IOException: Error opening job jar:
- 解决Fatal error: Maximum execution time of 30 seconds exceeded in...的问题
- Advanced Run Time Type Identification in C++ Part II(Property Library An Implementation of RTTI in C++)