hadoop Capacity Scheduler计算能力调度器配置|电子爱好者

admin管理员组
文章数量:1611969

计算能力调度器介绍

Capacity Scheduler支持以下特性：

(1) 计算能力保证。支持多个队列，某个作业可被提交到某一个队列中。每个队列会配置一定比例的计算资源，且所有提交到队列中的作业共享该队列中的资源。

(2) 灵活性。空闲资源会被分配给那些未达到资源使用上限的队列，当某个未达到资源的队列需要资源时，一旦出现空闲资源资源，便会分配给他们。

(3) 支持优先级。队列支持作业优先级调度（默认是FIFO）

(4) 多重租赁。综合考虑多种约束防止单个作业、用户或者队列独占队列或者集群中的资源。

(5) 基于资源的调度。支持资源密集型作业，允许作业使用的资源量高于默认值，进而可容纳不同资源需求的作业。不过，当前仅支持内存资源的调度。

配置方法为

1. 复制$HADOOP_HOME/contrib/capacity-scheduler/hadoop-capacity-scheduler.jar 到$HADOOP_HOME/lib目录中

2. 修改namenode节点中的conf/mapred-site.xml文件

  <property>
    <name>mapred.jobtracker.taskScheduler</name>
    <value>org.apache.hadoop.mapred.CapacityTaskScheduler</value>
  </property>
  <property>
    <name>mapred.queue.names</name>
    <value>default,hadoop,hive</value>
  </property>

3. 修改conf/ capacity-scheduler.xml 配置文件

<?xml version="1.0"?>

<!-- This is the configuration file for the resource manager in Hadoop. -->
<!-- You can configure various scheduling parameters related to queues. -->
<!-- The properties for a queue follow a naming convention,such as, -->
<!-- mapred.capacity-scheduler.queue.<queue-name>.property-name. -->

<configuration>
  <!-- Capacity scheduler Job Initialization configuration parameters -->
  <property>
    <name>mapred.capacity-scheduler.init-poll-interval</name>
    <value>5000</value>
    <description>The amount of time in miliseconds which is used to poll the job queues for jobs to initialize.
    </description>
  </property>
  <property>
    <name>mapred.capacity-scheduler.init-worker-threads</name>
    <value>5</value>
    <description>Number of worker threads which would be used by
    Initialization poller to initialize jobs in a set of queue.
    If number mentioned in property is equal to number of job queues
    then a single thread would initialize jobs in a queue. If lesser
    then a thread would get a set of queues assigned. If the number
    is greater then number of threads would be equal to number of 
    job queues.
    </description>
  </property>

  <property> 
     <name>mapred.capacity-scheduler.maximum-system-jobs</name> 
     <value>30</value> 
     <description>Maximum number of jobs in the system which can be initialized, 
concurrently, by the Capacity Scheduler. 
     </description> 
  </property> 

<!--hadoop queue-->
  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.capacity</name>
    <value>30</value>
    <description>Percentage of the number of slots in the cluster that are to be available for jobs in this queue.
    </description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.maximum-capacity</name>
    <value>-1</value>
    <description>
    </description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.supports-priority</name>
    <value>true</value>
    <description></description>
  </property>
  
    <property>
    <name>mapred.capacity-scheduler.queue.hadoop.minimum-user-limit-percent</name>
    <value>100</value>
    <description> </description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.user-limit-factor</name>
    <value>3</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.maximum-initialized-active-tasks</name>
    <value>200000</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.maximum-initialized-active-tasks-per-user</name>
    <value>100000</value>
    <description></description>
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hadoop.init-accept-jobs-factor</name>
    <value>10</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.default-maximum-initialized-jobs-per-user</name>
    <value>5</value>
    <description>The maximum number of jobs to be pre-initialized for a user
    of the job queue.
    </description>
  </property>
  
<!-- hive -->
<property>
    <name>mapred.capacity-scheduler.queue.hive.capacity</name>
    <value>30</value>
    <description></description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hive.maximum-capacity</name>
    <value>-1</value>
    <description></description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hive.supports-priority</name>
    <value>true</value>
    <description>If true, priorities of jobs will be taken into account in scheduling decisions.
    </description>
  </property>
  
    <property>
    <name>mapred.capacity-scheduler.queue.hive.minimum-user-limit-percent</name>
    <value>100</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hive.user-limit-factor</name>
    <value>4</value>
    <description>The multiple of the queue capacity which can be configured to allow a single user to acquire more slots.
    </description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hive.maximum-initialized-active-tasks</name>
    <value>200000</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.hive.maximum-initialized-active-tasks-per-user</name>
    <value>100000</value>
    <description></description>
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.hive.init-accept-jobs-factor</name>
    <value>10</value>
    <description></description>
  </property>

<!-- default --> 
  <property>
    <name>mapred.capacity-scheduler.queue.default.capacity</name>
    <value>40</value>
    <description></description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.default.maximum-capacity</name>
    <value>-1</value>
    <description></description>    
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.default.supports-priority</name>
    <value>true</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.default.minimum-user-limit-percent</name>
    <value>100</value>
    <description></description>
  </property>
  
  <property>
    <name>mapred.capacity-scheduler.queue.default.user-limit-factor</name>
    <value>4</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.default.maximum-initialized-active-tasks</name>
    <value>200000</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.default.maximum-initialized-active-tasks-per-user</name>
    <value>100000</value>
    <description></description>
  </property>

  <property>
    <name>mapred.capacity-scheduler.queue.default.init-accept-jobs-factor</name>
    <value>10</value>
    <description></description>
  </property>

</configuration>

保存文件后，重启jobtracker

以后修改capacity-scheduler.xml文件后只需要执行命令hadoop mradmin -refreshQueues 就可以重新加载配置项。

4. 最后，如何使用该队列呢:
mapreduce:在Job的代码中，设置Job属于的队列,例如hive：
conf.setQueueName("hive");
hive:在执行hive任务时，设置hive属于的队列,例如hive:
set mapred.job.queue.name=hive;

设置队列的任务名称set mapred.job.name=hadooptest;

设置队列的优先级别set mapred.job.priority=HIGH;

本文标签：能力 Hadoop Capacity scheduler

版权声明：本文标题：hadoop Capacity Scheduler计算能力调度器配置内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/xitong/1728622119a1166485.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

Vector 中size和 capacity的区别

2小时前

size是指容器当前拥有元素的个数，而capacity是指容器在必须分配新的存储空间之前可以存放的元素总数。如vector<int> ivect(10),ivect.capacity()10&#

hadoop Capacity Scheduler使用手记

2小时前

由于集群资源有限，为了保证重要任务能够分配到足够的槽位，决定将hadoop的HIHO调度器换成Capacity Scheduler （Fair Scheduler无法实现最大

Java字符容量capacity()方法

2小时前

Java字符容量计算：比如StringBuffer sbnew StringBuffer("Good");输出sb.capacity();，长度为20，因

vector中的size和capacity

2小时前

原文地址——诸葛半里在vector中与size()和capacity() 相对应的有两个函数：resize(size_type)和reserve(size_type)。 Size size指目前容器中实际有

kylin1.15.4.1 usrlocalhadoop-2.6.0contribcapacity-scheduler*.jar

2小时前

kylin1.15.4.1部署遇见的问题：Failed to scan [file:usrlocalhadoop-2.6.0contribcapacity-scheduler*.jar] from cl

Storm Capacity Metric

2小时前

The Capacity metric is defined as: CapacityExecute latency * Executed over a windowWindow size In the example tab

[Cloud Computing]Patterns: Elastic Resource Capacity

2小时前

Elastic Resource Capacity (Erl, Naserpour) How can the processing capacity of virtual servers be dynamically scaled in r

StringBuilder对象Capacity属性

2小时前

Capacity属性：获取或设置可包含在当前实例所分配的内存中的最大字符数 Capacity属性的默认值为16。当StringBuilder对象的Length属性值超过Capacity属性的长度时&#

大数据运维实战第二十四课 Yarn 资源调度 Fair Schedule 与 Capacity Scheduler 配置选型

2小时前

在大数据平台运维中，会经常遇到集群资源争抢的问题。因为在公司内部，Hadoop Yarn 集群一般会被多个业务、多个用户同时使用，共享 Yarn 资源。此时&#xff

Hadoop Capacity Scheduler配置使用记录

2小时前

网址: http:wwwblogspanfeng412archive20130322hadoop-capacity-scheduler-configuration.html这里参考Capacity Scheduler G

C++顺序容器的capacity和reserve方法

2小时前

verctor容器这样的顺序容器在内存的存储空间是连续的，而其后面的存储空间可能被其他数据占用，当在需要添加新的元素时，vector就需要重新分配存储空间以连续存储原来元素和

【大数据】HADOOP-YARN容量调度器配置详解

2小时前

目录简介资源分配应用程序数目限制队列权限管理基于用户或组的队列映射应用程序的生存期(lifetime) 简介 Capacity调度器具有以下的几个特性： 层次化的队列设计，这种层次化的队列设

hadoop Capacity Scheduler计算能力调度器配置

2小时前

计算能力调度器介绍 Capacity Scheduler支持以下特性： (1) 计算能力保证。支持多个队列，某个作业可被提交到某一个队列中。每个队列会配置一定比例的计算资源，

string的sizelength、resizereverse和capacity

2小时前

函数原型reserve 原型: #include <string>void reserve( size_type size0 ); 函数reserve()将字符串的容量设置为至少size. 如果size指定的数值要小于当

C++中max_size()、size()、capacity()和reserve()函数

2小时前

在C容器类型中，max_size()和size()函数通用于所有类型的容器，capcity()和reserve()函数值只适用于vector容器。 c.size()函数:返回容器c 中元素的个数

StringBuffer中length()和capacity()的区别用法

2小时前

[Java] view plain copy print ? length()和capacity() 通过调用length()方法可以得到当前StringBuffer的长度。而通过调用capacity()方法可以得到总的分配容

[hadoop]yarn调度器

2小时前

一、FIFO(先进先出)调度器单队列，按照提交作业的先后顺序运行。二、容量调度器(capacity scheduler) 1.特点 1）多队列：每个队列配置一定的

vector中capacity()和size()有什么不同？

2小时前

在学习vector时看到capacity()和size()都是求容器在内存中分配的大小，为什么两者得到的结果却不一样呢？下面把代码和结果贴出来和大家分享下！ 注&#

vector的capacity新增长方式（dev c++实测）

2小时前

vector的capacity()调用返回vector中最大能够存储的元素个数，也即在下一次需要扩充容量之前能容纳的元素个数。reserve会使容器在必要的时候增长，以便容纳制指定数目的元素。

hadoop出现 failed to create file because current leaseholder is trying to recreate file.

14分钟前

org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.hdfs.protocol.AlreadyBeingCreatedException): failed to create fi

电子爱好者 - 最新技术资讯及电子产品介绍！

hadoop Capacity Scheduler计算能力调度器配置

更多相关文章

Vector 中size和 capacity的区别

hadoop Capacity Scheduler使用手记

Java字符容量capacity()方法

vector中的size和capacity

kylin1.15.4.1 usrlocalhadoop-2.6.0contribcapacity-scheduler*.jar

Storm Capacity Metric

[Cloud Computing]Patterns: Elastic Resource Capacity

StringBuilder对象Capacity属性

大数据运维实战第二十四课 Yarn 资源调度 Fair Schedule 与 Capacity Scheduler 配置选型

Hadoop Capacity Scheduler配置使用记录

C++顺序容器的capacity和reserve方法

【大数据】HADOOP-YARN容量调度器配置详解

hadoop Capacity Scheduler计算能力调度器配置

string的sizelength、resizereverse和capacity

C++中max_size()、size()、capacity()和reserve()函数

StringBuffer中length()和capacity()的区别用法

[hadoop]yarn调度器

vector中capacity()和size()有什么不同？

vector的capacity新增长方式（dev c++实测）

hadoop出现 failed to create file because current leaseholder is trying to recreate file.

发表评论

推荐文章

2024年8月国产数据库大事记-墨天轮

springboot应用服务报错Error parsing HTTP request header

RGB 与YUV颜色模型及存储格式

Deprecated: use FragmentPagerAdapter(FragmentManager, int) with BEHAVIOR_RESUME_ONLY_CURRENT_FRAGMEN

asp.net中处理程序调用HttpContext.Current.Session获取值出错

热门文章

053试题 51 - sec_protocol_error_further_action 参数

史上最详细的使用Claude和接入Claude-api教程

进程和线程的区别和联系

如何虚拟打印PDF文件（Win7）

进bios怎么改开机密码

按键精灵免字库本地识别OCR

惠普Z240 工作站 安装 Debian

惠普Teradici PCoIP 受OpenSSL 漏洞影响，波及1500万个端点

MYSQL打开后出现“Exception: Current profile has no WMI enabled”错误解决方案

springboot报错would dispatch back to the current handler URL [ordertest] again.

最新文章

LightOJ 1112 Curious Robin Hood

HDU 5112 A Curious Matt (水题)

HDU 5112 A Curious Matt (2014ACMICPC亚洲区北京站-重现赛)

HDU - 5112 A Curious Matt

This Curious AI Beats Many Games...and Gets Addicted to the TV

hdu-5112-A Curious Matt

Viz World and Viz Curious Maps 教程 -- 基础篇

Keep learning, be curious！目标就在前方，努力就对了。

hdu 5112 A Curious Matt (结构体+cmp函数)

hdu5512 - A Curious Matt （排序）水

HDU 5112 2014ICPC北京站现场赛 A Curious Matt

推荐电影 The curious case of Benjamin Button（本杰明.巴顿怪事）

HDU 5112 A Curious Matt 水题

文章标题 HDU 5112- A Curious Matt

Codeforces 407C&amp;408E Curious Array 组合数多层差分

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

惠普Z240 工作站安装 Debian

Codeforces 407C&408E Curious Array 组合数多层差分

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载