2、OpenMP的任务调度schedule(static|dynamic|guided|runtime[size])|电子爱好者

admin管理员组
文章数量:1565367

基本思想：对于for的任务分担 schedule(static|dynamic|guided|runtime[size])

（1）for的任务分担

#pragma omp parallel
{
#pragma omp for
for(int i=0;i<num/2;i++)//num此为偶数
{
 .....
}
#pragma omp for
for(int i=num/2;i<num;i++) 
{
.......
}
}

测试代码

#include <iostream>
#include <omp.h>
#include<chrono>
#include<vector>
#include<thread>
using namespace std;
using namespace chrono;

void sequentialProgram(int num)
{

    for(int i=0;i<num;i++)
    {
       // std::cout<<"hello world"<<std::endl;
        printf("%s the current thread id: %d\n","hello world",omp_get_thread_num());
    }
}

void  parallelProgram(int num)
{


#pragma omp parallel
    {
#pragma omp for
    for(int i=0;i<num/2;i++)
    {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("%s the current thread id: %d\n","A hello world",omp_get_thread_num());
}
#pragma omp for
    for(int i=num/2;i<num;i++) {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("%s the current thread id: %d\n","B hello world",omp_get_thread_num());
    }
    }
}

int main() {


    int num=omp_get_num_procs()*2;
    auto start_time=std::chrono::steady_clock::now();
    sequentialProgram(num);
    auto end_time=std::chrono::steady_clock::now();
    std::cout<<"sequentialProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;

    start_time=std::chrono::steady_clock::now();
    parallelProgram(num);
    end_time=std::chrono::steady_clock::now();
    std::cout<<"parallelProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;
    return 0;
}

测试结果，在一个并行域中，对多个for进行制导指令处理，可以使用调度指令简化完成这一操作

F:\OpenMP\cmake-build-debug\OpenMP.exe
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
hello world the current thread id: 0
sequentialProgram elapse time: 0.0776085 seconds
A hello world the current thread id: 1
A hello world the current thread id: 0
A hello world the current thread id: 3
A hello world the current thread id: 5
A hello world the current thread id: 7
A hello world the current thread id: 10
A hello world the current thread id: 9
A hello world the current thread id: 8
A hello world the current thread id: 2
A hello world the current thread id: 4
A hello world the current thread id: 6
A hello world the current thread id: 11
B hello world the current thread id: 1
B hello world the current thread id: 0
B hello world the current thread id: 7
B hello world the current thread id: 9
B hello world the current thread id: 2
B hello world the current thread id: 6
B hello world the current thread id: 4
B hello world the current thread id: 10
B hello world the current thread id: 3
B hello world the current thread id: 8
B hello world the current thread id: 5
B hello world the current thread id: 11
parallelProgram elapse time: 0.0527985 seconds

Process finished with exit code 0

（2）使用for的调度指令schedule

#pragma omp parallel for schedule(static|dynamic}guided|runtime[size])
 for (int i = 0; i < num; i++) 
{
       .......
    }

当写成

#pragma omp parallel for
等价
#pragma omp parallel for schedule(static)
等价
#pragma omp parallel for schedule(static,num/omp_get_num_procs()) //  num=omp_get_num_procs()*2;

其中static 设置为多少线程来处理迭代计算任务

其中size 为可选项，当不设置size参数时，默认for循环的线程以num/omp_get_num_procs()来分配

测试代码

#include <iostream>
#include <omp.h>
#include<chrono>
#include<vector>
#include<thread>
using namespace std;
using namespace chrono;

void sequentialProgram(int num)
{

    for(int i=0;i<num;i++)
    {
       // std::cout<<"hello world"<<std::endl;
        printf("i=%d the current thread id: %d\n",i,omp_get_thread_num());
    }
}

void  parallelProgram(int num) {

//#pragma omp parallel for   
//#pragma omp parallel for schedule(static)
#pragma omp parallel for schedule(static,2) // 上述三种预处理指令是一样的效果 注意设置的num循环测试
    for (int i = 0; i < num; i++) {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("i=%d the current thread id: %d\n", i, omp_get_thread_num());
    }
}

int main() {


    int num=omp_get_num_procs()*2;
    auto start_time=std::chrono::steady_clock::now();
    sequentialProgram(num);
    auto end_time=std::chrono::steady_clock::now();
    std::cout<<"sequentialProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;

    start_time=std::chrono::steady_clock::now();
    parallelProgram(num);
    end_time=std::chrono::steady_clock::now();
    std::cout<<"parallelProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;
    return 0;
}

测试结果是相同的

F:\OpenMP\cmake-build-debug\OpenMP.exe
i=0 the current thread id: 0
i=1 the current thread id: 0
i=2 the current thread id: 0
i=3 the current thread id: 0
i=4 the current thread id: 0
i=5 the current thread id: 0
i=6 the current thread id: 0
i=7 the current thread id: 0
i=8 the current thread id: 0
i=9 the current thread id: 0
i=10 the current thread id: 0
i=11 the current thread id: 0
i=12 the current thread id: 0
i=13 the current thread id: 0
i=14 the current thread id: 0
i=15 the current thread id: 0
i=16 the current thread id: 0
i=17 the current thread id: 0
i=18 the current thread id: 0
i=19 the current thread id: 0
i=20 the current thread id: 0
i=21 the current thread id: 0
i=22 the current thread id: 0
i=23 the current thread id: 0
sequentialProgram elapse time: 0.0422739 seconds
i=0 the current thread id: 0
i=1 the current thread id: 0
i=4 the current thread id: 2
i=5 the current thread id: 2
i=14 the current thread id: 7
i=15 the current thread id: 7
i=18 the current thread id: 9
i=19 the current thread id: 9
i=16 the current thread id: 8
i=17 the current thread id: 8
i=12 the current thread id: 6
i=13 the current thread id: 6
i=2 the current thread id: 1
i=3 the current thread id: 1
i=10 the current thread id: 5
i=11 the current thread id: 5
i=6 the current thread id: 3
i=7 the current thread id: 3
i=8 the current thread id: 4
i=9 the current thread id: 4
i=22 the current thread id: 11
i=23 the current thread id: 11
i=20 the current thread id: 10
i=21 the current thread id: 10
parallelProgram elapse time: 0.0412098 seconds

Process finished with exit code 0

(3)虽然参数static均衡的分担任务，但是存在某些线程处理速度上的差异，因此引入dynamic

#pragma omp parallel for schedule(dynamic) 
    for (int i = 0; i < num; i++) {
      ......
    }
}

测试代码

#include <iostream>
#include <omp.h>
#include<chrono>
#include<vector>
#include<thread>
using namespace std;
using namespace chrono;

void sequentialProgram(int num)
{

    for(int i=0;i<num;i++)
    {
       // std::cout<<"hello world"<<std::endl;
        printf("i=%d the current thread id: %d\n",i,omp_get_thread_num());
    }
}

void  parallelProgram(int num) {


#pragma omp parallel for schedule(dynamic) 
    for (int i = 0; i < num; i++) {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("i=%d the current thread id: %d\n", i, omp_get_thread_num());
    }
}

int main() {


    int num=omp_get_num_procs()*2;
    auto start_time=std::chrono::steady_clock::now();
    sequentialProgram(num);
    auto end_time=std::chrono::steady_clock::now();
    std::cout<<"sequentialProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;

    start_time=std::chrono::steady_clock::now();
    parallelProgram(num);
    end_time=std::chrono::steady_clock::now();
    std::cout<<"parallelProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;
    return 0;
}

测试结果可以看出，线程id=9处理速度较快，因此承担了更多的任务,当然也可以使用size进行限制线程处理任务的数量~

F:\OpenMP\cmake-build-debug\OpenMP.exe
i=0 the current thread id: 0
i=1 the current thread id: 0
i=2 the current thread id: 0
i=3 the current thread id: 0
i=4 the current thread id: 0
i=5 the current thread id: 0
i=6 the current thread id: 0
i=7 the current thread id: 0
i=8 the current thread id: 0
i=9 the current thread id: 0
i=10 the current thread id: 0
i=11 the current thread id: 0
i=12 the current thread id: 0
i=13 the current thread id: 0
i=14 the current thread id: 0
i=15 the current thread id: 0
i=16 the current thread id: 0
i=17 the current thread id: 0
i=18 the current thread id: 0
i=19 the current thread id: 0
i=20 the current thread id: 0
i=21 the current thread id: 0
i=22 the current thread id: 0
i=23 the current thread id: 0
sequentialProgram elapse time: 0.041236 seconds
i=0 the current thread id: 2
i=6 the current thread id: 9
i=13 the current thread id: 9
i=14 the current thread id: 9
i=15 the current thread id: 9
i=16 the current thread id: 9
i=17 the current thread id: 9
i=18 the current thread id: 9
i=19 the current thread id: 9
i=20 the current thread id: 9
i=21 the current thread id: 9
i=22 the current thread id: 9
i=23 the current thread id: 9
i=5 the current thread id: 11
i=3 the current thread id: 1
i=4 the current thread id: 8
i=7 the current thread id: 4
i=1 the current thread id: 10
i=2 the current thread id: 3
i=8 the current thread id: 0
i=9 the current thread id: 6
i=10 the current thread id: 7
i=11 the current thread id: 5
i=12 the current thread id: 2
parallelProgram elapse time: 0.0399313 seconds

Process finished with exit code 0

（4）guided 采用启发式调度算法，开始分配较大的块，然后逐渐变小，最后分配给每个线程的任务为size数量，如果没设置size，将在最后分配给每个任务量为1

#pragma omp parallel for schedule(guided) 
    for (int i = 0; i < num; i++) {
      .....
    }
}

测试代码

#include <iostream>
#include <omp.h>
#include<chrono>
#include<vector>
#include<thread>
using namespace std;
using namespace chrono;

void sequentialProgram(int num)
{

    for(int i=0;i<num;i++)
    {
       // std::cout<<"hello world"<<std::endl;
        printf("i=%d the current thread id: %d\n",i,omp_get_thread_num());
    }
}

void  parallelProgram(int num) {


#pragma omp parallel for schedule(guided) 
    for (int i = 0; i < num; i++) {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("i=%d the current thread id: %d\n", i, omp_get_thread_num());
    }
}

int main() {


    int num=omp_get_num_procs()*2-5;
    auto start_time=std::chrono::steady_clock::now();
    sequentialProgram(num);
    auto end_time=std::chrono::steady_clock::now();
    std::cout<<"sequentialProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;

    start_time=std::chrono::steady_clock::now();
    parallelProgram(num);
    end_time=std::chrono::steady_clock::now();
    std::cout<<"parallelProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;
    return 0;
}

测试结果，第一次先为每个线程分配两个任务，然后最后变成每个线程只能承担一个任务执行

F:\OpenMP\cmake-build-debug\OpenMP.exe
i=0 the current thread id: 0
i=1 the current thread id: 0
i=2 the current thread id: 0
i=3 the current thread id: 0
i=4 the current thread id: 0
i=5 the current thread id: 0
i=6 the current thread id: 0
i=7 the current thread id: 0
i=8 the current thread id: 0
i=9 the current thread id: 0
i=10 the current thread id: 0
i=11 the current thread id: 0
i=12 the current thread id: 0
i=13 the current thread id: 0
i=14 the current thread id: 0
i=15 the current thread id: 0
i=16 the current thread id: 0
i=17 the current thread id: 0
i=18 the current thread id: 0
sequentialProgram elapse time: 0.033042 seconds
i=0 the current thread id: 0
i=1 the current thread id: 0
i=16 the current thread id: 0
i=17 the current thread id: 0
i=18 the current thread id: 0
i=6 the current thread id: 5
i=7 the current thread id: 5
i=2 the current thread id: 3
i=3 the current thread id: 3
i=13 the current thread id: 6
i=14 the current thread id: 2
i=15 the current thread id: 11
i=10 the current thread id: 8
i=12 the current thread id: 9
i=8 the current thread id: 7
i=9 the current thread id: 1
i=11 the current thread id: 10
i=4 the current thread id: 4
i=5 the current thread id: 4
parallelProgram elapse time: 0.0334159 seconds

Process finished with exit code 0

（5）runtime 设置之后，将获取系统的任务属性来来调用上述三种中的一种方法,我测试一下，好像每次都是以dynamic 的方式调用~~

#pragma omp parallel for schedule(runtime)
    for (int i = 0; i < num; i++) {
       ......
    }
}

测试代码

#include <iostream>
#include <omp.h>
#include<chrono>
#include<vector>
#include<thread>
using namespace std;
using namespace chrono;

void sequentialProgram(int num)
{

    for(int i=0;i<num;i++)
    {
       // std::cout<<"hello world"<<std::endl;
        printf("i=%d the current thread id: %d\n",i,omp_get_thread_num());
    }
}

void  parallelProgram(int num) {


#pragma omp parallel for schedule(runtime)
    for (int i = 0; i < num; i++) {
        //std::cout<<"hello world"<<"the current thread id: "<<omp_get_thread_num()<<std::endl;
        printf("i=%d the current thread id: %d\n", i, omp_get_thread_num());
    }
}

int main() {


    int num=omp_get_num_procs()*2;
    auto start_time=std::chrono::steady_clock::now();
    sequentialProgram(num);
    auto end_time=std::chrono::steady_clock::now();
    std::cout<<"sequentialProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;

    start_time=std::chrono::steady_clock::now();
    parallelProgram(num);
    end_time=std::chrono::steady_clock::now();
    std::cout<<"parallelProgram elapse time: "<<std::chrono::duration<double>(end_time-start_time).count()<<" seconds"<<std::endl;
    return 0;
}

测试结果

F:\OpenMP\cmake-build-debug\OpenMP.exe
i=0 the current thread id: 0
i=1 the current thread id: 0
i=2 the current thread id: 0
i=3 the current thread id: 0
i=4 the current thread id: 0
i=5 the current thread id: 0
i=6 the current thread id: 0
i=7 the current thread id: 0
i=8 the current thread id: 0
i=9 the current thread id: 0
i=10 the current thread id: 0
i=11 the current thread id: 0
i=12 the current thread id: 0
i=13 the current thread id: 0
i=14 the current thread id: 0
i=15 the current thread id: 0
i=16 the current thread id: 0
i=17 the current thread id: 0
i=18 the current thread id: 0
i=19 the current thread id: 0
i=20 the current thread id: 0
i=21 the current thread id: 0
i=22 the current thread id: 0
i=23 the current thread id: 0
sequentialProgram elapse time: 0.0410057 seconds
i=0 the current thread id: 1
i=8 the current thread id: 9
i=13 the current thread id: 9
i=14 the current thread id: 9
i=15 the current thread id: 9
i=16 the current thread id: 9
i=17 the current thread id: 9
i=18 the current thread id: 9
i=19 the current thread id: 9
i=20 the current thread id: 9
i=21 the current thread id: 9
i=22 the current thread id: 9
i=23 the current thread id: 9
i=6 the current thread id: 2
i=5 the current thread id: 8
i=7 the current thread id: 11
i=3 the current thread id: 10
i=4 the current thread id: 3
i=2 the current thread id: 4
i=1 the current thread id: 7
i=9 the current thread id: 0
i=10 the current thread id: 6
i=11 the current thread id: 5
i=12 the current thread id: 1
parallelProgram elapse time: 0.042588 seconds

Process finished with exit code 0

本文标签： static schedule OpenMP dynamic size

版权声明：本文标题：2、OpenMP的任务调度schedule(static|dynamic|guided|runtime[size]) 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/dianzi/1725780473a1042182.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

电子爱好者 - 最新技术资讯及电子产品介绍！

2、OpenMP的任务调度schedule(static|dynamic|guided|runtime[size])

更多相关文章

python-schedule模块基本用法

Python定时模块--schedule

import schedule ImportError: No module named schedule

时间自动过期解决方案之node-schedule

【python】RuntimeError: Set changed size during iteration 问题解决

OpenMP(三）#pragma omp critical

【OpenMP】#pragma omp critical 子句

OpenMP critical

OpenMP critical Lock() atomic 3种锁的比较

设置rem或font-size小于12px在浏览器中无效

【解决报错】ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from

numpy.ndarray size changed, may indicate binary incompatibil

ValueError: numpy.ndarray size changed, may indicate binary incompatibility.

RuntimeWarning: numpy.ufunc size changed, may indicate binary incompatibility. Expected 192 from C h

解决ValueError: numpy.ufunc size changed, may indicate binary incompatibility.

解决ValueError: numpy.ufunc size changed, may indicate binary incompatibility. Expected 216 from C h

RuntimeWarning: numpy.dtype size changed, may indicate binary incompatibility.

已解决ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C h

完美解决ValueError: numpy.ndarray size changed, may indicate binary incompatibility. Expected 96

...Anaconda3libimportlib_bootstrap.py:219: RuntimeWarning: numpy.ufunc size changed, may indica

发表评论

推荐文章

Cognitive Services 主要有哪些应用？可以用来作什么？

[Paper Reading]Towards a New Generation of Cognitive Diagnosis

Vue3 - [兼容PC和手机H5] 详细监听浏览器刷新关闭前进后退事件，用户点击关闭和刷新页面前 “拦截“ 操作并弹出提示框（实时监听用户关闭或刷新网页，触发时文字提醒并执行自定义操作）

ubantu下谷歌浏览器安装包

计算机换用户无法启动软件吗,电脑软件无法启动常见的三种原因以及解决方法...

热门文章

七种有效将msvcp140.dll丢失的解决方法，快速修复msvcp140.dll错误

手机浏览器打开微信app的方法

手机浏览器不能显示轮播图

Android之讯飞语音-文字转语音（不用另外安装语音合成包apk）遇到的问题

1-1.Win10系统利用Pycharm社区版安装Django搭建一个简单Python Web项目的步骤之一

打印机SMB设置

用 Python 破解 WIFI 密码，走到哪里都能连 WIFI

Mac输入法设置

【例0855】create red points 在CAD程序中创建红色点

DXF 格式详解

最新文章

C#扫雷外挂辅助工具

MathType最新破解版7.4官方汉化安装包一键安装包下载

PDF电子书如何一键添加书签

win7系统禁止运行指定软件

录屏软件Camtasia2024安装激活图文教程

保姆式安装CodeFormer人脸修复工具

C#+API实现指定窗体激活

PhotoZoom ProClassic 9.0.2激活版安装激活图文教程

win更新管理工具有用吗_7个非常有用的在线业务管理工具

联想工程师专用小工具（共计204款）

MathType7.4中文破解版附MathType软件完整激活教程+激活补丁

在虚拟机VirtualBox7.0.6+openEuler20.03TSL上安装部署openGauss3.1.1数据库快速（一键）安装指导手册

高效实用工具库，简化操作没难度 | 开源专题 No.77

一键安装CDR2024手把手教学下载安装coreldraw2024软件

Guitar Pro8.2中文版简谱制作工具及安装图文激活Guitar Pro8教程

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载