APP Profiler

AMD APP Profiler has been succeeded by AMD’s CodeXL. With the exception of critical bug fixes, there will be no new versions of APP Profiler. For new features, continuous support and a unified developer tools suite, please visit AMD’s CodeXL web page and check out the CodeXL Forum.

The AMD APP Profiler is a performance analysis tool that gathers data from the OpenCL™ run-time and AMD Radeon™ GPUs during the execution of an OpenCL™ application. We can then use this information to discover bottlenecks in an application and find ways to optimize the application’s performance for AMD platforms.

Go to Downloads

OpenCL and the OpenCL logo are trademarks of Apple Inc. used by permission by Khronos.

New in Version 2.5

  • Support for AMD APP SDK v2.7.
  • Support for OpenCL™ 1.2.
  • Support for collecting performance counters on APU devices.
  • Full support for profiling with AMD Radeon™ HD7000 series GPUs based on Graphics Core Next/Southern Islands:
    • Added support for kernel occupancy analysis.
    • Added support for collecting performance counters for DirectCompute (DirectX 11) applications.
    • Addition of SALUBusy counter.
    • Fixed value reported for VALUBusy counter.
    • The values reported for LDSFetchInsts and LDSWriteInsts counters were inaccurate on AMD Radeon™ HD7000 series GPUs; thus for those GPUs, those two counters have been replaced by a single LDSInsts counter.
    • Fixed display of kernel ISA.
  • Improved OpenCL™ analysis module:
    • Added detection of deprecated OpenCL™ APIs.
  • Added support for showing source and destination location, as well as zero-copy status for memory transfers initiated using clEnqueueMapBuffer or clEnqueueMapImage. This information is displayed in the API Trace view.
  • Added support for Microsoft® Visual Studio® projects that use User-defined Macros in the project settings.
  • Fixed the –workingdirectory (-w) command line switch (set current directory) on Linux.
  • Fixed some problems with importing previously-generated profile results into Microsoft® Visual Studio®.
  • Changed the default installation directory on Windows to %PROGRAMFILES(X86)%\AMD\AMD APP Profiler to make it more consistent with other AMD tools (i.e gDEBugger and CodeAnalyst)

Getting Started

Requirements

  • Microsoft Windows Vista or 7 (32 bit/64 bit) or Linux 32-bit/64-bit
  • [Optional] Microsoft Visual Studio 2008 and 2010 (Standard/Professional/Team System Edition)
  • To profile OpenCL™ applications:
    • AMD APP SDK v2.7 or later
    • [GPU device] AMD Catalyst with OpenCL™ GPU support (12.4 or newer)
    • [GPU device] AMD Radeon™ HD 4000 series or newer
  • To profile DirectCompute applications:
    • Microsoft DirectX run-time (June 2010 or later)
    • AMD Radeon™ HD 5000 series or newer

Features

  • Collect OpenCL™ Application Trace

    • View and debug the input parameters and output results for all OpenCL™ API calls
    • Search the API calls
    • Navigate to the source code that called an OpenCL™ API
    • Specify which OpenCL™ APIs will be traced
  • Collect GPU Performance Counters of AMD Radeon™ graphics cards
    • Show kernel resource usages
    • Show the number of instructions executed by the GPU
    • Show the GPU utilization
    • Show the GPU memory access characteristics
    • Measure kernel execution time
  • OpenCL™ Timeline visualization
    • Visualize the application high level structure
    • Visualize kernel execution and data transfer operations
    • Visualize host code execution
    • Annotate host code in the timeline with performance markers using the included CLPerfMarkerAMD library
  • OpenCL™ Application Summary pages
    • Find incorrect or inefficient usage of the OpenCL™ API using the OpenCL™ analysis module
    • Find the API hotspots
    • Find the bottleneck between kernel execution and data transfer operations
    • Find the top 10 data transfer and kernel execution operations
  • OpenCL™ Kernel Occupancy Viewer
    • Calculates and displays a kernel occupancy number, which estimates the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support
    • Find out which kernel resource (GPR usage, LDS size, or Work-group size) is currently limiting the number of in-flight wavefronts
    • Displays graphs showing how kernel occupancy would be affected by changes in each kernel resource
  • Display the AMD IL and ISA (hardware disassembly) code of the kernel for OpenCL™ kernels and DXASM code for DirectCompute kernels.

Support

  • AMD APP Profiler Knowledge Base
  • GPU Developer Tool Forum

Resources

  • AMD Accelerated Parallel Processing OpenCL Programming Guide
  • AMD APP Profiler’s article at Siggraph 2010
  • AMD OpenCL™ Zone

Download

File Name Version Size Launch Date OS Bitness Description
Linux®
AMDAPPProfiler-v2.5.1804-lnx.tgz
2.5
3 MB
05/16/2012
32-bit/64-bit
AMD APP Profiler for OpenCL on Linux platforms
Windows®
AMDAPPProfiler-v2.5.1777.msi
2.5
8 MB
05/16/2012
32-/64-bit
AMD APP Profiler for OpenCL on Windows platforms

APP Profiler相关推荐

  1. React Profiler API

    Profiler 测量一个 React 应用多久渲染一次以及渲染一次的"代价".它的目的是识别出应用中渲染较慢的部分,或是可以使用类似 memoization 优化的部分,并从相关 ...

  2. 【React】Profiler 测量渲染

    文章目录 Profiler 测量渲染一个 React 应用多久渲染一次以及渲染一次的"代价". 它的目的是识别出应用中渲染较慢的部分,或是可以使用类似 memoization 优化 ...

  3. flask gevent uwsgi nginx 联合高并发 部署

    1 python代码 /home/run.py from gevent import monkey monkey.patch_all()from flask import Flask app = Fl ...

  4. AMD OpenCL 大学课程

    AMD OpenCL大学课程是非常好的入门级OpenCL教程,通过看教程中的PPT,我们能够很快的了解OpenCL机制以及编程方法.下载地址:http://developer.amd.com/zone ...

  5. OpenCL 学习step by step (11) 数组求和(reduction)

    本篇教程中,我们学习一下如何用opencl有效实现数组求和,也就是通常所说的reduction问题. 在程序中,我们设置workgroup size为256,kernel的输入.输出缓冲参数都用uin ...

  6. React实战精讲(React_TS/API)

    英国诗人 萨松在诗歌 <与我,过去.现在以及未来>中写道:"In me the tiger sniffs the rose" 诗人余光中将其翻译为:"心有猛虎 ...

  7. react生命周期的基本用法

    转载于:https://www.jianshu.com/p/c9bc994933d5 首先,看一下一个组件的构造 import React,{ Component } from 'react';cla ...

  8. 使用Android Profiler连接成功后app闪退中断解决方法

    最近使用Android Profiler遇到了一个以前没有的小问题,当Profiler连接app成功之后运行小段时间之后就闪退了,网上查了很多解决办法发现以下办法可行,出处找不到了,如果有找到的帮我加 ...

  9. Android Studio 打开Profiler后App闪退

    Android Studio 打开Profiler后App闪退 环境 Android Studio 4.1.1 Android 10 错误信息: 2020-12-24 16:06:21.870 300 ...

最新文章

  1. Android Activity的onRestart()方法
  2. tf之get_Variable()
  3. 教师资格证考试计算机知识题库,教师资格证考试题库
  4. 快速开发插件emmet,前端程序员炫技必备!
  5. java.lang包—对象基类Object
  6. 2021-01-25
  7. linux的cache过高的原因定位与解决echo 3 > /proc/sys/vm/drop_caches
  8. c++语言将任意进制转化10进制,C++ 基础编程之十进制转换为任意进制及操作符重载...
  9. jquery生成二维码图片
  10. hadoop系列一:hadoop集群安装
  11. Julia: DataFrame最大的好处是对NA的处理和对象化操作!
  12. 2021年行政区划代码(含经纬度)mysql库
  13. GPS接收天线 GPS天线原理(有源天线供电通常由接收机前端通过RF同轴电缆供电。)实际测试GPS接收芯片的SMA天线底座中心孔和底座之间电压3.3V,可以通过SMA接口对天线供电2022.3.30
  14. 电源硬件设计----开关电源布局设计要领
  15. php制作的教师管理信息系统,php师资信息管理系统
  16. 计算机excel混合引用,excel绝对引用、混合引用和相对引用怎么用
  17. 为什么保险公司一直不停地招人?
  18. fatal: unable to auto-detect email address (got ‘...@...(none)‘)
  19. Go语言核心之美 2.6-常量
  20. SPI Flash是什么?

热门文章

  1. wifi android手机版下载地址,手机随身wifi下载-手机随身WiFi 安卓版v1.6.3-PC6安卓网...
  2. sendfile实现文件服务器,sendfile
  3. 东华理工专科计算机应用,计算机专业考研:东华理工大学计算机应用技术专业介绍...
  4. linux history 看更多历史记录_Linux内核5.3+将更多支持F2FS
  5. python threading模块中的timer_threading中定时器Timer方法
  6. linux 图片编辑 java_Java的图片处理工具类
  7. mysql之为已经建好的表格添加id为自增主键
  8. Spark基础学习笔记12:Scala内建控制结构
  9. 【BZOJ3295】动态逆序对,CDQ分治/BIT套权值线段树
  10. 【BZOJ2594】水管局长加强版,LCT+并查集+二分查找位置