Tool GDB

Examining Memory (data or in machine instructions)

You can use the command x (for “examine”) to examine memory in any of several formats, independently of your program's data types.

x/nfu addr

x addr

x

n, the repeat count

The repeat count is a decimal integer; the default is 1. It specifies how much memory (counting by units u) to display.

f, the display format

The display format is one of the formats used by print (`x', `d', `u', `o', `t', `a', `c', `f', `s'), and in addition `i' (for machine instructions). The default is `x' (hexadecimal) initially. The default changes each time you use either x or print.

x -- Print as integer in hexadecimal.

d -- Print as integer in signed decimal.

u -- Print as integer in unsigned decimal.

o -- Print as integer in octal.

t -- Print as integer in binary. The letter `t' stands for “two”. 1

a -- Print as an address, both absolute in hexadecimal and as an offset from the nearest preceding symbol. You can use this format used to discover where (in what function) an unknown address is located:

(gdb) p/a 0x54320

$3 = 0x54320 <_initialize_vx+396>

c -- Regard as an integer and print it as a character constant.

f -- Regard as a floating point number and print using typical floating point syntax.

s --Regard as a string

u, the unit size  -- The unit size is any of

b -- Bytes.

h -- Halfwords (two bytes).

w -- Words (four bytes). This is the initial default.

g -- Giant words (eight bytes).

Each time you specify a unit size with x, that size becomes the default unit the next time you use x. For the `i' format, the unit size is ignored and is normally not written. For the `s' format (string), the unit size defaults to `b', unless it is explicitly given. Use x /hs to display 16-bit char strings and x /ws to display 32-bit strings. The next use of x /s will again display 8-bit strings. Note that the results depend on the programming language of the current compilation unit. If the language is C, the `s' modifier will use the UTF-16 encoding while `w' will use UTF-32. The encoding is set by the programming language and cannot be altered.

Example:

x/4xw $sp -- print the four words (`w') of memory above the stack pointer  in hexadecimal (`x').

x/3uh 0x54320 -- display three halfwords (h) of memory, formatted as unsigned decimal integers (`u'), starting at address 0x54320.

x/5i $pc-6 -- examines machine instructions.

Registers

The names of registers are different for each machine; use info registers to see the names used on your machine.
info registers– check register names
gdb four “standard” register:
$pc -- the program counter register, $RIP
$sp -- the stack pointer register, $RSP
$fp -- a pointer to the current stack frame, “base pointer” register, $RBP
$ps -- a register that contains the processor status
Example:
p/x $pc
x/i $pc

GDB conversation

If displaying variable, which is a address, gdb will try to display its content(value) together. Format: 
Address:  content(value) 
Example:
(gdb) x/x $pc  -- $pc is a address
0x40071e <_Z3addii+4>:  0x10ec8348
(gdb) x/i $pc
0x40071e <_Z3addii+4>:  sub    $0x10,%rsp
(gdb) x/xg $pc
0x40071e <_Z3addii+4>:  0x89fc7d8910ec8348
(gdb) p $pc – print value of $pc only
$9 = (void (*)()) 0x40071e <add(int, int)+4>
(gdb) p/x $pc
$10 = 0x40071e 

Advance guide for debugging a program

Introduce

Any program only includes data and machine instructions.
Machine instructions: user code(Text), static lib code and shared lib code. We can disassemble program at any program memory. 
Data: initialized data, uninitialized data (bss), Heap(malloc arena), stack and registers. 
Commonly, the code(machine instruction) won’t change when running a program. It means if we track data, we can know the all program status according to machine instruction. This means gdb x command can be a general tool for program debug.

Virtual memory

Modern operating systems and architectures provide virtual memory. Through hardware support and additional code in the operating system, virtual memory allows each user process to act as though it is the only thing running on the computer. It gives each process a completely separate address space.

Through pmap, user can check process' memory map, such as:

$ pmap 23814  # If you don't have pmap installed, use 'cat /proc/23814/maps'

Process memory layout

It is for 64-bit systems. On 32-bit systems the shared libraries are usually found at the lowest address, followed by the text segment, then everything else.

The text, or code, segment contains the actual program and any statically linked libraries. On 64-bit systems it generally starts at 0x400000 (32-bit systems like to place it at 0x8047000).

The data and BSS segments come next. The data segment contains all initialized global variables as well as static strings (eg, those used in printf). The BSS, or "block started by segment" region holds all uninitialized global variables (those which by C convention are initialized automatically to 0).

After that comes the heap, where all memory obtained via malloc() is located. The heap grows upwards as more memory is requested.

Then we have any shared libraries such as the loader, libc, malloc, etc.

Finally we have the stack, which it should be noted grows downward as it expands.

Downward-growing stack

Programs use the stack to store temporary, or automatic variables, arguments passed during a function call, and information needed to return to a previous point in the program when a function call is made.

There are also three registers that are important at this point - RBP, RSP, and RIP. RSP is the stack pointer. The stack works just like a LIFO queue with push() and pop() operations. RSP tracks the next available position in this queue.

The stack frame is essentially the region of the stack that is presently active, or that corresponds to the presently executing function. It is pointed to by RBP, the "base pointer," and is used in combination with an offset to reference all local variables. Every time a function is called, RBP is updated to point to its stack frame.  

RIP is the instruction pointer. It holds the address of the instruction that the CPU just loaded and is presently executing.

The diagram above shows a snapshot of the stack for a program that is presently in func1(), which was called from main(). In order for the stack to look this way, some things must have happened when func1() was called. These steps are defined by the C calling convention. [2]

1. The arguments to func1() were pushed onto the stack.
2. func1() was called.
3. RBP was pushed onto the stack.
4. RSP was moved to RBP.
5. Space for the local variables was allocated on the stack.
6. Local variables were set to initial values (if provided).

Steps 3 through 6 are called the function prelude.

图3 带有基指针的栈帧

System calls

The only way to enter supervisor mode is to go through predefined entry points in the kernel. One of these points is called a system call.

How system calls work on x86_64 Linux by taking a look at the kernel source, specifically arch/x86_64/kernel/entry.S where we see the following comment...

/*

 * System call entry. Upto 6 arguments in registers are supported.

 *

 * SYSCALL does not save anything on the stack and does not change the

 * stack pointer.

 */

                              

/*

 * Register setup: 

 * rax  system call number

 * rdi  arg0

 * rcx  return address for syscall/sysret, C arg3

 * rsi  arg1

 * rdx  arg2        

 * r10  arg3        (--> moved to rcx for C)

 * r8   arg4

 * r9   arg5

 * r11  eflags for syscall/sysret, temporary for C

 * r12-r15,rbp,rbx saved by C code, not touched.                          

 *

 * Interrupts are off on entry.

 * Only called from user space.

 *

 * XXX   if we had a free scratch register we could save the RSP into the stack frame

 *      and report it properly in ps. Unfortunately we haven't.

 */ 

So to make a system call, you first store the syscall number in RAX, any parameters in RDI, RSI, RDX, etc, and then execute the "syscall" instruction.

图4 64位模式下通用寄存器的大小和名称

图5 在传统和兼容模式下的通用寄存器大小和名称

通用寄存器在编程时通常用于不同的用途,说明如表1所示。

表1 通用寄存器的用途

寄存器名

用途

EAX

累加器

EBX

基址寄存器

ECX

计数器

EDX

数据寄存器

ESI

源地址指针寄存器

EDI

目的地址指针寄存器

EBP

基址指针寄存器

ESP

堆栈指针寄存器

待续

参考文献:Buffer Overflows and You (上)GDB munualX86-64构架概述X86-64处理器构架的应用程序二进制接口

作者:zhenjing.chen 
出处:http://www.cnblogs.com/zhenjing/ 
未注明转载的文章,版权归作者所有,欢迎转载,但未经作者同意必须保留此段声明,且在文章页面明显位置给出原文连接,否则保留追究法律责任的权利。

转载于:https://www.cnblogs.com/xumaojun/p/8533130.html

[Advance] How to debug a program (上)相关推荐

  1. Idea在debug时打上断点没有用 Skipped breakpoint at ... because it happened inside debugger evaluation

    问题: 最近项目上调试多线程bug时发现奇怪问题:用Idea打上条件断点debug时没有用.直接跳过并提示:Skipped breakpoint at ... because it happened ...

  2. Debug:Program received signal SIGSEGV,Segmentation fault.

    错误描述 最近在用Dev-C++编写程序时,程序能够编译运行,奈何本该接收键盘输入数据的程序,运行之后直接显示"请按任意键继续-"字样结束运行了.调试后发现了报错信息"P ...

  3. NLog文章系列——入门教程(上)

    作者:Jarosław Kowalski <jaak@jkowalski.net> 翻译:Dflying Chen:http://dflying.cnblogs.com/ 原文:http: ...

  4. VC++调试程序、快捷键以及Debug版本与Release版本

    1.如何在Release状态下进行调试 Project->Setting=>ProjectSetting对话框,选择Release状态.C/C++标签中的Category选General, ...

  5. 全志A10平板上的ubuntu终极安装版,支持HDMI和平板本机LCD

    转自 http://hi.baidu.com/lang_byebye/item/429f3507922eb1c4905718d7 这个ubuntu系统仍然是安装在miniSD卡中.与原andorid系 ...

  6. DEBUG命令大全及实例(有改动)

    Debug   启动 Debug,它是可用于测试和调试 MS-DOS 可执行文件的程序.   Debug [[drive:][path] filename [parameters]]   参数   [ ...

  7. Debug-PC之开山老祖

    Debug 原意是杀虫子.这里是机器调试工具. 其实,Debug的由来,还有一则趣闻,在早期美国的一计算机房中,科学家正在紧张的工作.同时,许多台大型的计算机也在不停的运行着.大概是由于,机器过热,引 ...

  8. Debug-PC之开山老祖[转]

    http://tb.blog.csdn.net/TrackBack.aspx?PostId=496060 Debug 原意是杀虫子.这里是机器调试工具. 其实,Debug的由来,还有一则趣闻,在早期美 ...

  9. [转贴]改好DEBUG七处缺点的comexe实现报告

    改好DEBUG七处缺点的comexe实现报告 首先声明,本文所指DEBUG,系DOS 6.22,win 98及2k三者DEBUG.EXE. (1) DEBUG命令T及P的2处缺点 跟踪命令T,建立在8 ...

最新文章

  1. 蓝凌ekp开发_蓝凌EKP8.0产品白皮书(简).doc
  2. 日常工作,怎么结合工具设计有效的时间管理?
  3. AIX系统日志学习笔记之三
  4. 计算机上开发和运行应用程序与操作系统无关,计算机期中测验理论部分和答案-C.doc...
  5. 成绩表格模板MySQL_Mysql数据统计脚本模板
  6. storm消息可靠机制(ack)的原理和使用
  7. S7503E V7 snmpv3典型组网配置案例(与IMC联动)
  8. Battery Health 3 for Mac(电池健康管理工具)
  9. oracle pl/sql发送邮件多个收件人问题
  10. lwip协议栈源码分析之pbuf
  11. 【乌拉喵.教程】PCtoLCD2002作为LCD5110字模提取软件的使用方法
  12. 如何在HBuilder里面运行php文件
  13. cobalt strik启动
  14. 青云诀2显示登录服务器超时,青云诀2游戏突然显示数据包损坏怎么办 解决方案分享...
  15. 通过css实现图片过渡放大的效果
  16. vmware虚拟机中ubuntu如何连接校园网netkeeper
  17. SPSSModeler的下载与安装
  18. oracle数据库管理员账户锁定,Oracle数据库账号被锁定解决方法
  19. 关于Microsofe Visio,如果设置图层大小与绘图适应?
  20. 工业工程运用计算机,工业工程如何面对挑战-精选.doc

热门文章

  1. docker :open /var/lib/docker/tmp/GetImageBlob318829910: no such file or directory异常解决
  2. 2022-2028年中国自主可控行业深度调研及投资前景预测报告(全卷)
  3. 非本地类型不能定义方法 cannot define new methods on non-local type time.Duration
  4. 2022-2028年中国软件测试行业市场研究及前瞻分析报告
  5. 如何优雅的在python中暂停死循环?
  6. 认清自己,就能活出更好的自己
  7. RuntimeError: Expected object of device type cuda but got device type cpu for argument pytorch数据位置
  8. pycharm设置编写的脚本页面长行实现自动换行(windows版)
  9. PyTorch官方中文文档:torch.optim 优化器参数
  10. LeetCode中等题之最简分数