Gdb: Difference between revisions
(119 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
== References == |
== References == |
||
* [https://www.gnu.org/software/gdb/documentation/ '''GDB: The GNU Project Debugger -- Documentation'''] |
|||
* [http://www.unknownroad.com/rtfm/gdbtut/gdbsegfault.html 7.2 Example Debugging Session: Segmentation Fault Example] (unkownroad.com) |
* [http://www.unknownroad.com/rtfm/gdbtut/gdbsegfault.html 7.2 Example Debugging Session: Segmentation Fault Example] (unkownroad.com) |
||
* [http://www.delorie.com/gnu/docs/gdb/gdb_toc.html Debugging with GDB] (delorie.com) |
* [http://www.delorie.com/gnu/docs/gdb/gdb_toc.html Debugging with GDB] (delorie.com) |
||
Line 5: | Line 6: | ||
* [http://wwwcsif.cs.ucdavis.edu/~davis/30/gdb_Tutorial.htm#Sect4_2 An Interactive Guide to Faster, Less Frustrating Debugging] |
* [http://wwwcsif.cs.ucdavis.edu/~davis/30/gdb_Tutorial.htm#Sect4_2 An Interactive Guide to Faster, Less Frustrating Debugging] |
||
* [http://heather.cs.ucdavis.edu/~matloff/debug.html Norm Matloff's Debugging Tutorial] (also guide to fast editing and gui debuggers DDD, GVD) |
* [http://heather.cs.ucdavis.edu/~matloff/debug.html Norm Matloff's Debugging Tutorial] (also guide to fast editing and gui debuggers DDD, GVD) |
||
* [https://blogs.oracle.com/ksplice/entry/8_gdb_tricks_you_should 8 gdb tricks you should know] (oracle.com) |
|||
* [https://news.ycombinator.com/item?id=12101347 "Give me 15 minutes and I'll change your view of GDB" (video) (undo.io)] (Hacker News) |
|||
:* [https://www.gnu.org/software/gdb/news/reversible.html GDB news - reversible debug]. |
|||
== GDB front-ends == |
=== GDB front-ends === |
||
There are several types of front-end for GDB: |
|||
Reference list: |
|||
* GDB built-in front-end (using TUI, Text User Interface) |
|||
* Front-end that customizes heavily {{file|.gdbinit}} file. |
|||
* Front-end using the newer MI2 interface. |
|||
* Front-end using the older MI interface. |
|||
More front-ends: |
|||
* [https://sourceware.org/gdb/wiki/GDB%20Front%20Ends GDB Front Ends] |
* [https://sourceware.org/gdb/wiki/GDB%20Front%20Ends GDB Front Ends] |
||
* [http://stackoverflow.com/questions/209534/how-to-highlight-and-color-gdb-output-during-interactive-debugging How to highlight and color gdb output during interactive debugging?] |
* [http://stackoverflow.com/questions/209534/how-to-highlight-and-color-gdb-output-during-interactive-debugging How to highlight and color gdb output during interactive debugging?] |
||
* See [https://news.ycombinator.com/item?id=12101347 HN post]. |
|||
* See list on [https://github.com/mozilla/rr/wiki/Using-rr-in-an-IDE Mozilla rr] |
|||
;GDB built-in |
|||
Ideally front-ends must use the GDB/MI2 interface. |
|||
* '''GDB TUI''' — GDB own Text User Interface ({{kb|C-x C-a}}: http://davis.lbl.gov/Manuals/GDB/gdb_21.html) |
|||
;gdbinit front-end |
|||
=== Good candidates === |
|||
* '''{{red|NEW}}''' '''[https://github.com/hugsy/gef Gef]''' |
|||
Mainly from SO post above: |
|||
* '''[https://github.com/cyrus-and/gdb-dashboard GDB Dashboard]''' ([https://github.com/xeyownt/gdb-dashboard my fork]) |
|||
* [http://reverse.put.as/gdbinit/ .gdbinit] |
|||
* [https://github.com/ |
* '''[https://github.com/snare/voltron voltron]''' |
||
* [https://github.com/zachriggle/pwndbg pwndbg] |
* '''[https://github.com/zachriggle/pwndbg pwndbg]''' |
||
* '''[http://reverse.put.as/gdbinit/ .gdbinit]''' |
|||
* [https://github.com/snare/voltron voltron] |
|||
; IDE with gdb integration |
|||
Vim: |
|||
* https:// |
* '''{{red|HOT}} VSCode''' https://code.visualstudio.com/docs/cpp/cpp-debug |
||
* '''{{red|HOT}} Emacs''' https://www.gnu.org/software/emacs/manual/html_node/emacs/Debuggers.html |
|||
: Emacs is frequently touted as having the best GDB integration (better than DDD for instance) [https://news.ycombinator.com/item?id=12101347]. Must check for some tutorials [https://undo.io/resources/gdb-watchpoint/using-gdb-emacs/]. |
|||
* '''CLion''' https://www.jetbrains.com/clion/ |
|||
* '''QtCreator''' http://doc.qt.io/qtcreator/ |
|||
* '''Eclipse''' |
|||
;MI2 front-ends |
|||
* '''{{red|NEW}} gdbgui''' https://www.gdbgui.com/ |
|||
: Seen on [https://github.com/mozilla/rr/wiki/Using-rr-in-an-IDE Mozilla rr]. |
|||
* (Neovim) '''[https://github.com/critiqjo/lldb.nvim lldb.nvim]''' |
|||
: Only for LLDB. |
|||
* (Vim) '''[https://github.com/vim-scripts/Conque-GDB Conque-GDB]''' |
|||
:Use old shell implementation, not the new neovim terminal. |
:Use old shell implementation, not the new neovim terminal. |
||
* http://pyclewn.sourceforge.net/ |
* (Vim) '''[http://pyclewn.sourceforge.net/ pyclewn]''' |
||
:Read some comment it was not convenient, and would require extra plugin like https://github.com/notEvil/vim-debug. |
:Read some comment it was not convenient, and would require extra plugin like https://github.com/notEvil/vim-debug. |
||
; MI front-end (old interface) |
|||
Neovim: |
|||
* '''{{red|HOT}}''' '''[http://cgdb.github.io/ cgdb]''' (text mode with vi keymap) |
|||
* https://github.com/critiqjo/lldb.nvim |
|||
* '''[http://www.gnu.org/software/ddd/ DDD]''' |
|||
: Only for LLDB. |
|||
* '''[http://www.kdbg.org/index.php KDBG]''' |
|||
=== GDBINIT Hacks === |
|||
=== GDBInit hacks === |
|||
* Using [https://github.com/nojhan/colout/blob/master/colout/example.gdbinit colout] one may color almost any gdb output. |
* Using [https://github.com/nojhan/colout/blob/master/colout/example.gdbinit colout] one may color almost any gdb output. |
||
=== |
=== Tools and libraries === |
||
* [https://www.capstone-engine.org/ '''Capstone'''], the ultimate disassembly engine. |
|||
These are using the old "annotation" mechanisms: |
|||
* [http://www.gnu.org/software/ddd/ DDD] |
|||
=== Tutorials === |
|||
* [http://www.kdbg.org/index.php KDBG] |
|||
* [https://www.youtube.com/watch?v=PorfLSr3DDI CppCon 2015: Greg Law " Give me 15 minutes & I'll change your view of GDB"] |
|||
* [http://cgdb.github.io/ cgdb] (text mode with vi keymap) |
|||
: GDB TUI mode, python, reverse debugging (with <code>record</code>). |
|||
* {{red|TO WATCH}} [https://www.youtube.com/watch?v=-n9Fkq1e6sg CppCon 2016: Greg Law “GDB - A Lot More Than You Knew"] |
|||
== Alternatives to GDB == |
|||
=== [[rr]] === |
|||
[[rr]] is the '''Record and Replay Framework''', developped by '''Mozilla''', and is a replacement for and powerful enhancement of gdb. |
|||
See [[rr]] for more information. |
|||
=== [https://lldb.llvm.org/ LLDB] === |
|||
{{todo|Have a look at LLDB}} |
|||
The debugger from Apple, mostly compatible with GDB with more powerful feature, but for LLVM/CLang only. |
|||
=== No debugger === |
|||
Some people don't use / like debuggers: |
|||
* [https://lwn.net/2000/0914/a/lt-debugger.php3 Linus Torvalds doesn't like debuggers] |
|||
: ''I don't like debuggers. Never have, probably never will. I use gdb all the time, but I tend to use it not as a debugger, but as a disassembler on steroids that you can program.'' |
|||
* [https://lemire.me/blog/2016/06/21/i-do-not-use-a-debugger/ Daniel Lemire - I do not use a debugger] |
|||
* [https://blog.jgc.org/2007/01/tao-of-debugging.html John Graham-Cumming - The Tao of Debugging] |
|||
: ''The most effective debugging tools are: your brain, a unit test, and the print statement.'' |
|||
: ''Any programmer not writing unit tests for their code in 2007 should be considered a pariah (*).'' |
|||
== GDB configuration == |
== GDB configuration == |
||
Line 54: | Line 104: | ||
set height 0 |
set height 0 |
||
</source> |
</source> |
||
=== Install Python modules for GDB === |
|||
From pwndbg [https://github.com/pwndbg/pwndbg.git]: |
|||
<source lang=bash> |
|||
# Find the Python version used by GDB. |
|||
PYVER=$(gdb -batch -q --nx -ex 'pi import platform; print(".".join(platform.python_version_tuple()[:2]))') |
|||
PYTHON=$(gdb -batch -q --nx -ex 'pi import sys; print(sys.executable)') |
|||
PYTHON="${PYTHON}${PYVER}" |
|||
# Find the Python site-packages that we need to use so that |
|||
# GDB can find the files once we've installed them. |
|||
SITE_PACKAGES=$(gdb -batch -q --nx -ex 'pi import site; print(site.getsitepackages()[0])') |
|||
# or to install in user mode: |
|||
SITE_PACKAGES=$(gdb -batch -q --nx -ex 'pi import site; print(site.getusersitepackages())') |
|||
# Install Python dependencies |
|||
sudo ${PYTHON} -m pip install --target ${SITE_PACKAGES} -Ur requirements.txt |
|||
</source> |
|||
Example of file {{file|requirements.txt}} |
|||
pip |
|||
pycparser |
|||
psutil>=3.1.0 |
|||
== Front-ends == |
|||
=== Gef === |
|||
* [https://github.com/hugsy/gef Gef on GitHub] |
|||
* [https://gef.readthedocs.io/en/master/ Gef Docs] |
|||
Simple install script [https://github.com/hugsy/gef]: |
|||
<source lang="bash"> |
|||
# via the install script |
|||
$ wget -q -O- https://github.com/hugsy/gef/raw/master/scripts/gef.sh | sh |
|||
# manually |
|||
$ wget -O ~/.gdbinit-gef.py -q https://github.com/hugsy/gef/raw/master/gef.py |
|||
$ echo source ~/.gdbinit-gef.py >> ~/.gdbinit |
|||
</source> |
|||
'''{{red|Dependencies}}''': |
|||
* Install '''keystone-engine''', NOT keystone (fix error <code>AttributeError: 'module' object has no attribute 'KS_ARCH_X86'</code>): |
|||
<source lang="bash"> |
|||
pip3 install --no-binary keystone-engine keystone-engine |
|||
</source> |
|||
'''{{red|Tips}}''': |
|||
* Gef may work with a second pane in <code>tmux</code>. This uses the <code>tmux-setup</code> / <code>tmux-start</code> commands, but the most reliable is to identify the PTS device with <code>tmux list-panes -a -F '#D #{pane_tty}'</code>, and then assign the appropriate device to variable <code>redirect</code> in {{file|~/.gef.rc}}. |
|||
<source lang="bash"> |
|||
tmux list-panes -F '#D #{pane_tty}' |
|||
# %0 /dev/pts/9 |
|||
# %1 /dev/pts/10 |
|||
</source> |
|||
* Use <code>context</code> (or alias <code>ctx</code>) to re-print gef context. |
|||
* Use <code>db &myvar</code> to dump (byte) at some variable location. |
|||
=== GDB dashboard === |
=== GDB dashboard === |
||
Line 80: | Line 187: | ||
Alternative styles: |
Alternative styles: |
||
* [https://github.com/daveyarwood/gruvbox-pygments gruvbox] |
* [https://github.com/daveyarwood/gruvbox-pygments gruvbox] |
||
;Issues |
|||
* <code>Cannot write the dashboard: [Errno 10] No child process</code> [https://github.com/cyrus-and/gdb-dashboard/issues/56 issue #56] |
|||
:This is a bug in the gdb vendor. Fix file {{file|platform.py}}: |
|||
<source lang="diff"> |
|||
@@ -1009,7 +1009,10 @@ |
|||
except (AttributeError,os.error): |
|||
return default |
|||
output = string.strip(f.read()) |
|||
- rc = f.close() |
|||
+ try: |
|||
+ rc = f.close() |
|||
+ except: |
|||
+ rc = 0 |
|||
if not output or rc: |
|||
return default |
|||
else: |
|||
</source> |
|||
* <code>Cannot write the dashboard: Invalid character '?' in expression.</code> |
|||
: Likely due to incorrect character in <code>info reg</code>. The register is evaluated via <code>parse_and_eval</code> |
|||
=== Voltron === |
|||
See [[Voltron]]. |
|||
=== GDB TUI === |
|||
Some example inspired from [https://www.youtube.com/watch?v=PorfLSr3DDI Greg Law video on Youtube]: |
|||
* Start TUI mode with {{kb|Ctrl-X A}}, or <code>gdb --tui</code>, or <code>tui enable</code>. |
|||
* {{kb|Ctrl-L}} repaint the screen. |
|||
* {{kb|Ctrl-X 2}} to split the layout (repeat to cycle through layout), or <code>layout split</code>. |
|||
* {{kb|Ctrl-X 1}} for back to source-only or disas-only layout (depending on the split-layout we left). |
|||
* <code>tui reg float</code> for instance to change the register view. |
|||
Note that while in TUI mode, up and down keys will scroll the source code: |
|||
* {{kb|Up}} / {{kb|Down}} scroll the source code. |
|||
* {{kb|Ctrl-P}} / {{kb|Ctrl-N}} Previous or Next command in command history. |
|||
* {{kb|Ctrl-O}} switch focus ({{red|does not work for me}}). |
|||
=== CGDB === |
|||
Probably '''one of the best lightweight front-end''' out there, with vim-like bindings. |
|||
* {{kb|Esc}} or {{kb|i}} to switch between modes. |
|||
* {{kb|Ctrl-W}} to switch between vertical / horizontal layout (requires latest cgdb from source). |
|||
* {{kb|j}}, {{kb|k}}, {{kb|Ctrl-D}}, {{kb|Ctrl-U}} to navigate (and more vim-like bindings as well) |
|||
* {{kb|Space}} to set a breakpoint, {{kb|t}} for a temporary breakpoint |
|||
* {{kb|/}} and {{kb|n}} for search and next |
|||
* <code>:help</code> for help (when in cgdb mode) |
|||
* <code>:set dis</code> or <code>:set nodis</code> to view disassembly. |
|||
cgdb works nicely with '''[[rr]]''': |
|||
<source lang="bash"> |
|||
rr replay -d cgdb |
|||
</source> |
|||
See [[rr]] page for more options. |
|||
== Prepare debug session == |
== Prepare debug session == |
||
Line 102: | Line 263: | ||
== GDB commands == |
== GDB commands == |
||
Reference |
=== Reference === |
||
* GDB manual |
* GDB manual |
||
* https://beej.us/guide/bggdb/ |
* https://beej.us/guide/bggdb/ |
||
Line 113: | Line 274: | ||
|width=150pt| |
|width=150pt| |
||
<code>b [+-][NUMBER]</code><br/> |
<code>b [+-][NUMBER]</code><br/> |
||
<code>break [+-][NUMBER]</code> |
<code>break [+-][NUMBER]</code><br/> |
||
<code>b LOCATION</code><br/> |
|||
<code>break LOCATION</code> |
|||
| |
| |
||
Set a breakpoint at current line, at given line ''NUMBER'' or ''NUMBER'' lines after/before current line. |
Set a breakpoint at current line, at given line ''NUMBER'' or ''NUMBER'' lines after/before current line. |
||
|- |
|||
Set breakpoint at ''LOCATION''. |
|||
| |
|||
<code>b LOCATION</code><br/> |
|||
<code>break LOCATION</code><br/> |
|||
|Set breakpoint at ''LOCATION''. |
|||
:<code>b main</code> sets a breakpoint at beginning of function <code>main()</code>. |
:<code>b main</code> sets a breakpoint at beginning of function <code>main()</code>. |
||
:<code>b foo.c:42</code> sets a breakpoint at file {{file|foo.c}}, line 42. |
:<code>b foo.c:42</code> sets a breakpoint at file {{file|foo.c}}, line 42. |
||
:<code>b foo.c:loop</code> sets a breakpoint at file {{file|foo.c}}, label <code>loop:</code>. |
|||
:<code>b *0x400120</code> sets a breakpoint at address 0x400120. |
|||
|- |
|||
|width=150pt| |
|||
<code>hb [+-][NUMBER]</code><br/> |
|||
<code>hb LOCATION</code><br/> |
|||
| |
|||
Set a HARDWARE breakpoint at current line, at given line ''NUMBER'' or ''NUMBER'' lines after/before current line. |
|||
Set a HARDWARE breakpoint at ''LOCATION''. |
|||
|- |
|||
|width=150pt| |
|||
<code>b LOC if COND</code> |
|||
| |
|||
Set a conditional breakpoint. |
|||
:<code>b main.c:24 if a == 2</code> Set a breakpoint at <code>main.c:24</code> when var <code>a</code> is 2. |
|||
|- |
|||
|width=150pt| |
|||
<code>tb [+-][NUMBER]</code><br/> |
|||
<code>tbreak [+-][NUMBER]</code><br/> |
|||
<code>tb LOCATION</code><br/> |
|||
<code>tbreak LOCATION</code> |
|||
| |
|||
Idem, but ''temporary'' breakpoint. |
|||
|- |
|||
| |
|||
<code>ignore N COUNT</code> |
|||
|Set ignore-count of breakpoint N to COUNT |
|||
|- |
|||
| |
|||
<code>command [BKPIP]</code><br/> |
|||
<code>...</code><br/> |
|||
<code>end</code><br/> |
|||
|Set commands to execute when a breakpoint is hit |
|||
|- |
|- |
||
| |
| |
||
Line 139: | Line 333: | ||
|- |
|- |
||
| |
| |
||
<code>cl LOCATION</code><br/> |
<code>cl [LOCATION]</code><br/> |
||
<code>clear LOCATION</code> |
<code>clear [LOCATION]</code> |
||
|Clear breakpoint by ''LOCATION'' |
|Clear breakpoint by ''LOCATION'', or current line |
||
|- |
|- |
||
| |
| |
||
Line 156: | Line 350: | ||
<code>dis NUMBER</code><br/> |
<code>dis NUMBER</code><br/> |
||
<code>disable NUMBER</code> |
<code>disable NUMBER</code> |
||
|Disable breakpoint by ''NUMBER'' (as listed by <code>i b</code>) |
|Disable breakpoint by ''NUMBER'' (as listed by <code>i b</code>). |
||
<code>dis $bpnum</code> to disable last bkp (useful for {{file|.gdbinit}}). |
|||
|- |
|- |
||
| |
| |
||
Line 162: | Line 357: | ||
<code>save breakpoints FILE</code> |
<code>save breakpoints FILE</code> |
||
|Save current breakpoints as script FILE. Use <code>source</code> to reload. |
|Save current breakpoints as script FILE. Use <code>source</code> to reload. |
||
|- |
|||
!colspan=2|Registers |
|||
|- |
|||
|width=150pt| |
|||
<code>i r</code><br/> |
|||
<code>info reg</code> |
|||
| |
|||
List integer registers |
|||
|} |
|} |
||
Line 173: | Line 376: | ||
| |
| |
||
Start (or restart) program. Arguments may include wildcards (*) and redirections (<, <<...) |
Start (or restart) program. Arguments may include wildcards (*) and redirections (<, <<...) |
||
|- |
|||
| |
|||
<code>start [ARGS]</code><br/> |
|||
|Start and break on <code>main</code> |
|||
|- |
|- |
||
| |
| |
||
Line 187: | Line 394: | ||
<code>step [NUMBER]</code> |
<code>step [NUMBER]</code> |
||
|Step (''into'') current line, or ''NUMBER'' lines. |
|Step (''into'') current line, or ''NUMBER'' lines. |
||
|- |
|||
| |
|||
<code>si [NUMBER]</code><br/> |
|||
<code>stepi [NUMBER]</code> |
|||
|Step one (assembly) '''instruction''' exactly (''N'' times) |
|||
|- |
|- |
||
| |
| |
||
Line 192: | Line 404: | ||
<code>next</code> |
<code>next</code> |
||
|Run to next line (''over'' current line) |
|Run to next line (''over'' current line) |
||
|- |
|||
| |
|||
<code>ni [NUMBER]</code><br/> |
|||
<code>nexti [NUMBER]</code> |
|||
|Step one (assembly) '''instruction''' exactly (''N'' times), but over subroutine calls. |
|||
|- |
|- |
||
| |
| |
||
Line 199: | Line 416: | ||
|- |
|- |
||
| |
| |
||
<code>adv LOCATION</code> |
|||
<code>advance LOCATION</code> |
<code>advance LOCATION</code> |
||
|Run until temporary breakpoint set at ''LOCATION''. |
|Run until temporary breakpoint set at ''LOCATION''. |
||
|- |
|- |
||
| |
| |
||
<code>u [NUMBER]</code><br/> |
|||
<code>until [NUMBER]</code> |
<code>until [NUMBER]</code> |
||
|Execute until the program reaches a source line greater than current. |
|Execute until the program reaches a source line greater than current ({{red|very handy}}). |
||
|} |
|} |
||
Line 212: | Line 431: | ||
|- |
|- |
||
|width=150pt| |
|width=150pt| |
||
<code>bt</code><br/> |
|||
<code>bt [COUNT]</code><br/> |
<code>bt [COUNT]</code><br/> |
||
<code>backtrace [COUNT]</code><br/> |
<code>backtrace [COUNT]</code><br/> |
||
Line 236: | Line 454: | ||
{| width=100% |
{| width=100% |
||
|- |
|- |
||
!colspan=2|View memory |
!colspan=2|View / Change memory |
||
|- |
|- |
||
|width=150pt| |
|width=150pt| |
||
<code>disp/FMT EXPR</code><br/> |
|||
<code>disp EXPR</code><br/> |
<code>disp EXPR</code><br/> |
||
<code>display EXPR</code> |
<code>display EXPR</code> |
||
| |
| |
||
Display EXPR at each prompt (if within scope). |
Display ''EXPR'' at each prompt (if within scope), using format FMT. If omitted, '''reuse last format FMT''' (of last <code>x</code>, <code>p</code> or <code>disp</code>). |
||
Use <code>*ptr</code> to display pointed data, and <code>*ptr@LEN</code> to display array of LEN elements. |
|||
|- |
|||
|width=150pt| |
|||
<code>x[/nfu] ADDR</code><br/> |
|||
<code>x/8zb ADDR</code><br/> |
|||
<code>x/16i ADDR</code><br/> |
|||
<code>x/16zw ADDR</code> |
|||
| |
|||
Examine ''n'' memory locations at ADDR, format as ''n'' (<code>xduotacfszr</code>) and unit size ''u'' (<code>bhwg</code>). If omitted, '''reuse last format FMT'''. |
|||
'''Pro tip: Use always format 'z', not 'x''''. |
|||
* <code>/i</code> for a single assembly instruction. <code>/16i</code> for 16 instructions |
|||
* <code>/8zb</code> for 8 bytes with '''leading zeroes'''. |
|||
* <code>/16zw</code> for 16 32-bit words with '''leading zeroes'''. |
|||
* <code>x/16i $PC</code> use current PC addr |
|||
|- |
|- |
||
| |
| |
||
<code>i |
'''<code>i lo</code>'''<br/> |
||
<code>i args</code><br/> |
<code>i args</code><br/> |
||
<code>info locals</code><br/> |
<code>info locals</code><br/> |
||
Line 252: | Line 485: | ||
|- |
|- |
||
| |
| |
||
<code>p/FMT EXPR</code><br/> |
|||
<code>p EXPR</code><br/> |
|||
<code>print EXPR</code> |
<code>print EXPR</code> |
||
|print ''EXPR'', using format FMT. If omitted, '''reuse last format FMT''' (of last <code>x</code>, <code>p</code> or <code>disp</code>). |
|||
|print ''EXPR''. |
|||
Use <code>*ptr</code> to display pointed data, and <code>*ptr@LEN</code> to display array of LEN elements. '''Pro tip: Use always format 'z', not 'x'''' |
|||
|- |
|- |
||
| |
| |
||
<code>undisplay NUMBER</code> |
<code>undisplay NUMBER</code> |
||
|Undisplay expression by ''NUMBER''. |
|Undisplay expression by ''NUMBER''. |
||
|- |
|||
| |
|||
<code>set VARIABLE=VALUE</code> |
|||
|Write VALUE in VARIABLE (this modifies memory). |
|||
|} |
|} |
||
Line 271: | Line 511: | ||
|- |
|- |
||
| |
| |
||
<code> |
<code>disas /m</code><br/> |
||
<code> |
<code>disas /s</code><br/> |
||
<code> |
<code>disas 'foo.c'::bar</code><br/> |
||
<code>disas main,+40</code><br/> |
|||
<code>disas $pc,+40</code><br/> |
|||
<code>disassemble</code> |
|||
|Disassemble a specified section of memory |
|Disassemble a specified section of memory |
||
|} |
|} |
||
Line 298: | Line 541: | ||
|{{kb|RETURN}} |
|{{kb|RETURN}} |
||
|Repeat last command. |
|Repeat last command. |
||
|- |
|||
| |
|||
<code>add-symbol-file FILE ADDR</code> |
|||
|Load symbols from ''FILE'' at given ''ADDR''. |
|||
|} |
|} |
||
</div> |
</div> |
||
;Keyboard shortcuts |
|||
* {{kb|C-L}} — Clear the screen (useful when pgm writes to the screen). |
|||
=== Dump / restore === |
|||
For instance, to load an Intel hex file: |
|||
<source lang=text> |
|||
restore firmware.hex |
|||
</source> |
|||
To generate the same file: |
|||
<source lang=text> |
|||
dump ihex memory firmware.hex 0x00400000 0x0040FFFF |
|||
</source> |
|||
Note that <code>dump</code> can also save the result of an expression: |
|||
<code>dump [format] memory ''filename'' ''start_addr'' ''end_addr''</code> |
|||
<code>dump [format] value ''filename'' ''expr''</code> |
|||
=== Reverse debugging === |
|||
;References |
|||
* [https://sourceware.org/gdb/current/onlinedocs/gdb/Reverse-Execution.html GDB Manual - Reverse execution] |
|||
* [https://sourceware.org/gdb/current/onlinedocs/gdb/Process-Record-and-Replay.html#Process-Record-and-Replay GDB Manual - Recording Inferior’s Execution and Replaying It] |
|||
* [https://news.ycombinator.com/item?id=12101347 "Give me 15 minutes and I'll change your view of GDB" (video) (undo.io)] (Hacker News) |
|||
:* [https://www.gnu.org/software/gdb/news/reversible.html GDB news - reversible debug]. |
|||
* [http://shanekirk.com/2017/10/gdb-tips-and-tricks-4-reverse-debugging/ GDB Tips and Tricks #4: Reverse Debugging] |
|||
* [https://jayconrod.com/posts/28/tutorial-reverse-debugging-with-gdb-7 Tutorial: Reverse debugging with GDB 7] |
|||
;Example session |
|||
Say we have the following program: |
|||
<source lang="c"> |
|||
void error(void) |
|||
{ |
|||
// ... |
|||
} |
|||
void foo(void) |
|||
{ |
|||
// ... |
|||
bar(); // Calls error() if something goes wrong |
|||
baz(); |
|||
// ... |
|||
} |
|||
</source> |
|||
We want to debug function <code>bar</code>. We can use gdb reverse debugging as follows: |
|||
<source lang="bash"> |
|||
gdb myprog |
|||
</source> |
|||
<source lang="python"> |
|||
# Start recording when entering 'bar' |
|||
b bar |
|||
commands |
|||
record |
|||
continue |
|||
end |
|||
# Break if error |
|||
b error |
|||
# Stop recording when we exit bar |
|||
tbp baz |
|||
record stop |
|||
continue |
|||
end |
|||
</source> |
|||
If an error occurs, we can use the following commands to walk backward (see reference for more): |
|||
* The usual step or next, but in reverse: <code>reverse-next</code> or <code>rn</code>, <code>reverse-nexti</code> or <code>rni</code>, <code>reverse-step</code> or <code>rs</code>, <code>reverse-stepi</code> or <code>rsi</code>. |
|||
* Reverse continue (until next breakpoint, or record start): <code>reverse-continue</code> or <code>rc</code>. |
|||
* Reverse finish: <code>reverse-finish</code>. |
|||
Recording may be very slow. A faster alternative is to use [[rr]]. |
|||
;Troubleshooting |
|||
* When getting the following messages: |
|||
<source lang="text"> |
|||
Process record does not support instruction 0xfae64 (xsave) |
|||
Process record does not support instruction 0xc5 at address |
|||
... |
|||
</source> |
|||
:This mostly happens when gdb encounters an instruction it cannot handle yet, such as AVX instructions. Usually it occurs when a library is loaded to detect which CPU binary to use [https://sourceware.org/legacy-ml/gdb-prs/2019-q3/msg00385.html], [https://sourceware.org/legacy-ml/gdb/2016-08/msg00029.html], [https://bugs.launchpad.net/ubuntu/+source/gdb/+bug/1573786] |
|||
:* The fix is either to run gdb with <tt>LD_BIND_NOW=1</tt> (<code>LD_BIND_NOW=1 gdb a.out</code>) to avoid exotic instruction due to <tt>ld</tt>. |
|||
:* Or compile with <tt>-static</tt> (<code>clang -g -static hello.c</code>). |
|||
* gdb prints the following message and exits: |
|||
<source lang="bash"> |
|||
# [1]+ Stopped LD_BIND_NOW=1 gdb a.out |
|||
fg |
|||
# LD_BIND_NOW=1 gdb a.out |
|||
# Process record: failed to record execution log. |
|||
# |
|||
# Program stopped |
|||
</source> |
|||
:This is simply indicative that program probably segfaulted. Do <code>fg</code> to resume gdb and start debug backward. |
|||
:This does not seem to happen when using '''cgdb'''. |
|||
* Watch point ignored when reverse executing |
|||
:When setting a watchpoint, gdb goes up to program start and prints <code>No more reverse-execution history</code>. |
|||
<source lang="bash"> |
|||
watch *(long **)0x7fffffffffd4a8 |
|||
# Hardware watch point 4: *(long **)0x7fffffffffd4a8 |
|||
rc |
|||
# Continuing |
|||
# |
|||
# No more reverse-execution history. |
|||
# main () at bubble_sort.c:34 |
|||
# 34 unsigned i = 0; |
|||
</source> |
|||
: There is a bug in gdb that makes it ignore hardware watchpoint in reverse execution... The fix [https://jayconrod.com/posts/28/tutorial-reverse-debugging-with-gdb-7], [http://shanekirk.com/2017/10/gdb-tips-and-tricks-4-reverse-debugging/] |
|||
<source lang="bash"> |
|||
set can-use-hw-watchpoints 0 |
|||
watch *(long **)0x7fffffffffd4a8 |
|||
</source> |
|||
== Python == |
|||
See [[gdb python]]. |
|||
== Tips == |
== Tips == |
||
=== Use RETURN to repeat last command === |
|||
Pressing {{kb|RETURN}} repeats all last command: |
|||
* Commands like <code>s</code> (step) or <code>n</code> (next). Very handy to step in the code. |
|||
* Command like <code>x/16w 0x10100000</code> or <code>x/16i 0x10100000</code>, each time printing 16 new words. |
|||
=== Define a custom label for breakpoint in C/C++ === |
=== Define a custom label for breakpoint in C/C++ === |
||
Say we want to set a breakpoint at a specified location in source file, but this position may move over time. The easiest is to use an <code>asm</code> statement to define the label [http://stackoverflow.com/questions/14268325/exposing-goto-labels-to-symbol-table]: |
Say we want to set a breakpoint at a specified location in source file, but this position may move over time. The easiest is to use an <code>asm</code> statement to define the label [http://stackoverflow.com/questions/14268325/exposing-goto-labels-to-symbol-table]: |
||
Line 327: | Line 695: | ||
readelf -s a.out | grep RET |
readelf -s a.out | grep RET |
||
# 41: 0804858a 0 NOTYPE LOCAL DEFAULT 13 RET |
# 41: 0804858a 0 NOTYPE LOCAL DEFAULT 13 RET |
||
</source> |
|||
=== Use <code>help</code> to test a command abbreviation === |
|||
Don't know if <code>f</code> stands for <code>finish</code> or <code>frame</code>? Just use <code>help f</code>: |
|||
<source lang=python> |
|||
help f |
|||
# Select and print a stack frame. |
|||
# ... |
|||
</source> |
|||
=== 8 gdb tricks you should know === |
|||
From https://blogs.oracle.com/ksplice/entry/8_gdb_tricks_you_should: |
|||
* Use <code>break WHERE if COND</code> |
|||
: For instance <code>break context_switch if next == init_task</code>. |
|||
* Use <code>command</code> |
|||
: This sets commands to be executed when a breakpoint is hit. For instance |
|||
<source lang=python> |
|||
b do_mmap_pgoff |
|||
# Breakpoint 1 at 0xffffffff8111a441: file mm/mmap.c, line 940. |
|||
command 1 |
|||
# Type commands for when breakpoint 1 is hit, one per line. |
|||
# End with a line saying just "end". |
|||
>print addr |
|||
>print len |
|||
>print prot |
|||
>end |
|||
</source> |
|||
* Use <code>gdb --args</code> to specify runtime arguments |
|||
<source lang=bash> |
|||
gdb --args pizzamaker --deep-dish --toppings=pepperoni |
|||
# ... |
|||
show args |
|||
# Argument list to give program being debugged when it is started is |
|||
# " --deep-dish --toppings=pepperoni". |
|||
b main |
|||
# ... |
|||
</source> |
|||
* Finding source files |
|||
: Use <code>directory</code> to add directory to search for source files. |
|||
<source lang=python> |
|||
list main |
|||
# 1192 ls.c: No such file or directory. |
|||
# in ls.c |
|||
directory ~/src/coreutils-7.4/src/ |
|||
# Source directories searched: /home/nelhage/src/coreutils-7.4:$cdir:$cwd |
|||
list main |
|||
</source> |
|||
: Use <code>set substitute-path</code> to fix absolute paths |
|||
<source lang=python> |
|||
list schedule |
|||
# 5519 /build/buildd/linux-2.6.32/kernel/sched.c: No such file or directory. |
|||
# in /build/buildd/linux-2.6.32/kernel/sched.c |
|||
set substitute-path /build/buildd/linux-2.6.32 /home/nelhage/src/linux-2.6.32/ |
|||
list schedule |
|||
</source> |
|||
* Use gcc <code>-ggdb3</code> to add MACRO's symbol as gdb macro |
|||
* Use gdb's variable like <code>$1</code> but also define your own |
|||
<source lang=python> |
|||
set $foo = 4 |
|||
p $foo |
|||
# $3 = 4 |
|||
</source> |
|||
* Use register variables |
|||
: All CPU registers are available as variables like <code>$R0</code>, <code>$R1</code>, but gdb also defines cross-architectures ones like <code>$sp</code> and <code>$pc</code>. For instance, if <code>$rsi</code> register is used to pass the 1st parameter, |
|||
<source lang=python> |
|||
break write if $rsi == 2 |
|||
</source> |
|||
* The <code>x</code> command |
|||
<source lang=python> |
|||
x/s 0xffffffff81946000 |
|||
# ffffffff81946000 <>: "root=/dev/sda1 quiet" |
|||
# Use x/i as a quick way to disassemble memory |
|||
x/5i schedule |
|||
# 0xffffffff8154804a <schedule>: push %rbp |
|||
# 0xffffffff8154804b <schedule+1>: mov $0x11ac0,%rdx |
|||
# 0xffffffff81548052 <schedule+8>: mov %gs:0xb588,%rax |
|||
# 0xffffffff8154805b <schedule+17>: mov %rsp,%rbp |
|||
# 0xffffffff8154805e <schedule+20>: push %r15 |
|||
# Print code surround current pc |
|||
x/20i $ip-40 |
|||
</source> |
|||
* Use the <code>@</code> symbol |
|||
:The following works if source code is something like <tt>int a[ 10 ] = { 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 };</tt>: |
|||
<source lang=python> |
|||
p a |
|||
# $1 = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10} |
|||
</source> |
|||
:But if the array is much bigger, or array is a C++ vector, we can use <code>@</code>: |
|||
<source lang=python> |
|||
p *&a[0]@10 |
|||
# $1 = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10} |
|||
p *&a[550]@4 |
|||
# $1 = {551, 552, 553, 554} |
|||
</source> |
|||
=== Use value history variables <tt>$</tt> === |
|||
See summary list below. For more details, check GDB manual on ''Value History''. |
|||
<source lang=python> |
|||
$1 # History value 1 |
|||
$2 # History value 2 |
|||
$ # MOST RECENT value in the history |
|||
$$ # ... and value before that |
|||
$$2 # ... and value before that |
|||
$$0 # Same as $ |
|||
$$1 # Same as $$ |
|||
</source> |
|||
For instance: |
|||
<source lang=python> |
|||
p *$ # Dereference last value |
|||
p *$.next # ... follow the chain |
|||
<RETURN> # ... again |
|||
<RETURN> # ... again |
|||
</source> |
|||
<source lang=python> |
|||
show values # Print the last 10 values. |
|||
</source> |
|||
=== Disassemble at an arbitrary address === |
|||
<code>disas FUNCTION</code> command only works for defined <code>FUNCTION</code>. To disassemble at an arbitrary address: |
|||
* <code>x/i 0x10100000</code> one instruction at address <code>0x10100000</code>, |
|||
* <code>x/16i 0x10100000</code> (16 instruction) at address <code>Ox10100000</code>, |
|||
* <code>x/16i *0x10100000</code> (16 instruction) at address contained at <code>Ox10100000</code>, |
|||
* <code>x/16i $PC</code> disassemble 16 instructions at PC. |
|||
=== Redirect STDIN === |
|||
Redirect STDIN with the <code>run</code> command: |
|||
<source lang="bash"> |
|||
gdb ./myprog |
|||
run <input.txt # Run with stdin redirected |
|||
</source> |
|||
One can also pass ''parameters'' with the <code>run</code> command (or use gdb <code>--args</code> command-line parameter). |
|||
=== Exit gdb if run fails === |
|||
Use python script |
|||
<source lang="python"> |
|||
def tryRun(): |
|||
try: |
|||
gdb.execute("run") |
|||
except: |
|||
print("Exception occured during gdb command: run") |
|||
gdb.execute("quit 1") |
|||
</source> |
|||
=== Define custom gdb command === |
|||
Define custom gdb command using <code>define</code>: |
|||
<source lang="python"> |
|||
# define in .gdbinit |
|||
define sf |
|||
where # find out where the program is |
|||
info args # show arguments |
|||
info locals # show local variables |
|||
end |
|||
</source> |
|||
See also [[Gdb python]] to define more powerful custom commands. |
|||
=== Load symbols from another ELF file === |
|||
This is useful in embedded systems where some part of the code are already loaded or stored in a ROM. When the debugger stops in a function for which it doesn't have the symbols, it cannot decode the stack and provide a correct backtrace. The fix is to load the symbols for that function using the command <code>add-symbol-file</code>. |
|||
<source lang=bash> |
|||
add-symbol-file a.out 0 # Add symbol from file a.out, at offset 0 |
|||
</source> |
|||
Typically the offset to give is the lowest address in the ELF file. |
|||
=== Modify memory content === |
|||
Use <code>set</code> [https://stackoverflow.com/questions/3305164/how-to-modify-memory-contents-using-gdb]. Simplest is to modify a variable: |
|||
<source lang="bash"> |
|||
set variable i = 10 |
|||
</source> |
|||
But GDB can write in any memory location: |
|||
<source lang="bash"> |
|||
set {int}0x83040 = 4 |
|||
set *((int *)0x83040) = 4 |
|||
</source> |
|||
=== Remote debug with <code>gdbserver</code> === |
|||
Start the gdb server on the remote machine: |
|||
<source lang="bash"> |
|||
gdbserver --multi 192.168.20.1:6666 # CLIENT_ADDR:REMOTE_PORT |
|||
</source> |
|||
This will listen on port <code>6666</code>, waiting for connection from client <code>192.168.20.1</code> |
|||
On the client, start the GDB client: |
|||
<source lang="bash"> |
|||
gdb-multiarch ./my_exec # multiarch necessary if client/server are different arch |
|||
</source> |
|||
Then, in GDB: |
|||
<source lang="text"> |
|||
target extended-remote {REMOTE_ADDR}:{REMOTE_PORT} |
|||
set remote exec-file /path/on/remote/to/my_exec |
|||
</source> |
|||
If the remote executable uses dynamic libraries, these must be sent to the client. |
|||
To accelerate the loading, copy first the libraries locally on the client, then tell GDB where to find them [https://www.fayewilliams.com/2013/01/31/gdb-unable-to-find-dynamic-linker-breakpoint-function/]: |
|||
<source lang="text"> |
|||
set sysroot /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root |
|||
set solib-absolute-prefix /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root |
|||
set solib-search-path /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root |
|||
</source> |
|||
=== Reverse debugging === |
|||
From [https://www.youtube.com/watch?v=PorfLSr3DDI Greg Law's incredible video]. |
|||
Assume we have an application that crashes at random. We want to debug it when it crashes. |
|||
First we setup some breakpoint and start recording automatically: |
|||
<source lang="bash"> |
|||
gdb a.out |
|||
start |
|||
break main |
|||
command |
|||
record |
|||
continue |
|||
end |
|||
break _exit.c:32 # Hack. to break on exit |
|||
command |
|||
run |
|||
end |
|||
</source> |
|||
We also make sure pagination is disabled: |
|||
<source lang="bash"> |
|||
set pagination off |
|||
</source> |
|||
Now we <code>cont</code> until the code crashes. We start to '''reverse debug''' the program: |
|||
<source lang="bash"> |
|||
p $pc |
|||
# 1 = (void (*)()) 0x5e0c5d00 |
|||
x 0x5e0c5d00 |
|||
# 0x5e0c5d00 Cannot access memory at address 0x5e0c5d00 |
|||
bt |
|||
# ... the stack is corrupted. Let's step back... |
|||
reverse-stepi |
|||
# 0x00000000004806ac in main () at bubble_sort.:43 |
|||
# 43 } |
|||
# ... we're back in normal land |
|||
bt |
|||
# 0x00000000004806ac in main () at bubble_sort.:43 |
|||
disas |
|||
# 0x00000000004806ac <+135>: retq |
|||
# ... so indeed we tried to return, but likely the stack is corrupted |
|||
</source> |
|||
Let's look at our stack: |
|||
<source lang="bash"> |
|||
print $sp |
|||
# 2 = (void *) 0x7fffffffdc98 |
|||
print *(long **) 0x7fffffffdc98 |
|||
# 3 = (long *) 0x5e4c5d00 |
|||
# ... this the address we would jump to if RETQ |
|||
watch *(long **) 0x7fffffffdc98 |
|||
# Hardware watchpoint 4: *(long **) 0x7fffffffdc98 |
|||
</source> |
|||
We have '''set our hardware watchpoint, and will reverse continue!''' This will bring us to the point where the corruption occured: |
|||
<source lang="bash"> |
|||
reverse-continue |
|||
# old value = (long *) 0x5e4c5d00 |
|||
# new value = (long *) 0xffff7a36ec5 <__libc_start_main+245> |
|||
# 0x00000000004806a8 in main () at bubble_sort.:37 |
|||
print i |
|||
4 = 35 |
|||
</source> |
|||
We were indeed overwriting the stack because of array overflow! |
|||
=== Use Catch to break on syscall === |
|||
This was used by HN to debug a complex bugs in some Python profiler [https://news.ycombinator.com/item?id=22031219]: |
|||
<source lang="bash"> |
|||
# This looks for "lseek" followed by "close" without intervening "read". |
|||
set height 0 |
|||
catch syscall close |
|||
catch syscall read |
|||
catch syscall lseek |
|||
disable 1 2 |
|||
commands 2 |
|||
disable 1 2 |
|||
continue |
|||
end |
|||
commands 3 |
|||
if $rdi == 31 |
|||
enable 1 2 |
|||
continue |
|||
else |
|||
continue |
|||
end |
|||
end |
|||
</source> |
|||
=== Printing and Examing memory === |
|||
; Getting rid of octal |
|||
* By default, gdb uses '''octal''' to ''print''' char string, which is a major PITA. Use <code>p/x</code> or <code>p/a</code> to work-around that, or use ''examine'' (<code>x</code>) |
|||
; Dealing with examine default format / unit size |
|||
* The complete '''examine''' syntax is <code>x/nfu addr</code>, where ''n'' is the '''repeat count''', ''f'' is the '''format''' (same as for ''print'', ie. one of <code>x</code>, <code>d</code>, <code>u</code>, <code>o</code>, <code>t</code>, <code>a</code>, <code>c</code>, <code>f</code>, <code>s</code>), and ''u'' is the '''unit size''' (one of <code>b</code>, <code>h</code>, <code>w</code>, <code>g</code>). |
|||
* If not specified, ''examine'' uses a default value for the ''format'' (<code>x</code> initially) and ''unit size'' (<code>w</code> for ''x'' format, <code>g</code> for ''a'' format). But the '''default changes''' every time you use ''print'' or ''examine''! |
|||
=== Printing arrays and pointers (to arrays) === |
|||
Given the C code: |
|||
<source lang="c"> |
|||
uint8_t arr[8] = {0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44}; |
|||
uint8_t *p = arr; |
|||
</source> |
|||
Using '''print''' or '''p''': |
|||
<source lang="python"> |
|||
# -- print: arr and *p |
|||
p arr |
|||
p *p@8 |
|||
# $1 = "\022\064Vx\021\"3D" |
|||
p/a arr |
|||
p/x arr |
|||
p/a *p@8 |
|||
p/x *p@8 |
|||
# $2 = {0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44} |
|||
p *p |
|||
# $3 = 0x12 |
|||
# -- print: &arr, p and &p |
|||
p &arr |
|||
# $4 = (uint8_t (*)[8]) 0x404030 <arr> |
|||
p p |
|||
# $4 = (uint8_t *) 0x404030 <arr> "\022\064Vx\021\"3D" |
|||
p/a &arr |
|||
p/a p |
|||
# $5 = 0x404030 <arr> |
|||
p/x &arr |
|||
p/x p |
|||
# $3 = 0x404030 |
|||
p &p |
|||
# $1 = (uint8_t **) 0x404040 <p> |
|||
p/a &p |
|||
# $2 = 0x404040 <p> |
|||
p/x &p |
|||
# $3 = 0x404040 |
|||
# -- print: More cases with @ |
|||
p arr@2 |
|||
# $8 = {"\022\064Vx\021\"3D", "\000\000\000\000\001\000\000"} |
|||
p/a arr@2 |
|||
p/x arr@2 |
|||
# $9 = {{0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44}, {0xa2, 0xa1, 0xb2, 0xb1, 0x0, 0x0, 0x0, 0x0}} |
|||
p p@8 |
|||
# $7 = {0x404030 <arr> "\022\064Vx\021\"3D", 0x100000001 <error: Cannot access memory at address 0x100000001>, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0} |
|||
p/a p@8 |
|||
# $8 = {0x404030 <arr>, 0x100000001, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0} |
|||
p/x p@8 |
|||
# $9 = {0x404030, 0x100000001, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0} |
|||
</source> |
|||
Using '''examine''' or '''x''': |
|||
<source lang="python"> |
|||
# -- examine: arr, &arr, p (all points to same address) |
|||
# ... We force FORMAT and UNIT SIZE to avoid relying on previous default setting. |
|||
x/xw arr |
|||
x/xw &arr |
|||
x/xw p |
|||
# 0x404030 <arr>: 0x78563412 |
|||
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'word'. |
|||
x/xg arr |
|||
x/xg &arr |
|||
x/xg p |
|||
# 0x404030 <arr>: 0x4433221178563412 |
|||
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'giant'. |
|||
x/xb arr |
|||
x/xb &arr |
|||
x/xb p |
|||
# 0x404030 <arr>: 0x12 |
|||
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'byte'. |
|||
x/8b arr |
|||
x/8b &arr |
|||
x/8b p |
|||
# 0x404030 <arr>: 0x12 0x34 0x56 0x78 0x11 0x22 0x33 0x44 |
|||
# -- examine: address 'a' format is always SIGNED, which may gives strange results: |
|||
# ... same as 'x' if positive: |
|||
x/xw arr |
|||
x/aw arr |
|||
# 0x404030 <arr>: 0x78563412 |
|||
# ... but different is negative. This is because 'a' takes the value as an address (negative=stack). |
|||
set arr[3]=0x80 |
|||
x/aw arr |
|||
# 0x404030 <arr>: 0xffffffff80563412 |
|||
x/xw arr |
|||
# 0x404030 <arr>: 0x80563412 |
|||
# -- examine: &p (address of pointer p itself) |
|||
x/aw &p |
|||
# 0x404050 <p>: 0x404030 <arr> |
|||
# ... note again difference with 'hexadecimal' format: |
|||
x/xw &p |
|||
# 0x404050 <p>: 0x00404030 |
|||
# -- examine: bad cases (usually) |
|||
x *p |
|||
# 0x1: Cannot access memory at address 0x12 |
|||
x &p@8 |
|||
# Only values in memory can be extended with '@'. |
|||
x/xb arr |
|||
x/xb arr@8 |
|||
# 0x404030 <arr>: 0x12 |
|||
# ... @8 has no effect. One must use x/8xb |
|||
</source> |
|||
;Notes |
|||
* We can use both <code>p/x</code> and <code>p/a</code> to remove octal format. The small advantage for <code>p/a</code> is that it will also prints to nearest preceding symbol. |
|||
<source lang=python> |
|||
p/a some_struct # Address mode... but strangely it removes octal. WTF? |
|||
</source> |
|||
* <code>p</code> and <code>x</code> always default to '''last used format''', unless one is specified. The default for <code>x</code> is '''x''', and changes '''each time you use <code>x</code> or <code>print</code>''' [https://sourceware.org/gdb/onlinedocs/gdb/Memory.html#Memory]: |
|||
<source lang="python"> |
|||
</source> |
|||
=== Printing struct and pointers (to struct) === |
|||
Given the C code: |
|||
<source lang="c"> |
|||
struct { |
|||
uint16_t a; |
|||
uint16_t b; |
|||
} S = {0xa1a2,0xb1b2}, *s = &S; |
|||
</source> |
|||
Using '''print''' or '''p''': |
|||
<source lang="python"> |
|||
# -- print: S and *s |
|||
p S |
|||
p/a S |
|||
p/x S |
|||
p *s |
|||
p/a *s |
|||
p/x *s |
|||
# $9 = { |
|||
# a = 0xa1a2, |
|||
# b = 0xb1b2 |
|||
# } |
|||
# -- print: &S, s and &s |
|||
p &S |
|||
p s |
|||
# $1 = (struct {...} *) 0x404038 <S> |
|||
p/a &S |
|||
p/a s |
|||
# $2 = 0x404038 <S> |
|||
p/x &S |
|||
p/x s |
|||
# $3 = 0x404038 |
|||
p &s |
|||
# $4 = (struct {...} **) 0x404040 <s> |
|||
p/a &s |
|||
# $5 = 0x404040 <s> |
|||
p/x &s |
|||
# $6 = 0x404040 |
|||
# -- print: More cases with @ |
|||
p S@2 |
|||
p/a S@2 |
|||
p/x S@2 |
|||
# $2 = {{ |
|||
# a = 0xa1a2, |
|||
# b = 0xb1b2 |
|||
# }, { |
|||
# a = 0x0, |
|||
# b = 0x0 |
|||
# }} |
|||
p s@2 |
|||
p/a s@2 |
|||
# $2 = {0x404038 <S>, 0x0} |
|||
p/x s@2 |
|||
# $2 = {0x404038, 0x0} |
|||
</source> |
|||
Using '''examine''' or '''x''': |
|||
<source lang="python"> |
|||
# -- examine: &S, s, in hexadecimal format |
|||
x/xw &S |
|||
x/xw s |
|||
# 0x404038 <S>: 0xb1b2a1a2 |
|||
x/xg &S |
|||
x/xg s |
|||
# 0x404038 <S>: 0x00000000b1b2a1a2 |
|||
x/xb &S |
|||
x/xb s |
|||
# 0x404038 <S>: 0xa2 |
|||
# -- examine: &S, s, in address format |
|||
x/aw &S |
|||
x/aw s |
|||
# 0x404038 <S>: 0xffffffffb1b2a1a2 |
|||
x/ab &S |
|||
x/ab s |
|||
# 0x404038 <S>: 0xffffffffffffffa2 |
|||
x/ag &S |
|||
# 0x404038 <S>: 0xb1b2a1a2 |
|||
# ... This is because address is 0x00000000b1b2a1a2, and leading zeroes are removed. |
|||
# -- examine: bad cases (usually) |
|||
x/xb S |
|||
x/xb *s |
|||
# Value can't be converted to integer. |
|||
x/xb &S@8 |
|||
# Only values in memory can be extended with '@'. |
|||
x/xw s |
|||
x/xw s@4 |
|||
# 0x404040 <s>: 0x00404038 |
|||
# ... @8 has no effect. One must use x/4xw |
|||
</source> |
|||
=== Hook on quit === |
|||
From [https://stackoverflow.com/questions/4355978/get-rid-of-quit-anyway-prompt-using-gdb-just-kill-the-process-and-quit Stackoverflow]: |
|||
<source lang=python> |
|||
define hook-quit |
|||
set confirm off |
|||
end |
|||
</source> |
|||
=== Hook on stop === |
|||
Say you have a custom <code>cmd</code> that you'll like to execute whenever GDB breaks: |
|||
<source lang=python> |
|||
define hook-stop |
|||
cmd |
|||
end |
|||
</source> |
|||
=== Enable core dump and load in gdb === |
|||
Enable core dump file with ulimit: |
|||
<source lang="bash"> |
|||
ulimit -c unlimited |
|||
</source> |
|||
Check they are well enabled: |
|||
<source lang="bash"> |
|||
ulimit -a |
|||
# core file size (blocks, -c) unlimited |
|||
# data seg size (kbytes, -d) unlimited |
|||
# ... |
|||
</source> |
|||
Use gdb <code>-c core</code> to load the core dump: |
|||
<source lang="bash"> |
|||
./a.out |
|||
# Segmentation fault (core dumped) |
|||
l core* |
|||
# -rw-------+ 1 user user 240K Jun 16 16:35 core |
|||
gdb -c core |
|||
</source> |
|||
Or use any other front-end: |
|||
<source lang="bash"> |
|||
cgdb -c core |
|||
</source> |
|||
Check {{file|/proc/sys/kernel/core_pattern}} for the naming pattern for core files. |
|||
=== Send SIGINT automatically after some time === |
|||
Use this <code>timeout</code>: |
|||
<source lang="bash"> |
|||
#! /bin/bash |
|||
# Source: https://github.com/dscho/msys/blob/master/msys/packages/bash/2.04/examples/scripts/timeout |
|||
#Newsgroups: comp.unix.admin,comp.unix.solaris,comp.unix.shell |
|||
#From: gwc@root.co.uk (Geoff Clare) |
|||
#Subject: Re: timeout -t <sec> <unix command> (Re: How to give rsh a shorter timeout?) |
|||
#Message-ID: <EoBxrs.223@root.co.uk> |
|||
#Date: Fri, 13 Feb 1998 18:23:52 GMT |
|||
# |
|||
# Conversion to bash v2 syntax done by Chet Ramey <chet@po.cwru.edu |
|||
# UNTESTED |
|||
# |
|||
prog=${0##*/} |
|||
usage="usage: $prog [-signal] [timeout] [:interval] [+delay] [--] <command>" |
|||
SIG=-TERM # default signal sent to the process when the timer expires |
|||
timeout=60 # default timeout |
|||
interval=15 # default interval between checks if the process is still alive |
|||
delay=2 # default delay between posting the given signal and |
|||
# destroying the process (kill -KILL) |
|||
while : |
|||
do |
|||
case $1 in |
|||
--) shift; break ;; |
|||
-*) SIG=$1 ;; |
|||
[0-9]*) timeout=$1 ;; |
|||
:*) EXPR='..\(.*\)' ; interval=`expr x"$1" : "$EXPR"` ;; |
|||
+*) EXPR='..\(.*\)' ; delay=`expr x"$1" : "$EXPR"` ;; |
|||
*) break ;; |
|||
esac |
|||
shift |
|||
done |
|||
case $# in |
|||
0) echo "$prog: $usage" >&2 ; exit 2 ;; |
|||
esac |
|||
( |
|||
for t in $timeout $delay |
|||
do |
|||
while (( $t > $interval )) |
|||
do |
|||
sleep $interval |
|||
kill -0 $$ || exit |
|||
t=$(( $t - $interval )) |
|||
done |
|||
sleep $t |
|||
kill $SIG $$ && kill -0 $$ || exit |
|||
SIG=-KILL |
|||
done |
|||
) 2> /dev/null & |
|||
exec "$@" |
|||
</source> |
|||
Example of use: |
|||
<source lang="bash"> |
|||
timeout -INT 300 :5 -- gdb $(APP) |
|||
</source> |
|||
=== Write into memory === |
|||
Use <code>set VARIABLE = VALUE</code>. |
|||
;Modify program variables |
|||
For instance, given a program |
|||
<source lang="c"> |
|||
int main(void){ |
|||
char[] person = "Bob"; |
|||
char[] p2 = "Alice"; |
|||
printf("Hello %s\n"); |
|||
} |
|||
</source> |
|||
We can do |
|||
<source lang="python"> |
|||
set main::person = { 'S', 'a', 'm', 0x00 } |
|||
set main::person = "Sam" |
|||
</source> |
|||
;Write at memory address |
|||
<source lang="python"> |
|||
set {char}0x43f800 = 0x01 |
|||
set {int}0x43f800 = 0x04030201 |
|||
set {char [4]}0x43f800 = {0x01, 0x02, 0x03, 0x04} |
|||
</source> |
|||
;Call functions with side effect |
|||
Alternatively we can use <code>strcpy</code> [https://stackoverflow.com/questions/19503057/in-gdb-how-can-i-write-a-string-to-memory]: |
|||
<source lang="python"> |
|||
p malloc(20) |
|||
# $3 = (void *) 0x6ce81808 |
|||
p strcpy($3, "my string") |
|||
# $4 = 1827149832 |
|||
x/s $3 |
|||
# 0x6ce81808: "my string" |
|||
</source> |
|||
=== Detect current architecture === |
|||
<source lang="python"> |
|||
show architecture |
|||
# The target architecture is set automatically (currently i386) |
|||
</source> |
|||
See also gdb python. |
|||
=== Source file on specific architecture === |
|||
For instance, to source gdb dashboard only on i386: |
|||
<source lang="python"> |
|||
arch=gdb.execute("show architecture", to_string=True).strip() |
|||
if arch.find('i386')>=0: |
|||
gdb.execute("source ~/.gdbinit-gef.py") |
|||
end |
|||
</source> |
|||
=== Breakpoint tips === |
|||
Source: https://nullprogram.com/blog/2024/01/28/ |
|||
;Continuable assertions |
|||
* On X86, we can trigger an INT 3 followed by a NOP so that GDB breaks at the point of the assert |
|||
<source lang="c"> |
|||
#define breakpoint() asm('int3; nop') |
|||
</source> |
|||
;Named position |
|||
* Use a label in a function <code>example</code> (eg. <code>loop</code>), then use <code>break example:loop</code> to set a symbol break that will adapt at each rebuild. |
|||
<source lang="c"> |
|||
// gdb: b example:loop |
|||
void example(double *nums, int n, ...) |
|||
{ |
|||
for (int i = 0; i < n; i++) { |
|||
loop: // named position at the start of the loop |
|||
// ... |
|||
} |
|||
} |
|||
</source> |
|||
* Use an ASM label (one by module allowed). If defined in several module, all will be covered by a single bkp. |
|||
<source lang="c"> |
|||
#//gdb: b b |
|||
for (int i = 0; i < n; i++) { |
|||
asm ("b:"); |
|||
// ... |
|||
} |
|||
</source> |
</source> |
||
Line 378: | Line 1,477: | ||
c |
c |
||
</source> |
</source> |
||
|} |
|||
=== More GDB stuff === |
=== More GDB stuff === |
||
<source lang=gdb> |
<source lang=gdb> |
||
Line 393: | Line 1,494: | ||
da m w 0x2000 0x10 # Add a second zone at 0x2000 |
da m w 0x2000 0x10 # Add a second zone at 0x2000 |
||
</source> |
</source> |
||
=== Frequently used commands === |
|||
<source lang=gdb> |
|||
info reg |
|||
x/i $pc |
|||
</source> |
|||
== GDB script == |
|||
=== Breakpoint command lists === |
|||
See <code>help commands</code> (or [https://sourceware.org/gdb/current/onlinedocs/gdb/Break-Commands.html#Break-Commands here]) |
|||
Example script: |
|||
<source lang="text"> |
|||
break foo if x>0 |
|||
commands |
|||
silent |
|||
printf "x is %d\n",x |
|||
cont |
|||
end |
|||
</source> |
|||
* Use <code>silent</code> to make the breakpoint silent. |
|||
* Use condition (here <code>if x>0</code>) on breakpoint to break conditionally. |
|||
== Troubleshooting == |
|||
=== Python GDB === |
|||
* [https://stackoverflow.com/questions/33482870/gdb-hangs-when-multi-threaded-python-extension-is-used-to-debug-multi-threaded-p Deadlocks in multi-threaded Python extensions] |
Latest revision as of 12:45, 29 January 2024
References
- GDB: The GNU Project Debugger -- Documentation
- 7.2 Example Debugging Session: Segmentation Fault Example (unkownroad.com)
- Debugging with GDB (delorie.com)
- GDB commands (tutorialspoint.com)
- An Interactive Guide to Faster, Less Frustrating Debugging
- Norm Matloff's Debugging Tutorial (also guide to fast editing and gui debuggers DDD, GVD)
- 8 gdb tricks you should know (oracle.com)
- "Give me 15 minutes and I'll change your view of GDB" (video) (undo.io) (Hacker News)
GDB front-ends
There are several types of front-end for GDB:
- GDB built-in front-end (using TUI, Text User Interface)
- Front-end that customizes heavily .gdbinit file.
- Front-end using the newer MI2 interface.
- Front-end using the older MI interface.
More front-ends:
- GDB Front Ends
- How to highlight and color gdb output during interactive debugging?
- See HN post.
- See list on Mozilla rr
- GDB built-in
- GDB TUI — GDB own Text User Interface (C-x C-a: http://davis.lbl.gov/Manuals/GDB/gdb_21.html)
- gdbinit front-end
- IDE with gdb integration
- HOT VSCode https://code.visualstudio.com/docs/cpp/cpp-debug
- HOT Emacs https://www.gnu.org/software/emacs/manual/html_node/emacs/Debuggers.html
- Emacs is frequently touted as having the best GDB integration (better than DDD for instance) [1]. Must check for some tutorials [2].
- CLion https://www.jetbrains.com/clion/
- QtCreator http://doc.qt.io/qtcreator/
- Eclipse
- MI2 front-ends
- NEW gdbgui https://www.gdbgui.com/
- Seen on Mozilla rr.
- (Neovim) lldb.nvim
- Only for LLDB.
- (Vim) Conque-GDB
- Use old shell implementation, not the new neovim terminal.
- (Vim) pyclewn
- Read some comment it was not convenient, and would require extra plugin like https://github.com/notEvil/vim-debug.
- MI front-end (old interface)
GDBINIT Hacks
- Using colout one may color almost any gdb output.
Tools and libraries
- Capstone, the ultimate disassembly engine.
Tutorials
- GDB TUI mode, python, reverse debugging (with
record
).
Alternatives to GDB
rr
rr is the Record and Replay Framework, developped by Mozilla, and is a replacement for and powerful enhancement of gdb.
See rr for more information.
LLDB
☒ | TODO: Have a look at LLDB |
The debugger from Apple, mostly compatible with GDB with more powerful feature, but for LLVM/CLang only.
No debugger
Some people don't use / like debuggers:
- I don't like debuggers. Never have, probably never will. I use gdb all the time, but I tend to use it not as a debugger, but as a disassembler on steroids that you can program.
- The most effective debugging tools are: your brain, a unit test, and the print statement.
- Any programmer not writing unit tests for their code in 2007 should be considered a pariah (*).
GDB configuration
GDB reads file ~/.gdbinit at start.
Some references:
Bare minimum configuration
From StackOverflow [3]:
set history save on
set print pretty
set output-radix 16
set height 0
Install Python modules for GDB
From pwndbg [4]:
# Find the Python version used by GDB.
PYVER=$(gdb -batch -q --nx -ex 'pi import platform; print(".".join(platform.python_version_tuple()[:2]))')
PYTHON=$(gdb -batch -q --nx -ex 'pi import sys; print(sys.executable)')
PYTHON="${PYTHON}${PYVER}"
# Find the Python site-packages that we need to use so that
# GDB can find the files once we've installed them.
SITE_PACKAGES=$(gdb -batch -q --nx -ex 'pi import site; print(site.getsitepackages()[0])')
# or to install in user mode:
SITE_PACKAGES=$(gdb -batch -q --nx -ex 'pi import site; print(site.getusersitepackages())')
# Install Python dependencies
sudo ${PYTHON} -m pip install --target ${SITE_PACKAGES} -Ur requirements.txt
Example of file requirements.txt
pip pycparser psutil>=3.1.0
Front-ends
Gef
Simple install script [5]:
# via the install script
$ wget -q -O- https://github.com/hugsy/gef/raw/master/scripts/gef.sh | sh
# manually
$ wget -O ~/.gdbinit-gef.py -q https://github.com/hugsy/gef/raw/master/gef.py
$ echo source ~/.gdbinit-gef.py >> ~/.gdbinit
Dependencies:
- Install keystone-engine, NOT keystone (fix error
AttributeError: 'module' object has no attribute 'KS_ARCH_X86'
):
pip3 install --no-binary keystone-engine keystone-engine
Tips:
- Gef may work with a second pane in
tmux
. This uses thetmux-setup
/tmux-start
commands, but the most reliable is to identify the PTS device withtmux list-panes -a -F '#D #{pane_tty}'
, and then assign the appropriate device to variableredirect
in ~/.gef.rc.
tmux list-panes -F '#D #{pane_tty}'
# %0 /dev/pts/9
# %1 /dev/pts/10
- Use
context
(or aliasctx
) to re-print gef context. - Use
db &myvar
to dump (byte) at some variable location.
GDB dashboard
GDB dashboard is a modular visual interface for GDB in Python.
To install simply copy .gdbinit as ~/.gdbinit
cp gdb-dashboard/.gdbinit ~/.gdbinit
Alternatively, source it from ~/.gdbinit:
source ~/.gdbinit-dashboard
- Install pygments
Install pygments to get source highlighting
sudo pip install Pygments # Globally pip install Pygments # Locally
If GDB uses python3 (ldd $(which gdb))
), you'll need to install with pip3
:
sudo pip3 install Pygments # Globally pip3 install Pygments # Locally
To get the list of available styles:
python from pygments.styles import get_all_styles as styles python for s in styles(): print(s)
Alternative styles:
- Issues
Cannot write the dashboard: [Errno 10] No child process
issue #56
- This is a bug in the gdb vendor. Fix file platform.py:
@@ -1009,7 +1009,10 @@
except (AttributeError,os.error):
return default
output = string.strip(f.read())
- rc = f.close()
+ try:
+ rc = f.close()
+ except:
+ rc = 0
if not output or rc:
return default
else:
Cannot write the dashboard: Invalid character '?' in expression.
- Likely due to incorrect character in
info reg
. The register is evaluated viaparse_and_eval
Voltron
See Voltron.
GDB TUI
Some example inspired from Greg Law video on Youtube:
- Start TUI mode with Ctrl-X A, or
gdb --tui
, ortui enable
. - Ctrl-L repaint the screen.
- Ctrl-X 2 to split the layout (repeat to cycle through layout), or
layout split
. - Ctrl-X 1 for back to source-only or disas-only layout (depending on the split-layout we left).
tui reg float
for instance to change the register view.
Note that while in TUI mode, up and down keys will scroll the source code:
- Up / Down scroll the source code.
- Ctrl-P / Ctrl-N Previous or Next command in command history.
- Ctrl-O switch focus (does not work for me).
CGDB
Probably one of the best lightweight front-end out there, with vim-like bindings.
- Esc or i to switch between modes.
- Ctrl-W to switch between vertical / horizontal layout (requires latest cgdb from source).
- j, k, Ctrl-D, Ctrl-U to navigate (and more vim-like bindings as well)
- Space to set a breakpoint, t for a temporary breakpoint
- / and n for search and next
:help
for help (when in cgdb mode):set dis
or:set nodis
to view disassembly.
cgdb works nicely with rr:
rr replay -d cgdb
See rr page for more options.
Prepare debug session
- Compile with debug symbols, use option -g:
gcc -g program.c # -g : debug symbols
gcc -g -O0 program.c # ... -O0: disable optimization
- Force core dumps (see bash help ulimit):
ulimit -c unlimited
./a.out
# Segmentation fault (core dumped)
GDB invocation
gdb a.out
gdb a.out core.1234 # If coredump available
GDB commands
Reference
- GDB manual
- https://beej.us/guide/bggdb/
Break points and watch points | |
---|---|
|
Set a breakpoint at current line, at given line NUMBER or NUMBER lines after/before current line. Set breakpoint at LOCATION.
|
|
Set a HARDWARE breakpoint at current line, at given line NUMBER or NUMBER lines after/before current line. Set a HARDWARE breakpoint at LOCATION. |
|
Set a conditional breakpoint.
|
|
Idem, but temporary breakpoint. |
|
Set ignore-count of breakpoint N to COUNT |
|
Set commands to execute when a breakpoint is hit |
|
Stop execution when EXPR changes |
|
Stop execution when EXPR is accessed |
|
list breakpoints |
|
Clear breakpoint by LOCATION, or current line |
|
Delete all breakpoints |
|
Clear breakpoint by NUMBER (as listed by i b )
|
|
Disable breakpoint by NUMBER (as listed by i b ).
|
|
Save current breakpoints as script FILE. Use source to reload.
|
Registers | |
|
List integer registers |
Execute program | |
---|---|
|
Start (or restart) program. Arguments may include wildcards (*) and redirections (<, <<...) |
|
Start and break on main
|
|
Kill current program. |
|
Continue and interrupted program. |
|
Step (into) current line, or NUMBER lines. |
|
Step one (assembly) instruction exactly (N times) |
|
Run to next line (over current line) |
|
Step one (assembly) instruction exactly (N times), but over subroutine calls. |
|
Execute till returning from current selected frame. |
|
Run until temporary breakpoint set at LOCATION. |
|
Execute until the program reaches a source line greater than current (very handy). |
View stack | |
---|---|
|
Print backtrace of all stack frames, or innermost (outermost) COUNT frames if COUNT>0 (COUNT<0) |
|
Select frame FRAME and print stack frame |
|
Go up a level in the stack (frame calling current frame). |
|
Go down a level in the stack (frame called by current frame). |
View / Change memory | |
---|---|
|
Display EXPR at each prompt (if within scope), using format FMT. If omitted, reuse last format FMT (of last |
|
Examine n memory locations at ADDR, format as n (
|
|
Print information on local variables / function arguments in the current frame |
|
print EXPR, using format FMT. If omitted, reuse last format FMT (of last x , p or disp ).
Use |
|
Undisplay expression by NUMBER. |
|
Write VALUE in VARIABLE (this modifies memory). |
View code | |
---|---|
|
List (10 by default) lines of current frame |
|
Disassemble a specified section of memory |
Miscellaneous | |
---|---|
|
Quit gdb. |
|
Get help on COMMAND, or search commands related to WORD. |
|
Source script FILE. |
RETURN | Repeat last command. |
|
Load symbols from FILE at given ADDR. |
- Keyboard shortcuts
- C-L — Clear the screen (useful when pgm writes to the screen).
Dump / restore
For instance, to load an Intel hex file:
restore firmware.hex
To generate the same file:
dump ihex memory firmware.hex 0x00400000 0x0040FFFF
Note that dump
can also save the result of an expression:
dump [format] memory filename start_addr end_addr
dump [format] value filename expr
Reverse debugging
- References
- GDB Manual - Reverse execution
- GDB Manual - Recording Inferior’s Execution and Replaying It
- "Give me 15 minutes and I'll change your view of GDB" (video) (undo.io) (Hacker News)
- Example session
Say we have the following program:
void error(void)
{
// ...
}
void foo(void)
{
// ...
bar(); // Calls error() if something goes wrong
baz();
// ...
}
We want to debug function bar
. We can use gdb reverse debugging as follows:
gdb myprog
# Start recording when entering 'bar'
b bar
commands
record
continue
end
# Break if error
b error
# Stop recording when we exit bar
tbp baz
record stop
continue
end
If an error occurs, we can use the following commands to walk backward (see reference for more):
- The usual step or next, but in reverse:
reverse-next
orrn
,reverse-nexti
orrni
,reverse-step
orrs
,reverse-stepi
orrsi
. - Reverse continue (until next breakpoint, or record start):
reverse-continue
orrc
. - Reverse finish:
reverse-finish
.
Recording may be very slow. A faster alternative is to use rr.
- Troubleshooting
- When getting the following messages:
Process record does not support instruction 0xfae64 (xsave)
Process record does not support instruction 0xc5 at address
...
- This mostly happens when gdb encounters an instruction it cannot handle yet, such as AVX instructions. Usually it occurs when a library is loaded to detect which CPU binary to use [6], [7], [8]
- The fix is either to run gdb with LD_BIND_NOW=1 (
LD_BIND_NOW=1 gdb a.out
) to avoid exotic instruction due to ld. - Or compile with -static (
clang -g -static hello.c
).
- The fix is either to run gdb with LD_BIND_NOW=1 (
- gdb prints the following message and exits:
# [1]+ Stopped LD_BIND_NOW=1 gdb a.out
fg
# LD_BIND_NOW=1 gdb a.out
# Process record: failed to record execution log.
#
# Program stopped
- This is simply indicative that program probably segfaulted. Do
fg
to resume gdb and start debug backward. - This does not seem to happen when using cgdb.
- Watch point ignored when reverse executing
- When setting a watchpoint, gdb goes up to program start and prints
No more reverse-execution history
.
watch *(long **)0x7fffffffffd4a8
# Hardware watch point 4: *(long **)0x7fffffffffd4a8
rc
# Continuing
#
# No more reverse-execution history.
# main () at bubble_sort.c:34
# 34 unsigned i = 0;
- There is a bug in gdb that makes it ignore hardware watchpoint in reverse execution... The fix [9], [10]
set can-use-hw-watchpoints 0
watch *(long **)0x7fffffffffd4a8
Python
See gdb python.
Tips
Use RETURN to repeat last command
Pressing RETURN repeats all last command:
- Commands like
s
(step) orn
(next). Very handy to step in the code. - Command like
x/16w 0x10100000
orx/16i 0x10100000
, each time printing 16 new words.
Define a custom label for breakpoint in C/C++
Say we want to set a breakpoint at a specified location in source file, but this position may move over time. The easiest is to use an asm
statement to define the label [11]:
#include <stdio.h>
int main () {
void *ret_p = &&ret;
printf("ret: %p\n", ret_p);
goto *ret_p;
return 1;
ret:
asm("RET:")
return 0;
}
This will add a symbol table entry as follows.
gcc -Wl,--export-dynamic t.c -ldl
readelf -s a.out | grep RET
# 41: 0804858a 0 NOTYPE LOCAL DEFAULT 13 RET
Use help
to test a command abbreviation
Don't know if f
stands for finish
or frame
? Just use help f
:
help f
# Select and print a stack frame.
# ...
8 gdb tricks you should know
From https://blogs.oracle.com/ksplice/entry/8_gdb_tricks_you_should:
- Use
break WHERE if COND
- For instance
break context_switch if next == init_task
.
- Use
command
- This sets commands to be executed when a breakpoint is hit. For instance
b do_mmap_pgoff
# Breakpoint 1 at 0xffffffff8111a441: file mm/mmap.c, line 940.
command 1
# Type commands for when breakpoint 1 is hit, one per line.
# End with a line saying just "end".
>print addr
>print len
>print prot
>end
- Use
gdb --args
to specify runtime arguments
gdb --args pizzamaker --deep-dish --toppings=pepperoni
# ...
show args
# Argument list to give program being debugged when it is started is
# " --deep-dish --toppings=pepperoni".
b main
# ...
- Finding source files
- Use
directory
to add directory to search for source files.
list main
# 1192 ls.c: No such file or directory.
# in ls.c
directory ~/src/coreutils-7.4/src/
# Source directories searched: /home/nelhage/src/coreutils-7.4:$cdir:$cwd
list main
- Use
set substitute-path
to fix absolute paths
list schedule
# 5519 /build/buildd/linux-2.6.32/kernel/sched.c: No such file or directory.
# in /build/buildd/linux-2.6.32/kernel/sched.c
set substitute-path /build/buildd/linux-2.6.32 /home/nelhage/src/linux-2.6.32/
list schedule
- Use gcc
-ggdb3
to add MACRO's symbol as gdb macro
- Use gdb's variable like
$1
but also define your own
set $foo = 4
p $foo
# $3 = 4
- Use register variables
- All CPU registers are available as variables like
$R0
,$R1
, but gdb also defines cross-architectures ones like$sp
and$pc
. For instance, if$rsi
register is used to pass the 1st parameter,
break write if $rsi == 2
- The
x
command
x/s 0xffffffff81946000
# ffffffff81946000 <>: "root=/dev/sda1 quiet"
# Use x/i as a quick way to disassemble memory
x/5i schedule
# 0xffffffff8154804a <schedule>: push %rbp
# 0xffffffff8154804b <schedule+1>: mov $0x11ac0,%rdx
# 0xffffffff81548052 <schedule+8>: mov %gs:0xb588,%rax
# 0xffffffff8154805b <schedule+17>: mov %rsp,%rbp
# 0xffffffff8154805e <schedule+20>: push %r15
# Print code surround current pc
x/20i $ip-40
- Use the
@
symbol
- The following works if source code is something like int a[ 10 ] = { 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 };:
p a
# $1 = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}
- But if the array is much bigger, or array is a C++ vector, we can use
@
:
p *&a[0]@10
# $1 = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}
p *&a[550]@4
# $1 = {551, 552, 553, 554}
Use value history variables $
See summary list below. For more details, check GDB manual on Value History.
$1 # History value 1
$2 # History value 2
$ # MOST RECENT value in the history
$$ # ... and value before that
$$2 # ... and value before that
$$0 # Same as $
$$1 # Same as $$
For instance:
p *$ # Dereference last value
p *$.next # ... follow the chain
<RETURN> # ... again
<RETURN> # ... again
show values # Print the last 10 values.
Disassemble at an arbitrary address
disas FUNCTION
command only works for defined FUNCTION
. To disassemble at an arbitrary address:
x/i 0x10100000
one instruction at address0x10100000
,x/16i 0x10100000
(16 instruction) at addressOx10100000
,x/16i *0x10100000
(16 instruction) at address contained atOx10100000
,x/16i $PC
disassemble 16 instructions at PC.
Redirect STDIN
Redirect STDIN with the run
command:
gdb ./myprog
run <input.txt # Run with stdin redirected
One can also pass parameters with the run
command (or use gdb --args
command-line parameter).
Exit gdb if run fails
Use python script
def tryRun():
try:
gdb.execute("run")
except:
print("Exception occured during gdb command: run")
gdb.execute("quit 1")
Define custom gdb command
Define custom gdb command using define
:
# define in .gdbinit
define sf
where # find out where the program is
info args # show arguments
info locals # show local variables
end
See also Gdb python to define more powerful custom commands.
Load symbols from another ELF file
This is useful in embedded systems where some part of the code are already loaded or stored in a ROM. When the debugger stops in a function for which it doesn't have the symbols, it cannot decode the stack and provide a correct backtrace. The fix is to load the symbols for that function using the command add-symbol-file
.
add-symbol-file a.out 0 # Add symbol from file a.out, at offset 0
Typically the offset to give is the lowest address in the ELF file.
Modify memory content
Use set
[12]. Simplest is to modify a variable:
set variable i = 10
But GDB can write in any memory location:
set {int}0x83040 = 4
set *((int *)0x83040) = 4
Remote debug with gdbserver
Start the gdb server on the remote machine:
gdbserver --multi 192.168.20.1:6666 # CLIENT_ADDR:REMOTE_PORT
This will listen on port 6666
, waiting for connection from client 192.168.20.1
On the client, start the GDB client:
gdb-multiarch ./my_exec # multiarch necessary if client/server are different arch
Then, in GDB:
target extended-remote {REMOTE_ADDR}:{REMOTE_PORT}
set remote exec-file /path/on/remote/to/my_exec
If the remote executable uses dynamic libraries, these must be sent to the client. To accelerate the loading, copy first the libraries locally on the client, then tell GDB where to find them [13]:
set sysroot /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root
set solib-absolute-prefix /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root
set solib-search-path /d/st/courses/training_obfu/formation_reverse/dynamic/tp/qmessage/pi_root
Reverse debugging
From Greg Law's incredible video.
Assume we have an application that crashes at random. We want to debug it when it crashes.
First we setup some breakpoint and start recording automatically:
gdb a.out
start
break main
command
record
continue
end
break _exit.c:32 # Hack. to break on exit
command
run
end
We also make sure pagination is disabled:
set pagination off
Now we cont
until the code crashes. We start to reverse debug the program:
p $pc
# 1 = (void (*)()) 0x5e0c5d00
x 0x5e0c5d00
# 0x5e0c5d00 Cannot access memory at address 0x5e0c5d00
bt
# ... the stack is corrupted. Let's step back...
reverse-stepi
# 0x00000000004806ac in main () at bubble_sort.:43
# 43 }
# ... we're back in normal land
bt
# 0x00000000004806ac in main () at bubble_sort.:43
disas
# 0x00000000004806ac <+135>: retq
# ... so indeed we tried to return, but likely the stack is corrupted
Let's look at our stack:
print $sp
# 2 = (void *) 0x7fffffffdc98
print *(long **) 0x7fffffffdc98
# 3 = (long *) 0x5e4c5d00
# ... this the address we would jump to if RETQ
watch *(long **) 0x7fffffffdc98
# Hardware watchpoint 4: *(long **) 0x7fffffffdc98
We have set our hardware watchpoint, and will reverse continue! This will bring us to the point where the corruption occured:
reverse-continue
# old value = (long *) 0x5e4c5d00
# new value = (long *) 0xffff7a36ec5 <__libc_start_main+245>
# 0x00000000004806a8 in main () at bubble_sort.:37
print i
4 = 35
We were indeed overwriting the stack because of array overflow!
Use Catch to break on syscall
This was used by HN to debug a complex bugs in some Python profiler [14]:
# This looks for "lseek" followed by "close" without intervening "read".
set height 0
catch syscall close
catch syscall read
catch syscall lseek
disable 1 2
commands 2
disable 1 2
continue
end
commands 3
if $rdi == 31
enable 1 2
continue
else
continue
end
end
Printing and Examing memory
- Getting rid of octal
- By default, gdb uses octal' to print char string, which is a major PITA. Use
p/x
orp/a
to work-around that, or use examine (x
)
- Dealing with examine default format / unit size
- The complete examine syntax is
x/nfu addr
, where n is the repeat count, f is the format (same as for print, ie. one ofx
,d
,u
,o
,t
,a
,c
,f
,s
), and u is the unit size (one ofb
,h
,w
,g
). - If not specified, examine uses a default value for the format (
x
initially) and unit size (w
for x format,g
for a format). But the default changes every time you use print or examine!
Printing arrays and pointers (to arrays)
Given the C code:
uint8_t arr[8] = {0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44};
uint8_t *p = arr;
Using print or p:
# -- print: arr and *p
p arr
p *p@8
# $1 = "\022\064Vx\021\"3D"
p/a arr
p/x arr
p/a *p@8
p/x *p@8
# $2 = {0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44}
p *p
# $3 = 0x12
# -- print: &arr, p and &p
p &arr
# $4 = (uint8_t (*)[8]) 0x404030 <arr>
p p
# $4 = (uint8_t *) 0x404030 <arr> "\022\064Vx\021\"3D"
p/a &arr
p/a p
# $5 = 0x404030 <arr>
p/x &arr
p/x p
# $3 = 0x404030
p &p
# $1 = (uint8_t **) 0x404040 <p>
p/a &p
# $2 = 0x404040 <p>
p/x &p
# $3 = 0x404040
# -- print: More cases with @
p arr@2
# $8 = {"\022\064Vx\021\"3D", "\000\000\000\000\001\000\000"}
p/a arr@2
p/x arr@2
# $9 = {{0x12, 0x34, 0x56, 0x78, 0x11, 0x22, 0x33, 0x44}, {0xa2, 0xa1, 0xb2, 0xb1, 0x0, 0x0, 0x0, 0x0}}
p p@8
# $7 = {0x404030 <arr> "\022\064Vx\021\"3D", 0x100000001 <error: Cannot access memory at address 0x100000001>, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}
p/a p@8
# $8 = {0x404030 <arr>, 0x100000001, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}
p/x p@8
# $9 = {0x404030, 0x100000001, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}
Using examine or x:
# -- examine: arr, &arr, p (all points to same address)
# ... We force FORMAT and UNIT SIZE to avoid relying on previous default setting.
x/xw arr
x/xw &arr
x/xw p
# 0x404030 <arr>: 0x78563412
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'word'.
x/xg arr
x/xg &arr
x/xg p
# 0x404030 <arr>: 0x4433221178563412
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'giant'.
x/xb arr
x/xb &arr
x/xb p
# 0x404030 <arr>: 0x12
# ... From now on, all examine will use hexadecimal, and unit size for hex is 'byte'.
x/8b arr
x/8b &arr
x/8b p
# 0x404030 <arr>: 0x12 0x34 0x56 0x78 0x11 0x22 0x33 0x44
# -- examine: address 'a' format is always SIGNED, which may gives strange results:
# ... same as 'x' if positive:
x/xw arr
x/aw arr
# 0x404030 <arr>: 0x78563412
# ... but different is negative. This is because 'a' takes the value as an address (negative=stack).
set arr[3]=0x80
x/aw arr
# 0x404030 <arr>: 0xffffffff80563412
x/xw arr
# 0x404030 <arr>: 0x80563412
# -- examine: &p (address of pointer p itself)
x/aw &p
# 0x404050 <p>: 0x404030 <arr>
# ... note again difference with 'hexadecimal' format:
x/xw &p
# 0x404050 <p>: 0x00404030
# -- examine: bad cases (usually)
x *p
# 0x1: Cannot access memory at address 0x12
x &p@8
# Only values in memory can be extended with '@'.
x/xb arr
x/xb arr@8
# 0x404030 <arr>: 0x12
# ... @8 has no effect. One must use x/8xb
- Notes
- We can use both
p/x
andp/a
to remove octal format. The small advantage forp/a
is that it will also prints to nearest preceding symbol.
p/a some_struct # Address mode... but strangely it removes octal. WTF?
p
andx
always default to last used format, unless one is specified. The default forx
is x, and changes each time you usex
orprint
[15]:
Printing struct and pointers (to struct)
Given the C code:
struct {
uint16_t a;
uint16_t b;
} S = {0xa1a2,0xb1b2}, *s = &S;
Using print or p:
# -- print: S and *s
p S
p/a S
p/x S
p *s
p/a *s
p/x *s
# $9 = {
# a = 0xa1a2,
# b = 0xb1b2
# }
# -- print: &S, s and &s
p &S
p s
# $1 = (struct {...} *) 0x404038 <S>
p/a &S
p/a s
# $2 = 0x404038 <S>
p/x &S
p/x s
# $3 = 0x404038
p &s
# $4 = (struct {...} **) 0x404040 <s>
p/a &s
# $5 = 0x404040 <s>
p/x &s
# $6 = 0x404040
# -- print: More cases with @
p S@2
p/a S@2
p/x S@2
# $2 = {{
# a = 0xa1a2,
# b = 0xb1b2
# }, {
# a = 0x0,
# b = 0x0
# }}
p s@2
p/a s@2
# $2 = {0x404038 <S>, 0x0}
p/x s@2
# $2 = {0x404038, 0x0}
Using examine or x:
# -- examine: &S, s, in hexadecimal format
x/xw &S
x/xw s
# 0x404038 <S>: 0xb1b2a1a2
x/xg &S
x/xg s
# 0x404038 <S>: 0x00000000b1b2a1a2
x/xb &S
x/xb s
# 0x404038 <S>: 0xa2
# -- examine: &S, s, in address format
x/aw &S
x/aw s
# 0x404038 <S>: 0xffffffffb1b2a1a2
x/ab &S
x/ab s
# 0x404038 <S>: 0xffffffffffffffa2
x/ag &S
# 0x404038 <S>: 0xb1b2a1a2
# ... This is because address is 0x00000000b1b2a1a2, and leading zeroes are removed.
# -- examine: bad cases (usually)
x/xb S
x/xb *s
# Value can't be converted to integer.
x/xb &S@8
# Only values in memory can be extended with '@'.
x/xw s
x/xw s@4
# 0x404040 <s>: 0x00404038
# ... @8 has no effect. One must use x/4xw
Hook on quit
From Stackoverflow:
define hook-quit
set confirm off
end
Hook on stop
Say you have a custom cmd
that you'll like to execute whenever GDB breaks:
define hook-stop
cmd
end
Enable core dump and load in gdb
Enable core dump file with ulimit:
ulimit -c unlimited
Check they are well enabled:
ulimit -a
# core file size (blocks, -c) unlimited
# data seg size (kbytes, -d) unlimited
# ...
Use gdb -c core
to load the core dump:
./a.out
# Segmentation fault (core dumped)
l core*
# -rw-------+ 1 user user 240K Jun 16 16:35 core
gdb -c core
Or use any other front-end:
cgdb -c core
Check /proc/sys/kernel/core_pattern for the naming pattern for core files.
Send SIGINT automatically after some time
Use this timeout
:
#! /bin/bash
# Source: https://github.com/dscho/msys/blob/master/msys/packages/bash/2.04/examples/scripts/timeout
#Newsgroups: comp.unix.admin,comp.unix.solaris,comp.unix.shell
#From: gwc@root.co.uk (Geoff Clare)
#Subject: Re: timeout -t <sec> <unix command> (Re: How to give rsh a shorter timeout?)
#Message-ID: <EoBxrs.223@root.co.uk>
#Date: Fri, 13 Feb 1998 18:23:52 GMT
#
# Conversion to bash v2 syntax done by Chet Ramey <chet@po.cwru.edu
# UNTESTED
#
prog=${0##*/}
usage="usage: $prog [-signal] [timeout] [:interval] [+delay] [--] <command>"
SIG=-TERM # default signal sent to the process when the timer expires
timeout=60 # default timeout
interval=15 # default interval between checks if the process is still alive
delay=2 # default delay between posting the given signal and
# destroying the process (kill -KILL)
while :
do
case $1 in
--) shift; break ;;
-*) SIG=$1 ;;
[0-9]*) timeout=$1 ;;
:*) EXPR='..\(.*\)' ; interval=`expr x"$1" : "$EXPR"` ;;
+*) EXPR='..\(.*\)' ; delay=`expr x"$1" : "$EXPR"` ;;
*) break ;;
esac
shift
done
case $# in
0) echo "$prog: $usage" >&2 ; exit 2 ;;
esac
(
for t in $timeout $delay
do
while (( $t > $interval ))
do
sleep $interval
kill -0 $$ || exit
t=$(( $t - $interval ))
done
sleep $t
kill $SIG $$ && kill -0 $$ || exit
SIG=-KILL
done
) 2> /dev/null &
exec "$@"
Example of use:
timeout -INT 300 :5 -- gdb $(APP)
Write into memory
Use set VARIABLE = VALUE
.
- Modify program variables
For instance, given a program
int main(void){
char[] person = "Bob";
char[] p2 = "Alice";
printf("Hello %s\n");
}
We can do
set main::person = { 'S', 'a', 'm', 0x00 }
set main::person = "Sam"
- Write at memory address
set {char}0x43f800 = 0x01
set {int}0x43f800 = 0x04030201
set {char [4]}0x43f800 = {0x01, 0x02, 0x03, 0x04}
- Call functions with side effect
Alternatively we can use strcpy
[16]:
p malloc(20)
# $3 = (void *) 0x6ce81808
p strcpy($3, "my string")
# $4 = 1827149832
x/s $3
# 0x6ce81808: "my string"
Detect current architecture
show architecture
# The target architecture is set automatically (currently i386)
See also gdb python.
Source file on specific architecture
For instance, to source gdb dashboard only on i386:
arch=gdb.execute("show architecture", to_string=True).strip()
if arch.find('i386')>=0:
gdb.execute("source ~/.gdbinit-gef.py")
end
Breakpoint tips
Source: https://nullprogram.com/blog/2024/01/28/
- Continuable assertions
- On X86, we can trigger an INT 3 followed by a NOP so that GDB breaks at the point of the assert
#define breakpoint() asm('int3; nop')
- Named position
- Use a label in a function
example
(eg.loop
), then usebreak example:loop
to set a symbol break that will adapt at each rebuild.
// gdb: b example:loop
void example(double *nums, int n, ...)
{
for (int i = 0; i < n; i++) {
loop: // named position at the start of the loop
// ...
}
}
- Use an ASM label (one by module allowed). If defined in several module, all will be covered by a single bkp.
#//gdb: b b
for (int i = 0; i < n; i++) {
asm ("b:");
// ...
}
Examples
Simple Segmentation Fault Example
(From [17])
Example program segfault.c:
#include <stdio.h>
#include <stdlib.h>
int main(int argc, char **argv)
{
char *buf;
buf = malloc(1<<31);
fgets(buf, 1024, stdin);
printf("%s\n", buf);
return 1;
}
|
Compile and launch gdb:
gcc -g segfault.c
gdb a.out
The debug session run
backtrace
frame 3
print buf
kill
break segfault.c:8
run
print buf
next
print buf Fix the bug, then start again, watching now watch buf
# Start again, answer 'y' when asked to start from beginning
run
# Break at watch point, let's _c_ontinue
c
|
More GDB stuff
display i # Display i at each stop
display/4i $pc # Display the 4 next instruction at each stop
u # Continue until we reach an higher pc address
Using gdb-dashboard
da # Print dashboard again
da reg # Remove register
da reg # Add it back
da m w 0x1000 0x10 # Watch memory zone at 0x1000 for 16 bytes
da m w 0x2000 0x10 # Add a second zone at 0x2000
Frequently used commands
info reg
x/i $pc
GDB script
Breakpoint command lists
See help commands
(or here)
Example script:
break foo if x>0
commands
silent
printf "x is %d\n",x
cont
end
- Use
silent
to make the breakpoint silent. - Use condition (here
if x>0
) on breakpoint to break conditionally.