api/docs/intro.dox - external/github.com/DynamoRIO/dynamorio - Git at Google

 /* ******************************************************************************
  * Copyright (c) 2010-2025 Google, Inc.  All rights reserved.
  * Copyright (c) 2011 Massachusetts Institute of Technology  All rights reserved.
  * Copyright (c) 2007-2010 VMware, Inc.  All rights reserved.
  * ******************************************************************************/

 /*
  * Redistribution and use in source and binary forms, with or without
  * modification, are permitted provided that the following conditions are met:
  *
  * * Redistributions of source code must retain the above copyright notice,
  *   this list of conditions and the following disclaimer.
  *
  * * Redistributions in binary form must reproduce the above copyright notice,
  *   this list of conditions and the following disclaimer in the documentation
  *   and/or other materials provided with the distribution.
  *
  * * Neither the name of VMware, Inc. nor the names of its contributors may be
  *   used to endorse or promote products derived from this software without
  *   specific prior written permission.
  *
  * THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS"
  * AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
  * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
  * ARE DISCLAIMED. IN NO EVENT SHALL VMWARE, INC. OR CONTRIBUTORS BE LIABLE
  * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
  * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
  * SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
  * CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
  * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
  * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH
  * DAMAGE.
  */

 /* The name "page_user_docs" is hardcoded in CMake_rundoxygen.cmake. */

 /**
  ****************************************************************************
  ****************************************************************************
  ****************************************************************************
 \page page_user_docs Build Your Own Tool

 DynamoRIO is a <em>runtime code manipulation system</em> that supports
 code transformations on any part of a program, <em>while it executes</em>.
 DynamoRIO gives complete control over the runtime code stream and does not
 limit transformations to trampoline insertion.  DynamoRIO exports an
 interface for building dynamic tools for a wide variety of uses: program
 analysis and understanding, profiling, instrumentation, optimization,
 translation, etc. DynamoRIO provides efficient, transparent, and
 comprehensive manipulation of an unmodified application running on a stock
 operating system (Windows, Linux, or Android) and commodity IA-32, AMD64,
 ARM, and AArch64 hardware.  See \ref sec_limit_platforms for details of
 which platform combinations are fully supported.

 This document describes the DynamoRIO system and the various APIs that it
 exports for building custom tools.  It is divided into the following
 sections:

 - \subpage page_tutorials
   <br>Gives a few short tutorials on using DynamoRIO and includes slides
   from prior in-person tutorial presentations.

 - \subpage API_samples
   <br>Shows some sample use cases and reference implementations.

 - \subpage page_build_client
   <br>How to build a tool or "client" of DynamoRIO.

 - \subpage page_deploy
   <br>How to run DynamoRIO.

 - \subpage using
   <br>The top-level interfaces provided to a tool.

 - \subpage API_BT
   <br>DynamoRIO's full runtime code manipulation interface.

 - \ref page_ext
   <br>DynamoRIO's API is augmented by a collection of extension libraries.

 - \subpage page_standalone
   <br>DynamoRIO can be used as a standalone library for IA-32/AMD64/ARM/AArch64
   disassembly, decoding, encoding, and general instruction manipulation.
   A separate static library is provided for this purpose.

 - \subpage overview
   <br>A description of the implementation of the DynamoRIO system.

 - \subpage release_notes
   <br>Release notes for this release, including changes since prior
   releases and plans for future releases.

 <br>

 ***************************************************************************
 ***************************************************************************
 \page overview DynamoRIO System Overview

 DynamoRIO is a system for runtime code manipulation that is efficient,
 transparent, and comprehensive, able to observe and manipulate every
 executed instruction in an unmodified application running on a stock
 operating system and commodity hardware.

 ***************************************************************************
 \section sec_intro Introduction

 DynamoRIO operates in user mode on a target process.  It acts as a
 <em>process virtual machine</em>, interposing between the application and
 the operating system.  It has a complete view of the application code
 stream and acts as a runtime control point, allowing custom tools to be
 embedded inside it:

   \image html interpose.png
   \image latex interpose.eps "Flow chart" width=10cm

 The application itself, along with the underlying operating system and
 hardware, remain unchanged.  DynamoRIO operates in native (non-virtual)
 environments as well as inside guest operating systems running on virtual
 machines.  Tools created on top of DynamoRIO will operate without change
 whether the underlying operating system is native or a virtual machine
 guest.

 ***************************************************************************
 \section sec_system System Operation

 DynamoRIO operates by shifting an application's execution from its original
 instructions to a <em>code cache</em>, where the instructions can be freely
 modified. DynamoRIO occupies the address space with the application and has
 full control over execution, taking over whenever control leaves the code
 cache or when the operating system directly transfers control to the
 application (<em>kernel-mediated control transfers</em>):

   \image html flow-highlevel.png
   \image latex flow-highlevel.eps "Flow chart" width=10cm

 DynamoRIO copies the application code one <em>dynamic basic block</em> at a
 time into its basic block code cache. A block that directly targets another
 block already resident in the cache is linked to that block to avoid the
 cost of returning to the DynamoRIO dispatcher.

 Frequently executed sequences of basic blocks are combined into
 <em>traces</em>, which are placed in a separate code cache. DynamoRIO makes
 these traces available via its interface for convenient access to hot
 application code streams.

 The following figure shows the flow of control between the components of
 DynamoRIO and its code caches:

   \image html flow.png
   \image latex flow.eps "Flow chart" width=15cm

 The context switch is between DynamoRIO's operational state and the machine
 state of the application: both are still within the same process.

 Indirect branches require dynamic resolution of their targets, which is
 performed via an inlined table lookup or a compare to a known target
 inlined into a trace.

 \section sec_sys_transp Transparency

 Transparency is an important requirement for DynamoRIO and its clients.
 The subject is fully covered in \subpage transparency.

 ***************************************************************************
 \section sec_refs References

 The canonical reference for DynamoRIO is:

 - Derek Bruening.<br>
   <a href="http://www.burningcutlery.com/derek/phd.html">
   Efficient, Transparent, and Comprehensive Runtime Code Manipulation</a>.<br>
   Ph.D. Thesis, MIT, September 2004.

 See \ref page_publications for other publications involving DynamoRIO.

 ****************************************************************************
 ****************************************************************************
 */
 /* It's good to use separate C comments: we've hit some sort of doxygen
  * internal buffering error before if one comment gets too long.
  */
 /**
 ***************************************************************************
 ***************************************************************************
 \page using Tool Event Model and API

 This section gives an overview of how to use DynamoRIO, divided into the
 following sub-sections:

  - \ref sec_events
  - \ref sec_utils
  - \ref sec_extlibs
  - \ref sec_extensions
  - \ref sec_comm
  - \ref sec_annotations
  - \ref sec_64bit_reach
  - \ref sec_utf8

 DynamoRIO exports a rich Application Programming Interface (API) to the
 user for building a DynamoRIO <em>client</em>. A DynamoRIO client is a
 library that is coupled with DynamoRIO in order to jointly operate on an
 input program binary:

   \image html client.png
   \image latex client.eps "DynamoRIO client" width=10cm

 To interact with the client, DynamoRIO provides specific events that a
 client can intercept.  Event interception functions, if supplied by a user
 client, are called by DynamoRIO at appropriate times.

 DynamoRIO can alternatively be used as a third-party disassembly library
 (see \ref page_standalone).

 ***************************************************************************
 \section sec_events Common Events

 A client's primary interaction with the DynamoRIO system is via a
 set of event callbacks.  These events include the following:

   - Basic block and trace creation or deletion
     (dr_register_bb_event(), dr_register_trace_event(), dr_register_delete_event())
   - Process initialization and exit
     (dr_client_main(), dr_register_post_attach_event(), dr_register_exit_event())
   - Thread initialization and exit
     (dr_register_thread_init_event(), dr_register_thread_exit_event())
   - Fork child initialization (Linux-only); meant to be used for
     re-initialization of data structures and creation of new log files
     (dr_register_fork_init_event())
   - Application library load and unload
     (dr_register_module_load_event(), dr_register_module_unload_event())
   - Application fault or exception (signal on Linux)
     (dr_register_exception_event(), dr_register_signal_event())
   - Kernel-mediated control transfers (dr_register_kernel_xfer_event()):
     - Application APC (Asynchronous Procedure Call), callback, or exception
       dispatcher execution (Windows)
     - Application signal delivery (Linux)
     - System call that changes the context
   - System call interception: pre-system call, post-system call, and system
     call filtering by number
     (dr_register_pre_syscall_event(), dr_register_post_syscall_event(),
     dr_register_filter_syscall_event())
   - Signal interception (Linux-only)
     (dr_register_signal_event())
   - Nudge received - see \ref sec_comm
     (dr_register_nudge_event())

 Typically, a client will register for the desired events at
 initialization in its dr_client_main() routine.  DynamoRIO then calls the
 registered functions at the appropriate times.  Each event has a
 specific registration routine (e.g., dr_register_thread_init_event(): see
 the names in parentheses in the list above)
 and an associated unregistration routine.  The header file dr_events.h
 contains the declarations for all registration and unregistration
 routines.

 When attaching to an already running process, dr_client_main() is called
 in the initially taken over thread before the other threads are taken over.
 Thus, it is not the best place to take a snapshot of global state such
 as the address space.  The post-attach event (dr_register_post_attach_event())
 is provided as a point where all the other threads have been suspended but
 have not yet started executing under instrumentation: the best point to
 take a snapshot and avoid a gap between it and monitored events post-takeover.

 Note that clients are allowed to register multiple callbacks for the
 same event.  DynamoRIO also supports mutiple clients, each of which
 can register for the same event.  In this case, DynamoRIO sequences
 event callbacks in reverse order of when they were registered.  In
 other words, the first registered callback receives event notification
 last.  This scheme gives priority to a callback registered earlier,
 since it can override or modify the actions of clients registered
 later.  Note that DynamoRIO calls each client's dr_client_main() routine
 according to the client's priority (see \ref multi_client and
 dr_register_client() in the deployment API).

 Systems registering multiple callbacks for a single event should be
 aware that client modifications are visible in subsequent callbacks.
 DynamoRIO makes no attempt to mitigate interference among callback
 functions.  It is the responsibility of a client to ensure
 compatibility among its callback functions and the callback functions
 of other clients.

 Clients can also unregister a callback using the appropriate
 unregister routine (see dr_events.h).  While unusual, it is possible for
 one callback routine to unregister another.  In this case, DynamoRIO
 still calls routines that were registered before the event.
 Unregistration takes effect before the next event.

 On Linux, an exec (SYS_execve) does NOT result in an exit event, but it
 WILL result in the client library being reloaded and its dr_client_main() routine
 being called again.  The system call events can be used for notification of
 SYS_execve.

 ***************************************************************************
 \section sec_utils Common Utilities

 DynamoRIO provides clients with a powerful library of utilities for
 custom runtime code transformations.  The interface includes explicit
 support for creating \e transparent clients.  See the section on
 \ref transparency for a full discussion of the importance of remaining
 transparent when operating in the same process as the application.
 DynamoRIO provides common resources clients can use to avoid reliance on
 shared libraries that may be in use by the application.  The client should
 only use external resources through DynamoRIO's own API, through
 DynamoRIO Extensions (see \ref sec_extensions), through direct
 system calls, or via an external agent in a separate process that
 communicates with the client (see \ref sec_comm).  Third-party libraries
 can be used if they are linked statically or loaded privately and there is
 no possibility of global resource conflicts (e.g., a third-party library's
 memory allocation must be wrapped): see \ref sec_extlibs for more details.

 DynamoRIO's API provides:

  - Memory allocation: both thread-private (faster as it incurs no
    synchronization costs) and thread-shared
  - Thread-local storage
  - Thread-local stack separate from the application stack
  - Simple mutexes
  - File creation, reading, and writing
  - Address space querying
  - Application module iterator
  - Processor feature identification
  - Extra thread creation
  - Symbol lookup (currently Windows-only)
  - Auxiliary library loading

 See dr_tools.h and dr_proc.h for specifics of each routine.

 Another class of utilities provided by DynamoRIO are structures and
 routines for decoding, encoding, and manipulating IA-32, AMD64, ARM, and AArch64
 instructions. These are described in \ref sec_IR.

 \anchor subsec_forwards
 In addition, on Windows, DynamoRIO provides a number
 of utility functions that it fowards to a core Windows system library
 that we believe to be safe for clients to use:

  - wcstoul
  - wcstombs
  - wcstol
  - wcsstr
  - wcsspn
  - wcsrchr
  - wcspbrk
  - wcsncpy
  - wcsncmp
  - wcsncat
  - wcslen
  - wcscspn
  - wcscpy
  - wcscmp
  - wcschr
  - wcscat
  - towupper
  - towlower
  - toupper
  - tolower
  - tan
  - strtoul
  - strtol
  - strstr
  - strspn
  - strrchr
  - strpbrk
  - strncpy
  - strncmp
  - strncat
  - strlen
  - strcspn
  - strcmp
  - strchr
  - sscanf
  - sqrt
  - sprintf
  - sin
  - qsort
  - pow
  - memset
  - memmove
  - memcpy
  - memcmp
  - memchr
  - mbstowcs
  - log
  - labs
  - isxdigit
  - iswxdigit
  - iswspace
  - iswlower
  - iswdigit
  - iswctype
  - iswalpha
  - isupper
  - isspace
  - ispunct
  - isprint
  - islower
  - isgraph
  - isdigit
  - iscntrl
  - isalpha
  - isalnum
  - floor
  - fabs
  - cos
  - ceil
  - atol
  - atoi
  - atan
  - abs
  - _wtol
  - _wtoi64
  - _wtoi
  - _wcsupr
  - _wcsnicmp
  - _wcslwr
  - _wcsicmp
  - _vsnprintf
  - _ultow
  - _ultoa
  - _ui64toa
  - _toupper
  - _tolower
  - _strupr
  - _strnicmp
  - _strlwr
  - _stricmp
  - _strcmpi
  - _snwprintf
  - _snprintf
  - _memicmp
  - _memccpy
  - _ltow
  - _ltoa
  - _itow
  - _itoa
  - _i64tow
  - _i64toa
  - _ftol
  - _fltused
  - _chkstk
  - _aullshr
  - _aullrem
  - _aulldiv
  - _atoi64
  - _allshr
  - _allshl
  - _allrem
  - _allmul
  - _alldiv
  - __toascii
  - __iscsymf
  - __iscsym
  - __isascii

 In general, these routines match their standard C library counterparts.  However, be
 warned that some of these may be more limited.  In particular, _vsnprintf
 and _snprintf do not support floating-point values.  DynamoRIO provides
 its own dr_snprintf() that does support floating-point values, but does
 not support printing wide characters.  When printing floating-point values
 be sure to \ref sec_trans_floating_point
 "save the application's floating point state"
 so as to avoid corrupting it.

 ***************************************************************************
 \section sec_64bit_reach 64-Bit Reachability

 To simplify reachability in a 64-bit address space, DynamoRIO guarantees
 that all of its code caches are located within a single 2GB memory region.
 It also places all client memory allocated through dr_thread_alloc(),
 dr_global_alloc(), dr_nonheap_alloc(), or dr_custom_alloc() with
 #DR_ALLOC_CACHE_REACHABLE in the same region.

 DynamoRIO loads client libraries and Extensions (but not copies of system
 libraries) within 32-bit reachability of its code caches.  Typically, the
 code cache region is located in the low 4GB of the address space; thus, to
 avoid relocations at client library load time, it is recommended to set a
 preferred client library base in the low 4GB.  The \ref op_reachable_client
 "-reachable_client runtime option" can be used to remove this guarantee.
 The option is automatically turned off when DynamoRIO and clients are
 statically linked with the application: for such static usage, 32-bit
 reachability to the code cache by clients is not guaranteed. It is also
 disabled on macOS, where the lack of a private loader means we cannot
 control where the client library gets loaded.

 The net result is that any static data or code in a client library, or any
 data allocated using DynamoRIO's API routines (except dr_raw_mem_alloc() or
 dr_custom_alloc()), is guaranteed to be directly reachable from code cache
 code.  However, memory allocated through system libraries (including
 malloc, operator new, and HeapAlloc), as well as DynamoRIO's own
 internally-used heap memory, is *not* guaranteed to be reachable: only
 memory directly allocated via DynamoRIO's API.  The \ref op_reachable_heap
 "-reachable_heap runtime option" can be used to guarantee that all memory
 is reachable, at the risk of running out of memory due to the smaller space
 of available memory.

 To make more space available for the code caches when running larger
 applications, or for clients that use a lot of heap memory that is not
 directly referenced from the cache, we recommend that dr_custom_alloc() be
 called to obtain memory that is not guaranteed to be reachable from the
 code cache (by not passing #DR_ALLOC_CACHE_REACHABLE).  This frees up space
 in the reachable region.

 When inserting calls, dr_insert_call() and dr_insert_clean_call() assume
 that the call is destined for encoding into the code cache-reachable memory
 region, when determining whether a direct or indirect call is needed.
 An indirect call will clobber r11.  Use dr_insert_clean_call_ex()
 with #DR_CLEANCALL_INDIRECT to ensure reachability when encoding to
 a location other than DR's regular code region, or when a clean call is not
 needed, dr_insert_call_ex() takes in a target encode location for more
 flexible determination of direct versus indirect.

 DynamoRIO does not guarantee that any of its memory is allocated in the
 lower 4GB of the address space.  However, it provides several features to
 make it easier to reference addresses absolutely:

  - For directly referencing a global variable \p var in a client library, the
    client can create an operand with the address \p &var and it will
    auto-magically turn into a pc-relative addressing mode.
    OPND_CREATE_ABSMEM() directly creates a pc-relative operand, while
    opnd_create_abs_addr() will convert to a pc-relative operand when an
    absolute reference will not encode.  An opnd_create_rel_addr() operand
    will also convert to an absolute reference when that will reach but a
    pc-relative reference will not.

  - When using an address as an immediate, use the routines
    instrlist_insert_mov_immed_ptrsz() or
    instrlist_insert_push_immed_ptrsz() to conveniently insert either one or
    two instructions depending on whether the address is in the lower 4GB or
    not.

  - When using an #instr_t pointer as an immediate, use the routines
    instrlist_insert_mov_instr_addr() or instrlist_insert_push_instr_addr()
    to conveniently insert either one or two instructions depending on
    whether the resulting #instr_t encoded address is in the lower 4GB or
    not.

 ***************************************************************************
 \section sec_utf8 String Encoding

 All strings in the DynamoRIO API, whether input or output parameters, are
 encoded as UTF-8.  DynamoRIO will internally convert to UTF-16 when
 interacting with the Windows kernel.  A client can use #dr_snprintf() or
 #dr_snwprintf() with the \p S format code to convert between UTF-8 and
 UTF-16 on its own.  (The _snprintf() function forwarded to ntdll does not
 perform that conversion.)


 ***************************************************************************
 \section sec_extensions DynamoRIO Extension Libraries

 DynamoRIO supports extending the API presented to clients through
 separate libraries called DynamoRIO Extensions.  Extensions are meant to
 include features that may be too costly to make available by default or
 features contributed by third parties whose licensing requires using a
 separate library.  Extensions can be either static libraries linked with
 clients at build time or dynamic libraries loaded at runtime.  A private
 loader is used to load dynamic Extensions.

 Current Extensions provide symbol access and container data structures.
 Each Extension has its own documentation and has its functions and data
 structures documented separately from the main API.
 See the full list of Extensions here: \ref page_ext.

 Be aware that some of the DynamoRIO Extensions have LGPL licenses instead
 of the BSD license of the rest of DynamoRIO.  Such Extensions are built as
 shared libraries, have their own license.txt files, and clearly identify
 their license in their documentation.  (We also provide static versions of
 such libraries, but take care in using them that their LGPL licenses match
 your requirements.)

 ***************************************************************************
 \section sec_extlibs Using External Libraries

 Clients are free to use external libraries as long as those libraries do
 not use any global user-mode resources that would interfere with the
 running application, and as long as no alertable system calls are invoked
 on Windows (see \ref sec_alertable).  While most non-graphical
 non-alertable Windows API routines are supported, native threading libraries
 such as \p libpthread.so on Linux are known to sometimes cause problems.

 Currently we provide a private loader for both Windows and Linux which
 supplies support for client external library use.
 Clients must either link statically to all libraries or load them using
 our private loader, which will happen automatically for shared libraries
 loaded in a typical manner.
 With private loading, the client
 uses a separate copy of each library from any copy used by the application.
 This helps to prevent re-entrancy problems (see \ref sec_trans_resource).
 Even with this separation, if these libraries use global resources there
 can still be conflicts.  Our private loader redirects heap
 allocation in the main process heap to instead use DynamoRIO's internal
 heap.  The loader also attempts to isolate other global resource usage and
 global callbacks.  Please file reports on any transparency problems
 observed when using the private loader.

 If DynamoRIO and clients are statically linked into the application,
 the private loader is not available and third-party library usage is
 generally no longer safe, unless that usage only happens during
 process-wide initialization (see \ref sec_static_DR).

 By default, all Windows clients link with libc.  To instead
 use the libc subset of routines forwarded from the DynamoRIO library to \p
 ntdll.dll (which keeps clients more lightweight and is usually sufficient
 for most C code), set this variable prior to invoking
 configure_DynamoRIO_client():

 \code
   set(DynamoRIO_USE_LIBC OFF)
 \endcode

 C++ clients and standalone clients link with libc by default.

 **************************************************
 \subsection sec_alertable Avoid Alertable System Calls

 On Windows, DynamoRIO does not support a client (or a library used by a
 client) making alertable system calls.  These are system calls that can be
 interrupted for delivery of callbacks or asynchronous procedure calls.  At
 the Windows API layer, they include many graphical routines, any Wait
 function invoked with \p alertable=TRUE (e.g., WaitForSingleObjectEx or
 WaitForMultipleObjectsEx), any Windows message queue function (GetMessage,
 SendMessage, ReplyMessage), and asynchronous i/o.  In general, avoiding
 graphical, windowing, or asynchronous i/o library or system calls is
 advisable.  DynamoRIO does not guarantee correct execution when a callback
 arrives during client code execution.

 **************************************************
 \subsection sec_rpath DynamoRIO Library Search Paths

 DynamoRIO's loader searches for libraries in approximately the same manner
 as the system loader.  It also has support for automatically locating
 Extension libraries that are packaged in the usual place in the DynamoRIO
 file hierarchy.

 DynamoRIO supports setting DT_RPATH for ELF clients, via setting the
 DynamoRIO_RPATH variable to ON prior to invoking
 configure_DynamoRIO_client().  On Windows, setting that variable will
 create a "<client_basename>.drpath" text file that contains a list of
 paths.  At runtime, DynamoRIO's loader will parse this file and add each
 newline-separated path to its list of search paths.  This file is honored
 on Linux as well, though it is not automatically created there.  This
 allows clients a cross-platform mechanism to use third-party libraries in
 locations of their choosing.

 **************************************************
 \subsection subsec_avoid_redir Deliberately Invoking Application Routines

 Sometimes, a client wishes to invoke system library routines with the
 application context, rather than having them redirected and isolated by
 DynamoRIO.  This can be accomplished using dynamic binding rather than
 static: dynamically looking up each desired library routine via DR's own
 routines (such as dr_get_proc_address()).  (Using \p GetProcAddress will not
 work for this purpose as the result will be redirected.)

 **************************************************
 \subsection subsec_no_loader When Private Loader is Disabled

 On Linux, if the private loader is deliberately disabled, ld provides the -wrap
 option, which allows us to override the C library's memory heap allocation
 routines with our own.  For convenience, DynamoRIO exports
 __wrap_malloc(), __wrap_realloc(), and __wrap_free() for this purpose.
 These routines behave like their C library counterparts, but operate on
 DynamoRIO's global memory pool.  Use the -Xlinker flag with gcc to replace
 the libc routines with DynamoRIO's _wrap routines, e.g.,

 \code
   gcc -Xlinker -wrap=malloc -Xlinker -wrap=realloc -Xlinker -wrap=free -Xlinker -wrap=strdup ...
 \endcode

 The ability to override the memory allocation routines makes it
 convenient to develop C++ clients that use the \em new and \em delete
 operators (as long as those operators are implemented using malloc and
 free).  In particular, heap allocation is required to use the C++
 Standard Template Library containers. When developing a C++ client,
 we recommend linking statically to the C++ runtime library if not using
 the provided private loader.

 On Linux, this is most easily accomplished by specifying the path to the
 static version of the library on the gcc command line.  gcc's
 -print-file-name option is useful for discovering this path, e.g.,

 \code
   g++ -print-file-name=libstdc++.a
 \endcode

 A full gcc command line for building a C++ client when disabling the
 private loader (which is not the default) might look something like this
 (note that this requires static versions of the standard libraries that
 were built PIC, which is not the case in modern binary distributions and
 often requires building from source):

 \code
 g++ -o my-client.so -I<header dir> \
   -fPIC -shared -nodefaultlibs \
   -Xlinker -wrap=malloc -Xlinker -wrap=realloc -Xlinker -wrap=free \
   `g++ -print-file-name=libstdc++.a` \
   `g++ -print-file-name=libgcc.a` \
   `g++ -print-file-name=libgcc_eh.a` \
   my-client.cpp
 \endcode

 **************************************************
 \subsection subsec_cpp C++ Clients

 The 3.0 version of DynamoRIO added experimental full support for C++
 clients using the STL and other libraries.

 On Windows, when using the Microsoft Visual C++ compiler, we recommend
 using the \p /MT compiler flag to request a static C library.  The client
 will still use the \p kernel32.dll library but our private loader will load
 a separate copy of that library and redirect heap allocation automatically.
 Our private loader does not yet support locating SxS (side-by-side)
 libraries, so using \p /MD will most likely not work unless using a
 version of the Visual Studio compiler other than 2005 or 2008.

 We do not recommend that a client or its libraries invoke their own system
 calls as this bypasses DynamoRIO's monitoring of changes to the process
 address space and changes to threads or control flow.  Such system calls
 will also not work properly on Linux when using sysenter on some systems.
 If you see an assert to that effect in debug build on Linux, try the \ref
 op_sysenter "-sysenter_is_int80" option.

 ***************************************************************************
 \section sec_comm Communication

 Due to transparency limitations (see \ref transparency),
 DynamoRIO can only support certain communication channels in and out of the
 target application process.  These include:

  - DynamoRIO deployment control and runtime options: see \ref page_deploy
    and \ref sec_options.  In particular, the deployment API allows users to
    pass up-front runtime information to the client.
  - Nudges: Since polling requires extra threads, and DynamoRIO tries not
    to create permanent extra threads (see \ref sec_trans_thread
    "Thread Transparency"), a mechanism called \e nudges are the preferred mechanism
    for pushing data into the process. Nudges are used to notify DynamoRIO
    that it needs to re-read its options, or perform some other action.
    DynamoRIO also provides a custom nudge event that can be used by
    clients. See dr_nudge_process() and dr_register_nudge_event().
  - Files can be used to send data out.  An external process can wait on
    the file.
  - Shared memory can be used for bi-directional communication.  For an
    example of this on Windows, see the stats sample (see \ref sec_drstats).

 ***************************************************************************
 \section sec_annotations Annotations

 DynamoRIO provides a binary annotation mechanism which allows the target application to
 communicate directly with the DynamoRIO client, or with DynamoRIO itself. A binary
 annotation is generated from a macro that can be manually inserted into the source code of
 the target program. When compiled, the resulting sequence of assembly instructions has no
 effect on native execution (i.e., it is a nop, or resolves to a static default value),
 but during execution under DynamoRIO, each annotation is detected and transformed into a
 function call to a set of registered handlers. Currently DynamoRIO provides 2 simple
 annotations:

  - <b>DYNAMORIO_ANNOTATE_RUNNING_ON_DYNAMORIO()</b>
    Indicates by its return value whether the target app is running under DynamoRIO,

  - <b>DYNAMORIO_ANNOTATE_LOG(format, ...)</b>
    Writes a message to the DynamoRIO log, when the target app is running under DynamoRIO
    and logging is enabled.

 An annotation may be declared void, as in DYNAMORIO_ANNOTATE_LOG(), or it may have a
 return value, as in the boolean DYNAMORIO_ANNOTATE_RUNNING_ON_DYNAMORIO(). The return
 value can be used in a branch predicate, such that some of the target app's code only
 executes under DynamoRIO, or it can be used for in-process communication to obtain data
 from DynamoRIO or its client that is only available during binary translation.

 \subsection subsec_annotate_app Annotating an Application

 Adding an annotation to a target application is primarily a simple matter of invoking the
 annotation macro at the desired program location. The macros are declared in the C header
 file <b>include/annotations/dr_annotations.h</b>, and each macro operates syntactically
 like a function call. An annotation having a return value can be used as an expression.
 DynamoRIO provides a module which defines the annotations, and clients may also provide
 modules containing custom annotations. Each compilation unit that uses annotations must be
 statically linked with the corresponding annotation module(s). For projects using cmake, a
 convenience function \b use_DynamoRIO_annotations(target, srcs) will configure the
 specified \b srcs to be linked with the annotation module.

 \subsection subsec_instr_annotations Instrumenting Annotations

 When DynamoRIO encounters an annotation in the target app, it instruments that program
 location in one of two ways:

  - Return value substitution: DynamoRIO replaces the annotation with a constant return
    value (this instrumentation is only valid for annotations having a return value),

  - Handler invocation: DynamoRIO replaces the annotation with a call to each handler that
    is currently registered for the annotation. For annotations having a return value, the
    return value of the last registered handler will be the value to take effect at the
    target program location.

 Annotation handlers are registered using API functions \b dr_annotation_register_call()
 and \b dr_annotation_register_return(). Note that changes to handler registration will
 have no effect on annotations that have already been translated by DynamoRIO into the
 code cache (until the annotated basic blocks are removed from the cache and retranslated).
 The annotation instrumentation invokes the handlers using a separate clean call for each
 handler. The return value for an annotation can be set within a handler function using
 the API function \b dr_annotation_set_return_value().

 \subsection subsec_create_annotations Creating Custom Annotations

 DynamoRIO client developers may wish to create new annotations to facilitate
 client-specific communication with the target application. For example, a client that
 inspects memory usage may have false positives for variables in the target app that are
 never referenced after initialization. The \ref API_tutorial_annotation1 walks
 through the process of creating a new annotation that allows target application
 developers to explicitly mark any variable as defined.

 ****************************************************************************
 ****************************************************************************
 */
 /* It's good to use separate C comments: we've hit some sort of doxygen
  * internal buffering error before if one comment gets too long.
  */
 /* The name "Disassembly Library" is hardcoded in CMake_rundoxygen.cmake. */
 /**
 ***************************************************************************
 ***************************************************************************
 \page page_standalone Disassembly Library

 DynamoRIO can be used as a standalone library for IA-32/AMD64/ARM/AArch64
 disassembly, decoding, encoding, and general instruction manipulation,
 independently of controlling a target application.  When used in this way,
 all aspects of DynamoRIO's API routines that apply to instrumentation or
 application control are not applicable; however, the full, rich instruction
 set API is enabled.  For further information on the instruction set API see
 the following sections of the Code Manipulation API:

  - \ref sec_IR
  - \ref sec_decode

 \section sec_standalone Using DynamoRIO as a Standalone Library

 DynamoRIO can be used as a regular third-party library for a standalone
 application (instead of a client that operates on a target program).  Two
 options are provided: using the regular DynamoRIO shared library, or using
 a special static library \p drdecode.  The shared library provides not only
 decoding routines but also cross-platform resources such as file
 manipulation.

 When using the DynamoRIO shared library, this initialization routine must
 be called prior to using any API routines:

 \code dr_standalone_init() \endcode

 This routine returns a dummy context that can be passed to API routines.

 When using \p drdecode, the special context \p GLOBAL_DCONTEXT should be
 used whenever a context is required.  The \p drdecode library does not
 require initialization.

 Neither the context returned by dr_standalone_init() nor \p
 GLOBAL_DCONTEXT can be used as the drcontext for a thread running under
 DynamoRIO control!  It is only for standalone programs that wish to use
 DynamoRIO as a library of routines for instruction manipulation or
 other purposes.

 In standalone mode, the dr_set_isa_mode() routine operates globally rather
 than per-thread.

 Runtime options are ignored in standalone mode.  Disassembly style can be
 controlled via disassemble_set_syntax().  The processor to use will not be
 automatically set and will be assumed to be \p VENDOR_INTEL.  Use
 proc_set_vendor() to set to \p VENDOR_AMD instead.

 In standalone mode, there are no 32-bit-displacement reachability
 guarantees regarding DynamoRIO's heap.

 Some DynamoRIO API routines are not supported in standalone mode.  These
 include all event registration routines, module iteration,
 dr_memory_protect(), dr_messagebox(), dr_get_current_drcontext(),
 dr_get_thread_id(), tls fields, dr_thread_yield(), dr_sleep(), client
 threads, suspending threads, itimers, register spilling and restoring,
 dr_redirect_execution(), try/except, and code cache routines (e.g.,
 dr_delete_fragment() or flush routines).

 When using the \p drdecode library, no API routines other than those
 involving decoding, encoding, disassembling, instruction lists,
 instructions, or operands are supported.  The various compute_address
 routines can be used by manually filling in \p dr_mcontext_t, although far
 memory references will have their segment base ignored.  Other API routines
 are simply not present in the static library.  There is no separate set of
 headers for use with \p drdecode.

 When using DynamoRIO's CMake support, use the configure_DynamoRIO_decoder()
 function to set up include directories and to link with \p drdecode.  The
 next section describes how to link with the DynamoRIO shared library.


 \section sec_standalone_shared DynamoRIO Shared Library Issues

 Since the DynamoRIO library on Windows includes or forwards
 implementations of certain C library routines (see
 \ref subsec_forwards "C library utilities"), standalone applications
 linking to both DynamoRIO and the C library may experience linker errors
 when building and floating point problems when running.  To avoid these
 problems, explicitly list the C runtime library on the command line:

 \code /link /nodefaultlib libcmt.lib dynamorio.lib \endcode

 DynamoRIO writes to stderr and stdout using raw system calls, which can
 interfere with the buffering of library routines.  When mixing use of
 printf or fprintf with DynamoRIO output (including not only dr_printf()
 and dr_fprintf() but also passing STDOUT or STDERR to routines like
 disassemble()), you may need to flush between library printing and
 DynamoRIO printing (e.g., using fflush(stdout)) to ensure that the library
 output is visible.

 The binary tracedump reader (\ref sec_ex8) is an example of use of
 DynamoRIO as a standalone library.

 When building an application that uses DynamoRIO as a standalone library,
 follow the steps for \ref page_build_client to include the header files and link
 with the DynamoRIO library, but omit the linker flags requesting no
 standard libraries or startup files.  DynamoRIO's CMake support does this
 automatically via the configure_DynamoRIO_standalone() function.

 */