|
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292 |
- <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
- <html>
- <!-- Copyright (C) 1988-2020 Free Software Foundation, Inc.
-
- Permission is granted to copy, distribute and/or modify this document
- under the terms of the GNU Free Documentation License, Version 1.3 or
- any later version published by the Free Software Foundation; with the
- Invariant Sections being "Funding Free Software", the Front-Cover
- Texts being (a) (see below), and with the Back-Cover Texts being (b)
- (see below). A copy of the license is included in the section entitled
- "GNU Free Documentation License".
-
- (a) The FSF's Front-Cover Text is:
-
- A GNU Manual
-
- (b) The FSF's Back-Cover Text is:
-
- You have freedom to copy and modify this GNU Manual, like GNU
- software. Copies published by the Free Software Foundation raise
- funds for GNU development. -->
- <!-- Created by GNU Texinfo 6.5, http://www.gnu.org/software/texinfo/ -->
- <head>
- <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
- <title>IPA (GNU Compiler Collection (GCC) Internals)</title>
-
- <meta name="description" content="IPA (GNU Compiler Collection (GCC) Internals)">
- <meta name="keywords" content="IPA (GNU Compiler Collection (GCC) Internals)">
- <meta name="resource-type" content="document">
- <meta name="distribution" content="global">
- <meta name="Generator" content="makeinfo">
- <link href="index.html#Top" rel="start" title="Top">
- <link href="Option-Index.html#Option-Index" rel="index" title="Option Index">
- <link href="index.html#SEC_Contents" rel="contents" title="Table of Contents">
- <link href="LTO.html#LTO" rel="up" title="LTO">
- <link href="WHOPR.html#WHOPR" rel="next" title="WHOPR">
- <link href="LTO-object-file-layout.html#LTO-object-file-layout" rel="prev" title="LTO object file layout">
- <style type="text/css">
- <!--
- a.summary-letter {text-decoration: none}
- blockquote.indentedblock {margin-right: 0em}
- blockquote.smallindentedblock {margin-right: 0em; font-size: smaller}
- blockquote.smallquotation {font-size: smaller}
- div.display {margin-left: 3.2em}
- div.example {margin-left: 3.2em}
- div.lisp {margin-left: 3.2em}
- div.smalldisplay {margin-left: 3.2em}
- div.smallexample {margin-left: 3.2em}
- div.smalllisp {margin-left: 3.2em}
- kbd {font-style: oblique}
- pre.display {font-family: inherit}
- pre.format {font-family: inherit}
- pre.menu-comment {font-family: serif}
- pre.menu-preformatted {font-family: serif}
- pre.smalldisplay {font-family: inherit; font-size: smaller}
- pre.smallexample {font-size: smaller}
- pre.smallformat {font-family: inherit; font-size: smaller}
- pre.smalllisp {font-size: smaller}
- span.nolinebreak {white-space: nowrap}
- span.roman {font-family: initial; font-weight: normal}
- span.sansserif {font-family: sans-serif; font-weight: normal}
- ul.no-bullet {list-style: none}
- -->
- </style>
-
-
- </head>
-
- <body lang="en">
- <a name="IPA"></a>
- <div class="header">
- <p>
- Next: <a href="WHOPR.html#WHOPR" accesskey="n" rel="next">WHOPR</a>, Previous: <a href="LTO-object-file-layout.html#LTO-object-file-layout" accesskey="p" rel="prev">LTO object file layout</a>, Up: <a href="LTO.html#LTO" accesskey="u" rel="up">LTO</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Option-Index.html#Option-Index" title="Index" rel="index">Index</a>]</p>
- </div>
- <hr>
- <a name="Using-summary-information-in-IPA-passes"></a>
- <h3 class="section">25.3 Using summary information in IPA passes</h3>
-
- <p>Programs are represented internally as a <em>callgraph</em> (a
- multi-graph where nodes are functions and edges are call sites)
- and a <em>varpool</em> (a list of static and external variables in
- the program).
- </p>
- <p>The inter-procedural optimization is organized as a sequence of
- individual passes, which operate on the callgraph and the
- varpool. To make the implementation of WHOPR possible, every
- inter-procedural optimization pass is split into several stages
- that are executed at different times during WHOPR compilation:
- </p>
- <ul>
- <li> LGEN time
- <ol>
- <li> <em>Generate summary</em> (<code>generate_summary</code> in
- <code>struct ipa_opt_pass_d</code>). This stage analyzes every function
- body and variable initializer is examined and stores relevant
- information into a pass-specific data structure.
-
- </li><li> <em>Write summary</em> (<code>write_summary</code> in
- <code>struct ipa_opt_pass_d</code>). This stage writes all the
- pass-specific information generated by <code>generate_summary</code>.
- Summaries go into their own <code>LTO_section_*</code> sections that
- have to be declared in <samp>lto-streamer.h</samp>:<code>enum
- lto_section_type</code>. A new section is created by calling
- <code>create_output_block</code> and data can be written using the
- <code>lto_output_*</code> routines.
- </li></ol>
-
- </li><li> WPA time
- <ol>
- <li> <em>Read summary</em> (<code>read_summary</code> in
- <code>struct ipa_opt_pass_d</code>). This stage reads all the
- pass-specific information in exactly the same order that it was
- written by <code>write_summary</code>.
-
- </li><li> <em>Execute</em> (<code>execute</code> in <code>struct
- opt_pass</code>). This performs inter-procedural propagation. This
- must be done without actual access to the individual function
- bodies or variable initializers. Typically, this results in a
- transitive closure operation over the summary information of all
- the nodes in the callgraph.
-
- </li><li> <em>Write optimization summary</em>
- (<code>write_optimization_summary</code> in <code>struct
- ipa_opt_pass_d</code>). This writes the result of the inter-procedural
- propagation into the object file. This can use the same data
- structures and helper routines used in <code>write_summary</code>.
- </li></ol>
-
- </li><li> LTRANS time
- <ol>
- <li> <em>Read optimization summary</em>
- (<code>read_optimization_summary</code> in <code>struct
- ipa_opt_pass_d</code>). The counterpart to
- <code>write_optimization_summary</code>. This reads the interprocedural
- optimization decisions in exactly the same format emitted by
- <code>write_optimization_summary</code>.
-
- </li><li> <em>Transform</em> (<code>function_transform</code> and
- <code>variable_transform</code> in <code>struct ipa_opt_pass_d</code>).
- The actual function bodies and variable initializers are updated
- based on the information passed down from the <em>Execute</em> stage.
- </li></ol>
- </li></ul>
-
- <p>The implementation of the inter-procedural passes are shared
- between LTO, WHOPR and classic non-LTO compilation.
- </p>
- <ul>
- <li> During the traditional file-by-file mode every pass executes its
- own <em>Generate summary</em>, <em>Execute</em>, and <em>Transform</em>
- stages within the single execution context of the compiler.
-
- </li><li> In LTO compilation mode, every pass uses <em>Generate
- summary</em> and <em>Write summary</em> stages at compilation time,
- while the <em>Read summary</em>, <em>Execute</em>, and
- <em>Transform</em> stages are executed at link time.
-
- </li><li> In WHOPR mode all stages are used.
- </li></ul>
-
- <p>To simplify development, the GCC pass manager differentiates
- between normal inter-procedural passes (see <a href="Regular-IPA-passes.html#Regular-IPA-passes">Regular IPA passes</a>),
- small inter-procedural passes (see <a href="Small-IPA-passes.html#Small-IPA-passes">Small IPA passes</a>)
- and late inter-procedural passes (see <a href="Late-IPA-passes.html#Late-IPA-passes">Late IPA passes</a>).
- A small or late IPA pass (<code>SIMPLE_IPA_PASS</code>) does
- everything at once and thus cannot be executed during WPA in
- WHOPR mode. It defines only the <em>Execute</em> stage and during
- this stage it accesses and modifies the function bodies. Such
- passes are useful for optimization at LGEN or LTRANS time and are
- used, for example, to implement early optimization before writing
- object files. The simple inter-procedural passes can also be used
- for easier prototyping and development of a new inter-procedural
- pass.
- </p>
-
- <a name="Virtual-clones"></a>
- <h4 class="subsection">25.3.1 Virtual clones</h4>
-
- <p>One of the main challenges of introducing the WHOPR compilation
- mode was addressing the interactions between optimization passes.
- In LTO compilation mode, the passes are executed in a sequence,
- each of which consists of analysis (or <em>Generate summary</em>),
- propagation (or <em>Execute</em>) and <em>Transform</em> stages.
- Once the work of one pass is finished, the next pass sees the
- updated program representation and can execute. This makes the
- individual passes dependent on each other.
- </p>
- <p>In WHOPR mode all passes first execute their <em>Generate
- summary</em> stage. Then summary writing marks the end of the LGEN
- stage. At WPA time,
- the summaries are read back into memory and all passes run the
- <em>Execute</em> stage. Optimization summaries are streamed and
- sent to LTRANS, where all the passes execute the <em>Transform</em>
- stage.
- </p>
- <p>Most optimization passes split naturally into analysis,
- propagation and transformation stages. But some do not. The
- main problem arises when one pass performs changes and the
- following pass gets confused by seeing different callgraphs
- between the <em>Transform</em> stage and the <em>Generate summary</em>
- or <em>Execute</em> stage. This means that the passes are required
- to communicate their decisions with each other.
- </p>
- <p>To facilitate this communication, the GCC callgraph
- infrastructure implements <em>virtual clones</em>, a method of
- representing the changes performed by the optimization passes in
- the callgraph without needing to update function bodies.
- </p>
- <p>A <em>virtual clone</em> in the callgraph is a function that has no
- associated body, just a description of how to create its body based
- on a different function (which itself may be a virtual clone).
- </p>
- <p>The description of function modifications includes adjustments to
- the function’s signature (which allows, for example, removing or
- adding function arguments), substitutions to perform on the
- function body, and, for inlined functions, a pointer to the
- function that it will be inlined into.
- </p>
- <p>It is also possible to redirect any edge of the callgraph from a
- function to its virtual clone. This implies updating of the call
- site to adjust for the new function signature.
- </p>
- <p>Most of the transformations performed by inter-procedural
- optimizations can be represented via virtual clones. For
- instance, a constant propagation pass can produce a virtual clone
- of the function which replaces one of its arguments by a
- constant. The inliner can represent its decisions by producing a
- clone of a function whose body will be later integrated into
- a given function.
- </p>
- <p>Using <em>virtual clones</em>, the program can be easily updated
- during the <em>Execute</em> stage, solving most of pass interactions
- problems that would otherwise occur during <em>Transform</em>.
- </p>
- <p>Virtual clones are later materialized in the LTRANS stage and
- turned into real functions. Passes executed after the virtual
- clone were introduced also perform their <em>Transform</em> stage
- on new functions, so for a pass there is no significant
- difference between operating on a real function or a virtual
- clone introduced before its <em>Execute</em> stage.
- </p>
- <p>Optimization passes then work on virtual clones introduced before
- their <em>Execute</em> stage as if they were real functions. The
- only difference is that clones are not visible during the
- <em>Generate Summary</em> stage.
- </p>
- <p>To keep function summaries updated, the callgraph interface
- allows an optimizer to register a callback that is called every
- time a new clone is introduced as well as when the actual
- function or variable is generated or when a function or variable
- is removed. These hooks are registered in the <em>Generate
- summary</em> stage and allow the pass to keep its information intact
- until the <em>Execute</em> stage. The same hooks can also be
- registered during the <em>Execute</em> stage to keep the
- optimization summaries updated for the <em>Transform</em> stage.
- </p>
- <a name="IPA-references"></a>
- <h4 class="subsection">25.3.2 IPA references</h4>
-
- <p>GCC represents IPA references in the callgraph. For a function
- or variable <code>A</code>, the <em>IPA reference</em> is a list of all
- locations where the address of <code>A</code> is taken and, when
- <code>A</code> is a variable, a list of all direct stores and reads
- to/from <code>A</code>. References represent an oriented multi-graph on
- the union of nodes of the callgraph and the varpool. See
- <samp>ipa-reference.c</samp>:<code>ipa_reference_write_optimization_summary</code>
- and
- <samp>ipa-reference.c</samp>:<code>ipa_reference_read_optimization_summary</code>
- for details.
- </p>
- <a name="Jump-functions"></a>
- <h4 class="subsection">25.3.3 Jump functions</h4>
- <p>Suppose that an optimization pass sees a function <code>A</code> and it
- knows the values of (some of) its arguments. The <em>jump
- function</em> describes the value of a parameter of a given function
- call in function <code>A</code> based on this knowledge.
- </p>
- <p>Jump functions are used by several optimizations, such as the
- inter-procedural constant propagation pass and the
- devirtualization pass. The inliner also uses jump functions to
- perform inlining of callbacks.
- </p>
- <hr>
- <div class="header">
- <p>
- Next: <a href="WHOPR.html#WHOPR" accesskey="n" rel="next">WHOPR</a>, Previous: <a href="LTO-object-file-layout.html#LTO-object-file-layout" accesskey="p" rel="prev">LTO object file layout</a>, Up: <a href="LTO.html#LTO" accesskey="u" rel="up">LTO</a> [<a href="index.html#SEC_Contents" title="Table of contents" rel="contents">Contents</a>][<a href="Option-Index.html#Option-Index" title="Index" rel="index">Index</a>]</p>
- </div>
-
-
-
- </body>
- </html>
|