rust/src/librustc
Ariel Ben-Yehuda b74219964c improve the tyencode abbrev format
3% win on libcore

528828 liballoc-bb943c5a.rlib
1425126 liballoc_jemalloc-bb943c5a.rlib
10090 liballoc_system-bb943c5a.rlib
144904 libarena-bb943c5a.rlib
3773896 libcollections-bb943c5a.rlib
17075242 libcore-bb943c5a.rlib
195770 libflate-bb943c5a.rlib
234702 libfmt_macros-bb943c5a.rlib
536342 libgetopts-bb943c5a.rlib
212028 libgraphviz-bb943c5a.rlib
397068 liblibc-bb943c5a.rlib
185038 liblog-bb943c5a.rlib
680782 librand-bb943c5a.rlib
577574 librbml-bb943c5a.rlib
1381992 librustc_back-bb943c5a.rlib
37554736 librustc-bb943c5a.rlib
12826 librustc_bitflags-bb943c5a.rlib
2257392 librustc_borrowck-bb943c5a.rlib
533858 librustc_data_structures-bb943c5a.rlib
9338878 librustc_driver-bb943c5a.rlib
8960016 librustc_front-bb943c5a.rlib
1594212 librustc_lint-bb943c5a.rlib
79159342 librustc_llvm-bb943c5a.rlib
4590656 librustc_mir-bb943c5a.rlib
3529292 librustc_platform_intrinsics-bb943c5a.rlib
590688 librustc_privacy-bb943c5a.rlib
3084134 librustc_resolve-bb943c5a.rlib
14032890 librustc_trans-bb943c5a.rlib
11833852 librustc_typeck-bb943c5a.rlib
1641496 librustc_unicode-bb943c5a.rlib
15611582 librustdoc-bb943c5a.rlib
2693764 libserialize-bb943c5a.rlib
8266920 libstd-bb943c5a.rlib
29573790 libsyntax-bb943c5a.rlib
895484 libterm-bb943c5a.rlib
2015-10-01 22:10:09 +03:00
..
front/map Fill in some missing parts in the default HIR visitor 2015-09-29 00:23:54 +03:00
lint Fill in some missing parts in the default HIR visitor 2015-09-29 00:23:54 +03:00
metadata improve the tyencode abbrev format 2015-10-01 22:10:09 +03:00
middle improve the tyencode abbrev format 2015-10-01 22:10:09 +03:00
plugin Change to a multi-trait approach 2015-09-17 12:16:46 +12:00
session Auto merge of #28632 - alexcrichton:update-match-indices, r=Kimundi 2015-09-26 20:06:51 +00:00
util Use ast attributes every where (remove HIR attributes). 2015-09-16 10:57:06 +12:00
diagnostics.rs store the rustc version in metadata and check it 2015-09-29 21:27:08 +03:00
lib.rs move middle::ty and related modules to middle/ty/ 2015-09-14 10:56:13 +03:00
macros.rs std: Remove deprecated/unstable num functionality 2015-04-21 11:37:43 -07:00
README.md Use https URLs to refer to rust-lang.org where appropriate. 2015-08-09 14:28:46 -07:00

An informal guide to reading and working on the rustc compiler.

If you wish to expand on this document, or have a more experienced Rust contributor add anything else to it, please get in touch:

or file a bug:

https://github.com/rust-lang/rust/issues

Your concerns are probably the same as someone else's.

The crates of rustc

Rustc consists of a number of crates, including libsyntax, librustc, librustc_back, librustc_trans, and librustc_driver (the names and divisions are not set in stone and may change; in general, a finer-grained division of crates is preferable):

  • libsyntax contains those things concerned purely with syntax that is, the AST, parser, pretty-printer, lexer, macro expander, and utilities for traversing ASTs are in a separate crate called "syntax", whose files are in ./../libsyntax, where . is the current directory (that is, the parent directory of front/, middle/, back/, and so on).

  • librustc (the current directory) contains the high-level analysis passes, such as the type checker, borrow checker, and so forth. It is the heart of the compiler.

  • librustc_back contains some very low-level details that are specific to different LLVM targets and so forth.

  • librustc_trans contains the code to convert from Rust IR into LLVM IR, and then from LLVM IR into machine code, as well as the main driver that orchestrates all the other passes and various other bits of miscellany. In general it contains code that runs towards the end of the compilation process.

  • librustc_driver invokes the compiler from libsyntax, then the analysis phases from librustc, and finally the lowering and codegen passes from librustc_trans.

Roughly speaking the "order" of the three crates is as follows:

libsyntax -> librustc -> librustc_trans
|                                     |
+-----------------+-------------------+
                  |
          librustc_driver

Modules in the rustc crate

The rustc crate itself consists of the following submodules (mostly, but not entirely, in their own directories):

  • session: options and data that pertain to the compilation session as a whole
  • middle: middle-end: name resolution, typechecking, LLVM code generation
  • metadata: encoder and decoder for data required by separate compilation
  • plugin: infrastructure for compiler plugins
  • lint: infrastructure for compiler warnings
  • util: ubiquitous types and helper functions
  • lib: bindings to LLVM

The entry-point for the compiler is main() in the librustc_driver crate.

The 3 central data structures:

  1. ./../libsyntax/ast.rs defines the AST. The AST is treated as immutable after parsing, but it depends on mutable context data structures (mainly hash maps) to give it meaning.

    • Many though not all nodes within this data structure are wrapped in the type spanned<T>, meaning that the front-end has marked the input coordinates of that node. The member node is the data itself, the member span is the input location (file, line, column; both low and high).

    • Many other nodes within this data structure carry a def_id. These nodes represent the 'target' of some name reference elsewhere in the tree. When the AST is resolved, by middle/resolve.rs, all names wind up acquiring a def that they point to. So anything that can be pointed-to by a name winds up with a def_id.

  2. middle/ty.rs defines the datatype sty. This is the type that represents types after they have been resolved and normalized by the middle-end. The typeck phase converts every ast type to a ty::sty, and the latter is used to drive later phases of compilation. Most variants in the ast::ty tag have a corresponding variant in the ty::sty tag.

  3. ./../librustc_llvm/lib.rs defines the exported types ValueRef, TypeRef, BasicBlockRef, and several others. Each of these is an opaque pointer to an LLVM type, manipulated through the lib::llvm interface.

Control and information flow within the compiler:

  • main() in lib.rs assumes control on startup. Options are parsed, platform is detected, etc.

  • ./../libsyntax/parse/parser.rs parses the input files and produces an AST that represents the input crate.

  • Multiple middle-end passes (middle/resolve.rs, middle/typeck.rs) analyze the semantics of the resulting AST. Each pass generates new information about the AST and stores it in various environment data structures. The driver passes environments to each compiler pass that needs to refer to them.

  • Finally, the trans module in librustc_trans translates the Rust AST to LLVM bitcode in a type-directed way. When it's finished synthesizing LLVM values, rustc asks LLVM to write them out in some form (.bc, .o) and possibly run the system linker.