1//! Partitioning Codegen Units for Incremental Compilation
2//! ======================================================
3//!
4//! The task of this module is to take the complete set of monomorphizations of
5//! a crate and produce a set of codegen units from it, where a codegen unit
6//! is a named set of (mono-item, linkage) pairs. That is, this module
7//! decides which monomorphization appears in which codegen units with which
8//! linkage. The following paragraphs describe some of the background on the
9//! partitioning scheme.
10//!
11//! The most important opportunity for saving on compilation time with
12//! incremental compilation is to avoid re-codegenning and re-optimizing code.
13//! Since the unit of codegen and optimization for LLVM is "modules" or, how
14//! we call them "codegen units", the particulars of how much time can be saved
15//! by incremental compilation are tightly linked to how the output program is
16//! partitioned into these codegen units prior to passing it to LLVM --
17//! especially because we have to treat codegen units as opaque entities once
18//! they are created: There is no way for us to incrementally update an existing
19//! LLVM module and so we have to build any such module from scratch if it was
20//! affected by some change in the source code.
21//!
22//! From that point of view it would make sense to maximize the number of
23//! codegen units by, for example, putting each function into its own module.
24//! That way only those modules would have to be re-compiled that were actually
25//! affected by some change, minimizing the number of functions that could have
26//! been re-used but just happened to be located in a module that is
27//! re-compiled.
28//!
29//! However, since LLVM optimization does not work across module boundaries,
30//! using such a highly granular partitioning would lead to very slow runtime
31//! code since it would effectively prohibit inlining and other inter-procedure
32//! optimizations. We want to avoid that as much as possible.
33//!
34//! Thus we end up with a trade-off: The bigger the codegen units, the better
35//! LLVM's optimizer can do its work, but also the smaller the compilation time
36//! reduction we get from incremental compilation.
37//!
38//! Ideally, we would create a partitioning such that there are few big codegen
39//! units with few interdependencies between them. For now though, we use the
40//! following heuristic to determine the partitioning:
41//!
42//! - There are two codegen units for every source-level module:
43//! - One for "stable", that is non-generic, code
44//! - One for more "volatile" code, i.e., monomorphized instances of functions
45//! defined in that module
46//!
47//! In order to see why this heuristic makes sense, let's take a look at when a
48//! codegen unit can get invalidated:
49//!
50//! 1. The most straightforward case is when the BODY of a function or global
51//! changes. Then any codegen unit containing the code for that item has to be
52//! re-compiled. Note that this includes all codegen units where the function
53//! has been inlined.
54//!
55//! 2. The next case is when the SIGNATURE of a function or global changes. In
56//! this case, all codegen units containing a REFERENCE to that item have to be
57//! re-compiled. This is a superset of case 1.
58//!
59//! 3. The final and most subtle case is when a REFERENCE to a generic function
60//! is added or removed somewhere. Even though the definition of the function
61//! might be unchanged, a new REFERENCE might introduce a new monomorphized
62//! instance of this function which has to be placed and compiled somewhere.
63//! Conversely, when removing a REFERENCE, it might have been the last one with
64//! that particular set of generic arguments and thus we have to remove it.
65//!
66//! From the above we see that just using one codegen unit per source-level
67//! module is not such a good idea, since just adding a REFERENCE to some
68//! generic item somewhere else would invalidate everything within the module
69//! containing the generic item. The heuristic above reduces this detrimental
70//! side-effect of references a little by at least not touching the non-generic
71//! code of the module.
72//!
73//! A Note on Inlining
74//! ------------------
75//! As briefly mentioned above, in order for LLVM to be able to inline a
76//! function call, the body of the function has to be available in the LLVM
77//! module where the call is made. This has a few consequences for partitioning:
78//!
79//! - The partitioning algorithm has to take care of placing functions into all
80//! codegen units where they should be available for inlining. It also has to
81//! decide on the correct linkage for these functions.
82//!
83//! - The partitioning algorithm has to know which functions are likely to get
84//! inlined, so it can distribute function instantiations accordingly. Since
85//! there is no way of knowing for sure which functions LLVM will decide to
86//! inline in the end, we apply a heuristic here: Only functions marked with
87//! `#[inline]` are considered for inlining by the partitioner. The current
88//! implementation will not try to determine if a function is likely to be
89//! inlined by looking at the functions definition.
90//!
91//! Note though that as a side-effect of creating a codegen units per
92//! source-level module, functions from the same module will be available for
93//! inlining, even when they are not marked `#[inline]`.
9495use std::cmp;
96use std::collections::hash_map::Entry;
97use std::fs::{self, File};
98use std::io::Write;
99use std::path::{Path, PathBuf};
100101use rustc_data_structures::fx::{FxIndexMap, FxIndexSet};
102use rustc_data_structures::sync::par_join;
103use rustc_data_structures::unord::{UnordMap, UnordSet};
104use rustc_hir::LangItem;
105use rustc_hir::attrs::{InlineAttr, Linkage};
106use rustc_hir::def::DefKind;
107use rustc_hir::def_id::{DefId, DefIdSet, LOCAL_CRATE};
108use rustc_hir::definitions::DefPathDataName;
109use rustc_middle::bug;
110use rustc_middle::middle::codegen_fn_attrs::CodegenFnAttrFlags;
111use rustc_middle::middle::exported_symbols::{SymbolExportInfo, SymbolExportLevel};
112use rustc_middle::mir::StatementKind;
113use rustc_middle::mono::{
114CodegenUnit, CodegenUnitNameBuilder, InstantiationMode, MonoItem, MonoItemData,
115MonoItemPartitions, Visibility,
116};
117use rustc_middle::ty::print::{characteristic_def_id_of_type, with_no_trimmed_paths};
118use rustc_middle::ty::{self, InstanceKind, TyCtxt};
119use rustc_middle::util::Providers;
120use rustc_session::CodegenUnits;
121use rustc_session::config::{DumpMonoStatsFormat, SwitchWithOptPath};
122use rustc_span::Symbol;
123use rustc_target::spec::SymbolVisibility;
124use tracing::debug;
125126use crate::collector::{self, MonoItemCollectionStrategy, UsageMap};
127use crate::errors::{CouldntDumpMonoStats, SymbolAlreadyDefined};
128use crate::graph_checks::target_specific_checks;
129130struct PartitioningCx<'a, 'tcx> {
131 tcx: TyCtxt<'tcx>,
132 usage_map: &'a UsageMap<'tcx>,
133}
134135struct PlacedMonoItems<'tcx> {
136/// The codegen units, sorted by name to make things deterministic.
137codegen_units: Vec<CodegenUnit<'tcx>>,
138139 internalization_candidates: UnordSet<MonoItem<'tcx>>,
140}
141142// The output CGUs are sorted by name.
143fn partition<'tcx, I>(
144 tcx: TyCtxt<'tcx>,
145 mono_items: I,
146 usage_map: &UsageMap<'tcx>,
147) -> Vec<CodegenUnit<'tcx>>
148where
149I: Iterator<Item = MonoItem<'tcx>>,
150{
151let _prof_timer = tcx.prof.generic_activity("cgu_partitioning");
152153let cx = &PartitioningCx { tcx, usage_map };
154155// Place all mono items into a codegen unit. `place_mono_items` is
156 // responsible for initializing the CGU size estimates.
157let PlacedMonoItems { mut codegen_units, internalization_candidates } = {
158let _prof_timer = tcx.prof.generic_activity("cgu_partitioning_place_items");
159let placed = place_mono_items(cx, mono_items);
160161debug_dump(tcx, "PLACE", &placed.codegen_units);
162163placed164 };
165166// Merge until we don't exceed the max CGU count.
167 // `merge_codegen_units` is responsible for updating the CGU size
168 // estimates.
169{
170let _prof_timer = tcx.prof.generic_activity("cgu_partitioning_merge_cgus");
171merge_codegen_units(cx, &mut codegen_units);
172debug_dump(tcx, "MERGE", &codegen_units);
173 }
174175// Make as many symbols "internal" as possible, so LLVM has more freedom to
176 // optimize.
177if !tcx.sess.link_dead_code() {
178let _prof_timer = tcx.prof.generic_activity("cgu_partitioning_internalize_symbols");
179internalize_symbols(cx, &mut codegen_units, internalization_candidates);
180181debug_dump(tcx, "INTERNALIZE", &codegen_units);
182 }
183184// Mark one CGU for dead code, if necessary.
185if tcx.sess.instrument_coverage() {
186mark_code_coverage_dead_code_cgu(&mut codegen_units);
187 }
188189// Ensure CGUs are sorted by name, so that we get deterministic results.
190if !codegen_units.is_sorted_by(|a, b| a.name().as_str() <= b.name().as_str()) {
191let mut names = String::new();
192for cgu in codegen_units.iter() {
193 names += &::alloc::__export::must_use({
::alloc::fmt::format(format_args!("- {0}\n", cgu.name()))
})format!("- {}\n", cgu.name());
194 }
195::rustc_middle::util::bug::bug_fmt(format_args!("unsorted CGUs:\n{0}",
names));bug!("unsorted CGUs:\n{names}");
196 }
197198codegen_units199}
200201fn place_mono_items<'tcx, I>(cx: &PartitioningCx<'_, 'tcx>, mono_items: I) -> PlacedMonoItems<'tcx>
202where
203I: Iterator<Item = MonoItem<'tcx>>,
204{
205let mut codegen_units = UnordMap::default();
206let is_incremental_build = cx.tcx.sess.opts.incremental.is_some();
207let mut internalization_candidates = UnordSet::default();
208209// Determine if monomorphizations instantiated in this crate will be made
210 // available to downstream crates. This depends on whether we are in
211 // share-generics mode and whether the current crate can even have
212 // downstream crates.
213let can_export_generics = cx.tcx.local_crate_exports_generics();
214let always_export_generics = can_export_generics && cx.tcx.sess.opts.share_generics();
215216let cgu_name_builder = &mut CodegenUnitNameBuilder::new(cx.tcx);
217let cgu_name_cache = &mut UnordMap::default();
218219for mono_item in mono_items {
220// Handle only root (GloballyShared) items directly here. Inlined (LocalCopy) items
221 // are handled at the bottom of the loop based on reachability, with one exception.
222 // The #[lang = "start"] item is the program entrypoint, so there are no calls to it in MIR.
223 // So even if its mode is LocalCopy, we need to treat it like a root.
224match mono_item.instantiation_mode(cx.tcx) {
225 InstantiationMode::GloballyShared { .. } => {}
226 InstantiationMode::LocalCopy => continue,
227 }
228229let characteristic_def_id = characteristic_def_id_of_mono_item(cx.tcx, mono_item);
230let is_volatile = is_incremental_build && mono_item.is_generic_fn();
231232let cgu_name = match characteristic_def_id {
233Some(def_id) => compute_codegen_unit_name(
234 cx.tcx,
235 cgu_name_builder,
236 def_id,
237 is_volatile,
238 cgu_name_cache,
239 ),
240None => fallback_cgu_name(cgu_name_builder),
241 };
242243let cgu = codegen_units.entry(cgu_name).or_insert_with(|| CodegenUnit::new(cgu_name));
244245let mut can_be_internalized = true;
246let (linkage, visibility) = mono_item_linkage_and_visibility(
247 cx.tcx,
248&mono_item,
249&mut can_be_internalized,
250 can_export_generics,
251 always_export_generics,
252 );
253254if visibility == Visibility::Hidden && can_be_internalized {
255 internalization_candidates.insert(mono_item);
256 }
257let size_estimate = mono_item.size_estimate(cx.tcx);
258259 cgu.items_mut()
260 .insert(mono_item, MonoItemData { inlined: false, linkage, visibility, size_estimate });
261262// Get all inlined items that are reachable from `mono_item` without
263 // going via another root item. This includes drop-glue, functions from
264 // external crates, and local functions the definition of which is
265 // marked with `#[inline]`.
266let mut reachable_inlined_items = FxIndexSet::default();
267 get_reachable_inlined_items(cx.tcx, mono_item, cx.usage_map, &mut reachable_inlined_items);
268269// Add those inlined items. It's possible an inlined item is reachable
270 // from multiple root items within a CGU, which is fine, it just means
271 // the `insert` will be a no-op.
272for inlined_item in reachable_inlined_items {
273// This is a CGU-private copy.
274cgu.items_mut().entry(inlined_item).or_insert_with(|| MonoItemData {
275 inlined: true,
276 linkage: Linkage::Internal,
277 visibility: Visibility::Default,
278 size_estimate: inlined_item.size_estimate(cx.tcx),
279 });
280 }
281 }
282283// Always ensure we have at least one CGU; otherwise, if we have a
284 // crate with just types (for example), we could wind up with no CGU.
285if codegen_units.is_empty() {
286let cgu_name = fallback_cgu_name(cgu_name_builder);
287codegen_units.insert(cgu_name, CodegenUnit::new(cgu_name));
288 }
289290let mut codegen_units: Vec<_> = cx.tcx.with_stable_hashing_context(|mut hcx| {
291codegen_units.into_items().map(|(_, cgu)| cgu).collect_sorted(&mut hcx, true)
292 });
293294for cgu in codegen_units.iter_mut() {
295 cgu.compute_size_estimate();
296 }
297298return PlacedMonoItems { codegen_units, internalization_candidates };
299300fn get_reachable_inlined_items<'tcx>(
301 tcx: TyCtxt<'tcx>,
302 item: MonoItem<'tcx>,
303 usage_map: &UsageMap<'tcx>,
304 visited: &mut FxIndexSet<MonoItem<'tcx>>,
305 ) {
306usage_map.for_each_inlined_used_item(tcx, item, |inlined_item| {
307let is_new = visited.insert(inlined_item);
308if is_new {
309get_reachable_inlined_items(tcx, inlined_item, usage_map, visited);
310 }
311 });
312 }
313}
314315// This function requires the CGUs to be sorted by name on input, and ensures
316// they are sorted by name on return, for deterministic behaviour.
317fn merge_codegen_units<'tcx>(
318 cx: &PartitioningCx<'_, 'tcx>,
319 codegen_units: &mut Vec<CodegenUnit<'tcx>>,
320) {
321if !(cx.tcx.sess.codegen_units().as_usize() >= 1) {
::core::panicking::panic("assertion failed: cx.tcx.sess.codegen_units().as_usize() >= 1")
};assert!(cx.tcx.sess.codegen_units().as_usize() >= 1);
322323// A sorted order here ensures merging is deterministic.
324if !codegen_units.is_sorted_by(|a, b| a.name().as_str() <= b.name().as_str())
{
::core::panicking::panic("assertion failed: codegen_units.is_sorted_by(|a, b| a.name().as_str() <= b.name().as_str())")
};assert!(codegen_units.is_sorted_by(|a, b| a.name().as_str() <= b.name().as_str()));
325326// This map keeps track of what got merged into what.
327let mut cgu_contents: UnordMap<Symbol, Vec<Symbol>> =
328codegen_units.iter().map(|cgu| (cgu.name(), ::alloc::boxed::box_assume_init_into_vec_unsafe(::alloc::intrinsics::write_box_via_move(::alloc::boxed::Box::new_uninit(),
[cgu.name()]))vec![cgu.name()])).collect();
329330// If N is the maximum number of CGUs, and the CGUs are sorted from largest
331 // to smallest, we repeatedly find which CGU in codegen_units[N..] has the
332 // greatest overlap of inlined items with codegen_units[N-1], merge that
333 // CGU into codegen_units[N-1], then re-sort by size and repeat.
334 //
335 // We use inlined item overlap to guide this merging because it minimizes
336 // duplication of inlined items, which makes LLVM be faster and generate
337 // better and smaller machine code.
338 //
339 // Why merge into codegen_units[N-1]? We want CGUs to have similar sizes,
340 // which means we don't want codegen_units[0..N] (the already big ones)
341 // getting any bigger, if we can avoid it. When we have more than N CGUs
342 // then at least one of the biggest N will have to grow. codegen_units[N-1]
343 // is the smallest of those, and so has the most room to grow.
344let max_codegen_units = cx.tcx.sess.codegen_units().as_usize();
345while codegen_units.len() > max_codegen_units {
346// Sort small CGUs to the back.
347codegen_units.sort_by_key(|cgu| cmp::Reverse(cgu.size_estimate()));
348349let cgu_dst = &codegen_units[max_codegen_units - 1];
350351// Find the CGU that overlaps the most with `cgu_dst`. In the case of a
352 // tie, favour the earlier (bigger) CGU.
353let mut max_overlap = 0;
354let mut max_overlap_i = max_codegen_units;
355for (i, cgu_src) in codegen_units.iter().enumerate().skip(max_codegen_units) {
356if cgu_src.size_estimate() <= max_overlap {
357// None of the remaining overlaps can exceed `max_overlap`, so
358 // stop looking.
359break;
360 }
361362let overlap = compute_inlined_overlap(cgu_dst, cgu_src);
363if overlap > max_overlap {
364 max_overlap = overlap;
365 max_overlap_i = i;
366 }
367 }
368369let mut cgu_src = codegen_units.swap_remove(max_overlap_i);
370let cgu_dst = &mut codegen_units[max_codegen_units - 1];
371372// Move the items from `cgu_src` to `cgu_dst`. Some of them may be
373 // duplicate inlined items, in which case the destination CGU is
374 // unaffected. Recalculate size estimates afterwards.
375cgu_dst.items_mut().append(cgu_src.items_mut());
376 cgu_dst.compute_size_estimate();
377378// Record that `cgu_dst` now contains all the stuff that was in
379 // `cgu_src` before.
380let mut consumed_cgu_names = cgu_contents.remove(&cgu_src.name()).unwrap();
381 cgu_contents.get_mut(&cgu_dst.name()).unwrap().append(&mut consumed_cgu_names);
382 }
383384// Having multiple CGUs can drastically speed up compilation. But for
385 // non-incremental builds, tiny CGUs slow down compilation *and* result in
386 // worse generated code. So we don't allow CGUs smaller than this (unless
387 // there is just one CGU, of course). Note that CGU sizes of 100,000+ are
388 // common in larger programs, so this isn't all that large.
389const NON_INCR_MIN_CGU_SIZE: usize = 1800;
390391// Repeatedly merge the two smallest codegen units as long as: it's a
392 // non-incremental build, and the user didn't specify a CGU count, and
393 // there are multiple CGUs, and some are below the minimum size.
394 //
395 // The "didn't specify a CGU count" condition is because when an explicit
396 // count is requested we observe it as closely as possible. For example,
397 // the `compiler_builtins` crate sets `codegen-units = 10000` and it's
398 // critical they aren't merged. Also, some tests use explicit small values
399 // and likewise won't work if small CGUs are merged.
400while cx.tcx.sess.opts.incremental.is_none()
401 && #[allow(non_exhaustive_omitted_patterns)] match cx.tcx.sess.codegen_units() {
CodegenUnits::Default(_) => true,
_ => false,
}matches!(cx.tcx.sess.codegen_units(), CodegenUnits::Default(_))402 && codegen_units.len() > 1
403&& codegen_units.iter().any(|cgu| cgu.size_estimate() < NON_INCR_MIN_CGU_SIZE)
404 {
405// Sort small cgus to the back.
406codegen_units.sort_by_key(|cgu| cmp::Reverse(cgu.size_estimate()));
407408let mut smallest = codegen_units.pop().unwrap();
409let second_smallest = codegen_units.last_mut().unwrap();
410411// Move the items from `smallest` to `second_smallest`. Some of them
412 // may be duplicate inlined items, in which case the destination CGU is
413 // unaffected. Recalculate size estimates afterwards.
414second_smallest.items_mut().append(smallest.items_mut());
415 second_smallest.compute_size_estimate();
416417// Don't update `cgu_contents`, that's only for incremental builds.
418}
419420let cgu_name_builder = &mut CodegenUnitNameBuilder::new(cx.tcx);
421422// Rename the newly merged CGUs.
423if cx.tcx.sess.opts.incremental.is_some() {
424// If we are doing incremental compilation, we want CGU names to
425 // reflect the path of the source level module they correspond to.
426 // For CGUs that contain the code of multiple modules because of the
427 // merging done above, we use a concatenation of the names of all
428 // contained CGUs.
429let new_cgu_names = UnordMap::from(
430cgu_contents431 .items()
432// This `filter` makes sure we only update the name of CGUs that
433 // were actually modified by merging.
434.filter(|(_, cgu_contents)| cgu_contents.len() > 1)
435 .map(|(current_cgu_name, cgu_contents)| {
436let mut cgu_contents: Vec<&str> =
437cgu_contents.iter().map(|s| s.as_str()).collect();
438439// Sort the names, so things are deterministic and easy to
440 // predict. We are sorting primitive `&str`s here so we can
441 // use unstable sort.
442cgu_contents.sort_unstable();
443444 (*current_cgu_name, cgu_contents.join("--"))
445 }),
446 );
447448for cgu in codegen_units.iter_mut() {
449if let Some(new_cgu_name) = new_cgu_names.get(&cgu.name()) {
450let new_cgu_name = if cx.tcx.sess.opts.unstable_opts.human_readable_cgu_names {
451 Symbol::intern(&CodegenUnit::shorten_name(new_cgu_name))
452 } else {
453// If we don't require CGU names to be human-readable,
454 // we use a fixed length hash of the composite CGU name
455 // instead.
456Symbol::intern(&CodegenUnit::mangle_name(new_cgu_name))
457 };
458 cgu.set_name(new_cgu_name);
459 }
460 }
461462// A sorted order here ensures what follows can be deterministic.
463codegen_units.sort_by(|a, b| a.name().as_str().cmp(b.name().as_str()));
464 } else {
465// When compiling non-incrementally, we rename the CGUS so they have
466 // identical names except for the numeric suffix, something like
467 // `regex.f10ba03eb5ec7975-cgu.N`, where `N` varies.
468 //
469 // It is useful for debugging and profiling purposes if the resulting
470 // CGUs are sorted by name *and* reverse sorted by size. (CGU 0 is the
471 // biggest, CGU 1 is the second biggest, etc.)
472 //
473 // So first we reverse sort by size. Then we generate the names with
474 // zero-padded suffixes, which means they are automatically sorted by
475 // names. The numeric suffix width depends on the number of CGUs, which
476 // is always greater than zero:
477 // - [1,9] CGUs: `0`, `1`, `2`, ...
478 // - [10,99] CGUs: `00`, `01`, `02`, ...
479 // - [100,999] CGUs: `000`, `001`, `002`, ...
480 // - etc.
481 //
482 // If we didn't zero-pad the sorted-by-name order would be `XYZ-cgu.0`,
483 // `XYZ-cgu.1`, `XYZ-cgu.10`, `XYZ-cgu.11`, ..., `XYZ-cgu.2`, etc.
484codegen_units.sort_by_key(|cgu| cmp::Reverse(cgu.size_estimate()));
485let num_digits = codegen_units.len().ilog10() as usize + 1;
486for (index, cgu) in codegen_units.iter_mut().enumerate() {
487// Note: `WorkItem::short_description` depends on this name ending
488 // with `-cgu.` followed by a numeric suffix. Please keep it in
489 // sync with this code.
490let suffix = ::alloc::__export::must_use({
::alloc::fmt::format(format_args!("{0:01$}", index, num_digits))
})format!("{index:0num_digits$}");
491let numbered_codegen_unit_name =
492 cgu_name_builder.build_cgu_name_no_mangle(LOCAL_CRATE, &["cgu"], Some(suffix));
493 cgu.set_name(numbered_codegen_unit_name);
494 }
495 }
496}
497498/// Compute the combined size of all inlined items that appear in both `cgu1`
499/// and `cgu2`.
500fn compute_inlined_overlap<'tcx>(cgu1: &CodegenUnit<'tcx>, cgu2: &CodegenUnit<'tcx>) -> usize {
501// Either order works. We pick the one that involves iterating over fewer
502 // items.
503let (src_cgu, dst_cgu) =
504if cgu1.items().len() <= cgu2.items().len() { (cgu1, cgu2) } else { (cgu2, cgu1) };
505506let mut overlap = 0;
507for (item, data) in src_cgu.items().iter() {
508if data.inlined && dst_cgu.items().contains_key(item) {
509 overlap += data.size_estimate;
510 }
511 }
512overlap513}
514515fn internalize_symbols<'tcx>(
516 cx: &PartitioningCx<'_, 'tcx>,
517 codegen_units: &mut [CodegenUnit<'tcx>],
518 internalization_candidates: UnordSet<MonoItem<'tcx>>,
519) {
520/// For symbol internalization, we need to know whether a symbol/mono-item
521 /// is used from outside the codegen unit it is defined in. This type is
522 /// used to keep track of that.
523#[derive(#[automatically_derived]
impl ::core::clone::Clone for MonoItemPlacement {
#[inline]
fn clone(&self) -> MonoItemPlacement {
match self {
MonoItemPlacement::SingleCgu(__self_0) =>
MonoItemPlacement::SingleCgu(::core::clone::Clone::clone(__self_0)),
MonoItemPlacement::MultipleCgus =>
MonoItemPlacement::MultipleCgus,
}
}
}Clone, #[automatically_derived]
impl ::core::cmp::PartialEq for MonoItemPlacement {
#[inline]
fn eq(&self, other: &MonoItemPlacement) -> bool {
let __self_discr = ::core::intrinsics::discriminant_value(self);
let __arg1_discr = ::core::intrinsics::discriminant_value(other);
__self_discr == __arg1_discr &&
match (self, other) {
(MonoItemPlacement::SingleCgu(__self_0),
MonoItemPlacement::SingleCgu(__arg1_0)) =>
__self_0 == __arg1_0,
_ => true,
}
}
}PartialEq, #[automatically_derived]
impl ::core::cmp::Eq for MonoItemPlacement {
#[inline]
#[doc(hidden)]
#[coverage(off)]
fn assert_fields_are_eq(&self) {
let _: ::core::cmp::AssertParamIsEq<Symbol>;
}
}Eq, #[automatically_derived]
impl ::core::fmt::Debug for MonoItemPlacement {
#[inline]
fn fmt(&self, f: &mut ::core::fmt::Formatter) -> ::core::fmt::Result {
match self {
MonoItemPlacement::SingleCgu(__self_0) =>
::core::fmt::Formatter::debug_tuple_field1_finish(f,
"SingleCgu", &__self_0),
MonoItemPlacement::MultipleCgus =>
::core::fmt::Formatter::write_str(f, "MultipleCgus"),
}
}
}Debug)]
524enum MonoItemPlacement {
525 SingleCgu(Symbol),
526 MultipleCgus,
527 }
528529let mut mono_item_placements = UnordMap::default();
530let single_codegen_unit = codegen_units.len() == 1;
531532if !single_codegen_unit {
533for cgu in codegen_units.iter() {
534for item in cgu.items().keys() {
535// If there is more than one codegen unit, we need to keep track
536 // in which codegen units each monomorphization is placed.
537match mono_item_placements.entry(*item) {
538 Entry::Occupied(e) => {
539let placement = e.into_mut();
540if true {
if !match *placement {
MonoItemPlacement::SingleCgu(cgu_name) =>
cgu_name != cgu.name(),
MonoItemPlacement::MultipleCgus => true,
} {
::core::panicking::panic("assertion failed: match *placement {\n MonoItemPlacement::SingleCgu(cgu_name) => cgu_name != cgu.name(),\n MonoItemPlacement::MultipleCgus => true,\n}")
};
};debug_assert!(match *placement {
541 MonoItemPlacement::SingleCgu(cgu_name) => cgu_name != cgu.name(),
542 MonoItemPlacement::MultipleCgus => true,
543 });
544*placement = MonoItemPlacement::MultipleCgus;
545 }
546 Entry::Vacant(e) => {
547 e.insert(MonoItemPlacement::SingleCgu(cgu.name()));
548 }
549 }
550 }
551 }
552 }
553554// For each internalization candidates in each codegen unit, check if it is
555 // used from outside its defining codegen unit.
556for cgu in codegen_units {
557let home_cgu = MonoItemPlacement::SingleCgu(cgu.name());
558559for (item, data) in cgu.items_mut() {
560if !internalization_candidates.contains(item) {
561// This item is no candidate for internalizing, so skip it.
562continue;
563 }
564565if !single_codegen_unit {
566if true {
match (&mono_item_placements[item], &home_cgu) {
(left_val, right_val) => {
if !(*left_val == *right_val) {
let kind = ::core::panicking::AssertKind::Eq;
::core::panicking::assert_failed(kind, &*left_val,
&*right_val, ::core::option::Option::None);
}
}
};
};debug_assert_eq!(mono_item_placements[item], home_cgu);
567568if cx
569 .usage_map
570 .get_user_items(*item)
571 .iter()
572 .filter_map(|user_item| {
573// Some user mono items might not have been
574 // instantiated. We can safely ignore those.
575mono_item_placements.get(user_item)
576 })
577 .any(|placement| *placement != home_cgu)
578 {
579// Found a user from another CGU, so skip to the next item
580 // without marking this one as internal.
581continue;
582 }
583 }
584585// When LTO inlines the caller of a naked function, it will attempt but fail to make the
586 // naked function symbol visible. To ensure that LTO works correctly, do not default
587 // naked functions to internal linkage and default visibility.
588if let MonoItem::Fn(instance) = item {
589let flags = cx.tcx.codegen_instance_attrs(instance.def).flags;
590if flags.contains(CodegenFnAttrFlags::NAKED) {
591continue;
592 }
593 }
594595// If we got here, we did not find any uses from other CGUs, so
596 // it's fine to make this monomorphization internal.
597data.linkage = Linkage::Internal;
598 data.visibility = Visibility::Default;
599 }
600 }
601}
602603fn mark_code_coverage_dead_code_cgu<'tcx>(codegen_units: &mut [CodegenUnit<'tcx>]) {
604if !!codegen_units.is_empty() {
::core::panicking::panic("assertion failed: !codegen_units.is_empty()")
};assert!(!codegen_units.is_empty());
605606// Find the smallest CGU that has exported symbols and put the dead
607 // function stubs in that CGU. We look for exported symbols to increase
608 // the likelihood the linker won't throw away the dead functions.
609 // FIXME(#92165): In order to truly resolve this, we need to make sure
610 // the object file (CGU) containing the dead function stubs is included
611 // in the final binary. This will probably require forcing these
612 // function symbols to be included via `-u` or `/include` linker args.
613let dead_code_cgu = codegen_units614 .iter_mut()
615 .filter(|cgu| cgu.items().iter().any(|(_, data)| data.linkage == Linkage::External))
616 .min_by_key(|cgu| cgu.size_estimate());
617618// If there are no CGUs that have externally linked items, then we just
619 // pick the first CGU as a fallback.
620let dead_code_cgu = if let Some(cgu) = dead_code_cgu { cgu } else { &mut codegen_units[0] };
621622dead_code_cgu.make_code_coverage_dead_code_cgu();
623}
624625fn characteristic_def_id_of_mono_item<'tcx>(
626 tcx: TyCtxt<'tcx>,
627 mono_item: MonoItem<'tcx>,
628) -> Option<DefId> {
629match mono_item {
630 MonoItem::Fn(instance) => {
631let def_id = match instance.def {
632 ty::InstanceKind::Item(def) => def,
633 ty::InstanceKind::VTableShim(..)
634 | ty::InstanceKind::ReifyShim(..)
635 | ty::InstanceKind::FnPtrShim(..)
636 | ty::InstanceKind::ClosureOnceShim { .. }
637 | ty::InstanceKind::ConstructCoroutineInClosureShim { .. }
638 | ty::InstanceKind::Intrinsic(..)
639 | ty::InstanceKind::DropGlue(..)
640 | ty::InstanceKind::Virtual(..)
641 | ty::InstanceKind::CloneShim(..)
642 | ty::InstanceKind::ThreadLocalShim(..)
643 | ty::InstanceKind::FnPtrAddrShim(..)
644 | ty::InstanceKind::FutureDropPollShim(..)
645 | ty::InstanceKind::AsyncDropGlue(..)
646 | ty::InstanceKind::AsyncDropGlueCtorShim(..) => return None,
647 };
648649// If this is a method, we want to put it into the same module as
650 // its self-type. If the self-type does not provide a characteristic
651 // DefId, we use the location of the impl after all.
652653let assoc_parent = tcx.assoc_parent(def_id);
654655if let Some((_, DefKind::Trait)) = assoc_parent {
656let self_ty = instance.args.type_at(0);
657// This is a default implementation of a trait method.
658return characteristic_def_id_of_type(self_ty).or(Some(def_id));
659 }
660661if let Some((impl_def_id, DefKind::Impl { of_trait })) = assoc_parent {
662if of_trait663 && tcx.sess.opts.incremental.is_some()
664 && tcx.is_lang_item(tcx.impl_trait_id(impl_def_id), LangItem::Drop)
665 {
666// Put `Drop::drop` into the same cgu as `drop_in_place`
667 // since `drop_in_place` is the only thing that can
668 // call it.
669return None;
670 }
671672// This is a method within an impl, find out what the self-type is:
673let impl_self_ty = tcx.instantiate_and_normalize_erasing_regions(
674instance.args,
675 ty::TypingEnv::fully_monomorphized(),
676tcx.type_of(impl_def_id),
677 );
678if let Some(def_id) = characteristic_def_id_of_type(impl_self_ty) {
679return Some(def_id);
680 }
681 }
682683Some(def_id)
684 }
685 MonoItem::Static(def_id) => Some(def_id),
686 MonoItem::GlobalAsm(item_id) => Some(item_id.owner_id.to_def_id()),
687 }
688}
689690fn compute_codegen_unit_name(
691 tcx: TyCtxt<'_>,
692 name_builder: &mut CodegenUnitNameBuilder<'_>,
693 def_id: DefId,
694 volatile: bool,
695 cache: &mut CguNameCache,
696) -> Symbol {
697// Find the innermost module that is not nested within a function.
698let mut current_def_id = def_id;
699let mut cgu_def_id = None;
700// Walk backwards from the item we want to find the module for.
701loop {
702if current_def_id.is_crate_root() {
703if cgu_def_id.is_none() {
704// If we have not found a module yet, take the crate root.
705cgu_def_id = Some(def_id.krate.as_def_id());
706 }
707break;
708 } else if tcx.def_kind(current_def_id) == DefKind::Mod {
709if cgu_def_id.is_none() {
710cgu_def_id = Some(current_def_id);
711 }
712 } else {
713// If we encounter something that is not a module, throw away
714 // any module that we've found so far because we now know that
715 // it is nested within something else.
716cgu_def_id = None;
717 }
718719current_def_id = tcx.parent(current_def_id);
720 }
721722let cgu_def_id = cgu_def_id.unwrap();
723724*cache.entry((cgu_def_id, volatile)).or_insert_with(|| {
725let def_path = tcx.def_path(cgu_def_id);
726727let components = def_path.data.iter().map(|part| match part.data.name() {
728 DefPathDataName::Named(name) => name,
729 DefPathDataName::Anon { .. } => ::core::panicking::panic("internal error: entered unreachable code")unreachable!(),
730 });
731732let volatile_suffix = volatile.then_some("volatile");
733734name_builder.build_cgu_name(def_path.krate, components, volatile_suffix)
735 })
736}
737738// Anything we can't find a proper codegen unit for goes into this.
739fn fallback_cgu_name(name_builder: &mut CodegenUnitNameBuilder<'_>) -> Symbol {
740name_builder.build_cgu_name(LOCAL_CRATE, &["fallback"], Some("cgu"))
741}
742743fn mono_item_linkage_and_visibility<'tcx>(
744 tcx: TyCtxt<'tcx>,
745 mono_item: &MonoItem<'tcx>,
746 can_be_internalized: &mut bool,
747 can_export_generics: bool,
748 always_export_generics: bool,
749) -> (Linkage, Visibility) {
750if let Some(explicit_linkage) = mono_item.explicit_linkage(tcx) {
751return (explicit_linkage, Visibility::Default);
752 }
753let vis = mono_item_visibility(
754tcx,
755mono_item,
756can_be_internalized,
757can_export_generics,
758always_export_generics,
759 );
760 (Linkage::External, vis)
761}
762763type CguNameCache = UnordMap<(DefId, bool), Symbol>;
764765fn static_visibility<'tcx>(
766 tcx: TyCtxt<'tcx>,
767 can_be_internalized: &mut bool,
768 def_id: DefId,
769) -> Visibility {
770if tcx.is_reachable_non_generic(def_id) {
771*can_be_internalized = false;
772default_visibility(tcx, def_id, false)
773 } else {
774 Visibility::Hidden775 }
776}
777778fn mono_item_visibility<'tcx>(
779 tcx: TyCtxt<'tcx>,
780 mono_item: &MonoItem<'tcx>,
781 can_be_internalized: &mut bool,
782 can_export_generics: bool,
783 always_export_generics: bool,
784) -> Visibility {
785let instance = match mono_item {
786// This is pretty complicated; see below.
787MonoItem::Fn(instance) => instance,
788789// Misc handling for generics and such, but otherwise:
790MonoItem::Static(def_id) => return static_visibility(tcx, can_be_internalized, *def_id),
791 MonoItem::GlobalAsm(item_id) => {
792return static_visibility(tcx, can_be_internalized, item_id.owner_id.to_def_id());
793 }
794 };
795796let def_id = match instance.def {
797 InstanceKind::Item(def_id)
798 | InstanceKind::DropGlue(def_id, Some(_))
799 | InstanceKind::FutureDropPollShim(def_id, _, _)
800 | InstanceKind::AsyncDropGlue(def_id, _)
801 | InstanceKind::AsyncDropGlueCtorShim(def_id, _) => def_id,
802803// We match the visibility of statics here
804InstanceKind::ThreadLocalShim(def_id) => {
805return static_visibility(tcx, can_be_internalized, def_id);
806 }
807808// These are all compiler glue and such, never exported, always hidden.
809InstanceKind::VTableShim(..)
810 | InstanceKind::ReifyShim(..)
811 | InstanceKind::FnPtrShim(..)
812 | InstanceKind::Virtual(..)
813 | InstanceKind::Intrinsic(..)
814 | InstanceKind::ClosureOnceShim { .. }
815 | InstanceKind::ConstructCoroutineInClosureShim { .. }
816 | InstanceKind::DropGlue(..)
817 | InstanceKind::CloneShim(..)
818 | InstanceKind::FnPtrAddrShim(..) => return Visibility::Hidden,
819 };
820821// Both the `start_fn` lang item and `main` itself should not be exported,
822 // so we give them with `Hidden` visibility but these symbols are
823 // only referenced from the actual `main` symbol which we unfortunately
824 // don't know anything about during partitioning/collection. As a result we
825 // forcibly keep this symbol out of the `internalization_candidates` set.
826 //
827 // FIXME: eventually we don't want to always force this symbol to have
828 // hidden visibility, it should indeed be a candidate for
829 // internalization, but we have to understand that it's referenced
830 // from the `main` symbol we'll generate later.
831 //
832 // This may be fixable with a new `InstanceKind` perhaps? Unsure!
833if tcx.is_entrypoint(def_id) {
834*can_be_internalized = false;
835return Visibility::Hidden;
836 }
837838let is_generic = instance.args.non_erasable_generics().next().is_some();
839840// Upstream `DefId` instances get different handling than local ones.
841let Some(def_id) = def_id.as_local() else {
842return if is_generic843 && (always_export_generics844 || (can_export_generics845 && tcx.codegen_fn_attrs(def_id).inline == InlineAttr::Never))
846 {
847// If it is an upstream monomorphization and we export generics, we must make
848 // it available to downstream crates.
849*can_be_internalized = false;
850default_visibility(tcx, def_id, true)
851 } else {
852 Visibility::Hidden853 };
854 };
855856if is_generic {
857if always_export_generics858 || (can_export_generics && tcx.codegen_fn_attrs(def_id).inline == InlineAttr::Never)
859 {
860if tcx.is_unreachable_local_definition(def_id) {
861// This instance cannot be used from another crate.
862Visibility::Hidden863 } else {
864// This instance might be useful in a downstream crate.
865*can_be_internalized = false;
866default_visibility(tcx, def_id.to_def_id(), true)
867 }
868 } else {
869// We are not exporting generics or the definition is not reachable
870 // for downstream crates, we can internalize its instantiations.
871Visibility::Hidden872 }
873 } else {
874// If this isn't a generic function then we mark this a `Default` if
875 // this is a reachable item, meaning that it's a symbol other crates may
876 // use when they link to us.
877if tcx.is_reachable_non_generic(def_id.to_def_id()) {
878*can_be_internalized = false;
879if true {
if !!is_generic {
::core::panicking::panic("assertion failed: !is_generic")
};
};debug_assert!(!is_generic);
880return default_visibility(tcx, def_id.to_def_id(), false);
881 }
882883// If this isn't reachable then we're gonna tag this with `Hidden`
884 // visibility. In some situations though we'll want to prevent this
885 // symbol from being internalized.
886 //
887 // There's three categories of items here:
888 //
889 // * First is weak lang items. These are basically mechanisms for
890 // libcore to forward-reference symbols defined later in crates like
891 // the standard library or `#[panic_handler]` definitions. The
892 // definition of these weak lang items needs to be referenceable by
893 // libcore, so we're no longer a candidate for internalization.
894 // Removal of these functions can't be done by LLVM but rather must be
895 // done by the linker as it's a non-local decision.
896 //
897 // * Second is "std internal symbols". Currently this is primarily used
898 // for allocator symbols. Allocators are a little weird in their
899 // implementation, but the idea is that the compiler, at the last
900 // minute, defines an allocator with an injected object file. The
901 // `alloc` crate references these symbols (`__rust_alloc`) and the
902 // definition doesn't get hooked up until a linked crate artifact is
903 // generated.
904 //
905 // The symbols synthesized by the compiler (`__rust_alloc`) are thin
906 // veneers around the actual implementation, some other symbol which
907 // implements the same ABI. These symbols (things like `__rg_alloc`,
908 // `__rdl_alloc`, `__rde_alloc`, etc), are all tagged with "std
909 // internal symbols".
910 //
911 // The std-internal symbols here **should not show up in a dll as an
912 // exported interface**, so they return `false` from
913 // `is_reachable_non_generic` above and we'll give them `Hidden`
914 // visibility below. Like the weak lang items, though, we can't let
915 // LLVM internalize them as this decision is left up to the linker to
916 // omit them, so prevent them from being internalized.
917 //
918 // * Externally implementable items. They work (in this case) pretty much the same as
919 // RUSTC_STD_INTERNAL_SYMBOL in that their implementation is also chosen later in
920 // the compilation process and we can't let them be internalized and they can't
921 // show up as an external interface.
922let attrs = tcx.codegen_fn_attrs(def_id);
923if attrs.flags.intersects(
924CodegenFnAttrFlags::RUSTC_STD_INTERNAL_SYMBOL925 | CodegenFnAttrFlags::EXTERNALLY_IMPLEMENTABLE_ITEM,
926 ) {
927*can_be_internalized = false;
928 }
929930 Visibility::Hidden931 }
932}
933934fn default_visibility(tcx: TyCtxt<'_>, id: DefId, is_generic: bool) -> Visibility {
935// Fast-path to avoid expensive query call below
936if tcx.sess.default_visibility() == SymbolVisibility::Interposable {
937return Visibility::Default;
938 }
939940let export_level = if is_generic {
941// Generic functions never have export-level C.
942SymbolExportLevel::Rust943 } else {
944match tcx.reachable_non_generics(id.krate).get(&id) {
945Some(SymbolExportInfo { level: SymbolExportLevel::C, .. }) => SymbolExportLevel::C,
946_ => SymbolExportLevel::Rust,
947 }
948 };
949950match export_level {
951// C-export level items remain at `Default` to allow C code to
952 // access and interpose them.
953SymbolExportLevel::C => Visibility::Default,
954955// For all other symbols, `default_visibility` determines which visibility to use.
956SymbolExportLevel::Rust => tcx.sess.default_visibility().into(),
957 }
958}
959960fn debug_dump<'a, 'tcx: 'a>(tcx: TyCtxt<'tcx>, label: &str, cgus: &[CodegenUnit<'tcx>]) {
961let dump = move || {
962use std::fmt::Write;
963964let mut num_cgus = 0;
965let mut all_cgu_sizes = Vec::new();
966967// Note: every unique root item is placed exactly once, so the number
968 // of unique root items always equals the number of placed root items.
969 //
970 // Also, unreached inlined items won't be counted here. This is fine.
971972let mut inlined_items = UnordSet::default();
973974let mut root_items = 0;
975let mut unique_inlined_items = 0;
976let mut placed_inlined_items = 0;
977978let mut root_size = 0;
979let mut unique_inlined_size = 0;
980let mut placed_inlined_size = 0;
981982for cgu in cgus.iter() {
983 num_cgus += 1;
984 all_cgu_sizes.push(cgu.size_estimate());
985986for (item, data) in cgu.items() {
987if !data.inlined {
988 root_items += 1;
989 root_size += data.size_estimate;
990 } else {
991if inlined_items.insert(item) {
992 unique_inlined_items += 1;
993 unique_inlined_size += data.size_estimate;
994 }
995 placed_inlined_items += 1;
996 placed_inlined_size += data.size_estimate;
997 }
998 }
999 }
10001001all_cgu_sizes.sort_unstable_by_key(|&n| cmp::Reverse(n));
10021003let unique_items = root_items + unique_inlined_items;
1004let placed_items = root_items + placed_inlined_items;
1005let items_ratio = placed_itemsas f64 / unique_itemsas f64;
10061007let unique_size = root_size + unique_inlined_size;
1008let placed_size = root_size + placed_inlined_size;
1009let size_ratio = placed_sizeas f64 / unique_sizeas f64;
10101011let mean_cgu_size = placed_sizeas f64 / num_cgusas f64;
10121013match (&placed_size, &all_cgu_sizes.iter().sum::<usize>()) {
(left_val, right_val) => {
if !(*left_val == *right_val) {
let kind = ::core::panicking::AssertKind::Eq;
::core::panicking::assert_failed(kind, &*left_val, &*right_val,
::core::option::Option::None);
}
}
};assert_eq!(placed_size, all_cgu_sizes.iter().sum::<usize>());
10141015let s = &mut String::new();
1016let _ = s.write_fmt(format_args!("{0}\n", label))writeln!(s, "{label}");
1017let _ = s.write_fmt(format_args!("- unique items: {1} ({2} root + {3} inlined), unique size: {4} ({5} root + {6} inlined)\n- placed items: {7} ({2} root + {8} inlined), placed size: {9} ({5} root + {10} inlined)\n- placed/unique items ratio: {11:.2}, placed/unique size ratio: {12:.2}\n- CGUs: {13}, mean size: {14:.1}, sizes: {0}\n",
list(&all_cgu_sizes), unique_items, root_items, unique_inlined_items,
unique_size, root_size, unique_inlined_size, placed_items,
placed_inlined_items, placed_size, placed_inlined_size, items_ratio,
size_ratio, num_cgus, mean_cgu_size))writeln!(
1018s,
1019"- unique items: {unique_items} ({root_items} root + {unique_inlined_items} inlined), \
1020 unique size: {unique_size} ({root_size} root + {unique_inlined_size} inlined)\n\
1021 - placed items: {placed_items} ({root_items} root + {placed_inlined_items} inlined), \
1022 placed size: {placed_size} ({root_size} root + {placed_inlined_size} inlined)\n\
1023 - placed/unique items ratio: {items_ratio:.2}, \
1024 placed/unique size ratio: {size_ratio:.2}\n\
1025 - CGUs: {num_cgus}, mean size: {mean_cgu_size:.1}, sizes: {}",
1026 list(&all_cgu_sizes),
1027 );
1028let _ = s.write_fmt(format_args!("\n"))writeln!(s);
10291030for (i, cgu) in cgus.iter().enumerate() {
1031let name = cgu.name();
1032let size = cgu.size_estimate();
1033let num_items = cgu.items().len();
1034let mean_size = size as f64 / num_items as f64;
10351036let mut placed_item_sizes: Vec<_> =
1037 cgu.items().values().map(|data| data.size_estimate).collect();
1038 placed_item_sizes.sort_unstable_by_key(|&n| cmp::Reverse(n));
1039let sizes = list(&placed_item_sizes);
10401041let _ = s.write_fmt(format_args!("- CGU[{0}]\n", i))writeln!(s, "- CGU[{i}]");
1042let _ = s.write_fmt(format_args!(" - {0}, size: {1}\n", name, size))writeln!(s, " - {name}, size: {size}");
1043let _ =
1044s.write_fmt(format_args!(" - items: {0}, mean size: {1:.1}, sizes: {2}\n",
num_items, mean_size, sizes))writeln!(s, " - items: {num_items}, mean size: {mean_size:.1}, sizes: {sizes}",);
10451046for (item, data) in cgu.items_in_deterministic_order(tcx) {
1047let linkage = data.linkage;
1048let symbol_name = item.symbol_name(tcx).name;
1049let symbol_hash_start = symbol_name.rfind('h');
1050let symbol_hash = symbol_hash_start.map_or("<no hash>", |i| &symbol_name[i..]);
1051let kind = if !data.inlined { "root" } else { "inlined" };
1052let size = data.size_estimate;
1053let _ = {
let _guard = NoTrimmedGuard::new();
s.write_fmt(format_args!(" - {0} [{1:?}] [{2}] ({3}, size: {4})\n", item,
linkage, symbol_hash, kind, size))
}with_no_trimmed_paths!(writeln!(
1054 s,
1055" - {item} [{linkage:?}] [{symbol_hash}] ({kind}, size: {size})"
1056));
1057 }
10581059let _ = s.write_fmt(format_args!("\n"))writeln!(s);
1060 }
10611062return std::mem::take(s);
10631064// Converts a slice to a string, capturing repetitions to save space.
1065 // E.g. `[4, 4, 4, 3, 2, 1, 1, 1, 1, 1]` -> "[4 (x3), 3, 2, 1 (x5)]".
1066fn list(ns: &[usize]) -> String {
1067let mut v = Vec::new();
1068if ns.is_empty() {
1069return "[]".to_string();
1070 }
10711072let mut elem = |curr, curr_count| {
1073if curr_count == 1 {
1074v.push(::alloc::__export::must_use({
::alloc::fmt::format(format_args!("{0}", curr))
})format!("{curr}"));
1075 } else {
1076v.push(::alloc::__export::must_use({
::alloc::fmt::format(format_args!("{0} (x{1})", curr, curr_count))
})format!("{curr} (x{curr_count})"));
1077 }
1078 };
10791080let mut curr = ns[0];
1081let mut curr_count = 1;
10821083for &n in &ns[1..] {
1084if n != curr {
1085 elem(curr, curr_count);
1086 curr = n;
1087 curr_count = 1;
1088 } else {
1089 curr_count += 1;
1090 }
1091 }
1092elem(curr, curr_count);
10931094::alloc::__export::must_use({
::alloc::fmt::format(format_args!("[{0}]", v.join(", ")))
})format!("[{}]", v.join(", "))1095 }
1096 };
10971098{
use ::tracing::__macro_support::Callsite as _;
static __CALLSITE: ::tracing::callsite::DefaultCallsite =
{
static META: ::tracing::Metadata<'static> =
{
::tracing_core::metadata::Metadata::new("event compiler/rustc_monomorphize/src/partitioning.rs:1098",
"rustc_monomorphize::partitioning", ::tracing::Level::DEBUG,
::tracing_core::__macro_support::Option::Some("compiler/rustc_monomorphize/src/partitioning.rs"),
::tracing_core::__macro_support::Option::Some(1098u32),
::tracing_core::__macro_support::Option::Some("rustc_monomorphize::partitioning"),
::tracing_core::field::FieldSet::new(&["message"],
::tracing_core::callsite::Identifier(&__CALLSITE)),
::tracing::metadata::Kind::EVENT)
};
::tracing::callsite::DefaultCallsite::new(&META)
};
let enabled =
::tracing::Level::DEBUG <= ::tracing::level_filters::STATIC_MAX_LEVEL
&&
::tracing::Level::DEBUG <=
::tracing::level_filters::LevelFilter::current() &&
{
let interest = __CALLSITE.interest();
!interest.is_never() &&
::tracing::__macro_support::__is_enabled(__CALLSITE.metadata(),
interest)
};
if enabled {
(|value_set: ::tracing::field::ValueSet|
{
let meta = __CALLSITE.metadata();
::tracing::Event::dispatch(meta, &value_set);
;
})({
#[allow(unused_imports)]
use ::tracing::field::{debug, display, Value};
let mut iter = __CALLSITE.metadata().fields().iter();
__CALLSITE.metadata().fields().value_set(&[(&::tracing::__macro_support::Iterator::next(&mut iter).expect("FieldSet corrupted (this is a bug)"),
::tracing::__macro_support::Option::Some(&format_args!("{0}",
dump()) as &dyn Value))])
});
} else { ; }
};debug!("{}", dump());
1099}
11001101#[inline(never)] // give this a place in the profiler
1102fn assert_symbols_are_distinct<'a, 'tcx, I>(tcx: TyCtxt<'tcx>, mono_items: I)
1103where
1104I: Iterator<Item = &'a MonoItem<'tcx>>,
1105'tcx: 'a,
1106{
1107let _prof_timer = tcx.prof.generic_activity("assert_symbols_are_distinct");
11081109let mut symbols: Vec<_> =
1110mono_items.map(|mono_item| (mono_item, mono_item.symbol_name(tcx))).collect();
11111112symbols.sort_by_key(|sym| sym.1);
11131114for &[(mono_item1, ref sym1), (mono_item2, ref sym2)] in symbols.array_windows() {
1115if sym1 == sym2 {
1116let span1 = mono_item1.local_span(tcx);
1117let span2 = mono_item2.local_span(tcx);
11181119// Deterministically select one of the spans for error reporting
1120let span = match (span1, span2) {
1121 (Some(span1), Some(span2)) => {
1122Some(if span1.lo().0 > span2.lo().0 { span1 } else { span2 })
1123 }
1124 (span1, span2) => span1.or(span2),
1125 };
11261127 tcx.dcx().emit_fatal(SymbolAlreadyDefined { span, symbol: sym1.to_string() });
1128 }
1129 }
1130}
11311132fn collect_and_partition_mono_items(tcx: TyCtxt<'_>, (): ()) -> MonoItemPartitions<'_> {
1133let collection_strategy = if tcx.sess.link_dead_code() {
1134 MonoItemCollectionStrategy::Eager1135 } else {
1136 MonoItemCollectionStrategy::Lazy1137 };
11381139let (items, usage_map) = collector::collect_crate_mono_items(tcx, collection_strategy);
1140// Perform checks that need to operate on the entire mono item graph
1141target_specific_checks(tcx, &items, &usage_map);
11421143// If there was an error during collection (e.g. from one of the constants we evaluated),
1144 // then we stop here. This way codegen does not have to worry about failing constants.
1145 // (codegen relies on this and ICEs will happen if this is violated.)
1146tcx.dcx().abort_if_errors();
11471148let (codegen_units, _) = tcx.sess.time("partition_and_assert_distinct_symbols", || {
1149par_join(
1150 || {
1151let mut codegen_units = partition(tcx, items.iter().copied(), &usage_map);
1152codegen_units[0].make_primary();
1153&*tcx.arena.alloc_from_iter(codegen_units)
1154 },
1155 || assert_symbols_are_distinct(tcx, items.iter()),
1156 )
1157 });
11581159if tcx.prof.enabled() {
1160// Record CGU size estimates for self-profiling.
1161for cgu in codegen_units {
1162 tcx.prof.artifact_size(
1163"codegen_unit_size_estimate",
1164 cgu.name().as_str(),
1165 cgu.size_estimate() as u64,
1166 );
1167 }
1168 }
11691170let mono_items: DefIdSet = items1171 .iter()
1172 .filter_map(|mono_item| match *mono_item {
1173 MonoItem::Fn(ref instance) => Some(instance.def_id()),
1174 MonoItem::Static(def_id) => Some(def_id),
1175_ => None,
1176 })
1177 .collect();
11781179// Output monomorphization stats per def_id
1180if let SwitchWithOptPath::Enabled(ref path) = tcx.sess.opts.unstable_opts.dump_mono_stats
1181 && let Err(err) =
1182dump_mono_items_stats(tcx, codegen_units, path, tcx.crate_name(LOCAL_CRATE))
1183 {
1184tcx.dcx().emit_fatal(CouldntDumpMonoStats { error: err.to_string() });
1185 }
11861187if tcx.sess.opts.unstable_opts.print_mono_items {
1188let mut item_to_cgus: UnordMap<_, Vec<_>> = Default::default();
11891190for cgu in codegen_units {
1191for (&mono_item, &data) in cgu.items() {
1192 item_to_cgus.entry(mono_item).or_default().push((cgu.name(), data.linkage));
1193 }
1194 }
11951196let mut item_keys: Vec<_> = items1197 .iter()
1198 .map(|i| {
1199let mut output = { let _guard = NoTrimmedGuard::new(); i.to_string() }with_no_trimmed_paths!(i.to_string());
1200output.push_str(" @@");
1201let mut empty = Vec::new();
1202let cgus = item_to_cgus.get_mut(i).unwrap_or(&mut empty);
1203cgus.sort_by_key(|(name, _)| *name);
1204cgus.dedup();
1205for &(ref cgu_name, linkage) in cgus.iter() {
1206 output.push(' ');
1207 output.push_str(cgu_name.as_str());
12081209let linkage_abbrev = match linkage {
1210 Linkage::External => "External",
1211 Linkage::AvailableExternally => "Available",
1212 Linkage::LinkOnceAny => "OnceAny",
1213 Linkage::LinkOnceODR => "OnceODR",
1214 Linkage::WeakAny => "WeakAny",
1215 Linkage::WeakODR => "WeakODR",
1216 Linkage::Internal => "Internal",
1217 Linkage::ExternalWeak => "ExternalWeak",
1218 Linkage::Common => "Common",
1219 };
12201221 output.push('[');
1222 output.push_str(linkage_abbrev);
1223 output.push(']');
1224 }
1225output1226 })
1227 .collect();
12281229item_keys.sort();
12301231for item in item_keys {
1232{ ::std::io::_print(format_args!("MONO_ITEM {0}\n", item)); };println!("MONO_ITEM {item}");
1233 }
1234 }
12351236MonoItemPartitions { all_mono_items: tcx.arena.alloc(mono_items), codegen_units }
1237}
12381239/// Outputs stats about instantiation counts and estimated size, per `MonoItem`'s
1240/// def, to a file in the given output directory.
1241fn dump_mono_items_stats<'tcx>(
1242 tcx: TyCtxt<'tcx>,
1243 codegen_units: &[CodegenUnit<'tcx>],
1244 output_directory: &Option<PathBuf>,
1245 crate_name: Symbol,
1246) -> Result<(), Box<dyn std::error::Error>> {
1247let output_directory = if let Some(directory) = output_directory {
1248 fs::create_dir_all(directory)?;
1249directory1250 } else {
1251Path::new(".")
1252 };
12531254let format = tcx.sess.opts.unstable_opts.dump_mono_stats_format;
1255let ext = format.extension();
1256let filename = ::alloc::__export::must_use({
::alloc::fmt::format(format_args!("{0}.mono_items.{1}", crate_name,
ext))
})format!("{crate_name}.mono_items.{ext}");
1257let output_path = output_directory.join(&filename);
1258let mut file = File::create_buffered(&output_path)?;
12591260// Gather instantiated mono items grouped by def_id
1261let mut items_per_def_id: FxIndexMap<_, Vec<_>> = Default::default();
1262for cgu in codegen_units {
1263 cgu.items()
1264 .keys()
1265// Avoid variable-sized compiler-generated shims
1266.filter(|mono_item| mono_item.is_user_defined())
1267 .for_each(|mono_item| {
1268 items_per_def_id.entry(mono_item.def_id()).or_default().push(mono_item);
1269 });
1270 }
12711272#[derive(#[doc(hidden)]
#[allow(non_upper_case_globals, unused_attributes, unused_qualifications,
clippy :: absolute_paths,)]
const _: () =
{
#[allow(unused_extern_crates, clippy :: useless_attribute)]
extern crate serde as _serde;
;
#[automatically_derived]
impl _serde::Serialize for MonoItem {
fn serialize<__S>(&self, __serializer: __S)
-> _serde::__private228::Result<__S::Ok, __S::Error> where
__S: _serde::Serializer {
let mut __serde_state =
_serde::Serializer::serialize_struct(__serializer,
"MonoItem", false as usize + 1 + 1 + 1 + 1)?;
_serde::ser::SerializeStruct::serialize_field(&mut __serde_state,
"name", &self.name)?;
_serde::ser::SerializeStruct::serialize_field(&mut __serde_state,
"instantiation_count", &self.instantiation_count)?;
_serde::ser::SerializeStruct::serialize_field(&mut __serde_state,
"size_estimate", &self.size_estimate)?;
_serde::ser::SerializeStruct::serialize_field(&mut __serde_state,
"total_estimate", &self.total_estimate)?;
_serde::ser::SerializeStruct::end(__serde_state)
}
}
};serde::Serialize)]
1273struct MonoItem {
1274 name: String,
1275 instantiation_count: usize,
1276 size_estimate: usize,
1277 total_estimate: usize,
1278 }
12791280// Output stats sorted by total instantiated size, from heaviest to lightest
1281let mut stats: Vec<_> = items_per_def_id1282 .into_iter()
1283 .map(|(def_id, items)| {
1284let name = { let _guard = NoTrimmedGuard::new(); tcx.def_path_str(def_id) }with_no_trimmed_paths!(tcx.def_path_str(def_id));
1285let instantiation_count = items.len();
1286let size_estimate = items[0].size_estimate(tcx);
1287let total_estimate = instantiation_count * size_estimate;
1288MonoItem { name, instantiation_count, size_estimate, total_estimate }
1289 })
1290 .collect();
1291stats.sort_unstable_by_key(|item| cmp::Reverse(item.total_estimate));
12921293if !stats.is_empty() {
1294match format {
1295 DumpMonoStatsFormat::Json => serde_json::to_writer(file, &stats)?,
1296 DumpMonoStatsFormat::Markdown => {
1297file.write_fmt(format_args!("| Item | Instantiation count | Estimated Cost Per Instantiation | Total Estimated Cost |\n"))writeln!(
1298file,
1299"| Item | Instantiation count | Estimated Cost Per Instantiation | Total Estimated Cost |"
1300)?;
1301file.write_fmt(format_args!("| --- | ---: | ---: | ---: |\n"))writeln!(file, "| --- | ---: | ---: | ---: |")?;
13021303for MonoItem { name, instantiation_count, size_estimate, total_estimate } in stats {
1304file.write_fmt(format_args!("| `{0}` | {1} | {2} | {3} |\n", name,
instantiation_count, size_estimate, total_estimate))writeln!(
1305 file,
1306"| `{name}` | {instantiation_count} | {size_estimate} | {total_estimate} |"
1307)?;
1308 }
1309 }
1310 }
1311 }
13121313Ok(())
1314}
13151316pub(crate) fn provide(providers: &mut Providers) {
1317providers.queries.collect_and_partition_mono_items = collect_and_partition_mono_items;
13181319providers.queries.is_codegened_item =
1320 |tcx, def_id| tcx.collect_and_partition_mono_items(()).all_mono_items.contains(&def_id);
13211322providers.queries.codegen_unit = |tcx, name| {
1323tcx.collect_and_partition_mono_items(())
1324 .codegen_units
1325 .iter()
1326 .find(|cgu| cgu.name() == name)
1327 .unwrap_or_else(|| {
::core::panicking::panic_fmt(format_args!("failed to find cgu with name {0:?}",
name));
}panic!("failed to find cgu with name {name:?}"))
1328 };
13291330providers.queries.size_estimate = |tcx, instance| {
1331match instance.def {
1332// "Normal" functions size estimate: the number of
1333 // statements, plus one for the terminator.
1334InstanceKind::Item(..)
1335 | InstanceKind::DropGlue(..)
1336 | InstanceKind::AsyncDropGlueCtorShim(..) => {
1337let mir = tcx.instance_mir(instance.def);
1338mir.basic_blocks
1339 .iter()
1340 .map(|bb| {
1341bb.statements
1342 .iter()
1343 .filter_map(|stmt| match stmt.kind {
1344 StatementKind::StorageLive(_) | StatementKind::StorageDead(_) => {
1345None1346 }
1347_ => Some(stmt),
1348 })
1349 .count()
1350 + 1
1351})
1352 .sum()
1353 }
1354// Other compiler-generated shims size estimate: 1
1355_ => 1,
1356 }
1357 };
13581359 collector::provide(providers);
1360}