-------------------------------------------------------------------------------- Fujitsu PRIMEPOWER flags/tunables decription (Nov.05 2002) -------------------------------------------------------------------------------- Fujitsu Parallelnavi 1.0.2 compiler flag description Compiler options Remark -------------------------------------------------------------------------------- -Am Required if a source file contains modules which will be referenced by USE statements in other source files or if a source file contains USE statements that reference modules in another source file. -dy/-dn Specifies dynamic(-dy) or static(-dn) linkage of libraries. -dy is the default unless -Kfast_GP=n (n>=3) is specified and -Klargepage is not specified, in that case -dn is the default. -f omitmsg Set the level of diagnostic messages output and inhibit specific messages. omitmsg is one of i, w, or s, and/or a list of msgnum. i : All messages are output, this is the default. w : i level messages are not output. s : i and w level messages are not output. msgnum : Message number msgnum is inhibited. msgnum must be an i or w level message. -Fixed Specifies that Fortran source programs are written in fixed source form. -fs Do not print any warnings or diagnostic messages other than fatal errors. -Kbcopy Convert memory copy loop to memmove or memcpy function. -Kcfunc This uses high speed mathematical functions and library functions (malloc,calloc,realloc,free) prepared by this compilation system. -Kcommonpad[=N] Insert padding elements in common blocks for efficient use of cache. N can be specified from 4 to 4096. -Kcond This generates conditional-move instruction. This option is effective only if -Kcrossfile option is also specified. -Kcrossfile This option specifies crossfile optimization. If program consists of several files, the compiler refers these files at one time, and analyzes data dependency and control relation across these files. -Keval This option specifies optimization with changing of expression evaluation order. -Kfast_GP[={0|1|2|3|4|5}] This performs optimization for SPARC64 GP series. 0: This performs optimization suitable for SPARC64 GP. 1: This generates multiply and add instruction in addition to -Kfast_GP=0. (default) 2: This performs reordering of expression evaluation in addition to -Kfast_GP=1. 3:(C and C++ compiler) This generates crossfile optimization, and inter- procedural optimization in addition to -Kfast_GP=2. (Fortran compiler) This performs loop restructuring optimization in addition to -Kfast_GP=2. 4:(C and C++ compiler only) This generates advanced branch optimization in addition to -Kfast_GP=3. 5:(C and C++ compiler only) This generates global instruction scheduling optimization suit for scientific application in addition to -Kfast_GP=3. -Kfconst This option specifies to treat a floating-point type constant without an extension as float type. -Kfrecipro This option specifies to convert a floating point division into multiplication by the reciprocal. -Kfuncalign=n Adjust entry of functional alignment at n-byte boundary. -Kfuse Fuses neighboring loops. -KGREG The global registers g2 through g7 are subject to register allocation in the compile stage. -Kgs Performs global instruction scheduling. -Kilfunc This option replaces several and double precision mathematical functions,sin,cos,log10,log and exp with complier builtin functions. -Klargepage Specifies to generate executable program which utilizes Parallelnavi largepage facility. -Kmemalias In case of indirect memory access through pointer, when the accessing types are different, no memory aliase is assumed. -KNOFLTLD[=N] This option generates non-faulting load instruction. When non-faulting load instruction is used, signal does not occur even if given address is invalid. Memory access can be done before the memory address validation by using this option. N is maximum number of non-faulting load instruction for each loop or branch instruction. The compiler determine the default value of N when omitted N can be a number from 1 to 255. -Knogs This option invalidates global instruction scheduling. -Knoiopt This option prevents interprocedural optimization. -Knoprefetch Suppresses use the prefetch instruction. -Knounroll Prevents loop unrolling optimizations. -Konefile Generate only one temporary file (.s file or .o file) when multiple source files are compiled in one compilation. -Kpg Generates instructions to produce a profile file for subsequent optimization (global instruction scheduling etc.). -Kpopt This option specifies the optimization data pointed to by pointers using a limited interpretation that the areas referred by pointers are only referred by pointers. -Kpreex This option specifies the optimization by moving the evaluation of invariant expressions beyond branch. -Kpreload Moves load instructions across branches. -Kpu[=file] This performs optimization (global instruction scheduling, etc.) using program runtime profile information obtained by specifying -Kpg option. -Kprefetch[={1|2|3|4}] Generate prefetch instruction correspond to each prefetch level. 1: Basic level prefetch for array elements only inner-most loop. 2: In addition to the -Kprefetch=1, generates the prefetch instruction for array elements within the loop pre-header which access the first iteration in the loop. 3: In addition to the -Kprefetch=2, when the stride of access for array elements are larger than cache line size, compiler generates prefetch instruction for each cache line size access. 4: Maximum level for generating prefetch instruction. In addition to -Kprefetch=3, compiler generates the prefetch instruction for array elements which access in the outer loop. -Kprefetch_iteration=N Generate the prefetch instruction of the data which is referred after N iterations. -Krestp This option specifies arguments optimization based on ANSI (C Standard). -Kstaticclump This option specifies to gather all static and global variables except large array to access with same base address and index. -Kswitchopt This option specifies to change case organization by execution ratio of each case statements. -Kunroll[=N] Performs loop unrolling. N means upper limit of unrolling expansion number, whose value should be from 2 to 9999. -Kuse_rodata This option specifies whether string constant, floating point constant and initialization value of aggregate type local storage variable are allocated to read-only data section. -KV9FMADD Indicates that SPARC V9 instructions and multiply add/subtract instructions are generated. -Kxi=N Inline expansion, instead of function calls, is performed. Expanded function is selected by result of profiler. N is the percentage that means increased object size. -lssl2mtfma This option specifies libraries of SSL II, SSL II Thread- Parallel Capabilities and BLAS/LAPACK to be linked. -O[level] Specifies the optimization level. 0: No optimization. 1: Basic optimization. 2: Loop unrolling in addition to -O1. 3: Global instruction scheduling and restructuring of nested loop in addition to -O2. 4: Enhanced optimization of loop restructuring rather than -O3. -x- Inline expansion, instead of function calls, is performed for all functions defined in the C source code. -Xa The C language specifications used at compile time is based on the specifications of the C language standard (ANSI standard). This is the default mode -x dir=directory_name Performs inline expansion of procedures defined in the files under the directory specified and in the file currently being compiled. -x stm_no Applying optimization for inline expansion of user-defined external procedure having fewer than specified number of execution statements in the stm_no arguments. -------------------------------------------------------------------------------- Fujitsu PRIMEPOWER flags/tunables decription (Nov.05 2002) Fujitsu Parallelnavi 2.0.1 compiler flag description -------------------------------------------------------------------------------- -Am Required if a source file contains modules which will be referenced by USE statements in other source files or if a source file contains USE statements that reference modules in another source file. -dy/-dn Specifies dynamic(-dy) or static(-dn) linkage of libraries. -dy is the default unless -Kfast_GP=n (n>=3) is specified and -Klargepage is not specified, in that case -dn is the default. -f omitmsg Set the level of diagnostic messages output and inhibit specific messages. omitmsg is one of i, w, or s, and/or a list of msgnum. i : All messages are output, this is the default. w : i level messages are not output. s : i and w level messages are not output. msgnum : Message number msgnum is inhibited. msgnum must be an i or w level message. -Fixed Specifies that Fortran source programs are written in fixed source form. -fs Do not print any warnings or diagnostic messages other than fatal errors. -Kalignc[=N] Adjust entry of global data alignment at n-byte boundary. N can be specified from 1 to 32768. -Kalignl[=N] Adjust entry of local data alignment at n-byte boundary. N can be specified from 1 to 32768. -Kauto Local variables (without an initial value or the SAVE attribute) are allocated on the runtime stack. Their values are lost when the procedure ends. -Kcfunc This uses high speed mathematical functions and library functions (malloc,calloc,realloc,free) prepared by this compilation system. -Kcommonpad[=N] Insert padding elements in common blocks for efficient use of cache. N can be specified from 4 to 4096. -Karraypad_const[=N] Insert padding elements after each row of an array whose size is declared with constants for efficient use of cache. -Kcrossfile This option specifies crossfile optimization. If program consists of several files, the compiler refers these files at one time, and analyzes data dependency and control relation across these files. -Kfast_GP2[={0|1|2}] This performs optimization for SPARC64 GP2 series. 0: This performs optimization suitable for SPARC64 GP2. 1: This generates multiply and add instruction in addition to -Kfast_GP2=0. (default) 2: This performs reordering of expression evaluation in addition to -Kfast_GP2=1. -KFMADD This option specifies use of the combined multiply- add/subtract floating-point instructions. -Kfrecipro This option specifies to convert a floating point division into multiplication by the reciprocal. -Kfuse Fuses neighboring loops. -KGREG The global registers g2 through g7 are subject to register allocation in the compile stage. -KGREG_SYSTEM The global registers g5 through g7 are subject to register allocation in the compile stage. -Kgs Performs global instruction scheduling. -Kilfunc This option replaces several and double precision mathematical functions,sin,cos,log10,log and exp with complier builtin functions. -Klargepage Specifies to generate executable program which utilizes Parallelnavi largepage facility. -KNOFMADD This option suppresses use of the combined multiply- add/subtract floating-point instructions. -Knoprefetch Suppresses use the prefetch instruction. -Knounroll Prevents loop unrolling optimizations. -Knovfunc Suppresses to change the intrinsic function (including power operation) to a multi-operation function. -Kpreex This option specifies the optimization by moving the evaluation of invariant expressions beyond branch. -Kpopt This option specifies the optimization data pointed to by pointers using a limited interpretation that the areas referred by pointers are only referred by pointers. -Kpg Generates instructions to produce a profile file for subsequent optimization (global instruction scheduling etc.). -Kprefetch[={1|2|3|4}] Generate prefetch instruction correspond to each prefetch level. 1: Basic level prefetch for array elements only inner-most loop. 2: In addition to the -Kprefetch=1, generates the prefetch instruction for array elements within the loop pre-header which access the first iteration in the loop. 3: In addition to the -Kprefetch=2, when the stride of access for array elements are larger than cache line size, compiler generates prefetch instruction for each cache line size access. 4: Maximum level for generating prefetch instruction. In addition to -Kprefetch=3, compiler generates the prefetch instruction for array elements which access in the outer loop. -Kprefetch_cache_level=[1/2/3] LEVEL-1: Generate prefetch instructions in order for the data to reside in the primary cache. LEVEL-2: Generate prefetch instructions in order for the data to reside only in the secondary cache. LEVEL-3: Generate both level-1 and -2 prefetch instructions. -Kprefetch_infer This option is specified to be omitted of address calculation for prefetch while is prefetch by distance is guessed from existing information -Kprefetch_iteration=N Generate the prefetch instruction of the data which is referred after N iterations. -Kprefetch_line=N Generate the prefetch instruction to get the data located N times bytes in a line of the primary data cache ahead of the address a neighboring load or store instruction points. -Kprefetch_line_L2=N Same to -Kprefetch_line=N, except the data reside only in the secondary cache. -Kpreload Moves load instructions across branches. -Kpreschedule_length[=N] Specifies the schedule length of pre-pass scheduler. This is effective only when -O5 option is used. -Kpu[=file] This performs optimization (global instruction scheduling, etc.) using program runtime profile information obtained by specifying -Kpg option. -Kunroll[=N] Performs loop unrolling. N means upper limit of unrolling expansion number, whose value should be from 2 to 9999. -Kvect Vectorize the body of an innermost loop before the optimization, then unvectorize. -O[level] Specifies the optimization level. 0: No optimization. 1: Basic optimization. 2: Loop unrolling in addition to -O1. 3: Global instruction scheduling and restructuring of nested loop in addition to -O2. 4: Enhanced optimization of loop restructuring rather than -O3. 5: Creates an object program by applying further optimizations of register allocation in addition to -O4. -SSL2 The whole set of routines from SSL II, SSL II Thread- Parallel Capabilities and BLAS/LAPACK becomes part of link-edit libraries. -x- Inline expansion, instead of function calls, is performed for all functions defined in the C source code. -x stm_no Applying optimization for inline expansion of user-defined external procedure having fewer than specified number of execution statements in the stm_no arguments. -x dir=directory_name Performs inline expansion of procedures defined in the files under the directory specified and in the file currently being compiled. -------------------------------------------------------------------------------- Sun C, C++ and Fortran Sun ONE Studio 7 Last updated: 10/02/2002 Compiler options Description -------------------------------------------------------------------------------- -D Set definition for preprocessor. -dalign Assume data is naturally aligned. -dn Specify static binding. -e Accept extended (132 character) input source lines (FORTRAN). -fast This is a convenience option for selecting a set of optimizations for performance. In all three of Fortran, C, and C++, -fast implies: * -dalign, align memory objects on their natural boundaries (also spelt as -xmemalign=8s, meaning that doubles (8 byte) items are correctly aligned, and if an incorrect item is found, a signal should be raised). * -fsimple=2 permitting reordering of floating point operations. * -fns disable gradual underflow * -libmil use inline expansion for math library, libm (also spelt -xlibmil) * Optimization level 5 (spelt either as -O5 or -xO5) * -xtarget=native, the native best machine characteristics option (also sometimes spelt -native). This option implies all three of -xarch=native, -xchip=native, -xcache=native In addition, the -fast option sets the following options for the compilers noted: * -depend (Fortran and C): analyze loops for data dependendencies (also spelt -xdepend) * -D__MATHERR_ERRNO_DONTCARE (C): errno may not be set for math functions. * -fsingle (C): evaluate float expressions as single precision * -ftrap=%none (f77, C, C++) turn off IEEE 754 trapping * -ftrap=common (f90) limit IEEE 754 trapping to common cases * -xlibmopt (Fortran and C++): select the optimized math library * -pad=local (Fortran): pad local variables * -xalias_level=basic (C) assume ISO C9X aliasing rules for basic types only. * -xbuiltin=%all (C and C++): substitute intrinsic functions or inline system functions where profitable for performance. * -xprefetch=yes (Fortran and C): enable prefetching * -xprefetch_level=2 (Fortran): set the level of searching that the prefetcher will do (valid values are 1, 2, or 3, where higher values mean more searching) * -xvector=yes (Fortran): permits the compiler to transform math library calls within DO loops into single calls to the equivalent vector math routines when such transformations are possible. -fixed Accept fixed-format input source files (FORTRAN). -fns Disable gradual underflow, allowing tiny results to be flushed to zero rather than producing subnormal numbers (non-standard according to IEEE 754) -fsimple=0 Permits no simplifying assumptions. Preserves strict IEEE 754 conformance. -fsimple=1 With -fsimple=1, the optimizer can assume the following: * The IEEE 754 default rounding/trapping modes do not change after process initialization. * Computations producing no visible result other than potential floating-point exceptions may be deleted. * Computations with Infinity or NaNs as operands need not propagate NaNs to their results. For example, x*0 may be replaced by 0. * Computations do not depend on sign of zero. -fsimple=2 Permits more aggressive floating point optimizations that may cause programs to produce different numeric results due to changes in rounding. Even with -fsimple=2, the optimizer still is not permitted to introduce a floating point exception in a program that otherwise produces none. -ftrap=t Sets the IEEE 754 trapping mode in effect at startup. t is a comma-separated list that consists of one or more of the following: %all, %none, common, [no%]invalid, [no%]overflow, [no%]underflow, [no%]division, [no%]inexact. The default is -ftrap=%none for f77, C, and C++; for f90 the default is common. This option sets the IEEE 754 trapping modes that are established at program initialization. Processing is left-to-right. The common exceptions, by definition, are invalid, division by zero, and overflow. %none turns off all trapping modes. common traps on invalid, division by zero, and overflow. Do not use this option for programs that depend on IEEE standard exception handling; you can get different numerical results, premature program termination, or unexpected SIGFPE signals. -libmil Use inline expansion templates for libm. -library=iostream Use "classic" (pre 1998 C++ standard) iostream library Prior to the C++ standard (1998), there was one iostream library, what is now often called "classic" iostreams. The C++ standard defines a different, but similar, iostream library, which we call "standard" iostreams. To get classic iostreams in standard (default) mode, you use the option "-library=iostream". -ll2amm Library containing chip specific memory routines. -lm Link with math library -lmopt This chooses the math library that is optimized for speed -lprism32 Library to enable Intimate Shared Memory (ISM) (4MB page) usage. -lsunperf Link with the Sun Performance Library (netlib and Society for Industrial and Applied Mathematics (SIAM) routines) -native Select native machine characteristics for optimization. Equivalent to setting all three of -xarch=native, -xchip=native, -xcache=native -Qicache-chbab=1 See -Wc,-Qicache-chbab=1 -Qoption <phase> Pass flags along to compiler phase: <flags> f90comp Fortran first pass iropt Global optimizer cg Code generator -Qoption cg See -Wc,<flags> below. (The code generation phase is <flags> addressed via -Qoption cg in Fortran and C++; and via -Wc in C.) -Qoption f90comp Enable padding of f90 arrays by n. -array_pad_rows, <n> -Qoption f90comp Enable f90 array expansion. -expansion -qoption f90comp This reduces the optimization level of the f90 -O3 front/middle end to O3. The effect is to turn off loop cloning and unrolling (note that it has no effect on cg's loop unrolling). -Qoption iropt See also -W2,<flags> below. (The optimizer can be <flags> addressed either via Qoption iropt in Fortran and C++, or via -W2 in C.) -Qoption iropt enable optimizations based on data access patterns -Adata_access -Qoption iropt Set memory store operation weight for loop -Addint:sf=9 interchange to 9 -Qoption iropt See -W2,-Amemopt -Amemopt -Qoption iropt See -W2,-Ma<n> -Ma<n> -Qoption iropt See -W2,-Mm<n> -Mm<n> -Qoption iropt Do not inline calls when parameters are arrays and -MR actual array dimensions and formal array dimensions are mismatched -Qoption iropt See -W2,-Mr<n> -Mr<n> -Qoption iropt disable scalar replacement optimization -Rscalarrep -Qoption iropt Same as -Qoption iropt -Rscalarrep plus -Qoption -Rscalarrep,-MR iropt -MR -Qoption iropt See -W2,-whole -whole -stackvar Allocate routine local variables on stack (FORTRAN). -W<phase>,<flags> Pass flags along to compiler phase (2=optimizer, c=code generator) -W2,-Abopt Enable aggressive optimizations of all branches. -W2,-Adata_access Enable optimizations based on data access patterns. -W2,-Aheap Allows the compiler to recognize malloc-like memory allocation functions. -W2,-Ainline Perform Inter-Procedural Analysis (IPA) -based inlining. -W2,-Aivel: More aggresive strength reduction by replicating duplicate_loops loops. -W2,-Amemopt Memory access optimization. This does whole-program mode inter-procedural memory access analysis, merges memory allocations, and performs cache conscious data layout program transformations. -W2,-Amemopt: Reconstruct array subscripts during memory arrayloc allocation merging and data layout program transformation -W2,-Ashort_ldst Convert multiple short memory operations into single long memory operations. -W2,-Aunroll Enables outer-loop unrolling. -W2,-crit Enable optimization of critical control paths -W2,-Ma<n> Enable inlining of routines with frame size up to n. -W2,-Mm<n> Maximum module increase limit for inlining. -W2,-Mp<n> Procedures with entry counts equal or greater than n become candidates for inlining. -W2,-Mr<n> Maximum code increase due to inlining is limited to n triples. -W2,-Ms<n> Maximum level of recursive inlining. -W2,-Mt<n> The maximum size of a routine body eligible for inlining is limited to n triples. -W2, Assume that different global pointer variables point -O4+restrict_g to their own memory locations. -W2,-reroll=1 Turns on loop rerolling. -W2,-whole Do whole program optimizations. -Wc, Enable early cross-call instruction scheduling. -Qdepgraph-early_ cross_call=1 -Wc,-Qeps: Allow generating speculative load during EPS. do_spec_load=1 -Wc,-Qeps: Use enhanced pipeline scheduling(EPS) and selective enabled=1 scheduling algorithms for instruction scheduling. -Wc,-Qeps: Turn off register pressure heuristic in EPS. rp_filtering_ margin=100 -Wc,-Qgsched-T4 Sets the aggressiveness of the trace formation. -Wc, Turns on the late trace scheduler. -Qgsched-trace_ late=1 -Wc, Turns on the conversion of loads to non-faulting -Qgsched-trace_ loads inside the trace. spec_load=1 -Wc, Turn on optimization to reduce branch after branch -Qicache-chbab=1 penalty -Wc, Inline calls to memcpy with n bytes or fewer being -Qinline_memcpy=<n> copied -Wc,-Qipa: Use profile feedback data to predict values and valueprediction attempt to generate faster code along these control paths, even at the expense of possibly slower code along paths leading to different values. Correct code is generated for both paths. -Wc, Do function entry alignment at n-byte boundaries. -Qiselect- funcalign=<n> -Wc, Peels the most frequent test branches/cases off a -Qiselect-sw_pf_ switch until the branch probability reaches less tbl_th=<n> than 1/n. This is effective only when profile feedback is used -Wc,-Qlp=1-av=<nav> Control irregular loop prefetching. -t=<nt>-fa=1-fl=1 lp=1 turns on the module (default is on for F90; off for C/C++) av Sets the prefetch look ahead to <nav>. t Make <nt> attempts at prefetching. fa=1 forces user settings to override internally computed values. fl=1 forces the optimization to be turned on for all languages. -Wc,-Qms_pipe+ Use fp divide for signed integer division intdivusefp -Wc,-Qms_pipe-pref Turn off prefetching within modulo scheduling -Wc,-Qpeep-Sh0 Disables the max live base registers algorithm for sethi hoisting. -Xa Assume ANSI C conformance, allow K & R extensions. (default mode) -xalias_level=<a> Allows compiler to perform type-based alias analysis (C compiler) at the given alias level. basic assume ISO C9X aliasing rules for basic types only. std assume ISO C9X aliasing rules. strong assume all pointers are type safe (strongly typed). -xalias_level=<a> Allows compiler to perform type-based alias analysis (C++ compiler) at the given alias level. any assume any type may alias any other type. simple assume fundamental types are not aliased. compatible assume layout-incompatible types are not aliased. -xarch=<a> Limit the set of instructions the compiler may use to generic, generic64, native, native64, v7, v8a, v8, v8plus, v8plusa, v8plusb, v9, v9a, v9b. Typical settings include: UltraSPARC-II, 32-bit mode: v8plusa UltraSPARC-II, 64-bit mode: v9a UltraSPARC-III, 32-bit mode: v8plusb UltraSPARC-III, 64-bit mode: v9b For more information, see the Fortran User's Guide at docs.sun.com -Xc Assume strict ANSI C conformance. -xcache=<c> Defines the cache properties for use by the optimizer. c must be one of the following: * native (set parameters for the host environment) * s1/l1/a1 * s1/l1/a1:s2/l2/a2 * s1/l1/a1:s2/l2/a2:s3/l3/a3 The si/li/ai are defined as follows: si The size of the data cache at level i, in kilobytes. li The line size of the data cache at level i, in bytes. ai The associativity of the data cache at level i. -xchip=<c> Specifies the target processor for use by the optimizer. c must be one of: generic, generic64, native, native64, old, super, super2, micro, micro2, hyper, hyper2, powerup, ultra, ultra2, ultra2i, ultra3, 386, 486, pentium, pentium_pro, 603, 604. -xcrossfile Enable cross-file inlining. -xdepend Analyze loops for data dependencies. -xF Allow function reordering by the WorkShop Performance Analyzer -xinline= Turn off inlining -xipo=n Performs optimizations across all object files in the link step: 0=off, 1=on, 2=performs whole-program detection and analysis. The default is 0. -xlibmopt This chooses the math library that is optimized for speed. -xlic_lib=sunperf Link with Sun supplied licensed sunperf library. -xO1 Does basic local optimization (peephole). -xO2 xO1 and more local and global optimizations. -xO3 Besides what xO2 does, it optimizes references or definitions for external variables. Loop unrolling and software pipelining are also performed. -xO4 xO3 plus function inlining. -xO5 Besides what -xO4 does, it enables speculative code motion. -xpad=common[:<n>] Pad common block variables, for better use of cache. n specifies the amount of padding to apply. If no parameter is specified then the compiler selects one automatically. -xpad=local[:<n>] Pad local variables only, for better use of cache. n specifies the amount of padding to apply. If no parameter is specified then the compiler selects one automatically. -xparallel Use parallel processing to improve performance. -xprefetch Enable prefetch instructions on those architectures that support prefetch, such as UltraSPARC II (-xarch=v8plus, v8plusa, v9plusb, v9, v9a, or v9b) * auto Enable automatic generation of prefetch instructions * no%auto Disable automatic generation of prefetch instructions * explicit Enable explicit prefetch macros * no%explicit Disable explicit prefetch macros * yes the same as -xprefetch=auto,explicit * no the same as -xprefetch=no%auto,no%explicit Defaults: If -xprefetch is not specified, -xprefetch=no%auto,explicit is assumed. If only -xprefetch is specified, -xprefetch=auto,explicit is assumed. latx:factor Adjust the compiler's assumed prefetch-to-load and prefetch-to-store latencies by the specified factor. -xprefetch_level=n The compiler searches for prefetch opportunities when -xprefetch=auto is enabled. You can control the level of searching that the compiler does by setting n to 1, 2, or 3, where higher numbers mean do more searching. The default is 2. -xprofile=collect Collect profile data for feedback directed optimizations. -xprofile=use Use data collected for profile feedback. -xreduction Parallelize loops containing reductions. -xregs=syst Allows use of the system reserved registers %g6 and %g7, and %g5 if not already allowed by -xarch value. -xrestrict[=f1,..., Treat pointer-valued function parameters as f2,%all,%none] restricted pointers. The default is %none. Specifying -xrestrict is equivalent to specifying -xrestrict=%all. -xsafe=mem Enables the use of non-faulting loads when used in conjunction with -xarch=v8plus. Assumes that no memory based traps will occur. -xsfpconst Represents unsuffixed floating-point constants as single precision -Xt Assume K & R conformance, allow ANSI C. -xtarget=native Same as -native -xunroll=n Specifies whether or not the compiler optimizes (unrolls) loops. n is a positive integer. When n is 1, it is a command and the compiler unrolls no loops. When n is greater than 1, -xunroll=n merely suggests to the compiler that it unroll loops n times. -------------------------------------------------------------------------------- Environment Variables Flag Remark -------------------------------------------------------------------------------- LD_LIBRARY_PATH=<p> Specify the locations to resolve dynamic link dependencies PRISM_HEAP=<n> Set the heap size limit for large pages PRISM_MODE=2 Large page mode: Attempt to put text, data and heap all into large pages. -------------------------------------------------------------------------------- /etc/system (system configuration information file) description System Tunables Remark -------------------------------------------------------------------------------- consistent_coloring Controls the page coloring policy. It can be set to one of the following: 0: (default) dynamic (uses various vaddr bits) 1: static (virtual=paddr) tune_t_fsflushr The number of seconds between fsflush invocations for checking dirty memory. autoup The frequency of file system sync operations. shmsys:shminfo_shmmin Minimum size of system V shared memory segment that can be created. shmsys:shminfo_shmmax Maximum size of system V shared memory segment that can be created. shmsys:shminfo_shmmni System wide limit on number of shared memory segments that can be created. shmsys:shminfo_shmseg Limit on the number of shared memory segments that any one process can create. -------------------------------------------------------------------------------- -------------------------------------------------------------------------------- /etc/opt/FJSVpnrm/lpg.conf (Large page management information file) description Tunables Remark -------------------------------------------------------------------------------- TSS=size[unit] Size of total memory, to be used for large page segments. At start of the system, this amount of memory is reserved and initialized. "unit" can be M for mega-byte and G for giga-byte. SHMSEGSIZE=size[unit] Size of large page segment. "unit" can be M for mega-byte and G for giga-byte. JOB=size [unit] Specify the large page memory resource size for in-job processes. "unit" can be T (terra byte), G (giga byte), or M (mega byte) after size. LIMITPOLICY=job | proc Define the memory allocation/limitation type. job : Limits for each job in Node. proc: Limits for each process resource set. Default is job. --------------------------------------------------------------------------------