gas/doc/c-xtensa.texi

   1 @c Copyright (C) 2002-2021 Free Software Foundation, Inc.
   2 @c This is part of the GAS manual.
   3 @c For copying conditions, see the file as.texinfo.
   4 @c
   5 @c man end
   6 @ifset GENERIC
   7 @page
   8 @node Xtensa-Dependent
   9 @chapter Xtensa Dependent Features
  10 @end ifset
  11 @ifclear GENERIC
  12 @node Machine Dependencies
  13 @chapter Xtensa Dependent Features
  14 @end ifclear
  15
  16 @cindex Xtensa architecture
  17 This chapter covers features of the @sc{gnu} assembler that are specific
  18 to the Xtensa architecture.  For details about the Xtensa instruction
  19 set, please consult the @cite{Xtensa Instruction Set Architecture (ISA)
  20 Reference Manual}.
  21
  22 @menu
  23 * Xtensa Options::              Command-line Options.
  24 * Xtensa Syntax::               Assembler Syntax for Xtensa Processors.
  25 * Xtensa Optimizations::        Assembler Optimizations.
  26 * Xtensa Relaxation::           Other Automatic Transformations.
  27 * Xtensa Directives::           Directives for Xtensa Processors.
  28 @end menu
  29
  30 @node Xtensa Options
  31 @section Command-line Options
  32
  33 @c man begin OPTIONS
  34 @table @gcctabopt
  35
  36 @item --text-section-literals | --no-text-section-literals
  37 @kindex --text-section-literals
  38 @kindex --no-text-section-literals
  39 Control the treatment of literal pools.  The default is
  40 @samp{--no-@-text-@-section-@-literals}, which places literals in
  41 separate sections in the output file.  This allows the literal pool to be
  42 placed in a data RAM/ROM.  With @samp{--text-@-section-@-literals}, the
  43 literals are interspersed in the text section in order to keep them as
  44 close as possible to their references.  This may be necessary for large
  45 assembly files, where the literals would otherwise be out of range of the
  46 @code{L32R} instructions in the text section.  Literals are grouped into
  47 pools following @code{.literal_position} directives or preceding
  48 @code{ENTRY} instructions.  These options only affect literals referenced
  49 via PC-relative @code{L32R} instructions; literals for absolute mode
  50 @code{L32R} instructions are handled separately.
  51 @xref{Literal Directive, ,literal}.
  52
  53 @item --auto-litpools | --no-auto-litpools
  54 @kindex --auto-litpools
  55 @kindex --no-auto-litpools
  56 Control the treatment of literal pools.  The default is
  57 @samp{--no-@-auto-@-litpools}, which in the absence of
  58 @samp{--text-@-section-@-literals} places literals in separate sections
  59 in the output file.  This allows the literal pool to be placed in a data
  60 RAM/ROM.  With @samp{--auto-@-litpools}, the literals are interspersed
  61 in the text section in order to keep them as close as possible to their
  62 references, explicit @code{.literal_position} directives are not
  63 required.  This may be necessary for very large functions, where single
  64 literal pool at the beginning of the function may not be reachable by
  65 @code{L32R} instructions at the end.  These options only affect
  66 literals referenced via PC-relative @code{L32R} instructions; literals
  67 for absolute mode @code{L32R} instructions are handled separately.
  68 When used together with @samp{--text-@-section-@-literals},
  69 @samp{--auto-@-litpools} takes precedence.
  70 @xref{Literal Directive, ,literal}.
  71
  72 @item --absolute-literals | --no-absolute-literals
  73 @kindex --absolute-literals
  74 @kindex --no-absolute-literals
  75 Indicate to the assembler whether @code{L32R} instructions use absolute
  76 or PC-relative addressing.  If the processor includes the absolute
  77 addressing option, the default is to use absolute @code{L32R}
  78 relocations.  Otherwise, only the PC-relative @code{L32R} relocations
  79 can be used.
  80
  81 @item --target-align | --no-target-align
  82 @kindex --target-align
  83 @kindex --no-target-align
  84 Enable or disable automatic alignment to reduce branch penalties at some
  85 expense in code size.  @xref{Xtensa Automatic Alignment, ,Automatic
  86 Instruction Alignment}.  This optimization is enabled by default.  Note
  87 that the assembler will always align instructions like @code{LOOP} that
  88 have fixed alignment requirements.
  89
  90 @item --longcalls | --no-longcalls
  91 @kindex --longcalls
  92 @kindex --no-longcalls
  93 Enable or disable transformation of call instructions to allow calls
  94 across a greater range of addresses.  @xref{Xtensa Call Relaxation,
  95 ,Function Call Relaxation}.  This option should be used when call
  96 targets can potentially be out of range.  It may degrade both code size
  97 and performance, but the linker can generally optimize away the
  98 unnecessary overhead when a call ends up within range.  The default is
  99 @samp{--no-@-longcalls}.
 100
 101 @item --transform | --no-transform
 102 @kindex --transform
 103 @kindex --no-transform
 104 Enable or disable all assembler transformations of Xtensa instructions,
 105 including both relaxation and optimization.  The default is
 106 @samp{--transform}; @samp{--no-transform} should only be used in the
 107 rare cases when the instructions must be exactly as specified in the
 108 assembly source.  Using @samp{--no-transform} causes out of range
 109 instruction operands to be errors.
 110
 111 @item --rename-section @var{oldname}=@var{newname}
 112 @kindex --rename-section
 113 Rename the @var{oldname} section to @var{newname}.  This option can be used
 114 multiple times to rename multiple sections.
 115
 116 @item --trampolines | --no-trampolines
 117 @kindex --trampolines
 118 @kindex --no-trampolines
 119 Enable or disable transformation of jump instructions to allow jumps
 120 across a greater range of addresses.  @xref{Xtensa Jump Relaxation,
 121 ,Jump Trampolines}.  This option should be used when jump targets can
 122 potentially be out of range.  In the absence of such jumps this option
 123 does not affect code size or performance.  The default is
 124 @samp{--trampolines}.
 125
 126 @item --abi-windowed | --abi-call0
 127 @kindex --abi-windowed
 128 @kindex --abi-call0
 129 Choose ABI tag written to the @code{.xtensa.info} section.  ABI tag
 130 indicates ABI of the assembly code.  A warning is issued by the linker
 131 on an attempt to link object files with inconsistent ABI tags.
 132 Default ABI is chosen by the Xtensa core configuration.
 133 @end table
 134
 135 @c man end
 136
 137 @node Xtensa Syntax
 138 @section Assembler Syntax
 139 @cindex syntax, Xtensa assembler
 140 @cindex Xtensa assembler syntax
 141 @cindex FLIX syntax
 142
 143 Block comments are delimited by @samp{/*} and @samp{*/}.  End of line
 144 comments may be introduced with either @samp{#} or @samp{//}.
 145
 146 If a @samp{#} appears as the first character of a line then the whole
 147 line is treated as a comment, but in this case the line could also be
 148 a logical line number directive (@pxref{Comments}) or a preprocessor
 149 control command (@pxref{Preprocessing}).
 150
 151 Instructions consist of a leading opcode or macro name followed by
 152 whitespace and an optional comma-separated list of operands:
 153
 154 @smallexample
 155 @var{opcode} [@var{operand}, @dots{}]
 156 @end smallexample
 157
 158 Instructions must be separated by a newline or semicolon (@samp{;}).
 159
 160 FLIX instructions, which bundle multiple opcodes together in a single
 161 instruction, are specified by enclosing the bundled opcodes inside
 162 braces:
 163
 164 @smallexample
 165 @group
 166 @{
 167 [@var{format}]
 168 @var{opcode0} [@var{operands}]
 169 @end group
 170 @var{opcode1} [@var{operands}]
 171 @group
 172 @var{opcode2} [@var{operands}]
 173 @dots{}
 174 @}
 175 @end group
 176 @end smallexample
 177
 178 The opcodes in a FLIX instruction are listed in the same order as the
 179 corresponding instruction slots in the TIE format declaration.
 180 Directives and labels are not allowed inside the braces of a FLIX
 181 instruction.  A particular TIE format name can optionally be specified
 182 immediately after the opening brace, but this is usually unnecessary.
 183 The assembler will automatically search for a format that can encode the
 184 specified opcodes, so the format name need only be specified in rare
 185 cases where there is more than one applicable format and where it
 186 matters which of those formats is used.  A FLIX instruction can also be
 187 specified on a single line by separating the opcodes with semicolons:
 188
 189 @smallexample
 190 @{ [@var{format};] @var{opcode0} [@var{operands}]; @var{opcode1} [@var{operands}]; @var{opcode2} [@var{operands}]; @dots{} @}
 191 @end smallexample
 192
 193 If an opcode can only be encoded in a FLIX instruction but is not
 194 specified as part of a FLIX bundle, the assembler will choose the
 195 smallest format where the opcode can be encoded and
 196 will fill unused instruction slots with no-ops.
 197
 198 @menu
 199 * Xtensa Opcodes::              Opcode Naming Conventions.
 200 * Xtensa Registers::            Register Naming.
 201 @end menu
 202
 203 @node Xtensa Opcodes
 204 @subsection Opcode Names
 205 @cindex Xtensa opcode names
 206 @cindex opcode names, Xtensa
 207
 208 See the @cite{Xtensa Instruction Set Architecture (ISA) Reference
 209 Manual} for a complete list of opcodes and descriptions of their
 210 semantics.
 211
 212 @cindex _ opcode prefix
 213 If an opcode name is prefixed with an underscore character (@samp{_}),
 214 @command{@value{AS}} will not transform that instruction in any way.  The
 215 underscore prefix disables both optimization (@pxref{Xtensa
 216 Optimizations, ,Xtensa Optimizations}) and relaxation (@pxref{Xtensa
 217 Relaxation, ,Xtensa Relaxation}) for that particular instruction.  Only
 218 use the underscore prefix when it is essential to select the exact
 219 opcode produced by the assembler.  Using this feature unnecessarily
 220 makes the code less efficient by disabling assembler optimization and
 221 less flexible by disabling relaxation.
 222
 223 Note that this special handling of underscore prefixes only applies to
 224 Xtensa opcodes, not to either built-in macros or user-defined macros.
 225 When an underscore prefix is used with a macro (e.g., @code{_MOV}), it
 226 refers to a different macro.  The assembler generally provides built-in
 227 macros both with and without the underscore prefix, where the underscore
 228 versions behave as if the underscore carries through to the instructions
 229 in the macros.  For example, @code{_MOV} may expand to @code{_MOV.N}@.
 230
 231 The underscore prefix only applies to individual instructions, not to
 232 series of instructions.  For example, if a series of instructions have
 233 underscore prefixes, the assembler will not transform the individual
 234 instructions, but it may insert other instructions between them (e.g.,
 235 to align a @code{LOOP} instruction).  To prevent the assembler from
 236 modifying a series of instructions as a whole, use the
 237 @code{no-transform} directive.  @xref{Transform Directive, ,transform}.
 238
 239 @node Xtensa Registers
 240 @subsection Register Names
 241 @cindex Xtensa register names
 242 @cindex register names, Xtensa
 243 @cindex sp register
 244
 245 The assembly syntax for a register file entry is the ``short'' name for
 246 a TIE register file followed by the index into that register file.  For
 247 example, the general-purpose @code{AR} register file has a short name of
 248 @code{a}, so these registers are named @code{a0}@dots{}@code{a15}.
 249 As a special feature, @code{sp} is also supported as a synonym for
 250 @code{a1}.  Additional registers may be added by processor configuration
 251 options and by designer-defined TIE extensions.  An initial @samp{$}
 252 character is optional in all register names.
 253
 254 @node Xtensa Optimizations
 255 @section Xtensa Optimizations
 256 @cindex optimizations
 257
 258 The optimizations currently supported by @command{@value{AS}} are
 259 generation of density instructions where appropriate and automatic
 260 branch target alignment.
 261
 262 @menu
 263 * Density Instructions::        Using Density Instructions.
 264 * Xtensa Automatic Alignment::  Automatic Instruction Alignment.
 265 @end menu
 266
 267 @node Density Instructions
 268 @subsection Using Density Instructions
 269 @cindex density instructions
 270
 271 The Xtensa instruction set has a code density option that provides
 272 16-bit versions of some of the most commonly used opcodes.  Use of these
 273 opcodes can significantly reduce code size.  When possible, the
 274 assembler automatically translates instructions from the core
 275 Xtensa instruction set into equivalent instructions from the Xtensa code
 276 density option.  This translation can be disabled by using underscore
 277 prefixes (@pxref{Xtensa Opcodes, ,Opcode Names}), by using the
 278 @samp{--no-transform} command-line option (@pxref{Xtensa Options, ,Command
 279 Line Options}), or by using the @code{no-transform} directive
 280 (@pxref{Transform Directive, ,transform}).
 281
 282 It is a good idea @emph{not} to use the density instructions directly.
 283 The assembler will automatically select dense instructions where
 284 possible.  If you later need to use an Xtensa processor without the code
 285 density option, the same assembly code will then work without modification.
 286
 287 @node Xtensa Automatic Alignment
 288 @subsection Automatic Instruction Alignment
 289 @cindex alignment of @code{LOOP} instructions
 290 @cindex alignment of branch targets
 291 @cindex @code{LOOP} instructions, alignment
 292 @cindex branch target alignment
 293
 294 The Xtensa assembler will automatically align certain instructions, both
 295 to optimize performance and to satisfy architectural requirements.
 296
 297 As an optimization to improve performance, the assembler attempts to
 298 align branch targets so they do not cross instruction fetch boundaries.
 299 (Xtensa processors can be configured with either 32-bit or 64-bit
 300 instruction fetch widths.)  An
 301 instruction immediately following a call is treated as a branch target
 302 in this context, because it will be the target of a return from the
 303 call.  This alignment has the potential to reduce branch penalties at
 304 some expense in code size.
 305 This optimization is enabled by default.  You can disable it with the
 306 @samp{--no-target-@-align} command-line option (@pxref{Xtensa Options,
 307 ,Command-line Options}).
 308
 309 The target alignment optimization is done without adding instructions
 310 that could increase the execution time of the program.  If there are
 311 density instructions in the code preceding a target, the assembler can
 312 change the target alignment by widening some of those instructions to
 313 the equivalent 24-bit instructions.  Extra bytes of padding can be
 314 inserted immediately following unconditional jump and return
 315 instructions.
 316 This approach is usually successful in aligning many, but not all,
 317 branch targets.
 318
 319 The @code{LOOP} family of instructions must be aligned such that the
 320 first instruction in the loop body does not cross an instruction fetch
 321 boundary (e.g., with a 32-bit fetch width, a @code{LOOP} instruction
 322 must be on either a 1 or 2 mod 4 byte boundary).  The assembler knows
 323 about this restriction and inserts the minimal number of 2 or 3 byte
 324 no-op instructions to satisfy it.  When no-op instructions are added,
 325 any label immediately preceding the original loop will be moved in order
 326 to refer to the loop instruction, not the newly generated no-op
 327 instruction.  To preserve binary compatibility across processors with
 328 different fetch widths, the assembler conservatively assumes a 32-bit
 329 fetch width when aligning @code{LOOP} instructions (except if the first
 330 instruction in the loop is a 64-bit instruction).
 331
 332 Previous versions of the assembler automatically aligned @code{ENTRY}
 333 instructions to 4-byte boundaries, but that alignment is now the
 334 programmer's responsibility.
 335
 336 @node Xtensa Relaxation
 337 @section Xtensa Relaxation
 338 @cindex relaxation
 339
 340 When an instruction operand is outside the range allowed for that
 341 particular instruction field, @command{@value{AS}} can transform the code
 342 to use a functionally-equivalent instruction or sequence of
 343 instructions.  This process is known as @dfn{relaxation}.  This is
 344 typically done for branch instructions because the distance of the
 345 branch targets is not known until assembly-time.  The Xtensa assembler
 346 offers branch relaxation and also extends this concept to function
 347 calls, @code{MOVI} instructions and other instructions with immediate
 348 fields.
 349
 350 @menu
 351 * Xtensa Branch Relaxation::        Relaxation of Branches.
 352 * Xtensa Call Relaxation::          Relaxation of Function Calls.
 353 * Xtensa Jump Relaxation::          Relaxation of Jumps.
 354 * Xtensa Immediate Relaxation::     Relaxation of other Immediate Fields.
 355 @end menu
 356
 357 @node Xtensa Branch Relaxation
 358 @subsection Conditional Branch Relaxation
 359 @cindex relaxation of branch instructions
 360 @cindex branch instructions, relaxation
 361
 362 When the target of a branch is too far away from the branch itself,
 363 i.e., when the offset from the branch to the target is too large to fit
 364 in the immediate field of the branch instruction, it may be necessary to
 365 replace the branch with a branch around a jump.  For example,
 366
 367 @smallexample
 368     beqz    a2, L
 369 @end smallexample
 370
 371 may result in:
 372
 373 @smallexample
 374 @group
 375     bnez.n  a2, M
 376     j L
 377 M:
 378 @end group
 379 @end smallexample
 380
 381 (The @code{BNEZ.N} instruction would be used in this example only if the
 382 density option is available.  Otherwise, @code{BNEZ} would be used.)
 383
 384 This relaxation works well because the unconditional jump instruction
 385 has a much larger offset range than the various conditional branches.
 386 However, an error will occur if a branch target is beyond the range of a
 387 jump instruction.  @command{@value{AS}} cannot relax unconditional jumps.
 388 Similarly, an error will occur if the original input contains an
 389 unconditional jump to a target that is out of range.
 390
 391 Branch relaxation is enabled by default.  It can be disabled by using
 392 underscore prefixes (@pxref{Xtensa Opcodes, ,Opcode Names}), the
 393 @samp{--no-transform} command-line option (@pxref{Xtensa Options,
 394 ,Command-line Options}), or the @code{no-transform} directive
 395 (@pxref{Transform Directive, ,transform}).
 396
 397 @node Xtensa Call Relaxation
 398 @subsection Function Call Relaxation
 399 @cindex relaxation of call instructions
 400 @cindex call instructions, relaxation
 401
 402 Function calls may require relaxation because the Xtensa immediate call
 403 instructions (@code{CALL0}, @code{CALL4}, @code{CALL8} and
 404 @code{CALL12}) provide a PC-relative offset of only 512 Kbytes in either
 405 direction.  For larger programs, it may be necessary to use indirect
 406 calls (@code{CALLX0}, @code{CALLX4}, @code{CALLX8} and @code{CALLX12})
 407 where the target address is specified in a register.  The Xtensa
 408 assembler can automatically relax immediate call instructions into
 409 indirect call instructions.  This relaxation is done by loading the
 410 address of the called function into the callee's return address register
 411 and then using a @code{CALLX} instruction.  So, for example:
 412
 413 @smallexample
 414     call8 func
 415 @end smallexample
 416
 417 might be relaxed to:
 418
 419 @smallexample
 420 @group
 421     .literal .L1, func
 422     l32r    a8, .L1
 423     callx8  a8
 424 @end group
 425 @end smallexample
 426
 427 Because the addresses of targets of function calls are not generally
 428 known until link-time, the assembler must assume the worst and relax all
 429 the calls to functions in other source files, not just those that really
 430 will be out of range.  The linker can recognize calls that were
 431 unnecessarily relaxed, and it will remove the overhead introduced by the
 432 assembler for those cases where direct calls are sufficient.
 433
 434 Call relaxation is disabled by default because it can have a negative
 435 effect on both code size and performance, although the linker can
 436 usually eliminate the unnecessary overhead.  If a program is too large
 437 and some of the calls are out of range, function call relaxation can be
 438 enabled using the @samp{--longcalls} command-line option or the
 439 @code{longcalls} directive (@pxref{Longcalls Directive, ,longcalls}).
 440
 441 @node Xtensa Jump Relaxation
 442 @subsection Jump Relaxation
 443 @cindex relaxation of jump instructions
 444 @cindex jump instructions, relaxation
 445
 446 Jump instruction may require relaxation because the Xtensa jump instruction
 447 (@code{J}) provide a PC-relative offset of only 128 Kbytes in either
 448 direction.  One option is to use jump long (@code{J.L}) instruction, which
 449 depending on jump distance may be assembled as jump (@code{J}) or indirect
 450 jump (@code{JX}).  However it needs a free register.  When there's no spare
 451 register it is possible to plant intermediate jump sites (trampolines)
 452 between the jump instruction and its target.  These sites may be located in
 453 areas unreachable by normal code execution flow, in that case they only
 454 contain intermediate jumps, or they may be inserted in the middle of code
 455 block, in which case there's an additional jump from the beginning of the
 456 trampoline to the instruction past its end.  So, for example:
 457
 458 @smallexample
 459 @group
 460     j 1f
 461     ...
 462     retw
 463     ...
 464     mov a10, a2
 465     call8 func
 466     ...
 467 1:
 468     ...
 469 @end group
 470 @end smallexample
 471
 472 might be relaxed to:
 473
 474 @smallexample
 475 @group
 476     j .L0_TR_1
 477     ...
 478     retw
 479 .L0_TR_1:
 480     j 1f
 481     ...
 482     mov a10, a2
 483     call8 func
 484     ...
 485 1:
 486     ...
 487 @end group
 488 @end smallexample
 489
 490 or to:
 491
 492 @smallexample
 493 @group
 494     j .L0_TR_1
 495     ...
 496     retw
 497     ...
 498     mov a10, a2
 499     j .L0_TR_0
 500 .L0_TR_1:
 501     j 1f
 502 .L0_TR_0:
 503     call8 func
 504     ...
 505 1:
 506     ...
 507 @end group
 508 @end smallexample
 509
 510 The Xtensa assembler uses trampolines with jump around only when it cannot
 511 find suitable unreachable trampoline.  There may be multiple trampolines
 512 between the jump instruction and its target.
 513
 514 This relaxation does not apply to jumps to undefined symbols, assuming they
 515 will reach their targets once resolved.
 516
 517 Jump relaxation is enabled by default because it does not affect code size
 518 or performance while the code itself is small.  This relaxation may be
 519 disabled completely with @samp{--no-trampolines} or @samp{--no-transform}
 520 command-line options (@pxref{Xtensa Options, ,Command-line Options}).
 521
 522 @node Xtensa Immediate Relaxation
 523 @subsection Other Immediate Field Relaxation
 524 @cindex immediate fields, relaxation
 525 @cindex relaxation of immediate fields
 526
 527 The assembler normally performs the following other relaxations.  They
 528 can be disabled by using underscore prefixes (@pxref{Xtensa Opcodes,
 529 ,Opcode Names}), the @samp{--no-transform} command-line option
 530 (@pxref{Xtensa Options, ,Command-line Options}), or the
 531 @code{no-transform} directive (@pxref{Transform Directive, ,transform}).
 532
 533 @cindex @code{MOVI} instructions, relaxation
 534 @cindex relaxation of @code{MOVI} instructions
 535 The @code{MOVI} machine instruction can only materialize values in the
 536 range from -2048 to 2047.  Values outside this range are best
 537 materialized with @code{L32R} instructions.  Thus:
 538
 539 @smallexample
 540     movi a0, 100000
 541 @end smallexample
 542
 543 is assembled into the following machine code:
 544
 545 @smallexample
 546 @group
 547     .literal .L1, 100000
 548     l32r a0, .L1
 549 @end group
 550 @end smallexample
 551
 552 @cindex @code{L8UI} instructions, relaxation
 553 @cindex @code{L16SI} instructions, relaxation
 554 @cindex @code{L16UI} instructions, relaxation
 555 @cindex @code{L32I} instructions, relaxation
 556 @cindex relaxation of @code{L8UI} instructions
 557 @cindex relaxation of @code{L16SI} instructions
 558 @cindex relaxation of @code{L16UI} instructions
 559 @cindex relaxation of @code{L32I} instructions
 560 The @code{L8UI} machine instruction can only be used with immediate
 561 offsets in the range from 0 to 255. The @code{L16SI} and @code{L16UI}
 562 machine instructions can only be used with offsets from 0 to 510.  The
 563 @code{L32I} machine instruction can only be used with offsets from 0 to
 564 1020.  A load offset outside these ranges can be materialized with
 565 an @code{L32R} instruction if the destination register of the load
 566 is different than the source address register.  For example:
 567
 568 @smallexample
 569     l32i a1, a0, 2040
 570 @end smallexample
 571
 572 is translated to:
 573
 574 @smallexample
 575 @group
 576     .literal .L1, 2040
 577     l32r a1, .L1
 578 @end group
 579 @group
 580     add a1, a0, a1
 581     l32i a1, a1, 0
 582 @end group
 583 @end smallexample
 584
 585 @noindent
 586 If the load destination and source address register are the same, an
 587 out-of-range offset causes an error.
 588
 589 @cindex @code{ADDI} instructions, relaxation
 590 @cindex relaxation of @code{ADDI} instructions
 591 The Xtensa @code{ADDI} instruction only allows immediate operands in the
 592 range from -128 to 127.  There are a number of alternate instruction
 593 sequences for the @code{ADDI} operation.  First, if the
 594 immediate is 0, the @code{ADDI} will be turned into a @code{MOV.N}
 595 instruction (or the equivalent @code{OR} instruction if the code density
 596 option is not available).  If the @code{ADDI} immediate is outside of
 597 the range -128 to 127, but inside the range -32896 to 32639, an
 598 @code{ADDMI} instruction or @code{ADDMI}/@code{ADDI} sequence will be
 599 used.  Finally, if the immediate is outside of this range and a free
 600 register is available, an @code{L32R}/@code{ADD} sequence will be used
 601 with a literal allocated from the literal pool.
 602
 603 For example:
 604
 605 @smallexample
 606 @group
 607     addi    a5, a6, 0
 608     addi    a5, a6, 512
 609 @end group
 610 @group
 611     addi    a5, a6, 513
 612     addi    a5, a6, 50000
 613 @end group
 614 @end smallexample
 615
 616 is assembled into the following:
 617
 618 @smallexample
 619 @group
 620     .literal .L1, 50000
 621     mov.n   a5, a6
 622 @end group
 623     addmi   a5, a6, 0x200
 624     addmi   a5, a6, 0x200
 625     addi    a5, a5, 1
 626 @group
 627     l32r    a5, .L1
 628     add     a5, a6, a5
 629 @end group
 630 @end smallexample
 631
 632 @node Xtensa Directives
 633 @section Directives
 634 @cindex Xtensa directives
 635 @cindex directives, Xtensa
 636
 637 The Xtensa assembler supports a region-based directive syntax:
 638
 639 @smallexample
 640 @group
 641     .begin @var{directive} [@var{options}]
 642     @dots{}
 643     .end @var{directive}
 644 @end group
 645 @end smallexample
 646
 647 All the Xtensa-specific directives that apply to a region of code use
 648 this syntax.
 649
 650 The directive applies to code between the @code{.begin} and the
 651 @code{.end}.  The state of the option after the @code{.end} reverts to
 652 what it was before the @code{.begin}.
 653 A nested @code{.begin}/@code{.end} region can further
 654 change the state of the directive without having to be aware of its
 655 outer state.  For example, consider:
 656
 657 @smallexample
 658 @group
 659     .begin no-transform
 660 L:  add a0, a1, a2
 661 @end group
 662     .begin transform
 663 M:  add a0, a1, a2
 664     .end transform
 665 @group
 666 N:  add a0, a1, a2
 667     .end no-transform
 668 @end group
 669 @end smallexample
 670
 671 The @code{ADD} opcodes at @code{L} and @code{N} in the outer
 672 @code{no-transform} region both result in @code{ADD} machine instructions,
 673 but the assembler selects an @code{ADD.N} instruction for the
 674 @code{ADD} at @code{M} in the inner @code{transform} region.
 675
 676 The advantage of this style is that it works well inside macros which can
 677 preserve the context of their callers.
 678
 679 The following directives are available:
 680 @menu
 681 * Schedule Directive::         Enable instruction scheduling.
 682 * Longcalls Directive::        Use Indirect Calls for Greater Range.
 683 * Transform Directive::        Disable All Assembler Transformations.
 684 * Literal Directive::          Intermix Literals with Instructions.
 685 * Literal Position Directive:: Specify Inline Literal Pool Locations.
 686 * Literal Prefix Directive::   Specify Literal Section Name Prefix.
 687 * Absolute Literals Directive:: Control PC-Relative vs. Absolute Literals.
 688 @end menu
 689
 690 @node Schedule Directive
 691 @subsection schedule
 692 @cindex @code{schedule} directive
 693 @cindex @code{no-schedule} directive
 694
 695 The @code{schedule} directive is recognized only for compatibility with
 696 Tensilica's assembler.
 697
 698 @smallexample
 699 @group
 700     .begin [no-]schedule
 701     .end [no-]schedule
 702 @end group
 703 @end smallexample
 704
 705 This directive is ignored and has no effect on @command{@value{AS}}.
 706
 707 @node Longcalls Directive
 708 @subsection longcalls
 709 @cindex @code{longcalls} directive
 710 @cindex @code{no-longcalls} directive
 711
 712 The @code{longcalls} directive enables or disables function call
 713 relaxation.  @xref{Xtensa Call Relaxation, ,Function Call Relaxation}.
 714
 715 @smallexample
 716 @group
 717     .begin [no-]longcalls
 718     .end [no-]longcalls
 719 @end group
 720 @end smallexample
 721
 722 Call relaxation is disabled by default unless the @samp{--longcalls}
 723 command-line option is specified.  The @code{longcalls} directive
 724 overrides the default determined by the command-line options.
 725
 726 @node Transform Directive
 727 @subsection transform
 728 @cindex @code{transform} directive
 729 @cindex @code{no-transform} directive
 730
 731 This directive enables or disables all assembler transformation,
 732 including relaxation (@pxref{Xtensa Relaxation, ,Xtensa Relaxation}) and
 733 optimization (@pxref{Xtensa Optimizations, ,Xtensa Optimizations}).
 734
 735 @smallexample
 736 @group
 737     .begin [no-]transform
 738     .end [no-]transform
 739 @end group
 740 @end smallexample
 741
 742 Transformations are enabled by default unless the @samp{--no-transform}
 743 option is used.  The @code{transform} directive overrides the default
 744 determined by the command-line options.  An underscore opcode prefix,
 745 disabling transformation of that opcode, always takes precedence over
 746 both directives and command-line flags.
 747
 748 @node Literal Directive
 749 @subsection literal
 750 @cindex @code{literal} directive
 751
 752 The @code{.literal} directive is used to define literal pool data, i.e.,
 753 read-only 32-bit data accessed via @code{L32R} instructions.
 754
 755 @smallexample
 756     .literal @var{label}, @var{value}[, @var{value}@dots{}]
 757 @end smallexample
 758
 759 This directive is similar to the standard @code{.word} directive, except
 760 that the actual location of the literal data is determined by the
 761 assembler and linker, not by the position of the @code{.literal}
 762 directive.  Using this directive gives the assembler freedom to locate
 763 the literal data in the most appropriate place and possibly to combine
 764 identical literals.  For example, the code:
 765
 766 @smallexample
 767 @group
 768     entry sp, 40
 769     .literal .L1, sym
 770     l32r    a4, .L1
 771 @end group
 772 @end smallexample
 773
 774 can be used to load a pointer to the symbol @code{sym} into register
 775 @code{a4}.  The value of @code{sym} will not be placed between the
 776 @code{ENTRY} and @code{L32R} instructions; instead, the assembler puts
 777 the data in a literal pool.
 778
 779 Literal pools are placed by default in separate literal sections;
 780 however, when using the @samp{--text-@-section-@-literals}
 781 option (@pxref{Xtensa Options, ,Command-line Options}), the literal
 782 pools for PC-relative mode @code{L32R} instructions
 783 are placed in the current section.@footnote{Literals for the
 784 @code{.init} and @code{.fini} sections are always placed in separate
 785 sections, even when @samp{--text-@-section-@-literals} is enabled.}
 786 These text section literal
 787 pools are created automatically before @code{ENTRY} instructions and
 788 manually after @samp{.literal_position} directives (@pxref{Literal
 789 Position Directive, ,literal_position}).  If there are no preceding
 790 @code{ENTRY} instructions, explicit @code{.literal_position} directives
 791 must be used to place the text section literal pools; otherwise,
 792 @command{@value{AS}} will report an error.
 793
 794 When literals are placed in separate sections, the literal section names
 795 are derived from the names of the sections where the literals are
 796 defined.  The base literal section names are @code{.literal} for
 797 PC-relative mode @code{L32R} instructions and @code{.lit4} for absolute
 798 mode @code{L32R} instructions (@pxref{Absolute Literals Directive,
 799 ,absolute-literals}).  These base names are used for literals defined in
 800 the default @code{.text} section.  For literals defined in other
 801 sections or within the scope of a @code{literal_prefix} directive
 802 (@pxref{Literal Prefix Directive, ,literal_prefix}), the following rules
 803 determine the literal section name:
 804
 805 @enumerate
 806 @item
 807 If the current section is a member of a section group, the literal
 808 section name includes the group name as a suffix to the base
 809 @code{.literal} or @code{.lit4} name, with a period to separate the base
 810 name and group name.  The literal section is also made a member of the
 811 group.
 812
 813 @item
 814 If the current section name (or @code{literal_prefix} value) begins with
 815 ``@code{.gnu.linkonce.@var{kind}.}'', the literal section name is formed
 816 by replacing ``@code{.@var{kind}}'' with the base @code{.literal} or
 817 @code{.lit4} name.  For example, for literals defined in a section named
 818 @code{.gnu.linkonce.t.func}, the literal section will be
 819 @code{.gnu.linkonce.literal.func} or @code{.gnu.linkonce.lit4.func}.
 820
 821 @item
 822 If the current section name (or @code{literal_prefix} value) ends with
 823 @code{.text}, the literal section name is formed by replacing that
 824 suffix with the base @code{.literal} or @code{.lit4} name.  For example,
 825 for literals defined in a section named @code{.iram0.text}, the literal
 826 section will be @code{.iram0.literal} or @code{.iram0.lit4}.
 827
 828 @item
 829 If none of the preceding conditions apply, the literal section name is
 830 formed by adding the base @code{.literal} or @code{.lit4} name as a
 831 suffix to the current section name (or @code{literal_prefix} value).
 832 @end enumerate
 833
 834 @node Literal Position Directive
 835 @subsection literal_position
 836 @cindex @code{literal_position} directive
 837
 838 When using @samp{--text-@-section-@-literals} to place literals inline
 839 in the section being assembled, the @code{.literal_position} directive
 840 can be used to mark a potential location for a literal pool.
 841
 842 @smallexample
 843     .literal_position
 844 @end smallexample
 845
 846 The @code{.literal_position} directive is ignored when the
 847 @samp{--text-@-section-@-literals} option is not used or when
 848 @code{L32R} instructions use the absolute addressing mode.
 849
 850 The assembler will automatically place text section literal pools
 851 before @code{ENTRY} instructions, so the @code{.literal_position}
 852 directive is only needed to specify some other location for a literal
 853 pool.  You may need to add an explicit jump instruction to skip over an
 854 inline literal pool.
 855
 856 For example, an interrupt vector does not begin with an @code{ENTRY}
 857 instruction so the assembler will be unable to automatically find a good
 858 place to put a literal pool.  Moreover, the code for the interrupt
 859 vector must be at a specific starting address, so the literal pool
 860 cannot come before the start of the code.  The literal pool for the
 861 vector must be explicitly positioned in the middle of the vector (before
 862 any uses of the literals, due to the negative offsets used by
 863 PC-relative @code{L32R} instructions).  The @code{.literal_position}
 864 directive can be used to do this.  In the following code, the literal
 865 for @samp{M} will automatically be aligned correctly and is placed after
 866 the unconditional jump.
 867
 868 @smallexample
 869 @group
 870     .global M
 871 code_start:
 872 @end group
 873     j continue
 874     .literal_position
 875     .align 4
 876 @group
 877 continue:
 878     movi    a4, M
 879 @end group
 880 @end smallexample
 881
 882 @node Literal Prefix Directive
 883 @subsection literal_prefix
 884 @cindex @code{literal_prefix} directive
 885
 886 The @code{literal_prefix} directive allows you to override the default
 887 literal section names, which are derived from the names of the sections
 888 where the literals are defined.
 889
 890 @smallexample
 891 @group
 892     .begin literal_prefix [@var{name}]
 893     .end literal_prefix
 894 @end group
 895 @end smallexample
 896
 897 For literals defined within the delimited region, the literal section
 898 names are derived from the @var{name} argument instead of the name of
 899 the current section.  The rules used to derive the literal section names
 900 do not change.  @xref{Literal Directive, ,literal}.  If the @var{name}
 901 argument is omitted, the literal sections revert to the defaults.  This
 902 directive has no effect when using the
 903 @samp{--text-@-section-@-literals} option (@pxref{Xtensa Options,
 904 ,Command-line Options}).
 905
 906 @node Absolute Literals Directive
 907 @subsection absolute-literals
 908 @cindex @code{absolute-literals} directive
 909 @cindex @code{no-absolute-literals} directive
 910
 911 The @code{absolute-@-literals} and @code{no-@-absolute-@-literals}
 912 directives control the absolute vs.@: PC-relative mode for @code{L32R}
 913 instructions.  These are relevant only for Xtensa configurations that
 914 include the absolute addressing option for @code{L32R} instructions.
 915
 916 @smallexample
 917 @group
 918     .begin [no-]absolute-literals
 919     .end [no-]absolute-literals
 920 @end group
 921 @end smallexample
 922
 923 These directives do not change the @code{L32R} mode---they only cause
 924 the assembler to emit the appropriate kind of relocation for @code{L32R}
 925 instructions and to place the literal values in the appropriate section.
 926 To change the @code{L32R} mode, the program must write the
 927 @code{LITBASE} special register.  It is the programmer's responsibility
 928 to keep track of the mode and indicate to the assembler which mode is
 929 used in each region of code.
 930
 931 If the Xtensa configuration includes the absolute @code{L32R} addressing
 932 option, the default is to assume absolute @code{L32R} addressing unless
 933 the @samp{--no-@-absolute-@-literals} command-line option is specified.
 934 Otherwise, the default is to assume PC-relative @code{L32R} addressing.
 935 The @code{absolute-@-literals} directive can then be used to override
 936 the default determined by the command-line options.
 937
 938 @c Local Variables:
 939 @c fill-column: 72
 940 @c End: