[thirdparty/gcc.git] / libstdc++-v3 / doc / xml / manual / concurrency_extensions.xml

<chapter xmlns="http://docbook.org/ns/docbook" version="5.0" 
	 xml:id="manual.ext.concurrency" xreflabel="Concurrency Extensions">
<?dbhtml filename="ext_concurrency.html"?>

<info><title>Concurrency</title>
  <keywordset>
    <keyword>
      ISO C++
    </keyword>
    <keyword>
      library
    </keyword>
  </keywordset>
</info>


<section xml:id="manual.ext.concurrency.design" xreflabel="Design"><info><title>Design</title></info>
  

  <section xml:id="manual.ext.concurrency.design.threads" xreflabel="Threads API"><info><title>Interface to Locks and Mutexes</title></info>
    

<para>The file &lt;ext/concurrence.h&gt; contains all the higher-level
constructs for playing with threads. In contrast to the atomics layer,
the concurrence layer consists largely of types. All types are defined within <code>namespace __gnu_cxx</code>.
</para>

<para>
These types can be used in a portable manner, regardless of the
specific environment. They are carefully designed to provide optimum
efficiency and speed, abstracting out underlying thread calls and
accesses when compiling for single-threaded situations (even on hosts
that support multiple threads.)
</para>

<para>The enumerated type <code>_Lock_policy</code> details the set of
available locking
policies: <code>_S_single</code>, <code>_S_mutex</code>,
and <code>_S_atomic</code>.
</para>

<itemizedlist>
<listitem><para><code>_S_single</code></para>
<para>Indicates single-threaded code that does not need locking.
</para>

</listitem>
<listitem><para><code>_S_mutex</code></para>
<para>Indicates multi-threaded code using thread-layer abstractions.
</para>
</listitem>
<listitem><para><code>_S_atomic</code></para>
<para>Indicates multi-threaded code using atomic operations.
</para>
</listitem>
</itemizedlist>

<para>The compile-time constant <code>__default_lock_policy</code> is set
to one of the three values above, depending on characteristics of the
host environment and the current compilation flags.
</para>

<para>Two more datatypes make up the rest of the
interface: <code>__mutex</code>, and <code>__scoped_lock</code>.
</para>

<para>
</para>

<para>The scoped lock idiom is well-discussed within the C++
community. This version takes a <code>__mutex</code> reference, and
locks it during construction of <code>__scoped_locke</code> and
unlocks it during destruction. This is an efficient way of locking
critical sections, while retaining exception-safety.
</para>
  </section>

  <section xml:id="manual.ext.concurrency.design.atomics" xreflabel="Atomic API"><info><title>Interface to Atomic Functions</title></info>
    

<para>
Two functions and one type form the base of atomic support.
</para>


<para>The type <code>_Atomic_word</code> is a signed integral type
supporting atomic operations.
</para>

<para>
The two functions functions are:
</para>

<programlisting>
_Atomic_word
__exchange_and_add_dispatch(volatile _Atomic_word*, int);

void
__atomic_add_dispatch(volatile _Atomic_word*, int);
</programlisting>

<para>Both of these functions are declared in the header file
&lt;ext/atomicity.h&gt;, and are in <code>namespace __gnu_cxx</code>.
</para>

<itemizedlist>
<listitem><para>
<code>
__exchange_and_add_dispatch
</code>
</para>
<para>Adds the second argument's value to the first argument. Returns the old value.
</para>
</listitem>
<listitem><para>
<code>
__atomic_add_dispatch
</code>
</para>
<para>Adds the second argument's value to the first argument. Has no return value.
</para>
</listitem>
</itemizedlist>

<para>
These functions forward to one of several specialized helper
functions, depending on the circumstances. For instance,
</para>

<para>
<code>
__exchange_and_add_dispatch
</code>
</para>

<para>
Calls through to either of:
</para>

<itemizedlist>
<listitem><para><code>__exchange_and_add</code>
</para>
<para>Multi-thread version. Inlined if compiler-generated builtin atomics
can be used, otherwise resolved at link time to a non-builtin code
sequence.
</para>
</listitem>

<listitem><para><code>__exchange_and_add_single</code>
</para>
<para>Single threaded version. Inlined.</para>
</listitem>
</itemizedlist>

<para>However, only <code>__exchange_and_add_dispatch</code>
and <code>__atomic_add_dispatch</code> should be used. These functions
can be used in a portable manner, regardless of the specific
environment. They are carefully designed to provide optimum efficiency
and speed, abstracting out atomic accesses when they are not required
(even on hosts that support compiler intrinsics for atomic
operations.)
</para>

<para>
In addition, there are two macros
</para>

<para>
<code>
_GLIBCXX_READ_MEM_BARRIER
</code>
</para>
<para>
<code>
_GLIBCXX_WRITE_MEM_BARRIER
</code>
</para>

<para>
Which expand to the appropriate write and read barrier required by the
host hardware and operating system.
</para>
  </section>

</section>


<section xml:id="manual.ext.concurrency.impl" xreflabel="Implementation"><info><title>Implementation</title></info>
  
  <section xml:id="manual.ext.concurrency.impl.atomic_fallbacks" xreflabel="Atomic F"><info><title>Using Builtin Atomic Functions</title></info>
    

<para>The functions for atomic operations described above are either
implemented via compiler intrinsics (if the underlying host is
capable) or by library fallbacks.</para>

<para>Compiler intrinsics (builtins) are always preferred.  However, as
the compiler builtins for atomics are not universally implemented,
using them directly is problematic, and can result in undefined
function calls. (An example of an undefined symbol from the use
of <code>__sync_fetch_and_add</code> on an unsupported host is a
missing reference to <code>__sync_fetch_and_add_4</code>.)
</para>

<para>In addition, on some hosts the compiler intrinsics are enabled
conditionally, via the <code>-march</code> command line flag. This makes
usage vary depending on the target hardware and the flags used during
compile.
</para>


<para>
<remark>
Incomplete/inconsistent. This is only C++11.
</remark>
</para>

<para>
If builtins are possible for bool-sized integral types,
<code>ATOMIC_BOOL_LOCK_FREE</code> will be defined.
If builtins are possible for int-sized integral types,
<code>ATOMIC_INT_LOCK_FREE</code> will be defined.
</para>


<para>For the following hosts, intrinsics are enabled by default.
</para>

<itemizedlist>
  <listitem><para>alpha</para></listitem>
  <listitem><para>ia64</para></listitem>
  <listitem><para>powerpc</para></listitem>
  <listitem><para>s390</para></listitem>
</itemizedlist>

<para>For others, some form of <code>-march</code> may work. On
non-ancient x86 hardware, <code>-march=native</code> usually does the
trick.</para>

<para> For hosts without compiler intrinsics, but with capable
hardware, hand-crafted assembly is selected. This is the case for the following hosts:
</para>

<itemizedlist>
  <listitem><para>cris</para></listitem>
  <listitem><para>hppa</para></listitem>
  <listitem><para>i386</para></listitem>
  <listitem><para>i486</para></listitem>
  <listitem><para>m48k</para></listitem>
  <listitem><para>mips</para></listitem>
  <listitem><para>sparc</para></listitem>
</itemizedlist>

<para>And for the rest, a simulated atomic lock via pthreads.
</para>

<para> Detailed information about compiler intrinsics for atomic operations can be found in the GCC <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/gcc/Atomic-Builtins.html"> documentation</link>.
</para>

<para> More details on the library fallbacks from the porting <link linkend="internals.thread_safety">section</link>.
</para>


  </section>
  <section xml:id="manual.ext.concurrency.impl.thread" xreflabel="Pthread"><info><title>Thread Abstraction</title></info>
    

<para>A thin layer above IEEE 1003.1 (i.e. pthreads) is used to abstract
the thread interface for GCC. This layer is called "gthread," and is
comprised of one header file that wraps the host's default thread layer with
a POSIX-like interface.
</para>

<para> The file &lt;gthr-default.h&gt; points to the deduced wrapper for
the current host. In libstdc++ implementation files,
&lt;bits/gthr.h&gt; is used to select the proper gthreads file.
</para>

<para>Within libstdc++ sources, all calls to underlying thread functionality
use this layer. More detail as to the specific interface can be found in the source <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/libstdc++/latest-doxygen/a00883_source.html">documentation</link>.
</para>

<para>By design, the gthread layer is interoperable with the types,
functions, and usage found in the usual &lt;pthread.h&gt; file,
including <code>pthread_t</code>, <code>pthread_once_t</code>, <code>pthread_create</code>,
etc.
</para>

  </section>
</section>

<section xml:id="manual.ext.concurrency.use" xreflabel="Use"><info><title>Use</title></info>

  
<para>Typical usage of the last two constructs is demonstrated as follows:
</para>

<programlisting>
#include &lt;ext/concurrence.h&gt;

namespace
{
  __gnu_cxx::__mutex safe_base_mutex;
} // anonymous namespace

namespace other
{
  void
  foo()
  {
    __gnu_cxx::__scoped_lock sentry(safe_base_mutex);
    for (int i = 0; i &lt; max;  ++i)
      {
	_Safe_iterator_base* __old = __iter;
	__iter = __iter-&lt;_M_next;
	__old-&lt;_M_detach_single();
      }
}
</programlisting>

<para>In this sample code, an anonymous namespace is used to keep
the <code>__mutex</code> private to the compilation unit,
and <code>__scoped_lock</code> is used to guard access to the critical
section within the for loop, locking the mutex on creation and freeing
the mutex as control moves out of this block.
</para>

<para>Several exception classes are used to keep track of
concurrence-related errors. These classes
are: <code>__concurrence_lock_error</code>, <code>__concurrence_unlock_error</code>, <code>__concurrence_wait_error</code>,
and <code>__concurrence_broadcast_error</code>.
</para>


</section>

</chapter>
Commit	Line	Data
c6a21af2 BK	1	<chapter xmlns="http://docbook.org/ns/docbook" version="5.0"
c6a21af2 BK	2	xml:id="manual.ext.concurrency" xreflabel="Concurrency Extensions">
5e623d0b	3	<?dbhtml filename="ext_concurrency.html"?>
8a165db0	4
c6a21af2	5	<info><title>Concurrency</title>
8a165db0 BK	6	<keywordset>
	7	<keyword>
	8	ISO C++
	9	</keyword>
	10	<keyword>
	11	library
	12	</keyword>
	13	</keywordset>
c6a21af2 BK	14	</info>
c6a21af2 BK	15
8a165db0	16
8a165db0	17
c6a21af2 BK	18	<section xml:id="manual.ext.concurrency.design" xreflabel="Design"><info><title>Design</title></info>
c6a21af2 BK	19
8a165db0	20
c6a21af2 BK	21	<section xml:id="manual.ext.concurrency.design.threads" xreflabel="Threads API"><info><title>Interface to Locks and Mutexes</title></info>
c6a21af2 BK	22
8a165db0 BK	23
	24	<para>The file <ext/concurrence.h> contains all the higher-level
	25	constructs for playing with threads. In contrast to the atomics layer,
	26	the concurrence layer consists largely of types. All types are defined within <code>namespace __gnu_cxx</code>.
	27	</para>
	28
	29	<para>
	30	These types can be used in a portable manner, regardless of the
	31	specific environment. They are carefully designed to provide optimum
	32	efficiency and speed, abstracting out underlying thread calls and
	33	accesses when compiling for single-threaded situations (even on hosts
	34	that support multiple threads.)
	35	</para>
	36
	37	<para>The enumerated type <code>_Lock_policy</code> details the set of
	38	available locking
	39	policies: <code>_S_single</code>, <code>_S_mutex</code>,
	40	and <code>_S_atomic</code>.
	41	</para>
	42
	43	<itemizedlist>
	44	<listitem><para><code>_S_single</code></para>
	45	<para>Indicates single-threaded code that does not need locking.
	46	</para>
	47
	48	</listitem>
	49	<listitem><para><code>_S_mutex</code></para>
	50	<para>Indicates multi-threaded code using thread-layer abstractions.
	51	</para>
	52	</listitem>
	53	<listitem><para><code>_S_atomic</code></para>
	54	<para>Indicates multi-threaded code using atomic operations.
	55	</para>
	56	</listitem>
	57	</itemizedlist>
	58
	59	<para>The compile-time constant <code>__default_lock_policy</code> is set
	60	to one of the three values above, depending on characteristics of the
	61	host environment and the current compilation flags.
	62	</para>
	63
	64	<para>Two more datatypes make up the rest of the
	65	interface: <code>__mutex</code>, and <code>__scoped_lock</code>.
	66	</para>
	67
	68	<para>
	69	</para>
	70
	71	<para>The scoped lock idiom is well-discussed within the C++
	72	community. This version takes a <code>__mutex</code> reference, and
	73	locks it during construction of <code>__scoped_locke</code> and
	74	unlocks it during destruction. This is an efficient way of locking
	75	critical sections, while retaining exception-safety.
	76	</para>
c6a21af2	77	</section>
8a165db0	78
c6a21af2 BK	79	<section xml:id="manual.ext.concurrency.design.atomics" xreflabel="Atomic API"><info><title>Interface to Atomic Functions</title></info>
c6a21af2 BK	80
8a165db0 BK	81
	82
	83	<para>
03a32789	84	Two functions and one type form the base of atomic support.
8a165db0 BK	85	</para>
	86
	87
	88	<para>The type <code>_Atomic_word</code> is a signed integral type
	89	supporting atomic operations.
	90	</para>
	91
	92	<para>
	93	The two functions functions are:
	94	</para>
	95
	96	<programlisting>
	97	_Atomic_word
	98	__exchange_and_add_dispatch(volatile _Atomic_word*, int);
	99
	100	void
	101	__atomic_add_dispatch(volatile _Atomic_word*, int);
	102	</programlisting>
	103
	104	<para>Both of these functions are declared in the header file
	105	<ext/atomicity.h>, and are in <code>namespace __gnu_cxx</code>.
	106	</para>
	107
	108	<itemizedlist>
	109	<listitem><para>
	110	<code>
	111	__exchange_and_add_dispatch
	112	</code>
	113	</para>
	114	<para>Adds the second argument's value to the first argument. Returns the old value.
	115	</para>
	116	</listitem>
	117	<listitem><para>
	118	<code>
	119	__atomic_add_dispatch
	120	</code>
	121	</para>
	122	<para>Adds the second argument's value to the first argument. Has no return value.
	123	</para>
	124	</listitem>
	125	</itemizedlist>
	126
	127	<para>
	128	These functions forward to one of several specialized helper
03a32789	129	functions, depending on the circumstances. For instance,
8a165db0 BK	130	</para>
	131
	132	<para>
	133	<code>
	134	__exchange_and_add_dispatch
	135	</code>
	136	</para>
	137
	138	<para>
	139	Calls through to either of:
	140	</para>
	141
	142	<itemizedlist>
	143	<listitem><para><code>__exchange_and_add</code>
	144	</para>
	145	<para>Multi-thread version. Inlined if compiler-generated builtin atomics
	146	can be used, otherwise resolved at link time to a non-builtin code
	147	sequence.
	148	</para>
	149	</listitem>
	150
03a32789	151	<listitem><para><code>__exchange_and_add_single</code>
8a165db0 BK	152	</para>
	153	<para>Single threaded version. Inlined.</para>
	154	</listitem>
	155	</itemizedlist>
	156
	157	<para>However, only <code>__exchange_and_add_dispatch</code>
	158	and <code>__atomic_add_dispatch</code> should be used. These functions
	159	can be used in a portable manner, regardless of the specific
	160	environment. They are carefully designed to provide optimum efficiency
	161	and speed, abstracting out atomic accesses when they are not required
	162	(even on hosts that support compiler intrinsics for atomic
	163	operations.)
	164	</para>
	165
	166	<para>
	167	In addition, there are two macros
	168	</para>
	169
	170	<para>
	171	<code>
03a32789	172	_GLIBCXX_READ_MEM_BARRIER
8a165db0 BK	173	</code>
	174	</para>
	175	<para>
	176	<code>
03a32789	177	_GLIBCXX_WRITE_MEM_BARRIER
8a165db0 BK	178	</code>
	179	</para>
	180
	181	<para>
	182	Which expand to the appropriate write and read barrier required by the
	183	host hardware and operating system.
	184	</para>
c6a21af2	185	</section>
8a165db0	186
c6a21af2	187	</section>
8a165db0 BK	188
8a165db0 BK	189
c6a21af2 BK	190	<section xml:id="manual.ext.concurrency.impl" xreflabel="Implementation"><info><title>Implementation</title></info>
	191
	192	<section xml:id="manual.ext.concurrency.impl.atomic_fallbacks" xreflabel="Atomic F"><info><title>Using Builtin Atomic Functions</title></info>
	193
03a32789	194
8a165db0 BK	195	<para>The functions for atomic operations described above are either
	196	implemented via compiler intrinsics (if the underlying host is
	197	capable) or by library fallbacks.</para>
	198
	199	<para>Compiler intrinsics (builtins) are always preferred. However, as
	200	the compiler builtins for atomics are not universally implemented,
	201	using them directly is problematic, and can result in undefined
	202	function calls. (An example of an undefined symbol from the use
	203	of <code>__sync_fetch_and_add</code> on an unsupported host is a
	204	missing reference to <code>__sync_fetch_and_add_4</code>.)
	205	</para>
	206
	207	<para>In addition, on some hosts the compiler intrinsics are enabled
	208	conditionally, via the <code>-march</code> command line flag. This makes
	209	usage vary depending on the target hardware and the flags used during
	210	compile.
	211	</para>
	212
a152e96f BK	213
	214
	215	<para>
	216	<remark>
	217	Incomplete/inconsistent. This is only C++11.
	218	</remark>
	219	</para>
	220
03a32789	221	<para>
35648b45	222	If builtins are possible for bool-sized integral types,
a152e96f	223	<code>ATOMIC_BOOL_LOCK_FREE</code> will be defined.
35648b45	224	If builtins are possible for int-sized integral types,
a152e96f	225	<code>ATOMIC_INT_LOCK_FREE</code> will be defined.
8a165db0 BK	226	</para>
	227
	228
	229	<para>For the following hosts, intrinsics are enabled by default.
	230	</para>
	231
	232	<itemizedlist>
	233	<listitem><para>alpha</para></listitem>
	234	<listitem><para>ia64</para></listitem>
	235	<listitem><para>powerpc</para></listitem>
	236	<listitem><para>s390</para></listitem>
	237	</itemizedlist>
	238
	239	<para>For others, some form of <code>-march</code> may work. On
	240	non-ancient x86 hardware, <code>-march=native</code> usually does the
	241	trick.</para>
	242
	243	<para> For hosts without compiler intrinsics, but with capable
	244	hardware, hand-crafted assembly is selected. This is the case for the following hosts:
	245	</para>
	246
	247	<itemizedlist>
	248	<listitem><para>cris</para></listitem>
	249	<listitem><para>hppa</para></listitem>
	250	<listitem><para>i386</para></listitem>
	251	<listitem><para>i486</para></listitem>
	252	<listitem><para>m48k</para></listitem>
	253	<listitem><para>mips</para></listitem>
	254	<listitem><para>sparc</para></listitem>
	255	</itemizedlist>
	256
	257	<para>And for the rest, a simulated atomic lock via pthreads.
	258	</para>
	259
c6a21af2	260	<para> Detailed information about compiler intrinsics for atomic operations can be found in the GCC <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/gcc/Atomic-Builtins.html"> documentation</link>.
8a165db0 BK	261	</para>
8a165db0 BK	262
50fbf0c3	263	<para> More details on the library fallbacks from the porting <link linkend="internals.thread_safety">section</link>.
8a165db0 BK	264	</para>
	265
	266
c6a21af2 BK	267	</section>
	268	<section xml:id="manual.ext.concurrency.impl.thread" xreflabel="Pthread"><info><title>Thread Abstraction</title></info>
	269
8a165db0	270
4394b61e	271	<para>A thin layer above IEEE 1003.1 (i.e. pthreads) is used to abstract
8a165db0 BK	272	the thread interface for GCC. This layer is called "gthread," and is
	273	comprised of one header file that wraps the host's default thread layer with
	274	a POSIX-like interface.
	275	</para>
	276
	277	<para> The file <gthr-default.h> points to the deduced wrapper for
	278	the current host. In libstdc++ implementation files,
	279	<bits/gthr.h> is used to select the proper gthreads file.
	280	</para>
	281
	282	<para>Within libstdc++ sources, all calls to underlying thread functionality
c6a21af2	283	use this layer. More detail as to the specific interface can be found in the source <link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://gcc.gnu.org/onlinedocs/libstdc++/latest-doxygen/a00883_source.html">documentation</link>.
8a165db0 BK	284	</para>
	285
	286	<para>By design, the gthread layer is interoperable with the types,
	287	functions, and usage found in the usual <pthread.h> file,
	288	including <code>pthread_t</code>, <code>pthread_once_t</code>, <code>pthread_create</code>,
	289	etc.
	290	</para>
	291
c6a21af2 BK	292	</section>
c6a21af2 BK	293	</section>
8a165db0	294
c6a21af2	295	<section xml:id="manual.ext.concurrency.use" xreflabel="Use"><info><title>Use</title></info>
8a165db0	296
c6a21af2	297
8a165db0 BK	298
	299	<para>Typical usage of the last two constructs is demonstrated as follows:
	300	</para>
	301
	302	<programlisting>
	303	#include <ext/concurrence.h>
	304
	305	namespace
	306	{
	307	__gnu_cxx::__mutex safe_base_mutex;
	308	} // anonymous namespace
	309
	310	namespace other
	311	{
	312	void
	313	foo()
	314	{
	315	__gnu_cxx::__scoped_lock sentry(safe_base_mutex);
	316	for (int i = 0; i < max; ++i)
	317	{
	318	_Safe_iterator_base* __old = __iter;
	319	__iter = __iter-<_M_next;
	320	__old-<_M_detach_single();
	321	}
	322	}
	323	</programlisting>
	324
	325	<para>In this sample code, an anonymous namespace is used to keep
	326	the <code>__mutex</code> private to the compilation unit,
	327	and <code>__scoped_lock</code> is used to guard access to the critical
	328	section within the for loop, locking the mutex on creation and freeing
	329	the mutex as control moves out of this block.
	330	</para>
	331
	332	<para>Several exception classes are used to keep track of
	333	concurrence-related errors. These classes
	334	are: <code>__concurrence_lock_error</code>, <code>__concurrence_unlock_error</code>, <code>__concurrence_wait_error</code>,
	335	and <code>__concurrence_broadcast_error</code>.
	336	</para>
	337
	338
c6a21af2	339	</section>
8a165db0 BK	340
8a165db0 BK	341	</chapter>