| 
pthreads(5)							   pthreads(5)
      pthreads -	introduction to	POSIX thread characteristics
      This man page is intended as an overview of the POSIX thread model	as
     implemented in IRIX; it is	not an introduction to thread programming or a
     detailed description of the IRIX implementation.
     Thread programming	is motivated by	two concerns:
     Application Performance    [Toc]    [Back]
	  o   On a multiprocessor multiple threads may run at the same time.
	  o   Operations that cause the	caller to wait do not prevent other
	      threads (in the same process) from making	progress.
     Programming Model    [Toc]    [Back]
	  o   Complex applications can be structured more elegantly by
	      partitioning with	threads.
	  o   Threads share many resources implicitly which can	simplify cooperative
	algorithms.
     In	the past a UNIX	process	has been thought of as having a	single thread.
     The POSIX thread programming model	(known as pthreads) introduces the
     concept of	multiple threads within	a single process.  A POSIX thread is
     an	executable entity that belongs to a process.  It contains sufficient
     state to enable it	to run on a processor independently of its fellows.
     In	most other respects threads share state	with their host	process.  The
     vast majority of interfaces and features work the same way	in a multithreaded
 process as an unthreaded one.
     The POSIX threads API provides a set of interfaces	and semantics for
     creating and controlling threads.	Because	the interfaces are defined by
     the ISO standards organisation an application using them may enjoy	the
     benefits of parallel programming without sacrificing portability.
   Applications    [Toc]    [Back]
     To	create a pthread process an application	must be	linked with the
     pthread run-time library.	It is also recommended that the	thread safe
     options be	enabled	at compile time	using the feature test macro,
     _POSIX_C_SOURCE.
	  cc -D_POSIX_C_SOURCE=199506L app.c -llib0 -llib1 ... -lpthread
     See intro(3), section 3P for further details.
   Components    [Toc]    [Back]
     POSIX threads introduces a	number of thread components.
     Attribute    [Toc]    [Back]
	  An attribute is a characteristic of something	that defines its
	  behaviour.  Most of the objects introduced by	pthreads have
	  characteristics which	are set	when it	is created; indeed some
									Page 1
pthreads(5)							   pthreads(5)
	  characteristics may only be set at that time.	 A set of
	  characteristics may be specified in an attribute object which	is
	  passed to the	creation interface.  An	analogy	is a mould or template
	  that is used to shape	some artifact.	Every attribute	has a default
	  value	and the	attribute object itself	is optional.
     Thread    [Toc]    [Back]
	  A thread executes code.  It is created using pthread_create()	and
	  either inherits characteristics from its creator or has them
	  specified via	creation attributes.  The pthread_attr_* interfaces
	  may be used to create	a thread attributes object.
	  Many features	of the process are shared by its threads, including
	  process and user ids (PID, UID), file	descriptors, memory (including
	  text and data) and signal handlers.
	  Each thread has a unique identity which is used by a number of
	  pthread interfaces.  Thread ids may be compared using
	  pthread_equal().
	  Threads terminate with an exit status	[see pthread_exit()].  By
	  default the thread identity and exit status persist after the	thread
	  has terminated until the status is retrieved using pthread_join().
	  An application should	either retrieve	this status or arrange for it
	  to be	automatically discarded	on thread termination using
	  pthread_detach() or setting the detach thread	attribute.
     Mutex Lock    [Toc]    [Back]
	  A mutex lock facilitates exclusive access by a thread	to a resource.
	  Only one thread may own a mutex at a time and	is thus	guaranteed
	  exclusive access.
	  Mutex	interfaces are described on the	pthread_mutex_*	man pages.
	  The pthread_mutexattr_* interfaces may be used to create a mutex
	  attributes object.  Attributes include error checking, recursion and
	  owner	priority.  Mutexes are intended	to be lightweight and only
	  owned	for brief periods of time.
     Condition Variable    [Toc]    [Back]
	  A condition variable synchronises threads with an event of interest.
	  Condition variables allow a thread to	wait for an event and be woken
	  up when the event occurs; the	nature of the event is determined by
	  the application.
	  Condition variable interfaces	are described on the pthread_cond_*
	  man pages.  The pthread_condattr_* interfaces	may be used to create
	  a condition variable attributes object.
     Read-Write	Lock
	  A read-write lock facilitates	shared access for read and exclusive
	  access for write by threads to a resource.  They are used in
	  conjunction with resources which are frequently read but
	  infrequently changed (written).
	  Read-write lock interfaces are described on the pthread_rwlock_* man
	  pages.  The pthread_rwlockattr_* interfaces may be used to create a
	  read-write lock attributes object.
									Page 2
pthreads(5)							   pthreads(5)
     Semaphore    [Toc]    [Back]
	  A semaphore synchronises threads with	a counter.  Semaphores are not
	  part of pthreads per se and do not have an associated	attributes
	  object.  However anonymous semaphores	provide	a pshared flag (for
	  semaphores which are private to the process) which allows pthreads
	  to optimise access [see sem_init(3C)].  Semaphore interfaces are
	  described on the sem_* man pages.
   Semantics    [Toc]    [Back]
     POSIX threads introduces the following special semantics.
     Cancellation    [Toc]    [Back]
	  A cancellation request is a request for a thread to terminate.  Each
	  thread specifies whether and when to act on cancellation requests
	  using	its cancelability state	and type.  A request is	made with
	  pthread_cancel() and the state and type are set with
	  pthread_setcancelstate() and pthread_setcanceltype().	 Termination
	  handlers may be established to execute code when a thread terminates
	  for some reason including acting on a	cancellation request.
     Signals    [Toc]    [Back]
	  In the pthread signal	model all threads share	the signal disposition
	  [see sigaction(2)] but each thread has its own signal	mask of
	  blocked signals.  Signals may	be sent	to individual threads using
	  the pthread_kill() interface.	 Each thread may change	its own	mask
	  using	pthread_sigmask().  In IRIX, sigprocmask(2) is equivalent to
	  pthread_sigmask() but	a portable pthread application should only use
	  the latter.
	  A signal is delivered	to at most one thread; it is not broadcast to
	  all threads.
	  o   When a signal is directed	at a process the first eligible	thread
	      is chosen	to receive it.	An eligible thread is one that does
	      not have the signal blocked in its signal	mask or	is waiting in
	      sigwait(3).  If there are	no eligible threads then the signal
	      remains pending on the process until a thread becomes eligible.
	      This is called asynchronous signal delivery; kill(2) causes this
	      type of delivery.
	  o   When a signal is directed	at a thread then that thread will
	      receive it.  If the signal is blocked then it will remain
	      pending on the target thread; it will not	be delivered to	a
	      different	thread.	 This is called	synchronous signal delivery;
	      exceptions and program faults cause this type of delivery.
	  o   If the action of the signal is to	stop, continue or terminate
	      the recipient then it will act on	the process as a whole.	 It is
	      not possible to stop, continue or	terminate a single thread with
	      a	signal.
									Page 3
pthreads(5)							   pthreads(5)
     Notifications    [Toc]    [Back]
	  An extension to signal handling for threads is the addition of
	  SIGEV_THREAD which creates a thread instead of sending a signal as a
	  means	of handling the	occurrence of some event [see mq_notify(3C),
	  aio_read(3) and timer_create(3C)].  Creation of the thread takes
	  place	when the event occurs so care should be	taken to avoid any
	  errors or else the notification may be lost.
     Scheduling    [Toc]    [Back]
	  When a processor executes instructions on behalf of a	user it	does
	  so according to a set	of scheduling attributes which are part	of a
	  kernel execution vehicle.  Every thread that executes	on a processor
	  or waits in the kernel needs such a vehicle.
	  POSIX	threads	makes a	distinction between system scope (kernel)
	  scheduling and process scope (run-time) scheduling. The SGI
	  implementation adds an additional scheduling scope, bound scope.
	  These	bound scope threads are	scheduled exactly the same as POSIX
	  process scope	threads.  For the rest of this man page, the
	  discussion of	process	scope scheduling is applicable for bound scope
	  threads.  In both scopes the individual thread scheduling policy and
	  priority values are set using	the interfaces
	  pthread_attr_setschedpolicy(), pthread_attr_setschedparam(), and
	  pthread_setschedparam().
	  Scope	is defined when	a thread is created using
	  pthread_attr_setscope():
	  system scope threads are scheduled by	the kernel; the	scheduling
	  attributes of	the thread and the kernel execution vehicle are	the
	  same.	 The kernel includes all system	scope threads in its
	  scheduling decisions.	 These threads run at realtime policy and
	  priority and may only	be created by privileged users.
	  process scope	threads	are scheduled by the pthread run-time; the
	  scheduling attributes	of the thread and the kernel execution vehicle
	  may be different.  The run-time makes	scheduling decisions based
	  only on the process scope threads in the host	process.
	  An advantage of system scope is that a thread	can get	high
	  performance and deterministic	response.  A disadvantage is that
	  kernel resources must	be allocated to	each thread.
	  In contrast, process scope threads only require kernel state when
	  they are executing on	a processor or waiting in the kernel.  The
	  run-time scheduler multiplexes process scope threads onto a smaller
	  number of kernel execution vehicles.	This can produce faster
	  scheduling because no	kernel state is	involved.
	  The number of	execution vehicles used	for process scope threads
	  depends on application behaviour and system configuration.  By
	  default, the run-time	adjusts	this number dynamically	but the
									Page 4
pthreads(5)							   pthreads(5)
	  pthread_setconcurrency() interface gives a strong hint as to the
	  desired value.
	  The execution	vehicles used for process scope	threads	share a	set of
	  kernel scheduling attributes which can be changed using the
	  sched_setscheduler() and sched_setparam() interfaces.	 These
	  interfaces do	not affect system scope	thread scheduling.  As with
	  system scope threads changing	these scheduling attributes is a
	  privileged operation.
     Low-overhead Locking    [Toc]    [Back]
	  In order to protect updates to internal pthread data structures, a
	  low-overhead locking mechanism is required.  This locking interface
	  is not user-callable and is contained	entirely within	the pthread
	  library.  When pthreads are present, this locking mechanism is also
	  used to ensure that some routines in libc can	be safely called from
	  multiple threads.  Some examples of this are the stdio(3S) routines
	  and the malloc(3C) routines.
	  By default, these locks spin/sleep for process scope and bound scope
	  threads on multiprocessor systems, and immediately block for system
	  scope	threads	and on single processor	systems.  In the spin/sleep
	  lock path, the lock will be tried 1000 times,	and then nanosleep(2)
	  will be called.  This	process	will be	repeated until the lock	can be
	  obtained.
	  This process can be tuned for	an individual application with the
	  PT_SPINS environment variable.  This determines how many times the
	  lock is tried	before sleeping.  This environment variable is checked
	  once at program startup time.	 Different spin	values may be used to
	  improve application throughput; however, higher values will probably
	  increase user	time while lower values	will probably increase system
	  time.	 In general, applications with more pthreads than processors
	  will probably	benefit	from setting PT_SPINS to a lower value,	while
	  applications with fewer pthreads may benefit from setting the
	  environment variable to a higher value.  If the environment variable
	  is set to 0, these locks will	all become blocking locks.
	  realtime(5) applications and applications that create	only system
	  scope	threads	may benefit from setting PT_SPINS to 0.	 This forces
	  the locks to block immediately instead of having the lock routine do
	  a run-time check to determine	if the caller is a system scope
	  thread.
     Thread Data    [Toc]    [Back]
	  Pthreads share the address space; stacks, text and data are
	  accessible by	all threads in the process.  This means	that access to
	  shared data is simpler than where multiple processes use a shared
	  memory region	which each must	map.  The cost is the potential	for
	  accidental corruption.  Individual thread data is provided using the
	  notion of a shared set of keys and unique thread values bound	to
	  each key.  The pthread_key_create() interface	creates	a new, shared
	  key and pthread_setspecific()	allows a thread	to bind	its own	value
									Page 5
pthreads(5)							   pthreads(5)
	  to a key.  Conceptually, the key is an index to an array of values.
	  The same key used by different threads may retrieve a	different
	  values because each thread has its own array.
     Process Management    [Toc]    [Back]
	  Some basic UNIX interfaces have particular semantics when called
	  from a POSIX threads process.
	  fork(2) creates a new	pthread	process	and follows the	usual rules
	  for inherited	state.	The new	process	has a single pthread (the one
	  which	made the call);	all other pthreads active prior	to fork() are
	  quietly destroyed and	their resources	(stacks	etc.) reclaimed	by the
	  pthread run-time.  A number of issues	arise because fork() causes
	  data to be copied from the parent to the child with no
	  synchronisation with threads that may	be modifying that data.	 To
	  allow	the application	to perform its own synchronisation the
	  pthread_atfork() interface can register handlers to be processed
	  when fork() is used.	In general operations which are	safe to
	  perform after	a fork() are the same as those which are safe to
	  perform in a signal handler.
	  exec(2) overlays a new process image on the calling process; all
	  threads from the old image are destroyed.  Thread termination
	  handlers and thread data destruction	[see pthread_cleanup_push()
	  and pthread_key_create()] are	not performed.	The new	process	will
	  only be a pthread process if the new image is	of a pthread process.
	  exit(2) performs all the usual process clean up operations and then
	  destroys all threads in the process.	Thread termination handlers
	  and thread data destruction  [see pthread_cleanup_push() and
	  pthread_key_create()]	are not	performed.
   Limitations    [Toc]    [Back]
     With POSIX	threads	there are some limitations and practices to avoid.
     Not enough	pthreads
	  Each pthread process has a resource limit called RLIMIT_PTHREAD on
	  the number of	threads	it can create.	The value is inherited by
	  child	processes and may be set using setrlimit(2) or the shell limit
	  and ulimit commands.	The default limits can be changed using	the
	  systune(1M) command on the rlimit_pthread_cur	and rlimit_pthread_max
	  variables.
     Fatal exceptions    [Toc]    [Back]
	  When a thread	executes code in the pthread run-time scheduler	it
	  masks	signals	so that	signal handlers	always run in a	consistent
	  environment.	A side effect of this is that if the thread raises an
	  exception (for example due to	memory corruption) the kernel will
	  terminate the	process.  As an	aid to debugging such problems the
	  environment variable PT_CORE should be set prior to starting the
	  application so that a	core file will be generated.
									Page 6
pthreads(5)							   pthreads(5)
     Stack overrun    [Toc]    [Back]
	  Unlike the default stack in an unthreaded IRIX process, pthread
	  stacks are limited [see pthread_attr_setstacksize()] and a thread
	  may overrun its stack.  By default pthread stacks have a protected
	  region at the	end of the stack [see pthread_attr_setguardsize()]
	  that can turn	some of	these overruns into a protection faults	which
	  is generally preferable to overwriting data.
     Dynamic loading    [Toc]    [Back]
	  The pthread library can be made available to a running process. The
	  pthread DSO can be dynamically loaded	using dlopen(3C) or
	  sgidladd(3C).	 Any pthread calls made	before pthread library is
	  loaded will all return ENOSYS.
     Using sprocs    [Toc]    [Back]
	  The sproc(2) model of	threading is incompatible with POSIX threads.
	  Attempts by an sproc process to create pthreads and vice-versa will
	  be rejected.
     Priority    [Toc]    [Back]
	  Attempts to create a pthread process at a priority outside of	the
	  valid	range will be rejected [see sched_get_priority_min(2) and
	  sched_get_priority_max(2)].  Specifically, weightless	is not a valid
	  pthread priority.
     MAP_LOCAL memory    [Toc]    [Back]
	  The POSIX thread memory model	requires that memory be	available to
	  all threads.	The MAP_LOCAL option to	mmap(2)	is useful only to
	  sproc	processes and should not be used with pthreads.
     Dynamic memory allocation    [Toc]    [Back]
	  The sbrk(2) call is not thread-safe.	It is used by the C run-time
	  memory allocator, malloc(3C),	which is always	used by	the pthread
	  run-time.  Safe use of sbrk(2) outside of the	run-time (for example
	  by a third party memory allocator) is	not therefore possible.
   Documentation    [Toc]    [Back]
     Additional	on-line	documentation about pthreads is	available in:
	  Topics in IRIX Programming: Chapter 13
	  SpeedShop User's Guide: Chapter 6
     pthread_*(3P)
									PPPPaaaaggggeeee 7777[ Back ] |