Statistiques
| Révision :

root / man / man3 / HPL_pdlaswp01T.3

Historique | Voir | Annoter | Télécharger (2,28 ko)

1
.TH HPL_pdlaswp01T 3 "September 10, 2008" "HPL 2.0" "HPL Library Functions"
2
.SH NAME
3
HPL_pdlaswp01T \- Broadcast a column panel L and swap the row panel U.
4
.SH SYNOPSIS
5
\fB\&#include "hpl.h"\fR
6
 
7
\fB\&void\fR
8
\fB\&HPL_pdlaswp01T(\fR
9
\fB\&HPL_T_panel *\fR
10
\fI\&PBCST\fR,
11
\fB\&int *\fR
12
\fI\&IFLAG\fR,
13
\fB\&HPL_T_panel *\fR
14
\fI\&PANEL\fR,
15
\fB\&const int\fR
16
\fI\&NN\fR
17
\fB\&);\fR
18
.SH DESCRIPTION
19
\fB\&HPL_pdlaswp01T\fR
20
applies the  NB  row interchanges to  NN columns of the
21
trailing submatrix and broadcast a column panel.
22
 
23
A "Spread then roll" algorithm performs  the swap :: broadcast  of the
24
row panel U at once,  resulting in a minimal communication volume  and
25
a "very good"  use of the connectivity if available.  With  P  process
26
rows  and  assuming  bi-directional links,  the  running time  of this
27
function can be approximated by:
28
 
29
   (log_2(P)+(P-1)) * lat +   K * NB * LocQ(N) / bdwth
30
 
31
where  NB  is the number of rows of the row panel U,  N is the global
32
number of columns being updated,  lat and bdwth  are the latency  and
33
bandwidth  of  the  network  for  double  precision real words.  K is
34
a constant in (2,3] that depends on the achieved bandwidth  during  a
35
simultaneous  message exchange  between two processes.  An  empirical
36
optimistic value of K is typically 2.4.
37
.SH ARGUMENTS
38
.TP 8
39
PBCST   (local input/output)    HPL_T_panel *
40
On entry,  PBCST  points to the data structure containing the
41
panel (to be broadcast) information.
42
.TP 8
43
IFLAG   (local input/output)    int *
44
On entry, IFLAG  indicates  whether or not  the broadcast has
45
already been completed.  If not,  probing will occur, and the
46
outcome will be contained in IFLAG on exit.
47
.TP 8
48
PANEL   (local input/output)    HPL_T_panel *
49
On entry,  PANEL  points to the data structure containing the
50
panel information.
51
.TP 8
52
NN      (local input)           const int
53
On entry, NN specifies  the  local  number  of columns of the
54
trailing  submatrix  to  be swapped and broadcast starting at
55
the current position. NN must be at least zero.
56
.SH SEE ALSO
57
.BR HPL_pdgesv \ (3),
58
.BR HPL_pdgesvK2 \ (3),
59
.BR HPL_pdupdateNT \ (3),
60
.BR HPL_pdupdateTT \ (3),
61
.BR HPL_pipid \ (3),
62
.BR HPL_plindx1 \ (3),
63
.BR HPL_plindx10 \ (3),
64
.BR HPL_spreadT \ (3),
65
.BR HPL_equil \ (3),
66
.BR HPL_rollT \ (3),
67
.BR HPL_dlaswp10N \ (3),
68
.BR HPL_dlaswp01T \ (3),
69
.BR HPL_dlaswp06T \ (3).