root / man / man3 / HPL_pdlaswp01N.3
Historique | Voir | Annoter | Télécharger (2,28 ko)
1 |
.TH HPL_pdlaswp01N 3 "September 10, 2008" "HPL 2.0" "HPL Library Functions" |
---|---|
2 |
.SH NAME |
3 |
HPL_pdlaswp01N \- Broadcast a column panel L and swap the row panel U. |
4 |
.SH SYNOPSIS |
5 |
\fB\&#include "hpl.h"\fR |
6 |
|
7 |
\fB\&void\fR |
8 |
\fB\&HPL_pdlaswp01N(\fR |
9 |
\fB\&HPL_T_panel *\fR |
10 |
\fI\&PBCST\fR, |
11 |
\fB\&int *\fR |
12 |
\fI\&IFLAG\fR, |
13 |
\fB\&HPL_T_panel *\fR |
14 |
\fI\&PANEL\fR, |
15 |
\fB\&const int\fR |
16 |
\fI\&NN\fR |
17 |
\fB\&);\fR |
18 |
.SH DESCRIPTION |
19 |
\fB\&HPL_pdlaswp01N\fR |
20 |
applies the NB row interchanges to NN columns of the |
21 |
trailing submatrix and broadcast a column panel. |
22 |
|
23 |
A "Spread then roll" algorithm performs the swap :: broadcast of the |
24 |
row panel U at once, resulting in a minimal communication volume and |
25 |
a "very good" use of the connectivity if available. With P process |
26 |
rows and assuming bi-directional links, the running time of this |
27 |
function can be approximated by: |
28 |
|
29 |
(log_2(P)+(P-1)) * lat + K * NB * LocQ(N) / bdwth |
30 |
|
31 |
where NB is the number of rows of the row panel U, N is the global |
32 |
number of columns being updated, lat and bdwth are the latency and |
33 |
bandwidth of the network for double precision real words. K is |
34 |
a constant in (2,3] that depends on the achieved bandwidth during a |
35 |
simultaneous message exchange between two processes. An empirical |
36 |
optimistic value of K is typically 2.4. |
37 |
.SH ARGUMENTS |
38 |
.TP 8 |
39 |
PBCST (local input/output) HPL_T_panel * |
40 |
On entry, PBCST points to the data structure containing the |
41 |
panel (to be broadcast) information. |
42 |
.TP 8 |
43 |
IFLAG (local input/output) int * |
44 |
On entry, IFLAG indicates whether or not the broadcast has |
45 |
already been completed. If not, probing will occur, and the |
46 |
outcome will be contained in IFLAG on exit. |
47 |
.TP 8 |
48 |
PANEL (local input/output) HPL_T_panel * |
49 |
On entry, PANEL points to the data structure containing the |
50 |
panel information. |
51 |
.TP 8 |
52 |
NN (local input) const int |
53 |
On entry, NN specifies the local number of columns of the |
54 |
trailing submatrix to be swapped and broadcast starting at |
55 |
the current position. NN must be at least zero. |
56 |
.SH SEE ALSO |
57 |
.BR HPL_pdgesv \ (3), |
58 |
.BR HPL_pdgesvK2 \ (3), |
59 |
.BR HPL_pdupdateNN \ (3), |
60 |
.BR HPL_pdupdateTN \ (3), |
61 |
.BR HPL_pipid \ (3), |
62 |
.BR HPL_plindx1 \ (3), |
63 |
.BR HPL_plindx10 \ (3), |
64 |
.BR HPL_spreadN \ (3), |
65 |
.BR HPL_equil \ (3), |
66 |
.BR HPL_rollN \ (3), |
67 |
.BR HPL_dlaswp00N \ (3), |
68 |
.BR HPL_dlaswp01N \ (3), |
69 |
.BR HPL_dlaswp06N \ (3). |