timing.txt
上传用户:lyxiangda
上传日期:2007-01-12
资源大小:3042k
文件大小:11k
- MPI Library Timing Tests
- Hardware/OS
- (A) SGI O2 1 x MIPS R10000 250MHz IRIX 6.5.3
- (B) IBM RS/6000 43P-240 1 x PowerPC 603e 223MHz AIX 4.3
- (C) Dell GX1/L+ 1 x Pentium III 550MHz Linux 2.2.12-20
- (D) PowerBook G3 1 x PowerPC 750 266MHz LinuxPPC 2.2.6-15apmac
- (E) PowerBook G3 1 x PowerPC 750 266MHz MacOS 8.5.1
- (F) PowerBook G3 1 x PowerPC 750 400MHz MacOS 9.0.2
- Compiler
- (1) MIPSpro C 7.2.1 -O3 optimizations
- (2) GCC 2.95.1 -O3 optimizations
- (3) IBM AIX xlc -O3 optimizations (version unknown)
- (4) EGCS 2.91.66 -O3 optimizations
- (5) Metrowerks CodeWarrior 5.0 C, all optimizations
- (6) MIPSpro C 7.30 -O3 optimizations
- (7) same as (6), with optimized libmalloc.so
- Timings are given in seconds, computed using the C library's clock()
- function. The first column gives the hardware and compiler
- configuration used for the test. The second column indicates the
- number of tests that were aggregated to get the statistics for that
- size. These were compiled using 16 bit digits.
- Source data were generated randomly using a fixed seed, so they should
- be internally consistent, but may vary on different systems depending
- on the C library. Also, since the resolution of the timer accessed by
- clock() varies, there may be some variance in the precision of these
- measurements.
- Prime Generation (primegen)
- 128 bits:
- A1 200 min=0.03, avg=0.19, max=0.72, sum=38.46
- A2 200 min=0.02, avg=0.16, max=0.62, sum=32.55
- B3 200 min=0.01, avg=0.07, max=0.22, sum=13.29
- C4 200 min=0.00, avg=0.03, max=0.20, sum=6.14
- D4 200 min=0.00, avg=0.05, max=0.33, sum=9.70
- A6 200 min=0.01, avg=0.09, max=0.36, sum=17.48
- A7 200 min=0.00, avg=0.05, max=0.24, sum=10.07
- 192 bits:
- A1 200 min=0.05, avg=0.45, max=3.13, sum=89.96
- A2 200 min=0.04, avg=0.39, max=2.61, sum=77.55
- B3 200 min=0.02, avg=0.18, max=1.25, sum=36.97
- C4 200 min=0.01, avg=0.09, max=0.33, sum=18.24
- D4 200 min=0.02, avg=0.15, max=0.54, sum=29.63
- A6 200 min=0.02, avg=0.24, max=1.70, sum=47.84
- A7 200 min=0.01, avg=0.15, max=1.05, sum=30.88
- 256 bits:
- A1 200 min=0.08, avg=0.92, max=6.13, sum=184.79
- A2 200 min=0.06, avg=0.76, max=5.03, sum=151.11
- B3 200 min=0.04, avg=0.41, max=2.68, sum=82.35
- C4 200 min=0.02, avg=0.19, max=0.69, sum=37.91
- D4 200 min=0.03, avg=0.31, max=1.15, sum=63.00
- A6 200 min=0.04, avg=0.48, max=3.13, sum=95.46
- A7 200 min=0.03, avg=0.37, max=2.36, sum=73.60
- 320 bits:
- A1 200 min=0.11, avg=1.59, max=6.14, sum=318.81
- A2 200 min=0.09, avg=1.27, max=4.93, sum=254.03
- B3 200 min=0.07, avg=0.82, max=3.13, sum=163.80
- C4 200 min=0.04, avg=0.44, max=1.91, sum=87.59
- D4 200 min=0.06, avg=0.73, max=3.22, sum=146.73
- A6 200 min=0.07, avg=0.93, max=3.50, sum=185.01
- A7 200 min=0.05, avg=0.76, max=2.94, sum=151.78
- 384 bits:
- A1 200 min=0.16, avg=2.69, max=11.41, sum=537.89
- A2 200 min=0.13, avg=2.15, max=9.03, sum=429.14
- B3 200 min=0.11, avg=1.54, max=6.49, sum=307.78
- C4 200 min=0.06, avg=0.81, max=4.84, sum=161.13
- D4 200 min=0.10, avg=1.38, max=8.31, sum=276.81
- A6 200 min=0.11, avg=1.73, max=7.36, sum=345.55
- A7 200 min=0.09, avg=1.46, max=6.12, sum=292.02
- 448 bits:
- A1 200 min=0.23, avg=3.36, max=15.92, sum=672.63
- A2 200 min=0.17, avg=2.61, max=12.25, sum=522.86
- B3 200 min=0.16, avg=2.10, max=9.83, sum=420.86
- C4 200 min=0.09, avg=1.44, max=7.64, sum=288.36
- D4 200 min=0.16, avg=2.50, max=13.29, sum=500.17
- A6 200 min=0.15, avg=2.31, max=10.81, sum=461.58
- A7 200 min=0.14, avg=2.03, max=9.53, sum=405.16
- 512 bits:
- A1 200 min=0.30, avg=6.12, max=22.18, sum=1223.35
- A2 200 min=0.25, avg=4.67, max=16.90, sum=933.18
- B3 200 min=0.23, avg=4.13, max=14.94, sum=825.45
- C4 200 min=0.13, avg=2.08, max=9.75, sum=415.22
- D4 200 min=0.24, avg=4.04, max=20.18, sum=808.11
- A6 200 min=0.22, avg=4.47, max=16.19, sum=893.83
- A7 200 min=0.20, avg=4.03, max=14.65, sum=806.02
- Modular Exponentation (metime)
- The following results are aggregated from 200 pseudo-randomly
- generated tests, based on a fixed seed.
- base, exponent, and modulus size (bits)
- P/C 128 192 256 320 384 448 512 640 768 896 1024
- ------- -----------------------------------------------------------------
- A1 0.015 0.027 0.047 0.069 0.098 0.133 0.176 0.294 0.458 0.680 1.040
- A2 0.013 0.024 0.037 0.053 0.077 0.102 0.133 0.214 0.326 0.476 0.668
- B3 0.005 0.011 0.021 0.036 0.056 0.084 0.121 0.222 0.370 0.573 0.840
- C4 0.002 0.006 0.011 0.020 0.032 0.048 0.069 0.129 0.223 0.344 0.507
- D4 0.004 0.010 0.019 0.034 0.056 0.085 0.123 0.232 0.390 0.609 0.899
- E5 0.007 0.015 0.031 0.055 0.088 0.133 0.183 0.342 0.574 0.893 1.317
- A6 0.008 0.016 0.038 0.042 0.064 0.093 0.133 0.239 0.393 0.604 0.880
- A7 0.005 0.011 0.020 0.036 0.056 0.083 0.121 0.223 0.374 0.583 0.855
- Multiplication and Squaring tests, (mulsqr)
- The following results are aggregated from 500000 pseudo-randomly
- generated tests, based on a per-run wall-clock seed. Times are given
- in seconds, except where indicated in microseconds (us).
- (A1)
- bits multiply square ad percent time/mult time/square
- 64 9.33 9.15 > 1.9 18.7us 18.3us
- 128 10.88 10.44 > 4.0 21.8us 20.9us
- 192 13.30 11.89 > 10.6 26.7us 23.8us
- 256 14.88 12.64 > 15.1 29.8us 25.3us
- 320 18.64 15.01 > 19.5 37.3us 30.0us
- 384 23.11 17.70 > 23.4 46.2us 35.4us
- 448 28.28 20.88 > 26.2 56.6us 41.8us
- 512 34.09 24.51 > 28.1 68.2us 49.0us
- 640 47.86 33.25 > 30.5 95.7us 66.5us
- 768 64.91 43.54 > 32.9 129.8us 87.1us
- 896 84.49 55.48 > 34.3 169.0us 111.0us
- 1024 107.25 69.21 > 35.5 214.5us 138.4us
- 1536 227.97 141.91 > 37.8 456.0us 283.8us
- 2048 394.05 242.15 > 38.5 788.1us 484.3us
- (A2)
- bits multiply square ad percent time/mult time/square
- 64 7.87 7.95 < 1.0 15.7us 15.9us
- 128 9.40 9.19 > 2.2 18.8us 18.4us
- 192 11.15 10.59 > 5.0 22.3us 21.2us
- 256 12.02 11.16 > 7.2 24.0us 22.3us
- 320 14.62 13.43 > 8.1 29.2us 26.9us
- 384 17.72 15.80 > 10.8 35.4us 31.6us
- 448 21.24 18.51 > 12.9 42.5us 37.0us
- 512 25.36 21.78 > 14.1 50.7us 43.6us
- 640 34.57 29.00 > 16.1 69.1us 58.0us
- 768 46.10 37.60 > 18.4 92.2us 75.2us
- 896 58.94 47.72 > 19.0 117.9us 95.4us
- 1024 73.76 59.12 > 19.8 147.5us 118.2us
- 1536 152.00 118.80 > 21.8 304.0us 237.6us
- 2048 259.41 199.57 > 23.1 518.8us 399.1us
- (B3)
- bits multiply square ad percent time/mult time/square
- 64 2.60 2.47 > 5.0 5.20us 4.94us
- 128 4.43 4.06 > 8.4 8.86us 8.12us
- 192 7.03 6.10 > 13.2 14.1us 12.2us
- 256 10.44 8.59 > 17.7 20.9us 17.2us
- 320 14.44 11.64 > 19.4 28.9us 23.3us
- 384 19.12 15.08 > 21.1 38.2us 30.2us
- 448 24.55 19.09 > 22.2 49.1us 38.2us
- 512 31.03 23.53 > 24.2 62.1us 47.1us
- 640 45.05 33.80 > 25.0 90.1us 67.6us
- 768 63.02 46.05 > 26.9 126.0us 92.1us
- 896 83.74 60.29 > 28.0 167.5us 120.6us
- 1024 106.73 76.65 > 28.2 213.5us 153.3us
- 1536 228.94 160.98 > 29.7 457.9us 322.0us
- 2048 398.08 275.93 > 30.7 796.2us 551.9us
- (C4)
- bits multiply square ad percent time/mult time/square
- 64 1.34 1.28 > 4.5 2.68us 2.56us
- 128 2.76 2.59 > 6.2 5.52us 5.18us
- 192 4.52 4.16 > 8.0 9.04us 8.32us
- 256 6.64 5.99 > 9.8 13.3us 12.0us
- 320 9.20 8.13 > 11.6 18.4us 16.3us
- 384 12.01 10.58 > 11.9 24.0us 21.2us
- 448 15.24 13.33 > 12.5 30.5us 26.7us
- 512 19.02 16.46 > 13.5 38.0us 32.9us
- 640 27.56 23.54 > 14.6 55.1us 47.1us
- 768 37.89 31.78 > 16.1 75.8us 63.6us
- 896 49.24 41.42 > 15.9 98.5us 82.8us
- 1024 62.59 52.18 > 16.6 125.2us 104.3us
- 1536 131.66 107.72 > 18.2 263.3us 215.4us
- 2048 226.45 182.95 > 19.2 453.0us 365.9us
- (A7)
- bits multiply square ad percent time/mult time/square
- 64 1.74 1.71 > 1.7 3.48us 3.42us
- 128 3.48 2.96 > 14.9 6.96us 5.92us
- 192 5.74 4.60 > 19.9 11.5us 9.20us
- 256 8.75 6.61 > 24.5 17.5us 13.2us
- 320 12.5 8.99 > 28.1 25.0us 18.0us
- 384 16.9 11.9 > 29.6 33.8us 23.8us
- 448 22.2 15.2 > 31.7 44.4us 30.4us
- 512 28.3 19.0 > 32.7 56.6us 38.0us
- 640 42.4 28.0 > 34.0 84.8us 56.0us
- 768 59.4 38.5 > 35.2 118.8us 77.0us
- 896 79.5 51.2 > 35.6 159.0us 102.4us
- 1024 102.6 65.5 > 36.2 205.2us 131.0us
- 1536 224.3 140.6 > 37.3 448.6us 281.2us
- 2048 393.4 244.3 > 37.9 786.8us 488.6us
- ------------------------------------------------------------------
- The contents of this file are subject to the Mozilla Public
- License Version 1.1 (the "License"); you may not use this file
- except in compliance with the License. You may obtain a copy of
- the License at http://www.mozilla.org/MPL/
- Software distributed under the License is distributed on an "AS
- IS" basis, WITHOUT WARRANTY OF ANY KIND, either express or
- implied. See the License for the specific language governing
- rights and limitations under the License.
- The Original Code is the MPI Arbitrary Precision Integer Arithmetic
- library.
- The Initial Developer of the Original Code is
- Michael J. Fromberger <sting@linguist.dartmouth.edu>
- Portions created by Michael J. Fromberger are
- Copyright (C) 1998, 2000 Michael J. Fromberger. All Rights Reserved.
- Contributor(s):
- Alternatively, the contents of this file may be used under the
- terms of the GNU General Public License Version 2 or later (the
- "GPL"), in which case the provisions of the GPL are applicable
- instead of those above. If you wish to allow use of your
- version of this file only under the terms of the GPL and not to
- allow others to use your version of this file under the MPL,
- indicate your decision by deleting the provisions above and
- replace them with the notice and other provisions required by
- the GPL. If you do not delete the provisions above, a recipient
- may use your version of this file under either the MPL or the GPL.
- $Id: timing.txt,v 1.1 2000/07/14 00:44:38 nelsonb%netscape.com Exp $