Sponsored By
Efika 5200B Project
Review and Support, Application optimization

in category Other
proposed by DJBase on 4th June 2006 (accepted on 7th October 2006)
[View Full Project]
  Stream Benchmarks
posted by DJBase on 10th April 2007


My next step was to do some benchmarks under Linux to check the RAM performance. I used the Stream test against Pegasos and Sam440ep board.

EFIKA Linux 2.6.19

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 2 microseconds.
Each test below will take on the order of 183141 microseconds.
(= 91570 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------192.4569------0.1978------0.1663------0.3896
Scale:--------187.5988------0.1855------0.1706------0.2254
Add:----------195.1688------0.2569------0.2459------0.2920
Triad:---------188.3092------0.3100------0.2549------0.4959
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

Pegasos II G4/1GHz Linux 2.6.17

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 90864 microseconds.
(= 90864 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------376.0548------0.0869------0.0851------0.0907
Scale:--------361.1246------0.0907------0.0886------0.0929
Add:----------359.3918------0.1359------0.1336------0.1376
Triad:---------361.1680------0.1357------0.1329------0.1394
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

Pegasos II G3/600MHz

-------------------------------------------------------------
STREAM version $Revision: 5.6 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 2 microseconds.
Each test below will take on the order of 251755 microseconds.
(= 125877 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time---Min time---Max time
Copy:--------132.6981------0.2416-----0.2411-----0.2421
Scale:-------112.7054------0.2849-----0.2839-----0.2859
Add:---------118.6606------0.4054-----0.4045-----0.4063
Triad:-------118.7202------0.4056-----0.4043-----0.4065
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------


AMCC 440EP 533 Mhz Evaluation Board Linux 2.6.18

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 112887 microseconds.
(= 112887 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------262.6503------0.1221------0.1218------0.1223
Scale:--------245.8928------0.1302------0.1301------0.1303
Add:----------253.5979------0.1893------0.1893------0.1893
Triad:---------251.2734------0.1911------0.1910------0.1912
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

For further tests, Application optimizations and guides I am waiting now for a MorphOS Release.
Genesi Network: Genesi - Main Site Power2People PowerDeveloper