chessforyou Bettina&Terry77
chessforyou Bettina&Terry77
chessforyou Bettina&Terry77
Would you like to react to this message? Create an account in a few clicks or log in to continue.

chessforyou Bettina&Terry77


 
HomeLatest imagesRegisterLog in
WELCOME TO FORUM OF Angels77 * named in memory of Bettina & Terry
Search
 
 

Display results as :
 
Rechercher Advanced Search
Search
 
 

Display results as :
 
Rechercher Advanced Search
Latest topics
Latest topics
Navigation
 Portal
 Index
 Memberlist
 Profile
 FAQ
 Search
Navigation
 Portal
 Index
 Memberlist
 Profile
 FAQ
 Search
Forum
Forum
Affiliates
free forum
 


Affiliates
free forum
 



 

 SPCC: Testrun of Stockfish 201214 finished

Go down 
2 posters
AuthorMessage
LondonFrau
Admin
Admin
LondonFrau


Female Posts : 1293
Reputation : 3534
Join date : 2010-02-27
Location : ???

SPCC: Testrun of Stockfish 201214 finished Empty
PostSubject: SPCC: Testrun of Stockfish 201214 finished   SPCC: Testrun of Stockfish 201214 finished EmptyWed Dec 16, 2020 4:45 pm

Stockfish testing
 
Playing conditions:
 
Hardware: Since 20/07/21 AMD Ryzen 3900 12-core (24 threads) notebook with 32GB RAM. Now, 20 games are played simultaneously (!), so from now, each testrun will have 6000 or 7000 games (instead of 5000 before) and will take only 2 days, not 6-7 days as before! From now, all engine-binaries are popcount/avx2, of course, because bmi2-compiles are extremly slow on AMD. To keep the rating-list engine-names consistent, the "bmi2"- or "pext"-extension in the engine-name is still in use for older engines - otherwise ORDO will not calculate all played games by this engine as one engine...
Speed: (singlethread, TurboBoost-mode switched off, chess starting position) Stockfish: 1.3 mn/s, Komodo: 1.1 mn/s
Hash: 256MB per engine
GUI: Cutechess-cli (GUI ends game, when a 5-piece endgame is on the board)
Tablebases: None for engines, 5 Syzygy for cutechess-cli
Openings: HERT_500 testset (by Thomas Zipproth) (download the file at the "Download & Links"-section or [You must be registered and logged in to see this link.])
Ponder, Large Memory Pages & learning: Off
Thinking time: 180''+1000ms (= 3'+1'') per game/engine (average game-duration: around  7.5 minutes). One 7000 games-testrun takes about 2 days.The version-numbers of the Stockfish engines are the date of the latest patch, which was included in the Stockfish sourcecode, not the release-date of the engine-file, written backwards (year,month,day))(example: 200807 = August, 7, 2020). The used SF compile is the AVX2-compile, which is the fastest on my AMD Ryzen CPU. SF binaries are taken from [You must be registered and logged in to see this link.] (except the official SF-release versions, which are taken form the official Stockfish website).
Download BrainFish (and the Cerebellum-Libraries)[You must be registered and logged in to see this link.]
 
To avoid distortions in the Ordo Elo-calculation, from now, only 3x Stockfish (latest official release + the latest 2 dev-versions) and 1x Brainfish are stored in the gamebase (all older engine-versions games will be deleted, every time, when a new version was tested). Stockfish and BrainFish older Elo-results can still be seen in the Elo-diagrams below. BrainFish plays always with the latest Cerebellum-Libraries of course, because otherwise BrainFish = Stockfish.
 
Latest update: 2020/12/16: Stockfish 201214 avx2 (+2 Elo to Stockfish 201205)
 
(Ordo-calculation fixed to Stockfish 12 = 3684 Elo)
 
See the individual statistics of engine-results [You must be registered and logged in to see this link.]
Download the current gamebase [You must be registered and logged in to see this link.]
 
     Program                      Elo    +    -   Games   Score   Av.Op.  Draws
   1 CFish 12 3xCerebellum      : 3726    8    8  7000    86.1 %   3389   27.3 %
   2 Stockfish 201214 avx2      : 3725    8    8  7000    78.4 %   3472   41.2 %
   3 Stockfish 201205 avx2      : 3723    8    8  7000    78.2 %   3472   41.5 %
   4 CFish 12 avx2              : 3704    8    8  7000    84.6 %   3389   29.1 %
   5 Stockfish 12 200902        : 3684    5    5 22000    78.0 %   3436   39.9 %
   6 KomodoDragon 1.0 avx2      : 3653    6    6 10000    70.7 %   3471   45.7 %
   7 SF 200910 miniNNue avx2    : 3616    7    7  7000    72.1 %   3437   43.2 %
   8 Stockfish 200731 popc      : 3602    8    8  7000    80.5 %   3345   36.2 %
   9 Stockfish 11 200118        : 3565    5    5 17000    69.5 %   3403   41.6 %
  10 Stockfish 10 181129        : 3525    5    5 15000    78.5 %   3288   37.7 %
  11 KomodoDragon 1.0 MCTS      : 3479    6    6  7000    57.7 %   3424   56.9 %
  12 Stockfish 9 180201         : 3475    8    8  5000    74.9 %   3273   41.7 %
  13 Komodo 14.1 x64            : 3454    6    6  8000    56.3 %   3411   55.6 %
  14 Komodo 14 bmi2             : 3445    4    4 20000    52.1 %   3433   51.4 %
  15 Houdini 6 pext             : 3440    2    2 48000    57.2 %   3388   46.8 %
  16 Nemorino 6.00 avx2         : 3440    4    4 17000    47.3 %   3467   50.7 %
  17 Komodo 13.3 bmi2           : 3439    6    6  8000    62.8 %   3343   49.9 %
  18 Komodo 13.1 bmi2           : 3425    5    5 11000    62.0 %   3334   48.8 %
  19 Komodo 12.3 bmi2           : 3413    7    7  7000    62.7 %   3314   49.4 %
  20 Ethereal 12.75 avx2        : 3399    4    4 16000    41.5 %   3474   49.9 %
  21 Ethereal 12.62 avx2        : 3390    6    6  8000    49.1 %   3402   54.6 %
  22 Slow Chess 2.4 popc        : 3371    5    5 12000    35.9 %   3493   46.8 %
  23 Ethereal 12.50 popc        : 3356    6    6  8000    46.9 %   3387   55.5 %
  24 Slow Chess 2.3 popc        : 3345    4    4 15000    43.2 %   3402   52.4 %
  25 Komodo 14 MCTS             : 3340    8    8  5000    44.4 %   3386   53.4 %
  26 Ethereal 12.25 pext        : 3338    5    5 12000    35.2 %   3471   46.4 %
  27 Slow Chess 2.2 popc        : 3329    5    5 11000    32.9 %   3483   42.7 %
  28 RubiChess 1.9dev nnue      : 3320    6    6 10000    27.2 %   3522   40.0 %
  29 Ethereal 12.00 pext        : 3317    6    6  9000    43.1 %   3371   50.8 %
  30 Igel 2.8.0 popavx2         : 3315    6    6  8000    37.1 %   3419   48.6 %
  31 Ethereal 11.75 pext        : 3309    6    6  9000    39.3 %   3392   53.2 %
  32 Xiphos 0.6 bmi2            : 3304    3    3 32000    36.7 %   3418   49.0 %
  33 Fire 7.1 popc              : 3301    2    2 42000    42.0 %   3371   50.7 %
  34 Xiphos 0.5.6 bmi2          : 3289    7    7  7000    41.2 %   3356   54.6 %
  35 Minic 2.51 nasc_nutr       : 3284    6    6  7000    31.2 %   3437   45.0 %
  36 Ethereal 11.53 pext        : 3281    6    6  7000    42.2 %   3343   53.4 %
  37 Komodo 12.3 MCTS           : 3276    7    7  7000    42.7 %   3334   46.3 %
  38 Ethereal 11.25 pext        : 3272    7    7  6000    38.4 %   3362   51.0 %
  39 rofChade 2.3 bmi2          : 3258    5    5 11000    33.8 %   3388   47.5 %
  40 Booot 6.4 popc             : 3245    7    7  6000    31.1 %   3394   46.5 %
  41 Schooner 2.2 popc          : 3242    7    7  6000    31.3 %   3391   50.3 %
  42 Laser 1.7 bmi2             : 3219    8    8  6000    30.8 %   3371   45.8 %
  43 Fizbo 2 bmi2               : 3214    8    8  5000    36.0 %   3325   39.0 %
  44 Fritz 17                   : 3213    8    8  6000    29.4 %   3377   44.2 %
  45 Shredder 13 x64            : 3210    8    8  6000    31.9 %   3359   42.6 %
  46 RubiChess 1.8 popc         : 3208    6    6  7000    32.0 %   3345   46.1 %
  47 Defenchess 2.2 popc        : 3206    8    8  5000    26.6 %   3395   41.8 %
  48 Booot 6.3.1 popc           : 3200    8    8  5000    34.0 %   3328   44.1 %
  49 Andscacs 0.95 popc         : 3168    9    9  5000    23.1 %   3391   35.4 %
The version-numbers (180622 for example) of the engines are the date of the latest patch, which was included in the Stockfish sourcecode, not the release-date of the engine-file. Especially the asmFish-engines are often released much later!!
Below you find a diagram of the progress of Stockfish in my tests since August 2020
And below that diagram, the older diagrams.
 
You can save the diagrams (as a JPG-picture (in originial size)) on your PC with mouseclick (right button) and then choose "save image"...
The Elo-ratings of older Stockfish dev-versions in the Ordo-calculation can be a little different to the Elo-"dots" in the diagram, because the results/games of new Stockfish dev-versions - when getting part of the Ordo-calculation - can change the Elo-ratings of the opponent engines and that can change the Elo-ratings of older Stockfish dev-versions (in the Ordo-calculation / ratinglist, but not in the diagram, where all Elo-"dots" are the rating of one Stockfish dev-version at the moment, when the testrun of that Stockfish dev-version was finished).
SPCC: Testrun of Stockfish 201214 finished Stockfishaktuell
 
 
 
SPCC: Testrun of Stockfish 201214 finished 41390979_1000_13_optimized

_________________
Bettina...............The greatest happiness of life is the conviction that we are loved.

Victor Hugo

"B", Doppelganger, theropodes, Belladonna, When D Green ? and vv56 like this post

Back to top Go down
theropodes
V.I.P.MEMBER
V.I.P.MEMBER
theropodes


Posts : 187
Reputation : 584
Join date : 2010-05-27
Location : Inside the Dark

SPCC: Testrun of Stockfish 201214 finished Empty
PostSubject: Re: SPCC: Testrun of Stockfish 201214 finished   SPCC: Testrun of Stockfish 201214 finished EmptyWed Dec 16, 2020 8:05 pm

Whaou, congrat for the job and many thanks for sharing.
I like so you benchmark elo that include cfish. This not the case with [You must be registered and logged in to see this link.]

"B", Doppelganger, LondonFrau, Belladonna and When D Green ? like this post

Back to top Go down
 
SPCC: Testrun of Stockfish 201214 finished
Back to top 
Page 1 of 1
 Similar topics
-
» SPCC: Testrun of Stockfish 170526 finished
»  SPCC: Testrun of Stockfish 170423 finished ....Stefan Pohl
» SPCC: Testrun of Komodo 10 finished
» SPCC testrun of KomodoDragon 2.5 MCTS finished
» SPCC: Testrun of BrainFish 180613 finished

Permissions in this forum:You cannot reply to topics in this forum
chessforyou Bettina&Terry77 :: ENGlNES :: tests and tournements-
Jump to: