It is currently Thu Sep 04, 2025 9:57 am

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 216 posts ]  Go to page Previous  1 ... 5, 6, 7, 8, 9, 10, 11  Next
Author Message
Offline
 Post subject: Re: KataGo V1.3
Post #141 Posted: Mon Mar 02, 2020 2:34 am 
Dies in gote

Posts: 63
Liked others: 8
Was liked: 4
jann wrote:
...whichever side wins at 1000 visits will likely also win at 10000 visits.
How do you reach that conclusion?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #142 Posted: Mon Mar 02, 2020 2:39 am 
Dies in gote

Posts: 63
Liked others: 8
Was liked: 4
jann wrote:
...since the test was not about raw strength but scalability itself.
I don't think we are talking about raw strength here, are we?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #143 Posted: Mon Mar 02, 2020 4:36 am 
Lives in gote

Posts: 652
Location: Adelaide, South Australia
Liked others: 219
Was liked: 281
Rank: Australian 3 dan
GD Posts: 200
Thanks Uberdude for the very clear and detailed explanation! In the light of that, the other comments are making a lot more sense now :-) To summarise:
  • "Visits" is often used to mean total number of visits to the root node.
  • If analysing a single position in isolation, visits (in this sense) and playouts are the same thing.
  • When playing a game, the convention is that "visits" includes tree reuse from previous moves but "playouts" doesn't.
  • Limiting an engine to "n visits per move" means that if the visit count (including tree reuse) is less than n then you keep adding playouts until you get to n.
  • Limiting an engine to "n playouts per move" means that you do n playouts every move in addition to any tree reuse.
  • With tree reuse, visits/second is a higher number than playouts/second because you're counting the re-used visits again.

jann wrote:
Another example is when you find an otherwise weaker side ahead, because of higher extent of tree reuse (thus effectively more but weaker search).

This one I'm still finding hard to imagine. Tree reuse happens when the opponent plays a move that you've already explored, so you can reuse that part of the tree. Tree reuse is maximised when the opponent plays the most explored move, which is often the move that you assess as best. So if the "weaker" net is getting a lot of tree reuse, that means the "stronger" net is consistently choosing the same move that the "weaker" net would have picked. It looks to me like the so-called "stronger" net in this scenario isn't actually that much stronger?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #144 Posted: Mon Mar 02, 2020 6:31 am 
Lives in gote

Posts: 445
Liked others: 0
Was liked: 37
xela wrote:
jann wrote:
Another example is when you find an otherwise weaker side ahead, because of higher extent of tree reuse (thus effectively more but weaker search).

This one I'm still finding hard to imagine. Tree reuse happens when the opponent plays a move that you've already explored, so you can reuse that part of the tree. Tree reuse is maximised when the opponent plays the most explored move, which is often the move that you assess as best.

The point here that the less of a branching factor a policy has, the narrower tree it builds in its memory, the more the potential for reuse. So if the weaker net only looks at 2 moves everywhere (vs, say, 3 for the stronger one), it may be weaker with blind spots, but it will benefit from tree reuse more.

As mentioned earlier, this may account for a, say, 1.5x search speed advantage. Then if you don't know this specifically, and only have the result of a test match at 1000 playouts, you are less likely to correctly predict the result of 10000 playouts match (your actual use case). Basically you are in same situation like if you did time based test on unknown hardware - there is an unclear speed related factor that affects your results in unknown ways and extent, and not necessarily the same way during test than during later usage (speed vs strength works very differently at low search than at high search).

The wider your test is, the more factors you allow to affect its result, the more tests you need to perform to get the same knowledge/confidence (because first you need to guess each individual factor from the results). Again this is for the case where your use case / conditon is significantly different and you cannot test on it directly (otherwise you don't need to know individual factors and are fine with a single test there, since you can be sure all factors will work the same way during test than during later usage).

Limeztone wrote:
jann wrote:
...whichever side wins at 1000 visits will likely also win at 10000 visits.
How do you reach that conclusion?

Look above where we talked about the linked scalability graph. Same winner at 1000 visits as at 10000 visits = curves don't cross the 1.0 line. Unknown bonus from tree reuse = different winner at 1000 vs 2000 visits than at 10000 vs 20000 visits = some curves cross the 2.0 line etc.

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #145 Posted: Sat Mar 07, 2020 12:47 am 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
Will future versions have a way to set minimum playouts per move?

new 20b 800v against elf2 800v on cgos: 30-1 wow

http://www.yss-aya.com/cgos/19x19/cross ... 3v800.html

s191 had 75% win rate

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #146 Posted: Sat Mar 07, 2020 9:48 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
50 game test
KataGo 1.3.3 (g170e-b20c256x2-s2430231552-d525879064) v. LZ017 (#268)
1600 visits for both (~ time parity)
Katago wins 37-13 (74%)
twogtp 1.5.1, no error, no duplicate game, all games by resignation.


Stats : (Katago always shows as W, because of the command -alternate)
Attachment:
1.jpg
1.jpg [ 346.39 KiB | Viewed 10234 times ]


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #147 Posted: Sun Mar 08, 2020 2:17 pm 
Gosei
User avatar

Posts: 1464
Liked others: 212
Was liked: 215
Vargo games lost from the ladder, do you consider? are there any statistics, with the promotion LZ the number of games lost due to the ladder decreases? and how does playout affect it?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #148 Posted: Mon Mar 09, 2020 8:11 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
And wrote:
the ladder, do you consider?
No, I haven't looked at the games. If you want, you can see for yourself. KataGo is W in every even-numbered game, and the 13 games lost by KataGo are n° 1,12,15,16,20,21,23,24,25,32,34,41,47.
Attachment:
kata133_LZ268.zip [43.52 KiB]
Downloaded 455 times

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #149 Posted: Mon Mar 09, 2020 5:40 pm 
Dies with sente

Posts: 101
Liked others: 2
Was liked: 16
Rank: KGS 2 D
When a bot knows ladder, many interesting things happen, like this one.


Attachments:
kata-265.sgf [2.2 KiB]
Downloaded 417 times
Untitled-go.png
Untitled-go.png [ 583.98 KiB | Viewed 10036 times ]

This post by splee99 was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #150 Posted: Tue Mar 10, 2020 9:30 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
Another 50 game test with the exact same commands
KataGo 1.3.3 (g170e-b20c256x2-s2430231552-d525879064) v. LZ017 (#268)
1600 visits for both (~ time parity)
Katago wins 34-16 = 68% ( Last time it was 37-13 = 74%)
twogtp 1.5.1, no error, no duplicate game, all games by resignation.


Stats : (Katago always shows as W, because of the command -alternate, so, B+R means Katago lost)
Attachment:
1.jpg
1.jpg [ 277.11 KiB | Viewed 9971 times ]
Attachment:
2.jpg
2.jpg [ 656.27 KiB | Viewed 9971 times ]
The games :
Attachment:
kata133_LZ268b.zip [45.16 KiB]
Downloaded 410 times


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #151 Posted: Sat Mar 14, 2020 7:04 am 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
with the ladder code version of leela zero, it's much more even.

https://github.com/yssaya/leela-zero-ladder/releases

katago s243 vs lz 270 ladder version, both bots 1 thread 512 playouts:

lz wins 3 games kata wins 1. Katago lost the first three but won the last easily

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #152 Posted: Sat Mar 14, 2020 9:48 am 
Lives in sente

Posts: 759
Liked others: 114
Was liked: 916
Rank: maybe 2d
Some new nets have been uploaded:

Roughly 50 Elo gain for the 20-block extended training net, and slightly more than 100 Elo gain for the 30 and 40-block nets. Enjoy!

https://github.com/lightvector/KataGo/releases


This post by lightvector was liked by 4 people: And, ez4u, go4thewin, Vargo
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #153 Posted: Sat Mar 14, 2020 10:18 pm 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
A last 50 game test for the 20b net : g170e-b20c256x2-s2430231552-d525879064
KataGo 1.3.3 g170e-b20c256x2-s2430231552-d525879064 v. LZ017#268 LADDER VERSION
Katago wins 35-15 = 70% (against the "regular" LZ017#268, KG won 74% and 68%)
Twogtp 1.5.1, 1600 visits for both (~ time parity), no error, no duplicate game, all games by resignation.

Details : (KG always appears as W, because of the command -alternate)
Attachment:
k1.jpg
k1.jpg [ 299.82 KiB | Viewed 9724 times ]
Attachment:
k2.jpg
k2.jpg [ 993.34 KiB | Viewed 9724 times ]

LADDER VERSION : https://github.com/yssaya/leela-zero-ladder/releases


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #154 Posted: Sat Mar 14, 2020 11:29 pm 
Lives with ko

Posts: 132
Liked others: 0
Was liked: 7
lightvector wrote:
Some new nets have been uploaded:

Roughly 50 Elo gain for the 20-block extended training net, and slightly more than 100 Elo gain for the 30 and 40-block nets. Enjoy!

https://github.com/lightvector/KataGo/releases


Thx for the big nets :-)

b30c320x2
b40c256x2

From what I have seen so far, if the second number is bigger (320), then the net has a very good chance to be stronger and if the first number is bigger (40), then the net has a medium chance to be stronger.

Some ideas which could be tried:

30 and 320 vs 30 and 384
30 and 320 vs 30 and 512

40 and 256 vs 40 and 320
40 and 256 vs 40 and 384
40 and 256 vs 40 and 512

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #155 Posted: Sun Mar 15, 2020 11:32 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
50 game test between the two newest 20b nets : g170e-b20c256x2-s2430231552-d525879064 v. g170e-b20c256x2-s2971705856-d633407024
The newest , g170e-b20c256x2-s2971705856-d633407024 wins 27-23 (54%)
Twogtp 1.5.1, 1600 visits for both (~ time parity), no error, no duplicate game, all games by resignation.

Details : (s2971 always appears as W, because of the command -alternate)
Attachment:
k3.jpg
k3.jpg [ 361.42 KiB | Viewed 9662 times ]
Attachment:
k4.jpg
k4.jpg [ 889.57 KiB | Viewed 9662 times ]


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #156 Posted: Sun Mar 15, 2020 11:36 am 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
50 game test KG 1.3.3 g170e-b20c256x2-s2971705856-d633407024 v. LZ#270
KG wins 30-20 (60%, "only" , maybe 50 games is not enough, or maybe s2971 doesn't like #270 ;-) )
Again, Twogtp 1.5.1, 1600 visits for both (~ time parity), no error, no duplicate game, all games by resignation.

Details :
Attachment:
k1.jpg
k1.jpg [ 362.11 KiB | Viewed 9662 times ]
Attachment:
k2.jpg
k2.jpg [ 793.06 KiB | Viewed 9662 times ]


This post by Vargo was liked by: go4thewin
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #157 Posted: Sun Mar 15, 2020 1:20 pm 
Gosei
User avatar

Posts: 1464
Liked others: 212
Was liked: 215
Vargo in this case, maybe another 50?

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #158 Posted: Sun Mar 15, 2020 3:01 pm 
Lives with ko

Posts: 150
Liked others: 200
Was liked: 30
Rank: 25 kyu
katago 1.3.3 20b s297 1 thread 12 playouts vs gtp4zen7 9d setting; 4-0

Wow I can't beieve it is possibe to have a bot this strong and fast on so few playouts.


Attachments:
File comment: sgfs
sgf.zip [3.81 KiB]
Downloaded 440 times

_________________
set katago to play at your level https://docdro.id/sHZU1ti or experiment with gtp4zen ( https://rb.gy/kx2ilb )
Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #159 Posted: Mon Mar 16, 2020 12:17 pm 
Lives in gote

Posts: 337
Liked others: 22
Was liked: 97
50 more games KG 1.3.3 g170e-b20c256x2-s2971705856-d633407024 v. LZ#270
KataGo wins 37-13 = 74% (last test, it was 60%)

details : (KG always appears as W, because of the command -alternate)
Attachment:
kg1.jpg
kg1.jpg [ 320.81 KiB | Viewed 11610 times ]
Attachment:
kg2.jpg
kg2.jpg [ 829.08 KiB | Viewed 11610 times ]

Top
 Profile  
 
Offline
 Post subject: Re: KataGo V1.3
Post #160 Posted: Mon Mar 16, 2020 12:24 pm 
Gosei
User avatar

Posts: 1464
Liked others: 212
Was liked: 215
LZ will never catch up with the KataGo. Does anyone see the point in the further development of LZ? what are its advantages? different game style?

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 216 posts ]  Go to page Previous  1 ... 5, 6, 7, 8, 9, 10, 11  Next

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group