TOP ACTIVATIONS
MAX = 16.856

# 13 On September 20 2013 07 : 44 Se eker
# 14 On September 20 2013 09 : 37 Dark ho
On September 20 2013 07 : 44 Se eker
two linguistic communities ," d ' Ent rem ont states in
4 Comments Print This Post P CC
harsh glare of public life Mrs . Obama asked
Obama said with damp eyes Mrs . Obama and
't act right away ' T ori was eventually
hope , never fear ' Mrs . Obama ,
is the Governor - General .[ 200 ] The
[ V iet - C aj un ] craw fish on
ook her at one point Mrs . Obama urged
language . - Katherine d ' Ent rem ont , commissioner
favorite favorite - September 26 , 2009 Subject :
only 240 m ( 790 ft ) from the asteroid .
banned by officials in Madrid Dri ving slowly along
' OR RIGHT ( path , 5 ) = '. cl
un Bo il | C aj un Kitchen <|endoftext|> Recently ,
eb a Sidd iqu i ; Editing by Mark Potter )
life of this country ' The first lady 's

MIN ACTIVATIONS
MIN = -16.788

Arg ument Parser () ap . add _ argument (" -
we can work on piece by piece ." She
args [" images "] ): # load the image , resize
GR 2 G RAY ) # smooth the image using a
scrap ed by from deal to deal , drilling hundreds of
ED NET CALL UPDATE : 05 38 z SEM INAL MA
ED NET CALL UPDATE : 05 43 z ST OW MA
= c v 2 . get Struct uring Element ( cv
ED NET CALL UPDATE : 05 41 z BLACK FO OT
correct s trials in short or long training blocks respectively )
parse . Arg ument Parser () ap . add _ argument
said one attacker was Kh ur am Sh az ad Butt
even all the spoil thereof , shalt thou take unto thy
=" path to images directory ") args = v ars (
object system . draw ing . bit map ($ SR .
on Division Street . Block after block is filled with tents
parse . Arg ument Parser () ap . add _ argument
= v ars ( ap . parse _ args ()) #
images " ] ) : # load the image , resize
, whether granted with soft or tough conditions , appears to

INTERVAL 8.445 - 16.856
CONTAINS 0.064%

Israelis Jewish , Arab and Dru ze who came
Ruk mini Temple Here 6 . B ala Han
separated with white space or ' : ', where each part
nominated Ken y atta as its President , although the government
only 240 m ( 790 ft ) from the asteroid .

INTERVAL 0.034 - 8.445
CONTAINS 49.036%

Hz ( Tr ace Free = 80 ) 27
the temple include 1 ) Shrine for Cup id
idea is at the heart of her hiring philosophy .
two areas that hurt the Broncos in 2016 as they missed
asked Libertarian 2016 contender Gary Johnson on MSNBC this morning

INTERVAL -8.377 - 0.034
CONTAINS 50.829%

it 's not what you think it is , which I
't what anyone would expect . " I 've
in Britain before a set date . Although its
distortion , even at high volumes . It is not too
it will remove photos , videos and links with explicit content

INTERVAL -16.788 - -8.377
CONTAINS 0.071%

fossil . As soon as he popped his slide
view . Case in point : Gen . Martin
To grunt and sweat under a weary life . .
the arguments ap = arg parse . Arg ument Parser ()
NET CALL UPDATE : 05 50 z VOL UN TE ER
Token #
Token ID1303
Feature activation-7.30e+00

Pos logit contributions
シャ+2.611
アル+2.449
+2.176
+2.059
habi+2.035
Neg logit contributions
rint-3.749
agents-2.945
iment-2.921
 tamp-2.886
 Scher-2.881
Token13
Token ID1485
Feature activation+3.34e+00

Pos logit contributions
rint+4.765
agents+3.364
iment+3.356
 Scher+3.268
 tamp+3.194
Neg logit contributions
シャ-6.691
アル-6.498
-5.905
ンジ-5.745
-5.721
Token On
Token ID1550
Feature activation+4.92e+00

Pos logit contributions
シャ+2.468
アル+2.337
+2.099
+2.046
 Shepard+2.032
Neg logit contributions
rint-2.806
agents-2.148
iment-2.103
 tamp-2.042
 Scher-2.024
Token September
Token ID2693
Feature activation+3.54e-01

Pos logit contributions
シャ+2.841
アル+2.645
+2.284
+2.255
habi+2.181
Neg logit contributions
rint-4.856
agents-3.894
 tamp-3.789
iment-3.753
 Scher-3.737
Token 20
Token ID1160
Feature activation+7.70e-01

Pos logit contributions
シャ+0.298
アル+0.286
+0.261
+0.250
habi+0.248
Neg logit contributions
rint-0.268
agents-0.194
iment-0.194
 Scher-0.191
 tamp-0.190
Token 2013
Token ID2211
Feature activation+1.69e+01

Pos logit contributions
シャ+0.519
アル+0.491
+0.442
+0.419
habi+0.409
Neg logit contributions
rint-0.716
agents-0.551
iment-0.542
 tamp-0.537
 Scher-0.531
Token 07
Token ID8753
Feature activation+1.04e+00

Pos logit contributions
シャ+5.407
 Shepard+5.303
 Hubble+4.834
 Hun+4.716
 Daddy+4.658
Neg logit contributions
rint-17.302
ibaba-14.450
agents-14.334
ventures-14.245
 pse-14.202
Token:
Token ID25
Feature activation+5.58e+00

Pos logit contributions
シャ+0.818
アル+0.786
+0.716
+0.689
habi+0.676
Neg logit contributions
rint-0.838
agents-0.609
 tamp-0.601
iment-0.599
 advis-0.597
Token44
Token ID2598
Feature activation+7.48e+00

Pos logit contributions
シャ+3.781
アル+3.625
+3.218
NS+3.130
+3.108
Neg logit contributions
rint-5.060
 pse-3.902
 tamp-3.851
 advis-3.837
 fortun-3.796
Token Se
Token ID1001
Feature activation+9.63e-01

Pos logit contributions
シャ+5.493
アル+5.194
+4.625
+4.589
 Shepard+4.565
Neg logit contributions
rint-6.114
agents-4.734
iment-4.633
 tamp-4.502
 advis-4.449
Tokeneker
Token ID28233
Feature activation+6.68e+00

Pos logit contributions
シャ+0.731
アル+0.693
+0.628
habi+0.600
+0.598
Neg logit contributions
rint-0.808
agents-0.609
 Scher-0.602
 tamp-0.601
iment-0.601
Token #
Token ID1303
Feature activation-4.22e+00

Pos logit contributions
シャ+2.669
アル+2.509
+2.227
+2.110
habi+2.083
Neg logit contributions
rint-3.799
agents-2.977
iment-2.955
 tamp-2.921
 Scher-2.918
Token14
Token ID1415
Feature activation+3.38e+00

Pos logit contributions
rint+3.288
agents+2.420
iment+2.399
 Scher+2.366
 tamp+2.343
Neg logit contributions
シャ-3.466
アル-3.318
-3.020
-2.892
habi-2.863
Token On
Token ID1550
Feature activation+3.98e+00

Pos logit contributions
シャ+2.509
アル+2.381
+2.143
+2.078
 Shepard+2.061
Neg logit contributions
rint-2.829
agents-2.152
iment-2.122
 tamp-2.053
 Scher-2.040
Token September
Token ID2693
Feature activation+1.46e+00

Pos logit contributions
シャ+2.380
アル+2.225
+1.917
+1.915
habi+1.841
Neg logit contributions
rint-3.894
agents-3.097
 tamp-3.014
iment-2.999
 Scher-2.986
Token 20
Token ID1160
Feature activation+1.26e+00

Pos logit contributions
シャ+1.230
アル+1.182
+1.073
+1.032
habi+1.024
Neg logit contributions
rint-1.111
iment-0.803
agents-0.802
 Scher-0.790
 tamp-0.786
Token 2013
Token ID2211
Feature activation+1.51e+01

Pos logit contributions
シャ+0.850
アル+0.802
+0.725
+0.687
habi+0.669
Neg logit contributions
rint-1.178
agents-0.906
iment-0.889
 tamp-0.883
 Scher-0.874
Token 09
Token ID7769
Feature activation+5.58e-01

Pos logit contributions
シャ+5.081
 Shepard+4.877
 Hubble+4.479
アル+4.350
 Daddy+4.324
Neg logit contributions
rint-15.746
ibaba-13.032
agents-13.027
ventures-12.863
 pse-12.817
Token:
Token ID25
Feature activation+2.45e+00

Pos logit contributions
シャ+0.452
アル+0.434
+0.396
+0.382
habi+0.375
Neg logit contributions
rint-0.441
agents-0.319
iment-0.315
 tamp-0.314
 Scher-0.310
Token37
Token ID2718
Feature activation+7.30e+00

Pos logit contributions
シャ+1.862
アル+1.782
+1.616
+1.558
NS+1.540
Neg logit contributions
rint-2.045
 tamp-1.501
agents-1.500
iment-1.493
 advis-1.488
Token Dark
Token ID3801
Feature activation+5.90e+00

Pos logit contributions
シャ+5.425
アル+5.167
+4.654
 Shepard+4.527
+4.497
Neg logit contributions
rint-5.939
agents-4.520
iment-4.445
 tamp-4.334
 advis-4.179
Tokenho
Token ID8873
Feature activation+1.12e+00

Pos logit contributions
シャ+3.667
アル+3.513
+2.990
+2.938
nikov+2.906
Neg logit contributions
rint-5.526
 tamp-4.396
iment-4.393
agents-4.342
 fortun-4.308
Token
Token ID198
Feature activation+3.19e-01

Pos logit contributions
シャ+2.477
アル+2.331
+2.118
+2.041
NS+1.995
Neg logit contributions
rint-2.888
agents-2.155
iment-2.149
 advis-2.124
 tamp-2.117
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
シャ+0.248
アル+0.236
+0.215
+0.207
habi+0.202
Neg logit contributions
rint-0.265
agents-0.195
iment-0.194
 tamp-0.192
 advis-0.189
TokenOn
Token ID2202
Feature activation+4.25e+00

Pos logit contributions
rint+0.266
agents+0.189
iment+0.188
 tamp+0.185
 Scher+0.185
Neg logit contributions
シャ-0.323
アル-0.309
-0.284
-0.273
habi-0.270
Token September
Token ID2693
Feature activation+1.23e+00

Pos logit contributions
シャ+2.507
アル+2.347
+2.028
+2.027
habi+1.923
Neg logit contributions
rint-4.193
agents-3.311
iment-3.256
 tamp-3.252
 Scher-3.204
Token 20
Token ID1160
Feature activation+5.94e-01

Pos logit contributions
シャ+0.999
アル+0.954
+0.871
+0.834
habi+0.825
Neg logit contributions
rint-0.963
agents-0.710
iment-0.706
 Scher-0.694
 tamp-0.693
Token 2013
Token ID2211
Feature activation+1.33e+01

Pos logit contributions
シャ+0.405
アル+0.383
+0.346
+0.328
habi+0.320
Neg logit contributions
rint-0.547
agents-0.420
iment-0.414
 tamp-0.409
 Scher-0.407
Token 07
Token ID8753
Feature activation-1.51e+00

Pos logit contributions
シャ+4.801
アル+4.279
 Shepard+4.216
+3.992
 Hubble+3.971
Neg logit contributions
rint-14.191
agents-11.659
ibaba-11.638
ventures-11.484
iment-11.386
Token:
Token ID25
Feature activation+2.49e+00

Pos logit contributions
rint+1.140
agents+0.819
iment+0.813
 tamp+0.801
 Scher+0.801
Neg logit contributions
シャ-1.283
アル-1.232
-1.126
-1.083
habi-1.071
Token44
Token ID2598
Feature activation-9.01e-01

Pos logit contributions
シャ+2.144
アル+2.055
+1.875
+1.820
NS+1.772
Neg logit contributions
rint-1.867
 tamp-1.321
agents-1.315
iment-1.304
 advis-1.288
Token Se
Token ID1001
Feature activation+6.84e+00

Pos logit contributions
rint+0.639
agents+0.451
iment+0.449
 Scher+0.441
 tamp+0.439
Neg logit contributions
シャ-0.804
アル-0.773
-0.708
-0.682
habi-0.676
Tokeneker
Token ID28233
Feature activation+1.85e+00

Pos logit contributions
シャ+2.593
アル+2.331
+2.031
habi+1.885
nikov+1.679
Neg logit contributions
rint-7.888
 tamp-6.744
 fortun-6.701
 advis-6.609
 Scher-6.564
Token two
Token ID734
Feature activation-8.52e-01

Pos logit contributions
rint+0.522
agents+0.395
iment+0.394
 Scher+0.389
 tamp+0.387
Neg logit contributions
シャ-0.442
アル-0.423
-0.380
-0.362
habi-0.357
Token linguistic
Token ID29929
Feature activation-3.36e+00

Pos logit contributions
rint+0.697
agents+0.519
iment+0.518
 Scher+0.511
 tamp+0.509
Neg logit contributions
シャ-0.667
アル-0.638
-0.576
-0.551
habi-0.546
Token communities
Token ID5348
Feature activation-7.85e-01

Pos logit contributions
rint+2.703
agents+2.025
iment+2.008
 tamp+1.988
 Scher+1.987
Neg logit contributions
シャ-2.670
アル-2.539
-2.296
-2.204
habi-2.194
Token,"
Token ID553
Feature activation+2.16e+00

Pos logit contributions
rint+0.634
agents+0.470
iment+0.469
 Scher+0.462
 tamp+0.460
Neg logit contributions
シャ-0.623
アル-0.596
-0.539
-0.516
habi-0.511
Token d
Token ID288
Feature activation+1.79e+00

Pos logit contributions
シャ+1.535
アル+1.472
+1.303
+1.249
habi+1.226
Neg logit contributions
rint-1.918
iment-1.457
agents-1.456
 Scher-1.423
 tamp-1.416
Token'
Token ID6
Feature activation+1.27e+01

Pos logit contributions
シャ+1.074
アル+1.012
+0.875
habi+0.855
+0.842
Neg logit contributions
rint-1.751
agents-1.385
 tamp-1.374
 Scher-1.358
iment-1.344
TokenEnt
Token ID14539
Feature activation+9.01e-01

Pos logit contributions
シャ+4.544
アル+4.497
HK+3.391
+3.309
NS+3.292
Neg logit contributions
rint-14.269
 tamp-11.855
 pse-11.825
 fortun-11.658
 advis-11.623
Tokenrem
Token ID2787
Feature activation+1.64e+00

Pos logit contributions
シャ+0.644
アル+0.620
+0.548
habi+0.544
+0.534
Neg logit contributions
rint-0.770
 tamp-0.589
 Scher-0.585
agents-0.581
iment-0.578
Tokenont
Token ID756
Feature activation-7.00e-01

Pos logit contributions
シャ+1.010
アル+0.957
+0.834
habi+0.797
nikov+0.786
Neg logit contributions
rint-1.575
agents-1.247
 Scher-1.239
 tamp-1.237
 advis-1.211
Token states
Token ID2585
Feature activation+8.29e-01

Pos logit contributions
rint+0.548
agents+0.402
iment+0.401
 Scher+0.395
 tamp+0.393
Neg logit contributions
シャ-0.572
アル-0.549
-0.498
-0.478
habi-0.473
Token in
Token ID287
Feature activation+7.61e-01

Pos logit contributions
シャ+0.632
アル+0.605
+0.546
+0.520
habi+0.518
Neg logit contributions
rint-0.691
agents-0.518
iment-0.516
 Scher-0.508
 tamp-0.507
Token
Token ID198
Feature activation+3.64e-01

Pos logit contributions
シャ+0.104
アル+0.099
+0.090
+0.086
habi+0.084
Neg logit contributions
rint-0.113
agents-0.085
iment-0.085
 Scher-0.083
 tamp-0.083
Token
Token ID198
Feature activation+3.48e+00

Pos logit contributions
シャ+0.287
アル+0.274
+0.249
+0.239
habi+0.235
Neg logit contributions
rint-0.296
agents-0.218
iment-0.216
 tamp-0.215
 Scher-0.213
Token4
Token ID19
Feature activation+1.23e+00

Pos logit contributions
シャ+2.593
アル+2.472
+2.224
+2.133
NS+2.105
Neg logit contributions
rint-2.939
 tamp-2.212
iment-2.200
agents-2.179
 Scher-2.178
Token Comments
Token ID19502
Feature activation+1.47e+00

Pos logit contributions
シャ+0.939
アル+0.893
+0.813
+0.770
habi+0.760
Neg logit contributions
rint-1.031
agents-0.770
iment-0.770
 Scher-0.759
 tamp-0.754
Token Print
Token ID12578
Feature activation+4.13e+00

Pos logit contributions
シャ+1.122
アル+1.075
+0.968
+0.933
habi+0.922
Neg logit contributions
rint-1.210
agents-0.907
iment-0.900
 tamp-0.882
 Scher-0.879
Token This
Token ID770
Feature activation+1.21e+01

Pos logit contributions
シャ+2.905
アル+2.781
+2.508
+2.387
habi+2.334
Neg logit contributions
rint-3.580
agents-2.755
iment-2.706
 tamp-2.691
 Scher-2.666
Token Post
Token ID2947
Feature activation+1.72e+00

Pos logit contributions
シャ+7.159
アル+6.939
 Shepard+6.111
+6.106
 Nikon+6.066
Neg logit contributions
rint-10.339
agents-8.269
luaj-7.795
paralle-7.742
iment-7.678
Token
Token ID198
Feature activation+3.17e+00

Pos logit contributions
シャ+1.307
アル+1.254
+1.141
+1.085
habi+1.068
Neg logit contributions
rint-1.421
agents-1.072
iment-1.062
 tamp-1.043
 Scher-1.040
Token
Token ID198
Feature activation+6.38e+00

Pos logit contributions
シャ+2.261
アル+2.148
+1.942
+1.861
NS+1.846
Neg logit contributions
rint-2.787
 pse-2.097
 fortun-2.081
agents-2.069
 tamp-2.066
TokenP
Token ID47
Feature activation-1.50e+00

Pos logit contributions
シャ+4.626
アル+4.441
+3.953
NS+3.858
+3.827
Neg logit contributions
rint-5.326
 tamp-4.099
iment-4.029
 Scher-3.985
 advis-3.964
TokenCC
Token ID4093
Feature activation+1.17e+00

Pos logit contributions
rint+1.039
iment+0.727
agents+0.715
 Scher+0.703
 tamp+0.697
Neg logit contributions
シャ-1.378
アル-1.328
-1.218
-1.178
habi-1.165
Token harsh
Token ID11859
Feature activation+1.14e+00

Pos logit contributions
rint+1.745
agents+1.357
iment+1.356
 Scher+1.342
 tamp+1.336
Neg logit contributions
シャ-1.229
アル-1.166
-1.032
-0.978
habi-0.966
Token glare
Token ID36785
Feature activation+5.34e+00

Pos logit contributions
シャ+0.887
アル+0.844
+0.758
habi+0.734
+0.733
Neg logit contributions
rint-0.945
agents-0.702
iment-0.688
 Scher-0.677
 tamp-0.675
Token of
Token ID286
Feature activation-1.26e+00

Pos logit contributions
シャ+4.000
アル+3.789
+3.390
+3.265
ンジ+3.226
Neg logit contributions
rint-4.491
iment-3.333
agents-3.333
 Scher-3.269
 tamp-3.244
Token public
Token ID1171
Feature activation-3.74e+00

Pos logit contributions
rint+1.042
agents+0.779
iment+0.777
 Scher+0.763
 tamp+0.759
Neg logit contributions
シャ-0.978
アル-0.934
-0.841
-0.806
habi-0.798
Token life
Token ID1204
Feature activation-7.41e-01

Pos logit contributions
rint+3.287
iment+2.512
agents+2.512
 Scher+2.479
 tamp+2.478
Neg logit contributions
シャ-2.687
アル-2.562
-2.295
-2.188
habi-2.165
Token
Token ID198
Feature activation+1.20e+01

Pos logit contributions
rint+0.610
agents+0.455
iment+0.454
 Scher+0.447
 tamp+0.445
Neg logit contributions
シャ-0.577
アル-0.552
-0.498
-0.476
habi-0.472
Token
Token ID198
Feature activation+5.78e+00

Pos logit contributions
シャ+5.975
アル+5.607
NS+5.064
 Shepard+5.055
+4.903
Neg logit contributions
rint-11.547
 pse-10.155
 fortun-9.873
 ende-9.447
 therap-9.354
TokenMrs
Token ID27034
Feature activation+1.39e+00

Pos logit contributions
シャ+3.721
アル+3.533
+3.082
NS+3.004
+2.986
Neg logit contributions
rint-5.456
 tamp-4.193
iment-4.141
 Scher-4.095
agents-4.094
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
シャ+0.975
アル+0.930
+0.832
+0.790
 Shepard+0.783
Neg logit contributions
rint-1.246
agents-0.953
iment-0.933
 tamp-0.927
 advis-0.915
Token Obama
Token ID2486
Feature activation+1.16e+00

Pos logit contributions
シャ+0.876
アル+0.822
+0.712
habi+0.668
+0.668
Neg logit contributions
rint-1.500
agents-1.191
iment-1.176
 tamp-1.162
 Scher-1.155
Token asked
Token ID1965
Feature activation+8.79e-01

Pos logit contributions
シャ+0.882
アル+0.842
+0.755
+0.726
habi+0.724
Neg logit contributions
rint-0.978
agents-0.738
iment-0.728
 Scher-0.710
 tamp-0.704
Token Obama
Token ID2486
Feature activation+1.33e+00

Pos logit contributions
シャ+1.487
アル+1.383
+1.181
 Shepard+1.150
+1.106
Neg logit contributions
rint-2.907
agents-2.328
iment-2.270
 tamp-2.251
 advis-2.249
Token said
Token ID531
Feature activation+3.45e+00

Pos logit contributions
シャ+1.124
アル+1.081
+0.982
+0.944
habi+0.940
Neg logit contributions
rint-1.003
agents-0.729
iment-0.719
 Scher-0.705
 tamp-0.702
Token with
Token ID351
Feature activation+2.32e+00

Pos logit contributions
シャ+2.527
アル+2.415
+2.163
habi+2.047
+2.028
Neg logit contributions
rint-2.964
agents-2.236
iment-2.219
 tamp-2.173
 Scher-2.155
Token damp
Token ID21151
Feature activation-3.06e+00

Pos logit contributions
シャ+1.742
アル+1.677
+1.490
+1.430
 Shepard+1.413
Neg logit contributions
rint-1.956
agents-1.473
iment-1.445
 Scher-1.415
 tamp-1.403
Token eyes
Token ID2951
Feature activation+4.79e+00

Pos logit contributions
rint+2.516
iment+1.888
 tamp+1.870
agents+1.868
 Scher+1.862
Neg logit contributions
シャ-2.367
アル-2.262
-2.046
-1.953
habi-1.938
Token
Token ID198
Feature activation+1.19e+01

Pos logit contributions
シャ+3.419
アル+3.268
+2.945
+2.773
 Shepard+2.755
Neg logit contributions
rint-4.144
agents-3.142
iment-3.112
 Scher-3.035
 tamp-2.988
Token
Token ID198
Feature activation+5.25e+00

Pos logit contributions
シャ+5.935
アル+5.542
NS+5.027
 Shepard+4.991
+4.862
Neg logit contributions
rint-11.518
 pse-10.215
 fortun-9.894
 ende-9.496
 therap-9.417
TokenMrs
Token ID27034
Feature activation+1.29e+00

Pos logit contributions
シャ+3.413
アル+3.239
+2.825
NS+2.725
+2.719
Neg logit contributions
rint-4.944
 tamp-3.787
iment-3.748
agents-3.722
 Scher-3.708
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
シャ+0.908
アル+0.866
+0.775
+0.736
 Shepard+0.729
Neg logit contributions
rint-1.159
agents-0.887
iment-0.868
 tamp-0.861
 advis-0.853
Token Obama
Token ID2486
Feature activation+6.23e-02

Pos logit contributions
シャ+0.808
アル+0.757
+0.657
habi+0.615
+0.615
Neg logit contributions
rint-1.382
agents-1.100
iment-1.083
 tamp-1.068
 Scher-1.064
Token and
Token ID290
Feature activation+1.69e+00

Pos logit contributions
シャ+0.048
アル+0.046
+0.041
+0.040
habi+0.040
Neg logit contributions
rint-0.051
agents-0.039
iment-0.038
 Scher-0.037
 tamp-0.037
Token't
Token ID470
Feature activation+1.07e+00

Pos logit contributions
シャ+1.432
アル+1.369
+1.239
+1.206
nikov+1.169
Neg logit contributions
rint-1.542
agents-1.151
 advis-1.130
 fortun-1.130
 tamp-1.128
Token act
Token ID719
Feature activation+1.30e+00

Pos logit contributions
シャ+0.772
アル+0.734
+0.659
habi+0.637
+0.634
Neg logit contributions
rint-0.919
agents-0.707
iment-0.689
 Scher-0.682
 tamp-0.660
Token right
Token ID826
Feature activation+5.49e+00

Pos logit contributions
シャ+0.908
アル+0.863
+0.772
habi+0.739
+0.735
Neg logit contributions
rint-1.159
agents-0.884
iment-0.871
 Scher-0.867
 tamp-0.852
Token away
Token ID1497
Feature activation+6.07e+00

Pos logit contributions
シャ+3.447
アル+3.214
+2.883
habi+2.805
 Shepard+2.737
Neg logit contributions
rint-5.115
agents-3.978
iment-3.900
 Scher-3.864
 tamp-3.842
Token'
Token ID6
Feature activation+1.89e+00

Pos logit contributions
シャ+3.470
アル+3.309
+2.858
habi+2.731
 Shepard+2.703
Neg logit contributions
rint-5.941
agents-4.656
iment-4.632
 Scher-4.598
 tamp-4.573
Token
Token ID198
Feature activation+1.18e+01

Pos logit contributions
シャ+1.485
アル+1.419
+1.284
+1.231
habi+1.219
Neg logit contributions
rint-1.536
iment-1.132
agents-1.125
 Scher-1.111
 tamp-1.097
Token
Token ID198
Feature activation+3.31e+00

Pos logit contributions
シャ+5.924
アル+5.538
NS+5.077
 Shepard+4.918
+4.849
Neg logit contributions
rint-11.424
 pse-9.982
 fortun-9.689
 ende-9.309
 therap-9.269
TokenT
Token ID51
Feature activation+2.39e+00

Pos logit contributions
シャ+2.336
アル+2.222
+1.980
+1.923
NS+1.880
Neg logit contributions
rint-2.987
iment-2.250
 tamp-2.245
 Scher-2.230
agents-2.209
Tokenori
Token ID10145
Feature activation-1.70e+00

Pos logit contributions
シャ+1.761
アル+1.677
+1.481
habi+1.421
+1.420
Neg logit contributions
rint-2.057
agents-1.571
 Scher-1.549
iment-1.548
 tamp-1.540
Token was
Token ID373
Feature activation-5.24e-01

Pos logit contributions
rint+1.395
agents+1.040
iment+1.038
 Scher+1.024
 tamp+1.020
Neg logit contributions
シャ-1.323
アル-1.265
-1.142
-1.093
habi-1.082
Token eventually
Token ID4191
Feature activation-1.65e+00

Pos logit contributions
rint+0.476
agents+0.366
iment+0.364
 Scher+0.359
 tamp+0.357
Neg logit contributions
シャ-0.362
アル-0.344
-0.306
-0.292
habi-0.290
Token hope
Token ID2911
Feature activation+3.58e-01

Pos logit contributions
シャ+0.697
アル+0.663
+0.597
habi+0.568
+0.567
Neg logit contributions
rint-0.817
agents-0.616
iment-0.610
 Scher-0.597
 tamp-0.593
Token,
Token ID11
Feature activation-4.47e+00

Pos logit contributions
シャ+0.266
アル+0.254
+0.229
+0.218
habi+0.216
Neg logit contributions
rint-0.306
agents-0.231
iment-0.230
 Scher-0.227
 tamp-0.226
Token never
Token ID1239
Feature activation-1.39e+00

Pos logit contributions
rint+3.730
iment+2.838
agents+2.811
 tamp+2.787
 Scher+2.779
Neg logit contributions
シャ-3.389
アル-3.233
-2.920
-2.804
habi-2.752
Token fear
Token ID3252
Feature activation-4.32e-01

Pos logit contributions
rint+1.207
agents+0.915
iment+0.912
 Scher+0.902
 tamp+0.896
Neg logit contributions
シャ-1.018
アル-0.971
-0.870
-0.828
habi-0.823
Token'
Token ID6
Feature activation+1.79e-01

Pos logit contributions
rint+0.355
agents+0.264
iment+0.263
 Scher+0.260
 tamp+0.259
Neg logit contributions
シャ-0.337
アル-0.322
-0.291
-0.278
habi-0.276
Token
Token ID198
Feature activation+1.17e+01

Pos logit contributions
シャ+0.142
アル+0.136
+0.123
+0.118
habi+0.117
Neg logit contributions
rint-0.143
agents-0.106
iment-0.105
 Scher-0.104
 tamp-0.103
Token
Token ID198
Feature activation+5.07e+00

Pos logit contributions
シャ+5.957
アル+5.648
 Shepard+5.092
NS+5.084
+4.942
Neg logit contributions
rint-11.278
 pse-9.867
 fortun-9.549
 ende-9.183
 therap-9.053
TokenMrs
Token ID27034
Feature activation+1.11e+00

Pos logit contributions
シャ+3.334
アル+3.169
+2.771
NS+2.683
+2.670
Neg logit contributions
rint-4.751
 tamp-3.641
iment-3.600
agents-3.572
 advis-3.564
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
シャ+0.786
アル+0.750
+0.672
+0.637
 Shepard+0.630
Neg logit contributions
rint-0.985
agents-0.753
iment-0.739
 tamp-0.732
 advis-0.722
Token Obama
Token ID2486
Feature activation-8.64e-01

Pos logit contributions
シャ+1.051
アル+0.990
+0.862
+0.809
habi+0.806
Neg logit contributions
rint-1.736
agents-1.374
iment-1.362
 tamp-1.346
 Scher-1.345
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
rint+0.707
agents+0.527
iment+0.524
 Scher+0.516
 tamp+0.513
Neg logit contributions
シャ-0.676
アル-0.647
-0.584
-0.560
habi-0.556
Token is
Token ID318
Feature activation-4.77e-01

Pos logit contributions
rint+1.999
agents+1.499
iment+1.497
 Scher+1.480
 tamp+1.471
Neg logit contributions
シャ-1.876
アル-1.789
-1.611
-1.549
habi-1.533
Token the
Token ID262
Feature activation-5.68e-01

Pos logit contributions
rint+0.395
agents+0.294
iment+0.293
 tamp+0.287
 Scher+0.287
Neg logit contributions
シャ-0.369
アル-0.355
-0.320
-0.304
habi-0.300
Token Governor
Token ID10807
Feature activation+1.10e+00

Pos logit contributions
rint+0.492
agents+0.373
iment+0.371
 Scher+0.362
 tamp+0.362
Neg logit contributions
シャ-0.416
アル-0.398
-0.359
-0.340
habi-0.335
Token-
Token ID12
Feature activation-4.90e+00

Pos logit contributions
シャ+0.825
アル+0.785
+0.709
+0.676
habi+0.669
Neg logit contributions
rint-0.941
agents-0.709
iment-0.705
 Scher-0.696
 tamp-0.692
TokenGeneral
Token ID12218
Feature activation-1.39e-01

Pos logit contributions
rint+4.245
agents+3.290
iment+3.264
 Scher+3.207
 tamp+3.179
Neg logit contributions
シャ-3.514
アル-3.343
-2.982
-2.841
habi-2.835
Token.[
Token ID3693
Feature activation+1.15e+01

Pos logit contributions
rint+0.115
agents+0.086
iment+0.086
 Scher+0.085
 tamp+0.084
Neg logit contributions
シャ-0.107
アル-0.102
-0.092
-0.088
habi-0.087
Token200
Token ID2167
Feature activation+2.67e+00

Pos logit contributions
シャ+6.815
アル+6.391
+5.595
NS+5.586
+5.443
Neg logit contributions
rint-10.046
 Scher-8.268
 tamp-8.151
 advis-7.989
agents-7.859
Token]
Token ID60
Feature activation+1.72e+00

Pos logit contributions
シャ+1.882
アル+1.789
+1.596
+1.520
habi+1.502
Neg logit contributions
rint-2.390
agents-1.836
iment-1.829
 Scher-1.809
 tamp-1.799
Token
Token ID198
Feature activation-4.91e-01

Pos logit contributions
シャ+1.299
アル+1.251
+1.141
+1.076
 Shepard+1.064
Neg logit contributions
rint-1.456
agents-1.071
iment-1.071
 tamp-1.047
 advis-1.032
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
rint+0.397
agents+0.291
iment+0.289
 tamp+0.286
 Scher+0.284
Neg logit contributions
シャ-0.391
アル-0.373
-0.339
-0.325
habi-0.321
TokenThe
Token ID464
Feature activation+1.74e+00

Pos logit contributions
シャ+1.667
アル+1.607
+1.445
+1.377
NS+1.373
Neg logit contributions
rint-2.025
 tamp-1.528
iment-1.513
 Scher-1.512
agents-1.496
Token [
Token ID685
Feature activation+1.57e+00

Pos logit contributions
rint+2.715
iment+2.063
 tamp+2.023
 Scher+2.006
agents+2.001
Neg logit contributions
シャ-2.635
アル-2.535
-2.270
-2.173
habi-2.147
TokenV
Token ID53
Feature activation+1.25e+00

Pos logit contributions
シャ+1.179
アル+1.128
+1.013
+0.974
habi+0.965
Neg logit contributions
rint-1.322
iment-0.991
 tamp-0.987
 Scher-0.984
agents-0.977
Tokeniet
Token ID1155
Feature activation-2.23e-02

Pos logit contributions
シャ+0.838
アル+0.792
+0.699
habi+0.668
+0.666
Neg logit contributions
rint-1.156
agents-0.894
 Scher-0.886
iment-0.882
 tamp-0.880
Token-
Token ID12
Feature activation+2.12e+00

Pos logit contributions
rint+0.019
iment+0.014
agents+0.014
 Scher+0.014
 tamp+0.014
Neg logit contributions
シャ-0.017
アル-0.016
-0.015
-0.014
habi-0.014
TokenC
Token ID34
Feature activation-4.47e-01

Pos logit contributions
シャ+1.409
アル+1.337
+1.184
+1.123
habi+1.110
Neg logit contributions
rint-1.985
agents-1.540
iment-1.538
 Scher-1.522
 tamp-1.517
Tokenaj
Token ID1228
Feature activation+1.14e+01

Pos logit contributions
rint+0.333
agents+0.240
iment+0.238
 Scher+0.236
 tamp+0.236
Neg logit contributions
シャ-0.381
アル-0.367
-0.332
-0.321
habi-0.320
Tokenun
Token ID403
Feature activation+1.00e+00

Pos logit contributions
シャ+5.684
アル+5.007
habi+4.935
+4.640
nikov+4.607
Neg logit contributions
rint-10.586
 pse-9.342
 fortun-9.119
 tamp-8.961
 Scher-8.895
Token]
Token ID60
Feature activation-2.03e+00

Pos logit contributions
シャ+0.706
アル+0.672
+0.601
+0.572
habi+0.565
Neg logit contributions
rint-0.897
agents-0.687
iment-0.684
 Scher-0.677
 tamp-0.674
Token craw
Token ID27784
Feature activation-1.73e+00

Pos logit contributions
rint+1.630
iment+1.211
agents+1.208
 Scher+1.192
 tamp+1.191
Neg logit contributions
シャ-1.611
アル-1.549
-1.394
-1.337
habi-1.321
Tokenfish
Token ID11084
Feature activation-2.62e+00

Pos logit contributions
rint+1.446
agents+1.086
iment+1.083
 Scher+1.069
 tamp+1.065
Neg logit contributions
シャ-1.319
アル-1.260
-1.135
-1.085
habi-1.075
Token on
Token ID319
Feature activation+3.96e+00

Pos logit contributions
rint+2.048
iment+1.500
agents+1.489
 Scher+1.471
 tamp+1.471
Neg logit contributions
シャ-2.147
アル-2.063
-1.859
-1.791
habi-1.767
Tokenook
Token ID566
Feature activation-6.46e-01

Pos logit contributions
rint+1.201
agents+0.849
iment+0.844
 Scher+0.832
 tamp+0.826
Neg logit contributions
シャ-1.523
アル-1.454
-1.337
-1.288
habi-1.274
Token her
Token ID607
Feature activation+1.70e+00

Pos logit contributions
rint+0.559
agents+0.423
iment+0.421
 Scher+0.413
 tamp+0.412
Neg logit contributions
シャ-0.476
アル-0.454
-0.407
-0.388
habi-0.384
Token at
Token ID379
Feature activation+9.99e-01

Pos logit contributions
シャ+1.250
アル+1.192
+1.080
+1.018
 Shepard+1.017
Neg logit contributions
rint-1.455
agents-1.100
iment-1.079
 Scher-1.049
 tamp-1.046
Token one
Token ID530
Feature activation+1.86e+00

Pos logit contributions
シャ+0.764
アル+0.726
+0.653
+0.619
habi+0.616
Neg logit contributions
rint-0.836
agents-0.628
iment-0.626
 tamp-0.607
 Scher-0.606
Token point
Token ID966
Feature activation+4.67e-01

Pos logit contributions
シャ+1.575
アル+1.496
+1.375
+1.317
habi+1.295
Neg logit contributions
rint-1.400
agents-1.019
iment-0.983
 Scher-0.966
 tamp-0.960
Token
Token ID198
Feature activation+1.14e+01

Pos logit contributions
シャ+0.358
アル+0.342
+0.309
+0.295
habi+0.292
Neg logit contributions
rint-0.389
agents-0.292
iment-0.291
 Scher-0.285
 tamp-0.284
Token
Token ID198
Feature activation+5.12e+00

Pos logit contributions
シャ+6.041
アル+5.699
 Shepard+5.121
NS+5.104
+4.991
Neg logit contributions
rint-10.821
 pse-9.243
 fortun-9.041
 ende-8.624
 advis-8.507
TokenMrs
Token ID27034
Feature activation+1.94e+00

Pos logit contributions
シャ+3.402
アル+3.242
+2.844
NS+2.753
+2.744
Neg logit contributions
rint-4.750
 tamp-3.623
iment-3.592
agents-3.580
 advis-3.551
Token.
Token ID13
Feature activation+1.57e+00

Pos logit contributions
シャ+1.397
アル+1.330
+1.191
+1.135
habi+1.123
Neg logit contributions
rint-1.703
agents-1.298
iment-1.297
 Scher-1.280
 tamp-1.275
Token Obama
Token ID2486
Feature activation+1.38e+00

Pos logit contributions
シャ+0.969
アル+0.914
+0.801
+0.752
habi+0.747
Neg logit contributions
rint-1.540
agents-1.214
iment-1.206
 Scher-1.192
 tamp-1.191
Token urged
Token ID11643
Feature activation+2.73e-02

Pos logit contributions
シャ+1.041
アル+0.995
+0.890
+0.855
habi+0.853
Neg logit contributions
rint-1.162
agents-0.878
iment-0.866
 Scher-0.843
 tamp-0.837
Token language
Token ID3303
Feature activation-1.25e+00

Pos logit contributions
rint+0.578
agents+0.415
iment+0.410
 Scher+0.405
 tamp+0.403
Neg logit contributions
シャ-0.673
アル-0.647
-0.591
-0.570
habi-0.565
Token.
Token ID13
Feature activation+5.02e+00

Pos logit contributions
rint+0.971
agents+0.716
iment+0.713
 Scher+0.706
 tamp+0.699
Neg logit contributions
シャ-1.032
アル-0.979
-0.885
-0.861
habi-0.852
Token -
Token ID532
Feature activation+2.09e+00

Pos logit contributions
シャ+3.386
アル+3.332
+2.925
+2.784
 Shepard+2.755
Neg logit contributions
rint-4.469
agents-3.472
iment-3.393
 Scher-3.320
 tamp-3.303
Token Katherine
Token ID32719
Feature activation-2.65e+00

Pos logit contributions
シャ+1.362
アル+1.307
+1.153
+1.086
habi+1.069
Neg logit contributions
rint-1.963
agents-1.532
iment-1.527
 Scher-1.506
 tamp-1.495
Token d
Token ID288
Feature activation+3.12e+00

Pos logit contributions
rint+2.225
 tamp+1.674
iment+1.667
agents+1.667
 Scher+1.667
Neg logit contributions
シャ-2.017
アル-1.895
-1.729
-1.651
habi-1.631
Token'
Token ID6
Feature activation+1.13e+01

Pos logit contributions
シャ+1.794
アル+1.700
+1.456
+1.417
habi+1.411
Neg logit contributions
rint-3.121
agents-2.473
 tamp-2.442
 advis-2.400
 fortun-2.395
TokenEnt
Token ID14539
Feature activation+1.05e+00

Pos logit contributions
アル+4.209
シャ+4.198
+3.149
+3.059
NS+3.028
Neg logit contributions
rint-12.787
 tamp-10.568
 pse-10.558
 fortun-10.463
 advis-10.444
Tokenrem
Token ID2787
Feature activation+1.99e+00

Pos logit contributions
シャ+0.756
アル+0.728
+0.642
habi+0.635
+0.625
Neg logit contributions
rint-0.898
 tamp-0.686
 Scher-0.680
agents-0.678
iment-0.672
Tokenont
Token ID756
Feature activation-2.25e+00

Pos logit contributions
シャ+1.208
アル+1.152
+0.994
habi+0.953
+0.937
Neg logit contributions
rint-1.916
agents-1.519
 tamp-1.509
 Scher-1.508
 advis-1.483
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
rint+1.727
agents+1.260
iment+1.259
 Scher+1.246
 tamp+1.239
Neg logit contributions
シャ-1.881
アル-1.791
-1.629
-1.576
habi-1.550
Token commissioner
Token ID16248
Feature activation+7.85e-01

Pos logit contributions
rint+0.094
agents+0.070
iment+0.069
 Scher+0.068
 tamp+0.068
Neg logit contributions
シャ-0.090
アル-0.086
-0.078
-0.074
habi-0.074
Token favorite
Token ID4004
Feature activation+5.91e+00

Pos logit contributions
シャ+2.040
アル+1.863
+1.607
habi+1.462
+1.460
Neg logit contributions
rint-4.506
agents-3.713
iment-3.606
 Scher-3.560
 tamp-3.543
Token favorite
Token ID4004
Feature activation+8.30e+00

Pos logit contributions
シャ+3.020
アル+2.791
+2.424
+2.183
habi+2.180
Neg logit contributions
rint-6.299
agents-5.171
iment-5.003
 Scher-4.981
 tamp-4.954
Token -
Token ID532
Feature activation+5.67e+00

Pos logit contributions
シャ+4.409
アル+4.143
+3.633
+3.264
ンジ+3.256
Neg logit contributions
rint-8.511
agents-6.928
iment-6.703
 Scher-6.678
 tamp-6.654
Token September
Token ID2693
Feature activation-6.64e-01

Pos logit contributions
シャ+3.304
アル+3.012
+2.694
 Shepard+2.617
+2.569
Neg logit contributions
rint-5.669
agents-4.508
iment-4.391
inion-4.276
 advis-4.260
Token 26
Token ID2608
Feature activation+8.75e-01

Pos logit contributions
rint+0.521
agents+0.381
iment+0.379
 tamp+0.372
 Scher+0.371
Neg logit contributions
シャ-0.543
アル-0.520
-0.474
-0.455
habi-0.449
Token,
Token ID11
Feature activation+1.13e+01

Pos logit contributions
シャ+0.672
アル+0.642
+0.585
+0.559
habi+0.549
Neg logit contributions
rint-0.728
agents-0.534
 tamp-0.532
iment-0.531
 Scher-0.522
Token 2009
Token ID3717
Feature activation+8.23e+00

Pos logit contributions
シャ+5.413
アル+4.737
+4.113
NS+3.916
ンジ+3.845
Neg logit contributions
rint-11.900
 tamp-9.624
agents-9.526
ventures-9.467
Token-9.390
Token
Token ID198
Feature activation+8.09e+00

Pos logit contributions
シャ+4.904
アル+4.658
+4.214
NS+4.070
 Shepard+4.034
Neg logit contributions
rint-7.695
 pse-6.179
 fortun-6.110
 tamp-5.855
ibaba-5.822
Token
Token ID198
Feature activation+6.12e+00

Pos logit contributions
シャ+4.714
アル+4.356
+3.966
NS+3.896
 Shepard+3.841
Neg logit contributions
rint-7.749
 pse-6.468
 fortun-6.329
 ende-5.994
 advis-5.980
TokenSubject
Token ID19776
Feature activation+3.86e+00

Pos logit contributions
シャ+2.641
アル+2.276
+1.890
NS+1.802
+1.642
Neg logit contributions
rint-7.323
 pse-6.039
 tamp-5.991
 fortun-5.948
 Scher-5.887
Token:
Token ID25
Feature activation+4.55e+00

Pos logit contributions
シャ+2.851
アル+2.704
+2.502
+2.390
NS+2.339
Neg logit contributions
rint-3.255
 fortun-2.429
 advis-2.405
agents-2.404
 tamp-2.387
Token only
Token ID691
Feature activation+3.36e-01

Pos logit contributions
シャ+0.378
アル+0.362
+0.327
+0.312
habi+0.310
Neg logit contributions
rint-0.395
iment-0.294
agents-0.294
 tamp-0.285
 Scher-0.285
Token 240
Token ID14956
Feature activation-1.50e-02

Pos logit contributions
シャ+0.265
アル+0.254
+0.229
+0.220
habi+0.218
Neg logit contributions
rint-0.272
agents-0.202
iment-0.201
 tamp-0.196
 Scher-0.196
Token m
Token ID285
Feature activation+6.11e+00

Pos logit contributions
rint+0.014
agents+0.011
iment+0.011
 Scher+0.011
 tamp+0.011
Neg logit contributions
シャ-0.010
アル-0.009
-0.008
-0.008
habi-0.008
Token (
Token ID357
Feature activation-1.37e-01

Pos logit contributions
シャ+4.319
アル+4.025
+3.692
habi+3.590
+3.539
Neg logit contributions
rint-5.217
agents-4.061
iment-3.964
 tamp-3.889
 Scher-3.886
Token790
Token ID37750
Feature activation+2.46e+00

Pos logit contributions
rint+0.108
agents+0.079
iment+0.079
 tamp+0.078
 Scher+0.078
Neg logit contributions
シャ-0.111
アル-0.106
-0.097
-0.093
habi-0.092
Token ft
Token ID10117
Feature activation+1.12e+01

Pos logit contributions
シャ+1.754
アル+1.667
+1.504
+1.424
habi+1.417
Neg logit contributions
rint-2.156
agents-1.663
iment-1.642
 Scher-1.630
 tamp-1.579
Token)
Token ID8
Feature activation+4.42e+00

Pos logit contributions
シャ+7.161
アル+6.672
+6.240
+6.035
 Shepard+5.913
Neg logit contributions
rint-9.419
agents-7.503
iment-7.240
 Scher-7.095
ertodd-7.002
Token from
Token ID422
Feature activation+1.98e+00

Pos logit contributions
シャ+3.610
アル+3.425
+3.137
+3.044
habi+3.040
Neg logit contributions
rint-3.333
agents-2.484
iment-2.417
 Scher-2.344
 tamp-2.315
Token the
Token ID262
Feature activation+2.92e+00

Pos logit contributions
シャ+1.463
アル+1.388
+1.254
+1.192
 Shepard+1.177
Neg logit contributions
rint-1.703
agents-1.291
iment-1.278
 tamp-1.249
 Scher-1.232
Token asteroid
Token ID27460
Feature activation+8.48e-01

Pos logit contributions
シャ+2.290
アル+2.161
+1.983
+1.892
 Shepard+1.883
Neg logit contributions
rint-2.350
agents-1.757
iment-1.724
 tamp-1.680
 Scher-1.662
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
シャ+0.644
アル+0.612
+0.551
+0.529
habi+0.523
Neg logit contributions
rint-0.713
agents-0.535
iment-0.526
 tamp-0.522
 Scher-0.516
Token banned
Token ID9301
Feature activation+1.08e-01

Pos logit contributions
rint+3.281
iment+2.508
 tamp+2.482
agents+2.478
 Scher+2.460
Neg logit contributions
シャ-2.882
アル-2.740
-2.467
-2.340
habi-2.317
Token by
Token ID416
Feature activation-1.76e+00

Pos logit contributions
シャ+0.079
アル+0.076
+0.068
+0.065
habi+0.064
Neg logit contributions
rint-0.094
agents-0.072
iment-0.071
 Scher-0.071
 tamp-0.070
Token officials
Token ID2828
Feature activation-2.21e+00

Pos logit contributions
rint+1.214
agents+0.862
iment+0.850
 Scher+0.836
 tamp+0.834
Neg logit contributions
シャ-1.598
アル-1.530
-1.405
-1.351
habi-1.345
Token in
Token ID287
Feature activation+3.40e+00

Pos logit contributions
rint+1.595
agents+1.151
iment+1.144
 Scher+1.120
 tamp+1.110
Neg logit contributions
シャ-1.947
アル-1.860
-1.702
-1.643
habi-1.620
Token Madrid
Token ID14708
Feature activation+3.04e+00

Pos logit contributions
シャ+2.050
アル+1.934
+1.690
+1.584
habi+1.559
Neg logit contributions
rint-3.378
iment-2.658
agents-2.650
 tamp-2.622
 Scher-2.621
Token
Token ID198
Feature activation+1.11e+01

Pos logit contributions
シャ+2.321
アル+2.219
+2.016
+1.930
habi+1.922
Neg logit contributions
rint-2.508
iment-1.866
agents-1.850
 Scher-1.838
 tamp-1.832
Token
Token ID198
Feature activation+3.57e+00

Pos logit contributions
シャ+5.781
アル+5.470
NS+4.925
 Shepard+4.855
+4.768
Neg logit contributions
rint-10.725
 pse-9.319
 fortun-9.060
 ende-8.646
 therap-8.556
TokenDri
Token ID20564
Feature activation-8.59e-01

Pos logit contributions
シャ+2.572
アル+2.476
+2.205
+2.143
NS+2.095
Neg logit contributions
rint-3.128
 tamp-2.356
iment-2.328
 Scher-2.319
 advis-2.302
Tokenving
Token ID1075
Feature activation-2.86e-01

Pos logit contributions
rint+0.754
agents+0.573
iment+0.570
 Scher+0.566
 tamp+0.565
Neg logit contributions
シャ-0.621
アル-0.590
-0.530
-0.504
habi-0.502
Token slowly
Token ID6364
Feature activation+1.57e+00

Pos logit contributions
rint+0.249
agents+0.190
iment+0.188
 Scher+0.185
 tamp+0.184
Neg logit contributions
シャ-0.207
アル-0.199
-0.178
-0.170
habi-0.168
Token along
Token ID1863
Feature activation-2.70e+00

Pos logit contributions
シャ+1.333
アル+1.267
+1.173
+1.117
habi+1.115
Neg logit contributions
rint-1.174
agents-0.851
iment-0.829
 Scher-0.811
 tamp-0.806
Token'
Token ID6
Feature activation-4.48e-01

Pos logit contributions
シャ+1.128
アル+1.079
+0.956
+0.904
habi+0.897
Neg logit contributions
rint-1.583
agents-1.237
iment-1.230
 Scher-1.218
 tamp-1.215
Token OR
Token ID6375
Feature activation+1.05e+00

Pos logit contributions
rint+0.370
agents+0.277
iment+0.276
 Scher+0.272
 tamp+0.269
Neg logit contributions
シャ-0.344
アル-0.330
-0.297
-0.285
habi-0.282
Token RIGHT
Token ID33621
Feature activation+2.68e+00

Pos logit contributions
シャ+0.833
アル+0.806
+0.727
+0.700
habi+0.688
Neg logit contributions
rint-0.829
agents-0.625
iment-0.617
 Scher-0.603
 tamp-0.597
Token (
Token ID357
Feature activation-4.22e+00

Pos logit contributions
シャ+1.644
アル+1.567
+1.384
+1.322
NS+1.280
Neg logit contributions
rint-2.605
agents-2.071
iment-2.045
 advis-2.002
 tamp-1.997
Token path
Token ID3108
Feature activation+8.91e+00

Pos logit contributions
rint+2.884
 tamp+1.995
iment+1.988
agents+1.961
 Scher+1.943
Neg logit contributions
シャ-3.860
アル-3.669
-3.391
-3.258
habi-3.247
Token,
Token ID837
Feature activation+1.10e+01

Pos logit contributions
シャ+4.314
アル+4.034
+3.472
NS+3.269
+3.224
Neg logit contributions
rint-9.436
 fortun-7.817
 pse-7.599
agents-7.560
 advis-7.557
Token 5
Token ID642
Feature activation-4.04e-01

Pos logit contributions
アル+6.241
シャ+6.066
+5.048
+5.033
ンジ+4.985
Neg logit contributions
rint-10.097
agents-8.276
iment-8.143
 tamp-7.809
ventures-7.797
Token )
Token ID1267
Feature activation-5.46e+00

Pos logit contributions
rint+0.347
iment+0.262
agents+0.261
 Scher+0.258
 tamp+0.257
Neg logit contributions
シャ-0.302
アル-0.287
-0.257
-0.246
habi-0.245
Token =
Token ID796
Feature activation+6.86e+00

Pos logit contributions
rint+2.911
iment+1.799
 Scher+1.757
 tamp+1.736
agents+1.718
Neg logit contributions
シャ-5.749
アル-5.524
-5.078
habi-4.997
-4.968
Token '.
Token ID45302
Feature activation+2.01e+00

Pos logit contributions
シャ+4.011
アル+3.936
+3.356
+3.178
ンジ+3.132
Neg logit contributions
rint-6.501
agents-5.294
iment-5.199
 Scher-5.009
 tamp-5.001
Tokencl
Token ID565
Feature activation+3.91e+00

Pos logit contributions
シャ+1.282
アル+1.224
+1.088
habi+1.021
+1.018
Neg logit contributions
rint-1.878
iment-1.479
 Scher-1.479
agents-1.472
 tamp-1.471
Tokenun
Token ID403
Feature activation+6.53e-01

Pos logit contributions
シャ+5.347
アル+4.899
habi+4.604
+4.467
nikov+4.321
Neg logit contributions
rint-8.498
 pse-7.072
 advis-6.905
agents-6.859
 tamp-6.830
Token Bo
Token ID3248
Feature activation+2.22e-01

Pos logit contributions
シャ+0.497
アル+0.476
+0.430
+0.410
habi+0.407
Neg logit contributions
rint-0.544
agents-0.410
iment-0.407
 Scher-0.400
 tamp-0.399
Tokenil
Token ID346
Feature activation+2.29e+00

Pos logit contributions
シャ+0.159
アル+0.151
+0.137
+0.130
habi+0.129
Neg logit contributions
rint-0.193
agents-0.148
iment-0.147
 Scher-0.146
 tamp-0.145
Token |
Token ID930
Feature activation+5.20e+00

Pos logit contributions
シャ+1.703
アル+1.633
+1.474
+1.403
habi+1.388
Neg logit contributions
rint-1.931
agents-1.465
iment-1.457
 Scher-1.427
 tamp-1.418
Token C
Token ID327
Feature activation-3.25e+00

Pos logit contributions
シャ+3.380
アル+3.224
+2.879
+2.758
 Shepard+2.720
Neg logit contributions
rint-4.712
agents-3.763
iment-3.714
 advis-3.597
 tamp-3.573
Tokenaj
Token ID1228
Feature activation+1.10e+01

Pos logit contributions
rint+2.299
iment+1.643
agents+1.623
 Scher+1.582
 tamp+1.566
Neg logit contributions
シャ-2.902
アル-2.779
-2.573
-2.446
ンジ-2.426
Tokenun
Token ID403
Feature activation+7.51e-01

Pos logit contributions
シャ+5.264
アル+4.617
habi+4.605
+4.428
NS+4.274
Neg logit contributions
rint-10.414
 pse-9.611
 fortun-9.211
 advis-8.932
 tamp-8.906
Token Kitchen
Token ID22541
Feature activation+2.18e+00

Pos logit contributions
シャ+0.576
アル+0.550
+0.495
+0.474
habi+0.469
Neg logit contributions
rint-0.627
iment-0.471
agents-0.470
 Scher-0.464
 tamp-0.461
Token<|endoftext|>
Token ID50256
Feature activation-8.22e-01

Pos logit contributions
シャ+1.682
アル+1.615
+1.460
+1.394
habi+1.377
Neg logit contributions
rint-1.773
agents-1.337
iment-1.331
 Scher-1.302
 tamp-1.289
TokenRecently
Token ID24661
Feature activation-1.84e+00

Pos logit contributions
rint+0.703
iment+0.531
agents+0.530
 Scher+0.524
 tamp+0.523
Neg logit contributions
シャ-0.614
アル-0.586
-0.526
-0.503
habi-0.496
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
rint+1.467
agents+1.083
iment+1.079
 Scher+1.066
 tamp+1.061
Neg logit contributions
シャ-1.475
アル-1.414
-1.279
-1.226
habi-1.217
Tokeneb
Token ID1765
Feature activation+2.15e+00

Pos logit contributions
シャ+3.718
アル+3.520
habi+3.206
+3.184
NS+3.107
Neg logit contributions
rint-3.237
agents-2.384
 tamp-2.375
 Scher-2.339
iment-2.301
Tokena
Token ID64
Feature activation+1.95e+00

Pos logit contributions
シャ+1.652
アル+1.581
+1.438
habi+1.392
+1.367
Neg logit contributions
rint-1.753
agents-1.313
 tamp-1.300
iment-1.291
 advis-1.272
Token Sidd
Token ID44487
Feature activation+2.54e-01

Pos logit contributions
シャ+1.438
アル+1.374
+1.237
habi+1.190
+1.175
Neg logit contributions
rint-1.665
agents-1.254
iment-1.243
 tamp-1.238
 Scher-1.217
Tokeniqu
Token ID1557
Feature activation+1.85e+00

Pos logit contributions
シャ+0.272
アル+0.264
+0.246
habi+0.239
+0.238
Neg logit contributions
rint-0.132
agents-0.079
iment-0.078
 tamp-0.077
 Scher-0.077
Tokeni
Token ID72
Feature activation+6.34e+00

Pos logit contributions
シャ+1.487
アル+1.421
+1.293
habi+1.289
nikov+1.263
Neg logit contributions
rint-1.405
 tamp-1.054
agents-1.040
 Scher-1.025
 advis-1.003
Token;
Token ID26
Feature activation+1.10e+01

Pos logit contributions
シャ+4.726
アル+4.447
+4.031
habi+4.009
nikov+3.945
Neg logit contributions
rint-5.063
agents-3.824
 tamp-3.814
iment-3.794
 Scher-3.761
Token Editing
Token ID39883
Feature activation+6.11e-01

Pos logit contributions
シャ+6.202
アル+5.733
 Nikon+5.162
NS+5.099
+5.054
Neg logit contributions
rint-9.647
 Scher-7.723
iment-7.697
agents-7.672
aldi-7.512
Token by
Token ID416
Feature activation+5.29e+00

Pos logit contributions
シャ+0.416
アル+0.395
+0.354
+0.335
habi+0.331
Neg logit contributions
rint-0.559
agents-0.433
iment-0.431
 tamp-0.423
 Scher-0.421
Token Mark
Token ID2940
Feature activation-5.43e-01

Pos logit contributions
シャ+3.578
アル+3.388
+3.094
 Shepard+3.045
 Nikon+2.950
Neg logit contributions
rint-4.640
agents-3.573
iment-3.526
 advis-3.457
 tamp-3.447
Token Potter
Token ID14179
Feature activation+2.31e+00

Pos logit contributions
rint+0.507
 Scher+0.425
 Meadows+0.400
 Coun+0.390
agents+0.390
Neg logit contributions
シャ-0.375
アル-0.350
-0.300
ンジ-0.286
-0.283
Token)
Token ID8
Feature activation+4.18e-01

Pos logit contributions
シャ+1.794
アル+1.707
+1.557
+1.480
habi+1.480
Neg logit contributions
rint-1.891
agents-1.414
iment-1.411
 tamp-1.388
 Scher-1.380
Token life
Token ID1204
Feature activation-6.58e-01

Pos logit contributions
rint+0.567
agents+0.430
iment+0.429
 Scher+0.422
 tamp+0.418
Neg logit contributions
シャ-0.473
アル-0.451
-0.404
-0.385
habi-0.382
Token of
Token ID286
Feature activation-1.43e+00

Pos logit contributions
rint+0.511
agents+0.375
iment+0.373
 Scher+0.369
 tamp+0.367
Neg logit contributions
シャ-0.543
アル-0.518
-0.473
-0.455
habi-0.449
Token this
Token ID428
Feature activation-1.01e-01

Pos logit contributions
rint+1.138
agents+0.838
iment+0.838
 Scher+0.824
 tamp+0.819
Neg logit contributions
シャ-1.143
アル-1.095
-0.992
-0.950
habi-0.942
Token country
Token ID1499
Feature activation+1.48e+00

Pos logit contributions
rint+0.085
agents+0.064
iment+0.064
 Scher+0.063
 tamp+0.063
Neg logit contributions
シャ-0.076
アル-0.073
-0.066
-0.063
habi-0.062
Token'
Token ID6
Feature activation+2.28e+00

Pos logit contributions
シャ+1.065
アル+1.022
+0.918
+0.872
habi+0.868
Neg logit contributions
rint-1.291
agents-0.978
iment-0.971
 Scher-0.960
 tamp-0.959
Token
Token ID198
Feature activation+1.10e+01

Pos logit contributions
シャ+1.773
アル+1.707
+1.531
+1.469
habi+1.469
Neg logit contributions
rint-1.856
iment-1.374
agents-1.372
 Scher-1.343
 tamp-1.341
Token
Token ID198
Feature activation+4.15e+00

Pos logit contributions
シャ+5.721
アル+5.378
NS+4.857
 Shepard+4.792
+4.707
Neg logit contributions
rint-10.545
 pse-9.169
 fortun-8.917
 ende-8.509
 therap-8.395
TokenThe
Token ID464
Feature activation+7.68e-01

Pos logit contributions
シャ+2.696
アル+2.559
+2.235
+2.150
NS+2.136
Neg logit contributions
rint-3.946
 tamp-3.032
iment-3.011
agents-2.991
 Scher-2.976
Token first
Token ID717
Feature activation+3.39e+00

Pos logit contributions
シャ+0.569
アル+0.540
+0.485
+0.461
habi+0.457
Neg logit contributions
rint-0.660
agents-0.500
iment-0.492
 Scher-0.483
 tamp-0.481
Token lady
Token ID10846
Feature activation-2.10e+00

Pos logit contributions
シャ+3.024
アル+2.877
+2.615
habi+2.516
+2.513
Neg logit contributions
rint-2.410
agents-1.693
iment-1.658
 Scher-1.655
 tamp-1.608
Token's
Token ID338
Feature activation-1.09e+00

Pos logit contributions
rint+1.685
iment+1.248
agents+1.245
 Scher+1.233
 tamp+1.228
Neg logit contributions
シャ-1.673
アル-1.602
-1.452
-1.389
habi-1.373
TokenArg
Token ID28100
Feature activation-7.53e+00

Pos logit contributions
rint+7.104
Token+5.545
agents+5.523
iment+5.458
(*+5.261
Neg logit contributions
シャ-8.404
アル-8.074
habi-7.324
-7.319
 Shepard-7.131
Tokenument
Token ID1713
Feature activation-5.56e+00

Pos logit contributions
rint+5.643
iment+4.441
agents+4.230
 Scher+4.015
Token+3.966
Neg logit contributions
シャ-6.137
アル-5.951
-5.399
ンジ-5.206
habi-5.069
TokenParser
Token ID46677
Feature activation-2.98e+00

Pos logit contributions
rint+3.576
agents+2.517
iment+2.508
 Scher+2.493
Token+2.411
Neg logit contributions
シャ-5.162
アル-4.927
-4.536
habi-4.454
ンジ-4.382
Token()
Token ID3419
Feature activation-1.32e+01

Pos logit contributions
rint+2.027
iment+1.432
agents+1.424
 Scher+1.406
 tamp+1.375
Neg logit contributions
シャ-2.720
アル-2.612
-2.402
habi-2.303
-2.292
Token ap
Token ID2471
Feature activation-1.11e+01

Pos logit contributions
rint+6.806
 tamp+4.241
 advis+4.141
iment+4.138
agents+4.108
Neg logit contributions
シャ-13.600
アル-13.248
-12.500
ンジ-11.939
-11.756
Token.
Token ID13
Feature activation-1.68e+01

Pos logit contributions
rint+7.093
iment+5.222
 Scher+5.046
(*+4.955
agents+4.941
Neg logit contributions
シャ-9.843
アル-9.426
-8.725
ンジ-8.527
habi-8.525
Tokenadd
Token ID2860
Feature activation-7.75e+00

Pos logit contributions
rint+7.657
agents+4.701
iment+4.348
(*+4.169
modules+4.158
Neg logit contributions
アル-17.685
シャ-17.672
-15.967
ンジ-15.781
ーティ-15.737
Token_
Token ID62
Feature activation-5.12e+00

Pos logit contributions
rint+5.846
iment+4.446
agents+4.426
(*+4.220
 Scher+4.196
Neg logit contributions
シャ-6.240
アル-5.957
-5.358
ンジ-5.177
habi-5.137
Tokenargument
Token ID49140
Feature activation-1.03e+01

Pos logit contributions
rint+3.084
agents+2.081
iment+2.073
 Scher+1.921
 tamp+1.900
Neg logit contributions
シャ-5.101
アル-4.936
-4.551
-4.371
ンジ-4.368
Token("
Token ID7203
Feature activation+1.33e+00

Pos logit contributions
rint+4.692
(*+2.927
iment+2.916
agents+2.713
 Scher+2.614
Neg logit contributions
シャ-10.898
アル-10.599
-9.916
ンジ-9.645
habi-9.597
Token-
Token ID12
Feature activation-3.04e+00

Pos logit contributions
シャ+0.938
アル+0.884
+0.794
+0.762
habi+0.753
Neg logit contributions
rint-1.176
 Scher-0.900
 tamp-0.898
iment-0.893
agents-0.891
Token we
Token ID356
Feature activation+2.07e+00

Pos logit contributions
シャ+2.258
アル+2.144
+1.947
 Shepard+1.832
habi+1.820
Neg logit contributions
rint-3.092
agents-2.349
iment-2.344
 Scher-2.292
 tamp-2.254
Token can
Token ID460
Feature activation+2.62e+00

Pos logit contributions
シャ+1.360
アル+1.305
+1.164
habi+1.099
+1.094
Neg logit contributions
rint-1.926
agents-1.477
iment-1.458
 Scher-1.450
 tamp-1.416
Token work
Token ID670
Feature activation+1.48e+00

Pos logit contributions
シャ+1.961
アル+1.867
+1.685
nikov+1.622
habi+1.620
Neg logit contributions
rint-2.196
agents-1.658
 Scher-1.602
iment-1.594
 advis-1.535
Token on
Token ID319
Feature activation+9.20e-01

Pos logit contributions
シャ+1.115
アル+1.071
+0.960
habi+0.931
+0.913
Neg logit contributions
rint-1.237
agents-0.921
iment-0.908
 Scher-0.905
 tamp-0.890
Token piece
Token ID3704
Feature activation+7.45e+00

Pos logit contributions
シャ+0.675
アル+0.649
+0.587
habi+0.560
+0.553
Neg logit contributions
rint-0.783
agents-0.589
iment-0.580
 Scher-0.577
 tamp-0.568
Token by
Token ID416
Feature activation-1.56e+01

Pos logit contributions
シャ+4.213
アル+3.830
+3.557
habi+3.246
NS+3.197
Neg logit contributions
rint-7.358
agents-5.875
 Scher-5.789
iment-5.783
 tamp-5.684
Token piece
Token ID3704
Feature activation+2.53e+00

Pos logit contributions
rint+8.736
 tamp+6.783
iment+6.478
 Scher+6.179
 fault+6.041
Neg logit contributions
シャ-14.704
アル-14.193
-12.775
ンジ-12.633
habi-12.422
Token."
Token ID526
Feature activation+8.89e-01

Pos logit contributions
シャ+1.836
アル+1.772
+1.599
habi+1.540
 Shepard+1.501
Neg logit contributions
rint-2.162
agents-1.618
iment-1.588
 Scher-1.587
 tamp-1.569
Token
Token ID198
Feature activation-4.14e-01

Pos logit contributions
シャ+0.751
アル+0.722
+0.657
+0.631
habi+0.625
Neg logit contributions
rint-0.673
agents-0.489
iment-0.487
 Scher-0.480
 tamp-0.477
Token
Token ID198
Feature activation+4.95e-01

Pos logit contributions
rint+0.336
agents+0.247
iment+0.245
 tamp+0.243
 Scher+0.241
Neg logit contributions
シャ-0.328
アル-0.313
-0.284
-0.273
habi-0.270
TokenShe
Token ID3347
Feature activation-7.37e-01

Pos logit contributions
シャ+0.413
アル+0.396
+0.360
+0.348
habi+0.343
Neg logit contributions
rint-0.379
iment-0.274
 tamp-0.273
agents-0.273
 Scher-0.271
Tokenargs
Token ID22046
Feature activation-2.22e+00

Pos logit contributions
rint+1.717
iment+1.277
agents+1.272
 Scher+1.261
 tamp+1.257
Neg logit contributions
シャ-1.652
アル-1.582
-1.431
-1.370
habi-1.363
Token["
Token ID14692
Feature activation-2.47e+00

Pos logit contributions
rint+1.540
agents+1.071
iment+1.066
 tamp+1.050
 Scher+1.049
Neg logit contributions
シャ-2.012
アル-1.940
-1.776
-1.718
habi-1.702
Tokenimages
Token ID17566
Feature activation-1.85e+00

Pos logit contributions
rint+1.835
iment+1.317
agents+1.315
 Scher+1.299
 tamp+1.295
Neg logit contributions
シャ-2.114
アル-2.032
-1.853
-1.784
habi-1.770
Token"]
Token ID8973
Feature activation-4.75e+00

Pos logit contributions
rint+1.801
agents+1.418
iment+1.416
 Scher+1.399
 tamp+1.394
Neg logit contributions
シャ-1.152
アル-1.089
-0.956
-0.901
habi-0.890
Token):
Token ID2599
Feature activation-5.68e+00

Pos logit contributions
rint+2.625
iment+1.703
agents+1.699
 Scher+1.649
 tamp+1.608
Neg logit contributions
シャ-4.916
アル-4.749
-4.411
-4.242
habi-4.241
Token #
Token ID1303
Feature activation-1.56e+01

Pos logit contributions
rint+4.889
 tamp+3.648
iment+3.646
agents+3.627
 Scher+3.622
Neg logit contributions
シャ-4.250
アル-4.058
-3.689
-3.449
habi-3.407
Token load
Token ID3440
Feature activation-5.34e+00

Pos logit contributions
rint+6.788
 tamp+4.082
(*+3.739
 advis+3.484
================================================================+3.386
Neg logit contributions
シャ-16.262
アル-16.145
-14.900
-14.716
ンジ-14.528
Token the
Token ID262
Feature activation-4.81e+00

Pos logit contributions
rint+3.932
iment+2.853
 Scher+2.851
agents+2.851
 tamp+2.789
Neg logit contributions
シャ-4.514
アル-4.348
-3.958
habi-3.818
-3.797
Token image
Token ID2939
Feature activation-3.68e+00

Pos logit contributions
rint+2.870
iment+1.894
agents+1.853
 Scher+1.837
 tamp+1.817
Neg logit contributions
シャ-4.829
アル-4.677
-4.353
-4.152
habi-4.143
Token,
Token ID11
Feature activation-7.44e+00

Pos logit contributions
rint+2.741
iment+1.984
agents+1.977
 Scher+1.951
 tamp+1.944
Neg logit contributions
シャ-3.137
アル-3.018
-2.743
-2.639
habi-2.616
Token resize
Token ID47558
Feature activation-6.02e+00

Pos logit contributions
rint+4.576
 Scher+3.056
 tamp+3.032
iment+2.999
agents+2.985
Neg logit contributions
シャ-7.210
アル-6.876
-6.486
-6.206
habi-6.172
TokenGR
Token ID10761
Feature activation+5.11e+00

Pos logit contributions
シャ+3.715
アル+3.395
NS+3.057
HK+2.858
+2.772
Neg logit contributions
rint-8.236
 tamp-6.813
agents-6.724
iment-6.690
 fortun-6.679
Token2
Token ID17
Feature activation+4.50e+00

Pos logit contributions
シャ+3.322
アル+3.124
+2.787
+2.727
NS+2.696
Neg logit contributions
rint-4.809
agents-3.809
 advis-3.753
 tamp-3.725
iment-3.721
TokenG
Token ID38
Feature activation+4.78e+00

Pos logit contributions
シャ+2.627
アル+2.472
+2.123
NS+2.088
HK+2.081
Neg logit contributions
rint-4.561
agents-3.621
iment-3.604
 tamp-3.544
 advis-3.526
TokenRAY
Token ID30631
Feature activation+8.45e+00

Pos logit contributions
シャ+2.306
アル+2.105
NS+1.728
+1.702
HK+1.648
Neg logit contributions
rint-5.219
agents-4.315
iment-4.255
 tamp-4.253
 advis-4.251
Token )
Token ID1267
Feature activation-2.72e+00

Pos logit contributions
シャ+4.385
アル+3.999
+3.570
+3.405
NS+3.365
Neg logit contributions
rint-8.780
 fortun-7.078
 pse-7.027
agents-6.946
 advis-6.918
Token #
Token ID1303
Feature activation-1.47e+01

Pos logit contributions
rint+2.374
iment+1.774
 tamp+1.769
agents+1.766
 Scher+1.752
Neg logit contributions
シャ-2.014
アル-1.942
-1.760
-1.652
habi-1.617
Token smooth
Token ID7209
Feature activation-2.18e+00

Pos logit contributions
rint+7.331
 tamp+4.641
ooth+4.432
 Scher+4.392
================================================================+4.160
Neg logit contributions
シャ-14.619
アル-14.457
-13.055
habi-12.928
-12.782
Token the
Token ID262
Feature activation-9.28e-01

Pos logit contributions
rint+1.719
agents+1.264
iment+1.263
 Scher+1.250
 tamp+1.240
Neg logit contributions
シャ-1.764
アル-1.691
-1.534
-1.467
habi-1.459
Token image
Token ID2939
Feature activation-3.02e-01

Pos logit contributions
rint+0.619
agents+0.428
iment+0.424
 Scher+0.416
 tamp+0.414
Neg logit contributions
シャ-0.862
アル-0.829
-0.761
-0.739
habi-0.731
Token using
Token ID1262
Feature activation-1.87e+00

Pos logit contributions
rint+0.240
agents+0.178
iment+0.177
 Scher+0.174
 tamp+0.173
Neg logit contributions
シャ-0.241
アル-0.230
-0.208
-0.201
habi-0.198
Token a
Token ID257
Feature activation-1.27e+00

Pos logit contributions
rint+1.458
agents+1.068
iment+1.066
 Scher+1.049
 tamp+1.044
Neg logit contributions
シャ-1.533
アル-1.469
-1.334
-1.280
habi-1.270
Token scrap
Token ID15881
Feature activation-3.33e+00

Pos logit contributions
シャ+1.102
アル+1.042
+0.941
+0.901
habi+0.895
Neg logit contributions
rint-1.271
agents-0.971
iment-0.957
 Scher-0.933
 tamp-0.922
Tokened
Token ID276
Feature activation-2.12e+00

Pos logit contributions
rint+3.367
iment+2.655
agents+2.653
 Scher+2.642
 tamp+2.617
Neg logit contributions
シャ-1.950
アル-1.838
-1.602
-1.512
habi-1.480
Token by
Token ID416
Feature activation+1.07e+00

Pos logit contributions
rint+1.802
iment+1.359
agents+1.349
 Scher+1.348
 tamp+1.336
Neg logit contributions
シャ-1.590
アル-1.527
-1.364
-1.306
habi-1.286
Token from
Token ID422
Feature activation-1.96e+00

Pos logit contributions
シャ+0.825
アル+0.786
+0.716
+0.685
habi+0.683
Neg logit contributions
rint-0.873
agents-0.653
iment-0.645
 tamp-0.631
 Scher-0.630
Token deal
Token ID1730
Feature activation+2.30e+00

Pos logit contributions
rint+1.633
agents+1.224
iment+1.222
 Scher+1.207
 tamp+1.201
Neg logit contributions
シャ-1.503
アル-1.437
-1.296
-1.238
habi-1.226
Token to
Token ID284
Feature activation-1.40e+01

Pos logit contributions
シャ+1.680
アル+1.598
+1.440
habi+1.360
+1.357
Neg logit contributions
rint-1.988
agents-1.499
iment-1.495
 Scher-1.474
 tamp-1.471
Token deal
Token ID1730
Feature activation-2.45e+00

Pos logit contributions
rint+8.138
 tamp+6.129
 Scher+6.045
iment+5.739
 advis+5.682
Neg logit contributions
シャ-12.346
アル-12.316
-11.095
-10.571
nikov-10.458
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
rint+1.968
iment+1.461
agents+1.456
 Scher+1.445
 tamp+1.437
Neg logit contributions
シャ-1.950
アル-1.866
-1.685
-1.615
habi-1.592
Token drilling
Token ID19747
Feature activation+5.89e-01

Pos logit contributions
シャ+0.820
アル+0.778
+0.701
+0.673
habi+0.670
Neg logit contributions
rint-0.938
agents-0.709
iment-0.702
 tamp-0.685
 Scher-0.685
Token hundreds
Token ID5179
Feature activation+1.34e+00

Pos logit contributions
シャ+0.442
アル+0.419
+0.380
+0.365
habi+0.362
Neg logit contributions
rint-0.495
agents-0.376
iment-0.369
 Scher-0.361
 tamp-0.360
Token of
Token ID286
Feature activation-2.90e+00

Pos logit contributions
シャ+1.093
アル+1.051
+0.953
+0.912
habi+0.904
Neg logit contributions
rint-1.046
iment-0.768
agents-0.767
 Scher-0.754
 tamp-0.752
TokenED
Token ID1961
Feature activation-1.56e+00

Pos logit contributions
シャ+2.419
アル+2.326
+2.086
+2.020
NS+1.999
Neg logit contributions
rint-2.190
agents-1.584
 Scher-1.558
iment-1.533
 tamp-1.518
Token NET
Token ID30502
Feature activation+1.77e+00

Pos logit contributions
rint+1.309
agents+0.983
iment+0.982
 Scher+0.969
 tamp+0.965
Neg logit contributions
シャ-1.192
アル-1.138
-1.026
-0.980
habi-0.972
Token CALL
Token ID42815
Feature activation+1.72e+00

Pos logit contributions
シャ+1.285
アル+1.227
+1.105
+1.054
NS+1.027
Neg logit contributions
rint-1.544
agents-1.175
iment-1.168
 tamp-1.146
 Scher-1.143
Token UPDATE
Token ID35717
Feature activation+4.03e+00

Pos logit contributions
シャ+1.270
アル+1.217
+1.092
+1.043
NS+1.021
Neg logit contributions
rint-1.468
agents-1.112
iment-1.100
 Scher-1.077
 tamp-1.076
Token:
Token ID25
Feature activation+7.47e+00

Pos logit contributions
シャ+3.080
アル+2.968
+2.700
+2.562
 Shepard+2.541
Neg logit contributions
rint-3.298
agents-2.457
iment-2.419
 Scher-2.368
 tamp-2.355
Token 05
Token ID8870
Feature activation-1.39e+01

Pos logit contributions
シャ+3.923
アル+3.840
+3.269
 Shepard+3.099
+3.008
Neg logit contributions
rint-7.685
agents-6.150
iment-6.082
 tamp-5.898
 advis-5.836
Token38
Token ID2548
Feature activation-6.11e+00

Pos logit contributions
rint+5.986
(*+4.228
Token+4.193
====+4.161
iment+4.155
Neg logit contributions
シャ-13.680
アル-13.367
habi-12.282
-12.168
ンジ-11.791
Tokenz
Token ID89
Feature activation+3.33e+00

Pos logit contributions
rint+4.129
iment+2.862
agents+2.822
 tamp+2.815
 Scher+2.802
Neg logit contributions
シャ-5.621
アル-5.394
-4.969
-4.761
habi-4.710
Token SEM
Token ID48603
Feature activation+3.30e+00

Pos logit contributions
シャ+2.432
アル+2.310
+2.072
+2.001
 Shepard+1.963
Neg logit contributions
rint-2.870
agents-2.187
iment-2.161
 tamp-2.090
 Scher-2.086
TokenINAL
Token ID17961
Feature activation+1.19e+00

Pos logit contributions
シャ+2.380
アル+2.292
+2.071
NS+1.983
+1.962
Neg logit contributions
rint-2.872
agents-2.181
 tamp-2.111
iment-2.108
 Scher-2.076
Token MA
Token ID8779
Feature activation+4.10e+00

Pos logit contributions
シャ+0.875
アル+0.835
+0.751
+0.717
ンジ+0.698
Neg logit contributions
rint-1.030
agents-0.784
iment-0.777
 Scher-0.763
 tamp-0.761
TokenED
Token ID1961
Feature activation+1.29e+00

Pos logit contributions
シャ+3.526
アル+3.311
+2.927
NS+2.890
+2.874
Neg logit contributions
rint-4.975
agents-3.858
 tamp-3.821
 pse-3.803
 Scher-3.797
Token NET
Token ID30502
Feature activation+1.69e+00

Pos logit contributions
シャ+1.047
アル+1.003
+0.908
+0.877
ンジ+0.857
Neg logit contributions
rint-1.021
agents-0.753
iment-0.747
 Scher-0.731
 tamp-0.729
Token CALL
Token ID42815
Feature activation+5.93e+00

Pos logit contributions
シャ+1.168
アル+1.110
+0.993
+0.962
 Shepard+0.925
Neg logit contributions
rint-1.540
agents-1.186
iment-1.164
 tamp-1.149
 Scher-1.132
Token UPDATE
Token ID35717
Feature activation+4.72e+00

Pos logit contributions
シャ+3.150
アル+2.991
+2.564
+2.416
NS+2.340
Neg logit contributions
rint-6.185
agents-4.995
iment-4.937
 Scher-4.836
 tamp-4.817
Token:
Token ID25
Feature activation+7.36e+00

Pos logit contributions
シャ+3.501
アル+3.342
+3.058
+2.942
 Shepard+2.915
Neg logit contributions
rint-3.978
agents-2.958
iment-2.887
 fortun-2.845
 advis-2.821
Token 05
Token ID8870
Feature activation-1.35e+01

Pos logit contributions
シャ+3.431
アル+3.372
+2.777
 Shepard+2.576
+2.492
Neg logit contributions
rint-8.053
agents-6.532
iment-6.509
 tamp-6.346
 Scher-6.301
Token43
Token ID3559
Feature activation-6.81e+00

Pos logit contributions
rint+5.616
Token+3.861
 Scher+3.641
iment+3.625
(*+3.601
Neg logit contributions
シャ-13.862
アル-13.288
habi-12.336
-12.267
ンジ-12.028
Tokenz
Token ID89
Feature activation+1.38e+00

Pos logit contributions
rint+4.492
iment+3.124
agents+3.075
 Scher+3.010
 tamp+3.006
Neg logit contributions
シャ-6.370
アル-6.120
-5.649
-5.393
ンジ-5.345
Token ST
Token ID3563
Feature activation-3.76e+00

Pos logit contributions
シャ+1.001
アル+0.952
+0.852
+0.820
ンジ+0.799
Neg logit contributions
rint-1.202
agents-0.915
iment-0.909
 tamp-0.886
 Scher-0.883
TokenOW
Token ID3913
Feature activation-1.05e+00

Pos logit contributions
rint+2.506
iment+1.700
 tamp+1.682
 Scher+1.680
agents+1.676
Neg logit contributions
シャ-3.507
アル-3.374
-3.124
habi-2.998
-2.987
Token MA
Token ID8779
Feature activation+3.03e+00

Pos logit contributions
rint+0.872
agents+0.653
iment+0.652
 Scher+0.643
 tamp+0.640
Neg logit contributions
シャ-0.808
アル-0.772
-0.697
-0.666
habi-0.660
Token =
Token ID796
Feature activation+2.95e+00

Pos logit contributions
rint+2.210
iment+1.455
agents+1.452
 Scher+1.424
 tamp+1.393
Neg logit contributions
シャ-3.631
アル-3.507
-3.243
-3.125
habi-3.120
Token c
Token ID269
Feature activation-6.90e-02

Pos logit contributions
シャ+1.004
アル+0.908
+0.736
+0.666
habi+0.622
Neg logit contributions
rint-3.603
agents-3.028
iment-3.019
 Scher-2.956
 tamp-2.948
Tokenv
Token ID85
Feature activation-4.03e-01

Pos logit contributions
rint+0.059
agents+0.045
 tamp+0.045
 Scher+0.044
iment+0.044
Neg logit contributions
シャ-0.050
アル-0.048
-0.043
-0.042
habi-0.042
Token2
Token ID17
Feature activation-5.80e+00

Pos logit contributions
rint+0.352
agents+0.267
iment+0.266
 tamp+0.266
 Scher+0.263
Neg logit contributions
シャ-0.293
アル-0.279
-0.249
-0.241
habi-0.235
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
rint+4.418
iment+3.295
agents+3.253
 Scher+3.250
 tamp+3.134
Neg logit contributions
シャ-4.776
アル-4.562
-4.161
habi-3.967
ンジ-3.960
Tokenget
Token ID1136
Feature activation-1.34e+01

Pos logit contributions
rint+0.975
iment+0.710
agents+0.708
 tamp+0.703
 Scher+0.703
Neg logit contributions
シャ-1.052
アル-1.012
-0.919
-0.885
habi-0.878
TokenStruct
Token ID44909
Feature activation-2.82e+00

Pos logit contributions
rint+8.738
Seg+6.903
Token+6.707
iment+6.705
(*+6.689
Neg logit contributions
シャ-10.447
アル-9.951
-9.489
habi-9.344
nikov-9.194
Tokenuring
Token ID870
Feature activation-6.49e+00

Pos logit contributions
rint+2.353
agents+1.764
iment+1.760
 Scher+1.737
 tamp+1.730
Neg logit contributions
シャ-2.160
アル-2.065
-1.861
-1.779
habi-1.762
TokenElement
Token ID20180
Feature activation-2.31e+00

Pos logit contributions
rint+4.477
iment+3.356
agents+3.260
 Scher+3.201
Token+3.188
Neg logit contributions
シャ-5.582
アル-5.292
-4.958
habi-4.804
ンジ-4.691
Token(
Token ID7
Feature activation-1.86e+00

Pos logit contributions
rint+1.480
agents+1.001
iment+1.000
 Scher+0.979
 tamp+0.969
Neg logit contributions
シャ-2.211
アル-2.131
-1.967
-1.895
habi-1.885
Tokencv
Token ID33967
Feature activation-3.27e+00

Pos logit contributions
rint+1.726
iment+1.340
agents+1.337
 tamp+1.327
 Scher+1.325
Neg logit contributions
シャ-1.241
アル-1.180
-1.046
-0.999
habi-0.987
TokenED
Token ID1961
Feature activation+1.38e+00

Pos logit contributions
シャ+3.398
アル+3.184
+2.813
NS+2.759
+2.759
Neg logit contributions
rint-4.755
agents-3.668
 Scher-3.645
 tamp-3.616
 pse-3.600
Token NET
Token ID30502
Feature activation+1.08e+00

Pos logit contributions
シャ+1.028
アル+0.981
+0.882
+0.848
NS+0.825
Neg logit contributions
rint-1.171
agents-0.887
iment-0.878
 Scher-0.857
 tamp-0.854
Token CALL
Token ID42815
Feature activation+4.05e+00

Pos logit contributions
シャ+0.803
アル+0.767
+0.690
+0.659
habi+0.648
Neg logit contributions
rint-0.927
agents-0.703
iment-0.700
 Scher-0.689
 tamp-0.686
Token UPDATE
Token ID35717
Feature activation+3.78e+00

Pos logit contributions
シャ+2.739
アル+2.627
+2.329
+2.222
NS+2.181
Neg logit contributions
rint-3.677
agents-2.856
iment-2.817
 Scher-2.741
 tamp-2.738
Token:
Token ID25
Feature activation+7.21e+00

Pos logit contributions
シャ+3.117
アル+3.004
+2.737
+2.614
habi+2.580
Neg logit contributions
rint-2.894
agents-2.116
iment-2.093
 Scher-2.054
 tamp-2.031
Token 05
Token ID8870
Feature activation-1.33e+01

Pos logit contributions
シャ+3.480
アル+3.419
+2.840
 Shepard+2.667
+2.575
Neg logit contributions
rint-7.752
agents-6.262
iment-6.225
 tamp-6.071
 Scher-5.997
Token41
Token ID3901
Feature activation-7.42e+00

Pos logit contributions
rint+5.716
Token+3.871
iment+3.853
(*+3.693
 Scher+3.686
Neg logit contributions
シャ-13.408
アル-13.217
habi-12.140
-12.034
ンジ-11.672
Tokenz
Token ID89
Feature activation+1.97e+00

Pos logit contributions
rint+4.809
iment+3.319
 Scher+3.232
agents+3.227
 tamp+3.226
Neg logit contributions
シャ-7.001
アル-6.700
-6.189
-5.906
ンジ-5.851
Token BLACK
Token ID31963
Feature activation-2.15e+00

Pos logit contributions
シャ+1.457
アル+1.387
+1.246
+1.201
 Shepard+1.168
Neg logit contributions
rint-1.685
agents-1.275
iment-1.265
 tamp-1.230
 Scher-1.224
TokenFO
Token ID6080
Feature activation+1.07e+00

Pos logit contributions
rint+1.630
agents+1.185
iment+1.182
 Scher+1.164
 tamp+1.156
Neg logit contributions
シャ-1.809
アル-1.741
-1.576
-1.517
habi-1.506
TokenOT
Token ID2394
Feature activation-1.82e-01

Pos logit contributions
シャ+0.831
アル+0.790
+0.719
+0.695
NS+0.671
Neg logit contributions
rint-0.881
agents-0.661
iment-0.654
 Scher-0.642
 tamp-0.640
Token correct
Token ID3376
Feature activation-1.59e+00

Pos logit contributions
rint+1.396
agents+1.011
iment+1.010
 Scher+0.993
 tamp+0.989
Neg logit contributions
シャ-1.555
アル-1.492
-1.358
-1.305
habi-1.295
Tokens
Token ID82
Feature activation-4.92e+00

Pos logit contributions
rint+1.257
agents+0.923
iment+0.921
 Scher+0.908
 tamp+0.903
Neg logit contributions
シャ-1.296
アル-1.241
-1.127
-1.080
habi-1.070
Token trials
Token ID9867
Feature activation+1.76e-01

Pos logit contributions
rint+3.769
agents+2.767
iment+2.762
 Scher+2.707
 tamp+2.695
Neg logit contributions
シャ-4.075
アル-3.899
-3.531
-3.409
habi-3.384
Token in
Token ID287
Feature activation-2.20e+00

Pos logit contributions
シャ+0.137
アル+0.132
+0.120
+0.114
habi+0.113
Neg logit contributions
rint-0.143
agents-0.106
iment-0.105
 Scher-0.104
 tamp-0.104
Token short
Token ID1790
Feature activation-6.42e+00

Pos logit contributions
rint+1.759
agents+1.299
iment+1.296
 Scher+1.278
 tamp+1.272
Neg logit contributions
シャ-1.766
アル-1.691
-1.532
-1.468
habi-1.455
Token or
Token ID393
Feature activation-1.32e+01

Pos logit contributions
rint+5.079
agents+3.784
iment+3.744
 Scher+3.721
 tamp+3.720
Neg logit contributions
シャ-5.078
アル-4.862
-4.414
-4.218
habi-4.216
Token long
Token ID890
Feature activation-4.53e+00

Pos logit contributions
rint+9.705
 tamp+7.518
iment+7.435
 Scher+7.289
 advis+7.268
Neg logit contributions
アル-9.758
シャ-9.725
-8.449
-8.413
habi-8.257
Token training
Token ID3047
Feature activation-1.49e+00

Pos logit contributions
rint+3.570
iment+2.636
agents+2.635
 tamp+2.608
 Scher+2.599
Neg logit contributions
シャ-3.645
アル-3.498
-3.170
-3.039
habi-3.029
Token blocks
Token ID7021
Feature activation-6.35e-01

Pos logit contributions
rint+1.234
iment+0.919
agents+0.919
 Scher+0.908
 tamp+0.898
Neg logit contributions
シャ-1.159
アル-1.103
-1.003
-0.954
habi-0.945
Token respectively
Token ID8148
Feature activation-2.98e+00

Pos logit contributions
rint+0.508
agents+0.374
iment+0.373
 Scher+0.368
 tamp+0.366
Neg logit contributions
シャ-0.509
アル-0.487
-0.442
-0.423
habi-0.420
Token)
Token ID8
Feature activation-3.07e+00

Pos logit contributions
rint+2.267
iment+1.648
agents+1.646
 Scher+1.620
 tamp+1.612
Neg logit contributions
シャ-2.500
アル-2.394
-2.180
-2.097
habi-2.079
Tokenparse
Token ID29572
Feature activation-9.69e+00

Pos logit contributions
rint+3.590
agents+2.369
iment+2.302
 Scher+2.252
 tamp+2.230
Neg logit contributions
シャ-5.715
アル-5.534
-5.083
-4.928
ンジ-4.890
Token.
Token ID13
Feature activation-1.04e+01

Pos logit contributions
rint+6.356
iment+4.681
 Scher+4.676
agents+4.545
(*+4.479
Neg logit contributions
シャ-8.462
アル-8.172
-7.470
ンジ-7.308
habi-7.276
TokenArg
Token ID28100
Feature activation-7.53e+00

Pos logit contributions
rint+7.104
Token+5.545
agents+5.523
iment+5.458
(*+5.261
Neg logit contributions
シャ-8.404
アル-8.074
habi-7.324
-7.319
 Shepard-7.131
Tokenument
Token ID1713
Feature activation-5.56e+00

Pos logit contributions
rint+5.643
iment+4.441
agents+4.230
 Scher+4.015
Token+3.966
Neg logit contributions
シャ-6.137
アル-5.951
-5.399
ンジ-5.206
habi-5.069
TokenParser
Token ID46677
Feature activation-2.98e+00

Pos logit contributions
rint+3.576
agents+2.517
iment+2.508
 Scher+2.493
Token+2.411
Neg logit contributions
シャ-5.162
アル-4.927
-4.536
habi-4.454
ンジ-4.382
Token()
Token ID3419
Feature activation-1.32e+01

Pos logit contributions
rint+2.027
iment+1.432
agents+1.424
 Scher+1.406
 tamp+1.375
Neg logit contributions
シャ-2.720
アル-2.612
-2.402
habi-2.303
-2.292
Token ap
Token ID2471
Feature activation-1.11e+01

Pos logit contributions
rint+6.806
 tamp+4.241
 advis+4.141
iment+4.138
agents+4.108
Neg logit contributions
シャ-13.600
アル-13.248
-12.500
ンジ-11.939
-11.756
Token.
Token ID13
Feature activation-1.68e+01

Pos logit contributions
rint+7.093
iment+5.222
 Scher+5.046
(*+4.955
agents+4.941
Neg logit contributions
シャ-9.843
アル-9.426
-8.725
ンジ-8.527
habi-8.525
Tokenadd
Token ID2860
Feature activation-7.75e+00

Pos logit contributions
rint+7.657
agents+4.701
iment+4.348
(*+4.169
modules+4.158
Neg logit contributions
アル-17.685
シャ-17.672
-15.967
ンジ-15.781
ーティ-15.737
Token_
Token ID62
Feature activation-5.12e+00

Pos logit contributions
rint+5.846
iment+4.446
agents+4.426
(*+4.220
 Scher+4.196
Neg logit contributions
シャ-6.240
アル-5.957
-5.358
ンジ-5.177
habi-5.137
Tokenargument
Token ID49140
Feature activation-1.03e+01

Pos logit contributions
rint+3.084
agents+2.081
iment+2.073
 Scher+1.921
 tamp+1.900
Neg logit contributions
シャ-5.101
アル-4.936
-4.551
-4.371
ンジ-4.368
Token said
Token ID531
Feature activation-1.14e+00

Pos logit contributions
rint+0.526
agents+0.389
iment+0.389
 Scher+0.384
 tamp+0.382
Neg logit contributions
シャ-0.502
アル-0.479
-0.434
-0.416
habi-0.411
Token one
Token ID530
Feature activation+1.14e+00

Pos logit contributions
rint+0.933
iment+0.693
agents+0.693
 Scher+0.682
 tamp+0.678
Neg logit contributions
シャ-0.886
アル-0.848
-0.765
-0.732
habi-0.725
Token attacker
Token ID15250
Feature activation-1.91e+00

Pos logit contributions
シャ+0.874
アル+0.826
+0.742
+0.715
habi+0.707
Neg logit contributions
rint-0.950
agents-0.712
iment-0.706
 Scher-0.694
 tamp-0.686
Token was
Token ID373
Feature activation-2.06e+00

Pos logit contributions
rint+1.569
agents+1.172
iment+1.169
 Scher+1.158
 tamp+1.154
Neg logit contributions
シャ-1.489
アル-1.426
-1.289
-1.230
habi-1.217
Token Kh
Token ID5311
Feature activation-4.44e+00

Pos logit contributions
rint+1.736
agents+1.305
iment+1.303
 Scher+1.286
 tamp+1.281
Neg logit contributions
シャ-1.567
アル-1.497
-1.349
-1.287
habi-1.274
Tokenur
Token ID333
Feature activation-1.29e+01

Pos logit contributions
rint+3.054
agents+2.116
iment+2.111
 Scher+2.055
 tamp+2.029
Neg logit contributions
シャ-4.099
アル-3.914
-3.623
-3.480
ンジ-3.408
Tokenam
Token ID321
Feature activation-4.62e+00

Pos logit contributions
rint+10.091
iment+7.485
agents+7.286
cycle+7.157
 Scher+7.057
Neg logit contributions
シャ-9.993
アル-9.796
-9.068
-8.325
ンジ-8.286
Token Sh
Token ID911
Feature activation+1.32e+00

Pos logit contributions
rint+4.106
 Scher+3.161
agents+3.157
iment+3.136
 tamp+3.100
Neg logit contributions
シャ-3.240
アル-3.109
-2.772
-2.618
habi-2.554
Tokenaz
Token ID1031
Feature activation-1.57e+00

Pos logit contributions
シャ+1.195
アル+1.140
+1.041
habi+1.029
nikov+1.011
Neg logit contributions
rint-0.904
 Scher-0.637
 tamp-0.626
agents-0.625
iment-0.624
Tokenad
Token ID324
Feature activation+1.43e+00

Pos logit contributions
rint+1.452
agents+1.122
iment+1.122
 Scher+1.106
 tamp+1.101
Neg logit contributions
シャ-1.070
アル-1.018
-0.903
-0.857
habi-0.844
Token Butt
Token ID36041
Feature activation-1.40e+00

Pos logit contributions
シャ+0.905
アル+0.854
+0.754
+0.715
habi+0.714
Neg logit contributions
rint-1.377
iment-1.078
agents-1.077
 Scher-1.057
 tamp-1.056
Token even
Token ID772
Feature activation-5.79e+00

Pos logit contributions
rint+7.812
 tamp+5.927
 Scher+5.787
agents+5.719
iment+5.576
Neg logit contributions
シャ-9.695
アル-9.418
-8.470
ンジ-8.198
habi-8.082
Token all
Token ID477
Feature activation-5.10e+00

Pos logit contributions
rint+4.442
 tamp+3.285
agents+3.283
 Scher+3.274
iment+3.272
Neg logit contributions
シャ-4.747
アル-4.540
-4.082
-3.912
habi-3.899
Token the
Token ID262
Feature activation-3.54e+00

Pos logit contributions
rint+3.974
agents+2.947
iment+2.941
 tamp+2.913
 Scher+2.907
Neg logit contributions
シャ-4.108
アル-3.936
-3.573
habi-3.393
-3.392
Token spoil
Token ID20851
Feature activation-6.10e+00

Pos logit contributions
rint+2.954
agents+2.234
iment+2.227
 Scher+2.204
 tamp+2.201
Neg logit contributions
シャ-2.691
アル-2.579
-2.312
-2.211
habi-2.198
Token thereof
Token ID15370
Feature activation-9.11e+00

Pos logit contributions
rint+4.631
iment+3.427
agents+3.414
 tamp+3.368
 Scher+3.356
Neg logit contributions
シャ-5.013
アル-4.850
-4.370
habi-4.164
-4.144
Token,
Token ID11
Feature activation-1.28e+01

Pos logit contributions
rint+6.159
 tamp+4.479
iment+4.428
 Scher+4.360
agents+4.313
Neg logit contributions
シャ-7.939
アル-7.700
-6.927
ンジ-6.688
-6.596
Token shalt
Token ID36258
Feature activation-5.67e+00

Pos logit contributions
rint+8.577
 tamp+6.534
 Scher+6.394
agents+6.278
 advis+6.104
Neg logit contributions
シャ-10.556
アル-10.273
-9.211
ンジ-8.935
habi-8.846
Token thou
Token ID14210
Feature activation-9.43e+00

Pos logit contributions
rint+4.463
 tamp+3.369
iment+3.334
 Scher+3.305
agents+3.262
Neg logit contributions
シャ-4.514
アル-4.351
-3.889
-3.716
habi-3.690
Token take
Token ID1011
Feature activation-6.61e+00

Pos logit contributions
rint+6.719
 tamp+5.187
 Scher+4.900
iment+4.891
 advis+4.804
Neg logit contributions
シャ-7.959
アル-7.670
-6.946
habi-6.609
-6.586
Token unto
Token ID12722
Feature activation-5.10e+00

Pos logit contributions
rint+5.016
iment+3.700
 tamp+3.673
agents+3.654
 Scher+3.628
Neg logit contributions
シャ-5.441
アル-5.203
-4.736
habi-4.504
-4.499
Token thy
Token ID11906
Feature activation-4.74e+00

Pos logit contributions
rint+3.912
iment+2.889
 tamp+2.888
agents+2.885
 Scher+2.864
Neg logit contributions
シャ-4.151
アル-3.991
-3.594
habi-3.428
ンジ-3.397
Token="
Token ID2625
Feature activation+1.97e+00

Pos logit contributions
rint+3.908
agents+2.785
iment+2.753
 Scher+2.651
 tamp+2.644
Neg logit contributions
シャ-5.003
アル-4.761
-4.389
-4.189
habi-4.183
Tokenpath
Token ID6978
Feature activation+7.58e-01

Pos logit contributions
シャ+1.534
アル+1.487
+1.347
+1.286
habi+1.276
Neg logit contributions
rint-1.557
iment-1.163
 tamp-1.163
 Scher-1.159
agents-1.138
Token to
Token ID284
Feature activation-7.46e-02

Pos logit contributions
シャ+0.555
アル+0.531
+0.477
+0.454
habi+0.451
Neg logit contributions
rint-0.649
agents-0.495
iment-0.491
 Scher-0.486
 tamp-0.484
Token images
Token ID4263
Feature activation-3.55e+00

Pos logit contributions
rint+0.057
iment+0.042
agents+0.042
 Scher+0.041
 tamp+0.041
Neg logit contributions
シャ-0.062
アル-0.059
-0.054
-0.052
habi-0.051
Token directory
Token ID8619
Feature activation+7.58e-01

Pos logit contributions
rint+2.787
iment+2.039
agents+2.039
 Scher+2.008
 tamp+1.999
Neg logit contributions
シャ-2.902
アル-2.776
-2.519
-2.416
habi-2.391
Token")
Token ID4943
Feature activation-1.28e+01

Pos logit contributions
シャ+0.595
アル+0.574
+0.521
+0.498
habi+0.494
Neg logit contributions
rint-0.602
agents-0.452
iment-0.448
 Scher-0.440
 tamp-0.437
Token args
Token ID26498
Feature activation-4.73e+00

Pos logit contributions
rint+9.623
 tamp+6.984
agents+6.843
 advis+6.765
(*+6.681
Neg logit contributions
シャ-10.438
アル-9.949
-9.120
ンジ-8.655
-8.614
Token =
Token ID796
Feature activation-7.69e-01

Pos logit contributions
rint+2.989
agents+1.990
iment+1.981
 tamp+1.916
 Scher+1.915
Neg logit contributions
シャ-4.589
アル-4.416
-4.067
-3.932
ンジ-3.898
Token v
Token ID410
Feature activation-2.92e+00

Pos logit contributions
rint+0.568
iment+0.408
agents+0.408
 Scher+0.400
 tamp+0.395
Neg logit contributions
シャ-0.659
アル-0.635
-0.580
-0.558
habi-0.552
Tokenars
Token ID945
Feature activation-4.15e+00

Pos logit contributions
rint+2.678
agents+2.075
iment+2.070
 Scher+2.057
 tamp+2.051
Neg logit contributions
シャ-1.972
アル-1.872
-1.663
-1.577
habi-1.572
Token(
Token ID7
Feature activation-4.33e+00

Pos logit contributions
rint+3.016
iment+2.148
agents+2.138
 tamp+2.093
 Scher+2.091
Neg logit contributions
シャ-3.631
アル-3.485
-3.185
-3.061
habi-3.027
Tokenobject
Token ID15252
Feature activation+5.54e-01

Pos logit contributions
rint+1.255
agents+0.948
 tamp+0.946
 Scher+0.945
iment+0.943
Neg logit contributions
シャ-1.071
アル-1.015
-0.916
-0.877
habi-0.869
Token system
Token ID1080
Feature activation+1.68e+00

Pos logit contributions
シャ+0.383
アル+0.368
+0.325
+0.314
habi+0.306
Neg logit contributions
rint-0.499
agents-0.386
iment-0.382
 Scher-0.376
 tamp-0.375
Token.
Token ID13
Feature activation-7.99e+00

Pos logit contributions
シャ+1.135
アル+1.085
+0.962
+0.928
NS+0.897
Neg logit contributions
rint-1.536
agents-1.180
 tamp-1.175
 advis-1.165
iment-1.164
Tokendraw
Token ID19334
Feature activation-1.17e+00

Pos logit contributions
rint+5.413
agents+3.720
iment+3.660
 Scher+3.442
cycle+3.436
Neg logit contributions
シャ-7.227
アル-6.934
-6.363
 Shepard-6.126
nikov-6.099
Tokening
Token ID278
Feature activation-4.05e+00

Pos logit contributions
rint+0.993
agents+0.745
 Scher+0.738
iment+0.737
 tamp+0.737
Neg logit contributions
シャ-0.878
アル-0.837
-0.757
-0.727
habi-0.719
Token.
Token ID13
Feature activation-1.25e+01

Pos logit contributions
rint+3.308
iment+2.470
agents+2.466
 Scher+2.433
 tamp+2.403
Neg logit contributions
シャ-3.171
アル-3.035
-2.737
-2.610
habi-2.598
Tokenbit
Token ID2545
Feature activation-5.78e+00

Pos logit contributions
rint+8.740
agents+6.112
iment+6.063
oct+5.895
Token+5.802
Neg logit contributions
シャ-10.385
アル-9.971
-9.030
 Shepard-8.813
nikov-8.617
Tokenmap
Token ID8899
Feature activation-1.89e+00

Pos logit contributions
rint+3.020
iment+1.778
agents+1.761
 Scher+1.676
 tamp+1.642
Neg logit contributions
シャ-6.327
アル-6.118
-5.686
-5.483
ンジ-5.434
Token($
Token ID16763
Feature activation-2.34e+00

Pos logit contributions
rint+1.785
agents+1.390
iment+1.388
 Scher+1.373
 tamp+1.367
Neg logit contributions
シャ-1.235
アル-1.171
-1.034
-0.979
habi-0.970
TokenSR
Token ID12562
Feature activation-4.86e+00

Pos logit contributions
rint+1.846
agents+1.356
iment+1.352
 Scher+1.330
 tamp+1.323
Neg logit contributions
シャ-1.898
アル-1.815
-1.650
-1.577
habi-1.565
Token.
Token ID13
Feature activation-6.03e+00

Pos logit contributions
rint+3.656
iment+2.627
agents+2.604
 Scher+2.526
 tamp+2.525
Neg logit contributions
シャ-4.117
アル-3.874
-3.580
-3.434
habi-3.416
Token on
Token ID319
Feature activation+6.87e-01

Pos logit contributions
シャ+2.491
アル+2.387
+2.161
+2.062
habi+2.045
Neg logit contributions
rint-2.916
agents-2.218
iment-2.159
 Scher-2.121
 tamp-2.119
Token Division
Token ID7458
Feature activation+2.71e+00

Pos logit contributions
シャ+0.511
アル+0.479
+0.433
+0.419
 Shepard+0.409
Neg logit contributions
rint-0.582
agents-0.441
iment-0.439
 Scher-0.429
 tamp-0.424
Token Street
Token ID3530
Feature activation+6.69e-02

Pos logit contributions
シャ+1.889
アル+1.782
+1.647
+1.559
 Shepard+1.545
Neg logit contributions
rint-2.364
agents-1.828
iment-1.806
 tamp-1.769
 Scher-1.756
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
シャ+0.051
アル+0.049
+0.044
+0.042
habi+0.042
Neg logit contributions
rint-0.056
agents-0.042
iment-0.041
 tamp-0.041
 Scher-0.041
Token Block
Token ID9726
Feature activation+3.17e+00

Pos logit contributions
rint+0.113
agents+0.084
iment+0.083
 tamp+0.082
 Scher+0.082
Neg logit contributions
シャ-0.108
アル-0.103
-0.093
-0.089
habi-0.088
Token after
Token ID706
Feature activation-1.25e+01

Pos logit contributions
シャ+2.040
アル+1.907
+1.698
+1.634
habi+1.598
Neg logit contributions
rint-3.004
iment-2.356
agents-2.342
 Scher-2.285
 tamp-2.275
Token block
Token ID2512
Feature activation-1.68e+00

Pos logit contributions
rint+8.237
 tamp+6.412
iment+6.042
 Coun+5.976
 scaff+5.961
Neg logit contributions
シャ-10.145
アル-9.907
-9.066
habi-8.456
-8.362
Token is
Token ID318
Feature activation-1.33e+00

Pos logit contributions
rint+1.340
agents+0.989
iment+0.987
 Scher+0.972
 tamp+0.968
Neg logit contributions
シャ-1.353
アル-1.295
-1.174
-1.125
habi-1.114
Token filled
Token ID5901
Feature activation-8.11e-01

Pos logit contributions
rint+1.087
agents+0.810
iment+0.804
 Scher+0.792
 tamp+0.787
Neg logit contributions
シャ-1.049
アル-1.000
-0.906
-0.869
habi-0.859
Token with
Token ID351
Feature activation+2.79e+00

Pos logit contributions
rint+0.624
agents+0.455
iment+0.454
 Scher+0.447
 tamp+0.445
Neg logit contributions
シャ-0.674
アル-0.647
-0.588
-0.565
habi-0.560
Token tents
Token ID29804
Feature activation+1.70e-01

Pos logit contributions
シャ+2.027
アル+1.891
+1.706
+1.656
ンジ+1.627
Neg logit contributions
rint-2.405
iment-1.815
agents-1.811
 Scher-1.759
inion-1.749
Tokenparse
Token ID29572
Feature activation-3.91e+00

Pos logit contributions
rint+2.391
agents+1.621
iment+1.598
 Scher+1.568
 tamp+1.558
Neg logit contributions
シャ-3.548
アル-3.416
-3.147
-3.038
ンジ-3.004
Token.
Token ID13
Feature activation-5.84e+00

Pos logit contributions
rint+3.262
iment+2.439
agents+2.433
 tamp+2.389
 Scher+2.387
Neg logit contributions
シャ-3.006
アル-2.875
-2.581
-2.457
ンジ-2.453
TokenArg
Token ID28100
Feature activation-3.86e+00

Pos logit contributions
rint+4.815
agents+3.632
iment+3.561
 Scher+3.542
 tamp+3.502
Neg logit contributions
シャ-4.439
アル-4.262
-3.807
-3.632
habi-3.617
Tokenument
Token ID1713
Feature activation-3.53e+00

Pos logit contributions
rint+3.126
agents+2.327
iment+2.320
 Scher+2.282
 tamp+2.243
Neg logit contributions
シャ-3.037
アル-2.895
-2.639
-2.473
ンジ-2.462
TokenParser
Token ID46677
Feature activation-6.09e+00

Pos logit contributions
rint+2.471
iment+1.742
agents+1.731
 Scher+1.707
 tamp+1.688
Neg logit contributions
シャ-3.168
アル-3.057
-2.805
-2.686
habi-2.670
Token()
Token ID3419
Feature activation-1.24e+01

Pos logit contributions
rint+3.890
iment+2.681
agents+2.603
 Scher+2.598
 tamp+2.570
Neg logit contributions
シャ-5.736
アル-5.552
-5.115
ンジ-4.938
-4.891
Token ap
Token ID2471
Feature activation-8.36e+00

Pos logit contributions
rint+9.581
 tamp+7.071
agents+6.880
 advis+6.828
iment+6.728
Neg logit contributions
シャ-9.826
アル-9.480
-8.634
ンジ-8.124
-8.063
Token.
Token ID13
Feature activation-9.04e+00

Pos logit contributions
rint+6.459
iment+4.641
agents+4.600
 Scher+4.402
(*+4.398
Neg logit contributions
シャ-6.890
アル-6.537
-6.003
ンジ-5.739
habi-5.703
Tokenadd
Token ID2860
Feature activation-4.21e+00

Pos logit contributions
rint+6.706
agents+4.879
iment+4.738
 Scher+4.521
 tamp+4.500
Neg logit contributions
シャ-7.588
アル-7.312
-6.593
-6.398
 Shepard-6.324
Token_
Token ID62
Feature activation+7.77e-01

Pos logit contributions
rint+3.488
agents+2.607
iment+2.594
 tamp+2.519
 Scher+2.517
Neg logit contributions
シャ-3.255
アル-3.097
-2.778
-2.666
habi-2.640
Tokenargument
Token ID49140
Feature activation-9.38e-01

Pos logit contributions
シャ+0.702
アル+0.675
+0.621
+0.598
habi+0.596
Neg logit contributions
rint-0.530
 Scher-0.372
iment-0.370
 tamp-0.370
agents-0.367
Token =
Token ID796
Feature activation-3.77e+00

Pos logit contributions
rint+3.293
iment+2.128
agents+2.101
 Scher+2.061
 tamp+1.976
Neg logit contributions
シャ-5.989
アル-5.787
-5.351
habi-5.172
-5.152
Token v
Token ID410
Feature activation-5.42e+00

Pos logit contributions
rint+3.293
agents+2.489
 tamp+2.485
iment+2.482
 Scher+2.481
Neg logit contributions
シャ-2.764
アル-2.623
-2.354
-2.223
habi-2.205
Tokenars
Token ID945
Feature activation-6.69e+00

Pos logit contributions
rint+4.828
agents+3.729
iment+3.674
 Scher+3.578
 tamp+3.530
Neg logit contributions
シャ-3.896
アル-3.744
-3.319
-3.191
ンジ-3.142
Token(
Token ID7
Feature activation-7.88e+00

Pos logit contributions
rint+4.059
iment+2.735
 Scher+2.711
agents+2.696
(*+2.692
Neg logit contributions
シャ-6.460
アル-6.230
-5.808
habi-5.529
-5.528
Tokenap
Token ID499
Feature activation-8.10e+00

Pos logit contributions
rint+5.884
agents+4.373
iment+4.294
 Scher+4.050
 tamp+4.024
Neg logit contributions
シャ-6.624
アル-6.398
-5.795
ンジ-5.603
-5.543
Token.
Token ID13
Feature activation-1.24e+01

Pos logit contributions
rint+5.781
iment+4.272
agents+4.194
 Scher+4.152
(*+4.061
Neg logit contributions
シャ-6.889
アル-6.569
-6.029
habi-5.814
ンジ-5.810
Tokenparse
Token ID29572
Feature activation-5.13e+00

Pos logit contributions
rint+6.666
agents+4.025
oct+3.632
(*+3.631
cycle+3.615
Neg logit contributions
シャ-12.903
アル-12.621
-11.502
 Shepard-11.321
-11.218
Token_
Token ID62
Feature activation-4.11e+00

Pos logit contributions
rint+4.115
agents+3.107
iment+3.091
 Scher+3.026
 tamp+2.951
Neg logit contributions
シャ-4.009
アル-3.820
-3.453
habi-3.301
ンジ-3.268
Tokenargs
Token ID22046
Feature activation-3.51e+00

Pos logit contributions
rint+3.206
agents+2.386
iment+2.336
 Scher+2.278
 tamp+2.256
Neg logit contributions
シャ-3.399
アル-3.248
-2.945
-2.821
habi-2.787
Token())
Token ID28955
Feature activation-6.31e+00

Pos logit contributions
rint+1.911
agents+1.196
iment+1.196
 Scher+1.147
 tamp+1.126
Neg logit contributions
シャ-3.697
アル-3.568
-3.311
-3.200
habi-3.188
Token #
Token ID1303
Feature activation-8.54e+00

Pos logit contributions
rint+5.132
 tamp+3.759
agents+3.686
iment+3.679
 Scher+3.645
Neg logit contributions
シャ-5.052
アル-4.760
-4.462
-4.139
habi-4.022
Tokenimages
Token ID17566
Feature activation+8.93e-01

Pos logit contributions
シャ+1.084
アル+1.040
+0.942
+0.919
habi+0.914
Neg logit contributions
rint-1.085
iment-0.808
 Scher-0.807
 tamp-0.802
agents-0.770
Token"
Token ID1
Feature activation+5.11e+00

Pos logit contributions
シャ+0.700
アル+0.668
+0.604
+0.584
habi+0.577
Neg logit contributions
rint-0.725
agents-0.534
iment-0.529
 Scher-0.528
 tamp-0.526
Token ]
Token ID2361
Feature activation-3.89e+00

Pos logit contributions
シャ+2.581
アル+2.389
+2.007
+1.910
 Shepard+1.866
Neg logit contributions
rint-5.439
iment-4.355
 Scher-4.333
 fortun-4.323
agents-4.320
Token )
Token ID1267
Feature activation-8.09e+00

Pos logit contributions
rint+3.134
iment+2.355
agents+2.346
 tamp+2.307
 Scher+2.290
Neg logit contributions
シャ-3.073
アル-2.943
-2.688
-2.537
ンジ-2.521
Token :
Token ID1058
Feature activation-4.93e+00

Pos logit contributions
rint+6.690
 tamp+5.029
agents+4.946
iment+4.908
 Scher+4.897
Neg logit contributions
シャ-6.281
アル-6.045
-5.605
-5.194
ンジ-5.062
Token #
Token ID1303
Feature activation-1.23e+01

Pos logit contributions
rint+4.204
 tamp+3.083
iment+3.059
agents+3.040
 Scher+3.031
Neg logit contributions
シャ-3.787
アル-3.643
-3.360
-3.132
habi-3.055
Token load
Token ID3440
Feature activation-4.32e+00

Pos logit contributions
rint+6.043
 tamp+3.681
(*+3.373
 advis+3.260
agents+3.220
Neg logit contributions
シャ-12.499
アル-12.337
-11.448
-11.433
ンジ-10.983
Token the
Token ID262
Feature activation-2.91e+00

Pos logit contributions
rint+3.256
agents+2.378
iment+2.370
 Scher+2.368
 tamp+2.328
Neg logit contributions
シャ-3.599
アル-3.463
-3.148
habi-3.027
-3.024
Token image
Token ID2939
Feature activation-3.14e+00

Pos logit contributions
rint+1.836
iment+1.230
agents+1.226
 Scher+1.203
 tamp+1.194
Neg logit contributions
シャ-2.824
アル-2.726
-2.521
-2.423
habi-2.412
Token,
Token ID11
Feature activation-7.84e+00

Pos logit contributions
rint+2.359
iment+1.705
agents+1.701
 Scher+1.678
 tamp+1.671
Neg logit contributions
シャ-2.664
アル-2.562
-2.329
-2.241
habi-2.222
Token resize
Token ID47558
Feature activation-5.88e+00

Pos logit contributions
rint+4.731
 Scher+3.147
 tamp+3.100
agents+3.060
iment+3.030
Neg logit contributions
シャ-7.665
アル-7.297
-6.947
-6.618
habi-6.596
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
シャ+1.829
アル+1.737
+1.563
+1.504
habi+1.485
Neg logit contributions
rint-2.270
agents-1.738
iment-1.712
 Scher-1.675
 tamp-1.651
Token whether
Token ID1771
Feature activation-1.47e+00

Pos logit contributions
シャ+0.084
アル+0.080
+0.071
+0.069
habi+0.068
Neg logit contributions
rint-0.096
agents-0.072
iment-0.071
 tamp-0.070
 Scher-0.070
Token granted
Token ID7520
Feature activation-2.20e+00

Pos logit contributions
rint+1.201
agents+0.893
iment+0.889
 Scher+0.876
 tamp+0.873
Neg logit contributions
シャ-1.159
アル-1.108
-1.003
-0.960
habi-0.951
Token with
Token ID351
Feature activation-1.65e+00

Pos logit contributions
rint+1.767
agents+1.308
iment+1.306
 Scher+1.287
 tamp+1.283
Neg logit contributions
シャ-1.757
アル-1.680
-1.520
-1.456
habi-1.443
Token soft
Token ID2705
Feature activation-5.17e+00

Pos logit contributions
rint+1.345
agents+1.000
iment+0.998
 Scher+0.983
 tamp+0.979
Neg logit contributions
シャ-1.295
アル-1.239
-1.120
-1.072
habi-1.062
Token or
Token ID393
Feature activation-1.22e+01

Pos logit contributions
rint+3.963
agents+2.926
iment+2.920
 tamp+2.889
 Scher+2.861
Neg logit contributions
シャ-4.205
アル-4.059
-3.671
-3.568
habi-3.516
Token tough
Token ID5802
Feature activation-4.63e+00

Pos logit contributions
rint+9.190
 tamp+7.332
iment+7.253
 advis+7.139
 Scher+7.021
Neg logit contributions
シャ-8.775
アル-8.516
-7.397
-7.317
ンジ-7.213
Token conditions
Token ID3403
Feature activation+1.60e+00

Pos logit contributions
rint+3.416
iment+2.511
agents+2.483
 tamp+2.483
 Scher+2.462
Neg logit contributions
シャ-3.888
アル-3.731
-3.405
-3.285
habi-3.249
Token,
Token ID11
Feature activation+4.32e-01

Pos logit contributions
シャ+1.277
アル+1.222
+1.110
+1.066
habi+1.057
Neg logit contributions
rint-1.283
agents-0.946
iment-0.941
 Scher-0.925
 tamp-0.919
Token appears
Token ID3568
Feature activation+3.52e+00

Pos logit contributions
シャ+0.316
アル+0.302
+0.270
+0.258
habi+0.257
Neg logit contributions
rint-0.375
agents-0.285
iment-0.283
 Scher-0.277
 tamp-0.276
Token to
Token ID284
Feature activation+4.19e+00

Pos logit contributions
シャ+2.511
アル+2.402
+2.144
+2.080
habi+2.076
Neg logit contributions
rint-3.032
agents-2.294
iment-2.227
 Scher-2.203
 tamp-2.174
Token Israelis
Token ID21137
Feature activation+1.79e+00

Pos logit contributions
rint+0.990
agents+0.742
iment+0.741
 Scher+0.729
 tamp+0.726
Neg logit contributions
シャ-0.911
アル-0.870
-0.784
-0.750
habi-0.743
Token
Token ID784
Feature activation+3.31e-01

Pos logit contributions
シャ+1.322
アル+1.261
+1.121
+1.086
habi+1.077
Neg logit contributions
rint-1.531
agents-1.153
iment-1.152
 Scher-1.129
 tamp-1.117
Token Jewish
Token ID5582
Feature activation+4.11e+00

Pos logit contributions
シャ+0.254
アル+0.242
+0.218
+0.209
habi+0.207
Neg logit contributions
rint-0.276
iment-0.206
agents-0.205
 Scher-0.203
 tamp-0.202
Token,
Token ID11
Feature activation+6.01e+00

Pos logit contributions
シャ+3.088
アル+2.941
+2.651
+2.545
ンジ+2.519
Neg logit contributions
rint-3.479
iment-2.606
agents-2.555
 Scher-2.512
 tamp-2.503
Token Arab
Token ID4498
Feature activation+3.74e+00

Pos logit contributions
シャ+4.293
アル+4.028
+3.665
+3.551
habi+3.492
Neg logit contributions
rint-5.171
agents-3.920
iment-3.827
 Scher-3.720
 tamp-3.700
Token and
Token ID290
Feature activation+8.68e+00

Pos logit contributions
シャ+2.849
アル+2.731
+2.471
+2.345
habi+2.331
Neg logit contributions
rint-3.122
iment-2.314
agents-2.303
 Scher-2.283
 tamp-2.273
Token Dru
Token ID21295
Feature activation+3.46e+00

Pos logit contributions
シャ+5.816
アル+5.645
+5.069
+4.885
 Shepard+4.874
Neg logit contributions
rint-7.342
agents-5.701
iment-5.501
ventures-5.283
 advis-5.273
Tokenze
Token ID2736
Feature activation-4.51e-01

Pos logit contributions
シャ+2.420
アル+2.267
+2.074
habi+2.038
nikov+2.007
Neg logit contributions
rint-3.038
agents-2.384
iment-2.307
 tamp-2.281
 advis-2.251
Token
Token ID784
Feature activation-3.78e-01

Pos logit contributions
rint+0.350
agents+0.256
iment+0.255
 Scher+0.251
 tamp+0.250
Neg logit contributions
シャ-0.371
アル-0.356
-0.324
-0.311
habi-0.308
Token who
Token ID508
Feature activation+1.94e+00

Pos logit contributions
rint+0.317
agents+0.238
iment+0.238
 Scher+0.234
 tamp+0.233
Neg logit contributions
シャ-0.288
アル-0.275
-0.247
-0.236
habi-0.235
Token came
Token ID1625
Feature activation+1.90e+00

Pos logit contributions
シャ+1.409
アル+1.342
+1.194
+1.144
habi+1.141
Neg logit contributions
rint-1.683
agents-1.270
iment-1.264
 Scher-1.245
 tamp-1.218
Token Ruk
Token ID49037
Feature activation-1.94e+00

Pos logit contributions
rint+0.537
agents+0.410
iment+0.409
 tamp+0.402
 Scher+0.401
Neg logit contributions
シャ-0.441
アル-0.421
-0.377
-0.360
habi-0.355
Tokenmini
Token ID45313
Feature activation-4.14e+00

Pos logit contributions
rint+1.353
agents+0.948
iment+0.946
 Scher+0.929
 tamp+0.925
Neg logit contributions
シャ-1.751
アル-1.685
-1.546
-1.489
habi-1.478
Token Temple
Token ID10857
Feature activation+7.75e-01

Pos logit contributions
rint+2.758
iment+1.888
agents+1.880
 Scher+1.880
 tamp+1.845
Neg logit contributions
シャ-3.836
アル-3.714
-3.403
-3.269
habi-3.260
Token Here
Token ID3423
Feature activation+3.85e+00

Pos logit contributions
シャ+0.503
アル+0.477
+0.423
+0.402
habi+0.398
Neg logit contributions
rint-0.729
agents-0.572
iment-0.567
 Scher-0.556
 tamp-0.555
Token
Token ID198
Feature activation+2.63e+00

Pos logit contributions
シャ+3.053
アル+2.929
+2.655
+2.542
habi+2.531
Neg logit contributions
rint-3.052
agents-2.266
iment-2.256
 Scher-2.219
 tamp-2.211
Token
Token ID198
Feature activation+8.82e+00

Pos logit contributions
シャ+1.892
アル+1.795
+1.627
+1.567
NS+1.534
Neg logit contributions
rint-2.296
 pse-1.721
 fortun-1.714
agents-1.707
 tamp-1.697
Token6
Token ID21
Feature activation-1.35e+00

Pos logit contributions
シャ+5.405
アル+5.317
+4.410
+4.382
NS+4.269
Neg logit contributions
rint-8.155
 tamp-6.560
iment-6.502
agents-6.391
 Scher-6.379
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
rint+1.151
agents+0.868
iment+0.866
 Scher+0.852
 tamp+0.852
Neg logit contributions
シャ-1.009
アル-0.962
-0.867
-0.828
habi-0.817
Token B
Token ID347
Feature activation+1.60e-02

Pos logit contributions
rint+1.281
agents+0.974
iment+0.972
 Scher+0.961
 tamp+0.956
Neg logit contributions
シャ-1.074
アル-1.024
-0.917
-0.874
habi-0.866
Tokenala
Token ID6081
Feature activation+2.88e+00

Pos logit contributions
シャ+0.012
アル+0.011
+0.010
+0.010
habi+0.010
Neg logit contributions
rint-0.014
agents-0.010
iment-0.010
 Scher-0.010
 tamp-0.010
Token Han
Token ID9530
Feature activation-2.65e+00

Pos logit contributions
シャ+2.282
アル+2.176
+1.993
habi+1.911
+1.909
Neg logit contributions
rint-2.281
agents-1.686
iment-1.647
 tamp-1.629
 advis-1.623
Token separated
Token ID11266
Feature activation+3.47e+00

Pos logit contributions
シャ+2.240
アル+2.148
+1.927
+1.862
habi+1.822
Neg logit contributions
rint-2.495
iment-1.894
agents-1.875
 Scher-1.832
 tamp-1.813
Token with
Token ID351
Feature activation+6.46e+00

Pos logit contributions
シャ+2.277
アル+2.169
+1.923
+1.832
ンジ+1.803
Neg logit contributions
rint-3.207
agents-2.516
iment-2.486
 Scher-2.463
 tamp-2.433
Token white
Token ID2330
Feature activation+2.80e+00

Pos logit contributions
シャ+4.632
アル+4.336
+3.932
+3.806
 Shepard+3.763
Neg logit contributions
rint-5.322
agents-4.179
iment-4.117
 Scher-3.972
luaj-3.911
Token space
Token ID2272
Feature activation+2.29e+00

Pos logit contributions
シャ+2.204
アル+2.088
+1.917
+1.836
habi+1.787
Neg logit contributions
rint-2.240
agents-1.674
iment-1.670
 Scher-1.639
 tamp-1.607
Token or
Token ID393
Feature activation+3.62e+00

Pos logit contributions
シャ+1.694
アル+1.604
+1.455
+1.390
habi+1.358
Neg logit contributions
rint-1.938
agents-1.475
iment-1.467
 Scher-1.444
 tamp-1.431
Token '
Token ID705
Feature activation+8.72e+00

Pos logit contributions
シャ+2.630
アル+2.518
+2.267
+2.181
habi+2.116
Neg logit contributions
rint-3.060
agents-2.349
iment-2.336
 Scher-2.285
 tamp-2.241
Token:
Token ID25
Feature activation+9.56e+00

Pos logit contributions
シャ+6.003
アル+5.834
+5.193
+5.134
NS+5.080
Neg logit contributions
rint-7.046
iment-5.385
agents-5.320
 Scher-5.308
 tamp-5.275
Token',
Token ID3256
Feature activation+1.42e+00

Pos logit contributions
シャ+5.997
アル+5.916
+5.236
NS+5.087
ンジ+5.077
Neg logit contributions
rint-8.070
iment-6.393
agents-6.297
 tamp-6.215
 Scher-6.162
Token where
Token ID810
Feature activation+3.60e+00

Pos logit contributions
シャ+1.078
アル+1.034
+0.931
+0.895
habi+0.881
Neg logit contributions
rint-1.185
agents-0.893
iment-0.891
 Scher-0.872
 tamp-0.867
Token each
Token ID1123
Feature activation+6.32e-01

Pos logit contributions
シャ+2.495
アル+2.424
+2.124
+2.033
 Shepard+2.010
Neg logit contributions
rint-3.177
iment-2.461
agents-2.431
 Scher-2.376
 tamp-2.366
Token part
Token ID636
Feature activation-7.70e-01

Pos logit contributions
シャ+0.460
アル+0.440
+0.392
+0.374
habi+0.367
Neg logit contributions
rint-0.550
agents-0.421
iment-0.418
 Scher-0.412
 tamp-0.408
Token nominated
Token ID19332
Feature activation+3.43e+00

Pos logit contributions
シャ+0.037
アル+0.036
+0.032
+0.031
habi+0.030
Neg logit contributions
rint-0.041
agents-0.031
iment-0.031
 Scher-0.031
 tamp-0.030
Token Ken
Token ID7148
Feature activation-2.75e+00

Pos logit contributions
シャ+2.446
アル+2.356
+2.133
+2.014
 Shepard+2.012
Neg logit contributions
rint-2.936
iment-2.228
agents-2.220
 tamp-2.167
 Scher-2.130
Tokeny
Token ID88
Feature activation-2.46e+00

Pos logit contributions
rint+2.507
iment+1.944
agents+1.939
 Scher+1.918
 tamp+1.909
Neg logit contributions
シャ-1.895
アル-1.800
-1.596
-1.516
ンジ-1.481
Tokenatta
Token ID25014
Feature activation-9.87e-02

Pos logit contributions
rint+1.926
agents+1.415
iment+1.411
 Scher+1.390
 tamp+1.386
Neg logit contributions
シャ-2.003
アル-1.918
-1.742
-1.672
habi-1.659
Token as
Token ID355
Feature activation+6.01e+00

Pos logit contributions
rint+0.081
agents+0.061
iment+0.060
 Scher+0.059
 tamp+0.059
Neg logit contributions
シャ-0.077
アル-0.073
-0.066
-0.063
habi-0.063
Token its
Token ID663
Feature activation+9.37e+00

Pos logit contributions
シャ+4.076
アル+3.931
+3.526
+3.380
 Shepard+3.316
Neg logit contributions
rint-5.274
agents-4.074
iment-4.000
 Scher-3.890
 tamp-3.872
Token President
Token ID1992
Feature activation+4.55e+00

Pos logit contributions
シャ+5.692
アル+5.411
+4.992
 Shepard+4.704
+4.697
Neg logit contributions
rint-8.354
agents-6.570
iment-6.373
ertodd-6.328
 Scher-6.207
Token,
Token ID11
Feature activation+2.98e-01

Pos logit contributions
シャ+3.284
アル+3.118
+2.835
habi+2.719
+2.707
Neg logit contributions
rint-3.871
agents-2.977
iment-2.920
 tamp-2.905
 Scher-2.877
Token although
Token ID3584
Feature activation+3.27e+00

Pos logit contributions
シャ+0.233
アル+0.223
+0.202
+0.193
habi+0.192
Neg logit contributions
rint-0.244
agents-0.182
iment-0.181
 Scher-0.177
 tamp-0.177
Token the
Token ID262
Feature activation+2.47e+00

Pos logit contributions
シャ+2.257
アル+2.155
+1.944
+1.839
 Shepard+1.834
Neg logit contributions
rint-2.904
agents-2.224
iment-2.202
 tamp-2.149
 Scher-2.132
Token government
Token ID1230
Feature activation-1.81e+00

Pos logit contributions
シャ+1.713
アル+1.611
+1.453
+1.373
habi+1.355
Neg logit contributions
rint-2.221
agents-1.710
iment-1.680
 Scher-1.662
 tamp-1.653
Token only
Token ID691
Feature activation+3.36e-01

Pos logit contributions
シャ+0.378
アル+0.362
+0.327
+0.312
habi+0.310
Neg logit contributions
rint-0.395
iment-0.294
agents-0.294
 tamp-0.285
 Scher-0.285
Token 240
Token ID14956
Feature activation-1.50e-02

Pos logit contributions
シャ+0.265
アル+0.254
+0.229
+0.220
habi+0.218
Neg logit contributions
rint-0.272
agents-0.202
iment-0.201
 tamp-0.196
 Scher-0.196
Token m
Token ID285
Feature activation+6.11e+00

Pos logit contributions
rint+0.014
agents+0.011
iment+0.011
 Scher+0.011
 tamp+0.011
Neg logit contributions
シャ-0.010
アル-0.009
-0.008
-0.008
habi-0.008
Token (
Token ID357
Feature activation-1.37e-01

Pos logit contributions
シャ+4.319
アル+4.025
+3.692
habi+3.590
+3.539
Neg logit contributions
rint-5.217
agents-4.061
iment-3.964
 tamp-3.889
 Scher-3.886
Token790
Token ID37750
Feature activation+2.46e+00

Pos logit contributions
rint+0.108
agents+0.079
iment+0.079
 tamp+0.078
 Scher+0.078
Neg logit contributions
シャ-0.111
アル-0.106
-0.097
-0.093
habi-0.092
Token ft
Token ID10117
Feature activation+1.12e+01

Pos logit contributions
シャ+1.754
アル+1.667
+1.504
+1.424
habi+1.417
Neg logit contributions
rint-2.156
agents-1.663
iment-1.642
 Scher-1.630
 tamp-1.579
Token)
Token ID8
Feature activation+4.42e+00

Pos logit contributions
シャ+7.161
アル+6.672
+6.240
+6.035
 Shepard+5.913
Neg logit contributions
rint-9.419
agents-7.503
iment-7.240
 Scher-7.095
ertodd-7.002
Token from
Token ID422
Feature activation+1.98e+00

Pos logit contributions
シャ+3.610
アル+3.425
+3.137
+3.044
habi+3.040
Neg logit contributions
rint-3.333
agents-2.484
iment-2.417
 Scher-2.344
 tamp-2.315
Token the
Token ID262
Feature activation+2.92e+00

Pos logit contributions
シャ+1.463
アル+1.388
+1.254
+1.192
 Shepard+1.177
Neg logit contributions
rint-1.703
agents-1.291
iment-1.278
 tamp-1.249
 Scher-1.232
Token asteroid
Token ID27460
Feature activation+8.48e-01

Pos logit contributions
シャ+2.290
アル+2.161
+1.983
+1.892
 Shepard+1.883
Neg logit contributions
rint-2.350
agents-1.757
iment-1.724
 tamp-1.680
 Scher-1.662
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
シャ+0.644
アル+0.612
+0.551
+0.529
habi+0.523
Neg logit contributions
rint-0.713
agents-0.535
iment-0.526
 tamp-0.522
 Scher-0.516
TokenHz
Token ID7399
Feature activation+1.50e+00

Pos logit contributions
rint+1.811
agents+1.172
iment+1.167
 Scher+1.141
 tamp+1.134
Neg logit contributions
シャ-3.100
アル-2.997
-2.776
-2.684
habi-2.667
Token (
Token ID357
Feature activation-4.21e+00

Pos logit contributions
シャ+1.036
アル+0.985
+0.877
+0.833
habi+0.823
Neg logit contributions
rint-1.370
agents-1.056
iment-1.055
 Scher-1.041
 tamp-1.037
TokenTr
Token ID2898
Feature activation-3.96e+00

Pos logit contributions
rint+3.287
agents+2.415
iment+2.392
 Scher+2.352
 tamp+2.337
Neg logit contributions
シャ-3.410
アル-3.261
-2.971
habi-2.835
nikov-2.805
Tokenace
Token ID558
Feature activation-1.98e-02

Pos logit contributions
rint+2.671
iment+1.846
agents+1.824
 Scher+1.784
 tamp+1.773
Neg logit contributions
シャ-3.706
アル-3.549
-3.285
-3.152
ンジ-3.126
Token Free
Token ID3232
Feature activation-4.01e+00

Pos logit contributions
rint+0.016
agents+0.012
iment+0.012
 tamp+0.012
 Scher+0.011
Neg logit contributions
シャ-0.016
アル-0.015
-0.014
-0.013
habi-0.013
Token =
Token ID796
Feature activation+2.59e+00

Pos logit contributions
rint+2.425
iment+1.598
agents+1.589
 Scher+1.566
 tamp+1.553
Neg logit contributions
シャ-3.977
アル-3.845
-3.542
habi-3.426
-3.419
Token 80
Token ID4019
Feature activation+6.88e-01

Pos logit contributions
シャ+2.185
アル+2.091
+1.911
+1.842
habi+1.798
Neg logit contributions
rint-1.957
agents-1.411
iment-1.404
 Scher-1.378
 tamp-1.370
Token)
Token ID8
Feature activation+1.20e+00

Pos logit contributions
シャ+0.566
アル+0.542
+0.492
+0.472
habi+0.468
Neg logit contributions
rint-0.535
agents-0.392
iment-0.392
 Scher-0.386
 tamp-0.384
Token
Token ID198
Feature activation-1.77e+00

Pos logit contributions
シャ+0.942
アル+0.898
+0.815
+0.781
habi+0.763
Neg logit contributions
rint-0.986
agents-0.733
iment-0.732
 tamp-0.713
 Scher-0.709
Token
Token ID198
Feature activation+3.98e+00

Pos logit contributions
rint+1.372
agents+1.001
iment+0.998
 Scher+0.982
 tamp+0.980
Neg logit contributions
シャ-1.460
アル-1.399
-1.272
-1.221
habi-1.209
Token27
Token ID1983
Feature activation-3.14e+00

Pos logit contributions
シャ+3.121
アル+2.936
+2.689
+2.604
NS+2.534
Neg logit contributions
rint-3.231
iment-2.411
 tamp-2.403
agents-2.395
 Scher-2.342
Token the
Token ID262
Feature activation-6.72e-01

Pos logit contributions
rint+1.005
agents+0.753
iment+0.751
 Scher+0.737
 tamp+0.736
Neg logit contributions
シャ-0.940
アル-0.897
-0.812
-0.776
habi-0.769
Token temple
Token ID12505
Feature activation-1.19e+00

Pos logit contributions
rint+0.495
agents+0.358
iment+0.355
 tamp+0.345
 Scher+0.344
Neg logit contributions
シャ-0.581
アル-0.554
-0.510
-0.489
habi-0.486
Token include
Token ID2291
Feature activation-1.72e+00

Pos logit contributions
rint+1.075
agents+0.826
iment+0.824
 Scher+0.812
 tamp+0.810
Neg logit contributions
シャ-0.833
アル-0.793
-0.708
-0.672
habi-0.666
Token
Token ID198
Feature activation-2.49e+00

Pos logit contributions
rint+1.404
agents+1.046
iment+1.043
 Scher+1.027
 tamp+1.023
Neg logit contributions
シャ-1.347
アル-1.288
-1.165
-1.115
habi-1.104
Token
Token ID198
Feature activation-2.49e+00

Pos logit contributions
rint+1.899
agents+1.380
iment+1.378
 Scher+1.357
 tamp+1.349
Neg logit contributions
シャ-2.087
アル-2.003
-1.822
-1.749
habi-1.735
Token1
Token ID16
Feature activation+9.31e-01

Pos logit contributions
rint+2.120
agents+1.602
iment+1.597
 Scher+1.576
 tamp+1.567
Neg logit contributions
シャ-1.860
アル-1.775
-1.594
-1.522
habi-1.508
Token)
Token ID8
Feature activation-1.93e+00

Pos logit contributions
シャ+0.698
アル+0.667
+0.603
+0.575
habi+0.567
Neg logit contributions
rint-0.788
agents-0.597
iment-0.595
 tamp-0.582
 Scher-0.582
Token Shrine
Token ID30682
Feature activation-1.33e+00

Pos logit contributions
rint+1.631
agents+1.228
iment+1.225
 Scher+1.207
 tamp+1.203
Neg logit contributions
シャ-1.464
アル-1.399
-1.260
-1.203
habi-1.191
Token for
Token ID329
Feature activation-2.31e+00

Pos logit contributions
rint+1.088
agents+0.811
iment+0.808
 Scher+0.795
 tamp+0.793
Neg logit contributions
シャ-1.043
アル-0.996
-0.902
-0.863
habi-0.853
Token Cup
Token ID5454
Feature activation+3.10e-03

Pos logit contributions
rint+2.035
agents+1.551
iment+1.548
 Scher+1.530
 tamp+1.524
Neg logit contributions
シャ-1.670
アル-1.592
-1.424
-1.357
habi-1.343
Tokenid
Token ID312
Feature activation-7.77e+00

Pos logit contributions
シャ+0.002
アル+0.002
+0.002
+0.002
habi+0.002
Neg logit contributions
rint-0.003
agents-0.002
iment-0.002
 Scher-0.002
 tamp-0.002
Token idea
Token ID2126
Feature activation-1.94e+00

Pos logit contributions
rint+0.752
agents+0.562
iment+0.559
 Scher+0.550
 tamp+0.545
Neg logit contributions
シャ-0.702
アル-0.671
-0.605
-0.580
habi-0.574
Token is
Token ID318
Feature activation-1.44e+00

Pos logit contributions
rint+1.568
iment+1.165
agents+1.163
 Scher+1.150
 tamp+1.145
Neg logit contributions
シャ-1.532
アル-1.466
-1.325
-1.267
habi-1.257
Token at
Token ID379
Feature activation-6.63e-01

Pos logit contributions
rint+1.211
agents+0.912
iment+0.902
 Scher+0.891
 tamp+0.885
Neg logit contributions
シャ-1.094
アル-1.044
-0.943
-0.902
habi-0.893
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
rint+0.591
agents+0.452
iment+0.451
 Scher+0.444
 tamp+0.442
Neg logit contributions
シャ-0.471
アル-0.448
-0.401
-0.380
habi-0.377
Token heart
Token ID2612
Feature activation-7.77e-01

Pos logit contributions
シャ+0.213
アル+0.204
+0.185
+0.178
habi+0.176
Neg logit contributions
rint-0.203
agents-0.149
iment-0.148
 Scher-0.144
 tamp-0.144
Token of
Token ID286
Feature activation+1.28e+00

Pos logit contributions
rint+0.588
iment+0.433
agents+0.427
 Scher+0.424
 tamp+0.424
Neg logit contributions
シャ-0.648
アル-0.623
-0.567
-0.546
habi-0.542
Token her
Token ID607
Feature activation-2.36e+00

Pos logit contributions
シャ+0.951
アル+0.911
+0.814
+0.786
habi+0.772
Neg logit contributions
rint-1.094
iment-0.825
agents-0.823
 Scher-0.800
 tamp-0.798
Token hiring
Token ID12965
Feature activation+3.98e+00

Pos logit contributions
rint+1.915
agents+1.422
iment+1.421
 Scher+1.403
 tamp+1.398
Neg logit contributions
シャ-1.866
アル-1.786
-1.614
-1.544
habi-1.531
Token philosophy
Token ID8876
Feature activation-2.74e+00

Pos logit contributions
シャ+2.891
アル+2.763
+2.526
+2.416
habi+2.342
Neg logit contributions
rint-3.409
iment-2.552
agents-2.552
 tamp-2.476
 Scher-2.461
Token.
Token ID13
Feature activation-3.45e+00

Pos logit contributions
rint+2.146
iment+1.577
agents+1.574
 Scher+1.562
 tamp+1.552
Neg logit contributions
シャ-2.228
アル-2.130
-1.929
-1.851
habi-1.832
Token
Token ID564
Feature activation-1.73e+00

Pos logit contributions
rint+2.815
agents+2.120
 Scher+2.120
iment+2.120
 tamp+2.085
Neg logit contributions
シャ-2.666
アル-2.539
-2.300
habi-2.196
-2.192
Token two
Token ID734
Feature activation+7.47e-01

Pos logit contributions
シャ+1.555
アル+1.487
+1.348
+1.300
habi+1.299
Neg logit contributions
rint-1.698
iment-1.268
agents-1.259
 tamp-1.219
 Scher-1.204
Token areas
Token ID3006
Feature activation-7.89e-02

Pos logit contributions
シャ+0.567
アル+0.537
+0.492
+0.467
habi+0.459
Neg logit contributions
rint-0.626
iment-0.468
agents-0.467
 Scher-0.454
 tamp-0.453
Token that
Token ID326
Feature activation-6.01e-01

Pos logit contributions
rint+0.068
agents+0.051
iment+0.051
 tamp+0.050
 Scher+0.050
Neg logit contributions
シャ-0.058
アル-0.056
-0.050
-0.048
habi-0.047
Token hurt
Token ID5938
Feature activation+6.61e-01

Pos logit contributions
rint+0.514
iment+0.387
agents+0.387
 Scher+0.379
 tamp+0.378
Neg logit contributions
シャ-0.448
アル-0.427
-0.385
-0.368
habi-0.363
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
シャ+0.478
アル+0.455
+0.411
+0.391
habi+0.387
Neg logit contributions
rint-0.574
agents-0.435
iment-0.434
 tamp-0.426
 Scher-0.424
Token Broncos
Token ID16582
Feature activation+7.81e-02

Pos logit contributions
rint+1.364
agents+1.061
iment+1.059
 Scher+1.047
 tamp+1.043
Neg logit contributions
シャ-0.968
アル-0.919
-0.813
-0.771
habi-0.762
Token in
Token ID287
Feature activation+2.40e-02

Pos logit contributions
シャ+0.059
アル+0.056
+0.051
+0.049
habi+0.048
Neg logit contributions
rint-0.066
agents-0.050
iment-0.049
 Scher-0.048
 tamp-0.048
Token 2016
Token ID1584
Feature activation+2.87e+00

Pos logit contributions
シャ+0.018
アル+0.017
+0.016
+0.015
habi+0.015
Neg logit contributions
rint-0.020
agents-0.015
iment-0.015
 Scher-0.015
 tamp-0.015
Token as
Token ID355
Feature activation-7.06e-01

Pos logit contributions
シャ+2.065
アル+1.970
+1.788
+1.709
habi+1.679
Neg logit contributions
rint-2.471
agents-1.899
iment-1.891
 tamp-1.854
 Scher-1.844
Token they
Token ID484
Feature activation-1.03e+00

Pos logit contributions
rint+0.596
iment+0.448
agents+0.448
 Scher+0.440
 tamp+0.439
Neg logit contributions
シャ-0.534
アル-0.510
-0.459
-0.439
habi-0.434
Token missed
Token ID6825
Feature activation+5.75e-02

Pos logit contributions
rint+0.856
agents+0.642
iment+0.639
 Scher+0.629
 tamp+0.624
Neg logit contributions
シャ-0.792
アル-0.757
-0.683
-0.654
habi-0.648
Token
Token ID251
Feature activation+7.00e-01

Pos logit contributions
シャ+0.620
アル+0.589
+0.528
+0.502
habi+0.495
Neg logit contributions
rint-0.769
agents-0.592
iment-0.587
 Scher-0.583
 tamp-0.578
Token asked
Token ID1965
Feature activation+2.22e+00

Pos logit contributions
シャ+0.526
アル+0.504
+0.453
+0.434
habi+0.427
Neg logit contributions
rint-0.593
agents-0.449
iment-0.444
 Scher-0.436
 tamp-0.436
Token Libertarian
Token ID30226
Feature activation-2.03e+00

Pos logit contributions
シャ+1.634
アル+1.575
+1.415
+1.362
 Shepard+1.355
Neg logit contributions
rint-1.865
agents-1.435
iment-1.414
 tamp-1.364
 Scher-1.361
Token 2016
Token ID1584
Feature activation+2.97e-01

Pos logit contributions
rint+1.727
agents+1.307
iment+1.305
 Scher+1.285
 tamp+1.280
Neg logit contributions
シャ-1.520
アル-1.453
-1.308
-1.254
habi-1.234
Token contender
Token ID24190
Feature activation-1.87e+00

Pos logit contributions
シャ+0.230
アル+0.221
+0.201
+0.194
habi+0.189
Neg logit contributions
rint-0.243
agents-0.184
iment-0.182
 Scher-0.179
 tamp-0.178
Token Gary
Token ID10936
Feature activation+3.74e-01

Pos logit contributions
rint+1.648
agents+1.259
iment+1.257
 Scher+1.241
 tamp+1.236
Neg logit contributions
シャ-1.338
アル-1.274
-1.139
-1.085
habi-1.074
Token Johnson
Token ID5030
Feature activation+3.12e-01

Pos logit contributions
シャ+0.276
アル+0.264
+0.241
+0.228
 Shepard+0.223
Neg logit contributions
rint-0.318
agents-0.242
iment-0.238
 tamp-0.236
 advis-0.232
Token on
Token ID319
Feature activation+3.07e-01

Pos logit contributions
シャ+0.241
アル+0.230
+0.209
+0.200
habi+0.197
Neg logit contributions
rint-0.258
agents-0.194
iment-0.193
 Scher-0.189
 tamp-0.189
Token MSNBC
Token ID22128
Feature activation+9.10e-01

Pos logit contributions
シャ+0.236
アル+0.226
+0.204
+0.196
habi+0.193
Neg logit contributions
rint-0.254
agents-0.192
iment-0.190
 Scher-0.186
 tamp-0.186
Token this
Token ID428
Feature activation+3.13e-01

Pos logit contributions
シャ+0.676
アル+0.645
+0.582
+0.559
habi+0.549
Neg logit contributions
rint-0.777
agents-0.591
iment-0.587
 Scher-0.577
 tamp-0.577
Token morning
Token ID3329
Feature activation-9.05e-01

Pos logit contributions
シャ+0.154
アル+0.142
+0.121
+0.112
habi+0.110
Neg logit contributions
rint-0.347
agents-0.283
iment-0.281
 Scher-0.278
 tamp-0.277
Token it
Token ID340
Feature activation-2.59e-01

Pos logit contributions
rint+3.066
iment+2.224
 Scher+2.186
agents+2.173
 tamp+2.166
Neg logit contributions
シャ-3.608
アル-3.494
-3.163
-3.027
habi-3.024
Token's
Token ID338
Feature activation-1.64e+00

Pos logit contributions
rint+0.212
agents+0.158
iment+0.156
 tamp+0.154
 Scher+0.154
Neg logit contributions
シャ-0.203
アル-0.194
-0.176
-0.170
habi-0.166
Token not
Token ID407
Feature activation-7.23e-01

Pos logit contributions
rint+1.272
agents+0.929
iment+0.927
 Scher+0.913
 tamp+0.909
Neg logit contributions
シャ-1.356
アル-1.300
-1.182
-1.134
habi-1.124
Token what
Token ID644
Feature activation+1.55e+00

Pos logit contributions
rint+0.638
agents+0.492
iment+0.484
 Scher+0.477
 tamp+0.477
Neg logit contributions
シャ-0.518
アル-0.490
-0.443
-0.421
habi-0.416
Token you
Token ID345
Feature activation-8.04e-01

Pos logit contributions
シャ+1.048
アル+1.001
+0.899
+0.857
 Shepard+0.835
Neg logit contributions
rint-1.418
agents-1.084
iment-1.082
 tamp-1.064
 Scher-1.051
Token think
Token ID892
Feature activation-7.07e-01

Pos logit contributions
rint+0.633
agents+0.463
iment+0.460
 Scher+0.455
 tamp+0.453
Neg logit contributions
シャ-0.655
アル-0.626
-0.571
-0.546
habi-0.539
Token it
Token ID340
Feature activation+3.65e-01

Pos logit contributions
rint+0.568
agents+0.420
iment+0.414
 Scher+0.408
 tamp+0.408
Neg logit contributions
シャ-0.563
アル-0.537
-0.489
-0.469
habi-0.463
Token is
Token ID318
Feature activation+3.70e+00

Pos logit contributions
シャ+0.258
アル+0.243
+0.220
+0.210
habi+0.207
Neg logit contributions
rint-0.325
agents-0.249
iment-0.247
 tamp-0.242
 Scher-0.242
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
シャ+2.740
アル+2.617
+2.353
+2.236
habi+2.197
Neg logit contributions
rint-3.140
agents-2.401
iment-2.380
 tamp-2.353
 Scher-2.336
Token which
Token ID543
Feature activation-3.20e+00

Pos logit contributions
シャ+1.067
アル+1.020
+0.912
+0.877
habi+0.868
Neg logit contributions
rint-1.194
agents-0.904
iment-0.892
 tamp-0.870
 Scher-0.870
Token I
Token ID314
Feature activation-1.28e+00

Pos logit contributions
rint+2.512
iment+1.883
 Scher+1.863
agents+1.852
 tamp+1.841
Neg logit contributions
シャ-2.565
アル-2.460
-2.196
habi-2.139
-2.136
Token't
Token ID470
Feature activation-8.47e-01

Pos logit contributions
シャ+2.438
アル+2.338
+2.104
+2.055
NS+1.971
Neg logit contributions
rint-2.801
agents-2.132
 fortun-2.118
 tamp-2.084
 advis-2.081
Token what
Token ID644
Feature activation+7.41e-01

Pos logit contributions
rint+0.690
agents+0.518
iment+0.508
 Scher+0.500
 tamp+0.499
Neg logit contributions
シャ-0.661
アル-0.634
-0.575
-0.549
habi-0.542
Token anyone
Token ID2687
Feature activation-5.92e-01

Pos logit contributions
シャ+0.533
アル+0.510
+0.458
+0.433
 Shepard+0.427
Neg logit contributions
rint-0.651
iment-0.494
agents-0.493
 tamp-0.483
 Scher-0.479
Token would
Token ID561
Feature activation-4.03e+00

Pos logit contributions
rint+0.461
agents+0.339
iment+0.335
 Scher+0.330
 tamp+0.326
Neg logit contributions
シャ-0.486
アル-0.464
-0.421
-0.405
habi-0.402
Token expect
Token ID1607
Feature activation-2.70e+00

Pos logit contributions
rint+3.256
iment+2.425
 tamp+2.415
agents+2.389
 Scher+2.364
Neg logit contributions
シャ-3.198
アル-3.071
-2.805
-2.643
ンジ-2.615
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
rint+2.156
iment+1.595
agents+1.590
 Scher+1.572
 tamp+1.560
Neg logit contributions
シャ-2.164
アル-2.074
-1.872
-1.797
habi-1.782
Token
Token ID198
Feature activation-9.64e-01

Pos logit contributions
rint+1.205
agents+0.883
iment+0.883
 Scher+0.877
 tamp+0.861
Neg logit contributions
シャ-1.305
アル-1.247
-1.135
-1.089
habi-1.088
Token
Token ID198
Feature activation+5.89e-02

Pos logit contributions
rint+0.769
agents+0.564
iment+0.561
 tamp+0.554
 Scher+0.552
Neg logit contributions
シャ-0.776
アル-0.743
-0.674
-0.647
habi-0.640
Token"
Token ID1
Feature activation-7.68e-01

Pos logit contributions
シャ+0.047
アル+0.045
+0.041
+0.039
habi+0.038
Neg logit contributions
rint-0.048
iment-0.035
 tamp-0.035
agents-0.035
 Scher-0.035
TokenI
Token ID40
Feature activation-8.10e-01

Pos logit contributions
rint+0.666
iment+0.502
agents+0.502
 tamp+0.498
 Scher+0.496
Neg logit contributions
シャ-0.566
アル-0.541
-0.485
-0.463
habi-0.454
Token've
Token ID1053
Feature activation-2.06e+00

Pos logit contributions
rint+0.683
agents+0.516
iment+0.510
 Scher+0.503
 tamp+0.500
Neg logit contributions
シャ-0.610
アル-0.583
-0.524
-0.503
habi-0.498
Token in
Token ID287
Feature activation-1.44e+00

Pos logit contributions
rint+1.259
agents+0.935
iment+0.933
 Scher+0.920
 tamp+0.916
Neg logit contributions
シャ-1.224
アル-1.171
-1.059
-1.014
habi-1.005
Token Britain
Token ID5491
Feature activation-1.86e+00

Pos logit contributions
rint+1.193
agents+0.891
iment+0.889
 Scher+0.877
 tamp+0.874
Neg logit contributions
シャ-1.119
アル-1.070
-0.966
-0.923
habi-0.915
Token before
Token ID878
Feature activation+1.66e+00

Pos logit contributions
rint+1.521
agents+1.133
iment+1.131
 Scher+1.115
 tamp+1.110
Neg logit contributions
シャ-1.452
アル-1.389
-1.255
-1.201
habi-1.190
Token a
Token ID257
Feature activation-2.05e+00

Pos logit contributions
シャ+1.243
アル+1.202
+1.082
+1.026
habi+1.021
Neg logit contributions
rint-1.396
iment-1.044
agents-1.042
 Scher-1.015
 tamp-0.997
Token set
Token ID900
Feature activation+2.39e-02

Pos logit contributions
rint+1.629
agents+1.201
iment+1.199
 Scher+1.182
 tamp+1.178
Neg logit contributions
シャ-1.646
アル-1.576
-1.428
-1.369
habi-1.357
Token date
Token ID3128
Feature activation-1.75e+00

Pos logit contributions
シャ+0.018
アル+0.018
+0.016
+0.015
habi+0.015
Neg logit contributions
rint-0.020
agents-0.015
iment-0.015
 Scher-0.015
 tamp-0.015
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
rint+1.410
agents+1.046
iment+1.044
 Scher+1.029
 tamp+1.026
Neg logit contributions
シャ-1.385
アル-1.325
-1.199
-1.148
habi-1.138
Token
Token ID198
Feature activation-2.04e+00

Pos logit contributions
rint+0.134
agents+0.099
iment+0.099
 tamp+0.097
 Scher+0.097
Neg logit contributions
シャ-0.130
アル-0.125
-0.113
-0.109
habi-0.107
Token
Token ID198
Feature activation-1.13e+00

Pos logit contributions
rint+1.572
agents+1.145
iment+1.143
 Scher+1.125
 tamp+1.121
Neg logit contributions
シャ-1.695
アル-1.625
-1.478
-1.419
habi-1.406
TokenAlthough
Token ID7003
Feature activation+1.13e+00

Pos logit contributions
rint+0.961
iment+0.723
agents+0.722
 Scher+0.713
 tamp+0.712
Neg logit contributions
シャ-0.842
アル-0.806
-0.724
-0.692
habi-0.682
Token its
Token ID663
Feature activation-7.77e-02

Pos logit contributions
シャ+0.872
アル+0.840
+0.759
+0.722
habi+0.715
Neg logit contributions
rint-0.927
iment-0.685
agents-0.682
 Scher-0.669
 tamp-0.666
Token distortion
Token ID25100
Feature activation-3.26e-01

Pos logit contributions
シャ+0.568
アル+0.534
+0.484
+0.460
habi+0.455
Neg logit contributions
rint-0.675
agents-0.522
iment-0.510
 Scher-0.500
 tamp-0.491
Token,
Token ID11
Feature activation-3.45e+00

Pos logit contributions
rint+0.267
agents+0.200
iment+0.197
 Scher+0.195
 tamp+0.192
Neg logit contributions
シャ-0.254
アル-0.241
-0.221
-0.209
habi-0.207
Token even
Token ID772
Feature activation-2.08e+00

Pos logit contributions
rint+2.768
iment+2.054
agents+2.037
 tamp+2.034
 Scher+2.030
Neg logit contributions
シャ-2.735
アル-2.630
-2.378
-2.274
habi-2.248
Token at
Token ID379
Feature activation-7.77e-01

Pos logit contributions
rint+1.707
agents+1.274
iment+1.272
 Scher+1.254
 tamp+1.249
Neg logit contributions
シャ-1.618
アル-1.547
-1.397
-1.336
habi-1.324
Token high
Token ID1029
Feature activation-3.40e-01

Pos logit contributions
rint+0.624
agents+0.462
iment+0.461
 Scher+0.453
 tamp+0.451
Neg logit contributions
シャ-0.620
アル-0.593
-0.537
-0.514
habi-0.510
Token volumes
Token ID15343
Feature activation-1.57e+00

Pos logit contributions
rint+0.308
agents+0.237
iment+0.236
 Scher+0.232
 tamp+0.230
Neg logit contributions
シャ-0.237
アル-0.226
-0.201
-0.191
habi-0.189
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
rint+1.322
agents+0.993
iment+0.991
 Scher+0.978
 tamp+0.974
Neg logit contributions
シャ-1.199
アル-1.145
-1.032
-0.986
habi-0.976
Token It
Token ID632
Feature activation-1.09e+00

Pos logit contributions
rint+1.140
agents+0.855
iment+0.851
 Scher+0.837
 tamp+0.837
Neg logit contributions
シャ-1.041
アル-0.995
-0.898
-0.857
habi-0.846
Token is
Token ID318
Feature activation+5.97e-01

Pos logit contributions
rint+0.921
agents+0.693
iment+0.691
 Scher+0.681
 tamp+0.678
Neg logit contributions
シャ-0.831
アル-0.793
-0.714
-0.682
habi-0.676
Token not
Token ID407
Feature activation+4.07e-01

Pos logit contributions
シャ+0.464
アル+0.441
+0.400
+0.383
habi+0.379
Neg logit contributions
rint-0.490
agents-0.370
iment-0.364
 Scher-0.356
 tamp-0.353
Token too
Token ID1165
Feature activation-2.03e+00

Pos logit contributions
シャ+0.313
アル+0.298
+0.270
+0.258
habi+0.256
Neg logit contributions
rint-0.337
agents-0.255
iment-0.251
 Scher-0.246
 tamp-0.243
Token it
Token ID340
Feature activation-6.26e-01

Pos logit contributions
rint+1.668
agents+1.233
iment+1.230
 Scher+1.212
 tamp+1.207
Neg logit contributions
シャ-1.666
アル-1.596
-1.445
-1.385
habi-1.372
Token will
Token ID481
Feature activation-7.02e-01

Pos logit contributions
rint+0.540
agents+0.408
iment+0.408
 Scher+0.402
 tamp+0.398
Neg logit contributions
シャ-0.462
アル-0.440
-0.396
-0.378
habi-0.374
Token remove
Token ID4781
Feature activation-3.80e+00

Pos logit contributions
rint+0.577
agents+0.432
iment+0.429
 Scher+0.424
 tamp+0.419
Neg logit contributions
シャ-0.545
アル-0.520
-0.470
-0.450
habi-0.446
Token photos
Token ID5205
Feature activation-1.51e+00

Pos logit contributions
rint+3.039
agents+2.261
 tamp+2.247
iment+2.240
 Scher+2.222
Neg logit contributions
シャ-3.045
アル-2.913
-2.624
-2.508
habi-2.485
Token,
Token ID11
Feature activation-2.71e-01

Pos logit contributions
rint+1.219
agents+0.903
iment+0.902
 Scher+0.889
 tamp+0.886
Neg logit contributions
シャ-1.204
アル-1.152
-1.043
-0.998
habi-0.989
Token videos
Token ID5861
Feature activation-2.08e+00

Pos logit contributions
rint+0.198
agents+0.141
iment+0.141
 Scher+0.138
 tamp+0.137
Neg logit contributions
シャ-0.235
アル-0.226
-0.207
-0.199
habi-0.197
Token and
Token ID290
Feature activation-1.43e-01

Pos logit contributions
rint+1.579
iment+1.159
agents+1.158
 tamp+1.140
 Scher+1.140
Neg logit contributions
シャ-1.744
アル-1.681
-1.516
-1.456
ンジ-1.439
Token links
Token ID6117
Feature activation+1.28e+00

Pos logit contributions
rint+0.117
agents+0.087
iment+0.087
 Scher+0.085
 tamp+0.084
Neg logit contributions
シャ-0.112
アル-0.107
-0.097
-0.093
habi-0.092
Token with
Token ID351
Feature activation+2.85e+00

Pos logit contributions
シャ+0.955
アル+0.913
+0.829
+0.794
habi+0.792
Neg logit contributions
rint-1.079
agents-0.813
iment-0.802
 Scher-0.792
 tamp-0.783
Token explicit
Token ID7952
Feature activation+1.02e+00

Pos logit contributions
シャ+2.035
アル+1.941
+1.747
+1.668
 Shepard+1.661
Neg logit contributions
rint-2.469
iment-1.878
agents-1.867
 Scher-1.821
 tamp-1.786
Token content
Token ID2695
Feature activation+1.20e+00

Pos logit contributions
シャ+0.797
アル+0.761
+0.690
+0.661
habi+0.659
Neg logit contributions
rint-0.824
agents-0.604
iment-0.602
 Scher-0.588
 tamp-0.577
Token fossil
Token ID12584
Feature activation-3.59e+00

Pos logit contributions
rint+0.882
agents+0.649
iment+0.639
 Scher+0.632
 tamp+0.627
Neg logit contributions
シャ-0.934
アル-0.893
-0.811
-0.781
habi-0.773
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
rint+2.935
iment+2.205
agents+2.179
 Scher+2.160
 tamp+2.157
Neg logit contributions
シャ-2.805
アル-2.685
-2.434
-2.322
habi-2.299
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
rint+1.354
iment+1.006
agents+1.005
 Scher+0.994
 tamp+0.984
Neg logit contributions
シャ-1.341
アル-1.282
-1.161
-1.110
habi-1.105
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
rint+1.294
agents+0.944
iment+0.941
 Scher+0.927
 tamp+0.925
Neg logit contributions
シャ-1.373
アル-1.315
-1.196
-1.148
habi-1.137
TokenAs
Token ID1722
Feature activation-2.78e+00

Pos logit contributions
rint+1.111
agents+0.839
iment+0.837
 Scher+0.827
 tamp+0.826
Neg logit contributions
シャ-0.956
アル-0.913
-0.819
-0.784
habi-0.772
Token soon
Token ID2582
Feature activation-1.11e+01

Pos logit contributions
rint+2.275
iment+1.697
agents+1.691
 Scher+1.670
 tamp+1.661
Neg logit contributions
シャ-2.182
アル-2.088
-1.888
-1.800
habi-1.790
Token as
Token ID355
Feature activation-3.11e+00

Pos logit contributions
rint+6.342
agents+4.401
iment+4.378
 Scher+4.351
 tamp+4.314
Neg logit contributions
シャ-10.583
アル-10.167
-9.307
habi-9.265
-9.182
Token he
Token ID339
Feature activation-3.95e+00

Pos logit contributions
rint+2.551
iment+1.914
agents+1.904
 Scher+1.892
 tamp+1.883
Neg logit contributions
シャ-2.417
アル-2.314
-2.094
-1.997
habi-1.987
Token popped
Token ID22928
Feature activation-3.46e+00

Pos logit contributions
rint+3.211
iment+2.400
agents+2.391
 tamp+2.378
 Scher+2.370
Neg logit contributions
シャ-3.104
アル-2.982
-2.696
-2.568
habi-2.536
Token his
Token ID465
Feature activation+2.14e+00

Pos logit contributions
rint+2.702
iment+1.995
agents+1.978
 Scher+1.962
 tamp+1.956
Neg logit contributions
シャ-2.829
アル-2.715
-2.456
-2.351
habi-2.341
Token slide
Token ID10649
Feature activation+1.31e+00

Pos logit contributions
シャ+1.443
アル+1.364
+1.224
+1.165
habi+1.150
Neg logit contributions
rint-1.940
agents-1.503
iment-1.486
 Scher-1.464
 tamp-1.429
Token view
Token ID1570
Feature activation-1.20e-03

Pos logit contributions
シャ+0.694
アル+0.666
+0.593
+0.572
 Shepard+0.569
Neg logit contributions
rint-0.720
iment-0.537
agents-0.536
 Scher-0.526
 tamp-0.519
Token.
Token ID13
Feature activation-2.50e+00

Pos logit contributions
rint+0.001
agents+0.001
iment+0.001
 Scher+0.001
 tamp+0.001
Neg logit contributions
シャ-0.001
アル-0.001
-0.001
-0.001
habi-0.001
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
rint+2.190
agents+1.668
iment+1.665
 Scher+1.641
 tamp+1.633
Neg logit contributions
シャ-1.815
アル-1.731
-1.552
-1.478
habi-1.463
Token
Token ID198
Feature activation-1.43e+00

Pos logit contributions
シャ+0.789
アル+0.754
+0.683
+0.654
habi+0.640
Neg logit contributions
rint-0.873
agents-0.645
 tamp-0.640
iment-0.639
 advis-0.629
TokenCase
Token ID20448
Feature activation+4.70e+00

Pos logit contributions
rint+1.223
iment+0.925
agents+0.924
 Scher+0.918
 tamp+0.915
Neg logit contributions
シャ-1.063
アル-1.018
-0.914
-0.874
habi-0.864
Token in
Token ID287
Feature activation-9.18e+00

Pos logit contributions
シャ+3.497
アル+3.429
+3.108
nikov+2.960
+2.949
Neg logit contributions
rint-3.723
agents-2.899
iment-2.771
 tamp-2.735
 Scher-2.707
Token point
Token ID966
Feature activation+1.29e+00

Pos logit contributions
rint+4.880
 Scher+3.341
 tamp+3.315
 advis+3.240
iment+3.147
Neg logit contributions
シャ-9.466
アル-9.199
-8.450
ンジ-8.318
-8.294
Token:
Token ID25
Feature activation-7.59e-01

Pos logit contributions
シャ+1.106
アル+1.062
+0.970
+0.931
habi+0.924
Neg logit contributions
rint-0.960
agents-0.690
iment-0.689
 Scher-0.677
 tamp-0.674
Token Gen
Token ID5215
Feature activation+7.14e-01

Pos logit contributions
rint+0.642
iment+0.486
agents+0.485
 Scher+0.473
 tamp+0.472
Neg logit contributions
シャ-0.568
アル-0.542
-0.488
-0.466
habi-0.464
Token.
Token ID13
Feature activation-1.80e+00

Pos logit contributions
シャ+0.540
アル+0.516
+0.464
+0.443
habi+0.439
Neg logit contributions
rint-0.602
agents-0.453
iment-0.452
 Scher-0.446
 tamp-0.444
Token Martin
Token ID5780
Feature activation-2.80e-01

Pos logit contributions
rint+1.640
agents+1.266
iment+1.265
 Scher+1.257
 tamp+1.244
Neg logit contributions
シャ-1.238
アル-1.176
-1.046
-0.996
habi-0.985
Token
Token ID198
Feature activation-4.33e+00

Pos logit contributions
rint+1.428
agents+1.043
iment+1.039
 tamp+1.022
 Scher+1.021
Neg logit contributions
シャ-1.504
アル-1.441
-1.310
-1.258
habi-1.244
TokenTo
Token ID2514
Feature activation-6.25e-01

Pos logit contributions
rint+3.652
agents+2.767
iment+2.761
 Scher+2.728
 tamp+2.697
Neg logit contributions
シャ-3.227
アル-3.081
-2.773
habi-2.639
-2.632
Token grunt
Token ID45836
Feature activation-4.62e+00

Pos logit contributions
rint+0.525
agents+0.395
iment+0.391
 Scher+0.386
 tamp+0.381
Neg logit contributions
シャ-0.477
アル-0.454
-0.410
-0.393
habi-0.388
Token and
Token ID290
Feature activation-5.64e+00

Pos logit contributions
rint+3.664
iment+2.741
agents+2.701
 tamp+2.686
 Scher+2.684
Neg logit contributions
シャ-3.684
アル-3.546
-3.204
-3.054
habi-3.033
Token sweat
Token ID15488
Feature activation+3.24e-01

Pos logit contributions
rint+4.646
 tamp+3.549
iment+3.523
 Scher+3.498
agents+3.435
Neg logit contributions
シャ-4.304
アル-4.127
-3.733
-3.506
habi-3.477
Token under
Token ID739
Feature activation-1.04e+01

Pos logit contributions
シャ+0.247
アル+0.235
+0.213
+0.204
habi+0.202
Neg logit contributions
rint-0.270
agents-0.203
iment-0.200
 Scher-0.198
 tamp-0.196
Token a
Token ID257
Feature activation-6.12e+00

Pos logit contributions
rint+7.784
iment+5.921
 tamp+5.908
 advis+5.851
 Scher+5.781
Neg logit contributions
シャ-8.075
アル-7.829
-7.115
-6.627
ンジ-6.617
Token weary
Token ID34730
Feature activation-5.84e+00

Pos logit contributions
rint+5.079
 tamp+3.863
iment+3.854
 Scher+3.797
agents+3.753
Neg logit contributions
シャ-4.594
アル-4.398
-3.965
-3.733
habi-3.701
Token life
Token ID1204
Feature activation-6.03e+00

Pos logit contributions
rint+4.855
iment+3.718
 tamp+3.677
 Scher+3.646
agents+3.615
Neg logit contributions
シャ-4.360
アル-4.197
-3.770
-3.569
ンジ-3.519
Token.
Token ID764
Feature activation+8.20e-01

Pos logit contributions
rint+5.048
iment+3.882
 tamp+3.826
agents+3.804
 Scher+3.795
Neg logit contributions
シャ-4.484
アル-4.329
-3.877
-3.642
ンジ-3.638
Token.
Token ID764
Feature activation+1.58e+00

Pos logit contributions
シャ+0.545
アル+0.517
+0.459
+0.436
habi+0.424
Neg logit contributions
rint-0.770
agents-0.596
iment-0.592
 tamp-0.583
 Scher-0.581
Token the
Token ID262
Feature activation+8.25e-01

Pos logit contributions
rint+2.086
agents+1.536
iment+1.532
 Scher+1.514
 tamp+1.503
Neg logit contributions
シャ-2.174
アル-2.085
-1.892
-1.814
habi-1.799
Token arguments
Token ID7159
Feature activation-9.71e+00

Pos logit contributions
シャ+0.678
アル+0.650
+0.595
+0.570
habi+0.569
Neg logit contributions
rint-0.634
iment-0.459
agents-0.453
 Scher-0.451
 tamp-0.450
Token ap
Token ID2471
Feature activation-7.70e+00

Pos logit contributions
rint+5.326
 tamp+3.419
iment+3.336
 advis+3.308
agents+3.294
Neg logit contributions
シャ-9.905
アル-9.481
-8.913
-8.633
ンジ-8.578
Token =
Token ID796
Feature activation-6.50e+00

Pos logit contributions
rint+4.105
iment+2.584
agents+2.490
 Scher+2.446
 tamp+2.312
Neg logit contributions
シャ-8.062
アル-7.737
-7.263
habi-6.951
ンジ-6.941
Token arg
Token ID1822
Feature activation-5.79e+00

Pos logit contributions
rint+3.316
iment+1.998
agents+1.996
 Scher+1.953
 tamp+1.950
Neg logit contributions
シャ-7.045
アル-6.801
-6.333
-6.121
ンジ-6.071
Tokenparse
Token ID29572
Feature activation-9.69e+00

Pos logit contributions
rint+3.590
agents+2.369
iment+2.302
 Scher+2.252
 tamp+2.230
Neg logit contributions
シャ-5.715
アル-5.534
-5.083
-4.928
ンジ-4.890
Token.
Token ID13
Feature activation-1.04e+01

Pos logit contributions
rint+6.356
iment+4.681
 Scher+4.676
agents+4.545
(*+4.479
Neg logit contributions
シャ-8.462
アル-8.172
-7.470
ンジ-7.308
habi-7.276
TokenArg
Token ID28100
Feature activation-7.53e+00

Pos logit contributions
rint+7.104
Token+5.545
agents+5.523
iment+5.458
(*+5.261
Neg logit contributions
シャ-8.404
アル-8.074
habi-7.324
-7.319
 Shepard-7.131
Tokenument
Token ID1713
Feature activation-5.56e+00

Pos logit contributions
rint+5.643
iment+4.441
agents+4.230
 Scher+4.015
Token+3.966
Neg logit contributions
シャ-6.137
アル-5.951
-5.399
ンジ-5.206
habi-5.069
TokenParser
Token ID46677
Feature activation-2.98e+00

Pos logit contributions
rint+3.576
agents+2.517
iment+2.508
 Scher+2.493
Token+2.411
Neg logit contributions
シャ-5.162
アル-4.927
-4.536
habi-4.454
ンジ-4.382
Token()
Token ID3419
Feature activation-1.32e+01

Pos logit contributions
rint+2.027
iment+1.432
agents+1.424
 Scher+1.406
 tamp+1.375
Neg logit contributions
シャ-2.720
アル-2.612
-2.402
habi-2.303
-2.292
Token NET
Token ID30502
Feature activation+1.32e+00

Pos logit contributions
シャ+1.185
アル+1.139
+1.033
+0.999
ンジ+0.978
Neg logit contributions
rint-1.089
agents-0.793
iment-0.787
 Scher-0.771
 tamp-0.768
Token CALL
Token ID42815
Feature activation+6.85e+00

Pos logit contributions
シャ+0.926
アル+0.882
+0.789
+0.762
ンジ+0.733
Neg logit contributions
rint-1.195
agents-0.916
iment-0.902
 tamp-0.891
 Scher-0.880
Token UPDATE
Token ID35717
Feature activation+4.89e+00

Pos logit contributions
シャ+3.512
アル+3.338
+2.842
+2.681
NS+2.615
Neg logit contributions
rint-7.232
agents-5.856
iment-5.786
 Scher-5.671
 tamp-5.647
Token:
Token ID25
Feature activation+8.48e+00

Pos logit contributions
シャ+3.609
アル+3.440
+3.147
+3.034
 Shepard+2.979
Neg logit contributions
rint-4.176
agents-3.098
 fortun-3.029
iment-3.020
 advis-2.980
Token 05
Token ID8870
Feature activation-9.57e+00

Pos logit contributions
シャ+3.875
アル+3.813
+3.126
 Shepard+2.906
+2.772
Neg logit contributions
rint-9.286
agents-7.552
iment-7.530
 tamp-7.342
 Scher-7.296
Token50
Token ID1120
Feature activation-8.42e+00

Pos logit contributions
rint+4.795
 Scher+3.224
iment+3.082
Token+3.051
(*+3.035
Neg logit contributions
シャ-9.698
アル-9.220
-8.657
habi-8.499
-8.333
Tokenz
Token ID89
Feature activation+2.09e+00

Pos logit contributions
rint+5.313
iment+3.643
agents+3.546
 Scher+3.520
 tamp+3.472
Neg logit contributions
シャ-8.005
アル-7.689
-7.096
-6.791
ンジ-6.724
Token VOL
Token ID38570
Feature activation-1.03e+00

Pos logit contributions
シャ+1.504
アル+1.427
+1.276
+1.232
 Shepard+1.205
Neg logit contributions
rint-1.835
agents-1.398
iment-1.390
 tamp-1.346
 Scher-1.340
TokenUN
Token ID4944
Feature activation+1.99e+00

Pos logit contributions
rint+0.892
agents+0.677
iment+0.675
 Scher+0.666
 tamp+0.664
Neg logit contributions
シャ-0.760
アル-0.725
-0.650
-0.621
habi-0.614
TokenTE
Token ID9328
Feature activation-3.20e+00

Pos logit contributions
シャ+1.586
アル+1.496
+1.360
+1.325
nikov+1.291
Neg logit contributions
rint-1.606
agents-1.205
iment-1.191
 Scher-1.163
 tamp-1.149
TokenER
Token ID1137
Feature activation-1.30e+00

Pos logit contributions
rint+2.800
iment+2.144
agents+2.135
 Scher+2.126
 tamp+2.116
Neg logit contributions
シャ-2.286
アル-2.191
-1.960
-1.857
habi-1.854