TOP ACTIVATIONS
MAX = 2.446

were scoring those two f ift ies against West Indies in
got a couple of f ift ies in that game and
149 ) D ance in front of a turret
Railroad Company F acts [ edit ]
th Season : Ghost Island TBA TBA The Complete Thirty -
lett Foundation F ounded in 1966 by William Hew
rey f us Ve ep HBO Jim Parsons The Big Bang
in today 's Counter - Strike are caused by two magical
Season : Ghost Island TBA TBA The Complete Thirty - Se
ared Anton Ferdinand 's deep cross , Lor ik Can a
emotions . F acts could not change the minds
1 ) of the Criminal Code Act 1995 , which carries
ih al - B uk hari , 5 : 58 :
camps ites . WATCH OUT FOR R AT TL
credit for your aw es omen ess . If
Con aug hey True Detective HBO Matthew Rh ys The Americans
an already - f ant astic creation . Scott
t seem f azed . While he
ometer circuit . WATCH : The attempt will be
credit for the hack , Crowd strike stood by its analysis

MIN ACTIVATIONS
MIN = -2.040

major Oscars cont ender , potentially picking up nominations
was the shin ig ami art of high - speed movement
are in a nond esc ript pre war building overlooking Herald
has contradicted inc ont ro vert ible reality .
ern i ère ment m ais pas mo ind re ment
seen as a racial epit het that targeted African - Americans
and the Stars re - sign Lang en br un ner
AND STEP H EN M atur in first meet in April
playoff round or re - sign s Lang en br un
Meanwhile , on foreign policy -- another Carter weak
. <|endoftext|> B rock Les nar has been flagged by the
So more OFFIC IAL announcements soon . Be careful
and stre ng then faculty tenure systems .
last mile connectivity and public transport , automation and computer ization
officer , and Stephen M atur in , a penn iless
Ted H agg ard is prof iled in the new G
Paul Ryan On Aug . 12 , 2012 ,
is saying
of the 5 PM Monday deadline for all NHL teams to

INTERVAL 1.325 - 2.446
CONTAINS 0.100%

side our earlier requirements ) leaves us with a good prototype
credit for your aw es omen ess . If
their impartial ity . For instance , a key aspect of
Lab / Method / F raction ( b , c ,
ers vs . Hust lers TBA TBA The Complete Thirty -

INTERVAL 0.203 - 1.325
CONTAINS 21.784%

s seventh consecutive lunar mission failure . NASA Headquarters formed
Fit an Object Al igned B ounding Box around the space
, DC : US Department of Transportation , National Highway Traffic
got a gun in your house , I m
eder to go back on 24 - hour alcohol monitoring and

INTERVAL -0.918 - 0.203
CONTAINS 77.315%

gay . In this context , the gay man is just
humans . The Australian and French research shows that
2 + 1 is , but they are capable of mathematical
are the sweeping back stories that are attached to the characters
has also offered the president praise . Last week , he

INTERVAL -2.040 - -0.918
CONTAINS 0.801%

other OD IN based boot load ers , so when the
He has been photographed receiving a business award from
the entire time we were together . I was broken .
DC : US Department of Transportation , National Highway Traffic Safety
, California . Hide Caption 10 of 15 Photos : Carly
Token were
Token ID547
Feature activation+1.42e-01

Pos logit contributions
ignt+0.447
GN+0.439
wic+0.434
aughs+0.429
stories+0.417
Neg logit contributions
 mole-0.503
 ],-0.457
ゼウス-0.447
 ):-0.403
 ].-0.394
Token scoring
Token ID9689
Feature activation+3.08e-01

Pos logit contributions
ignt+0.094
GN+0.092
wic+0.091
aughs+0.090
stories+0.087
Neg logit contributions
 mole-0.114
 ],-0.104
ゼウス-0.101
 ):-0.092
 ].-0.090
Token those
Token ID883
Feature activation-5.51e-02

Pos logit contributions
ignt+0.189
GN+0.185
wic+0.182
aughs+0.180
stories+0.174
Neg logit contributions
 mole-0.263
 ],-0.241
ゼウス-0.236
 ):-0.215
 ].-0.211
Token two
Token ID734
Feature activation+3.69e-01

Pos logit contributions
 mole+0.048
 ],+0.044
ゼウス+0.043
 ):+0.039
 ].+0.038
Neg logit contributions
ignt-0.033
GN-0.032
wic-0.032
aughs-0.031
stories-0.030
Token f
Token ID277
Feature activation+3.48e-01

Pos logit contributions
ignt+0.223
GN+0.219
wic+0.215
aughs+0.213
stories+0.205
Neg logit contributions
 mole-0.318
 ],-0.292
ゼウス-0.284
 ):-0.261
 ].-0.255
Tokenift
Token ID2135
Feature activation+2.45e+00

Pos logit contributions
ignt+0.222
GN+0.218
wic+0.215
aughs+0.212
stories+0.206
Neg logit contributions
 mole-0.286
 ],-0.263
ゼウス-0.256
 ):-0.234
 ].-0.228
Tokenies
Token ID444
Feature activation+6.21e-01

Pos logit contributions
ignt+1.100
GN+1.056
wic+1.042
aughs+1.020
stories+0.970
Neg logit contributions
 mole-2.492
 ],-2.314
ゼウス-2.277
 ):-2.107
 ].-2.077
Token against
Token ID1028
Feature activation-1.68e-02

Pos logit contributions
ignt+0.385
GN+0.377
wic+0.372
aughs+0.367
stories+0.356
Neg logit contributions
 mole-0.524
 ],-0.480
ゼウス-0.471
 ):-0.428
 ].-0.420
Token West
Token ID2688
Feature activation+1.12e+00

Pos logit contributions
 mole+0.014
 ],+0.013
ゼウス+0.013
 ):+0.011
 ].+0.011
Neg logit contributions
ignt-0.011
GN-0.010
wic-0.010
aughs-0.010
stories-0.010
Token Indies
Token ID42556
Feature activation+4.41e-01

Pos logit contributions
ignt+0.815
GN+0.801
wic+0.790
aughs+0.784
stories+0.761
Neg logit contributions
 mole-0.831
 ],-0.752
ゼウス-0.732
 ):-0.658
 ].-0.641
Token in
Token ID287
Feature activation+9.04e-02

Pos logit contributions
ignt+0.273
GN+0.267
wic+0.263
aughs+0.261
stories+0.251
Neg logit contributions
 mole-0.373
 ],-0.343
ゼウス-0.333
 ):-0.306
 ].-0.299
Token got
Token ID1392
Feature activation-3.12e-01

Pos logit contributions
 mole+0.035
 ],+0.032
ゼウス+0.032
 ):+0.028
 ].+0.027
Neg logit contributions
ignt-0.029
GN-0.029
wic-0.028
aughs-0.028
stories-0.027
Token a
Token ID257
Feature activation-2.60e-02

Pos logit contributions
 mole+0.253
 ],+0.231
ゼウス+0.226
 ):+0.205
 ].+0.201
Neg logit contributions
ignt-0.204
GN-0.200
wic-0.197
aughs-0.194
stories-0.189
Token couple
Token ID3155
Feature activation-3.72e-01

Pos logit contributions
 mole+0.022
 ],+0.020
ゼウス+0.019
 ):+0.018
 ].+0.017
Neg logit contributions
ignt-0.017
GN-0.016
wic-0.016
aughs-0.016
stories-0.015
Token of
Token ID286
Feature activation+3.25e-01

Pos logit contributions
 mole+0.307
 ],+0.279
ゼウス+0.270
 ):+0.250
 ].+0.242
Neg logit contributions
ignt-0.239
GN-0.236
wic-0.231
aughs-0.230
stories-0.220
Token f
Token ID277
Feature activation+1.56e-01

Pos logit contributions
ignt+0.200
GN+0.195
wic+0.192
aughs+0.190
stories+0.183
Neg logit contributions
 mole-0.278
 ],-0.255
ゼウス-0.249
 ):-0.227
 ].-0.223
Tokenift
Token ID2135
Feature activation+2.33e+00

Pos logit contributions
ignt+0.101
GN+0.100
wic+0.098
aughs+0.097
stories+0.094
Neg logit contributions
 mole-0.126
 ],-0.116
ゼウス-0.112
 ):-0.103
 ].-0.100
Tokenies
Token ID444
Feature activation+3.72e-01

Pos logit contributions
ignt+1.075
GN+1.042
wic+1.026
aughs+1.010
stories+0.963
Neg logit contributions
 mole-2.344
 ],-2.179
ゼウス-2.145
 ):-1.979
 ].-1.950
Token in
Token ID287
Feature activation+1.24e-01

Pos logit contributions
ignt+0.235
GN+0.231
wic+0.227
aughs+0.224
stories+0.218
Neg logit contributions
 mole-0.309
 ],-0.283
ゼウス-0.279
 ):-0.251
 ].-0.247
Token that
Token ID326
Feature activation+6.09e-02

Pos logit contributions
ignt+0.086
GN+0.084
wic+0.083
aughs+0.082
stories+0.080
Neg logit contributions
 mole-0.095
 ],-0.086
ゼウス-0.085
 ):-0.076
 ].-0.074
Token game
Token ID983
Feature activation-1.62e-01

Pos logit contributions
ignt+0.041
GN+0.040
wic+0.039
aughs+0.039
stories+0.038
Neg logit contributions
 mole-0.049
 ],-0.044
ゼウス-0.043
 ):-0.039
 ].-0.038
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
 mole+0.137
 ],+0.125
ゼウス+0.122
 ):+0.112
 ].+0.109
Neg logit contributions
ignt-0.101
GN-0.099
wic-0.098
aughs-0.097
stories-0.093
Token149
Token ID19442
Feature activation+1.70e-01

Pos logit contributions
 mole+0.129
 ],+0.117
ゼウス+0.114
 ):+0.104
 ].+0.102
Neg logit contributions
ignt-0.097
GN-0.097
wic-0.095
aughs-0.093
stories-0.091
Token)
Token ID8
Feature activation-2.65e-01

Pos logit contributions
ignt+0.095
GN+0.091
wic+0.090
aughs+0.090
stories+0.085
Neg logit contributions
 mole-0.156
 ],-0.142
ゼウス-0.139
 ):-0.131
 ])-0.125
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
 mole+0.243
 ],+0.224
ゼウス+0.216
 ):+0.202
 ].+0.197
Neg logit contributions
ignt-0.149
GN-0.144
wic-0.142
aughs-0.138
stories-0.134
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
 mole+0.116
 ],+0.107
ゼウス+0.106
 ):+0.097
 ].+0.095
Neg logit contributions
ignt-0.055
GN-0.054
wic-0.053
aughs-0.052
stories-0.050
TokenD
Token ID35
Feature activation+4.59e-01

Pos logit contributions
 mole+0.109
 ],+0.100
ゼウス+0.098
 ):+0.089
 ].+0.087
Neg logit contributions
ignt-0.086
GN-0.084
wic-0.083
aughs-0.082
stories-0.079
Tokenance
Token ID590
Feature activation+2.29e+00

Pos logit contributions
ignt+0.294
GN+0.287
wic+0.284
aughs+0.281
stories+0.272
Neg logit contributions
 mole-0.379
 ],-0.346
ゼウス-0.337
 ):-0.308
 ].-0.301
Token in
Token ID287
Feature activation+1.24e-01

Pos logit contributions
ignt+1.424
GN+1.404
wic+1.381
aughs+1.348
stories+1.317
Neg logit contributions
 mole-1.887
 ],-1.744
ゼウス-1.725
 ):-1.553
 ].-1.521
Token front
Token ID2166
Feature activation+3.75e-01

Pos logit contributions
ignt+0.083
GN+0.081
wic+0.080
aughs+0.079
stories+0.077
Neg logit contributions
 mole-0.098
 ],-0.089
ゼウス-0.087
 ):-0.079
 ].-0.077
Token of
Token ID286
Feature activation-6.06e-02

Pos logit contributions
ignt+0.218
GN+0.214
wic+0.211
aughs+0.204
stories+0.202
Neg logit contributions
 mole-0.327
 ],-0.302
ゼウス-0.299
 ):-0.271
 ].-0.266
Token a
Token ID257
Feature activation+1.69e-01

Pos logit contributions
 mole+0.048
 ],+0.043
ゼウス+0.042
 ):+0.038
 ].+0.037
Neg logit contributions
ignt-0.042
GN-0.041
wic-0.040
aughs-0.040
stories-0.039
Token turret
Token ID28101
Feature activation-6.64e-02

Pos logit contributions
ignt+0.103
GN+0.101
wic+0.099
aughs+0.099
stories+0.095
Neg logit contributions
 mole-0.145
 ],-0.132
ゼウス-0.129
 ):-0.118
 ].-0.116
Token Railroad
Token ID36289
Feature activation+3.12e-01

Pos logit contributions
ignt+0.178
GN+0.175
wic+0.173
aughs+0.172
stories+0.168
Neg logit contributions
 mole-0.132
 ],-0.118
ゼウス-0.113
 ):-0.100
 ].-0.097
Token Company
Token ID5834
Feature activation+2.10e-01

Pos logit contributions
ignt+0.235
GN+0.231
wic+0.227
aughs+0.226
stories+0.220
Neg logit contributions
 mole-0.223
 ],-0.202
ゼウス-0.194
 ):-0.176
 ].-0.172
Token
Token ID198
Feature activation+6.32e-01

Pos logit contributions
ignt+0.119
GN+0.116
wic+0.114
aughs+0.112
stories+0.108
Neg logit contributions
 mole-0.190
 ],-0.175
ゼウス-0.172
 ):-0.157
 ].-0.154
Token
Token ID198
Feature activation+5.53e-01

Pos logit contributions
GN+0.286
ignt+0.285
wic+0.278
aughs+0.267
stories+0.263
Neg logit contributions
 mole-0.635
ゼウス-0.585
 ],-0.583
 ):-0.530
 ].-0.521
TokenF
Token ID37
Feature activation-2.24e-02

Pos logit contributions
ignt+0.352
GN+0.346
wic+0.341
aughs+0.336
stories+0.326
Neg logit contributions
 mole-0.458
 ],-0.418
ゼウス-0.409
 ):-0.372
 ].-0.364
Tokenacts
Token ID8656
Feature activation+2.10e+00

Pos logit contributions
 mole+0.018
 ],+0.016
ゼウス+0.016
 ):+0.014
 ].+0.014
Neg logit contributions
ignt-0.015
GN-0.015
wic-0.015
aughs-0.015
stories-0.014
Token [
Token ID685
Feature activation+2.63e-01

Pos logit contributions
ignt+1.334
GN+1.307
wic+1.289
aughs+1.265
stories+1.233
Neg logit contributions
 mole-1.740
 ],-1.586
ゼウス-1.565
 ):-1.413
 ].-1.380
Token edit
Token ID4370
Feature activation+5.48e-01

Pos logit contributions
ignt+0.157
GN+0.154
wic+0.151
aughs+0.150
stories+0.144
Neg logit contributions
 mole-0.228
 ],-0.210
ゼウス-0.205
 ):-0.188
 ].-0.184
Token ]
Token ID2361
Feature activation+1.40e-01

Pos logit contributions
GN+0.628
ignt+0.627
wic+0.620
aughs+0.619
stories+0.612
Neg logit contributions
 mole-0.162
ゼウス-0.126
 ],-0.109
 ):-0.072
skirts-0.069
Token
Token ID198
Feature activation+1.38e-01

Pos logit contributions
ignt+0.066
GN+0.065
wic+0.064
aughs+0.062
stories+0.060
Neg logit contributions
 mole-0.138
 ],-0.127
ゼウス-0.127
 ):-0.116
 ].-0.114
Token
Token ID198
Feature activation+1.75e-01

Pos logit contributions
ignt+0.064
GN+0.064
wic+0.062
aughs+0.060
stories+0.059
Neg logit contributions
 mole-0.137
 ],-0.127
ゼウス-0.126
 ):-0.115
 ].-0.113
Tokenth
Token ID400
Feature activation+5.80e-01

Pos logit contributions
ignt+0.421
GN+0.412
wic+0.407
aughs+0.402
stories+0.390
Neg logit contributions
 mole-0.486
 ],-0.443
ゼウス-0.432
 ):-0.392
 ].-0.382
Token Season
Token ID7369
Feature activation+4.58e-01

Pos logit contributions
ignt+0.465
GN+0.457
wic+0.452
aughs+0.448
stories+0.436
Neg logit contributions
 mole-0.385
 ],-0.345
ゼウス-0.334
 ):-0.297
 ].-0.288
Token:
Token ID25
Feature activation+4.31e-01

Pos logit contributions
ignt+0.300
GN+0.293
wic+0.289
aughs+0.286
stories+0.277
Neg logit contributions
 mole-0.372
 ],-0.340
ゼウス-0.331
 ):-0.302
 ].-0.295
Token Ghost
Token ID9897
Feature activation-2.27e-01

Pos logit contributions
ignt+0.300
GN+0.293
wic+0.289
aughs+0.286
stories+0.278
Neg logit contributions
 mole-0.333
 ],-0.303
ゼウス-0.294
 ):-0.266
 ].-0.260
Token Island
Token ID5451
Feature activation+4.08e-01

Pos logit contributions
 mole+0.182
 ],+0.167
ゼウス+0.161
 ):+0.148
 ].+0.145
Neg logit contributions
ignt-0.151
GN-0.148
wic-0.146
aughs-0.143
stories-0.138
Token TBA
Token ID48591
Feature activation+2.08e+00

Pos logit contributions
ignt+0.270
GN+0.265
wic+0.261
aughs+0.258
stories+0.250
Neg logit contributions
 mole-0.328
 ],-0.300
ゼウス-0.292
 ):-0.266
 ].-0.260
Token TBA
Token ID48591
Feature activation+2.01e+00

Pos logit contributions
ignt+1.361
GN+1.344
wic+1.319
aughs+1.307
stories+1.266
Neg logit contributions
 mole-1.675
 ],-1.528
ゼウス-1.497
 ):-1.354
 ].-1.327
Token The
Token ID383
Feature activation+8.59e-01

Pos logit contributions
ignt+1.365
GN+1.351
wic+1.326
aughs+1.314
stories+1.276
Neg logit contributions
 mole-1.570
 ],-1.427
ゼウス-1.398
 ):-1.260
 ].-1.232
Token Complete
Token ID13248
Feature activation+1.25e+00

Pos logit contributions
ignt+0.627
GN+0.618
wic+0.609
aughs+0.604
stories+0.587
Neg logit contributions
 mole-0.630
 ],-0.570
ゼウス-0.557
 ):-0.498
 ].-0.486
Token Thirty
Token ID29948
Feature activation+1.03e+00

Pos logit contributions
ignt+0.917
GN+0.903
wic+0.890
aughs+0.883
stories+0.858
Neg logit contributions
 mole-0.915
 ],-0.827
ゼウス-0.808
 ):-0.722
 ].-0.704
Token-
Token ID12
Feature activation+6.40e-01

Pos logit contributions
ignt+0.479
GN+0.468
wic+0.458
aughs+0.451
stories+0.433
Neg logit contributions
 mole-1.023
 ],-0.951
ゼウス-0.937
 ):-0.864
 ].-0.849
Tokenlett
Token ID15503
Feature activation+5.42e-01

Pos logit contributions
ignt+0.164
GN+0.162
wic+0.159
aughs+0.158
stories+0.154
Neg logit contributions
 mole-0.142
 ],-0.128
ゼウス-0.123
 ):-0.110
 ].-0.107
Token Foundation
Token ID5693
Feature activation-2.53e-01

Pos logit contributions
ignt+0.350
GN+0.342
wic+0.337
aughs+0.334
stories+0.323
Neg logit contributions
 mole-0.445
 ],-0.408
ゼウス-0.397
 ):-0.362
 ].-0.354
Token
Token ID198
Feature activation+2.37e-01

Pos logit contributions
 mole+0.206
 ],+0.188
ゼウス+0.183
 ):+0.167
 ].+0.163
Neg logit contributions
ignt-0.166
GN-0.162
wic-0.160
aughs-0.158
stories-0.153
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
GN+0.109
ignt+0.108
wic+0.106
aughs+0.102
stories+0.100
Neg logit contributions
 mole-0.237
ゼウス-0.219
 ],-0.218
 ):-0.198
 ].-0.195
TokenF
Token ID37
Feature activation+7.33e-02

Pos logit contributions
ignt+0.100
GN+0.098
wic+0.096
aughs+0.095
stories+0.092
Neg logit contributions
 mole-0.141
 ],-0.129
ゼウス-0.126
 ):-0.116
 ].-0.113
Tokenounded
Token ID6302
Feature activation+2.06e+00

Pos logit contributions
ignt+0.051
GN+0.050
wic+0.050
aughs+0.049
stories+0.048
Neg logit contributions
 mole-0.056
 ],-0.051
ゼウス-0.050
 ):-0.045
 ].-0.044
Token in
Token ID287
Feature activation-6.04e-02

Pos logit contributions
ignt+1.328
GN+1.305
wic+1.284
aughs+1.267
stories+1.226
Neg logit contributions
 mole-1.687
 ],-1.534
ゼウス-1.519
 ):-1.364
 ].-1.335
Token 1966
Token ID19322
Feature activation-5.02e-01

Pos logit contributions
 mole+0.054
 ],+0.050
ゼウス+0.049
 ):+0.045
 ].+0.044
Neg logit contributions
ignt-0.035
GN-0.034
wic-0.033
aughs-0.033
stories-0.032
Token by
Token ID416
Feature activation+6.32e-02

Pos logit contributions
 mole+0.407
 ],+0.375
ゼウス+0.362
 ):+0.331
 ].+0.323
Neg logit contributions
ignt-0.329
GN-0.322
wic-0.316
aughs-0.315
stories-0.303
Token William
Token ID3977
Feature activation+3.75e-01

Pos logit contributions
ignt+0.043
GN+0.042
wic+0.041
aughs+0.041
stories+0.040
Neg logit contributions
 mole-0.050
 ],-0.045
ゼウス-0.044
 ):-0.040
 ].-0.039
Token Hew
Token ID30446
Feature activation+6.49e-01

Pos logit contributions
ignt+0.251
GN+0.246
wic+0.243
aughs+0.240
stories+0.233
Neg logit contributions
 mole-0.298
 ],-0.272
ゼウス-0.265
 ):-0.240
 ].-0.235
Tokenrey
Token ID4364
Feature activation+4.51e-01

Pos logit contributions
ignt+0.081
GN+0.078
wic+0.077
aughs+0.076
stories+0.074
Neg logit contributions
 mole-0.105
 ],-0.097
ゼウス-0.094
 ):-0.087
 ].-0.085
Tokenf
Token ID69
Feature activation-1.46e-02

Pos logit contributions
ignt+0.305
GN+0.298
wic+0.294
aughs+0.292
stories+0.283
Neg logit contributions
 mole-0.355
 ],-0.324
ゼウス-0.314
 ):-0.287
 ].-0.280
Tokenus
Token ID385
Feature activation+3.48e-01

Pos logit contributions
 mole+0.009
 ],+0.008
ゼウス+0.008
 ):+0.007
 ].+0.007
Neg logit contributions
ignt-0.012
GN-0.012
wic-0.011
aughs-0.011
stories-0.011
Token Ve
Token ID8016
Feature activation+1.33e-01

Pos logit contributions
ignt+0.236
GN+0.229
wic+0.226
aughs+0.225
stories+0.217
Neg logit contributions
 mole-0.276
 ],-0.251
ゼウス-0.244
 ):-0.222
 ].-0.217
Tokenep
Token ID538
Feature activation-2.76e-02

Pos logit contributions
ignt+0.071
GN+0.068
wic+0.068
aughs+0.067
stories+0.065
Neg logit contributions
 mole-0.124
 ],-0.115
ゼウス-0.112
 ):-0.104
 ].-0.102
Token HBO
Token ID18804
Feature activation+2.03e+00

Pos logit contributions
 mole+0.023
 ],+0.021
ゼウス+0.020
 ):+0.018
 ].+0.018
Neg logit contributions
ignt-0.018
GN-0.017
wic-0.017
aughs-0.017
stories-0.016
Token Jim
Token ID5395
Feature activation+2.52e-01

Pos logit contributions
ignt+1.288
GN+1.266
wic+1.245
aughs+1.231
stories+1.191
Neg logit contributions
 mole-1.675
 ],-1.534
ゼウス-1.502
 ):-1.364
 ].-1.335
Token Parsons
Token ID35925
Feature activation-1.33e-01

Pos logit contributions
ignt+0.209
GN+0.203
aughs+0.201
wic+0.201
stories+0.195
Neg logit contributions
 mole-0.161
 ],-0.145
ゼウス-0.138
 ):-0.125
 ].-0.121
Token The
Token ID383
Feature activation+3.15e-01

Pos logit contributions
 mole+0.104
 ],+0.095
ゼウス+0.092
 ):+0.084
 ].+0.082
Neg logit contributions
ignt-0.091
GN-0.088
wic-0.088
aughs-0.087
stories-0.084
Token Big
Token ID4403
Feature activation+1.12e-01

Pos logit contributions
ignt+0.205
GN+0.202
wic+0.199
aughs+0.198
stories+0.191
Neg logit contributions
 mole-0.254
 ],-0.232
ゼウス-0.228
 ):-0.206
 ].-0.202
Token Bang
Token ID9801
Feature activation+3.99e-02

Pos logit contributions
ignt+0.034
GN+0.032
wic+0.031
aughs+0.030
stories+0.027
Neg logit contributions
 mole-0.130
 ],-0.123
ゼウス-0.119
 ):-0.113
 ].-0.112
Token in
Token ID287
Feature activation-8.04e-03

Pos logit contributions
ignt+0.036
GN+0.035
wic+0.034
aughs+0.034
stories+0.033
Neg logit contributions
 mole-0.039
 ],-0.035
ゼウス-0.035
 ):-0.031
 ].-0.030
Token today
Token ID1909
Feature activation+3.81e-01

Pos logit contributions
 mole+0.007
 ],+0.006
ゼウス+0.006
 ):+0.005
 ].+0.005
Neg logit contributions
ignt-0.005
GN-0.005
wic-0.005
aughs-0.005
stories-0.005
Token's
Token ID338
Feature activation+1.53e-01

Pos logit contributions
ignt+0.285
GN+0.280
wic+0.276
aughs+0.274
stories+0.266
Neg logit contributions
 mole-0.274
 ],-0.247
ゼウス-0.240
 ):-0.216
 ].-0.210
Token Counter
Token ID15034
Feature activation+9.08e-01

Pos logit contributions
ignt+0.086
GN+0.083
wic+0.082
aughs+0.081
stories+0.078
Neg logit contributions
 mole-0.139
 ],-0.128
ゼウス-0.125
 ):-0.116
 ].-0.113
Token-
Token ID12
Feature activation-6.88e-03

Pos logit contributions
ignt+0.481
aughs+0.456
GN+0.456
wic+0.455
stories+0.430
Neg logit contributions
 mole-0.864
 ],-0.793
ゼウス-0.771
 ):-0.719
 ].-0.703
TokenStrike
Token ID31584
Feature activation+2.01e+00

Pos logit contributions
 mole+0.007
 ],+0.006
ゼウス+0.006
 ):+0.006
 ].+0.005
Neg logit contributions
ignt-0.004
wic-0.003
aughs-0.003
GN-0.003
stories-0.003
Token are
Token ID389
Feature activation+2.08e-01

Pos logit contributions
ignt+1.284
GN+1.269
wic+1.241
aughs+1.228
stories+1.195
Neg logit contributions
 mole-1.654
 ],-1.514
ゼウス-1.490
 ):-1.343
 ].-1.315
Token caused
Token ID4073
Feature activation+2.04e-01

Pos logit contributions
ignt+0.152
GN+0.149
wic+0.147
aughs+0.146
stories+0.142
Neg logit contributions
 mole-0.153
 ],-0.138
ゼウス-0.137
 ):-0.121
 ].-0.118
Token by
Token ID416
Feature activation+1.47e-01

Pos logit contributions
ignt+0.155
GN+0.152
wic+0.150
aughs+0.150
stories+0.145
Neg logit contributions
 mole-0.145
 ],-0.131
ゼウス-0.126
 ):-0.114
 ].-0.111
Token two
Token ID734
Feature activation-5.37e-04

Pos logit contributions
ignt+0.094
GN+0.092
wic+0.090
aughs+0.089
stories+0.086
Neg logit contributions
 mole-0.122
 ],-0.112
ゼウス-0.109
 ):-0.099
 ].-0.097
Token magical
Token ID10883
Feature activation+5.17e-01

Pos logit contributions
 mole+0.000
 ],+0.000
ゼウス+0.000
 ):+0.000
 ].+0.000
Neg logit contributions
ignt-0.000
GN-0.000
wic-0.000
aughs-0.000
stories-0.000
Token Season
Token ID7369
Feature activation+4.58e-01

Pos logit contributions
ignt+0.465
GN+0.457
wic+0.452
aughs+0.448
stories+0.436
Neg logit contributions
 mole-0.385
 ],-0.345
ゼウス-0.334
 ):-0.297
 ].-0.288
Token:
Token ID25
Feature activation+4.31e-01

Pos logit contributions
ignt+0.300
GN+0.293
wic+0.289
aughs+0.286
stories+0.277
Neg logit contributions
 mole-0.372
 ],-0.340
ゼウス-0.331
 ):-0.302
 ].-0.295
Token Ghost
Token ID9897
Feature activation-2.27e-01

Pos logit contributions
ignt+0.300
GN+0.293
wic+0.289
aughs+0.286
stories+0.278
Neg logit contributions
 mole-0.333
 ],-0.303
ゼウス-0.294
 ):-0.266
 ].-0.260
Token Island
Token ID5451
Feature activation+4.08e-01

Pos logit contributions
 mole+0.182
 ],+0.167
ゼウス+0.161
 ):+0.148
 ].+0.145
Neg logit contributions
ignt-0.151
GN-0.148
wic-0.146
aughs-0.143
stories-0.138
Token TBA
Token ID48591
Feature activation+2.08e+00

Pos logit contributions
ignt+0.270
GN+0.265
wic+0.261
aughs+0.258
stories+0.250
Neg logit contributions
 mole-0.328
 ],-0.300
ゼウス-0.292
 ):-0.266
 ].-0.260
Token TBA
Token ID48591
Feature activation+2.01e+00

Pos logit contributions
ignt+1.361
GN+1.344
wic+1.319
aughs+1.307
stories+1.266
Neg logit contributions
 mole-1.675
 ],-1.528
ゼウス-1.497
 ):-1.354
 ].-1.327
Token The
Token ID383
Feature activation+8.59e-01

Pos logit contributions
ignt+1.365
GN+1.351
wic+1.326
aughs+1.314
stories+1.276
Neg logit contributions
 mole-1.570
 ],-1.427
ゼウス-1.398
 ):-1.260
 ].-1.232
Token Complete
Token ID13248
Feature activation+1.25e+00

Pos logit contributions
ignt+0.627
GN+0.618
wic+0.609
aughs+0.604
stories+0.587
Neg logit contributions
 mole-0.630
 ],-0.570
ゼウス-0.557
 ):-0.498
 ].-0.486
Token Thirty
Token ID29948
Feature activation+1.03e+00

Pos logit contributions
ignt+0.917
GN+0.903
wic+0.890
aughs+0.883
stories+0.858
Neg logit contributions
 mole-0.915
 ],-0.827
ゼウス-0.808
 ):-0.722
 ].-0.704
Token-
Token ID12
Feature activation+6.40e-01

Pos logit contributions
ignt+0.479
GN+0.468
wic+0.458
aughs+0.451
stories+0.433
Neg logit contributions
 mole-1.023
 ],-0.951
ゼウス-0.937
 ):-0.864
 ].-0.849
TokenSe
Token ID4653
Feature activation+2.12e-01

Pos logit contributions
ignt+0.281
GN+0.271
wic+0.266
aughs+0.262
stories+0.247
Neg logit contributions
 mole-0.659
 ],-0.614
ゼウス-0.602
 ):-0.560
 ].-0.551
Tokenared
Token ID1144
Feature activation-1.30e+00

Pos logit contributions
ignt+0.294
GN+0.289
wic+0.286
aughs+0.284
stories+0.277
Neg logit contributions
 mole-0.191
 ],-0.168
ゼウス-0.163
 ):-0.140
 ].-0.135
Token Anton
Token ID9261
Feature activation+2.52e-01

Pos logit contributions
 mole+1.042
 ],+0.951
ゼウス+0.926
 ):+0.843
 ].+0.823
Neg logit contributions
ignt-0.862
GN-0.845
wic-0.832
aughs-0.825
stories-0.798
Token Ferdinand
Token ID44312
Feature activation+3.82e-02

Pos logit contributions
ignt+0.182
GN+0.179
wic+0.176
aughs+0.175
stories+0.170
Neg logit contributions
 mole-0.187
 ],-0.170
ゼウス-0.164
 ):-0.149
 ].-0.145
Token's
Token ID338
Feature activation+2.87e-01

Pos logit contributions
ignt+0.026
GN+0.026
wic+0.026
aughs+0.025
stories+0.024
Neg logit contributions
 mole-0.029
 ],-0.027
ゼウス-0.026
 ):-0.024
 ].-0.023
Token deep
Token ID2769
Feature activation+7.12e-01

Pos logit contributions
ignt+0.190
GN+0.186
wic+0.183
aughs+0.181
stories+0.176
Neg logit contributions
 mole-0.231
 ],-0.211
ゼウス-0.206
 ):-0.187
 ].-0.183
Token cross
Token ID3272
Feature activation+2.00e+00

Pos logit contributions
ignt+0.457
GN+0.447
wic+0.441
aughs+0.435
stories+0.421
Neg logit contributions
 mole-0.585
 ],-0.536
ゼウス-0.525
 ):-0.476
 ].-0.466
Token,
Token ID11
Feature activation-2.66e-02

Pos logit contributions
ignt+1.281
GN+1.250
wic+1.247
aughs+1.204
stories+1.177
Neg logit contributions
 mole-1.631
 ],-1.490
ゼウス-1.479
 ):-1.326
 ].-1.301
Token Lor
Token ID26068
Feature activation-4.81e-01

Pos logit contributions
 mole+0.021
 ],+0.020
ゼウス+0.019
 ):+0.017
 ].+0.017
Neg logit contributions
ignt-0.017
GN-0.017
wic-0.017
aughs-0.017
stories-0.016
Tokenik
Token ID1134
Feature activation-3.33e-01

Pos logit contributions
 mole+0.335
 ],+0.303
ゼウス+0.290
 ):+0.263
 ].+0.256
Neg logit contributions
ignt-0.366
GN-0.362
wic-0.353
aughs-0.351
stories-0.343
Token Can
Token ID1680
Feature activation+7.69e-01

Pos logit contributions
 mole+0.262
 ],+0.238
ゼウス+0.233
 ):+0.210
 ].+0.205
Neg logit contributions
ignt-0.227
GN-0.222
wic-0.219
aughs-0.217
stories-0.210
Tokena
Token ID64
Feature activation+1.15e-01

Pos logit contributions
ignt+0.481
GN+0.472
wic+0.470
aughs+0.461
stories+0.443
Neg logit contributions
 mole-0.644
 ],-0.590
ゼウス-0.583
 ):-0.525
 ].-0.515
Token emotions
Token ID10825
Feature activation+1.96e-02

Pos logit contributions
ignt+0.011
GN+0.011
wic+0.011
aughs+0.010
stories+0.010
Neg logit contributions
 mole-0.014
 ],-0.013
ゼウス-0.012
 ):-0.011
 ].-0.011
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
ignt+0.012
GN+0.012
wic+0.011
aughs+0.011
stories+0.011
Neg logit contributions
 mole-0.017
 ],-0.016
ゼウス-0.015
 ):-0.014
 ].-0.014
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
ignt+0.087
GN+0.086
wic+0.084
aughs+0.083
stories+0.081
Neg logit contributions
 mole-0.098
 ],-0.089
ゼウス-0.087
 ):-0.078
 ].-0.076
Token
Token ID198
Feature activation+5.61e-02

Pos logit contributions
ignt+0.012
GN+0.011
wic+0.011
aughs+0.011
stories+0.011
Neg logit contributions
 mole-0.024
 ],-0.022
ゼウス-0.022
 ):-0.020
 ].-0.020
TokenF
Token ID37
Feature activation-7.28e-03

Pos logit contributions
ignt+0.034
GN+0.033
wic+0.032
aughs+0.032
stories+0.031
Neg logit contributions
 mole-0.049
 ],-0.045
ゼウス-0.044
 ):-0.040
 ].-0.039
Tokenacts
Token ID8656
Feature activation+2.00e+00

Pos logit contributions
 mole+0.005
 ],+0.005
ゼウス+0.005
 ):+0.004
 ].+0.004
Neg logit contributions
ignt-0.005
GN-0.005
wic-0.005
aughs-0.005
stories-0.005
Token could
Token ID714
Feature activation-4.68e-02

Pos logit contributions
ignt+1.320
GN+1.297
wic+1.281
aughs+1.259
stories+1.226
Neg logit contributions
 mole-1.591
 ],-1.452
ゼウス-1.430
 ):-1.283
 ].-1.255
Token not
Token ID407
Feature activation+4.69e-01

Pos logit contributions
 mole+0.039
 ],+0.036
ゼウス+0.035
 ):+0.032
 ].+0.031
Neg logit contributions
ignt-0.029
GN-0.029
wic-0.028
aughs-0.028
stories-0.027
Token change
Token ID1487
Feature activation+1.41e-01

Pos logit contributions
ignt+0.277
GN+0.272
wic+0.266
aughs+0.264
stories+0.254
Neg logit contributions
 mole-0.412
 ],-0.379
ゼウス-0.369
 ):-0.340
 ].-0.333
Token the
Token ID262
Feature activation+3.66e-02

Pos logit contributions
ignt+0.091
GN+0.089
wic+0.088
aughs+0.087
stories+0.084
Neg logit contributions
 mole-0.116
 ],-0.106
ゼウス-0.104
 ):-0.094
 ].-0.092
Token minds
Token ID9017
Feature activation-4.51e-01

Pos logit contributions
ignt+0.023
GN+0.022
wic+0.022
aughs+0.022
stories+0.021
Neg logit contributions
 mole-0.031
 ],-0.028
ゼウス-0.027
 ):-0.025
 ].-0.024
Token1
Token ID16
Feature activation+4.56e-02

Pos logit contributions
 mole+0.023
 ],+0.021
ゼウス+0.020
 ):+0.018
 ].+0.018
Neg logit contributions
ignt-0.019
GN-0.019
wic-0.019
aughs-0.019
stories-0.018
Token)
Token ID8
Feature activation-4.22e-01

Pos logit contributions
ignt+0.027
GN+0.026
wic+0.026
aughs+0.026
stories+0.024
Neg logit contributions
 mole-0.040
 ],-0.037
ゼウス-0.036
 ):-0.034
 ].-0.033
Token of
Token ID286
Feature activation+1.27e-02

Pos logit contributions
 mole+0.351
 ],+0.324
ゼウス+0.313
 ):+0.288
 ].+0.282
Neg logit contributions
ignt-0.267
GN-0.260
wic-0.258
aughs-0.255
stories-0.247
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
ignt+0.009
GN+0.009
wic+0.009
aughs+0.009
stories+0.009
Neg logit contributions
 mole-0.009
 ],-0.009
ゼウス-0.008
 ):-0.008
 ].-0.007
Token Criminal
Token ID15531
Feature activation+1.36e+00

Pos logit contributions
ignt+0.065
GN+0.064
wic+0.063
aughs+0.062
stories+0.060
Neg logit contributions
 mole-0.083
 ],-0.076
ゼウス-0.074
 ):-0.067
 ].-0.066
Token Code
Token ID6127
Feature activation+2.00e+00

Pos logit contributions
ignt+1.073
GN+1.056
wic+1.041
aughs+1.030
stories+1.004
Neg logit contributions
 mole-0.913
 ],-0.817
ゼウス-0.798
 ):-0.703
 ].-0.683
Token Act
Token ID2191
Feature activation+8.39e-01

Pos logit contributions
ignt+1.165
GN+1.149
wic+1.117
aughs+1.104
stories+1.065
Neg logit contributions
 mole-1.750
 ],-1.601
ゼウス-1.585
 ):-1.435
 ])-1.405
Token 1995
Token ID8735
Feature activation-2.81e-01

Pos logit contributions
ignt+0.456
GN+0.445
wic+0.437
aughs+0.432
stories+0.415
Neg logit contributions
 mole-0.773
 ],-0.714
ゼウス-0.700
 ):-0.644
 ].-0.631
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
 mole+0.241
 ],+0.223
ゼウス+0.216
 ):+0.199
 ].+0.195
Neg logit contributions
ignt-0.171
GN-0.167
wic-0.165
aughs-0.163
stories-0.158
Token which
Token ID543
Feature activation+4.40e-02

Pos logit contributions
ignt+0.174
GN+0.170
wic+0.168
aughs+0.166
stories+0.161
Neg logit contributions
 mole-0.216
 ],-0.198
ゼウス-0.192
 ):-0.176
 ].-0.172
Token carries
Token ID10732
Feature activation+4.19e-01

Pos logit contributions
ignt+0.032
GN+0.031
wic+0.031
aughs+0.031
stories+0.030
Neg logit contributions
 mole-0.032
 ],-0.029
ゼウス-0.029
 ):-0.026
 ].-0.025
Tokenih
Token ID4449
Feature activation+1.32e-01

Pos logit contributions
 mole+0.593
 ],+0.556
ゼウス+0.540
 ):+0.509
 ].+0.501
Neg logit contributions
ignt-0.236
GN-0.230
aughs-0.222
wic-0.221
stories-0.206
Token al
Token ID435
Feature activation-1.83e-01

Pos logit contributions
ignt+0.093
GN+0.091
wic+0.090
aughs+0.090
stories+0.086
Neg logit contributions
 mole-0.100
 ],-0.091
ゼウス-0.088
 ):-0.080
 ].-0.078
Token-
Token ID12
Feature activation+1.72e-01

Pos logit contributions
 mole+0.173
 ],+0.162
ゼウス+0.156
 ):+0.146
 ].+0.144
Neg logit contributions
ignt-0.094
GN-0.092
wic-0.091
aughs-0.090
stories-0.086
TokenB
Token ID33
Feature activation+4.81e-01

Pos logit contributions
ignt+0.101
GN+0.098
wic+0.097
aughs+0.097
stories+0.092
Neg logit contributions
 mole-0.150
 ],-0.139
ゼウス-0.135
 ):-0.124
 ].-0.122
Tokenuk
Token ID2724
Feature activation+2.60e-01

Pos logit contributions
ignt+0.272
GN+0.265
wic+0.261
aughs+0.258
stories+0.248
Neg logit contributions
 mole-0.432
 ],-0.399
ゼウス-0.390
 ):-0.359
 ].-0.352
Tokenhari
Token ID49573
Feature activation+1.98e+00

Pos logit contributions
ignt+0.116
GN+0.113
wic+0.109
aughs+0.109
stories+0.101
Neg logit contributions
 mole-0.265
 ],-0.248
ゼウス-0.241
 ):-0.226
 ].-0.223
Token,
Token ID11
Feature activation+3.71e-02

Pos logit contributions
ignt+1.201
GN+1.175
wic+1.155
aughs+1.142
stories+1.115
Neg logit contributions
 mole-1.679
 ],-1.525
ゼウス-1.518
 ):-1.365
 ])-1.331
Token 5
Token ID642
Feature activation+2.22e-01

Pos logit contributions
ignt+0.026
GN+0.025
wic+0.025
aughs+0.025
stories+0.024
Neg logit contributions
 mole-0.029
 ],-0.026
ゼウス-0.025
 ):-0.023
 ].-0.022
Token:
Token ID25
Feature activation+2.86e-01

Pos logit contributions
ignt+0.155
GN+0.151
wic+0.150
aughs+0.149
stories+0.143
Neg logit contributions
 mole-0.171
 ],-0.157
ゼウス-0.152
 ):-0.138
 ].-0.135
Token58
Token ID3365
Feature activation+1.93e-01

Pos logit contributions
ignt+0.180
GN+0.177
wic+0.174
aughs+0.171
stories+0.167
Neg logit contributions
 mole-0.238
 ],-0.217
ゼウス-0.213
 ):-0.193
 ].-0.189
Token:
Token ID25
Feature activation+2.47e-01

Pos logit contributions
ignt+0.115
GN+0.111
wic+0.110
aughs+0.109
stories+0.104
Neg logit contributions
 mole-0.169
 ],-0.157
ゼウス-0.152
 ):-0.141
 ].-0.139
Token camps
Token ID11665
Feature activation-3.89e-01

Pos logit contributions
ignt+0.052
GN+0.051
wic+0.051
aughs+0.050
stories+0.049
Neg logit contributions
 mole-0.063
 ],-0.057
ゼウス-0.056
 ):-0.051
 ].-0.050
Tokenites
Token ID2737
Feature activation-9.46e-02

Pos logit contributions
 mole+0.205
 ],+0.178
ゼウス+0.169
 ):+0.146
 ].+0.140
Neg logit contributions
ignt-0.364
GN-0.360
wic-0.355
aughs-0.355
stories-0.345
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 mole+0.083
 ],+0.076
ゼウス+0.075
 ):+0.068
 ].+0.067
Neg logit contributions
ignt-0.056
GN-0.054
wic-0.054
aughs-0.053
stories-0.051
Token
Token ID198
Feature activation+4.76e-01

Pos logit contributions
GN+0.183
ignt+0.183
wic+0.180
aughs+0.177
stories+0.172
Neg logit contributions
 mole-0.202
 ],-0.182
ゼウス-0.180
 ):-0.159
 ].-0.156
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
GN+0.218
ignt+0.218
wic+0.212
aughs+0.204
stories+0.201
Neg logit contributions
 mole-0.475
 ],-0.438
ゼウス-0.437
 ):-0.397
 ].-0.391
TokenWATCH
Token ID35192
Feature activation+1.96e+00

Pos logit contributions
ignt+0.186
GN+0.185
wic+0.181
aughs+0.177
stories+0.173
Neg logit contributions
 mole-0.270
 ],-0.247
ゼウス-0.242
 ):-0.221
 ].-0.216
Token OUT
Token ID16289
Feature activation+3.20e-02

Pos logit contributions
GN+1.370
ignt+1.370
wic+1.335
aughs+1.318
stories+1.292
Neg logit contributions
 mole-1.491
 ],-1.344
ゼウス-1.338
 ):-1.174
 ].-1.149
Token FOR
Token ID7473
Feature activation+1.32e-01

Pos logit contributions
ignt+0.024
GN+0.023
wic+0.023
aughs+0.023
stories+0.022
Neg logit contributions
 mole-0.023
 ],-0.021
ゼウス-0.021
 ):-0.019
 ].-0.018
Token R
Token ID371
Feature activation-2.45e-01

Pos logit contributions
ignt+0.090
GN+0.088
wic+0.087
aughs+0.086
stories+0.083
Neg logit contributions
 mole-0.103
 ],-0.094
ゼウス-0.092
 ):-0.083
 ].-0.081
TokenAT
Token ID1404
Feature activation-7.69e-01

Pos logit contributions
 mole+0.174
 ],+0.157
ゼウス+0.154
 ):+0.136
 ].+0.133
Neg logit contributions
ignt-0.184
GN-0.182
wic-0.179
aughs-0.178
stories-0.172
TokenTL
Token ID14990
Feature activation+3.75e-01

Pos logit contributions
 mole+0.605
 ],+0.554
ゼウス+0.532
 ):+0.489
 ].+0.477
Neg logit contributions
ignt-0.527
GN-0.505
wic-0.504
aughs-0.501
stories-0.488
Token credit
Token ID3884
Feature activation-6.74e-01

Pos logit contributions
ignt+0.076
GN+0.075
wic+0.074
aughs+0.073
stories+0.071
Neg logit contributions
 mole-0.082
 ],-0.074
ゼウス-0.072
 ):-0.065
 ].-0.063
Token for
Token ID329
Feature activation+1.56e-01

Pos logit contributions
 mole+0.544
 ],+0.496
ゼウス+0.480
 ):+0.445
 ].+0.431
Neg logit contributions
ignt-0.445
GN-0.434
wic-0.428
aughs-0.428
stories-0.407
Token your
Token ID534
Feature activation+4.11e-02

Pos logit contributions
ignt+0.101
GN+0.099
wic+0.098
aughs+0.096
stories+0.094
Neg logit contributions
 mole-0.127
 ],-0.117
ゼウス-0.114
 ):-0.103
 ].-0.101
Token aw
Token ID3253
Feature activation+3.55e-01

Pos logit contributions
ignt+0.026
GN+0.026
wic+0.025
aughs+0.025
stories+0.024
Neg logit contributions
 mole-0.034
 ],-0.031
ゼウス-0.030
 ):-0.027
 ].-0.027
Tokenes
Token ID274
Feature activation+5.82e-01

Pos logit contributions
ignt+0.227
GN+0.222
wic+0.218
aughs+0.216
stories+0.209
Neg logit contributions
 mole-0.294
 ],-0.269
ゼウス-0.263
 ):-0.240
 ].-0.235
Tokenomen
Token ID3674
Feature activation+1.95e+00

Pos logit contributions
ignt+0.496
GN+0.492
wic+0.486
aughs+0.481
stories+0.471
Neg logit contributions
 mole-0.354
 ],-0.311
ゼウス-0.301
 ):-0.264
 ].-0.254
Tokeness
Token ID408
Feature activation+5.24e-01

Pos logit contributions
ignt+1.347
GN+1.310
wic+1.290
aughs+1.278
stories+1.247
Neg logit contributions
 mole-1.529
 ],-1.382
ゼウス-1.368
 ):-1.217
 ].-1.190
Token.
Token ID13
Feature activation+8.25e-02

Pos logit contributions
ignt+0.309
GN+0.302
wic+0.297
aughs+0.294
stories+0.283
Neg logit contributions
 mole-0.459
 ],-0.422
ゼウス-0.413
 ):-0.379
 ].-0.371
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
ignt+0.053
GN+0.051
wic+0.051
aughs+0.050
stories+0.048
Neg logit contributions
 mole-0.068
 ],-0.063
ゼウス-0.061
 ):-0.056
 ].-0.055
Token
Token ID198
Feature activation-3.84e-03

Pos logit contributions
ignt+0.069
GN+0.069
wic+0.067
aughs+0.065
stories+0.063
Neg logit contributions
 mole-0.148
 ],-0.136
ゼウス-0.136
 ):-0.124
 ].-0.122
TokenIf
Token ID1532
Feature activation-3.82e-02

Pos logit contributions
 mole+0.003
 ],+0.003
ゼウス+0.003
 ):+0.003
 ].+0.003
Neg logit contributions
ignt-0.002
GN-0.002
wic-0.002
aughs-0.002
stories-0.002
TokenCon
Token ID3103
Feature activation+3.01e-01

Pos logit contributions
ignt+0.301
GN+0.293
wic+0.287
aughs+0.283
stories+0.270
Neg logit contributions
 mole-0.626
 ],-0.581
ゼウス-0.570
 ):-0.528
 ].-0.519
Tokenaug
Token ID7493
Feature activation-8.67e-01

Pos logit contributions
ignt+0.180
GN+0.178
wic+0.173
aughs+0.171
stories+0.166
Neg logit contributions
 mole-0.262
 ],-0.241
ゼウス-0.236
 ):-0.213
 ].-0.211
Tokenhey
Token ID20342
Feature activation-6.75e-01

Pos logit contributions
 mole+0.644
 ],+0.589
ゼウス+0.569
 ):+0.519
 ].+0.504
Neg logit contributions
ignt-0.620
GN-0.608
wic-0.599
aughs-0.596
stories-0.574
Token True
Token ID6407
Feature activation+6.87e-01

Pos logit contributions
 mole+0.522
 ],+0.476
ゼウス+0.456
 ):+0.421
 ].+0.409
Neg logit contributions
ignt-0.474
GN-0.454
wic-0.453
aughs-0.451
stories-0.434
Token Detective
Token ID20220
Feature activation+4.83e-01

Pos logit contributions
ignt+0.600
GN+0.593
wic+0.585
aughs+0.580
stories+0.567
Neg logit contributions
 mole-0.406
 ],-0.358
ゼウス-0.348
 ):-0.300
 ].-0.290
Token HBO
Token ID18804
Feature activation+1.95e+00

Pos logit contributions
ignt+0.291
GN+0.280
wic+0.278
aughs+0.275
stories+0.265
Neg logit contributions
 mole-0.420
 ],-0.387
ゼウス-0.374
 ):-0.346
 ].-0.339
Token Matthew
Token ID9308
Feature activation+4.58e-01

Pos logit contributions
ignt+1.290
GN+1.264
wic+1.246
aughs+1.233
stories+1.194
Neg logit contributions
 mole-1.561
 ],-1.425
ゼウス-1.390
 ):-1.262
 ].-1.234
Token Rh
Token ID10323
Feature activation+1.83e-01

Pos logit contributions
ignt+0.232
GN+0.226
wic+0.222
aughs+0.219
stories+0.209
Neg logit contributions
 mole-0.439
 ],-0.407
ゼウス-0.400
 ):-0.368
 ].-0.362
Tokenys
Token ID893
Feature activation-8.39e-01

Pos logit contributions
ignt+0.059
GN+0.056
wic+0.055
aughs+0.054
stories+0.050
Neg logit contributions
 mole-0.210
 ],-0.197
ゼウス-0.194
 ):-0.181
 ].-0.179
Token The
Token ID383
Feature activation+1.71e-01

Pos logit contributions
 mole+0.680
 ],+0.623
ゼウス+0.601
 ):+0.555
 ].+0.541
Neg logit contributions
ignt-0.553
GN-0.532
wic-0.526
aughs-0.525
stories-0.503
Token Americans
Token ID3399
Feature activation+3.06e-01

Pos logit contributions
ignt+0.115
GN+0.113
wic+0.112
aughs+0.111
stories+0.108
Neg logit contributions
 mole-0.134
 ],-0.122
ゼウス-0.120
 ):-0.108
 ].-0.105
Token an
Token ID281
Feature activation-1.58e-01

Pos logit contributions
ignt+0.158
GN+0.155
wic+0.154
aughs+0.153
stories+0.146
Neg logit contributions
 mole-0.164
 ],-0.148
ゼウス-0.144
 ):-0.130
 ].-0.127
Token already
Token ID1541
Feature activation+7.42e-01

Pos logit contributions
 mole+0.135
 ],+0.124
ゼウス+0.121
 ):+0.111
 ].+0.109
Neg logit contributions
ignt-0.097
GN-0.094
wic-0.093
aughs-0.092
stories-0.089
Token-
Token ID12
Feature activation+4.93e-02

Pos logit contributions
ignt+0.430
GN+0.420
wic+0.414
aughs+0.409
stories+0.394
Neg logit contributions
 mole-0.658
 ],-0.606
ゼウス-0.591
 ):-0.544
 ].-0.533
Tokenf
Token ID69
Feature activation+1.18e-01

Pos logit contributions
ignt+0.030
GN+0.029
wic+0.029
aughs+0.028
stories+0.027
Neg logit contributions
 mole-0.042
 ],-0.039
ゼウス-0.038
 ):-0.035
 ].-0.034
Tokenant
Token ID415
Feature activation+1.02e+00

Pos logit contributions
ignt+0.079
GN+0.077
wic+0.076
aughs+0.075
stories+0.073
Neg logit contributions
 mole-0.094
 ],-0.086
ゼウス-0.084
 ):-0.076
 ].-0.074
Tokenastic
Token ID3477
Feature activation+1.94e+00

Pos logit contributions
ignt+0.645
GN+0.633
wic+0.625
aughs+0.620
stories+0.597
Neg logit contributions
 mole-0.842
 ],-0.773
ゼウス-0.751
 ):-0.688
 ].-0.674
Token creation
Token ID6282
Feature activation-2.26e-01

Pos logit contributions
ignt+1.405
GN+1.385
wic+1.367
aughs+1.341
stories+1.308
Neg logit contributions
 mole-1.418
 ],-1.289
ゼウス-1.262
 ):-1.128
 ].-1.096
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
 mole+0.195
 ],+0.178
ゼウス+0.173
 ):+0.159
 ].+0.157
Neg logit contributions
ignt-0.138
GN-0.134
aughs-0.133
wic-0.131
stories-0.127
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
ignt+0.070
GN+0.069
wic+0.068
aughs+0.067
stories+0.065
Neg logit contributions
 mole-0.089
 ],-0.082
ゼウス-0.080
 ):-0.073
 ].-0.071
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
GN+0.169
ignt+0.168
wic+0.164
aughs+0.158
stories+0.155
Neg logit contributions
 mole-0.366
 ],-0.337
ゼウス-0.336
 ):-0.306
 ].-0.301
TokenScott
Token ID19040
Feature activation+3.86e-01

Pos logit contributions
ignt+0.162
GN+0.159
wic+0.157
aughs+0.155
stories+0.150
Neg logit contributions
 mole-0.217
 ],-0.199
ゼウス-0.194
 ):-0.177
 ].-0.173
Token
Token ID447
Feature activation+1.73e-01

Pos logit contributions
 mole+0.044
 ],+0.040
ゼウス+0.039
 ):+0.035
 ].+0.034
Neg logit contributions
GN-0.037
ignt-0.037
wic-0.036
aughs-0.035
stories-0.035
Token
Token ID247
Feature activation+3.89e-01

Pos logit contributions
ignt+0.158
GN+0.156
wic+0.154
aughs+0.153
stories+0.150
Neg logit contributions
 mole-0.096
 ],-0.083
ゼウス-0.080
 ):-0.069
 ].-0.066
Tokent
Token ID83
Feature activation+2.06e-01

Pos logit contributions
ignt+0.225
GN+0.220
wic+0.217
aughs+0.214
stories+0.206
Neg logit contributions
 mole-0.345
 ],-0.318
ゼウス-0.310
 ):-0.285
 ].-0.279
Token seem
Token ID1283
Feature activation-7.71e-02

Pos logit contributions
ignt+0.123
GN+0.120
wic+0.118
aughs+0.117
stories+0.113
Neg logit contributions
 mole-0.179
 ],-0.164
ゼウス-0.161
 ):-0.147
 ].-0.144
Token f
Token ID277
Feature activation-7.59e-03

Pos logit contributions
 mole+0.063
 ],+0.057
ゼウス+0.056
 ):+0.051
 ].+0.050
Neg logit contributions
ignt-0.050
GN-0.049
wic-0.049
aughs-0.048
stories-0.046
Tokenazed
Token ID13865
Feature activation+1.92e+00

Pos logit contributions
 mole+0.007
 ],+0.006
ゼウス+0.006
 ):+0.006
 ].+0.006
Neg logit contributions
ignt-0.004
GN-0.004
wic-0.004
aughs-0.004
stories-0.004
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
ignt+1.237
GN+1.212
wic+1.193
aughs+1.181
stories+1.142
Neg logit contributions
 mole-1.581
 ],-1.446
ゼウス-1.414
 ):-1.283
 ].-1.257
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
ignt+0.215
GN+0.211
wic+0.208
aughs+0.206
stories+0.199
Neg logit contributions
 mole-0.258
 ],-0.235
ゼウス-0.230
 ):-0.209
 ].-0.204
Token
Token ID198
Feature activation+9.06e-02

Pos logit contributions
ignt+0.101
GN+0.101
wic+0.099
aughs+0.095
stories+0.093
Neg logit contributions
 mole-0.218
 ],-0.201
ゼウス-0.200
 ):-0.182
 ].-0.180
TokenWhile
Token ID3633
Feature activation+6.86e-02

Pos logit contributions
ignt+0.065
GN+0.063
wic+0.063
aughs+0.062
stories+0.060
Neg logit contributions
 mole-0.068
 ],-0.062
ゼウス-0.060
 ):-0.055
 ].-0.053
Token he
Token ID339
Feature activation-1.81e-02

Pos logit contributions
ignt+0.047
GN+0.046
wic+0.045
aughs+0.045
stories+0.043
Neg logit contributions
 mole-0.054
 ],-0.049
ゼウス-0.048
 ):-0.043
 ].-0.042
Tokenometer
Token ID15635
Feature activation+7.60e-01

Pos logit contributions
 mole+0.332
 ],+0.297
ゼウス+0.288
 ):+0.256
 ].+0.250
Neg logit contributions
ignt-0.381
GN-0.374
wic-0.369
aughs-0.367
stories-0.357
Token circuit
Token ID10349
Feature activation+5.30e-01

Pos logit contributions
ignt+0.504
GN+0.494
wic+0.487
aughs+0.481
stories+0.466
Neg logit contributions
 mole-0.609
 ],-0.556
ゼウス-0.543
 ):-0.492
 ].-0.481
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
ignt+0.325
GN+0.318
wic+0.313
aughs+0.310
stories+0.299
Neg logit contributions
 mole-0.451
 ],-0.414
ゼウス-0.405
 ):-0.370
 ].-0.362
Token
Token ID198
Feature activation+3.71e-01

Pos logit contributions
ignt+0.183
GN+0.180
wic+0.177
aughs+0.175
stories+0.169
Neg logit contributions
 mole-0.237
 ],-0.217
ゼウス-0.212
 ):-0.193
 ].-0.189
Token
Token ID198
Feature activation+2.71e-01

Pos logit contributions
GN+0.169
ignt+0.168
wic+0.165
aughs+0.158
stories+0.156
Neg logit contributions
 mole-0.373
ゼウス-0.343
 ],-0.342
 ):-0.311
 ].-0.306
TokenWATCH
Token ID35192
Feature activation+1.92e+00

Pos logit contributions
ignt+0.167
GN+0.164
wic+0.161
aughs+0.159
stories+0.154
Neg logit contributions
 mole-0.230
 ],-0.211
ゼウス-0.206
 ):-0.188
 ].-0.184
Token:
Token ID25
Feature activation+3.21e-01

Pos logit contributions
ignt+1.344
GN+1.332
wic+1.305
aughs+1.293
stories+1.258
Neg logit contributions
 mole-1.462
 ],-1.323
ゼウス-1.304
 ):-1.158
 ].-1.136
Token The
Token ID383
Feature activation+3.43e-02

Pos logit contributions
ignt+0.217
GN+0.214
wic+0.211
aughs+0.208
stories+0.202
Neg logit contributions
 mole-0.253
 ],-0.230
ゼウス-0.225
 ):-0.203
 ].-0.199
Token attempt
Token ID2230
Feature activation+6.25e-01

Pos logit contributions
ignt+0.022
GN+0.021
wic+0.021
aughs+0.021
stories+0.020
Neg logit contributions
 mole-0.028
 ],-0.026
ゼウス-0.026
 ):-0.023
 ].-0.023
Token will
Token ID481
Feature activation-1.53e-01

Pos logit contributions
ignt+0.457
GN+0.449
wic+0.443
aughs+0.439
stories+0.427
Neg logit contributions
 mole-0.457
 ],-0.413
ゼウス-0.403
 ):-0.359
 ].-0.352
Token be
Token ID307
Feature activation+2.19e-01

Pos logit contributions
 mole+0.135
 ],+0.124
ゼウス+0.121
 ):+0.111
 ].+0.109
Neg logit contributions
ignt-0.090
GN-0.088
wic-0.087
aughs-0.086
stories-0.083
Token credit
Token ID3884
Feature activation-8.38e-01

Pos logit contributions
 mole+0.038
 ],+0.035
ゼウス+0.033
 ):+0.030
 ].+0.030
Neg logit contributions
ignt-0.034
GN-0.034
wic-0.033
aughs-0.033
stories-0.032
Token for
Token ID329
Feature activation+6.98e-02

Pos logit contributions
 mole+0.589
 ],+0.530
ゼウス+0.507
 ):+0.463
 ].+0.447
Neg logit contributions
ignt-0.642
GN-0.627
aughs-0.621
wic-0.620
stories-0.593
Token the
Token ID262
Feature activation-6.12e-03

Pos logit contributions
ignt+0.049
GN+0.048
wic+0.048
aughs+0.047
stories+0.046
Neg logit contributions
 mole-0.053
 ],-0.048
ゼウス-0.047
 ):-0.042
 ].-0.041
Token hack
Token ID8156
Feature activation+4.61e-01

Pos logit contributions
 mole+0.005
 ],+0.004
ゼウス+0.004
 ):+0.004
 ].+0.004
Neg logit contributions
ignt-0.004
GN-0.004
wic-0.004
aughs-0.004
stories-0.004
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
ignt+0.291
GN+0.284
wic+0.280
aughs+0.278
stories+0.268
Neg logit contributions
 mole-0.386
 ],-0.354
ゼウス-0.345
 ):-0.315
 ].-0.308
Token Crowd
Token ID29452
Feature activation+1.91e+00

Pos logit contributions
 mole+0.136
 ],+0.125
ゼウス+0.122
 ):+0.110
 ].+0.108
Neg logit contributions
ignt-0.113
GN-0.111
wic-0.109
aughs-0.108
stories-0.105
Tokenstrike
Token ID33069
Feature activation+4.84e-01

Pos logit contributions
ignt+1.169
GN+1.149
wic+1.127
stories+1.087
aughs+1.077
Neg logit contributions
 mole-1.626
 ],-1.489
ゼウス-1.477
 ):-1.335
 ].-1.308
Token stood
Token ID6204
Feature activation-4.01e-01

Pos logit contributions
ignt+0.359
GN+0.355
wic+0.349
aughs+0.345
stories+0.337
Neg logit contributions
 mole-0.344
 ],-0.311
ゼウス-0.309
 ):-0.271
 ].-0.264
Token by
Token ID416
Feature activation+1.56e-01

Pos logit contributions
 mole+0.301
 ],+0.273
ゼウス+0.265
 ):+0.239
 ].+0.233
Neg logit contributions
ignt-0.287
GN-0.282
wic-0.278
aughs-0.275
stories-0.267
Token its
Token ID663
Feature activation+2.74e-01

Pos logit contributions
ignt+0.104
GN+0.102
wic+0.100
aughs+0.099
stories+0.096
Neg logit contributions
 mole-0.124
 ],-0.114
ゼウス-0.111
 ):-0.101
 ].-0.098
Token analysis
Token ID3781
Feature activation+2.13e-01

Pos logit contributions
ignt+0.162
GN+0.158
wic+0.155
aughs+0.154
stories+0.148
Neg logit contributions
 mole-0.240
 ],-0.221
ゼウス-0.217
 ):-0.198
 ].-0.194
Token major
Token ID1688
Feature activation+5.15e-01

Pos logit contributions
ignt+0.032
GN+0.031
wic+0.031
aughs+0.031
stories+0.030
Neg logit contributions
 mole-0.041
 ],-0.037
ゼウス-0.036
 ):-0.033
 ].-0.032
Token Oscars
Token ID46485
Feature activation-7.25e-01

Pos logit contributions
ignt+0.329
GN+0.322
wic+0.317
aughs+0.314
stories+0.304
Neg logit contributions
 mole-0.425
 ],-0.389
ゼウス-0.381
 ):-0.346
 ].-0.339
Token
Token ID198
Feature activation-1.82e-01

Pos logit contributions
 mole+0.582
 ],+0.530
ゼウス+0.514
 ):+0.470
 ].+0.458
Neg logit contributions
ignt-0.483
GN-0.472
wic-0.467
aughs-0.461
stories-0.448
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
 mole+0.179
 ],+0.166
ゼウス+0.164
 ):+0.150
 ].+0.148
Neg logit contributions
ignt-0.086
GN-0.085
wic-0.083
aughs-0.080
stories-0.078
Tokencont
Token ID3642
Feature activation+1.70e-01

Pos logit contributions
 mole+0.220
 ],+0.202
ゼウス+0.197
 ):+0.180
 ].+0.176
Neg logit contributions
ignt-0.171
GN-0.166
wic-0.164
aughs-0.163
stories-0.157
Tokenender
Token ID2194
Feature activation-2.04e+00

Pos logit contributions
ignt+0.110
GN+0.107
wic+0.106
aughs+0.104
stories+0.101
Neg logit contributions
 mole-0.140
 ],-0.128
ゼウス-0.125
 ):-0.113
 ].-0.111
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 mole+1.683
 ],+1.541
ゼウス+1.491
 ):+1.380
 ].+1.341
Neg logit contributions
ignt-1.302
GN-1.267
aughs-1.245
wic-1.244
stories-1.191
Token potentially
Token ID6196
Feature activation-5.47e-01

Pos logit contributions
 mole+0.082
 ],+0.074
ゼウス+0.073
 ):+0.066
 ].+0.064
Neg logit contributions
ignt-0.067
GN-0.066
wic-0.065
aughs-0.064
stories-0.062
Token picking
Token ID10868
Feature activation-4.82e-01

Pos logit contributions
 mole+0.426
 ],+0.388
ゼウス+0.378
 ):+0.342
 ].+0.334
Neg logit contributions
ignt-0.376
GN-0.369
wic-0.364
aughs-0.361
stories-0.349
Token up
Token ID510
Feature activation+2.34e-01

Pos logit contributions
 mole+0.418
 ],+0.384
ゼウス+0.376
 ):+0.344
 ].+0.337
Neg logit contributions
ignt-0.289
GN-0.282
wic-0.278
aughs-0.275
stories-0.265
Token nominations
Token ID30994
Feature activation-1.70e+00

Pos logit contributions
ignt+0.159
GN+0.156
wic+0.154
aughs+0.153
stories+0.148
Neg logit contributions
 mole-0.184
 ],-0.167
ゼウス-0.163
 ):-0.147
 ].-0.144
Token was
Token ID373
Feature activation-6.95e-01

Pos logit contributions
 mole+0.227
 ],+0.206
ゼウス+0.201
 ):+0.181
 ].+0.177
Neg logit contributions
ignt-0.209
GN-0.205
wic-0.203
aughs-0.201
stories-0.195
Token the
Token ID262
Feature activation-5.56e-01

Pos logit contributions
 mole+0.544
 ],+0.496
ゼウス+0.483
 ):+0.437
 ].+0.427
Neg logit contributions
ignt-0.474
GN-0.465
wic-0.458
aughs-0.454
stories-0.440
Token shin
Token ID40465
Feature activation-1.40e+00

Pos logit contributions
 mole+0.465
 ],+0.426
ゼウス+0.416
 ):+0.380
 ].+0.372
Neg logit contributions
ignt-0.349
GN-0.342
wic-0.337
aughs-0.333
stories-0.322
Tokenig
Token ID328
Feature activation-3.42e-02

Pos logit contributions
 mole+1.286
 ],+1.184
ゼウス+1.139
 ):+1.068
 ].+1.045
Neg logit contributions
ignt-0.772
GN-0.756
aughs-0.744
wic-0.743
stories-0.707
Tokenami
Token ID6277
Feature activation-9.01e-01

Pos logit contributions
 mole+0.023
 ],+0.021
ゼウス+0.020
 ):+0.018
 ].+0.018
Neg logit contributions
ignt-0.027
GN-0.026
wic-0.026
aughs-0.026
stories-0.025
Token art
Token ID1242
Feature activation-1.94e+00

Pos logit contributions
 mole+0.734
 ],+0.671
ゼウス+0.656
 ):+0.596
 ].+0.583
Neg logit contributions
ignt-0.585
GN-0.574
wic-0.565
aughs-0.559
stories-0.541
Token of
Token ID286
Feature activation-4.06e-01

Pos logit contributions
 mole+1.605
 ],+1.470
ゼウス+1.428
 ):+1.307
 ].+1.280
Neg logit contributions
ignt-1.237
GN-1.211
wic-1.190
aughs-1.184
stories-1.142
Token high
Token ID1029
Feature activation-6.14e-01

Pos logit contributions
 mole+0.318
 ],+0.289
ゼウス+0.282
 ):+0.255
 ].+0.250
Neg logit contributions
ignt-0.277
GN-0.271
wic-0.268
aughs-0.265
stories-0.257
Token-
Token ID12
Feature activation-3.91e-01

Pos logit contributions
 mole+0.531
 ],+0.489
ゼウス+0.476
 ):+0.437
 ].+0.428
Neg logit contributions
ignt-0.370
GN-0.361
wic-0.355
aughs-0.352
stories-0.338
Tokenspeed
Token ID12287
Feature activation-7.20e-01

Pos logit contributions
 mole+0.309
 ],+0.282
ゼウス+0.275
 ):+0.249
 ].+0.243
Neg logit contributions
ignt-0.264
GN-0.259
wic-0.256
aughs-0.253
stories-0.245
Token movement
Token ID3356
Feature activation+1.40e-02

Pos logit contributions
 mole+0.577
 ],+0.525
ゼウス+0.508
 ):+0.466
 ].+0.455
Neg logit contributions
ignt-0.482
GN-0.472
wic-0.465
aughs-0.462
stories-0.445
Token
Token ID30266
Feature activation-5.42e-01

Pos logit contributions
ignt+0.009
GN+0.008
wic+0.008
aughs+0.008
stories+0.008
Neg logit contributions
 mole-0.011
 ],-0.010
ゼウス-0.009
 ):-0.009
 ].-0.008
Token
Token ID109
Feature activation-6.80e-01

Pos logit contributions
 mole+0.322
 ],+0.284
ゼウス+0.275
 ):+0.239
 ].+0.231
Neg logit contributions
ignt-0.473
GN-0.465
wic-0.460
aughs-0.457
stories-0.446
Token
Token ID43095
Feature activation-9.74e-01

Pos logit contributions
 mole+0.340
 ],+0.292
ゼウス+0.281
 ):+0.236
 ].+0.226
Neg logit contributions
ignt-0.658
GN-0.648
wic-0.642
aughs-0.638
stories-0.624
Token
Token ID23877
Feature activation-2.87e-01

Pos logit contributions
 mole+0.625
 ],+0.558
ゼウス+0.542
 ):+0.478
 ].+0.462
Neg logit contributions
ignt-0.804
GN-0.788
wic-0.782
aughs-0.777
stories-0.755
Token
Token ID229
Feature activation-3.39e-01

Pos logit contributions
 mole+0.195
 ],+0.175
ゼウス+0.169
 ):+0.150
 ].+0.146
Neg logit contributions
ignt-0.226
GN-0.222
wic-0.220
aughs-0.218
stories-0.212
Token
Token ID162
Feature activation-1.92e+00

Pos logit contributions
 mole+0.223
 ],+0.199
ゼウス+0.193
 ):+0.171
 ].+0.166
Neg logit contributions
ignt-0.275
GN-0.270
wic-0.267
aughs-0.265
stories-0.258
Token
Token ID252
Feature activation-8.63e-01

Pos logit contributions
 mole+1.149
 ],+1.016
ゼウス+0.982
 ):+0.857
 ].+0.829
Neg logit contributions
ignt-1.661
GN-1.635
wic-1.620
aughs-1.612
stories-1.567
Token
Token ID250
Feature activation+1.02e-01

Pos logit contributions
 mole+0.620
 ],+0.559
ゼウス+0.544
 ):+0.487
 ].+0.474
Neg logit contributions
ignt-0.644
GN-0.634
wic-0.625
aughs-0.619
stories-0.601
Token
Token ID40367
Feature activation-1.02e+00

Pos logit contributions
ignt+0.079
GN+0.078
wic+0.077
aughs+0.077
stories+0.074
Neg logit contributions
 mole-0.070
 ],-0.063
ゼウス-0.061
 ):-0.054
 ].-0.053
Token
Token ID253
Feature activation+3.36e-01

Pos logit contributions
 mole+0.642
 ],+0.570
ゼウス+0.553
 ):+0.485
 ].+0.470
Neg logit contributions
ignt-0.856
GN-0.842
wic-0.833
aughs-0.827
stories-0.806
Token
Token ID161
Feature activation-7.72e-01

Pos logit contributions
ignt+0.263
GN+0.259
wic+0.256
aughs+0.253
stories+0.247
Neg logit contributions
 mole-0.229
 ],-0.206
ゼウス-0.200
 ):-0.177
 ].-0.173
Token are
Token ID389
Feature activation+3.57e-01

Pos logit contributions
 mole+0.121
 ],+0.109
ゼウス+0.106
 ):+0.097
 ].+0.094
Neg logit contributions
ignt-0.111
GN-0.109
wic-0.107
aughs-0.107
stories-0.103
Token in
Token ID287
Feature activation-1.45e-01

Pos logit contributions
ignt+0.238
GN+0.234
wic+0.230
aughs+0.229
stories+0.220
Neg logit contributions
 mole-0.287
 ],-0.261
ゼウス-0.254
 ):-0.231
 ].-0.226
Token a
Token ID257
Feature activation+7.18e-02

Pos logit contributions
 mole+0.110
 ],+0.100
ゼウス+0.097
 ):+0.088
 ].+0.086
Neg logit contributions
ignt-0.103
GN-0.101
wic-0.099
aughs-0.098
stories-0.095
Token nond
Token ID30745
Feature activation-1.75e-01

Pos logit contributions
ignt+0.048
GN+0.047
wic+0.046
aughs+0.045
stories+0.044
Neg logit contributions
 mole-0.058
 ],-0.053
ゼウス-0.051
 ):-0.047
 ].-0.046
Tokenesc
Token ID3798
Feature activation+4.14e-01

Pos logit contributions
 mole+0.116
 ],+0.103
ゼウス+0.100
 ):+0.089
 ].+0.086
Neg logit contributions
ignt-0.141
GN-0.139
wic-0.137
aughs-0.136
stories-0.133
Tokenript
Token ID1968
Feature activation-1.91e+00

Pos logit contributions
ignt+0.292
GN+0.287
wic+0.283
aughs+0.281
stories+0.272
Neg logit contributions
 mole-0.313
 ],-0.284
ゼウス-0.277
 ):-0.250
 ].-0.244
Token pre
Token ID662
Feature activation-2.93e-01

Pos logit contributions
 mole+1.459
 ],+1.314
ゼウス+1.264
 ):+1.154
 ].+1.126
Neg logit contributions
ignt-1.350
GN-1.324
aughs-1.298
wic-1.298
stories-1.253
Tokenwar
Token ID5767
Feature activation+1.25e-01

Pos logit contributions
 mole+0.254
 ],+0.234
ゼウス+0.225
 ):+0.210
 ].+0.205
Neg logit contributions
ignt-0.173
aughs-0.169
GN-0.166
wic-0.166
stories-0.157
Token building
Token ID2615
Feature activation-4.17e-01

Pos logit contributions
ignt+0.086
GN+0.084
wic+0.083
aughs+0.082
stories+0.080
Neg logit contributions
 mole-0.097
 ],-0.088
ゼウス-0.086
 ):-0.078
 ].-0.076
Token overlooking
Token ID34603
Feature activation-1.06e-02

Pos logit contributions
 mole+0.335
 ],+0.306
ゼウス+0.297
 ):+0.271
 ].+0.264
Neg logit contributions
ignt-0.277
GN-0.272
wic-0.267
aughs-0.267
stories-0.257
Token Herald
Token ID18277
Feature activation+1.05e+00

Pos logit contributions
 mole+0.008
 ],+0.007
ゼウス+0.007
 ):+0.006
 ].+0.006
Neg logit contributions
ignt-0.007
GN-0.007
wic-0.007
aughs-0.007
stories-0.007
Token has
Token ID468
Feature activation+4.62e-02

Pos logit contributions
ignt+0.158
GN+0.156
wic+0.154
aughs+0.152
stories+0.148
Neg logit contributions
 mole-0.155
 ],-0.140
ゼウス-0.136
 ):-0.121
 ].-0.119
Token contradicted
Token ID44853
Feature activation+3.75e-03

Pos logit contributions
ignt+0.032
GN+0.031
wic+0.031
aughs+0.030
stories+0.030
Neg logit contributions
 mole-0.036
 ],-0.033
ゼウス-0.032
 ):-0.029
 ].-0.028
Token inc
Token ID753
Feature activation-8.51e-01

Pos logit contributions
ignt+0.003
GN+0.002
wic+0.002
aughs+0.002
stories+0.002
Neg logit contributions
 mole-0.003
 ],-0.003
ゼウス-0.003
 ):-0.002
 ].-0.002
Tokenont
Token ID756
Feature activation-9.35e-01

Pos logit contributions
 mole+0.732
 ],+0.678
ゼウス+0.656
 ):+0.606
 ].+0.594
Neg logit contributions
ignt-0.510
wic-0.495
GN-0.495
aughs-0.488
stories-0.468
Tokenro
Token ID305
Feature activation+3.10e-01

Pos logit contributions
 mole+0.540
 ],+0.477
ゼウス+0.450
 ):+0.408
 ])+0.388
Neg logit contributions
ignt-0.824
GN-0.813
wic-0.801
aughs-0.795
stories-0.774
Tokenvert
Token ID1851
Feature activation-1.91e+00

Pos logit contributions
ignt+0.170
GN+0.166
wic+0.163
aughs+0.161
stories+0.155
Neg logit contributions
 mole-0.284
 ],-0.261
ゼウス-0.257
 ):-0.235
 ].-0.231
Tokenible
Token ID856
Feature activation-8.18e-01

Pos logit contributions
 mole+1.476
 ],+1.351
ゼウス+1.298
 ):+1.195
 ].+1.163
Neg logit contributions
ignt-1.302
GN-1.278
wic-1.258
aughs-1.251
stories-1.207
Token reality
Token ID3950
Feature activation+6.52e-02

Pos logit contributions
 mole+0.684
 ],+0.626
ゼウス+0.606
 ):+0.560
 ].+0.546
Neg logit contributions
ignt-0.519
GN-0.504
wic-0.500
aughs-0.497
stories-0.473
Token.
Token ID13
Feature activation+2.25e-02

Pos logit contributions
ignt+0.039
GN+0.038
wic+0.038
aughs+0.037
stories+0.036
Neg logit contributions
 mole-0.056
 ],-0.052
ゼウス-0.051
 ):-0.046
 ].-0.045
Token
Token ID198
Feature activation+5.21e-01

Pos logit contributions
ignt+0.013
GN+0.013
wic+0.013
aughs+0.013
stories+0.012
Neg logit contributions
 mole-0.020
 ],-0.018
ゼウス-0.018
 ):-0.016
 ].-0.016
Token
Token ID198
Feature activation+3.42e-01

Pos logit contributions
GN+0.238
ignt+0.236
wic+0.230
aughs+0.222
stories+0.218
Neg logit contributions
 mole-0.522
ゼウス-0.481
 ],-0.480
 ):-0.436
 ].-0.429
Tokenern
Token ID1142
Feature activation-5.17e-01

Pos logit contributions
 mole+0.178
 ],+0.163
ゼウス+0.159
 ):+0.146
 ].+0.142
Neg logit contributions
ignt-0.134
GN-0.131
wic-0.129
aughs-0.127
stories-0.123
Tokeni
Token ID72
Feature activation-4.91e-01

Pos logit contributions
 mole+0.435
 ],+0.398
ゼウス+0.386
 ):+0.355
 ].+0.347
Neg logit contributions
ignt-0.322
GN-0.317
wic-0.311
aughs-0.308
stories-0.299
Tokenère
Token ID35979
Feature activation-1.66e-01

Pos logit contributions
 mole+0.387
 ],+0.351
ゼウス+0.339
 ):+0.311
 ].+0.304
Neg logit contributions
ignt-0.333
GN-0.326
wic-0.323
aughs-0.320
stories-0.311
Tokenment
Token ID434
Feature activation-3.95e-01

Pos logit contributions
 mole+0.138
 ],+0.127
ゼウス+0.123
 ):+0.113
 ].+0.110
Neg logit contributions
ignt-0.106
GN-0.103
wic-0.102
aughs-0.101
stories-0.097
Token m
Token ID285
Feature activation+7.36e-02

Pos logit contributions
 mole+0.328
 ],+0.300
ゼウス+0.292
 ):+0.267
 ].+0.262
Neg logit contributions
ignt-0.251
GN-0.245
wic-0.242
aughs-0.240
stories-0.232
Tokenais
Token ID15152
Feature activation-1.90e+00

Pos logit contributions
ignt+0.063
GN+0.062
wic+0.062
aughs+0.061
stories+0.060
Neg logit contributions
 mole-0.044
 ],-0.039
ゼウス-0.037
 ):-0.033
 ].-0.032
Token pas
Token ID38836
Feature activation-5.66e-01

Pos logit contributions
 mole+1.590
 ],+1.451
ゼウス+1.390
 ):+1.295
 ].+1.269
Neg logit contributions
ignt-1.205
GN-1.181
wic-1.169
aughs-1.161
stories-1.124
Token mo
Token ID6941
Feature activation+3.08e-01

Pos logit contributions
 mole+0.466
 ],+0.425
ゼウス+0.413
 ):+0.378
 ].+0.370
Neg logit contributions
ignt-0.364
GN-0.357
wic-0.352
aughs-0.349
stories-0.338
Tokenind
Token ID521
Feature activation-7.34e-01

Pos logit contributions
ignt+0.170
GN+0.167
wic+0.165
aughs+0.163
stories+0.157
Neg logit contributions
 mole-0.279
 ],-0.258
ゼウス-0.252
 ):-0.232
 ].-0.227
Tokenre
Token ID260
Feature activation-5.26e-01

Pos logit contributions
 mole+0.648
 ],+0.595
ゼウス+0.580
 ):+0.535
 ].+0.523
Neg logit contributions
ignt-0.426
GN-0.419
wic-0.409
aughs-0.405
stories-0.391
Tokenment
Token ID434
Feature activation-1.88e-01

Pos logit contributions
 mole+0.427
 ],+0.390
ゼウス+0.379
 ):+0.346
 ].+0.339
Neg logit contributions
ignt-0.343
GN-0.337
wic-0.332
aughs-0.329
stories-0.319
Token seen
Token ID1775
Feature activation-2.97e-01

Pos logit contributions
 mole+0.096
 ],+0.087
ゼウス+0.086
 ):+0.077
 ].+0.075
Neg logit contributions
ignt-0.091
GN-0.090
wic-0.088
aughs-0.087
stories-0.085
Token as
Token ID355
Feature activation+2.59e-02

Pos logit contributions
 mole+0.267
 ],+0.246
ゼウス+0.240
 ):+0.221
 ].+0.217
Neg logit contributions
ignt-0.168
GN-0.164
wic-0.161
aughs-0.160
stories-0.153
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
ignt+0.018
GN+0.018
wic+0.018
aughs+0.018
stories+0.017
Neg logit contributions
 mole-0.020
 ],-0.018
ゼウス-0.018
 ):-0.016
 ].-0.015
Token racial
Token ID8425
Feature activation+1.23e-01

Pos logit contributions
 mole+0.111
 ],+0.102
ゼウス+0.100
 ):+0.090
 ].+0.088
Neg logit contributions
ignt-0.092
GN-0.091
wic-0.089
aughs-0.088
stories-0.085
Token epit
Token ID21240
Feature activation+1.56e-02

Pos logit contributions
ignt+0.098
GN+0.097
wic+0.096
aughs+0.095
stories+0.092
Neg logit contributions
 mole-0.082
 ],-0.074
ゼウス-0.072
 ):-0.063
 ].-0.062
Tokenhet
Token ID3202
Feature activation-1.85e+00

Pos logit contributions
ignt+0.009
GN+0.009
wic+0.008
aughs+0.008
stories+0.008
Neg logit contributions
 mole-0.014
 ],-0.013
ゼウス-0.013
 ):-0.012
 ].-0.012
Token that
Token ID326
Feature activation-1.32e-01

Pos logit contributions
 mole+1.540
 ],+1.409
ゼウス+1.367
 ):+1.252
 ].+1.226
Neg logit contributions
ignt-1.178
GN-1.150
wic-1.134
aughs-1.129
stories-1.088
Token targeted
Token ID7977
Feature activation-1.03e-01

Pos logit contributions
 mole+0.097
 ],+0.088
ゼウス+0.086
 ):+0.077
 ].+0.075
Neg logit contributions
ignt-0.096
GN-0.094
wic-0.093
aughs-0.092
stories-0.090
Token African
Token ID5510
Feature activation-3.77e-02

Pos logit contributions
 mole+0.081
 ],+0.073
ゼウス+0.072
 ):+0.065
 ].+0.063
Neg logit contributions
ignt-0.069
GN-0.068
wic-0.067
aughs-0.066
stories-0.064
Token-
Token ID12
Feature activation+8.85e-02

Pos logit contributions
 mole+0.029
 ],+0.027
ゼウス+0.026
 ):+0.023
 ].+0.023
Neg logit contributions
ignt-0.026
GN-0.026
wic-0.025
aughs-0.025
stories-0.024
TokenAmericans
Token ID17636
Feature activation-1.38e-01

Pos logit contributions
ignt+0.075
GN+0.074
wic+0.073
aughs+0.072
stories+0.071
Neg logit contributions
 mole-0.055
 ],-0.048
ゼウス-0.047
 ):-0.041
 ].-0.040
Token and
Token ID290
Feature activation+1.35e-01

Pos logit contributions
 mole+0.391
 ],+0.361
ゼウス+0.350
 ):+0.321
 ].+0.313
Neg logit contributions
ignt-0.297
GN-0.289
wic-0.286
aughs-0.284
stories-0.273
Token the
Token ID262
Feature activation+6.58e-02

Pos logit contributions
ignt+0.098
GN+0.096
wic+0.094
aughs+0.094
stories+0.091
Neg logit contributions
 mole-0.100
 ],-0.091
ゼウス-0.089
 ):-0.079
 ].-0.077
Token Stars
Token ID10271
Feature activation+1.37e-01

Pos logit contributions
ignt+0.042
GN+0.041
wic+0.040
aughs+0.040
stories+0.039
Neg logit contributions
 mole-0.054
 ],-0.050
ゼウス-0.049
 ):-0.044
 ].-0.043
Token re
Token ID302
Feature activation+4.12e-01

Pos logit contributions
ignt+0.083
GN+0.082
wic+0.080
aughs+0.079
stories+0.077
Neg logit contributions
 mole-0.117
 ],-0.108
ゼウス-0.106
 ):-0.096
 ].-0.094
Token-
Token ID12
Feature activation-5.33e-01

Pos logit contributions
ignt+0.208
GN+0.200
wic+0.199
aughs+0.196
stories+0.189
Neg logit contributions
 mole-0.395
 ],-0.368
ゼウス-0.358
 ):-0.333
 ].-0.327
Tokensign
Token ID12683
Feature activation-1.84e+00

Pos logit contributions
 mole+0.332
 ],+0.293
ゼウス+0.286
 ):+0.249
 ].+0.241
Neg logit contributions
ignt-0.450
GN-0.444
wic-0.438
aughs-0.434
stories-0.424
Token Lang
Token ID16332
Feature activation-1.81e-01

Pos logit contributions
 mole+1.337
 ],+1.213
ゼウス+1.176
 ):+1.058
 ].+1.032
Neg logit contributions
ignt-1.352
GN-1.324
wic-1.309
aughs-1.302
stories-1.261
Tokenen
Token ID268
Feature activation-1.74e-01

Pos logit contributions
 mole+0.188
 ],+0.173
ゼウス+0.172
 ):+0.158
 ].+0.155
Neg logit contributions
ignt-0.079
GN-0.077
wic-0.076
aughs-0.073
stories-0.071
Tokenbr
Token ID1671
Feature activation-2.19e-01

Pos logit contributions
 mole+0.184
 ],+0.171
ゼウス+0.169
 ):+0.156
 ].+0.154
Neg logit contributions
ignt-0.071
GN-0.069
wic-0.067
aughs-0.066
stories-0.063
Tokenun
Token ID403
Feature activation+3.50e-01

Pos logit contributions
 mole+0.221
 ],+0.203
ゼウス+0.202
 ):+0.185
 ].+0.182
Neg logit contributions
ignt-0.101
GN-0.098
wic-0.097
aughs-0.093
stories-0.091
Tokenner
Token ID1008
Feature activation-9.20e-01

Pos logit contributions
ignt+0.256
GN+0.253
wic+0.252
aughs+0.244
stories+0.243
Neg logit contributions
 mole-0.257
ゼウス-0.229
 ],-0.229
 ):-0.200
 ].-0.195
Token AND
Token ID5357
Feature activation-4.30e-01

Pos logit contributions
 mole+0.145
 ],+0.133
ゼウス+0.130
 ):+0.118
 ].+0.116
Neg logit contributions
ignt-0.112
GN-0.110
wic-0.108
aughs-0.107
stories-0.103
Token STEP
Token ID49154
Feature activation-6.53e-01

Pos logit contributions
 mole+0.336
 ],+0.307
ゼウス+0.298
 ):+0.272
 ].+0.265
Neg logit contributions
ignt-0.296
GN-0.287
wic-0.285
aughs-0.282
stories-0.273
TokenH
Token ID39
Feature activation-3.57e-01

Pos logit contributions
 mole+0.557
 ],+0.512
ゼウス+0.497
 ):+0.459
 ].+0.447
Neg logit contributions
ignt-0.403
aughs-0.386
wic-0.383
GN-0.382
stories-0.369
TokenEN
Token ID1677
Feature activation-1.96e-01

Pos logit contributions
 mole+0.266
 ],+0.242
ゼウス+0.235
 ):+0.212
 ].+0.206
Neg logit contributions
ignt-0.257
GN-0.252
wic-0.249
aughs-0.246
stories-0.239
Token M
Token ID337
Feature activation-1.67e-01

Pos logit contributions
 mole+0.142
 ],+0.128
ゼウス+0.124
 ):+0.112
 ].+0.109
Neg logit contributions
ignt-0.146
GN-0.142
wic-0.141
aughs-0.140
stories-0.136
Tokenatur
Token ID2541
Feature activation-1.84e+00

Pos logit contributions
 mole+0.118
 ],+0.107
ゼウス+0.103
 ):+0.093
 ].+0.090
Neg logit contributions
ignt-0.126
GN-0.123
wic-0.122
aughs-0.121
stories-0.118
Tokenin
Token ID259
Feature activation-1.11e-01

Pos logit contributions
 mole+1.732
 ],+1.606
ゼウス+1.572
 ):+1.453
 ].+1.426
Neg logit contributions
ignt-0.956
GN-0.931
wic-0.913
aughs-0.904
stories-0.864
Token first
Token ID717
Feature activation-1.31e-01

Pos logit contributions
 mole+0.089
 ],+0.082
ゼウス+0.080
 ):+0.072
 ].+0.071
Neg logit contributions
ignt-0.073
GN-0.071
wic-0.070
aughs-0.069
stories-0.067
Token meet
Token ID1826
Feature activation-4.30e-01

Pos logit contributions
 mole+0.088
 ],+0.079
ゼウス+0.078
 ):+0.068
 ].+0.066
Neg logit contributions
ignt-0.103
GN-0.102
wic-0.101
aughs-0.100
stories-0.097
Token in
Token ID287
Feature activation-2.25e-01

Pos logit contributions
 mole+0.342
 ],+0.312
ゼウス+0.303
 ):+0.276
 ].+0.270
Neg logit contributions
ignt-0.289
GN-0.283
wic-0.278
aughs-0.276
stories-0.267
Token April
Token ID3035
Feature activation+2.00e-01

Pos logit contributions
 mole+0.183
 ],+0.167
ゼウス+0.163
 ):+0.148
 ].+0.145
Neg logit contributions
ignt-0.147
GN-0.144
wic-0.142
aughs-0.140
stories-0.136
Token playoff
Token ID13119
Feature activation-7.35e-01

Pos logit contributions
 mole+0.026
 ],+0.024
ゼウス+0.024
 ):+0.022
 ].+0.021
Neg logit contributions
ignt-0.015
GN-0.015
wic-0.015
aughs-0.014
stories-0.014
Token round
Token ID2835
Feature activation-8.07e-01

Pos logit contributions
 mole+0.652
 ],+0.601
ゼウス+0.587
 ):+0.539
 ].+0.528
Neg logit contributions
ignt-0.426
GN-0.415
wic-0.409
aughs-0.405
stories-0.389
Token or
Token ID393
Feature activation+1.35e-01

Pos logit contributions
 mole+0.679
 ],+0.624
ゼウス+0.605
 ):+0.558
 ].+0.544
Neg logit contributions
ignt-0.505
GN-0.490
wic-0.486
aughs-0.483
stories-0.465
Token re
Token ID302
Feature activation+5.69e-01

Pos logit contributions
ignt+0.088
GN+0.087
wic+0.085
aughs+0.084
stories+0.081
Neg logit contributions
 mole-0.109
 ],-0.100
ゼウス-0.098
 ):-0.088
 ].-0.086
Token-
Token ID12
Feature activation-3.75e-01

Pos logit contributions
ignt+0.300
GN+0.294
wic+0.287
aughs+0.284
stories+0.272
Neg logit contributions
 mole-0.536
 ],-0.494
ゼウス-0.488
 ):-0.445
 ].-0.438
Tokensign
Token ID12683
Feature activation-1.83e+00

Pos logit contributions
 mole+0.243
 ],+0.214
ゼウス+0.211
 ):+0.182
 ].+0.177
Neg logit contributions
ignt-0.308
GN-0.305
wic-0.300
aughs-0.297
stories-0.292
Tokens
Token ID82
Feature activation-9.79e-01

Pos logit contributions
 mole+1.535
 ],+1.407
ゼウス+1.375
 ):+1.256
 ].+1.227
Neg logit contributions
ignt-1.146
GN-1.119
wic-1.104
aughs-1.098
stories-1.057
Token Lang
Token ID16332
Feature activation+1.04e-01

Pos logit contributions
 mole+0.769
 ],+0.701
ゼウス+0.683
 ):+0.619
 ].+0.605
Neg logit contributions
ignt-0.666
GN-0.653
wic-0.644
aughs-0.637
stories-0.618
Tokenen
Token ID268
Feature activation+6.49e-02

Pos logit contributions
ignt+0.045
GN+0.044
wic+0.043
aughs+0.041
stories+0.040
Neg logit contributions
 mole-0.108
 ],-0.099
ゼウス-0.099
 ):-0.090
 ].-0.089
Tokenbr
Token ID1671
Feature activation+1.00e-01

Pos logit contributions
ignt+0.026
GN+0.025
wic+0.025
aughs+0.024
stories+0.023
Neg logit contributions
 mole-0.069
 ],-0.064
ゼウス-0.063
 ):-0.059
 ].-0.058
Tokenun
Token ID403
Feature activation+4.16e-01

Pos logit contributions
ignt+0.045
GN+0.044
wic+0.044
aughs+0.042
stories+0.041
Neg logit contributions
 mole-0.102
 ],-0.093
ゼウス-0.093
 ):-0.085
 ].-0.084
Token
Token ID198
Feature activation+1.82e-02

Pos logit contributions
ignt+0.021
GN+0.020
wic+0.020
aughs+0.020
stories+0.019
Neg logit contributions
 mole-0.029
 ],-0.027
ゼウス-0.026
 ):-0.024
 ].-0.023
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
GN+0.008
ignt+0.008
wic+0.008
aughs+0.008
stories+0.008
Neg logit contributions
 mole-0.018
ゼウス-0.017
 ],-0.017
 ):-0.015
 ].-0.015
TokenMeanwhile
Token ID10294
Feature activation+1.19e+00

Pos logit contributions
 mole+0.106
 ],+0.097
ゼウス+0.095
 ):+0.087
 ].+0.085
Neg logit contributions
ignt-0.078
GN-0.077
wic-0.075
aughs-0.074
stories-0.072
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
ignt+0.750
GN+0.736
wic+0.723
aughs+0.714
stories+0.690
Neg logit contributions
 mole-1.007
 ],-0.923
ゼウス-0.895
 ):-0.822
 ].-0.803
Token on
Token ID319
Feature activation-1.42e-01

Pos logit contributions
 mole+0.105
 ],+0.096
ゼウス+0.094
 ):+0.085
 ].+0.083
Neg logit contributions
ignt-0.086
GN-0.084
wic-0.083
aughs-0.082
stories-0.080
Token foreign
Token ID3215
Feature activation-1.82e+00

Pos logit contributions
 mole+0.111
 ],+0.101
ゼウス+0.099
 ):+0.089
 ].+0.087
Neg logit contributions
ignt-0.098
GN-0.096
wic-0.095
aughs-0.094
stories-0.091
Token policy
Token ID2450
Feature activation-5.51e-01

Pos logit contributions
 mole+1.566
 ],+1.432
ゼウス+1.381
 ):+1.284
 ].+1.251
Neg logit contributions
ignt-1.107
GN-1.084
wic-1.066
aughs-1.051
stories-1.007
Token --
Token ID1377
Feature activation-2.37e-01

Pos logit contributions
 mole+0.466
 ],+0.427
ゼウス+0.416
 ):+0.381
 ].+0.373
Neg logit contributions
ignt-0.342
GN-0.335
wic-0.330
aughs-0.326
stories-0.315
Token another
Token ID1194
Feature activation-4.84e-01

Pos logit contributions
 mole+0.189
 ],+0.172
ゼウス+0.169
 ):+0.152
 ].+0.149
Neg logit contributions
ignt-0.158
GN-0.155
wic-0.153
aughs-0.151
stories-0.147
Token Carter
Token ID10831
Feature activation+3.26e-02

Pos logit contributions
 mole+0.393
 ],+0.359
ゼウス+0.349
 ):+0.318
 ].+0.311
Neg logit contributions
ignt-0.317
GN-0.311
wic-0.306
aughs-0.303
stories-0.294
Token weak
Token ID4939
Feature activation-2.73e-01

Pos logit contributions
ignt+0.020
GN+0.020
wic+0.020
aughs+0.019
stories+0.019
Neg logit contributions
 mole-0.027
 ],-0.025
ゼウス-0.025
 ):-0.022
 ].-0.022
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ignt+0.381
GN+0.373
wic+0.367
aughs+0.362
stories+0.350
Neg logit contributions
 mole-0.575
 ],-0.529
ゼウス-0.519
 ):-0.474
 ].-0.465
Token<|endoftext|>
Token ID50256
Feature activation-4.78e-01

Pos logit contributions
ignt+0.115
GN+0.114
wic+0.112
aughs+0.111
stories+0.108
Neg logit contributions
 mole-0.132
 ],-0.119
ゼウス-0.117
 ):-0.105
 ].-0.103
TokenB
Token ID33
Feature activation+1.28e-01

Pos logit contributions
 mole+0.399
 ],+0.368
ゼウス+0.358
 ):+0.328
 ].+0.321
Neg logit contributions
ignt-0.301
GN-0.289
wic-0.288
aughs-0.287
stories-0.274
Tokenrock
Token ID10823
Feature activation+8.53e-01

Pos logit contributions
ignt+0.086
GN+0.084
wic+0.083
aughs+0.082
stories+0.080
Neg logit contributions
 mole-0.101
 ],-0.093
ゼウス-0.090
 ):-0.082
 ].-0.080
Token Les
Token ID11732
Feature activation-6.85e-01

Pos logit contributions
ignt+0.588
GN+0.577
wic+0.569
aughs+0.563
stories+0.546
Neg logit contributions
 mole-0.662
 ],-0.602
ゼウス-0.588
 ):-0.530
 ].-0.518
Tokennar
Token ID23955
Feature activation-1.81e+00

Pos logit contributions
 mole+0.633
 ],+0.588
ゼウス+0.564
 ):+0.532
 ].+0.521
Neg logit contributions
ignt-0.365
GN-0.353
wic-0.349
aughs-0.348
stories-0.326
Token has
Token ID468
Feature activation+2.54e-02

Pos logit contributions
 mole+1.338
 ],+1.213
ゼウス+1.167
 ):+1.061
 ].+1.034
Neg logit contributions
ignt-1.316
GN-1.290
wic-1.268
aughs-1.259
stories-1.224
Token been
Token ID587
Feature activation-4.21e-02

Pos logit contributions
ignt+0.018
GN+0.018
wic+0.018
aughs+0.017
stories+0.017
Neg logit contributions
 mole-0.019
 ],-0.017
ゼウス-0.017
 ):-0.015
 ].-0.015
Token flagged
Token ID34060
Feature activation-3.74e-02

Pos logit contributions
 mole+0.031
 ],+0.028
ゼウス+0.027
 ):+0.024
 ].+0.024
Neg logit contributions
ignt-0.031
GN-0.030
wic-0.030
aughs-0.030
stories-0.029
Token by
Token ID416
Feature activation+3.33e-01

Pos logit contributions
 mole+0.029
 ],+0.027
ゼウス+0.026
 ):+0.024
 ].+0.023
Neg logit contributions
ignt-0.025
GN-0.025
wic-0.025
aughs-0.024
stories-0.024
Token the
Token ID262
Feature activation-2.26e-02

Pos logit contributions
ignt+0.206
GN+0.203
wic+0.199
aughs+0.197
stories+0.191
Neg logit contributions
 mole-0.279
 ],-0.256
ゼウス-0.252
 ):-0.228
 ].-0.224
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
ignt+0.032
GN+0.031
wic+0.031
aughs+0.030
stories+0.029
Neg logit contributions
 mole-0.044
 ],-0.041
ゼウス-0.040
 ):-0.036
 ].-0.035
Token
Token ID198
Feature activation+6.97e-02

Pos logit contributions
ignt+0.116
GN+0.116
wic+0.112
aughs+0.108
stories+0.106
Neg logit contributions
 mole-0.250
 ],-0.230
ゼウス-0.230
 ):-0.209
 ].-0.206
TokenSo
Token ID2396
Feature activation-3.35e-01

Pos logit contributions
ignt+0.042
GN+0.041
wic+0.041
aughs+0.040
stories+0.039
Neg logit contributions
 mole-0.060
 ],-0.055
ゼウス-0.054
 ):-0.049
 ].-0.048
Token more
Token ID517
Feature activation-3.45e-02

Pos logit contributions
 mole+0.281
 ],+0.258
ゼウス+0.252
 ):+0.230
 ].+0.225
Neg logit contributions
ignt-0.209
GN-0.205
wic-0.201
aughs-0.199
stories-0.193
Token OFFIC
Token ID47601
Feature activation-4.17e-01

Pos logit contributions
 mole+0.030
 ],+0.028
ゼウス+0.027
 ):+0.025
 ].+0.024
Neg logit contributions
ignt-0.020
GN-0.020
wic-0.020
aughs-0.019
stories-0.019
TokenIAL
Token ID12576
Feature activation-1.81e+00

Pos logit contributions
 mole+0.293
 ],+0.266
ゼウス+0.254
 ):+0.230
 ].+0.224
Neg logit contributions
ignt-0.321
wic-0.309
GN-0.307
aughs-0.306
stories-0.296
Token announcements
Token ID24009
Feature activation+2.94e-01

Pos logit contributions
 mole+1.595
 ],+1.471
ゼウス+1.428
 ):+1.318
 ].+1.295
Neg logit contributions
ignt-1.057
GN-1.017
wic-1.014
aughs-1.001
stories-0.956
Token soon
Token ID2582
Feature activation+1.25e-01

Pos logit contributions
ignt+0.189
GN+0.186
wic+0.183
aughs+0.180
stories+0.175
Neg logit contributions
 mole-0.240
 ],-0.220
ゼウス-0.215
 ):-0.195
 ].-0.191
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
ignt+0.075
GN+0.074
wic+0.072
aughs+0.072
stories+0.069
Neg logit contributions
 mole-0.107
 ],-0.099
ゼウス-0.097
 ):-0.088
 ].-0.086
Token Be
Token ID1355
Feature activation+1.27e-01

Pos logit contributions
 mole+0.189
 ],+0.174
ゼウス+0.169
 ):+0.155
 ].+0.151
Neg logit contributions
ignt-0.144
GN-0.140
wic-0.138
aughs-0.137
stories-0.132
Token careful
Token ID8161
Feature activation+2.44e-01

Pos logit contributions
ignt+0.107
GN+0.105
wic+0.104
aughs+0.103
stories+0.100
Neg logit contributions
 mole-0.080
 ],-0.071
ゼウス-0.069
 ):-0.060
 ].-0.058
Token
Token ID251
Feature activation-3.15e-01

Pos logit contributions
 mole+0.224
 ],+0.196
ゼウス+0.189
 ):+0.160
 ].+0.154
Neg logit contributions
ignt-0.406
GN-0.402
wic-0.400
aughs-0.395
stories-0.385
Token and
Token ID290
Feature activation-9.07e-02

Pos logit contributions
 mole+0.257
 ],+0.235
ゼウス+0.230
 ):+0.209
 ].+0.204
Neg logit contributions
ignt-0.204
GN-0.200
wic-0.198
aughs-0.195
stories-0.189
Token
Token ID564
Feature activation-2.40e-02

Pos logit contributions
 mole+0.076
 ],+0.069
ゼウス+0.068
 ):+0.061
 ].+0.060
Neg logit contributions
ignt-0.057
GN-0.056
wic-0.055
aughs-0.055
stories-0.053
Token
Token ID250
Feature activation-1.70e-01

Pos logit contributions
 mole+0.019
 ],+0.018
ゼウス+0.017
 ):+0.016
 ].+0.015
Neg logit contributions
ignt-0.016
GN-0.016
wic-0.015
aughs-0.015
stories-0.015
Tokenstre
Token ID22853
Feature activation+5.84e-02

Pos logit contributions
 mole+0.144
 ],+0.132
ゼウス+0.130
 ):+0.118
 ].+0.116
Neg logit contributions
ignt-0.105
GN-0.103
wic-0.101
aughs-0.100
stories-0.097
Tokenng
Token ID782
Feature activation-1.78e+00

Pos logit contributions
ignt+0.034
GN+0.033
wic+0.033
aughs+0.032
stories+0.031
Neg logit contributions
 mole-0.052
 ],-0.047
ゼウス-0.047
 ):-0.042
 ].-0.042
Tokenthen
Token ID8524
Feature activation-7.96e-01

Pos logit contributions
 mole+1.506
 ],+1.392
ゼウス+1.355
 ):+1.244
 ].+1.221
Neg logit contributions
ignt-1.095
GN-1.074
wic-1.057
aughs-1.051
stories-1.007
Token faculty
Token ID12829
Feature activation+3.97e-01

Pos logit contributions
 mole+0.646
 ],+0.590
ゼウス+0.577
 ):+0.523
 ].+0.512
Neg logit contributions
ignt-0.520
GN-0.510
wic-0.502
aughs-0.497
stories-0.481
Token tenure
Token ID17081
Feature activation+2.16e-01

Pos logit contributions
ignt+0.268
GN+0.262
wic+0.259
aughs+0.256
stories+0.248
Neg logit contributions
 mole-0.314
 ],-0.286
ゼウス-0.279
 ):-0.252
 ].-0.247
Token systems
Token ID3341
Feature activation-1.55e-01

Pos logit contributions
ignt+0.136
GN+0.133
wic+0.131
aughs+0.129
stories+0.125
Neg logit contributions
 mole-0.180
 ],-0.165
ゼウス-0.161
 ):-0.147
 ].-0.144
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
 mole+0.132
 ],+0.122
ゼウス+0.119
 ):+0.109
 ].+0.106
Neg logit contributions
ignt-0.095
GN-0.092
wic-0.091
aughs-0.090
stories-0.087
Token last
Token ID938
Feature activation-2.48e-01

Pos logit contributions
ignt+0.043
GN+0.043
wic+0.042
aughs+0.041
stories+0.040
Neg logit contributions
 mole-0.053
 ],-0.048
ゼウス-0.048
 ):-0.043
 ].-0.042
Token mile
Token ID10591
Feature activation-5.27e-01

Pos logit contributions
 mole+0.207
 ],+0.190
ゼウス+0.185
 ):+0.169
 ].+0.165
Neg logit contributions
ignt-0.156
GN-0.153
wic-0.151
aughs-0.150
stories-0.144
Token connectivity
Token ID19843
Feature activation-1.14e-01

Pos logit contributions
 mole+0.427
 ],+0.390
ゼウス+0.380
 ):+0.346
 ].+0.338
Neg logit contributions
ignt-0.346
GN-0.338
wic-0.334
aughs-0.331
stories-0.320
Token and
Token ID290
Feature activation-3.88e-02

Pos logit contributions
 mole+0.095
 ],+0.087
ゼウス+0.085
 ):+0.078
 ].+0.076
Neg logit contributions
ignt-0.072
GN-0.070
wic-0.069
aughs-0.069
stories-0.066
Token public
Token ID1171
Feature activation+8.02e-02

Pos logit contributions
 mole+0.031
 ],+0.029
ゼウス+0.028
 ):+0.025
 ].+0.025
Neg logit contributions
ignt-0.025
GN-0.025
wic-0.025
aughs-0.024
stories-0.024
Token transport
Token ID4839
Feature activation-1.77e+00

Pos logit contributions
ignt+0.052
GN+0.050
wic+0.050
aughs+0.049
stories+0.048
Neg logit contributions
 mole-0.066
 ],-0.060
ゼウス-0.059
 ):-0.054
 ].-0.053
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
 mole+1.403
 ],+1.280
ゼウス+1.243
 ):+1.133
 ].+1.107
Neg logit contributions
ignt-1.199
GN-1.168
wic-1.153
aughs-1.149
stories-1.103
Token automation
Token ID22771
Feature activation-6.94e-01

Pos logit contributions
 mole+0.080
 ],+0.073
ゼウス+0.072
 ):+0.065
 ].+0.063
Neg logit contributions
ignt-0.069
GN-0.068
wic-0.067
aughs-0.066
stories-0.064
Token and
Token ID290
Feature activation-8.27e-02

Pos logit contributions
 mole+0.574
 ],+0.527
ゼウス+0.513
 ):+0.469
 ].+0.459
Neg logit contributions
ignt-0.442
GN-0.432
wic-0.428
aughs-0.424
stories-0.407
Token computer
Token ID3644
Feature activation-5.71e-01

Pos logit contributions
 mole+0.068
 ],+0.062
ゼウス+0.061
 ):+0.055
 ].+0.054
Neg logit contributions
ignt-0.053
GN-0.052
wic-0.051
aughs-0.051
stories-0.049
Tokenization
Token ID1634
Feature activation+1.07e-01

Pos logit contributions
 mole+0.471
 ],+0.429
ゼウス+0.419
 ):+0.383
 ].+0.375
Neg logit contributions
ignt-0.368
wic-0.359
GN-0.358
aughs-0.354
stories-0.341
Token officer
Token ID3818
Feature activation+2.90e-01

Pos logit contributions
ignt+0.538
GN+0.527
wic+0.519
aughs+0.513
stories+0.497
Neg logit contributions
 mole-0.674
 ],-0.618
ゼウス-0.606
 ):-0.549
 ].-0.537
Token,
Token ID11
Feature activation-1.27e-01

Pos logit contributions
ignt+0.179
GN+0.175
wic+0.172
aughs+0.171
stories+0.165
Neg logit contributions
 mole-0.247
 ],-0.226
ゼウス-0.220
 ):-0.202
 ].-0.197
Token and
Token ID290
Feature activation-7.97e-02

Pos logit contributions
 mole+0.105
 ],+0.096
ゼウス+0.093
 ):+0.085
 ].+0.083
Neg logit contributions
ignt-0.081
GN-0.080
wic-0.078
aughs-0.077
stories-0.075
Token Stephen
Token ID7970
Feature activation+1.09e-01

Pos logit contributions
 mole+0.061
 ],+0.056
ゼウス+0.054
 ):+0.049
 ].+0.048
Neg logit contributions
ignt-0.056
GN-0.055
wic-0.054
aughs-0.053
stories-0.052
Token M
Token ID337
Feature activation+1.67e-01

Pos logit contributions
ignt+0.076
GN+0.075
wic+0.074
aughs+0.073
stories+0.071
Neg logit contributions
 mole-0.084
 ],-0.076
ゼウス-0.074
 ):-0.067
 ].-0.065
Tokenatur
Token ID2541
Feature activation-1.77e+00

Pos logit contributions
ignt+0.106
GN+0.104
wic+0.102
aughs+0.101
stories+0.098
Neg logit contributions
 mole-0.138
 ],-0.127
ゼウス-0.122
 ):-0.113
 ].-0.110
Tokenin
Token ID259
Feature activation+8.85e-02

Pos logit contributions
 mole+1.567
 ],+1.457
ゼウス+1.394
 ):+1.308
 ].+1.278
Neg logit contributions
ignt-1.009
GN-0.997
aughs-0.952
wic-0.950
stories-0.916
Token,
Token ID11
Feature activation-1.76e-01

Pos logit contributions
ignt+0.056
GN+0.055
wic+0.054
aughs+0.053
stories+0.052
Neg logit contributions
 mole-0.074
 ],-0.067
ゼウス-0.065
 ):-0.060
 ].-0.058
Token a
Token ID257
Feature activation-7.21e-02

Pos logit contributions
 mole+0.133
 ],+0.121
ゼウス+0.117
 ):+0.106
 ].+0.103
Neg logit contributions
ignt-0.125
GN-0.123
wic-0.121
aughs-0.120
stories-0.116
Token penn
Token ID22429
Feature activation-2.88e-01

Pos logit contributions
 mole+0.055
 ],+0.050
ゼウス+0.049
 ):+0.044
 ].+0.043
Neg logit contributions
ignt-0.050
GN-0.049
wic-0.049
aughs-0.048
stories-0.047
Tokeniless
Token ID30608
Feature activation-5.41e-01

Pos logit contributions
 mole+0.188
 ],+0.168
ゼウス+0.162
 ):+0.144
 ].+0.139
Neg logit contributions
ignt-0.234
GN-0.231
wic-0.228
aughs-0.227
stories-0.221
Token Ted
Token ID11396
Feature activation-2.99e-01

Pos logit contributions
ignt+0.035
GN+0.035
wic+0.034
aughs+0.034
stories+0.033
Neg logit contributions
 mole-0.042
 ],-0.038
ゼウス-0.037
 ):-0.033
 ].-0.033
Token H
Token ID367
Feature activation+3.44e-02

Pos logit contributions
 mole+0.224
 ],+0.203
ゼウス+0.198
 ):+0.178
 ].+0.174
Neg logit contributions
ignt-0.213
GN-0.209
wic-0.207
aughs-0.205
stories-0.199
Tokenagg
Token ID9460
Feature activation-6.40e-01

Pos logit contributions
ignt+0.023
GN+0.023
wic+0.022
aughs+0.022
stories+0.021
Neg logit contributions
 mole-0.027
 ],-0.025
ゼウス-0.024
 ):-0.022
 ].-0.021
Tokenard
Token ID446
Feature activation+2.07e-01

Pos logit contributions
 mole+0.455
 ],+0.411
ゼウス+0.399
 ):+0.357
 ].+0.348
Neg logit contributions
ignt-0.482
GN-0.474
wic-0.467
aughs-0.463
stories-0.451
Token is
Token ID318
Feature activation-1.81e-01

Pos logit contributions
ignt+0.148
GN+0.145
wic+0.144
aughs+0.142
stories+0.138
Neg logit contributions
 mole-0.154
 ],-0.140
ゼウス-0.137
 ):-0.122
 ].-0.119
Token prof
Token ID1534
Feature activation-1.77e+00

Pos logit contributions
 mole+0.139
 ],+0.126
ゼウス+0.123
 ):+0.111
 ].+0.108
Neg logit contributions
ignt-0.127
GN-0.124
wic-0.123
aughs-0.121
stories-0.118
Tokeniled
Token ID3902
Feature activation-1.16e+00

Pos logit contributions
 mole+1.571
 ],+1.461
ゼウス+1.413
 ):+1.314
 ].+1.287
Neg logit contributions
ignt-0.999
GN-0.979
wic-0.961
aughs-0.956
stories-0.923
Token in
Token ID287
Feature activation-1.67e-01

Pos logit contributions
 mole+0.924
 ],+0.844
ゼウス+0.818
 ):+0.746
 ].+0.730
Neg logit contributions
ignt-0.779
GN-0.763
wic-0.751
aughs-0.746
stories-0.718
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
 mole+0.128
 ],+0.116
ゼウス+0.113
 ):+0.102
 ].+0.100
Neg logit contributions
ignt-0.117
GN-0.114
wic-0.113
aughs-0.112
stories-0.108
Token new
Token ID649
Feature activation-2.77e-01

Pos logit contributions
 mole+0.137
 ],+0.126
ゼウス+0.122
 ):+0.112
 ].+0.110
Neg logit contributions
ignt-0.096
GN-0.094
wic-0.093
aughs-0.092
stories-0.088
Token G
Token ID402
Feature activation+2.07e-02

Pos logit contributions
 mole+0.244
 ],+0.224
ゼウス+0.218
 ):+0.201
 ].+0.197
Neg logit contributions
ignt-0.163
GN-0.158
wic-0.156
aughs-0.154
stories-0.148
TokenPaul
Token ID12041
Feature activation-5.10e-02

Pos logit contributions
ignt+0.122
GN+0.120
wic+0.118
aughs+0.117
stories+0.113
Neg logit contributions
 mole-0.160
 ],-0.147
ゼウス-0.143
 ):-0.130
 ].-0.128
Token Ryan
Token ID6047
Feature activation-4.53e-01

Pos logit contributions
 mole+0.042
 ],+0.038
ゼウス+0.037
 ):+0.034
 ].+0.033
Neg logit contributions
ignt-0.033
GN-0.033
wic-0.032
aughs-0.032
stories-0.031
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
 mole+0.406
 ],+0.377
ゼウス+0.360
 ):+0.340
 ].+0.331
Neg logit contributions
ignt-0.260
GN-0.252
wic-0.246
aughs-0.244
stories-0.234
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
ignt+0.067
GN+0.066
wic+0.065
aughs+0.063
stories+0.061
Neg logit contributions
 mole-0.143
 ],-0.132
ゼウス-0.131
 ):-0.120
 ].-0.118
TokenOn
Token ID2202
Feature activation+1.30e-01

Pos logit contributions
ignt+0.070
GN+0.068
wic+0.067
aughs+0.067
stories+0.064
Neg logit contributions
 mole-0.084
 ],-0.077
ゼウス-0.075
 ):-0.068
 ].-0.066
Token Aug
Token ID2447
Feature activation-1.76e+00

Pos logit contributions
ignt+0.077
GN+0.076
wic+0.074
aughs+0.073
stories+0.071
Neg logit contributions
 mole-0.113
 ],-0.104
ゼウス-0.102
 ):-0.093
 ].-0.091
Token.
Token ID13
Feature activation+3.99e-02

Pos logit contributions
 mole+1.577
 ],+1.457
ゼウス+1.422
 ):+1.310
 ].+1.285
Neg logit contributions
ignt-1.006
GN-0.980
wic-0.966
aughs-0.957
stories-0.917
Token 12
Token ID1105
Feature activation-3.96e-01

Pos logit contributions
GN+0.022
ignt+0.022
wic+0.021
aughs+0.020
stories+0.020
Neg logit contributions
 mole-0.036
ゼウス-0.033
 ],-0.033
 ):-0.030
 ].-0.029
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
 mole+0.322
 ],+0.298
ゼウス+0.282
 ):+0.265
 ].+0.258
Neg logit contributions
ignt-0.259
GN-0.251
wic-0.247
aughs-0.246
stories-0.235
Token 2012
Token ID2321
Feature activation-2.34e-01

Pos logit contributions
ignt+0.071
GN+0.069
wic+0.067
aughs+0.066
stories+0.063
Neg logit contributions
 mole-0.151
 ],-0.140
ゼウス-0.138
 ):-0.128
 ].-0.125
Token,
Token ID11
Feature activation+5.25e-02

Pos logit contributions
 mole+0.190
 ],+0.175
ゼウス+0.167
 ):+0.155
 ].+0.150
Neg logit contributions
ignt-0.155
GN-0.150
wic-0.148
aughs-0.147
stories-0.141
Token is
Token ID318
Feature activation-4.42e-02

Pos logit contributions
ignt+0.102
GN+0.100
wic+0.099
aughs+0.098
stories+0.095
Neg logit contributions
 mole-0.113
 ],-0.103
ゼウス-0.100
 ):-0.091
 ].-0.089
Token saying
Token ID2282
Feature activation+1.08e-01

Pos logit contributions
 mole+0.035
 ],+0.032
ゼウス+0.031
 ):+0.028
 ].+0.028
Neg logit contributions
ignt-0.030
GN-0.029
wic-0.029
aughs-0.028
stories-0.027
Token
Token ID10545
Feature activation-1.63e+00

Pos logit contributions
ignt+0.074
GN+0.073
wic+0.072
aughs+0.071
stories+0.069
Neg logit contributions
 mole-0.084
 ],-0.076
ゼウス-0.075
 ):-0.067
 ].-0.066
Token
Token ID230
Feature activation-4.04e-01

Pos logit contributions
 mole+0.974
 ],+0.862
ゼウス+0.835
 ):+0.727
 ].+0.704
Neg logit contributions
ignt-1.409
GN-1.384
wic-1.371
aughs-1.368
stories-1.329
Token
Token ID239
Feature activation-8.70e-01

Pos logit contributions
 mole+0.264
 ],+0.237
ゼウス+0.230
 ):+0.203
 ].+0.197
Neg logit contributions
ignt-0.327
GN-0.320
wic-0.318
aughs-0.316
stories-0.308
Token
Token ID46349
Feature activation-1.76e+00

Pos logit contributions
 mole+0.581
 ],+0.520
ゼウス+0.505
 ):+0.447
 ].+0.435
Neg logit contributions
ignt-0.694
GN-0.682
wic-0.674
aughs-0.669
stories-0.651
Token
Token ID111
Feature activation-1.61e+00

Pos logit contributions
 mole+0.904
 ],+0.780
ゼウス+0.753
 ):+0.634
 ].+0.610
Neg logit contributions
ignt-1.666
GN-1.641
wic-1.629
aughs-1.617
stories-1.582
Token
Token ID17358
Feature activation-1.32e+00

Pos logit contributions
 mole+1.102
 ],+0.990
ゼウス+0.964
 ):+0.858
 ].+0.833
Neg logit contributions
ignt-1.255
GN-1.228
wic-1.217
aughs-1.213
stories-1.176
Token
Token ID223
Feature activation-7.80e-01

Pos logit contributions
 mole+0.671
 ],+0.580
ゼウス+0.556
 ):+0.469
 ].+0.450
Neg logit contributions
ignt-1.264
GN-1.246
wic-1.234
aughs-1.226
stories-1.200
Token
Token ID31660
Feature activation-1.29e+00

Pos logit contributions
 mole+0.533
 ],+0.479
ゼウス+0.465
 ):+0.414
 ].+0.403
Neg logit contributions
ignt-0.610
GN-0.599
wic-0.592
aughs-0.588
stories-0.571
Token
Token ID25001
Feature activation-9.03e-01

Pos logit contributions
 mole+0.846
 ],+0.756
ゼウス+0.734
 ):+0.650
 ].+0.630
Neg logit contributions
ignt-1.049
GN-1.028
wic-1.018
aughs-1.014
stories-0.985
Token of
Token ID286
Feature activation-2.49e-01

Pos logit contributions
ignt+0.404
GN+0.395
wic+0.392
aughs+0.390
stories+0.375
Neg logit contributions
 mole-0.511
 ],-0.471
ゼウス-0.454
 ):-0.418
 ].-0.409
Token the
Token ID262
Feature activation-1.55e-01

Pos logit contributions
 mole+0.188
 ],+0.170
ゼウス+0.166
 ):+0.149
 ].+0.146
Neg logit contributions
ignt-0.177
GN-0.174
wic-0.171
aughs-0.170
stories-0.165
Token 5
Token ID642
Feature activation-1.09e-01

Pos logit contributions
 mole+0.135
 ],+0.124
ゼウス+0.121
 ):+0.111
 ].+0.109
Neg logit contributions
ignt-0.092
GN-0.090
wic-0.088
aughs-0.087
stories-0.084
Token PM
Token ID3122
Feature activation-5.77e-03

Pos logit contributions
 mole+0.094
 ],+0.086
ゼウス+0.084
 ):+0.077
 ].+0.075
Neg logit contributions
ignt-0.066
GN-0.064
wic-0.063
aughs-0.062
stories-0.060
Token Monday
Token ID3321
Feature activation-2.52e-01

Pos logit contributions
 mole+0.005
 ],+0.005
ゼウス+0.005
 ):+0.004
 ].+0.004
Neg logit contributions
ignt-0.003
GN-0.003
wic-0.003
aughs-0.003
stories-0.003
Token deadline
Token ID12508
Feature activation-1.75e+00

Pos logit contributions
 mole+0.191
 ],+0.174
ゼウス+0.169
 ):+0.153
 ].+0.149
Neg logit contributions
ignt-0.177
GN-0.174
wic-0.172
aughs-0.170
stories-0.165
Token for
Token ID329
Feature activation-2.94e-01

Pos logit contributions
 mole+1.487
 ],+1.371
ゼウス+1.327
 ):+1.222
 ].+1.202
Neg logit contributions
ignt-1.076
GN-1.055
wic-1.038
aughs-1.033
stories-0.999
Token all
Token ID477
Feature activation-1.38e-01

Pos logit contributions
 mole+0.240
 ],+0.219
ゼウス+0.215
 ):+0.194
 ].+0.190
Neg logit contributions
ignt-0.191
GN-0.187
wic-0.184
aughs-0.182
stories-0.176
Token NHL
Token ID9481
Feature activation+5.91e-01

Pos logit contributions
 mole+0.119
 ],+0.109
ゼウス+0.106
 ):+0.098
 ].+0.096
Neg logit contributions
ignt-0.083
GN-0.082
wic-0.080
aughs-0.079
stories-0.077
Token teams
Token ID3466
Feature activation-3.84e-01

Pos logit contributions
ignt+0.327
GN+0.320
wic+0.313
aughs+0.309
stories+0.297
Neg logit contributions
 mole-0.538
 ],-0.496
ゼウス-0.487
 ):-0.447
 ].-0.438
Token to
Token ID284
Feature activation+8.64e-02

Pos logit contributions
 mole+0.333
 ],+0.307
ゼウス+0.299
 ):+0.275
 ].+0.270
Neg logit contributions
ignt-0.229
GN-0.223
wic-0.220
aughs-0.220
stories-0.211
Tokenside
Token ID1589
Feature activation+7.71e-01

Pos logit contributions
ignt+0.626
GN+0.617
aughs+0.598
wic+0.598
stories+0.575
Neg logit contributions
 mole-0.754
 ],-0.689
ゼウス-0.659
 ):-0.613
 ].-0.596
Token our
Token ID674
Feature activation-1.79e-02

Pos logit contributions
ignt+0.524
GN+0.514
wic+0.506
aughs+0.501
stories+0.486
Neg logit contributions
 mole-0.604
 ],-0.551
ゼウス-0.540
 ):-0.485
 ].-0.475
Token earlier
Token ID2961
Feature activation-1.30e-01

Pos logit contributions
 mole+0.015
 ],+0.014
ゼウス+0.013
 ):+0.012
 ].+0.012
Neg logit contributions
ignt-0.011
GN-0.011
wic-0.011
aughs-0.011
stories-0.011
Token requirements
Token ID5359
Feature activation-1.05e-01

Pos logit contributions
 mole+0.106
 ],+0.097
ゼウス+0.094
 ):+0.086
 ].+0.084
Neg logit contributions
ignt-0.085
GN-0.083
wic-0.082
aughs-0.081
stories-0.078
Token)
Token ID8
Feature activation+4.13e-01

Pos logit contributions
 mole+0.086
 ],+0.079
ゼウス+0.077
 ):+0.071
 ].+0.069
Neg logit contributions
ignt-0.068
GN-0.066
wic-0.065
aughs-0.065
stories-0.063
Token leaves
Token ID5667
Feature activation+1.35e+00

Pos logit contributions
ignt+0.237
GN+0.232
wic+0.228
aughs+0.225
stories+0.217
Neg logit contributions
 mole-0.368
 ],-0.339
ゼウス-0.332
 ):-0.304
 ].-0.298
Token us
Token ID514
Feature activation+2.22e-01

Pos logit contributions
ignt+0.856
GN+0.842
wic+0.831
aughs+0.812
stories+0.792
Neg logit contributions
 mole-1.113
 ],-1.022
ゼウス-1.009
 ):-0.909
 ].-0.889
Token with
Token ID351
Feature activation+2.07e-01

Pos logit contributions
ignt+0.145
GN+0.142
wic+0.140
aughs+0.139
stories+0.134
Neg logit contributions
 mole-0.180
 ],-0.165
ゼウス-0.160
 ):-0.146
 ].-0.143
Token a
Token ID257
Feature activation+3.15e-01

Pos logit contributions
ignt+0.141
GN+0.139
wic+0.137
aughs+0.135
stories+0.131
Neg logit contributions
 mole-0.161
 ],-0.147
ゼウス-0.143
 ):-0.129
 ].-0.126
Token good
Token ID922
Feature activation-2.23e-01

Pos logit contributions
ignt+0.197
GN+0.193
wic+0.191
aughs+0.187
stories+0.181
Neg logit contributions
 mole-0.263
 ],-0.242
ゼウス-0.238
 ):-0.215
 ].-0.211
Token prototype
Token ID14879
Feature activation-1.72e-01

Pos logit contributions
 mole+0.192
 ],+0.177
ゼウス+0.173
 ):+0.158
 ].+0.155
Neg logit contributions
ignt-0.134
GN-0.132
wic-0.130
aughs-0.128
stories-0.124
Token credit
Token ID3884
Feature activation-6.74e-01

Pos logit contributions
ignt+0.076
GN+0.075
wic+0.074
aughs+0.073
stories+0.071
Neg logit contributions
 mole-0.082
 ],-0.074
ゼウス-0.072
 ):-0.065
 ].-0.063
Token for
Token ID329
Feature activation+1.56e-01

Pos logit contributions
 mole+0.544
 ],+0.496
ゼウス+0.480
 ):+0.445
 ].+0.431
Neg logit contributions
ignt-0.445
GN-0.434
wic-0.428
aughs-0.428
stories-0.407
Token your
Token ID534
Feature activation+4.11e-02

Pos logit contributions
ignt+0.101
GN+0.099
wic+0.098
aughs+0.096
stories+0.094
Neg logit contributions
 mole-0.127
 ],-0.117
ゼウス-0.114
 ):-0.103
 ].-0.101
Token aw
Token ID3253
Feature activation+3.55e-01

Pos logit contributions
ignt+0.026
GN+0.026
wic+0.025
aughs+0.025
stories+0.024
Neg logit contributions
 mole-0.034
 ],-0.031
ゼウス-0.030
 ):-0.027
 ].-0.027
Tokenes
Token ID274
Feature activation+5.82e-01

Pos logit contributions
ignt+0.227
GN+0.222
wic+0.218
aughs+0.216
stories+0.209
Neg logit contributions
 mole-0.294
 ],-0.269
ゼウス-0.263
 ):-0.240
 ].-0.235
Tokenomen
Token ID3674
Feature activation+1.95e+00

Pos logit contributions
ignt+0.496
GN+0.492
wic+0.486
aughs+0.481
stories+0.471
Neg logit contributions
 mole-0.354
 ],-0.311
ゼウス-0.301
 ):-0.264
 ].-0.254
Tokeness
Token ID408
Feature activation+5.24e-01

Pos logit contributions
ignt+1.347
GN+1.310
wic+1.290
aughs+1.278
stories+1.247
Neg logit contributions
 mole-1.529
 ],-1.382
ゼウス-1.368
 ):-1.217
 ].-1.190
Token.
Token ID13
Feature activation+8.25e-02

Pos logit contributions
ignt+0.309
GN+0.302
wic+0.297
aughs+0.294
stories+0.283
Neg logit contributions
 mole-0.459
 ],-0.422
ゼウス-0.413
 ):-0.379
 ].-0.371
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
ignt+0.053
GN+0.051
wic+0.051
aughs+0.050
stories+0.048
Neg logit contributions
 mole-0.068
 ],-0.063
ゼウス-0.061
 ):-0.056
 ].-0.055
Token
Token ID198
Feature activation-3.84e-03

Pos logit contributions
ignt+0.069
GN+0.069
wic+0.067
aughs+0.065
stories+0.063
Neg logit contributions
 mole-0.148
 ],-0.136
ゼウス-0.136
 ):-0.124
 ].-0.122
TokenIf
Token ID1532
Feature activation-3.82e-02

Pos logit contributions
 mole+0.003
 ],+0.003
ゼウス+0.003
 ):+0.003
 ].+0.003
Neg logit contributions
ignt-0.002
GN-0.002
wic-0.002
aughs-0.002
stories-0.002
Token their
Token ID511
Feature activation+1.02e-01

Pos logit contributions
ignt+0.057
GN+0.056
wic+0.055
aughs+0.055
stories+0.053
Neg logit contributions
 mole-0.065
 ],-0.059
ゼウス-0.059
 ):-0.052
 ].-0.051
Token impartial
Token ID32521
Feature activation+2.39e-01

Pos logit contributions
ignt+0.067
GN+0.065
wic+0.064
aughs+0.063
stories+0.062
Neg logit contributions
 mole-0.082
 ],-0.075
ゼウス-0.074
 ):-0.066
 ].-0.065
Tokenity
Token ID414
Feature activation+3.06e-01

Pos logit contributions
ignt+0.146
GN+0.143
wic+0.141
aughs+0.139
stories+0.135
Neg logit contributions
 mole-0.205
 ],-0.188
ゼウス-0.184
 ):-0.168
 ].-0.164
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
ignt+0.180
GN+0.177
wic+0.173
aughs+0.172
stories+0.165
Neg logit contributions
 mole-0.268
 ],-0.247
ゼウス-0.241
 ):-0.222
 ].-0.217
Token For
Token ID1114
Feature activation+2.28e-01

Pos logit contributions
ignt+0.022
GN+0.021
wic+0.021
aughs+0.021
stories+0.020
Neg logit contributions
 mole-0.037
 ],-0.034
ゼウス-0.033
 ):-0.031
 ].-0.030
Token instance
Token ID4554
Feature activation+1.37e+00

Pos logit contributions
ignt+0.152
GN+0.149
wic+0.147
aughs+0.146
stories+0.141
Neg logit contributions
 mole-0.182
 ],-0.166
ゼウス-0.164
 ):-0.147
 ].-0.144
Token,
Token ID11
Feature activation+2.74e-01

Pos logit contributions
ignt+0.784
GN+0.763
wic+0.752
aughs+0.734
stories+0.713
Neg logit contributions
 mole-1.219
 ],-1.118
ゼウス-1.106
 ):-1.002
 ])-0.989
Token a
Token ID257
Feature activation+2.50e-01

Pos logit contributions
ignt+0.182
GN+0.178
wic+0.176
aughs+0.173
stories+0.169
Neg logit contributions
 mole-0.219
 ],-0.199
ゼウス-0.197
 ):-0.177
 ].-0.172
Token key
Token ID1994
Feature activation+1.25e-01

Pos logit contributions
ignt+0.156
GN+0.153
wic+0.151
aughs+0.148
stories+0.144
Neg logit contributions
 mole-0.209
 ],-0.192
ゼウス-0.188
 ):-0.171
 ].-0.167
Token aspect
Token ID4843
Feature activation+6.18e-01

Pos logit contributions
ignt+0.081
GN+0.080
wic+0.079
aughs+0.078
stories+0.075
Neg logit contributions
 mole-0.101
 ],-0.093
ゼウス-0.091
 ):-0.082
 ].-0.080
Token of
Token ID286
Feature activation+1.13e-02

Pos logit contributions
ignt+0.394
GN+0.386
wic+0.379
aughs+0.375
stories+0.362
Neg logit contributions
 mole-0.513
 ],-0.470
ゼウス-0.458
 ):-0.419
 ].-0.409
Token Lab
Token ID3498
Feature activation+1.29e+00

Pos logit contributions
 mole+0.047
 ],+0.043
ゼウス+0.042
 ):+0.038
 ].+0.037
Neg logit contributions
ignt-0.043
GN-0.042
wic-0.042
aughs-0.041
stories-0.040
Token/
Token ID14
Feature activation+4.18e-01

Pos logit contributions
ignt+0.886
GN+0.870
wic+0.857
aughs+0.849
stories+0.823
Neg logit contributions
 mole-0.997
 ],-0.907
ゼウス-0.886
 ):-0.799
 ].-0.781
TokenMethod
Token ID17410
Feature activation+2.64e-01

Pos logit contributions
ignt+0.250
GN+0.244
wic+0.241
aughs+0.239
stories+0.229
Neg logit contributions
 mole-0.363
 ],-0.334
ゼウス-0.326
 ):-0.299
 ].-0.293
Token/
Token ID14
Feature activation+2.08e-01

Pos logit contributions
ignt+0.177
GN+0.174
wic+0.172
aughs+0.170
stories+0.165
Neg logit contributions
 mole-0.209
 ],-0.191
ゼウス-0.186
 ):-0.168
 ].-0.165
TokenF
Token ID37
Feature activation+2.26e-01

Pos logit contributions
ignt+0.128
GN+0.126
wic+0.124
aughs+0.122
stories+0.119
Neg logit contributions
 mole-0.176
 ],-0.162
ゼウス-0.158
 ):-0.144
 ].-0.141
Tokenraction
Token ID7861
Feature activation+1.39e+00

Pos logit contributions
ignt+0.163
GN+0.160
wic+0.158
aughs+0.156
stories+0.152
Neg logit contributions
 mole-0.168
 ],-0.152
ゼウス-0.148
 ):-0.133
 ].-0.130
Token (
Token ID357
Feature activation+2.71e-01

Pos logit contributions
ignt+0.916
GN+0.903
wic+0.890
aughs+0.876
stories+0.851
Neg logit contributions
 mole-1.102
 ],-1.007
ゼウス-0.986
 ):-0.890
 ].-0.871
Tokenb
Token ID65
Feature activation+1.71e-01

Pos logit contributions
ignt+0.167
GN+0.164
wic+0.161
aughs+0.159
stories+0.154
Neg logit contributions
 mole-0.230
 ],-0.211
ゼウス-0.206
 ):-0.188
 ].-0.185
Token,
Token ID11
Feature activation+4.16e-01

Pos logit contributions
ignt+0.093
GN+0.089
wic+0.089
aughs+0.088
stories+0.084
Neg logit contributions
 mole-0.159
 ],-0.146
ゼウス-0.143
 ):-0.135
 ])-0.129
Tokenc
Token ID66
Feature activation+4.68e-02

Pos logit contributions
ignt+0.241
GN+0.236
wic+0.232
aughs+0.229
stories+0.221
Neg logit contributions
 mole-0.369
 ],-0.340
ゼウス-0.332
 ):-0.305
 ].-0.299
Token,
Token ID11
Feature activation+4.29e-01

Pos logit contributions
ignt+0.026
GN+0.025
wic+0.025
aughs+0.024
stories+0.023
Neg logit contributions
 mole-0.043
 ],-0.040
ゼウス-0.039
 ):-0.036
 ])-0.035
Tokeners
Token ID364
Feature activation+1.88e-01

Pos logit contributions
 mole+0.074
 ],+0.068
ゼウス+0.068
 ):+0.061
 ].+0.060
Neg logit contributions
ignt-0.049
GN-0.048
wic-0.048
aughs-0.046
stories-0.045
Token vs
Token ID3691
Feature activation-8.46e-01

Pos logit contributions
ignt+0.121
GN+0.119
wic+0.117
aughs+0.116
stories+0.112
Neg logit contributions
 mole-0.154
 ],-0.141
ゼウス-0.138
 ):-0.125
 ].-0.123
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 mole+0.718
 ],+0.661
ゼウス+0.633
 ):+0.590
 ].+0.578
Neg logit contributions
ignt-0.525
GN-0.507
wic-0.504
aughs-0.501
stories-0.484
Token Hust
Token ID42941
Feature activation-6.62e-01

Pos logit contributions
ignt+0.232
GN+0.227
wic+0.224
aughs+0.222
stories+0.215
Neg logit contributions
 mole-0.275
 ],-0.251
ゼウス-0.243
 ):-0.222
 ].-0.217
Tokenlers
Token ID8116
Feature activation+1.65e-01

Pos logit contributions
 mole+0.555
 ],+0.511
ゼウス+0.491
 ):+0.455
 ].+0.446
Neg logit contributions
ignt-0.410
GN-0.404
wic-0.393
aughs-0.393
stories-0.380
Token TBA
Token ID48591
Feature activation+1.36e+00

Pos logit contributions
ignt+0.103
GN+0.101
wic+0.099
aughs+0.098
stories+0.095
Neg logit contributions
 mole-0.139
 ],-0.128
ゼウス-0.124
 ):-0.114
 ].-0.112
Token TBA
Token ID48591
Feature activation+1.53e+00

Pos logit contributions
ignt+0.821
GN+0.804
wic+0.791
aughs+0.782
stories+0.754
Neg logit contributions
 mole-1.169
 ],-1.074
ゼウス-1.051
 ):-0.960
 ].-0.941
Token The
Token ID383
Feature activation+5.55e-01

Pos logit contributions
ignt+0.961
GN+0.942
wic+0.927
aughs+0.917
stories+0.887
Neg logit contributions
 mole-1.274
 ],-1.167
ゼウス-1.140
 ):-1.039
 ].-1.017
Token Complete
Token ID13248
Feature activation+9.35e-01

Pos logit contributions
ignt+0.373
GN+0.365
wic+0.360
aughs+0.356
stories+0.345
Neg logit contributions
 mole-0.440
 ],-0.401
ゼウス-0.392
 ):-0.355
 ].-0.347
Token Thirty
Token ID29948
Feature activation+3.97e-01

Pos logit contributions
ignt+0.646
GN+0.634
wic+0.625
aughs+0.619
stories+0.600
Neg logit contributions
 mole-0.724
 ],-0.658
ゼウス-0.642
 ):-0.580
 ].-0.566
Token-
Token ID12
Feature activation+6.81e-01

Pos logit contributions
ignt+0.251
GN+0.245
wic+0.241
aughs+0.238
stories+0.230
Neg logit contributions
 mole-0.332
 ],-0.304
ゼウス-0.297
 ):-0.271
 ].-0.265
Token
Token ID247
Feature activation-6.62e-02

Pos logit contributions
 mole+0.078
 ],+0.068
ゼウス+0.065
 ):+0.056
 ].+0.053
Neg logit contributions
ignt-0.125
GN-0.124
wic-0.120
aughs-0.119
stories-0.118
Tokens
Token ID82
Feature activation-1.68e-01

Pos logit contributions
 mole+0.060
 ],+0.054
ゼウス+0.054
 ):+0.049
 ])+0.048
Neg logit contributions
ignt-0.038
GN-0.037
wic-0.036
aughs-0.035
stories-0.035
Token seventh
Token ID14024
Feature activation+4.40e-01

Pos logit contributions
 mole+0.131
 ],+0.119
ゼウス+0.116
 ):+0.105
 ].+0.103
Neg logit contributions
ignt-0.116
GN-0.114
wic-0.112
aughs-0.111
stories-0.108
Token consecutive
Token ID12785
Feature activation+6.92e-01

Pos logit contributions
ignt+0.253
GN+0.248
wic+0.245
aughs+0.236
stories+0.230
Neg logit contributions
 mole-0.391
 ],-0.360
ゼウス-0.357
 ):-0.322
 ].-0.316
Token lunar
Token ID25572
Feature activation+1.79e-01

Pos logit contributions
GN+0.563
ignt+0.557
wic+0.545
aughs+0.528
stories+0.527
Neg logit contributions
 mole-0.434
ゼウス-0.395
 ],-0.394
 ):-0.333
skirts-0.331
Token mission
Token ID4365
Feature activation+2.06e-01

Pos logit contributions
GN+0.106
ignt+0.105
wic+0.103
stories+0.100
 static+0.098
Neg logit contributions
 mole-0.150
 ],-0.140
ゼウス-0.140
 ):-0.124
skirts-0.123
Token failure
Token ID5287
Feature activation-1.96e-01

Pos logit contributions
ignt+0.126
GN+0.124
wic+0.121
aughs+0.117
stories+0.115
Neg logit contributions
 mole-0.173
 ],-0.161
ゼウス-0.160
 ):-0.142
skirts-0.141
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 mole+0.172
 ],+0.159
ゼウス+0.155
 ):+0.142
 ].+0.140
Neg logit contributions
ignt-0.115
GN-0.112
wic-0.111
aughs-0.110
stories-0.106
Token NASA
Token ID8884
Feature activation+4.26e-01

Pos logit contributions
 mole+0.157
 ],+0.141
ゼウス+0.139
 ):+0.123
 ].+0.121
Neg logit contributions
GN-0.148
ignt-0.148
wic-0.144
aughs-0.142
stories-0.139
Token Headquarters
Token ID32193
Feature activation-2.91e-01

Pos logit contributions
ignt+0.307
GN+0.303
wic+0.298
aughs+0.294
stories+0.286
Neg logit contributions
 mole-0.316
 ],-0.287
ゼウス-0.279
 ):-0.250
 ].-0.245
Token formed
Token ID7042
Feature activation+3.51e-01

Pos logit contributions
 mole+0.228
 ],+0.208
ゼウス+0.204
 ):+0.183
 ].+0.179
Neg logit contributions
ignt-0.198
GN-0.195
wic-0.191
aughs-0.189
stories-0.184
Token Fit
Token ID25048
Feature activation+1.47e+00

Pos logit contributions
 mole+0.048
 ],+0.043
ゼウス+0.043
 ):+0.039
 ].+0.038
Neg logit contributions
ignt-0.040
GN-0.039
wic-0.039
aughs-0.038
stories-0.037
Token an
Token ID281
Feature activation-4.97e-01

Pos logit contributions
ignt+0.958
GN+0.942
wic+0.937
aughs+0.908
stories+0.893
Neg logit contributions
 mole-1.181
 ],-1.074
ゼウス-1.060
 ):-0.955
 ].-0.932
Token Object
Token ID9515
Feature activation+5.77e-01

Pos logit contributions
 mole+0.395
 ],+0.360
ゼウス+0.353
 ):+0.319
 ].+0.312
Neg logit contributions
ignt-0.333
GN-0.326
wic-0.322
aughs-0.318
stories-0.308
Token Al
Token ID978
Feature activation+3.10e-02

Pos logit contributions
ignt+0.375
GN+0.368
wic+0.362
aughs+0.355
stories+0.345
Neg logit contributions
 mole-0.467
 ],-0.426
ゼウス-0.419
 ):-0.378
 ].-0.370
Tokenigned
Token ID3916
Feature activation+2.91e-01

Pos logit contributions
ignt+0.019
GN+0.019
wic+0.019
aughs+0.018
stories+0.018
Neg logit contributions
 mole-0.026
 ],-0.024
ゼウス-0.023
 ):-0.021
 ].-0.021
Token B
Token ID347
Feature activation+2.73e-01

Pos logit contributions
ignt+0.193
GN+0.190
wic+0.187
aughs+0.183
stories+0.177
Neg logit contributions
 mole-0.231
 ],-0.210
ゼウス-0.207
 ):-0.185
 ].-0.182
Tokenounding
Token ID9969
Feature activation+9.39e-01

Pos logit contributions
ignt+0.166
GN+0.166
wic+0.161
aughs+0.158
stories+0.151
Neg logit contributions
 mole-0.233
 ],-0.212
ゼウス-0.210
 ):-0.189
 ].-0.186
Token Box
Token ID8315
Feature activation+1.09e-03

Pos logit contributions
ignt+0.513
GN+0.500
wic+0.492
aughs+0.486
stories+0.467
Neg logit contributions
 mole-0.865
 ],-0.799
ゼウス-0.782
 ):-0.720
 ].-0.706
Token around
Token ID1088
Feature activation-1.08e-01

Pos logit contributions
ignt+0.001
GN+0.001
wic+0.001
aughs+0.001
stories+0.001
Neg logit contributions
 mole-0.001
 ],-0.001
ゼウス-0.001
 ):-0.001
 ].-0.001
Token the
Token ID262
Feature activation-5.14e-02

Pos logit contributions
 mole+0.084
 ],+0.076
ゼウス+0.074
 ):+0.067
 ].+0.065
Neg logit contributions
ignt-0.075
GN-0.074
wic-0.073
aughs-0.072
stories-0.070
Token space
Token ID2272
Feature activation-1.96e-01

Pos logit contributions
 mole+0.039
 ],+0.035
ゼウス+0.035
 ):+0.031
 ].+0.030
Neg logit contributions
ignt-0.036
GN-0.035
wic-0.035
aughs-0.034
stories-0.033
Token,
Token ID11
Feature activation+5.79e-01

Pos logit contributions
 mole+0.194
 ],+0.177
ゼウス+0.169
 ):+0.160
 ].+0.154
Neg logit contributions
ignt-0.166
aughs-0.163
GN-0.163
wic-0.159
stories-0.155
Token DC
Token ID6257
Feature activation+6.47e-01

Pos logit contributions
ignt+0.402
aughs+0.392
GN+0.391
wic+0.386
stories+0.372
Neg logit contributions
 mole-0.449
 ],-0.410
ゼウス-0.393
 ):-0.364
 ].-0.354
Token:
Token ID25
Feature activation+6.13e-01

Pos logit contributions
ignt+0.505
GN+0.495
aughs+0.489
wic+0.488
stories+0.473
Neg logit contributions
 mole-0.446
 ],-0.395
ゼウス-0.384
 ):-0.350
 ].-0.333
Token US
Token ID1294
Feature activation+2.96e-01

Pos logit contributions
ignt+0.467
GN+0.456
wic+0.451
aughs+0.449
stories+0.434
Neg logit contributions
 mole-0.433
 ],-0.392
ゼウス-0.378
 ):-0.341
 ].-0.332
Token Department
Token ID2732
Feature activation+3.53e-01

Pos logit contributions
ignt+0.164
GN+0.159
wic+0.157
aughs+0.156
stories+0.149
Neg logit contributions
 mole-0.270
 ],-0.250
ゼウス-0.244
 ):-0.225
 ].-0.221
Token of
Token ID286
Feature activation+2.15e-01

Pos logit contributions
ignt+0.206
GN+0.205
wic+0.201
aughs+0.195
stories+0.192
Neg logit contributions
 mole-0.309
 ],-0.282
ゼウス-0.279
 ):-0.252
 ].-0.247
Token Transportation
Token ID15198
Feature activation-9.86e-01

Pos logit contributions
ignt+0.127
GN+0.124
wic+0.121
aughs+0.121
stories+0.116
Neg logit contributions
 mole-0.187
 ],-0.174
ゼウス-0.168
 ):-0.156
 ].-0.153
Token,
Token ID11
Feature activation+3.16e-01

Pos logit contributions
 mole+0.830
 ],+0.764
ゼウス+0.732
 ):+0.681
 ].+0.670
Neg logit contributions
ignt-0.614
GN-0.593
aughs-0.588
wic-0.579
stories-0.561
Token National
Token ID2351
Feature activation-1.07e+00

Pos logit contributions
ignt+0.212
GN+0.208
wic+0.205
aughs+0.203
stories+0.197
Neg logit contributions
 mole-0.250
 ],-0.229
ゼウス-0.223
 ):-0.202
 ].-0.198
Token Highway
Token ID14914
Feature activation-2.58e-01

Pos logit contributions
 mole+0.733
 ],+0.658
ゼウス+0.635
 ):+0.571
 ].+0.556
Neg logit contributions
ignt-0.831
GN-0.817
wic-0.804
aughs-0.801
stories-0.776
Token Traffic
Token ID23624
Feature activation-6.54e-01

Pos logit contributions
 mole+0.184
 ],+0.168
ゼウス+0.160
 ):+0.146
 ].+0.143
Neg logit contributions
ignt-0.193
GN-0.190
aughs-0.186
wic-0.186
stories-0.180
Token got
Token ID1392
Feature activation-2.66e-01

Pos logit contributions
ignt+0.115
GN+0.113
wic+0.112
aughs+0.111
stories+0.107
Neg logit contributions
 mole-0.121
 ],-0.109
ゼウス-0.106
 ):-0.096
 ].-0.094
Token a
Token ID257
Feature activation-2.97e-02

Pos logit contributions
 mole+0.202
 ],+0.182
ゼウス+0.176
 ):+0.161
 ].+0.156
Neg logit contributions
ignt-0.189
GN-0.186
wic-0.184
aughs-0.183
stories-0.176
Token gun
Token ID2485
Feature activation-2.30e-01

Pos logit contributions
 mole+0.024
 ],+0.022
ゼウス+0.022
 ):+0.020
 ].+0.019
Neg logit contributions
ignt-0.019
GN-0.019
wic-0.019
aughs-0.018
stories-0.018
Token in
Token ID287
Feature activation-1.52e-01

Pos logit contributions
 mole+0.198
 ],+0.181
ゼウス+0.177
 ):+0.162
 ].+0.159
Neg logit contributions
ignt-0.139
GN-0.136
wic-0.134
aughs-0.133
stories-0.128
Token your
Token ID534
Feature activation-1.93e-01

Pos logit contributions
 mole+0.136
 ],+0.125
ゼウス+0.123
 ):+0.113
 ].+0.111
Neg logit contributions
ignt-0.087
GN-0.085
wic-0.083
aughs-0.082
stories-0.079
Token house
Token ID2156
Feature activation+2.55e-01

Pos logit contributions
 mole+0.166
 ],+0.152
ゼウス+0.149
 ):+0.136
 ].+0.134
Neg logit contributions
ignt-0.117
GN-0.114
wic-0.112
aughs-0.111
stories-0.107
Token,
Token ID11
Feature activation+5.47e-02

Pos logit contributions
ignt+0.158
GN+0.154
aughs+0.152
wic+0.152
stories+0.144
Neg logit contributions
 mole-0.216
 ],-0.198
ゼウス-0.192
 ):-0.177
 ].-0.173
Token I
Token ID314
Feature activation-6.39e-02

Pos logit contributions
ignt+0.033
GN+0.032
wic+0.032
aughs+0.031
stories+0.030
Neg logit contributions
 mole-0.047
 ],-0.044
ゼウス-0.042
 ):-0.039
 ].-0.038
Token
Token ID447
Feature activation+1.98e-01

Pos logit contributions
 mole+0.051
 ],+0.047
ゼウス+0.046
 ):+0.041
 ].+0.041
Neg logit contributions
ignt-0.042
GN-0.041
wic-0.041
aughs-0.040
stories-0.039
Token
Token ID247
Feature activation+6.27e-02

Pos logit contributions
ignt+0.179
GN+0.177
wic+0.173
aughs+0.170
stories+0.168
Neg logit contributions
 mole-0.111
 ],-0.098
ゼウス-0.093
 ):-0.081
 ].-0.078
Tokenm
Token ID76
Feature activation-3.95e-02

Pos logit contributions
ignt+0.036
GN+0.035
wic+0.034
aughs+0.034
stories+0.033
Neg logit contributions
 mole-0.056
 ],-0.052
ゼウス-0.051
 ):-0.046
 ].-0.046
Tokeneder
Token ID5702
Feature activation-2.72e-01

Pos logit contributions
 mole+0.176
 ],+0.161
ゼウス+0.159
 ):+0.145
 ].+0.142
Neg logit contributions
ignt-0.101
GN-0.100
wic-0.098
aughs-0.094
stories-0.092
Token to
Token ID284
Feature activation+9.87e-02

Pos logit contributions
 mole+0.222
 ],+0.203
ゼウス+0.196
 ):+0.180
 ].+0.176
Neg logit contributions
ignt-0.178
GN-0.173
wic-0.172
aughs-0.171
stories-0.165
Token go
Token ID467
Feature activation+2.11e-01

Pos logit contributions
ignt+0.060
GN+0.059
wic+0.058
aughs+0.058
stories+0.056
Neg logit contributions
 mole-0.084
 ],-0.078
ゼウス-0.075
 ):-0.070
 ].-0.068
Token back
Token ID736
Feature activation+1.78e-01

Pos logit contributions
ignt+0.127
GN+0.124
wic+0.123
aughs+0.122
stories+0.117
Neg logit contributions
 mole-0.181
 ],-0.167
ゼウス-0.162
 ):-0.149
 ].-0.147
Token on
Token ID319
Feature activation-1.00e-01

Pos logit contributions
ignt+0.104
GN+0.102
wic+0.101
aughs+0.100
stories+0.096
Neg logit contributions
 mole-0.156
 ],-0.145
ゼウス-0.141
 ):-0.129
 ].-0.127
Token 24
Token ID1987
Feature activation+2.16e-01

Pos logit contributions
 mole+0.073
 ],+0.066
ゼウス+0.065
 ):+0.058
 ].+0.057
Neg logit contributions
ignt-0.073
GN-0.072
wic-0.071
aughs-0.070
stories-0.068
Token-
Token ID12
Feature activation+1.83e-01

Pos logit contributions
ignt+0.125
GN+0.120
wic+0.119
aughs+0.118
stories+0.113
Neg logit contributions
 mole-0.193
 ],-0.179
ゼウス-0.173
 ):-0.160
 ].-0.157
Tokenhour
Token ID9769
Feature activation-5.96e-01

Pos logit contributions
ignt+0.105
GN+0.100
wic+0.099
aughs+0.099
stories+0.093
Neg logit contributions
 mole-0.164
 ],-0.152
ゼウス-0.146
 ):-0.137
 ].-0.134
Token alcohol
Token ID5548
Feature activation+1.09e-01

Pos logit contributions
 mole+0.428
 ],+0.386
ゼウス+0.374
 ):+0.336
 ].+0.328
Neg logit contributions
ignt-0.447
GN-0.438
wic-0.434
aughs-0.429
stories-0.418
Token monitoring
Token ID9904
Feature activation-1.33e-01

Pos logit contributions
ignt+0.076
GN+0.075
wic+0.074
aughs+0.073
stories+0.071
Neg logit contributions
 mole-0.083
 ],-0.076
ゼウス-0.074
 ):-0.067
 ].-0.065
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
 mole+0.114
 ],+0.105
ゼウス+0.103
 ):+0.094
 ].+0.092
Neg logit contributions
ignt-0.081
GN-0.079
wic-0.078
aughs-0.077
stories-0.074
Token gay
Token ID5650
Feature activation-1.84e-01

Pos logit contributions
 mole+0.065
 ],+0.059
ゼウス+0.058
 ):+0.052
 ].+0.051
Neg logit contributions
ignt-0.055
GN-0.054
wic-0.053
aughs-0.052
stories-0.051
Token.
Token ID13
Feature activation+7.09e-02

Pos logit contributions
 mole+0.158
 ],+0.145
ゼウス+0.142
 ):+0.130
 ].+0.127
Neg logit contributions
ignt-0.112
GN-0.109
wic-0.107
aughs-0.106
stories-0.102
Token In
Token ID554
Feature activation+3.76e-01

Pos logit contributions
ignt+0.048
GN+0.048
wic+0.047
aughs+0.047
stories+0.045
Neg logit contributions
 mole-0.055
 ],-0.050
ゼウス-0.049
 ):-0.044
 ].-0.043
Token this
Token ID428
Feature activation+2.16e-02

Pos logit contributions
ignt+0.242
GN+0.238
wic+0.234
aughs+0.231
stories+0.225
Neg logit contributions
 mole-0.308
 ],-0.282
ゼウス-0.277
 ):-0.251
 ].-0.245
Token context
Token ID4732
Feature activation+1.58e+00

Pos logit contributions
ignt+0.012
GN+0.012
wic+0.012
aughs+0.012
stories+0.011
Neg logit contributions
 mole-0.020
 ],-0.018
ゼウス-0.018
 ):-0.016
 ].-0.016
Token,
Token ID11
Feature activation-6.13e-02

Pos logit contributions
ignt+0.973
GN+0.952
wic+0.937
aughs+0.927
stories+0.895
Neg logit contributions
 mole-1.337
 ],-1.227
ゼウス-1.200
 ):-1.095
 ].-1.072
Token the
Token ID262
Feature activation+3.25e-02

Pos logit contributions
 mole+0.049
 ],+0.045
ゼウス+0.044
 ):+0.040
 ].+0.039
Neg logit contributions
ignt-0.040
GN-0.040
wic-0.039
aughs-0.039
stories-0.038
Token gay
Token ID5650
Feature activation-7.47e-02

Pos logit contributions
ignt+0.022
GN+0.021
wic+0.021
aughs+0.021
stories+0.020
Neg logit contributions
 mole-0.026
 ],-0.024
ゼウス-0.023
 ):-0.021
 ].-0.020
Token man
Token ID582
Feature activation+3.30e-03

Pos logit contributions
 mole+0.058
 ],+0.052
ゼウス+0.051
 ):+0.046
 ].+0.045
Neg logit contributions
ignt-0.052
GN-0.051
wic-0.050
aughs-0.050
stories-0.048
Token is
Token ID318
Feature activation-1.20e-01

Pos logit contributions
ignt+0.002
GN+0.002
wic+0.002
aughs+0.002
stories+0.002
Neg logit contributions
 mole-0.002
 ],-0.002
ゼウス-0.002
 ):-0.002
 ].-0.002
Token just
Token ID655
Feature activation-4.82e-01

Pos logit contributions
 mole+0.091
 ],+0.083
ゼウス+0.082
 ):+0.073
 ].+0.072
Neg logit contributions
ignt-0.083
GN-0.082
wic-0.081
aughs-0.080
stories-0.077
Token humans
Token ID5384
Feature activation-1.15e-01

Pos logit contributions
ignt+0.245
GN+0.240
wic+0.237
aughs+0.234
stories+0.227
Neg logit contributions
 mole-0.306
 ],-0.280
ゼウス-0.272
 ):-0.248
 ].-0.243
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
 mole+0.098
 ],+0.090
ゼウス+0.087
 ):+0.080
 ].+0.078
Neg logit contributions
ignt-0.071
GN-0.069
wic-0.068
aughs-0.067
stories-0.065
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
 mole+0.096
 ],+0.089
ゼウス+0.087
 ):+0.079
 ].+0.078
Neg logit contributions
ignt-0.065
GN-0.063
wic-0.062
aughs-0.061
stories-0.059
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
 mole+0.222
 ],+0.205
ゼウス+0.203
 ):+0.186
 ].+0.183
Neg logit contributions
ignt-0.107
GN-0.106
wic-0.104
aughs-0.101
stories-0.098
TokenThe
Token ID464
Feature activation-8.66e-02

Pos logit contributions
 mole+0.088
 ],+0.081
ゼウス+0.079
 ):+0.072
 ].+0.070
Neg logit contributions
ignt-0.067
GN-0.064
wic-0.064
aughs-0.063
stories-0.061
Token Australian
Token ID6638
Feature activation-1.59e-01

Pos logit contributions
 mole+0.074
 ],+0.068
ゼウス+0.066
 ):+0.060
 ].+0.059
Neg logit contributions
ignt-0.053
GN-0.052
wic-0.051
aughs-0.051
stories-0.049
Token and
Token ID290
Feature activation+3.35e-01

Pos logit contributions
 mole+0.142
 ],+0.130
ゼウス+0.127
 ):+0.117
 ].+0.115
Neg logit contributions
ignt-0.092
GN-0.090
wic-0.089
aughs-0.088
stories-0.084
Token French
Token ID4141
Feature activation+5.98e-01

Pos logit contributions
ignt+0.235
GN+0.230
wic+0.227
aughs+0.225
stories+0.218
Neg logit contributions
 mole-0.256
 ],-0.233
ゼウス-0.226
 ):-0.205
 ].-0.200
Token research
Token ID2267
Feature activation-2.05e-01

Pos logit contributions
ignt+0.278
GN+0.270
wic+0.265
aughs+0.261
stories+0.249
Neg logit contributions
 mole-0.598
 ],-0.556
ゼウス-0.546
 ):-0.506
 ].-0.497
Token shows
Token ID2523
Feature activation+1.99e-01

Pos logit contributions
 mole+0.189
 ],+0.175
ゼウス+0.171
 ):+0.158
 ].+0.155
Neg logit contributions
ignt-0.112
GN-0.109
wic-0.107
aughs-0.106
stories-0.102
Token that
Token ID326
Feature activation+2.75e-03

Pos logit contributions
ignt+0.135
GN+0.132
wic+0.130
aughs+0.129
stories+0.125
Neg logit contributions
 mole-0.157
 ],-0.143
ゼウス-0.140
 ):-0.127
 ].-0.124
Token 2
Token ID362
Feature activation+1.35e-01

Pos logit contributions
ignt+0.058
GN+0.057
wic+0.056
aughs+0.056
stories+0.054
Neg logit contributions
 mole-0.064
 ],-0.059
ゼウス-0.057
 ):-0.052
 ].-0.050
Token+
Token ID10
Feature activation-2.51e-01

Pos logit contributions
ignt+0.086
GN+0.084
wic+0.083
aughs+0.082
stories+0.079
Neg logit contributions
 mole-0.112
 ],-0.103
ゼウス-0.100
 ):-0.091
 ].-0.089
Token1
Token ID16
Feature activation+1.88e-01

Pos logit contributions
 mole+0.197
 ],+0.180
ゼウス+0.176
 ):+0.159
 ].+0.155
Neg logit contributions
ignt-0.170
GN-0.167
wic-0.164
aughs-0.162
stories-0.158
Token is
Token ID318
Feature activation+4.18e-02

Pos logit contributions
ignt+0.130
GN+0.128
wic+0.125
aughs+0.124
stories+0.121
Neg logit contributions
 mole-0.145
 ],-0.133
ゼウス-0.129
 ):-0.117
 ].-0.114
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
ignt+0.027
GN+0.026
wic+0.026
aughs+0.026
stories+0.025
Neg logit contributions
 mole-0.034
 ],-0.031
ゼウス-0.031
 ):-0.028
 ].-0.027
Token but
Token ID475
Feature activation-8.94e-02

Pos logit contributions
 mole+0.008
 ],+0.008
ゼウス+0.008
 ):+0.007
 ].+0.007
Neg logit contributions
ignt-0.006
GN-0.006
wic-0.006
aughs-0.006
stories-0.006
Token they
Token ID484
Feature activation-8.74e-02

Pos logit contributions
 mole+0.070
 ],+0.064
ゼウス+0.063
 ):+0.056
 ].+0.055
Neg logit contributions
ignt-0.061
GN-0.060
wic-0.059
aughs-0.058
stories-0.056
Token are
Token ID389
Feature activation+6.23e-02

Pos logit contributions
 mole+0.068
 ],+0.062
ゼウス+0.062
 ):+0.055
 ].+0.054
Neg logit contributions
ignt-0.059
GN-0.058
wic-0.057
aughs-0.057
stories-0.055
Token capable
Token ID6007
Feature activation-5.07e-02

Pos logit contributions
ignt+0.041
GN+0.040
wic+0.039
aughs+0.039
stories+0.037
Neg logit contributions
 mole-0.051
 ],-0.046
ゼウス-0.045
 ):-0.041
 ].-0.040
Token of
Token ID286
Feature activation+1.53e-01

Pos logit contributions
 mole+0.041
 ],+0.037
ゼウス+0.036
 ):+0.033
 ].+0.033
Neg logit contributions
ignt-0.033
wic-0.033
GN-0.032
aughs-0.032
stories-0.031
Token mathematical
Token ID18069
Feature activation+3.60e-01

Pos logit contributions
ignt+0.101
GN+0.100
wic+0.098
aughs+0.097
stories+0.094
Neg logit contributions
 mole-0.122
 ],-0.112
ゼウス-0.110
 ):-0.099
 ].-0.097
Token are
Token ID389
Feature activation+1.99e-02

Pos logit contributions
ignt+0.066
GN+0.064
wic+0.063
aughs+0.063
stories+0.061
Neg logit contributions
 mole-0.081
 ],-0.074
ゼウス-0.072
 ):-0.066
 ].-0.064
Token the
Token ID262
Feature activation-2.96e-02

Pos logit contributions
ignt+0.014
GN+0.014
wic+0.013
aughs+0.013
stories+0.013
Neg logit contributions
 mole-0.015
 ],-0.014
ゼウス-0.014
 ):-0.012
 ].-0.012
Token sweeping
Token ID18404
Feature activation+6.61e-02

Pos logit contributions
 mole+0.025
 ],+0.023
ゼウス+0.022
 ):+0.020
 ].+0.020
Neg logit contributions
ignt-0.019
GN-0.018
wic-0.018
aughs-0.018
stories-0.017
Token back
Token ID736
Feature activation+1.62e-01

Pos logit contributions
ignt+0.041
GN+0.040
wic+0.040
aughs+0.039
stories+0.038
Neg logit contributions
 mole-0.056
 ],-0.051
ゼウス-0.050
 ):-0.045
 ].-0.045
Token stories
Token ID3923
Feature activation-3.68e-01

Pos logit contributions
ignt+0.117
GN+0.115
wic+0.114
aughs+0.113
stories+0.108
Neg logit contributions
 mole-0.121
 ],-0.109
ゼウス-0.106
 ):-0.096
 ].-0.093
Token that
Token ID326
Feature activation+1.60e-01

Pos logit contributions
 mole+0.301
 ],+0.275
ゼウス+0.266
 ):+0.243
 ].+0.238
Neg logit contributions
ignt-0.240
GN-0.234
aughs-0.231
wic-0.230
stories-0.220
Token are
Token ID389
Feature activation+3.29e-01

Pos logit contributions
ignt+0.104
GN+0.102
wic+0.101
aughs+0.099
stories+0.096
Neg logit contributions
 mole-0.130
 ],-0.119
ゼウス-0.116
 ):-0.105
 ].-0.103
Token attached
Token ID7223
Feature activation-8.27e-02

Pos logit contributions
ignt+0.223
GN+0.219
wic+0.216
aughs+0.213
stories+0.207
Neg logit contributions
 mole-0.258
 ],-0.236
ゼウス-0.231
 ):-0.208
 ].-0.203
Token to
Token ID284
Feature activation+2.11e-01

Pos logit contributions
 mole+0.072
 ],+0.066
ゼウス+0.064
 ):+0.059
 ].+0.058
Neg logit contributions
ignt-0.049
GN-0.048
aughs-0.047
wic-0.047
stories-0.045
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
ignt+0.143
GN+0.140
wic+0.138
aughs+0.136
stories+0.132
Neg logit contributions
 mole-0.166
 ],-0.152
ゼウス-0.148
 ):-0.134
 ].-0.131
Token characters
Token ID3435
Feature activation-4.47e-01

Pos logit contributions
ignt+0.090
GN+0.088
wic+0.087
aughs+0.086
stories+0.083
Neg logit contributions
 mole-0.123
 ],-0.113
ゼウス-0.110
 ):-0.100
 ].-0.098
Token has
Token ID468
Feature activation+2.01e-01

Pos logit contributions
ignt+0.049
GN+0.048
wic+0.048
aughs+0.047
stories+0.046
Neg logit contributions
 mole-0.047
 ],-0.043
ゼウス-0.042
 ):-0.037
 ].-0.036
Token also
Token ID635
Feature activation-6.41e-02

Pos logit contributions
ignt+0.141
GN+0.138
wic+0.137
aughs+0.135
stories+0.131
Neg logit contributions
 mole-0.152
 ],-0.138
ゼウス-0.137
 ):-0.121
 ].-0.119
Token offered
Token ID4438
Feature activation+9.70e-02

Pos logit contributions
 mole+0.047
 ],+0.043
ゼウス+0.042
 ):+0.037
 ].+0.037
Neg logit contributions
ignt-0.047
GN-0.046
wic-0.045
aughs-0.045
stories-0.043
Token the
Token ID262
Feature activation-1.19e-03

Pos logit contributions
ignt+0.065
GN+0.063
wic+0.062
aughs+0.062
stories+0.060
Neg logit contributions
 mole-0.078
 ],-0.071
ゼウス-0.069
 ):-0.063
 ].-0.061
Token president
Token ID1893
Feature activation-1.21e-01

Pos logit contributions
 mole+0.001
 ],+0.001
ゼウス+0.001
 ):+0.001
 ].+0.001
Neg logit contributions
ignt-0.001
GN-0.001
wic-0.001
aughs-0.001
stories-0.001
Token praise
Token ID13463
Feature activation-6.82e-01

Pos logit contributions
 mole+0.097
 ],+0.088
ゼウス+0.086
 ):+0.078
 ].+0.076
Neg logit contributions
ignt-0.081
GN-0.080
wic-0.078
aughs-0.078
stories-0.075
Token.
Token ID13
Feature activation-2.46e-02

Pos logit contributions
 mole+0.515
 ],+0.468
ゼウス+0.454
 ):+0.413
 ].+0.402
Neg logit contributions
ignt-0.483
GN-0.474
wic-0.467
aughs-0.465
stories-0.449
Token Last
Token ID4586
Feature activation-2.05e-01

Pos logit contributions
 mole+0.022
 ],+0.020
ゼウス+0.020
 ):+0.018
 ].+0.018
Neg logit contributions
ignt-0.014
GN-0.014
wic-0.014
aughs-0.014
stories-0.013
Token week
Token ID1285
Feature activation-3.48e-01

Pos logit contributions
 mole+0.180
 ],+0.166
ゼウス+0.162
 ):+0.148
 ].+0.145
Neg logit contributions
ignt-0.121
GN-0.118
wic-0.116
aughs-0.115
stories-0.111
Token,
Token ID11
Feature activation-9.82e-02

Pos logit contributions
 mole+0.290
 ],+0.266
ゼウス+0.258
 ):+0.237
 ].+0.232
Neg logit contributions
ignt-0.221
GN-0.215
wic-0.212
aughs-0.210
stories-0.203
Token he
Token ID339
Feature activation+3.43e-02

Pos logit contributions
 mole+0.069
 ],+0.063
ゼウス+0.061
 ):+0.054
 ].+0.053
Neg logit contributions
ignt-0.075
GN-0.073
wic-0.072
aughs-0.072
stories-0.070
Token other
Token ID584
Feature activation-3.31e-02

Pos logit contributions
ignt+0.082
GN+0.080
wic+0.079
aughs+0.078
stories+0.076
Neg logit contributions
 mole-0.089
 ],-0.081
ゼウス-0.078
 ):-0.071
 ].-0.069
Token OD
Token ID31245
Feature activation+3.53e-01

Pos logit contributions
 mole+0.027
 ],+0.025
ゼウス+0.024
 ):+0.022
 ].+0.021
Neg logit contributions
ignt-0.022
GN-0.021
wic-0.021
aughs-0.021
stories-0.020
TokenIN
Token ID1268
Feature activation-1.74e-01

Pos logit contributions
ignt+0.257
GN+0.248
wic+0.248
aughs+0.247
stories+0.239
Neg logit contributions
 mole-0.263
 ],-0.237
ゼウス-0.230
 ):-0.208
 ].-0.203
Token based
Token ID1912
Feature activation+3.79e-01

Pos logit contributions
 mole+0.144
 ],+0.132
ゼウス+0.129
 ):+0.118
 ].+0.115
Neg logit contributions
ignt-0.111
GN-0.109
wic-0.107
aughs-0.106
stories-0.103
Token boot
Token ID6297
Feature activation-2.20e-01

Pos logit contributions
ignt+0.251
GN+0.247
wic+0.243
aughs+0.240
stories+0.233
Neg logit contributions
 mole-0.302
 ],-0.277
ゼウス-0.271
 ):-0.245
 ].-0.239
Tokenload
Token ID2220
Feature activation-1.10e+00

Pos logit contributions
 mole+0.205
 ],+0.189
ゼウス+0.185
 ):+0.171
 ].+0.168
Neg logit contributions
ignt-0.118
GN-0.115
wic-0.113
aughs-0.112
stories-0.107
Tokeners
Token ID364
Feature activation-2.76e-01

Pos logit contributions
 mole+0.972
 ],+0.894
ゼウス+0.866
 ):+0.805
 ].+0.789
Neg logit contributions
ignt-0.639
GN-0.633
wic-0.619
aughs-0.609
stories-0.586
Token,
Token ID11
Feature activation-4.42e-02

Pos logit contributions
 mole+0.237
 ],+0.217
ゼウス+0.212
 ):+0.194
 ].+0.190
Neg logit contributions
ignt-0.168
GN-0.164
wic-0.162
aughs-0.160
stories-0.155
Token so
Token ID523
Feature activation-9.41e-02

Pos logit contributions
 mole+0.035
 ],+0.032
ゼウス+0.032
 ):+0.028
 ].+0.028
Neg logit contributions
ignt-0.030
GN-0.029
wic-0.029
aughs-0.028
stories-0.027
Token when
Token ID618
Feature activation+1.86e-01

Pos logit contributions
 mole+0.075
 ],+0.069
ゼウス+0.067
 ):+0.061
 ].+0.059
Neg logit contributions
ignt-0.063
GN-0.061
wic-0.061
aughs-0.060
stories-0.058
Token the
Token ID262
Feature activation+1.74e-01

Pos logit contributions
ignt+0.122
GN+0.120
wic+0.118
aughs+0.117
stories+0.113
Neg logit contributions
 mole-0.149
 ],-0.137
ゼウス-0.134
 ):-0.121
 ].-0.118
Token
Token ID198
Feature activation-2.27e-01

Pos logit contributions
ignt+0.015
GN+0.015
wic+0.015
aughs+0.014
stories+0.014
Neg logit contributions
 mole-0.025
 ],-0.023
ゼウス-0.023
 ):-0.021
 ].-0.021
Token
Token ID198
Feature activation-2.89e-01

Pos logit contributions
 mole+0.224
 ],+0.207
ゼウス+0.205
 ):+0.188
 ].+0.185
Neg logit contributions
ignt-0.108
GN-0.107
wic-0.104
aughs-0.101
stories-0.098
TokenHe
Token ID1544
Feature activation-6.40e-01

Pos logit contributions
 mole+0.246
 ],+0.226
ゼウス+0.221
 ):+0.202
 ].+0.198
Neg logit contributions
ignt-0.178
GN-0.173
wic-0.171
aughs-0.170
stories-0.163
Token has
Token ID468
Feature activation-1.64e-01

Pos logit contributions
 mole+0.450
 ],+0.405
ゼウス+0.395
 ):+0.352
 ].+0.342
Neg logit contributions
ignt-0.488
GN-0.479
wic-0.474
aughs-0.470
stories-0.456
Token been
Token ID587
Feature activation-3.33e-01

Pos logit contributions
 mole+0.121
 ],+0.109
ゼウス+0.107
 ):+0.096
 ].+0.093
Neg logit contributions
ignt-0.119
GN-0.117
wic-0.115
aughs-0.114
stories-0.111
Token photographed
Token ID25880
Feature activation-9.54e-01

Pos logit contributions
 mole+0.242
 ],+0.219
ゼウス+0.215
 ):+0.192
 ].+0.186
Neg logit contributions
ignt-0.245
GN-0.240
wic-0.237
aughs-0.235
stories-0.228
Token receiving
Token ID6464
Feature activation+1.65e-01

Pos logit contributions
 mole+0.760
 ],+0.693
ゼウス+0.672
 ):+0.612
 ].+0.599
Neg logit contributions
ignt-0.639
GN-0.627
wic-0.618
aughs-0.613
stories-0.593
Token a
Token ID257
Feature activation+9.57e-02

Pos logit contributions
ignt+0.109
GN+0.107
wic+0.106
aughs+0.105
stories+0.101
Neg logit contributions
 mole-0.133
 ],-0.121
ゼウス-0.118
 ):-0.107
 ].-0.105
Token business
Token ID1597
Feature activation-3.74e-01

Pos logit contributions
ignt+0.063
GN+0.061
wic+0.060
aughs+0.060
stories+0.058
Neg logit contributions
 mole-0.078
 ],-0.071
ゼウス-0.069
 ):-0.063
 ].-0.062
Token award
Token ID5764
Feature activation-2.36e-01

Pos logit contributions
 mole+0.304
 ],+0.278
ゼウス+0.269
 ):+0.246
 ].+0.241
Neg logit contributions
ignt-0.245
GN-0.240
wic-0.236
aughs-0.235
stories-0.227
Token from
Token ID422
Feature activation+1.76e-01

Pos logit contributions
 mole+0.183
 ],+0.166
ゼウス+0.163
 ):+0.146
 ].+0.143
Neg logit contributions
ignt-0.163
GN-0.161
wic-0.158
aughs-0.156
stories-0.152
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 mole+0.186
 ],+0.170
ゼウス+0.166
 ):+0.151
 ].+0.148
Neg logit contributions
ignt-0.142
GN-0.139
wic-0.137
aughs-0.135
stories-0.131
Token entire
Token ID2104
Feature activation-1.87e-01

Pos logit contributions
 mole+0.101
 ],+0.093
ゼウス+0.091
 ):+0.084
 ].+0.082
Neg logit contributions
ignt-0.069
GN-0.068
wic-0.067
aughs-0.066
stories-0.063
Token time
Token ID640
Feature activation-5.77e-01

Pos logit contributions
 mole+0.147
 ],+0.134
ゼウス+0.131
 ):+0.119
 ].+0.116
Neg logit contributions
ignt-0.128
GN-0.125
wic-0.123
aughs-0.122
stories-0.118
Token we
Token ID356
Feature activation-4.27e-01

Pos logit contributions
 mole+0.492
 ],+0.452
ゼウス+0.434
 ):+0.403
 ].+0.396
Neg logit contributions
ignt-0.356
GN-0.345
wic-0.341
aughs-0.340
stories-0.326
Token were
Token ID547
Feature activation-4.61e-01

Pos logit contributions
 mole+0.329
 ],+0.299
ゼウス+0.290
 ):+0.263
 ].+0.257
Neg logit contributions
ignt-0.298
GN-0.291
wic-0.287
aughs-0.285
stories-0.276
Token together
Token ID1978
Feature activation-1.27e+00

Pos logit contributions
 mole+0.372
 ],+0.340
ゼウス+0.329
 ):+0.301
 ].+0.294
Neg logit contributions
ignt-0.304
GN-0.297
wic-0.293
aughs-0.291
stories-0.281
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 mole+1.069
 ],+0.985
ゼウス+0.941
 ):+0.872
 ].+0.865
Neg logit contributions
ignt-0.793
GN-0.769
wic-0.762
aughs-0.758
stories-0.724
Token I
Token ID314
Feature activation-5.21e-02

Pos logit contributions
 mole+0.086
 ],+0.079
ゼウス+0.078
 ):+0.070
 ].+0.069
Neg logit contributions
ignt-0.064
GN-0.063
wic-0.062
aughs-0.062
stories-0.060
Token was
Token ID373
Feature activation-3.69e-01

Pos logit contributions
 mole+0.039
ゼウス+0.036
 ],+0.035
 ):+0.031
 ].+0.030
Neg logit contributions
ignt-0.036
GN-0.036
wic-0.036
aughs-0.035
stories-0.034
Token broken
Token ID5445
Feature activation+8.20e-01

Pos logit contributions
 mole+0.289
 ],+0.264
ゼウス+0.260
 ):+0.233
 ].+0.228
Neg logit contributions
ignt-0.250
GN-0.245
wic-0.241
aughs-0.238
stories-0.232
Token.
Token ID13
Feature activation-5.82e-02

Pos logit contributions
ignt+0.496
GN+0.486
wic+0.478
aughs+0.472
stories+0.456
Neg logit contributions
 mole-0.706
 ],-0.648
ゼウス-0.635
 ):-0.579
 ].-0.567
Token DC
Token ID6257
Feature activation+6.47e-01

Pos logit contributions
ignt+0.402
aughs+0.392
GN+0.391
wic+0.386
stories+0.372
Neg logit contributions
 mole-0.449
 ],-0.410
ゼウス-0.393
 ):-0.364
 ].-0.354
Token:
Token ID25
Feature activation+6.13e-01

Pos logit contributions
ignt+0.505
GN+0.495
aughs+0.489
wic+0.488
stories+0.473
Neg logit contributions
 mole-0.446
 ],-0.395
ゼウス-0.384
 ):-0.350
 ].-0.333
Token US
Token ID1294
Feature activation+2.96e-01

Pos logit contributions
ignt+0.467
GN+0.456
wic+0.451
aughs+0.449
stories+0.434
Neg logit contributions
 mole-0.433
 ],-0.392
ゼウス-0.378
 ):-0.341
 ].-0.332
Token Department
Token ID2732
Feature activation+3.53e-01

Pos logit contributions
ignt+0.164
GN+0.159
wic+0.157
aughs+0.156
stories+0.149
Neg logit contributions
 mole-0.270
 ],-0.250
ゼウス-0.244
 ):-0.225
 ].-0.221
Token of
Token ID286
Feature activation+2.15e-01

Pos logit contributions
ignt+0.206
GN+0.205
wic+0.201
aughs+0.195
stories+0.192
Neg logit contributions
 mole-0.309
 ],-0.282
ゼウス-0.279
 ):-0.252
 ].-0.247
Token Transportation
Token ID15198
Feature activation-9.86e-01

Pos logit contributions
ignt+0.127
GN+0.124
wic+0.121
aughs+0.121
stories+0.116
Neg logit contributions
 mole-0.187
 ],-0.174
ゼウス-0.168
 ):-0.156
 ].-0.153
Token,
Token ID11
Feature activation+3.16e-01

Pos logit contributions
 mole+0.830
 ],+0.764
ゼウス+0.732
 ):+0.681
 ].+0.670
Neg logit contributions
ignt-0.614
GN-0.593
aughs-0.588
wic-0.579
stories-0.561
Token National
Token ID2351
Feature activation-1.07e+00

Pos logit contributions
ignt+0.212
GN+0.208
wic+0.205
aughs+0.203
stories+0.197
Neg logit contributions
 mole-0.250
 ],-0.229
ゼウス-0.223
 ):-0.202
 ].-0.198
Token Highway
Token ID14914
Feature activation-2.58e-01

Pos logit contributions
 mole+0.733
 ],+0.658
ゼウス+0.635
 ):+0.571
 ].+0.556
Neg logit contributions
ignt-0.831
GN-0.817
wic-0.804
aughs-0.801
stories-0.776
Token Traffic
Token ID23624
Feature activation-6.54e-01

Pos logit contributions
 mole+0.184
 ],+0.168
ゼウス+0.160
 ):+0.146
 ].+0.143
Neg logit contributions
ignt-0.193
GN-0.190
aughs-0.186
wic-0.186
stories-0.180
Token Safety
Token ID11233
Feature activation-7.98e-01

Pos logit contributions
 mole+0.496
 ],+0.452
ゼウス+0.435
 ):+0.396
 ].+0.387
Neg logit contributions
ignt-0.462
GN-0.455
aughs-0.444
wic-0.444
stories-0.430
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
 mole+0.703
 ],+0.648
ゼウス+0.625
 ):+0.577
 ].+0.569
Neg logit contributions
ignt-0.518
GN-0.507
wic-0.501
aughs-0.497
stories-0.479
Token California
Token ID3442
Feature activation-2.86e-01

Pos logit contributions
 mole+0.084
 ],+0.076
ゼウス+0.073
 ):+0.066
 ].+0.064
Neg logit contributions
ignt-0.095
GN-0.093
wic-0.092
aughs-0.092
stories-0.088
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 mole+0.241
 ],+0.224
ゼウス+0.218
 ):+0.200
 ].+0.198
Neg logit contributions
ignt-0.175
GN-0.171
wic-0.169
aughs-0.166
stories-0.160
Token Hide
Token ID10415
Feature activation+1.52e-01

Pos logit contributions
ignt+0.010
GN+0.009
wic+0.009
aughs+0.009
stories+0.009
Neg logit contributions
 mole-0.008
 ],-0.007
ゼウス-0.007
 ):-0.006
 ].-0.006
Token Caption
Token ID11260
Feature activation-7.97e-01

Pos logit contributions
GN+0.180
ignt+0.180
wic+0.179
aughs+0.176
stories+0.175
Neg logit contributions
 mole-0.041
ゼウス-0.029
 ],-0.028
 ):-0.016
 ].-0.013
Token 10
Token ID838
Feature activation-1.38e+00

Pos logit contributions
 mole+0.704
 ],+0.645
ゼウス+0.643
 ):+0.576
skirts+0.569
Neg logit contributions
GN-0.448
wic-0.444
ignt-0.443
aughs-0.428
stories-0.423
Token of
Token ID286
Feature activation-1.53e-01

Pos logit contributions
 mole+1.205
 ],+1.104
ゼウス+1.092
 ):+0.988
 ].+0.967
Neg logit contributions
ignt-0.805
GN-0.801
wic-0.786
aughs-0.766
stories-0.753
Token 15
Token ID1315
Feature activation-5.55e-01

Pos logit contributions
 mole+0.149
ゼウス+0.137
 ],+0.137
 ):+0.124
 ].+0.122
Neg logit contributions
ignt-0.074
GN-0.074
wic-0.072
aughs-0.070
stories-0.068
Token Photos
Token ID9434
Feature activation+2.56e-01

Pos logit contributions
 mole+0.499
ゼウス+0.451
 ],+0.451
 ):+0.405
 ].+0.397
Neg logit contributions
GN-0.310
ignt-0.305
wic-0.305
aughs-0.296
stories-0.293
Token:
Token ID25
Feature activation-3.23e-01

Pos logit contributions
GN+0.168
wic+0.164
ignt+0.163
stories+0.158
aughs+0.158
Neg logit contributions
 mole-0.205
ゼウス-0.186
 ],-0.184
 ):-0.161
skirts-0.160
Token Carly
Token ID35982
Feature activation-7.94e-01

Pos logit contributions
 mole+0.244
 ],+0.219
ゼウス+0.217
 ):+0.191
 ].+0.187
Neg logit contributions
ignt-0.226
GN-0.225
wic-0.222
aughs-0.218
stories-0.214