TOP ACTIVATIONS
MAX = 3.038

such boundaries . The Lac anian do jo .
in a time when Lac anian clinical practice is not even
an que in particular to recapt ure their early - season
to be a practicing Lac anian psych oan alyst to appreciate
it means to practice Lac anian psycho analysis . This is
scope , and practice Lac anian psycho analysis every day .
few that do practice Lac anian psycho analysis in such settings
scrap . Article Continued Below The T
, La R oche said . Self
My eyes are still wide open . I wasn 't expecting
last . Article Continued Below T od
game a ' Ter rible ' score of 2 . 0
oor , S ., J aff ee , S . R
mess seriously Article Continued Below
browser is ' holding back ' the web A Downing Street
report by the Senate Permanent Subcommittee on Investigations found , the
of the ' b anned ' Catalan independence referendum this Sunday
was left alone in the corridor , wondering ( do captains
ard , a post doctoral fellow in Chang s
a ' skin - deep ' attribute that was not rooted

MIN ACTIVATIONS
MIN = -3.277

( old ) website . Nevertheless we worked hard to cover
fired electricity generation produces more lif ecycle greenhouse gas emissions than
submarines would have extended service lif etimes , from 25 to
have been astonished by the lif el ike movements .
hard to get a very lif el ike , fluid motion
. pdf © Copyright 2018 Red s
if the Kernel cannot correctly mount it , it will panic
million ). Please click here for more Dolphins
happens in Greater Cincinnati . Please try again soon , or
anywhere . Copyright © 2019 The Washington Times ,
ine . Copyright © 2016 Else vier B .
MB 8 192 MB Texture Units 144 64 104 80 120
attacks and the French supermarket chain Car ref our took Danish
our work is appreciated . Please click below to become an
On 10 October 1941 those fronts were merged into the Western
Continue reading the main story Please verify you 're not a
Photos courtesy of © K elli ( fl ickr
( © Copyright 2010 The Associated Press .
Continue reading the main story Please verify you 're not a
Continue reading the main story Please verify you 're not a

INTERVAL 1.459 - 3.038
CONTAINS 0.426%

Mr . Ahmad ine jad said that confrontation with Israel
, while Alex ei Em elin , who often played on
been on the endangered species list for decades -- threatened by
f d | + -------------------- --+ | [ address of system
was signed without any formal resolution passed by the Board of

INTERVAL -0.120 - 1.459
CONTAINS 63.985%

eff . org / d ee pl inks / 2016 /
for al - Sh ab ab fighters . Al
, just maybe , when they have issues that no one
why don 't you figure that out while I go talk
the optical bench says you could . But could

INTERVAL -1.699 - -0.120
CONTAINS 35.453%

ty to order prosecutors to give them access to all the
, as iceberg s fall into the sea and glaciers retreat
Crist ina Fern á nd ez de Kir ch ner
Dallas , Palm Beach , Florida , Moreno Valley , California
a series which begins with the low - kick specialist ,

INTERVAL -3.277 - -1.699
CONTAINS 0.136%

, that used Sar in nerve gas during a recent incident
you to leave this here store , so a good Christian
son , told the Tampa Bay Times he " could n
School Should Start Later For Te ens ' Health
you could text message a store instead of calling them ?
Token such
Token ID884
Feature activation+2.32e-01

Pos logit contributions
 Advertisement+0.230
 Kitt+0.222
bryce+0.218
Reviewer+0.195
cars+0.190
Neg logit contributions
 apologizing-0.195
 recomp-0.185
 challeng-0.185
aign-0.181
 forgiven-0.179
Token boundaries
Token ID13215
Feature activation+3.32e-01

Pos logit contributions
 apologizing+0.154
 recomp+0.146
 challeng+0.146
aign+0.143
 forgiven+0.141
Neg logit contributions
 Advertisement-0.194
 Kitt-0.187
bryce-0.184
Reviewer-0.165
cars-0.161
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
 apologizing+0.242
 recomp+0.230
 challeng+0.229
aign+0.226
 forgiven+0.223
Neg logit contributions
 Advertisement-0.256
 Kitt-0.246
bryce-0.242
Reviewer-0.215
cars-0.209
Token The
Token ID383
Feature activation+2.09e-01

Pos logit contributions
 apologizing+0.133
 recomp+0.126
 challeng+0.125
aign+0.124
 forgiven+0.122
Neg logit contributions
 Advertisement-0.160
 Kitt-0.154
bryce-0.153
Reviewer-0.137
cars-0.133
Token Lac
Token ID23115
Feature activation+6.04e-01

Pos logit contributions
 apologizing+0.141
 recomp+0.134
 challeng+0.133
aign+0.130
 forgiven+0.129
Neg logit contributions
 Advertisement-0.173
 Kitt-0.167
bryce-0.165
Reviewer-0.147
cars-0.144
Tokenanian
Token ID38336
Feature activation+3.04e+00

Pos logit contributions
 apologizing+0.349
 recomp+0.329
 challeng+0.327
aign+0.318
 forgiven+0.315
Neg logit contributions
 Advertisement-0.556
 Kitt-0.538
bryce-0.531
Reviewer-0.482
cars-0.472
Token do
Token ID466
Feature activation+1.43e-01

Pos logit contributions
 apologizing+2.018
 recomp+1.923
 challeng+1.915
aign+1.860
 forgiven+1.845
Neg logit contributions
 Advertisement-2.537
 Kitt-2.429
bryce-2.418
Reviewer-2.167
cars-2.114
Tokenjo
Token ID7639
Feature activation-1.16e-01

Pos logit contributions
 apologizing+0.085
 recomp+0.080
 challeng+0.080
aign+0.078
 forgiven+0.077
Neg logit contributions
 Advertisement-0.129
 Kitt-0.125
bryce-0.123
Reviewer-0.112
cars-0.109
Token.
Token ID13
Feature activation+6.71e-02

Pos logit contributions
 Advertisement+0.080
 Kitt+0.076
bryce+0.075
Reviewer+0.066
cars+0.064
Neg logit contributions
 apologizing-0.094
 recomp-0.091
 challeng-0.090
aign-0.089
 forgiven-0.088
Token
Token ID198
Feature activation+1.67e-01

Pos logit contributions
 apologizing+0.046
 recomp+0.044
 challeng+0.043
aign+0.043
 forgiven+0.042
Neg logit contributions
 Advertisement-0.055
 Kitt-0.053
bryce-0.052
Reviewer-0.046
cars-0.045
Token
Token ID198
Feature activation+2.48e-02

Pos logit contributions
 apologizing+0.094
 recomp+0.088
aign+0.086
 forgiven+0.085
 reload+0.083
Neg logit contributions
 Advertisement-0.155
 Kitt-0.151
bryce-0.150
Reviewer-0.136
 Carbuncle-0.133
Token in
Token ID287
Feature activation+1.71e-03

Pos logit contributions
 Advertisement+0.414
 Kitt+0.396
bryce+0.389
Reviewer+0.341
cars+0.331
Neg logit contributions
 apologizing-0.467
 recomp-0.447
 challeng-0.446
aign-0.439
 forgiven-0.434
Token a
Token ID257
Feature activation-9.67e-02

Pos logit contributions
 apologizing+0.001
 recomp+0.001
 challeng+0.001
aign+0.001
 forgiven+0.001
Neg logit contributions
 Advertisement-0.001
 Kitt-0.001
bryce-0.001
Reviewer-0.001
cars-0.001
Token time
Token ID640
Feature activation+1.99e-01

Pos logit contributions
 Advertisement+0.074
 Kitt+0.072
bryce+0.070
Reviewer+0.062
cars+0.061
Neg logit contributions
 apologizing-0.070
 recomp-0.067
 challeng-0.067
aign-0.066
 forgiven-0.065
Token when
Token ID618
Feature activation+1.78e-01

Pos logit contributions
 apologizing+0.150
 recomp+0.144
 challeng+0.143
aign+0.141
 forgiven+0.139
Neg logit contributions
 Advertisement-0.149
 Kitt-0.143
bryce-0.140
Reviewer-0.124
cars-0.121
Token Lac
Token ID23115
Feature activation+4.79e-01

Pos logit contributions
 apologizing+0.137
 recomp+0.131
 challeng+0.130
aign+0.128
 forgiven+0.127
Neg logit contributions
 Advertisement-0.130
 Kitt-0.124
bryce-0.123
Reviewer-0.108
cars-0.105
Tokenanian
Token ID38336
Feature activation+2.97e+00

Pos logit contributions
 apologizing+0.297
 recomp+0.279
aign+0.278
 challeng+0.272
 forgiven+0.271
Neg logit contributions
 Advertisement-0.422
 Kitt-0.409
bryce-0.402
Reviewer-0.365
 Carbuncle-0.354
Token clinical
Token ID8668
Feature activation-6.12e-01

Pos logit contributions
 apologizing+2.187
 recomp+2.089
 challeng+2.080
aign+2.031
 forgiven+2.008
Neg logit contributions
 Advertisement-2.265
 Kitt-2.161
bryce-2.158
Reviewer-1.910
 Carbuncle-1.856
Token practice
Token ID3357
Feature activation+2.82e-01

Pos logit contributions
 Advertisement+0.551
 Kitt+0.533
bryce+0.524
Reviewer+0.474
cars+0.465
Neg logit contributions
 apologizing-0.367
 recomp-0.346
 challeng-0.345
aign-0.337
 forgiven-0.333
Token is
Token ID318
Feature activation-1.45e-01

Pos logit contributions
 apologizing+0.243
 recomp+0.234
 challeng+0.232
aign+0.229
 forgiven+0.227
Neg logit contributions
 Advertisement-0.179
 Kitt-0.171
bryce-0.168
Reviewer-0.147
 Carbuncle-0.140
Token not
Token ID407
Feature activation-3.31e-02

Pos logit contributions
 Advertisement+0.118
 Kitt+0.114
bryce+0.112
Reviewer+0.100
cars+0.098
Neg logit contributions
 apologizing-0.099
 recomp-0.094
 challeng-0.094
aign-0.092
 forgiven-0.091
Token even
Token ID772
Feature activation-3.78e-01

Pos logit contributions
 Advertisement+0.027
 Kitt+0.026
bryce+0.026
Reviewer+0.023
cars+0.023
Neg logit contributions
 apologizing-0.022
 recomp-0.021
 challeng-0.021
aign-0.021
 forgiven-0.020
Tokenan
Token ID272
Feature activation-9.03e-02

Pos logit contributions
 apologizing+0.223
aign+0.217
 recomp+0.217
 challeng+0.212
 forgiven+0.210
Neg logit contributions
 Advertisement-0.175
 Kitt-0.165
bryce-0.165
Reviewer-0.147
 Carbuncle-0.137
Tokenque
Token ID4188
Feature activation+2.12e-01

Pos logit contributions
 Advertisement+0.068
 Kitt+0.065
bryce+0.064
Reviewer+0.057
cars+0.055
Neg logit contributions
 apologizing-0.068
 recomp-0.064
 challeng-0.064
aign-0.063
 forgiven-0.062
Token in
Token ID287
Feature activation+2.16e-01

Pos logit contributions
 apologizing+0.168
 recomp+0.161
 challeng+0.160
aign+0.158
 forgiven+0.156
Neg logit contributions
 Advertisement-0.149
 Kitt-0.143
bryce-0.140
Reviewer-0.123
cars-0.120
Token particular
Token ID1948
Feature activation+9.52e-01

Pos logit contributions
 apologizing+0.160
 recomp+0.153
 challeng+0.153
aign+0.150
 forgiven+0.148
Neg logit contributions
 Advertisement-0.163
 Kitt-0.157
bryce-0.154
Reviewer-0.136
cars-0.133
Token to
Token ID284
Feature activation+4.67e-02

Pos logit contributions
 apologizing+0.733
 recomp+0.700
 challeng+0.696
aign+0.684
 forgiven+0.680
Neg logit contributions
 Advertisement-0.696
 Kitt-0.668
bryce-0.657
Reviewer-0.580
cars-0.563
Token recapt
Token ID36791
Feature activation+2.95e+00

Pos logit contributions
 apologizing+0.030
 challeng+0.028
 recomp+0.028
aign+0.028
 helicop+0.027
Neg logit contributions
 Advertisement-0.040
 Kitt-0.039
bryce-0.038
Reviewer-0.034
cars-0.034
Tokenure
Token ID495
Feature activation+9.72e-01

Pos logit contributions
 apologizing+1.414
 recomp+1.363
aign+1.360
 reload+1.298
 pardon+1.293
Neg logit contributions
 Advertisement-2.936
 Kitt-2.816
bryce-2.814
Reviewer-2.649
cars-2.531
Token their
Token ID511
Feature activation+3.63e-01

Pos logit contributions
 apologizing+0.760
 recomp+0.730
 challeng+0.725
aign+0.714
 forgiven+0.708
Neg logit contributions
 Advertisement-0.698
 Kitt-0.665
bryce-0.658
Reviewer-0.581
cars-0.561
Token early
Token ID1903
Feature activation+1.34e-01

Pos logit contributions
 apologizing+0.244
 challeng+0.232
 recomp+0.232
aign+0.227
 helicop+0.224
Neg logit contributions
 Advertisement-0.299
 Kitt-0.289
bryce-0.283
Reviewer-0.253
cars-0.248
Token-
Token ID12
Feature activation+3.61e-01

Pos logit contributions
 apologizing+0.093
 challeng+0.089
 recomp+0.088
aign+0.087
 helicop+0.086
Neg logit contributions
 Advertisement-0.107
 Kitt-0.104
bryce-0.101
Reviewer-0.090
cars-0.089
Tokenseason
Token ID6230
Feature activation-1.30e-01

Pos logit contributions
 apologizing+0.226
 recomp+0.213
 challeng+0.213
aign+0.208
 helicop+0.205
Neg logit contributions
 Advertisement-0.316
 Kitt-0.305
bryce-0.301
Reviewer-0.271
cars-0.266
Token to
Token ID284
Feature activation-2.17e-01

Pos logit contributions
 Advertisement+0.230
 Kitt+0.220
bryce+0.216
Reviewer+0.190
cars+0.185
Neg logit contributions
 apologizing-0.250
 recomp-0.239
 challeng-0.239
aign-0.235
 forgiven-0.232
Token be
Token ID307
Feature activation-8.04e-02

Pos logit contributions
 Advertisement+0.185
 Kitt+0.179
bryce+0.176
Reviewer+0.158
cars+0.155
Neg logit contributions
 apologizing-0.139
 challeng-0.132
 recomp-0.132
aign-0.129
 helicop-0.127
Token a
Token ID257
Feature activation-1.66e-01

Pos logit contributions
 Advertisement+0.064
 Kitt+0.062
bryce+0.061
Reviewer+0.054
cars+0.053
Neg logit contributions
 apologizing-0.056
 recomp-0.053
 challeng-0.053
aign-0.052
 helicop-0.051
Token practicing
Token ID18207
Feature activation-1.64e-01

Pos logit contributions
 Advertisement+0.136
 Kitt+0.132
bryce+0.129
Reviewer+0.116
cars+0.113
Neg logit contributions
 apologizing-0.112
 recomp-0.107
 challeng-0.106
aign-0.105
 forgiven-0.103
Token Lac
Token ID23115
Feature activation+2.86e-01

Pos logit contributions
 Advertisement+0.135
 Kitt+0.130
bryce+0.128
Reviewer+0.115
cars+0.112
Neg logit contributions
 apologizing-0.111
 recomp-0.105
 challeng-0.105
aign-0.103
 forgiven-0.102
Tokenanian
Token ID38336
Feature activation+2.95e+00

Pos logit contributions
 apologizing+0.171
 recomp+0.160
aign+0.159
 challeng+0.158
 forgiven+0.155
Neg logit contributions
 Advertisement-0.259
 Kitt-0.251
bryce-0.247
Reviewer-0.224
cars-0.219
Token psych
Token ID3795
Feature activation-1.41e+00

Pos logit contributions
 apologizing+2.240
 recomp+2.127
 challeng+2.115
aign+2.092
 forgiven+2.054
Neg logit contributions
 Advertisement-2.179
 Kitt-2.085
bryce-2.068
Reviewer-1.825
cars-1.785
Tokenoan
Token ID24611
Feature activation-5.33e-01

Pos logit contributions
 Advertisement+0.908
 Kitt+0.865
bryce+0.848
Reviewer+0.734
cars+0.709
Neg logit contributions
 apologizing-1.202
 recomp-1.155
 challeng-1.153
aign-1.134
 forgiven-1.124
Tokenalyst
Token ID21470
Feature activation-9.13e-01

Pos logit contributions
 Advertisement+0.617
 Kitt+0.603
bryce+0.595
Reviewer+0.553
cars+0.543
Neg logit contributions
 apologizing-0.183
 recomp-0.163
 challeng-0.161
aign-0.157
 forgiven-0.153
Token to
Token ID284
Feature activation-1.05e-01

Pos logit contributions
 Advertisement+0.645
 Kitt+0.616
bryce+0.605
Reviewer+0.530
cars+0.515
Neg logit contributions
 apologizing-0.723
 recomp-0.693
 challeng-0.692
aign-0.679
 forgiven-0.673
Token appreciate
Token ID9144
Feature activation-1.13e+00

Pos logit contributions
 Advertisement+0.088
 Kitt+0.085
bryce+0.084
Reviewer+0.075
cars+0.073
Neg logit contributions
 apologizing-0.069
 recomp-0.066
 challeng-0.066
aign-0.064
 forgiven-0.064
Token it
Token ID340
Feature activation-2.22e-01

Pos logit contributions
 Advertisement+0.342
 Kitt+0.326
bryce+0.322
Reviewer+0.283
cars+0.274
Neg logit contributions
 apologizing-0.385
 recomp-0.369
 challeng-0.368
aign-0.361
 forgiven-0.358
Token means
Token ID1724
Feature activation-6.93e-02

Pos logit contributions
 Advertisement+0.147
 Kitt+0.141
bryce+0.137
Reviewer+0.119
cars+0.116
Neg logit contributions
 apologizing-0.186
 recomp-0.178
 challeng-0.178
aign-0.175
 forgiven-0.173
Token to
Token ID284
Feature activation-2.49e-01

Pos logit contributions
 Advertisement+0.049
 Kitt+0.047
bryce+0.046
Reviewer+0.041
cars+0.039
Neg logit contributions
 apologizing-0.055
 recomp-0.052
 challeng-0.052
aign-0.051
 forgiven-0.051
Token practice
Token ID3357
Feature activation-1.55e-02

Pos logit contributions
 Advertisement+0.216
 Kitt+0.208
bryce+0.205
Reviewer+0.185
cars+0.181
Neg logit contributions
 apologizing-0.157
 recomp-0.148
 challeng-0.148
aign-0.145
 forgiven-0.143
Token Lac
Token ID23115
Feature activation+3.63e-01

Pos logit contributions
 Advertisement+0.011
 Kitt+0.011
bryce+0.011
Reviewer+0.009
cars+0.009
Neg logit contributions
 apologizing-0.012
 recomp-0.012
 challeng-0.012
aign-0.011
 forgiven-0.011
Tokenanian
Token ID38336
Feature activation+2.88e+00

Pos logit contributions
 apologizing+0.220
 recomp+0.207
 challeng+0.207
aign+0.203
 forgiven+0.200
Neg logit contributions
 Advertisement-0.325
 Kitt-0.314
bryce-0.310
Reviewer-0.280
cars-0.274
Token psycho
Token ID30731
Feature activation-1.48e+00

Pos logit contributions
 apologizing+2.073
 recomp+1.968
 challeng+1.961
aign+1.916
 forgiven+1.898
Neg logit contributions
 Advertisement-2.244
 Kitt-2.156
bryce-2.154
Reviewer-1.906
 Carbuncle-1.855
Tokenanalysis
Token ID20930
Feature activation+2.58e-01

Pos logit contributions
 Advertisement+1.599
 Kitt+1.550
bryce+1.531
Reviewer+1.416
cars+1.395
Neg logit contributions
 apologizing-0.608
 recomp-0.566
 challeng-0.560
aign-0.540
 forgiven-0.535
Token.
Token ID13
Feature activation-8.25e-02

Pos logit contributions
 apologizing+0.188
 recomp+0.179
 challeng+0.179
aign+0.176
 forgiven+0.174
Neg logit contributions
 Advertisement-0.198
 Kitt-0.190
bryce-0.187
Reviewer-0.166
cars-0.162
Token This
Token ID770
Feature activation-3.28e-01

Pos logit contributions
 Advertisement+0.063
 Kitt+0.060
bryce+0.060
Reviewer+0.053
cars+0.051
Neg logit contributions
 apologizing-0.061
 recomp-0.058
 challeng-0.057
aign-0.057
 forgiven-0.056
Token is
Token ID318
Feature activation+2.04e-01

Pos logit contributions
 Advertisement+0.216
 Kitt+0.206
bryce+0.201
Reviewer+0.175
cars+0.169
Neg logit contributions
 apologizing-0.275
 recomp-0.264
 challeng-0.264
aign-0.260
 forgiven-0.257
Token scope
Token ID8354
Feature activation+3.12e-01

Pos logit contributions
 apologizing+0.098
 recomp+0.092
 challeng+0.092
aign+0.091
 forgiven+0.089
Neg logit contributions
 Advertisement-0.132
 Kitt-0.128
bryce-0.125
Reviewer-0.113
cars-0.111
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 apologizing+0.228
 recomp+0.218
 challeng+0.217
aign+0.214
 forgiven+0.212
Neg logit contributions
 Advertisement-0.240
 Kitt-0.230
bryce-0.227
Reviewer-0.201
cars-0.195
Token and
Token ID290
Feature activation-9.46e-02

Pos logit contributions
 Advertisement+0.093
 Kitt+0.090
bryce+0.088
Reviewer+0.079
cars+0.077
Neg logit contributions
 apologizing-0.080
 recomp-0.076
 challeng-0.076
aign-0.075
 helicop-0.074
Token practice
Token ID3357
Feature activation+1.88e-01

Pos logit contributions
 Advertisement+0.078
 Kitt+0.075
bryce+0.074
Reviewer+0.066
cars+0.065
Neg logit contributions
 apologizing-0.064
 recomp-0.061
 challeng-0.061
aign-0.059
 forgiven-0.059
Token Lac
Token ID23115
Feature activation+6.17e-01

Pos logit contributions
 apologizing+0.144
 recomp+0.138
 challeng+0.137
aign+0.134
 forgiven+0.133
Neg logit contributions
 Advertisement-0.139
 Kitt-0.133
bryce-0.132
Reviewer-0.117
cars-0.113
Tokenanian
Token ID38336
Feature activation+2.87e+00

Pos logit contributions
 apologizing+0.337
 recomp+0.315
 challeng+0.311
aign+0.309
 forgiven+0.303
Neg logit contributions
 Advertisement-0.589
 Kitt-0.572
bryce-0.564
Reviewer-0.513
cars-0.503
Token psycho
Token ID30731
Feature activation-1.09e+00

Pos logit contributions
 apologizing+2.289
 recomp+2.193
 challeng+2.189
aign+2.124
 forgiven+2.119
Neg logit contributions
 Advertisement-2.013
 Kitt-1.913
bryce-1.907
Reviewer-1.671
 Carbuncle-1.613
Tokenanalysis
Token ID20930
Feature activation+3.50e-01

Pos logit contributions
 Advertisement+1.196
 Kitt+1.160
bryce+1.148
Reviewer+1.060
cars+1.044
Neg logit contributions
 apologizing-0.438
 recomp-0.403
 challeng-0.403
aign-0.385
 helicop-0.378
Token every
Token ID790
Feature activation+4.37e-01

Pos logit contributions
 apologizing+0.254
 recomp+0.243
 challeng+0.242
aign+0.236
 forgiven+0.235
Neg logit contributions
 Advertisement-0.271
 Kitt-0.261
bryce-0.259
Reviewer-0.230
 Carbuncle-0.223
Token day
Token ID1110
Feature activation-4.83e-02

Pos logit contributions
 apologizing+0.309
 recomp+0.294
 challeng+0.293
aign+0.288
 forgiven+0.284
Neg logit contributions
 Advertisement-0.347
 Kitt-0.333
bryce-0.327
Reviewer-0.292
cars-0.285
Token.
Token ID13
Feature activation+5.03e-02

Pos logit contributions
 Advertisement+0.037
 Kitt+0.035
bryce+0.034
Reviewer+0.030
cars+0.030
Neg logit contributions
 apologizing-0.036
 recomp-0.034
 challeng-0.034
aign-0.034
 forgiven-0.033
Token few
Token ID1178
Feature activation-5.32e-01

Pos logit contributions
 Advertisement+1.163
 Kitt+1.122
bryce+1.097
Reviewer+0.971
cars+0.952
Neg logit contributions
 apologizing-1.129
 challeng-1.079
 recomp-1.076
aign-1.060
 forgiven-1.047
Token that
Token ID326
Feature activation+5.59e-02

Pos logit contributions
 Advertisement+0.392
 Kitt+0.375
bryce+0.369
Reviewer+0.326
cars+0.317
Neg logit contributions
 apologizing-0.407
 recomp-0.389
 challeng-0.388
aign-0.381
 forgiven-0.377
Token do
Token ID466
Feature activation-5.53e-02

Pos logit contributions
 apologizing+0.040
 recomp+0.038
 challeng+0.038
aign+0.037
 helicop+0.037
Neg logit contributions
 Advertisement-0.044
 Kitt-0.042
bryce-0.042
Reviewer-0.037
cars-0.036
Token practice
Token ID3357
Feature activation-1.64e-01

Pos logit contributions
 Advertisement+0.046
 Kitt+0.044
bryce+0.043
Reviewer+0.039
cars+0.038
Neg logit contributions
 apologizing-0.037
 recomp-0.035
 challeng-0.035
aign-0.034
 forgiven-0.034
Token Lac
Token ID23115
Feature activation+4.76e-01

Pos logit contributions
 Advertisement+0.126
 Kitt+0.120
bryce+0.119
Reviewer+0.106
cars+0.103
Neg logit contributions
 apologizing-0.120
 recomp-0.115
 challeng-0.115
aign-0.112
 forgiven-0.111
Tokenanian
Token ID38336
Feature activation+2.85e+00

Pos logit contributions
 apologizing+0.262
 recomp+0.245
 challeng+0.243
aign+0.240
 forgiven+0.235
Neg logit contributions
 Advertisement-0.452
 Kitt-0.438
bryce-0.432
Reviewer-0.393
cars-0.385
Token psycho
Token ID30731
Feature activation-1.33e+00

Pos logit contributions
 apologizing+2.188
 recomp+2.089
 challeng+2.088
aign+2.035
 forgiven+2.021
Neg logit contributions
 Advertisement-2.083
 Kitt-1.990
bryce-1.983
Reviewer-1.754
 Carbuncle-1.688
Tokenanalysis
Token ID20930
Feature activation+3.82e-01

Pos logit contributions
 Advertisement+1.462
 Kitt+1.419
bryce+1.403
Reviewer+1.297
cars+1.276
Neg logit contributions
 apologizing-0.536
 recomp-0.494
 challeng-0.491
aign-0.472
 forgiven-0.464
Token in
Token ID287
Feature activation+2.35e-02

Pos logit contributions
 apologizing+0.288
 recomp+0.275
 challeng+0.273
aign+0.268
 forgiven+0.267
Neg logit contributions
 Advertisement-0.287
 Kitt-0.276
bryce-0.272
Reviewer-0.243
cars-0.233
Token such
Token ID884
Feature activation+1.71e-01

Pos logit contributions
 apologizing+0.018
 recomp+0.017
 challeng+0.017
aign+0.017
 forgiven+0.017
Neg logit contributions
 Advertisement-0.017
 Kitt-0.016
bryce-0.016
Reviewer-0.014
cars-0.014
Token settings
Token ID6460
Feature activation+1.98e-02

Pos logit contributions
 apologizing+0.132
 recomp+0.126
 challeng+0.126
aign+0.124
 forgiven+0.123
Neg logit contributions
 Advertisement-0.124
 Kitt-0.119
bryce-0.117
Reviewer-0.103
cars-0.100
Token scrap
Token ID15881
Feature activation-4.57e-01

Pos logit contributions
 Advertisement+0.054
 Kitt+0.052
bryce+0.051
Reviewer+0.045
cars+0.044
Neg logit contributions
 apologizing-0.060
 recomp-0.058
 challeng-0.058
aign-0.057
 forgiven-0.056
Token.
Token ID13
Feature activation+9.98e-02

Pos logit contributions
 Advertisement+0.346
 Kitt+0.333
bryce+0.326
Reviewer+0.290
 Carbuncle+0.280
Neg logit contributions
 apologizing-0.340
 recomp-0.326
 challeng-0.322
aign-0.318
 forgiven-0.316
Token
Token ID198
Feature activation-5.68e-02

Pos logit contributions
 apologizing+0.071
 recomp+0.068
 challeng+0.067
aign+0.066
 forgiven+0.065
Neg logit contributions
 Advertisement-0.078
 Kitt-0.075
bryce-0.074
Reviewer-0.066
cars-0.065
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 Advertisement+0.053
 Kitt+0.051
bryce+0.051
Reviewer+0.046
 Carbuncle+0.045
Neg logit contributions
 apologizing-0.032
 recomp-0.030
aign-0.030
 forgiven-0.029
 pardon-0.028
TokenArticle
Token ID14906
Feature activation+2.78e-01

Pos logit contributions
 Advertisement+0.103
 Kitt+0.099
bryce+0.097
Reviewer+0.085
 Carbuncle+0.083
Neg logit contributions
 apologizing-0.122
 recomp-0.117
 challeng-0.116
aign-0.115
 forgiven-0.114
Token Continued
Token ID22168
Feature activation+2.78e+00

Pos logit contributions
 apologizing+0.159
 recomp+0.149
 challeng+0.145
aign+0.145
 forgiven+0.143
Neg logit contributions
 Advertisement-0.254
 Kitt-0.247
bryce-0.246
Reviewer-0.227
 Carbuncle-0.217
Token Below
Token ID10383
Feature activation+1.94e-01

Pos logit contributions
 apologizing+1.398
 recomp+1.267
aign+1.262
 forgiven+1.260
 reload+1.227
Neg logit contributions
bryce-2.655
 Kitt-2.619
 Advertisement-2.588
Reviewer-2.498
 Carbuncle-2.349
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 apologizing+0.108
 recomp+0.101
aign+0.100
 forgiven+0.098
 reload+0.096
Neg logit contributions
 Advertisement-0.178
 Kitt-0.175
bryce-0.174
Reviewer-0.158
 Carbuncle-0.154
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
 apologizing+0.085
 recomp+0.079
aign+0.078
 forgiven+0.077
 reload+0.075
Neg logit contributions
 Advertisement-0.141
 Kitt-0.137
bryce-0.136
Reviewer-0.124
 Carbuncle-0.121
TokenThe
Token ID464
Feature activation-1.82e-01

Pos logit contributions
 apologizing+0.085
 recomp+0.082
 challeng+0.081
aign+0.080
 forgiven+0.079
Neg logit contributions
 Advertisement-0.076
 Kitt-0.073
bryce-0.071
Reviewer-0.062
 Carbuncle-0.061
Token T
Token ID309
Feature activation-2.07e-01

Pos logit contributions
 Advertisement+0.144
 Kitt+0.139
bryce+0.137
Reviewer+0.122
cars+0.119
Neg logit contributions
 apologizing-0.128
 recomp-0.122
 challeng-0.122
aign-0.119
 forgiven-0.118
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 apologizing+1.044
 recomp+0.998
 challeng+0.987
aign+0.974
 forgiven+0.964
Neg logit contributions
 Advertisement-1.034
 Kitt-0.995
bryce-0.986
Reviewer-0.871
cars-0.843
Token
Token ID447
Feature activation+2.33e-01

Pos logit contributions
 Advertisement+0.081
 Kitt+0.078
bryce+0.076
Reviewer+0.068
cars+0.066
Neg logit contributions
 apologizing-0.071
 recomp-0.068
 challeng-0.068
aign-0.067
 helicop-0.066
Token
Token ID251
Feature activation-1.03e-02

Pos logit contributions
 apologizing+0.168
 recomp+0.160
 challeng+0.159
aign+0.157
 forgiven+0.155
Neg logit contributions
 Advertisement-0.181
 Kitt-0.174
bryce-0.171
Reviewer-0.152
cars-0.149
Token La
Token ID4689
Feature activation-2.67e-01

Pos logit contributions
 Advertisement+0.009
 Kitt+0.008
bryce+0.008
Reviewer+0.007
cars+0.007
Neg logit contributions
 apologizing-0.007
 recomp-0.006
 challeng-0.006
aign-0.006
 helicop-0.006
TokenR
Token ID49
Feature activation+6.57e-01

Pos logit contributions
 Advertisement+0.188
 Kitt+0.181
bryce+0.178
Reviewer+0.156
 Carbuncle+0.152
Neg logit contributions
 apologizing-0.211
 recomp-0.202
aign-0.199
 challeng-0.198
 forgiven-0.196
Tokenoche
Token ID30848
Feature activation+2.69e+00

Pos logit contributions
 apologizing+0.185
 recomp+0.163
aign+0.159
 forgiven+0.151
 challeng+0.150
Neg logit contributions
 Advertisement-0.799
 Kitt-0.781
bryce-0.769
Reviewer-0.720
 Carbuncle-0.712
Token said
Token ID531
Feature activation-1.41e-01

Pos logit contributions
 apologizing+1.829
 recomp+1.727
 challeng+1.718
aign+1.688
 forgiven+1.681
Neg logit contributions
 Advertisement-2.203
 Kitt-2.126
bryce-2.104
Reviewer-1.888
cars-1.841
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
 Advertisement+0.108
 Kitt+0.104
bryce+0.102
Reviewer+0.090
cars+0.088
Neg logit contributions
 apologizing-0.103
 recomp-0.098
 challeng-0.098
aign-0.097
 forgiven-0.095
Token
Token ID564
Feature activation+1.43e-01

Pos logit contributions
 apologizing+0.019
 recomp+0.018
 challeng+0.018
aign+0.018
 forgiven+0.018
Neg logit contributions
 Advertisement-0.020
 Kitt-0.019
bryce-0.019
Reviewer-0.017
cars-0.017
Token
Token ID250
Feature activation-6.86e-02

Pos logit contributions
 apologizing+0.093
 recomp+0.089
 challeng+0.087
aign+0.087
 forgiven+0.086
Neg logit contributions
 Advertisement-0.120
 Kitt-0.116
bryce-0.115
Reviewer-0.103
cars-0.101
TokenSelf
Token ID24704
Feature activation-9.27e-01

Pos logit contributions
 Advertisement+0.052
 Kitt+0.050
bryce+0.049
Reviewer+0.043
cars+0.042
Neg logit contributions
 apologizing-0.051
 recomp-0.049
 challeng-0.049
aign-0.048
 forgiven-0.047
Token My
Token ID2011
Feature activation+4.57e-01

Pos logit contributions
 Advertisement+0.310
 Kitt+0.298
bryce+0.289
Reviewer+0.260
cars+0.255
Neg logit contributions
 apologizing-0.235
 challeng-0.229
 recomp-0.226
aign-0.221
 helicop-0.221
Token eyes
Token ID2951
Feature activation+1.65e+00

Pos logit contributions
 apologizing+0.308
 recomp+0.294
 challeng+0.294
aign+0.289
 helicop+0.285
Neg logit contributions
 Advertisement-0.375
 Kitt-0.362
bryce-0.355
Reviewer-0.317
cars-0.311
Token are
Token ID389
Feature activation+8.46e-01

Pos logit contributions
 apologizing+1.062
 recomp+1.009
 challeng+0.999
aign+0.975
 forgiven+0.971
Neg logit contributions
 Advertisement-1.421
 Kitt-1.370
bryce-1.357
Reviewer-1.222
cars-1.184
Token still
Token ID991
Feature activation+8.20e-01

Pos logit contributions
 apologizing+0.499
 recomp+0.470
 challeng+0.470
aign+0.462
 helicop+0.452
Neg logit contributions
 Advertisement-0.766
 Kitt-0.742
bryce-0.731
Reviewer-0.661
cars-0.648
Token wide
Token ID3094
Feature activation+2.11e+00

Pos logit contributions
 apologizing+0.479
 recomp+0.451
 challeng+0.450
aign+0.441
 helicop+0.433
Neg logit contributions
 Advertisement-0.748
 Kitt-0.725
bryce-0.714
Reviewer-0.647
cars-0.635
Token open
Token ID1280
Feature activation+2.69e+00

Pos logit contributions
 apologizing+1.332
 recomp+1.258
 challeng+1.255
aign+1.216
 forgiven+1.213
Neg logit contributions
 Advertisement-1.848
 Kitt-1.778
bryce-1.762
Reviewer-1.590
 Carbuncle-1.546
Token.
Token ID13
Feature activation+7.69e-01

Pos logit contributions
 apologizing+1.963
 recomp+1.856
 challeng+1.852
 forgiven+1.800
aign+1.797
Neg logit contributions
 Advertisement-2.098
 Kitt-1.999
bryce-1.989
Reviewer-1.772
cars-1.711
Token I
Token ID314
Feature activation+1.02e+00

Pos logit contributions
 apologizing+0.547
 recomp+0.520
 challeng+0.517
aign+0.509
 forgiven+0.503
Neg logit contributions
 Advertisement-0.605
 Kitt-0.582
bryce-0.574
Reviewer-0.511
cars-0.498
Token wasn
Token ID2492
Feature activation+1.18e+00

Pos logit contributions
 apologizing+0.671
 recomp+0.637
 challeng+0.630
aign+0.615
 forgiven+0.614
Neg logit contributions
 Advertisement-0.860
 Kitt-0.829
bryce-0.819
Reviewer-0.739
cars-0.716
Token't
Token ID470
Feature activation+2.57e-01

Pos logit contributions
 apologizing+0.893
 recomp+0.851
aign+0.839
 forgiven+0.834
 reload+0.820
Neg logit contributions
 Advertisement-0.866
 Kitt-0.834
bryce-0.822
Reviewer-0.728
 Carbuncle-0.714
Token expecting
Token ID12451
Feature activation+8.24e-01

Pos logit contributions
 apologizing+0.138
 recomp+0.129
 challeng+0.129
aign+0.126
 forgiven+0.124
Neg logit contributions
 Advertisement-0.247
 Kitt-0.240
bryce-0.236
Reviewer-0.216
cars-0.211
Token last
Token ID938
Feature activation-3.03e-01

Pos logit contributions
 Advertisement+0.232
 Kitt+0.223
bryce+0.218
Reviewer+0.193
cars+0.189
Neg logit contributions
 apologizing-0.224
 challeng-0.215
 recomp-0.214
aign-0.212
 helicop-0.208
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
 Advertisement+0.238
 Kitt+0.229
bryce+0.223
Reviewer+0.199
cars+0.196
Neg logit contributions
 apologizing-0.214
 challeng-0.204
 recomp-0.204
aign-0.202
 helicop-0.198
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
 apologizing+0.146
 recomp+0.138
 challeng+0.136
aign+0.136
 forgiven+0.134
Neg logit contributions
 Advertisement-0.171
 Kitt-0.165
bryce-0.164
Reviewer-0.146
cars-0.143
Token
Token ID198
Feature activation-4.32e-02

Pos logit contributions
 apologizing+0.062
 recomp+0.058
aign+0.057
 forgiven+0.056
 pardon+0.055
Neg logit contributions
 Advertisement-0.102
 Kitt-0.099
bryce-0.098
Reviewer-0.089
 Carbuncle-0.087
TokenArticle
Token ID14906
Feature activation-8.96e-01

Pos logit contributions
 Advertisement+0.033
 Kitt+0.032
bryce+0.031
Reviewer+0.028
cars+0.027
Neg logit contributions
 apologizing-0.031
 recomp-0.030
 challeng-0.030
aign-0.029
 forgiven-0.029
Token Continued
Token ID22168
Feature activation+2.69e+00

Pos logit contributions
 Advertisement+0.733
 Kitt+0.706
bryce+0.695
Reviewer+0.623
cars+0.607
Neg logit contributions
 apologizing-0.610
 recomp-0.580
 challeng-0.578
aign-0.567
 forgiven-0.560
Token Below
Token ID10383
Feature activation+2.57e-01

Pos logit contributions
 apologizing+1.384
 recomp+1.246
 forgiven+1.231
aign+1.230
 reload+1.212
Neg logit contributions
bryce-2.538
 Kitt-2.518
 Advertisement-2.501
Reviewer-2.390
 Carbuncle-2.254
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
 apologizing+0.143
 recomp+0.133
aign+0.132
 forgiven+0.129
 reload+0.126
Neg logit contributions
 Advertisement-0.237
 Kitt-0.232
bryce-0.231
Reviewer-0.211
 Carbuncle-0.205
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
 apologizing+0.111
 recomp+0.104
aign+0.102
 forgiven+0.100
 apologized+0.098
Neg logit contributions
 Advertisement-0.184
 Kitt-0.179
bryce-0.178
Reviewer-0.162
 Carbuncle-0.158
TokenT
Token ID51
Feature activation-5.67e-01

Pos logit contributions
 apologizing+0.129
 recomp+0.123
 challeng+0.121
aign+0.121
 forgiven+0.119
Neg logit contributions
 Advertisement-0.119
 Kitt-0.115
bryce-0.112
Reviewer-0.098
 Carbuncle-0.097
Tokenod
Token ID375
Feature activation-1.62e-01

Pos logit contributions
 Advertisement+0.353
 Kitt+0.335
bryce+0.328
Reviewer+0.282
cars+0.273
Neg logit contributions
 apologizing-0.497
 recomp-0.479
 challeng-0.478
aign-0.469
 forgiven-0.466
Token game
Token ID983
Feature activation+1.27e-01

Pos logit contributions
 apologizing+0.013
 challeng+0.013
 recomp+0.013
aign+0.013
 helicop+0.012
Neg logit contributions
 Advertisement-0.018
 Kitt-0.017
bryce-0.017
Reviewer-0.015
cars-0.015
Token a
Token ID257
Feature activation+2.69e-02

Pos logit contributions
 apologizing+0.100
 recomp+0.096
 challeng+0.095
aign+0.094
 forgiven+0.093
Neg logit contributions
 Advertisement-0.091
 Kitt-0.087
bryce-0.085
Reviewer-0.075
cars-0.073
Token '
Token ID705
Feature activation+1.22e+00

Pos logit contributions
 apologizing+0.018
 challeng+0.017
 recomp+0.017
aign+0.017
 helicop+0.017
Neg logit contributions
 Advertisement-0.022
 Kitt-0.021
bryce-0.021
Reviewer-0.019
 Carbuncle-0.018
TokenTer
Token ID15156
Feature activation-5.44e-01

Pos logit contributions
 apologizing+0.949
 recomp+0.907
 challeng+0.904
aign+0.890
 forgiven+0.881
Neg logit contributions
 Advertisement-0.882
 Kitt-0.845
bryce-0.829
Reviewer-0.727
cars-0.710
Tokenrible
Token ID5547
Feature activation+1.64e-01

Pos logit contributions
 Advertisement+0.470
 Kitt+0.453
bryce+0.447
Reviewer+0.402
cars+0.393
Neg logit contributions
 apologizing-0.347
 recomp-0.328
 challeng-0.328
aign-0.319
 helicop-0.316
Token'
Token ID6
Feature activation+2.66e+00

Pos logit contributions
 apologizing+0.122
 challeng+0.117
 recomp+0.117
aign+0.115
 helicop+0.114
Neg logit contributions
 Advertisement-0.124
 Kitt-0.118
bryce-0.115
Reviewer-0.103
cars-0.100
Token score
Token ID4776
Feature activation-1.75e+00

Pos logit contributions
 apologizing+1.611
 recomp+1.524
 challeng+1.497
aign+1.465
 forgiven+1.453
Neg logit contributions
 Advertisement-2.357
 Kitt-2.292
bryce-2.282
Reviewer-2.038
cars-2.021
Token of
Token ID286
Feature activation+7.98e-02

Pos logit contributions
 Advertisement+1.234
 Kitt+1.170
bryce+1.140
Reviewer+1.012
cars+0.980
Neg logit contributions
 apologizing-1.377
 challeng-1.340
 recomp-1.323
aign-1.304
 helicop-1.298
Token 2
Token ID362
Feature activation-1.08e-01

Pos logit contributions
 apologizing+0.055
 recomp+0.053
 challeng+0.052
aign+0.051
 forgiven+0.051
Neg logit contributions
 Advertisement-0.064
 Kitt-0.062
bryce-0.061
Reviewer-0.055
cars-0.053
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
 Advertisement+0.080
 Kitt+0.077
bryce+0.076
Reviewer+0.067
cars+0.065
Neg logit contributions
 apologizing-0.082
 recomp-0.078
 challeng-0.078
aign-0.077
 forgiven-0.076
Token0
Token ID15
Feature activation+3.00e-01

Pos logit contributions
 Advertisement+0.150
 Kitt+0.144
bryce+0.141
Reviewer+0.125
cars+0.121
Neg logit contributions
 apologizing-0.152
 recomp-0.145
 challeng-0.145
aign-0.142
 forgiven-0.141
Tokenoor
Token ID2675
Feature activation+9.98e-01

Pos logit contributions
 apologizing+0.317
 recomp+0.303
 challeng+0.302
aign+0.297
 forgiven+0.293
Neg logit contributions
 Advertisement-0.313
 Kitt-0.300
bryce-0.296
Reviewer-0.262
cars-0.254
Token,
Token ID11
Feature activation+7.63e-01

Pos logit contributions
 apologizing+0.857
 recomp+0.823
 challeng+0.819
aign+0.815
 forgiven+0.801
Neg logit contributions
 Advertisement-0.634
 Kitt-0.602
bryce-0.593
Reviewer-0.514
 Carbuncle-0.494
Token S
Token ID311
Feature activation+1.33e-01

Pos logit contributions
 apologizing+0.586
 recomp+0.561
 challeng+0.549
aign+0.548
 forgiven+0.543
Neg logit contributions
 Advertisement-0.555
 Kitt-0.528
bryce-0.528
Reviewer-0.467
cars-0.450
Token.,
Token ID1539
Feature activation+1.72e+00

Pos logit contributions
 apologizing+0.087
 recomp+0.082
 challeng+0.082
aign+0.080
 forgiven+0.080
Neg logit contributions
 Advertisement-0.112
 Kitt-0.109
bryce-0.107
Reviewer-0.096
cars-0.094
Token J
Token ID449
Feature activation+9.22e-01

Pos logit contributions
 apologizing+1.331
 recomp+1.276
aign+1.258
 challeng+1.250
 forgiven+1.235
Neg logit contributions
 Advertisement-1.230
bryce-1.174
 Kitt-1.157
Reviewer-1.046
cars-0.999
Tokenaff
Token ID2001
Feature activation+2.65e+00

Pos logit contributions
 apologizing+0.689
 recomp+0.658
 challeng+0.654
aign+0.652
 forgiven+0.639
Neg logit contributions
 Advertisement-0.690
 Kitt-0.660
bryce-0.653
Reviewer-0.579
cars-0.559
Tokenee
Token ID1453
Feature activation+1.49e+00

Pos logit contributions
 apologizing+1.906
 recomp+1.849
aign+1.839
 challeng+1.797
 forgiven+1.777
Neg logit contributions
 Advertisement-2.034
 Kitt-1.954
bryce-1.923
Reviewer-1.713
 Carbuncle-1.659
Token,
Token ID11
Feature activation+8.82e-01

Pos logit contributions
 apologizing+1.274
 recomp+1.227
 challeng+1.216
aign+1.213
 forgiven+1.190
Neg logit contributions
 Advertisement-0.956
 Kitt-0.909
bryce-0.893
Reviewer-0.776
cars-0.745
Token S
Token ID311
Feature activation+3.51e-01

Pos logit contributions
 apologizing+0.673
 recomp+0.644
aign+0.628
 challeng+0.626
 forgiven+0.624
Neg logit contributions
 Advertisement-0.645
bryce-0.615
 Kitt-0.613
Reviewer-0.546
cars-0.524
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 apologizing+0.223
 recomp+0.211
 challeng+0.211
aign+0.205
 forgiven+0.204
Neg logit contributions
 Advertisement-0.303
 Kitt-0.293
bryce-0.288
Reviewer-0.259
cars-0.254
Token R
Token ID371
Feature activation+1.40e+00

Pos logit contributions
 apologizing+0.814
 recomp+0.779
aign+0.761
 challeng+0.754
 forgiven+0.753
Neg logit contributions
 Advertisement-0.776
bryce-0.740
 Kitt-0.738
Reviewer-0.660
cars-0.631
Token mess
Token ID2085
Feature activation-3.51e-01

Pos logit contributions
 apologizing+0.773
 recomp+0.736
 challeng+0.731
aign+0.721
 forgiven+0.710
Neg logit contributions
 Advertisement-0.825
 Kitt-0.797
bryce-0.787
Reviewer-0.699
 Carbuncle-0.678
Token seriously
Token ID6411
Feature activation-1.01e-01

Pos logit contributions
 Advertisement+0.314
 Kitt+0.303
bryce+0.299
Reviewer+0.270
cars+0.265
Neg logit contributions
 apologizing-0.213
 recomp-0.201
 challeng-0.201
aign-0.195
 forgiven-0.193
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
 Advertisement+0.080
 Kitt+0.077
bryce+0.076
Reviewer+0.067
cars+0.066
Neg logit contributions
 apologizing-0.072
 recomp-0.069
 challeng-0.069
aign-0.067
 forgiven-0.067
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
 Advertisement+0.497
 Kitt+0.482
bryce+0.477
Reviewer+0.433
 Carbuncle+0.423
Neg logit contributions
 apologizing-0.310
 recomp-0.290
aign-0.285
 forgiven-0.280
 challeng-0.277
TokenArticle
Token ID14906
Feature activation-9.76e-02

Pos logit contributions
 Advertisement+0.343
 Kitt+0.327
bryce+0.319
Reviewer+0.270
 Carbuncle+0.264
Neg logit contributions
 apologizing-0.528
 recomp-0.507
 challeng-0.504
aign-0.501
 forgiven-0.495
Token Continued
Token ID22168
Feature activation+2.65e+00

Pos logit contributions
 Advertisement+0.091
bryce+0.089
 Kitt+0.089
Reviewer+0.082
 Carbuncle+0.078
Neg logit contributions
 apologizing-0.054
 recomp-0.050
aign-0.049
 forgiven-0.048
 apologized-0.048
Token Below
Token ID10383
Feature activation+1.58e-01

Pos logit contributions
 apologizing+1.314
 recomp+1.186
 forgiven+1.179
aign+1.177
 reload+1.153
Neg logit contributions
bryce-2.550
 Kitt-2.506
 Advertisement-2.464
Reviewer-2.383
 Carbuncle-2.253
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
 apologizing+0.088
 recomp+0.082
aign+0.081
 forgiven+0.079
 reload+0.078
Neg logit contributions
 Advertisement-0.145
 Kitt-0.142
bryce-0.142
Reviewer-0.129
 Carbuncle-0.126
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
 Advertisement+0.124
 Kitt+0.121
bryce+0.120
Reviewer+0.109
 Carbuncle+0.106
Neg logit contributions
 apologizing-0.075
 recomp-0.071
aign-0.069
 forgiven-0.068
 apologized-0.067
Token
Token ID447
Feature activation-6.33e-01

Pos logit contributions
 Advertisement+0.088
 Kitt+0.085
bryce+0.083
Reviewer+0.073
 Carbuncle+0.071
Neg logit contributions
 apologizing-0.099
 recomp-0.095
 challeng-0.094
aign-0.094
 forgiven-0.092
Token
Token ID250
Feature activation-2.73e-01

Pos logit contributions
 Advertisement+0.497
 Kitt+0.477
bryce+0.469
Reviewer+0.418
cars+0.407
Neg logit contributions
 apologizing-0.454
 recomp-0.432
 challeng-0.431
aign-0.423
 forgiven-0.418
Token browser
Token ID6444
Feature activation+1.43e+00

Pos logit contributions
 apologizing+0.245
 recomp+0.233
 challeng+0.233
aign+0.228
 forgiven+0.225
Neg logit contributions
 Advertisement-0.289
 Kitt-0.278
bryce-0.274
Reviewer-0.245
cars-0.238
Token is
Token ID318
Feature activation+4.98e-01

Pos logit contributions
 apologizing+1.145
 recomp+1.106
 challeng+1.083
aign+1.065
 forgiven+1.064
Neg logit contributions
 Advertisement-0.991
bryce-0.947
 Kitt-0.947
Reviewer-0.837
cars-0.800
Token '
Token ID705
Feature activation+1.66e+00

Pos logit contributions
 apologizing+0.340
 recomp+0.323
 challeng+0.322
aign+0.316
 forgiven+0.312
Neg logit contributions
 Advertisement-0.407
 Kitt-0.392
bryce-0.386
Reviewer-0.346
cars-0.337
Tokenholding
Token ID19216
Feature activation-1.80e-01

Pos logit contributions
 apologizing+1.155
 recomp+1.097
 challeng+1.093
aign+1.074
 forgiven+1.062
Neg logit contributions
 Advertisement-1.330
 Kitt-1.281
bryce-1.257
Reviewer-1.123
 Carbuncle-1.101
Token back
Token ID736
Feature activation+5.64e-01

Pos logit contributions
 Advertisement+0.147
 Kitt+0.142
bryce+0.139
Reviewer+0.124
cars+0.122
Neg logit contributions
 apologizing-0.123
 recomp-0.117
 challeng-0.117
aign-0.114
 forgiven-0.113
Token'
Token ID6
Feature activation+2.64e+00

Pos logit contributions
 apologizing+0.432
 recomp+0.413
 challeng+0.412
aign+0.406
 forgiven+0.401
Neg logit contributions
 Advertisement-0.414
 Kitt-0.397
bryce-0.389
Reviewer-0.342
cars-0.334
Token the
Token ID262
Feature activation+2.91e-01

Pos logit contributions
 apologizing+1.876
 recomp+1.797
 challeng+1.775
 forgiven+1.731
aign+1.730
Neg logit contributions
 Advertisement-2.058
bryce-1.980
 Kitt-1.980
Reviewer-1.769
 Carbuncle-1.702
Token web
Token ID3992
Feature activation-3.73e-01

Pos logit contributions
 apologizing+0.195
 recomp+0.186
 challeng+0.185
aign+0.181
 forgiven+0.179
Neg logit contributions
 Advertisement-0.241
 Kitt-0.232
bryce-0.230
Reviewer-0.206
cars-0.201
Token A
Token ID317
Feature activation+2.52e-01

Pos logit contributions
 Advertisement+0.291
 Kitt+0.280
bryce+0.275
Reviewer+0.245
cars+0.238
Neg logit contributions
 apologizing-0.269
 recomp-0.256
 challeng-0.255
aign-0.251
 forgiven-0.248
Token Downing
Token ID31355
Feature activation-2.55e-01

Pos logit contributions
 apologizing+0.172
 recomp+0.163
 challeng+0.162
aign+0.160
 forgiven+0.158
Neg logit contributions
 Advertisement-0.207
 Kitt-0.200
bryce-0.196
Reviewer-0.175
cars-0.171
Token Street
Token ID3530
Feature activation+9.11e-01

Pos logit contributions
 Advertisement+0.186
 Kitt+0.178
bryce+0.175
Reviewer+0.155
cars+0.150
Neg logit contributions
 apologizing-0.196
 recomp-0.187
 challeng-0.187
aign-0.184
 forgiven-0.182
Token report
Token ID989
Feature activation+6.89e-01

Pos logit contributions
 apologizing+0.028
 recomp+0.027
 challeng+0.027
aign+0.026
 forgiven+0.026
Neg logit contributions
 Advertisement-0.034
 Kitt-0.033
bryce-0.032
Reviewer-0.029
cars-0.028
Token by
Token ID416
Feature activation+1.11e-01

Pos logit contributions
 apologizing+0.510
 recomp+0.484
 challeng+0.480
aign+0.474
 forgiven+0.469
Neg logit contributions
 Advertisement-0.524
bryce-0.502
 Kitt-0.501
Reviewer-0.438
cars-0.430
Token the
Token ID262
Feature activation-5.87e-02

Pos logit contributions
 apologizing+0.086
 recomp+0.083
 challeng+0.082
aign+0.081
 forgiven+0.080
Neg logit contributions
 Advertisement-0.079
 Kitt-0.076
bryce-0.075
Reviewer-0.066
cars-0.064
Token Senate
Token ID3845
Feature activation-2.73e-02

Pos logit contributions
 Advertisement+0.042
 Kitt+0.040
bryce+0.040
Reviewer+0.035
cars+0.034
Neg logit contributions
 apologizing-0.046
 recomp-0.044
 challeng-0.044
aign-0.043
 forgiven-0.043
Token Permanent
Token ID36814
Feature activation-1.34e+00

Pos logit contributions
 Advertisement+0.020
 Kitt+0.019
bryce+0.019
Reviewer+0.017
cars+0.016
Neg logit contributions
 apologizing-0.021
 recomp-0.020
 challeng-0.020
aign-0.019
 forgiven-0.019
Token Subcommittee
Token ID42187
Feature activation+2.62e+00

Pos logit contributions
 Advertisement+1.269
 Kitt+1.231
bryce+1.204
Reviewer+1.101
cars+1.081
Neg logit contributions
 apologizing-0.733
 recomp-0.686
 challeng-0.685
aign-0.671
 helicop-0.659
Token on
Token ID319
Feature activation+1.68e-01

Pos logit contributions
 apologizing+1.834
 recomp+1.743
aign+1.721
 challeng+1.719
 helicop+1.695
Neg logit contributions
 Advertisement-2.079
 Kitt-2.008
bryce-2.005
Reviewer-1.766
cars-1.731
Token Investigations
Token ID38111
Feature activation+3.51e-01

Pos logit contributions
 apologizing+0.094
 recomp+0.088
 challeng+0.087
aign+0.086
 forgiven+0.085
Neg logit contributions
 Advertisement-0.157
bryce-0.152
 Kitt-0.151
Reviewer-0.137
cars-0.133
Token found
Token ID1043
Feature activation+1.72e-01

Pos logit contributions
 apologizing+0.235
 recomp+0.223
 challeng+0.221
aign+0.217
 forgiven+0.216
Neg logit contributions
 Advertisement-0.293
 Kitt-0.282
bryce-0.280
Reviewer-0.250
cars-0.244
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
 apologizing+0.139
 recomp+0.134
 challeng+0.133
aign+0.130
 forgiven+0.130
Neg logit contributions
 Advertisement-0.118
 Kitt-0.113
bryce-0.111
Reviewer-0.097
cars-0.094
Token the
Token ID262
Feature activation+4.47e-02

Pos logit contributions
 apologizing+0.145
 recomp+0.138
 challeng+0.138
aign+0.135
 forgiven+0.134
Neg logit contributions
 Advertisement-0.140
 Kitt-0.135
bryce-0.134
Reviewer-0.118
cars-0.115
Token of
Token ID286
Feature activation+8.41e-02

Pos logit contributions
 Advertisement+1.341
 Kitt+1.287
bryce+1.265
Reviewer+1.122
cars+1.093
Neg logit contributions
 apologizing-1.299
 recomp-1.239
 challeng-1.238
aign-1.214
 forgiven-1.200
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
 apologizing+0.066
 recomp+0.063
 challeng+0.063
aign+0.061
 forgiven+0.061
Neg logit contributions
 Advertisement-0.061
 Kitt-0.058
bryce-0.057
Reviewer-0.051
cars-0.049
Token '
Token ID705
Feature activation+1.62e+00

Pos logit contributions
 apologizing+0.130
 recomp+0.124
 challeng+0.123
aign+0.121
 forgiven+0.120
Neg logit contributions
 Advertisement-0.142
 Kitt-0.136
bryce-0.136
Reviewer-0.121
cars-0.116
Tokenb
Token ID65
Feature activation-4.04e-01

Pos logit contributions
 apologizing+1.281
 recomp+1.221
 challeng+1.221
aign+1.213
 forgiven+1.193
Neg logit contributions
 Advertisement-1.132
 Kitt-1.078
bryce-1.062
Reviewer-0.917
 Carbuncle-0.912
Tokenanned
Token ID3577
Feature activation+6.17e-01

Pos logit contributions
 Advertisement+0.299
 Kitt+0.287
bryce+0.282
Reviewer+0.249
cars+0.243
Neg logit contributions
 apologizing-0.307
 recomp-0.293
 challeng-0.293
aign-0.287
 forgiven-0.284
Token'
Token ID6
Feature activation+2.59e+00

Pos logit contributions
 apologizing+0.470
 recomp+0.449
 challeng+0.448
aign+0.440
 forgiven+0.435
Neg logit contributions
 Advertisement-0.456
 Kitt-0.437
bryce-0.430
Reviewer-0.380
cars-0.369
Token Catalan
Token ID31066
Feature activation+3.97e-01

Pos logit contributions
 apologizing+1.800
 recomp+1.724
 challeng+1.695
aign+1.661
 pardon+1.661
Neg logit contributions
 Advertisement-2.067
bryce-1.997
 Kitt-1.969
Reviewer-1.780
 Carbuncle-1.700
Token independence
Token ID10404
Feature activation+7.30e-01

Pos logit contributions
 apologizing+0.302
 recomp+0.288
 challeng+0.287
aign+0.283
 forgiven+0.279
Neg logit contributions
 Advertisement-0.293
 Kitt-0.281
bryce-0.278
Reviewer-0.245
cars-0.237
Token referendum
Token ID11467
Feature activation+6.65e-01

Pos logit contributions
 apologizing+0.506
 recomp+0.482
 challeng+0.480
aign+0.471
 forgiven+0.465
Neg logit contributions
 Advertisement-0.588
 Kitt-0.566
bryce-0.558
Reviewer-0.498
cars-0.485
Token this
Token ID428
Feature activation+2.97e-01

Pos logit contributions
 apologizing+0.483
 recomp+0.459
 challeng+0.457
aign+0.450
 forgiven+0.445
Neg logit contributions
 Advertisement-0.513
 Kitt-0.494
bryce-0.492
Reviewer-0.436
 Carbuncle-0.420
Token Sunday
Token ID3502
Feature activation+9.17e-02

Pos logit contributions
 apologizing+0.197
 recomp+0.187
 challeng+0.187
aign+0.183
 forgiven+0.181
Neg logit contributions
 Advertisement-0.249
 Kitt-0.240
bryce-0.236
Reviewer-0.212
cars-0.207
Token was
Token ID373
Feature activation+9.69e-01

Pos logit contributions
 apologizing+0.851
 recomp+0.809
 challeng+0.807
aign+0.788
 forgiven+0.783
Neg logit contributions
 Advertisement-0.985
 Kitt-0.945
bryce-0.935
Reviewer-0.838
cars-0.813
Token left
Token ID1364
Feature activation+2.97e-01

Pos logit contributions
 apologizing+0.627
 recomp+0.594
 challeng+0.592
aign+0.581
 forgiven+0.573
Neg logit contributions
 Advertisement-0.826
 Kitt-0.797
bryce-0.785
Reviewer-0.706
cars-0.690
Token alone
Token ID3436
Feature activation+1.84e-01

Pos logit contributions
 apologizing+0.206
 recomp+0.196
 challeng+0.196
aign+0.194
 forgiven+0.190
Neg logit contributions
 Advertisement-0.238
 Kitt-0.229
bryce-0.225
Reviewer-0.200
cars-0.196
Token in
Token ID287
Feature activation+1.95e-01

Pos logit contributions
 apologizing+0.141
 recomp+0.135
 challeng+0.135
aign+0.132
 forgiven+0.131
Neg logit contributions
 Advertisement-0.134
 Kitt-0.129
bryce-0.126
Reviewer-0.111
cars-0.108
Token the
Token ID262
Feature activation+3.06e-01

Pos logit contributions
 apologizing+0.154
 recomp+0.147
 challeng+0.147
aign+0.145
 forgiven+0.143
Neg logit contributions
 Advertisement-0.138
 Kitt-0.132
bryce-0.130
Reviewer-0.114
cars-0.111
Token corridor
Token ID20749
Feature activation+2.58e+00

Pos logit contributions
 apologizing+0.196
 challeng+0.186
 recomp+0.186
aign+0.183
 helicop+0.180
Neg logit contributions
 Advertisement-0.261
 Kitt-0.254
bryce-0.247
Reviewer-0.223
 Carbuncle-0.218
Token,
Token ID11
Feature activation+4.48e-01

Pos logit contributions
 apologizing+1.996
 recomp+1.891
 challeng+1.882
 forgiven+1.836
aign+1.834
Neg logit contributions
 Advertisement-1.900
 Kitt-1.809
bryce-1.796
Reviewer-1.599
 Carbuncle-1.533
Token wondering
Token ID11263
Feature activation-9.07e-02

Pos logit contributions
 apologizing+0.297
 recomp+0.282
 challeng+0.282
aign+0.278
 helicop+0.273
Neg logit contributions
 Advertisement-0.373
 Kitt-0.361
bryce-0.354
Reviewer-0.316
cars-0.310
Token (
Token ID357
Feature activation+2.54e-01

Pos logit contributions
 Advertisement+0.071
 Kitt+0.068
bryce+0.067
Reviewer+0.060
cars+0.058
Neg logit contributions
 apologizing-0.065
 recomp-0.062
 challeng-0.062
aign-0.061
 forgiven-0.060
Tokendo
Token ID4598
Feature activation+8.87e-02

Pos logit contributions
 apologizing+0.189
 recomp+0.181
 challeng+0.180
aign+0.177
 forgiven+0.175
Neg logit contributions
 Advertisement-0.191
 Kitt-0.183
bryce-0.180
Reviewer-0.160
cars-0.155
Token captains
Token ID37551
Feature activation+8.05e-01

Pos logit contributions
 apologizing+0.071
 recomp+0.068
 challeng+0.068
aign+0.067
 forgiven+0.066
Neg logit contributions
 Advertisement-0.062
 Kitt-0.060
bryce-0.059
Reviewer-0.051
cars-0.050
Tokenard
Token ID446
Feature activation-2.41e-01

Pos logit contributions
 Advertisement+0.145
 Kitt+0.138
bryce+0.135
Reviewer+0.115
cars+0.111
Neg logit contributions
 apologizing-0.225
 recomp-0.216
 challeng-0.216
aign-0.213
 forgiven-0.211
Token,
Token ID11
Feature activation+2.61e-01

Pos logit contributions
 Advertisement+0.156
 Kitt+0.149
bryce+0.145
Reviewer+0.125
cars+0.122
Neg logit contributions
 apologizing-0.205
 recomp-0.197
 challeng-0.196
aign-0.193
 forgiven-0.192
Token a
Token ID257
Feature activation+3.62e-02

Pos logit contributions
 apologizing+0.206
 recomp+0.197
 challeng+0.197
aign+0.194
 forgiven+0.192
Neg logit contributions
 Advertisement-0.185
 Kitt-0.178
bryce-0.174
Reviewer-0.153
cars-0.149
Token post
Token ID1281
Feature activation+1.15e+00

Pos logit contributions
 apologizing+0.021
 recomp+0.020
 challeng+0.020
aign+0.020
 forgiven+0.019
Neg logit contributions
 Advertisement-0.033
 Kitt-0.032
bryce-0.031
Reviewer-0.028
cars-0.028
Tokendoctoral
Token ID44064
Feature activation+1.42e-01

Pos logit contributions
 apologizing+0.919
 recomp+0.879
 challeng+0.877
aign+0.864
 forgiven+0.854
Neg logit contributions
 Advertisement-0.804
 Kitt-0.769
bryce-0.754
Reviewer-0.662
cars-0.642
Token fellow
Token ID5891
Feature activation+2.56e+00

Pos logit contributions
 apologizing+0.107
 recomp+0.102
 challeng+0.102
aign+0.100
 helicop+0.099
Neg logit contributions
 Advertisement-0.106
 Kitt-0.102
bryce-0.100
Reviewer-0.088
 Carbuncle-0.086
Token in
Token ID287
Feature activation+1.86e-01

Pos logit contributions
 apologizing+2.012
 recomp+1.917
 challeng+1.903
aign+1.880
 forgiven+1.857
Neg logit contributions
 Advertisement-1.835
 Kitt-1.738
bryce-1.719
Reviewer-1.527
cars-1.484
Token Chang
Token ID22597
Feature activation+4.59e-01

Pos logit contributions
 apologizing+0.132
 recomp+0.126
 challeng+0.125
aign+0.123
 forgiven+0.121
Neg logit contributions
 Advertisement-0.146
 Kitt-0.140
bryce-0.138
Reviewer-0.124
cars-0.120
Token
Token ID447
Feature activation+2.15e-01

Pos logit contributions
 apologizing+0.332
 recomp+0.316
 challeng+0.315
aign+0.310
 forgiven+0.306
Neg logit contributions
 Advertisement-0.357
 Kitt-0.343
bryce-0.337
Reviewer-0.300
cars-0.292
Token
Token ID247
Feature activation+2.01e-02

Pos logit contributions
 apologizing+0.148
 recomp+0.140
aign+0.138
 challeng+0.138
 forgiven+0.136
Neg logit contributions
 Advertisement-0.175
 Kitt-0.168
bryce-0.166
Reviewer-0.149
cars-0.145
Tokens
Token ID82
Feature activation+1.66e-01

Pos logit contributions
 apologizing+0.017
 recomp+0.016
 challeng+0.016
aign+0.016
 forgiven+0.016
Neg logit contributions
 Advertisement-0.014
 Kitt-0.013
bryce-0.013
Reviewer-0.011
cars-0.011
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 apologizing+0.300
 recomp+0.285
 challeng+0.284
aign+0.279
 forgiven+0.276
Neg logit contributions
 Advertisement-0.368
 Kitt-0.354
bryce-0.349
Reviewer-0.313
cars-0.305
Token '
Token ID705
Feature activation+1.32e+00

Pos logit contributions
 apologizing+0.066
 challeng+0.062
 recomp+0.062
aign+0.061
 helicop+0.060
Neg logit contributions
 Advertisement-0.087
 Kitt-0.083
bryce-0.082
Reviewer-0.074
 Carbuncle-0.072
Tokenskin
Token ID20407
Feature activation-9.95e-01

Pos logit contributions
 apologizing+0.938
 recomp+0.894
 challeng+0.891
aign+0.876
 forgiven+0.867
Neg logit contributions
 Advertisement-1.045
 Kitt-1.006
bryce-0.986
Reviewer-0.878
 Carbuncle-0.859
Token-
Token ID12
Feature activation-3.22e-02

Pos logit contributions
 Advertisement+0.693
 Kitt+0.663
bryce+0.644
Reviewer+0.571
cars+0.556
Neg logit contributions
 apologizing-0.795
 challeng-0.760
 recomp-0.760
aign-0.750
 helicop-0.741
Tokendeep
Token ID22089
Feature activation-3.04e-01

Pos logit contributions
 Advertisement+0.024
 Kitt+0.023
bryce+0.023
Reviewer+0.020
cars+0.019
Neg logit contributions
 apologizing-0.024
 recomp-0.023
 challeng-0.023
aign-0.023
 forgiven-0.023
Token'
Token ID6
Feature activation+2.53e+00

Pos logit contributions
 Advertisement+0.226
 Kitt+0.216
bryce+0.211
Reviewer+0.186
 Carbuncle+0.182
Neg logit contributions
 apologizing-0.230
 challeng-0.220
 recomp-0.219
aign-0.217
 helicop-0.214
Token attribute
Token ID11688
Feature activation-7.28e-01

Pos logit contributions
 apologizing+1.618
 recomp+1.538
 challeng+1.525
aign+1.485
 forgiven+1.473
Neg logit contributions
 Advertisement-2.172
 Kitt-2.099
bryce-2.074
Reviewer-1.869
cars-1.828
Token that
Token ID326
Feature activation+1.40e-01

Pos logit contributions
 Advertisement+0.543
 Kitt+0.522
bryce+0.511
Reviewer+0.452
cars+0.441
Neg logit contributions
 apologizing-0.546
 challeng-0.522
 recomp-0.521
aign-0.514
 helicop-0.507
Token was
Token ID373
Feature activation-5.22e-02

Pos logit contributions
 apologizing+0.108
 recomp+0.104
 challeng+0.103
aign+0.101
 forgiven+0.101
Neg logit contributions
 Advertisement-0.102
 Kitt-0.097
bryce-0.096
Reviewer-0.085
cars-0.082
Token not
Token ID407
Feature activation+7.33e-03

Pos logit contributions
 Advertisement+0.044
 Kitt+0.042
bryce+0.042
Reviewer+0.038
cars+0.037
Neg logit contributions
 apologizing-0.034
 recomp-0.033
 challeng-0.032
aign-0.032
 forgiven-0.031
Token rooted
Token ID19459
Feature activation+1.45e+00

Pos logit contributions
 apologizing+0.005
 recomp+0.005
 challeng+0.004
aign+0.004
 forgiven+0.004
Neg logit contributions
 Advertisement-0.006
 Kitt-0.006
bryce-0.006
Reviewer-0.005
cars-0.005
Token (
Token ID357
Feature activation-8.48e-02

Pos logit contributions
 Advertisement+0.417
 Kitt+0.402
bryce+0.394
Reviewer+0.356
 Carbuncle+0.349
Neg logit contributions
 apologizing-0.293
 challeng-0.277
 recomp-0.276
aign-0.271
 helicop-0.268
Tokenold
Token ID727
Feature activation-1.15e+00

Pos logit contributions
 Advertisement+0.066
 Kitt+0.063
bryce+0.062
Reviewer+0.055
cars+0.054
Neg logit contributions
 apologizing-0.062
 challeng-0.059
 recomp-0.059
aign-0.057
 helicop-0.057
Token)
Token ID8
Feature activation-1.23e-01

Pos logit contributions
 Advertisement+0.764
 Kitt+0.729
bryce+0.702
Reviewer+0.612
 Carbuncle+0.601
Neg logit contributions
 apologizing-0.964
 challeng-0.931
 recomp-0.927
aign-0.910
 helicop-0.909
Token website
Token ID3052
Feature activation+1.95e-01

Pos logit contributions
 Advertisement+0.111
 Kitt+0.107
bryce+0.105
Reviewer+0.095
 Carbuncle+0.093
Neg logit contributions
 apologizing-0.074
 challeng-0.070
 recomp-0.070
aign-0.068
 helicop-0.067
Token.
Token ID13
Feature activation-3.08e-01

Pos logit contributions
 apologizing+0.140
 challeng+0.133
 recomp+0.133
aign+0.131
 helicop+0.129
Neg logit contributions
 Advertisement-0.153
 Kitt-0.147
bryce-0.144
Reviewer-0.129
cars-0.125
Token Nevertheless
Token ID15933
Feature activation-3.28e+00

Pos logit contributions
 Advertisement+0.253
 Kitt+0.243
bryce+0.237
Reviewer+0.214
cars+0.208
Neg logit contributions
 apologizing-0.210
 challeng-0.202
 recomp-0.200
aign-0.196
 helicop-0.195
Token we
Token ID356
Feature activation-6.64e-03

Pos logit contributions
 Advertisement+1.957
 Kitt+1.845
bryce+1.773
Reviewer+1.564
cars+1.475
Neg logit contributions
 apologizing-2.933
 challeng-2.851
 recomp-2.842
aign-2.805
 helicop-2.797
Token worked
Token ID3111
Feature activation+1.39e-01

Pos logit contributions
 Advertisement+0.005
 Kitt+0.005
bryce+0.005
Reviewer+0.004
cars+0.004
Neg logit contributions
 apologizing-0.005
 recomp-0.005
 challeng-0.005
aign-0.005
 forgiven-0.005
Token hard
Token ID1327
Feature activation-3.65e-01

Pos logit contributions
 apologizing+0.082
 recomp+0.077
 challeng+0.077
aign+0.076
 forgiven+0.074
Neg logit contributions
 Advertisement-0.126
 Kitt-0.122
bryce-0.120
Reviewer-0.109
cars-0.106
Token to
Token ID284
Feature activation+1.02e-02

Pos logit contributions
 Advertisement+0.266
 Kitt+0.255
bryce+0.251
Reviewer+0.221
cars+0.215
Neg logit contributions
 apologizing-0.281
 recomp-0.268
 challeng-0.267
aign-0.263
 forgiven-0.260
Token cover
Token ID3002
Feature activation-6.87e-01

Pos logit contributions
 apologizing+0.007
 recomp+0.006
 challeng+0.006
aign+0.006
 forgiven+0.006
Neg logit contributions
 Advertisement-0.009
 Kitt-0.008
bryce-0.008
Reviewer-0.007
cars-0.007
Tokenfired
Token ID26803
Feature activation-9.01e-01

Pos logit contributions
 apologizing+0.128
 recomp+0.119
 challeng+0.119
aign+0.116
 forgiven+0.114
Neg logit contributions
 Advertisement-0.249
 Kitt-0.241
bryce-0.238
Reviewer-0.217
cars-0.214
Token electricity
Token ID8744
Feature activation-2.10e-01

Pos logit contributions
 Advertisement+0.647
 Kitt+0.621
bryce+0.607
Reviewer+0.534
cars+0.521
Neg logit contributions
 apologizing-0.704
 challeng-0.672
 recomp-0.671
aign-0.661
 forgiven-0.653
Token generation
Token ID5270
Feature activation-2.84e-02

Pos logit contributions
 Advertisement+0.150
 Kitt+0.144
bryce+0.141
Reviewer+0.124
 Carbuncle+0.120
Neg logit contributions
 apologizing-0.165
 recomp-0.159
 challeng-0.157
aign-0.155
 forgiven-0.154
Token produces
Token ID11073
Feature activation-3.87e-01

Pos logit contributions
 Advertisement+0.017
 Kitt+0.016
bryce+0.016
Reviewer+0.014
 Carbuncle+0.013
Neg logit contributions
 apologizing-0.025
 recomp-0.024
 challeng-0.024
aign-0.024
 forgiven-0.024
Token more
Token ID517
Feature activation-3.30e-01

Pos logit contributions
 Advertisement+0.298
 Kitt+0.287
bryce+0.282
Reviewer+0.251
cars+0.244
Neg logit contributions
 apologizing-0.281
 recomp-0.268
 challeng-0.267
aign-0.262
 forgiven-0.260
Token lif
Token ID3868
Feature activation-3.23e+00

Pos logit contributions
 Advertisement+0.263
 Kitt+0.253
bryce+0.249
Reviewer+0.222
cars+0.216
Neg logit contributions
 apologizing-0.232
 recomp-0.221
 challeng-0.220
aign-0.216
 forgiven-0.214
Tokenecycle
Token ID47510
Feature activation-4.55e-01

Pos logit contributions
 Advertisement+2.610
 Kitt+2.495
bryce+2.459
Reviewer+2.206
cars+2.174
Neg logit contributions
 apologizing-2.236
 challeng-2.130
 recomp-2.104
aign-2.071
 helicop-2.054
Token greenhouse
Token ID16325
Feature activation-7.26e-01

Pos logit contributions
 Advertisement+0.356
 Kitt+0.342
bryce+0.337
Reviewer+0.300
cars+0.292
Neg logit contributions
 apologizing-0.327
 recomp-0.311
 challeng-0.310
aign-0.305
 forgiven-0.301
Token gas
Token ID3623
Feature activation-2.95e-01

Pos logit contributions
 Advertisement+0.679
 Kitt+0.655
bryce+0.645
Reviewer+0.587
cars+0.577
Neg logit contributions
 apologizing-0.411
 challeng-0.386
 recomp-0.384
aign-0.377
 helicop-0.371
Token emissions
Token ID8971
Feature activation-5.98e-01

Pos logit contributions
 Advertisement+0.213
 Kitt+0.204
bryce+0.201
Reviewer+0.177
cars+0.171
Neg logit contributions
 apologizing-0.229
 recomp-0.219
 challeng-0.218
aign-0.214
 forgiven-0.212
Token than
Token ID621
Feature activation-2.22e-01

Pos logit contributions
 Advertisement+0.469
 Kitt+0.452
bryce+0.445
Reviewer+0.396
 Carbuncle+0.386
Neg logit contributions
 apologizing-0.428
 recomp-0.408
 challeng-0.405
aign-0.398
 forgiven-0.395
Token submarines
Token ID34031
Feature activation+9.04e-02

Pos logit contributions
 Advertisement+0.339
 Kitt+0.326
bryce+0.318
Reviewer+0.280
cars+0.275
Neg logit contributions
 apologizing-0.355
 recomp-0.338
 challeng-0.337
aign-0.335
 helicop-0.329
Token would
Token ID561
Feature activation-2.56e-01

Pos logit contributions
 apologizing+0.077
 recomp+0.074
 challeng+0.074
aign+0.072
 forgiven+0.072
Neg logit contributions
 Advertisement-0.059
 Kitt-0.055
bryce-0.055
Reviewer-0.048
cars-0.046
Token have
Token ID423
Feature activation-6.43e-02

Pos logit contributions
 Advertisement+0.229
 Kitt+0.221
bryce+0.218
Reviewer+0.197
cars+0.193
Neg logit contributions
 apologizing-0.155
 recomp-0.146
 challeng-0.145
aign-0.142
 forgiven-0.140
Token extended
Token ID7083
Feature activation+1.44e-01

Pos logit contributions
 Advertisement+0.049
 Kitt+0.047
bryce+0.046
Reviewer+0.041
cars+0.040
Neg logit contributions
 apologizing-0.047
 recomp-0.045
 challeng-0.045
aign-0.044
 forgiven-0.044
Token service
Token ID2139
Feature activation-1.62e-01

Pos logit contributions
 apologizing+0.101
 recomp+0.096
 challeng+0.096
aign+0.094
 forgiven+0.093
Neg logit contributions
 Advertisement-0.116
 Kitt-0.111
bryce-0.110
Reviewer-0.098
cars-0.095
Token lif
Token ID3868
Feature activation-3.19e+00

Pos logit contributions
 Advertisement+0.130
 Kitt+0.125
bryce+0.123
Reviewer+0.110
cars+0.107
Neg logit contributions
 apologizing-0.113
 recomp-0.108
 challeng-0.108
aign-0.106
 forgiven-0.104
Tokenetimes
Token ID46874
Feature activation+5.73e-01

Pos logit contributions
 Advertisement+2.057
 Kitt+1.956
bryce+1.915
Reviewer+1.667
cars+1.630
Neg logit contributions
 apologizing-2.721
 challeng-2.621
 recomp-2.585
 helicop-2.545
aign-2.540
Token,
Token ID11
Feature activation-9.57e-02

Pos logit contributions
 apologizing+0.429
 recomp+0.410
 challeng+0.408
aign+0.399
 forgiven+0.398
Neg logit contributions
 Advertisement-0.430
 Kitt-0.413
bryce-0.408
Reviewer-0.361
cars-0.350
Token from
Token ID422
Feature activation-2.24e-01

Pos logit contributions
 Advertisement+0.077
 Kitt+0.075
bryce+0.073
Reviewer+0.066
cars+0.064
Neg logit contributions
 apologizing-0.066
 recomp-0.063
 challeng-0.063
aign-0.061
 forgiven-0.061
Token 25
Token ID1679
Feature activation-2.41e-02

Pos logit contributions
 Advertisement+0.170
 Kitt+0.163
bryce+0.160
Reviewer+0.142
cars+0.138
Neg logit contributions
 apologizing-0.167
 recomp-0.159
 challeng-0.159
aign-0.156
 forgiven-0.154
Token to
Token ID284
Feature activation-3.18e-02

Pos logit contributions
 Advertisement+0.018
 Kitt+0.017
bryce+0.017
Reviewer+0.015
 Carbuncle+0.014
Neg logit contributions
 apologizing-0.019
 recomp-0.018
 challeng-0.018
aign-0.017
 forgiven-0.017
Token have
Token ID423
Feature activation-4.07e-01

Pos logit contributions
 Advertisement+0.294
 Kitt+0.283
bryce+0.279
Reviewer+0.250
cars+0.244
Neg logit contributions
 apologizing-0.236
 recomp-0.224
 challeng-0.224
aign-0.219
 forgiven-0.216
Token been
Token ID587
Feature activation-5.66e-01

Pos logit contributions
 Advertisement+0.344
 Kitt+0.331
bryce+0.326
Reviewer+0.293
cars+0.287
Neg logit contributions
 apologizing-0.267
 recomp-0.252
 challeng-0.252
aign-0.248
 helicop-0.243
Token astonished
Token ID40962
Feature activation-3.36e-01

Pos logit contributions
 Advertisement+0.504
 Kitt+0.486
bryce+0.479
Reviewer+0.433
cars+0.425
Neg logit contributions
 apologizing-0.342
 challeng-0.324
 recomp-0.322
aign-0.318
 helicop-0.309
Token by
Token ID416
Feature activation-7.72e-02

Pos logit contributions
 Advertisement+0.238
 Kitt+0.228
bryce+0.224
Reviewer+0.196
cars+0.190
Neg logit contributions
 apologizing-0.267
 recomp-0.255
 challeng-0.253
aign-0.249
 forgiven-0.248
Token the
Token ID262
Feature activation-3.72e-01

Pos logit contributions
 Advertisement+0.055
 Kitt+0.052
bryce+0.052
Reviewer+0.045
 Carbuncle+0.044
Neg logit contributions
 apologizing-0.061
 recomp-0.058
 challeng-0.058
aign-0.057
 forgiven-0.057
Token lif
Token ID3868
Feature activation-3.18e+00

Pos logit contributions
 Advertisement+0.316
 Kitt+0.305
bryce+0.300
Reviewer+0.270
cars+0.263
Neg logit contributions
 apologizing-0.243
 recomp-0.230
 challeng-0.229
aign-0.225
 forgiven-0.222
Tokenel
Token ID417
Feature activation+5.49e-01

Pos logit contributions
 Advertisement+2.325
 Kitt+2.215
bryce+2.174
Reviewer+1.936
cars+1.910
Neg logit contributions
 apologizing-2.451
 challeng-2.332
 recomp-2.306
aign-2.268
 helicop-2.259
Tokenike
Token ID522
Feature activation+7.91e-01

Pos logit contributions
 apologizing+0.226
 recomp+0.208
 challeng+0.207
aign+0.199
 forgiven+0.195
Neg logit contributions
 Advertisement-0.597
 Kitt-0.580
bryce-0.573
Reviewer-0.529
cars-0.520
Token movements
Token ID8650
Feature activation+1.47e+00

Pos logit contributions
 apologizing+0.533
 recomp+0.509
 challeng+0.504
aign+0.495
 forgiven+0.489
Neg logit contributions
 Advertisement-0.651
 Kitt-0.627
bryce-0.620
Reviewer-0.555
 Carbuncle-0.539
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
 apologizing+1.129
 recomp+1.077
 challeng+1.072
aign+1.055
 forgiven+1.051
Neg logit contributions
 Advertisement-1.072
 Kitt-1.028
bryce-1.018
Reviewer-0.894
 Carbuncle-0.869
Token
Token ID198
Feature activation-3.80e-01

Pos logit contributions
 Advertisement+0.212
 Kitt+0.205
bryce+0.202
Reviewer+0.181
cars+0.176
Neg logit contributions
 apologizing-0.179
 recomp-0.170
 challeng-0.168
aign-0.167
 forgiven-0.164
Token hard
Token ID1327
Feature activation-6.54e-01

Pos logit contributions
 Advertisement+0.154
 Kitt+0.148
bryce+0.146
Reviewer+0.130
cars+0.128
Neg logit contributions
 apologizing-0.130
 recomp-0.123
 challeng-0.123
aign-0.121
 helicop-0.119
Token to
Token ID284
Feature activation-9.55e-03

Pos logit contributions
 Advertisement+0.466
 Kitt+0.445
bryce+0.438
Reviewer+0.386
cars+0.375
Neg logit contributions
 apologizing-0.514
 recomp-0.492
 challeng-0.491
aign-0.483
 forgiven-0.478
Token get
Token ID651
Feature activation+3.48e-01

Pos logit contributions
 Advertisement+0.008
 Kitt+0.008
bryce+0.008
Reviewer+0.007
cars+0.007
Neg logit contributions
 apologizing-0.006
 recomp-0.006
 challeng-0.006
aign-0.006
 forgiven-0.006
Token a
Token ID257
Feature activation-1.10e-01

Pos logit contributions
 apologizing+0.263
 recomp+0.251
 challeng+0.250
aign+0.246
 forgiven+0.244
Neg logit contributions
 Advertisement-0.259
 Kitt-0.249
bryce-0.245
Reviewer-0.217
cars-0.210
Token very
Token ID845
Feature activation-2.27e-01

Pos logit contributions
 Advertisement+0.095
 Kitt+0.092
bryce+0.091
Reviewer+0.082
cars+0.080
Neg logit contributions
 apologizing-0.070
 recomp-0.066
 challeng-0.066
aign-0.064
 forgiven-0.064
Token lif
Token ID3868
Feature activation-3.08e+00

Pos logit contributions
 Advertisement+0.186
 Kitt+0.179
bryce+0.176
Reviewer+0.157
cars+0.154
Neg logit contributions
 apologizing-0.155
 recomp-0.147
 challeng-0.147
aign-0.144
 forgiven-0.142
Tokenel
Token ID417
Feature activation+4.47e-01

Pos logit contributions
 Advertisement+2.244
 Kitt+2.134
bryce+2.083
Reviewer+1.847
cars+1.838
Neg logit contributions
 apologizing-2.368
 challeng-2.279
 recomp-2.238
 helicop-2.203
aign-2.197
Tokenike
Token ID522
Feature activation+8.38e-01

Pos logit contributions
 apologizing+0.187
 challeng+0.171
 recomp+0.171
aign+0.165
 forgiven+0.162
Neg logit contributions
 Advertisement-0.484
 Kitt-0.470
bryce-0.465
Reviewer-0.428
cars-0.421
Token,
Token ID11
Feature activation-6.64e-02

Pos logit contributions
 apologizing+0.539
 recomp+0.514
 challeng+0.505
aign+0.497
 forgiven+0.491
Neg logit contributions
 Advertisement-0.716
 Kitt-0.692
bryce-0.688
Reviewer-0.617
 Carbuncle-0.602
Token fluid
Token ID11711
Feature activation+1.01e+00

Pos logit contributions
 Advertisement+0.055
 Kitt+0.053
bryce+0.053
Reviewer+0.047
cars+0.046
Neg logit contributions
 apologizing-0.044
 recomp-0.042
 challeng-0.042
aign-0.041
 forgiven-0.040
Token motion
Token ID6268
Feature activation+1.18e+00

Pos logit contributions
 apologizing+0.719
 recomp+0.688
 challeng+0.679
aign+0.670
 forgiven+0.661
Neg logit contributions
 Advertisement-0.795
 Kitt-0.768
bryce-0.756
Reviewer-0.671
 Carbuncle-0.655
Token.
Token ID13
Feature activation-5.89e-01

Pos logit contributions
 Advertisement+0.742
 Kitt+0.713
bryce+0.700
Reviewer+0.624
cars+0.609
Neg logit contributions
 apologizing-0.674
 recomp-0.643
 challeng-0.642
aign-0.627
 forgiven-0.622
Tokenpdf
Token ID12315
Feature activation-5.90e-01

Pos logit contributions
 Advertisement+0.499
 Kitt+0.482
bryce+0.474
Reviewer+0.426
cars+0.417
Neg logit contributions
 apologizing-0.384
 recomp-0.364
 challeng-0.363
aign-0.356
 forgiven-0.351
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
 Advertisement+0.489
 Kitt+0.471
bryce+0.463
Reviewer+0.415
cars+0.405
Neg logit contributions
 apologizing-0.396
 recomp-0.376
 challeng-0.375
aign-0.368
 forgiven-0.363
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
 Advertisement+0.134
 Kitt+0.131
bryce+0.129
Reviewer+0.117
 Carbuncle+0.115
Neg logit contributions
 apologizing-0.082
 recomp-0.077
aign-0.076
 forgiven-0.074
 pardon-0.073
Token©
Token ID16224
Feature activation-2.24e+00

Pos logit contributions
 Advertisement+0.152
 Kitt+0.147
bryce+0.143
Reviewer+0.124
 Carbuncle+0.122
Neg logit contributions
 apologizing-0.181
 recomp-0.173
 challeng-0.172
aign-0.171
 forgiven-0.169
Token Copyright
Token ID15069
Feature activation-2.79e+00

Pos logit contributions
 Advertisement+1.730
 Kitt+1.661
bryce+1.627
Reviewer+1.443
 Carbuncle+1.411
Neg logit contributions
 apologizing-1.632
 recomp-1.557
 challeng-1.553
aign-1.521
 forgiven-1.503
Token 2018
Token ID2864
Feature activation-4.72e-01

Pos logit contributions
 Advertisement+2.043
 Kitt+1.961
bryce+1.917
Reviewer+1.684
 Carbuncle+1.651
Neg logit contributions
 apologizing-2.140
 recomp-2.052
 challeng-2.045
aign-2.000
 helicop-1.984
Token Red
Token ID2297
Feature activation+1.75e-02

Pos logit contributions
 Advertisement+0.340
 Kitt+0.326
bryce+0.320
Reviewer+0.282
cars+0.274
Neg logit contributions
 apologizing-0.368
 recomp-0.352
 challeng-0.351
aign-0.345
 forgiven-0.341
Token
Token ID447
Feature activation-6.98e-01

Pos logit contributions
 apologizing+0.015
 recomp+0.014
 challeng+0.014
aign+0.014
 forgiven+0.014
Neg logit contributions
 Advertisement-0.012
 Kitt-0.011
bryce-0.011
Reviewer-0.009
cars-0.009
Token
Token ID247
Feature activation-2.02e-01

Pos logit contributions
 Advertisement+0.574
 Kitt+0.553
bryce+0.544
Reviewer+0.487
cars+0.475
Neg logit contributions
 apologizing-0.473
 recomp-0.450
 challeng-0.449
aign-0.440
 forgiven-0.434
Tokens
Token ID82
Feature activation-4.74e-01

Pos logit contributions
 Advertisement+0.145
 Kitt+0.139
bryce+0.136
Reviewer+0.120
cars+0.117
Neg logit contributions
 apologizing-0.158
 recomp-0.151
 challeng-0.150
aign-0.148
 forgiven-0.147
Token if
Token ID611
Feature activation+6.85e-02

Pos logit contributions
 apologizing+0.006
 challeng+0.006
 recomp+0.006
aign+0.006
 helicop+0.006
Neg logit contributions
 Advertisement-0.005
 Kitt-0.005
bryce-0.005
Reviewer-0.004
cars-0.004
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
 apologizing+0.059
 recomp+0.056
 challeng+0.056
aign+0.055
 forgiven+0.055
Neg logit contributions
 Advertisement-0.044
 Kitt-0.042
bryce-0.041
Reviewer-0.036
cars-0.034
Token Kernel
Token ID32169
Feature activation+6.24e-01

Pos logit contributions
 Advertisement+0.098
 Kitt+0.094
bryce+0.092
Reviewer+0.084
cars+0.082
Neg logit contributions
 apologizing-0.061
 challeng-0.057
 recomp-0.057
aign-0.056
 helicop-0.055
Token cannot
Token ID2314
Feature activation-1.08e-01

Pos logit contributions
 apologizing+0.463
 recomp+0.445
 challeng+0.437
aign+0.431
 forgiven+0.427
Neg logit contributions
 Advertisement-0.469
 Kitt-0.454
bryce-0.451
Reviewer-0.399
cars-0.387
Token correctly
Token ID9380
Feature activation+2.83e-01

Pos logit contributions
 Advertisement+0.097
 Kitt+0.094
bryce+0.092
Reviewer+0.083
cars+0.082
Neg logit contributions
 apologizing-0.064
 challeng-0.060
 recomp-0.060
aign-0.059
 helicop-0.058
Token mount
Token ID3817
Feature activation-2.74e+00

Pos logit contributions
 apologizing+0.180
 challeng+0.170
 recomp+0.168
aign+0.167
 helicop+0.164
Neg logit contributions
 Advertisement-0.245
 Kitt-0.236
bryce-0.231
Reviewer-0.209
cars-0.204
Token it
Token ID340
Feature activation+1.36e-01

Pos logit contributions
 Advertisement+1.822
 Kitt+1.743
bryce+1.670
Reviewer+1.462
cars+1.438
Neg logit contributions
 apologizing-2.268
 challeng-2.187
 recomp-2.141
aign-2.140
 helicop-2.138
Token,
Token ID11
Feature activation-3.68e-01

Pos logit contributions
 apologizing+0.110
 challeng+0.105
 recomp+0.105
aign+0.104
 helicop+0.102
Neg logit contributions
 Advertisement-0.094
 Kitt-0.090
bryce-0.088
Reviewer-0.077
cars-0.075
Token it
Token ID340
Feature activation+1.27e-01

Pos logit contributions
 Advertisement+0.258
 Kitt+0.247
bryce+0.241
Reviewer+0.212
cars+0.206
Neg logit contributions
 apologizing-0.293
 challeng-0.281
 recomp-0.279
aign-0.277
 helicop-0.274
Token will
Token ID481
Feature activation-9.38e-02

Pos logit contributions
 apologizing+0.120
 recomp+0.116
 challeng+0.115
aign+0.114
 forgiven+0.113
Neg logit contributions
 Advertisement-0.071
 Kitt-0.067
bryce-0.066
Reviewer-0.056
cars-0.053
Token panic
Token ID13619
Feature activation-1.62e+00

Pos logit contributions
 Advertisement+0.087
 Kitt+0.084
bryce+0.083
Reviewer+0.075
 Carbuncle+0.074
Neg logit contributions
 apologizing-0.053
 challeng-0.050
 recomp-0.050
aign-0.049
 helicop-0.048
Token million
Token ID1510
Feature activation-1.50e-02

Pos logit contributions
 apologizing+0.117
 recomp+0.111
 challeng+0.111
aign+0.108
 helicop+0.107
Neg logit contributions
 Advertisement-0.154
 Kitt-0.148
bryce-0.146
Reviewer-0.131
cars-0.129
Token).
Token ID737
Feature activation-3.03e-01

Pos logit contributions
 Advertisement+0.011
 Kitt+0.011
bryce+0.011
Reviewer+0.010
cars+0.009
Neg logit contributions
 apologizing-0.011
 recomp-0.011
 challeng-0.011
aign-0.010
 forgiven-0.010
Token
Token ID198
Feature activation-4.88e-01

Pos logit contributions
 Advertisement+0.255
 Kitt+0.246
bryce+0.241
Reviewer+0.217
cars+0.212
Neg logit contributions
 apologizing-0.200
 challeng-0.191
 recomp-0.190
aign-0.185
 helicop-0.184
Token
Token ID198
Feature activation-4.32e-01

Pos logit contributions
 Advertisement+0.447
 Kitt+0.434
bryce+0.429
Reviewer+0.389
 Carbuncle+0.380
Neg logit contributions
 apologizing-0.282
 recomp-0.266
aign-0.260
 forgiven-0.255
 challeng-0.255
Token
Token ID3581
Feature activation-4.86e-01

Pos logit contributions
 Advertisement+0.280
 Kitt+0.267
bryce+0.262
Reviewer+0.227
cars+0.219
Neg logit contributions
 apologizing-0.368
 recomp-0.353
 challeng-0.352
aign-0.347
 forgiven-0.343
Token Please
Token ID4222
Feature activation-2.73e+00

Pos logit contributions
 Advertisement+0.368
 Kitt+0.352
bryce+0.344
Reviewer+0.306
cars+0.298
Neg logit contributions
 apologizing-0.363
 challeng-0.348
 recomp-0.347
aign-0.339
 helicop-0.336
Token click
Token ID3904
Feature activation-2.15e-01

Pos logit contributions
 Advertisement+2.177
 Kitt+2.070
bryce+2.024
Reviewer+1.819
 Carbuncle+1.774
Neg logit contributions
 apologizing-1.908
 challeng-1.835
aign-1.802
 recomp-1.786
 helicop-1.770
Token here
Token ID994
Feature activation+3.32e-01

Pos logit contributions
 Advertisement+0.185
 Kitt+0.178
bryce+0.175
Reviewer+0.158
cars+0.154
Neg logit contributions
 apologizing-0.137
 challeng-0.130
 recomp-0.130
aign-0.127
 helicop-0.126
Token for
Token ID329
Feature activation+1.85e-01

Pos logit contributions
 apologizing+0.252
 recomp+0.241
 challeng+0.240
aign+0.236
 forgiven+0.233
Neg logit contributions
 Advertisement-0.247
 Kitt-0.237
bryce-0.232
Reviewer-0.206
cars-0.200
Token more
Token ID517
Feature activation-1.98e-01

Pos logit contributions
 apologizing+0.148
 recomp+0.141
 challeng+0.141
aign+0.139
 forgiven+0.137
Neg logit contributions
 Advertisement-0.130
 Kitt-0.124
bryce-0.121
Reviewer-0.107
cars-0.104
Token Dolphins
Token ID21668
Feature activation-1.93e-01

Pos logit contributions
 Advertisement+0.156
 Kitt+0.149
bryce+0.146
Reviewer+0.130
cars+0.128
Neg logit contributions
 apologizing-0.141
 recomp-0.135
 challeng-0.134
aign-0.132
 forgiven-0.130
Token happens
Token ID4325
Feature activation-4.82e-01

Pos logit contributions
 Advertisement+0.059
 Kitt+0.055
bryce+0.054
Reviewer+0.050
 Carbuncle+0.046
Neg logit contributions
 apologizing-0.062
 recomp-0.061
 challeng-0.059
aign-0.058
 reload-0.058
Token in
Token ID287
Feature activation-1.66e-01

Pos logit contributions
 Advertisement+0.381
 Kitt+0.366
bryce+0.361
Reviewer+0.322
cars+0.313
Neg logit contributions
 apologizing-0.342
 recomp-0.325
 challeng-0.324
aign-0.319
 forgiven-0.315
Token Greater
Token ID18169
Feature activation+1.80e-01

Pos logit contributions
 Advertisement+0.128
 Kitt+0.123
bryce+0.121
Reviewer+0.108
cars+0.105
Neg logit contributions
 apologizing-0.121
 recomp-0.115
 challeng-0.115
aign-0.113
 forgiven-0.112
Token Cincinnati
Token ID16137
Feature activation-3.55e-01

Pos logit contributions
 apologizing+0.128
 recomp+0.121
aign+0.120
 challeng+0.119
 forgiven+0.118
Neg logit contributions
 Advertisement-0.141
bryce-0.136
 Kitt-0.134
Reviewer-0.121
cars-0.116
Token.
Token ID13
Feature activation+7.05e-02

Pos logit contributions
 Advertisement+0.285
 Kitt+0.274
bryce+0.274
Reviewer+0.245
 Carbuncle+0.237
Neg logit contributions
 apologizing-0.245
 recomp-0.232
 challeng-0.230
aign-0.228
 forgiven-0.225
Token Please
Token ID4222
Feature activation-2.66e+00

Pos logit contributions
 apologizing+0.053
 recomp+0.050
 challeng+0.050
aign+0.050
 forgiven+0.049
Neg logit contributions
 Advertisement-0.053
 Kitt-0.050
bryce-0.050
Reviewer-0.044
cars-0.043
Token try
Token ID1949
Feature activation+6.06e-01

Pos logit contributions
 Advertisement+2.019
 Kitt+1.940
bryce+1.883
Reviewer+1.673
cars+1.623
Neg logit contributions
 apologizing-1.963
 challeng-1.885
 recomp-1.875
aign-1.858
 forgiven-1.834
Token again
Token ID757
Feature activation-1.26e-01

Pos logit contributions
 apologizing+0.319
 recomp+0.300
 challeng+0.300
aign+0.290
 helicop+0.287
Neg logit contributions
 Advertisement-0.589
 Kitt-0.571
bryce-0.562
Reviewer-0.513
cars-0.503
Token soon
Token ID2582
Feature activation+6.31e-01

Pos logit contributions
 Advertisement+0.110
 Kitt+0.106
bryce+0.105
Reviewer+0.095
cars+0.093
Neg logit contributions
 apologizing-0.078
 recomp-0.074
 challeng-0.074
aign-0.072
 helicop-0.071
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
 apologizing+0.430
 recomp+0.406
aign+0.399
 challeng+0.397
 forgiven+0.393
Neg logit contributions
 Advertisement-0.510
 Kitt-0.495
bryce-0.492
Reviewer-0.440
cars-0.427
Token or
Token ID393
Feature activation-9.00e-03

Pos logit contributions
 Advertisement+0.093
 Kitt+0.090
bryce+0.089
Reviewer+0.081
cars+0.078
Neg logit contributions
 apologizing-0.071
 recomp-0.066
 challeng-0.066
aign-0.065
 forgiven-0.064
Token anywhere
Token ID6609
Feature activation-1.04e+00

Pos logit contributions
 apologizing+0.346
 recomp+0.331
 challeng+0.330
aign+0.324
 helicop+0.320
Neg logit contributions
 Advertisement-0.352
 Kitt-0.338
bryce-0.331
Reviewer-0.294
cars-0.286
Token.
Token ID13
Feature activation-9.95e-02

Pos logit contributions
 Advertisement+0.828
 Kitt+0.795
bryce+0.775
Reviewer+0.690
cars+0.683
Neg logit contributions
 apologizing-0.726
 challeng-0.696
 recomp-0.695
aign-0.679
 helicop-0.673
Token
Token ID198
Feature activation-2.57e-01

Pos logit contributions
 Advertisement+0.080
 Kitt+0.076
bryce+0.075
Reviewer+0.067
cars+0.065
Neg logit contributions
 apologizing-0.070
 challeng-0.067
 recomp-0.067
aign-0.065
 helicop-0.065
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
 Advertisement+0.237
 Kitt+0.230
bryce+0.228
Reviewer+0.206
 Carbuncle+0.202
Neg logit contributions
 apologizing-0.146
 recomp-0.138
aign-0.135
 forgiven-0.133
 challeng-0.130
TokenCopyright
Token ID15269
Feature activation-2.00e+00

Pos logit contributions
 Advertisement+0.230
 Kitt+0.220
bryce+0.216
Reviewer+0.191
cars+0.185
Neg logit contributions
 apologizing-0.244
 recomp-0.233
 challeng-0.233
aign-0.228
 forgiven-0.226
Token ©
Token ID10673
Feature activation-2.65e+00

Pos logit contributions
 Advertisement+1.567
 Kitt+1.503
bryce+1.468
Reviewer+1.303
cars+1.272
Neg logit contributions
 apologizing-1.434
 challeng-1.374
 recomp-1.368
aign-1.340
 helicop-1.337
Token 2019
Token ID13130
Feature activation-4.18e-01

Pos logit contributions
 Advertisement+2.459
 Kitt+2.377
bryce+2.303
Reviewer+2.078
cars+2.057
Neg logit contributions
 apologizing-1.533
 challeng-1.485
 recomp-1.447
 helicop-1.433
aign-1.401
Token The
Token ID383
Feature activation-1.19e-01

Pos logit contributions
 Advertisement+0.302
bryce+0.292
 Kitt+0.289
Reviewer+0.257
cars+0.246
Neg logit contributions
 apologizing-0.322
 recomp-0.307
aign-0.302
 challeng-0.301
 forgiven-0.297
Token Washington
Token ID2669
Feature activation+1.70e-01

Pos logit contributions
 Advertisement+0.093
 Kitt+0.090
bryce+0.090
Reviewer+0.080
cars+0.078
Neg logit contributions
 apologizing-0.083
 recomp-0.078
aign-0.078
 forgiven-0.076
 challeng-0.075
Token Times
Token ID3782
Feature activation-9.96e-01

Pos logit contributions
 apologizing+0.107
 recomp+0.099
aign+0.098
 forgiven+0.097
 pardon+0.096
Neg logit contributions
 Advertisement-0.144
bryce-0.140
 Kitt-0.139
Reviewer-0.127
cars-0.124
Token,
Token ID11
Feature activation-7.27e-02

Pos logit contributions
 Advertisement+0.647
bryce+0.626
 Kitt+0.617
Reviewer+0.554
cars+0.527
Neg logit contributions
 apologizing-0.831
 recomp-0.789
aign-0.781
 pardon-0.768
 forgiven-0.767
Tokenine
Token ID500
Feature activation-1.02e-01

Pos logit contributions
 Advertisement+0.036
 Kitt+0.035
bryce+0.034
Reviewer+0.030
cars+0.030
Neg logit contributions
 apologizing-0.034
 challeng-0.032
 recomp-0.032
 forgiven-0.031
 helicop-0.031
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
 Advertisement+0.080
 Kitt+0.077
bryce+0.076
Reviewer+0.068
cars+0.066
Neg logit contributions
 apologizing-0.071
 challeng-0.068
 recomp-0.068
aign-0.067
 forgiven-0.066
Token
Token ID198
Feature activation+8.17e-02

Pos logit contributions
 apologizing+0.117
 recomp+0.110
 challeng+0.109
aign+0.108
 forgiven+0.106
Neg logit contributions
 Advertisement-0.168
 Kitt-0.163
bryce-0.162
Reviewer-0.145
cars-0.142
Token
Token ID198
Feature activation+2.89e-02

Pos logit contributions
 apologizing+0.045
 recomp+0.042
aign+0.042
 forgiven+0.041
 reload+0.040
Neg logit contributions
 Advertisement-0.076
 Kitt-0.075
bryce-0.074
Reviewer-0.067
 Carbuncle-0.066
TokenCopyright
Token ID15269
Feature activation-1.77e+00

Pos logit contributions
 apologizing+0.023
 recomp+0.022
aign+0.022
 challeng+0.022
 forgiven+0.021
Neg logit contributions
 Advertisement-0.020
 Kitt-0.019
bryce-0.019
 Carbuncle-0.016
Reviewer-0.016
Token ©
Token ID10673
Feature activation-2.62e+00

Pos logit contributions
 Advertisement+1.356
 Kitt+1.305
bryce+1.275
Reviewer+1.129
cars+1.102
Neg logit contributions
 apologizing-1.298
 recomp-1.238
 challeng-1.237
aign-1.212
 helicop-1.208
Token 2016
Token ID1584
Feature activation-1.84e-01

Pos logit contributions
 Advertisement+2.326
 Kitt+2.261
bryce+2.197
Reviewer+1.955
 Carbuncle+1.949
Neg logit contributions
 apologizing-1.601
 challeng-1.535
 recomp-1.512
 helicop-1.485
 forgiven-1.464
Token Else
Token ID25974
Feature activation-3.42e-01

Pos logit contributions
 Advertisement+0.128
 Kitt+0.123
bryce+0.120
Reviewer+0.105
cars+0.102
Neg logit contributions
 apologizing-0.148
 recomp-0.141
 challeng-0.141
aign-0.139
 forgiven-0.137
Tokenvier
Token ID49663
Feature activation+6.68e-01

Pos logit contributions
 Advertisement+0.227
 Kitt+0.216
bryce+0.213
Reviewer+0.186
cars+0.179
Neg logit contributions
 apologizing-0.284
 recomp-0.273
 challeng-0.271
aign-0.269
 forgiven-0.266
Token B
Token ID347
Feature activation-2.68e-01

Pos logit contributions
 apologizing+0.560
 recomp+0.537
 challeng+0.532
aign+0.529
 forgiven+0.519
Neg logit contributions
 Advertisement-0.434
 Kitt-0.413
bryce-0.413
Reviewer-0.359
cars-0.347
Token.
Token ID13
Feature activation+4.57e-02

Pos logit contributions
 Advertisement+0.210
 Kitt+0.203
bryce+0.198
Reviewer+0.175
cars+0.173
Neg logit contributions
 apologizing-0.192
 recomp-0.183
 challeng-0.182
aign-0.178
 helicop-0.178
Token MB
Token ID10771
Feature activation+4.08e-01

Pos logit contributions
 Advertisement+0.220
bryce+0.216
 Kitt+0.214
Reviewer+0.194
cars+0.187
Neg logit contributions
 apologizing-0.179
 recomp-0.172
 forgiven-0.164
aign-0.164
 apologized-0.162
Token 8
Token ID807
Feature activation-2.48e-01

Pos logit contributions
 apologizing+0.276
 recomp+0.264
 challeng+0.261
aign+0.257
 forgiven+0.254
Neg logit contributions
 Advertisement-0.333
 Kitt-0.322
bryce-0.318
Reviewer-0.285
cars-0.277
Token192
Token ID17477
Feature activation-3.67e-01

Pos logit contributions
 Advertisement+0.189
 Kitt+0.182
bryce+0.179
Reviewer+0.159
cars+0.154
Neg logit contributions
 apologizing-0.183
 recomp-0.175
 challeng-0.173
aign-0.171
 forgiven-0.169
Token MB
Token ID10771
Feature activation+3.40e-01

Pos logit contributions
 Advertisement+0.300
bryce+0.294
 Kitt+0.290
Reviewer+0.263
cars+0.254
Neg logit contributions
 apologizing-0.244
 recomp-0.234
aign-0.224
 forgiven-0.224
 challeng-0.223
Token Texture
Token ID44409
Feature activation-7.50e-01

Pos logit contributions
 apologizing+0.228
 recomp+0.217
 challeng+0.215
aign+0.211
 forgiven+0.209
Neg logit contributions
 Advertisement-0.280
 Kitt-0.271
bryce-0.268
Reviewer-0.241
cars-0.234
Token Units
Token ID27719
Feature activation-2.56e+00

Pos logit contributions
 Advertisement+0.590
 Kitt+0.566
bryce+0.555
Reviewer+0.496
cars+0.484
Neg logit contributions
 apologizing-0.537
 challeng-0.510
 recomp-0.510
aign-0.501
 forgiven-0.495
Token 144
Token ID20224
Feature activation-3.17e-01

Pos logit contributions
 Advertisement+1.917
 Kitt+1.811
bryce+1.729
cars+1.507
 Carbuncle+1.503
Neg logit contributions
 apologizing-1.948
 challeng-1.882
 recomp-1.854
aign-1.835
 helicop-1.802
Token 64
Token ID5598
Feature activation-7.14e-01

Pos logit contributions
 Advertisement+0.254
 Kitt+0.245
bryce+0.240
Reviewer+0.215
cars+0.210
Neg logit contributions
 apologizing-0.222
 recomp-0.211
 challeng-0.210
aign-0.206
 forgiven-0.204
Token 104
Token ID14436
Feature activation-4.19e-01

Pos logit contributions
 Advertisement+0.586
 Kitt+0.564
bryce+0.556
Reviewer+0.498
cars+0.486
Neg logit contributions
 apologizing-0.485
 recomp-0.461
 challeng-0.459
aign-0.451
 forgiven-0.445
Token 80
Token ID4019
Feature activation-2.36e-01

Pos logit contributions
 Advertisement+0.354
 Kitt+0.341
bryce+0.337
Reviewer+0.303
cars+0.295
Neg logit contributions
 apologizing-0.274
 recomp-0.260
 challeng-0.258
aign-0.253
 forgiven-0.250
Token 120
Token ID7982
Feature activation+1.75e-01

Pos logit contributions
 Advertisement+0.196
 Kitt+0.188
bryce+0.185
Reviewer+0.166
cars+0.162
Neg logit contributions
 apologizing-0.158
 recomp-0.150
 challeng-0.150
aign-0.147
 forgiven-0.145
Token attacks
Token ID3434
Feature activation-2.74e-01

Pos logit contributions
 apologizing+0.307
 recomp+0.291
 challeng+0.291
aign+0.285
 forgiven+0.281
Neg logit contributions
 Advertisement-0.372
 Kitt-0.358
bryce-0.352
Reviewer-0.316
cars-0.308
Token and
Token ID290
Feature activation-4.09e-01

Pos logit contributions
 Advertisement+0.211
 Kitt+0.203
bryce+0.199
Reviewer+0.177
cars+0.172
Neg logit contributions
 apologizing-0.200
 recomp-0.191
 challeng-0.190
aign-0.187
 forgiven-0.185
Token the
Token ID262
Feature activation-2.35e-01

Pos logit contributions
 Advertisement+0.313
 Kitt+0.300
bryce+0.295
Reviewer+0.262
cars+0.255
Neg logit contributions
 apologizing-0.301
 recomp-0.287
 challeng-0.286
aign-0.281
 forgiven-0.278
Token French
Token ID4141
Feature activation-9.21e-02

Pos logit contributions
 Advertisement+0.192
 Kitt+0.185
bryce+0.182
Reviewer+0.163
cars+0.159
Neg logit contributions
 apologizing-0.161
 recomp-0.153
 challeng-0.153
aign-0.150
 forgiven-0.148
Token supermarket
Token ID26454
Feature activation-1.60e-01

Pos logit contributions
 Advertisement+0.070
 Kitt+0.067
bryce+0.066
Reviewer+0.059
cars+0.057
Neg logit contributions
 apologizing-0.068
 recomp-0.065
 challeng-0.065
aign-0.064
 forgiven-0.063
Token chain
Token ID6333
Feature activation-2.55e+00

Pos logit contributions
 Advertisement+0.115
 Kitt+0.110
bryce+0.108
Reviewer+0.096
cars+0.092
Neg logit contributions
 apologizing-0.125
 recomp-0.120
 challeng-0.119
aign-0.118
 forgiven-0.116
Token Car
Token ID1879
Feature activation+1.78e-01

Pos logit contributions
 Advertisement+1.929
 Kitt+1.859
bryce+1.797
Reviewer+1.591
 Carbuncle+1.569
Neg logit contributions
 apologizing-1.878
 challeng-1.804
 recomp-1.798
aign-1.759
 helicop-1.741
Tokenref
Token ID5420
Feature activation-9.96e-01

Pos logit contributions
 apologizing+0.121
 recomp+0.115
 challeng+0.114
aign+0.113
 forgiven+0.111
Neg logit contributions
 Advertisement-0.146
 Kitt-0.141
bryce-0.139
Reviewer-0.125
 Carbuncle-0.121
Tokenour
Token ID454
Feature activation-5.83e-01

Pos logit contributions
 Advertisement+0.956
 Kitt+0.926
bryce+0.913
Reviewer+0.831
cars+0.816
Neg logit contributions
 apologizing-0.538
 challeng-0.504
 recomp-0.504
aign-0.489
 forgiven-0.483
Token took
Token ID1718
Feature activation-1.10e+00

Pos logit contributions
 Advertisement+0.451
 Kitt+0.433
bryce+0.426
Reviewer+0.379
cars+0.369
Neg logit contributions
 apologizing-0.423
 recomp-0.403
 challeng-0.402
aign-0.395
 forgiven-0.390
Token Danish
Token ID20849
Feature activation+2.09e-01

Pos logit contributions
 Advertisement+0.832
 Kitt+0.798
bryce+0.785
Reviewer+0.695
cars+0.677
Neg logit contributions
 apologizing-0.823
 recomp-0.786
 challeng-0.783
aign-0.770
 forgiven-0.761
Token our
Token ID674
Feature activation-3.72e-01

Pos logit contributions
 apologizing+0.130
 recomp+0.123
 challeng+0.120
aign+0.120
 forgiven+0.119
Neg logit contributions
 Advertisement-0.152
 Kitt-0.145
bryce-0.145
Reviewer-0.128
cars-0.126
Token work
Token ID670
Feature activation+9.15e-01

Pos logit contributions
 Advertisement+0.321
 Kitt+0.310
bryce+0.305
Reviewer+0.274
 Carbuncle+0.268
Neg logit contributions
 apologizing-0.238
 challeng-0.225
 recomp-0.225
aign-0.220
 helicop-0.218
Token is
Token ID318
Feature activation-8.57e-02

Pos logit contributions
 apologizing+0.747
 recomp+0.722
 forgiven+0.699
 challeng+0.696
aign+0.695
Neg logit contributions
 Advertisement-0.602
bryce-0.595
 Kitt-0.592
Reviewer-0.516
 Carbuncle-0.508
Token appreciated
Token ID16373
Feature activation-1.05e+00

Pos logit contributions
 Advertisement+0.081
 Kitt+0.080
bryce+0.079
 Carbuncle+0.071
cars+0.071
Neg logit contributions
 apologizing-0.047
 recomp-0.045
 forgiven-0.043
 challeng-0.043
 pardon-0.042
Token.
Token ID13
Feature activation+9.13e-02

Pos logit contributions
bryce+0.880
 Advertisement+0.879
 Kitt+0.858
cars+0.776
Reviewer+0.765
Neg logit contributions
 apologizing-0.676
 recomp-0.634
 pardon-0.612
 understatement-0.610
aign-0.609
Token Please
Token ID4222
Feature activation-2.54e+00

Pos logit contributions
 apologizing+0.055
 recomp+0.053
aign+0.051
 challeng+0.050
 forgiven+0.050
Neg logit contributions
bryce-0.078
 Advertisement-0.078
 Kitt-0.078
Reviewer-0.069
cars-0.069
Token click
Token ID3904
Feature activation-5.21e-01

Pos logit contributions
 Advertisement+1.996
 Kitt+1.894
bryce+1.826
Reviewer+1.649
 Carbuncle+1.629
Neg logit contributions
 apologizing-1.823
 challeng-1.750
aign-1.728
 recomp-1.707
 helicop-1.699
Token below
Token ID2174
Feature activation-7.27e-01

Pos logit contributions
 Advertisement+0.436
 Kitt+0.428
bryce+0.422
Reviewer+0.378
cars+0.370
Neg logit contributions
 apologizing-0.342
 recomp-0.320
 challeng-0.312
aign-0.311
 forgiven-0.311
Token to
Token ID284
Feature activation+2.28e-01

Pos logit contributions
 Advertisement+0.548
 Kitt+0.544
bryce+0.539
Reviewer+0.478
cars+0.470
Neg logit contributions
 apologizing-0.526
 recomp-0.497
aign-0.491
 pardon-0.488
 forgiven-0.481
Token become
Token ID1716
Feature activation-3.74e-02

Pos logit contributions
 apologizing+0.130
 recomp+0.125
 reload+0.121
 pardon+0.119
 forgiven+0.118
Neg logit contributions
 Advertisement-0.209
bryce-0.203
 Kitt-0.203
Reviewer-0.186
cars-0.181
Token an
Token ID281
Feature activation-9.88e-02

Pos logit contributions
bryce+0.030
 Advertisement+0.029
 Kitt+0.029
Reviewer+0.027
cars+0.026
Neg logit contributions
 apologizing-0.025
 recomp-0.025
 forgiven-0.024
 reload-0.023
 understatement-0.023
Token On
Token ID1550
Feature activation+3.35e-01

Pos logit contributions
 apologizing+0.155
 challeng+0.148
 recomp+0.148
aign+0.144
 forgiven+0.143
Neg logit contributions
 Advertisement-0.166
 Kitt-0.159
bryce-0.156
Reviewer-0.139
cars-0.135
Token 10
Token ID838
Feature activation+4.66e-01

Pos logit contributions
 apologizing+0.247
 recomp+0.236
 challeng+0.233
aign+0.230
 forgiven+0.228
Neg logit contributions
 Advertisement-0.255
 Kitt-0.244
bryce-0.242
Reviewer-0.215
cars-0.209
Token October
Token ID3267
Feature activation-5.00e-01

Pos logit contributions
 apologizing+0.368
 recomp+0.352
 challeng+0.350
aign+0.343
 forgiven+0.340
Neg logit contributions
 Advertisement-0.331
 Kitt-0.316
bryce-0.311
Reviewer-0.276
cars-0.266
Token 1941
Token ID23234
Feature activation+5.61e-01

Pos logit contributions
 Advertisement+0.336
 Kitt+0.321
bryce+0.314
Reviewer+0.273
cars+0.265
Neg logit contributions
 apologizing-0.414
 recomp-0.397
 challeng-0.396
aign-0.390
 forgiven-0.387
Token those
Token ID883
Feature activation-2.23e-01

Pos logit contributions
 apologizing+0.380
 recomp+0.361
 challeng+0.360
aign+0.353
 forgiven+0.348
Neg logit contributions
 Advertisement-0.461
 Kitt-0.443
bryce-0.439
Reviewer-0.393
cars-0.383
Token fronts
Token ID29324
Feature activation-2.53e+00

Pos logit contributions
 Advertisement+0.162
 Kitt+0.155
bryce+0.152
Reviewer+0.134
cars+0.131
Neg logit contributions
 apologizing-0.173
 recomp-0.165
 challeng-0.165
aign-0.162
 forgiven-0.161
Token were
Token ID547
Feature activation-3.10e-01

Pos logit contributions
 Advertisement+1.814
 Kitt+1.754
bryce+1.696
Reviewer+1.487
cars+1.460
Neg logit contributions
 apologizing-1.962
 recomp-1.875
 challeng-1.874
aign-1.850
 forgiven-1.817
Token merged
Token ID23791
Feature activation-1.73e-01

Pos logit contributions
 Advertisement+0.265
 Kitt+0.256
bryce+0.251
Reviewer+0.226
cars+0.221
Neg logit contributions
 apologizing-0.200
 recomp-0.190
 challeng-0.189
aign-0.186
 forgiven-0.183
Token into
Token ID656
Feature activation-2.97e-01

Pos logit contributions
 Advertisement+0.132
 Kitt+0.126
bryce+0.125
Reviewer+0.111
cars+0.108
Neg logit contributions
 apologizing-0.127
 recomp-0.121
 challeng-0.121
aign-0.119
 forgiven-0.117
Token the
Token ID262
Feature activation-4.40e-02

Pos logit contributions
 Advertisement+0.208
 Kitt+0.198
bryce+0.196
Reviewer+0.171
cars+0.166
Neg logit contributions
 apologizing-0.238
 recomp-0.228
 challeng-0.227
aign-0.223
 forgiven-0.221
Token Western
Token ID4885
Feature activation+4.04e-01

Pos logit contributions
 Advertisement+0.032
 Kitt+0.031
bryce+0.030
Reviewer+0.027
cars+0.026
Neg logit contributions
 apologizing-0.034
 recomp-0.032
 challeng-0.032
aign-0.032
 forgiven-0.031
Token Continue
Token ID10054
Feature activation-1.58e-01

Pos logit contributions
 Advertisement+0.610
 Kitt+0.586
bryce+0.577
Reviewer+0.508
cars+0.493
Neg logit contributions
 apologizing-0.678
 recomp-0.647
 challeng-0.640
aign-0.636
 forgiven-0.629
Token reading
Token ID3555
Feature activation-8.92e-01

Pos logit contributions
 Advertisement+0.132
 Kitt+0.129
bryce+0.127
Reviewer+0.114
cars+0.111
Neg logit contributions
 apologizing-0.105
 recomp-0.098
 forgiven-0.095
aign-0.095
 challeng-0.094
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
 Advertisement+0.593
 Kitt+0.566
bryce+0.554
Reviewer+0.482
cars+0.467
Neg logit contributions
 apologizing-0.745
 challeng-0.715
 recomp-0.715
aign-0.702
 helicop-0.696
Token main
Token ID1388
Feature activation+4.19e-01

Pos logit contributions
 Advertisement+0.180
 Kitt+0.175
bryce+0.172
Reviewer+0.157
cars+0.153
Neg logit contributions
 apologizing-0.115
 recomp-0.108
 challeng-0.106
aign-0.105
 forgiven-0.104
Token story
Token ID1621
Feature activation-5.14e-01

Pos logit contributions
 apologizing+0.253
 recomp+0.239
 challeng+0.237
aign+0.233
 forgiven+0.230
Neg logit contributions
 Advertisement-0.375
 Kitt-0.363
bryce-0.358
Reviewer-0.324
cars-0.317
Token Please
Token ID4222
Feature activation-2.53e+00

Pos logit contributions
 Advertisement+0.429
 Kitt+0.415
bryce+0.409
Reviewer+0.367
cars+0.358
Neg logit contributions
 apologizing-0.340
 recomp-0.323
 challeng-0.321
aign-0.315
 forgiven-0.312
Token verify
Token ID11767
Feature activation-9.77e-01

Pos logit contributions
 Advertisement+1.657
 Kitt+1.560
bryce+1.522
Reviewer+1.317
 Carbuncle+1.285
Neg logit contributions
 apologizing-2.114
 challeng-2.088
 recomp-2.042
aign-2.041
 helicop-2.026
Token you
Token ID345
Feature activation-4.15e-01

Pos logit contributions
 Advertisement+0.598
 Kitt+0.571
bryce+0.558
Reviewer+0.480
cars+0.462
Neg logit contributions
 apologizing-0.868
 recomp-0.837
 challeng-0.822
aign-0.820
 forgiven-0.814
Token're
Token ID821
Feature activation-4.51e-01

Pos logit contributions
 Advertisement+0.389
 Kitt+0.379
bryce+0.373
Reviewer+0.342
cars+0.332
Neg logit contributions
 apologizing-0.231
 recomp-0.219
aign-0.211
 forgiven-0.210
 challeng-0.209
Token not
Token ID407
Feature activation-3.70e-01

Pos logit contributions
 Advertisement+0.404
 Kitt+0.390
bryce+0.385
Reviewer+0.349
cars+0.341
Neg logit contributions
 apologizing-0.273
 recomp-0.258
 challeng-0.255
aign-0.251
 forgiven-0.248
Token a
Token ID257
Feature activation-1.58e-01

Pos logit contributions
 Advertisement+0.251
 Kitt+0.240
bryce+0.237
Reviewer+0.208
cars+0.200
Neg logit contributions
 apologizing-0.304
 recomp-0.290
aign-0.285
 forgiven-0.283
 challeng-0.283
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
 apologizing+0.308
 recomp+0.293
 challeng+0.292
aign+0.286
 forgiven+0.283
Neg logit contributions
 Advertisement-0.361
 Kitt-0.348
bryce-0.343
Reviewer-0.306
cars-0.299
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
 apologizing+0.071
 recomp+0.067
aign+0.066
 forgiven+0.064
 reload+0.063
Neg logit contributions
 Advertisement-0.118
 Kitt-0.115
bryce-0.114
Reviewer-0.104
 Carbuncle-0.102
TokenPhotos
Token ID21197
Feature activation+7.40e-01

Pos logit contributions
 apologizing+0.008
 recomp+0.008
 challeng+0.008
aign+0.007
 forgiven+0.007
Neg logit contributions
 Advertisement-0.008
 Kitt-0.007
bryce-0.007
Reviewer-0.006
cars-0.006
Token courtesy
Token ID12537
Feature activation-8.84e-01

Pos logit contributions
 apologizing+0.531
 recomp+0.506
 challeng+0.506
aign+0.496
 forgiven+0.490
Neg logit contributions
 Advertisement-0.579
 Kitt-0.557
bryce-0.546
Reviewer-0.487
cars-0.474
Token of
Token ID286
Feature activation-1.68e-02

Pos logit contributions
 Advertisement+0.660
 Kitt+0.636
bryce+0.620
Reviewer+0.549
cars+0.533
Neg logit contributions
 apologizing-0.667
 recomp-0.638
 challeng-0.637
aign-0.625
 forgiven-0.618
Token ©
Token ID10673
Feature activation-2.53e+00

Pos logit contributions
 Advertisement+0.012
 Kitt+0.012
bryce+0.012
Reviewer+0.010
cars+0.010
Neg logit contributions
 apologizing-0.013
 recomp-0.012
 challeng-0.012
aign-0.012
 forgiven-0.012
TokenK
Token ID42
Feature activation-2.46e-01

Pos logit contributions
 Advertisement+1.899
 Kitt+1.832
bryce+1.788
Reviewer+1.575
cars+1.540
Neg logit contributions
 apologizing-1.896
 challeng-1.812
 recomp-1.808
aign-1.767
 forgiven-1.750
Tokenelli
Token ID23225
Feature activation-5.21e-01

Pos logit contributions
 Advertisement+0.174
 Kitt+0.166
bryce+0.163
Reviewer+0.143
cars+0.139
Neg logit contributions
 apologizing-0.195
 recomp-0.187
 challeng-0.187
aign-0.184
 forgiven-0.182
Token (
Token ID357
Feature activation-4.27e-01

Pos logit contributions
 Advertisement+0.382
 Kitt+0.368
bryce+0.360
Reviewer+0.317
cars+0.309
Neg logit contributions
 apologizing-0.400
 recomp-0.383
 challeng-0.382
aign-0.375
 forgiven-0.371
Tokenfl
Token ID2704
Feature activation-6.11e-01

Pos logit contributions
 Advertisement+0.301
 Kitt+0.288
bryce+0.283
Reviewer+0.248
cars+0.241
Neg logit contributions
 apologizing-0.340
 recomp-0.325
 challeng-0.324
aign-0.319
 forgiven-0.316
Tokenickr
Token ID18994
Feature activation-7.77e-01

Pos logit contributions
 Advertisement+0.357
 Kitt+0.339
bryce+0.331
Reviewer+0.282
cars+0.271
Neg logit contributions
 apologizing-0.559
 recomp-0.539
 challeng-0.537
aign-0.530
 forgiven-0.525
Token
Token ID251
Feature activation-1.61e-01

Pos logit contributions
 apologizing+0.018
 recomp+0.017
 challeng+0.017
aign+0.017
 forgiven+0.017
Neg logit contributions
 Advertisement-0.018
 Kitt-0.017
bryce-0.017
Reviewer-0.015
cars-0.015
Token
Token ID198
Feature activation+3.66e-01

Pos logit contributions
 Advertisement+0.141
 Kitt+0.136
bryce+0.133
Reviewer+0.120
cars+0.118
Neg logit contributions
 apologizing-0.102
 challeng-0.097
 recomp-0.097
aign-0.094
 helicop-0.093
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
 apologizing+0.204
 recomp+0.192
aign+0.189
 forgiven+0.185
 pardon+0.181
Neg logit contributions
 Advertisement-0.341
 Kitt-0.332
bryce-0.329
Reviewer-0.299
 Carbuncle-0.293
Token(
Token ID7
Feature activation-1.32e+00

Pos logit contributions
 apologizing+0.172
 recomp+0.165
 challeng+0.164
aign+0.162
 forgiven+0.160
Neg logit contributions
 Advertisement-0.162
 Kitt-0.156
bryce-0.153
Reviewer-0.134
 Carbuncle-0.131
Token©
Token ID16224
Feature activation-1.89e+00

Pos logit contributions
 Advertisement+0.939
 Kitt+0.896
bryce+0.876
Reviewer+0.774
cars+0.751
Neg logit contributions
 apologizing-1.045
 challeng-1.007
 recomp-1.006
aign-0.981
 helicop-0.977
Token Copyright
Token ID15069
Feature activation-2.52e+00

Pos logit contributions
 Advertisement+1.424
 Kitt+1.373
bryce+1.341
Reviewer+1.169
cars+1.151
Neg logit contributions
 apologizing-1.422
 challeng-1.367
 recomp-1.358
 helicop-1.326
aign-1.324
Token 2010
Token ID3050
Feature activation-1.06e-01

Pos logit contributions
 Advertisement+2.312
 Kitt+2.245
bryce+2.200
Reviewer+1.983
cars+1.956
Neg logit contributions
 apologizing-1.466
 challeng-1.392
 recomp-1.388
 helicop-1.339
aign-1.339
Token The
Token ID383
Feature activation-8.51e-02

Pos logit contributions
 Advertisement+0.078
 Kitt+0.074
bryce+0.074
Reviewer+0.065
cars+0.063
Neg logit contributions
 apologizing-0.080
 recomp-0.077
 challeng-0.076
aign-0.075
 forgiven-0.074
Token Associated
Token ID10575
Feature activation-2.89e-01

Pos logit contributions
 Advertisement+0.064
 Kitt+0.061
bryce+0.061
Reviewer+0.054
cars+0.052
Neg logit contributions
 apologizing-0.063
 recomp-0.060
 challeng-0.060
aign-0.059
 forgiven-0.058
Token Press
Token ID4332
Feature activation-1.15e+00

Pos logit contributions
 Advertisement+0.259
bryce+0.252
 Kitt+0.251
Reviewer+0.230
cars+0.221
Neg logit contributions
 apologizing-0.170
 recomp-0.160
aign-0.156
 forgiven-0.154
 pardon-0.152
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 Advertisement+0.937
 Kitt+0.905
bryce+0.898
Reviewer+0.808
cars+0.780
Neg logit contributions
 apologizing-0.784
 recomp-0.745
 challeng-0.734
aign-0.729
 forgiven-0.718
Token Continue
Token ID10054
Feature activation-2.96e-01

Pos logit contributions
 Advertisement+0.500
 Kitt+0.484
bryce+0.479
Reviewer+0.423
cars+0.409
Neg logit contributions
 apologizing-0.542
 recomp-0.517
aign-0.509
 challeng-0.506
 forgiven-0.502
Token reading
Token ID3555
Feature activation-9.27e-01

Pos logit contributions
 Advertisement+0.247
 Kitt+0.241
bryce+0.237
Reviewer+0.213
cars+0.208
Neg logit contributions
 apologizing-0.198
 recomp-0.185
aign-0.180
 forgiven-0.180
 challeng-0.179
Token the
Token ID262
Feature activation-5.01e-02

Pos logit contributions
 Advertisement+0.620
 Kitt+0.592
bryce+0.581
Reviewer+0.506
cars+0.490
Neg logit contributions
 apologizing-0.770
 recomp-0.739
 challeng-0.736
aign-0.726
 forgiven-0.719
Token main
Token ID1388
Feature activation+5.77e-01

Pos logit contributions
 Advertisement+0.046
 Kitt+0.045
bryce+0.045
Reviewer+0.041
cars+0.039
Neg logit contributions
 apologizing-0.029
 recomp-0.027
 challeng-0.026
aign-0.026
 forgiven-0.026
Token story
Token ID1621
Feature activation-2.73e-01

Pos logit contributions
 apologizing+0.337
 recomp+0.317
aign+0.309
 challeng+0.306
 forgiven+0.304
Neg logit contributions
 Advertisement-0.526
 Kitt-0.513
bryce-0.508
Reviewer-0.461
 Carbuncle-0.450
Token Please
Token ID4222
Feature activation-2.51e+00

Pos logit contributions
 Advertisement+0.230
 Kitt+0.223
bryce+0.221
Reviewer+0.198
cars+0.193
Neg logit contributions
 apologizing-0.179
 recomp-0.169
 challeng-0.167
aign-0.165
 forgiven-0.163
Token verify
Token ID11767
Feature activation-8.22e-01

Pos logit contributions
 Advertisement+1.657
 Kitt+1.568
bryce+1.526
Reviewer+1.324
 Carbuncle+1.286
Neg logit contributions
 apologizing-2.089
 challeng-2.056
 recomp-2.017
aign-2.010
 helicop-1.998
Token you
Token ID345
Feature activation-4.54e-01

Pos logit contributions
 Advertisement+0.512
 Kitt+0.491
bryce+0.479
Reviewer+0.413
cars+0.399
Neg logit contributions
 apologizing-0.721
 recomp-0.696
aign-0.680
 forgiven-0.677
 challeng-0.675
Token're
Token ID821
Feature activation-3.76e-01

Pos logit contributions
 Advertisement+0.431
 Kitt+0.420
bryce+0.414
Reviewer+0.382
cars+0.370
Neg logit contributions
 apologizing-0.247
 recomp-0.235
aign-0.224
 forgiven-0.224
 apologized-0.221
Token not
Token ID407
Feature activation-2.69e-01

Pos logit contributions
 Advertisement+0.341
 Kitt+0.329
bryce+0.326
Reviewer+0.296
cars+0.288
Neg logit contributions
 apologizing-0.223
 recomp-0.211
 challeng-0.205
aign-0.205
 forgiven-0.203
Token a
Token ID257
Feature activation-1.23e-01

Pos logit contributions
 Advertisement+0.185
 Kitt+0.177
bryce+0.175
Reviewer+0.154
 Carbuncle+0.148
Neg logit contributions
 apologizing-0.219
 recomp-0.209
aign-0.205
 forgiven-0.204
 reload-0.202
Token Continue
Token ID10054
Feature activation-2.25e-01

Pos logit contributions
 Advertisement+0.475
 Kitt+0.457
bryce+0.451
Reviewer+0.397
cars+0.385
Neg logit contributions
 apologizing-0.524
 recomp-0.500
 challeng-0.492
aign-0.492
 forgiven-0.486
Token reading
Token ID3555
Feature activation-9.88e-01

Pos logit contributions
 Advertisement+0.187
 Kitt+0.182
bryce+0.179
Reviewer+0.161
cars+0.157
Neg logit contributions
 apologizing-0.151
 recomp-0.141
 challeng-0.138
aign-0.137
 forgiven-0.137
Token the
Token ID262
Feature activation-7.17e-02

Pos logit contributions
 Advertisement+0.659
 Kitt+0.628
bryce+0.616
Reviewer+0.536
cars+0.520
Neg logit contributions
 apologizing-0.823
 recomp-0.789
 challeng-0.788
aign-0.775
 forgiven-0.768
Token main
Token ID1388
Feature activation+6.83e-01

Pos logit contributions
 Advertisement+0.066
 Kitt+0.064
bryce+0.064
Reviewer+0.058
cars+0.056
Neg logit contributions
 apologizing-0.041
 recomp-0.039
 challeng-0.038
aign-0.038
 forgiven-0.037
Token story
Token ID1621
Feature activation-3.72e-01

Pos logit contributions
 apologizing+0.404
 recomp+0.381
 challeng+0.372
aign+0.372
 forgiven+0.366
Neg logit contributions
 Advertisement-0.618
 Kitt-0.602
bryce-0.594
Reviewer-0.539
 Carbuncle-0.525
Token Please
Token ID4222
Feature activation-2.49e+00

Pos logit contributions
 Advertisement+0.312
 Kitt+0.303
bryce+0.300
Reviewer+0.269
cars+0.262
Neg logit contributions
 apologizing-0.244
 recomp-0.231
 challeng-0.229
aign-0.226
 forgiven-0.223
Token verify
Token ID11767
Feature activation-9.05e-01

Pos logit contributions
 Advertisement+1.619
 Kitt+1.525
bryce+1.483
Reviewer+1.283
 Carbuncle+1.253
Neg logit contributions
 apologizing-2.085
 challeng-2.067
aign-2.019
 recomp-2.018
 helicop-2.006
Token you
Token ID345
Feature activation-4.57e-01

Pos logit contributions
 Advertisement+0.561
 Kitt+0.537
bryce+0.524
Reviewer+0.451
cars+0.435
Neg logit contributions
 apologizing-0.796
 recomp-0.768
aign-0.752
 challeng-0.749
 forgiven-0.747
Token're
Token ID821
Feature activation-4.72e-01

Pos logit contributions
 Advertisement+0.430
 Kitt+0.419
bryce+0.413
Reviewer+0.379
cars+0.368
Neg logit contributions
 apologizing-0.254
 recomp-0.240
aign-0.230
 forgiven-0.230
 challeng-0.228
Token not
Token ID407
Feature activation-2.07e-01

Pos logit contributions
 Advertisement+0.426
 Kitt+0.411
bryce+0.407
Reviewer+0.369
cars+0.360
Neg logit contributions
 apologizing-0.282
 recomp-0.266
 challeng-0.261
aign-0.259
 forgiven-0.256
Token a
Token ID257
Feature activation-8.99e-02

Pos logit contributions
 Advertisement+0.143
 Kitt+0.136
bryce+0.135
Reviewer+0.119
 Carbuncle+0.114
Neg logit contributions
 apologizing-0.168
 recomp-0.161
aign-0.158
 forgiven-0.157
 reload-0.155
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
 Advertisement+0.034
 Kitt+0.034
bryce+0.033
Reviewer+0.030
 Carbuncle+0.030
Neg logit contributions
 apologizing-0.021
 recomp-0.019
aign-0.019
 forgiven-0.019
 apologized-0.018
TokenMr
Token ID5246
Feature activation-8.39e-02

Pos logit contributions
 Advertisement+0.086
 Kitt+0.082
bryce+0.081
Reviewer+0.070
cars+0.067
Neg logit contributions
 apologizing-0.116
 recomp-0.112
 challeng-0.111
aign-0.110
 forgiven-0.109
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
 Advertisement+0.068
 Kitt+0.065
bryce+0.065
Reviewer+0.058
cars+0.056
Neg logit contributions
 apologizing-0.058
 recomp-0.055
aign-0.054
 challeng-0.054
 forgiven-0.053
Token Ahmad
Token ID24152
Feature activation-5.03e-01

Pos logit contributions
 apologizing+0.146
 recomp+0.138
 challeng+0.138
aign+0.135
 forgiven+0.133
Neg logit contributions
 Advertisement-0.197
 Kitt-0.190
bryce-0.187
Reviewer-0.168
cars-0.165
Tokenine
Token ID500
Feature activation-1.51e+00

Pos logit contributions
 Advertisement+0.414
 Kitt+0.399
bryce+0.393
Reviewer+0.352
cars+0.343
Neg logit contributions
 apologizing-0.340
 recomp-0.323
 challeng-0.320
aign-0.317
 forgiven-0.312
Tokenjad
Token ID38442
Feature activation+1.67e+00

Pos logit contributions
 Advertisement+1.451
 Kitt+1.408
bryce+1.387
Reviewer+1.267
 Carbuncle+1.238
Neg logit contributions
 apologizing-0.806
 recomp-0.755
aign-0.737
 challeng-0.732
 forgiven-0.725
Token said
Token ID531
Feature activation-3.05e-01

Pos logit contributions
 apologizing+1.225
 recomp+1.167
 challeng+1.160
aign+1.139
 forgiven+1.133
Neg logit contributions
 Advertisement-1.281
 Kitt-1.230
bryce-1.214
Reviewer-1.084
cars-1.047
Token that
Token ID326
Feature activation-1.80e-01

Pos logit contributions
 Advertisement+0.220
 Kitt+0.211
bryce+0.207
Reviewer+0.181
cars+0.177
Neg logit contributions
 apologizing-0.235
 recomp-0.226
 challeng-0.226
aign-0.222
 helicop-0.219
Token confrontation
Token ID19207
Feature activation-5.85e-01

Pos logit contributions
 Advertisement+0.136
 Kitt+0.130
bryce+0.128
Reviewer+0.113
cars+0.110
Neg logit contributions
 apologizing-0.134
 recomp-0.128
 challeng-0.128
aign-0.126
 forgiven-0.124
Token with
Token ID351
Feature activation+1.81e-01

Pos logit contributions
 Advertisement+0.423
 Kitt+0.405
bryce+0.398
Reviewer+0.354
 Carbuncle+0.340
Neg logit contributions
 apologizing-0.459
 recomp-0.438
 challeng-0.435
aign-0.426
 forgiven-0.426
Token Israel
Token ID2692
Feature activation-2.01e-01

Pos logit contributions
 apologizing+0.137
 recomp+0.131
 challeng+0.130
aign+0.128
 forgiven+0.127
Neg logit contributions
 Advertisement-0.134
 Kitt-0.129
bryce-0.126
Reviewer-0.112
cars-0.109
Token,
Token ID11
Feature activation+1.20e-01

Pos logit contributions
 Advertisement+0.169
 Kitt+0.162
bryce+0.159
Reviewer+0.141
cars+0.137
Neg logit contributions
 apologizing-0.171
 recomp-0.163
 challeng-0.163
aign-0.160
 forgiven-0.158
Token while
Token ID981
Feature activation-7.76e-01

Pos logit contributions
 apologizing+0.088
 recomp+0.084
 challeng+0.083
aign+0.082
 forgiven+0.081
Neg logit contributions
 Advertisement-0.092
 Kitt-0.089
bryce-0.087
Reviewer-0.078
cars-0.075
Token Alex
Token ID4422
Feature activation+7.63e-01

Pos logit contributions
 Advertisement+0.575
 Kitt+0.554
bryce+0.539
Reviewer+0.475
cars+0.465
Neg logit contributions
 apologizing-0.585
 recomp-0.561
 challeng-0.560
aign-0.550
 forgiven-0.544
Tokenei
Token ID20295
Feature activation-9.75e-01

Pos logit contributions
 apologizing+0.589
 recomp+0.562
 challeng+0.557
aign+0.554
 forgiven+0.545
Neg logit contributions
 Advertisement-0.550
bryce-0.523
 Kitt-0.517
Reviewer-0.462
 Carbuncle-0.442
Token Em
Token ID2295
Feature activation-5.71e-01

Pos logit contributions
 Advertisement+0.674
 Kitt+0.635
bryce+0.635
Reviewer+0.557
cars+0.536
Neg logit contributions
 apologizing-0.792
 recomp-0.760
 challeng-0.754
aign-0.741
 forgiven-0.739
Tokenelin
Token ID27176
Feature activation+1.53e+00

Pos logit contributions
 Advertisement+0.598
 Kitt+0.581
bryce+0.574
Reviewer+0.526
cars+0.518
Neg logit contributions
 apologizing-0.259
 recomp-0.240
 challeng-0.240
aign-0.231
 helicop-0.227
Token,
Token ID11
Feature activation+4.50e-02

Pos logit contributions
 apologizing+0.938
 recomp+0.891
 challeng+0.880
aign+0.866
 helicop+0.855
Neg logit contributions
 Advertisement-1.343
 Kitt-1.297
bryce-1.289
Reviewer-1.172
cars-1.136
Token who
Token ID508
Feature activation+2.08e-01

Pos logit contributions
 apologizing+0.034
 recomp+0.032
 challeng+0.032
aign+0.031
 forgiven+0.031
Neg logit contributions
 Advertisement-0.034
 Kitt-0.032
bryce-0.032
Reviewer-0.028
cars-0.028
Token often
Token ID1690
Feature activation-2.59e-01

Pos logit contributions
 apologizing+0.164
 recomp+0.157
 challeng+0.156
aign+0.154
 forgiven+0.152
Neg logit contributions
 Advertisement-0.149
 Kitt-0.142
bryce-0.140
Reviewer-0.124
cars-0.119
Token played
Token ID2826
Feature activation-1.72e-01

Pos logit contributions
 Advertisement+0.186
 Kitt+0.178
bryce+0.175
Reviewer+0.155
cars+0.149
Neg logit contributions
 apologizing-0.203
 recomp-0.194
 challeng-0.193
aign-0.190
 helicop-0.189
Token on
Token ID319
Feature activation-2.49e-01

Pos logit contributions
 Advertisement+0.133
 Kitt+0.128
bryce+0.125
Reviewer+0.111
cars+0.109
Neg logit contributions
 apologizing-0.125
 recomp-0.119
 challeng-0.118
aign-0.116
 forgiven-0.115
Token been
Token ID587
Feature activation-8.93e-02

Pos logit contributions
 Advertisement+0.011
 Kitt+0.010
bryce+0.010
Reviewer+0.009
cars+0.009
Neg logit contributions
 apologizing-0.008
 recomp-0.007
 challeng-0.007
aign-0.007
 forgiven-0.007
Token on
Token ID319
Feature activation+1.50e-01

Pos logit contributions
 Advertisement+0.067
 Kitt+0.065
bryce+0.064
Reviewer+0.056
cars+0.055
Neg logit contributions
 apologizing-0.067
 recomp-0.064
 challeng-0.064
aign-0.062
 forgiven-0.062
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
 apologizing+0.120
 recomp+0.115
 challeng+0.115
aign+0.113
 forgiven+0.112
Neg logit contributions
 Advertisement-0.104
 Kitt-0.099
bryce-0.098
Reviewer-0.085
cars-0.083
Token endangered
Token ID22700
Feature activation-5.89e-02

Pos logit contributions
 apologizing+0.110
 recomp+0.104
 challeng+0.103
aign+0.101
 forgiven+0.100
Neg logit contributions
 Advertisement-0.141
 Kitt-0.136
bryce-0.134
Reviewer-0.120
cars-0.118
Token species
Token ID4693
Feature activation-4.92e-01

Pos logit contributions
 Advertisement+0.040
 Kitt+0.038
bryce+0.037
Reviewer+0.033
cars+0.031
Neg logit contributions
 apologizing-0.049
 recomp-0.047
 challeng-0.046
aign-0.046
 forgiven-0.045
Token list
Token ID1351
Feature activation+1.62e+00

Pos logit contributions
 Advertisement+0.325
 Kitt+0.310
bryce+0.304
Reviewer+0.264
cars+0.256
Neg logit contributions
 apologizing-0.413
 recomp-0.397
 challeng-0.395
aign-0.389
 forgiven-0.386
Token for
Token ID329
Feature activation-2.76e-02

Pos logit contributions
 apologizing+1.059
 recomp+1.001
 challeng+0.991
aign+0.973
 forgiven+0.973
Neg logit contributions
 Advertisement-1.365
 Kitt-1.315
bryce-1.304
Reviewer-1.174
 Carbuncle-1.138
Token decades
Token ID4647
Feature activation+2.33e-01

Pos logit contributions
 Advertisement+0.020
 Kitt+0.019
bryce+0.019
Reviewer+0.017
cars+0.016
Neg logit contributions
 apologizing-0.021
 recomp-0.020
 challeng-0.020
aign-0.020
 forgiven-0.020
Token --
Token ID1377
Feature activation+1.57e-01

Pos logit contributions
 apologizing+0.181
 recomp+0.172
 challeng+0.171
aign+0.168
 forgiven+0.167
Neg logit contributions
 Advertisement-0.169
 Kitt-0.163
bryce-0.160
Reviewer-0.141
 Carbuncle-0.137
Token threatened
Token ID8556
Feature activation+2.63e-01

Pos logit contributions
 apologizing+0.115
 recomp+0.109
 challeng+0.109
 forgiven+0.106
aign+0.106
Neg logit contributions
 Advertisement-0.120
 Kitt-0.115
bryce-0.115
Reviewer-0.101
 Carbuncle-0.098
Token by
Token ID416
Feature activation+2.57e-01

Pos logit contributions
 apologizing+0.204
 recomp+0.195
 challeng+0.194
aign+0.191
 forgiven+0.189
Neg logit contributions
 Advertisement-0.190
 Kitt-0.181
bryce-0.179
Reviewer-0.157
cars-0.153
Token f
Token ID277
Feature activation-5.90e-01

Pos logit contributions
 Advertisement+0.655
 Kitt+0.628
bryce+0.618
Reviewer+0.547
cars+0.532
Neg logit contributions
 apologizing-0.662
 recomp-0.634
 challeng-0.630
aign-0.620
 forgiven-0.613
Tokend
Token ID67
Feature activation+5.52e-01

Pos logit contributions
 Advertisement+0.427
 Kitt+0.409
bryce+0.402
Reviewer+0.355
cars+0.344
Neg logit contributions
 apologizing-0.458
 recomp-0.439
 challeng-0.437
aign-0.430
 forgiven-0.425
Token |
Token ID930
Feature activation-4.29e-01

Pos logit contributions
 apologizing+0.451
 recomp+0.434
 challeng+0.431
aign+0.426
 forgiven+0.421
Neg logit contributions
 Advertisement-0.375
 Kitt-0.358
bryce-0.353
Reviewer-0.308
 Carbuncle-0.298
Token +
Token ID1343
Feature activation+1.17e+00

Pos logit contributions
 Advertisement+0.355
 Kitt+0.341
bryce+0.334
Reviewer+0.299
cars+0.293
Neg logit contributions
 apologizing-0.290
 challeng-0.275
 recomp-0.274
aign-0.270
 helicop-0.267
Token--------------------
Token ID19351
Feature activation+1.81e+00

Pos logit contributions
 apologizing+0.583
 recomp+0.548
aign+0.532
 forgiven+0.516
 challeng+0.513
Neg logit contributions
 Advertisement-1.160
 Kitt-1.130
bryce-1.113
Reviewer-1.021
 Carbuncle-1.005
Token--+
Token ID44785
Feature activation+1.48e+00

Pos logit contributions
 apologizing+0.692
 recomp+0.641
aign+0.609
 forgiven+0.587
 pardon+0.581
Neg logit contributions
 Advertisement-1.995
 Kitt-1.955
bryce-1.937
Reviewer-1.797
cars-1.750
Token |
Token ID930
Feature activation-4.20e-01

Pos logit contributions
 apologizing+1.240
 recomp+1.192
 challeng+1.187
aign+1.170
 forgiven+1.157
Neg logit contributions
 Advertisement-0.982
 Kitt-0.938
bryce-0.923
Reviewer-0.802
cars-0.776
Token [
Token ID685
Feature activation+2.87e-01

Pos logit contributions
 Advertisement+0.340
 Kitt+0.327
bryce+0.321
Reviewer+0.287
cars+0.280
Neg logit contributions
 apologizing-0.289
 challeng-0.275
 recomp-0.275
aign-0.269
 forgiven-0.266
Tokenaddress
Token ID21975
Feature activation+6.11e-01

Pos logit contributions
 apologizing+0.194
 recomp+0.185
 challeng+0.184
aign+0.180
 forgiven+0.178
Neg logit contributions
 Advertisement-0.236
 Kitt-0.227
bryce-0.223
Reviewer-0.200
cars-0.195
Token of
Token ID286
Feature activation+7.61e-02

Pos logit contributions
 apologizing+0.498
 recomp+0.480
 challeng+0.474
aign+0.468
 forgiven+0.463
Neg logit contributions
 Advertisement-0.417
 Kitt-0.400
bryce-0.394
Reviewer-0.344
 Carbuncle-0.332
Token system
Token ID1080
Feature activation+9.71e-01

Pos logit contributions
 apologizing+0.051
 recomp+0.049
 challeng+0.048
aign+0.047
 forgiven+0.047
Neg logit contributions
 Advertisement-0.063
 Kitt-0.061
bryce-0.060
Reviewer-0.054
cars-0.052
Token was
Token ID373
Feature activation-1.65e-01

Pos logit contributions
 apologizing+0.639
 recomp+0.614
 challeng+0.608
aign+0.596
 forgiven+0.594
Neg logit contributions
 Advertisement-0.609
 Kitt-0.581
bryce-0.576
Reviewer-0.510
cars-0.493
Token signed
Token ID4488
Feature activation+4.49e-01

Pos logit contributions
 Advertisement+0.133
 Kitt+0.128
bryce+0.127
Reviewer+0.113
cars+0.110
Neg logit contributions
 apologizing-0.115
 recomp-0.110
 challeng-0.109
aign-0.107
 forgiven-0.106
Token without
Token ID1231
Feature activation+3.93e-01

Pos logit contributions
 apologizing+0.333
 recomp+0.318
 challeng+0.315
aign+0.308
 forgiven+0.307
Neg logit contributions
 Advertisement-0.340
 Kitt-0.326
bryce-0.323
Reviewer-0.287
cars-0.278
Token any
Token ID597
Feature activation+1.76e-01

Pos logit contributions
 apologizing+0.283
 recomp+0.271
 challeng+0.268
aign+0.263
 forgiven+0.259
Neg logit contributions
 Advertisement-0.307
 Kitt-0.294
bryce-0.293
Reviewer-0.261
cars-0.253
Token formal
Token ID8766
Feature activation-7.35e-01

Pos logit contributions
 apologizing+0.123
 recomp+0.118
 challeng+0.116
aign+0.114
 forgiven+0.113
Neg logit contributions
 Advertisement-0.141
 Kitt-0.136
bryce-0.135
Reviewer-0.120
cars-0.117
Token resolution
Token ID6323
Feature activation+1.58e+00

Pos logit contributions
 Advertisement+0.579
 Kitt+0.557
bryce+0.551
Reviewer+0.489
cars+0.478
Neg logit contributions
 apologizing-0.524
 recomp-0.501
 challeng-0.496
aign-0.487
 forgiven-0.481
Token passed
Token ID3804
Feature activation+8.10e-01

Pos logit contributions
 apologizing+1.175
 recomp+1.114
 challeng+1.104
aign+1.086
 forgiven+1.074
Neg logit contributions
 Advertisement-1.195
 Kitt-1.143
bryce-1.139
Reviewer-1.013
cars-0.977
Token by
Token ID416
Feature activation+1.23e-01

Pos logit contributions
 apologizing+0.628
 recomp+0.601
 challeng+0.597
aign+0.588
 forgiven+0.579
Neg logit contributions
 Advertisement-0.586
 Kitt-0.562
bryce-0.555
Reviewer-0.488
cars-0.474
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 apologizing+0.097
 recomp+0.094
 challeng+0.093
aign+0.092
 forgiven+0.090
Neg logit contributions
 Advertisement-0.086
bryce-0.083
 Kitt-0.082
Reviewer-0.073
cars-0.070
Token Board
Token ID5926
Feature activation+8.66e-01

Pos logit contributions
 Advertisement+0.013
bryce+0.012
 Kitt+0.012
Reviewer+0.011
cars+0.010
Neg logit contributions
 apologizing-0.015
 recomp-0.015
 challeng-0.014
aign-0.014
 forgiven-0.014
Token of
Token ID286
Feature activation+1.69e-01

Pos logit contributions
 apologizing+0.641
 recomp+0.612
 challeng+0.609
aign+0.599
 forgiven+0.592
Neg logit contributions
 Advertisement-0.657
 Kitt-0.631
bryce-0.622
Reviewer-0.551
cars-0.537
Tokeneff
Token ID14822
Feature activation+4.34e-01

Pos logit contributions
 Advertisement+0.224
 Kitt+0.214
bryce+0.210
Reviewer+0.183
cars+0.177
Neg logit contributions
 apologizing-0.280
 recomp-0.268
 challeng-0.268
aign-0.264
 forgiven-0.261
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
 apologizing+0.295
 recomp+0.281
aign+0.276
 challeng+0.274
 forgiven+0.270
Neg logit contributions
 Advertisement-0.354
 Kitt-0.343
bryce-0.337
Reviewer-0.302
 Carbuncle-0.295
Tokenorg
Token ID2398
Feature activation-6.78e-01

Pos logit contributions
 Advertisement+0.296
 Kitt+0.286
bryce+0.280
Reviewer+0.251
 Carbuncle+0.246
Neg logit contributions
 apologizing-0.257
 recomp-0.243
aign-0.243
 forgiven-0.237
 challeng-0.234
Token/
Token ID14
Feature activation-5.80e-01

Pos logit contributions
 Advertisement+0.462
 Kitt+0.445
bryce+0.436
Reviewer+0.382
 Carbuncle+0.369
Neg logit contributions
 apologizing-0.551
 recomp-0.527
aign-0.519
 challeng-0.516
 forgiven-0.512
Tokend
Token ID67
Feature activation+2.09e-01

Pos logit contributions
 Advertisement+0.443
 Kitt+0.424
bryce+0.419
Reviewer+0.369
cars+0.364
Neg logit contributions
 apologizing-0.429
 recomp-0.409
 challeng-0.408
aign-0.400
 forgiven-0.397
Tokenee
Token ID1453
Feature activation+9.98e-01

Pos logit contributions
 apologizing+0.166
 recomp+0.159
 challeng+0.158
aign+0.157
 forgiven+0.154
Neg logit contributions
 Advertisement-0.147
 Kitt-0.141
bryce-0.137
Reviewer-0.121
 Carbuncle-0.118
Tokenpl
Token ID489
Feature activation+6.91e-02

Pos logit contributions
 apologizing+0.622
 recomp+0.588
 challeng+0.584
aign+0.573
 helicop+0.563
Neg logit contributions
 Advertisement-0.874
 Kitt-0.846
bryce-0.832
Reviewer-0.752
cars-0.734
Tokeninks
Token ID2973
Feature activation-8.01e-01

Pos logit contributions
 apologizing+0.072
 challeng+0.070
 recomp+0.070
aign+0.069
 forgiven+0.068
Neg logit contributions
 Advertisement-0.031
 Kitt-0.029
bryce-0.028
Reviewer-0.023
cars-0.022
Token/
Token ID14
Feature activation-5.03e-01

Pos logit contributions
 Advertisement+0.537
 Kitt+0.507
bryce+0.502
Reviewer+0.432
cars+0.425
Neg logit contributions
 apologizing-0.665
 recomp-0.639
 challeng-0.638
 forgiven-0.624
 helicop-0.623
Token2016
Token ID5304
Feature activation-8.68e-02

Pos logit contributions
 Advertisement+0.394
 Kitt+0.379
bryce+0.374
Reviewer+0.332
cars+0.325
Neg logit contributions
 apologizing-0.360
 recomp-0.343
 challeng-0.342
aign-0.335
 forgiven-0.332
Token/
Token ID14
Feature activation-1.12e-01

Pos logit contributions
 Advertisement+0.057
 Kitt+0.054
bryce+0.053
Reviewer+0.046
cars+0.045
Neg logit contributions
 apologizing-0.073
 recomp-0.070
 challeng-0.070
aign-0.069
 forgiven-0.069
Token for
Token ID329
Feature activation-1.76e-02

Pos logit contributions
 apologizing+1.078
 recomp+1.035
 challeng+1.017
 forgiven+1.005
 helicop+0.997
Neg logit contributions
 Advertisement-1.058
 Kitt-1.029
bryce-1.015
Reviewer-0.904
 Carbuncle-0.868
Token al
Token ID435
Feature activation-3.32e-02

Pos logit contributions
 Advertisement+0.014
 Kitt+0.013
bryce+0.013
Reviewer+0.012
cars+0.011
Neg logit contributions
 apologizing-0.013
 recomp-0.012
 challeng-0.012
aign-0.012
 forgiven-0.012
Token-
Token ID12
Feature activation-1.94e-01

Pos logit contributions
 Advertisement+0.025
 Kitt+0.024
bryce+0.024
Reviewer+0.021
cars+0.020
Neg logit contributions
 apologizing-0.025
 recomp-0.024
 challeng-0.024
aign-0.023
 forgiven-0.023
TokenSh
Token ID2484
Feature activation-1.69e-01

Pos logit contributions
 Advertisement+0.132
 Kitt+0.126
bryce+0.124
Reviewer+0.108
cars+0.105
Neg logit contributions
 apologizing-0.159
 recomp-0.153
 challeng-0.152
aign-0.150
 forgiven-0.148
Tokenab
Token ID397
Feature activation+4.96e-01

Pos logit contributions
 Advertisement+0.124
 Kitt+0.119
bryce+0.117
Reviewer+0.103
 Carbuncle+0.100
Neg logit contributions
 apologizing-0.130
 recomp-0.124
 challeng-0.123
aign-0.122
 forgiven-0.121
Tokenab
Token ID397
Feature activation+5.11e-01

Pos logit contributions
 apologizing+0.435
 recomp+0.418
 challeng+0.416
aign+0.413
 forgiven+0.409
Neg logit contributions
 Advertisement-0.308
 Kitt-0.292
bryce-0.284
Reviewer-0.249
 Carbuncle-0.239
Token fighters
Token ID8486
Feature activation+4.39e-02

Pos logit contributions
 apologizing+0.376
 recomp+0.359
 challeng+0.357
aign+0.351
 forgiven+0.347
Neg logit contributions
 Advertisement-0.390
 Kitt-0.374
bryce-0.368
Reviewer-0.327
cars-0.318
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
 apologizing+0.033
 recomp+0.031
 challeng+0.031
aign+0.031
 forgiven+0.030
Neg logit contributions
 Advertisement-0.033
 Kitt-0.032
bryce-0.031
Reviewer-0.028
cars-0.027
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
 Advertisement+0.203
 Kitt+0.196
bryce+0.191
Reviewer+0.173
cars+0.170
Neg logit contributions
 apologizing-0.140
 challeng-0.135
 recomp-0.133
 helicop-0.129
aign-0.128
Token
Token ID198
Feature activation-2.87e-01

Pos logit contributions
 Advertisement+0.215
 Kitt+0.209
bryce+0.207
Reviewer+0.188
 Carbuncle+0.184
Neg logit contributions
 apologizing-0.131
 recomp-0.123
aign-0.121
 forgiven-0.118
 reload-0.116
TokenAl
Token ID2348
Feature activation+4.20e-01

Pos logit contributions
 Advertisement+0.200
 Kitt+0.192
bryce+0.188
Reviewer+0.165
 Carbuncle+0.161
Neg logit contributions
 apologizing-0.229
 recomp-0.219
 challeng-0.218
aign-0.216
 forgiven-0.213
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 Advertisement+0.181
 Kitt+0.173
bryce+0.171
Reviewer+0.150
cars+0.145
Neg logit contributions
 apologizing-0.206
 recomp-0.197
 challeng-0.196
aign-0.193
 forgiven-0.191
Token just
Token ID655
Feature activation+1.59e-01

Pos logit contributions
 Advertisement+0.078
 Kitt+0.075
bryce+0.074
Reviewer+0.066
cars+0.064
Neg logit contributions
 apologizing-0.073
 recomp-0.070
 challeng-0.069
aign-0.068
 forgiven-0.067
Token maybe
Token ID3863
Feature activation-3.27e-01

Pos logit contributions
 apologizing+0.126
 recomp+0.121
 challeng+0.120
aign+0.118
 forgiven+0.117
Neg logit contributions
 Advertisement-0.112
 Kitt-0.107
bryce-0.106
Reviewer-0.093
cars-0.089
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
 Advertisement+0.207
 Kitt+0.197
bryce+0.193
Reviewer+0.167
cars+0.161
Neg logit contributions
 apologizing-0.283
 recomp-0.273
 challeng-0.271
aign-0.267
 forgiven-0.265
Token when
Token ID618
Feature activation+1.00e-01

Pos logit contributions
 Advertisement+0.070
 Kitt+0.067
bryce+0.066
Reviewer+0.058
cars+0.056
Neg logit contributions
 apologizing-0.080
 recomp-0.077
 challeng-0.076
aign-0.075
 forgiven-0.074
Token they
Token ID484
Feature activation-2.35e-02

Pos logit contributions
 apologizing+0.081
 recomp+0.078
 challeng+0.077
aign+0.076
 forgiven+0.075
Neg logit contributions
 Advertisement-0.069
 Kitt-0.066
bryce-0.066
Reviewer-0.058
cars-0.056
Token have
Token ID423
Feature activation-2.06e-01

Pos logit contributions
 Advertisement+0.019
 Kitt+0.018
bryce+0.018
Reviewer+0.016
cars+0.016
Neg logit contributions
 apologizing-0.016
 recomp-0.015
 challeng-0.015
aign-0.015
 forgiven-0.015
Token issues
Token ID2428
Feature activation-2.29e-01

Pos logit contributions
 Advertisement+0.157
 Kitt+0.150
bryce+0.148
Reviewer+0.131
cars+0.128
Neg logit contributions
 apologizing-0.153
 recomp-0.146
 challeng-0.146
aign-0.143
 forgiven-0.142
Token that
Token ID326
Feature activation+1.39e-01

Pos logit contributions
 Advertisement+0.172
 Kitt+0.165
bryce+0.162
Reviewer+0.144
cars+0.140
Neg logit contributions
 apologizing-0.171
 recomp-0.163
 challeng-0.163
aign-0.160
 forgiven-0.158
Token no
Token ID645
Feature activation-1.42e-01

Pos logit contributions
 apologizing+0.102
 recomp+0.098
 challeng+0.097
aign+0.095
 forgiven+0.094
Neg logit contributions
 Advertisement-0.107
 Kitt-0.102
bryce-0.102
Reviewer-0.091
cars-0.088
Token one
Token ID530
Feature activation+2.67e-01

Pos logit contributions
 Advertisement+0.104
 Kitt+0.100
bryce+0.098
Reviewer+0.087
cars+0.084
Neg logit contributions
 apologizing-0.109
 recomp-0.104
 challeng-0.103
aign-0.102
 forgiven-0.101
Token why
Token ID1521
Feature activation+5.43e-01

Pos logit contributions
 Advertisement+0.150
 Kitt+0.144
bryce+0.142
Reviewer+0.126
cars+0.123
Neg logit contributions
 apologizing-0.137
 recomp-0.131
 challeng-0.131
aign-0.128
 forgiven-0.127
Token don
Token ID836
Feature activation-4.07e-01

Pos logit contributions
 apologizing+0.393
 recomp+0.376
 challeng+0.373
aign+0.365
 forgiven+0.363
Neg logit contributions
 Advertisement-0.422
 Kitt-0.404
bryce-0.401
Reviewer-0.357
cars-0.346
Token't
Token ID470
Feature activation-6.90e-01

Pos logit contributions
 Advertisement+0.293
 Kitt+0.283
bryce+0.276
Reviewer+0.245
 Carbuncle+0.238
Neg logit contributions
 apologizing-0.315
 recomp-0.301
aign-0.299
 forgiven-0.293
 reload-0.290
Token you
Token ID345
Feature activation+5.69e-02

Pos logit contributions
 Advertisement+0.451
 Kitt+0.430
bryce+0.422
Reviewer+0.366
cars+0.354
Neg logit contributions
 apologizing-0.584
 recomp-0.561
 challeng-0.559
aign-0.551
 forgiven-0.545
Token figure
Token ID3785
Feature activation-7.86e-01

Pos logit contributions
 apologizing+0.036
 recomp+0.035
 challeng+0.034
aign+0.034
 forgiven+0.033
Neg logit contributions
 Advertisement-0.049
 Kitt-0.047
bryce-0.047
Reviewer-0.042
cars-0.041
Token that
Token ID326
Feature activation-1.95e-02

Pos logit contributions
 Advertisement+0.526
 Kitt+0.503
bryce+0.493
Reviewer+0.429
cars+0.416
Neg logit contributions
 apologizing-0.652
 recomp-0.626
 challeng-0.624
aign-0.614
 forgiven-0.608
Token out
Token ID503
Feature activation-1.61e-01

Pos logit contributions
 Advertisement+0.013
 Kitt+0.013
bryce+0.013
Reviewer+0.011
cars+0.011
Neg logit contributions
 apologizing-0.016
 recomp-0.015
 challeng-0.015
aign-0.015
 forgiven-0.015
Token while
Token ID981
Feature activation-1.12e+00

Pos logit contributions
 Advertisement+0.118
 Kitt+0.113
bryce+0.111
Reviewer+0.098
cars+0.095
Neg logit contributions
 apologizing-0.123
 recomp-0.117
 challeng-0.117
aign-0.115
 forgiven-0.114
Token I
Token ID314
Feature activation-3.42e-02

Pos logit contributions
 Advertisement+0.779
 Kitt+0.747
bryce+0.730
Reviewer+0.638
cars+0.621
Neg logit contributions
 apologizing-0.896
 challeng-0.861
 recomp-0.861
aign-0.849
 helicop-0.838
Token go
Token ID467
Feature activation+9.65e-02

Pos logit contributions
 Advertisement+0.029
 Kitt+0.028
bryce+0.027
Reviewer+0.025
cars+0.024
Neg logit contributions
 apologizing-0.023
 recomp-0.021
 challeng-0.021
aign-0.021
 forgiven-0.021
Token talk
Token ID1561
Feature activation-9.42e-01

Pos logit contributions
 apologizing+0.069
 recomp+0.066
 challeng+0.066
aign+0.064
 forgiven+0.064
Neg logit contributions
 Advertisement-0.076
 Kitt-0.073
bryce-0.071
Reviewer-0.064
cars-0.062
Token the
Token ID262
Feature activation-6.27e-02

Pos logit contributions
 apologizing+0.183
 recomp+0.176
 challeng+0.174
aign+0.171
 forgiven+0.170
Neg logit contributions
 Advertisement-0.170
 Kitt-0.162
bryce-0.161
Reviewer-0.141
cars-0.137
Token optical
Token ID18480
Feature activation-5.41e-01

Pos logit contributions
 Advertisement+0.052
 Kitt+0.050
bryce+0.049
Reviewer+0.044
cars+0.043
Neg logit contributions
 apologizing-0.042
 recomp-0.040
 challeng-0.040
aign-0.039
 forgiven-0.039
Token bench
Token ID7624
Feature activation-1.05e+00

Pos logit contributions
 Advertisement+0.594
 Kitt+0.579
bryce+0.570
Reviewer+0.527
cars+0.518
Neg logit contributions
 apologizing-0.217
 challeng-0.197
 recomp-0.196
aign-0.192
 helicop-0.187
Token says
Token ID1139
Feature activation+1.88e-02

Pos logit contributions
 Advertisement+0.730
 Kitt+0.698
bryce+0.687
Reviewer+0.601
cars+0.582
Neg logit contributions
 apologizing-0.851
 recomp-0.816
 challeng-0.813
aign-0.800
 forgiven-0.792
Token you
Token ID345
Feature activation-1.01e-01

Pos logit contributions
 apologizing+0.014
 recomp+0.013
 challeng+0.013
aign+0.013
 forgiven+0.013
Neg logit contributions
 Advertisement-0.014
 Kitt-0.014
bryce-0.013
Reviewer-0.012
 Carbuncle-0.012
Token could
Token ID714
Feature activation+3.06e-02

Pos logit contributions
 Advertisement+0.072
 Kitt+0.069
bryce+0.068
Reviewer+0.059
cars+0.057
Neg logit contributions
 apologizing-0.080
 recomp-0.077
 challeng-0.076
aign-0.075
 forgiven-0.074
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 apologizing+0.025
 recomp+0.024
 challeng+0.024
aign+0.023
 forgiven+0.023
Neg logit contributions
 Advertisement-0.021
 Kitt-0.020
bryce-0.020
Reviewer-0.017
cars-0.017
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
 Advertisement+0.081
 Kitt+0.078
bryce+0.076
Reviewer+0.068
cars+0.066
Neg logit contributions
 apologizing-0.071
 challeng-0.068
 recomp-0.067
aign-0.066
 helicop-0.065
Token
Token ID198
Feature activation-4.85e-02

Pos logit contributions
 apologizing+0.080
 recomp+0.075
aign+0.074
 forgiven+0.072
 pardon+0.071
Neg logit contributions
 Advertisement-0.132
 Kitt-0.128
bryce-0.127
Reviewer-0.115
 Carbuncle-0.113
TokenBut
Token ID1537
Feature activation+4.03e-02

Pos logit contributions
 Advertisement+0.036
 Kitt+0.034
bryce+0.034
Reviewer+0.030
cars+0.029
Neg logit contributions
 apologizing-0.037
 recomp-0.035
 challeng-0.035
aign-0.035
 forgiven-0.034
Token could
Token ID714
Feature activation+2.75e-01

Pos logit contributions
 apologizing+0.030
 recomp+0.029
 challeng+0.029
aign+0.028
 forgiven+0.028
Neg logit contributions
 Advertisement-0.030
 Kitt-0.029
bryce-0.029
Reviewer-0.025
cars-0.025
Tokenty
Token ID774
Feature activation+4.43e-01

Pos logit contributions
 apologizing+0.307
 recomp+0.287
aign+0.280
 challeng+0.278
 forgiven+0.274
Neg logit contributions
 Advertisement-0.601
 Kitt-0.582
bryce-0.574
Reviewer-0.524
 Carbuncle-0.517
Token to
Token ID284
Feature activation+2.19e-02

Pos logit contributions
 apologizing+0.342
 recomp+0.328
 challeng+0.327
aign+0.321
 forgiven+0.318
Neg logit contributions
 Advertisement-0.321
 Kitt-0.308
bryce-0.302
Reviewer-0.266
cars-0.259
Token order
Token ID1502
Feature activation+5.90e-02

Pos logit contributions
 apologizing+0.014
 challeng+0.013
 recomp+0.013
aign+0.013
 helicop+0.013
Neg logit contributions
 Advertisement-0.019
 Kitt-0.018
bryce-0.018
Reviewer-0.016
 Carbuncle-0.016
Token prosecutors
Token ID11947
Feature activation+4.45e-01

Pos logit contributions
 apologizing+0.045
 recomp+0.043
 challeng+0.043
aign+0.043
 forgiven+0.042
Neg logit contributions
 Advertisement-0.043
 Kitt-0.041
bryce-0.041
Reviewer-0.036
cars-0.035
Token to
Token ID284
Feature activation-2.11e-01

Pos logit contributions
 apologizing+0.340
 recomp+0.325
 challeng+0.324
aign+0.318
 forgiven+0.315
Neg logit contributions
 Advertisement-0.327
 Kitt-0.314
bryce-0.309
Reviewer-0.273
cars-0.265
Token give
Token ID1577
Feature activation-6.88e-01

Pos logit contributions
 Advertisement+0.183
 Kitt+0.177
bryce+0.174
Reviewer+0.156
cars+0.153
Neg logit contributions
 apologizing-0.133
 challeng-0.126
 recomp-0.126
aign-0.123
 forgiven-0.121
Token them
Token ID606
Feature activation+1.45e-01

Pos logit contributions
 Advertisement+0.580
 Kitt+0.560
bryce+0.548
Reviewer+0.493
cars+0.482
Neg logit contributions
 apologizing-0.450
 challeng-0.427
 recomp-0.425
aign-0.418
 helicop-0.412
Token access
Token ID1895
Feature activation+5.41e-01

Pos logit contributions
 apologizing+0.110
 recomp+0.105
 challeng+0.105
aign+0.103
 forgiven+0.102
Neg logit contributions
 Advertisement-0.108
 Kitt-0.104
bryce-0.102
Reviewer-0.090
cars-0.088
Token to
Token ID284
Feature activation+6.64e-02

Pos logit contributions
 apologizing+0.425
 recomp+0.406
 challeng+0.405
aign+0.399
 forgiven+0.394
Neg logit contributions
 Advertisement-0.386
 Kitt-0.370
bryce-0.363
Reviewer-0.319
cars-0.310
Token all
Token ID477
Feature activation-3.36e-01

Pos logit contributions
 apologizing+0.051
 recomp+0.049
 challeng+0.049
aign+0.048
 forgiven+0.047
Neg logit contributions
 Advertisement-0.048
 Kitt-0.046
bryce-0.046
Reviewer-0.040
cars-0.039
Token the
Token ID262
Feature activation+2.43e-02

Pos logit contributions
 Advertisement+0.248
 Kitt+0.238
bryce+0.233
Reviewer+0.206
cars+0.201
Neg logit contributions
 apologizing-0.256
 recomp-0.245
 challeng-0.244
aign-0.240
 forgiven-0.237
Token,
Token ID11
Feature activation-2.38e-01

Pos logit contributions
 apologizing+0.535
 recomp+0.511
 challeng+0.506
aign+0.500
 forgiven+0.495
Neg logit contributions
 Advertisement-0.539
 Kitt-0.520
bryce-0.512
Reviewer-0.455
 Carbuncle-0.440
Token as
Token ID355
Feature activation-1.04e-01

Pos logit contributions
 Advertisement+0.176
 Kitt+0.169
bryce+0.167
Reviewer+0.148
cars+0.143
Neg logit contributions
 apologizing-0.180
 recomp-0.172
 challeng-0.171
aign-0.168
 forgiven-0.167
Token iceberg
Token ID34849
Feature activation-1.25e+00

Pos logit contributions
 Advertisement+0.080
bryce+0.077
 Kitt+0.076
Reviewer+0.067
cars+0.066
Neg logit contributions
 apologizing-0.075
 recomp-0.072
 challeng-0.072
 forgiven-0.070
aign-0.070
Tokens
Token ID82
Feature activation-3.74e-02

Pos logit contributions
 Advertisement+0.877
 Kitt+0.837
bryce+0.824
Reviewer+0.724
cars+0.701
Neg logit contributions
 apologizing-1.002
 recomp-0.963
 challeng-0.957
aign-0.941
 forgiven-0.933
Token fall
Token ID2121
Feature activation-4.01e-01

Pos logit contributions
 Advertisement+0.031
 Kitt+0.029
bryce+0.029
Reviewer+0.026
cars+0.025
Neg logit contributions
 apologizing-0.026
 recomp-0.025
 challeng-0.024
aign-0.024
 forgiven-0.024
Token into
Token ID656
Feature activation-2.07e-01

Pos logit contributions
 Advertisement+0.313
 Kitt+0.300
bryce+0.298
Reviewer+0.266
cars+0.257
Neg logit contributions
 apologizing-0.289
 recomp-0.276
 challeng-0.274
aign-0.267
 forgiven-0.266
Token the
Token ID262
Feature activation-9.92e-02

Pos logit contributions
 Advertisement+0.145
 Kitt+0.139
bryce+0.137
Reviewer+0.120
cars+0.117
Neg logit contributions
 apologizing-0.165
 recomp-0.158
 challeng-0.157
aign-0.154
 forgiven-0.154
Token sea
Token ID5417
Feature activation-7.95e-01

Pos logit contributions
 Advertisement+0.074
 Kitt+0.071
bryce+0.069
Reviewer+0.061
cars+0.060
Neg logit contributions
 apologizing-0.075
 recomp-0.072
 challeng-0.072
aign-0.071
 forgiven-0.070
Token and
Token ID290
Feature activation-9.99e-02

Pos logit contributions
 Advertisement+0.599
 Kitt+0.574
bryce+0.564
Reviewer+0.499
cars+0.487
Neg logit contributions
 apologizing-0.593
 recomp-0.566
 challeng-0.565
aign-0.555
 forgiven-0.549
Token glaciers
Token ID40509
Feature activation+5.37e-03

Pos logit contributions
 Advertisement+0.076
 Kitt+0.072
bryce+0.071
Reviewer+0.064
cars+0.061
Neg logit contributions
 apologizing-0.074
 recomp-0.072
 challeng-0.071
aign-0.069
 forgiven-0.069
Token retreat
Token ID13703
Feature activation-1.81e+00

Pos logit contributions
 apologizing+0.004
 recomp+0.003
 challeng+0.003
aign+0.003
 forgiven+0.003
Neg logit contributions
 Advertisement-0.004
 Kitt-0.004
bryce-0.004
Reviewer-0.004
cars-0.004
Token Crist
Token ID24568
Feature activation+1.61e+00

Pos logit contributions
 apologizing+0.052
 recomp+0.050
 challeng+0.050
aign+0.049
 forgiven+0.049
Neg logit contributions
 Advertisement-0.041
 Kitt-0.039
bryce-0.038
Reviewer-0.033
cars-0.032
Tokenina
Token ID1437
Feature activation-3.35e-02

Pos logit contributions
 apologizing+1.176
 recomp+1.113
aign+1.112
 challeng+1.103
 helicop+1.082
Neg logit contributions
 Advertisement-1.219
 Kitt-1.177
bryce-1.172
Reviewer-1.057
 Carbuncle-1.019
Token Fern
Token ID38982
Feature activation-6.09e-01

Pos logit contributions
 Advertisement+0.020
 Kitt+0.019
bryce+0.019
Reviewer+0.016
cars+0.016
Neg logit contributions
 apologizing-0.030
 recomp-0.029
 challeng-0.029
aign-0.028
 forgiven-0.028
Tokená
Token ID6557
Feature activation-6.70e-01

Pos logit contributions
 Advertisement+0.407
 Kitt+0.391
bryce+0.377
Reviewer+0.331
cars+0.319
Neg logit contributions
 apologizing-0.504
 challeng-0.491
 recomp-0.485
 helicop-0.473
 forgiven-0.473
Tokennd
Token ID358
Feature activation+6.37e-01

Pos logit contributions
 Advertisement+0.487
 Kitt+0.467
bryce+0.459
Reviewer+0.404
cars+0.393
Neg logit contributions
 apologizing-0.518
 recomp-0.495
 challeng-0.494
aign-0.486
 forgiven-0.480
Tokenez
Token ID8471
Feature activation-3.15e-01

Pos logit contributions
 apologizing+0.399
 challeng+0.380
 recomp+0.378
aign+0.367
 helicop+0.364
Neg logit contributions
 Advertisement-0.557
 Kitt-0.537
bryce-0.527
Reviewer-0.476
cars-0.466
Token de
Token ID390
Feature activation+4.94e-01

Pos logit contributions
 Advertisement+0.267
 Kitt+0.258
bryce+0.253
Reviewer+0.227
 Carbuncle+0.222
Neg logit contributions
 apologizing-0.205
 challeng-0.196
 recomp-0.196
aign-0.191
 helicop-0.188
Token Kir
Token ID7385
Feature activation-6.15e-01

Pos logit contributions
 apologizing+0.452
 challeng+0.437
 recomp+0.437
aign+0.430
 helicop+0.425
Neg logit contributions
 Advertisement-0.289
 Kitt-0.274
bryce-0.266
Reviewer-0.226
cars-0.218
Tokench
Token ID354
Feature activation+1.59e+00

Pos logit contributions
 Advertisement+0.446
 Kitt+0.425
bryce+0.418
Reviewer+0.372
 Carbuncle+0.358
Neg logit contributions
 apologizing-0.477
 recomp-0.458
 challeng-0.450
aign-0.450
 forgiven-0.444
Tokenner
Token ID1008
Feature activation+4.21e-01

Pos logit contributions
 apologizing+0.862
 recomp+0.816
aign+0.803
 forgiven+0.774
 apologized+0.765
Neg logit contributions
 Advertisement-1.529
 Kitt-1.471
bryce-1.456
Reviewer-1.347
 Carbuncle-1.306
Token
Token ID447
Feature activation-2.85e-02

Pos logit contributions
 apologizing+0.329
 recomp+0.315
 challeng+0.314
aign+0.309
 helicop+0.306
Neg logit contributions
 Advertisement-0.303
 Kitt-0.289
bryce-0.284
Reviewer-0.250
cars-0.243
Token Dallas
Token ID8533
Feature activation-4.50e-01

Pos logit contributions
 Advertisement+0.025
 Kitt+0.024
bryce+0.024
Reviewer+0.021
cars+0.020
Neg logit contributions
 apologizing-0.023
 recomp-0.021
 challeng-0.021
aign-0.021
 forgiven-0.021
Token,
Token ID11
Feature activation+8.68e-02

Pos logit contributions
 Advertisement+0.279
 Kitt+0.266
bryce+0.260
Reviewer+0.224
cars+0.216
Neg logit contributions
 apologizing-0.396
 recomp-0.380
 challeng-0.379
aign-0.374
 forgiven-0.370
Token Palm
Token ID18358
Feature activation+7.61e-01

Pos logit contributions
 apologizing+0.061
 recomp+0.058
 challeng+0.057
aign+0.056
 forgiven+0.055
Neg logit contributions
 Advertisement-0.069
 Kitt-0.066
bryce-0.066
Reviewer-0.059
cars-0.057
Token Beach
Token ID8511
Feature activation-3.48e-01

Pos logit contributions
 apologizing+0.538
 recomp+0.513
 challeng+0.507
aign+0.502
 forgiven+0.499
Neg logit contributions
 Advertisement-0.599
 Kitt-0.577
bryce-0.574
Reviewer-0.515
cars-0.496
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 Advertisement+0.222
 Kitt+0.212
bryce+0.208
Reviewer+0.179
cars+0.173
Neg logit contributions
 apologizing-0.300
 recomp-0.288
 challeng-0.287
aign-0.283
 forgiven-0.280
Token Florida
Token ID4744
Feature activation-4.51e-01

Pos logit contributions
 apologizing+0.076
 recomp+0.072
 challeng+0.072
aign+0.071
 forgiven+0.070
Neg logit contributions
 Advertisement-0.086
 Kitt-0.082
bryce-0.082
Reviewer-0.073
cars-0.071
Token,
Token ID11
Feature activation+9.62e-02

Pos logit contributions
 Advertisement+0.282
 Kitt+0.269
bryce+0.263
Reviewer+0.226
cars+0.219
Neg logit contributions
 apologizing-0.394
 recomp-0.379
 challeng-0.378
aign-0.373
 forgiven-0.369
Token Moreno
Token ID43269
Feature activation-7.55e-01

Pos logit contributions
 apologizing+0.068
 recomp+0.065
 challeng+0.064
aign+0.063
 forgiven+0.062
Neg logit contributions
 Advertisement-0.076
 Kitt-0.073
bryce-0.073
Reviewer-0.065
cars-0.063
Token Valley
Token ID6916
Feature activation+3.68e-01

Pos logit contributions
 Advertisement+0.463
 Kitt+0.439
bryce+0.437
Reviewer+0.377
cars+0.359
Neg logit contributions
 apologizing-0.668
 recomp-0.642
 challeng-0.639
aign-0.632
 forgiven-0.627
Token,
Token ID11
Feature activation+6.38e-02

Pos logit contributions
 apologizing+0.319
 recomp+0.307
 challeng+0.305
aign+0.302
 forgiven+0.299
Neg logit contributions
 Advertisement-0.232
 Kitt-0.221
bryce-0.218
Reviewer-0.188
 Carbuncle-0.181
Token California
Token ID3442
Feature activation-6.40e-01

Pos logit contributions
 apologizing+0.048
 recomp+0.046
 challeng+0.046
aign+0.045
 forgiven+0.045
Neg logit contributions
 Advertisement-0.047
 Kitt-0.045
bryce-0.045
Reviewer-0.040
cars-0.039
Token a
Token ID257
Feature activation-1.15e-01

Pos logit contributions
 Advertisement+0.046
 Kitt+0.044
bryce+0.044
Reviewer+0.038
cars+0.037
Neg logit contributions
 apologizing-0.048
 recomp-0.046
 challeng-0.046
aign-0.045
 forgiven-0.045
Token series
Token ID2168
Feature activation+1.05e-01

Pos logit contributions
 Advertisement+0.099
 Kitt+0.096
bryce+0.094
Reviewer+0.085
cars+0.083
Neg logit contributions
 apologizing-0.073
 recomp-0.069
 challeng-0.069
aign-0.068
 forgiven-0.067
Token which
Token ID543
Feature activation+8.60e-02

Pos logit contributions
 apologizing+0.078
 recomp+0.074
 challeng+0.074
aign+0.073
 forgiven+0.072
Neg logit contributions
 Advertisement-0.080
 Kitt-0.077
bryce-0.076
Reviewer-0.067
cars-0.065
Token begins
Token ID6140
Feature activation+1.10e+00

Pos logit contributions
 apologizing+0.071
 recomp+0.068
 challeng+0.068
aign+0.066
 forgiven+0.066
Neg logit contributions
 Advertisement-0.058
 Kitt-0.055
bryce-0.055
Reviewer-0.048
cars-0.046
Token with
Token ID351
Feature activation-4.73e-02

Pos logit contributions
 apologizing+0.790
 recomp+0.750
 challeng+0.748
aign+0.732
 forgiven+0.724
Neg logit contributions
 Advertisement-0.859
 Kitt-0.825
bryce-0.817
Reviewer-0.727
cars-0.706
Token the
Token ID262
Feature activation-2.05e-01

Pos logit contributions
 Advertisement+0.035
 Kitt+0.033
bryce+0.033
Reviewer+0.029
cars+0.028
Neg logit contributions
 apologizing-0.036
 recomp-0.035
 challeng-0.034
aign-0.034
 forgiven-0.034
Token low
Token ID1877
Feature activation-9.03e-01

Pos logit contributions
 Advertisement+0.169
 Kitt+0.163
bryce+0.159
Reviewer+0.143
cars+0.139
Neg logit contributions
 apologizing-0.139
 recomp-0.132
 challeng-0.132
aign-0.130
 forgiven-0.127
Token-
Token ID12
Feature activation+3.65e-01

Pos logit contributions
 Advertisement+0.791
 Kitt+0.762
bryce+0.746
Reviewer+0.677
cars+0.665
Neg logit contributions
 apologizing-0.562
 challeng-0.531
 recomp-0.530
aign-0.520
 forgiven-0.514
Tokenkick
Token ID24585
Feature activation-6.34e-01

Pos logit contributions
 apologizing+0.246
 recomp+0.234
 challeng+0.233
aign+0.228
 forgiven+0.226
Neg logit contributions
 Advertisement-0.301
 Kitt-0.290
bryce-0.286
Reviewer-0.256
cars-0.250
Token specialist
Token ID15670
Feature activation-5.68e-01

Pos logit contributions
 Advertisement+0.507
 Kitt+0.488
bryce+0.477
Reviewer+0.426
cars+0.418
Neg logit contributions
 apologizing-0.443
 recomp-0.421
 challeng-0.420
aign-0.412
 forgiven-0.409
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
 Advertisement+0.426
 Kitt+0.410
bryce+0.400
Reviewer+0.352
 Carbuncle+0.345
Neg logit contributions
 apologizing-0.426
 recomp-0.407
 challeng-0.405
aign-0.400
 forgiven-0.395
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
 apologizing+0.862
 recomp+0.827
 challeng+0.825
aign+0.812
 forgiven+0.805
Neg logit contributions
 Advertisement-0.628
 Kitt-0.598
bryce-0.587
Reviewer-0.506
cars-0.488
Token that
Token ID326
Feature activation+1.73e-01

Pos logit contributions
 apologizing+0.157
 recomp+0.150
 challeng+0.150
aign+0.147
 forgiven+0.146
Neg logit contributions
 Advertisement-0.130
 Kitt-0.124
bryce-0.122
Reviewer-0.107
cars-0.103
Token used
Token ID973
Feature activation+6.25e-01

Pos logit contributions
 apologizing+0.124
 recomp+0.118
 challeng+0.118
aign+0.115
 forgiven+0.115
Neg logit contributions
 Advertisement-0.135
 Kitt-0.130
bryce-0.129
Reviewer-0.115
cars-0.111
Token Sar
Token ID6866
Feature activation-2.66e-02

Pos logit contributions
 apologizing+0.389
 recomp+0.368
 challeng+0.365
aign+0.357
 forgiven+0.354
Neg logit contributions
 Advertisement-0.548
 Kitt-0.529
bryce-0.523
Reviewer-0.472
cars-0.460
Tokenin
Token ID259
Feature activation-2.98e-01

Pos logit contributions
 Advertisement+0.016
 Kitt+0.015
bryce+0.015
Reviewer+0.013
cars+0.012
Neg logit contributions
 apologizing-0.024
 recomp-0.023
 challeng-0.023
aign-0.022
 forgiven-0.022
Token nerve
Token ID16384
Feature activation-2.00e+00

Pos logit contributions
 Advertisement+0.233
 Kitt+0.223
bryce+0.218
Reviewer+0.194
cars+0.191
Neg logit contributions
 apologizing-0.215
 challeng-0.205
 recomp-0.204
aign-0.201
 forgiven-0.199
Token gas
Token ID3623
Feature activation+1.60e-01

Pos logit contributions
 Advertisement+1.813
 Kitt+1.737
bryce+1.693
Reviewer+1.551
cars+1.535
Neg logit contributions
 apologizing-1.180
 challeng-1.131
 recomp-1.113
aign-1.110
 helicop-1.079
Token during
Token ID1141
Feature activation+7.90e-01

Pos logit contributions
 apologizing+0.120
 recomp+0.114
 challeng+0.114
aign+0.112
 forgiven+0.111
Neg logit contributions
 Advertisement-0.120
 Kitt-0.115
bryce-0.112
Reviewer-0.099
cars-0.097
Token a
Token ID257
Feature activation+1.18e-01

Pos logit contributions
 apologizing+0.624
 recomp+0.597
 challeng+0.595
aign+0.586
 forgiven+0.580
Neg logit contributions
 Advertisement-0.561
 Kitt-0.537
bryce-0.528
Reviewer-0.464
cars-0.450
Token recent
Token ID2274
Feature activation+3.92e-01

Pos logit contributions
 apologizing+0.078
 challeng+0.074
 recomp+0.074
aign+0.072
 forgiven+0.071
Neg logit contributions
 Advertisement-0.099
 Kitt-0.095
bryce-0.093
Reviewer-0.083
cars-0.082
Token incident
Token ID4519
Feature activation-1.85e-01

Pos logit contributions
 apologizing+0.232
 challeng+0.220
 recomp+0.219
aign+0.214
 forgiven+0.211
Neg logit contributions
 Advertisement-0.354
 Kitt-0.343
bryce-0.336
Reviewer-0.304
cars-0.299
Token you
Token ID345
Feature activation+6.40e-03

Pos logit contributions
 apologizing+0.623
 recomp+0.596
 challeng+0.589
aign+0.579
 forgiven+0.577
Neg logit contributions
 Advertisement-0.560
 Kitt-0.537
bryce-0.531
Reviewer-0.468
 Carbuncle-0.451
Token to
Token ID284
Feature activation-3.14e-01

Pos logit contributions
 apologizing+0.005
 challeng+0.005
 recomp+0.005
aign+0.004
 forgiven+0.004
Neg logit contributions
 Advertisement-0.005
 Kitt-0.005
bryce-0.005
Reviewer-0.004
cars-0.004
Token leave
Token ID2666
Feature activation+3.45e-01

Pos logit contributions
 Advertisement+0.263
 Kitt+0.254
bryce+0.249
Reviewer+0.223
cars+0.219
Neg logit contributions
 apologizing-0.207
 challeng-0.197
 recomp-0.197
aign-0.194
 forgiven-0.190
Token this
Token ID428
Feature activation-3.95e-02

Pos logit contributions
 apologizing+0.254
 challeng+0.243
 recomp+0.243
aign+0.239
 forgiven+0.236
Neg logit contributions
 Advertisement-0.262
 Kitt-0.252
bryce-0.246
Reviewer-0.218
cars-0.213
Token here
Token ID994
Feature activation+2.18e-01

Pos logit contributions
 Advertisement+0.030
 Kitt+0.029
bryce+0.028
Reviewer+0.025
cars+0.025
Neg logit contributions
 apologizing-0.029
 recomp-0.028
 challeng-0.028
aign-0.027
 helicop-0.027
Token store
Token ID3650
Feature activation-1.76e+00

Pos logit contributions
 apologizing+0.162
 challeng+0.156
 recomp+0.156
aign+0.152
 forgiven+0.151
Neg logit contributions
 Advertisement-0.164
 Kitt-0.157
bryce-0.154
Reviewer-0.136
cars-0.133
Token,
Token ID11
Feature activation-1.57e-01

Pos logit contributions
 Advertisement+1.331
 Kitt+1.267
bryce+1.238
Reviewer+1.087
cars+1.084
Neg logit contributions
 apologizing-1.298
 challeng-1.262
 recomp-1.254
aign-1.225
 helicop-1.216
Token so
Token ID523
Feature activation-7.86e-02

Pos logit contributions
 Advertisement+0.129
 Kitt+0.124
bryce+0.121
Reviewer+0.108
cars+0.107
Neg logit contributions
 apologizing-0.107
 challeng-0.102
 recomp-0.102
aign-0.100
 helicop-0.098
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
 Advertisement+0.064
 Kitt+0.062
bryce+0.061
Reviewer+0.054
cars+0.053
Neg logit contributions
 apologizing-0.054
 recomp-0.051
 challeng-0.051
aign-0.050
 forgiven-0.049
Token good
Token ID922
Feature activation-3.08e-02

Pos logit contributions
 Advertisement+0.107
 Kitt+0.103
bryce+0.101
Reviewer+0.090
cars+0.088
Neg logit contributions
 apologizing-0.098
 challeng-0.093
 recomp-0.093
aign-0.092
 forgiven-0.091
Token Christian
Token ID4302
Feature activation+1.26e+00

Pos logit contributions
 Advertisement+0.025
 Kitt+0.024
bryce+0.023
Reviewer+0.021
cars+0.020
Neg logit contributions
 apologizing-0.021
 recomp-0.020
 challeng-0.020
aign-0.020
 forgiven-0.020
Tokenson
Token ID1559
Feature activation-4.89e-01

Pos logit contributions
 Advertisement+0.187
 Kitt+0.180
bryce+0.178
Reviewer+0.159
 Carbuncle+0.155
Neg logit contributions
 apologizing-0.161
 recomp-0.153
 challeng-0.152
aign-0.150
 forgiven-0.148
Token,
Token ID11
Feature activation-4.61e-02

Pos logit contributions
 Advertisement+0.313
 Kitt+0.298
bryce+0.293
Reviewer+0.253
cars+0.244
Neg logit contributions
 apologizing-0.421
 recomp-0.404
 challeng-0.403
aign-0.397
 forgiven-0.393
Token told
Token ID1297
Feature activation+3.88e-01

Pos logit contributions
 Advertisement+0.036
 Kitt+0.035
bryce+0.034
Reviewer+0.030
cars+0.030
Neg logit contributions
 apologizing-0.033
 recomp-0.032
 challeng-0.031
aign-0.031
 forgiven-0.031
Token the
Token ID262
Feature activation-7.47e-02

Pos logit contributions
 apologizing+0.289
 recomp+0.276
 challeng+0.275
aign+0.270
 forgiven+0.267
Neg logit contributions
 Advertisement-0.292
 Kitt-0.280
bryce-0.276
Reviewer-0.244
cars-0.238
Token Tampa
Token ID15528
Feature activation+5.62e-02

Pos logit contributions
 Advertisement+0.063
 Kitt+0.061
bryce+0.060
Reviewer+0.054
cars+0.052
Neg logit contributions
 apologizing-0.049
 recomp-0.047
 challeng-0.047
aign-0.046
 forgiven-0.045
Token Bay
Token ID4696
Feature activation-1.79e+00

Pos logit contributions
 apologizing+0.042
 recomp+0.040
 challeng+0.040
aign+0.039
 forgiven+0.039
Neg logit contributions
 Advertisement-0.043
 Kitt-0.041
bryce-0.040
Reviewer-0.035
cars-0.034
Token Times
Token ID3782
Feature activation-6.30e-01

Pos logit contributions
 Advertisement+1.350
 Kitt+1.305
bryce+1.251
Reviewer+1.110
 Carbuncle+1.090
Neg logit contributions
 apologizing-1.340
 recomp-1.280
 challeng-1.268
aign-1.249
 helicop-1.233
Token he
Token ID339
Feature activation-9.34e-02

Pos logit contributions
 Advertisement+0.464
 Kitt+0.445
bryce+0.436
Reviewer+0.384
cars+0.374
Neg logit contributions
 apologizing-0.481
 recomp-0.461
 challeng-0.461
aign-0.451
 forgiven-0.447
Token "
Token ID366
Feature activation-7.74e-02

Pos logit contributions
 Advertisement+0.068
 Kitt+0.065
bryce+0.064
Reviewer+0.057
cars+0.055
Neg logit contributions
 apologizing-0.073
 recomp-0.069
 challeng-0.069
aign-0.068
 forgiven-0.067
Tokencould
Token ID24089
Feature activation+2.26e-02

Pos logit contributions
 Advertisement+0.058
 Kitt+0.056
bryce+0.055
Reviewer+0.049
 Carbuncle+0.047
Neg logit contributions
 apologizing-0.058
 recomp-0.055
 challeng-0.055
aign-0.054
 forgiven-0.053
Tokenn
Token ID77
Feature activation+2.85e-01

Pos logit contributions
 apologizing+0.018
 recomp+0.017
 challeng+0.017
aign+0.017
 forgiven+0.017
Neg logit contributions
 Advertisement-0.016
 Kitt-0.015
bryce-0.015
Reviewer-0.013
cars-0.013
Token School
Token ID3961
Feature activation+9.39e-01

Pos logit contributions
 apologizing+0.523
 recomp+0.498
 challeng+0.494
aign+0.489
 forgiven+0.484
Neg logit contributions
 Advertisement-0.502
 Kitt-0.481
bryce-0.479
Reviewer-0.424
 Carbuncle-0.408
Token Should
Token ID10358
Feature activation+4.60e-01

Pos logit contributions
 apologizing+0.724
 recomp+0.691
 challeng+0.685
aign+0.682
 forgiven+0.672
Neg logit contributions
 Advertisement-0.676
 Kitt-0.647
bryce-0.641
Reviewer-0.569
 Carbuncle-0.547
Token Start
Token ID7253
Feature activation+3.37e-01

Pos logit contributions
 apologizing+0.350
 recomp+0.335
 challeng+0.334
aign+0.328
 forgiven+0.324
Neg logit contributions
 Advertisement-0.340
 Kitt-0.326
bryce-0.320
Reviewer-0.283
cars-0.275
Token Later
Token ID11450
Feature activation-1.72e-01

Pos logit contributions
 apologizing+0.249
 recomp+0.236
 challeng+0.235
aign+0.231
 forgiven+0.229
Neg logit contributions
 Advertisement-0.256
 Kitt-0.246
bryce-0.243
Reviewer-0.217
cars-0.210
Token For
Token ID1114
Feature activation+3.16e-01

Pos logit contributions
 Advertisement+0.139
 Kitt+0.134
bryce+0.132
Reviewer+0.118
cars+0.115
Neg logit contributions
 apologizing-0.119
 recomp-0.113
 challeng-0.112
aign-0.111
 forgiven-0.109
Token Te
Token ID1665
Feature activation-1.70e+00

Pos logit contributions
 apologizing+0.237
 recomp+0.226
 challeng+0.225
aign+0.222
 forgiven+0.219
Neg logit contributions
 Advertisement-0.236
 Kitt-0.227
bryce-0.224
Reviewer-0.199
cars-0.192
Tokenens
Token ID641
Feature activation-3.40e-02

Pos logit contributions
 Advertisement+1.188
 Kitt+1.137
bryce+1.113
Reviewer+0.968
cars+0.951
Neg logit contributions
 apologizing-1.364
 recomp-1.307
 challeng-1.306
aign-1.275
 helicop-1.269
Token'
Token ID6
Feature activation+1.32e+00

Pos logit contributions
 Advertisement+0.027
 Kitt+0.026
bryce+0.026
Reviewer+0.023
cars+0.023
Neg logit contributions
 apologizing-0.024
 recomp-0.022
 challeng-0.022
aign-0.022
 forgiven-0.022
Token Health
Token ID3893
Feature activation+3.56e-01

Pos logit contributions
 apologizing+0.995
 recomp+0.948
 challeng+0.938
aign+0.932
 forgiven+0.924
Neg logit contributions
 Advertisement-0.961
bryce-0.926
 Kitt-0.922
Reviewer-0.821
cars-0.788
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
 apologizing+0.230
 recomp+0.218
 challeng+0.217
aign+0.214
 forgiven+0.210
Neg logit contributions
 Advertisement-0.301
 Kitt-0.291
bryce-0.288
Reviewer-0.258
cars-0.252
Token
Token ID198
Feature activation-5.91e-02

Pos logit contributions
 apologizing+0.014
 recomp+0.013
aign+0.013
 forgiven+0.012
 apologized+0.012
Neg logit contributions
 Advertisement-0.022
 Kitt-0.022
bryce-0.022
Reviewer-0.020
 Carbuncle-0.019
Token you
Token ID345
Feature activation-1.40e-01

Pos logit contributions
 apologizing+0.244
 recomp+0.234
 challeng+0.232
aign+0.228
 forgiven+0.227
Neg logit contributions
 Advertisement-0.198
 Kitt-0.189
bryce-0.188
Reviewer-0.163
cars-0.157
Token could
Token ID714
Feature activation+3.03e-01

Pos logit contributions
 Advertisement+0.108
 Kitt+0.104
bryce+0.102
Reviewer+0.090
cars+0.088
Neg logit contributions
 apologizing-0.102
 challeng-0.097
 recomp-0.097
aign-0.095
 forgiven-0.094
Token text
Token ID2420
Feature activation-8.75e-01

Pos logit contributions
 apologizing+0.194
 recomp+0.185
 challeng+0.183
aign+0.178
 forgiven+0.177
Neg logit contributions
 Advertisement-0.261
 Kitt-0.252
bryce-0.249
Reviewer-0.224
cars-0.218
Token message
Token ID3275
Feature activation-4.07e-01

Pos logit contributions
 Advertisement+0.610
 Kitt+0.584
bryce+0.574
Reviewer+0.503
cars+0.487
Neg logit contributions
 apologizing-0.702
 recomp-0.672
 challeng-0.670
aign-0.658
 forgiven-0.653
Token a
Token ID257
Feature activation+4.75e-02

Pos logit contributions
 Advertisement+0.294
 Kitt+0.281
bryce+0.278
Reviewer+0.245
cars+0.237
Neg logit contributions
 apologizing-0.316
 recomp-0.302
 challeng-0.301
aign-0.296
 forgiven-0.293
Token store
Token ID3650
Feature activation-1.70e+00

Pos logit contributions
 apologizing+0.034
 recomp+0.032
 challeng+0.032
aign+0.032
 forgiven+0.031
Neg logit contributions
 Advertisement-0.037
 Kitt-0.036
bryce-0.035
Reviewer-0.031
cars-0.031
Token instead
Token ID2427
Feature activation-2.98e-01

Pos logit contributions
 Advertisement+1.246
 Kitt+1.191
bryce+1.167
Reviewer+1.028
cars+1.003
Neg logit contributions
 apologizing-1.299
 recomp-1.248
 challeng-1.246
aign-1.225
 forgiven-1.211
Token of
Token ID286
Feature activation-9.48e-03

Pos logit contributions
 Advertisement+0.213
 Kitt+0.203
bryce+0.199
Reviewer+0.175
cars+0.170
Neg logit contributions
 apologizing-0.234
 challeng-0.225
 recomp-0.225
aign-0.221
 forgiven-0.218
Token calling
Token ID4585
Feature activation-5.73e-01

Pos logit contributions
 Advertisement+0.008
 Kitt+0.008
bryce+0.008
Reviewer+0.007
cars+0.007
Neg logit contributions
 apologizing-0.006
 recomp-0.006
 challeng-0.006
aign-0.006
 forgiven-0.006
Token them
Token ID606
Feature activation+3.83e-02

Pos logit contributions
 Advertisement+0.420
 Kitt+0.403
bryce+0.396
Reviewer+0.349
cars+0.340
Neg logit contributions
 apologizing-0.439
 recomp-0.419
 challeng-0.418
aign-0.411
 forgiven-0.407
Token?
Token ID30
Feature activation-9.36e-02

Pos logit contributions
 apologizing+0.029
 recomp+0.027
 challeng+0.027
aign+0.027
 forgiven+0.026
Neg logit contributions
 Advertisement-0.029
 Kitt-0.028
bryce-0.027
Reviewer-0.024
cars-0.024