TOP ACTIVATIONS
MAX = 0.330

a 3 - year - old boy who had a new
was three - years - old . She had a lot
a 3 - year - old little boy . He had
her three - year - old daughter went to the park
a three - year - old who found a chair .
The 3 - year - old thought hard and smiled .
are angry . They sc old Lily and Ben . They
was three - years - old and she loved to explore
little 3 - year - old boy . He asked the
tiny three - year - old girl came across the letter
was three - years - old . She wanted to go
little three - year - old daughter was cl apping and
a three - year - old boy . He liked to
from him . They sc old him . They tell him
a three - year - old boy decided to try it
The three - year - old girl was fascinated by the
a three - year - old girl . She had been
average three - year - old boy . He loved playing
her 3 - year - old daughter decided to take a
but she didn 't sc old her again . She simply

MIN ACTIVATIONS
MIN = -0.276

he waved one of his antenna e . The
pale yellow bug with long antenna e . Lucy smiled and
The butterfly thought the waving antenna e was so funny that
ground . He waved his antenna e and made some special
and waved one of its antenna e . Bob bie waved
had wings and legs and antenna e and eyes . I
see its eyes , its antenna e , and its scales
Bob w igg led his antenna e and flew away .
sweet tune , a mis chie vous fox appeared . "
happy . It waved its antenna e . "
But then , a mis chie vous rabbit came and started
her to grab onto her antenna e . The different butterfly
The little boy smiled mis chie v ously and said ,
<|endoftext|> Peter was a mis chie vous 3 year old who
on the sidewalk and its antenna e were waving in the
and replied with a mis chie vous gl int in his
he especially liked the or nam ents he kept around .
who liked to collect or nam ents . Everywhere she went
never played with the or nam ents again . They still
always acted silly and mis chie vous . One

INTERVAL 0.179 - 0.330
CONTAINS 0.120%

their mom and to be more careful and respectful next time
getting wet and we need more hay ." So
going home and to learn more about the village . <|endoftext|>
him . " Here , take this . It
a three - year - old boy was very excited .

INTERVAL 0.027 - 0.179
CONTAINS 25.555%

. He wanted to go home to the ocean .
and said , " I love you too , mom ."
Lily on the swing and pushed her gently . Lily smiled
went in for dinner . He had had a really lovely
had created a lake of chocolate inside her chamber . All

INTERVAL -0.124 - 0.027
CONTAINS 72.794%

right thing , even if it 's hard . And remember
lived on a small farm . His farm had lots of
The lawyer was still angry and said , "
happy to see Jill that she smiled and asked her to
his work . It was a very big blanket !

INTERVAL -0.276 - -0.124
CONTAINS 1.531%

From then on , Splash lived in his
and they ran away happily ever after . <|endoftext|> One day
a pretty rainbow . He had never seen anything so beautiful
friends . He wished he had gone to her party .
spinning top . As she tw irled faster and faster ,
Token a
Token ID257
Feature activation+4.29e-02

Pos logit contributions
old+0.025
 safer+0.024
 better+0.023
 Doctor+0.023
 cleaner+0.023
Neg logit contributions
 antenna-0.033
rys-0.029
 stub-0.029
ho-0.028
yt-0.028
Token 3
Token ID513
Feature activation-4.35e-02

Pos logit contributions
old+0.060
 Doctor+0.059
 better+0.056
 safer+0.056
 doctor+0.054
Neg logit contributions
 antenna-0.085
rys-0.080
chie-0.076
 stub-0.075
rac-0.075
Token-
Token ID12
Feature activation-2.19e-02

Pos logit contributions
 antenna+0.071
rys+0.066
 stub+0.062
 magn+0.060
ifying+0.060
Neg logit contributions
old-0.084
 Doctor-0.076
 better-0.071
 doctor-0.070
 safer-0.070
Tokenyear
Token ID1941
Feature activation-1.39e-02

Pos logit contributions
 antenna+0.037
rys+0.034
 stub+0.033
chie+0.033
 magn+0.031
Neg logit contributions
old-0.037
 Doctor-0.036
 safer-0.034
 better-0.034
 doctor-0.033
Token-
Token ID12
Feature activation-3.54e-02

Pos logit contributions
 antenna+0.027
rys+0.025
 stub+0.024
 magn+0.023
chie+0.023
Neg logit contributions
old-0.022
 Doctor-0.020
 safer-0.019
 better-0.019
 doctor-0.018
Tokenold
Token ID727
Feature activation+2.55e-01

Pos logit contributions
 antenna+0.091
chie+0.086
rys+0.084
 stub+0.083
Lou+0.082
Neg logit contributions
 Doctor-0.026
 safer-0.026
 better-0.023
old-0.023
 doctor-0.021
Token boy
Token ID2933
Feature activation+8.12e-02

Pos logit contributions
 safer+0.351
 Doctor+0.338
old+0.330
Here+0.324
 better+0.322
Neg logit contributions
 antenna-0.501
chie-0.449
 stub-0.442
rys-0.438
 magn-0.425
Token who
Token ID508
Feature activation+6.67e-02

Pos logit contributions
 Doctor+0.127
 safer+0.126
old+0.124
Here+0.121
 cleaner+0.118
Neg logit contributions
 antenna-0.137
 stub-0.123
rys-0.120
chie-0.119
 magn-0.117
Token had
Token ID550
Feature activation-1.49e-01

Pos logit contributions
old+0.112
 Doctor+0.109
 safer+0.104
 better+0.101
 doctor+0.099
Neg logit contributions
 antenna-0.114
rys-0.104
 stub-0.101
chie-0.098
 magn-0.097
Token a
Token ID257
Feature activation-1.81e-02

Pos logit contributions
 antenna+0.318
rys+0.301
 stub+0.287
chie+0.282
 magn+0.279
Neg logit contributions
old-0.193
 Doctor-0.182
 safer-0.170
 better-0.169
 doctor-0.163
Token new
Token ID649
Feature activation+1.54e-01

Pos logit contributions
 antenna+0.036
rys+0.034
 stub+0.032
chie+0.032
 magn+0.031
Neg logit contributions
old-0.026
 Doctor-0.025
 safer-0.023
 better-0.023
 doctor-0.022
Token was
Token ID373
Feature activation+7.04e-02

Pos logit contributions
 antenna+0.029
 magn+0.026
rys+0.026
 nowhere+0.026
yt+0.025
Neg logit contributions
old-0.021
 Doctor-0.020
Here-0.018
 doctor-0.017
 safer-0.017
Token three
Token ID1115
Feature activation-4.06e-02

Pos logit contributions
old+0.099
 Doctor+0.094
 safer+0.090
 better+0.090
 doctor+0.085
Neg logit contributions
 antenna-0.142
rys-0.134
 stub-0.126
chie-0.126
 magn-0.124
Token-
Token ID12
Feature activation-2.42e-02

Pos logit contributions
 antenna+0.060
rys+0.056
 stub+0.055
chie+0.051
 magn+0.050
Neg logit contributions
old-0.081
 Doctor-0.076
 better-0.073
 safer-0.073
 doctor-0.070
Tokenyears
Token ID19002
Feature activation-1.32e-01

Pos logit contributions
 antenna+0.032
rys+0.029
 stub+0.027
chie+0.027
 magn+0.026
Neg logit contributions
old-0.050
 Doctor-0.049
 safer-0.047
 better-0.047
 doctor-0.046
Token-
Token ID12
Feature activation-3.71e-02

Pos logit contributions
 antenna+0.250
rys+0.231
chie+0.220
 stub+0.218
 magn+0.211
Neg logit contributions
old-0.198
 Doctor-0.194
 safer-0.185
 better-0.181
 doctor-0.176
Tokenold
Token ID727
Feature activation+2.55e-01

Pos logit contributions
 antenna+0.096
chie+0.092
rys+0.089
 stub+0.087
ho+0.087
Neg logit contributions
 Doctor-0.027
 safer-0.027
 better-0.025
 cleaner-0.022
 doctor-0.022
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
 safer+0.345
 Doctor+0.343
old+0.341
 better+0.322
Here+0.322
Neg logit contributions
 antenna-0.507
 stub-0.445
chie-0.444
rys-0.442
ho-0.432
Token She
Token ID1375
Feature activation+1.59e-03

Pos logit contributions
old+0.029
 Doctor+0.027
 safer+0.027
 better+0.026
 cleaner+0.024
Neg logit contributions
 antenna-0.051
rys-0.048
 stub-0.046
chie-0.046
 magn-0.045
Token had
Token ID550
Feature activation-1.47e-01

Pos logit contributions
old+0.002
 Doctor+0.002
 better+0.002
 him+0.002
 doctor+0.002
Neg logit contributions
rys-0.003
 antenna-0.003
chie-0.003
 magn-0.003
 stub-0.003
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
 antenna+0.308
rys+0.307
ho+0.282
yt+0.281
chie+0.278
Neg logit contributions
old-0.194
 Doctor-0.177
 better-0.175
 safer-0.167
 him-0.166
Token lot
Token ID1256
Feature activation-3.41e-02

Pos logit contributions
 antenna+0.031
rys+0.030
 stub+0.028
chie+0.028
rac+0.027
Neg logit contributions
old-0.023
 Doctor-0.022
 better-0.021
 safer-0.021
 doctor-0.020
Token a
Token ID257
Feature activation+1.84e-02

Pos logit contributions
old+0.077
 Doctor+0.072
 safer+0.067
 better+0.066
 doctor+0.064
Neg logit contributions
 antenna-0.133
rys-0.125
 stub-0.120
chie-0.118
 magn-0.116
Token 3
Token ID513
Feature activation-5.28e-02

Pos logit contributions
old+0.024
 Doctor+0.024
 better+0.023
 safer+0.023
 doctor+0.022
Neg logit contributions
 antenna-0.038
rys-0.036
chie-0.034
 stub-0.034
rac-0.034
Token-
Token ID12
Feature activation-2.80e-02

Pos logit contributions
 antenna+0.091
rys+0.084
 stub+0.080
ifying+0.079
nam+0.078
Neg logit contributions
old-0.098
 Doctor-0.088
 better-0.082
 doctor-0.081
 choice-0.080
Tokenyear
Token ID1941
Feature activation-2.01e-02

Pos logit contributions
 antenna+0.051
rys+0.047
chie+0.045
 stub+0.045
rac+0.043
Neg logit contributions
old-0.043
 Doctor-0.043
 safer-0.041
 better-0.040
 doctor-0.039
Token-
Token ID12
Feature activation-3.90e-02

Pos logit contributions
 antenna+0.039
rys+0.037
 stub+0.035
 magn+0.034
chie+0.034
Neg logit contributions
old-0.031
 Doctor-0.028
 safer-0.026
 better-0.026
 doctor-0.026
Tokenold
Token ID727
Feature activation+2.52e-01

Pos logit contributions
 antenna+0.111
chie+0.105
rys+0.104
 stub+0.102
Lou+0.100
Neg logit contributions
 Doctor-0.018
 safer-0.017
 better-0.015
old-0.014
 doctor-0.013
Token little
Token ID1310
Feature activation+2.49e-02

Pos logit contributions
 safer+0.347
 Doctor+0.339
old+0.334
 better+0.323
Here+0.322
Neg logit contributions
 antenna-0.493
chie-0.442
 stub-0.438
rys-0.436
 magn-0.418
Token boy
Token ID2933
Feature activation+7.47e-02

Pos logit contributions
old+0.030
 Doctor+0.029
 safer+0.027
Here+0.025
 better+0.025
Neg logit contributions
 antenna-0.055
rys-0.051
chie-0.050
 stub-0.049
 magn-0.048
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 Doctor+0.119
old+0.119
 safer+0.116
Here+0.111
 better+0.110
Neg logit contributions
 antenna-0.128
 stub-0.114
rys-0.113
chie-0.110
 magn-0.108
Token He
Token ID679
Feature activation+4.68e-02

Pos logit contributions
old+0.025
 safer+0.024
 better+0.023
 Doctor+0.023
 cleaner+0.022
Neg logit contributions
 antenna-0.047
rys-0.044
 stub-0.042
chie-0.042
 magn-0.041
Token had
Token ID550
Feature activation-1.49e-01

Pos logit contributions
old+0.067
 Doctor+0.060
 better+0.058
 doctor+0.056
 him+0.056
Neg logit contributions
 antenna-0.095
rys-0.095
chie-0.087
 stub-0.086
 magn-0.085
Token her
Token ID607
Feature activation-3.79e-03

Pos logit contributions
old+0.076
 Doctor+0.072
 safer+0.067
 better+0.066
 doctor+0.064
Neg logit contributions
 antenna-0.117
rys-0.110
 stub-0.105
chie-0.104
 magn-0.102
Token three
Token ID1115
Feature activation-4.25e-02

Pos logit contributions
 antenna+0.008
rys+0.007
 stub+0.007
chie+0.007
 magn+0.007
Neg logit contributions
old-0.005
 Doctor-0.005
 safer-0.005
 better-0.005
 doctor-0.004
Token-
Token ID12
Feature activation-3.35e-02

Pos logit contributions
 antenna+0.075
rys+0.073
 stub+0.070
 magn+0.067
chie+0.065
Neg logit contributions
old-0.072
 Doctor-0.064
 better-0.063
 safer-0.062
 him-0.060
Tokenyear
Token ID1941
Feature activation-2.26e-02

Pos logit contributions
 antenna+0.065
chie+0.059
rys+0.059
 stub+0.057
ho+0.055
Neg logit contributions
 Doctor-0.047
 safer-0.045
old-0.044
 better-0.043
 doctor-0.043
Token-
Token ID12
Feature activation-4.14e-02

Pos logit contributions
 antenna+0.040
rys+0.037
chie+0.035
 stub+0.034
rac+0.033
Neg logit contributions
old-0.036
 Doctor-0.036
 safer-0.034
 better-0.033
Here-0.033
Tokenold
Token ID727
Feature activation+2.49e-01

Pos logit contributions
 antenna+0.120
chie+0.116
rys+0.111
Lou+0.109
ho+0.109
Neg logit contributions
 Doctor-0.017
 safer-0.015
 better-0.012
 doctor-0.011
 cleaner-0.009
Token daughter
Token ID4957
Feature activation+6.44e-02

Pos logit contributions
 safer+0.318
 Doctor+0.317
old+0.311
Here+0.298
 better+0.291
Neg logit contributions
 antenna-0.513
chie-0.452
rys-0.450
 stub-0.450
 magn-0.434
Token went
Token ID1816
Feature activation+3.00e-03

Pos logit contributions
old+0.102
 Doctor+0.098
 safer+0.093
 better+0.091
 doctor+0.089
Neg logit contributions
 antenna-0.118
rys-0.108
 stub-0.104
chie-0.101
 magn-0.100
Token to
Token ID284
Feature activation-2.18e-02

Pos logit contributions
old+0.004
 Doctor+0.004
 safer+0.004
 better+0.004
 doctor+0.004
Neg logit contributions
 antenna-0.006
rys-0.006
 stub-0.005
chie-0.005
 magn-0.005
Token the
Token ID262
Feature activation+3.62e-02

Pos logit contributions
 antenna+0.046
rys+0.042
 stub+0.041
 magn+0.041
chie+0.039
Neg logit contributions
old-0.028
 Doctor-0.026
 safer-0.026
 cleaner-0.024
Here-0.024
Token park
Token ID3952
Feature activation+1.13e-01

Pos logit contributions
old+0.045
 Doctor+0.039
 safer+0.037
 him+0.036
 better+0.035
Neg logit contributions
 antenna-0.077
rys-0.072
 stub-0.070
chie-0.070
 magn-0.070
Token a
Token ID257
Feature activation-1.21e-02

Pos logit contributions
old+0.029
 Doctor+0.028
 safer+0.026
 doctor+0.025
 choice+0.025
Neg logit contributions
 antenna-0.041
rys-0.038
 stub-0.037
 nowhere-0.036
 magn-0.035
Token three
Token ID1115
Feature activation-5.95e-02

Pos logit contributions
 antenna+0.023
rys+0.022
 stub+0.022
 magn+0.021
yt+0.021
Neg logit contributions
old-0.018
 Doctor-0.016
 safer-0.016
 him-0.015
 better-0.015
Token-
Token ID12
Feature activation-4.43e-02

Pos logit contributions
 antenna+0.111
rys+0.104
 stub+0.099
chie+0.097
 magn+0.095
Neg logit contributions
old-0.095
 Doctor-0.090
 better-0.083
 safer-0.083
 doctor-0.083
Tokenyear
Token ID1941
Feature activation-2.55e-02

Pos logit contributions
 antenna+0.076
rys+0.069
 stub+0.067
chie+0.066
 nowhere+0.063
Neg logit contributions
 Doctor-0.072
old-0.071
 safer-0.069
 better-0.068
 doctor-0.065
Token-
Token ID12
Feature activation-4.85e-02

Pos logit contributions
 antenna+0.050
rys+0.047
 stub+0.046
 magn+0.044
chie+0.044
Neg logit contributions
old-0.039
 Doctor-0.035
 safer-0.032
 better-0.032
 doctor-0.032
Tokenold
Token ID727
Feature activation+2.48e-01

Pos logit contributions
 antenna+0.143
chie+0.135
rys+0.135
 stub+0.134
ho+0.131
Neg logit contributions
 Doctor-0.017
 safer-0.015
 better-0.013
 cleaner-0.010
 doctor-0.008
Token who
Token ID508
Feature activation+6.27e-02

Pos logit contributions
 safer+0.310
old+0.308
 Doctor+0.306
 better+0.290
Here+0.287
Neg logit contributions
 antenna-0.521
 stub-0.466
rys-0.466
chie-0.458
 magn-0.450
Token found
Token ID1043
Feature activation+4.90e-03

Pos logit contributions
old+0.104
 Doctor+0.100
 safer+0.095
 better+0.094
 doctor+0.091
Neg logit contributions
 antenna-0.111
rys-0.103
 stub-0.097
chie-0.095
 magn-0.094
Token a
Token ID257
Feature activation-2.10e-02

Pos logit contributions
old+0.007
 Doctor+0.006
 safer+0.006
 better+0.006
 choice+0.006
Neg logit contributions
 antenna-0.010
 nowhere-0.009
rys-0.009
 stub-0.009
chie-0.008
Token chair
Token ID5118
Feature activation-9.25e-02

Pos logit contributions
 antenna+0.043
rys+0.040
chie+0.038
 stub+0.038
rac+0.037
Neg logit contributions
old-0.029
 Doctor-0.029
 better-0.027
 safer-0.027
 doctor-0.026
Token.
Token ID13
Feature activation+9.80e-03

Pos logit contributions
 antenna+0.210
rys+0.199
 stub+0.189
chie+0.189
 magn+0.185
Neg logit contributions
old-0.109
 Doctor-0.102
 safer-0.094
 better-0.093
 doctor-0.092
TokenThe
Token ID464
Feature activation+8.78e-03

Pos logit contributions
old+0.023
 Doctor+0.021
 safer+0.020
 better+0.020
 cleaner+0.019
Neg logit contributions
 antenna-0.046
rys-0.044
 stub-0.042
chie-0.041
 magn-0.041
Token 3
Token ID513
Feature activation-6.24e-02

Pos logit contributions
old+0.011
 Doctor+0.011
 safer+0.010
 better+0.010
 doctor+0.009
Neg logit contributions
 antenna-0.019
rys-0.018
 stub-0.017
chie-0.017
 magn-0.016
Token-
Token ID12
Feature activation-4.08e-02

Pos logit contributions
 antenna+0.104
rys+0.100
 stub+0.093
 magn+0.092
nam+0.090
Neg logit contributions
old-0.117
 Doctor-0.105
 doctor-0.098
 better-0.097
 choice-0.096
Tokenyear
Token ID1941
Feature activation-2.42e-02

Pos logit contributions
 antenna+0.087
rys+0.082
 stub+0.078
chie+0.077
 magn+0.076
Neg logit contributions
old-0.053
 Doctor-0.050
 safer-0.047
 better-0.046
 doctor-0.045
Token-
Token ID12
Feature activation-4.58e-02

Pos logit contributions
 antenna+0.048
rys+0.045
 stub+0.044
 magn+0.043
chie+0.041
Neg logit contributions
old-0.037
 Doctor-0.034
 better-0.031
 safer-0.031
 doctor-0.031
Tokenold
Token ID727
Feature activation+2.47e-01

Pos logit contributions
 antenna+0.144
rys+0.137
chie+0.135
 stub+0.135
rac+0.131
Neg logit contributions
 Doctor-0.009
old-0.008
 safer-0.006
 better-0.005
 doctor-0.002
Token thought
Token ID1807
Feature activation-5.24e-02

Pos logit contributions
old+0.385
 Doctor+0.373
 safer+0.357
 better+0.349
 doctor+0.337
Neg logit contributions
 antenna-0.455
rys-0.417
 stub-0.403
chie-0.395
 magn-0.385
Token hard
Token ID1327
Feature activation-5.45e-02

Pos logit contributions
 antenna+0.114
rys+0.103
 stub+0.102
chie+0.101
 nowhere+0.100
Neg logit contributions
old-0.062
 Doctor-0.060
Here-0.055
 safer-0.055
 better-0.051
Token and
Token ID290
Feature activation+1.44e-02

Pos logit contributions
 antenna+0.115
rys+0.106
 stub+0.102
chie+0.101
 magn+0.100
Neg logit contributions
old-0.071
 Doctor-0.068
 safer-0.065
 better-0.061
 doctor-0.060
Token smiled
Token ID13541
Feature activation-4.77e-02

Pos logit contributions
old+0.022
 Doctor+0.021
 safer+0.019
 better+0.019
 doctor+0.019
Neg logit contributions
 antenna-0.028
rys-0.025
 stub-0.025
chie-0.024
 magn-0.023
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
 antenna+0.103
rys+0.096
 stub+0.092
chie+0.091
 magn+0.089
Neg logit contributions
old-0.062
 Doctor-0.058
 safer-0.054
 better-0.052
 doctor-0.051
Token are
Token ID389
Feature activation+5.21e-02

Pos logit contributions
 Doctor+0.008
old+0.008
 safer+0.008
Here+0.007
 doctor+0.007
Neg logit contributions
 antenna-0.008
 stub-0.007
ho-0.006
 leapt-0.006
 magn-0.006
Token angry
Token ID7954
Feature activation+2.56e-02

Pos logit contributions
old+0.080
 Doctor+0.075
 safer+0.071
Here+0.071
 doctor+0.068
Neg logit contributions
 antenna-0.098
rys-0.089
 stub-0.089
chie-0.085
 nowhere-0.084
Token.
Token ID13
Feature activation+2.58e-02

Pos logit contributions
old+0.036
 safer+0.036
 Doctor+0.036
Here+0.035
 better+0.033
Neg logit contributions
 antenna-0.049
rys-0.044
 stub-0.042
yt-0.042
ho-0.042
Token They
Token ID1119
Feature activation+3.57e-02

Pos logit contributions
old+0.027
 Doctor+0.026
 safer+0.023
 better+0.023
 doctor+0.022
Neg logit contributions
 antenna-0.061
rys-0.058
 stub-0.056
chie-0.055
 magn-0.054
Token sc
Token ID629
Feature activation-3.67e-02

Pos logit contributions
old+0.043
 Doctor+0.041
 safer+0.038
 better+0.036
 doctor+0.036
Neg logit contributions
 antenna-0.079
rys-0.073
 stub-0.071
chie-0.069
 magn-0.069
Tokenold
Token ID727
Feature activation+2.46e-01

Pos logit contributions
 antenna+0.074
chie+0.069
rys+0.069
rac+0.067
 stub+0.065
Neg logit contributions
 Doctor-0.046
Here-0.043
 safer-0.043
 better-0.043
 cleaner-0.042
Token Lily
Token ID20037
Feature activation+5.88e-02

Pos logit contributions
 safer+0.322
Here+0.305
old+0.303
 Doctor+0.288
 cleaner+0.286
Neg logit contributions
 antenna-0.476
ho-0.439
 stub-0.428
 nowhere-0.426
chie-0.419
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
old+0.080
 Doctor+0.077
 safer+0.072
 better+0.069
Here+0.068
Neg logit contributions
 antenna-0.120
rys-0.111
 stub-0.109
chie-0.104
 magn-0.104
Token Ben
Token ID3932
Feature activation-4.44e-03

Pos logit contributions
old+0.016
 Doctor+0.014
 safer+0.014
 better+0.013
 doctor+0.012
Neg logit contributions
 antenna-0.037
rys-0.035
 stub-0.033
 magn-0.033
chie-0.032
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
 antenna+0.009
rys+0.009
 stub+0.008
chie+0.008
 magn+0.008
Neg logit contributions
old-0.006
 Doctor-0.006
 safer-0.005
 better-0.005
 doctor-0.005
Token They
Token ID1119
Feature activation+2.73e-02

Pos logit contributions
old+0.019
 Doctor+0.017
 safer+0.016
 better+0.016
 doctor+0.015
Neg logit contributions
 antenna-0.037
rys-0.035
 stub-0.034
 magn-0.033
chie-0.032
Token was
Token ID373
Feature activation+2.83e-02

Pos logit contributions
old+0.040
 Doctor+0.036
 better+0.035
 doctor+0.034
 safer+0.034
Neg logit contributions
 antenna-0.048
rys-0.047
chie-0.044
 stub-0.042
 magn-0.042
Token three
Token ID1115
Feature activation-5.32e-02

Pos logit contributions
old+0.044
 Doctor+0.041
 safer+0.039
 better+0.038
 doctor+0.038
Neg logit contributions
 antenna-0.053
rys-0.049
 stub-0.048
chie-0.046
 magn-0.045
Token-
Token ID12
Feature activation-3.98e-02

Pos logit contributions
 antenna+0.068
rys+0.062
 stub+0.058
chie+0.055
 magn+0.054
Neg logit contributions
old-0.116
 Doctor-0.111
 safer-0.107
 better-0.106
 doctor-0.103
Tokenyears
Token ID19002
Feature activation-1.46e-01

Pos logit contributions
 antenna+0.035
rys+0.030
chie+0.028
 stub+0.027
ho+0.024
Neg logit contributions
 Doctor-0.098
old-0.097
 safer-0.095
 better-0.094
 doctor-0.093
Token-
Token ID12
Feature activation-4.49e-02

Pos logit contributions
 antenna+0.271
rys+0.250
 stub+0.237
chie+0.236
 magn+0.229
Neg logit contributions
old-0.228
 Doctor-0.221
 safer-0.210
 better-0.207
 doctor-0.201
Tokenold
Token ID727
Feature activation+2.46e-01

Pos logit contributions
 antenna+0.126
chie+0.122
rys+0.119
ho+0.118
Lou+0.117
Neg logit contributions
 safer-0.021
 Doctor-0.020
 better-0.018
 cleaner-0.016
 doctor-0.014
Token and
Token ID290
Feature activation+1.91e-02

Pos logit contributions
old+0.330
 Doctor+0.322
 safer+0.314
Here+0.302
 better+0.297
Neg logit contributions
 antenna-0.500
rys-0.442
 stub-0.436
chie-0.436
ho-0.430
Token she
Token ID673
Feature activation-3.82e-02

Pos logit contributions
old+0.028
 Doctor+0.027
 safer+0.025
 better+0.025
 doctor+0.024
Neg logit contributions
 antenna-0.038
rys-0.035
 stub-0.034
chie-0.033
 magn-0.032
Token loved
Token ID6151
Feature activation+1.35e-02

Pos logit contributions
 antenna+0.075
rys+0.074
 stub+0.066
chie+0.066
 magn+0.066
Neg logit contributions
old-0.058
 Doctor-0.052
 better-0.050
 safer-0.049
 doctor-0.049
Token to
Token ID284
Feature activation-2.33e-02

Pos logit contributions
old+0.020
 Doctor+0.019
 safer+0.018
 better+0.018
 doctor+0.017
Neg logit contributions
 antenna-0.026
rys-0.024
 stub-0.023
chie-0.022
 magn-0.022
Token explore
Token ID7301
Feature activation-1.92e-02

Pos logit contributions
 antenna+0.050
rys+0.047
 stub+0.045
 magn+0.044
chie+0.044
Neg logit contributions
old-0.030
 Doctor-0.028
 safer-0.027
 better-0.026
Here-0.025
Token little
Token ID1310
Feature activation+2.26e-02

Pos logit contributions
 antenna+0.026
rys+0.025
chie+0.023
 stub+0.023
rac+0.023
Neg logit contributions
 Doctor-0.018
old-0.018
 better-0.017
 safer-0.017
 doctor-0.017
Token 3
Token ID513
Feature activation-7.27e-02

Pos logit contributions
old+0.027
 Doctor+0.025
 safer+0.023
 better+0.023
 doctor+0.022
Neg logit contributions
 antenna-0.050
rys-0.047
 stub-0.046
chie-0.045
 magn-0.044
Token-
Token ID12
Feature activation-4.39e-02

Pos logit contributions
 antenna+0.124
rys+0.114
ifying+0.107
 stub+0.107
rac+0.105
Neg logit contributions
old-0.137
 Doctor-0.124
 doctor-0.115
 better-0.114
 ice-0.113
Tokenyear
Token ID1941
Feature activation-3.06e-02

Pos logit contributions
 antenna+0.070
rys+0.064
chie+0.063
 stub+0.063
ho+0.060
Neg logit contributions
 Doctor-0.075
 safer-0.073
old-0.072
 better-0.071
 doctor-0.068
Token-
Token ID12
Feature activation-4.73e-02

Pos logit contributions
 antenna+0.060
rys+0.056
 stub+0.055
 magn+0.053
chie+0.052
Neg logit contributions
old-0.047
 Doctor-0.043
 safer-0.040
 better-0.040
 doctor-0.039
Tokenold
Token ID727
Feature activation+2.44e-01

Pos logit contributions
 antenna+0.136
chie+0.131
ho+0.127
 stub+0.127
Lou+0.126
Neg logit contributions
 Doctor-0.019
 safer-0.018
 better-0.015
 choice-0.012
 cleaner-0.011
Token boy
Token ID2933
Feature activation+6.94e-02

Pos logit contributions
 safer+0.304
 Doctor+0.293
Here+0.284
old+0.284
 better+0.275
Neg logit contributions
 antenna-0.514
 stub-0.463
rys-0.457
chie-0.456
 magn-0.446
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
 Doctor+0.124
old+0.122
 safer+0.120
 better+0.115
 cleaner+0.115
Neg logit contributions
 antenna-0.106
 stub-0.096
rys-0.096
 magn-0.091
 nowhere-0.090
Token He
Token ID679
Feature activation+4.30e-02

Pos logit contributions
old+0.020
 safer+0.018
 Doctor+0.018
 better+0.018
 cleaner+0.017
Neg logit contributions
 antenna-0.039
rys-0.036
 stub-0.035
 nowhere-0.034
 magn-0.034
Token asked
Token ID1965
Feature activation+5.38e-02

Pos logit contributions
old+0.059
 better+0.052
 Doctor+0.052
 him+0.051
 doctor+0.049
Neg logit contributions
rys-0.092
 antenna-0.089
chie-0.085
 magn-0.083
yt-0.081
Token the
Token ID262
Feature activation+3.02e-02

Pos logit contributions
old+0.063
 safer+0.062
Here+0.061
 Doctor+0.058
 cleaner+0.057
Neg logit contributions
 antenna-0.113
 nowhere-0.100
nam-0.098
rac-0.097
 magn-0.097
Token tiny
Token ID7009
Feature activation-5.53e-02

Pos logit contributions
 antenna+0.033
rys+0.031
 stub+0.030
chie+0.029
 magn+0.029
Neg logit contributions
old-0.022
 Doctor-0.021
 safer-0.019
 better-0.019
 doctor-0.018
Token three
Token ID1115
Feature activation-6.13e-02

Pos logit contributions
 antenna+0.112
rys+0.105
 stub+0.100
chie+0.099
rac+0.099
Neg logit contributions
old-0.076
 Doctor-0.074
 better-0.068
 doctor-0.068
 safer-0.068
Token-
Token ID12
Feature activation-4.64e-02

Pos logit contributions
 antenna+0.120
rys+0.112
ifying+0.112
 stub+0.110
nam+0.108
Neg logit contributions
old-0.098
 Doctor-0.084
 doctor-0.079
 ice-0.079
 choice-0.077
Tokenyear
Token ID1941
Feature activation-2.98e-02

Pos logit contributions
 antenna+0.085
rys+0.079
 stub+0.076
chie+0.075
 magn+0.073
Neg logit contributions
old-0.072
 Doctor-0.071
 safer-0.067
 better-0.066
 doctor-0.064
Token-
Token ID12
Feature activation-4.91e-02

Pos logit contributions
 antenna+0.063
rys+0.060
 stub+0.060
ifying+0.060
 magn+0.059
Neg logit contributions
old-0.042
 Doctor-0.035
 doctor-0.032
 ice-0.032
 better-0.032
Tokenold
Token ID727
Feature activation+2.43e-01

Pos logit contributions
 antenna+0.156
rys+0.148
chie+0.147
 stub+0.145
ho+0.143
Neg logit contributions
 Doctor-0.009
 safer-0.006
 better-0.004
old-0.002
 doctor-0.000
Token girl
Token ID2576
Feature activation+1.50e-02

Pos logit contributions
old+0.323
 Doctor+0.319
 safer+0.317
 better+0.300
Here+0.290
Neg logit contributions
 antenna-0.494
rys-0.448
 stub-0.443
chie-0.434
 magn-0.427
Token came
Token ID1625
Feature activation+4.48e-02

Pos logit contributions
old+0.025
 Doctor+0.023
 better+0.022
 safer+0.022
 doctor+0.022
Neg logit contributions
 antenna-0.027
rys-0.025
chie-0.024
 stub-0.023
rac-0.023
Token across
Token ID1973
Feature activation-1.00e-01

Pos logit contributions
old+0.060
 Doctor+0.054
 better+0.053
 him+0.051
 safer+0.050
Neg logit contributions
 antenna-0.095
rys-0.091
 stub-0.087
rac-0.084
chie-0.084
Token the
Token ID262
Feature activation+3.20e-02

Pos logit contributions
 antenna+0.230
rys+0.217
 stub+0.209
chie+0.206
 magn+0.203
Neg logit contributions
old-0.115
 Doctor-0.108
 safer-0.099
 better-0.098
 doctor-0.094
Token letter
Token ID3850
Feature activation-2.91e-02

Pos logit contributions
old+0.043
 Doctor+0.043
 better+0.040
 doctor+0.040
 safer+0.039
Neg logit contributions
 antenna-0.066
rys-0.062
chie-0.058
 stub-0.058
rac-0.058
Token was
Token ID373
Feature activation+1.24e-02

Pos logit contributions
 antenna+0.055
rys+0.053
chie+0.049
 stub+0.048
 magn+0.048
Neg logit contributions
old-0.043
 Doctor-0.039
 better-0.038
 safer-0.037
 doctor-0.037
Token three
Token ID1115
Feature activation-5.88e-02

Pos logit contributions
old+0.019
 Doctor+0.018
 safer+0.017
 better+0.017
 doctor+0.016
Neg logit contributions
 antenna-0.024
rys-0.022
 stub-0.021
chie-0.021
 magn-0.021
Token-
Token ID12
Feature activation-4.85e-02

Pos logit contributions
 antenna+0.081
rys+0.074
 stub+0.070
chie+0.067
 magn+0.066
Neg logit contributions
old-0.124
 Doctor-0.116
 safer-0.112
 better-0.112
 doctor-0.108
Tokenyears
Token ID19002
Feature activation-1.48e-01

Pos logit contributions
 antenna+0.043
rys+0.037
 stub+0.033
chie+0.033
 magn+0.029
Neg logit contributions
old-0.121
 Doctor-0.120
 safer-0.116
 better-0.115
 doctor-0.113
Token-
Token ID12
Feature activation-5.11e-02

Pos logit contributions
 antenna+0.280
rys+0.260
 stub+0.247
chie+0.244
 magn+0.239
Neg logit contributions
old-0.225
 Doctor-0.217
 safer-0.204
 better-0.202
 doctor-0.197
Tokenold
Token ID727
Feature activation+2.42e-01

Pos logit contributions
 antenna+0.148
chie+0.142
rys+0.140
 stub+0.137
ho+0.137
Neg logit contributions
 Doctor-0.022
 safer-0.021
 better-0.017
 cleaner-0.016
 doctor-0.014
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
 Doctor+0.309
old+0.309
 safer+0.297
Here+0.282
 better+0.279
Neg logit contributions
 antenna-0.506
rys-0.453
 stub-0.446
chie-0.444
 magn-0.432
Token She
Token ID1375
Feature activation-4.45e-03

Pos logit contributions
old+0.019
 Doctor+0.018
 safer+0.018
 better+0.017
 cleaner+0.017
Neg logit contributions
 antenna-0.026
rys-0.024
 stub-0.024
 nowhere-0.023
chie-0.023
Token wanted
Token ID2227
Feature activation-9.38e-03

Pos logit contributions
rys+0.009
 antenna+0.009
chie+0.008
 magn+0.008
 stub+0.008
Neg logit contributions
old-0.006
 Doctor-0.006
 better-0.005
 doctor-0.005
 him-0.005
Token to
Token ID284
Feature activation-2.49e-02

Pos logit contributions
 antenna+0.018
rys+0.016
 stub+0.016
chie+0.015
 magn+0.015
Neg logit contributions
old-0.014
 Doctor-0.013
 safer-0.013
 better-0.012
 doctor-0.012
Token go
Token ID467
Feature activation+7.93e-02

Pos logit contributions
 antenna+0.056
rys+0.053
 stub+0.051
chie+0.050
 magn+0.049
Neg logit contributions
old-0.029
 Doctor-0.028
 better-0.025
 safer-0.025
 doctor-0.025
Token little
Token ID1310
Feature activation+2.50e-02

Pos logit contributions
old+0.068
 Doctor+0.064
 safer+0.060
 better+0.059
 doctor+0.057
Neg logit contributions
 antenna-0.111
rys-0.104
 stub-0.100
chie-0.098
 magn-0.097
Token three
Token ID1115
Feature activation-5.76e-02

Pos logit contributions
old+0.034
 Doctor+0.032
 safer+0.030
 better+0.030
 doctor+0.029
Neg logit contributions
 antenna-0.052
rys-0.049
 stub-0.046
chie-0.046
 magn-0.045
Token-
Token ID12
Feature activation-4.44e-02

Pos logit contributions
rys+0.111
 antenna+0.109
 stub+0.104
 magn+0.103
ifying+0.099
Neg logit contributions
old-0.091
 better-0.079
 Doctor-0.077
 safer-0.075
 choice-0.073
Tokenyear
Token ID1941
Feature activation-3.11e-02

Pos logit contributions
 antenna+0.091
rys+0.084
chie+0.084
 stub+0.081
ho+0.079
Neg logit contributions
 Doctor-0.057
 safer-0.054
old-0.053
 better-0.052
 doctor-0.050
Token-
Token ID12
Feature activation-4.77e-02

Pos logit contributions
 antenna+0.059
rys+0.055
 stub+0.053
chie+0.051
 magn+0.051
Neg logit contributions
old-0.048
 Doctor-0.046
 safer-0.043
 better-0.043
 doctor-0.041
Tokenold
Token ID727
Feature activation+2.42e-01

Pos logit contributions
 antenna+0.145
chie+0.138
rys+0.135
Lou+0.134
 stub+0.133
Neg logit contributions
 Doctor-0.013
 safer-0.011
 better-0.007
 cleaner-0.005
 doctor-0.005
Token daughter
Token ID4957
Feature activation+5.94e-02

Pos logit contributions
 safer+0.362
 Doctor+0.362
old+0.356
Here+0.344
 better+0.338
Neg logit contributions
 antenna-0.446
 stub-0.392
rys-0.382
chie-0.381
 magn-0.368
Token was
Token ID373
Feature activation+6.18e-03

Pos logit contributions
old+0.096
 Doctor+0.092
 better+0.086
 safer+0.086
 doctor+0.084
Neg logit contributions
 antenna-0.107
rys-0.101
 stub-0.095
chie-0.094
 magn-0.092
Token cl
Token ID537
Feature activation-6.14e-02

Pos logit contributions
old+0.009
 Doctor+0.009
 better+0.008
 safer+0.008
 doctor+0.008
Neg logit contributions
 antenna-0.012
rys-0.011
 stub-0.011
chie-0.011
 magn-0.011
Tokenapping
Token ID5912
Feature activation-7.65e-02

Pos logit contributions
 antenna+0.092
rys+0.082
 stub+0.079
 magn+0.077
chie+0.074
Neg logit contributions
old-0.123
 Doctor-0.116
 better-0.110
 safer-0.109
 doctor-0.108
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
 antenna+0.171
rys+0.167
 stub+0.159
chie+0.155
 magn+0.154
Neg logit contributions
old-0.089
 Doctor-0.081
 better-0.080
 him-0.078
 safer-0.076
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
old+0.023
 Doctor+0.022
 safer+0.020
 better+0.020
 doctor+0.020
Neg logit contributions
 antenna-0.032
rys-0.030
chie-0.028
 stub-0.028
 magn-0.028
Token three
Token ID1115
Feature activation-5.93e-02

Pos logit contributions
 antenna+0.032
rys+0.029
 stub+0.028
chie+0.028
 magn+0.027
Neg logit contributions
old-0.023
 Doctor-0.022
 safer-0.021
 better-0.021
 doctor-0.020
Token-
Token ID12
Feature activation-4.84e-02

Pos logit contributions
 antenna+0.107
rys+0.100
 stub+0.096
chie+0.093
 magn+0.092
Neg logit contributions
old-0.098
 Doctor-0.093
 better-0.087
 safer-0.087
 doctor-0.086
Tokenyear
Token ID1941
Feature activation-3.23e-02

Pos logit contributions
 antenna+0.081
rys+0.075
 stub+0.072
chie+0.071
 magn+0.067
Neg logit contributions
old-0.082
 Doctor-0.081
 safer-0.077
 better-0.076
 doctor-0.073
Token-
Token ID12
Feature activation-5.12e-02

Pos logit contributions
 antenna+0.065
rys+0.061
 stub+0.059
 magn+0.058
 nowhere+0.056
Neg logit contributions
old-0.048
 Doctor-0.043
 safer-0.040
 better-0.040
 doctor-0.040
Tokenold
Token ID727
Feature activation+2.41e-01

Pos logit contributions
 antenna+0.153
chie+0.147
rys+0.147
 stub+0.146
ho+0.142
Neg logit contributions
 Doctor-0.016
 safer-0.012
 better-0.010
old-0.007
 cleaner-0.005
Token boy
Token ID2933
Feature activation+6.78e-02

Pos logit contributions
old+0.271
 Doctor+0.264
 safer+0.260
 better+0.241
Here+0.241
Neg logit contributions
 antenna-0.539
rys-0.492
 stub-0.489
chie-0.482
 magn-0.469
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
old+0.091
 Doctor+0.089
 safer+0.084
 better+0.080
Here+0.079
Neg logit contributions
 antenna-0.137
rys-0.128
 stub-0.126
chie-0.120
 magn-0.120
Token He
Token ID679
Feature activation+4.23e-02

Pos logit contributions
old+0.013
 safer+0.012
 Doctor+0.012
 better+0.012
 cleaner+0.011
Neg logit contributions
 antenna-0.025
rys-0.023
chie-0.023
 stub-0.023
ho-0.022
Token liked
Token ID8288
Feature activation-2.61e-03

Pos logit contributions
old+0.072
 Doctor+0.068
 better+0.065
 safer+0.065
 doctor+0.064
Neg logit contributions
 antenna-0.074
rys-0.070
chie-0.064
 stub-0.064
 magn-0.062
Token to
Token ID284
Feature activation-2.63e-02

Pos logit contributions
 antenna+0.005
rys+0.005
 stub+0.005
chie+0.005
 magn+0.005
Neg logit contributions
old-0.004
 Doctor-0.003
 safer-0.003
 better-0.003
 doctor-0.003
Token from
Token ID422
Feature activation-3.37e-02

Pos logit contributions
 better+0.035
old+0.034
 safer+0.033
 Doctor+0.032
 doctor+0.031
Neg logit contributions
 antenna-0.081
rys-0.079
 stub-0.076
chie-0.075
rac-0.073
Token him
Token ID683
Feature activation+1.33e-01

Pos logit contributions
 antenna+0.079
rys+0.072
 nowhere+0.070
 magn+0.069
 stub+0.069
Neg logit contributions
old-0.039
 Doctor-0.033
 safer-0.031
 cleaner-0.030
 better-0.030
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
old+0.156
 Doctor+0.147
 safer+0.136
 better+0.133
 doctor+0.129
Neg logit contributions
 antenna-0.300
rys-0.281
 stub-0.271
chie-0.266
 magn-0.264
Token They
Token ID1119
Feature activation+3.11e-02

Pos logit contributions
old+0.021
 Doctor+0.018
 choice+0.018
 safer+0.018
 cleaner+0.018
Neg logit contributions
 antenna-0.031
rys-0.028
 stub-0.028
 nowhere-0.027
rac-0.027
Token sc
Token ID629
Feature activation-4.00e-02

Pos logit contributions
old+0.040
 Doctor+0.038
 better+0.036
 safer+0.036
 doctor+0.034
Neg logit contributions
 antenna-0.066
rys-0.063
 stub-0.060
chie-0.059
 magn-0.057
Tokenold
Token ID727
Feature activation+2.41e-01

Pos logit contributions
 antenna+0.072
rys+0.065
chie+0.063
rac+0.061
ho+0.061
Neg logit contributions
 Doctor-0.064
 doctor-0.059
 better-0.058
old-0.058
 safer-0.058
Token him
Token ID683
Feature activation+1.27e-01

Pos logit contributions
 safer+0.297
Here+0.294
old+0.294
 Doctor+0.281
 cleaner+0.277
Neg logit contributions
 antenna-0.465
 nowhere-0.426
nam-0.416
 reapp-0.410
 stub-0.408
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
old+0.199
 Doctor+0.189
 safer+0.183
 better+0.183
 doctor+0.174
Neg logit contributions
 antenna-0.231
rys-0.222
 stub-0.209
chie-0.208
rac-0.198
Token They
Token ID1119
Feature activation+2.71e-02

Pos logit contributions
old+0.017
 Doctor+0.015
 choice+0.014
 safer+0.014
 better+0.014
Neg logit contributions
 antenna-0.025
rys-0.022
 stub-0.022
rac-0.021
 nowhere-0.021
Token tell
Token ID1560
Feature activation+5.57e-03

Pos logit contributions
old+0.034
 Doctor+0.032
 safer+0.030
 better+0.030
 doctor+0.029
Neg logit contributions
 antenna-0.059
rys-0.055
 stub-0.053
chie-0.052
 magn-0.052
Token him
Token ID683
Feature activation+1.25e-01

Pos logit contributions
old+0.005
 safer+0.005
Here+0.005
 cleaner+0.004
 Doctor+0.004
Neg logit contributions
 antenna-0.013
rys-0.012
 nowhere-0.012
 stub-0.012
rac-0.011
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
 antenna+0.033
rys+0.031
 stub+0.030
 magn+0.030
chie+0.029
Neg logit contributions
old-0.019
 Doctor-0.017
 safer-0.016
 better-0.016
 doctor-0.015
Token three
Token ID1115
Feature activation-6.23e-02

Pos logit contributions
 antenna+0.043
rys+0.040
 stub+0.038
chie+0.038
 magn+0.037
Neg logit contributions
old-0.027
 Doctor-0.026
 safer-0.025
 better-0.024
 doctor-0.024
Token-
Token ID12
Feature activation-4.85e-02

Pos logit contributions
 antenna+0.115
rys+0.105
 stub+0.104
ifying+0.103
nam+0.100
Neg logit contributions
old-0.107
 Doctor-0.095
 doctor-0.090
 ice-0.088
 choice-0.087
Tokenyear
Token ID1941
Feature activation-3.23e-02

Pos logit contributions
 antenna+0.086
rys+0.079
 stub+0.076
chie+0.076
 magn+0.072
Neg logit contributions
 Doctor-0.077
old-0.077
 safer-0.073
 better-0.072
 doctor-0.070
Token-
Token ID12
Feature activation-5.03e-02

Pos logit contributions
 antenna+0.068
rys+0.064
 stub+0.064
 magn+0.062
ifying+0.062
Neg logit contributions
old-0.046
 Doctor-0.039
 doctor-0.036
 better-0.036
 ice-0.035
Tokenold
Token ID727
Feature activation+2.40e-01

Pos logit contributions
 antenna+0.154
rys+0.146
chie+0.146
 stub+0.144
ho+0.141
Neg logit contributions
 Doctor-0.014
 safer-0.010
 better-0.008
old-0.007
 doctor-0.005
Token boy
Token ID2933
Feature activation+6.62e-02

Pos logit contributions
 safer+0.293
 Doctor+0.293
old+0.291
 better+0.273
Here+0.265
Neg logit contributions
 antenna-0.507
rys-0.465
 stub-0.461
chie-0.454
 magn-0.444
Token decided
Token ID3066
Feature activation+2.15e-02

Pos logit contributions
 Doctor+0.117
old+0.117
 safer+0.114
 better+0.110
Here+0.107
Neg logit contributions
 antenna-0.104
rys-0.094
 stub-0.093
 magn-0.088
chie-0.087
Token to
Token ID284
Feature activation-2.51e-02

Pos logit contributions
old+0.030
 Doctor+0.028
 safer+0.027
 better+0.026
 doctor+0.025
Neg logit contributions
 antenna-0.044
rys-0.041
 stub-0.039
chie-0.038
 magn-0.038
Token try
Token ID1949
Feature activation+6.75e-02

Pos logit contributions
 antenna+0.054
rys+0.050
 stub+0.048
chie+0.048
 magn+0.047
Neg logit contributions
old-0.032
 Doctor-0.030
 safer-0.029
 better-0.028
Here-0.026
Token it
Token ID340
Feature activation+1.04e-02

Pos logit contributions
old+0.080
 Doctor+0.075
 safer+0.070
 better+0.069
 doctor+0.066
Neg logit contributions
 antenna-0.152
rys-0.143
 stub-0.138
chie-0.136
 magn-0.134
TokenThe
Token ID464
Feature activation-1.07e-02

Pos logit contributions
 antenna+0.020
rys+0.019
 stub+0.017
 magn+0.017
chie+0.017
Neg logit contributions
old-0.018
 Doctor-0.017
 safer-0.015
Here-0.015
 doctor-0.015
Token three
Token ID1115
Feature activation-6.29e-02

Pos logit contributions
 antenna+0.022
rys+0.021
 stub+0.020
chie+0.020
rac+0.019
Neg logit contributions
old-0.014
 Doctor-0.014
 better-0.013
 safer-0.013
 doctor-0.013
Token-
Token ID12
Feature activation-4.94e-02

Pos logit contributions
 antenna+0.112
rys+0.108
 stub+0.106
 magn+0.101
ifying+0.099
Neg logit contributions
old-0.113
 Doctor-0.096
 customers-0.092
 ice-0.091
 better-0.090
Tokenyear
Token ID1941
Feature activation-3.23e-02

Pos logit contributions
 antenna+0.092
rys+0.085
chie+0.081
 stub+0.081
 magn+0.078
Neg logit contributions
old-0.075
 Doctor-0.074
 safer-0.071
 better-0.070
 doctor-0.067
Token-
Token ID12
Feature activation-5.15e-02

Pos logit contributions
 antenna+0.066
rys+0.063
 stub+0.062
 magn+0.060
ifying+0.059
Neg logit contributions
old-0.047
 Doctor-0.041
 better-0.038
 safer-0.038
 doctor-0.037
Tokenold
Token ID727
Feature activation+2.40e-01

Pos logit contributions
 antenna+0.160
chie+0.152
rys+0.151
 stub+0.148
ho+0.147
Neg logit contributions
 Doctor-0.012
 safer-0.009
 better-0.007
old-0.004
 doctor-0.004
Token girl
Token ID2576
Feature activation+1.55e-02

Pos logit contributions
old+0.344
 Doctor+0.340
 safer+0.328
 better+0.316
Here+0.307
Neg logit contributions
 antenna-0.468
rys-0.422
 stub-0.413
chie-0.409
 magn-0.397
Token was
Token ID373
Feature activation+6.28e-03

Pos logit contributions
old+0.025
 Doctor+0.023
 better+0.022
 safer+0.021
 doctor+0.021
Neg logit contributions
 antenna-0.028
rys-0.028
 stub-0.025
chie-0.025
 magn-0.025
Token fascinated
Token ID30103
Feature activation-6.07e-02

Pos logit contributions
old+0.009
 Doctor+0.009
 better+0.008
 safer+0.008
 doctor+0.008
Neg logit contributions
 antenna-0.012
rys-0.012
 stub-0.011
 magn-0.011
chie-0.011
Token by
Token ID416
Feature activation+2.68e-02

Pos logit contributions
 antenna+0.122
rys+0.113
chie+0.108
 nowhere+0.108
yt+0.107
Neg logit contributions
old-0.083
 Doctor-0.081
 safer-0.077
Here-0.074
 better-0.072
Token the
Token ID262
Feature activation+2.90e-02

Pos logit contributions
old+0.028
 Doctor+0.027
 safer+0.024
 better+0.024
 doctor+0.023
Neg logit contributions
 antenna-0.063
rys-0.060
 stub-0.059
chie-0.057
 magn-0.056
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
old+0.016
 Doctor+0.015
 better+0.015
 safer+0.014
 doctor+0.014
Neg logit contributions
 antenna-0.021
rys-0.021
yt-0.020
 magn-0.020
ho-0.020
Token three
Token ID1115
Feature activation-6.16e-02

Pos logit contributions
 antenna+0.035
rac+0.034
rys+0.034
chie+0.031
yt+0.031
Neg logit contributions
 Doctor-0.034
 better-0.033
 doctor-0.033
 safer-0.032
old-0.031
Token-
Token ID12
Feature activation-5.49e-02

Pos logit contributions
 antenna+0.114
rys+0.109
rac+0.103
chie+0.102
ifying+0.102
Neg logit contributions
old-0.101
 Doctor-0.093
 doctor-0.086
 better-0.086
 safer-0.083
Tokenyear
Token ID1941
Feature activation-3.27e-02

Pos logit contributions
 antenna+0.092
rys+0.085
 stub+0.082
chie+0.082
ho+0.078
Neg logit contributions
 Doctor-0.091
old-0.090
 safer-0.087
 better-0.085
 doctor-0.083
Token-
Token ID12
Feature activation-5.55e-02

Pos logit contributions
 antenna+0.065
rys+0.061
 stub+0.060
 magn+0.059
 nowhere+0.057
Neg logit contributions
old-0.049
 Doctor-0.044
 better-0.041
 safer-0.041
 doctor-0.040
Tokenold
Token ID727
Feature activation+2.40e-01

Pos logit contributions
 antenna+0.162
chie+0.156
rys+0.153
 stub+0.153
ho+0.152
Neg logit contributions
 Doctor-0.019
 safer-0.017
 better-0.014
 cleaner-0.011
 choice-0.010
Token girl
Token ID2576
Feature activation+1.37e-02

Pos logit contributions
old+0.282
 Doctor+0.279
 safer+0.273
Here+0.259
 better+0.255
Neg logit contributions
 antenna-0.529
rys-0.477
 stub-0.473
chie-0.464
 magn-0.456
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
old+0.019
 Doctor+0.018
 safer+0.017
 better+0.017
 doctor+0.016
Neg logit contributions
 antenna-0.028
rys-0.026
 stub-0.025
chie-0.025
 magn-0.024
Token She
Token ID1375
Feature activation-5.27e-03

Pos logit contributions
old+0.016
 Doctor+0.015
 safer+0.013
 better+0.013
 doctor+0.013
Neg logit contributions
 antenna-0.027
rys-0.025
 stub-0.024
chie-0.024
 magn-0.023
Token had
Token ID550
Feature activation-1.51e-01

Pos logit contributions
rys+0.012
 antenna+0.011
chie+0.011
ifying+0.010
 magn+0.010
Neg logit contributions
old-0.007
 Doctor-0.006
 better-0.006
 doctor-0.006
 him-0.006
Token been
Token ID587
Feature activation-2.14e-02

Pos logit contributions
rys+0.311
 antenna+0.310
ho+0.289
 magn+0.286
rac+0.284
Neg logit contributions
old-0.198
 Doctor-0.185
 better-0.182
 safer-0.178
 him-0.175
Token average
Token ID2811
Feature activation+6.11e-02

Pos logit contributions
 antenna+0.180
rys+0.178
chie+0.169
 stub+0.159
nam+0.158
Neg logit contributions
old-0.120
Here-0.117
 safer-0.115
 Doctor-0.115
 better-0.110
Token three
Token ID1115
Feature activation-6.38e-02

Pos logit contributions
old+0.086
 Doctor+0.080
 safer+0.077
 better+0.074
Here+0.073
Neg logit contributions
 antenna-0.125
rys-0.116
 stub-0.111
chie-0.110
 magn-0.109
Token-
Token ID12
Feature activation-5.25e-02

Pos logit contributions
 antenna+0.110
rys+0.101
 stub+0.098
chie+0.093
 magn+0.092
Neg logit contributions
old-0.113
 Doctor-0.107
 better-0.099
 safer-0.099
 doctor-0.099
Tokenyear
Token ID1941
Feature activation-3.53e-02

Pos logit contributions
 antenna+0.086
rys+0.079
 stub+0.076
chie+0.076
ho+0.071
Neg logit contributions
 Doctor-0.089
old-0.089
 safer-0.086
 better-0.084
 doctor-0.081
Token-
Token ID12
Feature activation-5.44e-02

Pos logit contributions
 antenna+0.069
rys+0.065
 stub+0.063
 magn+0.061
chie+0.060
Neg logit contributions
old-0.054
 Doctor-0.049
 safer-0.046
 better-0.045
 doctor-0.045
Tokenold
Token ID727
Feature activation+2.39e-01

Pos logit contributions
 antenna+0.160
chie+0.155
rys+0.154
 stub+0.152
ho+0.149
Neg logit contributions
 Doctor-0.018
 safer-0.015
 better-0.013
 doctor-0.009
 cleaner-0.008
Token boy
Token ID2933
Feature activation+6.62e-02

Pos logit contributions
old+0.298
 Doctor+0.293
 safer+0.290
 better+0.272
Here+0.271
Neg logit contributions
 antenna-0.504
rys-0.455
 stub-0.452
chie-0.447
 magn-0.436
Token.
Token ID13
Feature activation+1.02e-02

Pos logit contributions
old+0.087
 Doctor+0.086
 safer+0.081
 better+0.078
Here+0.076
Neg logit contributions
 antenna-0.135
rys-0.125
 stub-0.123
 magn-0.118
chie-0.118
Token He
Token ID679
Feature activation+3.96e-02

Pos logit contributions
old+0.011
 safer+0.011
 Doctor+0.010
 better+0.010
 cleaner+0.009
Neg logit contributions
 antenna-0.023
rys-0.021
chie-0.021
 stub-0.021
yt-0.020
Token loved
Token ID6151
Feature activation+7.63e-03

Pos logit contributions
old+0.060
 Doctor+0.055
 better+0.053
 safer+0.052
 doctor+0.051
Neg logit contributions
 antenna-0.077
rys-0.074
chie-0.069
 stub-0.068
 magn-0.067
Token playing
Token ID2712
Feature activation+7.27e-02

Pos logit contributions
old+0.011
 Doctor+0.011
 safer+0.010
 better+0.010
 doctor+0.010
Neg logit contributions
 antenna-0.015
rys-0.014
 stub-0.013
chie-0.013
 magn-0.013
Token her
Token ID607
Feature activation-3.31e-02

Pos logit contributions
old+0.014
 Doctor+0.013
 safer+0.012
 better+0.012
 doctor+0.012
Neg logit contributions
 antenna-0.024
rys-0.022
 stub-0.022
chie-0.021
 magn-0.021
Token 3
Token ID513
Feature activation-7.84e-02

Pos logit contributions
 antenna+0.061
 magn+0.053
 stub+0.052
rys+0.050
 eight+0.049
Neg logit contributions
old-0.055
 safer-0.053
 Doctor-0.051
 healthier-0.050
 better-0.048
Token-
Token ID12
Feature activation-5.26e-02

Pos logit contributions
 antenna+0.128
rys+0.119
 stub+0.111
 magn+0.109
chie+0.105
Neg logit contributions
old-0.150
 Doctor-0.137
 doctor-0.128
 better-0.128
 safer-0.126
Tokenyear
Token ID1941
Feature activation-3.61e-02

Pos logit contributions
 antenna+0.079
chie+0.075
rys+0.071
 stub+0.071
nam+0.069
Neg logit contributions
 Doctor-0.093
 safer-0.091
 better-0.086
old-0.085
 doctor-0.084
Token-
Token ID12
Feature activation-5.39e-02

Pos logit contributions
 antenna+0.071
rys+0.066
 stub+0.064
 magn+0.062
chie+0.062
Neg logit contributions
old-0.055
 Doctor-0.051
 safer-0.047
 better-0.047
 doctor-0.046
Tokenold
Token ID727
Feature activation+2.39e-01

Pos logit contributions
chie+0.148
 antenna+0.147
nam+0.140
ho+0.139
 stub+0.139
Neg logit contributions
 Doctor-0.025
 safer-0.022
 better-0.017
 cleaner-0.014
 choice-0.014
Token daughter
Token ID4957
Feature activation+6.03e-02

Pos logit contributions
 safer+0.341
 Doctor+0.326
old+0.314
Here+0.314
 cleaner+0.298
Neg logit contributions
 antenna-0.455
 stub-0.409
chie-0.408
rys-0.391
ho-0.390
Token decided
Token ID3066
Feature activation+1.82e-02

Pos logit contributions
old+0.100
 Doctor+0.097
 safer+0.093
 better+0.090
 doctor+0.088
Neg logit contributions
 antenna-0.105
rys-0.096
 stub-0.093
chie-0.089
 magn-0.089
Token to
Token ID284
Feature activation-2.66e-02

Pos logit contributions
old+0.024
 Doctor+0.023
 better+0.022
 safer+0.022
 doctor+0.021
Neg logit contributions
 antenna-0.038
rys-0.036
 stub-0.034
chie-0.034
 magn-0.033
Token take
Token ID1011
Feature activation-2.14e-02

Pos logit contributions
 antenna+0.058
rys+0.055
 stub+0.053
chie+0.052
 magn+0.052
Neg logit contributions
old-0.032
 Doctor-0.030
 safer-0.029
 better-0.027
Here-0.026
Token a
Token ID257
Feature activation-2.29e-02

Pos logit contributions
 antenna+0.041
rys+0.039
 stub+0.037
 magn+0.037
chie+0.036
Neg logit contributions
old-0.029
 safer-0.028
 Doctor-0.027
 doctor-0.025
Here-0.025
Token but
Token ID475
Feature activation+7.62e-03

Pos logit contributions
old+0.034
 Doctor+0.031
Here+0.030
 safer+0.030
 customers+0.028
Neg logit contributions
 antenna-0.045
rys-0.041
 stub-0.040
 nowhere-0.039
yt-0.038
Token she
Token ID673
Feature activation-3.86e-02

Pos logit contributions
old+0.013
 Doctor+0.013
Here+0.012
 safer+0.012
 choice+0.012
Neg logit contributions
 antenna-0.012
 stub-0.011
rys-0.011
 nowhere-0.011
 beating-0.010
Token didn
Token ID1422
Feature activation-4.05e-02

Pos logit contributions
 antenna+0.065
rys+0.060
 stub+0.058
chie+0.055
 magn+0.055
Neg logit contributions
old-0.067
 Doctor-0.065
 safer-0.061
 better-0.060
 doctor-0.059
Token't
Token ID470
Feature activation-9.63e-02

Pos logit contributions
 antenna+0.056
rys+0.051
 stub+0.048
chie+0.047
 magn+0.045
Neg logit contributions
old-0.082
 Doctor-0.079
 safer-0.076
 better-0.075
 doctor-0.073
Token sc
Token ID629
Feature activation-4.42e-02

Pos logit contributions
 antenna+0.209
rys+0.199
 stub+0.194
chie+0.190
 magn+0.182
Neg logit contributions
old-0.117
 Doctor-0.112
Here-0.104
 safer-0.102
 doctor-0.095
Tokenold
Token ID727
Feature activation+2.38e-01

Pos logit contributions
 antenna+0.071
rys+0.068
chie+0.068
rac+0.066
 stub+0.063
Neg logit contributions
Here-0.072
 Doctor-0.070
 cleaner-0.066
 waiter-0.065
 safer-0.065
Token her
Token ID607
Feature activation-3.37e-02

Pos logit contributions
Here+0.367
old+0.358
 Doctor+0.353
 Enjoy+0.350
 safer+0.349
Neg logit contributions
 antenna-0.392
 stub-0.386
 nowhere-0.370
rys-0.367
nam-0.340
Token again
Token ID757
Feature activation+6.74e-02

Pos logit contributions
 antenna+0.061
 magn+0.054
 stub+0.054
 fists+0.053
 thumb+0.053
Neg logit contributions
Here-0.052
old-0.049
 safer-0.045
 Doctor-0.044
 Enjoy-0.042
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
old+0.086
 Doctor+0.081
 safer+0.075
Here+0.074
 better+0.071
Neg logit contributions
 antenna-0.144
rys-0.134
 stub-0.130
 magn-0.126
chie-0.125
Token She
Token ID1375
Feature activation+3.63e-03

Pos logit contributions
old+0.021
 Doctor+0.020
 safer+0.019
 better+0.019
 doctor+0.018
Neg logit contributions
 antenna-0.036
rys-0.034
 stub-0.033
chie-0.032
 magn-0.031
Token simply
Token ID2391
Feature activation+7.10e-03

Pos logit contributions
old+0.006
 Doctor+0.006
 safer+0.005
 better+0.005
 doctor+0.005
Neg logit contributions
 antenna-0.007
rys-0.006
 stub-0.006
chie-0.006
 magn-0.006
Token he
Token ID339
Feature activation+3.37e-03

Pos logit contributions
 antenna+0.035
rys+0.034
 stub+0.033
chie+0.032
 magn+0.032
Neg logit contributions
old-0.008
 Doctor-0.008
 better-0.007
 safer-0.007
 doctor-0.006
Token waved
Token ID26834
Feature activation-3.48e-02

Pos logit contributions
 better+0.005
old+0.005
 Doctor+0.005
 doctor+0.005
 him+0.005
Neg logit contributions
rys-0.006
 antenna-0.005
ifying-0.005
nam-0.005
 magn-0.005
Token one
Token ID530
Feature activation+2.75e-03

Pos logit contributions
 antenna+0.075
rys+0.070
 stub+0.067
chie+0.066
 magn+0.065
Neg logit contributions
old-0.045
 Doctor-0.043
 safer-0.040
 better-0.039
 doctor-0.038
Token of
Token ID286
Feature activation+9.65e-02

Pos logit contributions
old+0.004
 Doctor+0.004
 safer+0.003
 better+0.003
 doctor+0.003
Neg logit contributions
 antenna-0.006
rys-0.005
 stub-0.005
chie-0.005
 magn-0.005
Token his
Token ID465
Feature activation-1.62e-03

Pos logit contributions
old+0.096
 Doctor+0.090
 safer+0.082
 better+0.081
 doctor+0.077
Neg logit contributions
 antenna-0.235
rys-0.223
 stub-0.216
chie-0.212
 magn-0.210
Token antenna
Token ID20509
Feature activation-2.76e-01

Pos logit contributions
 stub+0.002
rys+0.002
yt+0.002
nam+0.001
 antenna+0.001
Neg logit contributions
old-0.003
 better-0.003
 safer-0.003
 new-0.003
 Doctor-0.003
Tokene
Token ID68
Feature activation-1.05e-01

Pos logit contributions
 antenna+0.409
rys+0.379
 stub+0.354
chie+0.346
 magn+0.341
Neg logit contributions
old-0.543
 Doctor-0.499
 better-0.496
 him-0.495
 safer-0.484
Token.
Token ID13
Feature activation+8.75e-03

Pos logit contributions
rys+0.234
 antenna+0.232
nam+0.226
chie+0.221
 stub+0.221
Neg logit contributions
old-0.103
 him-0.102
 better-0.093
 Doctor-0.091
 doctor-0.087
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
old+0.012
 safer+0.011
 Doctor+0.011
 better+0.010
 choice+0.010
Neg logit contributions
 antenna-0.018
rys-0.016
 stub-0.016
Lou-0.016
rac-0.015
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 antenna+0.040
rys+0.037
 stub+0.037
Lou+0.036
chie+0.036
Neg logit contributions
old-0.023
 Doctor-0.022
 safer-0.021
 cleaner-0.021
 better-0.021
TokenThe
Token ID464
Feature activation-1.73e-02

Pos logit contributions
 antenna+0.036
rys+0.034
 stub+0.032
 magn+0.031
chie+0.031
Neg logit contributions
old-0.028
 Doctor-0.027
 safer-0.025
 better-0.025
 doctor-0.024
Token pale
Token ID14005
Feature activation-1.25e-02

Pos logit contributions
 antenna+0.047
rac+0.046
rys+0.044
chie+0.044
 tentacles+0.040
Neg logit contributions
 Doctor-0.041
 better-0.041
 doctor-0.039
 safer-0.039
 ice-0.037
Token yellow
Token ID7872
Feature activation-4.21e-02

Pos logit contributions
 antenna+0.021
rac+0.020
chie+0.019
rys+0.019
nam+0.018
Neg logit contributions
 Doctor-0.020
old-0.020
 better-0.020
 doctor-0.020
 ice-0.019
Token bug
Token ID5434
Feature activation-1.02e-01

Pos logit contributions
 antenna+0.075
chie+0.072
rac+0.071
rys+0.070
nam+0.069
Neg logit contributions
 doctor-0.061
 Doctor-0.061
 better-0.060
 ice-0.060
old-0.058
Token with
Token ID351
Feature activation-5.86e-02

Pos logit contributions
 antenna+0.247
chie+0.244
rac+0.242
rys+0.237
nam+0.228
Neg logit contributions
old-0.092
 better-0.091
 Doctor-0.087
 doctor-0.084
 safer-0.083
Token long
Token ID890
Feature activation-1.79e-01

Pos logit contributions
rys+0.117
 antenna+0.113
yt+0.113
rac+0.112
chie+0.112
Neg logit contributions
 him-0.071
old-0.069
 better-0.069
 ice-0.068
 Doctor-0.064
Token antenna
Token ID20509
Feature activation-2.76e-01

Pos logit contributions
chie+0.256
nam+0.254
rac+0.243
rys+0.233
spe+0.228
Neg logit contributions
 ice-0.317
 better-0.302
 chocolate-0.300
 him-0.293
 less-0.292
Tokene
Token ID68
Feature activation-1.03e-01

Pos logit contributions
 antenna+0.540
rys+0.523
chie+0.512
rac+0.508
 stub+0.495
Neg logit contributions
old-0.388
 him-0.368
 better-0.352
 Doctor-0.322
 safer-0.322
Token.
Token ID13
Feature activation+3.08e-03

Pos logit contributions
rys+0.228
nam+0.226
chie+0.225
 antenna+0.214
rac+0.211
Neg logit contributions
 him-0.103
old-0.097
 ice-0.096
 Tom-0.092
 Doctor-0.091
Token Lucy
Token ID22162
Feature activation+1.04e-02

Pos logit contributions
old+0.004
 Doctor+0.004
 safer+0.004
 better+0.003
 doctor+0.003
Neg logit contributions
 antenna-0.007
rys-0.006
 stub-0.006
 nowhere-0.006
chie-0.006
Token smiled
Token ID13541
Feature activation-4.97e-02

Pos logit contributions
old+0.014
 better+0.011
 him+0.011
 Doctor+0.011
 ice+0.010
Neg logit contributions
rys-0.022
chie-0.021
ifying-0.021
 antenna-0.020
nam-0.020
Token and
Token ID290
Feature activation+8.00e-03

Pos logit contributions
 antenna+0.109
rys+0.103
 stub+0.099
chie+0.097
 magn+0.096
Neg logit contributions
old-0.062
 Doctor-0.058
 safer-0.054
 better-0.053
 doctor-0.051
TokenThe
Token ID464
Feature activation-1.73e-02

Pos logit contributions
 antenna+0.036
rys+0.034
 stub+0.032
 magn+0.031
chie+0.031
Neg logit contributions
old-0.028
 Doctor-0.027
 safer-0.025
 better-0.025
 doctor-0.024
Token butterfly
Token ID35113
Feature activation-1.19e-01

Pos logit contributions
 antenna+0.028
chie+0.027
rys+0.027
ifying+0.027
yt+0.027
Neg logit contributions
 better-0.028
 Doctor-0.027
 doctor-0.027
 safer-0.026
old-0.026
Token thought
Token ID1807
Feature activation-5.65e-02

Pos logit contributions
rys+0.188
nam+0.180
 antenna+0.179
ifying+0.175
chie+0.174
Neg logit contributions
 better-0.210
old-0.206
 safer-0.193
 doctor-0.190
 less-0.187
Token the
Token ID262
Feature activation+3.14e-02

Pos logit contributions
 antenna+0.130
rys+0.119
 stub+0.115
 magn+0.113
chie+0.113
Neg logit contributions
old-0.063
 Doctor-0.062
Here-0.056
 safer-0.055
 doctor-0.053
Token waving
Token ID25849
Feature activation+7.06e-03

Pos logit contributions
 better+0.053
 Doctor+0.050
 safer+0.050
old+0.050
 doctor+0.049
Neg logit contributions
 antenna-0.051
rys-0.048
chie-0.047
ifying-0.046
 stub-0.046
Token antenna
Token ID20509
Feature activation-2.74e-01

Pos logit contributions
old+0.010
 better+0.010
 him+0.010
 doctor+0.009
 Doctor+0.009
Neg logit contributions
 antenna-0.012
nam-0.012
rys-0.012
yt-0.012
 stub-0.012
Tokene
Token ID68
Feature activation-1.04e-01

Pos logit contributions
 antenna+0.388
rys+0.357
 stub+0.336
chie+0.324
 magn+0.317
Neg logit contributions
old-0.554
 Doctor-0.522
 better-0.510
 safer-0.506
 him-0.501
Token was
Token ID373
Feature activation+7.92e-03

Pos logit contributions
rys+0.205
 antenna+0.200
 stub+0.194
nam+0.192
chie+0.189
Neg logit contributions
old-0.136
 him-0.131
 better-0.129
 Doctor-0.122
 ice-0.119
Token so
Token ID523
Feature activation-1.22e-02

Pos logit contributions
old+0.012
 Doctor+0.011
 better+0.011
 safer+0.011
 doctor+0.010
Neg logit contributions
 antenna-0.015
rys-0.014
 stub-0.014
chie-0.014
 magn-0.013
Token funny
Token ID8258
Feature activation+3.48e-02

Pos logit contributions
 antenna+0.024
rys+0.022
 stub+0.021
chie+0.021
 magn+0.020
Neg logit contributions
old-0.018
 Doctor-0.018
 doctor-0.016
 safer-0.016
Here-0.015
Token that
Token ID326
Feature activation+3.16e-02

Pos logit contributions
old+0.050
 Doctor+0.048
Here+0.045
 safer+0.044
 doctor+0.042
Neg logit contributions
 antenna-0.071
rys-0.063
 tentacles-0.060
 magn-0.060
yt-0.060
Token ground
Token ID2323
Feature activation-3.51e-02

Pos logit contributions
 Doctor+0.002
old+0.002
 better+0.002
 safer+0.002
 doctor+0.002
Neg logit contributions
 antenna-0.001
rys-0.001
chie-0.001
 stub-0.001
rac-0.001
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 antenna+0.082
rys+0.081
chie+0.078
 stub+0.076
rac+0.075
Neg logit contributions
 better-0.036
 safer-0.035
old-0.035
 Doctor-0.033
 doctor-0.031
Token He
Token ID679
Feature activation+4.29e-02

Pos logit contributions
old+0.014
 safer+0.012
 choice+0.012
 better+0.012
 Doctor+0.011
Neg logit contributions
 antenna-0.026
rys-0.023
 stub-0.023
 tentacles-0.022
 nowhere-0.022
Token waved
Token ID26834
Feature activation-3.25e-02

Pos logit contributions
old+0.069
 Doctor+0.066
 better+0.065
 safer+0.064
 doctor+0.063
Neg logit contributions
rys-0.077
 antenna-0.074
chie-0.072
ifying-0.067
yt-0.067
Token his
Token ID465
Feature activation-1.74e-03

Pos logit contributions
 antenna+0.070
rys+0.064
 stub+0.062
chie+0.061
 magn+0.060
Neg logit contributions
old-0.043
 Doctor-0.041
 safer-0.037
 better-0.036
 doctor-0.035
Token antenna
Token ID20509
Feature activation-2.73e-01

Pos logit contributions
rys+0.002
 stub+0.002
yt+0.002
chie+0.002
ho+0.002
Neg logit contributions
 better-0.003
 safer-0.003
 ice-0.003
 Doctor-0.003
 doctor-0.003
Tokene
Token ID68
Feature activation-1.01e-01

Pos logit contributions
 antenna+0.418
rys+0.388
 stub+0.366
chie+0.355
 magn+0.348
Neg logit contributions
old-0.525
 Doctor-0.493
 better-0.480
 safer-0.477
 him-0.469
Token and
Token ID290
Feature activation+9.56e-03

Pos logit contributions
rys+0.233
 antenna+0.229
chie+0.221
 stub+0.219
nam+0.217
Neg logit contributions
old-0.099
 him-0.094
 Doctor-0.092
 better-0.090
 doctor-0.085
Token made
Token ID925
Feature activation-2.44e-02

Pos logit contributions
 better+0.014
 Doctor+0.014
 safer+0.014
old+0.014
 him+0.014
Neg logit contributions
rys-0.017
chie-0.017
 antenna-0.016
ifying-0.016
 stub-0.015
Token some
Token ID617
Feature activation-8.17e-04

Pos logit contributions
 antenna+0.047
 wings+0.040
 tentacles+0.039
 stub+0.038
Lou+0.037
Neg logit contributions
old-0.034
Here-0.032
 Doctor-0.031
orting-0.031
Deal-0.030
Token special
Token ID2041
Feature activation+2.16e-02

Pos logit contributions
 antenna+0.002
rys+0.001
 magn+0.001
 wings+0.001
 tentacles+0.001
Neg logit contributions
old-0.001
 Doctor-0.001
Here-0.001
 Enjoy-0.001
 Tommy-0.001
Token and
Token ID290
Feature activation+1.54e-02

Pos logit contributions
 antenna+0.227
rys+0.208
 stub+0.200
chie+0.194
 magn+0.194
Neg logit contributions
old-0.195
 Doctor-0.189
 safer-0.176
 better-0.171
 doctor-0.170
Token waved
Token ID26834
Feature activation-3.14e-02

Pos logit contributions
old+0.019
 Doctor+0.018
 better+0.017
 safer+0.016
 him+0.016
Neg logit contributions
 antenna-0.033
rys-0.033
chie-0.031
 stub-0.030
rac-0.030
Token one
Token ID530
Feature activation+3.98e-03

Pos logit contributions
 antenna+0.063
rys+0.058
 stub+0.056
chie+0.055
 magn+0.054
Neg logit contributions
old-0.046
 Doctor-0.044
 safer-0.041
 better-0.040
 doctor-0.039
Token of
Token ID286
Feature activation+9.93e-02

Pos logit contributions
old+0.005
 Doctor+0.005
 safer+0.004
 doctor+0.004
Here+0.004
Neg logit contributions
 antenna-0.009
rys-0.008
 stub-0.008
 magn-0.008
chie-0.008
Token its
Token ID663
Feature activation-9.51e-02

Pos logit contributions
old+0.124
 Doctor+0.116
 safer+0.105
 doctor+0.104
 better+0.101
Neg logit contributions
 antenna-0.223
rys-0.209
 nowhere-0.193
 stub-0.193
 magn-0.192
Token antenna
Token ID20509
Feature activation-2.72e-01

Pos logit contributions
rys+0.147
 stub+0.142
 antenna+0.140
rac+0.138
nam+0.134
Neg logit contributions
 better-0.153
old-0.149
 safer-0.148
 him-0.144
 new-0.142
Tokene
Token ID68
Feature activation-1.04e-01

Pos logit contributions
 antenna+0.421
rys+0.389
 stub+0.367
chie+0.359
 magn+0.350
Neg logit contributions
old-0.515
 Doctor-0.481
 better-0.469
 safer-0.466
 him-0.459
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 antenna+0.238
rys+0.237
chie+0.225
 stub+0.223
rac+0.220
Neg logit contributions
old-0.104
 him-0.096
 Doctor-0.092
 better-0.091
 doctor-0.086
Token Bob
Token ID5811
Feature activation+3.08e-02

Pos logit contributions
old+0.016
 safer+0.014
 better+0.014
 Doctor+0.013
 cleaner+0.013
Neg logit contributions
 antenna-0.026
rys-0.022
ho-0.022
 tentacles-0.022
 eight-0.022
Tokenbie
Token ID12590
Feature activation-1.23e-01

Pos logit contributions
old+0.057
 him+0.050
 Doctor+0.049
 better+0.049
 ice+0.047
Neg logit contributions
 antenna-0.048
rys-0.047
 stub-0.043
nam-0.043
 magn-0.042
Token waved
Token ID26834
Feature activation-3.10e-02

Pos logit contributions
rys+0.237
 antenna+0.227
nam+0.213
ifying+0.211
 magn+0.211
Neg logit contributions
old-0.193
 better-0.171
 him-0.163
 Doctor-0.160
 doctor-0.160
Token had
Token ID550
Feature activation-1.50e-01

Pos logit contributions
 antenna+0.143
rys+0.135
rac+0.126
chie+0.126
 stub+0.125
Neg logit contributions
old-0.098
 better-0.090
 Doctor-0.089
 safer-0.087
 doctor-0.083
Token wings
Token ID12098
Feature activation-1.82e-01

Pos logit contributions
rys+0.331
 antenna+0.320
yt+0.317
ho+0.315
chie+0.313
Neg logit contributions
 better-0.174
 safer-0.166
old-0.161
 less-0.153
 Doctor-0.153
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 antenna+0.447
rys+0.433
 stub+0.412
chie+0.408
 magn+0.407
Neg logit contributions
old-0.172
 better-0.155
 Doctor-0.153
 safer-0.151
 him-0.139
Token legs
Token ID7405
Feature activation-2.26e-01

Pos logit contributions
old+0.018
 better+0.017
 safer+0.017
 Doctor+0.016
 him+0.016
Neg logit contributions
rys-0.021
chie-0.021
 antenna-0.020
yt-0.020
rac-0.019
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
 antenna+0.524
rys+0.510
 stub+0.476
 magn+0.473
chie+0.469
Neg logit contributions
old-0.250
 better-0.227
 Doctor-0.223
 safer-0.217
 him-0.209
Token antenna
Token ID20509
Feature activation-2.71e-01

Pos logit contributions
 better+0.018
 safer+0.017
old+0.017
 him+0.017
 Doctor+0.017
Neg logit contributions
rys-0.021
chie-0.021
 antenna-0.020
yt-0.020
 stub-0.019
Tokene
Token ID68
Feature activation-1.03e-01

Pos logit contributions
 antenna+0.520
rys+0.518
 stub+0.478
yt+0.475
 magn+0.470
Neg logit contributions
old-0.386
 him-0.384
 better-0.378
 safer-0.343
 Doctor-0.333
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
rys+0.207
 antenna+0.207
 stub+0.194
chie+0.192
 magn+0.190
Neg logit contributions
old-0.133
 Doctor-0.125
 him-0.122
 better-0.120
 doctor-0.115
Token eyes
Token ID2951
Feature activation-1.01e-01

Pos logit contributions
 better+0.019
 safer+0.018
 Doctor+0.018
old+0.018
 him+0.017
Neg logit contributions
rys-0.024
chie-0.024
 antenna-0.023
yt-0.023
 stub-0.022
Token.
Token ID13
Feature activation+1.16e-02

Pos logit contributions
 antenna+0.235
rys+0.226
 stub+0.214
chie+0.213
 magn+0.210
Neg logit contributions
old-0.108
 Doctor-0.101
 better-0.097
 safer-0.095
 him-0.089
Token I
Token ID314
Feature activation-6.88e-02

Pos logit contributions
old+0.017
 Doctor+0.016
 safer+0.016
 better+0.015
 cleaner+0.015
Neg logit contributions
 antenna-0.022
rys-0.021
 stub-0.020
 magn-0.019
Lou-0.019
Token see
Token ID766
Feature activation+6.41e-02

Pos logit contributions
 antenna+0.012
rys+0.012
chie+0.012
 stub+0.012
 magn+0.011
Neg logit contributions
old-0.005
 Doctor-0.005
 better-0.005
 him-0.005
 safer-0.005
Token its
Token ID663
Feature activation-9.40e-02

Pos logit contributions
old+0.080
 Doctor+0.076
 safer+0.068
 doctor+0.067
 better+0.065
Neg logit contributions
 antenna-0.142
rys-0.131
 stub-0.125
 magn-0.124
 nowhere-0.121
Token eyes
Token ID2951
Feature activation-1.02e-01

Pos logit contributions
rys+0.187
 antenna+0.186
chie+0.186
 stub+0.183
rac+0.177
Neg logit contributions
 better-0.112
 safer-0.111
old-0.105
 Doctor-0.103
 choice-0.102
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
 antenna+0.209
rys+0.202
chie+0.195
 stub+0.191
rac+0.185
Neg logit contributions
 better-0.135
 safer-0.128
old-0.126
 Doctor-0.121
 less-0.118
Token its
Token ID663
Feature activation-9.37e-02

Pos logit contributions
 antenna+0.025
rys+0.024
chie+0.023
 stub+0.023
rac+0.022
Neg logit contributions
old-0.014
 Doctor-0.014
 better-0.013
 safer-0.013
 him-0.013
Token antenna
Token ID20509
Feature activation-2.70e-01

Pos logit contributions
rys+0.141
chie+0.137
 stub+0.136
Ol+0.133
yt+0.130
Neg logit contributions
 better-0.154
 new-0.154
 safer-0.151
 welcome-0.145
 less-0.144
Tokene
Token ID68
Feature activation-1.03e-01

Pos logit contributions
 antenna+0.460
rys+0.445
 stub+0.419
chie+0.413
 magn+0.397
Neg logit contributions
old-0.453
 better-0.423
 safer-0.414
 him-0.410
 Doctor-0.405
Token,
Token ID11
Feature activation-1.13e-02

Pos logit contributions
rys+0.188
 antenna+0.179
chie+0.177
 stub+0.175
nam+0.174
Neg logit contributions
old-0.142
 him-0.140
 better-0.139
 Doctor-0.139
 ice-0.136
Token and
Token ID290
Feature activation+1.61e-02

Pos logit contributions
rys+0.025
chie+0.025
 antenna+0.025
 stub+0.024
yt+0.023
Neg logit contributions
 better-0.012
 Doctor-0.011
 safer-0.011
old-0.011
 him-0.011
Token its
Token ID663
Feature activation-9.33e-02

Pos logit contributions
 better+0.018
 safer+0.017
 Doctor+0.017
old+0.017
 him+0.016
Neg logit contributions
chie-0.035
rys-0.035
 antenna-0.034
 stub-0.033
yt-0.033
Token scales
Token ID16252
Feature activation-7.21e-02

Pos logit contributions
rys+0.135
chie+0.132
 stub+0.130
ifying+0.128
yt+0.128
Neg logit contributions
 better-0.159
 new-0.157
 safer-0.155
 choice-0.150
 welcome-0.149
Token Bob
Token ID5811
Feature activation+3.21e-02

Pos logit contributions
 antenna+0.021
 nowhere+0.018
ho+0.018
rys+0.018
 magn+0.018
Neg logit contributions
old-0.013
 Doctor-0.011
 safer-0.010
 doctor-0.010
Here-0.010
Token w
Token ID266
Feature activation-6.74e-02

Pos logit contributions
old+0.051
 better+0.048
 Doctor+0.047
 doctor+0.045
 safer+0.045
Neg logit contributions
 antenna-0.059
rys-0.058
chie-0.054
nam-0.054
rac-0.051
Tokenigg
Token ID6950
Feature activation-6.75e-02

Pos logit contributions
 antenna+0.124
 stub+0.113
rys+0.113
nam+0.113
yt+0.110
Neg logit contributions
old-0.110
 Doctor-0.097
 safer-0.092
 better-0.091
 doctor-0.090
Tokenled
Token ID992
Feature activation-7.91e-02

Pos logit contributions
 antenna+0.104
rys+0.095
 stub+0.090
 magn+0.088
chie+0.087
Neg logit contributions
old-0.128
 Doctor-0.120
 better-0.117
 safer-0.116
 him-0.112
Token his
Token ID465
Feature activation-3.44e-03

Pos logit contributions
 antenna+0.184
rys+0.176
 stub+0.169
chie+0.167
 magn+0.163
Neg logit contributions
old-0.085
 Doctor-0.081
 better-0.075
 safer-0.075
 him-0.071
Token antenna
Token ID20509
Feature activation-2.69e-01

Pos logit contributions
 antenna+0.006
rys+0.005
 stub+0.005
chie+0.005
yt+0.005
Neg logit contributions
old-0.006
 Doctor-0.006
 safer-0.006
 better-0.006
 doctor-0.005
Tokene
Token ID68
Feature activation-1.03e-01

Pos logit contributions
 antenna+0.396
rys+0.373
 stub+0.341
chie+0.336
 magn+0.329
Neg logit contributions
old-0.532
 Doctor-0.494
 better-0.487
 him-0.480
 safer-0.479
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
rys+0.202
 antenna+0.196
nam+0.194
chie+0.192
 stub+0.183
Neg logit contributions
old-0.132
 him-0.125
 Doctor-0.125
 doctor-0.122
 better-0.122
Token flew
Token ID13112
Feature activation-6.85e-02

Pos logit contributions
 better+0.027
 Doctor+0.026
 safer+0.025
old+0.025
 him+0.025
Neg logit contributions
rys-0.031
chie-0.030
 antenna-0.029
ifying-0.029
rac-0.027
Token away
Token ID1497
Feature activation-5.01e-02

Pos logit contributions
 antenna+0.132
rys+0.121
 stub+0.116
chie+0.114
 magn+0.112
Neg logit contributions
old-0.106
 Doctor-0.100
 safer-0.093
 better-0.091
 doctor-0.090
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 antenna+0.114
rys+0.111
 stub+0.106
rac+0.106
chie+0.105
Neg logit contributions
 better-0.053
old-0.052
 safer-0.051
 Doctor-0.050
 him-0.046
Token sweet
Token ID6029
Feature activation-1.55e-02

Pos logit contributions
 antenna+0.044
rys+0.041
 stub+0.039
chie+0.038
 magn+0.038
Neg logit contributions
old-0.038
 Doctor-0.036
 safer-0.034
 better-0.034
 doctor-0.033
Token tune
Token ID14009
Feature activation-1.00e-01

Pos logit contributions
 antenna+0.028
rys+0.026
 stub+0.025
 magn+0.024
yt+0.023
Neg logit contributions
old-0.025
 Doctor-0.024
 safer-0.024
Here-0.022
 better-0.022
Token,
Token ID11
Feature activation-1.67e-02

Pos logit contributions
 antenna+0.215
rys+0.204
 stub+0.195
chie+0.193
 magn+0.189
Neg logit contributions
old-0.127
 Doctor-0.121
 better-0.113
 safer-0.111
 doctor-0.109
Token a
Token ID257
Feature activation-2.43e-02

Pos logit contributions
 antenna+0.037
rys+0.035
 stub+0.034
chie+0.033
 magn+0.033
Neg logit contributions
old-0.020
 Doctor-0.019
 safer-0.017
 better-0.017
 doctor-0.017
Token mis
Token ID2984
Feature activation-7.87e-02

Pos logit contributions
 antenna+0.048
rys+0.046
rac+0.043
chie+0.042
 stub+0.042
Neg logit contributions
 Doctor-0.034
old-0.033
 doctor-0.032
 better-0.032
 safer-0.031
Tokenchie
Token ID3043
Feature activation-2.68e-01

Pos logit contributions
 antenna+0.053
 magn+0.043
 stub+0.040
rys+0.040
 nowhere+0.034
Neg logit contributions
old-0.228
 Doctor-0.200
 better-0.199
 safer-0.198
 cleaner-0.198
Tokenvous
Token ID31222
Feature activation+6.32e-02

Pos logit contributions
 antenna+0.607
rys+0.567
 magn+0.542
 stub+0.540
chie+0.526
Neg logit contributions
old-0.333
 Doctor-0.294
 better-0.274
 safer-0.271
 doctor-0.257
Token fox
Token ID21831
Feature activation-7.10e-02

Pos logit contributions
old+0.098
 Doctor+0.094
 safer+0.088
 better+0.088
 doctor+0.086
Neg logit contributions
 antenna-0.118
rys-0.110
 stub-0.105
chie-0.103
 magn-0.102
Token appeared
Token ID4120
Feature activation+4.38e-02

Pos logit contributions
 antenna+0.114
rys+0.110
chie+0.103
rac+0.103
nam+0.100
Neg logit contributions
old-0.127
 Doctor-0.119
 him-0.117
 better-0.116
 doctor-0.115
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
old+0.049
 better+0.047
 Doctor+0.047
 him+0.045
 safer+0.044
Neg logit contributions
 antenna-0.097
rys-0.094
 stub-0.091
chie-0.089
rac-0.088
Token "
Token ID366
Feature activation-7.14e-02

Pos logit contributions
old+0.014
 Doctor+0.012
 safer+0.012
 better+0.012
 cleaner+0.011
Neg logit contributions
 antenna-0.023
rys-0.021
 stub-0.021
chie-0.020
 magn-0.020
Token happy
Token ID3772
Feature activation+3.75e-02

Pos logit contributions
old+0.022
 Doctor+0.020
 doctor+0.018
Here+0.018
 customers+0.017
Neg logit contributions
 antenna-0.033
rys-0.031
 stub-0.030
 nowhere-0.029
 tentacles-0.029
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
old+0.056
 Doctor+0.052
Here+0.049
 safer+0.047
 better+0.045
Neg logit contributions
 antenna-0.075
rys-0.067
 nowhere-0.066
 stub-0.065
 tentacles-0.064
Token It
Token ID632
Feature activation+5.46e-02

Pos logit contributions
old+0.025
 Doctor+0.023
 cleaner+0.022
 groceries+0.022
 better+0.021
Neg logit contributions
 antenna-0.033
rys-0.030
rac-0.029
 nowhere-0.028
 stub-0.028
Token waved
Token ID26834
Feature activation-2.81e-02

Pos logit contributions
old+0.088
 better+0.086
 Doctor+0.085
 safer+0.084
 doctor+0.081
Neg logit contributions
 antenna-0.094
rys-0.091
chie-0.088
rac-0.083
 stub-0.083
Token its
Token ID663
Feature activation-9.20e-02

Pos logit contributions
 antenna+0.054
rys+0.050
 stub+0.047
chie+0.046
 magn+0.046
Neg logit contributions
old-0.043
 Doctor-0.042
 safer-0.039
 better-0.038
 doctor-0.037
Token antenna
Token ID20509
Feature activation-2.68e-01

Pos logit contributions
rys+0.129
 stub+0.125
yt+0.125
nam+0.123
spe+0.120
Neg logit contributions
 better-0.164
 welcome-0.156
 new-0.153
 safer-0.153
 less-0.148
Tokene
Token ID68
Feature activation-1.01e-01

Pos logit contributions
 antenna+0.375
rys+0.340
 stub+0.319
chie+0.310
 magn+0.303
Neg logit contributions
old-0.546
 Doctor-0.524
 safer-0.505
 better-0.502
 doctor-0.489
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
rys+0.237
 antenna+0.235
chie+0.229
 stub+0.224
nam+0.221
Neg logit contributions
 better-0.089
 him-0.087
old-0.087
 ice-0.081
 Doctor-0.080
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
old+0.026
 Doctor+0.023
 choice+0.021
 groceries+0.021
 cleaner+0.021
Neg logit contributions
 antenna-0.034
rys-0.030
rac-0.030
 stub-0.029
 nowhere-0.029
Token
Token ID198
Feature activation-1.23e-02

Pos logit contributions
 antenna+0.024
 stub+0.021
rys+0.021
rac+0.021
 nowhere+0.021
Neg logit contributions
 Doctor-0.017
old-0.017
 cleaner-0.017
 safer-0.016
 customers-0.016
Token"
Token ID1
Feature activation-7.73e-02

Pos logit contributions
 antenna+0.026
rys+0.024
 stub+0.023
chie+0.023
rac+0.023
Neg logit contributions
old-0.016
 Doctor-0.016
 safer-0.014
 better-0.014
 doctor-0.014
TokenBut
Token ID1537
Feature activation-2.30e-02

Pos logit contributions
 antenna+0.049
rys+0.045
 stub+0.043
 magn+0.043
yt+0.042
Neg logit contributions
old-0.033
 Doctor-0.031
Here-0.029
 safer-0.029
 better-0.029
Token then
Token ID788
Feature activation+1.01e-01

Pos logit contributions
 antenna+0.053
rys+0.050
 stub+0.048
chie+0.048
 magn+0.047
Neg logit contributions
old-0.026
 Doctor-0.025
 safer-0.023
 better-0.023
 doctor-0.022
Token,
Token ID11
Feature activation-1.86e-02

Pos logit contributions
old+0.120
 Doctor+0.116
 better+0.107
 safer+0.105
 doctor+0.103
Neg logit contributions
 antenna-0.226
rys-0.214
 stub-0.205
chie-0.204
 magn-0.197
Token a
Token ID257
Feature activation-2.62e-02

Pos logit contributions
 antenna+0.041
rys+0.038
 stub+0.037
 magn+0.036
chie+0.036
Neg logit contributions
old-0.023
 Doctor-0.021
 safer-0.020
 better-0.019
Here-0.018
Token mis
Token ID2984
Feature activation-7.88e-02

Pos logit contributions
 antenna+0.052
rys+0.049
rac+0.047
 stub+0.046
chie+0.046
Neg logit contributions
 Doctor-0.037
old-0.036
 better-0.034
 doctor-0.034
 safer-0.034
Tokenchie
Token ID3043
Feature activation-2.68e-01

Pos logit contributions
 antenna+0.055
 magn+0.045
 stub+0.043
rys+0.038
 tentacles+0.036
Neg logit contributions
old-0.231
 Doctor-0.202
 safer-0.201
 better-0.201
 cleaner-0.199
Tokenvous
Token ID31222
Feature activation+6.25e-02

Pos logit contributions
 antenna+0.619
rys+0.577
 magn+0.557
 stub+0.550
 tentacles+0.536
Neg logit contributions
old-0.328
 Doctor-0.284
 better-0.266
 safer-0.261
 doctor-0.247
Token rabbit
Token ID22746
Feature activation-8.50e-02

Pos logit contributions
old+0.093
 Doctor+0.087
 safer+0.084
 better+0.081
Here+0.080
Neg logit contributions
 antenna-0.122
rys-0.114
 stub-0.109
chie-0.106
 magn-0.105
Token came
Token ID1625
Feature activation+4.04e-02

Pos logit contributions
 antenna+0.144
rys+0.135
chie+0.129
rac+0.128
nam+0.125
Neg logit contributions
old-0.146
 Doctor-0.140
 better-0.134
 doctor-0.133
 him-0.132
Token and
Token ID290
Feature activation+9.11e-03

Pos logit contributions
old+0.052
 better+0.048
 Doctor+0.047
 him+0.046
 safer+0.044
Neg logit contributions
 antenna-0.086
rys-0.082
 stub-0.079
rac-0.078
nam-0.078
Token started
Token ID2067
Feature activation+5.33e-02

Pos logit contributions
 Doctor+0.014
old+0.014
 better+0.014
 him+0.013
 safer+0.013
Neg logit contributions
 antenna-0.017
rys-0.016
chie-0.015
rac-0.015
yt-0.015
Token her
Token ID607
Feature activation-3.50e-02

Pos logit contributions
old+0.017
 Doctor+0.016
 safer+0.015
 better+0.015
 doctor+0.015
Neg logit contributions
 antenna-0.027
rys-0.025
 stub-0.024
chie-0.024
 magn-0.023
Token to
Token ID284
Feature activation-1.97e-02

Pos logit contributions
 antenna+0.082
rys+0.078
chie+0.075
 stub+0.075
rac+0.073
Neg logit contributions
old-0.039
 Doctor-0.036
 better-0.034
 safer-0.033
 doctor-0.033
Token grab
Token ID5552
Feature activation+7.74e-03

Pos logit contributions
 antenna+0.043
rys+0.041
 stub+0.039
chie+0.039
 magn+0.038
Neg logit contributions
old-0.023
 Doctor-0.022
 better-0.022
 safer-0.021
 doctor-0.021
Token onto
Token ID4291
Feature activation-6.96e-02

Pos logit contributions
old+0.009
 Doctor+0.009
 safer+0.008
 better+0.008
 doctor+0.008
Neg logit contributions
 antenna-0.017
rys-0.016
 stub-0.016
chie-0.015
 magn-0.015
Token her
Token ID607
Feature activation-3.49e-02

Pos logit contributions
 antenna+0.160
rys+0.151
 stub+0.145
chie+0.142
 magn+0.141
Neg logit contributions
old-0.080
 Doctor-0.074
 safer-0.068
 better-0.067
 doctor-0.064
Token antenna
Token ID20509
Feature activation-2.67e-01

Pos logit contributions
rys+0.051
 antenna+0.050
ho+0.049
 stub+0.049
chie+0.049
Neg logit contributions
 better-0.061
 safer-0.059
 ice-0.059
 new-0.057
 doctor-0.056
Tokene
Token ID68
Feature activation-1.00e-01

Pos logit contributions
 antenna+0.414
rys+0.378
 stub+0.356
chie+0.355
 magn+0.342
Neg logit contributions
old-0.521
 Doctor-0.472
 better-0.472
 him-0.469
 safer-0.462
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
 antenna+0.245
rys+0.244
nam+0.236
chie+0.234
 stub+0.231
Neg logit contributions
old-0.080
 him-0.072
 Doctor-0.070
 doctor-0.068
 better-0.067
Token The
Token ID383
Feature activation+2.70e-02

Pos logit contributions
old+0.019
 Doctor+0.017
 safer+0.017
 better+0.016
 doctor+0.015
Neg logit contributions
 antenna-0.034
rys-0.032
 stub-0.031
chie-0.030
 magn-0.030
Token different
Token ID1180
Feature activation+7.23e-02

Pos logit contributions
 better+0.043
old+0.041
 safer+0.040
 Doctor+0.040
 doctor+0.040
Neg logit contributions
 antenna-0.046
chie-0.042
ifying-0.042
rys-0.042
 tentacles-0.041
Token butterfly
Token ID35113
Feature activation-1.11e-01

Pos logit contributions
old+0.094
 better+0.092
 new+0.087
 welcome+0.084
 doctor+0.083
Neg logit contributions
ifying-0.140
 antenna-0.134
rys-0.133
 stub-0.130
chie-0.130
Token The
Token ID383
Feature activation+1.82e-02

Pos logit contributions
old+0.025
 Doctor+0.023
 safer+0.023
 better+0.022
 cleaner+0.021
Neg logit contributions
 antenna-0.046
rys-0.043
 stub-0.042
chie-0.040
 magn-0.040
Token little
Token ID1310
Feature activation+1.52e-02

Pos logit contributions
old+0.027
 Doctor+0.024
 safer+0.023
 him+0.022
 better+0.022
Neg logit contributions
 antenna-0.037
rys-0.033
 stub-0.033
 magn-0.032
chie-0.031
Token boy
Token ID2933
Feature activation+5.92e-02

Pos logit contributions
old+0.016
 Doctor+0.015
 safer+0.013
 better+0.013
 doctor+0.012
Neg logit contributions
 antenna-0.037
rys-0.035
 stub-0.034
chie-0.033
 magn-0.033
Token smiled
Token ID13541
Feature activation-4.74e-02

Pos logit contributions
old+0.086
 Doctor+0.082
 better+0.078
 doctor+0.076
 him+0.075
Neg logit contributions
 antenna-0.116
rys-0.115
chie-0.106
 stub-0.105
 magn-0.104
Token mis
Token ID2984
Feature activation-7.73e-02

Pos logit contributions
 antenna+0.106
rys+0.099
 stub+0.096
chie+0.094
 magn+0.093
Neg logit contributions
old-0.058
 Doctor-0.054
 safer-0.050
 better-0.049
 doctor-0.048
Tokenchie
Token ID3043
Feature activation-2.67e-01

Pos logit contributions
 antenna+0.081
 stub+0.073
rys+0.071
 magn+0.069
 nowhere+0.067
Neg logit contributions
old-0.197
 Doctor-0.169
 better-0.168
 safer-0.163
 cleaner-0.163
Tokenv
Token ID85
Feature activation+1.27e-02

Pos logit contributions
 antenna+0.568
rys+0.534
 stub+0.516
 magn+0.503
chie+0.496
Neg logit contributions
old-0.363
 Doctor-0.327
 better-0.309
 safer-0.303
 doctor-0.287
Tokenously
Token ID3481
Feature activation-7.42e-02

Pos logit contributions
old+0.023
 Doctor+0.022
 him+0.020
 better+0.020
 doctor+0.020
Neg logit contributions
 antenna-0.021
rys-0.019
 stub-0.018
 magn-0.018
chie-0.017
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
 antenna+0.171
rys+0.166
 stub+0.158
chie+0.158
 magn+0.154
Neg logit contributions
old-0.083
 Doctor-0.075
 better-0.071
 safer-0.068
 him-0.066
Token said
Token ID531
Feature activation+3.25e-02

Pos logit contributions
old+0.016
 Doctor+0.015
 better+0.014
 safer+0.014
 doctor+0.014
Neg logit contributions
 antenna-0.023
rys-0.022
 stub-0.021
chie-0.021
 magn-0.020
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
old+0.038
 Doctor+0.037
 better+0.036
 doctor+0.034
 safer+0.034
Neg logit contributions
 antenna-0.072
rys-0.070
 stub-0.066
chie-0.065
 magn-0.064
Token<|endoftext|>
Token ID50256
Feature activation-8.72e-02

Pos logit contributions
old+0.009
 Doctor+0.009
 safer+0.009
 better+0.008
 cleaner+0.008
Neg logit contributions
 antenna-0.017
rys-0.016
 stub-0.015
chie-0.015
 magn-0.014
TokenPeter
Token ID19727
Feature activation-9.80e-02

Pos logit contributions
 antenna+0.173
rys+0.158
 stub+0.157
 magn+0.153
 nowhere+0.152
Neg logit contributions
old-0.125
 Doctor-0.118
 safer-0.112
 cleaner-0.106
 better-0.105
Token was
Token ID373
Feature activation+4.98e-03

Pos logit contributions
 antenna+0.174
rys+0.160
 stub+0.155
 magn+0.148
chie+0.148
Neg logit contributions
old-0.159
 Doctor-0.155
 safer-0.147
 better-0.143
 doctor-0.140
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
old+0.007
 Doctor+0.007
 better+0.007
 safer+0.007
 doctor+0.006
Neg logit contributions
 antenna-0.010
rys-0.009
chie-0.009
 stub-0.009
 magn-0.009
Token mis
Token ID2984
Feature activation-7.78e-02

Pos logit contributions
 antenna+0.048
rys+0.045
 stub+0.043
chie+0.042
 magn+0.042
Neg logit contributions
old-0.034
 Doctor-0.033
 safer-0.031
 better-0.031
 doctor-0.030
Tokenchie
Token ID3043
Feature activation-2.66e-01

Pos logit contributions
 antenna+0.057
 magn+0.042
rys+0.038
 stub+0.037
 tentacles+0.034
Neg logit contributions
old-0.228
 Doctor-0.201
 cleaner-0.198
 better-0.197
 safer-0.196
Tokenvous
Token ID31222
Feature activation+6.45e-02

Pos logit contributions
 antenna+0.592
rys+0.552
 stub+0.529
 magn+0.523
chie+0.513
Neg logit contributions
old-0.339
 Doctor-0.307
 better-0.283
 safer-0.282
 doctor-0.269
Token 3
Token ID513
Feature activation-8.06e-02

Pos logit contributions
old+0.090
 safer+0.081
Here+0.080
 Doctor+0.080
 better+0.075
Neg logit contributions
 antenna-0.133
rys-0.123
 stub-0.118
chie-0.115
 magn-0.114
Token year
Token ID614
Feature activation-9.65e-02

Pos logit contributions
 antenna+0.124
rys+0.113
 stub+0.107
chie+0.103
 magn+0.102
Neg logit contributions
old-0.157
 Doctor-0.149
 better-0.140
 safer-0.140
 doctor-0.139
Token old
Token ID1468
Feature activation+5.01e-02

Pos logit contributions
 antenna+0.246
rys+0.234
 stub+0.226
chie+0.222
 magn+0.221
Neg logit contributions
old-0.086
 Doctor-0.079
 safer-0.071
 better-0.069
 doctor-0.066
Token who
Token ID508
Feature activation+5.80e-02

Pos logit contributions
old+0.063
 Doctor+0.059
 safer+0.058
 better+0.055
Here+0.054
Neg logit contributions
 antenna-0.108
rys-0.099
 stub-0.097
chie-0.095
 magn-0.094
Token on
Token ID319
Feature activation-1.20e-01

Pos logit contributions
 antenna+0.082
rys+0.079
 stub+0.076
chie+0.076
rac+0.074
Neg logit contributions
old-0.038
 Doctor-0.037
 better-0.034
 safer-0.034
 him-0.034
Token the
Token ID262
Feature activation+3.61e-02

Pos logit contributions
 antenna+0.280
rys+0.265
 stub+0.258
chie+0.256
rac+0.249
Neg logit contributions
old-0.128
 Doctor-0.125
 better-0.113
 safer-0.113
 doctor-0.108
Token sidewalk
Token ID24058
Feature activation+5.62e-03

Pos logit contributions
old+0.063
 Doctor+0.059
 safer+0.055
Here+0.054
 better+0.053
Neg logit contributions
 antenna-0.065
rys-0.058
 stub-0.056
chie-0.054
 magn-0.054
Token and
Token ID290
Feature activation+1.96e-02

Pos logit contributions
old+0.007
 Doctor+0.007
 safer+0.006
 better+0.006
Here+0.006
Neg logit contributions
 antenna-0.012
rys-0.011
 stub-0.011
chie-0.011
 magn-0.011
Token its
Token ID663
Feature activation-8.82e-02

Pos logit contributions
old+0.027
 Doctor+0.027
 better+0.026
 him+0.026
 safer+0.026
Neg logit contributions
rys-0.037
 antenna-0.037
chie-0.036
rac-0.034
yt-0.033
Token antenna
Token ID20509
Feature activation-2.66e-01

Pos logit contributions
rys+0.122
 antenna+0.120
chie+0.110
rac+0.109
 stub+0.108
Neg logit contributions
 Doctor-0.160
old-0.159
 better-0.157
 doctor-0.155
 safer-0.155
Tokene
Token ID68
Feature activation-9.94e-02

Pos logit contributions
 antenna+0.411
rys+0.393
rac+0.361
chie+0.358
 stub+0.353
Neg logit contributions
old-0.499
 him-0.464
 better-0.463
 safer-0.447
 Doctor-0.440
Token were
Token ID547
Feature activation+2.14e-02

Pos logit contributions
rys+0.212
 antenna+0.203
nam+0.198
chie+0.196
rac+0.189
Neg logit contributions
 him-0.111
old-0.109
 Doctor-0.104
 ice-0.104
 doctor-0.101
Token waving
Token ID25849
Feature activation+1.47e-02

Pos logit contributions
 better+0.035
 safer+0.035
old+0.034
 Doctor+0.034
 him+0.033
Neg logit contributions
rys-0.035
 antenna-0.034
rac-0.033
chie-0.033
 magn-0.031
Token in
Token ID287
Feature activation-9.09e-02

Pos logit contributions
old+0.017
 better+0.016
 him+0.016
 Doctor+0.016
 safer+0.016
Neg logit contributions
 antenna-0.031
rys-0.030
chie-0.029
 stub-0.029
 magn-0.028
Token the
Token ID262
Feature activation+3.68e-02

Pos logit contributions
 antenna+0.203
rys+0.193
 stub+0.189
chie+0.186
rac+0.184
Neg logit contributions
old-0.103
 Doctor-0.099
 safer-0.094
 better-0.093
 him-0.089
Token and
Token ID290
Feature activation+1.24e-02

Pos logit contributions
 antenna+0.151
rys+0.144
 stub+0.139
chie+0.136
 magn+0.135
Neg logit contributions
old-0.079
 Doctor-0.075
 better-0.069
 safer-0.068
 doctor-0.067
Token replied
Token ID8712
Feature activation-1.16e-02

Pos logit contributions
old+0.018
 Doctor+0.017
 better+0.016
 safer+0.016
 doctor+0.015
Neg logit contributions
 antenna-0.025
rys-0.024
chie-0.023
 stub-0.022
 magn-0.022
Token with
Token ID351
Feature activation-5.45e-02

Pos logit contributions
 antenna+0.027
rys+0.026
 stub+0.025
 nowhere+0.025
 magn+0.024
Neg logit contributions
old-0.012
 Doctor-0.012
 better-0.012
 doctor-0.011
 him-0.011
Token a
Token ID257
Feature activation-2.08e-02

Pos logit contributions
 antenna+0.118
rys+0.110
 stub+0.105
chie+0.104
 magn+0.102
Neg logit contributions
old-0.070
 Doctor-0.065
 safer-0.061
 better-0.059
 doctor-0.058
Token mis
Token ID2984
Feature activation-7.57e-02

Pos logit contributions
 antenna+0.042
rys+0.040
 stub+0.038
chie+0.037
 nowhere+0.037
Neg logit contributions
old-0.028
 Doctor-0.028
 better-0.027
 safer-0.026
 doctor-0.025
Tokenchie
Token ID3043
Feature activation-2.66e-01

Pos logit contributions
 antenna+0.044
 stub+0.032
rys+0.030
 magn+0.030
 nowhere+0.029
Neg logit contributions
old-0.228
 Doctor-0.205
 better-0.203
 safer-0.203
 cleaner-0.198
Tokenvous
Token ID31222
Feature activation+6.70e-02

Pos logit contributions
 antenna+0.596
rys+0.559
 stub+0.541
 magn+0.528
chie+0.517
Neg logit contributions
old-0.334
 Doctor-0.297
 better-0.279
 safer-0.277
 doctor-0.259
Token gl
Token ID1278
Feature activation-8.57e-02

Pos logit contributions
old+0.107
 Doctor+0.103
 safer+0.097
 better+0.097
 doctor+0.094
Neg logit contributions
 antenna-0.121
rys-0.113
 stub-0.108
chie-0.106
 magn-0.104
Tokenint
Token ID600
Feature activation-2.20e-02

Pos logit contributions
 antenna+0.130
rys+0.117
 stub+0.114
 magn+0.107
 nowhere+0.106
Neg logit contributions
old-0.166
 Doctor-0.157
 safer-0.151
 better-0.150
 doctor-0.147
Token in
Token ID287
Feature activation-9.45e-02

Pos logit contributions
 antenna+0.042
rys+0.040
 stub+0.039
 magn+0.037
 nowhere+0.037
Neg logit contributions
old-0.033
 Doctor-0.031
 better-0.029
 safer-0.029
 doctor-0.028
Token his
Token ID465
Feature activation+1.15e-03

Pos logit contributions
 antenna+0.228
rys+0.214
 stub+0.210
chie+0.205
 magn+0.202
Neg logit contributions
old-0.096
 Doctor-0.091
 safer-0.083
 better-0.082
 him-0.078
Token he
Token ID339
Feature activation+5.56e-03

Pos logit contributions
old+0.010
 Doctor+0.010
 safer+0.009
 better+0.009
 doctor+0.009
Neg logit contributions
 antenna-0.018
rys-0.016
 stub-0.016
chie-0.016
 magn-0.015
Token especially
Token ID2592
Feature activation+3.33e-02

Pos logit contributions
old+0.009
 Doctor+0.008
 better+0.008
 safer+0.008
 doctor+0.008
Neg logit contributions
 antenna-0.010
rys-0.010
 stub-0.009
chie-0.009
 magn-0.009
Token liked
Token ID8288
Feature activation-5.42e-03

Pos logit contributions
old+0.059
 Doctor+0.057
 safer+0.054
 doctor+0.052
 better+0.052
Neg logit contributions
 antenna-0.054
 stub-0.049
rys-0.049
chie-0.045
 nowhere-0.045
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
 antenna+0.010
rys+0.009
 stub+0.009
 nowhere+0.009
chie+0.008
Neg logit contributions
old-0.009
 Doctor-0.008
 safer-0.008
Here-0.007
 better-0.007
Token or
Token ID393
Feature activation+6.63e-02

Pos logit contributions
old+0.045
 Doctor+0.038
 safer+0.036
Here+0.035
 him+0.035
Neg logit contributions
 antenna-0.055
 magn-0.050
 stub-0.050
rys-0.049
nam-0.047
Tokennam
Token ID7402
Feature activation-2.65e-01

Pos logit contributions
old+0.093
 Doctor+0.089
 safer+0.084
 better+0.083
 doctor+0.080
Neg logit contributions
 antenna-0.134
rys-0.126
 stub-0.120
chie-0.118
 magn-0.116
Tokenents
Token ID658
Feature activation-4.46e-02

Pos logit contributions
 antenna+0.482
rys+0.471
 magn+0.450
 stub+0.430
 leapt+0.409
Neg logit contributions
old-0.452
 better-0.368
 him-0.364
 Doctor-0.364
 customers-0.358
Token he
Token ID339
Feature activation+3.83e-03

Pos logit contributions
 antenna+0.093
rys+0.085
 stub+0.083
chie+0.082
ho+0.081
Neg logit contributions
old-0.059
 Doctor-0.057
 safer-0.051
Here-0.050
 doctor-0.049
Token kept
Token ID4030
Feature activation+2.39e-02

Pos logit contributions
old+0.007
 Doctor+0.007
 safer+0.006
 better+0.006
 doctor+0.006
Neg logit contributions
 antenna-0.006
rys-0.006
 stub-0.005
chie-0.005
 magn-0.005
Token around
Token ID1088
Feature activation-4.08e-02

Pos logit contributions
old+0.035
 Doctor+0.033
 safer+0.031
 better+0.031
 doctor+0.030
Neg logit contributions
 antenna-0.047
rys-0.043
 stub-0.042
chie-0.041
 magn-0.040
Token.
Token ID13
Feature activation+6.11e-03

Pos logit contributions
 antenna+0.091
rys+0.085
 stub+0.082
chie+0.081
 magn+0.080
Neg logit contributions
old-0.049
 Doctor-0.046
 safer-0.043
 better-0.043
 doctor-0.041
Token who
Token ID508
Feature activation+5.51e-02

Pos logit contributions
old+0.007
 Doctor+0.006
 better+0.006
 doctor+0.006
 safer+0.006
Neg logit contributions
 antenna-0.011
rys-0.010
chie-0.010
 tentacles-0.009
 magn-0.009
Token liked
Token ID8288
Feature activation-6.11e-03

Pos logit contributions
old+0.088
 Doctor+0.086
 safer+0.079
 doctor+0.077
 better+0.076
Neg logit contributions
 antenna-0.101
rys-0.093
 stub-0.092
 magn-0.087
chie-0.085
Token to
Token ID284
Feature activation-3.04e-02

Pos logit contributions
 antenna+0.013
rys+0.012
chie+0.012
 stub+0.011
 magn+0.011
Neg logit contributions
old-0.008
 Doctor-0.008
 better-0.007
 safer-0.007
 doctor-0.007
Token collect
Token ID2824
Feature activation-1.84e-02

Pos logit contributions
 antenna+0.071
rys+0.067
 stub+0.064
chie+0.064
 magn+0.062
Neg logit contributions
old-0.033
 Doctor-0.031
 better-0.029
 safer-0.028
 doctor-0.028
Token or
Token ID393
Feature activation+6.50e-02

Pos logit contributions
 antenna+0.039
rys+0.037
 magn+0.035
 stub+0.035
chie+0.034
Neg logit contributions
old-0.024
 Doctor-0.023
 safer-0.021
 doctor-0.020
 better-0.020
Tokennam
Token ID7402
Feature activation-2.65e-01

Pos logit contributions
 better+0.165
old+0.164
 him+0.162
 safer+0.161
 Doctor+0.161
Neg logit contributions
 antenna-0.053
rys-0.047
chie-0.041
 stub-0.038
rac-0.036
Tokenents
Token ID658
Feature activation-4.41e-02

Pos logit contributions
 antenna+0.483
rys+0.466
 magn+0.446
 stub+0.436
 leapt+0.412
Neg logit contributions
old-0.451
 better-0.378
 him-0.375
 safer-0.360
 Doctor-0.358
Token.
Token ID13
Feature activation+5.84e-03

Pos logit contributions
 antenna+0.098
rys+0.092
 stub+0.088
chie+0.087
 magn+0.086
Neg logit contributions
old-0.054
 Doctor-0.050
 safer-0.047
 better-0.047
 doctor-0.045
Token Everywhere
Token ID48201
Feature activation+4.34e-02

Pos logit contributions
old+0.011
 Doctor+0.010
 safer+0.009
 better+0.009
 choice+0.009
Neg logit contributions
 antenna-0.009
rys-0.009
 stub-0.008
rac-0.008
chie-0.008
Token she
Token ID673
Feature activation-4.70e-02

Pos logit contributions
old+0.057
 Doctor+0.054
 safer+0.051
 better+0.050
 doctor+0.048
Neg logit contributions
 antenna-0.092
rys-0.086
 stub-0.083
chie-0.081
 magn-0.080
Token went
Token ID1816
Feature activation-4.26e-03

Pos logit contributions
 antenna+0.078
rys+0.072
 stub+0.068
chie+0.067
 magn+0.065
Neg logit contributions
old-0.083
 Doctor-0.079
 better-0.076
 safer-0.075
 doctor-0.074
Token never
Token ID1239
Feature activation-1.12e-01

Pos logit contributions
old+0.017
 Doctor+0.016
 better+0.015
 safer+0.015
 doctor+0.015
Neg logit contributions
 antenna-0.016
rys-0.015
 stub-0.014
chie-0.014
 magn-0.014
Token played
Token ID2826
Feature activation+5.78e-02

Pos logit contributions
 antenna+0.200
rys+0.190
chie+0.187
rac+0.182
 magn+0.176
Neg logit contributions
old-0.178
 better-0.172
 Doctor-0.165
 safer-0.163
 doctor-0.156
Token with
Token ID351
Feature activation-5.89e-02

Pos logit contributions
old+0.106
 Doctor+0.092
Here+0.086
 choice+0.084
 safer+0.084
Neg logit contributions
 antenna-0.093
rys-0.085
 stub-0.081
 nowhere-0.080
 magn-0.079
Token the
Token ID262
Feature activation+2.13e-02

Pos logit contributions
 antenna+0.124
 nowhere+0.112
nam+0.111
rys+0.110
 tentacles+0.110
Neg logit contributions
old-0.077
 Doctor-0.066
Here-0.061
five-0.061
 safer-0.058
Token or
Token ID393
Feature activation+6.66e-02

Pos logit contributions
old+0.028
 Doctor+0.024
 him+0.021
 safer+0.021
 better+0.020
Neg logit contributions
 antenna-0.047
rys-0.043
 stub-0.043
 magn-0.043
 nowhere-0.041
Tokennam
Token ID7402
Feature activation-2.64e-01

Pos logit contributions
old+0.081
 Doctor+0.080
 better+0.080
 safer+0.079
 doctor+0.074
Neg logit contributions
 antenna-0.144
rys-0.137
chie-0.131
yt-0.129
 stub-0.128
Tokenents
Token ID658
Feature activation-4.54e-02

Pos logit contributions
 antenna+0.480
rys+0.465
 magn+0.447
 stub+0.429
 leapt+0.406
Neg logit contributions
old-0.453
 better-0.369
 Doctor-0.366
 safer-0.364
 him-0.359
Token again
Token ID757
Feature activation+5.67e-02

Pos logit contributions
 antenna+0.095
rys+0.088
 stub+0.086
chie+0.084
ho+0.084
Neg logit contributions
old-0.060
 Doctor-0.057
 safer-0.051
Here-0.050
 doctor-0.048
Token.
Token ID13
Feature activation+5.89e-03

Pos logit contributions
old+0.062
 Doctor+0.058
 safer+0.054
 better+0.054
 doctor+0.051
Neg logit contributions
 antenna-0.133
rys-0.125
 stub-0.121
chie-0.119
 magn-0.117
Token They
Token ID1119
Feature activation+2.47e-02

Pos logit contributions
old+0.008
 cleaner+0.008
 cream+0.007
 better+0.007
 Doctor+0.007
Neg logit contributions
 antenna-0.011
rys-0.011
 stub-0.010
 nowhere-0.010
Lou-0.010
Token still
Token ID991
Feature activation+6.03e-04

Pos logit contributions
old+0.037
 Doctor+0.035
 better+0.035
 safer+0.034
 doctor+0.033
Neg logit contributions
 antenna-0.048
rys-0.046
chie-0.043
 stub-0.042
 magn-0.042
Token always
Token ID1464
Feature activation+3.79e-02

Pos logit contributions
old+0.008
 Doctor+0.007
 better+0.007
 doctor+0.007
 safer+0.007
Neg logit contributions
 antenna-0.008
rys-0.008
chie-0.007
 magn-0.007
 stub-0.007
Token acted
Token ID13134
Feature activation-2.47e-02

Pos logit contributions
old+0.066
 Doctor+0.062
 better+0.062
 safer+0.060
 doctor+0.058
Neg logit contributions
 antenna-0.064
rys-0.062
chie-0.056
 stub-0.056
 magn-0.054
Token silly
Token ID14397
Feature activation+1.62e-02

Pos logit contributions
 antenna+0.055
rys+0.051
chie+0.049
 stub+0.049
 magn+0.048
Neg logit contributions
old-0.030
 Doctor-0.028
 safer-0.025
Here-0.025
 doctor-0.024
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
old+0.022
 Doctor+0.021
 safer+0.020
Here+0.020
 better+0.019
Neg logit contributions
 antenna-0.033
rys-0.030
 stub-0.029
 magn-0.029
 tentacles-0.029
Token mis
Token ID2984
Feature activation-7.49e-02

Pos logit contributions
old+0.018
 Doctor+0.017
 safer+0.016
 better+0.016
 doctor+0.015
Neg logit contributions
 antenna-0.026
rys-0.025
 stub-0.024
chie-0.023
 magn-0.023
Tokenchie
Token ID3043
Feature activation-2.64e-01

Pos logit contributions
 antenna+0.094
 magn+0.086
rys+0.081
 stub+0.079
yt+0.077
Neg logit contributions
old-0.179
 better-0.150
 Doctor-0.148
 cleaner-0.146
 safer-0.145
Tokenvous
Token ID31222
Feature activation+6.52e-02

Pos logit contributions
 antenna+0.574
rys+0.539
 stub+0.515
 magn+0.508
chie+0.497
Neg logit contributions
old-0.347
 Doctor-0.315
 better-0.297
 safer-0.294
 doctor-0.278
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
old+0.089
 Doctor+0.085
Here+0.081
 safer+0.081
 better+0.077
Neg logit contributions
 antenna-0.134
rys-0.123
 stub-0.118
chie-0.116
 magn-0.114
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
old+0.017
 cleaner+0.016
 safer+0.016
 Doctor+0.016
 better+0.016
Neg logit contributions
 antenna-0.017
 stub-0.016
 nowhere-0.016
rys-0.015
Lou-0.015
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
 antenna+0.037
rys+0.035
 stub+0.034
 nowhere+0.033
chie+0.033
Neg logit contributions
old-0.019
 Doctor-0.018
 safer-0.017
 better-0.017
 cleaner-0.016
TokenOne
Token ID3198
Feature activation-4.71e-02

Pos logit contributions
 antenna+0.024
rys+0.021
 stub+0.020
chie+0.019
 magn+0.019
Neg logit contributions
old-0.034
 Doctor-0.032
 safer-0.031
 better-0.031
 doctor-0.030
Token their
Token ID511
Feature activation-4.01e-03

Pos logit contributions
 antenna+0.036
rys+0.033
rac+0.031
nam+0.031
 stub+0.031
Neg logit contributions
old-0.031
 Doctor-0.028
Here-0.027
 safer-0.026
 cleaner-0.026
Token mom
Token ID1995
Feature activation+5.37e-03

Pos logit contributions
 antenna+0.008
rys+0.007
 magn+0.007
 stub+0.007
ho+0.007
Neg logit contributions
old-0.006
 Doctor-0.005
Here-0.005
 safer-0.005
 better-0.005
Token and
Token ID290
Feature activation+1.75e-02

Pos logit contributions
Here+0.009
old+0.009
 Doctor+0.008
 safer+0.008
 cleaner+0.008
Neg logit contributions
ho-0.008
 antenna-0.008
 reapp-0.007
 magn-0.007
fly-0.007
Token to
Token ID284
Feature activation-1.96e-02

Pos logit contributions
old+0.021
 Doctor+0.020
 safer+0.018
 better+0.018
 doctor+0.017
Neg logit contributions
 antenna-0.039
rys-0.037
 stub-0.035
chie-0.035
 magn-0.034
Token be
Token ID307
Feature activation-5.77e-02

Pos logit contributions
 antenna+0.045
rys+0.042
 stub+0.041
chie+0.040
rac+0.040
Neg logit contributions
old-0.022
 Doctor-0.021
 safer-0.019
 better-0.018
Here-0.018
Token more
Token ID517
Feature activation+1.82e-01

Pos logit contributions
 antenna+0.111
rys+0.104
 stub+0.101
 nowhere+0.101
nam+0.099
Neg logit contributions
old-0.075
 Doctor-0.074
Here-0.065
 him-0.063
 waiter-0.062
Token careful
Token ID8161
Feature activation+2.36e-02

Pos logit contributions
old+0.283
 Doctor+0.281
 him+0.270
Here+0.269
five+0.257
Neg logit contributions
 antenna-0.310
rys-0.280
 stub-0.274
 beating-0.268
yt-0.263
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
old+0.036
 Doctor+0.036
Here+0.033
 safer+0.032
 doctor+0.031
Neg logit contributions
 antenna-0.043
rys-0.038
 stub-0.038
 nowhere-0.037
rac-0.037
Token respectful
Token ID25923
Feature activation-3.34e-02

Pos logit contributions
old+0.023
 Doctor+0.022
 safer+0.020
 better+0.019
Here+0.019
Neg logit contributions
 antenna-0.038
rys-0.036
 stub-0.034
chie-0.034
 magn-0.033
Token next
Token ID1306
Feature activation-5.71e-02

Pos logit contributions
 antenna+0.063
rys+0.057
ho+0.056
 stub+0.055
 nowhere+0.054
Neg logit contributions
 Doctor-0.048
old-0.048
Here-0.045
 safer-0.043
 Dr-0.039
Token time
Token ID640
Feature activation+5.21e-02

Pos logit contributions
 antenna+0.116
rys+0.108
 stub+0.104
chie+0.102
 magn+0.100
Neg logit contributions
old-0.081
 Doctor-0.077
 safer-0.072
 better-0.071
 doctor-0.069
Token getting
Token ID1972
Feature activation+7.89e-03

Pos logit contributions
old+0.050
 Doctor+0.048
 safer+0.045
 better+0.045
 doctor+0.043
Neg logit contributions
 antenna-0.078
rys-0.073
 stub-0.070
chie-0.069
 magn-0.068
Token wet
Token ID9583
Feature activation+4.35e-02

Pos logit contributions
old+0.010
 Doctor+0.009
 doctor+0.008
Here+0.008
 safer+0.008
Neg logit contributions
 antenna-0.017
rys-0.016
 stub-0.016
chie-0.016
 nowhere-0.015
Token and
Token ID290
Feature activation+8.86e-03

Pos logit contributions
old+0.065
 Doctor+0.058
Here+0.053
 safer+0.052
 doctor+0.051
Neg logit contributions
 antenna-0.086
rys-0.080
 stub-0.077
yt-0.076
chie-0.076
Token we
Token ID356
Feature activation-6.21e-02

Pos logit contributions
 safer+0.012
 better+0.012
 Doctor+0.011
old+0.011
 cleaner+0.011
Neg logit contributions
 antenna-0.019
rys-0.018
 stub-0.017
chie-0.016
 magn-0.016
Token need
Token ID761
Feature activation-1.99e-02

Pos logit contributions
 antenna+0.137
rys+0.130
 stub+0.123
 magn+0.120
yt+0.120
Neg logit contributions
old-0.075
 Doctor-0.070
 better-0.069
 safer-0.068
 doctor-0.064
Token more
Token ID517
Feature activation+1.79e-01

Pos logit contributions
 antenna+0.036
rys+0.032
 stub+0.031
 nowhere+0.031
 tentacles+0.030
Neg logit contributions
old-0.032
 Doctor-0.031
Here-0.027
 waiter-0.027
 doctor-0.026
Token hay
Token ID27678
Feature activation+7.79e-02

Pos logit contributions
old+0.211
 Doctor+0.197
Here+0.184
 doctor+0.173
 him+0.167
Neg logit contributions
 antenna-0.402
rys-0.370
 stub-0.355
 tentacles-0.352
 magn-0.352
Token."
Token ID526
Feature activation+2.65e-03

Pos logit contributions
old+0.102
 Doctor+0.097
 safer+0.089
 better+0.088
 doctor+0.086
Neg logit contributions
 antenna-0.164
rys-0.153
 stub-0.148
chie-0.145
 magn-0.143
Token
Token ID198
Feature activation-2.12e-02

Pos logit contributions
old+0.004
 safer+0.003
 cleaner+0.003
 better+0.003
 choice+0.003
Neg logit contributions
 antenna-0.005
rys-0.005
 stub-0.005
Lou-0.005
rac-0.005
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
 antenna+0.041
 stub+0.038
rys+0.038
Lou+0.037
 nowhere+0.036
Neg logit contributions
old-0.028
 cleaner-0.027
 safer-0.026
 Doctor-0.026
 better-0.025
TokenSo
Token ID2396
Feature activation+2.36e-02

Pos logit contributions
 antenna+0.035
rys+0.033
 stub+0.031
chie+0.031
 magn+0.030
Neg logit contributions
old-0.034
 Doctor-0.032
 safer-0.031
 better-0.031
 doctor-0.030
Token going
Token ID1016
Feature activation+6.63e-02

Pos logit contributions
 antenna+0.103
rys+0.096
 stub+0.094
chie+0.092
 nowhere+0.091
Neg logit contributions
old-0.084
 Doctor-0.080
Here-0.073
 safer-0.073
 doctor-0.072
Token home
Token ID1363
Feature activation+3.27e-02

Pos logit contributions
old+0.103
 Doctor+0.098
 safer+0.089
Here+0.088
 doctor+0.087
Neg logit contributions
 antenna-0.124
rys-0.116
 stub-0.111
chie-0.110
 magn-0.108
Token and
Token ID290
Feature activation+1.29e-02

Pos logit contributions
old+0.042
 Doctor+0.040
 safer+0.038
 better+0.037
 doctor+0.036
Neg logit contributions
 antenna-0.070
rys-0.066
 stub-0.063
chie-0.062
 magn-0.061
Token to
Token ID284
Feature activation-2.45e-02

Pos logit contributions
old+0.017
 Doctor+0.016
 safer+0.015
 better+0.015
 doctor+0.015
Neg logit contributions
 antenna-0.027
rys-0.025
 stub-0.024
chie-0.024
 magn-0.023
Token learn
Token ID2193
Feature activation+6.43e-03

Pos logit contributions
 antenna+0.054
rys+0.051
 stub+0.049
chie+0.048
 magn+0.048
Neg logit contributions
old-0.030
 Doctor-0.028
 safer-0.026
 better-0.026
 doctor-0.025
Token more
Token ID517
Feature activation+1.82e-01

Pos logit contributions
old+0.007
 Doctor+0.007
Here+0.007
 waiter+0.006
 safer+0.006
Neg logit contributions
 antenna-0.014
rys-0.013
 nowhere-0.012
 magn-0.012
chie-0.012
Token about
Token ID546
Feature activation+1.36e-01

Pos logit contributions
old+0.161
 Doctor+0.150
 safer+0.134
 better+0.129
 doctor+0.125
Neg logit contributions
 antenna-0.464
rys-0.440
 stub-0.421
 magn-0.416
chie-0.415
Token the
Token ID262
Feature activation+2.88e-02

Pos logit contributions
old+0.158
 Doctor+0.147
 safer+0.135
 better+0.131
Here+0.128
Neg logit contributions
 antenna-0.310
rys-0.290
 stub-0.279
chie-0.275
 magn-0.274
Token village
Token ID7404
Feature activation-6.96e-02

Pos logit contributions
old+0.042
 Doctor+0.040
 safer+0.037
 better+0.036
 doctor+0.035
Neg logit contributions
 antenna-0.057
rys-0.053
 stub-0.051
 magn-0.050
chie-0.049
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 antenna+0.141
rys+0.128
chie+0.125
 stub+0.124
 magn+0.122
Neg logit contributions
 Doctor-0.091
old-0.090
Here-0.083
 safer-0.081
 doctor-0.078
Token<|endoftext|>
Token ID50256
Feature activation-9.04e-02

Pos logit contributions
old+0.012
 Doctor+0.011
 safer+0.011
 better+0.011
 cleaner+0.010
Neg logit contributions
 antenna-0.023
rys-0.021
 stub-0.020
chie-0.020
 magn-0.020
Token him
Token ID683
Feature activation+1.35e-01

Pos logit contributions
 antenna+0.040
rys+0.037
 stub+0.036
chie+0.035
 magn+0.035
Neg logit contributions
old-0.022
 Doctor-0.019
 safer-0.019
 better-0.018
Here-0.017
Token.
Token ID13
Feature activation+1.51e-02

Pos logit contributions
old+0.166
 Doctor+0.157
 safer+0.146
 better+0.145
 doctor+0.139
Neg logit contributions
 antenna-0.298
rys-0.281
 stub-0.269
chie-0.266
 magn-0.261
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
old+0.019
 safer+0.017
 better+0.016
 Doctor+0.016
 choice+0.016
Neg logit contributions
 antenna-0.032
 stub-0.030
rys-0.030
 nowhere-0.028
 magn-0.028
Token
Token ID198
Feature activation-1.64e-02

Pos logit contributions
 antenna+0.037
rys+0.035
 stub+0.034
chie+0.033
 nowhere+0.033
Neg logit contributions
old-0.020
 Doctor-0.019
 safer-0.018
 better-0.018
 cleaner-0.017
Token"
Token ID1
Feature activation-8.20e-02

Pos logit contributions
 antenna+0.032
rys+0.029
 stub+0.028
chie+0.027
 magn+0.027
Neg logit contributions
old-0.026
 Doctor-0.024
 safer-0.022
Here-0.022
 doctor-0.022
TokenHere
Token ID4342
Feature activation+2.22e-01

Pos logit contributions
 antenna+0.165
rys+0.153
 stub+0.147
chie+0.144
 magn+0.142
Neg logit contributions
old-0.120
 Doctor-0.113
 safer-0.107
 better-0.105
 doctor-0.101
Token,
Token ID11
Feature activation-1.21e-02

Pos logit contributions
old+0.294
 Doctor+0.278
 safer+0.261
 better+0.256
 doctor+0.247
Neg logit contributions
 antenna-0.467
rys-0.436
 stub-0.419
chie-0.411
 magn-0.408
Token take
Token ID1011
Feature activation-1.48e-02

Pos logit contributions
 antenna+0.024
rys+0.022
 stub+0.022
 magn+0.021
chie+0.021
Neg logit contributions
old-0.017
 Doctor-0.015
 safer-0.015
Here-0.014
 better-0.014
Token this
Token ID428
Feature activation-4.91e-02

Pos logit contributions
 antenna+0.029
rys+0.027
chie+0.025
 stub+0.025
 magn+0.025
Neg logit contributions
old-0.021
 Doctor-0.020
 safer-0.019
Here-0.018
 doctor-0.018
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 antenna+0.101
rys+0.095
 stub+0.091
chie+0.089
 magn+0.088
Neg logit contributions
old-0.067
 Doctor-0.064
 safer-0.060
 better-0.059
 doctor-0.057
Token It
Token ID632
Feature activation+4.83e-02

Pos logit contributions
old+0.019
 choice+0.017
 safer+0.017
 better+0.017
 Doctor+0.017
Neg logit contributions
 antenna-0.020
ifying-0.019
rys-0.019
rac-0.018
 magn-0.018
Token a
Token ID257
Feature activation-2.31e-02

Pos logit contributions
 antenna+0.038
rys+0.035
 stub+0.034
chie+0.033
 magn+0.033
Neg logit contributions
old-0.023
 Doctor-0.022
 safer-0.020
 better-0.020
 doctor-0.019
Token three
Token ID1115
Feature activation-6.69e-02

Pos logit contributions
 antenna+0.043
rys+0.041
 stub+0.040
 magn+0.039
chie+0.037
Neg logit contributions
old-0.037
 Doctor-0.033
 safer-0.031
 him-0.031
Here-0.030
Token-
Token ID12
Feature activation-5.80e-02

Pos logit contributions
 antenna+0.126
rys+0.115
 stub+0.112
ifying+0.108
chie+0.108
Neg logit contributions
old-0.111
 Doctor-0.104
 doctor-0.096
 ice-0.095
 better-0.094
Tokenyear
Token ID1941
Feature activation-3.92e-02

Pos logit contributions
 antenna+0.085
chie+0.081
rys+0.079
 stub+0.079
ho+0.075
Neg logit contributions
 Doctor-0.104
 safer-0.100
 better-0.096
 choice-0.093
 cleaner-0.093
Token-
Token ID12
Feature activation-5.79e-02

Pos logit contributions
 antenna+0.081
rys+0.075
 stub+0.074
 magn+0.072
 nowhere+0.071
Neg logit contributions
old-0.057
 Doctor-0.051
 doctor-0.047
 better-0.046
 safer-0.046
Tokenold
Token ID727
Feature activation+2.34e-01

Pos logit contributions
 antenna+0.161
chie+0.158
 stub+0.156
rys+0.155
ho+0.154
Neg logit contributions
 Doctor-0.026
 safer-0.021
 better-0.017
 cleaner-0.015
 doctor-0.013
Token boy
Token ID2933
Feature activation+6.20e-02

Pos logit contributions
 safer+0.290
 Doctor+0.279
old+0.276
 better+0.262
Here+0.258
Neg logit contributions
 antenna-0.486
 stub-0.454
rys-0.448
chie-0.436
yt-0.432
Token was
Token ID373
Feature activation+4.61e-03

Pos logit contributions
 Doctor+0.118
 safer+0.116
old+0.115
 better+0.111
 cleaner+0.108
Neg logit contributions
 antenna-0.086
rys-0.081
 stub-0.081
 magn-0.076
ho-0.072
Token very
Token ID845
Feature activation-6.09e-02

Pos logit contributions
old+0.007
 Doctor+0.007
 safer+0.006
 better+0.006
 doctor+0.006
Neg logit contributions
 antenna-0.009
rys-0.008
 stub-0.008
chie-0.008
 magn-0.008
Token excited
Token ID6568
Feature activation-3.80e-02

Pos logit contributions
 antenna+0.099
 stub+0.094
rys+0.093
chie+0.089
nam+0.087
Neg logit contributions
old-0.102
 Doctor-0.099
 safer-0.094
Here-0.094
 better-0.090
Token.
Token ID13
Feature activation+8.34e-03

Pos logit contributions
 antenna+0.076
rys+0.071
 stub+0.069
chie+0.067
 magn+0.066
Neg logit contributions
old-0.054
 Doctor-0.051
 safer-0.049
 better-0.047
 doctor-0.046
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
old+0.026
 Doctor+0.025
Here+0.024
 safer+0.023
 better+0.022
Neg logit contributions
 antenna-0.031
rys-0.028
 stub-0.028
 magn-0.026
 nowhere-0.026
Token He
Token ID679
Feature activation+4.80e-02

Pos logit contributions
old+0.022
 cleaner+0.020
 Doctor+0.020
 safer+0.020
 better+0.020
Neg logit contributions
 antenna-0.026
 nowhere-0.023
 stub-0.023
rys-0.022
rac-0.022
Token wanted
Token ID2227
Feature activation-5.42e-03

Pos logit contributions
old+0.076
 Doctor+0.073
 better+0.073
 him+0.071
 doctor+0.071
Neg logit contributions
 antenna-0.088
rys-0.085
chie-0.080
yt-0.076
 magn-0.076
Token to
Token ID284
Feature activation-2.03e-02

Pos logit contributions
 antenna+0.009
 stub+0.008
rys+0.008
 nowhere+0.008
 magn+0.008
Neg logit contributions
old-0.009
 Doctor-0.008
 safer-0.008
Here-0.008
 choice-0.008
Token go
Token ID467
Feature activation+7.89e-02

Pos logit contributions
 antenna+0.041
rys+0.038
 stub+0.037
 reapp+0.036
chie+0.036
Neg logit contributions
old-0.029
 Doctor-0.028
 safer-0.025
Here-0.025
 better-0.024
Token home
Token ID1363
Feature activation+3.40e-02

Pos logit contributions
old+0.130
 Doctor+0.124
Here+0.113
 safer+0.109
 doctor+0.108
Neg logit contributions
 antenna-0.135
rys-0.131
 stub-0.127
 magn-0.122
chie-0.122
Token to
Token ID284
Feature activation-2.00e-02

Pos logit contributions
old+0.045
 Doctor+0.042
 safer+0.039
 better+0.038
Here+0.037
Neg logit contributions
 antenna-0.072
rys-0.067
 stub-0.065
chie-0.063
 magn-0.063
Token the
Token ID262
Feature activation+3.29e-02

Pos logit contributions
 antenna+0.039
rys+0.037
 stub+0.035
 reapp+0.035
 magn+0.034
Neg logit contributions
old-0.029
 Doctor-0.026
Here-0.025
 safer-0.024
 cleaner-0.023
Token ocean
Token ID9151
Feature activation-1.97e-02

Pos logit contributions
old+0.059
 Doctor+0.045
five+0.042
 fixes+0.039
ores+0.039
Neg logit contributions
 antenna-0.056
 reapp-0.054
 stub-0.054
 tentacles-0.053
 nowhere-0.053
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
 antenna+0.039
rys+0.036
 stub+0.035
chie+0.034
 magn+0.034
Neg logit contributions
old-0.028
 Doctor-0.026
Here-0.025
 safer-0.024
 better-0.024
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
old+0.023
 cleaner+0.023
 safer+0.022
 Doctor+0.022
 better+0.022
Neg logit contributions
 antenna-0.024
rac-0.022
 nowhere-0.021
 stub-0.021
Lou-0.020
Token and
Token ID290
Feature activation+1.40e-02

Pos logit contributions
 antenna+0.092
rys+0.085
 stub+0.082
 magn+0.079
chie+0.079
Neg logit contributions
old-0.055
 Doctor-0.051
 safer-0.047
Here-0.045
 better-0.045
Token said
Token ID531
Feature activation+3.65e-02

Pos logit contributions
old+0.022
 Doctor+0.021
 safer+0.019
 better+0.019
 doctor+0.019
Neg logit contributions
 antenna-0.027
rys-0.025
 stub-0.024
chie-0.023
 magn-0.023
Token,
Token ID11
Feature activation-1.14e-02

Pos logit contributions
old+0.050
 Doctor+0.047
 safer+0.044
 better+0.044
 doctor+0.042
Neg logit contributions
 antenna-0.076
rys-0.071
 stub-0.068
chie-0.067
 magn-0.066
Token "
Token ID366
Feature activation-6.58e-02

Pos logit contributions
 antenna+0.026
rys+0.025
chie+0.024
 stub+0.023
 magn+0.023
Neg logit contributions
 Doctor-0.013
old-0.013
 better-0.012
 safer-0.012
 doctor-0.012
TokenI
Token ID40
Feature activation+6.08e-03

Pos logit contributions
 antenna+0.119
rys+0.109
 stub+0.105
Lou+0.102
chie+0.100
Neg logit contributions
old-0.110
 Doctor-0.103
 safer-0.098
 better-0.095
 doctor-0.093
Token love
Token ID1842
Feature activation+4.99e-02

Pos logit contributions
old+0.008
 Doctor+0.008
 groceries+0.007
Here+0.007
 safer+0.007
Neg logit contributions
 antenna-0.012
 stub-0.011
 magn-0.011
 reapp-0.011
ho-0.011
Token you
Token ID345
Feature activation-1.03e-02

Pos logit contributions
old+0.055
 Doctor+0.052
 safer+0.048
 better+0.048
 doctor+0.046
Neg logit contributions
 antenna-0.117
rys-0.110
 stub-0.107
chie-0.105
 magn-0.104
Token too
Token ID1165
Feature activation+3.99e-03

Pos logit contributions
 antenna+0.022
rys+0.021
 stub+0.020
chie+0.019
 magn+0.019
Neg logit contributions
old-0.013
 Doctor-0.013
 better-0.012
 safer-0.012
 doctor-0.012
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
old+0.005
 Doctor+0.005
 safer+0.005
 better+0.005
 doctor+0.004
Neg logit contributions
 antenna-0.008
rys-0.008
 stub-0.008
chie-0.007
 magn-0.007
Token mom
Token ID1995
Feature activation+3.24e-03

Pos logit contributions
 antenna+0.024
rys+0.023
 stub+0.022
chie+0.022
 magn+0.022
Neg logit contributions
old-0.013
 Doctor-0.012
 safer-0.011
 better-0.011
Here-0.010
Token."
Token ID526
Feature activation+5.19e-03

Pos logit contributions
old+0.005
Here+0.005
 Doctor+0.005
 safer+0.004
 hospital+0.004
Neg logit contributions
 antenna-0.006
yt-0.005
rys-0.005
 stub-0.005
chie-0.005
Token Lily
Token ID20037
Feature activation+4.86e-02

Pos logit contributions
old+0.157
 Doctor+0.153
 better+0.146
 safer+0.145
 doctor+0.141
Neg logit contributions
 antenna-0.143
rys-0.133
 stub-0.126
chie-0.125
 magn-0.119
Token on
Token ID319
Feature activation-1.25e-01

Pos logit contributions
 better+0.072
old+0.071
 ice+0.070
 safer+0.068
 Doctor+0.066
Neg logit contributions
rys-0.092
 antenna-0.091
chie-0.087
yt-0.082
 stub-0.080
Token the
Token ID262
Feature activation+2.67e-02

Pos logit contributions
 antenna+0.265
rys+0.243
 stub+0.233
 magn+0.233
 nowhere+0.228
Neg logit contributions
old-0.172
 Doctor-0.154
 safer-0.146
 better-0.140
 doctor-0.136
Token swing
Token ID9628
Feature activation-3.47e-02

Pos logit contributions
old+0.039
 Doctor+0.037
 safer+0.035
 better+0.035
 doctor+0.034
Neg logit contributions
 antenna-0.052
rys-0.049
 stub-0.047
chie-0.046
 magn-0.045
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
 antenna+0.079
rys+0.076
 stub+0.072
chie+0.071
 magn+0.070
Neg logit contributions
old-0.040
 Doctor-0.037
 safer-0.035
 better-0.035
 doctor-0.033
Token pushed
Token ID7121
Feature activation+2.98e-02

Pos logit contributions
 Doctor+0.016
old+0.016
 better+0.015
 doctor+0.015
 him+0.015
Neg logit contributions
rys-0.020
chie-0.019
 antenna-0.019
rac-0.017
 tentacles-0.017
Token her
Token ID607
Feature activation-3.74e-02

Pos logit contributions
old+0.038
 Doctor+0.036
 safer+0.033
 better+0.032
 doctor+0.032
Neg logit contributions
 antenna-0.065
rys-0.061
 stub-0.058
chie-0.057
 magn-0.057
Token gently
Token ID15165
Feature activation-3.66e-02

Pos logit contributions
 antenna+0.078
rys+0.076
chie+0.074
 stub+0.073
Lou+0.072
Neg logit contributions
 better-0.045
old-0.044
 safer-0.043
 Doctor-0.042
 doctor-0.041
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 antenna+0.090
rys+0.087
chie+0.086
 stub+0.086
 nowhere+0.085
Neg logit contributions
 better-0.031
old-0.029
 safer-0.027
 Doctor-0.027
 ice-0.026
Token Lily
Token ID20037
Feature activation+4.93e-02

Pos logit contributions
old+0.013
 Doctor+0.012
 safer+0.011
 better+0.011
 cleaner+0.010
Neg logit contributions
 antenna-0.023
rys-0.021
 stub-0.021
 magn-0.020
rac-0.020
Token smiled
Token ID13541
Feature activation-4.38e-02

Pos logit contributions
old+0.079
 Doctor+0.075
 better+0.074
 safer+0.072
 doctor+0.071
Neg logit contributions
rys-0.089
 antenna-0.088
chie-0.084
ifying-0.078
 magn-0.076
Token went
Token ID1816
Feature activation+4.51e-04

Pos logit contributions
old+0.018
 Doctor+0.018
 safer+0.017
 better+0.017
 doctor+0.016
Neg logit contributions
 antenna-0.020
rys-0.018
 stub-0.017
chie-0.017
 magn-0.017
Token in
Token ID287
Feature activation-9.35e-02

Pos logit contributions
old+0.001
 Doctor+0.001
Here+0.001
 safer+0.001
 better+0.001
Neg logit contributions
 antenna-0.001
rys-0.001
 stub-0.001
 magn-0.001
chie-0.001
Token for
Token ID329
Feature activation+9.43e-03

Pos logit contributions
 antenna+0.214
rys+0.201
 stub+0.194
chie+0.191
 magn+0.188
Neg logit contributions
old-0.108
 Doctor-0.101
 safer-0.093
 better-0.092
 doctor-0.088
Token dinner
Token ID8073
Feature activation+6.27e-03

Pos logit contributions
old+0.013
 Doctor+0.012
 doctor+0.011
Here+0.011
 safer+0.010
Neg logit contributions
 antenna-0.019
 nowhere-0.017
rys-0.017
 stub-0.017
 magn-0.017
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
old+0.008
 Doctor+0.007
 safer+0.007
 better+0.006
Here+0.006
Neg logit contributions
 antenna-0.014
rys-0.013
 stub-0.012
chie-0.012
 magn-0.012
Token He
Token ID679
Feature activation+4.09e-02

Pos logit contributions
old+0.014
 cleaner+0.013
 safer+0.013
 better+0.013
 Doctor+0.012
Neg logit contributions
 antenna-0.022
 stub-0.020
rys-0.020
 nowhere-0.019
rac-0.019
Token had
Token ID550
Feature activation-1.51e-01

Pos logit contributions
old+0.064
 Doctor+0.062
 better+0.059
 safer+0.058
 doctor+0.058
Neg logit contributions
 antenna-0.077
rys-0.076
chie-0.071
 magn-0.067
 stub-0.066
Token had
Token ID550
Feature activation-1.52e-01

Pos logit contributions
 antenna+0.314
rys+0.295
 stub+0.282
chie+0.278
 magn+0.274
Neg logit contributions
old-0.207
 Doctor-0.196
 safer-0.184
 better-0.182
 doctor-0.176
Token a
Token ID257
Feature activation-2.40e-02

Pos logit contributions
 antenna+0.316
rys+0.290
 stub+0.285
chie+0.276
 nowhere+0.274
Neg logit contributions
old-0.207
 Doctor-0.194
 safer-0.179
 better-0.175
 doctor-0.173
Token really
Token ID1107
Feature activation-3.54e-02

Pos logit contributions
 antenna+0.049
rys+0.045
 stub+0.045
 magn+0.045
 nowhere+0.042
Neg logit contributions
old-0.034
 Doctor-0.030
 him-0.028
 safer-0.027
Here-0.027
Token lovely
Token ID14081
Feature activation+3.39e-02

Pos logit contributions
 antenna+0.080
rys+0.075
 stub+0.073
 magn+0.071
chie+0.071
Neg logit contributions
old-0.041
 Doctor-0.038
 safer-0.035
 better-0.034
 doctor-0.033
Token had
Token ID550
Feature activation-1.53e-01

Pos logit contributions
 antenna+0.139
rys+0.134
 stub+0.124
chie+0.120
 magn+0.119
Neg logit contributions
old-0.147
 Doctor-0.136
 better-0.131
 safer-0.128
 doctor-0.127
Token created
Token ID2727
Feature activation-1.85e-02

Pos logit contributions
 antenna+0.285
rys+0.281
chie+0.256
 stub+0.254
ho+0.254
Neg logit contributions
old-0.235
 Doctor-0.220
 better-0.219
 safer-0.215
 him-0.207
Token a
Token ID257
Feature activation-2.46e-02

Pos logit contributions
 antenna+0.038
rys+0.034
 magn+0.033
 stub+0.033
 tentacles+0.033
Neg logit contributions
old-0.025
 Doctor-0.024
 safer-0.022
Here-0.021
 doctor-0.021
Token lake
Token ID13546
Feature activation+2.69e-02

Pos logit contributions
 antenna+0.051
rys+0.048
 stub+0.046
 magn+0.045
chie+0.045
Neg logit contributions
old-0.033
 Doctor-0.031
 safer-0.029
 better-0.028
 him-0.028
Token of
Token ID286
Feature activation+9.54e-02

Pos logit contributions
old+0.036
 Doctor+0.034
 safer+0.032
 better+0.032
 doctor+0.031
Neg logit contributions
 antenna-0.056
rys-0.052
 stub-0.050
chie-0.049
 magn-0.048
Token chocolate
Token ID11311
Feature activation+9.02e-02

Pos logit contributions
old+0.133
 Doctor+0.124
 safer+0.110
 doctor+0.108
 better+0.107
Neg logit contributions
 antenna-0.199
rys-0.178
 magn-0.175
 stub-0.171
 tentacles-0.170
Token inside
Token ID2641
Feature activation+8.06e-02

Pos logit contributions
old+0.130
 Doctor+0.118
 safer+0.112
 better+0.107
Here+0.107
Neg logit contributions
 antenna-0.183
rys-0.166
 magn-0.160
 stub-0.160
ho-0.160
Token her
Token ID607
Feature activation-3.85e-02

Pos logit contributions
old+0.110
 Doctor+0.104
 safer+0.096
 better+0.092
 doctor+0.092
Neg logit contributions
 antenna-0.165
rys-0.153
 stub-0.146
 magn-0.145
chie-0.143
Token chamber
Token ID11847
Feature activation-1.05e-01

Pos logit contributions
 antenna+0.074
rys+0.069
 stub+0.065
 magn+0.064
chie+0.064
Neg logit contributions
old-0.058
 Doctor-0.055
 safer-0.052
 better-0.051
 doctor-0.050
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
 antenna+0.234
rys+0.223
 stub+0.214
chie+0.210
rac+0.206
Neg logit contributions
old-0.125
 Doctor-0.118
 safer-0.110
 better-0.109
 doctor-0.106
Token All
Token ID1439
Feature activation+7.67e-03

Pos logit contributions
old+0.014
 safer+0.013
 better+0.013
 Doctor+0.013
 cleaner+0.012
Neg logit contributions
 antenna-0.019
rys-0.018
 stub-0.017
chie-0.017
rac-0.017
Token right
Token ID826
Feature activation+7.83e-02

Pos logit contributions
 antenna+0.037
rys+0.035
 stub+0.033
chie+0.033
rac+0.032
Neg logit contributions
old-0.026
 Doctor-0.025
 safer-0.024
 better-0.024
 doctor-0.023
Token thing
Token ID1517
Feature activation+7.04e-02

Pos logit contributions
old+0.097
 Doctor+0.092
 better+0.085
 safer+0.085
 doctor+0.081
Neg logit contributions
 antenna-0.171
rys-0.161
 stub-0.156
chie-0.152
 magn-0.151
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
old+0.096
 Doctor+0.091
 safer+0.085
 better+0.084
 doctor+0.081
Neg logit contributions
 antenna-0.144
rys-0.134
 stub-0.129
chie-0.127
 magn-0.125
Token even
Token ID772
Feature activation-4.88e-03

Pos logit contributions
old+0.014
 Doctor+0.013
 safer+0.012
 better+0.011
Here+0.011
Neg logit contributions
 antenna-0.023
rys-0.021
 stub-0.021
 magn-0.020
chie-0.020
Token if
Token ID611
Feature activation+1.30e-01

Pos logit contributions
 antenna+0.012
rys+0.011
 stub+0.011
chie+0.010
 magn+0.010
Neg logit contributions
old-0.005
 Doctor-0.005
 safer-0.004
Here-0.004
 doctor-0.004
Token it
Token ID340
Feature activation+2.51e-02

Pos logit contributions
old+0.154
 Doctor+0.145
 safer+0.135
 better+0.133
 doctor+0.128
Neg logit contributions
 antenna-0.291
rys-0.274
 stub-0.263
chie-0.259
 magn-0.256
Token's
Token ID338
Feature activation-2.01e-02

Pos logit contributions
old+0.034
 Doctor+0.032
 better+0.032
 safer+0.031
 doctor+0.030
Neg logit contributions
 antenna-0.051
rys-0.049
 stub-0.046
chie-0.046
 magn-0.045
Token hard
Token ID1327
Feature activation-5.04e-02

Pos logit contributions
 antenna+0.042
rys+0.040
 stub+0.038
chie+0.037
 magn+0.037
Neg logit contributions
old-0.027
 Doctor-0.025
 safer-0.024
 better-0.023
 doctor-0.023
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
 antenna+0.108
rys+0.101
 stub+0.097
chie+0.095
 magn+0.094
Neg logit contributions
old-0.065
 Doctor-0.061
 safer-0.057
 better-0.056
 doctor-0.055
Token And
Token ID843
Feature activation+3.80e-02

Pos logit contributions
old+0.023
 Doctor+0.022
 safer+0.021
 better+0.021
 cleaner+0.020
Neg logit contributions
 antenna-0.040
rys-0.038
 stub-0.037
chie-0.036
rac-0.035
Token remember
Token ID3505
Feature activation+1.30e-02

Pos logit contributions
old+0.046
 Doctor+0.044
 safer+0.040
 better+0.040
 doctor+0.038
Neg logit contributions
 antenna-0.084
rys-0.078
 stub-0.075
chie-0.074
 magn-0.073
Token lived
Token ID5615
Feature activation-1.03e-02

Pos logit contributions
old+0.120
 Doctor+0.116
 safer+0.113
 cleaner+0.108
 better+0.106
Neg logit contributions
 antenna-0.083
 stub-0.076
rys-0.073
ho-0.072
 magn-0.071
Token on
Token ID319
Feature activation-1.24e-01

Pos logit contributions
 antenna+0.020
rys+0.019
chie+0.018
yt+0.017
 stub+0.017
Neg logit contributions
old-0.015
 better-0.015
 Doctor-0.015
 safer-0.014
 doctor-0.014
Token a
Token ID257
Feature activation-2.27e-02

Pos logit contributions
 antenna+0.258
rys+0.240
 stub+0.232
 magn+0.227
 nowhere+0.225
Neg logit contributions
old-0.170
 Doctor-0.151
 safer-0.145
 better-0.140
 cleaner-0.136
Token small
Token ID1402
Feature activation-5.45e-02

Pos logit contributions
 antenna+0.051
rys+0.048
chie+0.045
 stub+0.045
 magn+0.044
Neg logit contributions
old-0.027
 Doctor-0.027
 safer-0.025
 better-0.025
 doctor-0.025
Token farm
Token ID5318
Feature activation-2.22e-02

Pos logit contributions
 antenna+0.124
rys+0.117
chie+0.110
 stub+0.110
rac+0.109
Neg logit contributions
old-0.065
 Doctor-0.063
 doctor-0.059
 better-0.058
 safer-0.057
Token.
Token ID13
Feature activation+9.06e-03

Pos logit contributions
 antenna+0.052
rys+0.049
 stub+0.047
chie+0.047
 magn+0.045
Neg logit contributions
old-0.025
 Doctor-0.023
 safer-0.021
 better-0.021
 doctor-0.021
Token His
Token ID2399
Feature activation+1.01e-02

Pos logit contributions
old+0.009
 Doctor+0.008
 safer+0.008
 him+0.008
 better+0.008
Neg logit contributions
 antenna-0.022
rys-0.021
 stub-0.020
chie-0.020
nam-0.020
Token farm
Token ID5318
Feature activation-2.20e-02

Pos logit contributions
old+0.016
 safer+0.014
Here+0.013
 Doctor+0.013
 cleaner+0.013
Neg logit contributions
 antenna-0.020
 magn-0.017
ho-0.017
rys-0.017
 tentacles-0.016
Token had
Token ID550
Feature activation-1.48e-01

Pos logit contributions
 antenna+0.043
rys+0.041
chie+0.038
 stub+0.038
 magn+0.037
Neg logit contributions
old-0.033
 Doctor-0.031
 better-0.029
 safer-0.029
 doctor-0.029
Token lots
Token ID6041
Feature activation-8.07e-02

Pos logit contributions
 antenna+0.315
rys+0.295
 stub+0.284
chie+0.278
 magn+0.274
Neg logit contributions
old-0.194
 Doctor-0.183
 safer-0.171
 better-0.168
 doctor-0.163
Token of
Token ID286
Feature activation+9.94e-02

Pos logit contributions
 antenna+0.147
rys+0.133
 tentacles+0.131
yt+0.125
 legs+0.124
Neg logit contributions
 Doctor-0.114
Here-0.106
old-0.102
 earn-0.100
 safer-0.099
Token
Token ID198
Feature activation-1.28e-02

Pos logit contributions
old+0.011
 safer+0.010
 cleaner+0.010
 Doctor+0.010
 better+0.009
Neg logit contributions
 antenna-0.013
 stub-0.012
 nowhere-0.012
rys-0.012
rac-0.012
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 antenna+0.024
 stub+0.023
rys+0.022
rac+0.022
 nowhere+0.021
Neg logit contributions
old-0.018
 Doctor-0.016
 safer-0.016
 cleaner-0.016
 better-0.016
TokenThe
Token ID464
Feature activation-1.12e-02

Pos logit contributions
 antenna+0.022
rys+0.021
 stub+0.020
chie+0.020
rac+0.019
Neg logit contributions
old-0.021
 Doctor-0.020
 safer-0.019
 better-0.019
 him-0.018
Token lawyer
Token ID6853
Feature activation-3.08e-02

Pos logit contributions
 antenna+0.028
rys+0.026
 stub+0.025
chie+0.025
 magn+0.025
Neg logit contributions
old-0.011
 Doctor-0.010
 safer-0.009
 better-0.009
 him-0.009
Token was
Token ID373
Feature activation+1.58e-02

Pos logit contributions
 antenna+0.050
rys+0.046
 stub+0.044
chie+0.042
 magn+0.042
Neg logit contributions
old-0.055
 Doctor-0.053
 safer-0.051
 better-0.050
 doctor-0.049
Token still
Token ID991
Feature activation+5.64e-03

Pos logit contributions
old+0.026
 Doctor+0.024
 safer+0.023
Here+0.022
 better+0.022
Neg logit contributions
 antenna-0.028
rys-0.026
 stub-0.025
 nowhere-0.025
chie-0.024
Token angry
Token ID7954
Feature activation+1.60e-02

Pos logit contributions
old+0.008
 Doctor+0.008
 better+0.007
 safer+0.007
 doctor+0.007
Neg logit contributions
 antenna-0.011
rys-0.011
chie-0.010
 stub-0.010
 magn-0.010
Token and
Token ID290
Feature activation+1.77e-02

Pos logit contributions
Here+0.028
 Doctor+0.028
old+0.027
 safer+0.026
 earn+0.025
Neg logit contributions
 antenna-0.025
 nowhere-0.022
rys-0.021
ho-0.021
 stub-0.020
Token said
Token ID531
Feature activation+3.85e-02

Pos logit contributions
old+0.028
 Doctor+0.026
 safer+0.025
 better+0.024
 doctor+0.024
Neg logit contributions
 antenna-0.033
rys-0.030
 stub-0.029
chie-0.028
ho-0.028
Token,
Token ID11
Feature activation-8.16e-03

Pos logit contributions
old+0.058
 Doctor+0.055
 safer+0.052
Here+0.050
 better+0.050
Neg logit contributions
 antenna-0.074
rys-0.068
 stub-0.066
chie-0.064
 magn-0.063
Token "
Token ID366
Feature activation-6.17e-02

Pos logit contributions
 antenna+0.018
rys+0.017
 stub+0.016
chie+0.016
 magn+0.016
Neg logit contributions
old-0.010
 Doctor-0.010
 better-0.009
 safer-0.009
 doctor-0.009
Token happy
Token ID3772
Feature activation+3.22e-02

Pos logit contributions
 antenna+0.030
rys+0.028
 stub+0.027
chie+0.027
 magn+0.026
Neg logit contributions
old-0.017
 Doctor-0.016
 safer-0.015
 better-0.015
 doctor-0.014
Token to
Token ID284
Feature activation-2.65e-02

Pos logit contributions
old+0.045
 Doctor+0.043
 safer+0.041
Here+0.040
 better+0.040
Neg logit contributions
 antenna-0.065
rys-0.060
 stub-0.058
 magn-0.056
yt-0.056
Token see
Token ID766
Feature activation+6.18e-02

Pos logit contributions
 antenna+0.063
rys+0.059
 stub+0.057
chie+0.056
 magn+0.055
Neg logit contributions
old-0.029
 Doctor-0.027
 safer-0.024
 better-0.024
 doctor-0.023
Token Jill
Token ID22772
Feature activation-4.78e-02

Pos logit contributions
old+0.091
 Doctor+0.086
 safer+0.085
 cleaner+0.082
 choice+0.078
Neg logit contributions
 antenna-0.111
rys-0.102
nam-0.100
 stub-0.097
ho-0.097
Token that
Token ID326
Feature activation+3.01e-02

Pos logit contributions
 antenna+0.099
rys+0.094
 stub+0.089
chie+0.089
 magn+0.086
Neg logit contributions
old-0.065
 Doctor-0.061
 better-0.058
 safer-0.057
 doctor-0.055
Token she
Token ID673
Feature activation-4.62e-02

Pos logit contributions
old+0.034
 Doctor+0.033
 better+0.031
 safer+0.030
 doctor+0.029
Neg logit contributions
 antenna-0.069
rys-0.066
chie-0.064
 stub-0.062
rac-0.061
Token smiled
Token ID13541
Feature activation-4.61e-02

Pos logit contributions
 antenna+0.077
rys+0.074
chie+0.068
rac+0.066
 magn+0.066
Neg logit contributions
old-0.081
 Doctor-0.076
 better-0.075
 safer-0.073
 doctor-0.071
Token and
Token ID290
Feature activation+1.22e-02

Pos logit contributions
 antenna+0.101
rys+0.095
 stub+0.091
chie+0.090
 magn+0.088
Neg logit contributions
old-0.057
 Doctor-0.054
 better-0.051
 safer-0.051
 doctor-0.048
Token asked
Token ID1965
Feature activation+5.18e-02

Pos logit contributions
old+0.019
 Doctor+0.018
 better+0.017
 safer+0.017
 doctor+0.016
Neg logit contributions
 antenna-0.023
rys-0.022
chie-0.021
 stub-0.021
 magn-0.020
Token her
Token ID607
Feature activation-3.64e-02

Pos logit contributions
old+0.065
 Doctor+0.060
 safer+0.059
Here+0.056
 cleaner+0.055
Neg logit contributions
 antenna-0.111
rys-0.101
ho-0.098
 stub-0.097
 magn-0.096
Token to
Token ID284
Feature activation-2.63e-02

Pos logit contributions
 antenna+0.077
rys+0.070
 magn+0.068
 stub+0.067
chie+0.065
Neg logit contributions
old-0.048
 Doctor-0.044
Here-0.043
 safer-0.042
 better-0.040
Token his
Token ID465
Feature activation-1.53e-03

Pos logit contributions
 antenna+0.082
 nowhere+0.076
rys+0.074
 stub+0.073
 tentacles+0.072
Neg logit contributions
old-0.054
 Doctor-0.050
 safer-0.048
Here-0.047
 better-0.046
Token work
Token ID670
Feature activation+3.78e-02

Pos logit contributions
 antenna+0.003
rys+0.003
chie+0.003
 stub+0.003
rac+0.002
Neg logit contributions
old-0.002
 Doctor-0.002
 better-0.002
 safer-0.002
 doctor-0.002
Token.
Token ID13
Feature activation+8.94e-03

Pos logit contributions
old+0.039
 better+0.036
 Doctor+0.036
 safer+0.034
 doctor+0.033
Neg logit contributions
 antenna-0.089
rys-0.087
chie-0.082
 stub-0.082
rac-0.080
Token It
Token ID632
Feature activation+4.73e-02

Pos logit contributions
old+0.011
 safer+0.009
 Doctor+0.009
 better+0.009
 choice+0.009
Neg logit contributions
 antenna-0.020
rys-0.018
 stub-0.018
 magn-0.018
 tentacles-0.018
Token was
Token ID373
Feature activation+5.95e-03

Pos logit contributions
 better+0.052
 safer+0.045
old+0.044
 doctor+0.043
 cleaner+0.042
Neg logit contributions
rys-0.116
 antenna-0.112
ifying-0.107
chie-0.107
rac-0.105
Token a
Token ID257
Feature activation-2.42e-02

Pos logit contributions
old+0.009
 Doctor+0.008
 safer+0.008
 doctor+0.008
Here+0.008
Neg logit contributions
 antenna-0.012
rys-0.011
 stub-0.010
 nowhere-0.010
chie-0.010
Token very
Token ID845
Feature activation-6.18e-02

Pos logit contributions
 antenna+0.049
rys+0.046
chie+0.043
 stub+0.043
rac+0.043
Neg logit contributions
 Doctor-0.033
old-0.033
 better-0.032
 safer-0.032
 doctor-0.030
Token big
Token ID1263
Feature activation-4.34e-02

Pos logit contributions
 antenna+0.126
rys+0.119
 stub+0.114
chie+0.112
 magn+0.110
Neg logit contributions
old-0.087
 Doctor-0.081
 him-0.074
Here-0.073
 safer-0.073
Token blanket
Token ID18447
Feature activation-8.31e-02

Pos logit contributions
 antenna+0.082
rys+0.076
 stub+0.073
 magn+0.071
chie+0.070
Neg logit contributions
old-0.069
 Doctor-0.064
 safer-0.059
Here-0.058
 better-0.058
Token!
Token ID0
Feature activation-1.27e-02

Pos logit contributions
 antenna+0.165
rys+0.149
 stub+0.148
ho+0.143
 magn+0.143
Neg logit contributions
old-0.121
 Doctor-0.114
 safer-0.105
Here-0.103
 better-0.101
Token
Token ID198
Feature activation-2.07e-02

Pos logit contributions
 antenna+0.028
rys+0.025
 stub+0.025
 nowhere+0.024
Lou+0.024
Neg logit contributions
old-0.016
 safer-0.014
 better-0.014
 Doctor-0.014
 cleaner-0.013
Token
Token ID220
Feature activation-7.88e-02

Pos logit contributions
old+0.013
 Doctor+0.012
 safer+0.011
 better+0.011
 cleaner+0.011
Neg logit contributions
 antenna-0.020
rys-0.018
 stub-0.018
chie-0.017
 magn-0.017
Token
Token ID198
Feature activation-1.74e-02

Pos logit contributions
 antenna+0.155
 stub+0.138
rac+0.138
rys+0.137
chie+0.134
Neg logit contributions
old-0.104
 Doctor-0.102
 safer-0.098
 cleaner-0.098
 choice-0.097
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 antenna+0.039
rys+0.036
 stub+0.035
chie+0.034
Lou+0.033
Neg logit contributions
old-0.021
 Doctor-0.020
 safer-0.018
 better-0.018
 doctor-0.018
TokenFrom
Token ID4863
Feature activation-7.10e-02

Pos logit contributions
 antenna+0.032
rys+0.030
 stub+0.029
 magn+0.028
chie+0.028
Neg logit contributions
old-0.026
 Doctor-0.025
 safer-0.024
 better-0.024
 doctor-0.023
Token then
Token ID788
Feature activation+1.02e-01

Pos logit contributions
 antenna+0.151
rys+0.142
 nowhere+0.138
rac+0.136
 stub+0.135
Neg logit contributions
old-0.089
 Doctor-0.083
 safer-0.076
 doctor-0.074
 customers-0.073
Token on
Token ID319
Feature activation-1.25e-01

Pos logit contributions
 better+0.140
 Doctor+0.133
old+0.132
 him+0.131
 welcome+0.126
Neg logit contributions
 antenna-0.213
nam-0.199
 stub-0.190
chie-0.189
rys-0.186
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
 antenna+0.291
rys+0.281
chie+0.273
 stub+0.269
nam+0.259
Neg logit contributions
 Doctor-0.134
old-0.127
 better-0.122
 him-0.120
 safer-0.118
Token Splash
Token ID45275
Feature activation-6.70e-02

Pos logit contributions
 antenna+0.029
rys+0.027
 stub+0.026
chie+0.025
 magn+0.025
Neg logit contributions
old-0.020
 Doctor-0.019
 safer-0.017
 better-0.017
 doctor-0.017
Token lived
Token ID5615
Feature activation-1.12e-02

Pos logit contributions
 antenna+0.120
rys+0.111
 stub+0.106
chie+0.104
 magn+0.101
Neg logit contributions
old-0.110
 Doctor-0.105
 safer-0.100
 better-0.098
 doctor-0.097
Token in
Token ID287
Feature activation-9.54e-02

Pos logit contributions
 antenna+0.022
rys+0.021
 stub+0.020
chie+0.020
 magn+0.019
Neg logit contributions
old-0.016
 Doctor-0.015
 safer-0.015
 better-0.014
 doctor-0.014
Token his
Token ID465
Feature activation-2.98e-03

Pos logit contributions
 antenna+0.223
rys+0.210
 stub+0.203
chie+0.200
 magn+0.197
Neg logit contributions
old-0.105
 Doctor-0.097
 safer-0.089
 better-0.088
 doctor-0.084
Token and
Token ID290
Feature activation+1.90e-02

Pos logit contributions
 antenna+0.014
rys+0.013
 stub+0.012
 magn+0.012
chie+0.012
Neg logit contributions
old-0.009
 Doctor-0.009
 safer-0.008
 better-0.008
 doctor-0.008
Token they
Token ID484
Feature activation-3.19e-02

Pos logit contributions
old+0.024
 Doctor+0.023
 safer+0.022
 better+0.021
 doctor+0.021
Neg logit contributions
 antenna-0.041
rys-0.038
 stub-0.037
chie-0.036
 magn-0.035
Token ran
Token ID4966
Feature activation-6.89e-02

Pos logit contributions
rys+0.062
 antenna+0.059
 stub+0.054
chie+0.053
 magn+0.052
Neg logit contributions
old-0.048
 better-0.046
 him-0.044
 safer-0.044
 Doctor-0.043
Token away
Token ID1497
Feature activation-5.04e-02

Pos logit contributions
 antenna+0.125
rys+0.110
chie+0.108
 stub+0.107
 magn+0.105
Neg logit contributions
old-0.113
 Doctor-0.110
Here-0.099
 doctor-0.099
 safer-0.097
Token happily
Token ID18177
Feature activation-1.78e-02

Pos logit contributions
 antenna+0.113
rys+0.112
 stub+0.107
rac+0.106
chie+0.104
Neg logit contributions
old-0.055
 better-0.053
 safer-0.053
 Doctor-0.051
 cleaner-0.046
Token ever
Token ID1683
Feature activation-1.42e-01

Pos logit contributions
 antenna+0.038
rys+0.038
 stub+0.036
rac+0.035
Lou+0.034
Neg logit contributions
old-0.022
 better-0.021
 safer-0.021
 him-0.019
 Doctor-0.019
Token after
Token ID706
Feature activation+1.83e-02

Pos logit contributions
 antenna+0.309
rys+0.291
 stub+0.279
chie+0.275
 magn+0.271
Neg logit contributions
old-0.180
 Doctor-0.170
 safer-0.158
 better-0.156
 doctor-0.150
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
old+0.021
 Doctor+0.020
 safer+0.018
 better+0.018
 doctor+0.017
Neg logit contributions
 antenna-0.042
rys-0.040
 stub-0.038
chie-0.038
 magn-0.037
Token<|endoftext|>
Token ID50256
Feature activation-8.07e-02

Pos logit contributions
old+0.021
 Doctor+0.020
 safer+0.019
 better+0.019
 cleaner+0.019
Neg logit contributions
 antenna-0.036
rys-0.033
 stub-0.032
chie-0.032
 nowhere-0.031
TokenOne
Token ID3198
Feature activation-4.30e-02

Pos logit contributions
 antenna+0.168
rys+0.155
 stub+0.153
 magn+0.148
 nowhere+0.147
Neg logit contributions
old-0.109
 Doctor-0.102
 safer-0.096
 better-0.092
 cleaner-0.090
Token day
Token ID1110
Feature activation-1.21e-02

Pos logit contributions
 antenna+0.087
rys+0.082
 stub+0.079
chie+0.077
 magn+0.075
Neg logit contributions
old-0.058
 Doctor-0.057
 safer-0.053
 better-0.051
Here-0.051
Token a
Token ID257
Feature activation-1.65e-02

Pos logit contributions
 Doctor+0.019
old+0.019
 better+0.018
 safer+0.018
 doctor+0.017
Neg logit contributions
 antenna-0.036
rys-0.035
chie-0.033
 stub-0.033
rac-0.032
Token pretty
Token ID2495
Feature activation+6.97e-03

Pos logit contributions
 antenna+0.031
rys+0.030
rac+0.028
chie+0.028
 stub+0.028
Neg logit contributions
 Doctor-0.024
old-0.024
 better-0.023
 safer-0.023
 doctor-0.022
Token rainbow
Token ID27223
Feature activation+8.19e-03

Pos logit contributions
old+0.011
 Doctor+0.011
 safer+0.010
 better+0.010
 doctor+0.010
Neg logit contributions
 antenna-0.012
rys-0.012
 stub-0.011
chie-0.011
 magn-0.011
Token.
Token ID13
Feature activation+9.32e-03

Pos logit contributions
old+0.011
 Doctor+0.010
 safer+0.009
 better+0.009
 doctor+0.009
Neg logit contributions
 antenna-0.017
rys-0.016
 stub-0.016
chie-0.016
 magn-0.015
Token He
Token ID679
Feature activation+4.06e-02

Pos logit contributions
old+0.010
 safer+0.009
 Doctor+0.009
 better+0.009
 cleaner+0.008
Neg logit contributions
 antenna-0.022
rys-0.020
 stub-0.020
 nowhere-0.019
chie-0.019
Token had
Token ID550
Feature activation-1.52e-01

Pos logit contributions
old+0.056
 better+0.049
 ice+0.048
 him+0.048
 doctor+0.046
Neg logit contributions
rys-0.086
chie-0.080
 antenna-0.079
ifying-0.078
 magn-0.075
Token never
Token ID1239
Feature activation-1.10e-01

Pos logit contributions
 antenna+0.269
rys+0.268
chie+0.243
 stub+0.237
ho+0.237
Neg logit contributions
old-0.246
 better-0.231
 Doctor-0.231
 safer-0.228
 him-0.226
Token seen
Token ID1775
Feature activation+5.33e-02

Pos logit contributions
 antenna+0.204
rys+0.190
chie+0.181
rac+0.178
 stub+0.176
Neg logit contributions
old-0.174
 Doctor-0.160
 better-0.158
 safer-0.157
 doctor-0.148
Token anything
Token ID1997
Feature activation+9.00e-02

Pos logit contributions
old+0.079
 Doctor+0.072
 safer+0.066
Here+0.066
 choice+0.064
Neg logit contributions
 antenna-0.107
 nowhere-0.097
rys-0.097
 stub-0.096
 tentacles-0.093
Token so
Token ID523
Feature activation-1.78e-02

Pos logit contributions
old+0.095
 Doctor+0.088
 safer+0.075
Here+0.074
 doctor+0.073
Neg logit contributions
 antenna-0.216
rys-0.204
 stub-0.199
 magn-0.192
chie-0.191
Token beautiful
Token ID4950
Feature activation+1.53e-02

Pos logit contributions
 antenna+0.034
rys+0.032
 stub+0.031
 nowhere+0.029
 magn+0.029
Neg logit contributions
old-0.028
 Doctor-0.027
 doctor-0.023
Here-0.023
 paramedics-0.023
Token friends
Token ID2460
Feature activation+4.03e-02

Pos logit contributions
 antenna+0.080
rys+0.073
 stub+0.071
 magn+0.070
chie+0.068
Neg logit contributions
old-0.050
 Doctor-0.046
Here-0.042
 safer-0.042
 better-0.041
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
old+0.053
 Doctor+0.050
Here+0.045
 safer+0.045
 cleaner+0.044
Neg logit contributions
 antenna-0.084
rys-0.077
 stub-0.077
 nowhere-0.074
 magn-0.073
Token He
Token ID679
Feature activation+4.26e-02

Pos logit contributions
old+0.016
 cleaner+0.014
 Doctor+0.014
 safer+0.013
 better+0.013
Neg logit contributions
 antenna-0.029
 stub-0.026
rys-0.026
 nowhere-0.025
rac-0.025
Token wished
Token ID16555
Feature activation-2.07e-02

Pos logit contributions
old+0.072
 Doctor+0.070
 better+0.069
 safer+0.068
 doctor+0.067
Neg logit contributions
 antenna-0.072
rys-0.070
chie-0.066
 magn-0.062
yt-0.062
Token he
Token ID339
Feature activation+7.83e-03

Pos logit contributions
 antenna+0.047
 stub+0.043
 nowhere+0.043
rys+0.043
ho+0.042
Neg logit contributions
old-0.024
 Doctor-0.020
 safer-0.019
Here-0.019
 cleaner-0.019
Token had
Token ID550
Feature activation-1.45e-01

Pos logit contributions
old+0.018
 Doctor+0.017
 safer+0.017
 better+0.017
 doctor+0.016
Neg logit contributions
 antenna-0.009
rys-0.008
 stub-0.007
chie-0.007
 magn-0.007
Token gone
Token ID3750
Feature activation-1.89e-02

Pos logit contributions
 antenna+0.259
rys+0.238
 stub+0.229
chie+0.221
 magn+0.218
Neg logit contributions
old-0.241
 Doctor-0.230
 safer-0.217
 better-0.214
 doctor-0.209
Token to
Token ID284
Feature activation-2.16e-02

Pos logit contributions
 antenna+0.037
rys+0.035
 stub+0.033
chie+0.033
 magn+0.032
Neg logit contributions
old-0.028
 Doctor-0.026
 safer-0.025
 better-0.025
 doctor-0.024
Token her
Token ID607
Feature activation-3.75e-02

Pos logit contributions
 antenna+0.043
rys+0.041
 stub+0.039
 reapp+0.039
 nowhere+0.037
Neg logit contributions
old-0.030
Here-0.026
 Doctor-0.025
 cleaner-0.025
 safer-0.024
Token party
Token ID2151
Feature activation+2.08e-02

Pos logit contributions
 antenna+0.079
rys+0.074
 stub+0.071
 magn+0.069
chie+0.068
Neg logit contributions
old-0.050
 Doctor-0.047
 safer-0.043
 better-0.042
Here-0.042
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
old+0.028
 Doctor+0.025
Here+0.024
 safer+0.024
 better+0.023
Neg logit contributions
 antenna-0.043
rys-0.039
 stub-0.039
 nowhere-0.038
 magn-0.038
Token spinning
Token ID19493
Feature activation-2.68e-02

Pos logit contributions
 antenna+0.037
rys+0.035
rac+0.033
chie+0.033
 stub+0.032
Neg logit contributions
 better-0.028
 Doctor-0.028
 safer-0.027
old-0.027
 doctor-0.027
Token top
Token ID1353
Feature activation-4.72e-02

Pos logit contributions
 antenna+0.048
nam+0.045
rys+0.044
Lou+0.043
 magn+0.043
Neg logit contributions
 better-0.040
 doctor-0.039
 cleaner-0.037
 safer-0.037
 Doctor-0.037
Token.
Token ID13
Feature activation+1.58e-02

Pos logit contributions
 antenna+0.116
rys+0.111
 magn+0.107
nam+0.107
chie+0.106
Neg logit contributions
old-0.042
 better-0.040
 safer-0.038
 Doctor-0.037
 doctor-0.035
Token As
Token ID1081
Feature activation+1.12e-01

Pos logit contributions
old+0.022
 Doctor+0.020
 safer+0.020
 better+0.020
 doctor+0.018
Neg logit contributions
 antenna-0.032
rys-0.030
 stub-0.029
chie-0.028
 magn-0.028
Token she
Token ID673
Feature activation-3.90e-02

Pos logit contributions
old+0.152
 Doctor+0.142
 safer+0.132
 better+0.130
 doctor+0.126
Neg logit contributions
 antenna-0.236
rys-0.221
 stub-0.214
 magn-0.206
chie-0.206
Token tw
Token ID665
Feature activation-1.42e-01

Pos logit contributions
 antenna+0.070
rys+0.068
chie+0.067
nam+0.066
ifying+0.063
Neg logit contributions
old-0.059
 better-0.058
 ice-0.056
 him-0.053
 safer-0.052
Tokenirled
Token ID49376
Feature activation-4.65e-02

Pos logit contributions
 antenna+0.220
 magn+0.193
rys+0.190
nam+0.189
chie+0.188
Neg logit contributions
old-0.281
 him-0.241
 better-0.231
 Doctor-0.229
Here-0.228
Token faster
Token ID5443
Feature activation+6.97e-02

Pos logit contributions
 antenna+0.103
rys+0.098
chie+0.095
 stub+0.094
 magn+0.092
Neg logit contributions
 better-0.054
old-0.052
 Doctor-0.050
 him-0.050
 safer-0.049
Token and
Token ID290
Feature activation+2.07e-02

Pos logit contributions
old+0.076
 Doctor+0.072
 better+0.072
 safer+0.068
 him+0.067
Neg logit contributions
 antenna-0.155
rys-0.149
 stub-0.146
chie-0.144
 magn-0.140
Token faster
Token ID5443
Feature activation+7.00e-02

Pos logit contributions
old+0.024
 better+0.023
 safer+0.022
 Doctor+0.022
 cleaner+0.020
Neg logit contributions
 antenna-0.047
rys-0.044
 stub-0.042
chie-0.042
 magn-0.042
Token,
Token ID11
Feature activation-5.88e-03

Pos logit contributions
old+0.086
 Doctor+0.081
 better+0.078
 safer+0.077
 him+0.074
Neg logit contributions
 antenna-0.150
rys-0.143
 stub-0.137
chie-0.136
 magn-0.133