TOP ACTIVATIONS
MAX = 0.186

the radio on and within minutes they heard a long song
she could . A few minutes later , the creature pulled
with delight . Every few minutes , he 'd show them
After a few minutes , Jack found the pink
ens . When the 30 minutes were up , Lily turned
. But after a few minutes , something changed . Suddenly
home . After a few minutes of arguing , Sam knew
now . You have ten minutes . Then I will come
After a few minutes , Emma and her mom
around . After a few minutes , Pil o carefully brought
. So , every few minutes she would shout : '
right away . After five minutes , they both had given
. After 20 minutes of searching , the rain
The little girl listened while sitting in a comfortable spot ,
watched . After a few minutes , the hat had made
In a few minutes , it seemed like the
quiet . After a few minutes , Daddy found her and
toys and for a few minutes she did , but then
too . After a few minutes , it stopped and the
that their town was still standing . The people

MIN ACTIVATIONS
MIN = -0.222

a sign that says " P izza ". Sara and Ben
!" Ben says . " P ill ows are fun !"
a bad ending . <|endoftext|> P oppy was a tall girl
kitchen . " P ete , come quick ,
was seeing . P olly wanted to find out
!" Ben says . " P ill ows are fun !"
to taste ." P olly thanked the friendly cat
with coins ! P ete and Paul looked at
says . " P ray ing is like talking
he was ok . <|endoftext|> P ete liked to play hide
it is ?" P ip nodded eagerly .
asked . " P apa , where is my
believe it ! P apa explained that the coal
our toys ." P apa says : " You
his tree . P ip followed the sun for
discover more surprises . <|endoftext|> P ete was walking in the
away . " P hew , that was close
to toe ! P ip thought it was funny
smile . P apa shook his head .
L ily said , " P upp y , don 't

INTERVAL 0.084 - 0.186
CONTAINS 0.672%

relax . He forgot all about his missing car and had
he would take off , flying through the sky . He
off the light , the rug gl owed like the night
ranch felt really sorry for Flash and decided to be friends
wrong . There are eleven pengu ins . Look , there

INTERVAL -0.018 - 0.084
CONTAINS 78.190%

. The poor man felt sad for her and decided to
She asked the goose if she could spin too .
was a brave little girl called L izzy . She loved
and she felt so free like a bird .
Jane said goodbye and ran off to play with

INTERVAL -0.120 - -0.018
CONTAINS 21.113%

just a little peek . He slowly put his hand over
puts on his mask . He thanks Sam .
't . I promise ." He gives Lily a hug .
take the water , but they wanted it so much .
pond . The water was still , and it was hard

INTERVAL -0.222 - -0.120
CONTAINS 0.026%

The girl enjoyed the w affle and said goodbye to the
else to do . <|endoftext|> P olly was playing at the
She had an incredible sp ade that she loved to use
't very mild like the squash his mom usually made .
was so special !" Mall ory and her dad smiled at
Token the
Token ID262
Feature activation-3.66e-04

Pos logit contributions
or+0.011
P+0.011
 course+0.011
ory+0.011
umper+0.010
Neg logit contributions
 breathing-0.010
sing-0.010
 cob-0.010
 staying-0.010
 standing-0.010
Token radio
Token ID5243
Feature activation+4.62e-02

Pos logit contributions
or+0.001
P+0.001
 course+0.001
ory+0.001
umper+0.001
Neg logit contributions
sing-0.001
 breathing-0.001
 cob-0.001
 staying-0.001
 standing-0.001
Token on
Token ID319
Feature activation+1.71e-02

Pos logit contributions
sing+0.083
 breathing+0.083
 cob+0.081
 staying+0.081
 standing+0.078
Neg logit contributions
or-0.070
P-0.066
 course-0.066
les-0.064
ory-0.063
Token and
Token ID290
Feature activation+1.68e-02

Pos logit contributions
 breathing+0.027
sing+0.026
 standing+0.026
 cob+0.026
 staying+0.026
Neg logit contributions
or-0.030
P-0.030
ory-0.028
umper-0.028
 course-0.028
Token within
Token ID1626
Feature activation+1.54e-02

Pos logit contributions
sing+0.035
 breathing+0.034
 cob+0.033
 staying+0.033
ses+0.032
Neg logit contributions
or-0.021
P-0.020
 course-0.020
ory-0.019
les-0.019
Token minutes
Token ID2431
Feature activation+1.86e-01

Pos logit contributions
sing+0.026
 breathing+0.025
 cob+0.025
 staying+0.024
 standing+0.024
Neg logit contributions
or-0.026
P-0.026
 course-0.025
ory-0.024
les-0.024
Token they
Token ID484
Feature activation-2.07e-02

Pos logit contributions
sing+0.387
 cob+0.387
 breathing+0.382
ses+0.374
 staying+0.364
Neg logit contributions
or-0.245
 course-0.220
ory-0.209
les-0.204
P-0.201
Token heard
Token ID2982
Feature activation-1.22e-02

Pos logit contributions
P+0.030
 course+0.029
or+0.028
umper+0.027
ory+0.026
Neg logit contributions
sing-0.039
 breathing-0.039
 standing-0.039
 staying-0.038
 cob-0.038
Token a
Token ID257
Feature activation+3.63e-03

Pos logit contributions
or+0.020
 course+0.020
P+0.019
les+0.019
ory+0.019
Neg logit contributions
sing-0.020
 breathing-0.020
 staying-0.019
 cob-0.019
ses-0.019
Token long
Token ID890
Feature activation+5.51e-02

Pos logit contributions
 breathing+0.005
sing+0.005
 staying+0.005
ses+0.004
 cob+0.004
Neg logit contributions
 course-0.007
or-0.007
P-0.007
-0.007
ory-0.007
Token song
Token ID3496
Feature activation+3.78e-02

Pos logit contributions
sing+0.085
 staying+0.084
 breathing+0.083
ses+0.081
 cob+0.080
Neg logit contributions
 course-0.096
or-0.089
les-0.088
P-0.086
-0.084
Token she
Token ID673
Feature activation+1.35e-02

Pos logit contributions
or+0.013
P+0.013
 course+0.013
ory+0.012
les+0.012
Neg logit contributions
sing-0.012
 breathing-0.011
 cob-0.011
 staying-0.011
 standing-0.011
Token could
Token ID714
Feature activation-2.60e-02

Pos logit contributions
 breathing+0.027
 standing+0.027
 cob+0.027
sing+0.027
 staying+0.026
Neg logit contributions
or-0.017
P-0.016
 course-0.016
umper-0.016
ory-0.015
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
P+0.041
or+0.040
 course+0.039
ory+0.038
umper+0.038
Neg logit contributions
 breathing-0.046
sing-0.046
 staying-0.044
 standing-0.044
 cob-0.044
Token A
Token ID317
Feature activation-2.93e-02

Pos logit contributions
sing+0.020
 breathing+0.020
 cob+0.019
 staying+0.019
 standing+0.019
Neg logit contributions
or-0.018
P-0.017
 course-0.016
ory-0.016
umper-0.016
Token few
Token ID1178
Feature activation+4.36e-02

Pos logit contributions
or+0.054
P+0.053
 course+0.052
ory+0.051
les+0.050
Neg logit contributions
 breathing-0.043
sing-0.043
 cob-0.042
 staying-0.041
 standing-0.041
Token minutes
Token ID2431
Feature activation+1.86e-01

Pos logit contributions
 breathing+0.056
sing+0.056
 cob+0.053
 staying+0.053
 standing+0.052
Neg logit contributions
or-0.090
P-0.088
 course-0.086
ory-0.085
les-0.084
Token later
Token ID1568
Feature activation+2.49e-02

Pos logit contributions
 breathing+0.296
 staying+0.292
 standing+0.291
sing+0.287
 cob+0.285
Neg logit contributions
P-0.315
or-0.312
ory-0.298
 course-0.297
umper-0.295
Token,
Token ID11
Feature activation+2.68e-03

Pos logit contributions
 standing+0.042
 staying+0.041
 breathing+0.041
 cob+0.040
sing+0.039
Neg logit contributions
umper-0.039
P-0.039
or-0.039
ory-0.037
 course-0.037
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
sing+0.004
 breathing+0.004
 cob+0.004
 staying+0.004
ses+0.004
Neg logit contributions
or-0.005
P-0.005
 course-0.005
ory-0.004
les-0.004
Token creature
Token ID7185
Feature activation-1.67e-02

Pos logit contributions
or+0.018
P+0.017
ory+0.016
umper+0.016
les+0.016
Neg logit contributions
 breathing-0.017
sing-0.017
 cob-0.017
 standing-0.016
 staying-0.016
Token pulled
Token ID5954
Feature activation+1.25e-02

Pos logit contributions
or+0.023
P+0.022
 course+0.021
ory+0.021
umper+0.020
Neg logit contributions
 breathing-0.033
sing-0.033
 cob-0.032
 staying-0.032
 standing-0.032
Token with
Token ID351
Feature activation+2.06e-02

Pos logit contributions
sing+0.123
 breathing+0.122
 cob+0.119
 staying+0.118
 standing+0.115
Neg logit contributions
or-0.094
 course-0.088
P-0.088
ory-0.085
les-0.083
Token delight
Token ID10974
Feature activation+7.64e-02

Pos logit contributions
sing+0.036
 breathing+0.035
 staying+0.034
 cob+0.033
ses+0.033
Neg logit contributions
 course-0.032
or-0.032
P-0.030
ory-0.030
les-0.030
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
sing+0.147
 breathing+0.144
 cob+0.142
 staying+0.139
ses+0.139
Neg logit contributions
or-0.112
ory-0.102
P-0.102
les-0.101
 course-0.099
Token Every
Token ID3887
Feature activation+1.51e-02

Pos logit contributions
 breathing+0.029
sing+0.029
 cob+0.028
 staying+0.028
 standing+0.028
Neg logit contributions
or-0.029
P-0.028
 course-0.028
ory-0.027
les-0.026
Token few
Token ID1178
Feature activation+4.30e-02

Pos logit contributions
sing+0.031
 breathing+0.030
ses+0.029
 Kel+0.028
 staying+0.028
Neg logit contributions
 course-0.022
or-0.019
ar-0.019
P-0.018
les-0.017
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.079
ses+0.073
 breathing+0.070
 staying+0.066
 cob+0.064
Neg logit contributions
 course-0.075
or-0.071
les-0.070
P-0.067
ons-0.064
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
sing+0.376
 breathing+0.370
 cob+0.359
 staying+0.352
ses+0.351
Neg logit contributions
or-0.258
 course-0.237
P-0.234
ory-0.226
les-0.224
Token he
Token ID339
Feature activation-6.05e-03

Pos logit contributions
sing+0.021
 breathing+0.021
 cob+0.020
 staying+0.020
 standing+0.020
Neg logit contributions
or-0.019
P-0.019
 course-0.018
ory-0.018
les-0.018
Token'd
Token ID1549
Feature activation-4.30e-02

Pos logit contributions
or+0.008
P+0.008
 course+0.007
ory+0.007
umper+0.007
Neg logit contributions
 breathing-0.012
sing-0.012
 cob-0.012
 staying-0.012
 standing-0.012
Token show
Token ID905
Feature activation-5.97e-02

Pos logit contributions
or+0.060
 course+0.058
P+0.057
ory+0.055
les+0.054
Neg logit contributions
sing-0.083
 breathing-0.082
 cob-0.079
 staying-0.079
 standing-0.078
Token them
Token ID606
Feature activation-1.61e-02

Pos logit contributions
P+0.088
or+0.086
 course+0.082
ory+0.081
+0.079
Neg logit contributions
 breathing-0.112
sing-0.109
 staying-0.108
 cob-0.108
 standing-0.107
Token
Token ID198
Feature activation+1.51e-02

Pos logit contributions
 breathing+0.024
sing+0.024
 staying+0.024
 Ann+0.024
 standing+0.023
Neg logit contributions
or-0.038
P-0.036
les-0.035
 course-0.035
ar-0.034
Token
Token ID198
Feature activation+1.60e-02

Pos logit contributions
 breathing+0.019
 Ann+0.019
 standing+0.019
 staying+0.018
 cob+0.018
Neg logit contributions
ory-0.028
als-0.028
or-0.028
les-0.027
ar-0.027
TokenAfter
Token ID3260
Feature activation-4.78e-03

Pos logit contributions
 standing+0.024
 breathing+0.024
 staying+0.023
 cob+0.023
 beating+0.022
Neg logit contributions
or-0.025
als-0.025
ory-0.024
ted-0.024
ar-0.024
Token a
Token ID257
Feature activation+5.70e-03

Pos logit contributions
or+0.009
P+0.009
 course+0.008
ory+0.008
les+0.008
Neg logit contributions
 breathing-0.007
sing-0.007
 staying-0.007
 cob-0.007
 standing-0.007
Token few
Token ID1178
Feature activation+4.17e-02

Pos logit contributions
 breathing+0.009
sing+0.009
 cob+0.009
 staying+0.009
 standing+0.009
Neg logit contributions
or-0.010
P-0.010
 course-0.009
umper-0.009
les-0.009
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.047
 breathing+0.046
 staying+0.043
 cob+0.043
 standing+0.042
Neg logit contributions
or-0.093
P-0.092
 course-0.092
les-0.088
ory-0.088
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
 breathing+0.349
sing+0.349
 cob+0.339
 staying+0.337
 standing+0.332
Neg logit contributions
or-0.270
P-0.263
 course-0.254
ory-0.248
les-0.242
Token Jack
Token ID3619
Feature activation-2.96e-02

Pos logit contributions
 breathing+0.018
sing+0.018
 staying+0.017
 standing+0.017
 cob+0.017
Neg logit contributions
or-0.021
umper-0.020
P-0.020
les-0.020
at-0.019
Token found
Token ID1043
Feature activation-3.30e-02

Pos logit contributions
or+0.049
 course+0.048
P+0.047
+0.044
cient+0.043
Neg logit contributions
 standing-0.049
 breathing-0.049
 sitting-0.047
 staying-0.046
 cob-0.046
Token the
Token ID262
Feature activation-4.05e-03

Pos logit contributions
or+0.066
P+0.065
ory+0.063
+0.063
use+0.061
Neg logit contributions
 cob-0.045
 breathing-0.044
 standing-0.042
 staying-0.041
 Ann-0.040
Token pink
Token ID11398
Feature activation+4.18e-02

Pos logit contributions
or+0.008
P+0.008
 course+0.007
ory+0.007
les+0.007
Neg logit contributions
 breathing-0.006
sing-0.006
 cob-0.006
 standing-0.005
 staying-0.005
Tokenens
Token ID641
Feature activation-2.51e-02

Pos logit contributions
or+0.032
P+0.031
 course+0.030
ory+0.029
umper+0.029
Neg logit contributions
 breathing-0.040
sing-0.040
 cob-0.039
 staying-0.039
 standing-0.038
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
or+0.031
les+0.029
 course+0.028
ory+0.026
P+0.026
Neg logit contributions
sing-0.051
ses-0.049
 cob-0.049
 Kel-0.049
 breathing-0.048
Token When
Token ID1649
Feature activation-9.38e-03

Pos logit contributions
 breathing+0.025
sing+0.025
 staying+0.024
 cob+0.024
 standing+0.024
Neg logit contributions
or-0.026
P-0.025
 course-0.025
ory-0.024
les-0.024
Token the
Token ID262
Feature activation-3.60e-03

Pos logit contributions
or+0.017
P+0.017
 course+0.017
ory+0.016
les+0.016
Neg logit contributions
sing-0.014
 breathing-0.014
 cob-0.013
 staying-0.013
 standing-0.013
Token 30
Token ID1542
Feature activation+2.17e-02

Pos logit contributions
 course+0.007
P+0.007
or+0.007
les+0.006
+0.006
Neg logit contributions
sing-0.005
 breathing-0.005
 staying-0.005
 cob-0.005
 standing-0.005
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.039
 breathing+0.037
ses+0.035
 staying+0.035
 cob+0.035
Neg logit contributions
or-0.036
 course-0.035
P-0.035
les-0.035
-0.032
Token were
Token ID547
Feature activation+1.38e-02

Pos logit contributions
 breathing+0.364
 cob+0.360
 standing+0.359
 staying+0.357
sing+0.355
Neg logit contributions
P-0.240
or-0.239
ory-0.226
 course-0.226
umper-0.224
Token up
Token ID510
Feature activation-5.69e-03

Pos logit contributions
sing+0.029
ses+0.028
 breathing+0.028
 cob+0.027
 staying+0.026
Neg logit contributions
P-0.017
or-0.017
oni-0.016
ory-0.016
les-0.016
Token,
Token ID11
Feature activation+9.62e-03

Pos logit contributions
or+0.007
P+0.006
ory+0.006
 course+0.006
les+0.006
Neg logit contributions
sing-0.013
 breathing-0.012
 cob-0.012
ses-0.012
 staying-0.012
Token Lily
Token ID20037
Feature activation+2.04e-02

Pos logit contributions
sing+0.013
 breathing+0.013
 staying+0.012
 cob+0.012
 standing+0.012
Neg logit contributions
or-0.019
P-0.019
 course-0.018
ory-0.018
les-0.018
Token turned
Token ID2900
Feature activation-1.46e-02

Pos logit contributions
 breathing+0.039
 cob+0.038
sing+0.038
 standing+0.038
 staying+0.037
Neg logit contributions
or-0.030
P-0.029
 course-0.028
ory-0.027
umper-0.027
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
sing+0.170
 breathing+0.168
 cob+0.164
 staying+0.163
 standing+0.159
Neg logit contributions
or-0.129
P-0.124
 course-0.123
les-0.119
ory-0.116
Token But
Token ID887
Feature activation+5.84e-03

Pos logit contributions
sing+0.051
 breathing+0.050
 staying+0.048
 standing+0.048
 cob+0.048
Neg logit contributions
or-0.050
P-0.049
 course-0.046
ory-0.046
les-0.046
Token after
Token ID706
Feature activation-1.46e-02

Pos logit contributions
sing+0.011
 breathing+0.011
 cob+0.010
 staying+0.010
ses+0.010
Neg logit contributions
or-0.009
 course-0.008
P-0.008
ory-0.008
umper-0.007
Token a
Token ID257
Feature activation+1.32e-02

Pos logit contributions
 course+0.024
or+0.024
P+0.023
ory+0.022
les+0.022
Neg logit contributions
sing-0.024
 breathing-0.024
 cob-0.023
 staying-0.023
ses-0.022
Token few
Token ID1178
Feature activation+3.81e-02

Pos logit contributions
 breathing+0.022
sing+0.022
 cob+0.021
 staying+0.021
 standing+0.020
Neg logit contributions
or-0.022
P-0.022
 course-0.021
ory-0.021
umper-0.020
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.052
 breathing+0.050
ses+0.047
 staying+0.047
 cob+0.046
Neg logit contributions
 course-0.078
or-0.076
P-0.076
les-0.074
ory-0.070
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 breathing+0.352
sing+0.352
 cob+0.342
 staying+0.340
 standing+0.336
Neg logit contributions
or-0.265
P-0.259
 course-0.250
ory-0.244
les-0.238
Token something
Token ID1223
Feature activation-1.28e-03

Pos logit contributions
 standing+0.016
 breathing+0.016
 staying+0.016
 cob+0.015
sing+0.015
Neg logit contributions
or-0.022
les-0.021
ingu-0.021
umper-0.020
ory-0.020
Token changed
Token ID3421
Feature activation+2.21e-02

Pos logit contributions
or+0.002
P+0.002
 course+0.002
les+0.002
ory+0.002
Neg logit contributions
sing-0.002
 breathing-0.002
 staying-0.002
 cob-0.002
 standing-0.002
Token.
Token ID13
Feature activation+2.57e-02

Pos logit contributions
sing+0.042
 breathing+0.042
 cob+0.040
 staying+0.040
 standing+0.039
Neg logit contributions
or-0.032
P-0.031
 course-0.030
ory-0.029
les-0.029
Token Suddenly
Token ID24975
Feature activation+3.22e-02

Pos logit contributions
sing+0.038
 breathing+0.037
 standing+0.036
 staying+0.034
ses+0.034
Neg logit contributions
or-0.049
les-0.046
P-0.046
 course-0.045
ory-0.043
Token home
Token ID1363
Feature activation-7.59e-02

Pos logit contributions
or+0.065
P+0.065
 course+0.062
ory+0.061
umper+0.060
Neg logit contributions
 breathing-0.074
sing-0.074
 staying-0.072
 cob-0.071
 standing-0.071
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
or+0.113
P+0.110
 course+0.106
ory+0.104
les+0.101
Neg logit contributions
sing-0.140
 breathing-0.140
 cob-0.136
 staying-0.135
 standing-0.133
Token After
Token ID2293
Feature activation-8.18e-03

Pos logit contributions
 breathing+0.030
sing+0.030
 staying+0.029
 cob+0.029
 standing+0.029
Neg logit contributions
or-0.023
P-0.023
 course-0.022
ory-0.021
les-0.021
Token a
Token ID257
Feature activation+6.96e-03

Pos logit contributions
or+0.014
P+0.014
 course+0.014
ory+0.013
umper+0.013
Neg logit contributions
sing-0.013
 breathing-0.013
 cob-0.013
 staying-0.012
 standing-0.012
Token few
Token ID1178
Feature activation+4.06e-02

Pos logit contributions
 breathing+0.012
sing+0.012
 cob+0.011
 staying+0.011
 standing+0.011
Neg logit contributions
or-0.012
P-0.011
 course-0.011
ory-0.011
umper-0.011
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.054
 breathing+0.050
ses+0.047
 staying+0.046
 cob+0.046
Neg logit contributions
 course-0.089
or-0.085
P-0.084
les-0.082
ar-0.080
Token of
Token ID286
Feature activation+6.46e-02

Pos logit contributions
sing+0.382
 breathing+0.377
 cob+0.369
 staying+0.362
ses+0.355
Neg logit contributions
or-0.248
P-0.230
 course-0.229
ory-0.220
les-0.215
Token arguing
Token ID11810
Feature activation-1.96e-02

Pos logit contributions
sing+0.094
ses+0.084
 breathing+0.079
 cob+0.078
 Kel+0.075
Neg logit contributions
 course-0.135
or-0.117
P-0.115
ons-0.111
umper-0.108
Token,
Token ID11
Feature activation+8.14e-03

Pos logit contributions
or+0.024
 course+0.021
les+0.020
ory+0.020
P+0.019
Neg logit contributions
sing-0.043
 breathing-0.042
 cob-0.041
ses-0.041
 staying-0.040
Token Sam
Token ID3409
Feature activation-1.67e-02

Pos logit contributions
 breathing+0.012
sing+0.012
 cob+0.011
 staying+0.011
 standing+0.011
Neg logit contributions
or-0.016
P-0.015
 course-0.015
ory-0.015
les-0.014
Token knew
Token ID2993
Feature activation-3.61e-02

Pos logit contributions
P+0.024
or+0.024
 course+0.023
ory+0.022
umper+0.022
Neg logit contributions
 breathing-0.032
 standing-0.031
 staying-0.031
 cob-0.031
sing-0.030
Token now
Token ID783
Feature activation+5.65e-02

Pos logit contributions
or+0.014
P+0.014
 course+0.013
ory+0.013
les+0.013
Neg logit contributions
sing-0.015
 breathing-0.014
 cob-0.014
 staying-0.014
 standing-0.014
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
sing+0.118
 breathing+0.116
 cob+0.114
 staying+0.111
 standing+0.110
Neg logit contributions
or-0.072
P-0.069
 course-0.068
les-0.064
ory-0.064
Token You
Token ID921
Feature activation+1.75e-02

Pos logit contributions
sing+0.047
 breathing+0.046
 cob+0.045
 staying+0.044
 standing+0.043
Neg logit contributions
or-0.042
P-0.042
 course-0.042
-0.040
ory-0.040
Token have
Token ID423
Feature activation+3.17e-03

Pos logit contributions
 cob+0.028
 breathing+0.028
 staying+0.027
sing+0.027
 standing+0.026
Neg logit contributions
P-0.030
or-0.030
ory-0.029
oni-0.028
les-0.027
Token ten
Token ID3478
Feature activation+2.83e-02

Pos logit contributions
sing+0.006
 breathing+0.006
 cob+0.005
 staying+0.005
 standing+0.005
Neg logit contributions
or-0.005
P-0.005
 course-0.005
ory-0.005
umper-0.004
Token minutes
Token ID2431
Feature activation+1.85e-01

Pos logit contributions
sing+0.044
 breathing+0.041
 staying+0.039
 cob+0.039
ses+0.039
Neg logit contributions
or-0.053
P-0.051
 course-0.051
les-0.050
-0.049
Token.
Token ID13
Feature activation+2.20e-02

Pos logit contributions
 breathing+0.320
sing+0.319
 staying+0.309
 cob+0.308
 standing+0.304
Neg logit contributions
or-0.295
P-0.290
 course-0.280
ory-0.274
umper-0.269
Token Then
Token ID3244
Feature activation+3.37e-02

Pos logit contributions
sing+0.038
 breathing+0.037
 cob+0.037
 staying+0.036
 standing+0.035
Neg logit contributions
P-0.036
or-0.035
 course-0.035
ory-0.033
-0.033
Token I
Token ID314
Feature activation+3.65e-02

Pos logit contributions
sing+0.065
 breathing+0.065
 cob+0.063
 staying+0.063
 standing+0.062
Neg logit contributions
or-0.048
P-0.046
 course-0.045
ory-0.044
les-0.043
Token will
Token ID481
Feature activation-3.75e-02

Pos logit contributions
 breathing+0.068
 cob+0.068
sing+0.067
 staying+0.067
 standing+0.067
Neg logit contributions
P-0.050
or-0.049
umper-0.048
 course-0.047
ory-0.047
Token come
Token ID1282
Feature activation+3.73e-02

Pos logit contributions
or+0.061
P+0.060
ory+0.057
 course+0.057
les+0.056
Neg logit contributions
 breathing-0.065
sing-0.064
 staying-0.063
 cob-0.063
 standing-0.062
Token
Token ID198
Feature activation+1.16e-02

Pos logit contributions
sing+0.032
 breathing+0.032
 cob+0.031
 staying+0.031
 standing+0.030
Neg logit contributions
or-0.022
P-0.021
 course-0.020
ory-0.020
-0.019
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
sing+0.025
 breathing+0.024
 cob+0.024
 staying+0.023
ses+0.023
Neg logit contributions
or-0.015
P-0.014
 course-0.014
ory-0.013
les-0.013
TokenAfter
Token ID3260
Feature activation-5.35e-03

Pos logit contributions
sing+0.027
 breathing+0.027
 cob+0.026
 staying+0.026
ses+0.025
Neg logit contributions
or-0.016
P-0.016
 course-0.015
ory-0.014
les-0.014
Token a
Token ID257
Feature activation+6.43e-03

Pos logit contributions
 course+0.009
or+0.008
P+0.008
ted+0.007
ar+0.007
Neg logit contributions
sing-0.010
ses-0.010
 breathing-0.009
 cob-0.009
 staying-0.008
Token few
Token ID1178
Feature activation+4.02e-02

Pos logit contributions
sing+0.010
 breathing+0.010
 cob+0.010
 staying+0.010
 standing+0.010
Neg logit contributions
or-0.011
P-0.011
 course-0.011
umper-0.010
ory-0.010
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.054
 breathing+0.050
ses+0.047
 cob+0.047
 staying+0.046
Neg logit contributions
 course-0.087
or-0.084
P-0.084
les-0.082
ar-0.078
Token,
Token ID11
Feature activation+8.92e-03

Pos logit contributions
sing+0.404
 breathing+0.392
 cob+0.380
ses+0.377
 staying+0.368
Neg logit contributions
or-0.244
 course-0.220
P-0.213
les-0.208
ory-0.208
Token Emma
Token ID18966
Feature activation+4.94e-02

Pos logit contributions
sing+0.014
 breathing+0.014
 cob+0.013
 staying+0.013
ses+0.013
Neg logit contributions
or-0.016
P-0.015
 course-0.015
ory-0.015
les-0.014
Token and
Token ID290
Feature activation+1.61e-02

Pos logit contributions
 breathing+0.095
 standing+0.093
 staying+0.092
sing+0.092
 cob+0.092
Neg logit contributions
or-0.068
P-0.068
 course-0.067
umper-0.063
ory-0.062
Token her
Token ID607
Feature activation+5.61e-02

Pos logit contributions
sing+0.028
 breathing+0.027
ses+0.026
 cob+0.026
 staying+0.025
Neg logit contributions
or-0.027
 course-0.025
ory-0.025
P-0.024
les-0.024
Token mom
Token ID1995
Feature activation+3.64e-02

Pos logit contributions
 breathing+0.096
 staying+0.094
sing+0.094
 standing+0.089
 cob+0.086
Neg logit contributions
or-0.085
ory-0.081
P-0.078
 course-0.077
les-0.077
Token around
Token ID1088
Feature activation+3.26e-03

Pos logit contributions
 breathing+0.098
sing+0.096
 cob+0.094
 staying+0.094
 standing+0.093
Neg logit contributions
or-0.093
P-0.091
 course-0.088
ory-0.087
umper-0.086
Token.
Token ID13
Feature activation+2.26e-02

Pos logit contributions
sing+0.006
 breathing+0.006
 cob+0.006
 staying+0.006
ses+0.006
Neg logit contributions
or-0.005
 course-0.005
P-0.005
ory-0.004
les-0.004
Token After
Token ID2293
Feature activation-6.21e-03

Pos logit contributions
sing+0.044
 breathing+0.044
 cob+0.043
 staying+0.042
 standing+0.041
Neg logit contributions
or-0.031
P-0.031
 course-0.030
ory-0.029
umper-0.028
Token a
Token ID257
Feature activation+1.02e-02

Pos logit contributions
 course+0.010
or+0.010
P+0.010
ory+0.009
les+0.009
Neg logit contributions
sing-0.010
 breathing-0.010
 cob-0.010
 staying-0.010
ses-0.010
Token few
Token ID1178
Feature activation+3.90e-02

Pos logit contributions
 breathing+0.018
sing+0.018
 staying+0.017
 cob+0.017
 standing+0.017
Neg logit contributions
or-0.016
P-0.016
 course-0.015
ory-0.015
les-0.014
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.057
 breathing+0.054
ses+0.051
 staying+0.050
 Kel+0.049
Neg logit contributions
 course-0.080
P-0.074
or-0.073
les-0.073
ar-0.071
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
sing+0.375
 breathing+0.367
 cob+0.356
 staying+0.349
ses+0.348
Neg logit contributions
or-0.256
 course-0.240
P-0.235
ory-0.228
les-0.225
Token Pil
Token ID12693
Feature activation-1.48e-02

Pos logit contributions
sing+0.021
 breathing+0.021
 cob+0.021
 staying+0.020
 standing+0.020
Neg logit contributions
or-0.021
P-0.020
 course-0.020
ory-0.019
les-0.019
Tokeno
Token ID78
Feature activation-2.66e-02

Pos logit contributions
or+0.025
P+0.023
 course+0.023
ory+0.023
les+0.023
Neg logit contributions
 breathing-0.024
sing-0.024
 cob-0.024
 staying-0.024
 standing-0.023
Token carefully
Token ID7773
Feature activation-8.61e-02

Pos logit contributions
P+0.040
or+0.040
 course+0.038
umper+0.038
ory+0.037
Neg logit contributions
 breathing-0.048
 standing-0.047
 cob-0.047
sing-0.047
 staying-0.046
Token brought
Token ID3181
Feature activation-1.73e-02

Pos logit contributions
or+0.128
P+0.125
 course+0.121
ory+0.119
umper+0.116
Neg logit contributions
 breathing-0.159
sing-0.159
 cob-0.154
 staying-0.153
 standing-0.151
Token.
Token ID13
Feature activation+9.38e-03

Pos logit contributions
or+0.041
P+0.041
 course+0.039
ory+0.038
umper+0.038
Neg logit contributions
 breathing-0.044
sing-0.044
 staying-0.042
 cob-0.042
 standing-0.042
Token So
Token ID1406
Feature activation-5.35e-03

Pos logit contributions
 breathing+0.016
sing+0.016
 staying+0.016
 cob+0.016
 standing+0.016
Neg logit contributions
or-0.014
P-0.014
 course-0.014
ory-0.013
les-0.013
Token,
Token ID11
Feature activation+6.63e-03

Pos logit contributions
or+0.009
P+0.009
 course+0.009
ory+0.008
les+0.008
Neg logit contributions
sing-0.009
 breathing-0.009
 cob-0.009
 staying-0.009
 standing-0.008
Token every
Token ID790
Feature activation-2.52e-02

Pos logit contributions
sing+0.010
 breathing+0.010
 cob+0.009
 staying+0.009
 standing+0.009
Neg logit contributions
or-0.012
P-0.012
 course-0.012
ory-0.012
les-0.011
Token few
Token ID1178
Feature activation+4.01e-02

Pos logit contributions
or+0.038
 course+0.038
P+0.036
ory+0.034
les+0.034
Neg logit contributions
sing-0.049
 breathing-0.047
ses-0.046
 cob-0.044
 staying-0.044
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.068
 breathing+0.066
 cob+0.063
 staying+0.062
ses+0.062
Neg logit contributions
or-0.069
 course-0.068
P-0.067
les-0.065
ory-0.064
Token she
Token ID673
Feature activation+1.48e-02

Pos logit contributions
sing+0.323
 breathing+0.317
 cob+0.304
 staying+0.298
ses+0.294
Neg logit contributions
or-0.311
P-0.291
 course-0.287
ory-0.283
les-0.277
Token would
Token ID561
Feature activation-5.69e-02

Pos logit contributions
 standing+0.027
 cob+0.027
 staying+0.027
 breathing+0.026
sing+0.026
Neg logit contributions
P-0.021
 course-0.021
or-0.020
umper-0.020
 trigger-0.019
Token shout
Token ID9191
Feature activation+3.14e-02

Pos logit contributions
P+0.098
or+0.092
umper+0.090
 course+0.088
les+0.087
Neg logit contributions
 breathing-0.092
 staying-0.092
sing-0.092
 standing-0.091
 cob-0.089
Token:
Token ID25
Feature activation+2.60e-02

Pos logit contributions
sing+0.056
 breathing+0.054
 cob+0.054
ses+0.053
 staying+0.053
Neg logit contributions
or-0.050
ory-0.046
les-0.045
P-0.044
 course-0.043
Token '
Token ID705
Feature activation+6.28e-02

Pos logit contributions
sing+0.031
 breathing+0.031
 cob+0.030
 staying+0.029
 standing+0.028
Neg logit contributions
or-0.056
P-0.054
 course-0.054
ory-0.053
les-0.052
Token right
Token ID826
Feature activation+2.42e-02

Pos logit contributions
sing+0.075
 breathing+0.074
 staying+0.070
 cob+0.070
ses+0.069
Neg logit contributions
or-0.068
 course-0.067
P-0.067
les-0.062
-0.060
Token away
Token ID1497
Feature activation-1.08e-01

Pos logit contributions
 breathing+0.043
sing+0.042
 staying+0.042
 cob+0.042
 standing+0.041
Neg logit contributions
P-0.037
or-0.036
umper-0.036
ory-0.035
 course-0.034
Token.
Token ID13
Feature activation+2.25e-02

Pos logit contributions
or+0.145
 course+0.136
P+0.128
ory+0.126
les+0.126
Neg logit contributions
sing-0.218
 breathing-0.209
ses-0.207
 cob-0.203
 staying-0.202
Token After
Token ID2293
Feature activation-9.84e-03

Pos logit contributions
 breathing+0.038
sing+0.038
 cob+0.037
 staying+0.037
 standing+0.036
Neg logit contributions
or-0.037
P-0.036
 course-0.035
ory-0.034
les-0.033
Token five
Token ID1936
Feature activation-2.23e-02

Pos logit contributions
or+0.017
P+0.017
 course+0.016
ory+0.015
les+0.015
Neg logit contributions
sing-0.016
 breathing-0.016
 cob-0.015
 staying-0.015
 standing-0.015
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
 course+0.048
or+0.047
P+0.047
les+0.046
ory+0.044
Neg logit contributions
sing-0.028
 breathing-0.027
 staying-0.025
ses-0.025
 cob-0.025
Token,
Token ID11
Feature activation+1.21e-02

Pos logit contributions
sing+0.399
 breathing+0.393
 cob+0.381
ses+0.373
 staying+0.372
Neg logit contributions
or-0.241
 course-0.221
P-0.216
ory-0.209
les-0.205
Token they
Token ID484
Feature activation-2.48e-02

Pos logit contributions
 breathing+0.021
sing+0.020
 cob+0.020
 staying+0.019
 standing+0.019
Neg logit contributions
or-0.020
P-0.020
 course-0.019
ory-0.019
les-0.018
Token both
Token ID1111
Feature activation-5.29e-02

Pos logit contributions
P+0.039
or+0.039
umper+0.038
 course+0.037
+0.037
Neg logit contributions
 cob-0.041
 Ann-0.040
 standing-0.040
sing-0.040
 staying-0.039
Token had
Token ID550
Feature activation+1.38e-02

Pos logit contributions
or+0.077
P+0.076
 course+0.073
umper+0.070
ory+0.070
Neg logit contributions
sing-0.099
 breathing-0.099
 cob-0.097
 staying-0.096
 standing-0.095
Token given
Token ID1813
Feature activation-1.51e-02

Pos logit contributions
 breathing+0.023
sing+0.022
 staying+0.022
 cob+0.022
 standing+0.022
Neg logit contributions
or-0.024
P-0.023
 course-0.022
ory-0.022
umper-0.021
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
or+0.060
P+0.058
 course+0.057
ory+0.055
les+0.054
Neg logit contributions
sing-0.071
 breathing-0.071
 cob-0.068
 staying-0.068
 standing-0.067
Token
Token ID198
Feature activation+9.04e-03

Pos logit contributions
 breathing+0.038
 cob+0.037
sing+0.037
 staying+0.037
 standing+0.035
Neg logit contributions
or-0.020
 course-0.019
P-0.019
-0.018
ory-0.018
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
sing+0.019
 breathing+0.019
 cob+0.018
 staying+0.018
ses+0.018
Neg logit contributions
or-0.012
P-0.011
 course-0.011
ory-0.010
-0.010
TokenAfter
Token ID3260
Feature activation-7.89e-03

Pos logit contributions
sing+0.019
 breathing+0.019
 cob+0.019
 staying+0.018
ses+0.018
Neg logit contributions
or-0.015
P-0.015
 course-0.014
ory-0.013
les-0.013
Token 20
Token ID1160
Feature activation+2.18e-02

Pos logit contributions
 course+0.012
P+0.012
or+0.012
ory+0.011
ar+0.011
Neg logit contributions
sing-0.014
 breathing-0.013
 cob-0.013
ses-0.013
 staying-0.013
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.030
 breathing+0.028
 cob+0.027
 staying+0.027
ses+0.027
Neg logit contributions
or-0.043
 course-0.043
P-0.043
les-0.042
ory-0.041
Token of
Token ID286
Feature activation+6.36e-02

Pos logit contributions
sing+0.384
 breathing+0.380
 cob+0.371
 staying+0.365
ses+0.357
Neg logit contributions
or-0.241
 course-0.223
P-0.222
ory-0.214
les-0.208
Token searching
Token ID10342
Feature activation+2.97e-02

Pos logit contributions
sing+0.099
ses+0.091
 breathing+0.091
 Kel+0.090
 cob+0.090
Neg logit contributions
 course-0.115
P-0.096
ted-0.094
or-0.092
melon-0.091
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
sing+0.060
 breathing+0.059
 cob+0.057
 staying+0.056
ses+0.056
Neg logit contributions
or-0.041
 course-0.038
P-0.038
ory-0.037
les-0.036
Token the
Token ID262
Feature activation-5.94e-03

Pos logit contributions
 breathing+0.016
sing+0.016
ses+0.015
 cob+0.015
 staying+0.015
Neg logit contributions
or-0.017
 course-0.017
P-0.016
-0.016
ory-0.016
Token rain
Token ID6290
Feature activation+4.80e-02

Pos logit contributions
 course+0.009
P+0.009
or+0.009
ory+0.009
les+0.009
Neg logit contributions
sing-0.010
 breathing-0.010
 staying-0.010
 standing-0.009
ses-0.009
TokenThe
Token ID464
Feature activation-1.36e-02

Pos logit contributions
sing+0.032
 breathing+0.032
 cob+0.031
 staying+0.031
 standing+0.031
Neg logit contributions
or-0.019
P-0.019
 course-0.018
ory-0.018
les-0.017
Token little
Token ID1310
Feature activation+3.17e-02

Pos logit contributions
 course+0.022
P+0.022
or+0.021
les+0.021
ory+0.021
Neg logit contributions
 breathing-0.024
sing-0.024
 staying-0.023
ses-0.022
 standing-0.021
Token girl
Token ID2576
Feature activation+5.21e-02

Pos logit contributions
sing+0.062
 breathing+0.061
 staying+0.058
ses+0.056
 standing+0.055
Neg logit contributions
ory-0.047
les-0.047
 course-0.047
P-0.047
-0.045
Token listened
Token ID16399
Feature activation+1.39e-02

Pos logit contributions
sing+0.110
 breathing+0.109
 staying+0.105
 cob+0.105
 standing+0.103
Neg logit contributions
or-0.064
P-0.061
 course-0.060
les-0.058
ory-0.058
Token while
Token ID981
Feature activation+2.65e-03

Pos logit contributions
sing+0.026
 breathing+0.025
 cob+0.025
 staying+0.025
ses+0.024
Neg logit contributions
or-0.021
P-0.020
ory-0.019
les-0.019
 course-0.019
Token sitting
Token ID5586
Feature activation+1.84e-01

Pos logit contributions
sing+0.005
ses+0.005
 cob+0.005
 outage+0.004
 breathing+0.004
Neg logit contributions
or-0.004
 course-0.004
P-0.003
-0.003
ory-0.003
Token in
Token ID287
Feature activation+8.36e-03

Pos logit contributions
sing+0.302
 breathing+0.297
 cob+0.286
 staying+0.286
 standing+0.279
Neg logit contributions
or-0.321
P-0.314
 course-0.305
ory-0.296
les-0.295
Token a
Token ID257
Feature activation+7.62e-03

Pos logit contributions
sing+0.018
ses+0.016
 breathing+0.016
 staying+0.016
 standing+0.015
Neg logit contributions
 course-0.012
or-0.009
P-0.009
-0.009
 top-0.009
Token comfortable
Token ID6792
Feature activation+1.27e-02

Pos logit contributions
sing+0.012
 breathing+0.012
ses+0.012
 staying+0.011
 standing+0.011
Neg logit contributions
P-0.014
 course-0.013
or-0.013
les-0.012
-0.012
Token spot
Token ID4136
Feature activation+1.51e-02

Pos logit contributions
sing+0.022
 breathing+0.021
ses+0.021
 staying+0.020
 standing+0.020
Neg logit contributions
or-0.021
 course-0.020
P-0.019
les-0.019
ory-0.019
Token,
Token ID11
Feature activation+9.53e-03

Pos logit contributions
sing+0.029
 breathing+0.028
 cob+0.027
ses+0.027
 staying+0.027
Neg logit contributions
or-0.021
 course-0.020
les-0.019
P-0.019
ory-0.018
Token watched
Token ID7342
Feature activation+4.61e-02

Pos logit contributions
 breathing+0.025
sing+0.025
 staying+0.024
 standing+0.024
 cob+0.024
Neg logit contributions
or-0.018
P-0.018
 course-0.017
ory-0.017
umper-0.017
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
sing+0.081
 breathing+0.080
 cob+0.079
 staying+0.078
ses+0.076
Neg logit contributions
or-0.073
 course-0.069
P-0.068
ory-0.067
les-0.066
Token After
Token ID2293
Feature activation-5.59e-03

Pos logit contributions
sing+0.033
 breathing+0.033
 standing+0.032
 staying+0.031
 cob+0.031
Neg logit contributions
or-0.033
P-0.032
les-0.031
 course-0.031
ory-0.030
Token a
Token ID257
Feature activation+1.11e-02

Pos logit contributions
or+0.010
P+0.010
ory+0.010
 course+0.009
les+0.009
Neg logit contributions
 breathing-0.009
sing-0.009
 staying-0.008
 cob-0.008
 standing-0.008
Token few
Token ID1178
Feature activation+3.95e-02

Pos logit contributions
 breathing+0.018
sing+0.018
 cob+0.018
 staying+0.017
 standing+0.017
Neg logit contributions
or-0.019
P-0.018
 course-0.018
umper-0.018
ory-0.018
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.047
 breathing+0.045
 staying+0.042
 cob+0.042
ses+0.041
Neg logit contributions
 course-0.087
or-0.086
P-0.085
les-0.082
ory-0.080
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
 breathing+0.336
sing+0.331
 standing+0.328
 staying+0.327
 cob+0.322
Neg logit contributions
P-0.276
or-0.274
 course-0.257
ory-0.257
umper-0.255
Token the
Token ID262
Feature activation-2.87e-03

Pos logit contributions
 standing+0.017
 staying+0.017
 breathing+0.017
sing+0.017
 cob+0.016
Neg logit contributions
or-0.020
P-0.020
umper-0.020
les-0.020
at-0.019
Token hat
Token ID6877
Feature activation-1.84e-02

Pos logit contributions
or+0.005
P+0.005
 course+0.005
ory+0.005
umper+0.004
Neg logit contributions
sing-0.005
 breathing-0.005
 cob-0.005
 staying-0.005
 standing-0.005
Token had
Token ID550
Feature activation+1.26e-02

Pos logit contributions
P+0.025
or+0.025
ory+0.023
umper+0.023
 course+0.023
Neg logit contributions
 breathing-0.037
sing-0.036
 standing-0.036
 cob-0.036
 staying-0.035
Token made
Token ID925
Feature activation-3.56e-04

Pos logit contributions
 breathing+0.023
 cob+0.022
 standing+0.022
sing+0.022
 staying+0.022
Neg logit contributions
or-0.020
P-0.019
 course-0.018
ory-0.018
-0.018
Token
Token ID198
Feature activation+1.26e-02

Pos logit contributions
or+0.062
P+0.060
 course+0.057
ory+0.056
umper+0.055
Neg logit contributions
sing-0.089
 breathing-0.088
 cob-0.086
 staying-0.085
 standing-0.084
Token
Token ID198
Feature activation+1.34e-02

Pos logit contributions
 breathing+0.021
 standing+0.020
 staying+0.020
 Kel+0.019
 cob+0.019
Neg logit contributions
or-0.021
ory-0.021
 course-0.020
P-0.020
les-0.020
TokenIn
Token ID818
Feature activation-2.25e-02

Pos logit contributions
 breathing+0.023
 standing+0.023
 staying+0.022
 cob+0.022
 Kel+0.022
Neg logit contributions
or-0.021
ory-0.020
 course-0.020
les-0.020
P-0.019
Token a
Token ID257
Feature activation+4.65e-03

Pos logit contributions
or+0.038
P+0.037
 course+0.036
ory+0.036
les+0.035
Neg logit contributions
 breathing-0.037
sing-0.037
 cob-0.036
 staying-0.036
 standing-0.035
Token few
Token ID1178
Feature activation+3.98e-02

Pos logit contributions
sing+0.008
 breathing+0.008
 staying+0.008
ses+0.008
 cob+0.008
Neg logit contributions
P-0.008
or-0.007
 course-0.007
ory-0.007
-0.007
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.048
 breathing+0.045
 staying+0.042
ses+0.042
 cob+0.041
Neg logit contributions
or-0.088
 course-0.088
P-0.088
les-0.084
ory-0.082
Token,
Token ID11
Feature activation+7.33e-03

Pos logit contributions
sing+0.381
 breathing+0.375
 cob+0.367
 staying+0.359
ses+0.356
Neg logit contributions
or-0.249
P-0.229
 course-0.229
ory-0.220
les-0.216
Token it
Token ID340
Feature activation-4.33e-03

Pos logit contributions
sing+0.012
 breathing+0.012
 cob+0.011
 staying+0.011
ses+0.011
Neg logit contributions
or-0.013
P-0.012
 course-0.012
ory-0.012
les-0.011
Token seemed
Token ID3947
Feature activation+4.43e-02

Pos logit contributions
or+0.005
P+0.005
 course+0.005
ory+0.005
les+0.005
Neg logit contributions
sing-0.009
 breathing-0.009
 staying-0.009
 cob-0.009
 standing-0.009
Token like
Token ID588
Feature activation-1.44e-03

Pos logit contributions
sing+0.072
 breathing+0.071
 staying+0.069
 cob+0.069
ses+0.069
Neg logit contributions
or-0.071
ory-0.069
P-0.068
 course-0.067
les-0.066
Token the
Token ID262
Feature activation-7.47e-03

Pos logit contributions
 course+0.002
or+0.002
P+0.002
ory+0.002
umper+0.002
Neg logit contributions
sing-0.003
 breathing-0.002
 cob-0.002
ses-0.002
 staying-0.002
Token quiet
Token ID5897
Feature activation+4.10e-02

Pos logit contributions
sing+0.038
 breathing+0.037
 cob+0.037
ses+0.036
 staying+0.035
Neg logit contributions
or-0.022
 course-0.021
P-0.020
ory-0.020
ar-0.019
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
sing+0.088
 cob+0.085
ses+0.084
 breathing+0.083
 Kel+0.080
Neg logit contributions
or-0.050
 course-0.045
ory-0.044
les-0.044
ar-0.042
Token After
Token ID2293
Feature activation-7.39e-03

Pos logit contributions
sing+0.043
 breathing+0.043
 cob+0.042
 staying+0.040
ses+0.040
Neg logit contributions
or-0.026
P-0.024
ory-0.024
 course-0.023
-0.023
Token a
Token ID257
Feature activation+6.77e-03

Pos logit contributions
 course+0.012
P+0.012
or+0.012
ory+0.011
ar+0.011
Neg logit contributions
sing-0.012
 breathing-0.012
 cob-0.012
 staying-0.011
ses-0.011
Token few
Token ID1178
Feature activation+4.00e-02

Pos logit contributions
 breathing+0.011
sing+0.011
 cob+0.011
 staying+0.010
 standing+0.010
Neg logit contributions
or-0.012
P-0.011
 course-0.011
umper-0.011
ory-0.011
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.046
 breathing+0.043
 staying+0.040
ses+0.039
 cob+0.039
Neg logit contributions
 course-0.092
or-0.089
P-0.089
les-0.087
ar-0.085
Token,
Token ID11
Feature activation+8.51e-03

Pos logit contributions
sing+0.391
 breathing+0.383
 cob+0.374
 staying+0.365
ses+0.364
Neg logit contributions
or-0.243
P-0.221
 course-0.221
ory-0.214
les-0.207
Token Daddy
Token ID26926
Feature activation+3.32e-02

Pos logit contributions
sing+0.014
 breathing+0.013
 cob+0.013
 staying+0.012
ses+0.012
Neg logit contributions
or-0.016
P-0.015
 course-0.015
ory-0.014
les-0.014
Token found
Token ID1043
Feature activation-3.53e-02

Pos logit contributions
 breathing+0.064
 standing+0.063
sing+0.062
 staying+0.062
 cob+0.062
Neg logit contributions
or-0.046
P-0.046
 course-0.045
ory-0.042
umper-0.041
Token her
Token ID607
Feature activation+5.81e-02

Pos logit contributions
or+0.072
P+0.071
ory+0.068
 course+0.068
umper+0.067
Neg logit contributions
 breathing-0.046
 cob-0.045
 staying-0.044
sing-0.044
 standing-0.044
Token and
Token ID290
Feature activation+1.60e-02

Pos logit contributions
sing+0.093
 breathing+0.089
 Kel+0.087
 staying+0.086
ses+0.086
Neg logit contributions
or-0.093
 course-0.090
ory-0.088
les-0.086
P-0.083
Token toys
Token ID14958
Feature activation-1.08e-02

Pos logit contributions
sing+0.096
ses+0.090
 breathing+0.087
 Kel+0.086
 staying+0.085
Neg logit contributions
or-0.102
les-0.097
 course-0.092
P-0.091
ory-0.091
Token and
Token ID290
Feature activation+1.80e-02

Pos logit contributions
P+0.018
or+0.018
 course+0.017
ory+0.017
umper+0.017
Neg logit contributions
 breathing-0.018
sing-0.018
 staying-0.017
 cob-0.017
 standing-0.017
Token for
Token ID329
Feature activation-5.24e-03

Pos logit contributions
sing+0.036
 breathing+0.035
ses+0.034
 cob+0.033
 staying+0.032
Neg logit contributions
or-0.025
 course-0.024
P-0.023
ory-0.022
-0.022
Token a
Token ID257
Feature activation+4.13e-03

Pos logit contributions
or+0.008
 course+0.008
P+0.008
ory+0.008
les+0.008
Neg logit contributions
sing-0.009
 breathing-0.009
 cob-0.009
 staying-0.008
ses-0.008
Token few
Token ID1178
Feature activation+4.00e-02

Pos logit contributions
sing+0.007
 breathing+0.007
ses+0.007
 staying+0.007
 cob+0.007
Neg logit contributions
or-0.006
P-0.006
 course-0.006
ory-0.006
-0.006
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.045
 breathing+0.040
ses+0.039
 staying+0.037
 cob+0.037
Neg logit contributions
 course-0.093
or-0.093
P-0.091
les-0.090
ory-0.086
Token she
Token ID673
Feature activation+1.30e-02

Pos logit contributions
sing+0.372
 cob+0.364
 breathing+0.360
ses+0.357
 staying+0.339
Neg logit contributions
or-0.260
 course-0.235
ory-0.227
P-0.224
les-0.221
Token did
Token ID750
Feature activation-1.62e-02

Pos logit contributions
 breathing+0.026
sing+0.026
 cob+0.025
 standing+0.025
 staying+0.025
Neg logit contributions
or-0.017
P-0.017
 course-0.016
umper-0.016
ory-0.016
Token,
Token ID11
Feature activation+5.63e-03

Pos logit contributions
or+0.024
P+0.024
 course+0.023
ory+0.022
les+0.022
Neg logit contributions
sing-0.030
 breathing-0.030
 cob-0.029
 staying-0.029
 standing-0.028
Token but
Token ID475
Feature activation+2.93e-02

Pos logit contributions
sing+0.008
ses+0.008
 cob+0.008
 breathing+0.008
 Kel+0.007
Neg logit contributions
 course-0.010
or-0.009
ory-0.009
-0.009
les-0.009
Token then
Token ID788
Feature activation+3.44e-02

Pos logit contributions
sing+0.047
 breathing+0.046
ses+0.044
 cob+0.042
 staying+0.040
Neg logit contributions
or-0.053
 course-0.050
ory-0.049
P-0.047
ar-0.047
Token too
Token ID1165
Feature activation+2.51e-02

Pos logit contributions
sing+0.100
 breathing+0.093
ses+0.092
 staying+0.091
 Kel+0.089
Neg logit contributions
 course-0.058
les-0.057
or-0.056
als-0.050
ar-0.050
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
sing+0.048
 breathing+0.047
 staying+0.046
 cob+0.046
 standing+0.045
Neg logit contributions
or-0.036
P-0.035
 course-0.034
ory-0.033
les-0.033
Token After
Token ID2293
Feature activation-7.62e-03

Pos logit contributions
 breathing+0.039
sing+0.039
 cob+0.038
 staying+0.037
 standing+0.037
Neg logit contributions
or-0.033
P-0.033
 course-0.032
ory-0.031
umper-0.030
Token a
Token ID257
Feature activation+8.37e-03

Pos logit contributions
 course+0.012
or+0.012
P+0.012
ory+0.011
les+0.011
Neg logit contributions
sing-0.013
 breathing-0.012
 cob-0.012
ses-0.012
 staying-0.012
Token few
Token ID1178
Feature activation+3.91e-02

Pos logit contributions
 breathing+0.014
sing+0.014
 staying+0.014
 cob+0.014
 standing+0.013
Neg logit contributions
or-0.014
P-0.014
 course-0.013
ory-0.013
les-0.013
Token minutes
Token ID2431
Feature activation+1.84e-01

Pos logit contributions
sing+0.048
 breathing+0.044
 staying+0.042
ses+0.041
 Kel+0.041
Neg logit contributions
 course-0.089
P-0.083
or-0.083
les-0.082
ar-0.079
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
sing+0.377
 breathing+0.373
 cob+0.364
 staying+0.359
 standing+0.350
Neg logit contributions
or-0.246
P-0.231
 course-0.230
ory-0.220
les-0.216
Token it
Token ID340
Feature activation-2.39e-03

Pos logit contributions
sing+0.022
 breathing+0.022
 staying+0.021
 cob+0.021
 standing+0.020
Neg logit contributions
or-0.018
P-0.018
 course-0.018
ory-0.017
les-0.016
Token stopped
Token ID5025
Feature activation+5.30e-02

Pos logit contributions
or+0.003
P+0.003
 course+0.003
ory+0.003
les+0.003
Neg logit contributions
 breathing-0.005
sing-0.005
 cob-0.005
 staying-0.005
 standing-0.005
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
ses+0.101
 cob+0.100
 Kel+0.099
sing+0.099
 Ann+0.097
Neg logit contributions
or-0.065
P-0.064
ory-0.063
 course-0.061
umper-0.061
Token the
Token ID262
Feature activation-2.21e-03

Pos logit contributions
sing+0.037
 breathing+0.037
 staying+0.036
 cob+0.035
 standing+0.035
Neg logit contributions
or-0.029
P-0.028
 course-0.028
ory-0.027
les-0.026
Token that
Token ID326
Feature activation+5.15e-03

Pos logit contributions
sing+0.012
 breathing+0.012
ses+0.011
 staying+0.011
 cob+0.011
Neg logit contributions
 course-0.010
or-0.009
les-0.009
P-0.009
ory-0.008
Token their
Token ID511
Feature activation+2.55e-02

Pos logit contributions
sing+0.011
 breathing+0.011
ses+0.011
 Kel+0.010
 staying+0.010
Neg logit contributions
 course-0.006
or-0.005
-0.005
 may-0.005
les-0.005
Token town
Token ID3240
Feature activation-2.16e-02

Pos logit contributions
 breathing+0.041
sing+0.040
 staying+0.039
 Kel+0.038
 standing+0.038
Neg logit contributions
or-0.040
 course-0.040
les-0.039
ory-0.038
-0.038
Token was
Token ID373
Feature activation+6.30e-03

Pos logit contributions
or+0.025
 course+0.023
les+0.023
P+0.022
ory+0.021
Neg logit contributions
sing-0.047
 breathing-0.046
 cob-0.045
 staying-0.044
ses-0.044
Token still
Token ID991
Feature activation-1.29e-02

Pos logit contributions
ses+0.011
sing+0.010
 cob+0.010
 breathing+0.010
 Kel+0.010
Neg logit contributions
 course-0.010
P-0.009
or-0.009
les-0.009
-0.009
Token standing
Token ID5055
Feature activation+1.84e-01

Pos logit contributions
 course+0.019
P+0.018
les+0.018
ory+0.018
or+0.017
Neg logit contributions
ses-0.023
sing-0.022
 Kel-0.021
 cob-0.021
 Ann-0.020
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
 breathing+0.270
 standing+0.269
 staying+0.257
sing+0.257
 cob+0.250
Neg logit contributions
or-0.341
 course-0.335
P-0.334
umper-0.325
ory-0.322
Token
Token ID198
Feature activation+7.80e-03

Pos logit contributions
 breathing+0.025
sing+0.025
 cob+0.024
 staying+0.024
 standing+0.024
Neg logit contributions
or-0.024
P-0.024
 course-0.023
ory-0.022
umper-0.022
Token
Token ID198
Feature activation+8.83e-03

Pos logit contributions
sing+0.013
 breathing+0.013
 cob+0.012
 staying+0.012
 standing+0.012
Neg logit contributions
or-0.013
P-0.013
 course-0.013
ory-0.012
les-0.012
TokenThe
Token ID464
Feature activation-1.65e-02

Pos logit contributions
 breathing+0.014
 cob+0.014
 staying+0.014
 standing+0.013
 beating+0.013
Neg logit contributions
ory-0.016
or-0.015
les-0.015
umper-0.014
als-0.014
Token people
Token ID661
Feature activation+1.60e-02

Pos logit contributions
or+0.028
P+0.027
 course+0.027
ory+0.026
les+0.026
Neg logit contributions
 breathing-0.027
sing-0.027
 cob-0.026
 staying-0.026
 standing-0.026
Token a
Token ID257
Feature activation-3.45e-02

Pos logit contributions
 breathing+0.021
 cob+0.021
 staying+0.021
 standing+0.020
sing+0.020
Neg logit contributions
or-0.025
P-0.024
ory-0.024
umper-0.023
 course-0.023
Token sign
Token ID1051
Feature activation-6.81e-02

Pos logit contributions
P+0.059
 course+0.059
or+0.058
ory+0.056
+0.055
Neg logit contributions
 breathing-0.055
sing-0.054
 staying-0.053
 Kel-0.051
 cob-0.051
Token that
Token ID326
Feature activation-1.39e-02

Pos logit contributions
or+0.111
P+0.108
 course+0.104
ory+0.103
umper+0.101
Neg logit contributions
 breathing-0.116
sing-0.116
 cob-0.112
 staying-0.112
 standing-0.111
Token says
Token ID1139
Feature activation-3.15e-03

Pos logit contributions
or+0.020
P+0.020
ory+0.019
umper+0.019
 course+0.018
Neg logit contributions
 breathing-0.026
sing-0.026
 cob-0.026
 staying-0.025
 standing-0.025
Token "
Token ID366
Feature activation+4.43e-02

Pos logit contributions
or+0.005
P+0.005
 course+0.005
ory+0.005
umper+0.005
Neg logit contributions
 breathing-0.005
sing-0.005
 cob-0.005
 staying-0.005
 standing-0.005
TokenP
Token ID47
Feature activation-2.18e-01

Pos logit contributions
 breathing+0.078
 staying+0.076
sing+0.076
 standing+0.075
 cob+0.072
Neg logit contributions
or-0.063
 course-0.061
les-0.061
ory-0.061
P-0.061
Tokenizza
Token ID9990
Feature activation-4.95e-02

Pos logit contributions
P+0.338
ted+0.329
les+0.325
 trigger+0.323
 course+0.321
Neg logit contributions
sing-0.337
 Ann-0.330
 standing-0.325
run-0.321
ses-0.318
Token".
Token ID1911
Feature activation-4.17e-02

Pos logit contributions
P+0.074
or+0.073
umper+0.072
 course+0.069
ory+0.068
Neg logit contributions
 standing-0.087
 staying-0.085
 breathing-0.085
 sitting-0.084
sing-0.083
Token Sara
Token ID24799
Feature activation+4.87e-03

Pos logit contributions
or+0.079
P+0.077
les+0.076
 course+0.075
umper+0.072
Neg logit contributions
 breathing-0.058
sing-0.057
 Ann-0.057
 standing-0.057
 staying-0.056
Token and
Token ID290
Feature activation-9.60e-04

Pos logit contributions
 breathing+0.007
sing+0.007
 cob+0.007
 standing+0.007
 staying+0.007
Neg logit contributions
or-0.009
P-0.009
umper-0.009
 course-0.008
ory-0.008
Token Ben
Token ID3932
Feature activation-2.48e-02

Pos logit contributions
 course+0.002
or+0.002
ory+0.002
P+0.002
umper+0.002
Neg logit contributions
 breathing-0.001
sing-0.001
 cob-0.001
 staying-0.001
 standing-0.001
Token!"
Token ID2474
Feature activation-3.93e-02

Pos logit contributions
 breathing+0.011
sing+0.011
 staying+0.010
 cob+0.010
 standing+0.010
Neg logit contributions
or-0.009
P-0.008
 course-0.008
ory-0.008
umper-0.008
Token Ben
Token ID3932
Feature activation-2.74e-02

Pos logit contributions
or+0.064
P+0.064
 course+0.062
ory+0.060
umper+0.059
Neg logit contributions
sing-0.067
 breathing-0.067
 cob-0.065
 staying-0.064
 standing-0.063
Token says
Token ID1139
Feature activation+1.53e-03

Pos logit contributions
or+0.052
P+0.049
umper+0.047
 course+0.046
ory+0.045
Neg logit contributions
 breathing-0.041
sing-0.041
 standing-0.040
 Ann-0.040
 cob-0.040
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
sing+0.003
 breathing+0.003
 cob+0.003
 staying+0.003
 standing+0.003
Neg logit contributions
or-0.002
P-0.002
 course-0.002
ory-0.002
les-0.002
Token "
Token ID366
Feature activation+5.00e-02

Pos logit contributions
P+0.016
or+0.016
 course+0.015
ory+0.015
+0.014
Neg logit contributions
sing-0.021
 breathing-0.021
 cob-0.020
 staying-0.020
 standing-0.020
TokenP
Token ID47
Feature activation-2.14e-01

Pos logit contributions
sing+0.100
 breathing+0.100
 cob+0.098
 staying+0.098
 standing+0.097
Neg logit contributions
P-0.069
or-0.067
 course-0.064
ory-0.061
-0.060
Tokenill
Token ID359
Feature activation-4.44e-02

Pos logit contributions
P+0.304
or+0.302
les+0.290
umper+0.287
uses+0.283
Neg logit contributions
sing-0.375
 Ann-0.362
ses-0.361
 cob-0.360
 standing-0.358
Tokenows
Token ID1666
Feature activation+4.71e-02

Pos logit contributions
or+0.069
P+0.068
ory+0.065
 course+0.063
umper+0.063
Neg logit contributions
 breathing-0.080
sing-0.078
 cob-0.078
 standing-0.077
 staying-0.077
Token are
Token ID389
Feature activation-1.94e-02

Pos logit contributions
 breathing+0.086
sing+0.084
 cob+0.084
 staying+0.083
 standing+0.081
Neg logit contributions
or-0.073
P-0.072
ory-0.069
umper-0.068
 course-0.066
Token fun
Token ID1257
Feature activation+4.58e-02

Pos logit contributions
P+0.030
 course+0.029
oni+0.027
ons+0.025
ade+0.025
Neg logit contributions
sing-0.032
ses-0.028
 Kel-0.028
 breathing-0.027
 apartment-0.026
Token!"
Token ID2474
Feature activation-3.98e-02

Pos logit contributions
sing+0.110
 Kel+0.105
ses+0.102
 breathing+0.101
 cob+0.101
Neg logit contributions
 course-0.044
or-0.041
P-0.041
les-0.040
-0.037
Token a
Token ID257
Feature activation-5.91e-03

Pos logit contributions
sing+0.013
 breathing+0.013
 cob+0.012
 staying+0.012
 standing+0.012
Neg logit contributions
or-0.016
P-0.016
 course-0.015
ory-0.015
les-0.014
Token bad
Token ID2089
Feature activation-2.16e-02

Pos logit contributions
or+0.010
P+0.010
 course+0.010
ory+0.009
+0.009
Neg logit contributions
 breathing-0.010
sing-0.010
 staying-0.009
 cob-0.009
 standing-0.009
Token ending
Token ID7464
Feature activation+4.52e-02

Pos logit contributions
or+0.038
 course+0.038
P+0.037
ory+0.036
les+0.035
Neg logit contributions
 breathing-0.033
 staying-0.032
 standing-0.032
 Kel-0.032
sing-0.031
Token.
Token ID13
Feature activation+4.48e-05

Pos logit contributions
sing+0.097
 cob+0.093
 breathing+0.093
 Kel+0.092
ses+0.090
Neg logit contributions
or-0.055
 course-0.051
P-0.049
ory-0.048
les-0.048
Token<|endoftext|>
Token ID50256
Feature activation-1.00e-02

Pos logit contributions
sing+0.000
 cob+0.000
 breathing+0.000
 staying+0.000
 standing+0.000
Neg logit contributions
P-0.000
or-0.000
-0.000
 course-0.000
ory-0.000
TokenP
Token ID47
Feature activation-2.08e-01

Pos logit contributions
or+0.013
ory+0.013
umper+0.012
ingu+0.012
 course+0.012
Neg logit contributions
 breathing-0.020
 standing-0.020
 sitting-0.019
 staying-0.019
 cob-0.019
Tokenoppy
Token ID26696
Feature activation-9.40e-02

Pos logit contributions
 trigger+0.331
les+0.322
ted+0.309
uses+0.308
Hey+0.308
Neg logit contributions
 Ann-0.314
 standing-0.312
 Kim-0.308
sing-0.304
 sitting-0.302
Token was
Token ID373
Feature activation-1.06e-02

Pos logit contributions
or+0.181
P+0.173
umper+0.172
 course+0.169
 trigger+0.167
Neg logit contributions
 breathing-0.134
 cob-0.131
 staying-0.130
 standing-0.127
 sitting-0.123
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
P+0.018
or+0.016
 course+0.016
umper+0.015
ingu+0.015
Neg logit contributions
 Kel-0.017
 cob-0.017
 breathing-0.017
sing-0.017
ses-0.016
Token tall
Token ID7331
Feature activation-2.28e-02

Pos logit contributions
or+0.028
P+0.028
 course+0.027
ory+0.027
les+0.026
Neg logit contributions
 breathing-0.019
sing-0.019
 staying-0.018
 cob-0.018
 standing-0.018
Token girl
Token ID2576
Feature activation+3.17e-02

Pos logit contributions
or+0.038
P+0.037
 course+0.035
ory+0.035
umper+0.034
Neg logit contributions
sing-0.038
 breathing-0.038
 cob-0.037
 staying-0.037
 standing-0.036
Token kitchen
Token ID9592
Feature activation+1.74e-02

Pos logit contributions
or+0.032
P+0.031
 course+0.031
ory+0.029
les+0.029
Neg logit contributions
 breathing-0.030
sing-0.030
 staying-0.029
 cob-0.029
 standing-0.029
Token.
Token ID13
Feature activation+7.70e-03

Pos logit contributions
 breathing+0.031
sing+0.031
 cob+0.030
 staying+0.030
 standing+0.030
Neg logit contributions
or-0.027
P-0.027
 course-0.026
ory-0.025
umper-0.025
Token
Token ID198
Feature activation-1.66e-03

Pos logit contributions
sing+0.014
 breathing+0.014
 cob+0.014
 staying+0.014
 standing+0.013
Neg logit contributions
or-0.011
P-0.011
 course-0.011
ory-0.011
umper-0.010
Token
Token ID198
Feature activation-9.47e-04

Pos logit contributions
or+0.003
P+0.002
ory+0.002
 course+0.002
les+0.002
Neg logit contributions
 breathing-0.003
sing-0.003
 standing-0.003
 staying-0.003
 cob-0.003
Token"
Token ID1
Feature activation-1.49e-02

Pos logit contributions
or+0.001
ory+0.001
les+0.001
 course+0.001
P+0.001
Neg logit contributions
 breathing-0.002
sing-0.002
 staying-0.002
 standing-0.002
 cob-0.002
TokenP
Token ID47
Feature activation-2.08e-01

Pos logit contributions
P+0.022
or+0.022
 course+0.021
ory+0.020
umper+0.020
Neg logit contributions
sing-0.029
 breathing-0.028
 cob-0.028
 staying-0.027
 standing-0.027
Tokenete
Token ID14471
Feature activation-9.92e-02

Pos logit contributions
P+0.296
les+0.289
 course+0.286
ted+0.281
 trigger+0.275
Neg logit contributions
sing-0.358
 standing-0.345
 Ann-0.336
 staying-0.335
 breathing-0.335
Token,
Token ID11
Feature activation+1.96e-03

Pos logit contributions
or+0.163
P+0.163
umper+0.153
ory+0.150
 course+0.148
Neg logit contributions
 breathing-0.166
 standing-0.165
 cob-0.163
sing-0.163
 staying-0.161
Token come
Token ID1282
Feature activation+2.72e-02

Pos logit contributions
sing+0.003
 breathing+0.003
 cob+0.003
 staying+0.003
 standing+0.003
Neg logit contributions
or-0.003
P-0.003
 course-0.003
ory-0.003
les-0.003
Token quick
Token ID2068
Feature activation-3.02e-02

Pos logit contributions
 breathing+0.034
sing+0.033
 standing+0.032
 staying+0.032
 cob+0.031
Neg logit contributions
or-0.058
P-0.057
 course-0.056
les-0.055
ory-0.055
Token,
Token ID11
Feature activation+2.51e-03

Pos logit contributions
or+0.040
P+0.039
 course+0.038
ory+0.037
les+0.036
Neg logit contributions
sing-0.061
 breathing-0.061
 cob-0.059
 staying-0.059
 standing-0.058
Token was
Token ID373
Feature activation-4.92e-03

Pos logit contributions
or+0.072
P+0.070
 course+0.068
ory+0.066
les+0.065
Neg logit contributions
 breathing-0.076
sing-0.075
 staying-0.072
 cob-0.072
 standing-0.071
Token seeing
Token ID4379
Feature activation+4.79e-02

Pos logit contributions
 course+0.008
P+0.008
or+0.007
ory+0.007
les+0.007
Neg logit contributions
sing-0.008
 breathing-0.008
 Kel-0.008
 cob-0.008
ses-0.007
Token.
Token ID13
Feature activation+8.46e-03

Pos logit contributions
sing+0.084
 breathing+0.082
 cob+0.079
 staying+0.078
ses+0.077
Neg logit contributions
or-0.075
 course-0.075
P-0.072
les-0.069
ory-0.069
Token
Token ID198
Feature activation-5.34e-03

Pos logit contributions
 breathing+0.014
sing+0.014
 cob+0.014
 staying+0.014
 standing+0.014
Neg logit contributions
or-0.014
P-0.013
 course-0.013
ory-0.013
les-0.012
Token
Token ID198
Feature activation-3.93e-03

Pos logit contributions
or+0.008
P+0.008
 course+0.008
ory+0.008
les+0.007
Neg logit contributions
 breathing-0.010
sing-0.010
 cob-0.009
 staying-0.009
 standing-0.009
TokenP
Token ID47
Feature activation-2.07e-01

Pos logit contributions
or+0.005
P+0.005
 course+0.005
ory+0.005
les+0.005
Neg logit contributions
 breathing-0.008
sing-0.008
 cob-0.007
 staying-0.007
 standing-0.007
Tokenolly
Token ID5098
Feature activation-6.80e-02

Pos logit contributions
 trigger+0.264
les+0.243
ted+0.236
uses+0.227
+0.219
Neg logit contributions
sing-0.386
 Ann-0.374
 standing-0.374
run-0.368
ses-0.354
Token wanted
Token ID2227
Feature activation+1.54e-02

Pos logit contributions
or+0.100
P+0.097
 course+0.093
umper+0.091
ory+0.090
Neg logit contributions
sing-0.126
 breathing-0.126
 cob-0.124
 standing-0.123
 staying-0.122
Token to
Token ID284
Feature activation-7.73e-03

Pos logit contributions
sing+0.028
 breathing+0.027
 cob+0.026
 staying+0.026
 standing+0.026
Neg logit contributions
or-0.024
P-0.023
 course-0.023
ory-0.022
les-0.021
Token find
Token ID1064
Feature activation-8.09e-03

Pos logit contributions
or+0.014
P+0.014
les+0.014
ory+0.014
umper+0.013
Neg logit contributions
 cob-0.012
 Ann-0.011
sing-0.011
 breathing-0.011
 staying-0.011
Token out
Token ID503
Feature activation-1.09e-02

Pos logit contributions
or+0.013
 course+0.013
P+0.013
les+0.012
ory+0.012
Neg logit contributions
sing-0.014
 breathing-0.014
 staying-0.013
 standing-0.013
 cob-0.013
Token!"
Token ID2474
Feature activation-4.78e-02

Pos logit contributions
sing+0.013
 breathing+0.012
 cob+0.012
 staying+0.012
 standing+0.012
Neg logit contributions
or-0.006
P-0.006
 course-0.006
-0.005
les-0.005
Token Ben
Token ID3932
Feature activation-2.17e-02

Pos logit contributions
P+0.074
 course+0.071
ory+0.071
+0.071
or+0.070
Neg logit contributions
sing-0.084
 breathing-0.084
 cob-0.082
 staying-0.082
 standing-0.080
Token says
Token ID1139
Feature activation+6.78e-03

Pos logit contributions
or+0.039
P+0.038
 course+0.036
ory+0.036
umper+0.035
Neg logit contributions
 breathing-0.034
sing-0.034
 cob-0.033
 staying-0.033
 standing-0.033
Token.
Token ID13
Feature activation-7.04e-03

Pos logit contributions
 cob+0.016
sing+0.015
 Kel+0.015
 staying+0.015
ses+0.015
Neg logit contributions
or-0.007
les-0.006
ory-0.006
-0.006
oni-0.006
Token "
Token ID366
Feature activation+5.56e-02

Pos logit contributions
P+0.013
or+0.013
 course+0.012
ory+0.012
les+0.012
Neg logit contributions
sing-0.011
 breathing-0.010
 cob-0.010
 staying-0.010
 standing-0.010
TokenP
Token ID47
Feature activation-2.06e-01

Pos logit contributions
sing+0.106
 breathing+0.106
 cob+0.103
 staying+0.102
 standing+0.101
Neg logit contributions
or-0.080
P-0.079
 course-0.076
ory-0.074
les-0.072
Tokenill
Token ID359
Feature activation-3.90e-02

Pos logit contributions
or+0.332
P+0.328
 course+0.323
umper+0.311
ory+0.309
Neg logit contributions
sing-0.343
 breathing-0.338
 cob-0.333
 staying-0.327
 standing-0.323
Tokenows
Token ID1666
Feature activation+5.37e-02

Pos logit contributions
or+0.062
P+0.060
ory+0.058
umper+0.056
 course+0.055
Neg logit contributions
 breathing-0.070
 cob-0.068
sing-0.068
 staying-0.067
 standing-0.067
Token are
Token ID389
Feature activation-1.66e-02

Pos logit contributions
 breathing+0.096
sing+0.094
 cob+0.094
 staying+0.093
 standing+0.091
Neg logit contributions
or-0.085
P-0.083
ory-0.079
umper-0.079
 course-0.075
Token fun
Token ID1257
Feature activation+4.76e-02

Pos logit contributions
P+0.022
 course+0.022
oni+0.022
ingu+0.019
 Jolly+0.019
Neg logit contributions
 Kel-0.028
sing-0.028
ses-0.025
 Lu-0.025
 apartment-0.025
Token!"
Token ID2474
Feature activation-4.55e-02

Pos logit contributions
sing+0.117
 Kel+0.114
 breathing+0.109
ses+0.109
 cob+0.109
Neg logit contributions
 course-0.042
les-0.040
or-0.038
P-0.037
-0.036
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
or+0.034
P+0.033
 course+0.032
ory+0.032
les+0.031
Neg logit contributions
 breathing-0.038
sing-0.038
 staying-0.036
 cob-0.036
 standing-0.036
Token taste
Token ID6938
Feature activation-7.66e-02

Pos logit contributions
or+0.023
P+0.022
 course+0.021
les+0.021
ory+0.021
Neg logit contributions
 breathing-0.021
sing-0.021
 staying-0.020
 cob-0.020
 standing-0.020
Token."
Token ID526
Feature activation-3.27e-02

Pos logit contributions
or+0.142
P+0.138
 course+0.132
ory+0.131
umper+0.131
Neg logit contributions
 breathing-0.117
 cob-0.114
 staying-0.113
 standing-0.110
sing-0.110
Token
Token ID198
Feature activation-1.51e-03

Pos logit contributions
or+0.053
P+0.052
 course+0.050
ory+0.049
les+0.048
Neg logit contributions
 breathing-0.056
sing-0.056
 staying-0.054
 cob-0.054
 standing-0.054
Token
Token ID198
Feature activation-7.27e-04

Pos logit contributions
or+0.002
P+0.002
 course+0.002
ory+0.002
les+0.002
Neg logit contributions
 breathing-0.003
sing-0.003
 cob-0.003
 staying-0.003
 standing-0.003
TokenP
Token ID47
Feature activation-2.06e-01

Pos logit contributions
or+0.001
ory+0.001
les+0.001
 course+0.001
P+0.001
Neg logit contributions
 breathing-0.001
sing-0.001
 cob-0.001
 staying-0.001
 standing-0.001
Tokenolly
Token ID5098
Feature activation-7.21e-02

Pos logit contributions
 trigger+0.257
les+0.256
uses+0.251
or+0.243
ted+0.241
Neg logit contributions
sing-0.379
 Ann-0.363
ses-0.358
 cob-0.354
run-0.352
Token thanked
Token ID26280
Feature activation-1.96e-02

Pos logit contributions
or+0.113
P+0.111
umper+0.107
 course+0.105
ory+0.103
Neg logit contributions
sing-0.126
 breathing-0.125
 standing-0.124
 cob-0.124
 staying-0.121
Token the
Token ID262
Feature activation-8.76e-03

Pos logit contributions
or+0.039
P+0.038
les+0.036
ory+0.036
umper+0.035
Neg logit contributions
 breathing-0.027
 staying-0.026
sing-0.026
 cob-0.026
 standing-0.025
Token friendly
Token ID8030
Feature activation-9.12e-03

Pos logit contributions
 course+0.016
P+0.015
+0.014
or+0.014
les+0.014
Neg logit contributions
 breathing-0.013
sing-0.013
 staying-0.013
 standing-0.012
 Kel-0.012
Token cat
Token ID3797
Feature activation+2.43e-02

Pos logit contributions
or+0.016
 course+0.016
les+0.016
P+0.016
ory+0.016
Neg logit contributions
sing-0.014
 breathing-0.014
 staying-0.013
 standing-0.013
 cob-0.012
Token with
Token ID351
Feature activation+1.30e-02

Pos logit contributions
or+0.013
umper+0.013
P+0.013
ory+0.013
 course+0.013
Neg logit contributions
 breathing-0.008
 staying-0.008
 standing-0.008
sing-0.008
 cob-0.008
Token coins
Token ID10796
Feature activation-3.69e-02

Pos logit contributions
sing+0.022
 breathing+0.022
 staying+0.021
ses+0.020
 standing+0.020
Neg logit contributions
 course-0.021
P-0.020
or-0.020
les-0.020
ory-0.019
Token!
Token ID0
Feature activation-1.38e-02

Pos logit contributions
P+0.064
or+0.062
 course+0.061
umper+0.059
ory+0.057
Neg logit contributions
 breathing-0.061
sing-0.059
 standing-0.059
 staying-0.059
 cob-0.058
Token
Token ID198
Feature activation+3.25e-03

Pos logit contributions
or+0.024
P+0.023
 course+0.023
ory+0.022
les+0.022
Neg logit contributions
 breathing-0.022
sing-0.022
 staying-0.021
 cob-0.021
 standing-0.021
Token
Token ID198
Feature activation+4.98e-03

Pos logit contributions
 breathing+0.006
sing+0.006
 cob+0.006
 staying+0.006
 standing+0.006
Neg logit contributions
or-0.005
P-0.005
 course-0.005
ory-0.004
les-0.004
TokenP
Token ID47
Feature activation-2.05e-01

Pos logit contributions
 breathing+0.009
sing+0.009
 cob+0.009
 staying+0.009
 standing+0.009
Neg logit contributions
or-0.007
P-0.007
 course-0.007
ory-0.007
les-0.006
Tokenete
Token ID14471
Feature activation-9.10e-02

Pos logit contributions
 trigger+0.219
les+0.198
P+0.192
ted+0.191
+0.190
Neg logit contributions
sing-0.424
 standing-0.410
 Ann-0.409
run-0.405
 staying-0.404
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
or+0.143
P+0.141
 course+0.135
umper+0.133
ory+0.128
Neg logit contributions
 breathing-0.159
 standing-0.158
sing-0.156
 cob-0.156
 staying-0.154
Token Paul
Token ID3362
Feature activation+3.66e-02

Pos logit contributions
 breathing+0.027
sing+0.027
 staying+0.026
 cob+0.026
 standing+0.026
Neg logit contributions
or-0.033
P-0.032
 course-0.031
les-0.031
ory-0.030
Token looked
Token ID3114
Feature activation-1.52e-02

Pos logit contributions
 standing+0.064
sing+0.062
 cob+0.062
 staying+0.061
 breathing+0.061
Neg logit contributions
P-0.057
or-0.053
 course-0.051
umper-0.050
-0.049
Token at
Token ID379
Feature activation+1.77e-02

Pos logit contributions
P+0.032
or+0.032
 course+0.031
umper+0.031
ory+0.029
Neg logit contributions
sing-0.019
 standing-0.018
 breathing-0.018
 cob-0.018
 staying-0.018
Token says
Token ID1139
Feature activation+1.04e-02

Pos logit contributions
or+0.028
P+0.027
 course+0.026
ory+0.026
umper+0.025
Neg logit contributions
 breathing-0.026
sing-0.026
 cob-0.025
 staying-0.025
 standing-0.024
Token.
Token ID13
Feature activation-1.13e-03

Pos logit contributions
 cob+0.025
sing+0.024
 Kel+0.023
ses+0.023
 staying+0.023
Neg logit contributions
or-0.011
les-0.009
ory-0.009
-0.009
P-0.008
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
P+0.001
+0.001
or+0.001
 course+0.001
ory+0.001
Neg logit contributions
sing-0.003
 cob-0.003
 staying-0.003
 breathing-0.003
 standing-0.003
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
P+0.013
or+0.013
+0.012
 course+0.012
ory+0.011
Neg logit contributions
sing-0.027
 breathing-0.026
 cob-0.026
 staying-0.026
ses-0.025
Token"
Token ID1
Feature activation-1.60e-02

Pos logit contributions
P+0.012
or+0.012
 course+0.011
ory+0.011
umper+0.010
Neg logit contributions
sing-0.023
 breathing-0.023
 cob-0.023
 staying-0.022
 standing-0.022
TokenP
Token ID47
Feature activation-2.05e-01

Pos logit contributions
P+0.023
or+0.022
 course+0.021
ory+0.020
+0.020
Neg logit contributions
sing-0.032
 breathing-0.032
 cob-0.031
 staying-0.031
 standing-0.030
Tokenray
Token ID2433
Feature activation-3.43e-02

Pos logit contributions
P+0.291
 course+0.281
or+0.279
umper+0.277
les+0.273
Neg logit contributions
sing-0.369
 breathing-0.353
 cob-0.350
 standing-0.349
 Ann-0.347
Tokening
Token ID278
Feature activation+3.75e-02

Pos logit contributions
or+0.047
P+0.045
 course+0.044
ory+0.043
les+0.043
Neg logit contributions
sing-0.068
 breathing-0.067
 cob-0.065
 staying-0.065
 standing-0.064
Token is
Token ID318
Feature activation+3.48e-03

Pos logit contributions
 breathing+0.067
sing+0.067
 cob+0.065
 staying+0.065
 standing+0.064
Neg logit contributions
or-0.058
P-0.057
ory-0.054
 course-0.054
umper-0.053
Token like
Token ID588
Feature activation-1.05e-02

Pos logit contributions
 Kel+0.005
sing+0.005
ses+0.005
 breathing+0.005
 cob+0.005
Neg logit contributions
-0.005
oni-0.005
 course-0.005
P-0.005
les-0.005
Token talking
Token ID3375
Feature activation+1.28e-02

Pos logit contributions
 course+0.016
P+0.015
+0.014
oni+0.013
or+0.013
Neg logit contributions
sing-0.019
 Kel-0.018
 breathing-0.018
ses-0.017
 staying-0.017
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
sing+0.065
 cob+0.061
 Kel+0.060
 breathing+0.060
ses+0.060
Neg logit contributions
 course-0.038
or-0.036
P-0.035
ory-0.034
les-0.034
Token was
Token ID373
Feature activation+7.59e-03

Pos logit contributions
or+0.017
ory+0.017
P+0.016
umper+0.016
ingu+0.016
Neg logit contributions
 breathing-0.017
 staying-0.016
 standing-0.016
 cob-0.015
 sitting-0.015
Token ok
Token ID12876
Feature activation+7.27e-02

Pos logit contributions
sing+0.012
 cob+0.012
ses+0.011
 breathing+0.011
 Kel+0.011
Neg logit contributions
P-0.013
or-0.013
 course-0.013
umper-0.012
ory-0.012
Token.
Token ID13
Feature activation+8.93e-03

Pos logit contributions
sing+0.156
 breathing+0.148
 cob+0.146
ses+0.145
 staying+0.144
Neg logit contributions
or-0.089
P-0.085
 course-0.084
ory-0.083
les-0.080
Token<|endoftext|>
Token ID50256
Feature activation-5.13e-03

Pos logit contributions
 breathing+0.016
sing+0.016
 cob+0.016
 staying+0.016
 standing+0.016
Neg logit contributions
or-0.013
P-0.013
 course-0.013
ory-0.012
les-0.012
TokenP
Token ID47
Feature activation-2.04e-01

Pos logit contributions
or+0.008
ory+0.008
umper+0.007
 course+0.007
P+0.007
Neg logit contributions
 standing-0.009
 breathing-0.009
 sitting-0.009
 staying-0.008
 Ann-0.008
Tokenete
Token ID14471
Feature activation-1.02e-01

Pos logit contributions
 trigger+0.333
les+0.316
uses+0.306
ted+0.301
Hey+0.294
Neg logit contributions
 Ann-0.305
 standing-0.301
 Kim-0.295
 cob-0.295
 sitting-0.295
Token liked
Token ID8288
Feature activation-4.46e-02

Pos logit contributions
or+0.203
 trigger+0.193
P+0.191
 course+0.186
umper+0.185
Neg logit contributions
 cob-0.135
 breathing-0.134
 staying-0.131
 standing-0.131
 sitting-0.128
Token to
Token ID284
Feature activation-1.12e-02

Pos logit contributions
or+0.063
P+0.062
 course+0.062
ory+0.059
umper+0.056
Neg logit contributions
sing-0.086
 breathing-0.083
 cob-0.081
ses-0.080
 staying-0.078
Token play
Token ID711
Feature activation-9.33e-03

Pos logit contributions
or+0.019
P+0.019
ory+0.018
les+0.018
umper+0.018
Neg logit contributions
sing-0.018
 breathing-0.018
 cob-0.018
 staying-0.018
 standing-0.017
Token hide
Token ID7808
Feature activation+2.38e-02

Pos logit contributions
or+0.020
ory+0.020
umper+0.019
use+0.019
oni+0.019
Neg logit contributions
 cob-0.010
 standing-0.010
 breathing-0.010
 staying-0.010
 sitting-0.010
Token it
Token ID340
Feature activation-1.23e-02

Pos logit contributions
sing+0.086
 breathing+0.081
ses+0.078
 Kel+0.076
 staying+0.075
Neg logit contributions
 course-0.048
P-0.046
or-0.044
-0.043
ons-0.041
Token is
Token ID318
Feature activation+3.82e-03

Pos logit contributions
or+0.019
P+0.018
 course+0.017
ory+0.017
umper+0.017
Neg logit contributions
 breathing-0.023
sing-0.022
 cob-0.022
 staying-0.022
 standing-0.022
Token?"
Token ID1701
Feature activation-3.12e-02

Pos logit contributions
 breathing+0.009
sing+0.009
 staying+0.008
 cob+0.008
 standing+0.008
Neg logit contributions
or-0.004
P-0.004
 course-0.004
ory-0.004
les-0.004
Token
Token ID198
Feature activation-6.60e-03

Pos logit contributions
or+0.055
P+0.053
 course+0.051
les+0.051
ory+0.050
Neg logit contributions
 breathing-0.049
sing-0.049
 staying-0.048
 standing-0.047
 cob-0.047
Token
Token ID198
Feature activation-4.43e-03

Pos logit contributions
or+0.011
P+0.010
ory+0.010
 course+0.010
les+0.010
Neg logit contributions
 breathing-0.011
 cob-0.011
sing-0.011
 standing-0.011
 staying-0.011
TokenP
Token ID47
Feature activation-2.04e-01

Pos logit contributions
or+0.007
les+0.007
ory+0.007
 course+0.007
P+0.006
Neg logit contributions
 breathing-0.007
 cob-0.007
sing-0.007
 standing-0.007
 staying-0.007
Tokenip
Token ID541
Feature activation-7.85e-02

Pos logit contributions
 trigger+0.259
les+0.251
uses+0.246
P+0.241
umper+0.234
Neg logit contributions
sing-0.377
 Ann-0.373
ses-0.359
 standing-0.356
 cob-0.355
Token nodded
Token ID14464
Feature activation+3.43e-03

Pos logit contributions
 trigger+0.136
P+0.135
or+0.132
glers+0.124
ory+0.123
Neg logit contributions
 Ann-0.106
 breathing-0.105
 standing-0.104
 cob-0.104
ses-0.104
Token eagerly
Token ID30130
Feature activation-5.35e-02

Pos logit contributions
sing+0.006
 breathing+0.006
 cob+0.006
 staying+0.006
 standing+0.006
Neg logit contributions
or-0.006
P-0.005
 course-0.005
ory-0.005
les-0.005
Token.
Token ID13
Feature activation+8.98e-03

Pos logit contributions
P+0.095
or+0.093
 course+0.091
umper+0.090
ory+0.087
Neg logit contributions
 breathing-0.085
sing-0.083
 standing-0.082
 staying-0.080
 cob-0.077
Token
Token ID220
Feature activation-5.44e-02

Pos logit contributions
sing+0.016
 breathing+0.016
 cob+0.015
 staying+0.015
 standing+0.014
Neg logit contributions
or-0.014
P-0.014
ory-0.014
 course-0.014
umper-0.013
Token asked
Token ID1965
Feature activation+1.12e-02

Pos logit contributions
sing+0.019
 breathing+0.018
 cob+0.018
 staying+0.018
ses+0.018
Neg logit contributions
P-0.010
or-0.009
ory-0.009
les-0.009
umper-0.009
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
sing+0.023
 cob+0.023
 breathing+0.023
 staying+0.022
ses+0.022
Neg logit contributions
or-0.014
P-0.014
ory-0.013
 course-0.013
umper-0.012
Token
Token ID198
Feature activation-3.45e-03

Pos logit contributions
sing+0.003
 breathing+0.003
 staying+0.003
 cob+0.003
 standing+0.003
Neg logit contributions
P-0.002
or-0.002
-0.002
umper-0.002
ory-0.002
Token
Token ID198
Feature activation-2.92e-03

Pos logit contributions
or+0.005
P+0.005
 course+0.004
ory+0.004
umper+0.004
Neg logit contributions
sing-0.007
 breathing-0.007
 cob-0.007
 staying-0.007
 standing-0.007
Token"
Token ID1
Feature activation-1.35e-02

Pos logit contributions
or+0.003
P+0.003
 course+0.003
ory+0.003
les+0.003
Neg logit contributions
 breathing-0.006
sing-0.006
 cob-0.006
 staying-0.006
 standing-0.006
TokenP
Token ID47
Feature activation-2.03e-01

Pos logit contributions
P+0.018
or+0.018
 course+0.017
ory+0.016
umper+0.016
Neg logit contributions
sing-0.028
 breathing-0.028
 cob-0.027
 staying-0.027
 standing-0.026
Tokenapa
Token ID32678
Feature activation-2.62e-02

Pos logit contributions
 course+0.306
P+0.306
umper+0.292
or+0.291
les+0.285
Neg logit contributions
sing-0.331
 Ann-0.324
 standing-0.324
 cob-0.323
 staying-0.320
Token,
Token ID11
Feature activation+8.84e-04

Pos logit contributions
P+0.043
or+0.043
umper+0.039
 course+0.038
ory+0.038
Neg logit contributions
 breathing-0.046
 standing-0.045
sing-0.044
 staying-0.044
 cob-0.044
Token where
Token ID810
Feature activation+5.06e-02

Pos logit contributions
sing+0.002
 breathing+0.002
 cob+0.002
ses+0.002
 staying+0.002
Neg logit contributions
 course-0.001
or-0.001
P-0.001
ory-0.001
umper-0.001
Token is
Token ID318
Feature activation+4.05e-03

Pos logit contributions
 breathing+0.094
 cob+0.093
 staying+0.093
sing+0.092
 standing+0.091
Neg logit contributions
or-0.075
P-0.073
 course-0.069
ory-0.069
umper-0.068
Token my
Token ID616
Feature activation+2.15e-02

Pos logit contributions
 breathing+0.005
 staying+0.005
 cob+0.005
 standing+0.005
sing+0.005
Neg logit contributions
or-0.008
P-0.008
 course-0.008
ory-0.008
les-0.008
Token believe
Token ID1975
Feature activation+3.64e-02

Pos logit contributions
or+0.092
P+0.089
 course+0.087
ory+0.085
les+0.083
Neg logit contributions
 breathing-0.097
sing-0.097
 cob-0.094
 staying-0.094
 standing-0.092
Token it
Token ID340
Feature activation-6.23e-03

Pos logit contributions
sing+0.060
 breathing+0.059
 cob+0.058
 staying+0.056
 standing+0.056
Neg logit contributions
or-0.062
P-0.060
 course-0.060
ory-0.058
les-0.057
Token!
Token ID0
Feature activation-1.21e-02

Pos logit contributions
or+0.009
P+0.009
 course+0.008
ory+0.008
umper+0.008
Neg logit contributions
 breathing-0.012
sing-0.012
 cob-0.011
 staying-0.011
 standing-0.011
Token
Token ID198
Feature activation+8.47e-03

Pos logit contributions
or+0.020
P+0.019
 course+0.019
ory+0.019
les+0.018
Neg logit contributions
 breathing-0.020
sing-0.020
 staying-0.019
 cob-0.019
 standing-0.019
Token
Token ID198
Feature activation+9.86e-03

Pos logit contributions
sing+0.016
 breathing+0.016
 cob+0.015
 staying+0.015
 standing+0.015
Neg logit contributions
or-0.013
P-0.012
 course-0.012
ory-0.012
umper-0.011
TokenP
Token ID47
Feature activation-2.03e-01

Pos logit contributions
 breathing+0.018
sing+0.018
 staying+0.018
 cob+0.018
 standing+0.018
Neg logit contributions
or-0.014
P-0.014
 course-0.013
ory-0.013
les-0.013
Tokenapa
Token ID32678
Feature activation-1.88e-02

Pos logit contributions
 trigger+0.234
les+0.215
ted+0.209
+0.200
uses+0.198
Neg logit contributions
sing-0.401
 standing-0.396
 Ann-0.392
run-0.385
 staying-0.385
Token explained
Token ID4893
Feature activation-1.90e-02

Pos logit contributions
or+0.030
P+0.029
 course+0.027
ory+0.026
umper+0.026
Neg logit contributions
 standing-0.033
 breathing-0.032
 staying-0.031
 sitting-0.031
sing-0.031
Token that
Token ID326
Feature activation+3.69e-03

Pos logit contributions
P+0.031
or+0.031
 course+0.030
ory+0.029
umper+0.029
Neg logit contributions
 breathing-0.032
sing-0.032
 cob-0.031
 staying-0.031
 standing-0.031
Token the
Token ID262
Feature activation-9.06e-03

Pos logit contributions
sing+0.007
 breathing+0.007
ses+0.007
 cob+0.007
 staying+0.007
Neg logit contributions
or-0.005
 course-0.005
P-0.005
ory-0.005
umper-0.004
Token coal
Token ID5655
Feature activation-2.43e-02

Pos logit contributions
or+0.016
P+0.015
 course+0.015
ory+0.014
les+0.014
Neg logit contributions
 breathing-0.015
sing-0.015
 cob-0.014
 staying-0.014
 standing-0.014
Token our
Token ID674
Feature activation-3.66e-03

Pos logit contributions
or+0.027
P+0.026
 course+0.025
ory+0.025
umper+0.025
Neg logit contributions
 breathing-0.021
sing-0.020
 cob-0.020
 staying-0.020
 standing-0.020
Token toys
Token ID14958
Feature activation-1.32e-02

Pos logit contributions
or+0.007
P+0.006
 course+0.006
ory+0.006
les+0.006
Neg logit contributions
 breathing-0.005
sing-0.005
 staying-0.005
 standing-0.005
 Kel-0.005
Token."
Token ID526
Feature activation-2.53e-02

Pos logit contributions
or+0.021
P+0.020
 course+0.020
ory+0.019
umper+0.019
Neg logit contributions
 breathing-0.023
sing-0.023
 staying-0.023
 cob-0.022
 standing-0.022
Token
Token ID198
Feature activation+1.75e-03

Pos logit contributions
or+0.039
P+0.038
 course+0.037
ory+0.036
les+0.036
Neg logit contributions
 breathing-0.045
sing-0.045
 cob-0.044
 staying-0.044
 standing-0.043
Token
Token ID198
Feature activation+2.57e-03

Pos logit contributions
sing+0.003
 breathing+0.003
 cob+0.003
 staying+0.003
 standing+0.003
Neg logit contributions
or-0.002
P-0.002
 course-0.002
ory-0.002
les-0.002
TokenP
Token ID47
Feature activation-2.03e-01

Pos logit contributions
 breathing+0.005
sing+0.005
 cob+0.005
 staying+0.005
 standing+0.005
Neg logit contributions
or-0.004
P-0.003
 course-0.003
ory-0.003
les-0.003
Tokenapa
Token ID32678
Feature activation-2.32e-02

Pos logit contributions
 trigger+0.293
 course+0.274
P+0.273
ted+0.270
Hey+0.266
Neg logit contributions
sing-0.331
ses-0.321
 staying-0.321
 Ann-0.315
 standing-0.315
Token says
Token ID1139
Feature activation+1.96e-02

Pos logit contributions
or+0.042
P+0.041
umper+0.038
 course+0.038
ory+0.037
Neg logit contributions
 breathing-0.036
 standing-0.035
sing-0.034
 staying-0.034
 cob-0.034
Token:
Token ID25
Feature activation+2.69e-02

Pos logit contributions
 breathing+0.037
sing+0.037
 staying+0.036
 cob+0.036
 standing+0.035
Neg logit contributions
or-0.029
P-0.028
 course-0.027
ory-0.027
umper-0.026
Token "
Token ID366
Feature activation+6.69e-02

Pos logit contributions
 breathing+0.031
sing+0.030
 standing+0.030
 staying+0.029
 sitting+0.027
Neg logit contributions
or-0.060
les-0.058
P-0.058
 course-0.057
ory-0.056
TokenYou
Token ID1639
Feature activation-2.50e-02

Pos logit contributions
 breathing+0.127
sing+0.125
 staying+0.122
 cob+0.121
 standing+0.120
Neg logit contributions
or-0.098
P-0.091
 course-0.091
ory-0.090
les-0.088
Token his
Token ID465
Feature activation+8.31e-03

Pos logit contributions
sing+0.068
 breathing+0.067
 cob+0.065
 staying+0.064
 standing+0.063
Neg logit contributions
 course-0.054
or-0.051
P-0.050
les-0.048
ory-0.048
Token tree
Token ID5509
Feature activation-6.52e-03

Pos logit contributions
 breathing+0.013
 Kel+0.013
 staying+0.013
sing+0.012
 standing+0.012
Neg logit contributions
 course-0.013
or-0.013
-0.013
ory-0.013
P-0.013
Token.
Token ID13
Feature activation+7.58e-03

Pos logit contributions
or+0.009
 course+0.008
P+0.008
ory+0.008
les+0.007
Neg logit contributions
sing-0.013
 breathing-0.013
 cob-0.013
 staying-0.012
ses-0.012
Token
Token ID198
Feature activation-7.72e-04

Pos logit contributions
 breathing+0.014
sing+0.014
 staying+0.013
 cob+0.013
 standing+0.013
Neg logit contributions
or-0.012
P-0.011
 course-0.011
ory-0.011
les-0.011
Token
Token ID198
Feature activation+1.32e-03

Pos logit contributions
or+0.001
P+0.001
ory+0.001
 course+0.001
les+0.001
Neg logit contributions
 breathing-0.001
sing-0.001
 staying-0.001
 standing-0.001
 cob-0.001
TokenP
Token ID47
Feature activation-2.03e-01

Pos logit contributions
 breathing+0.002
 staying+0.002
 standing+0.002
sing+0.002
 cob+0.002
Neg logit contributions
or-0.002
ory-0.002
les-0.002
 course-0.002
P-0.002
Tokenip
Token ID541
Feature activation-7.89e-02

Pos logit contributions
 trigger+0.223
les+0.221
ted+0.214
uses+0.210
P+0.208
Neg logit contributions
sing-0.408
 standing-0.393
 Ann-0.389
 staying-0.387
 cob-0.384
Token followed
Token ID3940
Feature activation+7.32e-03

Pos logit contributions
ory+0.125
P+0.123
or+0.123
umper+0.119
 trigger+0.119
Neg logit contributions
 cob-0.126
 standing-0.125
 breathing-0.124
 staying-0.123
 sitting-0.122
Token the
Token ID262
Feature activation-1.03e-02

Pos logit contributions
 breathing+0.011
sing+0.011
 cob+0.011
 staying+0.011
 standing+0.010
Neg logit contributions
P-0.013
 course-0.013
or-0.013
ory-0.012
umper-0.012
Token sun
Token ID4252
Feature activation+3.79e-03

Pos logit contributions
 course+0.015
P+0.014
or+0.014
les+0.014
+0.014
Neg logit contributions
sing-0.018
 breathing-0.018
 staying-0.018
 standing-0.017
ses-0.017
Token for
Token ID329
Feature activation-1.12e-02

Pos logit contributions
 cob+0.007
sing+0.007
 breathing+0.007
ses+0.007
 Kel+0.007
Neg logit contributions
or-0.005
les-0.005
als-0.005
 course-0.005
ted-0.005
Token discover
Token ID7073
Feature activation-2.04e-02

Pos logit contributions
sing+0.030
 breathing+0.029
 cob+0.029
ses+0.028
 standing+0.028
Neg logit contributions
or-0.020
P-0.019
 course-0.019
-0.019
ory-0.019
Token more
Token ID517
Feature activation+2.39e-02

Pos logit contributions
 course+0.031
or+0.029
P+0.028
les+0.027
+0.026
Neg logit contributions
sing-0.041
ses-0.039
 breathing-0.038
 cob-0.035
 staying-0.035
Token surprises
Token ID24072
Feature activation+6.17e-02

Pos logit contributions
sing+0.043
ses+0.040
 breathing+0.037
 Kel+0.035
 staying+0.034
Neg logit contributions
 course-0.040
or-0.040
les-0.039
P-0.038
ade-0.037
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
sing+0.124
 breathing+0.119
 cob+0.118
ses+0.117
 staying+0.111
Neg logit contributions
or-0.088
 course-0.082
P-0.080
ory-0.078
les-0.077
Token<|endoftext|>
Token ID50256
Feature activation-4.69e-03

Pos logit contributions
sing+0.034
 breathing+0.034
 cob+0.033
 staying+0.033
 standing+0.033
Neg logit contributions
or-0.024
P-0.023
 course-0.022
ory-0.022
umper-0.021
TokenP
Token ID47
Feature activation-2.02e-01

Pos logit contributions
or+0.008
ory+0.007
P+0.007
 course+0.007
umper+0.007
Neg logit contributions
 breathing-0.008
 standing-0.008
 staying-0.008
 cob-0.007
 Ann-0.007
Tokenete
Token ID14471
Feature activation-9.39e-02

Pos logit contributions
 trigger+0.296
ted+0.274
les+0.272
als+0.264
uses+0.262
Neg logit contributions
 standing-0.344
 Ann-0.335
 sitting-0.332
 cob-0.327
 staying-0.327
Token was
Token ID373
Feature activation-3.02e-03

Pos logit contributions
or+0.185
P+0.181
 trigger+0.178
umper+0.175
 course+0.174
Neg logit contributions
 breathing-0.122
 cob-0.122
 standing-0.121
 staying-0.120
 sitting-0.116
Token walking
Token ID6155
Feature activation+7.11e-02

Pos logit contributions
P+0.006
 course+0.005
umper+0.005
or+0.005
ingu+0.005
Neg logit contributions
 cob-0.004
sing-0.004
 breathing-0.004
 Kel-0.004
 beating-0.004
Token in
Token ID287
Feature activation-4.27e-03

Pos logit contributions
sing+0.115
 breathing+0.115
 cob+0.112
 staying+0.110
 standing+0.106
Neg logit contributions
or-0.127
P-0.121
 course-0.117
ory-0.116
les-0.114
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
 course+0.005
or+0.004
P+0.003
 top+0.003
+0.003
Neg logit contributions
sing-0.010
ses-0.010
 breathing-0.009
 beating-0.009
 cob-0.009
Token away
Token ID1497
Feature activation-1.14e-01

Pos logit contributions
or+0.027
P+0.027
 course+0.026
ory+0.026
umper+0.025
Neg logit contributions
 breathing-0.029
sing-0.029
 staying-0.028
 cob-0.028
 standing-0.028
Token.
Token ID13
Feature activation+1.24e-03

Pos logit contributions
or+0.099
ar+0.087
ory+0.084
les+0.083
+0.082
Neg logit contributions
sing-0.268
 cob-0.260
ses-0.255
 staying-0.252
 Kel-0.247
Token
Token ID198
Feature activation-4.48e-03

Pos logit contributions
sing+0.002
 breathing+0.002
 cob+0.002
 staying+0.002
 standing+0.002
Neg logit contributions
or-0.002
P-0.002
 course-0.002
ory-0.002
les-0.002
Token
Token ID198
Feature activation-4.03e-03

Pos logit contributions
or+0.006
P+0.006
 course+0.005
ory+0.005
umper+0.005
Neg logit contributions
sing-0.009
 breathing-0.009
 cob-0.009
 staying-0.009
 standing-0.009
Token"
Token ID1
Feature activation-1.45e-02

Pos logit contributions
or+0.006
 course+0.006
ory+0.006
les+0.006
umper+0.005
Neg logit contributions
 breathing-0.007
 standing-0.007
 cob-0.007
 staying-0.007
sing-0.007
TokenP
Token ID47
Feature activation-2.02e-01

Pos logit contributions
P+0.024
or+0.024
 course+0.023
ory+0.022
les+0.022
Neg logit contributions
sing-0.025
 breathing-0.025
 cob-0.024
 staying-0.024
 standing-0.023
Tokenhew
Token ID6391
Feature activation+2.41e-03

Pos logit contributions
 course+0.303
P+0.296
or+0.290
les+0.288
umper+0.286
Neg logit contributions
sing-0.339
ses-0.325
 breathing-0.324
 standing-0.322
 Ann-0.319
Token,
Token ID11
Feature activation+2.21e-03

Pos logit contributions
sing+0.005
 breathing+0.005
 cob+0.005
 staying+0.005
 standing+0.005
Neg logit contributions
or-0.003
P-0.003
 course-0.003
ory-0.003
les-0.003
Token that
Token ID326
Feature activation-2.27e-03

Pos logit contributions
sing+0.005
 breathing+0.005
 beating+0.004
 staying+0.004
 cob+0.004
Neg logit contributions
 course-0.003
P-0.002
or-0.002
ar-0.002
 may-0.002
Token was
Token ID373
Feature activation-1.76e-03

Pos logit contributions
or+0.003
P+0.003
 course+0.003
les+0.003
ory+0.003
Neg logit contributions
sing-0.005
 breathing-0.005
 cob-0.005
 staying-0.005
 standing-0.004
Token close
Token ID1969
Feature activation-5.93e-02

Pos logit contributions
 course+0.003
P+0.003
oni+0.002
ar+0.002
+0.002
Neg logit contributions
sing-0.003
 cob-0.003
 breathing-0.003
 Kel-0.003
ses-0.003
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
sing+0.004
 breathing+0.004
 cob+0.004
 staying+0.004
 standing+0.004
Neg logit contributions
or-0.003
P-0.003
 course-0.003
ory-0.003
les-0.003
Token toe
Token ID21189
Feature activation+3.99e-02

Pos logit contributions
or+0.020
 course+0.020
P+0.020
ory+0.019
les+0.019
Neg logit contributions
sing-0.026
 breathing-0.026
 cob-0.025
 staying-0.025
 standing-0.025
Token!
Token ID0
Feature activation-1.15e-02

Pos logit contributions
sing+0.078
 breathing+0.076
 cob+0.074
 staying+0.074
ses+0.072
Neg logit contributions
or-0.056
P-0.053
 course-0.052
les-0.052
ory-0.051
Token
Token ID198
Feature activation+3.43e-04

Pos logit contributions
or+0.016
P+0.016
 course+0.015
ory+0.015
umper+0.015
Neg logit contributions
sing-0.022
 breathing-0.022
 cob-0.022
 staying-0.021
 standing-0.021
Token
Token ID198
Feature activation+1.32e-03

Pos logit contributions
sing+0.001
 breathing+0.001
 cob+0.001
 staying+0.001
 standing+0.001
Neg logit contributions
or-0.000
P-0.000
 course-0.000
ory-0.000
umper-0.000
TokenP
Token ID47
Feature activation-2.02e-01

Pos logit contributions
sing+0.003
 breathing+0.003
 cob+0.003
 staying+0.003
 standing+0.002
Neg logit contributions
or-0.002
P-0.002
 course-0.002
ory-0.002
umper-0.002
Tokenip
Token ID541
Feature activation-7.61e-02

Pos logit contributions
 trigger+0.212
P+0.211
les+0.209
or+0.205
+0.201
Neg logit contributions
sing-0.426
 standing-0.403
 breathing-0.400
 staying-0.399
 cob-0.399
Token thought
Token ID1807
Feature activation-3.61e-02

Pos logit contributions
P+0.115
or+0.115
ory+0.111
 trigger+0.107
umper+0.106
Neg logit contributions
 breathing-0.131
 cob-0.129
 standing-0.127
 staying-0.126
 sitting-0.122
Token it
Token ID340
Feature activation-8.56e-03

Pos logit contributions
or+0.059
 course+0.053
les+0.052
ar+0.049
 may+0.048
Neg logit contributions
sing-0.066
ses-0.063
 breathing-0.062
 staying-0.060
 cob-0.059
Token was
Token ID373
Feature activation-1.62e-03

Pos logit contributions
or+0.008
 course+0.008
les+0.008
P+0.007
+0.007
Neg logit contributions
sing-0.020
 breathing-0.019
 staying-0.019
ses-0.019
 cob-0.019
Token funny
Token ID8258
Feature activation-2.26e-03

Pos logit contributions
 course+0.002
P+0.002
oni+0.002
umper+0.002
ory+0.002
Neg logit contributions
sing-0.003
 breathing-0.003
ses-0.003
 staying-0.003
 Kel-0.003
Token smile
Token ID8212
Feature activation-4.02e-02

Pos logit contributions
 course+0.031
or+0.029
P+0.029
+0.028
ory+0.028
Neg logit contributions
 breathing-0.039
sing-0.037
 staying-0.037
 Kel-0.036
 cob-0.036
Token.
Token ID13
Feature activation+4.21e-03

Pos logit contributions
or+0.062
P+0.061
 course+0.059
ory+0.058
les+0.057
Neg logit contributions
sing-0.072
 breathing-0.072
 cob-0.070
 staying-0.069
 standing-0.068
Token
Token ID220
Feature activation-6.12e-02

Pos logit contributions
sing+0.008
 breathing+0.008
 cob+0.008
 staying+0.007
 standing+0.007
Neg logit contributions
or-0.006
P-0.006
 course-0.006
ory-0.006
umper-0.006
Token
Token ID198
Feature activation+1.78e-03

Pos logit contributions
or+0.086
P+0.083
 course+0.081
ory+0.079
les+0.077
Neg logit contributions
sing-0.118
 breathing-0.118
 cob-0.115
 staying-0.114
 standing-0.112
Token
Token ID198
Feature activation+4.14e-03

Pos logit contributions
 breathing+0.003
sing+0.003
 cob+0.003
 staying+0.003
 standing+0.003
Neg logit contributions
or-0.003
P-0.003
 course-0.002
ory-0.002
les-0.002
TokenP
Token ID47
Feature activation-2.02e-01

Pos logit contributions
 breathing+0.008
sing+0.008
 cob+0.008
 staying+0.008
 standing+0.008
Neg logit contributions
or-0.006
P-0.006
 course-0.005
ory-0.005
les-0.005
Tokenapa
Token ID32678
Feature activation-2.15e-02

Pos logit contributions
 trigger+0.262
les+0.236
uses+0.229
ted+0.217
umper+0.214
Neg logit contributions
sing-0.371
 Ann-0.369
 standing-0.362
run-0.360
 staying-0.350
Token shook
Token ID14682
Feature activation-6.74e-03

Pos logit contributions
or+0.035
P+0.034
 course+0.031
umper+0.030
ory+0.030
Neg logit contributions
 standing-0.037
 breathing-0.037
 staying-0.036
sing-0.036
 cob-0.036
Token his
Token ID465
Feature activation+5.60e-03

Pos logit contributions
or+0.008
 course+0.007
ory+0.007
P+0.007
les+0.007
Neg logit contributions
sing-0.014
 breathing-0.014
 cob-0.014
ses-0.014
 staying-0.014
Token head
Token ID1182
Feature activation+7.91e-03

Pos logit contributions
 Kel+0.010
 staying+0.010
 breathing+0.010
 cob+0.010
 Ann+0.010
Neg logit contributions
ory-0.007
or-0.007
-0.007
les-0.007
 course-0.007
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
sing+0.014
 breathing+0.014
 cob+0.014
 staying+0.014
 standing+0.013
Neg logit contributions
or-0.012
P-0.012
 course-0.011
ory-0.011
les-0.011
TokenL
Token ID43
Feature activation-4.18e-03

Pos logit contributions
 breathing+0.007
sing+0.007
 cob+0.007
 staying+0.006
 standing+0.006
Neg logit contributions
or-0.005
P-0.005
 course-0.004
ory-0.004
les-0.004
Tokenily
Token ID813
Feature activation+2.65e-02

Pos logit contributions
or+0.008
P+0.008
 course+0.008
+0.008
ory+0.008
Neg logit contributions
 breathing-0.006
 cob-0.006
 staying-0.006
sing-0.006
 standing-0.006
Token said
Token ID531
Feature activation+1.56e-02

Pos logit contributions
 breathing+0.046
 cob+0.046
sing+0.046
 standing+0.046
 staying+0.045
Neg logit contributions
or-0.041
P-0.040
 course-0.038
ory-0.038
umper-0.037
Token,
Token ID11
Feature activation+6.30e-03

Pos logit contributions
sing+0.032
 breathing+0.032
 cob+0.031
 staying+0.030
 standing+0.030
Neg logit contributions
or-0.021
P-0.020
 course-0.019
ory-0.019
les-0.018
Token "
Token ID366
Feature activation+6.35e-02

Pos logit contributions
sing+0.009
 cob+0.009
 breathing+0.008
 staying+0.008
ses+0.008
Neg logit contributions
or-0.012
P-0.011
 course-0.011
ory-0.011
-0.011
TokenP
Token ID47
Feature activation-2.02e-01

Pos logit contributions
 breathing+0.117
sing+0.117
 cob+0.113
 staying+0.113
 standing+0.111
Neg logit contributions
or-0.096
P-0.092
 course-0.090
ory-0.088
les-0.087
Tokenupp
Token ID7211
Feature activation+2.84e-02

Pos logit contributions
P+0.276
les+0.271
or+0.269
 course+0.267
 trigger+0.266
Neg logit contributions
sing-0.349
 cob-0.330
ses-0.327
 standing-0.327
 breathing-0.323
Tokeny
Token ID88
Feature activation-5.69e-03

Pos logit contributions
sing+0.044
 breathing+0.044
 staying+0.041
 standing+0.039
 sitting+0.038
Neg logit contributions
P-0.052
 course-0.052
-0.049
or-0.049
-0.047
Token,
Token ID11
Feature activation+6.92e-03

Pos logit contributions
or+0.009
P+0.009
umper+0.008
 course+0.008
ory+0.008
Neg logit contributions
 breathing-0.010
sing-0.010
 cob-0.010
 staying-0.010
 standing-0.010
Token don
Token ID836
Feature activation+5.97e-03

Pos logit contributions
sing+0.012
 breathing+0.012
 cob+0.012
 staying+0.011
ses+0.011
Neg logit contributions
or-0.011
 course-0.011
P-0.010
ory-0.010
umper-0.010
Token't
Token ID470
Feature activation-5.12e-02

Pos logit contributions
 breathing+0.014
 staying+0.014
 standing+0.014
 beating+0.013
 Kel+0.013
Neg logit contributions
ory-0.006
or-0.006
P-0.006
les-0.005
umper-0.005
Token relax
Token ID8960
Feature activation+5.19e-02

Pos logit contributions
or+0.007
 course+0.007
P+0.006
+0.006
umper+0.006
Neg logit contributions
sing-0.007
 breathing-0.007
ses-0.006
 cob-0.006
 staying-0.006
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
sing+0.109
ses+0.105
 cob+0.105
 breathing+0.104
 staying+0.102
Neg logit contributions
or-0.064
P-0.061
 course-0.060
les-0.057
ory-0.057
Token He
Token ID679
Feature activation-1.25e-02

Pos logit contributions
 breathing+0.041
sing+0.041
 cob+0.040
 staying+0.040
 standing+0.040
Neg logit contributions
or-0.029
P-0.027
 course-0.027
ory-0.026
les-0.026
Token forgot
Token ID16453
Feature activation-3.10e-02

Pos logit contributions
or+0.017
P+0.016
 course+0.016
ory+0.015
les+0.015
Neg logit contributions
sing-0.025
 breathing-0.025
 cob-0.024
 staying-0.024
 standing-0.024
Token all
Token ID477
Feature activation-4.93e-02

Pos logit contributions
or+0.058
 course+0.055
P+0.054
les+0.053
ory+0.053
Neg logit contributions
sing-0.047
 breathing-0.045
ses-0.043
 cob-0.043
 staying-0.042
Token about
Token ID546
Feature activation+1.15e-01

Pos logit contributions
 course+0.110
les+0.106
or+0.102
ons+0.094
ade+0.094
Neg logit contributions
sing-0.068
ses-0.062
 breathing-0.048
sed-0.048
 beating-0.047
Token his
Token ID465
Feature activation+1.15e-02

Pos logit contributions
sing+0.219
 cob+0.204
ses+0.204
 breathing+0.204
 standing+0.192
Neg logit contributions
 course-0.166
or-0.161
P-0.147
umper-0.147
ory-0.146
Token missing
Token ID4814
Feature activation-7.74e-03

Pos logit contributions
 breathing+0.018
 cob+0.018
sing+0.018
 staying+0.018
 standing+0.017
Neg logit contributions
or-0.020
 course-0.019
les-0.018
P-0.018
ory-0.018
Token car
Token ID1097
Feature activation+4.91e-02

Pos logit contributions
P+0.015
or+0.014
 course+0.014
ory+0.014
umper+0.014
Neg logit contributions
 breathing-0.011
sing-0.011
 cob-0.011
 staying-0.011
 standing-0.011
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
 breathing+0.084
sing+0.084
 staying+0.081
 cob+0.081
 standing+0.080
Neg logit contributions
or-0.080
P-0.080
 course-0.075
ory-0.075
umper-0.073
Token had
Token ID550
Feature activation+1.55e-02

Pos logit contributions
sing+0.028
 breathing+0.027
 cob+0.026
 staying+0.026
 standing+0.026
Neg logit contributions
or-0.016
 course-0.015
P-0.015
ory-0.015
les-0.015
Token he
Token ID339
Feature activation-1.29e-02

Pos logit contributions
sing+0.005
 breathing+0.005
 staying+0.005
 cob+0.005
ses+0.005
Neg logit contributions
or-0.003
 course-0.003
P-0.003
ory-0.003
les-0.003
Token would
Token ID561
Feature activation-5.93e-02

Pos logit contributions
P+0.019
or+0.019
ory+0.018
umper+0.018
 course+0.017
Neg logit contributions
 breathing-0.024
 standing-0.024
 cob-0.024
sing-0.023
 staying-0.023
Token take
Token ID1011
Feature activation-1.33e-02

Pos logit contributions
P+0.105
or+0.103
ory+0.099
umper+0.097
les+0.095
Neg logit contributions
sing-0.096
 breathing-0.095
 staying-0.093
 cob-0.093
 standing-0.091
Token off
Token ID572
Feature activation-2.09e-02

Pos logit contributions
P+0.024
or+0.024
ory+0.024
umper+0.023
 course+0.023
Neg logit contributions
 breathing-0.020
 cob-0.020
 standing-0.019
 staying-0.019
sing-0.019
Token,
Token ID11
Feature activation+4.17e-03

Pos logit contributions
or+0.032
P+0.031
 course+0.030
ory+0.029
les+0.029
Neg logit contributions
sing-0.038
 breathing-0.038
 cob-0.037
 staying-0.036
 standing-0.036
Token flying
Token ID7348
Feature activation+9.01e-02

Pos logit contributions
sing+0.007
 breathing+0.007
 cob+0.006
ses+0.006
 staying+0.006
Neg logit contributions
 course-0.007
or-0.007
P-0.006
les-0.006
-0.006
Token through
Token ID832
Feature activation-2.99e-03

Pos logit contributions
sing+0.157
 breathing+0.157
 staying+0.151
 cob+0.151
 standing+0.148
Neg logit contributions
or-0.144
P-0.139
 course-0.137
ory-0.132
les-0.131
Token the
Token ID262
Feature activation-1.11e-02

Pos logit contributions
 course+0.004
or+0.004
P+0.004
les+0.004
ory+0.004
Neg logit contributions
sing-0.006
 breathing-0.005
 staying-0.005
ses-0.005
 cob-0.005
Token sky
Token ID6766
Feature activation+7.03e-03

Pos logit contributions
or+0.019
ory+0.018
P+0.018
umper+0.017
 course+0.017
Neg logit contributions
 cob-0.019
 breathing-0.019
sing-0.018
 standing-0.018
 staying-0.018
Token.
Token ID13
Feature activation+1.00e-02

Pos logit contributions
sing+0.013
 breathing+0.013
 cob+0.013
 staying+0.013
ses+0.013
Neg logit contributions
or-0.010
 course-0.009
P-0.009
les-0.009
ory-0.009
Token He
Token ID679
Feature activation-1.97e-02

Pos logit contributions
sing+0.020
 breathing+0.020
 cob+0.019
 staying+0.019
 standing+0.019
Neg logit contributions
or-0.014
P-0.013
 course-0.013
ory-0.012
umper-0.012
Token off
Token ID572
Feature activation-2.31e-02

Pos logit contributions
P+0.045
or+0.044
 course+0.043
ory+0.042
umper+0.042
Neg logit contributions
 breathing-0.041
 cob-0.040
 standing-0.040
sing-0.040
 staying-0.039
Token the
Token ID262
Feature activation-1.17e-02

Pos logit contributions
or+0.041
P+0.041
umper+0.039
ory+0.039
 course+0.039
Neg logit contributions
 breathing-0.035
 cob-0.035
sing-0.035
 staying-0.034
 standing-0.034
Token light
Token ID1657
Feature activation+1.93e-02

Pos logit contributions
or+0.020
 course+0.020
P+0.020
les+0.019
ory+0.019
Neg logit contributions
sing-0.018
 breathing-0.018
 staying-0.018
 standing-0.017
 cob-0.017
Token,
Token ID11
Feature activation+1.55e-03

Pos logit contributions
sing+0.035
 breathing+0.035
 cob+0.034
 staying+0.034
 standing+0.034
Neg logit contributions
or-0.029
P-0.028
 course-0.027
ory-0.027
les-0.026
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
sing+0.003
 breathing+0.003
ses+0.003
 staying+0.003
 cob+0.003
Neg logit contributions
or-0.002
 course-0.002
P-0.002
-0.002
ory-0.002
Token rug
Token ID14477
Feature activation+8.44e-02

Pos logit contributions
 course+0.020
les+0.019
or+0.019
P+0.019
+0.019
Neg logit contributions
 staying-0.022
sing-0.021
 breathing-0.021
 Kel-0.021
 standing-0.020
Token gl
Token ID1278
Feature activation+5.58e-02

Pos logit contributions
sing+0.174
 breathing+0.174
 cob+0.171
 standing+0.169
 staying+0.168
Neg logit contributions
or-0.108
P-0.106
ory-0.099
 course-0.099
umper-0.099
Tokenowed
Token ID6972
Feature activation+4.70e-02

Pos logit contributions
sing+0.067
 breathing+0.065
 standing+0.060
 cob+0.060
 staying+0.060
Neg logit contributions
 course-0.121
P-0.121
or-0.117
umper-0.114
ory-0.113
Token like
Token ID588
Feature activation-8.42e-03

Pos logit contributions
sing+0.084
 breathing+0.083
 staying+0.082
ses+0.081
 cob+0.080
Neg logit contributions
or-0.072
P-0.069
les-0.069
ory-0.068
 course-0.067
Token the
Token ID262
Feature activation-1.36e-02

Pos logit contributions
 course+0.014
P+0.013
or+0.013
ory+0.013
les+0.012
Neg logit contributions
sing-0.014
 breathing-0.014
 staying-0.013
 cob-0.013
 standing-0.013
Token night
Token ID1755
Feature activation-1.05e-02

Pos logit contributions
 course+0.022
P+0.021
or+0.021
les+0.020
+0.020
Neg logit contributions
sing-0.022
 breathing-0.022
 staying-0.022
 Kel-0.021
 standing-0.021
Token ranch
Token ID28752
Feature activation-5.22e-02

Pos logit contributions
or+0.011
P+0.010
umper+0.010
les+0.010
ory+0.010
Neg logit contributions
 cob-0.010
sing-0.010
 breathing-0.010
 staying-0.010
 standing-0.010
Token felt
Token ID2936
Feature activation-1.99e-02

Pos logit contributions
P+0.080
or+0.078
umper+0.074
ory+0.074
 course+0.072
Neg logit contributions
sing-0.096
 breathing-0.095
 staying-0.094
 standing-0.093
 cob-0.092
Token really
Token ID1107
Feature activation-8.82e-02

Pos logit contributions
or+0.046
P+0.045
ory+0.043
 course+0.043
umper+0.043
Neg logit contributions
 breathing-0.022
 staying-0.021
sing-0.021
 cob-0.020
 standing-0.020
Token sorry
Token ID7926
Feature activation+3.79e-02

Pos logit contributions
or+0.197
P+0.194
 course+0.190
ory+0.188
les+0.186
Neg logit contributions
sing-0.097
 breathing-0.095
 cob-0.092
 staying-0.089
 standing-0.087
Token for
Token ID329
Feature activation-7.25e-03

Pos logit contributions
sing+0.061
 breathing+0.058
 cob+0.057
ses+0.056
 staying+0.055
Neg logit contributions
or-0.066
 course-0.063
les-0.062
ory-0.060
P-0.059
Token Flash
Token ID9973
Feature activation+1.13e-01

Pos logit contributions
 course+0.011
or+0.010
Friend+0.009
P+0.009
ted+0.009
Neg logit contributions
sing-0.014
ses-0.013
 breathing-0.013
 cob-0.012
 raining-0.012
Token and
Token ID290
Feature activation+1.30e-02

Pos logit contributions
sing+0.216
 breathing+0.213
 cob+0.210
 staying+0.207
 standing+0.204
Neg logit contributions
or-0.161
P-0.151
 course-0.150
ory-0.148
les-0.145
Token decided
Token ID3066
Feature activation-4.65e-02

Pos logit contributions
sing+0.024
 breathing+0.023
 cob+0.022
 standing+0.022
 staying+0.022
Neg logit contributions
 course-0.018
or-0.018
P-0.018
ory-0.017
les-0.017
Token to
Token ID284
Feature activation-3.28e-03

Pos logit contributions
or+0.074
P+0.071
 course+0.070
ory+0.068
les+0.066
Neg logit contributions
sing-0.082
 breathing-0.081
 cob-0.079
 staying-0.077
 standing-0.076
Token be
Token ID307
Feature activation+7.53e-02

Pos logit contributions
P+0.006
or+0.006
les+0.006
ory+0.006
boards+0.006
Neg logit contributions
 staying-0.005
 breathing-0.005
sing-0.005
 cob-0.005
 standing-0.005
Token friends
Token ID2460
Feature activation-6.72e-02

Pos logit contributions
sing+0.123
 breathing+0.114
ses+0.113
 cob+0.112
 staying+0.107
Neg logit contributions
P-0.138
or-0.131
 course-0.130
ory-0.123
umper-0.121
Token wrong
Token ID2642
Feature activation-3.29e-03

Pos logit contributions
P+0.038
 course+0.038
or+0.037
ory+0.036
umper+0.035
Neg logit contributions
sing-0.037
 breathing-0.036
 cob-0.034
 staying-0.034
ses-0.033
Token.
Token ID13
Feature activation-4.38e-03

Pos logit contributions
or+0.003
P+0.003
ory+0.003
+0.003
les+0.003
Neg logit contributions
sing-0.008
 Kel-0.007
 cob-0.007
ses-0.007
 breathing-0.007
Token There
Token ID1318
Feature activation+8.23e-02

Pos logit contributions
or+0.008
P+0.008
 course+0.007
ory+0.007
umper+0.007
Neg logit contributions
 breathing-0.007
sing-0.007
 cob-0.007
 staying-0.007
 standing-0.006
Token are
Token ID389
Feature activation-2.20e-02

Pos logit contributions
 breathing+0.163
sing+0.163
 cob+0.161
 staying+0.159
 standing+0.157
Neg logit contributions
or-0.110
P-0.108
 course-0.102
ory-0.102
umper-0.100
Token eleven
Token ID22216
Feature activation+5.59e-02

Pos logit contributions
or+0.037
P+0.036
 course+0.036
ory+0.034
les+0.033
Neg logit contributions
sing-0.036
 breathing-0.036
 staying-0.034
 cob-0.034
 standing-0.034
Token pengu
Token ID38373
Feature activation+8.95e-02

Pos logit contributions
sing+0.102
 breathing+0.102
 cob+0.099
 staying+0.098
 standing+0.096
Neg logit contributions
or-0.085
P-0.082
 course-0.080
ory-0.078
les-0.077
Tokenins
Token ID1040
Feature activation-4.17e-02

Pos logit contributions
 breathing+0.150
 staying+0.148
 standing+0.148
 sitting+0.147
 cob+0.142
Neg logit contributions
or-0.132
-0.130
ory-0.130
umper-0.130
iane-0.128
Token.
Token ID13
Feature activation-5.07e-03

Pos logit contributions
or+0.069
P+0.068
 course+0.066
ory+0.065
umper+0.064
Neg logit contributions
 breathing-0.071
 staying-0.069
sing-0.068
 standing-0.067
 cob-0.067
Token Look
Token ID6803
Feature activation+4.23e-02

Pos logit contributions
or+0.010
P+0.009
 course+0.009
les+0.009
ory+0.009
Neg logit contributions
sing-0.007
 breathing-0.007
 staying-0.007
 cob-0.007
 standing-0.007
Token,
Token ID11
Feature activation-7.59e-03

Pos logit contributions
 breathing+0.076
sing+0.076
 cob+0.074
 staying+0.074
 standing+0.073
Neg logit contributions
or-0.065
P-0.063
 course-0.061
ory-0.060
umper-0.058
Token there
Token ID612
Feature activation+3.87e-02

Pos logit contributions
or+0.012
P+0.012
 course+0.012
ory+0.012
les+0.011
Neg logit contributions
sing-0.013
 breathing-0.013
 cob-0.012
 staying-0.012
 standing-0.012
Token.
Token ID13
Feature activation+5.20e-03

Pos logit contributions
or+0.068
P+0.067
 course+0.064
umper+0.063
ory+0.062
Neg logit contributions
 breathing-0.066
sing-0.064
 cob-0.064
 staying-0.063
 standing-0.063
Token The
Token ID383
Feature activation-3.57e-02

Pos logit contributions
 breathing+0.008
sing+0.008
 cob+0.008
 staying+0.008
 standing+0.008
Neg logit contributions
or-0.009
P-0.009
 course-0.009
ory-0.009
les-0.009
Token poor
Token ID3595
Feature activation-1.29e-02

Pos logit contributions
or+0.078
P+0.072
umper+0.070
 course+0.070
ar+0.069
Neg logit contributions
 breathing-0.046
 cob-0.045
sing-0.045
 standing-0.043
 sitting-0.042
Token man
Token ID582
Feature activation-2.62e-02

Pos logit contributions
or+0.027
P+0.026
 course+0.026
umper+0.024
ar+0.024
Neg logit contributions
 breathing-0.017
 cob-0.016
sing-0.016
 standing-0.016
 staying-0.016
Token felt
Token ID2936
Feature activation-2.20e-02

Pos logit contributions
or+0.043
P+0.042
 course+0.039
ory+0.038
umper+0.038
Neg logit contributions
 breathing-0.047
 standing-0.046
sing-0.045
 staying-0.044
 sitting-0.044
Token sad
Token ID6507
Feature activation+9.37e-03

Pos logit contributions
or+0.051
P+0.049
ory+0.048
 course+0.048
les+0.047
Neg logit contributions
 breathing-0.025
 cob-0.023
sing-0.023
 staying-0.023
 standing-0.022
Token for
Token ID329
Feature activation-1.02e-02

Pos logit contributions
sing+0.018
 Kel+0.017
 cob+0.017
 breathing+0.017
 staying+0.017
Neg logit contributions
or-0.014
 course-0.013
les-0.013
ory-0.013
P-0.012
Token her
Token ID607
Feature activation+5.22e-02

Pos logit contributions
 course+0.016
or+0.015
P+0.015
ory+0.015
umper+0.014
Neg logit contributions
sing-0.018
 breathing-0.017
ses-0.017
 cob-0.017
 staying-0.016
Token and
Token ID290
Feature activation+3.82e-03

Pos logit contributions
sing+0.094
 breathing+0.094
 staying+0.093
 cob+0.092
 Kel+0.092
Neg logit contributions
or-0.075
ory-0.073
 course-0.072
les-0.069
oni-0.067
Token decided
Token ID3066
Feature activation-5.04e-02

Pos logit contributions
 breathing+0.007
sing+0.007
 cob+0.007
 staying+0.007
 standing+0.007
Neg logit contributions
or-0.006
P-0.006
 course-0.005
ory-0.005
umper-0.005
Token to
Token ID284
Feature activation-6.59e-03

Pos logit contributions
or+0.089
P+0.087
 course+0.085
ory+0.083
les+0.083
Neg logit contributions
 breathing-0.081
sing-0.079
 staying-0.078
 cob-0.077
 standing-0.077
Token She
Token ID1375
Feature activation+1.54e-03

Pos logit contributions
 breathing+0.027
sing+0.027
 cob+0.026
 staying+0.026
 standing+0.025
Neg logit contributions
or-0.024
P-0.023
 course-0.023
ory-0.022
les-0.022
Token asked
Token ID1965
Feature activation+1.69e-02

Pos logit contributions
 breathing+0.003
sing+0.003
 cob+0.003
 standing+0.003
 staying+0.003
Neg logit contributions
or-0.002
P-0.002
 course-0.002
ory-0.002
umper-0.002
Token the
Token ID262
Feature activation-5.35e-03

Pos logit contributions
sing+0.027
 breathing+0.027
 cob+0.026
 staying+0.025
 standing+0.025
Neg logit contributions
or-0.030
P-0.029
 course-0.029
ory-0.028
les-0.027
Token goose
Token ID37246
Feature activation-3.37e-02

Pos logit contributions
or+0.010
P+0.009
umper+0.009
 course+0.009
ory+0.009
Neg logit contributions
sing-0.008
 breathing-0.008
 cob-0.008
 standing-0.008
 staying-0.008
Token if
Token ID611
Feature activation+3.30e-02

Pos logit contributions
P+0.061
or+0.061
 course+0.058
ory+0.057
umper+0.056
Neg logit contributions
 breathing-0.051
sing-0.051
 staying-0.050
 cob-0.049
 standing-0.049
Token she
Token ID673
Feature activation+9.06e-03

Pos logit contributions
 breathing+0.048
 staying+0.047
sing+0.047
 cob+0.047
 standing+0.046
Neg logit contributions
or-0.061
P-0.060
 course-0.058
ory-0.058
les-0.057
Token could
Token ID714
Feature activation-3.79e-02

Pos logit contributions
 breathing+0.019
sing+0.019
 cob+0.019
 staying+0.019
 standing+0.019
Neg logit contributions
or-0.011
P-0.011
 course-0.010
ory-0.010
umper-0.010
Token spin
Token ID7906
Feature activation+4.61e-02

Pos logit contributions
P+0.069
or+0.066
ory+0.065
 course+0.064
umper+0.063
Neg logit contributions
 breathing-0.058
sing-0.058
 staying-0.057
 standing-0.056
 cob-0.055
Token too
Token ID1165
Feature activation+1.53e-02

Pos logit contributions
sing+0.076
 breathing+0.075
 cob+0.072
 staying+0.072
 standing+0.071
Neg logit contributions
or-0.079
P-0.077
 course-0.075
ory-0.073
les-0.072
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
 breathing+0.027
sing+0.026
 cob+0.026
 staying+0.026
 standing+0.025
Neg logit contributions
or-0.024
P-0.024
 course-0.023
ory-0.023
umper-0.022
Token
Token ID220
Feature activation-5.22e-02

Pos logit contributions
sing+0.021
 breathing+0.021
 staying+0.020
 standing+0.020
 Ann+0.020
Neg logit contributions
or-0.024
P-0.023
les-0.023
 course-0.022
umper-0.022
Token was
Token ID373
Feature activation-8.50e-03

Pos logit contributions
sing+0.085
 breathing+0.084
 staying+0.080
 cob+0.079
ses+0.079
Neg logit contributions
or-0.037
 course-0.036
P-0.034
ory-0.031
les-0.031
Token a
Token ID257
Feature activation-9.03e-03

Pos logit contributions
P+0.013
 course+0.011
ory+0.010
+0.010
umper+0.010
Neg logit contributions
sing-0.017
 breathing-0.017
 staying-0.016
 cob-0.016
 beating-0.016
Token brave
Token ID14802
Feature activation-3.37e-02

Pos logit contributions
P+0.016
ory+0.015
or+0.015
 course+0.015
+0.015
Neg logit contributions
 breathing-0.014
sing-0.014
 staying-0.014
 standing-0.013
 cob-0.013
Token little
Token ID1310
Feature activation+1.26e-02

Pos logit contributions
or+0.056
P+0.055
 course+0.054
ory+0.054
les+0.052
Neg logit contributions
 breathing-0.056
sing-0.056
 staying-0.054
 cob-0.053
 standing-0.053
Token girl
Token ID2576
Feature activation+3.38e-02

Pos logit contributions
 breathing+0.020
sing+0.020
 staying+0.020
 standing+0.019
 cob+0.019
Neg logit contributions
P-0.022
or-0.022
ory-0.021
 course-0.021
les-0.020
Token called
Token ID1444
Feature activation-1.34e-02

Pos logit contributions
sing+0.071
 breathing+0.070
 staying+0.068
 cob+0.067
ses+0.066
Neg logit contributions
or-0.043
P-0.043
 course-0.040
ory-0.038
les-0.037
Token L
Token ID406
Feature activation+1.78e-02

Pos logit contributions
or+0.028
P+0.027
 course+0.026
ory+0.026
ar+0.025
Neg logit contributions
sing-0.018
 breathing-0.017
 staying-0.015
 beating-0.015
 standing-0.015
Tokenizzy
Token ID40593
Feature activation-2.38e-02

Pos logit contributions
 cob+0.027
 breathing+0.027
 standing+0.027
 staying+0.026
sing+0.026
Neg logit contributions
 course-0.029
-0.028
P-0.028
or-0.028
 trigger-0.027
Token.
Token ID13
Feature activation+1.85e-04

Pos logit contributions
P+0.040
or+0.040
 course+0.039
umper+0.037
ory+0.037
Neg logit contributions
 breathing-0.040
sing-0.039
 cob-0.038
 staying-0.038
 standing-0.037
Token She
Token ID1375
Feature activation-7.60e-03

Pos logit contributions
sing+0.000
 breathing+0.000
 staying+0.000
 standing+0.000
 cob+0.000
Neg logit contributions
or-0.000
P-0.000
 course-0.000
ory-0.000
les-0.000
Token loved
Token ID6151
Feature activation-4.23e-02

Pos logit contributions
or+0.016
ory+0.015
 course+0.015
umper+0.015
P+0.014
Neg logit contributions
 cob-0.010
 breathing-0.010
 staying-0.009
sing-0.009
 standing-0.009
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
sing+0.037
 breathing+0.036
 cob+0.035
 staying+0.035
 standing+0.034
Neg logit contributions
or-0.026
 course-0.026
P-0.025
ory-0.025
les-0.024
Token she
Token ID673
Feature activation+1.31e-02

Pos logit contributions
sing+0.019
ses+0.019
 breathing+0.018
 staying+0.017
 standing+0.016
Neg logit contributions
 course-0.015
 tasted-0.013
P-0.013
als-0.013
 may-0.013
Token felt
Token ID2936
Feature activation-2.00e-02

Pos logit contributions
 breathing+0.026
sing+0.026
 cob+0.025
 staying+0.025
 standing+0.024
Neg logit contributions
or-0.018
P-0.018
 course-0.017
umper-0.017
ory-0.017
Token so
Token ID523
Feature activation+1.05e-02

Pos logit contributions
or+0.036
P+0.036
 course+0.035
ory+0.034
les+0.034
Neg logit contributions
sing-0.030
 breathing-0.029
 cob-0.028
 staying-0.028
 standing-0.028
Token free
Token ID1479
Feature activation-4.88e-02

Pos logit contributions
sing+0.013
 cob+0.012
 breathing+0.012
ses+0.012
 standing+0.012
Neg logit contributions
or-0.021
P-0.021
ory-0.021
 course-0.020
les-0.020
Token like
Token ID588
Feature activation-3.57e-03

Pos logit contributions
or+0.059
 course+0.056
P+0.055
les+0.053
ory+0.052
Neg logit contributions
sing-0.106
 breathing-0.101
 cob-0.100
ses-0.098
 staying-0.098
Token a
Token ID257
Feature activation+3.74e-04

Pos logit contributions
 course+0.006
or+0.006
P+0.005
ory+0.005
les+0.005
Neg logit contributions
sing-0.006
 breathing-0.006
ses-0.006
 staying-0.006
 cob-0.006
Token bird
Token ID6512
Feature activation+2.89e-02

Pos logit contributions
 breathing+0.001
sing+0.001
 cob+0.001
 staying+0.001
 standing+0.001
Neg logit contributions
or-0.001
P-0.001
 course-0.001
ory-0.001
les-0.001
Token.
Token ID13
Feature activation+3.60e-03

Pos logit contributions
sing+0.058
 breathing+0.056
 cob+0.055
 staying+0.055
ses+0.054
Neg logit contributions
or-0.039
 course-0.037
P-0.037
les-0.035
ory-0.035
Token
Token ID198
Feature activation+4.69e-04

Pos logit contributions
 breathing+0.006
sing+0.006
 staying+0.006
 cob+0.006
 standing+0.006
Neg logit contributions
or-0.006
P-0.006
 course-0.006
ory-0.006
les-0.006
Token
Token ID198
Feature activation+2.03e-03

Pos logit contributions
 breathing+0.001
 standing+0.001
 staying+0.001
 cob+0.001
sing+0.001
Neg logit contributions
or-0.001
ory-0.001
P-0.001
 course-0.001
les-0.001
Token
Token ID198
Feature activation-2.40e-03

Pos logit contributions
sing+0.009
 breathing+0.009
 cob+0.009
 staying+0.009
 standing+0.009
Neg logit contributions
or-0.007
P-0.006
 course-0.006
ory-0.006
les-0.006
Token
Token ID198
Feature activation-1.03e-03

Pos logit contributions
or+0.004
P+0.004
 course+0.004
ory+0.004
umper+0.004
Neg logit contributions
 breathing-0.004
sing-0.004
 staying-0.004
 standing-0.004
 cob-0.004
TokenJane
Token ID41083
Feature activation+1.32e-02

Pos logit contributions
or+0.002
P+0.001
umper+0.001
 course+0.001
ory+0.001
Neg logit contributions
 breathing-0.002
sing-0.002
 cob-0.002
 standing-0.002
 staying-0.002
Token said
Token ID531
Feature activation+1.35e-02

Pos logit contributions
 breathing+0.024
sing+0.024
 cob+0.023
 standing+0.023
 staying+0.023
Neg logit contributions
or-0.020
P-0.020
 course-0.019
ory-0.018
umper-0.018
Token goodbye
Token ID24829
Feature activation+2.03e-02

Pos logit contributions
sing+0.031
 breathing+0.030
 cob+0.030
 staying+0.029
ses+0.029
Neg logit contributions
or-0.016
P-0.014
ory-0.013
ar-0.013
 course-0.013
Token and
Token ID290
Feature activation+9.48e-03

Pos logit contributions
 cob+0.040
 breathing+0.039
 staying+0.038
sing+0.038
 standing+0.037
Neg logit contributions
or-0.028
ory-0.026
P-0.025
-0.024
ar-0.024
Token ran
Token ID4966
Feature activation+2.05e-02

Pos logit contributions
sing+0.021
 breathing+0.021
 cob+0.020
 staying+0.020
ses+0.020
Neg logit contributions
or-0.011
P-0.010
 course-0.010
ory-0.010
les-0.009
Token off
Token ID572
Feature activation-1.97e-02

Pos logit contributions
 breathing+0.034
sing+0.033
 staying+0.032
 cob+0.032
 standing+0.032
Neg logit contributions
or-0.034
P-0.034
 course-0.034
umper-0.032
ory-0.032
Token to
Token ID284
Feature activation-2.65e-03

Pos logit contributions
or+0.029
P+0.027
ory+0.026
 course+0.025
les+0.025
Neg logit contributions
sing-0.037
 cob-0.037
 breathing-0.036
ses-0.035
 staying-0.035
Token play
Token ID711
Feature activation-9.56e-03

Pos logit contributions
or+0.003
 course+0.003
P+0.003
ory+0.003
+0.003
Neg logit contributions
sing-0.006
 breathing-0.005
 cob-0.005
ses-0.005
 staying-0.005
Token with
Token ID351
Feature activation+1.67e-02

Pos logit contributions
or+0.014
P+0.014
 course+0.013
les+0.013
ory+0.012
Neg logit contributions
sing-0.019
 breathing-0.018
 cob-0.018
 staying-0.018
ses-0.017
Token just
Token ID655
Feature activation-2.56e-02

Pos logit contributions
P+0.023
or+0.023
ory+0.022
umper+0.022
 course+0.021
Neg logit contributions
 breathing-0.019
 cob-0.018
 staying-0.018
 standing-0.017
 Ann-0.016
Token a
Token ID257
Feature activation+1.14e-03

Pos logit contributions
or+0.045
P+0.045
 course+0.043
ory+0.043
umper+0.042
Neg logit contributions
 breathing-0.041
sing-0.040
 staying-0.039
 cob-0.039
 standing-0.038
Token little
Token ID1310
Feature activation+2.46e-02

Pos logit contributions
 breathing+0.002
sing+0.002
 cob+0.002
 staying+0.002
 standing+0.002
Neg logit contributions
or-0.002
P-0.002
 course-0.002
umper-0.002
ory-0.002
Token peek
Token ID27185
Feature activation+1.37e-02

Pos logit contributions
 breathing+0.048
sing+0.048
 staying+0.047
 cob+0.047
 standing+0.046
Neg logit contributions
or-0.033
P-0.032
 course-0.031
ory-0.031
les-0.030
Token.
Token ID13
Feature activation+4.19e-03

Pos logit contributions
 breathing+0.023
sing+0.023
 standing+0.022
 staying+0.022
 cob+0.022
Neg logit contributions
P-0.022
or-0.022
 course-0.022
ory-0.021
umper-0.020
Token He
Token ID679
Feature activation-1.94e-02

Pos logit contributions
 breathing+0.006
sing+0.006
 Ann+0.006
 standing+0.006
 staying+0.006
Neg logit contributions
or-0.007
P-0.007
les-0.007
 course-0.007
umper-0.007
Token slowly
Token ID6364
Feature activation-3.58e-02

Pos logit contributions
or+0.029
P+0.028
 course+0.027
umper+0.027
ory+0.026
Neg logit contributions
 cob-0.036
 breathing-0.035
 standing-0.035
sing-0.035
 staying-0.034
Token put
Token ID1234
Feature activation+7.11e-03

Pos logit contributions
or+0.051
P+0.050
 course+0.048
ory+0.047
umper+0.046
Neg logit contributions
 breathing-0.068
sing-0.067
 cob-0.067
 standing-0.066
 staying-0.066
Token his
Token ID465
Feature activation+9.12e-03

Pos logit contributions
 breathing+0.008
 standing+0.008
 cob+0.007
 sitting+0.007
 Kim+0.007
Neg logit contributions
or-0.015
ory-0.015
P-0.015
opus-0.014
umper-0.014
Token hand
Token ID1021
Feature activation+3.37e-02

Pos logit contributions
sing+0.011
 breathing+0.011
 cob+0.011
 staying+0.010
 standing+0.010
Neg logit contributions
or-0.020
P-0.019
 course-0.019
umper-0.019
ory-0.018
Token over
Token ID625
Feature activation+1.61e-02

Pos logit contributions
sing+0.060
 breathing+0.058
 cob+0.058
 staying+0.057
ses+0.056
Neg logit contributions
or-0.054
P-0.051
 course-0.049
les-0.048
ory-0.048
Token puts
Token ID7584
Feature activation+4.24e-02

Pos logit contributions
or+0.042
P+0.040
ory+0.038
umper+0.038
 course+0.037
Neg logit contributions
 cob-0.032
 breathing-0.031
sing-0.030
 standing-0.030
 staying-0.030
Token on
Token ID319
Feature activation+1.66e-02

Pos logit contributions
 breathing+0.057
 standing+0.055
 cob+0.054
sing+0.054
 Kim+0.052
Neg logit contributions
or-0.083
P-0.081
ory-0.081
umper-0.076
 course-0.075
Token his
Token ID465
Feature activation+5.96e-03

Pos logit contributions
 breathing+0.025
 cob+0.024
sing+0.023
 standing+0.023
 Ann+0.022
Neg logit contributions
or-0.032
P-0.031
ory-0.030
umper-0.030
 course-0.029
Token mask
Token ID9335
Feature activation-4.59e-02

Pos logit contributions
 breathing+0.011
sing+0.011
 cob+0.011
 staying+0.010
 standing+0.010
Neg logit contributions
or-0.009
P-0.009
ory-0.009
 course-0.008
umper-0.008
Token.
Token ID13
Feature activation+6.61e-03

Pos logit contributions
P+0.085
or+0.084
ory+0.083
umper+0.082
 course+0.081
Neg logit contributions
 breathing-0.067
sing-0.065
 standing-0.063
 cob-0.062
 staying-0.062
Token He
Token ID679
Feature activation-2.09e-02

Pos logit contributions
 breathing+0.013
sing+0.013
 cob+0.012
 staying+0.012
 standing+0.012
Neg logit contributions
or-0.010
P-0.009
 course-0.009
ory-0.009
les-0.009
Token thanks
Token ID5176
Feature activation-2.81e-02

Pos logit contributions
or+0.040
P+0.039
ory+0.038
umper+0.037
 course+0.036
Neg logit contributions
 cob-0.031
 breathing-0.030
sing-0.029
 staying-0.029
 standing-0.029
Token Sam
Token ID3409
Feature activation-2.40e-02

Pos logit contributions
or+0.055
P+0.055
 course+0.052
ory+0.052
les+0.051
Neg logit contributions
 breathing-0.039
sing-0.039
 cob-0.038
 staying-0.037
 standing-0.037
Token.
Token ID13
Feature activation+6.12e-03

Pos logit contributions
or+0.039
P+0.037
 course+0.037
ory+0.036
les+0.035
Neg logit contributions
sing-0.042
 breathing-0.041
 cob-0.040
 staying-0.040
 standing-0.039
Token
Token ID198
Feature activation-2.75e-03

Pos logit contributions
 breathing+0.011
sing+0.011
 cob+0.011
 staying+0.011
 standing+0.011
Neg logit contributions
or-0.009
P-0.009
 course-0.009
ory-0.009
les-0.008
Token
Token ID198
Feature activation-2.18e-03

Pos logit contributions
or+0.004
P+0.004
 course+0.004
ory+0.004
les+0.004
Neg logit contributions
 breathing-0.005
sing-0.005
 cob-0.005
 staying-0.005
 standing-0.005
Token't
Token ID470
Feature activation-4.63e-02

Pos logit contributions
ory+0.024
or+0.024
P+0.022
les+0.021
 course+0.021
Neg logit contributions
 breathing-0.056
 standing-0.055
 staying-0.054
 cob-0.052
 sitting-0.052
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
or+0.089
P+0.085
ted+0.082
ory+0.082
boards+0.081
Neg logit contributions
 breathing-0.068
 cob-0.066
sing-0.064
 staying-0.063
 standing-0.062
Token I
Token ID314
Feature activation+3.29e-02

Pos logit contributions
 breathing+0.017
sing+0.017
 staying+0.016
 cob+0.016
 standing+0.016
Neg logit contributions
or-0.017
P-0.016
 course-0.016
ory-0.016
-0.015
Token promise
Token ID6991
Feature activation-3.34e-02

Pos logit contributions
sing+0.054
 breathing+0.053
 cob+0.052
 staying+0.050
 standing+0.049
Neg logit contributions
P-0.058
or-0.055
umper-0.055
 course-0.054
les-0.051
Token."
Token ID526
Feature activation-3.28e-02

Pos logit contributions
or+0.050
P+0.049
 course+0.047
ory+0.046
les+0.045
Neg logit contributions
 breathing-0.062
sing-0.062
 staying-0.060
 cob-0.060
 standing-0.059
Token He
Token ID679
Feature activation-1.88e-02

Pos logit contributions
or+0.058
P+0.056
 course+0.055
ory+0.053
umper+0.053
Neg logit contributions
 breathing-0.053
sing-0.053
 cob-0.051
 staying-0.050
 standing-0.049
Token gives
Token ID3607
Feature activation-1.36e-02

Pos logit contributions
or+0.036
P+0.034
ory+0.033
umper+0.033
 course+0.032
Neg logit contributions
 cob-0.028
 breathing-0.027
sing-0.026
ses-0.026
 standing-0.026
Token Lily
Token ID20037
Feature activation+1.04e-02

Pos logit contributions
or+0.033
P+0.032
ory+0.032
+0.031
+0.031
Neg logit contributions
 Ann-0.011
 cob-0.011
 breathing-0.011
 Kim-0.011
 staying-0.010
Token a
Token ID257
Feature activation-3.78e-03

Pos logit contributions
 breathing+0.017
sing+0.017
 cob+0.016
 staying+0.016
 standing+0.016
Neg logit contributions
or-0.018
P-0.017
 course-0.017
ory-0.016
les-0.016
Token hug
Token ID16225
Feature activation-6.52e-02

Pos logit contributions
or+0.006
P+0.006
 course+0.006
ory+0.006
umper+0.005
Neg logit contributions
 breathing-0.007
sing-0.007
 cob-0.007
 staying-0.006
 standing-0.006
Token.
Token ID13
Feature activation+8.63e-03

Pos logit contributions
P+0.129
or+0.125
ory+0.122
umper+0.118
 course+0.118
Neg logit contributions
 breathing-0.094
 standing-0.087
 staying-0.086
sing-0.085
 sitting-0.084
Token take
Token ID1011
Feature activation-1.89e-02

Pos logit contributions
 breathing+0.036
sing+0.036
 staying+0.035
 cob+0.034
 standing+0.034
Neg logit contributions
or-0.037
P-0.037
 course-0.036
les-0.035
ory-0.035
Token the
Token ID262
Feature activation-1.32e-02

Pos logit contributions
or+0.029
P+0.029
 course+0.028
ory+0.027
les+0.027
Neg logit contributions
sing-0.035
 breathing-0.034
 cob-0.032
 staying-0.032
 standing-0.032
Token water
Token ID1660
Feature activation-7.73e-02

Pos logit contributions
or+0.021
P+0.020
 course+0.019
umper+0.019
ory+0.019
Neg logit contributions
 cob-0.024
 breathing-0.024
sing-0.024
 staying-0.023
 standing-0.023
Token,
Token ID11
Feature activation-3.94e-03

Pos logit contributions
P+0.144
or+0.138
 course+0.135
umper+0.133
+0.130
Neg logit contributions
 breathing-0.116
sing-0.113
 staying-0.112
 cob-0.112
 standing-0.110
Token but
Token ID475
Feature activation+1.58e-02

Pos logit contributions
or+0.007
 course+0.007
P+0.007
ory+0.007
umper+0.007
Neg logit contributions
sing-0.006
 breathing-0.006
 cob-0.005
 standing-0.005
 staying-0.005
Token they
Token ID484
Feature activation-4.16e-02

Pos logit contributions
sing+0.032
 breathing+0.031
 cob+0.030
ses+0.030
 beating+0.029
Neg logit contributions
or-0.018
 course-0.018
ory-0.018
uses-0.017
 may-0.017
Token wanted
Token ID2227
Feature activation+1.44e-02

Pos logit contributions
P+0.073
or+0.071
umper+0.065
les+0.064
ory+0.063
Neg logit contributions
 cob-0.068
sing-0.068
 staying-0.067
 breathing-0.066
 standing-0.063
Token it
Token ID340
Feature activation-1.19e-02

Pos logit contributions
sing+0.025
 breathing+0.025
 cob+0.024
 staying+0.024
 standing+0.024
Neg logit contributions
or-0.023
P-0.022
 course-0.022
ory-0.021
les-0.021
Token so
Token ID523
Feature activation+2.35e-03

Pos logit contributions
or+0.018
P+0.018
 course+0.017
ory+0.017
les+0.016
Neg logit contributions
sing-0.021
 breathing-0.021
 cob-0.021
 staying-0.020
 standing-0.020
Token much
Token ID881
Feature activation+2.96e-02

Pos logit contributions
sing+0.003
 breathing+0.003
 staying+0.003
 standing+0.003
 cob+0.003
Neg logit contributions
or-0.004
P-0.004
 course-0.004
ory-0.004
-0.004
Token.
Token ID13
Feature activation-4.69e-03

Pos logit contributions
sing+0.059
 breathing+0.058
 cob+0.057
 staying+0.056
 standing+0.056
Neg logit contributions
or-0.040
 course-0.039
P-0.039
ory-0.037
les-0.036
Token pond
Token ID16723
Feature activation-4.63e-02

Pos logit contributions
sing+0.020
 breathing+0.020
 staying+0.019
 standing+0.018
 cob+0.018
Neg logit contributions
or-0.018
P-0.017
 course-0.017
ory-0.017
les-0.016
Token.
Token ID13
Feature activation-2.65e-03

Pos logit contributions
or+0.075
P+0.073
 course+0.070
ory+0.069
umper+0.069
Neg logit contributions
 breathing-0.080
sing-0.079
 staying-0.077
 standing-0.077
 cob-0.077
Token The
Token ID383
Feature activation-3.73e-02

Pos logit contributions
or+0.004
P+0.004
 course+0.004
ory+0.004
umper+0.004
Neg logit contributions
sing-0.005
 breathing-0.005
 cob-0.005
 staying-0.004
 standing-0.004
Token water
Token ID1660
Feature activation-7.44e-02

Pos logit contributions
or+0.057
P+0.056
 course+0.053
ory+0.052
umper+0.052
Neg logit contributions
sing-0.068
 breathing-0.067
 cob-0.066
 staying-0.065
 standing-0.064
Token was
Token ID373
Feature activation+1.66e-03

Pos logit contributions
or+0.109
P+0.108
 course+0.101
umper+0.100
ory+0.100
Neg logit contributions
sing-0.140
 breathing-0.140
 cob-0.136
 staying-0.136
 standing-0.135
Token still
Token ID991
Feature activation-1.89e-02

Pos logit contributions
sing+0.003
ses+0.003
 cob+0.002
 Ann+0.002
 breathing+0.002
Neg logit contributions
 course-0.003
P-0.003
ory-0.002
or-0.002
oni-0.002
Token,
Token ID11
Feature activation+7.66e-03

Pos logit contributions
P+0.030
les+0.029
ory+0.029
oni+0.029
 course+0.028
Neg logit contributions
 Ann-0.028
sing-0.028
 Kel-0.028
ses-0.028
 cob-0.027
Token and
Token ID290
Feature activation+8.70e-03

Pos logit contributions
sing+0.012
ses+0.012
 breathing+0.012
 cob+0.012
 Kel+0.011
Neg logit contributions
 course-0.013
or-0.011
les-0.011
uses-0.011
P-0.011
Token it
Token ID340
Feature activation-6.83e-03

Pos logit contributions
sing+0.017
 breathing+0.017
ses+0.016
 staying+0.016
 cob+0.016
Neg logit contributions
 course-0.012
or-0.011
P-0.011
les-0.011
ory-0.010
Token was
Token ID373
Feature activation+4.59e-03

Pos logit contributions
or+0.009
P+0.009
 course+0.008
ory+0.008
les+0.008
Neg logit contributions
sing-0.014
 breathing-0.014
 cob-0.013
 staying-0.013
 standing-0.013
Token hard
Token ID1327
Feature activation+1.81e-02

Pos logit contributions
sing+0.007
ses+0.007
 Kel+0.007
 Ann+0.007
 cob+0.007
Neg logit contributions
 course-0.007
oni-0.006
P-0.006
ory-0.006
les-0.006
TokenThe
Token ID464
Feature activation-2.37e-02

Pos logit contributions
 cob+0.010
 breathing+0.010
 staying+0.010
 standing+0.009
 beating+0.009
Neg logit contributions
ory-0.011
or-0.011
les-0.010
umper-0.010
ingu-0.010
Token girl
Token ID2576
Feature activation+4.37e-02

Pos logit contributions
or+0.049
P+0.044
 course+0.044
umper+0.044
ory+0.043
Neg logit contributions
 breathing-0.031
 cob-0.031
 staying-0.030
 standing-0.030
sing-0.029
Token enjoyed
Token ID8359
Feature activation-2.99e-02

Pos logit contributions
 breathing+0.074
 staying+0.074
 cob+0.074
 standing+0.073
sing+0.073
Neg logit contributions
or-0.070
P-0.069
umper-0.065
 course-0.064
ory-0.062
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
P+0.055
or+0.055
 course+0.055
ory+0.052
umper+0.051
Neg logit contributions
sing-0.045
 breathing-0.043
 cob-0.042
ses-0.040
 staying-0.040
Token w
Token ID266
Feature activation-3.06e-02

Pos logit contributions
or+0.023
P+0.022
umper+0.021
 course+0.020
ory+0.020
Neg logit contributions
 breathing-0.022
 cob-0.021
 staying-0.021
sing-0.021
 standing-0.021
Tokenaffle
Token ID30697
Feature activation-1.23e-01

Pos logit contributions
P+0.036
 course+0.035
or+0.034
+0.034
als+0.033
Neg logit contributions
 breathing-0.064
 staying-0.062
sing-0.061
 cob-0.060
 standing-0.059
Token and
Token ID290
Feature activation+8.33e-03

Pos logit contributions
or+0.219
P+0.213
umper+0.208
 course+0.205
ory+0.205
Neg logit contributions
 breathing-0.195
 staying-0.187
sing-0.186
 cob-0.185
 standing-0.183
Token said
Token ID531
Feature activation+1.63e-02

Pos logit contributions
sing+0.016
 breathing+0.016
 cob+0.016
 staying+0.016
 standing+0.015
Neg logit contributions
or-0.012
P-0.011
 course-0.011
ory-0.011
les-0.010
Token goodbye
Token ID24829
Feature activation+1.78e-02

Pos logit contributions
 breathing+0.026
sing+0.026
 cob+0.026
 staying+0.025
 standing+0.025
Neg logit contributions
or-0.028
P-0.027
umper-0.026
 course-0.026
ory-0.026
Token to
Token ID284
Feature activation-6.05e-03

Pos logit contributions
 breathing+0.032
sing+0.032
 cob+0.032
 staying+0.031
 standing+0.031
Neg logit contributions
or-0.027
P-0.026
ory-0.025
 course-0.025
les-0.024
Token the
Token ID262
Feature activation-1.42e-02

Pos logit contributions
or+0.011
P+0.011
 course+0.010
ory+0.010
les+0.010
Neg logit contributions
 breathing-0.009
sing-0.009
 cob-0.009
 staying-0.009
 standing-0.009
Token else
Token ID2073
Feature activation+1.22e-02

Pos logit contributions
or+0.005
 course+0.005
P+0.004
ory+0.004
ade+0.004
Neg logit contributions
sing-0.005
ses-0.005
 breathing-0.005
 cob-0.005
 staying-0.005
Token to
Token ID284
Feature activation-4.73e-03

Pos logit contributions
sing+0.024
 breathing+0.023
 cob+0.023
ses+0.022
 Kel+0.022
Neg logit contributions
or-0.017
 course-0.016
ory-0.016
P-0.016
les-0.015
Token do
Token ID466
Feature activation+4.15e-02

Pos logit contributions
or+0.008
P+0.008
umper+0.008
ory+0.008
les+0.007
Neg logit contributions
sing-0.008
 breathing-0.008
 cob-0.008
 staying-0.008
 standing-0.007
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 breathing+0.067
 staying+0.066
 standing+0.066
sing+0.062
 cob+0.061
Neg logit contributions
P-0.070
umper-0.070
or-0.069
ory-0.069
 course-0.066
Token<|endoftext|>
Token ID50256
Feature activation-8.43e-03

Pos logit contributions
sing+0.024
 breathing+0.024
 cob+0.023
 staying+0.023
 standing+0.023
Neg logit contributions
or-0.019
P-0.018
 course-0.018
ory-0.017
umper-0.017
TokenP
Token ID47
Feature activation-2.01e-01

Pos logit contributions
or+0.014
ory+0.013
P+0.013
 course+0.013
umper+0.013
Neg logit contributions
 standing-0.014
 breathing-0.014
 staying-0.013
 Ann-0.013
 sitting-0.013
Tokenolly
Token ID5098
Feature activation-7.62e-02

Pos logit contributions
les+0.284
 trigger+0.278
uses+0.263
ted+0.261
or+0.259
Neg logit contributions
 standing-0.339
 Ann-0.338
sing-0.336
 cob-0.333
 staying-0.329
Token was
Token ID373
Feature activation-1.16e-02

Pos logit contributions
or+0.135
P+0.132
 course+0.126
umper+0.125
ory+0.125
Neg logit contributions
 breathing-0.118
 cob-0.116
 standing-0.116
 staying-0.115
sing-0.113
Token playing
Token ID2712
Feature activation+3.82e-02

Pos logit contributions
P+0.019
 course+0.015
umper+0.014
or+0.014
+0.014
Neg logit contributions
sing-0.020
ses-0.020
 cob-0.019
 breathing-0.019
 outage-0.019
Token at
Token ID379
Feature activation+1.02e-02

Pos logit contributions
sing+0.057
 breathing+0.056
 cob+0.054
 staying+0.054
 standing+0.053
Neg logit contributions
or-0.071
P-0.070
 course-0.068
ory-0.066
les-0.066
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
sing+0.021
 breathing+0.020
ses+0.019
 staying+0.019
 standing+0.019
Neg logit contributions
 course-0.014
P-0.014
or-0.014
umper-0.012
ory-0.012
Token She
Token ID1375
Feature activation-1.19e-02

Pos logit contributions
P+0.001
 course+0.001
or+0.001
les+0.001
als+0.001
Neg logit contributions
 breathing-0.001
sing-0.001
 staying-0.001
 standing-0.001
 cob-0.001
Token had
Token ID550
Feature activation-5.36e-03

Pos logit contributions
or+0.021
P+0.020
 course+0.019
ory+0.019
umper+0.019
Neg logit contributions
 breathing-0.020
sing-0.019
 cob-0.019
 staying-0.019
 standing-0.019
Token an
Token ID281
Feature activation-2.03e-02

Pos logit contributions
P+0.007
 course+0.006
Friend+0.005
+0.005
ons+0.005
Neg logit contributions
sing-0.012
 breathing-0.011
 staying-0.011
ses-0.011
 Kel-0.010
Token incredible
Token ID8082
Feature activation-8.36e-03

Pos logit contributions
P+0.041
 course+0.041
or+0.040
ory+0.039
umper+0.038
Neg logit contributions
 breathing-0.027
sing-0.025
 staying-0.025
 standing-0.024
 Kel-0.023
Token sp
Token ID599
Feature activation-8.06e-02

Pos logit contributions
 course+0.014
P+0.014
or+0.013
ory+0.013
+0.013
Neg logit contributions
 breathing-0.014
 staying-0.013
sing-0.013
 Kel-0.013
 standing-0.013
Tokenade
Token ID671
Feature activation-1.39e-01

Pos logit contributions
 course+0.082
umper+0.077
use+0.076
P+0.075
+0.075
Neg logit contributions
 cob-0.185
sing-0.183
 breathing-0.178
 standing-0.173
 staying-0.170
Token that
Token ID326
Feature activation-1.38e-02

Pos logit contributions
or+0.245
ory+0.236
P+0.236
umper+0.236
+0.231
Neg logit contributions
 cob-0.213
 breathing-0.213
sing-0.207
 staying-0.206
 standing-0.206
Token she
Token ID673
Feature activation-2.35e-03

Pos logit contributions
or+0.024
P+0.024
 course+0.023
ory+0.022
les+0.022
Neg logit contributions
sing-0.022
 breathing-0.022
 staying-0.021
 cob-0.021
 standing-0.021
Token loved
Token ID6151
Feature activation-4.54e-02

Pos logit contributions
or+0.005
 course+0.005
ory+0.005
umper+0.005
 trigger+0.005
Neg logit contributions
 cob-0.003
 breathing-0.003
 staying-0.003
 standing-0.002
sing-0.002
Token to
Token ID284
Feature activation-1.41e-02

Pos logit contributions
or+0.076
P+0.074
 course+0.072
ory+0.071
les+0.069
Neg logit contributions
sing-0.076
 breathing-0.076
 cob-0.073
 staying-0.073
 standing-0.072
Token use
Token ID779
Feature activation-4.11e-02

Pos logit contributions
or+0.026
P+0.025
umper+0.025
Captain+0.024
ory+0.024
Neg logit contributions
 cob-0.021
sing-0.021
 breathing-0.021
 staying-0.019
 standing-0.019
Token't
Token ID470
Feature activation-4.06e-02

Pos logit contributions
 breathing+0.071
sing+0.071
 staying+0.070
 standing+0.070
 cob+0.069
Neg logit contributions
or-0.024
P-0.022
ory-0.021
les-0.021
 course-0.020
Token very
Token ID845
Feature activation-2.94e-02

Pos logit contributions
P+0.063
ory+0.061
oni+0.060
 course+0.059
umper+0.058
Neg logit contributions
sing-0.071
ses-0.068
 Kel-0.066
 breathing-0.064
 cob-0.063
Token mild
Token ID11607
Feature activation-7.58e-03

Pos logit contributions
or+0.058
P+0.057
 course+0.055
ory+0.055
les+0.053
Neg logit contributions
 breathing-0.040
sing-0.040
 cob-0.039
 staying-0.038
 standing-0.038
Token like
Token ID588
Feature activation-1.25e-03

Pos logit contributions
or+0.011
 course+0.011
ory+0.011
les+0.010
P+0.010
Neg logit contributions
sing-0.014
 breathing-0.014
 cob-0.014
 staying-0.013
ses-0.013
Token the
Token ID262
Feature activation-4.27e-03

Pos logit contributions
 course+0.002
or+0.002
P+0.002
ory+0.002
les+0.002
Neg logit contributions
sing-0.002
 breathing-0.002
 cob-0.002
 staying-0.002
 standing-0.002
Token squash
Token ID34613
Feature activation-1.42e-01

Pos logit contributions
or+0.007
P+0.007
 course+0.007
ory+0.007
les+0.007
Neg logit contributions
 breathing-0.007
sing-0.007
 cob-0.007
 staying-0.007
 standing-0.007
Token his
Token ID465
Feature activation+1.11e-02

Pos logit contributions
or+0.209
P+0.205
 course+0.193
ory+0.192
umper+0.189
Neg logit contributions
 breathing-0.265
sing-0.263
 cob-0.258
 staying-0.256
 standing-0.253
Token mom
Token ID1995
Feature activation+3.32e-02

Pos logit contributions
 breathing+0.015
 staying+0.015
 cob+0.015
 Kel+0.014
 standing+0.014
Neg logit contributions
ory-0.021
or-0.021
 course-0.020
P-0.020
-0.019
Token usually
Token ID3221
Feature activation+1.17e-02

Pos logit contributions
sing+0.069
 breathing+0.068
 cob+0.067
 staying+0.066
 standing+0.065
Neg logit contributions
or-0.042
P-0.040
 course-0.040
ory-0.039
les-0.038
Token made
Token ID925
Feature activation+4.26e-03

Pos logit contributions
 breathing+0.023
sing+0.022
 staying+0.022
 cob+0.022
 standing+0.022
Neg logit contributions
or-0.017
P-0.016
 course-0.015
umper-0.015
ory-0.015
Token.
Token ID13
Feature activation+1.98e-02

Pos logit contributions
sing+0.008
 breathing+0.008
 cob+0.008
 staying+0.008
 standing+0.007
Neg logit contributions
or-0.006
P-0.006
 course-0.006
ory-0.006
les-0.006
Token was
Token ID373
Feature activation+1.89e-03

Pos logit contributions
or+0.065
P+0.064
ory+0.062
umper+0.061
 course+0.057
Neg logit contributions
 breathing-0.080
 staying-0.078
 cob-0.078
 standing-0.077
sing-0.076
Token so
Token ID523
Feature activation+1.02e-02

Pos logit contributions
sing+0.003
 breathing+0.003
 cob+0.003
 staying+0.003
ses+0.003
Neg logit contributions
or-0.003
P-0.003
 course-0.003
ory-0.003
les-0.003
Token special
Token ID2041
Feature activation-2.69e-02

Pos logit contributions
 breathing+0.012
sing+0.012
 cob+0.011
 staying+0.011
 standing+0.011
Neg logit contributions
or-0.022
P-0.022
 course-0.022
ory-0.021
umper-0.021
Token!"
Token ID2474
Feature activation-2.79e-02

Pos logit contributions
or+0.044
P+0.044
 course+0.042
ory+0.041
umper+0.041
Neg logit contributions
 breathing-0.046
 staying-0.044
sing-0.044
 cob-0.044
 standing-0.043
Token Mall
Token ID16618
Feature activation-5.39e-02

Pos logit contributions
or+0.047
 course+0.045
ede+0.045
use+0.045
umper+0.044
Neg logit contributions
 breathing-0.040
 standing-0.040
 staying-0.040
 Ann-0.039
 Mir-0.038
Tokenory
Token ID652
Feature activation-1.83e-01

Pos logit contributions
 course+0.045
 trigger+0.039
ede+0.033
+0.031
achers+0.031
Neg logit contributions
 staying-0.123
 standing-0.123
 sitting-0.121
 breathing-0.121
 kneeling-0.118
Token and
Token ID290
Feature activation+7.89e-03

Pos logit contributions
or+0.304
P+0.296
umper+0.293
 course+0.280
use+0.270
Neg logit contributions
sing-0.300
 breathing-0.296
 cob-0.296
 standing-0.291
 staying-0.284
Token her
Token ID607
Feature activation+4.94e-02

Pos logit contributions
 breathing+0.010
sing+0.010
 staying+0.010
 standing+0.010
 cob+0.010
Neg logit contributions
or-0.016
P-0.015
 course-0.015
les-0.015
umper-0.015
Token dad
Token ID9955
Feature activation+2.52e-03

Pos logit contributions
sing+0.050
 breathing+0.049
 staying+0.047
 cob+0.046
 standing+0.046
Neg logit contributions
or-0.117
P-0.115
 course-0.112
umper-0.110
les-0.109
Token smiled
Token ID13541
Feature activation-7.25e-03

Pos logit contributions
sing+0.005
 standing+0.004
 breathing+0.004
 cob+0.004
 staying+0.004
Neg logit contributions
P-0.004
or-0.004
umper-0.004
ory-0.003
-0.003
Token at
Token ID379
Feature activation+1.66e-02

Pos logit contributions
ory+0.009
or+0.009
les+0.009
P+0.009
 course+0.009
Neg logit contributions
 cob-0.015
sing-0.014
 breathing-0.014
 staying-0.014
 Kel-0.014