TOP ACTIVATIONS
MAX = 0.080

You sing so pretty ," said R ino . " Thank
" Hello there ," said Blue to a little mouse
's so y ummy ," replied Jerry . Twe
Hi , T una ," said Crab by . " What
to jump and play ," replied Joey . Twe
you for your help ," said the small bird . "
help you find it ," said Fox y .
asked , " Can you whistle ?" Lily replied , "
have an adventurous journey ," replied the little bird . "
" Please help me ," said the snail . " I
The kind pigeon and the new bird went to
" You 're welcome ," said the big bird . "
, a dove with a shiny be ak !" Sam said
make a snow man ," said Blue . Little Pink smiled
, I am Blue !" said the bird . " What
a happy bird named Raven . Raven had shiny black feathers
sunrise is so pretty ," said Blue . Blue
colors . It was a special flag , and everyone liked
. " Hi ," said Buzz . The cater
your pretty wings ." The butterfly replied , " Thank you

MIN ACTIVATIONS
MIN = -0.080

is very messy . There are clothes , papers , and
was a little girl called Fel icia . She loved to
Jake and his dog , F uzz y , went on
dog called Bos co . Bos co liked to go outside
. Suddenly she realized that F uzz y must have come
one day . Suddenly , F uzz y saw something up
walking in the park , L izzy saw something spark ly
day . One day , Fel icity saw that the chickens
friends . One day , L izzy was in her room
is very messy . There are toys everywhere . Mom says
very dirty and sm elly . I threw her away ."
izzy . One day , L izzy decided to take a
umbrella was too heavy for F ido . Suddenly
you cannot hide from me . You need a bath .
the wind . B izzy woke up and couldn
big pile of garbage on the street . He thought it
As they were walking , L izzy saw something black in
was a little spider named Sp ide y . He lived
J ill looked around for F uzz y , but he
saw a big factory near their home . They wanted to

INTERVAL 0.040 - 0.080
CONTAINS 0.487%

" Yes , really ," said Mom . " But first
Buzz y lived in a big yellow hive with lots of
. The sun 's warm rays made the rose feel happy
, hot red balloon ?" The hot red balloon was so
and felt free . The bird knew it could always depend

INTERVAL -0.000 - 0.040
CONTAINS 51.194%

I will help you . You are my friend ."
enjoyed their time on the island forever . <|endoftext|> One day
ily smiled and said , " Sure ! Let 's go
he loved her too . He said he would be more
I wanted to catch the butterfly , but it flew away

INTERVAL -0.040 - -0.000
CONTAINS 47.550%

, I 'm tired . Can we stop playing ?" Lily
unes . They do not see Mom and Dad . Mom
it . Lily tried first , but she could not catch
the big one . L ily feels sad and
she had been brave enough to reverse the lock on the

INTERVAL -0.080 - -0.040
CONTAINS 0.770%

and Sue liked to play in the woods . They liked
and his friends went to the tall closet room every day
it was a passage to somewhere new and exciting . Harry
together and stuck them over the crack .
and the sauce spilled on the floor . Lily felt bad
Token You
Token ID921
Feature activation+1.45e-02

Pos logit contributions
 desks+0.020
keeping+0.020
 Maybe+0.019
 generation+0.019
 computers+0.019
Neg logit contributions
 course-0.031
 thank-0.030
agging-0.029
br-0.028
 winners-0.028
Token sing
Token ID1702
Feature activation+4.26e-02

Pos logit contributions
agging+0.025
 course+0.023
is+0.022
aid+0.022
ier+0.022
Neg logit contributions
 Maybe-0.023
 generation-0.020
 desks-0.019
keeping-0.019
grand-0.019
Token so
Token ID523
Feature activation+1.51e-02

Pos logit contributions
agging+0.074
 course+0.073
ig+0.067
ier+0.067
 winners+0.067
Neg logit contributions
 Maybe-0.067
and-0.058
 generation-0.057
keeping-0.056
 desks-0.056
Token pretty
Token ID2495
Feature activation+3.61e-02

Pos logit contributions
agging+0.025
 course+0.024
 winners+0.022
br+0.022
ier+0.022
Neg logit contributions
 Maybe-0.025
 generation-0.022
and-0.022
 desks-0.021
grand-0.020
Token,"
Token ID553
Feature activation+4.44e-02

Pos logit contributions
agging+0.056
 course+0.055
 thank+0.051
 winners+0.050
br+0.049
Neg logit contributions
 Maybe-0.062
keeping-0.057
 desks-0.056
 generation-0.055
and-0.055
Token said
Token ID531
Feature activation+7.80e-02

Pos logit contributions
agging+0.087
ier+0.080
 winners+0.079
ags+0.074
br+0.074
Neg logit contributions
 Maybe-0.071
grand-0.064
 generation-0.060
 Grand-0.057
 alley-0.055
Token R
Token ID371
Feature activation-2.14e-02

Pos logit contributions
agging+0.210
 winners+0.189
ier+0.187
is+0.186
ig+0.184
Neg logit contributions
 Maybe-0.067
 Grand-0.062
 because-0.052
 Tim-0.051
grand-0.050
Tokenino
Token ID2879
Feature activation+4.93e-03

Pos logit contributions
 Maybe+0.042
 generation+0.038
keeping+0.038
and+0.038
grand+0.038
Neg logit contributions
 course-0.027
agging-0.027
 thank-0.025
 winners-0.025
 soaring-0.024
Token.
Token ID13
Feature activation+3.05e-02

Pos logit contributions
 course+0.010
agging+0.010
 thank+0.009
br+0.009
 winners+0.009
Neg logit contributions
 Maybe-0.007
keeping-0.006
 generation-0.006
 desks-0.006
and-0.005
Token "
Token ID366
Feature activation-6.96e-04

Pos logit contributions
agging+0.059
 course+0.058
 winners+0.055
ier+0.054
br+0.054
Neg logit contributions
 Maybe-0.043
keeping-0.036
 generation-0.036
 desks-0.034
and-0.034
TokenThank
Token ID10449
Feature activation+3.43e-02

Pos logit contributions
 Maybe+0.001
keeping+0.001
 desks+0.001
 generation+0.001
and+0.001
Neg logit contributions
 course-0.001
agging-0.001
 thank-0.001
br-0.001
 winners-0.001
Token
Token ID198
Feature activation+3.08e-02

Pos logit contributions
 course+0.013
agging+0.012
 thank+0.012
br+0.012
aid+0.011
Neg logit contributions
 Maybe-0.013
 desks-0.012
keeping-0.012
 generation-0.012
 computers-0.012
Token"
Token ID1
Feature activation+8.24e-04

Pos logit contributions
agging+0.043
 course+0.043
ier+0.041
 winners+0.041
 luck+0.041
Neg logit contributions
 Maybe-0.058
 generation-0.052
and-0.051
grand-0.050
keeping-0.050
TokenHello
Token ID15496
Feature activation+2.69e-02

Pos logit contributions
 course+0.001
agging+0.001
br+0.001
 thank+0.001
 winners+0.001
Neg logit contributions
 Maybe-0.001
 desks-0.001
keeping-0.001
 generation-0.001
and-0.001
Token there
Token ID612
Feature activation+2.56e-02

Pos logit contributions
agging+0.046
 course+0.041
aid+0.040
ier+0.039
ags+0.038
Neg logit contributions
 Maybe-0.049
 tonight-0.038
 Grand-0.038
 because-0.038
packs-0.037
Token,"
Token ID553
Feature activation+4.37e-02

Pos logit contributions
agging+0.040
aid+0.037
 course+0.036
ier+0.034
 winners+0.034
Neg logit contributions
 Maybe-0.046
 desks-0.039
 alley-0.038
keeping-0.038
and-0.038
Token said
Token ID531
Feature activation+7.57e-02

Pos logit contributions
agging+0.083
ier+0.078
oming+0.074
ig+0.073
 winners+0.073
Neg logit contributions
 Maybe-0.073
grand-0.060
 generation-0.058
 They-0.057
 cigarettes-0.056
Token Blue
Token ID4518
Feature activation-7.09e-03

Pos logit contributions
agging+0.174
is+0.167
ig+0.162
oming+0.157
bling+0.156
Neg logit contributions
 Maybe-0.098
 They-0.083
 Grand-0.081
 generation-0.077
 Who-0.075
Token to
Token ID284
Feature activation+2.49e-02

Pos logit contributions
 Maybe+0.008
keeping+0.007
and+0.007
 generation+0.007
 desks+0.006
Neg logit contributions
 course-0.015
agging-0.015
 winners-0.014
br-0.014
 thank-0.014
Token a
Token ID257
Feature activation+1.11e-02

Pos logit contributions
agging+0.047
ig+0.045
 course+0.045
br+0.044
ier+0.044
Neg logit contributions
 Maybe-0.036
 generation-0.033
keeping-0.031
 Who-0.030
grand-0.028
Token little
Token ID1310
Feature activation+3.64e-03

Pos logit contributions
agging+0.020
ier+0.018
 course+0.018
bled+0.018
agged+0.018
Neg logit contributions
 Maybe-0.018
 alley-0.016
 topic-0.016
 hallway-0.015
 Who-0.015
Token mouse
Token ID10211
Feature activation+6.61e-03

Pos logit contributions
agging+0.005
 course+0.004
 winners+0.004
br+0.004
 thank+0.004
Neg logit contributions
 Maybe-0.007
 generation-0.007
keeping-0.007
 desks-0.007
and-0.007
Token's
Token ID338
Feature activation+8.80e-03

Pos logit contributions
agging+0.023
 course+0.022
 winners+0.021
 thank+0.021
br+0.020
Neg logit contributions
 Maybe-0.012
 generation-0.010
keeping-0.010
 desks-0.010
grand-0.010
Token so
Token ID523
Feature activation+2.03e-03

Pos logit contributions
agging+0.015
 course+0.014
 winners+0.013
aid+0.013
br+0.013
Neg logit contributions
 Maybe-0.014
 generation-0.013
 cigarettes-0.013
 tonight-0.012
 because-0.012
Token y
Token ID331
Feature activation-8.51e-04

Pos logit contributions
agging+0.003
 course+0.003
 winners+0.002
 soaring+0.002
ier+0.002
Neg logit contributions
 Maybe-0.004
grand-0.004
and-0.004
 desks-0.003
 generation-0.003
Tokenummy
Token ID13513
Feature activation+5.82e-03

Pos logit contributions
 Maybe+0.002
 generation+0.001
keeping+0.001
 desks+0.001
and+0.001
Neg logit contributions
 course-0.001
agging-0.001
 thank-0.001
br-0.001
 winners-0.001
Token,"
Token ID553
Feature activation+5.80e-02

Pos logit contributions
 course+0.008
agging+0.007
 thank+0.007
br+0.007
 winners+0.007
Neg logit contributions
 Maybe-0.011
 generation-0.011
 desks-0.010
keeping-0.010
and-0.010
Token replied
Token ID8712
Feature activation+7.55e-02

Pos logit contributions
agging+0.095
ier+0.090
 winners+0.086
oming+0.085
ig+0.084
Neg logit contributions
 Maybe-0.108
grand-0.099
 generation-0.095
 Grand-0.090
 alley-0.089
Token Jerry
Token ID13075
Feature activation-6.13e-03

Pos logit contributions
agging+0.169
ig+0.161
br+0.160
ier+0.157
is+0.156
Neg logit contributions
 Maybe-0.082
 generation-0.072
 Grand-0.067
 Tim-0.067
 because-0.065
Token.
Token ID13
Feature activation+2.77e-02

Pos logit contributions
 Maybe+0.007
keeping+0.006
and+0.006
 generation+0.006
 desks+0.006
Neg logit contributions
 course-0.013
agging-0.013
br-0.012
 winners-0.012
 thank-0.012
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
agging+0.050
 course+0.050
 winners+0.046
br+0.046
 thank+0.046
Neg logit contributions
 Maybe-0.041
 generation-0.035
keeping-0.035
and-0.035
 desks-0.034
Token
Token ID198
Feature activation+3.57e-02

Pos logit contributions
agging+0.044
 course+0.044
 winners+0.042
 thank+0.042
br+0.041
Neg logit contributions
 Maybe-0.026
keeping-0.022
 generation-0.022
and-0.021
 desks-0.021
TokenTwe
Token ID32665
Feature activation-2.83e-02

Pos logit contributions
agging+0.063
 luck+0.062
ier+0.061
ig+0.060
 winners+0.060
Neg logit contributions
 Maybe-0.057
grand-0.048
and-0.048
 generation-0.047
 alley-0.047
TokenHi
Token ID17250
Feature activation+3.70e-02

Pos logit contributions
agging+0.032
 winners+0.029
ier+0.029
ig+0.029
 thank+0.029
Neg logit contributions
 Maybe-0.024
 generation-0.021
keeping-0.020
grand-0.019
and-0.019
Token,
Token ID11
Feature activation+1.85e-02

Pos logit contributions
agging+0.058
 course+0.054
aid+0.051
 winners+0.050
ier+0.050
Neg logit contributions
 Maybe-0.068
and-0.056
 desks-0.056
keeping-0.056
 generation-0.055
Token T
Token ID309
Feature activation-7.01e-03

Pos logit contributions
agging+0.035
 course+0.032
aid+0.032
ier+0.032
br+0.031
Neg logit contributions
 Maybe-0.028
 Who-0.023
and-0.022
grand-0.021
 Grand-0.021
Tokenuna
Token ID9613
Feature activation+5.50e-03

Pos logit contributions
 Maybe+0.014
 desks+0.013
 generation+0.013
keeping+0.013
and+0.012
Neg logit contributions
agging-0.009
 course-0.009
 thank-0.009
br-0.008
 medal-0.008
Token,"
Token ID553
Feature activation+4.86e-02

Pos logit contributions
agging+0.009
 course+0.009
 winners+0.008
br+0.008
 thank+0.008
Neg logit contributions
 Maybe-0.009
keeping-0.008
 generation-0.008
grand-0.008
and-0.008
Token said
Token ID531
Feature activation+7.55e-02

Pos logit contributions
agging+0.097
ier+0.086
 winners+0.081
br+0.081
ags+0.080
Neg logit contributions
 Maybe-0.083
grand-0.072
 Grand-0.069
 They-0.064
 Who-0.064
Token Crab
Token ID32379
Feature activation+8.41e-03

Pos logit contributions
agging+0.186
ier+0.166
 winners+0.165
 course+0.163
br+0.162
Neg logit contributions
 Maybe-0.081
 Grand-0.077
 Jake-0.062
 M-0.062
 generation-0.062
Tokenby
Token ID1525
Feature activation+2.11e-02

Pos logit contributions
 course+0.018
agging+0.018
 winners+0.018
 soaring+0.017
br+0.017
Neg logit contributions
 Maybe-0.010
and-0.008
grand-0.008
bie-0.007
jo-0.007
Token.
Token ID13
Feature activation+3.64e-02

Pos logit contributions
 course+0.048
agging+0.047
br+0.045
 winners+0.045
 medal+0.044
Neg logit contributions
 Maybe-0.022
keeping-0.018
and-0.017
 generation-0.017
grand-0.016
Token "
Token ID366
Feature activation-1.44e-02

Pos logit contributions
agging+0.080
 course+0.079
 winners+0.074
ier+0.074
br+0.073
Neg logit contributions
 Maybe-0.047
 generation-0.033
and-0.033
 Who-0.033
keeping-0.033
TokenWhat
Token ID2061
Feature activation-9.81e-03

Pos logit contributions
 desks+0.023
 Maybe+0.021
 computers+0.021
packs+0.021
keeping+0.021
Neg logit contributions
 course-0.026
agging-0.023
 soaring-0.021
 thank-0.021
br-0.021
Token to
Token ID284
Feature activation+2.20e-02

Pos logit contributions
agging+0.019
 course+0.019
ier+0.018
 winners+0.017
 thank+0.017
Neg logit contributions
 Maybe-0.014
keeping-0.012
 generation-0.012
 desks-0.012
and-0.012
Token jump
Token ID4391
Feature activation+3.20e-02

Pos logit contributions
agging+0.040
 course+0.038
ier+0.038
 winners+0.037
br+0.036
Neg logit contributions
 Maybe-0.031
 generation-0.028
 desks-0.026
keeping-0.026
grand-0.026
Token and
Token ID290
Feature activation+2.63e-03

Pos logit contributions
agging+0.056
 course+0.054
 winners+0.052
 thank+0.051
ier+0.050
Neg logit contributions
 Maybe-0.049
and-0.042
keeping-0.042
 desks-0.042
 generation-0.041
Token play
Token ID711
Feature activation+4.05e-02

Pos logit contributions
agging+0.005
ier+0.005
 course+0.005
br+0.005
 winners+0.005
Neg logit contributions
 Maybe-0.004
 generation-0.003
keeping-0.003
 desks-0.003
and-0.003
Token,"
Token ID553
Feature activation+5.61e-02

Pos logit contributions
agging+0.074
ier+0.070
oming+0.067
 winners+0.067
aid+0.067
Neg logit contributions
 Maybe-0.059
and-0.051
 generation-0.050
keeping-0.049
 tonight-0.048
Token replied
Token ID8712
Feature activation+7.49e-02

Pos logit contributions
agging+0.087
br+0.084
 winners+0.084
ier+0.082
 winner+0.077
Neg logit contributions
 Maybe-0.105
grand-0.095
 generation-0.092
 because-0.091
 Grand-0.091
Token Joey
Token ID26154
Feature activation+3.90e-03

Pos logit contributions
agging+0.177
 winners+0.175
br+0.172
 winner+0.164
ier+0.163
Neg logit contributions
 Maybe-0.066
 Grand-0.060
 M-0.058
 because-0.058
 Daddy-0.055
Token.
Token ID13
Feature activation+3.30e-02

Pos logit contributions
 course+0.009
agging+0.009
br+0.009
 winners+0.009
 medal+0.008
Neg logit contributions
 Maybe-0.004
and-0.003
keeping-0.003
 generation-0.003
grand-0.003
Token
Token ID198
Feature activation+2.67e-02

Pos logit contributions
agging+0.060
 course+0.060
 winners+0.056
br+0.056
 thank+0.056
Neg logit contributions
 Maybe-0.048
 generation-0.041
keeping-0.041
and-0.040
 desks-0.040
Token
Token ID198
Feature activation+3.50e-02

Pos logit contributions
agging+0.056
 course+0.056
 winners+0.054
 thank+0.053
br+0.053
Neg logit contributions
 Maybe-0.031
and-0.026
 generation-0.026
keeping-0.026
 desks-0.025
TokenTwe
Token ID32665
Feature activation-2.30e-02

Pos logit contributions
agging+0.056
 course+0.054
 winners+0.053
 thank+0.052
ier+0.051
Neg logit contributions
 Maybe-0.060
and-0.053
 generation-0.053
keeping-0.052
grand-0.051
Token you
Token ID345
Feature activation+2.75e-02

Pos logit contributions
agging+0.131
ier+0.127
oming+0.123
ig+0.119
 winners+0.118
Neg logit contributions
 Maybe-0.043
and-0.036
 generation-0.033
keeping-0.032
grand-0.030
Token for
Token ID329
Feature activation+3.45e-02

Pos logit contributions
agging+0.044
 course+0.042
br+0.039
 soaring+0.039
 winners+0.038
Neg logit contributions
 Maybe-0.046
and-0.041
keeping-0.040
 desks-0.040
 Jamie-0.040
Token your
Token ID534
Feature activation+1.53e-02

Pos logit contributions
ement+0.077
oming+0.072
ier+0.072
agging+0.070
br+0.070
Neg logit contributions
keeping-0.034
 generation-0.033
 where-0.031
 tonight-0.031
 Help-0.029
Token help
Token ID1037
Feature activation+2.65e-02

Pos logit contributions
agging+0.023
ier+0.022
agged+0.022
bled+0.021
rob+0.021
Neg logit contributions
 Maybe-0.025
 generation-0.024
and-0.022
 topic-0.022
 computers-0.022
Token,"
Token ID553
Feature activation+5.14e-02

Pos logit contributions
agging+0.046
 course+0.043
br+0.042
 winners+0.041
oming+0.040
Neg logit contributions
 Maybe-0.040
keeping-0.038
and-0.036
 generation-0.036
grand-0.035
Token said
Token ID531
Feature activation+7.48e-02

Pos logit contributions
agging+0.115
ier+0.097
ags+0.095
br+0.094
 winners+0.094
Neg logit contributions
 Maybe-0.072
grand-0.059
 Grand-0.058
 They-0.058
 generation-0.057
Token the
Token ID262
Feature activation+3.99e-02

Pos logit contributions
agging+0.187
is+0.159
 winners+0.156
br+0.156
bling+0.154
Neg logit contributions
 Grand-0.077
 Maybe-0.075
 Jamie-0.067
 generation-0.067
 Jake-0.066
Token small
Token ID1402
Feature activation+3.74e-02

Pos logit contributions
ier+0.081
ises+0.079
agging+0.075
bly+0.073
bled+0.072
Neg logit contributions
 Maybe-0.045
 aid-0.039
 alley-0.038
 hall-0.037
 generation-0.036
Token bird
Token ID6512
Feature activation+4.24e-02

Pos logit contributions
bled+0.068
agging+0.067
bling+0.066
bly+0.065
ier+0.064
Neg logit contributions
 Maybe-0.054
 alley-0.048
 generation-0.047
 aid-0.047
 cigarettes-0.043
Token.
Token ID13
Feature activation+4.23e-02

Pos logit contributions
agging+0.106
 course+0.106
 winners+0.102
br+0.099
ags+0.098
Neg logit contributions
 Maybe-0.034
and-0.028
grand-0.026
 because-0.025
keeping-0.024
Token "
Token ID366
Feature activation+1.68e-02

Pos logit contributions
agging+0.078
 course+0.076
br+0.071
 winners+0.071
ier+0.071
Neg logit contributions
 Maybe-0.064
 generation-0.053
keeping-0.053
and-0.053
 desks-0.051
Token help
Token ID1037
Feature activation+6.59e-03

Pos logit contributions
 course+0.010
agging+0.010
 winners+0.009
br+0.009
 thank+0.009
Neg logit contributions
 Maybe-0.014
 generation-0.013
keeping-0.013
 desks-0.012
and-0.012
Token you
Token ID345
Feature activation-1.82e-02

Pos logit contributions
 course+0.011
agging+0.011
 thank+0.010
br+0.010
 winners+0.010
Neg logit contributions
 Maybe-0.011
keeping-0.010
 generation-0.010
 desks-0.010
and-0.009
Token find
Token ID1064
Feature activation+1.40e-02

Pos logit contributions
 Maybe+0.023
keeping+0.020
 generation+0.020
 desks+0.020
and+0.019
Neg logit contributions
 course-0.036
agging-0.036
 thank-0.034
br-0.033
 winners-0.033
Token it
Token ID340
Feature activation+8.15e-03

Pos logit contributions
agging+0.024
 course+0.023
 winners+0.022
br+0.022
 thank+0.021
Neg logit contributions
 Maybe-0.022
 generation-0.020
keeping-0.019
 desks-0.019
and-0.018
Token,"
Token ID553
Feature activation+4.99e-02

Pos logit contributions
 course+0.012
agging+0.012
 thank+0.011
 winners+0.011
br+0.011
Neg logit contributions
 Maybe-0.015
keeping-0.013
 generation-0.013
and-0.013
 desks-0.013
Token said
Token ID531
Feature activation+7.27e-02

Pos logit contributions
agging+0.114
ier+0.101
 winners+0.098
ags+0.094
oming+0.093
Neg logit contributions
 Maybe-0.073
grand-0.061
 They-0.060
 Who-0.058
 generation-0.056
Token Fox
Token ID5426
Feature activation-2.49e-02

Pos logit contributions
agging+0.181
 winners+0.167
is+0.161
ig+0.159
ier+0.158
Neg logit contributions
 Maybe-0.073
 generation-0.063
 Jones-0.060
 Grand-0.059
 Who-0.057
Tokeny
Token ID88
Feature activation+2.12e-03

Pos logit contributions
 Maybe+0.024
and+0.021
keeping+0.020
 generation+0.019
places+0.018
Neg logit contributions
 course-0.058
agging-0.056
 thank-0.055
 winners-0.055
br-0.054
Token.
Token ID13
Feature activation+2.54e-02

Pos logit contributions
 course+0.005
agging+0.005
 winners+0.004
br+0.004
 thank+0.004
Neg logit contributions
 Maybe-0.002
and-0.002
keeping-0.002
 generation-0.002
 desks-0.002
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
agging+0.057
 course+0.056
 winners+0.053
ier+0.052
 medal+0.052
Neg logit contributions
 Maybe-0.031
 generation-0.023
and-0.023
keeping-0.022
places-0.021
Token
Token ID198
Feature activation+3.15e-02

Pos logit contributions
agging+0.057
 course+0.056
 winners+0.055
ier+0.054
 luck+0.054
Neg logit contributions
 Maybe-0.025
and-0.020
 generation-0.019
keeping-0.018
grand-0.018
Token asked
Token ID1965
Feature activation+2.54e-02

Pos logit contributions
agging+0.094
 winners+0.094
ags+0.092
 luck+0.087
ier+0.087
Neg logit contributions
 Maybe-0.048
 because-0.039
 wanted-0.039
grand-0.038
 generation-0.036
Token,
Token ID11
Feature activation+2.74e-02

Pos logit contributions
agging+0.043
 course+0.041
 winners+0.039
br+0.039
 thank+0.038
Neg logit contributions
 Maybe-0.042
 generation-0.037
keeping-0.036
 desks-0.036
and-0.036
Token "
Token ID366
Feature activation+1.56e-02

Pos logit contributions
agging+0.062
 course+0.061
 winners+0.058
br+0.057
ig+0.057
Neg logit contributions
 Maybe-0.032
and-0.023
 Who-0.022
 generation-0.022
 because-0.021
TokenCan
Token ID6090
Feature activation+8.83e-03

Pos logit contributions
agging+0.035
 winners+0.032
 thank+0.032
ier+0.032
 luck+0.031
Neg logit contributions
 Maybe-0.018
 generation-0.015
and-0.014
keeping-0.014
grand-0.014
Token you
Token ID345
Feature activation+2.54e-02

Pos logit contributions
agging+0.018
 course+0.017
ier+0.017
 winners+0.016
br+0.016
Neg logit contributions
 Maybe-0.012
 generation-0.010
grand-0.010
keeping-0.009
 desks-0.009
Token whistle
Token ID16121
Feature activation+7.27e-02

Pos logit contributions
agging+0.046
ier+0.044
bled+0.043
oming+0.042
ig+0.042
Neg logit contributions
 Maybe-0.037
 sneak-0.033
 generation-0.033
 because-0.031
grand-0.031
Token?"
Token ID1701
Feature activation+4.31e-02

Pos logit contributions
agging+0.142
ier+0.136
oming+0.133
 course+0.133
ig+0.132
Neg logit contributions
 Maybe-0.094
 because-0.083
 tonight-0.078
and-0.074
 School-0.067
Token Lily
Token ID20037
Feature activation+2.62e-02

Pos logit contributions
agging+0.087
 course+0.086
 winners+0.081
br+0.081
 thank+0.080
Neg logit contributions
 Maybe-0.055
keeping-0.047
 generation-0.047
 desks-0.045
and-0.045
Token replied
Token ID8712
Feature activation+5.36e-02

Pos logit contributions
agging+0.047
 winners+0.044
br+0.042
ier+0.042
 course+0.042
Neg logit contributions
 Maybe-0.044
 generation-0.035
 Jamie-0.035
grand-0.035
 because-0.034
Token,
Token ID11
Feature activation+3.69e-02

Pos logit contributions
 course+0.096
agging+0.095
br+0.095
aid+0.095
 winners+0.095
Neg logit contributions
 Maybe-0.073
 because-0.070
and-0.064
care-0.058
 Jamie-0.058
Token "
Token ID366
Feature activation+1.72e-02

Pos logit contributions
agging+0.082
 course+0.081
br+0.079
 winners+0.079
aid+0.078
Neg logit contributions
 Maybe-0.041
 generation-0.030
and-0.029
 because-0.029
 Jamie-0.028
Token have
Token ID423
Feature activation+2.52e-02

Pos logit contributions
agging+0.008
 course+0.007
ier+0.007
br+0.007
aid+0.007
Neg logit contributions
 Maybe-0.006
 generation-0.005
keeping-0.005
 desks-0.005
 Who-0.005
Token an
Token ID281
Feature activation+2.00e-02

Pos logit contributions
agging+0.041
ier+0.038
 course+0.037
aid+0.036
oming+0.035
Neg logit contributions
 Maybe-0.042
 generation-0.038
keeping-0.037
and-0.037
 desks-0.037
Token adventurous
Token ID39770
Feature activation+3.98e-02

Pos logit contributions
br+0.029
agging+0.028
 course+0.028
 winners+0.027
ier+0.027
Neg logit contributions
 Maybe-0.034
 generation-0.033
grand-0.031
 alley-0.031
 desks-0.030
Token journey
Token ID7002
Feature activation+2.56e-02

Pos logit contributions
agging+0.071
ier+0.069
 course+0.067
bled+0.067
aid+0.066
Neg logit contributions
 Maybe-0.057
 generation-0.055
 alley-0.050
keeping-0.049
 hallway-0.049
Token,"
Token ID553
Feature activation+4.94e-02

Pos logit contributions
 course+0.045
agging+0.045
 thank+0.041
br+0.040
 winners+0.040
Neg logit contributions
 Maybe-0.039
keeping-0.035
 generation-0.034
and-0.034
 desks-0.034
Token replied
Token ID8712
Feature activation+7.27e-02

Pos logit contributions
agging+0.100
ier+0.093
 winners+0.087
br+0.086
oming+0.084
Neg logit contributions
 Maybe-0.079
grand-0.066
 They-0.064
 generation-0.064
 Grand-0.063
Token the
Token ID262
Feature activation+3.15e-02

Pos logit contributions
agging+0.181
ier+0.165
 winners+0.159
is+0.159
br+0.158
Neg logit contributions
 Maybe-0.072
 Grand-0.068
 M-0.065
 Jake-0.061
 Tim-0.058
Token little
Token ID1310
Feature activation+3.57e-02

Pos logit contributions
agging+0.043
ier+0.039
aid+0.038
 course+0.038
ises+0.038
Neg logit contributions
 Maybe-0.059
 generation-0.056
 alley-0.053
 desks-0.050
keeping-0.050
Token bird
Token ID6512
Feature activation+1.52e-02

Pos logit contributions
agging+0.058
bling+0.051
bled+0.051
ier+0.051
aid+0.050
Neg logit contributions
 Maybe-0.064
 generation-0.055
 alley-0.054
keeping-0.051
 desks-0.050
Token.
Token ID13
Feature activation+4.54e-02

Pos logit contributions
 course+0.033
agging+0.032
 winners+0.031
br+0.030
 thank+0.030
Neg logit contributions
 Maybe-0.018
and-0.015
keeping-0.015
 generation-0.015
grand-0.014
Token "
Token ID366
Feature activation-6.44e-03

Pos logit contributions
agging+0.093
 course+0.092
 winners+0.086
br+0.086
 thank+0.085
Neg logit contributions
 Maybe-0.059
 generation-0.047
keeping-0.047
and-0.047
 desks-0.046
Token"
Token ID1
Feature activation-1.12e-02

Pos logit contributions
agging+0.036
 course+0.035
 winners+0.034
ier+0.034
 luck+0.033
Neg logit contributions
 Maybe-0.050
 generation-0.046
grand-0.044
 alley-0.043
keeping-0.043
TokenPlease
Token ID5492
Feature activation+7.87e-03

Pos logit contributions
 Maybe+0.017
 desks+0.016
keeping+0.016
and+0.016
 generation+0.015
Neg logit contributions
 course-0.020
agging-0.018
br-0.017
 thank-0.017
aid-0.017
Token help
Token ID1037
Feature activation+1.27e-02

Pos logit contributions
agging+0.011
 course+0.011
 winners+0.010
br+0.010
ier+0.010
Neg logit contributions
 Maybe-0.015
 generation-0.013
keeping-0.013
 desks-0.013
grand-0.013
Token me
Token ID502
Feature activation-2.07e-03

Pos logit contributions
agging+0.020
 course+0.019
 winners+0.018
br+0.018
ier+0.018
Neg logit contributions
 Maybe-0.022
keeping-0.019
 desks-0.019
and-0.019
 generation-0.019
Token,"
Token ID553
Feature activation+3.95e-02

Pos logit contributions
 Maybe+0.003
keeping+0.003
and+0.003
 because+0.003
 generation+0.003
Neg logit contributions
agging-0.003
 course-0.003
 winners-0.003
br-0.003
agged-0.003
Token said
Token ID531
Feature activation+7.16e-02

Pos logit contributions
agging+0.077
ier+0.071
ig+0.068
 winners+0.067
br+0.067
Neg logit contributions
 Maybe-0.064
grand-0.052
 generation-0.051
 They-0.050
 cigarettes-0.049
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
agging+0.160
ig+0.151
is+0.145
inking+0.142
ier+0.141
Neg logit contributions
 Maybe-0.092
 Grand-0.080
 generation-0.079
 They-0.078
 Who-0.074
Token snail
Token ID47374
Feature activation-6.44e-03

Pos logit contributions
 course+0.027
agging+0.025
 seventeen+0.024
ier+0.024
 nineteen+0.024
Neg logit contributions
 Maybe-0.026
 alley-0.023
keeping-0.022
 topic-0.022
 desks-0.022
Token.
Token ID13
Feature activation+2.43e-02

Pos logit contributions
 Maybe+0.009
 generation+0.008
keeping+0.008
 desks+0.008
and+0.008
Neg logit contributions
 course-0.012
agging-0.012
 thank-0.011
 winners-0.011
br-0.011
Token "
Token ID366
Feature activation-3.56e-02

Pos logit contributions
agging+0.050
 course+0.048
ier+0.046
 winners+0.046
 luck+0.046
Neg logit contributions
 Maybe-0.034
 generation-0.025
keeping-0.025
 They-0.024
and-0.024
TokenI
Token ID40
Feature activation-1.43e-02

Pos logit contributions
 desks+0.059
 computers+0.055
packs+0.054
keeping+0.053
 Maybe+0.052
Neg logit contributions
 course-0.057
aid-0.056
agging-0.050
is-0.047
br-0.047
Token
Token ID198
Feature activation+2.05e-02

Pos logit contributions
agging+0.089
 course+0.089
 thank+0.084
 winners+0.084
br+0.083
Neg logit contributions
 Maybe-0.044
 generation-0.036
keeping-0.036
 desks-0.035
and-0.034
Token
Token ID198
Feature activation+2.92e-02

Pos logit contributions
 course+0.039
agging+0.038
 thank+0.036
br+0.035
 winners+0.034
Neg logit contributions
 Maybe-0.028
keeping-0.026
 generation-0.025
 desks-0.025
grand-0.023
TokenThe
Token ID464
Feature activation+5.06e-02

Pos logit contributions
agging+0.043
 course+0.042
 thank+0.039
 winners+0.039
br+0.038
Neg logit contributions
 Maybe-0.053
 generation-0.048
 desks-0.047
keeping-0.047
and-0.047
Token kind
Token ID1611
Feature activation+5.34e-02

Pos logit contributions
ier+0.094
ig+0.091
agging+0.089
ises+0.087
odic+0.085
Neg logit contributions
 Maybe-0.066
 alley-0.062
 generation-0.060
grand-0.056
 wanted-0.056
Token pigeon
Token ID46387
Feature activation+3.36e-02

Pos logit contributions
agging+0.084
 course+0.083
br+0.081
ises+0.080
ig+0.078
Neg logit contributions
 Maybe-0.089
 generation-0.077
grand-0.076
 alley-0.075
 They-0.075
Token and
Token ID290
Feature activation+7.15e-02

Pos logit contributions
agging+0.072
 winners+0.065
 course+0.065
oming+0.063
br+0.063
Neg logit contributions
 Maybe-0.044
 generation-0.032
and-0.032
grand-0.032
places-0.032
Token the
Token ID262
Feature activation+5.06e-02

Pos logit contributions
agging+0.172
ier+0.166
oming+0.166
ement+0.162
is+0.158
Neg logit contributions
 They-0.063
 Maybe-0.063
 generation-0.063
 wanted-0.060
 because-0.055
Token new
Token ID649
Feature activation+4.63e-02

Pos logit contributions
ig+0.095
inkle+0.094
agging+0.094
ier+0.094
ises+0.092
Neg logit contributions
 Maybe-0.066
 alley-0.059
 wanted-0.058
 generation-0.057
 topic-0.057
Token bird
Token ID6512
Feature activation+2.82e-02

Pos logit contributions
agging+0.068
 course+0.066
br+0.065
oming+0.063
ier+0.062
Neg logit contributions
 Maybe-0.085
 generation-0.075
keeping-0.074
packs-0.074
and-0.073
Token went
Token ID1816
Feature activation+2.53e-02

Pos logit contributions
agging+0.055
 course+0.051
 winners+0.050
br+0.048
oming+0.048
Neg logit contributions
 Maybe-0.038
packs-0.032
 generation-0.032
places-0.031
and-0.031
Token to
Token ID284
Feature activation+1.39e-02

Pos logit contributions
 course+0.048
agging+0.044
 thank+0.044
 eagle+0.044
aid+0.044
Neg logit contributions
 Maybe-0.033
and-0.031
packs-0.030
keeping-0.030
 desks-0.030
Token "
Token ID366
Feature activation+1.68e-02

Pos logit contributions
agging+0.078
 course+0.076
br+0.071
 winners+0.071
ier+0.071
Neg logit contributions
 Maybe-0.064
 generation-0.053
keeping-0.053
and-0.053
 desks-0.051
TokenYou
Token ID1639
Feature activation+3.64e-02

Pos logit contributions
agging+0.031
 course+0.030
 thank+0.029
br+0.028
 winners+0.028
Neg logit contributions
 Maybe-0.024
 generation-0.021
and-0.021
keeping-0.021
grand-0.020
Token're
Token ID821
Feature activation+3.60e-02

Pos logit contributions
agging+0.080
is+0.069
oming+0.068
 course+0.067
 gu+0.067
Neg logit contributions
 Maybe-0.044
 tonight-0.034
 generation-0.033
days-0.033
grand-0.033
Token welcome
Token ID7062
Feature activation+4.07e-02

Pos logit contributions
agging+0.070
is+0.068
aid+0.064
br+0.064
ises+0.063
Neg logit contributions
 because-0.046
 Maybe-0.043
 Grand-0.041
 tonight-0.040
 Jamie-0.039
Token,"
Token ID553
Feature activation+5.52e-02

Pos logit contributions
agging+0.067
 course+0.067
br+0.063
aid+0.060
 winners+0.060
Neg logit contributions
 Maybe-0.066
keeping-0.060
 generation-0.056
and-0.056
grand-0.055
Token said
Token ID531
Feature activation+7.11e-02

Pos logit contributions
agging+0.117
ier+0.100
 winners+0.098
br+0.097
bling+0.096
Neg logit contributions
 Maybe-0.080
 They-0.067
grand-0.065
keeping-0.065
 generation-0.064
Token the
Token ID262
Feature activation+3.92e-02

Pos logit contributions
agging+0.168
br+0.147
is+0.141
 winners+0.140
bling+0.140
Neg logit contributions
 Maybe-0.080
 Grand-0.072
 generation-0.068
 Jamie-0.065
 Jake-0.065
Token big
Token ID1263
Feature activation+3.94e-02

Pos logit contributions
ier+0.079
ises+0.077
agging+0.074
bly+0.071
bled+0.071
Neg logit contributions
 Maybe-0.046
 alley-0.039
 aid-0.039
 hall-0.037
 generation-0.037
Token bird
Token ID6512
Feature activation+4.27e-02

Pos logit contributions
bled+0.069
bly+0.069
agging+0.068
bling+0.068
odic+0.065
Neg logit contributions
 Maybe-0.062
 alley-0.055
 generation-0.050
 cigarettes-0.049
 knocks-0.049
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
agging+0.105
 course+0.103
 winners+0.101
br+0.098
ags+0.096
Neg logit contributions
 Maybe-0.036
and-0.029
 because-0.027
grand-0.026
keeping-0.025
Token "
Token ID366
Feature activation+1.33e-02

Pos logit contributions
agging+0.071
 course+0.069
br+0.065
 winners+0.065
ier+0.064
Neg logit contributions
 Maybe-0.057
 generation-0.047
keeping-0.047
and-0.046
 desks-0.044
Token,
Token ID11
Feature activation-1.46e-02

Pos logit contributions
agging+0.003
 course+0.003
 winners+0.003
br+0.003
 thank+0.003
Neg logit contributions
 Maybe-0.005
 desks-0.005
 generation-0.005
keeping-0.005
and-0.005
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 Maybe+0.024
keeping+0.022
 generation+0.022
 desks+0.021
and+0.021
Neg logit contributions
 course-0.024
agging-0.023
 thank-0.023
br-0.022
 winners-0.022
Token dove
Token ID38862
Feature activation+2.80e-02

Pos logit contributions
agging+0.060
agged+0.056
ags+0.056
bled+0.054
 course+0.051
Neg logit contributions
 Maybe-0.063
 generation-0.062
grand-0.059
 alley-0.056
 topic-0.056
Token with
Token ID351
Feature activation+3.25e-02

Pos logit contributions
agging+0.052
 course+0.050
 winners+0.047
 thank+0.045
br+0.045
Neg logit contributions
 Maybe-0.040
 generation-0.035
keeping-0.035
grand-0.034
and-0.033
Token a
Token ID257
Feature activation+4.66e-02

Pos logit contributions
agging+0.061
ier+0.056
 winners+0.054
 course+0.054
aid+0.053
Neg logit contributions
 Maybe-0.044
 generation-0.042
 desks-0.038
keeping-0.038
grand-0.037
Token shiny
Token ID22441
Feature activation+7.10e-02

Pos logit contributions
agging+0.073
 course+0.072
agged+0.071
bled+0.069
ier+0.067
Neg logit contributions
 Maybe-0.073
 generation-0.070
 desk-0.066
 topic-0.064
 aid-0.064
Token be
Token ID307
Feature activation+3.44e-02

Pos logit contributions
 winners+0.107
agging+0.105
bling+0.105
 course+0.104
icious+0.098
Neg logit contributions
 generation-0.103
 Maybe-0.100
 desk-0.096
 because-0.091
 cigarettes-0.090
Tokenak
Token ID461
Feature activation+4.25e-02

Pos logit contributions
 course+0.048
 thank+0.044
agging+0.043
 medal+0.043
 winners+0.043
Neg logit contributions
keeping-0.059
grand-0.059
and-0.057
packs-0.057
 generation-0.057
Token!"
Token ID2474
Feature activation+2.46e-03

Pos logit contributions
 course+0.082
agging+0.081
 winners+0.076
 thank+0.076
aid+0.074
Neg logit contributions
 Maybe-0.053
keeping-0.050
grand-0.046
and-0.046
 generation-0.046
Token Sam
Token ID3409
Feature activation+5.24e-03

Pos logit contributions
 thank+0.005
 course+0.005
br+0.004
 best+0.004
 medal+0.004
Neg logit contributions
keeping-0.004
 generation-0.004
and-0.003
places-0.003
uries-0.003
Token said
Token ID531
Feature activation+3.04e-02

Pos logit contributions
 course+0.007
agging+0.007
 thank+0.007
br+0.007
 winners+0.007
Neg logit contributions
 Maybe-0.010
keeping-0.009
 generation-0.009
 desks-0.009
and-0.009
Token make
Token ID787
Feature activation+3.05e-02

Pos logit contributions
 course+0.006
agging+0.005
 thank+0.005
br+0.005
 winners+0.005
Neg logit contributions
 Maybe-0.005
keeping-0.005
 generation-0.005
 desks-0.005
and-0.005
Token a
Token ID257
Feature activation+8.35e-03

Pos logit contributions
agging+0.053
 course+0.051
 winners+0.048
 thank+0.047
br+0.047
Neg logit contributions
 Maybe-0.047
 generation-0.042
keeping-0.042
 desks-0.042
and-0.040
Token snow
Token ID6729
Feature activation+2.41e-02

Pos logit contributions
agging+0.014
 course+0.014
ier+0.013
agged+0.013
bled+0.012
Neg logit contributions
 Maybe-0.013
 generation-0.012
 desks-0.012
 laptop-0.012
 alley-0.011
Tokenman
Token ID805
Feature activation+2.84e-02

Pos logit contributions
agging+0.039
 course+0.039
 winners+0.035
 thank+0.035
br+0.034
Neg logit contributions
 Maybe-0.040
keeping-0.036
 generation-0.034
and-0.034
 desks-0.034
Token,"
Token ID553
Feature activation+3.93e-02

Pos logit contributions
agging+0.047
 course+0.046
br+0.042
 winners+0.040
 thank+0.039
Neg logit contributions
 Maybe-0.046
keeping-0.041
grand-0.041
 generation-0.041
and-0.040
Token said
Token ID531
Feature activation+7.09e-02

Pos logit contributions
agging+0.077
ier+0.068
 winners+0.066
ig+0.065
br+0.065
Neg logit contributions
 Maybe-0.063
grand-0.053
 generation-0.051
keeping-0.051
 They-0.050
Token Blue
Token ID4518
Feature activation-1.79e-02

Pos logit contributions
agging+0.148
ig+0.135
 winners+0.131
is+0.129
inking+0.128
Neg logit contributions
 Maybe-0.096
 Grand-0.083
 generation-0.083
 They-0.080
 Daddy-0.076
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
 Maybe+0.024
keeping+0.021
 generation+0.020
and+0.020
 desks+0.020
Neg logit contributions
 course-0.035
agging-0.034
br-0.032
 thank-0.032
 winners-0.032
Token Little
Token ID7703
Feature activation+1.62e-02

Pos logit contributions
agging+0.050
 course+0.050
 winners+0.046
ier+0.045
br+0.045
Neg logit contributions
 Maybe-0.043
 generation-0.036
keeping-0.035
and-0.034
 desks-0.033
Token Pink
Token ID14657
Feature activation-9.82e-03

Pos logit contributions
agging+0.028
ier+0.025
 winners+0.025
br+0.025
aid+0.024
Neg logit contributions
 Maybe-0.027
keeping-0.024
 generation-0.022
 alley-0.021
 Who-0.021
Token smiled
Token ID13541
Feature activation+3.61e-03

Pos logit contributions
 Maybe+0.013
keeping+0.010
places+0.010
 generation+0.010
and+0.010
Neg logit contributions
agging-0.020
 course-0.020
 winners-0.019
 thank-0.018
 soaring-0.018
Token,
Token ID11
Feature activation+2.36e-02

Pos logit contributions
agging+0.087
aid+0.082
ier+0.079
bled+0.078
bling+0.078
Neg logit contributions
 Maybe-0.078
 tonight-0.063
 because-0.062
 generation-0.060
 They-0.059
Token I
Token ID314
Feature activation+2.15e-03

Pos logit contributions
agging+0.049
ig+0.046
ier+0.045
agged+0.044
inking+0.043
Neg logit contributions
 Maybe-0.034
 Who-0.027
 Grand-0.026
 because-0.026
 They-0.025
Token am
Token ID716
Feature activation+1.31e-02

Pos logit contributions
 course+0.004
agging+0.004
 thank+0.004
br+0.004
 winners+0.004
Neg logit contributions
 Maybe-0.003
keeping-0.003
 desks-0.003
 generation-0.003
and-0.003
Token Blue
Token ID4518
Feature activation+1.58e-02

Pos logit contributions
agging+0.025
 course+0.024
aid+0.024
ier+0.024
ig+0.023
Neg logit contributions
 Maybe-0.017
 generation-0.015
keeping-0.014
 Jamie-0.014
 desks-0.014
Token!"
Token ID2474
Feature activation+3.96e-02

Pos logit contributions
 course+0.029
agging+0.029
br+0.027
 thank+0.027
aid+0.026
Neg logit contributions
 Maybe-0.023
keeping-0.020
 generation-0.019
and-0.019
places-0.018
Token said
Token ID531
Feature activation+7.07e-02

Pos logit contributions
agging+0.062
 course+0.057
ier+0.056
 winners+0.056
br+0.054
Neg logit contributions
 Maybe-0.072
 generation-0.063
grand-0.062
keeping-0.061
 desks-0.060
Token the
Token ID262
Feature activation+3.96e-02

Pos logit contributions
ig+0.147
agging+0.146
is+0.134
bling+0.130
 course+0.128
Neg logit contributions
 Maybe-0.096
 They-0.083
 generation-0.082
 Grand-0.078
 because-0.078
Token bird
Token ID6512
Feature activation+2.14e-02

Pos logit contributions
agging+0.068
ier+0.067
oming+0.063
ises+0.062
agged+0.061
Neg logit contributions
 Maybe-0.060
 generation-0.055
grand-0.051
 alley-0.051
 hall-0.050
Token.
Token ID13
Feature activation+2.97e-02

Pos logit contributions
 course+0.049
agging+0.048
 winners+0.047
br+0.046
 thank+0.045
Neg logit contributions
 Maybe-0.022
and-0.018
 generation-0.017
grand-0.016
keeping-0.015
Token "
Token ID366
Feature activation-4.15e-03

Pos logit contributions
agging+0.066
ig+0.063
 course+0.062
ier+0.062
 luck+0.062
Neg logit contributions
 Maybe-0.041
 They-0.031
 Who-0.028
 generation-0.026
 Sure-0.025
TokenWhat
Token ID2061
Feature activation-2.56e-03

Pos logit contributions
 Maybe+0.006
 desks+0.005
keeping+0.005
 generation+0.005
 computers+0.005
Neg logit contributions
 course-0.008
agging-0.008
br-0.007
 thank-0.007
 winners-0.007
Token a
Token ID257
Feature activation+1.17e-02

Pos logit contributions
 Maybe+0.019
keeping+0.018
 desks+0.017
places+0.017
grand+0.017
Neg logit contributions
 course-0.013
agging-0.012
 best-0.011
 soaring-0.011
 thank-0.011
Token happy
Token ID3772
Feature activation+2.42e-02

Pos logit contributions
agging+0.020
 course+0.020
ier+0.018
 winners+0.018
 soaring+0.018
Neg logit contributions
 Maybe-0.019
 desks-0.016
 generation-0.016
 alley-0.015
grand-0.015
Token bird
Token ID6512
Feature activation+1.81e-02

Pos logit contributions
 course+0.033
 thank+0.032
agging+0.032
br+0.030
 medal+0.030
Neg logit contributions
 Maybe-0.046
keeping-0.043
and-0.041
 desks-0.041
 generation-0.041
Token named
Token ID3706
Feature activation+1.22e-02

Pos logit contributions
 course+0.032
agging+0.031
 thank+0.029
br+0.029
 winners+0.028
Neg logit contributions
 Maybe-0.028
keeping-0.025
 desks-0.025
 generation-0.025
and-0.024
Token Raven
Token ID12552
Feature activation+3.85e-03

Pos logit contributions
agging+0.025
 course+0.025
 winners+0.023
br+0.023
 thank+0.023
Neg logit contributions
 Maybe-0.015
 generation-0.013
keeping-0.013
 desks-0.013
and-0.013
Token.
Token ID13
Feature activation+7.07e-02

Pos logit contributions
 course+0.008
agging+0.008
 thank+0.007
 winners+0.007
br+0.007
Neg logit contributions
 Maybe-0.005
 generation-0.004
keeping-0.004
 desks-0.004
and-0.004
Token Raven
Token ID12552
Feature activation+3.35e-02

Pos logit contributions
agging+0.103
 course+0.099
ier+0.098
ig+0.096
is+0.096
Neg logit contributions
 Maybe-0.141
 generation-0.118
 They-0.115
 Who-0.111
grand-0.110
Token had
Token ID550
Feature activation+2.95e-02

Pos logit contributions
agging+0.073
 course+0.071
 winners+0.070
 thank+0.066
oming+0.066
Neg logit contributions
 Maybe-0.038
grand-0.032
and-0.030
 generation-0.029
days-0.027
Token shiny
Token ID22441
Feature activation+5.58e-02

Pos logit contributions
agging+0.062
ier+0.062
 course+0.062
ises+0.061
ement+0.059
Neg logit contributions
 Maybe-0.032
and-0.027
 generation-0.027
keeping-0.027
grand-0.026
Token black
Token ID2042
Feature activation+2.96e-02

Pos logit contributions
 winners+0.083
bling+0.080
ier+0.078
 course+0.076
 victorious+0.075
Neg logit contributions
 Maybe-0.088
 cigarettes-0.088
 desks-0.085
 because-0.084
packs-0.083
Token feathers
Token ID27400
Feature activation+3.33e-02

Pos logit contributions
 course+0.039
 winners+0.039
agging+0.039
ier+0.038
 thank+0.037
Neg logit contributions
 Maybe-0.054
and-0.049
 desks-0.048
keeping-0.048
 alley-0.047
Token sunrise
Token ID40048
Feature activation+4.66e-02

Pos logit contributions
aid+0.096
agging+0.092
bled+0.091
ig+0.091
ier+0.087
Neg logit contributions
 Maybe-0.071
 hallway-0.064
 stove-0.063
 hall-0.063
 generation-0.061
Token is
Token ID318
Feature activation+3.64e-02

Pos logit contributions
agging+0.097
 course+0.085
aid+0.084
 winners+0.082
bling+0.082
Neg logit contributions
 Maybe-0.059
grand-0.048
keeping-0.048
 generation-0.047
 desks-0.046
Token so
Token ID523
Feature activation+1.69e-02

Pos logit contributions
agging+0.074
aid+0.074
 course+0.069
ig+0.069
agged+0.066
Neg logit contributions
 because-0.044
 Maybe-0.042
 tonight-0.042
 Grand-0.037
 generation-0.037
Token pretty
Token ID2495
Feature activation+3.27e-02

Pos logit contributions
agging+0.025
 course+0.024
ig+0.022
ier+0.022
 winners+0.021
Neg logit contributions
 Maybe-0.031
grand-0.028
and-0.027
 desks-0.027
 generation-0.027
Token,"
Token ID553
Feature activation+4.53e-02

Pos logit contributions
agging+0.053
 course+0.052
 thank+0.048
 winners+0.048
aid+0.047
Neg logit contributions
 Maybe-0.054
keeping-0.048
and-0.046
 desks-0.046
 generation-0.046
Token said
Token ID531
Feature activation+7.00e-02

Pos logit contributions
agging+0.078
ier+0.072
 winners+0.071
br+0.070
 course+0.070
Neg logit contributions
 Maybe-0.078
grand-0.067
 generation-0.066
keeping-0.064
 desks-0.062
Token Blue
Token ID4518
Feature activation-5.56e-03

Pos logit contributions
agging+0.159
is+0.150
ig+0.146
oming+0.143
bling+0.142
Neg logit contributions
 Maybe-0.093
 generation-0.076
 Grand-0.075
 They-0.074
 Who-0.071
Token.
Token ID13
Feature activation+4.64e-02

Pos logit contributions
 Maybe+0.007
keeping+0.006
and+0.005
 generation+0.005
 desks+0.005
Neg logit contributions
 course-0.012
agging-0.012
br-0.011
 winners-0.011
 thank-0.011
Token
Token ID198
Feature activation+2.35e-02

Pos logit contributions
agging+0.101
 course+0.100
ier+0.096
 winners+0.095
 luck+0.093
Neg logit contributions
 Maybe-0.058
 generation-0.043
keeping-0.042
and-0.041
 They-0.040
Token
Token ID198
Feature activation+3.83e-02

Pos logit contributions
 course+0.047
agging+0.046
 thank+0.044
br+0.043
 winners+0.043
Neg logit contributions
 Maybe-0.030
keeping-0.026
 generation-0.026
 desks-0.025
and-0.025
TokenBlue
Token ID14573
Feature activation+9.85e-03

Pos logit contributions
ier+0.064
 course+0.064
ig+0.063
 winners+0.063
 luck+0.063
Neg logit contributions
 Maybe-0.063
 generation-0.056
and-0.053
grand-0.050
 They-0.049
Token colors
Token ID7577
Feature activation+1.90e-02

Pos logit contributions
agging+0.081
 course+0.079
aid+0.079
isy+0.076
ier+0.076
Neg logit contributions
 Maybe-0.063
 desks-0.063
packs-0.061
 They-0.060
keeping-0.058
Token.
Token ID13
Feature activation+2.37e-02

Pos logit contributions
 course+0.045
agging+0.043
aid+0.042
 thank+0.042
 winners+0.042
Neg logit contributions
 Maybe-0.015
keeping-0.013
 desks-0.013
 because-0.012
packs-0.012
Token It
Token ID632
Feature activation+3.38e-02

Pos logit contributions
 course+0.044
 thank+0.040
agging+0.039
br+0.038
 soaring+0.037
Neg logit contributions
keeping-0.034
 Maybe-0.032
 generation-0.031
grand-0.031
 desks-0.031
Token was
Token ID373
Feature activation+1.59e-02

Pos logit contributions
agging+0.070
 course+0.068
 winners+0.064
 thank+0.063
ier+0.062
Neg logit contributions
 Maybe-0.043
 generation-0.036
 desks-0.034
and-0.033
grand-0.033
Token a
Token ID257
Feature activation+3.35e-02

Pos logit contributions
 course+0.031
agging+0.030
aid+0.030
ises+0.029
ig+0.029
Neg logit contributions
 Maybe-0.020
 because-0.019
 Grand-0.018
 generation-0.017
 desks-0.017
Token special
Token ID2041
Feature activation+7.00e-02

Pos logit contributions
agging+0.056
agged+0.052
 course+0.052
ier+0.052
aid+0.051
Neg logit contributions
 Maybe-0.054
 generation-0.052
 hallway-0.050
 desk-0.048
 topic-0.048
Token flag
Token ID6056
Feature activation+3.40e-02

Pos logit contributions
agging+0.113
aid+0.101
bling+0.100
 course+0.100
ier+0.097
Neg logit contributions
 generation-0.115
 Maybe-0.114
 hallway-0.103
 topic-0.102
 desk-0.100
Token,
Token ID11
Feature activation-5.11e-03

Pos logit contributions
 course+0.061
agging+0.061
 thank+0.057
 winners+0.057
aid+0.056
Neg logit contributions
 Maybe-0.048
 generation-0.043
keeping-0.042
 desks-0.042
and-0.041
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
 Maybe+0.008
keeping+0.008
 generation+0.008
 desks+0.008
grand+0.007
Neg logit contributions
 course-0.008
agging-0.008
 thank-0.008
 soaring-0.008
 winners-0.008
Token everyone
Token ID2506
Feature activation+2.16e-04

Pos logit contributions
agging+0.034
 course+0.032
br+0.030
ier+0.030
aid+0.029
Neg logit contributions
 Maybe-0.025
 generation-0.022
 desks-0.021
keeping-0.020
 computers-0.020
Token liked
Token ID8288
Feature activation+1.35e-02

Pos logit contributions
 course+0.000
agging+0.000
 thank+0.000
br+0.000
 winners+0.000
Neg logit contributions
 Maybe-0.000
keeping-0.000
 generation-0.000
 desks-0.000
and-0.000
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
 course+0.028
agging+0.028
 winners+0.027
br+0.027
 thank+0.026
Neg logit contributions
 Maybe-0.014
and-0.011
 generation-0.011
keeping-0.011
grand-0.010
Token
Token ID198
Feature activation+2.80e-02

Pos logit contributions
agging+0.056
 course+0.055
 winners+0.052
br+0.051
ier+0.051
Neg logit contributions
 Maybe-0.042
 generation-0.033
and-0.033
keeping-0.032
 desks-0.032
Token"
Token ID1
Feature activation+1.42e-03

Pos logit contributions
agging+0.069
 course+0.067
 winners+0.067
ier+0.065
 luck+0.065
Neg logit contributions
 Maybe-0.027
and-0.021
 They-0.017
 generation-0.016
 Who-0.015
TokenHi
Token ID17250
Feature activation+3.42e-02

Pos logit contributions
agging+0.003
 course+0.003
 thank+0.002
 winners+0.002
br+0.002
Neg logit contributions
 Maybe-0.002
 generation-0.002
keeping-0.002
and-0.002
 desks-0.002
Token,"
Token ID553
Feature activation+4.54e-02

Pos logit contributions
agging+0.060
aid+0.054
 course+0.054
 winners+0.052
ier+0.051
Neg logit contributions
 Maybe-0.058
 tonight-0.047
and-0.046
 Jamie-0.046
 because-0.045
Token said
Token ID531
Feature activation+6.97e-02

Pos logit contributions
agging+0.098
ier+0.085
 winners+0.084
ig+0.082
oming+0.081
Neg logit contributions
 Maybe-0.067
 They-0.053
 Who-0.052
 Jamie-0.050
grand-0.048
Token Buzz
Token ID15753
Feature activation-8.82e-03

Pos logit contributions
agging+0.164
is+0.142
br+0.141
 winners+0.140
ig+0.139
Neg logit contributions
 Maybe-0.087
 M-0.070
 Who-0.066
 They-0.066
 Jamie-0.066
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
 Maybe+0.009
and+0.008
keeping+0.007
 generation+0.007
places+0.006
Neg logit contributions
 course-0.021
agging-0.020
br-0.019
 winners-0.019
 thank-0.018
Token
Token ID198
Feature activation+3.66e-02

Pos logit contributions
 course+0.036
agging+0.036
 thank+0.033
 winners+0.033
br+0.033
Neg logit contributions
 Maybe-0.028
 generation-0.024
keeping-0.024
 desks-0.024
and-0.023
TokenThe
Token ID464
Feature activation+2.19e-02

Pos logit contributions
agging+0.086
 luck+0.085
 winners+0.084
ier+0.083
 medal+0.080
Neg logit contributions
 Maybe-0.040
and-0.034
 They-0.028
 generation-0.024
 Who-0.024
Token cater
Token ID20825
Feature activation+8.94e-03

Pos logit contributions
agging+0.039
ier+0.035
 course+0.035
 winners+0.034
agged+0.034
Neg logit contributions
 Maybe-0.034
 alley-0.029
 desks-0.029
and-0.028
 topic-0.028
Token your
Token ID534
Feature activation+2.62e-02

Pos logit contributions
agging+0.025
 course+0.024
ier+0.023
br+0.022
 winners+0.022
Neg logit contributions
 Maybe-0.023
 generation-0.020
keeping-0.020
and-0.020
 desks-0.020
Token pretty
Token ID2495
Feature activation+3.95e-02

Pos logit contributions
agging+0.036
 course+0.035
ier+0.034
br+0.033
aid+0.032
Neg logit contributions
 Maybe-0.049
 generation-0.044
keeping-0.044
 desks-0.044
and-0.042
Token wings
Token ID12098
Feature activation+4.36e-02

Pos logit contributions
 course+0.054
agging+0.054
 winners+0.051
ier+0.050
aid+0.050
Neg logit contributions
 Maybe-0.072
keeping-0.066
 desks-0.066
 generation-0.066
packs-0.064
Token."
Token ID526
Feature activation+2.20e-02

Pos logit contributions
agging+0.088
 course+0.087
aid+0.083
bled+0.081
br+0.080
Neg logit contributions
 Maybe-0.052
and-0.042
 because-0.042
keeping-0.042
 tonight-0.040
Token The
Token ID383
Feature activation+4.09e-02

Pos logit contributions
 course+0.040
 thank+0.038
agging+0.036
 soaring+0.035
aid+0.034
Neg logit contributions
 desks-0.029
 generation-0.029
keeping-0.029
 Maybe-0.029
packs-0.028
Token butterfly
Token ID35113
Feature activation+6.95e-02

Pos logit contributions
agging+0.067
ier+0.066
 seventeen+0.060
ags+0.058
br+0.057
Neg logit contributions
 Maybe-0.063
 alley-0.060
 generation-0.058
 hall-0.055
 hallway-0.052
Token replied
Token ID8712
Feature activation+4.34e-02

Pos logit contributions
agging+0.152
ags+0.143
 winners+0.140
ier+0.137
bling+0.134
Neg logit contributions
 Maybe-0.089
 wanted-0.077
 because-0.073
 dreamed-0.061
 They-0.060
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
 course+0.077
agging+0.074
 winners+0.073
ises+0.070
 medal+0.069
Neg logit contributions
 Maybe-0.066
 because-0.059
and-0.058
 They-0.054
 Jamie-0.050
Token "
Token ID366
Feature activation+7.81e-03

Pos logit contributions
 course+0.061
agging+0.061
 winners+0.059
ier+0.058
 luck+0.058
Neg logit contributions
 Maybe-0.032
and-0.022
 They-0.022
 generation-0.022
 because-0.022
TokenThank
Token ID10449
Feature activation+2.65e-02

Pos logit contributions
agging+0.015
 course+0.014
 thank+0.014
 winners+0.013
br+0.013
Neg logit contributions
 Maybe-0.011
 generation-0.010
and-0.010
keeping-0.009
grand-0.009
Token you
Token ID345
Feature activation+5.98e-03

Pos logit contributions
agging+0.048
 course+0.046
 winners+0.045
 thank+0.045
br+0.045
Neg logit contributions
 Maybe-0.037
and-0.033
keeping-0.032
 generation-0.032
grand-0.030
Token is
Token ID318
Feature activation-3.56e-02

Pos logit contributions
keeping+0.058
 generation+0.057
and+0.056
 Maybe+0.055
grand+0.052
Neg logit contributions
 course-0.034
 best-0.032
agging-0.032
 thank-0.030
br-0.029
Token very
Token ID845
Feature activation-5.19e-02

Pos logit contributions
 Maybe+0.065
keeping+0.060
 generation+0.059
and+0.058
 desks+0.058
Neg logit contributions
 course-0.051
agging-0.051
 thank-0.047
br-0.046
 winners-0.046
Token messy
Token ID24097
Feature activation-4.47e-02

Pos logit contributions
 Maybe+0.092
keeping+0.089
 generation+0.087
and+0.082
grand+0.078
Neg logit contributions
agging-0.077
 thank-0.074
 course-0.073
 best-0.071
br-0.071
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 generation+0.083
 Maybe+0.083
grand+0.083
 Who+0.077
keeping+0.075
Neg logit contributions
ier-0.061
 course-0.060
agging-0.059
 best-0.058
 better-0.055
Token There
Token ID1318
Feature activation-2.16e-02

Pos logit contributions
keeping+0.018
 generation+0.017
 desks+0.017
grand+0.017
 computers+0.016
Neg logit contributions
agging-0.021
 course-0.021
 thank-0.020
 best-0.020
 winners-0.019
Token are
Token ID389
Feature activation-8.04e-02

Pos logit contributions
 Maybe+0.045
keeping+0.045
 generation+0.044
and+0.042
 alley+0.040
Neg logit contributions
 course-0.026
 best-0.024
 thank-0.023
 winners-0.021
agging-0.021
Token clothes
Token ID8242
Feature activation-3.52e-02

Pos logit contributions
 Maybe+0.151
 generation+0.146
keeping+0.145
grand+0.139
jo+0.132
Neg logit contributions
 best-0.109
 course-0.106
 winners-0.103
agging-0.100
 more-0.096
Token,
Token ID11
Feature activation-5.96e-02

Pos logit contributions
 Maybe+0.064
 generation+0.058
keeping+0.058
 desks+0.057
and+0.056
Neg logit contributions
 course-0.051
agging-0.051
 thank-0.047
br-0.046
 winners-0.046
Token papers
Token ID9473
Feature activation-3.67e-02

Pos logit contributions
 Maybe+0.097
 generation+0.093
keeping+0.089
and+0.086
grand+0.084
Neg logit contributions
 course-0.096
agging-0.092
 thank-0.090
 winners-0.087
aid-0.086
Token,
Token ID11
Feature activation-5.86e-02

Pos logit contributions
 Maybe+0.065
keeping+0.059
 desks+0.059
 generation+0.059
and+0.057
Neg logit contributions
 course-0.054
agging-0.053
 thank-0.049
 winners-0.048
br-0.048
Token and
Token ID290
Feature activation-5.20e-02

Pos logit contributions
 Maybe+0.087
 generation+0.079
keeping+0.077
and+0.075
grand+0.072
Neg logit contributions
 course-0.105
agging-0.103
 thank-0.098
br-0.097
 winners-0.096
Token was
Token ID373
Feature activation-5.85e-03

Pos logit contributions
 course+0.020
 best+0.018
 thank+0.018
agging+0.017
 soaring+0.017
Neg logit contributions
keeping-0.020
 Maybe-0.019
 generation-0.019
 desks-0.019
grand-0.018
Token a
Token ID257
Feature activation+1.22e-02

Pos logit contributions
 Maybe+0.011
keeping+0.010
 desks+0.010
grand+0.010
 generation+0.010
Neg logit contributions
 course-0.008
agging-0.008
 thank-0.007
 best-0.007
 winners-0.007
Token little
Token ID1310
Feature activation+1.32e-02

Pos logit contributions
agging+0.021
 course+0.021
ier+0.020
 winners+0.019
aid+0.019
Neg logit contributions
 Maybe-0.019
 generation-0.016
 desks-0.016
 alley-0.016
 hallway-0.015
Token girl
Token ID2576
Feature activation-1.87e-02

Pos logit contributions
agging+0.024
 course+0.023
 soaring+0.022
 winners+0.022
ier+0.022
Neg logit contributions
 Maybe-0.019
 generation-0.017
 desks-0.016
 alley-0.016
keeping-0.015
Token called
Token ID1444
Feature activation-2.77e-02

Pos logit contributions
 Maybe+0.036
keeping+0.036
places+0.034
 desks+0.034
 generation+0.034
Neg logit contributions
 course-0.026
agging-0.023
 thank-0.022
ig-0.021
br-0.021
Token Fel
Token ID13937
Feature activation-7.99e-02

Pos logit contributions
 Maybe+0.042
 generation+0.040
places+0.040
 desks+0.037
keeping+0.037
Neg logit contributions
 thank-0.047
 course-0.044
br-0.042
 tweet-0.040
 winners-0.039
Tokenicia
Token ID33577
Feature activation-1.96e-02

Pos logit contributions
 Maybe+0.124
and+0.110
keeping+0.107
grand+0.102
care+0.099
Neg logit contributions
agging-0.145
 course-0.135
ig-0.129
 thank-0.128
isy-0.127
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
 Maybe+0.035
keeping+0.034
 generation+0.033
 desks+0.032
grand+0.031
Neg logit contributions
 course-0.029
 thank-0.026
agging-0.025
 best-0.024
ig-0.023
Token She
Token ID1375
Feature activation+1.20e-02

Pos logit contributions
keeping+0.026
 generation+0.025
grand+0.023
 Maybe+0.023
and+0.023
Neg logit contributions
 course-0.035
br-0.035
 thank-0.034
agging-0.034
 best-0.033
Token loved
Token ID6151
Feature activation+1.04e-02

Pos logit contributions
 course+0.016
agging+0.016
br+0.015
 thank+0.015
 best+0.015
Neg logit contributions
 Maybe-0.023
keeping-0.022
 generation-0.021
packs-0.021
 desks-0.021
Token to
Token ID284
Feature activation+5.88e-03

Pos logit contributions
 course+0.020
agging+0.019
 thank+0.018
 soaring+0.018
 gu+0.018
Neg logit contributions
 Maybe-0.015
 generation-0.013
keeping-0.013
and-0.012
grand-0.012
Token Jake
Token ID14757
Feature activation-3.31e-02

Pos logit contributions
 Maybe+0.028
and+0.027
grand+0.026
keeping+0.026
 desks+0.025
Neg logit contributions
 course-0.027
 soaring-0.024
 thank-0.024
agging-0.024
 best-0.023
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
 Maybe+0.054
keeping+0.051
 desks+0.049
 generation+0.049
and+0.048
Neg logit contributions
 course-0.053
 thank-0.049
br-0.047
agging-0.047
ig-0.046
Token his
Token ID465
Feature activation+3.21e-03

Pos logit contributions
 thank+0.015
 course+0.015
 soaring+0.013
agging+0.013
 best+0.013
Neg logit contributions
 Maybe-0.018
and-0.016
 desks-0.016
keeping-0.016
 generation-0.016
Token dog
Token ID3290
Feature activation+7.93e-03

Pos logit contributions
 course+0.006
agging+0.006
br+0.006
ier+0.006
 nineteen+0.005
Neg logit contributions
 Maybe-0.004
 desks-0.004
 generation-0.004
keeping-0.004
and-0.004
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
agging+0.014
 course+0.013
 winners+0.013
br+0.012
aid+0.012
Neg logit contributions
 Maybe-0.012
 generation-0.011
 desks-0.011
keeping-0.011
and-0.010
Token F
Token ID376
Feature activation-7.75e-02

Pos logit contributions
 course+0.022
agging+0.021
 thank+0.021
 winners+0.020
 soaring+0.020
Neg logit contributions
 Maybe-0.016
 desks-0.015
keeping-0.014
 generation-0.014
grand-0.014
Tokenuzz
Token ID4715
Feature activation-2.21e-02

Pos logit contributions
 Maybe+0.105
 desks+0.093
 computers+0.092
keeping+0.091
 cigarettes+0.086
Neg logit contributions
agging-0.166
ig-0.135
is-0.133
ier-0.131
oming-0.127
Tokeny
Token ID88
Feature activation+1.17e-02

Pos logit contributions
 Maybe+0.034
keeping+0.030
 generation+0.030
 desks+0.029
and+0.029
Neg logit contributions
agging-0.039
 course-0.039
br-0.036
ier-0.036
 thank-0.035
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
 course+0.017
agging+0.017
br+0.016
 thank+0.016
 winners+0.015
Neg logit contributions
 Maybe-0.021
 generation-0.019
keeping-0.019
 desks-0.019
and-0.018
Token went
Token ID1816
Feature activation+1.87e-02

Pos logit contributions
 course+0.019
agging+0.018
 thank+0.017
 soaring+0.017
 winners+0.017
Neg logit contributions
 Maybe-0.016
keeping-0.015
 desks-0.015
 generation-0.014
and-0.014
Token on
Token ID319
Feature activation-5.92e-04

Pos logit contributions
 course+0.037
agging+0.035
 winners+0.035
aid+0.034
 thank+0.034
Neg logit contributions
 Maybe-0.023
and-0.020
 desks-0.020
 generation-0.020
keeping-0.020
Token dog
Token ID3290
Feature activation-7.31e-03

Pos logit contributions
agging+0.033
 course+0.032
 winners+0.030
 thank+0.029
br+0.029
Neg logit contributions
 Maybe-0.036
 generation-0.032
 desks-0.031
keeping-0.031
and-0.030
Token called
Token ID1444
Feature activation-6.77e-03

Pos logit contributions
 Maybe+0.011
keeping+0.010
 generation+0.010
 desks+0.010
and+0.010
Neg logit contributions
 course-0.012
agging-0.012
 thank-0.011
br-0.011
 winners-0.011
Token Bos
Token ID14548
Feature activation-7.02e-02

Pos logit contributions
 Maybe+0.010
keeping+0.009
 generation+0.009
 desks+0.009
and+0.009
Neg logit contributions
 course-0.012
agging-0.012
 thank-0.011
br-0.011
 winners-0.011
Tokenco
Token ID1073
Feature activation-2.15e-02

Pos logit contributions
 Maybe+0.158
 generation+0.147
keeping+0.142
 desks+0.139
 hallway+0.133
Neg logit contributions
 course-0.085
agging-0.081
is-0.074
ig-0.065
br-0.064
Token.
Token ID13
Feature activation-2.34e-02

Pos logit contributions
 Maybe+0.038
keeping+0.032
 generation+0.031
 desks+0.030
and+0.030
Neg logit contributions
 course-0.037
agging-0.034
is-0.031
 thank-0.031
br-0.030
Token Bos
Token ID14548
Feature activation-7.70e-02

Pos logit contributions
keeping+0.039
 generation+0.037
grand+0.034
and+0.033
 Maybe+0.032
Neg logit contributions
 course-0.039
 thank-0.038
br-0.034
agging-0.034
 best-0.033
Tokenco
Token ID1073
Feature activation-1.17e-02

Pos logit contributions
 generation+0.175
 Maybe+0.173
keeping+0.164
packs+0.153
 desks+0.151
Neg logit contributions
 course-0.089
agging-0.084
is-0.078
br-0.067
ig-0.067
Token liked
Token ID8288
Feature activation-1.95e-02

Pos logit contributions
 Maybe+0.020
keeping+0.018
 generation+0.017
 desks+0.017
and+0.016
Neg logit contributions
 course-0.021
agging-0.018
 thank-0.017
is-0.017
br-0.017
Token to
Token ID284
Feature activation-3.29e-03

Pos logit contributions
 Maybe+0.031
 generation+0.029
keeping+0.027
grand+0.027
packs+0.026
Neg logit contributions
 course-0.035
 thank-0.032
agging-0.032
 best-0.031
 soaring-0.030
Token go
Token ID467
Feature activation-3.42e-03

Pos logit contributions
 Maybe+0.005
 generation+0.004
 desks+0.004
keeping+0.004
and+0.004
Neg logit contributions
agging-0.006
 course-0.006
 winners-0.006
br-0.006
ier-0.005
Token outside
Token ID2354
Feature activation-2.88e-02

Pos logit contributions
 Maybe+0.006
keeping+0.006
 generation+0.005
 desks+0.005
grand+0.005
Neg logit contributions
 course-0.005
agging-0.005
br-0.005
 thank-0.005
 winners-0.005
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 Maybe+0.035
 generation+0.031
keeping+0.031
 desks+0.031
and+0.030
Neg logit contributions
 course-0.036
agging-0.036
 thank-0.033
 winners-0.033
br-0.033
Token Suddenly
Token ID24975
Feature activation+1.07e-02

Pos logit contributions
 generation+0.018
keeping+0.018
 desks+0.017
 laptop+0.017
grand+0.017
Neg logit contributions
 course-0.021
 thank-0.020
agging-0.019
br-0.019
 best-0.018
Token she
Token ID673
Feature activation-4.02e-03

Pos logit contributions
agging+0.013
 course+0.013
 winners+0.011
 thank+0.011
br+0.011
Neg logit contributions
 Maybe-0.022
 desks-0.020
 generation-0.020
keeping-0.020
and-0.019
Token realized
Token ID6939
Feature activation-3.48e-03

Pos logit contributions
 Maybe+0.005
 desks+0.004
 generation+0.004
keeping+0.004
and+0.004
Neg logit contributions
agging-0.008
 course-0.008
 winners-0.008
 thank-0.008
br-0.008
Token that
Token ID326
Feature activation-1.08e-02

Pos logit contributions
 Maybe+0.006
keeping+0.006
 generation+0.006
 desks+0.006
and+0.005
Neg logit contributions
 course-0.005
agging-0.005
 thank-0.005
br-0.005
 winners-0.005
Token F
Token ID376
Feature activation-7.64e-02

Pos logit contributions
 Maybe+0.018
keeping+0.017
 generation+0.016
and+0.016
 desks+0.016
Neg logit contributions
 course-0.017
 thank-0.016
 soaring-0.016
 best-0.015
agging-0.015
Tokenuzz
Token ID4715
Feature activation-3.75e-02

Pos logit contributions
 computers+0.106
 desks+0.105
 hotels+0.103
 tables+0.097
 Maybe+0.096
Neg logit contributions
agging-0.141
ig-0.129
ier-0.125
inking-0.123
is-0.120
Tokeny
Token ID88
Feature activation-1.20e-02

Pos logit contributions
 Maybe+0.057
 generation+0.051
 desks+0.050
keeping+0.049
and+0.049
Neg logit contributions
 course-0.066
agging-0.066
ier-0.063
br-0.061
ig-0.060
Token must
Token ID1276
Feature activation-2.61e-03

Pos logit contributions
 Maybe+0.018
and+0.016
keeping+0.016
 generation+0.016
 desks+0.016
Neg logit contributions
 course-0.021
agging-0.020
 thank-0.020
br-0.019
 medal-0.019
Token have
Token ID423
Feature activation-7.91e-03

Pos logit contributions
 Maybe+0.005
keeping+0.004
 generation+0.004
 desks+0.004
and+0.004
Neg logit contributions
agging-0.004
 course-0.004
br-0.004
 thank-0.004
 winners-0.004
Token come
Token ID1282
Feature activation-2.24e-03

Pos logit contributions
 Maybe+0.012
 generation+0.010
keeping+0.010
 desks+0.010
and+0.010
Neg logit contributions
 course-0.014
agging-0.014
 thank-0.013
br-0.013
 winners-0.013
Token one
Token ID530
Feature activation-1.35e-02

Pos logit contributions
 Maybe+0.006
keeping+0.006
grand+0.006
 desks+0.006
places+0.006
Neg logit contributions
 course-0.004
agging-0.004
 best-0.004
 soaring-0.004
 more-0.004
Token day
Token ID1110
Feature activation-4.85e-03

Pos logit contributions
 Maybe+0.022
and+0.019
 generation+0.019
 desks+0.019
packs+0.018
Neg logit contributions
agging-0.023
 course-0.022
 thank-0.021
br-0.021
 winners-0.021
Token.
Token ID13
Feature activation-2.57e-02

Pos logit contributions
 Maybe+0.009
keeping+0.009
grand+0.009
 desks+0.008
 generation+0.008
Neg logit contributions
agging-0.007
 course-0.006
 soaring-0.006
 thank-0.006
 best-0.006
Token Suddenly
Token ID24975
Feature activation-3.22e-03

Pos logit contributions
grand+0.038
 laptop+0.034
 generation+0.034
keeping+0.034
 heaven+0.033
Neg logit contributions
 higher-0.043
 course-0.043
 soaring-0.042
 thank-0.042
 best-0.041
Token,
Token ID11
Feature activation-2.99e-02

Pos logit contributions
 Maybe+0.007
keeping+0.007
 generation+0.007
and+0.007
 desks+0.007
Neg logit contributions
 course-0.003
agging-0.003
br-0.003
 thank-0.003
 winners-0.003
Token F
Token ID376
Feature activation-7.60e-02

Pos logit contributions
 Maybe+0.049
grand+0.048
keeping+0.048
 generation+0.046
and+0.046
Neg logit contributions
 course-0.049
 soaring-0.045
br-0.044
 thank-0.044
 best-0.044
Tokenuzz
Token ID4715
Feature activation-3.38e-02

Pos logit contributions
 computers+0.104
 desks+0.103
keeping+0.102
 Maybe+0.099
grand+0.098
Neg logit contributions
agging-0.147
ig-0.134
agged-0.123
inking-0.122
ags-0.122
Tokeny
Token ID88
Feature activation-4.28e-03

Pos logit contributions
 Maybe+0.053
 generation+0.047
keeping+0.047
grand+0.046
and+0.046
Neg logit contributions
agging-0.059
 course-0.057
br-0.054
ier-0.054
ig-0.053
Token saw
Token ID2497
Feature activation+6.44e-04

Pos logit contributions
 Maybe+0.006
keeping+0.006
 generation+0.006
 desks+0.006
and+0.006
Neg logit contributions
 course-0.007
agging-0.007
 thank-0.007
br-0.007
 winners-0.007
Token something
Token ID1223
Feature activation-3.65e-02

Pos logit contributions
 course+0.001
agging+0.001
 thank+0.001
br+0.001
 winners+0.001
Neg logit contributions
 Maybe-0.001
keeping-0.001
 generation-0.001
 desks-0.001
and-0.001
Token up
Token ID510
Feature activation-8.55e-03

Pos logit contributions
 Maybe+0.065
 desks+0.058
and+0.057
 generation+0.056
 computers+0.056
Neg logit contributions
agging-0.053
 course-0.053
 winners-0.049
 thank-0.047
aid-0.047
Token walking
Token ID6155
Feature activation+6.81e-03

Pos logit contributions
 Maybe+0.019
keeping+0.018
 generation+0.017
and+0.017
 desks+0.016
Neg logit contributions
 course-0.020
agging-0.020
 thank-0.019
br-0.019
 soaring-0.019
Token in
Token ID287
Feature activation-3.88e-02

Pos logit contributions
 course+0.010
agging+0.010
 thank+0.009
 winners+0.009
br+0.009
Neg logit contributions
 Maybe-0.013
keeping-0.011
 generation-0.011
 desks-0.011
and-0.011
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
 Maybe+0.101
grand+0.098
and+0.098
uries+0.097
keeping+0.096
Neg logit contributions
 course-0.032
 thank-0.021
 best-0.019
 soaring-0.019
 better-0.018
Token park
Token ID3952
Feature activation-1.39e-02

Pos logit contributions
 Maybe+0.040
keeping+0.038
 generation+0.036
and+0.036
 desks+0.035
Neg logit contributions
 course-0.034
agging-0.033
br-0.030
 thank-0.030
 best-0.030
Token,
Token ID11
Feature activation-4.56e-02

Pos logit contributions
keeping+0.038
 generation+0.037
 Maybe+0.037
places+0.036
 computers+0.036
Neg logit contributions
 course-0.006
 thank-0.005
 best-0.005
br-0.004
 better-0.004
Token L
Token ID406
Feature activation-7.51e-02

Pos logit contributions
keeping+0.084
 generation+0.079
grand+0.076
 Maybe+0.076
and+0.071
Neg logit contributions
 course-0.067
 best-0.065
br-0.065
 thank-0.065
 soaring-0.063
Tokenizzy
Token ID40593
Feature activation-3.77e-02

Pos logit contributions
 generation+0.137
keeping+0.132
 Maybe+0.124
grand+0.116
 computers+0.115
Neg logit contributions
ig-0.125
agging-0.120
ier-0.112
is-0.112
 course-0.107
Token saw
Token ID2497
Feature activation+2.61e-03

Pos logit contributions
 Maybe+0.076
keeping+0.075
and+0.073
 generation+0.071
grand+0.071
Neg logit contributions
 course-0.041
 soaring-0.038
ig-0.037
 best-0.036
 eagle-0.034
Token something
Token ID1223
Feature activation-3.58e-02

Pos logit contributions
 course+0.004
agging+0.004
 thank+0.004
br+0.004
 winners+0.004
Neg logit contributions
 Maybe-0.004
keeping-0.004
 generation-0.004
 desks-0.004
and-0.004
Token spark
Token ID9009
Feature activation+3.79e-03

Pos logit contributions
 Maybe+0.066
 desks+0.060
and+0.058
 generation+0.058
 computers+0.058
Neg logit contributions
 course-0.050
agging-0.049
 winners-0.046
 thank-0.043
aid-0.042
Tokenly
Token ID306
Feature activation-1.72e-02

Pos logit contributions
 winners+0.006
 course+0.006
agging+0.006
 thank+0.006
aid+0.005
Neg logit contributions
 Maybe-0.006
 generation-0.005
places-0.005
and-0.005
keeping-0.005
Token day
Token ID1110
Feature activation-8.24e-03

Pos logit contributions
 course+0.006
agging+0.005
 thank+0.005
br+0.005
 soaring+0.005
Neg logit contributions
 Maybe-0.006
 desks-0.006
keeping-0.006
and-0.005
grand-0.005
Token.
Token ID13
Feature activation+4.77e-03

Pos logit contributions
 Maybe+0.013
 generation+0.011
 desks+0.011
keeping+0.011
grand+0.011
Neg logit contributions
agging-0.014
 course-0.014
 thank-0.013
br-0.013
 winners-0.013
Token One
Token ID1881
Feature activation-1.92e-02

Pos logit contributions
 course+0.008
 thank+0.007
agging+0.006
 best+0.006
 winners+0.006
Neg logit contributions
keeping-0.008
 desks-0.008
 laptop-0.007
 computers-0.007
 alley-0.007
Token day
Token ID1110
Feature activation+1.67e-02

Pos logit contributions
 Maybe+0.033
keeping+0.031
 desks+0.030
 generation+0.030
and+0.029
Neg logit contributions
 course-0.030
agging-0.029
 thank-0.027
br-0.027
 winners-0.026
Token,
Token ID11
Feature activation-3.24e-03

Pos logit contributions
 course+0.019
agging+0.018
 thank+0.017
br+0.017
 winners+0.017
Neg logit contributions
 Maybe-0.036
keeping-0.033
 generation-0.033
 desks-0.032
and-0.032
Token Fel
Token ID13937
Feature activation-7.50e-02

Pos logit contributions
 Maybe+0.006
and+0.005
keeping+0.005
grand+0.005
 desks+0.005
Neg logit contributions
 course-0.005
 thank-0.005
 best-0.005
 winners-0.005
 soaring-0.004
Tokenicity
Token ID8467
Feature activation-2.18e-02

Pos logit contributions
 Maybe+0.129
keeping+0.113
and+0.113
packs+0.106
care+0.106
Neg logit contributions
agging-0.128
isy-0.114
ig-0.113
is-0.112
ier-0.111
Token saw
Token ID2497
Feature activation+3.83e-03

Pos logit contributions
 Maybe+0.035
and+0.034
keeping+0.033
 desks+0.032
 generation+0.032
Neg logit contributions
 course-0.033
 thank-0.031
ig-0.030
 best-0.030
br-0.030
Token that
Token ID326
Feature activation-5.30e-03

Pos logit contributions
agging+0.006
 course+0.006
 thank+0.006
br+0.006
 winners+0.006
Neg logit contributions
 Maybe-0.006
keeping-0.005
 generation-0.005
 desks-0.005
and-0.005
Token the
Token ID262
Feature activation+9.30e-03

Pos logit contributions
 Maybe+0.010
keeping+0.009
and+0.009
 desks+0.009
grand+0.009
Neg logit contributions
 course-0.007
 best-0.007
 soaring-0.007
 thank-0.006
 medal-0.006
Token chickens
Token ID25972
Feature activation+1.35e-02

Pos logit contributions
agging+0.014
 course+0.013
 thank+0.012
br+0.012
aid+0.012
Neg logit contributions
 Maybe-0.016
 generation-0.015
 desks-0.015
keeping-0.015
 computers-0.014
Token friends
Token ID2460
Feature activation-1.37e-02

Pos logit contributions
 Maybe+0.016
keeping+0.015
 generation+0.015
 desks+0.014
and+0.014
Neg logit contributions
 course-0.014
agging-0.014
 thank-0.013
 winners-0.013
br-0.013
Token.
Token ID13
Feature activation-7.76e-03

Pos logit contributions
 Maybe+0.020
keeping+0.018
 generation+0.017
 desks+0.017
grand+0.017
Neg logit contributions
 course-0.026
agging-0.025
 thank-0.023
 winners-0.023
br-0.023
Token One
Token ID1881
Feature activation-2.91e-02

Pos logit contributions
keeping+0.015
 generation+0.013
grand+0.012
 alley+0.011
 cigarettes+0.011
Neg logit contributions
 course-0.011
 best-0.011
 thank-0.011
agging-0.011
br-0.010
Token day
Token ID1110
Feature activation+5.74e-03

Pos logit contributions
keeping+0.071
 Maybe+0.067
 desks+0.064
places+0.064
 generation+0.062
Neg logit contributions
 best-0.029
 course-0.028
 thank-0.026
agging-0.025
 better-0.024
Token,
Token ID11
Feature activation-1.76e-02

Pos logit contributions
 course+0.006
agging+0.006
 thank+0.005
 winners+0.005
br+0.005
Neg logit contributions
 Maybe-0.013
keeping-0.012
 generation-0.012
 desks-0.012
and-0.011
Token L
Token ID406
Feature activation-7.50e-02

Pos logit contributions
keeping+0.025
 generation+0.024
 Maybe+0.024
grand+0.022
and+0.021
Neg logit contributions
 course-0.033
 thank-0.032
agging-0.031
 best-0.031
br-0.031
Tokenizzy
Token ID40593
Feature activation-3.00e-02

Pos logit contributions
 generation+0.139
keeping+0.138
 Maybe+0.125
 habitat+0.118
 computers+0.115
Neg logit contributions
ig-0.123
agging-0.115
is-0.112
 course-0.110
ier-0.106
Token was
Token ID373
Feature activation-1.47e-02

Pos logit contributions
and+0.056
 Maybe+0.056
 generation+0.055
keeping+0.054
ometers+0.052
Neg logit contributions
 course-0.036
 best-0.032
 thank-0.032
ig-0.031
 better-0.031
Token in
Token ID287
Feature activation-3.86e-02

Pos logit contributions
 Maybe+0.026
keeping+0.023
 generation+0.023
 desks+0.022
grand+0.022
Neg logit contributions
agging-0.023
 course-0.023
 thank-0.021
 winners-0.021
br-0.020
Token her
Token ID607
Feature activation-3.00e-02

Pos logit contributions
 Maybe+0.094
and+0.087
keeping+0.085
grand+0.085
places+0.083
Neg logit contributions
 course-0.039
 thank-0.033
 soaring-0.031
 best-0.031
agging-0.030
Token room
Token ID2119
Feature activation-2.43e-02

Pos logit contributions
 Maybe+0.037
keeping+0.034
and+0.032
 generation+0.032
grand+0.031
Neg logit contributions
 course-0.061
agging-0.060
 best-0.058
 thank-0.056
 winners-0.056
Token is
Token ID318
Feature activation-3.72e-02

Pos logit contributions
keeping+0.059
 generation+0.059
and+0.057
 Maybe+0.056
grand+0.053
Neg logit contributions
 course-0.034
 best-0.034
agging-0.031
ig-0.029
 medal-0.029
Token very
Token ID845
Feature activation-5.10e-02

Pos logit contributions
 Maybe+0.070
keeping+0.066
 generation+0.065
and+0.063
grand+0.062
Neg logit contributions
agging-0.051
 course-0.050
 best-0.048
 thank-0.047
br-0.047
Token messy
Token ID24097
Feature activation-4.08e-02

Pos logit contributions
 Maybe+0.098
keeping+0.096
 generation+0.092
and+0.087
grand+0.085
Neg logit contributions
agging-0.069
 thank-0.065
 best-0.064
 luck-0.063
br-0.062
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 Maybe+0.076
 generation+0.074
grand+0.073
 Who+0.069
 Sure+0.068
Neg logit contributions
ier-0.056
agging-0.055
 course-0.054
 best-0.053
 luck-0.050
Token There
Token ID1318
Feature activation-2.17e-02

Pos logit contributions
 generation+0.024
keeping+0.024
grand+0.022
places+0.021
days+0.021
Neg logit contributions
 course-0.025
agging-0.024
 thank-0.024
 winners-0.022
 best-0.022
Token are
Token ID389
Feature activation-7.46e-02

Pos logit contributions
 Maybe+0.042
keeping+0.042
 generation+0.041
and+0.040
 alley+0.038
Neg logit contributions
 course-0.029
 best-0.026
 thank-0.025
 winners-0.024
 medal-0.024
Token toys
Token ID14958
Feature activation-5.79e-02

Pos logit contributions
 Maybe+0.139
 generation+0.135
keeping+0.132
grand+0.129
 Sure+0.120
Neg logit contributions
 best-0.103
 course-0.100
 winners-0.096
agging-0.092
 more-0.091
Token everywhere
Token ID8347
Feature activation-3.88e-02

Pos logit contributions
 Maybe+0.120
 generation+0.114
keeping+0.110
grand+0.109
and+0.106
Neg logit contributions
 course-0.072
agging-0.067
br-0.063
ier-0.062
 thank-0.062
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
 Maybe+0.072
 generation+0.071
keeping+0.070
grand+0.067
 habitat+0.065
Neg logit contributions
 course-0.055
agging-0.050
ier-0.050
br-0.050
 best-0.048
Token Mom
Token ID11254
Feature activation-4.08e-02

Pos logit contributions
 generation+0.032
keeping+0.032
grand+0.029
places+0.028
days+0.028
Neg logit contributions
 course-0.034
 thank-0.032
agging-0.031
 winners-0.030
 best-0.029
Token says
Token ID1139
Feature activation-4.32e-03

Pos logit contributions
 Maybe+0.065
 generation+0.062
keeping+0.062
and+0.059
 desks+0.057
Neg logit contributions
 course-0.068
agging-0.066
 thank-0.062
br-0.061
 winners-0.060
Token very
Token ID845
Feature activation-2.75e-02

Pos logit contributions
 Maybe+0.039
keeping+0.035
 generation+0.035
and+0.034
 desks+0.034
Neg logit contributions
agging-0.031
 course-0.031
 thank-0.029
 winners-0.028
 soaring-0.028
Token dirty
Token ID11841
Feature activation-3.16e-02

Pos logit contributions
 Maybe+0.053
 desks+0.048
keeping+0.048
 generation+0.048
and+0.048
Neg logit contributions
 course-0.037
agging-0.036
br-0.033
 thank-0.032
 winners-0.032
Token and
Token ID290
Feature activation-5.38e-02

Pos logit contributions
 generation+0.057
 Maybe+0.056
grand+0.054
 Sure+0.052
 desks+0.051
Neg logit contributions
ier-0.046
agging-0.045
 course-0.042
 best-0.042
 better-0.041
Token sm
Token ID895
Feature activation-4.23e-02

Pos logit contributions
 Maybe+0.094
 generation+0.089
keeping+0.088
and+0.086
 desks+0.084
Neg logit contributions
 course-0.080
agging-0.077
 thank-0.076
br-0.071
 winners-0.071
Tokenelly
Token ID6148
Feature activation-2.48e-02

Pos logit contributions
 generation+0.057
 laptop+0.052
 Maybe+0.052
 computers+0.051
grand+0.049
Neg logit contributions
ier-0.084
agging-0.084
aid-0.078
 course-0.077
ags-0.075
Token.
Token ID13
Feature activation-7.44e-02

Pos logit contributions
 generation+0.043
 Maybe+0.042
grand+0.039
 Sure+0.037
and+0.037
Neg logit contributions
 course-0.037
agging-0.037
ier-0.036
 best-0.035
 thank-0.035
Token I
Token ID314
Feature activation-2.96e-02

Pos logit contributions
 generation+0.119
 desks+0.116
 alley+0.115
keeping+0.114
 corridor+0.112
Neg logit contributions
 thank-0.105
 sorry-0.096
 welcome-0.093
 best-0.091
 course-0.088
Token threw
Token ID9617
Feature activation-1.42e-02

Pos logit contributions
 desks+0.055
keeping+0.054
 generation+0.053
 computers+0.052
and+0.050
Neg logit contributions
 thank-0.040
aid-0.039
 better-0.039
 course-0.038
 healed-0.034
Token her
Token ID607
Feature activation-9.80e-03

Pos logit contributions
 Maybe+0.026
 generation+0.024
keeping+0.023
grand+0.023
 desks+0.023
Neg logit contributions
 course-0.021
agging-0.020
 thank-0.019
br-0.018
 winners-0.018
Token away
Token ID1497
Feature activation-2.11e-02

Pos logit contributions
 Maybe+0.019
 generation+0.018
keeping+0.017
 desks+0.017
and+0.017
Neg logit contributions
 course-0.013
agging-0.013
 thank-0.012
 winners-0.011
br-0.011
Token."
Token ID526
Feature activation-1.79e-02

Pos logit contributions
 Maybe+0.033
 generation+0.031
 desks+0.030
keeping+0.030
grand+0.030
Neg logit contributions
 course-0.035
agging-0.034
 thank-0.032
ier-0.032
br-0.032
Tokenizzy
Token ID40593
Feature activation-2.18e-02

Pos logit contributions
 Maybe+0.103
 generation+0.095
keeping+0.094
 desks+0.091
places+0.091
Neg logit contributions
agging-0.089
 course-0.088
is-0.086
ig-0.085
ier-0.083
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
 Maybe+0.042
 desks+0.039
keeping+0.037
 generation+0.037
 computers+0.037
Neg logit contributions
 course-0.028
isy-0.027
agging-0.026
 thank-0.026
is-0.024
Token One
Token ID1881
Feature activation-2.84e-02

Pos logit contributions
keeping+0.028
 generation+0.027
grand+0.025
 Maybe+0.023
places+0.022
Neg logit contributions
 course-0.034
 thank-0.034
agging-0.033
br-0.033
 best-0.033
Token day
Token ID1110
Feature activation+5.81e-04

Pos logit contributions
keeping+0.067
 Maybe+0.063
 desks+0.063
places+0.061
 computers+0.060
Neg logit contributions
 best-0.028
 course-0.026
 thank-0.026
 better-0.025
agging-0.024
Token,
Token ID11
Feature activation-2.60e-02

Pos logit contributions
 course+0.001
agging+0.001
 thank+0.001
br+0.001
 winners+0.000
Neg logit contributions
 Maybe-0.001
keeping-0.001
 generation-0.001
 desks-0.001
and-0.001
Token L
Token ID406
Feature activation-7.44e-02

Pos logit contributions
keeping+0.042
 generation+0.040
 Maybe+0.039
grand+0.037
places+0.036
Neg logit contributions
 course-0.044
 thank-0.043
br-0.040
agging-0.040
 best-0.040
Tokenizzy
Token ID40593
Feature activation-2.55e-02

Pos logit contributions
 generation+0.143
keeping+0.141
 Maybe+0.130
 habitat+0.123
 computers+0.121
Neg logit contributions
ig-0.118
agging-0.109
is-0.107
 course-0.102
isy-0.101
Token decided
Token ID3066
Feature activation-6.36e-03

Pos logit contributions
and+0.050
 desks+0.048
 Maybe+0.048
uries+0.048
 hotels+0.047
Neg logit contributions
 course-0.028
 eagle-0.025
 nodded-0.025
 thank-0.024
 flew-0.024
Token to
Token ID284
Feature activation+7.59e-03

Pos logit contributions
 Maybe+0.010
 generation+0.009
keeping+0.009
packs+0.009
places+0.009
Neg logit contributions
 thank-0.011
 course-0.011
 soaring-0.010
agging-0.010
 medal-0.010
Token take
Token ID1011
Feature activation+1.67e-02

Pos logit contributions
 course+0.010
 thank+0.010
br+0.009
 medal+0.009
agging+0.008
Neg logit contributions
 Maybe-0.015
and-0.014
keeping-0.013
places-0.013
 desks-0.013
Token a
Token ID257
Feature activation-1.64e-03

Pos logit contributions
agging+0.030
 course+0.030
 winners+0.028
 thank+0.027
ier+0.027
Neg logit contributions
 Maybe-0.024
 generation-0.022
keeping-0.022
 desks-0.021
and-0.021
Token umbrella
Token ID25510
Feature activation+5.91e-04

Pos logit contributions
 course+0.004
agging+0.004
 thank+0.003
 winners+0.003
br+0.003
Neg logit contributions
 Maybe-0.005
 generation-0.004
keeping-0.004
 desks-0.004
and-0.004
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
 course+0.001
agging+0.001
 thank+0.001
br+0.001
 winners+0.001
Neg logit contributions
 Maybe-0.001
 generation-0.001
keeping-0.001
 desks-0.001
and-0.001
Token too
Token ID1165
Feature activation-1.55e-02

Pos logit contributions
 Maybe+0.020
keeping+0.018
 generation+0.018
 desks+0.018
 computers+0.017
Neg logit contributions
 course-0.021
agging-0.020
br-0.019
 winners-0.019
 thank-0.019
Token heavy
Token ID4334
Feature activation-2.18e-02

Pos logit contributions
 Maybe+0.030
keeping+0.027
 generation+0.027
 desks+0.027
and+0.027
Neg logit contributions
agging-0.021
 course-0.021
br-0.019
 thank-0.019
 soaring-0.019
Token for
Token ID329
Feature activation-1.63e-02

Pos logit contributions
 Maybe+0.040
 generation+0.037
 desks+0.036
keeping+0.036
and+0.035
Neg logit contributions
agging-0.032
 course-0.031
 thank-0.029
ier-0.029
br-0.028
Token F
Token ID376
Feature activation-7.42e-02

Pos logit contributions
 Maybe+0.023
 generation+0.021
keeping+0.021
 desks+0.021
and+0.020
Neg logit contributions
 course-0.029
agging-0.029
 winners-0.027
 thank-0.027
br-0.027
Tokenido
Token ID17305
Feature activation-4.33e-02

Pos logit contributions
 desks+0.113
 computers+0.108
 Maybe+0.104
 hallway+0.096
 laptops+0.094
Neg logit contributions
agging-0.147
ig-0.124
ier-0.122
is-0.120
 course-0.117
Token.
Token ID13
Feature activation-6.57e-03

Pos logit contributions
 Maybe+0.070
 desks+0.063
 generation+0.063
and+0.060
keeping+0.059
Neg logit contributions
agging-0.069
 course-0.068
 best-0.065
 thank-0.065
 soaring-0.062
Token
Token ID198
Feature activation+2.63e-03

Pos logit contributions
 desks+0.012
 generation+0.012
keeping+0.012
 laptop+0.012
 computers+0.012
Neg logit contributions
 course-0.008
 thank-0.008
agging-0.008
 best-0.007
 soaring-0.007
Token
Token ID198
Feature activation+8.01e-03

Pos logit contributions
 course+0.005
agging+0.005
 thank+0.004
br+0.004
 winners+0.004
Neg logit contributions
 Maybe-0.004
 generation-0.003
keeping-0.003
 desks-0.003
 computers-0.003
TokenSuddenly
Token ID38582
Feature activation+1.96e-02

Pos logit contributions
 course+0.012
agging+0.012
 thank+0.011
 winners+0.011
br+0.011
Neg logit contributions
 Maybe-0.014
keeping-0.013
 generation-0.013
 desks-0.013
and-0.012
Token you
Token ID345
Feature activation-1.99e-02

Pos logit contributions
 generation+0.063
keeping+0.063
 Maybe+0.060
packs+0.058
grand+0.057
Neg logit contributions
 course-0.045
 thank-0.045
 best-0.037
 better-0.037
 winners-0.036
Token cannot
Token ID2314
Feature activation-2.09e-02

Pos logit contributions
 Maybe+0.036
 generation+0.036
keeping+0.035
 desks+0.034
and+0.034
Neg logit contributions
 course-0.028
 thank-0.027
agging-0.026
 winners-0.026
 best-0.025
Token hide
Token ID7808
Feature activation-3.96e-02

Pos logit contributions
 Maybe+0.037
 generation+0.034
 desks+0.034
keeping+0.034
and+0.033
Neg logit contributions
agging-0.032
 course-0.031
 thank-0.030
br-0.027
 better-0.027
Token from
Token ID422
Feature activation-3.54e-02

Pos logit contributions
 Maybe+0.069
grand+0.067
 generation+0.066
 Sure+0.061
 hallway+0.060
Neg logit contributions
agging-0.060
 course-0.059
ier-0.055
 best-0.054
 luck-0.054
Token me
Token ID502
Feature activation-4.84e-02

Pos logit contributions
 Maybe+0.078
grand+0.071
and+0.070
 generation+0.068
keeping+0.068
Neg logit contributions
 course-0.043
agging-0.039
 soaring-0.036
 best-0.035
 thank-0.034
Token.
Token ID13
Feature activation-7.41e-02

Pos logit contributions
 Maybe+0.091
 generation+0.090
grand+0.085
 Who+0.082
 Sure+0.082
Neg logit contributions
agging-0.064
 course-0.061
 best-0.059
ier-0.058
 better-0.058
Token You
Token ID921
Feature activation-3.70e-02

Pos logit contributions
and+0.117
 desks+0.116
packs+0.111
 hallway+0.106
cases+0.105
Neg logit contributions
 sorry-0.103
 thank-0.097
 course-0.091
ags-0.090
agging-0.087
Token need
Token ID761
Feature activation-2.39e-02

Pos logit contributions
 Maybe+0.074
 generation+0.072
 desks+0.070
keeping+0.069
and+0.069
Neg logit contributions
 course-0.046
 thank-0.045
 winners-0.041
agging-0.041
 better-0.041
Token a
Token ID257
Feature activation-1.52e-02

Pos logit contributions
 Maybe+0.034
 generation+0.030
keeping+0.029
 desks+0.029
grand+0.028
Neg logit contributions
 course-0.045
agging-0.043
 thank-0.042
br-0.041
 winners-0.041
Token bath
Token ID7837
Feature activation-1.11e-02

Pos logit contributions
 Maybe+0.023
 generation+0.020
 desks+0.020
and+0.020
keeping+0.020
Neg logit contributions
 course-0.027
agging-0.026
 winners-0.024
 soaring-0.024
ier-0.023
Token.
Token ID13
Feature activation-6.44e-02

Pos logit contributions
 Maybe+0.017
 generation+0.017
 desks+0.016
grand+0.015
keeping+0.015
Neg logit contributions
 course-0.019
agging-0.018
ier-0.017
 winners-0.017
 thank-0.017
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
 Maybe+0.052
 generation+0.047
keeping+0.046
 desks+0.046
and+0.045
Neg logit contributions
 course-0.047
agging-0.046
 thank-0.043
br-0.042
 winners-0.042
Token wind
Token ID2344
Feature activation-1.35e-03

Pos logit contributions
agging+0.017
 winners+0.017
 course+0.017
br+0.016
 medal+0.016
Neg logit contributions
 Maybe-0.019
 generation-0.018
 desks-0.017
 hallway-0.017
keeping-0.017
Token.
Token ID13
Feature activation-7.43e-03

Pos logit contributions
 Maybe+0.002
keeping+0.001
 generation+0.001
and+0.001
 desks+0.001
Neg logit contributions
 course-0.003
agging-0.003
 thank-0.003
 winners-0.003
br-0.003
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
 Maybe+0.010
keeping+0.010
 generation+0.010
 desks+0.010
packs+0.010
Neg logit contributions
 course-0.013
agging-0.012
 thank-0.012
br-0.012
 soaring-0.012
Token
Token ID198
Feature activation-8.42e-03

Pos logit contributions
keeping+0.022
 Maybe+0.021
 desks+0.021
 generation+0.021
 computers+0.020
Neg logit contributions
 course-0.021
agging-0.020
br-0.020
 thank-0.019
aid-0.019
TokenB
Token ID33
Feature activation-7.40e-02

Pos logit contributions
 Maybe+0.014
keeping+0.013
 generation+0.013
 desks+0.013
and+0.012
Neg logit contributions
 course-0.013
agging-0.013
br-0.012
 thank-0.012
 winners-0.012
Tokenizzy
Token ID40593
Feature activation-4.35e-03

Pos logit contributions
 generation+0.140
keeping+0.130
 computers+0.122
grand+0.119
 heaven+0.116
Neg logit contributions
ig-0.122
is-0.108
agging-0.106
oming-0.096
isy-0.094
Token woke
Token ID19092
Feature activation+1.43e-03

Pos logit contributions
 Maybe+0.006
keeping+0.006
 generation+0.006
 desks+0.005
and+0.005
Neg logit contributions
 course-0.008
agging-0.008
 thank-0.007
br-0.007
 winners-0.007
Token up
Token ID510
Feature activation+2.75e-03

Pos logit contributions
 course+0.002
agging+0.002
 winners+0.002
br+0.002
 thank+0.002
Neg logit contributions
 Maybe-0.002
 generation-0.002
 desks-0.002
keeping-0.002
and-0.002
Token and
Token ID290
Feature activation-3.33e-03

Pos logit contributions
 course+0.005
agging+0.005
 winners+0.005
br+0.005
 thank+0.004
Neg logit contributions
 Maybe-0.004
 generation-0.004
keeping-0.004
and-0.004
 desks-0.003
Token couldn
Token ID3521
Feature activation+4.33e-03

Pos logit contributions
 Maybe+0.005
keeping+0.004
 generation+0.004
 desks+0.004
and+0.004
Neg logit contributions
 course-0.006
agging-0.006
 thank-0.005
 winners-0.005
br-0.005
Token big
Token ID1263
Feature activation+4.97e-03

Pos logit contributions
 Maybe+0.016
 desks+0.015
 generation+0.014
 computers+0.014
 alley+0.014
Neg logit contributions
agging-0.013
 course-0.012
ier-0.011
 winners-0.011
aid-0.011
Token pile
Token ID14540
Feature activation+2.27e-04

Pos logit contributions
agging+0.008
 course+0.008
aid+0.008
ier+0.008
br+0.007
Neg logit contributions
 Maybe-0.008
 desks-0.007
 generation-0.007
 alley-0.007
 computers-0.007
Token of
Token ID286
Feature activation-2.12e-02

Pos logit contributions
 course+0.000
agging+0.000
 thank+0.000
ier+0.000
 winners+0.000
Neg logit contributions
 Maybe-0.000
 generation-0.000
keeping-0.000
 desks-0.000
grand-0.000
Token garbage
Token ID15413
Feature activation-4.89e-02

Pos logit contributions
 Maybe+0.034
keeping+0.030
 generation+0.030
 desks+0.030
and+0.029
Neg logit contributions
 course-0.036
agging-0.035
 thank-0.033
br-0.032
 winners-0.032
Token on
Token ID319
Feature activation-4.30e-02

Pos logit contributions
 Maybe+0.085
 generation+0.081
and+0.080
grand+0.078
keeping+0.076
Neg logit contributions
 course-0.076
agging-0.072
ier-0.068
 thank-0.067
br-0.066
Token the
Token ID262
Feature activation-7.39e-02

Pos logit contributions
 Maybe+0.106
grand+0.103
and+0.102
days+0.095
eware+0.094
Neg logit contributions
 course-0.042
 medal-0.029
 better-0.028
 gu-0.027
 thank-0.027
Token street
Token ID4675
Feature activation-2.57e-02

Pos logit contributions
 Maybe+0.121
and+0.105
grand+0.104
packs+0.104
 generation+0.102
Neg logit contributions
 course-0.123
agging-0.117
 best-0.117
 medal-0.108
 thank-0.108
Token.
Token ID13
Feature activation-7.57e-03

Pos logit contributions
 Maybe+0.044
 generation+0.040
keeping+0.040
grand+0.039
and+0.039
Neg logit contributions
 course-0.040
agging-0.037
 winners-0.035
 thank-0.034
 soaring-0.034
Token He
Token ID679
Feature activation-1.63e-02

Pos logit contributions
grand+0.013
keeping+0.013
 generation+0.012
 cigarettes+0.012
ox+0.012
Neg logit contributions
 thank-0.010
agging-0.010
 better-0.010
 best-0.010
 course-0.010
Token thought
Token ID1807
Feature activation-1.67e-02

Pos logit contributions
keeping+0.025
 Maybe+0.024
 generation+0.023
grand+0.023
and+0.022
Neg logit contributions
 course-0.027
agging-0.025
 thank-0.025
 medal-0.024
 nodded-0.024
Token it
Token ID340
Feature activation-1.44e-02

Pos logit contributions
 Maybe+0.031
keeping+0.030
 generation+0.029
grand+0.028
and+0.027
Neg logit contributions
 course-0.024
 thank-0.024
agging-0.023
br-0.021
 medal-0.021
Token As
Token ID1081
Feature activation-3.03e-02

Pos logit contributions
keeping+0.029
 generation+0.028
grand+0.028
packs+0.025
 Maybe+0.024
Neg logit contributions
 thank-0.044
br-0.042
agging-0.042
 best-0.041
 nineteen-0.040
Token they
Token ID484
Feature activation+2.77e-02

Pos logit contributions
 Maybe+0.046
keeping+0.046
 generation+0.044
grand+0.043
and+0.041
Neg logit contributions
 course-0.052
 thank-0.050
 best-0.050
br-0.049
 better-0.048
Token were
Token ID547
Feature activation+1.01e-02

Pos logit contributions
agging+0.060
 winners+0.057
ier+0.056
 course+0.055
 thank+0.054
Neg logit contributions
 Maybe-0.030
 desks-0.025
packs-0.025
 alley-0.025
days-0.024
Token walking
Token ID6155
Feature activation+2.08e-02

Pos logit contributions
 course+0.020
aid+0.018
 medal+0.018
 winners+0.018
ier+0.018
Neg logit contributions
 desks-0.012
 Maybe-0.012
 computers-0.011
care-0.011
 hallway-0.011
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
agging+0.033
 course+0.033
isin+0.031
 gu+0.031
 winners+0.030
Neg logit contributions
 Maybe-0.036
and-0.030
 They-0.030
 because-0.030
days-0.029
Token L
Token ID406
Feature activation-7.39e-02

Pos logit contributions
keeping+0.053
grand+0.051
 Maybe+0.049
 generation+0.049
and+0.046
Neg logit contributions
 course-0.056
 best-0.056
 thank-0.054
br-0.053
 soaring-0.052
Tokenizzy
Token ID40593
Feature activation-1.86e-02

Pos logit contributions
 generation+0.129
keeping+0.126
 Maybe+0.117
grand+0.111
 computers+0.110
Neg logit contributions
ig-0.126
agging-0.121
ier-0.115
is-0.114
 course-0.112
Token saw
Token ID2497
Feature activation+1.15e-02

Pos logit contributions
 Maybe+0.030
keeping+0.028
 generation+0.028
and+0.027
grand+0.026
Neg logit contributions
 course-0.030
agging-0.029
 soaring-0.027
br-0.027
 thank-0.027
Token something
Token ID1223
Feature activation-3.06e-02

Pos logit contributions
agging+0.019
 course+0.019
 thank+0.018
br+0.018
 winners+0.018
Neg logit contributions
 Maybe-0.018
 generation-0.016
 desks-0.016
keeping-0.016
and-0.016
Token black
Token ID2042
Feature activation-9.34e-03

Pos logit contributions
 Maybe+0.054
 desks+0.049
and+0.047
 computers+0.047
 cigarettes+0.047
Neg logit contributions
agging-0.045
 course-0.045
 winners-0.042
aid-0.039
 thank-0.039
Token in
Token ID287
Feature activation-2.28e-02

Pos logit contributions
 Maybe+0.017
 generation+0.015
keeping+0.015
 desks+0.015
and+0.015
Neg logit contributions
 course-0.014
agging-0.014
 thank-0.012
br-0.012
 winners-0.012
Token was
Token ID373
Feature activation-8.85e-03

Pos logit contributions
 course+0.024
 best+0.021
 thank+0.021
agging+0.020
br+0.019
Neg logit contributions
keeping-0.023
 Maybe-0.023
 generation-0.022
 desks-0.022
grand-0.021
Token a
Token ID257
Feature activation+1.37e-02

Pos logit contributions
 Maybe+0.018
keeping+0.017
places+0.016
 desks+0.016
grand+0.016
Neg logit contributions
 course-0.012
agging-0.011
 best-0.010
 thank-0.010
 soaring-0.010
Token little
Token ID1310
Feature activation+1.36e-02

Pos logit contributions
 course+0.023
agging+0.023
ier+0.022
 winners+0.021
 soaring+0.021
Neg logit contributions
 Maybe-0.022
 generation-0.019
 desks-0.019
 alley-0.018
grand-0.018
Token spider
Token ID19230
Feature activation-2.66e-03

Pos logit contributions
agging+0.025
 course+0.024
 soaring+0.022
 winners+0.022
br+0.022
Neg logit contributions
 Maybe-0.020
 generation-0.017
 desks-0.017
 alley-0.016
keeping-0.016
Token named
Token ID3706
Feature activation-2.99e-02

Pos logit contributions
 Maybe+0.005
keeping+0.004
 desks+0.004
 generation+0.004
and+0.004
Neg logit contributions
 course-0.004
agging-0.004
br-0.004
 thank-0.004
 soaring-0.004
Token Sp
Token ID1338
Feature activation-7.38e-02

Pos logit contributions
 Maybe+0.040
keeping+0.039
 generation+0.038
grand+0.037
packs+0.036
Neg logit contributions
 course-0.056
 thank-0.053
agging-0.052
 soaring-0.052
br-0.052
Tokenide
Token ID485
Feature activation-3.66e-02

Pos logit contributions
 Maybe+0.121
 generation+0.108
keeping+0.108
 Who+0.103
 desks+0.103
Neg logit contributions
agging-0.133
is-0.130
isy-0.129
ier-0.122
 course-0.118
Tokeny
Token ID88
Feature activation-1.15e-02

Pos logit contributions
 Maybe+0.057
 generation+0.052
keeping+0.051
 desks+0.051
grand+0.048
Neg logit contributions
agging-0.062
 course-0.062
ier-0.059
 thank-0.056
br-0.056
Token.
Token ID13
Feature activation-1.29e-02

Pos logit contributions
 Maybe+0.017
keeping+0.016
 desks+0.016
 generation+0.015
and+0.015
Neg logit contributions
 course-0.020
agging-0.020
 thank-0.019
br-0.018
 soaring-0.018
Token He
Token ID679
Feature activation+1.37e-02

Pos logit contributions
keeping+0.017
 generation+0.016
grand+0.015
 Maybe+0.015
packs+0.015
Neg logit contributions
 course-0.025
 thank-0.023
br-0.023
agging-0.023
 best-0.022
Token lived
Token ID5615
Feature activation+1.03e-02

Pos logit contributions
 course+0.025
agging+0.025
 thank+0.023
 winners+0.023
br+0.023
Neg logit contributions
 Maybe-0.020
 generation-0.017
keeping-0.017
 desks-0.017
and-0.017
TokenJ
Token ID41
Feature activation-3.68e-02

Pos logit contributions
 Maybe+0.009
keeping+0.008
 generation+0.008
 desks+0.008
places+0.008
Neg logit contributions
 course-0.007
agging-0.007
br-0.007
aid-0.006
 thank-0.006
Tokenill
Token ID359
Feature activation-1.77e-02

Pos logit contributions
 Maybe+0.052
 generation+0.051
 hotels+0.046
 hallway+0.045
keeping+0.044
Neg logit contributions
agging-0.068
 course-0.068
ier-0.067
is-0.065
ig-0.064
Token looked
Token ID3114
Feature activation+2.56e-03

Pos logit contributions
 Maybe+0.024
 generation+0.021
keeping+0.021
 desks+0.020
and+0.020
Neg logit contributions
agging-0.034
 course-0.034
 winners-0.032
 thank-0.032
br-0.031
Token around
Token ID1088
Feature activation-1.75e-02

Pos logit contributions
 course+0.004
agging+0.004
 winners+0.004
br+0.004
 thank+0.004
Neg logit contributions
 Maybe-0.004
 generation-0.004
 desks-0.004
keeping-0.004
and-0.004
Token for
Token ID329
Feature activation+5.01e-03

Pos logit contributions
 Maybe+0.028
keeping+0.025
 generation+0.025
 desks+0.024
grand+0.024
Neg logit contributions
 course-0.030
agging-0.029
 thank-0.028
 soaring-0.026
br-0.026
Token F
Token ID376
Feature activation-7.37e-02

Pos logit contributions
 course+0.008
agging+0.008
 thank+0.008
 winners+0.007
br+0.007
Neg logit contributions
 Maybe-0.008
keeping-0.007
 generation-0.007
 desks-0.007
and-0.007
Tokenuzz
Token ID4715
Feature activation-3.65e-02

Pos logit contributions
 Maybe+0.109
 hotels+0.109
 desks+0.106
 computers+0.103
 generation+0.100
Neg logit contributions
agging-0.133
ig-0.120
is-0.117
inking-0.116
ier-0.113
Tokeny
Token ID88
Feature activation-2.33e-02

Pos logit contributions
 Maybe+0.054
 generation+0.048
keeping+0.047
 desks+0.046
grand+0.046
Neg logit contributions
agging-0.066
 course-0.066
ier-0.061
 thank-0.060
 soaring-0.060
Token,
Token ID11
Feature activation-1.78e-02

Pos logit contributions
 Maybe+0.041
grand+0.038
 generation+0.037
keeping+0.036
and+0.036
Neg logit contributions
 course-0.035
agging-0.032
 thank-0.031
 best-0.031
 soaring-0.030
Token but
Token ID475
Feature activation-8.47e-03

Pos logit contributions
 Maybe+0.025
keeping+0.024
grand+0.024
 desks+0.023
 generation+0.023
Neg logit contributions
 course-0.032
agging-0.031
 soaring-0.031
 thank-0.030
 higher-0.028
Token he
Token ID339
Feature activation-2.36e-02

Pos logit contributions
 Maybe+0.013
keeping+0.012
 generation+0.012
 desks+0.012
and+0.011
Neg logit contributions
 course-0.015
agging-0.014
 thank-0.014
 soaring-0.013
br-0.013
Token saw
Token ID2497
Feature activation+2.77e-04

Pos logit contributions
 Maybe+0.003
keeping+0.003
and+0.003
 generation+0.003
 desks+0.003
Neg logit contributions
 course-0.003
 thank-0.002
 best-0.002
 winners-0.002
 better-0.002
Token a
Token ID257
Feature activation-9.34e-03

Pos logit contributions
 course+0.000
agging+0.000
 thank+0.000
 winners+0.000
br+0.000
Neg logit contributions
 Maybe-0.000
keeping-0.000
 generation-0.000
and-0.000
grand-0.000
Token big
Token ID1263
Feature activation+2.29e-03

Pos logit contributions
 Maybe+0.017
 desks+0.015
 alley+0.015
 generation+0.015
 hallway+0.015
Neg logit contributions
agging-0.014
 course-0.013
ier-0.013
agged-0.012
aid-0.012
Token factory
Token ID8860
Feature activation-3.08e-02

Pos logit contributions
agging+0.004
ier+0.004
aid+0.004
 winners+0.004
 course+0.004
Neg logit contributions
 Maybe-0.003
 desks-0.003
 alley-0.003
 laptop-0.003
 hallway-0.003
Token near
Token ID1474
Feature activation-5.09e-02

Pos logit contributions
 Maybe+0.058
 generation+0.053
keeping+0.053
grand+0.050
and+0.050
Neg logit contributions
 course-0.044
agging-0.043
 thank-0.039
 winners-0.039
br-0.039
Token their
Token ID511
Feature activation-7.34e-02

Pos logit contributions
 Maybe+0.130
grand+0.123
storms+0.122
and+0.120
places+0.118
Neg logit contributions
 course-0.037
 more-0.033
 better-0.028
 best-0.025
 thank-0.022
Token home
Token ID1363
Feature activation-2.73e-02

Pos logit contributions
 Maybe+0.119
and+0.113
keeping+0.107
 generation+0.105
gent+0.103
Neg logit contributions
 best-0.115
 course-0.111
 thank-0.104
 winners-0.099
 friends-0.099
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 generation+0.053
 Maybe+0.052
grand+0.050
gent+0.049
|+0.048
Neg logit contributions
 best-0.032
ier-0.032
agging-0.030
 course-0.029
 thank-0.029
Token They
Token ID1119
Feature activation-1.23e-02

Pos logit contributions
 generation+0.021
keeping+0.019
grand+0.019
days+0.018
places+0.018
Neg logit contributions
 thank-0.016
 course-0.015
 nineteen-0.013
 nodded-0.013
 replied-0.013
Token wanted
Token ID2227
Feature activation-2.24e-02

Pos logit contributions
keeping+0.019
 generation+0.018
 Maybe+0.017
and+0.017
grand+0.017
Neg logit contributions
 course-0.021
 thank-0.021
 winners-0.019
 nodded-0.019
 best-0.019
Token to
Token ID284
Feature activation-5.48e-03

Pos logit contributions
 Maybe+0.041
keeping+0.038
grand+0.038
 generation+0.038
|+0.036
Neg logit contributions
 more-0.032
 better-0.031
 course-0.030
 best-0.030
 thank-0.029
Token"
Token ID1
Feature activation-5.59e-03

Pos logit contributions
 course+0.003
agging+0.003
br+0.002
 thank+0.002
 winners+0.002
Neg logit contributions
 Maybe-0.004
keeping-0.004
 generation-0.004
 desks-0.004
places-0.004
TokenYes
Token ID5297
Feature activation+3.82e-03

Pos logit contributions
 Maybe+0.008
 generation+0.007
and+0.007
 desks+0.007
keeping+0.007
Neg logit contributions
agging-0.011
 course-0.010
 thank-0.010
br-0.010
 winners-0.010
Token,
Token ID11
Feature activation-1.59e-02

Pos logit contributions
agging+0.005
 course+0.004
 winners+0.004
br+0.004
 soaring+0.004
Neg logit contributions
 Maybe-0.008
 desks-0.007
 generation-0.007
and-0.007
keeping-0.007
Token really
Token ID1107
Feature activation-3.71e-03

Pos logit contributions
 Maybe+0.025
keeping+0.022
 generation+0.022
 desks+0.022
and+0.022
Neg logit contributions
agging-0.027
 course-0.027
 thank-0.025
br-0.025
 winners-0.025
Token,"
Token ID553
Feature activation+2.27e-02

Pos logit contributions
 Maybe+0.007
 desks+0.006
 generation+0.006
keeping+0.006
grand+0.006
Neg logit contributions
agging-0.006
 course-0.006
 winners-0.005
br-0.005
aid-0.005
Token said
Token ID531
Feature activation+4.04e-02

Pos logit contributions
agging+0.045
br+0.041
ier+0.041
oming+0.041
 soaring+0.040
Neg logit contributions
 Maybe-0.034
 Grand-0.028
grand-0.028
 desks-0.027
 Jake-0.026
Token Mom
Token ID11254
Feature activation+5.88e-03

Pos logit contributions
agging+0.105
 course+0.104
br+0.100
 winners+0.098
 soaring+0.097
Neg logit contributions
 Maybe-0.028
 desks-0.020
 Grand-0.018
keeping-0.018
 generation-0.018
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
 course+0.011
agging+0.011
 winners+0.010
br+0.010
 thank+0.010
Neg logit contributions
 Maybe-0.008
keeping-0.007
and-0.007
 desks-0.007
 generation-0.007
Token "
Token ID366
Feature activation-1.93e-02

Pos logit contributions
 generation+0.024
keeping+0.024
 desks+0.023
grand+0.023
 Maybe+0.022
Neg logit contributions
 course-0.022
 thank-0.021
agging-0.020
br-0.020
 winners-0.019
TokenBut
Token ID1537
Feature activation-1.80e-02

Pos logit contributions
 Maybe+0.032
 desks+0.031
keeping+0.030
packs+0.029
 computers+0.029
Neg logit contributions
 course-0.030
agging-0.028
br-0.027
 winners-0.026
is-0.026
Token first
Token ID717
Feature activation-1.51e-03

Pos logit contributions
 Maybe+0.025
 generation+0.019
places+0.019
 desks+0.019
 cigarettes+0.018
Neg logit contributions
agging-0.038
br-0.034
 course-0.033
aid-0.032
ier-0.032
Token Buzz
Token ID15753
Feature activation+5.84e-03

Pos logit contributions
agging+0.063
 course+0.063
 winners+0.057
br+0.057
 thank+0.056
Neg logit contributions
 Maybe-0.078
 generation-0.069
keeping-0.068
and-0.067
 desks-0.067
Tokeny
Token ID88
Feature activation+3.85e-02

Pos logit contributions
 course+0.012
agging+0.012
 thank+0.011
br+0.011
 winners+0.011
Neg logit contributions
 Maybe-0.007
 generation-0.006
 desks-0.006
keeping-0.006
and-0.006
Token lived
Token ID5615
Feature activation+3.01e-02

Pos logit contributions
 course+0.066
agging+0.064
 thank+0.061
br+0.060
 best+0.058
Neg logit contributions
 Maybe-0.059
keeping-0.055
 generation-0.054
 desks-0.054
and-0.052
Token in
Token ID287
Feature activation+1.10e-03

Pos logit contributions
 course+0.063
 seventeen+0.063
ises+0.059
isin+0.059
br+0.059
Neg logit contributions
and-0.036
 Maybe-0.035
 because-0.029
 upstairs-0.028
care-0.027
Token a
Token ID257
Feature activation+1.29e-02

Pos logit contributions
 course+0.002
agging+0.001
 thank+0.001
br+0.001
 soaring+0.001
Neg logit contributions
 Maybe-0.002
keeping-0.002
 desks-0.002
 generation-0.002
and-0.002
Token big
Token ID1263
Feature activation+4.18e-02

Pos logit contributions
ier+0.021
agging+0.021
 winners+0.020
 course+0.020
aid+0.020
Neg logit contributions
 Maybe-0.021
 generation-0.019
 hallway-0.018
 alley-0.018
 desks-0.018
Token yellow
Token ID7872
Feature activation+3.89e-02

Pos logit contributions
aid+0.075
ier+0.074
 winners+0.073
agging+0.071
bled+0.071
Neg logit contributions
 Maybe-0.059
 alley-0.056
 hall-0.056
 hallway-0.054
 generation-0.054
Token hive
Token ID35881
Feature activation+2.70e-02

Pos logit contributions
agging+0.056
 course+0.055
 winners+0.054
ier+0.053
 thank+0.052
Neg logit contributions
 Maybe-0.068
 generation-0.063
 desks-0.061
keeping-0.060
and-0.059
Token with
Token ID351
Feature activation+1.53e-02

Pos logit contributions
 course+0.049
agging+0.048
 winners+0.046
br+0.046
 thank+0.046
Neg logit contributions
 Maybe-0.038
 generation-0.034
and-0.033
keeping-0.033
grand-0.031
Token lots
Token ID6041
Feature activation+8.66e-03

Pos logit contributions
agging+0.026
 course+0.025
ier+0.024
 winners+0.024
br+0.023
Neg logit contributions
 Maybe-0.023
 desks-0.021
 generation-0.021
keeping-0.021
and-0.021
Token of
Token ID286
Feature activation+2.43e-02

Pos logit contributions
 course+0.013
agging+0.013
 winners+0.012
 thank+0.012
 medal+0.012
Neg logit contributions
 Maybe-0.015
and-0.014
 desks-0.013
keeping-0.013
 generation-0.013
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 course+0.000
agging+0.000
 thank+0.000
 winners+0.000
br+0.000
Neg logit contributions
 Maybe-0.000
keeping-0.000
 generation-0.000
 desks-0.000
and-0.000
Token The
Token ID383
Feature activation+3.76e-02

Pos logit contributions
 course+0.038
 thank+0.036
agging+0.035
br+0.034
 soaring+0.033
Neg logit contributions
keeping-0.029
 Maybe-0.029
 desks-0.029
 generation-0.028
packs-0.027
Token sun
Token ID4252
Feature activation+3.68e-02

Pos logit contributions
agging+0.062
ier+0.059
 course+0.056
 winners+0.055
bled+0.054
Neg logit contributions
 alley-0.057
 Maybe-0.056
 hallway-0.052
 hall-0.052
 desks-0.048
Token's
Token ID338
Feature activation+2.23e-02

Pos logit contributions
agging+0.075
 winners+0.071
 course+0.071
 medal+0.065
ags+0.065
Neg logit contributions
 Maybe-0.048
 hallway-0.039
places-0.037
 generation-0.036
keeping-0.036
Token warm
Token ID5814
Feature activation+2.19e-02

Pos logit contributions
 course+0.037
br+0.034
 winners+0.034
 medal+0.034
agging+0.033
Neg logit contributions
 Maybe-0.033
 generation-0.032
 cigarettes-0.031
 raining-0.030
 because-0.030
Token rays
Token ID24823
Feature activation+4.17e-02

Pos logit contributions
agging+0.038
 course+0.038
 winners+0.037
br+0.036
 thank+0.036
Neg logit contributions
 Maybe-0.032
keeping-0.029
 generation-0.028
 desks-0.027
and-0.027
Token made
Token ID925
Feature activation+7.50e-03

Pos logit contributions
agging+0.081
 winners+0.079
 course+0.076
 medal+0.073
 luck+0.072
Neg logit contributions
 Maybe-0.052
and-0.042
 hallway-0.042
 because-0.042
days-0.041
Token the
Token ID262
Feature activation+2.94e-02

Pos logit contributions
agging+0.012
 course+0.011
 winners+0.011
 medal+0.011
br+0.011
Neg logit contributions
 Maybe-0.012
 generation-0.011
and-0.011
 desks-0.011
keeping-0.010
Token rose
Token ID8278
Feature activation+5.15e-03

Pos logit contributions
agging+0.043
 course+0.040
 winners+0.038
ier+0.038
br+0.038
Neg logit contributions
 Maybe-0.053
 alley-0.049
 generation-0.048
 desks-0.047
keeping-0.047
Token feel
Token ID1254
Feature activation+3.01e-02

Pos logit contributions
 course+0.008
agging+0.008
 winners+0.008
br+0.007
 thank+0.007
Neg logit contributions
 Maybe-0.009
and-0.008
keeping-0.007
 generation-0.007
 desks-0.007
Token happy
Token ID3772
Feature activation+2.27e-04

Pos logit contributions
agging+0.049
 course+0.045
 winners+0.044
ig+0.043
ises+0.043
Neg logit contributions
 Maybe-0.050
 because-0.045
and-0.043
 They-0.041
 generation-0.041
Token,
Token ID11
Feature activation-2.69e-03

Pos logit contributions
 course+0.012
agging+0.012
 thank+0.011
br+0.011
 winners+0.011
Neg logit contributions
 Maybe-0.020
 generation-0.019
keeping-0.019
 desks-0.018
and-0.018
Token hot
Token ID3024
Feature activation+3.24e-02

Pos logit contributions
 Maybe+0.004
 generation+0.004
keeping+0.004
 desks+0.004
and+0.004
Neg logit contributions
agging-0.004
 course-0.004
br-0.004
ier-0.004
 winners-0.004
Token red
Token ID2266
Feature activation+4.43e-02

Pos logit contributions
agging+0.052
aid+0.049
ier+0.048
 winners+0.047
 course+0.046
Neg logit contributions
 Maybe-0.050
keeping-0.046
 tonight-0.045
 alley-0.044
 desks-0.044
Token balloon
Token ID21190
Feature activation+2.92e-02

Pos logit contributions
agging+0.058
aid+0.054
 course+0.052
 winners+0.049
ier+0.048
Neg logit contributions
 Maybe-0.083
 generation-0.074
 desks-0.074
keeping-0.073
packs-0.073
Token?"
Token ID1701
Feature activation+2.02e-02

Pos logit contributions
agging+0.049
 course+0.047
 winners+0.043
 thank+0.043
aid+0.043
Neg logit contributions
 Maybe-0.047
 generation-0.041
keeping-0.041
 desks-0.040
and-0.040
Token The
Token ID383
Feature activation+4.13e-02

Pos logit contributions
 course+0.040
agging+0.038
 thank+0.037
br+0.036
 soaring+0.035
Neg logit contributions
 Maybe-0.026
keeping-0.024
 generation-0.024
 desks-0.023
grand-0.022
Token hot
Token ID3024
Feature activation+2.08e-02

Pos logit contributions
ier+0.062
agging+0.061
ig+0.061
br+0.061
aid+0.061
Neg logit contributions
 Maybe-0.067
 desks-0.063
 generation-0.062
 hallway-0.061
 computers-0.060
Token red
Token ID2266
Feature activation+4.75e-02

Pos logit contributions
agging+0.027
 winners+0.025
ier+0.025
 course+0.025
aid+0.024
Neg logit contributions
 Maybe-0.039
 generation-0.035
 desks-0.035
keeping-0.035
 alley-0.035
Token balloon
Token ID21190
Feature activation+4.41e-02

Pos logit contributions
agging+0.064
ier+0.057
 winners+0.056
 course+0.056
aid+0.056
Neg logit contributions
 Maybe-0.087
 generation-0.083
 alley-0.078
 desks-0.077
packs-0.076
Token was
Token ID373
Feature activation+1.48e-02

Pos logit contributions
agging+0.079
 winners+0.072
 course+0.071
ags+0.069
aid+0.067
Neg logit contributions
 Maybe-0.067
 because-0.054
 hallway-0.054
 generation-0.053
 They-0.052
Token so
Token ID523
Feature activation+1.28e-02

Pos logit contributions
 course+0.027
ises+0.025
agging+0.025
ig+0.025
aid+0.025
Neg logit contributions
 because-0.019
 Maybe-0.019
 raining-0.019
 Jamie-0.017
 Grand-0.017
Token and
Token ID290
Feature activation+1.84e-02

Pos logit contributions
agging+0.044
 course+0.044
aid+0.042
 winners+0.041
br+0.041
Neg logit contributions
 Maybe-0.032
 generation-0.028
and-0.028
keeping-0.027
 desks-0.026
Token felt
Token ID2936
Feature activation+3.96e-02

Pos logit contributions
agging+0.034
 course+0.031
 winners+0.030
aid+0.029
br+0.029
Neg logit contributions
 Maybe-0.030
 generation-0.025
keeping-0.025
 desks-0.024
and-0.023
Token free
Token ID1479
Feature activation+2.47e-02

Pos logit contributions
agging+0.069
ig+0.065
 course+0.063
is+0.063
ises+0.061
Neg logit contributions
 Maybe-0.063
and-0.055
 because-0.055
 generation-0.054
 They-0.052
Token.
Token ID13
Feature activation+3.08e-02

Pos logit contributions
 course+0.050
agging+0.048
 thank+0.047
aid+0.046
br+0.046
Neg logit contributions
 Maybe-0.029
and-0.027
keeping-0.027
 generation-0.025
 alley-0.023
Token The
Token ID383
Feature activation+4.13e-02

Pos logit contributions
 course+0.054
 thank+0.051
agging+0.047
 soaring+0.046
br+0.044
Neg logit contributions
 desks-0.047
keeping-0.047
 laptop-0.046
 computers-0.044
 Maybe-0.044
Token bird
Token ID6512
Feature activation+4.02e-02

Pos logit contributions
agging+0.074
ier+0.072
bly+0.072
br+0.070
bled+0.070
Neg logit contributions
 Maybe-0.056
 alley-0.053
 hall-0.051
 hallway-0.048
 generation-0.048
Token knew
Token ID2993
Feature activation+1.98e-02

Pos logit contributions
agging+0.091
 winners+0.086
ags+0.084
ier+0.080
 seventeen+0.080
Neg logit contributions
 Maybe-0.044
 because-0.036
and-0.036
 wanted-0.034
 dreamed-0.033
Token it
Token ID340
Feature activation+1.56e-02

Pos logit contributions
agging+0.042
 seventeen+0.039
br+0.039
aid+0.038
ags+0.037
Neg logit contributions
 Maybe-0.025
 because-0.020
 generation-0.020
and-0.019
 Jamie-0.018
Token could
Token ID714
Feature activation+4.34e-02

Pos logit contributions
 course+0.026
 best+0.024
 soaring+0.024
 thank+0.024
br+0.024
Neg logit contributions
 Maybe-0.024
 desks-0.024
keeping-0.023
 generation-0.022
and-0.022
Token always
Token ID1464
Feature activation+3.03e-02

Pos logit contributions
agging+0.076
 winners+0.069
oming+0.068
 course+0.068
ier+0.067
Neg logit contributions
 Maybe-0.068
 generation-0.060
 sneak-0.056
grand-0.056
 tonight-0.055
Token depend
Token ID4745
Feature activation+2.36e-02

Pos logit contributions
agging+0.056
oming+0.054
bled+0.052
ier+0.051
 winners+0.051
Neg logit contributions
 Maybe-0.043
 generation-0.040
 sneak-0.036
 because-0.034
keeping-0.033
Token I
Token ID314
Feature activation+6.79e-03

Pos logit contributions
keeping+0.029
 desks+0.029
and+0.026
 generation+0.026
 alley+0.026
Neg logit contributions
 thank-0.059
 course-0.057
 winners-0.054
 nineteen-0.052
agging-0.052
Token will
Token ID481
Feature activation+1.60e-02

Pos logit contributions
 course+0.009
 thank+0.009
agging+0.009
 winners+0.008
br+0.008
Neg logit contributions
 Maybe-0.013
keeping-0.012
 desks-0.012
 generation-0.012
and-0.011
Token help
Token ID1037
Feature activation-8.82e-04

Pos logit contributions
agging+0.027
 course+0.026
br+0.024
ier+0.024
 winners+0.024
Neg logit contributions
 Maybe-0.026
 generation-0.023
keeping-0.022
 desks-0.022
grand-0.022
Token you
Token ID345
Feature activation-1.39e-03

Pos logit contributions
 Maybe+0.002
 generation+0.001
keeping+0.001
 desks+0.001
and+0.001
Neg logit contributions
 course-0.001
agging-0.001
 thank-0.001
 winners-0.001
br-0.001
Token.
Token ID13
Feature activation-2.72e-02

Pos logit contributions
 Maybe+0.002
and+0.001
keeping+0.001
 generation+0.001
 desks+0.001
Neg logit contributions
 course-0.003
agging-0.003
br-0.003
 thank-0.003
 winners-0.003
Token You
Token ID921
Feature activation+7.11e-03

Pos logit contributions
 desks+0.032
keeping+0.032
and+0.030
 computers+0.029
 generation+0.029
Neg logit contributions
 thank-0.055
 course-0.054
 winners-0.050
agging-0.050
br-0.048
Token are
Token ID389
Feature activation+7.53e-03

Pos logit contributions
agging+0.011
 course+0.011
br+0.010
 soaring+0.010
 winners+0.010
Neg logit contributions
 Maybe-0.012
 generation-0.011
keeping-0.011
 desks-0.011
grand-0.010
Token my
Token ID616
Feature activation+1.56e-02

Pos logit contributions
 course+0.014
agging+0.013
ig+0.013
ement+0.012
is+0.012
Neg logit contributions
 Maybe-0.010
 Grand-0.010
 computers-0.010
 because-0.010
 tonight-0.010
Token friend
Token ID1545
Feature activation+3.39e-02

Pos logit contributions
agging+0.018
 course+0.018
br+0.016
 thank+0.015
ier+0.015
Neg logit contributions
 Maybe-0.034
 desks-0.030
 generation-0.030
and-0.030
keeping-0.030
Token."
Token ID526
Feature activation+1.27e-02

Pos logit contributions
agging+0.072
 course+0.071
br+0.067
 thank+0.066
 soaring+0.066
Neg logit contributions
 Maybe-0.040
and-0.031
keeping-0.031
 desks-0.030
 generation-0.030
Token
Token ID198
Feature activation+1.55e-02

Pos logit contributions
 course+0.020
 thank+0.020
agging+0.019
 best+0.017
 soaring+0.017
Neg logit contributions
keeping-0.021
 generation-0.020
 desks-0.020
days-0.019
and-0.019
Token enjoyed
Token ID8359
Feature activation+2.67e-02

Pos logit contributions
agging+0.036
 course+0.033
br+0.032
ier+0.031
 winners+0.031
Neg logit contributions
 Maybe-0.029
 generation-0.027
keeping-0.027
 desks-0.025
and-0.025
Token their
Token ID511
Feature activation+1.84e-02

Pos logit contributions
agging+0.048
br+0.046
aid+0.046
 course+0.045
ier+0.045
Neg logit contributions
keeping-0.037
 generation-0.036
 Maybe-0.036
 desks-0.034
and-0.033
Token time
Token ID640
Feature activation+5.95e-03

Pos logit contributions
agging+0.030
 course+0.030
ier+0.028
br+0.028
aid+0.028
Neg logit contributions
 Maybe-0.028
 generation-0.026
keeping-0.026
 desks-0.025
and-0.025
Token on
Token ID319
Feature activation+1.01e-02

Pos logit contributions
 course+0.009
agging+0.009
 thank+0.008
aid+0.008
br+0.008
Neg logit contributions
 Maybe-0.010
and-0.009
 generation-0.009
keeping-0.009
 desks-0.009
Token the
Token ID262
Feature activation+5.17e-03

Pos logit contributions
 course+0.015
agging+0.015
 thank+0.014
br+0.014
 winners+0.013
Neg logit contributions
 Maybe-0.018
keeping-0.016
 generation-0.016
 desks-0.016
and-0.016
Token island
Token ID7022
Feature activation+1.59e-02

Pos logit contributions
 course+0.008
agging+0.008
 winners+0.007
aid+0.007
br+0.007
Neg logit contributions
 Maybe-0.009
 generation-0.008
 desks-0.008
keeping-0.008
grand-0.008
Token forever
Token ID8097
Feature activation+1.27e-02

Pos logit contributions
agging+0.031
 course+0.031
br+0.029
 thank+0.029
 winners+0.029
Neg logit contributions
 Maybe-0.020
keeping-0.018
 generation-0.018
and-0.018
 desks-0.018
Token.
Token ID13
Feature activation+1.97e-02

Pos logit contributions
 course+0.027
agging+0.027
br+0.026
aid+0.026
 winners+0.025
Neg logit contributions
 Maybe-0.014
keeping-0.013
 generation-0.012
and-0.011
 desks-0.011
Token<|endoftext|>
Token ID50256
Feature activation+1.60e-02

Pos logit contributions
 course+0.031
 thank+0.029
 best+0.026
agging+0.026
 winners+0.026
Neg logit contributions
 desks-0.034
keeping-0.032
 computers-0.031
 Maybe-0.031
 laptop-0.031
TokenOne
Token ID3198
Feature activation-2.28e-02

Pos logit contributions
 course+0.027
agging+0.025
br+0.024
 thank+0.024
 winners+0.023
Neg logit contributions
 Maybe-0.025
 desks-0.024
 generation-0.024
grand-0.023
keeping-0.023
Token day
Token ID1110
Feature activation+2.73e-02

Pos logit contributions
 Maybe+0.041
keeping+0.038
 generation+0.037
 desks+0.036
and+0.035
Neg logit contributions
 course-0.035
agging-0.034
 thank-0.032
 winners-0.031
 soaring-0.030
Tokenily
Token ID813
Feature activation+7.70e-03

Pos logit contributions
 Maybe+0.033
keeping+0.029
 generation+0.029
 desks+0.029
and+0.029
Neg logit contributions
 course-0.038
agging-0.038
 thank-0.036
 winners-0.035
br-0.035
Token smiled
Token ID13541
Feature activation+1.05e-02

Pos logit contributions
agging+0.012
 course+0.012
 winners+0.011
br+0.011
 thank+0.010
Neg logit contributions
 Maybe-0.014
 desks-0.012
 generation-0.012
keeping-0.012
and-0.012
Token and
Token ID290
Feature activation+3.82e-02

Pos logit contributions
 course+0.023
agging+0.022
br+0.021
 winners+0.021
 thank+0.021
Neg logit contributions
 Maybe-0.013
and-0.010
 generation-0.009
keeping-0.009
 desks-0.009
Token said
Token ID531
Feature activation+4.20e-02

Pos logit contributions
agging+0.080
 course+0.070
 winners+0.067
ier+0.066
ags+0.065
Neg logit contributions
 Maybe-0.058
 Grand-0.047
 Jamie-0.046
 They-0.045
 Sure-0.045
Token,
Token ID11
Feature activation+1.58e-02

Pos logit contributions
agging+0.076
 course+0.075
 eagle+0.070
ags+0.070
 hawk+0.069
Neg logit contributions
 Maybe-0.063
and-0.053
 Jamie-0.049
 They-0.047
 Grand-0.044
Token "
Token ID366
Feature activation+2.87e-03

Pos logit contributions
 course+0.034
agging+0.033
 winners+0.031
br+0.031
 luck+0.031
Neg logit contributions
 Maybe-0.019
and-0.015
 generation-0.015
keeping-0.014
 desks-0.014
TokenSure
Token ID19457
Feature activation+1.72e-02

Pos logit contributions
agging+0.005
 course+0.005
 thank+0.005
br+0.005
ier+0.005
Neg logit contributions
 Maybe-0.004
 generation-0.004
and-0.004
keeping-0.004
 desks-0.003
Token!
Token ID0
Feature activation-9.91e-03

Pos logit contributions
agging+0.028
 course+0.025
aid+0.024
oming+0.024
 winners+0.024
Neg logit contributions
 Maybe-0.030
 Jamie-0.025
 desks-0.025
keeping-0.025
 cigarettes-0.025
Token Let
Token ID3914
Feature activation+2.12e-02

Pos logit contributions
 Maybe+0.010
keeping+0.009
 desks+0.009
 generation+0.009
and+0.008
Neg logit contributions
 course-0.022
agging-0.021
 thank-0.021
 winners-0.020
br-0.020
Token's
Token ID338
Feature activation+8.81e-03

Pos logit contributions
agging+0.045
 course+0.045
 winners+0.042
 thank+0.042
br+0.042
Neg logit contributions
 Maybe-0.024
 generation-0.021
keeping-0.020
 desks-0.020
and-0.020
Token go
Token ID467
Feature activation+1.35e-02

Pos logit contributions
agging+0.018
ier+0.017
 course+0.017
br+0.016
aid+0.016
Neg logit contributions
 Maybe-0.012
 generation-0.011
keeping-0.010
 desks-0.010
 computers-0.009
Token he
Token ID339
Feature activation+5.43e-03

Pos logit contributions
 Maybe+0.014
 generation+0.013
keeping+0.013
 desks+0.013
and+0.013
Neg logit contributions
 course-0.013
 thank-0.012
agging-0.012
 best-0.011
 winners-0.011
Token loved
Token ID6151
Feature activation-2.37e-02

Pos logit contributions
 course+0.010
agging+0.010
 thank+0.009
br+0.009
 winners+0.009
Neg logit contributions
 Maybe-0.008
keeping-0.007
 generation-0.007
 desks-0.007
and-0.007
Token her
Token ID607
Feature activation-1.12e-02

Pos logit contributions
 Maybe+0.043
 generation+0.038
grand+0.037
keeping+0.037
packs+0.036
Neg logit contributions
 course-0.037
agging-0.034
 thank-0.034
 luck-0.032
 medal-0.031
Token too
Token ID1165
Feature activation-1.35e-02

Pos logit contributions
 Maybe+0.020
keeping+0.019
 generation+0.018
and+0.018
grand+0.018
Neg logit contributions
agging-0.017
 course-0.016
 thank-0.016
 best-0.016
 winners-0.016
Token.
Token ID13
Feature activation-8.26e-03

Pos logit contributions
keeping+0.022
 Maybe+0.022
 generation+0.021
grand+0.021
 desks+0.021
Neg logit contributions
agging-0.021
 course-0.021
 thank-0.021
 best-0.020
 better-0.020
Token He
Token ID679
Feature activation+2.18e-02

Pos logit contributions
keeping+0.014
 generation+0.014
days+0.014
 desks+0.013
grand+0.013
Neg logit contributions
 thank-0.013
 course-0.011
agging-0.011
 better-0.011
 best-0.010
Token said
Token ID531
Feature activation+1.63e-02

Pos logit contributions
agging+0.034
 course+0.032
 winners+0.030
br+0.030
 soaring+0.029
Neg logit contributions
 Maybe-0.040
 desks-0.034
 generation-0.034
grand-0.034
keeping-0.033
Token he
Token ID339
Feature activation-2.24e-03

Pos logit contributions
agging+0.036
 course+0.034
ig+0.032
ags+0.032
ises+0.032
Neg logit contributions
 Maybe-0.019
and-0.015
grand-0.015
 desks-0.015
keeping-0.014
Token would
Token ID561
Feature activation+1.21e-02

Pos logit contributions
 Maybe+0.003
keeping+0.003
 generation+0.003
 desks+0.003
and+0.003
Neg logit contributions
 course-0.004
agging-0.004
 thank-0.004
aid-0.003
 best-0.003
Token be
Token ID307
Feature activation+1.74e-02

Pos logit contributions
 course+0.019
agging+0.019
br+0.017
 winners+0.017
 thank+0.017
Neg logit contributions
 Maybe-0.021
 generation-0.019
keeping-0.018
 desks-0.018
and-0.018
Token more
Token ID517
Feature activation+9.46e-03

Pos logit contributions
br+0.028
 course+0.027
isy+0.027
is+0.026
agging+0.026
Neg logit contributions
 Maybe-0.027
 heaven-0.025
packs-0.023
and-0.023
 Grand-0.023
TokenI
Token ID40
Feature activation-6.86e-03

Pos logit contributions
 Maybe+0.024
keeping+0.024
 generation+0.023
 desks+0.023
and+0.022
Neg logit contributions
 course-0.028
agging-0.026
aid-0.025
br-0.025
 thank-0.025
Token wanted
Token ID2227
Feature activation-4.50e-03

Pos logit contributions
keeping+0.012
 desks+0.012
 generation+0.011
 Maybe+0.011
 computers+0.011
Neg logit contributions
 thank-0.011
 course-0.010
 better-0.010
br-0.010
 medal-0.009
Token to
Token ID284
Feature activation+1.08e-02

Pos logit contributions
 Maybe+0.006
keeping+0.005
 generation+0.005
 desks+0.005
and+0.005
Neg logit contributions
 course-0.009
agging-0.009
 thank-0.008
br-0.008
 winners-0.008
Token catch
Token ID4929
Feature activation+2.24e-02

Pos logit contributions
agging+0.017
 course+0.017
br+0.016
 winners+0.016
ier+0.015
Neg logit contributions
 Maybe-0.018
 generation-0.016
keeping-0.016
 desks-0.016
grand-0.016
Token the
Token ID262
Feature activation+1.55e-02

Pos logit contributions
agging+0.035
 course+0.033
 winners+0.031
 thank+0.031
br+0.031
Neg logit contributions
 Maybe-0.038
keeping-0.034
 desks-0.034
 generation-0.034
and-0.034
Token butterfly
Token ID35113
Feature activation+3.44e-02

Pos logit contributions
agging+0.022
 course+0.021
br+0.019
ier+0.019
 thank+0.019
Neg logit contributions
 Maybe-0.028
 desks-0.026
 generation-0.026
keeping-0.025
 computers-0.025
Token,
Token ID11
Feature activation-1.32e-02

Pos logit contributions
agging+0.058
 course+0.054
 winners+0.051
bled+0.050
aid+0.050
Neg logit contributions
 Maybe-0.057
and-0.049
 generation-0.049
keeping-0.048
grand-0.047
Token but
Token ID475
Feature activation-5.52e-03

Pos logit contributions
 Maybe+0.018
keeping+0.017
 generation+0.016
 desks+0.016
and+0.015
Neg logit contributions
 course-0.025
 thank-0.024
agging-0.024
 winners-0.023
br-0.023
Token it
Token ID340
Feature activation+4.45e-03

Pos logit contributions
 Maybe+0.007
 generation+0.006
 desks+0.006
keeping+0.005
grand+0.005
Neg logit contributions
agging-0.012
 course-0.011
br-0.011
 winners-0.010
aid-0.010
Token flew
Token ID13112
Feature activation+1.63e-02

Pos logit contributions
agging+0.008
 course+0.008
 thank+0.008
 winners+0.008
br+0.008
Neg logit contributions
 Maybe-0.006
 generation-0.005
keeping-0.005
 desks-0.005
and-0.005
Token away
Token ID1497
Feature activation-1.72e-04

Pos logit contributions
agging+0.019
 course+0.019
 winners+0.017
 thank+0.017
br+0.017
Neg logit contributions
 Maybe-0.034
keeping-0.031
 generation-0.031
and-0.030
 desks-0.030
Token,
Token ID11
Feature activation-2.06e-02

Pos logit contributions
 Maybe+0.023
keeping+0.020
 generation+0.020
 desks+0.020
grand+0.019
Neg logit contributions
agging-0.014
 course-0.013
 winners-0.012
br-0.012
aid-0.012
Token I
Token ID314
Feature activation-4.93e-03

Pos logit contributions
 Maybe+0.034
keeping+0.034
 generation+0.033
 desks+0.032
and+0.031
Neg logit contributions
 course-0.034
 thank-0.033
 winners-0.030
agging-0.029
br-0.029
Token'm
Token ID1101
Feature activation+1.24e-03

Pos logit contributions
 Maybe+0.008
keeping+0.008
 desks+0.008
 generation+0.008
 computers+0.008
Neg logit contributions
 course-0.007
 thank-0.007
 better-0.007
agging-0.007
br-0.007
Token tired
Token ID10032
Feature activation-8.99e-03

Pos logit contributions
 course+0.002
agging+0.002
br+0.002
 thank+0.002
 winners+0.002
Neg logit contributions
 Maybe-0.002
 generation-0.002
keeping-0.002
 desks-0.002
and-0.002
Token.
Token ID13
Feature activation-4.31e-02

Pos logit contributions
 Maybe+0.016
 generation+0.016
 desks+0.015
 computers+0.015
keeping+0.015
Neg logit contributions
 course-0.012
agging-0.012
 thank-0.012
ier-0.012
br-0.012
Token Can
Token ID1680
Feature activation-9.79e-03

Pos logit contributions
 desks+0.072
 computers+0.069
keeping+0.067
packs+0.066
 alley+0.065
Neg logit contributions
 thank-0.073
 course-0.061
 sorry-0.061
 tweet-0.058
 winners-0.057
Token we
Token ID356
Feature activation+3.09e-02

Pos logit contributions
keeping+0.014
 generation+0.013
 desks+0.013
 Maybe+0.013
and+0.013
Neg logit contributions
 course-0.018
 thank-0.017
agging-0.016
br-0.015
 winners-0.015
Token stop
Token ID2245
Feature activation+1.02e-02

Pos logit contributions
agging+0.061
 course+0.058
 winners+0.055
br+0.055
ier+0.055
Neg logit contributions
 Maybe-0.042
 generation-0.034
 tonight-0.034
days-0.033
grand-0.033
Token playing
Token ID2712
Feature activation+3.25e-02

Pos logit contributions
 course+0.017
agging+0.016
 thank+0.015
br+0.015
 winners+0.015
Neg logit contributions
 Maybe-0.017
keeping-0.015
 generation-0.015
 desks-0.015
and-0.015
Token?"
Token ID1701
Feature activation-1.20e-02

Pos logit contributions
agging+0.057
 course+0.055
br+0.053
 winners+0.052
aid+0.052
Neg logit contributions
 Maybe-0.049
 desks-0.042
 generation-0.041
and-0.041
 tonight-0.041
Token Lily
Token ID20037
Feature activation-2.37e-02

Pos logit contributions
keeping+0.019
and+0.018
 generation+0.017
 habitat+0.016
 Maybe+0.016
Neg logit contributions
 thank-0.020
 course-0.020
 better-0.019
br-0.018
 best-0.018
Tokenunes
Token ID4015
Feature activation-1.28e-04

Pos logit contributions
 Maybe+0.002
keeping+0.002
 desks+0.002
and+0.002
places+0.002
Neg logit contributions
 course-0.004
agging-0.004
 medal-0.004
 winners-0.004
 thank-0.004
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 Maybe+0.000
keeping+0.000
and+0.000
 desks+0.000
 generation+0.000
Neg logit contributions
agging-0.000
 course-0.000
 thank-0.000
 winners-0.000
 soaring-0.000
Token They
Token ID1119
Feature activation+1.54e-02

Pos logit contributions
keeping+0.019
 generation+0.018
grand+0.018
and+0.017
days+0.017
Neg logit contributions
 thank-0.029
 nineteen-0.027
 course-0.026
br-0.025
 winners-0.025
Token do
Token ID466
Feature activation+2.92e-03

Pos logit contributions
agging+0.030
ier+0.027
 course+0.027
br+0.026
oming+0.026
Neg logit contributions
 Maybe-0.022
 desks-0.019
 sneak-0.018
 generation-0.018
 They-0.017
Token not
Token ID407
Feature activation-1.90e-02

Pos logit contributions
agging+0.006
 course+0.006
 winners+0.006
ier+0.006
br+0.006
Neg logit contributions
 Maybe-0.003
 desks-0.003
and-0.002
 generation-0.002
 computers-0.002
Token see
Token ID766
Feature activation-1.40e-02

Pos logit contributions
 Maybe+0.027
 generation+0.023
keeping+0.023
 desks+0.023
and+0.022
Neg logit contributions
 course-0.036
agging-0.036
br-0.033
 winners-0.033
 thank-0.033
Token Mom
Token ID11254
Feature activation-2.82e-02

Pos logit contributions
 Maybe+0.025
keeping+0.022
and+0.022
 generation+0.022
 desks+0.022
Neg logit contributions
 course-0.021
agging-0.020
 thank-0.020
 winners-0.019
br-0.019
Token and
Token ID290
Feature activation-3.01e-03

Pos logit contributions
 Maybe+0.040
keeping+0.035
 desks+0.035
and+0.034
 generation+0.034
Neg logit contributions
 course-0.052
agging-0.051
 winners-0.048
br-0.048
 thank-0.048
Token Dad
Token ID17415
Feature activation-3.04e-02

Pos logit contributions
 Maybe+0.003
 generation+0.002
keeping+0.002
and+0.002
grand+0.002
Neg logit contributions
 course-0.007
 thank-0.007
agging-0.007
 winners-0.007
 luck-0.006
Token.
Token ID13
Feature activation-2.62e-02

Pos logit contributions
 Maybe+0.042
keeping+0.037
 generation+0.036
 desks+0.036
and+0.036
Neg logit contributions
 course-0.056
agging-0.056
br-0.053
 winners-0.052
 thank-0.052
Token Mom
Token ID11254
Feature activation-1.59e-02

Pos logit contributions
keeping+0.042
 generation+0.039
and+0.037
grand+0.037
uries+0.035
Neg logit contributions
 thank-0.045
 best-0.041
 nodded-0.040
 nineteen-0.039
 better-0.038
Token it
Token ID340
Feature activation+5.04e-03

Pos logit contributions
agging+0.041
 course+0.039
br+0.037
 winners+0.037
 thank+0.036
Neg logit contributions
 Maybe-0.044
keeping-0.039
 generation-0.039
 desks-0.039
and-0.038
Token.
Token ID13
Feature activation-7.81e-03

Pos logit contributions
 course+0.009
agging+0.009
 thank+0.009
br+0.008
 winners+0.008
Neg logit contributions
 Maybe-0.007
 generation-0.006
keeping-0.006
 desks-0.006
and-0.006
Token Lily
Token ID20037
Feature activation-6.96e-03

Pos logit contributions
 generation+0.013
 desks+0.012
keeping+0.012
 computers+0.012
 laptop+0.012
Neg logit contributions
 course-0.013
 thank-0.011
aid-0.010
br-0.010
 better-0.010
Token tried
Token ID3088
Feature activation-4.08e-03

Pos logit contributions
 Maybe+0.010
keeping+0.009
 generation+0.009
 desks+0.009
and+0.009
Neg logit contributions
 course-0.013
agging-0.012
 thank-0.012
br-0.012
 winners-0.011
Token first
Token ID717
Feature activation+5.08e-03

Pos logit contributions
 Maybe+0.005
and+0.004
 desks+0.004
keeping+0.004
 generation+0.004
Neg logit contributions
agging-0.009
 course-0.009
br-0.008
 winners-0.008
 thank-0.008
Token,
Token ID11
Feature activation-2.53e-03

Pos logit contributions
agging+0.008
 course+0.008
br+0.007
aid+0.007
 winners+0.007
Neg logit contributions
 Maybe-0.009
and-0.008
 because-0.007
 generation-0.007
 desks-0.007
Token but
Token ID475
Feature activation-2.90e-03

Pos logit contributions
 Maybe+0.002
 generation+0.002
keeping+0.002
 desks+0.002
and+0.002
Neg logit contributions
agging-0.006
 course-0.006
br-0.006
 winners-0.006
 thank-0.005
Token she
Token ID673
Feature activation-1.46e-02

Pos logit contributions
 Maybe+0.004
 generation+0.003
 desks+0.003
keeping+0.003
and+0.003
Neg logit contributions
agging-0.006
 course-0.005
br-0.005
 winners-0.005
aid-0.005
Token could
Token ID714
Feature activation+2.27e-02

Pos logit contributions
 Maybe+0.021
keeping+0.019
 generation+0.019
 desks+0.019
and+0.019
Neg logit contributions
 course-0.026
agging-0.026
 thank-0.024
br-0.024
 winners-0.024
Token not
Token ID407
Feature activation-5.54e-03

Pos logit contributions
 course+0.048
agging+0.048
br+0.045
 winners+0.044
ier+0.043
Neg logit contributions
 Maybe-0.027
 generation-0.022
keeping-0.022
 desks-0.021
and-0.021
Token catch
Token ID4929
Feature activation+1.27e-02

Pos logit contributions
 Maybe+0.008
 generation+0.007
keeping+0.007
 desks+0.007
and+0.007
Neg logit contributions
 course-0.010
agging-0.010
 winners-0.009
br-0.009
 thank-0.009
Token the
Token ID262
Feature activation+2.45e-02

Pos logit contributions
agging+0.051
 course+0.050
 winners+0.047
ier+0.047
 thank+0.046
Neg logit contributions
 Maybe-0.045
 desks-0.040
 generation-0.039
 computers-0.038
keeping-0.038
Token big
Token ID1263
Feature activation+1.58e-02

Pos logit contributions
agging+0.036
 course+0.034
ier+0.034
 nineteen+0.033
br+0.032
Neg logit contributions
 Maybe-0.040
 desks-0.039
 computers-0.038
 laptop-0.038
 generation-0.038
Token one
Token ID530
Feature activation+9.95e-03

Pos logit contributions
agging+0.025
ier+0.023
 course+0.023
aid+0.022
 winners+0.022
Neg logit contributions
 Maybe-0.026
 desks-0.025
 computers-0.024
 generation-0.024
 laptop-0.024
Token.
Token ID13
Feature activation-1.35e-02

Pos logit contributions
agging+0.017
 course+0.016
br+0.015
 winners+0.015
 thank+0.015
Neg logit contributions
 Maybe-0.016
 generation-0.014
keeping-0.014
and-0.014
 desks-0.014
Token
Token ID198
Feature activation-6.00e-03

Pos logit contributions
keeping+0.020
 desks+0.020
 generation+0.020
 computers+0.019
places+0.018
Neg logit contributions
 thank-0.022
 course-0.022
br-0.020
aid-0.019
 winners-0.019
Token
Token ID198
Feature activation-5.16e-03

Pos logit contributions
keeping+0.010
 desks+0.010
 generation+0.010
 Maybe+0.010
places+0.009
Neg logit contributions
 course-0.009
agging-0.009
 thank-0.008
aid-0.008
br-0.008
TokenL
Token ID43
Feature activation-4.44e-02

Pos logit contributions
keeping+0.010
 Maybe+0.010
 desks+0.010
 generation+0.010
places+0.010
Neg logit contributions
 course-0.006
agging-0.006
br-0.005
aid-0.005
 thank-0.005
Tokenily
Token ID813
Feature activation-1.16e-02

Pos logit contributions
 Maybe+0.088
 generation+0.084
 desks+0.082
keeping+0.081
 computers+0.081
Neg logit contributions
 course-0.059
agging-0.058
ier-0.058
is-0.054
aid-0.052
Token feels
Token ID5300
Feature activation+1.30e-02

Pos logit contributions
 Maybe+0.018
keeping+0.018
 generation+0.018
and+0.017
grand+0.017
Neg logit contributions
 course-0.019
agging-0.019
 thank-0.018
br-0.017
 winners-0.017
Token sad
Token ID6507
Feature activation-1.31e-02

Pos logit contributions
 course+0.020
agging+0.020
br+0.018
ig+0.018
is+0.018
Neg logit contributions
 Maybe-0.023
 generation-0.020
 desks-0.020
and-0.019
keeping-0.019
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
 Maybe+0.018
 generation+0.017
keeping+0.017
 desks+0.016
grand+0.016
Neg logit contributions
 course-0.024
agging-0.024
 thank-0.023
br-0.022
 winners-0.022
Token she
Token ID673
Feature activation+8.64e-03

Pos logit contributions
 Maybe+0.017
 generation+0.016
keeping+0.016
grand+0.016
 desks+0.015
Neg logit contributions
 course-0.011
 thank-0.011
agging-0.010
 soaring-0.010
 luck-0.010
Token had
Token ID550
Feature activation+1.15e-02

Pos logit contributions
 course+0.015
agging+0.014
 thank+0.013
br+0.013
ier+0.013
Neg logit contributions
 Maybe-0.014
 desks-0.013
 generation-0.012
keeping-0.012
and-0.012
Token been
Token ID587
Feature activation+1.41e-02

Pos logit contributions
agging+0.020
 course+0.020
br+0.019
 thank+0.019
 winners+0.019
Neg logit contributions
 Maybe-0.017
keeping-0.015
 generation-0.015
and-0.015
 desks-0.015
Token brave
Token ID14802
Feature activation+2.22e-03

Pos logit contributions
 course+0.022
agging+0.020
 medal+0.019
 thank+0.019
br+0.019
Neg logit contributions
 Maybe-0.023
and-0.023
keeping-0.022
 alley-0.021
 upstairs-0.021
Token enough
Token ID1576
Feature activation+1.15e-02

Pos logit contributions
 course+0.003
agging+0.003
 thank+0.003
 winners+0.003
br+0.003
Neg logit contributions
 Maybe-0.004
 generation-0.004
keeping-0.004
 desks-0.004
grand-0.004
Token to
Token ID284
Feature activation-1.21e-03

Pos logit contributions
 course+0.024
agging+0.023
 thank+0.022
 winners+0.022
br+0.022
Neg logit contributions
 Maybe-0.014
 generation-0.012
 desks-0.012
keeping-0.012
and-0.012
Token reverse
Token ID9575
Feature activation-2.81e-03

Pos logit contributions
 Maybe+0.002
 generation+0.001
keeping+0.001
 desks+0.001
 alley+0.001
Neg logit contributions
agging-0.002
 course-0.002
 winners-0.002
br-0.002
ier-0.002
Token the
Token ID262
Feature activation-2.52e-02

Pos logit contributions
 Maybe+0.005
 generation+0.005
grand+0.004
keeping+0.004
and+0.004
Neg logit contributions
 course-0.004
agging-0.004
 thank-0.004
br-0.004
 winners-0.004
Token lock
Token ID5793
Feature activation-1.15e-03

Pos logit contributions
 Maybe+0.043
and+0.039
keeping+0.038
 generation+0.037
grand+0.037
Neg logit contributions
 course-0.041
agging-0.039
 thank-0.038
 best-0.037
 winners-0.037
Token on
Token ID319
Feature activation-2.95e-02

Pos logit contributions
 Maybe+0.002
keeping+0.002
 generation+0.002
 desks+0.002
grand+0.002
Neg logit contributions
 course-0.002
agging-0.002
 thank-0.002
br-0.002
ier-0.002
Token the
Token ID262
Feature activation-3.00e-02

Pos logit contributions
 Maybe+0.060
grand+0.055
and+0.054
keeping+0.054
places+0.053
Neg logit contributions
 course-0.039
 thank-0.034
 best-0.033
agging-0.032
 luck-0.032
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
 Maybe+0.036
 generation+0.035
keeping+0.035
 desks+0.032
grand+0.032
Neg logit contributions
 course-0.032
agging-0.030
br-0.030
 best-0.029
 thank-0.029
Token Sue
Token ID26089
Feature activation-1.04e-02

Pos logit contributions
agging+0.034
 course+0.033
 winners+0.032
br+0.032
ier+0.031
Neg logit contributions
 Maybe-0.016
keeping-0.013
and-0.013
 desks-0.012
 generation-0.012
Token liked
Token ID8288
Feature activation-3.33e-03

Pos logit contributions
 Maybe+0.016
keeping+0.015
 generation+0.015
 desks+0.014
and+0.014
Neg logit contributions
 course-0.018
agging-0.017
 thank-0.017
br-0.016
 winners-0.016
Token to
Token ID284
Feature activation+1.19e-03

Pos logit contributions
 Maybe+0.005
 generation+0.005
keeping+0.005
grand+0.004
and+0.004
Neg logit contributions
 course-0.006
agging-0.006
 thank-0.006
 soaring-0.005
br-0.005
Token play
Token ID711
Feature activation+1.31e-02

Pos logit contributions
 course+0.002
agging+0.002
 thank+0.002
br+0.002
 winners+0.002
Neg logit contributions
 Maybe-0.002
keeping-0.002
 generation-0.002
 desks-0.002
and-0.002
Token in
Token ID287
Feature activation-4.07e-02

Pos logit contributions
 course+0.021
agging+0.021
 thank+0.020
 winners+0.020
aid+0.020
Neg logit contributions
 Maybe-0.021
and-0.018
 desks-0.018
 generation-0.018
 computers-0.018
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
 Maybe+0.102
keeping+0.100
grand+0.100
 generation+0.098
and+0.096
Neg logit contributions
 course-0.036
 thank-0.026
 best-0.026
 soaring-0.026
agging-0.025
Token woods
Token ID16479
Feature activation-3.15e-02

Pos logit contributions
 Maybe+0.034
 desks+0.031
 generation+0.029
 computers+0.029
and+0.029
Neg logit contributions
 course-0.036
agging-0.036
 winners-0.034
 thank-0.034
aid-0.033
Token.
Token ID13
Feature activation+4.12e-03

Pos logit contributions
 Maybe+0.056
grand+0.054
keeping+0.052
 generation+0.051
 desks+0.050
Neg logit contributions
 course-0.048
agging-0.044
 best-0.043
 winners-0.042
 soaring-0.041
Token They
Token ID1119
Feature activation-9.16e-03

Pos logit contributions
 course+0.006
 thank+0.006
 nineteen+0.006
agging+0.006
br+0.006
Neg logit contributions
 generation-0.007
keeping-0.007
grand-0.006
 desks-0.006
places-0.006
Token liked
Token ID8288
Feature activation-8.80e-03

Pos logit contributions
 Maybe+0.015
keeping+0.014
 generation+0.014
and+0.013
grand+0.013
Neg logit contributions
 course-0.015
agging-0.014
 thank-0.014
br-0.013
 winners-0.013
Token and
Token ID290
Feature activation+1.53e-02

Pos logit contributions
 Maybe+0.017
keeping+0.015
 generation+0.015
 desks+0.015
and+0.014
Neg logit contributions
 course-0.019
agging-0.018
 thank-0.017
br-0.017
 winners-0.017
Token his
Token ID465
Feature activation-1.81e-02

Pos logit contributions
 course+0.021
 thank+0.020
agging+0.019
 luck+0.018
 medal+0.018
Neg logit contributions
 Maybe-0.029
and-0.027
keeping-0.026
 generation-0.026
 desks-0.026
Token friends
Token ID2460
Feature activation+5.09e-04

Pos logit contributions
 Maybe+0.038
keeping+0.035
 generation+0.035
and+0.034
 desks+0.034
Neg logit contributions
 course-0.021
agging-0.021
 thank-0.019
 winners-0.019
br-0.018
Token went
Token ID1816
Feature activation-9.82e-03

Pos logit contributions
agging+0.001
 course+0.001
br+0.001
 winners+0.001
 soaring+0.001
Neg logit contributions
 Maybe-0.001
 generation-0.001
keeping-0.001
 desks-0.001
grand-0.001
Token to
Token ID284
Feature activation-2.39e-02

Pos logit contributions
 Maybe+0.014
keeping+0.012
 generation+0.012
 desks+0.012
and+0.012
Neg logit contributions
 course-0.018
agging-0.018
 thank-0.017
br-0.017
 winners-0.017
Token the
Token ID262
Feature activation-5.29e-02

Pos logit contributions
 Maybe+0.044
keeping+0.040
and+0.040
grand+0.039
places+0.038
Neg logit contributions
 course-0.035
 thank-0.033
agging-0.032
 soaring-0.031
 winners-0.031
Token tall
Token ID7331
Feature activation-4.82e-02

Pos logit contributions
keeping+0.089
 Maybe+0.082
packs+0.077
and+0.076
os+0.076
Neg logit contributions
 best-0.088
 course-0.086
 better-0.082
 more-0.079
 thank-0.078
Token closet
Token ID21615
Feature activation-2.42e-02

Pos logit contributions
 Maybe+0.068
keeping+0.064
and+0.061
 generation+0.059
places+0.057
Neg logit contributions
 course-0.096
 best-0.081
 thank-0.081
is-0.081
 more-0.080
Token room
Token ID2119
Feature activation-1.86e-02

Pos logit contributions
 Maybe+0.042
 generation+0.038
grand+0.038
keeping+0.038
 desks+0.037
Neg logit contributions
 course-0.038
agging-0.035
 thank-0.033
 winners-0.033
 best-0.033
Token every
Token ID790
Feature activation-1.70e-02

Pos logit contributions
 Maybe+0.033
grand+0.029
 generation+0.029
keeping+0.029
 desks+0.028
Neg logit contributions
 course-0.029
agging-0.027
 thank-0.026
 winners-0.026
 best-0.026
Token day
Token ID1110
Feature activation-2.03e-02

Pos logit contributions
 Maybe+0.035
keeping+0.032
grand+0.031
places+0.031
 desks+0.031
Neg logit contributions
 course-0.023
 best-0.020
 thank-0.020
agging-0.019
 better-0.019
Token it
Token ID340
Feature activation-4.63e-03

Pos logit contributions
 Maybe+0.024
keeping+0.022
and+0.021
 generation+0.021
 desks+0.021
Neg logit contributions
 course-0.019
 thank-0.018
 best-0.017
 soaring-0.016
 medal-0.016
Token was
Token ID373
Feature activation-6.55e-03

Pos logit contributions
 Maybe+0.007
keeping+0.006
and+0.006
 generation+0.006
 desks+0.006
Neg logit contributions
 course-0.008
agging-0.008
 thank-0.008
br-0.007
 best-0.007
Token a
Token ID257
Feature activation-1.15e-03

Pos logit contributions
 Maybe+0.010
 generation+0.009
keeping+0.009
 computers+0.009
 desks+0.009
Neg logit contributions
agging-0.011
 course-0.011
br-0.010
aid-0.010
ig-0.010
Token passage
Token ID10066
Feature activation-1.79e-02

Pos logit contributions
 Maybe+0.002
 generation+0.002
keeping+0.002
 desks+0.002
and+0.002
Neg logit contributions
agging-0.002
 course-0.002
 thank-0.001
br-0.001
 winners-0.001
Token to
Token ID284
Feature activation-7.97e-03

Pos logit contributions
 Maybe+0.032
keeping+0.029
 generation+0.029
 desks+0.028
and+0.028
Neg logit contributions
 course-0.028
agging-0.025
 thank-0.025
 soaring-0.024
br-0.024
Token somewhere
Token ID7382
Feature activation-4.05e-02

Pos logit contributions
 Maybe+0.012
 generation+0.011
keeping+0.011
 desks+0.011
and+0.011
Neg logit contributions
agging-0.014
 course-0.014
 thank-0.012
br-0.012
 winners-0.012
Token new
Token ID649
Feature activation-7.18e-03

Pos logit contributions
 Maybe+0.070
keeping+0.064
 generation+0.063
 desks+0.062
and+0.060
Neg logit contributions
 course-0.062
agging-0.061
 thank-0.058
br-0.057
 soaring-0.056
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
 Maybe+0.012
grand+0.011
 desks+0.011
 generation+0.011
and+0.011
Neg logit contributions
 course-0.011
 luck-0.010
agging-0.010
 thank-0.010
 best-0.010
Token exciting
Token ID7895
Feature activation-5.88e-03

Pos logit contributions
 Maybe+0.030
 generation+0.027
keeping+0.027
 desks+0.027
and+0.026
Neg logit contributions
agging-0.026
 course-0.025
br-0.023
 thank-0.022
 winners-0.022
Token.
Token ID13
Feature activation+2.58e-03

Pos logit contributions
 Maybe+0.009
 generation+0.008
keeping+0.008
 desks+0.008
and+0.008
Neg logit contributions
 course-0.010
agging-0.010
 thank-0.010
 winners-0.010
br-0.010
Token Harry
Token ID5850
Feature activation-1.02e-02

Pos logit contributions
 course+0.004
 thank+0.003
 winners+0.003
 nut+0.003
 soaring+0.003
Neg logit contributions
 desks-0.005
 computers-0.004
 laptop-0.004
keeping-0.004
packs-0.004
Token together
Token ID1978
Feature activation-9.53e-03

Pos logit contributions
 Maybe+0.015
packs+0.014
 desks+0.013
keeping+0.013
and+0.013
Neg logit contributions
 course-0.013
agging-0.012
 thank-0.011
br-0.011
 winners-0.011
Token and
Token ID290
Feature activation+4.27e-03

Pos logit contributions
 Maybe+0.013
keeping+0.011
 generation+0.011
 desks+0.011
and+0.011
Neg logit contributions
agging-0.018
 course-0.018
 thank-0.017
br-0.017
 winners-0.017
Token stuck
Token ID7819
Feature activation-1.82e-02

Pos logit contributions
agging+0.007
 course+0.007
br+0.006
 winners+0.006
 thank+0.006
Neg logit contributions
 Maybe-0.007
keeping-0.006
 generation-0.006
 desks-0.006
and-0.006
Token them
Token ID606
Feature activation-4.53e-02

Pos logit contributions
 Maybe+0.031
keeping+0.027
 generation+0.027
 desks+0.027
and+0.027
Neg logit contributions
 course-0.029
agging-0.029
 thank-0.026
br-0.026
 winners-0.026
Token over
Token ID625
Feature activation-4.18e-02

Pos logit contributions
 Maybe+0.082
 generation+0.074
keeping+0.074
 desks+0.073
grand+0.072
Neg logit contributions
 course-0.065
agging-0.065
 thank-0.060
br-0.059
 winners-0.059
Token the
Token ID262
Feature activation-4.15e-02

Pos logit contributions
 Maybe+0.078
grand+0.072
keeping+0.070
 generation+0.070
and+0.069
Neg logit contributions
 course-0.061
agging-0.055
 thank-0.053
 winners-0.052
 luck-0.051
Token crack
Token ID8469
Feature activation-2.78e-02

Pos logit contributions
 Maybe+0.050
keeping+0.043
 generation+0.042
and+0.041
grand+0.040
Neg logit contributions
 course-0.085
agging-0.084
 thank-0.081
br-0.079
 winners-0.079
Token.
Token ID13
Feature activation-1.11e-02

Pos logit contributions
 Maybe+0.042
 generation+0.037
keeping+0.037
 desks+0.036
grand+0.036
Neg logit contributions
 course-0.049
agging-0.048
br-0.046
 thank-0.045
 winners-0.044
Token
Token ID220
Feature activation-3.28e-02

Pos logit contributions
keeping+0.016
grand+0.016
packs+0.015
 desks+0.015
 laptop+0.015
Neg logit contributions
 course-0.018
 thank-0.017
br-0.016
 nineteen-0.015
 winners-0.015
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
 generation+0.060
 desks+0.059
keeping+0.059
 alley+0.059
 hallway+0.056
Neg logit contributions
 course-0.039
agging-0.039
 best-0.038
 thank-0.036
 better-0.036
Token
Token ID198
Feature activation-4.02e-03

Pos logit contributions
keeping+0.028
 Maybe+0.028
 desks+0.028
 generation+0.028
 computers+0.026
Neg logit contributions
 course-0.026
agging-0.024
 thank-0.023
br-0.023
is-0.023
Token and
Token ID290
Feature activation-2.78e-02

Pos logit contributions
 Maybe+0.065
grand+0.061
keeping+0.061
 generation+0.060
and+0.058
Neg logit contributions
 course-0.048
agging-0.043
 thank-0.040
 best-0.039
ier-0.039
Token the
Token ID262
Feature activation-1.47e-02

Pos logit contributions
 Maybe+0.043
keeping+0.038
 generation+0.038
 desks+0.038
and+0.037
Neg logit contributions
agging-0.048
 course-0.048
 thank-0.044
 winners-0.044
br-0.044
Token sauce
Token ID10746
Feature activation-1.52e-02

Pos logit contributions
 Maybe+0.023
 generation+0.022
keeping+0.021
grand+0.021
and+0.021
Neg logit contributions
 course-0.024
agging-0.024
 best-0.023
 thank-0.023
 gu-0.023
Token spilled
Token ID31896
Feature activation-2.03e-02

Pos logit contributions
 Maybe+0.024
 generation+0.022
keeping+0.021
grand+0.021
 desks+0.021
Neg logit contributions
 course-0.027
agging-0.025
 thank-0.024
br-0.024
ier-0.024
Token on
Token ID319
Feature activation-3.05e-02

Pos logit contributions
 Maybe+0.039
 generation+0.035
keeping+0.035
 desks+0.035
grand+0.034
Neg logit contributions
 course-0.028
agging-0.027
 thank-0.026
br-0.025
 winners-0.024
Token the
Token ID262
Feature activation-4.41e-02

Pos logit contributions
 Maybe+0.063
keeping+0.058
and+0.058
grand+0.058
 generation+0.057
Neg logit contributions
 course-0.039
agging-0.034
 thank-0.033
br-0.032
 winners-0.030
Token floor
Token ID4314
Feature activation-4.41e-02

Pos logit contributions
 Maybe+0.054
 generation+0.048
packs+0.045
keeping+0.044
and+0.043
Neg logit contributions
agging-0.090
 course-0.089
 thank-0.086
 best-0.084
 butter-0.084
Token.
Token ID13
Feature activation-9.72e-03

Pos logit contributions
 Maybe+0.076
 generation+0.073
grand+0.072
keeping+0.068
 desks+0.066
Neg logit contributions
 course-0.068
ier-0.066
agging-0.065
br-0.061
 best-0.061
Token Lily
Token ID20037
Feature activation-2.38e-02

Pos logit contributions
keeping+0.013
 generation+0.013
packs+0.013
 Maybe+0.012
grand+0.012
Neg logit contributions
 course-0.017
 thank-0.017
agging-0.017
br-0.016
 best-0.016
Token felt
Token ID2936
Feature activation+1.34e-03

Pos logit contributions
 Maybe+0.038
 generation+0.037
keeping+0.036
grand+0.034
and+0.034
Neg logit contributions
 course-0.039
agging-0.037
 thank-0.035
 best-0.034
 better-0.034
Token bad
Token ID2089
Feature activation-7.66e-03

Pos logit contributions
agging+0.002
 course+0.002
br+0.002
 winners+0.002
ig+0.002
Neg logit contributions
 Maybe-0.002
and-0.002
 desks-0.002
 generation-0.002
keeping-0.002