TOP ACTIVATIONS
MAX = 0.113

stayed away from the all igator ," Mia said .
never walked near the all igator again . <|endoftext|> Ben and
't walk near the all igator . He is not your
be careful . The all igator might bite us if we
and Ben was the h ider . Anna counted to ten
and Ben is the h ider . Anna counts to ten
and Ben is the h ider . Ben runs behind a
." The all igator thought about it for a
. They left the all igator pond and went to see
He is my toy all igator . He is coming to
his hand . The all igator opened its mouth and snapped
was lucky . The all igator did not catch him .
and Ben was the h ider . Lily counted to ten
. The all igator was persistent though , and
I am the quiet all igator . Why are you sad
and Tom was the h ider . Lily counted to ten
" Look , an all igator !" Tim said . He
and apples to the all igator , too . The all
, too . The all igator did not bite them .
Suddenly , the all igator snapped its jaws and bit

MIN ACTIVATIONS
MIN = -0.092

she had with her or nam ents . <|endoftext|> Once upon
the rest of the or nam ents onto the tree .
time playing with her or nam ents and making her special
take one of the or nam ents outside for a walk
she could carry the or nam ents back with her .
she could carry the or nam ents to the place .
and lots of shiny or nam ents . L ila wanted
a letter about the or nam ents and asked for one
the sky and the or nam ents gl ittered in the
All of these lovely or nam ents reminded him of the
at all the different or nam ents and adm iring the
the tree with regular or nam ents !" <|endoftext|> Once upon
box full of beautiful or nam ents .
here and explore the or nam ents ." Da
the rest of the or nam ents onto the tree .
to help sort the or nam ents . She picked up
beautiful garden with the or nam ents . The
with lots of pretty or nam ents . One day ,
park filled with colourful or nam ents . Jimmy looked out
He quickly hung colorful or nam ents and tw ink ling

INTERVAL 0.062 - 0.113
CONTAINS 0.013%

much fun . The all igator and the bird were very
joy . <|endoftext|> The all igator was fair and happy .
I did . I sne aked into the attic one night
. They play hide and seek and tag with it .
to rise ! Owen be amed with excitement . His mom

INTERVAL 0.011 - 0.062
CONTAINS 16.484%

shouted . " Help ! Help ! My friend
able to help . The bee took the butterfly
good friend ," Mia said . They hugged each
were lots of small fishes swimming around his hands ! He
The van was old and sometimes it did not work .

INTERVAL -0.041 - 0.011
CONTAINS 82.855%

" Look , look !" Lily said . " A shooting
they were in a warm place , and they were happy
visited her bunny friends in the house in the tree .
L ila . L ila loved to go on walks in
can find a treasure or a fairy !" Ben

INTERVAL -0.092 - -0.041
CONTAINS 0.648%

his eyes and he told all his friends about the incredible
Tim my ." L ily , Ben , and
teacher is Miss Lily . Miss Lily is very nice .
was the wealthiest girl in all the land and the people
too hard for me ." Brown ie looked
Token stayed
Token ID9658
Feature activation+5.65e-02

Pos logit contributions
Ell+0.012
Da+0.010
 owl+0.009
Lu+0.009
aron+0.009
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
 scores-0.010
Token away
Token ID1497
Feature activation+2.98e-02

Pos logit contributions
Ell+0.105
Da+0.095
Mic+0.083
Bug+0.083
isy+0.083
Neg logit contributions
 wins-0.094
angles-0.089
 scores-0.086
osh-0.084
 recess-0.083
Token from
Token ID422
Feature activation+2.14e-03

Pos logit contributions
Ell+0.052
Da+0.045
age+0.041
Bug+0.041
Mic+0.041
Neg logit contributions
 wins-0.052
angles-0.047
upid-0.046
 scores-0.046
osh-0.046
Token the
Token ID262
Feature activation+6.39e-03

Pos logit contributions
Ell+0.003
 owl+0.003
Da+0.003
 Bug+0.003
 Hope+0.003
Neg logit contributions
angles-0.003
osh-0.003
 wins-0.003
upid-0.003
ret-0.003
Token all
Token ID477
Feature activation-3.58e-02

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.009
Bug+0.009
age+0.009
Neg logit contributions
angles-0.011
osh-0.011
 wins-0.011
upid-0.010
 scores-0.010
Tokenigator
Token ID23823
Feature activation+9.63e-02

Pos logit contributions
 wins+0.067
osh+0.066
angles+0.066
upid+0.062
 scores+0.062
Neg logit contributions
Ell-0.057
Da-0.050
aron-0.044
Bug-0.043
Mill-0.043
Token,"
Token ID553
Feature activation+8.65e-03

Pos logit contributions
Ell+0.186
Da+0.168
Mill+0.151
Bug+0.147
Mic+0.145
Neg logit contributions
 wins-0.163
osh-0.144
 scores-0.143
angles-0.142
upid-0.131
Token Mia
Token ID32189
Feature activation+9.52e-03

Pos logit contributions
Ell+0.019
Da+0.017
 owl+0.016
Mill+0.016
aron+0.016
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.010
upid-0.010
 scores-0.009
Token said
Token ID531
Feature activation-3.68e-03

Pos logit contributions
Ell+0.021
Da+0.019
Mill+0.018
Bug+0.018
Mic+0.017
Neg logit contributions
 wins-0.011
osh-0.011
angles-0.011
 scores-0.010
 agree-0.010
Token.
Token ID13
Feature activation+9.62e-03

Pos logit contributions
 wins+0.005
osh+0.005
angles+0.005
 scores+0.005
ret+0.005
Neg logit contributions
Ell-0.007
Da-0.006
Bug-0.006
Mill-0.006
Lu-0.006
Token
Token ID198
Feature activation+7.39e-04

Pos logit contributions
Ell+0.021
isy+0.020
coon+0.019
eh+0.019
ivering+0.019
Neg logit contributions
 wins-0.010
osh-0.009
angles-0.009
yles-0.009
upid-0.008
Token never
Token ID1239
Feature activation+2.50e-02

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.008
aron-0.008
age-0.008
Token walked
Token ID6807
Feature activation+4.20e-03

Pos logit contributions
Ell+0.046
Da+0.040
 owl+0.038
age+0.037
Bug+0.036
Neg logit contributions
angles-0.041
 wins-0.038
osh-0.038
upid-0.037
yles-0.037
Token near
Token ID1474
Feature activation+6.29e-03

Pos logit contributions
Ell+0.007
Da+0.006
Mill+0.006
aron+0.006
isy+0.006
Neg logit contributions
 wins-0.007
angles-0.007
 scores-0.007
osh-0.007
 recess-0.007
Token the
Token ID262
Feature activation+7.80e-03

Pos logit contributions
Ell+0.014
Da+0.013
 owl+0.012
Bug+0.012
Mill+0.012
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.007
 scores-0.007
Token all
Token ID477
Feature activation-3.75e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.013
aron+0.013
Mill+0.013
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
 scores-0.010
upid-0.010
Tokenigator
Token ID23823
Feature activation+9.44e-02

Pos logit contributions
angles+0.071
 wins+0.071
osh+0.070
 scores+0.066
upid+0.066
Neg logit contributions
Ell-0.057
Da-0.050
 owl-0.044
aron-0.043
Mill-0.043
Token again
Token ID757
Feature activation-1.75e-02

Pos logit contributions
Ell+0.178
Da+0.160
 owl+0.145
aron+0.143
Mill+0.142
Neg logit contributions
angles-0.144
 wins-0.143
osh-0.141
upid-0.132
 scores-0.132
Token.
Token ID13
Feature activation+9.79e-03

Pos logit contributions
 wins+0.033
angles+0.032
osh+0.032
 scores+0.030
upid+0.030
Neg logit contributions
Ell-0.027
Da-0.024
 owl-0.021
aron-0.021
Mill-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.53e-02

Pos logit contributions
Ell+0.015
Da+0.013
aron+0.012
 owl+0.012
Mill+0.011
Neg logit contributions
angles-0.019
osh-0.018
 wins-0.018
 scores-0.017
upid-0.017
TokenBen
Token ID11696
Feature activation-2.95e-02

Pos logit contributions
 wins+0.022
osh+0.021
angles+0.020
 scores+0.020
upid+0.019
Neg logit contributions
Ell-0.031
Da-0.028
 owl-0.026
aron-0.026
isy-0.025
Token and
Token ID290
Feature activation-4.45e-04

Pos logit contributions
 wins+0.052
osh+0.052
angles+0.049
 scores+0.047
 turns+0.045
Neg logit contributions
Ell-0.052
Da-0.045
Mill-0.041
 masse-0.041
isy-0.041
Token't
Token ID470
Feature activation+3.64e-02

Pos logit contributions
 wins+0.020
osh+0.020
angles+0.020
upid+0.019
 scores+0.018
Neg logit contributions
Ell-0.025
Da-0.024
aron-0.022
edly-0.021
Mill-0.021
Token walk
Token ID2513
Feature activation-6.07e-03

Pos logit contributions
Ell+0.083
Da+0.077
aron+0.070
Mill+0.069
Lu+0.068
Neg logit contributions
 wins-0.043
osh-0.041
angles-0.041
 scores-0.039
upid-0.038
Token near
Token ID1474
Feature activation+6.53e-03

Pos logit contributions
 wins+0.011
angles+0.010
osh+0.010
 scores+0.010
upid+0.010
Neg logit contributions
Ell-0.010
Da-0.009
Mill-0.008
aron-0.008
Mic-0.008
Token the
Token ID262
Feature activation+7.16e-03

Pos logit contributions
Ell+0.015
Da+0.013
aron+0.012
age+0.012
Just+0.012
Neg logit contributions
 wins-0.008
angles-0.008
osh-0.008
upid-0.008
 scores-0.007
Token all
Token ID477
Feature activation-3.79e-02

Pos logit contributions
Ell+0.014
Da+0.012
 owl+0.011
aron+0.011
Mill+0.011
Neg logit contributions
 wins-0.011
angles-0.011
osh-0.011
 scores-0.010
upid-0.010
Tokenigator
Token ID23823
Feature activation+9.36e-02

Pos logit contributions
angles+0.068
 wins+0.067
osh+0.067
 scores+0.063
upid+0.063
Neg logit contributions
Ell-0.062
Da-0.054
 owl-0.049
aron-0.047
Mill-0.047
Token.
Token ID13
Feature activation+7.44e-03

Pos logit contributions
Ell+0.158
Da+0.139
Mill+0.122
aron+0.120
Lu+0.118
Neg logit contributions
 wins-0.173
osh-0.166
angles-0.165
 scores-0.158
upid-0.155
Token He
Token ID679
Feature activation-4.63e-03

Pos logit contributions
Ell+0.015
Da+0.013
isy+0.012
eh+0.012
Mill+0.012
Neg logit contributions
 wins-0.012
osh-0.011
upid-0.010
angles-0.010
 scores-0.010
Token is
Token ID318
Feature activation-3.07e-03

Pos logit contributions
 wins+0.006
osh+0.005
angles+0.005
 scores+0.005
upid+0.005
Neg logit contributions
Ell-0.011
Da-0.010
Mill-0.009
aron-0.009
Lu-0.009
Token not
Token ID407
Feature activation+1.69e-02

Pos logit contributions
 wins+0.005
osh+0.004
angles+0.004
 scores+0.004
upid+0.004
Neg logit contributions
Ell-0.006
Da-0.005
Bug-0.005
Mill-0.005
age-0.005
Token your
Token ID534
Feature activation+8.78e-03

Pos logit contributions
Ell+0.032
Da+0.028
Bug+0.025
Mill+0.025
Just+0.025
Neg logit contributions
 wins-0.027
osh-0.026
angles-0.025
 scores-0.025
upid-0.024
Token be
Token ID307
Feature activation+1.55e-02

Pos logit contributions
 wins+0.003
angles+0.003
osh+0.003
upid+0.003
 scores+0.003
Neg logit contributions
Ell-0.005
Da-0.005
 owl-0.004
Mill-0.004
aron-0.004
Token careful
Token ID8161
Feature activation+1.35e-02

Pos logit contributions
Ell+0.030
Da+0.026
Bug+0.023
Mill+0.023
 owl+0.023
Neg logit contributions
 wins-0.024
angles-0.024
osh-0.023
 scores-0.022
upid-0.022
Token.
Token ID13
Feature activation+3.18e-03

Pos logit contributions
Ell+0.025
Da+0.023
Mic+0.020
Charl+0.019
Bug+0.019
Neg logit contributions
 wins-0.024
angles-0.023
osh-0.022
upid-0.021
 scores-0.021
Token The
Token ID383
Feature activation+2.49e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.004
Mill+0.004
isy+0.004
Neg logit contributions
 wins-0.005
osh-0.005
angles-0.005
upid-0.005
 scores-0.005
Token all
Token ID477
Feature activation-3.76e-02

Pos logit contributions
Ell+0.005
Da+0.005
aron+0.004
age+0.004
Mill+0.004
Neg logit contributions
 wins-0.003
angles-0.003
osh-0.003
 scores-0.003
upid-0.003
Tokenigator
Token ID23823
Feature activation+9.30e-02

Pos logit contributions
angles+0.081
 wins+0.081
osh+0.080
upid+0.077
 scores+0.077
Neg logit contributions
Ell-0.047
Da-0.040
 owl-0.034
aron-0.033
Bug-0.032
Token might
Token ID1244
Feature activation+4.88e-03

Pos logit contributions
Ell+0.207
Da+0.189
Mill+0.172
Lu+0.168
aron+0.167
Neg logit contributions
 wins-0.123
osh-0.119
angles-0.117
 scores-0.110
upid-0.104
Token bite
Token ID13197
Feature activation+9.29e-03

Pos logit contributions
Ell+0.010
Da+0.009
Mill+0.008
aron+0.008
 owl+0.008
Neg logit contributions
 wins-0.007
angles-0.006
osh-0.006
upid-0.006
 scores-0.006
Token us
Token ID514
Feature activation+2.34e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.013
aron+0.013
Mill+0.013
Neg logit contributions
 wins-0.016
angles-0.016
osh-0.015
 scores-0.014
upid-0.014
Token if
Token ID611
Feature activation+1.87e-02

Pos logit contributions
Ell+0.034
Da+0.029
 owl+0.026
aron+0.025
age+0.025
Neg logit contributions
angles-0.046
 wins-0.046
osh-0.046
 scores-0.043
upid-0.043
Token we
Token ID356
Feature activation-6.03e-03

Pos logit contributions
Ell+0.035
Da+0.032
 owl+0.029
Bug+0.028
Lu+0.028
Neg logit contributions
angles-0.028
 wins-0.028
osh-0.027
upid-0.026
 scores-0.026
Token and
Token ID290
Feature activation+4.84e-03

Pos logit contributions
 wins+0.006
osh+0.006
angles+0.006
 scores+0.006
 recess+0.006
Neg logit contributions
Ell-0.005
Da-0.005
 masse-0.004
Bug-0.004
Mic-0.004
Token Ben
Token ID3932
Feature activation-2.50e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.008
Lu+0.008
aron+0.008
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.006
 scores-0.006
Token was
Token ID373
Feature activation+1.49e-03

Pos logit contributions
 wins+0.038
osh+0.037
angles+0.035
 scores+0.034
upid+0.031
Neg logit contributions
Ell-0.049
Da-0.044
Mill-0.041
Mic-0.040
Bug-0.040
Token the
Token ID262
Feature activation+2.67e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.002
aron+0.002
Mill+0.002
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token h
Token ID289
Feature activation+1.32e-02

Pos logit contributions
Ell+0.005
Da+0.004
aron+0.004
Mill+0.004
Bug+0.004
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.004
 scores-0.004
upid-0.004
Tokenider
Token ID1304
Feature activation+9.29e-02

Pos logit contributions
Ell+0.014
Da+0.012
 owl+0.009
Bug+0.009
Mill+0.009
Neg logit contributions
angles-0.031
 wins-0.031
osh-0.031
upid-0.030
 scores-0.030
Token.
Token ID13
Feature activation+5.09e-03

Pos logit contributions
Ell+0.160
Da+0.139
Mic+0.125
Bug+0.125
Mill+0.124
Neg logit contributions
 wins-0.167
osh-0.161
angles-0.159
 scores-0.152
upid-0.151
Token Anna
Token ID11735
Feature activation+3.09e-03

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.008
aron+0.008
Mill+0.007
Neg logit contributions
 wins-0.008
angles-0.008
osh-0.008
 scores-0.007
upid-0.007
Token counted
Token ID14789
Feature activation+1.28e-02

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
Mill+0.005
aron+0.005
Neg logit contributions
 wins-0.005
angles-0.005
osh-0.005
 scores-0.005
upid-0.004
Token to
Token ID284
Feature activation-3.83e-03

Pos logit contributions
Ell+0.017
Da+0.015
 owl+0.015
Mill+0.013
Lu+0.013
Neg logit contributions
 wins-0.024
osh-0.024
angles-0.024
upid-0.024
 scores-0.023
Token ten
Token ID3478
Feature activation-2.77e-03

Pos logit contributions
angles+0.005
osh+0.005
upid+0.005
 wins+0.005
ret+0.005
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.007
Lu-0.007
 Hope-0.007
Token and
Token ID290
Feature activation+5.21e-03

Pos logit contributions
 wins+0.004
angles+0.004
osh+0.003
 scores+0.003
 recess+0.003
Neg logit contributions
Ell-0.003
Da-0.003
Bug-0.003
Mic-0.003
 masse-0.003
Token Ben
Token ID3932
Feature activation-2.18e-02

Pos logit contributions
Ell+0.010
Da+0.009
Lu+0.008
 owl+0.008
 Michelle+0.008
Neg logit contributions
angles-0.008
 wins-0.008
upid-0.007
osh-0.007
 scores-0.007
Token is
Token ID318
Feature activation-5.01e-03

Pos logit contributions
 wins+0.031
angles+0.027
osh+0.027
 scores+0.027
 turns+0.024
Neg logit contributions
Ell-0.048
Da-0.044
Mill-0.041
Mic-0.040
Bug-0.039
Token the
Token ID262
Feature activation+2.38e-03

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.007
aron-0.007
Lu-0.007
Token h
Token ID289
Feature activation+1.62e-02

Pos logit contributions
Ell+0.004
Da+0.004
 owl+0.003
aron+0.003
Mill+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Tokenider
Token ID1304
Feature activation+9.29e-02

Pos logit contributions
Ell+0.016
Da+0.013
 owl+0.010
Mill+0.010
Bug+0.010
Neg logit contributions
angles-0.040
 wins-0.040
osh-0.040
upid-0.038
 scores-0.038
Token.
Token ID13
Feature activation+7.21e-03

Pos logit contributions
Ell+0.161
Da+0.140
Bug+0.126
Mill+0.125
Mic+0.124
Neg logit contributions
 wins-0.166
osh-0.160
angles-0.159
 scores-0.152
upid-0.151
Token Anna
Token ID11735
Feature activation+4.99e-03

Pos logit contributions
Ell+0.014
Da+0.013
Lu+0.011
aron+0.011
Just+0.011
Neg logit contributions
angles-0.011
osh-0.011
 wins-0.011
upid-0.010
 scores-0.010
Token counts
Token ID9853
Feature activation+1.57e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.009
aron+0.008
age+0.008
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.006
 scores-0.006
Token to
Token ID284
Feature activation-3.37e-03

Pos logit contributions
Ell+0.022
Da+0.020
 owl+0.018
Lu+0.017
aron+0.017
Neg logit contributions
angles-0.029
 wins-0.029
osh-0.029
upid-0.028
 scores-0.028
Token ten
Token ID3478
Feature activation-6.48e-04

Pos logit contributions
angles+0.005
osh+0.005
upid+0.005
ours+0.005
 wins+0.005
Neg logit contributions
Ell-0.007
 owl-0.006
Da-0.006
Lu-0.006
 Hope-0.005
Token and
Token ID290
Feature activation+1.05e-02

Pos logit contributions
Ell+0.001
Da+0.001
Bug+0.001
Mill+0.001
Mic+0.001
Neg logit contributions
 wins-0.001
angles-0.001
osh-0.001
 scores-0.001
upid-0.001
Token Ben
Token ID3932
Feature activation-2.22e-02

Pos logit contributions
Ell+0.018
Da+0.016
 owl+0.015
Lu+0.014
age+0.014
Neg logit contributions
angles-0.018
 wins-0.018
osh-0.017
upid-0.016
 scores-0.016
Token is
Token ID318
Feature activation-2.06e-03

Pos logit contributions
 wins+0.034
osh+0.033
angles+0.033
 scores+0.031
upid+0.030
Neg logit contributions
Ell-0.043
Da-0.039
 owl-0.035
Mill-0.035
Bug-0.035
Token the
Token ID262
Feature activation+4.99e-03

Pos logit contributions
 wins+0.004
angles+0.004
osh+0.004
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.003
Da-0.003
 owl-0.003
aron-0.003
Mill-0.003
Token h
Token ID289
Feature activation+1.51e-02

Pos logit contributions
Ell+0.009
Da+0.008
aron+0.007
Mill+0.007
Bug+0.007
Neg logit contributions
 wins-0.008
angles-0.008
osh-0.008
 scores-0.007
upid-0.007
Tokenider
Token ID1304
Feature activation+9.28e-02

Pos logit contributions
Ell+0.017
Da+0.014
 owl+0.011
Bug+0.011
Mill+0.011
Neg logit contributions
angles-0.035
 wins-0.035
osh-0.035
upid-0.034
 scores-0.033
Token.
Token ID13
Feature activation+6.58e-03

Pos logit contributions
Ell+0.152
Da+0.133
 owl+0.117
Bug+0.117
Mill+0.116
Neg logit contributions
 wins-0.171
osh-0.167
angles-0.166
 scores-0.158
upid-0.157
Token Ben
Token ID3932
Feature activation-2.15e-02

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.009
aron+0.009
Mill+0.009
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
 scores-0.011
upid-0.011
Token runs
Token ID4539
Feature activation+6.68e-03

Pos logit contributions
angles+0.029
 wins+0.028
osh+0.028
upid+0.026
 scores+0.026
Neg logit contributions
Ell-0.044
Da-0.040
 owl-0.037
aron-0.037
age-0.036
Token behind
Token ID2157
Feature activation+1.22e-02

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.009
aron+0.009
Mill+0.009
Neg logit contributions
 wins-0.012
angles-0.012
osh-0.011
 scores-0.011
upid-0.011
Token a
Token ID257
Feature activation-2.13e-03

Pos logit contributions
Ell+0.023
Da+0.022
 owl+0.020
Lu+0.019
 Bug+0.019
Neg logit contributions
angles-0.018
 wins-0.018
osh-0.017
upid-0.016
yles-0.016
Token."
Token ID526
Feature activation+5.30e-03

Pos logit contributions
 wins+0.015
angles+0.014
osh+0.013
 scores+0.013
upid+0.013
Neg logit contributions
Ell-0.018
Da-0.017
aron-0.015
Mic-0.015
Lu-0.015
Token
Token ID198
Feature activation-3.11e-03

Pos logit contributions
Ell+0.010
Da+0.009
isy+0.009
age+0.009
aron+0.008
Neg logit contributions
 wins-0.008
osh-0.007
angles-0.007
 scores-0.007
upid-0.007
Token
Token ID198
Feature activation-1.35e-03

Pos logit contributions
osh+0.005
 wins+0.005
angles+0.005
upid+0.004
 scores+0.004
Neg logit contributions
Ell-0.005
Da-0.005
 owl-0.004
aron-0.004
age-0.004
TokenThe
Token ID464
Feature activation-3.59e-03

Pos logit contributions
osh+0.002
 wins+0.002
angles+0.002
 scores+0.002
upid+0.002
Neg logit contributions
Ell-0.002
 owl-0.002
aron-0.002
isy-0.002
Da-0.002
Token all
Token ID477
Feature activation-3.47e-02

Pos logit contributions
angles+0.007
 wins+0.007
osh+0.006
 scores+0.006
upid+0.006
Neg logit contributions
Ell-0.006
Da-0.005
 owl-0.004
aron-0.004
Mill-0.004
Tokenigator
Token ID23823
Feature activation+9.24e-02

Pos logit contributions
angles+0.077
 wins+0.076
osh+0.075
 scores+0.072
upid+0.072
Neg logit contributions
Ell-0.042
Da-0.036
 owl-0.030
aron-0.029
age-0.029
Token thought
Token ID1807
Feature activation+2.79e-02

Pos logit contributions
Ell+0.181
Da+0.163
Mill+0.149
Lu+0.145
aron+0.144
Neg logit contributions
 wins-0.141
osh-0.140
angles-0.135
 scores-0.129
upid-0.121
Token about
Token ID546
Feature activation-2.41e-02

Pos logit contributions
Ell+0.047
Da+0.041
 owl+0.038
aron+0.037
Lu+0.036
Neg logit contributions
angles-0.048
 wins-0.048
osh-0.047
 scores-0.045
upid-0.045
Token it
Token ID340
Feature activation-9.86e-03

Pos logit contributions
angles+0.028
osh+0.027
 wins+0.027
upid+0.025
 scores+0.024
Neg logit contributions
Ell-0.055
Da-0.050
 owl-0.047
Mill-0.046
Lu-0.046
Token for
Token ID329
Feature activation+1.64e-02

Pos logit contributions
 wins+0.018
angles+0.018
osh+0.018
 scores+0.017
upid+0.017
Neg logit contributions
Ell-0.016
Da-0.014
Mill-0.012
Bug-0.012
aron-0.012
Token a
Token ID257
Feature activation-6.90e-03

Pos logit contributions
Ell+0.027
Da+0.024
 owl+0.022
Mill+0.021
Lu+0.021
Neg logit contributions
angles-0.030
 wins-0.030
osh-0.029
upid-0.027
 scores-0.027
Token.
Token ID13
Feature activation+9.39e-03

Pos logit contributions
angles+0.008
osh+0.008
 wins+0.008
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.005
Da-0.004
 owl-0.004
aron-0.004
age-0.004
Token They
Token ID1119
Feature activation-4.21e-03

Pos logit contributions
Ell+0.019
Da+0.017
 owl+0.016
isy+0.015
Mill+0.015
Neg logit contributions
 wins-0.014
angles-0.013
osh-0.013
 scores-0.012
upid-0.012
Token left
Token ID1364
Feature activation+3.24e-03

Pos logit contributions
angles+0.007
 wins+0.007
osh+0.007
upid+0.006
 scores+0.006
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.006
aron-0.006
Bug-0.006
Token the
Token ID262
Feature activation+7.66e-03

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
aron+0.006
Lu+0.006
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token all
Token ID477
Feature activation-3.75e-02

Pos logit contributions
Ell+0.014
Da+0.012
aron+0.011
Mill+0.011
age+0.011
Neg logit contributions
 wins-0.013
angles-0.013
osh-0.012
 scores-0.012
upid-0.012
Tokenigator
Token ID23823
Feature activation+9.21e-02

Pos logit contributions
 wins+0.073
angles+0.072
osh+0.072
 scores+0.068
upid+0.068
Neg logit contributions
Ell-0.056
Da-0.049
aron-0.043
 owl-0.043
Mill-0.042
Token pond
Token ID16723
Feature activation+2.06e-02

Pos logit contributions
Ell+0.140
Da+0.122
Mill+0.109
aron+0.105
Mic+0.104
Neg logit contributions
 wins-0.191
osh-0.180
 scores-0.177
angles-0.173
 surf-0.165
Token and
Token ID290
Feature activation+1.74e-03

Pos logit contributions
Ell+0.030
Da+0.025
 masse+0.022
Mill+0.022
Mic+0.022
Neg logit contributions
 wins-0.043
osh-0.042
angles-0.041
 scores-0.040
upid-0.038
Token went
Token ID1816
Feature activation-6.74e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.002
age+0.002
aron+0.002
Neg logit contributions
angles-0.003
upid-0.003
 wins-0.003
osh-0.003
ret-0.003
Token to
Token ID284
Feature activation-3.62e-03

Pos logit contributions
 wins+0.009
angles+0.009
osh+0.009
 scores+0.009
 recess+0.009
Neg logit contributions
Ell-0.014
Da-0.012
aron-0.011
 owl-0.011
Bug-0.011
Token see
Token ID766
Feature activation-6.29e-03

Pos logit contributions
angles+0.006
osh+0.006
 wins+0.006
ret+0.005
yles+0.005
Neg logit contributions
Ell-0.006
 owl-0.006
 Bug-0.006
Da-0.006
 Hope-0.005
Token He
Token ID679
Feature activation-1.19e-02

Pos logit contributions
Ell+0.002
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.002
osh-0.001
 wins-0.001
 scores-0.001
upid-0.001
Token is
Token ID318
Feature activation-4.43e-03

Pos logit contributions
angles+0.020
osh+0.018
 wins+0.018
upid+0.018
 recess+0.018
Neg logit contributions
Ell-0.020
Da-0.017
 owl-0.017
aron-0.016
age-0.016
Token my
Token ID616
Feature activation+1.86e-02

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
 scores+0.007
upid+0.007
Neg logit contributions
Ell-0.007
Da-0.007
 owl-0.006
aron-0.006
Mill-0.006
Token toy
Token ID13373
Feature activation+6.71e-03

Pos logit contributions
Ell+0.032
Da+0.028
 owl+0.028
aron+0.026
 masse+0.026
Neg logit contributions
angles-0.030
osh-0.030
 wins-0.029
upid-0.028
 scores-0.027
Token all
Token ID477
Feature activation-3.75e-02

Pos logit contributions
Ell+0.014
Da+0.014
Mill+0.012
Lu+0.012
aron+0.012
Neg logit contributions
 wins-0.010
 scores-0.009
angles-0.009
osh-0.008
 surf-0.008
Tokenigator
Token ID23823
Feature activation+9.20e-02

Pos logit contributions
angles+0.066
 wins+0.065
osh+0.064
upid+0.061
 scores+0.060
Neg logit contributions
Ell-0.062
Da-0.055
 owl-0.049
aron-0.049
Mill-0.048
Token.
Token ID13
Feature activation+1.97e-03

Pos logit contributions
Ell+0.171
Da+0.155
Mill+0.135
Bug+0.135
Mic+0.132
Neg logit contributions
 wins-0.168
osh-0.154
 scores-0.152
angles-0.147
 surf-0.139
Token He
Token ID679
Feature activation-1.20e-02

Pos logit contributions
Ell+0.003
Da+0.003
isy+0.003
 owl+0.003
Mill+0.003
Neg logit contributions
 wins-0.003
osh-0.003
angles-0.003
upid-0.003
 scores-0.003
Token is
Token ID318
Feature activation-5.80e-03

Pos logit contributions
 wins+0.017
osh+0.017
angles+0.017
 scores+0.016
upid+0.015
Neg logit contributions
Ell-0.025
Da-0.023
Mill-0.020
 owl-0.020
Lu-0.020
Token coming
Token ID2406
Feature activation+1.22e-02

Pos logit contributions
 wins+0.010
angles+0.009
osh+0.009
 scores+0.009
upid+0.008
Neg logit contributions
Ell-0.011
Da-0.010
age-0.009
Bug-0.009
Mill-0.009
Token to
Token ID284
Feature activation-7.47e-03

Pos logit contributions
Ell+0.018
Da+0.016
 owl+0.013
aron+0.013
Bug+0.013
Neg logit contributions
 wins-0.024
angles-0.024
osh-0.024
upid-0.022
 scores-0.022
Token his
Token ID465
Feature activation+4.54e-03

Pos logit contributions
 wins+0.001
angles+0.001
osh+0.001
upid+0.001
 scores+0.001
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
aron-0.001
Bug-0.001
Token hand
Token ID1021
Feature activation+9.94e-03

Pos logit contributions
Ell+0.010
Da+0.009
aron+0.008
 owl+0.008
Mill+0.008
Neg logit contributions
 wins-0.006
angles-0.006
osh-0.006
 scores-0.005
upid-0.005
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
Ell+0.018
Da+0.016
Mill+0.014
Mic+0.014
Bug+0.014
Neg logit contributions
 wins-0.017
osh-0.017
angles-0.016
upid-0.015
 scores-0.015
Token The
Token ID383
Feature activation+4.07e-03

Pos logit contributions
Ell+0.018
Da+0.016
isy+0.014
 owl+0.014
Mill+0.014
Neg logit contributions
 wins-0.018
osh-0.018
angles-0.017
upid-0.016
 scores-0.016
Token all
Token ID477
Feature activation-3.74e-02

Pos logit contributions
Ell+0.009
Da+0.008
aron+0.008
age+0.008
Mill+0.008
Neg logit contributions
 wins-0.005
angles-0.005
osh-0.005
 scores-0.004
upid-0.004
Tokenigator
Token ID23823
Feature activation+9.19e-02

Pos logit contributions
 wins+0.079
osh+0.079
angles+0.079
upid+0.074
 scores+0.074
Neg logit contributions
Ell-0.050
Da-0.042
 owl-0.036
aron-0.036
Mill-0.036
Token opened
Token ID4721
Feature activation+1.36e-02

Pos logit contributions
Ell+0.172
Da+0.153
Mill+0.138
aron+0.137
 owl+0.137
Neg logit contributions
 wins-0.145
osh-0.144
angles-0.143
 scores-0.134
upid-0.131
Token its
Token ID663
Feature activation-2.40e-03

Pos logit contributions
Ell+0.025
Da+0.022
 owl+0.020
aron+0.020
Mill+0.020
Neg logit contributions
 wins-0.022
angles-0.022
osh-0.021
upid-0.020
 scores-0.020
Token mouth
Token ID5422
Feature activation-2.04e-02

Pos logit contributions
 wins+0.002
angles+0.002
upid+0.002
osh+0.002
 scores+0.002
Neg logit contributions
Ell-0.006
Da-0.006
Lu-0.005
Charl-0.005
Mill-0.005
Token and
Token ID290
Feature activation+4.09e-03

Pos logit contributions
 wins+0.039
osh+0.039
angles+0.038
upid+0.036
 scores+0.035
Neg logit contributions
Ell-0.033
Da-0.027
Mic-0.025
Mill-0.025
Bug-0.024
Token snapped
Token ID20821
Feature activation+5.49e-03

Pos logit contributions
Ell+0.008
Da+0.007
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
 wins-0.006
angles-0.006
osh-0.006
 scores-0.006
upid-0.006
Token was
Token ID373
Feature activation+3.55e-03

Pos logit contributions
angles+0.006
 wins+0.006
osh+0.005
upid+0.005
 scores+0.005
Neg logit contributions
Ell-0.006
Da-0.006
 owl-0.005
aron-0.005
Bug-0.005
Token lucky
Token ID9670
Feature activation+5.30e-03

Pos logit contributions
Ell+0.007
Da+0.006
Mill+0.005
Bug+0.005
Lu+0.005
Neg logit contributions
 wins-0.005
osh-0.005
angles-0.005
 scores-0.005
upid-0.005
Token.
Token ID13
Feature activation+9.89e-03

Pos logit contributions
Ell+0.009
Da+0.008
Bug+0.007
Mill+0.007
age+0.007
Neg logit contributions
 wins-0.010
angles-0.009
osh-0.009
 scores-0.009
upid-0.009
Token The
Token ID383
Feature activation+5.41e-03

Pos logit contributions
Ell+0.019
Da+0.017
isy+0.017
 owl+0.016
 masse+0.016
Neg logit contributions
 wins-0.014
osh-0.014
upid-0.013
angles-0.013
 scores-0.012
Token all
Token ID477
Feature activation-3.71e-02

Pos logit contributions
Ell+0.013
Da+0.012
aron+0.011
age+0.011
Mill+0.011
Neg logit contributions
 wins-0.005
angles-0.005
 scores-0.005
osh-0.005
upid-0.005
Tokenigator
Token ID23823
Feature activation+9.18e-02

Pos logit contributions
osh+0.073
 wins+0.072
angles+0.069
 scores+0.066
upid+0.065
Neg logit contributions
Ell-0.057
Da-0.049
aron-0.044
Just-0.044
Mill-0.044
Token did
Token ID750
Feature activation-1.16e-02

Pos logit contributions
Ell+0.180
Da+0.161
Mill+0.148
aron+0.144
Charl+0.143
Neg logit contributions
osh-0.140
 wins-0.140
angles-0.134
 scores-0.129
upid-0.121
Token not
Token ID407
Feature activation+1.68e-02

Pos logit contributions
angles+0.026
osh+0.026
 wins+0.025
yles+0.024
upid+0.024
Neg logit contributions
Ell-0.013
Da-0.011
 owl-0.010
Just-0.010
age-0.009
Token catch
Token ID4929
Feature activation-3.70e-03

Pos logit contributions
Ell+0.033
Da+0.029
 owl+0.028
age+0.027
aron+0.027
Neg logit contributions
angles-0.026
 wins-0.025
osh-0.024
yles-0.023
upid-0.023
Token him
Token ID683
Feature activation+5.75e-03

Pos logit contributions
angles+0.005
 wins+0.004
osh+0.004
upid+0.004
 scores+0.004
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.007
Bug-0.007
aron-0.007
Token.
Token ID13
Feature activation+9.12e-03

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.007
aron+0.007
age+0.007
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
 scores-0.010
Token and
Token ID290
Feature activation+6.88e-03

Pos logit contributions
 wins+0.012
angles+0.012
osh+0.012
 scores+0.011
upid+0.011
Neg logit contributions
Ell-0.010
Da-0.009
Bug-0.008
 masse-0.008
Mill-0.008
Token Ben
Token ID3932
Feature activation-2.53e-02

Pos logit contributions
Ell+0.013
Da+0.011
 owl+0.010
aron+0.010
Mill+0.010
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
 scores-0.010
Token was
Token ID373
Feature activation+8.47e-04

Pos logit contributions
 wins+0.040
osh+0.039
angles+0.038
 scores+0.036
upid+0.035
Neg logit contributions
Ell-0.048
Da-0.043
Mill-0.039
 owl-0.039
Bug-0.038
Token the
Token ID262
Feature activation+1.98e-03

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.001
 wins-0.001
osh-0.001
upid-0.001
 scores-0.001
Token h
Token ID289
Feature activation+1.15e-02

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
aron+0.003
Mill+0.003
Neg logit contributions
angles-0.003
 wins-0.003
osh-0.003
upid-0.003
 scores-0.003
Tokenider
Token ID1304
Feature activation+9.14e-02

Pos logit contributions
Ell+0.011
Da+0.009
 owl+0.007
Bug+0.007
Mill+0.007
Neg logit contributions
angles-0.028
 wins-0.028
osh-0.028
upid-0.027
 scores-0.027
Token.
Token ID13
Feature activation+4.63e-03

Pos logit contributions
Ell+0.151
Da+0.131
Bug+0.116
 owl+0.116
Mill+0.116
Neg logit contributions
 wins-0.168
osh-0.163
angles-0.163
 scores-0.154
upid-0.154
Token Lily
Token ID20037
Feature activation-6.24e-03

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.007
aron+0.007
age+0.007
Neg logit contributions
 wins-0.007
angles-0.007
osh-0.007
 scores-0.007
upid-0.007
Token counted
Token ID14789
Feature activation+1.26e-02

Pos logit contributions
angles+0.010
 wins+0.010
osh+0.010
upid+0.010
 scores+0.010
Neg logit contributions
Ell-0.011
Da-0.010
 owl-0.009
aron-0.009
age-0.009
Token to
Token ID284
Feature activation-3.44e-03

Pos logit contributions
Ell+0.017
Da+0.015
 owl+0.015
Mill+0.014
Lu+0.013
Neg logit contributions
 wins-0.023
osh-0.023
upid-0.023
angles-0.023
 scores-0.022
Token ten
Token ID3478
Feature activation-3.20e-03

Pos logit contributions
angles+0.004
osh+0.004
upid+0.004
 wins+0.004
ret+0.004
Neg logit contributions
Ell-0.007
Da-0.007
 owl-0.007
Lu-0.006
Mill-0.006
Token.
Token ID13
Feature activation+9.94e-03

Pos logit contributions
angles+0.029
 wins+0.026
yles+0.025
 scores+0.025
osh+0.025
Neg logit contributions
Ell-0.036
Da-0.033
age-0.032
aron-0.031
 owl-0.030
Token
Token ID198
Feature activation+2.13e-03

Pos logit contributions
Ell+0.017
Da+0.016
isy+0.014
 owl+0.014
aron+0.014
Neg logit contributions
 wins-0.017
osh-0.016
angles-0.015
upid-0.015
 scores-0.015
Token
Token ID198
Feature activation+1.10e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.003
aron+0.003
Mill+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
 scores-0.004
upid-0.004
TokenThe
Token ID464
Feature activation-2.25e-04

Pos logit contributions
Ell+0.002
Da+0.001
 owl+0.001
aron+0.001
age+0.001
Neg logit contributions
 wins-0.002
osh-0.002
angles-0.002
upid-0.002
 scores-0.002
Token all
Token ID477
Feature activation-4.09e-02

Pos logit contributions
angles+0.000
 wins+0.000
osh+0.000
 scores+0.000
upid+0.000
Neg logit contributions
Ell-0.000
Da-0.000
 owl-0.000
aron-0.000
Mill-0.000
Tokenigator
Token ID23823
Feature activation+9.14e-02

Pos logit contributions
angles+0.086
 wins+0.084
osh+0.082
upid+0.081
 scores+0.080
Neg logit contributions
Ell-0.054
Da-0.047
 owl-0.041
Lu-0.039
aron-0.039
Token was
Token ID373
Feature activation+5.47e-04

Pos logit contributions
Ell+0.174
Da+0.157
Mill+0.140
aron+0.140
 owl+0.139
Neg logit contributions
 wins-0.140
osh-0.139
angles-0.138
 scores-0.129
upid-0.126
Token persistent
Token ID16218
Feature activation+1.15e-02

Pos logit contributions
Ell+0.001
Da+0.001
Bug+0.001
Lu+0.001
Mic+0.001
Neg logit contributions
 wins-0.001
osh-0.001
angles-0.001
upid-0.001
 scores-0.001
Token though
Token ID996
Feature activation+1.74e-02

Pos logit contributions
Ell+0.020
Da+0.017
Bug+0.015
Mill+0.015
Mic+0.015
Neg logit contributions
 wins-0.021
osh-0.020
angles-0.020
 scores-0.020
upid-0.018
Token,
Token ID11
Feature activation+1.53e-03

Pos logit contributions
Ell+0.033
Da+0.030
age+0.027
aron+0.027
Bug+0.027
Neg logit contributions
 wins-0.027
osh-0.026
angles-0.025
 scores-0.024
upid-0.023
Token and
Token ID290
Feature activation-9.47e-04

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.003
aron+0.003
Mill+0.003
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token I
Token ID314
Feature activation-2.41e-03

Pos logit contributions
Ell+0.009
Da+0.008
aron+0.007
age+0.007
Just+0.007
Neg logit contributions
 wins-0.008
osh-0.008
angles-0.008
upid-0.007
 scores-0.007
Token am
Token ID716
Feature activation+5.48e-04

Pos logit contributions
angles+0.004
 wins+0.003
osh+0.003
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.005
Da-0.004
age-0.004
 owl-0.004
aron-0.004
Token the
Token ID262
Feature activation+4.26e-03

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.001
 wins-0.001
osh-0.001
upid-0.001
 scores-0.001
Token quiet
Token ID5897
Feature activation+4.80e-03

Pos logit contributions
Ell+0.007
Da+0.007
aron+0.006
edly+0.006
Just+0.006
Neg logit contributions
 wins-0.007
angles-0.007
osh-0.007
 scores-0.007
 recess-0.006
Token all
Token ID477
Feature activation-3.88e-02

Pos logit contributions
Ell+0.009
Da+0.008
Just+0.007
Mill+0.007
Mic+0.007
Neg logit contributions
 wins-0.008
osh-0.007
 scores-0.007
angles-0.007
 surf-0.007
Tokenigator
Token ID23823
Feature activation+9.12e-02

Pos logit contributions
 wins+0.077
osh+0.075
angles+0.072
 scores+0.070
upid+0.070
Neg logit contributions
Ell-0.057
Da-0.051
aron-0.045
Just-0.045
Mill-0.044
Token.
Token ID13
Feature activation-1.49e-03

Pos logit contributions
Ell+0.177
Da+0.162
Mill+0.147
Charl+0.145
Mic+0.143
Neg logit contributions
 wins-0.157
osh-0.147
 scores-0.140
angles-0.131
 surf-0.122
Token Why
Token ID4162
Feature activation+2.50e-02

Pos logit contributions
 wins+0.002
osh+0.002
upid+0.002
 L+0.002
angles+0.002
Neg logit contributions
Ell-0.003
Da-0.003
isy-0.002
han-0.002
eh-0.002
Token are
Token ID389
Feature activation-1.28e-02

Pos logit contributions
Ell+0.062
 owl+0.056
Da+0.056
Bug+0.053
Lu+0.053
Neg logit contributions
angles-0.021
 wins-0.019
osh-0.019
 recess-0.019
upid-0.019
Token you
Token ID345
Feature activation+6.12e-03

Pos logit contributions
 wins+0.023
angles+0.023
osh+0.023
 scores+0.022
upid+0.021
Neg logit contributions
Ell-0.021
Da-0.018
aron-0.017
age-0.016
 owl-0.016
Token sad
Token ID6507
Feature activation-1.39e-02

Pos logit contributions
Ell+0.011
Da+0.009
aron+0.008
Lu+0.008
Bug+0.008
Neg logit contributions
 wins-0.011
angles-0.010
osh-0.010
upid-0.010
 scores-0.010
Token and
Token ID290
Feature activation+4.77e-03

Pos logit contributions
 wins+0.007
angles+0.007
osh+0.007
 scores+0.007
 recess+0.007
Neg logit contributions
Ell-0.006
Da-0.006
 masse-0.005
Bug-0.005
Mic-0.005
Token Tom
Token ID4186
Feature activation-1.85e-02

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.007
Lu+0.007
aron+0.007
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.007
 scores-0.007
Token was
Token ID373
Feature activation+2.86e-03

Pos logit contributions
 wins+0.029
osh+0.029
angles+0.028
 scores+0.026
upid+0.024
Neg logit contributions
Ell-0.035
Da-0.031
Mill-0.028
 masse-0.028
Mic-0.028
Token the
Token ID262
Feature activation+2.46e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
Lu+0.005
Mill+0.005
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token h
Token ID289
Feature activation+1.38e-02

Pos logit contributions
Ell+0.004
Da+0.004
 owl+0.004
Lu+0.003
age+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Tokenider
Token ID1304
Feature activation+9.11e-02

Pos logit contributions
Ell+0.013
Da+0.010
 owl+0.008
aron+0.007
Mill+0.007
Neg logit contributions
angles-0.035
 wins-0.035
osh-0.034
upid-0.033
 scores-0.033
Token.
Token ID13
Feature activation+8.41e-03

Pos logit contributions
Ell+0.156
Da+0.135
Bug+0.122
Mill+0.121
Mic+0.120
Neg logit contributions
 wins-0.163
osh-0.158
angles-0.157
 scores-0.150
upid-0.149
Token Lily
Token ID20037
Feature activation-9.49e-04

Pos logit contributions
Ell+0.016
Da+0.015
Lu+0.013
aron+0.013
 owl+0.013
Neg logit contributions
angles-0.013
osh-0.013
 wins-0.013
upid-0.012
 scores-0.012
Token counted
Token ID14789
Feature activation+1.27e-02

Pos logit contributions
 wins+0.002
angles+0.002
osh+0.001
 scores+0.001
upid+0.001
Neg logit contributions
Ell-0.002
Da-0.002
 owl-0.001
aron-0.001
Mill-0.001
Token to
Token ID284
Feature activation-4.65e-03

Pos logit contributions
Ell+0.017
Da+0.014
 owl+0.014
Mill+0.013
Lu+0.012
Neg logit contributions
osh-0.025
 wins-0.025
upid-0.024
angles-0.024
 scores-0.024
Token ten
Token ID3478
Feature activation-2.99e-03

Pos logit contributions
angles+0.006
osh+0.006
 wins+0.006
upid+0.006
ours+0.006
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.009
Lu-0.008
Mill-0.008
Token"
Token ID1
Feature activation+2.23e-02

Pos logit contributions
Ell+0.005
Da+0.004
 owl+0.004
aron+0.004
age+0.004
Neg logit contributions
 wins-0.007
osh-0.007
angles-0.007
upid-0.007
 scores-0.007
TokenLook
Token ID8567
Feature activation-8.17e-03

Pos logit contributions
Ell+0.039
age+0.034
Da+0.033
isy+0.033
 owl+0.033
Neg logit contributions
osh-0.035
 wins-0.034
angles-0.033
 recess-0.031
 scores-0.031
Token,
Token ID11
Feature activation+2.30e-03

Pos logit contributions
 wins+0.018
osh+0.018
angles+0.018
 recess+0.017
 scores+0.017
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.007
isy-0.007
 Michelle-0.006
Token an
Token ID281
Feature activation-8.60e-04

Pos logit contributions
Ell+0.004
Da+0.004
 owl+0.004
 Bug+0.003
Bug+0.003
Neg logit contributions
angles-0.004
osh-0.003
 wins-0.003
upid-0.003
 scores-0.003
Token all
Token ID477
Feature activation-3.91e-02

Pos logit contributions
 wins+0.001
angles+0.001
upid+0.001
osh+0.001
 scores+0.001
Neg logit contributions
Ell-0.002
Da-0.002
Just-0.001
age-0.001
Charl-0.001
Tokenigator
Token ID23823
Feature activation+9.11e-02

Pos logit contributions
angles+0.084
 wins+0.084
osh+0.083
 scores+0.079
upid+0.079
Neg logit contributions
Ell-0.050
Da-0.042
 owl-0.036
aron-0.035
Mill-0.035
Token!"
Token ID2474
Feature activation-8.49e-04

Pos logit contributions
Ell+0.178
Da+0.161
Mill+0.144
Charl+0.142
Mic+0.142
Neg logit contributions
 wins-0.156
osh-0.154
 scores-0.140
angles-0.137
upid-0.130
Token Tim
Token ID5045
Feature activation-7.65e-03

Pos logit contributions
 wins+0.001
osh+0.001
angles+0.001
 scores+0.001
upid+0.001
Neg logit contributions
Ell-0.002
Da-0.002
age-0.002
aron-0.002
 owl-0.002
Token said
Token ID531
Feature activation-4.97e-03

Pos logit contributions
 wins+0.012
osh+0.011
angles+0.011
 scores+0.011
upid+0.010
Neg logit contributions
Ell-0.015
Da-0.013
 masse-0.012
Bug-0.012
Mill-0.012
Token.
Token ID13
Feature activation+7.08e-03

Pos logit contributions
 wins+0.008
osh+0.008
angles+0.007
 scores+0.007
upid+0.007
Neg logit contributions
Ell-0.009
Da-0.008
Bug-0.007
Mic-0.007
Charl-0.007
Token He
Token ID679
Feature activation-5.47e-03

Pos logit contributions
Ell+0.014
isy+0.012
Da+0.012
ivering+0.012
han+0.012
Neg logit contributions
 wins-0.010
osh-0.010
 L-0.009
 scores-0.008
 turns-0.008
Token and
Token ID290
Feature activation+1.09e-03

Pos logit contributions
 wins+0.013
angles+0.013
osh+0.013
 scores+0.012
upid+0.011
Neg logit contributions
Ell-0.013
Da-0.011
Mill-0.010
Charl-0.010
 masse-0.010
Token apples
Token ID22514
Feature activation+2.39e-02

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
aron+0.002
Mill+0.002
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.002
upid-0.002
Token to
Token ID284
Feature activation-3.76e-03

Pos logit contributions
Ell+0.041
Da+0.035
Mill+0.031
 owl+0.031
han+0.031
Neg logit contributions
 wins-0.043
osh-0.042
angles-0.042
 scores-0.040
upid-0.039
Token the
Token ID262
Feature activation+5.16e-03

Pos logit contributions
angles+0.005
 wins+0.005
osh+0.005
upid+0.005
 scores+0.005
Neg logit contributions
Ell-0.007
Da-0.006
 owl-0.006
Bug-0.006
 Bug-0.006
Token all
Token ID477
Feature activation-3.82e-02

Pos logit contributions
Ell+0.011
Da+0.010
aron+0.009
Mill+0.009
age+0.009
Neg logit contributions
 wins-0.007
angles-0.007
osh-0.007
 scores-0.006
upid-0.006
Tokenigator
Token ID23823
Feature activation+9.10e-02

Pos logit contributions
angles+0.064
 wins+0.062
osh+0.061
upid+0.059
yles+0.059
Neg logit contributions
Ell-0.065
Da-0.059
 owl-0.053
Lu-0.052
age-0.052
Token,
Token ID11
Feature activation+2.79e-04

Pos logit contributions
Ell+0.154
Da+0.135
Mill+0.119
Bug+0.117
Lu+0.117
Neg logit contributions
 wins-0.163
angles-0.158
osh-0.158
 scores-0.151
upid-0.147
Token too
Token ID1165
Feature activation-1.25e-02

Pos logit contributions
Ell+0.000
Da+0.000
Bug+0.000
Mill+0.000
age+0.000
Neg logit contributions
 wins-0.001
osh-0.001
angles-0.001
 scores-0.000
upid-0.000
Token.
Token ID13
Feature activation+5.44e-03

Pos logit contributions
angles+0.028
 wins+0.026
osh+0.026
 recess+0.026
 scores+0.026
Neg logit contributions
Ell-0.014
Da-0.013
aron-0.012
 owl-0.011
Just-0.011
Token The
Token ID383
Feature activation+3.41e-03

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.008
age+0.008
Mill+0.008
Neg logit contributions
 wins-0.009
angles-0.009
osh-0.008
 scores-0.008
upid-0.008
Token all
Token ID477
Feature activation-3.82e-02

Pos logit contributions
Ell+0.008
Da+0.007
aron+0.007
age+0.007
Just+0.007
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.003
 scores-0.003
upid-0.003
Token,
Token ID11
Feature activation+2.79e-04

Pos logit contributions
Ell+0.154
Da+0.135
Mill+0.119
Bug+0.117
Lu+0.117
Neg logit contributions
 wins-0.163
angles-0.158
osh-0.158
 scores-0.151
upid-0.147
Token too
Token ID1165
Feature activation-1.25e-02

Pos logit contributions
Ell+0.000
Da+0.000
Bug+0.000
Mill+0.000
age+0.000
Neg logit contributions
 wins-0.001
osh-0.001
angles-0.001
 scores-0.000
upid-0.000
Token.
Token ID13
Feature activation+5.44e-03

Pos logit contributions
angles+0.028
 wins+0.026
osh+0.026
 recess+0.026
 scores+0.026
Neg logit contributions
Ell-0.014
Da-0.013
aron-0.012
 owl-0.011
Just-0.011
Token The
Token ID383
Feature activation+3.41e-03

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.008
age+0.008
Mill+0.008
Neg logit contributions
 wins-0.009
angles-0.009
osh-0.008
 scores-0.008
upid-0.008
Token all
Token ID477
Feature activation-3.82e-02

Pos logit contributions
Ell+0.008
Da+0.007
aron+0.007
age+0.007
Just+0.007
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.003
 scores-0.003
upid-0.003
Tokenigator
Token ID23823
Feature activation+9.09e-02

Pos logit contributions
osh+0.077
 wins+0.076
angles+0.075
 scores+0.070
upid+0.069
Neg logit contributions
Ell-0.057
Da-0.048
Just-0.043
Mill-0.043
aron-0.043
Token did
Token ID750
Feature activation-1.40e-02

Pos logit contributions
Ell+0.175
Da+0.156
Mill+0.142
Lu+0.138
Bug+0.138
Neg logit contributions
 wins-0.142
osh-0.141
angles-0.137
 scores-0.130
upid-0.121
Token not
Token ID407
Feature activation+1.58e-02

Pos logit contributions
osh+0.030
angles+0.030
 wins+0.029
yles+0.028
 recess+0.028
Neg logit contributions
Ell-0.017
Da-0.015
isy-0.013
 owl-0.013
aron-0.013
Token bite
Token ID13197
Feature activation+7.01e-03

Pos logit contributions
Ell+0.031
Da+0.028
 owl+0.026
aron+0.026
age+0.025
Neg logit contributions
angles-0.023
osh-0.022
 wins-0.022
yles-0.021
upid-0.021
Token them
Token ID606
Feature activation+1.41e-02

Pos logit contributions
Ell+0.014
Da+0.012
 owl+0.011
Lu+0.011
Bug+0.011
Neg logit contributions
angles-0.010
osh-0.010
 wins-0.010
upid-0.009
 scores-0.009
Token.
Token ID13
Feature activation+4.94e-03

Pos logit contributions
Ell+0.019
Da+0.016
 owl+0.015
aron+0.015
isy+0.014
Neg logit contributions
angles-0.028
osh-0.028
 wins-0.027
upid-0.026
 recess-0.026
Token
Token ID198
Feature activation+3.16e-03

Pos logit contributions
Ell+0.003
Da+0.003
Mill+0.002
Just+0.002
 owl+0.002
Neg logit contributions
angles-0.005
 wins-0.005
osh-0.005
 scores-0.005
yles-0.005
TokenSuddenly
Token ID38582
Feature activation+8.64e-03

Pos logit contributions
Ell+0.004
Da+0.004
 owl+0.003
aron+0.003
Lu+0.003
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
 scores-0.006
upid-0.006
Token,
Token ID11
Feature activation+2.81e-03

Pos logit contributions
Ell+0.014
Da+0.012
aron+0.011
 owl+0.011
age+0.011
Neg logit contributions
 wins-0.016
angles-0.016
osh-0.016
upid-0.015
 scores-0.015
Token the
Token ID262
Feature activation+5.67e-03

Pos logit contributions
Ell+0.005
Da+0.004
 Hope+0.004
 owl+0.004
 Michelle+0.004
Neg logit contributions
angles-0.005
 wins-0.004
upid-0.004
osh-0.004
ret-0.004
Token all
Token ID477
Feature activation-3.98e-02

Pos logit contributions
Ell+0.012
Da+0.011
aron+0.011
age+0.011
Just+0.011
Neg logit contributions
 wins-0.007
angles-0.007
 scores-0.006
 recess-0.006
osh-0.006
Tokenigator
Token ID23823
Feature activation+9.09e-02

Pos logit contributions
 wins+0.084
osh+0.083
angles+0.083
 scores+0.078
upid+0.078
Neg logit contributions
Ell-0.054
Da-0.046
aron-0.040
Mill-0.039
 owl-0.039
Token snapped
Token ID20821
Feature activation+4.21e-03

Pos logit contributions
Ell+0.202
Da+0.182
Mill+0.171
Lu+0.168
aron+0.168
Neg logit contributions
osh-0.118
 wins-0.118
 scores-0.110
angles-0.109
upid-0.097
Token its
Token ID663
Feature activation-3.00e-03

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.005
Bug+0.005
age+0.005
Neg logit contributions
 wins-0.008
osh-0.008
angles-0.008
upid-0.007
 scores-0.007
Token jaws
Token ID39940
Feature activation+2.11e-02

Pos logit contributions
 wins+0.004
angles+0.004
 scores+0.004
upid+0.004
osh+0.004
Neg logit contributions
Ell-0.006
Da-0.006
Charl-0.005
Lu-0.005
 Michelle-0.005
Token and
Token ID290
Feature activation+1.02e-03

Pos logit contributions
Ell+0.038
Da+0.032
Mill+0.031
han+0.029
Mic+0.029
Neg logit contributions
 wins-0.039
osh-0.038
upid-0.035
 scores-0.035
angles-0.034
Token bit
Token ID1643
Feature activation+1.12e-02

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
Mill+0.002
aron+0.002
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.001
upid-0.001
Token she
Token ID673
Feature activation-3.99e-03

Pos logit contributions
 wins+0.035
angles+0.034
osh+0.034
 scores+0.032
upid+0.031
Neg logit contributions
Ell-0.032
Da-0.028
Bug-0.025
 owl-0.025
aron-0.025
Token had
Token ID550
Feature activation+1.43e-03

Pos logit contributions
 wins+0.006
osh+0.006
angles+0.006
 scores+0.005
 recess+0.005
Neg logit contributions
Ell-0.008
Da-0.007
aron-0.006
Mill-0.006
Bug-0.006
Token with
Token ID351
Feature activation+2.19e-03

Pos logit contributions
Ell+0.003
Da+0.002
Bug+0.002
 masse+0.002
 owl+0.002
Neg logit contributions
 wins-0.002
osh-0.002
angles-0.002
 scores-0.002
 recess-0.002
Token her
Token ID607
Feature activation+2.52e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.003
Mill+0.002
 Michelle+0.002
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token or
Token ID393
Feature activation+3.47e-03

Pos logit contributions
Ell+0.005
Da+0.004
aron+0.004
 owl+0.004
age+0.004
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.004
 scores-0.003
upid-0.003
Tokennam
Token ID7402
Feature activation-9.21e-02

Pos logit contributions
Ell+0.006
Da+0.006
 owl+0.005
aron+0.005
Mill+0.005
Neg logit contributions
angles-0.005
 wins-0.005
osh-0.005
upid-0.005
 scores-0.005
Tokenents
Token ID658
Feature activation-5.02e-02

Pos logit contributions
osh+0.102
 wins+0.100
angles+0.092
 recess+0.092
 scores+0.086
Neg logit contributions
Ell-0.218
Da-0.194
Mill-0.191
Just-0.182
aron-0.182
Token.
Token ID13
Feature activation-6.72e-03

Pos logit contributions
angles+0.095
 wins+0.095
osh+0.094
upid+0.089
 scores+0.089
Neg logit contributions
Ell-0.076
Da-0.066
 owl-0.059
aron-0.057
age-0.057
Token<|endoftext|>
Token ID50256
Feature activation-2.21e-02

Pos logit contributions
angles+0.012
 wins+0.012
osh+0.012
 scores+0.011
upid+0.011
Neg logit contributions
Ell-0.011
Da-0.010
 owl-0.009
aron-0.008
Mill-0.008
TokenOnce
Token ID7454
Feature activation-2.99e-02

Pos logit contributions
angles+0.038
 wins+0.038
osh+0.038
 scores+0.036
upid+0.036
Neg logit contributions
Ell-0.037
Da-0.032
 owl-0.029
aron-0.029
Mill-0.028
Token upon
Token ID2402
Feature activation+9.21e-03

Pos logit contributions
 wins+0.044
 scores+0.042
angles+0.042
toe+0.041
osh+0.040
Neg logit contributions
Ell-0.057
Da-0.052
Lu-0.047
 Betsy-0.047
isy-0.047
Token the
Token ID262
Feature activation+1.29e-03

Pos logit contributions
Ell+0.006
Da+0.006
 owl+0.005
 Michelle+0.005
aron+0.005
Neg logit contributions
angles-0.005
osh-0.005
 wins-0.005
yles-0.004
upid-0.004
Token rest
Token ID1334
Feature activation-1.07e-02

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
age+0.002
aron+0.002
Neg logit contributions
angles-0.002
osh-0.002
 wins-0.002
upid-0.002
yles-0.002
Token of
Token ID286
Feature activation+6.45e-03

Pos logit contributions
 wins+0.020
osh+0.019
angles+0.019
 scores+0.018
upid+0.018
Neg logit contributions
Ell-0.018
Da-0.015
 owl-0.014
Bug-0.013
Mill-0.013
Token the
Token ID262
Feature activation+3.91e-04

Pos logit contributions
Ell+0.012
Da+0.011
 owl+0.010
 Betsy+0.010
 Michelle+0.010
Neg logit contributions
angles-0.010
osh-0.010
 wins-0.009
upid-0.009
ret-0.009
Token or
Token ID393
Feature activation+1.01e-02

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.001
 wins-0.001
osh-0.001
upid-0.001
 scores-0.001
Tokennam
Token ID7402
Feature activation-8.90e-02

Pos logit contributions
Ell+0.018
Da+0.016
 owl+0.015
 Michelle+0.014
 Hope+0.014
Neg logit contributions
upid-0.016
angles-0.016
osh-0.015
 wins-0.015
ret-0.014
Tokenents
Token ID658
Feature activation-4.72e-02

Pos logit contributions
 wins+0.101
osh+0.100
 recess+0.097
angles+0.092
 scores+0.089
Neg logit contributions
Ell-0.203
Da-0.180
Mill-0.177
Just-0.171
age-0.168
Token onto
Token ID4291
Feature activation+1.10e-02

Pos logit contributions
 wins+0.089
angles+0.089
osh+0.088
 scores+0.083
upid+0.083
Neg logit contributions
Ell-0.073
Da-0.063
 owl-0.055
aron-0.055
Bug-0.055
Token the
Token ID262
Feature activation-2.34e-03

Pos logit contributions
Ell+0.024
Da+0.022
 owl+0.020
aron+0.020
Bug+0.020
Neg logit contributions
 wins-0.014
angles-0.014
osh-0.013
 scores-0.012
upid-0.012
Token tree
Token ID5509
Feature activation-3.56e-03

Pos logit contributions
 wins+0.004
angles+0.003
osh+0.003
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.004
Da-0.004
Mill-0.003
aron-0.003
Bug-0.003
Token.
Token ID13
Feature activation-5.54e-03

Pos logit contributions
 wins+0.006
osh+0.006
angles+0.006
 scores+0.006
upid+0.006
Neg logit contributions
Ell-0.006
Da-0.005
Bug-0.005
Mill-0.005
 owl-0.005
Token time
Token ID640
Feature activation-1.65e-02

Pos logit contributions
Ell+0.028
Da+0.026
Bug+0.024
aron+0.023
Mic+0.023
Neg logit contributions
 wins-0.017
osh-0.016
angles-0.016
 scores-0.016
 recess-0.015
Token playing
Token ID2712
Feature activation-2.31e-03

Pos logit contributions
 wins+0.029
osh+0.028
angles+0.027
 scores+0.027
 recess+0.025
Neg logit contributions
Ell-0.029
Da-0.026
Bug-0.024
Mic-0.023
 masse-0.022
Token with
Token ID351
Feature activation+4.89e-03

Pos logit contributions
 wins+0.004
osh+0.004
 scores+0.004
 recess+0.004
angles+0.004
Neg logit contributions
Ell-0.003
Da-0.003
 masse-0.003
Mic-0.003
aron-0.003
Token her
Token ID607
Feature activation-4.11e-04

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
Mill+0.005
Lu+0.005
Neg logit contributions
angles-0.010
 wins-0.010
osh-0.010
upid-0.009
 scores-0.009
Token or
Token ID393
Feature activation+4.31e-03

Pos logit contributions
 wins+0.001
angles+0.001
osh+0.001
 scores+0.001
upid+0.001
Neg logit contributions
Ell-0.001
Da-0.001
aron-0.001
age-0.001
Bug-0.001
Tokennam
Token ID7402
Feature activation-8.80e-02

Pos logit contributions
Ell+0.007
Da+0.007
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
 scores-0.007
upid-0.007
Tokenents
Token ID658
Feature activation-5.32e-02

Pos logit contributions
 wins+0.088
osh+0.087
angles+0.079
upid+0.076
 recess+0.075
Neg logit contributions
Ell-0.219
Da-0.195
Mill-0.188
Just-0.181
age-0.181
Token and
Token ID290
Feature activation-7.24e-03

Pos logit contributions
 wins+0.101
angles+0.100
osh+0.100
 scores+0.094
upid+0.093
Neg logit contributions
Ell-0.082
Da-0.071
Bug-0.062
Mill-0.062
aron-0.061
Token making
Token ID1642
Feature activation+2.62e-02

Pos logit contributions
angles+0.011
 wins+0.011
osh+0.011
 scores+0.011
yles+0.010
Neg logit contributions
Ell-0.013
Da-0.012
aron-0.011
Bug-0.011
age-0.011
Token her
Token ID607
Feature activation+1.94e-03

Pos logit contributions
Ell+0.047
Da+0.042
 owl+0.038
Mill+0.038
Lu+0.037
Neg logit contributions
 wins-0.043
osh-0.043
angles-0.043
upid-0.041
 scores-0.038
Token special
Token ID2041
Feature activation+1.97e-02

Pos logit contributions
Ell+0.004
 owl+0.003
Da+0.003
age+0.003
isy+0.003
Neg logit contributions
angles-0.003
osh-0.003
 wins-0.003
upid-0.003
 recess-0.003
Token take
Token ID1011
Feature activation+1.16e-03

Pos logit contributions
 wins+0.012
angles+0.012
osh+0.012
upid+0.011
 scores+0.011
Neg logit contributions
Ell-0.022
Da-0.021
aron-0.019
 owl-0.018
Mill-0.018
Token one
Token ID530
Feature activation-1.09e-02

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
aron+0.002
Bug+0.002
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token of
Token ID286
Feature activation+3.54e-03

Pos logit contributions
 wins+0.015
osh+0.015
angles+0.015
 scores+0.014
upid+0.014
Neg logit contributions
Ell-0.023
Da-0.020
aron-0.019
Mill-0.018
Bug-0.018
Token the
Token ID262
Feature activation+1.09e-03

Pos logit contributions
Ell+0.007
Da+0.007
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
angles-0.005
 wins-0.005
osh-0.005
upid-0.004
 scores-0.004
Token or
Token ID393
Feature activation+8.10e-03

Pos logit contributions
Ell+0.002
Da+0.002
aron+0.002
age+0.002
Mill+0.002
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.002
upid-0.001
Tokennam
Token ID7402
Feature activation-8.75e-02

Pos logit contributions
Ell+0.017
Da+0.015
 owl+0.014
aron+0.014
Mill+0.014
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
 scores-0.010
Tokenents
Token ID658
Feature activation-4.35e-02

Pos logit contributions
 wins+0.088
osh+0.087
angles+0.081
 recess+0.079
 scores+0.076
Neg logit contributions
Ell-0.214
Da-0.193
Mill-0.188
Just-0.180
age-0.180
Token outside
Token ID2354
Feature activation-1.52e-03

Pos logit contributions
 wins+0.082
angles+0.082
osh+0.081
upid+0.077
 scores+0.077
Neg logit contributions
Ell-0.067
Da-0.058
 owl-0.051
aron-0.050
Mill-0.050
Token for
Token ID329
Feature activation+1.08e-02

Pos logit contributions
angles+0.003
 wins+0.003
osh+0.003
upid+0.003
 scores+0.003
Neg logit contributions
Ell-0.002
Da-0.002
 owl-0.002
aron-0.002
age-0.002
Token a
Token ID257
Feature activation-6.22e-03

Pos logit contributions
Ell+0.019
Da+0.017
 owl+0.016
Mill+0.015
Bug+0.015
Neg logit contributions
angles-0.019
osh-0.018
 wins-0.018
upid-0.017
 scores-0.016
Token walk
Token ID2513
Feature activation-1.05e-02

Pos logit contributions
 wins+0.010
angles+0.010
osh+0.010
 scores+0.009
upid+0.009
Neg logit contributions
Ell-0.011
Da-0.010
aron-0.009
 owl-0.009
Bug-0.009
Token she
Token ID673
Feature activation+4.57e-03

Pos logit contributions
angles+0.012
osh+0.012
 wins+0.012
upid+0.012
 scores+0.011
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.007
Mill-0.007
Lu-0.007
Token could
Token ID714
Feature activation-2.76e-02

Pos logit contributions
Ell+0.008
Da+0.007
 owl+0.007
aron+0.007
age+0.007
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.007
 scores-0.007
Token carry
Token ID3283
Feature activation-4.21e-04

Pos logit contributions
 wins+0.035
angles+0.034
osh+0.034
 scores+0.031
upid+0.031
Neg logit contributions
Ell-0.061
Da-0.057
aron-0.051
Lu-0.050
Bug-0.050
Token the
Token ID262
Feature activation+3.13e-03

Pos logit contributions
 wins+0.000
angles+0.000
osh+0.000
 scores+0.000
upid+0.000
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
aron-0.001
age-0.001
Token or
Token ID393
Feature activation+8.75e-03

Pos logit contributions
Ell+0.006
Da+0.006
aron+0.005
Just+0.005
Mill+0.005
Neg logit contributions
 wins-0.005
angles-0.004
 scores-0.004
osh-0.004
 ease-0.004
Tokennam
Token ID7402
Feature activation-8.64e-02

Pos logit contributions
Ell+0.018
Da+0.016
 owl+0.015
Mill+0.015
Bug+0.015
Neg logit contributions
angles-0.012
osh-0.012
 wins-0.012
upid-0.011
 scores-0.011
Tokenents
Token ID658
Feature activation-3.73e-02

Pos logit contributions
 wins+0.084
osh+0.083
angles+0.078
 recess+0.075
 scores+0.074
Neg logit contributions
Ell-0.214
Da-0.193
Mill-0.187
age-0.181
 owl-0.179
Token back
Token ID736
Feature activation+6.81e-03

Pos logit contributions
 wins+0.070
angles+0.070
osh+0.069
 scores+0.065
upid+0.065
Neg logit contributions
Ell-0.057
Da-0.050
 owl-0.044
aron-0.043
Mill-0.043
Token with
Token ID351
Feature activation+6.31e-03

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.009
aron+0.009
Mill+0.009
Neg logit contributions
angles-0.012
osh-0.012
 wins-0.012
upid-0.011
 scores-0.011
Token her
Token ID607
Feature activation+1.06e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.009
Mill+0.008
 Michelle+0.008
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
 scores-0.010
Token.
Token ID13
Feature activation-4.83e-04

Pos logit contributions
Ell+0.020
Da+0.018
aron+0.016
Mill+0.016
Bug+0.016
Neg logit contributions
 wins-0.017
angles-0.016
osh-0.016
 scores-0.015
upid-0.015
Token she
Token ID673
Feature activation+4.92e-03

Pos logit contributions
 wins+0.013
angles+0.013
osh+0.013
 scores+0.012
 recess+0.012
Neg logit contributions
Ell-0.014
Da-0.012
age-0.011
aron-0.011
Bug-0.011
Token could
Token ID714
Feature activation-2.71e-02

Pos logit contributions
Ell+0.010
Da+0.009
 masse+0.009
aron+0.009
 owl+0.009
Neg logit contributions
osh-0.007
 wins-0.006
angles-0.006
 scores-0.006
yles-0.005
Token carry
Token ID3283
Feature activation+9.58e-04

Pos logit contributions
 wins+0.031
osh+0.031
angles+0.030
upid+0.028
 recess+0.027
Neg logit contributions
Ell-0.062
Da-0.058
aron-0.052
Charl-0.052
Mic-0.051
Token the
Token ID262
Feature activation+2.51e-03

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
aron+0.002
age+0.002
Neg logit contributions
 wins-0.001
angles-0.001
osh-0.001
 scores-0.001
upid-0.001
Token or
Token ID393
Feature activation+9.25e-03

Pos logit contributions
Ell+0.005
Da+0.004
aron+0.004
Just+0.004
han+0.004
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.003
 scores-0.003
 recess-0.003
Tokennam
Token ID7402
Feature activation-8.63e-02

Pos logit contributions
Ell+0.018
Da+0.016
 owl+0.016
Mill+0.015
age+0.015
Neg logit contributions
angles-0.013
osh-0.013
 wins-0.013
upid-0.012
 scores-0.012
Tokenents
Token ID658
Feature activation-3.80e-02

Pos logit contributions
 wins+0.088
osh+0.086
angles+0.080
 recess+0.079
 scores+0.078
Neg logit contributions
Ell-0.212
Da-0.191
Mill-0.186
age-0.178
Just-0.176
Token to
Token ID284
Feature activation-8.27e-03

Pos logit contributions
angles+0.071
 wins+0.071
osh+0.070
upid+0.067
 scores+0.067
Neg logit contributions
Ell-0.058
Da-0.050
 owl-0.045
age-0.044
aron-0.044
Token the
Token ID262
Feature activation+2.21e-03

Pos logit contributions
angles+0.014
osh+0.013
 wins+0.013
upid+0.013
 scores+0.012
Neg logit contributions
Ell-0.014
 owl-0.013
Da-0.013
Mill-0.012
 Hope-0.012
Token place
Token ID1295
Feature activation-4.08e-03

Pos logit contributions
Ell+0.004
Da+0.004
aron+0.003
Mill+0.003
age+0.003
Neg logit contributions
 wins-0.003
angles-0.003
osh-0.003
 scores-0.003
upid-0.003
Token.
Token ID13
Feature activation-1.98e-04

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.006
Da-0.005
 owl-0.005
aron-0.005
Mill-0.005
Token and
Token ID290
Feature activation-6.32e-03

Pos logit contributions
 wins+0.010
angles+0.010
osh+0.010
 scores+0.009
yles+0.009
Neg logit contributions
Ell-0.010
Da-0.009
Bug-0.008
Lu-0.008
 masse-0.008
Token lots
Token ID6041
Feature activation-3.50e-02

Pos logit contributions
angles+0.010
osh+0.009
 wins+0.009
upid+0.009
 scores+0.009
Neg logit contributions
Ell-0.012
Da-0.011
 owl-0.010
Mill-0.010
aron-0.010
Token of
Token ID286
Feature activation+3.27e-03

Pos logit contributions
 wins+0.062
angles+0.061
osh+0.061
 scores+0.058
upid+0.057
Neg logit contributions
Ell-0.059
Da-0.051
 owl-0.045
Bug-0.045
Mill-0.045
Token shiny
Token ID22441
Feature activation+4.62e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
aron+0.005
Mill+0.005
Neg logit contributions
angles-0.005
osh-0.005
upid-0.005
 wins-0.005
 scores-0.005
Token or
Token ID393
Feature activation+7.82e-03

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.008
aron+0.008
Mill+0.008
Neg logit contributions
angles-0.006
 wins-0.006
osh-0.006
upid-0.006
 scores-0.006
Tokennam
Token ID7402
Feature activation-8.63e-02

Pos logit contributions
Ell+0.014
Da+0.012
 owl+0.012
Mill+0.011
Lu+0.010
Neg logit contributions
angles-0.013
osh-0.013
upid-0.013
 wins-0.012
 scores-0.011
Tokenents
Token ID658
Feature activation-4.35e-02

Pos logit contributions
osh+0.102
 wins+0.099
 recess+0.092
angles+0.088
 scores+0.087
Neg logit contributions
Ell-0.197
Mill-0.176
Da-0.171
Just-0.162
aron-0.162
Token.
Token ID13
Feature activation-5.06e-03

Pos logit contributions
angles+0.089
 wins+0.087
osh+0.087
upid+0.084
 scores+0.083
Neg logit contributions
Ell-0.059
Da-0.050
 owl-0.045
aron-0.043
age-0.043
Token L
Token ID406
Feature activation-4.62e-02

Pos logit contributions
angles+0.008
osh+0.008
 wins+0.007
 scores+0.007
upid+0.007
Neg logit contributions
Ell-0.010
Da-0.009
Just-0.008
Lu-0.008
Mill-0.008
Tokenila
Token ID10102
Feature activation-2.44e-02

Pos logit contributions
angles+0.049
 wins+0.048
osh+0.046
 scores+0.043
upid+0.042
Neg logit contributions
Ell-0.107
Da-0.100
Lu-0.091
isy-0.091
aron-0.091
Token wanted
Token ID2227
Feature activation-8.58e-03

Pos logit contributions
 wins+0.041
angles+0.040
osh+0.040
 scores+0.038
upid+0.037
Neg logit contributions
Ell-0.043
Da-0.038
 owl-0.034
Bug-0.034
aron-0.034
Token a
Token ID257
Feature activation-5.96e-03

Pos logit contributions
 wins+0.014
angles+0.013
osh+0.013
upid+0.013
 scores+0.012
Neg logit contributions
Ell-0.016
Da-0.014
 owl-0.013
aron-0.013
Mill-0.013
Token letter
Token ID3850
Feature activation-8.40e-03

Pos logit contributions
angles+0.011
 wins+0.011
osh+0.010
 scores+0.010
upid+0.010
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.008
aron-0.008
Mill-0.008
Token about
Token ID546
Feature activation-2.63e-02

Pos logit contributions
osh+0.014
 wins+0.014
angles+0.014
 scores+0.013
upid+0.013
Neg logit contributions
Ell-0.015
Da-0.013
Bug-0.012
Mic-0.012
Lu-0.012
Token the
Token ID262
Feature activation+1.06e-03

Pos logit contributions
angles+0.040
osh+0.039
 wins+0.039
upid+0.037
 scores+0.036
Neg logit contributions
Ell-0.050
Da-0.045
 owl-0.043
Mill-0.041
aron-0.040
Token or
Token ID393
Feature activation+7.72e-03

Pos logit contributions
Ell+0.002
Da+0.002
aron+0.002
Just+0.001
Mill+0.001
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.002
 recess-0.002
Tokennam
Token ID7402
Feature activation-8.56e-02

Pos logit contributions
Ell+0.015
Da+0.013
 owl+0.012
Mill+0.012
 Michelle+0.012
Neg logit contributions
angles-0.012
osh-0.012
upid-0.011
 wins-0.011
 scores-0.011
Tokenents
Token ID658
Feature activation-3.86e-02

Pos logit contributions
osh+0.093
 wins+0.092
 recess+0.085
angles+0.084
 scores+0.081
Neg logit contributions
Ell-0.202
Da-0.179
Mill-0.176
aron-0.170
Just-0.168
Token and
Token ID290
Feature activation-4.83e-03

Pos logit contributions
angles+0.077
 wins+0.076
osh+0.075
upid+0.072
 scores+0.072
Neg logit contributions
Ell-0.054
Da-0.046
 owl-0.042
aron-0.041
age-0.040
Token asked
Token ID1965
Feature activation-5.17e-03

Pos logit contributions
angles+0.008
osh+0.008
upid+0.008
 wins+0.008
ret+0.007
Neg logit contributions
Ell-0.009
Da-0.007
 owl-0.007
Mill-0.007
aron-0.007
Token for
Token ID329
Feature activation+1.13e-02

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.008
aron-0.008
age-0.008
Token one
Token ID530
Feature activation-1.09e-02

Pos logit contributions
Ell+0.019
 owl+0.017
Da+0.017
 Hope+0.016
Mill+0.015
Neg logit contributions
angles-0.019
 wins-0.019
osh-0.019
upid-0.019
 scores-0.017
Token the
Token ID262
Feature activation+2.83e-03

Pos logit contributions
Ell+0.004
Da+0.004
age+0.004
aron+0.004
 owl+0.003
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.001
 scores-0.001
Token sky
Token ID6766
Feature activation+1.55e-02

Pos logit contributions
Ell+0.005
Da+0.005
aron+0.004
 owl+0.004
Mill+0.004
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.004
 scores-0.004
upid-0.004
Token and
Token ID290
Feature activation-2.66e-03

Pos logit contributions
Ell+0.023
Da+0.020
Mic+0.017
Bug+0.017
han+0.017
Neg logit contributions
osh-0.031
 wins-0.031
angles-0.030
 scores-0.028
upid-0.028
Token the
Token ID262
Feature activation+2.89e-03

Pos logit contributions
upid+0.004
angles+0.004
 wins+0.004
osh+0.004
 scores+0.003
Neg logit contributions
Ell-0.005
Da-0.005
 owl-0.005
Mill-0.004
Lu-0.004
Token or
Token ID393
Feature activation+9.68e-03

Pos logit contributions
Ell+0.005
Da+0.005
age+0.004
aron+0.004
Mill+0.004
Neg logit contributions
 wins-0.005
angles-0.005
osh-0.005
 scores-0.004
 recess-0.004
Tokennam
Token ID7402
Feature activation-8.54e-02

Pos logit contributions
Ell+0.018
Da+0.017
 owl+0.016
 Hope+0.015
 Michelle+0.015
Neg logit contributions
upid-0.015
angles-0.015
 wins-0.014
osh-0.014
 scores-0.014
Tokenents
Token ID658
Feature activation-3.75e-02

Pos logit contributions
 wins+0.100
osh+0.096
 recess+0.091
 scores+0.088
angles+0.087
Neg logit contributions
Ell-0.196
Da-0.175
Mill-0.173
age-0.165
Just-0.165
Token gl
Token ID1278
Feature activation-6.94e-03

Pos logit contributions
angles+0.062
 wins+0.060
upid+0.059
 scores+0.057
osh+0.057
Neg logit contributions
Ell-0.064
Da-0.056
age-0.054
 owl-0.053
isy-0.052
Tokenittered
Token ID36613
Feature activation+1.35e-02

Pos logit contributions
 wins+0.012
angles+0.012
osh+0.012
 scores+0.012
upid+0.011
Neg logit contributions
Ell-0.011
Da-0.010
aron-0.009
age-0.009
 owl-0.009
Token in
Token ID287
Feature activation+9.47e-04

Pos logit contributions
Ell+0.018
Da+0.015
Bug+0.013
 owl+0.013
Mill+0.013
Neg logit contributions
 wins-0.029
angles-0.028
osh-0.028
 scores-0.027
upid-0.026
Token the
Token ID262
Feature activation+2.02e-03

Pos logit contributions
Ell+0.002
Da+0.002
Mill+0.002
 owl+0.002
Lu+0.002
Neg logit contributions
angles-0.001
 wins-0.001
osh-0.001
upid-0.001
 scores-0.001
Token All
Token ID1439
Feature activation-2.92e-02

Pos logit contributions
angles+0.011
osh+0.011
 wins+0.010
 scores+0.010
upid+0.010
Neg logit contributions
Ell-0.010
Da-0.009
aron-0.008
 owl-0.008
Lu-0.008
Token of
Token ID286
Feature activation+6.78e-03

Pos logit contributions
angles+0.051
 wins+0.050
upid+0.048
osh+0.048
 scores+0.047
Neg logit contributions
Ell-0.048
Da-0.042
 owl-0.039
Lu-0.038
age-0.037
Token these
Token ID777
Feature activation+2.11e-02

Pos logit contributions
Ell+0.013
Da+0.012
 owl+0.011
Mill+0.011
Lu+0.011
Neg logit contributions
angles-0.010
 wins-0.010
osh-0.010
upid-0.009
 scores-0.009
Token lovely
Token ID14081
Feature activation+1.84e-02

Pos logit contributions
Ell+0.046
Da+0.041
 owl+0.039
aron+0.038
Bug+0.038
Neg logit contributions
angles-0.026
 wins-0.026
osh-0.026
upid-0.024
 scores-0.024
Token or
Token ID393
Feature activation+8.07e-03

Pos logit contributions
Ell+0.040
Da+0.036
aron+0.033
Bug+0.032
Just+0.032
Neg logit contributions
 wins-0.024
angles-0.023
osh-0.022
 scores-0.022
upid-0.020
Tokennam
Token ID7402
Feature activation-8.54e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.014
Mill+0.013
Lu+0.013
Neg logit contributions
osh-0.012
angles-0.012
upid-0.011
 wins-0.011
ret-0.010
Tokenents
Token ID658
Feature activation-4.55e-02

Pos logit contributions
 wins+0.089
osh+0.088
 recess+0.080
angles+0.080
 scores+0.078
Neg logit contributions
Ell-0.208
Da-0.185
Mill-0.182
aron-0.174
age-0.173
Token reminded
Token ID14516
Feature activation+5.70e-03

Pos logit contributions
angles+0.067
 wins+0.066
osh+0.065
upid+0.062
 scores+0.062
Neg logit contributions
Ell-0.087
Da-0.078
 owl-0.072
age-0.071
aron-0.071
Token him
Token ID683
Feature activation+1.25e-04

Pos logit contributions
Ell+0.013
Da+0.012
 owl+0.011
aron+0.011
age+0.011
Neg logit contributions
angles-0.006
 wins-0.006
osh-0.006
 scores-0.005
upid-0.005
Token of
Token ID286
Feature activation+3.46e-03

Pos logit contributions
Ell+0.000
Da+0.000
 owl+0.000
age+0.000
aron+0.000
Neg logit contributions
angles-0.000
 wins-0.000
osh-0.000
upid-0.000
 scores-0.000
Token the
Token ID262
Feature activation-1.07e-03

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
angles-0.005
 wins-0.005
osh-0.005
upid-0.005
 scores-0.004
Token at
Token ID379
Feature activation-3.03e-02

Pos logit contributions
Ell+0.048
Da+0.042
Bug+0.039
Mill+0.038
 owl+0.038
Neg logit contributions
 wins-0.037
angles-0.037
osh-0.037
 scores-0.035
upid-0.034
Token all
Token ID477
Feature activation-3.95e-02

Pos logit contributions
angles+0.045
 wins+0.045
osh+0.044
upid+0.044
 scores+0.042
Neg logit contributions
Ell-0.054
Da-0.049
 owl-0.049
 Betsy-0.047
 Bug-0.047
Token the
Token ID262
Feature activation+1.93e-03

Pos logit contributions
 wins+0.045
angles+0.045
osh+0.044
 scores+0.040
upid+0.040
Neg logit contributions
Ell-0.091
Da-0.083
 owl-0.077
aron-0.076
Bug-0.076
Token different
Token ID1180
Feature activation-5.15e-04

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
Lu+0.003
aron+0.003
Neg logit contributions
angles-0.003
osh-0.003
 wins-0.003
upid-0.003
 scores-0.003
Token or
Token ID393
Feature activation+1.01e-02

Pos logit contributions
angles+0.001
osh+0.001
 wins+0.001
upid+0.001
 scores+0.001
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
aron-0.001
Lu-0.001
Tokennam
Token ID7402
Feature activation-8.51e-02

Pos logit contributions
Ell+0.018
Da+0.015
 owl+0.015
Mill+0.014
 Michelle+0.014
Neg logit contributions
osh-0.017
upid-0.017
angles-0.016
ret-0.015
 wins-0.015
Tokenents
Token ID658
Feature activation-4.29e-02

Pos logit contributions
 wins+0.093
osh+0.093
 recess+0.087
angles+0.083
 scores+0.081
Neg logit contributions
Ell-0.202
Da-0.178
Mill-0.177
Just-0.168
aron-0.166
Token and
Token ID290
Feature activation-3.81e-03

Pos logit contributions
angles+0.085
 wins+0.084
osh+0.084
upid+0.080
 scores+0.080
Neg logit contributions
Ell-0.060
Da-0.051
 owl-0.046
aron-0.045
age-0.044
Token adm
Token ID6178
Feature activation-1.75e-02

Pos logit contributions
angles+0.006
osh+0.006
upid+0.006
 wins+0.006
ret+0.006
Neg logit contributions
Ell-0.006
Da-0.006
 owl-0.005
Mill-0.005
 Michelle-0.005
Tokeniring
Token ID3428
Feature activation+1.25e-02

Pos logit contributions
angles+0.041
 wins+0.041
osh+0.040
upid+0.038
 scores+0.038
Neg logit contributions
Ell-0.020
Da-0.016
 owl-0.013
age-0.013
aron-0.013
Token the
Token ID262
Feature activation+1.84e-04

Pos logit contributions
Ell+0.026
Da+0.024
 owl+0.022
aron+0.022
Lu+0.022
Neg logit contributions
angles-0.016
osh-0.016
 wins-0.015
upid-0.015
 scores-0.014
Token the
Token ID262
Feature activation+2.74e-04

Pos logit contributions
Ell+0.046
Da+0.042
 owl+0.037
aron+0.037
age+0.037
Neg logit contributions
 wins-0.040
angles-0.040
osh-0.039
 scores-0.037
upid-0.037
Token tree
Token ID5509
Feature activation-1.99e-03

Pos logit contributions
Ell+0.000
Da+0.000
aron+0.000
han+0.000
Mill+0.000
Neg logit contributions
 wins-0.000
angles-0.000
osh-0.000
 scores-0.000
upid-0.000
Token with
Token ID351
Feature activation+5.68e-03

Pos logit contributions
 wins+0.004
angles+0.004
osh+0.004
 scores+0.004
upid+0.004
Neg logit contributions
Ell-0.003
Da-0.002
 owl-0.002
Bug-0.002
Mill-0.002
Token regular
Token ID3218
Feature activation+1.04e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.008
Mill+0.008
Bug+0.008
Neg logit contributions
angles-0.009
 wins-0.009
osh-0.009
 scores-0.008
upid-0.008
Token or
Token ID393
Feature activation+6.74e-03

Pos logit contributions
Ell+0.020
Da+0.018
aron+0.016
Mill+0.016
 owl+0.016
Neg logit contributions
 wins-0.016
angles-0.016
osh-0.015
 scores-0.014
upid-0.014
Tokennam
Token ID7402
Feature activation-8.51e-02

Pos logit contributions
Ell+0.013
Da+0.011
 owl+0.011
Mill+0.010
 masse+0.010
Neg logit contributions
angles-0.010
osh-0.010
 wins-0.009
upid-0.009
 scores-0.009
Tokenents
Token ID658
Feature activation-4.31e-02

Pos logit contributions
osh+0.097
 wins+0.094
 recess+0.090
angles+0.086
 scores+0.084
Neg logit contributions
Ell-0.196
Da-0.173
Mill-0.173
aron-0.164
age-0.164
Token!"
Token ID2474
Feature activation-1.37e-02

Pos logit contributions
angles+0.083
osh+0.081
 wins+0.081
upid+0.077
 scores+0.077
Neg logit contributions
Ell-0.062
Da-0.054
 owl-0.049
aron-0.048
age-0.048
Token<|endoftext|>
Token ID50256
Feature activation-1.88e-02

Pos logit contributions
angles+0.018
 wins+0.018
osh+0.018
 scores+0.016
upid+0.016
Neg logit contributions
Ell-0.029
Da-0.026
 owl-0.024
aron-0.024
Bug-0.024
TokenOnce
Token ID7454
Feature activation-2.54e-02

Pos logit contributions
angles+0.033
 wins+0.033
osh+0.033
upid+0.031
 scores+0.031
Neg logit contributions
Ell-0.031
Da-0.027
 owl-0.024
aron-0.024
Mill-0.024
Token upon
Token ID2402
Feature activation+1.37e-02

Pos logit contributions
 wins+0.041
 scores+0.040
toe+0.039
angles+0.038
ws+0.037
Neg logit contributions
Ell-0.043
Da-0.039
 Betsy-0.035
Lu-0.035
 Daisy-0.035
Token box
Token ID3091
Feature activation+2.00e-02

Pos logit contributions
Ell+0.028
Da+0.025
Just+0.023
aron+0.022
Mill+0.022
Neg logit contributions
 wins-0.021
osh-0.021
angles-0.021
 scores-0.020
upid-0.019
Token full
Token ID1336
Feature activation-2.84e-02

Pos logit contributions
Ell+0.041
Da+0.039
Mic+0.037
Bug+0.037
Charl+0.036
Neg logit contributions
osh-0.026
 wins-0.026
 surf-0.026
angles-0.024
 surfing-0.024
Token of
Token ID286
Feature activation+4.29e-03

Pos logit contributions
 wins+0.033
osh+0.032
angles+0.030
 surf+0.030
upid+0.029
Neg logit contributions
Ell-0.065
Da-0.056
Just-0.055
Charl-0.055
Bug-0.054
Token beautiful
Token ID4950
Feature activation+9.35e-03

Pos logit contributions
Ell+0.008
Da+0.007
 owl+0.006
Mill+0.006
Lu+0.006
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.006
 scores-0.006
Token or
Token ID393
Feature activation+4.81e-03

Pos logit contributions
Ell+0.020
Da+0.018
aron+0.016
age+0.016
Bug+0.016
Neg logit contributions
 wins-0.012
angles-0.012
osh-0.012
 scores-0.011
upid-0.011
Tokennam
Token ID7402
Feature activation-8.51e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.009
Mill+0.009
Lu+0.009
Neg logit contributions
angles-0.006
osh-0.006
 wins-0.006
upid-0.006
 scores-0.006
Tokenents
Token ID658
Feature activation-4.40e-02

Pos logit contributions
 wins+0.085
osh+0.084
angles+0.077
upid+0.073
 scores+0.073
Neg logit contributions
Ell-0.212
Da-0.191
Mill-0.185
Just-0.176
age-0.175
Token.
Token ID13
Feature activation-4.66e-03

Pos logit contributions
 wins+0.086
angles+0.085
osh+0.084
 scores+0.079
upid+0.078
Neg logit contributions
Ell-0.066
Da-0.058
Bug-0.050
aron-0.049
Charl-0.049
Token
Token ID220
Feature activation-4.02e-02

Pos logit contributions
 wins+0.007
angles+0.007
osh+0.007
 turns+0.007
 scores+0.007
Neg logit contributions
Ell-0.008
coon-0.007
Da-0.007
isy-0.007
aron-0.006
Token
Token ID198
Feature activation+3.06e-03

Pos logit contributions
 wins+0.077
angles+0.077
osh+0.077
 scores+0.073
upid+0.072
Neg logit contributions
Ell-0.060
Da-0.052
Lu-0.045
Mill-0.045
aron-0.045
Token
Token ID198
Feature activation+3.83e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
aron+0.005
edly+0.005
Neg logit contributions
osh-0.004
 wins-0.004
angles-0.004
upid-0.004
 scores-0.004
Token here
Token ID994
Feature activation+3.74e-02

Pos logit contributions
Ell+0.070
Da+0.063
Mic+0.058
Bug+0.054
Lu+0.054
Neg logit contributions
 wins-0.073
angles-0.068
osh-0.067
 scores-0.065
 surf-0.065
Token and
Token ID290
Feature activation-5.39e-03

Pos logit contributions
Ell+0.068
Da+0.060
Mic+0.053
Bug+0.053
age+0.053
Neg logit contributions
 wins-0.064
angles-0.061
osh-0.059
 scores-0.056
upid-0.056
Token explore
Token ID7301
Feature activation+1.10e-02

Pos logit contributions
angles+0.007
 wins+0.007
osh+0.007
upid+0.006
 scores+0.006
Neg logit contributions
Ell-0.012
Da-0.011
 owl-0.010
aron-0.010
Mill-0.010
Token the
Token ID262
Feature activation+3.77e-03

Pos logit contributions
Ell+0.023
Da+0.020
age+0.019
 owl+0.018
aron+0.018
Neg logit contributions
 wins-0.016
angles-0.015
osh-0.015
 scores-0.014
upid-0.014
Token or
Token ID393
Feature activation+8.50e-03

Pos logit contributions
Ell+0.006
Da+0.006
 owl+0.005
Lu+0.005
aron+0.005
Neg logit contributions
angles-0.006
osh-0.006
 wins-0.006
upid-0.006
 scores-0.006
Tokennam
Token ID7402
Feature activation-8.50e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.013
Mill+0.013
Bug+0.013
Neg logit contributions
angles-0.013
osh-0.013
upid-0.012
 wins-0.012
 scores-0.012
Tokenents
Token ID658
Feature activation-3.70e-02

Pos logit contributions
 wins+0.089
osh+0.087
 recess+0.082
angles+0.081
 scores+0.079
Neg logit contributions
Ell-0.201
Da-0.180
Mill-0.177
age-0.171
Just-0.171
Token."
Token ID526
Feature activation-8.15e-03

Pos logit contributions
angles+0.067
 wins+0.066
osh+0.065
upid+0.062
 scores+0.062
Neg logit contributions
Ell-0.058
Da-0.051
 owl-0.046
aron-0.045
age-0.045
Token
Token ID198
Feature activation+1.82e-03

Pos logit contributions
 wins+0.012
osh+0.011
angles+0.011
 scores+0.011
upid+0.010
Neg logit contributions
Ell-0.016
Da-0.014
 owl-0.014
age-0.013
isy-0.013
Token
Token ID198
Feature activation+2.49e-03

Pos logit contributions
Ell+0.004
 owl+0.003
Da+0.003
han+0.003
 calf+0.003
Neg logit contributions
osh-0.002
 wins-0.002
angles-0.002
upid-0.002
 scores-0.002
TokenDa
Token ID26531
Feature activation+4.56e-02

Pos logit contributions
 owl+0.004
Ell+0.004
 calf+0.003
han+0.003
 Michelle+0.003
Neg logit contributions
osh-0.004
 wins-0.004
angles-0.003
 scores-0.003
 turns-0.003
Token the
Token ID262
Feature activation+4.55e-03

Pos logit contributions
Ell+0.006
 Betsy+0.006
Da+0.006
 Hope+0.006
 owl+0.006
Neg logit contributions
osh-0.009
angles-0.008
upid-0.008
 wins-0.008
isel-0.008
Token rest
Token ID1334
Feature activation-1.17e-02

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
aron+0.005
Mill+0.005
Neg logit contributions
angles-0.008
 wins-0.008
osh-0.008
upid-0.008
 scores-0.008
Token of
Token ID286
Feature activation+4.31e-03

Pos logit contributions
angles+0.022
 wins+0.022
osh+0.022
 scores+0.021
upid+0.021
Neg logit contributions
Ell-0.018
Da-0.016
 owl-0.014
aron-0.014
age-0.014
Token the
Token ID262
Feature activation+4.29e-03

Pos logit contributions
Ell+0.008
Da+0.007
 Betsy+0.007
 Bug+0.007
 owl+0.007
Neg logit contributions
osh-0.007
angles-0.007
 wins-0.007
upid-0.007
 scores-0.006
Token or
Token ID393
Feature activation+1.03e-02

Pos logit contributions
Ell+0.007
Da+0.006
aron+0.006
Mill+0.006
age+0.006
Neg logit contributions
 wins-0.007
angles-0.007
osh-0.007
 scores-0.007
upid-0.007
Tokennam
Token ID7402
Feature activation-8.50e-02

Pos logit contributions
Ell+0.019
 owl+0.017
Da+0.017
 Michelle+0.015
 Hope+0.015
Neg logit contributions
upid-0.016
osh-0.015
angles-0.015
 wins-0.015
ret-0.014
Tokenents
Token ID658
Feature activation-3.58e-02

Pos logit contributions
 wins+0.100
osh+0.100
 recess+0.091
angles+0.090
 scores+0.089
Neg logit contributions
Ell-0.190
Da-0.167
Mill-0.165
Just-0.160
age-0.159
Token onto
Token ID4291
Feature activation+1.49e-02

Pos logit contributions
angles+0.071
 wins+0.069
osh+0.068
upid+0.067
 scores+0.066
Neg logit contributions
Ell-0.050
Da-0.043
age-0.039
 owl-0.039
aron-0.037
Token the
Token ID262
Feature activation+3.92e-03

Pos logit contributions
Ell+0.026
Da+0.023
 Betsy+0.022
 Hope+0.022
 owl+0.022
Neg logit contributions
osh-0.025
 wins-0.025
angles-0.025
upid-0.024
inch-0.022
Token tree
Token ID5509
Feature activation+2.37e-03

Pos logit contributions
Ell+0.007
Da+0.007
han+0.006
Mill+0.006
aron+0.006
Neg logit contributions
 wins-0.006
angles-0.006
osh-0.006
 scores-0.005
 recess-0.005
Token.
Token ID13
Feature activation-2.40e-04

Pos logit contributions
Ell+0.003
Da+0.003
age+0.002
 owl+0.002
aron+0.002
Neg logit contributions
angles-0.005
osh-0.005
 wins-0.005
upid-0.005
 scores-0.005
Token to
Token ID284
Feature activation-8.02e-03

Pos logit contributions
 wins+0.015
 scores+0.015
 recess+0.014
angles+0.014
osh+0.014
Neg logit contributions
Ell-0.018
Da-0.016
Bug-0.016
Mic-0.015
Twe-0.014
Token help
Token ID1037
Feature activation+3.33e-02

Pos logit contributions
 wins+0.007
osh+0.007
upid+0.006
angles+0.006
 recess+0.006
Neg logit contributions
Ell-0.021
Da-0.019
aron-0.018
Bug-0.018
Lu-0.018
Token sort
Token ID3297
Feature activation+1.70e-02

Pos logit contributions
Ell+0.065
Da+0.058
Bug+0.052
aron+0.052
age+0.052
Neg logit contributions
 wins-0.049
angles-0.049
osh-0.048
 scores-0.045
upid-0.045
Token the
Token ID262
Feature activation-1.23e-03

Pos logit contributions
Ell+0.029
Da+0.026
 owl+0.023
Lu+0.023
han+0.023
Neg logit contributions
osh-0.029
 wins-0.028
angles-0.028
upid-0.027
 scores-0.026
Token or
Token ID393
Feature activation+1.09e-02

Pos logit contributions
angles+0.002
osh+0.002
 wins+0.002
upid+0.002
 scores+0.002
Neg logit contributions
Ell-0.002
Da-0.002
 owl-0.002
aron-0.002
Mill-0.002
Tokennam
Token ID7402
Feature activation-8.47e-02

Pos logit contributions
Ell+0.022
Da+0.019
 owl+0.018
Mill+0.018
Lu+0.017
Neg logit contributions
osh-0.015
angles-0.015
 wins-0.015
upid-0.014
 scores-0.014
Tokenents
Token ID658
Feature activation-4.48e-02

Pos logit contributions
osh+0.089
 wins+0.088
 recess+0.081
upid+0.079
angles+0.078
Neg logit contributions
Ell-0.205
Da-0.182
Mill-0.177
Just-0.170
age-0.167
Token.
Token ID13
Feature activation-1.53e-03

Pos logit contributions
angles+0.089
 wins+0.087
osh+0.087
upid+0.083
 scores+0.083
Neg logit contributions
Ell-0.063
Da-0.054
 owl-0.049
age-0.047
aron-0.047
Token She
Token ID1375
Feature activation+2.77e-03

Pos logit contributions
angles+0.004
osh+0.004
 wins+0.003
upid+0.003
 scores+0.003
Neg logit contributions
Ell-0.002
Da-0.001
 owl-0.001
aron-0.001
Lu-0.001
Token picked
Token ID6497
Feature activation+4.48e-03

Pos logit contributions
Ell+0.005
Da+0.004
 owl+0.004
aron+0.004
Mill+0.004
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token up
Token ID510
Feature activation-4.26e-03

Pos logit contributions
Ell+0.008
Da+0.007
 owl+0.007
 Michelle+0.007
Mill+0.007
Neg logit contributions
osh-0.007
 wins-0.007
angles-0.007
upid-0.007
 scores-0.006
Token beautiful
Token ID4950
Feature activation+8.79e-03

Pos logit contributions
angles+0.007
osh+0.007
 wins+0.007
yles+0.007
upid+0.007
Neg logit contributions
Ell-0.007
 owl-0.006
Da-0.006
isy-0.005
 masse-0.005
Token garden
Token ID11376
Feature activation+1.63e-02

Pos logit contributions
Ell+0.016
Da+0.015
Bug+0.013
Just+0.013
aron+0.013
Neg logit contributions
angles-0.013
 wins-0.013
osh-0.013
 scores-0.013
upid-0.012
Token with
Token ID351
Feature activation+7.92e-03

Pos logit contributions
Ell+0.024
Da+0.021
Bug+0.019
Mic+0.019
Charl+0.018
Neg logit contributions
 wins-0.033
angles-0.032
osh-0.031
 scores-0.031
 surf-0.029
Token the
Token ID262
Feature activation+2.04e-03

Pos logit contributions
Ell+0.015
Da+0.013
 owl+0.012
Mill+0.012
Lu+0.012
Neg logit contributions
 wins-0.013
angles-0.013
osh-0.013
upid-0.012
 scores-0.012
Token or
Token ID393
Feature activation+8.78e-03

Pos logit contributions
Ell+0.004
Da+0.003
aron+0.003
Mill+0.003
 owl+0.003
Neg logit contributions
 wins-0.003
angles-0.003
osh-0.003
 scores-0.003
upid-0.003
Tokennam
Token ID7402
Feature activation-8.43e-02

Pos logit contributions
Ell+0.019
Da+0.017
 owl+0.016
Mill+0.016
Lu+0.015
Neg logit contributions
angles-0.011
osh-0.011
 wins-0.011
upid-0.010
 scores-0.010
Tokenents
Token ID658
Feature activation-3.75e-02

Pos logit contributions
osh+0.094
 wins+0.092
 recess+0.085
angles+0.083
 scores+0.081
Neg logit contributions
Ell-0.198
Da-0.177
Mill-0.174
aron-0.164
age-0.164
Token.
Token ID13
Feature activation+1.86e-03

Pos logit contributions
angles+0.078
osh+0.077
 wins+0.077
upid+0.074
 scores+0.073
Neg logit contributions
Ell-0.048
Da-0.040
 owl-0.037
age-0.036
aron-0.035
Token
Token ID198
Feature activation+6.75e-03

Pos logit contributions
Ell+0.003
Da+0.002
 owl+0.002
Lu+0.002
Mill+0.002
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
 scores-0.004
upid-0.004
Token
Token ID198
Feature activation+7.29e-03

Pos logit contributions
Ell+0.011
Da+0.009
 owl+0.008
Mill+0.008
aron+0.008
Neg logit contributions
angles-0.013
 wins-0.013
osh-0.012
 scores-0.012
upid-0.012
TokenThe
Token ID464
Feature activation-3.47e-03

Pos logit contributions
Ell+0.011
Da+0.009
aron+0.009
 owl+0.009
han+0.009
Neg logit contributions
osh-0.013
angles-0.013
 wins-0.013
upid-0.012
 scores-0.012
Token with
Token ID351
Feature activation+8.79e-03

Pos logit contributions
 wins+0.004
angles+0.004
osh+0.004
 scores+0.003
 surf+0.003
Neg logit contributions
Ell-0.003
Da-0.003
Mic-0.002
Bug-0.002
Charl-0.002
Token lots
Token ID6041
Feature activation-3.67e-02

Pos logit contributions
Ell+0.015
Da+0.014
 owl+0.012
Mill+0.012
Lu+0.012
Neg logit contributions
 wins-0.015
angles-0.015
osh-0.015
upid-0.014
 scores-0.014
Token of
Token ID286
Feature activation+3.97e-03

Pos logit contributions
 wins+0.062
angles+0.061
osh+0.061
 scores+0.057
upid+0.057
Neg logit contributions
Ell-0.065
Da-0.057
 owl-0.050
Bug-0.050
Mill-0.050
Token pretty
Token ID2495
Feature activation+1.10e-02

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
angles-0.006
 wins-0.006
osh-0.006
upid-0.006
 scores-0.006
Token or
Token ID393
Feature activation+7.56e-03

Pos logit contributions
Ell+0.021
Da+0.019
 owl+0.017
aron+0.017
age+0.017
Neg logit contributions
angles-0.016
 wins-0.016
osh-0.016
 scores-0.015
upid-0.015
Tokennam
Token ID7402
Feature activation-8.42e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.014
 masse+0.013
Mill+0.013
Neg logit contributions
angles-0.010
osh-0.010
 wins-0.010
upid-0.010
 scores-0.009
Tokenents
Token ID658
Feature activation-3.84e-02

Pos logit contributions
osh+0.097
 wins+0.095
angles+0.086
 recess+0.086
 scores+0.084
Neg logit contributions
Ell-0.193
Mill-0.170
Da-0.169
aron-0.161
age-0.161
Token.
Token ID13
Feature activation+3.36e-03

Pos logit contributions
angles+0.077
 wins+0.076
osh+0.076
upid+0.073
 scores+0.072
Neg logit contributions
Ell-0.053
Da-0.046
 owl-0.041
aron-0.040
age-0.040
Token One
Token ID1881
Feature activation-1.06e-02

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
Lu+0.005
aron+0.005
Neg logit contributions
 wins-0.006
angles-0.006
osh-0.006
upid-0.005
 scores-0.005
Token day
Token ID1110
Feature activation+1.92e-02

Pos logit contributions
angles+0.015
 wins+0.015
osh+0.015
upid+0.014
 scores+0.014
Neg logit contributions
Ell-0.021
Da-0.019
 owl-0.017
aron-0.017
Mill-0.017
Token,
Token ID11
Feature activation+3.65e-03

Pos logit contributions
Ell+0.030
Da+0.026
 owl+0.023
aron+0.022
Mill+0.022
Neg logit contributions
angles-0.036
 wins-0.036
osh-0.036
 scores-0.034
upid-0.034
Token park
Token ID3952
Feature activation-6.39e-03

Pos logit contributions
angles+0.010
 wins+0.010
osh+0.010
upid+0.009
yles+0.009
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.007
Lu-0.007
age-0.007
Token filled
Token ID5901
Feature activation-2.16e-02

Pos logit contributions
 wins+0.012
angles+0.012
osh+0.012
 scores+0.011
upid+0.011
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.007
aron-0.007
Mill-0.007
Token with
Token ID351
Feature activation+6.87e-03

Pos logit contributions
 wins+0.046
osh+0.046
angles+0.045
 scores+0.044
 surf+0.042
Neg logit contributions
Ell-0.028
Da-0.023
 owl-0.021
age-0.021
 masse-0.020
Token colourful
Token ID44799
Feature activation+1.10e-02

Pos logit contributions
Ell+0.012
Da+0.011
 owl+0.010
Mill+0.010
Lu+0.010
Neg logit contributions
angles-0.011
 wins-0.011
osh-0.011
upid-0.010
ret-0.010
Token or
Token ID393
Feature activation+6.93e-03

Pos logit contributions
Ell+0.023
Da+0.021
 owl+0.019
aron+0.019
Mill+0.019
Neg logit contributions
 wins-0.015
angles-0.014
osh-0.014
 scores-0.013
upid-0.013
Tokennam
Token ID7402
Feature activation-8.40e-02

Pos logit contributions
Ell+0.013
Da+0.011
 owl+0.010
Mill+0.010
Bug+0.010
Neg logit contributions
angles-0.012
osh-0.011
 wins-0.011
upid-0.011
ret-0.010
Tokenents
Token ID658
Feature activation-4.25e-02

Pos logit contributions
 wins+0.090
osh+0.088
angles+0.082
 recess+0.081
 scores+0.078
Neg logit contributions
Ell-0.203
Da-0.179
Mill-0.176
age-0.168
Just-0.168
Token.
Token ID13
Feature activation-6.00e-03

Pos logit contributions
 wins+0.081
angles+0.081
osh+0.080
 scores+0.075
upid+0.075
Neg logit contributions
Ell-0.065
Da-0.056
 owl-0.049
aron-0.048
Mill-0.048
Token Jimmy
Token ID12963
Feature activation-1.34e-02

Pos logit contributions
angles+0.010
 wins+0.010
osh+0.010
upid+0.010
 scores+0.010
Neg logit contributions
Ell-0.010
Da-0.009
aron-0.008
 owl-0.008
Lu-0.008
Token looked
Token ID3114
Feature activation+1.47e-02

Pos logit contributions
angles+0.023
 wins+0.022
osh+0.022
upid+0.021
 scores+0.021
Neg logit contributions
Ell-0.023
Da-0.020
 owl-0.019
aron-0.018
age-0.018
Token out
Token ID503
Feature activation-6.55e-03

Pos logit contributions
Ell+0.026
Da+0.023
age+0.021
Bug+0.020
 owl+0.020
Neg logit contributions
 wins-0.025
angles-0.025
osh-0.024
 scores-0.023
upid-0.023
Token He
Token ID679
Feature activation-7.23e-03

Pos logit contributions
angles+0.010
 wins+0.010
osh+0.010
 scores+0.010
upid+0.010
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.006
aron-0.006
Mill-0.006
Token quickly
Token ID2952
Feature activation+1.91e-02

Pos logit contributions
angles+0.013
upid+0.012
 wins+0.012
osh+0.012
 recess+0.011
Neg logit contributions
Ell-0.012
Da-0.011
 owl-0.010
aron-0.009
Lu-0.009
Token hung
Token ID9174
Feature activation-1.57e-02

Pos logit contributions
Ell+0.033
Da+0.029
 owl+0.027
aron+0.027
age+0.026
Neg logit contributions
angles-0.032
 wins-0.032
osh-0.031
upid-0.030
 scores-0.029
Token colorful
Token ID20239
Feature activation+1.11e-02

Pos logit contributions
 wins+0.022
osh+0.022
angles+0.022
upid+0.021
 scores+0.021
Neg logit contributions
Ell-0.028
Da-0.025
 owl-0.025
aron-0.024
Mill-0.023
Token or
Token ID393
Feature activation+1.08e-02

Pos logit contributions
Ell+0.023
Da+0.020
 owl+0.019
aron+0.019
Mill+0.018
Neg logit contributions
angles-0.015
 wins-0.015
osh-0.015
upid-0.014
 scores-0.014
Tokennam
Token ID7402
Feature activation-8.40e-02

Pos logit contributions
Ell+0.019
Da+0.016
 owl+0.016
Just+0.015
aron+0.015
Neg logit contributions
upid-0.017
angles-0.017
osh-0.017
 wins-0.016
inch-0.016
Tokenents
Token ID658
Feature activation-3.78e-02

Pos logit contributions
 wins+0.092
osh+0.087
 recess+0.082
angles+0.080
 scores+0.079
Neg logit contributions
Ell-0.200
Da-0.177
Mill-0.174
Just-0.167
aron-0.165
Token and
Token ID290
Feature activation-3.29e-03

Pos logit contributions
angles+0.080
 wins+0.079
osh+0.078
upid+0.076
 scores+0.076
Neg logit contributions
Ell-0.047
Da-0.039
 owl-0.035
aron-0.035
age-0.034
Token tw
Token ID665
Feature activation+1.46e-03

Pos logit contributions
upid+0.006
angles+0.006
 wins+0.005
osh+0.005
ret+0.005
Neg logit contributions
Ell-0.006
Da-0.005
 owl-0.005
aron-0.004
Mill-0.004
Tokenink
Token ID676
Feature activation-3.90e-03

Pos logit contributions
Ell+0.002
aron+0.002
Da+0.002
Mill+0.002
Just+0.002
Neg logit contributions
 wins-0.003
osh-0.003
upid-0.003
 scores-0.003
 recess-0.003
Tokenling
Token ID1359
Feature activation+3.85e-02

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.005
Da-0.004
 owl-0.004
aron-0.004
age-0.004
Token much
Token ID881
Feature activation-5.90e-02

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.007
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.011
Da-0.010
Mill-0.009
Just-0.009
Lu-0.009
Token fun
Token ID1257
Feature activation-2.69e-02

Pos logit contributions
 wins+0.061
osh+0.055
angles+0.054
 scores+0.053
 recess+0.050
Neg logit contributions
Ell-0.146
Da-0.132
Bug-0.124
age-0.124
aron-0.123
Token.
Token ID13
Feature activation-8.34e-03

Pos logit contributions
 wins+0.047
 scores+0.043
osh+0.043
angles+0.043
 recess+0.041
Neg logit contributions
Ell-0.049
Da-0.044
Bug-0.040
Mic-0.040
 masse-0.038
Token The
Token ID383
Feature activation+9.06e-05

Pos logit contributions
 wins+0.014
angles+0.014
osh+0.014
 scores+0.013
upid+0.013
Neg logit contributions
Ell-0.014
Da-0.013
 owl-0.012
aron-0.011
age-0.011
Token all
Token ID477
Feature activation-4.31e-02

Pos logit contributions
Ell+0.000
Da+0.000
 owl+0.000
isy+0.000
 masse+0.000
Neg logit contributions
angles-0.000
 wins-0.000
osh-0.000
upid-0.000
yles-0.000
Tokenigator
Token ID23823
Feature activation+8.55e-02

Pos logit contributions
angles+0.079
 wins+0.076
osh+0.073
upid+0.073
 scores+0.073
Neg logit contributions
Ell-0.069
Da-0.060
 owl-0.055
Lu-0.054
age-0.052
Token and
Token ID290
Feature activation-3.95e-03

Pos logit contributions
Ell+0.162
Da+0.146
aron+0.130
Mill+0.130
 owl+0.129
Neg logit contributions
 wins-0.133
osh-0.131
angles-0.129
 scores-0.122
upid-0.118
Token the
Token ID262
Feature activation+6.74e-04

Pos logit contributions
angles+0.005
 wins+0.005
osh+0.005
upid+0.005
 scores+0.005
Neg logit contributions
Ell-0.008
Da-0.008
 owl-0.007
aron-0.007
Mill-0.007
Token bird
Token ID6512
Feature activation-6.08e-03

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
 wins-0.001
angles-0.001
osh-0.001
 scores-0.001
upid-0.001
Token were
Token ID547
Feature activation-7.30e-03

Pos logit contributions
angles+0.009
 wins+0.008
osh+0.008
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.012
Da-0.011
 owl-0.010
age-0.010
Bug-0.010
Token very
Token ID845
Feature activation+7.04e-03

Pos logit contributions
angles+0.012
 wins+0.012
osh+0.012
upid+0.011
 scores+0.011
Neg logit contributions
Ell-0.013
Da-0.011
 owl-0.010
Mill-0.010
aron-0.010
Token joy
Token ID8716
Feature activation+5.13e-03

Pos logit contributions
Ell+0.016
Da+0.015
 owl+0.013
 Michelle+0.013
Charl+0.013
Neg logit contributions
angles-0.011
 wins-0.010
osh-0.010
yles-0.009
upid-0.009
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
Ell+0.009
Da+0.008
Bug+0.007
Mic+0.007
 owl+0.007
Neg logit contributions
 wins-0.009
angles-0.009
osh-0.009
 scores-0.008
upid-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.60e-02

Pos logit contributions
angles+0.005
osh+0.004
 wins+0.004
upid+0.004
 scores+0.004
Neg logit contributions
Ell-0.003
Da-0.003
Mill-0.002
Lu-0.002
Charl-0.002
TokenThe
Token ID464
Feature activation-4.62e-03

Pos logit contributions
angles+0.028
 wins+0.027
osh+0.027
upid+0.025
 scores+0.025
Neg logit contributions
Ell-0.027
Da-0.024
 owl-0.021
aron-0.021
Mill-0.021
Token all
Token ID477
Feature activation-4.04e-02

Pos logit contributions
 wins+0.008
osh+0.007
angles+0.007
 scores+0.007
upid+0.007
Neg logit contributions
Ell-0.008
Da-0.007
aron-0.007
age-0.007
Mill-0.006
Tokenigator
Token ID23823
Feature activation+8.69e-02

Pos logit contributions
angles+0.063
 wins+0.062
osh+0.061
 scores+0.057
upid+0.057
Neg logit contributions
Ell-0.076
Da-0.068
 owl-0.062
aron-0.060
Lu-0.060
Token was
Token ID373
Feature activation+9.73e-04

Pos logit contributions
Ell+0.171
Da+0.152
aron+0.139
 masse+0.138
Mic+0.138
Neg logit contributions
osh-0.135
 wins-0.135
angles-0.123
 scores-0.121
 surf-0.112
Token fair
Token ID3148
Feature activation+1.31e-02

Pos logit contributions
Ell+0.002
Da+0.002
han+0.001
coon+0.001
isy+0.001
Neg logit contributions
osh-0.001
 wins-0.001
angles-0.001
upid-0.001
 scores-0.001
Token and
Token ID290
Feature activation-3.01e-03

Pos logit contributions
Ell+0.022
Da+0.019
Bug+0.017
Mic+0.017
Lu+0.016
Neg logit contributions
 wins-0.024
osh-0.024
angles-0.023
 scores-0.023
upid-0.022
Token happy
Token ID3772
Feature activation-7.90e-03

Pos logit contributions
angles+0.005
 wins+0.005
osh+0.005
 scores+0.005
upid+0.005
Neg logit contributions
Ell-0.005
Da-0.005
 owl-0.004
aron-0.004
Mill-0.004
Token.
Token ID13
Feature activation+2.99e-03

Pos logit contributions
 wins+0.013
osh+0.012
 scores+0.012
angles+0.012
 recess+0.011
Neg logit contributions
Ell-0.015
Da-0.013
Mic-0.012
Bug-0.012
han-0.012
Token I
Token ID314
Feature activation-5.02e-03

Pos logit contributions
Ell+0.004
Da+0.004
age+0.003
aron+0.003
Mill+0.003
Neg logit contributions
 wins-0.004
angles-0.004
osh-0.004
 scores-0.004
upid-0.004
Token did
Token ID750
Feature activation-1.27e-02

Pos logit contributions
angles+0.008
 wins+0.007
osh+0.007
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.010
Da-0.008
age-0.008
 owl-0.008
Just-0.008
Token.
Token ID13
Feature activation+5.51e-03

Pos logit contributions
 wins+0.023
angles+0.022
osh+0.022
 scores+0.021
upid+0.021
Neg logit contributions
Ell-0.021
Da-0.019
 owl-0.016
aron-0.016
Bug-0.016
Token I
Token ID314
Feature activation-4.84e-03

Pos logit contributions
Ell+0.010
Da+0.009
Mill+0.008
isy+0.008
eh+0.007
Neg logit contributions
 wins-0.010
osh-0.009
angles-0.009
upid-0.009
 scores-0.008
Token sne
Token ID10505
Feature activation-5.88e-03

Pos logit contributions
angles+0.008
 wins+0.007
osh+0.007
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.009
Da-0.008
age-0.008
aron-0.007
 owl-0.007
Tokenaked
Token ID4335
Feature activation+6.27e-02

Pos logit contributions
angles+0.011
 wins+0.010
osh+0.010
yles+0.010
 scores+0.010
Neg logit contributions
Ell-0.009
Da-0.009
age-0.008
aron-0.008
 owl-0.007
Token into
Token ID656
Feature activation+2.52e-02

Pos logit contributions
Ell+0.107
Da+0.096
 owl+0.091
aron+0.088
Lu+0.085
Neg logit contributions
angles-0.105
osh-0.103
 wins-0.102
upid-0.097
yles-0.094
Token the
Token ID262
Feature activation+5.69e-03

Pos logit contributions
Ell+0.043
Da+0.040
 owl+0.039
Mill+0.036
Lu+0.035
Neg logit contributions
angles-0.043
osh-0.041
 wins-0.039
upid-0.037
inch-0.036
Token attic
Token ID46113
Feature activation+1.52e-02

Pos logit contributions
Ell+0.010
Da+0.009
aron+0.008
Mill+0.008
Bug+0.008
Neg logit contributions
 wins-0.009
angles-0.009
osh-0.009
 scores-0.009
 recess-0.009
Token one
Token ID530
Feature activation-7.98e-03

Pos logit contributions
Ell+0.023
Da+0.020
 owl+0.018
aron+0.017
Mill+0.017
Neg logit contributions
 wins-0.029
angles-0.029
osh-0.028
 scores-0.027
upid-0.027
Token night
Token ID1755
Feature activation+3.85e-03

Pos logit contributions
angles+0.014
 wins+0.013
osh+0.013
upid+0.013
 scores+0.013
Neg logit contributions
Ell-0.014
Da-0.012
 owl-0.011
aron-0.011
age-0.011
Token.
Token ID13
Feature activation+9.69e-03

Pos logit contributions
Ell+0.043
Da+0.037
Mill+0.034
Lu+0.032
Bug+0.032
Neg logit contributions
 wins-0.047
angles-0.045
osh-0.044
 scores-0.044
upid-0.042
Token They
Token ID1119
Feature activation-3.76e-03

Pos logit contributions
Ell+0.019
Da+0.016
 owl+0.016
age+0.015
isy+0.015
Neg logit contributions
 wins-0.015
osh-0.014
angles-0.014
upid-0.013
 scores-0.013
Token play
Token ID711
Feature activation-1.40e-02

Pos logit contributions
angles+0.005
osh+0.005
 wins+0.004
yles+0.004
upid+0.004
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.007
aron-0.007
Bug-0.006
Token hide
Token ID7808
Feature activation+2.59e-02

Pos logit contributions
 wins+0.030
angles+0.029
osh+0.029
 scores+0.028
 recess+0.026
Neg logit contributions
Ell-0.020
Da-0.016
 masse-0.015
Mill-0.015
 Michelle-0.014
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
Ell+0.033
Da+0.027
 owl+0.023
age+0.023
Mill+0.023
Neg logit contributions
 wins-0.056
angles-0.056
osh-0.055
 scores-0.053
upid-0.052
Token seek
Token ID5380
Feature activation+8.14e-02

Pos logit contributions
Ell+0.012
Da+0.010
aron+0.008
 owl+0.008
Mill+0.008
Neg logit contributions
 wins-0.023
angles-0.023
osh-0.023
 scores-0.022
upid-0.021
Token and
Token ID290
Feature activation+1.02e-02

Pos logit contributions
Ell+0.141
Da+0.121
Mill+0.111
age+0.110
Mic+0.109
Neg logit contributions
 wins-0.147
angles-0.140
osh-0.139
 scores-0.135
 recess-0.127
Token tag
Token ID7621
Feature activation+1.22e-02

Pos logit contributions
Ell+0.020
Da+0.018
 owl+0.016
aron+0.016
Mill+0.016
Neg logit contributions
angles-0.015
 wins-0.015
osh-0.015
 scores-0.014
upid-0.014
Token with
Token ID351
Feature activation+1.88e-02

Pos logit contributions
Ell+0.020
Da+0.017
Mill+0.015
 owl+0.015
Bug+0.015
Neg logit contributions
 wins-0.022
angles-0.022
osh-0.022
 scores-0.021
upid-0.020
Token it
Token ID340
Feature activation-4.77e-03

Pos logit contributions
Ell+0.033
Da+0.031
 owl+0.028
 Hope+0.028
 Bug+0.027
Neg logit contributions
angles-0.030
osh-0.029
 wins-0.028
upid-0.028
yles-0.027
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
 wins+0.009
angles+0.008
osh+0.008
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.008
Da-0.007
Mill-0.006
han-0.006
 owl-0.006
Token to
Token ID284
Feature activation-8.47e-03

Pos logit contributions
osh+0.042
 wins+0.042
angles+0.040
upid+0.038
 scores+0.038
Neg logit contributions
Ell-0.050
Da-0.044
Mic-0.041
Bug-0.041
Lu-0.040
Token rise
Token ID4485
Feature activation+6.09e-03

Pos logit contributions
angles+0.013
 wins+0.013
osh+0.013
upid+0.012
 scores+0.012
Neg logit contributions
Ell-0.016
Da-0.015
 owl-0.013
aron-0.013
Mill-0.013
Token!
Token ID0
Feature activation-9.72e-03

Pos logit contributions
Ell+0.011
Da+0.010
Mic+0.009
han+0.009
Charl+0.009
Neg logit contributions
 wins-0.010
osh-0.010
angles-0.010
 scores-0.009
upid-0.009
Token Owen
Token ID22605
Feature activation-1.77e-02

Pos logit contributions
angles+0.016
 wins+0.016
osh+0.016
 scores+0.015
upid+0.015
Neg logit contributions
Ell-0.017
Da-0.015
 owl-0.014
aron-0.014
Bug-0.014
Token be
Token ID307
Feature activation+1.22e-02

Pos logit contributions
 wins+0.028
osh+0.028
angles+0.028
 scores+0.026
upid+0.026
Neg logit contributions
Ell-0.032
Da-0.029
 owl-0.026
aron-0.025
Mill-0.025
Tokenamed
Token ID2434
Feature activation+7.02e-02

Pos logit contributions
Ell+0.016
Da+0.013
Bug+0.012
 masse+0.012
Mic+0.012
Neg logit contributions
osh-0.025
 wins-0.025
angles-0.025
upid-0.024
 scores-0.023
Token with
Token ID351
Feature activation+5.33e-03

Pos logit contributions
Ell+0.100
Da+0.082
Bug+0.073
Mic+0.072
age+0.071
Neg logit contributions
osh-0.140
 wins-0.138
angles-0.135
 scores-0.132
 surf-0.130
Token excitement
Token ID14067
Feature activation+1.55e-02

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.009
Mill+0.009
 Hope+0.009
Neg logit contributions
angles-0.008
 wins-0.008
osh-0.007
yles-0.007
 scores-0.007
Token.
Token ID13
Feature activation-1.63e-03

Pos logit contributions
Ell+0.024
Da+0.020
Bug+0.018
Mic+0.017
Lu+0.017
Neg logit contributions
 wins-0.030
angles-0.030
osh-0.029
 scores-0.028
upid-0.028
Token His
Token ID2399
Feature activation+2.21e-02

Pos logit contributions
angles+0.004
 wins+0.003
osh+0.003
 scores+0.003
 recess+0.003
Neg logit contributions
Ell-0.002
Da-0.002
aron-0.002
age-0.002
 owl-0.002
Token mom
Token ID1995
Feature activation-7.52e-03

Pos logit contributions
 owl+0.033
Ell+0.031
age+0.028
isy+0.028
Just+0.027
Neg logit contributions
angles-0.041
 wins-0.040
upid-0.038
osh-0.038
yles-0.037
Token shouted
Token ID17293
Feature activation+6.30e-03

Pos logit contributions
Ell+0.006
Da+0.005
age+0.005
Lu+0.005
Mill+0.005
Neg logit contributions
 wins-0.005
osh-0.005
angles-0.005
 scores-0.004
upid-0.004
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
Ell+0.010
Da+0.009
 owl+0.008
Mill+0.008
aron+0.008
Neg logit contributions
 wins-0.012
angles-0.011
osh-0.011
upid-0.011
 scores-0.011
Token
Token ID198
Feature activation+9.15e-03

Pos logit contributions
Ell+0.015
Da+0.014
Just+0.012
aron+0.012
 owl+0.012
Neg logit contributions
angles-0.023
 wins-0.022
osh-0.022
 scores-0.021
upid-0.021
Token
Token ID198
Feature activation+9.32e-03

Pos logit contributions
Ell+0.013
Da+0.011
Just+0.009
Mill+0.009
Lu+0.009
Neg logit contributions
angles-0.019
 wins-0.019
osh-0.018
 scores-0.018
upid-0.017
Token"
Token ID1
Feature activation+2.00e-02

Pos logit contributions
Ell+0.011
Da+0.010
 owl+0.008
Lu+0.008
aron+0.008
Neg logit contributions
angles-0.021
 wins-0.021
osh-0.021
 scores-0.020
upid-0.020
TokenHelp
Token ID22087
Feature activation+1.95e-02

Pos logit contributions
Ell+0.031
Da+0.027
aron+0.025
 owl+0.025
age+0.025
Neg logit contributions
osh-0.036
 wins-0.036
angles-0.036
 scores-0.034
upid-0.033
Token!
Token ID0
Feature activation-5.86e-03

Pos logit contributions
Ell+0.032
Da+0.027
aron+0.024
age+0.024
Mill+0.024
Neg logit contributions
 wins-0.035
osh-0.035
angles-0.035
upid-0.034
 scores-0.033
Token Help
Token ID10478
Feature activation+1.38e-02

Pos logit contributions
angles+0.012
 wins+0.011
osh+0.011
 scores+0.011
yles+0.011
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.006
Bug-0.006
 Bug-0.006
Token!
Token ID0
Feature activation-5.25e-03

Pos logit contributions
Ell+0.021
Da+0.018
 owl+0.016
aron+0.016
Mill+0.015
Neg logit contributions
angles-0.027
 wins-0.027
osh-0.026
upid-0.025
 scores-0.025
Token My
Token ID2011
Feature activation+3.35e-03

Pos logit contributions
angles+0.010
 wins+0.009
osh+0.009
 scores+0.009
 recess+0.009
Neg logit contributions
Ell-0.008
Da-0.007
Bug-0.007
 owl-0.007
age-0.007
Token friend
Token ID1545
Feature activation+1.36e-04

Pos logit contributions
Ell+0.006
Da+0.006
aron+0.005
Mill+0.005
Lu+0.005
Neg logit contributions
 wins-0.005
angles-0.005
osh-0.005
upid-0.005
 scores-0.005
Token able
Token ID1498
Feature activation-9.99e-03

Pos logit contributions
Ell+0.025
Da+0.023
aron+0.020
Bug+0.020
Lu+0.020
Neg logit contributions
 wins-0.022
angles-0.022
osh-0.021
 scores-0.021
upid-0.020
Token to
Token ID284
Feature activation-7.83e-03

Pos logit contributions
 wins+0.014
 scores+0.013
osh+0.012
upid+0.012
angles+0.012
Neg logit contributions
Ell-0.021
aron-0.019
Da-0.018
Mic-0.018
age-0.018
Token help
Token ID1037
Feature activation+2.80e-02

Pos logit contributions
 wins+0.010
angles+0.010
osh+0.010
 scores+0.010
upid+0.010
Neg logit contributions
Ell-0.016
Da-0.015
aron-0.014
Lu-0.013
Charl-0.013
Token.
Token ID13
Feature activation+3.12e-03

Pos logit contributions
Ell+0.064
Da+0.057
aron+0.055
age+0.054
Charl+0.054
Neg logit contributions
 wins-0.032
angles-0.030
 scores-0.029
osh-0.029
upid-0.027
Token
Token ID198
Feature activation+1.20e-02

Pos logit contributions
Ell+0.006
Da+0.005
aron+0.005
 owl+0.005
isy+0.005
Neg logit contributions
 wins-0.005
angles-0.005
osh-0.005
 scores-0.004
upid-0.004
Token
Token ID198
Feature activation+1.20e-02

Pos logit contributions
Ell+0.020
Da+0.018
 owl+0.016
aron+0.016
age+0.015
Neg logit contributions
angles-0.021
 wins-0.021
osh-0.021
upid-0.019
 scores-0.019
TokenThe
Token ID464
Feature activation-1.93e-03

Pos logit contributions
Ell+0.019
aron+0.018
 owl+0.017
edly+0.017
 masse+0.017
Neg logit contributions
osh-0.018
angles-0.018
 wins-0.018
 scores-0.017
upid-0.016
Token bee
Token ID20697
Feature activation+2.09e-02

Pos logit contributions
angles+0.004
 wins+0.004
osh+0.004
upid+0.004
 scores+0.004
Neg logit contributions
Ell-0.003
Da-0.002
 owl-0.002
aron-0.002
Bug-0.002
Token took
Token ID1718
Feature activation+1.01e-02

Pos logit contributions
Ell+0.041
 masse+0.037
Da+0.036
Charl+0.035
aron+0.035
Neg logit contributions
osh-0.029
 wins-0.027
angles-0.026
 scores-0.024
ret-0.024
Token the
Token ID262
Feature activation+4.14e-03

Pos logit contributions
Ell+0.020
Da+0.018
 owl+0.016
Bug+0.016
Mill+0.016
Neg logit contributions
 wins-0.015
angles-0.015
osh-0.015
upid-0.014
 scores-0.014
Token butterfly
Token ID35113
Feature activation+1.00e-02

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
aron+0.005
Mill+0.005
Neg logit contributions
angles-0.008
 wins-0.008
osh-0.008
 scores-0.007
upid-0.007
Token good
Token ID922
Feature activation+1.61e-02

Pos logit contributions
 wins+0.005
osh+0.004
 scores+0.004
upid+0.004
 surf+0.004
Neg logit contributions
Ell-0.009
Da-0.008
aron-0.007
han-0.007
edly-0.007
Token friend
Token ID1545
Feature activation+1.57e-03

Pos logit contributions
Ell+0.032
Da+0.029
Bug+0.026
Lu+0.025
Mill+0.025
Neg logit contributions
 wins-0.024
angles-0.023
osh-0.023
upid-0.022
 scores-0.022
Token,"
Token ID553
Feature activation+9.46e-03

Pos logit contributions
Ell+0.003
Da+0.003
Mic+0.003
 masse+0.003
Mill+0.003
Neg logit contributions
 wins-0.002
osh-0.002
 scores-0.002
angles-0.002
 agree-0.002
Token Mia
Token ID32189
Feature activation+1.03e-02

Pos logit contributions
Ell+0.017
Da+0.017
 owl+0.016
Lu+0.015
 Hope+0.014
Neg logit contributions
angles-0.015
upid-0.015
yles-0.014
ret-0.014
 wins-0.014
Token said
Token ID531
Feature activation-1.91e-03

Pos logit contributions
Ell+0.023
Da+0.021
Mill+0.020
Mic+0.020
Bug+0.019
Neg logit contributions
osh-0.012
 wins-0.012
angles-0.010
 scores-0.010
 agree-0.010
Token.
Token ID13
Feature activation+1.21e-02

Pos logit contributions
 wins+0.003
osh+0.003
angles+0.003
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.004
Da-0.003
Mill-0.003
Bug-0.003
Mic-0.003
Token
Token ID198
Feature activation+4.80e-03

Pos logit contributions
Ell+0.028
isy+0.026
han+0.025
age+0.025
ivering+0.025
Neg logit contributions
 wins-0.012
 Tom-0.011
osh-0.011
 Ben-0.011
 Dan-0.010
Token
Token ID198
Feature activation+5.32e-03

Pos logit contributions
Ell+0.009
Da+0.008
 owl+0.007
 masse+0.007
han+0.007
Neg logit contributions
 wins-0.007
osh-0.007
angles-0.007
upid-0.007
 scores-0.007
TokenThey
Token ID2990
Feature activation-9.50e-04

Pos logit contributions
Ell+0.008
Da+0.007
 masse+0.006
isy+0.006
 owl+0.006
Neg logit contributions
 wins-0.010
angles-0.010
osh-0.010
 scores-0.009
upid-0.009
Token hugged
Token ID41130
Feature activation+5.24e-03

Pos logit contributions
 wins+0.002
angles+0.002
upid+0.002
 scores+0.002
osh+0.002
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
Lu-0.001
Bug-0.001
Token each
Token ID1123
Feature activation-4.09e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
aron+0.005
Lu+0.005
Neg logit contributions
angles-0.011
 wins-0.011
upid-0.011
yles-0.011
osh-0.011
Token were
Token ID547
Feature activation-4.96e-03

Pos logit contributions
Ell+0.006
Da+0.006
 owl+0.005
Mill+0.005
Lu+0.005
Neg logit contributions
angles-0.006
 wins-0.006
upid-0.006
 scores-0.006
 recess-0.006
Token lots
Token ID6041
Feature activation-3.72e-02

Pos logit contributions
angles+0.007
 wins+0.007
osh+0.007
upid+0.007
 scores+0.007
Neg logit contributions
Ell-0.009
Da-0.009
 owl-0.008
Mill-0.008
aron-0.008
Token of
Token ID286
Feature activation+3.06e-03

Pos logit contributions
angles+0.071
 wins+0.070
osh+0.069
upid+0.066
 scores+0.066
Neg logit contributions
Ell-0.056
Da-0.049
 owl-0.044
aron-0.043
Mill-0.042
Token small
Token ID1402
Feature activation+1.68e-03

Pos logit contributions
Ell+0.006
Da+0.005
 owl+0.005
Mill+0.005
aron+0.005
Neg logit contributions
angles-0.005
 wins-0.005
osh-0.005
upid-0.004
 scores-0.004
Token fishes
Token ID49765
Feature activation-3.35e-03

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
aron+0.003
Mill+0.003
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token swimming
Token ID14899
Feature activation+1.35e-02

Pos logit contributions
 wins+0.006
angles+0.006
osh+0.006
 scores+0.006
upid+0.005
Neg logit contributions
Ell-0.006
Da-0.005
aron-0.004
Bug-0.004
Lu-0.004
Token around
Token ID1088
Feature activation-1.29e-02

Pos logit contributions
Ell+0.021
Da+0.018
 masse+0.017
 Michelle+0.016
Mic+0.016
Neg logit contributions
 wins-0.027
osh-0.026
angles-0.025
 scores-0.025
 surf-0.024
Token his
Token ID465
Feature activation+3.42e-04

Pos logit contributions
 wins+0.023
osh+0.022
angles+0.022
 scores+0.021
upid+0.020
Neg logit contributions
Ell-0.022
Da-0.019
age-0.018
 masse-0.017
Bug-0.017
Token hands
Token ID2832
Feature activation-5.94e-04

Pos logit contributions
Ell+0.001
Da+0.000
 owl+0.000
aron+0.000
Mill+0.000
Neg logit contributions
angles-0.001
osh-0.001
 wins-0.001
upid-0.001
yles-0.001
Token!
Token ID0
Feature activation-9.47e-03

Pos logit contributions
 wins+0.001
angles+0.001
osh+0.001
 scores+0.001
upid+0.001
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
Bug-0.001
Mill-0.001
Token He
Token ID679
Feature activation-3.03e-03

Pos logit contributions
angles+0.017
 wins+0.017
osh+0.017
upid+0.016
 scores+0.016
Neg logit contributions
Ell-0.015
Da-0.014
 owl-0.012
aron-0.012
Mill-0.012
Token The
Token ID383
Feature activation+8.12e-03

Pos logit contributions
Ell+0.008
Da+0.007
aron+0.007
Lu+0.007
Just+0.006
Neg logit contributions
osh-0.015
angles-0.014
ret-0.014
upid-0.014
 wins-0.014
Token van
Token ID5719
Feature activation+2.17e-03

Pos logit contributions
Ell+0.013
Da+0.011
 owl+0.011
age+0.010
Mill+0.010
Neg logit contributions
angles-0.015
osh-0.015
 wins-0.015
upid-0.014
 scores-0.014
Token was
Token ID373
Feature activation+4.12e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.003
age+0.003
aron+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token old
Token ID1468
Feature activation+1.10e-02

Pos logit contributions
Ell+0.007
Da+0.006
 owl+0.006
aron+0.005
Mill+0.005
Neg logit contributions
angles-0.007
 wins-0.007
osh-0.007
upid-0.007
 scores-0.007
Token and
Token ID290
Feature activation+2.42e-03

Pos logit contributions
Ell+0.017
Da+0.014
Bug+0.012
Mill+0.012
Mic+0.012
Neg logit contributions
 wins-0.023
angles-0.022
osh-0.022
 scores-0.021
 surf-0.020
Token sometimes
Token ID3360
Feature activation+2.12e-02

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
Lu+0.003
Bug+0.003
Neg logit contributions
angles-0.004
upid-0.004
 wins-0.004
osh-0.004
inch-0.004
Token it
Token ID340
Feature activation-7.23e-03

Pos logit contributions
Ell+0.036
 owl+0.032
Da+0.031
 Michelle+0.029
age+0.029
Neg logit contributions
angles-0.035
upid-0.034
osh-0.033
 wins-0.032
ret-0.032
Token did
Token ID750
Feature activation-1.31e-02

Pos logit contributions
angles+0.013
upid+0.013
 recess+0.012
 wins+0.012
yles+0.012
Neg logit contributions
Ell-0.011
Da-0.009
 owl-0.009
age-0.009
 Bug-0.009
Token not
Token ID407
Feature activation+1.52e-02

Pos logit contributions
osh+0.029
angles+0.029
 wins+0.028
yles+0.028
upid+0.027
Neg logit contributions
Ell-0.015
Da-0.011
age-0.011
 owl-0.011
Just-0.011
Token work
Token ID670
Feature activation+4.75e-03

Pos logit contributions
Ell+0.027
 owl+0.025
age+0.024
aron+0.023
Bug+0.023
Neg logit contributions
angles-0.023
yles-0.023
osh-0.022
 wins-0.021
ours-0.021
Token.
Token ID13
Feature activation+6.03e-03

Pos logit contributions
Ell+0.007
Da+0.006
aron+0.005
 owl+0.005
Lu+0.005
Neg logit contributions
angles-0.009
osh-0.009
 wins-0.009
upid-0.009
 scores-0.009
Token"
Token ID1
Feature activation+1.75e-02

Pos logit contributions
Ell+0.004
 owl+0.004
Da+0.003
aron+0.003
 masse+0.003
Neg logit contributions
osh-0.005
 wins-0.005
angles-0.005
upid-0.005
 scores-0.005
TokenLook
Token ID8567
Feature activation-9.28e-03

Pos logit contributions
Ell+0.031
Da+0.028
age+0.027
aron+0.027
 owl+0.026
Neg logit contributions
osh-0.027
 wins-0.026
angles-0.026
yles-0.024
 scores-0.024
Token,
Token ID11
Feature activation+6.61e-04

Pos logit contributions
 wins+0.021
angles+0.020
osh+0.020
 recess+0.020
 scores+0.019
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.008
isy-0.008
han-0.007
Token look
Token ID804
Feature activation+4.32e-03

Pos logit contributions
Ell+0.001
 owl+0.001
Da+0.001
 Hope+0.001
 Bug+0.001
Neg logit contributions
angles-0.001
 wins-0.001
upid-0.001
 scores-0.001
anged-0.001
Token!"
Token ID2474
Feature activation-3.08e-03

Pos logit contributions
Ell+0.005
Da+0.005
aron+0.004
 owl+0.004
isy+0.004
Neg logit contributions
angles-0.009
osh-0.009
 wins-0.009
 recess-0.009
 scores-0.008
Token Lily
Token ID20037
Feature activation-6.09e-03

Pos logit contributions
 wins+0.003
osh+0.003
angles+0.003
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.007
Da-0.006
age-0.006
aron-0.006
 owl-0.006
Token said
Token ID531
Feature activation-5.19e-03

Pos logit contributions
 wins+0.009
osh+0.009
angles+0.008
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.012
Da-0.011
Bug-0.010
Lu-0.010
 masse-0.010
Token.
Token ID13
Feature activation+3.34e-03

Pos logit contributions
osh+0.008
 wins+0.008
angles+0.008
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.010
Da-0.008
Bug-0.008
Mic-0.008
Lu-0.007
Token "
Token ID366
Feature activation+1.37e-02

Pos logit contributions
Ell+0.007
isy+0.006
age+0.006
ivering+0.006
Da+0.006
Neg logit contributions
 wins-0.004
osh-0.004
 L-0.004
 J-0.004
 scores-0.004
TokenA
Token ID32
Feature activation-1.39e-02

Pos logit contributions
Ell+0.020
Da+0.017
 owl+0.015
Mill+0.015
aron+0.015
Neg logit contributions
angles-0.027
 wins-0.027
osh-0.026
upid-0.026
 scores-0.025
Token shooting
Token ID4395
Feature activation-2.42e-02

Pos logit contributions
angles+0.016
 wins+0.015
osh+0.014
upid+0.014
 scores+0.014
Neg logit contributions
Ell-0.031
Da-0.029
 owl-0.028
isy-0.027
age-0.026
Token they
Token ID484
Feature activation-1.73e-03

Pos logit contributions
Ell+0.030
Da+0.027
 owl+0.026
 protecting+0.026
 Bug+0.025
Neg logit contributions
upid-0.037
angles-0.035
 wins-0.034
osh-0.033
isel-0.033
Token were
Token ID547
Feature activation-5.46e-03

Pos logit contributions
angles+0.002
 wins+0.002
upid+0.002
 scores+0.002
osh+0.002
Neg logit contributions
Ell-0.003
Da-0.003
 owl-0.003
Lu-0.003
aron-0.003
Token in
Token ID287
Feature activation+3.47e-03

Pos logit contributions
angles+0.010
osh+0.010
 wins+0.010
upid+0.010
 scores+0.009
Neg logit contributions
Ell-0.008
Da-0.007
 owl-0.007
aron-0.006
Mill-0.006
Token a
Token ID257
Feature activation-6.27e-03

Pos logit contributions
Ell+0.005
Da+0.005
 owl+0.005
 masse+0.004
Charl+0.004
Neg logit contributions
angles-0.007
osh-0.006
 wins-0.006
upid-0.006
ret-0.006
Token warm
Token ID5814
Feature activation+1.40e-02

Pos logit contributions
angles+0.013
 wins+0.012
osh+0.012
yles+0.011
upid+0.011
Neg logit contributions
Ell-0.010
Da-0.009
P-0.008
 owl-0.008
Lu-0.007
Token place
Token ID1295
Feature activation+1.17e-03

Pos logit contributions
Ell+0.028
Da+0.025
Mic+0.023
Just+0.023
Mill+0.022
Neg logit contributions
 wins-0.020
osh-0.019
angles-0.019
 recess-0.018
 scores-0.018
Token,
Token ID11
Feature activation+9.82e-04

Pos logit contributions
Ell+0.002
Da+0.001
 owl+0.001
aron+0.001
Bug+0.001
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.002
upid-0.002
Token and
Token ID290
Feature activation+1.42e-03

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.001
aron+0.001
age+0.001
Neg logit contributions
 wins-0.002
angles-0.002
osh-0.002
 scores-0.002
upid-0.002
Token they
Token ID484
Feature activation-2.16e-03

Pos logit contributions
Ell+0.003
Da+0.002
 owl+0.002
age+0.002
edly+0.002
Neg logit contributions
angles-0.002
osh-0.002
upid-0.002
 wins-0.002
ret-0.002
Token were
Token ID547
Feature activation-6.28e-03

Pos logit contributions
angles+0.004
upid+0.004
 wins+0.004
 scores+0.004
osh+0.003
Neg logit contributions
Ell-0.003
Da-0.003
 owl-0.003
aron-0.003
age-0.003
Token happy
Token ID3772
Feature activation-1.18e-02

Pos logit contributions
angles+0.010
osh+0.010
 wins+0.010
upid+0.009
 scores+0.009
Neg logit contributions
Ell-0.011
Da-0.010
 owl-0.009
aron-0.009
Mill-0.009
Token visited
Token ID8672
Feature activation+1.13e-02

Pos logit contributions
Ell+0.019
Da+0.017
 owl+0.015
aron+0.015
Mill+0.015
Neg logit contributions
 wins-0.016
angles-0.016
osh-0.016
 scores-0.015
upid-0.015
Token her
Token ID607
Feature activation+1.10e-02

Pos logit contributions
Ell+0.026
Da+0.024
 owl+0.022
aron+0.022
age+0.022
Neg logit contributions
angles-0.013
 wins-0.013
osh-0.013
 scores-0.011
upid-0.011
Token bunny
Token ID44915
Feature activation+2.69e-04

Pos logit contributions
Ell+0.020
Da+0.018
 owl+0.016
Mill+0.016
Lu+0.015
Neg logit contributions
angles-0.018
osh-0.018
 wins-0.018
upid-0.017
 scores-0.016
Token friends
Token ID2460
Feature activation-4.39e-03

Pos logit contributions
Ell+0.001
Da+0.000
aron+0.000
han+0.000
Mill+0.000
Neg logit contributions
 wins-0.000
angles-0.000
osh-0.000
 scores-0.000
upid-0.000
Token in
Token ID287
Feature activation+6.19e-05

Pos logit contributions
 wins+0.008
osh+0.007
angles+0.007
 scores+0.007
upid+0.006
Neg logit contributions
Ell-0.008
Da-0.007
Bug-0.006
aron-0.006
Mill-0.006
Token the
Token ID262
Feature activation+2.56e-03

Pos logit contributions
Ell+0.000
Da+0.000
 owl+0.000
Mill+0.000
Lu+0.000
Neg logit contributions
angles-0.000
 wins-0.000
osh-0.000
upid-0.000
 scores-0.000
Token house
Token ID2156
Feature activation+5.07e-03

Pos logit contributions
Ell+0.004
Da+0.004
aron+0.004
age+0.003
coon+0.003
Neg logit contributions
 wins-0.004
osh-0.004
angles-0.004
 scores-0.004
 recess-0.004
Token in
Token ID287
Feature activation-3.68e-04

Pos logit contributions
Ell+0.008
Da+0.007
 owl+0.006
aron+0.006
Mill+0.006
Neg logit contributions
angles-0.009
 wins-0.009
osh-0.009
 scores-0.009
upid-0.009
Token the
Token ID262
Feature activation+2.16e-03

Pos logit contributions
angles+0.000
 wins+0.000
osh+0.000
upid+0.000
 scores+0.000
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
Mill-0.001
Lu-0.001
Token tree
Token ID5509
Feature activation-3.23e-03

Pos logit contributions
Ell+0.004
Da+0.003
aron+0.003
Mill+0.003
age+0.003
Neg logit contributions
 wins-0.004
osh-0.003
angles-0.003
 scores-0.003
 recess-0.003
Token.
Token ID13
Feature activation-3.54e-03

Pos logit contributions
 wins+0.006
angles+0.006
osh+0.006
 scores+0.005
upid+0.005
Neg logit contributions
Ell-0.005
Da-0.005
 owl-0.004
aron-0.004
Bug-0.004
Token L
Token ID406
Feature activation-4.12e-02

Pos logit contributions
 wins+0.029
angles+0.029
osh+0.029
upid+0.027
 scores+0.026
Neg logit contributions
Ell-0.048
Da-0.043
 owl-0.041
aron-0.041
age-0.040
Tokenila
Token ID10102
Feature activation-2.35e-02

Pos logit contributions
 wins+0.055
angles+0.052
osh+0.051
 scores+0.050
upid+0.047
Neg logit contributions
Ell-0.087
Da-0.081
Lu-0.074
isy-0.073
Mill-0.073
Token.
Token ID13
Feature activation+5.90e-03

Pos logit contributions
osh+0.032
 wins+0.031
angles+0.030
ret+0.028
 surf+0.028
Neg logit contributions
Ell-0.046
Da-0.043
 masse-0.041
han-0.040
Bug-0.038
Token L
Token ID406
Feature activation-4.06e-02

Pos logit contributions
Ell+0.005
Da+0.003
Lu+0.002
Just+0.002
Bug+0.002
Neg logit contributions
angles-0.016
 wins-0.016
osh-0.016
 scores-0.015
yles-0.015
Tokenila
Token ID10102
Feature activation-2.27e-02

Pos logit contributions
 wins+0.051
angles+0.049
 scores+0.047
osh+0.046
upid+0.043
Neg logit contributions
Ell-0.085
Da-0.081
isy-0.075
P-0.074
Lu-0.074
Token loved
Token ID6151
Feature activation-1.07e-02

Pos logit contributions
osh+0.025
 wins+0.023
angles+0.021
 are+0.020
 surf+0.020
Neg logit contributions
Ell-0.048
Da-0.048
 masse-0.047
han-0.044
edly-0.043
Token to
Token ID284
Feature activation-1.00e-02

Pos logit contributions
angles+0.018
 wins+0.018
osh+0.018
 scores+0.017
upid+0.017
Neg logit contributions
Ell-0.018
Da-0.016
aron-0.014
 owl-0.014
age-0.014
Token go
Token ID467
Feature activation-1.36e-02

Pos logit contributions
angles+0.011
 wins+0.011
osh+0.011
upid+0.010
 scores+0.010
Neg logit contributions
Ell-0.023
Da-0.021
 owl-0.019
aron-0.019
age-0.019
Token on
Token ID319
Feature activation+1.42e-03

Pos logit contributions
 surf+0.021
 wins+0.021
 recess+0.021
angles+0.020
 surfing+0.020
Neg logit contributions
Ell-0.025
Da-0.024
aron-0.023
edly-0.021
Charl-0.021
Token walks
Token ID11114
Feature activation-3.17e-02

Pos logit contributions
Ell+0.003
Da+0.002
 owl+0.002
Mill+0.002
Lu+0.002
Neg logit contributions
angles-0.002
osh-0.002
 wins-0.002
upid-0.002
 scores-0.002
Token in
Token ID287
Feature activation+3.06e-03

Pos logit contributions
 recess+0.053
 wins+0.053
osh+0.049
 scores+0.049
 surf+0.048
Neg logit contributions
Ell-0.048
coon-0.047
Da-0.046
 masse-0.045
isy-0.045
Token can
Token ID460
Feature activation-1.21e-02

Pos logit contributions
angles+0.009
 recess+0.009
yles+0.009
osh+0.008
 scores+0.008
Neg logit contributions
Ell-0.012
 Bug-0.011
 owl-0.011
Bug-0.011
age-0.010
Token find
Token ID1064
Feature activation+2.30e-02

Pos logit contributions
angles+0.017
osh+0.017
 wins+0.016
 scores+0.015
yles+0.015
Neg logit contributions
Ell-0.023
 owl-0.021
Da-0.020
age-0.020
Bug-0.020
Token a
Token ID257
Feature activation-3.59e-03

Pos logit contributions
Ell+0.033
 owl+0.031
Da+0.030
 Bug+0.028
Bug+0.028
Neg logit contributions
angles-0.041
 wins-0.040
osh-0.039
 scores-0.039
upid-0.039
Token treasure
Token ID14068
Feature activation-7.08e-03

Pos logit contributions
angles+0.007
osh+0.007
yles+0.007
 wins+0.006
upid+0.006
Neg logit contributions
Ell-0.005
 owl-0.005
Da-0.004
 Bug-0.004
age-0.004
Token or
Token ID393
Feature activation+1.14e-02

Pos logit contributions
 wins+0.013
angles+0.012
osh+0.011
 scores+0.011
 recess+0.010
Neg logit contributions
Ell-0.013
Da-0.012
Mill-0.010
Charl-0.010
Lu-0.010
Token a
Token ID257
Feature activation-3.91e-03

Pos logit contributions
Ell+0.020
Da+0.018
 owl+0.016
Bug+0.016
Mill+0.016
Neg logit contributions
angles-0.019
 wins-0.018
osh-0.018
upid-0.017
 scores-0.017
Token fairy
Token ID25607
Feature activation+6.45e-03

Pos logit contributions
angles+0.007
osh+0.007
 wins+0.006
upid+0.006
yles+0.006
Neg logit contributions
Ell-0.007
Da-0.006
 owl-0.006
age-0.005
Bug-0.005
Token!"
Token ID2474
Feature activation-4.12e-03

Pos logit contributions
Ell+0.011
Da+0.010
Mill+0.009
Lu+0.009
 masse+0.009
Neg logit contributions
 wins-0.011
angles-0.011
osh-0.011
 scores-0.010
upid-0.010
Token
Token ID198
Feature activation+9.68e-03

Pos logit contributions
angles+0.004
 wins+0.004
osh+0.004
 scores+0.004
upid+0.004
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.009
aron-0.008
Mill-0.008
Token
Token ID198
Feature activation+1.03e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.012
aron+0.012
Mill+0.012
Neg logit contributions
angles-0.018
 wins-0.018
osh-0.017
 scores-0.016
upid-0.016
TokenBen
Token ID11696
Feature activation-2.77e-02

Pos logit contributions
Ell+0.016
Da+0.014
 owl+0.013
aron+0.012
Mill+0.012
Neg logit contributions
angles-0.020
 wins-0.019
osh-0.019
upid-0.018
 scores-0.018
Token his
Token ID465
Feature activation-8.14e-04

Pos logit contributions
angles+0.003
 wins+0.003
osh+0.003
 scores+0.003
upid+0.003
Neg logit contributions
Ell-0.005
Da-0.005
age-0.004
 owl-0.004
Bug-0.004
Token eyes
Token ID2951
Feature activation-9.16e-03

Pos logit contributions
 wins+0.001
angles+0.001
osh+0.001
 scores+0.001
upid+0.001
Neg logit contributions
Ell-0.002
Da-0.002
Mill-0.002
Lu-0.002
Bug-0.002
Token and
Token ID290
Feature activation-6.05e-03

Pos logit contributions
osh+0.017
 wins+0.017
angles+0.017
upid+0.016
 scores+0.016
Neg logit contributions
Ell-0.014
Da-0.012
Mic-0.011
Bug-0.011
 owl-0.011
Token he
Token ID339
Feature activation-5.81e-03

Pos logit contributions
angles+0.010
 wins+0.010
upid+0.010
osh+0.009
 scores+0.009
Neg logit contributions
Ell-0.010
Da-0.009
 owl-0.009
Just-0.008
aron-0.008
Token told
Token ID1297
Feature activation-8.54e-03

Pos logit contributions
angles+0.010
 wins+0.010
upid+0.010
 scores+0.009
 recess+0.009
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.008
Just-0.008
age-0.008
Token all
Token ID477
Feature activation-4.36e-02

Pos logit contributions
 wins+0.015
osh+0.015
angles+0.014
upid+0.013
yles+0.013
Neg logit contributions
Ell-0.014
Da-0.013
 owl-0.013
Mill-0.012
han-0.011
Token his
Token ID465
Feature activation-1.52e-03

Pos logit contributions
angles+0.075
 wins+0.073
osh+0.072
 scores+0.068
yles+0.068
Neg logit contributions
Ell-0.074
Da-0.065
 owl-0.063
Mill-0.059
Lu-0.059
Token friends
Token ID2460
Feature activation-3.38e-03

Pos logit contributions
angles+0.003
 wins+0.003
osh+0.003
upid+0.002
yles+0.002
Neg logit contributions
Ell-0.002
 owl-0.002
Da-0.002
isy-0.002
Mill-0.002
Token about
Token ID546
Feature activation-2.87e-02

Pos logit contributions
angles+0.006
osh+0.006
 wins+0.006
upid+0.006
 recess+0.006
Neg logit contributions
Ell-0.005
Da-0.004
 owl-0.004
aron-0.004
isy-0.004
Token the
Token ID262
Feature activation-6.82e-04

Pos logit contributions
angles+0.041
osh+0.039
 wins+0.039
upid+0.038
 scores+0.035
Neg logit contributions
Ell-0.057
Da-0.052
 owl-0.050
Mill-0.047
 Bug-0.046
Token incredible
Token ID8082
Feature activation+1.21e-02

Pos logit contributions
angles+0.001
 wins+0.001
osh+0.001
upid+0.001
 scores+0.001
Neg logit contributions
Ell-0.001
Da-0.001
 owl-0.001
Mill-0.001
Bug-0.001
Token Tim
Token ID5045
Feature activation-9.64e-03

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.000
 wins-0.000
osh-0.000
upid-0.000
 scores-0.000
Tokenmy
Token ID1820
Feature activation+9.21e-03

Pos logit contributions
angles+0.017
 wins+0.017
osh+0.017
upid+0.016
 scores+0.016
Neg logit contributions
Ell-0.015
Da-0.013
 owl-0.012
aron-0.012
age-0.012
Token."
Token ID526
Feature activation-1.93e-04

Pos logit contributions
Ell+0.015
Da+0.013
 owl+0.011
Mill+0.011
aron+0.011
Neg logit contributions
 wins-0.017
angles-0.017
osh-0.016
 scores-0.016
upid-0.016
Token
Token ID198
Feature activation+1.50e-03

Pos logit contributions
 wins+0.000
angles+0.000
osh+0.000
 scores+0.000
upid+0.000
Neg logit contributions
Ell-0.000
Da-0.000
 owl-0.000
age-0.000
Bug-0.000
Token
Token ID198
Feature activation+2.25e-03

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
aron+0.002
age+0.002
Neg logit contributions
angles-0.003
 wins-0.003
osh-0.003
upid-0.002
 scores-0.002
TokenL
Token ID43
Feature activation-4.11e-02

Pos logit contributions
Ell+0.004
 owl+0.003
 Bug+0.003
 calf+0.003
Da+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
 scores-0.003
upid-0.003
Tokenily
Token ID813
Feature activation+1.36e-03

Pos logit contributions
angles+0.071
 wins+0.070
osh+0.068
upid+0.066
ret+0.064
Neg logit contributions
Ell-0.072
Da-0.063
 masse-0.059
 owl-0.058
Bug-0.058
Token,
Token ID11
Feature activation+4.08e-03

Pos logit contributions
Ell+0.002
Da+0.002
Mill+0.002
Bug+0.002
Mic+0.002
Neg logit contributions
 wins-0.002
osh-0.002
angles-0.002
 scores-0.002
upid-0.002
Token Ben
Token ID3932
Feature activation-2.10e-02

Pos logit contributions
Ell+0.009
Da+0.008
age+0.008
Bug+0.008
aron+0.008
Neg logit contributions
 wins-0.005
osh-0.005
angles-0.005
 scores-0.004
 recess-0.004
Token,
Token ID11
Feature activation+3.41e-03

Pos logit contributions
 wins+0.037
osh+0.036
 scores+0.034
angles+0.034
upid+0.032
Neg logit contributions
Ell-0.037
Da-0.032
Mill-0.029
Bug-0.029
age-0.028
Token and
Token ID290
Feature activation+5.35e-03

Pos logit contributions
Ell+0.006
age+0.005
han+0.005
Da+0.005
Bug+0.005
Neg logit contributions
osh-0.005
 wins-0.005
angles-0.005
 scores-0.005
 recess-0.005
Token teacher
Token ID4701
Feature activation-5.14e-03

Pos logit contributions
Ell+0.035
Da+0.031
 owl+0.030
age+0.028
 masse+0.027
Neg logit contributions
osh-0.033
angles-0.033
 wins-0.033
upid-0.030
yles-0.029
Token is
Token ID318
Feature activation-5.65e-03

Pos logit contributions
angles+0.008
 wins+0.008
osh+0.008
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.007
aron-0.007
Mill-0.007
Token Miss
Token ID4544
Feature activation-5.43e-02

Pos logit contributions
angles+0.010
 wins+0.010
osh+0.010
upid+0.009
 scores+0.009
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.007
aron-0.007
Mill-0.007
Token Lily
Token ID20037
Feature activation+9.67e-04

Pos logit contributions
 wins+0.076
angles+0.074
osh+0.072
 scores+0.068
yles+0.068
Neg logit contributions
Ell-0.111
Da-0.100
age-0.093
Bug-0.092
Mic-0.091
Token.
Token ID13
Feature activation+6.55e-03

Pos logit contributions
Ell+0.001
Da+0.001
 owl+0.001
aron+0.001
Mill+0.001
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token Miss
Token ID4544
Feature activation-5.40e-02

Pos logit contributions
Ell+0.008
Da+0.006
 owl+0.005
aron+0.005
Lu+0.005
Neg logit contributions
angles-0.015
osh-0.015
 wins-0.015
upid-0.014
 scores-0.014
Token Lily
Token ID20037
Feature activation+1.25e-03

Pos logit contributions
 wins+0.074
angles+0.073
osh+0.072
 scores+0.067
upid+0.065
Neg logit contributions
Ell-0.111
Da-0.101
age-0.092
Bug-0.092
 owl-0.091
Token is
Token ID318
Feature activation-5.22e-03

Pos logit contributions
Ell+0.002
Da+0.002
 owl+0.002
aron+0.002
Lu+0.002
Neg logit contributions
angles-0.002
osh-0.002
upid-0.002
 wins-0.002
 recess-0.002
Token very
Token ID845
Feature activation+1.38e-02

Pos logit contributions
 wins+0.009
angles+0.009
osh+0.008
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.009
Da-0.008
 owl-0.008
aron-0.007
Bug-0.007
Token nice
Token ID3621
Feature activation+1.50e-02

Pos logit contributions
Ell+0.021
Da+0.018
Lu+0.016
 owl+0.016
Mill+0.016
Neg logit contributions
angles-0.027
osh-0.026
 wins-0.025
upid-0.025
yles-0.024
Token.
Token ID13
Feature activation+6.46e-03

Pos logit contributions
Ell+0.021
Da+0.018
 owl+0.015
Bug+0.015
aron+0.015
Neg logit contributions
 wins-0.031
angles-0.030
osh-0.030
 scores-0.029
upid-0.029
Token was
Token ID373
Feature activation+2.00e-03

Pos logit contributions
 wins+0.009
angles+0.009
osh+0.009
 scores+0.008
upid+0.008
Neg logit contributions
Ell-0.010
Da-0.009
Bug-0.008
 owl-0.008
Mill-0.008
Token the
Token ID262
Feature activation+3.67e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.003
aron+0.003
age+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.003
 scores-0.003
Token wealthiest
Token ID35817
Feature activation-1.77e-02

Pos logit contributions
Ell+0.006
Da+0.006
 owl+0.005
 masse+0.005
age+0.005
Neg logit contributions
angles-0.006
osh-0.006
 wins-0.006
upid-0.006
 scores-0.006
Token girl
Token ID2576
Feature activation+1.78e-02

Pos logit contributions
 wins+0.030
angles+0.029
osh+0.029
 scores+0.028
 recess+0.027
Neg logit contributions
Ell-0.032
Da-0.028
Bug-0.025
Mill-0.025
age-0.024
Token in
Token ID287
Feature activation+1.73e-03

Pos logit contributions
Ell+0.031
Da+0.027
Bug+0.026
Mic+0.025
Mill+0.024
Neg logit contributions
 wins-0.032
osh-0.031
 scores-0.029
angles-0.028
 recess-0.026
Token all
Token ID477
Feature activation-4.24e-02

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
Mill+0.003
Lu+0.003
Neg logit contributions
angles-0.002
 wins-0.002
osh-0.002
upid-0.002
 scores-0.002
Token the
Token ID262
Feature activation+2.30e-03

Pos logit contributions
angles+0.075
 wins+0.074
osh+0.074
upid+0.069
 scores+0.069
Neg logit contributions
Ell-0.071
Da-0.062
 owl-0.055
aron-0.055
Mill-0.054
Token land
Token ID1956
Feature activation-2.73e-02

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
Mill+0.003
aron+0.003
Neg logit contributions
angles-0.004
 wins-0.004
osh-0.004
upid-0.004
 scores-0.004
Token and
Token ID290
Feature activation-5.06e-03

Pos logit contributions
 wins+0.051
osh+0.050
angles+0.049
 scores+0.047
upid+0.046
Neg logit contributions
Ell-0.044
Da-0.038
Bug-0.034
Mic-0.034
 owl-0.033
Token the
Token ID262
Feature activation+2.55e-03

Pos logit contributions
angles+0.009
osh+0.009
 wins+0.009
upid+0.008
 scores+0.008
Neg logit contributions
Ell-0.009
Da-0.007
 owl-0.007
Mill-0.007
age-0.007
Token people
Token ID661
Feature activation-7.24e-03

Pos logit contributions
Ell+0.004
Da+0.003
 owl+0.003
 masse+0.003
Lu+0.003
Neg logit contributions
angles-0.005
osh-0.005
 wins-0.005
upid-0.005
yles-0.004
Token too
Token ID1165
Feature activation-1.23e-02

Pos logit contributions
 wins+0.026
angles+0.026
osh+0.026
upid+0.024
 scores+0.024
Neg logit contributions
Ell-0.030
Da-0.027
aron-0.024
 owl-0.024
Mill-0.024
Token hard
Token ID1327
Feature activation+1.63e-02

Pos logit contributions
angles+0.019
osh+0.019
 wins+0.018
yles+0.018
 scores+0.018
Neg logit contributions
Ell-0.023
Da-0.020
 owl-0.019
Lu-0.019
Mill-0.018
Token for
Token ID329
Feature activation+1.60e-02

Pos logit contributions
Ell+0.032
Da+0.029
Mic+0.025
Bug+0.025
Lu+0.025
Neg logit contributions
 wins-0.027
angles-0.025
upid-0.024
osh-0.024
 scores-0.024
Token me
Token ID502
Feature activation+3.39e-02

Pos logit contributions
Ell+0.027
Da+0.024
 owl+0.021
Mill+0.021
Bug+0.021
Neg logit contributions
angles-0.028
 wins-0.028
osh-0.028
upid-0.026
 scores-0.026
Token."
Token ID526
Feature activation-5.47e-03

Pos logit contributions
Ell+0.060
Da+0.053
Bug+0.046
Mill+0.046
 masse+0.045
Neg logit contributions
 wins-0.060
osh-0.057
angles-0.056
upid-0.055
 scores-0.053
Token
Token ID220
Feature activation-4.35e-02

Pos logit contributions
 wins+0.008
osh+0.007
angles+0.007
 scores+0.007
upid+0.007
Neg logit contributions
Ell-0.011
Da-0.009
isy-0.009
 owl-0.009
age-0.009
Token
Token ID198
Feature activation+1.68e-03

Pos logit contributions
angles+0.084
 wins+0.084
osh+0.083
 scores+0.079
upid+0.078
Neg logit contributions
Ell-0.064
Da-0.055
aron-0.048
Mill-0.048
 owl-0.048
Token
Token ID198
Feature activation+2.77e-03

Pos logit contributions
Ell+0.003
Da+0.003
 owl+0.002
age+0.002
 masse+0.002
Neg logit contributions
 wins-0.003
osh-0.003
angles-0.003
upid-0.002
 scores-0.002
TokenBrown
Token ID20644
Feature activation+1.83e-02

Pos logit contributions
Ell+0.004
 owl+0.004
aron+0.003
age+0.003
 masse+0.003
Neg logit contributions
 wins-0.005
osh-0.005
angles-0.004
 scores-0.004
upid-0.004
Tokenie
Token ID494
Feature activation+3.26e-02

Pos logit contributions
Ell+0.031
Da+0.027
 owl+0.025
aron+0.025
Lu+0.024
Neg logit contributions
 wins-0.032
angles-0.032
osh-0.031
upid-0.030
 scores-0.029
Token looked
Token ID3114
Feature activation+1.52e-02

Pos logit contributions
Ell+0.063
Da+0.057
Mill+0.051
Bug+0.051
 owl+0.050
Neg logit contributions
 wins-0.049
osh-0.048
angles-0.048
 scores-0.045
upid-0.043