TOP ACTIVATIONS
MAX = 0.156

worked hard . The grass ho pper was spoiled and didn
a great he ave - ho , they gained the avocado
make it whole again . Amen ." Then ,
" Thank you , fire fly . You are very kind
introduced to a smooth grass ho pper called Jim . Ted
and picked up a grass ho pper ! Both friends were
" Hello , gir affe ," Anna said .
at circles , squares , rect angles and triangles .
ried ly , the grass ho pper rushed over and tried
saw monkeys , elephants , gir aff es , and lions
Gr ass ho pper , let â
see lions , elephants , gir aff es and more .
" Hello , gir affe ," Ben said .
The nos y grass ho pper hopped over to Tom
and brave people from long ago . One day
coins that people used long ago . They saw paintings and
and pointed at the grass ho pper . He said ,
El mo thanked the dragon fly and promised to take good
saw lions , elephants , gir aff es and ze br
she made a long time ago . Rose had promised always

MIN ACTIVATIONS
MIN = -0.100

area in the river where Spot could explore with no fear
They found a place where Spot could fit in . They
save his pup from the injuries and the pup died soon
the spot where he buried Spot . He wrote Spot 's
in a safe place where Tommy couldn 't reach it .
their mom and dad what Spot can do . They take
was the story of how Sally led the naughty monster away
it . She saw what Tommy had done . She cried
Tim my thought about what Billy said and decided to start
doing anything he set his mind to ! <|endoftext|> Once upon
â s how Spot explored the castle that day
do anything she set her mind to . One day ,
Tim my thought about what Tommy said and then said ,
home , he saw what Spot had done and said ,
woods . Jack had his rake with him . " What
open grass y area where Sally could do her favorite dance
had a metal truck which Max didn 't have . He
look as good as what Billy had made .
liked to do what his Dad did , so every day
's family saw how much Billy and Harriet loved each other

INTERVAL 0.092 - 0.156
CONTAINS 0.004%

. But be careful , crystals can be delicate ."
Sammy would often share av oc ados and other y ummy
They were here a minute ago ." " Maybe
treasure from a long time ago ." The
happy . He hugged t eddy and said , " Thank

INTERVAL 0.028 - 0.092
CONTAINS 5.872%

Her mom my said she was lucky it wasn 't worse
Sam said , " Yes , let 's try together ."
down and enjoyed the y ummy apple together . The end
In the end , the avocado realized that there was no
uses a teaspoon to make swirl s and dots on the

INTERVAL -0.036 - 0.028
CONTAINS 91.742%

things can break easily . <|endoftext|> Tim my was playing with
you ." The bird felt better and flew away .
One day , he decided to explore the woods . He
and brushes and made a beautiful picture of a rainbow .
with her coat . They knew she would be kept safe

INTERVAL -0.100 - -0.036
CONTAINS 2.382%

The little boy was a bit frightened and he stepped back
gun and went to see what it was . He found
a cow named Daisy . Daisy was friendly and helpful .
a band age on her ankle . He says , "
fit ! " What will you do with
Token worked
Token ID3111
Feature activation+2.07e-02

Pos logit contributions
 Amen+0.017
 teaches+0.017
 happens+0.016
 lived+0.016
icks+0.016
Neg logit contributions
arse-0.019
 destination-0.018
ge-0.018
ized-0.018
har-0.017
Token hard
Token ID1327
Feature activation+5.32e-02

Pos logit contributions
 teaches+0.020
 happens+0.019
 about+0.018
 lived+0.017
 is+0.016
Neg logit contributions
arse-0.058
oured-0.054
har-0.054
ishment-0.053
rolling-0.052
Token.
Token ID13
Feature activation+2.60e-02

Pos logit contributions
 happens+0.076
 teaches+0.074
 lived+0.070
 vitamins+0.068
 Amen+0.067
Neg logit contributions
arse-0.121
har-0.115
 destination-0.110
oured-0.110
ized-0.109
Token The
Token ID383
Feature activation+1.11e-02

Pos logit contributions
 happens+0.037
 Amen+0.036
 teaches+0.035
igator+0.034
ile+0.034
Neg logit contributions
arse-0.056
 destination-0.054
ge-0.053
har-0.053
ized-0.052
Token grass
Token ID8701
Feature activation+3.38e-02

Pos logit contributions
 Amen+0.017
 teaches+0.017
 lived+0.017
 happens+0.016
igator+0.016
Neg logit contributions
arse-0.023
ge-0.022
 destination-0.022
ized-0.022
har-0.021
Tokenho
Token ID8873
Feature activation+1.26e-01

Pos logit contributions
 lived+0.046
 teaches+0.046
 Amen+0.046
 happens+0.045
igator+0.044
Neg logit contributions
arse-0.077
ge-0.074
 destination-0.073
har-0.071
ized-0.071
Tokenpper
Token ID2848
Feature activation-1.33e-02

Pos logit contributions
 happens+0.244
 lived+0.244
 teaches+0.239
icks+0.234
 Amen+0.232
Neg logit contributions
arse-0.210
 destination-0.201
ishment-0.195
har-0.193
ge-0.191
Token was
Token ID373
Feature activation+1.36e-02

Pos logit contributions
arse+0.029
ge+0.027
 destination+0.027
har+0.026
ized+0.026
Neg logit contributions
 lived-0.021
 teaches-0.020
 happens-0.020
igator-0.019
 Amen-0.019
Token spoiled
Token ID34781
Feature activation-2.18e-03

Pos logit contributions
 Amen+0.022
 teaches+0.022
 lived+0.022
 happens+0.022
igator+0.021
Neg logit contributions
arse-0.028
ge-0.026
 destination-0.026
ized-0.026
har-0.025
Token and
Token ID290
Feature activation-1.63e-04

Pos logit contributions
arse+0.005
har+0.005
 destination+0.005
ge+0.005
rolling+0.005
Neg logit contributions
 lived-0.003
 happens-0.003
 teaches-0.003
igator-0.003
 about-0.003
Token didn
Token ID1422
Feature activation+6.77e-04

Pos logit contributions
arse+0.000
ge+0.000
 destination+0.000
ized+0.000
har+0.000
Neg logit contributions
 Amen-0.000
 teaches-0.000
 happens-0.000
 lived-0.000
igator-0.000
Token a
Token ID257
Feature activation-2.64e-03

Pos logit contributions
 teaches+0.037
 Amen+0.037
 lived+0.037
 happens+0.036
igator+0.035
Neg logit contributions
arse-0.057
ge-0.055
 destination-0.053
har-0.053
ized-0.052
Token great
Token ID1049
Feature activation-1.46e-02

Pos logit contributions
arse+0.005
ge+0.005
ized+0.005
ishment+0.005
 destination+0.005
Neg logit contributions
 Amen-0.004
 teaches-0.004
 happens-0.004
 lived-0.004
igator-0.004
Token he
Token ID339
Feature activation+6.51e-03

Pos logit contributions
arse+0.030
ge+0.029
ized+0.028
 destination+0.027
har+0.027
Neg logit contributions
 Amen-0.024
 teaches-0.023
 lived-0.023
 happens-0.023
igator-0.022
Tokenave
Token ID1015
Feature activation-3.47e-02

Pos logit contributions
 lived+0.010
 happens+0.010
 teaches+0.010
igator+0.009
 Amen+0.009
Neg logit contributions
arse-0.015
ge-0.014
 destination-0.013
ishment-0.013
har-0.013
Token-
Token ID12
Feature activation-1.40e-03

Pos logit contributions
arse+0.079
ge+0.074
 destination+0.073
har+0.072
ized+0.071
Neg logit contributions
 lived-0.050
 teaches-0.050
 Amen-0.048
igator-0.048
 happens-0.048
Tokenho
Token ID8873
Feature activation+1.22e-01

Pos logit contributions
ge+0.003
arse+0.003
ized+0.003
har+0.003
oured+0.002
Neg logit contributions
 Amen-0.002
 teaches-0.002
 happens-0.002
 lived-0.002
icks-0.002
Token,
Token ID11
Feature activation+8.04e-03

Pos logit contributions
 Amen+0.207
 teaches+0.201
 lived+0.197
 happens+0.197
icks+0.193
Neg logit contributions
arse-0.233
 destination-0.226
ge-0.220
ized-0.219
har-0.217
Token they
Token ID484
Feature activation+3.59e-03

Pos logit contributions
 lived+0.010
 teaches+0.009
igator+0.009
 Amen+0.009
 happens+0.009
Neg logit contributions
arse-0.021
ge-0.019
 destination-0.019
ishment-0.018
har-0.018
Token gained
Token ID8618
Feature activation-1.95e-02

Pos logit contributions
 lived+0.006
 teaches+0.006
igator+0.006
icks+0.005
 happens+0.005
Neg logit contributions
arse-0.008
ge-0.007
 destination-0.007
ized-0.007
ishment-0.007
Token the
Token ID262
Feature activation-6.79e-03

Pos logit contributions
arse+0.043
ge+0.040
 destination+0.040
har+0.039
ized+0.039
Neg logit contributions
 teaches-0.029
 happens-0.028
 lived-0.028
 Amen-0.028
igator-0.027
Token avocado
Token ID40377
Feature activation+2.19e-02

Pos logit contributions
arse+0.015
ge+0.014
 destination+0.014
har+0.014
ized+0.014
Neg logit contributions
 Amen-0.010
 teaches-0.010
 happens-0.010
 lived-0.010
igator-0.009
Token make
Token ID787
Feature activation-7.09e-04

Pos logit contributions
 Amen+0.021
 teaches+0.021
 lived+0.021
 happens+0.021
igator+0.020
Neg logit contributions
arse-0.031
ge-0.029
 destination-0.029
ized-0.029
har-0.028
Token it
Token ID340
Feature activation+1.14e-02

Pos logit contributions
arse+0.001
 destination+0.001
ge+0.001
ized+0.001
har+0.001
Neg logit contributions
 Amen-0.001
 teaches-0.001
 lived-0.001
 happens-0.001
igator-0.001
Token whole
Token ID2187
Feature activation-2.32e-02

Pos logit contributions
 Amen+0.020
 teaches+0.020
 lived+0.019
 happens+0.019
igator+0.019
Neg logit contributions
arse-0.022
ge-0.021
 destination-0.021
ized-0.020
har-0.020
Token again
Token ID757
Feature activation+1.75e-02

Pos logit contributions
arse+0.048
 destination+0.046
ge+0.046
ized+0.045
har+0.045
Neg logit contributions
 Amen-0.037
 lived-0.035
 teaches-0.035
 happens-0.035
igator-0.034
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
 Amen+0.021
 teaches+0.021
 lived+0.021
 happens+0.021
igator+0.020
Neg logit contributions
arse-0.042
 destination-0.041
ge-0.041
ized-0.040
har-0.039
Token Amen
Token ID34717
Feature activation+1.20e-01

Pos logit contributions
ile+0.019
py+0.018
eer+0.018
igator+0.018
igators+0.017
Neg logit contributions
har-0.021
-0.021
arse-0.020
 destination-0.020
ishment-0.020
Token."
Token ID526
Feature activation+2.22e-02

Pos logit contributions
igator+0.186
igators+0.185
 happens+0.176
 vitamins+0.176
 teaches+0.162
Neg logit contributions
 destination-0.237
ge-0.236
har-0.229
arse-0.225
ized-0.218
Token
Token ID198
Feature activation+9.07e-03

Pos logit contributions
 Amen+0.030
 teaches+0.030
 happens+0.030
 lived+0.029
igator+0.029
Neg logit contributions
arse-0.049
 destination-0.048
ge-0.047
ized-0.047
har-0.047
Token
Token ID198
Feature activation+4.58e-03

Pos logit contributions
 happens+0.014
 teaches+0.014
igators+0.014
igator+0.014
 Amen+0.014
Neg logit contributions
arse-0.018
ge-0.018
har-0.017
ized-0.017
 destination-0.017
TokenThen
Token ID6423
Feature activation-5.36e-04

Pos logit contributions
 Amen+0.008
 happens+0.008
 teaches+0.008
 lived+0.008
igator+0.007
Neg logit contributions
arse-0.009
ge-0.008
 destination-0.008
ized-0.008
har-0.008
Token,
Token ID11
Feature activation+2.51e-03

Pos logit contributions
arse+0.001
 destination+0.001
ge+0.001
ized+0.001
har+0.001
Neg logit contributions
 Amen-0.001
 teaches-0.001
 lived-0.001
 happens-0.001
igator-0.001
Token"
Token ID1
Feature activation-3.67e-02

Pos logit contributions
 Amen+0.002
 happens+0.002
 teaches+0.002
 lived+0.002
igator+0.002
Neg logit contributions
arse-0.002
ge-0.002
ized-0.002
har-0.002
 destination-0.002
TokenThank
Token ID10449
Feature activation+1.13e-02

Pos logit contributions
arse+0.058
ized+0.057
har+0.056
 sudden+0.054
ge+0.052
Neg logit contributions
icks-0.070
igator-0.068
igators-0.068
 about-0.066
 teaches-0.066
Token you
Token ID345
Feature activation+5.93e-02

Pos logit contributions
 teaches+0.022
igator+0.022
 vitamins+0.022
 lived+0.022
eer+0.021
Neg logit contributions
ized-0.017
arse-0.016
har-0.016
 breath-0.016
ipping-0.015
Token,
Token ID11
Feature activation+2.96e-02

Pos logit contributions
 happens+0.100
 lived+0.099
igator+0.097
 Amen+0.097
igators+0.097
Neg logit contributions
arse-0.114
 destination-0.112
har-0.105
ge-0.104
 breath-0.103
Token fire
Token ID2046
Feature activation+3.36e-02

Pos logit contributions
 Amen+0.040
 happens+0.040
 vitamins+0.039
 lived+0.039
igators+0.038
Neg logit contributions
 destination-0.062
arse-0.060
har-0.058
 breath-0.057
ishment-0.057
Tokenfly
Token ID12254
Feature activation+1.18e-01

Pos logit contributions
 Amen+0.057
 teaches+0.057
 vitamins+0.056
 happens+0.054
igator+0.051
Neg logit contributions
arse-0.065
 destination-0.062
ge-0.060
har-0.060
ized-0.057
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
 Amen+0.221
 vitamins+0.217
 teaches+0.198
igators+0.196
 happens+0.195
Neg logit contributions
 destination-0.187
 breath-0.183
har-0.175
ized-0.174
arse-0.173
Token You
Token ID921
Feature activation+2.98e-02

Pos logit contributions
igator+0.029
igators+0.028
 gor+0.027
ile+0.027
 happens+0.026
Neg logit contributions
arse-0.031
har-0.031
-0.030
ishment-0.030
 destination-0.029
Token are
Token ID389
Feature activation+5.58e-03

Pos logit contributions
 Amen+0.042
igators+0.037
igator+0.037
 gor+0.037
 happens+0.036
Neg logit contributions
arse-0.066
 destination-0.063
ized-0.060
har-0.059
 breath-0.059
Token very
Token ID845
Feature activation-8.19e-03

Pos logit contributions
 happens+0.011
 Amen+0.010
eer+0.010
 teaches+0.010
 vitamins+0.009
Neg logit contributions
har-0.009
arse-0.009
 completely-0.009
 destination-0.009
 breath-0.008
Token kind
Token ID1611
Feature activation+2.52e-02

Pos logit contributions
har+0.014
ized+0.014
arse+0.013
 breath+0.013
 destination+0.012
Neg logit contributions
 Amen-0.014
 happens-0.014
 teaches-0.014
 lions-0.014
 gor-0.013
Token introduced
Token ID5495
Feature activation+5.41e-03

Pos logit contributions
 Amen+0.005
 lived+0.005
 teaches+0.005
 happens+0.005
igator+0.005
Neg logit contributions
arse-0.006
ge-0.005
 destination-0.005
ized-0.005
har-0.005
Token to
Token ID284
Feature activation+1.82e-02

Pos logit contributions
 Amen+0.007
 lived+0.007
 teaches+0.007
 happens+0.007
igators+0.007
Neg logit contributions
arse-0.013
ge-0.012
 destination-0.012
ized-0.012
oured-0.011
Token a
Token ID257
Feature activation+5.77e-03

Pos logit contributions
 Amen+0.028
 teaches+0.027
 lived+0.027
 happens+0.027
igator+0.026
Neg logit contributions
arse-0.038
 destination-0.037
ge-0.037
ized-0.036
har-0.035
Token smooth
Token ID7209
Feature activation+2.44e-02

Pos logit contributions
 Amen+0.009
 teaches+0.009
 lived+0.009
 happens+0.009
igator+0.009
Neg logit contributions
arse-0.012
ge-0.011
 destination-0.011
ized-0.011
har-0.011
Token grass
Token ID8701
Feature activation+3.27e-02

Pos logit contributions
 Amen+0.039
 teaches+0.039
 lived+0.039
igator+0.038
 happens+0.038
Neg logit contributions
arse-0.050
ge-0.048
har-0.046
 destination-0.046
ized-0.046
Tokenho
Token ID8873
Feature activation+1.18e-01

Pos logit contributions
 Amen+0.055
 teaches+0.054
 happens+0.053
 lived+0.053
igator+0.051
Neg logit contributions
arse-0.064
 destination-0.061
ge-0.061
ized-0.059
har-0.059
Tokenpper
Token ID2848
Feature activation-5.25e-03

Pos logit contributions
 Amen+0.236
 teaches+0.232
 lived+0.231
 happens+0.230
igator+0.224
Neg logit contributions
arse-0.194
ge-0.183
 destination-0.183
ized-0.177
har-0.175
Token called
Token ID1444
Feature activation+5.22e-03

Pos logit contributions
arse+0.009
 destination+0.009
ge+0.009
ized+0.009
har+0.008
Neg logit contributions
 Amen-0.010
 happens-0.009
 teaches-0.009
 lived-0.009
igator-0.009
Token Jim
Token ID5395
Feature activation-2.17e-03

Pos logit contributions
 Amen+0.009
 happens+0.009
 teaches+0.009
 lived+0.008
igators+0.008
Neg logit contributions
arse-0.010
 destination-0.009
ge-0.009
oured-0.009
ized-0.009
Token.
Token ID13
Feature activation-3.47e-03

Pos logit contributions
arse+0.004
ge+0.004
 destination+0.003
oured+0.003
har+0.003
Neg logit contributions
 Amen-0.004
 happens-0.004
 teaches-0.004
 lived-0.004
igator-0.004
Token Ted
Token ID11396
Feature activation-1.38e-03

Pos logit contributions
arse+0.007
 destination+0.007
ge+0.007
ized+0.007
ishment+0.007
Neg logit contributions
 Amen-0.005
 happens-0.005
igator-0.005
 teaches-0.005
 lived-0.005
Token and
Token ID290
Feature activation+9.45e-03

Pos logit contributions
 lived+0.003
 happens+0.003
 lions+0.003
 teaches+0.003
igator+0.003
Neg logit contributions
arse-0.007
ge-0.006
 destination-0.006
har-0.006
ized-0.006
Token picked
Token ID6497
Feature activation-1.54e-02

Pos logit contributions
 lived+0.014
 happens+0.013
igator+0.013
 likes+0.013
 teaches+0.012
Neg logit contributions
arse-0.022
 destination-0.021
ge-0.020
 injuries-0.019
ipping-0.019
Token up
Token ID510
Feature activation-3.80e-03

Pos logit contributions
arse+0.027
 destination+0.025
ge+0.025
ized+0.024
har+0.024
Neg logit contributions
 Amen-0.030
 teaches-0.029
 lived-0.029
 happens-0.029
igator-0.028
Token a
Token ID257
Feature activation+1.14e-03

Pos logit contributions
arse+0.009
ge+0.008
 destination+0.008
har+0.008
ized+0.008
Neg logit contributions
 lived-0.005
 Amen-0.005
 happens-0.005
 teaches-0.005
igator-0.005
Token grass
Token ID8701
Feature activation+3.53e-02

Pos logit contributions
 lived+0.002
 Amen+0.002
 happens+0.002
 teaches+0.002
igator+0.002
Neg logit contributions
arse-0.002
ge-0.002
 destination-0.002
ized-0.002
har-0.002
Tokenho
Token ID8873
Feature activation+1.17e-01

Pos logit contributions
 lived+0.051
igator+0.050
 teaches+0.050
 Amen+0.050
 happens+0.049
Neg logit contributions
arse-0.077
 destination-0.072
ge-0.072
har-0.072
ized-0.072
Tokenpper
Token ID2848
Feature activation-9.41e-03

Pos logit contributions
 Amen+0.231
 teaches+0.226
 lived+0.224
 happens+0.223
igator+0.219
Neg logit contributions
arse-0.194
 destination-0.185
ge-0.180
ized-0.178
har-0.175
Token!
Token ID0
Feature activation-1.09e-04

Pos logit contributions
arse+0.020
 destination+0.019
ge+0.019
har+0.019
ized+0.019
Neg logit contributions
 lived-0.014
 teaches-0.014
 Amen-0.014
 happens-0.013
icks-0.013
Token Both
Token ID5747
Feature activation-1.39e-02

Pos logit contributions
arse+0.000
 destination+0.000
oured+0.000
har+0.000
ized+0.000
Neg logit contributions
 Amen-0.000
 teaches-0.000
 happens-0.000
ile-0.000
igator-0.000
Token friends
Token ID2460
Feature activation+1.40e-02

Pos logit contributions
arse+0.030
 destination+0.029
ge+0.028
ized+0.028
har+0.027
Neg logit contributions
 Amen-0.022
 teaches-0.020
 lived-0.020
 happens-0.020
igator-0.019
Token were
Token ID547
Feature activation+2.38e-02

Pos logit contributions
 lived+0.022
 teaches+0.018
 happens+0.018
igator+0.018
 likes+0.017
Neg logit contributions
arse-0.033
 destination-0.032
har-0.031
ge-0.031
 injuries-0.030
Token
Token ID198
Feature activation+1.86e-03

Pos logit contributions
ile+0.017
 happens+0.017
eer+0.016
 teaches+0.016
igator+0.016
Neg logit contributions
arse-0.020
 destination-0.020
-0.019
ge-0.019
oured-0.019
Token
Token ID198
Feature activation+8.41e-04

Pos logit contributions
 happens+0.003
igator+0.003
 Amen+0.003
 teaches+0.003
eer+0.003
Neg logit contributions
ge-0.004
arse-0.003
 destination-0.003
har-0.003
ized-0.003
Token"
Token ID1
Feature activation-3.66e-02

Pos logit contributions
 happens+0.001
 Amen+0.001
 teaches+0.001
 lived+0.001
eer+0.001
Neg logit contributions
arse-0.002
ge-0.002
har-0.002
ized-0.002
 destination-0.002
TokenHello
Token ID15496
Feature activation+3.83e-02

Pos logit contributions
arse+0.063
har+0.060
ized+0.060
ge+0.059
 destination+0.057
Neg logit contributions
 Amen-0.067
icks-0.066
 lived-0.065
 happens-0.065
 teaches-0.065
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
 Amen+0.060
 teaches+0.059
 lived+0.056
 happens+0.055
igator+0.055
Neg logit contributions
arse-0.080
 destination-0.078
ge-0.076
ized-0.074
har-0.074
Token gir
Token ID37370
Feature activation+1.17e-01

Pos logit contributions
 Amen+0.052
 happens+0.050
 teaches+0.049
 vitamins+0.048
icks+0.047
Neg logit contributions
arse-0.057
 destination-0.056
har-0.052
ge-0.052
ized-0.052
Tokenaffe
Token ID21223
Feature activation+6.72e-02

Pos logit contributions
 Amen+0.162
 lived+0.131
 teaches+0.124
 happens+0.121
 vitamins+0.120
Neg logit contributions
ized-0.275
ge-0.262
ipping-0.257
arse-0.255
 destination-0.253
Token,"
Token ID553
Feature activation+2.17e-02

Pos logit contributions
 vitamins+0.122
 teaches+0.116
 happens+0.116
 Amen+0.115
icks+0.112
Neg logit contributions
ized-0.112
ge-0.112
har-0.108
 destination-0.106
 forth-0.106
Token Anna
Token ID11735
Feature activation+6.55e-03

Pos logit contributions
 Amen+0.033
 happens+0.033
 teaches+0.033
 lived+0.032
icks+0.031
Neg logit contributions
arse-0.043
 destination-0.042
ge-0.042
ized-0.042
har-0.041
Token said
Token ID531
Feature activation+4.68e-05

Pos logit contributions
 Amen+0.009
igator+0.009
ile+0.009
igators+0.009
 vitamins+0.009
Neg logit contributions
 destination-0.013
arse-0.013
ishment-0.013
oured-0.013
ge-0.013
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 Amen+0.000
 teaches+0.000
 lived+0.000
 happens+0.000
igator+0.000
Neg logit contributions
arse-0.000
 destination-0.000
ge-0.000
ized-0.000
har-0.000
Token at
Token ID379
Feature activation-2.69e-03

Pos logit contributions
arse+0.070
ge+0.067
 destination+0.065
har+0.064
ized+0.063
Neg logit contributions
 teaches-0.047
 Amen-0.047
 lived-0.047
 happens-0.047
igator-0.045
Token circles
Token ID13332
Feature activation+2.04e-02

Pos logit contributions
arse+0.006
ge+0.005
har+0.005
 destination+0.005
ized+0.005
Neg logit contributions
 teaches-0.004
 Amen-0.004
 lived-0.004
igator-0.004
 happens-0.004
Token,
Token ID11
Feature activation-1.65e-03

Pos logit contributions
 Amen+0.034
 teaches+0.032
 lived+0.032
 happens+0.032
igator+0.031
Neg logit contributions
arse-0.041
 destination-0.039
ge-0.039
ized-0.038
har-0.037
Token squares
Token ID24438
Feature activation+8.19e-02

Pos logit contributions
arse+0.003
 destination+0.003
ge+0.003
ized+0.003
har+0.003
Neg logit contributions
 Amen-0.003
 teaches-0.003
 happens-0.003
 lived-0.003
igator-0.003
Token,
Token ID11
Feature activation-3.77e-04

Pos logit contributions
 Amen+0.143
 teaches+0.129
 happens+0.129
 lived+0.126
igator+0.124
Neg logit contributions
arse-0.160
 destination-0.157
ge-0.157
ized-0.153
har-0.145
Token rect
Token ID13621
Feature activation+1.16e-01

Pos logit contributions
arse+0.001
 destination+0.001
ge+0.001
ized+0.001
har+0.001
Neg logit contributions
 Amen-0.001
 happens-0.001
 teaches-0.001
 lived-0.001
igator-0.001
Tokenangles
Token ID27787
Feature activation+4.95e-02

Pos logit contributions
 teaches+0.200
 Amen+0.198
 lived+0.198
 happens+0.197
igators+0.192
Neg logit contributions
arse-0.217
ge-0.206
 destination-0.203
har-0.202
ized-0.199
Token and
Token ID290
Feature activation+5.70e-04

Pos logit contributions
 Amen+0.075
 teaches+0.074
 lived+0.074
 happens+0.073
igator+0.071
Neg logit contributions
arse-0.104
 destination-0.099
ge-0.099
ized-0.097
har-0.097
Token triangles
Token ID44360
Feature activation+7.37e-02

Pos logit contributions
 Amen+0.001
 lived+0.001
 happens+0.001
 teaches+0.001
igator+0.001
Neg logit contributions
arse-0.001
 destination-0.001
ge-0.001
ized-0.001
 forth-0.001
Token.
Token ID13
Feature activation+5.17e-03

Pos logit contributions
 Amen+0.138
 teaches+0.127
 happens+0.127
 lived+0.125
igator+0.122
Neg logit contributions
arse-0.134
ge-0.132
 destination-0.130
ized-0.127
har-0.120
Token
Token ID198
Feature activation+1.29e-04

Pos logit contributions
 happens+0.008
 teaches+0.008
igator+0.008
 Amen+0.008
ile+0.008
Neg logit contributions
 destination-0.010
arse-0.010
ge-0.009
har-0.009
ized-0.009
Tokenried
Token ID2228
Feature activation+1.07e-02

Pos logit contributions
arse+0.011
 destination+0.010
ized+0.010
ge+0.010
ishment+0.010
Neg logit contributions
 lived-0.015
 Amen-0.015
 happens-0.015
igator-0.015
 teaches-0.014
Tokenly
Token ID306
Feature activation+4.82e-02

Pos logit contributions
igator+0.021
 happens+0.021
 lived+0.021
 teaches+0.021
 Amen+0.020
Neg logit contributions
arse-0.017
ge-0.016
har-0.015
ized-0.015
ishment-0.015
Token,
Token ID11
Feature activation+1.30e-02

Pos logit contributions
 lived+0.062
 happens+0.061
 teaches+0.061
icks+0.059
 about+0.056
Neg logit contributions
arse-0.111
har-0.105
ge-0.104
 destination-0.104
ishment-0.102
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
 lived+0.019
 teaches+0.018
 happens+0.018
igator+0.017
 Amen+0.017
Neg logit contributions
arse-0.029
ge-0.028
 destination-0.028
har-0.027
ized-0.027
Token grass
Token ID8701
Feature activation+3.30e-02

Pos logit contributions
igator+0.024
 lived+0.023
 lions+0.023
 teaches+0.023
 Amen+0.023
Neg logit contributions
arse-0.047
ge-0.045
 destination-0.043
ized-0.043
har-0.043
Tokenho
Token ID8873
Feature activation+1.16e-01

Pos logit contributions
 lived+0.046
igator+0.044
 happens+0.044
 teaches+0.044
igators+0.042
Neg logit contributions
arse-0.076
ge-0.072
 destination-0.071
har-0.070
ishment-0.069
Tokenpper
Token ID2848
Feature activation-1.33e-02

Pos logit contributions
 lived+0.204
ile+0.197
 Amen+0.197
 happens+0.196
icks+0.194
Neg logit contributions
 destination-0.205
arse-0.204
ishment-0.200
ized-0.189
har-0.187
Token rushed
Token ID13999
Feature activation-9.19e-03

Pos logit contributions
arse+0.028
ge+0.027
 destination+0.026
ishment+0.026
har+0.026
Neg logit contributions
 lived-0.021
 happens-0.020
 teaches-0.020
igator-0.019
 likes-0.018
Token over
Token ID625
Feature activation+1.05e-02

Pos logit contributions
arse+0.019
ge+0.018
ized+0.018
har+0.017
ishment+0.017
Neg logit contributions
igator-0.014
 Amen-0.013
 teaches-0.013
 lived-0.013
 happens-0.013
Token and
Token ID290
Feature activation+6.34e-03

Pos logit contributions
 teaches+0.011
 lived+0.010
 happens+0.010
 lions+0.010
 Amen+0.010
Neg logit contributions
arse-0.028
ized-0.026
ge-0.026
 destination-0.026
har-0.025
Token tried
Token ID3088
Feature activation-2.39e-04

Pos logit contributions
 lived+0.011
igator+0.011
 happens+0.011
 Amen+0.010
 teaches+0.010
Neg logit contributions
arse-0.012
ge-0.012
 destination-0.011
ized-0.011
ishment-0.011
Token saw
Token ID2497
Feature activation+2.49e-02

Pos logit contributions
arse+0.030
ge+0.029
 destination+0.029
ized+0.029
har+0.027
Neg logit contributions
 Amen-0.029
 teaches-0.026
 happens-0.026
 lived-0.026
icks-0.025
Token monkeys
Token ID26697
Feature activation+4.66e-02

Pos logit contributions
 Amen+0.039
 teaches+0.037
 happens+0.037
 lived+0.037
igator+0.035
Neg logit contributions
arse-0.052
ge-0.050
 destination-0.050
ized-0.049
har-0.048
Token,
Token ID11
Feature activation-9.68e-04

Pos logit contributions
 Amen+0.072
 teaches+0.070
 lived+0.070
 happens+0.069
igator+0.067
Neg logit contributions
arse-0.098
ge-0.093
 destination-0.093
ized-0.091
har-0.090
Token elephants
Token ID25368
Feature activation+6.72e-02

Pos logit contributions
arse+0.002
ge+0.002
 destination+0.002
ized+0.002
har+0.002
Neg logit contributions
 Amen-0.002
 happens-0.002
 teaches-0.001
icks-0.001
 lived-0.001
Token,
Token ID11
Feature activation+5.89e-03

Pos logit contributions
 Amen+0.104
 teaches+0.100
 happens+0.099
 lived+0.097
igator+0.094
Neg logit contributions
arse-0.141
ge-0.136
 destination-0.136
ized-0.134
har-0.131
Token gir
Token ID37370
Feature activation+1.16e-01

Pos logit contributions
 Amen+0.011
 happens+0.011
 teaches+0.010
icks+0.010
eer+0.010
Neg logit contributions
ized-0.010
ge-0.010
arse-0.010
 destination-0.009
 forth-0.009
Tokenaff
Token ID2001
Feature activation+7.26e-02

Pos logit contributions
 Amen+0.226
 happens+0.200
 lived+0.197
 teaches+0.196
 vitamins+0.181
Neg logit contributions
ized-0.213
ge-0.197
 destination-0.195
arse-0.195
oured-0.194
Tokenes
Token ID274
Feature activation+5.62e-02

Pos logit contributions
 Amen+0.143
 teaches+0.132
 happens+0.130
 vitamins+0.126
igator+0.120
Neg logit contributions
ge-0.130
ized-0.121
arse-0.117
aults-0.112
ooming-0.111
Token,
Token ID11
Feature activation-1.40e-03

Pos logit contributions
 Amen+0.090
 teaches+0.087
 happens+0.086
 lived+0.085
igator+0.084
Neg logit contributions
arse-0.114
ge-0.110
 destination-0.110
ized-0.107
har-0.105
Token and
Token ID290
Feature activation+6.66e-03

Pos logit contributions
ized+0.002
 destination+0.002
ge+0.002
arse+0.002
 forth+0.002
Neg logit contributions
 Amen-0.003
 happens-0.002
 teaches-0.002
igator-0.002
icks-0.002
Token lions
Token ID31420
Feature activation+5.84e-02

Pos logit contributions
 Amen+0.010
 happens+0.010
 teaches+0.009
ile+0.009
 lived+0.009
Neg logit contributions
ge-0.012
 sudden-0.012
 forth-0.012
 breath-0.012
 destination-0.012
Token
Token ID26391
Feature activation-1.77e-03

Pos logit contributions
arse+0.003
ge+0.003
 destination+0.003
ized+0.002
har+0.002
Neg logit contributions
 Amen-0.002
 teaches-0.002
 lived-0.002
 happens-0.002
igator-0.001
Token
Token ID129
Feature activation+1.46e-02

Pos logit contributions
arse+0.004
ge+0.004
 destination+0.004
ized+0.003
har+0.003
Neg logit contributions
 lived-0.003
 Amen-0.003
 happens-0.003
 teaches-0.003
icks-0.002
Token
Token ID241
Feature activation-1.10e-02

Pos logit contributions
 Amen+0.024
 teaches+0.023
 lived+0.023
 happens+0.023
igator+0.022
Neg logit contributions
arse-0.029
 destination-0.028
ge-0.028
ized-0.027
har-0.027
TokenGr
Token ID8642
Feature activation+1.27e-02

Pos logit contributions
arse+0.022
ge+0.021
 destination+0.021
ized+0.020
har+0.020
Neg logit contributions
 Amen-0.018
 teaches-0.017
 lived-0.017
 happens-0.017
igator-0.017
Tokenass
Token ID562
Feature activation+1.41e-02

Pos logit contributions
 Amen+0.023
 happens+0.022
 lived+0.021
igator+0.021
igators+0.020
Neg logit contributions
arse-0.024
ge-0.024
 destination-0.023
har-0.023
oured-0.022
Tokenho
Token ID8873
Feature activation+1.14e-01

Pos logit contributions
 Amen+0.023
 teaches+0.022
 vitamins+0.021
 happens+0.021
icks+0.020
Neg logit contributions
arse-0.028
ge-0.028
 destination-0.027
har-0.026
ized-0.026
Tokenpper
Token ID2848
Feature activation-9.77e-03

Pos logit contributions
 lived+0.180
 Amen+0.172
ile+0.169
 happens+0.167
icks+0.166
Neg logit contributions
ishment-0.230
 destination-0.227
arse-0.226
ized-0.215
ge-0.212
Token,
Token ID11
Feature activation-9.82e-03

Pos logit contributions
arse+0.014
 destination+0.013
ge+0.013
ized+0.012
har+0.012
Neg logit contributions
 Amen-0.022
 teaches-0.021
 happens-0.021
 lived-0.021
igator-0.021
Token let
Token ID1309
Feature activation-2.56e-02

Pos logit contributions
arse+0.023
ge+0.022
 destination+0.021
ized+0.021
har+0.021
Neg logit contributions
 Amen-0.013
 lived-0.013
 teaches-0.013
 happens-0.013
igator-0.012
Tokenâ
Token ID22940
Feature activation-9.50e-03

Pos logit contributions
arse+0.055
ge+0.053
 destination+0.052
ized+0.051
har+0.051
Neg logit contributions
 Amen-0.038
 teaches-0.038
 lived-0.038
 happens-0.037
igator-0.036
Token
Token ID26391
Feature activation-6.05e-03

Pos logit contributions
arse+0.019
 destination+0.019
ge+0.019
ized+0.018
har+0.018
Neg logit contributions
 Amen-0.015
 teaches-0.015
 lived-0.015
 happens-0.015
igator-0.014
Token see
Token ID766
Feature activation-7.45e-03

Pos logit contributions
arse+0.020
ized+0.019
ge+0.019
ishment+0.019
ipping+0.018
Neg logit contributions
 lived-0.015
 Amen-0.015
 teaches-0.014
igator-0.013
igators-0.013
Token lions
Token ID31420
Feature activation+4.12e-02

Pos logit contributions
arse+0.014
 destination+0.014
ge+0.013
ized+0.013
har+0.013
Neg logit contributions
 Amen-0.013
 teaches-0.013
 happens-0.012
 lived-0.012
igator-0.012
Token,
Token ID11
Feature activation-1.20e-02

Pos logit contributions
 Amen+0.063
 teaches+0.061
 happens+0.061
 lived+0.060
igator+0.058
Neg logit contributions
arse-0.087
 destination-0.084
ge-0.084
ized-0.082
har-0.080
Token elephants
Token ID25368
Feature activation+4.97e-02

Pos logit contributions
arse+0.031
ge+0.031
 destination+0.030
ized+0.029
har+0.029
Neg logit contributions
 Amen-0.012
 happens-0.012
 teaches-0.012
 lived-0.011
icks-0.011
Token,
Token ID11
Feature activation-1.89e-03

Pos logit contributions
 Amen+0.081
 happens+0.078
 teaches+0.077
 lived+0.075
igator+0.074
Neg logit contributions
arse-0.100
ge-0.098
 destination-0.098
ized-0.094
har-0.092
Token gir
Token ID37370
Feature activation+1.12e-01

Pos logit contributions
ge+0.004
arse+0.003
 destination+0.003
ized+0.003
aults+0.003
Neg logit contributions
 Amen-0.003
 happens-0.003
 teaches-0.003
 lived-0.003
icks-0.003
Tokenaff
Token ID2001
Feature activation+7.65e-02

Pos logit contributions
 Amen+0.209
 happens+0.188
 teaches+0.188
 lived+0.184
igator+0.181
Neg logit contributions
arse-0.208
ized-0.205
ge-0.204
 destination-0.203
oured-0.195
Tokenes
Token ID274
Feature activation+6.18e-02

Pos logit contributions
 Amen+0.142
 teaches+0.133
 happens+0.133
 vitamins+0.126
igator+0.123
Neg logit contributions
ge-0.144
arse-0.142
ized-0.132
 destination-0.128
aults-0.127
Token and
Token ID290
Feature activation-1.93e-03

Pos logit contributions
 Amen+0.099
 happens+0.096
 teaches+0.096
 lived+0.095
igator+0.092
Neg logit contributions
arse-0.125
 destination-0.121
ge-0.121
ized-0.117
har-0.115
Token more
Token ID517
Feature activation+2.85e-02

Pos logit contributions
ge+0.004
arse+0.004
 destination+0.004
 breath+0.004
ized+0.004
Neg logit contributions
 Amen-0.003
 happens-0.003
 teaches-0.003
ile-0.003
icks-0.002
Token.
Token ID13
Feature activation+2.88e-03

Pos logit contributions
 Amen+0.054
 teaches+0.051
 happens+0.051
 lived+0.051
icks+0.050
Neg logit contributions
ge-0.049
 destination-0.048
ized-0.047
arse-0.047
har-0.044
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
 destination+0.007
arse+0.007
+0.006
oured+0.006
ishment+0.006
Neg logit contributions
igator-0.006
ile-0.006
 Amen-0.006
igators-0.006
eer-0.006
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
arse+0.024
ge+0.023
 destination+0.023
ized+0.022
har+0.022
Neg logit contributions
 Amen-0.018
igator-0.018
igators-0.017
 happens-0.017
 teaches-0.017
Token"
Token ID1
Feature activation-4.98e-02

Pos logit contributions
arse+0.029
ge+0.029
ized+0.028
har+0.028
 destination+0.028
Neg logit contributions
 Amen-0.017
 happens-0.017
 lived-0.017
eer-0.017
igator-0.016
TokenHello
Token ID15496
Feature activation+3.44e-02

Pos logit contributions
arse+0.087
ized+0.082
har+0.082
ge+0.081
 sudden+0.077
Neg logit contributions
 lived-0.089
icks-0.088
 Amen-0.087
 teaches-0.087
igators-0.087
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
 Amen+0.060
 teaches+0.058
igator+0.056
 vitamins+0.055
 lived+0.055
Neg logit contributions
 destination-0.066
arse-0.065
ge-0.063
ized-0.060
har-0.058
Token gir
Token ID37370
Feature activation+1.12e-01

Pos logit contributions
 Amen+0.031
 happens+0.029
 lived+0.026
 teaches+0.026
eer+0.025
Neg logit contributions
arse-0.062
 destination-0.061
ge-0.058
ized-0.057
oured-0.057
Tokenaffe
Token ID21223
Feature activation+6.89e-02

Pos logit contributions
 Amen+0.150
 happens+0.125
 lived+0.124
 vitamins+0.123
 teaches+0.120
Neg logit contributions
ized-0.262
ge-0.257
arse-0.245
ipping-0.244
 destination-0.238
Token,"
Token ID553
Feature activation+1.28e-02

Pos logit contributions
 vitamins+0.132
 happens+0.126
 teaches+0.124
 Amen+0.123
icks+0.121
Neg logit contributions
ge-0.110
ized-0.109
 forth-0.103
har-0.103
 destination-0.099
Token Ben
Token ID3932
Feature activation-7.50e-03

Pos logit contributions
 Amen+0.019
ile+0.018
 teaches+0.018
 lived+0.018
eer+0.018
Neg logit contributions
arse-0.025
 destination-0.025
ized-0.025
ge-0.024
ishment-0.024
Token said
Token ID531
Feature activation-5.55e-03

Pos logit contributions
arse+0.012
 destination+0.011
ized+0.011
ge+0.011
oured+0.011
Neg logit contributions
 Amen-0.015
 happens-0.014
igators-0.014
igator-0.014
 lived-0.014
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
arse+0.010
ge+0.009
 destination+0.009
ized+0.009
har+0.009
Neg logit contributions
 Amen-0.010
 lived-0.010
 teaches-0.010
 happens-0.010
igators-0.010
Token
Token ID198
Feature activation+8.55e-05

Pos logit contributions
 Amen+0.007
 teaches+0.007
 happens+0.006
igator+0.006
 lived+0.006
Neg logit contributions
arse-0.012
ge-0.011
 destination-0.011
har-0.011
ized-0.011
TokenThe
Token ID464
Feature activation+4.16e-03

Pos logit contributions
 Amen+0.000
 teaches+0.000
 lived+0.000
 happens+0.000
igator+0.000
Neg logit contributions
arse-0.000
 destination-0.000
ge-0.000
ized-0.000
ishment-0.000
Token nos
Token ID43630
Feature activation-2.44e-02

Pos logit contributions
 Amen+0.007
 teaches+0.007
 happens+0.007
igator+0.007
igators+0.007
Neg logit contributions
arse-0.008
 destination-0.008
ized-0.007
ge-0.007
har-0.007
Tokeny
Token ID88
Feature activation+3.74e-02

Pos logit contributions
ge+0.046
arse+0.044
ized+0.044
 destination+0.043
har+0.042
Neg logit contributions
 Amen-0.045
 teaches-0.041
 happens-0.039
icks-0.038
 vitamins-0.038
Token grass
Token ID8701
Feature activation+2.51e-02

Pos logit contributions
 Amen+0.051
 teaches+0.050
 lived+0.049
 happens+0.049
igator+0.047
Neg logit contributions
arse-0.086
ge-0.082
 destination-0.082
ized-0.080
har-0.080
Tokenho
Token ID8873
Feature activation+1.11e-01

Pos logit contributions
 Amen+0.032
 lived+0.032
 teaches+0.032
 happens+0.031
igator+0.031
Neg logit contributions
arse-0.059
ge-0.057
 destination-0.056
ized-0.055
har-0.055
Tokenpper
Token ID2848
Feature activation-1.86e-02

Pos logit contributions
 lived+0.190
ile+0.175
 happens+0.173
 Amen+0.169
icks+0.169
Neg logit contributions
 destination-0.214
arse-0.213
ishment-0.208
ge-0.195
ized-0.194
Token hopped
Token ID45230
Feature activation-6.37e-03

Pos logit contributions
arse+0.039
ge+0.037
 destination+0.037
ized+0.036
har+0.036
Neg logit contributions
 lived-0.029
 teaches-0.029
 Amen-0.029
 happens-0.029
igator-0.028
Token over
Token ID625
Feature activation+3.86e-03

Pos logit contributions
arse+0.012
ge+0.011
 destination+0.011
har+0.011
ized+0.011
Neg logit contributions
igator-0.011
 teaches-0.011
 lived-0.011
 happens-0.011
 Amen-0.011
Token to
Token ID284
Feature activation+6.51e-03

Pos logit contributions
 lived+0.005
 teaches+0.005
 lions+0.005
 happens+0.005
igators+0.004
Neg logit contributions
arse-0.010
 destination-0.009
ge-0.009
har-0.009
ized-0.009
Token Tom
Token ID4186
Feature activation-3.34e-02

Pos logit contributions
 lived+0.009
 Amen+0.009
 lions+0.008
 teaches+0.008
igator+0.008
Neg logit contributions
arse-0.016
ized-0.015
ge-0.015
har-0.015
 destination-0.014
Token and
Token ID290
Feature activation+9.08e-03

Pos logit contributions
 destination+0.003
ge+0.003
arse+0.003
ized+0.003
har+0.003
Neg logit contributions
 Amen-0.003
 happens-0.003
 teaches-0.003
icks-0.003
igator-0.003
Token brave
Token ID14802
Feature activation+4.77e-03

Pos logit contributions
 Amen+0.015
icks+0.014
 teaches+0.014
ile+0.014
 happens+0.013
Neg logit contributions
 destination-0.017
ge-0.017
arse-0.017
ized-0.016
 forth-0.016
Token people
Token ID661
Feature activation+3.77e-02

Pos logit contributions
 Amen+0.008
 teaches+0.007
 lived+0.007
igator+0.007
 happens+0.007
Neg logit contributions
arse-0.010
 destination-0.010
ge-0.009
ized-0.009
ishment-0.009
Token from
Token ID422
Feature activation+3.67e-02

Pos logit contributions
 Amen+0.067
 happens+0.067
 teaches+0.065
 vitamins+0.062
icks+0.062
Neg logit contributions
ge-0.067
arse-0.067
 destination-0.065
ized-0.063
har-0.061
Token long
Token ID890
Feature activation+4.67e-02

Pos logit contributions
 Amen+0.060
 teaches+0.058
 lived+0.058
 happens+0.058
igator+0.056
Neg logit contributions
arse-0.074
ge-0.071
 destination-0.071
ized-0.069
har-0.068
Token ago
Token ID2084
Feature activation+1.11e-01

Pos logit contributions
 Amen+0.079
 teaches+0.075
 lived+0.075
 happens+0.075
igator+0.074
Neg logit contributions
arse-0.090
 destination-0.089
ge-0.088
ized-0.085
har-0.082
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
 Amen+0.201
 happens+0.195
 teaches+0.193
igator+0.189
 lived+0.188
Neg logit contributions
arse-0.199
ge-0.196
 destination-0.196
ized-0.188
har-0.182
Token
Token ID198
Feature activation+3.15e-03

Pos logit contributions
 happens+0.020
igator+0.018
 teaches+0.018
ile+0.018
icks+0.018
Neg logit contributions
 destination-0.027
arse-0.027
ishment-0.026
ge-0.026
har-0.025
Token
Token ID198
Feature activation-6.04e-04

Pos logit contributions
 happens+0.005
igator+0.005
icks+0.005
 Amen+0.005
 vitamins+0.005
Neg logit contributions
ge-0.006
 destination-0.006
arse-0.006
ized-0.005
har-0.005
TokenOne
Token ID3198
Feature activation+2.60e-02

Pos logit contributions
arse+0.001
ge+0.001
 destination+0.001
ized+0.001
har+0.001
Neg logit contributions
 happens-0.001
 Amen-0.001
icks-0.001
 teaches-0.001
 lived-0.001
Token day
Token ID1110
Feature activation+3.09e-03

Pos logit contributions
icks+0.040
 Amen+0.039
 vitamins+0.038
 happens+0.038
 teaches+0.035
Neg logit contributions
 destination-0.050
 breath-0.047
ized-0.046
ge-0.046
har-0.046
Token coins
Token ID10796
Feature activation+4.88e-02

Pos logit contributions
arse+0.040
ge+0.039
 destination+0.038
ized+0.037
har+0.037
Neg logit contributions
 Amen-0.034
 teaches-0.032
 happens-0.032
 lived-0.032
igator-0.031
Token that
Token ID326
Feature activation-3.24e-03

Pos logit contributions
 lived+0.066
 happens+0.063
 teaches+0.062
icks+0.062
igator+0.060
Neg logit contributions
arse-0.114
har-0.109
 destination-0.105
ized-0.102
ishment-0.102
Token people
Token ID661
Feature activation-2.77e-03

Pos logit contributions
 destination+0.006
arse+0.006
ge+0.006
ized+0.006
har+0.006
Neg logit contributions
 Amen-0.005
 teaches-0.005
 happens-0.005
icks-0.005
igator-0.005
Token used
Token ID973
Feature activation-2.57e-03

Pos logit contributions
arse+0.006
 destination+0.006
har+0.006
ge+0.006
ized+0.006
Neg logit contributions
 lived-0.004
 teaches-0.004
 happens-0.004
 Amen-0.004
igator-0.004
Token long
Token ID890
Feature activation+4.61e-02

Pos logit contributions
arse+0.005
 destination+0.005
ge+0.005
ized+0.005
har+0.005
Neg logit contributions
 Amen-0.004
 teaches-0.004
 lived-0.004
 happens-0.004
igator-0.004
Token ago
Token ID2084
Feature activation+1.11e-01

Pos logit contributions
 Amen+0.083
 teaches+0.079
 lived+0.077
 happens+0.077
igator+0.077
Neg logit contributions
arse-0.086
ge-0.084
 destination-0.083
ized-0.080
har-0.078
Token.
Token ID13
Feature activation+4.99e-03

Pos logit contributions
 Amen+0.194
 teaches+0.187
 happens+0.185
 lived+0.184
igator+0.180
Neg logit contributions
arse-0.210
ge-0.203
 destination-0.202
ized-0.196
har-0.193
Token They
Token ID1119
Feature activation-1.01e-02

Pos logit contributions
 Amen+0.007
igator+0.007
 teaches+0.007
ile+0.007
 happens+0.007
Neg logit contributions
 destination-0.011
arse-0.011
ized-0.010
ge-0.010
ishment-0.010
Token saw
Token ID2497
Feature activation-5.18e-03

Pos logit contributions
arse+0.024
 destination+0.022
har+0.022
ge+0.022
ipping+0.021
Neg logit contributions
 lived-0.015
 teaches-0.014
igator-0.014
 Amen-0.013
igators-0.013
Token paintings
Token ID21641
Feature activation+3.03e-02

Pos logit contributions
 destination+0.010
arse+0.010
ge+0.010
ized+0.010
ishment+0.009
Neg logit contributions
 Amen-0.009
 teaches-0.008
 happens-0.008
igator-0.008
 lived-0.007
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
 Amen+0.048
 teaches+0.046
 happens+0.045
 lived+0.045
igator+0.044
Neg logit contributions
arse-0.063
 destination-0.061
ge-0.061
ized-0.060
har-0.058
Token and
Token ID290
Feature activation-5.25e-03

Pos logit contributions
 teaches+0.023
igator+0.023
 lived+0.023
 happens+0.022
icks+0.022
Neg logit contributions
arse-0.025
 destination-0.024
ge-0.023
har-0.023
ized-0.023
Token pointed
Token ID6235
Feature activation-1.19e-02

Pos logit contributions
arse+0.011
ge+0.010
 destination+0.010
ized+0.010
har+0.010
Neg logit contributions
 happens-0.009
 lived-0.009
 teaches-0.008
igator-0.008
 likes-0.008
Token at
Token ID379
Feature activation-3.69e-03

Pos logit contributions
arse+0.024
 destination+0.023
ge+0.023
ized+0.023
har+0.022
Neg logit contributions
 Amen-0.020
 teaches-0.019
 lived-0.019
 happens-0.018
igator-0.018
Token the
Token ID262
Feature activation+1.85e-03

Pos logit contributions
arse+0.009
 destination+0.008
ge+0.008
har+0.008
ized+0.008
Neg logit contributions
 lived-0.005
 teaches-0.005
 happens-0.005
igator-0.005
 Amen-0.005
Token grass
Token ID8701
Feature activation+2.51e-02

Pos logit contributions
 happens+0.003
 likes+0.003
 teaches+0.003
 lived+0.003
igator+0.002
Neg logit contributions
arse-0.004
ge-0.004
 destination-0.004
ized-0.004
har-0.004
Tokenho
Token ID8873
Feature activation+1.11e-01

Pos logit contributions
 Amen+0.040
 teaches+0.040
 lived+0.039
 happens+0.039
igator+0.038
Neg logit contributions
arse-0.051
ge-0.048
 destination-0.048
har-0.047
ized-0.047
Tokenpper
Token ID2848
Feature activation-1.80e-02

Pos logit contributions
 Amen+0.212
 teaches+0.208
 happens+0.208
 lived+0.207
icks+0.201
Neg logit contributions
arse-0.189
 destination-0.181
ge-0.176
ized-0.173
har-0.172
Token.
Token ID13
Feature activation-6.79e-04

Pos logit contributions
arse+0.038
 destination+0.036
ge+0.036
har+0.035
ized+0.035
Neg logit contributions
 lived-0.027
 teaches-0.027
 Amen-0.027
 happens-0.027
icks-0.026
Token He
Token ID679
Feature activation+1.70e-02

Pos logit contributions
 destination+0.002
arse+0.002
ge+0.002
har+0.002
ized+0.002
Neg logit contributions
 Amen-0.001
 lived-0.001
 teaches-0.001
igators-0.001
igator-0.001
Token said
Token ID531
Feature activation+4.14e-03

Pos logit contributions
 lived+0.027
 teaches+0.027
 happens+0.026
 Amen+0.025
igator+0.024
Neg logit contributions
arse-0.037
 destination-0.034
ge-0.034
har-0.033
ized-0.032
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
 Amen+0.010
igators+0.009
eer+0.009
 vitamins+0.009
 teaches+0.009
Neg logit contributions
 destination-0.004
ooming-0.004
 breath-0.004
ized-0.004
ipping-0.004
Token El
Token ID2574
Feature activation+2.56e-02

Pos logit contributions
 destination+0.011
arse+0.011
har+0.010
oured+0.010
ge+0.010
Neg logit contributions
ile-0.009
igator-0.008
igators-0.008
 teaches-0.008
 happens-0.008
Tokenmo
Token ID5908
Feature activation+2.75e-02

Pos logit contributions
 Amen+0.037
 happens+0.036
 teaches+0.036
 lived+0.035
icks+0.034
Neg logit contributions
arse-0.057
ge-0.056
 destination-0.054
har-0.053
ized-0.053
Token thanked
Token ID26280
Feature activation+2.29e-02

Pos logit contributions
 lived+0.044
 Amen+0.043
 teaches+0.043
 happens+0.042
igator+0.041
Neg logit contributions
arse-0.057
ge-0.054
 destination-0.054
har-0.052
ized-0.052
Token the
Token ID262
Feature activation+3.66e-02

Pos logit contributions
 Amen+0.040
 teaches+0.040
 happens+0.040
 lived+0.039
igator+0.038
Neg logit contributions
arse-0.042
 destination-0.040
ge-0.040
ized-0.039
har-0.039
Token dragon
Token ID10441
Feature activation+6.33e-02

Pos logit contributions
icks+0.044
 lived+0.044
emon+0.043
igators+0.041
 happens+0.040
Neg logit contributions
arse-0.083
 destination-0.082
 source-0.081
 breath-0.080
ized-0.077
Tokenfly
Token ID12254
Feature activation+1.10e-01

Pos logit contributions
 happens+0.112
 Amen+0.108
 teaches+0.106
 vitamins+0.104
 lived+0.103
Neg logit contributions
arse-0.119
ge-0.116
 destination-0.116
har-0.108
ized-0.107
Token and
Token ID290
Feature activation+7.71e-03

Pos logit contributions
 Amen+0.183
 happens+0.180
 lived+0.176
 teaches+0.175
igators+0.172
Neg logit contributions
arse-0.216
 destination-0.211
ge-0.207
ized-0.200
har-0.198
Token promised
Token ID8072
Feature activation+2.11e-02

Pos logit contributions
 lived+0.013
 Amen+0.012
 teaches+0.012
 happens+0.012
igator+0.012
Neg logit contributions
arse-0.016
ge-0.015
 destination-0.015
ized-0.015
har-0.015
Token to
Token ID284
Feature activation+3.79e-02

Pos logit contributions
 Amen+0.026
 lived+0.026
 happens+0.025
 teaches+0.025
 lions+0.023
Neg logit contributions
arse-0.051
ge-0.047
ized-0.047
 destination-0.047
ishment-0.047
Token take
Token ID1011
Feature activation+1.55e-02

Pos logit contributions
 Amen+0.062
 lived+0.060
 teaches+0.060
 happens+0.057
igator+0.057
Neg logit contributions
arse-0.078
ge-0.074
 destination-0.073
ized-0.073
har-0.071
Token good
Token ID922
Feature activation+3.17e-02

Pos logit contributions
 happens+0.024
 Amen+0.024
 teaches+0.023
 lived+0.023
icks+0.022
Neg logit contributions
arse-0.034
ge-0.033
ized-0.031
 destination-0.031
har-0.031
Token saw
Token ID2497
Feature activation-9.32e-03

Pos logit contributions
arse+0.029
ge+0.028
 destination+0.028
ized+0.027
har+0.026
Neg logit contributions
 Amen-0.023
 happens-0.022
 teaches-0.022
 lived-0.021
igator-0.021
Token lions
Token ID31420
Feature activation+3.99e-02

Pos logit contributions
 destination+0.020
ized+0.020
arse+0.020
ge+0.019
 sudden+0.019
Neg logit contributions
 Amen-0.014
 teaches-0.012
 happens-0.012
igator-0.012
icks-0.011
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
 Amen+0.068
 happens+0.063
 teaches+0.063
igator+0.060
 vitamins+0.060
Neg logit contributions
arse-0.079
 destination-0.077
ge-0.077
ized-0.076
har-0.072
Token elephants
Token ID25368
Feature activation+4.88e-02

Pos logit contributions
arse+0.040
ge+0.039
ized+0.038
 destination+0.038
har+0.037
Neg logit contributions
 Amen-0.018
 happens-0.018
 teaches-0.017
icks-0.016
 lived-0.016
Token,
Token ID11
Feature activation-5.81e-03

Pos logit contributions
 Amen+0.088
 happens+0.082
 teaches+0.081
igator+0.078
 vitamins+0.077
Neg logit contributions
arse-0.091
ge-0.091
 destination-0.090
ized-0.087
har-0.083
Token gir
Token ID37370
Feature activation+1.10e-01

Pos logit contributions
ge+0.009
arse+0.009
ized+0.009
 destination+0.009
 breath+0.009
Neg logit contributions
 Amen-0.011
 happens-0.011
 teaches-0.010
 lived-0.010
icks-0.010
Tokenaff
Token ID2001
Feature activation+6.96e-02

Pos logit contributions
 Amen+0.210
 happens+0.185
 teaches+0.183
 lived+0.181
igator+0.178
Neg logit contributions
ized-0.202
arse-0.200
 destination-0.196
ge-0.196
oured-0.191
Tokenes
Token ID274
Feature activation+5.26e-02

Pos logit contributions
 Amen+0.139
 teaches+0.127
 happens+0.125
 vitamins+0.121
igator+0.119
Neg logit contributions
ge-0.125
arse-0.120
ized-0.113
aults-0.109
ooming-0.109
Token and
Token ID290
Feature activation-5.93e-03

Pos logit contributions
 Amen+0.093
 happens+0.089
 teaches+0.088
igator+0.085
 lived+0.085
Neg logit contributions
arse-0.098
 destination-0.097
ge-0.096
ized-0.094
har-0.089
Token ze
Token ID41271
Feature activation+4.64e-02

Pos logit contributions
ge+0.012
arse+0.012
 sudden+0.012
 destination+0.012
ized+0.012
Neg logit contributions
 Amen-0.008
 teaches-0.008
 happens-0.007
ile-0.007
icks-0.007
Tokenbr
Token ID1671
Feature activation+3.61e-02

Pos logit contributions
 Amen+0.057
 teaches+0.043
 lived+0.043
 vitamins+0.041
igator+0.041
Neg logit contributions
arse-0.115
ized-0.111
ipping-0.110
ooming-0.110
ge-0.110
Token she
Token ID673
Feature activation+2.04e-02

Pos logit contributions
 destination+0.049
arse+0.047
ge+0.047
ized+0.044
har+0.044
Neg logit contributions
 Amen-0.048
igators-0.044
igator-0.044
eer-0.042
 happens-0.042
Token made
Token ID925
Feature activation+2.15e-02

Pos logit contributions
 Amen+0.028
igators+0.026
igator+0.026
icks+0.025
 teaches+0.025
Neg logit contributions
arse-0.045
ge-0.044
 destination-0.044
ized-0.044
har-0.043
Token a
Token ID257
Feature activation-4.11e-03

Pos logit contributions
 teaches+0.028
 happens+0.027
 Amen+0.026
icks+0.025
 lived+0.025
Neg logit contributions
arse-0.053
ge-0.050
 destination-0.050
ized-0.049
har-0.049
Token long
Token ID890
Feature activation+2.09e-02

Pos logit contributions
arse+0.008
ge+0.007
 destination+0.007
ized+0.007
har+0.007
Neg logit contributions
 Amen-0.007
 teaches-0.007
 lived-0.007
 happens-0.007
igator-0.007
Token time
Token ID640
Feature activation-1.76e-02

Pos logit contributions
 Amen+0.036
 happens+0.036
 teaches+0.035
igators+0.035
 lived+0.035
Neg logit contributions
arse-0.039
 destination-0.038
ge-0.038
har-0.036
ized-0.035
Token ago
Token ID2084
Feature activation+1.10e-01

Pos logit contributions
arse+0.035
ge+0.034
 destination+0.034
ized+0.033
har+0.032
Neg logit contributions
 Amen-0.029
 teaches-0.028
 lived-0.028
 happens-0.028
igator-0.027
Token.
Token ID13
Feature activation+6.63e-03

Pos logit contributions
 Amen+0.182
 teaches+0.176
 happens+0.175
 lived+0.174
igator+0.170
Neg logit contributions
arse-0.217
ge-0.209
 destination-0.209
ized-0.203
har-0.201
Token Rose
Token ID8049
Feature activation+7.31e-03

Pos logit contributions
 Amen+0.010
igator+0.010
ile+0.010
igators+0.009
eer+0.009
Neg logit contributions
arse-0.014
 destination-0.014
ized-0.013
ge-0.013
har-0.013
Token had
Token ID550
Feature activation+1.21e-02

Pos logit contributions
 Amen+0.012
 teaches+0.012
 happens+0.012
 lived+0.012
igator+0.012
Neg logit contributions
arse-0.014
 destination-0.014
ge-0.013
ized-0.013
har-0.013
Token promised
Token ID8072
Feature activation+9.14e-03

Pos logit contributions
 teaches+0.016
 happens+0.016
 lived+0.016
 Amen+0.015
 vitamins+0.015
Neg logit contributions
arse-0.029
ge-0.028
 destination-0.027
har-0.027
ized-0.027
Token always
Token ID1464
Feature activation+4.61e-02

Pos logit contributions
 teaches+0.009
 about+0.009
 happens+0.009
 lived+0.009
 lions+0.008
Neg logit contributions
arse-0.026
ized-0.024
ishment-0.023
ge-0.023
 destination-0.023
Token area
Token ID1989
Feature activation+4.36e-03

Pos logit contributions
arse+0.051
ge+0.048
 destination+0.047
ized+0.046
har+0.046
Neg logit contributions
 Amen-0.046
 teaches-0.046
 lived-0.046
 happens-0.045
igator-0.045
Token in
Token ID287
Feature activation-9.04e-03

Pos logit contributions
 lived+0.006
 teaches+0.006
igator+0.006
 happens+0.006
igators+0.005
Neg logit contributions
arse-0.010
 destination-0.009
ge-0.009
har-0.009
ishment-0.009
Token the
Token ID262
Feature activation-9.85e-03

Pos logit contributions
arse+0.024
ipping+0.023
oured+0.023
ooming+0.022
har+0.022
Neg logit contributions
 teaches-0.010
 lived-0.010
 lions-0.010
 Amen-0.009
igators-0.009
Token river
Token ID7850
Feature activation+3.94e-03

Pos logit contributions
ge+0.022
arse+0.021
ized+0.019
oured+0.019
ipping+0.019
Neg logit contributions
 lived-0.016
 teaches-0.015
 Amen-0.015
igator-0.015
 lions-0.014
Token where
Token ID810
Feature activation-2.32e-02

Pos logit contributions
 lived+0.006
 teaches+0.006
 Amen+0.006
igator+0.006
 happens+0.006
Neg logit contributions
arse-0.008
ge-0.008
 destination-0.008
har-0.008
ized-0.008
Token Spot
Token ID15899
Feature activation-9.99e-02

Pos logit contributions
arse+0.050
ge+0.047
 destination+0.047
ized+0.046
har+0.046
Neg logit contributions
 Amen-0.035
 teaches-0.034
 lived-0.034
 happens-0.034
igator-0.033
Token could
Token ID714
Feature activation-2.68e-02

Pos logit contributions
arse+0.244
ge+0.230
har+0.229
 destination+0.225
ized+0.224
Neg logit contributions
 lived-0.137
 happens-0.121
 teaches-0.119
eer-0.113
 likes-0.113
Token explore
Token ID7301
Feature activation-5.48e-02

Pos logit contributions
arse+0.051
ge+0.047
ishment+0.046
ized+0.046
har+0.046
Neg logit contributions
 lived-0.049
 Amen-0.048
 teaches-0.047
igator-0.046
icks-0.046
Token with
Token ID351
Feature activation+2.12e-03

Pos logit contributions
arse+0.112
ge+0.104
 destination+0.104
har+0.103
ized+0.101
Neg logit contributions
 lived-0.089
 Amen-0.088
 teaches-0.088
 happens-0.087
igator-0.084
Token no
Token ID645
Feature activation+7.84e-03

Pos logit contributions
 lived+0.003
 teaches+0.002
 lions+0.002
igators+0.002
 happens+0.002
Neg logit contributions
arse-0.005
har-0.005
ge-0.005
oured-0.005
 destination-0.005
Token fear
Token ID3252
Feature activation-3.28e-02

Pos logit contributions
 Amen+0.013
 teaches+0.012
 lions+0.012
 lived+0.012
 happens+0.012
Neg logit contributions
arse-0.016
ge-0.015
ized-0.015
har-0.015
oured-0.015
Token They
Token ID1119
Feature activation-1.57e-02

Pos logit contributions
 destination+0.010
ishment+0.009
har+0.009
arse+0.009
ized+0.009
Neg logit contributions
 Amen-0.007
igator-0.006
 happens-0.006
 teaches-0.006
ile-0.006
Token found
Token ID1043
Feature activation-1.41e-02

Pos logit contributions
arse+0.039
ge+0.037
 destination+0.036
ipping+0.035
har+0.035
Neg logit contributions
 lived-0.023
 teaches-0.020
 happens-0.019
igator-0.019
 Amen-0.019
Token a
Token ID257
Feature activation-3.01e-02

Pos logit contributions
arse+0.026
 destination+0.026
ized+0.026
ge+0.025
oured+0.025
Neg logit contributions
 Amen-0.025
 teaches-0.023
 lived-0.022
 happens-0.022
igator-0.021
Token place
Token ID1295
Feature activation-4.36e-02

Pos logit contributions
arse+0.067
ge+0.063
ized+0.063
ishment+0.061
har+0.060
Neg logit contributions
 lived-0.043
 teaches-0.043
 Amen-0.042
 lions-0.042
 happens-0.042
Token where
Token ID810
Feature activation-1.86e-02

Pos logit contributions
arse+0.103
ge+0.097
 destination+0.097
ized+0.095
har+0.094
Neg logit contributions
 lived-0.057
 teaches-0.057
 Amen-0.056
 happens-0.055
igator-0.054
Token Spot
Token ID15899
Feature activation-9.43e-02

Pos logit contributions
arse+0.037
 destination+0.037
ge+0.036
ized+0.036
har+0.035
Neg logit contributions
 Amen-0.030
 teaches-0.028
 happens-0.027
icks-0.027
ile-0.026
Token could
Token ID714
Feature activation-2.24e-02

Pos logit contributions
arse+0.245
ge+0.228
har+0.227
 destination+0.223
ized+0.222
Neg logit contributions
 lived-0.120
 happens-0.107
 teaches-0.106
 likes-0.097
 lions-0.096
Token fit
Token ID4197
Feature activation-2.17e-02

Pos logit contributions
arse+0.043
ge+0.040
 destination+0.040
ized+0.040
har+0.039
Neg logit contributions
 Amen-0.039
 lived-0.039
 teaches-0.039
igator-0.037
 happens-0.037
Token in
Token ID287
Feature activation-4.39e-03

Pos logit contributions
arse+0.051
har+0.046
 destination+0.046
ge+0.046
oured+0.045
Neg logit contributions
 lived-0.030
igators-0.029
igator-0.029
 teaches-0.029
 happens-0.028
Token.
Token ID13
Feature activation-9.69e-03

Pos logit contributions
arse+0.012
oured+0.011
har+0.011
ized+0.010
ipping+0.010
Neg logit contributions
 lived-0.005
 teaches-0.005
 lions-0.005
 gor-0.005
 vitamins-0.005
Token They
Token ID1119
Feature activation-1.80e-02

Pos logit contributions
 destination+0.022
arse+0.021
har+0.021
ge+0.021
ishment+0.021
Neg logit contributions
 Amen-0.013
 teaches-0.012
 happens-0.012
igator-0.012
ile-0.011
Token save
Token ID3613
Feature activation-2.03e-02

Pos logit contributions
oured+0.009
ishment+0.009
arse+0.008
ipping+0.008
ized+0.008
Neg logit contributions
 teaches-0.006
 lived-0.006
 Amen-0.006
igator-0.006
ile-0.005
Token his
Token ID465
Feature activation-5.19e-03

Pos logit contributions
arse+0.050
har+0.048
oured+0.048
ized+0.046
rolling+0.046
Neg logit contributions
 happens-0.025
 teaches-0.024
 lived-0.023
 lions-0.023
igators-0.022
Token pup
Token ID15552
Feature activation-3.53e-02

Pos logit contributions
arse+0.011
ge+0.010
ized+0.010
har+0.010
oured+0.010
Neg logit contributions
 happens-0.009
 teaches-0.008
 lions-0.008
 vitamins-0.008
 likes-0.008
Token from
Token ID422
Feature activation+2.15e-02

Pos logit contributions
arse+0.079
har+0.073
ized+0.073
ge+0.072
 destination+0.072
Neg logit contributions
 teaches-0.051
 lived-0.049
igator-0.048
 happens-0.047
igators-0.047
Token the
Token ID262
Feature activation+7.71e-03

Pos logit contributions
 lived+0.030
 lions+0.029
 teaches+0.029
 Amen+0.029
 happens+0.028
Neg logit contributions
arse-0.049
har-0.046
oured-0.046
ized-0.044
ge-0.044
Token injuries
Token ID6821
Feature activation-9.37e-02

Pos logit contributions
 teaches+0.012
 Amen+0.011
 vitamins+0.011
 happens+0.011
 lions+0.011
Neg logit contributions
ge-0.017
arse-0.016
oured-0.016
ized-0.016
har-0.016
Token and
Token ID290
Feature activation-1.54e-02

Pos logit contributions
arse+0.206
har+0.200
 destination+0.197
ishment+0.196
ized+0.194
Neg logit contributions
 lived-0.133
 teaches-0.130
 Amen-0.125
 happens-0.124
igator-0.121
Token the
Token ID262
Feature activation-2.85e-03

Pos logit contributions
arse+0.033
ge+0.031
 destination+0.031
ized+0.030
har+0.030
Neg logit contributions
 Amen-0.024
 lived-0.023
 teaches-0.023
 happens-0.023
igator-0.022
Token pup
Token ID15552
Feature activation-4.53e-02

Pos logit contributions
ge+0.006
arse+0.006
ized+0.006
oured+0.006
ishment+0.006
Neg logit contributions
 teaches-0.004
 Amen-0.004
 lions-0.004
 vitamins-0.004
 happens-0.004
Token died
Token ID3724
Feature activation-3.02e-02

Pos logit contributions
arse+0.103
ge+0.097
ized+0.095
ishment+0.095
har+0.094
Neg logit contributions
 lived-0.067
 teaches-0.064
 happens-0.061
 Amen-0.060
igator-0.059
Token soon
Token ID2582
Feature activation+1.64e-02

Pos logit contributions
arse+0.072
oured+0.067
ized+0.067
ishment+0.067
har+0.066
Neg logit contributions
 lived-0.039
 happens-0.035
eer-0.034
 teaches-0.034
 about-0.034
Token the
Token ID262
Feature activation-9.06e-03

Pos logit contributions
arse+0.020
ge+0.018
har+0.018
ized+0.018
 destination+0.018
Neg logit contributions
 lived-0.009
 teaches-0.008
 Amen-0.008
igator-0.008
 lions-0.008
Token spot
Token ID4136
Feature activation-2.72e-02

Pos logit contributions
arse+0.019
ge+0.018
ized+0.018
ishment+0.017
oured+0.017
Neg logit contributions
 lived-0.015
 lions-0.015
igator-0.014
eer-0.014
 gorilla-0.014
Token where
Token ID810
Feature activation-1.39e-02

Pos logit contributions
arse+0.060
 destination+0.055
ge+0.055
har+0.055
ized+0.054
Neg logit contributions
 lived-0.040
 teaches-0.040
igator-0.039
 Amen-0.039
 happens-0.038
Token he
Token ID339
Feature activation-5.18e-02

Pos logit contributions
arse+0.032
ge+0.030
 destination+0.030
ized+0.029
har+0.029
Neg logit contributions
 Amen-0.019
 teaches-0.019
 lived-0.019
 happens-0.019
igator-0.018
Token buried
Token ID11694
Feature activation-2.69e-02

Pos logit contributions
arse+0.106
ge+0.099
 destination+0.098
ized+0.095
har+0.095
Neg logit contributions
 lived-0.088
 teaches-0.085
 happens-0.085
 Amen-0.083
igator-0.080
Token Spot
Token ID15899
Feature activation-9.37e-02

Pos logit contributions
arse+0.064
ge+0.061
 destination+0.060
har+0.060
ized+0.059
Neg logit contributions
 lived-0.034
 teaches-0.034
 Amen-0.034
 happens-0.033
igator-0.033
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
arse+0.202
 destination+0.190
har+0.190
ge+0.188
ized+0.188
Neg logit contributions
 lived-0.144
 teaches-0.137
 happens-0.136
 Amen-0.132
igator-0.132
Token He
Token ID679
Feature activation-3.74e-03

Pos logit contributions
 destination+0.031
arse+0.030
har+0.029
ge+0.029
ized+0.028
Neg logit contributions
 Amen-0.017
igator-0.016
ile-0.016
 happens-0.016
 teaches-0.016
Token wrote
Token ID2630
Feature activation+6.76e-03

Pos logit contributions
arse+0.007
 destination+0.007
ge+0.007
ized+0.007
har+0.007
Neg logit contributions
 Amen-0.007
 teaches-0.006
 happens-0.006
igator-0.006
igators-0.006
Token Spot
Token ID15899
Feature activation-6.91e-02

Pos logit contributions
 Amen+0.011
igators+0.010
igator+0.010
 happens+0.010
 teaches+0.010
Neg logit contributions
 destination-0.013
arse-0.013
ized-0.013
ge-0.013
oured-0.013
Token's
Token ID338
Feature activation-1.77e-02

Pos logit contributions
arse+0.142
 destination+0.135
ge+0.135
ized+0.132
har+0.131
Neg logit contributions
 Amen-0.110
 teaches-0.108
 lived-0.108
 happens-0.106
igator-0.103
Token in
Token ID287
Feature activation-9.11e-03

Pos logit contributions
arse+0.014
ge+0.012
 destination+0.012
ishment+0.012
har+0.012
Neg logit contributions
igator-0.013
 lived-0.013
igators-0.012
 teaches-0.012
 happens-0.012
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
arse+0.024
rolling+0.023
oured+0.023
ge+0.023
ipping+0.022
Neg logit contributions
 teaches-0.010
 lived-0.009
 Amen-0.009
 vitamins-0.008
 lions-0.008
Token safe
Token ID3338
Feature activation-5.05e-02

Pos logit contributions
arse+0.038
ge+0.037
ized+0.035
 destination+0.035
ishment+0.034
Neg logit contributions
 Amen-0.034
 teaches-0.033
 lived-0.033
igator-0.033
 happens-0.032
Token place
Token ID1295
Feature activation-2.80e-02

Pos logit contributions
arse+0.097
ge+0.086
ized+0.084
har+0.083
oured+0.083
Neg logit contributions
 lived-0.091
 Amen-0.090
igator-0.090
 teaches-0.089
igators-0.086
Token where
Token ID810
Feature activation-9.77e-03

Pos logit contributions
arse+0.068
ge+0.061
har+0.060
 destination+0.060
ipping+0.059
Neg logit contributions
 teaches-0.037
 lived-0.036
igator-0.035
 about-0.033
 happens-0.033
Token Tommy
Token ID19919
Feature activation-9.37e-02

Pos logit contributions
arse+0.021
 destination+0.020
ge+0.020
ized+0.020
har+0.019
Neg logit contributions
 Amen-0.015
 teaches-0.014
 lived-0.014
 happens-0.014
igator-0.013
Token couldn
Token ID3521
Feature activation-9.35e-03

Pos logit contributions
arse+0.223
 destination+0.207
ge+0.206
har+0.204
ized+0.202
Neg logit contributions
 lived-0.129
 teaches-0.122
 happens-0.121
igator-0.115
 Amen-0.115
Token't
Token ID470
Feature activation-2.16e-02

Pos logit contributions
arse+0.018
ge+0.017
 destination+0.017
ized+0.017
har+0.017
Neg logit contributions
 Amen-0.016
 teaches-0.015
 lived-0.015
 happens-0.015
igator-0.015
Token reach
Token ID3151
Feature activation-5.04e-02

Pos logit contributions
arse+0.043
ishment+0.040
oured+0.039
 destination+0.039
ipping+0.038
Neg logit contributions
 lived-0.038
 Amen-0.037
igator-0.037
 teaches-0.035
igators-0.033
Token it
Token ID340
Feature activation-1.34e-02

Pos logit contributions
arse+0.120
har+0.111
ge+0.109
 destination+0.108
ized+0.107
Neg logit contributions
 happens-0.064
 lived-0.063
 lions-0.062
igator-0.062
 teaches-0.061
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
arse+0.031
har+0.030
ishment+0.029
ized+0.029
oured+0.029
Neg logit contributions
 lived-0.017
 teaches-0.016
igator-0.016
 about-0.016
 happens-0.016
Token their
Token ID511
Feature activation-1.48e-02

Pos logit contributions
arse+0.008
ge+0.008
 destination+0.008
ized+0.007
har+0.007
Neg logit contributions
 Amen-0.004
 teaches-0.004
 lived-0.004
 happens-0.004
igator-0.004
Token mom
Token ID1995
Feature activation-2.16e-02

Pos logit contributions
arse+0.034
ge+0.033
 destination+0.032
ized+0.032
har+0.032
Neg logit contributions
 Amen-0.020
 teaches-0.019
 lived-0.019
 happens-0.019
igator-0.018
Token and
Token ID290
Feature activation+1.13e-02

Pos logit contributions
arse+0.039
ge+0.039
 destination+0.038
ized+0.036
har+0.036
Neg logit contributions
 Amen-0.039
 happens-0.037
 teaches-0.037
 lived-0.036
igator-0.035
Token dad
Token ID9955
Feature activation-9.59e-03

Pos logit contributions
 Amen+0.022
 teaches+0.021
 happens+0.021
 lived+0.021
ile+0.021
Neg logit contributions
arse-0.019
 destination-0.018
ge-0.018
ized-0.017
har-0.017
Token what
Token ID644
Feature activation-4.57e-02

Pos logit contributions
arse+0.019
ge+0.018
 destination+0.018
ized+0.017
har+0.017
Neg logit contributions
 Amen-0.016
 teaches-0.016
 happens-0.016
 lived-0.016
igator-0.015
Token Spot
Token ID15899
Feature activation-9.32e-02

Pos logit contributions
 destination+0.063
 breath+0.057
ge+0.057
 separate+0.057
 forth+0.056
Neg logit contributions
py-0.091
ile-0.090
 Amen-0.090
igator-0.084
eer-0.083
Token can
Token ID460
Feature activation-5.45e-02

Pos logit contributions
 destination+0.246
arse+0.243
ge+0.239
ized+0.234
oured+0.230
Neg logit contributions
 Amen-0.092
igator-0.069
 teaches-0.067
igators-0.064
ile-0.063
Token do
Token ID466
Feature activation-1.84e-02

Pos logit contributions
arse+0.101
ge+0.100
 destination+0.098
ized+0.094
har+0.091
Neg logit contributions
 Amen-0.095
 happens-0.091
 teaches-0.090
 lived-0.087
 vitamins-0.087
Token.
Token ID13
Feature activation-8.63e-03

Pos logit contributions
ge+0.029
ized+0.027
arse+0.026
 destination+0.026
ipping+0.025
Neg logit contributions
 Amen-0.039
 happens-0.036
eer-0.035
igator-0.035
igators-0.035
Token They
Token ID1119
Feature activation-1.77e-02

Pos logit contributions
arse+0.022
 destination+0.022
ge+0.022
ized+0.021
har+0.021
Neg logit contributions
 Amen-0.009
 teaches-0.009
 happens-0.008
 lived-0.008
igator-0.008
Token take
Token ID1011
Feature activation-8.90e-03

Pos logit contributions
arse+0.036
ge+0.035
har+0.035
ized+0.035
 destination+0.033
Neg logit contributions
 lived-0.029
 Amen-0.029
 teaches-0.029
igator-0.028
igators-0.028
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
 Amen+0.026
 teaches+0.024
 happens+0.024
 lived+0.023
igator+0.023
Neg logit contributions
arse-0.045
 destination-0.043
ge-0.043
ized-0.042
har-0.041
Token the
Token ID262
Feature activation-4.84e-03

Pos logit contributions
 Amen+0.032
 teaches+0.031
 happens+0.031
 lived+0.031
igator+0.030
Neg logit contributions
arse-0.041
 destination-0.039
ge-0.039
ized-0.038
har-0.038
Token story
Token ID1621
Feature activation+9.70e-03

Pos logit contributions
ge+0.012
arse+0.011
ized+0.011
har+0.011
oured+0.011
Neg logit contributions
 Amen-0.006
 teaches-0.006
igator-0.006
 happens-0.006
 lived-0.006
Token of
Token ID286
Feature activation-1.49e-02

Pos logit contributions
 teaches+0.010
 lived+0.010
 happens+0.010
 Amen+0.010
igator+0.009
Neg logit contributions
arse-0.025
ge-0.024
 destination-0.024
ized-0.024
har-0.023
Token how
Token ID703
Feature activation-1.57e-02

Pos logit contributions
arse+0.036
ge+0.034
 destination+0.033
oured+0.032
har+0.032
Neg logit contributions
 lived-0.022
 teaches-0.020
igator-0.019
 lions-0.019
ile-0.018
Token Sally
Token ID25737
Feature activation-9.32e-02

Pos logit contributions
arse+0.035
 destination+0.034
ge+0.033
ishment+0.032
har+0.031
Neg logit contributions
 lived-0.024
 happens-0.023
 teaches-0.022
 Amen-0.022
igator-0.022
Token led
Token ID2957
Feature activation-1.18e-02

Pos logit contributions
arse+0.210
 destination+0.197
ge+0.196
har+0.190
ized+0.190
Neg logit contributions
 lived-0.138
 teaches-0.137
 happens-0.133
 Amen-0.130
igator-0.130
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
arse+0.032
ge+0.030
har+0.029
ooming+0.029
rolling+0.028
Neg logit contributions
 about-0.012
 happens-0.012
 lived-0.012
 teaches-0.011
igators-0.011
Token naughty
Token ID45128
Feature activation-1.68e-02

Pos logit contributions
 Amen+0.016
 teaches+0.016
 happens+0.016
igator+0.016
 lived+0.015
Neg logit contributions
arse-0.034
ge-0.034
 destination-0.033
ized-0.033
har-0.032
Token monster
Token ID9234
Feature activation+1.71e-02

Pos logit contributions
arse+0.040
ge+0.038
 destination+0.038
ized+0.037
har+0.037
Neg logit contributions
 Amen-0.022
 teaches-0.021
 lived-0.021
 happens-0.021
igator-0.020
Token away
Token ID1497
Feature activation-4.32e-02

Pos logit contributions
 lived+0.028
 Amen+0.028
 teaches+0.028
 happens+0.028
igator+0.027
Neg logit contributions
arse-0.034
 destination-0.032
ge-0.032
ized-0.031
har-0.031
Token it
Token ID340
Feature activation+7.43e-03

Pos logit contributions
arse+0.006
har+0.005
ge+0.005
 destination+0.005
ized+0.005
Neg logit contributions
 happens-0.002
 lived-0.002
 teaches-0.002
 Amen-0.002
icks-0.002
Token.
Token ID13
Feature activation+2.13e-03

Pos logit contributions
 teaches+0.010
 happens+0.010
 lived+0.010
icks+0.010
 Amen+0.009
Neg logit contributions
arse-0.017
har-0.016
ge-0.016
ized-0.016
 destination-0.016
Token She
Token ID1375
Feature activation-3.37e-03

Pos logit contributions
ile+0.004
igator+0.004
 Amen+0.004
igators+0.004
eer+0.003
Neg logit contributions
 destination-0.004
arse-0.004
oured-0.004
har-0.004
-0.004
Token saw
Token ID2497
Feature activation-1.05e-02

Pos logit contributions
arse+0.007
ge+0.007
 destination+0.007
har+0.006
ipping+0.006
Neg logit contributions
 lived-0.006
 teaches-0.005
 happens-0.005
 likes-0.005
 Amen-0.005
Token what
Token ID644
Feature activation-4.79e-02

Pos logit contributions
arse+0.021
ge+0.021
 destination+0.021
ized+0.020
har+0.020
Neg logit contributions
 Amen-0.017
 teaches-0.016
 lived-0.016
 happens-0.016
igator-0.016
Token Tommy
Token ID19919
Feature activation-9.29e-02

Pos logit contributions
ized+0.094
 destination+0.093
ge+0.091
arse+0.089
 completely+0.089
Neg logit contributions
 Amen-0.080
eer-0.069
igators-0.067
igator-0.066
 teaches-0.065
Token had
Token ID550
Feature activation-3.43e-02

Pos logit contributions
arse+0.272
har+0.264
 destination+0.255
ishment+0.248
 breath+0.247
Neg logit contributions
 lived-0.105
 happens-0.082
 likes-0.082
 is-0.076
 wants-0.071
Token done
Token ID1760
Feature activation-4.52e-02

Pos logit contributions
arse+0.081
 destination+0.074
har+0.073
ge+0.072
ishment+0.072
Neg logit contributions
 lived-0.054
 happens-0.049
 teaches-0.048
icks-0.045
 about-0.045
Token.
Token ID13
Feature activation-5.85e-03

Pos logit contributions
arse+0.096
 destination+0.090
ge+0.089
har+0.088
ized+0.087
Neg logit contributions
 teaches-0.070
 lived-0.070
 happens-0.069
 Amen-0.069
igator-0.065
Token She
Token ID1375
Feature activation-8.34e-03

Pos logit contributions
arse+0.014
 destination+0.013
ge+0.013
ized+0.013
har+0.013
Neg logit contributions
 Amen-0.008
 teaches-0.007
igator-0.007
 happens-0.007
 lived-0.007
Token cried
Token ID16896
Feature activation-8.73e-03

Pos logit contributions
arse+0.017
 destination+0.016
ge+0.016
ized+0.015
har+0.015
Neg logit contributions
 lived-0.014
 teaches-0.014
 Amen-0.014
 happens-0.014
igator-0.013
TokenTim
Token ID14967
Feature activation-1.14e-02

Pos logit contributions
 lived+0.005
 teaches+0.004
 Amen+0.004
igator+0.004
igators+0.004
Neg logit contributions
arse-0.007
 destination-0.006
oured-0.006
ge-0.006
ipping-0.006
Tokenmy
Token ID1820
Feature activation-4.86e-03

Pos logit contributions
arse+0.024
 destination+0.023
ge+0.023
ized+0.022
har+0.022
Neg logit contributions
 lived-0.018
 teaches-0.017
 Amen-0.017
igator-0.016
ile-0.016
Token thought
Token ID1807
Feature activation-4.81e-03

Pos logit contributions
arse+0.010
ge+0.009
 destination+0.009
har+0.009
ized+0.009
Neg logit contributions
 lived-0.008
 teaches-0.008
 Amen-0.008
 happens-0.008
igator-0.007
Token about
Token ID546
Feature activation-3.26e-02

Pos logit contributions
arse+0.013
ge+0.013
 destination+0.013
ized+0.013
har+0.012
Neg logit contributions
 Amen-0.004
 teaches-0.004
 lived-0.004
 happens-0.004
igator-0.004
Token what
Token ID644
Feature activation-3.99e-02

Pos logit contributions
arse+0.064
ge+0.062
 destination+0.062
ized+0.060
har+0.058
Neg logit contributions
 Amen-0.054
 teaches-0.053
 happens-0.052
igator-0.051
 lived-0.051
Token Billy
Token ID15890
Feature activation-9.26e-02

Pos logit contributions
arse+0.054
 destination+0.053
ge+0.052
ized+0.052
har+0.047
Neg logit contributions
 Amen-0.088
icks-0.084
igators-0.084
igator-0.083
eer-0.083
Token said
Token ID531
Feature activation-2.82e-02

Pos logit contributions
arse+0.197
ge+0.190
 destination+0.189
ized+0.186
har+0.181
Neg logit contributions
 Amen-0.140
 teaches-0.134
 happens-0.131
 lived-0.130
igator-0.128
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
arse+0.063
har+0.060
 destination+0.060
ge+0.059
ized+0.057
Neg logit contributions
 lived-0.041
 teaches-0.040
 happens-0.039
 Amen-0.038
igator-0.036
Token decided
Token ID3066
Feature activation+1.37e-02

Pos logit contributions
arse+0.028
 destination+0.027
ge+0.027
ized+0.027
har+0.026
Neg logit contributions
 Amen-0.024
 teaches-0.024
 happens-0.023
 lived-0.023
igator-0.023
Token to
Token ID284
Feature activation+4.87e-03

Pos logit contributions
 happens+0.017
 teaches+0.016
 lived+0.016
 Amen+0.016
 vitamins+0.014
Neg logit contributions
arse-0.034
ge-0.032
 destination-0.032
har-0.032
ized-0.031
Token start
Token ID923
Feature activation-1.92e-02

Pos logit contributions
 teaches+0.009
 lived+0.009
 Amen+0.009
igator+0.008
 happens+0.008
Neg logit contributions
arse-0.009
ge-0.009
ized-0.009
har-0.008
 destination-0.008
Token doing
Token ID1804
Feature activation+8.01e-03

Pos logit contributions
 teaches+0.003
 lived+0.003
 Amen+0.003
 happens+0.003
igators+0.003
Neg logit contributions
arse-0.004
 destination-0.004
ge-0.004
har-0.004
ized-0.004
Token anything
Token ID1997
Feature activation-9.77e-03

Pos logit contributions
 teaches+0.012
 happens+0.011
 lived+0.011
 Amen+0.011
 vitamins+0.010
Neg logit contributions
arse-0.018
ge-0.017
har-0.017
 destination-0.017
ized-0.017
Token he
Token ID339
Feature activation-4.31e-02

Pos logit contributions
arse+0.019
ge+0.019
 destination+0.019
ized+0.018
har+0.018
Neg logit contributions
 Amen-0.016
 teaches-0.015
 lived-0.015
 happens-0.015
igator-0.015
Token set
Token ID900
Feature activation-1.86e-02

Pos logit contributions
arse+0.099
 destination+0.094
ge+0.094
ized+0.091
har+0.091
Neg logit contributions
 teaches-0.059
 Amen-0.059
 lived-0.059
 happens-0.058
igator-0.055
Token his
Token ID465
Feature activation-3.07e-02

Pos logit contributions
arse+0.041
har+0.038
 destination+0.038
ge+0.038
oured+0.038
Neg logit contributions
 teaches-0.026
 lived-0.026
 happens-0.026
 Amen-0.025
igator-0.025
Token mind
Token ID2000
Feature activation-9.24e-02

Pos logit contributions
arse+0.063
ge+0.059
ized+0.059
har+0.057
 destination+0.056
Neg logit contributions
 teaches-0.049
 lived-0.049
 Amen-0.048
 happens-0.048
eer-0.045
Token to
Token ID284
Feature activation-1.71e-02

Pos logit contributions
arse+0.187
 destination+0.178
ge+0.178
ized+0.174
har+0.173
Neg logit contributions
 Amen-0.148
 teaches-0.145
 lived-0.144
 happens-0.143
igator-0.139
Token!
Token ID0
Feature activation-1.05e-02

Pos logit contributions
arse+0.037
har+0.034
 destination+0.034
ge+0.034
oured+0.034
Neg logit contributions
 lived-0.027
 teaches-0.026
 Amen-0.026
 happens-0.025
igator-0.024
Token<|endoftext|>
Token ID50256
Feature activation-3.56e-02

Pos logit contributions
arse+0.022
 destination+0.021
ge+0.021
har+0.020
ized+0.020
Neg logit contributions
 happens-0.016
igator-0.015
 Amen-0.015
igators-0.015
 teaches-0.015
TokenOnce
Token ID7454
Feature activation+6.56e-03

Pos logit contributions
arse+0.080
 destination+0.077
ge+0.077
ized+0.075
har+0.074
Neg logit contributions
 Amen-0.049
 teaches-0.048
 happens-0.047
 lived-0.047
igator-0.046
Token upon
Token ID2402
Feature activation+7.28e-03

Pos logit contributions
 Amen+0.017
 teaches+0.017
 happens+0.016
 lived+0.016
igators+0.016
Neg logit contributions
arse-0.007
ge-0.007
 destination-0.006
ized-0.006
har-0.006
Tokenâ
Token ID22940
Feature activation-1.52e-02

Pos logit contributions
 Amen+0.034
icks+0.030
igator+0.030
 teaches+0.030
 happens+0.030
Neg logit contributions
arse-0.052
 destination-0.051
ge-0.049
ized-0.049
 forth-0.047
Token
Token ID26391
Feature activation-5.70e-03

Pos logit contributions
arse+0.032
ge+0.030
 destination+0.030
ized+0.029
har+0.029
Neg logit contributions
 Amen-0.024
 teaches-0.024
 lived-0.023
 happens-0.023
igator-0.023
Token
Token ID8151
Feature activation+2.64e-02

Pos logit contributions
arse+0.012
ge+0.011
 destination+0.011
ized+0.011
har+0.010
Neg logit contributions
 Amen-0.009
 lived-0.009
 teaches-0.009
 happens-0.009
icks-0.008
Tokens
Token ID82
Feature activation+1.02e-02

Pos logit contributions
 Amen+0.044
 teaches+0.042
 happens+0.042
 lived+0.042
igator+0.041
Neg logit contributions
arse-0.052
ge-0.050
 destination-0.050
ized-0.048
har-0.048
Token how
Token ID703
Feature activation-2.25e-02

Pos logit contributions
 Amen+0.016
 happens+0.015
 teaches+0.015
igator+0.015
igators+0.015
Neg logit contributions
arse-0.021
 destination-0.021
ge-0.020
har-0.019
ized-0.019
Token Spot
Token ID15899
Feature activation-9.16e-02

Pos logit contributions
arse+0.047
ge+0.046
 destination+0.045
ized+0.045
har+0.044
Neg logit contributions
 Amen-0.034
 teaches-0.033
 happens-0.032
 lived-0.032
igator-0.031
Token explored
Token ID18782
Feature activation-2.84e-02

Pos logit contributions
arse+0.218
ge+0.206
 destination+0.204
har+0.201
ized+0.201
Neg logit contributions
 lived-0.123
 teaches-0.121
 happens-0.117
 Amen-0.114
igator-0.111
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
arse+0.061
ge+0.056
har+0.055
 destination+0.055
ized+0.054
Neg logit contributions
 lived-0.045
 teaches-0.044
 happens-0.043
 Amen-0.043
igators-0.042
Token castle
Token ID16669
Feature activation+2.17e-02

Pos logit contributions
arse+0.031
ge+0.030
ized+0.028
ishment+0.027
oured+0.027
Neg logit contributions
 lived-0.021
 lions-0.021
 teaches-0.021
 happens-0.020
igator-0.020
Token that
Token ID326
Feature activation-1.68e-02

Pos logit contributions
 Amen+0.036
 teaches+0.036
 lived+0.036
 happens+0.035
igator+0.034
Neg logit contributions
arse-0.043
ge-0.040
 destination-0.040
ized-0.039
har-0.039
Token day
Token ID1110
Feature activation-2.71e-02

Pos logit contributions
arse+0.037
ge+0.036
 destination+0.035
ized+0.035
har+0.035
Neg logit contributions
 Amen-0.024
 lived-0.024
 teaches-0.024
 happens-0.023
igator-0.023
Token do
Token ID466
Feature activation-1.25e-02

Pos logit contributions
arse+0.038
ge+0.035
har+0.034
 destination+0.034
ishment+0.034
Neg logit contributions
 lived-0.027
 teaches-0.027
 Amen-0.027
igator-0.025
 happens-0.025
Token anything
Token ID1997
Feature activation-1.90e-02

Pos logit contributions
arse+0.029
har+0.027
 destination+0.027
ge+0.026
ishment+0.026
Neg logit contributions
 lived-0.018
 teaches-0.018
 happens-0.018
 Amen-0.016
 about-0.016
Token she
Token ID673
Feature activation-5.09e-02

Pos logit contributions
arse+0.037
ge+0.036
 destination+0.036
ized+0.035
har+0.034
Neg logit contributions
 Amen-0.032
 teaches-0.031
 lived-0.031
 happens-0.030
igator-0.030
Token set
Token ID900
Feature activation-1.69e-02

Pos logit contributions
arse+0.121
 destination+0.117
ge+0.116
ized+0.114
har+0.113
Neg logit contributions
 Amen-0.065
 teaches-0.062
 happens-0.061
 lived-0.061
igator-0.059
Token her
Token ID607
Feature activation-1.70e-02

Pos logit contributions
arse+0.042
oured+0.039
har+0.039
 sudden+0.039
 destination+0.038
Neg logit contributions
 about-0.020
 teaches-0.019
 lived-0.019
 happens-0.018
icks-0.018
Token mind
Token ID2000
Feature activation-9.14e-02

Pos logit contributions
arse+0.035
ge+0.033
ized+0.032
har+0.032
oured+0.031
Neg logit contributions
 teaches-0.028
 lived-0.028
 happens-0.027
 Amen-0.026
igator-0.026
Token to
Token ID284
Feature activation-1.68e-02

Pos logit contributions
arse+0.198
 destination+0.186
ge+0.184
har+0.184
ized+0.183
Neg logit contributions
 teaches-0.135
 lived-0.133
 happens-0.132
 Amen-0.131
igator-0.129
Token.
Token ID13
Feature activation-8.25e-03

Pos logit contributions
arse+0.038
har+0.035
oured+0.035
ge+0.035
 destination+0.034
Neg logit contributions
 lived-0.026
 teaches-0.025
 Amen-0.023
 happens-0.023
igator-0.022
Token One
Token ID1881
Feature activation+1.18e-02

Pos logit contributions
 destination+0.016
arse+0.016
ge+0.015
ized+0.015
har+0.015
Neg logit contributions
 Amen-0.013
 happens-0.013
 teaches-0.013
igator-0.013
ile-0.013
Token day
Token ID1110
Feature activation-6.26e-03

Pos logit contributions
 Amen+0.014
icks+0.013
 happens+0.013
 teaches+0.013
 vitamins+0.013
Neg logit contributions
arse-0.029
ge-0.028
 destination-0.028
ized-0.028
har-0.027
Token,
Token ID11
Feature activation-2.56e-03

Pos logit contributions
arse+0.013
ge+0.012
 destination+0.012
ized+0.012
ipping+0.012
Neg logit contributions
 Amen-0.010
 teaches-0.010
igators-0.010
igator-0.009
eer-0.009
TokenTim
Token ID14967
Feature activation-8.88e-03

Pos logit contributions
 lived+0.007
 teaches+0.007
 Amen+0.006
 happens+0.006
igator+0.006
Neg logit contributions
arse-0.008
 destination-0.008
 breath-0.007
ishment-0.007
ge-0.007
Tokenmy
Token ID1820
Feature activation-1.64e-03

Pos logit contributions
arse+0.020
 destination+0.019
ge+0.019
ized+0.018
har+0.018
Neg logit contributions
 lived-0.013
 teaches-0.013
 likes-0.012
 Amen-0.012
igator-0.012
Token thought
Token ID1807
Feature activation-1.68e-03

Pos logit contributions
arse+0.003
ge+0.003
 destination+0.003
har+0.003
ized+0.003
Neg logit contributions
 lived-0.003
 teaches-0.003
 Amen-0.003
 happens-0.003
igator-0.002
Token about
Token ID546
Feature activation-3.00e-02

Pos logit contributions
arse+0.004
ge+0.004
 destination+0.004
ized+0.004
har+0.004
Neg logit contributions
 Amen-0.002
 teaches-0.002
 lived-0.002
 happens-0.002
igator-0.002
Token what
Token ID644
Feature activation-3.81e-02

Pos logit contributions
ge+0.052
arse+0.050
 destination+0.050
ized+0.050
 forth+0.047
Neg logit contributions
 Amen-0.054
 teaches-0.052
igator-0.052
 happens-0.052
icks-0.051
Token Tommy
Token ID19919
Feature activation-9.12e-02

Pos logit contributions
ge+0.066
 destination+0.065
arse+0.065
ized+0.064
har+0.060
Neg logit contributions
 Amen-0.070
icks-0.066
igators-0.065
igator-0.065
ile-0.064
Token said
Token ID531
Feature activation-3.00e-02

Pos logit contributions
arse+0.214
 destination+0.203
har+0.203
ge+0.200
ized+0.194
Neg logit contributions
 lived-0.128
 happens-0.123
 teaches-0.123
 Amen-0.117
igator-0.114
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
arse+0.060
ge+0.058
 destination+0.057
ized+0.056
har+0.055
Neg logit contributions
 Amen-0.049
 teaches-0.048
 lived-0.047
 happens-0.047
igator-0.046
Token then
Token ID788
Feature activation-1.34e-02

Pos logit contributions
arse+0.030
ge+0.029
 destination+0.028
har+0.028
ized+0.027
Neg logit contributions
 Amen-0.023
 lived-0.023
 teaches-0.023
 happens-0.022
igator-0.021
Token said
Token ID531
Feature activation-3.15e-03

Pos logit contributions
arse+0.026
 destination+0.025
ge+0.025
ized+0.025
har+0.024
Neg logit contributions
 Amen-0.022
 teaches-0.021
 lived-0.021
 happens-0.021
igator-0.021
Token,
Token ID11
Feature activation+9.53e-03

Pos logit contributions
arse+0.005
 destination+0.005
ized+0.005
ge+0.005
ipping+0.005
Neg logit contributions
igators-0.006
ile-0.006
eer-0.006
 Amen-0.006
 teaches-0.006
Token home
Token ID1363
Feature activation-2.91e-03

Pos logit contributions
arse+0.022
ized+0.021
ge+0.021
 destination+0.021
ishment+0.020
Neg logit contributions
 lived-0.018
 Amen-0.018
igator-0.018
icks-0.018
 happens-0.018
Token,
Token ID11
Feature activation-3.56e-03

Pos logit contributions
arse+0.009
ishment+0.009
aults+0.008
 destination+0.008
oured+0.008
Neg logit contributions
 lived-0.002
 about-0.002
igator-0.002
 happens-0.002
 is-0.001
Token he
Token ID339
Feature activation+5.15e-03

Pos logit contributions
arse+0.010
 destination+0.009
ge+0.009
har+0.009
oured+0.009
Neg logit contributions
 lived-0.004
 Amen-0.004
igator-0.003
 happens-0.003
 teaches-0.003
Token saw
Token ID2497
Feature activation-5.38e-03

Pos logit contributions
 lived+0.008
 teaches+0.008
 happens+0.008
 Amen+0.007
igator+0.007
Neg logit contributions
arse-0.012
ge-0.011
 destination-0.010
ized-0.010
ishment-0.010
Token what
Token ID644
Feature activation-4.68e-02

Pos logit contributions
arse+0.011
ge+0.011
 destination+0.011
ized+0.011
har+0.010
Neg logit contributions
 Amen-0.008
 teaches-0.008
 happens-0.008
 lived-0.008
igators-0.008
Token Spot
Token ID15899
Feature activation-9.07e-02

Pos logit contributions
ized+0.077
ge+0.074
 completely+0.073
 sudden+0.072
 destination+0.071
Neg logit contributions
 Amen-0.089
icks-0.081
 teaches-0.078
 vitamins-0.077
eer-0.077
Token had
Token ID550
Feature activation-3.59e-02

Pos logit contributions
arse+0.245
har+0.236
 destination+0.232
ge+0.230
ishment+0.226
Neg logit contributions
 lived-0.111
 happens-0.095
 teaches-0.089
 likes-0.086
 lions-0.081
Token done
Token ID1760
Feature activation-4.51e-02

Pos logit contributions
arse+0.083
ishment+0.078
 destination+0.077
har+0.077
ge+0.076
Neg logit contributions
 lived-0.057
 happens-0.051
 teaches-0.050
 about-0.048
icks-0.047
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
arse+0.095
 destination+0.090
ge+0.088
har+0.087
ized+0.086
Neg logit contributions
 teaches-0.070
 lived-0.070
 Amen-0.070
 happens-0.069
igator-0.066
Token said
Token ID531
Feature activation-4.89e-03

Pos logit contributions
arse+0.039
ge+0.037
 destination+0.037
ized+0.036
har+0.036
Neg logit contributions
 Amen-0.031
 teaches-0.031
 happens-0.030
 lived-0.030
igator-0.029
Token,
Token ID11
Feature activation+5.81e-03

Pos logit contributions
ized+0.007
 breath+0.006
arse+0.006
ipping+0.006
 destination+0.006
Neg logit contributions
eer-0.010
 Amen-0.010
igators-0.010
 teaches-0.010
ile-0.010
Token woods
Token ID16479
Feature activation+1.69e-02

Pos logit contributions
 lived+0.009
 lions+0.009
 Amen+0.009
igators+0.009
igator+0.009
Neg logit contributions
ge-0.014
arse-0.014
har-0.012
oured-0.012
ized-0.012
Token.
Token ID13
Feature activation-4.71e-04

Pos logit contributions
 lived+0.023
 teaches+0.023
 Amen+0.022
igator+0.022
igators+0.022
Neg logit contributions
arse-0.039
 destination-0.036
har-0.036
ge-0.035
oured-0.035
Token Jack
Token ID3619
Feature activation-4.12e-02

Pos logit contributions
arse+0.001
ge+0.001
 destination+0.001
ized+0.001
har+0.001
Neg logit contributions
 Amen-0.001
 teaches-0.001
 lived-0.001
 happens-0.001
igator-0.001
Token had
Token ID550
Feature activation-1.83e-02

Pos logit contributions
arse+0.088
ge+0.084
 destination+0.084
ized+0.081
har+0.081
Neg logit contributions
 Amen-0.062
 lived-0.061
 teaches-0.061
 happens-0.060
igator-0.058
Token his
Token ID465
Feature activation-4.80e-02

Pos logit contributions
arse+0.045
ge+0.043
 destination+0.042
ized+0.042
har+0.041
Neg logit contributions
 lived-0.024
 teaches-0.022
 happens-0.022
 Amen-0.022
 lions-0.021
Token rake
Token ID47885
Feature activation-9.01e-02

Pos logit contributions
arse+0.094
ge+0.089
 destination+0.089
ized+0.087
har+0.086
Neg logit contributions
 Amen-0.081
 teaches-0.080
 lived-0.079
 happens-0.079
igator-0.076
Token with
Token ID351
Feature activation-6.02e-03

Pos logit contributions
arse+0.208
har+0.192
ge+0.189
 destination+0.188
ishment+0.184
Neg logit contributions
 lived-0.129
igator-0.124
 teaches-0.121
 happens-0.117
igators-0.116
Token him
Token ID683
Feature activation-1.28e-02

Pos logit contributions
arse+0.015
har+0.014
ge+0.014
 destination+0.014
ized+0.014
Neg logit contributions
 lived-0.007
igator-0.007
 teaches-0.007
 vitamins-0.007
 happens-0.007
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
arse+0.029
har+0.027
ge+0.027
 destination+0.027
ized+0.026
Neg logit contributions
 lived-0.019
 teaches-0.017
 happens-0.017
igator-0.017
 Amen-0.017
Token "
Token ID366
Feature activation-4.35e-02

Pos logit contributions
arse+0.030
ge+0.029
 destination+0.028
ized+0.028
har+0.028
Neg logit contributions
 Amen-0.013
 teaches-0.013
 lived-0.013
 happens-0.013
igator-0.012
TokenWhat
Token ID2061
Feature activation-3.72e-02

Pos logit contributions
arse+0.081
ge+0.077
 destination+0.076
ized+0.075
har+0.074
Neg logit contributions
 Amen-0.077
 teaches-0.076
 happens-0.076
 lived-0.075
icks-0.073
Token open
Token ID1280
Feature activation-6.89e-03

Pos logit contributions
arse+0.021
ge+0.020
 destination+0.020
ized+0.019
har+0.019
Neg logit contributions
 Amen-0.016
 teaches-0.016
 lived-0.016
 happens-0.016
igator-0.015
Token grass
Token ID8701
Feature activation+4.10e-02

Pos logit contributions
arse+0.012
ge+0.012
ized+0.011
har+0.011
 destination+0.011
Neg logit contributions
 lived-0.013
 teaches-0.013
 Amen-0.013
igator-0.013
 happens-0.013
Tokeny
Token ID88
Feature activation+5.52e-02

Pos logit contributions
 lived+0.058
 lions+0.054
 happens+0.053
igator+0.053
 teaches+0.053
Neg logit contributions
arse-0.093
har-0.088
ge-0.087
 destination-0.085
ized-0.083
Token area
Token ID1989
Feature activation+2.44e-02

Pos logit contributions
 Amen+0.102
 teaches+0.098
 happens+0.095
 lived+0.095
igator+0.093
Neg logit contributions
arse-0.099
 destination-0.097
ge-0.095
ized-0.094
har-0.092
Token where
Token ID810
Feature activation-1.13e-02

Pos logit contributions
 lived+0.034
 teaches+0.033
igator+0.032
 happens+0.031
 about+0.031
Neg logit contributions
arse-0.057
ge-0.052
 destination-0.051
har-0.051
ishment-0.050
Token Sally
Token ID25737
Feature activation-8.99e-02

Pos logit contributions
arse+0.023
ge+0.022
 destination+0.022
har+0.021
ized+0.021
Neg logit contributions
 Amen-0.018
 lived-0.017
 teaches-0.017
 happens-0.017
igator-0.017
Token could
Token ID714
Feature activation-1.81e-02

Pos logit contributions
arse+0.221
 destination+0.207
har+0.206
ge+0.205
ized+0.202
Neg logit contributions
 lived-0.120
 happens-0.111
 teaches-0.109
igator-0.103
 likes-0.101
Token do
Token ID466
Feature activation-8.09e-03

Pos logit contributions
arse+0.034
ge+0.033
 destination+0.032
ized+0.032
har+0.031
Neg logit contributions
 Amen-0.032
 lived-0.032
 teaches-0.032
 happens-0.031
igator-0.030
Token her
Token ID607
Feature activation-2.56e-02

Pos logit contributions
arse+0.018
 destination+0.017
ge+0.017
har+0.017
ized+0.016
Neg logit contributions
 lived-0.012
 happens-0.012
 teaches-0.012
 Amen-0.011
icks-0.011
Token favorite
Token ID4004
Feature activation-9.79e-03

Pos logit contributions
arse+0.051
ge+0.049
ized+0.047
har+0.047
 destination+0.046
Neg logit contributions
 teaches-0.043
 Amen-0.043
 happens-0.043
 lived-0.042
icks-0.040
Token dance
Token ID9280
Feature activation-2.55e-02

Pos logit contributions
 destination+0.018
arse+0.018
ge+0.017
ized+0.017
har+0.017
Neg logit contributions
 Amen-0.017
 teaches-0.017
 happens-0.017
 lived-0.016
icks-0.016
Token had
Token ID550
Feature activation-1.14e-02

Pos logit contributions
arse+0.059
ge+0.057
 destination+0.056
har+0.055
ishment+0.053
Neg logit contributions
 lived-0.040
 happens-0.036
 teaches-0.036
 Amen-0.034
 likes-0.033
Token a
Token ID257
Feature activation-3.34e-02

Pos logit contributions
arse+0.023
ge+0.022
 destination+0.021
ized+0.021
ishment+0.021
Neg logit contributions
 lived-0.019
 happens-0.019
 teaches-0.019
 Amen-0.018
igator-0.018
Token metal
Token ID6147
Feature activation+2.28e-02

Pos logit contributions
arse+0.073
ge+0.070
ized+0.069
 destination+0.068
har+0.067
Neg logit contributions
 Amen-0.050
 lived-0.048
 teaches-0.048
 happens-0.048
igator-0.046
Token truck
Token ID7779
Feature activation-2.42e-02

Pos logit contributions
 Amen+0.038
 lived+0.038
 teaches+0.038
 happens+0.037
igator+0.037
Neg logit contributions
arse-0.045
ge-0.043
 destination-0.042
ized-0.041
har-0.041
Token which
Token ID543
Feature activation-4.23e-02

Pos logit contributions
arse+0.060
 destination+0.055
ge+0.055
har+0.055
ized+0.054
Neg logit contributions
 lived-0.030
igator-0.029
 teaches-0.028
 lions-0.027
 Amen-0.027
Token Max
Token ID5436
Feature activation-8.98e-02

Pos logit contributions
arse+0.082
 destination+0.081
ized+0.079
ge+0.079
har+0.075
Neg logit contributions
 Amen-0.071
 teaches-0.063
icks-0.063
 lions-0.061
 happens-0.061
Token didn
Token ID1422
Feature activation-2.69e-02

Pos logit contributions
arse+0.221
har+0.208
ge+0.208
 destination+0.205
ized+0.202
Neg logit contributions
 lived-0.127
 happens-0.108
 teaches-0.106
 likes-0.104
 Amen-0.102
Token't
Token ID470
Feature activation-1.36e-02

Pos logit contributions
arse+0.051
ge+0.049
 destination+0.048
ized+0.047
har+0.046
Neg logit contributions
 Amen-0.047
 teaches-0.046
 lived-0.046
 happens-0.045
igator-0.044
Token have
Token ID423
Feature activation-4.27e-02

Pos logit contributions
arse+0.029
ge+0.029
 destination+0.028
ized+0.027
har+0.027
Neg logit contributions
 Amen-0.020
 teaches-0.020
 happens-0.019
 lived-0.019
icks-0.018
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
arse+0.082
ge+0.080
 destination+0.079
ized+0.075
har+0.075
Neg logit contributions
 Amen-0.076
 teaches-0.071
 lived-0.069
 happens-0.068
igator-0.068
Token He
Token ID679
Feature activation+1.99e-03

Pos logit contributions
arse+0.042
ge+0.042
har+0.040
 destination+0.040
ized+0.040
Neg logit contributions
 teaches-0.013
 lived-0.013
 Amen-0.013
 happens-0.013
igator-0.011
Token look
Token ID804
Feature activation-2.63e-02

Pos logit contributions
arse+0.007
ized+0.007
ipping+0.006
oured+0.006
 destination+0.006
Neg logit contributions
 teaches-0.003
 Amen-0.003
 happens-0.003
 lived-0.003
igator-0.003
Token as
Token ID355
Feature activation-2.97e-02

Pos logit contributions
arse+0.063
ipping+0.058
 destination+0.057
har+0.057
ge+0.057
Neg logit contributions
 lived-0.033
 happens-0.033
 teaches-0.033
igator-0.032
 about-0.032
Token good
Token ID922
Feature activation+3.39e-03

Pos logit contributions
arse+0.055
ge+0.054
 destination+0.054
ized+0.053
har+0.052
Neg logit contributions
 Amen-0.053
 teaches-0.049
 lived-0.049
 happens-0.048
ile-0.048
Token as
Token ID355
Feature activation-1.55e-02

Pos logit contributions
 happens+0.006
 teaches+0.006
 Amen+0.006
 lived+0.006
 vitamins+0.006
Neg logit contributions
arse-0.006
har-0.006
 destination-0.006
ge-0.006
ized-0.006
Token what
Token ID644
Feature activation-3.11e-02

Pos logit contributions
arse+0.035
ge+0.033
 destination+0.033
ized+0.032
har+0.032
Neg logit contributions
 Amen-0.021
 teaches-0.021
 lived-0.021
 happens-0.021
igator-0.020
Token Billy
Token ID15890
Feature activation-8.96e-02

Pos logit contributions
 destination+0.059
ge+0.057
arse+0.055
ized+0.055
har+0.053
Neg logit contributions
 Amen-0.054
ile-0.049
icks-0.048
 teaches-0.048
igator-0.048
Token had
Token ID550
Feature activation-2.94e-02

Pos logit contributions
arse+0.227
 destination+0.213
har+0.212
ge+0.208
ized+0.206
Neg logit contributions
 lived-0.115
 happens-0.107
 teaches-0.105
igator-0.100
eer-0.098
Token made
Token ID925
Feature activation-2.02e-02

Pos logit contributions
arse+0.068
har+0.063
 destination+0.061
ishment+0.060
 forth+0.060
Neg logit contributions
 lived-0.044
 teaches-0.043
 happens-0.042
 vitamins-0.038
 about-0.038
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
arse+0.046
har+0.043
 destination+0.042
ge+0.042
ized+0.042
Neg logit contributions
 teaches-0.029
 lived-0.028
 happens-0.028
 Amen-0.027
igator-0.027
Token
Token ID220
Feature activation-4.55e-02

Pos logit contributions
arse+0.041
ge+0.040
har+0.038
aults+0.038
 destination+0.037
Neg logit contributions
 lived-0.015
 Amen-0.014
 teaches-0.013
 happens-0.012
 And-0.012
Token
Token ID198
Feature activation-1.50e-02

Pos logit contributions
arse+0.115
 destination+0.110
ge+0.110
ized+0.108
har+0.108
Neg logit contributions
 Amen-0.050
 teaches-0.049
 lived-0.049
 happens-0.048
igator-0.045
Token liked
Token ID8288
Feature activation-2.95e-02

Pos logit contributions
 Amen+0.005
icks+0.005
 happens+0.005
igators+0.005
 teaches+0.005
Neg logit contributions
 destination-0.008
ge-0.008
ized-0.008
arse-0.008
har-0.008
Token to
Token ID284
Feature activation-1.61e-02

Pos logit contributions
ge+0.053
 destination+0.053
arse+0.053
ized+0.052
 breath+0.050
Neg logit contributions
 Amen-0.052
 teaches-0.051
 happens-0.049
 lived-0.049
icks-0.049
Token do
Token ID466
Feature activation+2.72e-03

Pos logit contributions
arse+0.029
ge+0.028
 destination+0.027
ized+0.027
har+0.027
Neg logit contributions
 Amen-0.030
 teaches-0.029
 lived-0.029
 happens-0.029
igator-0.028
Token what
Token ID644
Feature activation-4.82e-02

Pos logit contributions
 lived+0.004
 teaches+0.004
 happens+0.004
 Amen+0.004
igators+0.004
Neg logit contributions
arse-0.006
 destination-0.006
ge-0.006
har-0.006
ized-0.005
Token his
Token ID465
Feature activation-5.97e-02

Pos logit contributions
 destination+0.084
 forth+0.079
ized+0.078
ge+0.078
 completely+0.073
Neg logit contributions
 Amen-0.087
icks-0.084
ile-0.082
eer-0.079
py-0.078
Token Dad
Token ID17415
Feature activation-8.96e-02

Pos logit contributions
arse+0.117
ge+0.111
 destination+0.108
ized+0.107
har+0.106
Neg logit contributions
 Amen-0.103
 teaches-0.101
 lived-0.101
 happens-0.099
igator-0.097
Token did
Token ID750
Feature activation-2.96e-02

Pos logit contributions
ge+0.177
arse+0.177
 destination+0.175
ized+0.173
har+0.165
Neg logit contributions
 Amen-0.143
icks-0.132
 happens-0.131
 teaches-0.130
igators-0.128
Token,
Token ID11
Feature activation-3.39e-02

Pos logit contributions
arse+0.059
ge+0.057
 destination+0.057
ized+0.055
har+0.055
Neg logit contributions
 Amen-0.048
 teaches-0.047
 lived-0.047
 happens-0.047
igator-0.045
Token so
Token ID523
Feature activation-1.61e-03

Pos logit contributions
arse+0.065
 destination+0.061
ge+0.061
ized+0.059
har+0.059
Neg logit contributions
 Amen-0.059
 teaches-0.058
 lived-0.058
 happens-0.057
igator-0.056
Token every
Token ID790
Feature activation+7.20e-03

Pos logit contributions
 destination+0.003
arse+0.003
ge+0.003
ized+0.003
 forth+0.003
Neg logit contributions
 teaches-0.002
 Amen-0.002
 happens-0.002
icks-0.002
ile-0.002
Token day
Token ID1110
Feature activation-1.34e-02

Pos logit contributions
 Amen+0.008
icks+0.008
 teaches+0.008
 lived+0.008
 happens+0.007
Neg logit contributions
 destination-0.017
arse-0.017
ge-0.017
ized-0.017
 breath-0.016
Token's
Token ID338
Feature activation+6.63e-03

Pos logit contributions
arse+0.095
ge+0.090
har+0.090
 destination+0.089
ized+0.088
Neg logit contributions
 lived-0.066
 happens-0.065
 teaches-0.064
 Amen-0.062
igator-0.059
Token family
Token ID1641
Feature activation-2.05e-02

Pos logit contributions
 Amen+0.012
 teaches+0.011
 lived+0.011
 happens+0.011
igator+0.011
Neg logit contributions
arse-0.013
ge-0.012
 destination-0.012
ized-0.012
har-0.012
Token saw
Token ID2497
Feature activation-5.72e-03

Pos logit contributions
arse+0.050
har+0.048
ge+0.048
ishment+0.047
 destination+0.046
Neg logit contributions
 lived-0.028
 teaches-0.026
 happens-0.025
 Amen-0.023
igator-0.023
Token how
Token ID703
Feature activation-2.21e-02

Pos logit contributions
arse+0.012
ge+0.012
har+0.012
 destination+0.012
ized+0.011
Neg logit contributions
 lived-0.009
 teaches-0.008
 Amen-0.008
 happens-0.008
igator-0.008
Token much
Token ID881
Feature activation-3.84e-02

Pos logit contributions
arse+0.049
 destination+0.047
ge+0.046
har+0.045
ishment+0.044
Neg logit contributions
 lived-0.032
 Amen-0.032
 teaches-0.032
 happens-0.032
igator-0.031
Token Billy
Token ID15890
Feature activation-8.94e-02

Pos logit contributions
arse+0.076
 destination+0.074
ized+0.074
ge+0.073
ishment+0.069
Neg logit contributions
 Amen-0.063
 teaches-0.058
 lived-0.058
icks-0.057
 happens-0.057
Token and
Token ID290
Feature activation-3.50e-02

Pos logit contributions
arse+0.235
 destination+0.224
har+0.222
ge+0.217
ishment+0.214
Neg logit contributions
 lived-0.111
 teaches-0.098
 happens-0.096
igator-0.091
 likes-0.091
Token Harriet
Token ID48805
Feature activation-4.70e-02

Pos logit contributions
arse+0.063
 destination+0.060
ge+0.060
ized+0.059
har+0.057
Neg logit contributions
 Amen-0.064
 teaches-0.062
 happens-0.062
 lived-0.061
igator-0.060
Token loved
Token ID6151
Feature activation-3.47e-02

Pos logit contributions
arse+0.117
har+0.113
ge+0.111
 destination+0.111
ipping+0.110
Neg logit contributions
 lived-0.068
 teaches-0.058
 happens-0.055
 Amen-0.054
igator-0.051
Token each
Token ID1123
Feature activation+1.10e-02

Pos logit contributions
arse+0.076
 destination+0.072
ge+0.071
har+0.070
ized+0.069
Neg logit contributions
 Amen-0.051
 lived-0.051
 teaches-0.050
 happens-0.050
igator-0.048
Token other
Token ID584
Feature activation-7.49e-03

Pos logit contributions
 teaches+0.016
igator+0.016
 lived+0.015
 Amen+0.015
igators+0.015
Neg logit contributions
arse-0.025
 destination-0.024
har-0.024
ge-0.024
ized-0.023
Token.
Token ID13
Feature activation+1.83e-03

Pos logit contributions
 Amen+0.005
 teaches+0.005
 happens+0.005
 lived+0.005
igator+0.005
Neg logit contributions
arse-0.005
ge-0.005
 destination-0.005
ized-0.005
har-0.004
Token But
Token ID887
Feature activation-1.96e-02

Pos logit contributions
ile+0.003
 happens+0.003
igator+0.003
eer+0.003
 gor+0.003
Neg logit contributions
arse-0.004
 destination-0.004
har-0.004
ishment-0.004
oured-0.004
Token be
Token ID307
Feature activation-9.73e-03

Pos logit contributions
arse+0.042
 destination+0.041
ge+0.041
ized+0.040
har+0.039
Neg logit contributions
 Amen-0.029
 teaches-0.028
 happens-0.028
 lived-0.028
igator-0.027
Token careful
Token ID8161
Feature activation+1.16e-02

Pos logit contributions
arse+0.018
har+0.017
ized+0.017
ge+0.017
 destination+0.017
Neg logit contributions
 Amen-0.017
 happens-0.017
 teaches-0.017
 lived-0.017
igators-0.016
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
 Amen+0.021
 happens+0.020
 teaches+0.020
 lived+0.019
igator+0.019
Neg logit contributions
arse-0.022
ge-0.021
ized-0.020
 destination-0.020
har-0.020
Token crystals
Token ID24770
Feature activation+9.39e-02

Pos logit contributions
 Amen+0.030
 teaches+0.029
 lived+0.028
 happens+0.028
igator+0.027
Neg logit contributions
arse-0.037
 destination-0.036
ge-0.035
ized-0.035
har-0.034
Token can
Token ID460
Feature activation+1.13e-03

Pos logit contributions
 Amen+0.124
 teaches+0.113
 happens+0.108
 lived+0.107
igator+0.104
Neg logit contributions
arse-0.223
ge-0.219
 destination-0.215
ized-0.212
har-0.206
Token be
Token ID307
Feature activation-8.09e-03

Pos logit contributions
 lived+0.002
 teaches+0.002
 happens+0.002
 Amen+0.002
igator+0.002
Neg logit contributions
arse-0.002
ge-0.002
 destination-0.002
har-0.002
ized-0.002
Token delicate
Token ID19217
Feature activation+5.14e-02

Pos logit contributions
arse+0.017
 destination+0.016
ge+0.016
ized+0.015
har+0.015
Neg logit contributions
 Amen-0.013
 lived-0.013
 teaches-0.013
 happens-0.013
igator-0.012
Token."
Token ID526
Feature activation+1.26e-02

Pos logit contributions
 Amen+0.089
 lived+0.085
 teaches+0.084
 happens+0.083
igator+0.080
Neg logit contributions
ge-0.096
arse-0.095
 destination-0.095
ized-0.093
har-0.088
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 Amen+0.016
 teaches+0.016
 lived+0.015
 happens+0.015
igator+0.015
Neg logit contributions
arse-0.030
 destination-0.029
ge-0.029
ized-0.028
har-0.028
Token Sammy
Token ID44067
Feature activation-3.52e-02

Pos logit contributions
arse+0.010
 destination+0.010
har+0.009
ge+0.009
ized+0.009
Neg logit contributions
 lived-0.010
 Amen-0.010
 teaches-0.010
 happens-0.010
igator-0.010
Token would
Token ID561
Feature activation+7.00e-03

Pos logit contributions
arse+0.066
 destination+0.064
ge+0.064
ized+0.063
 forth+0.060
Neg logit contributions
 Amen-0.062
 happens-0.057
 teaches-0.057
igator-0.055
icks-0.055
Token often
Token ID1690
Feature activation+3.74e-02

Pos logit contributions
 happens+0.011
 Amen+0.011
 teaches+0.010
igator+0.010
eer+0.010
Neg logit contributions
arse-0.014
 destination-0.014
ge-0.014
ized-0.013
har-0.013
Token share
Token ID2648
Feature activation+1.62e-03

Pos logit contributions
 Amen+0.064
 teaches+0.063
 lived+0.063
 happens+0.062
igator+0.061
Neg logit contributions
arse-0.072
ge-0.068
 destination-0.068
ized-0.067
har-0.066
Token av
Token ID1196
Feature activation+7.88e-02

Pos logit contributions
 lived+0.002
 teaches+0.002
 happens+0.002
 lions+0.002
 Amen+0.002
Neg logit contributions
arse-0.004
ge-0.004
 destination-0.004
har-0.004
oured-0.003
Tokenoc
Token ID420
Feature activation+1.02e-01

Pos logit contributions
 happens+0.092
 Amen+0.092
 lived+0.088
 teaches+0.088
igator+0.084
Neg logit contributions
ge-0.192
arse-0.192
ized-0.189
 destination-0.186
har-0.184
Tokenados
Token ID22484
Feature activation+5.89e-02

Pos logit contributions
 Amen+0.201
 lived+0.196
 teaches+0.194
 happens+0.192
igator+0.189
Neg logit contributions
arse-0.172
ized-0.165
ge-0.164
 destination-0.162
har-0.158
Token and
Token ID290
Feature activation+9.67e-03

Pos logit contributions
 Amen+0.109
 teaches+0.106
 happens+0.106
 lived+0.105
igator+0.102
Neg logit contributions
arse-0.105
ge-0.101
 destination-0.100
ized-0.099
har-0.097
Token other
Token ID584
Feature activation+2.36e-02

Pos logit contributions
 Amen+0.017
 happens+0.016
 teaches+0.016
 lived+0.016
igator+0.016
Neg logit contributions
arse-0.018
 destination-0.018
ge-0.018
ized-0.017
har-0.017
Token y
Token ID331
Feature activation+1.79e-02

Pos logit contributions
 lived+0.034
 teaches+0.034
 happens+0.034
igators+0.033
 Amen+0.033
Neg logit contributions
arse-0.052
 destination-0.050
ge-0.049
har-0.049
ized-0.046
Tokenummy
Token ID13513
Feature activation+6.84e-02

Pos logit contributions
 happens+0.026
 Amen+0.025
icks+0.025
 lived+0.025
 teaches+0.025
Neg logit contributions
arse-0.038
 destination-0.036
ge-0.036
har-0.036
oured-0.034
TokenThey
Token ID2990
Feature activation+9.20e-04

Pos logit contributions
arse+0.038
ge+0.036
 destination+0.035
ized+0.035
har+0.035
Neg logit contributions
 Amen-0.034
 teaches-0.034
 happens-0.033
 lived-0.033
igator-0.032
Token were
Token ID547
Feature activation+3.26e-03

Pos logit contributions
 lived+0.001
 teaches+0.001
 happens+0.001
igator+0.001
 Amen+0.001
Neg logit contributions
arse-0.002
ge-0.002
 destination-0.002
har-0.002
ized-0.002
Token here
Token ID994
Feature activation-9.11e-03

Pos logit contributions
 Amen+0.006
 teaches+0.006
 happens+0.006
 lived+0.006
igator+0.005
Neg logit contributions
arse-0.006
 destination-0.006
ge-0.006
ized-0.006
har-0.006
Token a
Token ID257
Feature activation-1.67e-02

Pos logit contributions
arse+0.019
 destination+0.019
ge+0.019
ized+0.018
har+0.018
Neg logit contributions
 Amen-0.014
 teaches-0.013
 lived-0.013
 happens-0.013
igator-0.013
Token minute
Token ID5664
Feature activation-1.38e-02

Pos logit contributions
arse+0.034
 destination+0.032
ge+0.032
har+0.031
ized+0.031
Neg logit contributions
 Amen-0.027
 teaches-0.026
 happens-0.026
 lived-0.026
igator-0.025
Token ago
Token ID2084
Feature activation+9.97e-02

Pos logit contributions
ge+0.026
arse+0.025
 destination+0.025
ized+0.023
har+0.023
Neg logit contributions
 Amen-0.024
 teaches-0.023
 lived-0.023
 happens-0.023
igator-0.022
Token."
Token ID526
Feature activation+8.22e-03

Pos logit contributions
 Amen+0.168
 teaches+0.157
 happens+0.156
 lived+0.153
igator+0.153
Neg logit contributions
 destination-0.192
arse-0.192
ge-0.188
ized-0.182
har-0.180
Token
Token ID198
Feature activation+1.43e-03

Pos logit contributions
 Amen+0.012
igator+0.012
 teaches+0.012
igators+0.012
 lived+0.012
Neg logit contributions
arse-0.017
 destination-0.017
ge-0.017
ized-0.016
har-0.016
Token
Token ID198
Feature activation-1.89e-03

Pos logit contributions
igator+0.002
 Amen+0.002
igators+0.002
 teaches+0.002
eer+0.002
Neg logit contributions
ge-0.003
arse-0.003
har-0.003
 destination-0.003
ized-0.003
Token"
Token ID1
Feature activation-4.52e-02

Pos logit contributions
arse+0.004
ge+0.004
 destination+0.004
har+0.004
ized+0.004
Neg logit contributions
 Amen-0.003
 teaches-0.003
 happens-0.003
 lived-0.002
igator-0.002
TokenMaybe
Token ID13300
Feature activation+7.29e-03

Pos logit contributions
arse+0.070
ge+0.070
har+0.069
ized+0.069
 destination+0.065
Neg logit contributions
 Amen-0.088
icks-0.087
 happens-0.086
 teaches-0.086
 lived-0.084
Token treasure
Token ID14068
Feature activation+1.49e-02

Pos logit contributions
arse+0.019
ge+0.018
 destination+0.018
ized+0.018
har+0.018
Neg logit contributions
 Amen-0.013
 teaches-0.013
 lived-0.013
 happens-0.013
igator-0.012
Token from
Token ID422
Feature activation+2.01e-02

Pos logit contributions
 Amen+0.024
 teaches+0.023
 happens+0.023
 lived+0.023
igators+0.022
Neg logit contributions
 destination-0.030
arse-0.030
ge-0.029
ized-0.028
har-0.028
Token a
Token ID257
Feature activation-2.74e-03

Pos logit contributions
 Amen+0.029
 lived+0.029
 teaches+0.029
 happens+0.028
igator+0.028
Neg logit contributions
arse-0.045
ge-0.042
 destination-0.041
ized-0.041
har-0.041
Token long
Token ID890
Feature activation+2.12e-02

Pos logit contributions
arse+0.006
ge+0.006
ized+0.006
 destination+0.006
har+0.005
Neg logit contributions
 Amen-0.004
 lived-0.004
 teaches-0.004
 happens-0.004
igator-0.004
Token time
Token ID640
Feature activation-2.71e-02

Pos logit contributions
 Amen+0.030
 lived+0.029
 teaches+0.029
 happens+0.028
igator+0.028
Neg logit contributions
arse-0.048
ge-0.046
 destination-0.045
ized-0.045
har-0.045
Token ago
Token ID2084
Feature activation+9.71e-02

Pos logit contributions
arse+0.065
ge+0.063
 destination+0.063
ized+0.061
har+0.061
Neg logit contributions
 Amen-0.033
 teaches-0.032
 lived-0.032
 happens-0.032
igator-0.031
Token."
Token ID526
Feature activation+1.14e-02

Pos logit contributions
 Amen+0.149
 happens+0.143
 teaches+0.143
 lived+0.140
igator+0.138
Neg logit contributions
arse-0.202
 destination-0.199
ge-0.196
ized-0.191
har-0.188
Token
Token ID198
Feature activation+7.50e-03

Pos logit contributions
 Amen+0.013
 teaches+0.013
 lived+0.013
 happens+0.013
igator+0.012
Neg logit contributions
arse-0.028
ge-0.027
 destination-0.027
ized-0.027
har-0.026
Token
Token ID220
Feature activation-2.51e-02

Pos logit contributions
 Amen+0.011
 happens+0.011
igator+0.011
 teaches+0.010
igators+0.010
Neg logit contributions
arse-0.016
ge-0.016
 destination-0.015
ized-0.015
har-0.015
Token
Token ID198
Feature activation+2.29e-03

Pos logit contributions
arse+0.056
 destination+0.054
ge+0.053
har+0.052
ized+0.051
Neg logit contributions
 Amen-0.036
 happens-0.034
igator-0.034
 teaches-0.034
 lived-0.033
TokenThe
Token ID464
Feature activation+6.56e-03

Pos logit contributions
 Amen+0.004
 teaches+0.004
 lived+0.004
 happens+0.004
igator+0.004
Neg logit contributions
arse-0.004
 destination-0.004
ge-0.004
ized-0.004
har-0.003
Token happy
Token ID3772
Feature activation+2.04e-02

Pos logit contributions
 Amen+0.020
 teaches+0.019
 happens+0.019
 lived+0.019
igator+0.019
Neg logit contributions
arse-0.023
 destination-0.021
ge-0.021
ized-0.021
har-0.021
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 Amen+0.038
 teaches+0.037
igator+0.036
igators+0.036
 happens+0.036
Neg logit contributions
arse-0.034
 destination-0.034
ge-0.034
ized-0.034
har-0.032
Token He
Token ID679
Feature activation+1.29e-02

Pos logit contributions
ile+0.020
 happens+0.020
igator+0.019
eer+0.019
 teaches+0.019
Neg logit contributions
arse-0.028
 destination-0.028
har-0.027
ishment-0.027
ge-0.027
Token hugged
Token ID41130
Feature activation-1.31e-02

Pos logit contributions
 Amen+0.021
 teaches+0.021
 lived+0.020
 happens+0.020
igator+0.020
Neg logit contributions
arse-0.026
 destination-0.024
ge-0.024
ized-0.024
har-0.024
Token t
Token ID256
Feature activation+5.95e-02

Pos logit contributions
arse+0.033
ge+0.031
 destination+0.031
har+0.030
ized+0.030
Neg logit contributions
 Amen-0.016
 lived-0.016
 teaches-0.015
igator-0.015
 happens-0.015
Tokeneddy
Token ID21874
Feature activation+9.79e-02

Pos logit contributions
 Amen+0.075
 lived+0.070
 teaches+0.067
igator+0.067
eer+0.067
Neg logit contributions
arse-0.144
 destination-0.141
ge-0.134
ishment-0.130
har-0.129
Token and
Token ID290
Feature activation+1.73e-02

Pos logit contributions
 Amen+0.169
 teaches+0.165
 lived+0.163
 happens+0.161
igator+0.158
Neg logit contributions
arse-0.190
ge-0.182
 destination-0.181
ized-0.176
har-0.174
Token said
Token ID531
Feature activation+2.05e-02

Pos logit contributions
 Amen+0.030
 teaches+0.029
 happens+0.029
 lived+0.029
igator+0.028
Neg logit contributions
arse-0.033
 destination-0.031
ge-0.031
ized-0.031
har-0.030
Token,
Token ID11
Feature activation+2.43e-02

Pos logit contributions
ile+0.049
igators+0.048
eer+0.047
igator+0.045
 teaches+0.044
Neg logit contributions
ized-0.024
 breath-0.023
ipping-0.021
ooming-0.021
 destination-0.021
Token "
Token ID366
Feature activation+2.69e-03

Pos logit contributions
igators+0.031
ile+0.030
 happens+0.030
igator+0.029
 teaches+0.028
Neg logit contributions
arse-0.058
ge-0.056
har-0.055
 destination-0.055
ized-0.054
TokenThank
Token ID10449
Feature activation+2.72e-02

Pos logit contributions
 Amen+0.004
 teaches+0.004
 happens+0.004
 lived+0.004
igators+0.004
Neg logit contributions
arse-0.005
ge-0.005
 destination-0.005
ized-0.005
har-0.005
Token Her
Token ID2332
Feature activation-2.18e-02

Pos logit contributions
arse+0.004
 destination+0.004
ge+0.004
ized+0.004
har+0.004
Neg logit contributions
 Amen-0.002
 teaches-0.002
 happens-0.002
igator-0.002
igators-0.002
Token mom
Token ID1995
Feature activation-2.58e-02

Pos logit contributions
arse+0.043
ge+0.041
 destination+0.041
ized+0.040
har+0.040
Neg logit contributions
 Amen-0.037
 teaches-0.036
 lived-0.035
 happens-0.035
igator-0.034
Tokenmy
Token ID1820
Feature activation-5.83e-03

Pos logit contributions
arse+0.047
ge+0.045
 destination+0.045
ized+0.044
har+0.043
Neg logit contributions
 Amen-0.047
 teaches-0.046
 happens-0.045
 lived-0.045
igator-0.044
Token said
Token ID531
Feature activation+1.49e-03

Pos logit contributions
arse+0.012
 destination+0.011
ge+0.011
ized+0.011
har+0.011
Neg logit contributions
 Amen-0.009
 teaches-0.009
 lived-0.009
 happens-0.009
igator-0.009
Token she
Token ID673
Feature activation+2.27e-02

Pos logit contributions
igators+0.003
eer+0.003
ile+0.003
igator+0.003
 teaches+0.003
Neg logit contributions
ized-0.002
arse-0.002
 breath-0.002
 sudden-0.002
 injuries-0.002
Token was
Token ID373
Feature activation+3.16e-02

Pos logit contributions
 Amen+0.034
 teaches+0.033
igator+0.032
igators+0.032
 happens+0.032
Neg logit contributions
arse-0.048
 destination-0.046
ge-0.046
ized-0.045
har-0.045
Token lucky
Token ID9670
Feature activation-5.15e-03

Pos logit contributions
 Amen+0.053
 teaches+0.052
 lived+0.052
 happens+0.052
igator+0.050
Neg logit contributions
arse-0.062
 destination-0.059
ge-0.058
ized-0.057
har-0.057
Token it
Token ID340
Feature activation-3.62e-03

Pos logit contributions
arse+0.010
 destination+0.010
ge+0.010
ized+0.010
har+0.010
Neg logit contributions
 Amen-0.008
 teaches-0.008
 lived-0.008
 happens-0.008
igator-0.008
Token wasn
Token ID2492
Feature activation-8.68e-04

Pos logit contributions
arse+0.008
ge+0.008
 destination+0.008
ized+0.007
har+0.007
Neg logit contributions
 Amen-0.005
 teaches-0.005
 happens-0.005
 lived-0.005
igator-0.005
Token't
Token ID470
Feature activation+2.07e-03

Pos logit contributions
arse+0.002
ge+0.002
 destination+0.002
ized+0.002
har+0.002
Neg logit contributions
 Amen-0.001
 teaches-0.001
 lived-0.001
 happens-0.001
igator-0.001
Token worse
Token ID4785
Feature activation+6.75e-04

Pos logit contributions
 Amen+0.003
 teaches+0.003
 happens+0.003
 lived+0.003
igators+0.003
Neg logit contributions
arse-0.004
ge-0.004
 destination-0.004
ized-0.004
har-0.004
Token Sam
Token ID3409
Feature activation-3.65e-02

Pos logit contributions
 Amen+0.007
 teaches+0.007
 happens+0.006
 lived+0.006
igator+0.006
Neg logit contributions
arse-0.010
 destination-0.009
ge-0.009
ized-0.009
har-0.009
Token said
Token ID531
Feature activation-9.94e-03

Pos logit contributions
arse+0.071
 destination+0.068
ge+0.068
ized+0.067
har+0.066
Neg logit contributions
 Amen-0.061
 teaches-0.059
 happens-0.059
 lived-0.059
igator-0.057
Token,
Token ID11
Feature activation-1.98e-03

Pos logit contributions
arse+0.018
ized+0.017
ge+0.017
oured+0.016
ipping+0.016
Neg logit contributions
 Amen-0.018
 teaches-0.017
igators-0.017
eer-0.017
 lived-0.017
Token "
Token ID366
Feature activation-1.02e-02

Pos logit contributions
arse+0.006
ge+0.005
 destination+0.005
ized+0.005
har+0.005
Neg logit contributions
 Amen-0.002
 teaches-0.002
 happens-0.002
 lived-0.001
igator-0.001
TokenYes
Token ID5297
Feature activation+2.71e-02

Pos logit contributions
arse+0.018
 destination+0.017
ge+0.017
ized+0.016
har+0.016
Neg logit contributions
 Amen-0.019
 teaches-0.019
 lived-0.019
 happens-0.019
igator-0.018
Token,
Token ID11
Feature activation+3.90e-02

Pos logit contributions
 Amen+0.037
 teaches+0.036
 lived+0.035
 happens+0.035
igator+0.034
Neg logit contributions
arse-0.062
ge-0.060
 destination-0.060
ized-0.058
har-0.058
Token let
Token ID1309
Feature activation+4.60e-03

Pos logit contributions
 Amen+0.066
 lived+0.061
icks+0.059
 teaches+0.059
 happens+0.058
Neg logit contributions
arse-0.071
 destination-0.070
ized-0.069
har-0.069
ge-0.068
Token's
Token ID338
Feature activation+4.19e-02

Pos logit contributions
 Amen+0.007
 teaches+0.007
 lived+0.007
 happens+0.007
igator+0.007
Neg logit contributions
arse-0.010
 destination-0.009
ge-0.009
ized-0.009
har-0.009
Token try
Token ID1949
Feature activation-1.42e-02

Pos logit contributions
 Amen+0.075
 lived+0.074
 teaches+0.073
 happens+0.071
igator+0.071
Neg logit contributions
arse-0.079
ge-0.075
 destination-0.074
ized-0.074
har-0.072
Token together
Token ID1978
Feature activation+2.16e-02

Pos logit contributions
arse+0.033
 destination+0.031
har+0.031
ge+0.031
ized+0.031
Neg logit contributions
 happens-0.018
 teaches-0.018
 Amen-0.018
icks-0.017
 lived-0.017
Token."
Token ID526
Feature activation+1.49e-02

Pos logit contributions
 Amen+0.032
 teaches+0.030
 lived+0.030
 happens+0.030
igator+0.029
Neg logit contributions
arse-0.048
ge-0.046
 destination-0.046
ized-0.045
har-0.044
Token down
Token ID866
Feature activation-2.01e-02

Pos logit contributions
 lived+0.014
 Amen+0.014
 likes+0.013
 lions+0.013
igator+0.013
Neg logit contributions
arse-0.019
ge-0.018
 destination-0.017
oured-0.016
har-0.016
Token and
Token ID290
Feature activation-2.28e-03

Pos logit contributions
arse+0.041
ge+0.039
 destination+0.039
ized+0.038
har+0.037
Neg logit contributions
 Amen-0.032
 teaches-0.032
 lived-0.032
 happens-0.031
igator-0.030
Token enjoyed
Token ID8359
Feature activation-2.10e-04

Pos logit contributions
arse+0.006
ge+0.005
 destination+0.005
ized+0.005
har+0.005
Neg logit contributions
 lived-0.003
 Amen-0.003
 teaches-0.003
 happens-0.003
igator-0.003
Token the
Token ID262
Feature activation+3.29e-03

Pos logit contributions
arse+0.000
ge+0.000
 destination+0.000
ized+0.000
har+0.000
Neg logit contributions
 Amen-0.000
 teaches-0.000
 happens-0.000
 lived-0.000
igator-0.000
Token y
Token ID331
Feature activation-7.85e-03

Pos logit contributions
 Amen+0.005
 happens+0.005
 teaches+0.005
 lived+0.005
igators+0.005
Neg logit contributions
arse-0.007
 destination-0.006
ized-0.006
ge-0.006
har-0.006
Tokenummy
Token ID13513
Feature activation+4.38e-02

Pos logit contributions
arse+0.015
ge+0.015
 destination+0.015
ized+0.014
har+0.014
Neg logit contributions
 Amen-0.013
 teaches-0.013
 lived-0.013
 happens-0.013
igator-0.012
Token apple
Token ID17180
Feature activation+1.03e-02

Pos logit contributions
 Amen+0.071
 teaches+0.068
 happens+0.065
icks+0.064
 lived+0.063
Neg logit contributions
ized-0.086
arse-0.086
ge-0.085
 destination-0.083
 forth-0.082
Token together
Token ID1978
Feature activation+1.42e-02

Pos logit contributions
 lived+0.015
 teaches+0.015
 Amen+0.015
igator+0.014
 happens+0.014
Neg logit contributions
arse-0.023
 destination-0.022
ge-0.021
har-0.021
ized-0.021
Token.
Token ID13
Feature activation+4.36e-03

Pos logit contributions
 lived+0.020
 teaches+0.020
 Amen+0.019
 happens+0.019
 lions+0.018
Neg logit contributions
arse-0.032
har-0.030
 destination-0.030
ge-0.030
ized-0.029
Token The
Token ID383
Feature activation+6.50e-03

Pos logit contributions
 Amen+0.007
 happens+0.007
igator+0.007
 teaches+0.007
ile+0.007
Neg logit contributions
arse-0.008
 destination-0.008
ized-0.008
ge-0.008
har-0.008
Token end
Token ID886
Feature activation-2.51e-02

Pos logit contributions
 Amen+0.011
 happens+0.011
 teaches+0.010
 lived+0.010
igators+0.010
Neg logit contributions
arse-0.013
 destination-0.012
ized-0.012
ge-0.012
har-0.012
TokenIn
Token ID818
Feature activation-2.09e-02

Pos logit contributions
 Amen+0.022
 happens+0.022
 teaches+0.022
 lived+0.022
igator+0.021
Neg logit contributions
arse-0.021
ge-0.020
 destination-0.020
ized-0.020
har-0.020
Token the
Token ID262
Feature activation+2.09e-02

Pos logit contributions
arse+0.045
ge+0.043
har+0.042
ized+0.041
 destination+0.040
Neg logit contributions
 lived-0.033
 Amen-0.032
 teaches-0.032
 happens-0.031
igator-0.030
Token end
Token ID886
Feature activation-1.68e-03

Pos logit contributions
 Amen+0.026
 teaches+0.026
 lived+0.025
 lions+0.024
igator+0.024
Neg logit contributions
ge-0.051
arse-0.050
ized-0.048
har-0.047
oured-0.047
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
 destination+0.003
arse+0.003
ge+0.003
ized+0.003
har+0.003
Neg logit contributions
 Amen-0.003
 teaches-0.003
eer-0.003
 happens-0.003
 vitamins-0.003
Token the
Token ID262
Feature activation+2.92e-02

Pos logit contributions
 lived+0.022
igator+0.020
 teaches+0.020
igators+0.020
 lions+0.020
Neg logit contributions
arse-0.034
ge-0.033
ized-0.031
 destination-0.031
har-0.031
Token avocado
Token ID40377
Feature activation+4.28e-02

Pos logit contributions
 Amen+0.044
 teaches+0.043
 lived+0.043
igator+0.043
 happens+0.042
Neg logit contributions
arse-0.063
ge-0.061
 destination-0.058
ized-0.058
har-0.057
Token realized
Token ID6939
Feature activation+1.44e-02

Pos logit contributions
 Amen+0.069
 teaches+0.065
 happens+0.065
 lived+0.064
igator+0.063
Neg logit contributions
arse-0.086
 destination-0.084
ge-0.083
ized-0.082
har-0.081
Token that
Token ID326
Feature activation+6.90e-03

Pos logit contributions
 lived+0.017
 teaches+0.016
 Amen+0.016
 happens+0.016
igator+0.015
Neg logit contributions
arse-0.037
ge-0.034
 destination-0.034
har-0.034
ized-0.033
Token there
Token ID612
Feature activation-2.31e-02

Pos logit contributions
 Amen+0.012
 teaches+0.011
 happens+0.011
icks+0.011
 lived+0.011
Neg logit contributions
arse-0.013
 destination-0.013
ge-0.013
ized-0.013
har-0.012
Token was
Token ID373
Feature activation+1.54e-02

Pos logit contributions
 destination+0.047
arse+0.047
ge+0.046
ized+0.046
har+0.043
Neg logit contributions
 Amen-0.037
 teaches-0.033
 happens-0.032
icks-0.032
eer-0.032
Token no
Token ID645
Feature activation+1.65e-02

Pos logit contributions
 Amen+0.026
 teaches+0.024
 happens+0.024
icks+0.024
 lived+0.024
Neg logit contributions
 destination-0.029
arse-0.029
ge-0.028
ized-0.028
har-0.028
Token uses
Token ID3544
Feature activation+1.60e-02

Pos logit contributions
 Amen+0.017
igator+0.016
ile+0.015
eer+0.015
 lived+0.015
Neg logit contributions
 destination-0.025
oured-0.024
arse-0.023
ishment-0.023
ized-0.023
Token a
Token ID257
Feature activation+1.26e-03

Pos logit contributions
 Amen+0.028
 lived+0.027
 teaches+0.027
 happens+0.027
igator+0.026
Neg logit contributions
arse-0.030
 destination-0.029
ge-0.029
ized-0.028
har-0.027
Token teaspoon
Token ID22326
Feature activation+3.77e-02

Pos logit contributions
 Amen+0.002
 teaches+0.002
 lived+0.002
 happens+0.002
igator+0.002
Neg logit contributions
arse-0.002
 destination-0.002
ge-0.002
ized-0.002
har-0.002
Token to
Token ID284
Feature activation+7.49e-05

Pos logit contributions
 Amen+0.054
 teaches+0.053
 lived+0.052
 happens+0.052
igator+0.050
Neg logit contributions
arse-0.084
 destination-0.080
ge-0.080
ized-0.078
har-0.078
Token make
Token ID787
Feature activation-2.71e-03

Pos logit contributions
 teaches+0.000
 lived+0.000
 Amen+0.000
igator+0.000
 happens+0.000
Neg logit contributions
arse-0.000
 destination-0.000
ge-0.000
ized-0.000
har-0.000
Token swirl
Token ID42835
Feature activation+3.20e-02

Pos logit contributions
arse+0.006
 destination+0.005
ge+0.005
ized+0.005
har+0.005
Neg logit contributions
 Amen-0.004
 teaches-0.004
 lived-0.004
 happens-0.004
igator-0.004
Tokens
Token ID82
Feature activation+5.18e-03

Pos logit contributions
 Amen+0.063
 teaches+0.060
 lived+0.060
 happens+0.059
igator+0.058
Neg logit contributions
arse-0.055
ge-0.053
 destination-0.052
ized-0.051
har-0.050
Token and
Token ID290
Feature activation+9.92e-04

Pos logit contributions
 Amen+0.009
 teaches+0.009
 lived+0.008
 happens+0.008
igator+0.008
Neg logit contributions
arse-0.010
 destination-0.010
ge-0.010
ized-0.009
har-0.009
Token dots
Token ID22969
Feature activation+6.01e-02

Pos logit contributions
 Amen+0.002
igator+0.002
 lived+0.002
ile+0.002
 teaches+0.002
Neg logit contributions
ge-0.002
arse-0.002
 destination-0.002
ized-0.002
 forth-0.002
Token on
Token ID319
Feature activation-2.44e-02

Pos logit contributions
 Amen+0.117
 teaches+0.108
 lived+0.108
 happens+0.106
igator+0.105
Neg logit contributions
arse-0.104
ge-0.103
 destination-0.101
ized-0.099
har-0.093
Token the
Token ID262
Feature activation-2.01e-02

Pos logit contributions
arse+0.049
ge+0.046
 destination+0.046
har+0.045
ized+0.045
Neg logit contributions
 teaches-0.040
 Amen-0.040
 lived-0.039
 happens-0.039
igator-0.038
Token things
Token ID1243
Feature activation+1.55e-02

Pos logit contributions
 Amen+0.032
 teaches+0.030
 lived+0.029
 happens+0.029
igator+0.029
Neg logit contributions
arse-0.053
 destination-0.051
ge-0.051
ized-0.050
har-0.049
Token can
Token ID460
Feature activation-1.20e-02

Pos logit contributions
 Amen+0.018
 teaches+0.016
 lived+0.015
 happens+0.015
igator+0.015
Neg logit contributions
arse-0.039
 destination-0.038
ge-0.038
ized-0.037
har-0.036
Token break
Token ID2270
Feature activation-1.56e-02

Pos logit contributions
arse+0.025
 destination+0.023
ge+0.023
har+0.023
ized+0.023
Neg logit contributions
 teaches-0.019
 lived-0.019
 Amen-0.018
 happens-0.018
igator-0.018
Token easily
Token ID3538
Feature activation+2.34e-02

Pos logit contributions
arse+0.033
 destination+0.031
har+0.031
ge+0.031
oured+0.031
Neg logit contributions
 happens-0.023
 teaches-0.023
 lived-0.023
igators-0.022
igator-0.022
Token.
Token ID13
Feature activation+3.91e-03

Pos logit contributions
 teaches+0.031
 happens+0.030
 lived+0.029
icks+0.027
igator+0.027
Neg logit contributions
arse-0.054
har-0.052
oured-0.051
ized-0.050
ishment-0.050
Token<|endoftext|>
Token ID50256
Feature activation-3.03e-02

Pos logit contributions
 happens+0.006
igator+0.005
 lived+0.005
eer+0.005
igators+0.005
Neg logit contributions
arse-0.008
har-0.008
 destination-0.008
ge-0.008
-0.008
TokenTim
Token ID14967
Feature activation+1.81e-02

Pos logit contributions
arse+0.066
 destination+0.064
ge+0.063
ized+0.061
har+0.061
Neg logit contributions
 Amen-0.044
 teaches-0.042
 happens-0.041
igator-0.041
 lived-0.040
Tokenmy
Token ID1820
Feature activation+1.88e-02

Pos logit contributions
 Amen+0.033
 teaches+0.031
 happens+0.031
igators+0.031
igator+0.030
Neg logit contributions
arse-0.032
 destination-0.032
ge-0.031
ized-0.030
har-0.029
Token was
Token ID373
Feature activation+2.52e-02

Pos logit contributions
 Amen+0.029
igators+0.026
igator+0.026
 teaches+0.026
icks+0.025
Neg logit contributions
 destination-0.038
arse-0.037
ized-0.037
ge-0.036
oured-0.035
Token playing
Token ID2712
Feature activation-6.97e-03

Pos logit contributions
 happens+0.042
 Amen+0.042
 teaches+0.042
 lived+0.041
igators+0.040
Neg logit contributions
arse-0.049
 destination-0.047
ge-0.046
ized-0.045
har-0.045
Token with
Token ID351
Feature activation+4.57e-04

Pos logit contributions
arse+0.015
ge+0.014
 destination+0.014
har+0.014
ized+0.013
Neg logit contributions
 lived-0.011
 Amen-0.011
 teaches-0.010
igator-0.010
 happens-0.010
Token you
Token ID345
Feature activation+2.09e-02

Pos logit contributions
arse+0.025
ge+0.024
 destination+0.024
ized+0.024
har+0.023
Neg logit contributions
 Amen-0.019
 teaches-0.019
 lived-0.018
 happens-0.018
igator-0.017
Token."
Token ID526
Feature activation+1.34e-02

Pos logit contributions
 Amen+0.027
 teaches+0.026
 happens+0.026
 lived+0.026
igators+0.024
Neg logit contributions
arse-0.050
ge-0.048
 destination-0.047
ized-0.046
har-0.046
Token The
Token ID383
Feature activation+1.19e-02

Pos logit contributions
 Amen+0.018
 teaches+0.017
 lived+0.017
 happens+0.017
igator+0.016
Neg logit contributions
arse-0.032
ge-0.030
 destination-0.030
ized-0.029
har-0.029
Token bird
Token ID6512
Feature activation+2.65e-02

Pos logit contributions
icks+0.019
 Amen+0.019
 happens+0.019
 teaches+0.019
 vitamins+0.018
Neg logit contributions
arse-0.022
ized-0.021
 destination-0.021
 sudden-0.021
 source-0.021
Token felt
Token ID2936
Feature activation+9.80e-03

Pos logit contributions
 Amen+0.050
 teaches+0.047
 vitamins+0.047
 happens+0.047
igators+0.047
Neg logit contributions
arse-0.046
ized-0.044
ge-0.044
har-0.043
 destination-0.043
Token better
Token ID1365
Feature activation+5.20e-03

Pos logit contributions
 lions+0.018
 Amen+0.018
igators+0.018
eer+0.018
ile+0.018
Neg logit contributions
ized-0.016
arse-0.016
 completely-0.015
 breath-0.015
 destination-0.014
Token and
Token ID290
Feature activation+1.39e-02

Pos logit contributions
 Amen+0.006
 lived+0.006
 about+0.005
 teaches+0.005
 happens+0.005
Neg logit contributions
arse-0.013
 destination-0.013
oured-0.012
har-0.012
ized-0.012
Token flew
Token ID13112
Feature activation+3.67e-02

Pos logit contributions
 Amen+0.026
 teaches+0.025
igators+0.024
icks+0.024
igator+0.024
Neg logit contributions
ized-0.023
arse-0.023
 destination-0.023
 forth-0.022
har-0.022
Token away
Token ID1497
Feature activation-1.59e-02

Pos logit contributions
 Amen+0.082
igator+0.080
 lived+0.080
 happens+0.080
 teaches+0.080
Neg logit contributions
arse-0.053
ge-0.049
 destination-0.047
ized-0.046
ishment-0.045
Token.
Token ID13
Feature activation+8.35e-03

Pos logit contributions
arse+0.038
 destination+0.036
ishment+0.035
ized+0.035
oured+0.035
Neg logit contributions
 lived-0.020
 Amen-0.019
 happens-0.019
 teaches-0.019
igator-0.018
Token
Token ID220
Feature activation-3.02e-02

Pos logit contributions
ile+0.017
igator+0.017
 Amen+0.017
 happens+0.017
igators+0.017
Neg logit contributions
arse-0.012
 destination-0.012
ized-0.011
oured-0.011
har-0.011
Token One
Token ID1881
Feature activation+2.07e-02

Pos logit contributions
 Amen+0.022
 happens+0.022
 teaches+0.021
ile+0.021
icks+0.021
Neg logit contributions
 destination-0.034
arse-0.034
ge-0.033
har-0.032
rolling-0.031
Token day
Token ID1110
Feature activation-2.23e-05

Pos logit contributions
 Amen+0.020
 teaches+0.019
 happens+0.019
icks+0.018
 lived+0.018
Neg logit contributions
arse-0.055
 destination-0.053
ge-0.053
ized-0.052
har-0.052
Token,
Token ID11
Feature activation+2.01e-03

Pos logit contributions
arse+0.000
ge+0.000
 destination+0.000
ized+0.000
har+0.000
Neg logit contributions
 Amen-0.000
 teaches-0.000
 lived-0.000
 happens-0.000
igator-0.000
Token he
Token ID339
Feature activation+1.28e-02

Pos logit contributions
 lived+0.002
igator+0.002
 lions+0.002
 happens+0.002
igators+0.002
Neg logit contributions
arse-0.006
 destination-0.006
ishment-0.005
har-0.005
ge-0.005
Token decided
Token ID3066
Feature activation+1.84e-02

Pos logit contributions
 lived+0.019
 Amen+0.018
 teaches+0.018
 happens+0.018
igator+0.017
Neg logit contributions
arse-0.028
ge-0.027
 destination-0.027
ized-0.026
har-0.026
Token to
Token ID284
Feature activation+5.54e-03

Pos logit contributions
 Amen+0.026
 teaches+0.025
 lived+0.025
 happens+0.025
igator+0.024
Neg logit contributions
arse-0.041
ge-0.039
 destination-0.039
ized-0.038
har-0.038
Token explore
Token ID7301
Feature activation-2.96e-02

Pos logit contributions
 Amen+0.009
 lived+0.009
 teaches+0.009
 happens+0.009
igator+0.008
Neg logit contributions
arse-0.011
ge-0.011
 destination-0.011
ized-0.011
har-0.010
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
arse+0.071
har+0.067
 destination+0.066
ge+0.065
oured+0.064
Neg logit contributions
 lived-0.041
 happens-0.038
 teaches-0.037
 Amen-0.037
igator-0.036
Token woods
Token ID16479
Feature activation+1.05e-02

Pos logit contributions
arse+0.026
ge+0.026
oured+0.024
har+0.024
ized+0.024
Neg logit contributions
 lived-0.019
 teaches-0.018
 Amen-0.018
 lions-0.018
igator-0.018
Token.
Token ID13
Feature activation+1.52e-03

Pos logit contributions
 lived+0.016
 teaches+0.016
 Amen+0.016
igator+0.015
 happens+0.015
Neg logit contributions
arse-0.023
 destination-0.021
ge-0.021
har-0.021
ized-0.020
Token He
Token ID679
Feature activation+7.35e-03

Pos logit contributions
 lived+0.001
 Amen+0.001
 teaches+0.001
 happens+0.001
igator+0.001
Neg logit contributions
arse-0.005
ized-0.004
ge-0.004
 destination-0.004
har-0.004
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
arse+0.072
ge+0.069
har+0.065
ishment+0.064
oured+0.063
Neg logit contributions
 happens-0.035
 teaches-0.034
 lived-0.034
 about-0.031
igator-0.030
Token brushes
Token ID36377
Feature activation-8.45e-04

Pos logit contributions
arse+0.023
ge+0.021
 destination+0.021
har+0.020
ipping+0.020
Neg logit contributions
 lived-0.017
 happens-0.016
 teaches-0.016
igator-0.014
 Amen-0.014
Token and
Token ID290
Feature activation-3.55e-03

Pos logit contributions
arse+0.002
har+0.002
ishment+0.002
ge+0.002
 destination+0.002
Neg logit contributions
 lived-0.001
 happens-0.001
 teaches-0.001
 about-0.001
icks-0.001
Token made
Token ID925
Feature activation+1.72e-02

Pos logit contributions
arse+0.008
ge+0.007
 destination+0.007
har+0.007
ipping+0.007
Neg logit contributions
 happens-0.006
 lived-0.006
 teaches-0.006
 Amen-0.005
igator-0.005
Token a
Token ID257
Feature activation-1.04e-02

Pos logit contributions
 happens+0.022
 teaches+0.022
 lived+0.022
 Amen+0.021
 lions+0.020
Neg logit contributions
arse-0.041
ge-0.040
 destination-0.039
har-0.038
ized-0.037
Token beautiful
Token ID4950
Feature activation+1.82e-02

Pos logit contributions
ge+0.024
arse+0.024
ized+0.022
oured+0.022
ipping+0.022
Neg logit contributions
 Amen-0.014
 teaches-0.014
 happens-0.014
 lived-0.014
 lions-0.013
Token picture
Token ID4286
Feature activation+1.29e-02

Pos logit contributions
 lived+0.029
 Amen+0.029
 teaches+0.028
 happens+0.028
 lions+0.027
Neg logit contributions
arse-0.038
ge-0.037
ized-0.034
 destination-0.034
har-0.033
Token of
Token ID286
Feature activation-2.04e-02

Pos logit contributions
 Amen+0.020
 lived+0.019
 teaches+0.019
 happens+0.019
igator+0.018
Neg logit contributions
arse-0.027
 destination-0.026
ge-0.026
ized-0.025
har-0.025
Token a
Token ID257
Feature activation-2.37e-02

Pos logit contributions
arse+0.047
ge+0.045
 destination+0.044
har+0.043
ized+0.043
Neg logit contributions
 lived-0.028
 Amen-0.028
 teaches-0.028
 happens-0.027
 lions-0.027
Token rainbow
Token ID27223
Feature activation+1.39e-02

Pos logit contributions
arse+0.051
ge+0.050
ized+0.048
 destination+0.047
ishment+0.047
Neg logit contributions
 Amen-0.036
 lived-0.035
 teaches-0.035
 happens-0.034
 lions-0.034
Token.
Token ID13
Feature activation+1.56e-03

Pos logit contributions
 lived+0.021
 teaches+0.021
 happens+0.021
 Amen+0.021
 lions+0.020
Neg logit contributions
arse-0.030
ge-0.028
 destination-0.028
har-0.027
ized-0.027
Token with
Token ID351
Feature activation+1.85e-02

Pos logit contributions
 Amen+0.007
 lived+0.007
 teaches+0.007
 happens+0.007
igator+0.006
Neg logit contributions
arse-0.009
 destination-0.008
ge-0.008
ized-0.008
har-0.008
Token her
Token ID607
Feature activation+8.00e-03

Pos logit contributions
 lived+0.023
 teaches+0.022
 Amen+0.022
igators+0.020
igator+0.020
Neg logit contributions
arse-0.048
ge-0.044
 destination-0.044
har-0.044
ooming-0.042
Token coat
Token ID13209
Feature activation-2.99e-02

Pos logit contributions
 lived+0.011
 teaches+0.011
 loves+0.011
 likes+0.010
 Amen+0.010
Neg logit contributions
arse-0.018
ge-0.017
ized-0.017
har-0.017
ishment-0.017
Token.
Token ID13
Feature activation-9.22e-03

Pos logit contributions
arse+0.070
ishment+0.066
ge+0.065
har+0.064
ized+0.063
Neg logit contributions
 lived-0.041
 teaches-0.038
 Amen-0.036
 about-0.036
igator-0.035
Token They
Token ID1119
Feature activation-1.54e-02

Pos logit contributions
arse+0.021
 destination+0.021
ge+0.020
har+0.020
ized+0.020
Neg logit contributions
 Amen-0.012
 happens-0.012
 teaches-0.011
igator-0.011
 lived-0.011
Token knew
Token ID2993
Feature activation+9.78e-03

Pos logit contributions
arse+0.034
ge+0.032
 destination+0.031
ized+0.031
har+0.031
Neg logit contributions
 lived-0.025
 teaches-0.023
 Amen-0.023
igator-0.022
 happens-0.021
Token she
Token ID673
Feature activation-3.48e-03

Pos logit contributions
 lived+0.012
 teaches+0.012
 Amen+0.012
 happens+0.012
igator+0.011
Neg logit contributions
arse-0.024
ge-0.023
 destination-0.023
har-0.022
ized-0.022
Token would
Token ID561
Feature activation+1.56e-03

Pos logit contributions
arse+0.009
ge+0.008
 destination+0.008
ized+0.008
har+0.008
Neg logit contributions
 lived-0.004
 Amen-0.004
 teaches-0.004
 happens-0.004
igator-0.004
Token be
Token ID307
Feature activation-2.25e-02

Pos logit contributions
 Amen+0.002
 lived+0.002
 teaches+0.002
 happens+0.002
igator+0.002
Neg logit contributions
arse-0.004
ge-0.004
ized-0.004
 destination-0.004
har-0.004
Token kept
Token ID4030
Feature activation-9.43e-04

Pos logit contributions
arse+0.045
ge+0.043
 destination+0.042
ized+0.041
har+0.040
Neg logit contributions
 Amen-0.038
 teaches-0.037
 lived-0.036
 happens-0.036
igator-0.035
Token safe
Token ID3338
Feature activation-1.59e-02

Pos logit contributions
arse+0.002
ge+0.002
 destination+0.002
har+0.002
oured+0.002
Neg logit contributions
 Amen-0.001
 teaches-0.001
 lived-0.001
 happens-0.001
icks-0.001
Token The
Token ID383
Feature activation-9.80e-03

Pos logit contributions
arse+0.018
ge+0.017
 destination+0.017
ized+0.017
har+0.016
Neg logit contributions
 lived-0.004
 teaches-0.004
 happens-0.004
 Amen-0.004
igator-0.003
Token little
Token ID1310
Feature activation-2.37e-02

Pos logit contributions
arse+0.025
ge+0.024
 destination+0.023
ized+0.023
har+0.023
Neg logit contributions
 teaches-0.012
igator-0.012
 Amen-0.011
 happens-0.011
 lived-0.011
Token boy
Token ID2933
Feature activation-8.09e-03

Pos logit contributions
arse+0.062
ge+0.058
har+0.057
 destination+0.057
ized+0.056
Neg logit contributions
igator-0.028
 lived-0.027
 teaches-0.026
 happens-0.026
eer-0.026
Token was
Token ID373
Feature activation+6.51e-03

Pos logit contributions
arse+0.018
 destination+0.017
ge+0.017
ishment+0.016
ized+0.016
Neg logit contributions
 lived-0.013
 happens-0.011
 teaches-0.011
igator-0.011
 Amen-0.011
Token a
Token ID257
Feature activation-2.48e-02

Pos logit contributions
 Amen+0.010
 lived+0.010
 teaches+0.010
 happens+0.010
igator+0.010
Neg logit contributions
arse-0.014
ge-0.013
 destination-0.013
ized-0.013
har-0.012
Token bit
Token ID1643
Feature activation-4.41e-02

Pos logit contributions
arse+0.042
ge+0.039
ized+0.039
 destination+0.039
ishment+0.038
Neg logit contributions
 Amen-0.049
 teaches-0.048
 lived-0.047
igator-0.047
 happens-0.047
Token frightened
Token ID24776
Feature activation-2.12e-02

Pos logit contributions
arse+0.096
 destination+0.093
ge+0.090
har+0.089
ized+0.086
Neg logit contributions
igator-0.065
 happens-0.064
 teaches-0.064
 Amen-0.063
igators-0.063
Token and
Token ID290
Feature activation-7.45e-03

Pos logit contributions
arse+0.046
 destination+0.044
ge+0.043
har+0.043
ized+0.042
Neg logit contributions
 Amen-0.031
 happens-0.030
 teaches-0.030
 lived-0.030
igator-0.030
Token he
Token ID339
Feature activation-1.32e-02

Pos logit contributions
arse+0.016
ge+0.015
 destination+0.015
har+0.015
ized+0.015
Neg logit contributions
 lived-0.012
 happens-0.011
 Amen-0.011
 teaches-0.011
igator-0.011
Token stepped
Token ID10764
Feature activation-3.71e-02

Pos logit contributions
arse+0.028
ge+0.026
 destination+0.025
ipping+0.025
ized+0.025
Neg logit contributions
 lived-0.023
 happens-0.021
 teaches-0.021
 Amen-0.021
igator-0.020
Token back
Token ID736
Feature activation-8.69e-03

Pos logit contributions
ge+0.073
arse+0.073
 destination+0.072
ishment+0.071
ized+0.068
Neg logit contributions
 Amen-0.059
icks-0.059
igator-0.058
ily-0.056
 about-0.054
Token gun
Token ID2485
Feature activation-1.00e-02

Pos logit contributions
ge+0.011
arse+0.011
har+0.010
ized+0.010
 destination+0.010
Neg logit contributions
 Amen-0.009
 lived-0.009
 teaches-0.009
eer-0.009
 happens-0.009
Token and
Token ID290
Feature activation-1.71e-03

Pos logit contributions
arse+0.024
har+0.023
ge+0.023
 destination+0.022
ized+0.022
Neg logit contributions
 teaches-0.012
 lived-0.012
igator-0.012
 happens-0.011
eer-0.011
Token went
Token ID1816
Feature activation+1.21e-02

Pos logit contributions
arse+0.004
 destination+0.004
ge+0.003
har+0.003
ipping+0.003
Neg logit contributions
 lived-0.003
 happens-0.003
 teaches-0.003
 Amen-0.002
 lions-0.002
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
 Amen+0.015
igator+0.015
igators+0.014
 teaches+0.014
icks+0.014
Neg logit contributions
arse-0.029
ized-0.028
ishment-0.028
 destination-0.027
ge-0.027
Token see
Token ID766
Feature activation-8.66e-03

Pos logit contributions
 Amen+0.026
 lived+0.025
 teaches+0.024
igator+0.024
igators+0.023
Neg logit contributions
arse-0.037
ized-0.037
ipping-0.035
oured-0.035
ge-0.034
Token what
Token ID644
Feature activation-4.26e-02

Pos logit contributions
arse+0.017
ge+0.017
 destination+0.016
ized+0.016
har+0.015
Neg logit contributions
 Amen-0.015
 teaches-0.014
 happens-0.014
 lived-0.014
igator-0.013
Token it
Token ID340
Feature activation-3.39e-02

Pos logit contributions
ized+0.061
 destination+0.060
ge+0.059
 completely+0.057
 sudden+0.057
Neg logit contributions
 Amen-0.090
icks-0.083
eer-0.077
 teaches-0.077
py-0.076
Token was
Token ID373
Feature activation-2.10e-02

Pos logit contributions
arse+0.075
ge+0.072
 destination+0.072
ized+0.071
har+0.069
Neg logit contributions
 Amen-0.049
 teaches-0.047
 happens-0.046
 lived-0.045
igator-0.045
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
arse+0.044
 destination+0.042
har+0.042
ge+0.042
ized+0.041
Neg logit contributions
 lived-0.032
 Amen-0.032
 teaches-0.032
 happens-0.031
igator-0.030
Token He
Token ID679
Feature activation-1.54e-03

Pos logit contributions
arse+0.017
 destination+0.017
ge+0.017
har+0.017
ized+0.016
Neg logit contributions
 Amen-0.010
 teaches-0.010
 happens-0.010
ile-0.010
igator-0.010
Token found
Token ID1043
Feature activation-1.75e-02

Pos logit contributions
arse+0.003
ge+0.003
 destination+0.003
ipping+0.003
ized+0.003
Neg logit contributions
 lived-0.003
 teaches-0.002
 happens-0.002
 Amen-0.002
igator-0.002
Token a
Token ID257
Feature activation-1.80e-02

Pos logit contributions
 happens+0.029
 teaches+0.029
 Amen+0.028
icks+0.027
igators+0.027
Neg logit contributions
arse-0.026
ized-0.025
 sudden-0.024
ge-0.024
 destination-0.024
Token cow
Token ID9875
Feature activation-7.16e-03

Pos logit contributions
arse+0.032
 destination+0.031
ge+0.031
ized+0.030
har+0.030
Neg logit contributions
 Amen-0.033
 teaches-0.032
 happens-0.032
 lived-0.032
igator-0.031
Token named
Token ID3706
Feature activation-2.09e-02

Pos logit contributions
arse+0.017
ized+0.016
ge+0.016
 destination+0.016
har+0.016
Neg logit contributions
 happens-0.009
 teaches-0.009
 vitamins-0.008
 Amen-0.008
igators-0.008
Token Daisy
Token ID40355
Feature activation-2.80e-02

Pos logit contributions
arse+0.043
 destination+0.040
ge+0.040
ized+0.039
har+0.039
Neg logit contributions
 teaches-0.033
 Amen-0.033
 happens-0.033
 lived-0.031
igator-0.031
Token.
Token ID13
Feature activation+2.08e-03

Pos logit contributions
ge+0.034
arse+0.029
ized+0.027
 destination+0.026
 completely+0.025
Neg logit contributions
 happens-0.070
 teaches-0.068
 vitamins-0.066
igators-0.065
 Amen-0.065
Token Daisy
Token ID40355
Feature activation-3.96e-02

Pos logit contributions
 happens+0.004
 teaches+0.004
 Amen+0.004
eer+0.004
igator+0.004
Neg logit contributions
arse-0.003
ized-0.003
 destination-0.003
ishment-0.003
ge-0.003
Token was
Token ID373
Feature activation-1.67e-03

Pos logit contributions
ized+0.076
har+0.070
 completely+0.069
 separate+0.069
ge+0.068
Neg logit contributions
 Amen-0.059
icks-0.058
igators-0.056
igator-0.055
 vitamins-0.055
Token friendly
Token ID8030
Feature activation-8.49e-03

Pos logit contributions
arse+0.003
 source+0.003
 breath+0.003
har+0.003
 completely+0.003
Neg logit contributions
 happens-0.003
 teaches-0.003
 Amen-0.003
igators-0.003
igator-0.003
Token and
Token ID290
Feature activation-1.34e-02

Pos logit contributions
 destination+0.013
arse+0.012
ized+0.012
ishment+0.012
 breath+0.012
Neg logit contributions
 Amen-0.017
 teaches-0.017
 happens-0.017
igator-0.017
igators-0.016
Token helpful
Token ID7613
Feature activation-9.41e-03

Pos logit contributions
 completely+0.022
ized+0.022
 forth+0.021
har+0.021
 separate+0.021
Neg logit contributions
ile-0.023
 happens-0.023
icks-0.022
igators-0.021
eer-0.020
Token.
Token ID13
Feature activation-5.15e-03

Pos logit contributions
arse+0.017
 destination+0.017
ge+0.016
ized+0.016
har+0.016
Neg logit contributions
 Amen-0.017
 teaches-0.016
 happens-0.016
igator-0.016
 lived-0.016
Token a
Token ID257
Feature activation-2.29e-02

Pos logit contributions
 Amen+0.003
 lived+0.003
 happens+0.003
 teaches+0.003
igator+0.003
Neg logit contributions
arse-0.005
ge-0.005
 destination-0.004
har-0.004
ized-0.004
Token band
Token ID4097
Feature activation-1.64e-02

Pos logit contributions
arse+0.041
ge+0.039
ized+0.039
ishment+0.038
 destination+0.038
Neg logit contributions
 Amen-0.043
 teaches-0.042
 happens-0.041
 lived-0.041
igator-0.041
Tokenage
Token ID496
Feature activation-9.70e-03

Pos logit contributions
ge+0.031
har+0.030
arse+0.030
ized+0.029
ooming+0.028
Neg logit contributions
 happens-0.028
 Amen-0.027
 teaches-0.027
igators-0.027
 vitamins-0.027
Token on
Token ID319
Feature activation-1.29e-02

Pos logit contributions
arse+0.017
 destination+0.017
ge+0.016
ized+0.016
har+0.016
Neg logit contributions
 Amen-0.018
 teaches-0.017
 lived-0.017
 happens-0.017
igators-0.017
Token her
Token ID607
Feature activation-1.02e-02

Pos logit contributions
arse+0.032
ge+0.030
 destination+0.030
har+0.029
ized+0.029
Neg logit contributions
 lived-0.016
 teaches-0.015
 Amen-0.015
 happens-0.015
icks-0.015
Token ankle
Token ID19847
Feature activation-3.66e-02

Pos logit contributions
arse+0.018
ge+0.017
ized+0.017
 destination+0.017
ishment+0.016
Neg logit contributions
 Amen-0.019
 teaches-0.019
 lived-0.019
 happens-0.018
eer-0.018
Token.
Token ID13
Feature activation-2.41e-03

Pos logit contributions
arse+0.070
ge+0.067
 destination+0.067
ized+0.065
har+0.065
Neg logit contributions
 Amen-0.063
 teaches-0.061
 lived-0.061
 happens-0.060
igator-0.059
Token He
Token ID679
Feature activation+6.14e-03

Pos logit contributions
 destination+0.005
ge+0.005
arse+0.005
har+0.005
ized+0.005
Neg logit contributions
 Amen-0.004
igator-0.004
ile-0.003
igators-0.003
 teaches-0.003
Token says
Token ID1139
Feature activation+5.35e-03

Pos logit contributions
 Amen+0.008
igator+0.007
ile+0.007
igators+0.007
 lived+0.007
Neg logit contributions
 destination-0.014
arse-0.014
ge-0.013
ized-0.013
oured-0.013
Token,
Token ID11
Feature activation+4.45e-03

Pos logit contributions
ile+0.012
igators+0.012
 Amen+0.012
igator+0.012
 teaches+0.012
Neg logit contributions
ipping-0.006
ge-0.006
ooming-0.006
 breath-0.006
 destination-0.006
Token "
Token ID366
Feature activation-9.02e-03

Pos logit contributions
igators+0.006
igator+0.006
ile+0.006
eer+0.006
 happens+0.006
Neg logit contributions
arse-0.010
ge-0.009
har-0.009
 destination-0.009
ized-0.009
Token fit
Token ID4197
Feature activation-2.50e-03

Pos logit contributions
arse+0.013
ge+0.012
 destination+0.012
ized+0.012
har+0.012
Neg logit contributions
 Amen-0.010
 teaches-0.010
 lived-0.010
 happens-0.009
igator-0.009
Token!
Token ID0
Feature activation-1.11e-03

Pos logit contributions
arse+0.005
 destination+0.005
ge+0.005
ized+0.005
har+0.005
Neg logit contributions
 lived-0.004
 teaches-0.004
 Amen-0.004
igator-0.004
 happens-0.004
Token
Token ID198
Feature activation-2.19e-03

Pos logit contributions
arse+0.002
 destination+0.002
har+0.002
ized+0.002
ge+0.002
Neg logit contributions
igator-0.002
ile-0.002
 Amen-0.002
igators-0.002
eer-0.002
Token
Token ID220
Feature activation-3.46e-02

Pos logit contributions
arse+0.005
 destination+0.004
ge+0.004
ized+0.004
har+0.004
Neg logit contributions
igator-0.003
 Amen-0.003
 happens-0.003
igators-0.003
 teaches-0.003
Token
Token ID198
Feature activation-3.98e-03

Pos logit contributions
arse+0.082
 destination+0.078
ge+0.078
ized+0.076
har+0.076
Neg logit contributions
 Amen-0.045
 teaches-0.043
 happens-0.042
 lived-0.042
igator-0.042
Token"
Token ID1
Feature activation-4.81e-02

Pos logit contributions
arse+0.008
 destination+0.007
ge+0.007
ized+0.007
har+0.007
Neg logit contributions
 Amen-0.007
 happens-0.007
 teaches-0.007
 lived-0.007
igator-0.006
TokenWhat
Token ID2061
Feature activation-1.96e-02

Pos logit contributions
arse+0.093
 destination+0.088
ge+0.088
ized+0.086
har+0.085
Neg logit contributions
 Amen-0.083
 teaches-0.081
 lived-0.081
 happens-0.080
igator-0.078
Token will
Token ID481
Feature activation-2.54e-02

Pos logit contributions
arse+0.035
 destination+0.033
aults+0.033
ge+0.032
 source+0.032
Neg logit contributions
 Amen-0.034
igator-0.030
eer-0.029
igators-0.029
py-0.028
Token you
Token ID345
Feature activation-7.81e-03

Pos logit contributions
arse+0.051
 destination+0.051
ge+0.050
ized+0.049
har+0.047
Neg logit contributions
 Amen-0.040
 teaches-0.038
icks-0.036
igators-0.036
 lived-0.036
Token do
Token ID466
Feature activation+1.13e-03

Pos logit contributions
arse+0.016
ge+0.015
 destination+0.015
ized+0.015
har+0.015
Neg logit contributions
 Amen-0.013
 teaches-0.013
 lived-0.013
 happens-0.012
igator-0.012
Token with
Token ID351
Feature activation+8.30e-03

Pos logit contributions
 Amen+0.002
 lived+0.002
igators+0.001
icks+0.001
eer+0.001
Neg logit contributions
 destination-0.002
ge-0.002
ized-0.002
arse-0.002
 breath-0.002