TOP ACTIVATIONS
MAX = 6.169

gradually spreading out of the coc oon . He was no
and patterns and even spun coc oons together . They liked
It was a tiny butterfly coc oon ! He felt so
down and enjoyed her popcorn coc oon . <|endoftext|> Once upon
palm trees with fruits and coc on uts . There were
tree and threw down some coc on uts for the sailor
ipped itself up in the coc oon . She waited patiently
the tree and found the coc oon . He went inside
, he emerged from the coc oon as a beautiful butterfly
coconut tree with lots of coc on uts . But the
, he spun a special coc oon and went to sleep
while . Soon , the tad p oles became frogs and
where he could find a coc oon . The butterfly told
the cater pillar in a coc oon so it could turn
that there was an available coc oon in the nearby tree
needed to go into a coc oon and then he would
to eat leaves and make coc oons . Maybe one day
and created the most beautiful coc oon . Finally
, it will make a coc oon and turn into a
twenty - eight , twenty - nine , thirty , thirty

MIN ACTIVATIONS
MIN = -5.631

. It 's okay . Acc idents happen . Just be
" It 's OK . Acc idents happen . I 'm
are proud of you . Acc idents happen . We can
's okay , Tom . Acc idents happen . I 'm
's okay , Lily . Acc idents happen . I 'm
's okay , Anna . Acc idents happen . I 'm
okay , sweet ie . Acc idents happen . Let 's
It was an accident . Acc idents happen . I 'm
't cry , boys . Acc idents happen . I have
. It is okay . Acc idents happen . Let me
's okay , Anna . Acc idents happen . You don
's okay , Lily . Acc idents happen . Just remember
's okay , Lily . Acc idents happen . Let 's
of the story is : Acc idents happen , but we
Lily were playing with their wagon in the yard . They
's okay , Lily . Acc idents happen . It 's
's okay , Fiona . Acc idents happen ." Then they
no time ." Enc our aged by the eagle
� � Its okay . Acc idents happen . â €
or chard and fill his wagon with juicy red apples .

INTERVAL 3.219 - 6.169
CONTAINS 0.100%

, a victim of the poison . Less on
A big pig , a fluffy sheep , a tiny mouse
One day , the furry body met a wise old
Do you have any hay I can use ?"
time . Suddenly , a loud voice bo omed in the

INTERVAL 0.269 - 3.219
CONTAINS 39.720%

day , she found a grape that was unknown to her
, he offered this idea . Some people thought the idea
her how she was feeling . " I 'm not feeling
The End <|endoftext|> Once upon a time there was a brave
free and alive all because he owned the oxygen . From

INTERVAL -2.681 - 0.269
CONTAINS 59.646%

, she noticed something in the distance . It was a
he walked . <|endoftext|> Once upon a time , there was
. They eat their sandwiches . They are happy .
gone . <|endoftext|> John wanted to play catch . He found
Cl are was so relieved and took off the bath robe

INTERVAL -5.631 - -2.681
CONTAINS 0.534%

he said . " Thank you , Mr .
I love helping you , Mom my ." Mitt ens me
comfort him . " I 'm sorry , you
L ily says , " Mom , you are the best
Lily said . " Or a message from a
Token gradually
Token ID11835
Feature activation+1.16e+00

Pos logit contributions
 coc+0.024
 sip+0.023
ocket+0.023
 pad+0.022
igsaw+0.022
Neg logit contributions
 ourselves-0.018
 neither-0.017
 best-0.016
 most-0.016
ate-0.016
Token spreading
Token ID14342
Feature activation+9.06e-01

Pos logit contributions
igsaw+0.028
ocket+0.028
oline+0.027
 tripod+0.027
roach+0.027
Neg logit contributions
 best-0.021
 neither-0.021
 ourselves-0.021
 most-0.021
 still-0.021
Token out
Token ID503
Feature activation+1.69e+00

Pos logit contributions
 coc+0.025
 sip+0.024
ocket+0.024
igsaw+0.024
 pad+0.024
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
 most-0.014
ate-0.013
Token of
Token ID286
Feature activation+1.79e-01

Pos logit contributions
 coc+0.045
 sip+0.044
ocket+0.043
igsaw+0.043
 pad+0.043
Neg logit contributions
 ourselves-0.030
 neither-0.029
 best-0.028
 most-0.028
ate-0.026
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
ocket+0.006
vel+0.006
 pad+0.006
oline+0.006
book+0.006
Neg logit contributions
 ourselves-0.002
 most-0.002
 best-0.002
 neither-0.002
ate-0.002
Token coc
Token ID8954
Feature activation+6.17e+00

Pos logit contributions
 most+0.000
 best+0.000
 ourselves+0.000
 fastest+0.000
 next+0.000
Neg logit contributions
run-0.000
book-0.000
 tripod-0.000
books-0.000
ocket-0.000
Tokenoon
Token ID2049
Feature activation+1.83e+00

Pos logit contributions
 sip+0.087
 coc+0.086
 bang+0.075
ocket+0.073
roach+0.073
Neg logit contributions
 ourselves-0.183
ate-0.180
iring-0.169
 best-0.169
 most-0.168
Token.
Token ID13
Feature activation+8.02e-01

Pos logit contributions
 coc+0.048
 sip+0.046
ocket+0.045
igsaw+0.045
 pad+0.044
Neg logit contributions
 ourselves-0.034
 neither-0.032
 best-0.031
 most-0.031
ate-0.030
Token He
Token ID679
Feature activation+1.39e-01

Pos logit contributions
 coc+0.020
 sip+0.020
ocket+0.019
igsaw+0.019
 pad+0.019
Neg logit contributions
 ourselves-0.016
 neither-0.015
 most-0.014
 best-0.014
ate-0.014
Token was
Token ID373
Feature activation+1.31e-01

Pos logit contributions
igsaw+0.004
andel+0.004
 sip+0.004
roach+0.004
book+0.004
Neg logit contributions
 most-0.002
 still-0.002
 ourselves-0.002
ate-0.002
 neither-0.002
Token no
Token ID645
Feature activation-1.38e+00

Pos logit contributions
 coc+0.003
 sip+0.003
ocket+0.003
 pad+0.003
igsaw+0.003
Neg logit contributions
 ourselves-0.003
 neither-0.002
 best-0.002
ate-0.002
 most-0.002
Token and
Token ID290
Feature activation-4.77e-01

Pos logit contributions
 coc+0.017
 sip+0.017
ocket+0.016
igsaw+0.016
 pad+0.016
Neg logit contributions
 ourselves-0.012
 neither-0.012
 best-0.011
 most-0.011
ate-0.011
Token patterns
Token ID7572
Feature activation+2.10e-01

Pos logit contributions
 ourselves+0.010
 neither+0.010
 most+0.010
 best+0.009
 when+0.009
Neg logit contributions
ocket-0.012
igsaw-0.011
andel-0.011
 coc-0.011
 sip-0.011
Token and
Token ID290
Feature activation+1.91e-01

Pos logit contributions
 coc+0.006
 sip+0.006
ocket+0.006
igsaw+0.006
 pad+0.006
Neg logit contributions
 ourselves-0.003
 neither-0.003
 best-0.003
 most-0.003
ate-0.003
Token even
Token ID772
Feature activation-1.64e-02

Pos logit contributions
ocket+0.005
igsaw+0.005
 sip+0.005
 coc+0.005
andel+0.004
Neg logit contributions
 ourselves-0.004
 most-0.004
 neither-0.004
 best-0.004
 when-0.003
Token spun
Token ID26843
Feature activation+1.51e-01

Pos logit contributions
 neither+0.000
 best+0.000
 most+0.000
 ourselves+0.000
ate+0.000
Neg logit contributions
 coc-0.000
ocket-0.000
igsaw-0.000
 sip-0.000
andel-0.000
Token coc
Token ID8954
Feature activation+6.14e+00

Pos logit contributions
ocket+0.004
 coc+0.003
 sip+0.003
igsaw+0.003
 pad+0.003
Neg logit contributions
 ourselves-0.003
 most-0.003
 best-0.003
 neither-0.003
ate-0.003
Tokenoons
Token ID13022
Feature activation-3.90e-01

Pos logit contributions
 coc+0.091
 sip+0.091
ocket+0.079
 bang+0.076
 pad+0.076
Neg logit contributions
 ourselves-0.179
ate-0.178
They-0.165
iring-0.164
 best-0.162
Token together
Token ID1978
Feature activation+8.46e-01

Pos logit contributions
 ourselves+0.007
 neither+0.006
 best+0.006
 most+0.006
ate+0.006
Neg logit contributions
 coc-0.011
 sip-0.011
ocket-0.010
igsaw-0.010
 pad-0.010
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
ocket+0.027
 coc+0.026
 sip+0.026
igsaw+0.026
roach+0.026
Neg logit contributions
 ourselves-0.012
 best-0.011
 neither-0.011
 most-0.010
 when-0.010
Token They
Token ID1119
Feature activation-2.88e-01

Pos logit contributions
 coc+0.004
ocket+0.004
 sip+0.004
igsaw+0.004
 pad+0.004
Neg logit contributions
 ourselves-0.002
 neither-0.002
 most-0.002
 best-0.002
ate-0.002
Token liked
Token ID8288
Feature activation+1.24e+00

Pos logit contributions
 neither+0.005
 still+0.005
 best+0.005
 most+0.005
 ourselves+0.005
Neg logit contributions
igsaw-0.008
run-0.008
andel-0.008
roach-0.007
ocket-0.007
Token It
Token ID632
Feature activation+8.54e-01

Pos logit contributions
 sip+0.051
 coc+0.051
 web+0.048
ocket+0.048
andel+0.046
Neg logit contributions
 most-0.019
 neither-0.017
ate-0.017
 ourselves-0.016
 best-0.015
Token was
Token ID373
Feature activation+9.84e-01

Pos logit contributions
 coc+0.025
igsaw+0.024
 sip+0.024
ocket+0.024
andel+0.024
Neg logit contributions
 ourselves-0.013
 most-0.013
 neither-0.012
 best-0.012
ate-0.012
Token a
Token ID257
Feature activation+1.34e+00

Pos logit contributions
 sip+0.030
igsaw+0.030
 coc+0.030
ocket+0.029
andel+0.029
Neg logit contributions
 most-0.013
 ourselves-0.013
 best-0.013
 neither-0.012
ate-0.011
Token tiny
Token ID7009
Feature activation+2.25e+00

Pos logit contributions
oline+0.034
run+0.034
andel+0.032
ocket+0.032
book+0.031
Neg logit contributions
 best-0.027
 most-0.027
 still-0.023
 ourselves-0.022
 fastest-0.021
Token butterfly
Token ID35113
Feature activation+1.31e+00

Pos logit contributions
ocket+0.050
book+0.050
oline+0.049
run+0.049
roach+0.049
Neg logit contributions
 best-0.046
 most-0.046
 neither-0.041
 ourselves-0.041
 still-0.040
Token coc
Token ID8954
Feature activation+6.14e+00

Pos logit contributions
 coc+0.033
 sip+0.033
igsaw+0.033
ocket+0.033
andel+0.032
Neg logit contributions
 ourselves-0.023
 best-0.022
 most-0.022
 neither-0.021
 when-0.020
Tokenoon
Token ID2049
Feature activation+1.87e+00

Pos logit contributions
 sip+0.095
 coc+0.092
 bang+0.085
roach+0.080
igsaw+0.079
Neg logit contributions
ate-0.170
 ourselves-0.169
iring-0.157
They-0.155
 most-0.154
Token!
Token ID0
Feature activation+1.08e+00

Pos logit contributions
 coc+0.047
 sip+0.046
ocket+0.045
igsaw+0.044
 pad+0.044
Neg logit contributions
 ourselves-0.036
 neither-0.034
 most-0.032
 best-0.032
ate-0.032
Token He
Token ID679
Feature activation+3.63e-01

Pos logit contributions
 coc+0.032
 sip+0.032
ocket+0.031
 web+0.031
 pad+0.030
Neg logit contributions
 ourselves-0.016
 neither-0.015
 most-0.014
 best-0.014
ate-0.013
Token felt
Token ID2936
Feature activation+9.35e-01

Pos logit contributions
igsaw+0.011
andel+0.010
roach+0.010
 sip+0.010
book+0.010
Neg logit contributions
 most-0.006
 still-0.006
ate-0.005
 neither-0.005
 best-0.005
Token so
Token ID523
Feature activation-1.22e+00

Pos logit contributions
 coc+0.024
 sip+0.023
ocket+0.023
 pad+0.022
igsaw+0.022
Neg logit contributions
 ourselves-0.018
 neither-0.017
ate-0.016
 best-0.016
 most-0.016
Token down
Token ID866
Feature activation+2.63e-01

Pos logit contributions
 ourselves+0.011
 neither+0.011
mm+0.010
Deal+0.010
 fastest+0.009
Neg logit contributions
 pad-0.010
 coc-0.010
ocket-0.010
run-0.009
 sip-0.009
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
 coc+0.006
 sip+0.006
ocket+0.006
 pad+0.006
igsaw+0.006
Neg logit contributions
 ourselves-0.006
 neither-0.005
ate-0.005
 best-0.005
Deal-0.005
Token enjoyed
Token ID8359
Feature activation+2.08e-01

Pos logit contributions
 ourselves+0.004
 neither+0.004
 most+0.003
 best+0.003
ate+0.003
Neg logit contributions
 coc-0.004
igsaw-0.004
 sip-0.004
ocket-0.004
 pad-0.004
Token her
Token ID607
Feature activation-2.74e-01

Pos logit contributions
 coc+0.006
 sip+0.006
ocket+0.006
 pad+0.006
igsaw+0.006
Neg logit contributions
 ourselves-0.003
 best-0.003
 neither-0.003
 most-0.003
ate-0.003
Token popcorn
Token ID38914
Feature activation-1.23e+00

Pos logit contributions
 ourselves+0.006
 best+0.006
 most+0.006
 neither+0.006
ate+0.006
Neg logit contributions
 coc-0.006
ocket-0.006
 pad-0.006
 sip-0.005
andel-0.005
Token coc
Token ID8954
Feature activation+6.13e+00

Pos logit contributions
 ourselves+0.023
 neither+0.022
 best+0.021
 most+0.021
ate+0.021
Neg logit contributions
 coc-0.031
 sip-0.031
ocket-0.030
igsaw-0.030
 pad-0.029
Tokenoon
Token ID2049
Feature activation+1.09e+00

Pos logit contributions
 coc+0.103
 sip+0.101
ocket+0.091
 pad+0.090
igsaw+0.086
Neg logit contributions
 ourselves-0.168
ate-0.163
 best-0.153
 most-0.151
iring-0.151
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
 coc+0.028
 sip+0.027
ocket+0.027
igsaw+0.026
 pad+0.026
Neg logit contributions
 ourselves-0.021
 neither-0.019
 best-0.019
 most-0.019
ate-0.018
Token<|endoftext|>
Token ID50256
Feature activation-6.52e-01

Pos logit contributions
 coc+0.016
 sip+0.015
ocket+0.015
igsaw+0.015
 pad+0.015
Neg logit contributions
 ourselves-0.010
 neither-0.009
 best-0.009
 most-0.009
ate-0.009
TokenOnce
Token ID7454
Feature activation-1.71e+00

Pos logit contributions
 ourselves+0.007
ate+0.006
 best+0.006
 most+0.006
 neither+0.005
Neg logit contributions
 coc-0.022
 sip-0.022
ocket-0.022
andel-0.021
igsaw-0.021
Token upon
Token ID2402
Feature activation-2.47e-01

Pos logit contributions
 ourselves+0.038
 neither+0.035
Sorry+0.035
ate+0.034
 most+0.034
Neg logit contributions
 coc-0.038
 sip-0.037
ocket-0.036
igsaw-0.036
 pad-0.036
Token palm
Token ID18057
Feature activation+2.46e+00

Pos logit contributions
 ourselves+0.021
 neither+0.019
ate+0.019
Sorry+0.019
 most+0.018
Neg logit contributions
 coc-0.026
 sip-0.025
ocket-0.024
 pad-0.024
 web-0.024
Token trees
Token ID7150
Feature activation+6.74e-01

Pos logit contributions
 coc+0.058
 sip+0.057
 pad+0.054
ocket+0.054
 web+0.054
Neg logit contributions
 ourselves-0.051
 neither-0.049
Sorry-0.046
ate-0.046
 best-0.045
Token with
Token ID351
Feature activation+7.99e-01

Pos logit contributions
 sip+0.015
 coc+0.015
 pad+0.014
 web+0.014
ocket+0.013
Neg logit contributions
 ourselves-0.015
Sorry-0.014
 neither-0.014
Deal-0.014
ate-0.014
Token fruits
Token ID15921
Feature activation+4.79e-01

Pos logit contributions
igsaw+0.022
ocket+0.021
 sip+0.021
 coc+0.020
book+0.020
Neg logit contributions
 ourselves-0.015
 best-0.014
 most-0.014
 neither-0.013
ate-0.012
Token and
Token ID290
Feature activation-3.50e-01

Pos logit contributions
 coc+0.012
 sip+0.012
ocket+0.011
 pad+0.011
igsaw+0.011
Neg logit contributions
 ourselves-0.009
 neither-0.009
ate-0.008
Sorry-0.008
 most-0.008
Token coc
Token ID8954
Feature activation+6.04e+00

Pos logit contributions
 ourselves+0.008
 most+0.007
 neither+0.007
 best+0.007
ate+0.007
Neg logit contributions
igsaw-0.008
 coc-0.008
ocket-0.008
 sip-0.008
 pad-0.007
Tokenon
Token ID261
Feature activation-1.50e-02

Pos logit contributions
 coc+0.120
 sip+0.117
igsaw+0.110
ocket+0.108
 pad+0.106
Neg logit contributions
 ourselves-0.150
ate-0.144
 best-0.133
 neither-0.132
iring-0.132
Tokenuts
Token ID5500
Feature activation-2.97e-01

Pos logit contributions
 ourselves+0.000
 neither+0.000
 best+0.000
ate+0.000
 most+0.000
Neg logit contributions
 coc-0.000
 sip-0.000
igsaw-0.000
roach-0.000
ocket-0.000
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 ourselves+0.005
 neither+0.005
 best+0.005
 most+0.005
ate+0.004
Neg logit contributions
 coc-0.008
 sip-0.008
igsaw-0.008
ocket-0.008
andel-0.008
Token There
Token ID1318
Feature activation-1.14e+00

Pos logit contributions
 ourselves+0.023
 neither+0.022
 most+0.020
 best+0.020
ate+0.020
Neg logit contributions
 coc-0.029
 sip-0.029
ocket-0.028
 pad-0.028
igsaw-0.028
Token were
Token ID547
Feature activation-9.68e-01

Pos logit contributions
 ourselves+0.028
Sorry+0.026
 yourselves+0.026
ate+0.025
Deal+0.025
Neg logit contributions
 sip-0.023
 coc-0.022
ocket-0.021
book-0.020
 pad-0.020
Token tree
Token ID5509
Feature activation+1.16e+00

Pos logit contributions
 most+0.010
 best+0.010
 ourselves+0.010
 neither+0.010
 fastest+0.009
Neg logit contributions
book-0.012
run-0.012
ocket-0.012
 sip-0.012
vel-0.012
Token and
Token ID290
Feature activation+4.67e-01

Pos logit contributions
 coc+0.030
 sip+0.030
ocket+0.029
igsaw+0.029
 pad+0.028
Neg logit contributions
 ourselves-0.021
 neither-0.020
 best-0.019
 most-0.019
ate-0.019
Token threw
Token ID9617
Feature activation-5.76e-01

Pos logit contributions
 coc+0.010
 sip+0.010
ocket+0.010
 pad+0.010
igsaw+0.010
Neg logit contributions
 ourselves-0.011
 neither-0.010
 best-0.010
ate-0.010
 most-0.009
Token down
Token ID866
Feature activation+3.35e-01

Pos logit contributions
 ourselves+0.007
 most+0.007
 best+0.006
 neither+0.006
 when+0.005
Neg logit contributions
ocket-0.019
igsaw-0.018
book-0.018
 web-0.018
 coc-0.018
Token some
Token ID617
Feature activation+1.16e+00

Pos logit contributions
 coc+0.010
 sip+0.009
ocket+0.009
 pad+0.009
igsaw+0.009
Neg logit contributions
 ourselves-0.005
 neither-0.005
 best-0.005
 most-0.005
ate-0.005
Token coc
Token ID8954
Feature activation+5.93e+00

Pos logit contributions
 tripod+0.030
ocket+0.030
book+0.029
igsaw+0.028
books+0.028
Neg logit contributions
 best-0.023
 most-0.022
 ourselves-0.021
 fastest-0.019
 when-0.019
Tokenon
Token ID261
Feature activation+5.31e-01

Pos logit contributions
 coc+0.100
 sip+0.098
ocket+0.088
roach+0.088
igsaw+0.087
Neg logit contributions
 ourselves-0.161
ate-0.155
 neither-0.147
 best-0.146
iring-0.145
Tokenuts
Token ID5500
Feature activation-7.75e-01

Pos logit contributions
 coc+0.013
 sip+0.013
ocket+0.013
igsaw+0.013
 pad+0.012
Neg logit contributions
 ourselves-0.010
 neither-0.010
 best-0.009
 most-0.009
ate-0.009
Token for
Token ID329
Feature activation+1.15e+00

Pos logit contributions
 ourselves+0.016
 neither+0.015
 best+0.014
 most+0.014
ate+0.014
Neg logit contributions
 coc-0.019
 sip-0.019
ocket-0.018
igsaw-0.018
 web-0.017
Token the
Token ID262
Feature activation+5.52e-02

Pos logit contributions
 pad+0.038
 sip+0.038
oline+0.037
igsaw+0.037
ocket+0.037
Neg logit contributions
 ourselves-0.015
 neither-0.012
 best-0.012
 most-0.012
 yourselves-0.011
Token sailor
Token ID43272
Feature activation-8.15e-01

Pos logit contributions
 sip+0.001
run+0.001
igsaw+0.001
book+0.001
ocket+0.001
Neg logit contributions
 ourselves-0.001
 best-0.001
 most-0.001
 neither-0.001
 fastest-0.001
Tokenipped
Token ID3949
Feature activation+2.74e-02

Pos logit contributions
 coc+0.007
 sip+0.007
 pad+0.007
igsaw+0.007
ocket+0.007
Neg logit contributions
 ourselves-0.005
 neither-0.005
ate-0.004
 best-0.004
Sorry-0.004
Token itself
Token ID2346
Feature activation+2.35e+00

Pos logit contributions
 coc+0.001
ocket+0.001
 sip+0.001
igsaw+0.001
 pad+0.001
Neg logit contributions
 ourselves-0.000
 neither-0.000
 best-0.000
 most-0.000
ate-0.000
Token up
Token ID510
Feature activation+7.18e-01

Pos logit contributions
ocket+0.066
 coc+0.065
roach+0.064
 sip+0.064
igsaw+0.063
Neg logit contributions
 ourselves-0.038
 best-0.036
 neither-0.035
 most-0.035
ate-0.034
Token in
Token ID287
Feature activation+5.04e-01

Pos logit contributions
 coc+0.019
 sip+0.019
ocket+0.018
igsaw+0.018
 pad+0.018
Neg logit contributions
 ourselves-0.013
 neither-0.012
 best-0.012
 most-0.012
ate-0.012
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
ocket+0.019
roach+0.018
anna+0.018
oline+0.018
oke+0.017
Neg logit contributions
 ourselves-0.005
 most-0.004
 when-0.004
 best-0.004
 neither-0.004
Token coc
Token ID8954
Feature activation+5.92e+00

Pos logit contributions
 most+0.021
 best+0.020
 ourselves+0.017
 next+0.016
 still+0.016
Neg logit contributions
run-0.032
ocket-0.030
ocol-0.029
vel-0.029
book-0.028
Tokenoon
Token ID2049
Feature activation+1.60e+00

Pos logit contributions
 sip+0.080
 coc+0.078
 bang+0.068
roach+0.068
igsaw+0.068
Neg logit contributions
 ourselves-0.178
ate-0.177
iring-0.165
 best-0.165
 neither-0.164
Token.
Token ID13
Feature activation+8.80e-01

Pos logit contributions
 coc+0.044
 sip+0.043
ocket+0.042
igsaw+0.042
andel+0.041
Neg logit contributions
 ourselves-0.028
 neither-0.026
 best-0.025
 most-0.025
ate-0.024
Token She
Token ID1375
Feature activation+6.25e-01

Pos logit contributions
 coc+0.018
 sip+0.018
 pad+0.017
igsaw+0.017
 web+0.017
Neg logit contributions
 ourselves-0.021
 neither-0.020
 best-0.019
ate-0.019
 most-0.019
Token waited
Token ID13488
Feature activation+8.08e-01

Pos logit contributions
igsaw+0.019
ocket+0.017
andel+0.017
roach+0.017
 sip+0.017
Neg logit contributions
 neither-0.010
 most-0.010
 still-0.010
 best-0.009
Sorry-0.009
Token patiently
Token ID37220
Feature activation-3.91e-01

Pos logit contributions
 coc+0.023
igsaw+0.023
ocket+0.022
 sip+0.022
roach+0.021
Neg logit contributions
 ourselves-0.013
 best-0.012
 neither-0.012
 most-0.012
ate-0.011
Token the
Token ID262
Feature activation-4.11e-01

Pos logit contributions
 ourselves+0.010
ate+0.009
 most+0.008
 your+0.008
 not+0.008
Neg logit contributions
andel-0.020
roach-0.020
book-0.020
ocket-0.019
vel-0.019
Token tree
Token ID5509
Feature activation+6.27e-01

Pos logit contributions
 most+0.009
 best+0.009
 ourselves+0.009
 neither+0.008
ate+0.008
Neg logit contributions
run-0.010
book-0.009
 sip-0.009
igsaw-0.009
ocol-0.009
Token and
Token ID290
Feature activation+6.54e-01

Pos logit contributions
 coc+0.017
 sip+0.017
ocket+0.016
igsaw+0.016
roach+0.016
Neg logit contributions
 ourselves-0.011
 neither-0.010
 most-0.010
 best-0.010
ate-0.010
Token found
Token ID1043
Feature activation-1.96e-01

Pos logit contributions
 coc+0.015
igsaw+0.015
andel+0.015
ocket+0.015
 sip+0.015
Neg logit contributions
 ourselves-0.013
 neither-0.013
 most-0.012
ate-0.012
 best-0.012
Token the
Token ID262
Feature activation+6.28e-02

Pos logit contributions
 ourselves+0.003
 neither+0.003
Sorry+0.003
ate+0.003
 best+0.003
Neg logit contributions
 coc-0.006
 sip-0.005
ocket-0.005
 pad-0.005
igsaw-0.005
Token coc
Token ID8954
Feature activation+5.91e+00

Pos logit contributions
run+0.001
 sip+0.001
igsaw+0.001
book+0.001
roach+0.001
Neg logit contributions
 most-0.002
 best-0.002
 ourselves-0.002
 neither-0.002
 fastest-0.002
Tokenoon
Token ID2049
Feature activation+1.57e+00

Pos logit contributions
 sip+0.085
 coc+0.084
roach+0.074
 bang+0.074
igsaw+0.073
Neg logit contributions
 ourselves-0.173
ate-0.170
iring-0.160
 best-0.160
 most-0.159
Token.
Token ID13
Feature activation+7.63e-01

Pos logit contributions
 coc+0.041
 sip+0.040
ocket+0.039
igsaw+0.038
andel+0.038
Neg logit contributions
 ourselves-0.029
 neither-0.027
 best-0.026
 most-0.026
ate-0.026
Token He
Token ID679
Feature activation+1.46e-01

Pos logit contributions
 coc+0.019
igsaw+0.019
 pad+0.019
 sip+0.019
run+0.018
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.013
Sorry-0.013
ate-0.013
Token went
Token ID1816
Feature activation+4.39e-01

Pos logit contributions
igsaw+0.004
andel+0.004
 sip+0.004
 coc+0.004
ocket+0.004
Neg logit contributions
 most-0.002
 neither-0.002
ate-0.002
 ourselves-0.002
 best-0.002
Token inside
Token ID2641
Feature activation+2.75e-01

Pos logit contributions
 coc+0.012
 sip+0.012
ocket+0.012
igsaw+0.012
book+0.011
Neg logit contributions
 ourselves-0.007
 best-0.007
 most-0.007
 neither-0.006
ate-0.006
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
 ourselves+0.002
ate+0.001
 neither+0.001
Deal+0.001
 yourselves+0.001
Neg logit contributions
 sip-0.003
 coc-0.003
ocket-0.002
igsaw-0.002
book-0.002
Token he
Token ID339
Feature activation+1.80e+00

Pos logit contributions
 coc+0.004
 sip+0.004
 pad+0.004
igsaw+0.004
book+0.004
Neg logit contributions
 ourselves-0.003
 yourselves-0.003
ate-0.003
Deal-0.003
mm-0.003
Token emerged
Token ID9349
Feature activation+3.42e-01

Pos logit contributions
igsaw+0.049
andel+0.045
 sip+0.045
run+0.044
roach+0.044
Neg logit contributions
 still-0.032
ate-0.031
 best-0.031
 most-0.031
 neither-0.031
Token from
Token ID422
Feature activation-3.47e-01

Pos logit contributions
 coc+0.008
 web+0.007
ocket+0.007
 sip+0.007
andel+0.007
Neg logit contributions
Deal-0.008
 ourselves-0.008
Sorry-0.007
 neither-0.007
eday-0.007
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
 ourselves+0.006
 neither+0.006
 most+0.006
 best+0.005
ate+0.005
Neg logit contributions
 coc-0.009
 sip-0.009
ocket-0.009
 pad-0.009
igsaw-0.009
Token coc
Token ID8954
Feature activation+5.91e+00

Pos logit contributions
 most+0.005
 best+0.005
 ourselves+0.005
 still+0.005
 next+0.005
Neg logit contributions
run-0.002
book-0.002
ocol-0.002
igsaw-0.002
roach-0.002
Tokenoon
Token ID2049
Feature activation+1.80e+00

Pos logit contributions
 sip+0.084
 coc+0.081
roach+0.073
ocket+0.071
igsaw+0.071
Neg logit contributions
 ourselves-0.175
ate-0.170
iring-0.162
 best-0.161
 neither-0.160
Token as
Token ID355
Feature activation-9.68e-01

Pos logit contributions
 coc+0.049
 sip+0.048
ocket+0.047
igsaw+0.046
andel+0.045
Neg logit contributions
 ourselves-0.031
 neither-0.029
 best-0.028
 most-0.028
ate-0.027
Token a
Token ID257
Feature activation+1.85e+00

Pos logit contributions
 ourselves+0.018
Sorry+0.016
 neither+0.015
 yourselves+0.015
Deal+0.015
Neg logit contributions
 coc-0.025
ocket-0.024
book-0.024
 sip-0.024
run-0.023
Token beautiful
Token ID4950
Feature activation+2.80e+00

Pos logit contributions
oline+0.051
vet+0.047
run+0.046
ez+0.045
roach+0.045
Neg logit contributions
 best-0.035
 most-0.034
Sorry-0.032
But-0.032
 next-0.032
Token butterfly
Token ID35113
Feature activation+1.96e+00

Pos logit contributions
run+0.068
 microsc+0.061
ocket+0.061
 tripod+0.061
book+0.060
Neg logit contributions
 most-0.059
 when-0.058
 best-0.055
bye-0.054
 next-0.052
Token coconut
Token ID20132
Feature activation+7.80e-01

Pos logit contributions
run+0.039
ocket+0.037
igsaw+0.037
book+0.037
 sip+0.036
Neg logit contributions
 best-0.025
 ourselves-0.025
 most-0.025
 neither-0.024
ate-0.022
Token tree
Token ID5509
Feature activation+1.03e+00

Pos logit contributions
igsaw+0.020
 sip+0.020
 coc+0.020
ocket+0.019
run+0.019
Neg logit contributions
 ourselves-0.014
 best-0.013
 neither-0.013
 most-0.013
iring-0.012
Token with
Token ID351
Feature activation+1.07e+00

Pos logit contributions
 coc+0.027
 sip+0.026
ocket+0.025
 pad+0.025
 web+0.025
Neg logit contributions
 ourselves-0.019
 neither-0.018
 best-0.017
 most-0.017
ate-0.017
Token lots
Token ID6041
Feature activation+2.82e+00

Pos logit contributions
 coc+0.029
 sip+0.028
ocket+0.028
 pad+0.027
igsaw+0.027
Neg logit contributions
 ourselves-0.019
 neither-0.018
 best-0.017
 most-0.017
ate-0.017
Token of
Token ID286
Feature activation+4.88e-01

Pos logit contributions
igsaw+0.066
 sip+0.066
ocket+0.065
roach+0.064
 tripod+0.064
Neg logit contributions
iring-0.052
 ourselves-0.052
 best-0.051
 most-0.050
ate-0.048
Token coc
Token ID8954
Feature activation+5.87e+00

Pos logit contributions
igsaw+0.012
 sip+0.011
ocket+0.011
 coc+0.011
run+0.011
Neg logit contributions
 ourselves-0.010
 most-0.010
 best-0.010
 neither-0.009
ate-0.009
Tokenon
Token ID261
Feature activation+2.48e-01

Pos logit contributions
 sip+0.110
 coc+0.106
 bang+0.096
roach+0.095
igsaw+0.095
Neg logit contributions
ate-0.146
 ourselves-0.145
iring-0.134
They-0.132
 best-0.131
Tokenuts
Token ID5500
Feature activation-6.32e-01

Pos logit contributions
 sip+0.006
 coc+0.006
igsaw+0.006
ocket+0.006
andel+0.006
Neg logit contributions
 ourselves-0.005
 neither-0.004
 best-0.004
 most-0.004
ate-0.004
Token.
Token ID13
Feature activation+9.56e-01

Pos logit contributions
 ourselves+0.012
 neither+0.011
 best+0.011
 most+0.011
ate+0.011
Neg logit contributions
 coc-0.016
 sip-0.016
igsaw-0.015
ocket-0.015
 web-0.015
Token But
Token ID887
Feature activation-1.54e-01

Pos logit contributions
 coc+0.030
 sip+0.030
ocket+0.030
igsaw+0.029
 web+0.029
Neg logit contributions
 ourselves-0.012
 neither-0.011
 most-0.011
 best-0.010
ate-0.010
Token the
Token ID262
Feature activation-9.10e-01

Pos logit contributions
 ourselves+0.003
ate+0.003
Sorry+0.003
Deal+0.003
 yourselves+0.002
Neg logit contributions
 coc-0.004
 pad-0.004
 sip-0.004
ocket-0.004
 gob-0.004
Token,
Token ID11
Feature activation+7.24e-01

Pos logit contributions
 ourselves+0.012
Sorry+0.011
 yourselves+0.011
Deal+0.010
ate+0.010
Neg logit contributions
 coc-0.014
 sip-0.014
 pad-0.013
ocket-0.013
book-0.013
Token he
Token ID339
Feature activation+1.03e+00

Pos logit contributions
 coc+0.023
 sip+0.022
ocket+0.022
igsaw+0.022
 pad+0.022
Neg logit contributions
 ourselves-0.009
 neither-0.008
ate-0.008
Sorry-0.008
Deal-0.007
Token spun
Token ID26843
Feature activation+7.28e-01

Pos logit contributions
 coc+0.025
igsaw+0.025
 sip+0.024
ocket+0.023
andel+0.023
Neg logit contributions
 ourselves-0.020
 neither-0.020
 most-0.019
 best-0.019
ate-0.019
Token a
Token ID257
Feature activation+1.00e+00

Pos logit contributions
 coc+0.018
 sip+0.018
ocket+0.017
igsaw+0.017
 pad+0.017
Neg logit contributions
 ourselves-0.014
 neither-0.013
 most-0.013
 best-0.013
ate-0.013
Token special
Token ID2041
Feature activation+1.90e+00

Pos logit contributions
ocket+0.029
run+0.028
book+0.027
roach+0.027
ocol+0.027
Neg logit contributions
 most-0.017
 best-0.017
 fastest-0.015
 still-0.015
 ourselves-0.015
Token coc
Token ID8954
Feature activation+5.83e+00

Pos logit contributions
ocket+0.045
run+0.044
ocol+0.044
lee+0.043
 microsc+0.042
Neg logit contributions
 most-0.041
 best-0.040
 when-0.035
 ourselves-0.034
 next-0.033
Tokenoon
Token ID2049
Feature activation+1.94e+00

Pos logit contributions
 coc+0.079
 sip+0.079
ocket+0.068
igsaw+0.067
roach+0.067
Neg logit contributions
 ourselves-0.175
ate-0.172
 best-0.163
iring-0.163
 most-0.163
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
 coc+0.046
 sip+0.045
ocket+0.044
 pad+0.043
igsaw+0.043
Neg logit contributions
 ourselves-0.040
 neither-0.038
 best-0.036
ate-0.036
 most-0.036
Token went
Token ID1816
Feature activation+1.14e-01

Pos logit contributions
 coc+0.009
 sip+0.009
igsaw+0.009
ocket+0.009
andel+0.008
Neg logit contributions
 ourselves-0.008
 neither-0.008
 most-0.007
 best-0.007
ate-0.007
Token to
Token ID284
Feature activation-9.43e-01

Pos logit contributions
 coc+0.003
 sip+0.003
ocket+0.003
igsaw+0.003
 pad+0.003
Neg logit contributions
 ourselves-0.002
 best-0.002
 most-0.002
 neither-0.002
ate-0.002
Token sleep
Token ID3993
Feature activation+7.01e-01

Pos logit contributions
 ourselves+0.019
 most+0.017
 best+0.017
ate+0.016
 yourself+0.016
Neg logit contributions
book-0.025
andel-0.024
roach-0.024
ocket-0.024
run-0.024
Token while
Token ID981
Feature activation+7.65e-01

Pos logit contributions
 coc+0.019
ocket+0.019
 sip+0.019
igsaw+0.019
book+0.019
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
 most-0.014
Sorry-0.013
Token.
Token ID13
Feature activation+8.44e-01

Pos logit contributions
 coc+0.021
 sip+0.021
igsaw+0.020
ocket+0.020
 pad+0.020
Neg logit contributions
 ourselves-0.013
 neither-0.012
 best-0.012
 most-0.012
ate-0.011
Token Soon
Token ID15894
Feature activation+3.26e-01

Pos logit contributions
 coc+0.022
 sip+0.022
ocket+0.021
igsaw+0.021
 pad+0.021
Neg logit contributions
 ourselves-0.015
 neither-0.014
 most-0.014
 best-0.014
ate-0.014
Token,
Token ID11
Feature activation+7.02e-01

Pos logit contributions
 coc+0.009
 sip+0.009
ocket+0.008
 pad+0.008
 web+0.008
Neg logit contributions
 ourselves-0.005
ate-0.005
Sorry-0.005
Deal-0.005
 yourselves-0.005
Token the
Token ID262
Feature activation+1.94e-01

Pos logit contributions
 coc+0.020
 sip+0.019
ocket+0.018
 web+0.018
igsaw+0.018
Neg logit contributions
 ourselves-0.012
ate-0.011
 neither-0.010
Sorry-0.010
Deal-0.010
Token tad
Token ID36264
Feature activation+5.80e+00

Pos logit contributions
run+0.005
book+0.005
ocket+0.005
 pad+0.005
igsaw+0.005
Neg logit contributions
 most-0.004
 best-0.004
 ourselves-0.004
 neither-0.004
 fastest-0.003
Tokenp
Token ID79
Feature activation+4.00e+00

Pos logit contributions
igsaw+0.130
 sip+0.129
 coc+0.128
ocket+0.125
book+0.118
Neg logit contributions
 ourselves-0.130
 most-0.125
 best-0.122
 neither-0.121
iring-0.117
Tokenoles
Token ID4316
Feature activation+8.54e-01

Pos logit contributions
 coc+0.092
 sip+0.089
igsaw+0.089
ocket+0.088
 pad+0.084
Neg logit contributions
 ourselves-0.084
 neither-0.082
 most-0.080
 best-0.079
ate-0.078
Token became
Token ID2627
Feature activation-1.31e+00

Pos logit contributions
igsaw+0.022
 coc+0.021
book+0.020
ocket+0.020
andel+0.020
Neg logit contributions
 neither-0.016
 ourselves-0.016
 best-0.016
 still-0.015
 most-0.015
Token frogs
Token ID37475
Feature activation-5.58e-01

Pos logit contributions
 ourselves+0.025
mm+0.024
Sorry+0.023
Deal+0.023
ate+0.022
Neg logit contributions
 coc-0.034
 sip-0.032
 web-0.031
 pad-0.030
run-0.029
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
 ourselves+0.010
 neither+0.010
 best+0.009
 most+0.009
ate+0.009
Neg logit contributions
 coc-0.015
 sip-0.014
ocket-0.014
igsaw-0.014
 pad-0.013
Token where
Token ID810
Feature activation+1.22e+00

Pos logit contributions
 coc+0.033
 sip+0.033
 pad+0.031
ocket+0.031
igsaw+0.031
Neg logit contributions
 ourselves-0.025
 neither-0.023
Sorry-0.023
ate-0.022
 most-0.022
Token he
Token ID339
Feature activation+1.94e+00

Pos logit contributions
 coc+0.040
 sip+0.039
ocket+0.038
igsaw+0.038
 pad+0.037
Neg logit contributions
 ourselves-0.015
 neither-0.013
ate-0.012
 best-0.012
 most-0.012
Token could
Token ID714
Feature activation+6.07e-01

Pos logit contributions
 coc+0.046
 sip+0.045
igsaw+0.044
ocket+0.043
andel+0.042
Neg logit contributions
 ourselves-0.040
 neither-0.039
 best-0.037
 most-0.037
ate-0.037
Token find
Token ID1064
Feature activation+4.38e-01

Pos logit contributions
igsaw+0.015
 coc+0.015
 pad+0.015
ocket+0.015
book+0.015
Neg logit contributions
 ourselves-0.012
 neither-0.011
ate-0.011
 best-0.010
 most-0.010
Token a
Token ID257
Feature activation+8.99e-01

Pos logit contributions
 coc+0.012
 sip+0.012
ocket+0.011
igsaw+0.011
 pad+0.011
Neg logit contributions
 ourselves-0.008
 neither-0.007
 best-0.007
 most-0.007
ate-0.007
Token coc
Token ID8954
Feature activation+5.79e+00

Pos logit contributions
ocol+0.022
run+0.022
roach+0.022
oline+0.021
book+0.021
Neg logit contributions
 best-0.020
 most-0.019
 ourselves-0.017
 still-0.016
 neither-0.016
Tokenoon
Token ID2049
Feature activation+1.67e+00

Pos logit contributions
 sip+0.085
 coc+0.085
roach+0.073
 bang+0.073
 pad+0.072
Neg logit contributions
 ourselves-0.167
ate-0.164
 best-0.156
 most-0.155
iring-0.154
Token.
Token ID13
Feature activation+9.37e-01

Pos logit contributions
 coc+0.044
 sip+0.043
ocket+0.042
igsaw+0.042
andel+0.041
Neg logit contributions
 ourselves-0.030
 neither-0.028
 best-0.028
 most-0.027
ate-0.026
Token The
Token ID383
Feature activation-5.38e-01

Pos logit contributions
 pad+0.017
 coc+0.017
igsaw+0.017
run+0.017
 sip+0.016
Neg logit contributions
 ourselves-0.024
 yourselves-0.022
 best-0.022
 fastest-0.021
 neither-0.021
Token butterfly
Token ID35113
Feature activation+1.47e+00

Pos logit contributions
 best+0.013
 most+0.013
 ourselves+0.012
 neither+0.012
Sorry+0.011
Neg logit contributions
run-0.012
 sip-0.011
book-0.011
igsaw-0.011
roach-0.011
Token told
Token ID1297
Feature activation+9.47e-01

Pos logit contributions
igsaw+0.038
 coc+0.038
 sip+0.038
ocket+0.037
andel+0.037
Neg logit contributions
 ourselves-0.025
 neither-0.025
 most-0.024
 best-0.024
ate-0.023
Token the
Token ID262
Feature activation+4.79e-01

Pos logit contributions
 ourselves+0.005
 neither+0.005
Sorry+0.005
Deal+0.005
 best+0.005
Neg logit contributions
 coc-0.008
 sip-0.008
ocket-0.007
 web-0.007
run-0.007
Token cater
Token ID20825
Feature activation+4.48e+00

Pos logit contributions
run+0.010
ulp+0.010
igsaw+0.010
 tripod+0.010
book+0.010
Neg logit contributions
 most-0.013
 best-0.012
 next-0.010
 biggest-0.010
 fastest-0.010
Tokenpillar
Token ID41643
Feature activation+1.11e+00

Pos logit contributions
 tripod+0.074
ulp+0.074
 sip+0.072
 fingertips+0.070
 bang+0.068
Neg logit contributions
 most-0.108
 fastest-0.105
ite-0.102
 when-0.101
iring-0.101
Token in
Token ID287
Feature activation+6.52e-01

Pos logit contributions
 coc+0.030
 sip+0.029
ocket+0.029
igsaw+0.028
andel+0.028
Neg logit contributions
 ourselves-0.019
 most-0.018
 neither-0.018
 best-0.018
ate-0.017
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
 sip+0.020
 coc+0.020
igsaw+0.019
ocket+0.019
oline+0.019
Neg logit contributions
 ourselves-0.009
 most-0.009
 best-0.008
 neither-0.008
ate-0.008
Token coc
Token ID8954
Feature activation+5.78e+00

Pos logit contributions
oline+0.033
roach+0.032
ocket+0.031
andel+0.030
run+0.030
Neg logit contributions
 best-0.020
 most-0.019
 still-0.017
 biggest-0.015
 less-0.014
Tokenoon
Token ID2049
Feature activation+1.62e+00

Pos logit contributions
 sip+0.084
 coc+0.084
 bang+0.075
roach+0.074
andel+0.072
Neg logit contributions
 ourselves-0.166
ate-0.166
iring-0.154
 best-0.153
 neither-0.152
Token so
Token ID523
Feature activation-9.07e-01

Pos logit contributions
 coc+0.043
 sip+0.042
ocket+0.041
igsaw+0.040
 pad+0.040
Neg logit contributions
 ourselves-0.029
 neither-0.027
 best-0.026
 most-0.026
ate-0.026
Token it
Token ID340
Feature activation+1.41e+00

Pos logit contributions
 ourselves+0.015
 neither+0.014
ate+0.014
 best+0.013
 most+0.013
Neg logit contributions
 coc-0.025
 sip-0.025
ocket-0.024
 web-0.023
igsaw-0.023
Token could
Token ID714
Feature activation+1.07e+00

Pos logit contributions
 coc+0.032
 sip+0.032
ocket+0.030
 web+0.030
 pad+0.030
Neg logit contributions
 ourselves-0.031
 neither-0.029
ate-0.028
 best-0.028
Sorry-0.027
Token turn
Token ID1210
Feature activation+9.69e-01

Pos logit contributions
 coc+0.024
 sip+0.024
ocket+0.023
igsaw+0.023
 pad+0.023
Neg logit contributions
 ourselves-0.023
 neither-0.022
 best-0.021
 most-0.021
ate-0.021
Token that
Token ID326
Feature activation-6.82e-01

Pos logit contributions
 coc+0.026
 sip+0.026
ocket+0.025
 pad+0.025
igsaw+0.025
Neg logit contributions
 ourselves-0.018
 neither-0.016
Sorry-0.016
ate-0.016
 most-0.016
Token there
Token ID612
Feature activation-1.99e-01

Pos logit contributions
 ourselves+0.009
 neither+0.009
 best+0.008
 most+0.008
ate+0.008
Neg logit contributions
 coc-0.021
 sip-0.021
ocket-0.020
igsaw-0.020
 pad-0.020
Token was
Token ID373
Feature activation+5.80e-01

Pos logit contributions
 ourselves+0.004
 neither+0.004
Sorry+0.004
ate+0.004
 most+0.004
Neg logit contributions
 coc-0.005
 sip-0.005
ocket-0.004
 pad-0.004
 web-0.004
Token an
Token ID281
Feature activation+2.14e+00

Pos logit contributions
 coc+0.017
 sip+0.017
ocket+0.016
 pad+0.016
igsaw+0.016
Neg logit contributions
 ourselves-0.009
 neither-0.008
ate-0.008
 most-0.007
Sorry-0.007
Token available
Token ID1695
Feature activation+5.89e-01

Pos logit contributions
run+0.059
 sip+0.059
book+0.055
igsaw+0.055
roach+0.054
Neg logit contributions
 best-0.038
 most-0.038
 ourselves-0.036
ate-0.035
 neither-0.034
Token coc
Token ID8954
Feature activation+5.75e+00

Pos logit contributions
run+0.011
igsaw+0.010
ocol+0.010
roach+0.010
book+0.010
Neg logit contributions
 best-0.016
 most-0.016
 ourselves-0.015
 next-0.015
ate-0.014
Tokenoon
Token ID2049
Feature activation+1.73e+00

Pos logit contributions
 sip+0.082
 coc+0.081
 bang+0.068
roach+0.068
 gob+0.067
Neg logit contributions
 ourselves-0.170
ate-0.169
iring-0.159
 most-0.159
 best-0.158
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
 coc+0.042
 sip+0.041
ocket+0.040
igsaw+0.040
 pad+0.039
Neg logit contributions
 ourselves-0.035
 neither-0.033
 best-0.031
 most-0.031
ate-0.031
Token the
Token ID262
Feature activation-7.83e-01

Pos logit contributions
 sip+0.036
ocket+0.035
 coc+0.035
 pad+0.035
igsaw+0.034
Neg logit contributions
 ourselves-0.017
 neither-0.014
 most-0.014
ate-0.014
 best-0.013
Token nearby
Token ID6716
Feature activation+1.22e+00

Pos logit contributions
 most+0.019
 best+0.018
 ourselves+0.017
 fastest+0.016
 yourself+0.015
Neg logit contributions
run-0.020
igsaw-0.018
 sip-0.018
ocol-0.017
book-0.017
Token tree
Token ID5509
Feature activation+1.15e+00

Pos logit contributions
igsaw+0.031
 sip+0.030
run+0.030
andel+0.030
roach+0.029
Neg logit contributions
 most-0.023
 ourselves-0.023
 best-0.023
 when-0.021
 yourself-0.020
Token needed
Token ID2622
Feature activation+1.46e+00

Pos logit contributions
 coc+0.014
 sip+0.013
ocket+0.013
 pad+0.013
igsaw+0.012
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
ate-0.013
 most-0.013
Token to
Token ID284
Feature activation-2.35e-01

Pos logit contributions
igsaw+0.041
 sip+0.040
 pad+0.040
andel+0.039
ocket+0.039
Neg logit contributions
 ourselves-0.024
 most-0.022
 best-0.022
 neither-0.021
ate-0.020
Token go
Token ID467
Feature activation+1.54e-01

Pos logit contributions
 ourselves+0.005
 neither+0.005
 best+0.005
 most+0.005
ate+0.005
Neg logit contributions
 coc-0.005
 sip-0.005
ocket-0.005
igsaw-0.005
 pad-0.005
Token into
Token ID656
Feature activation+2.43e-01

Pos logit contributions
 coc+0.003
 sip+0.003
 pad+0.003
run+0.003
ocket+0.003
Neg logit contributions
 neither-0.004
Sorry-0.004
 ourselves-0.004
Deal-0.004
mm-0.004
Token a
Token ID257
Feature activation+6.81e-01

Pos logit contributions
 sip+0.008
 pad+0.008
ocket+0.008
roach+0.007
book+0.007
Neg logit contributions
 ourselves-0.003
 most-0.003
 best-0.003
 neither-0.003
 yourselves-0.003
Token coc
Token ID8954
Feature activation+5.75e+00

Pos logit contributions
ocket+0.019
run+0.018
roach+0.018
book+0.018
oline+0.017
Neg logit contributions
 best-0.012
 most-0.011
 ourselves-0.011
 neither-0.010
 fastest-0.010
Tokenoon
Token ID2049
Feature activation+1.65e+00

Pos logit contributions
 sip+0.073
 coc+0.073
ocket+0.063
igsaw+0.062
roach+0.062
Neg logit contributions
 ourselves-0.178
ate-0.174
 neither-0.167
 best-0.167
 most-0.166
Token and
Token ID290
Feature activation-2.57e-01

Pos logit contributions
 coc+0.040
 sip+0.040
 pad+0.039
ocket+0.038
 web+0.038
Neg logit contributions
 ourselves-0.032
 neither-0.031
Sorry-0.030
ate-0.029
 best-0.029
Token then
Token ID788
Feature activation-1.33e+00

Pos logit contributions
 ourselves+0.005
 neither+0.005
 best+0.005
 most+0.005
ate+0.005
Neg logit contributions
 coc-0.006
 sip-0.006
ocket-0.006
 pad-0.006
igsaw-0.006
Token he
Token ID339
Feature activation+5.05e-01

Pos logit contributions
 ourselves+0.023
 neither+0.021
ate+0.020
 most+0.020
 best+0.020
Neg logit contributions
 coc-0.036
 sip-0.036
ocket-0.035
 pad-0.034
igsaw-0.034
Token would
Token ID561
Feature activation+7.99e-02

Pos logit contributions
 coc+0.011
 sip+0.011
ocket+0.011
 pad+0.011
 web+0.011
Neg logit contributions
 ourselves-0.011
 neither-0.010
ate-0.010
 most-0.010
 best-0.010
Token to
Token ID284
Feature activation-6.07e-01

Pos logit contributions
igsaw+0.043
 sip+0.041
 coc+0.041
ocket+0.041
 pad+0.041
Neg logit contributions
 ourselves-0.028
 neither-0.025
 most-0.025
 best-0.025
ate-0.024
Token eat
Token ID4483
Feature activation+1.41e+00

Pos logit contributions
 ourselves+0.013
 neither+0.012
ate+0.011
 best+0.011
 most+0.011
Neg logit contributions
igsaw-0.014
ocket-0.014
 coc-0.014
 pad-0.014
run-0.014
Token leaves
Token ID5667
Feature activation+1.19e+00

Pos logit contributions
igsaw+0.041
 coc+0.041
 sip+0.040
ocket+0.040
run+0.039
Neg logit contributions
 ourselves-0.023
 neither-0.019
 best-0.019
 most-0.019
ate-0.018
Token and
Token ID290
Feature activation-6.17e-01

Pos logit contributions
 coc+0.030
 sip+0.029
 pad+0.028
ocket+0.028
 web+0.027
Neg logit contributions
 ourselves-0.022
 neither-0.021
Sorry-0.021
ate-0.021
 best-0.020
Token make
Token ID787
Feature activation+8.43e-01

Pos logit contributions
 ourselves+0.014
 neither+0.013
 most+0.013
 best+0.013
 yourselves+0.012
Neg logit contributions
igsaw-0.014
andel-0.013
ocket-0.013
book-0.013
 coc-0.013
Token coc
Token ID8954
Feature activation+5.67e+00

Pos logit contributions
 sip+0.023
 coc+0.023
igsaw+0.023
ocket+0.023
 pad+0.023
Neg logit contributions
 ourselves-0.014
 neither-0.013
 best-0.013
 most-0.012
ate-0.012
Tokenoons
Token ID13022
Feature activation-4.40e-01

Pos logit contributions
 coc+0.074
 sip+0.072
ocket+0.066
 pad+0.065
igsaw+0.065
Neg logit contributions
 ourselves-0.178
ate-0.171
 neither-0.164
 best-0.163
 most-0.163
Token.
Token ID13
Feature activation-5.88e-01

Pos logit contributions
 ourselves+0.009
 neither+0.008
Sorry+0.008
ate+0.008
 most+0.008
Neg logit contributions
 coc-0.010
 sip-0.010
ocket-0.010
 pad-0.010
igsaw-0.010
Token Maybe
Token ID6674
Feature activation-2.21e-01

Pos logit contributions
 ourselves+0.012
 neither+0.011
 best+0.011
ate+0.011
 most+0.011
Neg logit contributions
 coc-0.014
 sip-0.014
 pad-0.013
ocket-0.013
run-0.013
Token one
Token ID530
Feature activation-8.97e-01

Pos logit contributions
 ourselves+0.003
Deal+0.003
 neither+0.003
Sorry+0.003
ate+0.003
Neg logit contributions
 coc-0.006
 sip-0.006
ocket-0.006
 pad-0.006
 web-0.005
Token day
Token ID1110
Feature activation-5.10e-01

Pos logit contributions
 best+0.011
 when+0.011
 most+0.011
 neither+0.011
 ourselves+0.010
Neg logit contributions
igsaw-0.031
lee-0.030
andel-0.030
vel-0.028
roach-0.027
Token and
Token ID290
Feature activation+3.64e-01

Pos logit contributions
 coc+0.015
 sip+0.015
ocket+0.015
igsaw+0.014
 pad+0.014
Neg logit contributions
 ourselves-0.010
 neither-0.009
 best-0.009
 most-0.009
ate-0.009
Token created
Token ID2727
Feature activation-2.58e-02

Pos logit contributions
 coc+0.009
 sip+0.008
igsaw+0.008
ocket+0.008
 pad+0.008
Neg logit contributions
 ourselves-0.008
 neither-0.007
 most-0.007
 best-0.007
ate-0.007
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
 ourselves+0.000
 neither+0.000
 best+0.000
 most+0.000
ate+0.000
Neg logit contributions
 coc-0.001
 sip-0.001
ocket-0.001
igsaw-0.001
 pad-0.001
Token most
Token ID749
Feature activation+1.03e+00

Pos logit contributions
 most+0.003
 best+0.003
 ourselves+0.003
ate+0.003
 neither+0.003
Neg logit contributions
run-0.006
 sip-0.006
ocket-0.006
igsaw-0.006
 coc-0.006
Token beautiful
Token ID4950
Feature activation+2.38e+00

Pos logit contributions
 sip+0.026
ocket+0.025
 coc+0.025
igsaw+0.024
 pad+0.024
Neg logit contributions
 ourselves-0.020
 best-0.019
 most-0.019
 neither-0.018
ate-0.018
Token coc
Token ID8954
Feature activation+5.64e+00

Pos logit contributions
run+0.056
ocket+0.055
 sip+0.053
book+0.051
igsaw+0.051
Neg logit contributions
 most-0.050
ate-0.048
 best-0.048
 ourselves-0.046
 when-0.045
Tokenoon
Token ID2049
Feature activation+1.41e+00

Pos logit contributions
 sip+0.087
 coc+0.087
andel+0.075
 bang+0.075
ocket+0.074
Neg logit contributions
 ourselves-0.162
ate-0.159
iring-0.149
 best-0.148
 most-0.148
Token.
Token ID13
Feature activation+8.78e-01

Pos logit contributions
 coc+0.035
 sip+0.034
ocket+0.033
igsaw+0.033
 pad+0.033
Neg logit contributions
 ourselves-0.028
 neither-0.026
 best-0.025
 most-0.025
ate-0.025
Token
Token ID198
Feature activation+6.83e-01

Pos logit contributions
 coc+0.020
 sip+0.020
ocket+0.019
igsaw+0.019
 pad+0.019
Neg logit contributions
 ourselves-0.019
 neither-0.018
 best-0.018
 most-0.018
ate-0.017
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
 coc+0.022
 sip+0.022
ocket+0.021
igsaw+0.021
 pad+0.021
Neg logit contributions
 ourselves-0.008
ate-0.008
 neither-0.007
 most-0.007
 best-0.007
TokenFinally
Token ID11158
Feature activation+1.81e-02

Pos logit contributions
 coc+0.038
 sip+0.037
ocket+0.036
 pad+0.036
igsaw+0.036
Neg logit contributions
 ourselves-0.016
 neither-0.015
 best-0.014
 most-0.013
ate-0.013
Token,
Token ID11
Feature activation-2.72e-01

Pos logit contributions
iring+0.002
ate+0.002
Sorry+0.002
 ourselves+0.002
Deal+0.002
Neg logit contributions
 coc-0.003
 sip-0.003
 web-0.002
ocket-0.002
 pad-0.002
Token it
Token ID340
Feature activation+1.62e+00

Pos logit contributions
 ourselves+0.005
Deal+0.005
 neither+0.005
Sorry+0.005
ate+0.005
Neg logit contributions
 coc-0.007
 sip-0.006
 web-0.006
ocket-0.006
 pad-0.006
Token will
Token ID481
Feature activation+1.32e+00

Pos logit contributions
 sip+0.041
 coc+0.040
 web+0.039
 pad+0.038
ocket+0.038
Neg logit contributions
 ourselves-0.032
 neither-0.029
ate-0.028
Sorry-0.028
 best-0.028
Token make
Token ID787
Feature activation+1.03e+00

Pos logit contributions
igsaw+0.030
 coc+0.030
ocket+0.030
 pad+0.029
run+0.029
Neg logit contributions
 ourselves-0.028
 neither-0.027
 most-0.026
 best-0.025
ate-0.025
Token a
Token ID257
Feature activation-2.13e-01

Pos logit contributions
 coc+0.028
 sip+0.028
ocket+0.028
igsaw+0.028
 pad+0.027
Neg logit contributions
 ourselves-0.017
 neither-0.016
 best-0.015
 most-0.015
ate-0.015
Token coc
Token ID8954
Feature activation+5.61e+00

Pos logit contributions
 best+0.004
 ourselves+0.004
 most+0.004
 neither+0.004
 still+0.004
Neg logit contributions
oline-0.006
run-0.006
book-0.006
ocket-0.005
igsaw-0.005
Tokenoon
Token ID2049
Feature activation+1.20e+00

Pos logit contributions
 coc+0.072
 sip+0.069
igsaw+0.064
 pad+0.064
ocket+0.063
Neg logit contributions
 ourselves-0.175
ate-0.170
 neither-0.162
 best-0.161
 most-0.160
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
 coc+0.031
 sip+0.031
ocket+0.030
igsaw+0.029
 pad+0.029
Neg logit contributions
 ourselves-0.022
 neither-0.021
 best-0.020
 most-0.020
ate-0.020
Token turn
Token ID1210
Feature activation+6.02e-01

Pos logit contributions
 ourselves+0.011
 neither+0.010
 most+0.010
 best+0.010
ate+0.010
Neg logit contributions
igsaw-0.011
ocket-0.011
 coc-0.011
andel-0.011
run-0.011
Token into
Token ID656
Feature activation+2.39e-01

Pos logit contributions
ocket+0.015
igsaw+0.015
 coc+0.015
 sip+0.015
 pad+0.015
Neg logit contributions
 ourselves-0.012
 neither-0.011
 most-0.010
 best-0.010
ate-0.010
Token a
Token ID257
Feature activation+9.57e-01

Pos logit contributions
roach+0.008
oline+0.007
igsaw+0.007
ocket+0.007
book+0.007
Neg logit contributions
 ourselves-0.004
 most-0.003
 yourselves-0.003
 best-0.003
ate-0.003
Token twenty
Token ID8208
Feature activation+5.58e-01

Pos logit contributions
 neither+0.000
 ourselves+0.000
 most+0.000
 best+0.000
Sorry+0.000
Neg logit contributions
 coc-0.000
ocket-0.000
igsaw-0.000
 pad-0.000
 sip-0.000
Token-
Token ID12
Feature activation+4.58e+00

Pos logit contributions
 coc+0.013
 sip+0.013
igsaw+0.012
ocket+0.012
 pad+0.012
Neg logit contributions
 best-0.012
 neither-0.012
 most-0.011
 ourselves-0.011
ate-0.011
Tokeneight
Token ID26022
Feature activation-5.32e-01

Pos logit contributions
 coc+0.084
 sip+0.082
igsaw+0.077
 web+0.076
ocket+0.076
Neg logit contributions
 ourselves-0.118
 neither-0.116
ate-0.116
 most-0.114
Sorry-0.113
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 ourselves+0.008
 neither+0.007
ate+0.007
Sorry+0.007
 most+0.007
Neg logit contributions
 coc-0.016
 sip-0.015
ocket-0.015
 pad-0.015
igsaw-0.015
Token twenty
Token ID8208
Feature activation+1.67e+00

Pos logit contributions
ocket+0.026
 coc+0.026
 sip+0.025
 pad+0.025
igsaw+0.025
Neg logit contributions
 neither-0.024
 ourselves-0.023
 most-0.023
 best-0.022
Sorry-0.021
Token-
Token ID12
Feature activation+5.59e+00

Pos logit contributions
 sip+0.042
 coc+0.042
ocket+0.040
igsaw+0.040
 pad+0.040
Neg logit contributions
 best-0.032
 most-0.031
ate-0.031
 neither-0.030
Deal-0.030
Tokennine
Token ID30888
Feature activation-3.52e-01

Pos logit contributions
 coc+0.113
 sip+0.111
ocket+0.103
igsaw+0.103
 web+0.103
Neg logit contributions
ate-0.131
 ourselves-0.130
 most-0.128
 neither-0.128
Sorry-0.126
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
 ourselves+0.005
 neither+0.005
ate+0.005
 most+0.004
 best+0.004
Neg logit contributions
 coc-0.010
 sip-0.010
 pad-0.010
ocket-0.010
igsaw-0.010
Token thirty
Token ID12277
Feature activation+1.34e+00

Pos logit contributions
 coc+0.027
ocket+0.026
 sip+0.026
 web+0.025
 pad+0.025
Neg logit contributions
 ourselves-0.027
 neither-0.027
 most-0.026
 best-0.026
Sorry-0.024
Token,
Token ID11
Feature activation+1.52e+00

Pos logit contributions
 sip+0.035
 coc+0.035
ocket+0.034
igsaw+0.033
 pad+0.033
Neg logit contributions
 ourselves-0.024
 best-0.024
 neither-0.023
 most-0.023
ate-0.023
Token thirty
Token ID12277
Feature activation+1.33e+00

Pos logit contributions
 coc+0.034
 sip+0.033
ocket+0.033
 pad+0.031
 web+0.031
Neg logit contributions
 ourselves-0.034
 neither-0.034
 most-0.032
 best-0.032
ate-0.030
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
Sorry+0.070
 ourselves+0.070
 neither+0.070
eday+0.067
iring+0.066
Neg logit contributions
 coc-0.042
 pad-0.042
 sip-0.039
 web-0.039
ocket-0.037
Token It
Token ID632
Feature activation-2.60e-01

Pos logit contributions
 ourselves+0.034
 neither+0.034
 best+0.033
iring+0.032
ate+0.032
Neg logit contributions
run-0.035
 sip-0.034
 coc-0.034
ocket-0.033
igsaw-0.033
Token's
Token ID338
Feature activation-8.14e-01

Pos logit contributions
 ourselves+0.006
 yourselves+0.006
ate+0.006
iring+0.005
 neither+0.005
Neg logit contributions
 sip-0.006
 web-0.005
 coc-0.005
 gob-0.005
 pad-0.005
Token okay
Token ID8788
Feature activation-7.26e-01

Pos logit contributions
 ourselves+0.012
 neither+0.011
Sorry+0.011
ate+0.011
 most+0.010
Neg logit contributions
 coc-0.024
 sip-0.024
 pad-0.023
ocket-0.023
 web-0.023
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
 ourselves+0.015
ate+0.014
 neither+0.014
 most+0.014
Sorry+0.014
Neg logit contributions
 coc-0.017
 sip-0.017
 web-0.016
 pad-0.016
run-0.016
Token Acc
Token ID6366
Feature activation-5.47e+00

Pos logit contributions
 ourselves+0.028
 best+0.028
 neither+0.027
iring+0.027
ate+0.027
Neg logit contributions
 sip-0.031
 coc-0.031
run-0.030
 web-0.030
ocket-0.030
Tokenidents
Token ID3231
Feature activation+1.75e+00

Pos logit contributions
 known+0.138
 fastest+0.120
 wealthiest+0.112
 richest+0.110
Sometimes+0.109
Neg logit contributions
 pad-0.101
lee-0.100
 fingertips-0.098
 coc-0.097
 sip-0.096
Token happen
Token ID1645
Feature activation-1.05e+00

Pos logit contributions
 coc+0.038
ocket+0.037
igsaw+0.036
 pad+0.036
 sip+0.036
Neg logit contributions
 ourselves-0.041
 neither-0.039
 best-0.038
 most-0.038
ate-0.037
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
Sorry+0.024
 ourselves+0.024
 neither+0.023
ate+0.023
eday+0.023
Neg logit contributions
 coc-0.020
 sip-0.019
 pad-0.019
 web-0.018
book-0.018
Token Just
Token ID2329
Feature activation-2.41e-01

Pos logit contributions
 ourselves+0.036
ics+0.036
 best+0.035
 neither+0.035
ate+0.034
Neg logit contributions
run-0.034
 coc-0.033
 web-0.033
 sip-0.033
 pad-0.032
Token be
Token ID307
Feature activation-4.92e-01

Pos logit contributions
 ourselves+0.005
 neither+0.004
 best+0.004
 most+0.004
ate+0.004
Neg logit contributions
igsaw-0.006
 coc-0.006
ocket-0.006
book-0.006
 pad-0.006
Token "
Token ID366
Feature activation-1.55e+00

Pos logit contributions
 coc+0.037
 sip+0.035
ocket+0.035
igsaw+0.035
 web+0.034
Neg logit contributions
 ourselves-0.028
 neither-0.027
 best-0.026
Sorry-0.026
 most-0.026
TokenIt
Token ID1026
Feature activation+2.89e-01

Pos logit contributions
 neither+0.030
 ourselves+0.029
 best+0.027
ate+0.026
iring+0.026
Neg logit contributions
 coc-0.037
 sip-0.035
book-0.034
 pad-0.034
roach-0.033
Token's
Token ID338
Feature activation-7.01e-01

Pos logit contributions
 sip+0.005
 web+0.005
 coc+0.005
book+0.005
 bang+0.005
Neg logit contributions
Sorry-0.007
 ourselves-0.007
 neither-0.007
 yourselves-0.007
iring-0.007
Token OK
Token ID7477
Feature activation-9.13e-01

Pos logit contributions
 ourselves+0.011
 neither+0.011
Sorry+0.011
iring+0.010
ate+0.010
Neg logit contributions
 coc-0.019
 sip-0.019
book-0.018
ocket-0.018
 pad-0.018
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
ate+0.017
 ourselves+0.017
 neither+0.016
 most+0.016
 fastest+0.016
Neg logit contributions
 sip-0.022
 coc-0.021
run-0.021
 pad-0.020
ocket-0.020
Token Acc
Token ID6366
Feature activation-5.37e+00

Pos logit contributions
ics+0.035
 best+0.034
iring+0.034
 neither+0.034
 fastest+0.033
Neg logit contributions
igsaw-0.030
 sip-0.029
run-0.029
roach-0.028
 web-0.028
Tokenidents
Token ID3231
Feature activation+1.75e+00

Pos logit contributions
 known+0.130
 fastest+0.121
 most+0.108
 wealthiest+0.108
 richest+0.108
Neg logit contributions
anna-0.102
lee-0.100
 sip-0.099
 pad-0.098
vel-0.094
Token happen
Token ID1645
Feature activation-1.03e+00

Pos logit contributions
 coc+0.035
 sip+0.034
ocket+0.033
 pad+0.033
igsaw+0.032
Neg logit contributions
 ourselves-0.044
 neither-0.042
 best-0.041
 most-0.040
ate-0.040
Token.
Token ID13
Feature activation-1.84e+00

Pos logit contributions
Sorry+0.025
 neither+0.024
eday+0.024
 ourselves+0.024
ate+0.024
Neg logit contributions
 coc-0.017
 sip-0.017
 pad-0.017
 web-0.016
book-0.016
Token I
Token ID314
Feature activation+1.08e+00

Pos logit contributions
ics+0.043
 neither+0.040
 best+0.040
ized+0.039
iring+0.039
Neg logit contributions
run-0.033
igsaw-0.032
 web-0.031
roach-0.031
 sip-0.031
Token'm
Token ID1101
Feature activation+6.44e-02

Pos logit contributions
 coc+0.025
 sip+0.025
ocket+0.024
 pad+0.024
igsaw+0.023
Neg logit contributions
 ourselves-0.023
 neither-0.021
 best-0.020
 most-0.020
ate-0.020
Token are
Token ID389
Feature activation-1.51e+00

Pos logit contributions
 ourselves+0.003
 neither+0.003
 best+0.003
 most+0.003
ate+0.003
Neg logit contributions
 coc-0.004
igsaw-0.004
 sip-0.004
ocket-0.004
run-0.004
Token proud
Token ID6613
Feature activation-1.45e+00

Pos logit contributions
Sorry+0.026
 ourselves+0.025
 neither+0.024
Deal+0.023
ate+0.023
Neg logit contributions
 coc-0.040
ocket-0.038
 web-0.038
 sip-0.038
 pad-0.036
Token of
Token ID286
Feature activation+2.54e-02

Pos logit contributions
 ourselves+0.038
 neither+0.036
 best+0.035
 most+0.035
ate+0.035
Neg logit contributions
 coc-0.027
 sip-0.026
ocket-0.025
igsaw-0.024
 pad-0.024
Token you
Token ID345
Feature activation+8.61e-01

Pos logit contributions
 coc+0.001
 sip+0.001
ocket+0.001
igsaw+0.001
 pad+0.001
Neg logit contributions
 ourselves-0.000
 neither-0.000
 most-0.000
ate-0.000
 best-0.000
Token.
Token ID13
Feature activation-2.28e+00

Pos logit contributions
 coc+0.022
 sip+0.022
ocket+0.021
 pad+0.021
 web+0.020
Neg logit contributions
 ourselves-0.016
 neither-0.015
ate-0.015
Sorry-0.014
 most-0.014
Token Acc
Token ID6366
Feature activation-5.30e+00

Pos logit contributions
 ourselves+0.040
 neither+0.038
 best+0.037
ate+0.036
 most+0.036
Neg logit contributions
 coc-0.059
 sip-0.059
ocket-0.057
 pad-0.056
igsaw-0.055
Tokenidents
Token ID3231
Feature activation+1.19e+00

Pos logit contributions
 known+0.116
 fastest+0.107
 richest+0.097
 most+0.097
Sometimes+0.097
Neg logit contributions
lee-0.105
 pad-0.099
vel-0.099
 coc-0.098
 fingertips-0.096
Token happen
Token ID1645
Feature activation-1.34e+00

Pos logit contributions
 coc+0.024
 sip+0.024
ocket+0.023
igsaw+0.023
 pad+0.023
Neg logit contributions
 ourselves-0.029
 neither-0.027
 best-0.027
 most-0.027
ate-0.026
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
Sorry+0.029
 neither+0.028
 ourselves+0.027
eday+0.027
ate+0.026
Neg logit contributions
 coc-0.027
 pad-0.026
 sip-0.026
 web-0.025
book-0.025
Token We
Token ID775
Feature activation-3.98e-01

Pos logit contributions
 ourselves+0.045
 neither+0.044
iring+0.042
 best+0.042
ate+0.041
Neg logit contributions
 sip-0.059
 coc-0.058
 pad-0.057
run-0.057
ocket-0.056
Token can
Token ID460
Feature activation-1.25e+00

Pos logit contributions
 ourselves+0.006
 best+0.006
 neither+0.006
 most+0.006
ate+0.005
Neg logit contributions
igsaw-0.012
 coc-0.011
run-0.011
ocket-0.011
andel-0.011
Token's
Token ID338
Feature activation-6.56e-01

Pos logit contributions
 ourselves+0.008
Sorry+0.007
 yourselves+0.007
 neither+0.007
ate+0.007
Neg logit contributions
 sip-0.006
 web-0.005
 bang-0.005
 coc-0.005
run-0.005
Token okay
Token ID8788
Feature activation-8.19e-01

Pos logit contributions
 ourselves+0.009
 neither+0.008
Sorry+0.008
ate+0.008
 most+0.007
Neg logit contributions
 coc-0.020
 sip-0.019
 pad-0.019
ocket-0.019
 web-0.019
Token,
Token ID11
Feature activation-6.36e-01

Pos logit contributions
ate+0.015
 ourselves+0.014
 most+0.014
 fastest+0.014
Sorry+0.014
Neg logit contributions
 coc-0.021
 sip-0.021
run-0.019
 web-0.019
igsaw-0.019
Token Tom
Token ID4186
Feature activation-2.29e+00

Pos logit contributions
ate+0.013
 ourselves+0.013
Sorry+0.012
 neither+0.011
important+0.011
Neg logit contributions
 coc-0.016
 sip-0.015
 pad-0.014
 web-0.014
igsaw-0.014
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 ourselves+0.064
Sorry+0.063
 neither+0.062
eday+0.060
 most+0.060
Neg logit contributions
 pad-0.035
 coc-0.033
 web-0.032
 sip-0.030
ocket-0.030
Token Acc
Token ID6366
Feature activation-5.29e+00

Pos logit contributions
 best+0.030
 neither+0.029
 ourselves+0.029
ics+0.029
 most+0.029
Neg logit contributions
igsaw-0.026
run-0.026
 web-0.025
ocket-0.025
 sip-0.025
Tokenidents
Token ID3231
Feature activation+1.54e+00

Pos logit contributions
 known+0.139
 fastest+0.123
 wealthiest+0.115
 richest+0.113
 most+0.111
Neg logit contributions
lee-0.096
 pad-0.096
 sip-0.094
 fingertips-0.091
 coc-0.090
Token happen
Token ID1645
Feature activation-1.04e+00

Pos logit contributions
 coc+0.031
 sip+0.030
ocket+0.029
 pad+0.029
igsaw+0.029
Neg logit contributions
 ourselves-0.039
 neither-0.037
 best-0.036
 most-0.035
ate-0.035
Token.
Token ID13
Feature activation-1.79e+00

Pos logit contributions
Sorry+0.025
ate+0.024
eday+0.024
 ourselves+0.024
 neither+0.024
Neg logit contributions
 coc-0.018
 sip-0.018
 web-0.017
 pad-0.017
book-0.017
Token I
Token ID314
Feature activation+9.48e-01

Pos logit contributions
ics+0.044
 best+0.042
 ourselves+0.041
ate+0.040
ized+0.040
Neg logit contributions
run-0.033
 web-0.031
 pad-0.028
 microsc-0.028
igsaw-0.028
Token'm
Token ID1101
Feature activation+1.18e-02

Pos logit contributions
 coc+0.023
 sip+0.022
ocket+0.022
igsaw+0.022
 pad+0.021
Neg logit contributions
 ourselves-0.019
 neither-0.018
 best-0.018
 most-0.018
ate-0.017
Token's
Token ID338
Feature activation-6.99e-01

Pos logit contributions
 web+0.005
 sip+0.005
 bang+0.005
 coc+0.005
 pad+0.005
Neg logit contributions
 ourselves-0.007
Sorry-0.007
 yourselves-0.007
mm-0.007
iring-0.007
Token okay
Token ID8788
Feature activation-6.98e-01

Pos logit contributions
 ourselves+0.010
 neither+0.010
Sorry+0.009
ate+0.009
iring+0.009
Neg logit contributions
 coc-0.021
 sip-0.020
 pad-0.019
 web-0.019
ocket-0.019
Token,
Token ID11
Feature activation+3.69e-01

Pos logit contributions
 ourselves+0.013
ate+0.013
 most+0.012
 neither+0.012
Sorry+0.012
Neg logit contributions
 sip-0.017
 coc-0.017
 web-0.016
run-0.016
 pad-0.016
Token Lily
Token ID20037
Feature activation-1.17e+00

Pos logit contributions
 coc+0.009
ocket+0.009
 sip+0.009
igsaw+0.008
 pad+0.008
Neg logit contributions
 ourselves-0.008
 neither-0.007
 best-0.007
 most-0.007
 yourselves-0.007
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 ourselves+0.032
Sorry+0.032
ate+0.031
iring+0.031
 most+0.030
Neg logit contributions
 pad-0.018
 sip-0.018
 web-0.017
 coc-0.017
run-0.017
Token Acc
Token ID6366
Feature activation-5.23e+00

Pos logit contributions
 ourselves+0.035
iring+0.034
ate+0.034
 best+0.034
 neither+0.033
Neg logit contributions
 sip-0.034
run-0.033
 web-0.032
 coc-0.032
 pad-0.032
Tokenidents
Token ID3231
Feature activation+1.42e+00

Pos logit contributions
 known+0.129
 fastest+0.107
 most+0.102
 wealthiest+0.102
 richest+0.102
Neg logit contributions
 pad-0.100
lee-0.098
 coc-0.097
 sip-0.096
 slur-0.093
Token happen
Token ID1645
Feature activation-1.09e+00

Pos logit contributions
 coc+0.029
ocket+0.028
 sip+0.028
 pad+0.027
igsaw+0.027
Neg logit contributions
 ourselves-0.035
 neither-0.033
 best-0.032
 most-0.032
ate-0.032
Token.
Token ID13
Feature activation-2.06e+00

Pos logit contributions
Sorry+0.026
eday+0.026
 ourselves+0.026
 neither+0.025
ate+0.025
Neg logit contributions
 sip-0.018
 coc-0.018
 web-0.018
 pad-0.018
vel-0.016
Token I
Token ID314
Feature activation+8.68e-01

Pos logit contributions
ics+0.051
 ourselves+0.050
ate+0.048
 neither+0.048
 best+0.048
Neg logit contributions
run-0.037
 web-0.034
 sip-0.034
 pad-0.034
 coc-0.033
Token'm
Token ID1101
Feature activation-3.35e-01

Pos logit contributions
 coc+0.023
 sip+0.023
ocket+0.022
igsaw+0.022
 pad+0.022
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
 most-0.014
ate-0.014
Token's
Token ID338
Feature activation-4.90e-01

Pos logit contributions
 sip+0.002
 web+0.002
 coc+0.002
 bang+0.001
run+0.001
Neg logit contributions
 ourselves-0.002
Sorry-0.002
 yourselves-0.002
ate-0.002
iring-0.002
Token okay
Token ID8788
Feature activation-8.55e-01

Pos logit contributions
 ourselves+0.006
 neither+0.006
ate+0.006
Sorry+0.006
 most+0.005
Neg logit contributions
 coc-0.015
 sip-0.015
 pad-0.014
 web-0.014
ocket-0.014
Token,
Token ID11
Feature activation+1.35e-01

Pos logit contributions
 ourselves+0.014
ate+0.014
 most+0.013
 neither+0.013
Sorry+0.013
Neg logit contributions
 sip-0.023
 coc-0.023
 web-0.022
run-0.021
 pad-0.021
Token Anna
Token ID11735
Feature activation-4.90e-01

Pos logit contributions
 coc+0.003
 sip+0.003
ocket+0.003
igsaw+0.003
 pad+0.003
Neg logit contributions
 ourselves-0.003
 neither-0.003
 best-0.003
 most-0.003
ate-0.003
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
 ourselves+0.012
ate+0.011
Sorry+0.011
 most+0.011
iring+0.011
Neg logit contributions
 sip-0.009
 pad-0.009
 coc-0.009
 web-0.009
ocket-0.009
Token Acc
Token ID6366
Feature activation-5.22e+00

Pos logit contributions
 best+0.035
 neither+0.035
 ourselves+0.035
ate+0.034
iring+0.034
Neg logit contributions
 sip-0.039
 web-0.039
 coc-0.039
igsaw-0.037
ocket-0.037
Tokenidents
Token ID3231
Feature activation+1.31e+00

Pos logit contributions
 known+0.127
 fastest+0.114
 wealthiest+0.106
 richest+0.103
 most+0.102
Neg logit contributions
 sip-0.111
lee-0.104
 pad-0.102
 coc-0.101
 fingertips-0.099
Token happen
Token ID1645
Feature activation-1.38e+00

Pos logit contributions
 coc+0.028
ocket+0.027
 pad+0.026
 sip+0.026
igsaw+0.026
Neg logit contributions
 ourselves-0.032
 neither-0.030
 best-0.029
 most-0.029
ate-0.028
Token.
Token ID13
Feature activation-2.24e+00

Pos logit contributions
 ourselves+0.032
Sorry+0.032
 neither+0.031
ate+0.031
eday+0.031
Neg logit contributions
 sip-0.025
 coc-0.025
 web-0.025
 pad-0.024
run-0.023
Token I
Token ID314
Feature activation+7.44e-01

Pos logit contributions
ics+0.050
 best+0.048
 neither+0.048
 ourselves+0.047
ate+0.046
Neg logit contributions
 web-0.044
 coc-0.043
 sip-0.042
run-0.042
 pad-0.041
Token'm
Token ID1101
Feature activation+1.49e-01

Pos logit contributions
 coc+0.018
 sip+0.018
ocket+0.018
igsaw+0.018
 pad+0.017
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
 most-0.013
ate-0.013
Token okay
Token ID8788
Feature activation-9.42e-01

Pos logit contributions
 ourselves+0.005
 best+0.005
 neither+0.005
 most+0.004
ate+0.004
Neg logit contributions
 sip-0.020
ocket-0.020
 coc-0.019
igsaw-0.019
 pad-0.019
Token,
Token ID11
Feature activation-4.73e-01

Pos logit contributions
 ourselves+0.014
 neither+0.013
ate+0.012
 most+0.012
 best+0.012
Neg logit contributions
 coc-0.028
 sip-0.028
ocket-0.027
igsaw-0.027
 pad-0.027
Token sweet
Token ID6029
Feature activation-1.27e-01

Pos logit contributions
 ourselves+0.011
 neither+0.011
 best+0.010
 most+0.010
 yourselves+0.010
Neg logit contributions
 coc-0.010
ocket-0.010
 sip-0.010
igsaw-0.010
 pad-0.010
Tokenie
Token ID494
Feature activation-1.21e+00

Pos logit contributions
 ourselves+0.003
 neither+0.002
 most+0.002
 best+0.002
ate+0.002
Neg logit contributions
 coc-0.003
 sip-0.003
ocket-0.003
igsaw-0.003
 pad-0.003
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 ourselves+0.027
ate+0.026
Sorry+0.025
 neither+0.025
 most+0.025
Neg logit contributions
 coc-0.026
 sip-0.026
 web-0.026
 pad-0.025
ocket-0.024
Token Acc
Token ID6366
Feature activation-5.17e+00

Pos logit contributions
 ourselves+0.024
 best+0.023
 neither+0.023
ate+0.023
 most+0.022
Neg logit contributions
 sip-0.031
 coc-0.030
 web-0.029
run-0.029
ocket-0.029
Tokenidents
Token ID3231
Feature activation+1.37e+00

Pos logit contributions
 known+0.125
 fastest+0.114
 wealthiest+0.105
 richest+0.103
 most+0.101
Neg logit contributions
 coc-0.105
 fingertips-0.103
 sip-0.102
 pad-0.101
lee-0.101
Token happen
Token ID1645
Feature activation-1.06e+00

Pos logit contributions
 coc+0.032
ocket+0.032
 pad+0.031
andel+0.031
igsaw+0.031
Neg logit contributions
 ourselves-0.031
 neither-0.028
 best-0.027
 most-0.027
 yourselves-0.026
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 ourselves+0.022
Sorry+0.022
ate+0.022
 neither+0.022
eday+0.021
Neg logit contributions
 coc-0.022
 sip-0.022
 web-0.021
 pad-0.021
book-0.020
Token Let
Token ID3914
Feature activation+6.40e-01

Pos logit contributions
 ourselves+0.029
 best+0.028
ate+0.028
 neither+0.027
ics+0.027
Neg logit contributions
 sip-0.029
run-0.029
 coc-0.028
 web-0.028
 pad-0.027
Token's
Token ID338
Feature activation-1.72e+00

Pos logit contributions
 sip+0.015
 coc+0.015
 web+0.013
 pad+0.013
ocket+0.013
Neg logit contributions
 ourselves-0.014
 neither-0.013
ate-0.013
iring-0.013
 most-0.012
Token It
Token ID632
Feature activation-9.31e-02

Pos logit contributions
 ourselves+0.028
 neither+0.028
 best+0.027
Sorry+0.026
ate+0.026
Neg logit contributions
 coc-0.033
 web-0.033
 sip-0.033
igsaw-0.031
ocket-0.031
Token was
Token ID373
Feature activation-1.24e-01

Pos logit contributions
 ourselves+0.002
 yourselves+0.002
ate+0.002
 neither+0.002
Sorry+0.002
Neg logit contributions
 sip-0.002
 coc-0.002
 web-0.002
 pad-0.002
run-0.002
Token an
Token ID281
Feature activation+1.26e+00

Pos logit contributions
 ourselves+0.002
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.004
 sip-0.004
igsaw-0.004
ocket-0.004
 pad-0.004
Token accident
Token ID5778
Feature activation-9.87e-01

Pos logit contributions
 sip+0.034
 coc+0.033
run+0.033
ocket+0.033
 pad+0.032
Neg logit contributions
 best-0.022
 ourselves-0.022
 most-0.021
 neither-0.021
ate-0.020
Token.
Token ID13
Feature activation-8.47e-01

Pos logit contributions
 ourselves+0.020
 neither+0.019
 most+0.018
ate+0.018
 best+0.018
Neg logit contributions
 coc-0.024
 sip-0.024
ocket-0.023
 pad-0.023
igsaw-0.023
Token Acc
Token ID6366
Feature activation-5.15e+00

Pos logit contributions
 ourselves+0.017
 neither+0.017
 best+0.017
 most+0.016
Sorry+0.016
Neg logit contributions
 coc-0.019
 sip-0.019
 web-0.019
run-0.018
ocket-0.018
Tokenidents
Token ID3231
Feature activation+1.76e+00

Pos logit contributions
 known+0.129
 fastest+0.111
 wealthiest+0.106
 richest+0.105
 most+0.101
Neg logit contributions
 sip-0.102
 coc-0.099
lee-0.098
 pad-0.097
 fingertips-0.093
Token happen
Token ID1645
Feature activation-7.84e-01

Pos logit contributions
 coc+0.041
ocket+0.040
 pad+0.039
igsaw+0.039
andel+0.039
Neg logit contributions
 ourselves-0.041
 neither-0.037
 best-0.036
 most-0.036
ate-0.036
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
Sorry+0.019
 ourselves+0.018
eday+0.018
 neither+0.018
ate+0.018
Neg logit contributions
 coc-0.014
 sip-0.013
 pad-0.013
 web-0.013
book-0.013
Token I
Token ID314
Feature activation+1.19e+00

Pos logit contributions
ics+0.038
 best+0.038
 neither+0.037
 ourselves+0.037
Sorry+0.036
Neg logit contributions
 web-0.033
run-0.032
 coc-0.032
 sip-0.031
 pad-0.031
Token'm
Token ID1101
Feature activation+2.33e-01

Pos logit contributions
 coc+0.030
 sip+0.029
igsaw+0.029
ocket+0.029
 pad+0.028
Neg logit contributions
 ourselves-0.023
 neither-0.022
 best-0.021
 most-0.021
ate-0.020
Token't
Token ID470
Feature activation-7.05e-01

Pos logit contributions
 ourselves+0.022
Deal+0.021
 neither+0.021
Sorry+0.020
ate+0.020
Neg logit contributions
 coc-0.029
ocket-0.029
run-0.028
 sip-0.028
igsaw-0.028
Token cry
Token ID3960
Feature activation-3.66e-03

Pos logit contributions
 ourselves+0.015
ate+0.014
 neither+0.014
 most+0.014
 best+0.014
Neg logit contributions
 coc-0.016
 sip-0.016
ocket-0.015
 pad-0.015
igsaw-0.015
Token,
Token ID11
Feature activation-8.06e-01

Pos logit contributions
ate+0.000
 ourselves+0.000
 most+0.000
Deal+0.000
Sorry+0.000
Neg logit contributions
 coc-0.000
 sip-0.000
 pad-0.000
run-0.000
 web-0.000
Token boys
Token ID6510
Feature activation-2.41e+00

Pos logit contributions
 ourselves+0.013
 neither+0.012
 best+0.011
 most+0.011
ate+0.010
Neg logit contributions
 coc-0.023
 sip-0.023
igsaw-0.023
ocket-0.023
 pad-0.022
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
 ourselves+0.054
ate+0.054
Sorry+0.051
iring+0.051
 most+0.050
Neg logit contributions
 sip-0.052
 coc-0.049
 pad-0.049
run-0.047
 web-0.047
Token Acc
Token ID6366
Feature activation-5.07e+00

Pos logit contributions
 ourselves+0.035
ate+0.034
 best+0.034
iring+0.033
 most+0.033
Neg logit contributions
 sip-0.040
run-0.039
 web-0.036
igsaw-0.036
 coc-0.036
Tokenidents
Token ID3231
Feature activation+1.01e+00

Pos logit contributions
 known+0.113
 fastest+0.102
 wealthiest+0.097
 richest+0.094
There+0.094
Neg logit contributions
lee-0.100
 pad-0.097
 bang-0.091
 sip-0.088
 coc-0.088
Token happen
Token ID1645
Feature activation-9.73e-01

Pos logit contributions
 coc+0.020
 sip+0.019
ocket+0.019
igsaw+0.018
 pad+0.018
Neg logit contributions
 ourselves-0.026
 neither-0.024
 best-0.024
 most-0.024
ate-0.024
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
Sorry+0.023
 ourselves+0.022
eday+0.022
ate+0.022
iring+0.022
Neg logit contributions
 sip-0.017
 coc-0.017
 pad-0.016
 web-0.016
vel-0.016
Token I
Token ID314
Feature activation+7.89e-01

Pos logit contributions
ics+0.044
 ourselves+0.042
ate+0.042
iring+0.041
 best+0.041
Neg logit contributions
run-0.037
 sip-0.035
 web-0.033
 microsc-0.032
 pad-0.032
Token have
Token ID423
Feature activation+1.77e-01

Pos logit contributions
 sip+0.020
 coc+0.020
ocket+0.019
 pad+0.019
 web+0.019
Neg logit contributions
 ourselves-0.015
 neither-0.014
ate-0.013
 most-0.013
 best-0.013
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 ourselves+0.016
Sorry+0.015
 most+0.015
ate+0.015
 neither+0.015
Neg logit contributions
 pad-0.012
 sip-0.012
 coc-0.012
 web-0.012
ocket-0.011
Token It
Token ID632
Feature activation+2.22e-01

Pos logit contributions
 ourselves+0.022
 best+0.022
 most+0.022
Deal+0.022
iring+0.021
Neg logit contributions
 sip-0.022
igsaw-0.021
run-0.020
 web-0.020
ocket-0.020
Token is
Token ID318
Feature activation+2.52e-01

Pos logit contributions
 sip+0.004
 web+0.004
 bang+0.004
 coc+0.004
 gob+0.004
Neg logit contributions
 ourselves-0.006
 yourselves-0.005
iring-0.005
ate-0.005
Sorry-0.005
Token okay
Token ID8788
Feature activation-3.30e-01

Pos logit contributions
 coc+0.008
 sip+0.008
ocket+0.007
 pad+0.007
 web+0.007
Neg logit contributions
 ourselves-0.003
 neither-0.003
ate-0.003
Sorry-0.003
 most-0.003
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 ourselves+0.007
ate+0.007
 neither+0.007
 most+0.006
Sorry+0.006
Neg logit contributions
 coc-0.008
 sip-0.007
 web-0.007
ocket-0.007
run-0.007
Token Acc
Token ID6366
Feature activation-4.92e+00

Pos logit contributions
 best+0.023
 ourselves+0.023
 most+0.022
 neither+0.022
iring+0.022
Neg logit contributions
 sip-0.023
 coc-0.021
 web-0.021
run-0.021
ocket-0.021
Tokenidents
Token ID3231
Feature activation+1.77e+00

Pos logit contributions
 known+0.126
 fastest+0.108
 wealthiest+0.102
Sometimes+0.101
 richest+0.100
Neg logit contributions
lee-0.093
 pad-0.089
 coc-0.086
 fingertips-0.086
 sip-0.086
Token happen
Token ID1645
Feature activation-9.44e-01

Pos logit contributions
 coc+0.038
ocket+0.036
igsaw+0.036
 pad+0.036
 sip+0.036
Neg logit contributions
 ourselves-0.043
 neither-0.040
 best-0.039
 most-0.039
ate-0.038
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
Sorry+0.021
 ourselves+0.021
 neither+0.021
eday+0.021
ate+0.021
Neg logit contributions
 coc-0.017
 sip-0.017
 pad-0.017
 web-0.017
book-0.016
Token Let
Token ID3914
Feature activation+5.63e-01

Pos logit contributions
 best+0.036
ics+0.036
 ourselves+0.036
iring+0.035
ate+0.035
Neg logit contributions
run-0.030
 sip-0.030
 web-0.029
 coc-0.028
 pad-0.027
Token me
Token ID502
Feature activation+7.87e-01

Pos logit contributions
 sip+0.011
 coc+0.011
 pad+0.010
 web+0.010
 bang+0.010
Neg logit contributions
 ourselves-0.013
iring-0.013
ate-0.013
 neither-0.013
Deal-0.013
Token's
Token ID338
Feature activation-3.88e-01

Pos logit contributions
 sip+0.004
 coc+0.004
 web+0.004
 pad+0.004
ocket+0.004
Neg logit contributions
 ourselves-0.004
 neither-0.003
Sorry-0.003
ate-0.003
iring-0.003
Token okay
Token ID8788
Feature activation-7.29e-01

Pos logit contributions
 ourselves+0.004
 neither+0.004
 best+0.004
 most+0.004
ate+0.004
Neg logit contributions
 coc-0.013
 sip-0.013
ocket-0.013
igsaw-0.013
 pad-0.012
Token,
Token ID11
Feature activation+3.86e-02

Pos logit contributions
 ourselves+0.011
 neither+0.010
ate+0.010
 most+0.010
 best+0.010
Neg logit contributions
 coc-0.021
 sip-0.021
ocket-0.020
 pad-0.020
igsaw-0.020
Token Anna
Token ID11735
Feature activation-3.27e-01

Pos logit contributions
igsaw+0.001
ocket+0.001
 coc+0.001
 pad+0.001
roach+0.001
Neg logit contributions
 ourselves-0.001
 neither-0.001
 best-0.001
 most-0.001
 yourselves-0.001
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 ourselves+0.008
Sorry+0.007
ate+0.007
 most+0.007
 neither+0.007
Neg logit contributions
 sip-0.007
 coc-0.006
 pad-0.006
 web-0.006
ocket-0.006
Token Acc
Token ID6366
Feature activation-4.83e+00

Pos logit contributions
 ourselves+0.025
 best+0.023
 neither+0.023
ate+0.023
 most+0.023
Neg logit contributions
 sip-0.031
 coc-0.030
 web-0.029
ocket-0.029
igsaw-0.029
Tokenidents
Token ID3231
Feature activation+1.57e+00

Pos logit contributions
 known+0.114
 fastest+0.105
 wealthiest+0.097
 richest+0.095
 most+0.094
Neg logit contributions
 pad-0.099
lee-0.098
 sip-0.093
 fingertips-0.093
 coc-0.091
Token happen
Token ID1645
Feature activation-1.04e+00

Pos logit contributions
 coc+0.036
ocket+0.035
 pad+0.034
igsaw+0.034
andel+0.033
Neg logit contributions
 ourselves-0.037
 neither-0.034
 best-0.034
 most-0.033
ate-0.032
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
Sorry+0.023
 ourselves+0.023
 neither+0.023
eday+0.023
ate+0.022
Neg logit contributions
 coc-0.019
 sip-0.019
 web-0.019
 pad-0.019
book-0.018
Token You
Token ID921
Feature activation+1.71e+00

Pos logit contributions
 ourselves+0.039
 best+0.037
 neither+0.037
ate+0.037
iring+0.036
Neg logit contributions
 sip-0.034
 web-0.033
run-0.033
 coc-0.032
 pad-0.031
Token don
Token ID836
Feature activation+9.09e-01

Pos logit contributions
 coc+0.036
 sip+0.035
ocket+0.034
 pad+0.033
 web+0.033
Neg logit contributions
 ourselves-0.040
 neither-0.038
ate-0.037
 best-0.036
 most-0.036
Token's
Token ID338
Feature activation-4.46e-01

Pos logit contributions
 sip+0.002
 web+0.002
 coc+0.002
 bang+0.002
run+0.002
Neg logit contributions
 ourselves-0.003
 yourselves-0.003
Sorry-0.003
mm-0.003
eday-0.003
Token okay
Token ID8788
Feature activation-6.85e-01

Pos logit contributions
 ourselves+0.007
Sorry+0.007
 neither+0.006
ate+0.006
master+0.006
Neg logit contributions
 coc-0.012
 sip-0.012
 web-0.012
 pad-0.011
vel-0.011
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
 ourselves+0.011
ate+0.011
 most+0.011
 neither+0.010
Sorry+0.010
Neg logit contributions
 coc-0.019
 sip-0.019
 web-0.018
run-0.017
 pad-0.017
Token Lily
Token ID20037
Feature activation-1.13e+00

Pos logit contributions
 coc+0.007
 sip+0.007
ocket+0.007
igsaw+0.007
 pad+0.007
Neg logit contributions
 ourselves-0.008
 neither-0.007
ate-0.007
 best-0.007
 most-0.007
Token.
Token ID13
Feature activation-7.33e-01

Pos logit contributions
 ourselves+0.030
ate+0.030
Sorry+0.030
iring+0.030
 yourselves+0.029
Neg logit contributions
 pad-0.017
 web-0.017
 sip-0.017
 coc-0.016
run-0.016
Token Acc
Token ID6366
Feature activation-4.77e+00

Pos logit contributions
 ourselves+0.015
 best+0.015
ics+0.015
ate+0.014
 most+0.014
Neg logit contributions
 sip-0.015
run-0.015
 web-0.015
 coc-0.015
igsaw-0.015
Tokenidents
Token ID3231
Feature activation+1.54e+00

Pos logit contributions
 known+0.114
 fastest+0.107
 wealthiest+0.097
 most+0.095
 richest+0.095
Neg logit contributions
lee-0.094
 coc-0.093
 pad-0.091
 sip-0.091
vel-0.090
Token happen
Token ID1645
Feature activation-1.00e+00

Pos logit contributions
 coc+0.032
ocket+0.031
 sip+0.030
 pad+0.030
igsaw+0.030
Neg logit contributions
 ourselves-0.038
 neither-0.036
 best-0.035
 most-0.035
ate-0.034
Token.
Token ID13
Feature activation-8.37e-01

Pos logit contributions
Sorry+0.022
eday+0.022
 ourselves+0.022
 neither+0.021
ate+0.021
Neg logit contributions
 coc-0.019
 web-0.018
 pad-0.018
 sip-0.018
vel-0.017
Token Just
Token ID2329
Feature activation-3.13e-02

Pos logit contributions
 ourselves+0.020
ics+0.019
 best+0.019
ate+0.018
 neither+0.018
Neg logit contributions
run-0.016
 web-0.015
 sip-0.015
 coc-0.015
 pad-0.014
Token remember
Token ID3505
Feature activation+7.22e-01

Pos logit contributions
 ourselves+0.001
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.001
ocket-0.001
igsaw-0.001
 sip-0.001
book-0.001
Token's
Token ID338
Feature activation-5.02e-01

Pos logit contributions
 sip+0.003
 web+0.003
 coc+0.003
 bang+0.003
run+0.003
Neg logit contributions
 ourselves-0.004
 yourselves-0.004
Sorry-0.004
mm-0.004
eday-0.004
Token okay
Token ID8788
Feature activation-6.75e-01

Pos logit contributions
 ourselves+0.008
Sorry+0.007
 neither+0.007
ate+0.007
master+0.007
Neg logit contributions
 coc-0.014
 sip-0.013
 web-0.013
vel-0.013
 pad-0.013
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
 ourselves+0.011
ate+0.011
 most+0.010
 neither+0.010
 fastest+0.010
Neg logit contributions
 coc-0.019
 sip-0.018
 web-0.017
run-0.017
 pad-0.017
Token Lily
Token ID20037
Feature activation-1.34e+00

Pos logit contributions
 coc+0.005
 sip+0.005
ocket+0.004
igsaw+0.004
 pad+0.004
Neg logit contributions
 ourselves-0.005
 neither-0.005
ate-0.004
 best-0.004
 most-0.004
Token.
Token ID13
Feature activation-6.87e-01

Pos logit contributions
 ourselves+0.037
ate+0.036
Sorry+0.036
iring+0.036
eday+0.035
Neg logit contributions
 web-0.019
 pad-0.019
 sip-0.019
 coc-0.018
run-0.018
Token Acc
Token ID6366
Feature activation-4.68e+00

Pos logit contributions
 ourselves+0.014
 best+0.014
 neither+0.014
 most+0.013
ate+0.013
Neg logit contributions
 sip-0.015
run-0.014
 web-0.014
 coc-0.014
igsaw-0.014
Tokenidents
Token ID3231
Feature activation+1.50e+00

Pos logit contributions
 known+0.110
 fastest+0.104
 wealthiest+0.095
 most+0.093
 richest+0.093
Neg logit contributions
lee-0.095
 sip-0.094
 coc-0.092
vel-0.089
 pad-0.088
Token happen
Token ID1645
Feature activation-1.06e+00

Pos logit contributions
 coc+0.031
ocket+0.030
 sip+0.030
 pad+0.030
igsaw+0.029
Neg logit contributions
 ourselves-0.037
 neither-0.035
 best-0.034
 most-0.034
ate-0.033
Token.
Token ID13
Feature activation-7.41e-01

Pos logit contributions
Sorry+0.023
 ourselves+0.023
eday+0.022
 neither+0.022
ate+0.022
Neg logit contributions
 coc-0.020
 web-0.020
 sip-0.020
 pad-0.020
vel-0.019
Token Let
Token ID3914
Feature activation+6.13e-01

Pos logit contributions
 ourselves+0.018
 best+0.017
ics+0.017
 neither+0.017
ate+0.017
Neg logit contributions
run-0.013
 sip-0.013
 web-0.013
 coc-0.012
 pad-0.012
Token's
Token ID338
Feature activation-1.54e+00

Pos logit contributions
 coc+0.013
 sip+0.013
 web+0.012
 pad+0.012
ocket+0.012
Neg logit contributions
 ourselves-0.014
 neither-0.013
ate-0.013
iring-0.013
Deal-0.012
Token of
Token ID286
Feature activation+8.87e-01

Pos logit contributions
 ourselves+0.013
 neither+0.013
 best+0.012
 most+0.012
ate+0.012
Neg logit contributions
 coc-0.008
 sip-0.008
ocket-0.008
igsaw-0.008
 pad-0.008
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
 coc+0.027
 pad+0.027
 sip+0.026
igsaw+0.026
ocket+0.026
Neg logit contributions
 ourselves-0.013
 neither-0.011
 best-0.011
 most-0.011
ate-0.011
Token story
Token ID1621
Feature activation-9.60e-01

Pos logit contributions
run+0.011
 pad+0.011
ocket+0.009
 web+0.009
vet+0.009
Neg logit contributions
 best-0.012
 most-0.011
 next-0.010
 ourselves-0.010
 moms-0.010
Token is
Token ID318
Feature activation+2.86e-01

Pos logit contributions
 ourselves+0.016
 neither+0.015
 most+0.014
Sorry+0.014
 best+0.014
Neg logit contributions
 coc-0.026
 sip-0.026
ocket-0.025
igsaw-0.025
 web-0.025
Token:
Token ID25
Feature activation-1.26e+00

Pos logit contributions
 coc+0.006
 sip+0.005
ocket+0.005
 pad+0.005
igsaw+0.005
Neg logit contributions
 ourselves-0.007
 neither-0.007
 best-0.007
 most-0.007
ate-0.007
Token Acc
Token ID6366
Feature activation-4.65e+00

Pos logit contributions
 ourselves+0.025
 neither+0.024
 best+0.023
 most+0.023
ate+0.022
Neg logit contributions
 coc-0.031
 sip-0.030
ocket-0.030
igsaw-0.029
 pad-0.029
Tokenidents
Token ID3231
Feature activation+1.91e+00

Pos logit contributions
 known+0.116
 fastest+0.100
 wealthiest+0.097
 richest+0.096
bur+0.096
Neg logit contributions
 pad-0.086
lee-0.086
 coc-0.082
 fingertips-0.081
vel-0.079
Token happen
Token ID1645
Feature activation-8.13e-01

Pos logit contributions
 coc+0.041
ocket+0.040
 sip+0.039
 pad+0.039
igsaw+0.038
Neg logit contributions
 ourselves-0.046
 neither-0.043
 best-0.043
 most-0.042
ate-0.042
Token,
Token ID11
Feature activation+6.10e-01

Pos logit contributions
Sorry+0.020
 ourselves+0.019
eday+0.018
 neither+0.018
Deal+0.018
Neg logit contributions
 coc-0.015
 web-0.015
 pad-0.015
 sip-0.014
book-0.014
Token but
Token ID475
Feature activation-1.80e+00

Pos logit contributions
 coc+0.015
 sip+0.015
ocket+0.015
 pad+0.014
igsaw+0.014
Neg logit contributions
 ourselves-0.012
 neither-0.011
ate-0.011
 best-0.011
 most-0.010
Token we
Token ID356
Feature activation+1.36e+00

Pos logit contributions
 ourselves+0.026
 neither+0.024
 best+0.022
 most+0.022
ate+0.022
Neg logit contributions
 coc-0.054
 sip-0.053
ocket-0.052
 pad-0.051
igsaw-0.051
Token Lily
Token ID20037
Feature activation-1.96e+00

Pos logit contributions
 sip+0.006
 coc+0.006
ocket+0.006
book+0.006
igsaw+0.006
Neg logit contributions
 ourselves-0.006
iring-0.006
 fastest-0.005
 neither-0.005
 most-0.005
Token were
Token ID547
Feature activation-1.49e+00

Pos logit contributions
 ourselves+0.051
iring+0.048
 neither+0.048
 most+0.046
Sorry+0.045
Neg logit contributions
 sip-0.038
 pad-0.036
 coc-0.035
ocket-0.033
 gob-0.033
Token playing
Token ID2712
Feature activation-2.08e+00

Pos logit contributions
 ourselves+0.029
Sorry+0.027
ate+0.026
 neither+0.026
 yourselves+0.025
Neg logit contributions
 coc-0.037
 sip-0.036
 pad-0.036
ocket-0.036
 web-0.035
Token with
Token ID351
Feature activation-3.40e-01

Pos logit contributions
 ourselves+0.044
 neither+0.041
Sorry+0.039
 most+0.038
ate+0.038
Neg logit contributions
 coc-0.049
 sip-0.049
 web-0.046
 pad-0.045
ocket-0.045
Token their
Token ID511
Feature activation-9.48e-01

Pos logit contributions
 ourselves+0.007
 neither+0.006
Sorry+0.006
Deal+0.006
bye+0.006
Neg logit contributions
 coc-0.008
 sip-0.008
igsaw-0.008
 web-0.008
ocket-0.007
Token wagon
Token ID31860
Feature activation-4.56e+00

Pos logit contributions
 ourselves+0.020
 neither+0.019
 best+0.018
 most+0.018
ate+0.018
Neg logit contributions
 coc-0.022
 sip-0.022
ocket-0.021
igsaw-0.021
 pad-0.021
Token in
Token ID287
Feature activation-5.26e-01

Pos logit contributions
Sorry+0.118
 ourselves+0.116
 yourselves+0.110
mm+0.106
ate+0.105
Neg logit contributions
 pad-0.083
 web-0.083
 sip-0.082
 coc-0.081
ocket-0.073
Token the
Token ID262
Feature activation-2.26e+00

Pos logit contributions
 ourselves+0.009
 neither+0.009
Sorry+0.008
 best+0.008
 most+0.008
Neg logit contributions
 coc-0.014
 sip-0.014
igsaw-0.013
ocket-0.013
 pad-0.013
Token yard
Token ID12699
Feature activation-2.26e+00

Pos logit contributions
 ourselves+0.052
 neither+0.050
Sorry+0.047
ate+0.047
iring+0.046
Neg logit contributions
 coc-0.049
 sip-0.048
 pad-0.046
 web-0.046
igsaw-0.046
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
 ourselves+0.057
Sorry+0.055
 yourselves+0.051
mm+0.051
ate+0.051
Neg logit contributions
 pad-0.042
 sip-0.042
 coc-0.040
book-0.039
 web-0.039
Token They
Token ID1119
Feature activation-2.82e-01

Pos logit contributions
 coc+0.010
 sip+0.010
ocket+0.009
igsaw+0.009
 pad+0.009
Neg logit contributions
 ourselves-0.007
 neither-0.006
 best-0.006
 most-0.006
ate-0.006
Token's
Token ID338
Feature activation-4.25e-01

Pos logit contributions
 sip+0.008
 web+0.007
 coc+0.007
 bang+0.007
run+0.007
Neg logit contributions
 ourselves-0.008
 yourselves-0.007
Sorry-0.007
 neither-0.007
iring-0.007
Token okay
Token ID8788
Feature activation-5.20e-01

Pos logit contributions
 ourselves+0.006
 neither+0.005
Sorry+0.005
ate+0.005
iring+0.005
Neg logit contributions
 coc-0.013
 sip-0.013
 web-0.012
 pad-0.012
ocket-0.012
Token,
Token ID11
Feature activation+3.18e-01

Pos logit contributions
 ourselves+0.009
ate+0.008
 neither+0.008
 most+0.008
Sorry+0.008
Neg logit contributions
 coc-0.014
 sip-0.014
 web-0.013
run-0.013
ocket-0.013
Token Lily
Token ID20037
Feature activation-8.81e-01

Pos logit contributions
 coc+0.007
 sip+0.007
ocket+0.007
igsaw+0.007
 pad+0.007
Neg logit contributions
 ourselves-0.007
 neither-0.006
ate-0.006
 best-0.006
 most-0.006
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
 ourselves+0.023
Sorry+0.022
ate+0.022
iring+0.022
 most+0.021
Neg logit contributions
 pad-0.015
 sip-0.015
 web-0.015
 coc-0.014
run-0.014
Token Acc
Token ID6366
Feature activation-4.54e+00

Pos logit contributions
 ourselves+0.017
 best+0.016
 neither+0.016
 most+0.016
ate+0.015
Neg logit contributions
 sip-0.018
 coc-0.018
igsaw-0.018
 web-0.018
run-0.018
Tokenidents
Token ID3231
Feature activation+1.61e+00

Pos logit contributions
 known+0.107
 fastest+0.101
 wealthiest+0.092
 most+0.091
 richest+0.090
Neg logit contributions
lee-0.090
 pad-0.088
 sip-0.087
vel-0.084
 fingertips-0.084
Token happen
Token ID1645
Feature activation-8.65e-01

Pos logit contributions
 coc+0.034
ocket+0.033
 sip+0.032
 pad+0.032
igsaw+0.032
Neg logit contributions
 ourselves-0.040
 neither-0.037
 best-0.036
 most-0.036
ate-0.035
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
Sorry+0.019
 ourselves+0.019
eday+0.019
 neither+0.018
ate+0.018
Neg logit contributions
 coc-0.016
 web-0.016
 sip-0.016
 pad-0.016
vel-0.015
Token It
Token ID632
Feature activation+1.26e-01

Pos logit contributions
 ourselves+0.025
 best+0.024
ics+0.024
 neither+0.024
ate+0.023
Neg logit contributions
run-0.020
 sip-0.019
 web-0.019
 coc-0.018
 pad-0.017
Token's
Token ID338
Feature activation-6.44e-01

Pos logit contributions
 sip+0.003
 coc+0.002
 web+0.002
 bang+0.002
 pad+0.002
Neg logit contributions
 ourselves-0.003
 yourselves-0.003
Sorry-0.003
ate-0.003
iring-0.003
Token's
Token ID338
Feature activation-4.44e-01

Pos logit contributions
 sip+0.007
 coc+0.007
 web+0.006
run+0.006
igsaw+0.006
Neg logit contributions
 ourselves-0.007
Sorry-0.007
 yourselves-0.007
 neither-0.007
ate-0.007
Token okay
Token ID8788
Feature activation-4.51e-01

Pos logit contributions
 ourselves+0.007
 neither+0.006
Sorry+0.006
ate+0.006
Deal+0.006
Neg logit contributions
 coc-0.013
 sip-0.012
ocket-0.012
 pad-0.012
 web-0.012
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 ourselves+0.007
ate+0.007
 neither+0.007
Sorry+0.006
 most+0.006
Neg logit contributions
 coc-0.013
 sip-0.013
 web-0.012
run-0.012
ocket-0.012
Token Fiona
Token ID42207
Feature activation-1.58e-01

Pos logit contributions
 coc+0.022
 sip+0.021
ocket+0.021
igsaw+0.021
 pad+0.020
Neg logit contributions
 ourselves-0.025
 neither-0.024
 best-0.023
 most-0.023
ate-0.023
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 ourselves+0.003
ate+0.003
Sorry+0.003
 neither+0.003
 most+0.003
Neg logit contributions
 sip-0.004
 coc-0.004
 web-0.004
ocket-0.004
 pad-0.004
Token Acc
Token ID6366
Feature activation-4.50e+00

Pos logit contributions
 ourselves+0.004
 best+0.004
 neither+0.004
Sorry+0.004
 most+0.004
Neg logit contributions
run-0.005
 sip-0.004
igsaw-0.004
 coc-0.004
 web-0.004
Tokenidents
Token ID3231
Feature activation+1.70e+00

Pos logit contributions
 known+0.102
 fastest+0.095
 wealthiest+0.085
 most+0.085
 richest+0.085
Neg logit contributions
lee-0.097
 fingertips-0.092
 pad-0.091
anna-0.089
 sip-0.087
Token happen
Token ID1645
Feature activation-9.13e-01

Pos logit contributions
 coc+0.040
ocket+0.039
 pad+0.038
igsaw+0.037
 sip+0.037
Neg logit contributions
 ourselves-0.039
 neither-0.036
 most-0.035
 best-0.035
ate-0.033
Token."
Token ID526
Feature activation+3.82e-01

Pos logit contributions
Sorry+0.019
 ourselves+0.018
 neither+0.018
ate+0.017
eday+0.017
Neg logit contributions
 coc-0.019
 sip-0.019
 web-0.019
 pad-0.019
book-0.018
Token Then
Token ID3244
Feature activation-2.51e+00

Pos logit contributions
ocket+0.012
 coc+0.012
igsaw+0.012
 sip+0.012
 pad+0.011
Neg logit contributions
 ourselves-0.005
 neither-0.004
 best-0.004
 most-0.004
ate-0.004
Token they
Token ID484
Feature activation-4.18e-01

Pos logit contributions
ate+0.041
 ourselves+0.040
Deal+0.038
important+0.038
 yourselves+0.037
Neg logit contributions
 sip-0.070
 coc-0.070
 web-0.064
run-0.063
 pad-0.061
Token no
Token ID645
Feature activation-1.62e+00

Pos logit contributions
 ourselves+0.014
 neither+0.013
 most+0.012
 best+0.012
 yourselves+0.011
Neg logit contributions
ocket-0.035
 coc-0.033
 sip-0.033
roach-0.032
igsaw-0.032
Token time
Token ID640
Feature activation-2.42e-01

Pos logit contributions
 best+0.038
 ourselves+0.038
 neither+0.038
 most+0.037
 fastest+0.035
Neg logit contributions
ocket-0.034
 coc-0.032
 sip-0.032
igsaw-0.031
book-0.031
Token."
Token ID526
Feature activation-1.53e+00

Pos logit contributions
 ourselves+0.004
 neither+0.004
 best+0.004
 most+0.004
ate+0.004
Neg logit contributions
 coc-0.006
 sip-0.006
ocket-0.006
igsaw-0.006
 pad-0.006
Token
Token ID198
Feature activation-4.53e-01

Pos logit contributions
 ourselves+0.028
 neither+0.026
ate+0.026
 best+0.026
 most+0.025
Neg logit contributions
 coc-0.039
 sip-0.039
 pad-0.037
igsaw-0.037
ocket-0.037
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
 ourselves+0.007
 neither+0.006
ate+0.006
 best+0.006
 most+0.005
Neg logit contributions
 coc-0.014
 sip-0.013
ocket-0.013
 pad-0.013
igsaw-0.013
TokenEnc
Token ID27195
Feature activation-4.46e+00

Pos logit contributions
 ourselves+0.005
 neither+0.005
 best+0.005
 most+0.005
ate+0.004
Neg logit contributions
 coc-0.008
 sip-0.008
igsaw-0.008
 pad-0.008
ocket-0.008
Tokenour
Token ID454
Feature activation+5.41e-01

Pos logit contributions
 most+0.086
 ones+0.085
 ourselves+0.081
 yourselves+0.081
 best+0.079
Neg logit contributions
vel-0.094
igsaw-0.093
 pad-0.093
ocket-0.091
 sip-0.090
Tokenaged
Token ID1886
Feature activation-1.43e+00

Pos logit contributions
 coc+0.014
 sip+0.014
ocket+0.014
igsaw+0.013
 pad+0.013
Neg logit contributions
 ourselves-0.010
 neither-0.009
 best-0.009
 most-0.009
ate-0.009
Token by
Token ID416
Feature activation+5.18e-01

Pos logit contributions
 ourselves+0.029
 neither+0.027
ate+0.026
 most+0.026
Sorry+0.026
Neg logit contributions
 coc-0.035
 sip-0.034
ocket-0.033
igsaw-0.033
 pad-0.032
Token the
Token ID262
Feature activation-8.41e-01

Pos logit contributions
 coc+0.014
ocket+0.014
 pad+0.013
 sip+0.013
book+0.013
Neg logit contributions
 ourselves-0.010
 neither-0.010
 best-0.009
 most-0.009
 yourselves-0.008
Token eagle
Token ID31176
Feature activation-7.64e-01

Pos logit contributions
 best+0.017
 most+0.017
 ourselves+0.016
 neither+0.015
 fastest+0.015
Neg logit contributions
ocket-0.020
 sip-0.020
book-0.020
run-0.019
 coc-0.019
Token�
Token ID129
Feature activation+1.08e+00

Pos logit contributions
 ourselves+0.014
iring+0.014
 most+0.013
eday+0.013
 fastest+0.013
Neg logit contributions
 pad-0.014
run-0.014
vel-0.014
 web-0.014
book-0.014
Token�
Token ID241
Feature activation-1.78e+00

Pos logit contributions
 coc+0.027
 sip+0.026
ocket+0.025
igsaw+0.025
 pad+0.024
Neg logit contributions
 ourselves-0.022
 neither-0.020
ate-0.020
 best-0.019
 most-0.019
TokenIts
Token ID20459
Feature activation-3.04e-01

Pos logit contributions
 ourselves+0.027
 neither+0.025
 best+0.024
ate+0.024
 most+0.023
Neg logit contributions
 coc-0.051
 sip-0.051
 pad-0.049
ocket-0.049
run-0.049
Token okay
Token ID8788
Feature activation+1.78e-02

Pos logit contributions
 ourselves+0.006
 neither+0.005
Sorry+0.005
ate+0.005
Deal+0.005
Neg logit contributions
 coc-0.007
 sip-0.007
 web-0.007
igsaw-0.007
ocket-0.007
Token.
Token ID13
Feature activation-5.80e-02

Pos logit contributions
run+0.000
 sip+0.000
 coc+0.000
 pad+0.000
 web+0.000
Neg logit contributions
 ourselves-0.000
ate-0.000
 neither-0.000
 most-0.000
 fastest-0.000
Token Acc
Token ID6366
Feature activation-4.44e+00

Pos logit contributions
 ourselves+0.001
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.001
 sip-0.001
igsaw-0.001
run-0.001
ocket-0.001
Tokenidents
Token ID3231
Feature activation+1.77e+00

Pos logit contributions
 known+0.101
 fastest+0.095
 wealthiest+0.087
 most+0.086
 richest+0.085
Neg logit contributions
lee-0.091
 fingertips-0.089
 pad-0.089
vel-0.088
 sip-0.085
Token happen
Token ID1645
Feature activation-1.06e+00

Pos logit contributions
 coc+0.039
ocket+0.038
 sip+0.037
 pad+0.037
igsaw+0.036
Neg logit contributions
 ourselves-0.042
 neither-0.039
 best-0.038
 most-0.038
ate-0.037
Token.
Token ID13
Feature activation-4.72e-01

Pos logit contributions
Sorry+0.023
 ourselves+0.023
 neither+0.022
ate+0.022
eday+0.022
Neg logit contributions
 pad-0.020
 coc-0.020
 sip-0.020
book-0.020
 web-0.020
Tokenâ
Token ID22940
Feature activation-2.11e+00

Pos logit contributions
 ourselves+0.010
 neither+0.009
 best+0.009
ate+0.009
 most+0.009
Neg logit contributions
run-0.011
 coc-0.010
 pad-0.010
igsaw-0.010
 sip-0.010
Token€
Token ID26391
Feature activation-8.03e-01

Pos logit contributions
 ourselves+0.043
 neither+0.042
 best+0.040
 most+0.039
iring+0.038
Neg logit contributions
 sip-0.050
 coc-0.049
igsaw-0.048
ocket-0.048
book-0.047
Token or
Token ID393
Feature activation-1.55e+00

Pos logit contributions
 ourselves+0.002
 neither+0.002
 best+0.002
 most+0.002
ate+0.002
Neg logit contributions
 coc-0.002
 sip-0.002
ocket-0.002
igsaw-0.002
 pad-0.002
Tokenchard
Token ID30215
Feature activation-1.61e+00

Pos logit contributions
 ourselves+0.026
 neither+0.025
 most+0.023
 best+0.022
ate+0.022
Neg logit contributions
 coc-0.043
igsaw-0.043
ocket-0.042
andel-0.042
 pad-0.041
Token and
Token ID290
Feature activation-4.35e-01

Pos logit contributions
 ourselves+0.032
Sorry+0.031
 neither+0.029
mm+0.028
 yourselves+0.028
Neg logit contributions
 coc-0.040
 sip-0.038
 pad-0.038
ocket-0.036
 web-0.036
Token fill
Token ID6070
Feature activation-1.26e+00

Pos logit contributions
 ourselves+0.011
 neither+0.010
 most+0.010
 best+0.010
ate+0.010
Neg logit contributions
 coc-0.009
ocket-0.008
igsaw-0.008
 sip-0.008
andel-0.008
Token his
Token ID465
Feature activation-1.24e+00

Pos logit contributions
 ourselves+0.028
 neither+0.027
Sorry+0.027
Deal+0.026
 best+0.026
Neg logit contributions
 sip-0.027
 coc-0.027
ocket-0.025
igsaw-0.024
book-0.024
Token wagon
Token ID31860
Feature activation-4.44e+00

Pos logit contributions
 ourselves+0.028
 neither+0.026
 best+0.025
 most+0.025
ate+0.025
Neg logit contributions
 coc-0.028
 sip-0.027
ocket-0.026
igsaw-0.026
 pad-0.026
Token with
Token ID351
Feature activation+9.19e-02

Pos logit contributions
Sorry+0.082
 ourselves+0.078
 neither+0.076
mm+0.073
 yourselves+0.073
Neg logit contributions
 sip-0.117
 coc-0.114
 gob-0.106
igsaw-0.105
 web-0.104
Token juicy
Token ID35075
Feature activation+1.33e+00

Pos logit contributions
 coc+0.002
 sip+0.002
ocket+0.002
 pad+0.002
igsaw+0.002
Neg logit contributions
 ourselves-0.002
 neither-0.002
ate-0.002
 best-0.002
Sorry-0.002
Token red
Token ID2266
Feature activation+1.79e-02

Pos logit contributions
 coc+0.033
 sip+0.032
ocket+0.032
igsaw+0.031
 pad+0.031
Neg logit contributions
 ourselves-0.026
 neither-0.024
 best-0.024
 most-0.023
ate-0.023
Token apples
Token ID22514
Feature activation-9.68e-01

Pos logit contributions
ocket+0.000
 coc+0.000
igsaw+0.000
 sip+0.000
 web+0.000
Neg logit contributions
 ourselves-0.000
 neither-0.000
 best-0.000
ate-0.000
 most-0.000
Token.
Token ID13
Feature activation+7.87e-01

Pos logit contributions
 ourselves+0.020
 neither+0.019
Sorry+0.018
ate+0.018
 most+0.018
Neg logit contributions
 coc-0.023
 sip-0.022
 pad-0.022
ocket-0.022
igsaw-0.021
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
 coc+0.010
 sip+0.010
ocket+0.010
igsaw+0.009
 pad+0.009
Neg logit contributions
 ourselves-0.007
 neither-0.007
 best-0.007
 most-0.007
ate-0.007
Token a
Token ID257
Feature activation+2.36e+00

Pos logit contributions
 coc+0.027
 sip+0.027
ocket+0.026
igsaw+0.026
 pad+0.025
Neg logit contributions
 ourselves-0.020
 neither-0.019
 most-0.018
 best-0.018
ate-0.018
Token victim
Token ID3117
Feature activation+1.88e+00

Pos logit contributions
oline+0.067
ocket+0.064
vet+0.061
andel+0.060
igsaw+0.060
Neg logit contributions
 best-0.042
 most-0.041
 still-0.035
Sorry-0.035
 neither-0.034
Token of
Token ID286
Feature activation+8.43e-01

Pos logit contributions
 coc+0.044
 sip+0.043
ocket+0.042
igsaw+0.042
 pad+0.040
Neg logit contributions
 ourselves-0.040
 neither-0.039
 best-0.037
 most-0.036
iring-0.034
Token the
Token ID262
Feature activation+6.38e-01

Pos logit contributions
 coc+0.026
 sip+0.026
igsaw+0.025
 pad+0.025
ocket+0.025
Neg logit contributions
 ourselves-0.012
 neither-0.011
 best-0.011
 most-0.011
 yourselves-0.010
Token poison
Token ID8764
Feature activation+3.22e+00

Pos logit contributions
run+0.014
igsaw+0.014
book+0.014
 sip+0.013
ocket+0.013
Neg logit contributions
 best-0.014
 ourselves-0.014
 most-0.014
 neither-0.013
Sorry-0.013
Token.
Token ID13
Feature activation+1.41e+00

Pos logit contributions
igsaw+0.082
 coc+0.080
andel+0.079
ocket+0.078
 sip+0.077
Neg logit contributions
 ourselves-0.058
 best-0.057
 neither-0.057
 most-0.056
 when-0.053
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
 coc+0.041
 sip+0.041
ocket+0.040
 web+0.039
 pad+0.039
Neg logit contributions
 ourselves-0.021
 neither-0.020
 most-0.019
ate-0.019
 best-0.018
Token
Token ID198
Feature activation+1.35e+00

Pos logit contributions
 coc+0.033
 sip+0.032
 pad+0.032
ocket+0.032
igsaw+0.032
Neg logit contributions
 ourselves-0.017
ate-0.016
They-0.015
Sorry-0.015
 neither-0.015
TokenLess
Token ID22058
Feature activation-5.26e-01

Pos logit contributions
 coc+0.046
 sip+0.046
ocket+0.045
 pad+0.044
igsaw+0.044
Neg logit contributions
 ourselves-0.014
 neither-0.012
Sorry-0.012
ate-0.011
 most-0.011
Tokenon
Token ID261
Feature activation-6.95e-01

Pos logit contributions
 ourselves+0.015
 neither+0.014
ate+0.014
 best+0.013
 most+0.013
Neg logit contributions
 coc-0.009
 sip-0.009
 pad-0.008
igsaw-0.008
ocket-0.008
Token A
Token ID317
Feature activation+2.48e+00

Pos logit contributions
 coc+0.049
ocket+0.049
 sip+0.047
 web+0.047
vel+0.046
Neg logit contributions
 ourselves-0.023
 neither-0.022
 most-0.022
 best-0.020
ate-0.020
Token big
Token ID1263
Feature activation+1.57e+00

Pos logit contributions
ocket+0.065
igsaw+0.064
run+0.064
 sip+0.064
 coc+0.063
Neg logit contributions
 best-0.046
 most-0.045
 ourselves-0.041
 neither-0.040
Sorry-0.039
Token pig
Token ID12967
Feature activation+2.96e-01

Pos logit contributions
run+0.042
ocket+0.041
 coc+0.041
igsaw+0.041
 sip+0.041
Neg logit contributions
 best-0.028
 most-0.027
 ourselves-0.026
 neither-0.026
 fastest-0.023
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 pad+0.005
ocket+0.005
 sip+0.005
 web+0.005
 coc+0.005
Neg logit contributions
 ourselves-0.008
Sorry-0.008
 neither-0.007
 yourselves-0.007
Deal-0.007
Token a
Token ID257
Feature activation+2.42e+00

Pos logit contributions
 coc+0.028
 sip+0.028
igsaw+0.027
ocket+0.027
 pad+0.027
Neg logit contributions
 ourselves-0.018
 neither-0.017
 most-0.017
 best-0.017
ate-0.016
Token fluffy
Token ID39145
Feature activation+4.00e+00

Pos logit contributions
igsaw+0.064
run+0.063
 sip+0.063
 coc+0.063
ocket+0.063
Neg logit contributions
 best-0.044
 most-0.043
 neither-0.038
 ourselves-0.037
 still-0.036
Token sheep
Token ID15900
Feature activation+4.86e-01

Pos logit contributions
igsaw+0.106
run+0.104
 coc+0.104
 sip+0.103
ocket+0.103
Neg logit contributions
 best-0.068
 most-0.067
 ourselves-0.066
 neither-0.062
ate-0.059
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
 coc+0.014
igsaw+0.013
 sip+0.013
ocket+0.013
andel+0.013
Neg logit contributions
 ourselves-0.008
 neither-0.008
 best-0.007
 most-0.007
ate-0.007
Token a
Token ID257
Feature activation+2.32e+00

Pos logit contributions
 pad+0.028
 sip+0.028
 coc+0.028
 web+0.026
ocket+0.026
Neg logit contributions
 ourselves-0.029
ate-0.026
 neither-0.026
Sorry-0.026
iring-0.025
Token tiny
Token ID7009
Feature activation+2.91e+00

Pos logit contributions
igsaw+0.062
ocket+0.060
run+0.060
 sip+0.059
 coc+0.059
Neg logit contributions
 best-0.043
 most-0.042
 neither-0.036
 ourselves-0.036
 still-0.035
Token mouse
Token ID10211
Feature activation+2.04e+00

Pos logit contributions
igsaw+0.087
ocket+0.084
book+0.082
run+0.082
oline+0.081
Neg logit contributions
 best-0.043
 most-0.040
 neither-0.035
 ourselves-0.033
 when-0.032
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
 coc+0.038
 sip+0.038
ocket+0.037
igsaw+0.036
 pad+0.036
Neg logit contributions
 ourselves-0.021
 neither-0.019
 best-0.018
 most-0.018
ate-0.018
TokenOne
Token ID3198
Feature activation-1.31e-02

Pos logit contributions
 coc+0.044
 sip+0.043
igsaw+0.042
 pad+0.041
ocket+0.041
Neg logit contributions
 ourselves-0.022
 neither-0.021
 best-0.019
 most-0.019
ate-0.018
Token day
Token ID1110
Feature activation+9.83e-01

Pos logit contributions
 best+0.000
 most+0.000
 neither+0.000
 ourselves+0.000
ate+0.000
Neg logit contributions
 coc-0.000
ocket-0.000
 sip-0.000
 pad-0.000
igsaw-0.000
Token,
Token ID11
Feature activation+1.18e+00

Pos logit contributions
 coc+0.030
 sip+0.030
ocket+0.029
igsaw+0.028
 pad+0.028
Neg logit contributions
 ourselves-0.014
 neither-0.013
 best-0.012
 most-0.012
ate-0.012
Token the
Token ID262
Feature activation+3.30e-01

Pos logit contributions
 coc+0.030
 sip+0.030
igsaw+0.028
ocket+0.028
 web+0.028
Neg logit contributions
 ourselves-0.021
ate-0.020
Sorry-0.019
Deal-0.019
 neither-0.019
Token furry
Token ID40857
Feature activation+3.43e+00

Pos logit contributions
 sip+0.008
 coc+0.008
igsaw+0.008
run+0.008
ocket+0.008
Neg logit contributions
 most-0.006
 ourselves-0.006
 best-0.006
 neither-0.006
ate-0.006
Token body
Token ID1767
Feature activation+1.02e+00

Pos logit contributions
run+0.079
igsaw+0.079
 sip+0.078
ocket+0.077
book+0.077
Neg logit contributions
 most-0.072
 best-0.072
ate-0.068
 when-0.066
 still-0.065
Token met
Token ID1138
Feature activation+4.74e-01

Pos logit contributions
 coc+0.024
 sip+0.024
ocket+0.023
igsaw+0.023
 pad+0.023
Neg logit contributions
 ourselves-0.021
 neither-0.020
 best-0.019
 most-0.019
ate-0.019
Token a
Token ID257
Feature activation+2.22e+00

Pos logit contributions
 sip+0.015
 coc+0.015
igsaw+0.015
ocket+0.015
andel+0.015
Neg logit contributions
 ourselves-0.006
 best-0.005
 most-0.005
 neither-0.005
ate-0.005
Token wise
Token ID10787
Feature activation+2.29e+00

Pos logit contributions
ocket+0.061
oline+0.060
run+0.059
andel+0.058
 sip+0.058
Neg logit contributions
 best-0.039
 most-0.038
 neither-0.033
 still-0.032
 fastest-0.031
Token old
Token ID1468
Feature activation+1.37e+00

Pos logit contributions
 sip+0.070
run+0.066
ocket+0.066
 coc+0.066
book+0.064
Neg logit contributions
 best-0.032
 most-0.031
ate-0.027
 neither-0.027
 ourselves-0.027
TokenDo
Token ID5211
Feature activation+3.00e-01

Pos logit contributions
 coc+0.006
 sip+0.005
igsaw+0.005
 web+0.005
ocket+0.005
Neg logit contributions
 ourselves-0.003
Sorry-0.003
 neither-0.003
 most-0.003
ate-0.003
Token you
Token ID345
Feature activation+2.89e+00

Pos logit contributions
 coc+0.009
ocket+0.009
igsaw+0.008
 sip+0.008
 microsc+0.008
Neg logit contributions
 ourselves-0.004
ate-0.004
Sorry-0.004
 neither-0.004
iring-0.004
Token have
Token ID423
Feature activation+7.98e-01

Pos logit contributions
 coc+0.068
igsaw+0.068
andel+0.064
 web+0.064
run+0.063
Neg logit contributions
 ourselves-0.059
 best-0.057
 most-0.057
 neither-0.057
ate-0.054
Token any
Token ID597
Feature activation+2.98e-01

Pos logit contributions
 sip+0.025
 coc+0.024
igsaw+0.024
 pad+0.024
 web+0.024
Neg logit contributions
 ourselves-0.011
 best-0.010
 neither-0.010
 most-0.010
ate-0.009
Token hay
Token ID27678
Feature activation-6.84e-01

Pos logit contributions
 coc+0.007
 sip+0.007
ocket+0.007
igsaw+0.007
 pad+0.007
Neg logit contributions
 ourselves-0.006
 neither-0.006
 best-0.006
 most-0.006
ate-0.006
Token I
Token ID314
Feature activation+3.52e+00

Pos logit contributions
 ourselves+0.012
 neither+0.012
 best+0.011
 most+0.011
ate+0.011
Neg logit contributions
 coc-0.018
 sip-0.018
ocket-0.017
igsaw-0.017
 pad-0.017
Token can
Token ID460
Feature activation+1.44e+00

Pos logit contributions
 coc+0.087
igsaw+0.085
andel+0.083
 web+0.081
 sip+0.078
Neg logit contributions
 ourselves-0.068
ate-0.066
 neither-0.066
 most-0.066
 still-0.065
Token use
Token ID779
Feature activation+1.22e+00

Pos logit contributions
 coc+0.032
 sip+0.031
ocket+0.030
igsaw+0.030
 pad+0.030
Neg logit contributions
 ourselves-0.032
 neither-0.030
 best-0.029
 most-0.029
ate-0.029
Token?"
Token ID1701
Feature activation+7.33e-01

Pos logit contributions
 coc+0.035
 sip+0.034
igsaw+0.033
ocket+0.033
 pad+0.033
Neg logit contributions
 ourselves-0.020
 neither-0.018
 most-0.018
 best-0.017
ate-0.017
Token
Token ID198
Feature activation+1.52e+00

Pos logit contributions
 coc+0.023
ocket+0.022
 sip+0.022
 pad+0.021
igsaw+0.021
Neg logit contributions
 ourselves-0.010
 neither-0.009
 most-0.009
ate-0.008
Sorry-0.008
Token
Token ID198
Feature activation+1.64e+00

Pos logit contributions
 coc+0.048
 sip+0.047
ocket+0.047
 pad+0.047
igsaw+0.046
Neg logit contributions
 ourselves-0.019
ate-0.017
 neither-0.016
Sorry-0.016
 most-0.016
Token time
Token ID640
Feature activation+3.45e-01

Pos logit contributions
ocket+0.032
run+0.032
roach+0.031
 sip+0.030
igsaw+0.029
Neg logit contributions
 best-0.030
 most-0.029
 when-0.027
ate-0.026
 ourselves-0.026
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
 coc+0.009
 sip+0.009
ocket+0.009
igsaw+0.009
 web+0.009
Neg logit contributions
 ourselves-0.006
 neither-0.006
 best-0.006
 most-0.006
ate-0.005
Token Suddenly
Token ID24975
Feature activation-9.77e-01

Pos logit contributions
 sip+0.010
 coc+0.010
igsaw+0.010
 pad+0.010
ocket+0.010
Neg logit contributions
 ourselves-0.007
 neither-0.007
 best-0.007
ate-0.007
 most-0.007
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 ourselves+0.018
Deal+0.018
ate+0.017
Sorry+0.017
iring+0.017
Neg logit contributions
 sip-0.024
 coc-0.024
igsaw-0.023
book-0.022
ocket-0.022
Token a
Token ID257
Feature activation+2.55e+00

Pos logit contributions
 ourselves+0.002
ate+0.002
Deal+0.002
Sorry+0.002
iring+0.002
Neg logit contributions
 sip-0.003
 coc-0.003
igsaw-0.003
book-0.003
ocket-0.003
Token loud
Token ID7812
Feature activation+3.62e+00

Pos logit contributions
ocket+0.068
oline+0.067
 sip+0.067
run+0.066
ulp+0.065
Neg logit contributions
 most-0.044
 best-0.043
Sorry-0.039
 neither-0.037
 still-0.036
Token voice
Token ID3809
Feature activation+2.18e+00

Pos logit contributions
ocket+0.103
 coc+0.100
run+0.094
 microsc+0.093
igsaw+0.092
Neg logit contributions
 neither-0.056
man-0.054
 most-0.051
 best-0.051
 not-0.050
Token bo
Token ID1489
Feature activation+2.00e+00

Pos logit contributions
 coc+0.066
igsaw+0.064
 pad+0.061
 web+0.060
 sip+0.059
Neg logit contributions
 neither-0.033
 ourselves-0.032
Sorry-0.030
 most-0.030
 when-0.030
Tokenomed
Token ID12657
Feature activation+3.01e+00

Pos logit contributions
 coc+0.035
 sip+0.030
 web+0.030
 pad+0.027
ocket+0.027
Neg logit contributions
 ourselves-0.056
 neither-0.055
Sorry-0.055
ate-0.054
 best-0.053
Token in
Token ID287
Feature activation+5.00e-01

Pos logit contributions
 coc+0.082
ocket+0.080
 sip+0.079
igsaw+0.077
 web+0.077
Neg logit contributions
 ourselves-0.052
 neither-0.049
 best-0.047
ate-0.047
iring-0.047
Token the
Token ID262
Feature activation-5.12e-01

Pos logit contributions
 coc+0.014
ocket+0.014
 pad+0.014
 web+0.014
igsaw+0.014
Neg logit contributions
 ourselves-0.008
 neither-0.008
 most-0.007
Sorry-0.007
 yourselves-0.007
Token day
Token ID1110
Feature activation-2.81e-01

Pos logit contributions
 best+0.012
 ourselves+0.012
 neither+0.012
 most+0.012
ate+0.011
Neg logit contributions
 coc-0.020
ocket-0.019
run-0.019
igsaw-0.019
 sip-0.019
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
 ourselves+0.005
ate+0.004
 neither+0.004
 most+0.004
 best+0.004
Neg logit contributions
 sip-0.008
 coc-0.008
ocket-0.008
igsaw-0.007
 pad-0.007
Token she
Token ID673
Feature activation+1.33e+00

Pos logit contributions
 sip+0.004
 coc+0.004
book+0.004
 web+0.004
ocket+0.004
Neg logit contributions
 ourselves-0.003
ate-0.003
 fastest-0.002
iring-0.002
 best-0.002
Token found
Token ID1043
Feature activation-8.54e-01

Pos logit contributions
 sip+0.026
 coc+0.025
 pad+0.024
ocket+0.023
 web+0.023
Neg logit contributions
 ourselves-0.036
 neither-0.032
iring-0.032
 most-0.031
 yourselves-0.031
Token a
Token ID257
Feature activation+9.32e-01

Pos logit contributions
 ourselves+0.015
 neither+0.013
Sorry+0.013
ate+0.013
 best+0.012
Neg logit contributions
 coc-0.024
 sip-0.023
ocket-0.023
 web-0.022
 pad-0.022
Token grape
Token ID30777
Feature activation+1.09e+00

Pos logit contributions
run+0.027
ocket+0.026
oline+0.026
igsaw+0.026
ez+0.026
Neg logit contributions
 best-0.015
 most-0.015
 neither-0.012
 still-0.011
 ourselves-0.011
Token that
Token ID326
Feature activation-7.54e-01

Pos logit contributions
run+0.033
igsaw+0.033
andel+0.032
roach+0.032
 fingertips+0.032
Neg logit contributions
 best-0.015
 when-0.015
 most-0.014
 neither-0.014
 next-0.013
Token was
Token ID373
Feature activation+4.19e-01

Pos logit contributions
 ourselves+0.015
 neither+0.014
ate+0.013
Sorry+0.013
 yourselves+0.013
Neg logit contributions
 coc-0.018
 sip-0.018
ocket-0.018
 web-0.017
book-0.017
Token unknown
Token ID6439
Feature activation-4.89e-01

Pos logit contributions
igsaw+0.013
 coc+0.012
 sip+0.012
ocket+0.012
roach+0.012
Neg logit contributions
 best-0.006
 most-0.006
 neither-0.006
 not-0.005
 still-0.005
Token to
Token ID284
Feature activation+1.06e+00

Pos logit contributions
 when+0.006
 best+0.006
 most+0.006
 ourselves+0.006
 neither+0.006
Neg logit contributions
igsaw-0.015
vel-0.015
ocket-0.015
 sip-0.015
run-0.015
Token her
Token ID607
Feature activation+1.10e+00

Pos logit contributions
run+0.034
igsaw+0.034
 pad+0.033
 coc+0.033
andel+0.032
Neg logit contributions
 ourselves-0.015
 neither-0.013
ate-0.013
 best-0.012
 most-0.012
Token,
Token ID11
Feature activation-2.44e-01

Pos logit contributions
book+0.007
 pad+0.007
igsaw+0.007
oline+0.007
run+0.007
Neg logit contributions
 ourselves-0.007
Sorry-0.007
Deal-0.007
ate-0.006
 neither-0.006
Token he
Token ID339
Feature activation+1.01e+00

Pos logit contributions
 ourselves+0.004
ate+0.004
Sorry+0.004
Deal+0.004
eday+0.003
Neg logit contributions
 coc-0.006
 sip-0.006
ocket-0.006
 bang-0.006
book-0.006
Token offered
Token ID4438
Feature activation-1.02e-02

Pos logit contributions
 coc+0.021
 sip+0.021
ocket+0.020
 pad+0.020
 web+0.019
Neg logit contributions
 ourselves-0.024
 neither-0.022
 best-0.021
 most-0.021
Sorry-0.021
Token this
Token ID428
Feature activation+2.47e-01

Pos logit contributions
 ourselves+0.000
 neither+0.000
 best+0.000
 most+0.000
ate+0.000
Neg logit contributions
 coc-0.000
 sip-0.000
igsaw-0.000
ocket-0.000
 web-0.000
Token idea
Token ID2126
Feature activation-2.24e+00

Pos logit contributions
 coc+0.006
 sip+0.006
ocket+0.006
igsaw+0.006
 pad+0.006
Neg logit contributions
 ourselves-0.005
 best-0.004
 most-0.004
 neither-0.004
ate-0.004
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
 ourselves+0.044
 neither+0.042
 best+0.040
 most+0.040
ate+0.039
Neg logit contributions
 coc-0.056
 sip-0.055
ocket-0.053
igsaw-0.053
 pad-0.052
Token Some
Token ID2773
Feature activation-6.96e-01

Pos logit contributions
 coc+0.039
 sip+0.039
igsaw+0.038
 pad+0.038
ocket+0.037
Neg logit contributions
 ourselves-0.026
 neither-0.024
 best-0.023
 most-0.023
Sorry-0.023
Token people
Token ID661
Feature activation+5.22e-01

Pos logit contributions
 ourselves+0.013
 neither+0.012
 best+0.012
 most+0.012
ate+0.011
Neg logit contributions
 coc-0.018
 sip-0.018
ocket-0.018
igsaw-0.017
 pad-0.017
Token thought
Token ID1807
Feature activation+2.50e+00

Pos logit contributions
 coc+0.012
 sip+0.012
ocket+0.012
igsaw+0.012
 pad+0.011
Neg logit contributions
 ourselves-0.011
 neither-0.010
 best-0.010
 most-0.010
ate-0.010
Token the
Token ID262
Feature activation-1.14e+00

Pos logit contributions
 coc+0.073
 sip+0.071
ocket+0.070
igsaw+0.069
 pad+0.069
Neg logit contributions
 ourselves-0.039
 neither-0.038
 best-0.036
 most-0.036
ate-0.034
Token idea
Token ID2126
Feature activation-1.99e+00

Pos logit contributions
 ourselves+0.023
 neither+0.022
 best+0.022
 most+0.022
ate+0.021
Neg logit contributions
 coc-0.027
 sip-0.027
ocket-0.026
igsaw-0.026
 pad-0.025
Token her
Token ID607
Feature activation+9.33e-01

Pos logit contributions
 ourselves+0.004
 when+0.004
 neither+0.004
 best+0.003
 whenever+0.003
Neg logit contributions
igsaw-0.012
 coc-0.012
andel-0.011
ocket-0.011
 web-0.011
Token how
Token ID703
Feature activation-2.58e-02

Pos logit contributions
 coc+0.025
 sip+0.024
ocket+0.024
 pad+0.024
igsaw+0.024
Neg logit contributions
 ourselves-0.017
 neither-0.016
 best-0.015
 most-0.015
ate-0.015
Token she
Token ID673
Feature activation+1.49e+00

Pos logit contributions
 ourselves+0.000
 neither+0.000
ate+0.000
 best+0.000
 most+0.000
Neg logit contributions
 coc-0.001
 sip-0.001
ocket-0.001
igsaw-0.001
 pad-0.001
Token was
Token ID373
Feature activation+1.82e-01

Pos logit contributions
 sip+0.035
 coc+0.035
ocket+0.034
 pad+0.033
 web+0.033
Neg logit contributions
 ourselves-0.031
 neither-0.029
ate-0.028
 most-0.027
 best-0.027
Token feeling
Token ID4203
Feature activation+8.17e-02

Pos logit contributions
 coc+0.004
 sip+0.004
ocket+0.004
 pad+0.004
 web+0.004
Neg logit contributions
 ourselves-0.004
 neither-0.003
ate-0.003
 most-0.003
Sorry-0.003
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
 coc+0.002
 sip+0.002
igsaw+0.002
ocket+0.002
 pad+0.002
Neg logit contributions
 ourselves-0.001
 neither-0.001
 best-0.001
 most-0.001
ate-0.001
Token "
Token ID366
Feature activation-2.22e+00

Pos logit contributions
ocket+0.039
 coc+0.039
andel+0.037
 sip+0.037
 web+0.036
Neg logit contributions
 ourselves-0.019
 neither-0.019
 most-0.018
 best-0.018
ate-0.018
TokenI
Token ID40
Feature activation+1.83e+00

Pos logit contributions
 ourselves+0.032
 neither+0.029
 best+0.028
ate+0.027
 most+0.026
Neg logit contributions
 coc-0.068
 sip-0.068
ocket-0.065
 pad-0.065
 web-0.063
Token'm
Token ID1101
Feature activation+4.83e-01

Pos logit contributions
 sip+0.043
 coc+0.041
ocket+0.040
 pad+0.039
run+0.038
Neg logit contributions
 ourselves-0.041
 neither-0.037
ate-0.037
 yourselves-0.036
 best-0.036
Token not
Token ID407
Feature activation+2.19e-01

Pos logit contributions
 coc+0.014
 sip+0.013
ocket+0.013
igsaw+0.013
 pad+0.013
Neg logit contributions
 ourselves-0.008
 neither-0.007
ate-0.007
 most-0.007
 best-0.007
Token feeling
Token ID4203
Feature activation+1.99e-02

Pos logit contributions
 sip+0.006
igsaw+0.006
oline+0.006
 coc+0.006
ocket+0.006
Neg logit contributions
 best-0.003
 ourselves-0.003
 neither-0.003
 most-0.003
 yourselves-0.003
TokenThe
Token ID464
Feature activation+1.56e-01

Pos logit contributions
 coc+0.050
 sip+0.049
ocket+0.047
 pad+0.047
igsaw+0.047
Neg logit contributions
 ourselves-0.018
 neither-0.017
 best-0.015
 most-0.015
ate-0.015
Token End
Token ID5268
Feature activation-1.18e+00

Pos logit contributions
 coc+0.003
 sip+0.003
ocket+0.003
igsaw+0.003
run+0.003
Neg logit contributions
 ourselves-0.004
 neither-0.003
 best-0.003
 most-0.003
ate-0.003
Token<|endoftext|>
Token ID50256
Feature activation+6.20e-02

Pos logit contributions
 ourselves+0.014
They+0.013
 neither+0.013
 best+0.013
 most+0.012
Neg logit contributions
 pad-0.039
ocket-0.038
 sip-0.037
 coc-0.037
andel-0.037
TokenOnce
Token ID7454
Feature activation-1.09e+00

Pos logit contributions
 coc+0.002
ocket+0.002
 sip+0.002
igsaw+0.002
 web+0.002
Neg logit contributions
 ourselves-0.001
ate-0.001
 most-0.001
 best-0.001
 neither-0.000
Token upon
Token ID2402
Feature activation-2.54e-01

Pos logit contributions
 ourselves+0.025
 neither+0.023
Sorry+0.022
ate+0.022
 most+0.022
Neg logit contributions
 coc-0.024
 sip-0.024
igsaw-0.023
ocket-0.023
 pad-0.022
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
 ourselves+0.004
 neither+0.004
 best+0.004
ate+0.004
Sorry+0.004
Neg logit contributions
 coc-0.007
 sip-0.007
ocket-0.007
 pad-0.007
igsaw-0.007
Token time
Token ID640
Feature activation-5.67e-02

Pos logit contributions
ocket+0.032
 coc+0.031
roach+0.029
andel+0.028
 sip+0.028
Neg logit contributions
 best-0.043
 neither-0.041
 most-0.041
 ourselves-0.041
Deal-0.040
Token there
Token ID612
Feature activation-7.17e-01

Pos logit contributions
 ourselves+0.001
ate+0.001
 neither+0.001
Sorry+0.001
 most+0.001
Neg logit contributions
 coc-0.002
 sip-0.002
igsaw-0.002
 pad-0.002
run-0.002
Token was
Token ID373
Feature activation+8.36e-01

Pos logit contributions
 ourselves+0.014
 neither+0.013
ate+0.012
 most+0.012
 best+0.012
Neg logit contributions
 coc-0.018
 sip-0.018
ocket-0.017
 pad-0.017
igsaw-0.017
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
 coc+0.022
 sip+0.021
 pad+0.020
 web+0.019
igsaw+0.019
Neg logit contributions
 ourselves-0.016
 neither-0.015
Sorry-0.015
ate-0.014
Deal-0.014
Token brave
Token ID14802
Feature activation+2.75e+00

Pos logit contributions
ocket+0.042
run+0.041
 coc+0.040
igsaw+0.040
 sip+0.040
Neg logit contributions
 best-0.033
 most-0.031
 ourselves-0.031
 neither-0.031
Sorry-0.030
Token free
Token ID1479
Feature activation+1.32e+00

Pos logit contributions
 coc+0.011
 sip+0.011
ocket+0.010
run+0.010
igsaw+0.010
Neg logit contributions
 ourselves-0.010
 neither-0.009
Sorry-0.009
mm-0.009
Deal-0.009
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
 coc+0.032
 sip+0.031
ocket+0.031
 pad+0.031
igsaw+0.030
Neg logit contributions
 ourselves-0.027
 neither-0.025
ate-0.024
Sorry-0.024
 most-0.023
Token alive
Token ID6776
Feature activation+1.42e+00

Pos logit contributions
 ourselves+0.032
 neither+0.030
ate+0.029
 yourselves+0.029
Sorry+0.028
Neg logit contributions
 coc-0.038
 sip-0.036
ocket-0.035
run-0.034
 pad-0.034
Token all
Token ID477
Feature activation+5.32e-01

Pos logit contributions
 coc+0.037
 sip+0.037
ocket+0.035
igsaw+0.035
 pad+0.035
Neg logit contributions
 ourselves-0.026
 neither-0.025
 best-0.024
 most-0.024
ate-0.023
Token because
Token ID780
Feature activation+5.40e-01

Pos logit contributions
 coc+0.014
 sip+0.014
ocket+0.013
igsaw+0.013
 pad+0.013
Neg logit contributions
 ourselves-0.010
 neither-0.009
 best-0.009
 most-0.009
ate-0.009
Token he
Token ID339
Feature activation+1.15e+00

Pos logit contributions
 coc+0.014
 sip+0.014
igsaw+0.014
run+0.014
ocket+0.014
Neg logit contributions
 ourselves-0.009
 neither-0.008
ate-0.008
Sorry-0.008
Deal-0.008
Token owned
Token ID6898
Feature activation+1.06e+00

Pos logit contributions
 coc+0.029
 sip+0.029
igsaw+0.029
andel+0.028
ocket+0.028
Neg logit contributions
 ourselves-0.021
 most-0.021
 neither-0.021
 best-0.020
ate-0.020
Token the
Token ID262
Feature activation+9.15e-01

Pos logit contributions
 sip+0.032
 coc+0.032
ocket+0.031
igsaw+0.031
 pad+0.030
Neg logit contributions
 ourselves-0.016
 most-0.015
 best-0.014
ate-0.014
 neither-0.014
Token oxygen
Token ID11863
Feature activation+8.05e-01

Pos logit contributions
 coc+0.021
 sip+0.021
ocket+0.020
 pad+0.020
 web+0.019
Neg logit contributions
 ourselves-0.020
 neither-0.019
ate-0.018
Sorry-0.018
 best-0.017
Token.
Token ID13
Feature activation+5.34e-01

Pos logit contributions
 coc+0.021
 sip+0.021
ocket+0.020
igsaw+0.020
 pad+0.019
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.014
 most-0.014
ate-0.013
Token From
Token ID3574
Feature activation-6.04e-01

Pos logit contributions
 coc+0.014
 sip+0.014
ocket+0.014
igsaw+0.014
 pad+0.014
Neg logit contributions
 ourselves-0.009
 neither-0.009
 best-0.008
 most-0.008
ate-0.008
Token,
Token ID11
Feature activation-4.76e-02

Pos logit contributions
 sip+0.015
 coc+0.015
 pad+0.015
 web+0.014
 curtain+0.014
Neg logit contributions
 ourselves-0.015
 neither-0.014
Sorry-0.014
mm-0.014
 yourselves-0.014
Token she
Token ID673
Feature activation+1.13e+00

Pos logit contributions
 ourselves+0.001
ate+0.001
 neither+0.000
Sorry+0.000
Deal+0.000
Neg logit contributions
 coc-0.002
 sip-0.002
ocket-0.001
 web-0.001
 pad-0.001
Token noticed
Token ID6810
Feature activation-8.64e-01

Pos logit contributions
 coc+0.023
 sip+0.023
 pad+0.022
ocket+0.022
 web+0.021
Neg logit contributions
 ourselves-0.028
 neither-0.025
ate-0.024
 best-0.024
 most-0.024
Token something
Token ID1223
Feature activation+1.94e-01

Pos logit contributions
 ourselves+0.015
 neither+0.013
Sorry+0.013
ate+0.013
Deal+0.013
Neg logit contributions
 coc-0.024
 sip-0.023
ocket-0.022
 web-0.022
 pad-0.022
Token in
Token ID287
Feature activation+1.66e-01

Pos logit contributions
igsaw+0.006
ocket+0.005
 pad+0.005
andel+0.005
 coc+0.005
Neg logit contributions
 best-0.003
 most-0.003
 still-0.003
 when-0.003
 neither-0.003
Token the
Token ID262
Feature activation-7.90e-01

Pos logit contributions
ocket+0.005
 pad+0.005
 coc+0.005
 sip+0.005
run+0.005
Neg logit contributions
 ourselves-0.002
 neither-0.002
 most-0.002
ate-0.002
 yourselves-0.002
Token distance
Token ID5253
Feature activation-2.18e-01

Pos logit contributions
 ourselves+0.016
 most+0.016
 best+0.015
 neither+0.015
ate+0.015
Neg logit contributions
 sip-0.019
 coc-0.018
igsaw-0.018
ocket-0.018
run-0.018
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
 ourselves+0.004
 neither+0.004
 most+0.003
 best+0.003
ate+0.003
Neg logit contributions
 coc-0.006
 sip-0.006
igsaw-0.006
ocket-0.006
 pad-0.006
Token It
Token ID632
Feature activation+2.92e-01

Pos logit contributions
ocket+0.039
 coc+0.039
ils+0.037
vel+0.037
 gob+0.037
Neg logit contributions
 most-0.013
 She-0.013
 Then-0.013
 when-0.012
 she-0.011
Token was
Token ID373
Feature activation+6.40e-01

Pos logit contributions
 coc+0.008
ocket+0.008
igsaw+0.008
 sip+0.008
 pad+0.008
Neg logit contributions
 ourselves-0.004
 neither-0.004
 most-0.004
 best-0.004
ate-0.004
Token a
Token ID257
Feature activation+7.49e-01

Pos logit contributions
igsaw+0.020
 sip+0.020
ocket+0.020
 coc+0.020
 pad+0.020
Neg logit contributions
 most-0.008
 ourselves-0.008
 best-0.008
 neither-0.007
ate-0.006
Token he
Token ID339
Feature activation+1.98e+00

Pos logit contributions
 ourselves+0.013
 neither+0.012
Sorry+0.012
ate+0.012
Deal+0.012
Neg logit contributions
 coc-0.020
ocket-0.019
run-0.018
book-0.018
 sip-0.018
Token walked
Token ID6807
Feature activation+7.59e-01

Pos logit contributions
 coc+0.048
 sip+0.047
igsaw+0.047
ocket+0.046
andel+0.044
Neg logit contributions
 ourselves-0.038
 neither-0.037
 best-0.036
 most-0.036
ate-0.036
Token.
Token ID13
Feature activation+5.32e-01

Pos logit contributions
 coc+0.018
 sip+0.017
 pad+0.017
ocket+0.017
igsaw+0.017
Neg logit contributions
 ourselves-0.016
 neither-0.015
Sorry-0.014
ate-0.014
 most-0.014
Token<|endoftext|>
Token ID50256
Feature activation-9.75e-01

Pos logit contributions
 coc+0.015
 sip+0.015
ocket+0.014
igsaw+0.014
 pad+0.014
Neg logit contributions
 ourselves-0.009
 neither-0.008
 best-0.008
 most-0.008
ate-0.007
TokenOnce
Token ID7454
Feature activation-1.83e+00

Pos logit contributions
 ourselves+0.011
ate+0.010
 best+0.010
 most+0.010
 neither+0.009
Neg logit contributions
 coc-0.032
 sip-0.032
ocket-0.031
igsaw-0.031
andel-0.031
Token upon
Token ID2402
Feature activation-6.09e-01

Pos logit contributions
 ourselves+0.044
Sorry+0.040
 neither+0.039
ate+0.039
 yourselves+0.039
Neg logit contributions
 coc-0.038
 sip-0.037
igsaw-0.036
ocket-0.036
 pad-0.036
Token a
Token ID257
Feature activation+1.03e+00

Pos logit contributions
 ourselves+0.010
 neither+0.009
 best+0.009
ate+0.009
Sorry+0.009
Neg logit contributions
 coc-0.017
 sip-0.017
ocket-0.016
 pad-0.016
 web-0.016
Token time
Token ID640
Feature activation-6.81e-01

Pos logit contributions
 coc+0.015
 sip+0.014
ocket+0.014
igsaw+0.013
 pad+0.013
Neg logit contributions
 ourselves-0.030
 neither-0.030
 best-0.029
 most-0.029
ate-0.028
Token,
Token ID11
Feature activation+4.94e-01

Pos logit contributions
 ourselves+0.013
ate+0.012
Sorry+0.012
Deal+0.012
 yourselves+0.011
Neg logit contributions
 sip-0.017
igsaw-0.017
 coc-0.017
run-0.016
 pad-0.016
Token there
Token ID612
Feature activation-1.07e+00

Pos logit contributions
 sip+0.013
 coc+0.013
 pad+0.013
run+0.013
 web+0.012
Neg logit contributions
 ourselves-0.009
 neither-0.008
Sorry-0.008
ate-0.007
 yourselves-0.007
Token was
Token ID373
Feature activation+8.53e-01

Pos logit contributions
 ourselves+0.022
 neither+0.020
Sorry+0.020
ate+0.019
 most+0.019
Neg logit contributions
 coc-0.025
 sip-0.025
 pad-0.024
ocket-0.024
 web-0.023
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 coc+0.013
 sip+0.013
ocket+0.013
 pad+0.013
igsaw+0.012
Neg logit contributions
 ourselves-0.011
 neither-0.010
ate-0.010
 most-0.009
 best-0.009
Token They
Token ID1119
Feature activation+3.33e-01

Pos logit contributions
 ourselves+0.024
 neither+0.022
 best+0.021
 most+0.021
ate+0.021
Neg logit contributions
 sip-0.037
 coc-0.037
 pad-0.035
igsaw-0.035
ocket-0.035
Token eat
Token ID4483
Feature activation+6.64e-01

Pos logit contributions
 coc+0.007
igsaw+0.007
ocket+0.007
 pad+0.007
run+0.007
Neg logit contributions
 ourselves-0.008
 neither-0.007
 best-0.007
 most-0.007
ate-0.007
Token their
Token ID511
Feature activation-1.33e+00

Pos logit contributions
 sip+0.018
 coc+0.018
ocket+0.017
igsaw+0.017
 pad+0.017
Neg logit contributions
 ourselves-0.011
 neither-0.011
 most-0.010
 best-0.010
ate-0.010
Token sandwiches
Token ID29777
Feature activation-1.87e-01

Pos logit contributions
 ourselves+0.031
 best+0.028
 neither+0.028
 most+0.028
ate+0.027
Neg logit contributions
 coc-0.029
ocket-0.029
igsaw-0.028
 sip-0.028
 pad-0.028
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
 ourselves+0.003
 neither+0.003
 best+0.003
 most+0.003
ate+0.002
Neg logit contributions
igsaw-0.005
 coc-0.005
ocket-0.005
andel-0.005
run-0.005
Token They
Token ID1119
Feature activation+2.63e-01

Pos logit contributions
 ourselves+0.023
 neither+0.021
 best+0.020
 most+0.020
ate+0.020
Neg logit contributions
 coc-0.036
 sip-0.036
 pad-0.033
ocket-0.033
igsaw-0.033
Token are
Token ID389
Feature activation-2.03e-01

Pos logit contributions
 coc+0.006
igsaw+0.005
ocket+0.005
run+0.005
 pad+0.005
Neg logit contributions
 ourselves-0.006
 neither-0.006
 best-0.006
 most-0.006
ate-0.005
Token happy
Token ID3772
Feature activation-1.04e+00

Pos logit contributions
 ourselves+0.004
 neither+0.004
ate+0.004
Sorry+0.004
 most+0.003
Neg logit contributions
 coc-0.005
 sip-0.005
ocket-0.005
 web-0.005
 pad-0.005
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 ourselves+0.021
 neither+0.020
 most+0.019
ate+0.019
 best+0.019
Neg logit contributions
 coc-0.025
 sip-0.025
ocket-0.024
 pad-0.024
igsaw-0.023
Token
Token ID198
Feature activation-5.07e-01

Pos logit contributions
 ourselves+0.032
 neither+0.030
 best+0.029
 most+0.028
ate+0.028
Neg logit contributions
 coc-0.041
 sip-0.041
 pad-0.038
run-0.037
book-0.037
Token gone
Token ID3750
Feature activation-5.71e-01

Pos logit contributions
 neither+0.003
 ourselves+0.003
 best+0.003
 most+0.002
 still+0.002
Neg logit contributions
igsaw-0.004
 coc-0.004
 sip-0.004
ocket-0.004
oline-0.004
Token.
Token ID13
Feature activation+4.41e-01

Pos logit contributions
 ourselves+0.011
 neither+0.010
ate+0.010
Sorry+0.010
 most+0.010
Neg logit contributions
 coc-0.014
 sip-0.014
ocket-0.014
 pad-0.014
igsaw-0.013
Token<|endoftext|>
Token ID50256
Feature activation-9.78e-01

Pos logit contributions
 coc+0.012
 sip+0.012
ocket+0.012
 pad+0.011
igsaw+0.011
Neg logit contributions
 ourselves-0.008
 neither-0.007
 most-0.007
 best-0.007
ate-0.007
TokenJohn
Token ID7554
Feature activation-7.88e-01

Pos logit contributions
 ourselves+0.010
ate+0.010
 best+0.009
 most+0.009
 neither+0.008
Neg logit contributions
 coc-0.033
 sip-0.032
ocket-0.032
andel-0.032
igsaw-0.032
Token wanted
Token ID2227
Feature activation+3.53e-01

Pos logit contributions
 ourselves+0.019
 neither+0.018
 best+0.017
 most+0.017
Sorry+0.017
Neg logit contributions
 sip-0.017
 pad-0.016
 coc-0.016
ocket-0.016
 web-0.015
Token to
Token ID284
Feature activation-2.12e-01

Pos logit contributions
 coc+0.009
 sip+0.009
ocket+0.008
igsaw+0.008
 pad+0.008
Neg logit contributions
 ourselves-0.007
 neither-0.007
 most-0.006
 best-0.006
ate-0.006
Token play
Token ID711
Feature activation-9.90e-03

Pos logit contributions
 ourselves+0.005
 neither+0.004
ate+0.004
 most+0.004
 best+0.004
Neg logit contributions
 coc-0.005
 sip-0.005
ocket-0.005
 pad-0.004
igsaw-0.004
Token catch
Token ID4929
Feature activation+4.58e-01

Pos logit contributions
 ourselves+0.000
 best+0.000
 most+0.000
ate+0.000
 neither+0.000
Neg logit contributions
ocket-0.000
 coc-0.000
 pad-0.000
igsaw-0.000
andel-0.000
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
ocket+0.016
 pad+0.015
 coc+0.015
andel+0.015
igsaw+0.015
Neg logit contributions
 ourselves-0.005
ate-0.004
 most-0.004
 best-0.004
 when-0.004
Token He
Token ID679
Feature activation-8.12e-02

Pos logit contributions
 pad+0.013
igsaw+0.013
run+0.013
 coc+0.013
book+0.013
Neg logit contributions
 ourselves-0.011
 neither-0.011
Sorry-0.010
ich-0.010
 best-0.010
Token found
Token ID1043
Feature activation-6.62e-01

Pos logit contributions
 neither+0.001
 most+0.001
 best+0.001
 still+0.001
 ourselves+0.001
Neg logit contributions
igsaw-0.002
 coc-0.002
ocket-0.002
andel-0.002
 sip-0.002
TokenCl
Token ID2601
Feature activation+3.93e-01

Pos logit contributions
 neither+0.002
 ourselves+0.002
 best+0.002
 most+0.002
 fastest+0.002
Neg logit contributions
 coc-0.002
 sip-0.002
ocket-0.002
vel-0.002
 pad-0.002
Tokenare
Token ID533
Feature activation+8.63e-01

Pos logit contributions
 coc+0.009
ocket+0.009
 sip+0.009
book+0.009
 pad+0.009
Neg logit contributions
 neither-0.008
 ourselves-0.008
 most-0.008
 best-0.008
 fastest-0.008
Token was
Token ID373
Feature activation-7.22e-02

Pos logit contributions
 coc+0.021
 sip+0.021
ocket+0.020
igsaw+0.020
 pad+0.020
Neg logit contributions
 ourselves-0.017
 neither-0.016
ate-0.016
 best-0.016
 most-0.016
Token so
Token ID523
Feature activation-7.25e-01

Pos logit contributions
 ourselves+0.002
Sorry+0.001
 neither+0.001
ate+0.001
Deal+0.001
Neg logit contributions
 coc-0.002
 sip-0.002
 web-0.002
run-0.002
ocket-0.002
Token relieved
Token ID22598
Feature activation-6.98e-01

Pos logit contributions
 ourselves+0.016
 neither+0.015
 yourselves+0.014
Sorry+0.014
 most+0.014
Neg logit contributions
 sip-0.016
 coc-0.016
run-0.016
ocket-0.015
igsaw-0.015
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
 ourselves+0.013
 neither+0.013
 best+0.012
 most+0.012
ate+0.012
Neg logit contributions
 coc-0.017
 sip-0.017
ocket-0.016
 pad-0.016
igsaw-0.016
Token took
Token ID1718
Feature activation-4.74e-01

Pos logit contributions
 ourselves+0.038
ate+0.036
 neither+0.035
 yourselves+0.034
Sorry+0.034
Neg logit contributions
 coc-0.048
 sip-0.048
 gob-0.045
ocket-0.044
 web-0.044
Token off
Token ID572
Feature activation+1.08e+00

Pos logit contributions
 ourselves+0.009
 neither+0.009
Sorry+0.008
 most+0.008
Deal+0.008
Neg logit contributions
 sip-0.012
 coc-0.012
ocket-0.011
igsaw-0.011
book-0.011
Token the
Token ID262
Feature activation-3.59e-01

Pos logit contributions
 coc+0.033
 pad+0.032
ocket+0.032
 sip+0.032
 web+0.031
Neg logit contributions
 ourselves-0.015
 neither-0.014
 best-0.013
 most-0.013
ate-0.013
Token bath
Token ID7837
Feature activation+5.45e-02

Pos logit contributions
 ourselves+0.010
 best+0.010
 most+0.010
 neither+0.009
Sorry+0.009
Neg logit contributions
 coc-0.006
ocket-0.006
 pad-0.006
 sip-0.006
run-0.005
Tokenrobe
Token ID25481
Feature activation-2.51e-01

Pos logit contributions
 sip+0.001
 coc+0.001
igsaw+0.001
ocket+0.001
book+0.001
Neg logit contributions
 ourselves-0.001
 neither-0.001
Sorry-0.001
ate-0.001
 most-0.001
Token he
Token ID339
Feature activation-3.05e-01

Pos logit contributions
ate+0.062
 ourselves+0.062
iring+0.060
 fastest+0.060
ained+0.058
Neg logit contributions
 coc-0.070
 sip-0.069
 gob-0.062
ocket-0.061
 bang-0.059
Token said
Token ID531
Feature activation-9.05e-01

Pos logit contributions
 ourselves+0.005
 neither+0.005
 most+0.004
 best+0.004
ate+0.004
Neg logit contributions
 coc-0.009
 sip-0.008
ocket-0.008
igsaw-0.008
 pad-0.008
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
 ourselves+0.018
 neither+0.017
 best+0.016
 most+0.016
ate+0.016
Neg logit contributions
 coc-0.023
 sip-0.022
ocket-0.022
igsaw-0.021
 pad-0.021
Token
Token ID198
Feature activation-7.32e-01

Pos logit contributions
 ourselves+0.006
 neither+0.006
 best+0.006
 most+0.006
ate+0.006
Neg logit contributions
 coc-0.012
 sip-0.012
ocket-0.012
igsaw-0.012
 pad-0.012
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
 ourselves+0.013
 neither+0.013
 most+0.012
 best+0.012
Sorry+0.012
Neg logit contributions
 coc-0.019
 sip-0.019
run-0.018
ocket-0.018
igsaw-0.018
Token"
Token ID1
Feature activation-2.89e+00

Pos logit contributions
 ourselves+0.016
 neither+0.016
eday+0.015
 best+0.015
 most+0.015
Neg logit contributions
 coc-0.014
 pad-0.013
run-0.012
book-0.012
 sip-0.012
TokenThank
Token ID10449
Feature activation-2.70e-01

Pos logit contributions
 ourselves+0.050
 neither+0.047
 best+0.044
ate+0.042
iring+0.041
Neg logit contributions
 coc-0.079
 sip-0.078
book-0.073
 pad-0.072
ocket-0.072
Token you
Token ID345
Feature activation+7.79e-01

Pos logit contributions
 neither+0.003
 ourselves+0.003
ate+0.003
 most+0.003
 best+0.003
Neg logit contributions
 coc-0.009
 sip-0.008
ocket-0.008
igsaw-0.008
run-0.008
Token,
Token ID11
Feature activation+4.08e-01

Pos logit contributions
 coc+0.022
 sip+0.021
ocket+0.020
 pad+0.020
igsaw+0.020
Neg logit contributions
 ourselves-0.013
ate-0.012
 neither-0.012
Sorry-0.012
 most-0.011
Token Mr
Token ID1770
Feature activation-1.41e+00

Pos logit contributions
 sip+0.012
ocket+0.011
 coc+0.011
 pad+0.011
 web+0.011
Neg logit contributions
 ourselves-0.006
 neither-0.006
 best-0.006
 most-0.006
mm-0.006
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 ourselves+0.033
 most+0.032
 neither+0.032
Sorry+0.031
iring+0.031
Neg logit contributions
 coc-0.028
ocket-0.027
 pad-0.027
 sip-0.027
book-0.025
TokenI
Token ID40
Feature activation+1.31e+00

Pos logit contributions
 ourselves+0.036
 neither+0.036
 best+0.032
ate+0.030
iring+0.030
Neg logit contributions
 coc-0.069
 sip-0.066
book-0.064
 pad-0.064
vel-0.063
Token love
Token ID1842
Feature activation+1.02e+00

Pos logit contributions
 coc+0.034
 sip+0.033
ocket+0.032
igsaw+0.032
 pad+0.031
Neg logit contributions
 ourselves-0.025
 neither-0.023
 best-0.023
 most-0.023
ate-0.022
Token helping
Token ID5742
Feature activation-5.46e-01

Pos logit contributions
 coc+0.030
 sip+0.030
ocket+0.029
igsaw+0.028
 pad+0.028
Neg logit contributions
 ourselves-0.016
 neither-0.014
 best-0.014
 most-0.013
ate-0.013
Token you
Token ID345
Feature activation+7.32e-01

Pos logit contributions
 ourselves+0.008
 neither+0.007
Sorry+0.007
Deal+0.007
 most+0.007
Neg logit contributions
 coc-0.016
 sip-0.016
 web-0.015
ocket-0.015
igsaw-0.015
Token,
Token ID11
Feature activation+3.02e-01

Pos logit contributions
 coc+0.019
 sip+0.019
 web+0.018
igsaw+0.018
 pad+0.018
Neg logit contributions
 ourselves-0.013
 neither-0.012
Sorry-0.012
 best-0.012
Deal-0.012
Token Mom
Token ID11254
Feature activation-3.46e+00

Pos logit contributions
 coc+0.008
 sip+0.008
ocket+0.008
igsaw+0.008
 pad+0.008
Neg logit contributions
 ourselves-0.005
 neither-0.005
 best-0.004
 most-0.004
ate-0.004
Tokenmy
Token ID1820
Feature activation-2.29e+00

Pos logit contributions
 ourselves+0.076
 neither+0.072
Sorry+0.072
 yourselves+0.068
Deal+0.067
Neg logit contributions
 pad-0.074
 coc-0.072
 sip-0.069
run-0.069
book-0.065
Token."
Token ID526
Feature activation-7.04e-01

Pos logit contributions
 ourselves+0.054
Sorry+0.053
 neither+0.053
iring+0.051
 yourselves+0.049
Neg logit contributions
 pad-0.043
 coc-0.043
 sip-0.041
 web-0.040
vel-0.039
Token Mitt
Token ID16627
Feature activation-3.34e-01

Pos logit contributions
 ourselves+0.013
 neither+0.012
ate+0.012
 most+0.012
 best+0.011
Neg logit contributions
 coc-0.019
 sip-0.018
 pad-0.017
ocket-0.017
 web-0.017
Tokenens
Token ID641
Feature activation-5.55e-02

Pos logit contributions
 ourselves+0.008
 neither+0.008
ate+0.008
 most+0.008
 best+0.008
Neg logit contributions
 coc-0.007
 sip-0.006
 pad-0.006
roach-0.006
ocket-0.006
Token me
Token ID502
Feature activation+1.15e+00

Pos logit contributions
 ourselves+0.001
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.001
 sip-0.001
ocket-0.001
igsaw-0.001
 pad-0.001
Token comfort
Token ID4467
Feature activation+1.34e-01

Pos logit contributions
 ourselves+0.034
 neither+0.032
 best+0.031
 most+0.031
ate+0.030
Neg logit contributions
 coc-0.029
ocket-0.029
igsaw-0.029
 pad-0.028
book-0.028
Token him
Token ID683
Feature activation+2.94e-01

Pos logit contributions
 coc+0.003
 sip+0.003
ocket+0.003
 pad+0.003
run+0.003
Neg logit contributions
 ourselves-0.002
 neither-0.002
ate-0.002
 most-0.002
 best-0.002
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
 coc+0.008
 sip+0.008
ocket+0.007
igsaw+0.007
 pad+0.007
Neg logit contributions
 ourselves-0.005
 neither-0.005
 best-0.005
 most-0.005
ate-0.005
Token
Token ID198
Feature activation-6.01e-01

Pos logit contributions
 ourselves+0.026
 neither+0.024
 best+0.024
 most+0.024
ate+0.024
Neg logit contributions
 coc-0.042
 sip-0.041
 pad-0.040
run-0.040
igsaw-0.040
Token
Token ID198
Feature activation-7.02e-01

Pos logit contributions
 ourselves+0.009
 neither+0.008
ate+0.008
 best+0.008
 most+0.008
Neg logit contributions
 coc-0.018
 sip-0.017
ocket-0.017
igsaw-0.017
 pad-0.017
Token"
Token ID1
Feature activation-3.00e+00

Pos logit contributions
 neither+0.010
 ourselves+0.010
 best+0.010
 most+0.010
ate+0.009
Neg logit contributions
 coc-0.021
 sip-0.020
igsaw-0.020
 pad-0.019
 web-0.019
TokenI
Token ID40
Feature activation+1.14e+00

Pos logit contributions
 ourselves+0.055
 neither+0.051
 best+0.050
ate+0.049
 most+0.048
Neg logit contributions
 sip-0.080
 coc-0.079
 pad-0.075
ocket-0.074
book-0.073
Token'm
Token ID1101
Feature activation-9.63e-02

Pos logit contributions
 sip+0.029
 coc+0.029
ocket+0.028
 pad+0.027
 web+0.027
Neg logit contributions
 ourselves-0.022
 neither-0.020
ate-0.020
 most-0.020
 best-0.019
Token sorry
Token ID7926
Feature activation-2.76e+00

Pos logit contributions
 ourselves+0.001
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.003
 sip-0.003
ocket-0.003
igsaw-0.003
 pad-0.003
Token,
Token ID11
Feature activation-5.27e-03

Pos logit contributions
 ourselves+0.040
 neither+0.037
ate+0.036
 most+0.035
Sorry+0.034
Neg logit contributions
 coc-0.083
 sip-0.082
ocket-0.079
igsaw-0.078
 pad-0.078
Token you
Token ID345
Feature activation+1.58e+00

Pos logit contributions
 ourselves+0.000
 neither+0.000
 best+0.000
 most+0.000
 yourselves+0.000
Neg logit contributions
 sip-0.000
ocket-0.000
igsaw-0.000
 coc-0.000
book-0.000
TokenL
Token ID43
Feature activation+1.83e+00

Pos logit contributions
 neither+0.008
 best+0.008
 most+0.008
 ourselves+0.008
 fastest+0.007
Neg logit contributions
 sip-0.021
 coc-0.021
 web-0.019
book-0.019
 pad-0.019
Tokenily
Token ID813
Feature activation-1.98e-01

Pos logit contributions
 coc+0.030
 sip+0.029
ocket+0.027
igsaw+0.026
 pad+0.026
Neg logit contributions
 ourselves-0.052
ate-0.050
 neither-0.049
 best-0.048
 most-0.048
Token says
Token ID1139
Feature activation-4.26e-01

Pos logit contributions
 ourselves+0.004
 neither+0.004
Sorry+0.004
iring+0.004
 most+0.004
Neg logit contributions
 coc-0.005
 sip-0.005
 pad-0.004
 web-0.004
ocket-0.004
Token,
Token ID11
Feature activation+6.50e-01

Pos logit contributions
 ourselves+0.007
ate+0.007
 most+0.006
 neither+0.006
 yourselves+0.006
Neg logit contributions
 coc-0.012
 sip-0.012
ocket-0.011
igsaw-0.011
 pad-0.011
Token "
Token ID366
Feature activation-2.91e+00

Pos logit contributions
 coc+0.018
igsaw+0.017
 sip+0.017
ocket+0.017
andel+0.016
Neg logit contributions
 neither-0.011
 ourselves-0.011
 best-0.011
Sorry-0.011
 most-0.011
TokenMom
Token ID29252
Feature activation-2.76e+00

Pos logit contributions
iring+0.051
 neither+0.050
 best+0.050
 ourselves+0.048
ate+0.046
Neg logit contributions
 coc-0.076
 sip-0.075
 tripod-0.072
 pad-0.071
book-0.070
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 ourselves+0.064
iring+0.060
Sorry+0.059
ate+0.059
 yourselves+0.056
Neg logit contributions
 pad-0.057
 sip-0.056
run-0.052
 coc-0.051
 bang-0.051
Token you
Token ID345
Feature activation+2.59e+00

Pos logit contributions
 ourselves+0.015
 neither+0.014
 best+0.014
 most+0.013
Sorry+0.011
Neg logit contributions
 coc-0.033
andel-0.033
igsaw-0.033
 pad-0.033
 sip-0.033
Token are
Token ID389
Feature activation-1.46e+00

Pos logit contributions
 coc+0.058
 sip+0.057
ocket+0.055
igsaw+0.054
 pad+0.054
Neg logit contributions
 ourselves-0.058
 neither-0.055
ate-0.053
 most-0.052
 best-0.052
Token the
Token ID262
Feature activation-2.19e+00

Pos logit contributions
 ourselves+0.027
Sorry+0.026
 neither+0.026
ate+0.025
Deal+0.024
Neg logit contributions
 coc-0.037
ocket-0.035
 sip-0.035
 web-0.034
igsaw-0.034
Token best
Token ID1266
Feature activation-7.48e-01

Pos logit contributions
 ourselves+0.019
 neither+0.016
Sorry+0.015
ate+0.014
limits+0.013
Neg logit contributions
 coc-0.079
 pad-0.075
 sip-0.073
 web-0.073
igsaw-0.073
Token Lily
Token ID20037
Feature activation-1.13e+00

Pos logit contributions
 ourselves+0.048
ate+0.044
 neither+0.043
 best+0.042
 most+0.042
Neg logit contributions
 sip-0.060
 coc-0.059
ocket-0.056
 pad-0.056
run-0.055
Token said
Token ID531
Feature activation-1.10e+00

Pos logit contributions
 ourselves+0.021
 neither+0.020
ate+0.019
 most+0.019
 best+0.019
Neg logit contributions
 coc-0.029
 sip-0.029
 pad-0.028
ocket-0.027
 web-0.027
Token.
Token ID13
Feature activation-7.28e-01

Pos logit contributions
 ourselves+0.023
 neither+0.021
ate+0.021
 most+0.021
Sorry+0.021
Neg logit contributions
 sip-0.026
 coc-0.025
ocket-0.025
run-0.024
 pad-0.024
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
 ourselves+0.015
 neither+0.014
 best+0.013
 most+0.013
ate+0.013
Neg logit contributions
 coc-0.018
 sip-0.017
ocket-0.017
igsaw-0.017
 pad-0.016
Token
Token ID198
Feature activation-9.63e-01

Pos logit contributions
 neither+0.018
 ourselves+0.018
 most+0.018
 best+0.018
Deal+0.017
Neg logit contributions
 coc-0.026
 sip-0.025
igsaw-0.024
run-0.024
 web-0.024
Token"
Token ID1
Feature activation-3.10e+00

Pos logit contributions
 neither+0.016
 best+0.016
 most+0.016
iring+0.014
 ourselves+0.014
Neg logit contributions
 coc-0.026
 sip-0.026
igsaw-0.025
 web-0.025
 pad-0.025
TokenOr
Token ID5574
Feature activation-2.93e+00

Pos logit contributions
 ourselves+0.056
 best+0.051
 neither+0.050
ate+0.048
iring+0.048
Neg logit contributions
 sip-0.083
 coc-0.079
 pad-0.078
book-0.076
 web-0.074
Token a
Token ID257
Feature activation+1.11e+00

Pos logit contributions
 ourselves+0.051
ate+0.046
 neither+0.046
Sorry+0.045
iring+0.044
Neg logit contributions
 coc-0.077
 sip-0.076
ocket-0.074
run-0.073
 pad-0.073
Token message
Token ID3275
Feature activation-8.30e-02

Pos logit contributions
ocket+0.025
run+0.025
andel+0.025
 coc+0.025
oline+0.024
Neg logit contributions
 best-0.024
 most-0.023
 neither-0.020
 ourselves-0.020
 fastest-0.020
Token from
Token ID422
Feature activation+9.30e-01

Pos logit contributions
 ourselves+0.001
 neither+0.001
 best+0.001
 most+0.001
ate+0.001
Neg logit contributions
 coc-0.002
 sip-0.002
igsaw-0.002
ocket-0.002
 pad-0.002
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
 sip+0.028
 coc+0.028
igsaw+0.027
 pad+0.027
run+0.027
Neg logit contributions
 ourselves-0.014
 most-0.012
 neither-0.012
 best-0.012
 yourselves-0.012