TOP ACTIVATIONS
MAX = 0.066

're the best rh in oc eros ever !" Tim my
Tim thought it was the best toy he had ever seen
quickly checked for a fire extingu isher , but there wasn
. Tom was the biggest of them all ! They were
. " It is the best tower ever !"
sees aw , the merry - go - round . They
! I 'm the best gol fer in the world and
the right size , the right color and the right price
yellow ball were the best of friends . And they lived
Lily thought it was the best toy car she had ever
it was the most beautiful ig loo he had ever seen
" The cave is the best shelter ." So , it
car . It is the best car ever . No one
" Hey , what 's going on ? Why are you
. It was the best birthday ever ! <|endoftext|> Once upon
not the best . The best thing is to share and
He thinks it is the best top ever . He shows
coaster , and the merry - go - round . He
smiled . It was the best night ever ! <|endoftext|> George
, thought it was the best rug ever .

MIN ACTIVATIONS
MIN = -0.056

The oct opus had eight long arms and loved to swim
It made eight tentacles into eight pieces and put them inside
and ch ia and s esame and sun flower and pumpkin
. The oct opus had eight arms and could swim very
a small sun flower called Willow . Willow was so small
. The oct opus had eight long arms and loved to
a ch ubby girl with curly hair and a big smile
says : " Dr ink some water . It
a pet ham ster named Spark y . One day ,
One day , Bob was feeling very enthusiastic . He loved
water . It had eight long arms and a round head
was a little pig named Pink y . Pink y loved
the water . It had eight long arms and a round
a unique little girl named Elsa . She went for a
a short tail and big cheeks . That is a ham
a time , Mark was feeling a bit bored . He
desert , with lots of c act uses all around her
a big sle igh with eight re ind eer . The
big and brown , with long ears and a fuzzy tail
flake is small and has eight points . They are both

INTERVAL 0.036 - 0.066
CONTAINS 0.148%

body . It felt so good ! Pretty
the barking dog . The man picked her up and held
house looks so tidy !" Grand ma smiled and said ,
had found . He called out to his friends and showed
the muff ins were the best they ever had ! <|endoftext|>

INTERVAL 0.005 - 0.036
CONTAINS 30.395%

here ". The monster was very surprised and she said ,
Sam . " Let 's go explore the ship
heard about the dream and said to his daughter , "
" Sorry , mom ," they said . " We
. Lily noticed her friend 's feelings and shared her toys

INTERVAL -0.026 - 0.005
CONTAINS 68.783%

. One day , Ben sees a new book
take the toy . L ily 's mother told
being so patient . The bear was happy .
collecting . Every morning she would walk to the park and
They both enjoyed it . They finished all the jelly but

INTERVAL -0.056 - -0.026
CONTAINS 0.675%

gears for the eyes , nose , and mouth . They
special rock called " flex ible ". Tim didn 't know
was a little bird named Tweet . Tweet loved to sing
John who lived in a small town . John loved
time . Every time they thought about the impressive city ,
Token're
Token ID821
Feature activation-4.03e-03

Pos logit contributions
One+0.004
 One+0.004
 pine+0.004
+0.004
orse+0.004
Neg logit contributions
ied-0.005
enge-0.005
uttered-0.005
ics-0.005
apped-0.005
Token the
Token ID262
Feature activation+9.62e-03

Pos logit contributions
+0.006
One+0.005
 know+0.005
 One+0.005
 think+0.005
Neg logit contributions
ied-0.007
uttered-0.006
enge-0.006
uddled-0.006
apped-0.006
Token best
Token ID1266
Feature activation+3.65e-02

Pos logit contributions
ied+0.013
enge+0.012
uttered+0.012
ics+0.011
uddled+0.011
Neg logit contributions
-0.016
One-0.015
 know-0.015
 One-0.015
 think-0.015
Token rh
Token ID9529
Feature activation+4.61e-02

Pos logit contributions
ied+0.073
enge+0.071
uttered+0.070
apped+0.068
ics+0.067
Neg logit contributions
One-0.035
-0.034
 One-0.032
 know-0.031
 think-0.030
Tokenin
Token ID259
Feature activation+2.59e-02

Pos logit contributions
ied+0.096
uttered+0.095
uddled+0.090
overed+0.089
ics+0.088
Neg logit contributions
."-0.040
One-0.039
-0.038
,"-0.036
"-0.036
Tokenoc
Token ID420
Feature activation+6.63e-02

Pos logit contributions
ied+0.049
uttered+0.046
enge+0.044
uddled+0.044
ics+0.043
Neg logit contributions
-0.031
One-0.030
 know-0.029
."-0.029
 think-0.028
Tokeneros
Token ID27498
Feature activation+3.11e-02

Pos logit contributions
ied+0.102
uttered+0.094
enge+0.091
ics+0.089
uddled+0.087
Neg logit contributions
-0.093
."-0.092
One-0.092
 One-0.088
 know-0.088
Token ever
Token ID1683
Feature activation-7.36e-04

Pos logit contributions
ied+0.063
uttered+0.060
enge+0.058
uddled+0.058
ics+0.056
Neg logit contributions
-0.033
."-0.032
 know-0.031
,"-0.030
 think-0.029
Token!"
Token ID2474
Feature activation-1.66e-05

Pos logit contributions
One+0.000
 One+0.000
+0.000
 pine+0.000
 observe+0.000
Neg logit contributions
ied-0.002
ics-0.002
enge-0.002
uttered-0.002
apped-0.002
Token Tim
Token ID5045
Feature activation+1.20e-02

Pos logit contributions
One+0.000
+0.000
 know+0.000
 One+0.000
 think+0.000
Neg logit contributions
ied-0.000
enge-0.000
uttered-0.000
ics-0.000
uddled-0.000
Tokenmy
Token ID1820
Feature activation+2.70e-02

Pos logit contributions
ied+0.023
enge+0.021
ics+0.020
uttered+0.020
uddled+0.020
Neg logit contributions
One-0.013
 One-0.012
-0.012
 know-0.011
 think-0.011
Token Tim
Token ID5045
Feature activation+1.40e-02

Pos logit contributions
ied+0.007
uttered+0.006
enge+0.006
uddled+0.006
ics+0.006
Neg logit contributions
 One-0.004
-0.004
One-0.004
 know-0.004
."-0.004
Token thought
Token ID1807
Feature activation-7.98e-03

Pos logit contributions
ied+0.029
uttered+0.027
enge+0.027
overed+0.026
uddled+0.026
Neg logit contributions
-0.015
 know-0.014
One-0.014
 think-0.014
 One-0.014
Token it
Token ID340
Feature activation+2.99e-02

Pos logit contributions
 One+0.008
+0.008
."+0.008
 know+0.007
,"+0.007
Neg logit contributions
enge-0.017
uttered-0.017
rol-0.016
uddled-0.016
ied-0.016
Token was
Token ID373
Feature activation+2.12e-02

Pos logit contributions
ied+0.060
uttered+0.055
enge+0.054
overed+0.054
uddled+0.054
Neg logit contributions
-0.038
 One-0.035
One-0.034
 think-0.034
 c-0.033
Token the
Token ID262
Feature activation+3.18e-02

Pos logit contributions
ied+0.044
uttered+0.042
uddled+0.041
enge+0.040
apped+0.039
Neg logit contributions
-0.023
 One-0.022
."-0.022
One-0.022
 know-0.022
Token best
Token ID1266
Feature activation+6.49e-02

Pos logit contributions
ied+0.062
uddled+0.058
uttered+0.058
overed+0.055
awed+0.055
Neg logit contributions
One-0.044
 One-0.043
."-0.042
 think-0.041
-0.040
Token toy
Token ID13373
Feature activation+4.19e-02

Pos logit contributions
ied+0.125
uddled+0.121
awed+0.121
uttered+0.119
enge+0.115
Neg logit contributions
."-0.089
 One-0.083
.-0.082
,"-0.077
One-0.072
Token he
Token ID339
Feature activation+3.66e-02

Pos logit contributions
ied+0.096
uttered+0.092
uddled+0.090
outed+0.090
overed+0.089
Neg logit contributions
-0.043
."-0.042
.-0.039
,"-0.036
 One-0.032
Token had
Token ID550
Feature activation+2.80e-02

Pos logit contributions
ied+0.082
uttered+0.081
ics+0.078
overed+0.077
uddled+0.077
Neg logit contributions
-0.033
."-0.031
 know-0.031
 think-0.030
 want-0.029
Token ever
Token ID1683
Feature activation-6.93e-03

Pos logit contributions
uttered+0.055
ied+0.055
enge+0.053
uddled+0.053
ics+0.050
Neg logit contributions
."-0.033
One-0.029
 know-0.028
-0.028
 think-0.027
Token seen
Token ID1775
Feature activation-2.27e-03

Pos logit contributions
+0.009
One+0.009
 One+0.009
 know+0.009
 think+0.008
Neg logit contributions
ied-0.012
enge-0.011
uttered-0.011
uddled-0.011
ics-0.010
Token quickly
Token ID2952
Feature activation+6.19e-03

Pos logit contributions
ied+0.016
enge+0.015
uttered+0.015
ics+0.015
uddled+0.015
Neg logit contributions
-0.012
One-0.012
 know-0.012
 One-0.012
 think-0.011
Token checked
Token ID10667
Feature activation-2.86e-03

Pos logit contributions
ied+0.009
uttered+0.008
uddled+0.008
enge+0.008
box+0.007
Neg logit contributions
One-0.010
 One-0.009
-0.009
."-0.009
 know-0.008
Token for
Token ID329
Feature activation+8.44e-03

Pos logit contributions
+0.004
One+0.004
 know+0.003
 One+0.003
 think+0.003
Neg logit contributions
ied-0.005
uttered-0.005
enge-0.005
uddled-0.005
ics-0.005
Token a
Token ID257
Feature activation+3.93e-04

Pos logit contributions
ied+0.017
enge+0.015
uttered+0.015
uddled+0.015
ics+0.015
Neg logit contributions
One-0.009
-0.009
 know-0.009
 One-0.009
 think-0.008
Token fire
Token ID2046
Feature activation+4.92e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
uddled+0.001
ics+0.001
Neg logit contributions
-0.000
One-0.000
 know-0.000
 One-0.000
 think-0.000
Token extingu
Token ID24995
Feature activation+6.45e-02

Pos logit contributions
ied+0.010
uttered+0.009
enge+0.009
uddled+0.009
ics+0.009
Neg logit contributions
-0.005
."-0.005
One-0.005
 One-0.005
 know-0.005
Tokenisher
Token ID4828
Feature activation+1.29e-02

Pos logit contributions
ics+0.071
ied+0.066
uttered+0.066
uddled+0.062
overed+0.061
Neg logit contributions
use-0.115
 think-0.114
,"-0.113
."-0.112
 know-0.112
Token,
Token ID11
Feature activation+3.55e-03

Pos logit contributions
ied+0.034
enge+0.033
uttered+0.033
uddled+0.032
ics+0.032
Neg logit contributions
-0.005
."-0.004
One-0.004
 know-0.004
 One-0.003
Token but
Token ID475
Feature activation-1.59e-03

Pos logit contributions
ied+0.009
enge+0.008
uttered+0.008
ics+0.008
uddled+0.008
Neg logit contributions
-0.003
 think-0.003
One-0.003
 One-0.002
 know-0.002
Token there
Token ID612
Feature activation+1.32e-02

Pos logit contributions
+0.001
 then+0.001
 see+0.001
 sometimes+0.001
 think+0.001
Neg logit contributions
enge-0.004
ied-0.004
uttered-0.004
ics-0.004
rol-0.004
Token wasn
Token ID2492
Feature activation+9.26e-03

Pos logit contributions
enge+0.031
plug+0.028
uttered+0.028
uddled+0.028
bel+0.028
Neg logit contributions
-0.014
.-0.011
 One-0.010
 sometimes-0.010
,-0.010
Token.
Token ID13
Feature activation-4.96e-03

Pos logit contributions
ied+0.015
enge+0.014
box+0.014
uttered+0.014
uddled+0.013
Neg logit contributions
One-0.009
-0.008
 know-0.008
 One-0.007
ppy-0.007
Token Tom
Token ID4186
Feature activation+1.02e-02

Pos logit contributions
 One+0.005
+0.004
One+0.004
."+0.003
 know+0.003
Neg logit contributions
ied-0.012
ics-0.011
bled-0.011
enge-0.011
uddled-0.011
Token was
Token ID373
Feature activation-3.95e-03

Pos logit contributions
ied+0.021
enge+0.019
uttered+0.019
uddled+0.019
ics+0.018
Neg logit contributions
One-0.010
-0.010
 know-0.010
 One-0.010
 think-0.009
Token the
Token ID262
Feature activation+1.20e-02

Pos logit contributions
+0.004
 know+0.004
One+0.004
."+0.004
 think+0.004
Neg logit contributions
ied-0.008
enge-0.008
uttered-0.008
uddled-0.008
ipt-0.007
Token biggest
Token ID4094
Feature activation+1.29e-02

Pos logit contributions
ied+0.019
enge+0.018
uttered+0.017
ics+0.017
uddled+0.017
Neg logit contributions
-0.017
One-0.017
 know-0.016
 One-0.016
 think-0.016
Token of
Token ID286
Feature activation+6.32e-02

Pos logit contributions
ied+0.026
enge+0.025
uttered+0.024
uddled+0.024
box+0.024
Neg logit contributions
One-0.012
-0.012
 know-0.011
 One-0.011
 think-0.011
Token them
Token ID606
Feature activation+2.09e-02

Pos logit contributions
uddled+0.130
ied+0.127
enge+0.125
uttered+0.123
apped+0.117
Neg logit contributions
 One-0.062
 pine-0.061
One-0.061
 know-0.060
 think-0.058
Token all
Token ID477
Feature activation+2.93e-02

Pos logit contributions
enge+0.041
ied+0.041
uttered+0.040
ipt+0.039
uddled+0.039
Neg logit contributions
-0.023
."-0.021
 know-0.021
 think-0.020
 c-0.020
Token!
Token ID0
Feature activation+4.19e-03

Pos logit contributions
ied+0.074
enge+0.073
uttered+0.072
uddled+0.071
apped+0.069
Neg logit contributions
-0.016
."-0.015
One-0.013
 One-0.013
 know-0.013
Token They
Token ID1119
Feature activation+5.95e-04

Pos logit contributions
ied+0.010
ics+0.009
enge+0.009
uttered+0.009
uddled+0.009
Neg logit contributions
 One-0.004
-0.003
One-0.003
 know-0.003
."-0.003
Token were
Token ID547
Feature activation-3.18e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
uddled+0.001
ics+0.001
Neg logit contributions
-0.001
One-0.001
 know-0.001
 One-0.001
 think-0.001
Token.
Token ID13
Feature activation+9.70e-03

Pos logit contributions
ied+0.082
enge+0.079
uttered+0.078
uddled+0.077
ics+0.076
Neg logit contributions
-0.011
One-0.009
 know-0.009
 One-0.009
."-0.008
Token "
Token ID366
Feature activation+6.17e-03

Pos logit contributions
ied+0.019
uttered+0.018
enge+0.018
uddled+0.017
apped+0.017
Neg logit contributions
-0.010
One-0.010
 know-0.009
 One-0.009
 think-0.009
TokenIt
Token ID1026
Feature activation+1.68e-02

Pos logit contributions
enge+0.009
ied+0.009
apped+0.008
uttered+0.008
ics+0.008
Neg logit contributions
 know-0.009
 c-0.008
 One-0.008
-0.008
 pine-0.008
Token is
Token ID318
Feature activation+5.60e-03

Pos logit contributions
ied+0.027
enge+0.025
uttered+0.024
ics+0.024
apped+0.023
Neg logit contributions
One-0.023
-0.022
 know-0.022
 One-0.022
 think-0.021
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
ied+0.009
enge+0.008
uttered+0.008
ics+0.008
uddled+0.008
Neg logit contributions
One-0.008
-0.008
 know-0.008
 One-0.008
 think-0.007
Token best
Token ID1266
Feature activation+6.29e-02

Pos logit contributions
ied+0.046
uttered+0.043
uddled+0.042
enge+0.040
apped+0.039
Neg logit contributions
One-0.043
-0.042
 One-0.041
 know-0.040
."-0.040
Token tower
Token ID10580
Feature activation+4.26e-02

Pos logit contributions
ied+0.116
uttered+0.107
uddled+0.105
enge+0.105
ics+0.100
Neg logit contributions
One-0.079
-0.079
."-0.079
 One-0.078
 know-0.076
Token ever
Token ID1683
Feature activation+1.52e-02

Pos logit contributions
uttered+0.090
ied+0.088
uddled+0.087
enge+0.085
ipt+0.083
Neg logit contributions
."-0.047
-0.047
,"-0.041
!"-0.041
.-0.038
Token!"
Token ID2474
Feature activation+8.15e-03

Pos logit contributions
ied+0.038
ics+0.035
enge+0.034
uttered+0.034
uddled+0.034
Neg logit contributions
One-0.008
 One-0.008
-0.007
 know-0.007
 think-0.006
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
ied+0.014
uttered+0.014
apped+0.014
enge+0.013
overed+0.013
Neg logit contributions
-0.010
One-0.009
use-0.009
 c-0.009
ppy-0.009
Token
Token ID198
Feature activation-6.70e-03

Pos logit contributions
ied+0.020
uttered+0.019
apped+0.018
enge+0.018
uddled+0.017
Neg logit contributions
use-0.011
-0.011
 tul-0.010
ulent-0.010
 c-0.010
Token sees
Token ID7224
Feature activation-9.08e-03

Pos logit contributions
+0.001
One+0.001
 One+0.001
 know+0.001
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.001
ics-0.001
Tokenaw
Token ID707
Feature activation-1.02e-03

Pos logit contributions
+0.006
One+0.006
."+0.005
 One+0.005
 know+0.005
Neg logit contributions
ied-0.022
enge-0.021
uttered-0.021
ics-0.020
uddled-0.020
Token,
Token ID11
Feature activation-5.03e-03

Pos logit contributions
One+0.001
+0.001
 know+0.001
 One+0.001
 think+0.001
Neg logit contributions
ied-0.002
uttered-0.002
enge-0.002
uddled-0.002
ics-0.002
Token the
Token ID262
Feature activation-2.52e-03

Pos logit contributions
+0.005
 think+0.005
 c+0.005
One+0.004
."+0.004
Neg logit contributions
ied-0.011
enge-0.010
uttered-0.010
uddled-0.010
apped-0.009
Token merry
Token ID44570
Feature activation+1.94e-02

Pos logit contributions
+0.003
One+0.003
 One+0.003
 know+0.003
 think+0.003
Neg logit contributions
ied-0.005
enge-0.004
uttered-0.004
uddled-0.004
ics-0.004
Token-
Token ID12
Feature activation+6.18e-02

Pos logit contributions
ied+0.033
uttered+0.030
enge+0.030
uddled+0.029
ics+0.029
Neg logit contributions
One-0.025
-0.025
 know-0.025
 One-0.025
."-0.024
Tokengo
Token ID2188
Feature activation+3.51e-02

Pos logit contributions
ied+0.088
enge+0.081
uttered+0.080
uddled+0.077
ics+0.075
Neg logit contributions
-0.097
One-0.097
 know-0.094
."-0.093
 One-0.092
Token-
Token ID12
Feature activation+4.19e-02

Pos logit contributions
ied+0.055
enge+0.052
uttered+0.051
uddled+0.050
box+0.047
Neg logit contributions
-0.051
."-0.050
One-0.049
 know-0.049
 One-0.047
Tokenround
Token ID744
Feature activation-2.92e-03

Pos logit contributions
ied+0.063
enge+0.057
uttered+0.056
uddled+0.054
ics+0.054
Neg logit contributions
-0.064
One-0.064
 know-0.063
 One-0.062
 think-0.060
Token.
Token ID13
Feature activation+4.91e-04

Pos logit contributions
+0.001
One+0.001
."+0.001
 One+0.001
 know+0.001
Neg logit contributions
ied-0.008
enge-0.008
uttered-0.008
uddled-0.007
apped-0.007
Token They
Token ID1119
Feature activation-5.63e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
uddled+0.001
apped+0.001
Neg logit contributions
-0.001
One-0.001
 know-0.001
 One-0.001
 think-0.001
Token!
Token ID0
Feature activation+8.58e-03

Pos logit contributions
One+0.003
 tul+0.003
 pine+0.003
 One+0.003
ppy+0.003
Neg logit contributions
ied-0.006
enge-0.006
ics-0.006
uddled-0.006
uttered-0.006
Token I
Token ID314
Feature activation+1.24e-03

Pos logit contributions
ied+0.015
enge+0.014
uttered+0.014
apped+0.014
uddled+0.013
Neg logit contributions
One-0.011
 know-0.010
 c-0.010
 pine-0.010
 think-0.010
Token'm
Token ID1101
Feature activation-8.03e-03

Pos logit contributions
ied+0.002
enge+0.002
uttered+0.002
uddled+0.002
ics+0.002
Neg logit contributions
One-0.002
-0.002
 One-0.001
 know-0.001
 think-0.001
Token the
Token ID262
Feature activation-2.33e-04

Pos logit contributions
+0.010
 know+0.010
One+0.009
 think+0.009
."+0.009
Neg logit contributions
ied-0.015
enge-0.014
uttered-0.014
uddled-0.014
ics-0.013
Token best
Token ID1266
Feature activation+2.62e-02

Pos logit contributions
+0.000
One+0.000
 One+0.000
 know+0.000
 think+0.000
Neg logit contributions
ied-0.000
uttered-0.000
uddled-0.000
enge-0.000
ics-0.000
Token gol
Token ID43791
Feature activation+6.16e-02

Pos logit contributions
ied+0.041
uttered+0.037
enge+0.037
uddled+0.036
ics+0.035
Neg logit contributions
-0.039
One-0.039
 know-0.038
 One-0.038
."-0.037
Tokenfer
Token ID2232
Feature activation+3.14e-02

Pos logit contributions
ied+0.092
uttered+0.085
uddled+0.077
outed+0.076
enge+0.075
Neg logit contributions
-0.096
."-0.092
One-0.092
"-0.090
orse-0.087
Token in
Token ID287
Feature activation+6.14e-03

Pos logit contributions
ied+0.070
uttered+0.069
enge+0.066
uddled+0.066
ics+0.064
Neg logit contributions
-0.028
."-0.026
 know-0.023
,"-0.023
 think-0.023
Token the
Token ID262
Feature activation+2.60e-04

Pos logit contributions
ied+0.012
uttered+0.011
enge+0.011
uddled+0.011
ics+0.010
Neg logit contributions
-0.007
One-0.007
 know-0.007
 One-0.006
 think-0.006
Token world
Token ID995
Feature activation+6.90e-03

Pos logit contributions
ied+0.000
enge+0.000
uttered+0.000
box+0.000
apped+0.000
Neg logit contributions
 know-0.000
One-0.000
-0.000
 think-0.000
 One-0.000
Token and
Token ID290
Feature activation+5.41e-03

Pos logit contributions
ied+0.019
uttered+0.019
enge+0.018
uddled+0.018
apped+0.018
Neg logit contributions
-0.002
."-0.002
One-0.002
 One-0.001
 know-0.001
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
ied+0.015
enge+0.015
uttered+0.015
uddled+0.015
apped+0.014
Neg logit contributions
-0.007
 One-0.007
One-0.007
 c-0.007
."-0.006
Token right
Token ID826
Feature activation+5.56e-02

Pos logit contributions
ied+0.040
uttered+0.036
enge+0.036
uddled+0.035
ics+0.034
Neg logit contributions
One-0.034
-0.034
 One-0.033
 know-0.032
 think-0.032
Token size
Token ID2546
Feature activation+1.52e-02

Pos logit contributions
ied+0.089
enge+0.083
uttered+0.083
uddled+0.082
ics+0.080
Neg logit contributions
One-0.079
."-0.078
 One-0.077
-0.076
 pine-0.074
Token,
Token ID11
Feature activation+3.73e-04

Pos logit contributions
ied+0.028
enge+0.027
uttered+0.027
uddled+0.026
ics+0.025
Neg logit contributions
-0.018
One-0.017
 know-0.017
 One-0.016
."-0.016
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
ied+0.001
uttered+0.001
enge+0.001
ics+0.001
uddled+0.001
Neg logit contributions
-0.000
 think-0.000
 c-0.000
."-0.000
 One-0.000
Token right
Token ID826
Feature activation+6.15e-02

Pos logit contributions
ied+0.026
enge+0.024
uttered+0.024
uddled+0.024
ics+0.023
Neg logit contributions
-0.019
One-0.019
 know-0.018
 One-0.018
 think-0.017
Token color
Token ID3124
Feature activation+1.13e-02

Pos logit contributions
ied+0.093
uttered+0.085
enge+0.085
uddled+0.083
ics+0.082
Neg logit contributions
One-0.094
-0.093
 One-0.092
."-0.091
 know-0.090
Token and
Token ID290
Feature activation+3.28e-03

Pos logit contributions
enge+0.024
ied+0.023
uttered+0.022
uddled+0.022
apped+0.021
Neg logit contributions
-0.012
."-0.011
,"-0.010
 c-0.010
.-0.009
Token the
Token ID262
Feature activation+1.82e-02

Pos logit contributions
ied+0.006
uttered+0.006
enge+0.006
uddled+0.006
overed+0.006
Neg logit contributions
-0.004
 know-0.003
."-0.003
 c-0.003
 think-0.003
Token right
Token ID826
Feature activation+5.78e-02

Pos logit contributions
ied+0.031
enge+0.029
uttered+0.029
uddled+0.028
ics+0.028
Neg logit contributions
-0.023
One-0.023
 know-0.022
 One-0.022
 think-0.021
Token price
Token ID2756
Feature activation+2.09e-02

Pos logit contributions
ied+0.094
enge+0.087
uttered+0.086
uddled+0.086
ics+0.084
Neg logit contributions
One-0.082
."-0.081
 One-0.080
-0.080
 know-0.078
Token yellow
Token ID7872
Feature activation+2.62e-02

Pos logit contributions
ied+0.031
enge+0.029
uttered+0.029
uddled+0.028
apped+0.027
Neg logit contributions
One-0.021
-0.021
 One-0.020
 c-0.019
."-0.019
Token ball
Token ID2613
Feature activation+1.79e-02

Pos logit contributions
ied+0.050
enge+0.048
cards+0.046
ics+0.046
plug+0.045
Neg logit contributions
 One-0.031
."-0.029
 c-0.029
-0.029
.-0.028
Token were
Token ID547
Feature activation+2.19e-03

Pos logit contributions
enge+0.034
ied+0.034
aughter+0.032
overed+0.031
ipt+0.031
Neg logit contributions
 c-0.020
-0.020
 know-0.019
 sometimes-0.019
.-0.018
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
ied+0.004
enge+0.004
uttered+0.004
uddled+0.004
overed+0.004
Neg logit contributions
-0.002
 know-0.002
."-0.002
One-0.002
 One-0.002
Token best
Token ID1266
Feature activation+4.19e-02

Pos logit contributions
ied+0.034
uttered+0.031
overed+0.030
uddled+0.030
apped+0.028
Neg logit contributions
One-0.030
-0.030
 One-0.028
 think-0.028
."-0.028
Token of
Token ID286
Feature activation+6.15e-02

Pos logit contributions
ied+0.083
enge+0.081
uttered+0.080
ics+0.076
cards+0.075
Neg logit contributions
."-0.045
-0.045
 One-0.041
 know-0.041
One-0.039
Token friends
Token ID2460
Feature activation+1.12e-02

Pos logit contributions
ied+0.122
enge+0.120
uttered+0.119
uddled+0.115
ics+0.114
Neg logit contributions
 know-0.061
One-0.060
-0.059
."-0.058
 One-0.058
Token.
Token ID13
Feature activation+4.46e-03

Pos logit contributions
ied+0.029
uttered+0.028
enge+0.028
uddled+0.027
ics+0.027
Neg logit contributions
-0.006
."-0.005
One-0.005
 know-0.004
 One-0.004
Token And
Token ID843
Feature activation+3.66e-03

Pos logit contributions
ied+0.009
enge+0.008
uttered+0.008
apped+0.008
uddled+0.008
Neg logit contributions
One-0.004
-0.004
 know-0.004
 think-0.004
 c-0.004
Token they
Token ID484
Feature activation-1.92e-03

Pos logit contributions
ied+0.007
uttered+0.007
uddled+0.007
enge+0.007
overed+0.007
Neg logit contributions
-0.005
 know-0.004
 One-0.004
 think-0.004
One-0.004
Token lived
Token ID5615
Feature activation+1.62e-03

Pos logit contributions
One+0.003
+0.003
 One+0.003
 know+0.003
."+0.003
Neg logit contributions
ied-0.003
enge-0.003
uttered-0.003
ics-0.003
uddled-0.003
Token Lily
Token ID20037
Feature activation+1.00e-02

Pos logit contributions
ied+0.012
enge+0.012
uttered+0.011
ics+0.011
uddled+0.011
Neg logit contributions
 One-0.006
-0.006
One-0.006
 know-0.006
."-0.006
Token thought
Token ID1807
Feature activation-7.84e-03

Pos logit contributions
ied+0.018
uttered+0.017
enge+0.017
uddled+0.016
ics+0.016
Neg logit contributions
-0.013
One-0.012
 know-0.012
 One-0.012
 think-0.012
Token it
Token ID340
Feature activation+2.80e-02

Pos logit contributions
+0.009
 One+0.009
 know+0.008
One+0.008
."+0.008
Neg logit contributions
enge-0.015
ied-0.014
uttered-0.014
apped-0.014
uddled-0.014
Token was
Token ID373
Feature activation+1.81e-02

Pos logit contributions
ied+0.055
uttered+0.051
enge+0.051
apped+0.050
overed+0.049
Neg logit contributions
-0.037
 One-0.034
One-0.034
 think-0.032
 c-0.031
Token the
Token ID262
Feature activation+2.33e-02

Pos logit contributions
ied+0.039
uttered+0.037
uddled+0.037
enge+0.036
apped+0.035
Neg logit contributions
-0.020
 One-0.019
."-0.018
One-0.018
 know-0.017
Token best
Token ID1266
Feature activation+6.14e-02

Pos logit contributions
ied+0.041
uttered+0.037
uddled+0.037
overed+0.035
apped+0.034
Neg logit contributions
One-0.036
-0.034
 One-0.033
 think-0.033
."-0.032
Token toy
Token ID13373
Feature activation+4.23e-02

Pos logit contributions
ied+0.119
uttered+0.111
enge+0.111
bled+0.110
uddled+0.109
Neg logit contributions
."-0.077
 One-0.075
-0.070
One-0.070
.-0.069
Token car
Token ID1097
Feature activation+4.29e-02

Pos logit contributions
ied+0.094
outed+0.092
uttered+0.089
uddled+0.086
bled+0.086
Neg logit contributions
-0.047
."-0.042
.-0.040
 c-0.037
,"-0.036
Token she
Token ID673
Feature activation+3.21e-02

Pos logit contributions
ipt+0.098
outed+0.098
uttered+0.098
ied+0.098
bled+0.094
Neg logit contributions
-0.040
."-0.039
.-0.038
 like-0.031
 c-0.031
Token had
Token ID550
Feature activation+1.61e-02

Pos logit contributions
ied+0.071
uttered+0.069
ics+0.067
overed+0.067
ipt+0.066
Neg logit contributions
-0.030
."-0.028
 know-0.027
 think-0.027
 want-0.026
Token ever
Token ID1683
Feature activation-1.43e-02

Pos logit contributions
ied+0.031
uttered+0.030
enge+0.030
uddled+0.028
apped+0.028
Neg logit contributions
."-0.019
-0.017
One-0.017
 know-0.017
 think-0.016
Token it
Token ID340
Feature activation+2.66e-02

Pos logit contributions
 One+0.007
+0.007
."+0.007
One+0.007
,"+0.007
Neg logit contributions
enge-0.014
uttered-0.013
ied-0.013
ics-0.013
uddled-0.013
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
ied+0.052
uttered+0.049
enge+0.049
overed+0.048
uddled+0.048
Neg logit contributions
-0.033
 One-0.031
One-0.030
."-0.029
 think-0.029
Token the
Token ID262
Feature activation+3.19e-02

Pos logit contributions
ied+0.040
uttered+0.039
uddled+0.038
enge+0.037
apped+0.036
Neg logit contributions
-0.022
One-0.022
 One-0.022
."-0.021
 know-0.021
Token most
Token ID749
Feature activation+2.90e-02

Pos logit contributions
ied+0.044
enge+0.041
uttered+0.040
uddled+0.038
ics+0.038
Neg logit contributions
-0.052
One-0.051
 know-0.050
 One-0.050
 think-0.048
Token beautiful
Token ID4950
Feature activation+4.09e-02

Pos logit contributions
ied+0.047
enge+0.044
uttered+0.042
uddled+0.041
ics+0.041
Neg logit contributions
One-0.040
-0.038
 One-0.038
 know-0.037
 think-0.036
Token ig
Token ID45329
Feature activation+6.11e-02

Pos logit contributions
ied+0.069
enge+0.064
uttered+0.063
uddled+0.061
apped+0.060
Neg logit contributions
-0.055
One-0.054
 know-0.052
 One-0.050
 think-0.050
Tokenloo
Token ID29680
Feature activation+4.07e-02

Pos logit contributions
ied+0.084
uttered+0.083
enge+0.077
ics+0.075
overed+0.074
Neg logit contributions
."-0.093
One-0.090
-0.087
,"-0.086
 pine-0.086
Token he
Token ID339
Feature activation+3.84e-02

Pos logit contributions
uttered+0.110
ied+0.106
uddled+0.104
outed+0.104
apped+0.102
Neg logit contributions
-0.026
.-0.022
."-0.021
,"-0.015
 c-0.015
Token had
Token ID550
Feature activation+2.47e-02

Pos logit contributions
ied+0.080
uttered+0.077
enge+0.074
ics+0.074
uddled+0.074
Neg logit contributions
-0.038
One-0.037
 know-0.036
 think-0.035
 One-0.034
Token ever
Token ID1683
Feature activation-1.03e-02

Pos logit contributions
ied+0.046
uttered+0.045
enge+0.044
uddled+0.043
apped+0.041
Neg logit contributions
."-0.030
One-0.029
-0.028
 know-0.027
 One-0.026
Token seen
Token ID1775
Feature activation-1.76e-03

Pos logit contributions
+0.015
One+0.015
 One+0.014
 know+0.014
 think+0.014
Neg logit contributions
ied-0.016
enge-0.014
uttered-0.014
ics-0.014
uddled-0.014
Token "
Token ID366
Feature activation+5.58e-03

Pos logit contributions
ied+0.008
uttered+0.007
uddled+0.007
enge+0.007
apped+0.007
Neg logit contributions
One-0.008
-0.008
 know-0.008
 think-0.007
 One-0.007
TokenThe
Token ID464
Feature activation+6.25e-03

Pos logit contributions
ied+0.010
uttered+0.010
enge+0.009
uddled+0.009
ics+0.009
Neg logit contributions
One-0.007
-0.007
 One-0.006
 know-0.006
 think-0.006
Token cave
Token ID11527
Feature activation+2.13e-02

Pos logit contributions
ied+0.010
uttered+0.010
enge+0.010
uddled+0.009
ics+0.009
Neg logit contributions
-0.008
One-0.008
 know-0.008
 think-0.007
 One-0.007
Token is
Token ID318
Feature activation+1.02e-02

Pos logit contributions
ied+0.042
enge+0.040
uttered+0.040
uddled+0.038
apped+0.038
Neg logit contributions
-0.023
."-0.023
 know-0.021
 think-0.021
,"-0.021
Token the
Token ID262
Feature activation+2.67e-02

Pos logit contributions
ied+0.019
enge+0.018
uttered+0.018
uddled+0.018
apped+0.017
Neg logit contributions
-0.012
One-0.012
."-0.012
 One-0.012
 know-0.012
Token best
Token ID1266
Feature activation+6.08e-02

Pos logit contributions
ied+0.049
uttered+0.045
enge+0.044
uddled+0.044
apped+0.042
Neg logit contributions
One-0.034
-0.033
 One-0.033
 know-0.032
."-0.032
Token shelter
Token ID11772
Feature activation+3.79e-02

Pos logit contributions
ied+0.105
enge+0.097
uttered+0.096
uddled+0.093
apped+0.090
Neg logit contributions
."-0.083
-0.082
One-0.081
 One-0.080
 know-0.079
Token."
Token ID526
Feature activation+5.94e-03

Pos logit contributions
ied+0.078
uttered+0.075
enge+0.074
uddled+0.074
ipt+0.072
Neg logit contributions
."-0.042
-0.039
,"-0.035
!"-0.034
 think-0.032
Token So
Token ID1406
Feature activation+3.91e-03

Pos logit contributions
ied+0.012
enge+0.012
uttered+0.012
apped+0.011
uddled+0.011
Neg logit contributions
One-0.006
-0.006
 know-0.005
 think-0.005
 One-0.005
Token,
Token ID11
Feature activation+8.19e-03

Pos logit contributions
ied+0.010
enge+0.009
uttered+0.009
uddled+0.009
ics+0.009
Neg logit contributions
-0.002
One-0.002
 know-0.002
 One-0.002
 think-0.002
Token it
Token ID340
Feature activation+9.58e-03

Pos logit contributions
ied+0.018
enge+0.017
uddled+0.016
uttered+0.016
box+0.016
Neg logit contributions
-0.008
."-0.008
 One-0.008
 think-0.007
 know-0.007
Token car
Token ID1097
Feature activation+1.49e-02

Pos logit contributions
ied+0.029
uttered+0.026
enge+0.026
uddled+0.025
ics+0.025
Neg logit contributions
 One-0.028
-0.028
."-0.028
One-0.028
 know-0.027
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
ied+0.040
uttered+0.039
enge+0.038
uddled+0.037
overed+0.037
Neg logit contributions
-0.007
."-0.006
One-0.005
 know-0.005
 One-0.005
Token It
Token ID632
Feature activation+1.43e-02

Pos logit contributions
enge+0.016
apped+0.016
uttered+0.015
ied+0.015
box+0.014
Neg logit contributions
One-0.016
 know-0.015
 pine-0.015
 think-0.015
 c-0.014
Token is
Token ID318
Feature activation+4.37e-03

Pos logit contributions
ied+0.031
uttered+0.029
uddled+0.029
enge+0.028
overed+0.028
Neg logit contributions
-0.016
."-0.014
 One-0.014
One-0.013
 know-0.013
Token the
Token ID262
Feature activation+3.07e-02

Pos logit contributions
ied+0.008
uttered+0.008
enge+0.008
uddled+0.008
apped+0.007
Neg logit contributions
-0.005
One-0.005
 One-0.005
."-0.005
 know-0.005
Token best
Token ID1266
Feature activation+6.05e-02

Pos logit contributions
ied+0.052
uttered+0.049
uddled+0.047
enge+0.045
overed+0.045
Neg logit contributions
One-0.046
-0.045
."-0.044
 One-0.043
 think-0.042
Token car
Token ID1097
Feature activation+4.27e-02

Pos logit contributions
ied+0.098
uttered+0.091
uddled+0.088
enge+0.088
ics+0.084
Neg logit contributions
."-0.096
 One-0.090
One-0.090
-0.090
,"-0.088
Token ever
Token ID1683
Feature activation+9.44e-03

Pos logit contributions
ied+0.090
uttered+0.090
uddled+0.086
ipt+0.084
outed+0.083
Neg logit contributions
."-0.049
-0.045
,"-0.041
!"-0.041
.-0.038
Token.
Token ID13
Feature activation+4.12e-03

Pos logit contributions
ied+0.024
ics+0.022
enge+0.021
uddled+0.021
uttered+0.021
Neg logit contributions
One-0.005
-0.005
 One-0.005
 know-0.004
 pine-0.004
Token No
Token ID1400
Feature activation+1.03e-02

Pos logit contributions
enge+0.006
apped+0.006
uttered+0.005
ied+0.005
boxes+0.005
Neg logit contributions
 pine-0.006
One-0.006
 c-0.006
 think-0.006
orse-0.006
Token one
Token ID530
Feature activation+1.40e-02

Pos logit contributions
ied+0.019
enge+0.017
uttered+0.017
box+0.017
apped+0.017
Neg logit contributions
-0.012
One-0.011
 know-0.011
 One-0.011
 think-0.010
Token "
Token ID366
Feature activation+1.45e-03

Pos logit contributions
ied+0.029
enge+0.027
uttered+0.027
uddled+0.026
ics+0.025
Neg logit contributions
-0.024
One-0.024
 know-0.023
 One-0.023
 think-0.022
TokenHey
Token ID10814
Feature activation-7.49e-03

Pos logit contributions
ied+0.003
uttered+0.003
uddled+0.003
enge+0.003
ics+0.003
Neg logit contributions
One-0.002
-0.002
 One-0.001
 think-0.001
 know-0.001
Token,
Token ID11
Feature activation+9.82e-03

Pos logit contributions
One+0.008
 One+0.007
 know+0.007
+0.007
 think+0.007
Neg logit contributions
ied-0.014
enge-0.014
uddled-0.013
uttered-0.013
box-0.013
Token what
Token ID644
Feature activation+1.77e-02

Pos logit contributions
ied+0.020
enge+0.019
uttered+0.018
uddled+0.018
ics+0.018
Neg logit contributions
One-0.010
 One-0.009
-0.009
 know-0.009
."-0.008
Token's
Token ID338
Feature activation+2.63e-02

Pos logit contributions
ied+0.036
enge+0.034
uttered+0.034
uddled+0.033
ics+0.032
Neg logit contributions
-0.018
One-0.017
 know-0.017
 One-0.016
 think-0.016
Token going
Token ID1016
Feature activation+6.02e-02

Pos logit contributions
uddled+0.046
uttered+0.046
ied+0.045
apped+0.044
umbled+0.043
Neg logit contributions
-0.034
 think-0.032
."-0.032
One-0.031
 c-0.031
Token on
Token ID319
Feature activation+2.40e-02

Pos logit contributions
ied+0.116
uddled+0.110
uttered+0.110
atively+0.108
enge+0.105
Neg logit contributions
-0.071
."-0.069
 see-0.067
?"-0.066
One-0.064
Token?
Token ID30
Feature activation+9.34e-03

Pos logit contributions
ied+0.055
uttered+0.052
enge+0.052
uddled+0.050
apped+0.050
Neg logit contributions
-0.019
One-0.018
."-0.017
 One-0.017
 know-0.016
Token Why
Token ID4162
Feature activation-4.20e-03

Pos logit contributions
ied+0.015
uttered+0.015
enge+0.015
apped+0.014
uddled+0.014
Neg logit contributions
One-0.012
 pine-0.011
 know-0.011
 think-0.010
 c-0.010
Token are
Token ID389
Feature activation+1.23e-02

Pos logit contributions
+0.004
 know+0.004
One+0.004
 One+0.004
 think+0.004
Neg logit contributions
ied-0.009
uttered-0.008
enge-0.008
uddled-0.008
apped-0.008
Token you
Token ID345
Feature activation-3.70e-03

Pos logit contributions
ied+0.028
uttered+0.027
enge+0.027
uddled+0.026
apped+0.026
Neg logit contributions
-0.010
One-0.009
 One-0.009
 know-0.008
."-0.008
Token.
Token ID13
Feature activation-1.71e-05

Pos logit contributions
ied+0.002
enge+0.001
uttered+0.001
uddled+0.001
ics+0.001
Neg logit contributions
-0.000
One-0.000
 One-0.000
 know-0.000
."-0.000
Token It
Token ID632
Feature activation+1.18e-02

Pos logit contributions
 One+0.000
+0.000
One+0.000
 know+0.000
."+0.000
Neg logit contributions
ied-0.000
enge-0.000
uttered-0.000
ics-0.000
uddled-0.000
Token was
Token ID373
Feature activation+7.40e-03

Pos logit contributions
ied+0.023
enge+0.022
uttered+0.022
overed+0.021
box+0.021
Neg logit contributions
-0.017
 c-0.014
 know-0.014
 think-0.013
 One-0.013
Token the
Token ID262
Feature activation+2.25e-02

Pos logit contributions
ied+0.015
uttered+0.015
enge+0.014
uddled+0.014
apped+0.014
Neg logit contributions
-0.008
One-0.008
 One-0.008
."-0.008
 know-0.008
Token best
Token ID1266
Feature activation+5.45e-02

Pos logit contributions
ied+0.041
uttered+0.038
uddled+0.037
overed+0.037
awed+0.036
Neg logit contributions
One-0.034
-0.033
 think-0.030
 One-0.030
 love-0.030
Token birthday
Token ID10955
Feature activation+5.99e-02

Pos logit contributions
ied+0.110
uttered+0.103
outed+0.100
uddled+0.099
enge+0.099
Neg logit contributions
."-0.064
-0.063
 One-0.062
 love-0.061
One-0.060
Token ever
Token ID1683
Feature activation+1.98e-02

Pos logit contributions
uttered+0.145
outed+0.145
ied+0.139
overed+0.137
awed+0.133
Neg logit contributions
-0.055
."-0.051
.-0.049
 love-0.046
 have-0.046
Token!
Token ID0
Feature activation-1.84e-03

Pos logit contributions
ied+0.048
enge+0.045
uttered+0.045
uddled+0.044
ics+0.043
Neg logit contributions
One-0.012
-0.012
 know-0.011
 One-0.011
 think-0.010
Token<|endoftext|>
Token ID50256
Feature activation-7.46e-03

Pos logit contributions
+0.002
 One+0.001
One+0.001
 know+0.001
."+0.001
Neg logit contributions
ied-0.004
enge-0.004
uttered-0.004
uddled-0.004
ics-0.004
TokenOnce
Token ID7454
Feature activation+7.76e-03

Pos logit contributions
+0.005
One+0.005
 know+0.005
 think+0.005
 One+0.005
Neg logit contributions
ied-0.017
uttered-0.016
enge-0.016
ics-0.016
uddled-0.015
Token upon
Token ID2402
Feature activation+1.82e-02

Pos logit contributions
ied+0.013
enge+0.012
uttered+0.012
uddled+0.012
ics+0.011
Neg logit contributions
One-0.011
 One-0.010
-0.010
 know-0.010
."-0.010
Token not
Token ID407
Feature activation+1.02e-04

Pos logit contributions
ied+0.008
enge+0.008
uttered+0.008
uddled+0.007
apped+0.007
Neg logit contributions
-0.006
One-0.006
 know-0.006
 One-0.006
."-0.006
Token the
Token ID262
Feature activation+2.99e-02

Pos logit contributions
ied+0.000
uttered+0.000
enge+0.000
uddled+0.000
apped+0.000
Neg logit contributions
."-0.000
 know-0.000
-0.000
,"-0.000
 think-0.000
Token best
Token ID1266
Feature activation+4.93e-02

Pos logit contributions
ied+0.047
uttered+0.043
enge+0.043
uddled+0.042
ics+0.040
Neg logit contributions
One-0.044
-0.044
 One-0.043
 know-0.042
 think-0.041
Token.
Token ID13
Feature activation+6.54e-03

Pos logit contributions
ied+0.103
enge+0.097
uttered+0.097
uddled+0.094
ics+0.093
Neg logit contributions
-0.047
One-0.047
 One-0.045
 know-0.044
."-0.043
Token The
Token ID383
Feature activation+1.30e-02

Pos logit contributions
apped+0.009
uttered+0.009
enge+0.008
ied+0.008
overed+0.007
Neg logit contributions
One-0.011
 pine-0.010
 c-0.010
 know-0.010
use-0.010
Token best
Token ID1266
Feature activation+5.99e-02

Pos logit contributions
ied+0.020
enge+0.019
uttered+0.018
box+0.018
ics+0.018
Neg logit contributions
-0.018
 know-0.017
orse-0.017
One-0.017
 One-0.016
Token thing
Token ID1517
Feature activation+3.84e-02

Pos logit contributions
ied+0.098
enge+0.090
uttered+0.090
uddled+0.087
ics+0.085
Neg logit contributions
-0.084
One-0.084
 One-0.081
 know-0.081
 think-0.078
Token is
Token ID318
Feature activation+7.93e-03

Pos logit contributions
uttered+0.069
ied+0.068
enge+0.068
ipt+0.066
apped+0.066
Neg logit contributions
-0.050
."-0.048
,"-0.044
 think-0.042
.-0.041
Token to
Token ID284
Feature activation+2.05e-02

Pos logit contributions
ied+0.014
enge+0.014
uttered+0.014
uddled+0.014
apped+0.014
Neg logit contributions
."-0.010
-0.010
 think-0.009
 know-0.009
,"-0.009
Token share
Token ID2648
Feature activation+1.42e-02

Pos logit contributions
ied+0.039
enge+0.038
ics+0.038
uttered+0.036
apped+0.035
Neg logit contributions
-0.019
 pine-0.018
One-0.017
orse-0.017
 One-0.016
Token and
Token ID290
Feature activation+3.04e-02

Pos logit contributions
ied+0.032
enge+0.030
uttered+0.030
uddled+0.029
ics+0.029
Neg logit contributions
One-0.010
-0.010
 One-0.010
 know-0.010
 think-0.009
Token He
Token ID679
Feature activation+7.64e-03

Pos logit contributions
 One+0.005
One+0.004
+0.004
 know+0.004
."+0.004
Neg logit contributions
ied-0.011
uttered-0.010
enge-0.010
ics-0.010
uddled-0.010
Token thinks
Token ID6834
Feature activation-1.66e-02

Pos logit contributions
ied+0.015
enge+0.014
uttered+0.014
uddled+0.013
apped+0.013
Neg logit contributions
One-0.008
-0.008
 know-0.007
 One-0.007
."-0.007
Token it
Token ID340
Feature activation+2.23e-02

Pos logit contributions
+0.018
 One+0.018
 know+0.017
."+0.017
,"+0.017
Neg logit contributions
enge-0.033
ied-0.032
uddled-0.031
uttered-0.031
ics-0.030
Token is
Token ID318
Feature activation+1.16e-02

Pos logit contributions
ied+0.043
enge+0.039
uttered+0.039
uddled+0.039
ics+0.037
Neg logit contributions
-0.027
One-0.026
 One-0.026
 know-0.025
 think-0.025
Token the
Token ID262
Feature activation+2.84e-02

Pos logit contributions
ied+0.021
uttered+0.020
enge+0.020
uddled+0.019
ics+0.019
Neg logit contributions
-0.014
One-0.014
 One-0.013
 know-0.013
 think-0.013
Token best
Token ID1266
Feature activation+5.99e-02

Pos logit contributions
ied+0.045
uttered+0.041
uddled+0.041
enge+0.040
ics+0.038
Neg logit contributions
One-0.043
-0.043
 One-0.041
 know-0.041
 think-0.040
Token top
Token ID1353
Feature activation+2.99e-02

Pos logit contributions
ied+0.113
uddled+0.104
enge+0.103
uttered+0.103
ics+0.098
Neg logit contributions
."-0.074
-0.074
One-0.073
 One-0.073
 know-0.071
Token ever
Token ID1683
Feature activation+9.01e-03

Pos logit contributions
ied+0.070
enge+0.068
uddled+0.068
uttered+0.068
apped+0.064
Neg logit contributions
-0.025
."-0.022
One-0.019
 know-0.019
,"-0.019
Token.
Token ID13
Feature activation-3.44e-03

Pos logit contributions
ied+0.022
uttered+0.020
enge+0.020
ics+0.020
uddled+0.020
Neg logit contributions
One-0.005
-0.005
 One-0.005
 know-0.005
 think-0.004
Token He
Token ID679
Feature activation+6.35e-03

Pos logit contributions
+0.004
One+0.004
 know+0.003
 One+0.003
 think+0.003
Neg logit contributions
ied-0.007
enge-0.006
uttered-0.006
uddled-0.006
apped-0.006
Token shows
Token ID2523
Feature activation+1.54e-02

Pos logit contributions
ied+0.012
enge+0.011
uttered+0.011
uddled+0.010
apped+0.010
Neg logit contributions
One-0.007
-0.007
 know-0.007
 One-0.007
orse-0.006
Token coaster
Token ID42450
Feature activation+5.28e-03

Pos logit contributions
ied+0.032
uttered+0.030
enge+0.029
apped+0.027
uddled+0.027
Neg logit contributions
-0.037
One-0.036
."-0.036
 One-0.035
 c-0.034
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
enge+0.015
ied+0.014
uttered+0.014
ipt+0.014
outed+0.014
Neg logit contributions
.-0.002
."-0.002
-0.002
,-0.001
 One-0.001
Token and
Token ID290
Feature activation+3.46e-03

Pos logit contributions
+0.006
One+0.006
 One+0.005
 c+0.005
 think+0.005
Neg logit contributions
ied-0.028
enge-0.027
uttered-0.026
uddled-0.025
ics-0.025
Token the
Token ID262
Feature activation+6.49e-03

Pos logit contributions
ied+0.007
uttered+0.007
enge+0.007
uddled+0.007
ings+0.007
Neg logit contributions
-0.003
 see-0.003
 know-0.003
 c-0.003
 pine-0.003
Token merry
Token ID44570
Feature activation+2.89e-02

Pos logit contributions
ied+0.012
uttered+0.011
enge+0.011
uddled+0.011
ics+0.011
Neg logit contributions
-0.008
One-0.008
 know-0.008
 One-0.007
 think-0.007
Token-
Token ID12
Feature activation+5.99e-02

Pos logit contributions
ied+0.047
uttered+0.046
enge+0.044
ics+0.043
uddled+0.041
Neg logit contributions
."-0.039
One-0.039
 One-0.037
 know-0.037
 pine-0.037
Tokengo
Token ID2188
Feature activation+3.34e-02

Pos logit contributions
ied+0.091
uttered+0.087
enge+0.083
uddled+0.081
ics+0.080
Neg logit contributions
-0.087
One-0.086
 know-0.083
."-0.081
 One-0.079
Token-
Token ID12
Feature activation+3.64e-02

Pos logit contributions
ied+0.058
uttered+0.056
enge+0.055
uddled+0.054
overed+0.052
Neg logit contributions
."-0.041
-0.041
 know-0.039
One-0.038
r-0.038
Tokenround
Token ID744
Feature activation+5.30e-03

Pos logit contributions
ied+0.054
enge+0.049
uttered+0.049
uddled+0.047
ics+0.046
Neg logit contributions
-0.056
One-0.056
 know-0.054
 One-0.054
 think-0.053
Token.
Token ID13
Feature activation-3.36e-03

Pos logit contributions
enge+0.013
ied+0.013
uttered+0.012
ipt+0.012
plug+0.012
Neg logit contributions
."-0.003
.-0.003
,"-0.003
-0.003
 One-0.002
Token He
Token ID679
Feature activation+3.45e-03

Pos logit contributions
 One+0.003
+0.003
One+0.003
 know+0.003
."+0.003
Neg logit contributions
ied-0.007
enge-0.007
uttered-0.007
uddled-0.007
ics-0.007
Token smiled
Token ID13541
Feature activation+4.66e-04

Pos logit contributions
ied+0.023
uttered+0.023
enge+0.022
uddled+0.021
ics+0.021
Neg logit contributions
-0.015
 c-0.013
 know-0.013
 think-0.013
 One-0.013
Token.
Token ID13
Feature activation-5.74e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
uddled+0.001
ics+0.001
Neg logit contributions
-0.000
One-0.000
 One-0.000
."-0.000
 know-0.000
Token It
Token ID632
Feature activation+2.03e-02

Pos logit contributions
One+0.007
+0.007
 know+0.007
 c+0.007
 pine+0.007
Neg logit contributions
ied-0.010
uttered-0.009
enge-0.009
apped-0.009
uddled-0.009
Token was
Token ID373
Feature activation+1.06e-02

Pos logit contributions
ied+0.040
uttered+0.038
enge+0.037
apped+0.036
overed+0.036
Neg logit contributions
-0.029
 c-0.024
 think-0.023
One-0.023
 One-0.023
Token the
Token ID262
Feature activation+2.40e-02

Pos logit contributions
uttered+0.022
ied+0.022
uddled+0.022
enge+0.022
apped+0.021
Neg logit contributions
-0.011
 One-0.011
."-0.010
One-0.010
 know-0.010
Token best
Token ID1266
Feature activation+5.98e-02

Pos logit contributions
ied+0.041
uttered+0.039
uddled+0.037
overed+0.036
apped+0.036
Neg logit contributions
One-0.037
-0.036
 One-0.034
."-0.034
 think-0.034
Token night
Token ID1755
Feature activation+3.24e-02

Pos logit contributions
ied+0.107
uttered+0.105
enge+0.100
bled+0.098
uddled+0.098
Neg logit contributions
."-0.082
-0.080
 One-0.079
One-0.077
 c-0.076
Token ever
Token ID1683
Feature activation+1.88e-02

Pos logit contributions
uttered+0.065
ied+0.064
enge+0.063
uddled+0.062
ipt+0.061
Neg logit contributions
-0.040
."-0.033
.-0.031
 c-0.029
One-0.029
Token!
Token ID0
Feature activation-4.63e-03

Pos logit contributions
ied+0.048
enge+0.045
uttered+0.045
uddled+0.044
ics+0.044
Neg logit contributions
-0.010
One-0.009
 know-0.008
 One-0.008
 think-0.007
Token<|endoftext|>
Token ID50256
Feature activation-5.08e-03

Pos logit contributions
One+0.004
+0.004
 know+0.004
 pine+0.004
 c+0.004
Neg logit contributions
ied-0.010
uttered-0.009
enge-0.009
apped-0.009
uddled-0.009
TokenGeorge
Token ID20191
Feature activation-3.53e-03

Pos logit contributions
One+0.004
 know+0.004
+0.004
 think+0.003
 One+0.003
Neg logit contributions
ied-0.011
uttered-0.011
enge-0.011
ics-0.010
apped-0.010
Token,
Token ID11
Feature activation-4.62e-03

Pos logit contributions
uttered+0.020
ied+0.019
enge+0.019
box+0.019
uddled+0.019
Neg logit contributions
-0.008
 c-0.008
."-0.007
 know-0.007
 think-0.007
Token thought
Token ID1807
Feature activation-1.77e-02

Pos logit contributions
+0.005
 c+0.004
 think+0.004
 pine+0.004
."+0.004
Neg logit contributions
ied-0.010
uttered-0.010
enge-0.010
ics-0.009
uddled-0.009
Token it
Token ID340
Feature activation+1.89e-02

Pos logit contributions
+0.022
."+0.021
One+0.020
 know+0.020
 One+0.020
Neg logit contributions
enge-0.033
ied-0.032
uttered-0.031
uddled-0.030
apped-0.030
Token was
Token ID373
Feature activation+2.14e-02

Pos logit contributions
ied+0.036
enge+0.035
uddled+0.034
uttered+0.034
apped+0.033
Neg logit contributions
-0.025
One-0.022
 One-0.022
 c-0.021
 think-0.021
Token the
Token ID262
Feature activation+2.70e-02

Pos logit contributions
ied+0.047
uddled+0.047
uttered+0.047
enge+0.045
apped+0.044
Neg logit contributions
-0.023
."-0.021
 One-0.019
One-0.019
 know-0.019
Token best
Token ID1266
Feature activation+5.98e-02

Pos logit contributions
ied+0.041
uttered+0.037
uddled+0.036
enge+0.036
ics+0.034
Neg logit contributions
-0.044
One-0.044
 One-0.042
 know-0.041
 think-0.041
Token rug
Token ID14477
Feature activation+4.85e-02

Pos logit contributions
ied+0.113
enge+0.106
bled+0.104
uddled+0.104
uttered+0.103
Neg logit contributions
."-0.075
-0.073
 One-0.073
One-0.069
 know-0.068
Token ever
Token ID1683
Feature activation+1.51e-02

Pos logit contributions
ipt+0.104
uttered+0.104
outed+0.104
ied+0.103
enge+0.102
Neg logit contributions
-0.052
.-0.046
."-0.045
 c-0.039
,"-0.038
Token.
Token ID13
Feature activation-4.97e-04

Pos logit contributions
ied+0.038
enge+0.036
uttered+0.036
uddled+0.035
ics+0.035
Neg logit contributions
One-0.008
-0.008
 One-0.007
 know-0.007
 think-0.007
Token
Token ID220
Feature activation+2.14e-02

Pos logit contributions
One+0.001
 know+0.001
+0.001
 think+0.001
 c+0.001
Neg logit contributions
ied-0.001
uttered-0.001
enge-0.001
apped-0.001
uddled-0.001
Token
Token ID198
Feature activation+1.83e-02

Pos logit contributions
ied+0.052
enge+0.049
uttered+0.049
uddled+0.048
ics+0.048
Neg logit contributions
-0.013
One-0.013
 know-0.011
 One-0.011
 think-0.011
Token The
Token ID383
Feature activation-1.14e-02

Pos logit contributions
+0.006
One+0.006
 know+0.005
 think+0.005
 c+0.005
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
apped-0.012
ics-0.012
Token oct
Token ID19318
Feature activation+1.50e-02

Pos logit contributions
+0.014
One+0.013
 know+0.013
 One+0.013
use+0.012
Neg logit contributions
ied-0.020
enge-0.018
apped-0.018
uttered-0.017
box-0.017
Tokenopus
Token ID25790
Feature activation+1.16e-02

Pos logit contributions
ied+0.026
enge+0.024
uttered+0.024
uddled+0.023
ics+0.023
Neg logit contributions
One-0.019
-0.019
 know-0.019
 One-0.019
 think-0.018
Token had
Token ID550
Feature activation-2.26e-03

Pos logit contributions
ied+0.018
enge+0.016
uttered+0.016
uddled+0.016
apped+0.016
Neg logit contributions
One-0.017
-0.017
 know-0.016
 One-0.016
 think-0.015
Token eight
Token ID3624
Feature activation-5.21e-02

Pos logit contributions
One+0.003
+0.003
 know+0.002
 One+0.002
."+0.002
Neg logit contributions
ied-0.004
enge-0.004
uttered-0.004
uddled-0.004
ics-0.004
Token long
Token ID890
Feature activation-5.64e-02

Pos logit contributions
 Amen+0.074
 know+0.069
 think+0.067
One+0.067
+0.066
Neg logit contributions
ied-0.075
awed-0.067
uttered-0.065
apped-0.065
uddled-0.064
Token arms
Token ID5101
Feature activation-3.24e-02

Pos logit contributions
+0.081
 Amen+0.078
 think+0.074
 know+0.073
One+0.073
Neg logit contributions
ied-0.085
awed-0.078
apped-0.078
uttered-0.076
uddled-0.076
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
+0.033
One+0.032
 think+0.030
 know+0.030
 One+0.028
Neg logit contributions
ied-0.068
uttered-0.063
apped-0.061
uddled-0.060
ics-0.059
Token loved
Token ID6151
Feature activation-8.87e-03

Pos logit contributions
+0.017
One+0.017
 know+0.016
 One+0.016
 think+0.016
Neg logit contributions
ied-0.022
uttered-0.020
enge-0.020
apped-0.020
uddled-0.019
Token to
Token ID284
Feature activation+3.41e-05

Pos logit contributions
+0.014
One+0.014
 know+0.014
 One+0.014
 think+0.013
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
uddled-0.011
ics-0.011
Token swim
Token ID9422
Feature activation-1.60e-02

Pos logit contributions
ics+0.000
ied+0.000
uttered+0.000
apped+0.000
enge+0.000
Neg logit contributions
-0.000
One-0.000
ppy-0.000
 Deal-0.000
ulent-0.000
Token It
Token ID632
Feature activation+6.81e-04

Pos logit contributions
One+0.004
+0.004
 know+0.004
 One+0.004
 think+0.004
Neg logit contributions
ied-0.009
uttered-0.008
enge-0.008
apped-0.008
uddled-0.008
Token made
Token ID925
Feature activation-7.71e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
ics+0.001
uddled+0.001
Neg logit contributions
-0.001
One-0.001
 know-0.001
 One-0.001
 think-0.001
Token eight
Token ID3624
Feature activation-4.16e-02

Pos logit contributions
 know+0.006
+0.006
 One+0.006
 c+0.006
 think+0.005
Neg logit contributions
ied-0.017
uddled-0.017
enge-0.017
uttered-0.016
box-0.016
Token tentacles
Token ID41192
Feature activation-1.75e-02

Pos logit contributions
One+0.056
 Amen+0.055
 know+0.050
 One+0.050
 observe+0.048
Neg logit contributions
ied-0.067
awed-0.057
uddled-0.057
uttered-0.056
apped-0.056
Token into
Token ID656
Feature activation-1.33e-02

Pos logit contributions
+0.014
One+0.012
 know+0.012
 One+0.012
 think+0.011
Neg logit contributions
ied-0.040
enge-0.039
uttered-0.038
uddled-0.037
ics-0.036
Token eight
Token ID3624
Feature activation-5.56e-02

Pos logit contributions
+0.015
One+0.015
 know+0.014
 One+0.014
 think+0.014
Neg logit contributions
ied-0.025
enge-0.023
uttered-0.023
uddled-0.023
ics-0.022
Token pieces
Token ID5207
Feature activation-1.54e-02

Pos logit contributions
 Amen+0.069
One+0.069
ppy+0.063
 know+0.063
 think+0.062
Neg logit contributions
ied-0.089
ings-0.080
uttered-0.077
awed-0.075
apped-0.075
Token and
Token ID290
Feature activation-9.73e-03

Pos logit contributions
+0.011
One+0.011
 know+0.010
 One+0.010
 think+0.009
Neg logit contributions
ied-0.036
enge-0.034
uttered-0.034
uddled-0.033
ics-0.032
Token put
Token ID1234
Feature activation+4.60e-03

Pos logit contributions
+0.011
 know+0.011
One+0.010
 think+0.010
 One+0.010
Neg logit contributions
ied-0.019
enge-0.018
uttered-0.017
uddled-0.017
ics-0.017
Token them
Token ID606
Feature activation+5.99e-03

Pos logit contributions
uddled+0.009
ied+0.009
enge+0.009
ics+0.009
uttered+0.008
Neg logit contributions
-0.005
."-0.005
 c-0.004
 One-0.004
 pine-0.004
Token inside
Token ID2641
Feature activation-3.72e-04

Pos logit contributions
enge+0.010
ied+0.010
uddled+0.009
uttered+0.009
overed+0.009
Neg logit contributions
-0.008
."-0.007
 c-0.007
One-0.007
 think-0.007
Token and
Token ID290
Feature activation-9.60e-03

Pos logit contributions
One+0.013
+0.012
 One+0.012
 know+0.012
 think+0.011
Neg logit contributions
ied-0.027
uttered-0.025
enge-0.025
uddled-0.024
ics-0.024
Token ch
Token ID442
Feature activation-2.75e-02

Pos logit contributions
One+0.010
+0.010
 One+0.009
 know+0.009
 think+0.009
Neg logit contributions
ied-0.019
enge-0.018
uttered-0.018
ics-0.017
uddled-0.017
Tokenia
Token ID544
Feature activation-1.77e-02

Pos logit contributions
+0.039
One+0.039
 One+0.037
 know+0.035
 think+0.035
Neg logit contributions
ied-0.047
uttered-0.043
enge-0.042
apped-0.041
uddled-0.040
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
One+0.017
 One+0.015
+0.015
 know+0.015
 think+0.015
Neg logit contributions
ied-0.037
uttered-0.034
enge-0.033
ics-0.033
uddled-0.033
Token s
Token ID264
Feature activation-3.67e-02

Pos logit contributions
+0.015
One+0.015
 know+0.015
 One+0.014
 think+0.014
Neg logit contributions
ied-0.028
enge-0.027
uttered-0.026
uddled-0.026
ics-0.025
Tokenesame
Token ID34038
Feature activation-5.54e-02

Pos logit contributions
+0.072
One+0.071
 One+0.070
 know+0.070
 think+0.069
Neg logit contributions
ied-0.042
uttered-0.036
enge-0.036
uddled-0.034
apped-0.033
Token and
Token ID290
Feature activation-1.60e-02

Pos logit contributions
One+0.066
 One+0.064
 know+0.063
+0.062
 think+0.061
Neg logit contributions
ied-0.102
ics-0.092
uttered-0.092
uddled-0.091
enge-0.089
Token sun
Token ID4252
Feature activation-2.58e-02

Pos logit contributions
+0.018
One+0.018
 know+0.017
 One+0.017
 think+0.016
Neg logit contributions
ied-0.031
enge-0.028
uttered-0.028
uddled-0.028
ics-0.027
Tokenflower
Token ID25547
Feature activation-1.93e-02

Pos logit contributions
+0.018
One+0.017
 One+0.016
 know+0.016
."+0.016
Neg logit contributions
ied-0.060
enge-0.057
uttered-0.057
uddled-0.055
ics-0.054
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
+0.017
One+0.016
 know+0.015
 One+0.015
."+0.015
Neg logit contributions
ied-0.042
enge-0.039
uttered-0.039
uddled-0.038
ics-0.038
Token pumpkin
Token ID30089
Feature activation-2.48e-02

Pos logit contributions
+0.015
One+0.015
 know+0.014
 One+0.014
 think+0.013
Neg logit contributions
ied-0.025
enge-0.023
uttered-0.023
uddled-0.023
ics-0.022
Token.
Token ID13
Feature activation-4.66e-03

Pos logit contributions
 know+0.003
One+0.003
+0.003
 think+0.003
 One+0.003
Neg logit contributions
ied-0.008
apped-0.008
enge-0.007
uttered-0.007
ics-0.007
Token The
Token ID383
Feature activation-1.31e-02

Pos logit contributions
+0.005
ppy+0.005
One+0.005
 think+0.005
 know+0.005
Neg logit contributions
ied-0.008
enge-0.008
uttered-0.008
apped-0.008
ics-0.007
Token oct
Token ID19318
Feature activation+1.45e-02

Pos logit contributions
+0.015
One+0.015
 know+0.014
 One+0.014
use+0.014
Neg logit contributions
ied-0.023
apped-0.021
enge-0.021
uttered-0.021
ipt-0.020
Tokenopus
Token ID25790
Feature activation+1.18e-02

Pos logit contributions
ied+0.024
enge+0.022
uttered+0.022
uddled+0.021
apped+0.021
Neg logit contributions
One-0.020
-0.020
 One-0.020
 know-0.020
 think-0.019
Token had
Token ID550
Feature activation+8.26e-04

Pos logit contributions
ied+0.019
uttered+0.017
enge+0.017
uddled+0.017
apped+0.017
Neg logit contributions
One-0.017
-0.016
 know-0.016
 One-0.016
 think-0.015
Token eight
Token ID3624
Feature activation-5.34e-02

Pos logit contributions
ied+0.002
enge+0.002
uddled+0.002
uttered+0.002
overed+0.001
Neg logit contributions
 know-0.001
One-0.001
-0.001
 One-0.001
."-0.001
Token arms
Token ID5101
Feature activation-3.25e-02

Pos logit contributions
 Amen+0.070
 know+0.068
ppy+0.067
 think+0.067
One+0.067
Neg logit contributions
ied-0.080
uttered-0.072
apped-0.071
awed-0.070
uddled-0.068
Token and
Token ID290
Feature activation-9.63e-03

Pos logit contributions
One+0.032
+0.031
 know+0.029
 think+0.029
 One+0.028
Neg logit contributions
ied-0.069
uttered-0.064
apped-0.062
uddled-0.061
ics-0.060
Token could
Token ID714
Feature activation+3.06e-03

Pos logit contributions
+0.013
One+0.013
 know+0.012
 One+0.012
 think+0.012
Neg logit contributions
ied-0.016
uttered-0.015
enge-0.015
apped-0.015
uddled-0.014
Token swim
Token ID9422
Feature activation-1.34e-02

Pos logit contributions
ied+0.006
uttered+0.006
ics+0.006
enge+0.006
apped+0.006
Neg logit contributions
-0.003
One-0.003
 One-0.002
."-0.002
 know-0.002
Token very
Token ID845
Feature activation-5.85e-03

Pos logit contributions
+0.016
One+0.016
 know+0.015
 One+0.015
 think+0.015
Neg logit contributions
ied-0.025
enge-0.023
uttered-0.023
uddled-0.022
ics-0.022
Token a
Token ID257
Feature activation-1.46e-02

Pos logit contributions
+0.006
One+0.006
 know+0.006
 One+0.006
 think+0.006
Neg logit contributions
ied-0.008
uttered-0.008
enge-0.008
uddled-0.007
ics-0.007
Token small
Token ID1402
Feature activation-1.31e-02

Pos logit contributions
 know+0.020
+0.020
r+0.019
One+0.019
 One+0.019
Neg logit contributions
enge-0.020
ied-0.019
uddled-0.019
 manager-0.019
apped-0.018
Token sun
Token ID4252
Feature activation-8.83e-03

Pos logit contributions
One+0.021
+0.020
 know+0.020
 One+0.019
 think+0.019
Neg logit contributions
ied-0.018
enge-0.017
uttered-0.016
apped-0.016
box-0.016
Tokenflower
Token ID25547
Feature activation-1.05e-02

Pos logit contributions
 know+0.010
+0.010
One+0.009
 think+0.009
 One+0.009
Neg logit contributions
ied-0.017
uttered-0.016
apped-0.016
ics-0.016
uddled-0.016
Token called
Token ID1444
Feature activation-2.06e-02

Pos logit contributions
 know+0.011
 think+0.010
 Deal+0.010
use+0.010
 disl+0.010
Neg logit contributions
ied-0.020
apped-0.019
ics-0.019
uttered-0.018
box-0.018
Token Willow
Token ID33021
Feature activation-5.26e-02

Pos logit contributions
 know+0.026
One+0.026
 think+0.025
+0.024
 want+0.023
Neg logit contributions
ied-0.034
uttered-0.033
enge-0.032
apped-0.031
ics-0.031
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
One+0.055
 know+0.054
 One+0.053
 think+0.053
 conclusions+0.053
Neg logit contributions
ied-0.099
apped-0.089
ics-0.088
uddled-0.088
uttered-0.086
Token Willow
Token ID33021
Feature activation-2.24e-02

Pos logit contributions
orse+0.014
 think+0.014
 know+0.014
+0.014
 disl+0.013
Neg logit contributions
uttered-0.018
ied-0.018
apped-0.017
enge-0.017
ics-0.016
Token was
Token ID373
Feature activation-1.55e-02

Pos logit contributions
One+0.039
 Deal+0.039
 One+0.038
 conclusions+0.037
 know+0.036
Neg logit contributions
ied-0.027
uddled-0.024
ics-0.023
uttered-0.023
enge-0.022
Token so
Token ID523
Feature activation-7.07e-03

Pos logit contributions
+0.022
One+0.022
 know+0.021
 One+0.021
 think+0.020
Neg logit contributions
ied-0.025
enge-0.023
uttered-0.023
uddled-0.022
ics-0.022
Token small
Token ID1402
Feature activation-1.20e-02

Pos logit contributions
+0.010
One+0.010
 One+0.009
 conclusions+0.009
use+0.009
Neg logit contributions
ied-0.011
uttered-0.011
ics-0.011
apped-0.010
enge-0.010
Token.
Token ID13
Feature activation-6.22e-03

Pos logit contributions
+0.017
 know+0.017
One+0.015
ppy+0.015
use+0.015
Neg logit contributions
ied-0.042
ics-0.039
apped-0.039
uttered-0.038
box-0.037
Token The
Token ID383
Feature activation-1.14e-02

Pos logit contributions
+0.006
One+0.006
 know+0.005
 think+0.005
 c+0.005
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
apped-0.012
ics-0.012
Token oct
Token ID19318
Feature activation+1.50e-02

Pos logit contributions
+0.014
One+0.013
 know+0.013
 One+0.013
use+0.012
Neg logit contributions
ied-0.020
enge-0.018
apped-0.018
uttered-0.017
box-0.017
Tokenopus
Token ID25790
Feature activation+1.16e-02

Pos logit contributions
ied+0.026
enge+0.024
uttered+0.024
uddled+0.023
ics+0.023
Neg logit contributions
One-0.019
-0.019
 know-0.019
 One-0.019
 think-0.018
Token had
Token ID550
Feature activation-2.26e-03

Pos logit contributions
ied+0.018
enge+0.016
uttered+0.016
uddled+0.016
apped+0.016
Neg logit contributions
One-0.017
-0.017
 know-0.016
 One-0.016
 think-0.015
Token eight
Token ID3624
Feature activation-5.21e-02

Pos logit contributions
One+0.003
+0.003
 know+0.002
 One+0.002
."+0.002
Neg logit contributions
ied-0.004
enge-0.004
uttered-0.004
uddled-0.004
ics-0.004
Token long
Token ID890
Feature activation-5.64e-02

Pos logit contributions
 Amen+0.074
 know+0.069
 think+0.067
One+0.067
+0.066
Neg logit contributions
ied-0.075
awed-0.067
uttered-0.065
apped-0.065
uddled-0.064
Token arms
Token ID5101
Feature activation-3.24e-02

Pos logit contributions
+0.081
 Amen+0.078
 think+0.074
 know+0.073
One+0.073
Neg logit contributions
ied-0.085
awed-0.078
apped-0.078
uttered-0.076
uddled-0.076
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
+0.033
One+0.032
 think+0.030
 know+0.030
 One+0.028
Neg logit contributions
ied-0.068
uttered-0.063
apped-0.061
uddled-0.060
ics-0.059
Token loved
Token ID6151
Feature activation-8.87e-03

Pos logit contributions
+0.017
One+0.017
 know+0.016
 One+0.016
 think+0.016
Neg logit contributions
ied-0.022
uttered-0.020
enge-0.020
apped-0.020
uddled-0.019
Token to
Token ID284
Feature activation+3.41e-05

Pos logit contributions
+0.014
One+0.014
 know+0.014
 One+0.014
 think+0.013
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
uddled-0.011
ics-0.011
Token a
Token ID257
Feature activation-5.75e-03

Pos logit contributions
+0.013
One+0.012
 know+0.012
."+0.012
 One+0.012
Neg logit contributions
ied-0.025
uttered-0.023
enge-0.023
uddled-0.023
ipt-0.021
Token ch
Token ID442
Feature activation-2.72e-02

Pos logit contributions
+0.007
One+0.007
 know+0.007
 One+0.007
."+0.006
Neg logit contributions
ied-0.010
enge-0.010
uttered-0.009
uddled-0.009
apped-0.009
Tokenubby
Token ID38393
Feature activation-2.17e-02

Pos logit contributions
+0.035
One+0.032
 One+0.030
 know+0.030
 c+0.029
Neg logit contributions
ied-0.053
uttered-0.050
enge-0.050
apped-0.049
overed-0.045
Token girl
Token ID2576
Feature activation-9.37e-03

Pos logit contributions
One+0.026
+0.025
 know+0.024
 One+0.024
 think+0.023
Neg logit contributions
ied-0.040
enge-0.037
uttered-0.037
uddled-0.036
apped-0.035
Token with
Token ID351
Feature activation-1.55e-02

Pos logit contributions
One+0.008
 know+0.008
 One+0.008
 think+0.007
 pine+0.007
Neg logit contributions
ied-0.019
enge-0.017
apped-0.017
box-0.017
uttered-0.017
Token curly
Token ID45731
Feature activation-5.19e-02

Pos logit contributions
+0.019
One+0.019
 know+0.018
 One+0.018
 think+0.017
Neg logit contributions
ied-0.028
enge-0.027
uttered-0.026
uddled-0.026
ics-0.025
Token hair
Token ID4190
Feature activation-3.59e-02

Pos logit contributions
+0.083
One+0.082
 think+0.078
 know+0.078
use+0.073
Neg logit contributions
ied-0.066
apped-0.061
uttered-0.060
uddled-0.060
awed-0.059
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
One+0.030
 know+0.029
+0.029
 think+0.027
 One+0.027
Neg logit contributions
ied-0.078
uddled-0.073
uttered-0.073
enge-0.071
apped-0.071
Token a
Token ID257
Feature activation-1.46e-02

Pos logit contributions
One+0.016
+0.015
 One+0.015
 know+0.014
 think+0.014
Neg logit contributions
ied-0.020
uttered-0.019
enge-0.019
uddled-0.018
apped-0.018
Token big
Token ID1263
Feature activation-7.48e-03

Pos logit contributions
One+0.017
+0.016
 One+0.016
 know+0.016
."+0.015
Neg logit contributions
ied-0.026
uttered-0.025
enge-0.025
uddled-0.024
apped-0.024
Token smile
Token ID8212
Feature activation-2.19e-02

Pos logit contributions
One+0.009
+0.008
 know+0.008
 One+0.008
."+0.007
Neg logit contributions
ied-0.014
enge-0.013
uttered-0.013
apped-0.012
box-0.012
Token says
Token ID1139
Feature activation-4.16e-03

Pos logit contributions
One+0.001
+0.001
 One+0.001
 know+0.001
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token:
Token ID25
Feature activation-9.50e-03

Pos logit contributions
One+0.004
+0.004
 know+0.004
 think+0.003
 One+0.003
Neg logit contributions
ied-0.009
uttered-0.008
uddled-0.008
apped-0.008
enge-0.008
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 know+0.014
+0.013
One+0.013
ppy+0.013
 pine+0.013
Neg logit contributions
ied-0.013
uttered-0.013
apped-0.013
overed-0.012
uddled-0.012
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
ppy+0.017
bee+0.015
 tul+0.015
 eagle+0.015
mo+0.015
Neg logit contributions
ied-0.019
ings-0.018
ics-0.018
uttered-0.018
ipt-0.018
Token"
Token ID1
Feature activation-8.47e-03

Pos logit contributions
orse+0.020
ppy+0.019
 eagle+0.018
bee+0.018
mo+0.018
Neg logit contributions
ied-0.034
enge-0.033
owner-0.033
ipt-0.032
 clerk-0.032
TokenDr
Token ID6187
Feature activation-5.09e-02

Pos logit contributions
One+0.010
+0.009
 One+0.009
 know+0.009
 think+0.008
Neg logit contributions
ied-0.017
uttered-0.016
enge-0.015
ics-0.015
uddled-0.015
Tokenink
Token ID676
Feature activation-1.19e-02

Pos logit contributions
 One+0.066
 know+0.065
 think+0.064
One+0.063
+0.063
Neg logit contributions
ied-0.083
ics-0.081
ings-0.081
apped-0.075
uddled-0.075
Token some
Token ID617
Feature activation-1.66e-02

Pos logit contributions
One+0.013
+0.012
 One+0.012
 know+0.012
 think+0.011
Neg logit contributions
ied-0.024
enge-0.022
uttered-0.022
ics-0.021
uddled-0.021
Token water
Token ID1660
Feature activation-6.55e-03

Pos logit contributions
One+0.022
+0.021
 One+0.020
 know+0.020
 think+0.019
Neg logit contributions
ied-0.029
enge-0.026
uttered-0.026
uddled-0.026
apped-0.025
Token.
Token ID13
Feature activation+1.57e-03

Pos logit contributions
+0.007
One+0.006
 know+0.006
 One+0.006
 think+0.006
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
uddled-0.012
ics-0.012
Token It
Token ID632
Feature activation+2.60e-03

Pos logit contributions
apped+0.002
uttered+0.002
enge+0.002
box+0.002
ied+0.002
Neg logit contributions
One-0.002
 know-0.002
-0.002
 think-0.002
orse-0.002
Token a
Token ID257
Feature activation+2.17e-03

Pos logit contributions
ied+0.004
enge+0.004
uttered+0.004
uddled+0.004
apped+0.003
Neg logit contributions
-0.002
One-0.002
 One-0.002
 know-0.002
 c-0.002
Token pet
Token ID4273
Feature activation-1.55e-02

Pos logit contributions
ied+0.004
uttered+0.004
enge+0.004
uddled+0.004
ics+0.004
Neg logit contributions
-0.002
One-0.002
 One-0.002
 c-0.002
 know-0.002
Token ham
Token ID8891
Feature activation-2.26e-02

Pos logit contributions
+0.017
One+0.016
 One+0.016
 know+0.016
 think+0.015
Neg logit contributions
ied-0.030
enge-0.028
uttered-0.028
uddled-0.027
ics-0.027
Tokenster
Token ID1706
Feature activation-1.75e-02

Pos logit contributions
+0.031
One+0.031
 know+0.030
 One+0.030
 think+0.029
Neg logit contributions
ied-0.037
enge-0.034
uttered-0.034
uddled-0.033
ics-0.032
Token named
Token ID3706
Feature activation-9.78e-03

Pos logit contributions
 know+0.021
One+0.020
+0.020
 think+0.020
 One+0.019
Neg logit contributions
ied-0.032
box-0.029
enge-0.028
uddled-0.028
uttered-0.028
Token Spark
Token ID17732
Feature activation-5.07e-02

Pos logit contributions
One+0.013
 know+0.013
 think+0.013
+0.013
 observe+0.012
Neg logit contributions
ied-0.014
enge-0.013
uttered-0.013
ics-0.013
box-0.012
Tokeny
Token ID88
Feature activation-2.03e-02

Pos logit contributions
+0.082
One+0.082
 know+0.080
 One+0.079
 think+0.077
Neg logit contributions
ied-0.072
enge-0.065
uttered-0.065
uddled-0.062
ics-0.061
Token.
Token ID13
Feature activation-8.43e-03

Pos logit contributions
+0.011
 know+0.011
One+0.011
 think+0.010
 One+0.009
Neg logit contributions
ied-0.049
uttered-0.046
enge-0.046
uddled-0.045
apped-0.045
Token One
Token ID1881
Feature activation-1.36e-02

Pos logit contributions
+0.006
 know+0.006
One+0.006
 think+0.006
 pine+0.005
Neg logit contributions
ied-0.018
uttered-0.018
enge-0.018
apped-0.017
uddled-0.017
Token day
Token ID1110
Feature activation+2.77e-03

Pos logit contributions
+0.015
One+0.015
 One+0.015
 know+0.014
 c+0.014
Neg logit contributions
ied-0.026
enge-0.025
uttered-0.025
uddled-0.025
ics-0.024
Token,
Token ID11
Feature activation+3.30e-03

Pos logit contributions
ied+0.007
enge+0.007
uttered+0.007
ics+0.007
uddled+0.007
Neg logit contributions
-0.001
One-0.001
 One-0.001
 know-0.001
."-0.001
TokenOne
Token ID3198
Feature activation-1.10e-02

Pos logit contributions
 know+0.025
One+0.025
 think+0.025
+0.024
 c+0.023
Neg logit contributions
ied-0.043
uttered-0.041
enge-0.040
apped-0.039
ics-0.039
Token day
Token ID1110
Feature activation-1.51e-02

Pos logit contributions
One+0.016
+0.016
."+0.016
ppy+0.016
use+0.016
Neg logit contributions
ied-0.016
uttered-0.014
enge-0.013
ics-0.013
apped-0.013
Token,
Token ID11
Feature activation+1.13e-04

Pos logit contributions
+0.008
One+0.007
 One+0.007
 know+0.006
."+0.006
Neg logit contributions
ied-0.038
enge-0.037
uttered-0.036
uddled-0.036
ics-0.035
Token Bob
Token ID5811
Feature activation-2.31e-02

Pos logit contributions
ied+0.000
enge+0.000
uddled+0.000
cards+0.000
uttered+0.000
Neg logit contributions
-0.000
One-0.000
 One-0.000
."-0.000
 think-0.000
Token was
Token ID373
Feature activation-1.61e-02

Pos logit contributions
One+0.031
 One+0.029
+0.029
 know+0.028
 think+0.027
Neg logit contributions
ied-0.038
enge-0.034
uttered-0.034
ics-0.033
apped-0.033
Token feeling
Token ID4203
Feature activation-5.04e-02

Pos logit contributions
+0.018
One+0.017
 One+0.017
 know+0.016
 think+0.016
Neg logit contributions
ied-0.032
enge-0.031
uttered-0.030
uddled-0.030
overed-0.028
Token very
Token ID845
Feature activation-9.19e-03

Pos logit contributions
One+0.069
+0.066
 know+0.065
 One+0.065
 think+0.062
Neg logit contributions
ied-0.084
uttered-0.076
enge-0.075
apped-0.074
uddled-0.074
Token enthusiastic
Token ID21848
Feature activation+1.71e-02

Pos logit contributions
One+0.014
+0.013
 One+0.012
 pine+0.012
r+0.012
Neg logit contributions
ied-0.015
uttered-0.014
apped-0.013
uddled-0.013
ics-0.013
Token.
Token ID13
Feature activation-6.28e-03

Pos logit contributions
ied+0.041
enge+0.039
uttered+0.039
uddled+0.038
ics+0.038
Neg logit contributions
-0.011
One-0.011
 One-0.010
."-0.010
 know-0.010
Token He
Token ID679
Feature activation-6.06e-03

Pos logit contributions
+0.007
One+0.007
 know+0.007
 think+0.006
ppy+0.006
Neg logit contributions
ied-0.012
uttered-0.011
enge-0.011
apped-0.010
ics-0.010
Token loved
Token ID6151
Feature activation-7.02e-03

Pos logit contributions
One+0.008
+0.007
 know+0.007
 One+0.007
."+0.007
Neg logit contributions
ied-0.010
enge-0.010
uttered-0.010
uddled-0.009
apped-0.009
Token water
Token ID1660
Feature activation-1.12e-02

Pos logit contributions
 know+0.005
+0.005
One+0.005
 One+0.005
 think+0.005
Neg logit contributions
ied-0.007
enge-0.007
box-0.006
uttered-0.006
ics-0.006
Token.
Token ID13
Feature activation+3.55e-03

Pos logit contributions
+0.004
One+0.004
 One+0.004
."+0.004
 know+0.004
Neg logit contributions
ied-0.029
enge-0.028
uttered-0.028
uddled-0.027
ics-0.027
Token It
Token ID632
Feature activation+1.26e-02

Pos logit contributions
ied+0.007
enge+0.007
uttered+0.007
uddled+0.007
ics+0.007
Neg logit contributions
-0.003
One-0.003
 One-0.003
 know-0.003
."-0.003
Token had
Token ID550
Feature activation+1.15e-02

Pos logit contributions
ied+0.015
uttered+0.014
enge+0.013
uddled+0.013
apped+0.013
Neg logit contributions
One-0.021
 know-0.021
-0.021
 One-0.020
 think-0.020
Token eight
Token ID3624
Feature activation-4.99e-02

Pos logit contributions
ied+0.023
enge+0.021
uttered+0.021
uddled+0.021
ics+0.020
Neg logit contributions
One-0.012
-0.012
 know-0.012
 One-0.012
 think-0.011
Token long
Token ID890
Feature activation-5.01e-02

Pos logit contributions
 Amen+0.073
One+0.070
 know+0.067
ppy+0.066
+0.065
Neg logit contributions
ied-0.071
ings-0.066
awed-0.065
uttered-0.062
umbled-0.062
Token arms
Token ID5101
Feature activation-3.22e-02

Pos logit contributions
+0.070
One+0.067
 Amen+0.065
 know+0.063
 think+0.063
Neg logit contributions
ied-0.079
apped-0.073
awed-0.073
uttered-0.072
uddled-0.072
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
One+0.034
+0.033
 know+0.032
 think+0.031
 One+0.031
Neg logit contributions
ied-0.065
uttered-0.060
apped-0.058
uddled-0.057
ics-0.056
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
One+0.017
+0.017
 One+0.016
 know+0.016
use+0.016
Neg logit contributions
ied-0.020
uttered-0.019
apped-0.019
enge-0.018
uddled-0.018
Token round
Token ID2835
Feature activation-9.62e-03

Pos logit contributions
One+0.014
+0.014
 One+0.014
 know+0.013
."+0.013
Neg logit contributions
ied-0.021
uttered-0.020
enge-0.020
apped-0.020
box-0.019
Token head
Token ID1182
Feature activation-1.51e-02

Pos logit contributions
+0.012
One+0.012
 know+0.012
 One+0.011
 think+0.011
Neg logit contributions
ied-0.017
uttered-0.016
enge-0.016
apped-0.015
uddled-0.015
Token was
Token ID373
Feature activation-2.09e-03

Pos logit contributions
cards+0.013
ied+0.013
enge+0.013
uttered+0.012
uddled+0.012
Neg logit contributions
-0.009
 One-0.008
One-0.007
 know-0.007
 see-0.007
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
+0.002
One+0.002
 One+0.002
 know+0.002
 think+0.002
Neg logit contributions
ied-0.004
enge-0.004
uttered-0.004
uddled-0.004
ics-0.003
Token little
Token ID1310
Feature activation+4.57e-03

Pos logit contributions
+0.014
 know+0.013
One+0.013
 One+0.013
r+0.012
Neg logit contributions
enge-0.015
ied-0.014
uddled-0.014
apped-0.014
 manager-0.014
Token pig
Token ID12967
Feature activation-1.31e-02

Pos logit contributions
enge+0.005
ied+0.005
apped+0.005
uttered+0.005
uddled+0.005
Neg logit contributions
One-0.008
-0.008
 know-0.008
 One-0.008
 think-0.007
Token named
Token ID3706
Feature activation-4.49e-03

Pos logit contributions
+0.015
 Deal+0.015
use+0.014
 know+0.013
One+0.013
Neg logit contributions
ied-0.024
ics-0.023
apped-0.023
enge-0.022
man-0.021
Token Pink
Token ID14657
Feature activation-5.00e-02

Pos logit contributions
+0.006
 know+0.006
One+0.006
 think+0.005
 One+0.005
Neg logit contributions
ied-0.008
uttered-0.007
enge-0.007
ics-0.007
apped-0.006
Tokeny
Token ID88
Feature activation-2.72e-02

Pos logit contributions
+0.111
 Deal+0.109
 One+0.104
 think+0.103
One+0.102
Neg logit contributions
ied-0.039
ics-0.028
apped-0.026
man-0.025
uttered-0.024
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
 Deal+0.038
+0.037
use+0.037
 know+0.036
 think+0.034
Neg logit contributions
apped-0.040
man-0.039
ied-0.039
uddled-0.036
keeper-0.036
Token Pink
Token ID14657
Feature activation-2.78e-02

Pos logit contributions
+0.011
 know+0.011
 think+0.010
One+0.010
orse+0.009
Neg logit contributions
ied-0.020
uttered-0.020
enge-0.019
apped-0.019
uddled-0.018
Tokeny
Token ID88
Feature activation+6.72e-03

Pos logit contributions
+0.052
One+0.051
 One+0.050
 know+0.049
 think+0.049
Neg logit contributions
ied-0.033
uttered-0.028
enge-0.028
ics-0.027
apped-0.027
Token loved
Token ID6151
Feature activation-3.52e-03

Pos logit contributions
ied+0.008
apped+0.007
uttered+0.007
uddled+0.007
enge+0.007
Neg logit contributions
 know-0.011
One-0.011
-0.011
 conclusions-0.010
use-0.010
Token the
Token ID262
Feature activation-4.16e-03

Pos logit contributions
One+0.004
+0.004
 One+0.004
 know+0.004
 think+0.003
Neg logit contributions
ied-0.007
enge-0.006
uttered-0.006
uddled-0.006
apped-0.006
Token water
Token ID1660
Feature activation-1.12e-02

Pos logit contributions
 know+0.005
+0.005
One+0.005
 One+0.005
 think+0.005
Neg logit contributions
ied-0.007
enge-0.007
box-0.006
uttered-0.006
ics-0.006
Token.
Token ID13
Feature activation+3.55e-03

Pos logit contributions
+0.004
One+0.004
 One+0.004
."+0.004
 know+0.004
Neg logit contributions
ied-0.029
enge-0.028
uttered-0.028
uddled-0.027
ics-0.027
Token It
Token ID632
Feature activation+1.26e-02

Pos logit contributions
ied+0.007
enge+0.007
uttered+0.007
uddled+0.007
ics+0.007
Neg logit contributions
-0.003
One-0.003
 One-0.003
 know-0.003
."-0.003
Token had
Token ID550
Feature activation+1.15e-02

Pos logit contributions
ied+0.015
uttered+0.014
enge+0.013
uddled+0.013
apped+0.013
Neg logit contributions
One-0.021
 know-0.021
-0.021
 One-0.020
 think-0.020
Token eight
Token ID3624
Feature activation-4.99e-02

Pos logit contributions
ied+0.023
enge+0.021
uttered+0.021
uddled+0.021
ics+0.020
Neg logit contributions
One-0.012
-0.012
 know-0.012
 One-0.012
 think-0.011
Token long
Token ID890
Feature activation-5.01e-02

Pos logit contributions
 Amen+0.073
One+0.070
 know+0.067
ppy+0.066
+0.065
Neg logit contributions
ied-0.071
ings-0.066
awed-0.065
uttered-0.062
umbled-0.062
Token arms
Token ID5101
Feature activation-3.22e-02

Pos logit contributions
+0.070
One+0.067
 Amen+0.065
 know+0.063
 think+0.063
Neg logit contributions
ied-0.079
apped-0.073
awed-0.073
uttered-0.072
uddled-0.072
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
One+0.034
+0.033
 know+0.032
 think+0.031
 One+0.031
Neg logit contributions
ied-0.065
uttered-0.060
apped-0.058
uddled-0.057
ics-0.056
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
One+0.017
+0.017
 One+0.016
 know+0.016
use+0.016
Neg logit contributions
ied-0.020
uttered-0.019
apped-0.019
enge-0.018
uddled-0.018
Token round
Token ID2835
Feature activation-9.62e-03

Pos logit contributions
One+0.014
+0.014
 One+0.014
 know+0.013
."+0.013
Neg logit contributions
ied-0.021
uttered-0.020
enge-0.020
apped-0.020
box-0.019
Token a
Token ID257
Feature activation-1.56e-02

Pos logit contributions
+0.007
One+0.007
 know+0.006
 One+0.006
 think+0.006
Neg logit contributions
ied-0.009
enge-0.008
uttered-0.008
uddled-0.008
ics-0.008
Token unique
Token ID3748
Feature activation-7.32e-03

Pos logit contributions
 know+0.022
+0.021
r+0.021
One+0.021
 One+0.021
Neg logit contributions
enge-0.021
 manager-0.020
uddled-0.019
ied-0.019
 clerk-0.019
Token little
Token ID1310
Feature activation+3.36e-03

Pos logit contributions
One+0.009
+0.009
 know+0.009
 One+0.009
 think+0.008
Neg logit contributions
ied-0.013
enge-0.012
uttered-0.012
uddled-0.011
apped-0.011
Token girl
Token ID2576
Feature activation-1.04e-02

Pos logit contributions
ied+0.004
enge+0.004
uttered+0.004
apped+0.004
uddled+0.004
Neg logit contributions
One-0.006
 know-0.006
-0.005
 One-0.005
 think-0.005
Token named
Token ID3706
Feature activation-6.48e-03

Pos logit contributions
 Deal+0.017
 fiber+0.016
 know+0.015
 conclusions+0.015
One+0.014
Neg logit contributions
ied-0.014
apped-0.013
enge-0.012
box-0.012
uttered-0.011
Token Elsa
Token ID19226
Feature activation-4.97e-02

Pos logit contributions
orse+0.013
oc+0.012
 eagle+0.012
 know+0.011
 pine+0.011
Neg logit contributions
ied-0.007
uttered-0.006
enge-0.005
ics-0.005
uddled-0.004
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 Deal+0.071
 conclusions+0.069
 fiber+0.060
 pine+0.059
ulent+0.059
Neg logit contributions
ied-0.064
man-0.062
apped-0.059
box-0.059
 lasted-0.057
Token She
Token ID1375
Feature activation-9.08e-03

Pos logit contributions
 know+0.016
orse+0.016
+0.016
 think+0.015
 pine+0.015
Neg logit contributions
ied-0.027
uttered-0.025
enge-0.025
apped-0.024
uddled-0.024
Token went
Token ID1816
Feature activation-7.38e-03

Pos logit contributions
 know+0.014
One+0.014
+0.013
 conclusions+0.013
 One+0.013
Neg logit contributions
ied-0.013
uddled-0.012
enge-0.011
uttered-0.011
apped-0.011
Token for
Token ID329
Feature activation-3.17e-03

Pos logit contributions
One+0.011
+0.011
 One+0.011
 know+0.011
 think+0.010
Neg logit contributions
ied-0.011
enge-0.010
uttered-0.010
uddled-0.010
ics-0.010
Token a
Token ID257
Feature activation-5.60e-03

Pos logit contributions
+0.005
One+0.004
."+0.004
 One+0.004
 know+0.004
Neg logit contributions
ied-0.004
enge-0.004
ics-0.004
uttered-0.004
uddled-0.004
Token a
Token ID257
Feature activation-5.91e-03

Pos logit contributions
."+0.001
+0.001
One+0.001
 One+0.001
 know+0.001
Neg logit contributions
ied-0.002
uttered-0.002
enge-0.002
uddled-0.002
ics-0.002
Token short
Token ID1790
Feature activation-2.29e-02

Pos logit contributions
+0.008
One+0.008
 know+0.008
 One+0.008
."+0.007
Neg logit contributions
ied-0.010
enge-0.009
uttered-0.009
uddled-0.009
ics-0.009
Token tail
Token ID7894
Feature activation-2.94e-02

Pos logit contributions
+0.034
One+0.034
 know+0.033
 One+0.033
 think+0.032
Neg logit contributions
ied-0.034
uttered-0.031
enge-0.031
uddled-0.030
apped-0.030
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
One+0.020
 know+0.018
 One+0.018
+0.017
 think+0.017
Neg logit contributions
ied-0.068
enge-0.063
uttered-0.063
box-0.063
apped-0.063
Token big
Token ID1263
Feature activation-1.72e-02

Pos logit contributions
One+0.021
+0.020
 know+0.019
 One+0.019
 think+0.018
Neg logit contributions
ied-0.026
enge-0.025
uttered-0.025
apped-0.024
uddled-0.023
Token cheeks
Token ID25839
Feature activation-4.95e-02

Pos logit contributions
One+0.026
+0.024
 know+0.023
 One+0.023
ppy+0.021
Neg logit contributions
ied-0.024
box-0.023
apped-0.023
enge-0.023
uttered-0.023
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
One+0.029
 know+0.027
+0.026
 One+0.026
 think+0.025
Neg logit contributions
ied-0.120
ics-0.112
uttered-0.111
enge-0.111
uddled-0.109
Token That
Token ID1320
Feature activation-2.22e-03

Pos logit contributions
One+0.014
 know+0.013
 pine+0.013
orse+0.013
 think+0.012
Neg logit contributions
enge-0.013
apped-0.013
uttered-0.012
ied-0.011
overed-0.010
Token is
Token ID318
Feature activation-9.55e-03

Pos logit contributions
+0.003
."+0.002
One+0.002
 One+0.002
 know+0.002
Neg logit contributions
ied-0.005
uttered-0.004
uddled-0.004
enge-0.004
overed-0.004
Token a
Token ID257
Feature activation+6.52e-04

Pos logit contributions
+0.011
One+0.010
."+0.010
 know+0.010
 think+0.010
Neg logit contributions
ied-0.019
uttered-0.019
uddled-0.018
enge-0.018
umbled-0.017
Token ham
Token ID8891
Feature activation-1.41e-02

Pos logit contributions
ied+0.001
uttered+0.001
enge+0.001
uddled+0.001
ics+0.001
Neg logit contributions
-0.001
One-0.001
 know-0.001
 One-0.001
 c-0.001
Token a
Token ID257
Feature activation+2.91e-03

Pos logit contributions
ied+0.016
enge+0.015
uttered+0.015
ics+0.015
uddled+0.014
Neg logit contributions
-0.015
One-0.015
 know-0.014
 One-0.014
 think-0.014
Token time
Token ID640
Feature activation+1.99e-02

Pos logit contributions
ied+0.004
enge+0.003
uttered+0.003
uddled+0.003
ics+0.003
Neg logit contributions
 know-0.005
."-0.004
 c-0.004
-0.004
 pine-0.004
Token,
Token ID11
Feature activation+8.95e-03

Pos logit contributions
ied+0.049
enge+0.046
uttered+0.046
uddled+0.045
ics+0.045
Neg logit contributions
One-0.012
-0.011
 know-0.011
 One-0.010
 think-0.010
Token Mark
Token ID2940
Feature activation-2.05e-02

Pos logit contributions
ied+0.015
uttered+0.014
enge+0.013
uddled+0.013
ics+0.013
Neg logit contributions
-0.012
One-0.012
 know-0.012
 One-0.012
 think-0.011
Token was
Token ID373
Feature activation-1.25e-02

Pos logit contributions
One+0.027
+0.025
 One+0.025
 know+0.024
."+0.024
Neg logit contributions
ied-0.036
enge-0.031
ics-0.031
uttered-0.031
uddled-0.030
Token feeling
Token ID4203
Feature activation-4.92e-02

Pos logit contributions
+0.013
One+0.012
 One+0.012
 know+0.012
 think+0.012
Neg logit contributions
ied-0.026
enge-0.025
uddled-0.024
uttered-0.024
owner-0.023
Token a
Token ID257
Feature activation+2.01e-03

Pos logit contributions
One+0.067
+0.065
 One+0.063
 know+0.063
 think+0.060
Neg logit contributions
ied-0.083
uttered-0.075
enge-0.074
apped-0.072
uddled-0.072
Token bit
Token ID1643
Feature activation-2.18e-02

Pos logit contributions
enge+0.003
ied+0.003
uttered+0.003
uddled+0.003
apped+0.003
Neg logit contributions
One-0.002
-0.002
 know-0.002
 One-0.002
."-0.002
Token bored
Token ID17533
Feature activation-1.14e-02

Pos logit contributions
One+0.036
+0.032
orse+0.032
 One+0.032
use+0.031
Neg logit contributions
ied-0.030
uttered-0.026
apped-0.026
outed-0.025
enge-0.024
Token.
Token ID13
Feature activation-4.99e-03

Pos logit contributions
One+0.008
 know+0.007
+0.007
 think+0.006
 One+0.006
Neg logit contributions
ied-0.026
enge-0.024
uttered-0.024
box-0.024
uddled-0.024
Token He
Token ID679
Feature activation-2.08e-03

Pos logit contributions
+0.006
One+0.006
ppy+0.005
orse+0.005
 c+0.005
Neg logit contributions
ied-0.009
enge-0.008
uttered-0.008
box-0.008
ics-0.008
Token desert
Token ID10326
Feature activation-7.43e-03

Pos logit contributions
 know+0.014
One+0.014
+0.013
 One+0.013
."+0.013
Neg logit contributions
ied-0.020
enge-0.019
uttered-0.018
box-0.018
uddled-0.018
Token,
Token ID11
Feature activation-8.66e-03

Pos logit contributions
+0.004
One+0.004
 One+0.003
 know+0.003
."+0.003
Neg logit contributions
ied-0.019
enge-0.018
uttered-0.018
uddled-0.017
apped-0.017
Token with
Token ID351
Feature activation-4.87e-03

Pos logit contributions
+0.009
 c+0.009
 think+0.008
One+0.008
 know+0.008
Neg logit contributions
ied-0.018
enge-0.018
uttered-0.017
uddled-0.016
apped-0.016
Token lots
Token ID6041
Feature activation-3.33e-02

Pos logit contributions
+0.006
One+0.006
 c+0.006
 know+0.005
 One+0.005
Neg logit contributions
ied-0.009
enge-0.009
uddled-0.009
uttered-0.009
overed-0.008
Token of
Token ID286
Feature activation+3.68e-03

Pos logit contributions
One+0.054
+0.051
 One+0.051
 think+0.051
 know+0.050
Neg logit contributions
ied-0.050
uttered-0.042
uddled-0.042
enge-0.040
overed-0.038
Token c
Token ID269
Feature activation-4.92e-02

Pos logit contributions
ied+0.007
enge+0.006
uttered+0.006
uddled+0.006
ics+0.006
Neg logit contributions
-0.005
One-0.004
 One-0.004
 know-0.004
."-0.004
Tokenact
Token ID529
Feature activation-2.27e-02

Pos logit contributions
+0.077
One+0.073
 think+0.071
 know+0.071
 One+0.070
Neg logit contributions
ied-0.084
uddled-0.080
uttered-0.076
enge-0.072
awed-0.070
Tokenuses
Token ID2664
Feature activation-2.20e-02

Pos logit contributions
One+0.033
 One+0.031
+0.031
 know+0.030
 think+0.028
Neg logit contributions
ied-0.043
uttered-0.036
uddled-0.035
ics-0.035
awed-0.033
Token all
Token ID477
Feature activation+6.55e-03

Pos logit contributions
One+0.024
 know+0.020
 One+0.020
 think+0.019
 Amen+0.019
Neg logit contributions
ied-0.045
uttered-0.040
ics-0.040
uddled-0.039
apped-0.039
Token around
Token ID1088
Feature activation-9.45e-04

Pos logit contributions
ied+0.012
enge+0.011
uttered+0.011
uddled+0.011
apped+0.011
Neg logit contributions
-0.008
One-0.008
 One-0.007
 know-0.007
."-0.007
Token her
Token ID607
Feature activation-2.33e-02

Pos logit contributions
One+0.001
+0.001
 know+0.001
 One+0.001
 think+0.001
Neg logit contributions
ied-0.002
uttered-0.002
enge-0.002
uddled-0.002
ics-0.002
Token a
Token ID257
Feature activation+1.22e-03

Pos logit contributions
+0.013
 know+0.012
 c+0.012
 think+0.012
 pine+0.012
Neg logit contributions
ied-0.025
enge-0.025
uddled-0.024
uttered-0.024
ics-0.023
Token big
Token ID1263
Feature activation+1.48e-03

Pos logit contributions
ied+0.002
uttered+0.002
enge+0.002
uddled+0.002
ics+0.002
Neg logit contributions
-0.001
One-0.001
 know-0.001
 One-0.001
 think-0.001
Token sle
Token ID3133
Feature activation-6.86e-03

Pos logit contributions
ied+0.003
enge+0.003
uttered+0.003
uddled+0.002
ics+0.002
Neg logit contributions
One-0.002
-0.002
 know-0.002
 One-0.002
 think-0.001
Tokenigh
Token ID394
Feature activation-2.25e-02

Pos logit contributions
+0.007
One+0.007
 know+0.006
 One+0.006
 think+0.006
Neg logit contributions
ied-0.014
enge-0.013
uttered-0.013
uddled-0.013
ics-0.013
Token with
Token ID351
Feature activation-2.89e-03

Pos logit contributions
+0.022
One+0.020
."+0.019
 One+0.018
 know+0.018
Neg logit contributions
ied-0.047
enge-0.046
uttered-0.045
uddled-0.043
apped-0.042
Token eight
Token ID3624
Feature activation-4.89e-02

Pos logit contributions
+0.003
One+0.003
 One+0.003
 know+0.003
 c+0.003
Neg logit contributions
ied-0.006
enge-0.005
uddled-0.005
uttered-0.005
ics-0.005
Token re
Token ID302
Feature activation-1.34e-02

Pos logit contributions
 think+0.067
One+0.067
ppy+0.065
 know+0.065
 appreciate+0.064
Neg logit contributions
ied-0.068
box-0.062
ings-0.061
awed-0.060
uttered-0.057
Tokenind
Token ID521
Feature activation-1.12e-02

Pos logit contributions
+0.020
 One+0.019
One+0.019
 know+0.018
 think+0.018
Neg logit contributions
ied-0.022
uttered-0.020
enge-0.020
ics-0.019
uddled-0.019
Tokeneer
Token ID28153
Feature activation-2.29e-02

Pos logit contributions
One+0.017
+0.017
 One+0.015
 think+0.015
 know+0.014
Neg logit contributions
ied-0.019
uttered-0.016
enge-0.016
apped-0.016
umbled-0.015
Token.
Token ID13
Feature activation-5.85e-03

Pos logit contributions
One+0.021
 One+0.019
 know+0.019
+0.019
 think+0.018
Neg logit contributions
ied-0.049
uttered-0.046
uddled-0.044
ics-0.044
enge-0.044
Token The
Token ID383
Feature activation-2.04e-02

Pos logit contributions
+0.005
 One+0.005
One+0.005
 know+0.005
."+0.005
Neg logit contributions
ied-0.013
enge-0.012
ics-0.012
uddled-0.012
uttered-0.012
Token big
Token ID1263
Feature activation-1.78e-02

Pos logit contributions
+0.011
One+0.011
 One+0.011
 know+0.011
."+0.010
Neg logit contributions
ied-0.019
enge-0.018
uttered-0.017
uddled-0.017
apped-0.016
Token and
Token ID290
Feature activation-7.69e-03

Pos logit contributions
One+0.022
 know+0.021
+0.020
 One+0.019
orse+0.019
Neg logit contributions
ied-0.030
enge-0.029
box-0.029
uddled-0.027
apped-0.027
Token brown
Token ID7586
Feature activation-2.37e-02

Pos logit contributions
One+0.008
+0.008
 know+0.008
 One+0.007
."+0.007
Neg logit contributions
ied-0.015
enge-0.014
uttered-0.013
ics-0.013
uddled-0.013
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
One+0.017
+0.016
 know+0.016
 One+0.014
 think+0.014
Neg logit contributions
ied-0.055
enge-0.052
uttered-0.052
uddled-0.051
apped-0.050
Token with
Token ID351
Feature activation-4.82e-03

Pos logit contributions
+0.012
One+0.011
 One+0.011
 know+0.010
 c+0.010
Neg logit contributions
ied-0.027
enge-0.026
uttered-0.026
uddled-0.025
ics-0.025
Token long
Token ID890
Feature activation-4.89e-02

Pos logit contributions
+0.005
One+0.005
 know+0.005
 One+0.005
 think+0.005
Neg logit contributions
ied-0.009
enge-0.009
uttered-0.009
uddled-0.008
ics-0.008
Token ears
Token ID11368
Feature activation-3.80e-02

Pos logit contributions
+0.072
One+0.069
 think+0.067
 know+0.065
 conclusions+0.062
Neg logit contributions
ied-0.073
ings-0.066
uddled-0.064
awed-0.064
box-0.063
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
One+0.037
 think+0.035
 know+0.034
+0.034
 One+0.031
Neg logit contributions
ied-0.079
uttered-0.074
uddled-0.071
ics-0.071
apped-0.070
Token a
Token ID257
Feature activation-1.83e-02

Pos logit contributions
One+0.017
+0.016
use+0.016
 One+0.015
ppy+0.015
Neg logit contributions
ied-0.020
uttered-0.019
apped-0.018
enge-0.018
cards-0.017
Token fuzzy
Token ID34669
Feature activation-2.12e-02

Pos logit contributions
One+0.024
+0.023
 One+0.022
ppy+0.021
r+0.021
Neg logit contributions
ied-0.027
box-0.026
uttered-0.026
enge-0.025
apped-0.025
Token tail
Token ID7894
Feature activation-2.48e-02

Pos logit contributions
+0.029
One+0.029
 know+0.026
 think+0.025
 One+0.025
Neg logit contributions
ied-0.034
uttered-0.033
apped-0.032
uddled-0.031
enge-0.031
Tokenflake
Token ID47597
Feature activation+1.91e-03

Pos logit contributions
ied+0.016
enge+0.014
uttered+0.014
uddled+0.013
ics+0.013
Neg logit contributions
-0.025
."-0.024
One-0.024
 c-0.024
 think-0.024
Token is
Token ID318
Feature activation-1.29e-02

Pos logit contributions
ied+0.003
uttered+0.003
ics+0.003
uddled+0.003
enge+0.003
Neg logit contributions
One-0.002
 One-0.002
-0.002
 know-0.002
 think-0.002
Token small
Token ID1402
Feature activation-8.33e-03

Pos logit contributions
+0.015
One+0.015
 know+0.015
 One+0.015
 think+0.014
Neg logit contributions
ied-0.024
enge-0.022
uttered-0.022
uddled-0.022
ics-0.021
Token and
Token ID290
Feature activation-8.51e-03

Pos logit contributions
One+0.008
+0.007
 know+0.007
 One+0.007
 think+0.007
Neg logit contributions
ied-0.017
enge-0.016
uttered-0.016
uddled-0.016
box-0.016
Token has
Token ID468
Feature activation-3.31e-03

Pos logit contributions
One+0.010
+0.009
 know+0.009
 One+0.009
 think+0.008
Neg logit contributions
ied-0.016
enge-0.015
uttered-0.015
uddled-0.014
apped-0.014
Token eight
Token ID3624
Feature activation-4.83e-02

Pos logit contributions
+0.004
One+0.004
 know+0.004
 One+0.004
 think+0.004
Neg logit contributions
ied-0.006
enge-0.006
uttered-0.006
uddled-0.006
ics-0.005
Token points
Token ID2173
Feature activation-1.54e-02

Pos logit contributions
 know+0.070
One+0.070
 Amen+0.069
 think+0.068
 observe+0.066
Neg logit contributions
ied-0.063
ings-0.060
awed-0.059
box-0.056
uddled-0.053
Token.
Token ID13
Feature activation-6.85e-03

Pos logit contributions
+0.013
One+0.012
 know+0.012
 One+0.012
."+0.011
Neg logit contributions
ied-0.034
enge-0.032
uttered-0.032
uddled-0.032
ics-0.031
Token They
Token ID1119
Feature activation-1.09e-02

Pos logit contributions
 One+0.007
One+0.006
+0.006
 know+0.006
."+0.006
Neg logit contributions
ied-0.015
enge-0.014
uttered-0.013
ics-0.013
uddled-0.013
Token are
Token ID389
Feature activation-1.26e-02

Pos logit contributions
emy+0.015
One+0.015
unn+0.014
ulent+0.014
orse+0.014
Neg logit contributions
 lasted-0.013
ics-0.013
atively-0.012
uttered-0.012
ied-0.012
Token both
Token ID1111
Feature activation-9.68e-03

Pos logit contributions
One+0.017
+0.017
 One+0.016
 know+0.016
 think+0.015
Neg logit contributions
ied-0.021
uttered-0.019
enge-0.019
ics-0.019
apped-0.018
Token body
Token ID1767
Feature activation-5.51e-03

Pos logit contributions
ied+0.008
enge+0.008
uttered+0.008
uddled+0.007
ics+0.007
Neg logit contributions
-0.005
One-0.005
 know-0.004
 One-0.004
 think-0.004
Token.
Token ID13
Feature activation+4.00e-03

Pos logit contributions
One+0.004
+0.004
 One+0.004
 know+0.004
 think+0.004
Neg logit contributions
ied-0.012
uttered-0.012
enge-0.012
uddled-0.011
ics-0.011
Token It
Token ID632
Feature activation+9.08e-03

Pos logit contributions
ied+0.007
uttered+0.007
apped+0.007
enge+0.007
uddled+0.007
Neg logit contributions
One-0.004
-0.004
 know-0.004
 think-0.004
 c-0.004
Token felt
Token ID2936
Feature activation-5.28e-03

Pos logit contributions
ied+0.014
enge+0.014
uttered+0.013
uddled+0.013
ics+0.013
Neg logit contributions
-0.014
One-0.013
 know-0.013
 One-0.013
 think-0.013
Token so
Token ID523
Feature activation+1.71e-02

Pos logit contributions
+0.006
One+0.006
 know+0.006
 One+0.006
 think+0.006
Neg logit contributions
ied-0.010
enge-0.009
uttered-0.009
uddled-0.009
ics-0.009
Token good
Token ID922
Feature activation+3.65e-02

Pos logit contributions
ied+0.027
uttered+0.025
apped+0.023
uddled+0.023
overed+0.022
Neg logit contributions
One-0.025
use-0.024
ppy-0.023
 tul-0.023
 pine-0.023
Token!
Token ID0
Feature activation+5.98e-03

Pos logit contributions
ied+0.078
enge+0.076
uttered+0.075
ics+0.072
apped+0.070
Neg logit contributions
-0.032
."-0.030
 know-0.028
One-0.028
 think-0.028
Token
Token ID220
Feature activation+2.16e-02

Pos logit contributions
ied+0.012
uttered+0.011
enge+0.011
apped+0.011
uddled+0.011
Neg logit contributions
One-0.006
-0.006
 know-0.006
 think-0.006
 c-0.006
Token
Token ID198
Feature activation+2.07e-02

Pos logit contributions
ied+0.051
uttered+0.048
enge+0.048
uddled+0.047
ics+0.047
Neg logit contributions
One-0.014
-0.014
 know-0.012
 One-0.012
 think-0.012
Token
Token ID198
Feature activation+2.72e-03

Pos logit contributions
ied+0.039
ics+0.036
uttered+0.036
uddled+0.035
apped+0.034
Neg logit contributions
 pine-0.020
-0.020
use-0.020
 c-0.020
 tul-0.020
TokenPretty
Token ID35700
Feature activation+2.17e-03

Pos logit contributions
ied+0.005
enge+0.004
uddled+0.004
ics+0.004
uttered+0.004
Neg logit contributions
 know-0.003
,"-0.003
."-0.003
 One-0.003
 pine-0.003
Token the
Token ID262
Feature activation+1.69e-03

Pos logit contributions
ied+0.003
enge+0.003
uttered+0.003
ics+0.003
uddled+0.003
Neg logit contributions
-0.002
One-0.002
 know-0.002
 One-0.002
 think-0.002
Token barking
Token ID48961
Feature activation+1.03e-03

Pos logit contributions
ied+0.003
uttered+0.003
enge+0.003
uddled+0.003
apped+0.003
Neg logit contributions
One-0.002
 know-0.002
-0.002
 think-0.002
 c-0.002
Token dog
Token ID3290
Feature activation+1.57e-02

Pos logit contributions
enge+0.002
ied+0.002
box+0.002
uttered+0.002
ics+0.002
Neg logit contributions
."-0.001
.-0.001
-0.001
 c-0.001
 know-0.001
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
enge+0.032
ied+0.032
uttered+0.031
overed+0.030
apped+0.030
Neg logit contributions
-0.018
 c-0.016
."-0.015
 know-0.015
 One-0.015
Token The
Token ID383
Feature activation-9.08e-03

Pos logit contributions
ied+0.025
enge+0.023
uttered+0.023
ics+0.023
uddled+0.022
Neg logit contributions
 One-0.013
-0.013
One-0.012
 know-0.012
."-0.012
Token man
Token ID582
Feature activation+3.71e-02

Pos logit contributions
One+0.014
+0.014
 know+0.014
 One+0.013
 think+0.013
Neg logit contributions
ied-0.014
uttered-0.013
enge-0.013
uddled-0.012
ics-0.012
Token picked
Token ID6497
Feature activation+2.27e-02

Pos logit contributions
ied+0.081
enge+0.081
uttered+0.079
ics+0.079
rol+0.079
Neg logit contributions
-0.036
 c-0.033
 know-0.030
 think-0.030
,"-0.028
Token her
Token ID607
Feature activation+8.79e-03

Pos logit contributions
uddled+0.038
uttered+0.037
enge+0.036
ics+0.035
box+0.035
Neg logit contributions
-0.035
 c-0.031
 One-0.031
."-0.030
 know-0.029
Token up
Token ID510
Feature activation+2.53e-02

Pos logit contributions
ied+0.012
uttered+0.012
enge+0.012
apped+0.011
uddled+0.011
Neg logit contributions
-0.015
 think-0.015
 c-0.015
 pine-0.014
."-0.014
Token and
Token ID290
Feature activation+1.52e-02

Pos logit contributions
ied+0.069
ics+0.069
uttered+0.068
enge+0.067
uddled+0.066
Neg logit contributions
-0.008
 c-0.008
 pine-0.008
.-0.007
."-0.007
Token held
Token ID2714
Feature activation+1.88e-02

Pos logit contributions
ied+0.033
uttered+0.031
umbled+0.030
enge+0.030
apped+0.030
Neg logit contributions
 c-0.016
-0.015
 think-0.014
 pine-0.014
 know-0.013
Token house
Token ID2156
Feature activation+1.63e-02

Pos logit contributions
ied+0.032
uttered+0.029
enge+0.029
overed+0.028
ics+0.028
Neg logit contributions
."-0.021
 c-0.021
-0.021
 know-0.020
 love-0.020
Token looks
Token ID3073
Feature activation+8.60e-03

Pos logit contributions
ied+0.033
uttered+0.032
enge+0.032
overed+0.031
apped+0.031
Neg logit contributions
-0.018
."-0.017
 know-0.016
,"-0.015
 think-0.015
Token so
Token ID523
Feature activation+2.34e-02

Pos logit contributions
ied+0.016
uttered+0.015
enge+0.015
ics+0.014
uddled+0.014
Neg logit contributions
-0.011
One-0.010
 One-0.010
 know-0.010
."-0.010
Token tidy
Token ID43044
Feature activation+2.08e-02

Pos logit contributions
ied+0.044
enge+0.041
uttered+0.041
uddled+0.039
ics+0.039
Neg logit contributions
-0.027
One-0.027
 know-0.026
 One-0.026
 think-0.025
Token!"
Token ID2474
Feature activation+6.06e-03

Pos logit contributions
ied+0.053
uttered+0.051
enge+0.051
uddled+0.049
apped+0.049
Neg logit contributions
-0.010
One-0.010
."-0.009
 One-0.009
 know-0.008
Token Grand
Token ID5675
Feature activation+3.84e-02

Pos logit contributions
ied+0.013
uttered+0.012
enge+0.012
ics+0.012
rol+0.012
Neg logit contributions
 One-0.006
-0.006
."-0.006
 know-0.005
One-0.005
Tokenma
Token ID2611
Feature activation+2.81e-02

Pos logit contributions
ied+0.069
uttered+0.068
enge+0.065
overed+0.064
ics+0.061
Neg logit contributions
-0.048
One-0.047
 One-0.044
."-0.044
,"-0.044
Token smiled
Token ID13541
Feature activation+9.22e-03

Pos logit contributions
ied+0.055
enge+0.053
uttered+0.051
ics+0.050
overed+0.048
Neg logit contributions
-0.034
 c-0.030
 know-0.030
 think-0.029
 One-0.029
Token and
Token ID290
Feature activation+6.29e-03

Pos logit contributions
ied+0.021
enge+0.020
uttered+0.020
ics+0.019
uddled+0.019
Neg logit contributions
-0.007
One-0.007
 One-0.007
 know-0.006
."-0.006
Token said
Token ID531
Feature activation+1.31e-02

Pos logit contributions
ied+0.014
enge+0.013
uttered+0.013
ics+0.013
ings+0.013
Neg logit contributions
 c-0.006
-0.006
 know-0.005
 think-0.005
 see-0.005
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
ied+0.033
enge+0.032
uttered+0.031
uddled+0.031
ics+0.031
Neg logit contributions
-0.007
One-0.006
 One-0.006
 know-0.006
."-0.005
Token had
Token ID550
Feature activation+1.53e-02

Pos logit contributions
ied+0.031
enge+0.029
uttered+0.029
uddled+0.028
apped+0.027
Neg logit contributions
One-0.022
-0.022
 One-0.022
 know-0.021
."-0.021
Token found
Token ID1043
Feature activation+9.55e-03

Pos logit contributions
ied+0.029
uttered+0.028
enge+0.028
uddled+0.027
apped+0.026
Neg logit contributions
 know-0.018
One-0.018
-0.017
 think-0.017
."-0.017
Token.
Token ID13
Feature activation-1.25e-04

Pos logit contributions
ied+0.024
uttered+0.024
enge+0.023
uddled+0.023
apped+0.022
Neg logit contributions
-0.005
."-0.005
 know-0.004
,"-0.004
One-0.004
Token He
Token ID679
Feature activation+2.52e-03

Pos logit contributions
 One+0.000
+0.000
One+0.000
 know+0.000
."+0.000
Neg logit contributions
ied-0.000
ics-0.000
enge-0.000
uddled-0.000
uttered-0.000
Token called
Token ID1444
Feature activation+1.08e-02

Pos logit contributions
ied+0.005
enge+0.005
uttered+0.005
ics+0.004
uddled+0.004
Neg logit contributions
-0.003
One-0.003
 know-0.003
 think-0.003
 One-0.003
Token out
Token ID503
Feature activation+3.63e-02

Pos logit contributions
uddled+0.018
enge+0.017
awed+0.017
ied+0.017
uttered+0.016
Neg logit contributions
 know-0.016
-0.016
 One-0.016
 think-0.016
."-0.015
Token to
Token ID284
Feature activation-2.74e-03

Pos logit contributions
ied+0.062
enge+0.059
uttered+0.057
uddled+0.056
ics+0.056
Neg logit contributions
-0.048
One-0.048
 One-0.046
 know-0.046
 think-0.044
Token his
Token ID465
Feature activation-1.17e-02

Pos logit contributions
 know+0.003
 think+0.003
+0.003
One+0.003
 One+0.003
Neg logit contributions
ied-0.005
uddled-0.005
uttered-0.005
enge-0.005
ics-0.005
Token friends
Token ID2460
Feature activation-1.24e-02

Pos logit contributions
One+0.015
+0.015
 One+0.014
 know+0.014
 think+0.014
Neg logit contributions
ied-0.021
enge-0.019
uttered-0.019
uddled-0.018
ics-0.018
Token and
Token ID290
Feature activation+8.28e-03

Pos logit contributions
+0.009
One+0.009
 One+0.008
 know+0.008
 think+0.008
Neg logit contributions
ied-0.029
enge-0.027
uttered-0.027
uddled-0.026
ics-0.026
Token showed
Token ID3751
Feature activation+2.24e-02

Pos logit contributions
ied+0.017
uddled+0.017
enge+0.016
uttered+0.016
awed+0.016
Neg logit contributions
 know-0.009
-0.009
 think-0.008
 c-0.008
 see-0.008
Token the
Token ID262
Feature activation+1.08e-02

Pos logit contributions
+0.003
."+0.003
 One+0.003
One+0.003
,"+0.003
Neg logit contributions
enge-0.006
ied-0.005
uttered-0.005
uddled-0.005
ics-0.005
Token muff
Token ID27563
Feature activation+1.66e-02

Pos logit contributions
ied+0.019
uttered+0.017
enge+0.017
uddled+0.017
apped+0.016
Neg logit contributions
-0.015
One-0.015
 One-0.014
 know-0.014
 think-0.014
Tokenins
Token ID1040
Feature activation+1.27e-02

Pos logit contributions
uttered+0.031
enge+0.030
ied+0.030
overed+0.029
ics+0.029
Neg logit contributions
-0.016
One-0.015
."-0.015
 c-0.015
 know-0.014
Token were
Token ID547
Feature activation+1.47e-04

Pos logit contributions
ied+0.022
enge+0.022
uttered+0.021
uddled+0.021
ipt+0.020
Neg logit contributions
-0.017
."-0.015
 c-0.015
 know-0.015
 think-0.014
Token the
Token ID262
Feature activation+1.73e-02

Pos logit contributions
enge+0.000
uttered+0.000
ied+0.000
uddled+0.000
apped+0.000
Neg logit contributions
-0.000
."-0.000
One-0.000
 know-0.000
 c-0.000
Token best
Token ID1266
Feature activation+4.07e-02

Pos logit contributions
ied+0.029
uttered+0.027
uddled+0.027
overed+0.025
enge+0.025
Neg logit contributions
-0.027
One-0.026
."-0.025
 think-0.025
 c-0.024
Token they
Token ID484
Feature activation+2.91e-02

Pos logit contributions
ied+0.084
enge+0.082
uttered+0.081
bled+0.079
uddled+0.078
Neg logit contributions
-0.042
."-0.041
 know-0.038
 c-0.037
 One-0.036
Token ever
Token ID1683
Feature activation-9.16e-03

Pos logit contributions
ied+0.061
uttered+0.061
ipt+0.060
uddled+0.059
enge+0.059
Neg logit contributions
-0.028
 know-0.027
 think-0.027
."-0.026
 want-0.025
Token had
Token ID550
Feature activation+3.31e-03

Pos logit contributions
One+0.011
+0.010
 One+0.010
 know+0.010
 think+0.010
Neg logit contributions
ied-0.017
enge-0.015
uttered-0.015
uddled-0.015
ics-0.015
Token!
Token ID0
Feature activation-1.55e-02

Pos logit contributions
ied+0.008
uttered+0.008
enge+0.008
uddled+0.007
overed+0.007
Neg logit contributions
."-0.002
-0.002
One-0.002
 know-0.002
,"-0.002
Token<|endoftext|>
Token ID50256
Feature activation-1.65e-02

Pos logit contributions
+0.018
One+0.017
 One+0.017
 know+0.017
 think+0.016
Neg logit contributions
ied-0.030
enge-0.028
uttered-0.028
uddled-0.027
ics-0.027
Token here
Token ID994
Feature activation-2.72e-03

Pos logit contributions
ied+0.008
enge+0.007
ics+0.007
uttered+0.007
box+0.007
Neg logit contributions
One-0.011
 One-0.011
-0.011
 know-0.010
orse-0.010
Token".
Token ID1911
Feature activation-1.04e-03

Pos logit contributions
 know+0.003
One+0.002
+0.002
 pine+0.002
 One+0.002
Neg logit contributions
ied-0.005
ics-0.005
uttered-0.005
enge-0.005
box-0.005
Token The
Token ID383
Feature activation-3.89e-04

Pos logit contributions
+0.001
One+0.001
 One+0.001
 know+0.001
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token monster
Token ID9234
Feature activation+2.63e-02

Pos logit contributions
+0.001
One+0.001
orse+0.001
 know+0.001
 One+0.001
Neg logit contributions
ied-0.001
enge-0.001
uttered-0.001
 manager-0.001
apped-0.000
Token was
Token ID373
Feature activation+2.81e-03

Pos logit contributions
ied+0.051
enge+0.048
ics+0.047
uttered+0.047
box+0.045
Neg logit contributions
-0.033
 c-0.030
 think-0.030
 know-0.029
One-0.028
Token very
Token ID845
Feature activation+7.52e-03

Pos logit contributions
ied+0.006
enge+0.006
uddled+0.005
uttered+0.005
ics+0.005
Neg logit contributions
-0.003
 know-0.003
One-0.003
 c-0.003
 think-0.003
Token surprised
Token ID6655
Feature activation+1.04e-02

Pos logit contributions
ied+0.014
enge+0.013
uttered+0.013
uddled+0.013
ics+0.013
Neg logit contributions
-0.009
 know-0.009
One-0.009
 think-0.008
 One-0.008
Token and
Token ID290
Feature activation+5.84e-03

Pos logit contributions
ied+0.023
enge+0.022
uttered+0.022
uddled+0.021
ics+0.021
Neg logit contributions
One-0.008
-0.008
 know-0.008
 One-0.008
 think-0.007
Token she
Token ID673
Feature activation+7.88e-03

Pos logit contributions
ied+0.012
enge+0.012
uttered+0.012
uddled+0.011
box+0.011
Neg logit contributions
-0.006
 c-0.006
 know-0.006
 think-0.006
 One-0.005
Token said
Token ID531
Feature activation+1.23e-02

Pos logit contributions
ied+0.016
enge+0.014
uttered+0.014
ics+0.014
box+0.014
Neg logit contributions
-0.010
 know-0.009
 c-0.009
 think-0.008
One-0.008
Token,
Token ID11
Feature activation+8.91e-03

Pos logit contributions
ied+0.027
enge+0.026
uttered+0.025
uddled+0.025
ics+0.025
Neg logit contributions
-0.010
One-0.010
 One-0.010
 know-0.009
."-0.009
Token Sam
Token ID3409
Feature activation-5.04e-03

Pos logit contributions
ied+0.008
uddled+0.008
uttered+0.008
overed+0.007
enge+0.007
Neg logit contributions
-0.005
 know-0.005
One-0.005
 One-0.005
 think-0.005
Token.
Token ID13
Feature activation-9.95e-03

Pos logit contributions
One+0.004
+0.004
 know+0.003
 One+0.003
 pine+0.003
Neg logit contributions
ied-0.011
uttered-0.010
enge-0.010
ics-0.010
apped-0.010
Token
Token ID198
Feature activation+1.68e-03

Pos logit contributions
+0.010
One+0.010
 know+0.009
 think+0.009
 c+0.009
Neg logit contributions
ied-0.020
uttered-0.019
enge-0.019
apped-0.018
ics-0.018
Token
Token ID198
Feature activation-1.97e-02

Pos logit contributions
ied+0.004
uttered+0.003
enge+0.003
ics+0.003
apped+0.003
Neg logit contributions
-0.001
 c-0.001
 know-0.001
 think-0.001
ppy-0.001
Token"
Token ID1
Feature activation+6.63e-04

Pos logit contributions
 know+0.007
+0.006
ppy+0.006
 c+0.006
 think+0.006
Neg logit contributions
ied-0.050
enge-0.049
uttered-0.047
ics-0.047
apped-0.046
TokenLet
Token ID5756
Feature activation+8.02e-03

Pos logit contributions
ied+0.001
enge+0.001
uttered+0.001
apped+0.001
ics+0.001
Neg logit contributions
 know-0.001
-0.001
 think-0.001
."-0.001
 One-0.001
Token's
Token ID338
Feature activation-5.42e-03

Pos logit contributions
ied+0.016
enge+0.015
uttered+0.015
uddled+0.015
ics+0.015
Neg logit contributions
-0.008
One-0.008
 know-0.008
 One-0.008
 think-0.007
Token go
Token ID467
Feature activation+1.35e-02

Pos logit contributions
One+0.007
ppy+0.006
Bee+0.006
emy+0.006
+0.006
Neg logit contributions
ics-0.008
enge-0.008
ied-0.007
box-0.007
apped-0.007
Token explore
Token ID7301
Feature activation-2.09e-03

Pos logit contributions
ied+0.017
ics+0.016
enge+0.016
uttered+0.015
apped+0.015
Neg logit contributions
One-0.021
-0.021
 One-0.020
 tul-0.020
 disl-0.020
Token the
Token ID262
Feature activation+7.57e-03

Pos logit contributions
 know+0.002
One+0.002
 One+0.002
+0.002
ppy+0.002
Neg logit contributions
ied-0.004
enge-0.003
ics-0.003
uddled-0.003
apped-0.003
Token ship
Token ID4074
Feature activation+6.12e-03

Pos logit contributions
ied+0.012
enge+0.011
box+0.011
ics+0.011
uttered+0.011
Neg logit contributions
-0.010
One-0.010
 know-0.010
 One-0.010
 think-0.009
Token heard
Token ID2982
Feature activation-1.77e-02

Pos logit contributions
ied+0.011
enge+0.010
uttered+0.010
uddled+0.010
ics+0.010
Neg logit contributions
-0.007
One-0.007
 know-0.006
 One-0.006
 think-0.006
Token about
Token ID546
Feature activation+1.69e-02

Pos logit contributions
+0.020
 know+0.019
 One+0.019
One+0.019
 c+0.018
Neg logit contributions
ied-0.034
enge-0.034
uttered-0.033
uddled-0.032
ics-0.031
Token the
Token ID262
Feature activation-4.39e-03

Pos logit contributions
ied+0.031
enge+0.030
uttered+0.029
uddled+0.029
apped+0.028
Neg logit contributions
-0.020
One-0.020
 know-0.019
 One-0.019
 think-0.019
Token dream
Token ID4320
Feature activation-4.17e-03

Pos logit contributions
+0.005
One+0.005
 know+0.005
 One+0.005
."+0.005
Neg logit contributions
ied-0.008
enge-0.007
uttered-0.007
box-0.007
ics-0.007
Token and
Token ID290
Feature activation-1.25e-03

Pos logit contributions
+0.004
One+0.004
 know+0.003
 One+0.003
."+0.003
Neg logit contributions
ied-0.009
enge-0.009
uttered-0.009
uddled-0.008
apped-0.008
Token said
Token ID531
Feature activation+7.75e-03

Pos logit contributions
+0.002
 know+0.001
One+0.001
 think+0.001
 One+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token to
Token ID284
Feature activation-5.75e-03

Pos logit contributions
ied+0.018
enge+0.017
uttered+0.017
uddled+0.017
ics+0.017
Neg logit contributions
-0.005
One-0.005
 know-0.005
 One-0.005
 think-0.004
Token his
Token ID465
Feature activation-1.30e-02

Pos logit contributions
One+0.008
+0.008
 One+0.007
 know+0.007
."+0.007
Neg logit contributions
ied-0.009
enge-0.009
uttered-0.009
ics-0.008
apped-0.008
Token daughter
Token ID4957
Feature activation+7.59e-03

Pos logit contributions
One+0.018
+0.017
 One+0.017
 know+0.016
."+0.015
Neg logit contributions
ied-0.021
enge-0.020
uttered-0.019
box-0.019
apped-0.019
Token,
Token ID11
Feature activation+5.61e-03

Pos logit contributions
ied+0.017
uttered+0.016
enge+0.016
ics+0.016
apped+0.016
Neg logit contributions
One-0.006
-0.005
 know-0.005
 One-0.005
 think-0.005
Token "
Token ID366
Feature activation-5.25e-03

Pos logit contributions
ied+0.010
enge+0.009
uttered+0.009
uddled+0.009
ics+0.008
Neg logit contributions
-0.007
One-0.007
 One-0.007
 know-0.007
 think-0.007
Token
Token ID198
Feature activation-8.11e-03

Pos logit contributions
ied+0.009
uttered+0.008
uddled+0.008
ics+0.008
apped+0.008
Neg logit contributions
ppy-0.005
-0.005
use-0.005
 think-0.005
 want-0.005
Token"
Token ID1
Feature activation-4.03e-03

Pos logit contributions
 know+0.010
 One+0.009
 think+0.009
."+0.009
 c+0.009
Neg logit contributions
enge-0.013
ied-0.013
owner-0.013
ipt-0.012
uddled-0.012
TokenSorry
Token ID14385
Feature activation-1.74e-02

Pos logit contributions
One+0.005
+0.005
 think+0.004
 One+0.004
 c+0.004
Neg logit contributions
ied-0.008
ics-0.008
uttered-0.008
box-0.007
enge-0.007
Token,
Token ID11
Feature activation-1.30e-03

Pos logit contributions
One+0.017
orse+0.015
Bo+0.015
 One+0.015
 know+0.014
Neg logit contributions
ied-0.033
box-0.031
enge-0.031
ics-0.030
ings-0.030
Token mom
Token ID1995
Feature activation-1.37e-02

Pos logit contributions
One+0.002
+0.002
 One+0.002
 know+0.001
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token,"
Token ID553
Feature activation+6.18e-03

Pos logit contributions
One+0.010
 One+0.008
Bo+0.008
+0.007
 conclusions+0.007
Neg logit contributions
ied-0.032
ics-0.030
box-0.029
enge-0.029
apped-0.028
Token they
Token ID484
Feature activation+2.21e-03

Pos logit contributions
ied+0.012
uttered+0.011
enge+0.011
uddled+0.011
ics+0.010
Neg logit contributions
-0.008
One-0.008
 One-0.007
 know-0.007
 think-0.007
Token said
Token ID531
Feature activation+1.72e-02

Pos logit contributions
ied+0.004
enge+0.004
uttered+0.004
uddled+0.004
ics+0.004
Neg logit contributions
-0.002
 know-0.002
One-0.002
 One-0.002
 think-0.002
Token.
Token ID13
Feature activation+3.56e-03

Pos logit contributions
ied+0.049
enge+0.049
uttered+0.047
ics+0.047
uddled+0.046
Neg logit contributions
-0.003
."-0.003
,"-0.002
 One-0.001
 know-0.001
Token "
Token ID366
Feature activation-3.71e-03

Pos logit contributions
ied+0.007
enge+0.006
uttered+0.006
ics+0.006
uddled+0.006
Neg logit contributions
One-0.004
-0.004
 One-0.004
 know-0.004
."-0.004
TokenWe
Token ID1135
Feature activation-9.93e-03

Pos logit contributions
One+0.004
+0.004
 know+0.004
 One+0.004
 think+0.004
Neg logit contributions
ied-0.007
uttered-0.006
enge-0.006
uddled-0.006
ics-0.006
Token.
Token ID13
Feature activation+7.74e-03

Pos logit contributions
ied+0.028
uttered+0.026
enge+0.026
uddled+0.025
ics+0.025
Neg logit contributions
-0.020
One-0.020
 know-0.019
 One-0.018
 think-0.018
Token Lily
Token ID20037
Feature activation+1.48e-02

Pos logit contributions
ied+0.015
enge+0.014
uttered+0.014
uddled+0.013
apped+0.013
Neg logit contributions
One-0.008
-0.008
 know-0.008
 One-0.008
 think-0.008
Token noticed
Token ID6810
Feature activation+9.38e-03

Pos logit contributions
ied+0.030
enge+0.028
uttered+0.028
ics+0.027
overed+0.027
Neg logit contributions
-0.017
 know-0.015
One-0.015
 One-0.015
 c-0.015
Token her
Token ID607
Feature activation+8.14e-03

Pos logit contributions
ied+0.020
enge+0.020
uttered+0.020
ics+0.019
apped+0.019
Neg logit contributions
-0.008
One-0.008
 One-0.007
 c-0.007
 know-0.007
Token friend
Token ID1545
Feature activation+1.01e-02

Pos logit contributions
ied+0.015
enge+0.014
uttered+0.014
ics+0.013
uddled+0.013
Neg logit contributions
-0.010
 know-0.010
One-0.009
 think-0.009
 One-0.009
Token's
Token ID338
Feature activation+1.34e-02

Pos logit contributions
ied+0.023
enge+0.022
uttered+0.022
uddled+0.021
ics+0.021
Neg logit contributions
-0.010
 One-0.009
 know-0.008
 c-0.008
One-0.008
Token feelings
Token ID7666
Feature activation-1.12e-02

Pos logit contributions
ied+0.023
enge+0.022
uttered+0.021
uddled+0.021
ics+0.020
Neg logit contributions
-0.017
One-0.017
 know-0.017
 One-0.017
 think-0.016
Token and
Token ID290
Feature activation+2.14e-02

Pos logit contributions
+0.009
One+0.008
 know+0.008
 One+0.008
."+0.008
Neg logit contributions
ied-0.025
enge-0.025
uttered-0.024
ics-0.023
uddled-0.023
Token shared
Token ID4888
Feature activation+2.66e-02

Pos logit contributions
ied+0.047
enge+0.046
uttered+0.046
ics+0.044
overed+0.044
Neg logit contributions
 know-0.019
 c-0.017
 think-0.017
-0.017
 One-0.017
Token her
Token ID607
Feature activation+2.49e-02

Pos logit contributions
ied+0.049
uttered+0.048
uddled+0.047
enge+0.047
ics+0.045
Neg logit contributions
-0.033
One-0.033
 know-0.032
."-0.032
 One-0.032
Token toys
Token ID14958
Feature activation+1.96e-02

Pos logit contributions
ied+0.032
enge+0.029
uttered+0.028
uddled+0.028
ics+0.027
Neg logit contributions
One-0.043
-0.043
 One-0.041
 know-0.040
 think-0.039
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
ied+0.021
uttered+0.020
enge+0.020
uddled+0.020
apped+0.019
Neg logit contributions
-0.006
One-0.006
."-0.005
 One-0.005
 know-0.005
Token
Token ID198
Feature activation+2.21e-03

Pos logit contributions
+0.014
 c+0.014
r+0.014
use+0.013
ulent+0.013
Neg logit contributions
apped-0.018
ied-0.017
uttered-0.016
ipt-0.016
enge-0.016
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
ied+0.005
uttered+0.004
enge+0.004
ics+0.004
uddled+0.004
Neg logit contributions
-0.002
 c-0.002
 think-0.002
ppy-0.002
 know-0.002
TokenOne
Token ID3198
Feature activation-1.27e-02

Pos logit contributions
 know+0.013
 c+0.011
 think+0.010
,"+0.010
 disl+0.010
Neg logit contributions
ied-0.045
enge-0.041
box-0.039
apped-0.039
uddled-0.038
Token day
Token ID1110
Feature activation-1.40e-02

Pos logit contributions
+0.019
One+0.018
 know+0.018
 One+0.017
 think+0.017
Neg logit contributions
ied-0.020
enge-0.017
uttered-0.017
apped-0.017
ics-0.016
Token,
Token ID11
Feature activation-1.07e-02

Pos logit contributions
One+0.011
 know+0.011
+0.011
 think+0.010
 One+0.010
Neg logit contributions
ied-0.031
apped-0.028
uttered-0.028
enge-0.028
uddled-0.028
Token Ben
Token ID3932
Feature activation-1.20e-02

Pos logit contributions
 One+0.009
One+0.009
+0.009
."+0.009
 think+0.008
Neg logit contributions
ied-0.024
uttered-0.023
enge-0.023
ics-0.023
uddled-0.022
Token sees
Token ID7224
Feature activation-1.53e-02

Pos logit contributions
One+0.014
+0.013
 One+0.013
 know+0.013
 think+0.012
Neg logit contributions
ied-0.022
enge-0.021
uttered-0.020
uddled-0.020
ics-0.020
Token a
Token ID257
Feature activation+3.63e-03

Pos logit contributions
One+0.019
+0.019
 One+0.018
 know+0.018
 think+0.017
Neg logit contributions
ied-0.028
enge-0.026
uttered-0.026
uddled-0.025
ics-0.025
Token new
Token ID649
Feature activation+2.31e-02

Pos logit contributions
ied+0.007
uttered+0.007
enge+0.007
ics+0.007
apped+0.007
Neg logit contributions
One-0.004
-0.004
 c-0.004
 pine-0.004
 know-0.004
Token book
Token ID1492
Feature activation-1.70e-03

Pos logit contributions
ied+0.032
uttered+0.029
enge+0.029
uddled+0.028
ics+0.027
Neg logit contributions
One-0.039
-0.038
 One-0.037
 know-0.037
 think-0.036
Token take
Token ID1011
Feature activation-2.44e-04

Pos logit contributions
ied+0.008
ics+0.008
ings+0.008
enge+0.008
 sale+0.007
Neg logit contributions
orse-0.007
Bee-0.007
ulent-0.007
ppy-0.006
orses-0.006
Token the
Token ID262
Feature activation+3.47e-05

Pos logit contributions
 know+0.000
+0.000
One+0.000
 One+0.000
."+0.000
Neg logit contributions
ied-0.000
uddled-0.000
uttered-0.000
enge-0.000
ics-0.000
Token toy
Token ID13373
Feature activation+6.34e-03

Pos logit contributions
ied+0.000
uttered+0.000
enge+0.000
uddled+0.000
apped+0.000
Neg logit contributions
-0.000
One-0.000
."-0.000
 think-0.000
 c-0.000
Token.
Token ID13
Feature activation-2.15e-03

Pos logit contributions
ied+0.016
uttered+0.015
uddled+0.015
enge+0.015
outed+0.015
Neg logit contributions
."-0.004
-0.004
.-0.004
,"-0.004
 c-0.003
Token
Token ID198
Feature activation+1.88e-02

Pos logit contributions
One+0.002
+0.002
 know+0.002
 think+0.001
 One+0.001
Neg logit contributions
ied-0.005
uttered-0.005
enge-0.005
uddled-0.004
apped-0.004
Token
Token ID198
Feature activation+1.94e-03

Pos logit contributions
ied+0.039
uttered+0.036
ics+0.035
uddled+0.035
enge+0.035
Neg logit contributions
-0.016
One-0.016
 pine-0.015
 think-0.015
 c-0.015
TokenL
Token ID43
Feature activation+1.09e-02

Pos logit contributions
ied+0.004
uddled+0.003
enge+0.003
uttered+0.003
ics+0.003
Neg logit contributions
 know-0.002
 One-0.002
 pine-0.002
."-0.002
 think-0.002
Tokenily
Token ID813
Feature activation-3.30e-03

Pos logit contributions
ied+0.024
uttered+0.023
enge+0.023
ics+0.022
uddled+0.022
Neg logit contributions
One-0.009
-0.009
 know-0.008
."-0.008
 One-0.008
Token's
Token ID338
Feature activation-4.78e-04

Pos logit contributions
+0.004
One+0.004
 One+0.004
 know+0.004
 think+0.003
Neg logit contributions
ied-0.006
enge-0.006
uttered-0.006
ics-0.006
uddled-0.006
Token mother
Token ID2802
Feature activation+1.47e-02

Pos logit contributions
+0.001
One+0.001
."+0.001
 One+0.001
 know+0.001
Neg logit contributions
ied-0.001
uttered-0.001
box-0.001
enge-0.001
uddled-0.001
Token told
Token ID1297
Feature activation+6.49e-03

Pos logit contributions
enge+0.029
uttered+0.028
ied+0.028
ics+0.028
overed+0.027
Neg logit contributions
-0.017
 c-0.015
 know-0.015
 think-0.014
One-0.014
Token being
Token ID852
Feature activation+9.18e-03

Pos logit contributions
uttered+0.008
enge+0.008
uddled+0.008
ied+0.008
umbled+0.007
Neg logit contributions
 c-0.004
 know-0.004
 think-0.004
-0.004
 help-0.004
Token so
Token ID523
Feature activation+6.02e-03

Pos logit contributions
ied+0.018
uddled+0.017
uttered+0.017
overed+0.016
enge+0.016
Neg logit contributions
-0.011
 know-0.010
."-0.010
One-0.010
 think-0.010
Token patient
Token ID5827
Feature activation+1.38e-04

Pos logit contributions
ied+0.013
enge+0.012
uttered+0.012
uddled+0.012
ics+0.012
Neg logit contributions
-0.005
One-0.005
 know-0.005
 One-0.005
 think-0.005
Token.
Token ID13
Feature activation-6.43e-03

Pos logit contributions
ied+0.000
enge+0.000
uttered+0.000
uddled+0.000
ics+0.000
Neg logit contributions
-0.000
One-0.000
."-0.000
 One-0.000
 know-0.000
Token
Token ID198
Feature activation+1.04e-02

Pos logit contributions
One+0.006
+0.006
 know+0.006
 think+0.005
 c+0.005
Neg logit contributions
ied-0.013
uttered-0.013
enge-0.012
apped-0.012
uddled-0.012
Token
Token ID198
Feature activation-2.15e-03

Pos logit contributions
ied+0.021
ics+0.020
uttered+0.019
uddled+0.019
enge+0.019
Neg logit contributions
 c-0.009
One-0.009
 think-0.009
 One-0.009
use-0.009
TokenThe
Token ID464
Feature activation-5.20e-03

Pos logit contributions
 One+0.002
 know+0.002
."+0.002
 c+0.002
 think+0.002
Neg logit contributions
ied-0.004
enge-0.004
ics-0.004
uttered-0.004
uddled-0.004
Token bear
Token ID6842
Feature activation+1.16e-02

Pos logit contributions
+0.006
 know+0.006
One+0.006
."+0.006
 One+0.006
Neg logit contributions
ied-0.009
enge-0.009
uttered-0.008
box-0.008
 manager-0.008
Token was
Token ID373
Feature activation-1.28e-04

Pos logit contributions
ied+0.021
enge+0.019
uttered+0.019
ics+0.018
uddled+0.018
Neg logit contributions
-0.015
One-0.014
 know-0.014
 One-0.014
 think-0.014
Token happy
Token ID3772
Feature activation+9.56e-03

Pos logit contributions
+0.000
 know+0.000
One+0.000
 think+0.000
."+0.000
Neg logit contributions
ied-0.000
enge-0.000
uttered-0.000
uddled-0.000
ics-0.000
Token.
Token ID13
Feature activation-7.46e-03

Pos logit contributions
ied+0.019
enge+0.017
uttered+0.017
uddled+0.017
ics+0.017
Neg logit contributions
-0.011
One-0.010
 know-0.010
 One-0.010
 think-0.010
Token collecting
Token ID13157
Feature activation-6.64e-03

Pos logit contributions
One+0.002
+0.002
 know+0.002
 One+0.002
 think+0.001
Neg logit contributions
ied-0.002
uttered-0.002
enge-0.002
uddled-0.002
ics-0.002
Token.
Token ID13
Feature activation-3.67e-03

Pos logit contributions
+0.008
One+0.008
 know+0.008
 One+0.008
 think+0.008
Neg logit contributions
ied-0.012
enge-0.011
uttered-0.011
uddled-0.011
ics-0.010
Token Every
Token ID3887
Feature activation+1.71e-03

Pos logit contributions
One+0.003
 know+0.003
 think+0.003
 c+0.003
+0.003
Neg logit contributions
ied-0.007
uttered-0.007
enge-0.007
apped-0.007
uddled-0.006
Token morning
Token ID3329
Feature activation-4.73e-03

Pos logit contributions
ied+0.003
enge+0.003
uddled+0.003
uttered+0.003
ics+0.003
Neg logit contributions
-0.002
One-0.002
 One-0.002
 know-0.002
 think-0.002
Token she
Token ID673
Feature activation-1.08e-02

Pos logit contributions
+0.005
One+0.005
 know+0.004
 One+0.004
 think+0.004
Neg logit contributions
ied-0.010
enge-0.009
uttered-0.009
uddled-0.009
ics-0.009
Token would
Token ID561
Feature activation-7.28e-03

Pos logit contributions
One+0.017
 One+0.016
 know+0.016
+0.016
."+0.016
Neg logit contributions
ied-0.015
uttered-0.013
enge-0.013
uddled-0.013
ics-0.012
Token walk
Token ID2513
Feature activation-1.89e-02

Pos logit contributions
One+0.008
emy+0.007
orse+0.007
 One+0.007
."+0.007
Neg logit contributions
ied-0.012
ics-0.012
uttered-0.012
box-0.011
enge-0.011
Token to
Token ID284
Feature activation-1.28e-02

Pos logit contributions
+0.022
One+0.022
 One+0.021
 c+0.021
."+0.020
Neg logit contributions
ied-0.035
enge-0.034
uttered-0.033
uddled-0.033
ics-0.032
Token the
Token ID262
Feature activation-6.51e-03

Pos logit contributions
One+0.018
+0.018
ppy+0.017
orse+0.017
 One+0.016
Neg logit contributions
ied-0.017
ics-0.017
enge-0.016
uttered-0.016
box-0.015
Token park
Token ID3952
Feature activation-4.37e-03

Pos logit contributions
 know+0.008
+0.008
One+0.008
 One+0.008
 think+0.008
Neg logit contributions
ied-0.011
enge-0.011
uttered-0.010
ics-0.010
uddled-0.010
Token and
Token ID290
Feature activation-7.92e-03

Pos logit contributions
+0.003
One+0.003
 One+0.003
."+0.003
 think+0.003
Neg logit contributions
ied-0.010
enge-0.010
uttered-0.009
uddled-0.009
apped-0.009
Token They
Token ID1119
Feature activation-1.20e-02

Pos logit contributions
One+0.016
+0.016
 know+0.015
 One+0.015
 think+0.015
Neg logit contributions
ied-0.025
uttered-0.024
enge-0.023
uddled-0.023
apped-0.022
Token both
Token ID1111
Feature activation-1.53e-03

Pos logit contributions
+0.017
One+0.016
 know+0.016
 One+0.016
 think+0.015
Neg logit contributions
ied-0.020
enge-0.018
uttered-0.018
uddled-0.018
ics-0.017
Token enjoyed
Token ID8359
Feature activation-1.04e-02

Pos logit contributions
One+0.002
+0.002
 One+0.002
 know+0.002
 think+0.002
Neg logit contributions
ied-0.003
uttered-0.002
enge-0.002
uddled-0.002
apped-0.002
Token it
Token ID340
Feature activation+1.35e-02

Pos logit contributions
+0.014
One+0.014
 know+0.014
 One+0.014
 think+0.013
Neg logit contributions
ied-0.017
enge-0.016
uttered-0.016
uddled-0.015
ics-0.015
Token.
Token ID13
Feature activation-1.45e-02

Pos logit contributions
ied+0.030
enge+0.029
uttered+0.028
uddled+0.027
ipt+0.027
Neg logit contributions
-0.013
."-0.012
One-0.011
 One-0.011
 think-0.011
Token They
Token ID1119
Feature activation-1.14e-02

Pos logit contributions
One+0.018
 know+0.017
 pine+0.017
+0.017
 c+0.017
Neg logit contributions
ied-0.025
uttered-0.024
enge-0.023
apped-0.023
uddled-0.022
Token finished
Token ID5201
Feature activation+1.50e-03

Pos logit contributions
+0.016
One+0.016
 know+0.015
 One+0.015
 think+0.015
Neg logit contributions
ied-0.019
enge-0.017
uttered-0.017
uddled-0.016
ics-0.016
Token all
Token ID477
Feature activation+4.67e-03

Pos logit contributions
ied+0.002
enge+0.002
uttered+0.002
uddled+0.002
ics+0.002
Neg logit contributions
-0.002
One-0.002
 know-0.002
 One-0.002
 think-0.002
Token the
Token ID262
Feature activation-4.55e-03

Pos logit contributions
enge+0.009
ied+0.009
uttered+0.008
uddled+0.008
box+0.008
Neg logit contributions
-0.006
 know-0.005
 think-0.005
One-0.005
."-0.005
Token jelly
Token ID27644
Feature activation+3.79e-03

Pos logit contributions
+0.006
One+0.006
 know+0.006
 think+0.006
 One+0.006
Neg logit contributions
ied-0.008
uttered-0.008
enge-0.008
uddled-0.007
apped-0.007
Token but
Token ID475
Feature activation-8.19e-03

Pos logit contributions
ied+0.007
enge+0.007
uttered+0.007
apped+0.006
uddled+0.006
Neg logit contributions
-0.004
."-0.004
 c-0.004
 think-0.004
 know-0.004
Token gears
Token ID28713
Feature activation-8.96e-03

Pos logit contributions
+0.025
One+0.025
 know+0.024
 think+0.024
 c+0.023
Neg logit contributions
ied-0.020
box-0.018
enge-0.017
uttered-0.017
uddled-0.017
Token for
Token ID329
Feature activation-1.24e-03

Pos logit contributions
+0.007
."+0.006
,"+0.006
One+0.006
 One+0.006
Neg logit contributions
enge-0.020
ied-0.020
uttered-0.020
uddled-0.019
ics-0.019
Token the
Token ID262
Feature activation+5.00e-03

Pos logit contributions
One+0.002
+0.002
 One+0.002
 know+0.002
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token eyes
Token ID2951
Feature activation-2.75e-02

Pos logit contributions
ied+0.008
enge+0.007
uttered+0.007
box+0.007
uddled+0.007
Neg logit contributions
One-0.007
-0.007
 know-0.007
 One-0.007
 think-0.007
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
+0.020
One+0.019
 One+0.018
 know+0.018
."+0.017
Neg logit contributions
ied-0.063
enge-0.061
uttered-0.060
uddled-0.059
ics-0.058
Token nose
Token ID9686
Feature activation-2.79e-02

Pos logit contributions
One+0.015
+0.015
 know+0.014
 One+0.014
."+0.013
Neg logit contributions
ied-0.022
enge-0.021
uttered-0.021
uddled-0.020
apped-0.020
Token,
Token ID11
Feature activation-1.27e-02

Pos logit contributions
+0.014
One+0.012
 One+0.012
 know+0.012
."+0.011
Neg logit contributions
ied-0.071
enge-0.068
uttered-0.067
uddled-0.065
ics-0.065
Token and
Token ID290
Feature activation-8.27e-05

Pos logit contributions
+0.014
 know+0.014
One+0.014
."+0.013
 think+0.013
Neg logit contributions
ied-0.023
uttered-0.021
box-0.021
enge-0.021
apped-0.020
Token mouth
Token ID5422
Feature activation-2.72e-02

Pos logit contributions
One+0.000
+0.000
."+0.000
 know+0.000
 One+0.000
Neg logit contributions
ied-0.000
box-0.000
uttered-0.000
apped-0.000
enge-0.000
Token.
Token ID13
Feature activation-3.28e-03

Pos logit contributions
+0.016
One+0.016
 know+0.014
 One+0.014
 think+0.013
Neg logit contributions
ied-0.066
enge-0.063
uttered-0.063
uddled-0.061
ics-0.061
Token They
Token ID1119
Feature activation-8.68e-03

Pos logit contributions
+0.004
One+0.004
 know+0.004
 c+0.003
 think+0.003
Neg logit contributions
ied-0.006
uttered-0.006
enge-0.006
apped-0.006
uddled-0.005
Token special
Token ID2041
Feature activation+7.85e-03

Pos logit contributions
ied+0.019
uttered+0.018
enge+0.018
uddled+0.017
ics+0.017
Neg logit contributions
-0.013
One-0.013
 know-0.012
 One-0.012
 think-0.012
Token rock
Token ID3881
Feature activation-7.27e-03

Pos logit contributions
ied+0.013
enge+0.012
uttered+0.012
box+0.012
uddled+0.012
Neg logit contributions
One-0.010
-0.010
 One-0.010
 know-0.009
ppy-0.009
Token called
Token ID1444
Feature activation-1.80e-03

Pos logit contributions
+0.004
One+0.004
."+0.003
 One+0.003
 know+0.003
Neg logit contributions
ied-0.018
uttered-0.018
enge-0.017
uddled-0.017
ics-0.017
Token "
Token ID366
Feature activation+5.80e-04

Pos logit contributions
One+0.002
 know+0.002
+0.002
 One+0.002
 think+0.002
Neg logit contributions
ied-0.003
uttered-0.003
enge-0.003
apped-0.003
box-0.003
Tokenflex
Token ID32880
Feature activation-1.23e-02

Pos logit contributions
ied+0.001
uttered+0.001
enge+0.001
uddled+0.001
ics+0.001
Neg logit contributions
One-0.001
-0.001
 One-0.001
 know-0.001
."-0.001
Tokenible
Token ID856
Feature activation-2.60e-02

Pos logit contributions
 know+0.021
One+0.021
 think+0.020
 One+0.020
+0.020
Neg logit contributions
ied-0.016
enge-0.015
uttered-0.013
box-0.013
awed-0.013
Token".
Token ID1911
Feature activation+5.40e-04

Pos logit contributions
One+0.020
 know+0.018
 One+0.018
 think+0.017
 c+0.017
Neg logit contributions
ied-0.059
enge-0.054
box-0.053
ics-0.052
uttered-0.052
Token Tim
Token ID5045
Feature activation+2.79e-04

Pos logit contributions
ied+0.001
enge+0.001
ics+0.001
uddled+0.001
uttered+0.001
Neg logit contributions
 One-0.001
-0.001
One-0.000
 know-0.000
."-0.000
Token didn
Token ID1422
Feature activation+5.77e-03

Pos logit contributions
ied+0.000
enge+0.000
uttered+0.000
uddled+0.000
apped+0.000
Neg logit contributions
One-0.000
-0.000
 One-0.000
 know-0.000
."-0.000
Token't
Token ID470
Feature activation-2.72e-03

Pos logit contributions
ied+0.009
enge+0.009
uttered+0.009
uddled+0.008
ics+0.008
Neg logit contributions
One-0.008
-0.008
 know-0.008
 One-0.008
 think-0.007
Token know
Token ID760
Feature activation-1.05e-02

Pos logit contributions
 pine+0.002
 tul+0.002
One+0.002
+0.002
ulent+0.002
Neg logit contributions
ied-0.005
enge-0.005
 manager-0.005
apped-0.005
 ensued-0.004
Token was
Token ID373
Feature activation-1.21e-03

Pos logit contributions
ied+0.017
cards+0.017
enge+0.017
uddled+0.016
uttered+0.016
Neg logit contributions
-0.011
 One-0.010
One-0.010
 know-0.009
 see-0.009
Token a
Token ID257
Feature activation-9.03e-03

Pos logit contributions
+0.001
One+0.001
 One+0.001
 know+0.001
 think+0.001
Neg logit contributions
ied-0.002
enge-0.002
uttered-0.002
uddled-0.002
ics-0.002
Token little
Token ID1310
Feature activation+5.02e-03

Pos logit contributions
+0.013
 know+0.012
One+0.012
 One+0.012
r+0.012
Neg logit contributions
enge-0.013
ied-0.012
uddled-0.012
 manager-0.012
apped-0.012
Token bird
Token ID6512
Feature activation-2.08e-02

Pos logit contributions
enge+0.006
ied+0.006
apped+0.005
uttered+0.005
uddled+0.005
Neg logit contributions
One-0.009
-0.009
 know-0.009
 One-0.008
 think-0.008
Token named
Token ID3706
Feature activation-3.08e-03

Pos logit contributions
 Deal+0.027
+0.026
 fiber+0.024
 know+0.024
use+0.023
Neg logit contributions
ied-0.037
apped-0.033
ics-0.033
uttered-0.031
uddled-0.031
Token Tweet
Token ID18752
Feature activation-3.83e-02

Pos logit contributions
+0.005
One+0.005
 know+0.005
 think+0.005
 want+0.004
Neg logit contributions
ied-0.004
uttered-0.004
ics-0.004
enge-0.004
apped-0.003
Token.
Token ID13
Feature activation-3.66e-03

Pos logit contributions
+0.064
 Deal+0.061
 know+0.056
 think+0.055
use+0.055
Neg logit contributions
ied-0.055
ics-0.047
uddled-0.042
man-0.042
 lasted-0.040
Token Tweet
Token ID18752
Feature activation-1.06e-02

Pos logit contributions
+0.007
 think+0.006
orse+0.006
 know+0.006
use+0.006
Neg logit contributions
uttered-0.004
apped-0.004
enge-0.003
ied-0.003
ics-0.003
Token loved
Token ID6151
Feature activation-6.04e-04

Pos logit contributions
 Deal+0.020
+0.020
ulent+0.019
 One+0.019
One+0.019
Neg logit contributions
ied-0.012
ics-0.010
uddled-0.009
uttered-0.009
 lasted-0.009
Token to
Token ID284
Feature activation+5.21e-03

Pos logit contributions
One+0.001
+0.001
 know+0.001
 One+0.001
 think+0.001
Neg logit contributions
ied-0.001
enge-0.001
uttered-0.001
uddled-0.001
apped-0.001
Token sing
Token ID1702
Feature activation-2.74e-02

Pos logit contributions
ics+0.008
uttered+0.007
ied+0.006
enge+0.006
apped+0.006
Neg logit contributions
-0.008
One-0.007
ppy-0.007
 Deal-0.007
use-0.007
Token John
Token ID1757
Feature activation-2.33e-02

Pos logit contributions
One+0.011
+0.011
 know+0.011
 think+0.010
 One+0.010
Neg logit contributions
ied-0.010
enge-0.009
uttered-0.009
ics-0.008
apped-0.008
Token who
Token ID508
Feature activation-2.46e-03

Pos logit contributions
One+0.018
+0.018
 conclusions+0.017
 know+0.017
 One+0.016
Neg logit contributions
ied-0.050
box-0.044
ics-0.044
apped-0.043
enge-0.043
Token lived
Token ID5615
Feature activation-1.07e-02

Pos logit contributions
+0.003
One+0.003
 know+0.003
 One+0.003
 think+0.002
Neg logit contributions
ied-0.005
uttered-0.004
enge-0.004
uddled-0.004
ics-0.004
Token in
Token ID287
Feature activation-1.69e-02

Pos logit contributions
One+0.015
+0.014
."+0.014
 One+0.014
 know+0.014
Neg logit contributions
ied-0.018
enge-0.017
uttered-0.016
uddled-0.016
ics-0.015
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
+0.021
One+0.021
 know+0.020
 One+0.020
 think+0.020
Neg logit contributions
ied-0.029
enge-0.027
uttered-0.027
ics-0.026
uddled-0.026
Token small
Token ID1402
Feature activation-3.53e-02

Pos logit contributions
 know+0.025
+0.024
 think+0.024
ppy+0.023
One+0.023
Neg logit contributions
ied-0.031
enge-0.029
box-0.028
uttered-0.028
ics-0.028
Token town
Token ID3240
Feature activation-1.84e-02

Pos logit contributions
 know+0.049
+0.049
One+0.049
 think+0.047
orse+0.046
Neg logit contributions
ied-0.051
enge-0.048
box-0.047
uttered-0.047
apped-0.047
Token.
Token ID13
Feature activation-9.40e-03

Pos logit contributions
+0.013
 know+0.013
ppy+0.012
One+0.012
 think+0.012
Neg logit contributions
ied-0.041
apped-0.038
ics-0.038
uttered-0.038
enge-0.038
Token
Token ID198
Feature activation-3.61e-03

Pos logit contributions
+0.009
One+0.009
 know+0.009
 think+0.009
 c+0.008
Neg logit contributions
ied-0.019
enge-0.018
uttered-0.017
ics-0.017
uddled-0.017
TokenJohn
Token ID7554
Feature activation-1.37e-03

Pos logit contributions
+0.003
 think+0.003
 know+0.003
 c+0.003
ppy+0.003
Neg logit contributions
ied-0.008
uttered-0.007
enge-0.007
ics-0.007
uddled-0.007
Token loved
Token ID6151
Feature activation-3.43e-03

Pos logit contributions
One+0.002
 know+0.002
 One+0.002
+0.002
 conclusions+0.002
Neg logit contributions
ied-0.002
uttered-0.002
uddled-0.002
enge-0.002
ics-0.002
Token time
Token ID640
Feature activation-3.33e-03

Pos logit contributions
ied+0.019
enge+0.018
uttered+0.018
uddled+0.018
ics+0.017
Neg logit contributions
One-0.011
-0.011
 One-0.010
 know-0.010
 think-0.010
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
+0.002
One+0.002
 know+0.002
 One+0.002
."+0.002
Neg logit contributions
ied-0.008
enge-0.008
uttered-0.007
uddled-0.007
ics-0.007
Token Every
Token ID3887
Feature activation+1.79e-03

Pos logit contributions
+0.010
 One+0.010
One+0.009
 know+0.009
."+0.008
Neg logit contributions
ied-0.022
enge-0.021
uddled-0.020
ics-0.020
uttered-0.020
Token time
Token ID640
Feature activation+2.46e-03

Pos logit contributions
ied+0.004
enge+0.004
uddled+0.004
uttered+0.003
ics+0.003
Neg logit contributions
-0.002
One-0.002
 One-0.002
 know-0.002
 think-0.002
Token they
Token ID484
Feature activation-1.62e-02

Pos logit contributions
ied+0.005
uddled+0.005
enge+0.005
awed+0.005
plug+0.005
Neg logit contributions
-0.003
 One-0.002
,"-0.002
 know-0.002
 see-0.002
Token thought
Token ID1807
Feature activation-2.88e-02

Pos logit contributions
+0.022
One+0.022
 know+0.021
 One+0.021
 think+0.020
Neg logit contributions
ied-0.027
enge-0.025
uttered-0.025
uddled-0.024
ics-0.024
Token about
Token ID546
Feature activation+7.19e-03

Pos logit contributions
+0.029
 know+0.023
."+0.023
 One+0.023
,"+0.022
Neg logit contributions
enge-0.064
ied-0.060
ipt-0.060
uttered-0.060
uddled-0.060
Token the
Token ID262
Feature activation+4.07e-03

Pos logit contributions
ied+0.013
enge+0.012
uttered+0.012
uddled+0.011
ics+0.011
Neg logit contributions
One-0.009
-0.009
 know-0.008
 One-0.008
 think-0.008
Token impressive
Token ID8036
Feature activation+2.29e-02

Pos logit contributions
ied+0.007
enge+0.007
uttered+0.007
uddled+0.007
ics+0.007
Neg logit contributions
-0.005
One-0.005
 know-0.005
 One-0.005
 think-0.005
Token city
Token ID1748
Feature activation+6.10e-03

Pos logit contributions
ied+0.041
enge+0.037
uttered+0.037
uddled+0.037
ics+0.035
Neg logit contributions
One-0.029
-0.029
 One-0.029
 know-0.028
 think-0.028
Token,
Token ID11
Feature activation-9.64e-03

Pos logit contributions
ied+0.014
enge+0.014
uttered+0.014
uddled+0.014
ipt+0.013
Neg logit contributions
-0.005
 One-0.004
."-0.004
,"-0.004
 know-0.004