TOP ACTIVATIONS
MAX = 4.452

day and shared a spicy pie every week . <|endoftext|> Once
and put it in a pie pan . They add the
and the hole in the pie . She was not happy
then put them in a pie dish . She put it
pour the mixture into a pie tin and attach it to
saw her mother holding a freshly - b aked muff in
and put it in the pie crust . Then he put
and put it in a pie dish . Then she poured
and decided to bake a pie . They both laughed and
some bread and a warm pie that his neighbour had left
all laughed and shared the pie . It was the best
crust on and put the pie in the oven .
to turn it into a pie . He took it to
They smiled and ate the pie . They were happy .
They smiled and ate the pie . They were happy .
happy and enjoyed eating her pie . <|endoftext|> Once upon a
's mom took out the pie . She put it on
balloons . He threw a pie at another clown . He
Tim my a warm , freshly baked cookie .
set to making a delicious pie . They took out the

MIN ACTIVATIONS
MIN = -4.184

and grand pa taught the grand child how to play .
The kids all hugged the grand pa and thanked him for
asked , " What other planets are there ?" Her mom
When the suitcase was ready , the mom
you want to see the attic , Nib bles ? There
. She wrote about how grand baby is always smiling ,
Plan ets ? What are planets like ?" Ben asked .
rounded the corner of the harbour , it was surprised to
others . That is what grand pa taught me . That
seek . Tom was the seeker and Lily was the h
and asked , " What papers ?" M um
park . Sara is the seeker and Ben is the h
park . Lily was the seeker and Tom was the h
lunch , he handed each grand kid a special present .
house . Lily was the seeker and Max was the h
park . Lily was the seeker and Max was the h
was very elderly . The grand pa was always teaching the
" In the attic , Grand ma . We
park . Lily was the seeker and Tom was the h
seas hell . " That seas hell belongs to you ,

INTERVAL 2.293 - 4.452
CONTAINS 0.103%

brown hair and a big smile . He walked up to
The girl picked out a cookie from the cart and happily
picked out some apples and bananas . When they got to
of snow men and snow balls and fun . They were
saw Tom and Lily and Spot . They were not happy

INTERVAL 0.134 - 2.293
CONTAINS 39.649%

decided to take a nap in the woods . When he
forest until he was tired and then he went to bed
t eddy bear . He is soft and brown and has
and a drink and cheers him on . " You are
They are sad . They have no one to play with

INTERVAL -2.025 - 0.134
CONTAINS 60.074%

by giving him his wallet ! Tim was so
you think I look silly ?" he asked .
played together for a while . Suddenly , a big storm
Billy said to his mom , � � €
can be dangerous ! <|endoftext|> Once upon a time , there

INTERVAL -4.184 - -2.025
CONTAINS 0.174%

car . They were still tired and frustrated , but they
the room . On the wall was a big black board
go home . She said goodbye to the monster and the
home . He taught the other birds that it 's important
elt down , hugged her tightly and said , " It
Token day
Token ID1110
Feature activation+2.37e+00

Pos logit contributions
chair+0.012
pins+0.010
 bed+0.010
washer+0.010
 of+0.010
Neg logit contributions
 appearance-0.019
ho-0.019
 cheating-0.019
 matter-0.019
iqu-0.018
Token and
Token ID290
Feature activation-4.38e-01

Pos logit contributions
 matter+0.105
 disagreement+0.090
 swoop+0.088
 problem+0.084
 beginning+0.084
Neg logit contributions
pins-0.053
uts-0.049
ouses-0.045
 Eve-0.045
ugg-0.041
Token shared
Token ID4888
Feature activation-1.93e-01

Pos logit contributions
pins+0.012
 Eve+0.011
ugg+0.011
ouses+0.011
uts+0.010
Neg logit contributions
 matter-0.015
 disagreement-0.013
 problem-0.013
 beginning-0.013
urious-0.013
Token a
Token ID257
Feature activation+4.58e-02

Pos logit contributions
pins+0.004
ouses+0.003
 Eve+0.003
hes+0.003
 Santa+0.003
Neg logit contributions
 matter-0.008
ho-0.008
 beginning-0.008
 disagreement-0.007
 problem-0.007
Token spicy
Token ID26880
Feature activation+1.53e+00

Pos logit contributions
 matter+0.002
 beginning+0.001
 disagreement+0.001
ho+0.001
 swoop+0.001
Neg logit contributions
pins-0.001
ouses-0.001
ugg-0.001
uts-0.001
 Eve-0.001
Token pie
Token ID2508
Feature activation+4.45e+00

Pos logit contributions
 matter+0.062
 beginning+0.058
 cheating+0.055
 impossible+0.055
 disagreement+0.053
Neg logit contributions
ugg-0.030
pins-0.029
ouses-0.027
uts-0.026
aid-0.026
Token every
Token ID790
Feature activation-1.35e+00

Pos logit contributions
 matter+0.186
 disagreement+0.165
 beginning+0.159
 swoop+0.156
 problem+0.152
Neg logit contributions
pins-0.106
uts-0.097
ouses-0.095
 Eve-0.090
ugg-0.089
Token week
Token ID1285
Feature activation+1.54e+00

Pos logit contributions
chair+0.029
ugg+0.025
 of+0.025
."+0.025
 Eve+0.025
Neg logit contributions
ho-0.043
 appearance-0.043
 cheating-0.042
ric-0.041
ths-0.040
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
 matter+0.075
 disagreement+0.067
Uh+0.065
 swoop+0.065
urious+0.063
Neg logit contributions
pins-0.028
uts-0.027
ouses-0.023
 Eve-0.023
obs-0.018
Token<|endoftext|>
Token ID50256
Feature activation+2.64e-01

Pos logit contributions
pins+0.007
ouses+0.006
uts+0.006
unn+0.006
ugg+0.005
Neg logit contributions
 matter-0.015
 disagreement-0.014
 beginning-0.013
ho-0.013
 swoop-0.013
TokenOnce
Token ID7454
Feature activation+1.82e-01

Pos logit contributions
 matter+0.011
 disagreement+0.010
 beginning+0.010
ho+0.009
 swoop+0.009
Neg logit contributions
pins-0.006
ouses-0.005
uts-0.005
ugg-0.005
 Eve-0.005
Token and
Token ID290
Feature activation-4.69e-02

Pos logit contributions
 matter+0.093
 disagreement+0.080
 beginning+0.078
 swoop+0.076
 problem+0.073
Neg logit contributions
pins-0.047
uts-0.040
ouses-0.039
 Eve-0.037
ugg-0.037
Token put
Token ID1234
Feature activation-2.53e-01

Pos logit contributions
pins+0.001
ouses+0.001
 Eve+0.001
ugg+0.001
uts+0.001
Neg logit contributions
 matter-0.002
 disagreement-0.001
 beginning-0.001
ho-0.001
 swoop-0.001
Token it
Token ID340
Feature activation+3.43e-01

Pos logit contributions
pins+0.008
 Eve+0.007
ouses+0.007
hes+0.007
ugg+0.006
Neg logit contributions
 matter-0.009
 disagreement-0.007
 beginning-0.007
ho-0.007
 swoop-0.007
Token in
Token ID287
Feature activation+1.85e-01

Pos logit contributions
 matter+0.016
 disagreement+0.014
where+0.012
 problem+0.012
 beginning+0.012
Neg logit contributions
pins-0.008
ouses-0.007
ugg-0.006
uts-0.006
hes-0.006
Token a
Token ID257
Feature activation+1.38e-01

Pos logit contributions
 matter+0.007
 disagreement+0.006
ho+0.006
 beginning+0.006
 swoop+0.006
Neg logit contributions
pins-0.005
 Eve-0.004
ouses-0.004
uts-0.004
ugg-0.004
Token pie
Token ID2508
Feature activation+4.42e+00

Pos logit contributions
 matter+0.007
 disagreement+0.006
 beginning+0.006
ho+0.006
 cheating+0.006
Neg logit contributions
pins-0.002
 Eve-0.002
ouses-0.002
washer-0.002
 bed-0.002
Token pan
Token ID3425
Feature activation+9.34e-01

Pos logit contributions
 matter+0.160
 disagreement+0.138
 beginning+0.135
 swoop+0.131
ho+0.131
Neg logit contributions
pins-0.128
ouses-0.116
ugg-0.112
uts-0.112
 Eve-0.111
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
 matter+0.037
 disagreement+0.033
 beginning+0.031
 swoop+0.031
 problem+0.030
Neg logit contributions
pins-0.025
ouses-0.022
uts-0.021
 Eve-0.021
ugg-0.021
Token They
Token ID1119
Feature activation-4.15e-01

Pos logit contributions
 matter+0.007
 disagreement+0.006
 beginning+0.006
ho+0.006
 total+0.006
Neg logit contributions
pins-0.005
ouses-0.005
unn-0.005
ugg-0.005
 complexion-0.005
Token add
Token ID751
Feature activation+4.40e-01

Pos logit contributions
pins+0.012
ouses+0.011
uts+0.011
ugg+0.011
 Eve+0.011
Neg logit contributions
 matter-0.015
 disagreement-0.012
 beginning-0.012
 swoop-0.012
ho-0.012
Token the
Token ID262
Feature activation-3.57e-01

Pos logit contributions
 matter+0.018
 disagreement+0.015
 beginning+0.015
ho+0.015
 swoop+0.015
Neg logit contributions
pins-0.011
ouses-0.010
ugg-0.009
uts-0.009
 Eve-0.009
Token and
Token ID290
Feature activation+5.84e-01

Pos logit contributions
 matter+0.024
 disagreement+0.020
 swoop+0.019
ah+0.019
where+0.019
Neg logit contributions
pins-0.012
uts-0.010
 opener-0.008
 Eve-0.008
washer-0.008
Token the
Token ID262
Feature activation+7.53e-01

Pos logit contributions
 matter+0.023
 disagreement+0.021
 beginning+0.020
ho+0.020
 swoop+0.019
Neg logit contributions
pins-0.014
ouses-0.012
 Eve-0.012
uts-0.011
ugg-0.011
Token hole
Token ID7604
Feature activation+5.37e-01

Pos logit contributions
 matter+0.035
ah+0.030
ths+0.030
urious+0.030
 swoop+0.030
Neg logit contributions
pins-0.013
 sofa-0.010
ouses-0.009
 buck-0.009
 CDs-0.009
Token in
Token ID287
Feature activation+1.02e+00

Pos logit contributions
 matter+0.027
 disagreement+0.024
where+0.023
Really+0.022
 swoop+0.022
Neg logit contributions
pins-0.009
uts-0.008
hes-0.007
washer-0.006
 opener-0.005
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
 matter+0.041
ho+0.037
 disagreement+0.037
 beginning+0.037
 swoop+0.036
Neg logit contributions
pins-0.024
 Eve-0.021
uts-0.019
ouses-0.018
ugg-0.017
Token pie
Token ID2508
Feature activation+4.39e+00

Pos logit contributions
 matter+0.060
 disagreement+0.051
 swoop+0.050
ths+0.049
ah+0.048
Neg logit contributions
pins-0.026
 Eve-0.022
ugg-0.021
ouses-0.020
washer-0.020
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 matter+0.196
 disagreement+0.169
 swoop+0.161
 beginning+0.160
where+0.155
Neg logit contributions
pins-0.098
uts-0.086
 Eve-0.082
ouses-0.077
washer-0.072
Token She
Token ID1375
Feature activation-3.32e-01

Pos logit contributions
ouses+0.005
pins+0.005
unn+0.005
ugg+0.005
 complexion+0.004
Neg logit contributions
 matter-0.004
 disagreement-0.003
ho-0.003
What-0.003
 beginning-0.003
Token was
Token ID373
Feature activation-1.01e-01

Pos logit contributions
pins+0.010
ouses+0.009
 Eve+0.009
ugg+0.009
uts+0.009
Neg logit contributions
 matter-0.011
 beginning-0.009
 disagreement-0.009
ho-0.009
 swoop-0.008
Token not
Token ID407
Feature activation+1.36e+00

Pos logit contributions
pins+0.003
hes+0.003
uts+0.003
 Eve+0.003
ouses+0.003
Neg logit contributions
 matter-0.004
ho-0.003
 disagreement-0.003
ah-0.003
 swoop-0.003
Token happy
Token ID3772
Feature activation+3.65e-01

Pos logit contributions
 matter+0.050
 disagreement+0.044
ho+0.043
 beginning+0.042
 swoop+0.041
Neg logit contributions
pins-0.039
ouses-0.035
uts-0.034
 Eve-0.033
ugg-0.031
Token then
Token ID788
Feature activation-1.36e-01

Pos logit contributions
 matter+0.018
 disagreement+0.017
 beginning+0.016
ho+0.016
 swoop+0.016
Neg logit contributions
pins-0.010
ouses-0.008
 Eve-0.008
uts-0.008
ugg-0.008
Token put
Token ID1234
Feature activation-3.68e-01

Pos logit contributions
pins+0.003
 Eve+0.003
ouses+0.003
uts+0.002
ugg+0.002
Neg logit contributions
 matter-0.006
 disagreement-0.005
 beginning-0.005
ho-0.005
 problem-0.005
Token them
Token ID606
Feature activation+1.19e+00

Pos logit contributions
pins+0.012
 Eve+0.010
ouses+0.010
hes+0.009
ugg+0.009
Neg logit contributions
 matter-0.013
 disagreement-0.011
 beginning-0.010
 cheating-0.010
 problem-0.010
Token in
Token ID287
Feature activation+2.72e-01

Pos logit contributions
 matter+0.050
 disagreement+0.043
 problem+0.040
 beginning+0.039
 cheating+0.039
Neg logit contributions
pins-0.032
ouses-0.027
uts-0.025
 Eve-0.025
ugg-0.025
Token a
Token ID257
Feature activation+4.97e-01

Pos logit contributions
 matter+0.011
 disagreement+0.010
ho+0.010
 swoop+0.010
 beginning+0.010
Neg logit contributions
pins-0.006
 Eve-0.005
uts-0.005
ouses-0.005
 Santa-0.004
Token pie
Token ID2508
Feature activation+4.35e+00

Pos logit contributions
 matter+0.025
 cheating+0.022
ths+0.020
 beginning+0.020
 disagreement+0.020
Neg logit contributions
pins-0.008
 Eve-0.007
ouses-0.006
washer-0.006
 bed-0.005
Token dish
Token ID9433
Feature activation+8.73e-01

Pos logit contributions
 matter+0.168
 disagreement+0.146
 beginning+0.142
 swoop+0.138
ho+0.135
Neg logit contributions
pins-0.117
ouses-0.105
 Eve-0.102
uts-0.102
ugg-0.100
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
 matter+0.031
ho+0.028
 beginning+0.028
 disagreement+0.027
 swoop+0.026
Neg logit contributions
pins-0.023
ouses-0.020
 Eve-0.020
ugg-0.020
unn-0.020
Token She
Token ID1375
Feature activation+1.46e-01

Pos logit contributions
 matter+0.001
 disagreement+0.001
 beginning+0.001
ho+0.001
 total+0.001
Neg logit contributions
pins-0.001
ouses-0.000
unn-0.000
 complexion-0.000
uts-0.000
Token put
Token ID1234
Feature activation-6.63e-01

Pos logit contributions
 matter+0.005
 disagreement+0.004
 beginning+0.004
 swoop+0.004
ho+0.004
Neg logit contributions
pins-0.005
ouses-0.004
uts-0.004
ugg-0.004
 Eve-0.004
Token it
Token ID340
Feature activation-3.54e-01

Pos logit contributions
pins+0.020
 Eve+0.017
ouses+0.017
ugg+0.017
uts+0.016
Neg logit contributions
 matter-0.024
 disagreement-0.020
 beginning-0.019
ho-0.019
 swoop-0.019
Token pour
Token ID12797
Feature activation+8.06e-01

Pos logit contributions
pins+0.005
ouses+0.005
 Eve+0.005
ugg+0.004
uts+0.004
Neg logit contributions
 matter-0.009
 disagreement-0.008
 beginning-0.008
ho-0.007
 problem-0.007
Token the
Token ID262
Feature activation+3.21e-01

Pos logit contributions
 matter+0.033
 disagreement+0.029
 beginning+0.028
 swoop+0.028
ho+0.028
Neg logit contributions
pins-0.020
 Eve-0.016
ouses-0.016
ugg-0.016
uts-0.015
Token mixture
Token ID11710
Feature activation+2.75e+00

Pos logit contributions
 matter+0.014
 cheating+0.012
 disagreement+0.012
 comparison+0.012
 beginning+0.011
Neg logit contributions
pins-0.006
 milk-0.006
 Eve-0.005
ouses-0.005
aid-0.005
Token into
Token ID656
Feature activation+2.61e-01

Pos logit contributions
 matter+0.112
 disagreement+0.098
 beginning+0.094
 swoop+0.093
ho+0.090
Neg logit contributions
pins-0.070
ouses-0.060
uts-0.060
ugg-0.057
 Eve-0.057
Token a
Token ID257
Feature activation+4.97e-01

Pos logit contributions
 matter+0.012
ho+0.011
 disagreement+0.011
 beginning+0.010
ah+0.010
Neg logit contributions
pins-0.005
 Eve-0.004
 Santa-0.004
ouses-0.003
 buck-0.003
Token pie
Token ID2508
Feature activation+4.31e+00

Pos logit contributions
 matter+0.022
urious+0.019
 cheating+0.019
 appearance+0.018
ths+0.018
Neg logit contributions
pins-0.008
 Eve-0.008
washer-0.007
 bed-0.007
ouses-0.007
Token tin
Token ID19783
Feature activation+7.11e-01

Pos logit contributions
 matter+0.177
 disagreement+0.155
 beginning+0.147
 swoop+0.145
 problem+0.141
Neg logit contributions
pins-0.110
ouses-0.095
uts-0.094
 Eve-0.094
ugg-0.089
Token and
Token ID290
Feature activation+4.75e-01

Pos logit contributions
 matter+0.027
 disagreement+0.023
 beginning+0.023
ho+0.023
 swoop+0.023
Neg logit contributions
pins-0.019
ouses-0.017
ugg-0.016
uts-0.016
 Eve-0.016
Token attach
Token ID10199
Feature activation+4.70e-01

Pos logit contributions
 matter+0.019
 disagreement+0.017
 beginning+0.017
ho+0.016
 problem+0.016
Neg logit contributions
pins-0.012
 Eve-0.010
ouses-0.010
ugg-0.009
uts-0.009
Token it
Token ID340
Feature activation+1.26e+00

Pos logit contributions
 matter+0.020
 disagreement+0.018
 beginning+0.017
 quarrel+0.017
 cheating+0.017
Neg logit contributions
pins-0.011
 Eve-0.009
uts-0.008
ouses-0.008
hes-0.008
Token to
Token ID284
Feature activation+7.52e-01

Pos logit contributions
 matter+0.062
 disagreement+0.053
Uh+0.052
 cheating+0.051
 End+0.050
Neg logit contributions
pins-0.023
uts-0.021
ouses-0.020
ugg-0.018
 Eve-0.017
Token saw
Token ID2497
Feature activation+7.05e-02

Pos logit contributions
pins+0.005
ouses+0.005
 Eve+0.005
ugg+0.005
uts+0.005
Neg logit contributions
 matter-0.005
 disagreement-0.004
 beginning-0.004
 problem-0.004
ho-0.004
Token her
Token ID607
Feature activation-2.54e-01

Pos logit contributions
 matter+0.003
 disagreement+0.003
 beginning+0.002
ho+0.002
 swoop+0.002
Neg logit contributions
pins-0.002
ouses-0.001
uts-0.001
 Eve-0.001
ugg-0.001
Token mother
Token ID2802
Feature activation-2.55e-01

Pos logit contributions
pins+0.006
uts+0.005
 complexion+0.005
 Eve+0.005
ouses+0.005
Neg logit contributions
 matter-0.010
 swoop-0.009
 disagreement-0.008
Time-0.008
ho-0.008
Token holding
Token ID4769
Feature activation+4.37e-01

Pos logit contributions
 Eve+0.005
pins+0.005
 dressed+0.005
 buck+0.005
 The+0.005
Neg logit contributions
 matter-0.011
 disagreement-0.009
OO-0.009
heit-0.009
ho-0.009
Token a
Token ID257
Feature activation+5.04e-01

Pos logit contributions
 matter+0.020
 disagreement+0.017
ho+0.017
 swoop+0.017
 problem+0.017
Neg logit contributions
pins-0.009
 Eve-0.008
ouses-0.007
 Santa-0.006
ugg-0.006
Token freshly
Token ID29026
Feature activation+4.30e+00

Pos logit contributions
 matter+0.020
ths+0.019
urious+0.018
 End+0.018
 beginning+0.018
Neg logit contributions
pins-0.010
 complexion-0.009
ouses-0.009
 buck-0.009
 Eve-0.009
Token-
Token ID12
Feature activation+1.67e+00

Pos logit contributions
 matter+0.161
 disagreement+0.139
ho+0.135
 beginning+0.129
 impossible+0.129
Neg logit contributions
ugg-0.110
pins-0.106
ouses-0.102
hes-0.102
aid-0.099
Tokenb
Token ID65
Feature activation+1.75e+00

Pos logit contributions
 matter+0.065
 disagreement+0.059
 beginning+0.057
 problem+0.056
 quarrel+0.053
Neg logit contributions
pins-0.043
ugg-0.039
ouses-0.038
uts-0.036
 Eve-0.035
Tokenaked
Token ID4335
Feature activation+2.72e+00

Pos logit contributions
 matter+0.076
 disagreement+0.067
 beginning+0.064
ho+0.063
 swoop+0.062
Neg logit contributions
pins-0.040
ouses-0.035
uts-0.035
ugg-0.033
 Eve-0.032
Token muff
Token ID27563
Feature activation+1.42e+00

Pos logit contributions
 matter+0.116
 disagreement+0.102
 total+0.096
ho+0.096
Really+0.092
Neg logit contributions
pins-0.066
 buck-0.052
 dressed-0.051
obs-0.048
uts-0.048
Tokenin
Token ID259
Feature activation+7.44e-01

Pos logit contributions
 matter+0.059
 disagreement+0.052
 beginning+0.049
 quarrel+0.048
 swoop+0.048
Neg logit contributions
pins-0.036
ouses-0.034
uts-0.032
ugg-0.032
unn-0.031
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
 matter+0.020
 disagreement+0.019
 beginning+0.018
 cheating+0.017
 problem+0.017
Neg logit contributions
pins-0.011
 Eve-0.009
ouses-0.009
ests-0.008
uts-0.008
Token put
Token ID1234
Feature activation+1.73e-01

Pos logit contributions
 matter+0.014
 disagreement+0.013
 beginning+0.013
ho+0.012
 problem+0.012
Neg logit contributions
pins-0.010
ouses-0.009
 Eve-0.009
 buck-0.008
ugg-0.008
Token it
Token ID340
Feature activation+6.82e-01

Pos logit contributions
 matter+0.007
 disagreement+0.006
 beginning+0.006
ho+0.006
urious+0.006
Neg logit contributions
pins-0.005
 Eve-0.004
ouses-0.004
hes-0.004
 Santa-0.004
Token in
Token ID287
Feature activation+4.44e-01

Pos logit contributions
 matter+0.029
 disagreement+0.025
 problem+0.023
 cheating+0.023
 quarrel+0.023
Neg logit contributions
pins-0.017
ouses-0.015
uts-0.014
ugg-0.014
 Eve-0.013
Token the
Token ID262
Feature activation+7.19e-01

Pos logit contributions
 matter+0.019
ho+0.018
 disagreement+0.017
 swoop+0.017
 problem+0.017
Neg logit contributions
pins-0.008
 Eve-0.007
 Santa-0.007
 opener-0.006
 buck-0.006
Token pie
Token ID2508
Feature activation+4.27e+00

Pos logit contributions
 matter+0.031
 disagreement+0.028
ho+0.027
 cheating+0.027
 swoop+0.026
Neg logit contributions
pins-0.012
 Eve-0.012
 milk-0.011
ugg-0.011
aid-0.010
Token crust
Token ID20048
Feature activation+2.26e+00

Pos logit contributions
 matter+0.169
 disagreement+0.148
 beginning+0.141
 swoop+0.139
 problem+0.135
Neg logit contributions
pins-0.113
ouses-0.100
uts-0.098
 Eve-0.098
ugg-0.094
Token.
Token ID13
Feature activation-2.63e-02

Pos logit contributions
 matter+0.095
 disagreement+0.084
 swoop+0.079
 beginning+0.078
 problem+0.077
Neg logit contributions
pins-0.058
uts-0.051
ouses-0.049
 Eve-0.046
ugg-0.045
Token Then
Token ID3244
Feature activation+6.70e-01

Pos logit contributions
pins+0.000
uts+0.000
 Eve+0.000
ouses+0.000
ugg+0.000
Neg logit contributions
 matter-0.001
 disagreement-0.001
 problem-0.001
ho-0.001
 swoop-0.001
Token he
Token ID339
Feature activation+8.94e-02

Pos logit contributions
 matter+0.033
ho+0.030
 disagreement+0.030
 beginning+0.029
urious+0.029
Neg logit contributions
pins-0.008
 Eve-0.007
 Santa-0.006
ouses-0.005
uts-0.005
Token put
Token ID1234
Feature activation-2.04e-01

Pos logit contributions
 matter+0.003
 disagreement+0.003
 beginning+0.003
ho+0.003
 problem+0.003
Neg logit contributions
pins-0.002
ouses-0.002
uts-0.002
 Eve-0.002
ugg-0.002
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
 matter+0.072
 disagreement+0.062
 beginning+0.061
 swoop+0.059
 problem+0.057
Neg logit contributions
pins-0.033
uts-0.029
ouses-0.026
 Eve-0.025
ugg-0.024
Token put
Token ID1234
Feature activation-4.07e-01

Pos logit contributions
pins+0.005
ouses+0.004
 Eve+0.004
uts+0.004
 buck+0.004
Neg logit contributions
 matter-0.007
 disagreement-0.007
 beginning-0.007
 problem-0.006
urious-0.006
Token it
Token ID340
Feature activation+2.14e-01

Pos logit contributions
pins+0.010
hes+0.008
 Eve+0.008
aid+0.008
ouses+0.008
Neg logit contributions
 matter-0.017
 disagreement-0.015
urious-0.014
ho-0.014
 beginning-0.014
Token in
Token ID287
Feature activation+4.70e-02

Pos logit contributions
 matter+0.009
 disagreement+0.008
where+0.007
urious+0.007
 problem+0.007
Neg logit contributions
pins-0.005
ouses-0.004
ugg-0.004
uts-0.004
hes-0.004
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
 matter+0.002
 disagreement+0.002
ho+0.002
 beginning+0.002
 swoop+0.002
Neg logit contributions
pins-0.001
 Eve-0.001
uts-0.001
ouses-0.001
 opener-0.001
Token pie
Token ID2508
Feature activation+4.15e+00

Pos logit contributions
 matter+0.005
ths+0.004
ho+0.004
 beginning+0.004
 disagreement+0.004
Neg logit contributions
pins-0.002
 Eve-0.002
ouses-0.001
washer-0.001
 opener-0.001
Token dish
Token ID9433
Feature activation+8.20e-01

Pos logit contributions
 matter+0.152
 disagreement+0.131
 beginning+0.128
 swoop+0.125
ho+0.124
Neg logit contributions
pins-0.118
ouses-0.107
uts-0.103
ugg-0.103
 Eve-0.103
Token.
Token ID13
Feature activation+2.29e-01

Pos logit contributions
 matter+0.025
ho+0.024
 beginning+0.023
 disagreement+0.022
 problem+0.022
Neg logit contributions
pins-0.024
ouses-0.022
unn-0.022
 Eve-0.022
ugg-0.022
Token Then
Token ID3244
Feature activation+3.99e-01

Pos logit contributions
 matter+0.006
 disagreement+0.005
What+0.005
 beginning+0.005
 total+0.005
Neg logit contributions
pins-0.007
unn-0.007
ouses-0.007
 complexion-0.007
grand-0.007
Token she
Token ID673
Feature activation-2.54e-02

Pos logit contributions
 matter+0.017
 beginning+0.015
 disagreement+0.015
 swoop+0.015
ho+0.015
Neg logit contributions
pins-0.009
 Eve-0.008
ouses-0.007
uts-0.007
ugg-0.007
Token poured
Token ID19036
Feature activation+2.00e-02

Pos logit contributions
pins+0.001
ouses+0.001
uts+0.001
ugg+0.001
 Eve+0.001
Neg logit contributions
 matter-0.001
 swoop-0.001
 disagreement-0.001
 beginning-0.001
ho-0.001
Token and
Token ID290
Feature activation-4.21e-01

Pos logit contributions
 matter+0.048
 disagreement+0.041
 swoop+0.040
 beginning+0.039
ho+0.039
Neg logit contributions
pins-0.022
uts-0.019
ouses-0.019
 Eve-0.018
ugg-0.017
Token decided
Token ID3066
Feature activation-4.68e-03

Pos logit contributions
pins+0.012
ouses+0.011
ugg+0.011
uts+0.011
 Eve+0.011
Neg logit contributions
 matter-0.015
 disagreement-0.013
 beginning-0.013
ho-0.012
 swoop-0.012
Token to
Token ID284
Feature activation+7.70e-01

Pos logit contributions
pins+0.000
uts+0.000
ouses+0.000
 Eve+0.000
 buck+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
ah-0.000
 beginning-0.000
ho-0.000
Token bake
Token ID28450
Feature activation+5.17e-01

Pos logit contributions
 matter+0.027
 disagreement+0.024
 beginning+0.024
ho+0.023
 swoop+0.023
Neg logit contributions
pins-0.022
ouses-0.020
ugg-0.019
 Eve-0.019
uts-0.019
Token a
Token ID257
Feature activation+7.39e-01

Pos logit contributions
 matter+0.023
ho+0.021
 disagreement+0.020
 swoop+0.019
 beginning+0.019
Neg logit contributions
pins-0.011
hes-0.010
ouses-0.010
 Eve-0.009
uts-0.009
Token pie
Token ID2508
Feature activation+4.12e+00

Pos logit contributions
 matter+0.030
 beginning+0.027
ho+0.026
 disagreement+0.026
 swoop+0.026
Neg logit contributions
pins-0.016
 Eve-0.014
ouses-0.014
washer-0.013
grand-0.013
Token.
Token ID13
Feature activation-3.67e-01

Pos logit contributions
 matter+0.166
 disagreement+0.146
 beginning+0.139
 swoop+0.134
ho+0.134
Neg logit contributions
pins-0.106
ouses-0.095
uts-0.095
 Eve-0.091
ugg-0.087
Token They
Token ID1119
Feature activation-3.61e-01

Pos logit contributions
pins+0.011
unn+0.010
ouses+0.010
ugg+0.010
aid+0.010
Neg logit contributions
 matter-0.012
 disagreement-0.010
 beginning-0.010
 total-0.010
ho-0.009
Token both
Token ID1111
Feature activation+5.89e-01

Pos logit contributions
pins+0.011
ouses+0.010
uts+0.010
ugg+0.010
 Eve+0.010
Neg logit contributions
 matter-0.012
 disagreement-0.011
 beginning-0.010
 swoop-0.010
ho-0.010
Token laughed
Token ID13818
Feature activation+1.19e+00

Pos logit contributions
 matter+0.021
 disagreement+0.019
 beginning+0.018
 problem+0.017
ho+0.017
Neg logit contributions
pins-0.016
ouses-0.015
ugg-0.015
 Eve-0.014
uts-0.014
Token and
Token ID290
Feature activation-8.82e-01

Pos logit contributions
 matter+0.055
 disagreement+0.049
 beginning+0.048
ho+0.047
 swoop+0.047
Neg logit contributions
pins-0.023
ouses-0.019
uts-0.018
 Eve-0.018
ugg-0.018
Token some
Token ID617
Feature activation-3.84e-01

Pos logit contributions
pins+0.005
ouses+0.005
uts+0.004
ugg+0.004
unn+0.004
Neg logit contributions
 matter-0.006
 beginning-0.005
 swoop-0.005
 disagreement-0.005
ho-0.005
Token bread
Token ID8509
Feature activation+2.78e-01

Pos logit contributions
pins+0.010
ouses+0.009
 Eve+0.009
ugg+0.008
uts+0.008
Neg logit contributions
 matter-0.015
 disagreement-0.013
 beginning-0.013
ho-0.013
 swoop-0.013
Token and
Token ID290
Feature activation+8.85e-02

Pos logit contributions
 matter+0.013
 disagreement+0.012
 beginning+0.011
 swoop+0.011
 problem+0.011
Neg logit contributions
pins-0.006
uts-0.005
ouses-0.005
ugg-0.004
 Eve-0.004
Token a
Token ID257
Feature activation+5.43e-01

Pos logit contributions
 matter+0.003
 swoop+0.003
 disagreement+0.003
 beginning+0.003
ho+0.002
Neg logit contributions
pins-0.003
ouses-0.002
ugg-0.002
uts-0.002
unn-0.002
Token warm
Token ID5814
Feature activation-6.23e-01

Pos logit contributions
 matter+0.024
ths+0.021
urious+0.020
 beginning+0.020
 disagreement+0.020
Neg logit contributions
pins-0.011
 Eve-0.009
ouses-0.009
 complexion-0.009
washer-0.009
Token pie
Token ID2508
Feature activation+4.02e+00

Pos logit contributions
pins+0.015
 Eve+0.013
ugg+0.013
ouses+0.012
 complexion+0.011
Neg logit contributions
 matter-0.025
 disagreement-0.023
 beginning-0.022
 cheating-0.022
ho-0.022
Token that
Token ID326
Feature activation+5.68e-01

Pos logit contributions
 matter+0.178
 disagreement+0.158
 beginning+0.145
urious+0.142
 swoop+0.141
Neg logit contributions
pins-0.093
uts-0.084
 Eve-0.079
ouses-0.077
washer-0.074
Token his
Token ID465
Feature activation-7.13e-02

Pos logit contributions
 matter+0.019
 disagreement+0.016
 beginning+0.015
ho+0.015
 swoop+0.015
Neg logit contributions
pins-0.018
ouses-0.017
ugg-0.016
uts-0.016
 Eve-0.016
Token neighbour
Token ID12250
Feature activation+3.57e-01

Pos logit contributions
pins+0.001
ouses+0.001
 Eve+0.001
uts+0.001
ugg+0.001
Neg logit contributions
 matter-0.003
 swoop-0.003
ho-0.003
 disagreement-0.003
 beginning-0.003
Token had
Token ID550
Feature activation+8.49e-02

Pos logit contributions
 matter+0.014
 disagreement+0.012
 comparison+0.012
 beginning+0.012
where+0.011
Neg logit contributions
 Eve-0.009
pins-0.009
ouses-0.008
grand-0.008
uts-0.008
Token left
Token ID1364
Feature activation-1.33e+00

Pos logit contributions
 matter+0.003
 disagreement+0.003
ho+0.003
 forth+0.003
 beginning+0.003
Neg logit contributions
pins-0.002
uts-0.002
ouses-0.002
 Eve-0.002
ugg-0.002
Token all
Token ID477
Feature activation+5.40e-01

Pos logit contributions
pins+0.014
ouses+0.013
uts+0.013
ugg+0.013
 Eve+0.013
Neg logit contributions
 matter-0.014
 disagreement-0.012
 beginning-0.012
 swoop-0.012
ho-0.012
Token laughed
Token ID13818
Feature activation+1.18e+00

Pos logit contributions
 matter+0.019
 disagreement+0.016
 beginning+0.016
ho+0.016
 swoop+0.016
Neg logit contributions
pins-0.016
ouses-0.015
ugg-0.014
uts-0.014
 Eve-0.014
Token and
Token ID290
Feature activation-7.75e-01

Pos logit contributions
 matter+0.055
 disagreement+0.049
 beginning+0.047
ho+0.047
 swoop+0.047
Neg logit contributions
pins-0.023
ouses-0.019
uts-0.018
ugg-0.018
 Eve-0.018
Token shared
Token ID4888
Feature activation-1.11e-01

Pos logit contributions
pins+0.022
ouses+0.021
uts+0.020
ugg+0.020
 Eve+0.020
Neg logit contributions
 matter-0.028
 disagreement-0.024
 beginning-0.023
 swoop-0.023
ho-0.023
Token the
Token ID262
Feature activation-5.38e-01

Pos logit contributions
pins+0.003
ouses+0.002
uts+0.002
 Eve+0.002
 Santa+0.002
Neg logit contributions
 matter-0.005
ho-0.004
 beginning-0.004
 swoop-0.004
 disagreement-0.004
Token pie
Token ID2508
Feature activation+4.01e+00

Pos logit contributions
pins+0.011
aid+0.011
grand+0.010
 milk+0.010
hes+0.009
Neg logit contributions
 matter-0.021
ho-0.020
 beginning-0.019
ah-0.018
 forth-0.018
Token.
Token ID13
Feature activation-4.18e-01

Pos logit contributions
 matter+0.156
 disagreement+0.135
 beginning+0.133
 swoop+0.129
ho+0.127
Neg logit contributions
pins-0.107
ouses-0.097
uts-0.096
 Eve-0.092
ugg-0.091
Token It
Token ID632
Feature activation-3.58e-01

Pos logit contributions
pins+0.011
ouses+0.010
unn+0.010
 complexion+0.009
ugg+0.009
Neg logit contributions
 matter-0.015
 disagreement-0.014
 beginning-0.013
 quarrel-0.013
 total-0.013
Token was
Token ID373
Feature activation-7.22e-01

Pos logit contributions
pins+0.011
 Eve+0.010
ouses+0.010
uts+0.010
ugg+0.010
Neg logit contributions
 matter-0.011
 disagreement-0.010
 beginning-0.010
ho-0.009
 problem-0.009
Token the
Token ID262
Feature activation-5.88e-01

Pos logit contributions
pins+0.015
 Santa+0.013
 Eve+0.013
hes+0.012
uts+0.011
Neg logit contributions
ho-0.029
 matter-0.028
 disagreement-0.026
ah-0.026
 swoop-0.026
Token best
Token ID1266
Feature activation+6.07e-01

Pos logit contributions
pins+0.013
 Santa+0.011
aid+0.010
 Eve+0.010
washer+0.010
Neg logit contributions
 inspection-0.021
 matter-0.020
 swoop-0.020
ho-0.020
urious-0.019
Token crust
Token ID20048
Feature activation+1.39e+00

Pos logit contributions
pins+0.002
uts+0.001
ouses+0.001
ugg+0.001
 Eve+0.001
Neg logit contributions
 matter-0.003
 disagreement-0.003
 beginning-0.002
 cheating-0.002
 quarrel-0.002
Token on
Token ID319
Feature activation+5.44e-01

Pos logit contributions
 matter+0.058
 disagreement+0.052
 problem+0.047
 cheating+0.046
 quarrel+0.046
Neg logit contributions
pins-0.038
uts-0.032
ouses-0.030
ugg-0.030
 Eve-0.028
Token and
Token ID290
Feature activation+6.75e-02

Pos logit contributions
 matter+0.025
 disagreement+0.023
 problem+0.023
ho+0.022
 cheating+0.022
Neg logit contributions
pins-0.009
 The-0.007
 Eve-0.007
 Santa-0.006
 opener-0.006
Token put
Token ID1234
Feature activation+2.59e-02

Pos logit contributions
 matter+0.003
 disagreement+0.002
 beginning+0.002
 problem+0.002
ho+0.002
Neg logit contributions
pins-0.002
 Eve-0.001
ouses-0.001
ugg-0.001
uts-0.001
Token the
Token ID262
Feature activation+2.38e-01

Pos logit contributions
 matter+0.001
 disagreement+0.001
 appearance+0.001
urious+0.001
ho+0.001
Neg logit contributions
pins-0.001
hes-0.000
 Eve-0.000
 Santa-0.000
cream-0.000
Token pie
Token ID2508
Feature activation+4.01e+00

Pos logit contributions
 matter+0.010
 disagreement+0.009
 cheating+0.009
 hilarious+0.009
 forth+0.008
Neg logit contributions
pins-0.004
 milk-0.004
aid-0.004
 buck-0.003
 stickers-0.003
Token in
Token ID287
Feature activation+6.63e-01

Pos logit contributions
 matter+0.166
 disagreement+0.145
 problem+0.130
 beginning+0.129
 cheating+0.129
Neg logit contributions
pins-0.109
ouses-0.094
uts-0.093
ugg-0.090
 Eve-0.087
Token the
Token ID262
Feature activation+6.35e-01

Pos logit contributions
 matter+0.031
ho+0.029
 disagreement+0.028
 problem+0.028
urious+0.027
Neg logit contributions
pins-0.010
 Eve-0.008
 Santa-0.008
 The-0.008
 opener-0.007
Token oven
Token ID14361
Feature activation+1.61e+00

Pos logit contributions
 matter+0.029
 cheating+0.025
 disagreement+0.025
ho+0.023
ah+0.023
Neg logit contributions
 milk-0.010
pins-0.010
 The-0.009
ugg-0.009
 Eve-0.009
Token.
Token ID13
Feature activation-4.70e-02

Pos logit contributions
 matter+0.069
 disagreement+0.060
 swoop+0.056
 beginning+0.056
 problem+0.055
Neg logit contributions
pins-0.039
uts-0.033
 Eve-0.033
ouses-0.033
ugg-0.031
Token
Token ID198
Feature activation-9.00e-02

Pos logit contributions
pins+0.001
ouses+0.001
uts+0.001
 Eve+0.001
ugg+0.001
Neg logit contributions
 matter-0.002
 disagreement-0.002
 beginning-0.002
ho-0.002
 swoop-0.002
Token to
Token ID284
Feature activation+8.23e-01

Pos logit contributions
 matter+0.011
 disagreement+0.010
ho+0.010
ah+0.009
 total+0.009
Neg logit contributions
pins-0.005
uts-0.004
 Eve-0.004
 Santa-0.003
grand-0.003
Token turn
Token ID1210
Feature activation+5.26e-01

Pos logit contributions
 matter+0.029
 disagreement+0.024
 beginning+0.024
 swoop+0.023
ho+0.023
Neg logit contributions
pins-0.025
ouses-0.023
uts-0.022
ugg-0.022
 Eve-0.022
Token it
Token ID340
Feature activation+3.65e-01

Pos logit contributions
 matter+0.020
ho+0.018
 disagreement+0.018
 swoop+0.017
 beginning+0.017
Neg logit contributions
pins-0.014
 Eve-0.012
 Santa-0.011
ouses-0.011
taker-0.011
Token into
Token ID656
Feature activation+7.14e-02

Pos logit contributions
 matter+0.015
 disagreement+0.014
ho+0.013
Uh+0.013
 cheating+0.012
Neg logit contributions
pins-0.008
 Eve-0.007
ouses-0.007
uts-0.007
 dressed-0.006
Token a
Token ID257
Feature activation+3.10e-01

Pos logit contributions
 matter+0.003
ho+0.003
 disagreement+0.003
ah+0.003
 beginning+0.003
Neg logit contributions
pins-0.001
 Eve-0.001
 Santa-0.001
ouses-0.001
 buck-0.001
Token pie
Token ID2508
Feature activation+4.01e+00

Pos logit contributions
 matter+0.011
ho+0.010
urious+0.010
 cheating+0.010
ths+0.010
Neg logit contributions
 Eve-0.007
 Santa-0.006
washer-0.006
 bed-0.006
pins-0.006
Token.
Token ID13
Feature activation-3.43e-01

Pos logit contributions
 matter+0.166
 disagreement+0.149
 total+0.134
 problem+0.133
 swoop+0.133
Neg logit contributions
pins-0.098
 Eve-0.087
uts-0.081
washer-0.078
ouses-0.077
Token He
Token ID679
Feature activation-2.23e-02

Pos logit contributions
pins+0.008
ouses+0.007
ugg+0.007
uts+0.007
unn+0.007
Neg logit contributions
 matter-0.014
 disagreement-0.012
 beginning-0.012
 swoop-0.012
ho-0.012
Token took
Token ID1718
Feature activation+4.06e-01

Pos logit contributions
pins+0.001
ouses+0.001
 Eve+0.001
ugg+0.001
uts+0.001
Neg logit contributions
 matter-0.001
 disagreement-0.001
 beginning-0.001
ho-0.001
 problem-0.001
Token it
Token ID340
Feature activation-5.57e-02

Pos logit contributions
 matter+0.016
 disagreement+0.014
ho+0.014
 beginning+0.014
 swoop+0.013
Neg logit contributions
pins-0.010
ouses-0.009
 Eve-0.009
ugg-0.009
uts-0.008
Token to
Token ID284
Feature activation-2.22e-02

Pos logit contributions
pins+0.002
uts+0.001
 Eve+0.001
ouses+0.001
ugg+0.001
Neg logit contributions
 matter-0.002
 disagreement-0.002
 beginning-0.002
ho-0.002
 swoop-0.002
TokenThey
Token ID2990
Feature activation-1.75e-02

Pos logit contributions
 matter+0.006
 swoop+0.005
 disagreement+0.005
 beginning+0.005
ho+0.005
Neg logit contributions
pins-0.008
ouses-0.007
ugg-0.007
 Eve-0.007
unn-0.007
Token smiled
Token ID13541
Feature activation+8.79e-01

Pos logit contributions
pins+0.001
ouses+0.001
uts+0.001
 Eve+0.001
unn+0.001
Neg logit contributions
 matter-0.001
 swoop-0.000
ho-0.000
 beginning-0.000
 disagreement-0.000
Token and
Token ID290
Feature activation-6.05e-01

Pos logit contributions
 matter+0.031
 beginning+0.028
 disagreement+0.026
 problem+0.026
ho+0.026
Neg logit contributions
pins-0.024
grand-0.023
ouses-0.023
 Eve-0.023
unn-0.023
Token ate
Token ID15063
Feature activation+5.01e-01

Pos logit contributions
pins+0.020
uts+0.020
ouses+0.020
ests+0.019
grand+0.019
Neg logit contributions
 matter-0.017
 swoop-0.016
ho-0.015
 beginning-0.014
 disagreement-0.014
Token the
Token ID262
Feature activation+3.22e-01

Pos logit contributions
 matter+0.018
 disagreement+0.016
 beginning+0.015
ho+0.015
 swoop+0.015
Neg logit contributions
pins-0.014
ouses-0.013
uts-0.013
ugg-0.013
 Eve-0.013
Token pie
Token ID2508
Feature activation+3.99e+00

Pos logit contributions
 matter+0.014
ho+0.012
 swoop+0.012
 cheating+0.012
 appearance+0.012
Neg logit contributions
pins-0.006
hes-0.005
 milk-0.005
ugg-0.005
 Eve-0.005
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
 matter+0.144
 disagreement+0.125
 beginning+0.124
ho+0.122
 swoop+0.120
Neg logit contributions
pins-0.111
ouses-0.101
ugg-0.099
 Eve-0.097
uts-0.097
Token They
Token ID1119
Feature activation-9.54e-01

Pos logit contributions
unn+0.004
pins+0.004
ouses+0.004
grand+0.004
ugg+0.003
Neg logit contributions
 matter-0.005
 disagreement-0.004
 beginning-0.004
What-0.004
 total-0.004
Token were
Token ID547
Feature activation-1.35e-01

Pos logit contributions
pins+0.032
ouses+0.031
uts+0.030
 Eve+0.030
unn+0.030
Neg logit contributions
 matter-0.029
ho-0.024
 swoop-0.024
 disagreement-0.024
 beginning-0.024
Token happy
Token ID3772
Feature activation+1.47e-01

Pos logit contributions
pins+0.004
ouses+0.003
uts+0.003
 Eve+0.003
ugg+0.003
Neg logit contributions
 matter-0.005
 disagreement-0.004
ho-0.004
 beginning-0.004
 swoop-0.004
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
 matter+0.006
 disagreement+0.005
 beginning+0.005
ho+0.005
 swoop+0.005
Neg logit contributions
pins-0.004
ouses-0.004
ugg-0.004
uts-0.004
 Eve-0.004
TokenThey
Token ID2990
Feature activation+6.16e-03

Pos logit contributions
 matter+0.006
 disagreement+0.005
 beginning+0.005
 swoop+0.005
ho+0.005
Neg logit contributions
pins-0.006
ouses-0.006
ugg-0.006
uts-0.006
 Eve-0.005
Token smiled
Token ID13541
Feature activation+9.68e-01

Pos logit contributions
 matter+0.000
 disagreement+0.000
 beginning+0.000
 swoop+0.000
ho+0.000
Neg logit contributions
pins-0.000
ouses-0.000
uts-0.000
 Eve-0.000
ugg-0.000
Token and
Token ID290
Feature activation-4.84e-01

Pos logit contributions
 matter+0.037
 beginning+0.033
 disagreement+0.032
 problem+0.031
ho+0.031
Neg logit contributions
pins-0.025
ouses-0.023
ugg-0.023
 Eve-0.023
unn-0.022
Token ate
Token ID15063
Feature activation+6.41e-01

Pos logit contributions
pins+0.015
uts+0.015
ouses+0.015
 Eve+0.014
grand+0.014
Neg logit contributions
 matter-0.015
 swoop-0.013
 disagreement-0.013
 beginning-0.013
ho-0.013
Token the
Token ID262
Feature activation+3.03e-01

Pos logit contributions
 matter+0.024
 disagreement+0.021
 beginning+0.021
 swoop+0.020
ho+0.020
Neg logit contributions
pins-0.017
ouses-0.015
ugg-0.015
 Eve-0.015
uts-0.015
Token pie
Token ID2508
Feature activation+3.99e+00

Pos logit contributions
 matter+0.013
ho+0.012
 appearance+0.012
ah+0.012
 cheating+0.011
Neg logit contributions
pins-0.005
 milk-0.005
hes-0.004
aid-0.004
cream-0.004
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
 matter+0.146
 disagreement+0.126
 beginning+0.124
ho+0.121
 swoop+0.121
Neg logit contributions
pins-0.112
ouses-0.101
ugg-0.098
uts-0.098
 Eve-0.097
Token They
Token ID1119
Feature activation-8.71e-01

Pos logit contributions
pins+0.005
unn+0.005
ouses+0.005
ugg+0.004
grand+0.004
Neg logit contributions
 matter-0.007
 disagreement-0.006
 beginning-0.006
 total-0.005
ho-0.005
Token were
Token ID547
Feature activation-7.88e-03

Pos logit contributions
pins+0.030
ouses+0.028
uts+0.028
 Eve+0.027
unn+0.027
Neg logit contributions
 matter-0.027
 disagreement-0.022
 swoop-0.022
ho-0.022
 beginning-0.022
Token happy
Token ID3772
Feature activation+2.30e-01

Pos logit contributions
pins+0.000
ouses+0.000
uts+0.000
 Eve+0.000
hes+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
ho-0.000
 beginning-0.000
 problem-0.000
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.008
 swoop+0.008
ho+0.008
Neg logit contributions
pins-0.006
ouses-0.005
uts-0.005
ugg-0.005
 Eve-0.005
Token happy
Token ID3772
Feature activation+5.03e-01

Pos logit contributions
 matter+0.004
 disagreement+0.003
 beginning+0.003
ho+0.003
 swoop+0.003
Neg logit contributions
pins-0.003
ouses-0.003
uts-0.003
ugg-0.003
 Eve-0.003
Token and
Token ID290
Feature activation-2.52e-01

Pos logit contributions
 matter+0.025
 disagreement+0.023
 swoop+0.022
ho+0.022
 total+0.021
Neg logit contributions
pins-0.007
uts-0.007
 Eve-0.006
ugg-0.006
ouses-0.006
Token enjoyed
Token ID8359
Feature activation-6.41e-01

Pos logit contributions
pins+0.007
ouses+0.007
ests+0.007
uts+0.006
unn+0.006
Neg logit contributions
 matter-0.009
 beginning-0.008
 disagreement-0.007
ho-0.007
 swoop-0.007
Token eating
Token ID6600
Feature activation+1.53e-01

Pos logit contributions
pins+0.012
 Eve+0.011
 Santa+0.010
uts+0.010
hes+0.010
Neg logit contributions
 matter-0.030
rated-0.025
ah-0.025
urious-0.025
 problem-0.025
Token her
Token ID607
Feature activation-6.25e-01

Pos logit contributions
 matter+0.006
 disagreement+0.005
 beginning+0.005
ho+0.005
 swoop+0.005
Neg logit contributions
pins-0.004
 Eve-0.003
uts-0.003
ouses-0.003
ugg-0.003
Token pie
Token ID2508
Feature activation+3.94e+00

Pos logit contributions
pins+0.017
ouses+0.016
ugg+0.015
uts+0.015
 Eve+0.014
Neg logit contributions
 matter-0.023
ho-0.019
 swoop-0.019
 beginning-0.019
 disagreement-0.019
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
 matter+0.155
 disagreement+0.137
 beginning+0.132
 swoop+0.131
ho+0.129
Neg logit contributions
pins-0.101
ouses-0.092
uts-0.091
 Eve-0.088
ugg-0.086
Token<|endoftext|>
Token ID50256
Feature activation+2.94e-01

Pos logit contributions
pins+0.015
ouses+0.013
unn+0.013
uts+0.012
ests+0.012
Neg logit contributions
 matter-0.018
 beginning-0.016
 disagreement-0.016
ho-0.015
where-0.015
TokenOnce
Token ID7454
Feature activation+2.41e-01

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.010
 swoop+0.010
ho+0.010
Neg logit contributions
pins-0.007
ouses-0.007
ugg-0.006
uts-0.006
 Eve-0.006
Token upon
Token ID2402
Feature activation+6.17e-01

Pos logit contributions
 matter+0.005
 beginning+0.004
 swoop+0.004
 disagreement+0.004
ho+0.003
Neg logit contributions
pins-0.010
 Eve-0.010
ouses-0.010
ugg-0.009
uts-0.009
Token a
Token ID257
Feature activation+2.91e-01

Pos logit contributions
 matter+0.023
 beginning+0.020
 disagreement+0.020
 swoop+0.019
ho+0.018
Neg logit contributions
pins-0.017
ouses-0.016
ugg-0.015
unn-0.015
uts-0.015
Token's
Token ID338
Feature activation+4.42e-01

Pos logit contributions
 matter+0.053
 disagreement+0.046
 beginning+0.045
ho+0.044
 swoop+0.044
Neg logit contributions
pins-0.031
ouses-0.027
uts-0.027
ugg-0.026
 Eve-0.026
Token mom
Token ID1995
Feature activation+2.79e-01

Pos logit contributions
 matter+0.022
ah+0.019
where+0.019
 swoop+0.019
 cheating+0.019
Neg logit contributions
pins-0.006
uts-0.005
 knee-0.004
 bed-0.004
 complexion-0.004
Token took
Token ID1718
Feature activation+1.06e+00

Pos logit contributions
 matter+0.013
 disagreement+0.012
 beginning+0.012
where+0.011
 impossible+0.011
Neg logit contributions
pins-0.005
 Eve-0.004
 buck-0.004
washer-0.004
uts-0.004
Token out
Token ID503
Feature activation-9.31e-04

Pos logit contributions
 matter+0.043
 disagreement+0.038
 beginning+0.038
ho+0.037
 problem+0.036
Neg logit contributions
pins-0.025
 Eve-0.022
 Santa-0.019
 buck-0.019
obs-0.018
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
pins+0.000
 Eve+0.000
uts+0.000
ouses+0.000
ugg+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
ho-0.000
 beginning-0.000
 swoop-0.000
Token pie
Token ID2508
Feature activation+3.94e+00

Pos logit contributions
 matter+0.014
ah+0.012
 swoop+0.011
ho+0.011
 beginning+0.011
Neg logit contributions
pins-0.006
 milk-0.005
hes-0.005
washer-0.005
 Eve-0.005
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
 matter+0.137
 disagreement+0.118
 beginning+0.117
ho+0.115
 swoop+0.114
Neg logit contributions
pins-0.115
ouses-0.106
ugg-0.104
 Eve-0.101
uts-0.101
Token She
Token ID1375
Feature activation-5.92e-01

Pos logit contributions
ouses+0.010
pins+0.010
unn+0.010
ugg+0.010
 Eve+0.009
Neg logit contributions
 matter-0.013
ho-0.012
 disagreement-0.011
 swoop-0.011
 beginning-0.011
Token put
Token ID1234
Feature activation-9.63e-01

Pos logit contributions
pins+0.019
ouses+0.017
uts+0.017
ugg+0.017
 Eve+0.017
Neg logit contributions
 matter-0.020
 disagreement-0.017
 beginning-0.016
 swoop-0.016
ho-0.016
Token it
Token ID340
Feature activation-3.63e-01

Pos logit contributions
pins+0.023
hes+0.018
 Eve+0.018
aid+0.017
 Santa+0.016
Neg logit contributions
 matter-0.043
 disagreement-0.038
urious-0.036
 quarrel-0.035
 appearance-0.035
Token on
Token ID319
Feature activation-1.03e+00

Pos logit contributions
pins+0.010
ugg+0.008
ouses+0.008
uts+0.008
hes+0.007
Neg logit contributions
 matter-0.015
 disagreement-0.013
 problem-0.012
 beginning-0.012
ho-0.012
Token balloons
Token ID35485
Feature activation+6.95e-01

Pos logit contributions
pins+0.006
ouses+0.006
uts+0.006
 Eve+0.006
ugg+0.005
Neg logit contributions
 matter-0.008
 disagreement-0.007
 beginning-0.007
ho-0.007
 problem-0.007
Token.
Token ID13
Feature activation+3.40e-01

Pos logit contributions
 matter+0.031
 disagreement+0.028
urious+0.026
 problem+0.026
 beginning+0.025
Neg logit contributions
pins-0.016
uts-0.013
ouses-0.012
 Eve-0.011
ugg-0.011
Token He
Token ID679
Feature activation-4.92e-01

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.010
 total+0.010
 problem+0.009
Neg logit contributions
pins-0.010
ouses-0.009
unn-0.009
ugg-0.009
grand-0.008
Token threw
Token ID9617
Feature activation+1.56e-01

Pos logit contributions
pins+0.016
ouses+0.015
ugg+0.014
uts+0.014
 Eve+0.014
Neg logit contributions
 matter-0.016
 disagreement-0.013
 beginning-0.013
ho-0.013
 swoop-0.013
Token a
Token ID257
Feature activation+4.95e-01

Pos logit contributions
 matter+0.007
urious+0.006
 disagreement+0.006
 beginning+0.006
ths+0.006
Neg logit contributions
pins-0.003
 Santa-0.003
 buck-0.003
 CDs-0.003
 Eve-0.003
Token pie
Token ID2508
Feature activation+3.94e+00

Pos logit contributions
 matter+0.018
 beginning+0.016
 disagreement+0.016
ho+0.015
ths+0.015
Neg logit contributions
pins-0.014
ouses-0.012
uts-0.012
ugg-0.011
 Eve-0.011
Token at
Token ID379
Feature activation+1.43e-01

Pos logit contributions
 matter+0.155
 disagreement+0.135
 beginning+0.128
 swoop+0.122
 problem+0.120
Neg logit contributions
pins-0.106
uts-0.093
ouses-0.090
ugg-0.089
 Eve-0.087
Token another
Token ID1194
Feature activation-1.95e-01

Pos logit contributions
 matter+0.005
 disagreement+0.004
 beginning+0.004
ho+0.004
 swoop+0.004
Neg logit contributions
pins-0.004
ouses-0.004
uts-0.004
ugg-0.004
 Eve-0.004
Token clown
Token ID25573
Feature activation+9.72e-01

Pos logit contributions
pins+0.004
 Eve+0.004
ouses+0.004
uts+0.004
chair+0.004
Neg logit contributions
 matter-0.008
 disagreement-0.007
 beginning-0.007
ho-0.007
 swoop-0.007
Token.
Token ID13
Feature activation+3.96e-01

Pos logit contributions
 matter+0.044
 disagreement+0.038
 beginning+0.037
 impossible+0.035
 problem+0.035
Neg logit contributions
pins-0.022
uts-0.020
ouses-0.019
ugg-0.018
 Eve-0.017
Token He
Token ID679
Feature activation-4.91e-01

Pos logit contributions
 matter+0.012
 disagreement+0.011
 beginning+0.011
 total+0.011
 swoop+0.010
Neg logit contributions
pins-0.012
unn-0.011
ouses-0.011
ugg-0.010
grand-0.010
Token Tim
Token ID5045
Feature activation+4.35e-01

Pos logit contributions
 matter+0.003
 disagreement+0.003
 beginning+0.003
 swoop+0.003
 problem+0.003
Neg logit contributions
 Eve-0.002
pins-0.002
ugg-0.002
ouses-0.002
 Santa-0.002
Tokenmy
Token ID1820
Feature activation+7.13e-01

Pos logit contributions
 matter+0.012
 disagreement+0.011
 beginning+0.010
ho+0.010
 swoop+0.010
Neg logit contributions
pins-0.016
ugg-0.015
ouses-0.015
 Eve-0.015
uts-0.014
Token a
Token ID257
Feature activation+7.09e-01

Pos logit contributions
 matter+0.027
 disagreement+0.023
 beginning+0.023
 swoop+0.023
ho+0.023
Neg logit contributions
pins-0.019
ouses-0.017
uts-0.016
 Eve-0.016
ugg-0.016
Token warm
Token ID5814
Feature activation+1.18e-01

Pos logit contributions
 matter+0.026
 disagreement+0.022
 beginning+0.022
ho+0.021
 swoop+0.021
Neg logit contributions
pins-0.020
ouses-0.018
ugg-0.018
uts-0.017
 Eve-0.017
Token,
Token ID11
Feature activation+5.74e-01

Pos logit contributions
 matter+0.004
 disagreement+0.004
 beginning+0.004
 swoop+0.004
ho+0.004
Neg logit contributions
pins-0.003
ugg-0.003
ouses-0.003
 Eve-0.003
uts-0.003
Token freshly
Token ID29026
Feature activation+3.93e+00

Pos logit contributions
 matter+0.018
 disagreement+0.015
 beginning+0.015
 swoop+0.014
ho+0.014
Neg logit contributions
pins-0.019
ouses-0.018
ugg-0.017
uts-0.017
 Eve-0.017
Token baked
Token ID22979
Feature activation+1.88e+00

Pos logit contributions
 matter+0.156
 disagreement+0.138
ho+0.133
 beginning+0.132
 impossible+0.128
Neg logit contributions
ugg-0.092
pins-0.090
ouses-0.086
hes-0.084
aid-0.084
Token cookie
Token ID19751
Feature activation+1.97e+00

Pos logit contributions
 matter+0.089
 disagreement+0.080
 beginning+0.077
ho+0.075
 quarrel+0.074
Neg logit contributions
pins-0.034
 Eve-0.031
uts-0.030
hes-0.029
ouses-0.029
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 matter+0.087
 disagreement+0.076
 beginning+0.075
 swoop+0.070
urious+0.070
Neg logit contributions
pins-0.045
uts-0.042
ouses-0.041
 Eve-0.040
ugg-0.037
Token
Token ID198
Feature activation-8.32e-02

Pos logit contributions
 matter+0.006
 disagreement+0.005
 beginning+0.005
ho+0.005
 swoop+0.005
Neg logit contributions
pins-0.005
ouses-0.004
unn-0.004
uts-0.004
ugg-0.004
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
pins+0.002
ouses+0.002
unn+0.002
uts+0.002
ugg+0.002
Neg logit contributions
 matter-0.003
 disagreement-0.003
 swoop-0.003
ho-0.003
 beginning-0.003
Token set
Token ID900
Feature activation+4.64e-01

Pos logit contributions
 matter+0.026
 disagreement+0.022
 beginning+0.021
 problem+0.021
ho+0.021
Neg logit contributions
pins-0.021
ugg-0.018
ouses-0.018
 Eve-0.018
uts-0.017
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 matter+0.017
 disagreement+0.015
ho+0.014
 beginning+0.014
 swoop+0.014
Neg logit contributions
pins-0.014
ouses-0.011
 Eve-0.011
uts-0.011
ugg-0.011
Token making
Token ID1642
Feature activation+6.76e-01

Pos logit contributions
 matter+0.043
 disagreement+0.037
 beginning+0.036
 swoop+0.035
ho+0.035
Neg logit contributions
pins-0.039
ouses-0.036
uts-0.035
ugg-0.035
 Eve-0.035
Token a
Token ID257
Feature activation+5.23e-01

Pos logit contributions
 matter+0.027
ho+0.024
 disagreement+0.023
 beginning+0.023
 swoop+0.023
Neg logit contributions
pins-0.017
 Eve-0.014
ouses-0.014
uts-0.014
ugg-0.013
Token delicious
Token ID12625
Feature activation+2.42e+00

Pos logit contributions
 matter+0.022
 beginning+0.019
 cheating+0.019
 disagreement+0.019
where+0.018
Neg logit contributions
pins-0.009
 Eve-0.009
ouses-0.008
 bed-0.008
 Santa-0.008
Token pie
Token ID2508
Feature activation+3.91e+00

Pos logit contributions
 matter+0.094
 disagreement+0.087
 beginning+0.084
ho+0.081
 cheating+0.081
Neg logit contributions
pins-0.049
 Eve-0.049
ouses-0.044
ugg-0.044
 complexion-0.043
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
 matter+0.158
 disagreement+0.139
 beginning+0.133
 swoop+0.128
ho+0.125
Neg logit contributions
pins-0.101
uts-0.091
ouses-0.090
 Eve-0.087
 opener-0.079
Token They
Token ID1119
Feature activation-3.26e-01

Pos logit contributions
pins+0.008
ouses+0.007
uts+0.007
ugg+0.007
 Eve+0.007
Neg logit contributions
 matter-0.015
 disagreement-0.013
 beginning-0.013
 swoop-0.013
ho-0.013
Token took
Token ID1718
Feature activation+4.59e-01

Pos logit contributions
pins+0.009
ouses+0.007
 Eve+0.007
ugg+0.007
uts+0.007
Neg logit contributions
 matter-0.013
 disagreement-0.011
 beginning-0.011
 problem-0.011
ho-0.010
Token out
Token ID503
Feature activation-6.43e-01

Pos logit contributions
 matter+0.019
ho+0.017
 beginning+0.016
 disagreement+0.016
 cheating+0.016
Neg logit contributions
pins-0.011
 Eve-0.009
ouses-0.009
hes-0.008
ugg-0.008
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
pins+0.016
ouses+0.013
 Eve+0.013
ugg+0.013
uts+0.013
Neg logit contributions
 matter-0.027
 disagreement-0.023
 beginning-0.023
ho-0.022
 swoop-0.022
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
pins+0.003
ouses+0.002
 Eve+0.002
uts+0.002
ugg+0.002
Neg logit contributions
 matter-0.004
 disagreement-0.004
ho-0.003
 swoop-0.003
 problem-0.003
Token grand
Token ID4490
Feature activation-1.63e+00

Pos logit contributions
pins+0.003
 Eve+0.003
ouses+0.003
uts+0.002
ugg+0.002
Neg logit contributions
 matter-0.004
 disagreement-0.004
 swoop-0.004
ho-0.004
 beginning-0.004
Tokenpa
Token ID8957
Feature activation-4.01e-02

Pos logit contributions
pins+0.033
 Eve+0.030
ouses+0.027
unn+0.026
grand+0.026
Neg logit contributions
 matter-0.070
ho-0.064
 beginning-0.063
 swoop-0.063
 disagreement-0.062
Token taught
Token ID7817
Feature activation-1.14e+00

Pos logit contributions
pins+0.001
ouses+0.001
 Eve+0.001
uts+0.001
ugg+0.001
Neg logit contributions
 matter-0.001
 disagreement-0.001
 beginning-0.001
ho-0.001
 swoop-0.001
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 Eve+0.035
pins+0.033
 Santa+0.031
ouses+0.030
grand+0.029
Neg logit contributions
 matter-0.036
 beginning-0.034
ho-0.034
 swoop-0.034
 disagreement-0.031
Token grand
Token ID4490
Feature activation-4.18e+00

Pos logit contributions
pins+0.033
 Eve+0.030
ouses+0.030
washer+0.029
grand+0.028
Neg logit contributions
 matter-0.048
ho-0.045
 swoop-0.045
 beginning-0.043
 disagreement-0.043
Tokenchild
Token ID9410
Feature activation-1.28e+00

Pos logit contributions
pins+0.113
ouses+0.102
ugg+0.099
uts+0.098
 Eve+0.095
Neg logit contributions
 matter-0.160
 disagreement-0.137
 beginning-0.134
ho-0.130
 swoop-0.130
Token how
Token ID703
Feature activation+2.04e-01

Pos logit contributions
pins+0.033
ouses+0.031
uts+0.030
 Eve+0.028
ugg+0.028
Neg logit contributions
 matter-0.050
 disagreement-0.044
ho-0.043
 beginning-0.043
 swoop-0.042
Token to
Token ID284
Feature activation-6.19e-02

Pos logit contributions
 matter+0.008
 disagreement+0.007
 beginning+0.007
ho+0.007
 swoop+0.007
Neg logit contributions
pins-0.005
ouses-0.005
uts-0.005
ugg-0.004
 Eve-0.004
Token play
Token ID711
Feature activation-7.67e-01

Pos logit contributions
pins+0.002
ouses+0.002
ugg+0.002
uts+0.002
unn+0.002
Neg logit contributions
 matter-0.002
 disagreement-0.002
 beginning-0.002
 swoop-0.002
ho-0.002
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
pins+0.020
ugg+0.018
unn+0.017
ouses+0.017
uts+0.017
Neg logit contributions
 matter-0.030
 disagreement-0.026
 beginning-0.026
 swoop-0.025
 problem-0.025
TokenThe
Token ID464
Feature activation+3.66e-01

Pos logit contributions
pins+0.016
ouses+0.014
 Eve+0.014
ugg+0.014
uts+0.014
Neg logit contributions
 matter-0.027
 disagreement-0.023
 beginning-0.023
ho-0.022
 swoop-0.022
Token kids
Token ID3988
Feature activation-2.75e-01

Pos logit contributions
 matter+0.014
 swoop+0.013
ho+0.013
 disagreement+0.012
 impossible+0.011
Neg logit contributions
pins-0.010
 Eve-0.009
 CDs-0.008
grand-0.008
 buck-0.008
Token all
Token ID477
Feature activation+8.50e-02

Pos logit contributions
pins+0.007
ugg+0.006
ouses+0.006
 Eve+0.006
uts+0.006
Neg logit contributions
 matter-0.011
 disagreement-0.009
 beginning-0.009
 swoop-0.009
 problem-0.008
Token hugged
Token ID41130
Feature activation-1.13e+00

Pos logit contributions
 matter+0.003
 disagreement+0.003
 beginning+0.003
ho+0.003
 swoop+0.003
Neg logit contributions
pins-0.002
ouses-0.002
ugg-0.002
 Eve-0.002
uts-0.002
Token the
Token ID262
Feature activation-1.58e+00

Pos logit contributions
pins+0.030
ouses+0.027
 Eve+0.026
ugg+0.026
uts+0.026
Neg logit contributions
 matter-0.043
 disagreement-0.037
 beginning-0.037
 swoop-0.036
ho-0.036
Token grand
Token ID4490
Feature activation-4.06e+00

Pos logit contributions
pins+0.035
 Eve+0.031
 CDs+0.028
ouses+0.028
grand+0.027
Neg logit contributions
 matter-0.064
 swoop-0.060
ho-0.058
ah-0.057
 beginning-0.055
Tokenpa
Token ID8957
Feature activation-1.70e+00

Pos logit contributions
pins+0.113
ouses+0.103
ugg+0.100
uts+0.099
 Eve+0.096
Neg logit contributions
 matter-0.152
 disagreement-0.131
 beginning-0.128
ho-0.124
 problem-0.123
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
pins+0.027
ugg+0.022
ouses+0.022
 Eve+0.022
uts+0.020
Neg logit contributions
 matter-0.084
 disagreement-0.076
 swoop-0.073
 beginning-0.072
ho-0.071
Token thanked
Token ID26280
Feature activation-9.97e-01

Pos logit contributions
pins+0.027
ouses+0.025
uts+0.024
ugg+0.024
unn+0.024
Neg logit contributions
 matter-0.039
 disagreement-0.033
 beginning-0.033
 swoop-0.032
ho-0.032
Token him
Token ID683
Feature activation+1.41e-02

Pos logit contributions
pins+0.032
 Eve+0.029
ouses+0.029
uts+0.028
ugg+0.028
Neg logit contributions
 matter-0.032
 disagreement-0.027
 beginning-0.027
ho-0.027
 swoop-0.026
Token for
Token ID329
Feature activation+9.70e-01

Pos logit contributions
 matter+0.001
 disagreement+0.000
 swoop+0.000
ho+0.000
 beginning+0.000
Neg logit contributions
pins-0.000
ouses-0.000
 Eve-0.000
ugg-0.000
uts-0.000
Token asked
Token ID1965
Feature activation-9.70e-01

Pos logit contributions
 matter+0.001
 disagreement+0.001
 beginning+0.001
ho+0.001
 swoop+0.001
Neg logit contributions
pins-0.001
ouses-0.001
 Eve-0.001
uts-0.001
ugg-0.001
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
pins+0.030
ouses+0.027
 Eve+0.027
ugg+0.026
uts+0.026
Neg logit contributions
 matter-0.033
 disagreement-0.028
 beginning-0.027
 swoop-0.027
ho-0.027
Token "
Token ID366
Feature activation+6.34e-01

Pos logit contributions
 matter+0.004
 disagreement+0.004
 beginning+0.003
 swoop+0.003
ho+0.003
Neg logit contributions
pins-0.004
ouses-0.003
 Eve-0.003
ugg-0.003
uts-0.003
TokenWhat
Token ID2061
Feature activation+4.40e-01

Pos logit contributions
 matter+0.016
 disagreement+0.015
 problem+0.015
 impossible+0.013
 decision+0.013
Neg logit contributions
pins-0.023
 Eve-0.021
 Santa-0.020
ugg-0.020
ests-0.020
Token other
Token ID584
Feature activation-1.93e+00

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.010
 problem+0.009
ho+0.009
Neg logit contributions
pins-0.017
ouses-0.015
ugg-0.015
 Eve-0.015
uts-0.015
Token planets
Token ID14705
Feature activation-3.96e+00

Pos logit contributions
pins+0.055
ouses+0.050
uts+0.048
 Eve+0.048
ugg+0.048
Neg logit contributions
 matter-0.069
 disagreement-0.060
 beginning-0.059
ho-0.057
 swoop-0.057
Token are
Token ID389
Feature activation-1.03e+00

Pos logit contributions
pins+0.114
ouses+0.102
uts+0.100
 Eve+0.099
ugg+0.098
Neg logit contributions
 matter-0.139
 disagreement-0.121
 beginning-0.119
ho-0.117
 swoop-0.115
Token there
Token ID612
Feature activation-1.12e+00

Pos logit contributions
pins+0.026
ouses+0.023
uts+0.023
 Eve+0.023
unn+0.022
Neg logit contributions
 matter-0.041
 disagreement-0.035
 beginning-0.034
ho-0.034
 swoop-0.033
Token?"
Token ID1701
Feature activation-5.10e-01

Pos logit contributions
pins+0.020
ouses+0.017
uts+0.017
 Eve+0.016
ugg+0.016
Neg logit contributions
 matter-0.052
 disagreement-0.046
 beginning-0.045
 swoop-0.045
ho-0.045
Token Her
Token ID2332
Feature activation+2.71e-01

Pos logit contributions
pins+0.012
ouses+0.010
uts+0.010
unn+0.010
ugg+0.010
Neg logit contributions
 matter-0.021
 disagreement-0.019
 beginning-0.019
ho-0.018
 swoop-0.018
Token mom
Token ID1995
Feature activation-9.95e-01

Pos logit contributions
 matter+0.011
 beginning+0.010
 disagreement+0.010
 swoop+0.010
ho+0.010
Neg logit contributions
pins-0.006
ouses-0.005
uts-0.005
 Eve-0.005
ugg-0.005
Token
Token ID220
Feature activation+9.45e-01

Pos logit contributions
 matter+0.007
 disagreement+0.006
 beginning+0.006
ho+0.006
 swoop+0.006
Neg logit contributions
pins-0.004
ouses-0.004
ugg-0.004
unn-0.004
uts-0.004
Token
Token ID198
Feature activation+5.19e-02

Pos logit contributions
 matter+0.034
 disagreement+0.031
 beginning+0.031
ho+0.030
 swoop+0.030
Neg logit contributions
pins-0.024
unn-0.020
ouses-0.020
ugg-0.020
 complexion-0.020
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
 matter+0.002
 disagreement+0.002
 beginning+0.002
 swoop+0.002
ho+0.002
Neg logit contributions
pins-0.001
ouses-0.001
ugg-0.001
uts-0.001
 Eve-0.001
TokenWhen
Token ID2215
Feature activation+4.15e-01

Pos logit contributions
pins+0.005
 Eve+0.004
ouses+0.004
uts+0.004
ugg+0.004
Neg logit contributions
 matter-0.006
 beginning-0.006
 disagreement-0.006
 problem-0.006
ho-0.005
Token the
Token ID262
Feature activation+3.94e-01

Pos logit contributions
 matter+0.019
urious+0.018
 swoop+0.017
 beginning+0.017
 disagreement+0.017
Neg logit contributions
pins-0.008
 Eve-0.008
 Santa-0.007
ugg-0.005
 Teddy-0.005
Token suitcase
Token ID45391
Feature activation-3.88e+00

Pos logit contributions
 matter+0.015
 swoop+0.013
urious+0.013
ho+0.013
 disagreement+0.013
Neg logit contributions
pins-0.009
 Eve-0.009
ugg-0.008
 Santa-0.008
grand-0.008
Token was
Token ID373
Feature activation-3.94e-01

Pos logit contributions
pins+0.137
ouses+0.125
 Eve+0.124
ugg+0.124
uts+0.123
Neg logit contributions
 matter-0.115
 disagreement-0.094
 beginning-0.092
 swoop-0.088
ho-0.087
Token ready
Token ID3492
Feature activation-7.56e-01

Pos logit contributions
pins+0.013
 Eve+0.012
ouses+0.012
ugg+0.011
uts+0.011
Neg logit contributions
 matter-0.012
 disagreement-0.010
ho-0.010
 beginning-0.010
 swoop-0.010
Token,
Token ID11
Feature activation-3.98e-01

Pos logit contributions
pins+0.013
 Eve+0.012
uts+0.010
ugg+0.010
ouses+0.009
Neg logit contributions
 matter-0.037
 disagreement-0.032
 swoop-0.032
 beginning-0.032
urious-0.031
Token the
Token ID262
Feature activation+2.36e-01

Pos logit contributions
pins+0.009
 Eve+0.008
ouses+0.008
uts+0.007
 Santa+0.007
Neg logit contributions
 matter-0.017
 disagreement-0.015
 beginning-0.015
ho-0.015
 swoop-0.015
Token mom
Token ID1995
Feature activation-1.01e+00

Pos logit contributions
 matter+0.010
ho+0.009
 disagreement+0.009
 swoop+0.009
 beginning+0.008
Neg logit contributions
pins-0.005
 Eve-0.005
ouses-0.004
 Santa-0.004
ugg-0.004
Token you
Token ID345
Feature activation-6.04e-01

Pos logit contributions
 matter+0.002
 beginning+0.002
 disagreement+0.002
 swoop+0.002
ho+0.002
Neg logit contributions
pins-0.001
uts-0.001
 Eve-0.001
ugg-0.001
ouses-0.001
Token want
Token ID765
Feature activation-6.95e-01

Pos logit contributions
pins+0.021
ouses+0.020
uts+0.019
ugg+0.019
 Eve+0.019
Neg logit contributions
 matter-0.019
 disagreement-0.015
 beginning-0.015
 swoop-0.015
ho-0.014
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
pins+0.025
ouses+0.023
uts+0.022
ugg+0.021
 Eve+0.021
Neg logit contributions
 matter-0.021
 disagreement-0.017
 beginning-0.017
 swoop-0.016
ho-0.016
Token see
Token ID766
Feature activation-1.61e+00

Pos logit contributions
pins+0.041
ouses+0.040
ugg+0.039
uts+0.038
unn+0.038
Neg logit contributions
 matter-0.028
 disagreement-0.022
 swoop-0.022
 beginning-0.021
 quarrel-0.020
Token the
Token ID262
Feature activation-9.17e-01

Pos logit contributions
pins+0.059
ouses+0.054
ugg+0.053
uts+0.053
 Eve+0.053
Neg logit contributions
 matter-0.046
 disagreement-0.038
 beginning-0.037
 swoop-0.036
ho-0.035
Token attic
Token ID46113
Feature activation-3.85e+00

Pos logit contributions
 bed+0.016
 Santa+0.015
 sticker+0.015
 buck+0.015
 sofa+0.015
Neg logit contributions
ths-0.036
rated-0.035
 inspection-0.034
 matter-0.034
urious-0.034
Token,
Token ID11
Feature activation-4.22e-01

Pos logit contributions
pins+0.096
 Eve+0.084
ouses+0.082
ugg+0.080
uts+0.079
Neg logit contributions
 matter-0.153
 disagreement-0.135
 beginning-0.129
 swoop-0.129
ho-0.127
Token Nib
Token ID49532
Feature activation-4.85e-01

Pos logit contributions
pins+0.009
 Eve+0.008
uts+0.007
ouses+0.007
 Santa+0.007
Neg logit contributions
 matter-0.018
 disagreement-0.017
 beginning-0.016
ho-0.015
 swoop-0.015
Tokenbles
Token ID7689
Feature activation-5.18e-01

Pos logit contributions
pins+0.011
ouses+0.011
uts+0.010
unn+0.009
ugg+0.009
Neg logit contributions
 matter-0.021
 disagreement-0.019
 beginning-0.018
 swoop-0.017
ho-0.017
Token?
Token ID30
Feature activation+4.89e-02

Pos logit contributions
pins+0.008
ugg+0.006
uts+0.006
ouses+0.006
 Eve+0.005
Neg logit contributions
 matter-0.026
 disagreement-0.023
 beginning-0.023
ths-0.022
 swoop-0.021
Token There
Token ID1318
Feature activation-7.13e-02

Pos logit contributions
 matter+0.001
ho+0.001
 beginning+0.001
 swoop+0.001
 disagreement+0.001
Neg logit contributions
pins-0.002
ouses-0.001
 Eve-0.001
uts-0.001
 complexion-0.001
Token.
Token ID13
Feature activation-3.77e-01

Pos logit contributions
 matter+0.029
 disagreement+0.025
 beginning+0.025
 swoop+0.024
ho+0.024
Neg logit contributions
pins-0.020
ouses-0.019
uts-0.018
 Eve-0.018
ugg-0.018
Token She
Token ID1375
Feature activation-3.73e-01

Pos logit contributions
unn+0.011
pins+0.010
ouses+0.010
ugg+0.010
ests+0.010
Neg logit contributions
 matter-0.013
 beginning-0.011
 disagreement-0.010
 total-0.010
What-0.010
Token wrote
Token ID2630
Feature activation-6.39e-01

Pos logit contributions
pins+0.011
ouses+0.010
uts+0.010
ugg+0.010
unn+0.009
Neg logit contributions
 matter-0.013
 swoop-0.011
 disagreement-0.011
 beginning-0.011
ho-0.011
Token about
Token ID546
Feature activation-1.05e+00

Pos logit contributions
pins+0.017
ouses+0.015
uts+0.015
 Eve+0.015
ugg+0.014
Neg logit contributions
 matter-0.025
 disagreement-0.021
ho-0.021
 beginning-0.020
 swoop-0.020
Token how
Token ID703
Feature activation-6.52e-02

Pos logit contributions
pins+0.026
 Eve+0.024
 Santa+0.023
uts+0.022
ouses+0.022
Neg logit contributions
 matter-0.040
ho-0.036
 disagreement-0.036
 swoop-0.035
 beginning-0.035
Token grand
Token ID4490
Feature activation-3.82e+00

Pos logit contributions
pins+0.002
ouses+0.002
uts+0.002
ugg+0.002
 Eve+0.002
Neg logit contributions
 matter-0.003
 disagreement-0.002
 beginning-0.002
 swoop-0.002
ho-0.002
Tokenbaby
Token ID40252
Feature activation-1.08e+00

Pos logit contributions
pins+0.115
ouses+0.105
ugg+0.100
uts+0.099
unn+0.095
Neg logit contributions
 matter-0.136
 disagreement-0.112
 beginning-0.110
 problem-0.106
ho-0.106
Token is
Token ID318
Feature activation-7.09e-01

Pos logit contributions
pins+0.035
ouses+0.032
uts+0.031
ugg+0.031
 Eve+0.031
Neg logit contributions
 matter-0.035
 disagreement-0.030
 beginning-0.029
ho-0.028
 swoop-0.028
Token always
Token ID1464
Feature activation-1.06e+00

Pos logit contributions
pins+0.020
ouses+0.018
uts+0.018
 Eve+0.017
ugg+0.016
Neg logit contributions
 matter-0.026
 disagreement-0.023
ho-0.022
 swoop-0.022
 beginning-0.022
Token smiling
Token ID16755
Feature activation+2.44e-01

Pos logit contributions
pins+0.028
ouses+0.026
 Eve+0.026
 buck+0.024
uts+0.023
Neg logit contributions
 matter-0.036
ho-0.036
 disagreement-0.033
 swoop-0.032
ah-0.031
Token,
Token ID11
Feature activation+2.69e-02

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.008
 swoop+0.008
ho+0.008
Neg logit contributions
pins-0.006
ouses-0.006
uts-0.006
ugg-0.006
 Eve-0.006
TokenPlan
Token ID20854
Feature activation+3.33e-02

Pos logit contributions
pins+0.015
ouses+0.014
ugg+0.013
uts+0.013
 Eve+0.013
Neg logit contributions
 matter-0.011
 disagreement-0.009
 beginning-0.008
ho-0.008
 swoop-0.008
Tokenets
Token ID1039
Feature activation-1.19e+00

Pos logit contributions
 matter+0.001
 disagreement+0.001
 beginning+0.001
ho+0.001
 swoop+0.001
Neg logit contributions
pins-0.001
ouses-0.001
ugg-0.001
 Eve-0.001
uts-0.001
Token?
Token ID30
Feature activation-7.01e-02

Pos logit contributions
pins+0.029
 Eve+0.025
uts+0.024
ouses+0.024
unn+0.023
Neg logit contributions
 matter-0.048
 disagreement-0.043
 beginning-0.042
ho-0.041
 swoop-0.040
Token What
Token ID1867
Feature activation+6.19e-01

Pos logit contributions
pins+0.002
ouses+0.002
ugg+0.002
 complexion+0.002
unn+0.002
Neg logit contributions
 matter-0.002
 beginning-0.002
ho-0.002
 disagreement-0.002
 swoop-0.002
Token are
Token ID389
Feature activation-7.69e-01

Pos logit contributions
 matter+0.018
 disagreement+0.014
 beginning+0.014
 problem+0.014
 swoop+0.013
Neg logit contributions
pins-0.023
ugg-0.022
ouses-0.021
 Eve-0.020
uts-0.020
Token planets
Token ID14705
Feature activation-3.79e+00

Pos logit contributions
pins+0.025
ouses+0.023
ugg+0.023
uts+0.022
 Eve+0.022
Neg logit contributions
 matter-0.024
 disagreement-0.020
 beginning-0.020
ho-0.020
 swoop-0.020
Token like
Token ID588
Feature activation+1.24e-02

Pos logit contributions
pins+0.083
ouses+0.072
uts+0.070
 Eve+0.069
ugg+0.067
Neg logit contributions
 matter-0.161
 disagreement-0.143
 beginning-0.141
ho-0.139
 swoop-0.137
Token?"
Token ID1701
Feature activation-8.74e-01

Pos logit contributions
 matter+0.000
 disagreement+0.000
 beginning+0.000
ho+0.000
 swoop+0.000
Neg logit contributions
pins-0.000
ouses-0.000
 Eve-0.000
uts-0.000
ugg-0.000
Token Ben
Token ID3932
Feature activation-7.24e-01

Pos logit contributions
pins+0.026
ouses+0.024
ugg+0.024
uts+0.023
ests+0.022
Neg logit contributions
 matter-0.030
 disagreement-0.027
 beginning-0.026
 swoop-0.025
ho-0.025
Token asked
Token ID1965
Feature activation-3.87e-01

Pos logit contributions
pins+0.020
ouses+0.018
ugg+0.017
uts+0.016
 Eve+0.016
Neg logit contributions
 matter-0.028
 disagreement-0.024
 beginning-0.024
 swoop-0.023
ho-0.023
Token.
Token ID13
Feature activation+5.96e-01

Pos logit contributions
pins+0.008
ouses+0.008
 Eve+0.007
uts+0.007
unn+0.006
Neg logit contributions
 matter-0.017
 disagreement-0.015
 beginning-0.014
 swoop-0.014
ho-0.014
Token rounded
Token ID19273
Feature activation-1.07e+00

Pos logit contributions
pins+0.017
ugg+0.015
 Eve+0.015
ouses+0.015
uts+0.015
Neg logit contributions
 matter-0.015
 disagreement-0.013
ho-0.012
 beginning-0.012
 swoop-0.012
Token the
Token ID262
Feature activation-4.11e-01

Pos logit contributions
pins+0.038
uts+0.038
unn+0.036
ouses+0.036
ugg+0.034
Neg logit contributions
 matter-0.031
 beginning-0.028
 disagreement-0.026
 swoop-0.026
 problem-0.025
Token corner
Token ID5228
Feature activation-1.95e+00

Pos logit contributions
uts+0.015
unn+0.014
ouses+0.014
pins+0.013
ests+0.013
Neg logit contributions
 beginning-0.013
 matter-0.012
 end-0.011
 disagreement-0.010
 problem-0.010
Token of
Token ID286
Feature activation+4.09e-01

Pos logit contributions
pins+0.036
 Eve+0.031
ugg+0.030
ouses+0.029
uts+0.029
Neg logit contributions
 matter-0.090
 disagreement-0.077
 swoop-0.076
 beginning-0.075
ho-0.073
Token the
Token ID262
Feature activation-7.31e-01

Pos logit contributions
 matter+0.014
 beginning+0.012
 disagreement+0.012
 swoop+0.011
ho+0.011
Neg logit contributions
pins-0.013
ouses-0.012
uts-0.012
ugg-0.012
unn-0.011
Token harbour
Token ID39720
Feature activation-3.77e+00

Pos logit contributions
uts+0.025
unn+0.023
ouses+0.023
ugg+0.023
ests+0.022
Neg logit contributions
 beginning-0.023
 matter-0.023
 problem-0.021
 end-0.020
 disagreement-0.020
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
pins+0.086
ouses+0.072
 Eve+0.072
ugg+0.070
uts+0.069
Neg logit contributions
 matter-0.161
 disagreement-0.140
 swoop-0.134
 beginning-0.134
ho-0.132
Token it
Token ID340
Feature activation+1.02e-01

Pos logit contributions
pins+0.034
 Eve+0.030
ouses+0.029
uts+0.028
ugg+0.027
Neg logit contributions
 matter-0.047
 disagreement-0.041
ho-0.040
 beginning-0.039
 swoop-0.039
Token was
Token ID373
Feature activation+4.49e-02

Pos logit contributions
 matter+0.003
 disagreement+0.003
ho+0.003
 beginning+0.003
 swoop+0.003
Neg logit contributions
pins-0.003
ouses-0.003
uts-0.003
 Eve-0.003
ugg-0.003
Token surprised
Token ID6655
Feature activation-5.53e-01

Pos logit contributions
 matter+0.001
ho+0.001
 disagreement+0.001
 swoop+0.001
 forth+0.001
Neg logit contributions
pins-0.001
 Eve-0.001
uts-0.001
ugg-0.001
ouses-0.001
Token to
Token ID284
Feature activation+5.71e-01

Pos logit contributions
uts+0.006
pins+0.005
 and+0.005
obs+0.004
 of+0.004
Neg logit contributions
 matter-0.027
 disagreement-0.026
 swoop-0.026
 total-0.026
 End-0.025
Token others
Token ID1854
Feature activation-7.65e-01

Pos logit contributions
 matter+0.018
ho+0.017
ah+0.016
 disagreement+0.016
 beginning+0.016
Neg logit contributions
pins-0.017
 Eve-0.015
uts-0.015
ouses-0.015
 Santa-0.014
Token.
Token ID13
Feature activation+3.21e-02

Pos logit contributions
pins+0.022
ouses+0.020
ugg+0.019
uts+0.019
 Eve+0.019
Neg logit contributions
 matter-0.028
 disagreement-0.024
 beginning-0.023
ho-0.023
 swoop-0.023
Token That
Token ID1320
Feature activation+1.36e+00

Pos logit contributions
 matter+0.001
 beginning+0.001
What+0.001
where+0.001
 End+0.001
Neg logit contributions
pins-0.001
ouses-0.001
 complexion-0.001
washer-0.001
uts-0.001
Token is
Token ID318
Feature activation-3.59e-01

Pos logit contributions
 matter+0.042
 swoop+0.036
 disagreement+0.035
 beginning+0.035
ho+0.035
Neg logit contributions
pins-0.043
 Eve-0.040
uts-0.038
ugg-0.038
ouses-0.038
Token what
Token ID644
Feature activation+9.15e-01

Pos logit contributions
pins+0.010
 Santa+0.009
 Eve+0.008
hes+0.008
aid+0.008
Neg logit contributions
 matter-0.012
ho-0.012
 disagreement-0.012
 swoop-0.011
 beginning-0.011
Token grand
Token ID4490
Feature activation-3.70e+00

Pos logit contributions
 matter+0.034
 beginning+0.030
 disagreement+0.030
 swoop+0.030
ho+0.029
Neg logit contributions
pins-0.024
uts-0.022
ouses-0.021
 Eve-0.021
unn-0.021
Tokenpa
Token ID8957
Feature activation-1.79e+00

Pos logit contributions
pins+0.100
ouses+0.090
ugg+0.087
uts+0.086
 Eve+0.086
Neg logit contributions
 matter-0.141
 disagreement-0.122
 beginning-0.120
ho-0.117
 swoop-0.117
Token taught
Token ID7817
Feature activation-1.71e+00

Pos logit contributions
pins+0.060
ouses+0.055
ugg+0.053
uts+0.053
 Eve+0.053
Neg logit contributions
 matter-0.056
 disagreement-0.047
 beginning-0.046
ho-0.045
 swoop-0.045
Token me
Token ID502
Feature activation-1.98e+00

Pos logit contributions
pins+0.060
 Eve+0.059
uts+0.056
unn+0.055
ouses+0.054
Neg logit contributions
 matter-0.046
 beginning-0.041
 disagreement-0.040
 swoop-0.039
ho-0.039
Token.
Token ID13
Feature activation+2.77e-01

Pos logit contributions
pins+0.055
ouses+0.049
ugg+0.048
 Eve+0.048
uts+0.047
Neg logit contributions
 matter-0.072
 beginning-0.062
 disagreement-0.061
ho-0.059
 problem-0.059
Token That
Token ID1320
Feature activation+1.33e+00

Pos logit contributions
 matter+0.007
What+0.006
 beginning+0.006
Where+0.006
Time+0.006
Neg logit contributions
pins-0.010
 complexion-0.009
ouses-0.009
unn-0.008
ugg-0.008
Token seek
Token ID5380
Feature activation-3.12e-01

Pos logit contributions
pins+0.017
ouses+0.016
 Eve+0.015
uts+0.014
ugg+0.014
Neg logit contributions
 matter-0.023
 disagreement-0.020
 beginning-0.020
 problem-0.019
ho-0.019
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
pins+0.008
ouses+0.007
uts+0.007
 Eve+0.007
ugg+0.006
Neg logit contributions
 matter-0.012
 disagreement-0.011
ho-0.011
 beginning-0.010
 swoop-0.010
Token Tom
Token ID4186
Feature activation-1.79e-01

Pos logit contributions
pins+0.004
 Eve+0.003
ouses+0.003
uts+0.003
ugg+0.003
Neg logit contributions
 matter-0.010
 swoop-0.009
ho-0.009
 disagreement-0.009
 beginning-0.008
Token was
Token ID373
Feature activation-5.01e-01

Pos logit contributions
pins+0.006
ouses+0.006
 Eve+0.006
uts+0.006
ugg+0.005
Neg logit contributions
 matter-0.005
 disagreement-0.005
 beginning-0.004
ho-0.004
 swoop-0.004
Token the
Token ID262
Feature activation-8.27e-01

Pos logit contributions
pins+0.012
ouses+0.011
 Eve+0.011
 buck+0.011
hes+0.011
Neg logit contributions
 matter-0.019
ho-0.017
 disagreement-0.017
 swoop-0.016
 problem-0.016
Token seeker
Token ID45993
Feature activation-3.68e+00

Pos logit contributions
pins+0.016
 Eve+0.013
 buck+0.013
washer+0.013
ugg+0.013
Neg logit contributions
 matter-0.034
 disagreement-0.032
 inspection-0.031
 swoop-0.031
rated-0.030
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
pins+0.078
ouses+0.067
uts+0.066
 Eve+0.065
ugg+0.064
Neg logit contributions
 matter-0.161
 disagreement-0.143
 beginning-0.140
 swoop-0.137
ho-0.137
Token Lily
Token ID20037
Feature activation+7.59e-01

Pos logit contributions
 matter+0.048
 disagreement+0.042
 beginning+0.041
ho+0.040
 swoop+0.040
Neg logit contributions
pins-0.034
ouses-0.030
ugg-0.029
uts-0.029
 Eve-0.029
Token was
Token ID373
Feature activation+7.07e-01

Pos logit contributions
 matter+0.020
 disagreement+0.016
 swoop+0.016
 beginning+0.016
ho+0.015
Neg logit contributions
pins-0.029
ouses-0.027
ugg-0.027
uts-0.026
unn-0.026
Token the
Token ID262
Feature activation+3.56e-02

Pos logit contributions
 matter+0.029
 disagreement+0.025
ho+0.025
 swoop+0.024
 problem+0.024
Neg logit contributions
pins-0.016
 Eve-0.015
ouses-0.014
uts-0.014
hes-0.014
Token h
Token ID289
Feature activation-3.54e-01

Pos logit contributions
 matter+0.001
ho+0.001
 disagreement+0.001
 swoop+0.001
heit+0.001
Neg logit contributions
pins-0.001
 Eve-0.001
 buck-0.001
 bed-0.001
washer-0.001
Token and
Token ID290
Feature activation+9.70e-01

Pos logit contributions
pins+0.013
uts+0.011
ouses+0.010
 Eve+0.010
ugg+0.008
Neg logit contributions
 matter-0.033
 swoop-0.031
ho-0.030
 disagreement-0.030
 total-0.029
Token asked
Token ID1965
Feature activation-1.77e-01

Pos logit contributions
 matter+0.041
 total+0.039
 perspective+0.039
where+0.039
ah+0.038
Neg logit contributions
pins-0.014
 thanked-0.012
 The-0.012
 cried-0.011
 Eve-0.011
Token,
Token ID11
Feature activation+8.89e-01

Pos logit contributions
pins+0.004
ouses+0.003
 Eve+0.003
uts+0.003
ugg+0.003
Neg logit contributions
 matter-0.008
 disagreement-0.007
 swoop-0.007
 beginning-0.007
ho-0.006
Token "
Token ID366
Feature activation+9.95e-01

Pos logit contributions
 matter+0.035
 disagreement+0.030
ho+0.029
 swoop+0.029
 beginning+0.028
Neg logit contributions
pins-0.024
 Eve-0.020
uts-0.020
ouses-0.019
ugg-0.018
TokenWhat
Token ID2061
Feature activation+9.77e-01

Pos logit contributions
 matter+0.025
 problem+0.022
 disagreement+0.022
 swoop+0.020
 impossible+0.018
Neg logit contributions
pins-0.039
uts-0.034
ouses-0.033
ugg-0.033
grand-0.032
Token papers
Token ID9473
Feature activation-3.68e+00

Pos logit contributions
 matter+0.027
 disagreement+0.021
 beginning+0.020
 problem+0.020
 quarrel+0.019
Neg logit contributions
pins-0.038
ugg-0.036
ouses-0.035
 Eve-0.034
unn-0.034
Token?"
Token ID1701
Feature activation-4.30e-01

Pos logit contributions
pins+0.115
ouses+0.105
ugg+0.103
uts+0.102
 Eve+0.101
Neg logit contributions
 matter-0.123
 disagreement-0.104
 beginning-0.103
 swoop-0.099
ho-0.099
Token
Token ID198
Feature activation+3.67e-01

Pos logit contributions
pins+0.010
ouses+0.009
ugg+0.008
uts+0.008
 Eve+0.008
Neg logit contributions
 matter-0.018
 disagreement-0.016
 beginning-0.016
ho-0.016
 swoop-0.015
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
 matter+0.013
 disagreement+0.012
ho+0.012
 swoop+0.012
 beginning+0.012
Neg logit contributions
pins-0.010
ouses-0.008
unn-0.008
ugg-0.008
 Eve-0.008
TokenM
Token ID44
Feature activation+5.19e-01

Pos logit contributions
 matter+0.007
 disagreement+0.006
 beginning+0.006
ho+0.006
 swoop+0.006
Neg logit contributions
pins-0.007
ouses-0.006
ugg-0.006
uts-0.006
 Eve-0.006
Tokenum
Token ID388
Feature activation-9.42e-02

Pos logit contributions
 matter+0.018
 disagreement+0.015
 beginning+0.015
 swoop+0.015
ho+0.015
Neg logit contributions
pins-0.016
ouses-0.014
ugg-0.014
uts-0.014
 Eve-0.014
Token park
Token ID3952
Feature activation+5.79e-01

Pos logit contributions
 Eve+0.001
pins+0.001
 bed+0.001
 Santa+0.001
ouses+0.001
Neg logit contributions
 matter-0.002
ho-0.001
 swoop-0.001
 disagreement-0.001
ah-0.001
Token.
Token ID13
Feature activation-2.71e-01

Pos logit contributions
 matter+0.031
 disagreement+0.027
Uh+0.026
 problem+0.026
 swoop+0.026
Neg logit contributions
pins-0.007
ouses-0.007
uts-0.006
 Eve-0.006
."-0.005
Token Sara
Token ID24799
Feature activation-1.40e+00

Pos logit contributions
pins+0.006
ouses+0.005
ugg+0.005
 Eve+0.005
uts+0.005
Neg logit contributions
 matter-0.012
 disagreement-0.010
 beginning-0.010
 swoop-0.010
ho-0.010
Token is
Token ID318
Feature activation-2.10e-02

Pos logit contributions
pins+0.052
ouses+0.048
ugg+0.047
uts+0.047
 Eve+0.046
Neg logit contributions
 matter-0.038
 disagreement-0.032
 beginning-0.031
 swoop-0.030
ho-0.030
Token the
Token ID262
Feature activation-8.35e-01

Pos logit contributions
pins+0.001
 Eve+0.000
ouses+0.000
uts+0.000
 buck+0.000
Neg logit contributions
 matter-0.001
ho-0.001
 disagreement-0.001
 swoop-0.001
 beginning-0.001
Token seeker
Token ID45993
Feature activation-3.65e+00

Pos logit contributions
pins+0.012
 Eve+0.010
robe+0.010
 CDs+0.010
 buck+0.009
Neg logit contributions
 matter-0.037
 disagreement-0.035
 swoop-0.035
ths-0.034
 inspection-0.034
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
pins+0.073
uts+0.063
ouses+0.061
 Eve+0.061
ugg+0.058
Neg logit contributions
 matter-0.166
 disagreement-0.148
 beginning-0.144
 swoop-0.141
ho-0.139
Token Ben
Token ID3932
Feature activation-3.23e-01

Pos logit contributions
 matter+0.049
 disagreement+0.043
 beginning+0.042
 swoop+0.041
ho+0.040
Neg logit contributions
pins-0.029
ouses-0.026
ugg-0.025
uts-0.025
 Eve-0.024
Token is
Token ID318
Feature activation+1.23e+00

Pos logit contributions
pins+0.011
ouses+0.011
uts+0.010
 Eve+0.010
ugg+0.010
Neg logit contributions
 matter-0.009
 disagreement-0.008
 beginning-0.008
ho-0.007
 swoop-0.007
Token the
Token ID262
Feature activation+1.31e-02

Pos logit contributions
 matter+0.053
 disagreement+0.047
ho+0.046
 swoop+0.044
 beginning+0.044
Neg logit contributions
pins-0.026
 Eve-0.024
ouses-0.023
uts-0.022
 buck-0.020
Token h
Token ID289
Feature activation-4.02e-01

Pos logit contributions
 matter+0.000
ho+0.000
ering+0.000
cking+0.000
 disagreement+0.000
Neg logit contributions
 bed-0.000
 Eve-0.000
 sofa-0.000
 buck-0.000
robe-0.000
Token park
Token ID3952
Feature activation+5.04e-01

Pos logit contributions
 Eve+0.002
pins+0.002
ouses+0.002
ugg+0.002
 Santa+0.002
Neg logit contributions
 matter-0.004
ho-0.004
 swoop-0.003
 disagreement-0.003
ah-0.003
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
 matter+0.027
 disagreement+0.024
Uh+0.023
 swoop+0.023
 problem+0.022
Neg logit contributions
pins-0.006
uts-0.006
ouses-0.005
 Eve-0.005
."-0.004
Token Lily
Token ID20037
Feature activation-3.96e-01

Pos logit contributions
pins+0.005
 Eve+0.004
ouses+0.004
uts+0.003
ugg+0.003
Neg logit contributions
 matter-0.014
 swoop-0.013
ho-0.013
 disagreement-0.012
 beginning-0.012
Token was
Token ID373
Feature activation-4.16e-01

Pos logit contributions
pins+0.014
ouses+0.013
ugg+0.013
uts+0.013
 Eve+0.013
Neg logit contributions
 matter-0.011
 disagreement-0.009
 beginning-0.009
 swoop-0.009
ho-0.009
Token the
Token ID262
Feature activation-8.37e-01

Pos logit contributions
pins+0.011
 Eve+0.010
ouses+0.009
uts+0.009
 buck+0.009
Neg logit contributions
 matter-0.016
ho-0.014
 disagreement-0.014
 swoop-0.013
 problem-0.013
Token seeker
Token ID45993
Feature activation-3.64e+00

Pos logit contributions
pins+0.017
 Eve+0.015
 elderly+0.014
 CDs+0.014
 buck+0.014
Neg logit contributions
 matter-0.032
 inspection-0.031
 disagreement-0.031
 swoop-0.030
rated-0.030
Token and
Token ID290
Feature activation+9.43e-01

Pos logit contributions
pins+0.071
uts+0.061
 Eve+0.059
ouses+0.058
ugg+0.055
Neg logit contributions
 matter-0.166
 disagreement-0.149
 beginning-0.145
 swoop-0.142
ho-0.139
Token Tom
Token ID4186
Feature activation+8.81e-01

Pos logit contributions
 matter+0.037
 disagreement+0.032
 beginning+0.032
 swoop+0.031
ho+0.031
Neg logit contributions
pins-0.024
ouses-0.022
uts-0.021
ugg-0.021
 Eve-0.021
Token was
Token ID373
Feature activation+7.06e-01

Pos logit contributions
 matter+0.029
 disagreement+0.025
 beginning+0.024
ho+0.023
 problem+0.022
Neg logit contributions
pins-0.027
ouses-0.026
 Eve-0.026
uts-0.025
ugg-0.024
Token the
Token ID262
Feature activation+1.15e-02

Pos logit contributions
 matter+0.031
 disagreement+0.028
ho+0.027
 problem+0.026
 swoop+0.026
Neg logit contributions
pins-0.013
 Eve-0.013
ouses-0.012
uts-0.012
hes-0.011
Token h
Token ID289
Feature activation-3.38e-01

Pos logit contributions
ho+0.000
 matter+0.000
 disagreement+0.000
 swoop+0.000
heit+0.000
Neg logit contributions
 Eve-0.000
pins-0.000
 buck-0.000
 bed-0.000
 CDs-0.000
Token lunch
Token ID9965
Feature activation-1.11e+00

Pos logit contributions
 matter+0.029
 disagreement+0.025
 beginning+0.025
ho+0.024
 swoop+0.024
Neg logit contributions
pins-0.019
ouses-0.017
ugg-0.017
uts-0.016
 Eve-0.016
Token,
Token ID11
Feature activation-5.55e-01

Pos logit contributions
pins+0.031
ouses+0.028
uts+0.026
ugg+0.026
 Eve+0.026
Neg logit contributions
 matter-0.041
 disagreement-0.036
 beginning-0.035
ho-0.035
 problem-0.034
Token he
Token ID339
Feature activation-4.29e-01

Pos logit contributions
pins+0.016
unn+0.016
ouses+0.016
ugg+0.016
uts+0.015
Neg logit contributions
 matter-0.019
 beginning-0.017
 disagreement-0.016
ho-0.015
 problem-0.015
Token handed
Token ID10158
Feature activation-4.81e-01

Pos logit contributions
pins+0.015
ouses+0.014
uts+0.014
ugg+0.014
 Eve+0.013
Neg logit contributions
 matter-0.012
 disagreement-0.011
 beginning-0.010
 swoop-0.010
ho-0.010
Token each
Token ID1123
Feature activation-1.42e+00

Pos logit contributions
pins+0.019
ouses+0.018
uts+0.018
ugg+0.018
unn+0.018
Neg logit contributions
 matter-0.012
 beginning-0.010
 disagreement-0.009
ho-0.009
 swoop-0.009
Token grand
Token ID4490
Feature activation-3.61e+00

Pos logit contributions
pins+0.043
ouses+0.039
ugg+0.038
uts+0.038
 Eve+0.038
Neg logit contributions
 matter-0.048
 disagreement-0.041
 beginning-0.041
ho-0.039
 swoop-0.039
Tokenkid
Token ID38439
Feature activation-1.14e+00

Pos logit contributions
pins+0.091
ouses+0.081
ugg+0.079
uts+0.078
 Eve+0.076
Neg logit contributions
 matter-0.144
 disagreement-0.125
 beginning-0.122
ho-0.119
 swoop-0.119
Token a
Token ID257
Feature activation-1.59e-01

Pos logit contributions
pins+0.033
unn+0.028
 Eve+0.028
ouses+0.028
ugg+0.028
Neg logit contributions
 matter-0.039
 disagreement-0.034
 beginning-0.034
 swoop-0.034
ho-0.032
Token special
Token ID2041
Feature activation-7.78e-01

Pos logit contributions
pins+0.004
ouses+0.004
ugg+0.004
 Eve+0.004
uts+0.003
Neg logit contributions
 matter-0.006
 beginning-0.005
 disagreement-0.005
ho-0.005
 swoop-0.005
Token present
Token ID1944
Feature activation+4.28e-01

Pos logit contributions
pins+0.019
ugg+0.018
ouses+0.018
 Eve+0.017
 complexion+0.017
Neg logit contributions
 matter-0.029
 beginning-0.025
 disagreement-0.025
ho-0.025
 swoop-0.025
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
 matter+0.017
 disagreement+0.015
 beginning+0.014
 swoop+0.014
 problem+0.014
Neg logit contributions
pins-0.011
uts-0.010
ouses-0.010
 Eve-0.010
ugg-0.010
Token house
Token ID2156
Feature activation+7.04e-02

Pos logit contributions
pins+0.017
 Eve+0.016
ouses+0.015
ugg+0.014
 complexion+0.014
Neg logit contributions
 matter-0.028
ho-0.024
 beginning-0.024
 disagreement-0.024
 swoop-0.024
Token.
Token ID13
Feature activation-9.43e-02

Pos logit contributions
 matter+0.004
 disagreement+0.003
 swoop+0.003
Uh+0.003
heit+0.003
Neg logit contributions
pins-0.001
uts-0.001
 Eve-0.001
ouses-0.001
washer-0.001
Token Lily
Token ID20037
Feature activation-3.95e-01

Pos logit contributions
pins+0.002
 Eve+0.002
ouses+0.001
uts+0.001
ugg+0.001
Neg logit contributions
 matter-0.005
ho-0.004
 swoop-0.004
 disagreement-0.004
 beginning-0.004
Token was
Token ID373
Feature activation-3.92e-01

Pos logit contributions
pins+0.014
ouses+0.013
 Eve+0.013
uts+0.013
ugg+0.013
Neg logit contributions
 matter-0.012
 disagreement-0.010
 beginning-0.009
ho-0.009
 swoop-0.009
Token the
Token ID262
Feature activation-6.72e-01

Pos logit contributions
pins+0.011
 Eve+0.010
ouses+0.009
uts+0.009
 buck+0.009
Neg logit contributions
 matter-0.014
ho-0.013
 disagreement-0.012
 swoop-0.012
 problem-0.011
Token seeker
Token ID45993
Feature activation-3.61e+00

Pos logit contributions
pins+0.014
 Eve+0.013
 CDs+0.011
 buck+0.011
 Santa+0.011
Neg logit contributions
 matter-0.026
 swoop-0.025
 disagreement-0.024
 inspection-0.024
ho-0.023
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
pins+0.071
uts+0.063
 Eve+0.060
ouses+0.060
ugg+0.057
Neg logit contributions
 matter-0.163
 disagreement-0.147
 beginning-0.142
 swoop-0.139
ho-0.137
Token Max
Token ID5436
Feature activation+6.28e-01

Pos logit contributions
 matter+0.044
 disagreement+0.038
 beginning+0.037
ho+0.037
 swoop+0.036
Neg logit contributions
pins-0.029
ouses-0.026
ugg-0.025
uts-0.025
 Eve-0.025
Token was
Token ID373
Feature activation+7.28e-01

Pos logit contributions
 matter+0.020
 disagreement+0.017
 beginning+0.017
ho+0.016
 problem+0.016
Neg logit contributions
pins-0.020
ouses-0.019
 Eve-0.018
uts-0.018
ugg-0.018
Token the
Token ID262
Feature activation+1.84e-01

Pos logit contributions
 matter+0.030
ho+0.026
 disagreement+0.026
 swoop+0.025
 problem+0.024
Neg logit contributions
pins-0.016
 Eve-0.016
uts-0.014
ouses-0.014
 buck-0.014
Token h
Token ID289
Feature activation-4.09e-01

Pos logit contributions
 swoop+0.006
ho+0.006
heit+0.006
 matter+0.006
cking+0.006
Neg logit contributions
 Eve-0.004
 bed-0.004
pins-0.004
 Teddy-0.004
washer-0.004
Token park
Token ID3952
Feature activation+4.49e-01

Pos logit contributions
 Eve+0.002
pins+0.002
 bed+0.001
ugg+0.001
ouses+0.001
Neg logit contributions
 matter-0.004
ho-0.003
 swoop-0.003
 disagreement-0.003
ah-0.003
Token.
Token ID13
Feature activation-2.22e-01

Pos logit contributions
 matter+0.023
 disagreement+0.020
 swoop+0.020
 problem+0.020
Uh+0.019
Neg logit contributions
pins-0.006
 Eve-0.005
ouses-0.005
uts-0.005
."-0.004
Token Lily
Token ID20037
Feature activation-4.94e-01

Pos logit contributions
pins+0.004
 Eve+0.004
ouses+0.004
uts+0.003
ugg+0.003
Neg logit contributions
 matter-0.010
ho-0.009
 disagreement-0.009
 beginning-0.009
 swoop-0.009
Token was
Token ID373
Feature activation-4.59e-01

Pos logit contributions
pins+0.018
ouses+0.016
uts+0.016
 Eve+0.016
ugg+0.016
Neg logit contributions
 matter-0.014
 disagreement-0.012
 beginning-0.012
ho-0.011
 swoop-0.011
Token the
Token ID262
Feature activation-8.49e-01

Pos logit contributions
pins+0.012
 Eve+0.011
ouses+0.011
uts+0.010
 buck+0.010
Neg logit contributions
 matter-0.017
ho-0.015
 disagreement-0.015
 swoop-0.014
 problem-0.014
Token seeker
Token ID45993
Feature activation-3.61e+00

Pos logit contributions
pins+0.011
 Eve+0.009
 CDs+0.008
 elderly+0.008
 buck+0.008
Neg logit contributions
 matter-0.039
 inspection-0.037
 swoop-0.037
 disagreement-0.036
rated-0.036
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
pins+0.071
uts+0.060
ouses+0.059
 Eve+0.058
ugg+0.057
Neg logit contributions
 matter-0.164
 disagreement-0.147
 beginning-0.143
 swoop-0.140
ho-0.139
Token Max
Token ID5436
Feature activation+6.16e-01

Pos logit contributions
 matter+0.046
 disagreement+0.041
 beginning+0.040
ho+0.039
 swoop+0.039
Neg logit contributions
pins-0.030
ouses-0.027
ugg-0.026
uts-0.026
 Eve-0.026
Token was
Token ID373
Feature activation+7.40e-01

Pos logit contributions
 matter+0.021
 disagreement+0.018
 beginning+0.017
ho+0.016
 problem+0.016
Neg logit contributions
pins-0.019
ouses-0.018
uts-0.018
 Eve-0.018
ugg-0.017
Token the
Token ID262
Feature activation+9.92e-02

Pos logit contributions
 matter+0.033
ho+0.030
 disagreement+0.029
urious+0.028
 swoop+0.028
Neg logit contributions
 Eve-0.013
pins-0.012
uts-0.011
 buck-0.011
hes-0.011
Token h
Token ID289
Feature activation-3.45e-01

Pos logit contributions
cking+0.003
ering+0.003
ho+0.003
heit+0.003
 swoop+0.003
Neg logit contributions
 bed-0.002
 Eve-0.002
 sofa-0.002
 buck-0.002
 opener-0.002
Token was
Token ID373
Feature activation-6.84e-02

Pos logit contributions
pins+0.026
ouses+0.024
uts+0.023
unn+0.023
ugg+0.022
Neg logit contributions
 matter-0.021
 disagreement-0.017
 beginning-0.017
 swoop-0.017
ho-0.017
Token very
Token ID845
Feature activation-4.15e-02

Pos logit contributions
pins+0.002
ouses+0.002
uts+0.002
ugg+0.002
 Eve+0.002
Neg logit contributions
 matter-0.002
 disagreement-0.002
 beginning-0.002
 swoop-0.002
ho-0.002
Token elderly
Token ID13830
Feature activation-1.02e+00

Pos logit contributions
pins+0.001
uts+0.001
ests+0.001
ouses+0.001
ugg+0.001
Neg logit contributions
 matter-0.001
 disagreement-0.001
 beginning-0.001
 problem-0.001
ho-0.001
Token.
Token ID13
Feature activation-4.64e-02

Pos logit contributions
ugg+0.041
unn+0.041
pins+0.040
ouses+0.039
angers+0.038
Neg logit contributions
 matter-0.018
 beginning-0.015
 problem-0.014
 coming-0.014
ho-0.014
Token The
Token ID383
Feature activation+4.87e-01

Pos logit contributions
unn+0.002
ouses+0.002
pins+0.002
aid+0.002
uts+0.002
Neg logit contributions
 matter-0.001
 disagreement-0.001
 total-0.001
 beginning-0.001
 quarrel-0.001
Token grand
Token ID4490
Feature activation-3.57e+00

Pos logit contributions
 matter+0.019
 beginning+0.017
 disagreement+0.016
 problem+0.016
 quarrel+0.015
Neg logit contributions
pins-0.013
uts-0.013
ouses-0.013
ugg-0.012
unn-0.012
Tokenpa
Token ID8957
Feature activation-1.31e+00

Pos logit contributions
pins+0.125
ouses+0.121
uts+0.119
ugg+0.118
aid+0.113
Neg logit contributions
 matter-0.109
 problem-0.089
 beginning-0.088
 disagreement-0.088
 size-0.085
Token was
Token ID373
Feature activation-2.27e-01

Pos logit contributions
pins+0.048
ouses+0.044
uts+0.043
ugg+0.042
unn+0.042
Neg logit contributions
 matter-0.037
 disagreement-0.030
 swoop-0.030
 beginning-0.029
ho-0.029
Token always
Token ID1464
Feature activation-7.89e-01

Pos logit contributions
pins+0.008
ouses+0.007
uts+0.007
ugg+0.007
 Eve+0.007
Neg logit contributions
 matter-0.007
 disagreement-0.006
 beginning-0.006
 swoop-0.006
ho-0.006
Token teaching
Token ID7743
Feature activation-1.01e+00

Pos logit contributions
pins+0.022
 Eve+0.020
ouses+0.019
uts+0.018
 buck+0.018
Neg logit contributions
 matter-0.029
ho-0.026
 disagreement-0.025
 swoop-0.024
 beginning-0.024
Token the
Token ID262
Feature activation-8.43e-01

Pos logit contributions
pins+0.028
 Eve+0.026
uts+0.025
ouses+0.025
unn+0.024
Neg logit contributions
 matter-0.035
ho-0.031
 swoop-0.031
 beginning-0.031
 disagreement-0.030
Token
Token ID198
Feature activation-9.52e-02

Pos logit contributions
unn+0.003
pins+0.003
ouses+0.003
ugg+0.003
ests+0.003
Neg logit contributions
 matter-0.002
ho-0.002
 beginning-0.002
What-0.002
Where-0.002
Token
Token ID198
Feature activation-2.84e-01

Pos logit contributions
pins+0.003
ouses+0.003
ugg+0.003
unn+0.003
 complexion+0.003
Neg logit contributions
 matter-0.003
 swoop-0.003
 disagreement-0.002
ho-0.002
What-0.002
Token"
Token ID1
Feature activation-7.40e-01

Pos logit contributions
pins+0.010
ouses+0.010
ugg+0.010
unn+0.009
uts+0.009
Neg logit contributions
 matter-0.008
 swoop-0.006
 disagreement-0.006
ho-0.006
 beginning-0.006
TokenIn
Token ID818
Feature activation+3.20e-01

Pos logit contributions
pins+0.027
ouses+0.026
unn+0.026
ugg+0.025
 Eve+0.025
Neg logit contributions
 matter-0.020
 beginning-0.016
ho-0.016
 disagreement-0.016
 swoop-0.015
Token the
Token ID262
Feature activation-2.12e-01

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.010
ho+0.010
 swoop+0.010
Neg logit contributions
pins-0.009
ouses-0.008
 Eve-0.008
uts-0.007
ugg-0.007
Token attic
Token ID46113
Feature activation-3.57e+00

Pos logit contributions
 Eve+0.005
pins+0.005
grand+0.004
ouses+0.004
ugg+0.004
Neg logit contributions
 matter-0.008
 swoop-0.007
ho-0.007
 disagreement-0.007
ths-0.006
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
pins+0.071
 Eve+0.065
ouses+0.058
uts+0.057
ugg+0.054
Neg logit contributions
 matter-0.159
 disagreement-0.141
 beginning-0.139
 swoop-0.135
ho-0.133
Token Grand
Token ID5675
Feature activation-1.29e+00

Pos logit contributions
pins+0.017
 Eve+0.015
ouses+0.015
uts+0.014
ugg+0.014
Neg logit contributions
 matter-0.023
 disagreement-0.021
 beginning-0.020
ho-0.020
 swoop-0.019
Tokenma
Token ID2611
Feature activation-3.23e-01

Pos logit contributions
pins+0.042
ouses+0.039
uts+0.038
ugg+0.038
 Eve+0.037
Neg logit contributions
 matter-0.042
 disagreement-0.035
 beginning-0.034
ho-0.033
 swoop-0.033
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
pins+0.008
ouses+0.007
 Eve+0.007
uts+0.007
ugg+0.007
Neg logit contributions
 matter-0.013
 disagreement-0.012
 beginning-0.011
ho-0.011
 swoop-0.011
Token We
Token ID775
Feature activation-5.90e-01

Pos logit contributions
 matter+0.001
ho+0.001
 beginning+0.001
where+0.001
 total+0.001
Neg logit contributions
pins-0.001
ouses-0.001
unn-0.001
 complexion-0.001
uts-0.001
Token park
Token ID3952
Feature activation+8.02e-01

Pos logit contributions
 matter+0.005
ho+0.005
 swoop+0.005
 disagreement+0.004
ah+0.004
Neg logit contributions
 Eve-0.003
pins-0.003
ugg-0.002
ouses-0.002
 Santa-0.002
Token.
Token ID13
Feature activation-2.13e-01

Pos logit contributions
 matter+0.043
 disagreement+0.037
Uh+0.036
heit+0.036
 swoop+0.036
Neg logit contributions
pins-0.009
uts-0.009
ouses-0.008
 Eve-0.008
 opener-0.006
Token Lily
Token ID20037
Feature activation-3.24e-01

Pos logit contributions
pins+0.004
 Eve+0.003
ouses+0.003
uts+0.003
ugg+0.003
Neg logit contributions
 matter-0.010
 swoop-0.009
ho-0.009
 disagreement-0.009
 beginning-0.009
Token was
Token ID373
Feature activation-2.86e-01

Pos logit contributions
pins+0.012
ouses+0.011
ugg+0.011
uts+0.011
 Eve+0.011
Neg logit contributions
 matter-0.009
 disagreement-0.007
 beginning-0.007
 swoop-0.007
ho-0.007
Token the
Token ID262
Feature activation-6.21e-01

Pos logit contributions
pins+0.008
ouses+0.007
 Eve+0.007
uts+0.007
ugg+0.007
Neg logit contributions
 matter-0.010
 disagreement-0.009
ho-0.009
 swoop-0.009
 beginning-0.008
Token seeker
Token ID45993
Feature activation-3.56e+00

Pos logit contributions
pins+0.012
 Eve+0.011
 elderly+0.010
 buck+0.010
 Santa+0.010
Neg logit contributions
 matter-0.024
 disagreement-0.023
 swoop-0.023
 inspection-0.023
rated-0.022
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
pins+0.071
ouses+0.061
uts+0.060
 Eve+0.059
ugg+0.058
Neg logit contributions
 matter-0.160
 disagreement-0.143
 beginning-0.139
 swoop-0.137
ho-0.136
Token Tom
Token ID4186
Feature activation+1.11e+00

Pos logit contributions
 matter+0.044
 disagreement+0.038
 beginning+0.037
 swoop+0.036
ho+0.036
Neg logit contributions
pins-0.031
ouses-0.027
ugg-0.027
uts-0.027
 Eve-0.026
Token was
Token ID373
Feature activation+9.05e-01

Pos logit contributions
 matter+0.037
 disagreement+0.032
 beginning+0.031
ho+0.029
 problem+0.028
Neg logit contributions
pins-0.034
ouses-0.032
 Eve-0.032
uts-0.032
ugg-0.030
Token the
Token ID262
Feature activation+1.08e-01

Pos logit contributions
 matter+0.038
 disagreement+0.033
ho+0.033
 swoop+0.031
 beginning+0.031
Neg logit contributions
pins-0.020
 Eve-0.018
ouses-0.017
uts-0.017
 buck-0.016
Token h
Token ID289
Feature activation-2.27e-01

Pos logit contributions
 matter+0.004
ho+0.004
 disagreement+0.004
heit+0.004
 swoop+0.004
Neg logit contributions
 Eve-0.002
pins-0.002
 buck-0.002
 bed-0.002
 opener-0.002
Token seas
Token ID21547
Feature activation-2.19e+00

Pos logit contributions
pins+0.016
ouses+0.015
 complexion+0.015
 Eve+0.013
 buck+0.013
Neg logit contributions
 matter-0.027
ths-0.025
ho-0.025
 disagreement-0.025
 cheating-0.024
Tokenhell
Token ID12758
Feature activation+1.24e+00

Pos logit contributions
pins+0.060
ouses+0.054
uts+0.054
ugg+0.052
 Eve+0.051
Neg logit contributions
 matter-0.083
 disagreement-0.072
 beginning-0.070
 swoop-0.067
ho-0.067
Token.
Token ID13
Feature activation-8.70e-03

Pos logit contributions
 matter+0.056
 disagreement+0.051
 End+0.046
 beginning+0.045
 cheating+0.045
Neg logit contributions
pins-0.028
uts-0.027
 Eve-0.021
ouses-0.021
ugg-0.020
Token "
Token ID366
Feature activation+5.66e-02

Pos logit contributions
pins+0.000
ouses+0.000
unn+0.000
uts+0.000
grand+0.000
Neg logit contributions
 matter-0.000
 beginning-0.000
ho-0.000
 disagreement-0.000
 swoop-0.000
TokenThat
Token ID2504
Feature activation+1.15e+00

Pos logit contributions
 matter+0.002
 problem+0.001
 disagreement+0.001
 beginning+0.001
ho+0.001
Neg logit contributions
pins-0.002
ugg-0.002
 Eve-0.002
grand-0.002
ouses-0.002
Token seas
Token ID21547
Feature activation-3.54e+00

Pos logit contributions
 matter+0.042
 disagreement+0.037
 beginning+0.035
 cheating+0.034
ho+0.033
Neg logit contributions
pins-0.029
 Eve-0.028
washer-0.026
uts-0.025
grand-0.025
Tokenhell
Token ID12758
Feature activation-5.05e-01

Pos logit contributions
pins+0.083
uts+0.081
ouses+0.077
unn+0.074
ugg+0.073
Neg logit contributions
 matter-0.140
 beginning-0.126
 disagreement-0.125
where-0.115
ho-0.114
Token belongs
Token ID14448
Feature activation-7.69e-01

Pos logit contributions
uts+0.010
pins+0.009
unn+0.009
ouses+0.008
ugg+0.008
Neg logit contributions
 matter-0.023
 beginning-0.020
 disagreement-0.020
 End-0.019
Where-0.019
Token to
Token ID284
Feature activation-9.09e-01

Pos logit contributions
pins+0.019
ouses+0.018
uts+0.017
ugg+0.017
 Eve+0.017
Neg logit contributions
 matter-0.030
 disagreement-0.027
 beginning-0.026
 swoop-0.025
ho-0.025
Token you
Token ID345
Feature activation-9.75e-01

Pos logit contributions
pins+0.030
 Eve+0.029
ouses+0.028
ugg+0.027
uts+0.027
Neg logit contributions
 matter-0.027
 disagreement-0.023
 beginning-0.022
ho-0.022
 swoop-0.021
Token,
Token ID11
Feature activation+6.22e-02

Pos logit contributions
pins+0.012
 Eve+0.012
uts+0.010
unn+0.010
ouses+0.009
Neg logit contributions
 matter-0.049
 disagreement-0.044
 beginning-0.043
 swoop-0.041
ho-0.041
Token brown
Token ID7586
Feature activation-7.57e-01

Pos logit contributions
pins+0.045
ouses+0.041
ugg+0.040
uts+0.040
 Eve+0.039
Neg logit contributions
 matter-0.056
 disagreement-0.049
 beginning-0.048
 swoop-0.047
ho-0.046
Token hair
Token ID4190
Feature activation-8.98e-01

Pos logit contributions
pins+0.022
ouses+0.020
ugg+0.019
uts+0.019
 Eve+0.019
Neg logit contributions
 matter-0.027
 disagreement-0.024
 beginning-0.023
 swoop-0.023
ho-0.022
Token and
Token ID290
Feature activation+5.90e-01

Pos logit contributions
pins+0.018
uts+0.015
ouses+0.014
ugg+0.014
 Eve+0.013
Neg logit contributions
 matter-0.042
 disagreement-0.038
 swoop-0.036
 beginning-0.035
 problem-0.035
Token a
Token ID257
Feature activation+6.22e-01

Pos logit contributions
 matter+0.021
 disagreement+0.019
 beginning+0.018
ho+0.018
 problem+0.017
Neg logit contributions
pins-0.017
ouses-0.016
uts-0.015
 Eve-0.015
ugg-0.015
Token big
Token ID1263
Feature activation+8.03e-02

Pos logit contributions
 matter+0.025
urious+0.024
ho+0.022
ths+0.022
rated+0.022
Neg logit contributions
pins-0.013
 complexion-0.012
 buck-0.011
ouses-0.011
 Eve-0.011
Token smile
Token ID8212
Feature activation+2.37e+00

Pos logit contributions
 matter+0.003
urious+0.003
ho+0.003
 beginning+0.003
ths+0.003
Neg logit contributions
pins-0.002
 complexion-0.002
 Eve-0.002
 bed-0.001
 buck-0.001
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
 matter+0.102
 disagreement+0.097
urious+0.091
ths+0.084
 impossible+0.084
Neg logit contributions
pins-0.053
uts-0.050
 complexion-0.044
ouses-0.043
ugg-0.042
Token He
Token ID679
Feature activation-2.64e-01

Pos logit contributions
 matter+0.007
where+0.006
 beginning+0.006
ho+0.006
 ï¿½+0.006
Neg logit contributions
pins-0.009
ouses-0.008
aid-0.007
 CDs-0.007
unn-0.007
Token walked
Token ID6807
Feature activation-5.69e-01

Pos logit contributions
pins+0.009
ouses+0.009
ugg+0.008
uts+0.008
 Eve+0.008
Neg logit contributions
 matter-0.008
 disagreement-0.006
 swoop-0.006
 beginning-0.006
ho-0.006
Token up
Token ID510
Feature activation-4.97e-01

Pos logit contributions
pins+0.016
 Eve+0.014
ouses+0.014
ugg+0.014
uts+0.013
Neg logit contributions
 matter-0.022
 disagreement-0.018
 beginning-0.017
 problem-0.017
 swoop-0.017
Token to
Token ID284
Feature activation+4.96e-01

Pos logit contributions
pins+0.015
ouses+0.014
uts+0.014
ugg+0.014
unn+0.013
Neg logit contributions
 matter-0.017
 disagreement-0.015
 beginning-0.014
ho-0.014
 swoop-0.014
TokenThe
Token ID464
Feature activation+6.31e-01

Pos logit contributions
pins+0.003
ouses+0.003
uts+0.003
ugg+0.003
 Eve+0.003
Neg logit contributions
 matter-0.006
 disagreement-0.005
 beginning-0.005
 problem-0.005
ho-0.005
Token girl
Token ID2576
Feature activation+3.15e-01

Pos logit contributions
 matter+0.027
 disagreement+0.025
ho+0.025
 swoop+0.024
ah+0.023
Neg logit contributions
pins-0.012
 Eve-0.011
grand-0.009
ouses-0.009
ugg-0.009
Token picked
Token ID6497
Feature activation+1.35e-01

Pos logit contributions
 matter+0.011
 disagreement+0.010
 beginning+0.010
ho+0.009
 swoop+0.009
Neg logit contributions
pins-0.009
ouses-0.008
ugg-0.008
 Eve-0.008
uts-0.008
Token out
Token ID503
Feature activation-8.77e-01

Pos logit contributions
 matter+0.005
 disagreement+0.004
ho+0.004
 swoop+0.004
 beginning+0.004
Neg logit contributions
pins-0.004
 Eve-0.003
ouses-0.003
uts-0.003
ugg-0.003
Token a
Token ID257
Feature activation+6.17e-01

Pos logit contributions
pins+0.022
ouses+0.019
 Eve+0.019
ugg+0.019
uts+0.019
Neg logit contributions
 matter-0.035
 disagreement-0.030
 beginning-0.030
ho-0.029
 swoop-0.029
Token cookie
Token ID19751
Feature activation+2.68e+00

Pos logit contributions
 matter+0.025
ho+0.022
ths+0.022
urious+0.021
 forth+0.021
Neg logit contributions
pins-0.014
ouses-0.012
 Eve-0.012
washer-0.011
 buck-0.011
Token from
Token ID422
Feature activation-9.76e-01

Pos logit contributions
 matter+0.130
 disagreement+0.116
 beginning+0.110
 swoop+0.109
urious+0.106
Neg logit contributions
pins-0.049
uts-0.040
ouses-0.038
 Eve-0.037
ugg-0.036
Token the
Token ID262
Feature activation+5.65e-02

Pos logit contributions
pins+0.025
 Eve+0.022
ugg+0.020
ouses+0.020
 Santa+0.019
Neg logit contributions
 matter-0.038
 disagreement-0.033
 beginning-0.032
ho-0.032
 swoop-0.032
Token cart
Token ID6383
Feature activation+2.57e-01

Pos logit contributions
 matter+0.003
 swoop+0.002
 disagreement+0.002
ah+0.002
urious+0.002
Neg logit contributions
pins-0.001
 Eve-0.001
ugg-0.001
 Santa-0.001
 sofa-0.001
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
 matter+0.010
 beginning+0.009
 disagreement+0.009
ho+0.009
 swoop+0.008
Neg logit contributions
pins-0.006
ouses-0.006
ugg-0.006
unn-0.006
uts-0.006
Token happily
Token ID18177
Feature activation+9.74e-01

Pos logit contributions
pins+0.005
ouses+0.004
 Eve+0.004
ugg+0.004
uts+0.004
Neg logit contributions
 matter-0.006
 disagreement-0.006
 beginning-0.005
ho-0.005
 problem-0.005
Token picked
Token ID6497
Feature activation+8.73e-01

Pos logit contributions
 matter+0.047
 disagreement+0.040
 swoop+0.040
 beginning+0.040
ho+0.039
Neg logit contributions
pins-0.059
ouses-0.054
uts-0.053
unn-0.053
ugg-0.053
Token out
Token ID503
Feature activation-2.99e-01

Pos logit contributions
 matter+0.032
 disagreement+0.028
ho+0.027
 beginning+0.027
 swoop+0.026
Neg logit contributions
pins-0.024
ouses-0.022
 Eve-0.021
uts-0.020
ugg-0.020
Token some
Token ID617
Feature activation+1.74e-01

Pos logit contributions
pins+0.008
ouses+0.007
 Eve+0.006
uts+0.006
ugg+0.006
Neg logit contributions
 matter-0.012
 disagreement-0.010
 beginning-0.010
ho-0.010
 swoop-0.010
Token apples
Token ID22514
Feature activation+1.89e+00

Pos logit contributions
 matter+0.006
 disagreement+0.006
ho+0.005
 beginning+0.005
 swoop+0.005
Neg logit contributions
pins-0.005
ouses-0.004
 Eve-0.004
uts-0.004
unn-0.004
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
 matter+0.074
 disagreement+0.065
 beginning+0.062
 swoop+0.061
ho+0.060
Neg logit contributions
pins-0.051
uts-0.044
ouses-0.044
 Eve-0.042
ugg-0.042
Token bananas
Token ID35484
Feature activation+2.48e+00

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.008
ho+0.008
 swoop+0.007
Neg logit contributions
pins-0.007
ouses-0.006
 Eve-0.006
uts-0.006
ugg-0.006
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 matter+0.096
 disagreement+0.084
 beginning+0.081
 swoop+0.077
 problem+0.077
Neg logit contributions
pins-0.068
uts-0.060
ouses-0.060
ugg-0.057
 Eve-0.057
Token When
Token ID1649
Feature activation+1.46e+00

Pos logit contributions
pins+0.003
unn+0.003
ouses+0.003
ugg+0.003
uts+0.003
Neg logit contributions
 matter-0.004
 disagreement-0.004
 beginning-0.003
ho-0.003
 quarrel-0.003
Token they
Token ID484
Feature activation-2.34e-01

Pos logit contributions
 matter+0.062
 beginning+0.054
 disagreement+0.054
ho+0.053
 swoop+0.053
Neg logit contributions
pins-0.034
 Eve-0.029
ouses-0.027
uts-0.026
ugg-0.026
Token got
Token ID1392
Feature activation-9.30e-01

Pos logit contributions
pins+0.008
ouses+0.008
uts+0.007
 Eve+0.007
ugg+0.007
Neg logit contributions
 matter-0.007
 disagreement-0.006
 beginning-0.006
ho-0.006
 swoop-0.006
Token to
Token ID284
Feature activation-4.10e-02

Pos logit contributions
pins+0.031
ouses+0.029
uts+0.028
ugg+0.028
 Eve+0.028
Neg logit contributions
 matter-0.029
 disagreement-0.025
 beginning-0.024
ho-0.023
 swoop-0.023
Token of
Token ID286
Feature activation+9.44e-01

Pos logit contributions
pins+0.022
ouses+0.019
ugg+0.018
uts+0.018
 Eve+0.018
Neg logit contributions
 matter-0.057
 disagreement-0.051
 beginning-0.051
 swoop-0.050
ho-0.050
Token snow
Token ID6729
Feature activation+3.47e-01

Pos logit contributions
 matter+0.034
 disagreement+0.029
ho+0.029
 swoop+0.029
 beginning+0.028
Neg logit contributions
pins-0.027
 Eve-0.024
uts-0.023
ouses-0.023
ugg-0.023
Tokenmen
Token ID3653
Feature activation+5.14e-01

Pos logit contributions
 matter+0.013
 disagreement+0.011
 beginning+0.011
 swoop+0.011
 problem+0.010
Neg logit contributions
pins-0.010
uts-0.009
ouses-0.009
 Eve-0.009
ugg-0.009
Token and
Token ID290
Feature activation-1.55e-01

Pos logit contributions
 matter+0.024
 disagreement+0.022
 beginning+0.021
 swoop+0.021
ho+0.020
Neg logit contributions
pins-0.010
 Eve-0.008
uts-0.008
ugg-0.008
ouses-0.008
Token snow
Token ID6729
Feature activation+8.27e-01

Pos logit contributions
pins+0.004
ouses+0.004
uts+0.004
ugg+0.004
 Eve+0.004
Neg logit contributions
 matter-0.006
 disagreement-0.005
 beginning-0.005
 swoop-0.005
ho-0.005
Tokenballs
Token ID21591
Feature activation+2.36e+00

Pos logit contributions
 matter+0.033
 disagreement+0.028
 beginning+0.027
 swoop+0.026
 problem+0.025
Neg logit contributions
pins-0.023
uts-0.021
ouses-0.020
 Eve-0.020
ugg-0.020
Token and
Token ID290
Feature activation+3.34e-01

Pos logit contributions
 matter+0.088
 disagreement+0.077
 beginning+0.076
ho+0.074
 swoop+0.073
Neg logit contributions
pins-0.064
ouses-0.058
ugg-0.056
uts-0.056
 Eve-0.055
Token fun
Token ID1257
Feature activation+1.43e+00

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.010
 swoop+0.010
ho+0.010
Neg logit contributions
pins-0.010
ouses-0.009
uts-0.009
ugg-0.009
 Eve-0.008
Token.
Token ID13
Feature activation+6.31e-02

Pos logit contributions
 matter+0.062
 disagreement+0.055
 swoop+0.055
ho+0.054
 beginning+0.053
Neg logit contributions
pins-0.030
ouses-0.028
uts-0.028
 Eve-0.027
ugg-0.025
Token They
Token ID1119
Feature activation-1.04e+00

Pos logit contributions
 matter+0.002
 disagreement+0.002
 beginning+0.002
 total+0.002
 quarrel+0.002
Neg logit contributions
pins-0.002
ouses-0.001
ugg-0.001
unn-0.001
uts-0.001
Token were
Token ID547
Feature activation-2.85e-01

Pos logit contributions
pins+0.037
ouses+0.037
uts+0.036
unn+0.035
angers+0.034
Neg logit contributions
 matter-0.029
 swoop-0.024
 beginning-0.023
ho-0.023
 disagreement-0.022
Token saw
Token ID2497
Feature activation-3.75e-01

Pos logit contributions
pins+0.031
ouses+0.031
ugg+0.030
uts+0.030
unn+0.030
Neg logit contributions
 matter-0.023
 disagreement-0.019
 swoop-0.019
 beginning-0.019
ho-0.018
Token Tom
Token ID4186
Feature activation+2.54e-01

Pos logit contributions
pins+0.011
ouses+0.010
ugg+0.010
uts+0.010
 Eve+0.009
Neg logit contributions
 matter-0.013
 disagreement-0.012
 beginning-0.011
ho-0.011
 swoop-0.011
Token and
Token ID290
Feature activation+6.17e-01

Pos logit contributions
 matter+0.014
 disagreement+0.012
 beginning+0.012
ho+0.011
 total+0.011
Neg logit contributions
pins-0.003
uts-0.003
 Eve-0.002
 buck-0.002
ugg-0.002
Token Lily
Token ID20037
Feature activation+2.01e+00

Pos logit contributions
 matter+0.025
 disagreement+0.022
 beginning+0.022
ho+0.021
 swoop+0.021
Neg logit contributions
pins-0.015
ouses-0.013
 Eve-0.013
ugg-0.013
uts-0.012
Token and
Token ID290
Feature activation+1.14e+00

Pos logit contributions
 matter+0.086
 disagreement+0.072
ho+0.071
 swoop+0.070
ah+0.070
Neg logit contributions
pins-0.047
uts-0.042
 Eve-0.038
ouses-0.037
ugg-0.036
Token Spot
Token ID15899
Feature activation+2.73e+00

Pos logit contributions
 matter+0.047
 disagreement+0.042
 beginning+0.041
ho+0.040
 swoop+0.040
Neg logit contributions
pins-0.027
ouses-0.023
 Eve-0.022
ugg-0.022
uts-0.022
Token.
Token ID13
Feature activation+4.68e-01

Pos logit contributions
 matter+0.111
ah+0.091
 disagreement+0.091
ho+0.090
 swoop+0.089
Neg logit contributions
pins-0.070
uts-0.062
 Eve-0.056
 buck-0.055
ugg-0.054
Token They
Token ID1119
Feature activation-7.51e-01

Pos logit contributions
 matter+0.012
 disagreement+0.012
What+0.012
Where+0.011
 quarrel+0.011
Neg logit contributions
unn-0.016
ouses-0.016
pins-0.015
ugg-0.015
 complexion-0.014
Token were
Token ID547
Feature activation+2.51e-01

Pos logit contributions
ouses+0.027
pins+0.027
ugg+0.026
uts+0.025
unn+0.025
Neg logit contributions
 matter-0.021
 swoop-0.018
 disagreement-0.017
 beginning-0.017
ho-0.017
Token not
Token ID407
Feature activation+1.30e+00

Pos logit contributions
 matter+0.008
 disagreement+0.007
 beginning+0.007
ho+0.006
 swoop+0.006
Neg logit contributions
pins-0.008
ouses-0.008
uts-0.007
 Eve-0.007
ugg-0.007
Token happy
Token ID3772
Feature activation+5.07e-01

Pos logit contributions
 matter+0.043
 disagreement+0.036
 beginning+0.036
 swoop+0.034
ho+0.034
Neg logit contributions
pins-0.042
ouses-0.038
ugg-0.038
uts-0.037
 Eve-0.037
Token decided
Token ID3066
Feature activation+9.41e-02

Pos logit contributions
 matter+0.004
 beginning+0.003
 disagreement+0.003
ho+0.003
 problem+0.003
Neg logit contributions
pins-0.003
ouses-0.003
 Eve-0.003
uts-0.003
washer-0.003
Token to
Token ID284
Feature activation+6.09e-01

Pos logit contributions
 matter+0.004
 disagreement+0.004
 total+0.004
 beginning+0.004
ho+0.004
Neg logit contributions
pins-0.002
uts-0.001
grand-0.001
 buck-0.001
 Eve-0.001
Token take
Token ID1011
Feature activation+4.66e-01

Pos logit contributions
 matter+0.020
 beginning+0.017
 disagreement+0.017
ho+0.017
 swoop+0.016
Neg logit contributions
pins-0.019
ouses-0.017
 Eve-0.016
uts-0.016
ugg-0.016
Token a
Token ID257
Feature activation+7.26e-01

Pos logit contributions
 matter+0.019
 disagreement+0.016
 beginning+0.016
ho+0.016
 swoop+0.016
Neg logit contributions
pins-0.012
ouses-0.010
uts-0.010
 Eve-0.010
ugg-0.010
Token nap
Token ID25422
Feature activation+8.44e-01

Pos logit contributions
 matter+0.025
 beginning+0.023
ths+0.023
 End+0.023
 size+0.023
Neg logit contributions
pins-0.018
washer-0.016
 opener-0.016
 Eve-0.015
ouses-0.015
Token in
Token ID287
Feature activation+2.52e-01

Pos logit contributions
 matter+0.030
 disagreement+0.026
 beginning+0.025
 swoop+0.024
ho+0.024
Neg logit contributions
pins-0.025
ouses-0.023
uts-0.022
ugg-0.022
 Eve-0.022
Token the
Token ID262
Feature activation-2.48e-01

Pos logit contributions
 matter+0.011
ho+0.010
 disagreement+0.010
 swoop+0.010
 beginning+0.009
Neg logit contributions
pins-0.005
 Eve-0.005
 Santa-0.004
 buck-0.004
ouses-0.004
Token woods
Token ID16479
Feature activation-1.59e-01

Pos logit contributions
 Eve+0.005
 Santa+0.005
 bed+0.005
ugg+0.005
pins+0.005
Neg logit contributions
 matter-0.010
ho-0.009
 disagreement-0.009
ths-0.008
 inspection-0.008
Token.
Token ID13
Feature activation+5.49e-02

Pos logit contributions
pins+0.003
 Eve+0.003
uts+0.003
ouses+0.003
washer+0.003
Neg logit contributions
 matter-0.007
 disagreement-0.006
 swoop-0.006
 problem-0.006
 beginning-0.006
Token When
Token ID1649
Feature activation+1.19e+00

Pos logit contributions
 matter+0.002
 beginning+0.002
 disagreement+0.002
 swoop+0.002
 total+0.002
Neg logit contributions
pins-0.001
ouses-0.001
unn-0.001
ugg-0.001
uts-0.001
Token he
Token ID339
Feature activation-2.08e-01

Pos logit contributions
 matter+0.056
ho+0.051
 swoop+0.051
 disagreement+0.051
 beginning+0.049
Neg logit contributions
pins-0.019
 Eve-0.018
 Santa-0.015
 buck-0.013
uts-0.013
Token forest
Token ID8222
Feature activation-3.74e-01

Pos logit contributions
 Eve+0.012
pins+0.011
 Santa+0.011
washer+0.011
ugg+0.010
Neg logit contributions
 matter-0.025
 disagreement-0.023
ho-0.022
 swoop-0.021
ah-0.021
Token until
Token ID1566
Feature activation+1.10e+00

Pos logit contributions
pins+0.006
uts+0.005
ouses+0.005
 Eve+0.005
 opener+0.004
Neg logit contributions
 matter-0.018
 disagreement-0.016
 problem-0.015
 swoop-0.015
 beginning-0.015
Token he
Token ID339
Feature activation+5.16e-01

Pos logit contributions
 matter+0.055
urious+0.047
 disagreement+0.047
 problem+0.045
ho+0.045
Neg logit contributions
pins-0.018
 Eve-0.014
 Santa-0.014
ugg-0.012
ouses-0.012
Token was
Token ID373
Feature activation+3.65e-01

Pos logit contributions
 matter+0.020
 disagreement+0.017
 beginning+0.016
 problem+0.016
 forth+0.015
Neg logit contributions
pins-0.013
ouses-0.012
 Eve-0.012
ugg-0.012
uts-0.012
Token tired
Token ID10032
Feature activation-1.52e+00

Pos logit contributions
 matter+0.013
 disagreement+0.011
ho+0.011
 beginning+0.011
 swoop+0.010
Neg logit contributions
pins-0.011
ouses-0.010
 Eve-0.009
uts-0.009
ugg-0.009
Token and
Token ID290
Feature activation+5.96e-01

Pos logit contributions
pins+0.034
 Eve+0.029
uts+0.028
ouses+0.026
ugg+0.025
Neg logit contributions
 matter-0.064
 disagreement-0.059
 swoop-0.056
ho-0.055
 beginning-0.054
Token then
Token ID788
Feature activation+4.15e-02

Pos logit contributions
 matter+0.024
 disagreement+0.021
ah+0.020
ho+0.020
 problem+0.020
Neg logit contributions
pins-0.014
 Eve-0.012
ugg-0.012
ouses-0.011
 buck-0.011
Token he
Token ID339
Feature activation+4.29e-01

Pos logit contributions
 matter+0.002
 disagreement+0.002
ho+0.002
 total+0.002
 beginning+0.002
Neg logit contributions
pins-0.001
ouses-0.001
 Eve-0.001
 buck-0.001
 Santa-0.001
Token went
Token ID1816
Feature activation-5.33e-02

Pos logit contributions
 matter+0.016
 disagreement+0.014
 beginning+0.013
ho+0.013
where+0.012
Neg logit contributions
pins-0.011
ouses-0.011
 Eve-0.010
 buck-0.010
ugg-0.010
Token to
Token ID284
Feature activation+8.76e-01

Pos logit contributions
pins+0.002
ouses+0.001
 Eve+0.001
ugg+0.001
uts+0.001
Neg logit contributions
 matter-0.002
 disagreement-0.002
ho-0.002
 beginning-0.002
 swoop-0.002
Token bed
Token ID3996
Feature activation-1.21e-01

Pos logit contributions
 matter+0.034
 disagreement+0.030
 beginning+0.029
ho+0.029
 swoop+0.028
Neg logit contributions
pins-0.022
 Eve-0.020
ouses-0.020
ugg-0.019
uts-0.019
Token t
Token ID256
Feature activation-1.32e+00

Pos logit contributions
 Eve+0.007
pins+0.007
ouses+0.007
ugg+0.007
 Santa+0.006
Neg logit contributions
 matter-0.014
 swoop-0.013
 disagreement-0.012
ho-0.012
 beginning-0.012
Tokeneddy
Token ID21874
Feature activation-1.02e+00

Pos logit contributions
pins+0.022
ouses+0.015
 Eve+0.015
uts+0.015
ugg+0.014
Neg logit contributions
 matter-0.062
ho-0.057
ah-0.054
 beginning-0.054
 problem-0.054
Token bear
Token ID6842
Feature activation+7.07e-02

Pos logit contributions
pins+0.026
 Eve+0.024
ouses+0.024
ugg+0.023
uts+0.022
Neg logit contributions
 matter-0.038
 disagreement-0.034
 beginning-0.033
ho-0.033
 swoop-0.032
Token.
Token ID13
Feature activation+6.46e-01

Pos logit contributions
 matter+0.003
 disagreement+0.002
 beginning+0.002
 swoop+0.002
ho+0.002
Neg logit contributions
pins-0.002
ouses-0.002
uts-0.002
 Eve-0.002
ugg-0.002
Token He
Token ID679
Feature activation-2.77e-01

Pos logit contributions
 matter+0.021
ho+0.017
 beginning+0.017
 swoop+0.016
 disagreement+0.016
Neg logit contributions
pins-0.020
ouses-0.018
unn-0.017
 complexion-0.017
ugg-0.017
Token is
Token ID318
Feature activation+3.11e-01

Pos logit contributions
pins+0.010
ouses+0.009
ugg+0.009
 Eve+0.009
uts+0.009
Neg logit contributions
 matter-0.008
 disagreement-0.007
 beginning-0.007
ho-0.007
 swoop-0.006
Token soft
Token ID2705
Feature activation-8.79e-01

Pos logit contributions
 matter+0.012
ho+0.011
 disagreement+0.011
urious+0.011
ah+0.010
Neg logit contributions
pins-0.007
 Santa-0.007
 Eve-0.007
hes-0.006
 buck-0.006
Token and
Token ID290
Feature activation+6.22e-01

Pos logit contributions
pins+0.022
ouses+0.021
ugg+0.020
unn+0.019
uts+0.019
Neg logit contributions
 matter-0.033
 beginning-0.030
 disagreement-0.029
 problem-0.028
ho-0.028
Token brown
Token ID7586
Feature activation-1.79e-01

Pos logit contributions
 matter+0.023
 disagreement+0.021
 beginning+0.020
ho+0.020
 swoop+0.019
Neg logit contributions
pins-0.017
ouses-0.015
 Eve-0.014
ugg-0.014
uts-0.014
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
pins+0.004
ouses+0.004
uts+0.003
ugg+0.003
 Eve+0.003
Neg logit contributions
 matter-0.007
 disagreement-0.007
 beginning-0.006
 swoop-0.006
ho-0.006
Token has
Token ID468
Feature activation+5.37e-01

Pos logit contributions
 matter+0.039
 disagreement+0.034
 beginning+0.033
ho+0.032
 swoop+0.032
Neg logit contributions
pins-0.031
ouses-0.028
 Eve-0.027
ugg-0.027
uts-0.027
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 matter+0.072
ho+0.067
 beginning+0.064
 swoop+0.062
 disagreement+0.062
Neg logit contributions
pins-0.049
ouses-0.048
unn-0.046
ugg-0.045
 Eve-0.044
Token a
Token ID257
Feature activation+6.96e-01

Pos logit contributions
pins+0.029
ouses+0.027
ugg+0.027
uts+0.026
 Eve+0.025
Neg logit contributions
 matter-0.038
 disagreement-0.032
 beginning-0.032
 swoop-0.031
 problem-0.030
Token drink
Token ID4144
Feature activation+3.54e-01

Pos logit contributions
 matter+0.028
 disagreement+0.024
 beginning+0.024
ho+0.024
ths+0.023
Neg logit contributions
pins-0.017
ouses-0.015
ugg-0.014
 Eve-0.014
uts-0.013
Token and
Token ID290
Feature activation-6.43e-01

Pos logit contributions
 matter+0.014
 beginning+0.012
 disagreement+0.012
ho+0.012
 swoop+0.012
Neg logit contributions
pins-0.009
ouses-0.008
 Eve-0.008
uts-0.008
ugg-0.008
Token cheers
Token ID34550
Feature activation+9.78e-01

Pos logit contributions
pins+0.017
ouses+0.015
ugg+0.015
uts+0.015
 Eve+0.014
Neg logit contributions
 matter-0.025
 disagreement-0.021
 beginning-0.021
 swoop-0.021
ho-0.020
Token him
Token ID683
Feature activation+6.28e-01

Pos logit contributions
 matter+0.045
 disagreement+0.042
 swoop+0.042
ho+0.040
urious+0.039
Neg logit contributions
pins-0.019
uts-0.017
 Eve-0.017
 Santa-0.016
ouses-0.015
Token on
Token ID319
Feature activation+4.59e-01

Pos logit contributions
 matter+0.029
 disagreement+0.025
 beginning+0.024
ho+0.023
 problem+0.023
Neg logit contributions
pins-0.013
ouses-0.013
uts-0.012
ugg-0.012
hes-0.011
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
 matter+0.021
ho+0.019
 disagreement+0.018
 swoop+0.018
 beginning+0.018
Neg logit contributions
pins-0.009
 Eve-0.008
 Santa-0.008
uts-0.007
 bed-0.006
Token "
Token ID366
Feature activation-1.73e-01

Pos logit contributions
ouses+0.006
unn+0.005
pins+0.005
ugg+0.005
washer+0.005
Neg logit contributions
 matter-0.005
 disagreement-0.004
 beginning-0.004
What-0.004
ho-0.004
TokenYou
Token ID1639
Feature activation-6.91e-01

Pos logit contributions
pins+0.006
ouses+0.006
ugg+0.006
uts+0.006
 Eve+0.006
Neg logit contributions
 matter-0.005
 disagreement-0.004
 beginning-0.004
ho-0.004
 swoop-0.004
Token are
Token ID389
Feature activation-4.18e-01

Pos logit contributions
pins+0.025
ugg+0.023
ouses+0.023
uts+0.023
unn+0.022
Neg logit contributions
 matter-0.021
 disagreement-0.017
 beginning-0.017
 swoop-0.016
 quarrel-0.016
Token They
Token ID1119
Feature activation-6.74e-01

Pos logit contributions
 matter+0.014
 disagreement+0.013
 beginning+0.013
 swoop+0.012
ho+0.012
Neg logit contributions
pins-0.006
ouses-0.005
uts-0.005
ugg-0.005
 Eve-0.005
Token are
Token ID389
Feature activation+3.61e-01

Pos logit contributions
pins+0.017
ugg+0.014
 Eve+0.014
ouses+0.014
uts+0.014
Neg logit contributions
 matter-0.026
 disagreement-0.022
 beginning-0.022
ho-0.022
 swoop-0.022
Token sad
Token ID6507
Feature activation-5.02e-01

Pos logit contributions
 matter+0.013
ho+0.011
 disagreement+0.011
 swoop+0.011
where+0.011
Neg logit contributions
pins-0.010
uts-0.009
ouses-0.009
 Eve-0.009
ugg-0.009
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
pins+0.012
uts+0.011
ouses+0.011
ugg+0.010
 Eve+0.010
Neg logit contributions
 matter-0.020
 disagreement-0.018
 swoop-0.018
 beginning-0.017
ho-0.017
Token They
Token ID1119
Feature activation-7.82e-01

Pos logit contributions
 matter+0.010
 disagreement+0.009
 beginning+0.008
ho+0.008
 swoop+0.008
Neg logit contributions
pins-0.005
ouses-0.004
ugg-0.004
 Eve-0.004
unn-0.004
Token have
Token ID423
Feature activation+3.35e-01

Pos logit contributions
pins+0.023
ugg+0.020
ouses+0.020
 Eve+0.019
uts+0.019
Neg logit contributions
 matter-0.028
 disagreement-0.024
 beginning-0.023
ho-0.023
 swoop-0.023
Token no
Token ID645
Feature activation+5.14e-01

Pos logit contributions
 matter+0.010
 disagreement+0.008
 beginning+0.008
 swoop+0.008
ho+0.008
Neg logit contributions
pins-0.012
ouses-0.011
ugg-0.011
uts-0.011
 Eve-0.011
Token one
Token ID530
Feature activation-4.80e-01

Pos logit contributions
 matter+0.020
ho+0.017
 beginning+0.017
 disagreement+0.017
 swoop+0.017
Neg logit contributions
pins-0.013
ouses-0.012
 Eve-0.011
ugg-0.011
uts-0.011
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
pins+0.011
washer+0.010
uts+0.010
ouses+0.010
ugg+0.009
Neg logit contributions
 matter-0.019
 beginning-0.017
 swoop-0.017
 disagreement-0.016
ho-0.016
Token play
Token ID711
Feature activation+1.43e-01

Pos logit contributions
 matter+0.041
 beginning+0.037
ho+0.036
 disagreement+0.036
 swoop+0.035
Neg logit contributions
pins-0.031
 Eve-0.027
ouses-0.027
uts-0.025
ugg-0.025
Token with
Token ID351
Feature activation-1.93e-01

Pos logit contributions
 matter+0.006
 swoop+0.005
 disagreement+0.005
 beginning+0.005
ho+0.005
Neg logit contributions
pins-0.003
ouses-0.003
uts-0.003
 Eve-0.003
ugg-0.002
Token by
Token ID416
Feature activation+1.90e-01

Pos logit contributions
 matter+0.010
ho+0.009
 disagreement+0.009
 swoop+0.009
 beginning+0.009
Neg logit contributions
uts-0.005
pins-0.005
ouses-0.004
 Eve-0.004
washer-0.004
Token giving
Token ID3501
Feature activation-8.28e-02

Pos logit contributions
ho+0.007
 matter+0.007
ah+0.006
urious+0.006
 disagreement+0.006
Neg logit contributions
pins-0.004
 bed-0.004
 buck-0.004
 Eve-0.004
 Santa-0.004
Token him
Token ID683
Feature activation-6.55e-03

Pos logit contributions
 Eve+0.002
pins+0.002
 Santa+0.002
 Teddy+0.002
ouses+0.002
Neg logit contributions
 matter-0.003
ho-0.003
 beginning-0.003
 disagreement-0.003
 swoop-0.003
Token his
Token ID465
Feature activation-3.52e-01

Pos logit contributions
pins+0.000
ouses+0.000
uts+0.000
 Eve+0.000
ugg+0.000
Neg logit contributions
 matter-0.000
ho-0.000
 disagreement-0.000
 beginning-0.000
 swoop-0.000
Token wallet
Token ID13008
Feature activation+3.11e-01

Pos logit contributions
pins+0.007
grand+0.007
ouses+0.007
 buck+0.006
uts+0.006
Neg logit contributions
 matter-0.014
ho-0.013
 beginning-0.013
 swoop-0.013
 disagreement-0.012
Token!
Token ID0
Feature activation+5.22e-02

Pos logit contributions
 matter+0.012
 disagreement+0.011
 beginning+0.010
ho+0.010
 swoop+0.010
Neg logit contributions
pins-0.008
uts-0.007
ouses-0.007
 Eve-0.007
ugg-0.006
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
 matter+0.002
 disagreement+0.002
 beginning+0.002
ho+0.002
 swoop+0.002
Neg logit contributions
pins-0.001
ouses-0.001
ugg-0.001
uts-0.001
unn-0.001
Token
Token ID198
Feature activation-5.81e-01

Pos logit contributions
pins+0.010
ouses+0.009
ugg+0.008
uts+0.008
 Eve+0.008
Neg logit contributions
 matter-0.018
 disagreement-0.016
 beginning-0.016
 swoop-0.016
ho-0.015
TokenTim
Token ID14967
Feature activation+2.85e-01

Pos logit contributions
pins+0.013
uts+0.012
 Eve+0.012
ugg+0.012
ouses+0.012
Neg logit contributions
 matter-0.024
 beginning-0.021
 disagreement-0.021
ho-0.021
 problem-0.021
Token was
Token ID373
Feature activation+3.35e-02

Pos logit contributions
 matter+0.010
 disagreement+0.009
 comparison+0.009
 beginning+0.009
ho+0.009
Neg logit contributions
 Eve-0.007
pins-0.007
ugg-0.007
 buck-0.007
washer-0.006
Token so
Token ID523
Feature activation-9.50e-01

Pos logit contributions
 matter+0.001
ho+0.001
 disagreement+0.001
 swoop+0.001
 beginning+0.001
Neg logit contributions
pins-0.001
ouses-0.001
uts-0.001
 Eve-0.001
ugg-0.001
Token you
Token ID345
Feature activation-6.39e-01

Pos logit contributions
pins+0.017
uts+0.017
 Eve+0.016
ouses+0.016
grand+0.015
Neg logit contributions
 matter-0.025
ho-0.023
 beginning-0.023
 disagreement-0.022
 swoop-0.022
Token think
Token ID892
Feature activation-1.62e-01

Pos logit contributions
pins+0.018
uts+0.016
ugg+0.015
ouses+0.015
 Eve+0.015
Neg logit contributions
 matter-0.022
 disagreement-0.019
 beginning-0.019
ho-0.019
 problem-0.019
Token I
Token ID314
Feature activation-1.08e+00

Pos logit contributions
pins+0.004
uts+0.004
 Eve+0.004
ouses+0.004
ugg+0.004
Neg logit contributions
 matter-0.006
 disagreement-0.005
ho-0.005
 beginning-0.005
 swoop-0.005
Token look
Token ID804
Feature activation-2.85e-01

Pos logit contributions
pins+0.032
uts+0.029
ugg+0.029
ouses+0.029
 Eve+0.028
Neg logit contributions
 matter-0.036
 beginning-0.031
 disagreement-0.031
ho-0.029
 problem-0.029
Token silly
Token ID14397
Feature activation-2.17e-01

Pos logit contributions
pins+0.006
ouses+0.006
 Santa+0.006
taker+0.005
 Eve+0.005
Neg logit contributions
 matter-0.011
ho-0.011
 swoop-0.010
 disagreement-0.010
ths-0.009
Token?"
Token ID1701
Feature activation-8.63e-01

Pos logit contributions
pins+0.004
ouses+0.003
uts+0.003
 Eve+0.003
ugg+0.003
Neg logit contributions
 matter-0.010
 disagreement-0.009
 beginning-0.009
 swoop-0.009
ho-0.009
Token he
Token ID339
Feature activation-4.61e-01

Pos logit contributions
pins+0.019
ouses+0.017
ugg+0.016
uts+0.016
 Eve+0.015
Neg logit contributions
 matter-0.037
 disagreement-0.033
 beginning-0.032
 swoop-0.032
ho-0.032
Token asked
Token ID1965
Feature activation-6.08e-01

Pos logit contributions
pins+0.006
 Eve+0.005
uts+0.005
 buck+0.005
ouses+0.004
Neg logit contributions
 matter-0.022
 disagreement-0.020
ho-0.020
 beginning-0.020
where-0.019
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
pins+0.010
 Eve+0.010
 Santa+0.008
taker+0.008
ouses+0.008
Neg logit contributions
 matter-0.028
 disagreement-0.025
 beginning-0.024
 total-0.024
 swoop-0.024
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.008
ho+0.008
 swoop+0.008
Neg logit contributions
pins-0.005
ouses-0.005
ugg-0.004
uts-0.004
unn-0.004
Token
Token ID198
Feature activation+6.53e-02

Pos logit contributions
 matter+0.009
 disagreement+0.008
 swoop+0.008
 beginning+0.008
ho+0.008
Neg logit contributions
pins-0.006
ouses-0.005
ugg-0.005
uts-0.005
unn-0.005
Token played
Token ID2826
Feature activation-1.84e-01

Pos logit contributions
 matter+0.012
 disagreement+0.010
 beginning+0.009
 problem+0.009
ho+0.009
Neg logit contributions
pins-0.009
ouses-0.008
ugg-0.008
 Eve-0.007
uts-0.007
Token together
Token ID1978
Feature activation+1.64e+00

Pos logit contributions
pins+0.005
ouses+0.004
 Eve+0.004
uts+0.004
ugg+0.004
Neg logit contributions
 matter-0.007
 disagreement-0.006
 beginning-0.006
ho-0.006
 swoop-0.006
Token for
Token ID329
Feature activation+7.12e-01

Pos logit contributions
 matter+0.081
Uh+0.070
 disagreement+0.068
urious+0.066
Time+0.066
Neg logit contributions
pins-0.025
uts-0.021
 opener-0.019
ouses-0.018
 Eve-0.018
Token a
Token ID257
Feature activation+2.66e-01

Pos logit contributions
 matter+0.026
 disagreement+0.022
 beginning+0.022
 swoop+0.021
 problem+0.020
Neg logit contributions
pins-0.022
ouses-0.020
uts-0.020
ugg-0.020
unn-0.019
Token while
Token ID981
Feature activation+8.47e-01

Pos logit contributions
 matter+0.010
 beginning+0.009
 cheating+0.009
urious+0.009
 disagreement+0.009
Neg logit contributions
pins-0.006
 Eve-0.006
ouses-0.006
ugg-0.005
uts-0.005
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
 matter+0.037
 disagreement+0.033
 swoop+0.032
ho+0.032
 beginning+0.032
Neg logit contributions
pins-0.017
 Eve-0.014
uts-0.014
ouses-0.013
unn-0.012
Token Suddenly
Token ID24975
Feature activation+2.43e-01

Pos logit contributions
pins+0.006
ouses+0.005
unn+0.005
ugg+0.005
uts+0.005
Neg logit contributions
 matter-0.008
 disagreement-0.007
 beginning-0.007
 quarrel-0.007
 impossible-0.007
Token,
Token ID11
Feature activation+3.29e-01

Pos logit contributions
 matter+0.008
 disagreement+0.007
 beginning+0.007
 problem+0.007
 swoop+0.007
Neg logit contributions
pins-0.008
ouses-0.007
ugg-0.007
uts-0.007
grand-0.007
Token a
Token ID257
Feature activation+6.55e-01

Pos logit contributions
 matter+0.011
 disagreement+0.010
 beginning+0.010
 problem+0.009
 swoop+0.009
Neg logit contributions
pins-0.010
ouses-0.010
ugg-0.010
unn-0.010
uts-0.010
Token big
Token ID1263
Feature activation+3.46e-02

Pos logit contributions
 matter+0.026
 beginning+0.022
urious+0.022
ho+0.022
 forth+0.021
Neg logit contributions
pins-0.016
 Eve-0.015
washer-0.013
 complexion-0.013
ouses-0.013
Token storm
Token ID6388
Feature activation+5.40e-01

Pos logit contributions
 matter+0.001
 beginning+0.001
 disagreement+0.001
ho+0.001
urious+0.001
Neg logit contributions
pins-0.001
 Eve-0.001
 complexion-0.001
uts-0.001
 Santa-0.001
Token
Token ID198
Feature activation-2.14e-02

Pos logit contributions
 matter+0.001
What+0.001
 disagreement+0.001
 beginning+0.001
where+0.001
Neg logit contributions
unn-0.001
pins-0.001
ouses-0.001
ests-0.001
ands-0.001
Token
Token ID198
Feature activation-2.11e-01

Pos logit contributions
pins+0.001
unn+0.001
ouses+0.001
 complexion+0.001
ugg+0.001
Neg logit contributions
 matter-0.001
 disagreement-0.000
 swoop-0.000
where-0.000
 moment-0.000
TokenBilly
Token ID49640
Feature activation+4.30e-03

Pos logit contributions
pins+0.007
unn+0.007
ouses+0.007
ugg+0.007
 Eve+0.007
Neg logit contributions
 matter-0.006
 swoop-0.005
 disagreement-0.005
 beginning-0.005
ho-0.005
Token said
Token ID531
Feature activation+8.98e-01

Pos logit contributions
 matter+0.000
 swoop+0.000
 disagreement+0.000
 beginning+0.000
ho+0.000
Neg logit contributions
pins-0.000
ouses-0.000
uts-0.000
unn-0.000
ests-0.000
Token to
Token ID284
Feature activation+2.70e-01

Pos logit contributions
 matter+0.032
 disagreement+0.028
 beginning+0.027
ho+0.027
 swoop+0.027
Neg logit contributions
pins-0.026
ouses-0.023
 Eve-0.023
ugg-0.023
uts-0.022
Token his
Token ID465
Feature activation+1.33e-01

Pos logit contributions
 matter+0.009
 beginning+0.007
 disagreement+0.007
 swoop+0.007
ho+0.007
Neg logit contributions
pins-0.009
ouses-0.009
uts-0.008
ugg-0.008
unn-0.008
Token mom
Token ID1995
Feature activation-9.27e-02

Pos logit contributions
 matter+0.005
 disagreement+0.004
 beginning+0.004
ho+0.004
 swoop+0.004
Neg logit contributions
pins-0.004
ouses-0.003
ugg-0.003
uts-0.003
 Eve-0.003
Token,
Token ID11
Feature activation+7.17e-02

Pos logit contributions
pins+0.002
 Eve+0.001
uts+0.001
washer+0.001
ugg+0.001
Neg logit contributions
 matter-0.005
 disagreement-0.004
 swoop-0.004
 beginning-0.004
Uh-0.004
Token �
Token ID6184
Feature activation+9.96e-01

Pos logit contributions
 matter+0.003
 disagreement+0.002
 swoop+0.002
 beginning+0.002
ho+0.002
Neg logit contributions
pins-0.002
 Eve-0.002
uts-0.002
ugg-0.002
ouses-0.002
Token�
Token ID95
Feature activation-1.01e-01

Pos logit contributions
 matter+0.028
 disagreement+0.025
 beginning+0.025
ho+0.024
 quarrel+0.024
Neg logit contributions
pins-0.031
unn-0.031
ouses-0.031
ests-0.031
uts-0.030
Token€
Token ID26391
Feature activation-4.00e-02

Pos logit contributions
pins+0.003
unn+0.003
ouses+0.003
uts+0.003
 complexion+0.003
Neg logit contributions
 matter-0.003
 beginning-0.003
 disagreement-0.002
where-0.002
 swoop-0.002
Token can
Token ID460
Feature activation-4.58e-01

Pos logit contributions
uts+0.002
pins+0.002
unn+0.002
washer+0.002
 Eve+0.002
Neg logit contributions
 matter-0.003
 disagreement-0.003
ho-0.003
 beginning-0.003
 total-0.003
Token be
Token ID307
Feature activation+3.52e-01

Pos logit contributions
pins+0.014
uts+0.013
 Eve+0.013
unn+0.013
grand+0.013
Neg logit contributions
 matter-0.014
 beginning-0.012
 disagreement-0.012
ho-0.011
 problem-0.011
Token dangerous
Token ID4923
Feature activation-3.10e-01

Pos logit contributions
ho+0.013
 matter+0.012
where+0.012
 beginning+0.011
 disagreement+0.011
Neg logit contributions
pins-0.007
 Santa-0.007
ouses-0.007
 buck-0.007
,"-0.007
Token!
Token ID0
Feature activation+3.78e-01

Pos logit contributions
uts+0.005
pins+0.004
."+0.004
unn+0.004
,"+0.003
Neg logit contributions
 matter-0.014
 swoop-0.013
 disagreement-0.013
 total-0.013
ah-0.013
Token<|endoftext|>
Token ID50256
Feature activation+2.11e-01

Pos logit contributions
 matter+0.013
 disagreement+0.011
 beginning+0.010
where+0.010
ho+0.010
Neg logit contributions
pins-0.012
ouses-0.011
 complexion-0.011
ugg-0.010
unn-0.010
TokenOnce
Token ID7454
Feature activation+1.05e-01

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.007
 swoop+0.007
ho+0.007
Neg logit contributions
pins-0.005
ouses-0.004
 Eve-0.004
ugg-0.004
uts-0.004
Token upon
Token ID2402
Feature activation+4.97e-01

Pos logit contributions
 matter+0.002
 disagreement+0.001
 beginning+0.001
 swoop+0.001
ho+0.001
Neg logit contributions
pins-0.005
ouses-0.005
uts-0.004
ugg-0.004
 Eve-0.004
Token a
Token ID257
Feature activation+2.25e-01

Pos logit contributions
 matter+0.018
 beginning+0.015
 disagreement+0.015
 swoop+0.015
 quarrel+0.014
Neg logit contributions
pins-0.014
ouses-0.013
ugg-0.013
unn-0.013
uts-0.012
Token time
Token ID640
Feature activation-8.47e-02

Pos logit contributions
 matter+0.005
 beginning+0.004
ho+0.004
 swoop+0.004
 forth+0.004
Neg logit contributions
pins-0.009
 Eve-0.009
ugg-0.008
ouses-0.008
uts-0.008
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
pins+0.002
uts+0.002
ouses+0.002
 Eve+0.001
ugg+0.001
Neg logit contributions
 matter-0.004
 beginning-0.003
 disagreement-0.003
 swoop-0.003
ho-0.003
Token there
Token ID612
Feature activation-2.83e-01

Pos logit contributions
pins+0.004
 Eve+0.004
ugg+0.003
ouses+0.003
 The+0.003
Neg logit contributions
 matter-0.008
 disagreement-0.007
 beginning-0.007
 swoop-0.007
ho-0.007
Token car
Token ID1097
Feature activation+8.19e-01

Pos logit contributions
 matter+0.024
ho+0.021
 swoop+0.021
 disagreement+0.021
 impossible+0.020
Neg logit contributions
 Eve-0.008
pins-0.008
ugg-0.007
 buck-0.007
 Santa-0.007
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 matter+0.026
 beginning+0.022
 disagreement+0.022
ho+0.022
 swoop+0.021
Neg logit contributions
pins-0.026
ouses-0.024
ugg-0.024
unn-0.024
uts-0.023
Token They
Token ID1119
Feature activation-8.06e-01

Pos logit contributions
 matter+0.003
 disagreement+0.003
 total+0.003
What+0.003
 beginning+0.003
Neg logit contributions
unn-0.005
ugg-0.004
pins-0.004
ouses-0.004
grand-0.004
Token were
Token ID547
Feature activation+6.31e-02

Pos logit contributions
uts+0.030
angers+0.030
unn+0.029
ouses+0.029
grand+0.028
Neg logit contributions
 matter-0.019
 swoop-0.019
ho-0.016
 quarrel-0.016
ah-0.015
Token still
Token ID991
Feature activation-1.03e+00

Pos logit contributions
 matter+0.002
 beginning+0.002
 disagreement+0.002
 swoop+0.002
ho+0.002
Neg logit contributions
pins-0.002
ouses-0.002
ugg-0.002
uts-0.002
 Eve-0.002
Token tired
Token ID10032
Feature activation-2.07e+00

Pos logit contributions
pins+0.027
ouses+0.023
uts+0.023
 Eve+0.023
ugg+0.022
Neg logit contributions
 matter-0.041
 disagreement-0.035
ho-0.035
 beginning-0.034
 swoop-0.034
Token and
Token ID290
Feature activation+8.14e-01

Pos logit contributions
pins+0.051
ouses+0.047
ugg+0.045
uts+0.044
 Eve+0.043
Neg logit contributions
 matter-0.083
 disagreement-0.072
 beginning-0.072
ho-0.070
 swoop-0.068
Token frustrated
Token ID14718
Feature activation-4.56e-01

Pos logit contributions
 matter+0.028
 disagreement+0.024
 beginning+0.024
ho+0.023
 swoop+0.023
Neg logit contributions
pins-0.024
ouses-0.022
uts-0.022
ugg-0.022
 Eve-0.022
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
pins+0.010
uts+0.009
 Eve+0.009
ouses+0.008
ugg+0.008
Neg logit contributions
 matter-0.019
 swoop-0.018
 disagreement-0.017
ho-0.017
 beginning-0.016
Token but
Token ID475
Feature activation-1.21e-01

Pos logit contributions
 matter+0.057
 disagreement+0.050
 swoop+0.049
ho+0.049
 beginning+0.048
Neg logit contributions
pins-0.034
uts-0.029
 Eve-0.028
ouses-0.028
ugg-0.026
Token they
Token ID484
Feature activation+4.66e-01

Pos logit contributions
pins+0.004
ouses+0.004
ugg+0.004
unn+0.004
ests+0.003
Neg logit contributions
 matter-0.004
 beginning-0.003
 disagreement-0.003
ho-0.003
 swoop-0.003
Token the
Token ID262
Feature activation-1.41e-01

Pos logit contributions
 matter+0.041
ho+0.037
 swoop+0.037
 disagreement+0.037
 problem+0.035
Neg logit contributions
pins-0.028
 Eve-0.025
 Santa-0.022
ouses-0.021
uts-0.021
Token room
Token ID2119
Feature activation-5.31e-01

Pos logit contributions
pins+0.003
ouses+0.003
 Eve+0.003
ugg+0.003
uts+0.003
Neg logit contributions
 matter-0.006
 swoop-0.005
 disagreement-0.005
ho-0.005
 beginning-0.005
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
pins+0.012
ouses+0.011
uts+0.011
 Eve+0.011
ugg+0.010
Neg logit contributions
 matter-0.023
 disagreement-0.020
 swoop-0.020
 beginning-0.019
ho-0.019
Token On
Token ID1550
Feature activation-8.39e-05

Pos logit contributions
 matter+0.011
 disagreement+0.010
 beginning+0.010
 swoop+0.009
ho+0.009
Neg logit contributions
pins-0.007
ouses-0.006
ugg-0.006
uts-0.005
unn-0.005
Token the
Token ID262
Feature activation+5.74e-02

Pos logit contributions
pins+0.000
 Eve+0.000
ouses+0.000
uts+0.000
ugg+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
 beginning-0.000
 swoop-0.000
ho-0.000
Token wall
Token ID3355
Feature activation-2.14e+00

Pos logit contributions
 matter+0.002
 swoop+0.002
 disagreement+0.002
ho+0.002
 beginning+0.002
Neg logit contributions
pins-0.002
ouses-0.001
 Eve-0.001
uts-0.001
ugg-0.001
Token was
Token ID373
Feature activation-7.00e-01

Pos logit contributions
pins+0.046
 Eve+0.041
ouses+0.040
ugg+0.040
uts+0.039
Neg logit contributions
 matter-0.092
 disagreement-0.079
 beginning-0.078
 swoop-0.078
ho-0.076
Token a
Token ID257
Feature activation-5.85e-01

Pos logit contributions
pins+0.014
 Eve+0.012
ouses+0.012
uts+0.011
ugg+0.011
Neg logit contributions
 matter-0.030
 disagreement-0.028
ho-0.027
 swoop-0.027
 beginning-0.026
Token big
Token ID1263
Feature activation-1.05e+00

Pos logit contributions
pins+0.012
 complexion+0.011
 Eve+0.011
washer+0.011
ouses+0.011
Neg logit contributions
 matter-0.022
 swoop-0.021
ho-0.020
urious-0.020
 beginning-0.020
Token black
Token ID2042
Feature activation-7.22e-01

Pos logit contributions
pins+0.026
 Eve+0.023
ouses+0.022
ugg+0.022
 complexion+0.021
Neg logit contributions
 matter-0.040
ho-0.035
 beginning-0.035
 disagreement-0.035
 swoop-0.035
Tokenboard
Token ID3526
Feature activation+4.87e-02

Pos logit contributions
pins+0.015
ugg+0.013
ouses+0.012
 Eve+0.012
uts+0.012
Neg logit contributions
 matter-0.031
 disagreement-0.028
 swoop-0.028
 beginning-0.027
ho-0.027
Token go
Token ID467
Feature activation-2.92e-02

Pos logit contributions
 matter+0.028
ho+0.024
 disagreement+0.024
 beginning+0.024
 problem+0.023
Neg logit contributions
pins-0.031
 Eve-0.028
ugg-0.026
ouses-0.026
uts-0.025
Token home
Token ID1363
Feature activation-2.23e-01

Pos logit contributions
pins+0.001
ouses+0.001
uts+0.001
ugg+0.001
unn+0.001
Neg logit contributions
 matter-0.001
 beginning-0.001
 disagreement-0.001
 swoop-0.001
 problem-0.001
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
pins+0.008
unn+0.007
ugg+0.007
ouses+0.007
ests+0.007
Neg logit contributions
 matter-0.006
 beginning-0.005
 disagreement-0.005
 problem-0.005
 quarrel-0.005
Token She
Token ID1375
Feature activation-6.93e-02

Pos logit contributions
pins+0.009
ouses+0.008
unn+0.008
 Eve+0.008
ugg+0.008
Neg logit contributions
 matter-0.012
 disagreement-0.011
 beginning-0.011
ho-0.010
 swoop-0.010
Token said
Token ID531
Feature activation+2.10e-01

Pos logit contributions
pins+0.002
ouses+0.002
uts+0.002
ugg+0.002
 Eve+0.002
Neg logit contributions
 matter-0.002
 disagreement-0.002
 beginning-0.002
 swoop-0.002
ho-0.002
Token goodbye
Token ID24829
Feature activation-2.05e+00

Pos logit contributions
 matter+0.009
 disagreement+0.008
 beginning+0.008
 swoop+0.007
ho+0.007
Neg logit contributions
pins-0.005
ouses-0.004
uts-0.004
ugg-0.004
 Eve-0.004
Token to
Token ID284
Feature activation+2.72e-01

Pos logit contributions
pins+0.061
ouses+0.056
unn+0.054
ests+0.053
ugg+0.053
Neg logit contributions
 matter-0.067
ho-0.059
 beginning-0.059
 disagreement-0.058
 swoop-0.058
Token the
Token ID262
Feature activation-7.49e-02

Pos logit contributions
 matter+0.010
 disagreement+0.009
ho+0.009
 beginning+0.009
 swoop+0.009
Neg logit contributions
pins-0.007
 Eve-0.006
ouses-0.006
ugg-0.006
uts-0.006
Token monster
Token ID9234
Feature activation-8.46e-03

Pos logit contributions
pins+0.002
washer+0.002
taker+0.002
 Eve+0.002
ugg+0.002
Neg logit contributions
 matter-0.003
 swoop-0.002
ho-0.002
ah-0.002
 disagreement-0.002
Token and
Token ID290
Feature activation-4.46e-01

Pos logit contributions
pins+0.000
 Eve+0.000
uts+0.000
ouses+0.000
ugg+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
 beginning-0.000
ho-0.000
 swoop-0.000
Token the
Token ID262
Feature activation+7.44e-01

Pos logit contributions
pins+0.013
ouses+0.012
ugg+0.012
uts+0.012
 Eve+0.012
Neg logit contributions
 matter-0.016
 disagreement-0.013
 beginning-0.013
ho-0.013
 swoop-0.013
Token home
Token ID1363
Feature activation-1.72e-01

Pos logit contributions
pins+0.009
 Eve+0.009
ouses+0.008
 bed+0.008
unn+0.008
Neg logit contributions
 matter-0.014
 swoop-0.013
ho-0.013
ah-0.013
Uh-0.012
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
pins+0.004
ouses+0.004
uts+0.004
 Eve+0.004
ugg+0.003
Neg logit contributions
 matter-0.007
 disagreement-0.006
 swoop-0.006
 beginning-0.006
ho-0.006
Token He
Token ID679
Feature activation-1.70e-01

Pos logit contributions
pins+0.002
ouses+0.002
ugg+0.002
uts+0.002
 Eve+0.002
Neg logit contributions
 matter-0.005
 disagreement-0.005
 beginning-0.005
ho-0.005
 swoop-0.005
Token taught
Token ID7817
Feature activation-8.77e-01

Pos logit contributions
pins+0.005
ouses+0.005
 Eve+0.004
ugg+0.004
uts+0.004
Neg logit contributions
 matter-0.006
 disagreement-0.005
 beginning-0.005
ho-0.005
 swoop-0.005
Token the
Token ID262
Feature activation-7.56e-01

Pos logit contributions
 Eve+0.023
pins+0.022
unn+0.021
 Santa+0.021
uts+0.020
Neg logit contributions
 matter-0.031
 beginning-0.027
ho-0.027
 disagreement-0.027
 swoop-0.026
Token other
Token ID584
Feature activation-2.10e+00

Pos logit contributions
 Eve+0.018
pins+0.017
 buck+0.016
 Santa+0.015
grand+0.015
Neg logit contributions
 matter-0.028
ho-0.026
 forth-0.025
 swoop-0.024
ah-0.024
Token birds
Token ID10087
Feature activation-4.59e-01

Pos logit contributions
 Eve+0.044
pins+0.042
unn+0.038
ouses+0.037
 buck+0.037
Neg logit contributions
 matter-0.084
ho-0.079
 disagreement-0.077
 swoop-0.076
 beginning-0.074
Token that
Token ID326
Feature activation+3.56e-01

Pos logit contributions
pins+0.011
uts+0.009
ouses+0.009
 Eve+0.009
ugg+0.008
Neg logit contributions
 matter-0.019
 disagreement-0.017
ho-0.016
 beginning-0.016
 swoop-0.016
Token it
Token ID340
Feature activation-1.75e-01

Pos logit contributions
 matter+0.013
 disagreement+0.012
 beginning+0.011
 swoop+0.011
ho+0.011
Neg logit contributions
pins-0.009
 Eve-0.008
uts-0.008
ouses-0.008
grand-0.008
Token's
Token ID338
Feature activation-1.48e-01

Pos logit contributions
pins+0.006
uts+0.005
ouses+0.005
 Eve+0.005
ugg+0.005
Neg logit contributions
 matter-0.005
 disagreement-0.004
 beginning-0.004
ho-0.004
 swoop-0.004
Token important
Token ID1593
Feature activation-2.41e-02

Pos logit contributions
pins+0.003
 Eve+0.003
 Santa+0.003
 buck+0.003
 dressed+0.003
Neg logit contributions
ho-0.006
ah-0.005
 matter-0.005
ue-0.005
 swoop-0.005
Tokenelt
Token ID2120
Feature activation+2.70e-01

Pos logit contributions
pins+0.000
ouses+0.000
 Eve+0.000
uts+0.000
ugg+0.000
Neg logit contributions
 matter-0.001
 disagreement-0.001
 beginning-0.001
ho-0.001
 swoop-0.001
Token down
Token ID866
Feature activation-3.33e-01

Pos logit contributions
 matter+0.010
 disagreement+0.008
 quarrel+0.008
ho+0.008
 beginning+0.008
Neg logit contributions
pins-0.008
ouses-0.007
 Eve-0.007
ugg-0.007
 buck-0.007
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
pins+0.006
ugg+0.005
ouses+0.005
uts+0.005
 Eve+0.005
Neg logit contributions
 matter-0.016
 disagreement-0.014
 beginning-0.013
 swoop-0.013
ho-0.013
Token hugged
Token ID41130
Feature activation-8.36e-01

Pos logit contributions
pins+0.007
 Eve+0.006
ouses+0.006
uts+0.006
ugg+0.005
Neg logit contributions
 matter-0.010
 disagreement-0.009
ho-0.009
 problem-0.009
 beginning-0.008
Token her
Token ID607
Feature activation-1.55e+00

Pos logit contributions
pins+0.017
 Eve+0.015
taker+0.012
 Santa+0.012
 The+0.012
Neg logit contributions
 matter-0.036
 disagreement-0.033
 swoop-0.033
 problem-0.033
 beginning-0.031
Token tightly
Token ID17707
Feature activation-2.09e+00

Pos logit contributions
pins+0.028
ouses+0.023
 Eve+0.023
ugg+0.022
uts+0.020
Neg logit contributions
 matter-0.073
 disagreement-0.067
 swoop-0.064
ho-0.063
 beginning-0.062
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
pins+0.045
ouses+0.038
uts+0.038
 Eve+0.037
ugg+0.036
Neg logit contributions
 matter-0.091
 disagreement-0.081
 swoop-0.077
 beginning-0.077
ho-0.077
Token said
Token ID531
Feature activation+6.97e-01

Pos logit contributions
pins+0.023
ouses+0.020
 Eve+0.019
ugg+0.019
uts+0.019
Neg logit contributions
 matter-0.042
 disagreement-0.037
 beginning-0.037
ho-0.036
 swoop-0.035
Token,
Token ID11
Feature activation-2.43e-01

Pos logit contributions
 matter+0.031
 disagreement+0.027
 beginning+0.026
 swoop+0.026
ho+0.025
Neg logit contributions
pins-0.015
ouses-0.013
uts-0.013
ugg-0.013
 Eve-0.013
Token "
Token ID366
Feature activation-6.99e-03

Pos logit contributions
pins+0.007
uts+0.006
 Eve+0.006
ouses+0.006
ugg+0.006
Neg logit contributions
 matter-0.009
 disagreement-0.008
 beginning-0.007
 problem-0.007
 swoop-0.007
TokenIt
Token ID1026
Feature activation+7.19e-01

Pos logit contributions
pins+0.000
ugg+0.000
aid+0.000
 Eve+0.000
uts+0.000
Neg logit contributions
 matter-0.000
 disagreement-0.000
 problem-0.000
 quarrel-0.000
 beginning-0.000