TOP ACTIVATIONS
MAX = 0.094

and how Ben wanted to chop the tree and how Sam
, " Boo ! Here I am !" L
to herself , " should I follow the path ?" She
mum used the axe to chop the vegetables for their breakfast
!" said Sammy . " These are my nuts . You
crown and playing in the freshly cut grass . It looked
leaves from the garden and put them on the itch .
Tom used his axe to chop some leaves and tw igs
ty said . Bunny suggested putting it in his bur row
someone using an axe to chop a tree . Stanley went
. The rabbit helped Tom put the carrots on a table
so he got ready to chop it down . He stood
a little boy decided to put on his special outfit .
went to the forest to pick some berries . She saw
out into the woods to chop timber . He had to
you can do anything ! <|endoftext|> Once upon a time ,
go to the pond and put the boat in the water
He found a carrot and put it in the trap .
pick up the pieces and put them back together .
Ta - da ! Here I am !" Mom cl apped

MIN ACTIVATIONS
MIN = -0.088

. Everyone thought she was rather curious . One
. It made Mom my very happy that her little girl
the yog is and was so proud of herself for doing
and smiled . She was proud of her daughter . She
Z oe 's mom was very proud of her and she
." Lily smiled and felt proud of herself .
L ily 's mom was proud of her for making a
ily 's mom my was proud of her for remembering to
!" Lily smiled and felt proud of herself . She replied
." Lily smiled and felt proud of herself . She wanted
. Lily smiled and felt proud . " Thank you ,
." Lily smiled and felt proud of her work . She
Lily smiled and felt very proud of herself . <|endoftext|> Once
mom my and daddy were proud of her for loving the
mom my and daddy were proud of her for being so
happy , and Lily felt proud . As a treat ,
all day . They were very happy to have found a
and smiled . She was proud of them . She said
liked counting and he was very good at it . He
!" Sue smiled and felt proud . From that day on

INTERVAL 0.049 - 0.094
CONTAINS 2.142%

your health for granted . <|endoftext|> Once upon a time ,
the park . They liked to find sticks and pretend they
let â € ™ s bury a treasure box ! â
Tim my realized that sometimes it 's fun to hide ,
it . He takes the carrot and goes to

INTERVAL 0.003 - 0.049
CONTAINS 43.572%

On this farm lived a happy hen . The hen had
B ella closed her eyes and sn ugg led with her
. Suddenly the pictures in the book started to come alive
to travel with her family . One day , they decided
everyone . <|endoftext|> Sam and his mom were walking through the

INTERVAL -0.043 - 0.003
CONTAINS 50.288%

sit down . Her friends saw her and
veil . She found a pen and wrote " S ue
it . They had a lot of fun .
'm going to a costume party later ." Lily listened to
boy who owned it very happy . One day

INTERVAL -0.088 - -0.043
CONTAINS 3.999%

problems . He was feeling very worried and did not know
things . <|endoftext|> Once upon a time , there was a
play , but Tom was not happy . He wanted to
Fl opsy . He was very silly and always wanted to
away and she was feeling very sad . She decided to
Token and
Token ID290
Feature activation+1.57e-02

Pos logit contributions
 meets+0.053
izons+0.052
 valleys+0.051
 forests+0.048
ators+0.047
Neg logit contributions
 badly-0.028
mmm-0.022
 dear-0.020
 understanding-0.019
 importantly-0.019
Token how
Token ID703
Feature activation+5.40e-03

Pos logit contributions
izons+0.029
 meets+0.028
 valleys+0.028
ips+0.026
sey+0.026
Neg logit contributions
 badly-0.027
 understanding-0.026
 dear-0.026
mmm-0.025
 importantly-0.024
Token Ben
Token ID3932
Feature activation+2.74e-02

Pos logit contributions
izons+0.010
 meets+0.010
 valleys+0.010
ators+0.010
sey+0.009
Neg logit contributions
 badly-0.010
mmm-0.009
 dear-0.008
 importantly-0.008
 understanding-0.008
Token wanted
Token ID2227
Feature activation+2.63e-02

Pos logit contributions
 valleys+0.056
izons+0.054
 meets+0.054
 forests+0.052
 bridges+0.052
Neg logit contributions
 badly-0.043
mmm-0.041
 dear-0.039
 persuasive-0.036
 understanding-0.035
Token to
Token ID284
Feature activation+4.09e-02

Pos logit contributions
 valleys+0.053
izons+0.052
 meets+0.051
ators+0.051
 driveway+0.048
Neg logit contributions
 badly-0.043
mmm-0.038
 dear-0.038
 caring-0.037
 understanding-0.036
Token chop
Token ID30506
Feature activation+9.42e-02

Pos logit contributions
 meets+0.066
 valleys+0.065
izons+0.063
 forests+0.059
ators+0.058
Neg logit contributions
mmm-0.082
 badly-0.082
 dear-0.076
 importantly-0.072
 understanding-0.072
Token the
Token ID262
Feature activation+5.49e-02

Pos logit contributions
 meets+0.189
izons+0.179
 driveway+0.165
 valleys+0.163
ators+0.159
Neg logit contributions
 badly-0.169
mmm-0.155
 dear-0.132
 because-0.132
ittens-0.130
Token tree
Token ID5509
Feature activation+3.22e-02

Pos logit contributions
 meets+0.104
izons+0.103
 valleys+0.096
ips+0.095
ators+0.094
Neg logit contributions
 badly-0.098
 dear-0.088
mmm-0.086
 painful-0.083
 understanding-0.080
Token and
Token ID290
Feature activation+2.28e-02

Pos logit contributions
 meets+0.078
izons+0.076
 valleys+0.074
 forests+0.069
ators+0.069
Neg logit contributions
 badly-0.043
mmm-0.036
 dear-0.031
 caring-0.030
 understanding-0.030
Token how
Token ID703
Feature activation+7.79e-03

Pos logit contributions
izons+0.044
ips+0.042
 meets+0.042
 valleys+0.041
ators+0.041
Neg logit contributions
 badly-0.040
 understanding-0.037
 dear-0.036
 caring-0.035
mmm-0.033
Token Sam
Token ID3409
Feature activation+3.55e-02

Pos logit contributions
izons+0.015
 meets+0.015
 valleys+0.014
ators+0.013
sey+0.013
Neg logit contributions
 badly-0.014
mmm-0.013
 dear-0.012
 importantly-0.012
 understanding-0.012
Token,
Token ID11
Feature activation-9.42e-03

Pos logit contributions
mmm+0.024
 badly+0.022
 importantly+0.022
 dear+0.021
ittens+0.020
Neg logit contributions
 meets-0.025
 valleys-0.024
izons-0.023
 forests-0.022
 mountains-0.022
Token "
Token ID366
Feature activation+2.22e-02

Pos logit contributions
mmm+0.014
 badly+0.013
 dear+0.012
 importantly+0.012
 understanding+0.012
Neg logit contributions
 meets-0.020
izons-0.019
 valleys-0.019
ators-0.018
 ease-0.018
TokenBoo
Token ID46120
Feature activation-1.71e-02

Pos logit contributions
 meets+0.039
 valleys+0.038
izons+0.037
ators+0.037
 mountains+0.036
Neg logit contributions
mmm-0.039
 badly-0.037
ww-0.035
From-0.034
 importantly-0.034
Token!
Token ID0
Feature activation+2.65e-02

Pos logit contributions
mmm+0.020
 badly+0.019
 dear+0.017
 importantly+0.016
 understanding+0.015
Neg logit contributions
 meets-0.042
 valleys-0.041
izons-0.040
 ease-0.038
 forests-0.038
Token Here
Token ID3423
Feature activation+1.91e-02

Pos logit contributions
 valleys+0.053
 meets+0.052
izons+0.050
ators+0.050
 forests+0.049
Neg logit contributions
mmm-0.043
 badly-0.040
 dear-0.039
 importantly-0.037
ww-0.035
Token I
Token ID314
Feature activation+9.41e-02

Pos logit contributions
 meets+0.039
izons+0.037
 valleys+0.037
 forests+0.034
 ease+0.034
Neg logit contributions
mmm-0.030
 badly-0.029
 importantly-0.027
 dear-0.027
 understanding-0.025
Token am
Token ID716
Feature activation-8.97e-03

Pos logit contributions
 valleys+0.177
 forests+0.161
 mountains+0.157
izons+0.157
iece+0.157
Neg logit contributions
mmm-0.147
 badly-0.145
 dear-0.142
ww-0.135
 persuasive-0.134
Token!"
Token ID2474
Feature activation+1.01e-03

Pos logit contributions
mmm+0.012
 badly+0.012
 importantly+0.011
 dear+0.011
 understanding+0.010
Neg logit contributions
 meets-0.020
izons-0.019
 valleys-0.019
 ease-0.018
 forests-0.018
Token
Token ID198
Feature activation-5.40e-03

Pos logit contributions
 meets+0.002
izons+0.002
 valleys+0.002
ators+0.002
 forests+0.002
Neg logit contributions
mmm-0.002
 badly-0.002
 dear-0.002
 importantly-0.002
 understanding-0.002
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
mmm+0.010
 badly+0.009
 importantly+0.009
 dear+0.009
 understanding+0.008
Neg logit contributions
 meets-0.010
izons-0.009
 valleys-0.009
ators-0.009
 forests-0.009
TokenL
Token ID43
Feature activation-2.65e-03

Pos logit contributions
mmm+0.023
 badly+0.021
 dear+0.020
 importantly+0.020
 understanding+0.018
Neg logit contributions
 meets-0.039
 valleys-0.038
izons-0.037
 forests-0.035
iece-0.035
Token to
Token ID284
Feature activation+2.96e-03

Pos logit contributions
mmm+0.059
 badly+0.056
 dear+0.053
 importantly+0.053
ww+0.049
Neg logit contributions
 meets-0.061
 valleys-0.058
izons-0.058
 forests-0.053
 ease-0.053
Token herself
Token ID5223
Feature activation-3.70e-02

Pos logit contributions
 meets+0.006
 valleys+0.006
ips+0.006
 bridges+0.006
 driveway+0.006
Neg logit contributions
mmm-0.004
 understanding-0.004
 badly-0.004
 dear-0.004
 caring-0.004
Token,
Token ID11
Feature activation-2.69e-02

Pos logit contributions
mmm+0.062
 badly+0.057
 dear+0.055
 importantly+0.055
ww+0.053
Neg logit contributions
 meets-0.071
izons-0.069
 valleys-0.069
sey-0.065
iece-0.064
Token "
Token ID366
Feature activation+3.26e-02

Pos logit contributions
mmm+0.048
 badly+0.045
 dear+0.043
 importantly+0.043
ww+0.041
Neg logit contributions
 meets-0.049
 valleys-0.048
izons-0.047
 forests-0.044
sey-0.044
Tokenshould
Token ID21754
Feature activation+1.04e-02

Pos logit contributions
 meets+0.066
izons+0.062
 finds+0.060
 ease+0.060
 valleys+0.060
Neg logit contributions
mmm-0.050
 badly-0.047
From-0.044
So-0.041
S-0.041
Token I
Token ID314
Feature activation+9.35e-02

Pos logit contributions
izons+0.028
 valleys+0.028
 meets+0.028
iece+0.027
ASH+0.027
Neg logit contributions
 dear-0.010
 badly-0.010
mmm-0.010
 importantly-0.008
 understanding-0.007
Token follow
Token ID1061
Feature activation+2.65e-02

Pos logit contributions
 valleys+0.157
 meets+0.153
izons+0.152
 forests+0.142
 bridges+0.142
Neg logit contributions
mmm-0.176
 badly-0.175
 dear-0.159
 understanding-0.151
 persuasive-0.150
Token the
Token ID262
Feature activation+3.41e-02

Pos logit contributions
 meets+0.052
izons+0.051
 valleys+0.050
iece+0.047
ASH+0.046
Neg logit contributions
mmm-0.045
 badly-0.044
 dear-0.040
 understanding-0.037
 caring-0.036
Token path
Token ID3108
Feature activation+1.58e-02

Pos logit contributions
 meets+0.060
izons+0.059
 valleys+0.056
ators+0.055
iece+0.053
Neg logit contributions
 badly-0.062
mmm-0.061
 dear-0.059
 importantly-0.057
 understanding-0.057
Token?"
Token ID1701
Feature activation-8.41e-03

Pos logit contributions
izons+0.033
 meets+0.032
 valleys+0.031
sey+0.029
ASH+0.029
Neg logit contributions
 badly-0.025
mmm-0.023
 dear-0.021
 understanding-0.019
 importantly-0.019
Token She
Token ID1375
Feature activation-9.32e-03

Pos logit contributions
mmm+0.014
 badly+0.012
 dear+0.012
 importantly+0.012
 understanding+0.012
Neg logit contributions
 meets-0.017
 valleys-0.016
ators-0.016
izons-0.016
 bridges-0.016
Token mum
Token ID25682
Feature activation+4.69e-03

Pos logit contributions
mmm+0.009
 badly+0.008
 dear+0.008
 importantly+0.008
 understanding+0.007
Neg logit contributions
 meets-0.014
 valleys-0.014
izons-0.014
ators-0.013
iece-0.013
Token used
Token ID973
Feature activation+3.70e-02

Pos logit contributions
 meets+0.009
izons+0.008
 valleys+0.008
ASH+0.008
 forests+0.008
Neg logit contributions
mmm-0.008
 badly-0.008
 dear-0.007
 importantly-0.007
 understanding-0.007
Token the
Token ID262
Feature activation+4.11e-02

Pos logit contributions
 meets+0.069
izons+0.067
 valleys+0.063
iece+0.061
ators+0.060
Neg logit contributions
 badly-0.065
mmm-0.063
 understanding-0.060
 caring-0.060
 dear-0.059
Token axe
Token ID23272
Feature activation+2.25e-02

Pos logit contributions
 meets+0.078
izons+0.077
 valleys+0.074
ators+0.070
iece+0.069
Neg logit contributions
mmm-0.070
 badly-0.068
 dear-0.066
 importantly-0.062
 understanding-0.062
Token to
Token ID284
Feature activation+4.65e-02

Pos logit contributions
 meets+0.046
izons+0.044
 valleys+0.042
 forests+0.039
 parks+0.039
Neg logit contributions
 badly-0.037
mmm-0.036
 dear-0.033
 caring-0.031
 understanding-0.031
Token chop
Token ID30506
Feature activation+9.33e-02

Pos logit contributions
 meets+0.088
izons+0.082
 valleys+0.081
iece+0.077
ips+0.077
Neg logit contributions
mmm-0.086
 badly-0.078
 dear-0.073
 understanding-0.073
 caring-0.072
Token the
Token ID262
Feature activation+5.34e-02

Pos logit contributions
 meets+0.190
izons+0.181
 driveway+0.167
 hall+0.159
iece+0.157
Neg logit contributions
mmm-0.160
 badly-0.143
 sweet-0.127
 dear-0.124
 because-0.122
Token vegetables
Token ID13701
Feature activation+3.30e-02

Pos logit contributions
 meets+0.099
izons+0.097
 valleys+0.091
ips+0.090
ators+0.087
Neg logit contributions
mmm-0.093
 badly-0.092
 dear-0.088
 painful-0.081
 importantly-0.080
Token for
Token ID329
Feature activation+2.02e-02

Pos logit contributions
 meets+0.072
izons+0.070
 valleys+0.067
 forests+0.063
ators+0.063
Neg logit contributions
mmm-0.049
 badly-0.047
 dear-0.041
 importantly-0.040
 understanding-0.038
Token their
Token ID511
Feature activation+5.48e-03

Pos logit contributions
 meets+0.033
izons+0.031
 valleys+0.030
ators+0.028
sey+0.028
Neg logit contributions
mmm-0.042
 badly-0.040
 dear-0.038
 importantly-0.037
 understanding-0.036
Token breakfast
Token ID12607
Feature activation+4.67e-03

Pos logit contributions
 meets+0.010
izons+0.009
 valleys+0.009
iece+0.008
ators+0.008
Neg logit contributions
mmm-0.010
 badly-0.010
 dear-0.009
 importantly-0.009
 understanding-0.009
Token!"
Token ID2474
Feature activation-2.17e-02

Pos logit contributions
mmm+0.036
 badly+0.035
 importantly+0.035
 persuasive+0.032
From+0.032
Neg logit contributions
 meets-0.037
izons-0.036
 ease-0.034
sey-0.034
 ones-0.034
Token said
Token ID531
Feature activation-2.18e-02

Pos logit contributions
mmm+0.030
 badly+0.029
 importantly+0.027
 dear+0.026
 enjoyable+0.024
Neg logit contributions
 meets-0.047
 valleys-0.045
izons-0.044
 ease-0.043
 mountains-0.043
Token Sammy
Token ID44067
Feature activation-3.13e-02

Pos logit contributions
mmm+0.042
 badly+0.040
 importantly+0.038
 dear+0.038
ww+0.035
Neg logit contributions
 meets-0.036
izons-0.035
 valleys-0.035
 ease-0.033
sey-0.032
Token.
Token ID13
Feature activation+2.32e-05

Pos logit contributions
mmm+0.054
 badly+0.049
 importantly+0.048
 dear+0.047
ww+0.044
Neg logit contributions
 meets-0.059
izons-0.058
 valleys-0.057
sey-0.055
iece-0.053
Token "
Token ID366
Feature activation+3.50e-02

Pos logit contributions
 meets+0.000
 valleys+0.000
izons+0.000
 forests+0.000
 ease+0.000
Neg logit contributions
mmm-0.000
 badly-0.000
 dear-0.000
 importantly-0.000
 understanding-0.000
TokenThese
Token ID4711
Feature activation+9.20e-02

Pos logit contributions
 meets+0.066
izons+0.063
 valleys+0.063
 forests+0.058
 mountains+0.058
Neg logit contributions
mmm-0.061
 badly-0.059
 importantly-0.054
 dear-0.053
 understanding-0.051
Token are
Token ID389
Feature activation-4.55e-03

Pos logit contributions
izons+0.160
 meets+0.154
 valleys+0.152
iece+0.150
ators+0.147
Neg logit contributions
 badly-0.174
mmm-0.170
 dear-0.164
 importantly-0.149
 caring-0.148
Token my
Token ID616
Feature activation+3.84e-02

Pos logit contributions
mmm+0.007
 importantly+0.007
 badly+0.006
 dear+0.006
ittens+0.006
Neg logit contributions
 meets-0.009
 valleys-0.008
 anywhere-0.008
 ease-0.008
 bridges-0.008
Token nuts
Token ID14380
Feature activation-2.45e-03

Pos logit contributions
izons+0.080
 valleys+0.077
 meets+0.077
iece+0.076
ators+0.074
Neg logit contributions
 badly-0.060
 dear-0.057
mmm-0.057
 importantly-0.052
 caring-0.051
Token.
Token ID13
Feature activation+2.18e-02

Pos logit contributions
mmm+0.004
 badly+0.003
 importantly+0.003
 dear+0.003
 understanding+0.003
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
 ease-0.005
 forests-0.005
Token You
Token ID921
Feature activation+5.45e-02

Pos logit contributions
 meets+0.037
izons+0.035
 valleys+0.034
 parks+0.031
 mountains+0.031
Neg logit contributions
 badly-0.040
mmm-0.040
 importantly-0.037
 dear-0.037
 understanding-0.036
Token crown
Token ID12389
Feature activation+4.59e-03

Pos logit contributions
 meets+0.016
izons+0.016
 valleys+0.015
ators+0.014
ASH+0.014
Neg logit contributions
 badly-0.015
mmm-0.014
 dear-0.014
 caring-0.013
 understanding-0.013
Token and
Token ID290
Feature activation-2.49e-02

Pos logit contributions
 meets+0.009
izons+0.009
 valleys+0.009
 forests+0.008
ators+0.008
Neg logit contributions
mmm-0.008
 badly-0.008
 dear-0.007
 importantly-0.006
 understanding-0.006
Token playing
Token ID2712
Feature activation+3.25e-02

Pos logit contributions
 dear+0.038
 caring+0.037
 understanding+0.037
mmm+0.037
 importantly+0.036
Neg logit contributions
izons-0.050
 meets-0.049
 valleys-0.049
ators-0.047
neath-0.047
Token in
Token ID287
Feature activation+1.39e-02

Pos logit contributions
 meets+0.078
izons+0.076
 valleys+0.072
 driveway+0.071
ators+0.071
Neg logit contributions
 badly-0.041
mmm-0.038
 understanding-0.034
 caring-0.034
 sweet-0.033
Token the
Token ID262
Feature activation+4.15e-02

Pos logit contributions
 meets+0.030
izons+0.029
rays+0.027
iece+0.027
ators+0.026
Neg logit contributions
 badly-0.021
 understanding-0.019
mmm-0.019
 sweet-0.019
 caring-0.018
Token freshly
Token ID29026
Feature activation+9.14e-02

Pos logit contributions
 meets+0.064
izons+0.064
ators+0.059
ips+0.056
iece+0.056
Neg logit contributions
 badly-0.079
 sweet-0.078
mmm-0.078
 dear-0.077
ww-0.075
Token cut
Token ID2005
Feature activation+3.34e-02

Pos logit contributions
 valleys+0.156
izons+0.155
 meets+0.155
 ease+0.150
 asleep+0.148
Neg logit contributions
mmm-0.145
ww-0.137
 sweet-0.136
 caring-0.135
 dear-0.133
Token grass
Token ID8701
Feature activation-1.09e-02

Pos logit contributions
 meets+0.076
izons+0.068
 valleys+0.066
 driveway+0.066
ators+0.064
Neg logit contributions
 badly-0.048
mmm-0.046
 dear-0.044
 sweet-0.041
isy-0.039
Token.
Token ID13
Feature activation-8.32e-03

Pos logit contributions
 badly+0.014
mmm+0.012
 dear+0.010
 sweet+0.010
 because+0.010
Neg logit contributions
 meets-0.028
izons-0.026
 valleys-0.025
 ease-0.024
ASH-0.023
Token It
Token ID632
Feature activation+2.78e-02

Pos logit contributions
mmm+0.014
 badly+0.013
 dear+0.013
 importantly+0.012
 understanding+0.012
Neg logit contributions
 meets-0.016
 valleys-0.016
izons-0.015
ators-0.014
 forests-0.014
Token looked
Token ID3114
Feature activation-2.79e-02

Pos logit contributions
 meets+0.044
izons+0.044
 valleys+0.043
ips+0.040
iece+0.039
Neg logit contributions
mmm-0.053
 badly-0.052
 dear-0.049
 caring-0.049
 importantly-0.048
Token leaves
Token ID5667
Feature activation+2.55e-02

Pos logit contributions
 meets+0.036
 valleys+0.034
izons+0.034
 ease+0.031
 parks+0.031
Neg logit contributions
mmm-0.047
 badly-0.044
 importantly-0.042
 dear-0.042
 understanding-0.040
Token from
Token ID422
Feature activation+9.49e-03

Pos logit contributions
 meets+0.062
izons+0.060
 valleys+0.058
 forests+0.055
ators+0.054
Neg logit contributions
 badly-0.034
mmm-0.031
 dear-0.026
 understanding-0.025
 importantly-0.024
Token the
Token ID262
Feature activation+5.30e-02

Pos logit contributions
 meets+0.019
izons+0.018
iece+0.017
 valleys+0.017
ips+0.017
Neg logit contributions
 badly-0.016
mmm-0.014
 understanding-0.013
 dear-0.013
 caring-0.013
Token garden
Token ID11376
Feature activation-3.84e-03

Pos logit contributions
 meets+0.082
izons+0.080
 valleys+0.076
ators+0.074
iece+0.071
Neg logit contributions
 badly-0.112
 dear-0.105
mmm-0.104
 painful-0.099
 understanding-0.098
Token and
Token ID290
Feature activation+4.61e-03

Pos logit contributions
 badly+0.005
mmm+0.004
 dear+0.004
 understanding+0.004
 importantly+0.004
Neg logit contributions
 meets-0.009
izons-0.009
 valleys-0.009
 ease-0.008
sey-0.008
Token put
Token ID1234
Feature activation+9.09e-02

Pos logit contributions
 valleys+0.009
izons+0.009
 meets+0.009
ips+0.009
ators+0.008
Neg logit contributions
 badly-0.007
mmm-0.007
 dear-0.007
 understanding-0.007
 caring-0.007
Token them
Token ID606
Feature activation+2.31e-02

Pos logit contributions
 meets+0.161
izons+0.158
 driveway+0.147
iece+0.146
 hall+0.144
Neg logit contributions
 badly-0.164
 sweet-0.156
mmm-0.150
 caring-0.143
 juicy-0.143
Token on
Token ID319
Feature activation+1.28e-02

Pos logit contributions
 meets+0.041
izons+0.040
 valleys+0.037
 forests+0.034
 nursery+0.034
Neg logit contributions
 badly-0.045
mmm-0.038
 sweet-0.037
 understanding-0.036
 caring-0.036
Token the
Token ID262
Feature activation+4.87e-02

Pos logit contributions
 meets+0.024
izons+0.022
 valleys+0.021
iece+0.020
sey+0.019
Neg logit contributions
 badly-0.024
mmm-0.023
 dear-0.021
 caring-0.020
 understanding-0.020
Token itch
Token ID49700
Feature activation+2.40e-02

Pos logit contributions
 meets+0.068
izons+0.064
 valleys+0.063
ators+0.057
 forests+0.055
Neg logit contributions
 badly-0.108
mmm-0.107
 dear-0.104
 importantly-0.097
 understanding-0.097
Token.
Token ID13
Feature activation+9.02e-03

Pos logit contributions
 meets+0.056
izons+0.053
 valleys+0.053
 parks+0.049
 forests+0.049
Neg logit contributions
 badly-0.033
mmm-0.029
 dear-0.026
 importantly-0.025
ittens-0.024
TokenTom
Token ID13787
Feature activation+8.32e-03

Pos logit contributions
mmm+0.045
 dear+0.044
 importantly+0.044
 badly+0.044
 darling+0.040
Neg logit contributions
 meets-0.049
 valleys-0.047
 bridges-0.046
 parks-0.046
izons-0.045
Token used
Token ID973
Feature activation+3.96e-02

Pos logit contributions
 meets+0.015
 valleys+0.015
izons+0.015
 forests+0.014
ators+0.013
Neg logit contributions
mmm-0.015
 badly-0.014
 dear-0.013
 importantly-0.013
 understanding-0.012
Token his
Token ID465
Feature activation+4.88e-02

Pos logit contributions
 meets+0.079
izons+0.076
iece+0.071
 valleys+0.070
ators+0.070
Neg logit contributions
 badly-0.065
mmm-0.062
 caring-0.060
 understanding-0.060
 dear-0.059
Token axe
Token ID23272
Feature activation+2.44e-02

Pos logit contributions
 meets+0.093
izons+0.090
 valleys+0.089
ators+0.083
 forests+0.082
Neg logit contributions
mmm-0.083
 badly-0.080
 dear-0.076
 importantly-0.074
 understanding-0.071
Token to
Token ID284
Feature activation+3.93e-02

Pos logit contributions
 meets+0.049
izons+0.048
 valleys+0.046
ators+0.043
 parks+0.041
Neg logit contributions
 badly-0.042
mmm-0.037
 dear-0.036
 understanding-0.035
 caring-0.035
Token chop
Token ID30506
Feature activation+9.09e-02

Pos logit contributions
 meets+0.074
izons+0.069
 valleys+0.067
ators+0.064
ips+0.062
Neg logit contributions
mmm-0.071
 badly-0.070
 dear-0.065
 understanding-0.065
 caring-0.063
Token some
Token ID617
Feature activation+2.16e-02

Pos logit contributions
 meets+0.179
izons+0.173
 driveway+0.162
iece+0.154
 hallway+0.154
Neg logit contributions
mmm-0.150
 badly-0.148
 sweet-0.131
ittens-0.128
 caring-0.128
Token leaves
Token ID5667
Feature activation+2.59e-02

Pos logit contributions
 meets+0.032
izons+0.031
 valleys+0.029
ators+0.027
sey+0.027
Neg logit contributions
mmm-0.046
 badly-0.046
 dear-0.043
 importantly-0.043
 understanding-0.041
Token and
Token ID290
Feature activation+6.36e-03

Pos logit contributions
 meets+0.058
izons+0.055
 valleys+0.053
ators+0.051
 forests+0.050
Neg logit contributions
 badly-0.038
mmm-0.037
 dear-0.032
 importantly-0.031
 understanding-0.030
Token tw
Token ID665
Feature activation+2.79e-02

Pos logit contributions
izons+0.012
 meets+0.012
ips+0.011
ators+0.011
sey+0.011
Neg logit contributions
mmm-0.011
 dear-0.010
 caring-0.010
 understanding-0.010
 badly-0.010
Tokenigs
Token ID9235
Feature activation+2.67e-02

Pos logit contributions
 meets+0.051
izons+0.049
 valleys+0.049
 forests+0.045
ators+0.045
Neg logit contributions
mmm-0.051
 badly-0.048
 dear-0.045
 importantly-0.044
ittens-0.043
Tokenty
Token ID774
Feature activation-3.60e-02

Pos logit contributions
mmm+0.040
 importantly+0.037
 dear+0.037
 badly+0.036
 enjoyable+0.035
Neg logit contributions
sey-0.023
izons-0.023
 meets-0.023
 valleys-0.020
 ease-0.019
Token said
Token ID531
Feature activation-3.75e-02

Pos logit contributions
mmm+0.067
 importantly+0.058
 enjoyable+0.054
ww+0.054
 badly+0.053
Neg logit contributions
 meets-0.065
sey-0.063
izons-0.062
 valleys-0.059
iece-0.058
Token.
Token ID13
Feature activation+1.48e-02

Pos logit contributions
mmm+0.082
ww+0.075
 importantly+0.072
www+0.070
 dear+0.068
Neg logit contributions
izons-0.050
iece-0.049
 meets-0.048
 out-0.048
 asleep-0.047
Token Bunny
Token ID28683
Feature activation-3.90e-03

Pos logit contributions
 meets+0.025
 valleys+0.023
izons+0.023
sey+0.021
 forests+0.021
Neg logit contributions
mmm-0.029
 badly-0.028
 importantly-0.026
 dear-0.026
 understanding-0.025
Token suggested
Token ID5220
Feature activation-2.06e-02

Pos logit contributions
mmm+0.008
 badly+0.007
 importantly+0.007
 dear+0.007
ww+0.006
Neg logit contributions
 meets-0.007
izons-0.006
 valleys-0.006
sey-0.006
 forests-0.006
Token putting
Token ID5137
Feature activation+9.07e-02

Pos logit contributions
mmm+0.034
 importantly+0.030
 badly+0.030
 dear+0.030
ww+0.029
Neg logit contributions
 meets-0.039
izons-0.038
 valleys-0.037
 ease-0.035
 mountains-0.035
Token it
Token ID340
Feature activation+3.66e-02

Pos logit contributions
 meets+0.184
izons+0.177
iece+0.169
 valleys+0.163
 driveway+0.162
Neg logit contributions
 badly-0.151
mmm-0.137
 caring-0.131
 sweet-0.128
 understanding-0.128
Token in
Token ID287
Feature activation+2.57e-02

Pos logit contributions
izons+0.066
 meets+0.065
ric+0.059
 valleys+0.058
rob+0.057
Neg logit contributions
 badly-0.071
 sweet-0.055
 because-0.055
 understanding-0.055
 grateful-0.055
Token his
Token ID465
Feature activation+5.63e-02

Pos logit contributions
 meets+0.044
izons+0.042
 valleys+0.040
iece+0.038
ators+0.038
Neg logit contributions
 badly-0.050
mmm-0.049
 importantly-0.045
 dear-0.045
 understanding-0.044
Token bur
Token ID4356
Feature activation+2.07e-02

Pos logit contributions
izons+0.095
 meets+0.093
 valleys+0.092
ators+0.090
iece+0.089
Neg logit contributions
 badly-0.107
 dear-0.103
mmm-0.101
 importantly-0.094
 understanding-0.092
Tokenrow
Token ID808
Feature activation+2.79e-02

Pos logit contributions
 meets+0.044
izons+0.042
 valleys+0.041
ators+0.039
 parks+0.038
Neg logit contributions
 badly-0.031
mmm-0.031
 dear-0.027
ittens-0.027
 importantly-0.027
Token someone
Token ID2130
Feature activation-2.43e-03

Pos logit contributions
 meets+0.060
izons+0.058
 valleys+0.056
ators+0.056
 driveway+0.056
Neg logit contributions
 badly-0.046
mmm-0.043
 dear-0.042
 understanding-0.040
 importantly-0.040
Token using
Token ID1262
Feature activation+6.27e-02

Pos logit contributions
 badly+0.004
mmm+0.004
 importantly+0.004
 dear+0.004
 understanding+0.004
Neg logit contributions
 meets-0.004
 valleys-0.004
izons-0.004
ators-0.004
ASH-0.004
Token an
Token ID281
Feature activation+6.33e-03

Pos logit contributions
 meets+0.125
izons+0.120
 valleys+0.120
ators+0.116
iece+0.112
Neg logit contributions
 badly-0.105
mmm-0.097
 dear-0.094
 understanding-0.094
 caring-0.089
Token axe
Token ID23272
Feature activation+2.07e-02

Pos logit contributions
 meets+0.008
 ease+0.007
sey+0.007
 mountains+0.007
 bottom+0.007
Neg logit contributions
mmm-0.014
 importantly-0.013
ww-0.013
 badly-0.013
From-0.012
Token to
Token ID284
Feature activation+5.25e-02

Pos logit contributions
 meets+0.048
 valleys+0.047
izons+0.046
 forests+0.043
 ease+0.043
Neg logit contributions
 badly-0.028
mmm-0.024
 dear-0.023
 importantly-0.022
 understanding-0.021
Token chop
Token ID30506
Feature activation+9.05e-02

Pos logit contributions
 meets+0.095
 valleys+0.091
izons+0.089
ators+0.085
sey+0.085
Neg logit contributions
mmm-0.096
 badly-0.095
 understanding-0.089
 dear-0.088
 caring-0.084
Token a
Token ID257
Feature activation+2.16e-02

Pos logit contributions
 meets+0.178
izons+0.168
 valleys+0.154
 nursery+0.150
 driveway+0.149
Neg logit contributions
 badly-0.158
mmm-0.154
 dear-0.130
 because-0.125
ittens-0.125
Token tree
Token ID5509
Feature activation+2.41e-02

Pos logit contributions
 meets+0.036
izons+0.035
 valleys+0.034
ators+0.032
ips+0.031
Neg logit contributions
 badly-0.043
mmm-0.041
 dear-0.039
 importantly-0.038
 understanding-0.037
Token.
Token ID13
Feature activation+3.28e-02

Pos logit contributions
 meets+0.055
izons+0.053
 valleys+0.052
 ease+0.049
 forests+0.049
Neg logit contributions
 badly-0.033
mmm-0.032
 dear-0.027
 importantly-0.027
 understanding-0.026
Token Stanley
Token ID14167
Feature activation-8.93e-03

Pos logit contributions
 meets+0.060
 valleys+0.057
izons+0.057
 forests+0.053
 mountains+0.053
Neg logit contributions
mmm-0.059
 badly-0.057
 importantly-0.054
 dear-0.053
ittens-0.050
Token went
Token ID1816
Feature activation+1.91e-02

Pos logit contributions
mmm+0.016
 badly+0.016
 dear+0.015
 importantly+0.015
 understanding+0.014
Neg logit contributions
 meets-0.016
 valleys-0.015
izons-0.015
ators-0.014
 forests-0.014
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 meets+0.044
 valleys+0.042
izons+0.041
 forests+0.039
 mountains+0.038
Neg logit contributions
mmm-0.038
 badly-0.036
 importantly-0.034
 dear-0.034
 understanding-0.031
Token The
Token ID383
Feature activation+3.37e-02

Pos logit contributions
 meets+0.018
 valleys+0.017
 parks+0.016
izons+0.016
 mountains+0.016
Neg logit contributions
mmm-0.021
 badly-0.020
 importantly-0.020
 dear-0.018
ww-0.018
Token rabbit
Token ID22746
Feature activation+2.99e-02

Pos logit contributions
 meets+0.061
 valleys+0.058
izons+0.058
 forests+0.053
ators+0.053
Neg logit contributions
mmm-0.062
 badly-0.059
 dear-0.056
 importantly-0.056
 understanding-0.053
Token helped
Token ID4193
Feature activation+2.66e-02

Pos logit contributions
 meets+0.055
 valleys+0.054
izons+0.054
 forests+0.049
ators+0.049
Neg logit contributions
mmm-0.052
 badly-0.050
 dear-0.047
 importantly-0.046
 understanding-0.044
Token Tom
Token ID4186
Feature activation+2.28e-02

Pos logit contributions
 valleys+0.059
 meets+0.058
izons+0.056
 forests+0.054
sey+0.053
Neg logit contributions
 badly-0.037
mmm-0.037
 dear-0.032
 understanding-0.031
 caring-0.030
Token put
Token ID1234
Feature activation+9.04e-02

Pos logit contributions
 valleys+0.040
izons+0.039
 meets+0.039
ators+0.036
 forests+0.036
Neg logit contributions
 badly-0.042
mmm-0.041
 dear-0.038
 understanding-0.037
 caring-0.036
Token the
Token ID262
Feature activation+5.75e-02

Pos logit contributions
 meets+0.166
izons+0.163
iece+0.154
 valleys+0.154
 driveway+0.153
Neg logit contributions
 badly-0.158
mmm-0.150
 sweet-0.141
 dear-0.138
 caring-0.137
Token carrots
Token ID34397
Feature activation+3.88e-02

Pos logit contributions
 meets+0.104
izons+0.104
 valleys+0.102
ators+0.097
iece+0.095
Neg logit contributions
mmm-0.099
 badly-0.098
 dear-0.096
 importantly-0.088
 persuasive-0.088
Token on
Token ID319
Feature activation+2.25e-02

Pos logit contributions
 meets+0.069
izons+0.066
 valleys+0.066
 forests+0.060
ators+0.060
Neg logit contributions
mmm-0.071
 badly-0.070
 dear-0.064
 importantly-0.063
 understanding-0.061
Token a
Token ID257
Feature activation+3.08e-03

Pos logit contributions
 meets+0.037
 valleys+0.034
izons+0.034
ators+0.031
sey+0.030
Neg logit contributions
mmm-0.045
 badly-0.045
 dear-0.041
 importantly-0.040
 understanding-0.039
Token table
Token ID3084
Feature activation-3.53e-03

Pos logit contributions
 meets+0.005
 valleys+0.004
izons+0.004
 bottom+0.004
 forests+0.004
Neg logit contributions
mmm-0.007
 importantly-0.006
 badly-0.006
 dear-0.006
 understanding-0.006
Token so
Token ID523
Feature activation-3.10e-02

Pos logit contributions
mmm+0.008
 badly+0.008
 dear+0.007
 importantly+0.007
 understanding+0.007
Neg logit contributions
 meets-0.016
izons-0.015
 valleys-0.015
ators-0.014
sey-0.014
Token he
Token ID339
Feature activation-1.01e-02

Pos logit contributions
mmm+0.054
 badly+0.051
 dear+0.048
 importantly+0.048
 understanding+0.045
Neg logit contributions
 meets-0.059
 valleys-0.057
izons-0.056
 forests-0.052
 ease-0.052
Token got
Token ID1392
Feature activation+2.29e-03

Pos logit contributions
mmm+0.023
 dear+0.020
 badly+0.020
 importantly+0.020
 enjoyable+0.019
Neg logit contributions
 meets-0.015
 valleys-0.013
 mountains-0.013
sey-0.012
 forests-0.012
Token ready
Token ID3492
Feature activation-8.69e-03

Pos logit contributions
 meets+0.004
izons+0.004
 valleys+0.004
ators+0.004
sey+0.004
Neg logit contributions
 badly-0.004
mmm-0.004
 dear-0.004
 importantly-0.004
 understanding-0.004
Token to
Token ID284
Feature activation+3.90e-02

Pos logit contributions
mmm+0.019
 badly+0.018
 importantly+0.017
 dear+0.017
ww+0.016
Neg logit contributions
 meets-0.013
 valleys-0.012
izons-0.012
 forests-0.011
 mountains-0.011
Token chop
Token ID30506
Feature activation+9.00e-02

Pos logit contributions
 meets+0.071
izons+0.069
 valleys+0.068
ators+0.064
ips+0.063
Neg logit contributions
mmm-0.071
 badly-0.066
 understanding-0.063
 dear-0.062
 caring-0.059
Token it
Token ID340
Feature activation+3.07e-02

Pos logit contributions
 meets+0.171
izons+0.166
 valleys+0.147
 nursery+0.146
 driveway+0.145
Neg logit contributions
mmm-0.162
 badly-0.154
 dear-0.131
ittens-0.128
 sweet-0.127
Token down
Token ID866
Feature activation+1.75e-02

Pos logit contributions
 meets+0.061
izons+0.058
 valleys+0.056
 forests+0.052
ators+0.051
Neg logit contributions
 badly-0.053
mmm-0.052
 dear-0.045
 importantly-0.042
 understanding-0.042
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
 meets+0.041
izons+0.040
 valleys+0.038
sey+0.036
ators+0.036
Neg logit contributions
 badly-0.023
mmm-0.023
 importantly-0.019
 dear-0.019
 understanding-0.019
Token He
Token ID679
Feature activation+8.25e-03

Pos logit contributions
 meets+0.043
izons+0.042
 valleys+0.041
ators+0.041
iece+0.039
Neg logit contributions
mmm-0.036
 badly-0.032
 dear-0.031
 importantly-0.030
ittens-0.029
Token stood
Token ID6204
Feature activation+1.79e-02

Pos logit contributions
 meets+0.013
 valleys+0.012
izons+0.012
 forests+0.011
sey+0.011
Neg logit contributions
mmm-0.017
 badly-0.016
 dear-0.015
 importantly-0.015
 understanding-0.014
Token a
Token ID257
Feature activation-5.71e-02

Pos logit contributions
 meets+0.020
 valleys+0.019
izons+0.019
sey+0.017
 ease+0.017
Neg logit contributions
mmm-0.028
 badly-0.026
 dear-0.026
 importantly-0.026
 caring-0.024
Token little
Token ID1310
Feature activation-3.22e-02

Pos logit contributions
mmm+0.106
 badly+0.093
 importantly+0.088
 enjoyable+0.080
ittens+0.080
Neg logit contributions
 meets-0.100
 valleys-0.095
izons-0.094
 mountains-0.093
 bottom-0.093
Token boy
Token ID2933
Feature activation-1.61e-02

Pos logit contributions
mmm+0.068
 badly+0.063
 importantly+0.060
 dear+0.059
 understanding+0.057
Neg logit contributions
 meets-0.048
 valleys-0.047
izons-0.046
 mountains-0.044
 forests-0.043
Token decided
Token ID3066
Feature activation-1.17e-02

Pos logit contributions
mmm+0.017
 badly+0.016
 dear+0.014
 importantly+0.014
 understanding+0.012
Neg logit contributions
 meets-0.041
 valleys-0.040
izons-0.040
 forests-0.038
ators-0.038
Token to
Token ID284
Feature activation+2.96e-02

Pos logit contributions
mmm+0.028
 importantly+0.026
 badly+0.025
 dear+0.024
ittens+0.024
Neg logit contributions
 meets-0.015
izons-0.013
 valleys-0.013
 mountains-0.012
 forests-0.012
Token put
Token ID1234
Feature activation+8.98e-02

Pos logit contributions
 valleys+0.048
 meets+0.046
iece+0.044
izons+0.043
 forests+0.042
Neg logit contributions
mmm-0.059
 badly-0.057
 dear-0.055
 importantly-0.052
 understanding-0.052
Token on
Token ID319
Feature activation+3.27e-02

Pos logit contributions
 meets+0.174
izons+0.172
 valleys+0.166
iece+0.157
 driveway+0.150
Neg logit contributions
 badly-0.149
mmm-0.141
 sweet-0.135
ittens-0.133
 dear-0.132
Token his
Token ID465
Feature activation+3.60e-02

Pos logit contributions
 meets+0.060
izons+0.057
 valleys+0.056
iece+0.053
ators+0.052
Neg logit contributions
 badly-0.058
mmm-0.057
 dear-0.053
 importantly-0.051
 understanding-0.051
Token special
Token ID2041
Feature activation+3.18e-02

Pos logit contributions
 meets+0.063
 valleys+0.061
izons+0.061
iece+0.055
 forests+0.055
Neg logit contributions
mmm-0.066
 badly-0.065
 dear-0.062
 importantly-0.060
ittens-0.057
Token outfit
Token ID16313
Feature activation+1.26e-02

Pos logit contributions
 meets+0.051
 valleys+0.050
izons+0.049
iece+0.045
 ease+0.044
Neg logit contributions
 badly-0.062
mmm-0.062
 dear-0.060
 importantly-0.056
ittens-0.055
Token.
Token ID13
Feature activation+4.01e-02

Pos logit contributions
 meets+0.028
 valleys+0.027
izons+0.027
 forests+0.025
 ease+0.025
Neg logit contributions
mmm-0.017
 badly-0.017
 dear-0.016
 importantly-0.015
ittens-0.014
Token went
Token ID1816
Feature activation+4.00e-02

Pos logit contributions
 meets+0.016
 valleys+0.015
izons+0.015
 forests+0.014
ators+0.014
Neg logit contributions
mmm-0.020
 badly-0.019
 dear-0.018
 importantly-0.018
 understanding-0.017
Token to
Token ID284
Feature activation+3.98e-02

Pos logit contributions
izons+0.074
 valleys+0.073
 meets+0.071
ators+0.069
 ease+0.068
Neg logit contributions
 badly-0.066
mmm-0.065
 dear-0.061
 importantly-0.060
ittens-0.059
Token the
Token ID262
Feature activation+3.84e-02

Pos logit contributions
izons+0.071
 valleys+0.071
iece+0.071
 meets+0.069
ators+0.068
Neg logit contributions
mmm-0.068
 dear-0.064
 badly-0.063
 understanding-0.062
 caring-0.060
Token forest
Token ID8222
Feature activation+4.63e-03

Pos logit contributions
 meets+0.044
 valleys+0.044
izons+0.041
iece+0.040
 ease+0.039
Neg logit contributions
mmm-0.089
 badly-0.089
 dear-0.087
 caring-0.085
ww-0.083
Token to
Token ID284
Feature activation+5.88e-02

Pos logit contributions
 meets+0.008
 valleys+0.008
izons+0.008
 forests+0.007
 mountains+0.007
Neg logit contributions
mmm-0.009
 badly-0.008
 dear-0.008
 importantly-0.008
ww-0.007
Token pick
Token ID2298
Feature activation+8.97e-02

Pos logit contributions
ators+0.106
iece+0.106
ips+0.103
izons+0.102
 valleys+0.101
Neg logit contributions
mmm-0.093
 understanding-0.089
 caring-0.088
 inviting-0.087
 sweet-0.087
Token some
Token ID617
Feature activation+2.98e-02

Pos logit contributions
izons+0.130
 meets+0.130
 driveway+0.127
ators+0.126
 valleys+0.121
Neg logit contributions
 badly-0.183
mmm-0.181
 dear-0.176
 sweet-0.171
 importantly-0.166
Token berries
Token ID36322
Feature activation+3.41e-02

Pos logit contributions
 meets+0.030
 valleys+0.028
 mountains+0.027
 forests+0.026
izons+0.026
Neg logit contributions
mmm-0.077
 badly-0.073
 importantly-0.069
 dear-0.068
 persuasive-0.068
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
 meets+0.060
 valleys+0.057
izons+0.055
 mountains+0.054
 anywhere+0.053
Neg logit contributions
mmm-0.062
 badly-0.057
 dear-0.057
 importantly-0.057
 understanding-0.053
Token She
Token ID1375
Feature activation+2.20e-02

Pos logit contributions
 meets+0.070
izons+0.068
ators+0.068
 valleys+0.068
sey+0.064
Neg logit contributions
mmm-0.041
 dear-0.040
 badly-0.037
 importantly-0.033
 sweet-0.033
Token saw
Token ID2497
Feature activation+3.39e-02

Pos logit contributions
 meets+0.035
izons+0.034
 valleys+0.033
ators+0.030
iece+0.030
Neg logit contributions
mmm-0.044
 badly-0.043
 dear-0.041
 importantly-0.040
 understanding-0.038
Token out
Token ID503
Feature activation+2.95e-03

Pos logit contributions
izons+0.078
 meets+0.069
ators+0.069
ASH+0.069
 valleys+0.068
Neg logit contributions
 badly-0.066
mmm-0.061
ittens-0.060
 because-0.058
 dear-0.057
Token into
Token ID656
Feature activation+2.87e-02

Pos logit contributions
izons+0.007
 meets+0.006
ators+0.006
ASH+0.005
 valleys+0.005
Neg logit contributions
 badly-0.004
 importantly-0.004
 understanding-0.004
mmm-0.004
 because-0.004
Token the
Token ID262
Feature activation+3.64e-02

Pos logit contributions
izons+0.058
 meets+0.057
ators+0.054
robe+0.053
iece+0.052
Neg logit contributions
 badly-0.045
 understanding-0.042
mmm-0.041
 importantly-0.038
 painful-0.038
Token woods
Token ID16479
Feature activation+4.71e-03

Pos logit contributions
izons+0.046
 meets+0.044
ips+0.044
ators+0.043
iece+0.041
Neg logit contributions
 badly-0.078
 painful-0.077
mmm-0.075
 dear-0.075
ww-0.075
Token to
Token ID284
Feature activation+6.45e-02

Pos logit contributions
izons+0.012
 meets+0.012
 valleys+0.011
ators+0.011
 ease+0.011
Neg logit contributions
 badly-0.005
mmm-0.004
 understanding-0.004
 importantly-0.004
 dear-0.004
Token chop
Token ID30506
Feature activation+8.95e-02

Pos logit contributions
ips+0.125
ators+0.118
pit+0.115
 meets+0.114
izons+0.113
Neg logit contributions
 badly-0.096
mmm-0.095
 understanding-0.094
 dear-0.090
 caring-0.090
Token timber
Token ID30605
Feature activation+2.25e-02

Pos logit contributions
 meets+0.173
izons+0.170
 driveway+0.152
 nursery+0.152
ators+0.151
Neg logit contributions
mmm-0.151
 badly-0.147
 dear-0.126
ittens-0.122
 sweet-0.122
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
 meets+0.053
izons+0.051
 valleys+0.050
 forests+0.047
ators+0.046
Neg logit contributions
 badly-0.030
mmm-0.030
 dear-0.025
 importantly-0.023
 understanding-0.023
Token He
Token ID679
Feature activation+3.19e-02

Pos logit contributions
izons+0.071
 meets+0.070
iece+0.069
 valleys+0.068
ators+0.068
Neg logit contributions
mmm-0.042
 dear-0.034
 badly-0.034
 importantly-0.033
ittens-0.033
Token had
Token ID550
Feature activation+2.92e-02

Pos logit contributions
izons+0.056
 meets+0.052
iece+0.050
 valleys+0.050
ips+0.049
Neg logit contributions
 badly-0.058
mmm-0.055
 dear-0.052
 caring-0.052
 importantly-0.051
Token to
Token ID284
Feature activation+4.10e-02

Pos logit contributions
izons+0.060
 meets+0.057
ators+0.055
iece+0.055
 valleys+0.054
Neg logit contributions
 badly-0.046
mmm-0.043
 dear-0.042
 understanding-0.041
 caring-0.041
Token you
Token ID345
Feature activation+2.33e-02

Pos logit contributions
 badly+0.017
mmm+0.016
 importantly+0.016
 dear+0.016
 understanding+0.015
Neg logit contributions
izons-0.027
 meets-0.026
sey-0.025
 valleys-0.025
ators-0.025
Token can
Token ID460
Feature activation+2.50e-02

Pos logit contributions
izons+0.040
 valleys+0.038
ators+0.037
 meets+0.037
ASH+0.037
Neg logit contributions
 badly-0.044
mmm-0.044
 dear-0.042
 understanding-0.039
 importantly-0.038
Token do
Token ID466
Feature activation-1.72e-03

Pos logit contributions
 valleys+0.043
 meets+0.043
izons+0.042
 forests+0.039
ASH+0.039
Neg logit contributions
mmm-0.049
 badly-0.045
 understanding-0.041
 dear-0.041
 caring-0.041
Token anything
Token ID1997
Feature activation-2.70e-02

Pos logit contributions
 badly+0.003
mmm+0.003
 caring+0.003
 dear+0.003
 enjoyable+0.003
Neg logit contributions
izons-0.003
 driveway-0.003
 meets-0.003
ators-0.003
ASH-0.003
Token!
Token ID0
Feature activation+2.05e-02

Pos logit contributions
mmm+0.040
 badly+0.039
 importantly+0.036
 dear+0.035
 caring+0.033
Neg logit contributions
 meets-0.057
izons-0.057
 valleys-0.056
ators-0.053
sey-0.052
Token<|endoftext|>
Token ID50256
Feature activation+8.95e-02

Pos logit contributions
 meets+0.039
 valleys+0.038
izons+0.037
ators+0.035
 forests+0.035
Neg logit contributions
mmm-0.036
 badly-0.033
 importantly-0.032
 dear-0.032
 understanding-0.029
TokenOnce
Token ID7454
Feature activation+4.02e-02

Pos logit contributions
izons+0.223
ators+0.220
iece+0.218
 valleys+0.216
 meets+0.214
Neg logit contributions
 badly-0.070
mmm-0.069
 importantly-0.060
From-0.055
S-0.053
Token upon
Token ID2402
Feature activation+2.97e-02

Pos logit contributions
 meets+0.057
izons+0.057
 valleys+0.056
ators+0.050
 forests+0.050
Neg logit contributions
mmm-0.086
 badly-0.085
 dear-0.080
 importantly-0.080
 understanding-0.075
Token a
Token ID257
Feature activation-4.88e-02

Pos logit contributions
 meets+0.053
izons+0.051
 valleys+0.051
ators+0.047
sey+0.046
Neg logit contributions
mmm-0.054
 badly-0.054
 dear-0.050
 importantly-0.049
 understanding-0.047
Token time
Token ID640
Feature activation+2.13e-02

Pos logit contributions
mmm+0.126
 importantly+0.118
 badly+0.118
 dear+0.116
 understanding+0.115
Neg logit contributions
 meets-0.048
 finds-0.037
 bottom-0.035
 valleys-0.035
izons-0.034
Token,
Token ID11
Feature activation+2.87e-02

Pos logit contributions
 meets+0.042
 valleys+0.040
izons+0.040
 forests+0.037
 ease+0.037
Neg logit contributions
mmm-0.034
 badly-0.032
 importantly-0.031
 dear-0.030
 understanding-0.029
Token go
Token ID467
Feature activation+2.32e-02

Pos logit contributions
izons+0.003
 meets+0.003
 valleys+0.003
ators+0.002
ASH+0.002
Neg logit contributions
mmm-0.003
 badly-0.003
 dear-0.003
 understanding-0.002
 persuasive-0.002
Token to
Token ID284
Feature activation+3.36e-02

Pos logit contributions
izons+0.055
 ease+0.049
 valleys+0.049
neath+0.048
sey+0.048
Neg logit contributions
 grateful-0.026
 badly-0.025
 obedient-0.025
 sweet-0.024
 understanding-0.024
Token the
Token ID262
Feature activation+3.23e-02

Pos logit contributions
 valleys+0.068
izons+0.067
iece+0.066
 meets+0.065
ators+0.064
Neg logit contributions
 understanding-0.048
 badly-0.047
mmm-0.046
 dear-0.045
 caring-0.044
Token pond
Token ID16723
Feature activation+5.74e-03

Pos logit contributions
ips+0.046
izons+0.044
 meets+0.043
 ease+0.042
 valleys+0.042
Neg logit contributions
 dear-0.064
 badly-0.064
 caring-0.063
 sweet-0.063
 understanding-0.060
Token and
Token ID290
Feature activation+1.97e-02

Pos logit contributions
izons+0.014
 meets+0.014
 valleys+0.013
 ease+0.013
ators+0.013
Neg logit contributions
 badly-0.006
mmm-0.006
 understanding-0.005
 dear-0.005
 grateful-0.005
Token put
Token ID1234
Feature activation+8.94e-02

Pos logit contributions
izons+0.042
ators+0.039
iece+0.038
ips+0.038
 valleys+0.037
Neg logit contributions
 understanding-0.024
 dear-0.024
mmm-0.024
 because-0.024
 juicy-0.024
Token the
Token ID262
Feature activation+4.85e-02

Pos logit contributions
izons+0.185
 meets+0.181
 valleys+0.169
iece+0.167
 driveway+0.166
Neg logit contributions
 badly-0.125
 sweet-0.121
mmm-0.119
 juicy-0.113
 dear-0.111
Token boat
Token ID8848
Feature activation+2.86e-02

Pos logit contributions
 meets+0.090
izons+0.089
 valleys+0.089
sey+0.081
ators+0.080
Neg logit contributions
 badly-0.081
 dear-0.079
mmm-0.077
 importantly-0.073
 understanding-0.073
Token in
Token ID287
Feature activation+1.66e-02

Pos logit contributions
izons+0.056
 meets+0.056
 valleys+0.050
 ease+0.049
 nursery+0.048
Neg logit contributions
 badly-0.048
 sweet-0.039
 grateful-0.038
 dear-0.038
mmm-0.038
Token the
Token ID262
Feature activation+4.48e-02

Pos logit contributions
 meets+0.030
izons+0.029
 valleys+0.027
ators+0.026
iece+0.026
Neg logit contributions
 badly-0.030
mmm-0.030
 understanding-0.027
 dear-0.027
 importantly-0.026
Token water
Token ID1660
Feature activation+1.52e-02

Pos logit contributions
izons+0.075
 meets+0.073
ators+0.072
sey+0.070
rays+0.068
Neg logit contributions
 badly-0.076
 dear-0.076
 sweet-0.074
 awful-0.073
 painful-0.071
Token He
Token ID679
Feature activation-1.65e-03

Pos logit contributions
 meets+0.016
 valleys+0.015
izons+0.015
 forests+0.014
 mountains+0.014
Neg logit contributions
mmm-0.015
 badly-0.015
 dear-0.014
 importantly-0.014
 understanding-0.013
Token found
Token ID1043
Feature activation+4.84e-02

Pos logit contributions
mmm+0.004
 badly+0.003
 importantly+0.003
 dear+0.003
ittens+0.003
Neg logit contributions
 meets-0.003
 finds-0.002
 valleys-0.002
izons-0.002
 parks-0.002
Token a
Token ID257
Feature activation-2.53e-03

Pos logit contributions
 meets+0.110
izons+0.108
 driveway+0.108
 valleys+0.105
ators+0.104
Neg logit contributions
 badly-0.063
 dear-0.059
 caring-0.057
 sweet-0.056
 understanding-0.056
Token carrot
Token ID40562
Feature activation+1.35e-02

Pos logit contributions
mmm+0.004
 badly+0.004
 dear+0.004
 importantly+0.004
 understanding+0.003
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
 forests-0.004
ators-0.004
Token and
Token ID290
Feature activation+1.19e-02

Pos logit contributions
 meets+0.027
 valleys+0.026
izons+0.026
 forests+0.024
ators+0.024
Neg logit contributions
mmm-0.022
 badly-0.021
 dear-0.020
 importantly-0.020
 understanding-0.018
Token put
Token ID1234
Feature activation+8.91e-02

Pos logit contributions
 meets+0.020
 valleys+0.019
izons+0.018
 forests+0.017
 mountains+0.017
Neg logit contributions
mmm-0.023
 badly-0.022
 importantly-0.021
 dear-0.021
ittens-0.020
Token it
Token ID340
Feature activation+2.83e-02

Pos logit contributions
izons+0.198
 meets+0.197
 driveway+0.193
 hall+0.190
ators+0.190
Neg logit contributions
mmm-0.122
 badly-0.121
 sweet-0.108
 dear-0.105
 juicy-0.101
Token in
Token ID287
Feature activation+1.58e-02

Pos logit contributions
 meets+0.052
izons+0.051
 valleys+0.049
ators+0.046
 nursery+0.045
Neg logit contributions
 badly-0.052
mmm-0.046
 sweet-0.042
 importantly-0.042
 understanding-0.042
Token the
Token ID262
Feature activation+6.14e-02

Pos logit contributions
 meets+0.026
izons+0.024
 valleys+0.024
ators+0.022
iece+0.022
Neg logit contributions
mmm-0.031
 badly-0.031
 importantly-0.028
 dear-0.028
 understanding-0.028
Token trap
Token ID12840
Feature activation+1.71e-02

Pos logit contributions
 meets+0.132
ators+0.130
izons+0.129
 valleys+0.128
ips+0.125
Neg logit contributions
mmm-0.083
 badly-0.082
 dear-0.079
 painful-0.073
 importantly-0.070
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
 meets+0.038
 valleys+0.036
izons+0.036
ators+0.034
 forests+0.034
Neg logit contributions
mmm-0.023
 badly-0.023
 dear-0.021
 importantly-0.020
 understanding-0.019
Token pick
Token ID2298
Feature activation+8.23e-02

Pos logit contributions
 valleys+0.040
 meets+0.038
 forests+0.036
ips+0.035
 mountains+0.035
Neg logit contributions
 badly-0.034
mmm-0.034
 dear-0.032
 understanding-0.031
 caring-0.030
Token up
Token ID510
Feature activation+1.64e-02

Pos logit contributions
 driveway+0.210
illas+0.204
ators+0.202
 hallway+0.202
izons+0.201
Neg logit contributions
 badly-0.085
 sweet-0.079
 juicy-0.069
mmm-0.067
 understanding-0.066
Token the
Token ID262
Feature activation+4.26e-02

Pos logit contributions
 meets+0.031
izons+0.031
 valleys+0.028
 driveway+0.028
sey+0.027
Neg logit contributions
 badly-0.029
mmm-0.027
 dear-0.026
 understanding-0.026
 sweet-0.025
Token pieces
Token ID5207
Feature activation+4.83e-03

Pos logit contributions
izons+0.079
 meets+0.072
ips+0.071
iece+0.069
 valleys+0.067
Neg logit contributions
 dear-0.080
 badly-0.077
 sweet-0.075
 adorable-0.069
 understanding-0.067
Token and
Token ID290
Feature activation+1.69e-02

Pos logit contributions
 meets+0.010
izons+0.010
 valleys+0.010
 forests+0.009
ators+0.009
Neg logit contributions
mmm-0.007
 badly-0.007
 dear-0.006
 importantly-0.006
 understanding-0.006
Token put
Token ID1234
Feature activation+8.89e-02

Pos logit contributions
 valleys+0.032
 meets+0.032
izons+0.032
ips+0.031
sey+0.030
Neg logit contributions
mmm-0.028
 badly-0.027
 dear-0.027
 understanding-0.026
 caring-0.025
Token them
Token ID606
Feature activation+1.82e-02

Pos logit contributions
 meets+0.164
izons+0.163
 hallway+0.153
 driveway+0.151
 hall+0.147
Neg logit contributions
 badly-0.140
 sweet-0.135
 as-0.129
 juicy-0.123
 caring-0.122
Token back
Token ID736
Feature activation+2.18e-03

Pos logit contributions
 meets+0.033
izons+0.033
 valleys+0.030
 forests+0.028
 nursery+0.027
Neg logit contributions
 badly-0.035
mmm-0.030
 understanding-0.029
 dear-0.028
 sweet-0.028
Token together
Token ID1978
Feature activation+5.20e-03

Pos logit contributions
izons+0.005
 meets+0.005
 valleys+0.004
 forests+0.004
 ease+0.004
Neg logit contributions
 badly-0.004
mmm-0.003
 importantly-0.003
 because-0.003
 dear-0.003
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
 meets+0.013
izons+0.013
 valleys+0.013
 forests+0.012
ASH+0.012
Neg logit contributions
 badly-0.006
mmm-0.005
 understanding-0.004
 importantly-0.004
 dear-0.004
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
 meets+0.025
izons+0.025
ators+0.025
 valleys+0.024
 bridges+0.023
Neg logit contributions
mmm-0.018
 badly-0.016
 dear-0.016
 importantly-0.015
 understanding-0.015
TokenTa
Token ID38586
Feature activation-4.14e-02

Pos logit contributions
 meets+0.028
 valleys+0.027
izons+0.027
 ease+0.025
ators+0.025
Neg logit contributions
mmm-0.023
 badly-0.022
 dear-0.021
 importantly-0.020
ww-0.020
Token-
Token ID12
Feature activation+3.08e-03

Pos logit contributions
 badly+0.061
mmm+0.060
 importantly+0.058
 persuasive+0.056
 painful+0.055
Neg logit contributions
 meets-0.079
izons-0.077
 valleys-0.075
sey-0.074
iece-0.073
Tokenda
Token ID6814
Feature activation-4.06e-02

Pos logit contributions
 meets+0.005
 valleys+0.005
izons+0.005
 bridges+0.005
 ease+0.005
Neg logit contributions
mmm-0.006
 badly-0.006
ww-0.005
 dear-0.005
www-0.005
Token!
Token ID0
Feature activation+1.80e-02

Pos logit contributions
mmm+0.051
 badly+0.049
 importantly+0.047
 dear+0.044
 understanding+0.042
Neg logit contributions
 meets-0.093
izons-0.090
 valleys-0.089
sey-0.084
ators-0.083
Token Here
Token ID3423
Feature activation+1.97e-02

Pos logit contributions
 meets+0.036
 valleys+0.035
izons+0.034
 ease+0.033
ators+0.033
Neg logit contributions
mmm-0.030
 dear-0.028
 badly-0.026
 importantly-0.025
ww-0.024
Token I
Token ID314
Feature activation+8.88e-02

Pos logit contributions
 meets+0.038
 valleys+0.037
izons+0.037
ators+0.034
 forests+0.034
Neg logit contributions
mmm-0.033
 badly-0.032
 dear-0.030
 importantly-0.029
ww-0.028
Token am
Token ID716
Feature activation-1.24e-02

Pos logit contributions
 valleys+0.173
 forests+0.156
 mountains+0.155
 bridges+0.153
iece+0.151
Neg logit contributions
 dear-0.134
 badly-0.134
mmm-0.133
 persuasive-0.122
 reassured-0.122
Token!"
Token ID2474
Feature activation-1.96e-02

Pos logit contributions
mmm+0.020
 badly+0.019
 dear+0.017
 importantly+0.017
 understanding+0.016
Neg logit contributions
 meets-0.025
 valleys-0.025
izons-0.024
 forests-0.023
ators-0.023
Token Mom
Token ID11254
Feature activation-1.75e-02

Pos logit contributions
mmm+0.037
 badly+0.036
 importantly+0.034
 dear+0.033
 persuasive+0.032
Neg logit contributions
 meets-0.033
 valleys-0.031
izons-0.031
 mountains-0.029
 forests-0.029
Token cl
Token ID537
Feature activation-2.73e-02

Pos logit contributions
mmm+0.025
 badly+0.023
 dear+0.022
 importantly+0.022
 understanding+0.021
Neg logit contributions
 meets-0.038
 valleys-0.037
izons-0.037
 forests-0.035
 ease-0.034
Tokenapped
Token ID6320
Feature activation-3.63e-02

Pos logit contributions
mmm+0.065
 badly+0.064
 dear+0.061
 importantly+0.060
 enjoyable+0.060
Neg logit contributions
 meets-0.033
izons-0.032
iece-0.029
ips-0.029
sey-0.028
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 meets+0.014
izons+0.014
 valleys+0.014
ASH+0.013
 ease+0.013
Neg logit contributions
mmm-0.007
 badly-0.007
 dear-0.006
 importantly-0.006
 understanding-0.006
Token Everyone
Token ID11075
Feature activation-7.28e-03

Pos logit contributions
 meets+0.044
izons+0.042
 valleys+0.042
iece+0.041
ators+0.040
Neg logit contributions
mmm-0.026
 dear-0.023
 badly-0.021
 importantly-0.021
 understanding-0.019
Token thought
Token ID1807
Feature activation-9.15e-04

Pos logit contributions
mmm+0.016
 badly+0.015
 dear+0.014
 importantly+0.014
ittens+0.013
Neg logit contributions
 meets-0.011
 valleys-0.010
izons-0.010
sey-0.009
 parks-0.009
Token she
Token ID673
Feature activation+4.03e-02

Pos logit contributions
mmm+0.001
 badly+0.001
 dear+0.001
 importantly+0.001
 understanding+0.001
Neg logit contributions
izons-0.002
 meets-0.002
 valleys-0.002
ASH-0.002
ators-0.002
Token was
Token ID373
Feature activation-2.46e-02

Pos logit contributions
 valleys+0.072
izons+0.069
 meets+0.067
 bridges+0.067
iece+0.066
Neg logit contributions
 badly-0.071
mmm-0.070
 dear-0.069
 importantly-0.066
 caring-0.064
Token rather
Token ID2138
Feature activation-8.83e-02

Pos logit contributions
 badly+0.032
mmm+0.032
 dear+0.030
 importantly+0.029
 caring+0.029
Neg logit contributions
 meets-0.054
 valleys-0.054
izons-0.054
iece-0.051
ASH-0.051
Token curious
Token ID11040
Feature activation-4.08e-02

Pos logit contributions
mmm+0.171
ittens+0.134
mmmm+0.132
ww+0.122
keys+0.118
Neg logit contributions
 meets-0.163
 finds-0.148
 ones-0.148
 woods-0.141
 places-0.140
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
mmm+0.058
 badly+0.055
 dear+0.051
 importantly+0.050
 understanding+0.046
Neg logit contributions
 meets-0.089
 valleys-0.086
izons-0.086
 forests-0.080
 ease-0.080
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
 meets+0.021
izons+0.020
 valleys+0.020
iece+0.019
ators+0.019
Neg logit contributions
mmm-0.017
 badly-0.015
 dear-0.015
 importantly-0.015
 understanding-0.013
Token
Token ID198
Feature activation-2.93e-02

Pos logit contributions
mmm+0.018
 badly+0.018
 importantly+0.016
 dear+0.016
 understanding+0.015
Neg logit contributions
 meets-0.020
izons-0.019
 valleys-0.019
ators-0.017
iece-0.017
TokenOne
Token ID3198
Feature activation-1.13e-02

Pos logit contributions
mmm+0.057
 dear+0.054
 badly+0.053
 importantly+0.051
ittens+0.048
Neg logit contributions
 meets-0.049
 valleys-0.049
izons-0.047
 forests-0.046
 parks-0.045
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
 badly+0.050
mmm+0.047
 dear+0.045
 importantly+0.044
 understanding+0.040
Neg logit contributions
 meets-0.090
izons-0.088
 valleys-0.087
 ease-0.084
neath-0.082
Token It
Token ID632
Feature activation+1.92e-02

Pos logit contributions
mmm+0.030
 badly+0.030
 importantly+0.027
 dear+0.027
 persuasive+0.026
Neg logit contributions
 meets-0.026
 valleys-0.025
izons-0.025
 mountains-0.023
 forests-0.023
Token made
Token ID925
Feature activation-1.79e-02

Pos logit contributions
 valleys+0.030
izons+0.030
 meets+0.030
iece+0.028
 forests+0.028
Neg logit contributions
mmm-0.036
 badly-0.036
 importantly-0.034
 caring-0.034
 dear-0.034
Token Mom
Token ID11254
Feature activation-2.20e-02

Pos logit contributions
mmm+0.032
 badly+0.031
 dear+0.030
 caring+0.029
 importantly+0.029
Neg logit contributions
 meets-0.034
 valleys-0.033
izons-0.032
 driveway-0.030
iece-0.030
Tokenmy
Token ID1820
Feature activation-2.50e-02

Pos logit contributions
 badly+0.051
mmm+0.048
 understanding+0.048
 dear+0.047
 caring+0.047
Neg logit contributions
 meets-0.029
 valleys-0.029
izons-0.028
 mountains-0.025
 forests-0.025
Token very
Token ID845
Feature activation-8.47e-02

Pos logit contributions
 badly+0.033
mmm+0.032
 dear+0.030
 understanding+0.029
 caring+0.028
Neg logit contributions
izons-0.057
 meets-0.057
 valleys-0.056
ators-0.053
 forests-0.052
Token happy
Token ID3772
Feature activation-4.31e-02

Pos logit contributions
mmm+0.135
ittens+0.116
ww+0.111
www+0.106
 importantly+0.105
Neg logit contributions
 meets-0.171
 valleys-0.156
 bottom-0.155
 ease-0.152
iece-0.149
Token that
Token ID326
Feature activation-2.70e-03

Pos logit contributions
mmm+0.065
 badly+0.063
 dear+0.058
 importantly+0.057
 understanding+0.054
Neg logit contributions
 meets-0.092
 valleys-0.089
izons-0.088
 forests-0.082
 ease-0.082
Token her
Token ID607
Feature activation-1.89e-02

Pos logit contributions
mmm+0.004
 badly+0.004
 dear+0.004
 importantly+0.004
 caring+0.003
Neg logit contributions
 meets-0.006
 valleys-0.006
izons-0.006
 forests-0.005
iece-0.005
Token little
Token ID1310
Feature activation-5.55e-03

Pos logit contributions
mmm+0.028
 badly+0.027
 dear+0.025
 importantly+0.025
 understanding+0.023
Neg logit contributions
 meets-0.041
 valleys-0.040
izons-0.039
 forests-0.037
ators-0.036
Token girl
Token ID2576
Feature activation-1.41e-03

Pos logit contributions
 badly+0.009
 dear+0.008
mmm+0.008
 importantly+0.008
 understanding+0.008
Neg logit contributions
 meets-0.011
 valleys-0.011
izons-0.011
ators-0.011
neath-0.010
Token the
Token ID262
Feature activation+7.28e-03

Pos logit contributions
 badly+0.026
mmm+0.026
 importantly+0.025
 dear+0.025
 understanding+0.024
Neg logit contributions
 meets-0.030
izons-0.028
 valleys-0.027
 driveway-0.027
ASH-0.026
Token yog
Token ID27423
Feature activation-2.74e-02

Pos logit contributions
 meets+0.014
izons+0.013
 valleys+0.013
ASH+0.013
ators+0.013
Neg logit contributions
 dear-0.012
mmm-0.012
 badly-0.011
 importantly-0.011
 understanding-0.011
Tokenis
Token ID271
Feature activation-5.73e-03

Pos logit contributions
mmm+0.046
 badly+0.044
 importantly+0.041
 dear+0.041
 understanding+0.038
Neg logit contributions
 meets-0.053
 valleys-0.051
izons-0.051
 forests-0.047
sey-0.047
Token and
Token ID290
Feature activation-3.30e-02

Pos logit contributions
mmm+0.011
 badly+0.010
 dear+0.010
 importantly+0.010
ittens+0.009
Neg logit contributions
 meets-0.010
 valleys-0.010
izons-0.010
 driveway-0.009
 forests-0.009
Token was
Token ID373
Feature activation-5.89e-02

Pos logit contributions
mmm+0.049
 understanding+0.049
 dear+0.048
 importantly+0.048
 badly+0.047
Neg logit contributions
izons-0.066
 meets-0.066
 valleys-0.064
ators-0.064
ASH-0.063
Token so
Token ID523
Feature activation-8.40e-02

Pos logit contributions
mmm+0.073
 badly+0.070
 importantly+0.065
 dear+0.065
 understanding+0.062
Neg logit contributions
 meets-0.138
izons-0.136
 valleys-0.135
ators-0.127
 forests-0.126
Token proud
Token ID6613
Feature activation-7.83e-02

Pos logit contributions
mmm+0.090
 badly+0.082
 dear+0.072
ittens+0.069
 importantly+0.069
Neg logit contributions
 meets-0.213
 valleys-0.208
izons-0.199
 ease-0.196
 forests-0.194
Token of
Token ID286
Feature activation-1.72e-02

Pos logit contributions
mmm+0.125
 badly+0.118
 dear+0.106
 importantly+0.101
ittens+0.101
Neg logit contributions
 meets-0.157
 valleys-0.150
izons-0.145
 mountains-0.142
 forests-0.141
Token herself
Token ID5223
Feature activation-5.17e-02

Pos logit contributions
mmm+0.032
 badly+0.032
 importantly+0.030
 understanding+0.030
 dear+0.029
Neg logit contributions
 meets-0.032
 valleys-0.029
izons-0.029
ASH-0.028
 driveway-0.027
Token for
Token ID329
Feature activation-3.66e-03

Pos logit contributions
mmm+0.081
 badly+0.075
 dear+0.071
 importantly+0.071
ww+0.066
Neg logit contributions
 meets-0.103
 valleys-0.099
izons-0.098
sey-0.093
 forests-0.092
Token doing
Token ID1804
Feature activation-1.09e-02

Pos logit contributions
 understanding+0.006
 badly+0.006
mmm+0.006
 caring+0.006
 importantly+0.006
Neg logit contributions
 meets-0.007
 valleys-0.007
izons-0.007
ASH-0.007
 driveway-0.007
Token and
Token ID290
Feature activation-2.83e-02

Pos logit contributions
mmm+0.026
 badly+0.025
 dear+0.023
 understanding+0.022
 importantly+0.022
Neg logit contributions
 meets-0.036
izons-0.035
 valleys-0.035
ators-0.032
 forests-0.032
Token smiled
Token ID13541
Feature activation-6.23e-02

Pos logit contributions
 understanding+0.031
 badly+0.030
mmm+0.029
 dear+0.029
 caring+0.028
Neg logit contributions
izons-0.067
 meets-0.066
 valleys-0.065
ators-0.064
neath-0.063
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
mmm+0.143
ittens+0.124
ww+0.123
www+0.123
 importantly+0.117
Neg logit contributions
 asleep-0.087
 valleys-0.086
 meets-0.086
 mountains-0.082
iece-0.081
Token She
Token ID1375
Feature activation-7.41e-03

Pos logit contributions
mmm+0.018
 badly+0.018
 importantly+0.016
 dear+0.016
 understanding+0.016
Neg logit contributions
 meets-0.018
 valleys-0.017
izons-0.017
 forests-0.016
 mountains-0.016
Token was
Token ID373
Feature activation-5.09e-02

Pos logit contributions
mmm+0.015
 badly+0.014
 importantly+0.013
 dear+0.013
ittens+0.013
Neg logit contributions
 meets-0.012
 valleys-0.011
izons-0.011
 forests-0.010
 parks-0.010
Token proud
Token ID6613
Feature activation-8.39e-02

Pos logit contributions
mmm+0.055
 badly+0.049
 dear+0.044
 importantly+0.044
ww+0.040
Neg logit contributions
 meets-0.131
 valleys-0.125
izons-0.125
 parks-0.118
 forests-0.118
Token of
Token ID286
Feature activation-8.19e-03

Pos logit contributions
mmm+0.180
 badly+0.160
ww+0.152
www+0.148
ittens+0.143
Neg logit contributions
 meets-0.128
 mountains-0.122
 bridges-0.118
 asleep-0.118
 valleys-0.117
Token her
Token ID607
Feature activation-3.09e-02

Pos logit contributions
mmm+0.015
 badly+0.015
 importantly+0.014
 dear+0.013
 caring+0.013
Neg logit contributions
 meets-0.015
izons-0.014
 valleys-0.014
 driveway-0.014
ators-0.013
Token daughter
Token ID4957
Feature activation-2.99e-02

Pos logit contributions
mmm+0.059
 badly+0.052
ww+0.051
 importantly+0.051
www+0.049
Neg logit contributions
 meets-0.053
izons-0.049
 valleys-0.048
sey-0.048
 parks-0.047
Token.
Token ID13
Feature activation-9.98e-03

Pos logit contributions
mmm+0.048
 badly+0.045
 importantly+0.042
 dear+0.042
ww+0.039
Neg logit contributions
 meets-0.061
 valleys-0.058
izons-0.058
 forests-0.054
sey-0.054
Token She
Token ID1375
Feature activation-1.13e-02

Pos logit contributions
mmm+0.020
 badly+0.020
 dear+0.018
 importantly+0.018
ww+0.018
Neg logit contributions
 meets-0.016
 valleys-0.016
izons-0.015
 mountains-0.014
 forests-0.014
TokenZ
Token ID57
Feature activation-1.07e-02

Pos logit contributions
mmm+0.052
 badly+0.049
 dear+0.045
 importantly+0.045
 understanding+0.041
Neg logit contributions
 meets-0.100
 valleys-0.097
izons-0.096
 forests-0.091
ators-0.090
Tokenoe
Token ID2577
Feature activation-2.02e-02

Pos logit contributions
mmm+0.022
 badly+0.021
 dear+0.020
 importantly+0.020
 understanding+0.019
Neg logit contributions
 meets-0.017
 valleys-0.016
izons-0.016
 forests-0.015
ators-0.014
Token's
Token ID338
Feature activation-2.56e-02

Pos logit contributions
mmm+0.039
 badly+0.038
 dear+0.036
 importantly+0.035
 understanding+0.034
Neg logit contributions
 meets-0.033
 valleys-0.033
izons-0.032
 forests-0.030
 bridges-0.029
Token mom
Token ID1995
Feature activation-2.30e-02

Pos logit contributions
mmm+0.040
 badly+0.038
 dear+0.035
 importantly+0.035
 understanding+0.033
Neg logit contributions
 meets-0.053
 valleys-0.051
izons-0.051
 forests-0.047
ators-0.047
Token was
Token ID373
Feature activation-6.02e-02

Pos logit contributions
mmm+0.037
 badly+0.035
 dear+0.033
 understanding+0.033
 importantly+0.033
Neg logit contributions
 valleys-0.045
 meets-0.045
izons-0.045
 forests-0.041
 bridges-0.041
Token very
Token ID845
Feature activation-8.39e-02

Pos logit contributions
mmm+0.066
 badly+0.060
 dear+0.054
 importantly+0.054
ittens+0.047
Neg logit contributions
 meets-0.153
 valleys-0.147
izons-0.147
 forests-0.139
 ease-0.139
Token proud
Token ID6613
Feature activation-7.84e-02

Pos logit contributions
mmm+0.126
ittens+0.115
 badly+0.100
emy+0.099
ww+0.098
Neg logit contributions
 meets-0.177
 nursery-0.152
 bottom-0.152
 ease-0.152
 valleys-0.151
Token of
Token ID286
Feature activation-1.14e-02

Pos logit contributions
mmm+0.143
 badly+0.131
ww+0.122
ittens+0.120
 dear+0.117
Neg logit contributions
 meets-0.136
 valleys-0.123
 mountains-0.122
izons-0.120
 parks-0.120
Token her
Token ID607
Feature activation-4.08e-02

Pos logit contributions
 badly+0.020
mmm+0.019
 importantly+0.018
 dear+0.018
 understanding+0.018
Neg logit contributions
 meets-0.022
izons-0.021
 valleys-0.021
 driveway-0.020
ASH-0.020
Token and
Token ID290
Feature activation-4.58e-02

Pos logit contributions
mmm+0.068
 badly+0.062
 importantly+0.060
ww+0.059
ittens+0.057
Neg logit contributions
 meets-0.077
izons-0.073
 valleys-0.072
sey-0.069
iece-0.068
Token she
Token ID673
Feature activation-1.95e-02

Pos logit contributions
mmm+0.064
 badly+0.060
 dear+0.055
 importantly+0.055
ittens+0.051
Neg logit contributions
 meets-0.102
 valleys-0.097
izons-0.097
 forests-0.091
 ease-0.091
Token."
Token ID526
Feature activation-1.27e-02

Pos logit contributions
mmm+0.062
 badly+0.059
 dear+0.055
 importantly+0.055
 understanding+0.051
Neg logit contributions
 meets-0.074
 valleys-0.071
izons-0.070
 forests-0.065
 mountains-0.065
Token Lily
Token ID20037
Feature activation-2.44e-02

Pos logit contributions
mmm+0.024
 badly+0.024
 importantly+0.022
 dear+0.021
 understanding+0.021
Neg logit contributions
 meets-0.022
izons-0.020
 valleys-0.020
 parks-0.018
 forests-0.018
Token smiled
Token ID13541
Feature activation-4.03e-02

Pos logit contributions
mmm+0.054
 badly+0.052
 importantly+0.051
 dear+0.049
ittens+0.047
Neg logit contributions
 meets-0.034
izons-0.031
sey-0.030
 valleys-0.030
ASH-0.028
Token and
Token ID290
Feature activation-2.97e-02

Pos logit contributions
mmm+0.091
ww+0.081
www+0.081
 importantly+0.079
ittens+0.078
Neg logit contributions
 asleep-0.057
 meets-0.053
 out-0.052
 valleys-0.051
iece-0.051
Token felt
Token ID2936
Feature activation-5.65e-02

Pos logit contributions
mmm+0.047
 badly+0.047
 importantly+0.042
 dear+0.040
ittens+0.039
Neg logit contributions
 meets-0.060
 valleys-0.056
izons-0.056
sey-0.052
 mountains-0.052
Token proud
Token ID6613
Feature activation-8.38e-02

Pos logit contributions
mmm+0.090
ittens+0.079
 badly+0.074
 importantly+0.072
 dear+0.070
Neg logit contributions
 meets-0.119
 valleys-0.111
 ease-0.110
izons-0.108
 anywhere-0.103
Token of
Token ID286
Feature activation-1.62e-02

Pos logit contributions
mmm+0.176
 badly+0.159
ww+0.151
ittens+0.147
www+0.146
Neg logit contributions
 meets-0.127
 asleep-0.121
 mountains-0.118
 ones-0.115
 valleys-0.114
Token herself
Token ID5223
Feature activation-3.82e-02

Pos logit contributions
mmm+0.035
 badly+0.034
 importantly+0.032
 dear+0.032
 understanding+0.031
Neg logit contributions
 meets-0.025
 valleys-0.024
izons-0.023
ators-0.021
 forests-0.021
Token.
Token ID13
Feature activation-7.22e-04

Pos logit contributions
mmm+0.085
ww+0.076
 badly+0.075
 importantly+0.073
www+0.072
Neg logit contributions
iece-0.048
 out-0.047
sey-0.047
 meets-0.047
 anywhere-0.046
Token
Token ID198
Feature activation-2.03e-02

Pos logit contributions
mmm+0.001
 dear+0.001
 badly+0.001
 importantly+0.001
 understanding+0.001
Neg logit contributions
 meets-0.001
 valleys-0.001
izons-0.001
ators-0.001
 forests-0.001
Token
Token ID198
Feature activation-4.21e-02

Pos logit contributions
mmm+0.040
 badly+0.039
 dear+0.037
 importantly+0.037
 understanding+0.035
Neg logit contributions
 meets-0.033
 valleys-0.031
izons-0.030
 forests-0.028
 ease-0.028
TokenL
Token ID43
Feature activation-1.89e-02

Pos logit contributions
mmm+0.067
 dear+0.063
 importantly+0.061
 badly+0.061
 enjoyable+0.056
Neg logit contributions
 meets-0.065
izons-0.064
 valleys-0.057
 parks-0.057
 driveway-0.057
Tokenily
Token ID813
Feature activation-1.48e-02

Pos logit contributions
mmm+0.034
 badly+0.032
 importantly+0.031
 dear+0.030
 understanding+0.029
Neg logit contributions
 meets-0.034
izons-0.033
 valleys-0.031
ators-0.030
sey-0.030
Token's
Token ID338
Feature activation-1.67e-02

Pos logit contributions
mmm+0.028
 badly+0.027
 importantly+0.026
 dear+0.026
 understanding+0.024
Neg logit contributions
 meets-0.025
izons-0.024
 valleys-0.024
 forests-0.022
sey-0.022
Token mom
Token ID1995
Feature activation-1.91e-02

Pos logit contributions
mmm+0.026
 badly+0.025
 importantly+0.023
 dear+0.023
ww+0.021
Neg logit contributions
 meets-0.035
izons-0.033
 valleys-0.033
 forests-0.031
 ease-0.031
Token was
Token ID373
Feature activation-5.31e-02

Pos logit contributions
mmm+0.031
 badly+0.030
 dear+0.028
 importantly+0.028
 understanding+0.026
Neg logit contributions
 meets-0.038
 valleys-0.037
izons-0.037
 forests-0.034
ators-0.034
Token proud
Token ID6613
Feature activation-8.34e-02

Pos logit contributions
mmm+0.074
 dear+0.059
 badly+0.059
ittens+0.058
ww+0.058
Neg logit contributions
 meets-0.123
izons-0.117
 valleys-0.114
 parks-0.110
 ease-0.110
Token of
Token ID286
Feature activation-8.47e-04

Pos logit contributions
mmm+0.188
ww+0.162
 badly+0.161
www+0.154
ittens+0.150
Neg logit contributions
 meets-0.111
 asleep-0.104
 ones-0.102
 again-0.101
 mountains-0.099
Token her
Token ID607
Feature activation-3.10e-02

Pos logit contributions
 badly+0.001
mmm+0.001
 dear+0.001
 importantly+0.001
 caring+0.001
Neg logit contributions
 meets-0.002
 valleys-0.002
izons-0.002
 driveway-0.002
 mountains-0.002
Token for
Token ID329
Feature activation+6.18e-03

Pos logit contributions
mmm+0.058
ww+0.051
 badly+0.050
 importantly+0.050
www+0.048
Neg logit contributions
 meets-0.054
izons-0.051
sey-0.049
iece-0.048
 nursery-0.047
Token making
Token ID1642
Feature activation+1.37e-02

Pos logit contributions
 meets+0.013
 valleys+0.013
izons+0.012
ators+0.012
sey+0.012
Neg logit contributions
 badly-0.010
mmm-0.009
 understanding-0.009
 caring-0.009
 importantly-0.009
Token a
Token ID257
Feature activation-2.46e-02

Pos logit contributions
 valleys+0.027
 meets+0.026
izons+0.024
ators+0.024
 driveway+0.024
Neg logit contributions
 badly-0.024
mmm-0.023
 caring-0.022
 dear-0.022
 understanding-0.022
Tokenily
Token ID813
Feature activation-5.96e-03

Pos logit contributions
mmm+0.037
 badly+0.035
 importantly+0.034
 dear+0.033
 understanding+0.032
Neg logit contributions
 meets-0.036
izons-0.034
 valleys-0.032
iece-0.031
ators-0.031
Token's
Token ID338
Feature activation-1.13e-02

Pos logit contributions
mmm+0.013
 badly+0.012
 importantly+0.012
 dear+0.012
 understanding+0.011
Neg logit contributions
 meets-0.009
 valleys-0.008
izons-0.008
sey-0.007
 forests-0.007
Token mom
Token ID1995
Feature activation-1.41e-02

Pos logit contributions
mmm+0.020
 badly+0.018
 importantly+0.017
 dear+0.016
ww+0.016
Neg logit contributions
 meets-0.021
izons-0.020
 valleys-0.020
 ease-0.019
sey-0.019
Tokenmy
Token ID1820
Feature activation-1.10e-02

Pos logit contributions
mmm+0.037
 badly+0.036
 dear+0.035
 importantly+0.035
 understanding+0.034
Neg logit contributions
 meets-0.013
 valleys-0.013
izons-0.012
 forests-0.011
 mountains-0.011
Token was
Token ID373
Feature activation-5.01e-02

Pos logit contributions
mmm+0.019
 badly+0.018
 dear+0.017
 importantly+0.017
 understanding+0.016
Neg logit contributions
 meets-0.021
 valleys-0.020
izons-0.020
 forests-0.019
ators-0.019
Token proud
Token ID6613
Feature activation-8.31e-02

Pos logit contributions
mmm+0.074
 dear+0.061
ww+0.059
 importantly+0.058
 badly+0.057
Neg logit contributions
 meets-0.112
izons-0.106
 valleys-0.103
 parks-0.100
 ease-0.100
Token of
Token ID286
Feature activation-4.36e-03

Pos logit contributions
mmm+0.201
ww+0.173
 badly+0.173
www+0.167
ittens+0.160
Neg logit contributions
 meets-0.096
 again-0.091
 asleep-0.090
 ones-0.087
 circles-0.087
Token her
Token ID607
Feature activation-2.86e-02

Pos logit contributions
mmm+0.008
 badly+0.008
 dear+0.007
 importantly+0.007
 caring+0.007
Neg logit contributions
 meets-0.008
 valleys-0.008
izons-0.008
 forests-0.007
 mountains-0.007
Token for
Token ID329
Feature activation+3.80e-03

Pos logit contributions
mmm+0.059
ww+0.054
www+0.052
 importantly+0.050
 badly+0.048
Neg logit contributions
 meets-0.039
 nursery-0.038
 finds-0.038
sey-0.038
izons-0.037
Token remembering
Token ID24865
Feature activation-2.63e-02

Pos logit contributions
 meets+0.007
 valleys+0.007
izons+0.006
 forests+0.006
ators+0.006
Neg logit contributions
mmm-0.007
 badly-0.007
 importantly-0.006
 dear-0.006
 understanding-0.006
Token to
Token ID284
Feature activation+1.21e-02

Pos logit contributions
mmm+0.049
 badly+0.047
 dear+0.044
 importantly+0.044
 understanding+0.041
Neg logit contributions
 meets-0.046
 valleys-0.044
izons-0.044
 forests-0.041
 ease-0.040
Token!"
Token ID2474
Feature activation-2.14e-02

Pos logit contributions
mmm+0.058
 badly+0.055
 importantly+0.051
 dear+0.050
 understanding+0.047
Neg logit contributions
 meets-0.077
izons-0.073
 valleys-0.073
ators-0.068
 forests-0.068
Token Lily
Token ID20037
Feature activation-3.33e-02

Pos logit contributions
mmm+0.043
 badly+0.041
 importantly+0.039
 dear+0.038
 persuasive+0.037
Neg logit contributions
 meets-0.035
izons-0.032
 valleys-0.031
sey-0.029
 mountains-0.029
Token smiled
Token ID13541
Feature activation-4.67e-02

Pos logit contributions
mmm+0.068
 badly+0.066
 importantly+0.064
 dear+0.061
ittens+0.059
Neg logit contributions
 meets-0.053
izons-0.049
sey-0.047
 valleys-0.047
 parks-0.043
Token and
Token ID290
Feature activation-2.86e-02

Pos logit contributions
mmm+0.107
ww+0.095
www+0.095
ittens+0.091
 importantly+0.090
Neg logit contributions
 asleep-0.066
 meets-0.062
iece-0.061
 valleys-0.060
 out-0.060
Token felt
Token ID2936
Feature activation-5.60e-02

Pos logit contributions
mmm+0.043
 badly+0.042
 importantly+0.038
 dear+0.036
ittens+0.035
Neg logit contributions
 meets-0.062
izons-0.057
 valleys-0.057
 mountains-0.054
 ease-0.053
Token proud
Token ID6613
Feature activation-8.30e-02

Pos logit contributions
mmm+0.084
ittens+0.074
 badly+0.068
ww+0.064
 dear+0.064
Neg logit contributions
 meets-0.125
 ease-0.116
 valleys-0.115
izons-0.113
 anywhere-0.107
Token of
Token ID286
Feature activation-1.37e-02

Pos logit contributions
mmm+0.170
 badly+0.153
ww+0.147
www+0.143
ittens+0.141
Neg logit contributions
 meets-0.127
 asleep-0.124
 mountains-0.118
 ones-0.116
 valleys-0.114
Token herself
Token ID5223
Feature activation-4.17e-02

Pos logit contributions
mmm+0.028
 badly+0.028
 importantly+0.026
 dear+0.026
 understanding+0.025
Neg logit contributions
 meets-0.022
 valleys-0.021
izons-0.020
ators-0.019
 forests-0.019
Token.
Token ID13
Feature activation-3.68e-03

Pos logit contributions
mmm+0.089
ww+0.078
 badly+0.077
www+0.075
 importantly+0.075
Neg logit contributions
 meets-0.059
iece-0.056
sey-0.056
izons-0.056
 anywhere-0.054
Token She
Token ID1375
Feature activation-2.17e-02

Pos logit contributions
mmm+0.006
 badly+0.006
 importantly+0.005
 dear+0.005
 understanding+0.005
Neg logit contributions
 meets-0.007
 valleys-0.007
izons-0.007
 forests-0.006
sey-0.006
Token replied
Token ID8712
Feature activation-2.54e-02

Pos logit contributions
mmm+0.045
 badly+0.043
 importantly+0.040
ittens+0.040
 dear+0.040
Neg logit contributions
 meets-0.036
 valleys-0.030
izons-0.030
sey-0.029
 finds-0.029
Token."
Token ID526
Feature activation-1.28e-02

Pos logit contributions
mmm+0.047
 badly+0.045
 dear+0.042
 importantly+0.041
 understanding+0.038
Neg logit contributions
 meets-0.068
izons-0.066
 valleys-0.066
 ease-0.061
 forests-0.061
Token Lily
Token ID20037
Feature activation-2.14e-02

Pos logit contributions
mmm+0.022
 badly+0.021
 dear+0.020
 importantly+0.019
 understanding+0.018
Neg logit contributions
 meets-0.025
 valleys-0.024
izons-0.024
 forests-0.022
ators-0.022
Token smiled
Token ID13541
Feature activation-3.96e-02

Pos logit contributions
mmm+0.046
 importantly+0.045
 badly+0.044
 dear+0.042
ittens+0.040
Neg logit contributions
 meets-0.031
izons-0.028
sey-0.028
 valleys-0.026
 finds-0.025
Token and
Token ID290
Feature activation-3.06e-02

Pos logit contributions
mmm+0.090
www+0.081
ww+0.081
 importantly+0.079
ittens+0.077
Neg logit contributions
 asleep-0.058
 out-0.052
 meets-0.051
 valleys-0.051
iece-0.050
Token felt
Token ID2936
Feature activation-5.43e-02

Pos logit contributions
mmm+0.046
 badly+0.044
 importantly+0.040
 dear+0.038
ittens+0.037
Neg logit contributions
 meets-0.065
izons-0.061
 valleys-0.061
 mountains-0.058
sey-0.057
Token proud
Token ID6613
Feature activation-8.29e-02

Pos logit contributions
mmm+0.088
ittens+0.079
 badly+0.072
 importantly+0.070
ww+0.069
Neg logit contributions
 meets-0.114
 valleys-0.106
 ease-0.106
izons-0.102
 anywhere-0.099
Token of
Token ID286
Feature activation-1.36e-02

Pos logit contributions
mmm+0.177
 badly+0.157
ww+0.153
ittens+0.148
www+0.148
Neg logit contributions
 meets-0.123
 asleep-0.120
 mountains-0.116
 ones-0.113
 valleys-0.111
Token herself
Token ID5223
Feature activation-3.81e-02

Pos logit contributions
mmm+0.029
 badly+0.028
 importantly+0.026
 dear+0.026
 understanding+0.025
Neg logit contributions
 meets-0.022
 valleys-0.021
izons-0.020
ators-0.019
 driveway-0.019
Token.
Token ID13
Feature activation-2.50e-03

Pos logit contributions
mmm+0.083
ww+0.074
 importantly+0.072
 badly+0.071
www+0.071
Neg logit contributions
 meets-0.050
sey-0.050
iece-0.049
 anywhere-0.049
izons-0.049
Token She
Token ID1375
Feature activation-2.10e-02

Pos logit contributions
mmm+0.004
 dear+0.003
 badly+0.003
 importantly+0.003
 understanding+0.003
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
ators-0.005
 ease-0.005
Token wanted
Token ID2227
Feature activation+8.83e-03

Pos logit contributions
mmm+0.043
 badly+0.042
 importantly+0.039
ittens+0.038
 dear+0.038
Neg logit contributions
 meets-0.035
 valleys-0.030
sey-0.029
 finds-0.029
izons-0.029
Token.
Token ID13
Feature activation-4.28e-04

Pos logit contributions
mmm+0.051
 badly+0.048
 dear+0.045
 importantly+0.045
 understanding+0.042
Neg logit contributions
 meets-0.062
 valleys-0.059
izons-0.059
 forests-0.055
sey-0.054
Token Lily
Token ID20037
Feature activation-1.44e-02

Pos logit contributions
mmm+0.001
 badly+0.001
 importantly+0.001
ww+0.001
ittens+0.001
Neg logit contributions
 meets-0.001
izons-0.001
 valleys-0.001
 parks-0.001
 hall-0.001
Token smiled
Token ID13541
Feature activation-4.11e-02

Pos logit contributions
mmm+0.028
 badly+0.027
 importantly+0.025
 dear+0.025
ittens+0.024
Neg logit contributions
 meets-0.025
izons-0.023
 valleys-0.023
sey-0.021
 forests-0.021
Token and
Token ID290
Feature activation-2.54e-02

Pos logit contributions
mmm+0.094
ww+0.083
www+0.082
ittens+0.081
 importantly+0.081
Neg logit contributions
 asleep-0.060
 meets-0.055
 valleys-0.055
 out-0.054
 mountains-0.054
Token felt
Token ID2936
Feature activation-4.42e-02

Pos logit contributions
mmm+0.040
 badly+0.038
 importantly+0.034
 dear+0.033
ittens+0.032
Neg logit contributions
 meets-0.054
 valleys-0.050
izons-0.049
 forests-0.047
 mountains-0.047
Token proud
Token ID6613
Feature activation-8.28e-02

Pos logit contributions
mmm+0.070
ittens+0.060
 badly+0.059
 importantly+0.057
 dear+0.055
Neg logit contributions
 meets-0.096
 valleys-0.090
izons-0.087
 ease-0.086
 forests-0.084
Token.
Token ID13
Feature activation-2.35e-03

Pos logit contributions
mmm+0.182
 badly+0.163
ww+0.157
www+0.152
ittens+0.151
Neg logit contributions
 meets-0.115
 mountains-0.114
 asleep-0.114
 ones-0.107
 curtains-0.106
Token "
Token ID366
Feature activation-1.56e-02

Pos logit contributions
mmm+0.004
 badly+0.003
 dear+0.003
 importantly+0.003
 understanding+0.003
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
ators-0.004
 ease-0.004
TokenThank
Token ID10449
Feature activation-2.12e-02

Pos logit contributions
 badly+0.034
mmm+0.033
 dear+0.031
ittens+0.031
 importantly+0.029
Neg logit contributions
 valleys-0.022
 meets-0.021
izons-0.021
 forests-0.020
ators-0.019
Token you
Token ID345
Feature activation-3.55e-02

Pos logit contributions
 badly+0.040
mmm+0.038
 dear+0.037
 importantly+0.037
 enjoyable+0.037
Neg logit contributions
 meets-0.034
 valleys-0.031
izons-0.031
 mountains-0.030
 forests-0.030
Token,
Token ID11
Feature activation-2.76e-02

Pos logit contributions
 importantly+0.071
mmm+0.068
 persuasive+0.067
 enjoyable+0.066
 annoying+0.065
Neg logit contributions
sey-0.045
 asleep-0.045
 curtains-0.045
 out-0.044
 anywhere-0.043
Token."
Token ID526
Feature activation-1.43e-02

Pos logit contributions
mmm+0.033
 badly+0.030
 dear+0.027
 importantly+0.025
 understanding+0.024
Neg logit contributions
 meets-0.069
izons-0.068
 valleys-0.067
 ease-0.063
 forests-0.063
Token Lily
Token ID20037
Feature activation-2.40e-02

Pos logit contributions
mmm+0.026
 badly+0.026
 importantly+0.024
 dear+0.023
 persuasive+0.023
Neg logit contributions
 meets-0.025
izons-0.024
 valleys-0.023
 parks-0.022
sey-0.022
Token smiled
Token ID13541
Feature activation-4.36e-02

Pos logit contributions
mmm+0.044
 badly+0.042
 importantly+0.040
 dear+0.039
ittens+0.037
Neg logit contributions
 meets-0.044
izons-0.041
 valleys-0.040
sey-0.038
 forests-0.037
Token and
Token ID290
Feature activation-2.92e-02

Pos logit contributions
mmm+0.100
ww+0.091
www+0.089
ittens+0.088
 importantly+0.087
Neg logit contributions
 asleep-0.060
 meets-0.058
 valleys-0.057
iece-0.056
sey-0.055
Token felt
Token ID2936
Feature activation-5.24e-02

Pos logit contributions
mmm+0.045
 badly+0.043
 importantly+0.040
 dear+0.039
ittens+0.036
Neg logit contributions
 meets-0.061
 valleys-0.058
izons-0.058
 forests-0.054
 ease-0.054
Token proud
Token ID6613
Feature activation-8.28e-02

Pos logit contributions
mmm+0.085
ittens+0.076
 badly+0.069
ww+0.069
 importantly+0.066
Neg logit contributions
 meets-0.111
 ease-0.103
 valleys-0.102
izons-0.099
 parks-0.096
Token of
Token ID286
Feature activation-1.37e-02

Pos logit contributions
mmm+0.167
 badly+0.155
ww+0.148
ittens+0.142
www+0.141
Neg logit contributions
 meets-0.129
 mountains-0.121
 asleep-0.120
 valleys-0.118
 bridges-0.116
Token her
Token ID607
Feature activation-9.78e-03

Pos logit contributions
mmm+0.029
 badly+0.028
 importantly+0.026
 dear+0.026
 understanding+0.026
Neg logit contributions
 meets-0.022
 valleys-0.021
izons-0.020
 forests-0.019
ators-0.019
Token work
Token ID670
Feature activation-2.49e-02

Pos logit contributions
mmm+0.021
 badly+0.020
 dear+0.019
 importantly+0.019
 understanding+0.018
Neg logit contributions
 meets-0.014
 valleys-0.014
izons-0.014
 forests-0.012
ators-0.012
Token.
Token ID13
Feature activation+2.46e-03

Pos logit contributions
mmm+0.037
 badly+0.035
 dear+0.032
 importantly+0.032
 understanding+0.030
Neg logit contributions
 meets-0.053
 valleys-0.052
izons-0.051
 forests-0.048
ators-0.047
Token She
Token ID1375
Feature activation-1.74e-02

Pos logit contributions
 meets+0.005
 valleys+0.005
izons+0.005
 forests+0.004
ators+0.004
Neg logit contributions
mmm-0.004
 badly-0.004
 dear-0.004
 importantly-0.004
 understanding-0.003
Token Lily
Token ID20037
Feature activation-2.04e-02

Pos logit contributions
 meets+0.008
 valleys+0.007
izons+0.007
 forests+0.007
 ease+0.007
Neg logit contributions
mmm-0.007
 badly-0.007
 importantly-0.006
 dear-0.006
 understanding-0.006
Token smiled
Token ID13541
Feature activation-4.00e-02

Pos logit contributions
mmm+0.042
 badly+0.041
 dear+0.038
 importantly+0.038
 understanding+0.036
Neg logit contributions
 meets-0.032
 valleys-0.030
izons-0.030
 forests-0.027
ators-0.027
Token and
Token ID290
Feature activation-3.14e-02

Pos logit contributions
mmm+0.088
ww+0.078
www+0.078
ittens+0.077
 importantly+0.075
Neg logit contributions
 asleep-0.058
 meets-0.057
 valleys-0.055
iece-0.055
izons-0.053
Token felt
Token ID2936
Feature activation-5.82e-02

Pos logit contributions
mmm+0.047
 badly+0.046
 importantly+0.041
 dear+0.040
ittens+0.039
Neg logit contributions
 meets-0.068
izons-0.063
 valleys-0.062
 ease-0.059
 mountains-0.059
Token very
Token ID845
Feature activation-7.34e-02

Pos logit contributions
mmm+0.094
ittens+0.086
 badly+0.077
 importantly+0.074
ww+0.073
Neg logit contributions
 meets-0.121
 ease-0.113
 valleys-0.111
izons-0.108
 anywhere-0.105
Token proud
Token ID6613
Feature activation-8.26e-02

Pos logit contributions
mmm+0.099
ittens+0.095
 badly+0.094
 importantly+0.081
ww+0.079
Neg logit contributions
 meets-0.165
izons-0.145
 valleys-0.145
 ease-0.144
 finds-0.143
Token of
Token ID286
Feature activation-1.78e-02

Pos logit contributions
mmm+0.174
 badly+0.155
ww+0.152
ittens+0.148
www+0.146
Neg logit contributions
 meets-0.123
 asleep-0.119
 mountains-0.114
 ones-0.113
 valleys-0.110
Token herself
Token ID5223
Feature activation-3.82e-02

Pos logit contributions
mmm+0.038
 badly+0.037
 importantly+0.035
 dear+0.035
 understanding+0.034
Neg logit contributions
 meets-0.027
 valleys-0.026
izons-0.025
ators-0.023
 forests-0.023
Token.
Token ID13
Feature activation+6.38e-03

Pos logit contributions
mmm+0.077
 badly+0.069
ww+0.068
 importantly+0.068
ittens+0.066
Neg logit contributions
 meets-0.059
izons-0.055
iece-0.054
sey-0.053
 anywhere-0.052
Token<|endoftext|>
Token ID50256
Feature activation+6.97e-02

Pos logit contributions
 meets+0.013
 valleys+0.013
izons+0.012
ators+0.012
 bridges+0.012
Neg logit contributions
mmm-0.010
 badly-0.009
 dear-0.009
 importantly-0.009
 understanding-0.008
TokenOnce
Token ID7454
Feature activation+1.86e-02

Pos logit contributions
izons+0.162
 valleys+0.161
 meets+0.159
ators+0.159
iece+0.156
Neg logit contributions
 badly-0.073
mmm-0.072
 importantly-0.063
From-0.061
 dear-0.057
Token mom
Token ID1995
Feature activation-1.35e-02

Pos logit contributions
 badly+0.001
mmm+0.001
 dear+0.001
 importantly+0.001
 understanding+0.001
Neg logit contributions
 meets-0.002
 valleys-0.002
izons-0.002
iece-0.002
 forests-0.002
Tokenmy
Token ID1820
Feature activation-5.78e-03

Pos logit contributions
 badly+0.030
mmm+0.029
 understanding+0.029
 dear+0.029
 importantly+0.028
Neg logit contributions
 valleys-0.018
 meets-0.017
izons-0.017
 mountains-0.015
 forests-0.015
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
mmm+0.008
 badly+0.008
 dear+0.008
 understanding+0.008
 importantly+0.008
Neg logit contributions
 valleys-0.012
 meets-0.012
izons-0.012
 forests-0.011
ators-0.011
Token daddy
Token ID35822
Feature activation+4.47e-04

Pos logit contributions
mmm+0.046
 badly+0.044
 dear+0.042
 importantly+0.041
 understanding+0.039
Neg logit contributions
 meets-0.048
 valleys-0.046
izons-0.046
 forests-0.042
 ease-0.042
Token were
Token ID547
Feature activation-4.94e-02

Pos logit contributions
 meets+0.001
 valleys+0.001
izons+0.001
 forests+0.001
 mountains+0.001
Neg logit contributions
 badly-0.001
mmm-0.001
 understanding-0.001
 dear-0.001
 importantly-0.001
Token proud
Token ID6613
Feature activation-8.26e-02

Pos logit contributions
mmm+0.076
 dear+0.062
ww+0.061
 importantly+0.060
 badly+0.059
Neg logit contributions
 meets-0.108
 valleys-0.102
izons-0.100
 parks-0.096
 ease-0.096
Token of
Token ID286
Feature activation-6.93e-03

Pos logit contributions
mmm+0.186
 badly+0.161
ww+0.154
www+0.150
ittens+0.147
Neg logit contributions
 meets-0.117
 mountains-0.112
 asleep-0.111
 bridges-0.108
 ones-0.108
Token her
Token ID607
Feature activation-3.44e-02

Pos logit contributions
mmm+0.013
 badly+0.013
 importantly+0.012
 dear+0.012
 caring+0.011
Neg logit contributions
 meets-0.012
 valleys-0.012
izons-0.012
ators-0.011
 mountains-0.011
Token for
Token ID329
Feature activation+4.52e-03

Pos logit contributions
mmm+0.068
ww+0.060
www+0.057
 importantly+0.057
 badly+0.056
Neg logit contributions
 meets-0.056
izons-0.052
sey-0.051
 valleys-0.050
 anywhere-0.050
Token loving
Token ID14442
Feature activation-7.82e-03

Pos logit contributions
 meets+0.008
 valleys+0.008
izons+0.008
ators+0.007
sey+0.007
Neg logit contributions
mmm-0.008
 badly-0.008
 importantly-0.007
 dear-0.007
 understanding-0.007
Token the
Token ID262
Feature activation+1.94e-02

Pos logit contributions
 dear+0.012
 badly+0.012
 understanding+0.012
 importantly+0.011
mmm+0.011
Neg logit contributions
 meets-0.016
 valleys-0.015
izons-0.015
neath-0.015
ators-0.014
Token mom
Token ID1995
Feature activation-2.31e-02

Pos logit contributions
mmm+0.011
 badly+0.011
 importantly+0.010
 dear+0.010
ww+0.009
Neg logit contributions
 meets-0.013
izons-0.013
 valleys-0.013
 forests-0.012
sey-0.012
Tokenmy
Token ID1820
Feature activation-1.87e-02

Pos logit contributions
mmm+0.053
 badly+0.052
 dear+0.051
 understanding+0.051
 importantly+0.050
Neg logit contributions
 valleys-0.028
 meets-0.027
izons-0.027
 ease-0.024
 mountains-0.024
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
mmm+0.027
 badly+0.025
 dear+0.025
 understanding+0.024
 importantly+0.024
Neg logit contributions
 valleys-0.039
 meets-0.039
izons-0.038
 bridges-0.036
 ease-0.036
Token daddy
Token ID35822
Feature activation-9.25e-03

Pos logit contributions
mmm+0.048
 badly+0.046
 dear+0.044
 importantly+0.044
 understanding+0.042
Neg logit contributions
 meets-0.058
 valleys-0.056
izons-0.056
 forests-0.051
ators-0.051
Token were
Token ID547
Feature activation-4.30e-02

Pos logit contributions
 badly+0.015
mmm+0.015
 understanding+0.014
 dear+0.014
 importantly+0.014
Neg logit contributions
 meets-0.017
 valleys-0.017
izons-0.017
 forests-0.016
 bridges-0.015
Token proud
Token ID6613
Feature activation-8.24e-02

Pos logit contributions
mmm+0.061
 badly+0.059
 importantly+0.053
 dear+0.053
 understanding+0.050
Neg logit contributions
 meets-0.095
 valleys-0.091
izons-0.091
 forests-0.085
 ease-0.085
Token of
Token ID286
Feature activation-7.65e-03

Pos logit contributions
mmm+0.188
 badly+0.165
ww+0.158
www+0.153
ittens+0.149
Neg logit contributions
 meets-0.109
 mountains-0.106
 asleep-0.105
 ones-0.105
 curtains-0.104
Token her
Token ID607
Feature activation-2.24e-02

Pos logit contributions
mmm+0.014
 badly+0.014
 importantly+0.013
 dear+0.013
 caring+0.012
Neg logit contributions
 meets-0.014
 valleys-0.013
izons-0.013
 forests-0.012
 mountains-0.012
Token for
Token ID329
Feature activation+5.82e-03

Pos logit contributions
mmm+0.044
ww+0.038
 badly+0.038
 importantly+0.037
www+0.036
Neg logit contributions
 meets-0.037
izons-0.035
 nursery-0.033
sey-0.033
 valleys-0.033
Token being
Token ID852
Feature activation-2.38e-02

Pos logit contributions
 meets+0.010
 valleys+0.010
izons+0.010
 forests+0.009
sey+0.009
Neg logit contributions
mmm-0.011
 badly-0.011
 importantly-0.010
 dear-0.010
 understanding-0.009
Token so
Token ID523
Feature activation-5.06e-02

Pos logit contributions
mmm+0.033
 badly+0.029
 dear+0.028
 importantly+0.027
ittens+0.026
Neg logit contributions
 meets-0.054
 valleys-0.052
izons-0.051
ators-0.048
 forests-0.048
Token happy
Token ID3772
Feature activation-5.78e-02

Pos logit contributions
mmm+0.074
 badly+0.062
 dear+0.061
 importantly+0.059
ww+0.059
Neg logit contributions
 meets-0.118
 valleys-0.111
izons-0.110
 parks-0.106
 ease-0.105
Token,
Token ID11
Feature activation-4.40e-02

Pos logit contributions
mmm+0.112
 badly+0.105
 dear+0.099
 importantly+0.098
ww+0.096
Neg logit contributions
 meets-0.099
 valleys-0.091
izons-0.090
 mountains-0.085
 forests-0.085
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
mmm+0.085
 badly+0.077
ww+0.073
 importantly+0.072
ittens+0.072
Neg logit contributions
 meets-0.072
 asleep-0.067
 valleys-0.065
 parks-0.064
 mountains-0.064
Token Lily
Token ID20037
Feature activation-1.48e-02

Pos logit contributions
mmm+0.070
 badly+0.063
ittens+0.059
 importantly+0.056
ww+0.056
Neg logit contributions
 meets-0.064
izons-0.054
 finds-0.054
 asleep-0.054
 valleys-0.054
Token felt
Token ID2936
Feature activation-5.54e-02

Pos logit contributions
mmm+0.026
 badly+0.025
 dear+0.024
 importantly+0.023
 understanding+0.023
Neg logit contributions
 meets-0.026
 valleys-0.026
izons-0.026
 forests-0.024
 bridges-0.024
Token proud
Token ID6613
Feature activation-8.24e-02

Pos logit contributions
mmm+0.089
ittens+0.076
ww+0.070
 badly+0.069
www+0.068
Neg logit contributions
 meets-0.118
 ease-0.109
 valleys-0.108
 anywhere-0.106
izons-0.105
Token.
Token ID13
Feature activation+1.28e-03

Pos logit contributions
mmm+0.173
 badly+0.153
ww+0.148
ittens+0.144
www+0.143
Neg logit contributions
 meets-0.126
 asleep-0.120
 mountains-0.115
 ones-0.112
 curtains-0.111
Token As
Token ID1081
Feature activation-4.08e-02

Pos logit contributions
 meets+0.002
 valleys+0.002
izons+0.002
 forests+0.002
ators+0.002
Neg logit contributions
mmm-0.002
 badly-0.002
 dear-0.002
 importantly-0.002
 understanding-0.002
Token a
Token ID257
Feature activation-5.97e-02

Pos logit contributions
mmm+0.066
 badly+0.063
 dear+0.059
 importantly+0.058
 persuasive+0.054
Neg logit contributions
 meets-0.082
 valleys-0.079
izons-0.078
 forests-0.073
 ease-0.073
Token treat
Token ID2190
Feature activation+1.26e-02

Pos logit contributions
mmm+0.120
 badly+0.110
ittens+0.098
 importantly+0.097
 enjoyable+0.092
Neg logit contributions
 meets-0.091
 bottom-0.089
 theater-0.087
 nursery-0.084
 hall-0.083
Token,
Token ID11
Feature activation-7.32e-03

Pos logit contributions
 valleys+0.026
 meets+0.026
izons+0.025
 forests+0.024
ators+0.023
Neg logit contributions
mmm-0.019
 badly-0.019
 dear-0.018
 importantly-0.018
 understanding-0.017
Token all
Token ID477
Feature activation-1.01e-02

Pos logit contributions
 meets+0.031
izons+0.029
 valleys+0.029
ators+0.027
sey+0.027
Neg logit contributions
mmm-0.029
 badly-0.029
 dear-0.026
 importantly-0.026
 understanding-0.025
Token day
Token ID1110
Feature activation-4.04e-02

Pos logit contributions
mmm+0.022
 badly+0.022
 dear+0.021
 importantly+0.020
 understanding+0.019
Neg logit contributions
 meets-0.015
 valleys-0.014
izons-0.013
 forests-0.012
sey-0.012
Token.
Token ID13
Feature activation+2.66e-03

Pos logit contributions
mmm+0.068
 dear+0.063
 importantly+0.060
 badly+0.060
 understanding+0.057
Neg logit contributions
 meets-0.076
 valleys-0.075
 mountains-0.070
 bottom-0.070
sey-0.069
Token They
Token ID1119
Feature activation-1.13e-02

Pos logit contributions
 meets+0.005
 valleys+0.005
izons+0.004
 forests+0.004
 mountains+0.004
Neg logit contributions
mmm-0.005
 badly-0.005
 importantly-0.004
 dear-0.004
 persuasive-0.004
Token were
Token ID547
Feature activation-4.82e-02

Pos logit contributions
mmm+0.021
 badly+0.021
 importantly+0.019
 dear+0.019
 understanding+0.019
Neg logit contributions
 meets-0.019
izons-0.019
 valleys-0.018
ators-0.017
 forests-0.017
Token very
Token ID845
Feature activation-8.24e-02

Pos logit contributions
mmm+0.078
 badly+0.072
 dear+0.068
 importantly+0.068
ww+0.063
Neg logit contributions
 meets-0.098
 valleys-0.094
izons-0.092
 ease-0.088
 forests-0.087
Token happy
Token ID3772
Feature activation-5.41e-02

Pos logit contributions
mmm+0.146
 badly+0.124
 importantly+0.119
 dear+0.118
ittens+0.113
Neg logit contributions
 meets-0.151
 valleys-0.139
 bridges-0.133
 ease-0.130
 bottom-0.128
Token to
Token ID284
Feature activation+9.14e-03

Pos logit contributions
mmm+0.094
 badly+0.087
 dear+0.082
 importantly+0.081
 persuasive+0.077
Neg logit contributions
 meets-0.104
 valleys-0.099
izons-0.097
 mountains-0.093
 forests-0.092
Token have
Token ID423
Feature activation+1.54e-02

Pos logit contributions
 meets+0.014
 valleys+0.014
izons+0.014
 forests+0.013
ators+0.013
Neg logit contributions
mmm-0.019
 badly-0.018
 dear-0.017
 understanding-0.016
 caring-0.016
Token found
Token ID1043
Feature activation+3.12e-02

Pos logit contributions
 meets+0.030
 valleys+0.029
izons+0.029
 forests+0.027
sey+0.027
Neg logit contributions
mmm-0.026
 badly-0.025
 importantly-0.023
 dear-0.023
 understanding-0.022
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
 meets+0.061
izons+0.060
 valleys+0.059
 driveway+0.057
ators+0.056
Neg logit contributions
 badly-0.052
 dear-0.048
 caring-0.048
mmm-0.047
 understanding-0.046
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
mmm+0.019
 badly+0.018
 dear+0.016
 understanding+0.016
 importantly+0.016
Neg logit contributions
 meets-0.024
 valleys-0.023
izons-0.023
ators-0.021
 forests-0.021
Token smiled
Token ID13541
Feature activation-6.17e-02

Pos logit contributions
mmm+0.031
 badly+0.029
 understanding+0.029
 importantly+0.028
 dear+0.028
Neg logit contributions
 meets-0.055
izons-0.055
 valleys-0.054
 ease-0.051
sey-0.051
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
mmm+0.148
ittens+0.129
www+0.127
ww+0.126
 importantly+0.120
Neg logit contributions
 asleep-0.082
 meets-0.077
 valleys-0.076
 out-0.075
 ones-0.075
Token She
Token ID1375
Feature activation-1.08e-02

Pos logit contributions
mmm+0.026
 badly+0.025
 importantly+0.023
ww+0.022
 dear+0.022
Neg logit contributions
 meets-0.019
 valleys-0.019
izons-0.018
 mountains-0.017
 parks-0.017
Token was
Token ID373
Feature activation-5.33e-02

Pos logit contributions
mmm+0.023
 badly+0.022
 importantly+0.021
 dear+0.021
ittens+0.020
Neg logit contributions
 meets-0.016
 valleys-0.015
izons-0.015
 forests-0.014
 parks-0.013
Token proud
Token ID6613
Feature activation-8.22e-02

Pos logit contributions
mmm+0.058
 badly+0.050
 dear+0.046
 importantly+0.045
ittens+0.042
Neg logit contributions
 meets-0.139
izons-0.131
 valleys-0.131
 ease-0.125
 parks-0.125
Token of
Token ID286
Feature activation-6.51e-03

Pos logit contributions
mmm+0.186
 badly+0.166
ww+0.156
www+0.152
ittens+0.147
Neg logit contributions
 meets-0.116
 mountains-0.107
 ones-0.107
 bridges-0.106
 asleep-0.104
Token them
Token ID606
Feature activation-2.96e-02

Pos logit contributions
mmm+0.016
 badly+0.016
 importantly+0.015
 dear+0.015
 caring+0.014
Neg logit contributions
 meets-0.008
 valleys-0.007
izons-0.007
 driveway-0.006
 forests-0.006
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
mmm+0.060
ww+0.055
 importantly+0.054
 badly+0.053
www+0.052
Neg logit contributions
 meets-0.046
izons-0.042
 anywhere-0.041
sey-0.041
 bottom-0.040
Token She
Token ID1375
Feature activation-1.64e-02

Pos logit contributions
 badly+0.029
mmm+0.029
ww+0.026
 persuasive+0.026
 importantly+0.025
Neg logit contributions
 meets-0.018
 valleys-0.017
 mountains-0.016
 parks-0.016
izons-0.016
Token said
Token ID531
Feature activation-9.40e-03

Pos logit contributions
mmm+0.033
 badly+0.031
 importantly+0.028
ittens+0.028
 dear+0.028
Neg logit contributions
 meets-0.029
izons-0.025
 valleys-0.025
 parks-0.024
sey-0.023
Token liked
Token ID8288
Feature activation-1.14e-02

Pos logit contributions
izons+0.006
 meets+0.005
 valleys+0.005
neath+0.005
 ease+0.005
Neg logit contributions
mmm-0.005
 badly-0.004
 understanding-0.004
 dear-0.004
 importantly-0.004
Token counting
Token ID14143
Feature activation-1.57e-03

Pos logit contributions
 caring+0.016
mmm+0.016
 sweet+0.015
 understanding+0.015
 importantly+0.015
Neg logit contributions
izons-0.025
 meets-0.023
neath-0.021
iece-0.021
 driveway-0.021
Token and
Token ID290
Feature activation-3.27e-02

Pos logit contributions
mmm+0.002
 badly+0.002
 dear+0.002
 importantly+0.002
 sweet+0.002
Neg logit contributions
 meets-0.003
izons-0.003
 driveway-0.003
ASH-0.003
neath-0.003
Token he
Token ID339
Feature activation+4.97e-03

Pos logit contributions
mmm+0.058
 understanding+0.057
 caring+0.055
 importantly+0.054
 dear+0.053
Neg logit contributions
izons-0.058
 meets-0.053
ators-0.052
ips-0.052
robe-0.052
Token was
Token ID373
Feature activation-5.17e-02

Pos logit contributions
izons+0.010
 meets+0.009
neath+0.009
 valleys+0.009
 asleep+0.009
Neg logit contributions
mmm-0.007
 understanding-0.007
 importantly-0.007
 badly-0.007
 dear-0.007
Token very
Token ID845
Feature activation-8.21e-02

Pos logit contributions
mmm+0.084
 badly+0.082
 importantly+0.078
 dear+0.077
 understanding+0.075
Neg logit contributions
 meets-0.101
izons-0.098
 valleys-0.096
ators-0.089
 forests-0.088
Token good
Token ID922
Feature activation-5.05e-02

Pos logit contributions
mmm+0.126
 badly+0.117
ittens+0.108
 dear+0.104
ww+0.103
Neg logit contributions
 meets-0.171
 valleys-0.167
 bridges-0.155
 parks-0.154
 mountains-0.154
Token at
Token ID379
Feature activation+1.84e-02

Pos logit contributions
mmm+0.075
 badly+0.069
 dear+0.064
 importantly+0.063
ww+0.060
Neg logit contributions
 meets-0.109
 valleys-0.104
izons-0.104
 mountains-0.098
 forests-0.098
Token it
Token ID340
Feature activation-2.37e-02

Pos logit contributions
izons+0.037
 meets+0.034
robe+0.034
 driveway+0.031
ancer+0.031
Neg logit contributions
 caring-0.027
 understanding-0.027
mmm-0.026
 sweet-0.025
 badly-0.025
Token.
Token ID13
Feature activation+4.66e-03

Pos logit contributions
mmm+0.033
 badly+0.032
 importantly+0.029
 dear+0.029
 understanding+0.027
Neg logit contributions
 meets-0.053
izons-0.051
 valleys-0.051
ators-0.048
 forests-0.047
Token He
Token ID679
Feature activation-6.36e-03

Pos logit contributions
 meets+0.010
izons+0.010
 valleys+0.010
ators+0.010
 anywhere+0.009
Neg logit contributions
mmm-0.007
 dear-0.006
 badly-0.006
 importantly-0.006
 understanding-0.005
Token!"
Token ID2474
Feature activation-1.75e-02

Pos logit contributions
mmm+0.046
 badly+0.043
 dear+0.040
 importantly+0.038
 understanding+0.035
Neg logit contributions
 meets-0.087
 valleys-0.085
izons-0.084
 forests-0.080
 ease-0.079
Token Sue
Token ID26089
Feature activation-2.06e-02

Pos logit contributions
mmm+0.029
 badly+0.028
 importantly+0.027
 dear+0.026
ww+0.025
Neg logit contributions
 meets-0.033
izons-0.031
 valleys-0.031
 parks-0.029
 ease-0.029
Token smiled
Token ID13541
Feature activation-4.87e-02

Pos logit contributions
mmm+0.043
 badly+0.041
 importantly+0.039
 dear+0.039
ww+0.036
Neg logit contributions
 meets-0.032
izons-0.030
 valleys-0.030
sey-0.028
 forests-0.027
Token and
Token ID290
Feature activation-2.94e-02

Pos logit contributions
mmm+0.113
ww+0.102
www+0.100
ittens+0.099
 importantly+0.097
Neg logit contributions
 asleep-0.068
 meets-0.063
 out-0.062
iece-0.062
 valleys-0.062
Token felt
Token ID2936
Feature activation-5.53e-02

Pos logit contributions
mmm+0.049
 badly+0.047
 importantly+0.043
 dear+0.041
ittens+0.040
Neg logit contributions
 meets-0.059
 valleys-0.055
izons-0.054
 ease-0.051
sey-0.051
Token proud
Token ID6613
Feature activation-8.19e-02

Pos logit contributions
mmm+0.090
ittens+0.081
 badly+0.074
ww+0.073
 importantly+0.071
Neg logit contributions
 meets-0.115
 ease-0.108
 valleys-0.106
izons-0.103
 anywhere-0.100
Token.
Token ID13
Feature activation+4.85e-04

Pos logit contributions
mmm+0.176
 badly+0.155
ww+0.155
ittens+0.149
www+0.149
Neg logit contributions
 meets-0.116
 asleep-0.113
 mountains-0.109
 ones-0.107
 valleys-0.106
Token From
Token ID3574
Feature activation-3.00e-02

Pos logit contributions
 meets+0.001
 valleys+0.001
izons+0.001
 forests+0.001
 ease+0.001
Neg logit contributions
mmm-0.001
 badly-0.001
 dear-0.001
 importantly-0.001
 understanding-0.001
Token that
Token ID326
Feature activation+1.71e-02

Pos logit contributions
mmm+0.055
 badly+0.050
 importantly+0.050
ittens+0.049
 dear+0.048
Neg logit contributions
 meets-0.050
 mountains-0.047
 parks-0.047
 up-0.046
ators-0.046
Token day
Token ID1110
Feature activation-4.69e-03

Pos logit contributions
 meets+0.024
izons+0.023
 valleys+0.022
sey+0.020
 forests+0.020
Neg logit contributions
mmm-0.038
 badly-0.037
 dear-0.035
 importantly-0.035
 understanding-0.033
Token on
Token ID319
Feature activation+9.07e-03

Pos logit contributions
mmm+0.013
 badly+0.012
 dear+0.012
 importantly+0.012
 enjoyable+0.012
Neg logit contributions
 meets-0.004
izons-0.004
 valleys-0.004
ips-0.003
iece-0.003
Token your
Token ID534
Feature activation+1.18e-02

Pos logit contributions
izons+0.086
ators+0.085
pit+0.085
 driveway+0.084
 meets+0.083
Neg logit contributions
 badly-0.080
 dear-0.073
 caring-0.071
 understanding-0.068
mmm-0.067
Token health
Token ID1535
Feature activation+1.14e-02

Pos logit contributions
izons+0.021
 valleys+0.020
 meets+0.020
neath+0.019
ators+0.019
Neg logit contributions
 dear-0.021
 badly-0.021
 importantly-0.020
mmm-0.019
 caring-0.019
Token for
Token ID329
Feature activation+1.95e-02

Pos logit contributions
 meets+0.027
 valleys+0.026
izons+0.026
 forests+0.025
 ease+0.024
Neg logit contributions
 badly-0.015
mmm-0.013
 dear-0.012
 importantly-0.012
 enjoyable-0.011
Token granted
Token ID7520
Feature activation-2.50e-03

Pos logit contributions
 meets+0.033
izons+0.032
 valleys+0.032
ators+0.029
sey+0.029
Neg logit contributions
mmm-0.037
 badly-0.037
 dear-0.034
 importantly-0.034
 understanding-0.033
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
 badly+0.004
mmm+0.004
 dear+0.003
 importantly+0.003
 understanding+0.003
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
sey-0.005
 forests-0.005
Token<|endoftext|>
Token ID50256
Feature activation+8.07e-02

Pos logit contributions
 meets+0.026
 valleys+0.026
izons+0.025
ators+0.024
 bridges+0.023
Neg logit contributions
mmm-0.025
 badly-0.023
 importantly-0.022
 dear-0.022
 understanding-0.020
TokenOnce
Token ID7454
Feature activation+2.50e-02

Pos logit contributions
iece+0.172
izons+0.169
ators+0.169
 bridges+0.165
 meets+0.164
Neg logit contributions
mmm-0.091
 badly-0.083
 importantly-0.079
S-0.077
From-0.072
Token upon
Token ID2402
Feature activation+1.53e-02

Pos logit contributions
 meets+0.035
izons+0.034
 valleys+0.034
ators+0.030
 forests+0.030
Neg logit contributions
 badly-0.055
mmm-0.054
 dear-0.051
 importantly-0.051
 understanding-0.048
Token a
Token ID257
Feature activation-5.36e-02

Pos logit contributions
 meets+0.027
izons+0.026
 valleys+0.026
ators+0.024
sey+0.023
Neg logit contributions
mmm-0.028
 badly-0.028
 dear-0.026
 importantly-0.025
 understanding-0.024
Token time
Token ID640
Feature activation+1.66e-02

Pos logit contributions
mmm+0.134
 badly+0.127
 importantly+0.124
 dear+0.122
 understanding+0.119
Neg logit contributions
 meets-0.056
 finds-0.043
 valleys-0.043
 bottom-0.042
izons-0.042
Token,
Token ID11
Feature activation+2.36e-02

Pos logit contributions
 meets+0.033
 valleys+0.031
izons+0.031
 forests+0.029
 ease+0.029
Neg logit contributions
mmm-0.026
 badly-0.025
 importantly-0.024
 dear-0.023
 understanding-0.022
Token the
Token ID262
Feature activation+3.51e-02

Pos logit contributions
 meets+0.068
izons+0.067
iece+0.062
rays+0.060
ators+0.060
Neg logit contributions
 badly-0.045
mmm-0.043
 understanding-0.043
 inviting-0.042
 importantly-0.042
Token park
Token ID3952
Feature activation-1.04e-03

Pos logit contributions
 meets+0.045
izons+0.044
iece+0.042
rays+0.042
ators+0.041
Neg logit contributions
 adorable-0.069
 sweet-0.069
 inviting-0.069
ww-0.067
 badly-0.067
Token.
Token ID13
Feature activation+3.49e-02

Pos logit contributions
mmm+0.002
 badly+0.002
 dear+0.001
 importantly+0.001
 understanding+0.001
Neg logit contributions
 meets-0.002
 valleys-0.002
izons-0.002
 forests-0.002
ators-0.002
Token They
Token ID1119
Feature activation+1.26e-02

Pos logit contributions
 meets+0.070
izons+0.067
 valleys+0.067
ators+0.064
 ease+0.062
Neg logit contributions
mmm-0.056
 badly-0.053
 dear-0.050
 importantly-0.050
 understanding-0.046
Token liked
Token ID8288
Feature activation+1.37e-02

Pos logit contributions
izons+0.026
 meets+0.025
ators+0.024
 valleys+0.024
iece+0.023
Neg logit contributions
 badly-0.018
mmm-0.018
 dear-0.017
 understanding-0.017
 importantly-0.017
Token to
Token ID284
Feature activation+5.16e-02

Pos logit contributions
izons+0.029
 meets+0.028
sey+0.026
iece+0.026
ators+0.026
Neg logit contributions
 badly-0.021
 caring-0.019
 understanding-0.019
 dear-0.019
mmm-0.018
Token find
Token ID1064
Feature activation+4.53e-02

Pos logit contributions
 meets+0.090
izons+0.088
 valleys+0.087
rays+0.083
ators+0.083
Neg logit contributions
mmm-0.087
 badly-0.086
 understanding-0.081
 dear-0.080
 persuasive-0.079
Token sticks
Token ID16461
Feature activation+1.47e-02

Pos logit contributions
 meets+0.083
izons+0.081
 valleys+0.075
sey+0.073
ators+0.073
Neg logit contributions
 badly-0.079
 dear-0.077
 sweet-0.073
 understanding-0.073
 importantly-0.072
Token and
Token ID290
Feature activation+7.77e-03

Pos logit contributions
 meets+0.036
izons+0.036
 valleys+0.034
ators+0.033
ASH+0.033
Neg logit contributions
 badly-0.018
mmm-0.016
 dear-0.014
 importantly-0.014
 caring-0.013
Token pretend
Token ID16614
Feature activation+1.00e-02

Pos logit contributions
izons+0.011
 meets+0.011
ators+0.010
ips+0.010
neath+0.009
Neg logit contributions
 badly-0.016
mmm-0.016
 dear-0.015
 caring-0.015
 sweet-0.015
Token they
Token ID484
Feature activation+5.78e-02

Pos logit contributions
 meets+0.019
izons+0.019
 valleys+0.018
sey+0.017
ators+0.017
Neg logit contributions
 badly-0.017
mmm-0.017
 importantly-0.015
 dear-0.015
 caring-0.014
Token let
Token ID1309
Feature activation+4.33e-02

Pos logit contributions
mmm+0.025
 badly+0.024
 dear+0.023
 importantly+0.023
 understanding+0.022
Neg logit contributions
 meets-0.023
izons-0.022
 valleys-0.022
sey-0.020
 ease-0.020
Tokenâ
Token ID22940
Feature activation-6.92e-03

Pos logit contributions
 valleys+0.067
 meets+0.065
izons+0.062
 forests+0.060
ators+0.060
Neg logit contributions
mmm-0.087
 badly-0.085
 dear-0.082
ww-0.079
ittens-0.076
Token€
Token ID26391
Feature activation-3.07e-03

Pos logit contributions
 badly+0.014
mmm+0.014
 importantly+0.014
 dear+0.013
 understanding+0.012
Neg logit contributions
 meets-0.011
izons-0.010
 valleys-0.010
iece-0.009
sey-0.009
Tokenâ„¢
Token ID8151
Feature activation+1.98e-02

Pos logit contributions
mmm+0.006
 badly+0.006
 importantly+0.006
 dear+0.006
 understanding+0.005
Neg logit contributions
 meets-0.005
 valleys-0.005
izons-0.005
 forests-0.004
ators-0.004
Tokens
Token ID82
Feature activation+1.48e-03

Pos logit contributions
 meets+0.034
izons+0.032
 valleys+0.032
 ease+0.029
sey+0.029
Neg logit contributions
mmm-0.038
 badly-0.036
 dear-0.034
 importantly-0.034
 understanding-0.032
Token bury
Token ID30033
Feature activation+5.23e-02

Pos logit contributions
 valleys+0.003
 meets+0.003
izons+0.002
 forests+0.002
 mountains+0.002
Neg logit contributions
mmm-0.003
 badly-0.003
 dear-0.003
 understanding-0.002
 importantly-0.002
Token a
Token ID257
Feature activation+2.42e-04

Pos logit contributions
 meets+0.092
izons+0.088
 valleys+0.086
 driveway+0.080
iece+0.079
Neg logit contributions
mmm-0.099
 badly-0.098
 dear-0.092
 importantly-0.085
ww-0.084
Token treasure
Token ID14068
Feature activation+2.39e-02

Pos logit contributions
 meets+0.000
izons+0.000
 valleys+0.000
ators+0.000
iece+0.000
Neg logit contributions
mmm-0.000
 badly-0.000
 dear-0.000
 importantly-0.000
 understanding-0.000
Token box
Token ID3091
Feature activation-7.51e-03

Pos logit contributions
 meets+0.039
izons+0.038
 valleys+0.037
 ease+0.035
 forests+0.033
Neg logit contributions
 badly-0.046
mmm-0.046
 dear-0.043
 importantly-0.041
ww-0.040
Token!
Token ID0
Feature activation+1.42e-02

Pos logit contributions
mmm+0.013
 badly+0.013
 dear+0.012
 importantly+0.011
 understanding+0.011
Neg logit contributions
 meets-0.014
izons-0.014
 valleys-0.014
 forests-0.013
 ease-0.013
Tokenâ
Token ID22940
Feature activation-1.26e-02

Pos logit contributions
 meets+0.023
 valleys+0.023
izons+0.022
 mountains+0.021
 forests+0.020
Neg logit contributions
mmm-0.027
 badly-0.027
 importantly-0.025
 dear-0.024
 understanding-0.024
Token Tim
Token ID5045
Feature activation-2.44e-03

Pos logit contributions
 meets+0.011
 valleys+0.010
izons+0.010
 ease+0.009
 forests+0.009
Neg logit contributions
mmm-0.010
 badly-0.009
 dear-0.009
 importantly-0.009
 persuasive-0.008
Tokenmy
Token ID1820
Feature activation-1.47e-02

Pos logit contributions
mmm+0.006
 badly+0.006
 dear+0.005
 importantly+0.005
 understanding+0.005
Neg logit contributions
 valleys-0.003
 meets-0.003
izons-0.003
 mountains-0.002
 bridges-0.002
Token realized
Token ID6939
Feature activation-2.16e-02

Pos logit contributions
mmm+0.029
 badly+0.028
 dear+0.026
 importantly+0.026
 understanding+0.025
Neg logit contributions
 meets-0.024
izons-0.023
 valleys-0.023
ators-0.021
 forests-0.021
Token that
Token ID326
Feature activation+1.19e-02

Pos logit contributions
mmm+0.031
 badly+0.028
 dear+0.027
 importantly+0.026
 persuasive+0.025
Neg logit contributions
 meets-0.046
 valleys-0.045
izons-0.044
 mountains-0.042
 forests-0.042
Token sometimes
Token ID3360
Feature activation+9.87e-03

Pos logit contributions
 meets+0.025
izons+0.025
 valleys+0.024
ators+0.023
ASH+0.023
Neg logit contributions
 badly-0.018
mmm-0.017
 importantly-0.017
 caring-0.016
 understanding-0.016
Token it
Token ID340
Feature activation+4.87e-02

Pos logit contributions
 meets+0.020
izons+0.020
 valleys+0.020
ators+0.019
sey+0.019
Neg logit contributions
 badly-0.015
mmm-0.015
 importantly-0.014
 dear-0.014
 understanding-0.013
Token's
Token ID338
Feature activation-1.01e-02

Pos logit contributions
 valleys+0.096
izons+0.091
iece+0.090
 driveway+0.089
 forests+0.087
Neg logit contributions
 badly-0.079
 dear-0.069
 annoying-0.069
 important-0.068
mmm-0.068
Token fun
Token ID1257
Feature activation-8.91e-03

Pos logit contributions
 badly+0.011
mmm+0.011
 importantly+0.009
 dear+0.009
 caring+0.008
Neg logit contributions
 meets-0.026
 valleys-0.025
izons-0.025
 forests-0.024
iece-0.023
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
 badly+0.015
mmm+0.013
 importantly+0.013
 dear+0.013
 enjoyable+0.013
Neg logit contributions
izons-0.018
 valleys-0.017
 meets-0.016
ators-0.016
iece-0.016
Token hide
Token ID7808
Feature activation+3.90e-03

Pos logit contributions
 valleys+0.058
ators+0.056
 meets+0.056
ips+0.055
rays+0.054
Neg logit contributions
 badly-0.047
mmm-0.047
 understanding-0.043
 caring-0.042
 dear-0.041
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
 meets+0.009
izons+0.009
 valleys+0.008
 ease+0.008
 driveway+0.008
Neg logit contributions
 badly-0.006
mmm-0.005
 dear-0.005
 importantly-0.005
 understanding-0.004
Token it
Token ID340
Feature activation-8.13e-03

Pos logit contributions
 meets+0.031
izons+0.030
 valleys+0.028
ators+0.028
 driveway+0.028
Neg logit contributions
mmm-0.024
 badly-0.024
 dear-0.023
 sweet-0.022
 understanding-0.021
Token.
Token ID13
Feature activation+3.55e-03

Pos logit contributions
mmm+0.012
 badly+0.011
 dear+0.010
 importantly+0.010
 understanding+0.010
Neg logit contributions
 meets-0.018
 valleys-0.017
izons-0.017
 forests-0.016
ators-0.016
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
 meets+0.007
 valleys+0.006
izons+0.006
 forests+0.006
sey+0.006
Neg logit contributions
mmm-0.007
 badly-0.006
 importantly-0.006
 dear-0.006
 understanding-0.006
Token
Token ID198
Feature activation-3.40e-02

Pos logit contributions
mmm+0.028
 badly+0.027
 dear+0.026
 importantly+0.025
 understanding+0.024
Neg logit contributions
 meets-0.024
 valleys-0.023
izons-0.022
 forests-0.021
 ease-0.020
TokenHe
Token ID1544
Feature activation-6.29e-04

Pos logit contributions
mmm+0.062
 dear+0.060
 badly+0.056
 importantly+0.056
 caring+0.052
Neg logit contributions
 meets-0.059
 valleys-0.058
izons-0.056
iece-0.055
Ready-0.053
Token takes
Token ID2753
Feature activation+5.12e-02

Pos logit contributions
mmm+0.001
 badly+0.001
 dear+0.001
 persuasive+0.001
 annoying+0.001
Neg logit contributions
izons-0.001
 valleys-0.001
 ease-0.001
 meets-0.001
ators-0.001
Token the
Token ID262
Feature activation+4.90e-02

Pos logit contributions
izons+0.119
 driveway+0.117
 meets+0.117
ric+0.115
ators+0.115
Neg logit contributions
 sweet-0.056
 badly-0.054
mmm-0.052
 loud-0.049
 dear-0.048
Token carrot
Token ID40562
Feature activation+1.97e-02

Pos logit contributions
 meets+0.098
izons+0.095
 valleys+0.094
ators+0.088
 forests+0.086
Neg logit contributions
mmm-0.081
 badly-0.075
 dear-0.074
 importantly-0.070
 understanding-0.067
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
 meets+0.042
izons+0.040
 valleys+0.040
ators+0.037
 parks+0.037
Neg logit contributions
mmm-0.030
 badly-0.029
 dear-0.026
 importantly-0.025
 understanding-0.025
Token goes
Token ID2925
Feature activation+2.96e-02

Pos logit contributions
izons+0.029
 meets+0.027
 valleys+0.026
ators+0.026
iece+0.024
Neg logit contributions
mmm-0.033
 badly-0.030
 dear-0.030
 understanding-0.028
 sweet-0.028
Token to
Token ID284
Feature activation+3.38e-02

Pos logit contributions
izons+0.070
 valleys+0.065
ators+0.063
ains+0.060
 meets+0.060
Neg logit contributions
 badly-0.033
 understanding-0.032
 because-0.032
 sweet-0.031
 obedient-0.030
Token On
Token ID1550
Feature activation+4.10e-02

Pos logit contributions
 meets+0.076
 valleys+0.075
izons+0.072
 forests+0.067
 mountains+0.067
Neg logit contributions
 badly-0.106
mmm-0.105
 importantly-0.098
 dear-0.097
 caring-0.094
Token this
Token ID428
Feature activation+3.59e-02

Pos logit contributions
 meets+0.061
 valleys+0.058
izons+0.058
 forests+0.052
ators+0.052
Neg logit contributions
mmm-0.087
 badly-0.084
 importantly-0.080
 dear-0.080
 understanding-0.076
Token farm
Token ID5318
Feature activation+3.29e-02

Pos logit contributions
 meets+0.045
 valleys+0.041
izons+0.040
 mountains+0.038
 forests+0.038
Neg logit contributions
mmm-0.086
 badly-0.086
 importantly-0.083
 dear-0.079
 understanding-0.078
Token lived
Token ID5615
Feature activation+5.12e-02

Pos logit contributions
 meets+0.077
 valleys+0.077
izons+0.077
 ease+0.074
sey+0.073
Neg logit contributions
 badly-0.039
mmm-0.038
 dear-0.035
 importantly-0.034
 understanding-0.031
Token a
Token ID257
Feature activation-4.17e-02

Pos logit contributions
 meets+0.088
izons+0.084
 valleys+0.084
ators+0.077
sey+0.077
Neg logit contributions
mmm-0.096
 badly-0.093
 importantly-0.087
 dear-0.087
 understanding-0.082
Token happy
Token ID3772
Feature activation+5.74e-03

Pos logit contributions
mmm+0.085
 badly+0.077
 importantly+0.072
ittens+0.068
 understanding+0.066
Neg logit contributions
 meets-0.068
 mountains-0.065
 hall-0.064
 valleys-0.064
 woods-0.063
Token hen
Token ID30963
Feature activation-3.45e-03

Pos logit contributions
 meets+0.010
 valleys+0.009
izons+0.009
 mountains+0.009
 forests+0.009
Neg logit contributions
mmm-0.011
 badly-0.011
 importantly-0.010
 dear-0.009
 persuasive-0.009
Token.
Token ID13
Feature activation+4.02e-02

Pos logit contributions
mmm+0.005
 badly+0.005
 dear+0.004
 importantly+0.004
 understanding+0.004
Neg logit contributions
 meets-0.007
 valleys-0.007
izons-0.007
 forests-0.007
 ease-0.007
Token The
Token ID383
Feature activation+2.55e-02

Pos logit contributions
 meets+0.060
 valleys+0.059
izons+0.057
 forests+0.053
 mountains+0.053
Neg logit contributions
 badly-0.083
mmm-0.083
 importantly-0.077
 dear-0.076
 caring-0.074
Token hen
Token ID30963
Feature activation+1.71e-02

Pos logit contributions
 meets+0.046
 valleys+0.044
 mountains+0.043
 forests+0.042
izons+0.041
Neg logit contributions
mmm-0.047
 badly-0.045
 importantly-0.042
 persuasive-0.038
 understanding-0.037
Token had
Token ID550
Feature activation+2.66e-02

Pos logit contributions
izons+0.030
 meets+0.030
 valleys+0.030
 ease+0.027
ators+0.027
Neg logit contributions
mmm-0.031
 badly-0.030
 dear-0.029
 importantly-0.028
 understanding-0.027
TokenB
Token ID33
Feature activation-1.80e-02

Pos logit contributions
mmm+0.073
 badly+0.072
 dear+0.070
 enjoyable+0.062
 importantly+0.062
Neg logit contributions
 valleys-0.077
 meets-0.076
 forests-0.071
 bridges-0.070
izons-0.070
Tokenella
Token ID12627
Feature activation-6.17e-03

Pos logit contributions
mmm+0.033
 badly+0.032
 dear+0.030
 importantly+0.029
ww+0.028
Neg logit contributions
 meets-0.033
 valleys-0.031
izons-0.031
 forests-0.029
 mountains-0.029
Token closed
Token ID4838
Feature activation+1.17e-02

Pos logit contributions
mmm+0.011
 badly+0.010
 dear+0.010
 importantly+0.010
 understanding+0.009
Neg logit contributions
 meets-0.011
 valleys-0.011
izons-0.011
ators-0.010
 ease-0.010
Token her
Token ID607
Feature activation+2.96e-02

Pos logit contributions
 meets+0.022
izons+0.020
 valleys+0.020
ators+0.018
 forests+0.018
Neg logit contributions
 badly-0.021
mmm-0.021
 dear-0.019
 importantly-0.019
 understanding-0.018
Token eyes
Token ID2951
Feature activation-1.78e-02

Pos logit contributions
izons+0.062
 meets+0.062
 valleys+0.060
ators+0.057
 ease+0.057
Neg logit contributions
 badly-0.043
 dear-0.042
mmm-0.042
 importantly-0.038
 understanding-0.037
Token and
Token ID290
Feature activation+4.65e-03

Pos logit contributions
mmm+0.026
 badly+0.025
 dear+0.023
 importantly+0.023
 understanding+0.021
Neg logit contributions
 meets-0.038
 valleys-0.037
izons-0.037
 forests-0.034
ators-0.034
Token sn
Token ID3013
Feature activation+8.61e-03

Pos logit contributions
 meets+0.009
izons+0.009
 valleys+0.009
ators+0.009
 bridges+0.008
Neg logit contributions
mmm-0.007
 badly-0.007
 dear-0.007
 importantly-0.006
 understanding-0.006
Tokenugg
Token ID6837
Feature activation-2.05e-03

Pos logit contributions
 meets+0.017
 valleys+0.017
izons+0.016
 bridges+0.016
 ease+0.015
Neg logit contributions
mmm-0.015
 badly-0.015
ittens-0.013
 dear-0.013
 importantly-0.012
Tokenled
Token ID992
Feature activation-1.06e-02

Pos logit contributions
 badly+0.003
mmm+0.003
 dear+0.003
 importantly+0.003
ittens+0.002
Neg logit contributions
 meets-0.005
izons-0.004
 valleys-0.004
 ease-0.004
 driveway-0.004
Token with
Token ID351
Feature activation+8.02e-03

Pos logit contributions
 badly+0.020
mmm+0.018
 understanding+0.018
 dear+0.017
 importantly+0.017
Neg logit contributions
 meets-0.019
izons-0.019
 valleys-0.017
ators-0.017
 driveway-0.016
Token her
Token ID607
Feature activation+1.17e-02

Pos logit contributions
 meets+0.017
izons+0.017
ators+0.016
 valleys+0.016
 driveway+0.015
Neg logit contributions
 badly-0.011
 dear-0.011
 importantly-0.011
 understanding-0.011
mmm-0.011
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
mmm+0.013
 badly+0.013
 dear+0.012
 importantly+0.012
 understanding+0.011
Neg logit contributions
 meets-0.022
 valleys-0.022
izons-0.021
ators-0.020
 forests-0.020
Token Suddenly
Token ID24975
Feature activation+5.97e-03

Pos logit contributions
 meets+0.047
ators+0.046
izons+0.046
 valleys+0.044
 bridges+0.043
Neg logit contributions
mmm-0.032
 dear-0.028
 badly-0.027
 importantly-0.026
From-0.024
Token the
Token ID262
Feature activation+2.04e-02

Pos logit contributions
 meets+0.015
izons+0.015
ators+0.014
 valleys+0.014
 driveway+0.014
Neg logit contributions
 badly-0.007
mmm-0.007
 dear-0.006
 importantly-0.005
 understanding-0.005
Token pictures
Token ID5986
Feature activation+1.65e-02

Pos logit contributions
 meets+0.036
izons+0.035
 valleys+0.035
ators+0.033
neath+0.032
Neg logit contributions
 dear-0.037
 badly-0.036
mmm-0.036
 importantly-0.035
 understanding-0.034
Token in
Token ID287
Feature activation-4.70e-03

Pos logit contributions
 meets+0.030
izons+0.029
 valleys+0.027
ASH+0.026
sey+0.026
Neg logit contributions
 badly-0.030
mmm-0.028
 importantly-0.027
 dear-0.026
 understanding-0.026
Token the
Token ID262
Feature activation+3.65e-02

Pos logit contributions
 badly+0.008
mmm+0.008
 dear+0.008
 importantly+0.008
 understanding+0.008
Neg logit contributions
 meets-0.009
izons-0.008
iece-0.008
 valleys-0.008
ips-0.007
Token book
Token ID1492
Feature activation+1.53e-02

Pos logit contributions
izons+0.053
ips+0.052
 meets+0.050
ators+0.049
 valleys+0.049
Neg logit contributions
 dear-0.074
mmm-0.069
 badly-0.069
 sweet-0.068
 importantly-0.067
Token started
Token ID2067
Feature activation-5.83e-03

Pos logit contributions
 meets+0.028
izons+0.027
 valleys+0.026
ators+0.025
 ease+0.025
Neg logit contributions
mmm-0.027
 badly-0.027
 dear-0.025
 importantly-0.024
ww-0.023
Token to
Token ID284
Feature activation+1.77e-02

Pos logit contributions
 badly+0.010
mmm+0.009
 dear+0.009
 understanding+0.009
 importantly+0.009
Neg logit contributions
 meets-0.011
izons-0.011
 valleys-0.010
ators-0.010
iece-0.009
Token come
Token ID1282
Feature activation+1.99e-02

Pos logit contributions
 meets+0.034
izons+0.031
ips+0.031
 valleys+0.030
ators+0.029
Neg logit contributions
mmm-0.030
 badly-0.029
 understanding-0.028
 annoying-0.027
 dear-0.027
Token alive
Token ID6776
Feature activation-7.90e-03

Pos logit contributions
 meets+0.036
izons+0.034
 valleys+0.033
ators+0.031
ips+0.031
Neg logit contributions
 badly-0.037
mmm-0.035
 dear-0.032
 understanding-0.031
 caring-0.030
Token to
Token ID284
Feature activation+3.79e-02

Pos logit contributions
 meets+0.013
izons+0.013
 valleys+0.013
iece+0.012
ators+0.012
Neg logit contributions
mmm-0.017
 badly-0.016
 dear-0.015
 importantly-0.015
 caring-0.015
Token travel
Token ID3067
Feature activation+2.04e-02

Pos logit contributions
 meets+0.047
 valleys+0.043
izons+0.043
 ease+0.038
 forests+0.038
Neg logit contributions
mmm-0.091
 badly-0.089
 importantly-0.085
 dear-0.084
 understanding-0.080
Token with
Token ID351
Feature activation+6.60e-03

Pos logit contributions
 meets+0.038
 valleys+0.036
izons+0.036
 forests+0.033
 mountains+0.033
Neg logit contributions
mmm-0.037
 badly-0.035
 dear-0.033
 importantly-0.033
 understanding-0.031
Token her
Token ID607
Feature activation+1.74e-02

Pos logit contributions
 meets+0.010
 valleys+0.009
izons+0.009
 forests+0.009
 mountains+0.008
Neg logit contributions
mmm-0.015
 badly-0.014
 importantly-0.013
 dear-0.013
 persuasive-0.013
Token family
Token ID1641
Feature activation-1.11e-02

Pos logit contributions
 meets+0.034
 valleys+0.032
izons+0.032
 forests+0.030
 mountains+0.030
Neg logit contributions
mmm-0.030
 badly-0.028
 importantly-0.026
 dear-0.026
 understanding-0.025
Token.
Token ID13
Feature activation+3.14e-02

Pos logit contributions
mmm+0.020
 badly+0.018
 dear+0.018
 importantly+0.017
ww+0.016
Neg logit contributions
 meets-0.021
 valleys-0.020
izons-0.020
 forests-0.019
sey-0.019
Token One
Token ID1881
Feature activation+9.24e-03

Pos logit contributions
 meets+0.068
 valleys+0.068
izons+0.066
iece+0.063
ators+0.062
Neg logit contributions
mmm-0.045
 dear-0.040
 badly-0.039
 importantly-0.038
ittens-0.036
Token day
Token ID1110
Feature activation-3.12e-03

Pos logit contributions
 meets+0.009
 valleys+0.008
 hall+0.008
izons+0.007
 mountains+0.007
Neg logit contributions
mmm-0.025
 badly-0.024
 importantly-0.023
 dear-0.022
 persuasive-0.022
Token,
Token ID11
Feature activation+1.47e-03

Pos logit contributions
mmm+0.006
 badly+0.005
 enjoyable+0.005
 persuasive+0.005
 importantly+0.005
Neg logit contributions
 meets-0.005
izons-0.005
 valleys-0.005
 bottom-0.005
 ones-0.004
Token they
Token ID484
Feature activation+8.63e-03

Pos logit contributions
 meets+0.002
izons+0.002
 valleys+0.002
 bridges+0.002
 ones+0.002
Neg logit contributions
mmm-0.003
 badly-0.003
 importantly-0.003
 persuasive-0.002
 dear-0.002
Token decided
Token ID3066
Feature activation-4.11e-03

Pos logit contributions
izons+0.014
 meets+0.014
 valleys+0.014
sey+0.013
iece+0.013
Neg logit contributions
 badly-0.015
mmm-0.015
 importantly-0.014
 dear-0.014
 understanding-0.014
Token everyone
Token ID2506
Feature activation-2.83e-02

Pos logit contributions
 badly+0.000
mmm+0.000
 importantly+0.000
 understanding+0.000
 dear+0.000
Neg logit contributions
 meets-0.000
 valleys-0.000
izons-0.000
 driveway-0.000
ware-0.000
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
mmm+0.044
 badly+0.043
 importantly+0.040
 dear+0.039
 understanding+0.037
Neg logit contributions
 meets-0.058
izons-0.056
 valleys-0.056
 forests-0.052
ators-0.052
Token<|endoftext|>
Token ID50256
Feature activation+7.52e-02

Pos logit contributions
 meets+0.022
 valleys+0.022
izons+0.022
ators+0.020
 forests+0.020
Neg logit contributions
mmm-0.017
 badly-0.016
 importantly-0.015
 dear-0.015
 understanding-0.014
TokenSam
Token ID16305
Feature activation+2.28e-02

Pos logit contributions
iece+0.178
izons+0.177
ators+0.175
ware+0.171
 meets+0.170
Neg logit contributions
mmm-0.064
 importantly-0.060
 badly-0.058
S-0.054
From-0.053
Token and
Token ID290
Feature activation-1.19e-02

Pos logit contributions
 valleys+0.038
izons+0.038
 meets+0.037
iece+0.035
 bridges+0.035
Neg logit contributions
mmm-0.041
 badly-0.039
 dear-0.038
 understanding-0.035
ww-0.035
Token his
Token ID465
Feature activation+3.87e-03

Pos logit contributions
mmm+0.021
 badly+0.020
 dear+0.019
 importantly+0.019
 understanding+0.018
Neg logit contributions
 meets-0.022
 valleys-0.021
izons-0.021
ators-0.019
 forests-0.019
Token mom
Token ID1995
Feature activation+8.39e-03

Pos logit contributions
 meets+0.008
 valleys+0.007
izons+0.007
 ease+0.007
 forests+0.007
Neg logit contributions
mmm-0.006
 badly-0.006
 importantly-0.006
 dear-0.005
 understanding-0.005
Token were
Token ID547
Feature activation-3.11e-02

Pos logit contributions
 valleys+0.014
izons+0.013
 meets+0.013
iece+0.012
 bridges+0.012
Neg logit contributions
mmm-0.016
 dear-0.015
 badly-0.015
 understanding-0.014
 importantly-0.014
Token walking
Token ID6155
Feature activation+3.94e-02

Pos logit contributions
 badly+0.055
mmm+0.054
 importantly+0.051
 dear+0.050
 understanding+0.049
Neg logit contributions
 valleys-0.056
 meets-0.056
izons-0.055
 forests-0.050
 ease-0.050
Token through
Token ID832
Feature activation+1.17e-02

Pos logit contributions
 meets+0.066
izons+0.065
 valleys+0.057
 ease+0.055
iece+0.054
Neg logit contributions
 badly-0.076
 understanding-0.069
mmm-0.066
 dear-0.064
 importantly-0.064
Token the
Token ID262
Feature activation+7.05e-03

Pos logit contributions
 meets+0.025
izons+0.025
ips+0.023
iece+0.023
robe+0.023
Neg logit contributions
 badly-0.017
 understanding-0.015
 importantly-0.015
 dear-0.014
mmm-0.014
Token sit
Token ID1650
Feature activation+1.46e-02

Pos logit contributions
 meets+0.020
 valleys+0.019
izons+0.018
 forests+0.018
ators+0.017
Neg logit contributions
 badly-0.023
mmm-0.023
 dear-0.021
 understanding-0.020
 importantly-0.019
Token down
Token ID866
Feature activation+1.05e-02

Pos logit contributions
 meets+0.019
izons+0.017
 valleys+0.017
 forests+0.015
 ease+0.015
Neg logit contributions
mmm-0.035
 badly-0.035
 dear-0.032
 importantly-0.032
 understanding-0.031
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 meets+0.024
izons+0.023
 valleys+0.022
ators+0.021
 forests+0.021
Neg logit contributions
 badly-0.015
mmm-0.014
 importantly-0.012
 dear-0.012
 understanding-0.012
Token
Token ID220
Feature activation+3.57e-02

Pos logit contributions
 meets+0.034
izons+0.032
 valleys+0.032
ators+0.031
iece+0.030
Neg logit contributions
mmm-0.029
 badly-0.028
 dear-0.027
 importantly-0.025
 understanding-0.024
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 meets+0.074
 valleys+0.073
izons+0.072
 forests+0.069
ators+0.068
Neg logit contributions
mmm-0.056
 badly-0.053
 dear-0.048
 importantly-0.047
ittens-0.045
Token
Token ID198
Feature activation-2.93e-02

Pos logit contributions
mmm+0.019
 badly+0.018
 importantly+0.017
 dear+0.017
 understanding+0.016
Neg logit contributions
 meets-0.019
izons-0.019
 valleys-0.018
ators-0.017
 forests-0.017
TokenHer
Token ID9360
Feature activation-4.62e-03

Pos logit contributions
mmm+0.045
 dear+0.042
 importantly+0.041
 badly+0.039
 enjoyable+0.039
Neg logit contributions
 valleys-0.057
 meets-0.057
izons-0.056
 forests-0.052
 parks-0.052
Token friends
Token ID2460
Feature activation-1.02e-02

Pos logit contributions
 badly+0.007
 dear+0.007
mmm+0.007
 importantly+0.006
 understanding+0.006
Neg logit contributions
 meets-0.010
 valleys-0.010
izons-0.009
ators-0.009
iece-0.009
Token saw
Token ID2497
Feature activation-4.44e-03

Pos logit contributions
 badly+0.020
mmm+0.019
 dear+0.017
 importantly+0.017
 understanding+0.017
Neg logit contributions
 meets-0.017
 valleys-0.017
izons-0.017
ators-0.015
 forests-0.015
Token her
Token ID607
Feature activation-1.24e-02

Pos logit contributions
 badly+0.007
mmm+0.007
 dear+0.007
 importantly+0.007
 understanding+0.007
Neg logit contributions
 meets-0.009
izons-0.008
 valleys-0.008
iece-0.008
sey-0.008
Token and
Token ID290
Feature activation-2.45e-02

Pos logit contributions
mmm+0.018
 badly+0.018
 dear+0.017
 importantly+0.016
 understanding+0.015
Neg logit contributions
 meets-0.026
 valleys-0.026
izons-0.025
 forests-0.024
ators-0.024
Token veil
Token ID30615
Feature activation+5.05e-04

Pos logit contributions
 meets+0.067
 valleys+0.063
izons+0.062
 ease+0.058
ators+0.058
Neg logit contributions
mmm-0.078
 badly-0.074
 dear-0.074
 importantly-0.068
 understanding-0.067
Token.
Token ID13
Feature activation+3.79e-03

Pos logit contributions
 meets+0.001
 valleys+0.001
izons+0.001
 forests+0.001
 ease+0.001
Neg logit contributions
mmm-0.001
 badly-0.001
 dear-0.001
 importantly-0.001
 understanding-0.001
Token She
Token ID1375
Feature activation+3.55e-03

Pos logit contributions
 meets+0.007
izons+0.007
 valleys+0.007
 parks+0.006
 mountains+0.006
Neg logit contributions
mmm-0.007
 badly-0.006
 importantly-0.006
 dear-0.006
ww-0.006
Token found
Token ID1043
Feature activation+3.65e-02

Pos logit contributions
 meets+0.007
 valleys+0.006
izons+0.006
 forests+0.006
 ease+0.006
Neg logit contributions
mmm-0.006
 badly-0.006
 dear-0.006
 importantly-0.005
 understanding-0.005
Token a
Token ID257
Feature activation-1.93e-02

Pos logit contributions
 meets+0.084
izons+0.080
 driveway+0.079
 valleys+0.077
ators+0.077
Neg logit contributions
 badly-0.052
 dear-0.051
 sweet-0.047
 understanding-0.046
 caring-0.045
Token pen
Token ID3112
Feature activation+4.18e-04

Pos logit contributions
 badly+0.032
mmm+0.031
 dear+0.030
 importantly+0.029
 understanding+0.028
Neg logit contributions
 meets-0.038
 valleys-0.037
izons-0.036
ators-0.034
iece-0.034
Token and
Token ID290
Feature activation+8.72e-03

Pos logit contributions
 meets+0.001
 valleys+0.001
izons+0.001
ators+0.001
 ease+0.001
Neg logit contributions
 badly-0.001
mmm-0.001
 dear-0.001
 importantly-0.001
 understanding-0.000
Token wrote
Token ID2630
Feature activation+1.47e-02

Pos logit contributions
 meets+0.016
izons+0.015
 valleys+0.015
ators+0.014
sey+0.014
Neg logit contributions
mmm-0.016
 badly-0.015
 dear-0.015
 importantly-0.014
 understanding-0.014
Token "
Token ID366
Feature activation+5.18e-02

Pos logit contributions
 meets+0.029
 valleys+0.028
izons+0.028
 driveway+0.027
sey+0.026
Neg logit contributions
mmm-0.024
 badly-0.024
 dear-0.023
 understanding-0.022
 importantly-0.021
TokenS
Token ID50
Feature activation-5.14e-03

Pos logit contributions
 meets+0.097
 valleys+0.093
izons+0.093
 bridges+0.086
sey+0.086
Neg logit contributions
mmm-0.092
 badly-0.088
ww-0.078
 dear-0.077
 importantly-0.076
Tokenue
Token ID518
Feature activation-9.19e-03

Pos logit contributions
mmm+0.007
 badly+0.007
 dear+0.007
 importantly+0.007
 understanding+0.006
Neg logit contributions
 meets-0.011
 valleys-0.011
izons-0.011
 forests-0.010
ators-0.010
Token it
Token ID340
Feature activation-3.34e-03

Pos logit contributions
 meets+0.054
izons+0.051
 valleys+0.051
ators+0.047
 forests+0.046
Neg logit contributions
mmm-0.063
 badly-0.062
 dear-0.058
 importantly-0.057
 understanding-0.054
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 badly+0.006
mmm+0.005
 dear+0.005
 importantly+0.005
 understanding+0.005
Neg logit contributions
 meets-0.006
izons-0.006
 valleys-0.006
ators-0.006
 forests-0.006
Token They
Token ID1119
Feature activation+7.55e-03

Pos logit contributions
 meets+0.022
izons+0.021
 valleys+0.021
ators+0.020
iece+0.019
Neg logit contributions
mmm-0.021
 badly-0.020
 dear-0.019
 importantly-0.019
 understanding-0.017
Token had
Token ID550
Feature activation+1.95e-02

Pos logit contributions
 meets+0.015
izons+0.014
 valleys+0.014
ators+0.013
ASH+0.013
Neg logit contributions
 badly-0.012
mmm-0.011
 dear-0.011
 understanding-0.011
 importantly-0.010
Token a
Token ID257
Feature activation-3.76e-02

Pos logit contributions
izons+0.042
 meets+0.042
 valleys+0.041
ators+0.039
iece+0.039
Neg logit contributions
 badly-0.028
mmm-0.026
 dear-0.024
 importantly-0.024
 understanding-0.023
Token lot
Token ID1256
Feature activation-2.57e-02

Pos logit contributions
mmm+0.064
 badly+0.059
 importantly+0.056
 dear+0.056
 understanding+0.052
Neg logit contributions
 meets-0.074
 valleys-0.071
izons-0.070
 forests-0.065
 ease-0.065
Token of
Token ID286
Feature activation-4.09e-04

Pos logit contributions
mmm+0.031
 badly+0.031
 importantly+0.028
 dear+0.027
 understanding+0.025
Neg logit contributions
 meets-0.062
izons-0.060
 valleys-0.059
ators-0.056
sey-0.056
Token fun
Token ID1257
Feature activation-1.94e-02

Pos logit contributions
mmm+0.000
 badly+0.000
 importantly+0.000
 dear+0.000
 understanding+0.000
Neg logit contributions
 meets-0.001
izons-0.001
 valleys-0.001
ators-0.001
 forests-0.001
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
mmm+0.035
 badly+0.033
 dear+0.032
 importantly+0.031
 understanding+0.029
Neg logit contributions
 meets-0.036
 valleys-0.034
izons-0.034
 forests-0.031
 mountains-0.031
Token
Token ID198
Feature activation-1.39e-02

Pos logit contributions
 meets+0.019
izons+0.018
 valleys+0.018
ators+0.017
 ease+0.017
Neg logit contributions
mmm-0.019
 badly-0.018
 dear-0.017
 importantly-0.017
 understanding-0.016
Token
Token ID198
Feature activation-3.51e-02

Pos logit contributions
mmm+0.026
 badly+0.025
 importantly+0.023
 dear+0.023
 understanding+0.022
Neg logit contributions
 meets-0.024
 valleys-0.023
izons-0.023
 forests-0.021
ators-0.021
Token'm
Token ID1101
Feature activation-3.00e-02

Pos logit contributions
 meets+0.043
 valleys+0.042
izons+0.042
 forests+0.039
ators+0.039
Neg logit contributions
mmm-0.049
 badly-0.047
 dear-0.045
 importantly-0.044
 understanding-0.042
Token going
Token ID1016
Feature activation+1.47e-02

Pos logit contributions
mmm+0.041
 badly+0.038
 dear+0.035
 importantly+0.035
 understanding+0.032
Neg logit contributions
 meets-0.068
 valleys-0.065
izons-0.065
 forests-0.061
 ease-0.061
Token to
Token ID284
Feature activation+2.95e-02

Pos logit contributions
izons+0.031
 valleys+0.029
iece+0.028
ators+0.028
sey+0.027
Neg logit contributions
 badly-0.022
mmm-0.021
 dear-0.020
 caring-0.019
 because-0.019
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 valleys+0.051
izons+0.048
ators+0.048
iece+0.046
ips+0.046
Neg logit contributions
mmm-0.054
 badly-0.053
 dear-0.051
 understanding-0.051
 caring-0.050
Token costume
Token ID16569
Feature activation-1.93e-02

Pos logit contributions
mmm+0.024
 badly+0.023
 dear+0.022
 importantly+0.021
 caring+0.021
Neg logit contributions
 meets-0.021
 valleys-0.021
izons-0.021
sey-0.019
ators-0.019
Token party
Token ID2151
Feature activation-1.92e-02

Pos logit contributions
mmm+0.027
 badly+0.026
 dear+0.024
 importantly+0.023
 understanding+0.021
Neg logit contributions
 meets-0.043
 valleys-0.042
izons-0.041
 ease-0.039
 forests-0.039
Token later
Token ID1568
Feature activation-3.18e-02

Pos logit contributions
mmm+0.025
 badly+0.023
 dear+0.021
 importantly+0.021
 understanding+0.019
Neg logit contributions
 meets-0.045
 valleys-0.044
izons-0.043
 ease-0.041
 forests-0.041
Token."
Token ID526
Feature activation-1.17e-03

Pos logit contributions
mmm+0.042
 badly+0.039
 dear+0.036
 importantly+0.035
 understanding+0.032
Neg logit contributions
 meets-0.074
 valleys-0.072
izons-0.071
 forests-0.067
 ease-0.067
Token Lily
Token ID20037
Feature activation-1.58e-02

Pos logit contributions
mmm+0.002
 badly+0.002
 dear+0.002
 importantly+0.002
 understanding+0.002
Neg logit contributions
 meets-0.002
 valleys-0.002
izons-0.002
ators-0.002
 forests-0.002
Token listened
Token ID16399
Feature activation-4.55e-02

Pos logit contributions
mmm+0.031
 badly+0.030
 importantly+0.028
 dear+0.028
 painful+0.026
Neg logit contributions
 meets-0.027
izons-0.025
 valleys-0.025
sey-0.024
 forests-0.023
Token to
Token ID284
Feature activation+3.54e-03

Pos logit contributions
mmm+0.105
 importantly+0.100
 badly+0.097
 dear+0.095
ww+0.095
Neg logit contributions
 meets-0.057
 valleys-0.052
 asleep-0.050
sey-0.049
izons-0.049
Token boy
Token ID2933
Feature activation-2.18e-02

Pos logit contributions
 meets+0.007
izons+0.007
 valleys+0.006
 ease+0.006
ips+0.006
Neg logit contributions
 badly-0.010
 dear-0.010
 understanding-0.009
mmm-0.009
 importantly-0.009
Token who
Token ID508
Feature activation+3.03e-02

Pos logit contributions
 badly+0.035
mmm+0.034
 dear+0.032
 importantly+0.031
 understanding+0.031
Neg logit contributions
 meets-0.043
izons-0.042
 valleys-0.041
ators-0.039
 forests-0.038
Token owned
Token ID6898
Feature activation+4.87e-02

Pos logit contributions
izons+0.062
ASH+0.060
 meets+0.058
 valleys+0.058
iece+0.055
Neg logit contributions
 badly-0.047
 understanding-0.042
 importantly-0.041
 annoying-0.039
 dear-0.039
Token it
Token ID340
Feature activation+2.25e-04

Pos logit contributions
 meets+0.108
izons+0.107
iece+0.100
ips+0.099
pit+0.097
Neg logit contributions
 badly-0.068
 caring-0.058
 sweet-0.057
 understanding-0.057
 inviting-0.055
Token very
Token ID845
Feature activation-5.40e-02

Pos logit contributions
izons+0.001
 meets+0.001
 valleys+0.000
ASH+0.000
ators+0.000
Neg logit contributions
 badly-0.000
mmm-0.000
 caring-0.000
 enjoyable-0.000
 understanding-0.000
Token happy
Token ID3772
Feature activation-1.97e-02

Pos logit contributions
mmm+0.080
 importantly+0.066
 dear+0.065
 badly+0.065
ww+0.060
Neg logit contributions
 meets-0.120
 valleys-0.116
izons-0.109
 bridges-0.108
 ease-0.107
Token.
Token ID13
Feature activation+2.34e-02

Pos logit contributions
 badly+0.022
mmm+0.020
 importantly+0.019
 dear+0.019
 understanding+0.017
Neg logit contributions
 meets-0.049
izons-0.048
 valleys-0.047
neath-0.046
 ease-0.045
Token
Token ID198
Feature activation-9.18e-03

Pos logit contributions
izons+0.049
 meets+0.048
 valleys+0.047
iece+0.046
ators+0.046
Neg logit contributions
mmm-0.035
 badly-0.030
 importantly-0.030
 dear-0.030
From-0.028
Token
Token ID198
Feature activation-2.47e-02

Pos logit contributions
mmm+0.018
 badly+0.017
 dear+0.017
 importantly+0.016
 understanding+0.016
Neg logit contributions
 meets-0.015
 valleys-0.014
izons-0.014
 forests-0.013
 ease-0.013
TokenOne
Token ID3198
Feature activation-1.31e-02

Pos logit contributions
mmm+0.050
 dear+0.050
 badly+0.048
 importantly+0.047
 darling+0.044
Neg logit contributions
 meets-0.039
 valleys-0.039
izons-0.036
 forests-0.036
 parks-0.035
Token day
Token ID1110
Feature activation+4.83e-03

Pos logit contributions
mmm+0.036
 badly+0.032
 importantly+0.031
 dear+0.031
ittens+0.030
Neg logit contributions
 meets-0.012
 hall-0.011
 theater-0.011
 valleys-0.010
 nursery-0.010
Token problems
Token ID2761
Feature activation-2.10e-03

Pos logit contributions
mmm+0.027
 badly+0.026
 dear+0.025
 importantly+0.025
 understanding+0.023
Neg logit contributions
 meets-0.025
izons-0.024
 valleys-0.024
 forests-0.021
ators-0.021
Token.
Token ID13
Feature activation+4.07e-02

Pos logit contributions
 badly+0.003
mmm+0.003
 importantly+0.002
 dear+0.002
 understanding+0.002
Neg logit contributions
 meets-0.005
izons-0.005
 valleys-0.005
ASH-0.004
 ease-0.004
Token He
Token ID679
Feature activation+1.80e-02

Pos logit contributions
 meets+0.093
izons+0.089
 valleys+0.086
ators+0.085
iece+0.085
Neg logit contributions
mmm-0.056
 dear-0.048
 badly-0.048
 importantly-0.047
 understanding-0.043
Token was
Token ID373
Feature activation-1.85e-02

Pos logit contributions
izons+0.035
 meets+0.032
iece+0.032
 ease+0.032
ips+0.032
Neg logit contributions
 badly-0.028
mmm-0.027
 understanding-0.025
 caring-0.025
 importantly-0.025
Token feeling
Token ID4203
Feature activation-1.76e-02

Pos logit contributions
 badly+0.021
 understanding+0.019
mmm+0.019
 caring+0.018
 importantly+0.018
Neg logit contributions
 meets-0.046
izons-0.046
 valleys-0.044
ASH-0.043
ators-0.042
Token very
Token ID845
Feature activation-5.28e-02

Pos logit contributions
 badly+0.019
mmm+0.017
 dear+0.016
 importantly+0.016
 understanding+0.015
Neg logit contributions
 meets-0.045
izons-0.045
ators-0.043
 valleys-0.042
ASH-0.042
Token worried
Token ID7960
Feature activation-4.37e-02

Pos logit contributions
mmm+0.064
 badly+0.057
 importantly+0.052
 dear+0.052
ittens+0.048
Neg logit contributions
 meets-0.129
 valleys-0.125
izons-0.121
 forests-0.117
 mountains-0.116
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
mmm+0.052
 badly+0.050
 dear+0.045
 importantly+0.044
 understanding+0.040
Neg logit contributions
 meets-0.107
izons-0.104
 valleys-0.103
ators-0.097
 ease-0.097
Token did
Token ID750
Feature activation-1.38e-02

Pos logit contributions
 badly+0.032
mmm+0.029
 dear+0.027
 understanding+0.027
 importantly+0.026
Neg logit contributions
izons-0.054
 meets-0.054
 valleys-0.052
ators-0.051
ASH-0.050
Token not
Token ID407
Feature activation-1.45e-02

Pos logit contributions
 badly+0.013
mmm+0.010
 understanding+0.009
 dear+0.009
 caring+0.008
Neg logit contributions
izons-0.038
 meets-0.038
 valleys-0.038
ators-0.037
ips-0.037
Token know
Token ID760
Feature activation+1.40e-04

Pos logit contributions
mmm+0.019
 badly+0.019
 dear+0.017
 importantly+0.016
 understanding+0.016
Neg logit contributions
 meets-0.033
 valleys-0.032
izons-0.032
ators-0.030
 forests-0.030
Token things
Token ID1243
Feature activation-2.59e-02

Pos logit contributions
 meets+0.019
 valleys+0.019
izons+0.019
 forests+0.017
ators+0.017
Neg logit contributions
mmm-0.017
 badly-0.016
 dear-0.015
 importantly-0.015
 understanding-0.014
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
mmm+0.049
 badly+0.045
 dear+0.044
 importantly+0.044
ww+0.041
Neg logit contributions
 meets-0.045
 valleys-0.042
izons-0.041
ators-0.039
 mountains-0.039
Token<|endoftext|>
Token ID50256
Feature activation+6.66e-02

Pos logit contributions
 meets+0.022
 valleys+0.021
izons+0.021
 forests+0.019
 ease+0.019
Neg logit contributions
mmm-0.025
 badly-0.024
 dear-0.023
 importantly-0.023
 understanding-0.022
TokenOnce
Token ID7454
Feature activation+1.14e-02

Pos logit contributions
izons+0.157
iece+0.155
 meets+0.154
 valleys+0.154
ators+0.152
Neg logit contributions
 badly-0.068
mmm-0.065
 importantly-0.056
From-0.054
 dear-0.049
Token upon
Token ID2402
Feature activation+1.88e-02

Pos logit contributions
 meets+0.014
 valleys+0.013
izons+0.013
 forests+0.011
ators+0.011
Neg logit contributions
mmm-0.027
 badly-0.027
 dear-0.025
 importantly-0.025
 understanding-0.024
Token a
Token ID257
Feature activation-4.92e-02

Pos logit contributions
 meets+0.032
 valleys+0.030
izons+0.030
ators+0.028
 forests+0.027
Neg logit contributions
mmm-0.036
 badly-0.035
 dear-0.033
 importantly-0.033
 understanding-0.031
Token time
Token ID640
Feature activation+1.82e-02

Pos logit contributions
mmm+0.122
 badly+0.113
 importantly+0.112
 dear+0.109
 understanding+0.108
Neg logit contributions
 meets-0.056
 finds-0.044
izons-0.043
 ease-0.043
 valleys-0.043
Token,
Token ID11
Feature activation+1.62e-02

Pos logit contributions
 meets+0.035
 valleys+0.034
izons+0.033
 forests+0.031
 ease+0.031
Neg logit contributions
mmm-0.030
 badly-0.028
 importantly-0.027
 dear-0.027
 understanding-0.025
Token there
Token ID612
Feature activation+6.26e-02

Pos logit contributions
 meets+0.026
izons+0.025
 valleys+0.025
 forests+0.022
 ease+0.022
Neg logit contributions
mmm-0.033
 badly-0.032
 importantly-0.030
 dear-0.030
 understanding-0.028
Token was
Token ID373
Feature activation-8.70e-03

Pos logit contributions
 valleys+0.114
 meets+0.112
izons+0.110
ators+0.106
iece+0.106
Neg logit contributions
 badly-0.110
mmm-0.101
 dear-0.095
 importantly-0.093
 sweet-0.088
Token a
Token ID257
Feature activation-5.39e-02

Pos logit contributions
mmm+0.020
 badly+0.019
 dear+0.018
 importantly+0.018
 persuasive+0.018
Neg logit contributions
 meets-0.011
 valleys-0.011
izons-0.010
 ease-0.010
 mountains-0.010
Token play
Token ID711
Feature activation+3.64e-02

Pos logit contributions
 meets+0.025
 valleys+0.024
izons+0.023
rays+0.023
 mountains+0.023
Neg logit contributions
 badly-0.022
mmm-0.021
 understanding-0.020
 caring-0.020
 annoying-0.020
Token,
Token ID11
Feature activation-9.42e-03

Pos logit contributions
 meets+0.075
 valleys+0.072
izons+0.072
sey+0.067
 driveway+0.067
Neg logit contributions
 badly-0.056
mmm-0.052
 dear-0.047
 understanding-0.047
 importantly-0.047
Token but
Token ID475
Feature activation-2.51e-03

Pos logit contributions
 badly+0.009
 understanding+0.009
mmm+0.008
 caring+0.008
 annoying+0.008
Neg logit contributions
izons-0.024
 meets-0.023
ators-0.023
 valleys-0.023
neath-0.022
Token Tom
Token ID4186
Feature activation-5.11e-03

Pos logit contributions
 badly+0.004
mmm+0.004
 dear+0.003
 importantly+0.003
 understanding+0.003
Neg logit contributions
 meets-0.006
izons-0.005
 valleys-0.005
sey-0.005
 forests-0.005
Token was
Token ID373
Feature activation-3.40e-02

Pos logit contributions
 badly+0.009
mmm+0.009
 dear+0.008
 importantly+0.008
 understanding+0.008
Neg logit contributions
 meets-0.009
 valleys-0.009
izons-0.009
 forests-0.008
ators-0.008
Token not
Token ID407
Feature activation-4.58e-02

Pos logit contributions
mmm+0.046
 badly+0.043
 dear+0.039
 importantly+0.039
 understanding+0.036
Neg logit contributions
 meets-0.078
 valleys-0.075
izons-0.075
 forests-0.070
ators-0.070
Token happy
Token ID3772
Feature activation-4.11e-02

Pos logit contributions
mmm+0.066
 badly+0.056
 dear+0.055
 importantly+0.055
ittens+0.051
Neg logit contributions
 meets-0.101
izons-0.096
 valleys-0.096
 bridges-0.091
 ease-0.091
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
mmm+0.068
 badly+0.063
 dear+0.060
 importantly+0.059
ww+0.055
Neg logit contributions
 meets-0.082
izons-0.078
 valleys-0.078
 forests-0.072
ators-0.072
Token He
Token ID679
Feature activation+6.14e-03

Pos logit contributions
 meets+0.032
 valleys+0.030
izons+0.030
 forests+0.028
ators+0.028
Neg logit contributions
mmm-0.029
 badly-0.028
 dear-0.026
 importantly-0.026
 understanding-0.024
Token wanted
Token ID2227
Feature activation+4.34e-03

Pos logit contributions
 meets+0.011
 valleys+0.011
izons+0.011
 forests+0.010
ators+0.010
Neg logit contributions
mmm-0.010
 badly-0.010
 dear-0.009
 importantly-0.009
 understanding-0.009
Token to
Token ID284
Feature activation+2.26e-02

Pos logit contributions
izons+0.010
 valleys+0.010
neath+0.009
 meets+0.009
iece+0.009
Neg logit contributions
 badly-0.006
 caring-0.006
 understanding-0.006
 sweet-0.005
 dear-0.005
Token Fl
Token ID1610
Feature activation-9.93e-04

Pos logit contributions
 meets+0.078
 valleys+0.075
izons+0.075
 forests+0.070
 ease+0.069
Neg logit contributions
mmm-0.064
 badly-0.061
 importantly-0.057
 dear-0.057
 understanding-0.053
Tokenopsy
Token ID44522
Feature activation-1.98e-02

Pos logit contributions
mmm+0.002
 badly+0.002
 dear+0.002
 importantly+0.002
 understanding+0.001
Neg logit contributions
 meets-0.002
 valleys-0.002
izons-0.002
 forests-0.002
sey-0.002
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
mmm+0.035
 badly+0.031
 dear+0.030
 importantly+0.030
ww+0.028
Neg logit contributions
 meets-0.038
 valleys-0.036
izons-0.035
sey-0.034
 forests-0.034
Token He
Token ID679
Feature activation+2.61e-02

Pos logit contributions
 meets+0.061
 valleys+0.060
izons+0.056
 mountains+0.054
 forests+0.054
Neg logit contributions
mmm-0.076
 badly-0.071
 importantly-0.069
 dear-0.068
 understanding-0.067
Token was
Token ID373
Feature activation-9.61e-03

Pos logit contributions
 meets+0.046
 valleys+0.044
izons+0.043
 forests+0.040
sey+0.040
Neg logit contributions
mmm-0.050
 badly-0.047
 dear-0.044
 importantly-0.044
 understanding-0.042
Token very
Token ID845
Feature activation-4.43e-02

Pos logit contributions
mmm+0.016
 badly+0.015
 dear+0.014
 importantly+0.014
 understanding+0.013
Neg logit contributions
 meets-0.019
 valleys-0.018
izons-0.018
 forests-0.017
sey-0.016
Token silly
Token ID14397
Feature activation-4.23e-02

Pos logit contributions
mmm+0.070
 badly+0.059
 importantly+0.054
 dear+0.053
From+0.051
Neg logit contributions
 meets-0.091
 valleys-0.082
 ease-0.079
sey-0.079
izons-0.078
Token and
Token ID290
Feature activation-9.36e-03

Pos logit contributions
mmm+0.073
 badly+0.059
 dear+0.056
ww+0.056
From+0.055
Neg logit contributions
 meets-0.082
 valleys-0.080
 mountains-0.078
sey-0.077
izons-0.077
Token always
Token ID1464
Feature activation+3.30e-03

Pos logit contributions
mmm+0.016
 badly+0.015
 importantly+0.014
 dear+0.014
 understanding+0.013
Neg logit contributions
 meets-0.018
 valleys-0.017
izons-0.017
sey-0.016
 ease-0.016
Token wanted
Token ID2227
Feature activation+2.68e-02

Pos logit contributions
 meets+0.007
izons+0.007
 valleys+0.007
ators+0.006
 forests+0.006
Neg logit contributions
 badly-0.005
mmm-0.005
 importantly-0.004
 dear-0.004
 understanding-0.004
Token to
Token ID284
Feature activation+4.51e-02

Pos logit contributions
izons+0.057
 meets+0.055
neath+0.054
ators+0.054
iece+0.054
Neg logit contributions
 badly-0.043
 caring-0.038
 dear-0.037
 understanding-0.037
 importantly-0.037
Token away
Token ID1497
Feature activation-1.33e-02

Pos logit contributions
 meets+0.066
 valleys+0.063
izons+0.060
ators+0.057
 ease+0.057
Neg logit contributions
 badly-0.061
 dear-0.055
mmm-0.055
 importantly-0.049
ww-0.048
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
 badly+0.018
mmm+0.018
 dear+0.016
 importantly+0.015
 understanding+0.015
Neg logit contributions
 meets-0.031
izons-0.029
 valleys-0.029
ators-0.027
 forests-0.027
Token she
Token ID673
Feature activation-2.78e-03

Pos logit contributions
 badly+0.016
 understanding+0.015
 dear+0.015
mmm+0.014
 caring+0.014
Neg logit contributions
 valleys-0.023
izons-0.023
 meets-0.023
ators-0.023
ips-0.022
Token was
Token ID373
Feature activation-3.19e-02

Pos logit contributions
 badly+0.005
mmm+0.004
 dear+0.004
 understanding+0.004
 importantly+0.004
Neg logit contributions
 valleys-0.005
izons-0.005
 meets-0.005
 bridges-0.005
ators-0.005
Token feeling
Token ID4203
Feature activation-2.88e-02

Pos logit contributions
 badly+0.041
mmm+0.037
 importantly+0.035
 understanding+0.034
 dear+0.034
Neg logit contributions
 meets-0.076
izons-0.074
 valleys-0.074
ators-0.070
 forests-0.069
Token very
Token ID845
Feature activation-5.77e-02

Pos logit contributions
 badly+0.034
mmm+0.029
 importantly+0.029
 dear+0.028
 understanding+0.028
Neg logit contributions
 meets-0.071
izons-0.071
 valleys-0.069
ators-0.069
ASH-0.067
Token sad
Token ID6507
Feature activation-5.64e-02

Pos logit contributions
mmm+0.074
 badly+0.061
 dear+0.058
 importantly+0.058
ittens+0.057
Neg logit contributions
 meets-0.139
 valleys-0.132
izons-0.130
 ease-0.124
 forests-0.123
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
mmm+0.089
 badly+0.082
 dear+0.077
 importantly+0.077
 understanding+0.071
Neg logit contributions
 meets-0.115
 valleys-0.111
izons-0.110
 forests-0.103
sey-0.103
Token She
Token ID1375
Feature activation-1.08e-02

Pos logit contributions
mmm+0.003
 badly+0.003
 dear+0.003
 importantly+0.003
 understanding+0.003
Neg logit contributions
 meets-0.004
 valleys-0.004
izons-0.004
ators-0.004
 bridges-0.004
Token decided
Token ID3066
Feature activation-1.53e-02

Pos logit contributions
mmm+0.018
 badly+0.017
 dear+0.016
 importantly+0.016
 understanding+0.015
Neg logit contributions
 meets-0.021
 valleys-0.020
izons-0.020
 forests-0.018
 ease-0.018
Token to
Token ID284
Feature activation+1.33e-02

Pos logit contributions
 badly+0.028
mmm+0.027
 dear+0.026
 understanding+0.025
 importantly+0.025
Neg logit contributions
 meets-0.027
 valleys-0.027
izons-0.027
ators-0.025
sey-0.025