TOP ACTIVATIONS
MAX = 0.096

" Wow ! Look how wonderful it looks ,"
" Look ! Look !" Tim my yelled to
" Mom ! Look at what I am doing
" Look ! Look ! Benny found wood !"
" Wow ! Look what we did !" Tom
, " M ama ! Look at the cow !" Her
said , " Wow ! Look at all the toys !"
" Wow ! Look at the balloons !" Tom
" Oh no ! Look what you did !" Lily
" Wow ! Look at that dolphin !" Tom
say , " Wow ! Look at the rainbow !"
" Wow ! Look at all these gifts !"
and shouted " Look ! Look !" His mom smiled and
" Wow ! Look , Mom my ! The
said , " Wow ! Look at the pretty butterfly .
? Boy : Look , Mom ! I got
said , " Wow ! Look , mom ! A huge
. " Hey ! Look at me ! I figured
R aa ah ! Look at my snake ! â
" That 's amazing ! Look at it , its so

MIN ACTIVATIONS
MIN = -0.098

she didn 't like the bitter ones . One day ,
share . We hope your wing will heal soon ," Sara
you okay ? Does your leg hurt ?" Lily nodded and
? And what if the hospital is sm elly ?" Ben
doctor said that Lily 's arm was broken and she needed
that had lots of organized le ashes to choose from .
and drink . But the lemon ade is not sweet .
my told her that his knee hurt a lot . His
a bad injury and needed surgery . He said they were
here and rest until your leg feels better ."
but be careful . The creek is deep and fast .
Maybe later , when the rain stops ," Mom said .
I play with your ball ?" Spot asked . Fl uffy
about the flowers ," the bee says . The
warm coat to prevent the bitter weather . It was a
lady . The lady puts the leash on the dog .
is sick . He has to stay in bed . He
mom my explained that a muscle is a part of your
saw a foolish cat named Mitt ens . Mitt ens liked
They say they hope her leg will heal soon .

INTERVAL 0.047 - 0.096
CONTAINS 0.314%

Wow !" Tim said . " Look at this sword !
Mom my said , " Yes , Lily . Sw ans
and said , " Thank you for listening . Do you
, she slowly peek ed out from under the sheet and
Yes ! That is a very

INTERVAL -0.001 - 0.047
CONTAINS 52.342%

, fairy ," Anna said . " Do you like your
you , Mia , for helping me find my own special
digs it out . He wants to play with it .
Teddy 's neck and hugged him . She said , "
." She grabbed the last box with both of her hands

INTERVAL -0.050 - -0.001
CONTAINS 47.275%

The end . <|endoftext|> Once upon a time , there was
what she would find at the end of the fog .
the tree , mom ?" Sara asks . "
it . But Ben 's juice was not water
on its next adventure . <|endoftext|> Jack was a busy 3

INTERVAL -0.098 - -0.050
CONTAINS 0.069%

exercise . He loved to exercise and felt very happy when
ow , ow . My arm hurts . My bike is
mom told her that eating healthy food was important to stay
Tim asked . " The mosquitoes are bothering us ."
day , Tim my 's knee started to hurt even more
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
storm+0.021
 worse+0.021
 meant+0.020
 means+0.020
 cost+0.020
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 hands-0.021
joy-0.021
 aston-0.021
Token
Token ID198
Feature activation-5.54e-03

Pos logit contributions
storm+0.017
 worse+0.017
 meant+0.017
 means+0.017
 cost+0.017
Neg logit contributions
 amaz-0.021
 fingertips-0.021
 hands-0.019
joy-0.018
 aston-0.018
Token"
Token ID1
Feature activation+1.59e-02

Pos logit contributions
storm+0.008
 worse+0.008
 means+0.008
 meant+0.008
 cost+0.008
Neg logit contributions
 amaz-0.011
 fingertips-0.011
 hands-0.010
joy-0.009
 aston-0.009
TokenWow
Token ID22017
Feature activation+5.08e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.025
 approval+0.024
 aston+0.024
joy+0.024
Neg logit contributions
 worse-0.027
storm-0.026
 cost-0.025
 meant-0.025
 means-0.025
Token!
Token ID0
Feature activation+4.38e-02

Pos logit contributions
 amaz+0.114
 fingertips+0.114
years+0.112
 aston+0.100
 approval+0.100
Neg logit contributions
 means-0.065
 worse-0.063
 meant-0.059
 wrong-0.058
 happen-0.058
Token Look
Token ID6803
Feature activation+9.56e-02

Pos logit contributions
 fingertips+0.076
 amaz+0.074
joy+0.065
 hands+0.064
 approval+0.062
Neg logit contributions
 means-0.078
storm-0.074
 meant-0.073
 cost-0.071
 worse-0.069
Token how
Token ID703
Feature activation+2.90e-02

Pos logit contributions
 fingertips+0.202
 amaz+0.194
 aston+0.177
years+0.177
joy+0.174
Neg logit contributions
 means-0.114
 worse-0.112
 wrong-0.111
ty-0.107
 meant-0.100
Token wonderful
Token ID7932
Feature activation-1.41e-03

Pos logit contributions
 amaz+0.057
 fingertips+0.055
joy+0.050
 hands+0.049
 aston+0.049
Neg logit contributions
 worse-0.044
storm-0.043
 meant-0.042
 means-0.042
 cost-0.041
Token it
Token ID340
Feature activation-1.11e-02

Pos logit contributions
storm+0.002
 worse+0.002
 meant+0.002
 cost+0.002
 means+0.002
Neg logit contributions
 amaz-0.003
 fingertips-0.003
 hands-0.002
joy-0.002
 aston-0.002
Token looks
Token ID3073
Feature activation+2.32e-02

Pos logit contributions
storm+0.017
 worse+0.017
 meant+0.016
 cost+0.016
 means+0.016
Neg logit contributions
 amaz-0.021
 fingertips-0.020
 hands-0.019
 aston-0.018
joy-0.018
Token,"
Token ID553
Feature activation+8.94e-04

Pos logit contributions
 amaz+0.041
 fingertips+0.039
 hands+0.035
joy+0.035
 aston+0.034
Neg logit contributions
 worse-0.040
 meant-0.039
storm-0.039
 means-0.039
 cost-0.038
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
storm+0.021
 worse+0.021
 meant+0.020
 means+0.020
 cost+0.020
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 hands-0.021
joy-0.021
 aston-0.021
Token
Token ID198
Feature activation-5.54e-03

Pos logit contributions
storm+0.017
 worse+0.017
 meant+0.017
 means+0.017
 cost+0.017
Neg logit contributions
 amaz-0.021
 fingertips-0.021
 hands-0.019
joy-0.018
 aston-0.018
Token"
Token ID1
Feature activation+1.59e-02

Pos logit contributions
storm+0.008
 worse+0.008
 means+0.008
 meant+0.008
 cost+0.008
Neg logit contributions
 amaz-0.011
 fingertips-0.011
 hands-0.010
joy-0.009
 aston-0.009
TokenLook
Token ID8567
Feature activation+6.81e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.025
 approval+0.024
 aston+0.024
joy+0.024
Neg logit contributions
 worse-0.027
storm-0.026
 cost-0.025
 meant-0.025
 means-0.025
Token!
Token ID0
Feature activation+4.73e-02

Pos logit contributions
 amaz+0.148
 fingertips+0.144
years+0.133
joy+0.133
ushes+0.131
Neg logit contributions
 worse-0.087
 means-0.087
ty-0.085
 wrong-0.080
 meant-0.078
Token Look
Token ID6803
Feature activation+9.52e-02

Pos logit contributions
 amaz+0.082
 fingertips+0.082
 hands+0.072
joy+0.071
 approval+0.069
Neg logit contributions
 means-0.080
storm-0.078
 meant-0.077
 worse-0.075
 cost-0.075
Token!"
Token ID2474
Feature activation-6.44e-04

Pos logit contributions
 fingertips+0.202
 amaz+0.200
 aston+0.181
years+0.178
joy+0.176
Neg logit contributions
 means-0.119
 worse-0.111
ty-0.111
 wrong-0.110
 meant-0.105
Token Tim
Token ID5045
Feature activation-1.19e-02

Pos logit contributions
storm+0.001
 worse+0.001
 meant+0.001
 cost+0.001
 means+0.001
Neg logit contributions
 amaz-0.001
 fingertips-0.001
 hands-0.001
joy-0.001
 aston-0.001
Tokenmy
Token ID1820
Feature activation-1.72e-03

Pos logit contributions
storm+0.016
 worse+0.015
 cost+0.015
 meant+0.015
 means+0.014
Neg logit contributions
 amaz-0.026
 fingertips-0.025
 hands-0.023
joy-0.022
 aston-0.022
Token yelled
Token ID22760
Feature activation+2.51e-02

Pos logit contributions
storm+0.002
 worse+0.002
 meant+0.002
 means+0.002
 cost+0.002
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 hands-0.003
joy-0.003
 aston-0.003
Token to
Token ID284
Feature activation+1.54e-02

Pos logit contributions
 amaz+0.052
 fingertips+0.049
ushes+0.043
 aston+0.043
joy+0.042
Neg logit contributions
 worse-0.037
 meant-0.036
 means-0.035
 cost-0.032
storm-0.032
Token
Token ID198
Feature activation+3.35e-03

Pos logit contributions
 amaz+0.030
 fingertips+0.029
 hands+0.026
joy+0.026
 aston+0.026
Neg logit contributions
 meant-0.022
 means-0.022
 worse-0.022
storm-0.022
 cost-0.021
Token
Token ID198
Feature activation+7.04e-03

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.007
joy+0.006
 toes+0.006
Neg logit contributions
 meant-0.004
 means-0.004
 worse-0.004
storm-0.004
 cost-0.004
Token"
Token ID1
Feature activation+3.22e-02

Pos logit contributions
 fingertips+0.015
 amaz+0.015
 hands+0.014
joy+0.014
 toes+0.013
Neg logit contributions
storm-0.009
 meant-0.009
 worse-0.009
 means-0.009
 cost-0.008
TokenMom
Token ID29252
Feature activation+3.79e-02

Pos logit contributions
 amaz+0.065
 fingertips+0.063
 hands+0.058
joy+0.057
 aston+0.056
Neg logit contributions
storm-0.047
 worse-0.046
 meant-0.045
 means-0.045
 cost-0.045
Token!
Token ID0
Feature activation+4.30e-02

Pos logit contributions
 amaz+0.091
 fingertips+0.091
 aston+0.083
joy+0.081
 approval+0.080
Neg logit contributions
 meant-0.041
 means-0.041
 worse-0.038
 cost-0.037
storm-0.035
Token Look
Token ID6803
Feature activation+9.49e-02

Pos logit contributions
 amaz+0.079
 fingertips+0.077
joy+0.070
 approval+0.068
 hands+0.068
Neg logit contributions
 means-0.068
storm-0.066
 meant-0.064
 cost-0.061
 worse-0.059
Token at
Token ID379
Feature activation+3.39e-02

Pos logit contributions
 fingertips+0.199
 amaz+0.197
 aston+0.184
 unison+0.179
 fascination+0.176
Neg logit contributions
 means-0.115
 worse-0.115
ty-0.109
 wrong-0.107
 meant-0.105
Token what
Token ID644
Feature activation+3.41e-02

Pos logit contributions
 amaz+0.070
 fingertips+0.068
 hands+0.062
joy+0.061
 aston+0.060
Neg logit contributions
storm-0.047
 worse-0.047
 meant-0.046
 means-0.045
 cost-0.045
Token I
Token ID314
Feature activation+3.29e-02

Pos logit contributions
 amaz+0.068
 fingertips+0.065
 hands+0.060
joy+0.059
 aston+0.058
Neg logit contributions
storm-0.050
 worse-0.050
 meant-0.049
 cost-0.048
 means-0.048
Token am
Token ID716
Feature activation+3.10e-02

Pos logit contributions
 amaz+0.061
 fingertips+0.059
 hands+0.054
joy+0.054
 aston+0.052
Neg logit contributions
 meant-0.053
storm-0.052
 worse-0.052
 means-0.052
 cost-0.052
Token doing
Token ID1804
Feature activation+1.09e-02

Pos logit contributions
 amaz+0.056
 fingertips+0.053
 hands+0.048
joy+0.048
 aston+0.047
Neg logit contributions
storm-0.052
 worse-0.051
 meant-0.051
 means-0.050
 cost-0.050
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
storm+0.012
 meant+0.012
 means+0.012
 worse+0.011
 cost+0.011
Neg logit contributions
 amaz-0.015
 fingertips-0.015
 hands-0.013
joy-0.013
 aston-0.013
Token
Token ID198
Feature activation-4.67e-03

Pos logit contributions
storm+0.017
 worse+0.017
 meant+0.017
 means+0.017
 cost+0.016
Neg logit contributions
 amaz-0.022
 fingertips-0.022
 hands-0.020
joy-0.019
 aston-0.019
Token"
Token ID1
Feature activation+2.45e-02

Pos logit contributions
storm+0.007
 worse+0.007
 meant+0.006
 cost+0.006
 means+0.006
Neg logit contributions
 amaz-0.009
 fingertips-0.009
 hands-0.008
joy-0.008
 aston-0.008
TokenLook
Token ID8567
Feature activation+6.95e-02

Pos logit contributions
 amaz+0.047
 fingertips+0.043
joy+0.040
 aston+0.040
 hands+0.040
Neg logit contributions
 worse-0.039
storm-0.038
 means-0.036
 cost-0.036
 meant-0.036
Token!
Token ID0
Feature activation+4.65e-02

Pos logit contributions
 amaz+0.147
 fingertips+0.141
ushes+0.133
joy+0.133
 aston+0.130
Neg logit contributions
ty-0.091
 worse-0.089
 means-0.088
 wrong-0.081
 meant-0.081
Token Look
Token ID6803
Feature activation+9.48e-02

Pos logit contributions
 fingertips+0.084
 amaz+0.084
 hands+0.073
joy+0.073
 approval+0.071
Neg logit contributions
 means-0.076
storm-0.075
 meant-0.073
 cost-0.071
 worse-0.069
Token!
Token ID0
Feature activation+4.67e-02

Pos logit contributions
 fingertips+0.202
 amaz+0.198
 aston+0.182
ushes+0.176
 unison+0.175
Neg logit contributions
ty-0.114
 means-0.110
 worse-0.105
 wrong-0.104
 meant-0.099
Token Benny
Token ID44275
Feature activation+2.29e-02

Pos logit contributions
 amaz+0.083
 fingertips+0.082
joy+0.072
 hands+0.072
 approval+0.069
Neg logit contributions
 means-0.078
 meant-0.076
storm-0.076
 cost-0.073
 worse-0.071
Token found
Token ID1043
Feature activation+3.07e-02

Pos logit contributions
 amaz+0.034
 fingertips+0.032
joy+0.028
 hands+0.028
 aston+0.027
Neg logit contributions
storm-0.046
 meant-0.045
 means-0.045
 worse-0.045
 cost-0.045
Token wood
Token ID4898
Feature activation-3.09e-04

Pos logit contributions
 amaz+0.067
 fingertips+0.067
joy+0.060
 aston+0.059
 hands+0.057
Neg logit contributions
 worse-0.039
 means-0.039
 meant-0.038
 cost-0.036
ty-0.035
Token!"
Token ID2474
Feature activation-2.02e-02

Pos logit contributions
storm+0.001
 worse+0.001
 meant+0.001
 cost+0.001
 means+0.001
Neg logit contributions
 amaz-0.000
 fingertips-0.000
 hands-0.000
joy-0.000
 aston-0.000
Token
Token ID198
Feature activation-7.60e-03

Pos logit contributions
 amaz+0.029
 fingertips+0.028
 hands+0.025
joy+0.025
 aston+0.024
Neg logit contributions
storm-0.025
 worse-0.025
 meant-0.025
 means-0.025
 cost-0.024
Token
Token ID198
Feature activation-7.51e-03

Pos logit contributions
 worse+0.012
storm+0.012
 meant+0.011
 means+0.011
 cost+0.011
Neg logit contributions
 amaz-0.015
 fingertips-0.014
 hands-0.013
joy-0.013
 aston-0.012
Token"
Token ID1
Feature activation+2.64e-02

Pos logit contributions
 worse+0.010
storm+0.010
 means+0.010
 meant+0.010
 cost+0.010
Neg logit contributions
 amaz-0.016
 fingertips-0.016
 hands-0.014
joy-0.014
 aston-0.013
TokenWow
Token ID22017
Feature activation+5.59e-02

Pos logit contributions
 amaz+0.047
 fingertips+0.042
 aston+0.040
joy+0.039
 approval+0.039
Neg logit contributions
storm-0.044
 worse-0.044
 cost-0.042
 means-0.042
 meant-0.042
Token!
Token ID0
Feature activation+5.18e-02

Pos logit contributions
 amaz+0.142
 fingertips+0.137
years+0.133
 fascination+0.125
 aston+0.124
Neg logit contributions
 means-0.060
 worse-0.060
 happen-0.056
 wrong-0.056
ty-0.055
Token Look
Token ID6803
Feature activation+9.46e-02

Pos logit contributions
 fingertips+0.104
 amaz+0.098
 approval+0.091
years+0.090
ushes+0.089
Neg logit contributions
 means-0.082
 meant-0.071
storm-0.069
 cost-0.064
 worse-0.063
Token what
Token ID644
Feature activation+3.65e-02

Pos logit contributions
 fingertips+0.227
 amaz+0.214
 unison+0.206
 fascination+0.198
 aston+0.198
Neg logit contributions
 worse-0.093
 wrong-0.089
 means-0.085
ty-0.083
 bad-0.078
Token we
Token ID356
Feature activation+4.34e-02

Pos logit contributions
 amaz+0.075
 fingertips+0.072
 hands+0.066
joy+0.065
 aston+0.064
Neg logit contributions
storm-0.052
 worse-0.051
 meant-0.050
 means-0.050
 cost-0.049
Token did
Token ID750
Feature activation+2.07e-02

Pos logit contributions
 fingertips+0.083
 amaz+0.082
joy+0.077
 hands+0.071
 approval+0.070
Neg logit contributions
 meant-0.073
 means-0.069
 cost-0.069
 worse-0.068
storm-0.064
Token!"
Token ID2474
Feature activation-5.37e-03

Pos logit contributions
 amaz+0.031
 fingertips+0.030
 hands+0.026
joy+0.026
 aston+0.025
Neg logit contributions
storm-0.040
 worse-0.040
 meant-0.040
 means-0.040
 cost-0.039
Token Tom
Token ID4186
Feature activation-1.37e-02

Pos logit contributions
storm+0.010
 worse+0.010
 cost+0.010
 meant+0.010
rob+0.009
Neg logit contributions
 amaz-0.008
 fingertips-0.008
 hands-0.008
 aston-0.007
 delight-0.007
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
 amaz+0.082
 fingertips+0.078
ushes+0.069
 unison+0.069
 aston+0.068
Neg logit contributions
 worse-0.049
 means-0.048
 meant-0.046
storm-0.043
 hurt-0.041
Token "
Token ID366
Feature activation+2.67e-02

Pos logit contributions
 amaz+0.023
 fingertips+0.022
 hands+0.019
 aston+0.019
joy+0.019
Neg logit contributions
storm-0.025
 meant-0.025
 worse-0.024
 means-0.024
 cost-0.024
TokenM
Token ID44
Feature activation+1.00e-02

Pos logit contributions
 amaz+0.050
 fingertips+0.048
 hands+0.044
joy+0.044
 aston+0.042
Neg logit contributions
storm-0.043
 meant-0.041
 worse-0.041
 means-0.041
 cost-0.041
Tokenama
Token ID1689
Feature activation+2.24e-02

Pos logit contributions
 amaz+0.013
 fingertips+0.013
 hands+0.011
joy+0.010
 aston+0.010
Neg logit contributions
 worse-0.022
storm-0.021
 meant-0.021
 cost-0.021
 means-0.021
Token!
Token ID0
Feature activation+4.65e-02

Pos logit contributions
 amaz+0.042
 fingertips+0.041
 aston+0.035
ushes+0.035
 approval+0.035
Neg logit contributions
 meant-0.038
 means-0.038
storm-0.037
 cost-0.035
ty-0.034
Token Look
Token ID6803
Feature activation+9.44e-02

Pos logit contributions
 amaz+0.080
 fingertips+0.077
joy+0.071
 approval+0.067
 hands+0.067
Neg logit contributions
 means-0.081
storm-0.080
 meant-0.077
 worse-0.074
 cost-0.074
Token at
Token ID379
Feature activation+2.85e-02

Pos logit contributions
 fingertips+0.207
 amaz+0.206
ushes+0.197
 unison+0.193
 aston+0.192
Neg logit contributions
ty-0.102
 worse-0.097
 wrong-0.092
 means-0.092
 that-0.083
Token the
Token ID262
Feature activation-4.68e-03

Pos logit contributions
 amaz+0.054
 fingertips+0.050
 hands+0.048
 delight+0.045
joy+0.045
Neg logit contributions
storm-0.045
 worse-0.045
 meant-0.044
 cost-0.044
 means-0.042
Token cow
Token ID9875
Feature activation+1.14e-02

Pos logit contributions
 worse+0.007
storm+0.007
 means+0.007
 meant+0.007
 cost+0.007
Neg logit contributions
 amaz-0.009
 fingertips-0.008
 hands-0.008
joy-0.008
 delight-0.008
Token!"
Token ID2474
Feature activation-1.17e-03

Pos logit contributions
 amaz+0.017
 fingertips+0.016
joy+0.015
 aston+0.014
 hands+0.014
Neg logit contributions
storm-0.023
 meant-0.022
 means-0.022
 worse-0.022
 cost-0.022
Token Her
Token ID2332
Feature activation+9.05e-03

Pos logit contributions
 worse+0.002
storm+0.002
rob+0.002
 cost+0.002
 meant+0.002
Neg logit contributions
 amaz-0.002
 fingertips-0.002
 hands-0.002
joy-0.002
 delight-0.002
Token said
Token ID531
Feature activation+2.42e-02

Pos logit contributions
 meant+0.003
 worse+0.003
 means+0.003
storm+0.003
 cost+0.003
Neg logit contributions
 amaz-0.005
 fingertips-0.005
joy-0.005
 hands-0.005
 aston-0.005
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
 amaz+0.046
 fingertips+0.044
 hands+0.038
 aston+0.037
 approval+0.037
Neg logit contributions
 worse-0.041
 means-0.040
 meant-0.039
storm-0.039
 cost-0.037
Token "
Token ID366
Feature activation+1.42e-02

Pos logit contributions
 amaz+0.019
 fingertips+0.018
 hands+0.016
joy+0.016
 aston+0.016
Neg logit contributions
storm-0.023
 worse-0.022
 meant-0.022
 means-0.022
 cost-0.022
TokenWow
Token ID22017
Feature activation+7.11e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.026
joy+0.025
 hands+0.024
 aston+0.024
Neg logit contributions
 worse-0.021
storm-0.021
 meant-0.020
 means-0.020
 cost-0.020
Token!
Token ID0
Feature activation+5.29e-02

Pos logit contributions
 amaz+0.177
 fingertips+0.169
years+0.164
ushes+0.161
 approval+0.159
Neg logit contributions
 worse-0.071
 wrong-0.070
 happen-0.067
ty-0.066
 means-0.064
Token Look
Token ID6803
Feature activation+9.42e-02

Pos logit contributions
 fingertips+0.111
 amaz+0.109
 approval+0.096
ushes+0.095
 toes+0.094
Neg logit contributions
 means-0.080
storm-0.072
 meant-0.071
 cost-0.069
 worse-0.061
Token at
Token ID379
Feature activation+2.70e-02

Pos logit contributions
 fingertips+0.226
 amaz+0.212
 unison+0.201
 aston+0.198
ushes+0.198
Neg logit contributions
 wrong-0.090
 worse-0.086
ty-0.084
 that-0.081
 means-0.079
Token all
Token ID477
Feature activation+2.03e-02

Pos logit contributions
 amaz+0.052
 fingertips+0.049
 hands+0.046
joy+0.045
 aston+0.044
Neg logit contributions
storm-0.042
 worse-0.041
 meant-0.040
 cost-0.039
 means-0.039
Token the
Token ID262
Feature activation-4.21e-03

Pos logit contributions
 amaz+0.041
 fingertips+0.039
 hands+0.036
joy+0.036
 aston+0.035
Neg logit contributions
storm-0.029
 worse-0.029
 meant-0.029
 means-0.029
 cost-0.028
Token toys
Token ID14958
Feature activation+2.56e-02

Pos logit contributions
 meant+0.006
 cost+0.006
 means+0.005
 worse+0.005
 ger+0.005
Neg logit contributions
 fingertips-0.009
 amaz-0.009
joy-0.008
years-0.008
apped-0.008
Token!"
Token ID2474
Feature activation-1.63e-02

Pos logit contributions
 amaz+0.043
 fingertips+0.041
joy+0.037
 aston+0.036
 hands+0.036
Neg logit contributions
storm-0.046
 meant-0.046
 worse-0.046
 cost-0.045
 means-0.045
Token
Token ID198
Feature activation+1.09e-03

Pos logit contributions
 amaz+0.025
 fingertips+0.024
joy+0.021
 hands+0.021
 aston+0.021
Neg logit contributions
 meant-0.017
 means-0.017
storm-0.017
 worse-0.017
 cost-0.016
Token
Token ID198
Feature activation+2.09e-03

Pos logit contributions
 amaz+0.002
 fingertips+0.002
joy+0.002
 hands+0.002
 approval+0.002
Neg logit contributions
 means-0.001
 meant-0.001
 worse-0.001
storm-0.001
 cost-0.001
Token"
Token ID1
Feature activation+2.39e-02

Pos logit contributions
 amaz+0.005
 fingertips+0.005
joy+0.004
maid+0.004
 hands+0.004
Neg logit contributions
 means-0.003
 worse-0.003
 meant-0.003
storm-0.003
guard-0.003
TokenWow
Token ID22017
Feature activation+5.26e-02

Pos logit contributions
 amaz+0.050
 fingertips+0.048
 hands+0.044
joy+0.044
 aston+0.042
Neg logit contributions
storm-0.033
 meant-0.033
 worse-0.033
 means-0.033
 cost-0.032
Token!
Token ID0
Feature activation+4.30e-02

Pos logit contributions
 amaz+0.133
 fingertips+0.129
years+0.128
 aston+0.116
 fascination+0.115
Neg logit contributions
 means-0.057
 worse-0.054
 happen-0.053
ty-0.051
 wrong-0.050
Token Look
Token ID6803
Feature activation+9.37e-02

Pos logit contributions
 fingertips+0.082
 amaz+0.077
years+0.072
 approval+0.071
ushes+0.070
Neg logit contributions
 means-0.073
storm-0.064
 meant-0.061
 cost-0.056
 chip-0.055
Token at
Token ID379
Feature activation+2.41e-02

Pos logit contributions
 fingertips+0.229
 amaz+0.215
years+0.201
 aston+0.199
 fascination+0.197
Neg logit contributions
 wrong-0.084
 means-0.084
 worse-0.080
ty-0.078
 that-0.070
Token the
Token ID262
Feature activation-1.00e-02

Pos logit contributions
 amaz+0.048
 fingertips+0.046
 hands+0.042
joy+0.042
 aston+0.041
Neg logit contributions
storm-0.036
 worse-0.035
 meant-0.035
 cost-0.034
 means-0.034
Token balloons
Token ID35485
Feature activation+1.06e-02

Pos logit contributions
storm+0.015
 cost+0.015
 worse+0.015
 meant+0.015
 means+0.015
Neg logit contributions
 amaz-0.019
 fingertips-0.019
joy-0.017
 hands-0.017
 aston-0.017
Token!"
Token ID2474
Feature activation-3.92e-03

Pos logit contributions
 amaz+0.015
 fingertips+0.014
 hands+0.012
joy+0.012
 aston+0.012
Neg logit contributions
storm-0.022
 worse-0.022
 meant-0.021
 means-0.021
 cost-0.021
Token Tom
Token ID4186
Feature activation-3.36e-02

Pos logit contributions
storm+0.007
 worse+0.007
 meant+0.007
 cost+0.007
 means+0.007
Neg logit contributions
 amaz-0.007
 fingertips-0.006
 hands-0.006
joy-0.006
 aston-0.006
Token
Token ID198
Feature activation-2.38e-03

Pos logit contributions
 worse+0.008
storm+0.008
 meant+0.008
 means+0.008
 cost+0.008
Neg logit contributions
 amaz-0.011
 fingertips-0.010
 hands-0.009
joy-0.009
 aston-0.009
Token"
Token ID1
Feature activation+2.99e-02

Pos logit contributions
 worse+0.004
storm+0.004
 means+0.004
 meant+0.004
 cost+0.003
Neg logit contributions
 amaz-0.005
 fingertips-0.005
 hands-0.004
joy-0.004
 aston-0.004
TokenOh
Token ID5812
Feature activation+4.50e-02

Pos logit contributions
 amaz+0.061
 fingertips+0.057
 hands+0.053
 aston+0.053
joy+0.052
Neg logit contributions
storm-0.042
 meant-0.041
 worse-0.041
 means-0.041
 cost-0.041
Token no
Token ID645
Feature activation+3.00e-02

Pos logit contributions
 fingertips+0.093
 amaz+0.090
ushes+0.087
joy+0.084
 approval+0.083
Neg logit contributions
 worse-0.066
 means-0.060
 hurt-0.058
 meant-0.057
storm-0.056
Token!
Token ID0
Feature activation+4.41e-02

Pos logit contributions
 amaz+0.048
 fingertips+0.046
 hands+0.040
joy+0.039
 aston+0.039
Neg logit contributions
 worse-0.057
storm-0.057
 means-0.056
 meant-0.056
 cost-0.055
Token Look
Token ID6803
Feature activation+9.37e-02

Pos logit contributions
 amaz+0.081
 fingertips+0.077
joy+0.069
 approval+0.068
 aston+0.067
Neg logit contributions
 means-0.074
 worse-0.071
storm-0.071
 meant-0.069
 cost-0.067
Token what
Token ID644
Feature activation+3.85e-02

Pos logit contributions
 fingertips+0.213
 amaz+0.211
 aston+0.195
 fascination+0.193
 unison+0.193
Neg logit contributions
 worse-0.124
 wrong-0.114
 means-0.101
 hurt-0.096
 bad-0.091
Token you
Token ID345
Feature activation+5.26e-02

Pos logit contributions
 amaz+0.081
 fingertips+0.078
 hands+0.071
joy+0.071
 aston+0.070
Neg logit contributions
storm-0.052
 worse-0.052
 meant-0.051
 means-0.051
 cost-0.050
Token did
Token ID750
Feature activation+1.47e-02

Pos logit contributions
 fingertips+0.122
 amaz+0.121
joy+0.116
 approval+0.108
 toes+0.106
Neg logit contributions
 meant-0.068
 cost-0.066
 means-0.063
 worse-0.062
 hurt-0.058
Token!"
Token ID2474
Feature activation-4.92e-03

Pos logit contributions
 amaz+0.019
 fingertips+0.018
 hands+0.016
joy+0.015
 aston+0.015
Neg logit contributions
storm-0.031
 worse-0.031
 meant-0.031
 cost-0.030
 means-0.030
Token Lily
Token ID20037
Feature activation-1.16e-02

Pos logit contributions
storm+0.009
 worse+0.008
 cost+0.008
 meant+0.008
ained+0.008
Neg logit contributions
 amaz-0.008
 fingertips-0.008
 hands-0.008
joy-0.007
 aston-0.007
Token
Token ID198
Feature activation+2.16e-03

Pos logit contributions
 amaz+0.035
 fingertips+0.034
 hands+0.030
joy+0.030
 aston+0.029
Neg logit contributions
storm-0.029
 worse-0.029
 means-0.029
 meant-0.028
 cost-0.027
Token
Token ID198
Feature activation+4.00e-03

Pos logit contributions
 amaz+0.004
 fingertips+0.004
 hands+0.004
joy+0.004
 aston+0.004
Neg logit contributions
 worse-0.003
storm-0.003
 meant-0.003
 means-0.003
 cost-0.003
Token"
Token ID1
Feature activation+2.90e-02

Pos logit contributions
 amaz+0.008
 fingertips+0.008
 hands+0.007
joy+0.007
 aston+0.007
Neg logit contributions
 worse-0.006
storm-0.006
 means-0.006
 meant-0.006
guard-0.006
TokenWow
Token ID22017
Feature activation+5.63e-02

Pos logit contributions
 amaz+0.058
 fingertips+0.053
 aston+0.050
joy+0.048
 hands+0.048
Neg logit contributions
 worse-0.043
 cost-0.041
storm-0.041
 means-0.041
 meant-0.041
Token!
Token ID0
Feature activation+5.10e-02

Pos logit contributions
 amaz+0.131
years+0.130
 fingertips+0.126
 aston+0.116
ushes+0.115
Neg logit contributions
 means-0.069
 worse-0.068
 happen-0.065
 meant-0.062
ty-0.062
Token Look
Token ID6803
Feature activation+9.32e-02

Pos logit contributions
 fingertips+0.091
 amaz+0.089
 approval+0.080
years+0.079
ushes+0.078
Neg logit contributions
 means-0.088
storm-0.081
 meant-0.079
 worse-0.075
 cost-0.074
Token at
Token ID379
Feature activation+2.02e-02

Pos logit contributions
 fingertips+0.208
 amaz+0.202
ushes+0.191
years+0.191
 aston+0.188
Neg logit contributions
 worse-0.103
 wrong-0.098
ty-0.097
 means-0.093
 that-0.083
Token that
Token ID326
Feature activation+1.70e-02

Pos logit contributions
 amaz+0.040
 fingertips+0.037
 hands+0.036
 delight+0.034
joy+0.033
Neg logit contributions
storm-0.031
 meant-0.030
 worse-0.029
 cost-0.029
 means-0.028
Token dolphin
Token ID44136
Feature activation+2.18e-03

Pos logit contributions
 amaz+0.031
 fingertips+0.028
 hands+0.027
joy+0.026
 delight+0.025
Neg logit contributions
storm-0.029
 worse-0.028
 cost-0.027
 meant-0.027
ained-0.026
Token!"
Token ID2474
Feature activation-3.59e-03

Pos logit contributions
 amaz+0.003
 fingertips+0.003
 hands+0.003
joy+0.002
 aston+0.002
Neg logit contributions
storm-0.004
 worse-0.004
 meant-0.004
 cost-0.004
 means-0.004
Token Tom
Token ID4186
Feature activation-1.22e-02

Pos logit contributions
storm+0.006
 worse+0.006
 cost+0.006
 meant+0.006
ained+0.006
Neg logit contributions
 amaz-0.006
 fingertips-0.006
 hands-0.006
 delight-0.005
 aston-0.005
Token say
Token ID910
Feature activation+4.77e-02

Pos logit contributions
 fingertips+0.015
joy+0.015
 amaz+0.015
years+0.015
ushes+0.013
Neg logit contributions
 means-0.016
 meant-0.016
 worse-0.016
 cost-0.016
storm-0.015
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
 fingertips+0.096
 amaz+0.096
 aston+0.084
joy+0.083
years+0.083
Neg logit contributions
 means-0.072
 worse-0.067
 meant-0.067
 happen-0.061
 wrong-0.060
Token "
Token ID366
Feature activation+4.00e-02

Pos logit contributions
 amaz+0.064
 fingertips+0.063
 hands+0.056
 aston+0.055
ushes+0.054
Neg logit contributions
 means-0.052
storm-0.051
 meant-0.050
 worse-0.047
 cost-0.047
TokenWow
Token ID22017
Feature activation+6.36e-02

Pos logit contributions
 amaz+0.085
 fingertips+0.083
joy+0.077
 hands+0.076
 aston+0.073
Neg logit contributions
storm-0.054
 means-0.052
 meant-0.052
 cost-0.051
 worse-0.049
Token!
Token ID0
Feature activation+4.97e-02

Pos logit contributions
 amaz+0.167
years+0.156
 fingertips+0.155
 aston+0.148
 fascination+0.148
Neg logit contributions
 means-0.063
 happen-0.060
 wrong-0.054
ty-0.054
 worse-0.052
Token Look
Token ID6803
Feature activation+9.31e-02

Pos logit contributions
 fingertips+0.094
 amaz+0.092
joy+0.083
years+0.082
 approval+0.082
Neg logit contributions
 means-0.082
 meant-0.072
storm-0.071
ained-0.068
 cost-0.065
Token at
Token ID379
Feature activation+3.44e-02

Pos logit contributions
 fingertips+0.218
 amaz+0.209
 aston+0.194
 fascination+0.194
ushes+0.189
Neg logit contributions
 means-0.092
 wrong-0.082
 worse-0.080
ty-0.077
 that-0.073
Token the
Token ID262
Feature activation+1.28e-02

Pos logit contributions
 amaz+0.070
 fingertips+0.067
 hands+0.062
joy+0.060
 aston+0.059
Neg logit contributions
storm-0.050
 worse-0.049
 meant-0.048
 cost-0.048
 means-0.047
Token rainbow
Token ID27223
Feature activation+2.39e-02

Pos logit contributions
 amaz+0.025
 fingertips+0.024
 hands+0.022
joy+0.022
 aston+0.021
Neg logit contributions
storm-0.020
 worse-0.019
 meant-0.019
 means-0.019
 cost-0.019
Token!"
Token ID2474
Feature activation-8.36e-03

Pos logit contributions
 amaz+0.036
 fingertips+0.035
 hands+0.031
joy+0.030
 aston+0.030
Neg logit contributions
storm-0.046
 worse-0.046
 meant-0.045
 means-0.045
 cost-0.045
Token
Token ID198
Feature activation-4.39e-04

Pos logit contributions
storm+0.016
 worse+0.015
 cost+0.015
 meant+0.015
 means+0.014
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 hands-0.012
joy-0.012
 aston-0.011
Token
Token ID198
Feature activation-4.25e-03

Pos logit contributions
 amaz+0.001
 fingertips+0.001
 hands+0.001
joy+0.001
 aston+0.001
Neg logit contributions
 means-0.001
 meant-0.001
 worse-0.001
storm-0.001
 cost-0.001
Token
Token ID198
Feature activation-8.84e-03

Pos logit contributions
 worse+0.006
 meant+0.006
 means+0.006
storm+0.006
 cost+0.006
Neg logit contributions
 amaz-0.009
 fingertips-0.008
 hands-0.008
joy-0.008
 aston-0.007
Token"
Token ID1
Feature activation+1.47e-02

Pos logit contributions
 worse+0.013
 means+0.013
storm+0.013
 meant+0.013
 cost+0.012
Neg logit contributions
 amaz-0.018
 fingertips-0.017
joy-0.015
 hands-0.015
maid-0.015
TokenWow
Token ID22017
Feature activation+6.02e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.026
 hands+0.024
joy+0.023
 aston+0.023
Neg logit contributions
storm-0.023
 worse-0.023
 meant-0.023
 cost-0.023
 means-0.022
Token!
Token ID0
Feature activation+4.09e-02

Pos logit contributions
 amaz+0.142
 fingertips+0.135
years+0.132
 aston+0.125
 approval+0.124
Neg logit contributions
 worse-0.074
 means-0.071
ty-0.069
 happen-0.067
 wrong-0.065
Token Look
Token ID6803
Feature activation+9.30e-02

Pos logit contributions
 fingertips+0.078
 amaz+0.074
 approval+0.067
ushes+0.067
years+0.066
Neg logit contributions
 means-0.066
storm-0.060
 meant-0.058
 cost-0.055
 surgery-0.054
Token at
Token ID379
Feature activation+2.71e-02

Pos logit contributions
 fingertips+0.218
 amaz+0.210
 unison+0.198
 aston+0.195
ushes+0.193
Neg logit contributions
 worse-0.092
 wrong-0.091
ty-0.090
 means-0.083
 that-0.078
Token all
Token ID477
Feature activation+2.76e-02

Pos logit contributions
 amaz+0.055
 fingertips+0.053
 hands+0.049
joy+0.048
 aston+0.047
Neg logit contributions
storm-0.039
 worse-0.039
 meant-0.038
 cost-0.037
 means-0.037
Token these
Token ID777
Feature activation+2.86e-02

Pos logit contributions
 amaz+0.057
 fingertips+0.055
joy+0.050
 hands+0.049
 aston+0.049
Neg logit contributions
 meant-0.038
 means-0.038
storm-0.038
 worse-0.038
 cost-0.037
Token gifts
Token ID13201
Feature activation+3.29e-02

Pos logit contributions
 fingertips+0.055
 amaz+0.055
joy+0.052
 aston+0.047
years+0.047
Neg logit contributions
 means-0.046
 meant-0.045
 cost-0.044
 worse-0.041
 happen-0.041
Token!"
Token ID2474
Feature activation-2.10e-02

Pos logit contributions
 amaz+0.048
 fingertips+0.046
joy+0.041
 hands+0.040
 aston+0.039
Neg logit contributions
storm-0.065
 meant-0.065
 worse-0.065
 means-0.065
 cost-0.064
Token and
Token ID290
Feature activation+3.89e-03

Pos logit contributions
 amaz+0.004
 fingertips+0.004
 hands+0.004
joy+0.003
 aston+0.003
Neg logit contributions
storm-0.004
 worse-0.004
 cost-0.004
 meant-0.004
rob-0.004
Token shouted
Token ID17293
Feature activation+3.35e-02

Pos logit contributions
 amaz+0.009
 fingertips+0.009
joy+0.008
 hands+0.008
 aston+0.008
Neg logit contributions
 meant-0.004
 worse-0.004
 means-0.004
 cost-0.004
storm-0.004
Token "
Token ID366
Feature activation+1.64e-02

Pos logit contributions
 amaz+0.067
 fingertips+0.064
ushes+0.059
years+0.057
 hands+0.055
Neg logit contributions
 worse-0.047
 means-0.046
 meant-0.045
storm-0.042
 happening-0.042
TokenLook
Token ID8567
Feature activation+7.50e-02

Pos logit contributions
 amaz+0.030
 fingertips+0.028
 hands+0.026
joy+0.025
 aston+0.025
Neg logit contributions
 worse-0.027
storm-0.027
 means-0.026
 meant-0.026
 cost-0.026
Token!
Token ID0
Feature activation+4.41e-02

Pos logit contributions
 amaz+0.160
joy+0.146
 unison+0.145
 fingertips+0.144
 aston+0.143
Neg logit contributions
ty-0.100
 worse-0.090
 means-0.089
 meant-0.084
 happen-0.081
Token Look
Token ID6803
Feature activation+9.25e-02

Pos logit contributions
 amaz+0.080
 fingertips+0.079
joy+0.070
 hands+0.070
 aston+0.067
Neg logit contributions
 means-0.071
storm-0.070
 meant-0.069
 worse-0.067
 cost-0.066
Token!"
Token ID2474
Feature activation-8.14e-03

Pos logit contributions
 amaz+0.202
 fingertips+0.198
 aston+0.187
 unison+0.184
ushes+0.180
Neg logit contributions
ty-0.107
 means-0.097
ock-0.093
 worse-0.093
 wrong-0.089
Token His
Token ID2399
Feature activation+8.10e-03

Pos logit contributions
storm+0.013
 worse+0.013
 meant+0.012
 cost+0.012
 means+0.012
Neg logit contributions
 amaz-0.015
 fingertips-0.015
 hands-0.014
joy-0.013
 aston-0.013
Token mom
Token ID1995
Feature activation+1.58e-02

Pos logit contributions
years+0.013
 amaz+0.013
joy+0.012
oor+0.012
 fingertips+0.012
Neg logit contributions
 meant-0.014
 worse-0.013
 cost-0.013
 hurt-0.012
 means-0.012
Token smiled
Token ID13541
Feature activation+3.17e-02

Pos logit contributions
 amaz+0.030
 fingertips+0.029
joy+0.028
ushes+0.027
 majesty+0.026
Neg logit contributions
 meant-0.026
 worse-0.024
 cost-0.023
 means-0.023
storm-0.023
Token and
Token ID290
Feature activation-7.92e-04

Pos logit contributions
 fingertips+0.065
 amaz+0.065
 hands+0.058
 majesty+0.057
joy+0.056
Neg logit contributions
 meant-0.043
 worse-0.043
 means-0.041
storm-0.040
 cost-0.038
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
storm+0.026
 worse+0.026
 meant+0.025
 means+0.025
 cost+0.025
Neg logit contributions
 amaz-0.035
 fingertips-0.033
 hands-0.031
joy-0.030
 aston-0.029
Token
Token ID198
Feature activation-9.58e-03

Pos logit contributions
storm+0.020
 worse+0.019
 cost+0.019
 meant+0.019
 means+0.019
Neg logit contributions
 amaz-0.019
 fingertips-0.018
 hands-0.017
joy-0.017
 aston-0.016
Token"
Token ID1
Feature activation+1.97e-02

Pos logit contributions
storm+0.015
 worse+0.015
 meant+0.015
 means+0.015
 cost+0.015
Neg logit contributions
 amaz-0.018
 fingertips-0.017
joy-0.016
 hands-0.016
 aston-0.015
TokenWow
Token ID22017
Feature activation+5.30e-02

Pos logit contributions
 amaz+0.036
 fingertips+0.033
 aston+0.030
 hands+0.030
joy+0.030
Neg logit contributions
 worse-0.032
storm-0.032
 means-0.031
 meant-0.031
 cost-0.031
Token!
Token ID0
Feature activation+5.11e-02

Pos logit contributions
 amaz+0.123
 fingertips+0.117
years+0.113
 aston+0.109
ushes+0.108
Neg logit contributions
 worse-0.070
 means-0.064
ty-0.063
 happen-0.062
 wrong-0.059
Token Look
Token ID6803
Feature activation+9.24e-02

Pos logit contributions
 amaz+0.103
 fingertips+0.102
 fascination+0.090
 approval+0.090
ushes+0.090
Neg logit contributions
 means-0.074
storm-0.071
 meant-0.068
 cost-0.067
 worse-0.064
Token,
Token ID11
Feature activation+2.85e-02

Pos logit contributions
 fingertips+0.211
 amaz+0.202
 unison+0.195
ushes+0.186
 aston+0.186
Neg logit contributions
 worse-0.111
 wrong-0.102
ty-0.097
 means-0.094
 bad-0.087
Token Mom
Token ID11254
Feature activation+1.24e-02

Pos logit contributions
 fingertips+0.060
 amaz+0.059
joy+0.056
 aston+0.053
ushes+0.053
Neg logit contributions
 worse-0.039
 meant-0.038
 means-0.038
 cost-0.037
storm-0.036
Tokenmy
Token ID1820
Feature activation+2.29e-02

Pos logit contributions
 amaz+0.026
 fingertips+0.025
 hands+0.024
joy+0.023
 aston+0.023
Neg logit contributions
storm-0.017
 worse-0.016
 meant-0.016
 cost-0.016
 means-0.016
Token!
Token ID0
Feature activation+3.75e-02

Pos logit contributions
 amaz+0.033
 fingertips+0.032
 hands+0.027
joy+0.027
 aston+0.026
Neg logit contributions
 meant-0.047
 means-0.047
 worse-0.047
storm-0.047
 cost-0.046
Token The
Token ID383
Feature activation-2.07e-02

Pos logit contributions
 amaz+0.076
 fingertips+0.073
 approval+0.066
joy+0.065
 hands+0.064
Neg logit contributions
storm-0.055
 means-0.055
 meant-0.053
 cost-0.052
 worse-0.050
Token said
Token ID531
Feature activation+1.25e-02

Pos logit contributions
 amaz+0.009
 fingertips+0.009
joy+0.008
 hands+0.008
 approval+0.007
Neg logit contributions
 meant-0.006
 means-0.006
 cost-0.006
storm-0.006
 worse-0.006
Token,
Token ID11
Feature activation+9.27e-03

Pos logit contributions
 amaz+0.024
 fingertips+0.023
 aston+0.020
ushes+0.020
joy+0.020
Neg logit contributions
 worse-0.020
 means-0.019
 meant-0.019
storm-0.018
 cost-0.018
Token "
Token ID366
Feature activation+1.15e-02

Pos logit contributions
 amaz+0.017
 fingertips+0.016
 hands+0.014
 aston+0.014
ushes+0.014
Neg logit contributions
storm-0.016
 meant-0.015
 means-0.015
 worse-0.015
 cost-0.015
TokenWow
Token ID22017
Feature activation+6.09e-02

Pos logit contributions
 amaz+0.023
 fingertips+0.022
 hands+0.020
joy+0.020
 aston+0.019
Neg logit contributions
 worse-0.018
storm-0.017
 meant-0.017
 cost-0.017
 means-0.017
Token!
Token ID0
Feature activation+4.09e-02

Pos logit contributions
 amaz+0.153
 fingertips+0.145
years+0.144
 aston+0.136
 fascination+0.133
Neg logit contributions
 worse-0.060
 means-0.059
 wrong-0.059
 happen-0.057
ty-0.054
Token Look
Token ID6803
Feature activation+9.23e-02

Pos logit contributions
 fingertips+0.081
 amaz+0.080
joy+0.072
 hands+0.070
 toes+0.069
Neg logit contributions
 means-0.064
storm-0.059
 meant-0.059
 cost-0.054
ained-0.051
Token at
Token ID379
Feature activation+2.11e-02

Pos logit contributions
 fingertips+0.208
 amaz+0.207
 aston+0.191
 fascination+0.188
 unison+0.185
Neg logit contributions
ty-0.087
 means-0.082
 wrong-0.082
 worse-0.078
 that-0.077
Token the
Token ID262
Feature activation-2.35e-03

Pos logit contributions
 amaz+0.040
 fingertips+0.038
 hands+0.036
joy+0.034
 delight+0.034
Neg logit contributions
storm-0.033
 worse-0.033
 meant-0.032
 cost-0.032
 means-0.031
Token pretty
Token ID2495
Feature activation-2.33e-03

Pos logit contributions
storm+0.004
 worse+0.004
 means+0.004
 meant+0.004
 cost+0.004
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 hands-0.004
joy-0.004
 aston-0.004
Token butterfly
Token ID35113
Feature activation+1.69e-03

Pos logit contributions
storm+0.004
 worse+0.004
 meant+0.003
 means+0.003
 cost+0.003
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 hands-0.004
joy-0.004
 aston-0.004
Token.
Token ID13
Feature activation+3.76e-02

Pos logit contributions
 amaz+0.003
 fingertips+0.002
 hands+0.002
joy+0.002
 aston+0.002
Neg logit contributions
storm-0.003
 worse-0.003
 meant-0.003
 means-0.003
 cost-0.003
Token?
Token ID30
Feature activation+1.11e-02

Pos logit contributions
storm+0.007
 worse+0.007
 meant+0.007
 cost+0.007
 means+0.007
Neg logit contributions
 amaz-0.011
 fingertips-0.011
 hands-0.010
joy-0.010
 aston-0.010
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
 amaz+0.021
 fingertips+0.021
joy+0.019
 hands+0.019
 toes+0.018
Neg logit contributions
 means-0.017
 meant-0.017
 cost-0.016
storm-0.016
 worse-0.016
Token
Token ID198
Feature activation-9.77e-03

Pos logit contributions
storm+0.022
 worse+0.022
 meant+0.022
 means+0.022
 cost+0.021
Neg logit contributions
 amaz-0.029
 fingertips-0.028
 hands-0.025
joy-0.024
 aston-0.024
TokenBoy
Token ID26554
Feature activation+3.99e-03

Pos logit contributions
storm+0.015
 worse+0.015
 means+0.014
 meant+0.014
 cost+0.014
Neg logit contributions
 amaz-0.019
 fingertips-0.018
 hands-0.017
joy-0.016
 aston-0.016
Token:
Token ID25
Feature activation+1.26e-02

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
 means-0.007
 meant-0.007
 worse-0.007
storm-0.007
 cost-0.007
Token Look
Token ID6803
Feature activation+9.21e-02

Pos logit contributions
 fingertips+0.021
 amaz+0.021
 hands+0.019
joy+0.018
 aston+0.018
Neg logit contributions
 means-0.022
 meant-0.022
storm-0.022
 worse-0.021
 cost-0.021
Token,
Token ID11
Feature activation+2.26e-02

Pos logit contributions
 fingertips+0.210
 amaz+0.208
 unison+0.194
 aston+0.189
 satisfaction+0.187
Neg logit contributions
 means-0.099
 wrong-0.095
ty-0.084
 worse-0.082
 meant-0.078
Token Mom
Token ID11254
Feature activation+4.78e-03

Pos logit contributions
 fingertips+0.046
 amaz+0.045
joy+0.042
ushes+0.041
 hands+0.040
Neg logit contributions
 means-0.031
 meant-0.030
 worse-0.029
storm-0.028
 cost-0.028
Token!
Token ID0
Feature activation+2.00e-02

Pos logit contributions
 amaz+0.009
 fingertips+0.009
 hands+0.008
joy+0.008
 aston+0.008
Neg logit contributions
storm-0.007
 meant-0.007
 means-0.007
 worse-0.007
 cost-0.007
Token I
Token ID314
Feature activation+2.71e-02

Pos logit contributions
 fingertips+0.040
 amaz+0.040
 hands+0.035
 approval+0.034
 toes+0.034
Neg logit contributions
 means-0.028
 meant-0.027
storm-0.027
 worse-0.025
 cost-0.024
Token got
Token ID1392
Feature activation-1.25e-03

Pos logit contributions
 fingertips+0.052
 amaz+0.052
 hands+0.047
joy+0.046
 toes+0.044
Neg logit contributions
 meant-0.042
 means-0.041
 cost-0.041
storm-0.040
 worse-0.040
Token said
Token ID531
Feature activation-5.20e-03

Pos logit contributions
 amaz+0.054
 fingertips+0.051
 aston+0.047
joy+0.046
 approval+0.045
Neg logit contributions
 worse-0.030
 means-0.028
 meant-0.027
 cost-0.026
ty-0.026
Token,
Token ID11
Feature activation-2.82e-03

Pos logit contributions
 worse+0.009
storm+0.009
 meant+0.009
 means+0.009
 cost+0.009
Neg logit contributions
 amaz-0.009
 fingertips-0.008
 hands-0.007
 aston-0.007
joy-0.007
Token "
Token ID366
Feature activation+8.02e-03

Pos logit contributions
storm+0.006
 worse+0.006
 meant+0.006
 cost+0.005
 means+0.005
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 hands-0.003
joy-0.003
 aston-0.003
TokenWow
Token ID22017
Feature activation+6.38e-02

Pos logit contributions
 amaz+0.015
 fingertips+0.014
joy+0.013
 hands+0.013
 approval+0.013
Neg logit contributions
 worse-0.013
 meant-0.012
storm-0.012
 cost-0.012
 means-0.012
Token!
Token ID0
Feature activation+4.00e-02

Pos logit contributions
 amaz+0.157
years+0.157
 fingertips+0.155
ushes+0.141
 approval+0.140
Neg logit contributions
 means-0.064
 wrong-0.062
 worse-0.061
 happen-0.059
 meant-0.056
Token Look
Token ID6803
Feature activation+9.15e-02

Pos logit contributions
 fingertips+0.076
 amaz+0.075
joy+0.067
 hands+0.065
 approval+0.064
Neg logit contributions
 means-0.066
storm-0.063
 meant-0.062
 cost-0.060
 worse-0.057
Token,
Token ID11
Feature activation+3.02e-02

Pos logit contributions
 fingertips+0.195
 amaz+0.191
 aston+0.176
 satisfaction+0.170
 fascination+0.170
Neg logit contributions
 means-0.102
ty-0.101
 wrong-0.099
 worse-0.096
ock-0.090
Token mom
Token ID1995
Feature activation+2.77e-02

Pos logit contributions
 amaz+0.056
 fingertips+0.055
joy+0.049
 hands+0.049
 aston+0.048
Neg logit contributions
storm-0.047
 worse-0.047
 meant-0.047
 means-0.046
 cost-0.046
Token!
Token ID0
Feature activation+3.27e-02

Pos logit contributions
 amaz+0.052
 fingertips+0.051
 hands+0.045
joy+0.045
 aston+0.045
Neg logit contributions
 meant-0.045
 means-0.044
storm-0.044
 worse-0.043
 cost-0.043
Token A
Token ID317
Feature activation+1.57e-02

Pos logit contributions
 amaz+0.062
 fingertips+0.061
joy+0.055
 hands+0.053
 approval+0.053
Neg logit contributions
 means-0.052
storm-0.051
 meant-0.050
 cost-0.048
 worse-0.047
Token huge
Token ID3236
Feature activation+2.03e-02

Pos logit contributions
 amaz+0.031
 fingertips+0.030
 hands+0.027
joy+0.027
 aston+0.026
Neg logit contributions
storm-0.024
 worse-0.023
 meant-0.022
 cost-0.022
 means-0.022
Token.
Token ID13
Feature activation-1.57e-03

Pos logit contributions
 amaz+0.013
 fingertips+0.012
 hands+0.011
joy+0.011
 aston+0.011
Neg logit contributions
storm-0.013
 worse-0.013
 meant-0.013
 means-0.013
 cost-0.013
Token
Token ID198
Feature activation-4.37e-03

Pos logit contributions
 means+0.002
 meant+0.002
storm+0.002
 worse+0.002
 cost+0.002
Neg logit contributions
 amaz-0.003
 fingertips-0.003
joy-0.003
 hands-0.003
 aston-0.003
Token"
Token ID1
Feature activation+1.79e-02

Pos logit contributions
storm+0.007
 worse+0.007
 meant+0.007
 means+0.006
 cost+0.006
Neg logit contributions
 amaz-0.009
 fingertips-0.008
 hands-0.008
joy-0.007
 aston-0.007
TokenHey
Token ID10814
Feature activation+5.41e-02

Pos logit contributions
 amaz+0.036
 fingertips+0.033
joy+0.031
 hands+0.030
 approval+0.030
Neg logit contributions
 worse-0.027
storm-0.027
 means-0.025
 cost-0.025
 meant-0.025
Token!
Token ID0
Feature activation+4.13e-02

Pos logit contributions
 fingertips+0.106
 amaz+0.106
ushes+0.101
years+0.093
 aston+0.093
Neg logit contributions
 worse-0.084
storm-0.081
 means-0.081
 meant-0.081
 cost-0.076
Token Look
Token ID6803
Feature activation+9.14e-02

Pos logit contributions
 amaz+0.084
 fingertips+0.082
ushes+0.073
joy+0.071
 toes+0.071
Neg logit contributions
 means-0.061
storm-0.056
 meant-0.054
 cost-0.054
 worse-0.051
Token at
Token ID379
Feature activation+2.54e-02

Pos logit contributions
 fingertips+0.204
 amaz+0.197
ushes+0.189
 aston+0.184
 unison+0.183
Neg logit contributions
ty-0.104
 worse-0.101
 wrong-0.101
 means-0.097
 happening-0.087
Token me
Token ID502
Feature activation+3.63e-02

Pos logit contributions
 amaz+0.051
 fingertips+0.048
 hands+0.045
joy+0.044
 aston+0.043
Neg logit contributions
storm-0.038
 worse-0.037
 meant-0.037
 cost-0.036
 means-0.036
Token!
Token ID0
Feature activation+3.81e-02

Pos logit contributions
 amaz+0.060
 fingertips+0.058
joy+0.051
 hands+0.051
 aston+0.051
Neg logit contributions
 worse-0.067
storm-0.065
 meant-0.065
 means-0.065
 cost-0.064
Token I
Token ID314
Feature activation+4.15e-02

Pos logit contributions
 amaz+0.075
 fingertips+0.072
joy+0.065
 hands+0.064
 aston+0.063
Neg logit contributions
 means-0.058
storm-0.056
 meant-0.055
 cost-0.054
 worse-0.052
Token figured
Token ID11638
Feature activation+2.44e-02

Pos logit contributions
 amaz+0.084
 fingertips+0.084
joy+0.076
 hands+0.075
 aston+0.072
Neg logit contributions
 meant-0.062
 means-0.060
 cost-0.060
storm-0.059
 worse-0.058
Token
Token ID241
Feature activation+1.07e-02

Pos logit contributions
 worse+0.007
storm+0.007
 meant+0.006
 cost+0.006
 means+0.006
Neg logit contributions
 amaz-0.006
 fingertips-0.006
 hands-0.005
joy-0.005
 aston-0.005
TokenR
Token ID49
Feature activation+2.31e-02

Pos logit contributions
 amaz+0.021
 fingertips+0.020
 hands+0.018
joy+0.018
 aston+0.017
Neg logit contributions
 worse-0.016
storm-0.016
 meant-0.016
 cost-0.016
 means-0.016
Tokenaa
Token ID7252
Feature activation+1.08e-02

Pos logit contributions
 amaz+0.042
 fingertips+0.037
joy+0.034
 approval+0.034
 aston+0.034
Neg logit contributions
storm-0.040
 means-0.039
 meant-0.038
ock-0.038
 worse-0.037
Tokenah
Token ID993
Feature activation+1.36e-02

Pos logit contributions
 amaz+0.016
 fingertips+0.015
 hands+0.013
joy+0.013
 aston+0.013
Neg logit contributions
storm-0.022
 worse-0.021
 meant-0.021
 means-0.021
 cost-0.021
Token!
Token ID0
Feature activation+2.63e-02

Pos logit contributions
 amaz+0.024
 fingertips+0.022
joy+0.021
 aston+0.020
 approval+0.020
Neg logit contributions
storm-0.024
 meant-0.023
 means-0.023
 worse-0.023
 cost-0.023
Token Look
Token ID6803
Feature activation+9.11e-02

Pos logit contributions
 amaz+0.054
 fingertips+0.054
 aston+0.048
 hands+0.047
joy+0.047
Neg logit contributions
 means-0.035
 meant-0.035
storm-0.032
 worse-0.032
 cost-0.031
Token at
Token ID379
Feature activation+1.08e-02

Pos logit contributions
 amaz+0.206
 fingertips+0.197
 aston+0.187
 unison+0.185
joy+0.177
Neg logit contributions
 means-0.106
 worse-0.100
ty-0.098
 meant-0.096
ock-0.096
Token my
Token ID616
Feature activation+2.88e-02

Pos logit contributions
 amaz+0.020
 fingertips+0.019
 hands+0.017
joy+0.017
 aston+0.016
Neg logit contributions
storm-0.018
 worse-0.018
 meant-0.017
 cost-0.017
 means-0.017
Token snake
Token ID17522
Feature activation+5.94e-03

Pos logit contributions
 amaz+0.063
 fingertips+0.060
joy+0.058
 aston+0.057
years+0.056
Neg logit contributions
 meant-0.036
 means-0.036
 worse-0.035
 cost-0.034
storm-0.034
Token!
Token ID0
Feature activation+1.83e-02

Pos logit contributions
 amaz+0.009
 fingertips+0.009
 hands+0.008
joy+0.008
 aston+0.008
Neg logit contributions
storm-0.011
 worse-0.011
 meant-0.011
 cost-0.011
 means-0.011
Tokenâ
Token ID22940
Feature activation+3.05e-03

Pos logit contributions
 amaz+0.036
 fingertips+0.035
 hands+0.031
 aston+0.031
joy+0.031
Neg logit contributions
 means-0.026
 meant-0.026
storm-0.025
 worse-0.024
 cost-0.024
Token "
Token ID366
Feature activation+3.63e-02

Pos logit contributions
 amaz+0.025
 fingertips+0.024
ushes+0.021
 hands+0.021
 aston+0.021
Neg logit contributions
 meant-0.021
 worse-0.021
storm-0.021
 cost-0.020
 means-0.020
TokenThat
Token ID2504
Feature activation+3.41e-02

Pos logit contributions
 amaz+0.076
 fingertips+0.073
 hands+0.068
joy+0.067
 aston+0.065
Neg logit contributions
storm-0.050
 worse-0.049
 meant-0.049
 means-0.048
 cost-0.048
Token's
Token ID338
Feature activation+2.04e-02

Pos logit contributions
 amaz+0.067
 fingertips+0.064
 hands+0.059
 aston+0.056
joy+0.056
Neg logit contributions
storm-0.051
 worse-0.051
 cost-0.049
 meant-0.049
 means-0.048
Token amazing
Token ID4998
Feature activation+3.03e-02

Pos logit contributions
 amaz+0.038
 fingertips+0.035
 hands+0.033
 aston+0.032
 delight+0.031
Neg logit contributions
storm-0.034
 worse-0.032
rob-0.032
 cost-0.032
 means-0.031
Token!
Token ID0
Feature activation+3.85e-02

Pos logit contributions
 amaz+0.046
 fingertips+0.044
 hands+0.038
joy+0.038
 aston+0.038
Neg logit contributions
storm-0.059
 meant-0.058
 worse-0.058
 means-0.058
 cost-0.057
Token Look
Token ID6803
Feature activation+9.11e-02

Pos logit contributions
 amaz+0.071
 fingertips+0.070
joy+0.064
 hands+0.063
 aston+0.060
Neg logit contributions
 means-0.063
 meant-0.061
storm-0.059
 cost-0.056
 worse-0.055
Token at
Token ID379
Feature activation+2.86e-02

Pos logit contributions
 amaz+0.194
 fingertips+0.191
 aston+0.179
 fascination+0.173
 unison+0.172
Neg logit contributions
 means-0.103
ty-0.101
 worse-0.099
 wrong-0.096
 meant-0.093
Token it
Token ID340
Feature activation+2.93e-02

Pos logit contributions
 amaz+0.056
 fingertips+0.054
 hands+0.050
joy+0.048
 aston+0.047
Neg logit contributions
storm-0.043
 worse-0.043
 meant-0.042
 cost-0.041
 means-0.041
Token,
Token ID11
Feature activation+3.95e-02

Pos logit contributions
 amaz+0.046
 fingertips+0.043
 hands+0.039
joy+0.038
 aston+0.037
Neg logit contributions
storm-0.056
 worse-0.055
 meant-0.055
 means-0.054
 cost-0.054
Token its
Token ID663
Feature activation+1.48e-02

Pos logit contributions
 fingertips+0.082
 amaz+0.080
joy+0.078
years+0.074
ushes+0.074
Neg logit contributions
 means-0.051
 meant-0.051
 worse-0.048
storm-0.047
 cost-0.046
Token so
Token ID523
Feature activation+1.58e-02

Pos logit contributions
 amaz+0.025
 fingertips+0.024
joy+0.022
 aston+0.021
 hands+0.021
Neg logit contributions
 worse-0.025
 meant-0.025
storm-0.025
 cost-0.024
 means-0.024
Token she
Token ID673
Feature activation-1.97e-02

Pos logit contributions
storm+0.071
rob+0.066
 outage+0.065
ock+0.064
bank+0.063
Neg logit contributions
 amaz-0.060
 hands-0.057
 fingertips-0.054
 toes-0.051
 ear-0.050
Token didn
Token ID1422
Feature activation-2.74e-02

Pos logit contributions
storm+0.035
ock+0.035
rob+0.034
 outage+0.034
 worse+0.034
Neg logit contributions
 fingertips-0.027
 hands-0.027
 amaz-0.026
 perfectly-0.024
years-0.023
Token't
Token ID470
Feature activation-2.74e-02

Pos logit contributions
rob+0.047
storm+0.047
bank+0.044
 ger+0.044
 meant+0.044
Neg logit contributions
 fingertips-0.038
oor-0.037
 amaz-0.036
aps-0.035
 aston-0.034
Token like
Token ID588
Feature activation-4.42e-02

Pos logit contributions
storm+0.055
rob+0.055
 worse+0.052
 means+0.051
guard+0.049
Neg logit contributions
 fingertips-0.034
 hands-0.034
 amaz-0.033
 ear-0.031
 pride-0.030
Token the
Token ID262
Feature activation-5.41e-02

Pos logit contributions
storm+0.075
 outage+0.072
ained+0.068
 worse+0.065
rob+0.065
Neg logit contributions
 hands-0.075
 fingertips-0.073
 amaz-0.073
joy-0.070
 delight-0.067
Token bitter
Token ID12922
Feature activation-7.48e-02

Pos logit contributions
storm+0.083
 outage+0.080
rob+0.076
 means+0.076
ained+0.074
Neg logit contributions
 amaz-0.098
 hands-0.095
 fingertips-0.094
joy-0.090
 delight-0.088
Token ones
Token ID3392
Feature activation-1.04e-02

Pos logit contributions
ained+0.109
 outage+0.108
storm+0.107
limits+0.102
 worse+0.099
Neg logit contributions
 hands-0.134
 fingertips-0.126
 amaz-0.125
 eyes-0.125
 toes-0.122
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
storm+0.018
rob+0.017
ained+0.016
 worse+0.016
bank+0.015
Neg logit contributions
 fingertips-0.017
 hands-0.017
 amaz-0.016
 toes-0.015
 perfectly-0.015
Token One
Token ID1881
Feature activation-1.11e-02

Pos logit contributions
storm+0.019
 worse+0.019
 meant+0.018
 means+0.018
 cost+0.018
Neg logit contributions
 amaz-0.027
 fingertips-0.026
 hands-0.024
joy-0.024
 aston-0.023
Token day
Token ID1110
Feature activation-7.01e-03

Pos logit contributions
rob+0.016
 worse+0.015
storm+0.014
 means+0.014
bank+0.013
Neg logit contributions
 amaz-0.023
 hands-0.022
 fingertips-0.022
 ear-0.020
 toes-0.020
Token,
Token ID11
Feature activation-2.18e-02

Pos logit contributions
 means+0.011
 meant+0.011
 worse+0.011
storm+0.010
 cost+0.010
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 approval-0.012
 aston-0.012
joy-0.011
Token share
Token ID2648
Feature activation-1.27e-02

Pos logit contributions
 amaz+0.001
 fingertips+0.001
 hands+0.001
joy+0.001
 aston+0.001
Neg logit contributions
storm-0.001
 worse-0.001
 meant-0.001
 means-0.001
 cost-0.001
Token.
Token ID13
Feature activation+9.05e-03

Pos logit contributions
 worse+0.019
storm+0.019
 meant+0.019
 means+0.019
 cost+0.018
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 hands-0.022
joy-0.022
 aston-0.021
Token We
Token ID775
Feature activation-6.25e-04

Pos logit contributions
 amaz+0.016
 fingertips+0.016
joy+0.014
 hands+0.013
 aston+0.013
Neg logit contributions
 means-0.016
 meant-0.015
storm-0.015
 worse-0.015
 cost-0.014
Token hope
Token ID2911
Feature activation-1.85e-02

Pos logit contributions
 cost+0.001
 meant+0.001
 worse+0.001
 means+0.001
storm+0.001
Neg logit contributions
 amaz-0.001
 fingertips-0.001
joy-0.001
 aston-0.001
 hands-0.001
Token your
Token ID534
Feature activation-1.59e-02

Pos logit contributions
storm+0.032
rob+0.031
 meant+0.030
 cost+0.030
 means+0.030
Neg logit contributions
 fingertips-0.032
 amaz-0.031
 hands-0.031
joy-0.028
 toes-0.027
Token wing
Token ID8539
Feature activation-7.25e-02

Pos logit contributions
 worse+0.023
storm+0.023
 meant+0.022
 cost+0.022
 means+0.022
Neg logit contributions
 amaz-0.032
 fingertips-0.031
 aston-0.028
joy-0.027
 hands-0.027
Token will
Token ID481
Feature activation-2.07e-02

Pos logit contributions
storm+0.127
rob+0.120
 cost+0.112
 meant+0.111
 virus+0.109
Neg logit contributions
 amaz-0.120
 hands-0.111
ushes-0.107
 fingertips-0.106
 approval-0.103
Token heal
Token ID12035
Feature activation-2.79e-02

Pos logit contributions
storm+0.035
rob+0.035
 means+0.035
 meant+0.034
 worse+0.033
Neg logit contributions
 amaz-0.035
 fingertips-0.033
 hands-0.030
 approval-0.029
 aston-0.029
Token soon
Token ID2582
Feature activation-3.75e-02

Pos logit contributions
storm+0.052
 cost+0.050
 meant+0.049
ained+0.048
rob+0.048
Neg logit contributions
 amaz-0.043
 fingertips-0.042
 hands-0.040
 approval-0.037
joy-0.037
Token,"
Token ID553
Feature activation-7.72e-03

Pos logit contributions
rob+0.079
storm+0.078
ained+0.076
 cost+0.071
 meant+0.070
Neg logit contributions
 hands-0.046
 amaz-0.044
 fingertips-0.044
 delight-0.037
 approval-0.037
Token Sara
Token ID24799
Feature activation-4.08e-02

Pos logit contributions
storm+0.013
 worse+0.012
 cost+0.012
 meant+0.012
ained+0.012
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 hands-0.013
joy-0.012
 aston-0.012
Token you
Token ID345
Feature activation+1.44e-02

Pos logit contributions
storm+0.026
 meant+0.024
 cost+0.024
rob+0.024
 means+0.024
Neg logit contributions
 amaz-0.026
 fingertips-0.026
 hands-0.025
joy-0.023
years-0.023
Token okay
Token ID8788
Feature activation+2.08e-02

Pos logit contributions
 amaz+0.022
 fingertips+0.020
 hands+0.020
joy+0.018
 approval+0.018
Neg logit contributions
storm-0.029
rob-0.028
 means-0.026
 cost-0.026
guard-0.026
Token?
Token ID30
Feature activation+1.27e-02

Pos logit contributions
 amaz+0.032
 fingertips+0.031
 hands+0.028
joy+0.028
 aston+0.026
Neg logit contributions
storm-0.039
 worse-0.038
 cost-0.038
 meant-0.038
rob-0.037
Token Does
Token ID8314
Feature activation-3.37e-02

Pos logit contributions
 amaz+0.022
 fingertips+0.021
 hands+0.019
joy+0.019
 aston+0.018
Neg logit contributions
storm-0.021
 worse-0.021
 meant-0.021
 means-0.021
 cost-0.021
Token your
Token ID534
Feature activation-1.29e-02

Pos logit contributions
rob+0.067
storm+0.067
ained+0.064
 cost+0.062
 meant+0.061
Neg logit contributions
 fingertips-0.045
 hands-0.042
 amaz-0.039
 eyes-0.038
 toes-0.038
Token leg
Token ID1232
Feature activation-7.15e-02

Pos logit contributions
storm+0.023
 worse+0.022
 meant+0.022
 cost+0.022
 means+0.022
Neg logit contributions
 amaz-0.022
 fingertips-0.022
 hands-0.020
joy-0.019
 delight-0.018
Token hurt
Token ID5938
Feature activation-2.16e-02

Pos logit contributions
rob+0.103
storm+0.097
bank+0.083
guard+0.081
 means+0.078
Neg logit contributions
 hands-0.135
 amaz-0.131
 fingertips-0.130
 perfectly-0.128
 pride-0.118
Token?"
Token ID1701
Feature activation-1.79e-02

Pos logit contributions
rob+0.034
storm+0.033
bank+0.030
 cost+0.029
guard+0.029
Neg logit contributions
 hands-0.037
 fingertips-0.035
 amaz-0.035
 perfectly-0.033
 approval-0.032
Token Lily
Token ID20037
Feature activation-1.49e-02

Pos logit contributions
storm+0.030
 worse+0.029
 meant+0.029
 cost+0.028
 means+0.028
Neg logit contributions
 amaz-0.032
 fingertips-0.031
 hands-0.029
joy-0.028
 delight-0.027
Token nodded
Token ID14464
Feature activation+6.16e-03

Pos logit contributions
storm+0.027
rob+0.025
ained+0.025
 worse+0.025
 outage+0.025
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 hands-0.023
joy-0.021
 delight-0.021
Token and
Token ID290
Feature activation-1.92e-02

Pos logit contributions
 amaz+0.012
 fingertips+0.012
 hands+0.011
joy+0.010
 aston+0.010
Neg logit contributions
 worse-0.009
 meant-0.009
storm-0.009
 means-0.009
 cost-0.009
Token?
Token ID30
Feature activation+8.02e-03

Pos logit contributions
rob+0.053
storm+0.053
 cost+0.049
ained+0.049
bank+0.048
Neg logit contributions
 hands-0.027
 amaz-0.022
 fingertips-0.020
aps-0.020
 eyes-0.020
Token And
Token ID843
Feature activation-7.19e-03

Pos logit contributions
 amaz+0.014
 fingertips+0.013
 hands+0.012
joy+0.012
 aston+0.012
Neg logit contributions
storm-0.014
 worse-0.014
 meant-0.013
 cost-0.013
 means-0.013
Token what
Token ID644
Feature activation-1.03e-02

Pos logit contributions
storm+0.011
 worse+0.011
 meant+0.011
 means+0.010
 cost+0.010
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 hands-0.012
joy-0.012
 aston-0.012
Token if
Token ID611
Feature activation-1.40e-02

Pos logit contributions
storm+0.019
rob+0.018
bank+0.016
 cost+0.016
 worse+0.016
Neg logit contributions
 amaz-0.017
 hands-0.017
 fingertips-0.016
 aston-0.014
 toes-0.014
Token the
Token ID262
Feature activation-2.43e-02

Pos logit contributions
storm+0.026
rob+0.025
ained+0.024
ock+0.023
bank+0.022
Neg logit contributions
 fingertips-0.022
 amaz-0.022
 hands-0.022
 toes-0.019
 delight-0.018
Token hospital
Token ID4436
Feature activation-7.13e-02

Pos logit contributions
storm+0.030
ained+0.029
rob+0.028
 meant+0.027
 means+0.027
Neg logit contributions
 amaz-0.052
 hands-0.049
 fingertips-0.048
 ear-0.048
 delight-0.046
Token is
Token ID318
Feature activation-2.05e-02

Pos logit contributions
storm+0.121
rob+0.116
 worse+0.114
ained+0.112
ock+0.108
Neg logit contributions
 amaz-0.119
 hands-0.114
 fingertips-0.112
 aston-0.099
 delight-0.097
Token sm
Token ID895
Feature activation-2.49e-02

Pos logit contributions
rob+0.035
storm+0.033
guard+0.031
ock+0.029
 cost+0.028
Neg logit contributions
 hands-0.034
 amaz-0.033
 ear-0.033
 fingertips-0.031
 pride-0.030
Tokenelly
Token ID6148
Feature activation-2.15e-02

Pos logit contributions
 cost+0.041
storm+0.039
 worse+0.038
 happen+0.036
rob+0.035
Neg logit contributions
 amaz-0.040
aps-0.040
 hands-0.040
joy-0.039
ushes-0.039
Token?"
Token ID1701
Feature activation-9.51e-03

Pos logit contributions
storm+0.046
rob+0.046
 cost+0.044
ained+0.044
 worse+0.042
Neg logit contributions
 hands-0.025
 amaz-0.024
 fingertips-0.022
 toes-0.021
 ear-0.019
Token Ben
Token ID3932
Feature activation-2.95e-02

Pos logit contributions
storm+0.019
 worse+0.018
 meant+0.018
 cost+0.018
 means+0.018
Neg logit contributions
 amaz-0.014
 fingertips-0.014
 hands-0.012
joy-0.012
 aston-0.012
Token doctor
Token ID6253
Feature activation-1.96e-02

Pos logit contributions
storm+0.028
rob+0.026
 meant+0.026
 means+0.026
ained+0.024
Neg logit contributions
 amaz-0.058
 fingertips-0.055
 hands-0.054
joy-0.053
 aston-0.052
Token said
Token ID531
Feature activation+4.25e-03

Pos logit contributions
 worse+0.029
storm+0.028
 meant+0.028
 means+0.028
 cost+0.028
Neg logit contributions
 amaz-0.039
 fingertips-0.038
 hands-0.034
joy-0.034
 aston-0.033
Token that
Token ID326
Feature activation-2.65e-02

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.008
rob-0.007
 cost-0.007
 meant-0.007
 means-0.007
Token Lily
Token ID20037
Feature activation-2.46e-02

Pos logit contributions
rob+0.050
storm+0.050
ock+0.046
ained+0.045
 outage+0.044
Neg logit contributions
 amaz-0.037
 hands-0.036
 fingertips-0.034
joy-0.032
 aston-0.030
Token's
Token ID338
Feature activation-3.42e-02

Pos logit contributions
rob+0.046
storm+0.046
 worse+0.045
ock+0.043
ained+0.042
Neg logit contributions
 hands-0.034
 fingertips-0.034
 amaz-0.031
 perfectly-0.028
 toes-0.028
Token arm
Token ID3211
Feature activation-7.13e-02

Pos logit contributions
storm+0.061
rob+0.059
 means+0.056
 worse+0.056
 meant+0.055
Neg logit contributions
 amaz-0.057
 fingertips-0.057
 hands-0.057
joy-0.050
 ear-0.049
Token was
Token ID373
Feature activation-3.48e-02

Pos logit contributions
rob+0.136
storm+0.135
 hotter+0.124
 weekend+0.123
bank+0.123
Neg logit contributions
 hands-0.101
 fingertips-0.087
 perfectly-0.081
 eyes-0.076
 amaz-0.076
Token broken
Token ID5445
Feature activation-1.72e-02

Pos logit contributions
rob+0.068
storm+0.063
guard+0.059
 means+0.057
bank+0.055
Neg logit contributions
 hands-0.055
 fingertips-0.052
 perfectly-0.051
 amaz-0.051
 ear-0.049
Token and
Token ID290
Feature activation-2.70e-02

Pos logit contributions
storm+0.036
rob+0.035
 cost+0.033
guard+0.032
 means+0.032
Neg logit contributions
 amaz-0.021
 hands-0.021
 fingertips-0.020
 perfectly-0.019
joy-0.018
Token she
Token ID673
Feature activation-1.70e-02

Pos logit contributions
rob+0.049
storm+0.046
ock+0.043
ained+0.041
bank+0.040
Neg logit contributions
 hands-0.044
 amaz-0.042
 fingertips-0.041
 perfectly-0.039
years-0.038
Token needed
Token ID2622
Feature activation-2.76e-02

Pos logit contributions
storm+0.034
rob+0.033
ained+0.031
ock+0.031
bank+0.030
Neg logit contributions
 hands-0.023
 fingertips-0.023
 amaz-0.021
 perfectly-0.019
 pride-0.018
Token that
Token ID326
Feature activation-2.21e-02

Pos logit contributions
storm+0.025
 worse+0.024
 meant+0.024
 means+0.024
 cost+0.023
Neg logit contributions
 amaz-0.035
 fingertips-0.034
 hands-0.031
joy-0.031
 aston-0.030
Token had
Token ID550
Feature activation-1.53e-02

Pos logit contributions
storm+0.034
 worse+0.033
ock+0.032
rob+0.032
ained+0.031
Neg logit contributions
 amaz-0.041
 fingertips-0.039
 hands-0.037
 aston-0.034
 delight-0.034
Token lots
Token ID6041
Feature activation-1.86e-02

Pos logit contributions
storm+0.025
 means+0.024
 worse+0.024
rob+0.024
 meant+0.023
Neg logit contributions
 amaz-0.027
 fingertips-0.026
 hands-0.025
joy-0.023
 pride-0.023
Token of
Token ID286
Feature activation-4.95e-02

Pos logit contributions
storm+0.031
 worse+0.030
 means+0.029
 meant+0.029
 cost+0.029
Neg logit contributions
 amaz-0.034
 fingertips-0.032
 hands-0.030
joy-0.029
 aston-0.029
Token organized
Token ID8389
Feature activation-4.35e-02

Pos logit contributions
storm+0.092
 worse+0.080
guard+0.076
 outage+0.073
 means+0.073
Neg logit contributions
 amaz-0.086
 hands-0.085
 fingertips-0.082
 delight-0.077
 pride-0.076
Token le
Token ID443
Feature activation-7.03e-02

Pos logit contributions
storm+0.079
 worse+0.072
rob+0.066
ained+0.065
 means+0.065
Neg logit contributions
 amaz-0.076
 hands-0.073
 fingertips-0.071
joy-0.065
 delight-0.065
Tokenashes
Token ID7465
Feature activation-1.83e-02

Pos logit contributions
storm+0.122
 worse+0.120
 means+0.117
 meant+0.117
 cost+0.114
Neg logit contributions
 fingertips-0.113
 hands-0.109
 amaz-0.104
joy-0.104
 toes-0.099
Token to
Token ID284
Feature activation-1.74e-02

Pos logit contributions
storm+0.031
 worse+0.031
 cost+0.030
 meant+0.030
 means+0.030
Neg logit contributions
 amaz-0.032
 fingertips-0.030
 hands-0.029
joy-0.027
 aston-0.026
Token choose
Token ID3853
Feature activation-1.54e-02

Pos logit contributions
storm+0.031
 worse+0.029
 means+0.029
 meant+0.029
ained+0.028
Neg logit contributions
 amaz-0.030
 fingertips-0.029
 hands-0.027
 delight-0.025
 approval-0.025
Token from
Token ID422
Feature activation-1.51e-02

Pos logit contributions
storm+0.027
 worse+0.025
rob+0.024
 means+0.024
 cost+0.024
Neg logit contributions
 amaz-0.026
 fingertips-0.025
 hands-0.024
joy-0.023
 aston-0.022
Token.
Token ID13
Feature activation-2.42e-03

Pos logit contributions
storm+0.026
 worse+0.025
 meant+0.025
 means+0.025
 cost+0.024
Neg logit contributions
 amaz-0.026
 fingertips-0.025
 hands-0.023
joy-0.023
 aston-0.022
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
storm+0.061
 means+0.061
 worse+0.061
rob+0.061
 meant+0.060
Neg logit contributions
 amaz-0.050
 hands-0.049
 fingertips-0.049
 toes-0.041
joy-0.041
Token drink
Token ID4144
Feature activation-3.64e-02

Pos logit contributions
storm+0.043
 worse+0.040
 meant+0.039
rob+0.039
 means+0.039
Neg logit contributions
 amaz-0.047
 fingertips-0.044
 hands-0.042
 aston-0.039
joy-0.038
Token.
Token ID13
Feature activation+1.66e-02

Pos logit contributions
storm+0.064
rob+0.059
 outage+0.056
 cost+0.056
 worse+0.056
Neg logit contributions
 hands-0.061
 amaz-0.061
 fingertips-0.060
 approval-0.053
 delight-0.053
Token But
Token ID887
Feature activation-1.10e-02

Pos logit contributions
 amaz+0.036
 fingertips+0.035
joy+0.032
 hands+0.032
 aston+0.031
Neg logit contributions
 means-0.021
storm-0.021
 meant-0.021
 worse-0.021
 cost-0.020
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 worse+0.014
storm+0.014
 meant+0.014
 means+0.013
 cost+0.013
Neg logit contributions
 amaz-0.024
 fingertips-0.024
 hands-0.022
joy-0.022
 aston-0.021
Token lemon
Token ID18873
Feature activation-6.95e-02

Pos logit contributions
storm+0.026
 means+0.025
 meant+0.025
 worse+0.025
ock+0.024
Neg logit contributions
 amaz-0.039
 fingertips-0.036
 hands-0.035
joy-0.034
 delight-0.033
Tokenade
Token ID671
Feature activation-5.73e-02

Pos logit contributions
storm+0.129
rob+0.125
bike+0.121
 operation+0.120
 worse+0.120
Neg logit contributions
 hands-0.095
 amaz-0.086
 fingertips-0.084
ushes-0.084
 toes-0.080
Token is
Token ID318
Feature activation-4.14e-02

Pos logit contributions
storm+0.098
rob+0.094
 worse+0.093
 meant+0.090
ock+0.089
Neg logit contributions
 amaz-0.097
 hands-0.092
 fingertips-0.090
 delight-0.080
 aston-0.080
Token not
Token ID407
Feature activation-1.71e-02

Pos logit contributions
rob+0.076
storm+0.072
 means+0.066
ock+0.065
bank+0.062
Neg logit contributions
 amaz-0.070
 fingertips-0.069
 hands-0.068
 toes-0.063
ushes-0.061
Token sweet
Token ID6029
Feature activation-1.58e-02

Pos logit contributions
storm+0.032
rob+0.030
 means+0.028
 worse+0.026
guard+0.026
Neg logit contributions
 hands-0.027
 amaz-0.027
 fingertips-0.027
 toes-0.025
joy-0.024
Token.
Token ID13
Feature activation+2.92e-02

Pos logit contributions
storm+0.034
rob+0.033
ained+0.032
Bug+0.031
 worse+0.031
Neg logit contributions
 amaz-0.019
 hands-0.018
 fingertips-0.018
joy-0.016
 toes-0.016
Tokenmy
Token ID1820
Feature activation-8.64e-03

Pos logit contributions
storm+0.038
 worse+0.034
rob+0.034
guard+0.034
 cost+0.034
Neg logit contributions
 amaz-0.034
 hands-0.033
 fingertips-0.032
joy-0.030
 aston-0.028
Token told
Token ID1297
Feature activation-7.65e-03

Pos logit contributions
storm+0.013
 worse+0.013
ained+0.013
rob+0.012
 cost+0.012
Neg logit contributions
 amaz-0.016
 fingertips-0.016
 hands-0.015
joy-0.014
 aston-0.014
Token her
Token ID607
Feature activation-3.39e-03

Pos logit contributions
storm+0.016
ained+0.015
 outage+0.014
rob+0.014
 cost+0.014
Neg logit contributions
 amaz-0.012
 hands-0.011
 fingertips-0.011
 delight-0.011
 approval-0.010
Token that
Token ID326
Feature activation-3.05e-02

Pos logit contributions
storm+0.005
 worse+0.005
 meant+0.004
 cost+0.004
 means+0.004
Neg logit contributions
 amaz-0.007
 fingertips-0.007
 hands-0.006
joy-0.006
 aston-0.006
Token his
Token ID465
Feature activation-2.60e-02

Pos logit contributions
storm+0.050
rob+0.048
ained+0.046
 cost+0.044
ooky+0.044
Neg logit contributions
 amaz-0.053
 hands-0.050
 fingertips-0.048
joy-0.046
 toes-0.045
Token knee
Token ID10329
Feature activation-6.91e-02

Pos logit contributions
storm+0.058
rob+0.057
 means+0.055
 cost+0.054
 meant+0.054
Neg logit contributions
 hands-0.034
 fingertips-0.033
 amaz-0.032
 toes-0.028
joy-0.026
Token hurt
Token ID5938
Feature activation-2.58e-02

Pos logit contributions
rob+0.151
igators+0.142
storm+0.139
ock+0.134
igator+0.132
Neg logit contributions
 hands-0.083
 fingertips-0.072
 perfectly-0.069
 eyes-0.065
 amaz-0.064
Token a
Token ID257
Feature activation-8.64e-03

Pos logit contributions
storm+0.054
rob+0.052
guard+0.048
 cost+0.048
 outage+0.046
Neg logit contributions
 hands-0.038
 fingertips-0.034
 amaz-0.032
 perfectly-0.031
 toes-0.031
Token lot
Token ID1256
Feature activation-2.40e-02

Pos logit contributions
rob+0.018
 means+0.018
storm+0.018
ained+0.018
 meant+0.017
Neg logit contributions
 amaz-0.011
 fingertips-0.010
 hands-0.010
joy-0.009
 approval-0.009
Token.
Token ID13
Feature activation-7.60e-03

Pos logit contributions
storm+0.056
rob+0.054
 cost+0.052
making+0.048
burn+0.048
Neg logit contributions
 hands-0.022
 perfectly-0.019
 amaz-0.019
 fingertips-0.018
 pride-0.018
Token His
Token ID2399
Feature activation-7.44e-03

Pos logit contributions
storm+0.013
 worse+0.013
rob+0.013
 meant+0.012
 cost+0.012
Neg logit contributions
 amaz-0.013
 fingertips-0.013
 hands-0.012
joy-0.012
 delight-0.011
Token a
Token ID257
Feature activation-7.72e-03

Pos logit contributions
storm+0.027
rob+0.025
 worse+0.024
 means+0.024
 meant+0.023
Neg logit contributions
 amaz-0.033
 fingertips-0.032
 hands-0.032
joy-0.030
 aston-0.028
Token bad
Token ID2089
Feature activation-4.42e-02

Pos logit contributions
storm+0.012
 means+0.012
 meant+0.012
 worse+0.012
rob+0.012
Neg logit contributions
 amaz-0.015
 fingertips-0.014
 hands-0.013
joy-0.013
 aston-0.012
Token injury
Token ID5095
Feature activation-4.86e-02

Pos logit contributions
rob+0.060
storm+0.059
ained+0.059
 worse+0.057
 meant+0.057
Neg logit contributions
 amaz-0.086
 hands-0.086
 fingertips-0.084
joy-0.079
 ear-0.078
Token and
Token ID290
Feature activation-2.59e-02

Pos logit contributions
storm+0.104
 worse+0.101
rob+0.101
 cost+0.099
 colder+0.098
Neg logit contributions
 hands-0.050
 amaz-0.044
 fingertips-0.044
 perfectly-0.042
 ear-0.041
Token needed
Token ID2622
Feature activation-3.15e-02

Pos logit contributions
rob+0.046
storm+0.045
ained+0.043
ock+0.042
bank+0.038
Neg logit contributions
 hands-0.043
 amaz-0.041
 fingertips-0.040
 ear-0.035
 pride-0.035
Token surgery
Token ID8185
Feature activation-6.87e-02

Pos logit contributions
storm+0.059
rob+0.057
ained+0.053
 means+0.052
 meant+0.052
Neg logit contributions
 hands-0.050
 fingertips-0.049
 amaz-0.048
 approval-0.044
 pride-0.044
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
storm+0.146
rob+0.145
 worse+0.138
ock+0.135
 cost+0.135
Neg logit contributions
 hands-0.085
 fingertips-0.077
 amaz-0.069
 toes-0.069
 perfectly-0.063
Token He
Token ID679
Feature activation-1.75e-02

Pos logit contributions
 amaz+0.023
 fingertips+0.022
 hands+0.020
joy+0.020
 aston+0.019
Neg logit contributions
storm-0.024
 worse-0.023
 cost-0.022
 meant-0.022
rob-0.022
Token said
Token ID531
Feature activation+1.06e-02

Pos logit contributions
storm+0.032
ained+0.032
ock+0.029
rob+0.029
bank+0.028
Neg logit contributions
 hands-0.026
 amaz-0.025
 fingertips-0.024
 aston-0.022
 ear-0.022
Token they
Token ID484
Feature activation-1.28e-02

Pos logit contributions
 amaz+0.019
 fingertips+0.018
 hands+0.018
joy+0.017
 aston+0.016
Neg logit contributions
storm-0.017
rob-0.016
 cost-0.016
ained-0.016
 meant-0.016
Token were
Token ID547
Feature activation-1.61e-02

Pos logit contributions
storm+0.024
rob+0.023
 worse+0.022
 means+0.022
ock+0.022
Neg logit contributions
 fingertips-0.019
 amaz-0.019
 hands-0.019
 aston-0.016
 pride-0.016
Token here
Token ID994
Feature activation+9.90e-03

Pos logit contributions
 amaz+0.002
 fingertips+0.002
 aston+0.001
joy+0.001
 hands+0.001
Neg logit contributions
 worse-0.001
 meant-0.001
 means-0.001
storm-0.001
 operation-0.001
Token and
Token ID290
Feature activation-2.37e-02

Pos logit contributions
 amaz+0.021
 fingertips+0.020
 aston+0.017
joy+0.017
 hands+0.017
Neg logit contributions
 worse-0.015
 meant-0.014
storm-0.014
 means-0.014
 cost-0.014
Token rest
Token ID1334
Feature activation-1.61e-02

Pos logit contributions
 meant+0.043
storm+0.043
 means+0.043
rob+0.042
ained+0.042
Neg logit contributions
 amaz-0.038
 fingertips-0.037
 hands-0.034
 delight-0.031
 aston-0.030
Token until
Token ID1566
Feature activation-9.90e-03

Pos logit contributions
storm+0.028
 worse+0.027
 meant+0.027
 cost+0.027
 means+0.027
Neg logit contributions
 amaz-0.028
 fingertips-0.027
 hands-0.025
joy-0.024
 aston-0.023
Token your
Token ID534
Feature activation-2.80e-02

Pos logit contributions
storm+0.016
 meant+0.015
 worse+0.015
 means+0.015
 cost+0.015
Neg logit contributions
 amaz-0.018
 fingertips-0.018
 hands-0.016
joy-0.016
 aston-0.015
Token leg
Token ID1232
Feature activation-6.86e-02

Pos logit contributions
storm+0.051
 meant+0.050
 worse+0.050
 means+0.049
 cost+0.049
Neg logit contributions
 amaz-0.046
 fingertips-0.045
 hands-0.042
joy-0.039
 aston-0.038
Token feels
Token ID5300
Feature activation-3.48e-02

Pos logit contributions
storm+0.115
rob+0.113
 meant+0.108
 worse+0.107
guard+0.105
Neg logit contributions
 hands-0.109
 fingertips-0.104
 amaz-0.104
 perfectly-0.090
 toes-0.089
Token better
Token ID1365
Feature activation-1.33e-02

Pos logit contributions
rob+0.057
storm+0.055
 means+0.052
guard+0.049
 cost+0.048
Neg logit contributions
 amaz-0.065
 fingertips-0.061
 hands-0.060
 delight-0.057
 pride-0.057
Token."
Token ID526
Feature activation-1.44e-02

Pos logit contributions
storm+0.025
 cost+0.025
rob+0.024
 meant+0.024
 worse+0.024
Neg logit contributions
 amaz-0.019
 fingertips-0.019
 hands-0.018
joy-0.016
 toes-0.016
Token
Token ID220
Feature activation+2.50e-03

Pos logit contributions
 worse+0.025
 means+0.025
storm+0.024
 meant+0.024
 cost+0.024
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 aston-0.021
joy-0.021
 hands-0.020
Token
Token ID198
Feature activation+4.75e-05

Pos logit contributions
 amaz+0.005
 fingertips+0.004
 hands+0.004
joy+0.004
 aston+0.004
Neg logit contributions
storm-0.004
 worse-0.004
 meant-0.004
 means-0.004
 cost-0.004
Token but
Token ID475
Feature activation-2.42e-02

Pos logit contributions
 amaz+0.015
 fingertips+0.015
joy+0.014
 hands+0.013
 aston+0.013
Neg logit contributions
 worse-0.011
storm-0.010
 meant-0.010
 means-0.010
 cost-0.010
Token be
Token ID307
Feature activation-1.14e-02

Pos logit contributions
rob+0.046
storm+0.045
 worse+0.042
ock+0.042
ained+0.041
Neg logit contributions
 amaz-0.039
 fingertips-0.038
 hands-0.037
 toes-0.032
 delight-0.032
Token careful
Token ID8161
Feature activation-4.32e-02

Pos logit contributions
 worse+0.021
 meant+0.021
 means+0.020
ained+0.020
 cost+0.019
Neg logit contributions
 amaz-0.019
 fingertips-0.019
 aston-0.016
joy-0.016
 hands-0.016
Token.
Token ID13
Feature activation+3.39e-03

Pos logit contributions
storm+0.083
rob+0.082
 cost+0.080
 worse+0.079
 meant+0.077
Neg logit contributions
 fingertips-0.064
 hands-0.063
 amaz-0.062
 toes-0.056
 aston-0.051
Token The
Token ID383
Feature activation-2.79e-02

Pos logit contributions
 amaz+0.006
 fingertips+0.006
joy+0.005
 hands+0.005
 aston+0.005
Neg logit contributions
storm-0.006
 worse-0.006
 means-0.006
 meant-0.006
 cost-0.006
Token creek
Token ID39459
Feature activation-6.85e-02

Pos logit contributions
storm+0.041
 worse+0.039
rob+0.039
 meant+0.038
 cost+0.037
Neg logit contributions
 amaz-0.055
 hands-0.052
 fingertips-0.051
joy-0.049
 delight-0.048
Token is
Token ID318
Feature activation-4.65e-02

Pos logit contributions
storm+0.129
rob+0.124
burn+0.123
 worse+0.121
ock+0.119
Neg logit contributions
 hands-0.094
 amaz-0.092
 fingertips-0.086
 perfectly-0.080
 aston-0.076
Token deep
Token ID2769
Feature activation-3.17e-02

Pos logit contributions
rob+0.090
storm+0.083
ock+0.075
 means+0.074
guard+0.073
Neg logit contributions
 amaz-0.075
 hands-0.074
 fingertips-0.070
 toes-0.066
 perfectly-0.065
Token and
Token ID290
Feature activation-2.58e-02

Pos logit contributions
storm+0.072
 cost+0.068
rob+0.067
making+0.065
 surgery+0.064
Neg logit contributions
 hands-0.036
 amaz-0.033
joy-0.030
 toes-0.029
 perfectly-0.027
Token fast
Token ID3049
Feature activation-3.40e-02

Pos logit contributions
storm+0.039
rob+0.039
ock+0.036
 meant+0.036
 worse+0.036
Neg logit contributions
 amaz-0.050
 fingertips-0.048
 hands-0.046
 aston-0.042
 toes-0.042
Token.
Token ID13
Feature activation+6.97e-03

Pos logit contributions
storm+0.070
rob+0.066
 cost+0.064
ained+0.063
 worse+0.062
Neg logit contributions
 hands-0.048
 amaz-0.043
 fingertips-0.040
 toes-0.039
joy-0.037
TokenMaybe
Token ID13300
Feature activation+6.07e-04

Pos logit contributions
storm+0.007
rob+0.006
 worse+0.006
 meant+0.006
ained+0.006
Neg logit contributions
 amaz-0.005
 fingertips-0.005
 hands-0.005
 aston-0.004
 approval-0.004
Token later
Token ID1568
Feature activation-1.95e-02

Pos logit contributions
 amaz+0.001
 fingertips+0.001
 hands+0.001
joy+0.001
 aston+0.001
Neg logit contributions
storm-0.001
 worse-0.001
 cost-0.001
 meant-0.001
rob-0.001
Token,
Token ID11
Feature activation+9.90e-03

Pos logit contributions
storm+0.034
 worse+0.034
 meant+0.033
 cost+0.033
 means+0.033
Neg logit contributions
 amaz-0.033
 fingertips-0.031
 hands-0.029
joy-0.028
 aston-0.027
Token when
Token ID618
Feature activation-7.60e-03

Pos logit contributions
 amaz+0.019
 fingertips+0.018
 hands+0.017
joy+0.017
 aston+0.016
Neg logit contributions
storm-0.015
 worse-0.015
 meant-0.015
 means-0.015
 cost-0.015
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
storm+0.012
 worse+0.011
 meant+0.011
 cost+0.011
 means+0.011
Neg logit contributions
 amaz-0.015
 fingertips-0.014
 hands-0.013
joy-0.012
 aston-0.012
Token rain
Token ID6290
Feature activation-6.84e-02

Pos logit contributions
 worse+0.042
 means+0.042
 meant+0.041
 cost+0.041
storm+0.041
Neg logit contributions
 amaz-0.058
 fingertips-0.053
 hands-0.052
joy-0.050
 delight-0.049
Token stops
Token ID9911
Feature activation-2.09e-02

Pos logit contributions
 worse+0.122
rob+0.122
ock+0.117
storm+0.112
 surgery+0.111
Neg logit contributions
 hands-0.093
 amaz-0.093
 fingertips-0.083
oor-0.076
apped-0.074
Token,"
Token ID553
Feature activation-9.87e-03

Pos logit contributions
rob+0.035
storm+0.033
 cost+0.033
 meant+0.031
 worse+0.031
Neg logit contributions
 hands-0.035
 fingertips-0.035
 amaz-0.034
 toes-0.031
joy-0.030
Token Mom
Token ID11254
Feature activation-3.79e-02

Pos logit contributions
storm+0.021
 worse+0.020
 means+0.020
 meant+0.020
 cost+0.020
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 hands-0.012
joy-0.011
 aston-0.011
Token said
Token ID531
Feature activation-1.80e-02

Pos logit contributions
storm+0.054
 worse+0.052
ained+0.051
 ger+0.048
rob+0.048
Neg logit contributions
 amaz-0.076
 fingertips-0.072
 hands-0.070
 aston-0.066
 approval-0.065
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
 worse+0.029
 meant+0.028
 means+0.027
storm+0.027
 cost+0.027
Neg logit contributions
 amaz-0.035
 fingertips-0.033
joy-0.029
 aston-0.029
 hands-0.029
Token I
Token ID314
Feature activation+3.65e-03

Pos logit contributions
storm+0.076
 meant+0.075
ained+0.075
ock+0.073
 worse+0.072
Neg logit contributions
 fingertips-0.054
 hands-0.050
 amaz-0.049
joy-0.043
 aston-0.042
Token play
Token ID711
Feature activation+1.11e-02

Pos logit contributions
 amaz+0.008
 fingertips+0.007
joy+0.007
 aston+0.006
 hands+0.006
Neg logit contributions
 cost-0.005
 meant-0.005
storm-0.005
 worse-0.005
 means-0.005
Token with
Token ID351
Feature activation-5.26e-06

Pos logit contributions
 amaz+0.018
 fingertips+0.017
 hands+0.015
joy+0.015
 aston+0.015
Neg logit contributions
storm-0.021
 worse-0.021
 meant-0.020
 means-0.020
 cost-0.020
Token your
Token ID534
Feature activation+2.74e-02

Feature not active on prev token;
no predictions were affected.
Token ball
Token ID2613
Feature activation-1.38e-02

Pos logit contributions
 amaz+0.059
 fingertips+0.057
joy+0.053
 aston+0.052
years+0.051
Neg logit contributions
 cost-0.035
 meant-0.034
 worse-0.034
 means-0.034
storm-0.034
Token?"
Token ID1701
Feature activation-6.82e-02

Pos logit contributions
storm+0.022
 meant+0.021
 worse+0.021
 means+0.021
rob+0.021
Neg logit contributions
 amaz-0.024
 fingertips-0.023
 hands-0.023
joy-0.021
 delight-0.020
Token Spot
Token ID15899
Feature activation-3.52e-02

Pos logit contributions
storm+0.138
ained+0.137
ock+0.134
 outage+0.131
 worse+0.125
Neg logit contributions
 hands-0.102
 fingertips-0.088
 delight-0.087
 ear-0.084
 amaz-0.084
Token asked
Token ID1965
Feature activation-3.45e-02

Pos logit contributions
storm+0.053
ained+0.048
 worse+0.047
rob+0.047
ock+0.046
Neg logit contributions
 amaz-0.067
 fingertips-0.067
 hands-0.066
joy-0.060
 delight-0.060
Token.
Token ID13
Feature activation-4.88e-02

Pos logit contributions
storm+0.065
 cost+0.059
 worse+0.058
rob+0.058
 means+0.058
Neg logit contributions
 amaz-0.053
 fingertips-0.053
 hands-0.052
joy-0.048
 aston-0.046
Token Fl
Token ID1610
Feature activation-1.86e-03

Pos logit contributions
ock+0.095
 worse+0.094
storm+0.093
 outage+0.092
 meant+0.087
Neg logit contributions
 hands-0.070
 fingertips-0.066
 amaz-0.064
joy-0.062
 delight-0.060
Tokenuffy
Token ID15352
Feature activation-2.54e-02

Pos logit contributions
storm+0.002
 worse+0.002
 means+0.002
 meant+0.002
ained+0.002
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 approval-0.004
joy-0.004
 delight-0.004
Token about
Token ID546
Feature activation-1.14e-02

Pos logit contributions
storm+0.002
 cost+0.002
rob+0.002
 worse+0.002
 meant+0.002
Neg logit contributions
 amaz-0.002
 fingertips-0.002
 hands-0.002
 toes-0.002
joy-0.002
Token the
Token ID262
Feature activation-1.80e-02

Pos logit contributions
storm+0.018
 worse+0.017
 meant+0.017
 cost+0.017
 means+0.016
Neg logit contributions
 amaz-0.022
 fingertips-0.021
 hands-0.020
joy-0.019
 aston-0.019
Token flowers
Token ID12734
Feature activation-1.85e-02

Pos logit contributions
storm+0.026
 worse+0.026
 meant+0.026
 cost+0.025
 means+0.025
Neg logit contributions
 amaz-0.036
 fingertips-0.035
 hands-0.031
joy-0.031
 aston-0.031
Token,"
Token ID553
Feature activation-1.41e-02

Pos logit contributions
storm+0.040
rob+0.039
ained+0.039
 cost+0.038
 meant+0.038
Neg logit contributions
 amaz-0.020
 hands-0.020
 fingertips-0.019
 toes-0.017
joy-0.016
Token the
Token ID262
Feature activation-1.01e-02

Pos logit contributions
storm+0.030
 cost+0.028
 worse+0.028
 meant+0.028
 outage+0.027
Neg logit contributions
 amaz-0.020
 fingertips-0.019
 hands-0.018
joy-0.017
 aston-0.016
Token bee
Token ID20697
Feature activation-6.82e-02

Pos logit contributions
storm+0.013
 worse+0.013
 meant+0.013
 means+0.013
 cost+0.013
Neg logit contributions
 amaz-0.022
 fingertips-0.021
 hands-0.019
joy-0.019
 aston-0.019
Token says
Token ID1139
Feature activation+4.64e-03

Pos logit contributions
storm+0.128
ained+0.120
rob+0.116
 cost+0.114
 outage+0.114
Neg logit contributions
 amaz-0.107
 hands-0.104
 fingertips-0.093
 ear-0.091
 delight-0.091
Token.
Token ID13
Feature activation+3.95e-03

Pos logit contributions
 amaz+0.009
 fingertips+0.008
joy+0.007
 aston+0.007
 hands+0.007
Neg logit contributions
 worse-0.007
 meant-0.007
 means-0.007
storm-0.007
 cost-0.007
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.007
 worse-0.006
 meant-0.006
 cost-0.006
 means-0.006
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
storm+0.024
 worse+0.023
 meant+0.023
 cost+0.023
 means+0.023
Neg logit contributions
 amaz-0.025
 fingertips-0.024
 hands-0.022
joy-0.022
 aston-0.021
TokenThe
Token ID464
Feature activation-1.06e-02

Pos logit contributions
storm+0.016
 worse+0.015
 meant+0.015
 cost+0.015
 means+0.015
Neg logit contributions
 amaz-0.019
 fingertips-0.018
 hands-0.017
joy-0.017
 aston-0.017
Token warm
Token ID5814
Feature activation-3.48e-02

Pos logit contributions
storm+0.015
 worse+0.015
 meant+0.015
 means+0.015
 cost+0.015
Neg logit contributions
 amaz-0.017
 fingertips-0.016
 hands-0.015
joy-0.015
 aston-0.014
Token coat
Token ID13209
Feature activation-1.00e-02

Pos logit contributions
storm+0.061
 worse+0.061
 meant+0.059
 means+0.059
 cost+0.058
Neg logit contributions
 amaz-0.059
 fingertips-0.056
 hands-0.052
joy-0.050
 aston-0.049
Token to
Token ID284
Feature activation-2.13e-02

Pos logit contributions
storm+0.017
 worse+0.017
 means+0.017
 meant+0.017
 cost+0.016
Neg logit contributions
 amaz-0.017
 fingertips-0.017
 hands-0.015
joy-0.015
 aston-0.014
Token prevent
Token ID2948
Feature activation-3.30e-02

Pos logit contributions
storm+0.037
 worse+0.036
 means+0.035
 meant+0.035
 cost+0.034
Neg logit contributions
 amaz-0.037
 fingertips-0.036
 hands-0.033
 aston-0.031
 approval-0.031
Token the
Token ID262
Feature activation-3.92e-02

Pos logit contributions
storm+0.066
 outage+0.063
rob+0.062
guard+0.059
 cost+0.059
Neg logit contributions
 hands-0.051
 fingertips-0.050
 amaz-0.044
 delight-0.044
 toes-0.043
Token bitter
Token ID12922
Feature activation-6.81e-02

Pos logit contributions
rob+0.055
 means+0.051
ained+0.048
 meant+0.048
 worse+0.047
Neg logit contributions
 amaz-0.078
 fingertips-0.077
 hands-0.076
 toes-0.070
 delight-0.070
Token weather
Token ID6193
Feature activation-4.87e-02

Pos logit contributions
rob+0.120
storm+0.108
guard+0.104
igators+0.102
igator+0.098
Neg logit contributions
 hands-0.118
 toes-0.109
 fingertips-0.106
 eyes-0.105
 amaz-0.104
Token.
Token ID13
Feature activation-3.62e-02

Pos logit contributions
rob+0.110
storm+0.097
 means+0.097
 cost+0.092
 meant+0.092
Neg logit contributions
 hands-0.060
 fingertips-0.060
 amaz-0.059
 toes-0.050
 perfectly-0.047
Token It
Token ID632
Feature activation-2.16e-02

Pos logit contributions
storm+0.051
 worse+0.051
 meant+0.050
 means+0.050
 cost+0.049
Neg logit contributions
 amaz-0.074
 fingertips-0.071
 hands-0.064
joy-0.064
 aston-0.063
Token was
Token ID373
Feature activation-2.45e-02

Pos logit contributions
rob+0.042
 inconvenience+0.039
storm+0.039
ock+0.039
ained+0.039
Neg logit contributions
 amaz-0.031
 hands-0.029
 fingertips-0.029
 aston-0.025
 grinned-0.025
Token a
Token ID257
Feature activation-6.20e-03

Pos logit contributions
rob+0.048
storm+0.045
guard+0.044
 means+0.042
 outage+0.041
Neg logit contributions
 amaz-0.040
 fingertips-0.037
 hands-0.036
 delight-0.033
 perfectly-0.033
Token lady
Token ID10846
Feature activation-9.88e-03

Pos logit contributions
storm+0.050
 meant+0.049
 means+0.048
 worse+0.048
 cost+0.047
Neg logit contributions
 amaz-0.069
 fingertips-0.066
 hands-0.063
joy-0.061
 aston-0.058
Token.
Token ID13
Feature activation-4.79e-03

Pos logit contributions
storm+0.017
 worse+0.016
 meant+0.016
 means+0.016
 cost+0.016
Neg logit contributions
 amaz-0.018
 fingertips-0.017
 hands-0.015
joy-0.015
 aston-0.015
Token The
Token ID383
Feature activation-3.44e-02

Pos logit contributions
storm+0.008
 worse+0.008
 cost+0.008
 meant+0.008
 means+0.008
Neg logit contributions
 amaz-0.008
 fingertips-0.008
 hands-0.007
joy-0.007
 aston-0.007
Token lady
Token ID10846
Feature activation-2.19e-02

Pos logit contributions
storm+0.050
 worse+0.049
 means+0.048
 meant+0.048
 cost+0.047
Neg logit contributions
 amaz-0.070
 fingertips-0.066
 hands-0.063
joy-0.062
 aston-0.059
Token puts
Token ID7584
Feature activation-2.90e-02

Pos logit contributions
 means+0.032
 worse+0.031
 meant+0.031
 cost+0.030
storm+0.029
Neg logit contributions
 amaz-0.044
 fingertips-0.044
joy-0.040
 hands-0.038
 approval-0.038
Token the
Token ID262
Feature activation-6.80e-02

Pos logit contributions
storm+0.044
 worse+0.043
 meant+0.043
 means+0.042
 cost+0.042
Neg logit contributions
 amaz-0.057
 fingertips-0.054
 hands-0.050
joy-0.049
 aston-0.048
Token leash
Token ID40529
Feature activation-1.69e-02

Pos logit contributions
storm+0.140
bank+0.120
 means+0.118
 outage+0.118
 worse+0.117
Neg logit contributions
 hands-0.110
 amaz-0.096
 hand-0.096
 ear-0.091
 backpack-0.088
Token on
Token ID319
Feature activation+6.05e-04

Pos logit contributions
 worse+0.025
 meant+0.025
 means+0.024
storm+0.024
 cost+0.024
Neg logit contributions
 amaz-0.035
 fingertips-0.033
joy-0.030
 aston-0.029
 hands-0.029
Token the
Token ID262
Feature activation-4.64e-02

Pos logit contributions
 amaz+0.001
 fingertips+0.001
joy+0.001
 hands+0.001
 aston+0.001
Neg logit contributions
 worse-0.001
storm-0.001
 means-0.001
 meant-0.001
 cost-0.001
Token dog
Token ID3290
Feature activation-2.52e-02

Pos logit contributions
storm+0.074
 means+0.073
 meant+0.072
ained+0.070
 worse+0.069
Neg logit contributions
 amaz-0.085
 fingertips-0.084
 hands-0.083
 ear-0.076
joy-0.075
Token.
Token ID13
Feature activation-3.37e-03

Pos logit contributions
storm+0.041
 worse+0.041
 meant+0.040
 means+0.040
 cost+0.040
Neg logit contributions
 amaz-0.046
 fingertips-0.044
 hands-0.040
joy-0.039
 aston-0.038
Token is
Token ID318
Feature activation-4.34e-02

Pos logit contributions
storm+0.081
 worse+0.080
 cost+0.075
ock+0.074
rob+0.073
Neg logit contributions
 amaz-0.055
 hands-0.053
joy-0.050
 fingertips-0.049
 aston-0.047
Token sick
Token ID6639
Feature activation-5.17e-02

Pos logit contributions
storm+0.101
 cost+0.094
rob+0.094
 means+0.091
ock+0.089
Neg logit contributions
 hands-0.052
joy-0.050
 toes-0.045
 fingertips-0.045
 amaz-0.043
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
rob+0.125
storm+0.124
making+0.121
 cost+0.120
ained+0.115
Neg logit contributions
 hands-0.048
 fingertips-0.040
 toes-0.039
 amaz-0.037
 ear-0.035
Token He
Token ID679
Feature activation-4.82e-02

Pos logit contributions
storm+0.014
 worse+0.014
 meant+0.014
 cost+0.014
 means+0.013
Neg logit contributions
 amaz-0.022
 fingertips-0.021
 hands-0.020
joy-0.020
 aston-0.019
Token has
Token ID468
Feature activation-6.18e-02

Pos logit contributions
storm+0.092
rob+0.089
ock+0.088
 outage+0.084
ained+0.084
Neg logit contributions
 hands-0.073
 amaz-0.070
 fingertips-0.070
 ear-0.062
 aston-0.061
Token to
Token ID284
Feature activation-6.78e-02

Pos logit contributions
storm+0.136
rob+0.135
ock+0.128
 means+0.122
 outage+0.121
Neg logit contributions
 hands-0.088
 fingertips-0.076
 amaz-0.074
joy-0.071
 toes-0.068
Token stay
Token ID2652
Feature activation-3.49e-02

Pos logit contributions
storm+0.147
rob+0.145
 outage+0.138
 means+0.134
 meant+0.133
Neg logit contributions
 hands-0.093
 amaz-0.090
 fingertips-0.083
 ear-0.082
 delight-0.082
Token in
Token ID287
Feature activation-2.04e-02

Pos logit contributions
storm+0.070
rob+0.064
 means+0.063
 cost+0.063
 meant+0.061
Neg logit contributions
 hands-0.053
 amaz-0.051
 fingertips-0.049
joy-0.046
 toes-0.044
Token bed
Token ID3996
Feature activation-2.05e-02

Pos logit contributions
storm+0.039
 meant+0.036
 means+0.036
 cost+0.035
 worse+0.035
Neg logit contributions
 amaz-0.035
 hands-0.031
 fingertips-0.030
joy-0.030
 delight-0.028
Token.
Token ID13
Feature activation-3.33e-03

Pos logit contributions
storm+0.040
rob+0.040
 means+0.040
 meant+0.040
 cost+0.039
Neg logit contributions
 amaz-0.025
 fingertips-0.025
 hands-0.024
 toes-0.022
joy-0.022
Token He
Token ID679
Feature activation-4.55e-02

Pos logit contributions
 worse+0.005
storm+0.005
 meant+0.005
 means+0.005
 cost+0.004
Neg logit contributions
 amaz-0.007
 fingertips-0.007
 hands-0.006
joy-0.006
 aston-0.006
Token mom
Token ID1995
Feature activation-8.69e-03

Pos logit contributions
 amaz+0.004
 fingertips+0.004
joy+0.003
oor+0.003
 aston+0.003
Neg logit contributions
 worse-0.005
 hospital-0.005
 operation-0.004
 cost-0.004
 medicine-0.004
Tokenmy
Token ID1820
Feature activation-2.45e-03

Pos logit contributions
storm+0.012
 worse+0.012
 meant+0.012
 means+0.012
 cost+0.012
Neg logit contributions
 amaz-0.018
 fingertips-0.017
 hands-0.016
joy-0.016
 aston-0.015
Token explained
Token ID4893
Feature activation-1.83e-03

Pos logit contributions
storm+0.004
 worse+0.004
 meant+0.004
 means+0.004
 cost+0.003
Neg logit contributions
 amaz-0.005
 fingertips-0.005
 hands-0.004
joy-0.004
 aston-0.004
Token that
Token ID326
Feature activation-2.97e-02

Pos logit contributions
storm+0.003
rob+0.003
ained+0.003
 cost+0.003
 meant+0.003
Neg logit contributions
 amaz-0.003
 hands-0.003
 fingertips-0.003
joy-0.003
 delight-0.003
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
rob+0.056
storm+0.055
ock+0.049
ained+0.049
ty+0.048
Neg logit contributions
 amaz-0.047
 hands-0.045
 fingertips-0.042
 toes-0.038
joy-0.037
Token muscle
Token ID8280
Feature activation-6.74e-02

Pos logit contributions
storm+0.025
 worse+0.024
 meant+0.024
 means+0.024
rob+0.024
Neg logit contributions
 amaz-0.030
 fingertips-0.028
 hands-0.027
joy-0.026
 aston-0.025
Token is
Token ID318
Feature activation-4.29e-02

Pos logit contributions
rob+0.112
ained+0.110
storm+0.106
 worse+0.105
ock+0.105
Neg logit contributions
 hands-0.106
 amaz-0.101
 fingertips-0.100
 perfectly-0.086
 toes-0.085
Token a
Token ID257
Feature activation-2.29e-02

Pos logit contributions
rob+0.087
storm+0.081
guard+0.071
ock+0.070
ican+0.066
Neg logit contributions
 hands-0.061
 amaz-0.058
 perfectly-0.056
 fingertips-0.054
 ear-0.053
Token part
Token ID636
Feature activation-3.56e-02

Pos logit contributions
rob+0.037
ained+0.036
 means+0.036
 meant+0.036
storm+0.035
Neg logit contributions
 amaz-0.041
 fingertips-0.039
 hands-0.037
joy-0.035
 aston-0.034
Token of
Token ID286
Feature activation-2.95e-02

Pos logit contributions
rob+0.072
storm+0.067
 worse+0.065
ained+0.065
 cost+0.062
Neg logit contributions
 amaz-0.049
 hands-0.048
 fingertips-0.048
 toes-0.041
 perfectly-0.040
Token your
Token ID534
Feature activation-1.91e-02

Pos logit contributions
storm+0.051
rob+0.048
 worse+0.044
guard+0.044
ained+0.044
Neg logit contributions
 amaz-0.053
 hands-0.052
 fingertips-0.050
 pride-0.045
 delight-0.045
Token saw
Token ID2497
Feature activation-4.25e-02

Pos logit contributions
storm+0.044
 worse+0.044
 cost+0.042
 meant+0.041
 means+0.041
Neg logit contributions
 amaz-0.032
 fingertips-0.030
 hands-0.028
joy-0.027
 aston-0.026
Token a
Token ID257
Feature activation-7.44e-04

Pos logit contributions
storm+0.107
 outage+0.100
 cost+0.099
 affect+0.094
rob+0.091
Neg logit contributions
 hands-0.040
 amaz-0.037
joy-0.033
 eyes-0.031
 ear-0.031
Token foolish
Token ID19538
Feature activation-3.54e-02

Pos logit contributions
storm+0.001
 means+0.001
 worse+0.001
 meant+0.001
 cost+0.001
Neg logit contributions
 amaz-0.001
 fingertips-0.001
joy-0.001
 hands-0.001
 aston-0.001
Token cat
Token ID3797
Feature activation-2.80e-02

Pos logit contributions
ained+0.055
 worse+0.055
storm+0.054
rob+0.053
 means+0.053
Neg logit contributions
 amaz-0.063
 fingertips-0.060
joy-0.059
 hands-0.059
 ear-0.054
Token named
Token ID3706
Feature activation-8.26e-03

Pos logit contributions
storm+0.054
 cost+0.054
 meant+0.053
 means+0.053
 worse+0.053
Neg logit contributions
 amaz-0.036
 fingertips-0.035
 hands-0.034
joy-0.031
 ear-0.031
Token Mitt
Token ID16627
Feature activation-6.74e-02

Pos logit contributions
storm+0.015
 worse+0.014
rob+0.013
ained+0.013
 cost+0.013
Neg logit contributions
 amaz-0.013
 fingertips-0.013
 hands-0.013
joy-0.012
 delight-0.011
Tokenens
Token ID641
Feature activation-2.33e-02

Pos logit contributions
 outage+0.103
 cost+0.092
 worse+0.087
storm+0.086
guard+0.082
Neg logit contributions
joy-0.111
 hands-0.102
 fingertips-0.099
 goodness-0.093
 toes-0.088
Token.
Token ID13
Feature activation-3.21e-02

Pos logit contributions
storm+0.045
 worse+0.043
 cost+0.043
 meant+0.043
 means+0.043
Neg logit contributions
 amaz-0.034
 fingertips-0.033
 hands-0.031
joy-0.030
 aston-0.028
Token Mitt
Token ID16627
Feature activation-6.24e-02

Pos logit contributions
rob+0.061
guard+0.061
storm+0.060
ock+0.059
 worse+0.059
Neg logit contributions
 hands-0.048
 fingertips-0.047
 amaz-0.045
 delight-0.040
joy-0.040
Tokenens
Token ID641
Feature activation-3.41e-02

Pos logit contributions
 outage+0.093
 cost+0.085
guard+0.082
 worse+0.081
storm+0.079
Neg logit contributions
joy-0.107
 hands-0.098
 goodness-0.094
 fingertips-0.092
 amaz-0.086
Token liked
Token ID8288
Feature activation-3.90e-02

Pos logit contributions
storm+0.058
 worse+0.057
rob+0.056
ock+0.056
 outage+0.055
Neg logit contributions
 amaz-0.054
 fingertips-0.053
 hands-0.052
joy-0.047
 aston-0.046
Token They
Token ID1119
Feature activation-1.92e-02

Pos logit contributions
storm+0.002
 worse+0.002
 means+0.002
 meant+0.002
 cost+0.002
Neg logit contributions
 amaz-0.004
 fingertips-0.003
joy-0.003
 hands-0.003
 aston-0.003
Token say
Token ID910
Feature activation+3.43e-03

Pos logit contributions
 cost+0.029
 meant+0.028
 worse+0.027
 means+0.027
storm+0.026
Neg logit contributions
 fingertips-0.039
 amaz-0.038
joy-0.036
 hands-0.033
 aston-0.032
Token they
Token ID484
Feature activation-2.18e-02

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.005
 worse-0.005
 meant-0.005
 cost-0.005
 means-0.005
Token hope
Token ID2911
Feature activation-2.16e-02

Pos logit contributions
 meant+0.034
 worse+0.033
 cost+0.033
 means+0.033
storm+0.030
Neg logit contributions
 amaz-0.044
 fingertips-0.043
joy-0.039
 aston-0.037
 hands-0.037
Token her
Token ID607
Feature activation-1.41e-02

Pos logit contributions
storm+0.035
rob+0.034
 means+0.033
ained+0.033
 meant+0.032
Neg logit contributions
 amaz-0.039
 fingertips-0.039
 hands-0.037
joy-0.035
 delight-0.034
Token leg
Token ID1232
Feature activation-6.73e-02

Pos logit contributions
storm+0.022
 worse+0.022
 meant+0.021
 means+0.021
 cost+0.021
Neg logit contributions
 amaz-0.027
 fingertips-0.026
 hands-0.023
joy-0.023
 aston-0.023
Token will
Token ID481
Feature activation-2.20e-02

Pos logit contributions
storm+0.127
rob+0.122
 meant+0.113
 cost+0.110
ained+0.109
Neg logit contributions
 amaz-0.102
 hands-0.097
 fingertips-0.092
ushes-0.088
 approval-0.085
Token heal
Token ID12035
Feature activation-2.54e-02

Pos logit contributions
rob+0.039
storm+0.038
 means+0.038
 meant+0.036
 worse+0.036
Neg logit contributions
 amaz-0.036
 fingertips-0.034
 hands-0.031
 approval-0.030
 aston-0.030
Token soon
Token ID2582
Feature activation-1.92e-02

Pos logit contributions
storm+0.048
 cost+0.046
 meant+0.045
 means+0.044
ained+0.044
Neg logit contributions
 amaz-0.039
 fingertips-0.038
 hands-0.036
joy-0.033
 approval-0.033
Token.
Token ID13
Feature activation-2.61e-03

Pos logit contributions
rob+0.040
storm+0.040
ained+0.038
 worse+0.038
 cost+0.037
Neg logit contributions
 amaz-0.024
 fingertips-0.024
 hands-0.023
joy-0.020
 approval-0.019
Token
Token ID198
Feature activation-8.04e-03

Pos logit contributions
storm+0.004
 worse+0.004
 meant+0.004
 means+0.004
 cost+0.004
Neg logit contributions
 amaz-0.005
 fingertips-0.005
 hands-0.004
joy-0.004
 aston-0.004
TokenWow
Token ID22017
Feature activation+6.11e-02

Pos logit contributions
 amaz+0.066
 fingertips+0.064
 hands+0.058
joy+0.058
 aston+0.056
Neg logit contributions
storm-0.051
 worse-0.050
 meant-0.049
 means-0.049
 cost-0.049
Token!"
Token ID2474
Feature activation+1.88e-03

Pos logit contributions
 amaz+0.155
 fingertips+0.147
years+0.145
 fascination+0.137
 aston+0.136
Neg logit contributions
 means-0.064
 happen-0.060
ty-0.060
 worse-0.059
 wrong-0.056
Token Tim
Token ID5045
Feature activation-4.88e-03

Pos logit contributions
 amaz+0.004
 fingertips+0.003
joy+0.003
 hands+0.003
 aston+0.003
Neg logit contributions
 worse-0.003
storm-0.003
 meant-0.003
 means-0.003
 cost-0.003
Token said
Token ID531
Feature activation+8.66e-03

Pos logit contributions
 meant+0.006
 means+0.005
 worse+0.005
 cost+0.005
storm+0.005
Neg logit contributions
 amaz-0.011
 fingertips-0.011
joy-0.010
 aston-0.010
 hands-0.010
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
 amaz+0.017
 fingertips+0.016
joy+0.015
 aston+0.014
 hands+0.014
Neg logit contributions
 worse-0.013
 meant-0.013
 means-0.013
 cost-0.012
storm-0.012
Token "
Token ID366
Feature activation+5.38e-02

Pos logit contributions
 amaz+0.038
 fingertips+0.036
 hands+0.032
joy+0.031
 aston+0.031
Neg logit contributions
storm-0.042
 worse-0.042
 meant-0.042
 means-0.042
 cost-0.041
TokenLook
Token ID8567
Feature activation+7.13e-02

Pos logit contributions
 amaz+0.119
 fingertips+0.116
 hands+0.108
joy+0.107
 aston+0.102
Neg logit contributions
storm-0.067
 worse-0.066
 meant-0.065
 means-0.065
 cost-0.064
Token at
Token ID379
Feature activation+3.48e-02

Pos logit contributions
 amaz+0.170
 fingertips+0.164
 unison+0.159
joy+0.158
 fascination+0.157
Neg logit contributions
 worse-0.073
 means-0.072
 wrong-0.072
ty-0.071
 happen-0.066
Token this
Token ID428
Feature activation+3.03e-02

Pos logit contributions
 amaz+0.073
 fingertips+0.071
 hands+0.065
joy+0.064
 aston+0.063
Neg logit contributions
storm-0.048
 worse-0.047
 meant-0.046
 means-0.046
 cost-0.046
Token sword
Token ID8429
Feature activation+2.93e-02

Pos logit contributions
 amaz+0.053
 fingertips+0.053
joy+0.047
 hands+0.046
 aston+0.046
Neg logit contributions
 means-0.051
 meant-0.051
 worse-0.051
 cost-0.051
storm-0.050
Token!
Token ID0
Feature activation+5.29e-02

Pos logit contributions
 amaz+0.046
 fingertips+0.044
joy+0.039
 hands+0.038
 aston+0.038
Neg logit contributions
storm-0.056
 worse-0.056
 meant-0.055
 means-0.055
 cost-0.055
TokenMom
Token ID29252
Feature activation+3.10e-03

Pos logit contributions
 amaz+0.002
 fingertips+0.001
 hands+0.001
joy+0.001
 aston+0.001
Neg logit contributions
storm-0.001
 worse-0.001
 meant-0.001
 means-0.001
 cost-0.001
Tokenmy
Token ID1820
Feature activation+9.90e-03

Pos logit contributions
 amaz+0.008
 fingertips+0.008
joy+0.007
 aston+0.007
 hands+0.007
Neg logit contributions
 meant-0.003
 means-0.003
ty-0.002
ock-0.002
storm-0.002
Token said
Token ID531
Feature activation+1.36e-02

Pos logit contributions
 amaz+0.021
 fingertips+0.021
joy+0.018
ushes+0.018
 hands+0.018
Neg logit contributions
 meant-0.014
 means-0.013
storm-0.013
 worse-0.013
 cost-0.013
Token,
Token ID11
Feature activation+2.06e-02

Pos logit contributions
 amaz+0.026
 fingertips+0.025
 aston+0.021
ushes+0.021
 approval+0.021
Neg logit contributions
 worse-0.022
 means-0.022
 meant-0.021
storm-0.021
 cost-0.020
Token "
Token ID366
Feature activation+1.16e-02

Pos logit contributions
 amaz+0.038
 fingertips+0.036
 aston+0.032
 hands+0.032
ushes+0.031
Neg logit contributions
storm-0.034
 meant-0.033
 means-0.033
 worse-0.033
 cost-0.032
TokenYes
Token ID5297
Feature activation+5.45e-02

Pos logit contributions
 amaz+0.020
 fingertips+0.018
joy+0.017
 approval+0.017
 hands+0.016
Neg logit contributions
 worse-0.021
 meant-0.020
 means-0.020
storm-0.020
 cost-0.020
Token,
Token ID11
Feature activation+2.80e-02

Pos logit contributions
 amaz+0.122
 fingertips+0.111
ushes+0.106
 aston+0.103
joy+0.103
Neg logit contributions
 worse-0.075
storm-0.068
 means-0.066
ty-0.065
 meant-0.064
Token Lily
Token ID20037
Feature activation+1.46e-02

Pos logit contributions
 amaz+0.056
 fingertips+0.054
 hands+0.050
joy+0.049
 aston+0.048
Neg logit contributions
storm-0.041
 worse-0.040
 meant-0.039
 means-0.039
 cost-0.039
Token.
Token ID13
Feature activation+2.30e-02

Pos logit contributions
 amaz+0.026
 fingertips+0.025
 aston+0.022
joy+0.021
 approval+0.021
Neg logit contributions
 meant-0.026
 means-0.026
storm-0.025
 worse-0.025
 cost-0.025
Token Sw
Token ID2451
Feature activation+7.97e-03

Pos logit contributions
 amaz+0.045
 fingertips+0.044
joy+0.039
 hands+0.038
 aston+0.038
Neg logit contributions
 means-0.036
storm-0.034
 meant-0.033
 cost-0.032
 worse-0.031
Tokenans
Token ID504
Feature activation+4.50e-04

Pos logit contributions
 amaz+0.017
 fingertips+0.016
 hands+0.015
joy+0.015
 aston+0.014
Neg logit contributions
storm-0.011
 worse-0.011
 meant-0.011
 means-0.010
 cost-0.010
Token and
Token ID290
Feature activation-6.44e-03

Pos logit contributions
 fingertips+0.049
 amaz+0.048
 hands+0.043
 aston+0.042
 approval+0.042
Neg logit contributions
 meant-0.033
 worse-0.033
 means-0.031
storm-0.029
 cost-0.028
Token said
Token ID531
Feature activation+1.85e-02

Pos logit contributions
storm+0.008
 worse+0.008
 meant+0.008
 means+0.008
 cost+0.008
Neg logit contributions
 amaz-0.014
 fingertips-0.013
 hands-0.012
joy-0.012
 aston-0.012
Token,
Token ID11
Feature activation+1.77e-02

Pos logit contributions
 amaz+0.030
 fingertips+0.029
 hands+0.025
joy+0.024
 aston+0.024
Neg logit contributions
 worse-0.035
 means-0.034
 meant-0.034
storm-0.034
 cost-0.033
Token "
Token ID366
Feature activation+1.76e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.026
 hands+0.023
joy+0.023
 aston+0.022
Neg logit contributions
storm-0.034
 worse-0.033
 meant-0.033
 means-0.033
 cost-0.032
TokenThank
Token ID10449
Feature activation+2.03e-02

Pos logit contributions
 amaz+0.035
 fingertips+0.033
joy+0.031
 hands+0.031
 approval+0.030
Neg logit contributions
storm-0.026
 worse-0.026
 cost-0.024
 meant-0.024
 means-0.023
Token you
Token ID345
Feature activation+5.03e-02

Pos logit contributions
 amaz+0.053
 fingertips+0.051
joy+0.047
 aston+0.046
 hands+0.046
Neg logit contributions
 means-0.019
 worse-0.018
 meant-0.018
ock-0.017
guard-0.017
Token for
Token ID329
Feature activation+1.32e-02

Pos logit contributions
 amaz+0.111
 fingertips+0.109
 aston+0.096
ushes+0.092
joy+0.091
Neg logit contributions
 worse-0.067
 meant-0.063
ty-0.063
 means-0.063
ock-0.060
Token listening
Token ID8680
Feature activation+9.95e-04

Pos logit contributions
 amaz+0.028
 fingertips+0.027
 hands+0.025
joy+0.025
 aston+0.024
Neg logit contributions
 worse-0.018
storm-0.018
 meant-0.018
 means-0.017
 cost-0.017
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
 amaz+0.002
 fingertips+0.002
joy+0.002
 hands+0.002
 aston+0.002
Neg logit contributions
 worse-0.002
storm-0.002
 meant-0.002
 means-0.002
 cost-0.001
Token Do
Token ID2141
Feature activation-5.10e-04

Pos logit contributions
 amaz+0.053
 fingertips+0.052
joy+0.047
 hands+0.046
 aston+0.045
Neg logit contributions
 means-0.038
 meant-0.037
storm-0.036
 cost-0.035
 worse-0.034
Token you
Token ID345
Feature activation+1.44e-02

Pos logit contributions
storm+0.001
 worse+0.001
 meant+0.001
 cost+0.001
 means+0.001
Neg logit contributions
 amaz-0.001
 fingertips-0.001
 hands-0.001
joy-0.001
 aston-0.001
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
 worse+0.014
 meant+0.014
 means+0.014
storm+0.014
 cost+0.014
Neg logit contributions
 amaz-0.023
 fingertips-0.022
joy-0.020
 hands-0.020
 aston-0.020
Token she
Token ID673
Feature activation+1.05e-02

Pos logit contributions
storm+0.013
 worse+0.013
 meant+0.013
 means+0.012
 cost+0.012
Neg logit contributions
 amaz-0.022
 fingertips-0.021
 hands-0.019
joy-0.019
 aston-0.019
Token slowly
Token ID6364
Feature activation+3.48e-03

Pos logit contributions
 amaz+0.020
 fingertips+0.020
joy+0.019
 approval+0.017
 hands+0.017
Neg logit contributions
 meant-0.016
 means-0.016
 worse-0.016
 cost-0.015
storm-0.015
Token peek
Token ID27185
Feature activation+5.22e-02

Pos logit contributions
 amaz+0.006
 fingertips+0.006
 hands+0.005
joy+0.005
 aston+0.005
Neg logit contributions
storm-0.006
 worse-0.006
 meant-0.006
 means-0.006
 cost-0.006
Tokened
Token ID276
Feature activation+3.11e-02

Pos logit contributions
 fingertips+0.098
 amaz+0.098
 majesty+0.083
joy+0.082
 aston+0.082
Neg logit contributions
 worse-0.083
 meant-0.078
 means-0.077
ty-0.074
ock-0.070
Token out
Token ID503
Feature activation+4.91e-02

Pos logit contributions
 fingertips+0.064
 amaz+0.061
 majesty+0.059
oor+0.057
years+0.057
Neg logit contributions
 worse-0.042
 meant-0.039
 means-0.038
ty-0.034
storm-0.033
Token from
Token ID422
Feature activation+2.49e-02

Pos logit contributions
 fingertips+0.111
 amaz+0.109
 Majesty+0.104
 majesty+0.103
joy+0.101
Neg logit contributions
 worse-0.056
 meant-0.053
 means-0.050
 cost-0.044
ock-0.043
Token under
Token ID739
Feature activation+3.99e-02

Pos logit contributions
 amaz+0.045
 fingertips+0.044
 hands+0.039
joy+0.039
 aston+0.038
Neg logit contributions
 worse-0.040
 meant-0.039
storm-0.039
 means-0.039
 cost-0.038
Token the
Token ID262
Feature activation+5.51e-03

Pos logit contributions
 amaz+0.090
 fingertips+0.088
ushes+0.082
maid+0.082
oor+0.081
Neg logit contributions
 worse-0.041
 meant-0.039
storm-0.039
 means-0.038
 surgery-0.037
Token sheet
Token ID9629
Feature activation+7.00e-03

Pos logit contributions
 amaz+0.009
 fingertips+0.009
 hands+0.008
joy+0.008
 aston+0.008
Neg logit contributions
 meant-0.010
storm-0.010
 means-0.010
 worse-0.010
 cost-0.009
Token and
Token ID290
Feature activation+8.89e-03

Pos logit contributions
 amaz+0.011
 fingertips+0.010
 hands+0.009
joy+0.009
 delight+0.008
Neg logit contributions
 worse-0.013
 means-0.013
storm-0.013
 meant-0.013
 cost-0.013
Token
Token ID6184
Feature activation+2.26e-02

Pos logit contributions
 fingertips+0.016
 amaz+0.016
 hands+0.014
 aston+0.014
 approval+0.014
Neg logit contributions
 means-0.009
 meant-0.009
storm-0.008
 worse-0.008
 cost-0.008
Token
Token ID95
Feature activation+4.14e-03

Pos logit contributions
 amaz+0.042
 fingertips+0.040
 hands+0.037
joy+0.035
 aston+0.034
Neg logit contributions
storm-0.038
 means-0.036
 cost-0.036
 worse-0.035
 meant-0.035
Token
Token ID26391
Feature activation+8.75e-03

Pos logit contributions
 amaz+0.009
 fingertips+0.008
 hands+0.008
joy+0.007
 aston+0.007
Neg logit contributions
storm-0.006
 worse-0.006
 meant-0.006
 means-0.006
 cost-0.006
Token
Token ID129
Feature activation-7.87e-03

Pos logit contributions
 amaz+0.017
 approval+0.017
 fingertips+0.016
years+0.015
 aston+0.015
Neg logit contributions
storm-0.013
 means-0.012
 meant-0.012
ained-0.011
rob-0.011
Token
Token ID241
Feature activation-5.77e-03

Pos logit contributions
 worse+0.014
storm+0.014
 meant+0.014
 means+0.013
 cost+0.013
Neg logit contributions
 amaz-0.013
 fingertips-0.013
 hands-0.011
joy-0.011
 aston-0.011
TokenYes
Token ID5297
Feature activation+6.15e-02

Pos logit contributions
 worse+0.009
storm+0.009
 cost+0.009
 means+0.009
 meant+0.009
Neg logit contributions
 amaz-0.011
 fingertips-0.010
joy-0.010
 hands-0.009
 aston-0.009
Token!
Token ID0
Feature activation+2.24e-02

Pos logit contributions
 amaz+0.110
 fingertips+0.101
 aston+0.092
joy+0.091
years+0.088
Neg logit contributions
 worse-0.107
 meant-0.106
 means-0.105
storm-0.103
 cost-0.102
Token That
Token ID1320
Feature activation+1.46e-02

Pos logit contributions
 amaz+0.041
 fingertips+0.040
 hands+0.035
joy+0.035
 aston+0.035
Neg logit contributions
 means-0.036
 meant-0.035
storm-0.034
 worse-0.034
 cost-0.034
Token is
Token ID318
Feature activation+7.47e-03

Pos logit contributions
 amaz+0.027
 fingertips+0.024
 hands+0.023
 delight+0.021
 aston+0.021
Neg logit contributions
storm-0.025
 worse-0.024
ained-0.023
rob-0.023
guard-0.022
Token a
Token ID257
Feature activation+2.04e-02

Pos logit contributions
 amaz+0.015
 fingertips+0.014
 hands+0.013
joy+0.013
 aston+0.012
Neg logit contributions
 worse-0.011
 meant-0.011
storm-0.011
 means-0.011
 cost-0.011
Token very
Token ID845
Feature activation-4.88e-04

Pos logit contributions
 fingertips+0.040
 amaz+0.040
 hands+0.036
ushes+0.036
joy+0.035
Neg logit contributions
 cost-0.029
storm-0.029
 worse-0.029
 meant-0.027
 operation-0.026
Token,
Token ID11
Feature activation+3.38e-02

Pos logit contributions
 fingertips+0.075
 amaz+0.074
years+0.066
 approval+0.066
 pride+0.065
Neg logit contributions
 means-0.058
 meant-0.056
 worse-0.053
storm-0.051
ty-0.049
Token fairy
Token ID25607
Feature activation+7.98e-03

Pos logit contributions
 amaz+0.067
 fingertips+0.066
joy+0.061
 hands+0.059
 aston+0.058
Neg logit contributions
 meant-0.050
 worse-0.049
storm-0.049
 means-0.049
 cost-0.048
Token,"
Token ID553
Feature activation-1.48e-02

Pos logit contributions
 amaz+0.014
 fingertips+0.013
 hands+0.012
joy+0.012
 aston+0.012
Neg logit contributions
storm-0.014
 worse-0.014
 meant-0.013
 means-0.013
 cost-0.013
Token Anna
Token ID11735
Feature activation-1.11e-02

Pos logit contributions
storm+0.024
 worse+0.022
ained+0.022
 cost+0.021
 meant+0.021
Neg logit contributions
 amaz-0.029
 fingertips-0.027
 hands-0.026
 aston-0.024
 delight-0.024
Token said
Token ID531
Feature activation+3.37e-03

Pos logit contributions
 meant+0.012
 means+0.012
 worse+0.011
storm+0.011
 cost+0.011
Neg logit contributions
 amaz-0.027
 fingertips-0.026
joy-0.024
 hands-0.024
 aston-0.023
Token.
Token ID13
Feature activation+2.79e-03

Pos logit contributions
 amaz+0.007
 fingertips+0.007
joy+0.006
 aston+0.006
ushes+0.006
Neg logit contributions
 worse-0.005
 meant-0.005
 means-0.004
 cost-0.004
storm-0.004
Token "
Token ID366
Feature activation+2.92e-02

Pos logit contributions
 amaz+0.005
 fingertips+0.005
joy+0.004
 hands+0.004
 aston+0.004
Neg logit contributions
 meant-0.005
 means-0.005
 worse-0.005
storm-0.005
 cost-0.005
TokenDo
Token ID5211
Feature activation-6.69e-03

Pos logit contributions
 amaz+0.066
 fingertips+0.064
 hands+0.060
joy+0.058
 aston+0.057
Neg logit contributions
storm-0.035
 worse-0.034
 meant-0.033
 means-0.033
 cost-0.033
Token you
Token ID345
Feature activation+2.67e-02

Pos logit contributions
storm+0.009
 worse+0.009
 meant+0.009
 means+0.008
 cost+0.008
Neg logit contributions
 amaz-0.014
 fingertips-0.014
 hands-0.013
joy-0.013
 aston-0.012
Token like
Token ID588
Feature activation-7.43e-03

Pos logit contributions
 amaz+0.061
 fingertips+0.058
joy+0.057
 hands+0.052
ushes+0.052
Neg logit contributions
 meant-0.034
 cost-0.033
ock-0.031
 worse-0.031
 means-0.030
Token your
Token ID534
Feature activation+9.33e-03

Pos logit contributions
storm+0.011
 worse+0.011
 meant+0.011
 cost+0.011
 means+0.011
Neg logit contributions
 amaz-0.015
 fingertips-0.014
 hands-0.013
joy-0.013
 aston-0.012
Token you
Token ID345
Feature activation+5.35e-02

Pos logit contributions
 amaz+0.062
 fingertips+0.059
joy+0.055
 aston+0.054
 hands+0.053
Neg logit contributions
 means-0.020
 worse-0.020
storm-0.019
 meant-0.018
guard-0.018
Token,
Token ID11
Feature activation+3.36e-02

Pos logit contributions
 amaz+0.126
 fingertips+0.117
 aston+0.110
joy+0.105
 fascination+0.103
Neg logit contributions
 worse-0.065
ty-0.064
 meant-0.060
 means-0.058
 chip-0.058
Token Mia
Token ID32189
Feature activation+2.38e-02

Pos logit contributions
 fingertips+0.060
 amaz+0.060
joy+0.054
 aston+0.052
 hands+0.050
Neg logit contributions
 meant-0.054
 means-0.054
 worse-0.054
 cost-0.054
storm-0.052
Token,
Token ID11
Feature activation+2.20e-02

Pos logit contributions
 amaz+0.042
 fingertips+0.040
 aston+0.034
joy+0.034
 hands+0.033
Neg logit contributions
 meant-0.043
 means-0.042
storm-0.042
 worse-0.042
ty-0.041
Token for
Token ID329
Feature activation+1.76e-02

Pos logit contributions
 fingertips+0.047
 amaz+0.047
joy+0.043
 aston+0.042
ushes+0.041
Neg logit contributions
 means-0.027
 cost-0.027
storm-0.027
 meant-0.027
 worse-0.026
Token helping
Token ID5742
Feature activation+1.23e-02

Pos logit contributions
 fingertips+0.038
 amaz+0.037
joy+0.033
 hands+0.033
 aston+0.033
Neg logit contributions
 worse-0.024
 meant-0.023
 cost-0.023
storm-0.022
 means-0.022
Token me
Token ID502
Feature activation+2.43e-02

Pos logit contributions
 amaz+0.025
 fingertips+0.024
joy+0.022
 hands+0.022
 aston+0.021
Neg logit contributions
 worse-0.018
storm-0.018
 meant-0.018
 means-0.017
 cost-0.017
Token find
Token ID1064
Feature activation+2.86e-02

Pos logit contributions
 amaz+0.045
 fingertips+0.042
joy+0.040
 aston+0.038
 hands+0.037
Neg logit contributions
 worse-0.041
 cost-0.039
storm-0.039
 meant-0.039
 means-0.038
Token my
Token ID616
Feature activation+2.01e-02

Pos logit contributions
 fingertips+0.072
 amaz+0.072
joy+0.066
 aston+0.064
 unison+0.062
Neg logit contributions
 worse-0.029
 means-0.028
ty-0.026
 medicine-0.025
ock-0.024
Token own
Token ID898
Feature activation+2.09e-02

Pos logit contributions
 amaz+0.044
 fingertips+0.043
years+0.042
ushes+0.041
 unison+0.040
Neg logit contributions
 cost-0.023
 worse-0.022
 meant-0.022
 means-0.020
 hurt-0.020
Token special
Token ID2041
Feature activation+8.04e-03

Pos logit contributions
 amaz+0.047
 fingertips+0.046
years+0.044
ushes+0.043
 aston+0.042
Neg logit contributions
 cost-0.024
 meant-0.023
 worse-0.022
 means-0.022
 happen-0.020
Token digs
Token ID46561
Feature activation+2.06e-02

Pos logit contributions
 worse+0.014
 means+0.014
 meant+0.014
storm+0.014
 cost+0.014
Neg logit contributions
 amaz-0.017
 fingertips-0.016
joy-0.015
 hands-0.014
 aston-0.014
Token it
Token ID340
Feature activation+1.03e-02

Pos logit contributions
 fingertips+0.050
 amaz+0.048
 majesty+0.045
 Majesty+0.045
years+0.044
Neg logit contributions
 worse-0.024
 means-0.023
 meant-0.022
 cost-0.019
ty-0.019
Token out
Token ID503
Feature activation+3.61e-02

Pos logit contributions
 amaz+0.022
 fingertips+0.022
joy+0.020
years+0.019
 aston+0.019
Neg logit contributions
 worse-0.014
 cost-0.013
 means-0.013
 meant-0.013
storm-0.012
Token.
Token ID13
Feature activation-4.28e-03

Pos logit contributions
 amaz+0.079
 fingertips+0.079
joy+0.072
 majesty+0.070
 Majesty+0.070
Neg logit contributions
 worse-0.048
 meant-0.043
 means-0.042
ock-0.037
 cost-0.037
Token He
Token ID679
Feature activation-1.04e-03

Pos logit contributions
 means+0.005
 worse+0.005
 meant+0.005
storm+0.005
 cost+0.005
Neg logit contributions
 amaz-0.010
 fingertips-0.009
 hands-0.009
joy-0.009
 aston-0.009
Token wants
Token ID3382
Feature activation-8.93e-04

Pos logit contributions
 means+0.002
 cost+0.002
 worse+0.002
 meant+0.001
storm+0.001
Neg logit contributions
 fingertips-0.002
 amaz-0.002
joy-0.002
 approval-0.002
 hands-0.002
Token to
Token ID284
Feature activation-3.05e-03

Pos logit contributions
 worse+0.001
 meant+0.001
 means+0.001
 cost+0.001
storm+0.001
Neg logit contributions
 amaz-0.002
 fingertips-0.002
joy-0.002
 aston-0.002
 hands-0.002
Token play
Token ID711
Feature activation+3.52e-02

Pos logit contributions
 cost+0.005
 worse+0.005
storm+0.005
 meant+0.005
 means+0.005
Neg logit contributions
 amaz-0.006
 fingertips-0.005
joy-0.005
 hands-0.005
 aston-0.005
Token with
Token ID351
Feature activation+2.02e-02

Pos logit contributions
 amaz+0.067
 fingertips+0.067
 hands+0.059
joy+0.058
 aston+0.057
Neg logit contributions
 worse-0.058
 meant-0.053
 means-0.053
 cost-0.053
storm-0.053
Token it
Token ID340
Feature activation+1.92e-02

Pos logit contributions
 amaz+0.049
 fingertips+0.046
joy+0.044
 aston+0.043
oor+0.042
Neg logit contributions
 worse-0.023
 means-0.021
 meant-0.021
 cost-0.020
ty-0.019
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
 amaz+0.037
 fingertips+0.036
joy+0.032
 hands+0.032
 aston+0.031
Neg logit contributions
 worse-0.031
 meant-0.030
 means-0.029
storm-0.029
 cost-0.029
Token Teddy
Token ID29345
Feature activation+2.79e-03

Pos logit contributions
 amaz+0.030
 fingertips+0.029
 aston+0.027
joy+0.026
 hands+0.026
Neg logit contributions
 worse-0.015
 meant-0.014
 means-0.014
 cost-0.013
storm-0.013
Token's
Token ID338
Feature activation+2.94e-03

Pos logit contributions
 amaz+0.006
 fingertips+0.005
joy+0.005
 aston+0.005
 hands+0.005
Neg logit contributions
 worse-0.004
storm-0.004
 meant-0.004
 means-0.004
 cost-0.004
Token neck
Token ID7393
Feature activation+7.63e-03

Pos logit contributions
 amaz+0.005
 fingertips+0.004
years+0.004
 aston+0.004
joy+0.004
Neg logit contributions
 medicine-0.005
 worse-0.005
 cost-0.005
 ger-0.005
 oil-0.005
Token and
Token ID290
Feature activation+2.21e-03

Pos logit contributions
 amaz+0.013
 fingertips+0.013
 hands+0.011
joy+0.011
 aston+0.011
Neg logit contributions
storm-0.013
 worse-0.013
 meant-0.013
 means-0.013
 cost-0.013
Token hugged
Token ID41130
Feature activation+9.85e-03

Pos logit contributions
 amaz+0.004
 fingertips+0.004
joy+0.004
 aston+0.004
 approval+0.003
Neg logit contributions
 meant-0.003
 worse-0.003
 means-0.003
 cost-0.003
storm-0.003
Token him
Token ID683
Feature activation+2.99e-02

Pos logit contributions
 amaz+0.022
 fingertips+0.021
 aston+0.020
joy+0.019
 hands+0.019
Neg logit contributions
 worse-0.013
 meant-0.012
 means-0.012
 cost-0.011
storm-0.011
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 amaz+0.080
 fingertips+0.073
joy+0.068
 aston+0.068
 majesty+0.065
Neg logit contributions
 worse-0.032
ty-0.026
 cost-0.025
 meant-0.025
 medicine-0.025
Token She
Token ID1375
Feature activation+2.27e-03

Pos logit contributions
 amaz+0.028
 fingertips+0.027
 hands+0.025
joy+0.024
 aston+0.024
Neg logit contributions
 worse-0.015
 meant-0.015
 means-0.015
storm-0.015
 cost-0.014
Token said
Token ID531
Feature activation+1.28e-02

Pos logit contributions
 amaz+0.005
 fingertips+0.004
joy+0.004
 hands+0.004
 aston+0.004
Neg logit contributions
 meant-0.003
 worse-0.003
 cost-0.003
storm-0.003
 means-0.003
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
 amaz+0.023
 fingertips+0.022
 hands+0.019
joy+0.019
 aston+0.019
Neg logit contributions
 worse-0.022
 means-0.022
 meant-0.022
storm-0.021
 cost-0.021
Token "
Token ID366
Feature activation+1.12e-02

Pos logit contributions
 amaz+0.036
 fingertips+0.035
 hands+0.031
 aston+0.031
joy+0.030
Neg logit contributions
storm-0.034
 meant-0.034
 means-0.034
 worse-0.033
 cost-0.032
Token."
Token ID526
Feature activation-4.47e-03

Pos logit contributions
 amaz+0.014
 fingertips+0.013
joy+0.013
 aston+0.011
 hands+0.011
Neg logit contributions
 cost-0.010
 means-0.009
 meant-0.009
 worse-0.009
storm-0.009
Token She
Token ID1375
Feature activation+1.34e-02

Pos logit contributions
 means+0.006
 meant+0.005
storm+0.005
bank+0.005
 cost+0.005
Neg logit contributions
 amaz-0.010
 fingertips-0.009
 aston-0.008
ushes-0.008
 approval-0.008
Token grabbed
Token ID13176
Feature activation+1.44e-02

Pos logit contributions
 amaz+0.029
 fingertips+0.028
joy+0.026
 approval+0.025
 aston+0.024
Neg logit contributions
 cost-0.019
 meant-0.019
 means-0.018
 worse-0.017
storm-0.016
Token the
Token ID262
Feature activation+2.02e-02

Pos logit contributions
 amaz+0.032
 fingertips+0.031
joy+0.028
 aston+0.028
 majesty+0.027
Neg logit contributions
 worse-0.017
 means-0.017
 meant-0.017
 cost-0.016
storm-0.015
Token last
Token ID938
Feature activation+1.36e-02

Pos logit contributions
 fingertips+0.040
 amaz+0.040
joy+0.036
years+0.036
 aston+0.035
Neg logit contributions
 cost-0.030
 worse-0.029
 oil-0.029
 meant-0.028
 means-0.027
Token box
Token ID3091
Feature activation+2.25e-02

Pos logit contributions
 amaz+0.025
 fingertips+0.024
joy+0.022
 aston+0.021
 hands+0.021
Neg logit contributions
 worse-0.022
 cost-0.021
storm-0.021
 meant-0.021
 means-0.021
Token with
Token ID351
Feature activation+2.90e-02

Pos logit contributions
 amaz+0.044
 fingertips+0.043
joy+0.039
 hands+0.038
 aston+0.038
Neg logit contributions
 worse-0.033
 meant-0.032
storm-0.032
 cost-0.032
 means-0.031
Token both
Token ID1111
Feature activation+3.23e-02

Pos logit contributions
 amaz+0.064
 majesty+0.063
 fingertips+0.063
 Majesty+0.062
joy+0.061
Neg logit contributions
 meant-0.030
 worse-0.029
 cost-0.026
 oil-0.025
 means-0.025
Token of
Token ID286
Feature activation+1.71e-02

Pos logit contributions
 fingertips+0.033
 amaz+0.032
joy+0.031
maid+0.029
years+0.028
Neg logit contributions
 meant-0.073
 means-0.073
 worse-0.070
 oil-0.069
 cost-0.068
Token her
Token ID607
Feature activation+3.50e-02

Pos logit contributions
 amaz+0.039
 fingertips+0.038
joy+0.035
 aston+0.035
maid+0.034
Neg logit contributions
 meant-0.018
 worse-0.018
 means-0.017
 oil-0.017
 cost-0.017
Token hands
Token ID2832
Feature activation+2.53e-02

Pos logit contributions
 amaz+0.059
 fingertips+0.053
oor+0.050
 aston+0.049
joy+0.049
Neg logit contributions
 meant-0.058
 cost-0.058
 worse-0.057
 oil-0.056
 means-0.055
Token The
Token ID383
Feature activation-6.60e-03

Pos logit contributions
 worse+0.016
storm+0.016
 meant+0.015
 cost+0.015
 means+0.015
Neg logit contributions
 amaz-0.015
 fingertips-0.014
 hands-0.013
joy-0.013
 aston-0.012
Token end
Token ID886
Feature activation-1.24e-02

Pos logit contributions
storm+0.010
 worse+0.010
 meant+0.010
 means+0.010
 cost+0.009
Neg logit contributions
 amaz-0.013
 fingertips-0.012
 hands-0.012
joy-0.011
 aston-0.011
Token.
Token ID13
Feature activation-7.07e-03

Pos logit contributions
storm+0.023
 worse+0.022
 cost+0.022
 meant+0.022
 means+0.022
Neg logit contributions
 amaz-0.020
 fingertips-0.019
 hands-0.017
joy-0.017
 aston-0.016
Token<|endoftext|>
Token ID50256
Feature activation-2.09e-02

Pos logit contributions
storm+0.012
 worse+0.012
 meant+0.011
 means+0.011
 cost+0.011
Neg logit contributions
 amaz-0.013
 fingertips-0.012
 hands-0.011
joy-0.011
 aston-0.011
TokenOnce
Token ID7454
Feature activation+2.74e-02

Pos logit contributions
storm+0.033
 worse+0.033
 meant+0.032
 means+0.032
 cost+0.032
Neg logit contributions
 amaz-0.039
 fingertips-0.037
 hands-0.034
joy-0.033
 aston-0.033
Token upon
Token ID2402
Feature activation-2.32e-02

Pos logit contributions
 amaz+0.037
 fingertips+0.034
 hands+0.030
joy+0.029
 aston+0.028
Neg logit contributions
 worse-0.058
storm-0.058
 meant-0.057
 means-0.057
 cost-0.056
Token a
Token ID257
Feature activation-9.07e-03

Pos logit contributions
storm+0.045
 means+0.043
 cost+0.043
 worse+0.042
 meant+0.042
Neg logit contributions
 amaz-0.035
 fingertips-0.032
 hands-0.031
joy-0.029
 aston-0.029
Token time
Token ID640
Feature activation+1.11e-02

Pos logit contributions
 worse+0.010
storm+0.010
 meant+0.010
 means+0.010
 cost+0.009
Neg logit contributions
 amaz-0.021
 fingertips-0.021
 hands-0.019
joy-0.019
 aston-0.019
Token,
Token ID11
Feature activation+6.26e-03

Pos logit contributions
 amaz+0.021
 fingertips+0.021
 approval+0.018
 delight+0.017
 pride+0.017
Neg logit contributions
 means-0.017
 meant-0.017
storm-0.017
 worse-0.017
ained-0.016
Token there
Token ID612
Feature activation+1.58e-02

Pos logit contributions
 amaz+0.010
 fingertips+0.009
 hands+0.009
 aston+0.008
joy+0.008
Neg logit contributions
storm-0.012
 worse-0.011
 cost-0.011
rob-0.011
 meant-0.011
Token was
Token ID373
Feature activation-1.28e-02

Pos logit contributions
 amaz+0.032
 fingertips+0.031
 hands+0.028
joy+0.028
 aston+0.028
Neg logit contributions
storm-0.023
 worse-0.023
 cost-0.022
 meant-0.021
 means-0.021
Token what
Token ID644
Feature activation+3.39e-03

Pos logit contributions
 amaz+0.036
 fingertips+0.036
joy+0.033
 hands+0.032
 aston+0.032
Neg logit contributions
 worse-0.032
 meant-0.031
 means-0.030
storm-0.029
 cost-0.029
Token she
Token ID673
Feature activation+1.62e-02

Pos logit contributions
 amaz+0.007
 fingertips+0.006
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.005
 worse-0.005
rob-0.005
 cost-0.005
ained-0.005
Token would
Token ID561
Feature activation+6.15e-03

Pos logit contributions
 amaz+0.032
 fingertips+0.031
 hands+0.028
joy+0.028
 aston+0.027
Neg logit contributions
 meant-0.025
storm-0.024
 cost-0.024
 means-0.024
 worse-0.024
Token find
Token ID1064
Feature activation+1.39e-02

Pos logit contributions
 amaz+0.010
 fingertips+0.010
 hands+0.009
joy+0.009
 aston+0.008
Neg logit contributions
storm-0.011
 worse-0.011
 meant-0.011
 cost-0.011
 means-0.011
Token at
Token ID379
Feature activation+2.74e-03

Pos logit contributions
 amaz+0.025
 fingertips+0.024
 hands+0.021
joy+0.021
 aston+0.021
Neg logit contributions
 worse-0.023
 meant-0.023
 means-0.023
storm-0.023
 cost-0.022
Token the
Token ID262
Feature activation-1.49e-03

Pos logit contributions
 amaz+0.005
 fingertips+0.005
 hands+0.004
joy+0.004
 aston+0.004
Neg logit contributions
storm-0.005
 worse-0.005
 cost-0.004
 meant-0.004
 means-0.004
Token end
Token ID886
Feature activation+1.23e-02

Pos logit contributions
 means+0.002
 meant+0.002
storm+0.002
 worse+0.002
rob+0.002
Neg logit contributions
 amaz-0.003
 fingertips-0.003
 delight-0.003
joy-0.003
 aston-0.003
Token of
Token ID286
Feature activation-7.38e-05

Pos logit contributions
 amaz+0.024
 fingertips+0.023
 hands+0.021
joy+0.020
 aston+0.020
Neg logit contributions
storm-0.019
 worse-0.019
 meant-0.018
 means-0.018
 cost-0.018
Token the
Token ID262
Feature activation-7.24e-03

Pos logit contributions
storm+0.000
 worse+0.000
 cost+0.000
 meant+0.000
 means+0.000
Neg logit contributions
 amaz-0.000
 fingertips-0.000
 hands-0.000
joy-0.000
 aston-0.000
Token fog
Token ID19558
Feature activation-1.84e-02

Pos logit contributions
 means+0.008
 meant+0.008
 worse+0.008
 cost+0.008
ained+0.008
Neg logit contributions
 amaz-0.017
 fingertips-0.015
 hands-0.014
 delight-0.014
joy-0.014
Token.
Token ID13
Feature activation+7.70e-04

Pos logit contributions
 worse+0.034
storm+0.034
 meant+0.034
 means+0.033
 cost+0.033
Neg logit contributions
 amaz-0.029
 fingertips-0.027
 hands-0.024
joy-0.024
 aston-0.023
Token the
Token ID262
Feature activation+1.79e-02

Pos logit contributions
storm+0.034
 cost+0.033
 meant+0.032
 worse+0.032
 outage+0.030
Neg logit contributions
 amaz-0.027
 hands-0.025
 fingertips-0.025
joy-0.023
 aston-0.022
Token tree
Token ID5509
Feature activation-1.12e-02

Pos logit contributions
 amaz+0.040
 fingertips+0.038
joy+0.036
 aston+0.035
 hands+0.035
Neg logit contributions
 worse-0.022
storm-0.022
 cost-0.022
 meant-0.021
 means-0.021
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
rob+0.020
storm+0.020
 ger+0.020
 worse+0.020
 meant+0.019
Neg logit contributions
 amaz-0.017
 hands-0.015
 fingertips-0.014
aps-0.013
ushes-0.013
Token mom
Token ID1995
Feature activation-1.00e-02

Pos logit contributions
 amaz+0.019
 fingertips+0.018
joy+0.017
 aston+0.015
 hands+0.015
Neg logit contributions
 worse-0.032
 meant-0.032
 means-0.031
 cost-0.031
storm-0.031
Token?"
Token ID1701
Feature activation-3.50e-02

Pos logit contributions
storm+0.017
rob+0.016
 worse+0.016
 surgery+0.015
 ger+0.015
Neg logit contributions
 amaz-0.016
 hands-0.016
 fingertips-0.014
maid-0.014
ushes-0.014
Token Sara
Token ID24799
Feature activation-2.89e-02

Pos logit contributions
storm+0.072
 worse+0.066
ock+0.066
 outage+0.066
rob+0.065
Neg logit contributions
 amaz-0.053
 hands-0.052
 fingertips-0.051
 delight-0.048
 aston-0.046
Token asks
Token ID7893
Feature activation-1.08e-03

Pos logit contributions
storm+0.048
 worse+0.043
 cost+0.043
rob+0.041
ained+0.040
Neg logit contributions
 amaz-0.056
 hands-0.053
 fingertips-0.052
joy-0.049
 aston-0.049
Token.
Token ID13
Feature activation-2.38e-02

Pos logit contributions
 worse+0.002
 meant+0.002
 means+0.002
storm+0.002
 cost+0.002
Neg logit contributions
 amaz-0.002
 fingertips-0.002
joy-0.002
 aston-0.002
 hands-0.002
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
storm+0.040
 worse+0.039
 meant+0.039
 means+0.038
 cost+0.038
Neg logit contributions
 amaz-0.043
 fingertips-0.041
 hands-0.037
joy-0.037
 aston-0.036
Token
Token ID198
Feature activation-9.46e-03

Pos logit contributions
storm+0.025
 worse+0.025
 meant+0.025
 means+0.024
 cost+0.024
Neg logit contributions
 amaz-0.030
 fingertips-0.029
 hands-0.026
joy-0.026
 aston-0.025
Token"
Token ID1
Feature activation-1.59e-03

Pos logit contributions
 means+0.017
 worse+0.017
storm+0.017
 meant+0.017
 cost+0.016
Neg logit contributions
 amaz-0.016
 fingertips-0.015
joy-0.014
 hands-0.013
 aston-0.013
Token it
Token ID340
Feature activation+4.41e-03

Pos logit contributions
storm+0.052
rob+0.048
 cost+0.047
 outage+0.047
 meant+0.047
Neg logit contributions
 amaz-0.047
 hands-0.044
 fingertips-0.044
 delight-0.040
 approval-0.039
Token.
Token ID13
Feature activation+3.75e-03

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.008
 meant-0.008
 means-0.008
 worse-0.008
 cost-0.008
Token
Token ID198
Feature activation-4.79e-03

Pos logit contributions
 amaz+0.007
 fingertips+0.007
 hands+0.006
joy+0.006
 aston+0.006
Neg logit contributions
storm-0.006
 worse-0.006
 meant-0.006
 cost-0.006
 means-0.005
Token
Token ID198
Feature activation-3.62e-03

Pos logit contributions
storm+0.009
 worse+0.009
 cost+0.009
 meant+0.008
 means+0.008
Neg logit contributions
 amaz-0.008
 fingertips-0.007
 hands-0.007
joy-0.007
 aston-0.007
TokenBut
Token ID1537
Feature activation-6.68e-03

Pos logit contributions
storm+0.006
 worse+0.006
 meant+0.006
 means+0.006
 cost+0.006
Neg logit contributions
 amaz-0.006
 fingertips-0.006
 hands-0.005
joy-0.005
 aston-0.005
Token Ben
Token ID3932
Feature activation-1.56e-02

Pos logit contributions
 worse+0.010
storm+0.010
 meant+0.010
 means+0.010
 cost+0.010
Neg logit contributions
 amaz-0.013
 fingertips-0.013
joy-0.011
 hands-0.011
 aston-0.011
Token's
Token ID338
Feature activation-1.26e-02

Pos logit contributions
storm+0.024
 worse+0.024
 meant+0.023
 means+0.023
 cost+0.023
Neg logit contributions
 amaz-0.030
 fingertips-0.029
 hands-0.026
joy-0.026
 aston-0.025
Token juice
Token ID13135
Feature activation-4.54e-02

Pos logit contributions
 worse+0.023
 cost+0.022
 meant+0.022
storm+0.022
 means+0.022
Neg logit contributions
 amaz-0.020
 fingertips-0.019
joy-0.017
 aston-0.017
 hands-0.016
Token was
Token ID373
Feature activation-3.52e-02

Pos logit contributions
rob+0.066
storm+0.066
 means+0.064
 outage+0.062
 worse+0.062
Neg logit contributions
 fingertips-0.086
 hands-0.083
 amaz-0.082
 toes-0.075
 perfectly-0.072
Token not
Token ID407
Feature activation-1.10e-02

Pos logit contributions
rob+0.060
storm+0.057
 means+0.056
 outage+0.052
ock+0.051
Neg logit contributions
 fingertips-0.063
 hands-0.061
 amaz-0.061
 perfectly-0.057
 toes-0.057
Token water
Token ID1660
Feature activation-3.45e-02

Pos logit contributions
storm+0.021
rob+0.020
 means+0.020
guard+0.018
 outage+0.018
Neg logit contributions
 hands-0.017
 fingertips-0.017
 amaz-0.017
 toes-0.016
joy-0.015
Token on
Token ID319
Feature activation+1.24e-02

Pos logit contributions
 amaz+0.001
 fingertips+0.001
 aston+0.001
 approval+0.001
 hands+0.001
Neg logit contributions
storm-0.001
 worse-0.001
 meant-0.001
 means-0.001
 cost-0.001
Token its
Token ID663
Feature activation+1.28e-02

Pos logit contributions
 fingertips+0.026
 amaz+0.026
ushes+0.023
 hands+0.023
years+0.023
Neg logit contributions
 worse-0.015
 means-0.014
 meant-0.014
 cost-0.014
storm-0.014
Token next
Token ID1306
Feature activation+1.72e-02

Pos logit contributions
 amaz+0.028
 fingertips+0.027
apped+0.026
ushes+0.026
joy+0.026
Neg logit contributions
 worse-0.014
 cost-0.014
storm-0.013
ty-0.013
 happen-0.013
Token adventure
Token ID8855
Feature activation+9.91e-03

Pos logit contributions
 amaz+0.039
 fingertips+0.038
 hands+0.035
joy+0.035
 aston+0.034
Neg logit contributions
storm-0.020
 worse-0.019
 cost-0.018
 meant-0.018
 means-0.018
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 amaz+0.018
 fingertips+0.017
 hands+0.015
joy+0.015
 aston+0.015
Neg logit contributions
storm-0.017
 worse-0.017
 meant-0.016
 cost-0.016
 means-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.26e-02

Pos logit contributions
 amaz+0.023
 fingertips+0.022
maid+0.020
 toes+0.020
joy+0.020
Neg logit contributions
 means-0.013
 meant-0.013
ained-0.012
storm-0.012
 cost-0.012
TokenJack
Token ID14295
Feature activation+9.76e-03

Pos logit contributions
storm+0.018
 meant+0.017
 worse+0.017
 means+0.017
 cost+0.017
Neg logit contributions
 amaz-0.026
 fingertips-0.025
 hands-0.023
joy-0.022
 aston-0.022
Token was
Token ID373
Feature activation+3.22e-03

Pos logit contributions
 amaz+0.018
 fingertips+0.018
 hands+0.016
joy+0.016
 aston+0.016
Neg logit contributions
storm-0.015
 meant-0.015
 worse-0.015
 means-0.015
 cost-0.015
Token a
Token ID257
Feature activation+2.03e-02

Pos logit contributions
 amaz+0.006
 fingertips+0.006
 hands+0.005
joy+0.005
 aston+0.005
Neg logit contributions
 worse-0.005
 meant-0.005
storm-0.005
 means-0.005
 cost-0.005
Token busy
Token ID8179
Feature activation-3.22e-03

Pos logit contributions
 fingertips+0.043
 amaz+0.042
ushes+0.040
oor+0.038
 approval+0.038
Neg logit contributions
storm-0.027
 worse-0.024
 hospital-0.024
 cost-0.024
 operation-0.023
Token 3
Token ID513
Feature activation+1.44e-02

Pos logit contributions
storm+0.005
 worse+0.005
 meant+0.005
 means+0.005
 cost+0.005
Neg logit contributions
 amaz-0.006
 fingertips-0.006
 hands-0.005
joy-0.005
 aston-0.005
Token exercise
Token ID5517
Feature activation-5.65e-02

Pos logit contributions
storm+0.056
 worse+0.048
 means+0.048
rob+0.047
 cost+0.047
Neg logit contributions
 amaz-0.063
 hands-0.061
 fingertips-0.060
 toes-0.056
 delight-0.055
Token.
Token ID13
Feature activation-1.97e-03

Pos logit contributions
storm+0.125
 worse+0.119
 meant+0.119
 outage+0.116
 cost+0.116
Neg logit contributions
 hands-0.057
 amaz-0.050
 toes-0.050
 perfectly-0.050
 fingertips-0.050
Token He
Token ID679
Feature activation-1.73e-02

Pos logit contributions
 worse+0.003
storm+0.003
 meant+0.003
 cost+0.003
 means+0.003
Neg logit contributions
 amaz-0.004
 fingertips-0.004
 hands-0.003
joy-0.003
 aston-0.003
Token loved
Token ID6151
Feature activation-4.31e-02

Pos logit contributions
storm+0.027
 worse+0.027
 cost+0.025
 meant+0.025
rob+0.024
Neg logit contributions
 amaz-0.031
 fingertips-0.030
 hands-0.030
 toes-0.027
joy-0.027
Token to
Token ID284
Feature activation-2.52e-02

Pos logit contributions
storm+0.087
 cost+0.084
 meant+0.080
 worse+0.080
rob+0.080
Neg logit contributions
 hands-0.062
joy-0.057
 amaz-0.055
 toes-0.054
 fingertips-0.054
Token exercise
Token ID5517
Feature activation-5.27e-02

Pos logit contributions
storm+0.038
 meant+0.037
 worse+0.036
 means+0.036
 cost+0.035
Neg logit contributions
 amaz-0.049
 fingertips-0.048
 hands-0.045
 delight-0.043
 approval-0.042
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
storm+0.116
 outage+0.110
 worse+0.108
 meant+0.108
 cost+0.108
Neg logit contributions
 amaz-0.053
 hands-0.053
 perfectly-0.050
 fingertips-0.048
 toes-0.047
Token felt
Token ID2936
Feature activation-2.16e-02

Pos logit contributions
storm+0.082
rob+0.080
 meant+0.078
 worse+0.078
ained+0.077
Neg logit contributions
 amaz-0.071
 fingertips-0.067
 hands-0.066
 delight-0.061
 toes-0.060
Token very
Token ID845
Feature activation-2.51e-02

Pos logit contributions
storm+0.037
 worse+0.036
 meant+0.035
 means+0.035
 cost+0.035
Neg logit contributions
 amaz-0.038
 fingertips-0.036
 hands-0.034
joy-0.033
 aston-0.032
Token happy
Token ID3772
Feature activation-1.25e-02

Pos logit contributions
 worse+0.041
 meant+0.039
storm+0.039
 cost+0.039
 means+0.038
Neg logit contributions
 amaz-0.047
 fingertips-0.045
joy-0.040
 hands-0.040
 aston-0.040
Token when
Token ID618
Feature activation-2.39e-02

Pos logit contributions
storm+0.020
 worse+0.020
 meant+0.019
 means+0.019
 cost+0.019
Neg logit contributions
 amaz-0.024
 fingertips-0.023
 hands-0.021
 aston-0.020
joy-0.020
Token ow
Token ID12334
Feature activation+2.83e-02

Pos logit contributions
 amaz+0.079
 fingertips+0.076
 hands+0.071
joy+0.069
 aston+0.068
Neg logit contributions
storm-0.046
 worse-0.045
 meant-0.044
 means-0.044
 cost-0.044
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
 amaz+0.054
 fingertips+0.050
 aston+0.045
 approval+0.044
joy+0.044
Neg logit contributions
 worse-0.050
storm-0.047
 means-0.047
 meant-0.046
ock-0.044
Token ow
Token ID12334
Feature activation+2.58e-02

Pos logit contributions
 amaz+0.074
 fingertips+0.072
 hands+0.067
joy+0.065
 aston+0.064
Neg logit contributions
storm-0.043
 worse-0.042
 meant-0.042
 means-0.042
 cost-0.041
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
 amaz+0.041
 fingertips+0.038
joy+0.033
 aston+0.033
 hands+0.032
Neg logit contributions
 worse-0.051
storm-0.051
 means-0.049
 meant-0.049
 cost-0.048
Token My
Token ID2011
Feature activation+1.90e-03

Pos logit contributions
 amaz+0.047
 fingertips+0.045
joy+0.041
 hands+0.040
 aston+0.039
Neg logit contributions
 worse-0.044
storm-0.044
 means-0.043
 meant-0.043
 cost-0.042
Token arm
Token ID3211
Feature activation-5.44e-02

Pos logit contributions
 amaz+0.003
 fingertips+0.003
joy+0.003
 hands+0.003
 aston+0.003
Neg logit contributions
 worse-0.003
storm-0.003
 meant-0.003
 means-0.003
 cost-0.003
Token hurts
Token ID20406
Feature activation+2.37e-03

Pos logit contributions
storm+0.053
rob+0.049
guard+0.048
 cost+0.046
 meant+0.045
Neg logit contributions
 hands-0.128
 fingertips-0.126
 amaz-0.123
 ear-0.113
 perfectly-0.111
Token.
Token ID13
Feature activation+2.52e-02

Pos logit contributions
 amaz+0.004
 fingertips+0.004
 hands+0.003
joy+0.003
 aston+0.003
Neg logit contributions
storm-0.004
 worse-0.004
 meant-0.004
 cost-0.004
 means-0.004
Token My
Token ID2011
Feature activation-2.15e-03

Pos logit contributions
 amaz+0.044
 fingertips+0.042
joy+0.038
 hands+0.038
 aston+0.037
Neg logit contributions
 worse-0.043
storm-0.042
 means-0.042
 meant-0.042
 cost-0.041
Token bike
Token ID7161
Feature activation-3.93e-02

Pos logit contributions
storm+0.004
 worse+0.004
 meant+0.004
 means+0.004
 cost+0.004
Neg logit contributions
 amaz-0.003
 fingertips-0.003
 hands-0.003
joy-0.003
 aston-0.003
Token is
Token ID318
Feature activation-2.76e-02

Pos logit contributions
storm+0.059
rob+0.055
ained+0.053
 meant+0.053
 worse+0.052
Neg logit contributions
 amaz-0.071
 fingertips-0.070
 hands-0.070
 aston-0.062
 perfectly-0.061
Token mom
Token ID1995
Feature activation-2.97e-02

Pos logit contributions
storm+0.025
 worse+0.025
 meant+0.024
 cost+0.024
 means+0.024
Neg logit contributions
 amaz-0.014
 fingertips-0.013
joy-0.011
 hands-0.011
 aston-0.011
Token told
Token ID1297
Feature activation-2.21e-02

Pos logit contributions
storm+0.052
 worse+0.051
rob+0.050
ained+0.047
 outage+0.047
Neg logit contributions
 amaz-0.046
 hands-0.045
 fingertips-0.041
 aston-0.039
 pride-0.038
Token her
Token ID607
Feature activation-1.42e-02

Pos logit contributions
rob+0.048
storm+0.046
 outage+0.043
 cost+0.043
guard+0.043
Neg logit contributions
 hands-0.030
 amaz-0.029
 fingertips-0.028
 delight-0.026
 approval-0.025
Token that
Token ID326
Feature activation-3.78e-02

Pos logit contributions
storm+0.018
 worse+0.017
 meant+0.017
 cost+0.017
 means+0.017
Neg logit contributions
 amaz-0.031
 fingertips-0.030
 hands-0.028
joy-0.027
 aston-0.027
Token eating
Token ID6600
Feature activation-5.59e-02

Pos logit contributions
storm+0.061
rob+0.060
 outage+0.060
ock+0.060
ained+0.058
Neg logit contributions
 amaz-0.064
 hands-0.061
 fingertips-0.059
joy-0.054
 delight-0.053
Token healthy
Token ID5448
Feature activation-6.32e-02

Pos logit contributions
 outage+0.111
storm+0.110
rob+0.093
guard+0.091
ock+0.090
Neg logit contributions
 hands-0.085
 amaz-0.076
 perfectly-0.072
 fingertips-0.072
 toes-0.069
Token food
Token ID2057
Feature activation-3.06e-02

Pos logit contributions
storm+0.115
 outage+0.109
 worse+0.102
ained+0.098
guard+0.097
Neg logit contributions
 hands-0.113
 amaz-0.104
 fingertips-0.104
 toes-0.097
 delight-0.095
Token was
Token ID373
Feature activation-6.03e-02

Pos logit contributions
rob+0.060
storm+0.060
 worse+0.057
ock+0.055
ained+0.054
Neg logit contributions
 fingertips-0.040
 amaz-0.039
 hands-0.039
 perfectly-0.035
 toes-0.034
Token important
Token ID1593
Feature activation-3.90e-02

Pos logit contributions
rob+0.128
storm+0.113
 outage+0.105
ock+0.104
bike+0.103
Neg logit contributions
 hands-0.081
 perfectly-0.076
 fingertips-0.075
 amaz-0.074
 toes-0.070
Token to
Token ID284
Feature activation-2.38e-02

Pos logit contributions
storm+0.077
 worse+0.074
rob+0.070
 cost+0.070
 means+0.069
Neg logit contributions
 amaz-0.056
 fingertips-0.056
 hands-0.055
joy-0.050
 toes-0.049
Token stay
Token ID2652
Feature activation-1.75e-02

Pos logit contributions
rob+0.044
storm+0.044
 outage+0.042
ained+0.042
 means+0.042
Neg logit contributions
 amaz-0.038
 fingertips-0.038
 hands-0.035
 delight-0.034
 approval-0.032
Token Tim
Token ID5045
Feature activation-1.86e-02

Pos logit contributions
storm+0.032
 worse+0.030
 cost+0.029
rob+0.029
 meant+0.029
Neg logit contributions
 amaz-0.027
 fingertips-0.025
 hands-0.025
joy-0.023
 delight-0.022
Token asked
Token ID1965
Feature activation+2.12e-04

Pos logit contributions
storm+0.030
 worse+0.027
ained+0.026
rob+0.026
 cost+0.025
Neg logit contributions
 amaz-0.037
 hands-0.035
 fingertips-0.035
joy-0.032
 delight-0.032
Token.
Token ID13
Feature activation+7.42e-03

Pos logit contributions
 amaz+0.000
 fingertips+0.000
joy+0.000
 aston+0.000
 hands+0.000
Neg logit contributions
 worse-0.000
 meant-0.000
 means-0.000
storm-0.000
 cost-0.000
Token "
Token ID366
Feature activation+1.28e-02

Pos logit contributions
 amaz+0.014
 fingertips+0.014
 hands+0.012
joy+0.012
 aston+0.012
Neg logit contributions
storm-0.011
 worse-0.011
 meant-0.011
 means-0.011
 cost-0.011
TokenThe
Token ID464
Feature activation-2.37e-02

Pos logit contributions
 amaz+0.024
 fingertips+0.023
 hands+0.021
joy+0.020
 aston+0.020
Neg logit contributions
storm-0.021
 worse-0.020
 meant-0.020
 means-0.020
 cost-0.020
Token mosquitoes
Token ID35573
Feature activation-5.14e-02

Pos logit contributions
storm+0.036
 worse+0.036
 meant+0.035
 means+0.035
 cost+0.035
Neg logit contributions
 amaz-0.045
 fingertips-0.043
 hands-0.040
joy-0.039
 aston-0.038
Token are
Token ID389
Feature activation-2.86e-02

Pos logit contributions
 worse+0.097
rob+0.096
storm+0.096
ained+0.094
 surgery+0.091
Neg logit contributions
 amaz-0.070
 fingertips-0.068
 hands-0.066
 aston-0.056
 toes-0.056
Token bothering
Token ID41656
Feature activation+2.87e-03

Pos logit contributions
rob+0.046
storm+0.043
guard+0.041
 means+0.040
ock+0.039
Neg logit contributions
 hands-0.052
 amaz-0.051
 fingertips-0.051
 toes-0.047
 ear-0.046
Token us
Token ID514
Feature activation-7.06e-03

Pos logit contributions
 amaz+0.005
 fingertips+0.005
 hands+0.005
 approval+0.005
joy+0.005
Neg logit contributions
storm-0.004
 worse-0.004
 cost-0.004
rob-0.004
 meant-0.004
Token."
Token ID526
Feature activation-1.06e-02

Pos logit contributions
storm+0.013
rob+0.013
 worse+0.013
 cost+0.013
 meant+0.012
Neg logit contributions
 amaz-0.011
 fingertips-0.010
 hands-0.010
 toes-0.009
 approval-0.009
Token
Token ID198
Feature activation-8.22e-03

Pos logit contributions
storm+0.018
 worse+0.018
 meant+0.017
 means+0.017
 cost+0.017
Neg logit contributions
 amaz-0.019
 fingertips-0.018
 hands-0.016
joy-0.016
 aston-0.016
Token day
Token ID1110
Feature activation+8.24e-03

Pos logit contributions
 amaz+0.003
 hands+0.003
 delight+0.003
joy+0.003
 fingertips+0.003
Neg logit contributions
 worse-0.003
rob-0.003
storm-0.002
 ger-0.002
guard-0.002
Token,
Token ID11
Feature activation-5.60e-03

Pos logit contributions
 amaz+0.014
 fingertips+0.013
 approval+0.012
joy+0.012
 aston+0.012
Neg logit contributions
 worse-0.015
storm-0.015
 meant-0.015
 means-0.014
 cost-0.014
Token Tim
Token ID5045
Feature activation-7.55e-03

Pos logit contributions
storm+0.009
 worse+0.009
 means+0.009
 meant+0.009
 cost+0.009
Neg logit contributions
 amaz-0.010
 fingertips-0.010
 hands-0.009
joy-0.009
 aston-0.009
Tokenmy
Token ID1820
Feature activation-6.10e-03

Pos logit contributions
storm+0.013
rob+0.012
 worse+0.012
 means+0.012
guard+0.012
Neg logit contributions
 amaz-0.013
 hands-0.012
 fingertips-0.012
joy-0.011
 aston-0.010
Token's
Token ID338
Feature activation-1.07e-02

Pos logit contributions
storm+0.010
 worse+0.010
rob+0.009
 cost+0.009
 means+0.009
Neg logit contributions
 amaz-0.011
 fingertips-0.010
 hands-0.010
joy-0.009
 aston-0.009
Token knee
Token ID10329
Feature activation-5.80e-02

Pos logit contributions
 worse+0.023
storm+0.023
 meant+0.023
 cost+0.023
 means+0.023
Neg logit contributions
 amaz-0.014
 fingertips-0.013
joy-0.011
 hands-0.011
 aston-0.011
Token started
Token ID2067
Feature activation-3.65e-02

Pos logit contributions
igators+0.107
rob+0.107
 outage+0.101
 means+0.100
guard+0.099
Neg logit contributions
 hands-0.081
 fingertips-0.080
 amaz-0.071
 perfectly-0.068
 grinned-0.067
Token to
Token ID284
Feature activation-2.18e-02

Pos logit contributions
rob+0.072
storm+0.067
guard+0.066
 means+0.065
igators+0.064
Neg logit contributions
 hands-0.059
 fingertips-0.057
 amaz-0.056
joy-0.054
 eyes-0.048
Token hurt
Token ID5938
Feature activation-1.82e-02

Pos logit contributions
rob+0.038
 means+0.035
ained+0.034
storm+0.033
guard+0.033
Neg logit contributions
 amaz-0.035
 fingertips-0.033
 hands-0.032
 smile-0.031
 delight-0.030
Token even
Token ID772
Feature activation-3.93e-02

Pos logit contributions
storm+0.035
 cost+0.034
 means+0.033
rob+0.033
 meant+0.032
Neg logit contributions
 amaz-0.027
 fingertips-0.027
 hands-0.026
joy-0.024
 toes-0.023
Token more
Token ID517
Feature activation-2.31e-02

Pos logit contributions
storm+0.083
rob+0.074
bank+0.064
ained+0.062
forest+0.062
Neg logit contributions
 amaz-0.061
 hands-0.056
 approval-0.054
 proudly-0.051
 ear-0.050