TOP ACTIVATIONS
MAX = 0.107

THE BE LL WHEN YOU FE EL LIKE IT ".
Tim my quickly yelled " ST OP !" and ran to
. The sign said " ST OP ". "
Emily , you look AM AZ ING !" Emily was so
AND YOU WILL GET IN TR OU BLE !"
morning ." The mom was perplex ed . She knew that
No , it 's not ! It 's a diamond !
The baby 's name was Ag nes . Ag nes loved
deep breath and shouted " ST OP !"
man stepped closer and said gru ff ly , " You
so surprised that they started tor oll away .
price tag . It said $ 1 . 99 per pound
her thoughtful idea . <|endoftext|> Jes se was very excited when
It says : " No ! You are bad ! You
that the zipper had become unst uck . She was relieved
for being so well - beh aved and Jack was delighted
a sign that said " RE VERS E ". L ila
No , no , no ! You are a bad girl
Mia realised that it wasn â t a star
the fridge . She was perplex ed by what she saw

MIN ACTIVATIONS
MIN = -0.100

As he cleaned more , Tim found more things he
. While he was swinging , he saw a group of
. While resting , the bird ie saw a
. While she was sweeping , she kept cl apping her
her friends . While playing , she found a toy car
tree branch . While resting , Twe ety saw a little
. While she was collecting , she saw a bee flying
his friends . While playing , Tim my saw a beautiful
As they cooked , Tim my 's mom told
As Tim my left , he said , " Thanks
As she played , she found a piece of
. While swimming , the mole met a kind
As Red was driving , he saw a big truck
While playing on the swings , she saw a rope lying
While Tim was resting , a spoiled cat named Max
As they were driving , they saw a big hill
As they walked further , they saw a scary tree
As she was walking , she saw a big wolf
As she was eating , she accidentally spilled the noodles
As they baked the cookies , Lily 's mom told her

INTERVAL 0.056 - 0.107
CONTAINS 2.545%

, " No , Tim . It is not safe ."
!" Tim and Mia said . They waved and smiled .
Go up again ! â The jet
white and has no cord . She says it is a
They learned that kindness is . <|endoftext|> S ara likes to

INTERVAL 0.004 - 0.056
CONTAINS 44.077%

was a big party inside . All of her friends and
ibrarian said . " Thank you ," Anna and
rainbow . There , he found a big box of y
before . He shouted , Look
their surprise , they all agreed and the team of families

INTERVAL -0.048 - 0.004
CONTAINS 47.643%

. Lucy was very sad , but she tried hard not
promised to come back again soon . They felt so happy
encourage her to draw more , so she got her a
he took a sip . The coffee was delicious and the
mom found a special fl ute for her daughter , one

INTERVAL -0.100 - -0.048
CONTAINS 5.736%

When she went back home , the girl was full of
her toys , but one day she lost the key to
Tim always helped his mom and dad , even in the
all day long . One day , Tim my was walking
the park with her mom and saw some noisy kids playing
Token THE
Token ID3336
Feature activation+1.83e-02

Pos logit contributions
 Give+0.051
 agrees+0.050
 Go+0.049
ives+0.045
 Share+0.044
Neg logit contributions
field-0.039
str-0.038
 sudden-0.036
 raft-0.034
 wheat-0.033
Token BE
Token ID9348
Feature activation+6.62e-02

Pos logit contributions
 agrees+0.035
 Give+0.032
 Go+0.031
ives+0.030
 consequences+0.028
Neg logit contributions
str-0.036
field-0.033
 cherry-0.030
 pine-0.029
pron-0.029
TokenLL
Token ID3069
Feature activation+9.28e-02

Pos logit contributions
 agrees+0.124
 Give+0.120
 Go+0.113
ives+0.111
 acts+0.108
Neg logit contributions
str-0.124
field-0.119
 sudden-0.110
 wheat-0.108
 raft-0.106
Token WHEN
Token ID42099
Feature activation+8.59e-02

Pos logit contributions
 agrees+0.202
 Give+0.188
 Go+0.181
ives+0.175
 acts+0.173
Neg logit contributions
str-0.151
field-0.144
 sudden-0.132
 wheat-0.127
aster-0.127
Token YOU
Token ID7013
Feature activation+8.00e-02

Pos logit contributions
 agrees+0.183
 Give+0.162
ives+0.160
 aback+0.153
 teaches+0.153
Neg logit contributions
str-0.142
field-0.137
aster-0.127
lf-0.124
 wheat-0.124
Token FE
Token ID18630
Feature activation+1.07e-01

Pos logit contributions
 agrees+0.158
 Give+0.155
 Go+0.148
ives+0.142
 Come+0.137
Neg logit contributions
str-0.139
field-0.137
 sudden-0.124
 wheat-0.121
 raft-0.120
TokenEL
Token ID3698
Feature activation+8.03e-02

Pos logit contributions
 agrees+0.204
 Give+0.195
 acts+0.188
 Go+0.185
 aback+0.179
Neg logit contributions
str-0.204
field-0.174
pron-0.171
aster-0.162
aman-0.159
Token LIKE
Token ID34178
Feature activation+3.40e-02

Pos logit contributions
 agrees+0.187
 Give+0.178
 Go+0.174
 Come+0.166
 Share+0.163
Neg logit contributions
str-0.114
aster-0.100
 pine-0.100
field-0.100
 wheat-0.098
Token IT
Token ID7283
Feature activation+2.48e-02

Pos logit contributions
 agrees+0.068
 Give+0.066
 Go+0.063
ives+0.061
 Come+0.058
Neg logit contributions
str-0.058
field-0.058
 sudden-0.053
 wheat-0.051
 raft-0.051
Token".
Token ID1911
Feature activation+2.44e-02

Pos logit contributions
 agrees+0.043
 Give+0.042
 Go+0.040
ives+0.038
 Come+0.036
Neg logit contributions
field-0.049
str-0.049
 sudden-0.045
 wheat-0.044
 raft-0.044
Token
Token ID198
Feature activation-3.80e-02

Pos logit contributions
 agrees+0.050
 Give+0.049
 Go+0.047
ives+0.045
 Share+0.043
Neg logit contributions
str-0.041
field-0.041
 sudden-0.038
 wheat-0.036
 raft-0.035
Token Tim
Token ID5045
Feature activation+1.65e-02

Pos logit contributions
 Give+0.024
 agrees+0.024
 Go+0.023
ives+0.022
 Come+0.021
Neg logit contributions
field-0.021
str-0.021
 sudden-0.019
 wheat-0.019
 raft-0.018
Tokenmy
Token ID1820
Feature activation-6.46e-03

Pos logit contributions
 agrees+0.033
 Give+0.032
 Go+0.031
ives+0.030
 Share+0.029
Neg logit contributions
str-0.028
field-0.028
 sudden-0.027
 raft-0.025
 pine-0.024
Token quickly
Token ID2952
Feature activation-2.04e-02

Pos logit contributions
field+0.013
str+0.012
 wheat+0.011
aster+0.011
aman+0.011
Neg logit contributions
 Give-0.012
 agrees-0.011
 Go-0.011
ives-0.010
 disagrees-0.010
Token yelled
Token ID22760
Feature activation+6.12e-02

Pos logit contributions
field+0.037
str+0.033
 sudden+0.031
pron+0.031
 wheat+0.030
Neg logit contributions
 Give-0.041
 agrees-0.041
 Go-0.039
ives-0.038
 disagrees-0.035
Token "
Token ID366
Feature activation+5.38e-02

Pos logit contributions
 agrees+0.162
 Give+0.152
ives+0.151
 acts+0.148
 Go+0.148
Neg logit contributions
 which-0.056
str-0.054
 sudden-0.053
 raft-0.052
 while-0.052
TokenST
Token ID2257
Feature activation+1.07e-01

Pos logit contributions
 Give+0.105
 Go+0.101
 agrees+0.100
ives+0.093
 Come+0.090
Neg logit contributions
field-0.098
str-0.095
 wheat-0.086
 raft-0.084
 sudden-0.083
TokenOP
Token ID3185
Feature activation+5.96e-02

Pos logit contributions
 agrees+0.239
 Give+0.215
 demands+0.208
 acts+0.208
 disagrees+0.202
Neg logit contributions
str-0.150
aster-0.148
field-0.128
aman-0.128
Wil-0.126
Token!"
Token ID2474
Feature activation-2.66e-03

Pos logit contributions
 agrees+0.112
 Give+0.104
 Go+0.102
 acts+0.094
ives+0.094
Neg logit contributions
str-0.110
field-0.107
 sudden-0.104
 pine-0.096
pron-0.096
Token and
Token ID290
Feature activation-4.69e-02

Pos logit contributions
field+0.006
str+0.006
 wheat+0.005
 sudden+0.005
 pine+0.005
Neg logit contributions
 Give-0.004
 Go-0.004
 agrees-0.004
 Come-0.004
 You-0.004
Token ran
Token ID4966
Feature activation+2.95e-02

Pos logit contributions
field+0.096
str+0.091
pron+0.084
iri+0.083
 wheat+0.082
Neg logit contributions
 Give-0.085
 Go-0.080
 agrees-0.078
 disagrees-0.072
 demands-0.070
Token to
Token ID284
Feature activation-3.76e-02

Pos logit contributions
 agrees+0.070
 Give+0.068
 Go+0.066
ives+0.063
 acts+0.063
Neg logit contributions
str-0.039
field-0.038
 sudden-0.036
 which-0.035
 raft-0.035
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
 agrees+0.027
 Give+0.027
 Go+0.025
ives+0.024
 Share+0.023
Neg logit contributions
field-0.030
str-0.030
 sudden-0.028
 wheat-0.027
 raft-0.026
Token The
Token ID383
Feature activation+3.43e-03

Pos logit contributions
 agrees+0.073
 Give+0.070
 Go+0.068
ives+0.066
 Share+0.062
Neg logit contributions
str-0.066
field-0.066
 sudden-0.062
 wheat-0.060
 raft-0.059
Token sign
Token ID1051
Feature activation-1.49e-02

Pos logit contributions
 agrees+0.009
 Give+0.009
 Go+0.008
 disagrees+0.008
ives+0.008
Neg logit contributions
field-0.004
str-0.004
 sudden-0.004
Wil-0.004
 Wil-0.003
Token said
Token ID531
Feature activation+5.67e-02

Pos logit contributions
field+0.025
str+0.024
 sudden+0.022
 wheat+0.021
 raft+0.021
Neg logit contributions
 Give-0.032
 agrees-0.031
 Go-0.030
ives-0.028
 teaches-0.027
Token "
Token ID366
Feature activation+7.87e-02

Pos logit contributions
 Give+0.115
 agrees+0.110
 Go+0.110
 Share+0.100
 Come+0.098
Neg logit contributions
field-0.105
 sudden-0.097
str-0.096
aman-0.089
 raft-0.088
TokenST
Token ID2257
Feature activation+1.06e-01

Pos logit contributions
 Give+0.152
 Go+0.147
 agrees+0.146
ives+0.138
 Share+0.133
Neg logit contributions
field-0.141
str-0.137
 sudden-0.129
 raft-0.129
pron-0.124
TokenOP
Token ID3185
Feature activation+6.87e-02

Pos logit contributions
 agrees+0.222
 Give+0.208
 demands+0.196
 Go+0.193
 disagrees+0.192
Neg logit contributions
str-0.159
aster-0.156
field-0.149
 wheat-0.134
cle-0.133
Token".
Token ID1911
Feature activation+3.16e-02

Pos logit contributions
 agrees+0.118
 Give+0.108
 Go+0.105
 disagrees+0.096
 acts+0.095
Neg logit contributions
str-0.137
field-0.136
 sudden-0.129
pron-0.125
 wheat-0.124
Token
Token ID198
Feature activation-3.79e-02

Pos logit contributions
 agrees+0.063
 Give+0.062
 Go+0.059
ives+0.057
 Share+0.054
Neg logit contributions
field-0.054
str-0.054
 sudden-0.051
 wheat-0.049
 raft-0.048
Token
Token ID198
Feature activation-3.99e-02

Pos logit contributions
field+0.064
str+0.064
 sudden+0.059
 wheat+0.056
 raft+0.056
Neg logit contributions
 agrees-0.077
 Give-0.076
 Go-0.073
ives-0.069
 Share-0.066
Token"
Token ID1
Feature activation+1.02e-02

Pos logit contributions
field+0.078
str+0.073
pron+0.069
 peach+0.066
 sudden+0.065
Neg logit contributions
 Give-0.074
 agrees-0.071
 Go-0.070
 Come-0.063
 Share-0.061
Token Emily
Token ID17608
Feature activation+2.48e-02

Pos logit contributions
 agrees+0.074
 Give+0.066
 Go+0.061
ives+0.061
 disagrees+0.059
Neg logit contributions
str-0.126
field-0.125
 sudden-0.119
 which-0.116
 wheat-0.116
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
 agrees+0.046
 Give+0.045
 Go+0.043
ives+0.041
 Share+0.039
Neg logit contributions
field-0.046
str-0.046
 sudden-0.043
 wheat-0.041
 raft-0.040
Token you
Token ID345
Feature activation+5.00e-02

Pos logit contributions
field+0.034
str+0.033
 sudden+0.031
 raft+0.028
 wheat+0.028
Neg logit contributions
 Give-0.037
 agrees-0.036
 Go-0.035
ives-0.032
 Come-0.031
Token look
Token ID804
Feature activation+4.71e-02

Pos logit contributions
 agrees+0.095
 Give+0.094
 Go+0.090
ives+0.085
 Share+0.081
Neg logit contributions
field-0.091
str-0.090
 sudden-0.084
 wheat-0.081
 raft-0.080
Token AM
Token ID3001
Feature activation+6.13e-02

Pos logit contributions
 agrees+0.106
 Give+0.105
 Go+0.101
ives+0.096
 Share+0.093
Neg logit contributions
field-0.069
str-0.069
 sudden-0.063
 wheat-0.059
 raft-0.059
TokenAZ
Token ID22778
Feature activation+1.04e-01

Pos logit contributions
 agrees+0.113
 Give+0.107
 Go+0.102
ives+0.097
 acts+0.095
Neg logit contributions
str-0.114
field-0.112
 sudden-0.103
pron-0.101
 which-0.098
TokenING
Token ID2751
Feature activation+2.64e-02

Pos logit contributions
 agrees+0.200
 Give+0.191
 acts+0.178
 Go+0.176
 disagrees+0.173
Neg logit contributions
str-0.176
field-0.174
 sudden-0.158
aman-0.157
pron-0.155
Token!"
Token ID2474
Feature activation+1.97e-03

Pos logit contributions
 Give+0.047
 agrees+0.047
 Go+0.045
ives+0.042
 Share+0.040
Neg logit contributions
field-0.050
str-0.050
 sudden-0.047
 wheat-0.045
 raft-0.045
Token Emily
Token ID17608
Feature activation+1.84e-02

Pos logit contributions
 Give+0.004
 agrees+0.004
 Go+0.004
ives+0.003
 Share+0.003
Neg logit contributions
field-0.004
str-0.003
 sudden-0.003
 wheat-0.003
 which-0.003
Token was
Token ID373
Feature activation+1.10e-02

Pos logit contributions
 agrees+0.037
 Give+0.036
 Go+0.034
ives+0.032
 Share+0.031
Neg logit contributions
field-0.032
str-0.032
 sudden-0.029
 wheat-0.029
 raft-0.028
Token so
Token ID523
Feature activation-1.51e-02

Pos logit contributions
 agrees+0.025
 Give+0.025
 Go+0.024
ives+0.023
 Share+0.022
Neg logit contributions
field-0.016
str-0.016
 sudden-0.015
 wheat-0.014
 raft-0.014
Token AND
Token ID5357
Feature activation+4.30e-03

Pos logit contributions
 agrees+0.119
 Give+0.104
 disagrees+0.101
ives+0.100
 Go+0.098
Neg logit contributions
str-0.132
field-0.127
 sudden-0.123
Wil-0.121
 pine-0.120
Token YOU
Token ID7013
Feature activation+6.17e-02

Pos logit contributions
 agrees+0.009
 Give+0.009
 Go+0.008
ives+0.008
 Share+0.008
Neg logit contributions
field-0.007
str-0.007
 sudden-0.007
 wheat-0.006
 raft-0.006
Token WILL
Token ID17682
Feature activation+5.06e-02

Pos logit contributions
 agrees+0.105
 Give+0.104
 Go+0.099
ives+0.092
 Share+0.089
Neg logit contributions
str-0.122
field-0.122
 sudden-0.113
 wheat-0.109
 raft-0.108
Token GET
Token ID17151
Feature activation+6.62e-02

Pos logit contributions
 Give+0.100
 agrees+0.098
 Go+0.096
ives+0.089
 Share+0.086
Neg logit contributions
field-0.089
str-0.088
 sudden-0.082
 wheat-0.078
 raft-0.078
Token IN
Token ID3268
Feature activation+6.73e-02

Pos logit contributions
 agrees+0.141
 Give+0.132
ives+0.127
 Go+0.126
 Share+0.124
Neg logit contributions
str-0.102
field-0.096
 sudden-0.093
 wheat-0.092
 pine-0.091
Token TR
Token ID7579
Feature activation+1.03e-01

Pos logit contributions
 agrees+0.131
 Give+0.129
 Go+0.123
ives+0.117
 Share+0.113
Neg logit contributions
str-0.118
field-0.117
 sudden-0.109
 wheat-0.106
 raft-0.103
TokenOU
Token ID2606
Feature activation+9.13e-02

Pos logit contributions
 agrees+0.198
 teaches+0.176
 acts+0.175
 Give+0.174
 disagrees+0.169
Neg logit contributions
str-0.177
field-0.170
aman-0.163
Wil-0.162
aster-0.162
TokenBLE
Token ID19146
Feature activation+6.05e-02

Pos logit contributions
 agrees+0.202
 Give+0.186
 Go+0.181
 teaches+0.176
ives+0.175
Neg logit contributions
str-0.140
field-0.133
 sudden-0.124
 pine-0.118
 raft-0.118
Token!"
Token ID2474
Feature activation+1.74e-02

Pos logit contributions
 agrees+0.111
 Give+0.106
 Go+0.102
ives+0.097
 Share+0.096
Neg logit contributions
str-0.114
field-0.112
 sudden-0.102
 raft-0.101
 wheat-0.100
Token
Token ID198
Feature activation-2.01e-02

Pos logit contributions
 Give+0.034
 agrees+0.034
 Go+0.033
ives+0.030
 Come+0.030
Neg logit contributions
field-0.031
str-0.031
 sudden-0.028
 raft-0.027
 wheat-0.027
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
field+0.034
str+0.034
 sudden+0.031
 wheat+0.030
 raft+0.030
Neg logit contributions
 agrees-0.040
 Give-0.040
 Go-0.038
ives-0.036
 Share-0.035
Token morning
Token ID3329
Feature activation-1.01e-02

Pos logit contributions
 Give+0.049
 agrees+0.047
 acts+0.043
 teaches+0.043
 Go+0.043
Neg logit contributions
 sudden-0.042
apy-0.041
ittens-0.041
Wil-0.040
str-0.040
Token."
Token ID526
Feature activation+1.83e-02

Pos logit contributions
str+0.023
field+0.023
 sudden+0.022
 raft+0.021
Wil+0.021
Neg logit contributions
 Give-0.014
 agrees-0.014
 Go-0.013
ives-0.012
 Come-0.011
Token The
Token ID383
Feature activation-6.98e-03

Pos logit contributions
 agrees+0.032
 Give+0.032
 Go+0.030
ives+0.028
 Share+0.027
Neg logit contributions
field-0.036
str-0.036
 sudden-0.034
 wheat-0.032
 raft-0.032
Token mom
Token ID1995
Feature activation-2.06e-02

Pos logit contributions
str+0.010
field+0.009
Wil+0.008
 sudden+0.008
Bet+0.008
Neg logit contributions
 Give-0.016
 agrees-0.016
 Go-0.016
 teaches-0.015
 Share-0.014
Token was
Token ID373
Feature activation+1.26e-02

Pos logit contributions
field+0.037
str+0.036
 sudden+0.034
 wheat+0.033
 raft+0.032
Neg logit contributions
 agrees-0.041
 Give-0.040
 Go-0.037
ives-0.035
 teaches-0.035
Token perplex
Token ID35682
Feature activation+1.00e-01

Pos logit contributions
 agrees+0.026
 Give+0.026
 Go+0.025
 Share+0.023
 Believe+0.022
Neg logit contributions
field-0.021
str-0.021
 sudden-0.019
 raft-0.019
 pine-0.019
Tokened
Token ID276
Feature activation+3.66e-02

Pos logit contributions
 Go+0.159
 agrees+0.159
 Give+0.157
 Share+0.144
 Believe+0.141
Neg logit contributions
str-0.181
 which-0.167
 they-0.165
field-0.164
 sudden-0.162
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
 agrees+0.092
 Give+0.091
 Go+0.089
ives+0.088
 Share+0.086
Neg logit contributions
 sudden-0.037
 which-0.036
str-0.036
 wheat-0.035
field-0.034
Token She
Token ID1375
Feature activation+3.44e-03

Pos logit contributions
 agrees+0.059
 Give+0.057
 Go+0.055
ives+0.053
 Share+0.050
Neg logit contributions
str-0.053
field-0.053
 sudden-0.049
 wheat-0.046
 which-0.046
Token knew
Token ID2993
Feature activation+8.37e-03

Pos logit contributions
 agrees+0.007
 Give+0.007
 Go+0.006
 teaches+0.006
 disagrees+0.006
Neg logit contributions
field-0.006
str-0.006
 wheat-0.005
 sudden-0.005
aster-0.005
Token that
Token ID326
Feature activation-5.23e-03

Pos logit contributions
 Go+0.018
 Give+0.018
 agrees+0.018
 You+0.016
 Believe+0.016
Neg logit contributions
field-0.013
 sudden-0.011
str-0.011
pron-0.011
 cloudy-0.010
TokenNo
Token ID2949
Feature activation+1.70e-03

Pos logit contributions
 agrees+0.061
 Give+0.060
 Go+0.058
ives+0.056
 Share+0.053
Neg logit contributions
str-0.044
field-0.043
 sudden-0.042
 wheat-0.038
 pine-0.037
Token,
Token ID11
Feature activation-4.19e-03

Pos logit contributions
 agrees+0.004
 Give+0.004
 Go+0.004
ives+0.004
 Share+0.004
Neg logit contributions
str-0.002
field-0.002
 sudden-0.002
 wheat-0.002
 raft-0.002
Token it
Token ID340
Feature activation+2.24e-02

Pos logit contributions
str+0.007
field+0.007
 sudden+0.007
 wheat+0.007
 raft+0.007
Neg logit contributions
 agrees-0.008
 Give-0.008
 Go-0.008
ives-0.007
 Share-0.007
Token's
Token ID338
Feature activation+3.81e-02

Pos logit contributions
 agrees+0.033
 Give+0.032
ives+0.032
 Go+0.031
 Share+0.030
Neg logit contributions
str-0.049
field-0.046
 pine-0.043
 raft-0.043
 peach-0.043
Token not
Token ID407
Feature activation+1.95e-02

Pos logit contributions
 Give+0.063
 agrees+0.062
 Go+0.062
ives+0.054
 Share+0.053
Neg logit contributions
field-0.080
str-0.078
 sudden-0.074
 raft-0.069
 wheat-0.069
Token!
Token ID0
Feature activation+9.94e-02

Pos logit contributions
 agrees+0.039
 Give+0.039
 Go+0.038
ives+0.035
 Share+0.034
Neg logit contributions
field-0.033
str-0.033
 sudden-0.031
 wheat-0.029
 raft-0.029
Token It
Token ID632
Feature activation+2.72e-02

Pos logit contributions
 agrees+0.153
ives+0.131
 aback+0.129
 disagrees+0.127
 teaches+0.122
Neg logit contributions
 pine-0.191
 which-0.189
str-0.189
 cherry-0.187
 peach-0.185
Token's
Token ID338
Feature activation+3.87e-02

Pos logit contributions
 agrees+0.038
 Give+0.037
ives+0.036
 Go+0.034
 Share+0.033
Neg logit contributions
str-0.062
field-0.060
 pine-0.056
 which-0.055
 wheat-0.055
Token a
Token ID257
Feature activation+3.12e-02

Pos logit contributions
 Give+0.065
 Go+0.065
 agrees+0.063
 You+0.056
 teaches+0.056
Neg logit contributions
field-0.078
str-0.077
 sudden-0.075
 raft-0.067
pron-0.067
Token diamond
Token ID15291
Feature activation-4.52e-03

Pos logit contributions
 Give+0.063
 Go+0.060
 agrees+0.059
 Share+0.054
 consequences+0.054
Neg logit contributions
field-0.050
str-0.049
 sudden-0.044
aster-0.042
ittens-0.042
Token!
Token ID0
Feature activation+9.14e-02

Pos logit contributions
str+0.009
field+0.009
 sudden+0.008
 wheat+0.008
 raft+0.008
Neg logit contributions
 agrees-0.008
 Give-0.008
 Go-0.007
ives-0.007
 Share-0.007
Token The
Token ID383
Feature activation+1.02e-02

Pos logit contributions
 Give+0.040
 agrees+0.038
 Go+0.037
 Come+0.035
 Share+0.035
Neg logit contributions
str-0.067
field-0.064
 sudden-0.061
 raft-0.058
 wheat-0.058
Token baby
Token ID5156
Feature activation-2.50e-03

Pos logit contributions
 Give+0.024
 agrees+0.024
 Go+0.023
 Share+0.022
 Come+0.022
Neg logit contributions
str-0.014
field-0.014
 sudden-0.011
 raft-0.011
aster-0.011
Token's
Token ID338
Feature activation+4.62e-02

Pos logit contributions
field+0.005
str+0.005
 sudden+0.004
 raft+0.004
aster+0.004
Neg logit contributions
 agrees-0.004
 Give-0.004
 Go-0.004
 Come-0.004
ives-0.004
Token name
Token ID1438
Feature activation+4.38e-02

Pos logit contributions
 Give+0.094
 agrees+0.094
 Go+0.090
ives+0.084
 Share+0.082
Neg logit contributions
str-0.077
field-0.077
 sudden-0.071
 wheat-0.067
 raft-0.067
Token was
Token ID373
Feature activation+3.06e-02

Pos logit contributions
 agrees+0.091
 Give+0.089
 Go+0.086
ives+0.082
 Come+0.078
Neg logit contributions
str-0.071
field-0.071
 sudden-0.065
 wheat-0.063
 raft-0.061
Token Ag
Token ID2449
Feature activation+9.90e-02

Pos logit contributions
 Give+0.057
 Go+0.056
 agrees+0.052
 You+0.051
 Come+0.050
Neg logit contributions
field-0.058
 sudden-0.055
str-0.053
 raft-0.052
aman-0.049
Tokennes
Token ID2516
Feature activation+5.20e-02

Pos logit contributions
 agrees+0.190
 Give+0.180
 Go+0.167
 consequences+0.167
 acts+0.165
Neg logit contributions
str-0.193
field-0.174
pron-0.158
aster-0.156
Wil-0.156
Token.
Token ID13
Feature activation+3.55e-02

Pos logit contributions
 Give+0.090
 agrees+0.089
 Go+0.086
ives+0.078
 Share+0.076
Neg logit contributions
field-0.101
str-0.100
 sudden-0.097
 wheat-0.091
 raft-0.091
Token Ag
Token ID2449
Feature activation+6.99e-02

Pos logit contributions
 agrees+0.073
 Give+0.070
 Go+0.068
ives+0.066
 disagrees+0.063
Neg logit contributions
str-0.059
field-0.058
 sudden-0.055
 raft-0.053
 which-0.052
Tokennes
Token ID2516
Feature activation+3.60e-02

Pos logit contributions
 agrees+0.132
 Give+0.128
 Go+0.119
 consequences+0.118
 Share+0.118
Neg logit contributions
str-0.130
field-0.118
 which-0.108
 sudden-0.107
Wil-0.106
Token loved
Token ID6151
Feature activation+3.00e-02

Pos logit contributions
 agrees+0.073
 Give+0.072
 Go+0.069
 Share+0.064
ives+0.064
Neg logit contributions
field-0.060
str-0.059
 sudden-0.057
 wheat-0.054
 raft-0.054
Token deep
Token ID2769
Feature activation+7.31e-03

Pos logit contributions
field+0.026
str+0.023
aman+0.021
 sudden+0.021
Wil+0.021
Neg logit contributions
 Give-0.034
 agrees-0.033
ives-0.032
 Go-0.030
 aback-0.030
Token breath
Token ID8033
Feature activation+3.58e-04

Pos logit contributions
 agrees+0.016
 Give+0.015
 aback+0.014
 Go+0.014
 acts+0.013
Neg logit contributions
str-0.012
field-0.012
 sudden-0.010
chard-0.010
aman-0.010
Token and
Token ID290
Feature activation-5.18e-02

Pos logit contributions
 agrees+0.001
 Give+0.001
 Go+0.001
ives+0.001
 aback+0.001
Neg logit contributions
str-0.001
field-0.001
 sudden-0.001
 wheat-0.001
 raft-0.001
Token shouted
Token ID17293
Feature activation+5.13e-02

Pos logit contributions
str+0.088
field+0.084
Wil+0.075
pron+0.074
aman+0.074
Neg logit contributions
 agrees-0.111
 Give-0.101
 Go-0.096
 Come-0.093
ives-0.093
Token "
Token ID366
Feature activation+5.32e-02

Pos logit contributions
 agrees+0.123
 Give+0.121
 Go+0.116
ives+0.113
 disagrees+0.110
Neg logit contributions
str-0.063
field-0.062
 sudden-0.057
 which-0.056
 wheat-0.056
TokenST
Token ID2257
Feature activation+9.86e-02

Pos logit contributions
 Give+0.102
 agrees+0.100
 Go+0.097
ives+0.093
 Come+0.089
Neg logit contributions
field-0.097
str-0.096
 sudden-0.088
 raft-0.087
 wheat-0.086
TokenOP
Token ID3185
Feature activation+5.74e-02

Pos logit contributions
 agrees+0.209
 Give+0.198
 acts+0.183
 Go+0.183
 disagrees+0.183
Neg logit contributions
str-0.148
aster-0.139
field-0.138
aman-0.130
Wil-0.124
Token!"
Token ID2474
Feature activation-1.02e-03

Pos logit contributions
 agrees+0.104
 Give+0.097
 Go+0.094
ives+0.088
 Share+0.087
Neg logit contributions
str-0.109
field-0.108
 sudden-0.104
 pine-0.098
 wheat-0.097
Token
Token ID220
Feature activation+4.18e-02

Pos logit contributions
str+0.002
field+0.002
 raft+0.002
 sudden+0.002
 wheat+0.002
Neg logit contributions
 Give-0.002
 Go-0.002
 agrees-0.002
ives-0.002
 Come-0.001
Token
Token ID198
Feature activation-3.96e-02

Pos logit contributions
 agrees+0.094
 Give+0.093
 Go+0.088
ives+0.084
 disagrees+0.084
Neg logit contributions
str-0.060
field-0.058
 sudden-0.053
 wheat-0.051
 which-0.050
Token
Token ID198
Feature activation-4.14e-02

Pos logit contributions
field+0.066
str+0.064
 sudden+0.058
 raft+0.058
 wheat+0.057
Neg logit contributions
 Give-0.082
 agrees-0.082
 Go-0.080
ives-0.075
 Come-0.072
Token man
Token ID582
Feature activation-1.59e-02

Pos logit contributions
str+0.018
field+0.018
 sudden+0.017
 wheat+0.016
 raft+0.016
Neg logit contributions
 agrees-0.027
 Give-0.027
 Go-0.026
ives-0.025
 Share-0.024
Token stepped
Token ID10764
Feature activation+4.71e-02

Pos logit contributions
field+0.029
str+0.028
 wheat+0.026
 sudden+0.026
 raft+0.026
Neg logit contributions
 agrees-0.031
 Give-0.030
 Go-0.028
ives-0.026
 teaches-0.026
Token closer
Token ID5699
Feature activation+1.37e-04

Pos logit contributions
 agrees+0.093
 Give+0.092
 Go+0.089
ives+0.084
 Share+0.081
Neg logit contributions
field-0.081
str-0.080
 sudden-0.077
 wheat-0.072
 raft-0.072
Token and
Token ID290
Feature activation-5.37e-02

Pos logit contributions
 agrees+0.000
 Give+0.000
 Go+0.000
ives+0.000
 aback+0.000
Neg logit contributions
str-0.000
field-0.000
 sudden-0.000
pron-0.000
 raft-0.000
Token said
Token ID531
Feature activation+2.70e-02

Pos logit contributions
field+0.088
str+0.082
ittens+0.082
 sudden+0.079
 wheat+0.077
Neg logit contributions
 agrees-0.111
 Give-0.106
 aback-0.102
 Go-0.098
 Come-0.097
Token gru
Token ID22848
Feature activation+9.80e-02

Pos logit contributions
 agrees+0.067
 Give+0.066
 Go+0.064
ives+0.062
 Share+0.060
Neg logit contributions
str-0.033
field-0.033
 sudden-0.030
 wheat-0.028
 raft-0.027
Tokenff
Token ID487
Feature activation+2.99e-02

Pos logit contributions
ives+0.189
 Go+0.186
 Give+0.186
 agrees+0.175
 teaches+0.175
Neg logit contributions
 pine-0.131
str-0.130
 wheat-0.124
cker-0.123
 mel-0.122
Tokenly
Token ID306
Feature activation+1.30e-02

Pos logit contributions
 agrees+0.056
 Give+0.055
 Go+0.053
ives+0.050
 Share+0.048
Neg logit contributions
str-0.055
field-0.054
 sudden-0.051
 wheat-0.049
 raft-0.048
Token,
Token ID11
Feature activation-1.95e-02

Pos logit contributions
 agrees+0.032
 Give+0.031
 Go+0.030
ives+0.029
 Share+0.028
Neg logit contributions
str-0.016
field-0.016
 sudden-0.015
 wheat-0.014
 raft-0.014
Token "
Token ID366
Feature activation+4.30e-02

Pos logit contributions
field+0.039
str+0.038
 sudden+0.036
 wheat+0.035
 raft+0.035
Neg logit contributions
 Give-0.034
 agrees-0.032
 Go-0.032
 Come-0.029
 You-0.029
TokenYou
Token ID1639
Feature activation+5.43e-02

Pos logit contributions
 Give+0.080
 agrees+0.077
 Go+0.076
ives+0.069
 Come+0.068
Neg logit contributions
field-0.082
str-0.079
 raft-0.074
 sudden-0.073
 wheat-0.072
Token so
Token ID523
Feature activation-1.59e-02

Pos logit contributions
 agrees+0.007
 Give+0.007
 Go+0.007
ives+0.006
 Share+0.006
Neg logit contributions
field-0.005
str-0.005
 sudden-0.005
 raft-0.005
aster-0.005
Token surprised
Token ID6655
Feature activation-3.63e-03

Pos logit contributions
str+0.024
field+0.024
 sudden+0.023
 wheat+0.021
 raft+0.021
Neg logit contributions
 agrees-0.035
 Give-0.034
 Go-0.033
ives-0.032
 Come-0.030
Token that
Token ID326
Feature activation-2.80e-02

Pos logit contributions
field+0.005
str+0.005
 sudden+0.005
pron+0.004
 raft+0.004
Neg logit contributions
 agrees-0.008
 Give-0.008
 Go-0.008
ives-0.008
 Come-0.007
Token they
Token ID484
Feature activation+1.22e-02

Pos logit contributions
str+0.036
field+0.035
 sudden+0.032
pron+0.030
 raft+0.029
Neg logit contributions
 Give-0.068
 agrees-0.067
 Go-0.067
 You-0.062
 Share-0.061
Token started
Token ID2067
Feature activation+8.14e-03

Pos logit contributions
 agrees+0.025
 Give+0.024
 Go+0.023
ives+0.022
 disagrees+0.021
Neg logit contributions
str-0.021
field-0.021
 sudden-0.019
 wheat-0.019
aster-0.019
Token tor
Token ID7332
Feature activation+9.80e-02

Pos logit contributions
 agrees+0.017
 Give+0.017
 Go+0.016
ives+0.015
 Share+0.015
Neg logit contributions
str-0.013
field-0.013
 sudden-0.012
 wheat-0.012
 raft-0.012
Tokenoll
Token ID692
Feature activation-2.07e-02

Pos logit contributions
 Give+0.145
 agrees+0.144
 wants+0.143
 disagrees+0.141
 demands+0.138
Neg logit contributions
field-0.165
 which-0.154
 raft-0.145
 reflecting-0.145
aman-0.145
Token away
Token ID1497
Feature activation-6.44e-03

Pos logit contributions
field+0.028
str+0.028
 sudden+0.026
 wheat+0.024
 raft+0.023
Neg logit contributions
 agrees-0.049
 Give-0.048
 Go-0.047
ives-0.045
 disagrees-0.042
Token.
Token ID13
Feature activation+2.29e-02

Pos logit contributions
str+0.011
field+0.011
 sudden+0.010
pron+0.010
 wheat+0.010
Neg logit contributions
 agrees-0.013
 Give-0.013
 Go-0.012
ives-0.011
 Share-0.011
Token
Token ID220
Feature activation+3.17e-02

Pos logit contributions
 agrees+0.046
 Give+0.045
 Go+0.043
ives+0.042
 Share+0.040
Neg logit contributions
field-0.039
str-0.039
 sudden-0.036
 wheat-0.034
 raft-0.033
Token
Token ID198
Feature activation-3.97e-02

Pos logit contributions
 agrees+0.074
 Give+0.073
 Go+0.068
 acts+0.067
 disagrees+0.066
Neg logit contributions
str-0.044
field-0.042
 sudden-0.038
Wil-0.035
 which-0.035
Token price
Token ID2756
Feature activation+9.85e-03

Pos logit contributions
 agrees+0.027
 Give+0.027
 Go+0.026
ives+0.025
 Share+0.025
Neg logit contributions
field-0.012
str-0.012
 sudden-0.011
 wheat-0.010
 raft-0.010
Token tag
Token ID7621
Feature activation+1.93e-03

Pos logit contributions
 Give+0.022
 agrees+0.022
 Go+0.021
ives+0.020
 Share+0.020
Neg logit contributions
field-0.014
str-0.013
 which-0.013
 wheat-0.013
 sudden-0.012
Token.
Token ID13
Feature activation+6.60e-02

Pos logit contributions
 agrees+0.004
 Give+0.004
 Go+0.004
ives+0.004
 Share+0.004
Neg logit contributions
str-0.003
field-0.003
 sudden-0.003
 wheat-0.003
 raft-0.003
Token It
Token ID632
Feature activation+1.39e-02

Pos logit contributions
 agrees+0.146
 Give+0.140
 Go+0.134
ives+0.128
 disagrees+0.128
Neg logit contributions
str-0.093
field-0.092
 sudden-0.089
 wheat-0.086
 which-0.084
Token said
Token ID531
Feature activation+5.69e-02

Pos logit contributions
 agrees+0.027
 Give+0.027
 Go+0.025
ives+0.023
 teaches+0.023
Neg logit contributions
str-0.025
field-0.025
 sudden-0.023
 wheat-0.022
 raft-0.022
Token $
Token ID720
Feature activation+9.77e-02

Pos logit contributions
 agrees+0.124
 Give+0.124
 Go+0.120
ives+0.111
 Share+0.109
Neg logit contributions
field-0.092
str-0.089
 sudden-0.084
 raft-0.077
 wheat-0.076
Token1
Token ID16
Feature activation+3.29e-02

Pos logit contributions
 agrees+0.184
 Give+0.176
 teaches+0.175
 Go+0.168
 acts+0.164
Neg logit contributions
str-0.158
 sudden-0.155
 wheat-0.155
field-0.150
 pine-0.147
Token.
Token ID13
Feature activation+6.00e-02

Pos logit contributions
 agrees+0.076
 Give+0.076
 Go+0.073
ives+0.069
 Share+0.067
Neg logit contributions
str-0.045
field-0.045
 sudden-0.041
 wheat-0.039
 raft-0.038
Token99
Token ID2079
Feature activation+5.45e-02

Pos logit contributions
 agrees+0.121
 Give+0.120
 Go+0.115
ives+0.109
 Share+0.105
Neg logit contributions
field-0.100
str-0.100
 sudden-0.093
 wheat-0.089
 raft-0.087
Token per
Token ID583
Feature activation+5.11e-02

Pos logit contributions
 Give+0.111
 agrees+0.107
 Go+0.105
 Come+0.099
 Share+0.098
Neg logit contributions
str-0.092
field-0.089
 which-0.079
 wheat-0.078
pron-0.078
Token pound
Token ID14896
Feature activation+6.44e-04

Pos logit contributions
 agrees+0.090
 Give+0.088
 Go+0.083
 disagrees+0.080
 Share+0.077
Neg logit contributions
str-0.095
field-0.090
 pine-0.085
 wheat-0.085
 branch-0.083
Token her
Token ID607
Feature activation+2.27e-02

Pos logit contributions
str+0.047
field+0.046
 raft+0.042
 sudden+0.041
 wheat+0.039
Neg logit contributions
 Give-0.063
 Go-0.062
 agrees-0.062
 You-0.057
 teaches-0.056
Token thoughtful
Token ID22677
Feature activation+2.17e-02

Pos logit contributions
 agrees+0.046
 Give+0.045
 Go+0.043
ives+0.041
 Share+0.039
Neg logit contributions
str-0.039
field-0.039
 sudden-0.036
 wheat-0.034
 raft-0.034
Token idea
Token ID2126
Feature activation+6.37e-03

Pos logit contributions
 Give+0.038
 agrees+0.037
 Go+0.037
ives+0.035
 Share+0.033
Neg logit contributions
field-0.041
str-0.040
 sudden-0.039
 wheat-0.038
 raft-0.037
Token.
Token ID13
Feature activation+3.78e-02

Pos logit contributions
 agrees+0.012
 Give+0.012
 Go+0.011
ives+0.011
 Share+0.010
Neg logit contributions
field-0.011
str-0.011
 sudden-0.011
 wheat-0.010
 raft-0.010
Token<|endoftext|>
Token ID50256
Feature activation+5.20e-02

Pos logit contributions
 agrees+0.073
 Give+0.072
 Go+0.069
ives+0.065
 Share+0.063
Neg logit contributions
field-0.067
str-0.067
 sudden-0.063
 wheat-0.060
 raft-0.059
TokenJes
Token ID22290
Feature activation+9.76e-02

Pos logit contributions
 agrees+0.128
 Give+0.121
 Go+0.117
ives+0.113
 demands+0.112
Neg logit contributions
field-0.069
str-0.063
 sudden-0.057
 pine-0.054
 wheat-0.054
Tokense
Token ID325
Feature activation-2.67e-02

Pos logit contributions
 agrees+0.147
 Give+0.144
 Go+0.136
ives+0.126
 Share+0.125
Neg logit contributions
field-0.207
str-0.205
 sudden-0.187
aman-0.184
 pine-0.183
Token was
Token ID373
Feature activation+4.11e-03

Pos logit contributions
field+0.049
str+0.049
 raft+0.047
 wheat+0.044
pron+0.044
Neg logit contributions
 agrees-0.048
 Give-0.048
 Go-0.046
ives-0.042
 acts-0.042
Token very
Token ID845
Feature activation-3.21e-03

Pos logit contributions
 Give+0.008
 Go+0.008
 agrees+0.008
 Come+0.007
 You+0.007
Neg logit contributions
field-0.007
str-0.007
 raft-0.006
 sudden-0.006
 pine-0.006
Token excited
Token ID6568
Feature activation+1.37e-02

Pos logit contributions
field+0.006
str+0.006
 raft+0.005
aman+0.005
 sudden+0.005
Neg logit contributions
 agrees-0.006
 Give-0.006
 Go-0.006
 Share-0.005
 Come-0.005
Token when
Token ID618
Feature activation-4.95e-02

Pos logit contributions
 agrees+0.026
 Give+0.026
 Go+0.025
ives+0.023
 Come+0.022
Neg logit contributions
str-0.025
field-0.025
 sudden-0.023
 wheat-0.022
 raft-0.022
Token It
Token ID632
Feature activation+1.33e-02

Pos logit contributions
 agrees+0.122
 Give+0.119
 Go+0.113
ives+0.108
 disagrees+0.102
Neg logit contributions
str-0.141
field-0.141
 sudden-0.132
 wheat-0.129
 raft-0.127
Token says
Token ID1139
Feature activation+6.57e-02

Pos logit contributions
 Give+0.018
 agrees+0.017
 Go+0.017
ives+0.016
 Come+0.015
Neg logit contributions
field-0.030
str-0.029
 sudden-0.028
 wheat-0.028
 raft-0.028
Token:
Token ID25
Feature activation+2.99e-02

Pos logit contributions
 Give+0.121
 agrees+0.120
 Go+0.116
ives+0.109
 Share+0.105
Neg logit contributions
field-0.127
str-0.121
 sudden-0.118
 pine-0.110
 raft-0.109
Token "
Token ID366
Feature activation+7.57e-02

Pos logit contributions
 agrees+0.059
 Give+0.059
 Go+0.057
ives+0.053
 Share+0.052
Neg logit contributions
field-0.052
str-0.051
 sudden-0.048
 wheat-0.045
 raft-0.045
TokenNo
Token ID2949
Feature activation+1.07e-02

Pos logit contributions
 agrees+0.158
 Give+0.156
 Go+0.150
ives+0.142
 Share+0.137
Neg logit contributions
str-0.124
field-0.124
 sudden-0.115
 wheat-0.109
 raft-0.107
Token!
Token ID0
Feature activation+9.73e-02

Pos logit contributions
 agrees+0.028
 Give+0.027
 Go+0.027
ives+0.026
 Share+0.025
Neg logit contributions
str-0.012
field-0.011
 sudden-0.011
 wheat-0.010
 raft-0.010
Token You
Token ID921
Feature activation+5.09e-02

Pos logit contributions
 agrees+0.146
 aback+0.121
ives+0.120
 disagrees+0.117
 Give+0.110
Neg logit contributions
aster-0.199
str-0.198
field-0.187
 wheat-0.186
 Wil-0.186
Token are
Token ID389
Feature activation+2.52e-02

Pos logit contributions
 agrees+0.073
 Give+0.072
 Go+0.068
ives+0.063
 Share+0.060
Neg logit contributions
field-0.116
str-0.115
 sudden-0.109
 wheat-0.105
 raft-0.105
Token bad
Token ID2089
Feature activation+3.51e-02

Pos logit contributions
 agrees+0.060
 Give+0.058
 Go+0.055
ives+0.054
 Come+0.052
Neg logit contributions
str-0.034
field-0.033
 sudden-0.031
 wheat-0.030
 which-0.029
Token!
Token ID0
Feature activation+9.30e-02

Pos logit contributions
 agrees+0.077
 Give+0.073
ives+0.070
 aback+0.069
 Go+0.069
Neg logit contributions
 wheat-0.046
 sudden-0.045
 which-0.044
field-0.043
str-0.043
Token You
Token ID921
Feature activation+5.05e-02

Pos logit contributions
 agrees+0.108
ives+0.083
 aback+0.082
 disagrees+0.074
viously+0.066
Neg logit contributions
aster-0.226
str-0.221
 Wil-0.211
 wheat-0.208
 pine-0.207
Token that
Token ID326
Feature activation-1.38e-02

Pos logit contributions
field+0.030
str+0.028
pron+0.027
 sudden+0.026
cle+0.025
Neg logit contributions
 Give-0.043
 Go-0.042
 agrees-0.042
 You-0.039
 Believe-0.038
Token the
Token ID262
Feature activation+8.40e-03

Pos logit contributions
field+0.020
str+0.020
 sudden+0.018
pron+0.017
 raft+0.017
Neg logit contributions
 Give-0.031
 agrees-0.031
 Go-0.031
 Come-0.028
 acts-0.028
Token zipper
Token ID48992
Feature activation-4.20e-03

Pos logit contributions
 agrees+0.020
 Give+0.020
 Go+0.020
ives+0.019
 Share+0.018
Neg logit contributions
str-0.011
field-0.011
 sudden-0.010
 wheat-0.009
 raft-0.009
Token had
Token ID550
Feature activation+2.16e-02

Pos logit contributions
str+0.007
field+0.006
 sudden+0.006
 raft+0.006
 which+0.006
Neg logit contributions
 agrees-0.009
 Give-0.009
 Go-0.009
ives-0.008
 Share-0.008
Token become
Token ID1716
Feature activation+2.06e-02

Pos logit contributions
 agrees+0.043
 Give+0.043
 Go+0.041
ives+0.039
 Share+0.037
Neg logit contributions
field-0.037
str-0.037
 sudden-0.034
 wheat-0.033
 raft-0.032
Token unst
Token ID15014
Feature activation+9.71e-02

Pos logit contributions
 agrees+0.039
 Give+0.038
 Go+0.037
 aback+0.035
ives+0.034
Neg logit contributions
field-0.037
str-0.036
 sudden-0.032
aster-0.032
pron-0.031
Tokenuck
Token ID1347
Feature activation+5.74e-03

Pos logit contributions
 agrees+0.192
 teaches+0.177
 Give+0.174
 Go+0.164
 disagrees+0.163
Neg logit contributions
str-0.168
 sudden-0.160
field-0.158
 raft-0.156
Wil-0.154
Token.
Token ID13
Feature activation+3.03e-02

Pos logit contributions
 agrees+0.013
 Give+0.013
 Go+0.012
ives+0.012
 Share+0.011
Neg logit contributions
field-0.009
str-0.009
 sudden-0.008
 wheat-0.007
 raft-0.007
Token She
Token ID1375
Feature activation-1.10e-02

Pos logit contributions
 agrees+0.068
 Give+0.063
 Go+0.061
 acts+0.061
 disagrees+0.060
Neg logit contributions
str-0.044
field-0.041
 which-0.038
 sudden-0.038
 Wil-0.037
Token was
Token ID373
Feature activation+1.37e-03

Pos logit contributions
field+0.019
str+0.019
 sudden+0.017
 wheat+0.017
aster+0.017
Neg logit contributions
 agrees-0.022
 Give-0.021
 Go-0.020
 teaches-0.019
ives-0.019
Token relieved
Token ID22598
Feature activation+9.86e-03

Pos logit contributions
 agrees+0.003
 Give+0.002
 Go+0.002
 aback+0.002
 Believe+0.002
Neg logit contributions
field-0.003
str-0.002
 sudden-0.002
 raft-0.002
aman-0.002
Token for
Token ID329
Feature activation-2.17e-02

Pos logit contributions
str+0.036
 sudden+0.035
field+0.034
 wheat+0.030
Wil+0.030
Neg logit contributions
 agrees-0.038
 Give-0.038
 Go-0.037
ives-0.035
 Come-0.033
Token being
Token ID852
Feature activation+2.69e-02

Pos logit contributions
str+0.036
field+0.035
 sudden+0.031
 raft+0.030
 wheat+0.029
Neg logit contributions
 Give-0.045
 agrees-0.045
 Go-0.044
ives-0.040
 Share-0.040
Token so
Token ID523
Feature activation-8.14e-03

Pos logit contributions
 agrees+0.058
 Give+0.058
 Go+0.056
ives+0.053
 Share+0.051
Neg logit contributions
field-0.042
str-0.041
 sudden-0.038
 raft-0.036
 wheat-0.035
Token well
Token ID880
Feature activation-1.49e-03

Pos logit contributions
field+0.014
str+0.014
 sudden+0.013
 wheat+0.012
 raft+0.012
Neg logit contributions
 agrees-0.016
 Give-0.016
 Go-0.016
ives-0.015
 Share-0.014
Token-
Token ID12
Feature activation+2.00e-03

Pos logit contributions
str+0.003
field+0.003
 sudden+0.002
 wheat+0.002
 raft+0.002
Neg logit contributions
 agrees-0.003
 Give-0.003
 Go-0.003
ives-0.003
 Share-0.002
Tokenbeh
Token ID20709
Feature activation+9.69e-02

Pos logit contributions
 Give+0.003
 agrees+0.003
 Go+0.003
ives+0.003
 Come+0.002
Neg logit contributions
field-0.005
str-0.004
 sudden-0.004
 raft-0.004
 wheat-0.004
Tokenaved
Token ID9586
Feature activation+1.65e-02

Pos logit contributions
 agrees+0.195
 consequences+0.191
 begs+0.189
 demands+0.188
 Give+0.187
Neg logit contributions
field-0.139
str-0.133
 boarded-0.126
ding-0.120
 roasted-0.119
Token and
Token ID290
Feature activation-5.81e-02

Pos logit contributions
 agrees+0.036
 Give+0.035
 Go+0.034
ives+0.033
 Share+0.031
Neg logit contributions
field-0.025
str-0.025
 sudden-0.023
 wheat-0.022
 raft-0.022
Token Jack
Token ID3619
Feature activation+1.99e-02

Pos logit contributions
str+0.089
field+0.087
 sudden+0.078
 wheat+0.077
 raft+0.075
Neg logit contributions
 Give-0.127
 agrees-0.126
 Go-0.122
ives-0.113
 Share-0.112
Token was
Token ID373
Feature activation+5.38e-03

Pos logit contributions
 agrees+0.041
 Give+0.041
 Go+0.039
ives+0.037
 Share+0.036
Neg logit contributions
str-0.032
field-0.032
 sudden-0.030
 wheat-0.028
 raft-0.028
Token delighted
Token ID20707
Feature activation+1.82e-02

Pos logit contributions
 agrees+0.011
 Give+0.011
 Go+0.011
 teaches+0.010
 Share+0.010
Neg logit contributions
str-0.009
field-0.009
 sudden-0.008
 raft-0.008
 wheat-0.008
Token a
Token ID257
Feature activation+1.31e-02

Pos logit contributions
 Give+0.044
 agrees+0.044
 Go+0.042
ives+0.039
 Share+0.039
Neg logit contributions
field-0.031
str-0.030
 sudden-0.027
pron-0.026
 raft-0.025
Token sign
Token ID1051
Feature activation+5.80e-03

Pos logit contributions
 Give+0.029
 agrees+0.027
 Go+0.026
 demands+0.024
 Share+0.024
Neg logit contributions
field-0.020
str-0.019
aster-0.018
aman-0.017
Wil-0.017
Token that
Token ID326
Feature activation-1.32e-02

Pos logit contributions
 Give+0.016
 agrees+0.015
 Go+0.015
ives+0.014
 Share+0.014
Neg logit contributions
field-0.006
str-0.006
 sudden-0.006
pron-0.005
 raft-0.005
Token said
Token ID531
Feature activation+5.21e-02

Pos logit contributions
str+0.021
field+0.021
 sudden+0.020
 wheat+0.018
pron+0.018
Neg logit contributions
 Give-0.028
 agrees-0.028
 Go-0.027
ives-0.025
 teaches-0.024
Token "
Token ID366
Feature activation+8.52e-02

Pos logit contributions
 Give+0.097
 Go+0.093
 agrees+0.093
ives+0.084
 Share+0.083
Neg logit contributions
field-0.101
str-0.097
 sudden-0.096
aman-0.087
 raft-0.086
TokenRE
Token ID2200
Feature activation+9.68e-02

Pos logit contributions
 Give+0.166
 agrees+0.162
 Go+0.160
ives+0.151
 Come+0.144
Neg logit contributions
field-0.151
str-0.149
 sudden-0.139
 raft-0.134
 wheat-0.133
TokenVERS
Token ID28884
Feature activation+5.44e-02

Pos logit contributions
 agrees+0.227
 Give+0.213
 Go+0.203
ives+0.198
 disagrees+0.196
Neg logit contributions
field-0.134
str-0.134
 wheat-0.122
 raft-0.118
 pine-0.116
TokenE
Token ID36
Feature activation+7.01e-02

Pos logit contributions
 agrees+0.116
 Give+0.108
 Go+0.103
ives+0.102
 acts+0.100
Neg logit contributions
str-0.087
field-0.083
 sudden-0.082
 wheat-0.077
Wil-0.075
Token".
Token ID1911
Feature activation+4.39e-02

Pos logit contributions
 agrees+0.118
 Give+0.117
 Go+0.111
ives+0.104
 Share+0.100
Neg logit contributions
field-0.141
str-0.141
 sudden-0.132
 wheat-0.128
 raft-0.126
Token L
Token ID406
Feature activation+3.00e-02

Pos logit contributions
 agrees+0.089
 Give+0.083
ives+0.080
 Go+0.079
 disagrees+0.078
Neg logit contributions
field-0.071
str-0.069
 sudden-0.068
 which-0.065
 wheat-0.064
Tokenila
Token ID10102
Feature activation+2.42e-02

Pos logit contributions
 agrees+0.068
 Give+0.066
 Go+0.063
ives+0.061
 aback+0.061
Neg logit contributions
str-0.045
field-0.041
 sudden-0.036
 peach-0.035
 wheat-0.034
TokenNo
Token ID2949
Feature activation+6.91e-03

Pos logit contributions
 Give+0.130
 agrees+0.127
 Go+0.123
ives+0.115
 Come+0.112
Neg logit contributions
field-0.125
str-0.122
 sudden-0.111
 raft-0.110
 wheat-0.108
Token,
Token ID11
Feature activation+9.43e-04

Pos logit contributions
 agrees+0.018
 Give+0.017
 Go+0.017
ives+0.017
 Share+0.016
Neg logit contributions
str-0.008
field-0.008
 sudden-0.007
 wheat-0.007
 raft-0.007
Token no
Token ID645
Feature activation-1.88e-02

Pos logit contributions
 Give+0.002
 agrees+0.002
 Go+0.002
ives+0.002
 Come+0.002
Neg logit contributions
field-0.002
str-0.002
 sudden-0.002
 wheat-0.002
 raft-0.002
Token,
Token ID11
Feature activation-6.66e-03

Pos logit contributions
field+0.030
str+0.030
 sudden+0.027
 wheat+0.026
 raft+0.026
Neg logit contributions
 agrees-0.040
 Give-0.040
 Go-0.038
ives-0.036
 Share-0.035
Token no
Token ID645
Feature activation-1.73e-02

Pos logit contributions
field+0.013
str+0.012
 sudden+0.012
 wheat+0.011
 raft+0.011
Neg logit contributions
 Give-0.013
 Go-0.013
 agrees-0.012
 You-0.011
ives-0.011
Token!
Token ID0
Feature activation+9.67e-02

Pos logit contributions
field+0.031
str+0.031
 sudden+0.029
 wheat+0.027
 raft+0.027
Neg logit contributions
 agrees-0.034
 Give-0.033
 Go-0.032
ives-0.030
 Share-0.029
Token You
Token ID921
Feature activation+5.80e-02

Pos logit contributions
 agrees+0.125
 aback+0.094
 disagrees+0.091
ives+0.090
 teaches+0.089
Neg logit contributions
str-0.225
aster-0.220
 sudden-0.211
 cherry-0.210
field-0.210
Token are
Token ID389
Feature activation+2.60e-02

Pos logit contributions
 Give+0.109
 agrees+0.108
 Go+0.104
ives+0.099
 Share+0.094
Neg logit contributions
field-0.107
str-0.106
 sudden-0.100
 wheat-0.095
 raft-0.094
Token a
Token ID257
Feature activation+3.05e-02

Pos logit contributions
 agrees+0.057
 Give+0.057
 Go+0.055
ives+0.052
 Share+0.050
Neg logit contributions
field-0.040
str-0.040
 sudden-0.036
 wheat-0.034
 raft-0.034
Token bad
Token ID2089
Feature activation+2.62e-02

Pos logit contributions
 agrees+0.062
 Give+0.061
 Go+0.059
ives+0.056
 Share+0.053
Neg logit contributions
field-0.052
str-0.051
 sudden-0.048
 wheat-0.045
 raft-0.045
Token girl
Token ID2576
Feature activation+2.93e-02

Pos logit contributions
 agrees+0.058
 Give+0.055
 Go+0.054
ives+0.053
 Share+0.050
Neg logit contributions
str-0.040
field-0.039
 sudden-0.036
 wheat-0.036
 pine-0.036
Token Mia
Token ID32189
Feature activation+9.69e-03

Pos logit contributions
str+0.105
field+0.100
pron+0.088
Wil+0.087
 sudden+0.086
Neg logit contributions
 agrees-0.145
 Give-0.142
 Go-0.137
 You-0.128
 Come-0.128
Token realised
Token ID19169
Feature activation-1.66e-02

Pos logit contributions
 agrees+0.020
 Give+0.020
 Go+0.019
ives+0.018
 disagrees+0.017
Neg logit contributions
str-0.016
field-0.016
 wheat-0.014
 sudden-0.014
 raft-0.014
Token that
Token ID326
Feature activation-1.13e-02

Pos logit contributions
field+0.025
str+0.024
 sudden+0.023
pron+0.023
cle+0.021
Neg logit contributions
 Give-0.036
 agrees-0.035
 Go-0.035
 acts-0.032
 You-0.032
Token it
Token ID340
Feature activation+3.51e-02

Pos logit contributions
str+0.015
field+0.015
 sudden+0.014
pron+0.013
 raft+0.012
Neg logit contributions
 Give-0.027
 agrees-0.027
 Go-0.026
 acts-0.025
 Believe-0.024
Token wasn
Token ID2492
Feature activation+5.77e-02

Pos logit contributions
 Give+0.080
 agrees+0.080
 Go+0.077
ives+0.075
 Share+0.074
Neg logit contributions
str-0.051
field-0.050
 sudden-0.044
 blowing-0.041
 which-0.040
Tokenâ
Token ID22940
Feature activation+9.65e-02

Pos logit contributions
 Give+0.091
 Go+0.089
 agrees+0.086
ives+0.077
 Share+0.076
Neg logit contributions
field-0.125
str-0.123
 sudden-0.118
 wheat-0.114
 raft-0.111
Token
Token ID26391
Feature activation+6.76e-02

Pos logit contributions
 agrees+0.179
 Give+0.168
 Go+0.155
ives+0.147
 demands+0.145
Neg logit contributions
str-0.184
field-0.177
aster-0.170
pron-0.154
aman-0.152
Token
Token ID8151
Feature activation+2.69e-02

Pos logit contributions
 agrees+0.132
 Give+0.131
 Go+0.124
ives+0.117
 Share+0.114
Neg logit contributions
str-0.119
field-0.118
 sudden-0.110
 wheat-0.103
 raft-0.102
Tokent
Token ID83
Feature activation+2.18e-02

Pos logit contributions
 Give+0.044
 agrees+0.043
 Go+0.042
ives+0.040
 Share+0.036
Neg logit contributions
field-0.056
str-0.054
 sudden-0.053
 wheat-0.051
 raft-0.050
Token a
Token ID257
Feature activation+3.19e-02

Pos logit contributions
 agrees+0.040
 Give+0.040
 Go+0.039
 aback+0.037
 You+0.035
Neg logit contributions
field-0.041
str-0.037
 sudden-0.037
 raft-0.034
aman-0.034
Token star
Token ID3491
Feature activation+4.43e-03

Pos logit contributions
 Give+0.070
 agrees+0.070
 Go+0.067
ives+0.063
 Share+0.062
Neg logit contributions
field-0.047
str-0.047
 sudden-0.042
 wheat-0.040
 raft-0.040
Token the
Token ID262
Feature activation+6.14e-03

Pos logit contributions
str+0.103
field+0.096
cker+0.085
cle+0.085
pron+0.082
Neg logit contributions
 Give-0.094
 Go-0.090
 agrees-0.087
 acts-0.084
 You-0.083
Token fridge
Token ID25772
Feature activation-2.53e-02

Pos logit contributions
 agrees+0.015
 Give+0.015
 Go+0.014
ives+0.014
 Come+0.013
Neg logit contributions
str-0.008
field-0.008
 sudden-0.007
 wheat-0.007
 raft-0.007
Token.
Token ID13
Feature activation+2.63e-02

Pos logit contributions
field+0.049
str+0.048
 sudden+0.045
 wheat+0.043
 raft+0.043
Neg logit contributions
 agrees-0.045
 Give-0.045
 Go-0.043
ives-0.040
 Share-0.039
Token She
Token ID1375
Feature activation-1.18e-02

Pos logit contributions
 agrees+0.049
 Give+0.046
 Go+0.045
ives+0.044
 acts+0.042
Neg logit contributions
field-0.046
str-0.045
 sudden-0.043
 pine-0.042
 wheat-0.041
Token was
Token ID373
Feature activation+1.01e-02

Pos logit contributions
str+0.021
field+0.021
 raft+0.020
 wheat+0.019
Wil+0.019
Neg logit contributions
 agrees-0.022
 Give-0.022
 Go-0.020
 disagrees-0.019
 wants-0.019
Token perplex
Token ID35682
Feature activation+9.62e-02

Pos logit contributions
 agrees+0.018
 Give+0.018
 Go+0.017
 aback+0.016
 Believe+0.016
Neg logit contributions
field-0.019
str-0.019
 raft-0.018
 wheat-0.017
 sudden-0.017
Tokened
Token ID276
Feature activation+3.12e-02

Pos logit contributions
 agrees+0.146
 Give+0.140
 Go+0.138
 teaches+0.130
 Share+0.128
Neg logit contributions
str-0.188
field-0.179
 which-0.177
 sudden-0.177
 while-0.171
Token by
Token ID416
Feature activation-7.93e-03

Pos logit contributions
 agrees+0.071
 Give+0.070
 Go+0.068
ives+0.066
 Share+0.064
Neg logit contributions
field-0.043
str-0.043
 sudden-0.041
 wheat-0.038
 which-0.038
Token what
Token ID644
Feature activation+1.41e-02

Pos logit contributions
str+0.015
field+0.015
pron+0.013
 raft+0.013
 sudden+0.013
Neg logit contributions
 Give-0.015
 Go-0.014
 agrees-0.014
 consequences-0.013
 acts-0.013
Token she
Token ID673
Feature activation+3.66e-02

Pos logit contributions
 agrees+0.035
 Give+0.035
 Go+0.034
ives+0.033
 Share+0.032
Neg logit contributions
field-0.017
str-0.017
 sudden-0.015
 raft-0.014
 wheat-0.014
Token saw
Token ID2497
Feature activation+2.25e-02

Pos logit contributions
 agrees+0.085
 Give+0.083
 Go+0.081
ives+0.077
 Share+0.076
Neg logit contributions
str-0.052
field-0.052
 sudden-0.047
 pine-0.044
 wheat-0.042
Token
Token ID198
Feature activation-5.02e-02

Pos logit contributions
field+0.076
str+0.075
 sudden+0.069
 wheat+0.066
 raft+0.066
Neg logit contributions
 Give-0.108
 agrees-0.108
 Go-0.104
ives-0.099
 Share-0.096
TokenAs
Token ID1722
Feature activation-9.37e-02

Pos logit contributions
field+0.077
str+0.066
 pumpkin+0.059
chard+0.059
 wheat+0.059
Neg logit contributions
 Give-0.121
 Go-0.115
 agrees-0.108
 Share-0.104
ives-0.103
Token he
Token ID339
Feature activation-7.82e-03

Pos logit contributions
pron+0.234
field+0.230
iri+0.218
str+0.212
aman+0.212
Neg logit contributions
 Give-0.079
 Go-0.078
 You-0.075
 Hide-0.068
 Come-0.063
Token cleaned
Token ID20750
Feature activation-1.40e-02

Pos logit contributions
str+0.016
field+0.015
 wheat+0.014
chard+0.014
 hurricane+0.014
Neg logit contributions
 agrees-0.013
 Give-0.013
 Go-0.012
 acts-0.011
 disagrees-0.011
Token more
Token ID517
Feature activation-6.28e-02

Pos logit contributions
str+0.026
field+0.025
 sudden+0.022
pron+0.022
iri+0.021
Neg logit contributions
 Give-0.026
 agrees-0.026
 Go-0.025
 aback-0.024
 You-0.023
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
str+0.112
field+0.112
 sudden+0.098
 raft+0.096
 wheat+0.093
Neg logit contributions
 agrees-0.125
 Give-0.122
 Go-0.117
ives-0.111
 acts-0.106
Token Tim
Token ID5045
Feature activation-3.27e-03

Pos logit contributions
field+0.222
pron+0.213
str+0.209
iri+0.204
aman+0.202
Neg logit contributions
 Give-0.139
 You-0.136
 Go-0.134
 consequences-0.119
 Come-0.118
Token found
Token ID1043
Feature activation-7.40e-03

Pos logit contributions
str+0.006
field+0.006
 sudden+0.005
 wheat+0.005
 raft+0.005
Neg logit contributions
 agrees-0.006
 Give-0.006
 Go-0.006
ives-0.005
 Come-0.005
Token more
Token ID517
Feature activation-4.32e-02

Pos logit contributions
field+0.010
str+0.010
 sudden+0.009
 wheat+0.009
 raft+0.008
Neg logit contributions
 agrees-0.017
 Give-0.017
 Go-0.016
ives-0.016
 Share-0.015
Token things
Token ID1243
Feature activation-4.00e-02

Pos logit contributions
field+0.078
str+0.077
 sudden+0.070
 raft+0.067
 wheat+0.066
Neg logit contributions
 agrees-0.084
 Give-0.083
 Go-0.079
ives-0.075
 Share-0.071
Token he
Token ID339
Feature activation+1.51e-02

Pos logit contributions
field+0.059
str+0.059
 sudden+0.055
 wheat+0.051
 raft+0.050
Neg logit contributions
 agrees-0.089
 Give-0.088
 Go-0.085
ives-0.082
 Share-0.078
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
str+0.047
field+0.047
 sudden+0.044
 wheat+0.042
 raft+0.042
Neg logit contributions
 agrees-0.043
 Give-0.042
 Go-0.041
ives-0.038
 Share-0.036
Token While
Token ID2893
Feature activation-8.66e-02

Pos logit contributions
 Give+0.023
 agrees+0.022
 Go+0.021
ives+0.020
 Come+0.019
Neg logit contributions
field-0.028
str-0.027
 sudden-0.026
 wheat-0.025
 raft-0.024
Token he
Token ID339
Feature activation-6.84e-03

Pos logit contributions
field+0.165
iri+0.164
str+0.163
aman+0.157
pron+0.147
Neg logit contributions
 Give-0.142
 Go-0.141
 You-0.132
 Come-0.125
 Hide-0.118
Token was
Token ID373
Feature activation-1.24e-02

Pos logit contributions
field+0.014
str+0.013
 wheat+0.013
 raft+0.012
chard+0.012
Neg logit contributions
 agrees-0.012
 Give-0.011
 Go-0.011
ives-0.010
 disagrees-0.010
Token swinging
Token ID25635
Feature activation-1.93e-02

Pos logit contributions
field+0.023
str+0.022
cle+0.020
Wil+0.019
aman+0.019
Neg logit contributions
 Give-0.023
 agrees-0.023
 Go-0.023
ives-0.021
 Come-0.021
Token,
Token ID11
Feature activation-9.89e-02

Pos logit contributions
field+0.037
str+0.034
 sudden+0.032
pron+0.031
aman+0.031
Neg logit contributions
 Give-0.037
 agrees-0.037
 Go-0.036
 aback-0.033
 Come-0.032
Token he
Token ID339
Feature activation-3.82e-03

Pos logit contributions
field+0.236
str+0.223
iri+0.217
aman+0.203
cker+0.200
Neg logit contributions
 You-0.117
 Give-0.115
 Go-0.110
 Hide-0.100
 Come-0.100
Token saw
Token ID2497
Feature activation-7.60e-03

Pos logit contributions
field+0.006
str+0.006
aman+0.006
Wil+0.006
 wheat+0.006
Neg logit contributions
 agrees-0.008
 Give-0.007
 Go-0.007
 disagrees-0.007
ives-0.007
Token a
Token ID257
Feature activation-8.24e-03

Pos logit contributions
field+0.015
str+0.014
pron+0.013
aman+0.013
cle+0.013
Neg logit contributions
 Give-0.014
 agrees-0.014
 Go-0.013
 aback-0.012
 consequences-0.012
Token group
Token ID1448
Feature activation+9.45e-05

Pos logit contributions
field+0.014
aster+0.013
aman+0.012
 Wil+0.012
ittens+0.012
Neg logit contributions
 Give-0.016
 agrees-0.014
 Go-0.014
 Hide-0.013
 disagrees-0.013
Token of
Token ID286
Feature activation-2.04e-02

Pos logit contributions
 agrees+0.000
 Give+0.000
 Go+0.000
ives+0.000
 Share+0.000
Neg logit contributions
str-0.000
field-0.000
 sudden-0.000
 wheat-0.000
 raft-0.000
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
str+0.041
field+0.037
 sudden+0.036
pron+0.033
 raft+0.032
Neg logit contributions
 Give-0.064
 agrees-0.064
 Go-0.063
 Share-0.059
 You-0.057
Token
Token ID198
Feature activation-4.17e-02

Pos logit contributions
 agrees+0.031
 Give+0.031
 Go+0.030
ives+0.028
 Come+0.027
Neg logit contributions
str-0.034
field-0.034
 sudden-0.031
pron-0.030
 wheat-0.030
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
field+0.074
str+0.073
 sudden+0.067
 wheat+0.066
 raft+0.065
Neg logit contributions
 agrees-0.080
 Give-0.080
 Go-0.077
ives-0.072
 Share-0.070
TokenWhile
Token ID3633
Feature activation-8.75e-02

Pos logit contributions
field+0.049
str+0.046
 pumpkin+0.039
 raft+0.039
aman+0.038
Neg logit contributions
 Give-0.104
 agrees-0.101
 Go-0.100
ives-0.095
 Believe-0.093
Token resting
Token ID19186
Feature activation-3.88e-02

Pos logit contributions
aman+0.156
pron+0.151
str+0.146
field+0.140
iri+0.138
Neg logit contributions
 Go-0.159
 You-0.156
 Give-0.155
 Come-0.144
 Hide-0.139
Token,
Token ID11
Feature activation-9.84e-02

Pos logit contributions
str+0.070
field+0.070
aman+0.066
pron+0.064
cle+0.063
Neg logit contributions
 agrees-0.070
 Give-0.070
 Go-0.066
 aback-0.065
 Come-0.061
Token the
Token ID262
Feature activation-1.66e-02

Pos logit contributions
str+0.182
aman+0.181
pron+0.170
field+0.170
cle+0.166
Neg logit contributions
 You-0.160
 Give-0.147
 Go-0.146
 agrees-0.142
 aback-0.142
Token bird
Token ID6512
Feature activation-3.03e-02

Pos logit contributions
field+0.028
str+0.027
aster+0.026
aman+0.026
pron+0.025
Neg logit contributions
 Give-0.030
 agrees-0.030
 Go-0.029
 consequences-0.027
 acts-0.027
Tokenie
Token ID494
Feature activation-9.82e-03

Pos logit contributions
field+0.052
pron+0.052
str+0.051
cle+0.051
apy+0.051
Neg logit contributions
 agrees-0.051
 Give-0.047
 wants-0.047
vious-0.044
ives-0.043
Token saw
Token ID2497
Feature activation-1.08e-02

Pos logit contributions
str+0.017
field+0.016
pron+0.015
aman+0.015
aster+0.015
Neg logit contributions
 agrees-0.019
 Give-0.018
 Go-0.017
ives-0.017
 disagrees-0.016
Token a
Token ID257
Feature activation-1.18e-02

Pos logit contributions
field+0.021
str+0.020
pron+0.019
cle+0.018
aman+0.017
Neg logit contributions
 agrees-0.020
 Give-0.020
 Go-0.018
 acts-0.017
 Believe-0.017
Token.
Token ID13
Feature activation+2.85e-02

Pos logit contributions
str+0.039
field+0.037
 sudden+0.036
 wheat+0.034
 raft+0.033
Neg logit contributions
 agrees-0.039
 Give-0.039
 Go-0.037
ives-0.035
 Share-0.033
Token While
Token ID2893
Feature activation-8.42e-02

Pos logit contributions
 agrees+0.049
 Give+0.047
 Go+0.045
ives+0.044
 disagrees+0.041
Neg logit contributions
field-0.056
str-0.056
 sudden-0.053
 wheat-0.050
 raft-0.050
Token she
Token ID673
Feature activation-2.22e-02

Pos logit contributions
str+0.120
field+0.105
Wil+0.094
aman+0.094
 sudden+0.093
Neg logit contributions
 Go-0.195
 Give-0.195
 agrees-0.186
 You-0.181
 Come-0.175
Token was
Token ID373
Feature activation-2.04e-02

Pos logit contributions
str+0.041
field+0.041
 sudden+0.037
 wheat+0.037
 raft+0.036
Neg logit contributions
 agrees-0.042
 Give-0.041
 Go-0.039
ives-0.037
 Come-0.036
Token sweeping
Token ID18404
Feature activation-1.92e-02

Pos logit contributions
str+0.038
field+0.037
iri+0.033
Wil+0.033
cle+0.033
Neg logit contributions
 Give-0.036
 agrees-0.036
 Go-0.034
ives-0.032
 disagrees-0.031
Token,
Token ID11
Feature activation-9.82e-02

Pos logit contributions
str+0.030
field+0.030
 sudden+0.026
 raft+0.025
 wheat+0.024
Neg logit contributions
 agrees-0.041
 Give-0.040
 Go-0.039
 aback-0.037
ives-0.036
Token she
Token ID673
Feature activation-1.35e-02

Pos logit contributions
str+0.155
field+0.149
 raft+0.132
cker+0.131
iri+0.124
Neg logit contributions
 Give-0.199
 Go-0.197
 agrees-0.195
 You-0.193
 Come-0.181
Token kept
Token ID4030
Feature activation-1.41e-02

Pos logit contributions
field+0.024
str+0.024
 raft+0.023
 wheat+0.022
Wil+0.021
Neg logit contributions
 agrees-0.024
 Give-0.023
 Go-0.022
ives-0.021
 disagrees-0.021
Token cl
Token ID537
Feature activation+5.29e-02

Pos logit contributions
str+0.027
field+0.027
iri+0.024
cle+0.022
cker+0.022
Neg logit contributions
 Give-0.024
 agrees-0.023
 Go-0.023
 Come-0.022
 aback-0.022
Tokenapping
Token ID5912
Feature activation+4.46e-03

Pos logit contributions
 Give+0.099
 agrees+0.099
 Go+0.094
ives+0.088
 disagrees+0.088
Neg logit contributions
field-0.094
str-0.093
 sudden-0.090
aster-0.084
pron-0.083
Token her
Token ID607
Feature activation+3.42e-02

Pos logit contributions
 agrees+0.010
 Give+0.009
 Go+0.009
ives+0.009
 Share+0.008
Neg logit contributions
str-0.007
field-0.007
 sudden-0.006
 wheat-0.006
 raft-0.006
Token her
Token ID607
Feature activation+1.98e-02

Pos logit contributions
str+0.031
field+0.031
 sudden+0.027
 wheat+0.025
 raft+0.025
Neg logit contributions
 agrees-0.049
 Give-0.048
 Go-0.047
ives-0.044
 Share-0.043
Token friends
Token ID2460
Feature activation-3.98e-02

Pos logit contributions
 agrees+0.035
 Give+0.034
 Go+0.033
ives+0.030
 acts+0.029
Neg logit contributions
str-0.038
field-0.037
 sudden-0.035
aster-0.035
aman-0.033
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
str+0.082
field+0.080
 sudden+0.077
 wheat+0.073
 raft+0.072
Neg logit contributions
 agrees-0.067
 Give-0.065
 Go-0.063
ives-0.058
 Share-0.056
Token While
Token ID2893
Feature activation-8.56e-02

Pos logit contributions
 agrees+0.027
 Give+0.027
 Go+0.025
ives+0.024
 disagrees+0.023
Neg logit contributions
field-0.029
str-0.029
 sudden-0.027
 wheat-0.026
 raft-0.026
Token playing
Token ID2712
Feature activation-1.85e-02

Pos logit contributions
str+0.124
field+0.120
aman+0.105
Wil+0.105
pron+0.103
Neg logit contributions
 Go-0.197
 Give-0.193
 You-0.180
 agrees-0.177
 Come-0.174
Token,
Token ID11
Feature activation-9.82e-02

Pos logit contributions
field+0.027
str+0.027
 sudden+0.024
 wheat+0.022
pron+0.022
Neg logit contributions
 Give-0.042
 agrees-0.042
 Go-0.041
ives-0.038
 Come-0.037
Token she
Token ID673
Feature activation-1.39e-02

Pos logit contributions
str+0.190
field+0.189
 raft+0.164
aman+0.163
pron+0.161
Neg logit contributions
 Give-0.165
 Go-0.162
 You-0.158
 agrees-0.154
 Come-0.143
Token found
Token ID1043
Feature activation-5.62e-03

Pos logit contributions
str+0.019
field+0.019
 sudden+0.017
 wheat+0.016
 raft+0.016
Neg logit contributions
 agrees-0.032
 Give-0.032
 Go-0.031
ives-0.030
 Share-0.029
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
field+0.007
str+0.007
 sudden+0.007
 wheat+0.006
 raft+0.006
Neg logit contributions
 agrees-0.014
 Give-0.014
 Go-0.013
ives-0.013
 Share-0.012
Token toy
Token ID13373
Feature activation-1.09e-02

Pos logit contributions
field+0.016
aster+0.015
str+0.015
 Wil+0.014
aman+0.014
Neg logit contributions
 Give-0.021
 agrees-0.019
 Go-0.018
 disagrees-0.017
 truth-0.017
Token car
Token ID1097
Feature activation-1.82e-02

Pos logit contributions
str+0.013
field+0.013
 sudden+0.011
 wheat+0.010
 raft+0.010
Neg logit contributions
 Give-0.028
 agrees-0.028
 Go-0.027
ives-0.026
 Share-0.025
Token tree
Token ID5509
Feature activation-3.27e-02

Pos logit contributions
str+0.021
field+0.021
 Wil+0.020
aman+0.020
apy+0.019
Neg logit contributions
 Give-0.031
 Go-0.031
 agrees-0.031
 demands-0.028
 disagrees-0.028
Token branch
Token ID8478
Feature activation-3.90e-02

Pos logit contributions
str+0.068
 sudden+0.065
Wil+0.060
field+0.059
 raft+0.058
Neg logit contributions
 agrees-0.054
 Give-0.051
ives-0.049
 Go-0.049
 Share-0.044
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
str+0.078
 sudden+0.077
field+0.073
Wil+0.069
 raft+0.069
Neg logit contributions
 agrees-0.067
 Give-0.065
ives-0.064
 Go-0.062
 aback-0.056
Token While
Token ID2893
Feature activation-8.31e-02

Pos logit contributions
 Give+0.028
 agrees+0.027
 Go+0.026
ives+0.024
 Come+0.024
Neg logit contributions
field-0.032
str-0.032
 sudden-0.030
pron-0.028
 wheat-0.028
Token resting
Token ID19186
Feature activation-3.71e-02

Pos logit contributions
pron+0.142
field+0.141
iet+0.140
aman+0.136
iri+0.131
Neg logit contributions
 Go-0.146
 Give-0.146
 You-0.143
 Come-0.135
 Share-0.124
Token,
Token ID11
Feature activation-9.80e-02

Pos logit contributions
field+0.071
str+0.067
aman+0.063
pron+0.062
 sudden+0.062
Neg logit contributions
 Give-0.069
 agrees-0.066
 Go-0.064
 aback-0.062
ives-0.060
Token Twe
Token ID24205
Feature activation+3.43e-02

Pos logit contributions
field+0.199
pron+0.184
str+0.176
aman+0.174
iet+0.169
Neg logit contributions
 You-0.152
 Give-0.150
 Go-0.140
 Come-0.126
 Share-0.120
Tokenety
Token ID2963
Feature activation-8.66e-03

Pos logit contributions
 agrees+0.078
 Give+0.077
 Go+0.075
ives+0.072
 Share+0.069
Neg logit contributions
field-0.049
str-0.048
 sudden-0.045
 wheat-0.042
 raft-0.042
Token saw
Token ID2497
Feature activation-9.87e-03

Pos logit contributions
str+0.014
field+0.014
aman+0.013
Wil+0.013
 raft+0.013
Neg logit contributions
 agrees-0.017
 Give-0.016
 Go-0.015
ives-0.015
 disagrees-0.014
Token a
Token ID257
Feature activation-1.00e-02

Pos logit contributions
field+0.022
pron+0.020
str+0.020
cle+0.018
aman+0.018
Neg logit contributions
 Give-0.016
 agrees-0.015
 Go-0.014
 consequences-0.014
 aback-0.014
Token little
Token ID1310
Feature activation+3.63e-03

Pos logit contributions
field+0.017
aster+0.017
side+0.016
aman+0.015
 Wil+0.015
Neg logit contributions
 Give-0.018
 truth-0.016
 agrees-0.016
 Go-0.015
 disagrees-0.014
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
str+0.012
field+0.012
 sudden+0.012
 raft+0.011
aster+0.011
Neg logit contributions
 agrees-0.010
 Give-0.010
 Go-0.010
ives-0.009
 Share-0.009
Token While
Token ID2893
Feature activation-8.22e-02

Pos logit contributions
 Give+0.018
 agrees+0.017
 Go+0.017
ives+0.015
 Come+0.015
Neg logit contributions
str-0.024
field-0.024
 sudden-0.022
 wheat-0.021
 raft-0.021
Token she
Token ID673
Feature activation-1.66e-02

Pos logit contributions
str+0.133
field+0.127
iri+0.122
cker+0.118
aman+0.114
Neg logit contributions
 Go-0.165
 Give-0.164
 You-0.154
 Come-0.149
 Hide-0.142
Token was
Token ID373
Feature activation-1.39e-02

Pos logit contributions
field+0.035
str+0.034
 wheat+0.033
 raft+0.032
 tractor+0.031
Neg logit contributions
 Give-0.025
 agrees-0.024
 Go-0.023
ives-0.021
 Share-0.021
Token collecting
Token ID13157
Feature activation-1.02e-02

Pos logit contributions
field+0.028
cle+0.027
str+0.025
aman+0.024
Big+0.024
Neg logit contributions
 Give-0.021
 Go-0.020
 agrees-0.019
 Hide-0.018
 Come-0.018
Token,
Token ID11
Feature activation-9.78e-02

Pos logit contributions
field+0.016
str+0.015
 sudden+0.014
pron+0.013
cle+0.013
Neg logit contributions
 agrees-0.023
 Give-0.022
 Go-0.021
ives-0.020
 aback-0.020
Token she
Token ID673
Feature activation-1.10e-02

Pos logit contributions
str+0.199
field+0.196
cker+0.184
cle+0.167
iri+0.163
Neg logit contributions
 Give-0.162
 Go-0.156
 You-0.154
 Come-0.140
 Hide-0.139
Token saw
Token ID2497
Feature activation-7.43e-03

Pos logit contributions
str+0.020
field+0.020
 wheat+0.018
aman+0.018
 raft+0.018
Neg logit contributions
 Give-0.019
 agrees-0.019
 Go-0.018
ives-0.017
 disagrees-0.017
Token a
Token ID257
Feature activation-5.89e-03

Pos logit contributions
field+0.015
str+0.014
pron+0.013
cle+0.013
aman+0.012
Neg logit contributions
 Give-0.013
 agrees-0.012
 Go-0.011
 consequences-0.011
 aback-0.011
Token bee
Token ID20697
Feature activation-6.94e-04

Pos logit contributions
field+0.009
aster+0.009
aman+0.008
ittens+0.008
 Wil+0.008
Neg logit contributions
 Give-0.011
 agrees-0.010
 Go-0.010
 truth-0.009
 disagrees-0.009
Token flying
Token ID7348
Feature activation-7.03e-04

Pos logit contributions
str+0.001
field+0.001
pron+0.001
aman+0.001
 sudden+0.001
Neg logit contributions
 Give-0.002
 agrees-0.001
 Go-0.001
ives-0.001
 You-0.001
Token his
Token ID465
Feature activation+2.63e-02

Pos logit contributions
field+0.027
str+0.027
 sudden+0.024
 wheat+0.022
 raft+0.022
Neg logit contributions
 agrees-0.052
 Give-0.051
 Go-0.050
ives-0.047
 Share-0.046
Token friends
Token ID2460
Feature activation-3.54e-02

Pos logit contributions
 Give+0.047
 agrees+0.047
 Go+0.045
ives+0.040
 Share+0.040
Neg logit contributions
field-0.050
str-0.050
aster-0.045
 sudden-0.044
 wheat-0.044
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
field+0.064
str+0.063
 sudden+0.059
 raft+0.057
 wheat+0.057
Neg logit contributions
 agrees-0.068
 Give-0.066
 Go-0.064
ives-0.061
 Share-0.058
Token While
Token ID2893
Feature activation-8.35e-02

Pos logit contributions
 Give+0.016
 agrees+0.016
 Go+0.015
ives+0.014
 Share+0.014
Neg logit contributions
field-0.022
str-0.022
 sudden-0.021
 wheat-0.020
 raft-0.020
Token playing
Token ID2712
Feature activation-1.06e-02

Pos logit contributions
aman+0.153
field+0.151
str+0.151
pron+0.144
iri+0.144
Neg logit contributions
 Give-0.154
 Go-0.153
 You-0.143
 Come-0.140
 Share-0.131
Token,
Token ID11
Feature activation-9.78e-02

Pos logit contributions
field+0.015
str+0.015
 sudden+0.013
 wheat+0.013
aman+0.012
Neg logit contributions
 Give-0.025
 agrees-0.025
 Go-0.024
ives-0.022
 Come-0.022
Token Tim
Token ID5045
Feature activation+1.40e-02

Pos logit contributions
field+0.236
str+0.228
aman+0.226
iri+0.214
pron+0.210
Neg logit contributions
 Give-0.114
 You-0.112
 Go-0.104
 Hide-0.094
 Come-0.091
Tokenmy
Token ID1820
Feature activation-1.19e-02

Pos logit contributions
 agrees+0.023
 Give+0.023
 Go+0.022
ives+0.020
 Come+0.019
Neg logit contributions
field-0.029
str-0.028
 sudden-0.027
 wheat-0.026
 raft-0.026
Token saw
Token ID2497
Feature activation-7.90e-03

Pos logit contributions
field+0.021
str+0.020
 wheat+0.018
aman+0.018
Wil+0.018
Neg logit contributions
 Give-0.023
 agrees-0.022
 Go-0.021
ives-0.020
 Come-0.020
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
field+0.016
str+0.015
pron+0.014
aman+0.013
 sudden+0.013
Neg logit contributions
 Give-0.015
 agrees-0.013
 Go-0.013
 Share-0.012
 aback-0.012
Token beautiful
Token ID4950
Feature activation+6.06e-03

Pos logit contributions
field+0.018
aster+0.016
 Wil+0.015
aman+0.015
Wil+0.015
Neg logit contributions
 Give-0.019
 agrees-0.017
 Go-0.016
 disagrees-0.016
 Share-0.015
Token
Token ID198
Feature activation-4.41e-02

Pos logit contributions
 Give+0.024
 agrees+0.024
 Go+0.023
ives+0.022
 Come+0.021
Neg logit contributions
field-0.023
str-0.023
 sudden-0.021
 wheat-0.020
 raft-0.020
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
field+0.079
str+0.078
 sudden+0.072
 wheat+0.070
 raft+0.070
Neg logit contributions
 Give-0.084
 agrees-0.084
 Go-0.080
ives-0.076
 Share-0.073
TokenAs
Token ID1722
Feature activation-8.81e-02

Pos logit contributions
field+0.071
str+0.063
 peach+0.058
 pumpkin+0.056
 corn+0.054
Neg logit contributions
 Give-0.107
 Go-0.100
 agrees-0.096
ives-0.092
 Share-0.092
Token they
Token ID484
Feature activation-1.78e-02

Pos logit contributions
pron+0.199
field+0.197
iri+0.191
 veil+0.184
str+0.184
Neg logit contributions
 You-0.073
 Go-0.066
 He-0.062
 Jack-0.061
 Give-0.061
Token cooked
Token ID15847
Feature activation-2.41e-02

Pos logit contributions
str+0.033
field+0.031
ich+0.031
aman+0.029
 raft+0.028
Neg logit contributions
 agrees-0.032
 Give-0.032
ives-0.030
 Go-0.030
 teaches-0.029
Token,
Token ID11
Feature activation-9.77e-02

Pos logit contributions
str+0.046
field+0.044
 sudden+0.038
aman+0.038
Wil+0.037
Neg logit contributions
 agrees-0.045
 Give-0.045
 Go-0.043
 Come-0.040
 You-0.040
Token Tim
Token ID5045
Feature activation+9.68e-03

Pos logit contributions
field+0.261
str+0.249
iri+0.245
aman+0.237
pron+0.235
Neg logit contributions
 Give-0.085
 You-0.085
 Go-0.080
 Come-0.067
 Hide-0.061
Tokenmy
Token ID1820
Feature activation-1.33e-02

Pos logit contributions
 agrees+0.015
 Give+0.014
 Go+0.013
ives+0.013
 Come+0.012
Neg logit contributions
field-0.021
str-0.021
 sudden-0.019
 wheat-0.019
 raft-0.019
Token's
Token ID338
Feature activation+5.65e-03

Pos logit contributions
str+0.030
field+0.030
Wil+0.028
aman+0.028
 raft+0.028
Neg logit contributions
 agrees-0.018
 Give-0.017
 Go-0.016
ives-0.015
 disagrees-0.014
Token mom
Token ID1995
Feature activation-3.58e-02

Pos logit contributions
 Give+0.013
 agrees+0.013
 Go+0.013
ives+0.012
 demands+0.012
Neg logit contributions
str-0.008
field-0.008
Wil-0.006
 sudden-0.006
aman-0.006
Token told
Token ID1297
Feature activation+1.66e-02

Pos logit contributions
field+0.066
str+0.064
 raft+0.059
 wheat+0.057
 sudden+0.057
Neg logit contributions
 agrees-0.067
 Give-0.066
 Go-0.061
ives-0.060
 teaches-0.058
Token
Token ID198
Feature activation-4.41e-02

Pos logit contributions
field+0.082
str+0.080
 sudden+0.074
 wheat+0.073
 raft+0.072
Neg logit contributions
 Give-0.085
 agrees-0.084
 Go-0.081
ives-0.075
 Come-0.073
TokenAs
Token ID1722
Feature activation-9.29e-02

Pos logit contributions
field+0.062
str+0.057
 wheat+0.050
 raft+0.049
 peach+0.049
Neg logit contributions
 Give-0.107
 Go-0.104
 agrees-0.102
 Come-0.094
 Share-0.094
Token Tim
Token ID5045
Feature activation+6.31e-03

Pos logit contributions
field+0.204
pron+0.201
str+0.192
cker+0.184
ich+0.181
Neg logit contributions
 Go-0.117
 Give-0.111
 You-0.110
 Come-0.096
 Hide-0.088
Tokenmy
Token ID1820
Feature activation-1.22e-02

Pos logit contributions
 agrees+0.010
 Give+0.010
 Go+0.009
ives+0.009
 Come+0.008
Neg logit contributions
str-0.013
field-0.013
 sudden-0.013
 wheat-0.013
 raft-0.012
Token left
Token ID1364
Feature activation-4.99e-02

Pos logit contributions
str+0.022
chard+0.022
field+0.021
 wheat+0.020
pron+0.020
Neg logit contributions
 agrees-0.024
 Give-0.023
 Go-0.022
ives-0.020
 acts-0.020
Token,
Token ID11
Feature activation-9.74e-02

Pos logit contributions
str+0.083
field+0.077
 sudden+0.074
 wheat+0.069
pron+0.067
Neg logit contributions
 Give-0.102
 Go-0.099
 agrees-0.098
 aback-0.093
 Come-0.091
Token he
Token ID339
Feature activation-1.42e-02

Pos logit contributions
field+0.196
str+0.190
pron+0.176
iri+0.176
cker+0.170
Neg logit contributions
 Give-0.154
 You-0.153
 Go-0.151
 Come-0.139
 aback-0.134
Token said
Token ID531
Feature activation+2.51e-02

Pos logit contributions
str+0.025
field+0.023
 wheat+0.022
ittens+0.022
Wil+0.022
Neg logit contributions
 agrees-0.028
 Give-0.026
 Go-0.025
 disagrees-0.024
 acts-0.023
Token,
Token ID11
Feature activation-4.13e-02

Pos logit contributions
 Give+0.050
 agrees+0.050
 Go+0.048
ives+0.044
 Share+0.043
Neg logit contributions
field-0.045
str-0.044
 sudden-0.041
 wheat-0.039
 raft-0.038
Token "
Token ID366
Feature activation+3.76e-02

Pos logit contributions
field+0.094
str+0.093
 sudden+0.087
aster+0.084
 wheat+0.084
Neg logit contributions
 Give-0.064
 Go-0.058
 Come-0.056
 agrees-0.055
 You-0.054
TokenThanks
Token ID9690
Feature activation+4.86e-02

Pos logit contributions
 Give+0.074
 agrees+0.071
 Go+0.070
ives+0.065
 Come+0.065
Neg logit contributions
field-0.068
str-0.067
 sudden-0.061
 raft-0.060
 wheat-0.060
Token
Token ID198
Feature activation-4.40e-02

Pos logit contributions
 agrees+0.050
 Give+0.046
 Go+0.044
ives+0.044
 acts+0.043
Neg logit contributions
str-0.041
 sudden-0.040
field-0.040
 which-0.037
 wheat-0.036
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
str+0.066
field+0.064
 sudden+0.062
 wheat+0.056
 pine+0.055
Neg logit contributions
 agrees-0.098
 Give-0.095
 Go-0.092
ives-0.088
 acts-0.086
TokenAs
Token ID1722
Feature activation-9.28e-02

Pos logit contributions
field+0.064
str+0.061
 raft+0.052
 sudden+0.052
 wheat+0.052
Neg logit contributions
 Give-0.109
 agrees-0.106
 Go-0.105
ives-0.099
 Come-0.096
Token she
Token ID673
Feature activation-1.45e-02

Pos logit contributions
str+0.182
field+0.180
cker+0.170
 raft+0.155
iri+0.154
Neg logit contributions
 Go-0.157
 Give-0.156
 agrees-0.142
 You-0.141
 Hide-0.133
Token played
Token ID2826
Feature activation-8.07e-03

Pos logit contributions
str+0.028
field+0.027
chard+0.027
 wheat+0.026
 hurricane+0.026
Neg logit contributions
 agrees-0.024
 Give-0.022
 Go-0.021
 Come-0.020
 acts-0.020
Token,
Token ID11
Feature activation-9.73e-02

Pos logit contributions
field+0.012
str+0.012
 sudden+0.011
 wheat+0.010
aman+0.010
Neg logit contributions
 agrees-0.018
 Give-0.018
 Go-0.018
ives-0.016
 Come-0.016
Token she
Token ID673
Feature activation-1.68e-02

Pos logit contributions
field+0.180
str+0.176
cker+0.152
 raft+0.152
iri+0.150
Neg logit contributions
 Give-0.185
 Go-0.180
 agrees-0.173
 You-0.170
 Come-0.160
Token found
Token ID1043
Feature activation-8.42e-03

Pos logit contributions
str+0.031
field+0.030
aman+0.029
Wil+0.028
 wheat+0.028
Neg logit contributions
 agrees-0.030
 Give-0.028
 Go-0.026
 disagrees-0.026
ives-0.025
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
field+0.012
str+0.012
 sudden+0.010
 wheat+0.010
 raft+0.010
Neg logit contributions
 agrees-0.020
 Give-0.020
 Go-0.019
ives-0.018
 Come-0.017
Token piece
Token ID3704
Feature activation-4.04e-02

Pos logit contributions
field+0.022
aster+0.021
str+0.020
 Wil+0.020
Wil+0.019
Neg logit contributions
 Give-0.027
 agrees-0.025
 Go-0.024
 disagrees-0.023
 truth-0.023
Token of
Token ID286
Feature activation-3.08e-02

Pos logit contributions
str+0.069
field+0.069
 sudden+0.065
 wheat+0.060
 raft+0.060
Neg logit contributions
 Give-0.082
 agrees-0.081
 Go-0.078
ives-0.075
 Share-0.070
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
 agrees+0.017
 Give+0.016
 Go+0.016
ives+0.015
 Come+0.014
Neg logit contributions
str-0.014
field-0.013
 sudden-0.013
Wil-0.012
 wheat-0.012
Token
Token ID198
Feature activation-3.51e-02

Pos logit contributions
 agrees+0.049
 Give+0.048
 Go+0.046
ives+0.043
 Share+0.041
Neg logit contributions
field-0.060
str-0.060
 sudden-0.057
 wheat-0.054
 raft-0.054
Token
Token ID198
Feature activation-3.89e-02

Pos logit contributions
field+0.056
str+0.056
 sudden+0.051
 wheat+0.049
 raft+0.049
Neg logit contributions
 agrees-0.074
 Give-0.074
 Go-0.071
ives-0.067
 Share-0.065
TokenWhile
Token ID3633
Feature activation-8.19e-02

Pos logit contributions
field+0.054
str+0.052
 raft+0.045
 sudden+0.045
 wheat+0.045
Neg logit contributions
 Give-0.093
 agrees-0.091
 Go-0.091
 Come-0.083
ives-0.083
Token swimming
Token ID14899
Feature activation-7.90e-03

Pos logit contributions
str+0.131
field+0.119
pron+0.109
aman+0.106
Wil+0.105
Neg logit contributions
 Go-0.177
 Give-0.172
 You-0.162
 agrees-0.158
 Come-0.156
Token,
Token ID11
Feature activation-9.73e-02

Pos logit contributions
field+0.012
str+0.012
 sudden+0.011
 wheat+0.011
pron+0.010
Neg logit contributions
 agrees-0.017
 Give-0.017
 Go-0.016
ives-0.016
 Come-0.015
Token the
Token ID262
Feature activation-9.77e-03

Pos logit contributions
str+0.199
field+0.194
pron+0.176
 raft+0.171
 blowing+0.169
Neg logit contributions
 Go-0.152
 Give-0.150
 You-0.146
 agrees-0.136
 Come-0.131
Token mole
Token ID9411
Feature activation-2.20e-02

Pos logit contributions
str+0.015
field+0.015
aman+0.013
 sudden+0.013
Wil+0.013
Neg logit contributions
 Give-0.020
 agrees-0.020
 Go-0.019
ives-0.018
 Come-0.018
Token met
Token ID1138
Feature activation-8.60e-04

Pos logit contributions
str+0.033
field+0.032
 sudden+0.029
 wheat+0.028
pron+0.028
Neg logit contributions
 agrees-0.049
 Give-0.048
 Go-0.046
ives-0.044
 disagrees-0.043
Token a
Token ID257
Feature activation-1.25e-02

Pos logit contributions
str+0.001
field+0.001
 sudden+0.001
 wheat+0.001
 raft+0.001
Neg logit contributions
 agrees-0.002
 Give-0.002
 Go-0.002
ives-0.002
 Share-0.002
Token kind
Token ID1611
Feature activation+6.53e-03

Pos logit contributions
field+0.021
aster+0.020
str+0.020
aman+0.020
ittens+0.019
Neg logit contributions
 Give-0.023
 agrees-0.022
 Go-0.020
 Believe-0.019
 Share-0.019
Token
Token ID198
Feature activation-4.21e-02

Pos logit contributions
str+0.069
field+0.069
 sudden+0.064
 wheat+0.061
 raft+0.060
Neg logit contributions
 agrees-0.089
 Give-0.088
 Go-0.084
ives-0.080
 Share-0.077
TokenAs
Token ID1722
Feature activation-8.74e-02

Pos logit contributions
field+0.070
str+0.068
 sudden+0.062
 wheat+0.060
 raft+0.060
Neg logit contributions
 Give-0.088
 agrees-0.087
 Go-0.085
ives-0.080
 Share-0.077
Token Red
Token ID2297
Feature activation+1.05e-03

Pos logit contributions
field+0.167
str+0.163
iri+0.152
cker+0.148
aman+0.146
Neg logit contributions
 Give-0.146
 Go-0.142
 You-0.133
 Share-0.128
 teaches-0.124
Token was
Token ID373
Feature activation-9.19e-03

Pos logit contributions
 agrees+0.002
 Give+0.002
 Go+0.002
ives+0.002
 Come+0.002
Neg logit contributions
field-0.002
str-0.002
pron-0.001
 sudden-0.001
 wheat-0.001
Token driving
Token ID5059
Feature activation-1.50e-02

Pos logit contributions
field+0.017
str+0.016
iri+0.015
opsy+0.014
Bet+0.014
Neg logit contributions
 Give-0.017
 Go-0.017
 Share-0.015
 agrees-0.015
 Come-0.015
Token,
Token ID11
Feature activation-9.72e-02

Pos logit contributions
field+0.026
str+0.025
 sudden+0.023
pron+0.022
 wheat+0.022
Neg logit contributions
 agrees-0.030
 Give-0.030
 Go-0.029
 aback-0.028
ives-0.027
Token he
Token ID339
Feature activation-4.63e-03

Pos logit contributions
field+0.215
str+0.210
iri+0.198
aman+0.193
opsy+0.186
Neg logit contributions
 Give-0.144
 Go-0.134
 You-0.132
 Share-0.117
 consequences-0.116
Token saw
Token ID2497
Feature activation-6.29e-03

Pos logit contributions
str+0.008
field+0.008
aman+0.008
ittens+0.008
 cherry+0.008
Neg logit contributions
 Give-0.008
 agrees-0.008
 disagrees-0.007
 Go-0.007
 wants-0.007
Token a
Token ID257
Feature activation-7.18e-03

Pos logit contributions
field+0.012
str+0.012
pron+0.010
aman+0.010
 sudden+0.010
Neg logit contributions
 Give-0.012
 agrees-0.011
 Go-0.011
 Share-0.010
 consequences-0.010
Token big
Token ID1263
Feature activation+3.35e-03

Pos logit contributions
field+0.012
aster+0.011
str+0.011
aman+0.010
ittens+0.010
Neg logit contributions
 Give-0.014
 agrees-0.013
 Go-0.012
 Share-0.012
 Hide-0.011
Token truck
Token ID7779
Feature activation-7.34e-03

Pos logit contributions
 Give+0.007
 agrees+0.007
 Go+0.006
 acts+0.006
 disagrees+0.006
Neg logit contributions
field-0.005
str-0.005
 sudden-0.005
pron-0.005
Wil-0.005
Token While
Token ID2893
Feature activation-8.08e-02

Pos logit contributions
 Give+0.023
 Go+0.022
 agrees+0.022
 Come+0.020
ives+0.019
Neg logit contributions
str-0.033
field-0.033
 sudden-0.030
 wheat-0.029
 raft-0.029
Token playing
Token ID2712
Feature activation-1.51e-02

Pos logit contributions
str+0.152
field+0.151
iri+0.143
aman+0.140
cker+0.139
Neg logit contributions
 Go-0.149
 Give-0.145
 You-0.133
 Come-0.133
 Share-0.119
Token on
Token ID319
Feature activation-4.74e-02

Pos logit contributions
field+0.025
str+0.025
 sudden+0.022
aman+0.021
 wheat+0.020
Neg logit contributions
 Give-0.033
 agrees-0.032
 Go-0.031
ives-0.029
 Come-0.029
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
str+0.075
field+0.075
pron+0.071
 sudden+0.069
cle+0.066
Neg logit contributions
 Give-0.099
 Go-0.099
 agrees-0.094
 You-0.087
 Share-0.086
Token swings
Token ID26728
Feature activation-3.69e-02

Pos logit contributions
field+0.026
str+0.025
 sudden+0.025
cle+0.023
pron+0.023
Neg logit contributions
 Give-0.026
 agrees-0.025
 Go-0.024
ives-0.021
 disagrees-0.021
Token,
Token ID11
Feature activation-9.71e-02

Pos logit contributions
field+0.072
 sudden+0.068
str+0.067
pron+0.065
cle+0.061
Neg logit contributions
 Give-0.069
 agrees-0.069
 Go-0.066
ives-0.060
 Come-0.059
Token she
Token ID673
Feature activation-1.18e-02

Pos logit contributions
str+0.206
field+0.206
cker+0.184
cle+0.177
aman+0.171
Neg logit contributions
 Give-0.153
 Go-0.151
 You-0.147
 Come-0.135
 agrees-0.130
Token saw
Token ID2497
Feature activation-7.86e-03

Pos logit contributions
field+0.020
str+0.020
 wheat+0.019
aman+0.019
Wil+0.018
Neg logit contributions
 Give-0.020
 agrees-0.020
 Go-0.019
 disagrees-0.018
 wants-0.018
Token a
Token ID257
Feature activation-1.06e-02

Pos logit contributions
field+0.016
str+0.015
pron+0.014
cle+0.013
aman+0.013
Neg logit contributions
 Give-0.014
 agrees-0.014
 Go-0.013
ives-0.012
 consequences-0.012
Token rope
Token ID17182
Feature activation-8.68e-03

Pos logit contributions
field+0.018
aster+0.016
 Wil+0.016
aman+0.016
ittens+0.015
Neg logit contributions
 Give-0.020
 agrees-0.018
 Go-0.017
 disagrees-0.016
 You-0.016
Token lying
Token ID9105
Feature activation-1.80e-04

Pos logit contributions
str+0.014
 sudden+0.014
field+0.013
 wheat+0.012
pron+0.012
Neg logit contributions
 Give-0.019
 Go-0.017
 agrees-0.017
ives-0.016
 You-0.016
Token
Token ID198
Feature activation-3.93e-02

Pos logit contributions
field+0.061
str+0.061
 sudden+0.056
 wheat+0.054
 raft+0.054
Neg logit contributions
 agrees-0.073
 Give-0.072
 Go-0.070
ives-0.065
 Share-0.063
TokenWhile
Token ID3633
Feature activation-8.15e-02

Pos logit contributions
field+0.057
str+0.055
 sudden+0.048
 raft+0.048
 wheat+0.048
Neg logit contributions
 Give-0.091
 agrees-0.089
 Go-0.089
 Come-0.081
ives-0.081
Token Tim
Token ID5045
Feature activation+1.13e-02

Pos logit contributions
str+0.126
field+0.114
aman+0.101
 sudden+0.100
pron+0.100
Neg logit contributions
 Go-0.181
 Give-0.177
 You-0.166
 agrees-0.165
 Come-0.161
Token was
Token ID373
Feature activation-1.11e-02

Pos logit contributions
 Give+0.023
 agrees+0.022
 Go+0.022
ives+0.020
 Share+0.020
Neg logit contributions
str-0.019
field-0.019
 sudden-0.018
 pine-0.017
 wheat-0.017
Token resting
Token ID19186
Feature activation-3.65e-02

Pos logit contributions
field+0.022
str+0.022
cle+0.020
Wil+0.019
 pine+0.019
Neg logit contributions
 Go-0.018
 Give-0.018
 agrees-0.017
ives-0.016
 Share-0.015
Token,
Token ID11
Feature activation-9.70e-02

Pos logit contributions
str+0.067
field+0.064
 sudden+0.059
aman+0.057
 wheat+0.056
Neg logit contributions
 Give-0.068
 agrees-0.066
 Go-0.064
 aback-0.062
 You-0.059
Token a
Token ID257
Feature activation-3.79e-02

Pos logit contributions
str+0.168
field+0.167
pron+0.146
aman+0.142
cker+0.134
Neg logit contributions
 Give-0.184
 Go-0.179
 You-0.171
 agrees-0.166
 Come-0.164
Token spoiled
Token ID34781
Feature activation-9.09e-03

Pos logit contributions
field+0.067
aman+0.060
str+0.059
 Wil+0.058
Wil+0.057
Neg logit contributions
 Give-0.069
 agrees-0.063
 Go-0.062
 Come-0.058
 demands-0.057
Token cat
Token ID3797
Feature activation-1.19e-02

Pos logit contributions
field+0.015
str+0.015
 sudden+0.014
Wil+0.013
 raft+0.013
Neg logit contributions
 Give-0.019
 agrees-0.018
 Go-0.017
ives-0.016
 Share-0.016
Token named
Token ID3706
Feature activation-3.53e-03

Pos logit contributions
field+0.018
str+0.018
 sudden+0.016
pron+0.016
 raft+0.015
Neg logit contributions
 Give-0.026
 agrees-0.025
 Go-0.025
ives-0.024
 Come-0.023
Token Max
Token ID5436
Feature activation+2.16e-02

Pos logit contributions
str+0.005
field+0.005
 sudden+0.005
 wheat+0.005
 pine+0.005
Neg logit contributions
 agrees-0.008
 Give-0.007
 Go-0.007
ives-0.007
 Share-0.007
Token
Token ID198
Feature activation-5.13e-02

Pos logit contributions
field+0.096
str+0.092
 wheat+0.088
 raft+0.087
 sudden+0.086
Neg logit contributions
 Give-0.080
 agrees-0.076
 Go-0.076
ives-0.070
 Share-0.068
TokenAs
Token ID1722
Feature activation-9.03e-02

Pos logit contributions
field+0.074
str+0.062
 peach+0.060
tha+0.058
 pumpkin+0.058
Neg logit contributions
 Give-0.127
 Go-0.120
 Share-0.109
 teaches-0.109
 agrees-0.109
Token they
Token ID484
Feature activation-1.04e-02

Pos logit contributions
pron+0.212
field+0.198
iri+0.197
bel+0.182
comb+0.182
Neg logit contributions
 You-0.086
 Go-0.085
 Give-0.080
 Hide-0.075
 is-0.072
Token were
Token ID547
Feature activation-3.32e-03

Pos logit contributions
field+0.017
pron+0.017
str+0.017
 wheat+0.017
 vine+0.016
Neg logit contributions
 Give-0.020
 agrees-0.019
 Go-0.018
ives-0.018
 teaches-0.017
Token driving
Token ID5059
Feature activation-1.62e-02

Pos logit contributions
field+0.006
str+0.005
pron+0.005
 peach+0.005
opsy+0.005
Neg logit contributions
 Give-0.007
 Go-0.006
 agrees-0.006
 Come-0.006
ives-0.006
Token,
Token ID11
Feature activation-9.69e-02

Pos logit contributions
field+0.027
str+0.026
pron+0.024
 sudden+0.024
 wheat+0.023
Neg logit contributions
 agrees-0.034
 Give-0.033
 Go-0.032
 aback-0.030
ives-0.029
Token they
Token ID484
Feature activation-1.07e-02

Pos logit contributions
field+0.252
iri+0.238
pron+0.237
str+0.236
aman+0.229
Neg logit contributions
 You-0.099
 Give-0.095
 Go-0.092
 Come-0.076
 That-0.072
Token saw
Token ID2497
Feature activation-5.25e-03

Pos logit contributions
ittens+0.020
 cherry+0.020
pron+0.020
field+0.020
str+0.020
Neg logit contributions
 agrees-0.016
 Give-0.015
ives-0.014
 disagrees-0.014
 Go-0.014
Token a
Token ID257
Feature activation-9.55e-03

Pos logit contributions
field+0.010
str+0.009
pron+0.009
aman+0.008
 sudden+0.008
Neg logit contributions
 Give-0.011
 agrees-0.010
 Go-0.010
ives-0.009
 Share-0.009
Token big
Token ID1263
Feature activation-7.25e-04

Pos logit contributions
field+0.016
aster+0.014
ittens+0.014
aman+0.014
iri+0.014
Neg logit contributions
 Give-0.018
 agrees-0.016
 Go-0.015
 Hide-0.015
 truth-0.014
Token hill
Token ID12788
Feature activation-2.64e-03

Pos logit contributions
field+0.001
str+0.001
 sudden+0.001
pron+0.001
ittens+0.001
Neg logit contributions
 Give-0.002
 agrees-0.001
 Go-0.001
 acts-0.001
 disagrees-0.001
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
str+0.054
field+0.054
 sudden+0.050
 wheat+0.048
 raft+0.047
Neg logit contributions
 agrees-0.070
 Give-0.069
 Go-0.067
ives-0.064
 Share-0.061
TokenAs
Token ID1722
Feature activation-8.59e-02

Pos logit contributions
field+0.054
str+0.052
 sudden+0.046
 raft+0.045
 wheat+0.045
Neg logit contributions
 Give-0.088
 agrees-0.087
 Go-0.085
ives-0.079
 Share-0.078
Token they
Token ID484
Feature activation-1.98e-02

Pos logit contributions
str+0.130
field+0.118
pron+0.104
 sudden+0.100
Wil+0.098
Neg logit contributions
 Go-0.185
 Give-0.182
 agrees-0.176
 You-0.172
 Come-0.163
Token walked
Token ID6807
Feature activation+1.77e-02

Pos logit contributions
str+0.034
field+0.033
ich+0.032
 wheat+0.031
 sudden+0.030
Neg logit contributions
 agrees-0.041
 Give-0.039
ives-0.037
 Go-0.037
 disagrees-0.036
Token further
Token ID2252
Feature activation-4.25e-02

Pos logit contributions
 agrees+0.031
 Give+0.029
 Go+0.027
 aback+0.027
ives+0.026
Neg logit contributions
str-0.036
field-0.035
 sudden-0.034
pron-0.034
aster-0.032
Token,
Token ID11
Feature activation-9.68e-02

Pos logit contributions
pron+0.083
str+0.083
field+0.076
 sudden+0.074
aster+0.074
Neg logit contributions
 agrees-0.078
 Give-0.069
 aback-0.068
 Go-0.065
 acts-0.063
Token they
Token ID484
Feature activation-2.27e-02

Pos logit contributions
str+0.142
field+0.138
pron+0.127
aman+0.117
 wheat+0.115
Neg logit contributions
 Go-0.208
 Give-0.207
 agrees-0.200
 You-0.194
 Come-0.189
Token saw
Token ID2497
Feature activation-1.17e-02

Pos logit contributions
ittens+0.046
pron+0.041
 wheat+0.041
apy+0.041
str+0.041
Neg logit contributions
 agrees-0.034
ives-0.030
 disagrees-0.030
 Give-0.030
 aback-0.029
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
field+0.020
str+0.020
pron+0.018
 sudden+0.017
Wil+0.017
Neg logit contributions
 agrees-0.024
 Give-0.023
 Go-0.022
 acts-0.021
ives-0.020
Token scary
Token ID14343
Feature activation+3.17e-03

Pos logit contributions
field+0.024
str+0.023
aster+0.023
 Wil+0.023
ittens+0.022
Neg logit contributions
 Give-0.031
 agrees-0.029
 Go-0.027
 disagrees-0.026
 Hide-0.026
Token tree
Token ID5509
Feature activation-2.72e-02

Pos logit contributions
 Give+0.006
 agrees+0.006
 Go+0.006
 acts+0.005
 Come+0.005
Neg logit contributions
field-0.005
str-0.005
 sudden-0.005
Wil-0.005
pron-0.005
Token
Token ID198
Feature activation-4.57e-02

Pos logit contributions
field+0.091
str+0.088
 raft+0.083
 wheat+0.083
 sudden+0.082
Neg logit contributions
 Give-0.084
 agrees-0.082
 Go-0.080
ives-0.075
 Come-0.072
TokenAs
Token ID1722
Feature activation-9.02e-02

Pos logit contributions
field+0.056
str+0.051
 raft+0.045
 pumpkin+0.044
 wheat+0.043
Neg logit contributions
 Give-0.123
 Go-0.119
 agrees-0.114
ives-0.109
 Share-0.108
Token she
Token ID673
Feature activation-1.04e-02

Pos logit contributions
cker+0.156
field+0.153
str+0.143
 raft+0.137
iri+0.136
Neg logit contributions
 Go-0.158
 Give-0.155
 You-0.149
 Come-0.135
 teaches-0.134
Token was
Token ID373
Feature activation-8.67e-03

Pos logit contributions
str+0.021
chard+0.020
field+0.019
 wheat+0.018
 hurricane+0.018
Neg logit contributions
 agrees-0.016
 Give-0.016
 Go-0.015
 acts-0.015
 Come-0.014
Token walking
Token ID6155
Feature activation+1.33e-02

Pos logit contributions
str+0.016
field+0.016
cle+0.014
pron+0.014
 sudden+0.014
Neg logit contributions
 Give-0.015
 Go-0.015
 agrees-0.015
 Come-0.014
 Share-0.013
Token,
Token ID11
Feature activation-9.68e-02

Pos logit contributions
 agrees+0.025
 Give+0.024
 Go+0.022
 aback+0.022
ives+0.021
Neg logit contributions
str-0.025
field-0.025
pron-0.023
 sudden-0.023
aman-0.022
Token she
Token ID673
Feature activation-1.17e-02

Pos logit contributions
field+0.185
str+0.173
cker+0.168
iri+0.163
 raft+0.161
Neg logit contributions
 Give-0.167
 Go-0.162
 You-0.162
 Come-0.146
 aback-0.143
Token saw
Token ID2497
Feature activation-1.14e-02

Pos logit contributions
str+0.021
 cherry+0.021
field+0.021
 raft+0.020
 wheat+0.020
Neg logit contributions
 agrees-0.018
 Give-0.018
 disagrees-0.017
 Go-0.017
 Come-0.016
Token a
Token ID257
Feature activation-1.19e-02

Pos logit contributions
field+0.025
str+0.023
pron+0.021
cle+0.020
cker+0.020
Neg logit contributions
 Give-0.019
 agrees-0.018
 Go-0.017
 aback-0.016
 consequences-0.016
Token big
Token ID1263
Feature activation-3.47e-03

Pos logit contributions
field+0.019
aster+0.019
ittens+0.017
aman+0.017
 Wil+0.017
Neg logit contributions
 Give-0.022
 agrees-0.019
 Go-0.018
 truth-0.018
 Hide-0.018
Token wolf
Token ID17481
Feature activation-1.03e-02

Pos logit contributions
str+0.005
field+0.005
 sudden+0.005
Wil+0.005
After+0.005
Neg logit contributions
 Give-0.007
 agrees-0.007
 Go-0.007
 acts-0.006
 disagrees-0.006
Token
Token ID198
Feature activation-5.08e-02

Pos logit contributions
field+0.088
str+0.085
 sudden+0.080
 raft+0.078
 wheat+0.078
Neg logit contributions
 Give-0.093
 agrees-0.092
 Go-0.089
ives-0.084
 Come-0.081
TokenAs
Token ID1722
Feature activation-8.92e-02

Pos logit contributions
field+0.073
str+0.065
 raft+0.062
 pumpkin+0.059
 wheat+0.058
Neg logit contributions
 Give-0.124
 Go-0.120
 agrees-0.116
ives-0.112
 Come-0.109
Token she
Token ID673
Feature activation-5.63e-03

Pos logit contributions
str+0.168
cker+0.166
field+0.160
 raft+0.150
Wil+0.143
Neg logit contributions
 Go-0.140
 Give-0.134
 You-0.129
 Come-0.120
 agrees-0.118
Token was
Token ID373
Feature activation-6.66e-03

Pos logit contributions
str+0.011
field+0.010
chard+0.010
 raft+0.010
aman+0.009
Neg logit contributions
 agrees-0.010
 Give-0.009
 Go-0.009
ives-0.009
 teaches-0.009
Token eating
Token ID6600
Feature activation-2.95e-02

Pos logit contributions
str+0.011
field+0.010
 sudden+0.009
cle+0.009
 raft+0.009
Neg logit contributions
 Give-0.013
 agrees-0.013
 Go-0.013
ives-0.012
 Share-0.012
Token,
Token ID11
Feature activation-9.67e-02

Pos logit contributions
str+0.056
field+0.054
 sudden+0.047
Wil+0.047
cle+0.047
Neg logit contributions
 agrees-0.057
 Give-0.054
 Go-0.052
 disagrees-0.049
ives-0.048
Token she
Token ID673
Feature activation-9.43e-03

Pos logit contributions
str+0.194
field+0.192
cker+0.176
 raft+0.166
cle+0.164
Neg logit contributions
 Give-0.163
 You-0.157
 Go-0.156
 agrees-0.143
 Come-0.139
Token accidentally
Token ID14716
Feature activation-6.93e-03

Pos logit contributions
str+0.017
field+0.017
Wil+0.016
 raft+0.016
aman+0.015
Neg logit contributions
 agrees-0.016
 Give-0.015
 Go-0.015
 disagrees-0.014
 demands-0.014
Token spilled
Token ID31896
Feature activation-1.63e-02

Pos logit contributions
field+0.012
str+0.012
 sudden+0.011
 raft+0.010
aman+0.010
Neg logit contributions
 Give-0.013
 agrees-0.013
 Go-0.013
ives-0.012
 disagrees-0.011
Token the
Token ID262
Feature activation+4.25e-03

Pos logit contributions
str+0.028
field+0.027
 sudden+0.023
cle+0.022
 raft+0.022
Neg logit contributions
 Give-0.034
 agrees-0.034
 Go-0.033
ives-0.031
 aback-0.030
Token noodles
Token ID31103
Feature activation-3.64e-02

Pos logit contributions
 agrees+0.009
 Give+0.009
 Go+0.009
ives+0.008
 Share+0.008
Neg logit contributions
field-0.006
str-0.006
 sudden-0.006
 wheat-0.006
 raft-0.006
TokenAs
Token ID1722
Feature activation-8.24e-02

Pos logit contributions
field+0.064
str+0.062
 sudden+0.056
 wheat+0.054
 raft+0.054
Neg logit contributions
 Give-0.100
 agrees-0.100
 Go-0.097
ives-0.091
 Share-0.089
Token they
Token ID484
Feature activation-1.45e-02

Pos logit contributions
str+0.129
field+0.124
pron+0.109
cker+0.106
 sudden+0.102
Neg logit contributions
 Go-0.165
 Give-0.157
 You-0.153
 Come-0.144
 agrees-0.144
Token baked
Token ID22979
Feature activation-3.28e-02

Pos logit contributions
str+0.028
ich+0.027
 wheat+0.025
pron+0.025
field+0.025
Neg logit contributions
 agrees-0.025
 Give-0.022
ives-0.022
 disagrees-0.021
 Share-0.021
Token the
Token ID262
Feature activation-9.79e-03

Pos logit contributions
field+0.060
str+0.059
 sudden+0.054
pron+0.051
aman+0.050
Neg logit contributions
 agrees-0.063
 Give-0.061
 Go-0.060
 Come-0.056
 Share-0.054
Token cookies
Token ID14746
Feature activation-6.62e-02

Pos logit contributions
str+0.014
field+0.014
 sudden+0.013
Wil+0.012
 raft+0.012
Neg logit contributions
 agrees-0.022
 Give-0.022
 Go-0.021
ives-0.020
 acts-0.019
Token,
Token ID11
Feature activation-9.67e-02

Pos logit contributions
 sudden+0.122
str+0.120
field+0.108
Wil+0.108
 pine+0.104
Neg logit contributions
 agrees-0.127
ives-0.116
 Give-0.114
 Go-0.110
 teaches-0.104
Token Lily
Token ID20037
Feature activation-1.59e-03

Pos logit contributions
str+0.204
field+0.202
aman+0.174
pron+0.171
cker+0.170
Neg logit contributions
 Give-0.154
 Go-0.151
 agrees-0.143
 You-0.140
 Come-0.130
Token's
Token ID338
Feature activation+1.04e-02

Pos logit contributions
str+0.003
field+0.003
 sudden+0.003
 raft+0.003
 wheat+0.002
Neg logit contributions
 agrees-0.003
 Give-0.003
 Go-0.003
ives-0.003
 disagrees-0.003
Token mom
Token ID1995
Feature activation-3.37e-02

Pos logit contributions
 agrees+0.024
 Give+0.023
 Go+0.022
ives+0.021
 demands+0.021
Neg logit contributions
str-0.015
field-0.015
Wil-0.013
 sudden-0.012
 raft-0.012
Token told
Token ID1297
Feature activation+1.76e-02

Pos logit contributions
field+0.061
str+0.060
 raft+0.056
 sudden+0.055
 wheat+0.055
Neg logit contributions
 agrees-0.065
 Give-0.063
 Go-0.058
ives-0.057
 teaches-0.055
Token her
Token ID607
Feature activation+3.59e-02

Pos logit contributions
 agrees+0.044
 Give+0.044
 Go+0.042
ives+0.041
 Share+0.039
Neg logit contributions
str-0.020
field-0.020
 sudden-0.019
 wheat-0.018
 raft-0.017
Token,
Token ID11
Feature activation-2.35e-02

Pos logit contributions
 agrees+0.092
 Give+0.091
 Go+0.088
ives+0.085
 Share+0.083
Neg logit contributions
str-0.042
field-0.041
 sudden-0.037
 wheat-0.034
 raft-0.034
Token "
Token ID366
Feature activation+4.08e-02

Pos logit contributions
field+0.054
str+0.053
 sudden+0.049
 raft+0.048
 wheat+0.048
Neg logit contributions
 Give-0.037
 Go-0.034
 agrees-0.033
 Come-0.031
 You-0.031
TokenNo
Token ID2949
Feature activation+7.44e-03

Pos logit contributions
 agrees+0.081
 Give+0.080
 Go+0.077
ives+0.073
 Share+0.070
Neg logit contributions
field-0.071
str-0.071
 sudden-0.066
 wheat-0.063
 raft-0.062
Token,
Token ID11
Feature activation-1.48e-02

Pos logit contributions
 agrees+0.020
 Give+0.020
ives+0.019
 Go+0.019
 Share+0.018
Neg logit contributions
str-0.008
field-0.007
 sudden-0.007
 raft-0.006
 wheat-0.006
Token Tim
Token ID5045
Feature activation+2.67e-02

Pos logit contributions
field+0.032
str+0.029
 sudden+0.027
pron+0.027
chard+0.025
Neg logit contributions
 Give-0.027
 Go-0.026
 agrees-0.025
 You-0.023
 Come-0.023
Token.
Token ID13
Feature activation+5.63e-02

Pos logit contributions
 agrees+0.068
 Give+0.067
ives+0.065
 Go+0.064
 Believe+0.062
Neg logit contributions
str-0.033
field-0.028
 which-0.025
 pine-0.025
 Wil-0.024
Token It
Token ID632
Feature activation+1.52e-02

Pos logit contributions
 agrees+0.073
 Give+0.063
ives+0.061
 Go+0.056
 disagrees+0.055
Neg logit contributions
str-0.136
field-0.135
 sudden-0.125
 raft-0.124
 wheat-0.124
Token is
Token ID318
Feature activation+3.00e-02

Pos logit contributions
 agrees+0.024
 Give+0.024
ives+0.024
 Go+0.022
 Share+0.022
Neg logit contributions
str-0.032
field-0.031
 raft-0.028
 pine-0.028
 wheat-0.027
Token not
Token ID407
Feature activation+1.72e-02

Pos logit contributions
 Go+0.053
 Give+0.053
 agrees+0.050
 teaches+0.047
 You+0.047
Neg logit contributions
 sudden-0.057
field-0.056
str-0.056
aman-0.049
iri-0.049
Token safe
Token ID3338
Feature activation+3.07e-02

Pos logit contributions
 Go+0.031
 Give+0.031
 agrees+0.031
 Come+0.028
 You+0.028
Neg logit contributions
field-0.033
 sudden-0.032
str-0.031
pron-0.027
aman-0.027
Token."
Token ID526
Feature activation+1.18e-02

Pos logit contributions
 agrees+0.063
 Give+0.061
 Go+0.058
ives+0.057
 Share+0.054
Neg logit contributions
field-0.051
str-0.049
 wheat-0.047
 which-0.047
 raft-0.045
Token!"
Token ID2474
Feature activation-1.93e-02

Pos logit contributions
field+0.009
str+0.008
 pine+0.007
 raft+0.007
 wheat+0.007
Neg logit contributions
 Give-0.010
 agrees-0.010
 Go-0.010
ives-0.009
 Share-0.009
Token Tim
Token ID5045
Feature activation+2.73e-02

Pos logit contributions
field+0.033
str+0.032
 sudden+0.030
pron+0.029
 wheat+0.028
Neg logit contributions
 agrees-0.040
 Give-0.040
 Go-0.038
ives-0.035
 Come-0.035
Token and
Token ID290
Feature activation-2.36e-02

Pos logit contributions
 Give+0.066
 agrees+0.065
 Go+0.064
ives+0.062
 Share+0.061
Neg logit contributions
str-0.032
field-0.032
 sudden-0.029
 pine-0.028
 which-0.027
Token Mia
Token ID32189
Feature activation+3.76e-03

Pos logit contributions
str+0.037
field+0.037
 sudden+0.034
 pine+0.033
 wheat+0.033
Neg logit contributions
 agrees-0.049
 Give-0.049
 Go-0.047
ives-0.046
 Share-0.044
Token said
Token ID531
Feature activation+3.80e-02

Pos logit contributions
 agrees+0.008
 Give+0.008
 Go+0.007
ives+0.007
 Share+0.007
Neg logit contributions
str-0.006
field-0.006
 sudden-0.006
 wheat-0.006
pron-0.005
Token.
Token ID13
Feature activation+7.62e-02

Pos logit contributions
 Give+0.082
 agrees+0.079
 Go+0.076
 disagrees+0.076
ives+0.075
Neg logit contributions
 they-0.052
field-0.052
str-0.049
 while-0.048
 which-0.048
Token They
Token ID1119
Feature activation+4.10e-03

Pos logit contributions
 agrees+0.143
 Give+0.139
 Go+0.134
ives+0.131
 disagrees+0.126
Neg logit contributions
field-0.135
str-0.128
 sudden-0.127
 which-0.124
 raft-0.121
Token waved
Token ID26834
Feature activation+3.73e-02

Pos logit contributions
 agrees+0.009
 Give+0.009
 Go+0.008
ives+0.008
 Share+0.008
Neg logit contributions
str-0.006
field-0.006
 sudden-0.006
 wheat-0.006
pron-0.006
Token and
Token ID290
Feature activation-4.57e-02

Pos logit contributions
 Give+0.080
 agrees+0.079
 Go+0.077
ives+0.074
 Share+0.071
Neg logit contributions
field-0.054
str-0.053
 sudden-0.052
 wheat-0.051
 raft-0.049
Token smiled
Token ID13541
Feature activation+5.31e-02

Pos logit contributions
str+0.083
field+0.082
pron+0.074
aman+0.074
Wil+0.074
Neg logit contributions
 agrees-0.092
 Give-0.084
 Go-0.079
 acts-0.076
 disagrees-0.075
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
 Give+0.103
 agrees+0.103
 Go+0.099
ives+0.094
 Share+0.089
Neg logit contributions
field-0.093
str-0.092
 sudden-0.087
 wheat-0.083
 raft-0.082
Token
Token ID241
Feature activation+5.83e-02

Pos logit contributions
 agrees+0.046
 Give+0.044
ives+0.042
 Go+0.041
 Come+0.039
Neg logit contributions
str-0.048
field-0.047
 wheat-0.043
 sudden-0.042
 raft-0.042
TokenGo
Token ID5247
Feature activation+5.88e-02

Pos logit contributions
 Give+0.119
 agrees+0.119
 Go+0.115
ives+0.108
 Come+0.104
Neg logit contributions
field-0.096
str-0.096
 sudden-0.088
 raft-0.084
 wheat-0.084
Token up
Token ID510
Feature activation-7.79e-03

Pos logit contributions
 agrees+0.117
 Give+0.116
 Go+0.111
ives+0.106
 Share+0.101
Neg logit contributions
field-0.101
str-0.101
 sudden-0.094
 wheat-0.090
 raft-0.089
Token again
Token ID757
Feature activation-1.90e-02

Pos logit contributions
str+0.013
field+0.013
 sudden+0.012
 wheat+0.012
 raft+0.012
Neg logit contributions
 Give-0.016
 agrees-0.016
 Go-0.015
ives-0.014
 Share-0.014
Token!
Token ID0
Feature activation+5.38e-02

Pos logit contributions
str+0.036
field+0.034
 sudden+0.033
pron+0.031
 raft+0.031
Neg logit contributions
 agrees-0.035
 Give-0.035
 Go-0.033
 Come-0.030
ives-0.030
Tokenâ
Token ID22940
Feature activation+5.57e-02

Pos logit contributions
 agrees+0.070
 consequences+0.058
ives+0.057
 disagrees+0.056
 acts+0.055
Neg logit contributions
str-0.123
field-0.120
Wil-0.116
 Wil-0.116
 sudden-0.115
Token
Token ID26391
Feature activation+4.84e-02

Pos logit contributions
 agrees+0.099
 Give+0.096
 Go+0.093
ives+0.089
 Share+0.083
Neg logit contributions
str-0.112
field-0.106
aster-0.094
 wheat-0.092
 raft-0.090
Token
Token ID198
Feature activation-3.69e-02

Pos logit contributions
 agrees+0.097
 Give+0.094
 Go+0.089
ives+0.087
 Share+0.083
Neg logit contributions
str-0.085
field-0.083
 sudden-0.075
 wheat-0.072
 which-0.071
Token
Token ID198
Feature activation-4.04e-02

Pos logit contributions
str+0.050
field+0.048
 sudden+0.045
 wheat+0.041
 pine+0.041
Neg logit contributions
 agrees-0.088
 Give-0.085
 Go-0.081
ives-0.079
 acts-0.079
TokenThe
Token ID464
Feature activation-1.33e-02

Pos logit contributions
field+0.040
str+0.038
 sudden+0.032
 raft+0.031
 wheat+0.031
Neg logit contributions
 Give-0.112
 agrees-0.110
 Go-0.108
ives-0.104
 Share-0.102
Token jet
Token ID12644
Feature activation-6.48e-03

Pos logit contributions
field+0.010
str+0.010
 sudden+0.009
 wheat+0.008
 raft+0.007
Neg logit contributions
 agrees-0.039
 Give-0.039
 Go-0.038
ives-0.036
 Share-0.035
Token white
Token ID2330
Feature activation+3.36e-02

Pos logit contributions
field+0.059
str+0.058
 sudden+0.053
aman+0.050
 raft+0.049
Neg logit contributions
 Give-0.081
 agrees-0.080
 Go-0.078
 teaches-0.072
 You-0.071
Token and
Token ID290
Feature activation-2.94e-02

Pos logit contributions
 agrees+0.074
 Give+0.073
 Go+0.070
ives+0.067
 Share+0.065
Neg logit contributions
field-0.050
str-0.050
 sudden-0.046
 wheat-0.044
 raft-0.043
Token has
Token ID468
Feature activation+2.27e-02

Pos logit contributions
field+0.055
str+0.055
 sudden+0.051
aman+0.048
 raft+0.048
Neg logit contributions
 Give-0.055
 agrees-0.054
 Go-0.053
 teaches-0.048
 You-0.047
Token no
Token ID645
Feature activation-1.18e-02

Pos logit contributions
 Give+0.049
 agrees+0.049
 Go+0.048
ives+0.044
 Share+0.043
Neg logit contributions
field-0.037
str-0.036
 sudden-0.031
 raft-0.031
pron-0.030
Token cord
Token ID15050
Feature activation+1.33e-02

Pos logit contributions
str+0.022
field+0.022
 sudden+0.019
 raft+0.019
Wil+0.018
Neg logit contributions
 agrees-0.022
 Give-0.022
 Go-0.021
ives-0.020
 acts-0.019
Token.
Token ID13
Feature activation+5.83e-02

Pos logit contributions
 agrees+0.025
 Give+0.025
 Go+0.024
ives+0.022
 Share+0.021
Neg logit contributions
field-0.024
str-0.024
 sudden-0.023
 wheat-0.022
 raft-0.022
Token She
Token ID1375
Feature activation+1.22e-04

Pos logit contributions
 agrees+0.117
ives+0.105
 Give+0.104
 acts+0.103
 Go+0.102
Neg logit contributions
 which-0.089
str-0.089
 wheat-0.084
 sudden-0.084
aster-0.083
Token says
Token ID1139
Feature activation+4.90e-02

Pos logit contributions
ives+0.000
 Give+0.000
 Go+0.000
 aback+0.000
 agrees+0.000
Neg logit contributions
field-0.000
 pine-0.000
 which-0.000
 sudden-0.000
 corn-0.000
Token it
Token ID340
Feature activation+5.18e-02

Pos logit contributions
 Give+0.115
 agrees+0.112
 Go+0.111
 Share+0.101
ives+0.101
Neg logit contributions
field-0.076
str-0.069
 sudden-0.067
 raft-0.059
pron-0.058
Token is
Token ID318
Feature activation+4.83e-02

Pos logit contributions
 agrees+0.068
 Give+0.062
ives+0.060
 Go+0.058
 aback+0.055
Neg logit contributions
str-0.117
field-0.115
 which-0.110
 they-0.108
 pine-0.108
Token a
Token ID257
Feature activation+3.49e-02

Pos logit contributions
 Give+0.098
 Go+0.097
 agrees+0.097
 teaches+0.088
 You+0.088
Neg logit contributions
field-0.082
str-0.080
 sudden-0.076
 raft-0.068
aman-0.067
Token They
Token ID1119
Feature activation+1.63e-03

Pos logit contributions
 agrees+0.136
 Give+0.127
 disagrees+0.122
 Go+0.119
ives+0.119
Neg logit contributions
str-0.102
field-0.102
 sudden-0.098
 which-0.096
 wheat-0.095
Token learned
Token ID4499
Feature activation+1.24e-02

Pos logit contributions
 Give+0.003
 agrees+0.003
 Go+0.003
ives+0.003
 Come+0.003
Neg logit contributions
field-0.003
str-0.003
 sudden-0.003
 wheat-0.002
 raft-0.002
Token that
Token ID326
Feature activation-1.30e-02

Pos logit contributions
 agrees+0.034
 Give+0.033
 Go+0.033
ives+0.031
 Share+0.031
Neg logit contributions
field-0.013
str-0.013
 sudden-0.011
pron-0.010
 raft-0.009
Token kindness
Token ID23887
Feature activation+9.69e-03

Pos logit contributions
field+0.020
str+0.020
 sudden+0.018
 wheat+0.017
 raft+0.017
Neg logit contributions
 agrees-0.028
 Give-0.028
 Go-0.027
ives-0.025
 Share-0.025
Token is
Token ID318
Feature activation+9.57e-03

Pos logit contributions
 Give+0.014
 agrees+0.014
 Go+0.013
 Come+0.012
ives+0.011
Neg logit contributions
field-0.021
str-0.021
 wheat-0.019
 which-0.019
 sudden-0.019
Token.
Token ID13
Feature activation+6.45e-02

Pos logit contributions
 agrees+0.018
 Give+0.018
 Go+0.018
 teaches+0.016
 Share+0.016
Neg logit contributions
field-0.018
str-0.017
 sudden-0.016
pron-0.015
 raft-0.015
Token<|endoftext|>
Token ID50256
Feature activation+6.20e-02

Pos logit contributions
 agrees+0.143
 Give+0.133
ives+0.128
 disagrees+0.127
 Go+0.126
Neg logit contributions
str-0.093
field-0.091
 sudden-0.087
 which-0.083
 wheat-0.082
TokenS
Token ID50
Feature activation+2.85e-02

Pos logit contributions
 agrees+0.164
 Give+0.152
 Go+0.146
 demands+0.145
ives+0.142
Neg logit contributions
field-0.070
str-0.061
 sudden-0.055
 wheat-0.054
 pine-0.053
Tokenara
Token ID3301
Feature activation+2.71e-02

Pos logit contributions
 agrees+0.054
 Give+0.053
 Go+0.050
ives+0.050
 Share+0.047
Neg logit contributions
field-0.050
 sudden-0.049
str-0.048
 raft-0.046
 wheat-0.046
Token likes
Token ID7832
Feature activation+9.89e-04

Pos logit contributions
 Give+0.066
 agrees+0.065
 Go+0.064
ives+0.060
 Share+0.058
Neg logit contributions
field-0.034
str-0.032
 wheat-0.028
 pine-0.028
 which-0.027
Token to
Token ID284
Feature activation+6.41e-03

Pos logit contributions
 Give+0.002
 Go+0.002
 agrees+0.002
 Share+0.001
 Come+0.001
Neg logit contributions
str-0.002
field-0.002
 sudden-0.002
aman-0.002
pron-0.002
Token was
Token ID373
Feature activation-4.68e-03

Pos logit contributions
field+0.036
str+0.036
 sudden+0.034
 raft+0.032
 wheat+0.032
Neg logit contributions
 agrees-0.034
 Give-0.033
 Go-0.032
ives-0.029
 Come-0.029
Token a
Token ID257
Feature activation-2.74e-03

Pos logit contributions
str+0.009
field+0.009
 sudden+0.008
 raft+0.008
ittens+0.008
Neg logit contributions
 Give-0.008
 agrees-0.008
 Go-0.008
 Come-0.007
 teaches-0.007
Token big
Token ID1263
Feature activation+4.34e-03

Pos logit contributions
field+0.004
str+0.004
aster+0.004
aman+0.004
cker+0.004
Neg logit contributions
 Give-0.006
 agrees-0.005
 Go-0.005
 Come-0.005
 Share-0.005
Token party
Token ID2151
Feature activation+3.93e-04

Pos logit contributions
 Give+0.009
 agrees+0.009
 Go+0.009
 Share+0.008
 disagrees+0.008
Neg logit contributions
str-0.007
field-0.007
 sudden-0.006
Wil-0.006
 wheat-0.006
Token inside
Token ID2641
Feature activation-4.62e-02

Pos logit contributions
 agrees+0.001
 Give+0.001
 Go+0.001
ives+0.001
 Share+0.001
Neg logit contributions
str-0.001
field-0.001
 sudden-0.001
 wheat-0.001
 raft-0.001
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
str+0.089
field+0.089
 sudden+0.084
 wheat+0.080
 raft+0.080
Neg logit contributions
 agrees-0.082
 Give-0.082
 Go-0.078
ives-0.073
 Share-0.070
Token All
Token ID1439
Feature activation-2.76e-02

Pos logit contributions
 agrees+0.040
 Give+0.039
 Go+0.037
ives+0.035
 disagrees+0.034
Neg logit contributions
field-0.039
str-0.039
 sudden-0.037
 wheat-0.035
 pine-0.035
Token of
Token ID286
Feature activation-5.93e-02

Pos logit contributions
field+0.041
str+0.041
 sudden+0.037
 raft+0.035
pron+0.034
Neg logit contributions
 agrees-0.063
 Give-0.062
 Go-0.060
ives-0.056
 teaches-0.055
Token her
Token ID607
Feature activation+1.82e-02

Pos logit contributions
str+0.121
field+0.116
 raft+0.099
cle+0.097
pron+0.096
Neg logit contributions
 agrees-0.099
 Go-0.095
 Give-0.095
 teaches-0.089
 Share-0.088
Token friends
Token ID2460
Feature activation-3.03e-02

Pos logit contributions
 Give+0.042
 agrees+0.041
 Go+0.040
ives+0.039
 Come+0.038
Neg logit contributions
field-0.025
 sudden-0.025
str-0.025
 wheat-0.022
 pine-0.021
Token and
Token ID290
Feature activation-4.96e-02

Pos logit contributions
field+0.042
str+0.042
 sudden+0.038
 wheat+0.036
 raft+0.035
Neg logit contributions
 agrees-0.070
 Give-0.069
 Go-0.067
ives-0.064
 Share-0.062
Tokenibrarian
Token ID35808
Feature activation-1.56e-02

Pos logit contributions
 agrees+0.028
 Give+0.027
 Go+0.026
 Come+0.024
 disagrees+0.024
Neg logit contributions
str-0.034
field-0.034
lf-0.031
aster-0.030
 peach-0.030
Token said
Token ID531
Feature activation+3.52e-02

Pos logit contributions
field+0.022
str+0.022
 sudden+0.020
 wheat+0.019
 raft+0.019
Neg logit contributions
 agrees-0.036
 Give-0.036
 Go-0.034
ives-0.033
 Share-0.032
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
 Give+0.083
 Go+0.080
 agrees+0.079
 Congratulations+0.079
 disagrees+0.078
Neg logit contributions
 they-0.043
 while-0.039
 which-0.038
str-0.036
 pumpkin-0.035
Token
Token ID198
Feature activation-1.78e-02

Pos logit contributions
 agrees+0.172
 Give+0.169
 Go+0.167
 acts+0.158
 disagrees+0.158
Neg logit contributions
field-0.084
 sudden-0.083
 which-0.080
 they-0.078
str-0.078
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
str+0.026
field+0.026
 sudden+0.024
 wheat+0.023
 pine+0.023
Neg logit contributions
 agrees-0.039
 Give-0.039
 Go-0.038
ives-0.036
 disagrees-0.035
Token"
Token ID1
Feature activation+2.45e-02

Pos logit contributions
field+0.028
str+0.028
 sudden+0.025
 wheat+0.024
 raft+0.023
Neg logit contributions
 agrees-0.054
 Give-0.053
 Go-0.052
ives-0.049
 Share-0.048
TokenThank
Token ID10449
Feature activation+2.79e-02

Pos logit contributions
 agrees+0.048
 Give+0.048
 Go+0.046
ives+0.044
 Share+0.042
Neg logit contributions
str-0.043
field-0.043
 sudden-0.039
 raft-0.038
 wheat-0.038
Token you
Token ID345
Feature activation+3.22e-02

Pos logit contributions
 agrees+0.059
 Give+0.057
 fault+0.056
ives+0.055
 aback+0.055
Neg logit contributions
 they-0.036
str-0.036
field-0.035
 sudden-0.034
aster-0.034
Token,"
Token ID553
Feature activation-2.91e-02

Pos logit contributions
 Give+0.083
 agrees+0.081
 Go+0.079
ives+0.078
 disagrees+0.076
Neg logit contributions
str-0.034
field-0.032
 sudden-0.030
 wheat-0.029
 pine-0.029
Token Anna
Token ID11735
Feature activation+2.44e-02

Pos logit contributions
field+0.053
str+0.053
 sudden+0.050
 wheat+0.047
 raft+0.047
Neg logit contributions
 agrees-0.055
 Give-0.054
 Go-0.052
ives-0.049
 Share-0.047
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
 Give+0.058
 agrees+0.058
 Go+0.057
ives+0.054
 Share+0.053
Neg logit contributions
str-0.031
field-0.031
 sudden-0.027
 wheat-0.026
 pine-0.025
Token rainbow
Token ID27223
Feature activation-2.36e-02

Pos logit contributions
field+0.016
str+0.015
pron+0.014
chard+0.014
aster+0.014
Neg logit contributions
 agrees-0.031
 Give-0.031
 Go-0.029
 acts-0.027
 Share-0.027
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
str+0.049
 sudden+0.047
Wil+0.044
field+0.044
pron+0.043
Neg logit contributions
 agrees-0.039
 Give-0.036
 Go-0.035
ives-0.034
 acts-0.031
Token There
Token ID1318
Feature activation-2.48e-02

Pos logit contributions
 agrees+0.043
 Give+0.039
 Go+0.038
ives+0.037
 acts+0.037
Neg logit contributions
str-0.045
field-0.044
 which-0.041
 pine-0.041
 sudden-0.041
Token,
Token ID11
Feature activation-7.55e-02

Pos logit contributions
str+0.035
field+0.035
 sudden+0.032
 wheat+0.030
 raft+0.030
Neg logit contributions
 agrees-0.056
 Give-0.056
 Go-0.054
ives-0.052
 Share-0.050
Token he
Token ID339
Feature activation+2.85e-03

Pos logit contributions
str+0.117
field+0.116
aman+0.102
iri+0.099
pron+0.097
Neg logit contributions
 Give-0.171
 Go-0.163
 agrees-0.163
 You-0.153
 Come-0.149
Token found
Token ID1043
Feature activation+9.42e-03

Pos logit contributions
 agrees+0.006
 Give+0.006
 Go+0.006
ives+0.005
 disagrees+0.005
Neg logit contributions
str-0.004
field-0.004
 sudden-0.004
 wheat-0.004
aman-0.004
Token a
Token ID257
Feature activation-1.61e-03

Pos logit contributions
 agrees+0.022
 Give+0.022
 Go+0.021
ives+0.020
 Share+0.019
Neg logit contributions
field-0.013
str-0.013
 sudden-0.012
pron-0.011
 raft-0.011
Token big
Token ID1263
Feature activation+2.36e-03

Pos logit contributions
aster+0.003
ittens+0.003
field+0.002
aman+0.002
str+0.002
Neg logit contributions
 Give-0.003
 agrees-0.003
 Go-0.003
 Believe-0.002
 truth-0.002
Token box
Token ID3091
Feature activation+4.55e-03

Pos logit contributions
 Give+0.005
 agrees+0.005
 Go+0.004
 Share+0.004
 acts+0.004
Neg logit contributions
str-0.004
field-0.004
 sudden-0.003
Wil-0.003
ittens-0.003
Token of
Token ID286
Feature activation-2.46e-02

Pos logit contributions
 Give+0.010
 agrees+0.009
 Go+0.009
ives+0.009
 Share+0.008
Neg logit contributions
str-0.007
field-0.007
 sudden-0.007
 wheat-0.006
 raft-0.006
Token y
Token ID331
Feature activation+3.32e-02

Pos logit contributions
field+0.039
str+0.039
 sudden+0.032
aman+0.032
pron+0.031
Neg logit contributions
 Give-0.053
 agrees-0.053
 Go-0.051
 acts-0.047
 teaches-0.046
Token before
Token ID878
Feature activation-2.44e-02

Pos logit contributions
 agrees+0.105
 Give+0.104
 Go+0.100
ives+0.095
 Share+0.092
Neg logit contributions
field-0.075
str-0.075
 sudden-0.070
 wheat-0.065
 raft-0.065
Token.
Token ID13
Feature activation+2.57e-02

Pos logit contributions
str+0.057
field+0.052
pron+0.050
chard+0.048
 sudden+0.048
Neg logit contributions
 Give-0.034
 agrees-0.033
 Go-0.033
 Share-0.028
 You-0.028
Token He
Token ID679
Feature activation-5.83e-03

Pos logit contributions
 agrees+0.051
 Give+0.049
 Go+0.047
ives+0.047
 disagrees+0.044
Neg logit contributions
field-0.045
str-0.043
 sudden-0.041
 pine-0.039
 wheat-0.039
Token shouted
Token ID17293
Feature activation+6.33e-02

Pos logit contributions
field+0.011
str+0.011
aster+0.011
aman+0.010
 wheat+0.010
Neg logit contributions
 agrees-0.010
 Give-0.010
 Go-0.009
 teaches-0.008
 disagrees-0.008
Token,
Token ID11
Feature activation-2.44e-02

Pos logit contributions
 agrees+0.154
 Give+0.151
 Go+0.146
ives+0.144
 disagrees+0.138
Neg logit contributions
str-0.072
field-0.072
 sudden-0.069
 which-0.067
 raft-0.066
Token
Token ID6184
Feature activation+5.35e-02

Pos logit contributions
str+0.054
field+0.054
 sudden+0.050
aster+0.050
 wheat+0.049
Neg logit contributions
 Give-0.038
 agrees-0.035
 Go-0.034
 Come-0.032
 You-0.031
Token
Token ID95
Feature activation+5.24e-02

Pos logit contributions
 agrees+0.106
 Give+0.103
 Go+0.098
ives+0.096
 Come+0.090
Neg logit contributions
str-0.096
field-0.094
 sudden-0.084
 raft-0.083
 wheat-0.082
Token
Token ID26391
Feature activation+4.72e-02

Pos logit contributions
 agrees+0.100
 Give+0.090
 acts+0.087
 Go+0.086
ives+0.085
Neg logit contributions
str-0.099
field-0.092
aster-0.087
 which-0.083
Wil-0.081
Token
Token ID129
Feature activation+2.63e-02

Pos logit contributions
 agrees+0.102
 Give+0.100
 Go+0.095
ives+0.091
 Share+0.088
Neg logit contributions
str-0.076
field-0.074
 sudden-0.068
 wheat-0.063
 raft-0.062
Token
Token ID241
Feature activation+5.76e-02

Pos logit contributions
 agrees+0.047
 Give+0.045
 Go+0.042
ives+0.042
 acts+0.040
Neg logit contributions
str-0.052
field-0.051
 sudden-0.046
 wheat-0.046
 raft-0.045
TokenLook
Token ID8567
Feature activation+3.75e-02

Pos logit contributions
 Give+0.113
 agrees+0.113
 Go+0.108
ives+0.102
 Come+0.098
Neg logit contributions
field-0.100
str-0.100
 sudden-0.092
 raft-0.088
 wheat-0.088
Token their
Token ID511
Feature activation+1.45e-02

Pos logit contributions
str+0.035
field+0.035
 sudden+0.034
 wheat+0.032
 pine+0.031
Neg logit contributions
 agrees-0.038
 teaches-0.037
 Give-0.037
 Go-0.036
 Share-0.035
Token surprise
Token ID5975
Feature activation-1.45e-02

Pos logit contributions
 Give+0.029
 agrees+0.028
 Go+0.027
ives+0.026
 Come+0.025
Neg logit contributions
field-0.025
str-0.025
 sudden-0.024
 wheat-0.023
 raft-0.022
Token,
Token ID11
Feature activation-6.81e-02

Pos logit contributions
field+0.022
str+0.021
 sudden+0.020
pron+0.019
 wheat+0.018
Neg logit contributions
 agrees-0.031
 Give-0.031
 Go-0.030
ives-0.028
 Come-0.027
Token they
Token ID484
Feature activation+1.33e-04

Pos logit contributions
ittens+0.112
str+0.112
iri+0.104
pron+0.102
field+0.102
Neg logit contributions
 agrees-0.132
 Give-0.122
 teaches-0.121
 aback-0.119
 Go-0.118
Token all
Token ID477
Feature activation-2.04e-02

Pos logit contributions
 agrees+0.000
 Give+0.000
ives+0.000
 Go+0.000
 teaches+0.000
Neg logit contributions
str-0.000
field-0.000
 sudden-0.000
ittens-0.000
aster-0.000
Token agreed
Token ID4987
Feature activation+2.16e-02

Pos logit contributions
field+0.035
str+0.035
 sudden+0.034
ittens+0.032
Wil+0.032
Neg logit contributions
 agrees-0.041
 Give-0.038
 teaches-0.036
 Go-0.036
 disagrees-0.034
Token and
Token ID290
Feature activation-4.85e-02

Pos logit contributions
 agrees+0.047
 Give+0.047
 Go+0.045
ives+0.043
 Share+0.041
Neg logit contributions
str-0.033
field-0.033
 sudden-0.030
 wheat-0.028
 pine-0.028
Token the
Token ID262
Feature activation+8.99e-03

Pos logit contributions
str+0.070
field+0.069
pron+0.064
 sudden+0.064
ittens+0.061
Neg logit contributions
 agrees-0.112
 Give-0.106
 Go-0.103
 teaches-0.098
ives-0.097
Token team
Token ID1074
Feature activation-1.38e-02

Pos logit contributions
 agrees+0.024
 Give+0.024
 Go+0.023
ives+0.023
 Share+0.022
Neg logit contributions
field-0.009
str-0.009
 sudden-0.008
 wheat-0.008
 raft-0.007
Token of
Token ID286
Feature activation-3.86e-02

Pos logit contributions
str+0.022
field+0.021
 sudden+0.020
 wheat+0.019
 raft+0.019
Neg logit contributions
 agrees-0.030
 Give-0.029
 Go-0.028
ives-0.027
 Share-0.026
Token families
Token ID4172
Feature activation-3.48e-02

Pos logit contributions
str+0.074
field+0.073
pron+0.064
 sudden+0.063
Wil+0.062
Neg logit contributions
 agrees-0.071
 Give-0.070
 Go-0.068
 teaches-0.064
 Share-0.063
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
field+0.037
str+0.036
 sudden+0.034
 wheat+0.033
 raft+0.032
Neg logit contributions
 agrees-0.044
 Give-0.043
 Go-0.041
ives-0.039
 Share-0.038
Token Lucy
Token ID22162
Feature activation+4.32e-03

Pos logit contributions
 agrees+0.061
 Give+0.059
 Go+0.057
ives+0.056
 disagrees+0.053
Neg logit contributions
field-0.046
str-0.046
 sudden-0.043
 wheat-0.040
 which-0.040
Token was
Token ID373
Feature activation+8.86e-03

Pos logit contributions
 agrees+0.008
 Give+0.008
 Go+0.008
ives+0.007
 Share+0.007
Neg logit contributions
field-0.008
str-0.008
 sudden-0.007
 wheat-0.007
 raft-0.007
Token very
Token ID845
Feature activation+6.85e-03

Pos logit contributions
 agrees+0.017
 Give+0.017
 Go+0.017
ives+0.015
 Share+0.015
Neg logit contributions
field-0.016
str-0.015
 sudden-0.014
 raft-0.014
 wheat-0.014
Token sad
Token ID6507
Feature activation+2.21e-02

Pos logit contributions
 agrees+0.014
 Give+0.014
 Go+0.013
ives+0.013
 Share+0.012
Neg logit contributions
str-0.011
field-0.011
 sudden-0.011
 wheat-0.010
 raft-0.010
Token,
Token ID11
Feature activation-3.81e-02

Pos logit contributions
 agrees+0.049
 Give+0.049
 Go+0.047
ives+0.045
 Share+0.043
Neg logit contributions
str-0.033
field-0.033
 sudden-0.030
 wheat-0.028
 raft-0.028
Token but
Token ID475
Feature activation-5.26e-02

Pos logit contributions
str+0.076
field+0.076
 raft+0.067
 wheat+0.065
aster+0.065
Neg logit contributions
 Give-0.067
 agrees-0.064
 Go-0.062
 You-0.061
 aback-0.056
Token she
Token ID673
Feature activation-3.72e-03

Pos logit contributions
str+0.075
field+0.074
 raft+0.067
pron+0.066
 sudden+0.064
Neg logit contributions
 Give-0.121
 agrees-0.121
 Go-0.116
 Believe-0.109
 Come-0.109
Token tried
Token ID3088
Feature activation+2.67e-02

Pos logit contributions
field+0.006
str+0.006
 sudden+0.005
 wheat+0.005
 raft+0.005
Neg logit contributions
 agrees-0.008
 Give-0.008
 Go-0.008
ives-0.007
 Share-0.007
Token hard
Token ID1327
Feature activation+5.00e-03

Pos logit contributions
 agrees+0.051
 Give+0.051
 Go+0.049
ives+0.046
 Share+0.044
Neg logit contributions
field-0.048
str-0.048
 sudden-0.044
 wheat-0.042
 raft-0.042
Token not
Token ID407
Feature activation-3.29e-03

Pos logit contributions
 Give+0.012
 agrees+0.012
 Go+0.012
 Share+0.011
 Come+0.011
Neg logit contributions
field-0.006
str-0.006
 sudden-0.006
 which-0.006
 wheat-0.005
Token promised
Token ID8072
Feature activation+7.39e-03

Pos logit contributions
str+0.103
field+0.098
pron+0.091
 sudden+0.091
Wil+0.090
Neg logit contributions
 agrees-0.123
 Give-0.113
 Go-0.110
 acts-0.104
 teaches-0.103
Token to
Token ID284
Feature activation-1.35e-02

Pos logit contributions
 agrees+0.014
 Give+0.014
 Go+0.013
 Come+0.012
 Share+0.012
Neg logit contributions
field-0.014
str-0.014
 sudden-0.013
pron-0.012
 wheat-0.012
Token come
Token ID1282
Feature activation+2.40e-02

Pos logit contributions
str+0.026
 sudden+0.025
field+0.024
 wheat+0.023
pron+0.022
Neg logit contributions
 Give-0.024
 agrees-0.024
 Go-0.024
 Share-0.022
ives-0.021
Token back
Token ID736
Feature activation-1.02e-02

Pos logit contributions
 agrees+0.035
 Give+0.035
 Go+0.033
ives+0.030
 Share+0.029
Neg logit contributions
str-0.055
field-0.054
 sudden-0.051
 wheat-0.049
pron-0.049
Token again
Token ID757
Feature activation-3.18e-02

Pos logit contributions
str+0.019
field+0.017
 sudden+0.017
pron+0.016
Wil+0.015
Neg logit contributions
 agrees-0.020
 Give-0.019
 Go-0.019
ives-0.017
 Come-0.017
Token soon
Token ID2582
Feature activation-3.79e-02

Pos logit contributions
str+0.060
field+0.054
 sudden+0.053
pron+0.051
 raft+0.051
Neg logit contributions
 agrees-0.061
 Give-0.056
 Go-0.055
 Come-0.050
 teaches-0.050
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
str+0.074
field+0.074
 sudden+0.070
 wheat+0.066
 raft+0.066
Neg logit contributions
 agrees-0.067
 Give-0.066
 Go-0.063
ives-0.059
 Share-0.056
Token They
Token ID1119
Feature activation-1.43e-02

Pos logit contributions
 agrees+0.048
 Give+0.047
 Go+0.045
ives+0.044
 disagrees+0.042
Neg logit contributions
field-0.041
str-0.040
 sudden-0.037
 wheat-0.035
 raft-0.035
Token felt
Token ID2936
Feature activation-4.75e-03

Pos logit contributions
str+0.024
 sudden+0.024
field+0.023
pron+0.023
 wheat+0.022
Neg logit contributions
 agrees-0.029
 Give-0.025
 Go-0.025
ives-0.024
 teaches-0.024
Token so
Token ID523
Feature activation-2.12e-02

Pos logit contributions
field+0.007
str+0.007
 sudden+0.006
 wheat+0.006
 raft+0.006
Neg logit contributions
 agrees-0.011
 Give-0.011
 Go-0.010
ives-0.010
 Share-0.009
Token happy
Token ID3772
Feature activation-1.10e-02

Pos logit contributions
str+0.026
field+0.026
 sudden+0.024
 wheat+0.022
 raft+0.022
Neg logit contributions
 Give-0.052
 agrees-0.052
 Go-0.050
ives-0.048
 Share-0.047
Token encourage
Token ID7898
Feature activation+1.90e-02

Pos logit contributions
 Give+0.000
 agrees+0.000
 Go+0.000
 Share+0.000
ives+0.000
Neg logit contributions
str-0.000
field-0.000
 sudden-0.000
aman-0.000
 wheat-0.000
Token her
Token ID607
Feature activation+2.63e-02

Pos logit contributions
 agrees+0.049
 Give+0.048
ives+0.046
 Go+0.046
 Share+0.044
Neg logit contributions
 sudden-0.019
field-0.019
str-0.018
 wheat-0.017
 which-0.017
Token to
Token ID284
Feature activation-1.08e-02

Pos logit contributions
 agrees+0.054
 Give+0.054
 Go+0.052
ives+0.049
 Share+0.047
Neg logit contributions
field-0.044
str-0.044
 sudden-0.040
 wheat-0.038
 raft-0.038
Token draw
Token ID3197
Feature activation+1.26e-02

Pos logit contributions
str+0.023
field+0.022
 sudden+0.022
 wheat+0.020
 pine+0.020
Neg logit contributions
 Give-0.017
 agrees-0.017
 Go-0.016
 Share-0.016
 teaches-0.014
Token more
Token ID517
Feature activation-3.67e-02

Pos logit contributions
 agrees+0.029
 Give+0.028
 Go+0.027
ives+0.026
 Share+0.025
Neg logit contributions
field-0.018
str-0.018
 sudden-0.017
 raft-0.016
 wheat-0.016
Token,
Token ID11
Feature activation-4.80e-02

Pos logit contributions
field+0.067
str+0.066
 sudden+0.062
 wheat+0.059
 raft+0.058
Neg logit contributions
 agrees-0.070
 Give-0.069
 Go-0.066
ives-0.063
 Share-0.060
Token so
Token ID523
Feature activation-4.07e-02

Pos logit contributions
field+0.100
str+0.100
cle+0.083
 sudden+0.082
ich+0.081
Neg logit contributions
 Give-0.086
 agrees-0.083
 Go-0.083
 You-0.075
 Come-0.072
Token she
Token ID673
Feature activation+5.73e-04

Pos logit contributions
str+0.055
field+0.052
 raft+0.044
Wil+0.043
 wheat+0.042
Neg logit contributions
 Give-0.101
 agrees-0.099
 Go-0.098
 teaches-0.092
 You-0.090
Token got
Token ID1392
Feature activation+5.76e-03

Pos logit contributions
 agrees+0.001
 Give+0.001
 Go+0.001
ives+0.001
 teaches+0.001
Neg logit contributions
field-0.001
str-0.001
 sudden-0.001
 wheat-0.001
 raft-0.001
Token her
Token ID607
Feature activation+4.26e-02

Pos logit contributions
 agrees+0.013
 Give+0.013
 Go+0.013
ives+0.012
 Share+0.012
Neg logit contributions
str-0.008
field-0.008
 sudden-0.008
 wheat-0.007
 raft-0.007
Token a
Token ID257
Feature activation+9.17e-03

Pos logit contributions
 agrees+0.083
 Give+0.082
 Go+0.078
ives+0.074
 acts+0.072
Neg logit contributions
field-0.074
str-0.074
 sudden-0.066
Wil-0.064
 raft-0.064
Token he
Token ID339
Feature activation+3.14e-03

Pos logit contributions
str+0.090
ittens+0.089
field+0.088
aman+0.084
iri+0.083
Neg logit contributions
 agrees-0.084
 Give-0.083
 Go-0.075
 Come-0.073
 wants-0.073
Token took
Token ID1718
Feature activation+1.68e-02

Pos logit contributions
 agrees+0.006
 Give+0.006
 Go+0.005
ives+0.005
 acts+0.005
Neg logit contributions
str-0.005
field-0.005
aman-0.005
 wheat-0.005
 hurricane-0.005
Token a
Token ID257
Feature activation-1.26e-02

Pos logit contributions
 agrees+0.039
 Give+0.039
 Go+0.037
ives+0.036
 Come+0.034
Neg logit contributions
field-0.024
str-0.024
 sudden-0.021
 raft-0.020
 wheat-0.020
Token sip
Token ID31145
Feature activation-6.54e-03

Pos logit contributions
field+0.022
 Wil+0.019
ths+0.019
Wil+0.019
cle+0.019
Neg logit contributions
 Give-0.023
 agrees-0.022
ives-0.022
 aback-0.021
 Go-0.020
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
str+0.013
field+0.013
 sudden+0.012
 raft+0.011
 wheat+0.011
Neg logit contributions
 agrees-0.011
 Give-0.011
 Go-0.011
ives-0.010
 Share-0.010
Token The
Token ID383
Feature activation-1.00e-02

Pos logit contributions
 agrees+0.027
 Give+0.027
 Go+0.025
ives+0.024
 Share+0.023
Neg logit contributions
field-0.033
str-0.033
 sudden-0.031
 wheat-0.030
 raft-0.030
Token coffee
Token ID6891
Feature activation+1.86e-03

Pos logit contributions
field+0.016
str+0.015
Wil+0.013
chard+0.013
 Wil+0.013
Neg logit contributions
 Give-0.022
 agrees-0.021
 Go-0.020
ives-0.019
 acts-0.019
Token was
Token ID373
Feature activation+1.40e-02

Pos logit contributions
 agrees+0.003
 Give+0.003
 Go+0.003
ives+0.003
 disagrees+0.003
Neg logit contributions
str-0.003
field-0.003
 sudden-0.003
Wil-0.003
 raft-0.003
Token delicious
Token ID12625
Feature activation+2.13e-02

Pos logit contributions
 agrees+0.021
 Give+0.020
 Go+0.019
 teaches+0.019
 aback+0.018
Neg logit contributions
field-0.031
str-0.029
cle-0.027
 raft-0.026
aman-0.026
Token and
Token ID290
Feature activation-5.16e-02

Pos logit contributions
 agrees+0.042
 Give+0.042
 Go+0.040
ives+0.038
 Share+0.037
Neg logit contributions
str-0.036
field-0.036
 sudden-0.034
 wheat-0.032
 raft-0.032
Token the
Token ID262
Feature activation+6.04e-03

Pos logit contributions
field+0.091
str+0.088
aman+0.081
Wil+0.078
 raft+0.077
Neg logit contributions
 agrees-0.099
 Give-0.095
 Go-0.091
 teaches-0.089
 You-0.085
Token mom
Token ID1995
Feature activation-2.36e-02

Pos logit contributions
 agrees+0.054
 Give+0.054
 Go+0.053
ives+0.049
 Come+0.049
Neg logit contributions
str-0.029
field-0.027
 sudden-0.024
 wheat-0.022
Wil-0.022
Token found
Token ID1043
Feature activation+8.63e-03

Pos logit contributions
str+0.042
field+0.042
 sudden+0.040
 raft+0.039
 wheat+0.038
Neg logit contributions
 agrees-0.045
 Give-0.044
 Go-0.042
ives-0.040
 Come-0.039
Token a
Token ID257
Feature activation+1.10e-03

Pos logit contributions
 Give+0.018
 agrees+0.018
 Go+0.018
ives+0.016
 aback+0.016
Neg logit contributions
str-0.014
field-0.014
 sudden-0.012
 raft-0.012
pron-0.011
Token special
Token ID2041
Feature activation+1.24e-02

Pos logit contributions
 Give+0.002
 agrees+0.002
 Go+0.002
 disagrees+0.002
 teaches+0.002
Neg logit contributions
field-0.002
str-0.002
aster-0.002
Wil-0.002
ittens-0.002
Token fl
Token ID781
Feature activation+2.16e-02

Pos logit contributions
 Give+0.025
 agrees+0.025
 Go+0.024
 acts+0.022
ives+0.022
Neg logit contributions
str-0.020
field-0.020
 sudden-0.019
Wil-0.017
 wheat-0.017
Tokenute
Token ID1133
Feature activation-4.31e-03

Pos logit contributions
 Give+0.052
 agrees+0.052
 Go+0.050
ives+0.048
 Share+0.046
Neg logit contributions
str-0.028
field-0.027
 sudden-0.025
 wheat-0.023
 raft-0.022
Token for
Token ID329
Feature activation-1.92e-02

Pos logit contributions
str+0.007
field+0.007
 sudden+0.007
 wheat+0.006
Wil+0.006
Neg logit contributions
 Give-0.009
 agrees-0.009
 Go-0.008
ives-0.008
 Come-0.007
Token her
Token ID607
Feature activation+2.91e-02

Pos logit contributions
str+0.034
field+0.033
 raft+0.029
 sudden+0.028
pron+0.027
Neg logit contributions
 Go-0.039
 Give-0.038
 agrees-0.036
 You-0.034
 Share-0.033
Token daughter
Token ID4957
Feature activation-1.42e-02

Pos logit contributions
 agrees+0.058
 Give+0.058
 Go+0.056
ives+0.052
 Share+0.050
Neg logit contributions
str-0.049
field-0.049
 sudden-0.046
 wheat-0.043
 raft-0.043
Token,
Token ID11
Feature activation-3.98e-02

Pos logit contributions
str+0.027
field+0.027
 sudden+0.025
 wheat+0.024
 raft+0.024
Neg logit contributions
 agrees-0.025
 Give-0.025
 Go-0.024
ives-0.023
 Share-0.021
Token one
Token ID530
Feature activation-2.45e-02

Pos logit contributions
str+0.069
field+0.067
 sudden+0.058
 raft+0.055
aster+0.054
Neg logit contributions
 Give-0.084
 agrees-0.081
 Go-0.080
 Come-0.074
 You-0.073
TokenWhen
Token ID2215
Feature activation-8.20e-02

Pos logit contributions
field+0.050
str+0.046
 raft+0.042
 wheat+0.040
 pumpkin+0.039
Neg logit contributions
 Give-0.102
 Go-0.100
 agrees-0.100
ives-0.094
 Come-0.090
Token she
Token ID673
Feature activation-1.56e-02

Pos logit contributions
str+0.139
cker+0.134
field+0.123
 raft+0.115
eh+0.113
Neg logit contributions
 Go-0.164
 Give-0.161
 You-0.156
 agrees-0.149
 Come-0.148
Token went
Token ID1816
Feature activation+9.79e-03

Pos logit contributions
str+0.025
field+0.025
 wheat+0.024
 raft+0.024
 sudden+0.023
Neg logit contributions
 Give-0.033
 agrees-0.032
 Go-0.031
ives-0.029
 Come-0.028
Token back
Token ID736
Feature activation-1.53e-02

Pos logit contributions
 Give+0.017
 agrees+0.016
 Go+0.016
ives+0.015
 Come+0.014
Neg logit contributions
str-0.020
field-0.020
 sudden-0.020
 wheat-0.018
pron-0.018
Token home
Token ID1363
Feature activation-3.15e-02

Pos logit contributions
str+0.029
field+0.028
 sudden+0.028
pron+0.026
 raft+0.026
Neg logit contributions
 Give-0.028
 agrees-0.028
 Go-0.026
ives-0.025
 Come-0.025
Token,
Token ID11
Feature activation-8.16e-02

Pos logit contributions
str+0.049
field+0.048
 sudden+0.045
 wheat+0.045
 raft+0.045
Neg logit contributions
 agrees-0.067
 Give-0.066
 Go-0.063
ives-0.060
 Share-0.058
Token the
Token ID262
Feature activation-5.04e-03

Pos logit contributions
str+0.137
field+0.133
cker+0.124
 raft+0.119
iri+0.114
Neg logit contributions
 Give-0.159
 Go-0.159
 agrees-0.156
 You-0.153
 Come-0.144
Token girl
Token ID2576
Feature activation-1.20e-02

Pos logit contributions
str+0.008
field+0.008
 raft+0.007
 sudden+0.006
 wheat+0.006
Neg logit contributions
 Give-0.011
 agrees-0.011
 Go-0.011
ives-0.010
 disagrees-0.010
Token was
Token ID373
Feature activation-7.02e-03

Pos logit contributions
str+0.021
field+0.021
 raft+0.019
 wheat+0.019
 sudden+0.019
Neg logit contributions
 agrees-0.024
 Give-0.023
 Go-0.022
ives-0.021
 Come-0.020
Token full
Token ID1336
Feature activation+7.06e-03

Pos logit contributions
field+0.013
str+0.013
 raft+0.012
 wheat+0.012
 sudden+0.011
Neg logit contributions
 Give-0.012
 agrees-0.012
 Go-0.012
 Share-0.011
 aback-0.011
Token of
Token ID286
Feature activation-2.90e-02

Pos logit contributions
 agrees+0.015
 Give+0.015
 Go+0.014
ives+0.013
 Share+0.013
Neg logit contributions
str-0.012
field-0.011
 sudden-0.011
 raft-0.010
 wheat-0.010
Token her
Token ID607
Feature activation+2.92e-02

Pos logit contributions
field+0.035
str+0.033
cle+0.027
aman+0.027
 sudden+0.027
Neg logit contributions
 Give-0.028
 Go-0.028
 agrees-0.026
 Share-0.024
ives-0.023
Token toys
Token ID14958
Feature activation-4.08e-02

Pos logit contributions
 Give+0.035
 agrees+0.035
 Go+0.033
 acts+0.029
 disagrees+0.029
Neg logit contributions
field-0.068
str-0.068
aster-0.068
aman-0.063
iri-0.060
Token,
Token ID11
Feature activation-5.78e-02

Pos logit contributions
str+0.076
field+0.072
 sudden+0.070
 wheat+0.065
Wil+0.065
Neg logit contributions
 agrees-0.077
 Give-0.075
 Go-0.072
ives-0.070
 disagrees-0.066
Token but
Token ID475
Feature activation-4.64e-02

Pos logit contributions
field+0.124
str+0.122
aman+0.109
 sudden+0.105
cle+0.103
Neg logit contributions
 Give-0.094
 Go-0.089
 agrees-0.083
 You-0.083
 disagrees-0.077
Token one
Token ID530
Feature activation-2.99e-02

Pos logit contributions
str+0.085
field+0.085
 sudden+0.081
 raft+0.074
 wheat+0.070
Neg logit contributions
 Give-0.090
 Go-0.087
 agrees-0.079
 You-0.079
 Come-0.076
Token day
Token ID1110
Feature activation-5.18e-02

Pos logit contributions
field+0.042
str+0.039
 sudden+0.035
 raft+0.034
Wil+0.033
Neg logit contributions
 Give-0.072
 Go-0.068
 agrees-0.068
 Come-0.064
 disagrees-0.063
Token she
Token ID673
Feature activation+5.53e-03

Pos logit contributions
field+0.106
str+0.088
 raft+0.087
aman+0.086
cker+0.085
Neg logit contributions
 Give-0.104
 Go-0.098
 You-0.086
 Come-0.085
 agrees-0.082
Token lost
Token ID2626
Feature activation+9.08e-03

Pos logit contributions
 agrees+0.013
 Give+0.013
 Go+0.013
ives+0.012
 Share+0.012
Neg logit contributions
field-0.007
str-0.007
 sudden-0.007
 wheat-0.006
 raft-0.006
Token the
Token ID262
Feature activation+1.81e-02

Pos logit contributions
 Give+0.017
 Go+0.017
 agrees+0.016
 consequences+0.015
 You+0.015
Neg logit contributions
str-0.017
field-0.016
 sudden-0.015
aman-0.013
 raft-0.013
Token key
Token ID1994
Feature activation-1.66e-02

Pos logit contributions
 agrees+0.038
 Give+0.038
 Go+0.036
ives+0.035
 Share+0.033
Neg logit contributions
field-0.029
str-0.029
 sudden-0.026
 wheat-0.025
 raft-0.025
Token to
Token ID284
Feature activation-1.43e-02

Pos logit contributions
str+0.031
field+0.030
 sudden+0.030
 wheat+0.028
 raft+0.027
Neg logit contributions
 Give-0.030
 agrees-0.029
 Go-0.029
ives-0.027
 You-0.025
Token Tim
Token ID5045
Feature activation+3.70e-03

Pos logit contributions
field+0.168
pron+0.151
str+0.151
iri+0.146
aman+0.145
Neg logit contributions
 Give-0.110
 Go-0.107
 agrees-0.103
 You-0.099
 Share-0.096
Token always
Token ID1464
Feature activation-2.10e-02

Pos logit contributions
 agrees+0.009
 Give+0.008
 Go+0.008
ives+0.008
 Share+0.008
Neg logit contributions
field-0.005
str-0.005
 sudden-0.005
 wheat-0.004
 raft-0.004
Token helped
Token ID4193
Feature activation-1.07e-03

Pos logit contributions
field+0.030
str+0.030
 sudden+0.027
 wheat+0.026
 raft+0.025
Neg logit contributions
 agrees-0.048
 Give-0.048
 Go-0.046
ives-0.044
 Share-0.042
Token his
Token ID465
Feature activation+1.35e-02

Pos logit contributions
field+0.002
str+0.002
 sudden+0.001
 raft+0.001
pron+0.001
Neg logit contributions
 Give-0.002
 agrees-0.002
 Go-0.002
ives-0.002
 Share-0.002
Token mom
Token ID1995
Feature activation-4.04e-02

Pos logit contributions
 agrees+0.037
 Give+0.037
 Go+0.035
ives+0.034
 Share+0.033
Neg logit contributions
str-0.014
field-0.014
 sudden-0.012
 wheat-0.011
 raft-0.010
Token and
Token ID290
Feature activation-5.93e-02

Pos logit contributions
str+0.075
field+0.072
 sudden+0.070
 raft+0.065
aman+0.064
Neg logit contributions
 agrees-0.076
 Give-0.074
ives-0.071
 Go-0.070
 Share-0.066
Token dad
Token ID9955
Feature activation-3.51e-02

Pos logit contributions
str+0.086
field+0.082
 sudden+0.078
ittens+0.071
 pine+0.071
Neg logit contributions
 agrees-0.130
 Give-0.127
 Go-0.124
 Share-0.119
 teaches-0.117
Token,
Token ID11
Feature activation-4.71e-02

Pos logit contributions
str+0.057
field+0.056
 sudden+0.053
 raft+0.049
 wheat+0.049
Neg logit contributions
 agrees-0.074
 Give-0.073
 Go-0.070
ives-0.068
 Share-0.065
Token even
Token ID772
Feature activation-5.96e-02

Pos logit contributions
field+0.086
str+0.085
 sudden+0.078
pron+0.072
aman+0.072
Neg logit contributions
 agrees-0.087
 Give-0.086
 Go-0.085
ives-0.079
 Come-0.077
Token in
Token ID287
Feature activation-3.31e-02

Pos logit contributions
field+0.110
str+0.108
aman+0.097
 raft+0.092
Wil+0.090
Neg logit contributions
 Give-0.115
 agrees-0.114
 Go-0.109
 acts-0.103
ives-0.103
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
str+0.046
field+0.044
 sudden+0.039
 raft+0.037
 wheat+0.036
Neg logit contributions
 Give-0.077
 agrees-0.077
 Go-0.075
 teaches-0.070
 Share-0.070
Token all
Token ID477
Feature activation-2.58e-02

Pos logit contributions
str+0.015
field+0.015
 sudden+0.014
 wheat+0.013
 raft+0.013
Neg logit contributions
 agrees-0.019
 Give-0.019
 Go-0.018
ives-0.017
 Share-0.016
Token day
Token ID1110
Feature activation-3.91e-02

Pos logit contributions
 sudden+0.038
field+0.038
str+0.035
 raft+0.033
pron+0.032
Neg logit contributions
 Give-0.056
 agrees-0.056
 Go-0.054
 You-0.052
 teaches-0.051
Token long
Token ID890
Feature activation-2.18e-02

Pos logit contributions
str+0.059
field+0.056
 sudden+0.050
 raft+0.048
aman+0.048
Neg logit contributions
 Give-0.085
 Go-0.084
 agrees-0.083
ives-0.081
 You-0.075
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
field+0.040
str+0.039
 sudden+0.038
 wheat+0.036
 which+0.035
Neg logit contributions
 agrees-0.041
 Give-0.040
 Go-0.038
ives-0.036
 Share-0.035
Token One
Token ID1881
Feature activation-3.72e-02

Pos logit contributions
 agrees+0.027
 Give+0.026
 Go+0.025
ives+0.025
 disagrees+0.023
Neg logit contributions
field-0.027
str-0.027
 sudden-0.025
 pine-0.024
 raft-0.024
Token day
Token ID1110
Feature activation-6.24e-02

Pos logit contributions
field+0.067
 sudden+0.063
str+0.060
aman+0.060
 raft+0.058
Neg logit contributions
 Give-0.068
 Go-0.062
 agrees-0.060
 Come-0.060
 You-0.059
Token,
Token ID11
Feature activation-7.99e-02

Pos logit contributions
field+0.132
str+0.114
pron+0.106
 sudden+0.102
iri+0.101
Neg logit contributions
 Give-0.119
 Go-0.113
 Come-0.105
 agrees-0.102
 You-0.101
Token Tim
Token ID5045
Feature activation+1.61e-02

Pos logit contributions
field+0.217
str+0.207
iri+0.190
 flock+0.189
 crop+0.184
Neg logit contributions
 Give-0.082
 You-0.082
 Go-0.074
 Come-0.066
 Hide-0.056
Tokenmy
Token ID1820
Feature activation-4.15e-03

Pos logit contributions
 agrees+0.032
 Give+0.032
 Go+0.031
ives+0.029
 Share+0.028
Neg logit contributions
str-0.027
field-0.027
 sudden-0.026
 wheat-0.024
 pine-0.024
Token was
Token ID373
Feature activation+9.15e-03

Pos logit contributions
str+0.008
field+0.008
 sudden+0.007
 wheat+0.007
 raft+0.007
Neg logit contributions
 agrees-0.008
 Give-0.008
 Go-0.007
ives-0.007
 Share-0.007
Token walking
Token ID6155
Feature activation+3.30e-02

Pos logit contributions
 Give+0.018
 Go+0.017
 agrees+0.016
 Come+0.015
ives+0.015
Neg logit contributions
field-0.017
str-0.017
 raft-0.015
aman-0.014
 wheat-0.014
Token the
Token ID262
Feature activation+9.73e-03

Pos logit contributions
str+0.023
field+0.023
 sudden+0.021
pron+0.019
 raft+0.019
Neg logit contributions
 Give-0.041
 agrees-0.040
 Go-0.040
 Share-0.037
 Come-0.037
Token park
Token ID3952
Feature activation-1.28e-02

Pos logit contributions
 Give+0.018
 Go+0.017
 agrees+0.016
 Share+0.015
ives+0.015
Neg logit contributions
str-0.018
field-0.017
apy-0.016
 sudden-0.016
ittens-0.016
Token with
Token ID351
Feature activation-2.83e-02

Pos logit contributions
str+0.024
field+0.023
 sudden+0.023
 raft+0.021
 wheat+0.021
Neg logit contributions
 Give-0.024
 agrees-0.024
 Go-0.023
ives-0.021
 Come-0.021
Token her
Token ID607
Feature activation+2.57e-02

Pos logit contributions
str+0.044
field+0.044
 sudden+0.039
 raft+0.037
 wheat+0.036
Neg logit contributions
 Give-0.063
 agrees-0.063
 Go-0.061
ives-0.057
 Share-0.056
Token mom
Token ID1995
Feature activation-3.91e-02

Pos logit contributions
 Give+0.060
 agrees+0.060
 Go+0.058
ives+0.055
 Share+0.054
Neg logit contributions
str-0.034
field-0.034
aster-0.030
 sudden-0.029
 wheat-0.027
Token and
Token ID290
Feature activation-5.66e-02

Pos logit contributions
str+0.086
field+0.085
 sudden+0.083
 raft+0.077
apy+0.077
Neg logit contributions
 Give-0.058
 agrees-0.058
 Go-0.054
ives-0.052
 Come-0.050
Token saw
Token ID2497
Feature activation-4.79e-03

Pos logit contributions
field+0.091
str+0.087
 sudden+0.082
ittens+0.080
 pine+0.080
Neg logit contributions
 Give-0.113
 agrees-0.109
 Go-0.107
 Come-0.106
 disagrees-0.102
Token some
Token ID617
Feature activation-3.00e-02

Pos logit contributions
field+0.009
str+0.009
pron+0.008
 sudden+0.008
cle+0.008
Neg logit contributions
 Give-0.009
 agrees-0.009
 Go-0.009
 Come-0.008
 Share-0.008
Token noisy
Token ID31210
Feature activation-1.36e-03

Pos logit contributions
field+0.050
str+0.048
 sudden+0.044
pron+0.041
aster+0.041
Neg logit contributions
 agrees-0.063
 Give-0.063
 Go-0.060
ives-0.057
 Share-0.055
Token kids
Token ID3988
Feature activation-1.60e-02

Pos logit contributions
str+0.002
field+0.002
 sudden+0.002
 wheat+0.002
 raft+0.002
Neg logit contributions
 agrees-0.003
 Give-0.003
 Go-0.003
ives-0.003
 Share-0.002
Token playing
Token ID2712
Feature activation+1.31e-02

Pos logit contributions
field+0.024
str+0.024
 sudden+0.022
 wheat+0.021
 raft+0.021
Neg logit contributions
 agrees-0.035
 Give-0.035
 Go-0.033
ives-0.032
 Share-0.031