TOP ACTIVATIONS
MAX = 5.166

good friend to everyone . One day , Yoga
the sun every day . One day , Lily
a great time together . One day , Tim
made them so happy . Soon , Lily became
But Lily never listened . One day , Lily
and always played together . One day , Lucy
loudly while jumping around . One day , Freddie
loved to play together . One sunny day ,
Max was always grateful . One day , Lily
fun in her class . One day , the
you " a lot . One day , Sam
songs and play games . One day , they
together all day long . The moral of the
explore the big fields . One day , Lily
on adventures every day . One day , Bob
awake and never tired . One day , the
but he never listened . One day , Tim
play together every day . One day , Tim
friends all day long . One day , the
someone to help him . One day , a

MIN ACTIVATIONS
MIN = -5.708

have a lot of buttons . What are you doing ?"
can make your own bread . Just add some water ,
both add some chocolate chips . But not too many ,
. Here is a teaspoon . Be careful not to spill
use some of my paints . We can paint together !"
. " Now you try . And have fun ."
. Be careful with them . They are sharp ."
Tom . Drink some milk . It will make your mouth
use any buttons you want . I have plenty . I
you are red and round . Tomato , tomato , you
you can peel the orange . You can use your fingers
is a nice lot ion . It smells like flowers .
. You measured the flour . You yielded the flour .
kids , have some cookies . They are chocolate chip .
have some of my blocks . We can build a new
rad ishes from our garden . They are pink and crunch
the red cr ay on . I like hearts too .
this wrench and this hammer . And you can hand me
these carrots and cel ery . They are crunch y and
you can keep the coins . You can buy something nice

INTERVAL 2.447 - 5.166
CONTAINS 2.902%

Once upon a time , there was a nos y bear
cake , and smiled . <|endoftext|> Once upon a time there
made something really special . <|endoftext|> Sam was a curious 3
after ! <|endoftext|> Once upon a time there was a lady
will be back soon ". Sure enough , the

INTERVAL -0.271 - 2.447
CONTAINS 57.039%

, and planes . She liked stories that made her laugh
still happy from seeing the bird display its feathers . When
the batter . They put the cups in a tray .
. We don 't have a bigger slide . But we
. " Thank you , Lily ," said the

INTERVAL -2.990 - -0.271
CONTAINS 38.587%

and make it go fast . He thought maybe Sara would
The pig gy bank !" said the little boy , proudly
waves . After a while of ro wing , she reached
" Yes , he is . He likes the snow
They hope the bear will not find them .

INTERVAL -5.708 - -2.990
CONTAINS 1.472%

M um said , � � € � � Yes
" I want it to be big enough to fit
we can 't eat steam . But we can eat what
't like rac co ons . They are mean and they
Lily said . " We can share with you ."
Token good
Token ID922
Feature activation-2.18e-01

Pos logit contributions
 nowhere+0.022
awed+0.020
ud+0.020
icer+0.020
 sudden+0.019
Neg logit contributions
 give-0.031
 share-0.029
 listen-0.029
 wait-0.028
 leave-0.027
Token friend
Token ID1545
Feature activation+6.35e-01

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.007
 neighbourhood-0.007
icer-0.007
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
 sudden+0.029
 nowhere+0.027
 neighbourhood+0.026
icer+0.026
 seas+0.025
Neg logit contributions
 give-0.028
 share-0.026
 listen-0.025
 leave-0.024
 wait-0.023
Token everyone
Token ID2506
Feature activation-5.17e-01

Pos logit contributions
 give+0.036
 share+0.031
 listen+0.030
 leave+0.030
 wait+0.029
Neg logit contributions
 nowhere-0.066
 year-0.065
 sudden-0.065
 neighbourhood-0.063
 woods-0.062
Token.
Token ID13
Feature activation+9.60e-01

Pos logit contributions
 give+0.025
 share+0.024
 listen+0.023
 leave+0.023
 wait+0.022
Neg logit contributions
 nowhere-0.019
 sudden-0.018
 year-0.018
 neighbourhood-0.017
 woods-0.017
Token
Token ID198
Feature activation+5.17e+00

Pos logit contributions
 sudden+0.036
 nowhere+0.036
 year+0.033
 seas+0.032
eh+0.032
Neg logit contributions
 give-0.045
 share-0.044
 ask-0.042
 I-0.041
 listen-0.041
Token
Token ID198
Feature activation+2.02e+00

Pos logit contributions
 sudden+0.282
pit+0.282
eled+0.281
ymes+0.281
awed+0.278
Neg logit contributions
 ask-0.106
 give-0.103
 share-0.102
 But-0.097
 listen-0.095
TokenOne
Token ID3198
Feature activation+1.10e+00

Pos logit contributions
awed+0.095
 nowhere+0.094
 doorway+0.092
 sudden+0.091
eled+0.091
Neg logit contributions
 share-0.066
 listen-0.064
 give-0.060
 ask-0.058
 wait-0.052
Token day
Token ID1110
Feature activation-9.05e-01

Pos logit contributions
 sudden+0.035
 nowhere+0.034
icer+0.033
awed+0.032
 neighbourhood+0.031
Neg logit contributions
 listen-0.062
 give-0.061
 share-0.058
 wait-0.058
 leave-0.054
Token,
Token ID11
Feature activation+8.39e-02

Pos logit contributions
 give+0.040
 share+0.038
 listen+0.037
 leave+0.037
 wait+0.036
Neg logit contributions
 nowhere-0.036
 sudden-0.036
 year-0.035
 neighbourhood-0.033
 woods-0.033
Token Yoga
Token ID32856
Feature activation+1.35e+00

Pos logit contributions
 neighbourhood+0.003
 nowhere+0.003
 year+0.003
 sudden+0.003
ud+0.003
Neg logit contributions
 give-0.004
 share-0.003
 listen-0.003
 wait-0.003
 leave-0.003
Token the
Token ID262
Feature activation+2.90e+00

Pos logit contributions
actus+0.119
awed+0.118
arrow+0.118
 sudden+0.115
ud+0.113
Neg logit contributions
 give-0.070
 share-0.067
 listen-0.057
 leave-0.055
 I-0.053
Token sun
Token ID4252
Feature activation+3.96e-01

Pos logit contributions
awed+0.114
ud+0.111
 sudden+0.103
 nowhere+0.102
eled+0.098
Neg logit contributions
 give-0.129
 leave-0.125
 share-0.122
 listen-0.119
 wait-0.118
Token every
Token ID790
Feature activation+2.54e+00

Pos logit contributions
 nowhere+0.016
 sudden+0.016
 year+0.015
 neighbourhood+0.015
 woods+0.014
Neg logit contributions
 give-0.018
 share-0.017
 listen-0.016
 leave-0.016
 wait-0.016
Token day
Token ID1110
Feature activation+4.87e-01

Pos logit contributions
awed+0.089
icer+0.089
aws+0.086
actus+0.082
ud+0.081
Neg logit contributions
 give-0.128
 wait-0.123
 listen-0.119
 leave-0.119
 share-0.116
Token.
Token ID13
Feature activation+1.54e+00

Pos logit contributions
 nowhere+0.020
 sudden+0.019
icer+0.019
 year+0.019
 neighbourhood+0.018
Neg logit contributions
 give-0.021
 share-0.020
 listen-0.019
 wait-0.019
 leave-0.018
Token
Token ID198
Feature activation+5.09e+00

Pos logit contributions
 sudden+0.074
 nowhere+0.072
 seas+0.070
ymes+0.068
in+0.068
Neg logit contributions
 give-0.056
 share-0.054
 I-0.052
 ask-0.051
 listen-0.050
Token
Token ID198
Feature activation+2.62e+00

Pos logit contributions
 seas+0.233
led+0.231
awed+0.229
ymes+0.222
 pieces+0.217
Neg logit contributions
 ask-0.149
 listen-0.143
 share-0.142
 please-0.139
 give-0.137
TokenOne
Token ID3198
Feature activation+1.32e+00

Pos logit contributions
awed+0.114
 seas+0.111
 pieces+0.110
actus+0.108
 nowhere+0.107
Neg logit contributions
 listen-0.098
 share-0.097
 please-0.089
 give-0.088
 wait-0.085
Token day
Token ID1110
Feature activation-4.82e-01

Pos logit contributions
awed+0.046
 nowhere+0.046
icer+0.045
 sudden+0.045
ud+0.042
Neg logit contributions
 listen-0.069
 give-0.067
 wait-0.066
 share-0.064
 leave-0.060
Token,
Token ID11
Feature activation+8.65e-01

Pos logit contributions
 give+0.020
 share+0.018
 listen+0.018
 leave+0.017
 wait+0.017
Neg logit contributions
 nowhere-0.021
 sudden-0.021
 year-0.020
 neighbourhood-0.020
 woods-0.019
Token Lily
Token ID20037
Feature activation+7.39e-01

Pos logit contributions
 nowhere+0.040
ud+0.039
 neighbourhood+0.038
 year+0.038
icer+0.038
Neg logit contributions
 give-0.033
 share-0.031
 listen-0.030
 wait-0.029
 ask-0.028
Token a
Token ID257
Feature activation+2.92e-01

Pos logit contributions
icer+0.034
 nowhere+0.032
 sudden+0.032
ud+0.032
 year+0.032
Neg logit contributions
 give-0.037
 share-0.036
 listen-0.035
 leave-0.034
 wait-0.033
Token great
Token ID1049
Feature activation+1.01e-01

Pos logit contributions
 nowhere+0.011
 sudden+0.010
 year+0.010
icer+0.010
ud+0.010
Neg logit contributions
 give-0.014
 listen-0.013
 share-0.013
 wait-0.013
 leave-0.013
Token time
Token ID640
Feature activation-1.09e+00

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 year+0.001
in+0.001
icer+0.001
Neg logit contributions
 give-0.007
 listen-0.006
 share-0.006
 leave-0.006
 wait-0.006
Token together
Token ID1978
Feature activation+2.25e-03

Pos logit contributions
 give+0.058
 share+0.055
 listen+0.053
 leave+0.053
 wait+0.052
Neg logit contributions
 nowhere-0.034
 year-0.033
 sudden-0.033
 neighbourhood-0.031
 woods-0.030
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 nowhere+0.000
 sudden+0.000
 year+0.000
 neighbourhood+0.000
icer+0.000
Neg logit contributions
 give-0.000
 share-0.000
 listen-0.000
 leave-0.000
 wait-0.000
Token
Token ID198
Feature activation+5.08e+00

Pos logit contributions
 sudden+0.077
 seas+0.073
 nowhere+0.072
in+0.072
ymes+0.070
Neg logit contributions
 give-0.074
 share-0.074
 ask-0.071
 listen-0.070
 I-0.067
Token
Token ID198
Feature activation+2.68e+00

Pos logit contributions
 seas+0.265
ymes+0.258
 sudden+0.254
 pieces+0.252
awed+0.247
Neg logit contributions
 listen-0.135
 give-0.133
 ask-0.133
 share-0.127
 please-0.117
TokenOne
Token ID3198
Feature activation+5.60e-01

Pos logit contributions
 seas+0.130
actus+0.129
joy+0.127
ymes+0.126
 pieces+0.126
Neg logit contributions
 listen-0.088
 share-0.081
 give-0.073
 ask-0.072
 wait-0.070
Token day
Token ID1110
Feature activation-8.22e-01

Pos logit contributions
 nowhere+0.015
 sudden+0.015
icer+0.013
awed+0.013
 neighbourhood+0.013
Neg logit contributions
 listen-0.034
 give-0.034
 share-0.032
 wait-0.032
 leave-0.031
Token,
Token ID11
Feature activation+5.99e-01

Pos logit contributions
 give+0.036
 share+0.034
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.033
 sudden-0.033
 year-0.032
 neighbourhood-0.030
 woods-0.030
Token Tim
Token ID5045
Feature activation+1.53e+00

Pos logit contributions
 neighbourhood+0.029
icer+0.028
ud+0.027
 nowhere+0.027
actus+0.027
Neg logit contributions
 give-0.023
 share-0.022
 listen-0.021
 wait-0.020
 please-0.019
Token made
Token ID925
Feature activation+4.26e-01

Pos logit contributions
 give+0.086
 share+0.083
 listen+0.081
 leave+0.080
 wait+0.079
Neg logit contributions
 nowhere-0.054
 year-0.051
 sudden-0.050
 neighbourhood-0.049
 woods-0.049
Token them
Token ID606
Feature activation-2.04e+00

Pos logit contributions
 nowhere+0.019
 sudden+0.018
icer+0.017
 year+0.017
 neighbourhood+0.017
Neg logit contributions
 give-0.019
 share-0.017
 listen-0.017
 leave-0.017
 ask-0.016
Token so
Token ID523
Feature activation-9.49e-01

Pos logit contributions
 give+0.082
 share+0.078
 listen+0.074
 ask+0.073
 leave+0.073
Neg logit contributions
 sudden-0.080
 year-0.079
 nowhere-0.078
 neighbourhood-0.076
 pieces-0.075
Token happy
Token ID3772
Feature activation-7.89e-01

Pos logit contributions
 give+0.053
 share+0.050
 listen+0.050
 leave+0.049
 wait+0.048
Neg logit contributions
 sudden-0.027
 nowhere-0.026
 year-0.026
 neighbourhood-0.024
 woods-0.023
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 give+0.039
 share+0.037
 listen+0.036
 leave+0.036
 wait+0.035
Neg logit contributions
 nowhere-0.028
 sudden-0.027
 year-0.026
 neighbourhood-0.025
 woods-0.025
Token
Token ID198
Feature activation+5.06e+00

Pos logit contributions
 sudden+0.080
 nowhere+0.075
eh+0.073
ymes+0.072
 seas+0.072
Neg logit contributions
 give-0.071
 ask-0.070
 share-0.068
 I-0.065
 But-0.064
Token
Token ID198
Feature activation+1.86e+00

Pos logit contributions
 seas+0.283
ymes+0.273
 sudden+0.273
eh+0.270
 pieces+0.267
Neg logit contributions
 ask-0.115
 give-0.102
 But-0.099
 listen-0.096
 share-0.095
TokenSoon
Token ID28093
Feature activation-1.12e+00

Pos logit contributions
awed+0.087
 seas+0.087
 nowhere+0.087
 doorway+0.085
 sudden+0.083
Neg logit contributions
 share-0.060
 listen-0.059
 ask-0.056
 give-0.056
 please-0.050
Token,
Token ID11
Feature activation+3.97e-01

Pos logit contributions
 give+0.050
 share+0.047
 listen+0.046
 leave+0.045
 wait+0.044
Neg logit contributions
 sudden-0.044
 nowhere-0.044
 year-0.043
 neighbourhood-0.040
 woods-0.040
Token Lily
Token ID20037
Feature activation+4.31e-01

Pos logit contributions
ud+0.019
 neighbourhood+0.019
 year+0.019
 nowhere+0.019
 sudden+0.018
Neg logit contributions
 give-0.015
 share-0.014
 listen-0.013
 wait-0.012
 leave-0.012
Token became
Token ID2627
Feature activation-1.28e-01

Pos logit contributions
 nowhere+0.016
 sudden+0.015
 year+0.015
icer+0.014
 seas+0.014
Neg logit contributions
 give-0.021
 share-0.020
 leave-0.019
 listen-0.019
 ask-0.019
Token But
Token ID887
Feature activation-5.55e-01

Pos logit contributions
 nowhere+0.017
 sudden+0.017
in+0.016
 year+0.016
 seas+0.016
Neg logit contributions
 give-0.021
 share-0.020
 ask-0.019
 listen-0.019
 leave-0.018
Token Lily
Token ID20037
Feature activation+1.18e+00

Pos logit contributions
 give+0.024
 share+0.023
 listen+0.022
 leave+0.022
 wait+0.021
Neg logit contributions
 nowhere-0.023
 sudden-0.022
 year-0.022
 neighbourhood-0.021
 woods-0.021
Token never
Token ID1239
Feature activation-1.06e+00

Pos logit contributions
 seas+0.054
 sudden+0.052
 nowhere+0.052
icer+0.050
 woods+0.049
Neg logit contributions
 give-0.046
 listen-0.044
 share-0.043
 wait-0.043
 ask-0.042
Token listened
Token ID16399
Feature activation+4.41e-02

Pos logit contributions
 give+0.053
 share+0.051
 listen+0.048
 leave+0.048
 wait+0.047
Neg logit contributions
 nowhere-0.036
 year-0.034
 sudden-0.034
 neighbourhood-0.033
 woods-0.032
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 neighbourhood+0.002
icer+0.002
 woods+0.002
Neg logit contributions
 give-0.002
 listen-0.002
 share-0.002
 leave-0.001
 wait-0.001
Token
Token ID198
Feature activation+5.03e+00

Pos logit contributions
 sudden+0.029
 nowhere+0.029
 seas+0.027
 year+0.027
in+0.027
Neg logit contributions
 give-0.027
 share-0.026
 ask-0.025
 listen-0.025
 leave-0.024
Token
Token ID198
Feature activation+1.66e+00

Pos logit contributions
 seas+0.284
ymes+0.275
pit+0.271
 sudden+0.269
 pieces+0.267
Neg logit contributions
 ask-0.116
 give-0.110
 listen-0.102
 please-0.101
 But-0.096
TokenOne
Token ID3198
Feature activation+8.61e-01

Pos logit contributions
 seas+0.077
 nowhere+0.074
 pieces+0.073
 sudden+0.071
joy+0.071
Neg logit contributions
 listen-0.056
 share-0.053
 give-0.053
 ask-0.052
 please-0.050
Token day
Token ID1110
Feature activation-1.38e+00

Pos logit contributions
 nowhere+0.026
 sudden+0.025
 neighbourhood+0.023
awed+0.023
icer+0.023
Neg logit contributions
 give-0.049
 listen-0.049
 wait-0.046
 share-0.045
 leave-0.045
Token,
Token ID11
Feature activation-3.55e-01

Pos logit contributions
 give+0.067
 share+0.064
 leave+0.062
 listen+0.061
 wait+0.061
Neg logit contributions
 sudden-0.049
 nowhere-0.048
 year-0.047
 woods-0.042
 neighbourhood-0.042
Token Lily
Token ID20037
Feature activation+8.60e-01

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.013
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.015
 sudden-0.015
 year-0.015
 neighbourhood-0.015
 woods-0.014
Token and
Token ID290
Feature activation+5.45e-02

Pos logit contributions
 sudden+0.028
 nowhere+0.027
icer+0.025
 year+0.025
 neighbourhood+0.025
Neg logit contributions
 give-0.028
 share-0.027
 listen-0.026
 leave-0.025
 wait-0.025
Token always
Token ID1464
Feature activation-1.58e+00

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 year+0.002
icer+0.002
 neighbourhood+0.002
Neg logit contributions
 give-0.002
 share-0.002
 listen-0.002
 leave-0.002
 ask-0.002
Token played
Token ID2826
Feature activation+3.21e-01

Pos logit contributions
 give+0.065
 share+0.060
 listen+0.058
 leave+0.058
 wait+0.057
Neg logit contributions
 nowhere-0.065
 year-0.063
 sudden-0.063
 neighbourhood-0.061
 woods-0.059
Token together
Token ID1978
Feature activation+1.75e+00

Pos logit contributions
 nowhere+0.014
 sudden+0.014
 year+0.013
ud+0.013
 neighbourhood+0.013
Neg logit contributions
 give-0.014
 share-0.013
 listen-0.012
 leave-0.012
 ask-0.012
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 sudden+0.083
 nowhere+0.078
icer+0.077
awed+0.076
 neighbourhood+0.075
Neg logit contributions
 give-0.069
 share-0.066
 listen-0.061
 leave-0.060
 wait-0.059
Token
Token ID198
Feature activation+5.01e+00

Pos logit contributions
 sudden+0.070
 nowhere+0.067
 seas+0.066
ymes+0.064
 pieces+0.063
Neg logit contributions
 give-0.067
 share-0.064
 I-0.062
 ask-0.061
 listen-0.060
Token
Token ID198
Feature activation+2.72e+00

Pos logit contributions
awed+0.219
 seas+0.217
ymes+0.214
pit+0.214
led+0.212
Neg logit contributions
 share-0.154
 ask-0.151
 listen-0.150
 give-0.140
 learn-0.139
TokenOne
Token ID3198
Feature activation+1.48e+00

Pos logit contributions
awed+0.118
actus+0.117
 pieces+0.116
 seas+0.116
 nowhere+0.113
Neg logit contributions
 share-0.101
 listen-0.101
 please-0.091
 give-0.091
 wait-0.087
Token day
Token ID1110
Feature activation-4.35e-01

Pos logit contributions
awed+0.052
 sudden+0.051
 nowhere+0.051
icer+0.051
ud+0.047
Neg logit contributions
 listen-0.077
 give-0.075
 wait-0.072
 share-0.072
 ask-0.067
Token,
Token ID11
Feature activation+9.08e-01

Pos logit contributions
 give+0.018
 share+0.017
 listen+0.016
 leave+0.016
 wait+0.015
Neg logit contributions
 nowhere-0.019
 sudden-0.019
 year-0.018
 neighbourhood-0.018
 woods-0.017
Token Lucy
Token ID22162
Feature activation+1.52e+00

Pos logit contributions
 nowhere+0.041
ud+0.041
 neighbourhood+0.040
 year+0.040
icer+0.040
Neg logit contributions
 give-0.035
 share-0.033
 listen-0.032
 wait-0.031
 ask-0.030
Token loudly
Token ID23112
Feature activation+1.13e-01

Pos logit contributions
 nowhere+0.006
 sudden+0.006
 year+0.006
 neighbourhood+0.006
icer+0.006
Neg logit contributions
 give-0.007
 share-0.006
 listen-0.006
 leave-0.006
 wait-0.006
Token while
Token ID981
Feature activation+4.30e-01

Pos logit contributions
 nowhere+0.004
 sudden+0.004
 year+0.004
 neighbourhood+0.004
icer+0.004
Neg logit contributions
 give-0.005
 share-0.005
 listen-0.005
 leave-0.005
 wait-0.005
Token jumping
Token ID14284
Feature activation+2.71e-01

Pos logit contributions
 nowhere+0.019
 sudden+0.019
 year+0.018
 neighbourhood+0.018
icer+0.018
Neg logit contributions
 give-0.017
 share-0.016
 listen-0.016
 leave-0.016
 wait-0.015
Token around
Token ID1088
Feature activation+1.54e+00

Pos logit contributions
 sudden+0.011
 nowhere+0.011
 year+0.011
 neighbourhood+0.011
icer+0.010
Neg logit contributions
 give-0.012
 share-0.011
 listen-0.011
 leave-0.011
 wait-0.010
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
 sudden+0.071
 nowhere+0.065
 neighbourhood+0.064
awed+0.064
ud+0.064
Neg logit contributions
 give-0.062
 share-0.059
 listen-0.056
 leave-0.055
 wait-0.052
Token
Token ID198
Feature activation+4.98e+00

Pos logit contributions
 nowhere+0.091
 seas+0.090
 sudden+0.089
ymes+0.087
 pieces+0.087
Neg logit contributions
 give-0.073
 ask-0.071
 I-0.070
 share-0.069
 listen-0.067
Token
Token ID198
Feature activation+3.00e+00

Pos logit contributions
 seas+0.269
awed+0.249
led+0.248
ymes+0.247
actus+0.247
Neg logit contributions
 please-0.133
 ask-0.129
 listen-0.111
 I-0.108
 But-0.104
TokenOne
Token ID3198
Feature activation+1.77e+00

Pos logit contributions
 seas+0.136
 pieces+0.132
actus+0.130
awed+0.128
 nowhere+0.124
Neg logit contributions
 listen-0.107
 please-0.104
 share-0.103
 ask-0.096
 give-0.095
Token day
Token ID1110
Feature activation-4.84e-02

Pos logit contributions
awed+0.064
icer+0.064
 nowhere+0.062
rones+0.058
 sudden+0.058
Neg logit contributions
 listen-0.089
 give-0.088
 wait-0.084
 share-0.080
 ask-0.078
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
 give+0.002
 share+0.002
 listen+0.002
 leave+0.002
 wait+0.002
Neg logit contributions
 nowhere-0.002
 sudden-0.002
 year-0.002
 neighbourhood-0.002
icer-0.002
Token Freddie
Token ID31658
Feature activation+1.33e+00

Pos logit contributions
 nowhere+0.052
ud+0.051
 neighbourhood+0.051
icer+0.050
 year+0.050
Neg logit contributions
 give-0.045
 share-0.041
 listen-0.041
 wait-0.039
 ask-0.037
Token loved
Token ID6151
Feature activation+8.91e-01

Pos logit contributions
icer+0.056
 sudden+0.056
 nowhere+0.054
ud+0.051
smith+0.050
Neg logit contributions
 give-0.067
 share-0.064
 listen-0.062
 leave-0.061
 wait-0.058
Token to
Token ID284
Feature activation-2.68e+00

Pos logit contributions
 nowhere+0.039
icer+0.038
 sudden+0.037
 year+0.037
 neighbourhood+0.037
Neg logit contributions
 give-0.037
 share-0.035
 listen-0.033
 leave-0.032
 wait-0.031
Token play
Token ID711
Feature activation+4.74e-01

Pos logit contributions
 give+0.085
ourage+0.083
 Sorry+0.083
 share+0.076
 leave+0.076
Neg logit contributions
 year-0.120
<|endoftext|>-0.117
 sudden-0.115
 woods-0.110
 neighbourhood-0.110
Token together
Token ID1978
Feature activation+1.20e+00

Pos logit contributions
 sudden+0.020
 nowhere+0.020
 neighbourhood+0.019
 year+0.019
ud+0.019
Neg logit contributions
 give-0.020
 share-0.019
 listen-0.018
 wait-0.017
 ask-0.017
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
 sudden+0.057
 nowhere+0.054
 neighbourhood+0.052
icer+0.052
awed+0.051
Neg logit contributions
 give-0.047
 share-0.045
 listen-0.042
 wait-0.041
 leave-0.040
Token
Token ID198
Feature activation+4.96e+00

Pos logit contributions
 sudden+0.061
 nowhere+0.061
 year+0.058
 seas+0.058
in+0.057
Neg logit contributions
 give-0.038
 share-0.035
 ask-0.034
 I-0.033
 listen-0.033
Token
Token ID198
Feature activation+2.42e+00

Pos logit contributions
 seas+0.240
led+0.240
awed+0.239
actus+0.237
ymes+0.225
Neg logit contributions
 ask-0.138
 listen-0.127
 please-0.122
 share-0.121
 give-0.117
TokenOne
Token ID3198
Feature activation+1.26e+00

Pos logit contributions
actus+0.105
awed+0.105
 nowhere+0.104
 seas+0.104
 pieces+0.100
Neg logit contributions
 share-0.090
 listen-0.090
 give-0.083
 please-0.082
 ask-0.080
Token sunny
Token ID27737
Feature activation+2.13e+00

Pos logit contributions
awed+0.045
icer+0.044
 nowhere+0.043
 sudden+0.041
ud+0.041
Neg logit contributions
 listen-0.066
 give-0.065
 wait-0.062
 share-0.061
 please-0.057
Token day
Token ID1110
Feature activation-1.95e-01

Pos logit contributions
 nowhere+0.070
icer+0.070
 sudden+0.065
ymes+0.063
awed+0.061
Neg logit contributions
 give-0.107
 listen-0.106
 share-0.103
 wait-0.099
 ask-0.097
Token,
Token ID11
Feature activation+9.47e-01

Pos logit contributions
 give+0.008
 share+0.007
 listen+0.007
 leave+0.007
 wait+0.007
Neg logit contributions
 nowhere-0.009
 sudden-0.009
 year-0.009
 neighbourhood-0.008
 woods-0.008
Token Max
Token ID5436
Feature activation+4.00e-01

Pos logit contributions
 give+0.052
 share+0.048
 listen+0.047
 leave+0.047
 wait+0.046
Neg logit contributions
 nowhere-0.050
 sudden-0.049
 year-0.048
 neighbourhood-0.046
 woods-0.046
Token was
Token ID373
Feature activation+5.95e-01

Pos logit contributions
 nowhere+0.014
 sudden+0.014
 year+0.013
 neighbourhood+0.013
 seas+0.013
Neg logit contributions
 give-0.021
 share-0.020
 listen-0.019
 leave-0.019
 wait-0.018
Token always
Token ID1464
Feature activation+4.97e-01

Pos logit contributions
 sudden+0.026
 seas+0.025
 nowhere+0.024
 year+0.024
icer+0.024
Neg logit contributions
 give-0.025
 share-0.024
 listen-0.023
 leave-0.022
 ask-0.022
Token grateful
Token ID14066
Feature activation-4.89e-01

Pos logit contributions
 sudden+0.021
 year+0.020
 nowhere+0.020
 neighbourhood+0.020
icer+0.020
Neg logit contributions
 give-0.021
 share-0.020
 listen-0.019
 ask-0.018
 leave-0.018
Token.
Token ID13
Feature activation+1.33e+00

Pos logit contributions
 give+0.019
 share+0.018
 listen+0.018
 leave+0.017
 wait+0.017
Neg logit contributions
 nowhere-0.022
 sudden-0.022
 year-0.021
 neighbourhood-0.021
 woods-0.020
Token
Token ID198
Feature activation+4.96e+00

Pos logit contributions
 sudden+0.055
 nowhere+0.053
 seas+0.053
 year+0.052
in+0.050
Neg logit contributions
 give-0.056
 ask-0.055
 share-0.054
 I-0.053
 listen-0.051
Token
Token ID198
Feature activation+2.10e+00

Pos logit contributions
 seas+0.282
ymes+0.269
 pieces+0.265
pit+0.263
 sudden+0.260
Neg logit contributions
 ask-0.104
 But-0.096
 listen-0.095
 give-0.094
't-0.091
TokenOne
Token ID3198
Feature activation+9.84e-01

Pos logit contributions
 seas+0.101
awed+0.098
 nowhere+0.098
 doorway+0.096
 pieces+0.095
Neg logit contributions
 listen-0.067
 share-0.064
 give-0.061
 ask-0.059
 please-0.055
Token day
Token ID1110
Feature activation-1.02e+00

Pos logit contributions
 nowhere+0.032
 sudden+0.032
icer+0.031
awed+0.030
 seas+0.029
Neg logit contributions
 listen-0.055
 give-0.054
 wait-0.051
 share-0.050
 ask-0.049
Token,
Token ID11
Feature activation+4.95e-01

Pos logit contributions
 give+0.047
 share+0.044
 leave+0.043
 listen+0.042
 wait+0.042
Neg logit contributions
 nowhere-0.039
 sudden-0.039
 year-0.038
 neighbourhood-0.036
 woods-0.036
Token Lily
Token ID20037
Feature activation+8.14e-01

Pos logit contributions
 neighbourhood+0.025
ud+0.024
 year+0.024
 nowhere+0.024
 area+0.024
Neg logit contributions
 give-0.017
 share-0.016
 listen-0.015
 wait-0.014
 ask-0.014
Token fun
Token ID1257
Feature activation-1.93e-01

Pos logit contributions
 nowhere+0.015
icer+0.015
 year+0.015
 sudden+0.015
ud+0.014
Neg logit contributions
 give-0.020
 share-0.020
 listen-0.019
 leave-0.018
 ask-0.018
Token in
Token ID287
Feature activation+6.54e-01

Pos logit contributions
 give+0.009
 share+0.008
 listen+0.008
 leave+0.008
 ask+0.008
Neg logit contributions
 nowhere-0.007
 sudden-0.007
 year-0.007
 neighbourhood-0.007
 woods-0.007
Token her
Token ID607
Feature activation-1.07e-01

Pos logit contributions
 sudden+0.031
awed+0.029
actus+0.029
eh+0.029
in+0.029
Neg logit contributions
 give-0.027
 share-0.025
 listen-0.024
 leave-0.023
 ask-0.022
Token class
Token ID1398
Feature activation+5.93e-01

Pos logit contributions
 give+0.006
 share+0.006
 listen+0.006
 leave+0.006
 wait+0.005
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 year-0.003
ud-0.003
icer-0.003
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
 sudden+0.025
 nowhere+0.024
icer+0.023
 snowball+0.022
 woods+0.022
Neg logit contributions
 give-0.026
 share-0.025
 listen-0.024
 wait-0.024
 leave-0.023
Token
Token ID198
Feature activation+4.96e+00

Pos logit contributions
 sudden+0.070
 nowhere+0.066
 seas+0.064
eh+0.061
in+0.060
Neg logit contributions
 give-0.064
 share-0.062
 ask-0.062
 listen-0.058
 I-0.057
Token
Token ID198
Feature activation+1.62e+00

Pos logit contributions
 seas+0.264
 sudden+0.252
pit+0.250
awed+0.250
 pieces+0.248
Neg logit contributions
 listen-0.135
 ask-0.135
 give-0.134
 share-0.128
 please-0.119
TokenOne
Token ID3198
Feature activation+8.25e-01

Pos logit contributions
 nowhere+0.074
 seas+0.072
 pieces+0.071
joy+0.070
awed+0.070
Neg logit contributions
 listen-0.062
 share-0.061
 give-0.059
 ask-0.056
 wait-0.053
Token day
Token ID1110
Feature activation-1.08e+00

Pos logit contributions
 nowhere+0.024
 sudden+0.024
icer+0.023
awed+0.022
 neighbourhood+0.021
Neg logit contributions
 listen-0.048
 give-0.048
 share-0.046
 wait-0.045
 ask-0.042
Token,
Token ID11
Feature activation+2.07e-01

Pos logit contributions
 give+0.050
 share+0.047
 leave+0.045
 listen+0.045
 wait+0.044
Neg logit contributions
 nowhere-0.041
 sudden-0.040
 year-0.040
 woods-0.037
 neighbourhood-0.037
Token the
Token ID262
Feature activation+3.44e-02

Pos logit contributions
 nowhere+0.009
 neighbourhood+0.009
ud+0.008
 sudden+0.008
icer+0.008
Neg logit contributions
 give-0.009
 share-0.009
 listen-0.008
 wait-0.008
 leave-0.008
Token you
Token ID345
Feature activation-1.41e+00

Pos logit contributions
 give+0.038
 share+0.034
 leave+0.033
 listen+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.071
 year-0.071
 sudden-0.069
 neighbourhood-0.067
 woods-0.066
Token"
Token ID1
Feature activation-1.12e-02

Pos logit contributions
 give+0.060
 share+0.056
 listen+0.054
 leave+0.054
 wait+0.053
Neg logit contributions
 year-0.058
 nowhere-0.056
 sudden-0.054
 neighbourhood-0.053
 woods-0.052
Token a
Token ID257
Feature activation-1.99e-01

Pos logit contributions
 give+0.000
 share+0.000
 listen+0.000
 leave+0.000
 ask+0.000
Neg logit contributions
 sudden-0.000
 nowhere-0.000
 year-0.000
 neighbourhood-0.000
 woods-0.000
Token lot
Token ID1256
Feature activation-1.36e+00

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.006
 sudden-0.006
 year-0.006
 neighbourhood-0.005
 woods-0.005
Token.
Token ID13
Feature activation+7.94e-01

Pos logit contributions
 give+0.066
 share+0.063
 leave+0.061
 listen+0.061
 wait+0.060
Neg logit contributions
 year-0.048
 nowhere-0.047
 sudden-0.044
 neighbourhood-0.043
in-0.042
Token
Token ID198
Feature activation+4.95e+00

Pos logit contributions
 nowhere+0.037
 sudden+0.036
 year+0.034
ymes+0.033
 seas+0.033
Neg logit contributions
 give-0.032
 share-0.030
 ask-0.029
 listen-0.028
 leave-0.028
Token
Token ID198
Feature activation+1.85e+00

Pos logit contributions
 sudden+0.267
 seas+0.260
eh+0.258
 nowhere+0.257
awed+0.256
Neg logit contributions
 ask-0.129
 give-0.118
 share-0.112
 But-0.110
 pay-0.109
TokenOne
Token ID3198
Feature activation+1.50e-01

Pos logit contributions
 nowhere+0.082
awed+0.079
ymes+0.078
 sudden+0.077
actus+0.075
Neg logit contributions
 listen-0.065
 ask-0.064
 share-0.064
 give-0.062
 pay-0.057
Token day
Token ID1110
Feature activation-1.62e+00

Pos logit contributions
 nowhere+0.003
 sudden+0.003
 neighbourhood+0.003
 seas+0.003
awed+0.003
Neg logit contributions
 give-0.010
 listen-0.009
 share-0.009
 wait-0.009
 ask-0.009
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
 give+0.082
 share+0.077
 leave+0.075
 wait+0.073
 listen+0.073
Neg logit contributions
 sudden-0.053
 nowhere-0.051
 year-0.050
 woods-0.049
 seas-0.049
Token Sam
Token ID3409
Feature activation+5.42e-01

Pos logit contributions
 give+0.053
 share+0.050
 leave+0.049
 listen+0.048
 wait+0.047
Neg logit contributions
 nowhere-0.040
 sudden-0.039
 year-0.037
 seas-0.036
 woods-0.036
Token songs
Token ID7259
Feature activation-1.15e-01

Pos logit contributions
 nowhere+0.016
 sudden+0.016
 neighbourhood+0.015
icer+0.015
 year+0.015
Neg logit contributions
 give-0.018
 share-0.017
 listen-0.016
 leave-0.016
 wait-0.015
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
 give+0.005
 share+0.005
 listen+0.005
 leave+0.005
 wait+0.005
Neg logit contributions
 nowhere-0.004
 sudden-0.004
 year-0.004
 neighbourhood-0.004
 woods-0.004
Token play
Token ID711
Feature activation+8.02e-01

Pos logit contributions
 give+0.050
 share+0.046
 leave+0.045
 listen+0.044
 wait+0.044
Neg logit contributions
 sudden-0.061
 nowhere-0.060
 year-0.059
 neighbourhood-0.057
 woods-0.057
Token games
Token ID1830
Feature activation+7.74e-01

Pos logit contributions
 sudden+0.036
 nowhere+0.036
ud+0.034
 neighbourhood+0.033
icer+0.033
Neg logit contributions
 share-0.032
 give-0.032
 listen-0.030
 wait-0.027
 ask-0.027
Token.
Token ID13
Feature activation+8.44e-01

Pos logit contributions
 sudden+0.035
 nowhere+0.033
icer+0.032
 neighbourhood+0.031
 year+0.030
Neg logit contributions
 give-0.033
 share-0.032
 listen-0.030
 wait-0.029
 leave-0.029
Token
Token ID198
Feature activation+4.93e+00

Pos logit contributions
 nowhere+0.041
 sudden+0.041
 year+0.039
 seas+0.038
 neighbourhood+0.037
Neg logit contributions
 give-0.031
 share-0.029
 ask-0.028
 listen-0.027
 I-0.026
Token
Token ID198
Feature activation+2.12e+00

Pos logit contributions
 seas+0.265
led+0.261
awed+0.259
 pieces+0.252
ymes+0.245
Neg logit contributions
 ask-0.121
 please-0.111
 listen-0.101
 share-0.097
 But-0.093
TokenOne
Token ID3198
Feature activation+1.18e+00

Pos logit contributions
 nowhere+0.091
awed+0.088
 pieces+0.088
 seas+0.087
actus+0.086
Neg logit contributions
 listen-0.081
 share-0.079
 give-0.074
 please-0.074
 ask-0.074
Token day
Token ID1110
Feature activation-8.20e-01

Pos logit contributions
 nowhere+0.040
awed+0.040
 sudden+0.040
icer+0.039
 neighbourhood+0.035
Neg logit contributions
 listen-0.064
 give-0.063
 wait-0.060
 share-0.060
 ask-0.057
Token,
Token ID11
Feature activation+6.64e-01

Pos logit contributions
 give+0.035
 share+0.033
 listen+0.032
 leave+0.032
 wait+0.031
Neg logit contributions
 nowhere-0.034
 sudden-0.033
 year-0.033
 neighbourhood-0.031
 woods-0.031
Token they
Token ID484
Feature activation+2.70e-01

Pos logit contributions
 nowhere+0.030
 neighbourhood+0.029
ud+0.029
 year+0.029
icer+0.028
Neg logit contributions
 give-0.027
 share-0.025
 listen-0.024
 wait-0.024
 ask-0.022
Token together
Token ID1978
Feature activation+1.03e+00

Pos logit contributions
 sudden+0.076
 nowhere+0.075
 neighbourhood+0.074
icer+0.072
ud+0.071
Neg logit contributions
 share-0.054
 give-0.053
 listen-0.049
 wait-0.043
 please-0.043
Token all
Token ID477
Feature activation+1.46e+00

Pos logit contributions
 sudden+0.047
 nowhere+0.045
icer+0.043
 neighbourhood+0.043
 woods+0.042
Neg logit contributions
 give-0.045
 share-0.043
 listen-0.040
 leave-0.039
 wait-0.038
Token day
Token ID1110
Feature activation-2.89e-01

Pos logit contributions
 sudden+0.061
 nowhere+0.054
icer+0.053
smith+0.050
awed+0.050
Neg logit contributions
 share-0.071
 give-0.068
 listen-0.064
 leave-0.060
 ask-0.059
Token long
Token ID890
Feature activation+2.74e+00

Pos logit contributions
 give+0.014
 share+0.013
 listen+0.013
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.010
 sudden-0.010
 year-0.010
 neighbourhood-0.010
 woods-0.009
Token.
Token ID13
Feature activation+1.56e+00

Pos logit contributions
icer+0.133
 sudden+0.132
actus+0.123
 nowhere+0.122
 neighbourhood+0.121
Neg logit contributions
 give-0.097
 share-0.091
 listen-0.082
 leave-0.081
 I-0.081
Token
Token ID198
Feature activation+4.92e+00

Pos logit contributions
 sudden+0.076
 nowhere+0.074
 seas+0.073
 year+0.071
in+0.071
Neg logit contributions
 give-0.055
 share-0.052
 I-0.050
 ask-0.049
 listen-0.048
Token
Token ID198
Feature activation+2.56e+00

Pos logit contributions
 seas+0.244
led+0.233
awed+0.231
pit+0.228
ymes+0.227
Neg logit contributions
 ask-0.131
 share-0.123
 please-0.120
 give-0.116
 listen-0.116
TokenThe
Token ID464
Feature activation+3.20e+00

Pos logit contributions
 seas+0.110
awed+0.106
actus+0.106
 pieces+0.106
 nowhere+0.105
Neg logit contributions
 share-0.097
 listen-0.095
 please-0.089
 give-0.088
 ask-0.085
Token moral
Token ID6573
Feature activation+8.11e-02

Pos logit contributions
ud+0.137
awed+0.136
rones+0.122
 nowhere+0.119
icer+0.115
Neg logit contributions
 give-0.153
 wait-0.146
 listen-0.142
 thank-0.135
 share-0.135
Token of
Token ID286
Feature activation+6.76e-01

Pos logit contributions
 nowhere+0.003
 sudden+0.003
icer+0.003
 neighbourhood+0.002
eh+0.002
Neg logit contributions
 give-0.004
 share-0.004
 listen-0.004
 leave-0.004
 ask-0.004
Token the
Token ID262
Feature activation+4.41e-01

Pos logit contributions
 sudden+0.034
 neighbourhood+0.032
ud+0.031
awed+0.031
 year+0.031
Neg logit contributions
 give-0.026
 share-0.024
 listen-0.023
 leave-0.021
 ask-0.021
Token explore
Token ID7301
Feature activation+1.28e+00

Pos logit contributions
 give+0.081
ourage+0.079
 Sorry+0.076
 leave+0.075
istence+0.071
Neg logit contributions
 sudden-0.112
 woods-0.110
 nowhere-0.108
<|endoftext|>-0.106
 forest-0.106
Token the
Token ID262
Feature activation+2.63e+00

Pos logit contributions
 sudden+0.062
 nowhere+0.059
icer+0.059
ud+0.058
awed+0.056
Neg logit contributions
 give-0.046
 share-0.046
 listen-0.045
 ask-0.041
 leave-0.039
Token big
Token ID1263
Feature activation+1.14e+00

Pos logit contributions
 nowhere+0.092
awed+0.090
ud+0.086
eled+0.083
 sudden+0.079
Neg logit contributions
 listen-0.129
 ask-0.127
 share-0.125
 give-0.124
 follow-0.117
Token fields
Token ID7032
Feature activation+6.60e-01

Pos logit contributions
 sudden+0.033
 nowhere+0.032
awed+0.027
ud+0.027
 year+0.026
Neg logit contributions
 give-0.066
 share-0.063
 leave-0.062
 listen-0.061
 ask-0.061
Token.
Token ID13
Feature activation+9.08e-01

Pos logit contributions
 sudden+0.030
 nowhere+0.028
icer+0.026
 neighbourhood+0.026
ud+0.026
Neg logit contributions
 give-0.028
 share-0.027
 listen-0.026
 leave-0.025
 wait-0.025
Token
Token ID198
Feature activation+4.92e+00

Pos logit contributions
 sudden+0.053
 nowhere+0.052
 year+0.051
in+0.050
ymes+0.050
Neg logit contributions
 give-0.024
 ask-0.023
 share-0.022
 listen-0.022
 leave-0.020
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
 seas+0.273
ymes+0.268
awed+0.266
 pieces+0.261
led+0.259
Neg logit contributions
 ask-0.118
 listen-0.102
 please-0.094
 give-0.093
 But-0.089
TokenOne
Token ID3198
Feature activation+6.66e-01

Pos logit contributions
 nowhere+0.075
 pieces+0.074
awed+0.074
 seas+0.073
 sudden+0.071
Neg logit contributions
 listen-0.067
 share-0.063
 ask-0.062
 give-0.061
 please-0.059
Token day
Token ID1110
Feature activation-6.97e-01

Pos logit contributions
 nowhere+0.019
 sudden+0.019
icer+0.018
awed+0.017
ud+0.017
Neg logit contributions
 listen-0.038
 give-0.038
 wait-0.036
 share-0.036
 ask-0.035
Token,
Token ID11
Feature activation+3.94e-01

Pos logit contributions
 give+0.030
 share+0.028
 listen+0.027
 leave+0.027
 wait+0.026
Neg logit contributions
 nowhere-0.029
 sudden-0.029
 year-0.028
 neighbourhood-0.027
 woods-0.027
Token Lily
Token ID20037
Feature activation+8.85e-01

Pos logit contributions
 neighbourhood+0.019
 nowhere+0.019
 year+0.019
ud+0.019
 area+0.018
Neg logit contributions
 give-0.014
 share-0.013
 listen-0.013
 wait-0.012
 ask-0.011
Token on
Token ID319
Feature activation+1.12e+00

Pos logit contributions
 give+0.055
 share+0.052
 listen+0.050
 leave+0.049
 wait+0.048
Neg logit contributions
 nowhere-0.043
 year-0.042
 sudden-0.041
 woods-0.040
 neighbourhood-0.040
Token adventures
Token ID17545
Feature activation+1.59e-01

Pos logit contributions
 sudden+0.050
in+0.047
 neighbourhood+0.047
awed+0.047
 nowhere+0.047
Neg logit contributions
 give-0.044
 share-0.040
 listen-0.040
 leave-0.039
 wait-0.038
Token every
Token ID790
Feature activation+1.62e+00

Pos logit contributions
 sudden+0.006
 nowhere+0.006
 year+0.006
 neighbourhood+0.006
icer+0.006
Neg logit contributions
 give-0.007
 share-0.007
 listen-0.007
 leave-0.006
 wait-0.006
Token day
Token ID1110
Feature activation-1.31e-01

Pos logit contributions
icer+0.044
awed+0.044
ymes+0.041
 sudden+0.040
ud+0.040
Neg logit contributions
 give-0.095
 leave-0.090
 listen-0.088
 wait-0.087
 share-0.086
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
 give+0.006
 share+0.006
 listen+0.006
 leave+0.006
 wait+0.005
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.005
 neighbourhood-0.005
icer-0.004
Token
Token ID198
Feature activation+4.91e+00

Pos logit contributions
 nowhere+0.060
 sudden+0.060
 seas+0.057
 pieces+0.056
 year+0.056
Neg logit contributions
 give-0.052
 share-0.048
 ask-0.047
 I-0.046
 listen-0.046
Token
Token ID198
Feature activation+2.46e+00

Pos logit contributions
awed+0.215
led+0.212
 seas+0.207
ymes+0.203
 pieces+0.200
Neg logit contributions
 ask-0.168
 please-0.156
 listen-0.156
 give-0.153
 share-0.152
TokenOne
Token ID3198
Feature activation+1.23e+00

Pos logit contributions
awed+0.102
 nowhere+0.101
 seas+0.101
 pieces+0.100
actus+0.098
Neg logit contributions
 listen-0.096
 share-0.095
 give-0.091
 please-0.089
 ask-0.089
Token day
Token ID1110
Feature activation-2.84e-01

Pos logit contributions
 nowhere+0.041
awed+0.040
 sudden+0.039
icer+0.038
ud+0.036
Neg logit contributions
 listen-0.066
 give-0.066
 wait-0.063
 share-0.061
 ask-0.060
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
 give+0.011
 share+0.011
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.013
 sudden-0.012
 year-0.012
 neighbourhood-0.012
 woods-0.012
Token Bob
Token ID5811
Feature activation+1.77e+00

Pos logit contributions
 nowhere+0.048
ud+0.046
 neighbourhood+0.046
 year+0.045
 sudden+0.045
Neg logit contributions
 give-0.040
 share-0.037
 listen-0.036
 wait-0.035
 ask-0.034
Token awake
Token ID21693
Feature activation-5.63e-01

Pos logit contributions
 give+0.030
 share+0.028
 listen+0.027
 leave+0.027
 wait+0.027
Neg logit contributions
 nowhere-0.023
 sudden-0.023
 year-0.022
 neighbourhood-0.021
 woods-0.021
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
 give+0.025
 share+0.024
 listen+0.023
 leave+0.023
 wait+0.022
Neg logit contributions
 nowhere-0.022
 sudden-0.022
 year-0.021
 neighbourhood-0.021
 woods-0.020
Token never
Token ID1239
Feature activation-1.13e+00

Pos logit contributions
 give+0.052
 share+0.049
 listen+0.048
 leave+0.047
 wait+0.046
Neg logit contributions
 nowhere-0.042
 sudden-0.040
 year-0.040
 neighbourhood-0.038
 woods-0.038
Token tired
Token ID10032
Feature activation-1.26e+00

Pos logit contributions
 give+0.054
 share+0.052
 listen+0.050
 leave+0.049
 wait+0.049
Neg logit contributions
 nowhere-0.041
 sudden-0.038
 year-0.038
 neighbourhood-0.036
 woods-0.036
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 give+0.057
 share+0.055
 listen+0.053
 ask+0.051
 wait+0.051
Neg logit contributions
 nowhere-0.048
 year-0.048
 sudden-0.045
 woods-0.044
 neighbourhood-0.043
Token
Token ID198
Feature activation+4.88e+00

Pos logit contributions
 sudden+0.048
ymes+0.044
 nowhere+0.044
 seas+0.043
awed+0.042
Neg logit contributions
 give-0.050
 share-0.047
 ask-0.047
 I-0.044
 listen-0.044
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
 seas+0.269
pit+0.269
ymes+0.268
awed+0.260
 pieces+0.254
Neg logit contributions
 ask-0.111
 give-0.109
 But-0.099
 listen-0.099
 share-0.098
TokenOne
Token ID3198
Feature activation+1.05e+00

Pos logit contributions
awed+0.092
 pieces+0.086
ymes+0.086
 seas+0.085
joy+0.085
Neg logit contributions
 listen-0.059
 share-0.056
 give-0.055
 ask-0.052
 wait-0.050
Token day
Token ID1110
Feature activation-1.40e+00

Pos logit contributions
 sudden+0.032
awed+0.031
 nowhere+0.030
icer+0.029
 neighbourhood+0.029
Neg logit contributions
 give-0.060
 listen-0.059
 wait-0.056
 share-0.056
 leave-0.054
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
 give+0.068
 share+0.065
 leave+0.063
 listen+0.062
 wait+0.062
Neg logit contributions
 sudden-0.048
 nowhere-0.048
 year-0.048
 woods-0.045
 seas-0.044
Token the
Token ID262
Feature activation-6.77e-01

Pos logit contributions
 give+0.009
 share+0.008
 listen+0.008
 wait+0.008
 leave+0.008
Neg logit contributions
 nowhere-0.008
 sudden-0.007
 year-0.007
 neighbourhood-0.007
ud-0.007
Token but
Token ID475
Feature activation-1.00e+00

Pos logit contributions
 give+0.052
 share+0.050
 listen+0.047
 leave+0.047
 wait+0.046
Neg logit contributions
 nowhere-0.054
 sudden-0.052
 year-0.051
 neighbourhood-0.047
 seas-0.047
Token he
Token ID339
Feature activation+6.14e-01

Pos logit contributions
 give+0.042
 share+0.039
 listen+0.038
 leave+0.037
 wait+0.036
Neg logit contributions
 nowhere-0.044
 sudden-0.041
 year-0.041
 woods-0.039
 neighbourhood-0.039
Token never
Token ID1239
Feature activation-1.54e+00

Pos logit contributions
 sudden+0.028
 nowhere+0.027
 seas+0.027
 neighbourhood+0.026
 woods+0.026
Neg logit contributions
 give-0.025
 share-0.023
 listen-0.023
 wait-0.022
 ask-0.022
Token listened
Token ID16399
Feature activation-6.53e-02

Pos logit contributions
 give+0.067
 share+0.065
 Sorry+0.062
 listen+0.061
 leave+0.061
Neg logit contributions
 nowhere-0.057
 year-0.054
 sudden-0.052
 neighbourhood-0.050
 neighborhood-0.047
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
 give+0.003
 share+0.003
 listen+0.002
 leave+0.002
 wait+0.002
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 neighbourhood-0.003
 woods-0.003
 year-0.003
Token
Token ID198
Feature activation+4.87e+00

Pos logit contributions
 sudden+0.055
 nowhere+0.054
ymes+0.052
 seas+0.052
in+0.051
Neg logit contributions
 give-0.055
 ask-0.054
 share-0.052
 I-0.051
 please-0.050
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
 seas+0.273
ymes+0.270
 sudden+0.257
 pieces+0.257
pit+0.254
Neg logit contributions
 ask-0.109
 give-0.104
 please-0.097
 But-0.095
 listen-0.095
TokenOne
Token ID3198
Feature activation+8.61e-01

Pos logit contributions
 seas+0.077
 nowhere+0.074
 pieces+0.074
ymes+0.074
 doorway+0.074
Neg logit contributions
 listen-0.061
 share-0.059
 give-0.058
 ask-0.055
 please-0.055
Token day
Token ID1110
Feature activation-1.65e+00

Pos logit contributions
 nowhere+0.026
awed+0.025
 sudden+0.025
icer+0.024
 neighbourhood+0.024
Neg logit contributions
 listen-0.049
 give-0.049
 wait-0.046
 share-0.045
 ask-0.044
Token,
Token ID11
Feature activation-5.11e-01

Pos logit contributions
 give+0.085
 share+0.081
 leave+0.079
 wait+0.078
 listen+0.077
Neg logit contributions
 sudden-0.054
 nowhere-0.052
 year-0.051
<|endoftext|>-0.046
 woods-0.046
Token Tim
Token ID5045
Feature activation+1.93e+00

Pos logit contributions
 give+0.021
 share+0.019
 listen+0.018
 leave+0.018
 wait+0.018
Neg logit contributions
 nowhere-0.023
 sudden-0.022
 year-0.022
 neighbourhood-0.022
 woods-0.021
Token play
Token ID711
Feature activation+1.89e+00

Pos logit contributions
 give+0.019
 share+0.018
 listen+0.017
 leave+0.017
 wait+0.016
Neg logit contributions
 nowhere-0.018
 sudden-0.017
 year-0.017
 neighbourhood-0.016
 woods-0.016
Token together
Token ID1978
Feature activation+1.51e+00

Pos logit contributions
 sudden+0.093
 nowhere+0.090
ud+0.089
icer+0.087
actus+0.086
Neg logit contributions
 share-0.072
 give-0.067
 listen-0.064
 wait-0.059
 please-0.057
Token every
Token ID790
Feature activation+2.35e+00

Pos logit contributions
 sudden+0.071
 nowhere+0.069
icer+0.066
 neighbourhood+0.063
 woods+0.063
Neg logit contributions
 give-0.061
 share-0.060
 listen-0.055
 wait-0.054
 leave-0.053
Token day
Token ID1110
Feature activation+7.34e-01

Pos logit contributions
awed+0.083
icer+0.082
actus+0.077
aws+0.074
ymes+0.071
Neg logit contributions
 give-0.123
 wait-0.118
 share-0.115
 listen-0.115
 leave-0.114
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
 nowhere+0.030
 sudden+0.029
icer+0.028
 year+0.028
 neighbourhood+0.028
Neg logit contributions
 give-0.032
 share-0.030
 listen-0.029
 wait-0.028
 leave-0.028
Token
Token ID198
Feature activation+4.86e+00

Pos logit contributions
 sudden+0.075
 nowhere+0.073
 seas+0.071
ymes+0.068
 year+0.068
Neg logit contributions
 give-0.058
 share-0.055
 ask-0.053
 I-0.053
 listen-0.051
Token
Token ID198
Feature activation+2.56e+00

Pos logit contributions
 seas+0.211
awed+0.210
led+0.203
ymes+0.196
 pieces+0.195
Neg logit contributions
 share-0.163
 ask-0.161
 listen-0.160
 give-0.153
 learn-0.151
TokenOne
Token ID3198
Feature activation+1.27e+00

Pos logit contributions
awed+0.110
 seas+0.108
actus+0.107
 nowhere+0.106
 pieces+0.104
Neg logit contributions
 share-0.100
 listen-0.098
 give-0.091
 please-0.088
 ask-0.086
Token day
Token ID1110
Feature activation-3.86e-01

Pos logit contributions
 nowhere+0.043
 sudden+0.042
awed+0.042
icer+0.041
ud+0.038
Neg logit contributions
 listen-0.069
 give-0.068
 wait-0.065
 share-0.065
 ask-0.060
Token,
Token ID11
Feature activation+9.64e-01

Pos logit contributions
 give+0.016
 share+0.015
 listen+0.014
 leave+0.014
 wait+0.013
Neg logit contributions
 nowhere-0.017
 sudden-0.017
 year-0.016
 neighbourhood-0.016
 woods-0.016
Token Tim
Token ID5045
Feature activation+1.92e+00

Pos logit contributions
 nowhere+0.044
ud+0.044
icer+0.043
 year+0.043
 neighbourhood+0.043
Neg logit contributions
 give-0.037
 share-0.035
 listen-0.034
 wait-0.033
 ask-0.031
Token friends
Token ID2460
Feature activation+1.34e+00

Pos logit contributions
ud+0.145
 year+0.129
eled+0.128
Ah+0.126
Jamie+0.123
Neg logit contributions
 give-0.124
 share-0.121
 leave-0.107
 I-0.106
 ask-0.105
Token all
Token ID477
Feature activation+9.07e-01

Pos logit contributions
 sudden+0.063
 nowhere+0.057
 year+0.057
icer+0.057
ud+0.056
Neg logit contributions
 give-0.055
 share-0.052
 leave-0.048
 listen-0.048
 ask-0.047
Token day
Token ID1110
Feature activation-1.33e-01

Pos logit contributions
 sudden+0.034
 nowhere+0.032
icer+0.030
 seas+0.029
ud+0.028
Neg logit contributions
 give-0.046
 share-0.046
 listen-0.043
 ask-0.042
 leave-0.042
Token long
Token ID890
Feature activation+2.24e+00

Pos logit contributions
 give+0.006
 share+0.006
 listen+0.006
 leave+0.006
 wait+0.006
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.005
 neighbourhood-0.004
 woods-0.004
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
icer+0.110
 sudden+0.108
 nowhere+0.104
in+0.100
 neighbourhood+0.099
Neg logit contributions
 give-0.082
 share-0.077
 leave-0.070
 listen-0.070
 wait-0.066
Token
Token ID198
Feature activation+4.85e+00

Pos logit contributions
 sudden+0.057
 nowhere+0.057
 seas+0.054
in+0.054
 pieces+0.054
Neg logit contributions
 give-0.041
 share-0.038
 ask-0.037
 I-0.036
 listen-0.035
Token
Token ID198
Feature activation+2.11e+00

Pos logit contributions
 seas+0.231
ymes+0.226
awed+0.223
led+0.221
 pieces+0.221
Neg logit contributions
 ask-0.142
 share-0.127
 give-0.127
 please-0.124
 listen-0.122
TokenOne
Token ID3198
Feature activation+1.02e+00

Pos logit contributions
 pieces+0.086
 seas+0.086
 nowhere+0.085
awed+0.083
actus+0.081
Neg logit contributions
 share-0.084
 listen-0.083
 give-0.079
 please-0.077
 wait-0.076
Token day
Token ID1110
Feature activation-6.70e-01

Pos logit contributions
 nowhere+0.033
awed+0.032
 sudden+0.032
icer+0.032
ud+0.030
Neg logit contributions
 give-0.055
 listen-0.055
 wait-0.052
 share-0.052
 ask-0.048
Token,
Token ID11
Feature activation+5.68e-01

Pos logit contributions
 give+0.028
 share+0.026
 listen+0.026
 leave+0.025
 wait+0.025
Neg logit contributions
 nowhere-0.029
 sudden-0.028
 year-0.028
 neighbourhood-0.027
 woods-0.026
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
 nowhere+0.026
ud+0.025
 year+0.025
 neighbourhood+0.025
 sudden+0.025
Neg logit contributions
 give-0.022
 share-0.020
 listen-0.020
 wait-0.019
 ask-0.018
Token someone
Token ID2130
Feature activation+1.17e-02

Pos logit contributions
 give+0.043
 share+0.041
 listen+0.040
 leave+0.039
 wait+0.039
Neg logit contributions
 nowhere-0.029
 sudden-0.028
 year-0.028
 neighbourhood-0.026
 woods-0.026
Token to
Token ID284
Feature activation-2.46e+00

Pos logit contributions
 nowhere+0.001
 sudden+0.001
 year+0.001
icer+0.001
 neighbourhood+0.001
Neg logit contributions
 give-0.000
 share-0.000
 listen-0.000
 leave-0.000
 ask-0.000
Token help
Token ID1037
Feature activation-1.21e+00

Pos logit contributions
 Sorry+0.057
ourage+0.052
 give+0.051
 share+0.046
istence+0.046
Neg logit contributions
 year-0.134
 nowhere-0.125
 neighbourhood-0.125
<|endoftext|>-0.124
 woods-0.123
Token him
Token ID683
Feature activation-1.80e+00

Pos logit contributions
 give+0.056
 share+0.053
 listen+0.052
 leave+0.051
 wait+0.050
Neg logit contributions
 nowhere-0.043
 year-0.043
 sudden-0.042
 neighbourhood-0.040
 woods-0.039
Token.
Token ID13
Feature activation+7.55e-01

Pos logit contributions
 give+0.083
 listen+0.079
 share+0.079
 Sorry+0.078
 wait+0.077
Neg logit contributions
 year-0.059
 nowhere-0.057
 neighbourhood-0.056
 sudden-0.056
 area-0.056
Token
Token ID198
Feature activation+4.85e+00

Pos logit contributions
 sudden+0.029
 nowhere+0.029
ymes+0.027
 year+0.026
 seas+0.026
Neg logit contributions
 give-0.035
 ask-0.033
 share-0.033
 listen-0.032
 I-0.030
Token
Token ID198
Feature activation+1.39e+00

Pos logit contributions
 seas+0.266
awed+0.261
ymes+0.259
pit+0.257
 pieces+0.251
Neg logit contributions
 ask-0.116
 give-0.113
 listen-0.112
 please-0.104
 share-0.101
TokenOne
Token ID3198
Feature activation+7.73e-01

Pos logit contributions
awed+0.062
 nowhere+0.060
 seas+0.059
actus+0.058
 pieces+0.057
Neg logit contributions
 listen-0.053
 share-0.050
 give-0.049
 ask-0.046
 wait-0.045
Token day
Token ID1110
Feature activation-1.39e+00

Pos logit contributions
 nowhere+0.022
 sudden+0.022
awed+0.021
icer+0.021
 neighbourhood+0.020
Neg logit contributions
 listen-0.045
 give-0.045
 wait-0.043
 share-0.042
 ask-0.040
Token,
Token ID11
Feature activation-2.28e-01

Pos logit contributions
 give+0.068
 share+0.065
 leave+0.063
 listen+0.062
 wait+0.061
Neg logit contributions
 sudden-0.049
 nowhere-0.048
 year-0.048
 woods-0.044
 seas-0.042
Token a
Token ID257
Feature activation+1.01e+00

Pos logit contributions
 give+0.010
 share+0.010
 listen+0.010
 wait+0.009
 leave+0.009
Neg logit contributions
 nowhere-0.009
 sudden-0.009
 neighbourhood-0.009
 year-0.009
 woods-0.008
Token have
Token ID423
Feature activation-2.37e-01

Pos logit contributions
 give+0.095
 listen+0.091
 leave+0.091
 share+0.088
 ask+0.088
Neg logit contributions
 nowhere-0.087
 year-0.081
 sudden-0.080
 pieces-0.079
 was-0.077
Token a
Token ID257
Feature activation+5.00e-02

Pos logit contributions
 give+0.011
 share+0.011
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.009
 sudden-0.009
 year-0.009
 neighbourhood-0.008
 woods-0.008
Token lot
Token ID1256
Feature activation-1.29e+00

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 year+0.002
 neighbourhood+0.001
 woods+0.001
Neg logit contributions
 give-0.003
 share-0.002
 listen-0.002
 wait-0.002
 leave-0.002
Token of
Token ID286
Feature activation-1.02e+00

Pos logit contributions
 give+0.078
 share+0.076
 leave+0.074
 listen+0.074
 ask+0.073
Neg logit contributions
 year-0.029
 nowhere-0.028
 pieces-0.025
 sudden-0.025
in-0.024
Token buttons
Token ID12163
Feature activation-2.62e+00

Pos logit contributions
 give+0.050
 share+0.048
 leave+0.047
 listen+0.046
 wait+0.045
Neg logit contributions
 year-0.035
 sudden-0.035
 nowhere-0.035
 neighbourhood-0.033
 woods-0.032
Token.
Token ID13
Feature activation-5.71e+00

Pos logit contributions
 give+0.140
 share+0.138
 ask+0.132
 listen+0.132
 Sorry+0.129
Neg logit contributions
 nowhere-0.065
 year-0.065
 pieces-0.062
in-0.059
 area-0.056
Token What
Token ID1867
Feature activation-8.38e-01

Pos logit contributions
 give+0.235
 leave+0.233
 lose+0.230
 wait+0.229
 listen+0.227
Neg logit contributions
 woods-0.168
-0.162
 neighbourhood-0.155
 sky-0.154
 world-0.153
Token are
Token ID389
Feature activation-3.09e+00

Pos logit contributions
 give+0.033
 share+0.031
 leave+0.030
 listen+0.030
 wait+0.029
Neg logit contributions
 nowhere-0.037
 year-0.036
 sudden-0.035
 neighbourhood-0.034
 woods-0.034
Token you
Token ID345
Feature activation-1.98e+00

Pos logit contributions
 give+0.121
 leave+0.118
istent+0.107
istence+0.106
 listen+0.106
Neg logit contributions
 seas-0.128
 nowhere-0.124
 pieces-0.113
 year-0.112
 sudden-0.109
Token doing
Token ID1804
Feature activation-2.40e+00

Pos logit contributions
 give+0.095
 leave+0.091
 share+0.089
 Sorry+0.087
 listen+0.086
Neg logit contributions
 nowhere-0.071
 year-0.063
 neighbourhood-0.063
 sudden-0.062
 time-0.060
Token?"
Token ID1701
Feature activation+2.74e-01

Pos logit contributions
 give+0.112
 leave+0.106
 wait+0.103
 share+0.103
 listen+0.101
Neg logit contributions
 nowhere-0.090
 year-0.082
 neighbourhood-0.079
 world-0.078
 area-0.077
Token can
Token ID460
Feature activation-4.21e+00

Pos logit contributions
 Sorry+0.097
 give+0.094
 leave+0.091
 listen+0.091
 share+0.090
Neg logit contributions
 year-0.097
 time-0.091
 pieces-0.090
icer-0.087
<|endoftext|>-0.086
Token make
Token ID787
Feature activation-1.38e+00

Pos logit contributions
osity+0.140
 Sorry+0.136
beh+0.136
severe+0.131
istence+0.129
Neg logit contributions
 year-0.171
 was-0.162
icer-0.160
in-0.157
<|endoftext|>-0.156
Token your
Token ID534
Feature activation+3.40e-01

Pos logit contributions
 give+0.074
 share+0.070
 leave+0.069
 listen+0.069
 wait+0.066
Neg logit contributions
 sudden-0.041
 nowhere-0.040
 year-0.039
 pieces-0.036
 seas-0.036
Token own
Token ID898
Feature activation+6.37e-02

Pos logit contributions
 nowhere+0.013
 sudden+0.012
 year+0.012
ud+0.011
 neighbourhood+0.011
Neg logit contributions
 give-0.017
 share-0.016
 wait-0.016
 listen-0.016
 leave-0.016
Token bread
Token ID8509
Feature activation-1.83e+00

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 year+0.002
 neighbourhood+0.002
 woods+0.002
Neg logit contributions
 give-0.003
 share-0.003
 listen-0.003
 leave-0.003
 wait-0.003
Token.
Token ID13
Feature activation-5.43e+00

Pos logit contributions
 give+0.086
 listen+0.080
 share+0.079
 leave+0.079
 wait+0.077
Neg logit contributions
 year-0.060
 nowhere-0.059
 pieces-0.057
 area-0.056
in-0.056
Token Just
Token ID2329
Feature activation-3.49e+00

Pos logit contributions
 give+0.213
 listen+0.211
 lose+0.210
 leave+0.205
 share+0.203
Neg logit contributions
icer-0.165
 snowball-0.154
 woods-0.152
<|endoftext|>-0.145
 world-0.145
Token add
Token ID751
Feature activation-1.15e+00

Pos logit contributions
 Sorry+0.095
 give+0.083
istence+0.080
 share+0.080
ourage+0.079
Neg logit contributions
 time-0.166
 sudden-0.158
 nowhere-0.157
 year-0.157
<|endoftext|>-0.153
Token some
Token ID617
Feature activation-7.49e-01

Pos logit contributions
 give+0.055
 share+0.052
 leave+0.051
 listen+0.051
 wait+0.049
Neg logit contributions
 nowhere-0.041
 sudden-0.040
 year-0.039
 seas-0.037
 neighbourhood-0.037
Token water
Token ID1660
Feature activation-2.44e+00

Pos logit contributions
 give+0.039
 share+0.037
 listen+0.036
 leave+0.036
 wait+0.035
Neg logit contributions
 nowhere-0.024
 sudden-0.023
 year-0.023
 neighbourhood-0.021
 woods-0.021
Token,
Token ID11
Feature activation-2.55e+00

Pos logit contributions
 give+0.111
 share+0.106
 listen+0.103
 leave+0.102
 ask+0.100
Neg logit contributions
 year-0.080
 nowhere-0.078
 pieces-0.078
in-0.077
 area-0.074
Token both
Token ID1111
Feature activation-1.86e+00

Pos logit contributions
 Sorry+0.119
 leave+0.117
 give+0.114
istence+0.113
osity+0.112
Neg logit contributions
 year-0.111
in-0.101
 was-0.096
 time-0.096
<|endoftext|>-0.095
Token add
Token ID751
Feature activation-1.11e-01

Pos logit contributions
 give+0.059
 listen+0.054
 leave+0.054
 share+0.052
 Sorry+0.049
Neg logit contributions
 year-0.090
 sudden-0.086
 nowhere-0.082
 neighbourhood-0.082
in-0.082
Token some
Token ID617
Feature activation+2.57e-01

Pos logit contributions
 give+0.005
 share+0.005
 listen+0.005
 leave+0.004
 wait+0.004
Neg logit contributions
 nowhere-0.005
 sudden-0.004
 year-0.004
 neighbourhood-0.004
 woods-0.004
Token chocolate
Token ID11311
Feature activation-1.11e+00

Pos logit contributions
 nowhere+0.010
 sudden+0.009
 neighbourhood+0.009
 year+0.009
 woods+0.008
Neg logit contributions
 give-0.013
 share-0.012
 listen-0.012
 leave-0.011
 ask-0.011
Token chips
Token ID12014
Feature activation-9.52e-01

Pos logit contributions
 give+0.052
 share+0.049
 listen+0.047
 leave+0.047
 ask+0.046
Neg logit contributions
 nowhere-0.041
 year-0.040
 sudden-0.040
 neighbourhood-0.037
 woods-0.037
Token.
Token ID13
Feature activation-5.39e+00

Pos logit contributions
 give+0.041
 share+0.038
 listen+0.037
 leave+0.037
 wait+0.036
Neg logit contributions
 nowhere-0.039
 year-0.038
 sudden-0.038
 neighbourhood-0.036
 woods-0.035
Token But
Token ID887
Feature activation-3.34e+00

Pos logit contributions
 lose+0.232
 leave+0.226
 give+0.222
 listen+0.209
 happen+0.206
Neg logit contributions
icer-0.156
 snowball-0.154
-0.147
-0.143
<|endoftext|>-0.142
Token not
Token ID407
Feature activation-6.16e-01

Pos logit contributions
 give+0.107
istence+0.104
 leave+0.102
 share+0.101
 Sorry+0.099
Neg logit contributions
 nowhere-0.136
 year-0.127
 time-0.126
 seas-0.126
 was-0.125
Token too
Token ID1165
Feature activation+2.55e-02

Pos logit contributions
 give+0.026
 share+0.025
 listen+0.024
 leave+0.023
 wait+0.023
Neg logit contributions
 nowhere-0.026
 sudden-0.025
 year-0.025
 neighbourhood-0.024
 woods-0.024
Token many
Token ID867
Feature activation-2.12e-01

Pos logit contributions
 nowhere+0.001
 sudden+0.001
 neighbourhood+0.001
 woods+0.001
 year+0.001
Neg logit contributions
 give-0.001
 share-0.001
 listen-0.001
 leave-0.001
 ask-0.001
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 give+0.010
 share+0.010
 listen+0.009
 leave+0.009
 wait+0.009
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.008
 woods-0.007
Token.
Token ID13
Feature activation-4.55e+00

Pos logit contributions
 give+0.033
 share+0.031
 listen+0.030
 leave+0.029
 wait+0.029
Neg logit contributions
 nowhere-0.028
 sudden-0.027
 year-0.027
 neighbourhood-0.026
 woods-0.025
Token Here
Token ID3423
Feature activation-1.36e+00

Pos logit contributions
 lose+0.159
 give+0.159
 leave+0.157
 share+0.150
 listen+0.145
Neg logit contributions
<|endoftext|>-0.168
 year-0.162
 neighbourhood-0.160
 world-0.157
icer-0.157
Token is
Token ID318
Feature activation-1.92e+00

Pos logit contributions
 give+0.053
 share+0.050
 listen+0.048
 leave+0.048
 wait+0.046
Neg logit contributions
 nowhere-0.058
 year-0.057
 sudden-0.056
 neighbourhood-0.054
 pieces-0.053
Token a
Token ID257
Feature activation-2.37e-01

Pos logit contributions
 give+0.087
 share+0.082
 leave+0.082
 listen+0.081
 wait+0.077
Neg logit contributions
 nowhere-0.067
 year-0.066
 sudden-0.065
 pieces-0.060
 seas-0.060
Token teaspoon
Token ID22326
Feature activation-1.76e+00

Pos logit contributions
 give+0.012
 share+0.012
 listen+0.011
 wait+0.011
 leave+0.011
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.007
 woods-0.007
Token.
Token ID13
Feature activation-5.36e+00

Pos logit contributions
 give+0.081
 share+0.077
 listen+0.076
 leave+0.075
 Sorry+0.073
Neg logit contributions
 year-0.064
 nowhere-0.062
in-0.058
 pieces-0.057
 area-0.057
Token Be
Token ID1355
Feature activation-4.84e-01

Pos logit contributions
 lose+0.195
 leave+0.187
 disappoint+0.184
 give+0.179
 listen+0.178
Neg logit contributions
<|endoftext|>-0.178
icer-0.170
 year-0.164
-0.164
 sky-0.163
Token careful
Token ID8161
Feature activation-1.46e+00

Pos logit contributions
 give+0.022
 share+0.020
 listen+0.020
 leave+0.019
 wait+0.019
Neg logit contributions
 nowhere-0.020
 sudden-0.019
 year-0.019
 neighbourhood-0.018
 woods-0.018
Token not
Token ID407
Feature activation-2.10e+00

Pos logit contributions
 give+0.066
 share+0.062
 leave+0.060
 listen+0.059
 wait+0.058
Neg logit contributions
 year-0.055
 nowhere-0.055
 sudden-0.051
 neighbourhood-0.049
 pieces-0.049
Token to
Token ID284
Feature activation-2.59e+00

Pos logit contributions
 give+0.095
 share+0.091
 Sorry+0.091
 leave+0.088
 listen+0.087
Neg logit contributions
 nowhere-0.070
 year-0.068
 sudden-0.066
 pieces-0.066
 woods-0.061
Token spill
Token ID19431
Feature activation-1.66e+00

Pos logit contributions
 Sorry+0.078
 give+0.077
istence+0.077
ourage+0.075
 listen+0.073
Neg logit contributions
 year-0.114
 sudden-0.113
 time-0.108
 nowhere-0.108
in-0.103
Token use
Token ID779
Feature activation-7.65e-01

Pos logit contributions
 Sorry+0.135
 leave+0.131
 give+0.131
severe+0.124
 listen+0.123
Neg logit contributions
 was-0.116
 year-0.111
 neighbourhood-0.106
<|endoftext|>-0.104
 world-0.102
Token some
Token ID617
Feature activation-1.08e-01

Pos logit contributions
 give+0.031
 share+0.029
 listen+0.028
 leave+0.027
 wait+0.027
Neg logit contributions
 nowhere-0.034
 sudden-0.033
 year-0.033
 neighbourhood-0.032
 woods-0.031
Token of
Token ID286
Feature activation-7.88e-01

Pos logit contributions
 give+0.005
 share+0.004
 listen+0.004
 leave+0.004
 wait+0.004
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.004
 neighbourhood-0.004
 woods-0.004
Token my
Token ID616
Feature activation+5.48e-01

Pos logit contributions
 give+0.024
 share+0.022
 listen+0.021
 leave+0.020
 wait+0.020
Neg logit contributions
 nowhere-0.043
 sudden-0.042
 year-0.041
 neighbourhood-0.040
 woods-0.040
Token paints
Token ID32281
Feature activation-1.39e+00

Pos logit contributions
 nowhere+0.024
ud+0.023
 sudden+0.023
 year+0.023
icer+0.022
Neg logit contributions
 give-0.023
 share-0.021
 listen-0.021
 wait-0.021
 leave-0.020
Token.
Token ID13
Feature activation-5.32e+00

Pos logit contributions
 give+0.069
 share+0.065
 listen+0.064
 leave+0.063
 ask+0.062
Neg logit contributions
 nowhere-0.047
 year-0.045
 sudden-0.043
 neighbourhood-0.042
 pieces-0.041
Token We
Token ID775
Feature activation-2.52e+00

Pos logit contributions
 lose+0.218
 leave+0.214
 give+0.209
 listen+0.199
 share+0.198
Neg logit contributions
 sky-0.154
-0.154
<|endoftext|>-0.152
Today-0.151
-0.151
Token can
Token ID460
Feature activation-4.01e+00

Pos logit contributions
 give+0.108
 listen+0.105
 leave+0.103
 Sorry+0.102
 share+0.099
Neg logit contributions
 year-0.083
 nowhere-0.080
 pieces-0.079
 neighbourhood-0.077
 sudden-0.077
Token paint
Token ID7521
Feature activation-7.85e-01

Pos logit contributions
 Sorry+0.148
beh+0.140
istence+0.137
severe+0.136
 give+0.136
Neg logit contributions
 was-0.134
 year-0.133
 neighbourhood-0.129
 sudden-0.129
<|endoftext|>-0.128
Token together
Token ID1978
Feature activation-5.76e-01

Pos logit contributions
 give+0.038
 share+0.036
 listen+0.035
 leave+0.035
 wait+0.034
Neg logit contributions
 nowhere-0.028
 sudden-0.027
 year-0.027
 neighbourhood-0.025
 woods-0.025
Token!"
Token ID2474
Feature activation-2.78e-01

Pos logit contributions
 give+0.023
 share+0.022
 listen+0.021
 leave+0.020
 wait+0.020
Neg logit contributions
 nowhere-0.026
 sudden-0.025
 year-0.025
 neighbourhood-0.024
 woods-0.024
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
 give+0.106
 share+0.103
 listen+0.100
 wait+0.100
 leave+0.100
Neg logit contributions
 nowhere-0.044
 year-0.040
in-0.038
 sudden-0.037
 pieces-0.036
Token "
Token ID366
Feature activation-3.15e-01

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.018
Neg logit contributions
 nowhere-0.020
 sudden-0.020
 year-0.020
 neighbourhood-0.019
 woods-0.018
TokenNow
Token ID3844
Feature activation-3.28e+00

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.014
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.012
 sudden-0.012
 year-0.011
 neighbourhood-0.011
 woods-0.011
Token you
Token ID345
Feature activation-2.11e+00

Pos logit contributions
 give+0.120
istence+0.115
 share+0.114
 listen+0.114
 Sorry+0.113
Neg logit contributions
 was-0.106
 time-0.105
 seas-0.104
 sudden-0.103
 nowhere-0.103
Token try
Token ID1949
Feature activation-4.02e-01

Pos logit contributions
 give+0.064
 Sorry+0.061
 listen+0.059
 share+0.059
 leave+0.059
Neg logit contributions
 nowhere-0.097
 year-0.096
 sudden-0.093
 neighbourhood-0.091
 pieces-0.090
Token.
Token ID13
Feature activation-5.28e+00

Pos logit contributions
 give+0.019
 share+0.018
 listen+0.018
 leave+0.017
 ask+0.017
Neg logit contributions
 nowhere-0.015
 sudden-0.014
 year-0.014
 neighbourhood-0.014
 woods-0.013
Token And
Token ID843
Feature activation-3.59e+00

Pos logit contributions
 wait+0.217
 leave+0.213
 give+0.209
 listen+0.207
 lose+0.199
Neg logit contributions
-0.162
 sky-0.156
 woods-0.154
icer-0.151
<|endoftext|>-0.151
Token have
Token ID423
Feature activation+3.96e-01

Pos logit contributions
 give+0.099
 Sorry+0.098
 leave+0.096
 listen+0.092
 share+0.089
Neg logit contributions
 sudden-0.153
 nowhere-0.152
 seas-0.149
 pieces-0.149
 world-0.145
Token fun
Token ID1257
Feature activation-1.66e+00

Pos logit contributions
 nowhere+0.014
 year+0.014
 sudden+0.014
 neighbourhood+0.014
icer+0.014
Neg logit contributions
 give-0.019
 share-0.019
 listen-0.018
 ask-0.017
 leave-0.017
Token."
Token ID526
Feature activation-1.97e-02

Pos logit contributions
 give+0.081
 leave+0.076
 listen+0.075
 share+0.075
 wait+0.074
Neg logit contributions
 nowhere-0.055
 year-0.055
 sudden-0.054
 pieces-0.052
 neighbourhood-0.051
Token
Token ID198
Feature activation+2.28e+00

Pos logit contributions
 give+0.001
 share+0.001
 listen+0.001
 ask+0.001
 leave+0.001
Neg logit contributions
 nowhere-0.001
 year-0.001
 sudden-0.001
 neighbourhood-0.001
 seas-0.001
Token.
Token ID13
Feature activation-4.89e+00

Pos logit contributions
 give+0.085
 share+0.082
 listen+0.080
 leave+0.079
 wait+0.078
Neg logit contributions
 nowhere-0.047
 year-0.046
in-0.042
 pieces-0.042
 sudden-0.041
Token Be
Token ID1355
Feature activation-3.21e-01

Pos logit contributions
 listen+0.169
 leave+0.167
 disappoint+0.160
 wait+0.157
 give+0.155
Neg logit contributions
icer-0.169
ase-0.168
-0.163
<|endoftext|>-0.160
 woods-0.160
Token careful
Token ID8161
Feature activation-1.24e+00

Pos logit contributions
 give+0.019
 share+0.018
 listen+0.018
 leave+0.018
 wait+0.017
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.007
 woods-0.007
Token with
Token ID351
Feature activation+8.56e-02

Pos logit contributions
 give+0.063
 share+0.060
 listen+0.058
 leave+0.058
 wait+0.058
Neg logit contributions
 nowhere-0.040
 year-0.039
 sudden-0.036
 neighbourhood-0.035
 area-0.034
Token them
Token ID606
Feature activation-4.51e-01

Pos logit contributions
 nowhere+0.004
 sudden+0.004
 year+0.004
 neighbourhood+0.004
 woods+0.004
Neg logit contributions
 give-0.003
 share-0.003
 listen-0.003
 leave-0.003
 ask-0.003
Token.
Token ID13
Feature activation-5.27e+00

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.018
Neg logit contributions
 nowhere-0.018
 sudden-0.017
 year-0.017
 neighbourhood-0.016
 woods-0.016
Token They
Token ID1119
Feature activation-3.25e+00

Pos logit contributions
 listen+0.205
 wait+0.203
 leave+0.199
 give+0.194
 lose+0.187
Neg logit contributions
icer-0.164
-0.157
ud-0.155
<|endoftext|>-0.148
ase-0.147
Token are
Token ID389
Feature activation-1.69e+00

Pos logit contributions
 Sorry+0.118
asty+0.118
 listen+0.117
 give+0.116
 leave+0.112
Neg logit contributions
 pieces-0.121
 year-0.108
 nowhere-0.104
 was-0.104
 sudden-0.099
Token sharp
Token ID7786
Feature activation-4.18e-01

Pos logit contributions
 give+0.077
 listen+0.074
 leave+0.072
 share+0.072
 wait+0.071
Neg logit contributions
 nowhere-0.063
 sudden-0.056
 pieces-0.055
 year-0.054
<|endoftext|>-0.052
Token."
Token ID526
Feature activation+5.06e-01

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.019
Neg logit contributions
 nowhere-0.015
 sudden-0.014
 year-0.014
 neighbourhood-0.013
 woods-0.013
Token
Token ID198
Feature activation+2.24e+00

Pos logit contributions
 sudden+0.019
 nowhere+0.019
 year+0.018
icer+0.018
 neighbourhood+0.017
Neg logit contributions
 give-0.025
 share-0.023
 ask-0.022
 listen-0.022
 leave-0.021
Token Tom
Token ID4186
Feature activation-1.33e+00

Pos logit contributions
 give+0.050
 share+0.048
 listen+0.046
 leave+0.046
ourage+0.044
Neg logit contributions
 nowhere-0.061
 year-0.059
 sudden-0.056
 pieces-0.055
<|endoftext|>-0.055
Token.
Token ID13
Feature activation-4.34e+00

Pos logit contributions
 give+0.065
 share+0.062
 listen+0.061
 leave+0.060
 wait+0.060
Neg logit contributions
 nowhere-0.043
 year-0.043
 sudden-0.040
 woods-0.040
 neighbourhood-0.039
Token Drink
Token ID32906
Feature activation-6.85e-01

Pos logit contributions
 give+0.151
 listen+0.151
 leave+0.150
 wait+0.148
 share+0.145
Neg logit contributions
icer-0.172
-0.154
ase-0.151
 woods-0.148
 snowball-0.142
Token some
Token ID617
Feature activation-7.41e-01

Pos logit contributions
 give+0.028
 share+0.026
 listen+0.026
 leave+0.025
 wait+0.025
Neg logit contributions
 nowhere-0.030
 sudden-0.029
 year-0.029
 neighbourhood-0.028
 woods-0.027
Token milk
Token ID7545
Feature activation-1.43e+00

Pos logit contributions
 give+0.036
 share+0.034
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.026
 sudden-0.025
 year-0.025
 neighbourhood-0.024
 woods-0.023
Token.
Token ID13
Feature activation-5.25e+00

Pos logit contributions
 give+0.072
 share+0.068
 listen+0.067
 leave+0.067
 ask+0.065
Neg logit contributions
 nowhere-0.045
 year-0.045
 sudden-0.042
 pieces-0.041
in-0.040
Token It
Token ID632
Feature activation-2.11e+00

Pos logit contributions
 leave+0.211
 give+0.209
 lose+0.207
 wait+0.206
 listen+0.202
Neg logit contributions
icer-0.150
-0.144
<|endoftext|>-0.140
 snowball-0.139
ara-0.135
Token will
Token ID481
Feature activation-2.57e+00

Pos logit contributions
 give+0.110
 share+0.106
 leave+0.104
 listen+0.104
 wait+0.101
Neg logit contributions
 year-0.056
 pieces-0.056
 nowhere-0.055
 was-0.054
in-0.051
Token make
Token ID787
Feature activation+5.09e-01

Pos logit contributions
 give+0.066
 Sorry+0.062
ourage+0.061
 listen+0.061
 leave+0.061
Neg logit contributions
 year-0.125
 nowhere-0.125
<|endoftext|>-0.122
 sudden-0.122
 pieces-0.119
Token your
Token ID534
Feature activation+4.95e-01

Pos logit contributions
 nowhere+0.031
 neighbourhood+0.030
 sudden+0.029
 year+0.029
 woods+0.029
Neg logit contributions
 give-0.014
 share-0.013
 listen-0.012
 leave-0.011
 wait-0.011
Token mouth
Token ID5422
Feature activation-1.02e+00

Pos logit contributions
 nowhere+0.022
 sudden+0.021
 year+0.020
ud+0.020
 neighbourhood+0.020
Neg logit contributions
 give-0.022
 share-0.021
 leave-0.020
 wait-0.020
 listen-0.020
Token use
Token ID779
Feature activation-5.99e-01

Pos logit contributions
 Sorry+0.090
 give+0.083
 leave+0.080
osity+0.079
 sorry+0.077
Neg logit contributions
 year-0.121
in-0.120
<|endoftext|>-0.119
 was-0.117
 neighbourhood-0.114
Token any
Token ID597
Feature activation+2.45e-02

Pos logit contributions
 give+0.025
 share+0.023
 listen+0.022
 leave+0.022
 wait+0.022
Neg logit contributions
 nowhere-0.026
 sudden-0.025
 year-0.025
 neighbourhood-0.024
 woods-0.024
Token buttons
Token ID12163
Feature activation-2.73e+00

Pos logit contributions
 nowhere+0.001
 sudden+0.001
 year+0.001
 neighbourhood+0.001
 woods+0.001
Neg logit contributions
 give-0.001
 share-0.001
 listen-0.001
 leave-0.001
 wait-0.001
Token you
Token ID345
Feature activation-1.38e+00

Pos logit contributions
 give+0.135
 share+0.130
 listen+0.127
 Sorry+0.126
 leave+0.124
Neg logit contributions
 pieces-0.078
 year-0.075
in-0.075
 nowhere-0.074
 seas-0.072
Token want
Token ID765
Feature activation+3.54e-02

Pos logit contributions
 give+0.051
 share+0.047
 listen+0.045
 leave+0.045
 wait+0.043
Neg logit contributions
 nowhere-0.063
 year-0.061
 sudden-0.061
 neighbourhood-0.059
 woods-0.058
Token.
Token ID13
Feature activation-5.21e+00

Pos logit contributions
 sudden+0.001
 nowhere+0.001
 year+0.001
 neighbourhood+0.001
 woods+0.001
Neg logit contributions
 give-0.002
 share-0.001
 listen-0.001
 leave-0.001
 ask-0.001
Token I
Token ID314
Feature activation-1.42e+00

Pos logit contributions
 give+0.209
 lose+0.207
 leave+0.203
 share+0.191
 listen+0.189
Neg logit contributions
 woods-0.156
-0.147
icer-0.147
<|endoftext|>-0.144
 neighbourhood-0.144
Token have
Token ID423
Feature activation+7.39e-02

Pos logit contributions
 give+0.049
 share+0.045
 leave+0.045
 listen+0.044
 Sorry+0.042
Neg logit contributions
 year-0.065
 nowhere-0.064
 sudden-0.063
 neighbourhood-0.062
in-0.060
Token plenty
Token ID6088
Feature activation-2.20e+00

Pos logit contributions
 nowhere+0.003
 sudden+0.003
 year+0.003
 neighbourhood+0.003
 woods+0.003
Neg logit contributions
 give-0.003
 share-0.003
 listen-0.003
 leave-0.003
 wait-0.003
Token.
Token ID13
Feature activation-4.99e+00

Pos logit contributions
 give+0.133
 share+0.131
 Sorry+0.129
 leave+0.128
 listen+0.127
Neg logit contributions
 year-0.045
 nowhere-0.041
 pieces-0.040
 seas-0.038
 sudden-0.036
Token I
Token ID314
Feature activation-1.50e+00

Pos logit contributions
 lose+0.185
 give+0.184
 leave+0.183
 share+0.172
 listen+0.170
Neg logit contributions
 woods-0.170
<|endoftext|>-0.163
-0.161
Ah-0.161
icer-0.158
Token you
Token ID345
Feature activation-2.15e+00

Pos logit contributions
 give+0.067
 share+0.063
 listen+0.061
 leave+0.060
 ask+0.060
Neg logit contributions
 nowhere-0.075
 year-0.074
 seas-0.073
 sudden-0.073
 woods-0.069
Token are
Token ID389
Feature activation-1.49e+00

Pos logit contributions
 give+0.072
 leave+0.068
 share+0.066
 Sorry+0.065
 listen+0.064
Neg logit contributions
 year-0.090
 nowhere-0.085
 sudden-0.085
 pieces-0.084
 time-0.084
Token red
Token ID2266
Feature activation-7.78e-01

Pos logit contributions
 give+0.078
 listen+0.073
 leave+0.073
 share+0.072
 wait+0.071
Neg logit contributions
 nowhere-0.046
 sudden-0.043
 year-0.042
 woods-0.039
 pieces-0.038
Token and
Token ID290
Feature activation-2.43e+00

Pos logit contributions
 give+0.037
 share+0.035
 listen+0.034
 leave+0.033
 wait+0.033
Neg logit contributions
 nowhere-0.029
 sudden-0.028
 year-0.028
 neighbourhood-0.026
 woods-0.026
Token round
Token ID2835
Feature activation-3.44e-01

Pos logit contributions
 leave+0.138
 give+0.136
 wait+0.128
 listen+0.127
 pay+0.124
Neg logit contributions
 sudden-0.058
 nowhere-0.056
 pieces-0.050
 seas-0.048
 year-0.047
Token.
Token ID13
Feature activation-5.20e+00

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.014
 leave+0.014
 wait+0.013
Neg logit contributions
 nowhere-0.014
 sudden-0.014
 year-0.013
 neighbourhood-0.013
 woods-0.013
Token Tomato
Token ID47777
Feature activation-1.33e+00

Pos logit contributions
 leave+0.207
 wait+0.203
 give+0.202
 listen+0.195
 lose+0.194
Neg logit contributions
icer-0.160
 woods-0.159
 snowball-0.156
ud-0.150
-0.150
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 give+0.061
 share+0.057
 listen+0.056
 leave+0.055
 wait+0.054
Neg logit contributions
 nowhere-0.048
 year-0.048
 sudden-0.047
 neighbourhood-0.046
 woods-0.045
Token tomato
Token ID24240
Feature activation-1.64e+00

Pos logit contributions
 give+0.059
 share+0.054
 listen+0.053
 leave+0.053
 wait+0.052
Neg logit contributions
 nowhere-0.070
 sudden-0.069
 year-0.068
 seas-0.065
 neighbourhood-0.064
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
 give+0.075
 share+0.070
 listen+0.069
 leave+0.069
 wait+0.068
Neg logit contributions
 year-0.057
 nowhere-0.056
 sudden-0.055
 woods-0.054
in-0.054
Token you
Token ID345
Feature activation-2.20e+00

Pos logit contributions
 give+0.077
 share+0.072
 leave+0.071
 listen+0.070
 wait+0.070
Neg logit contributions
 sudden-0.070
 nowhere-0.070
 year-0.069
 seas-0.067
 woods-0.065
Token you
Token ID345
Feature activation-1.78e+00

Pos logit contributions
istence+0.134
 give+0.129
 share+0.124
ourage+0.124
anks+0.122
Neg logit contributions
 time-0.094
 was-0.090
 nowhere-0.087
 year-0.085
 sudden-0.085
Token can
Token ID460
Feature activation-2.94e+00

Pos logit contributions
 give+0.066
 share+0.062
 Sorry+0.062
 listen+0.061
 leave+0.060
Neg logit contributions
 nowhere-0.071
 year-0.070
 sudden-0.068
 pieces-0.068
in-0.066
Token peel
Token ID31738
Feature activation-2.52e-01

Pos logit contributions
 Sorry+0.094
 give+0.083
ourage+0.082
 Please+0.081
osity+0.080
Neg logit contributions
in-0.126
<|endoftext|>-0.122
 year-0.121
 was-0.121
 snowball-0.119
Token the
Token ID262
Feature activation+6.82e-01

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.009
Neg logit contributions
 nowhere-0.011
 sudden-0.010
 year-0.010
 neighbourhood-0.010
 woods-0.010
Token orange
Token ID10912
Feature activation-4.99e-01

Pos logit contributions
 year+0.025
 sudden+0.025
 nowhere+0.025
 neighbourhood+0.024
ud+0.023
Neg logit contributions
 give-0.034
 share-0.031
 listen-0.030
 wait-0.030
 leave-0.030
Token.
Token ID13
Feature activation-5.18e+00

Pos logit contributions
 give+0.026
 share+0.024
 listen+0.023
 leave+0.023
 wait+0.023
Neg logit contributions
 nowhere-0.017
 sudden-0.017
 year-0.016
 neighbourhood-0.016
 woods-0.015
Token You
Token ID921
Feature activation-2.64e+00

Pos logit contributions
 leave+0.199
 give+0.192
 listen+0.190
 wait+0.189
 lose+0.189
Neg logit contributions
 snowball-0.148
icer-0.144
-0.142
ud-0.137
<|endoftext|>-0.136
Token can
Token ID460
Feature activation-2.74e+00

Pos logit contributions
 Sorry+0.108
 give+0.104
 leave+0.101
 listen+0.098
 share+0.097
Neg logit contributions
 pieces-0.089
 nowhere-0.088
 year-0.087
in-0.084
 time-0.083
Token use
Token ID779
Feature activation-3.72e-01

Pos logit contributions
 Sorry+0.085
 give+0.078
 leave+0.073
 listen+0.072
ourage+0.072
Neg logit contributions
in-0.120
<|endoftext|>-0.118
 year-0.116
icer-0.115
 snowball-0.115
Token your
Token ID534
Feature activation+4.02e-01

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.013
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.017
 sudden-0.017
 year-0.016
 neighbourhood-0.016
 woods-0.016
Token fingers
Token ID9353
Feature activation-1.17e+00

Pos logit contributions
 nowhere+0.015
 year+0.015
 sudden+0.015
 neighbourhood+0.014
 woods+0.014
Neg logit contributions
 give-0.019
 share-0.018
 listen-0.017
 wait-0.017
 leave-0.017
Token is
Token ID318
Feature activation-8.84e-01

Pos logit contributions
 give+0.037
 share+0.035
 listen+0.034
 leave+0.034
 wait+0.033
Neg logit contributions
 nowhere-0.021
 sudden-0.020
 year-0.020
 neighbourhood-0.019
 woods-0.018
Token a
Token ID257
Feature activation+2.00e-03

Pos logit contributions
 give+0.042
 share+0.040
 listen+0.039
 leave+0.038
 wait+0.037
Neg logit contributions
 nowhere-0.033
 sudden-0.031
 year-0.031
 neighbourhood-0.029
 woods-0.029
Token nice
Token ID3621
Feature activation-7.56e-01

Pos logit contributions
 nowhere+0.000
 sudden+0.000
 year+0.000
 neighbourhood+0.000
 woods+0.000
Neg logit contributions
 give-0.000
 share-0.000
 listen-0.000
 wait-0.000
 leave-0.000
Token lot
Token ID1256
Feature activation-1.23e+00

Pos logit contributions
 give+0.041
 share+0.039
 listen+0.038
 leave+0.038
 wait+0.037
Neg logit contributions
 nowhere-0.022
 sudden-0.022
 year-0.022
 neighbourhood-0.020
 woods-0.020
Tokenion
Token ID295
Feature activation-2.31e+00

Pos logit contributions
 give+0.074
 share+0.072
 leave+0.070
 listen+0.070
 wait+0.068
Neg logit contributions
 year-0.028
 nowhere-0.027
 sudden-0.026
 neighbourhood-0.024
 woods-0.023
Token.
Token ID13
Feature activation-5.17e+00

Pos logit contributions
 give+0.122
 share+0.118
 leave+0.115
 listen+0.114
 ask+0.114
Neg logit contributions
 year-0.063
 nowhere-0.060
 pieces-0.057
in-0.056
 area-0.056
Token It
Token ID632
Feature activation-2.58e+00

Pos logit contributions
 give+0.213
 leave+0.210
 listen+0.207
 lose+0.206
 share+0.203
Neg logit contributions
-0.152
icer-0.152
 woods-0.150
 sky-0.145
ud-0.144
Token smells
Token ID25760
Feature activation-2.02e+00

Pos logit contributions
 give+0.123
 share+0.122
 listen+0.121
 leave+0.120
 wait+0.119
Neg logit contributions
 pieces-0.077
 was-0.076
 year-0.070
 nowhere-0.070
 woods-0.068
Token like
Token ID588
Feature activation-4.64e-01

Pos logit contributions
 give+0.094
 share+0.090
 leave+0.089
 listen+0.088
 ask+0.088
Neg logit contributions
 year-0.069
 sudden-0.068
 nowhere-0.065
 time-0.062
 seas-0.061
Token flowers
Token ID12734
Feature activation-2.19e+00

Pos logit contributions
 give+0.022
 share+0.021
 listen+0.020
 leave+0.020
 wait+0.019
Neg logit contributions
 nowhere-0.017
 sudden-0.017
 year-0.017
 neighbourhood-0.016
 woods-0.016
Token.
Token ID13
Feature activation-5.04e+00

Pos logit contributions
 give+0.122
 Sorry+0.116
 leave+0.115
 share+0.115
 listen+0.113
Neg logit contributions
 year-0.054
 nowhere-0.053
 woods-0.050
 pieces-0.050
 area-0.047
Token.
Token ID13
Feature activation-4.35e+00

Pos logit contributions
 give+0.047
 share+0.044
 listen+0.043
 leave+0.043
 wait+0.042
Neg logit contributions
 nowhere-0.030
 year-0.029
 sudden-0.028
 neighbourhood-0.027
 woods-0.026
Token You
Token ID921
Feature activation-2.10e+00

Pos logit contributions
 give+0.179
 leave+0.176
 share+0.165
 listen+0.165
 lose+0.163
Neg logit contributions
-0.147
 woods-0.147
 neighbourhood-0.135
icer-0.133
 world-0.132
Token measured
Token ID8630
Feature activation+9.59e-01

Pos logit contributions
 give+0.089
 leave+0.087
 Sorry+0.085
 listen+0.084
 share+0.082
Neg logit contributions
 nowhere-0.071
 year-0.070
 pieces-0.070
in-0.066
 sudden-0.066
Token the
Token ID262
Feature activation+1.29e+00

Pos logit contributions
 sudden+0.044
 nowhere+0.044
 neighbourhood+0.042
 year+0.042
 woods+0.040
Neg logit contributions
 give-0.038
 share-0.037
 listen-0.035
 ask-0.033
 leave-0.033
Token flour
Token ID10601
Feature activation-4.39e-01

Pos logit contributions
 nowhere+0.054
 sudden+0.047
awed+0.046
jo+0.045
 neighbourhood+0.045
Neg logit contributions
 give-0.060
 wait-0.056
 listen-0.054
 share-0.053
 leave-0.051
Token.
Token ID13
Feature activation-5.17e+00

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.019
Neg logit contributions
 nowhere-0.016
 sudden-0.016
 year-0.015
 neighbourhood-0.015
 woods-0.014
Token You
Token ID921
Feature activation-2.20e+00

Pos logit contributions
 leave+0.228
 give+0.221
 lose+0.219
 listen+0.216
 share+0.209
Neg logit contributions
-0.155
icer-0.149
 woods-0.144
 sky-0.136
 world-0.133
Token yielded
Token ID26403
Feature activation+3.54e-01

Pos logit contributions
 give+0.094
 leave+0.092
 listen+0.089
 Sorry+0.089
 share+0.087
Neg logit contributions
 nowhere-0.076
 year-0.075
 pieces-0.073
 sudden-0.071
in-0.070
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
 nowhere+0.016
 sudden+0.015
 neighbourhood+0.015
 year+0.015
 woods+0.014
Neg logit contributions
 give-0.015
 share-0.014
 listen-0.013
 leave-0.013
 ask-0.013
Token flour
Token ID10601
Feature activation-1.34e+00

Pos logit contributions
 nowhere+0.041
 sudden+0.037
awed+0.036
 neighbourhood+0.036
 woods+0.036
Neg logit contributions
 give-0.052
 wait-0.048
 listen-0.047
 share-0.046
 ask-0.045
Token.
Token ID13
Feature activation-5.05e+00

Pos logit contributions
 give+0.068
 share+0.065
 listen+0.064
 leave+0.062
 ask+0.062
Neg logit contributions
 nowhere-0.042
 year-0.042
 sudden-0.039
 pieces-0.038
 neighbourhood-0.037
Token kids
Token ID3988
Feature activation-1.47e+00

Pos logit contributions
 give+0.052
 wait+0.049
 listen+0.049
 share+0.048
 leave+0.047
Neg logit contributions
 year-0.056
 nowhere-0.055
in-0.054
<|endoftext|>-0.054
 neighbourhood-0.054
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 give+0.060
 share+0.056
 listen+0.055
 leave+0.055
 wait+0.053
Neg logit contributions
 nowhere-0.058
 year-0.058
 neighbourhood-0.055
 sudden-0.055
 woods-0.053
Token have
Token ID423
Feature activation+6.50e-01

Pos logit contributions
 give+0.040
 share+0.038
 listen+0.036
 leave+0.035
 wait+0.035
Neg logit contributions
 nowhere-0.081
 year-0.078
 seas-0.075
 sudden-0.075
 neighbourhood-0.074
Token some
Token ID617
Feature activation-4.64e-01

Pos logit contributions
 sudden+0.030
icer+0.030
 year+0.030
 neighbourhood+0.030
 nowhere+0.029
Neg logit contributions
 give-0.025
 share-0.025
 listen-0.023
 wait-0.022
 ask-0.021
Token cookies
Token ID14746
Feature activation-7.96e-01

Pos logit contributions
 give+0.023
 share+0.022
 listen+0.021
 leave+0.021
 wait+0.021
Neg logit contributions
 nowhere-0.016
 sudden-0.016
 year-0.015
 neighbourhood-0.015
 woods-0.015
Token.
Token ID13
Feature activation-5.17e+00

Pos logit contributions
 give+0.035
 share+0.033
 listen+0.032
 leave+0.031
 wait+0.031
Neg logit contributions
 nowhere-0.032
 sudden-0.031
 year-0.031
 neighbourhood-0.030
 woods-0.029
Token They
Token ID1119
Feature activation-2.87e+00

Pos logit contributions
 lose+0.204
 leave+0.203
 give+0.200
 wait+0.197
 listen+0.193
Neg logit contributions
-0.153
 year-0.151
icer-0.149
 woods-0.149
ase-0.143
Token are
Token ID389
Feature activation-1.20e+00

Pos logit contributions
 give+0.103
 Sorry+0.103
 leave+0.100
 listen+0.099
 please+0.095
Neg logit contributions
 year-0.109
 pieces-0.106
 nowhere-0.100
 neighbourhood-0.100
 was-0.100
Token chocolate
Token ID11311
Feature activation-8.96e-01

Pos logit contributions
 give+0.057
 share+0.052
 listen+0.052
 leave+0.052
 wait+0.051
Neg logit contributions
 nowhere-0.043
 year-0.041
 sudden-0.041
 neighbourhood-0.038
 woods-0.038
Token chip
Token ID11594
Feature activation+3.88e-01

Pos logit contributions
 give+0.045
 share+0.042
 listen+0.041
 leave+0.041
 ask+0.040
Neg logit contributions
 nowhere-0.030
 year-0.030
 sudden-0.029
 neighbourhood-0.028
 woods-0.027
Token.
Token ID13
Feature activation-4.91e+00

Pos logit contributions
 nowhere+0.015
 sudden+0.014
 neighbourhood+0.013
 woods+0.013
 year+0.013
Neg logit contributions
 give-0.019
 share-0.018
 listen-0.017
 leave-0.017
 wait-0.017
Token have
Token ID423
Feature activation-2.43e-01

Pos logit contributions
 give+0.071
 Sorry+0.070
ourage+0.069
 listen+0.067
istence+0.065
Neg logit contributions
 year-0.113
in-0.112
 was-0.109
 neighbourhood-0.108
<|endoftext|>-0.107
Token some
Token ID617
Feature activation-2.65e-01

Pos logit contributions
 give+0.010
 share+0.010
 listen+0.009
 leave+0.009
 wait+0.009
Neg logit contributions
 nowhere-0.011
 sudden-0.010
 year-0.010
 neighbourhood-0.010
 woods-0.010
Token of
Token ID286
Feature activation-6.70e-01

Pos logit contributions
 give+0.014
 share+0.013
 listen+0.013
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.009
 sudden-0.008
 year-0.008
 neighbourhood-0.008
 woods-0.007
Token my
Token ID616
Feature activation+6.22e-01

Pos logit contributions
 give+0.020
 share+0.018
 listen+0.017
 leave+0.017
 wait+0.016
Neg logit contributions
 nowhere-0.037
 sudden-0.036
 year-0.036
 neighbourhood-0.035
 woods-0.034
Token blocks
Token ID7021
Feature activation-9.84e-01

Pos logit contributions
 nowhere+0.021
 sudden+0.021
ud+0.019
 year+0.019
awed+0.018
Neg logit contributions
 give-0.033
 share-0.032
 listen-0.030
 wait-0.030
 leave-0.030
Token.
Token ID13
Feature activation-5.16e+00

Pos logit contributions
 give+0.050
 share+0.047
 listen+0.046
 leave+0.046
 wait+0.045
Neg logit contributions
 nowhere-0.033
 year-0.031
 sudden-0.030
 neighbourhood-0.029
 woods-0.028
Token We
Token ID775
Feature activation-1.89e+00

Pos logit contributions
 leave+0.192
 give+0.188
 lose+0.187
 wait+0.186
 listen+0.184
Neg logit contributions
 woods-0.156
-0.150
ase-0.146
icer-0.146
aws-0.145
Token can
Token ID460
Feature activation-3.37e+00

Pos logit contributions
 give+0.072
 listen+0.068
 leave+0.067
 share+0.065
 wait+0.064
Neg logit contributions
 nowhere-0.075
 year-0.074
in-0.070
 pieces-0.070
 sudden-0.069
Token build
Token ID1382
Feature activation+3.99e-01

Pos logit contributions
istence+0.083
 Sorry+0.081
ourage+0.081
 give+0.080
 listen+0.079
Neg logit contributions
 year-0.150
 was-0.147
in-0.146
 neighbourhood-0.144
<|endoftext|>-0.143
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
 nowhere+0.016
 sudden+0.016
 year+0.016
 neighbourhood+0.016
ud+0.015
Neg logit contributions
 give-0.018
 share-0.017
 listen-0.016
 leave-0.016
 wait-0.016
Token new
Token ID649
Feature activation+1.09e+00

Pos logit contributions
 nowhere+0.040
awed+0.034
 sudden+0.034
ud+0.033
 year+0.032
Neg logit contributions
 give-0.056
 share-0.055
 listen-0.052
 wait-0.052
 ask-0.051
Token rad
Token ID2511
Feature activation-1.23e+00

Pos logit contributions
 sudden+0.018
 nowhere+0.017
 year+0.017
 neighbourhood+0.017
icer+0.017
Neg logit contributions
 give-0.018
 share-0.017
 listen-0.016
 leave-0.016
 wait-0.016
Tokenishes
Token ID5614
Feature activation-9.38e-01

Pos logit contributions
 give+0.061
 share+0.058
 listen+0.057
 wait+0.056
 leave+0.056
Neg logit contributions
 nowhere-0.042
 sudden-0.041
 year-0.039
 neighbourhood-0.039
icer-0.038
Token from
Token ID422
Feature activation+1.27e+00

Pos logit contributions
 give+0.040
 share+0.037
 listen+0.036
 leave+0.036
 wait+0.035
Neg logit contributions
 nowhere-0.039
 sudden-0.038
 year-0.038
 neighbourhood-0.036
 woods-0.035
Token our
Token ID674
Feature activation+1.22e+00

Pos logit contributions
 sudden+0.060
ud+0.057
 nowhere+0.056
 year+0.056
 neighbourhood+0.056
Neg logit contributions
 give-0.050
 share-0.047
 leave-0.044
 listen-0.044
 wait-0.043
Token garden
Token ID11376
Feature activation+1.56e+00

Pos logit contributions
 nowhere+0.042
 sudden+0.039
 year+0.036
ud+0.035
eh+0.035
Neg logit contributions
 give-0.065
 share-0.062
 leave-0.062
 wait-0.059
 ask-0.059
Token.
Token ID13
Feature activation-5.16e+00

Pos logit contributions
 sudden+0.072
 nowhere+0.071
icer+0.063
ud+0.062
 neighbourhood+0.062
Neg logit contributions
 give-0.066
 share-0.060
 leave-0.059
 ask-0.057
 listen-0.057
Token They
Token ID1119
Feature activation-3.07e+00

Pos logit contributions
 leave+0.206
 listen+0.205
 wait+0.205
 give+0.204
 lose+0.202
Neg logit contributions
-0.157
 woods-0.156
 year-0.151
-0.144
<|endoftext|>-0.144
Token are
Token ID389
Feature activation-1.22e+00

Pos logit contributions
 give+0.112
 listen+0.109
 Sorry+0.108
 leave+0.106
 share+0.104
Neg logit contributions
 year-0.119
 pieces-0.116
 nowhere-0.110
 was-0.110
 sudden-0.108
Token pink
Token ID11398
Feature activation-2.94e-01

Pos logit contributions
 give+0.063
 share+0.059
 listen+0.059
 leave+0.058
 wait+0.057
Neg logit contributions
 nowhere-0.040
 sudden-0.038
 year-0.038
 neighbourhood-0.035
 woods-0.035
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.014
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.010
 sudden-0.010
 year-0.010
 neighbourhood-0.009
 woods-0.009
Token crunch
Token ID23527
Feature activation+5.93e-01

Pos logit contributions
 give+0.116
 leave+0.111
 listen+0.110
 share+0.107
 wait+0.107
Neg logit contributions
 nowhere-0.058
 sudden-0.056
 year-0.055
 pieces-0.050
 seas-0.049
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
 give+0.021
 share+0.019
 listen+0.019
 leave+0.018
 wait+0.018
Neg logit contributions
 nowhere-0.024
 sudden-0.024
 year-0.023
 neighbourhood-0.023
 woods-0.022
Token red
Token ID2266
Feature activation-6.15e-01

Pos logit contributions
 nowhere+0.008
 sudden+0.008
 year+0.007
 neighbourhood+0.007
ud+0.007
Neg logit contributions
 give-0.013
 share-0.012
 listen-0.012
 ask-0.012
 leave-0.012
Token cr
Token ID1067
Feature activation-1.02e+00

Pos logit contributions
 give+0.035
 share+0.034
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.017
 sudden-0.016
 year-0.016
 neighbourhood-0.015
 woods-0.014
Tokenay
Token ID323
Feature activation-1.15e-01

Pos logit contributions
 give+0.063
 share+0.060
 listen+0.059
 leave+0.058
 wait+0.058
Neg logit contributions
 nowhere-0.023
 year-0.022
 sudden-0.021
 neighbourhood-0.020
icer-0.019
Tokenon
Token ID261
Feature activation-1.08e+00

Pos logit contributions
 give+0.007
 share+0.006
 listen+0.006
 leave+0.006
 wait+0.006
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 neighbourhood-0.003
 year-0.003
 woods-0.003
Token.
Token ID13
Feature activation-5.14e+00

Pos logit contributions
 give+0.053
 share+0.050
 listen+0.049
 leave+0.048
 wait+0.047
Neg logit contributions
 nowhere-0.037
 year-0.036
 sudden-0.034
 neighbourhood-0.033
 woods-0.033
Token I
Token ID314
Feature activation-1.62e+00

Pos logit contributions
 lose+0.197
 leave+0.195
 disappoint+0.188
beh+0.188
 give+0.186
Neg logit contributions
<|endoftext|>-0.144
-0.143
-0.138
 sky-0.136
Ah-0.133
Token like
Token ID588
Feature activation-5.32e-01

Pos logit contributions
 give+0.061
 leave+0.057
 share+0.057
 listen+0.056
 Sorry+0.054
Neg logit contributions
 year-0.067
 nowhere-0.065
 sudden-0.063
 neighbourhood-0.063
in-0.062
Token hearts
Token ID11954
Feature activation-1.89e+00

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.018
Neg logit contributions
 nowhere-0.024
 sudden-0.024
 year-0.023
 neighbourhood-0.023
 woods-0.022
Token too
Token ID1165
Feature activation-1.11e+00

Pos logit contributions
 give+0.094
 leave+0.087
 share+0.086
 listen+0.086
 wait+0.086
Neg logit contributions
 nowhere-0.060
 year-0.059
 pieces-0.056
in-0.056
 world-0.054
Token.
Token ID13
Feature activation-4.79e+00

Pos logit contributions
 give+0.051
 share+0.048
 leave+0.047
 listen+0.046
 wait+0.045
Neg logit contributions
 year-0.043
 nowhere-0.043
 sudden-0.041
 neighbourhood-0.039
 woods-0.038
Token this
Token ID428
Feature activation-3.87e-01

Pos logit contributions
 give+0.002
 share+0.002
 listen+0.002
 leave+0.002
 ask+0.002
Neg logit contributions
 nowhere-0.002
 sudden-0.002
 year-0.002
 neighbourhood-0.002
 woods-0.002
Token wrench
Token ID34561
Feature activation-1.21e+00

Pos logit contributions
 give+0.020
 share+0.018
 listen+0.018
 leave+0.018
 wait+0.017
Neg logit contributions
 nowhere-0.014
 sudden-0.013
 year-0.013
 neighbourhood-0.012
 woods-0.012
Token and
Token ID290
Feature activation-3.82e+00

Pos logit contributions
 give+0.060
 share+0.057
 listen+0.056
 leave+0.055
 wait+0.054
Neg logit contributions
 nowhere-0.041
 year-0.040
 sudden-0.038
 neighbourhood-0.037
 woods-0.036
Token this
Token ID428
Feature activation-1.01e+00

Pos logit contributions
 Sorry+0.103
ourage+0.100
 disappoint+0.094
aring+0.094
asty+0.093
Neg logit contributions
 pieces-0.177
 sudden-0.169
 nowhere-0.166
 was-0.163
<|endoftext|>-0.161
Token hammer
Token ID15554
Feature activation-8.55e-01

Pos logit contributions
 give+0.047
 share+0.045
 listen+0.044
 leave+0.043
 wait+0.042
Neg logit contributions
 nowhere-0.037
 sudden-0.036
 year-0.036
 neighbourhood-0.035
 woods-0.034
Token.
Token ID13
Feature activation-5.13e+00

Pos logit contributions
 give+0.038
 share+0.036
 listen+0.035
 leave+0.034
 wait+0.033
Neg logit contributions
 nowhere-0.034
 sudden-0.033
 year-0.033
 neighbourhood-0.031
 woods-0.031
Token And
Token ID843
Feature activation-3.56e+00

Pos logit contributions
 leave+0.204
 give+0.204
 wait+0.204
 lose+0.200
 listen+0.198
Neg logit contributions
<|endoftext|>-0.155
-0.148
 woods-0.147
 neighbourhood-0.145
-0.142
Token you
Token ID345
Feature activation-2.24e+00

Pos logit contributions
 give+0.103
 Sorry+0.103
 share+0.099
 listen+0.097
 leave+0.097
Neg logit contributions
 nowhere-0.150
 sudden-0.148
 pieces-0.147
 seas-0.140
 was-0.138
Token can
Token ID460
Feature activation-3.30e+00

Pos logit contributions
 give+0.086
 listen+0.086
 Sorry+0.084
 share+0.083
 wait+0.081
Neg logit contributions
 nowhere-0.084
 year-0.083
 pieces-0.081
 sudden-0.077
<|endoftext|>-0.076
Token hand
Token ID1021
Feature activation-1.03e+00

Pos logit contributions
 Sorry+0.104
osity+0.093
 Thank+0.090
 give+0.087
 Please+0.086
Neg logit contributions
in-0.138
<|endoftext|>-0.136
 year-0.136
 was-0.133
 neighbourhood-0.133
Token me
Token ID502
Feature activation-1.70e+00

Pos logit contributions
 give+0.044
 share+0.041
 listen+0.040
 leave+0.040
 wait+0.039
Neg logit contributions
 nowhere-0.042
 sudden-0.041
 year-0.040
 neighbourhood-0.039
 woods-0.038
Token these
Token ID777
Feature activation-5.96e-01

Pos logit contributions
 give+0.051
 share+0.048
 leave+0.047
 listen+0.046
 wait+0.046
Neg logit contributions
 nowhere-0.041
 sudden-0.039
 year-0.038
 neighbourhood-0.037
 woods-0.036
Token carrots
Token ID34397
Feature activation-9.93e-01

Pos logit contributions
 give+0.028
 share+0.026
 listen+0.026
 leave+0.025
 wait+0.025
Neg logit contributions
 nowhere-0.023
 sudden-0.022
 year-0.021
 neighbourhood-0.021
 woods-0.020
Token and
Token ID290
Feature activation-2.11e+00

Pos logit contributions
 give+0.045
 share+0.042
 listen+0.041
 leave+0.040
 wait+0.039
Neg logit contributions
 nowhere-0.039
 year-0.037
 sudden-0.036
 neighbourhood-0.035
 woods-0.035
Token cel
Token ID18725
Feature activation-2.51e-01

Pos logit contributions
 give+0.094
 listen+0.087
 share+0.087
 leave+0.087
ourage+0.086
Neg logit contributions
 nowhere-0.074
 sudden-0.072
 pieces-0.069
 seas-0.068
 year-0.067
Tokenery
Token ID1924
Feature activation-2.51e-01

Pos logit contributions
 give+0.016
 share+0.016
 listen+0.015
 leave+0.015
 wait+0.015
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.005
 neighbourhood-0.004
 woods-0.004
Token.
Token ID13
Feature activation-5.10e+00

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.009
Neg logit contributions
 nowhere-0.011
 sudden-0.011
 year-0.010
 neighbourhood-0.010
 woods-0.010
Token They
Token ID1119
Feature activation-2.92e+00

Pos logit contributions
 leave+0.217
 lose+0.210
 give+0.208
 wait+0.203
 share+0.195
Neg logit contributions
-0.159
 woods-0.146
-0.145
<|endoftext|>-0.139
Ah-0.138
Token are
Token ID389
Feature activation-1.14e+00

Pos logit contributions
 Sorry+0.114
 give+0.108
 listen+0.105
 leave+0.105
 share+0.101
Neg logit contributions
 pieces-0.106
 year-0.100
 world-0.096
in-0.095
 area-0.094
Token crunch
Token ID23527
Feature activation+1.91e-01

Pos logit contributions
 give+0.057
 listen+0.053
 share+0.053
 leave+0.052
 wait+0.051
Neg logit contributions
 nowhere-0.039
 sudden-0.037
 year-0.037
 neighbourhood-0.035
 woods-0.034
Tokeny
Token ID88
Feature activation-8.69e-01

Pos logit contributions
 sudden+0.004
 nowhere+0.004
 neighbourhood+0.003
 woods+0.003
 year+0.003
Neg logit contributions
 give-0.013
 share-0.012
 listen-0.012
 leave-0.012
 wait-0.011
Token and
Token ID290
Feature activation-2.38e+00

Pos logit contributions
 give+0.043
 share+0.040
 listen+0.039
 leave+0.039
 wait+0.038
Neg logit contributions
 nowhere-0.031
 sudden-0.030
 year-0.030
 neighbourhood-0.028
 woods-0.027
Token you
Token ID345
Feature activation-1.84e+00

Pos logit contributions
 give+0.053
 share+0.049
ourage+0.048
 listen+0.047
 leave+0.047
Neg logit contributions
 year-0.136
 nowhere-0.132
 neighbourhood-0.127
<|endoftext|>-0.126
 woods-0.125
Token can
Token ID460
Feature activation-2.51e+00

Pos logit contributions
 give+0.071
 listen+0.068
 share+0.067
 wait+0.066
 Sorry+0.065
Neg logit contributions
 year-0.069
 nowhere-0.066
in-0.065
 pieces-0.065
 area-0.064
Token keep
Token ID1394
Feature activation-9.26e-01

Pos logit contributions
 Sorry+0.070
 give+0.067
 listen+0.067
ourage+0.064
istence+0.063
Neg logit contributions
 year-0.116
in-0.113
 neighbourhood-0.110
 area-0.109
 was-0.108
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
 give+0.041
 share+0.038
 listen+0.037
 leave+0.037
 wait+0.036
Neg logit contributions
 nowhere-0.038
 sudden-0.036
 year-0.036
 neighbourhood-0.035
 woods-0.034
Token coins
Token ID10796
Feature activation-1.81e+00

Pos logit contributions
 give+0.006
 share+0.006
 listen+0.006
 leave+0.006
 wait+0.006
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 year-0.003
 neighbourhood-0.003
ud-0.003
Token.
Token ID13
Feature activation-5.10e+00

Pos logit contributions
 give+0.089
 share+0.084
 listen+0.083
 leave+0.081
 Sorry+0.081
Neg logit contributions
 year-0.059
 nowhere-0.059
in-0.054
 seas-0.053
 pieces-0.053
Token You
Token ID921
Feature activation-2.51e+00

Pos logit contributions
 lose+0.201
 leave+0.201
 bleed+0.197
 wait+0.194
 give+0.194
Neg logit contributions
-0.159
<|endoftext|>-0.158
-0.154
 woods-0.147
 year-0.145
Token can
Token ID460
Feature activation-2.99e+00

Pos logit contributions
 Sorry+0.100
 give+0.099
asty+0.097
 listen+0.097
 leave+0.095
Neg logit contributions
 nowhere-0.085
 year-0.085
 pieces-0.081
in-0.077
 time-0.076
Token buy
Token ID2822
Feature activation+4.63e-01

Pos logit contributions
 Sorry+0.083
osity+0.077
 give+0.072
ourage+0.071
 listen+0.070
Neg logit contributions
 year-0.140
 neighbourhood-0.131
<|endoftext|>-0.131
in-0.130
 forest-0.127
Token something
Token ID1223
Feature activation-1.56e+00

Pos logit contributions
 sudden+0.019
 nowhere+0.018
 year+0.018
 neighbourhood+0.018
ud+0.017
Neg logit contributions
 give-0.021
 share-0.020
 listen-0.019
 leave-0.019
 ask-0.018
Token nice
Token ID3621
Feature activation-6.75e-01

Pos logit contributions
 give+0.079
 share+0.075
 listen+0.074
 leave+0.074
 wait+0.072
Neg logit contributions
 nowhere-0.048
 year-0.047
 sudden-0.045
 seas-0.044
 neighbourhood-0.043
TokenOnce
Token ID7454
Feature activation+2.90e+00

Pos logit contributions
 sudden+0.136
 seas+0.123
eh+0.120
 nowhere+0.117
by+0.111
Neg logit contributions
 give-0.116
 I-0.116
 share-0.109
 ask-0.107
 listen-0.099
Token upon
Token ID2402
Feature activation+1.57e+00

Pos logit contributions
 sudden+0.103
 snowball+0.093
Ah+0.091
eh+0.091
 woods+0.088
Neg logit contributions
 give-0.145
 please-0.137
 ask-0.137
 leave-0.135
 I-0.135
Token a
Token ID257
Feature activation+2.30e+00

Pos logit contributions
icer+0.077
 neighbourhood+0.077
ud+0.076
 sudden+0.075
actus+0.073
Neg logit contributions
 listen-0.056
 give-0.056
 share-0.053
 wait-0.049
 ask-0.046
Token time
Token ID640
Feature activation+2.35e-01

Pos logit contributions
 pieces+0.061
 woods+0.060
 seas+0.060
 nowhere+0.058
in+0.057
Neg logit contributions
 listen-0.129
 wait-0.126
 give-0.125
 ask-0.116
 share-0.116
Token,
Token ID11
Feature activation+1.52e+00

Pos logit contributions
 sudden+0.012
 nowhere+0.011
 neighbourhood+0.011
 year+0.011
 woods+0.011
Neg logit contributions
 give-0.008
 share-0.007
 listen-0.007
 leave-0.007
 please-0.007
Token there
Token ID612
Feature activation+2.72e+00

Pos logit contributions
actus+0.073
 neighbourhood+0.072
ud+0.071
smith+0.071
 seas+0.069
Neg logit contributions
 give-0.053
 share-0.051
 please-0.050
 I-0.050
 listen-0.050
Token was
Token ID373
Feature activation+2.28e+00

Pos logit contributions
eh+0.117
ymes+0.110
 sudden+0.108
 seas+0.104
actus+0.099
Neg logit contributions
 give-0.122
 share-0.114
 leave-0.108
 I-0.105
 listen-0.103
Token a
Token ID257
Feature activation+5.13e-01

Pos logit contributions
actus+0.129
arrow+0.127
 neighbourhood+0.126
icer+0.126
ud+0.125
Neg logit contributions
 listen-0.053
 give-0.053
 share-0.051
 wait-0.047
 I-0.041
Token nos
Token ID43630
Feature activation+1.07e+00

Pos logit contributions
 nowhere+0.015
 sudden+0.012
 year+0.012
ud+0.012
 neighbourhood+0.012
Neg logit contributions
 give-0.030
 share-0.028
 listen-0.028
 wait-0.027
 ask-0.027
Tokeny
Token ID88
Feature activation+8.71e-01

Pos logit contributions
 sudden+0.025
 nowhere+0.023
 neighbourhood+0.021
 seas+0.019
 woods+0.018
Neg logit contributions
 give-0.066
 share-0.062
 listen-0.061
 leave-0.061
 ask-0.059
Token bear
Token ID6842
Feature activation+1.47e-01

Pos logit contributions
 nowhere+0.027
 sudden+0.026
 neighbourhood+0.024
icer+0.023
 pieces+0.022
Neg logit contributions
 give-0.049
 listen-0.045
 share-0.044
 wait-0.043
 leave-0.043
Token cake
Token ID12187
Feature activation-4.88e-02

Pos logit contributions
 nowhere+0.044
 sudden+0.041
ud+0.039
 woods+0.037
icer+0.037
Neg logit contributions
 give-0.045
 share-0.042
 listen-0.041
 wait-0.041
 leave-0.039
Token,
Token ID11
Feature activation+5.97e-02

Pos logit contributions
 give+0.002
 share+0.002
 listen+0.002
 leave+0.002
 wait+0.002
Neg logit contributions
 nowhere-0.002
 sudden-0.002
 year-0.002
 neighbourhood-0.002
 woods-0.002
Token and
Token ID290
Feature activation+4.62e-01

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 year+0.002
 neighbourhood+0.002
 woods+0.002
Neg logit contributions
 give-0.003
 share-0.003
 listen-0.003
 leave-0.003
 wait-0.003
Token smiled
Token ID13541
Feature activation-1.00e+00

Pos logit contributions
 nowhere+0.020
icer+0.019
 sudden+0.019
 year+0.019
 neighbourhood+0.019
Neg logit contributions
 give-0.020
 share-0.019
 listen-0.018
 wait-0.018
 leave-0.018
Token.
Token ID13
Feature activation+8.17e-01

Pos logit contributions
 give+0.045
 share+0.042
 listen+0.041
 leave+0.040
 wait+0.040
Neg logit contributions
 nowhere-0.040
 sudden-0.039
 year-0.038
 neighbourhood-0.037
 woods-0.036
Token<|endoftext|>
Token ID50256
Feature activation+3.84e+00

Pos logit contributions
 sudden+0.034
 nowhere+0.034
 seas+0.033
icer+0.031
 year+0.031
Neg logit contributions
 give-0.035
 share-0.033
 listen-0.031
 ask-0.031
 I-0.030
TokenOnce
Token ID7454
Feature activation+3.14e+00

Pos logit contributions
 sudden+0.177
eh+0.160
 seas+0.151
 nowhere+0.147
 tant+0.147
Neg logit contributions
 I-0.145
 give-0.135
 ask-0.126
 share-0.126
 You-0.118
Token upon
Token ID2402
Feature activation+2.31e+00

Pos logit contributions
 sudden+0.112
 snowball+0.100
ymes+0.099
eh+0.098
Ah+0.096
Neg logit contributions
 give-0.159
 please-0.151
 leave-0.148
 I-0.146
 ask-0.146
Token a
Token ID257
Feature activation+2.85e+00

Pos logit contributions
icer+0.127
 neighbourhood+0.126
ud+0.124
 sudden+0.123
actus+0.122
Neg logit contributions
 listen-0.076
 give-0.072
 share-0.068
 wait-0.062
 ask-0.057
Token time
Token ID640
Feature activation+9.94e-01

Pos logit contributions
 pieces+0.088
 seas+0.086
 woods+0.082
ymes+0.082
in+0.082
Neg logit contributions
 listen-0.150
 wait-0.144
 give-0.141
 please-0.132
 ask-0.132
Token there
Token ID612
Feature activation+3.33e+00

Pos logit contributions
 sudden+0.055
 neighbourhood+0.052
 nowhere+0.052
 woods+0.051
 pieces+0.050
Neg logit contributions
 give-0.030
 listen-0.026
 share-0.026
 please-0.026
 leave-0.025
Token made
Token ID925
Feature activation+1.77e+00

Pos logit contributions
 give+0.016
 share+0.015
 listen+0.014
 leave+0.014
 wait+0.014
Neg logit contributions
 nowhere-0.013
 sudden-0.013
 year-0.013
 neighbourhood-0.012
 woods-0.012
Token something
Token ID1223
Feature activation-1.33e-01

Pos logit contributions
 nowhere+0.090
 year+0.087
icer+0.083
orn+0.081
eh+0.080
Neg logit contributions
 give-0.067
 share-0.063
 listen-0.061
 ask-0.057
 leave-0.056
Token really
Token ID1107
Feature activation-1.26e+00

Pos logit contributions
 give+0.008
 share+0.007
 listen+0.007
 leave+0.007
 wait+0.007
Neg logit contributions
 nowhere-0.004
 sudden-0.004
 year-0.004
 neighbourhood-0.003
 woods-0.003
Token special
Token ID2041
Feature activation-2.08e-01

Pos logit contributions
 give+0.065
 share+0.062
 leave+0.061
 listen+0.060
 ask+0.058
Neg logit contributions
 nowhere-0.040
 sudden-0.038
 year-0.038
 neighbourhood-0.035
 seas-0.034
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
 give+0.010
 share+0.009
 listen+0.009
 leave+0.009
 wait+0.009
Neg logit contributions
 sudden-0.008
 nowhere-0.008
 year-0.007
 neighbourhood-0.007
 woods-0.007
Token<|endoftext|>
Token ID50256
Feature activation+3.80e+00

Pos logit contributions
 sudden+0.041
 nowhere+0.041
 seas+0.040
 year+0.040
in+0.039
Neg logit contributions
 give-0.036
 share-0.034
 ask-0.032
 I-0.031
 listen-0.031
TokenSam
Token ID16305
Feature activation+2.35e+00

Pos logit contributions
 sudden+0.209
eh+0.182
 seas+0.178
 nowhere+0.169
 tant+0.168
Neg logit contributions
 I-0.117
 give-0.111
 share-0.107
 ask-0.098
 You-0.090
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
 sudden+0.108
 seas+0.094
 nowhere+0.093
ymes+0.089
 year+0.084
Neg logit contributions
 give-0.110
 share-0.106
 ask-0.104
 leave-0.102
 listen-0.098
Token a
Token ID257
Feature activation+8.15e-01

Pos logit contributions
 seas+0.051
ymes+0.051
 year+0.051
actus+0.050
 sudden+0.050
Neg logit contributions
 give-0.045
 share-0.042
 ask-0.041
 listen-0.040
 wait-0.040
Token curious
Token ID11040
Feature activation+1.21e+00

Pos logit contributions
 nowhere+0.029
 sudden+0.026
 pieces+0.025
 year+0.025
awed+0.025
Neg logit contributions
 give-0.041
 share-0.038
 listen-0.038
 wait-0.038
 ask-0.037
Token 3
Token ID513
Feature activation+1.70e+00

Pos logit contributions
 nowhere+0.037
 sudden+0.037
awed+0.032
icer+0.031
 neighbourhood+0.031
Neg logit contributions
 give-0.069
 listen-0.064
 leave-0.064
 share-0.062
 wait-0.062
Token after
Token ID706
Feature activation+7.46e-01

Pos logit contributions
 give+0.022
 share+0.020
 listen+0.019
 leave+0.018
 wait+0.018
Neg logit contributions
 nowhere-0.035
 sudden-0.035
 year-0.034
 neighbourhood-0.033
 woods-0.033
Token!
Token ID0
Feature activation+1.13e+00

Pos logit contributions
icer+0.033
 sudden+0.032
 nowhere+0.031
ud+0.031
 neighbourhood+0.030
Neg logit contributions
 give-0.031
 share-0.029
 listen-0.028
 leave-0.027
 wait-0.026
Token<|endoftext|>
Token ID50256
Feature activation+3.40e+00

Pos logit contributions
 sudden+0.041
 seas+0.039
 nowhere+0.038
 year+0.037
 pieces+0.036
Neg logit contributions
 give-0.056
 share-0.053
 listen-0.051
 ask-0.050
 I-0.050
TokenOnce
Token ID7454
Feature activation+3.46e+00

Pos logit contributions
 sudden+0.165
eh+0.144
 seas+0.143
 tant+0.138
 nowhere+0.136
Neg logit contributions
 I-0.120
 give-0.119
 share-0.117
 ask-0.109
 listen-0.101
Token upon
Token ID2402
Feature activation+2.20e+00

Pos logit contributions
 sudden+0.128
 snowball+0.121
rob+0.117
Ah+0.117
ymes+0.113
Neg logit contributions
 give-0.160
 please-0.153
 leave-0.150
 I-0.148
 ask-0.147
Token a
Token ID257
Feature activation+3.06e+00

Pos logit contributions
icer+0.119
ud+0.117
 neighbourhood+0.117
 sudden+0.116
actus+0.111
Neg logit contributions
 listen-0.071
 give-0.068
 share-0.066
 wait-0.059
 ask-0.055
Token time
Token ID640
Feature activation+3.12e-01

Pos logit contributions
 pieces+0.096
 seas+0.095
 nowhere+0.090
 woods+0.089
 doorway+0.088
Neg logit contributions
 listen-0.152
 wait-0.144
 give-0.141
 share-0.137
 ask-0.137
Token there
Token ID612
Feature activation+3.47e+00

Pos logit contributions
 sudden+0.015
 nowhere+0.015
 neighbourhood+0.015
 year+0.015
 woods+0.014
Neg logit contributions
 give-0.011
 share-0.010
 listen-0.010
 leave-0.009
 wait-0.009
Token was
Token ID373
Feature activation+1.75e+00

Pos logit contributions
eh+0.165
ymes+0.163
 sudden+0.157
 seas+0.153
actus+0.144
Neg logit contributions
 give-0.137
 share-0.124
 I-0.120
 leave-0.120
 don-0.116
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
 neighbourhood+0.084
icer+0.083
actus+0.083
 year+0.082
arrow+0.081
Neg logit contributions
 listen-0.054
 share-0.053
 give-0.052
 wait-0.052
 ask-0.045
Token lady
Token ID10846
Feature activation+1.88e-01

Pos logit contributions
 nowhere+0.033
ud+0.028
awed+0.028
 pieces+0.028
 sudden+0.028
Neg logit contributions
 give-0.057
 share-0.054
 listen-0.054
 ask-0.052
 wait-0.052
Token will
Token ID481
Feature activation-2.65e+00

Pos logit contributions
 give+0.028
 share+0.026
 listen+0.025
 leave+0.025
 wait+0.024
Neg logit contributions
 nowhere-0.021
 sudden-0.021
 year-0.020
 neighbourhood-0.019
 woods-0.019
Token be
Token ID307
Feature activation+1.84e-01

Pos logit contributions
 Sorry+0.075
 give+0.074
 share+0.071
ourage+0.067
 leave+0.067
Neg logit contributions
 year-0.125
<|endoftext|>-0.118
 time-0.118
 nowhere-0.117
 world-0.116
Token back
Token ID736
Feature activation-3.64e-01

Pos logit contributions
 year+0.008
 nowhere+0.008
 sudden+0.008
 neighbourhood+0.008
 woods+0.008
Neg logit contributions
 give-0.007
 share-0.007
 listen-0.007
 leave-0.007
 ask-0.006
Token soon
Token ID2582
Feature activation-1.30e+00

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.014
 leave+0.014
 wait+0.013
Neg logit contributions
 nowhere-0.016
 sudden-0.015
 year-0.015
 neighbourhood-0.015
 woods-0.014
Token".
Token ID1911
Feature activation+3.39e-01

Pos logit contributions
 give+0.054
 share+0.051
 listen+0.049
 leave+0.049
 wait+0.047
Neg logit contributions
 nowhere-0.054
 year-0.053
 sudden-0.052
 neighbourhood-0.050
 woods-0.049
Token
Token ID198
Feature activation+3.46e+00

Pos logit contributions
 nowhere+0.012
 sudden+0.012
 year+0.012
 seas+0.011
eh+0.011
Neg logit contributions
 give-0.017
 share-0.016
 ask-0.015
 listen-0.015
 leave-0.015
Token
Token ID198
Feature activation+2.06e+00

Pos logit contributions
 seas+0.159
icer+0.140
 pieces+0.139
 nowhere+0.139
 woods+0.138
Neg logit contributions
 ask-0.126
 listen-0.123
 give-0.121
 share-0.118
 thank-0.115
TokenSure
Token ID19457
Feature activation+4.23e-01

Pos logit contributions
 seas+0.078
 nowhere+0.077
 woods+0.074
actus+0.072
 pieces+0.070
Neg logit contributions
 listen-0.090
 share-0.087
 give-0.086
 ask-0.083
 wait-0.083
Token enough
Token ID1576
Feature activation-7.12e-01

Pos logit contributions
 nowhere+0.018
 neighbourhood+0.018
 sudden+0.018
 woods+0.018
 neighborhood+0.017
Neg logit contributions
 give-0.018
 wait-0.017
 listen-0.017
 share-0.016
 leave-0.016
Token,
Token ID11
Feature activation-2.10e-01

Pos logit contributions
 give+0.030
 share+0.028
 listen+0.027
 leave+0.027
 wait+0.026
Neg logit contributions
 nowhere-0.030
 sudden-0.029
 year-0.029
 neighbourhood-0.028
 woods-0.027
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 give+0.010
 share+0.009
 listen+0.009
 leave+0.009
 wait+0.009
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.007
 woods-0.007
Token,
Token ID11
Feature activation-2.14e-01

Pos logit contributions
 give+0.012
 share+0.011
 listen+0.011
 leave+0.011
 wait+0.010
Neg logit contributions
 nowhere-0.012
 sudden-0.012
 year-0.011
 neighbourhood-0.011
 woods-0.011
Token and
Token ID290
Feature activation-7.93e-01

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.007
 sudden-0.007
 year-0.007
 neighbourhood-0.007
 woods-0.006
Token planes
Token ID13016
Feature activation-4.28e-01

Pos logit contributions
 give+0.044
 share+0.042
 listen+0.041
 leave+0.040
 wait+0.040
Neg logit contributions
 nowhere-0.023
 sudden-0.022
 year-0.022
 neighbourhood-0.021
 woods-0.020
Token.
Token ID13
Feature activation-8.36e-01

Pos logit contributions
 give+0.021
 share+0.020
 listen+0.019
 leave+0.019
 wait+0.018
Neg logit contributions
 nowhere-0.016
 sudden-0.016
 year-0.015
 neighbourhood-0.015
 woods-0.014
Token She
Token ID1375
Feature activation+7.20e-01

Pos logit contributions
 give+0.031
 share+0.029
 listen+0.028
 leave+0.027
 wait+0.027
Neg logit contributions
 nowhere-0.040
 sudden-0.039
 year-0.038
 neighbourhood-0.037
 woods-0.036
Token liked
Token ID8288
Feature activation+7.98e-01

Pos logit contributions
 sudden+0.028
 nowhere+0.028
 year+0.026
icer+0.026
 seas+0.026
Neg logit contributions
 give-0.034
 share-0.032
 listen-0.031
 ask-0.031
 leave-0.030
Token stories
Token ID3923
Feature activation-1.01e+00

Pos logit contributions
 nowhere+0.038
 sudden+0.038
icer+0.037
 year+0.037
 neighbourhood+0.036
Neg logit contributions
 give-0.031
 share-0.029
 listen-0.027
 leave-0.026
 ask-0.026
Token that
Token ID326
Feature activation-2.03e-01

Pos logit contributions
 give+0.046
 share+0.043
 listen+0.042
 leave+0.042
 wait+0.041
Neg logit contributions
 nowhere-0.039
 year-0.037
 sudden-0.037
 neighbourhood-0.035
 woods-0.035
Token made
Token ID925
Feature activation+5.57e-01

Pos logit contributions
 give+0.009
 share+0.009
 listen+0.008
 leave+0.008
 wait+0.008
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.007
icer-0.007
Token her
Token ID607
Feature activation+8.86e-01

Pos logit contributions
 nowhere+0.024
 sudden+0.023
 year+0.022
 neighbourhood+0.022
icer+0.021
Neg logit contributions
 give-0.025
 share-0.024
 listen-0.023
 leave-0.022
 ask-0.022
Token laugh
Token ID6487
Feature activation-3.22e-01

Pos logit contributions
 nowhere+0.041
ud+0.041
 sudden+0.040
eh+0.039
 year+0.039
Neg logit contributions
 give-0.037
 share-0.034
 leave-0.034
 listen-0.033
 ask-0.031
Token still
Token ID991
Feature activation+1.38e+00

Pos logit contributions
 sudden+0.017
 seas+0.017
 year+0.017
icer+0.017
 nowhere+0.017
Neg logit contributions
 give-0.016
 share-0.015
 leave-0.015
 listen-0.015
 wait-0.014
Token happy
Token ID3772
Feature activation+1.11e-01

Pos logit contributions
icer+0.061
 seas+0.059
 year+0.058
 sudden+0.057
ud+0.057
Neg logit contributions
 give-0.055
 leave-0.050
 share-0.049
 listen-0.048
 ask-0.046
Token from
Token ID422
Feature activation-3.41e-01

Pos logit contributions
 nowhere+0.005
 sudden+0.005
icer+0.004
 year+0.004
 neighbourhood+0.004
Neg logit contributions
 give-0.005
 share-0.005
 listen-0.004
 leave-0.004
 wait-0.004
Token seeing
Token ID4379
Feature activation+9.58e-01

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.014
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.014
 sudden-0.014
 year-0.014
 neighbourhood-0.013
 woods-0.013
Token the
Token ID262
Feature activation+3.24e+00

Pos logit contributions
 sudden+0.045
 year+0.044
 nowhere+0.043
 neighbourhood+0.042
icer+0.042
Neg logit contributions
 give-0.039
 share-0.035
 leave-0.034
 listen-0.034
 ask-0.032
Token bird
Token ID6512
Feature activation+8.85e-01

Pos logit contributions
awed+0.129
ud+0.128
 nowhere+0.125
rones+0.116
Le+0.116
Neg logit contributions
 give-0.146
 leave-0.133
 listen-0.128
 share-0.127
 wait-0.125
Token display
Token ID3359
Feature activation+1.77e+00

Pos logit contributions
 nowhere+0.036
 sudden+0.036
 neighbourhood+0.035
ud+0.034
icer+0.034
Neg logit contributions
 give-0.041
 share-0.038
 leave-0.037
 listen-0.036
 wait-0.034
Token its
Token ID663
Feature activation+1.55e+00

Pos logit contributions
 sudden+0.084
icer+0.077
 nowhere+0.076
 neighbourhood+0.074
ud+0.073
Neg logit contributions
 give-0.073
 share-0.069
 leave-0.067
 listen-0.065
 please-0.061
Token feathers
Token ID27400
Feature activation+2.08e-01

Pos logit contributions
ud+0.059
 nowhere+0.057
 sudden+0.055
icer+0.054
in+0.053
Neg logit contributions
 give-0.075
 share-0.070
 leave-0.069
 listen-0.067
 wait-0.065
Token.
Token ID13
Feature activation+6.93e-01

Pos logit contributions
 sudden+0.008
 nowhere+0.008
 year+0.008
 neighbourhood+0.008
 woods+0.007
Neg logit contributions
 give-0.010
 share-0.009
 listen-0.009
 leave-0.009
 wait-0.008
Token When
Token ID1649
Feature activation-6.49e-01

Pos logit contributions
 nowhere+0.031
 sudden+0.031
 seas+0.030
 year+0.030
in+0.029
Neg logit contributions
 give-0.027
 share-0.025
 ask-0.025
 listen-0.024
 I-0.024
Token the
Token ID262
Feature activation+1.64e+00

Pos logit contributions
 nowhere+0.039
 neighbourhood+0.038
 year+0.037
 neighborhood+0.036
eh+0.036
Neg logit contributions
 give-0.040
 share-0.038
 listen-0.035
 leave-0.034
 wait-0.034
Token batter
Token ID4984
Feature activation-1.13e+00

Pos logit contributions
 nowhere+0.068
 year+0.060
ud+0.059
 sudden+0.058
awed+0.058
Neg logit contributions
 give-0.079
 share-0.074
 wait-0.072
 listen-0.068
 leave-0.068
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
 give+0.057
 share+0.054
 listen+0.053
 leave+0.052
 ask+0.051
Neg logit contributions
 nowhere-0.037
 year-0.036
 sudden-0.036
 neighbourhood-0.034
 seas-0.033
Token They
Token ID1119
Feature activation-2.22e+00

Pos logit contributions
 give+0.001
 share+0.001
 listen+0.001
 ask+0.001
 leave+0.001
Neg logit contributions
 nowhere-0.002
 year-0.002
 sudden-0.002
 neighbourhood-0.002
in-0.002
Token put
Token ID1234
Feature activation+6.56e-01

Pos logit contributions
 give+0.068
 Sorry+0.067
 leave+0.063
 listen+0.063
ourage+0.062
Neg logit contributions
 year-0.097
 sudden-0.097
 nowhere-0.096
 pieces-0.096
in-0.093
Token the
Token ID262
Feature activation+1.52e+00

Pos logit contributions
 neighbourhood+0.030
 year+0.029
 sudden+0.029
 nowhere+0.029
 neighborhood+0.028
Neg logit contributions
 give-0.028
 share-0.026
 listen-0.024
 leave-0.024
 wait-0.024
Token cups
Token ID14180
Feature activation-1.19e+00

Pos logit contributions
 nowhere+0.066
jo+0.062
 neighbourhood+0.062
ud+0.062
 sudden+0.059
Neg logit contributions
 give-0.068
 wait-0.064
 share-0.062
 listen-0.059
 leave-0.056
Token in
Token ID287
Feature activation+8.10e-01

Pos logit contributions
 give+0.062
 share+0.059
 listen+0.058
 leave+0.057
 ask+0.056
Neg logit contributions
 nowhere-0.038
 year-0.036
 sudden-0.036
 pieces-0.033
 seas-0.033
Token a
Token ID257
Feature activation+9.34e-01

Pos logit contributions
 neighbourhood+0.037
eh+0.036
 nowhere+0.036
 sudden+0.035
 neighborhood+0.035
Neg logit contributions
 give-0.033
 share-0.030
 listen-0.028
 wait-0.027
 leave-0.027
Token tray
Token ID26473
Feature activation+1.68e-02

Pos logit contributions
 nowhere+0.031
 neighbourhood+0.027
ud+0.027
 sudden+0.027
 year+0.026
Neg logit contributions
 give-0.051
 share-0.049
 wait-0.046
 listen-0.045
 please-0.045
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
 sudden+0.001
 nowhere+0.001
 neighbourhood+0.001
 year+0.001
 woods+0.001
Neg logit contributions
 give-0.001
 share-0.001
 listen-0.001
 leave-0.001
 wait-0.001
Token.
Token ID13
Feature activation-3.71e+00

Pos logit contributions
 give+0.033
 share+0.031
 listen+0.030
 leave+0.030
 wait+0.029
Neg logit contributions
 nowhere-0.031
 year-0.030
 sudden-0.030
 neighbourhood-0.029
 woods-0.028
Token We
Token ID775
Feature activation-1.49e+00

Pos logit contributions
 give+0.142
 share+0.135
 listen+0.135
 wait+0.125
 leave+0.124
Neg logit contributions
icer-0.148
 neighbourhood-0.141
 nowhere-0.139
ud-0.138
 woods-0.138
Token don
Token ID836
Feature activation-3.58e+00

Pos logit contributions
 give+0.044
 share+0.040
 listen+0.039
 leave+0.038
 Sorry+0.037
Neg logit contributions
 nowhere-0.076
 year-0.075
 sudden-0.072
 neighbourhood-0.071
<|endoftext|>-0.070
Token't
Token ID470
Feature activation-2.71e+00

Pos logit contributions
 give+0.145
 wait+0.145
 leave+0.144
 share+0.141
 ask+0.140
Neg logit contributions
 year-0.145
<|endoftext|>-0.139
 nowhere-0.130
in-0.121
 pieces-0.120
Token have
Token ID423
Feature activation-4.36e-01

Pos logit contributions
 Sorry+0.072
 give+0.065
 leave+0.060
 share+0.060
ured+0.058
Neg logit contributions
 year-0.143
 time-0.135
 world-0.128
 neighbourhood-0.125
 nowhere-0.125
Token a
Token ID257
Feature activation-1.51e-01

Pos logit contributions
 give+0.019
 share+0.018
 listen+0.018
 leave+0.017
 wait+0.017
Neg logit contributions
 nowhere-0.018
 sudden-0.017
 year-0.017
 neighbourhood-0.017
 woods-0.016
Token bigger
Token ID5749
Feature activation-3.32e-01

Pos logit contributions
 give+0.007
 share+0.007
 listen+0.007
 leave+0.007
 wait+0.006
Neg logit contributions
 nowhere-0.006
 sudden-0.005
 year-0.005
 neighbourhood-0.005
 woods-0.005
Token slide
Token ID10649
Feature activation-8.25e-01

Pos logit contributions
 give+0.016
 share+0.015
 listen+0.015
 leave+0.014
 wait+0.014
Neg logit contributions
 nowhere-0.013
 sudden-0.012
 year-0.012
 neighbourhood-0.011
 woods-0.011
Token.
Token ID13
Feature activation-3.77e+00

Pos logit contributions
 give+0.040
 share+0.037
 listen+0.037
 leave+0.036
 wait+0.035
Neg logit contributions
 nowhere-0.030
 year-0.029
 sudden-0.029
 neighbourhood-0.027
 woods-0.027
Token But
Token ID887
Feature activation-2.34e+00

Pos logit contributions
 give+0.151
 listen+0.141
 share+0.141
 leave+0.131
 wait+0.127
Neg logit contributions
icer-0.136
 neighbourhood-0.131
 woods-0.129
ud-0.129
 snowball-0.129
Token we
Token ID356
Feature activation-2.17e+00

Pos logit contributions
 give+0.069
 share+0.065
 leave+0.060
 listen+0.060
 ask+0.056
Neg logit contributions
 nowhere-0.116
 year-0.110
 sudden-0.108
 seas-0.107
 pieces-0.104
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 give+0.057
 share+0.055
 listen+0.053
 leave+0.053
 ask+0.052
Neg logit contributions
 nowhere-0.033
 year-0.030
 sudden-0.030
 woods-0.027
 neighbourhood-0.027
Token
Token ID198
Feature activation+2.18e+00

Pos logit contributions
 give+0.078
 wait+0.075
 listen+0.075
 leave+0.075
 share+0.074
Neg logit contributions
 nowhere-0.043
 sudden-0.042
<|endoftext|>-0.041
 year-0.039
 woods-0.039
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
icer+0.099
 nowhere+0.097
 seas+0.097
 sudden+0.095
 neighbourhood+0.095
Neg logit contributions
 give-0.084
 share-0.083
 listen-0.081
 ask-0.079
 thank-0.069
Token"
Token ID1
Feature activation-1.70e-01

Pos logit contributions
 give+0.072
 leave+0.068
 share+0.066
 listen+0.066
 wait+0.065
Neg logit contributions
 sudden-0.036
 year-0.036
 nowhere-0.035
 neighbourhood-0.031
 woods-0.031
TokenThank
Token ID10449
Feature activation-1.43e+00

Pos logit contributions
 give+0.006
 share+0.006
 listen+0.006
 wait+0.005
 leave+0.005
Neg logit contributions
 nowhere-0.008
 year-0.008
 sudden-0.008
 neighbourhood-0.008
 woods-0.007
Token you
Token ID345
Feature activation-3.45e-02

Pos logit contributions
 give+0.046
 leave+0.041
 share+0.040
 listen+0.039
 ask+0.038
Neg logit contributions
 nowhere-0.075
 year-0.073
 sudden-0.071
in-0.071
<|endoftext|>-0.068
Token,
Token ID11
Feature activation-5.96e-01

Pos logit contributions
 give+0.001
 share+0.001
 listen+0.001
 leave+0.001
 wait+0.001
Neg logit contributions
 nowhere-0.002
 sudden-0.002
 neighbourhood-0.002
 woods-0.002
 year-0.002
Token Lily
Token ID20037
Feature activation+9.55e-02

Pos logit contributions
 give+0.027
 share+0.026
 listen+0.025
 leave+0.025
 wait+0.024
Neg logit contributions
 nowhere-0.024
 sudden-0.023
 year-0.022
 neighbourhood-0.022
 woods-0.021
Token,"
Token ID553
Feature activation+8.47e-02

Pos logit contributions
 sudden+0.005
 nowhere+0.005
 neighbourhood+0.004
 year+0.004
 woods+0.004
Neg logit contributions
 give-0.003
 share-0.003
 listen-0.003
 leave-0.003
 ask-0.003
Token said
Token ID531
Feature activation-7.49e-01

Pos logit contributions
 nowhere+0.003
 year+0.003
 neighbourhood+0.003
 sudden+0.003
ud+0.003
Neg logit contributions
 give-0.004
 share-0.004
 listen-0.004
 ask-0.004
 leave-0.004
Token the
Token ID262
Feature activation+1.42e+00

Pos logit contributions
 give+0.036
 share+0.034
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.027
 sudden-0.026
 year-0.026
 neighbourhood-0.024
 woods-0.024
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 sudden+0.029
 nowhere+0.027
 neighbourhood+0.026
 year+0.025
icer+0.025
Neg logit contributions
 give-0.028
 share-0.026
 leave-0.025
 listen-0.025
 wait-0.024
Token make
Token ID787
Feature activation+1.05e+00

Pos logit contributions
 give+0.051
 share+0.047
 listen+0.046
 leave+0.046
 wait+0.045
Neg logit contributions
 nowhere-0.043
 sudden-0.041
 year-0.040
 woods-0.039
 neighbourhood-0.039
Token it
Token ID340
Feature activation-1.81e+00

Pos logit contributions
 nowhere+0.049
 year+0.046
 neighbourhood+0.046
 sudden+0.045
ud+0.044
Neg logit contributions
 give-0.046
 share-0.043
 listen-0.040
 leave-0.039
 ask-0.039
Token go
Token ID467
Feature activation+4.16e-02

Pos logit contributions
 give+0.068
 share+0.062
 listen+0.061
 wait+0.060
 ask+0.060
Neg logit contributions
 nowhere-0.074
 year-0.073
 sudden-0.072
 seas-0.071
 woods-0.070
Token fast
Token ID3049
Feature activation+5.39e-01

Pos logit contributions
 nowhere+0.002
 sudden+0.002
 neighbourhood+0.002
 year+0.002
 woods+0.002
Neg logit contributions
 give-0.002
 share-0.002
 leave-0.002
 listen-0.002
 wait-0.002
Token.
Token ID13
Feature activation-4.71e-01

Pos logit contributions
 sudden+0.023
 nowhere+0.022
 neighbourhood+0.021
 year+0.021
eh+0.020
Neg logit contributions
 give-0.024
 share-0.023
 listen-0.022
 leave-0.022
 wait-0.021
Token He
Token ID679
Feature activation+8.85e-01

Pos logit contributions
 give+0.020
 share+0.018
 listen+0.018
 leave+0.017
 ask+0.017
Neg logit contributions
 nowhere-0.021
 sudden-0.020
 year-0.020
 neighbourhood-0.019
 woods-0.019
Token thought
Token ID1807
Feature activation+1.95e-01

Pos logit contributions
 sudden+0.039
 nowhere+0.038
 neighbourhood+0.038
 year+0.037
icer+0.037
Neg logit contributions
 give-0.038
 share-0.035
 ask-0.033
 listen-0.033
 leave-0.033
Token maybe
Token ID3863
Feature activation-6.06e-01

Pos logit contributions
 sudden+0.009
 neighbourhood+0.009
 nowhere+0.009
 year+0.009
 woods+0.008
Neg logit contributions
 give-0.008
 share-0.007
 listen-0.007
 leave-0.007
 wait-0.006
Token Sara
Token ID24799
Feature activation+1.55e+00

Pos logit contributions
 give+0.027
 share+0.025
 listen+0.024
 leave+0.024
 wait+0.024
Neg logit contributions
 nowhere-0.025
 sudden-0.024
 year-0.024
 neighbourhood-0.023
 woods-0.022
Token would
Token ID561
Feature activation-1.60e+00

Pos logit contributions
 sudden+0.068
 nowhere+0.061
 seas+0.061
 neighbourhood+0.059
 year+0.058
Neg logit contributions
 give-0.071
 share-0.068
 leave-0.064
 listen-0.064
 wait-0.062
TokenThe
Token ID464
Feature activation+3.39e-01

Pos logit contributions
 nowhere+0.005
 year+0.005
 sudden+0.005
 neighbourhood+0.005
 woods+0.004
Neg logit contributions
 give-0.006
 share-0.006
 listen-0.006
 wait-0.005
 ask-0.005
Token pig
Token ID12967
Feature activation-8.88e-01

Pos logit contributions
 nowhere+0.013
 sudden+0.012
ud+0.012
awed+0.011
 year+0.011
Neg logit contributions
 give-0.017
 share-0.016
 wait-0.016
 listen-0.016
 leave-0.015
Tokengy
Token ID1360
Feature activation+7.74e-01

Pos logit contributions
 give+0.046
 share+0.043
 listen+0.042
 leave+0.042
 wait+0.041
Neg logit contributions
 nowhere-0.029
 sudden-0.028
 year-0.028
 neighbourhood-0.027
 woods-0.026
Token bank
Token ID3331
Feature activation-4.69e-01

Pos logit contributions
 nowhere+0.026
 sudden+0.024
 year+0.022
ud+0.022
 seas+0.022
Neg logit contributions
 give-0.042
 share-0.039
 listen-0.038
 leave-0.038
 wait-0.038
Token!"
Token ID2474
Feature activation-6.21e-01

Pos logit contributions
 give+0.023
 share+0.021
 listen+0.021
 leave+0.020
 wait+0.020
Neg logit contributions
 nowhere-0.018
 sudden-0.017
 year-0.016
 neighbourhood-0.016
 woods-0.016
Token said
Token ID531
Feature activation-4.09e-01

Pos logit contributions
 give+0.027
 share+0.025
 listen+0.025
 leave+0.024
 wait+0.024
Neg logit contributions
 nowhere-0.026
 sudden-0.025
 year-0.025
 neighbourhood-0.024
 woods-0.023
Token the
Token ID262
Feature activation+7.29e-01

Pos logit contributions
 give+0.018
 share+0.017
 listen+0.016
 leave+0.016
 wait+0.016
Neg logit contributions
 nowhere-0.017
 sudden-0.016
 year-0.016
 neighbourhood-0.016
 woods-0.015
Token little
Token ID1310
Feature activation+3.31e-01

Pos logit contributions
 nowhere+0.027
ud+0.026
awed+0.024
eh+0.024
ymes+0.024
Neg logit contributions
 give-0.039
 leave-0.035
 share-0.034
 ask-0.034
 wait-0.034
Token boy
Token ID2933
Feature activation-7.41e-01

Pos logit contributions
 nowhere+0.010
 sudden+0.009
 year+0.008
 neighbourhood+0.008
ud+0.008
Neg logit contributions
 give-0.020
 leave-0.018
 share-0.018
 ask-0.018
 listen-0.018
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 give+0.036
 share+0.034
 listen+0.034
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.026
 sudden-0.026
 year-0.025
 neighbourhood-0.024
 woods-0.023
Token proudly
Token ID23017
Feature activation-1.99e+00

Pos logit contributions
 give+0.061
 share+0.059
 listen+0.057
 leave+0.056
 wait+0.055
Neg logit contributions
 nowhere-0.056
 sudden-0.055
 year-0.053
 neighbourhood-0.050
 woods-0.048
Token waves
Token ID9813
Feature activation-8.29e-01

Pos logit contributions
 give+0.030
 share+0.029
 listen+0.028
 leave+0.028
 wait+0.027
Neg logit contributions
 nowhere-0.018
 sudden-0.018
 year-0.017
 neighbourhood-0.016
 woods-0.016
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
 give+0.043
 share+0.041
 listen+0.040
 leave+0.039
 wait+0.039
Neg logit contributions
 nowhere-0.027
 year-0.026
 sudden-0.026
 neighbourhood-0.024
 woods-0.024
Token After
Token ID2293
Feature activation-1.97e-01

Pos logit contributions
 give+0.075
 share+0.072
 leave+0.066
 listen+0.064
 wait+0.064
Neg logit contributions
 neighbourhood-0.083
 year-0.083
 woods-0.083
 nowhere-0.080
 neighborhood-0.078
Token a
Token ID257
Feature activation-8.76e-01

Pos logit contributions
 give+0.010
 share+0.009
 listen+0.009
 leave+0.008
 wait+0.008
Neg logit contributions
 nowhere-0.007
 sudden-0.007
 year-0.007
 neighbourhood-0.007
icer-0.007
Token while
Token ID981
Feature activation-1.02e+00

Pos logit contributions
 give+0.044
 share+0.041
 leave+0.040
 listen+0.040
 ask+0.038
Neg logit contributions
 sudden-0.031
 nowhere-0.030
 year-0.030
 neighbourhood-0.027
 woods-0.027
Token of
Token ID286
Feature activation-1.07e+00

Pos logit contributions
 give+0.046
 share+0.044
 listen+0.042
 leave+0.042
 ask+0.040
Neg logit contributions
 nowhere-0.040
 year-0.040
 sudden-0.039
 woods-0.037
 neighbourhood-0.036
Token ro
Token ID686
Feature activation-7.53e-01

Pos logit contributions
 give+0.053
 share+0.051
 listen+0.049
 leave+0.048
 wait+0.048
Neg logit contributions
 nowhere-0.038
 sudden-0.036
 year-0.035
 woods-0.034
 neighbourhood-0.033
Tokenwing
Token ID5469
Feature activation-2.85e-01

Pos logit contributions
 give+0.040
 share+0.038
 listen+0.037
 leave+0.037
 wait+0.036
Neg logit contributions
 nowhere-0.024
 year-0.023
 sudden-0.023
 neighbourhood-0.022
 woods-0.021
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
 give+0.011
 share+0.010
 listen+0.010
 leave+0.010
 wait+0.010
Neg logit contributions
 nowhere-0.013
 sudden-0.013
 year-0.012
 neighbourhood-0.012
 woods-0.012
Token she
Token ID673
Feature activation+9.36e-01

Pos logit contributions
 give+0.005
 listen+0.004
 share+0.004
 wait+0.004
 leave+0.004
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.005
 neighbourhood-0.005
icer-0.005
Token reached
Token ID4251
Feature activation-4.89e-01

Pos logit contributions
icer+0.032
 year+0.031
 nowhere+0.030
 sudden+0.029
 snowball+0.029
Neg logit contributions
 give-0.047
 share-0.043
 wait-0.042
 leave-0.042
 listen-0.042
Token
Token ID198
Feature activation-7.40e-01

Pos logit contributions
 seas+0.054
 nowhere+0.054
 year+0.052
 sudden+0.051
icer+0.050
Neg logit contributions
 give-0.060
 ask-0.058
 share-0.057
 listen-0.057
 please-0.053
Token"
Token ID1
Feature activation-9.30e-01

Pos logit contributions
 give+0.035
 share+0.033
 leave+0.032
 listen+0.032
 wait+0.031
Neg logit contributions
 nowhere-0.028
 sudden-0.028
 year-0.027
 neighbourhood-0.026
 woods-0.025
TokenYes
Token ID5297
Feature activation-1.27e+00

Pos logit contributions
 give+0.048
 share+0.045
 leave+0.044
 listen+0.043
 wait+0.042
Neg logit contributions
 nowhere-0.031
 sudden-0.031
 year-0.029
 neighbourhood-0.028
 woods-0.027
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 give+0.059
 share+0.057
 listen+0.055
 leave+0.053
 ask+0.053
Neg logit contributions
 year-0.046
 nowhere-0.044
 sudden-0.044
 seas-0.040
in-0.040
Token he
Token ID339
Feature activation-5.51e-01

Pos logit contributions
 give+0.046
 share+0.044
 leave+0.042
 listen+0.042
 ask+0.041
Neg logit contributions
 nowhere-0.049
 year-0.048
 sudden-0.047
 seas-0.045
<|endoftext|>-0.043
Token is
Token ID318
Feature activation-3.57e-01

Pos logit contributions
 give+0.029
 share+0.027
 listen+0.026
 leave+0.026
 wait+0.026
Neg logit contributions
 nowhere-0.018
 sudden-0.018
 year-0.017
 neighbourhood-0.016
 woods-0.016
Token.
Token ID13
Feature activation-3.37e+00

Pos logit contributions
 give+0.015
 share+0.014
 listen+0.013
 leave+0.013
 wait+0.013
Neg logit contributions
 nowhere-0.016
 sudden-0.016
 year-0.015
 neighbourhood-0.015
 woods-0.014
Token He
Token ID679
Feature activation-9.71e-01

Pos logit contributions
 give+0.146
 leave+0.136
 wait+0.136
 share+0.134
 listen+0.133
Neg logit contributions
 woods-0.121
-0.108
 forest-0.106
 snowball-0.105
<|endoftext|>-0.105
Token likes
Token ID7832
Feature activation+9.79e-01

Pos logit contributions
 give+0.052
 share+0.049
 listen+0.048
 leave+0.048
 wait+0.046
Neg logit contributions
 nowhere-0.029
 year-0.028
 sudden-0.028
 neighbourhood-0.027
 woods-0.026
Token the
Token ID262
Feature activation+2.06e+00

Pos logit contributions
 nowhere+0.047
 sudden+0.046
 neighbourhood+0.046
 year+0.044
icer+0.044
Neg logit contributions
 give-0.037
 share-0.035
 listen-0.034
 leave-0.032
 ask-0.032
Token snow
Token ID6729
Feature activation-3.07e-02

Pos logit contributions
 nowhere+0.089
awed+0.087
ud+0.082
 sudden+0.080
ase+0.076
Neg logit contributions
 give-0.093
 listen-0.093
 ask-0.088
 wait-0.086
 leave-0.085
Token They
Token ID1119
Feature activation-2.13e+00

Pos logit contributions
 give+0.014
 share+0.012
 listen+0.012
 leave+0.012
 ask+0.011
Neg logit contributions
 nowhere-0.021
 sudden-0.021
 year-0.021
 neighbourhood-0.020
 woods-0.020
Token hope
Token ID2911
Feature activation+2.61e-01

Pos logit contributions
 give+0.059
 Sorry+0.057
 share+0.056
 listen+0.053
ourage+0.053
Neg logit contributions
 nowhere-0.102
 woods-0.099
 neighbourhood-0.099
 year-0.098
 forest-0.096
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 sudden+0.012
icer+0.012
 nowhere+0.012
eh+0.012
 year+0.012
Neg logit contributions
 give-0.011
 leave-0.010
 listen-0.010
 share-0.010
 wait-0.009
Token bear
Token ID6842
Feature activation+5.10e-02

Pos logit contributions
ud+0.044
 nowhere+0.042
awed+0.041
eh+0.039
 sudden+0.039
Neg logit contributions
 give-0.048
 ask-0.042
 share-0.042
 listen-0.041
 leave-0.041
Token will
Token ID481
Feature activation-1.30e+00

Pos logit contributions
 sudden+0.002
 nowhere+0.002
 year+0.002
 neighbourhood+0.002
 seas+0.002
Neg logit contributions
 give-0.002
 share-0.002
 listen-0.002
 leave-0.002
 wait-0.002
Token not
Token ID407
Feature activation-1.35e+00

Pos logit contributions
 give+0.036
 share+0.033
 listen+0.032
 leave+0.030
 Sorry+0.030
Neg logit contributions
 nowhere-0.071
 year-0.068
 sudden-0.066
 woods-0.066
 neighbourhood-0.065
Token find
Token ID1064
Feature activation+9.80e-01

Pos logit contributions
 give+0.040
 share+0.037
 listen+0.036
 Sorry+0.034
 leave+0.033
Neg logit contributions
 nowhere-0.069
 year-0.068
 sudden-0.067
 woods-0.065
 neighbourhood-0.065
Token them
Token ID606
Feature activation+4.64e-01

Pos logit contributions
 sudden+0.048
 neighbourhood+0.048
 year+0.047
 nowhere+0.046
 neighborhood+0.045
Neg logit contributions
 give-0.038
 share-0.035
 leave-0.034
 listen-0.033
 wait-0.033
Token.
Token ID13
Feature activation-4.98e-01

Pos logit contributions
 sudden+0.021
 nowhere+0.019
 neighbourhood+0.019
 year+0.018
icer+0.018
Neg logit contributions
 give-0.021
 share-0.020
 leave-0.019
 listen-0.019
 wait-0.018
Token
Token ID198
Feature activation+2.32e+00

Pos logit contributions
 give+0.023
 share+0.022
 listen+0.021
 leave+0.021
 wait+0.020
Neg logit contributions
 nowhere-0.019
 sudden-0.019
 year-0.019
 neighbourhood-0.018
 woods-0.017
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
 seas+0.107
icer+0.106
 nowhere+0.104
eh+0.102
pit+0.101
Neg logit contributions
 give-0.088
 ask-0.085
 share-0.085
 listen-0.083
 pay-0.075
Token
Token ID198
Feature activation+3.83e-01

Pos logit contributions
 nowhere+0.038
 seas+0.037
icer+0.034
 pieces+0.034
 woods+0.033
Neg logit contributions
 give-0.051
 share-0.048
 listen-0.048
 ask-0.047
 please-0.046
TokenM
Token ID44
Feature activation-8.54e-02

Pos logit contributions
 nowhere+0.013
 seas+0.012
icer+0.011
 woods+0.011
 pieces+0.011
Neg logit contributions
 give-0.019
 share-0.019
 listen-0.019
 please-0.018
 ask-0.017
Tokenum
Token ID388
Feature activation+1.74e+00

Pos logit contributions
 give+0.004
 share+0.004
 listen+0.004
 leave+0.004
 wait+0.004
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 neighbourhood-0.003
 year-0.003
 woods-0.003
Token said
Token ID531
Feature activation-8.52e-01

Pos logit contributions
 woods+0.074
 nowhere+0.074
 sudden+0.072
 neighbourhood+0.072
 seas+0.072
Neg logit contributions
 give-0.077
 listen-0.066
 share-0.066
 leave-0.064
 ask-0.063
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
 give+0.037
 share+0.035
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.035
 sudden-0.034
 year-0.034
 neighbourhood-0.032
 woods-0.032
Token �
Token ID6184
Feature activation-3.45e+00

Pos logit contributions
 give+0.070
 share+0.068
 leave+0.066
 listen+0.064
 wait+0.063
Neg logit contributions
 nowhere-0.060
 sudden-0.059
 year-0.058
 neighbourhood-0.054
<|endoftext|>-0.053
Token�
Token ID95
Feature activation-2.52e+00

Pos logit contributions
 give+0.155
 share+0.148
 leave+0.147
 listen+0.136
 ask+0.131
Neg logit contributions
 year-0.120
 neighbourhood-0.113
in-0.111
 sudden-0.107
 area-0.105
Token€
Token ID26391
Feature activation-1.39e+00

Pos logit contributions
 give+0.119
 share+0.112
 leave+0.112
 listen+0.102
 Sorry+0.101
Neg logit contributions
 year-0.095
 neighbourhood-0.095
 area-0.090
 nowhere-0.090
 world-0.087
Token�
Token ID129
Feature activation-2.21e+00

Pos logit contributions
 give+0.067
 share+0.063
 leave+0.061
 listen+0.060
 wait+0.059
Neg logit contributions
 nowhere-0.051
 sudden-0.050
 year-0.050
 neighbourhood-0.049
 woods-0.046
Token�
Token ID241
Feature activation+1.00e+00

Pos logit contributions
 give+0.092
 share+0.086
 leave+0.082
 Sorry+0.082
 listen+0.081
Neg logit contributions
 year-0.090
 nowhere-0.088
 neighbourhood-0.087
 sudden-0.086
<|endoftext|>-0.085
TokenYes
Token ID5297
Feature activation-1.10e+00

Pos logit contributions
 nowhere+0.041
 sudden+0.037
 neighbourhood+0.037
icer+0.037
 woods+0.037
Neg logit contributions
 give-0.044
 listen-0.043
 share-0.041
 wait-0.041
 ask-0.039
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
 seas+0.113
 nowhere+0.110
icer+0.106
 sudden+0.102
 woods+0.102
Neg logit contributions
 give-0.085
 share-0.083
 ask-0.082
 listen-0.078
 please-0.073
Token"
Token ID1
Feature activation-9.89e-01

Pos logit contributions
 give+0.007
 share+0.006
 listen+0.006
 ask+0.006
 wait+0.006
Neg logit contributions
 nowhere-0.003
 sudden-0.003
 year-0.003
 neighbourhood-0.003
 woods-0.003
TokenI
Token ID40
Feature activation-1.82e+00

Pos logit contributions
 give+0.053
 leave+0.050
 share+0.049
 listen+0.048
 ask+0.046
Neg logit contributions
 nowhere-0.033
 sudden-0.032
 year-0.031
 neighbourhood-0.029
 woods-0.028
Token want
Token ID765
Feature activation-8.51e-01

Pos logit contributions
 give+0.058
 leave+0.057
 share+0.056
 Sorry+0.055
 listen+0.051
Neg logit contributions
 year-0.087
 nowhere-0.084
 sudden-0.081
 neighbourhood-0.078
in-0.078
Token it
Token ID340
Feature activation-6.27e-01

Pos logit contributions
 give+0.036
 share+0.034
 listen+0.033
 leave+0.033
 wait+0.032
Neg logit contributions
 nowhere-0.036
 sudden-0.035
 year-0.034
 neighbourhood-0.032
 woods-0.032
Token to
Token ID284
Feature activation-3.13e+00

Pos logit contributions
 give+0.026
 share+0.024
 listen+0.023
 leave+0.023
 wait+0.022
Neg logit contributions
 nowhere-0.027
 sudden-0.027
 year-0.026
 neighbourhood-0.025
 woods-0.025
Token be
Token ID307
Feature activation-1.68e+00

Pos logit contributions
ourage+0.085
 Sorry+0.085
aring+0.084
osity+0.081
 give+0.080
Neg logit contributions
 year-0.139
<|endoftext|>-0.139
 time-0.134
 nowhere-0.130
 forest-0.128
Token big
Token ID1263
Feature activation-1.12e+00

Pos logit contributions
 give+0.087
 leave+0.084
 listen+0.082
 wait+0.082
 share+0.080
Neg logit contributions
 nowhere-0.053
 sudden-0.049
 year-0.047
 pieces-0.045
eh-0.044
Token enough
Token ID1576
Feature activation-1.22e+00

Pos logit contributions
 give+0.051
 share+0.048
 listen+0.046
 leave+0.046
 wait+0.045
Neg logit contributions
 nowhere-0.043
 year-0.042
 sudden-0.042
 neighbourhood-0.040
 woods-0.039
Token to
Token ID284
Feature activation-2.90e+00

Pos logit contributions
 give+0.053
 share+0.050
 leave+0.048
 listen+0.048
 wait+0.046
Neg logit contributions
 year-0.049
 nowhere-0.049
 sudden-0.048
 woods-0.045
 neighbourhood-0.045
Token fit
Token ID4197
Feature activation-6.89e-01

Pos logit contributions
 Sorry+0.097
aring+0.087
ourage+0.086
 give+0.085
osity+0.084
Neg logit contributions
 year-0.119
 sudden-0.117
 time-0.115
<|endoftext|>-0.114
 nowhere-0.111
Token we
Token ID356
Feature activation-1.46e+00

Pos logit contributions
 give+0.088
 share+0.087
 listen+0.083
 ask+0.082
ourage+0.081
Neg logit contributions
 nowhere-0.087
 year-0.080
 sudden-0.080
 pieces-0.076
 seas-0.076
Token can
Token ID460
Feature activation-2.82e+00

Pos logit contributions
 give+0.055
 share+0.051
 listen+0.050
 leave+0.048
 wait+0.047
Neg logit contributions
 nowhere-0.065
 year-0.063
 sudden-0.062
 neighbourhood-0.060
<|endoftext|>-0.058
Token't
Token ID470
Feature activation-2.73e+00

Pos logit contributions
 give+0.089
 Sorry+0.087
ourage+0.084
 share+0.081
istence+0.080
Neg logit contributions
 year-0.120
<|endoftext|>-0.115
 was-0.113
 sudden-0.111
in-0.107
Token eat
Token ID4483
Feature activation-1.29e-01

Pos logit contributions
 give+0.117
 Sorry+0.116
osity+0.114
 share+0.112
istence+0.112
Neg logit contributions
 year-0.090
<|endoftext|>-0.082
 area-0.080
in-0.079
 sudden-0.078
Token steam
Token ID13324
Feature activation-1.73e+00

Pos logit contributions
 give+0.006
 share+0.005
 listen+0.005
 leave+0.005
 wait+0.005
Neg logit contributions
 nowhere-0.005
 sudden-0.005
 year-0.005
 neighbourhood-0.005
 woods-0.005
Token.
Token ID13
Feature activation-3.42e+00

Pos logit contributions
 give+0.083
 share+0.080
 listen+0.076
 leave+0.075
 ask+0.075
Neg logit contributions
 year-0.058
 nowhere-0.054
 sudden-0.053
 area-0.052
 woods-0.052
Token But
Token ID887
Feature activation-2.49e+00

Pos logit contributions
 give+0.141
 share+0.134
 listen+0.133
 wait+0.127
 leave+0.122
Neg logit contributions
 snowball-0.131
 sudden-0.125
 year-0.125
<|endoftext|>-0.124
icer-0.123
Token we
Token ID356
Feature activation-2.13e+00

Pos logit contributions
 give+0.081
 share+0.077
 listen+0.074
ourage+0.071
 pay+0.069
Neg logit contributions
 nowhere-0.116
 sudden-0.108
 year-0.105
 seas-0.104
 pieces-0.104
Token can
Token ID460
Feature activation-3.03e+00

Pos logit contributions
 give+0.071
 listen+0.066
 share+0.065
 Sorry+0.064
 leave+0.061
Neg logit contributions
 nowhere-0.097
 year-0.094
 sudden-0.089
<|endoftext|>-0.087
 neighbourhood-0.087
Token eat
Token ID4483
Feature activation+3.13e-02

Pos logit contributions
 give+0.088
ourage+0.085
istence+0.084
 Sorry+0.082
 listen+0.079
Neg logit contributions
 year-0.135
<|endoftext|>-0.132
 sudden-0.127
 was-0.122
 forest-0.122
Token what
Token ID644
Feature activation+1.12e-01

Pos logit contributions
 nowhere+0.001
 sudden+0.001
 neighbourhood+0.001
 year+0.001
 woods+0.001
Neg logit contributions
 give-0.001
 share-0.001
 listen-0.001
 leave-0.001
 wait-0.001
Token't
Token ID470
Feature activation-2.08e+00

Pos logit contributions
 give+0.105
 wait+0.103
 share+0.102
 leave+0.101
 listen+0.098
Neg logit contributions
 year-0.120
 nowhere-0.119
<|endoftext|>-0.118
 pieces-0.109
 sudden-0.108
Token like
Token ID588
Feature activation+7.43e-01

Pos logit contributions
 give+0.049
 Sorry+0.049
 share+0.045
ourage+0.044
 leave+0.044
Neg logit contributions
 year-0.111
 nowhere-0.104
 time-0.103
<|endoftext|>-0.100
in-0.098
Token rac
Token ID3444
Feature activation-8.23e-01

Pos logit contributions
 sudden+0.033
 neighbourhood+0.032
 year+0.032
 nowhere+0.031
icer+0.031
Neg logit contributions
 give-0.031
 share-0.029
 listen-0.028
 leave-0.028
 ask-0.026
Tokenco
Token ID1073
Feature activation+9.70e-01

Pos logit contributions
 give+0.044
 share+0.042
 leave+0.041
 listen+0.041
 wait+0.040
Neg logit contributions
 nowhere-0.026
 year-0.024
 sudden-0.024
 neighbourhood-0.023
 woods-0.022
Tokenons
Token ID684
Feature activation-2.04e-01

Pos logit contributions
 nowhere+0.035
 sudden+0.035
 woods+0.033
 neighbourhood+0.033
 seas+0.030
Neg logit contributions
 give-0.049
 share-0.048
 listen-0.048
 wait-0.044
 pay-0.043
Token.
Token ID13
Feature activation-3.39e+00

Pos logit contributions
 give+0.009
 share+0.009
 listen+0.009
 leave+0.008
 wait+0.008
Neg logit contributions
 nowhere-0.008
 sudden-0.008
 year-0.008
 neighbourhood-0.008
 woods-0.007
Token They
Token ID1119
Feature activation-2.11e+00

Pos logit contributions
 give+0.138
 wait+0.131
 listen+0.131
 leave+0.126
 share+0.125
Neg logit contributions
 year-0.124
icer-0.122
 woods-0.121
 snowball-0.118
aws-0.116
Token are
Token ID389
Feature activation-6.00e-01

Pos logit contributions
 give+0.077
 listen+0.071
 share+0.071
 leave+0.069
 ask+0.067
Neg logit contributions
 year-0.089
 nowhere-0.088
 sudden-0.085
 pieces-0.084
 neighbourhood-0.082
Token mean
Token ID1612
Feature activation-3.32e-01

Pos logit contributions
 give+0.026
 share+0.025
 listen+0.024
 leave+0.024
 wait+0.023
Neg logit contributions
 nowhere-0.024
 sudden-0.024
 year-0.024
 neighbourhood-0.023
 woods-0.022
Token and
Token ID290
Feature activation-2.31e+00

Pos logit contributions
 give+0.016
 share+0.015
 listen+0.015
 leave+0.014
 wait+0.014
Neg logit contributions
 nowhere-0.013
 sudden-0.012
 year-0.012
 neighbourhood-0.012
 woods-0.011
Token they
Token ID484
Feature activation-1.86e+00

Pos logit contributions
 give+0.108
 leave+0.101
 listen+0.100
 share+0.100
 wait+0.098
Neg logit contributions
 nowhere-0.078
 sudden-0.072
 year-0.070
 pieces-0.065
 seas-0.063
Token Lily
Token ID20037
Feature activation-7.98e-01

Pos logit contributions
 give+0.027
 share+0.026
 listen+0.025
 leave+0.025
 wait+0.024
Neg logit contributions
 nowhere-0.027
 sudden-0.026
 year-0.025
 neighbourhood-0.025
 woods-0.024
Token said
Token ID531
Feature activation-1.65e+00

Pos logit contributions
 give+0.043
 share+0.041
 listen+0.040
 leave+0.040
 wait+0.039
Neg logit contributions
 nowhere-0.025
 sudden-0.024
 year-0.023
 neighbourhood-0.022
 woods-0.021
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 give+0.095
 share+0.092
 leave+0.089
 listen+0.088
 wait+0.088
Neg logit contributions
 nowhere-0.043
 sudden-0.037
 year-0.036
 pieces-0.035
in-0.035
Token "
Token ID366
Feature activation-1.98e-01

Pos logit contributions
 give+0.029
 share+0.027
 listen+0.026
 leave+0.026
 wait+0.025
Neg logit contributions
 nowhere-0.025
 sudden-0.025
 year-0.024
 neighbourhood-0.023
 woods-0.023
TokenWe
Token ID1135
Feature activation-1.55e+00

Pos logit contributions
 give+0.010
 share+0.009
 listen+0.009
 wait+0.008
 leave+0.008
Neg logit contributions
 nowhere-0.007
 sudden-0.007
 year-0.007
 neighbourhood-0.007
 woods-0.007
Token can
Token ID460
Feature activation-3.41e+00

Pos logit contributions
 give+0.057
 listen+0.053
 share+0.053
 leave+0.051
 wait+0.050
Neg logit contributions
 nowhere-0.067
 year-0.066
 sudden-0.063
 neighbourhood-0.063
in-0.061
Token share
Token ID2648
Feature activation+5.52e-02

Pos logit contributions
istence+0.084
osity+0.083
 listen+0.076
ourage+0.076
 Sorry+0.074
Neg logit contributions
 year-0.162
 was-0.155
<|endoftext|>-0.152
 neighbourhood-0.152
 neighborhood-0.144
Token with
Token ID351
Feature activation+7.45e-01

Pos logit contributions
 sudden+0.002
 nowhere+0.002
 year+0.002
 neighbourhood+0.002
 woods+0.002
Neg logit contributions
 give-0.002
 share-0.002
 listen-0.002
 leave-0.002
 ask-0.002
Token you
Token ID345
Feature activation+5.24e-01

Pos logit contributions
 year+0.043
 sudden+0.043
 nowhere+0.042
 neighbourhood+0.041
eh+0.041
Neg logit contributions
 give-0.021
 share-0.020
 listen-0.019
 leave-0.018
 wait-0.017
Token."
Token ID526
Feature activation-4.61e-01

Pos logit contributions
 sudden+0.026
 nowhere+0.025
 neighbourhood+0.025
 woods+0.024
 seas+0.024
Neg logit contributions
 give-0.019
 share-0.019
 listen-0.017
 leave-0.017
 wait-0.016
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 give+0.028
 share+0.027
 listen+0.026
 leave+0.026
 ask+0.025
Neg logit contributions
 nowhere-0.011
 sudden-0.011
 year-0.011
 neighbourhood-0.010
 woods-0.010