TOP ACTIVATIONS
MAX = 0.092

laughed and said , " Oh no , now I have
scream . " Oh no , Lily . You
ase . " Oh no , Tom , what
. They said , " Oh no ! Where is our
bite . " Y uck !" Lily said .
button . " Oh no , the goose wants
floor . " Oh no !" everyone said .
He cries out , " Oh no , my car !"
grass . " Oh no , Wally !" they
ground . " Oh no !" Anna cried .
mess and said , " Oh no , Kitty ! What
sky and said , " Look ! It 's a comet
happy and said , " Look , Mom ! A white
happy . " Really ?" Sara says .
. Tom said , " Oh no ! My cake is
bed . " Oh no , Tom , I
Tim my said , " Oh no ! Our cars are
airport . " Look , Lily , we are
ed . " Oh no , Rex , stop
mouth . " Oh no , our ball !"

MIN ACTIVATIONS
MIN = -0.085

. She knew that the only way to open the folder
things are not as important as the people we love .
. He knew that the only way he could find True
loved that v ase more than anything else in the house
things were not as important as she thought . <|endoftext|> Once
that yellow was not the only color in the world .
The dragon knew that the only way to reach the fairy
mighty is not as important as being nice . <|endoftext|> S
to others is more important than being jealous and selfish .
that material things are not as important as the people we
red hair and the biggest , brightest , green eyes .
things are not what they seem . And they learned that
giving up was sometimes the only way to succeed . <|endoftext|>
things are not what they seem , just like the lion
and never worried about what might happen . One
wealthy is not as important as being kind . They learned
things are not what they seem . They thanked mom my
" This is not the only red cr ay on .
. Jack was very smart , but he was having trouble
scary things can have happy endings . <|endoftext|> Once upon a

INTERVAL 0.048 - 0.092
CONTAINS 0.969%

was . " Where should we put him
haircut ." Tom said , " Yes , mom , let
flowers . " Wow !" said Maria , with
. They went back to the park . They
ended , Tim my went back outside to play . He

INTERVAL 0.003 - 0.048
CONTAINS 39.694%

pile of leaves blowing in the wind and making a weird
way . The end . <|endoftext|> Once upon a time ,
'll help you ." Roc co pulled the carrot out of
out to see a guitar shop . The bunny gazed at
bites the stick . The stick breaks . Tim falls into

INTERVAL -0.041 - 0.003
CONTAINS 58.273%

't understand . M illy still hated the task
play in the hall at school . The hall is big
and have more time to play . The moral of the
shelf all day , wishing for a friend .
it , but she didn 't know how . Mitt ens

INTERVAL -0.085 - -0.041
CONTAINS 1.064%

. Jack was only three years old and he loved to
She was very com fy when she sat down in her
, it found a nice , y ummy treat . It
down all the things he wanted , and all the things
boy . He was three years old and had lots of
Token laughed
Token ID13818
Feature activation+1.32e-02

Pos logit contributions
Sorry+0.032
Grab+0.026
W+0.025
Boo+0.024
ted+0.024
Neg logit contributions
 idea-0.021
 supposed-0.021
 pretending-0.021
 balancing-0.021
 way-0.020
Token and
Token ID290
Feature activation+3.21e-02

Pos logit contributions
Sorry+0.033
Grab+0.029
W+0.028
Boo+0.028
ted+0.026
Neg logit contributions
 idea-0.016
 supposed-0.015
epad-0.015
 possible-0.014
 pretending-0.014
Token said
Token ID531
Feature activation+2.00e-02

Pos logit contributions
Sorry+0.070
Grab+0.059
W+0.056
Boo+0.055
ipped+0.054
Neg logit contributions
 idea-0.049
 pretending-0.048
 supposed-0.047
 way-0.046
 balancing-0.045
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
Sorry+0.048
Grab+0.042
W+0.040
Boo+0.039
ted+0.038
Neg logit contributions
 idea-0.027
 supposed-0.026
epad-0.025
 possible-0.024
 happen-0.023
Token "
Token ID366
Feature activation+5.84e-02

Pos logit contributions
 supposed+0.035
 idea+0.034
epad+0.034
 possible+0.032
 recognise+0.031
Neg logit contributions
Sorry-0.054
W-0.045
Grab-0.045
Boo-0.043
ted-0.043
TokenOh
Token ID5812
Feature activation+9.23e-02

Pos logit contributions
Sorry+0.075
ipped+0.063
ted+0.059
Grab+0.058
olded+0.053
Neg logit contributions
 supposed-0.123
 possible-0.122
 idea-0.121
epad-0.121
 recognise-0.117
Token no
Token ID645
Feature activation+4.16e-02

Pos logit contributions
Sorry+0.237
Grab+0.213
ted+0.201
W+0.201
Boo+0.198
Neg logit contributions
 idea-0.116
 way-0.098
 supposed-0.097
 possible-0.092
 pretending-0.091
Token,
Token ID11
Feature activation+2.98e-02

Pos logit contributions
Sorry+0.101
Grab+0.088
W+0.086
Boo+0.084
ted+0.083
Neg logit contributions
 idea-0.056
 supposed-0.055
epad-0.052
 possible-0.051
 pretending-0.049
Token now
Token ID783
Feature activation+1.65e-02

Pos logit contributions
Sorry+0.060
Grab+0.049
W+0.049
Boo+0.047
ted+0.046
Neg logit contributions
 idea-0.051
 supposed-0.051
epad-0.050
 possible-0.048
 happen-0.047
Token I
Token ID314
Feature activation+3.99e-02

Pos logit contributions
Sorry+0.034
Grab+0.028
W+0.027
Boo+0.027
ted+0.026
Neg logit contributions
 idea-0.028
 supposed-0.028
epad-0.027
 possible-0.027
 happen-0.026
Token have
Token ID423
Feature activation+3.68e-02

Pos logit contributions
Sorry+0.079
Grab+0.066
W+0.064
ted+0.063
Boo+0.061
Neg logit contributions
 supposed-0.071
 idea-0.070
epad-0.066
 recognise-0.064
 possible-0.063
Token scream
Token ID8196
Feature activation+3.01e-03

Pos logit contributions
Sorry+0.006
Boo+0.005
ted+0.005
W+0.005
Grab+0.005
Neg logit contributions
 possible-0.006
epad-0.006
 supposed-0.005
 idea-0.005
 quest-0.005
Token.
Token ID13
Feature activation-4.60e-03

Pos logit contributions
Sorry+0.006
W+0.005
Grab+0.005
ted+0.005
Boo+0.005
Neg logit contributions
 supposed-0.005
epad-0.005
 possible-0.005
 idea-0.005
 recognise-0.004
Token
Token ID198
Feature activation+7.45e-03

Pos logit contributions
 idea+0.007
 supposed+0.007
 way+0.006
epad+0.006
 recognise+0.006
Neg logit contributions
Sorry-0.009
Grab-0.008
ipped-0.008
W-0.008
ted-0.008
Token
Token ID198
Feature activation+2.03e-02

Pos logit contributions
Sorry+0.015
Grab+0.012
W+0.012
Boo+0.012
ted+0.011
Neg logit contributions
 supposed-0.013
 idea-0.013
epad-0.012
 possible-0.012
 recognise-0.012
Token"
Token ID1
Feature activation+6.99e-02

Pos logit contributions
ishly+0.040
ted+0.039
ipped+0.037
ternally+0.032
rified+0.032
Neg logit contributions
 supposed-0.025
 recognise-0.025
 long-0.024
 impressed-0.024
 exc-0.024
TokenOh
Token ID5812
Feature activation+9.17e-02

Pos logit contributions
ugg+0.094
ipped+0.091
olded+0.090
ishly+0.089
ted+0.084
Neg logit contributions
 happen-0.120
 answer-0.117
 waking-0.116
 time-0.116
 idea-0.114
Token no
Token ID645
Feature activation+4.63e-02

Pos logit contributions
Sorry+0.239
Grab+0.228
ted+0.213
W+0.212
Boo+0.209
Neg logit contributions
 idea-0.092
 way-0.085
 pretending-0.079
 supposed-0.076
 happen-0.074
Token,
Token ID11
Feature activation+2.96e-02

Pos logit contributions
Sorry+0.114
Grab+0.101
W+0.098
Boo+0.095
ted+0.094
Neg logit contributions
 idea-0.062
 supposed-0.059
epad-0.055
 possible-0.055
 happen-0.053
Token Lily
Token ID20037
Feature activation+6.48e-03

Pos logit contributions
Sorry+0.066
W+0.055
Grab+0.055
ted+0.054
Boo+0.054
Neg logit contributions
 idea-0.045
 supposed-0.044
 possible-0.042
 happen-0.042
 pretending-0.041
Token.
Token ID13
Feature activation+3.99e-02

Pos logit contributions
Sorry+0.014
W+0.012
Grab+0.012
Boo+0.012
ted+0.011
Neg logit contributions
epad-0.010
 supposed-0.010
 idea-0.010
 possible-0.009
 recognise-0.009
Token You
Token ID921
Feature activation+5.76e-02

Pos logit contributions
Sorry+0.080
Grab+0.069
aid+0.068
H+0.068
W+0.067
Neg logit contributions
 pretending-0.058
arter-0.054
 image-0.052
 marking-0.050
 stove-0.049
Tokenase
Token ID589
Feature activation+2.70e-02

Pos logit contributions
Sorry+0.088
Grab+0.080
W+0.072
ted+0.071
Boo+0.069
Neg logit contributions
 supposed-0.057
 idea-0.057
epad-0.056
 trick-0.053
od-0.053
Token.
Token ID13
Feature activation-5.22e-03

Pos logit contributions
Sorry+0.046
ted+0.036
W+0.036
Boo+0.035
ipped+0.034
Neg logit contributions
 supposed-0.049
epad-0.049
 recognise-0.048
 possible-0.047
 idea-0.047
Token
Token ID198
Feature activation+9.85e-03

Pos logit contributions
 idea+0.009
 supposed+0.008
epad+0.008
 recognise+0.008
 possible+0.008
Neg logit contributions
Sorry-0.010
Grab-0.008
W-0.008
ted-0.008
Boo-0.008
Token
Token ID198
Feature activation+2.28e-02

Pos logit contributions
Sorry+0.020
Grab+0.016
W+0.016
Boo+0.016
ted+0.016
Neg logit contributions
 supposed-0.016
 idea-0.016
epad-0.016
 possible-0.015
 recognise-0.015
Token"
Token ID1
Feature activation+6.83e-02

Pos logit contributions
ishly+0.047
ted+0.046
ipped+0.044
ternally+0.040
Sorry+0.038
Neg logit contributions
 supposed-0.027
 impressed-0.025
 recognise-0.025
 long-0.024
 possible-0.024
TokenOh
Token ID5812
Feature activation+9.10e-02

Pos logit contributions
ipped+0.090
ishly+0.088
ugg+0.087
ted+0.087
olded+0.083
Neg logit contributions
epad-0.123
 idea-0.121
 answer-0.121
 supposed-0.120
 happen-0.120
Token no
Token ID645
Feature activation+4.73e-02

Pos logit contributions
Sorry+0.231
Grab+0.212
W+0.201
ted+0.201
Boo+0.195
Neg logit contributions
 idea-0.109
 way-0.095
 supposed-0.094
 pretending-0.091
 happen-0.090
Token,
Token ID11
Feature activation+3.29e-02

Pos logit contributions
Sorry+0.108
Grab+0.090
W+0.090
Boo+0.087
ted+0.085
Neg logit contributions
 supposed-0.069
 idea-0.069
epad-0.068
 possible-0.064
 recognise-0.061
Token Tom
Token ID4186
Feature activation+6.79e-03

Pos logit contributions
Sorry+0.073
W+0.062
Grab+0.061
Boo+0.060
ted+0.060
Neg logit contributions
 idea-0.050
 supposed-0.049
 possible-0.047
 happen-0.047
epad-0.046
Token,
Token ID11
Feature activation+3.57e-02

Pos logit contributions
Sorry+0.013
W+0.011
Grab+0.011
Boo+0.011
ted+0.010
Neg logit contributions
epad-0.012
 supposed-0.011
 idea-0.011
 possible-0.011
 recognise-0.011
Token what
Token ID644
Feature activation+4.95e-02

Pos logit contributions
Sorry+0.074
Grab+0.061
W+0.060
Boo+0.058
ted+0.057
Neg logit contributions
 supposed-0.059
 idea-0.059
epad-0.058
 possible-0.056
 happen-0.055
Token.
Token ID13
Feature activation-5.72e-04

Pos logit contributions
Sorry+0.006
W+0.005
ted+0.005
Grab+0.005
ipped+0.005
Neg logit contributions
epad-0.006
 recognise-0.006
 supposed-0.006
 idea-0.006
 possible-0.006
Token They
Token ID1119
Feature activation+2.07e-02

Pos logit contributions
 idea+0.001
 supposed+0.001
epad+0.001
 possible+0.001
 pretending+0.001
Neg logit contributions
Sorry-0.001
Grab-0.001
Boo-0.001
W-0.001
ipped-0.001
Token said
Token ID531
Feature activation+2.23e-02

Pos logit contributions
Sorry+0.043
Grab+0.037
W+0.036
Boo+0.035
ipped+0.034
Neg logit contributions
 idea-0.031
 supposed-0.031
epad-0.030
 answer-0.029
 recognise-0.029
Token,
Token ID11
Feature activation-8.69e-03

Pos logit contributions
Sorry+0.052
Grab+0.045
W+0.044
Boo+0.043
ted+0.042
Neg logit contributions
 idea-0.031
 supposed-0.030
epad-0.030
 possible-0.029
 happen-0.027
Token "
Token ID366
Feature activation+5.76e-02

Pos logit contributions
 supposed+0.012
 idea+0.012
epad+0.012
 possible+0.011
 recognise+0.010
Neg logit contributions
Sorry-0.021
Grab-0.017
W-0.017
Boo-0.017
ted-0.017
TokenOh
Token ID5812
Feature activation+9.07e-02

Pos logit contributions
olded+0.065
Sorry+0.064
ipped+0.064
ugg+0.060
ted+0.059
Neg logit contributions
 happen-0.109
 idea-0.106
 possible-0.106
epad-0.105
 supposed-0.103
Token no
Token ID645
Feature activation+4.30e-02

Pos logit contributions
Sorry+0.230
Grab+0.213
W+0.198
ted+0.198
Boo+0.194
Neg logit contributions
 idea-0.106
 way-0.094
 supposed-0.092
 pretending-0.091
 happen-0.090
Token!
Token ID0
Feature activation+4.76e-02

Pos logit contributions
Sorry+0.101
Grab+0.087
W+0.085
Boo+0.083
ted+0.082
Neg logit contributions
 idea-0.060
 supposed-0.059
epad-0.058
 possible-0.055
 pretending-0.053
Token Where
Token ID6350
Feature activation+5.40e-02

Pos logit contributions
Sorry+0.086
Grab+0.072
W+0.070
ted+0.068
Sp+0.066
Neg logit contributions
 pretending-0.081
 supposed-0.080
 idea-0.077
epad-0.076
 balancing-0.075
Token is
Token ID318
Feature activation+1.05e-02

Pos logit contributions
Sorry+0.099
Grab+0.079
W+0.078
Boo+0.075
ted+0.073
Neg logit contributions
 supposed-0.102
 idea-0.102
epad-0.101
 possible-0.097
 recognise-0.094
Token our
Token ID674
Feature activation+1.16e-02

Pos logit contributions
Sorry+0.020
Grab+0.016
W+0.016
Boo+0.016
ted+0.015
Neg logit contributions
 supposed-0.019
 idea-0.019
epad-0.019
 possible-0.018
 happen-0.017
Token bite
Token ID13197
Feature activation+2.36e-02

Pos logit contributions
Sorry+0.020
Grab+0.018
W+0.017
Boo+0.017
ted+0.017
Neg logit contributions
 supposed-0.014
epad-0.013
 possible-0.012
 recognise-0.012
 pretending-0.011
Token.
Token ID13
Feature activation+6.56e-03

Pos logit contributions
Sorry+0.053
W+0.044
Grab+0.044
Boo+0.043
ted+0.043
Neg logit contributions
 supposed-0.035
epad-0.035
 idea-0.034
 possible-0.032
 recognise-0.032
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.010
Boo+0.010
ted+0.009
Neg logit contributions
 idea-0.012
 supposed-0.011
epad-0.011
 pretending-0.011
 possible-0.011
Token
Token ID198
Feature activation+2.63e-02

Pos logit contributions
Sorry+0.024
Grab+0.019
W+0.019
Boo+0.019
ted+0.018
Neg logit contributions
 supposed-0.023
 idea-0.023
epad-0.023
 possible-0.022
 recognise-0.021
Token"
Token ID1
Feature activation+7.00e-02

Pos logit contributions
ted+0.053
ishly+0.049
ipped+0.047
als+0.043
rified+0.041
Neg logit contributions
 supposed-0.033
 impressed-0.033
 recognise-0.033
 balancing-0.033
 possible-0.033
TokenY
Token ID56
Feature activation+9.03e-02

Pos logit contributions
Sorry+0.080
ipped+0.073
olded+0.068
ted+0.068
ishly+0.062
Neg logit contributions
epad-0.146
 idea-0.144
 supposed-0.140
 possible-0.139
 page-0.139
Tokenuck
Token ID1347
Feature activation+5.10e-02

Pos logit contributions
Sorry+0.161
Sp+0.126
Grab+0.126
W+0.124
Boo+0.120
Neg logit contributions
 idea-0.174
 supposed-0.165
 happen-0.158
 answer-0.152
epad-0.152
Token!"
Token ID2474
Feature activation+2.04e-02

Pos logit contributions
Sorry+0.117
Grab+0.098
W+0.098
Boo+0.095
ted+0.092
Neg logit contributions
 supposed-0.073
 idea-0.073
epad-0.071
 possible-0.068
 happen-0.065
Token Lily
Token ID20037
Feature activation+1.68e-02

Pos logit contributions
Sorry+0.047
W+0.041
Grab+0.041
Boo+0.040
ipped+0.040
Neg logit contributions
 idea-0.030
 pretending-0.025
 answer-0.024
 happen-0.024
 way-0.023
Token said
Token ID531
Feature activation+2.12e-02

Pos logit contributions
Sorry+0.030
Grab+0.024
W+0.024
Boo+0.023
ted+0.022
Neg logit contributions
 idea-0.032
 supposed-0.032
epad-0.032
 possible-0.030
 pretending-0.030
Token.
Token ID13
Feature activation-1.37e-02

Pos logit contributions
Sorry+0.043
W+0.035
Grab+0.035
ted+0.034
Boo+0.033
Neg logit contributions
 supposed-0.035
 idea-0.034
epad-0.034
 possible-0.032
 recognise-0.032
Token button
Token ID4936
Feature activation+2.53e-02

Pos logit contributions
Sorry+0.017
W+0.014
Grab+0.014
Boo+0.013
ted+0.013
Neg logit contributions
epad-0.014
 supposed-0.014
 recognise-0.013
 possible-0.013
 idea-0.012
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
Sorry+0.052
W+0.043
ted+0.043
Boo+0.042
Grab+0.041
Neg logit contributions
 supposed-0.041
 idea-0.040
epad-0.040
 recognise-0.038
 possible-0.038
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
Sorry+0.003
Boo+0.002
W+0.002
Grab+0.002
ted+0.002
Neg logit contributions
 idea-0.002
 supposed-0.002
epad-0.002
 pretending-0.002
 possible-0.002
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
Sorry+0.022
Grab+0.017
W+0.017
Boo+0.017
ted+0.016
Neg logit contributions
 supposed-0.022
 idea-0.022
epad-0.022
 possible-0.021
 recognise-0.020
Token"
Token ID1
Feature activation+7.12e-02

Pos logit contributions
ted+0.050
ishly+0.047
Sorry+0.046
ipped+0.045
Boo+0.044
Neg logit contributions
 impressed-0.024
 long-0.023
 supposed-0.022
 balancing-0.022
 exc-0.021
TokenOh
Token ID5812
Feature activation+9.01e-02

Pos logit contributions
ipped+0.091
olded+0.088
ishly+0.086
ugg+0.085
uddled+0.078
Neg logit contributions
 happen-0.124
 comet-0.121
 giant-0.120
 trick-0.119
 page-0.117
Token no
Token ID645
Feature activation+4.39e-02

Pos logit contributions
Sorry+0.230
Grab+0.206
W+0.198
ted+0.193
Boo+0.192
Neg logit contributions
 idea-0.112
 supposed-0.098
 way-0.097
 happen-0.093
 pretending-0.093
Token,
Token ID11
Feature activation+3.38e-02

Pos logit contributions
Sorry+0.092
W+0.075
Grab+0.074
Boo+0.073
ted+0.070
Neg logit contributions
epad-0.072
 supposed-0.071
 idea-0.068
 possible-0.067
 recognise-0.066
Token the
Token ID262
Feature activation-4.72e-03

Pos logit contributions
Sorry+0.077
W+0.065
Boo+0.064
Grab+0.063
ted+0.062
Neg logit contributions
 idea-0.050
 supposed-0.048
 happen-0.047
 possible-0.045
 way-0.044
Token goose
Token ID37246
Feature activation+2.39e-02

Pos logit contributions
epad+0.008
 supposed+0.008
 recognise+0.007
 possible+0.007
 pretending+0.007
Neg logit contributions
Sorry-0.010
W-0.008
Grab-0.008
Boo-0.008
ted-0.008
Token wants
Token ID3382
Feature activation+3.16e-03

Pos logit contributions
Sorry+0.050
Grab+0.041
W+0.041
Boo+0.040
ted+0.039
Neg logit contributions
 supposed-0.039
 idea-0.039
epad-0.039
 possible-0.037
 recognise-0.035
Token floor
Token ID4314
Feature activation+1.10e-02

Pos logit contributions
Sorry+0.066
W+0.059
Grab+0.057
ted+0.057
Boo+0.057
Neg logit contributions
epad-0.035
 supposed-0.035
 possible-0.034
 recognise-0.034
 idea-0.032
Token.
Token ID13
Feature activation-4.07e-03

Pos logit contributions
Sorry+0.019
W+0.015
ted+0.015
Boo+0.014
L+0.014
Neg logit contributions
 supposed-0.020
 idea-0.020
epad-0.019
 recognise-0.019
 possible-0.019
Token
Token ID198
Feature activation+1.21e-02

Pos logit contributions
 idea+0.007
 supposed+0.007
epad+0.007
 possible+0.007
 recognise+0.007
Neg logit contributions
Sorry-0.008
Grab-0.006
W-0.006
Boo-0.006
ted-0.006
Token
Token ID198
Feature activation+2.65e-02

Pos logit contributions
Sorry+0.022
Grab+0.018
W+0.018
Boo+0.017
ted+0.017
Neg logit contributions
 supposed-0.022
 idea-0.022
epad-0.022
 possible-0.021
 recognise-0.021
Token"
Token ID1
Feature activation+6.87e-02

Pos logit contributions
ted+0.048
ishly+0.044
Sorry+0.043
ipped+0.043
uddled+0.038
Neg logit contributions
 balancing-0.038
 possible-0.036
 supposed-0.036
onna-0.036
 recognise-0.036
TokenOh
Token ID5812
Feature activation+8.98e-02

Pos logit contributions
ishly+0.080
ipped+0.078
olded+0.077
Sorry+0.074
ugg+0.073
Neg logit contributions
epad-0.124
 comet-0.124
 happen-0.124
 answer-0.123
 possible-0.122
Token no
Token ID645
Feature activation+4.59e-02

Pos logit contributions
Sorry+0.224
Grab+0.202
W+0.191
ted+0.190
Boo+0.186
Neg logit contributions
 idea-0.110
 supposed-0.097
 way-0.095
 happen-0.093
 pretending-0.093
Token!"
Token ID2474
Feature activation+1.98e-02

Pos logit contributions
Sorry+0.102
W+0.084
Grab+0.084
Boo+0.082
ted+0.080
Neg logit contributions
 supposed-0.069
epad-0.069
 idea-0.068
 possible-0.065
 recognise-0.063
Token everyone
Token ID2506
Feature activation+2.75e-03

Pos logit contributions
Sorry+0.047
Grab+0.040
W+0.040
Boo+0.040
ipped+0.038
Neg logit contributions
 idea-0.027
 answer-0.023
epad-0.023
 pretending-0.023
 way-0.022
Token said
Token ID531
Feature activation+1.76e-02

Pos logit contributions
Sorry+0.005
Grab+0.004
W+0.004
Boo+0.004
Sp+0.004
Neg logit contributions
 idea-0.005
 supposed-0.005
 answer-0.005
 happen-0.005
 recognise-0.004
Token.
Token ID13
Feature activation-4.94e-03

Pos logit contributions
Sorry+0.040
Grab+0.033
W+0.033
Boo+0.032
ted+0.032
Neg logit contributions
 supposed-0.026
epad-0.025
 idea-0.025
 possible-0.024
 recognise-0.023
TokenHe
Token ID1544
Feature activation+2.43e-02

Pos logit contributions
Sorry+0.034
ted+0.030
ipped+0.028
ishly+0.027
Grab+0.027
Neg logit contributions
 balancing-0.030
 recognise-0.029
 idea-0.029
 supposed-0.029
epad-0.029
Token cries
Token ID24691
Feature activation+1.61e-02

Pos logit contributions
Sorry+0.056
W+0.045
Grab+0.044
Sp+0.042
Boo+0.042
Neg logit contributions
 idea-0.036
 way-0.034
 balancing-0.034
 pretending-0.034
 supposed-0.033
Token out
Token ID503
Feature activation+7.55e-03

Pos logit contributions
Sorry+0.038
Grab+0.032
W+0.032
Boo+0.031
ted+0.030
Neg logit contributions
 supposed-0.022
 idea-0.022
epad-0.022
 possible-0.021
 recognise-0.020
Token,
Token ID11
Feature activation-6.90e-03

Pos logit contributions
Sorry+0.015
W+0.013
ted+0.013
Grab+0.013
Boo+0.013
Neg logit contributions
 supposed-0.013
epad-0.013
 possible-0.012
 idea-0.011
 recognise-0.011
Token "
Token ID366
Feature activation+6.45e-02

Pos logit contributions
 supposed+0.009
 idea+0.009
epad+0.009
 possible+0.008
 recognise+0.008
Neg logit contributions
Sorry-0.017
Grab-0.014
W-0.014
Boo-0.014
ted-0.013
TokenOh
Token ID5812
Feature activation+8.95e-02

Pos logit contributions
Sorry+0.075
olded+0.070
ishly+0.063
ted+0.062
ugg+0.062
Neg logit contributions
 balance-0.124
 idea-0.123
 balancing-0.123
 answer-0.122
epad-0.120
Token no
Token ID645
Feature activation+4.40e-02

Pos logit contributions
Sorry+0.215
Grab+0.187
W+0.177
ted+0.176
Boo+0.172
Neg logit contributions
 idea-0.121
 supposed-0.114
 pretending-0.109
 possible-0.108
 balancing-0.107
Token,
Token ID11
Feature activation+3.13e-02

Pos logit contributions
Sorry+0.098
W+0.081
Grab+0.081
Boo+0.079
ted+0.076
Neg logit contributions
 supposed-0.066
epad-0.066
 idea-0.065
 possible-0.062
 recognise-0.060
Token my
Token ID616
Feature activation+3.11e-02

Pos logit contributions
Sorry+0.063
W+0.052
Grab+0.052
Boo+0.050
ted+0.048
Neg logit contributions
 supposed-0.054
epad-0.053
 idea-0.053
 possible-0.050
 recognise-0.049
Token car
Token ID1097
Feature activation+3.57e-02

Pos logit contributions
Sorry+0.059
Grab+0.049
W+0.047
Boo+0.046
ted+0.044
Neg logit contributions
 idea-0.062
 supposed-0.058
epad-0.056
 pretending-0.056
 way-0.055
Token!"
Token ID2474
Feature activation+1.27e-02

Pos logit contributions
Sorry+0.083
Grab+0.070
W+0.069
Boo+0.066
ted+0.065
Neg logit contributions
 idea-0.052
 supposed-0.051
epad-0.048
 happen-0.047
 possible-0.046
Token grass
Token ID8701
Feature activation+1.24e-02

Pos logit contributions
Sorry+0.043
Grab+0.037
W+0.037
ted+0.035
Boo+0.035
Neg logit contributions
epad-0.027
 supposed-0.027
 idea-0.025
 possible-0.025
 recognise-0.024
Token.
Token ID13
Feature activation-1.33e-04

Pos logit contributions
Sorry+0.028
Grab+0.024
W+0.024
Boo+0.023
ted+0.023
Neg logit contributions
 supposed-0.018
 idea-0.017
epad-0.017
 possible-0.016
 recognise-0.016
Token
Token ID198
Feature activation+8.24e-03

Pos logit contributions
 idea+0.000
 supposed+0.000
epad+0.000
 possible+0.000
 recognise+0.000
Neg logit contributions
Sorry-0.000
Grab-0.000
W-0.000
Boo-0.000
ted-0.000
Token
Token ID198
Feature activation+2.15e-02

Pos logit contributions
Sorry+0.018
Grab+0.015
ted+0.014
Boo+0.014
W+0.014
Neg logit contributions
 supposed-0.012
 idea-0.012
 recognise-0.011
 possible-0.011
epad-0.011
Token"
Token ID1
Feature activation+6.70e-02

Pos logit contributions
ted+0.039
ipped+0.038
Sorry+0.037
ishly+0.035
cream+0.033
Neg logit contributions
 supposed-0.027
 time-0.026
 possible-0.026
 impressed-0.025
 sunny-0.025
TokenOh
Token ID5812
Feature activation+8.94e-02

Pos logit contributions
Sorry+0.074
ipped+0.070
olded+0.065
ray+0.065
ted+0.061
Neg logit contributions
 possible-0.130
 happen-0.128
 time-0.127
 idea-0.127
 supposed-0.126
Token no
Token ID645
Feature activation+4.62e-02

Pos logit contributions
Sorry+0.227
Grab+0.205
W+0.191
ted+0.191
Boo+0.186
Neg logit contributions
 idea-0.109
 way-0.095
 happen-0.092
 supposed-0.091
 possible-0.090
Token,
Token ID11
Feature activation+3.09e-02

Pos logit contributions
Sorry+0.106
Grab+0.089
W+0.088
Boo+0.086
ted+0.084
Neg logit contributions
 supposed-0.067
 idea-0.067
epad-0.066
 possible-0.063
 recognise-0.060
Token Wally
Token ID50117
Feature activation+2.02e-02

Pos logit contributions
Sorry+0.066
Grab+0.053
W+0.053
Boo+0.053
ted+0.051
Neg logit contributions
 idea-0.050
 supposed-0.049
 possible-0.047
epad-0.047
 happen-0.046
Token!"
Token ID2474
Feature activation+1.27e-02

Pos logit contributions
Sorry+0.041
W+0.034
Grab+0.033
Boo+0.033
ted+0.031
Neg logit contributions
epad-0.035
 supposed-0.033
 idea-0.032
 possible-0.032
 recognise-0.031
Token they
Token ID484
Feature activation+1.87e-02

Pos logit contributions
Sorry+0.027
Grab+0.023
ipped+0.023
W+0.023
Boo+0.022
Neg logit contributions
 idea-0.020
 happen-0.017
 answer-0.017
 way-0.017
 possible-0.016
Token ground
Token ID2323
Feature activation+1.10e-02

Pos logit contributions
Sorry+0.070
Grab+0.060
W+0.060
Boo+0.059
ted+0.057
Neg logit contributions
 supposed-0.032
 idea-0.031
epad-0.031
 possible-0.029
 recognise-0.027
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
Sorry+0.021
W+0.017
ted+0.017
Grab+0.017
Boo+0.017
Neg logit contributions
epad-0.019
 supposed-0.019
 idea-0.018
 possible-0.018
 recognise-0.017
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
 idea+0.006
 supposed+0.005
epad+0.005
 recognise+0.005
 possible+0.005
Neg logit contributions
Sorry-0.006
W-0.005
Boo-0.005
Grab-0.005
ted-0.004
Token
Token ID198
Feature activation+1.90e-02

Pos logit contributions
Sorry+0.019
Grab+0.015
W+0.015
Boo+0.015
ted+0.014
Neg logit contributions
 supposed-0.020
 idea-0.019
epad-0.019
 possible-0.019
 recognise-0.018
Token"
Token ID1
Feature activation+6.91e-02

Pos logit contributions
ted+0.037
Sorry+0.035
ipped+0.035
ugg+0.033
Boo+0.033
Neg logit contributions
 reaching-0.022
 supposed-0.022
 sunny-0.021
 exc-0.020
 recognise-0.020
TokenOh
Token ID5812
Feature activation+8.93e-02

Pos logit contributions
ipped+0.087
olded+0.084
ugg+0.081
ishly+0.078
Sorry+0.077
Neg logit contributions
 happen-0.126
 comet-0.119
 waking-0.119
 possible-0.119
 supposed-0.119
Token no
Token ID645
Feature activation+4.51e-02

Pos logit contributions
Sorry+0.228
Grab+0.204
W+0.195
ted+0.193
Sp+0.189
Neg logit contributions
 idea-0.107
 supposed-0.095
 way-0.093
 happen-0.090
 pretending-0.087
Token!"
Token ID2474
Feature activation+1.54e-02

Pos logit contributions
Sorry+0.099
W+0.082
Grab+0.082
Boo+0.080
ted+0.077
Neg logit contributions
 supposed-0.069
epad-0.069
 idea-0.068
 possible-0.065
 recognise-0.063
Token Anna
Token ID11735
Feature activation+1.36e-02

Pos logit contributions
Sorry+0.034
W+0.029
ipped+0.028
Grab+0.028
Boo+0.028
Neg logit contributions
 idea-0.024
 pretending-0.021
 happen-0.020
 answer-0.020
 way-0.020
Token cried
Token ID16896
Feature activation+2.43e-02

Pos logit contributions
Sorry+0.025
Grab+0.020
W+0.019
Boo+0.019
ted+0.018
Neg logit contributions
 idea-0.026
 supposed-0.026
epad-0.025
 pretending-0.024
 recognise-0.024
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
Sorry+0.062
Grab+0.054
Boo+0.052
W+0.052
ted+0.050
Neg logit contributions
 idea-0.029
 supposed-0.027
epad-0.027
 answer-0.026
 possible-0.025
Token mess
Token ID2085
Feature activation+4.67e-03

Pos logit contributions
epad+0.005
 supposed+0.005
 possible+0.005
 recognise+0.005
od+0.005
Neg logit contributions
Sorry-0.006
W-0.006
Grab-0.005
Sp-0.005
Boo-0.005
Token and
Token ID290
Feature activation+9.61e-03

Pos logit contributions
Sorry+0.010
W+0.009
Grab+0.008
Boo+0.008
ted+0.008
Neg logit contributions
 idea-0.007
 supposed-0.007
epad-0.007
 possible-0.007
 recognise-0.006
Token said
Token ID531
Feature activation+4.55e-03

Pos logit contributions
Sorry+0.020
Grab+0.017
W+0.017
Boo+0.016
Sp+0.016
Neg logit contributions
 idea-0.016
 supposed-0.016
 pretending-0.015
epad-0.015
 way-0.015
Token,
Token ID11
Feature activation-3.66e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.010
ted+0.010
Boo+0.010
Neg logit contributions
 idea-0.005
 supposed-0.005
epad-0.005
 happen-0.005
 possible-0.005
Token "
Token ID366
Feature activation+5.30e-02

Pos logit contributions
 supposed+0.050
 idea+0.049
epad+0.049
 possible+0.046
 recognise+0.044
Neg logit contributions
Sorry-0.087
W-0.073
Grab-0.073
Boo-0.071
ted-0.069
TokenOh
Token ID5812
Feature activation+8.93e-02

Pos logit contributions
ipped+0.076
olded+0.072
Sorry+0.070
uddle+0.069
uddled+0.068
Neg logit contributions
 supposed-0.086
 waking-0.081
 recognise-0.079
 possible-0.077
 unlike-0.077
Token no
Token ID645
Feature activation+3.94e-02

Pos logit contributions
Sorry+0.237
Grab+0.211
Boo+0.197
W+0.197
ted+0.197
Neg logit contributions
 idea-0.098
 way-0.090
 supposed-0.087
 happen-0.080
 possible-0.078
Token,
Token ID11
Feature activation+1.99e-02

Pos logit contributions
Sorry+0.095
Grab+0.083
W+0.080
Boo+0.079
ted+0.078
Neg logit contributions
 idea-0.053
 supposed-0.053
epad-0.050
 possible-0.048
 pretending-0.047
Token Kitty
Token ID21168
Feature activation+4.23e-03

Pos logit contributions
Sorry+0.043
Boo+0.034
Grab+0.034
W+0.033
ted+0.032
Neg logit contributions
 idea-0.032
 supposed-0.032
 happen-0.030
 way-0.030
 answer-0.029
Token!
Token ID0
Feature activation+2.93e-02

Pos logit contributions
Sorry+0.009
Grab+0.008
W+0.008
Boo+0.008
ted+0.007
Neg logit contributions
 supposed-0.006
epad-0.006
 idea-0.006
 possible-0.006
 recognise-0.006
Token What
Token ID1867
Feature activation+3.82e-02

Pos logit contributions
Sorry+0.064
Grab+0.051
W+0.051
Boo+0.050
ted+0.050
Neg logit contributions
 supposed-0.037
 pretending-0.035
arter-0.034
 writing-0.034
epad-0.033
Token sky
Token ID6766
Feature activation+2.16e-03

Pos logit contributions
Sorry+0.052
W+0.045
Grab+0.045
Boo+0.044
ted+0.044
Neg logit contributions
 supposed-0.016
epad-0.016
 idea-0.014
 recognise-0.014
 possible-0.013
Token and
Token ID290
Feature activation+3.72e-02

Pos logit contributions
Sorry+0.004
ted+0.003
W+0.003
Grab+0.003
Boo+0.003
Neg logit contributions
 supposed-0.004
epad-0.004
 idea-0.004
 possible-0.003
 recognise-0.003
Token said
Token ID531
Feature activation+1.45e-02

Pos logit contributions
Sorry+0.078
W+0.067
Grab+0.067
Sp+0.064
Boo+0.063
Neg logit contributions
 seeing-0.056
 pretending-0.056
epad-0.055
 idea-0.054
 supposed-0.053
Token,
Token ID11
Feature activation-1.98e-02

Pos logit contributions
Sorry+0.032
W+0.026
Grab+0.026
Boo+0.026
ted+0.025
Neg logit contributions
 supposed-0.022
epad-0.022
 idea-0.022
 possible-0.021
 recognise-0.020
Token "
Token ID366
Feature activation+6.08e-02

Pos logit contributions
 supposed+0.032
 idea+0.032
epad+0.031
 possible+0.030
 recognise+0.029
Neg logit contributions
Sorry-0.042
W-0.035
Grab-0.034
ted-0.033
H-0.032
TokenLook
Token ID8567
Feature activation+8.92e-02

Pos logit contributions
Sorry+0.078
ipped+0.067
Grab+0.062
olded+0.062
ted+0.059
Neg logit contributions
epad-0.127
 possible-0.122
 supposed-0.120
 recognise-0.118
 comet-0.118
Token!
Token ID0
Feature activation+2.74e-02

Pos logit contributions
Sorry+0.202
Grab+0.171
W+0.166
Boo+0.162
ted+0.156
Neg logit contributions
 idea-0.130
 supposed-0.130
epad-0.126
 possible-0.122
 happen-0.118
Token It
Token ID632
Feature activation+1.91e-02

Pos logit contributions
Sorry+0.056
Grab+0.046
W+0.044
ipped+0.044
Boo+0.043
Neg logit contributions
epad-0.042
 supposed-0.040
 possible-0.040
 idea-0.039
 pretending-0.039
Token's
Token ID338
Feature activation+1.82e-02

Pos logit contributions
Sorry+0.036
Grab+0.028
W+0.028
Boo+0.027
ted+0.026
Neg logit contributions
 supposed-0.036
 idea-0.036
epad-0.035
 possible-0.034
 happen-0.033
Token a
Token ID257
Feature activation-1.44e-02

Pos logit contributions
Sorry+0.033
W+0.028
Grab+0.027
Boo+0.025
ted+0.025
Neg logit contributions
epad-0.032
 idea-0.029
 supposed-0.029
 recognise-0.029
 quest-0.027
Token comet
Token ID31733
Feature activation+2.24e-02

Pos logit contributions
epad+0.013
 supposed+0.013
 idea+0.011
 recognise+0.011
 possible+0.010
Neg logit contributions
Sorry-0.040
W-0.035
Boo-0.034
Grab-0.033
H-0.033
Token happy
Token ID3772
Feature activation+1.13e-02

Pos logit contributions
Sorry+0.017
W+0.015
Grab+0.014
Boo+0.014
L+0.014
Neg logit contributions
epad-0.010
 supposed-0.009
 possible-0.009
 idea-0.008
 quest-0.008
Token and
Token ID290
Feature activation+2.20e-03

Pos logit contributions
Sorry+0.021
ted+0.017
W+0.016
Boo+0.016
Grab+0.015
Neg logit contributions
epad-0.021
 idea-0.021
 supposed-0.020
 recognise-0.019
 possible-0.019
Token said
Token ID531
Feature activation+1.08e-02

Pos logit contributions
Sorry+0.004
W+0.003
Grab+0.003
Boo+0.003
ted+0.003
Neg logit contributions
epad-0.004
 supposed-0.004
 idea-0.004
 possible-0.004
 recognise-0.004
Token,
Token ID11
Feature activation-3.77e-02

Pos logit contributions
Sorry+0.024
W+0.020
Grab+0.020
Boo+0.019
ted+0.019
Neg logit contributions
 supposed-0.017
epad-0.016
 idea-0.016
 possible-0.015
 recognise-0.015
Token "
Token ID366
Feature activation+5.39e-02

Pos logit contributions
 supposed+0.056
 idea+0.055
epad+0.055
 possible+0.052
 recognise+0.050
Neg logit contributions
Sorry-0.085
W-0.071
Grab-0.071
Boo-0.067
ted-0.067
TokenLook
Token ID8567
Feature activation+8.91e-02

Pos logit contributions
Sorry+0.065
ipped+0.058
olded+0.052
Grab+0.051
uddled+0.049
Neg logit contributions
epad-0.111
 idea-0.108
 giant-0.106
 image-0.106
 page-0.106
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
Sorry+0.196
Grab+0.163
W+0.162
Boo+0.158
ted+0.154
Neg logit contributions
 supposed-0.135
 idea-0.134
epad-0.133
 possible-0.126
 recognise-0.121
Token Mom
Token ID11254
Feature activation+8.66e-03

Pos logit contributions
Sorry+0.033
W+0.027
Grab+0.026
Boo+0.026
ted+0.025
Neg logit contributions
 supposed-0.032
epad-0.032
 idea-0.031
 possible-0.030
 recognise-0.029
Token!
Token ID0
Feature activation+2.67e-02

Pos logit contributions
Sorry+0.016
W+0.013
Grab+0.013
Boo+0.013
ted+0.012
Neg logit contributions
epad-0.016
 supposed-0.015
 idea-0.015
 possible-0.015
 recognise-0.014
Token A
Token ID317
Feature activation-1.24e-02

Pos logit contributions
Sorry+0.052
Grab+0.042
W+0.041
ipped+0.040
Boo+0.040
Neg logit contributions
epad-0.043
 idea-0.043
 supposed-0.042
 possible-0.041
 pretending-0.041
Token white
Token ID2330
Feature activation-2.26e-03

Pos logit contributions
epad+0.018
 supposed+0.018
 recognise+0.017
 possible+0.016
 idea+0.016
Neg logit contributions
Sorry-0.027
W-0.025
Boo-0.023
Grab-0.022
Sp-0.022
Token happy
Token ID3772
Feature activation+2.68e-02

Pos logit contributions
Sorry+0.029
Grab+0.025
W+0.025
Boo+0.024
ted+0.024
Neg logit contributions
 supposed-0.013
epad-0.012
 idea-0.012
 possible-0.012
 recognise-0.011
Token.
Token ID13
Feature activation-1.65e-03

Pos logit contributions
Sorry+0.062
W+0.052
Grab+0.052
Boo+0.050
ted+0.049
Neg logit contributions
 supposed-0.038
 idea-0.038
epad-0.038
 possible-0.036
 recognise-0.034
Token
Token ID198
Feature activation+1.10e-02

Pos logit contributions
 idea+0.003
 supposed+0.003
epad+0.003
 possible+0.003
 pretending+0.003
Neg logit contributions
Sorry-0.003
Grab-0.003
Boo-0.003
W-0.003
ipped-0.002
Token
Token ID198
Feature activation+2.17e-02

Pos logit contributions
Sorry+0.021
Grab+0.017
W+0.016
Boo+0.016
ted+0.015
Neg logit contributions
 supposed-0.020
 idea-0.020
epad-0.020
 possible-0.019
 recognise-0.019
Token"
Token ID1
Feature activation+6.57e-02

Pos logit contributions
ted+0.039
Sorry+0.038
ipped+0.035
Boo+0.034
Grab+0.034
Neg logit contributions
 supposed-0.031
 possible-0.030
 recognise-0.030
 balancing-0.029
 seeing-0.029
TokenReally
Token ID26392
Feature activation+8.87e-02

Pos logit contributions
ipped+0.078
ted+0.075
olded+0.073
Sorry+0.068
ishly+0.066
Neg logit contributions
 possible-0.123
epad-0.122
 idea-0.120
 answer-0.119
 happen-0.118
Token?"
Token ID1701
Feature activation+7.00e-03

Pos logit contributions
Sorry+0.214
Grab+0.190
W+0.183
Boo+0.183
ted+0.180
Neg logit contributions
 idea-0.112
 supposed-0.110
epad-0.108
 possible-0.106
 happen-0.100
Token Sara
Token ID24799
Feature activation+6.72e-03

Pos logit contributions
Sorry+0.014
Grab+0.012
W+0.012
Boo+0.012
ted+0.011
Neg logit contributions
 idea-0.012
 supposed-0.011
epad-0.011
 possible-0.011
 recognise-0.010
Token says
Token ID1139
Feature activation+2.53e-02

Pos logit contributions
Sorry+0.011
W+0.009
Grab+0.009
H+0.008
ted+0.008
Neg logit contributions
epad-0.013
 supposed-0.013
 possible-0.013
 happen-0.013
 idea-0.012
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
Sorry+0.050
W+0.040
ted+0.039
Grab+0.039
ipped+0.037
Neg logit contributions
 supposed-0.045
 idea-0.042
epad-0.041
 possible-0.041
 recognise-0.040
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
 supposed+0.035
epad+0.034
 idea+0.034
 possible+0.033
 happen+0.032
Neg logit contributions
Sorry-0.028
W-0.022
Grab-0.022
Boo-0.020
ted-0.020
Token.
Token ID13
Feature activation+4.77e-04

Pos logit contributions
Sorry+0.063
W+0.053
Grab+0.052
ted+0.052
Boo+0.050
Neg logit contributions
 supposed-0.048
epad-0.047
 idea-0.047
 possible-0.045
 recognise-0.044
Token Tom
Token ID4186
Feature activation+1.10e-02

Pos logit contributions
Sorry+0.001
W+0.001
Boo+0.001
Grab+0.001
Sp+0.001
Neg logit contributions
 idea-0.001
 supposed-0.001
 way-0.001
 about-0.001
 answer-0.001
Token said
Token ID531
Feature activation+1.69e-02

Pos logit contributions
Sorry+0.024
Grab+0.019
W+0.019
Boo+0.018
ted+0.017
Neg logit contributions
 supposed-0.017
 idea-0.017
 pretending-0.016
 way-0.016
 balancing-0.016
Token,
Token ID11
Feature activation-2.39e-02

Pos logit contributions
Sorry+0.041
Grab+0.035
W+0.034
Boo+0.033
ted+0.033
Neg logit contributions
 idea-0.022
 supposed-0.022
epad-0.021
 possible-0.021
 happen-0.020
Token "
Token ID366
Feature activation+5.75e-02

Pos logit contributions
 supposed+0.035
 idea+0.034
epad+0.034
 possible+0.033
 recognise+0.031
Neg logit contributions
Sorry-0.055
W-0.046
Grab-0.046
Boo-0.044
ted-0.044
TokenOh
Token ID5812
Feature activation+8.87e-02

Pos logit contributions
ipped+0.078
ishly+0.070
uddled+0.068
olded+0.068
ugg+0.067
Neg logit contributions
 supposed-0.095
 recognise-0.091
 page-0.091
 angle-0.090
 giant-0.090
Token no
Token ID645
Feature activation+3.80e-02

Pos logit contributions
Sorry+0.225
Grab+0.204
W+0.193
ted+0.192
Sp+0.190
Neg logit contributions
 idea-0.108
 supposed-0.095
 way-0.095
 possible-0.088
 happen-0.088
Token!
Token ID0
Feature activation+3.69e-02

Pos logit contributions
Sorry+0.084
W+0.070
Grab+0.070
Boo+0.068
ted+0.066
Neg logit contributions
 supposed-0.057
epad-0.057
 idea-0.057
 possible-0.054
 recognise-0.052
Token My
Token ID2011
Feature activation+2.69e-02

Pos logit contributions
Sorry+0.077
Grab+0.066
Sp+0.064
ted+0.064
W+0.063
Neg logit contributions
 supposed-0.042
 pretending-0.041
arter-0.041
epad-0.039
 image-0.039
Token cake
Token ID12187
Feature activation+3.54e-02

Pos logit contributions
Sorry+0.059
Grab+0.049
W+0.049
Boo+0.048
ted+0.046
Neg logit contributions
 supposed-0.041
 idea-0.041
epad-0.041
 possible-0.039
 recognise-0.037
Token is
Token ID318
Feature activation+1.92e-02

Pos logit contributions
Sorry+0.063
Grab+0.051
W+0.050
Boo+0.048
ted+0.047
Neg logit contributions
 supposed-0.069
 idea-0.068
epad-0.067
 possible-0.065
 happen-0.063
Token bed
Token ID3996
Feature activation+1.13e-02

Pos logit contributions
Sorry+0.091
W+0.079
Grab+0.078
Boo+0.077
ted+0.075
Neg logit contributions
 supposed-0.042
epad-0.041
 idea-0.040
 possible-0.039
 recognise-0.036
Token.
Token ID13
Feature activation+5.83e-03

Pos logit contributions
Sorry+0.023
W+0.019
Boo+0.018
ted+0.018
Grab+0.018
Neg logit contributions
 supposed-0.019
epad-0.018
 idea-0.018
 possible-0.017
 recognise-0.017
Token
Token ID198
Feature activation+1.51e-02

Pos logit contributions
Sorry+0.013
W+0.010
Grab+0.010
Boo+0.010
ted+0.010
Neg logit contributions
 idea-0.009
 supposed-0.009
epad-0.008
 recognise-0.008
 possible-0.008
Token
Token ID198
Feature activation+2.76e-02

Pos logit contributions
Sorry+0.029
Grab+0.023
W+0.023
Boo+0.022
ted+0.022
Neg logit contributions
 supposed-0.028
 idea-0.027
epad-0.027
 possible-0.026
 recognise-0.025
Token"
Token ID1
Feature activation+7.25e-02

Pos logit contributions
ted+0.046
Sorry+0.045
ishly+0.043
ipped+0.043
Grab+0.039
Neg logit contributions
 supposed-0.043
 recognise-0.041
 reaching-0.039
 seeing-0.039
 balancing-0.038
TokenOh
Token ID5812
Feature activation+8.87e-02

Pos logit contributions
ipped+0.092
ishly+0.091
olded+0.090
ugg+0.088
Sorry+0.084
Neg logit contributions
 supposed-0.124
 pretending-0.121
 possible-0.121
 waking-0.121
epad-0.121
Token no
Token ID645
Feature activation+4.65e-02

Pos logit contributions
Sorry+0.227
Grab+0.208
W+0.197
ted+0.196
Sp+0.194
Neg logit contributions
 idea-0.105
 supposed-0.092
 way-0.089
 pretending-0.086
 happen-0.083
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
Sorry+0.109
Grab+0.093
W+0.092
Boo+0.089
ted+0.087
Neg logit contributions
 idea-0.066
 supposed-0.065
epad-0.063
 possible-0.061
 happen-0.058
Token Tom
Token ID4186
Feature activation+5.72e-03

Pos logit contributions
Sorry+0.075
W+0.063
Grab+0.063
Boo+0.061
ted+0.061
Neg logit contributions
 idea-0.045
 supposed-0.044
 possible-0.041
 happen-0.041
 pretending-0.041
Token,
Token ID11
Feature activation+3.04e-02

Pos logit contributions
Sorry+0.011
W+0.009
Grab+0.009
Boo+0.009
ted+0.009
Neg logit contributions
epad-0.010
 supposed-0.009
 idea-0.009
 possible-0.009
 recognise-0.009
Token I
Token ID314
Feature activation+5.12e-02

Pos logit contributions
Sorry+0.062
Grab+0.051
W+0.050
Boo+0.049
ted+0.048
Neg logit contributions
 supposed-0.052
 idea-0.051
epad-0.050
 possible-0.049
 happen-0.048
TokenTim
Token ID14967
Feature activation+5.99e-03

Pos logit contributions
Sorry+0.007
ipped+0.007
ted+0.007
Grab+0.007
Boo+0.006
Neg logit contributions
 supposed-0.009
 recognise-0.008
 idea-0.008
 pretending-0.008
 seeing-0.008
Tokenmy
Token ID1820
Feature activation+1.92e-02

Pos logit contributions
Sorry+0.010
Grab+0.008
W+0.008
Boo+0.008
ted+0.007
Neg logit contributions
 supposed-0.012
 idea-0.012
epad-0.012
 possible-0.012
 recognise-0.011
Token said
Token ID531
Feature activation+1.42e-02

Pos logit contributions
Sorry+0.040
Grab+0.033
W+0.031
Boo+0.031
Sp+0.029
Neg logit contributions
 supposed-0.031
 idea-0.031
 way-0.030
 pretending-0.029
 balancing-0.029
Token,
Token ID11
Feature activation-2.68e-02

Pos logit contributions
Sorry+0.037
Grab+0.034
W+0.032
Boo+0.031
H+0.031
Neg logit contributions
 idea-0.016
 supposed-0.014
epad-0.014
 happen-0.014
 possible-0.014
Token "
Token ID366
Feature activation+6.03e-02

Pos logit contributions
 supposed+0.032
 idea+0.032
epad+0.031
 possible+0.029
 pretending+0.028
Neg logit contributions
Sorry-0.067
Grab-0.057
W-0.057
Boo-0.056
ted-0.054
TokenOh
Token ID5812
Feature activation+8.85e-02

Pos logit contributions
Sorry+0.072
ipped+0.069
ted+0.061
Grab+0.059
ugg+0.058
Neg logit contributions
 supposed-0.126
epad-0.120
 possible-0.119
 recognise-0.119
 idea-0.118
Token no
Token ID645
Feature activation+3.88e-02

Pos logit contributions
Sorry+0.228
Grab+0.210
W+0.199
Boo+0.197
ted+0.196
Neg logit contributions
 idea-0.100
 supposed-0.088
 way-0.086
 happen-0.084
 pretending-0.082
Token!
Token ID0
Feature activation+4.30e-02

Pos logit contributions
Sorry+0.095
Grab+0.085
W+0.081
Boo+0.080
ted+0.079
Neg logit contributions
 idea-0.051
 supposed-0.050
epad-0.047
 possible-0.046
 pretending-0.045
Token Our
Token ID3954
Feature activation+1.86e-02

Pos logit contributions
Sorry+0.085
Grab+0.074
W+0.070
ted+0.070
Boo+0.069
Neg logit contributions
 supposed-0.058
 pretending-0.057
 marking-0.055
arter-0.054
 balancing-0.053
Token cars
Token ID5006
Feature activation+2.74e-02

Pos logit contributions
Sorry+0.034
Grab+0.028
W+0.027
Boo+0.027
ted+0.026
Neg logit contributions
 idea-0.037
 supposed-0.035
epad-0.034
 way-0.034
 pretending-0.034
Token are
Token ID389
Feature activation+2.68e-02

Pos logit contributions
Sorry+0.059
Grab+0.049
W+0.049
Boo+0.048
ted+0.047
Neg logit contributions
 idea-0.045
 supposed-0.044
 happen-0.042
epad-0.041
 way-0.040
Token airport
Token ID9003
Feature activation+4.54e-03

Pos logit contributions
 supposed+0.007
epad+0.007
 idea+0.007
 recognise+0.007
 possible+0.007
Neg logit contributions
Sorry-0.011
W-0.009
Grab-0.009
Boo-0.009
ted-0.009
Token.
Token ID13
Feature activation+4.61e-04

Pos logit contributions
Sorry+0.009
W+0.007
ted+0.007
Boo+0.007
Grab+0.007
Neg logit contributions
 supposed-0.008
epad-0.007
 idea-0.007
 recognise-0.007
 possible-0.007
Token
Token ID198
Feature activation+9.68e-03

Pos logit contributions
Sorry+0.001
Grab+0.001
Boo+0.001
ipped+0.001
ted+0.001
Neg logit contributions
 idea-0.001
 supposed-0.001
epad-0.001
 possible-0.001
 pretending-0.001
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
Sorry+0.017
W+0.014
Grab+0.014
Boo+0.013
ted+0.013
Neg logit contributions
 supposed-0.019
epad-0.019
 idea-0.019
 possible-0.018
 recognise-0.017
Token"
Token ID1
Feature activation+6.17e-02

Pos logit contributions
ted+0.036
ishly+0.035
ipped+0.034
Sorry+0.032
Grab+0.030
Neg logit contributions
 supposed-0.034
 impressed-0.033
 possible-0.033
 balancing-0.032
 recognise-0.032
TokenLook
Token ID8567
Feature activation+8.85e-02

Pos logit contributions
Sorry+0.077
ipped+0.066
Grab+0.064
olded+0.063
ted+0.063
Neg logit contributions
epad-0.129
 idea-0.127
 possible-0.125
 supposed-0.122
 answer-0.122
Token,
Token ID11
Feature activation+2.53e-02

Pos logit contributions
Sorry+0.205
Grab+0.174
W+0.172
Boo+0.168
ted+0.162
Neg logit contributions
 idea-0.124
 supposed-0.123
epad-0.120
 possible-0.115
 happen-0.110
Token Lily
Token ID20037
Feature activation+3.27e-03

Pos logit contributions
Sorry+0.052
Grab+0.043
W+0.043
Boo+0.042
ted+0.041
Neg logit contributions
 idea-0.043
 supposed-0.041
epad-0.040
 possible-0.040
 happen-0.039
Token,
Token ID11
Feature activation+2.51e-02

Pos logit contributions
Sorry+0.006
W+0.005
Grab+0.005
Boo+0.005
ted+0.005
Neg logit contributions
epad-0.006
 supposed-0.006
 possible-0.005
 idea-0.005
 recognise-0.005
Token we
Token ID356
Feature activation+3.02e-02

Pos logit contributions
Sorry+0.047
W+0.038
Grab+0.038
Boo+0.036
ted+0.035
Neg logit contributions
 supposed-0.047
epad-0.046
 idea-0.046
 possible-0.043
 recognise-0.043
Token are
Token ID389
Feature activation+2.41e-02

Pos logit contributions
Sorry+0.063
Grab+0.052
W+0.051
ted+0.050
Boo+0.050
Neg logit contributions
 supposed-0.050
 idea-0.050
epad-0.049
 possible-0.046
 recognise-0.045
Tokened
Token ID276
Feature activation+2.51e-02

Pos logit contributions
Sorry+0.017
Grab+0.013
W+0.012
Boo+0.011
ted+0.010
Neg logit contributions
 idea-0.029
 supposed-0.029
epad-0.029
 pretending-0.028
 possible-0.027
Token.
Token ID13
Feature activation-9.53e-03

Pos logit contributions
Sorry+0.060
Grab+0.051
W+0.050
Boo+0.049
ted+0.047
Neg logit contributions
 idea-0.034
 supposed-0.033
epad-0.033
 possible-0.031
 pretending-0.031
Token
Token ID198
Feature activation+5.50e-03

Pos logit contributions
 idea+0.016
 supposed+0.015
epad+0.015
 possible+0.014
 balancing+0.014
Neg logit contributions
Sorry-0.019
Grab-0.016
W-0.015
Boo-0.015
ted-0.015
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
Sorry+0.010
Grab+0.008
W+0.008
Boo+0.008
ted+0.008
Neg logit contributions
 supposed-0.010
 idea-0.010
epad-0.010
 possible-0.010
 recognise-0.009
Token"
Token ID1
Feature activation+6.61e-02

Pos logit contributions
ted+0.030
ipped+0.030
ishly+0.029
Sorry+0.029
cream+0.027
Neg logit contributions
 long-0.017
 time-0.017
 balancing-0.017
 sunny-0.017
 recognise-0.016
TokenOh
Token ID5812
Feature activation+8.85e-02

Pos logit contributions
ipped+0.083
olded+0.079
Sorry+0.075
ugg+0.075
ishly+0.072
Neg logit contributions
 trick-0.120
 answer-0.115
 idea-0.113
 giant-0.112
 happen-0.112
Token no
Token ID645
Feature activation+4.29e-02

Pos logit contributions
Sorry+0.222
Grab+0.198
W+0.187
ted+0.184
Boo+0.182
Neg logit contributions
 idea-0.109
 way-0.098
 supposed-0.095
 pretending-0.095
 happen-0.092
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
Sorry+0.093
W+0.076
Grab+0.076
Boo+0.074
ted+0.072
Neg logit contributions
epad-0.067
 supposed-0.067
 idea-0.065
 possible-0.063
 recognise-0.061
Token Rex
Token ID17853
Feature activation+6.20e-03

Pos logit contributions
Sorry+0.058
Grab+0.046
W+0.046
Boo+0.046
ted+0.044
Neg logit contributions
 idea-0.044
 supposed-0.043
 possible-0.041
 happen-0.041
 pretending-0.040
Token,
Token ID11
Feature activation+3.28e-02

Pos logit contributions
Sorry+0.013
W+0.011
Grab+0.011
Boo+0.011
ted+0.010
Neg logit contributions
epad-0.010
 supposed-0.010
 idea-0.009
 possible-0.009
 recognise-0.009
Token stop
Token ID2245
Feature activation+2.37e-02

Pos logit contributions
Sorry+0.062
Grab+0.050
W+0.050
Boo+0.048
ted+0.046
Neg logit contributions
 supposed-0.060
epad-0.060
 idea-0.060
 possible-0.057
 recognise-0.055
Token mouth
Token ID5422
Feature activation+1.99e-02

Pos logit contributions
Sorry+0.088
Grab+0.072
W+0.071
Boo+0.070
ted+0.068
Neg logit contributions
 idea-0.060
 supposed-0.058
epad-0.056
 way-0.055
 possible-0.053
Token.
Token ID13
Feature activation-1.36e-03

Pos logit contributions
Sorry+0.045
W+0.037
Grab+0.037
Boo+0.036
ted+0.036
Neg logit contributions
 supposed-0.029
 idea-0.029
epad-0.029
 possible-0.027
 recognise-0.026
Token
Token ID198
Feature activation+1.19e-02

Pos logit contributions
 supposed+0.003
epad+0.003
 idea+0.003
 possible+0.002
 recognise+0.002
Neg logit contributions
Sorry-0.002
Grab-0.002
W-0.002
Boo-0.002
ted-0.002
Token
Token ID198
Feature activation+2.28e-02

Pos logit contributions
Sorry+0.021
W+0.016
Grab+0.016
Boo+0.016
ted+0.015
Neg logit contributions
 supposed-0.024
epad-0.024
 idea-0.024
 possible-0.023
 recognise-0.022
Token"
Token ID1
Feature activation+7.02e-02

Pos logit contributions
Sorry+0.030
ishly+0.030
ted+0.030
ipped+0.025
Grab+0.025
Neg logit contributions
 possible-0.042
 balancing-0.042
onna-0.042
 idea-0.041
 recognise-0.041
TokenOh
Token ID5812
Feature activation+8.84e-02

Pos logit contributions
Sorry+0.078
olded+0.077
ipped+0.077
ugg+0.071
ishly+0.067
Neg logit contributions
 giant-0.134
 idea-0.131
 answer-0.131
 happen-0.130
 possible-0.130
Token no
Token ID645
Feature activation+4.54e-02

Pos logit contributions
Sorry+0.223
Grab+0.201
W+0.188
ted+0.188
Sp+0.182
Neg logit contributions
 idea-0.110
 way-0.095
 supposed-0.094
 pretending-0.093
 balancing-0.089
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
Sorry+0.105
Grab+0.089
W+0.088
Boo+0.085
ted+0.084
Neg logit contributions
 idea-0.065
 supposed-0.065
epad-0.063
 possible-0.061
 happen-0.058
Token our
Token ID674
Feature activation+2.42e-02

Pos logit contributions
Sorry+0.055
Grab+0.045
W+0.045
Boo+0.044
ted+0.042
Neg logit contributions
 supposed-0.047
 idea-0.047
epad-0.047
 possible-0.045
 recognise-0.043
Token ball
Token ID2613
Feature activation+4.25e-02

Pos logit contributions
Sorry+0.056
Grab+0.047
W+0.046
Boo+0.046
ted+0.044
Neg logit contributions
 idea-0.036
 supposed-0.034
 way-0.034
 happen-0.032
epad-0.032
Token!"
Token ID2474
Feature activation+1.80e-02

Pos logit contributions
Sorry+0.089
Grab+0.074
W+0.073
Boo+0.070
ted+0.068
Neg logit contributions
 idea-0.070
 supposed-0.069
epad-0.066
 possible-0.064
 happen-0.064
Token.
Token ID13
Feature activation-9.47e-03

Pos logit contributions
epad+0.051
 supposed+0.051
 recognise+0.046
 idea+0.046
onna+0.045
Neg logit contributions
Sorry-0.059
W-0.050
ted-0.050
Boo-0.048
aw-0.047
Token She
Token ID1375
Feature activation-4.72e-03

Pos logit contributions
 idea+0.015
epad+0.015
 supposed+0.014
 recognise+0.014
 possible+0.014
Neg logit contributions
Sorry-0.020
Boo-0.016
Grab-0.016
W-0.016
ted-0.015
Token knew
Token ID2993
Feature activation-1.11e-02

Pos logit contributions
 idea+0.008
 supposed+0.007
 way+0.007
epad+0.007
 recognise+0.007
Neg logit contributions
Sorry-0.010
Grab-0.008
W-0.008
Boo-0.008
ted-0.008
Token that
Token ID326
Feature activation-3.10e-02

Pos logit contributions
 supposed+0.017
 idea+0.017
 possible+0.017
epad+0.017
 balancing+0.015
Neg logit contributions
Sorry-0.025
Grab-0.021
W-0.020
Boo-0.020
ted-0.019
Token the
Token ID262
Feature activation-3.24e-02

Pos logit contributions
 supposed+0.049
epad+0.049
 idea+0.048
 possible+0.045
 recognise+0.045
Neg logit contributions
Sorry-0.065
W-0.053
Grab-0.053
Boo-0.051
ted-0.051
Token only
Token ID691
Feature activation-8.45e-02

Pos logit contributions
 supposed+0.037
epad+0.036
 idea+0.036
 possible+0.034
 recognise+0.032
Neg logit contributions
Sorry-0.083
Grab-0.071
W-0.071
Boo-0.070
ted-0.068
Token way
Token ID835
Feature activation-2.71e-02

Pos logit contributions
 supposed+0.113
epad+0.110
 idea+0.105
 possible+0.100
 recognise+0.100
Neg logit contributions
Sorry-0.195
W-0.166
Grab-0.165
Boo-0.163
ted-0.159
Token to
Token ID284
Feature activation-2.21e-02

Pos logit contributions
 supposed+0.040
 idea+0.040
epad+0.040
 possible+0.037
 recognise+0.036
Neg logit contributions
Sorry-0.060
W-0.050
Grab-0.050
Boo-0.048
ted-0.047
Token open
Token ID1280
Feature activation-2.05e-03

Pos logit contributions
 supposed+0.029
epad+0.028
 idea+0.027
 possible+0.025
 pretending+0.025
Neg logit contributions
Sorry-0.051
W-0.043
Grab-0.043
Boo-0.042
ted-0.040
Token the
Token ID262
Feature activation-2.27e-03

Pos logit contributions
 supposed+0.003
 idea+0.003
epad+0.003
 possible+0.003
 recognise+0.003
Neg logit contributions
Sorry-0.005
Grab-0.004
W-0.004
Boo-0.004
ted-0.004
Token folder
Token ID9483
Feature activation+1.23e-02

Pos logit contributions
 supposed+0.003
epad+0.003
 idea+0.003
 possible+0.003
 recognise+0.002
Neg logit contributions
Sorry-0.006
Grab-0.005
W-0.005
Boo-0.005
ted-0.005
Token things
Token ID1243
Feature activation-4.95e-02

Pos logit contributions
 supposed+0.064
 idea+0.064
epad+0.062
 possible+0.061
 happen+0.059
Neg logit contributions
Sorry-0.080
Grab-0.066
W-0.065
Boo-0.064
ted-0.060
Token are
Token ID389
Feature activation-4.14e-02

Pos logit contributions
 supposed+0.089
epad+0.089
 idea+0.089
 possible+0.085
 recognise+0.082
Neg logit contributions
Sorry-0.094
Grab-0.075
W-0.075
Boo-0.073
ted-0.070
Token not
Token ID407
Feature activation-7.58e-02

Pos logit contributions
epad+0.057
 idea+0.054
 supposed+0.052
 angle+0.051
 recognise+0.051
Neg logit contributions
Sorry-0.091
W-0.077
Grab-0.075
Sp-0.072
H-0.071
Token as
Token ID355
Feature activation-7.98e-02

Pos logit contributions
epad+0.104
 supposed+0.097
 angle+0.097
 idea+0.096
 recognise+0.096
Neg logit contributions
Sorry-0.155
W-0.131
Grab-0.127
ted-0.122
L-0.121
Token important
Token ID1593
Feature activation-1.97e-02

Pos logit contributions
epad+0.097
 supposed+0.094
 idea+0.093
 recognise+0.086
 angle+0.084
Neg logit contributions
Sorry-0.190
W-0.162
Grab-0.159
Boo-0.153
ted-0.150
Token as
Token ID355
Feature activation-8.38e-02

Pos logit contributions
 supposed+0.032
 idea+0.031
epad+0.031
 recognise+0.030
 possible+0.029
Neg logit contributions
Sorry-0.039
ted-0.032
W-0.032
Grab-0.031
Boo-0.030
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
 idea+0.122
epad+0.122
 supposed+0.120
 recognise+0.112
 possible+0.110
Neg logit contributions
Sorry-0.182
W-0.152
Grab-0.150
Boo-0.146
ted-0.144
Token people
Token ID661
Feature activation-5.94e-02

Pos logit contributions
 supposed+0.057
epad+0.056
 idea+0.056
 possible+0.053
 recognise+0.052
Neg logit contributions
Sorry-0.095
W-0.079
Grab-0.079
Boo-0.077
ted-0.075
Token we
Token ID356
Feature activation-2.76e-02

Pos logit contributions
 supposed+0.091
 idea+0.091
epad+0.090
 possible+0.086
 recognise+0.082
Neg logit contributions
Sorry-0.128
W-0.106
Grab-0.106
Boo-0.103
ted-0.101
Token love
Token ID1842
Feature activation-3.46e-02

Pos logit contributions
 supposed+0.034
 idea+0.034
epad+0.033
 recognise+0.031
 answer+0.031
Neg logit contributions
Sorry-0.070
Grab-0.059
W-0.059
Boo-0.059
ted-0.056
Token.
Token ID13
Feature activation+1.26e-03

Pos logit contributions
 idea+0.058
 supposed+0.058
epad+0.058
 possible+0.055
 recognise+0.054
Neg logit contributions
Sorry-0.069
W-0.057
Grab-0.056
Boo-0.054
ted-0.054
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
epad+0.041
 supposed+0.039
 idea+0.039
 possible+0.037
 recognise+0.037
Neg logit contributions
Sorry-0.040
ted-0.033
W-0.032
Boo-0.030
Grab-0.030
Token He
Token ID679
Feature activation-6.51e-03

Pos logit contributions
 idea+0.019
epad+0.018
 supposed+0.018
 possible+0.017
 answer+0.017
Neg logit contributions
Sorry-0.025
Boo-0.020
W-0.020
Grab-0.019
ted-0.019
Token knew
Token ID2993
Feature activation-1.07e-02

Pos logit contributions
 idea+0.011
 supposed+0.011
epad+0.010
 recognise+0.010
 answer+0.010
Neg logit contributions
Sorry-0.014
Grab-0.011
W-0.011
Boo-0.011
Sp-0.010
Token that
Token ID326
Feature activation-2.90e-02

Pos logit contributions
 supposed+0.016
 idea+0.016
epad+0.016
 possible+0.016
 happen+0.015
Neg logit contributions
Sorry-0.024
Grab-0.020
W-0.020
Boo-0.019
ted-0.018
Token the
Token ID262
Feature activation-3.56e-02

Pos logit contributions
 supposed+0.045
epad+0.044
 idea+0.044
 recognise+0.041
 possible+0.039
Neg logit contributions
Sorry-0.062
W-0.051
Grab-0.051
Boo-0.049
ted-0.049
Token only
Token ID691
Feature activation-8.38e-02

Pos logit contributions
 supposed+0.052
epad+0.051
 idea+0.047
 recognise+0.047
 possible+0.047
Neg logit contributions
Sorry-0.081
W-0.068
Grab-0.067
Boo-0.066
ted-0.062
Token way
Token ID835
Feature activation-2.50e-02

Pos logit contributions
 supposed+0.119
epad+0.116
 idea+0.111
 possible+0.107
 recognise+0.106
Neg logit contributions
Sorry-0.186
W-0.159
Grab-0.158
Boo-0.157
ted-0.151
Token he
Token ID339
Feature activation-2.84e-02

Pos logit contributions
 idea+0.036
 supposed+0.036
epad+0.036
 possible+0.034
 recognise+0.033
Neg logit contributions
Sorry-0.055
W-0.046
Grab-0.046
Boo-0.045
ted-0.044
Token could
Token ID714
Feature activation-3.47e-02

Pos logit contributions
 supposed+0.048
 idea+0.048
epad+0.047
 possible+0.045
 recognise+0.044
Neg logit contributions
Sorry-0.058
Grab-0.047
W-0.047
Boo-0.046
ted-0.044
Token find
Token ID1064
Feature activation-7.92e-03

Pos logit contributions
epad+0.048
 supposed+0.047
 possible+0.043
 idea+0.042
 pretending+0.041
Neg logit contributions
Sorry-0.078
W-0.065
Grab-0.064
Boo-0.063
H-0.060
Token True
Token ID6407
Feature activation-1.90e-02

Pos logit contributions
 supposed+0.013
 idea+0.013
epad+0.012
 possible+0.012
 recognise+0.011
Neg logit contributions
Sorry-0.017
Grab-0.014
W-0.014
Boo-0.013
ted-0.013
Token loved
Token ID6151
Feature activation-3.18e-02

Pos logit contributions
 idea+0.021
epad+0.020
 supposed+0.020
 possible+0.019
 recognise+0.019
Neg logit contributions
Sorry-0.029
W-0.025
Grab-0.024
ted-0.023
Boo-0.023
Token that
Token ID326
Feature activation-3.01e-02

Pos logit contributions
epad+0.048
 idea+0.047
 supposed+0.046
 possible+0.045
 recognise+0.044
Neg logit contributions
Sorry-0.067
W-0.057
ted-0.057
Boo-0.054
Grab-0.054
Token v
Token ID410
Feature activation+1.22e-02

Pos logit contributions
 supposed+0.041
epad+0.041
 idea+0.040
 possible+0.038
 recognise+0.037
Neg logit contributions
Sorry-0.070
W-0.059
Grab-0.058
Boo-0.057
ted-0.056
Tokenase
Token ID589
Feature activation-1.40e-03

Pos logit contributions
Sorry+0.027
Grab+0.025
W+0.023
Boo+0.022
ted+0.022
Neg logit contributions
 supposed-0.017
 idea-0.017
od-0.016
epad-0.015
 balancing-0.014
Token more
Token ID517
Feature activation-2.55e-02

Pos logit contributions
 supposed+0.002
 idea+0.002
epad+0.002
 possible+0.002
 recognise+0.002
Neg logit contributions
Sorry-0.003
Grab-0.003
W-0.003
Boo-0.002
ted-0.002
Token than
Token ID621
Feature activation-8.28e-02

Pos logit contributions
epad+0.028
 supposed+0.027
 idea+0.027
 possible+0.024
 recognise+0.024
Neg logit contributions
Sorry-0.067
W-0.058
Grab-0.056
Boo-0.055
ted-0.054
Token anything
Token ID1997
Feature activation-4.00e-02

Pos logit contributions
 supposed+0.119
epad+0.116
 possible+0.115
 idea+0.113
 balancing+0.105
Neg logit contributions
Sorry-0.195
Grab-0.165
W-0.161
Boo-0.160
ted-0.151
Token else
Token ID2073
Feature activation-6.12e-02

Pos logit contributions
epad+0.067
 idea+0.066
 supposed+0.065
 recognise+0.061
 possible+0.060
Neg logit contributions
Sorry-0.081
W-0.067
Boo-0.064
Grab-0.064
ted-0.063
Token in
Token ID287
Feature activation-4.62e-02

Pos logit contributions
 idea+0.100
epad+0.100
 supposed+0.099
 recognise+0.092
 possible+0.091
Neg logit contributions
Sorry-0.124
W-0.102
Grab-0.098
Boo-0.097
ted-0.096
Token the
Token ID262
Feature activation+1.22e-03

Pos logit contributions
 supposed+0.070
 idea+0.070
epad+0.069
 possible+0.065
 recognise+0.063
Neg logit contributions
Sorry-0.100
W-0.083
Grab-0.083
Boo-0.080
ted-0.078
Token house
Token ID2156
Feature activation-3.71e-02

Pos logit contributions
Sorry+0.003
Grab+0.003
W+0.003
Boo+0.003
ted+0.003
Neg logit contributions
 idea-0.001
 supposed-0.001
epad-0.001
 possible-0.001
 way-0.001
Token things
Token ID1243
Feature activation-3.94e-02

Pos logit contributions
 idea+0.044
 supposed+0.043
 possible+0.040
epad+0.040
 happen+0.040
Neg logit contributions
Sorry-0.059
Grab-0.049
W-0.049
Boo-0.049
Sp-0.045
Token were
Token ID547
Feature activation-3.81e-02

Pos logit contributions
 supposed+0.067
 idea+0.067
epad+0.067
 possible+0.064
 recognise+0.061
Neg logit contributions
Sorry-0.078
Grab-0.064
W-0.064
Boo-0.062
ted-0.060
Token not
Token ID407
Feature activation-6.70e-02

Pos logit contributions
epad+0.049
 idea+0.046
 supposed+0.045
 recognise+0.043
 balancing+0.042
Neg logit contributions
Sorry-0.090
W-0.077
Grab-0.075
ted-0.072
Boo-0.071
Token as
Token ID355
Feature activation-7.37e-02

Pos logit contributions
epad+0.087
 idea+0.082
 supposed+0.082
 balancing+0.080
 recognise+0.080
Neg logit contributions
Sorry-0.148
W-0.125
Grab-0.124
ted-0.118
H-0.116
Token important
Token ID1593
Feature activation-1.81e-02

Pos logit contributions
epad+0.085
 supposed+0.084
 idea+0.083
 possible+0.075
 recognise+0.075
Neg logit contributions
Sorry-0.183
W-0.156
Grab-0.156
Boo-0.150
ted-0.147
Token as
Token ID355
Feature activation-8.14e-02

Pos logit contributions
epad+0.029
 idea+0.028
 supposed+0.028
 possible+0.027
 recognise+0.027
Neg logit contributions
Sorry-0.037
W-0.030
Grab-0.030
ted-0.030
Boo-0.029
Token she
Token ID673
Feature activation-3.61e-02

Pos logit contributions
 idea+0.113
epad+0.113
 supposed+0.113
 possible+0.104
 recognise+0.103
Neg logit contributions
Sorry-0.185
W-0.155
Grab-0.154
Boo-0.150
ted-0.148
Token thought
Token ID1807
Feature activation-6.09e-02

Pos logit contributions
 supposed+0.053
 idea+0.053
 answer+0.050
epad+0.049
 happen+0.048
Neg logit contributions
Sorry-0.081
Boo-0.067
Grab-0.066
W-0.066
Sp-0.062
Token.
Token ID13
Feature activation-7.49e-03

Pos logit contributions
epad+0.094
 idea+0.092
 supposed+0.092
 recognise+0.086
 possible+0.085
Neg logit contributions
Sorry-0.124
W-0.103
ted-0.102
Grab-0.100
Boo-0.097
Token<|endoftext|>
Token ID50256
Feature activation+6.20e-03

Pos logit contributions
 supposed+0.013
 idea+0.013
epad+0.012
 way+0.012
 pretending+0.011
Neg logit contributions
Sorry-0.015
Grab-0.012
Boo-0.012
W-0.011
ted-0.011
TokenOnce
Token ID7454
Feature activation+1.40e-02

Pos logit contributions
Sorry+0.011
W+0.008
Grab+0.008
Boo+0.008
ted+0.008
Neg logit contributions
 supposed-0.012
epad-0.012
 idea-0.012
 possible-0.012
 recognise-0.011
Token that
Token ID326
Feature activation-1.83e-02

Pos logit contributions
Sorry+0.003
Grab+0.002
Boo+0.002
W+0.002
ted+0.002
Neg logit contributions
 idea-0.001
 possible-0.001
 supposed-0.001
 happen-0.001
epad-0.001
Token yellow
Token ID7872
Feature activation-1.91e-02

Pos logit contributions
 supposed+0.026
epad+0.026
 idea+0.025
 recognise+0.024
 possible+0.023
Neg logit contributions
Sorry-0.041
W-0.034
Grab-0.034
Boo-0.032
ted-0.032
Token was
Token ID373
Feature activation-2.04e-02

Pos logit contributions
 idea+0.032
epad+0.029
 supposed+0.029
 happen+0.028
 possible+0.028
Neg logit contributions
Sorry-0.042
W-0.034
Grab-0.034
Boo-0.033
H-0.032
Token not
Token ID407
Feature activation-6.25e-02

Pos logit contributions
 supposed+0.032
epad+0.032
 idea+0.032
 possible+0.029
 recognise+0.029
Neg logit contributions
Sorry-0.044
W-0.036
Grab-0.036
Boo-0.035
ted-0.034
Token the
Token ID262
Feature activation-2.75e-02

Pos logit contributions
epad+0.096
 supposed+0.092
 idea+0.092
 recognise+0.089
 angle+0.087
Neg logit contributions
Sorry-0.125
W-0.106
Grab-0.104
Boo-0.098
ted-0.097
Token only
Token ID691
Feature activation-8.11e-02

Pos logit contributions
 supposed+0.041
epad+0.039
 recognise+0.037
 idea+0.036
 happen+0.035
Neg logit contributions
Sorry-0.061
W-0.051
Grab-0.050
Boo-0.048
Sp-0.047
Token color
Token ID3124
Feature activation-3.53e-02

Pos logit contributions
 supposed+0.135
epad+0.130
 recognise+0.125
 idea+0.125
 possible+0.122
Neg logit contributions
Sorry-0.156
W-0.129
Grab-0.129
ted-0.125
Boo-0.124
Token in
Token ID287
Feature activation-3.61e-02

Pos logit contributions
 supposed+0.056
epad+0.055
 idea+0.054
 recognise+0.051
 possible+0.050
Neg logit contributions
Sorry-0.073
W-0.061
Grab-0.059
Boo-0.059
ted-0.058
Token the
Token ID262
Feature activation+3.33e-03

Pos logit contributions
 supposed+0.056
 idea+0.056
epad+0.056
 possible+0.052
 recognise+0.051
Neg logit contributions
Sorry-0.077
W-0.064
Grab-0.064
Boo-0.062
ted-0.060
Token world
Token ID995
Feature activation-3.86e-02

Pos logit contributions
Sorry+0.008
Grab+0.006
W+0.006
Boo+0.006
ted+0.006
Neg logit contributions
 idea-0.005
epad-0.005
 supposed-0.005
 way-0.004
 possible-0.004
Token.
Token ID13
Feature activation-7.30e-03

Pos logit contributions
 supposed+0.059
epad+0.058
 idea+0.058
 recognise+0.054
 possible+0.054
Neg logit contributions
Sorry-0.082
W-0.069
Boo-0.067
Grab-0.067
ted-0.066
TokenThe
Token ID464
Feature activation-1.88e-02

Pos logit contributions
 possible+0.001
 recognise+0.001
 supposed+0.001
epad+0.001
 balancing+0.001
Neg logit contributions
Sorry-0.001
ted-0.001
ipped-0.001
Grab-0.001
ishly-0.001
Token dragon
Token ID10441
Feature activation-9.40e-03

Pos logit contributions
 supposed+0.021
epad+0.021
 idea+0.020
 possible+0.019
 recognise+0.018
Neg logit contributions
Sorry-0.049
W-0.042
Grab-0.042
Boo-0.041
ted-0.040
Token knew
Token ID2993
Feature activation-1.85e-02

Pos logit contributions
 idea+0.017
 supposed+0.016
epad+0.016
 possible+0.016
 balancing+0.015
Neg logit contributions
Sorry-0.018
Grab-0.015
W-0.015
Boo-0.014
ted-0.013
Token that
Token ID326
Feature activation-2.87e-02

Pos logit contributions
 supposed+0.029
 idea+0.029
epad+0.029
 possible+0.027
 recognise+0.026
Neg logit contributions
Sorry-0.039
Grab-0.033
W-0.033
Boo-0.032
ted-0.031
Token the
Token ID262
Feature activation-3.63e-02

Pos logit contributions
 supposed+0.045
epad+0.044
 idea+0.044
 recognise+0.040
 possible+0.040
Neg logit contributions
Sorry-0.061
W-0.051
Grab-0.050
Boo-0.049
ted-0.048
Token only
Token ID691
Feature activation-8.02e-02

Pos logit contributions
 supposed+0.043
epad+0.039
 recognise+0.037
 idea+0.035
 possible+0.035
Neg logit contributions
Sorry-0.091
W-0.078
Grab-0.077
Boo-0.077
ted-0.074
Token way
Token ID835
Feature activation-2.23e-02

Pos logit contributions
 supposed+0.119
epad+0.115
 idea+0.109
 recognise+0.106
 possible+0.104
Neg logit contributions
Sorry-0.169
W-0.142
Boo-0.142
Grab-0.141
ted-0.141
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
 supposed+0.034
 idea+0.034
epad+0.033
 possible+0.031
 recognise+0.030
Neg logit contributions
Sorry-0.048
W-0.040
Grab-0.040
Boo-0.039
ted-0.038
Token reach
Token ID3151
Feature activation-2.53e-03

Pos logit contributions
 supposed+0.024
epad+0.024
 idea+0.023
 pretending+0.021
 possible+0.021
Neg logit contributions
Sorry-0.040
W-0.034
Grab-0.034
Boo-0.033
ted-0.032
Token the
Token ID262
Feature activation-1.02e-03

Pos logit contributions
 supposed+0.004
epad+0.004
 idea+0.004
 recognise+0.004
 possible+0.004
Neg logit contributions
Sorry-0.005
W-0.004
Grab-0.004
ted-0.004
Boo-0.004
Token fairy
Token ID25607
Feature activation-9.96e-03

Pos logit contributions
 supposed+0.001
epad+0.001
 idea+0.001
 recognise+0.001
 possible+0.001
Neg logit contributions
Sorry-0.002
W-0.002
Grab-0.002
Boo-0.002
ted-0.002
Token mighty
Token ID18680
Feature activation-1.47e-02

Pos logit contributions
 supposed+0.020
 idea+0.019
 possible+0.018
epad+0.018
 pretending+0.017
Neg logit contributions
Sorry-0.044
Grab-0.038
ted-0.037
Boo-0.037
W-0.037
Token is
Token ID318
Feature activation-2.83e-02

Pos logit contributions
 supposed+0.027
 idea+0.027
epad+0.027
 possible+0.026
 recognise+0.025
Neg logit contributions
Sorry-0.027
Grab-0.022
W-0.022
Boo-0.021
ted-0.021
Token not
Token ID407
Feature activation-6.75e-02

Pos logit contributions
epad+0.039
 idea+0.039
 supposed+0.038
 recognise+0.035
 possible+0.035
Neg logit contributions
Sorry-0.065
W-0.055
Grab-0.054
Boo-0.052
ted-0.051
Token as
Token ID355
Feature activation-7.08e-02

Pos logit contributions
epad+0.100
 idea+0.097
 supposed+0.096
 recognise+0.093
 angle+0.090
Neg logit contributions
Sorry-0.140
W-0.118
Grab-0.115
Boo-0.110
ted-0.109
Token important
Token ID1593
Feature activation-1.36e-02

Pos logit contributions
epad+0.091
 supposed+0.090
 idea+0.089
 recognise+0.081
 possible+0.081
Neg logit contributions
Sorry-0.168
W-0.142
Grab-0.141
Boo-0.137
ted-0.133
Token as
Token ID355
Feature activation-8.01e-02

Pos logit contributions
epad+0.022
 supposed+0.022
 idea+0.021
 recognise+0.021
 possible+0.020
Neg logit contributions
Sorry-0.028
W-0.023
ted-0.022
Grab-0.022
Boo-0.021
Token being
Token ID852
Feature activation-3.27e-02

Pos logit contributions
 idea+0.112
epad+0.112
 supposed+0.111
 recognise+0.103
 possible+0.102
Neg logit contributions
Sorry-0.181
W-0.151
Grab-0.150
Boo-0.146
ted-0.143
Token nice
Token ID3621
Feature activation-1.42e-02

Pos logit contributions
 supposed+0.047
 idea+0.047
epad+0.047
 possible+0.044
 recognise+0.043
Neg logit contributions
Sorry-0.074
W-0.062
Grab-0.061
Boo-0.060
ted-0.058
Token.
Token ID13
Feature activation-9.28e-03

Pos logit contributions
epad+0.021
 supposed+0.021
 idea+0.021
 possible+0.020
 recognise+0.020
Neg logit contributions
Sorry-0.031
W-0.026
Grab-0.025
ted-0.025
Boo-0.025
Token<|endoftext|>
Token ID50256
Feature activation+2.83e-03

Pos logit contributions
 supposed+0.016
 idea+0.015
epad+0.015
 possible+0.015
 pretending+0.014
Neg logit contributions
Sorry-0.019
Grab-0.016
Boo-0.015
ted-0.015
W-0.015
TokenS
Token ID50
Feature activation+4.00e-02

Pos logit contributions
Sorry+0.004
Grab+0.003
W+0.003
Boo+0.003
ted+0.003
Neg logit contributions
 supposed-0.006
 idea-0.006
epad-0.006
 possible-0.006
 recognise-0.006
Token to
Token ID284
Feature activation-4.83e-02

Pos logit contributions
 idea+0.049
 supposed+0.049
epad+0.049
 possible+0.046
 recognise+0.044
Neg logit contributions
Sorry-0.076
Grab-0.064
W-0.063
Boo-0.061
ted-0.059
Token others
Token ID1854
Feature activation-5.06e-02

Pos logit contributions
 supposed+0.076
 idea+0.075
epad+0.075
 possible+0.071
 recognise+0.069
Neg logit contributions
Sorry-0.103
Grab-0.085
W-0.085
Boo-0.083
ted-0.080
Token is
Token ID318
Feature activation-3.93e-02

Pos logit contributions
 supposed+0.074
 idea+0.073
epad+0.073
 possible+0.069
 recognise+0.066
Neg logit contributions
Sorry-0.114
Grab-0.095
W-0.095
Boo-0.092
ted-0.090
Token more
Token ID517
Feature activation-3.71e-02

Pos logit contributions
epad+0.052
 idea+0.051
 supposed+0.050
 recognise+0.046
 possible+0.046
Neg logit contributions
Sorry-0.091
W-0.076
Grab-0.073
Boo-0.071
ted-0.070
Token important
Token ID1593
Feature activation-1.55e-02

Pos logit contributions
epad+0.045
 idea+0.043
 supposed+0.042
 happen+0.040
 recognise+0.039
Neg logit contributions
Sorry-0.090
W-0.078
Grab-0.074
H-0.073
Sp-0.072
Token than
Token ID621
Feature activation-7.99e-02

Pos logit contributions
 supposed+0.017
 idea+0.017
epad+0.017
 possible+0.016
 recognise+0.015
Neg logit contributions
Sorry-0.040
Grab-0.035
W-0.035
Boo-0.034
ted-0.033
Token being
Token ID852
Feature activation-4.03e-02

Pos logit contributions
 supposed+0.113
 idea+0.112
epad+0.112
 possible+0.106
 recognise+0.101
Neg logit contributions
Sorry-0.184
Grab-0.154
W-0.153
Boo-0.150
ted-0.146
Token jealous
Token ID19354
Feature activation-2.77e-02

Pos logit contributions
 supposed+0.055
epad+0.053
 idea+0.053
 possible+0.051
 recognise+0.047
Neg logit contributions
Sorry-0.097
Grab-0.083
W-0.081
Boo-0.080
ted-0.078
Token and
Token ID290
Feature activation-3.56e-02

Pos logit contributions
 idea+0.037
 supposed+0.037
epad+0.036
 possible+0.034
 happen+0.032
Neg logit contributions
Sorry-0.067
Grab-0.057
W-0.056
Boo-0.055
ted-0.053
Token selfish
Token ID20363
Feature activation-2.15e-02

Pos logit contributions
 supposed+0.046
epad+0.043
 possible+0.042
 idea+0.041
 happen+0.038
Neg logit contributions
Sorry-0.089
Grab-0.078
Boo-0.076
W-0.076
ted-0.074
Token.
Token ID13
Feature activation-2.33e-02

Pos logit contributions
 idea+0.027
 supposed+0.026
epad+0.025
 possible+0.024
 happen+0.023
Neg logit contributions
Sorry-0.054
Grab-0.047
W-0.046
Boo-0.045
ted-0.043
Token that
Token ID326
Feature activation-2.82e-02

Pos logit contributions
 supposed+0.020
 idea+0.020
 possible+0.019
epad+0.019
 balancing+0.018
Neg logit contributions
Sorry-0.040
Grab-0.035
Boo-0.034
W-0.033
ted-0.032
Token material
Token ID2587
Feature activation-3.85e-02

Pos logit contributions
 idea+0.041
 supposed+0.041
epad+0.041
 recognise+0.038
 possible+0.037
Neg logit contributions
Sorry-0.062
W-0.052
Grab-0.051
Sp-0.049
Boo-0.049
Token things
Token ID1243
Feature activation-4.95e-02

Pos logit contributions
 supposed+0.064
 idea+0.064
epad+0.062
 possible+0.061
 happen+0.059
Neg logit contributions
Sorry-0.080
Grab-0.066
W-0.065
Boo-0.064
ted-0.060
Token are
Token ID389
Feature activation-4.14e-02

Pos logit contributions
 supposed+0.089
epad+0.089
 idea+0.089
 possible+0.085
 recognise+0.082
Neg logit contributions
Sorry-0.094
Grab-0.075
W-0.075
Boo-0.073
ted-0.070
Token not
Token ID407
Feature activation-7.58e-02

Pos logit contributions
epad+0.057
 idea+0.054
 supposed+0.052
 angle+0.051
 recognise+0.051
Neg logit contributions
Sorry-0.091
W-0.077
Grab-0.075
Sp-0.072
H-0.071
Token as
Token ID355
Feature activation-7.98e-02

Pos logit contributions
epad+0.104
 supposed+0.097
 angle+0.097
 idea+0.096
 recognise+0.096
Neg logit contributions
Sorry-0.155
W-0.131
Grab-0.127
ted-0.122
L-0.121
Token important
Token ID1593
Feature activation-1.97e-02

Pos logit contributions
epad+0.097
 supposed+0.094
 idea+0.093
 recognise+0.086
 angle+0.084
Neg logit contributions
Sorry-0.190
W-0.162
Grab-0.159
Boo-0.153
ted-0.150
Token as
Token ID355
Feature activation-8.38e-02

Pos logit contributions
 supposed+0.032
 idea+0.031
epad+0.031
 recognise+0.030
 possible+0.029
Neg logit contributions
Sorry-0.039
ted-0.032
W-0.032
Grab-0.031
Boo-0.030
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
 idea+0.122
epad+0.122
 supposed+0.120
 recognise+0.112
 possible+0.110
Neg logit contributions
Sorry-0.182
W-0.152
Grab-0.150
Boo-0.146
ted-0.144
Token people
Token ID661
Feature activation-5.94e-02

Pos logit contributions
 supposed+0.057
epad+0.056
 idea+0.056
 possible+0.053
 recognise+0.052
Neg logit contributions
Sorry-0.095
W-0.079
Grab-0.079
Boo-0.077
ted-0.075
Token we
Token ID356
Feature activation-2.76e-02

Pos logit contributions
 supposed+0.091
 idea+0.091
epad+0.090
 possible+0.086
 recognise+0.082
Neg logit contributions
Sorry-0.128
W-0.106
Grab-0.106
Boo-0.103
ted-0.101
Token red
Token ID2266
Feature activation+5.71e-03

Pos logit contributions
epad+0.080
 supposed+0.079
 idea+0.077
 possible+0.075
 recognise+0.073
Neg logit contributions
Sorry-0.143
W-0.130
Grab-0.126
L-0.121
Boo-0.118
Token hair
Token ID4190
Feature activation-7.92e-03

Pos logit contributions
Sorry+0.014
Grab+0.012
W+0.011
Boo+0.011
H+0.011
Neg logit contributions
 idea-0.008
epad-0.007
 answer-0.007
 supposed-0.007
 way-0.007
Token and
Token ID290
Feature activation-2.28e-02

Pos logit contributions
epad+0.014
 supposed+0.014
 idea+0.014
 possible+0.013
 recognise+0.013
Neg logit contributions
Sorry-0.015
W-0.012
ted-0.012
Grab-0.012
Boo-0.012
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
 supposed+0.033
 idea+0.032
epad+0.032
 possible+0.030
 pretending+0.030
Neg logit contributions
Sorry-0.053
Grab-0.044
Boo-0.043
W-0.043
ted-0.042
Token biggest
Token ID4094
Feature activation-1.97e-02

Pos logit contributions
 supposed+0.038
epad+0.036
 idea+0.036
 possible+0.035
 recognise+0.033
Neg logit contributions
Sorry-0.068
W-0.058
Grab-0.058
Boo-0.056
ted-0.055
Token,
Token ID11
Feature activation-7.97e-02

Pos logit contributions
 supposed+0.031
epad+0.030
 possible+0.029
 recognise+0.028
 idea+0.028
Neg logit contributions
Sorry-0.041
W-0.035
Grab-0.034
Boo-0.033
ted-0.032
Token brightest
Token ID33871
Feature activation-1.82e-02

Pos logit contributions
 supposed+0.107
epad+0.106
 idea+0.099
 recognise+0.098
 quest+0.094
Neg logit contributions
Sorry-0.175
W-0.160
Grab-0.151
ted-0.147
Sp-0.147
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
 supposed+0.025
epad+0.025
 idea+0.024
 possible+0.023
 recognise+0.023
Neg logit contributions
Sorry-0.042
W-0.035
Grab-0.035
Boo-0.034
ted-0.034
Token green
Token ID4077
Feature activation-1.94e-02

Pos logit contributions
 supposed+0.095
epad+0.094
 idea+0.089
 recognise+0.085
 possible+0.083
Neg logit contributions
Sorry-0.168
W-0.152
Grab-0.145
ted-0.142
Boo-0.141
Token eyes
Token ID2951
Feature activation-3.72e-02

Pos logit contributions
 idea+0.027
 supposed+0.027
epad+0.027
 possible+0.025
 happen+0.024
Neg logit contributions
Sorry-0.045
Grab-0.038
W-0.037
Boo-0.036
ted-0.035
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
 idea+0.068
 supposed+0.066
epad+0.065
 quest+0.063
 recognise+0.063
Neg logit contributions
Sorry-0.065
ted-0.057
W-0.053
Boo-0.050
ipped-0.048
Token things
Token ID1243
Feature activation-2.36e-02

Pos logit contributions
 supposed+0.037
 idea+0.037
epad+0.036
 possible+0.035
 recognise+0.034
Neg logit contributions
Sorry-0.056
Grab-0.047
W-0.047
Boo-0.046
ted-0.044
Token are
Token ID389
Feature activation-3.17e-02

Pos logit contributions
epad+0.043
 supposed+0.043
 idea+0.043
 possible+0.041
 recognise+0.040
Neg logit contributions
Sorry-0.044
Grab-0.035
W-0.035
Boo-0.034
ted-0.033
Token not
Token ID407
Feature activation-6.86e-02

Pos logit contributions
epad+0.044
 idea+0.041
 supposed+0.040
 angle+0.039
 recognise+0.039
Neg logit contributions
Sorry-0.071
W-0.060
Grab-0.059
Sp-0.056
ted-0.055
Token what
Token ID644
Feature activation-5.89e-02

Pos logit contributions
epad+0.101
 angle+0.093
 idea+0.092
 recognise+0.092
 balancing+0.091
Neg logit contributions
Sorry-0.134
W-0.115
Grab-0.112
ted-0.107
Boo-0.104
Token they
Token ID484
Feature activation-5.63e-02

Pos logit contributions
epad+0.106
 supposed+0.103
 idea+0.103
 recognise+0.097
 possible+0.096
Neg logit contributions
Sorry-0.110
W-0.090
Grab-0.088
Boo-0.085
ted-0.084
Token seem
Token ID1283
Feature activation-7.90e-02

Pos logit contributions
epad+0.079
 supposed+0.076
 idea+0.075
 possible+0.074
 balancing+0.070
Neg logit contributions
Sorry-0.129
W-0.108
Grab-0.107
Boo-0.103
ted-0.103
Token.
Token ID13
Feature activation-3.59e-03

Pos logit contributions
epad+0.137
 throne+0.137
 angle+0.131
 quest+0.128
 page+0.126
Neg logit contributions
Sorry-0.126
W-0.097
s-0.094
olded-0.093
 turns-0.093
Token And
Token ID843
Feature activation-5.53e-03

Pos logit contributions
 supposed+0.007
 idea+0.007
epad+0.007
 possible+0.006
 pretending+0.006
Neg logit contributions
Sorry-0.007
Grab-0.005
Boo-0.005
W-0.005
ted-0.005
Token they
Token ID484
Feature activation+7.49e-03

Pos logit contributions
 supposed+0.010
epad+0.010
 idea+0.010
 possible+0.009
 recognise+0.009
Neg logit contributions
Sorry-0.010
W-0.008
Grab-0.008
Boo-0.008
ted-0.008
Token learned
Token ID4499
Feature activation-4.33e-03

Pos logit contributions
Sorry+0.017
Grab+0.014
Boo+0.014
W+0.013
ted+0.013
Neg logit contributions
 idea-0.012
 supposed-0.011
 answer-0.010
 recognise-0.010
epad-0.010
Token that
Token ID326
Feature activation-1.34e-02

Pos logit contributions
 supposed+0.005
 idea+0.005
 possible+0.005
 happen+0.005
 answer+0.005
Neg logit contributions
Sorry-0.011
Grab-0.010
Boo-0.010
W-0.010
ipped-0.009
Token giving
Token ID3501
Feature activation-2.45e-02

Pos logit contributions
epad+0.068
 idea+0.067
 supposed+0.066
 possible+0.061
 recognise+0.061
Neg logit contributions
Sorry-0.099
W-0.082
Grab-0.081
Boo-0.080
ted-0.078
Token up
Token ID510
Feature activation-3.19e-02

Pos logit contributions
 supposed+0.035
 idea+0.034
 possible+0.033
epad+0.033
 way+0.031
Neg logit contributions
Sorry-0.059
Grab-0.049
W-0.048
Boo-0.047
H-0.045
Token was
Token ID373
Feature activation-2.20e-02

Pos logit contributions
epad+0.049
 idea+0.048
 supposed+0.048
 recognise+0.045
 possible+0.044
Neg logit contributions
Sorry-0.067
W-0.056
Grab-0.054
Boo-0.053
ted-0.053
Token sometimes
Token ID3360
Feature activation-2.81e-02

Pos logit contributions
 supposed+0.032
 idea+0.032
epad+0.032
 possible+0.030
 recognise+0.029
Neg logit contributions
Sorry-0.049
Grab-0.041
W-0.041
Boo-0.040
ted-0.039
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
epad+0.047
 supposed+0.045
 recognise+0.044
 idea+0.043
arter+0.041
Neg logit contributions
Sorry-0.054
W-0.043
Grab-0.041
ted-0.040
Boo-0.040
Token only
Token ID691
Feature activation-7.80e-02

Pos logit contributions
epad+0.039
 supposed+0.039
 happen+0.036
 recognise+0.035
 idea+0.034
Neg logit contributions
Sorry-0.065
W-0.055
Grab-0.055
Boo-0.052
Sp-0.051
Token way
Token ID835
Feature activation-2.54e-02

Pos logit contributions
epad+0.098
 supposed+0.096
 idea+0.091
 recognise+0.087
 possible+0.085
Neg logit contributions
Sorry-0.180
Grab-0.154
W-0.152
Boo-0.149
ted-0.147
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
 idea+0.037
epad+0.037
 supposed+0.037
 possible+0.034
 recognise+0.033
Neg logit contributions
Sorry-0.056
W-0.047
Grab-0.047
Boo-0.046
ted-0.045
Token succeed
Token ID6758
Feature activation-3.94e-02

Pos logit contributions
 idea+0.026
 supposed+0.026
epad+0.026
 possible+0.025
 happen+0.023
Neg logit contributions
Sorry-0.051
Grab-0.044
W-0.043
Boo-0.043
ted-0.042
Token.
Token ID13
Feature activation+5.60e-03

Pos logit contributions
 idea+0.063
epad+0.063
 supposed+0.062
 recognise+0.057
 pretending+0.055
Neg logit contributions
Sorry-0.080
W-0.067
Grab-0.063
Boo-0.062
ted-0.062
Token<|endoftext|>
Token ID50256
Feature activation-2.66e-03

Pos logit contributions
Sorry+0.012
Grab+0.009
Boo+0.009
W+0.009
ted+0.009
Neg logit contributions
 supposed-0.009
 idea-0.009
epad-0.009
 possible-0.009
 recognise-0.009
Token things
Token ID1243
Feature activation-2.82e-02

Pos logit contributions
 supposed+0.050
epad+0.050
 idea+0.048
 possible+0.046
 recognise+0.046
Neg logit contributions
Sorry-0.058
W-0.047
Grab-0.047
Boo-0.045
ted-0.044
Token are
Token ID389
Feature activation-3.07e-02

Pos logit contributions
 supposed+0.046
 idea+0.045
epad+0.045
 possible+0.043
 recognise+0.041
Neg logit contributions
Sorry-0.059
Grab-0.048
W-0.048
Boo-0.047
ted-0.045
Token not
Token ID407
Feature activation-6.18e-02

Pos logit contributions
epad+0.041
 idea+0.038
 supposed+0.037
 recognise+0.036
 angle+0.036
Neg logit contributions
Sorry-0.069
W-0.059
Grab-0.057
Sp-0.055
ted-0.054
Token what
Token ID644
Feature activation-5.18e-02

Pos logit contributions
epad+0.083
 angle+0.078
 balancing+0.077
 throne+0.076
 quest+0.076
Neg logit contributions
Sorry-0.126
W-0.106
Grab-0.103
ted-0.098
H-0.097
Token they
Token ID484
Feature activation-5.18e-02

Pos logit contributions
epad+0.096
 supposed+0.096
 idea+0.094
 recognise+0.089
 possible+0.089
Neg logit contributions
Sorry-0.093
W-0.074
Grab-0.072
Boo-0.071
ted-0.069
Token seem
Token ID1283
Feature activation-7.77e-02

Pos logit contributions
epad+0.071
 supposed+0.069
 idea+0.068
 possible+0.066
 balancing+0.063
Neg logit contributions
Sorry-0.120
W-0.100
Grab-0.100
Boo-0.097
ted-0.096
Token,
Token ID11
Feature activation-5.03e-02

Pos logit contributions
 throne+0.122
epad+0.115
 page+0.111
 angle+0.109
 quest+0.108
Neg logit contributions
Sorry-0.138
s-0.110
W-0.109
olded-0.106
 turns-0.105
Token just
Token ID655
Feature activation-3.98e-02

Pos logit contributions
epad+0.053
 supposed+0.053
 idea+0.049
 possible+0.045
 recognise+0.045
Neg logit contributions
Sorry-0.132
W-0.114
Grab-0.112
Boo-0.109
Sp-0.108
Token like
Token ID588
Feature activation-3.48e-02

Pos logit contributions
epad+0.060
 supposed+0.058
 idea+0.057
 recognise+0.053
 possible+0.052
Neg logit contributions
Sorry-0.082
Grab-0.069
W-0.068
ted-0.066
Boo-0.065
Token the
Token ID262
Feature activation-2.76e-02

Pos logit contributions
epad+0.053
 supposed+0.053
 idea+0.053
 possible+0.049
 recognise+0.049
Neg logit contributions
Sorry-0.073
W-0.061
Grab-0.060
Boo-0.059
ted-0.058
Token lion
Token ID18744
Feature activation-3.84e-02

Pos logit contributions
 supposed+0.041
epad+0.039
 recognise+0.037
onna+0.037
 possible+0.034
Neg logit contributions
Sorry-0.058
W-0.049
Boo-0.048
Grab-0.047
ted-0.046
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
 supposed+0.027
 idea+0.026
epad+0.026
 possible+0.024
 happen+0.023
Neg logit contributions
Sorry-0.045
Grab-0.038
W-0.037
Boo-0.037
ted-0.035
Token never
Token ID1239
Feature activation-4.13e-02

Pos logit contributions
 supposed+0.029
 idea+0.029
epad+0.028
 possible+0.027
 pretending+0.026
Neg logit contributions
Sorry-0.049
Grab-0.041
W-0.041
Boo-0.040
ted-0.038
Token worried
Token ID7960
Feature activation-2.95e-02

Pos logit contributions
 supposed+0.073
 idea+0.073
epad+0.073
 possible+0.069
 recognise+0.067
Neg logit contributions
Sorry-0.080
Grab-0.065
W-0.065
Boo-0.063
ted-0.061
Token about
Token ID546
Feature activation-2.81e-02

Pos logit contributions
 idea+0.033
epad+0.031
 supposed+0.031
 recognise+0.031
 possible+0.030
Neg logit contributions
Sorry-0.065
ted-0.061
W-0.057
Boo-0.055
Sp-0.053
Token what
Token ID644
Feature activation-2.86e-02

Pos logit contributions
epad+0.046
 idea+0.046
 supposed+0.043
 recognise+0.042
 possible+0.042
Neg logit contributions
Sorry-0.051
W-0.045
ted-0.043
Grab-0.042
Sp-0.041
Token might
Token ID1244
Feature activation-7.74e-02

Pos logit contributions
epad+0.051
 idea+0.049
 supposed+0.048
 recognise+0.046
 possible+0.046
Neg logit contributions
Sorry-0.055
W-0.047
Boo-0.044
Grab-0.044
ted-0.043
Token happen
Token ID1645
Feature activation-3.04e-02

Pos logit contributions
epad+0.132
od+0.111
 idea+0.110
eways+0.110
 angle+0.107
Neg logit contributions
Sorry-0.130
W-0.116
Boo-0.106
ted-0.101
Grab-0.101
Token.
Token ID13
Feature activation-3.20e-02

Pos logit contributions
epad+0.058
 idea+0.058
 supposed+0.057
 recognise+0.055
 comment+0.055
Neg logit contributions
Sorry-0.049
W-0.041
H-0.036
Sp-0.036
L-0.036
Token
Token ID198
Feature activation-3.21e-02

Pos logit contributions
 idea+0.058
 supposed+0.058
epad+0.057
 possible+0.055
 recognise+0.053
Neg logit contributions
Sorry-0.062
Grab-0.049
W-0.049
Boo-0.048
ted-0.046
Token
Token ID198
Feature activation-2.32e-02

Pos logit contributions
epad+0.079
 supposed+0.077
 idea+0.075
 possible+0.074
 happen+0.073
Neg logit contributions
Sorry-0.042
W-0.034
L-0.031
H-0.030
Boo-0.030
TokenOne
Token ID3198
Feature activation-2.51e-02

Pos logit contributions
 supposed+0.044
 idea+0.043
epad+0.043
 possible+0.042
 recognise+0.041
Neg logit contributions
Sorry-0.042
Grab-0.033
ted-0.033
Boo-0.032
W-0.032
Token wealthy
Token ID11574
Feature activation-7.91e-03

Pos logit contributions
 supposed+0.027
 idea+0.025
 possible+0.024
epad+0.023
 answer+0.022
Neg logit contributions
Sorry-0.056
Grab-0.048
Boo-0.047
ted-0.047
W-0.047
Token is
Token ID318
Feature activation-1.96e-02

Pos logit contributions
 idea+0.015
 supposed+0.014
epad+0.014
 possible+0.014
 happen+0.013
Neg logit contributions
Sorry-0.015
Grab-0.012
W-0.012
Boo-0.012
ted-0.011
Token not
Token ID407
Feature activation-6.37e-02

Pos logit contributions
 supposed+0.028
 idea+0.027
epad+0.027
 possible+0.026
 recognise+0.024
Neg logit contributions
Sorry-0.046
Grab-0.038
W-0.038
Boo-0.037
ted-0.036
Token as
Token ID355
Feature activation-6.89e-02

Pos logit contributions
epad+0.094
 idea+0.091
 supposed+0.091
 recognise+0.086
 possible+0.085
Neg logit contributions
Sorry-0.136
W-0.114
Grab-0.113
Boo-0.108
ted-0.106
Token important
Token ID1593
Feature activation-9.89e-03

Pos logit contributions
epad+0.086
 supposed+0.085
 idea+0.085
 possible+0.078
 recognise+0.077
Neg logit contributions
Sorry-0.166
W-0.140
Grab-0.140
Boo-0.135
ted-0.131
Token as
Token ID355
Feature activation-7.73e-02

Pos logit contributions
 supposed+0.015
epad+0.015
 idea+0.015
 recognise+0.014
 possible+0.014
Neg logit contributions
Sorry-0.021
W-0.017
Grab-0.017
ted-0.017
Boo-0.016
Token being
Token ID852
Feature activation-2.81e-02

Pos logit contributions
epad+0.106
 idea+0.106
 supposed+0.105
 recognise+0.097
 possible+0.097
Neg logit contributions
Sorry-0.176
W-0.147
Grab-0.146
Boo-0.141
ted-0.139
Token kind
Token ID1611
Feature activation-2.52e-02

Pos logit contributions
 supposed+0.039
 idea+0.039
epad+0.038
 possible+0.036
 recognise+0.034
Neg logit contributions
Sorry-0.066
Grab-0.056
W-0.055
Boo-0.054
ted-0.053
Token.
Token ID13
Feature activation-2.48e-03

Pos logit contributions
 supposed+0.035
 idea+0.034
epad+0.034
 possible+0.032
 recognise+0.031
Neg logit contributions
Sorry-0.059
Grab-0.049
W-0.049
Boo-0.048
ted-0.047
Token They
Token ID1119
Feature activation+1.78e-02

Pos logit contributions
 supposed+0.005
 idea+0.004
epad+0.004
 possible+0.004
 recognise+0.004
Neg logit contributions
Sorry-0.005
Grab-0.004
Boo-0.004
W-0.004
ted-0.004
Token learned
Token ID4499
Feature activation+2.04e-03

Pos logit contributions
Sorry+0.041
Grab+0.035
ipped+0.034
ted+0.033
Boo+0.033
Neg logit contributions
 idea-0.027
 answer-0.024
 supposed-0.024
 recognise-0.023
 comment-0.022
Token things
Token ID1243
Feature activation-2.78e-02

Pos logit contributions
 supposed+0.037
epad+0.037
 idea+0.036
 possible+0.034
 recognise+0.034
Neg logit contributions
Sorry-0.043
W-0.035
Grab-0.034
Boo-0.033
ted-0.032
Token are
Token ID389
Feature activation-3.41e-02

Pos logit contributions
epad+0.050
 supposed+0.050
 idea+0.049
 possible+0.047
 recognise+0.046
Neg logit contributions
Sorry-0.052
Grab-0.042
W-0.042
Boo-0.040
ted-0.040
Token not
Token ID407
Feature activation-6.21e-02

Pos logit contributions
epad+0.048
 idea+0.043
 supposed+0.043
 angle+0.042
 recognise+0.042
Neg logit contributions
Sorry-0.074
W-0.063
Grab-0.060
Sp-0.058
H-0.057
Token what
Token ID644
Feature activation-5.70e-02

Pos logit contributions
epad+0.089
 angle+0.085
 balancing+0.082
 throne+0.082
 quest+0.082
Neg logit contributions
Sorry-0.117
W-0.099
Grab-0.095
ted-0.092
H-0.091
Token they
Token ID484
Feature activation-5.32e-02

Pos logit contributions
epad+0.103
 supposed+0.101
 idea+0.099
 recognise+0.095
 possible+0.094
Neg logit contributions
Sorry-0.106
W-0.086
Grab-0.083
Boo-0.082
ted-0.080
Token seem
Token ID1283
Feature activation-7.69e-02

Pos logit contributions
epad+0.077
 supposed+0.074
 possible+0.072
 idea+0.072
 balancing+0.069
Neg logit contributions
Sorry-0.118
W-0.098
Grab-0.097
Boo-0.094
ted-0.093
Token.
Token ID13
Feature activation+9.52e-04

Pos logit contributions
 throne+0.129
epad+0.123
 angle+0.119
 quest+0.118
lf+0.117
Neg logit contributions
Sorry-0.128
W-0.099
s-0.096
olded-0.095
H-0.093
Token They
Token ID1119
Feature activation+8.16e-03

Pos logit contributions
Sorry+0.002
Grab+0.002
Boo+0.001
ted+0.001
W+0.001
Neg logit contributions
 supposed-0.002
epad-0.002
 idea-0.002
 possible-0.002
 pretending-0.002
Token thanked
Token ID26280
Feature activation-1.20e-02

Pos logit contributions
Sorry+0.019
Grab+0.016
Boo+0.016
W+0.016
ipped+0.016
Neg logit contributions
 idea-0.012
 supposed-0.011
 answer-0.011
 recognise-0.010
epad-0.010
Token mom
Token ID1995
Feature activation+2.85e-03

Pos logit contributions
 idea+0.018
epad+0.018
 supposed+0.018
 possible+0.017
 happen+0.016
Neg logit contributions
Sorry-0.027
Grab-0.022
W-0.022
Boo-0.022
ted-0.021
Tokenmy
Token ID1820
Feature activation+1.31e-03

Pos logit contributions
Sorry+0.007
W+0.006
Grab+0.006
Boo+0.005
ted+0.005
Neg logit contributions
 supposed-0.004
epad-0.004
 idea-0.004
 possible-0.004
 recognise-0.003
Token "
Token ID366
Feature activation+2.18e-02

Pos logit contributions
 supposed+0.034
 idea+0.034
epad+0.034
 possible+0.032
 recognise+0.031
Neg logit contributions
Sorry-0.054
Grab-0.045
W-0.045
Boo-0.044
ted-0.043
TokenThis
Token ID1212
Feature activation+9.88e-03

Pos logit contributions
Sorry+0.030
ted+0.023
Grab+0.023
ipped+0.022
W+0.021
Neg logit contributions
epad-0.047
 supposed-0.046
 idea-0.046
 possible-0.045
 recognise-0.045
Token is
Token ID318
Feature activation-1.61e-02

Pos logit contributions
Sorry+0.016
W+0.012
Grab+0.012
Boo+0.012
ted+0.012
Neg logit contributions
epad-0.021
 supposed-0.021
 idea-0.020
 possible-0.020
 recognise-0.019
Token not
Token ID407
Feature activation-4.77e-02

Pos logit contributions
epad+0.027
 idea+0.026
 supposed+0.026
 recognise+0.025
 possible+0.025
Neg logit contributions
Sorry-0.032
W-0.027
Grab-0.027
Boo-0.026
ted-0.025
Token the
Token ID262
Feature activation-2.31e-02

Pos logit contributions
epad+0.080
 idea+0.076
 recognise+0.076
 supposed+0.075
 angle+0.075
Neg logit contributions
Sorry-0.086
W-0.073
Grab-0.070
L-0.067
Boo-0.067
Token only
Token ID691
Feature activation-7.69e-02

Pos logit contributions
 supposed+0.031
epad+0.031
 recognise+0.030
 idea+0.028
 possible+0.027
Neg logit contributions
Sorry-0.051
W-0.043
Grab-0.042
Boo-0.041
ted-0.040
Token red
Token ID2266
Feature activation-2.74e-02

Pos logit contributions
 supposed+0.112
epad+0.110
 idea+0.104
 recognise+0.102
 possible+0.101
Neg logit contributions
Sorry-0.164
W-0.139
Grab-0.139
Boo-0.135
ted-0.134
Token cr
Token ID1067
Feature activation-1.74e-02

Pos logit contributions
 idea+0.039
epad+0.038
 supposed+0.037
 possible+0.036
 answer+0.034
Neg logit contributions
Sorry-0.064
Grab-0.054
W-0.054
Boo-0.053
ted-0.050
Tokenay
Token ID323
Feature activation-2.33e-02

Pos logit contributions
 supposed+0.035
 idea+0.035
epad+0.035
 possible+0.034
 recognise+0.033
Neg logit contributions
Sorry-0.029
Grab-0.023
W-0.023
Boo-0.022
ted-0.021
Tokenon
Token ID261
Feature activation-1.40e-02

Pos logit contributions
 supposed+0.038
 idea+0.038
epad+0.037
 possible+0.036
 recognise+0.035
Neg logit contributions
Sorry-0.049
Grab-0.040
W-0.040
Boo-0.039
ted-0.038
Token.
Token ID13
Feature activation-2.58e-02

Pos logit contributions
 supposed+0.023
epad+0.022
 idea+0.022
 recognise+0.022
 possible+0.021
Neg logit contributions
Sorry-0.028
W-0.023
ted-0.022
Boo-0.022
Grab-0.021
Token.
Token ID13
Feature activation-1.82e-02

Pos logit contributions
epad+0.057
 idea+0.056
 supposed+0.055
 possible+0.055
 happen+0.052
Neg logit contributions
Sorry-0.061
W-0.053
L-0.049
H-0.049
ted-0.048
Token Jack
Token ID3619
Feature activation-3.34e-02

Pos logit contributions
 supposed+0.038
 possible+0.037
epad+0.036
 happen+0.034
 idea+0.034
Neg logit contributions
Sorry-0.030
Grab-0.025
W-0.025
Boo-0.023
Sp-0.023
Token was
Token ID373
Feature activation-1.24e-02

Pos logit contributions
 supposed+0.059
 idea+0.059
epad+0.058
 possible+0.056
 recognise+0.054
Neg logit contributions
Sorry-0.065
Grab-0.053
W-0.052
Boo-0.051
ted-0.049
Token very
Token ID845
Feature activation-6.47e-03

Pos logit contributions
epad+0.024
 happen+0.023
 possible+0.022
 idea+0.022
 supposed+0.022
Neg logit contributions
Sorry-0.022
W-0.019
Grab-0.018
H-0.016
L-0.016
Token smart
Token ID4451
Feature activation-1.77e-02

Pos logit contributions
epad+0.009
 supposed+0.008
 idea+0.008
 possible+0.008
 happen+0.007
Neg logit contributions
Sorry-0.016
W-0.014
Grab-0.013
Boo-0.013
H-0.012
Token,
Token ID11
Feature activation-7.64e-02

Pos logit contributions
epad+0.033
 possible+0.033
 recognise+0.032
 supposed+0.032
 happen+0.032
Neg logit contributions
Sorry-0.030
W-0.025
ted-0.025
L-0.023
H-0.023
Token but
Token ID475
Feature activation-3.76e-02

Pos logit contributions
epad+0.127
 idea+0.122
 supposed+0.116
 possible+0.115
 recognise+0.113
Neg logit contributions
Sorry-0.144
W-0.133
L-0.126
ted-0.122
Grab-0.122
Token he
Token ID339
Feature activation-2.18e-02

Pos logit contributions
 supposed+0.054
epad+0.053
 idea+0.053
 possible+0.050
 recognise+0.048
Neg logit contributions
Sorry-0.086
W-0.072
Grab-0.072
Boo-0.070
ted-0.068
Token was
Token ID373
Feature activation-1.90e-02

Pos logit contributions
 supposed+0.035
 idea+0.034
 balancing+0.032
epad+0.031
 pretending+0.031
Neg logit contributions
Sorry-0.046
Grab-0.039
ted-0.038
Boo-0.037
W-0.037
Token having
Token ID1719
Feature activation+1.50e-03

Pos logit contributions
epad+0.030
 idea+0.029
 happen+0.028
 angle+0.027
 await+0.027
Neg logit contributions
Sorry-0.039
W-0.034
Grab-0.032
H-0.031
Boo-0.030
Token trouble
Token ID5876
Feature activation-2.05e-02

Pos logit contributions
Sorry+0.003
W+0.003
Grab+0.003
Boo+0.003
ted+0.003
Neg logit contributions
 supposed-0.002
epad-0.002
 idea-0.002
 possible-0.002
 recognise-0.002
Token scary
Token ID14343
Feature activation-2.57e-02

Pos logit contributions
 supposed+0.035
 idea+0.035
epad+0.035
 possible+0.033
 recognise+0.032
Neg logit contributions
Sorry-0.054
Grab-0.045
W-0.045
Boo-0.044
ted-0.043
Token things
Token ID1243
Feature activation-3.33e-02

Pos logit contributions
epad+0.044
 supposed+0.044
 idea+0.043
 possible+0.041
 recognise+0.040
Neg logit contributions
Sorry-0.051
W-0.041
Grab-0.041
Boo-0.040
ted-0.039
Token can
Token ID460
Feature activation-5.01e-02

Pos logit contributions
 supposed+0.059
epad+0.059
 idea+0.058
 possible+0.056
 recognise+0.054
Neg logit contributions
Sorry-0.064
Grab-0.051
W-0.051
Boo-0.050
ted-0.048
Token have
Token ID423
Feature activation-2.58e-02

Pos logit contributions
epad+0.074
 supposed+0.073
 idea+0.069
 possible+0.065
 pretending+0.064
Neg logit contributions
Sorry-0.112
W-0.094
Grab-0.092
Boo-0.090
ted-0.086
Token happy
Token ID3772
Feature activation-1.65e-02

Pos logit contributions
 supposed+0.037
 idea+0.037
epad+0.037
 possible+0.035
 recognise+0.033
Neg logit contributions
Sorry-0.059
Grab-0.049
W-0.049
Boo-0.048
ted-0.047
Token endings
Token ID38168
Feature activation-7.62e-02

Pos logit contributions
 supposed+0.026
epad+0.026
 idea+0.025
 possible+0.024
 recognise+0.023
Neg logit contributions
Sorry-0.035
W-0.029
Grab-0.028
Boo-0.028
ted-0.027
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
epad+0.129
 supposed+0.129
 idea+0.126
 angle+0.120
arter+0.119
Neg logit contributions
Sorry-0.134
W-0.108
Sp-0.103
Boo-0.102
ted-0.100
Token<|endoftext|>
Token ID50256
Feature activation+3.50e-03

Pos logit contributions
Sorry+0.025
Grab+0.019
Boo+0.019
W+0.019
ted+0.019
Neg logit contributions
 supposed-0.018
epad-0.018
 idea-0.018
 possible-0.017
 balancing-0.017
TokenOnce
Token ID7454
Feature activation+1.10e-02

Pos logit contributions
Sorry+0.006
Grab+0.005
W+0.004
Boo+0.004
ted+0.004
Neg logit contributions
 idea-0.007
 supposed-0.007
epad-0.007
 possible-0.007
 balancing-0.006
Token upon
Token ID2402
Feature activation+1.81e-02

Pos logit contributions
Sorry+0.015
L+0.012
W+0.012
ted+0.011
ipped+0.011
Neg logit contributions
epad-0.024
 supposed-0.024
 recognise-0.022
 recipe-0.021
 pretending-0.021
Token a
Token ID257
Feature activation+5.14e-03

Pos logit contributions
Sorry+0.031
W+0.025
Grab+0.025
Boo+0.024
H+0.022
Neg logit contributions
 supposed-0.035
epad-0.035
 idea-0.035
 possible-0.034
 recognise-0.033
Token was
Token ID373
Feature activation-2.03e-02

Pos logit contributions
 supposed+0.054
 idea+0.053
epad+0.053
 possible+0.051
 recognise+0.049
Neg logit contributions
Sorry-0.054
Grab-0.043
W-0.043
Boo-0.041
ted-0.040
Token.
Token ID13
Feature activation-6.07e-03

Pos logit contributions
epad+0.029
 angle+0.026
 recognise+0.026
 idea+0.026
 quest+0.026
Neg logit contributions
Sorry-0.038
W-0.033
ted-0.031
H-0.031
Grab-0.029
Token
Token ID220
Feature activation+4.17e-04

Pos logit contributions
 idea+0.010
epad+0.009
 supposed+0.009
 way+0.009
 recognise+0.009
Neg logit contributions
Sorry-0.013
Grab-0.011
W-0.010
Boo-0.010
ted-0.010
Token
Token ID198
Feature activation-7.41e-03

Pos logit contributions
Sorry+0.001
Grab+0.001
W+0.001
Boo+0.001
ted+0.001
Neg logit contributions
 supposed-0.001
 idea-0.001
epad-0.001
 possible-0.001
 recognise-0.001
Token
Token ID198
Feature activation+1.90e-03

Pos logit contributions
 supposed+0.012
 idea+0.012
epad+0.012
 possible+0.012
 recognise+0.012
Neg logit contributions
Sorry-0.015
Grab-0.012
ted-0.012
Boo-0.012
W-0.012
Token"
Token ID1
Feature activation+5.41e-02

Pos logit contributions
ted+0.003
Sorry+0.003
ipped+0.003
Grab+0.003
Boo+0.003
Neg logit contributions
 supposed-0.003
 recognise-0.003
 possible-0.003
 impressed-0.003
 idea-0.003
TokenWhere
Token ID8496
Feature activation+4.24e-02

Pos logit contributions
Sorry+0.074
ipped+0.072
Grab+0.065
ugg+0.064
ted+0.064
Neg logit contributions
 possible-0.105
 supposed-0.102
epad-0.100
 recognise-0.097
 idea-0.097
Token should
Token ID815
Feature activation-7.25e-04

Pos logit contributions
Sorry+0.079
Grab+0.063
W+0.063
Boo+0.060
ted+0.059
Neg logit contributions
 supposed-0.080
 idea-0.079
epad-0.079
 possible-0.076
 recognise-0.073
Token we
Token ID356
Feature activation+6.85e-03

Pos logit contributions
 supposed+0.001
 idea+0.001
epad+0.001
 possible+0.001
 happen+0.001
Neg logit contributions
Sorry-0.001
Grab-0.001
W-0.001
Boo-0.001
ted-0.001
Token put
Token ID1234
Feature activation+4.80e-03

Pos logit contributions
Sorry+0.016
W+0.013
Grab+0.013
Boo+0.013
ted+0.012
Neg logit contributions
epad-0.010
 supposed-0.009
 idea-0.009
 possible-0.009
 pretending-0.009
Token him
Token ID683
Feature activation+2.98e-03

Pos logit contributions
Sorry+0.012
Boo+0.010
Grab+0.010
W+0.009
Fine+0.009
Neg logit contributions
 idea-0.006
 possible-0.006
 supposed-0.006
 happen-0.006
 way-0.006
Token haircut
Token ID45548
Feature activation+3.15e-02

Pos logit contributions
Sorry+0.026
W+0.021
Grab+0.021
Boo+0.020
ipped+0.020
Neg logit contributions
epad-0.018
 supposed-0.018
 recognise-0.017
 possible-0.017
 idea-0.016
Token."
Token ID526
Feature activation+2.02e-02

Pos logit contributions
Sorry+0.075
Grab+0.065
W+0.064
Boo+0.062
ted+0.060
Neg logit contributions
 idea-0.043
 supposed-0.042
epad-0.042
 possible-0.040
 happen-0.038
Token Tom
Token ID4186
Feature activation+1.29e-02

Pos logit contributions
Sorry+0.039
Grab+0.032
W+0.032
Boo+0.031
ted+0.030
Neg logit contributions
 idea-0.035
 supposed-0.035
epad-0.034
 possible-0.033
 pretending-0.032
Token said
Token ID531
Feature activation+1.49e-02

Pos logit contributions
Sorry+0.032
Grab+0.026
Boo+0.026
W+0.026
Sp+0.026
Neg logit contributions
 idea-0.017
 way-0.015
 answer-0.015
 pretending-0.014
 balance-0.014
Token,
Token ID11
Feature activation-6.75e-03

Pos logit contributions
Sorry+0.031
W+0.025
Grab+0.025
Boo+0.024
ted+0.024
Neg logit contributions
 supposed-0.025
epad-0.024
 idea-0.023
 possible-0.023
 recognise-0.022
Token "
Token ID366
Feature activation+5.88e-02

Pos logit contributions
 supposed+0.010
 idea+0.010
epad+0.010
 possible+0.009
 recognise+0.009
Neg logit contributions
Sorry-0.016
W-0.013
Grab-0.013
Boo-0.012
ted-0.012
TokenYes
Token ID5297
Feature activation+4.63e-02

Pos logit contributions
ipped+0.083
ted+0.076
Sorry+0.073
Grab+0.072
uddled+0.068
Neg logit contributions
epad-0.103
 possible-0.103
 idea-0.103
cest-0.098
 interested-0.096
Token,
Token ID11
Feature activation+4.43e-03

Pos logit contributions
Sorry+0.111
Grab+0.097
W+0.094
Boo+0.093
ted+0.090
Neg logit contributions
 idea-0.061
 supposed-0.059
epad-0.059
 possible-0.057
 happen-0.055
Token mom
Token ID1995
Feature activation+1.15e-02

Pos logit contributions
Sorry+0.009
W+0.007
Grab+0.007
Boo+0.007
L+0.006
Neg logit contributions
 supposed-0.008
epad-0.008
 idea-0.008
 recognise-0.007
 possible-0.007
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
Sorry+0.028
Grab+0.024
W+0.023
Boo+0.023
ted+0.022
Neg logit contributions
 supposed-0.016
 idea-0.015
epad-0.015
 possible-0.014
 happen-0.014
Token let
Token ID1309
Feature activation+4.35e-02

Pos logit contributions
Sorry+0.031
W+0.025
Grab+0.024
Boo+0.023
ted+0.022
Neg logit contributions
 supposed-0.039
epad-0.039
 idea-0.038
 recognise-0.036
 possible-0.036
Token flowers
Token ID12734
Feature activation-7.00e-03

Pos logit contributions
epad+0.026
 supposed+0.026
 idea+0.025
 possible+0.024
 recognise+0.023
Neg logit contributions
Sorry-0.039
Grab-0.033
W-0.033
Boo-0.031
ted-0.031
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
epad+0.013
 idea+0.012
 supposed+0.012
 recognise+0.011
 possible+0.011
Neg logit contributions
Sorry-0.012
ted-0.011
W-0.010
Grab-0.010
Boo-0.010
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
 supposed+0.028
epad+0.028
 idea+0.028
 possible+0.027
 happen+0.026
Neg logit contributions
Sorry-0.024
Grab-0.019
W-0.019
Boo-0.018
ted-0.018
Token
Token ID198
Feature activation-7.79e-03

Pos logit contributions
epad+0.039
 supposed+0.039
 idea+0.038
 possible+0.038
 happen+0.036
Neg logit contributions
Sorry-0.023
W-0.019
Boo-0.017
Grab-0.017
L-0.017
Token"
Token ID1
Feature activation+5.26e-02

Pos logit contributions
 supposed+0.014
 idea+0.014
epad+0.014
 possible+0.014
 recognise+0.013
Neg logit contributions
Sorry-0.014
Grab-0.012
W-0.012
Boo-0.011
ted-0.011
TokenWow
Token ID22017
Feature activation+4.95e-02

Pos logit contributions
Sorry+0.064
Grab+0.045
ted+0.043
ipped+0.043
W+0.043
Neg logit contributions
epad-0.126
 supposed-0.125
 idea-0.123
 possible-0.121
 recognise-0.119
Token!"
Token ID2474
Feature activation+3.93e-03

Pos logit contributions
Sorry+0.092
W+0.075
Boo+0.075
Grab+0.073
ted+0.072
Neg logit contributions
epad-0.089
 supposed-0.087
 idea-0.082
 possible-0.081
 recognise-0.080
Token said
Token ID531
Feature activation+9.34e-03

Pos logit contributions
Sorry+0.009
W+0.008
Grab+0.008
Boo+0.007
Sp+0.007
Neg logit contributions
 idea-0.006
 supposed-0.005
epad-0.005
 recognise-0.005
 answer-0.005
Token Maria
Token ID14200
Feature activation-2.05e-02

Pos logit contributions
Sorry+0.023
Grab+0.019
W+0.019
Boo+0.019
ted+0.019
Neg logit contributions
 supposed-0.012
epad-0.011
 idea-0.011
 possible-0.011
 recognise-0.010
Token,
Token ID11
Feature activation-1.47e-02

Pos logit contributions
 supposed+0.035
epad+0.034
 idea+0.033
 possible+0.032
 recognise+0.031
Neg logit contributions
Sorry-0.040
W-0.033
ted-0.032
Boo-0.032
Grab-0.031
Token with
Token ID351
Feature activation+2.43e-02

Pos logit contributions
 supposed+0.018
 idea+0.017
epad+0.017
 possible+0.016
 recognise+0.015
Neg logit contributions
Sorry-0.037
Grab-0.031
W-0.031
Boo-0.031
ted-0.030
Token.
Token ID13
Feature activation+1.39e-03

Pos logit contributions
Sorry+0.040
W+0.033
Grab+0.032
Boo+0.032
ted+0.031
Neg logit contributions
 supposed-0.030
 idea-0.029
epad-0.028
 possible-0.027
 recognise-0.027
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
Sorry+0.003
Grab+0.002
W+0.002
Boo+0.002
ted+0.002
Neg logit contributions
 idea-0.003
epad-0.003
 supposed-0.003
 possible-0.002
 recognise-0.002
Token
Token ID198
Feature activation+3.07e-02

Pos logit contributions
Sorry+0.035
Grab+0.029
Boo+0.028
ted+0.028
W+0.028
Neg logit contributions
 supposed-0.027
 idea-0.027
epad-0.026
 possible-0.026
 recognise-0.026
TokenThey
Token ID2990
Feature activation+2.62e-02

Pos logit contributions
ted+0.060
ishly+0.059
ipped+0.057
Sorry+0.050
outed+0.048
Neg logit contributions
 possible-0.040
 impressed-0.039
 guessing-0.038
 sunny-0.038
 recognise-0.037
Token went
Token ID1816
Feature activation+6.71e-03

Pos logit contributions
Sorry+0.057
Grab+0.049
W+0.045
Boo+0.045
ipped+0.044
Neg logit contributions
 idea-0.039
epad-0.038
 way-0.037
 answer-0.037
 supposed-0.036
Token back
Token ID736
Feature activation+5.52e-02

Pos logit contributions
Sorry+0.015
Grab+0.013
W+0.012
Boo+0.012
ted+0.012
Neg logit contributions
 idea-0.010
 supposed-0.010
 possible-0.010
epad-0.010
 answer-0.009
Token to
Token ID284
Feature activation+3.37e-02

Pos logit contributions
Sorry+0.147
Grab+0.127
W+0.123
Boo+0.122
ted+0.117
Neg logit contributions
 supposed-0.062
 idea-0.060
epad-0.058
 possible-0.057
 pretending-0.054
Token the
Token ID262
Feature activation+3.87e-02

Pos logit contributions
Sorry+0.055
W+0.045
Grab+0.044
ted+0.041
Boo+0.041
Neg logit contributions
epad-0.066
 supposed-0.066
 idea-0.064
 possible-0.062
 recognise-0.060
Token park
Token ID3952
Feature activation+9.28e-03

Pos logit contributions
Sorry+0.100
Grab+0.089
ted+0.088
Boo+0.087
W+0.086
Neg logit contributions
 idea-0.043
 way-0.042
 supposed-0.042
 answer-0.039
 possible-0.038
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
Sorry+0.021
W+0.018
Grab+0.018
Boo+0.017
ted+0.017
Neg logit contributions
 supposed-0.013
 idea-0.013
epad-0.013
 possible-0.012
 recognise-0.012
Token They
Token ID1119
Feature activation+2.47e-02

Pos logit contributions
Sorry+0.024
Grab+0.019
W+0.018
Boo+0.018
ted+0.018
Neg logit contributions
 idea-0.023
epad-0.022
 supposed-0.022
 possible-0.021
 recognise-0.021
Token ended
Token ID4444
Feature activation+3.67e-02

Pos logit contributions
Sorry+0.061
Grab+0.050
W+0.050
Boo+0.047
H+0.046
Neg logit contributions
 idea-0.042
 supposed-0.040
epad-0.039
 way-0.038
 balancing-0.038
Token,
Token ID11
Feature activation+8.25e-03

Pos logit contributions
Sorry+0.077
W+0.064
Grab+0.064
Boo+0.061
ted+0.061
Neg logit contributions
epad-0.059
 supposed-0.058
 idea-0.057
 possible-0.054
 recognise-0.053
Token Tim
Token ID5045
Feature activation+5.98e-04

Pos logit contributions
Sorry+0.017
Boo+0.015
W+0.015
Grab+0.014
H+0.014
Neg logit contributions
 idea-0.013
 pretending-0.013
 way-0.013
 possible-0.013
 supposed-0.012
Tokenmy
Token ID1820
Feature activation+1.50e-02

Pos logit contributions
Sorry+0.001
W+0.001
Boo+0.001
Grab+0.001
H+0.001
Neg logit contributions
epad-0.001
 possible-0.001
 idea-0.001
 supposed-0.001
 happen-0.001
Token went
Token ID1816
Feature activation+5.70e-03

Pos logit contributions
Sorry+0.027
Grab+0.022
W+0.022
Boo+0.021
ted+0.020
Neg logit contributions
 supposed-0.029
 idea-0.028
epad-0.028
 possible-0.027
 recognise-0.026
Token back
Token ID736
Feature activation+5.05e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.010
Boo+0.009
ted+0.009
Neg logit contributions
 supposed-0.010
 idea-0.009
epad-0.009
 possible-0.009
 balancing-0.009
Token outside
Token ID2354
Feature activation+3.25e-03

Pos logit contributions
Sorry+0.121
Grab+0.102
W+0.102
Boo+0.099
ted+0.097
Neg logit contributions
 supposed-0.067
 idea-0.066
epad-0.066
 possible-0.062
 recognise-0.060
Token to
Token ID284
Feature activation+2.05e-02

Pos logit contributions
Sorry+0.005
ted+0.004
W+0.004
ipped+0.004
Grab+0.004
Neg logit contributions
epad-0.006
 supposed-0.006
 possible-0.006
 idea-0.006
 happen-0.006
Token play
Token ID711
Feature activation-6.61e-03

Pos logit contributions
Sorry+0.037
Grab+0.031
W+0.031
Boo+0.030
ted+0.029
Neg logit contributions
epad-0.037
 supposed-0.036
 idea-0.036
 possible-0.035
 recognise-0.032
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 supposed+0.011
epad+0.011
 idea+0.011
 possible+0.011
 recognise+0.010
Neg logit contributions
Sorry-0.013
W-0.011
Grab-0.011
Boo-0.011
ted-0.010
Token He
Token ID679
Feature activation+2.49e-02

Pos logit contributions
Sorry+0.022
Grab+0.018
W+0.018
Boo+0.018
ted+0.017
Neg logit contributions
 supposed-0.017
 idea-0.017
epad-0.017
 possible-0.016
 recognise-0.015
Token pile
Token ID14540
Feature activation-8.53e-03

Pos logit contributions
 supposed+0.012
epad+0.012
 recognise+0.011
 possible+0.011
 idea+0.010
Neg logit contributions
Sorry-0.014
Grab-0.011
W-0.011
ted-0.011
Boo-0.011
Token of
Token ID286
Feature activation-1.95e-02

Pos logit contributions
epad+0.016
 supposed+0.016
 idea+0.016
 recognise+0.016
 possible+0.015
Neg logit contributions
Sorry-0.013
ted-0.011
W-0.011
Grab-0.010
ipped-0.010
Token leaves
Token ID5667
Feature activation-8.76e-03

Pos logit contributions
 idea+0.027
 supposed+0.026
epad+0.026
 possible+0.025
 recognise+0.024
Neg logit contributions
Sorry-0.047
W-0.039
Grab-0.039
Boo-0.038
ted-0.037
Token blowing
Token ID19280
Feature activation-8.51e-03

Pos logit contributions
epad+0.016
 supposed+0.015
 idea+0.015
 recognise+0.014
 possible+0.014
Neg logit contributions
Sorry-0.015
ted-0.013
Grab-0.012
W-0.012
Boo-0.012
Token in
Token ID287
Feature activation-9.06e-03

Pos logit contributions
 idea+0.011
 supposed+0.011
epad+0.011
 possible+0.011
 happen+0.010
Neg logit contributions
Sorry-0.021
Grab-0.017
W-0.017
Boo-0.017
ted-0.016
Token the
Token ID262
Feature activation+1.80e-02

Pos logit contributions
 supposed+0.015
epad+0.015
 idea+0.015
 possible+0.014
 recognise+0.014
Neg logit contributions
Sorry-0.018
Grab-0.015
W-0.015
Boo-0.014
ted-0.014
Token wind
Token ID2344
Feature activation-3.05e-02

Pos logit contributions
Sorry+0.045
W+0.036
Grab+0.036
Boo+0.036
ted+0.035
Neg logit contributions
 idea-0.023
 way-0.022
 supposed-0.022
epad-0.021
 answer-0.021
Token and
Token ID290
Feature activation-7.21e-03

Pos logit contributions
 supposed+0.050
epad+0.050
 idea+0.050
 recognise+0.047
 possible+0.047
Neg logit contributions
Sorry-0.059
ted-0.048
W-0.048
Boo-0.046
Grab-0.045
Token making
Token ID1642
Feature activation-6.65e-03

Pos logit contributions
 supposed+0.010
 idea+0.010
epad+0.010
 possible+0.009
 recognise+0.009
Neg logit contributions
Sorry-0.017
Grab-0.014
W-0.014
Boo-0.014
ted-0.014
Token a
Token ID257
Feature activation-5.15e-03

Pos logit contributions
epad+0.011
 supposed+0.011
 idea+0.010
 possible+0.010
 recognise+0.010
Neg logit contributions
Sorry-0.013
W-0.011
Grab-0.011
Boo-0.011
ted-0.011
Token weird
Token ID7650
Feature activation-3.52e-03

Pos logit contributions
 supposed+0.006
epad+0.006
 idea+0.006
 possible+0.006
 recognise+0.005
Neg logit contributions
Sorry-0.013
W-0.011
Grab-0.011
Boo-0.011
ted-0.010
Token way
Token ID835
Feature activation-1.25e-02

Pos logit contributions
Sorry+0.027
W+0.024
Grab+0.024
Boo+0.024
ted+0.024
Neg logit contributions
 supposed-0.011
epad-0.011
 possible-0.010
 idea-0.010
 recognise-0.009
Token.
Token ID13
Feature activation+8.54e-03

Pos logit contributions
epad+0.019
 idea+0.019
 supposed+0.019
 recognise+0.017
 possible+0.017
Neg logit contributions
Sorry-0.026
W-0.022
Grab-0.021
ted-0.021
Boo-0.021
Token The
Token ID383
Feature activation-6.83e-03

Pos logit contributions
Sorry+0.016
Grab+0.013
W+0.013
Boo+0.013
ted+0.012
Neg logit contributions
 supposed-0.016
 idea-0.015
epad-0.015
 possible-0.015
 recognise-0.014
Token end
Token ID886
Feature activation-8.73e-03

Pos logit contributions
 idea+0.010
 supposed+0.009
 answer+0.009
epad+0.008
 possible+0.008
Neg logit contributions
Sorry-0.016
Boo-0.014
Grab-0.014
W-0.013
ted-0.013
Token.
Token ID13
Feature activation+2.38e-02

Pos logit contributions
 supposed+0.012
epad+0.012
 idea+0.012
 possible+0.011
 recognise+0.011
Neg logit contributions
Sorry-0.020
W-0.017
Grab-0.017
Boo-0.017
ted-0.016
Token<|endoftext|>
Token ID50256
Feature activation+7.72e-03

Pos logit contributions
Sorry+0.044
Grab+0.035
W+0.035
Boo+0.034
ted+0.033
Neg logit contributions
 supposed-0.045
 idea-0.044
epad-0.044
 possible-0.043
 recognise-0.041
TokenOnce
Token ID7454
Feature activation+1.23e-02

Pos logit contributions
Sorry+0.013
W+0.010
Grab+0.010
Boo+0.010
ted+0.010
Neg logit contributions
 supposed-0.015
epad-0.015
 idea-0.015
 possible-0.015
 recognise-0.014
Token upon
Token ID2402
Feature activation+1.99e-02

Pos logit contributions
Sorry+0.016
L+0.013
s+0.013
ted+0.012
W+0.012
Neg logit contributions
epad-0.028
 supposed-0.026
 recognise-0.025
 quest-0.024
 recipe-0.024
Token a
Token ID257
Feature activation+5.02e-03

Pos logit contributions
Sorry+0.034
W+0.028
Grab+0.027
Boo+0.026
H+0.025
Neg logit contributions
epad-0.039
 supposed-0.039
 idea-0.039
 possible-0.038
 recognise-0.036
Token time
Token ID640
Feature activation+8.58e-03

Pos logit contributions
Sorry+0.013
W+0.012
L+0.012
H+0.012
Grab+0.011
Neg logit contributions
epad-0.006
 possible-0.005
 supposed-0.005
 recognise-0.004
 happen-0.004
Token,
Token ID11
Feature activation-5.55e-03

Pos logit contributions
Sorry+0.016
W+0.013
Grab+0.013
Boo+0.013
Sp+0.012
Neg logit contributions
epad-0.016
 supposed-0.015
 idea-0.015
 possible-0.014
 recognise-0.014
Token'll
Token ID1183
Feature activation+1.34e-03

Pos logit contributions
Sorry+0.053
Grab+0.043
W+0.042
ted+0.042
Boo+0.041
Neg logit contributions
 idea-0.047
 supposed-0.044
 answer-0.041
 recognise-0.041
epad-0.040
Token help
Token ID1037
Feature activation+8.02e-03

Pos logit contributions
Sorry+0.003
Grab+0.003
W+0.003
Boo+0.003
ted+0.003
Neg logit contributions
 supposed-0.002
 idea-0.002
epad-0.002
 possible-0.002
 recognise-0.002
Token you
Token ID345
Feature activation+1.94e-02

Pos logit contributions
Sorry+0.013
Grab+0.010
Boo+0.009
W+0.009
ted+0.008
Neg logit contributions
 supposed-0.017
 idea-0.017
epad-0.017
 possible-0.017
 happen-0.017
Token."
Token ID526
Feature activation+7.35e-03

Pos logit contributions
Sorry+0.045
W+0.038
Grab+0.037
Boo+0.036
ted+0.035
Neg logit contributions
 supposed-0.027
epad-0.027
 idea-0.026
 possible-0.025
 recognise-0.024
Token Roc
Token ID13545
Feature activation+7.69e-03

Pos logit contributions
Sorry+0.016
Boo+0.013
Grab+0.012
W+0.012
Sp+0.011
Neg logit contributions
 idea-0.011
 way-0.011
 pretending-0.010
 supposed-0.010
 balancing-0.010
Tokenco
Token ID1073
Feature activation+1.41e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.009
Boo+0.009
ted+0.009
Neg logit contributions
 idea-0.016
 supposed-0.016
epad-0.015
 balancing-0.015
 possible-0.015
Token pulled
Token ID5954
Feature activation+1.44e-02

Pos logit contributions
Sorry+0.030
W+0.025
Grab+0.024
Boo+0.024
Sp+0.023
Neg logit contributions
 idea-0.024
 supposed-0.022
epad-0.021
 recognise-0.021
 answer-0.020
Token the
Token ID262
Feature activation-2.49e-03

Pos logit contributions
Sorry+0.032
Grab+0.027
W+0.026
Boo+0.026
ted+0.025
Neg logit contributions
 supposed-0.021
 idea-0.021
epad-0.021
 possible-0.020
 recognise-0.019
Token carrot
Token ID40562
Feature activation+3.05e-02

Pos logit contributions
epad+0.004
 supposed+0.004
 possible+0.003
 recognise+0.003
od+0.003
Neg logit contributions
Sorry-0.006
Grab-0.005
W-0.005
ted-0.005
Boo-0.004
Token out
Token ID503
Feature activation+1.72e-03

Pos logit contributions
Sorry+0.056
Grab+0.047
W+0.046
ted+0.046
Boo+0.043
Neg logit contributions
 supposed-0.055
epad-0.054
 possible-0.050
 idea-0.049
 recognise-0.049
Token of
Token ID286
Feature activation+1.30e-02

Pos logit contributions
Sorry+0.004
Grab+0.003
W+0.003
ted+0.003
Boo+0.003
Neg logit contributions
 supposed-0.003
epad-0.003
 idea-0.003
 possible-0.002
 recognise-0.002
Token out
Token ID503
Feature activation-6.07e-03

Pos logit contributions
Sorry+0.009
W+0.008
Grab+0.008
ted+0.008
Boo+0.007
Neg logit contributions
 idea-0.007
epad-0.007
 supposed-0.007
 possible-0.006
 happen-0.006
Token to
Token ID284
Feature activation+9.53e-03

Pos logit contributions
 supposed+0.009
epad+0.009
 idea+0.009
 possible+0.009
 recognise+0.008
Neg logit contributions
Sorry-0.013
Grab-0.011
W-0.011
ted-0.011
Boo-0.010
Token see
Token ID766
Feature activation+2.99e-03

Pos logit contributions
Sorry+0.017
Grab+0.015
W+0.014
Boo+0.013
L+0.013
Neg logit contributions
 supposed-0.017
epad-0.017
 idea-0.017
 possible-0.016
arter-0.015
Token a
Token ID257
Feature activation-3.68e-03

Pos logit contributions
Sorry+0.006
Grab+0.005
W+0.005
ted+0.005
Boo+0.005
Neg logit contributions
 supposed-0.005
epad-0.005
 idea-0.005
 possible-0.005
 recognise-0.005
Token guitar
Token ID10047
Feature activation+2.24e-02

Pos logit contributions
 supposed+0.005
epad+0.005
 idea+0.004
 recognise+0.004
 possible+0.004
Neg logit contributions
Sorry-0.009
W-0.008
Grab-0.008
Boo-0.007
H-0.007
Token shop
Token ID6128
Feature activation+1.18e-02

Pos logit contributions
Sorry+0.044
W+0.036
ted+0.036
Grab+0.035
Boo+0.035
Neg logit contributions
 supposed-0.037
epad-0.036
 idea-0.035
 possible-0.034
 recognise-0.034
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
Sorry+0.023
W+0.018
Grab+0.017
H+0.017
ted+0.017
Neg logit contributions
 idea-0.020
 supposed-0.019
epad-0.019
 recognise-0.018
 happen-0.018
Token The
Token ID383
Feature activation-1.60e-02

Pos logit contributions
 idea+0.020
epad+0.019
 supposed+0.019
 pretending+0.019
 recognise+0.018
Neg logit contributions
Sorry-0.022
Boo-0.017
W-0.017
Grab-0.017
ted-0.016
Token bunny
Token ID44915
Feature activation-8.24e-04

Pos logit contributions
 idea+0.023
epad+0.022
 supposed+0.022
 possible+0.021
 balancing+0.020
Neg logit contributions
Sorry-0.037
Grab-0.030
W-0.030
Boo-0.030
ted-0.029
Token gazed
Token ID50255
Feature activation-1.94e-02

Pos logit contributions
 idea+0.001
 supposed+0.001
 way+0.001
 pretending+0.001
epad+0.001
Neg logit contributions
Sorry-0.002
Grab-0.001
W-0.001
Boo-0.001
Sp-0.001
Token at
Token ID379
Feature activation+3.00e-03

Pos logit contributions
 supposed+0.037
epad+0.036
 happen+0.035
 idea+0.035
 possible+0.035
Neg logit contributions
Sorry-0.034
ted-0.027
W-0.027
Grab-0.027
ipped-0.025
Token bites
Token ID26081
Feature activation+3.25e-02

Pos logit contributions
Sorry+0.028
Grab+0.023
W+0.023
Boo+0.022
ted+0.021
Neg logit contributions
 idea-0.022
 supposed-0.022
epad-0.021
 possible-0.020
 happen-0.020
Token the
Token ID262
Feature activation-1.73e-03

Pos logit contributions
Sorry+0.078
Grab+0.065
W+0.064
Boo+0.064
H+0.061
Neg logit contributions
 idea-0.042
 supposed-0.039
 possible-0.038
 answer-0.037
 way-0.037
Token stick
Token ID4859
Feature activation+1.28e-02

Pos logit contributions
 idea+0.003
 supposed+0.002
epad+0.002
 possible+0.002
 happen+0.002
Neg logit contributions
Sorry-0.004
W-0.003
Grab-0.003
Boo-0.003
ted-0.003
Token.
Token ID13
Feature activation-1.57e-03

Pos logit contributions
Sorry+0.024
W+0.019
Grab+0.019
ted+0.019
Boo+0.018
Neg logit contributions
 supposed-0.023
epad-0.023
 recognise-0.021
 idea-0.021
 possible-0.021
Token The
Token ID383
Feature activation+2.63e-03

Pos logit contributions
 idea+0.003
 supposed+0.003
epad+0.003
 possible+0.003
 recognise+0.003
Neg logit contributions
Sorry-0.003
W-0.002
Grab-0.002
Boo-0.002
ted-0.002
Token stick
Token ID4859
Feature activation+1.95e-02

Pos logit contributions
Sorry+0.006
W+0.005
Grab+0.005
Boo+0.005
ted+0.005
Neg logit contributions
 idea-0.004
 balancing-0.004
 way-0.003
 answer-0.003
 happen-0.003
Token breaks
Token ID9457
Feature activation+3.00e-02

Pos logit contributions
Sorry+0.038
Grab+0.031
W+0.031
Boo+0.030
ted+0.029
Neg logit contributions
 supposed-0.034
 idea-0.034
epad-0.034
 possible-0.032
 recognise-0.031
Token.
Token ID13
Feature activation+9.71e-04

Pos logit contributions
Sorry+0.068
Grab+0.057
W+0.057
Boo+0.055
ted+0.054
Neg logit contributions
 supposed-0.043
 idea-0.043
epad-0.043
 possible-0.041
 recognise-0.039
Token Tim
Token ID5045
Feature activation+8.70e-03

Pos logit contributions
Sorry+0.002
W+0.001
Grab+0.001
Boo+0.001
ted+0.001
Neg logit contributions
 idea-0.002
 supposed-0.002
epad-0.002
 possible-0.002
 recognise-0.002
Token falls
Token ID8953
Feature activation+4.08e-02

Pos logit contributions
Sorry+0.020
Grab+0.017
W+0.017
Boo+0.016
ted+0.016
Neg logit contributions
 supposed-0.012
epad-0.012
 idea-0.012
 possible-0.012
 recognise-0.011
Token into
Token ID656
Feature activation+2.51e-02

Pos logit contributions
Sorry+0.083
W+0.069
Grab+0.068
Boo+0.066
ted+0.065
Neg logit contributions
epad-0.068
 supposed-0.068
 idea-0.068
 possible-0.064
 recognise-0.063
Token't
Token ID470
Feature activation-2.66e-02

Pos logit contributions
Sorry+0.001
Grab+0.001
Boo+0.001
W+0.001
ted+0.001
Neg logit contributions
 supposed-0.001
 idea-0.001
 possible-0.001
 happen-0.001
epad-0.001
Token understand
Token ID1833
Feature activation-2.37e-02

Pos logit contributions
 supposed+0.038
epad+0.037
 idea+0.034
 possible+0.034
 pretending+0.033
Neg logit contributions
Sorry-0.062
W-0.052
Grab-0.050
Boo-0.050
H-0.049
Token.
Token ID13
Feature activation-7.90e-03

Pos logit contributions
epad+0.041
 idea+0.040
 supposed+0.040
 recognise+0.038
 possible+0.038
Neg logit contributions
Sorry-0.042
ted-0.035
W-0.034
Grab-0.033
Boo-0.032
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
 idea+0.013
 supposed+0.012
epad+0.012
 way+0.012
 possible+0.011
Neg logit contributions
Sorry-0.016
Boo-0.013
Grab-0.013
W-0.013
ted-0.012
Token
Token ID198
Feature activation-2.46e-03

Pos logit contributions
 supposed+0.025
 idea+0.024
epad+0.024
 possible+0.023
 recognise+0.023
Neg logit contributions
Sorry-0.025
Grab-0.020
W-0.020
Boo-0.020
ted-0.019
TokenM
Token ID44
Feature activation+1.44e-03

Pos logit contributions
 recognise+0.004
 supposed+0.004
 balancing+0.004
 idea+0.004
 possible+0.004
Neg logit contributions
Sorry-0.004
ted-0.004
Grab-0.003
ipped-0.003
Boo-0.003
Tokenilly
Token ID6548
Feature activation+8.33e-03

Pos logit contributions
Sorry+0.003
Grab+0.003
W+0.003
Boo+0.003
ted+0.003
Neg logit contributions
 supposed-0.002
epad-0.002
 idea-0.002
 possible-0.002
 recognise-0.002
Token still
Token ID991
Feature activation-1.07e-02

Pos logit contributions
Sorry+0.017
Grab+0.015
W+0.014
Boo+0.014
Sp+0.014
Neg logit contributions
 idea-0.013
 way-0.012
 balancing-0.011
 supposed-0.011
 pretending-0.011
Token hated
Token ID16563
Feature activation-1.60e-02

Pos logit contributions
 supposed+0.017
 idea+0.017
epad+0.016
 pretending+0.015
 recognise+0.015
Neg logit contributions
Sorry-0.023
Grab-0.020
W-0.019
Boo-0.019
Sp-0.018
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
epad+0.025
 idea+0.024
 supposed+0.024
 recognise+0.024
 possible+0.023
Neg logit contributions
Sorry-0.031
ted-0.026
W-0.025
Grab-0.025
Sp-0.024
Token task
Token ID4876
Feature activation-9.03e-03

Pos logit contributions
 supposed+0.019
epad+0.019
 recognise+0.018
od+0.018
 possible+0.017
Neg logit contributions
Sorry-0.040
Grab-0.035
W-0.034
Boo-0.032
H-0.032
Token play
Token ID711
Feature activation-2.33e-02

Pos logit contributions
 supposed+0.061
epad+0.059
 idea+0.058
 possible+0.058
 recognise+0.054
Neg logit contributions
Sorry-0.063
W-0.051
Grab-0.051
Boo-0.049
ted-0.046
Token in
Token ID287
Feature activation-1.36e-02

Pos logit contributions
 idea+0.038
 pretending+0.035
 balancing+0.034
 supposed+0.033
 trick+0.033
Neg logit contributions
Sorry-0.051
Grab-0.042
W-0.040
Boo-0.040
ted-0.038
Token the
Token ID262
Feature activation+2.78e-02

Pos logit contributions
epad+0.028
 supposed+0.027
 idea+0.026
 possible+0.025
 recognise+0.025
Neg logit contributions
Sorry-0.021
W-0.019
Grab-0.018
L-0.017
Sp-0.017
Token hall
Token ID6899
Feature activation-1.88e-02

Pos logit contributions
Sorry+0.066
Grab+0.055
W+0.053
ted+0.053
Boo+0.053
Neg logit contributions
 idea-0.038
 supposed-0.037
 possible-0.035
 way-0.035
epad-0.035
Token at
Token ID379
Feature activation-1.15e-02

Pos logit contributions
 supposed+0.029
epad+0.028
 idea+0.028
 possible+0.027
 recognise+0.026
Neg logit contributions
Sorry-0.040
W-0.033
Grab-0.032
Boo-0.032
ted-0.032
Token school
Token ID1524
Feature activation-1.42e-02

Pos logit contributions
epad+0.019
 supposed+0.019
 idea+0.018
 possible+0.017
 recognise+0.017
Neg logit contributions
Sorry-0.022
W-0.019
Grab-0.018
Boo-0.018
L-0.018
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 idea+0.019
 supposed+0.019
epad+0.019
 possible+0.017
 pretending+0.017
Neg logit contributions
Sorry-0.034
Grab-0.029
W-0.029
Boo-0.028
ted-0.027
Token The
Token ID383
Feature activation-2.06e-02

Pos logit contributions
 idea+0.037
 supposed+0.037
epad+0.037
 possible+0.035
 pretending+0.034
Neg logit contributions
Sorry-0.045
Grab-0.037
W-0.037
Boo-0.036
ted-0.034
Token hall
Token ID6899
Feature activation-1.85e-02

Pos logit contributions
 idea+0.023
 supposed+0.022
epad+0.021
 possible+0.019
 pretending+0.019
Neg logit contributions
Sorry-0.055
Grab-0.048
W-0.047
Boo-0.046
ted-0.045
Token is
Token ID318
Feature activation-1.79e-02

Pos logit contributions
 idea+0.032
 supposed+0.031
epad+0.030
 way+0.029
 possible+0.029
Neg logit contributions
Sorry-0.037
Grab-0.032
W-0.031
Boo-0.030
ted-0.029
Token big
Token ID1263
Feature activation-7.63e-03

Pos logit contributions
epad+0.029
 happen+0.028
arter+0.027
 hesitation+0.027
 quest+0.027
Neg logit contributions
Sorry-0.034
W-0.028
L-0.028
Grab-0.027
H-0.025
Token and
Token ID290
Feature activation+4.84e-03

Pos logit contributions
epad+0.018
 supposed+0.018
 idea+0.018
 possible+0.016
 recognise+0.016
Neg logit contributions
Sorry-0.025
W-0.021
Grab-0.020
Boo-0.020
ted-0.020
Token have
Token ID423
Feature activation+1.82e-02

Pos logit contributions
Sorry+0.010
Grab+0.009
W+0.009
Boo+0.008
ted+0.008
Neg logit contributions
 idea-0.008
 supposed-0.008
epad-0.008
 possible-0.007
 recognise-0.007
Token more
Token ID517
Feature activation-2.77e-02

Pos logit contributions
Sorry+0.041
Grab+0.034
W+0.034
Boo+0.033
ted+0.032
Neg logit contributions
 supposed-0.027
 idea-0.027
epad-0.027
 possible-0.026
 happen-0.024
Token time
Token ID640
Feature activation-2.83e-02

Pos logit contributions
epad+0.039
 supposed+0.039
 idea+0.037
 recognise+0.036
 possible+0.036
Neg logit contributions
Sorry-0.061
W-0.051
Grab-0.050
Boo-0.049
ted-0.048
Token to
Token ID284
Feature activation-1.69e-02

Pos logit contributions
 idea+0.040
 supposed+0.040
epad+0.040
 possible+0.037
 recognise+0.036
Neg logit contributions
Sorry-0.064
W-0.054
Grab-0.053
Boo-0.052
ted-0.051
Token play
Token ID711
Feature activation-1.77e-02

Pos logit contributions
 idea+0.022
 supposed+0.022
epad+0.022
 possible+0.021
 happen+0.020
Neg logit contributions
Sorry-0.041
Grab-0.035
W-0.035
Boo-0.034
ted-0.033
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 idea+0.027
 supposed+0.024
epad+0.024
 pretending+0.024
 happen+0.023
Neg logit contributions
Sorry-0.041
Grab-0.034
W-0.033
Boo-0.032
ted-0.032
Token The
Token ID383
Feature activation-5.15e-03

Pos logit contributions
 idea+0.016
 supposed+0.016
epad+0.016
 possible+0.015
 pretending+0.015
Neg logit contributions
Sorry-0.021
Grab-0.017
ted-0.017
ipped-0.017
Boo-0.017
Token moral
Token ID6573
Feature activation+2.50e-03

Pos logit contributions
 idea+0.007
 supposed+0.007
epad+0.007
 possible+0.007
 balancing+0.006
Neg logit contributions
Sorry-0.012
Grab-0.010
W-0.010
Boo-0.010
ted-0.010
Token of
Token ID286
Feature activation-2.16e-02

Pos logit contributions
Sorry+0.007
Grab+0.006
ted+0.006
ipped+0.006
Boo+0.006
Neg logit contributions
 idea-0.002
 supposed-0.002
 answer-0.002
 possible-0.002
 way-0.002
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
epad+0.044
 idea+0.041
 supposed+0.041
 recognise+0.039
 possible+0.038
Neg logit contributions
Sorry-0.035
W-0.031
Grab-0.030
L-0.028
ted-0.028
Token shelf
Token ID18316
Feature activation+2.63e-02

Pos logit contributions
Sorry+0.051
W+0.045
Grab+0.044
Boo+0.043
ted+0.042
Neg logit contributions
 supposed-0.027
epad-0.027
 idea-0.027
 possible-0.025
 recognise-0.024
Token all
Token ID477
Feature activation-3.74e-02

Pos logit contributions
Sorry+0.059
Grab+0.049
W+0.049
Boo+0.047
ted+0.047
Neg logit contributions
 supposed-0.038
epad-0.038
 idea-0.037
 possible-0.036
 recognise-0.035
Token day
Token ID1110
Feature activation-4.87e-03

Pos logit contributions
 supposed+0.039
epad+0.038
 idea+0.038
 possible+0.035
 recognise+0.033
Neg logit contributions
Sorry-0.100
Grab-0.087
W-0.086
Boo-0.084
ted-0.083
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
 idea+0.009
 supposed+0.008
epad+0.008
 possible+0.008
 happen+0.008
Neg logit contributions
Sorry-0.008
W-0.007
ted-0.007
H-0.007
Grab-0.007
Token wishing
Token ID24433
Feature activation-2.14e-02

Pos logit contributions
 idea+0.028
 supposed+0.028
epad+0.027
 possible+0.026
 happen+0.025
Neg logit contributions
Sorry-0.060
Grab-0.052
W-0.052
ted-0.050
Boo-0.049
Token for
Token ID329
Feature activation-7.17e-03

Pos logit contributions
 supposed+0.035
epad+0.034
 idea+0.034
 recognise+0.032
 possible+0.032
Neg logit contributions
Sorry-0.043
W-0.035
ted-0.034
Grab-0.034
Boo-0.034
Token a
Token ID257
Feature activation-4.86e-02

Pos logit contributions
 idea+0.011
 supposed+0.011
epad+0.011
 possible+0.011
 happen+0.010
Neg logit contributions
Sorry-0.016
Grab-0.013
W-0.013
Boo-0.013
ted-0.012
Token friend
Token ID1545
Feature activation-3.40e-02

Pos logit contributions
 supposed+0.064
epad+0.062
 idea+0.062
 possible+0.058
 recognise+0.057
Neg logit contributions
Sorry-0.116
W-0.100
Grab-0.098
Boo-0.096
Sp-0.092
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
epad+0.052
 idea+0.052
 supposed+0.051
 possible+0.048
 happen+0.048
Neg logit contributions
Sorry-0.070
W-0.061
ted-0.059
Boo-0.058
Grab-0.058
Token
Token ID198
Feature activation-2.24e-02

Pos logit contributions
 idea+0.031
 supposed+0.030
epad+0.030
 pretending+0.029
 recognise+0.028
Neg logit contributions
Sorry-0.045
Boo-0.036
Grab-0.035
W-0.035
ipped-0.035
Token
Token ID198
Feature activation-8.26e-03

Pos logit contributions
 supposed+0.043
 idea+0.043
epad+0.043
 possible+0.041
 recognise+0.040
Neg logit contributions
Sorry-0.040
Grab-0.032
W-0.032
Boo-0.030
ted-0.029
Token it
Token ID340
Feature activation+1.52e-02

Pos logit contributions
Sorry+0.021
Grab+0.017
W+0.017
Boo+0.017
ipped+0.016
Neg logit contributions
 idea-0.011
 supposed-0.011
 happen-0.010
epad-0.010
 possible-0.010
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
Sorry+0.024
W+0.021
ted+0.020
Grab+0.020
Sp+0.019
Neg logit contributions
epad-0.030
 supposed-0.029
 recognise-0.027
 possible-0.026
arter-0.026
Token but
Token ID475
Feature activation-1.41e-02

Pos logit contributions
epad+0.036
 supposed+0.034
 possible+0.033
arter+0.032
 idea+0.031
Neg logit contributions
Sorry-0.057
Grab-0.051
W-0.050
ted-0.048
Sure-0.047
Token she
Token ID673
Feature activation-7.27e-03

Pos logit contributions
 idea+0.016
 possible+0.015
 supposed+0.015
 way+0.015
 pretending+0.015
Neg logit contributions
Sorry-0.037
Boo-0.031
Grab-0.031
W-0.031
ipped-0.030
Token didn
Token ID1422
Feature activation-5.00e-03

Pos logit contributions
 idea+0.011
 supposed+0.011
 way+0.010
 balancing+0.010
 balance+0.010
Neg logit contributions
Sorry-0.016
Grab-0.013
Boo-0.013
W-0.012
ted-0.012
Token't
Token ID470
Feature activation-2.67e-02

Pos logit contributions
epad+0.010
 idea+0.009
 supposed+0.009
 possible+0.009
 pretending+0.009
Neg logit contributions
Sorry-0.009
W-0.007
Grab-0.007
Boo-0.007
H-0.007
Token know
Token ID760
Feature activation-2.05e-02

Pos logit contributions
epad+0.043
 recipe+0.040
 pretending+0.040
 comet+0.040
 stove+0.040
Neg logit contributions
Sorry-0.051
W-0.047
Boo-0.043
L-0.043
H-0.042
Token how
Token ID703
Feature activation-1.87e-02

Pos logit contributions
epad+0.040
 idea+0.040
 supposed+0.039
 recognise+0.037
 possible+0.036
Neg logit contributions
Sorry-0.034
ted-0.028
W-0.028
Grab-0.027
Boo-0.026
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
epad+0.030
 idea+0.029
 supposed+0.028
 recognise+0.026
 pretending+0.026
Neg logit contributions
Sorry-0.040
W-0.034
Boo-0.032
Grab-0.031
ted-0.031
Token Mitt
Token ID16627
Feature activation+4.69e-04

Pos logit contributions
 idea+0.028
 way+0.026
 supposed+0.024
 pretending+0.024
epad+0.024
Neg logit contributions
Sorry-0.037
Boo-0.030
Grab-0.030
W-0.029
ipped-0.028
Tokenens
Token ID641
Feature activation-9.35e-03

Pos logit contributions
Sorry+0.001
W+0.001
Grab+0.001
Boo+0.001
Sp+0.001
Neg logit contributions
 idea-0.001
 supposed-0.001
epad-0.001
 happen-0.001
 possible-0.001
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
epad+0.067
 possible+0.065
 idea+0.064
 supposed+0.063
 recognise+0.061
Neg logit contributions
Sorry-0.053
W-0.047
L-0.046
H-0.044
ted-0.042
Token Jack
Token ID3619
Feature activation-3.39e-02

Pos logit contributions
 supposed+0.035
 possible+0.034
epad+0.033
 happen+0.032
 idea+0.032
Neg logit contributions
Sorry-0.033
Grab-0.027
W-0.027
Sp-0.025
Boo-0.025
Token was
Token ID373
Feature activation-1.56e-02

Pos logit contributions
epad+0.064
 supposed+0.063
 idea+0.063
 possible+0.061
 recognise+0.059
Neg logit contributions
Sorry-0.062
W-0.050
Grab-0.049
Boo-0.048
ted-0.046
Token only
Token ID691
Feature activation-1.72e-02

Pos logit contributions
epad+0.031
 happen+0.029
 possible+0.029
 idea+0.028
 angle+0.028
Neg logit contributions
Sorry-0.026
W-0.023
Grab-0.022
H-0.020
L-0.020
Token three
Token ID1115
Feature activation-2.57e-02

Pos logit contributions
epad+0.029
 happen+0.028
 supposed+0.028
 idea+0.028
 possible+0.028
Neg logit contributions
Sorry-0.033
W-0.028
Grab-0.028
Boo-0.026
ted-0.026
Token years
Token ID812
Feature activation-4.50e-02

Pos logit contributions
epad+0.049
 supposed+0.048
 idea+0.048
 possible+0.047
 recognise+0.045
Neg logit contributions
Sorry-0.047
W-0.038
Grab-0.037
Boo-0.036
ted-0.035
Token old
Token ID1468
Feature activation+8.07e-03

Pos logit contributions
epad+0.052
 supposed+0.052
 recognise+0.048
 idea+0.047
 possible+0.045
Neg logit contributions
Sorry-0.110
W-0.096
ted-0.092
Grab-0.091
L-0.090
Token and
Token ID290
Feature activation-2.56e-02

Pos logit contributions
Sorry+0.021
Grab+0.019
Boo+0.018
W+0.018
ted+0.017
Neg logit contributions
 idea-0.009
 supposed-0.009
epad-0.008
 pretending-0.008
 answer-0.008
Token he
Token ID339
Feature activation-2.84e-02

Pos logit contributions
 supposed+0.041
 idea+0.040
epad+0.040
 possible+0.038
 recognise+0.036
Neg logit contributions
Sorry-0.055
Grab-0.046
W-0.045
Boo-0.044
ted-0.043
Token loved
Token ID6151
Feature activation-3.36e-02

Pos logit contributions
 supposed+0.046
 idea+0.043
epad+0.041
 pretending+0.040
 balancing+0.040
Neg logit contributions
Sorry-0.062
Grab-0.052
Boo-0.049
ted-0.049
W-0.049
Token to
Token ID284
Feature activation-3.47e-02

Pos logit contributions
 idea+0.056
epad+0.055
 possible+0.054
 recognise+0.053
 supposed+0.052
Neg logit contributions
Sorry-0.064
W-0.055
ted-0.053
Grab-0.049
Boo-0.048
Token She
Token ID1375
Feature activation-2.95e-02

Pos logit contributions
 supposed+0.030
epad+0.030
 possible+0.029
 idea+0.028
 balancing+0.028
Neg logit contributions
Sorry-0.025
Grab-0.020
W-0.020
Boo-0.019
ted-0.018
Token was
Token ID373
Feature activation-1.27e-02

Pos logit contributions
epad+0.053
 supposed+0.053
 idea+0.053
 possible+0.051
 recognise+0.049
Neg logit contributions
Sorry-0.057
W-0.046
Grab-0.046
Boo-0.044
ted-0.043
Token very
Token ID845
Feature activation-4.13e-03

Pos logit contributions
epad+0.023
 happen+0.022
 possible+0.022
 idea+0.021
 supposed+0.021
Neg logit contributions
Sorry-0.023
W-0.020
Grab-0.020
H-0.018
L-0.018
Token com
Token ID401
Feature activation+2.76e-02

Pos logit contributions
epad+0.005
 supposed+0.005
 idea+0.005
 possible+0.005
 happen+0.005
Neg logit contributions
Sorry-0.010
W-0.009
Grab-0.008
Boo-0.008
H-0.008
Tokenfy
Token ID24928
Feature activation-1.41e-03

Pos logit contributions
Sorry+0.055
Grab+0.046
Boo+0.044
ted+0.042
W+0.042
Neg logit contributions
 supposed-0.045
 idea-0.045
epad-0.044
 possible-0.041
 interested-0.041
Token when
Token ID618
Feature activation-4.62e-02

Pos logit contributions
epad+0.003
 recognise+0.002
 supposed+0.002
 idea+0.002
 possible+0.002
Neg logit contributions
Sorry-0.003
W-0.002
ted-0.002
H-0.002
Grab-0.002
Token she
Token ID673
Feature activation-1.45e-02

Pos logit contributions
 supposed+0.074
epad+0.074
 idea+0.073
 possible+0.069
 recognise+0.067
Neg logit contributions
Sorry-0.098
W-0.081
Grab-0.080
Boo-0.078
ted-0.076
Token sat
Token ID3332
Feature activation+2.73e-03

Pos logit contributions
 supposed+0.026
 idea+0.026
epad+0.025
 possible+0.024
 balancing+0.024
Neg logit contributions
Sorry-0.028
Grab-0.022
W-0.022
Boo-0.021
ted-0.021
Token down
Token ID866
Feature activation+2.08e-02

Pos logit contributions
Sorry+0.007
Grab+0.006
Boo+0.006
W+0.006
Sp+0.005
Neg logit contributions
 supposed-0.003
 idea-0.003
epad-0.003
 pretending-0.003
 possible-0.003
Token in
Token ID287
Feature activation-6.46e-05

Pos logit contributions
Sorry+0.045
Grab+0.037
W+0.037
Boo+0.036
ted+0.036
Neg logit contributions
 supposed-0.032
 idea-0.032
epad-0.032
 possible-0.030
 recognise-0.029
Token her
Token ID607
Feature activation+2.52e-02

Pos logit contributions
 supposed+0.000
epad+0.000
 idea+0.000
 possible+0.000
 recognise+0.000
Neg logit contributions
Sorry-0.000
W-0.000
Grab-0.000
Boo-0.000
ted-0.000
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
Sorry+0.031
Grab+0.026
W+0.026
Boo+0.026
ted+0.025
Neg logit contributions
 idea-0.015
 supposed-0.015
epad-0.014
 possible-0.014
 pretending-0.013
Token it
Token ID340
Feature activation+6.34e-03

Pos logit contributions
 idea+0.016
 pretending+0.014
 supposed+0.014
epad+0.014
 possible+0.014
Neg logit contributions
Sorry-0.030
W-0.026
Boo-0.026
Grab-0.025
ipped-0.024
Token found
Token ID1043
Feature activation+2.23e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.010
Boo+0.009
ted+0.009
Neg logit contributions
 idea-0.011
 supposed-0.011
epad-0.011
 possible-0.011
 way-0.010
Token a
Token ID257
Feature activation-1.84e-02

Pos logit contributions
Sorry+0.055
Grab+0.044
W+0.043
Boo+0.043
ipped+0.041
Neg logit contributions
 supposed-0.030
 idea-0.029
 possible-0.028
 way-0.027
 answer-0.027
Token nice
Token ID3621
Feature activation-6.49e-03

Pos logit contributions
epad+0.025
 supposed+0.025
 recognise+0.023
 possible+0.023
 idea+0.022
Neg logit contributions
Sorry-0.042
W-0.036
Grab-0.035
Boo-0.034
H-0.033
Token,
Token ID11
Feature activation-4.11e-02

Pos logit contributions
 supposed+0.010
epad+0.010
 possible+0.009
 recognise+0.009
 happen+0.009
Neg logit contributions
Sorry-0.013
W-0.012
Grab-0.011
Boo-0.011
L-0.011
Token y
Token ID331
Feature activation+5.04e-02

Pos logit contributions
epad+0.056
 supposed+0.056
 possible+0.052
 idea+0.050
 recognise+0.050
Neg logit contributions
Sorry-0.093
W-0.083
Grab-0.082
Boo-0.076
L-0.075
Tokenummy
Token ID13513
Feature activation-4.87e-03

Pos logit contributions
Sorry+0.124
Grab+0.105
W+0.104
Boo+0.103
ted+0.100
Neg logit contributions
 idea-0.063
 supposed-0.063
epad-0.061
 possible-0.058
 recognise-0.055
Token treat
Token ID2190
Feature activation+1.65e-02

Pos logit contributions
epad+0.008
 supposed+0.008
 possible+0.008
 recognise+0.008
 achieving+0.007
Neg logit contributions
Sorry-0.009
W-0.008
Grab-0.008
Boo-0.008
ted-0.007
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
Sorry+0.035
W+0.030
Grab+0.029
ted+0.029
Boo+0.028
Neg logit contributions
epad-0.025
 idea-0.024
 supposed-0.024
 recognise-0.023
 possible-0.023
Token It
Token ID632
Feature activation+8.26e-03

Pos logit contributions
 idea+0.019
 supposed+0.017
epad+0.017
 way+0.017
 recognise+0.017
Neg logit contributions
Sorry-0.026
Boo-0.021
W-0.021
ipped-0.020
Grab-0.020
Token down
Token ID866
Feature activation+2.11e-04

Pos logit contributions
 supposed+0.003
 idea+0.002
epad+0.002
 possible+0.002
 recognise+0.002
Neg logit contributions
Sorry-0.004
W-0.003
ted-0.003
Grab-0.003
Boo-0.003
Token all
Token ID477
Feature activation-4.06e-02

Pos logit contributions
Sorry+0.000
Grab+0.000
W+0.000
Boo+0.000
ted+0.000
Neg logit contributions
 supposed-0.000
 idea-0.000
epad-0.000
 possible-0.000
 recognise-0.000
Token the
Token ID262
Feature activation-1.00e-02

Pos logit contributions
epad+0.073
 supposed+0.071
 idea+0.066
 recognise+0.066
 possible+0.065
Neg logit contributions
Sorry-0.069
ted-0.062
W-0.060
Grab-0.060
ipped-0.056
Token things
Token ID1243
Feature activation-1.90e-02

Pos logit contributions
arter+0.013
onna+0.013
 supposed+0.012
 possible+0.012
epad+0.011
Neg logit contributions
Sorry-0.021
W-0.020
L-0.019
Grab-0.019
Boo-0.019
Token he
Token ID339
Feature activation-3.33e-02

Pos logit contributions
 supposed+0.031
epad+0.030
 idea+0.030
 recognise+0.028
 possible+0.028
Neg logit contributions
Sorry-0.038
Grab-0.031
ted-0.031
W-0.031
Boo-0.030
Token wanted
Token ID2227
Feature activation-5.11e-02

Pos logit contributions
 supposed+0.059
 idea+0.058
epad+0.058
 possible+0.056
 recognise+0.054
Neg logit contributions
Sorry-0.065
W-0.053
Grab-0.052
Boo-0.051
ted-0.049
Token,
Token ID11
Feature activation-3.44e-02

Pos logit contributions
 idea+0.085
 supposed+0.084
epad+0.083
 recognise+0.081
arter+0.078
Neg logit contributions
Sorry-0.092
ted-0.078
W-0.077
Sp-0.073
Boo-0.072
Token and
Token ID290
Feature activation+4.66e-03

Pos logit contributions
 supposed+0.047
 idea+0.045
epad+0.044
 possible+0.042
 recognise+0.040
Neg logit contributions
Sorry-0.077
W-0.065
Grab-0.065
ted-0.063
Boo-0.062
Token all
Token ID477
Feature activation-3.79e-02

Pos logit contributions
Sorry+0.010
Grab+0.008
W+0.008
Boo+0.008
ted+0.008
Neg logit contributions
 supposed-0.007
 idea-0.007
epad-0.007
 possible-0.007
 recognise-0.007
Token the
Token ID262
Feature activation-1.98e-02

Pos logit contributions
 supposed+0.066
epad+0.065
 possible+0.060
 idea+0.060
 recognise+0.059
Neg logit contributions
Sorry-0.064
ted-0.054
W-0.050
Grab-0.049
ipped-0.049
Token things
Token ID1243
Feature activation-1.59e-02

Pos logit contributions
 supposed+0.031
 possible+0.030
epad+0.030
 idea+0.029
arter+0.028
Neg logit contributions
Sorry-0.040
W-0.034
Grab-0.033
Boo-0.033
Sp-0.032
Token boy
Token ID2933
Feature activation-7.33e-03

Pos logit contributions
Sorry+0.033
Grab+0.027
Boo+0.026
W+0.026
ted+0.025
Neg logit contributions
 idea-0.024
 supposed-0.023
epad-0.022
 possible-0.022
 balancing-0.021
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
epad+0.016
 idea+0.016
 supposed+0.016
 possible+0.015
 recognise+0.015
Neg logit contributions
Sorry-0.011
W-0.009
Grab-0.009
Boo-0.008
ted-0.008
Token He
Token ID679
Feature activation-2.97e-02

Pos logit contributions
 supposed+0.039
 possible+0.038
epad+0.037
 idea+0.037
 happen+0.036
Neg logit contributions
Sorry-0.028
Grab-0.023
W-0.023
Boo-0.022
ted-0.021
Token was
Token ID373
Feature activation-1.70e-02

Pos logit contributions
 supposed+0.052
 idea+0.051
epad+0.051
 possible+0.049
 recognise+0.047
Neg logit contributions
Sorry-0.059
Grab-0.048
W-0.048
Boo-0.046
ted-0.045
Token three
Token ID1115
Feature activation-2.40e-02

Pos logit contributions
epad+0.032
 happen+0.030
 possible+0.030
 idea+0.029
 supposed+0.029
Neg logit contributions
Sorry-0.030
W-0.026
Grab-0.026
L-0.023
H-0.023
Token years
Token ID812
Feature activation-4.12e-02

Pos logit contributions
epad+0.046
 supposed+0.045
 idea+0.044
 possible+0.044
 recognise+0.042
Neg logit contributions
Sorry-0.044
W-0.036
Grab-0.035
Boo-0.034
ted-0.033
Token old
Token ID1468
Feature activation+9.14e-03

Pos logit contributions
epad+0.055
 supposed+0.054
 recognise+0.050
 possible+0.049
 happen+0.049
Neg logit contributions
Sorry-0.091
W-0.079
L-0.076
ted-0.075
Grab-0.075
Token and
Token ID290
Feature activation-2.40e-02

Pos logit contributions
Sorry+0.024
Grab+0.021
Boo+0.020
W+0.020
ted+0.019
Neg logit contributions
 idea-0.011
 supposed-0.010
epad-0.009
 pretending-0.009
 answer-0.009
Token had
Token ID550
Feature activation+6.34e-03

Pos logit contributions
 supposed+0.040
epad+0.040
 idea+0.040
 possible+0.038
 recognise+0.036
Neg logit contributions
Sorry-0.049
W-0.041
Grab-0.040
Boo-0.039
ted-0.038
Token lots
Token ID6041
Feature activation-3.15e-02

Pos logit contributions
Sorry+0.012
Grab+0.010
W+0.010
Boo+0.010
ted+0.009
Neg logit contributions
 supposed-0.011
 idea-0.011
epad-0.011
 possible-0.011
 recognise-0.010
Token of
Token ID286
Feature activation-2.93e-02

Pos logit contributions
 idea+0.040
 supposed+0.040
epad+0.039
 possible+0.036
 recognise+0.035
Neg logit contributions
Sorry-0.078
Grab-0.066
W-0.065
Boo-0.065
ted-0.062