TOP ACTIVATIONS
MAX = 7.226

. Rex sn arl s . He does not
It was hiding under a seaw eed . Lily said ,
your fault , my d arl ings . You didn 't
Spot grow ls and sn arl s . He tries to
when suddenly he heard a loud roar . He was scared
let out a big sn arl and the mouse knew he
Suddenly , a loud boom made them stop and
he hopped up onto a fallen leaf and floated across the
's okay , my d arl ings . I forgive you
sudden , he heard a loud " Z oom !"
when he suddenly heard a loud noise . It scared him
when suddenly he heard a loud noise . He was shocked
it . Suddenly , a loud noise came out of the
suddenly , she heard a sharp noise behind her . Pink
Suddenly , he heard a loud noise happen and he got
Suddenly , a loud noise came from the sky
playing , Benny heard a loud noise . It was a
a sudden she heard a loud noise . Mad eline panicked
but then they heard a loud noise . They
trap . Suddenly , a loud noise came from inside and

MIN ACTIVATIONS
MIN = -6.638

of their feelings . By being thoughtful and kind we can
the sunrise together . They became the best of friends ,
and the sheep continued to be friends , sharing y ummy
mighty blue berry would always be united , no matter the
ittens on , and enjoyed being warm and snug all winter
flying every day and they became very good at it .
extra scoop of wheat for being so eager ! <|endoftext|> M
fun on the farm and became best friends . <|endoftext|> Once
she sings a song about being home . Fl uffy listens
talked , laughed and enjoyed being together . <|endoftext|> S ally
him every day and they became best friends . One day
fun all day . They became best friends and shared many
Elephant , and the cat became the best of friends .
ined together , and they became the best of friends .
it was a reward for being good and helping them ."
oy ster and the crab became good friends and protected each
played all day . They became best friends and had many
every day , and they became the best of friends .
<|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are friends
and had fun . They became best friends and lived happily

INTERVAL 3.760 - 7.226
CONTAINS 1.708%

Lily shouted . " No , you can 't
" Stop !" a voice said . It was
shiny one . " Sorry , kids , I
noise . It was a high - p itched , sque
. He only made a funny sound . He tried again

INTERVAL 0.294 - 3.760
CONTAINS 40.855%

. But then the bunny had a surprise . The doctor
worry , my love . I can help you ," her
the measure . She liked to measure things . She measured
tired to run anymore . He was very scared and his
floor . One day , Mom says , "

INTERVAL -3.172 - 0.294
CONTAINS 54.345%

goodbye to the soft tank and thanked it for the wonderful
" Sorry , mom , we were just playing
touch them , and look at their colors . She had
a big castle with her mom my and daddy , the
very thick and strong . One day , the staff decided

INTERVAL -6.638 - -3.172
CONTAINS 3.093%

my pill to make me feel better . Mom supports me
he made mistakes . <|endoftext|> M ama and Daddy were having
have any ideas ? â The girl smiled
Tim and Sam played with the cars . They raced
Sara . We love you very much . Please don 't
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
driver+0.061
 unravel+0.060
paper+0.060
 bud+0.056
ro+0.055
Neg logit contributions
 much-0.044
 best-0.042
 And-0.040
 Everyone-0.040
 important-0.040
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
driver+0.013
paper+0.013
 unravel+0.013
 bud+0.012
ge+0.012
Neg logit contributions
 much-0.006
 And-0.006
 Everyone-0.006
 best-0.005
 important-0.005
Token
Token ID198
Feature activation+1.65e+00

Pos logit contributions
driver+0.035
 unravel+0.033
paper+0.033
 bud+0.032
ro+0.031
Neg logit contributions
 best-0.020
 much-0.019
 important-0.019
 Everyone-0.018
 And-0.018
TokenRex
Token ID47389
Feature activation+1.92e+00

Pos logit contributions
driver+0.034
paper+0.033
 unravel+0.033
 bud+0.030
pan+0.029
Neg logit contributions
 much-0.043
 best-0.042
 Everyone-0.041
 And-0.041
 important-0.041
Token sn
Token ID3013
Feature activation+5.48e-01

Pos logit contributions
driver+0.050
paper+0.046
 unravel+0.045
ge+0.044
 bud+0.043
Neg logit contributions
 best-0.041
 much-0.040
 And-0.040
 Everyone-0.038
 important-0.038
Tokenarl
Token ID7063
Feature activation+7.23e+00

Pos logit contributions
driver+0.010
 unravel+0.010
paper+0.010
 bud+0.009
 rip+0.008
Neg logit contributions
 much-0.015
 best-0.015
 And-0.015
 Everyone-0.015
 important-0.015
Tokens
Token ID82
Feature activation+1.57e+00

Pos logit contributions
 unravel+0.155
paper+0.150
driver+0.147
 rip+0.131
ge+0.131
Neg logit contributions
 best-0.184
 Everyone-0.179
 And-0.175
 much-0.171
 many-0.168
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
driver+0.042
paper+0.041
 unravel+0.039
 bud+0.036
pan+0.035
Neg logit contributions
 much-0.033
 best-0.031
 many-0.031
 important-0.028
 named-0.028
Token He
Token ID679
Feature activation+2.34e+00

Pos logit contributions
driver+0.004
 unravel+0.004
paper+0.004
ro+0.003
 bud+0.003
Neg logit contributions
 best-0.005
 much-0.005
 important-0.005
 Everyone-0.004
ovable-0.004
Token does
Token ID857
Feature activation+2.25e+00

Pos logit contributions
driver+0.055
paper+0.052
 unravel+0.049
 bud+0.048
ge+0.047
Neg logit contributions
 best-0.055
 much-0.054
 And-0.052
 important-0.052
 many-0.051
Token not
Token ID407
Feature activation+2.89e+00

Pos logit contributions
driver+0.071
paper+0.069
 unravel+0.067
 bud+0.065
ho+0.063
Neg logit contributions
 much-0.036
 best-0.033
 important-0.033
 Everyone-0.032
 And-0.031
Token It
Token ID632
Feature activation+2.70e+00

Pos logit contributions
 much+0.005
 best+0.005
 important+0.005
 Everyone+0.005
 And+0.005
Neg logit contributions
driver-0.010
paper-0.010
 unravel-0.010
 bud-0.009
ro-0.009
Token was
Token ID373
Feature activation-1.81e+00

Pos logit contributions
driver+0.080
paper+0.075
 unravel+0.073
 bud+0.071
pan+0.069
Neg logit contributions
 much-0.048
 best-0.046
 important-0.045
 named-0.043
 Everyone-0.042
Token hiding
Token ID11816
Feature activation+9.01e-01

Pos logit contributions
 much+0.032
 Everyone+0.032
 best+0.032
 important+0.031
 And+0.031
Neg logit contributions
driver-0.050
 unravel-0.048
paper-0.048
 bud-0.046
ro-0.044
Token under
Token ID739
Feature activation-7.84e-01

Pos logit contributions
driver+0.026
 unravel+0.025
paper+0.025
 bud+0.024
ro+0.023
Neg logit contributions
 much-0.016
 best-0.015
 important-0.015
 And-0.014
 Everyone-0.014
Token a
Token ID257
Feature activation+2.31e+00

Pos logit contributions
 much+0.008
 Everyone+0.006
 best+0.006
 many+0.006
 And+0.006
Neg logit contributions
driver-0.030
paper-0.028
 unravel-0.028
 bud-0.027
ge-0.027
Token seaw
Token ID32085
Feature activation+7.06e+00

Pos logit contributions
driver+0.056
paper+0.053
 unravel+0.051
ge+0.050
ASH+0.047
Neg logit contributions
 much-0.057
 best-0.049
 many-0.048
 important-0.047
 Everyone-0.046
Tokeneed
Token ID2308
Feature activation+2.99e+00

Pos logit contributions
 unravel+0.138
driver+0.138
paper+0.129
ge+0.122
pan+0.119
Neg logit contributions
 much-0.179
 best-0.172
 And-0.168
 important-0.165
 Everyone-0.164
Token.
Token ID13
Feature activation-8.84e-01

Pos logit contributions
driver+0.085
paper+0.085
 unravel+0.084
ASH+0.079
Mine+0.074
Neg logit contributions
 best-0.048
 much-0.048
 many-0.045
 named-0.044
 day-0.041
Token Lily
Token ID20037
Feature activation+2.71e-01

Pos logit contributions
 best+0.017
 important+0.017
 much+0.016
ovable+0.015
 Everyone+0.015
Neg logit contributions
driver-0.024
 unravel-0.023
paper-0.022
ro-0.022
 bud-0.021
Token said
Token ID531
Feature activation+1.31e+00

Pos logit contributions
driver+0.008
paper+0.007
 unravel+0.007
pan+0.007
 bud+0.007
Neg logit contributions
 And-0.005
 much-0.005
 named-0.005
 best-0.005
 important-0.005
Token,
Token ID11
Feature activation+2.88e+00

Pos logit contributions
driver+0.038
 unravel+0.037
paper+0.036
 bud+0.035
ro+0.034
Neg logit contributions
 much-0.023
 best-0.023
 important-0.021
 Everyone-0.021
 And-0.021
Token your
Token ID534
Feature activation+8.12e-01

Pos logit contributions
driver+0.031
 unravel+0.030
paper+0.030
 bud+0.029
 rip+0.028
Neg logit contributions
 much-0.015
 Everyone-0.015
 best-0.014
 And-0.014
 important-0.014
Token fault
Token ID8046
Feature activation+4.82e-01

Pos logit contributions
driver+0.022
 unravel+0.021
paper+0.021
 bud+0.019
ro+0.019
Neg logit contributions
 best-0.017
 important-0.016
 much-0.016
 And-0.015
 Everyone-0.014
Token,
Token ID11
Feature activation+1.91e+00

Pos logit contributions
 unravel+0.012
driver+0.012
paper+0.012
ro+0.011
 bud+0.011
Neg logit contributions
 best-0.010
 much-0.010
 important-0.009
 Everyone-0.009
 And-0.009
Token my
Token ID616
Feature activation+9.69e-01

Pos logit contributions
driver+0.053
paper+0.050
 unravel+0.050
 bud+0.048
ro+0.046
Neg logit contributions
 much-0.035
 best-0.034
 Everyone-0.034
 important-0.034
 And-0.033
Token d
Token ID288
Feature activation+9.95e-01

Pos logit contributions
driver+0.028
paper+0.027
 unravel+0.027
 bud+0.026
ro+0.025
Neg logit contributions
 much-0.017
 best-0.016
 Everyone-0.016
 important-0.016
 And-0.015
Tokenarl
Token ID7063
Feature activation+6.82e+00

Pos logit contributions
 unravel+0.016
paper+0.016
driver+0.015
 bud+0.014
 rip+0.013
Neg logit contributions
 much-0.029
 Everyone-0.028
 best-0.028
 And-0.028
 important-0.027
Tokenings
Token ID654
Feature activation-3.17e-01

Pos logit contributions
 unravel+0.161
paper+0.149
 snatch+0.138
driver+0.137
ASH+0.136
Neg logit contributions
 best-0.166
 And-0.147
 Everyone-0.147
 much-0.140
 important-0.140
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
 much+0.006
 Everyone+0.005
 important+0.005
 best+0.005
 And+0.005
Neg logit contributions
 unravel-0.009
paper-0.009
driver-0.009
ge-0.009
 clip-0.008
Token You
Token ID921
Feature activation-1.29e+00

Pos logit contributions
 best+0.007
 much+0.007
 important+0.007
 many+0.006
 days+0.006
Neg logit contributions
driver-0.008
ro-0.007
paper-0.007
 unravel-0.007
 rip-0.007
Token didn
Token ID1422
Feature activation+2.81e+00

Pos logit contributions
 much+0.027
 best+0.026
 important+0.025
 Everyone+0.025
 And+0.025
Neg logit contributions
driver-0.033
 unravel-0.032
paper-0.031
 bud-0.030
ro-0.029
Token't
Token ID470
Feature activation+3.20e+00

Pos logit contributions
driver+0.049
 unravel+0.046
paper+0.045
 bud+0.042
ro+0.039
Neg logit contributions
 best-0.080
 much-0.080
 important-0.078
 Everyone-0.077
 And-0.076
TokenSpot
Token ID32565
Feature activation+2.00e+00

Pos logit contributions
driver+0.036
paper+0.034
 unravel+0.034
 bud+0.032
ro+0.030
Neg logit contributions
 much-0.041
 best-0.040
 important-0.039
 Everyone-0.039
 And-0.038
Token grow
Token ID1663
Feature activation+1.69e+00

Pos logit contributions
driver+0.047
paper+0.044
 unravel+0.043
 bud+0.041
ge+0.041
Neg logit contributions
 best-0.046
 much-0.045
 And-0.045
 Everyone-0.043
 important-0.043
Tokenls
Token ID7278
Feature activation-1.07e-01

Pos logit contributions
driver+0.035
paper+0.034
 unravel+0.033
 bud+0.032
ro+0.030
Neg logit contributions
 much-0.042
 best-0.041
 important-0.040
 Everyone-0.040
 And-0.040
Token and
Token ID290
Feature activation+1.46e+00

Pos logit contributions
 much+0.002
 best+0.002
 And+0.002
 many+0.002
 Everyone+0.002
Neg logit contributions
driver-0.003
paper-0.003
 unravel-0.003
 bud-0.003
pan-0.003
Token sn
Token ID3013
Feature activation-1.49e-01

Pos logit contributions
driver+0.034
paper+0.032
 unravel+0.029
ge+0.028
 bud+0.028
Neg logit contributions
 much-0.034
 best-0.034
 And-0.033
 important-0.032
 many-0.031
Tokenarl
Token ID7063
Feature activation+6.72e+00

Pos logit contributions
 much+0.004
 best+0.004
 important+0.004
 Everyone+0.004
 And+0.004
Neg logit contributions
driver-0.003
 unravel-0.002
paper-0.002
 bud-0.002
ro-0.002
Tokens
Token ID82
Feature activation+3.88e-01

Pos logit contributions
 unravel+0.143
paper+0.138
driver+0.136
 bud+0.122
 rip+0.120
Neg logit contributions
 best-0.172
 much-0.167
 Everyone-0.165
 And-0.164
 many-0.160
Token.
Token ID13
Feature activation-3.77e-03

Pos logit contributions
driver+0.010
paper+0.010
 unravel+0.010
 bud+0.009
pan+0.008
Neg logit contributions
 much-0.009
 best-0.008
 many-0.008
 important-0.007
 And-0.007
Token He
Token ID679
Feature activation+2.26e+00

Pos logit contributions
 best+0.000
 much+0.000
 important+0.000
 Everyone+0.000
 And+0.000
Neg logit contributions
driver-0.000
 unravel-0.000
paper-0.000
 bud-0.000
ro-0.000
Token tries
Token ID8404
Feature activation+3.35e+00

Pos logit contributions
driver+0.050
paper+0.048
 unravel+0.046
 bud+0.044
ge+0.042
Neg logit contributions
 best-0.054
 much-0.054
 important-0.052
 And-0.052
 Everyone-0.051
Token to
Token ID284
Feature activation+4.16e+00

Pos logit contributions
driver+0.084
paper+0.080
 unravel+0.080
pan+0.076
 bud+0.073
Neg logit contributions
 much-0.076
 best-0.071
 many-0.069
 important-0.068
 And-0.066
Token when
Token ID618
Feature activation-1.03e+00

Pos logit contributions
driver+0.005
paper+0.005
 unravel+0.005
 bud+0.004
ro+0.004
Neg logit contributions
 much-0.003
 best-0.003
 important-0.003
 Everyone-0.003
 And-0.003
Token suddenly
Token ID6451
Feature activation+1.76e+00

Pos logit contributions
 important+0.028
 And+0.028
 best+0.028
ovable+0.027
 Everyone+0.027
Neg logit contributions
paper-0.017
driver-0.017
 unravel-0.016
ho-0.016
 bud-0.015
Token he
Token ID339
Feature activation+2.71e+00

Pos logit contributions
driver+0.041
paper+0.041
 unravel+0.040
 bud+0.038
 swoop+0.036
Neg logit contributions
 best-0.039
 important-0.038
 And-0.037
 Everyone-0.036
 much-0.036
Token heard
Token ID2982
Feature activation+1.61e+00

Pos logit contributions
driver+0.049
paper+0.047
 unravel+0.047
 bud+0.044
ro+0.041
Neg logit contributions
 much-0.075
 best-0.074
 important-0.073
 Everyone-0.073
 And-0.072
Token a
Token ID257
Feature activation+5.36e+00

Pos logit contributions
driver+0.047
 unravel+0.047
paper+0.046
 bud+0.044
 rip+0.043
Neg logit contributions
 best-0.026
 important-0.023
 glad-0.023
 And-0.023
ovable-0.022
Token loud
Token ID7812
Feature activation+6.72e+00

Pos logit contributions
driver+0.163
paper+0.152
 unravel+0.144
ro+0.143
pan+0.141
Neg logit contributions
 much-0.099
 important-0.093
 best-0.089
 Everyone-0.086
 And-0.083
Token roar
Token ID34532
Feature activation+2.62e+00

Pos logit contributions
driver+0.174
paper+0.166
 clip+0.156
 unravel+0.155
ge+0.152
Neg logit contributions
 much-0.142
 Everyone-0.138
 And-0.136
 important-0.133
 best-0.127
Token.
Token ID13
Feature activation+1.14e+00

Pos logit contributions
driver+0.077
paper+0.074
 unravel+0.072
ro+0.068
 clip+0.068
Neg logit contributions
 much-0.048
 best-0.044
 important-0.043
 Everyone-0.042
 And-0.041
Token He
Token ID679
Feature activation+3.65e+00

Pos logit contributions
driver+0.030
 unravel+0.029
paper+0.029
 bud+0.027
ro+0.026
Neg logit contributions
 best-0.023
 much-0.022
 important-0.022
 Everyone-0.021
 And-0.020
Token was
Token ID373
Feature activation-8.46e-01

Pos logit contributions
driver+0.078
paper+0.074
 unravel+0.073
 bud+0.069
ro+0.066
Neg logit contributions
 much-0.091
 best-0.089
 important-0.087
 Everyone-0.086
 And-0.085
Token scared
Token ID12008
Feature activation+1.85e+00

Pos logit contributions
 Everyone+0.017
 best+0.017
 much+0.017
 important+0.017
 And+0.017
Neg logit contributions
 unravel-0.020
driver-0.020
paper-0.019
 bud-0.019
 rip-0.018
Token let
Token ID1309
Feature activation+1.82e+00

Pos logit contributions
 much+0.007
 best+0.007
 important+0.007
 named+0.006
 And+0.006
Neg logit contributions
driver-0.010
paper-0.010
 unravel-0.009
pan-0.009
ro-0.009
Token out
Token ID503
Feature activation+1.82e-01

Pos logit contributions
driver+0.048
 unravel+0.046
paper+0.046
 bud+0.044
ro+0.042
Neg logit contributions
 much-0.036
 best-0.036
 important-0.035
 Everyone-0.034
 And-0.033
Token a
Token ID257
Feature activation+3.87e+00

Pos logit contributions
driver+0.006
 unravel+0.006
paper+0.005
 bud+0.005
 rip+0.005
Neg logit contributions
 best-0.003
 important-0.002
 much-0.002
 Everyone-0.002
 And-0.002
Token big
Token ID1263
Feature activation+4.71e+00

Pos logit contributions
driver+0.115
paper+0.112
 unravel+0.104
ro+0.102
pan+0.100
Neg logit contributions
 much-0.073
 best-0.067
 important-0.065
 And-0.060
 lucky-0.060
Token sn
Token ID3013
Feature activation+9.39e-01

Pos logit contributions
paper+0.114
driver+0.113
ge+0.106
 unravel+0.103
ro+0.100
Neg logit contributions
 much-0.105
 best-0.098
 And-0.097
 important-0.095
 named-0.094
Tokenarl
Token ID7063
Feature activation+6.72e+00

Pos logit contributions
driver+0.022
 unravel+0.021
paper+0.021
ro+0.020
 bud+0.020
Neg logit contributions
 best-0.021
 much-0.021
 important-0.021
 Everyone-0.020
 And-0.019
Token and
Token ID290
Feature activation+5.27e-01

Pos logit contributions
paper+0.178
 unravel+0.177
driver+0.167
ASH+0.167
 snatch+0.161
Neg logit contributions
 much-0.127
 best-0.126
 And-0.123
 Everyone-0.122
 many-0.113
Token the
Token ID262
Feature activation+1.65e+00

Pos logit contributions
driver+0.013
paper+0.012
 unravel+0.012
 bud+0.012
ro+0.011
Neg logit contributions
 much-0.012
 best-0.011
 important-0.011
 Everyone-0.011
 And-0.011
Token mouse
Token ID10211
Feature activation+1.34e+00

Pos logit contributions
driver+0.045
paper+0.043
 unravel+0.042
 bud+0.039
ro+0.038
Neg logit contributions
 best-0.033
 much-0.033
 important-0.032
 Everyone-0.030
 And-0.030
Token knew
Token ID2993
Feature activation+2.39e+00

Pos logit contributions
driver+0.033
paper+0.031
ro+0.029
 unravel+0.028
 clip+0.027
Neg logit contributions
 much-0.031
 best-0.030
 important-0.028
 named-0.028
 Everyone-0.027
Token he
Token ID339
Feature activation+1.95e+00

Pos logit contributions
driver+0.068
 unravel+0.065
paper+0.065
 bud+0.063
 clip+0.058
Neg logit contributions
 much-0.041
 Everyone-0.041
 best-0.041
 important-0.039
 And-0.039
Token
Token ID198
Feature activation+2.26e-02

Pos logit contributions
 best+0.021
 important+0.021
ovable+0.020
 much+0.019
 adventurous+0.018
Neg logit contributions
driver-0.019
ro-0.018
 unravel-0.018
paper-0.017
ho-0.016
Token
Token ID198
Feature activation+5.02e-01

Pos logit contributions
driver+0.000
 unravel+0.000
paper+0.000
ro+0.000
 bud+0.000
Neg logit contributions
 best-0.001
 important-0.000
ovable-0.000
 adventurous-0.000
 Everyone-0.000
TokenSuddenly
Token ID38582
Feature activation+1.70e+00

Pos logit contributions
driver+0.010
 unravel+0.009
paper+0.009
 bud+0.009
ro+0.009
Neg logit contributions
 best-0.013
 much-0.012
 important-0.012
 Everyone-0.012
ovable-0.012
Token,
Token ID11
Feature activation+1.03e+00

Pos logit contributions
 bud+0.032
driver+0.031
paper+0.028
 unravel+0.027
ho+0.026
Neg logit contributions
 important-0.045
ovable-0.043
 Everyone-0.042
 glad-0.042
 dearly-0.041
Token a
Token ID257
Feature activation+5.06e+00

Pos logit contributions
 bud+0.023
 unravel+0.022
driver+0.022
paper+0.021
 rip+0.021
Neg logit contributions
 important-0.023
ovable-0.023
 dearly-0.021
 And-0.021
 Everyone-0.020
Token loud
Token ID7812
Feature activation+6.61e+00

Pos logit contributions
driver+0.148
paper+0.145
 unravel+0.139
ge+0.135
ro+0.133
Neg logit contributions
 much-0.093
 best-0.087
 important-0.084
 Everyone-0.081
 And-0.078
Token boom
Token ID14166
Feature activation+3.94e+00

Pos logit contributions
driver+0.162
paper+0.161
 unravel+0.152
ge+0.151
 clip+0.144
Neg logit contributions
 much-0.140
 Everyone-0.139
 And-0.137
 named-0.132
 important-0.129
Token made
Token ID925
Feature activation-1.84e-01

Pos logit contributions
driver+0.105
paper+0.101
 unravel+0.099
 clip+0.095
ro+0.091
Neg logit contributions
 much-0.077
 Everyone-0.076
 best-0.070
 named-0.070
 And-0.069
Token them
Token ID606
Feature activation-2.26e-02

Pos logit contributions
 important+0.004
 best+0.004
 glad+0.004
 much+0.004
 And+0.004
Neg logit contributions
 unravel-0.004
driver-0.004
 bud-0.004
paper-0.004
 rip-0.004
Token stop
Token ID2245
Feature activation+6.93e-01

Pos logit contributions
 much+0.001
 best+0.001
 important+0.000
 Everyone+0.000
 And+0.000
Neg logit contributions
driver-0.001
paper-0.001
 unravel-0.001
 bud-0.000
ro-0.000
Token and
Token ID290
Feature activation+1.78e+00

Pos logit contributions
driver+0.020
paper+0.019
 unravel+0.018
 bud+0.017
ro+0.017
Neg logit contributions
 much-0.013
 best-0.013
 important-0.012
 And-0.012
 Everyone-0.012
Token he
Token ID339
Feature activation+1.89e+00

Pos logit contributions
 best+0.023
 Everyone+0.023
 much+0.023
 important+0.023
 And+0.023
Neg logit contributions
driver-0.018
paper-0.018
 unravel-0.017
 bud-0.016
ho-0.016
Token hopped
Token ID45230
Feature activation+1.80e+00

Pos logit contributions
driver+0.046
paper+0.044
 unravel+0.043
 bud+0.040
pan+0.040
Neg logit contributions
 much-0.041
 important-0.040
 best-0.040
 named-0.038
 Everyone-0.038
Token up
Token ID510
Feature activation-5.40e-01

Pos logit contributions
driver+0.051
paper+0.049
 unravel+0.047
 bud+0.044
 dent+0.044
Neg logit contributions
 much-0.035
 best-0.033
 important-0.032
 And-0.031
 Everyone-0.031
Token onto
Token ID4291
Feature activation+7.07e-01

Pos logit contributions
 much+0.010
 many+0.009
 best+0.008
 And+0.008
 important+0.008
Neg logit contributions
driver-0.016
paper-0.016
 unravel-0.015
 bud-0.014
 clip-0.014
Token a
Token ID257
Feature activation+3.48e+00

Pos logit contributions
driver+0.023
paper+0.022
 unravel+0.022
 bud+0.021
ro+0.021
Neg logit contributions
 much-0.010
 best-0.009
 important-0.009
 Everyone-0.009
 And-0.009
Token fallen
Token ID9292
Feature activation+6.60e+00

Pos logit contributions
driver+0.075
paper+0.068
 unravel+0.066
pan+0.058
ASH+0.058
Neg logit contributions
 much-0.094
 best-0.087
 important-0.086
 many-0.082
 lucky-0.081
Token leaf
Token ID12835
Feature activation+3.20e+00

Pos logit contributions
driver+0.136
ge+0.133
paper+0.131
 unravel+0.125
Mine+0.113
Neg logit contributions
 much-0.156
 best-0.147
 Everyone-0.147
 And-0.145
 named-0.141
Token and
Token ID290
Feature activation+7.11e-01

Pos logit contributions
paper+0.101
driver+0.098
use+0.094
 unravel+0.094
Mine+0.093
Neg logit contributions
 much-0.046
 best-0.039
 about-0.038
 And-0.037
 many-0.037
Token floated
Token ID29742
Feature activation+9.85e-01

Pos logit contributions
driver+0.017
paper+0.016
 unravel+0.015
 bud+0.015
ge+0.014
Neg logit contributions
 much-0.016
 best-0.015
 important-0.015
 many-0.015
 Everyone-0.015
Token across
Token ID1973
Feature activation-5.26e-01

Pos logit contributions
driver+0.030
 unravel+0.026
paper+0.026
ge+0.025
 dent+0.025
Neg logit contributions
 much-0.018
 best-0.017
 many-0.016
 important-0.016
 about-0.015
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 much+0.005
 many+0.004
 Everyone+0.004
 best+0.004
 And+0.004
Neg logit contributions
driver-0.021
paper-0.019
 unravel-0.019
 clip-0.019
 bud-0.019
Token's
Token ID338
Feature activation-1.64e+00

Pos logit contributions
driver+0.048
 unravel+0.046
paper+0.046
 bud+0.044
 rip+0.042
Neg logit contributions
 much-0.033
 Everyone-0.031
 best-0.031
 important-0.031
 And-0.030
Token okay
Token ID8788
Feature activation-1.92e-01

Pos logit contributions
 Everyone+0.034
ovable+0.032
 Everybody+0.031
 People+0.030
orry+0.028
Neg logit contributions
 unravel-0.040
 bud-0.039
driver-0.037
 rip-0.035
 clip-0.035
Token,
Token ID11
Feature activation+1.08e+00

Pos logit contributions
 much+0.004
 best+0.004
 Everyone+0.004
 important+0.004
 And+0.004
Neg logit contributions
driver-0.005
paper-0.005
 unravel-0.005
 bud-0.005
ro-0.004
Token my
Token ID616
Feature activation+1.35e+00

Pos logit contributions
driver+0.031
paper+0.029
 unravel+0.029
 bud+0.028
ro+0.027
Neg logit contributions
 much-0.019
 best-0.019
 important-0.018
 Everyone-0.018
 And-0.018
Token d
Token ID288
Feature activation+1.02e+00

Pos logit contributions
driver+0.040
paper+0.038
 unravel+0.037
 bud+0.037
ro+0.036
Neg logit contributions
 much-0.021
 Everyone-0.020
 best-0.020
 important-0.019
ovable-0.019
Tokenarl
Token ID7063
Feature activation+6.58e+00

Pos logit contributions
 unravel+0.015
driver+0.015
paper+0.014
 bud+0.013
ho+0.012
Neg logit contributions
 much-0.032
 Everyone-0.031
 best-0.031
 And-0.030
 important-0.030
Tokenings
Token ID654
Feature activation-7.98e-01

Pos logit contributions
 unravel+0.151
paper+0.143
driver+0.133
 swoop+0.129
 snatch+0.128
Neg logit contributions
 best-0.160
 Everyone-0.148
 And-0.145
 much-0.139
 friends-0.138
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
 much+0.014
 best+0.013
 And+0.013
 Everyone+0.013
 important+0.013
Neg logit contributions
 unravel-0.023
paper-0.023
driver-0.023
 bud-0.021
 clip-0.021
Token I
Token ID314
Feature activation+1.31e+00

Pos logit contributions
 best+0.027
 important+0.027
 much+0.027
 many+0.024
 lucky+0.024
Neg logit contributions
driver-0.026
ro-0.023
paper-0.022
 unravel-0.022
 rip-0.021
Token forgive
Token ID20927
Feature activation+3.66e-01

Pos logit contributions
driver+0.032
paper+0.031
 unravel+0.030
 bud+0.029
ro+0.027
Neg logit contributions
 much-0.029
 best-0.029
 important-0.028
 And-0.027
 Everyone-0.027
Token you
Token ID345
Feature activation-1.73e+00

Pos logit contributions
driver+0.010
paper+0.010
 unravel+0.010
 bud+0.009
ro+0.009
Neg logit contributions
 much-0.007
 best-0.006
 important-0.006
 Everyone-0.006
 And-0.006
Token sudden
Token ID4802
Feature activation+1.65e+00

Pos logit contributions
driver+0.053
paper+0.051
 unravel+0.050
ro+0.047
 bud+0.047
Neg logit contributions
 much-0.032
 best-0.030
 important-0.029
 Everyone-0.027
 many-0.026
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
driver+0.041
paper+0.039
 unravel+0.039
 bud+0.038
 swoop+0.035
Neg logit contributions
 best-0.034
 important-0.034
 And-0.033
 glad-0.032
 much-0.032
Token he
Token ID339
Feature activation+2.84e+00

Pos logit contributions
driver+0.029
paper+0.028
 unravel+0.028
 bud+0.026
 rip+0.025
Neg logit contributions
 important-0.028
ovable-0.027
 best-0.027
 And-0.026
 Everyone-0.026
Token heard
Token ID2982
Feature activation+1.65e+00

Pos logit contributions
driver+0.061
paper+0.056
 unravel+0.056
 bud+0.052
ro+0.051
Neg logit contributions
 much-0.072
 best-0.072
 important-0.068
 Everyone-0.067
 And-0.065
Token a
Token ID257
Feature activation+5.12e+00

Pos logit contributions
driver+0.051
 unravel+0.050
paper+0.050
 bud+0.047
 rip+0.046
Neg logit contributions
 best-0.024
 important-0.022
 much-0.021
 And-0.021
 glad-0.021
Token loud
Token ID7812
Feature activation+6.57e+00

Pos logit contributions
driver+0.155
paper+0.147
 unravel+0.139
ro+0.136
ge+0.134
Neg logit contributions
 much-0.096
 important-0.088
 best-0.086
 Everyone-0.083
 lucky-0.080
Token "
Token ID366
Feature activation+4.13e+00

Pos logit contributions
driver+0.170
paper+0.162
 unravel+0.150
 clip+0.149
ge+0.146
Neg logit contributions
 much-0.140
 Everyone-0.136
 And-0.132
 important-0.129
 best-0.123
TokenZ
Token ID57
Feature activation-5.35e-01

Pos logit contributions
driver+0.092
paper+0.082
 bud+0.079
pan+0.077
 unravel+0.076
Neg logit contributions
 much-0.102
 Everyone-0.099
 best-0.095
 important-0.093
 And-0.093
Tokenoom
Token ID4207
Feature activation+1.06e+00

Pos logit contributions
 much+0.013
 best+0.012
 important+0.012
 Everyone+0.012
 And+0.012
Neg logit contributions
driver-0.012
paper-0.012
 unravel-0.012
 bud-0.011
ro-0.010
Token!"
Token ID2474
Feature activation+8.59e-01

Pos logit contributions
driver+0.025
paper+0.024
 unravel+0.023
 bud+0.022
 clip+0.021
Neg logit contributions
 Everyone-0.024
 much-0.024
 best-0.023
 important-0.022
 And-0.022
Token
Token ID198
Feature activation+2.30e+00

Pos logit contributions
driver+0.021
paper+0.020
ro+0.020
 unravel+0.020
 bud+0.019
Neg logit contributions
 best-0.019
 important-0.018
 much-0.017
ovable-0.017
 glad-0.017
Token when
Token ID618
Feature activation-1.40e+00

Pos logit contributions
driver+0.034
 unravel+0.033
paper+0.033
 bud+0.031
ro+0.030
Neg logit contributions
 much-0.025
 best-0.025
 Everyone-0.024
 important-0.024
 And-0.023
Token he
Token ID339
Feature activation+1.05e+00

Pos logit contributions
ovable+0.039
 And+0.038
 important+0.038
 best+0.037
 Everyone+0.037
Neg logit contributions
paper-0.022
 unravel-0.022
driver-0.021
 bud-0.020
ho-0.019
Token suddenly
Token ID6451
Feature activation+1.83e+00

Pos logit contributions
driver+0.023
paper+0.021
 unravel+0.020
 bud+0.019
ro+0.018
Neg logit contributions
 much-0.027
 best-0.027
 important-0.025
 Everyone-0.025
 And-0.025
Token heard
Token ID2982
Feature activation+1.39e+00

Pos logit contributions
driver+0.037
paper+0.036
 unravel+0.036
 bud+0.033
ro+0.031
Neg logit contributions
 much-0.047
 best-0.047
 important-0.045
 Everyone-0.045
 And-0.045
Token a
Token ID257
Feature activation+4.97e+00

Pos logit contributions
driver+0.040
 unravel+0.039
paper+0.038
 bud+0.037
 rip+0.036
Neg logit contributions
 best-0.024
 important-0.021
 glad-0.021
ovable-0.020
 And-0.020
Token loud
Token ID7812
Feature activation+6.55e+00

Pos logit contributions
driver+0.148
paper+0.140
 unravel+0.133
ro+0.130
ge+0.127
Neg logit contributions
 much-0.095
 important-0.088
 best-0.087
 Everyone-0.084
 And-0.081
Token noise
Token ID7838
Feature activation+2.05e+00

Pos logit contributions
driver+0.170
paper+0.164
 unravel+0.153
 clip+0.152
ge+0.149
Neg logit contributions
 much-0.137
 Everyone-0.133
 And-0.131
 important-0.128
 best-0.123
Token.
Token ID13
Feature activation+7.72e-01

Pos logit contributions
driver+0.061
paper+0.059
 unravel+0.057
ro+0.054
 bud+0.054
Neg logit contributions
 much-0.036
 Everyone-0.032
 best-0.032
 important-0.032
 And-0.031
Token It
Token ID632
Feature activation+3.50e+00

Pos logit contributions
driver+0.020
 unravel+0.019
paper+0.019
 bud+0.018
ro+0.018
Neg logit contributions
 best-0.016
 much-0.015
 important-0.015
 Everyone-0.014
 And-0.014
Token scared
Token ID12008
Feature activation+1.99e+00

Pos logit contributions
driver+0.102
paper+0.097
 unravel+0.092
ro+0.090
 clip+0.090
Neg logit contributions
 much-0.065
 best-0.062
 important-0.061
 Everyone-0.060
 And-0.057
Token him
Token ID683
Feature activation+1.10e+00

Pos logit contributions
paper+0.037
driver+0.037
 unravel+0.036
 bud+0.035
 rip+0.033
Neg logit contributions
 best-0.051
 important-0.051
 much-0.049
 glad-0.049
ovable-0.048
Token when
Token ID618
Feature activation-1.42e+00

Pos logit contributions
driver+0.028
 unravel+0.028
paper+0.028
 bud+0.026
ro+0.025
Neg logit contributions
 much-0.021
 best-0.021
 Everyone-0.020
 important-0.020
 And-0.019
Token suddenly
Token ID6451
Feature activation+1.30e+00

Pos logit contributions
 important+0.043
ovable+0.042
 And+0.041
anny+0.040
 Everyone+0.040
Neg logit contributions
paper-0.019
 unravel-0.018
driver-0.017
 bud-0.017
ho-0.017
Token he
Token ID339
Feature activation+2.13e+00

Pos logit contributions
driver+0.032
paper+0.031
 unravel+0.030
 bud+0.030
 swoop+0.028
Neg logit contributions
 best-0.027
 important-0.027
 And-0.025
ovable-0.025
 glad-0.025
Token heard
Token ID2982
Feature activation+1.71e+00

Pos logit contributions
driver+0.041
paper+0.038
 unravel+0.038
 bud+0.036
ro+0.034
Neg logit contributions
 much-0.058
 best-0.057
 important-0.056
 Everyone-0.055
 And-0.054
Token a
Token ID257
Feature activation+5.19e+00

Pos logit contributions
driver+0.050
 unravel+0.049
paper+0.048
 bud+0.046
 rip+0.046
Neg logit contributions
 best-0.029
 important-0.026
 glad-0.025
ovable-0.024
 And-0.024
Token loud
Token ID7812
Feature activation+6.54e+00

Pos logit contributions
driver+0.153
paper+0.146
 unravel+0.138
ro+0.136
ge+0.134
Neg logit contributions
 much-0.099
 important-0.092
 best-0.089
 Everyone-0.088
 And-0.084
Token noise
Token ID7838
Feature activation+2.06e+00

Pos logit contributions
driver+0.170
paper+0.165
 unravel+0.153
 clip+0.152
ge+0.151
Neg logit contributions
 much-0.135
 Everyone-0.133
 And-0.131
 important-0.125
 named-0.120
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
driver+0.059
paper+0.057
 unravel+0.056
 bud+0.053
ro+0.052
Neg logit contributions
 much-0.038
 best-0.035
 Everyone-0.034
 important-0.034
 And-0.034
Token He
Token ID679
Feature activation+3.50e+00

Pos logit contributions
driver+0.035
 unravel+0.032
paper+0.032
ro+0.031
 bud+0.031
Neg logit contributions
 best-0.035
 important-0.034
 much-0.033
 glad-0.031
ovable-0.031
Token was
Token ID373
Feature activation-9.70e-01

Pos logit contributions
driver+0.076
paper+0.072
 unravel+0.071
 bud+0.066
ro+0.064
Neg logit contributions
 much-0.087
 best-0.085
 important-0.083
 Everyone-0.082
 And-0.081
Token shocked
Token ID11472
Feature activation+1.47e+00

Pos logit contributions
 Everyone+0.020
 important+0.020
 best+0.020
 much+0.020
 And+0.020
Neg logit contributions
 unravel-0.023
driver-0.022
 bud-0.021
paper-0.021
 rip-0.020
Token it
Token ID340
Feature activation+5.84e-01

Pos logit contributions
driver+0.017
paper+0.017
 unravel+0.016
 bud+0.016
ho+0.016
Neg logit contributions
 much-0.009
 best-0.008
 Everyone-0.008
 many-0.008
 important-0.008
Token.
Token ID13
Feature activation-6.11e-01

Pos logit contributions
driver+0.017
paper+0.017
 unravel+0.016
 bud+0.016
ro+0.016
Neg logit contributions
 much-0.010
 best-0.010
 important-0.009
 Everyone-0.009
 many-0.009
Token Suddenly
Token ID24975
Feature activation+1.33e+00

Pos logit contributions
 best+0.012
 much+0.012
 important+0.012
 glad+0.011
ovable+0.011
Neg logit contributions
driver-0.016
 unravel-0.015
paper-0.015
 bud-0.014
ro-0.014
Token,
Token ID11
Feature activation+7.45e-01

Pos logit contributions
driver+0.032
 unravel+0.031
 bud+0.030
paper+0.030
 rip+0.027
Neg logit contributions
 important-0.028
 much-0.028
 best-0.028
 Everyone-0.028
 And-0.027
Token a
Token ID257
Feature activation+4.50e+00

Pos logit contributions
 unravel+0.019
driver+0.019
 bud+0.018
paper+0.018
 rip+0.017
Neg logit contributions
 important-0.014
 And-0.014
 best-0.013
ovable-0.013
 Everyone-0.013
Token loud
Token ID7812
Feature activation+6.53e+00

Pos logit contributions
driver+0.128
paper+0.126
 unravel+0.121
ge+0.116
ro+0.115
Neg logit contributions
 much-0.085
 best-0.081
 important-0.078
 Everyone-0.075
 And-0.072
Token noise
Token ID7838
Feature activation+2.42e+00

Pos logit contributions
paper+0.159
driver+0.158
ge+0.147
 unravel+0.147
 clip+0.140
Neg logit contributions
 Everyone-0.140
 much-0.140
 And-0.133
 important-0.132
 named-0.131
Token came
Token ID1625
Feature activation+2.89e+00

Pos logit contributions
driver+0.060
paper+0.057
 unravel+0.054
ro+0.052
 clip+0.052
Neg logit contributions
 much-0.053
 Everyone-0.050
 named-0.049
 important-0.048
 best-0.048
Token out
Token ID503
Feature activation+2.07e+00

Pos logit contributions
driver+0.084
paper+0.080
 unravel+0.077
ro+0.075
 clip+0.074
Neg logit contributions
 much-0.053
 best-0.051
 important-0.047
 And-0.046
 Everyone-0.045
Token of
Token ID286
Feature activation-6.29e-01

Pos logit contributions
driver+0.064
paper+0.063
 unravel+0.060
ro+0.057
 clip+0.057
Neg logit contributions
 much-0.034
 best-0.031
 Everyone-0.030
 And-0.030
 important-0.030
Token the
Token ID262
Feature activation-9.61e-03

Pos logit contributions
 much+0.006
 named+0.005
 many+0.005
 days+0.004
 best+0.004
Neg logit contributions
paper-0.024
driver-0.024
ro-0.022
ho-0.022
ge-0.022
Token suddenly
Token ID6451
Feature activation+1.46e+00

Pos logit contributions
 Everyone+0.006
ovable+0.006
 much+0.006
 And+0.006
anny+0.006
Neg logit contributions
 unravel-0.006
 bud-0.006
ho-0.006
driver-0.006
 clip-0.006
Token,
Token ID11
Feature activation+7.22e-01

Pos logit contributions
paper+0.028
 bud+0.028
 unravel+0.028
driver+0.028
 swoop+0.026
Neg logit contributions
ovable-0.035
 important-0.035
 best-0.035
 Everyone-0.034
 And-0.034
Token she
Token ID673
Feature activation+2.36e+00

Pos logit contributions
 unravel+0.018
 bud+0.017
paper+0.017
driver+0.016
ho+0.016
Neg logit contributions
anny-0.014
ovable-0.014
 important-0.013
 And-0.013
 Everyone-0.013
Token heard
Token ID2982
Feature activation+1.31e+00

Pos logit contributions
driver+0.047
paper+0.044
 unravel+0.044
 bud+0.041
ro+0.039
Neg logit contributions
 much-0.062
 best-0.061
 important-0.060
 Everyone-0.059
 And-0.058
Token a
Token ID257
Feature activation+4.96e+00

Pos logit contributions
 unravel+0.033
driver+0.031
 rip+0.030
 bud+0.030
paper+0.030
Neg logit contributions
 best-0.025
 glad-0.023
ovable-0.023
 And-0.022
 important-0.022
Token sharp
Token ID7786
Feature activation+6.50e+00

Pos logit contributions
driver+0.157
paper+0.149
 unravel+0.140
ro+0.140
ge+0.137
Neg logit contributions
 much-0.087
 important-0.080
 best-0.077
 Everyone-0.072
 lucky-0.070
Token noise
Token ID7838
Feature activation+2.17e+00

Pos logit contributions
driver+0.162
paper+0.158
 unravel+0.146
ge+0.144
 clip+0.141
Neg logit contributions
 much-0.136
 important-0.133
 best-0.131
 And-0.125
 named-0.123
Token behind
Token ID2157
Feature activation+1.21e+00

Pos logit contributions
driver+0.062
paper+0.060
 unravel+0.059
 bud+0.056
ro+0.055
Neg logit contributions
 much-0.040
 best-0.036
 important-0.036
 And-0.035
 Everyone-0.035
Token her
Token ID607
Feature activation+1.68e-01

Pos logit contributions
 unravel+0.031
driver+0.030
paper+0.029
 bud+0.028
 rip+0.027
Neg logit contributions
 best-0.025
 important-0.024
 much-0.023
 And-0.022
 Everyone-0.022
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
driver+0.004
paper+0.004
 unravel+0.004
 bud+0.004
ro+0.004
Neg logit contributions
 much-0.004
 best-0.003
 important-0.003
 Everyone-0.003
 And-0.003
Token Pink
Token ID14657
Feature activation+1.05e+00

Pos logit contributions
driver+0.029
 unravel+0.028
paper+0.028
 bud+0.027
ro+0.026
Neg logit contributions
 best-0.025
 much-0.025
 important-0.024
 glad-0.023
 Everyone-0.023
Token Suddenly
Token ID24975
Feature activation+1.91e+00

Pos logit contributions
 much+0.004
 Everyone+0.004
 best+0.004
 And+0.004
 important+0.003
Neg logit contributions
driver-0.008
paper-0.008
 unravel-0.008
 bud-0.008
 clip-0.008
Token,
Token ID11
Feature activation+6.30e-01

Pos logit contributions
driver+0.050
paper+0.047
 unravel+0.047
 bud+0.045
ho+0.042
Neg logit contributions
 best-0.038
 much-0.037
 important-0.037
 Everyone-0.037
 And-0.036
Token he
Token ID339
Feature activation+3.11e+00

Pos logit contributions
driver+0.016
 unravel+0.016
paper+0.016
 bud+0.015
 rip+0.014
Neg logit contributions
 best-0.012
 important-0.012
ovable-0.012
 And-0.012
 Everyone-0.012
Token heard
Token ID2982
Feature activation+1.63e+00

Pos logit contributions
driver+0.072
paper+0.067
 unravel+0.066
pan+0.063
ro+0.062
Neg logit contributions
 much-0.075
 best-0.073
 important-0.070
 named-0.066
 And-0.066
Token a
Token ID257
Feature activation+4.95e+00

Pos logit contributions
driver+0.050
 unravel+0.049
paper+0.048
 bud+0.045
 rip+0.045
Neg logit contributions
 best-0.026
 important-0.022
 And-0.022
ovable-0.022
 much-0.022
Token loud
Token ID7812
Feature activation+6.49e+00

Pos logit contributions
driver+0.150
paper+0.144
ro+0.134
 unravel+0.134
pan+0.131
Neg logit contributions
 much-0.092
 important-0.084
 best-0.080
 And-0.078
 Everyone-0.078
Token noise
Token ID7838
Feature activation+2.05e+00

Pos logit contributions
driver+0.167
paper+0.164
 clip+0.153
 unravel+0.152
ge+0.149
Neg logit contributions
 much-0.133
 And-0.130
 Everyone-0.129
 important-0.123
 named-0.120
Token happen
Token ID1645
Feature activation+1.95e+00

Pos logit contributions
driver+0.061
paper+0.059
 unravel+0.058
ro+0.055
 bud+0.054
Neg logit contributions
 much-0.035
 And-0.031
 named-0.031
 important-0.031
 Everyone-0.030
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
driver+0.064
 unravel+0.061
paper+0.061
ro+0.057
 clip+0.057
Neg logit contributions
 much-0.031
 best-0.028
 important-0.028
 And-0.025
 many-0.025
Token he
Token ID339
Feature activation+2.64e+00

Pos logit contributions
driver+0.029
paper+0.028
 unravel+0.028
 bud+0.026
ho+0.024
Neg logit contributions
 best-0.032
 much-0.032
 important-0.031
 Everyone-0.030
 And-0.030
Token got
Token ID1392
Feature activation+3.26e-01

Pos logit contributions
driver+0.058
paper+0.055
 unravel+0.054
 bud+0.051
ro+0.049
Neg logit contributions
 much-0.065
 best-0.064
 important-0.062
 Everyone-0.062
 And-0.061
Token
Token ID220
Feature activation+7.23e-01

Pos logit contributions
driver+0.002
 unravel+0.002
paper+0.001
 bud+0.001
ro+0.001
Neg logit contributions
 best-0.001
 important-0.001
ovable-0.001
 lucky-0.001
 much-0.001
Token
Token ID198
Feature activation+2.72e+00

Pos logit contributions
driver+0.020
 unravel+0.018
paper+0.018
ro+0.017
 bud+0.017
Neg logit contributions
 best-0.013
 important-0.013
 much-0.013
ovable-0.012
 glad-0.012
TokenSuddenly
Token ID38582
Feature activation+1.57e+00

Pos logit contributions
driver+0.072
 unravel+0.068
paper+0.068
 bud+0.065
ro+0.063
Neg logit contributions
 best-0.054
 much-0.054
 important-0.052
 Everyone-0.050
 And-0.050
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
driver+0.037
 bud+0.034
paper+0.034
 unravel+0.033
ho+0.029
Neg logit contributions
 important-0.035
 Everyone-0.034
 best-0.034
ovable-0.034
 much-0.033
Token a
Token ID257
Feature activation+4.57e+00

Pos logit contributions
driver+0.027
 bud+0.025
 unravel+0.025
paper+0.025
 rip+0.023
Neg logit contributions
 important-0.019
ovable-0.018
 And-0.018
 best-0.018
 glad-0.017
Token loud
Token ID7812
Feature activation+6.47e+00

Pos logit contributions
driver+0.132
paper+0.127
 unravel+0.126
 bud+0.119
ro+0.117
Neg logit contributions
 much-0.081
 best-0.079
 important-0.077
 Everyone-0.075
 And-0.072
Token noise
Token ID7838
Feature activation+2.45e+00

Pos logit contributions
driver+0.156
paper+0.153
 unravel+0.145
ge+0.140
 clip+0.137
Neg logit contributions
 much-0.142
 Everyone-0.140
 And-0.135
 best-0.135
 important-0.134
Token came
Token ID1625
Feature activation+2.95e+00

Pos logit contributions
driver+0.059
paper+0.056
 unravel+0.055
ro+0.051
 bud+0.051
Neg logit contributions
 much-0.055
 Everyone-0.052
 best-0.052
 important-0.052
 And-0.050
Token from
Token ID422
Feature activation-1.59e-01

Pos logit contributions
driver+0.080
paper+0.076
 unravel+0.075
ro+0.070
 clip+0.070
Neg logit contributions
 much-0.059
 best-0.057
 important-0.054
 And-0.053
 Everyone-0.053
Token the
Token ID262
Feature activation+8.16e-04

Pos logit contributions
 much+0.002
 best+0.002
 important+0.002
 Everyone+0.002
 And+0.002
Neg logit contributions
driver-0.005
paper-0.005
 unravel-0.005
 bud-0.005
ro-0.005
Token sky
Token ID6766
Feature activation+1.85e+00

Pos logit contributions
driver+0.000
paper+0.000
 unravel+0.000
ge+0.000
 bud+0.000
Neg logit contributions
 best-0.000
 much-0.000
 important-0.000
 Everyone-0.000
 many-0.000
Token playing
Token ID2712
Feature activation+1.20e+00

Pos logit contributions
 Everyone+0.037
 And+0.034
 best+0.034
 Everybody+0.033
 People+0.033
Neg logit contributions
 unravel-0.049
driver-0.049
 bud-0.049
paper-0.046
 swoop-0.045
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
driver+0.033
 unravel+0.031
paper+0.031
 bud+0.030
ro+0.029
Neg logit contributions
 much-0.023
 best-0.022
 important-0.022
 Everyone-0.022
 And-0.021
Token Benny
Token ID44275
Feature activation+1.80e+00

Pos logit contributions
 unravel+0.023
driver+0.022
paper+0.021
 bud+0.020
 rip+0.019
Neg logit contributions
ovable-0.023
 And-0.023
 best-0.023
anny-0.022
 Everyone-0.022
Token heard
Token ID2982
Feature activation+2.15e+00

Pos logit contributions
driver+0.041
paper+0.038
 unravel+0.038
 bud+0.036
ro+0.034
Neg logit contributions
 much-0.043
 best-0.043
 important-0.041
 Everyone-0.040
 And-0.040
Token a
Token ID257
Feature activation+5.48e+00

Pos logit contributions
driver+0.061
 unravel+0.060
paper+0.057
 rip+0.055
 bud+0.055
Neg logit contributions
 best-0.039
 glad-0.033
 important-0.033
ovable-0.032
 And-0.032
Token loud
Token ID7812
Feature activation+6.47e+00

Pos logit contributions
driver+0.164
paper+0.156
 unravel+0.147
ro+0.147
ge+0.145
Neg logit contributions
 much-0.102
 important-0.095
 best-0.092
 Everyone-0.089
 And-0.086
Token noise
Token ID7838
Feature activation+2.69e+00

Pos logit contributions
driver+0.162
paper+0.156
 unravel+0.147
 clip+0.146
ge+0.143
Neg logit contributions
 much-0.140
 Everyone-0.135
 And-0.134
 important-0.130
 best-0.125
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
driver+0.080
paper+0.077
 unravel+0.075
ro+0.071
 bud+0.071
Neg logit contributions
 much-0.047
 Everyone-0.042
 important-0.041
 And-0.041
 best-0.041
Token It
Token ID632
Feature activation+4.16e+00

Pos logit contributions
driver+0.048
paper+0.046
 unravel+0.046
 bud+0.044
ro+0.042
Neg logit contributions
 much-0.037
 best-0.036
 important-0.035
 Everyone-0.035
 And-0.034
Token was
Token ID373
Feature activation+3.77e-01

Pos logit contributions
driver+0.126
paper+0.120
ro+0.112
 clip+0.112
 unravel+0.111
Neg logit contributions
 much-0.074
 best-0.070
 important-0.070
 Everyone-0.067
 And-0.063
Token a
Token ID257
Feature activation+4.06e+00

Pos logit contributions
driver+0.013
paper+0.012
 unravel+0.012
 bud+0.012
ro+0.011
Neg logit contributions
 much-0.005
 best-0.005
 important-0.005
 Everyone-0.005
 And-0.004
Token a
Token ID257
Feature activation+1.49e+00

Pos logit contributions
 much+0.036
 best+0.035
 important+0.034
 Everyone+0.033
 And+0.033
Neg logit contributions
driver-0.074
 unravel-0.072
paper-0.072
 bud-0.069
ro-0.067
Token sudden
Token ID4802
Feature activation+1.48e+00

Pos logit contributions
driver+0.040
 unravel+0.039
paper+0.038
 bud+0.037
ro+0.035
Neg logit contributions
 much-0.028
 best-0.027
 Everyone-0.027
 And-0.026
 important-0.026
Token she
Token ID673
Feature activation+1.77e+00

Pos logit contributions
driver+0.038
paper+0.036
 unravel+0.036
 bud+0.035
 rip+0.032
Neg logit contributions
 best-0.030
 much-0.029
 important-0.029
 Everyone-0.029
 And-0.029
Token heard
Token ID2982
Feature activation+1.81e+00

Pos logit contributions
driver+0.031
paper+0.030
 unravel+0.030
 bud+0.027
ro+0.026
Neg logit contributions
 much-0.049
 Everyone-0.049
 important-0.049
 And-0.049
 best-0.049
Token a
Token ID257
Feature activation+5.20e+00

Pos logit contributions
driver+0.056
 unravel+0.056
paper+0.054
 bud+0.053
 rip+0.052
Neg logit contributions
 best-0.026
 important-0.023
 much-0.023
 And-0.023
 glad-0.022
Token loud
Token ID7812
Feature activation+6.46e+00

Pos logit contributions
driver+0.145
paper+0.139
 unravel+0.133
ro+0.128
 clip+0.124
Neg logit contributions
 much-0.103
 important-0.097
 best-0.097
 Everyone-0.094
 And-0.090
Token noise
Token ID7838
Feature activation+2.29e+00

Pos logit contributions
driver+0.167
paper+0.164
 unravel+0.152
 clip+0.149
ge+0.146
Neg logit contributions
 much-0.133
 Everyone-0.132
 And-0.128
 important-0.125
 best-0.119
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
driver+0.068
paper+0.066
 unravel+0.065
ro+0.061
 bud+0.061
Neg logit contributions
 much-0.039
 Everyone-0.035
 important-0.035
 best-0.035
 And-0.034
Token Mad
Token ID4627
Feature activation-1.96e+00

Pos logit contributions
driver+0.028
paper+0.026
 unravel+0.026
 bud+0.025
ro+0.024
Neg logit contributions
 much-0.020
 best-0.019
 Everyone-0.019
 important-0.019
 And-0.018
Tokeneline
Token ID4470
Feature activation+9.35e-01

Pos logit contributions
 much+0.034
 best+0.033
 important+0.032
 Everyone+0.032
 And+0.031
Neg logit contributions
driver-0.056
paper-0.054
 unravel-0.054
 bud-0.052
ro-0.050
Token panicked
Token ID38225
Feature activation+3.71e+00

Pos logit contributions
driver+0.021
 unravel+0.020
paper+0.020
 bud+0.019
ho+0.018
Neg logit contributions
 much-0.022
 best-0.022
 important-0.021
 Everyone-0.021
 And-0.021
Token but
Token ID475
Feature activation+2.95e-01

Pos logit contributions
driver+0.035
paper+0.034
 unravel+0.034
 bud+0.032
ro+0.031
Neg logit contributions
 much-0.030
 best-0.029
 important-0.028
 And-0.027
 Everyone-0.027
Token then
Token ID788
Feature activation+2.20e+00

Pos logit contributions
driver+0.007
 unravel+0.007
paper+0.007
 bud+0.007
ho+0.007
Neg logit contributions
 And-0.006
 Everyone-0.006
ovable-0.006
 important-0.006
 much-0.005
Token they
Token ID484
Feature activation+1.06e+00

Pos logit contributions
driver+0.054
paper+0.051
 unravel+0.050
 bud+0.050
 swoop+0.046
Neg logit contributions
 important-0.045
ovable-0.044
 And-0.043
 Everyone-0.043
 best-0.042
Token heard
Token ID2982
Feature activation+1.37e+00

Pos logit contributions
driver+0.027
paper+0.026
 unravel+0.026
ro+0.026
 bud+0.024
Neg logit contributions
 much-0.022
 best-0.022
 important-0.020
 many-0.020
 named-0.020
Token a
Token ID257
Feature activation+4.77e+00

Pos logit contributions
driver+0.041
 unravel+0.040
paper+0.038
 rip+0.037
 bud+0.036
Neg logit contributions
 best-0.021
 important-0.019
ovable-0.019
 glad-0.019
 And-0.019
Token loud
Token ID7812
Feature activation+6.46e+00

Pos logit contributions
driver+0.154
paper+0.149
ro+0.140
 unravel+0.138
ge+0.135
Neg logit contributions
 much-0.082
 important-0.075
 best-0.071
 Everyone-0.067
 lucky-0.066
Token noise
Token ID7838
Feature activation+2.13e+00

Pos logit contributions
driver+0.172
paper+0.169
 clip+0.159
 unravel+0.156
ge+0.155
Neg logit contributions
 much-0.127
 Everyone-0.125
 And-0.123
 important-0.118
 named-0.113
Token.
Token ID13
Feature activation+9.20e-02

Pos logit contributions
driver+0.064
paper+0.062
 unravel+0.061
ro+0.058
 bud+0.058
Neg logit contributions
 much-0.036
 important-0.032
 best-0.031
 Everyone-0.031
 And-0.031
Token
Token ID198
Feature activation+2.64e+00

Pos logit contributions
driver+0.002
 unravel+0.002
paper+0.002
ro+0.002
 rip+0.002
Neg logit contributions
 best-0.002
 important-0.002
ovable-0.002
 much-0.002
 glad-0.002
Token
Token ID198
Feature activation+2.78e+00

Pos logit contributions
driver+0.067
 unravel+0.064
paper+0.061
 bud+0.059
ro+0.058
Neg logit contributions
 best-0.054
 important-0.051
 much-0.048
ovable-0.048
 glad-0.046
TokenThey
Token ID2990
Feature activation+1.38e+00

Pos logit contributions
driver+0.061
 unravel+0.058
paper+0.054
 bud+0.053
ro+0.053
Neg logit contributions
 best-0.070
 much-0.065
 important-0.064
 And-0.062
ovable-0.062
Token trap
Token ID12840
Feature activation+2.02e+00

Pos logit contributions
driver+0.010
paper+0.009
 unravel+0.009
ro+0.008
pan+0.008
Neg logit contributions
 best-0.011
 much-0.011
 important-0.010
 many-0.010
 And-0.010
Token.
Token ID13
Feature activation+6.56e-01

Pos logit contributions
paper+0.063
driver+0.063
 unravel+0.061
ho+0.058
 swoop+0.058
Neg logit contributions
 best-0.030
 much-0.030
 many-0.027
 And-0.026
 important-0.026
Token Suddenly
Token ID24975
Feature activation+1.36e+00

Pos logit contributions
driver+0.018
paper+0.017
 unravel+0.017
 bud+0.016
ho+0.016
Neg logit contributions
 much-0.013
 Everyone-0.012
 best-0.012
 important-0.012
 And-0.012
Token,
Token ID11
Feature activation+7.63e-01

Pos logit contributions
driver+0.035
paper+0.033
 unravel+0.033
 bud+0.032
 rip+0.029
Neg logit contributions
 best-0.027
 important-0.027
 much-0.027
 Everyone-0.027
 And-0.026
Token a
Token ID257
Feature activation+4.55e+00

Pos logit contributions
driver+0.020
 unravel+0.019
paper+0.019
 bud+0.018
 rip+0.017
Neg logit contributions
 important-0.014
 best-0.014
 And-0.014
 much-0.014
 Everyone-0.014
Token loud
Token ID7812
Feature activation+6.45e+00

Pos logit contributions
driver+0.142
paper+0.139
 unravel+0.132
ge+0.130
ro+0.129
Neg logit contributions
 much-0.077
 best-0.071
 important-0.070
 Everyone-0.064
 lucky-0.063
Token noise
Token ID7838
Feature activation+2.33e+00

Pos logit contributions
driver+0.157
paper+0.157
ge+0.148
 unravel+0.145
 clip+0.143
Neg logit contributions
 much-0.138
 Everyone-0.135
 And-0.131
 important-0.129
 named-0.129
Token came
Token ID1625
Feature activation+3.05e+00

Pos logit contributions
driver+0.054
paper+0.052
 unravel+0.049
ro+0.048
 clip+0.048
Neg logit contributions
 much-0.053
 Everyone-0.050
 named-0.050
 important-0.049
 best-0.049
Token from
Token ID422
Feature activation-5.43e-01

Pos logit contributions
driver+0.086
paper+0.082
 unravel+0.080
ro+0.077
 clip+0.077
Neg logit contributions
 much-0.057
 best-0.055
 important-0.052
 And-0.051
 Everyone-0.050
Token inside
Token ID2641
Feature activation+1.95e+00

Pos logit contributions
 much+0.008
 best+0.007
 important+0.007
 Everyone+0.007
 And+0.007
Neg logit contributions
driver-0.018
paper-0.017
 unravel-0.017
 bud-0.016
ro-0.016
Token and
Token ID290
Feature activation+6.33e-01

Pos logit contributions
driver+0.059
paper+0.057
 unravel+0.054
 bud+0.052
 clip+0.052
Neg logit contributions
 much-0.035
 Everyone-0.032
 And-0.032
 best-0.031
 important-0.031
Token of
Token ID286
Feature activation-2.22e+00

Pos logit contributions
 much+0.012
 best+0.012
 important+0.012
 And+0.012
 Everyone+0.012
Neg logit contributions
driver-0.016
paper-0.015
 unravel-0.015
 bud-0.014
ro-0.014
Token their
Token ID511
Feature activation-2.49e+00

Pos logit contributions
 Everyone+0.036
 much+0.036
 best+0.036
 And+0.035
ovable+0.034
Neg logit contributions
driver-0.057
 unravel-0.056
paper-0.055
 bud-0.054
ho-0.051
Token feelings
Token ID7666
Feature activation-1.86e+00

Pos logit contributions
 much+0.039
 best+0.037
 Everyone+0.037
 important+0.036
 And+0.036
Neg logit contributions
driver-0.074
 unravel-0.072
paper-0.072
 bud-0.069
ro-0.067
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
 much+0.036
 best+0.036
 important+0.035
 Everyone+0.034
 And+0.033
Neg logit contributions
driver-0.050
paper-0.049
 unravel-0.048
 bud-0.045
ho-0.044
Token By
Token ID2750
Feature activation-2.68e+00

Pos logit contributions
 best+0.048
 important+0.046
ovable+0.044
 adventurous+0.043
 much+0.042
Neg logit contributions
driver-0.069
 unravel-0.068
ro-0.068
ho-0.065
paper-0.062
Token being
Token ID852
Feature activation-6.64e+00

Pos logit contributions
 much+0.047
 best+0.046
 Everyone+0.045
 important+0.044
 And+0.044
Neg logit contributions
driver-0.074
 unravel-0.073
paper-0.072
 bud-0.070
ro-0.066
Token thoughtful
Token ID22677
Feature activation-4.92e-01

Pos logit contributions
 Everyone+0.113
 And+0.104
 People+0.098
 Everybody+0.098
ovable+0.097
Neg logit contributions
 unravel-0.177
driver-0.172
paper-0.167
 bud-0.166
 swoop-0.159
Token and
Token ID290
Feature activation-4.01e+00

Pos logit contributions
 best+0.009
 much+0.009
 important+0.009
 Everyone+0.009
 And+0.009
Neg logit contributions
driver-0.013
paper-0.013
 unravel-0.013
 bud-0.012
 clip-0.012
Token kind
Token ID1611
Feature activation-1.18e+00

Pos logit contributions
 much+0.066
 best+0.065
 Everyone+0.063
 important+0.062
 And+0.061
Neg logit contributions
driver-0.117
 unravel-0.113
paper-0.112
 bud-0.108
ro-0.104
Token we
Token ID356
Feature activation-2.07e+00

Pos logit contributions
 best+0.022
 much+0.022
 Everyone+0.021
 important+0.021
 And+0.021
Neg logit contributions
driver-0.033
paper-0.031
 unravel-0.031
 bud-0.029
 clip-0.028
Token can
Token ID460
Feature activation+1.56e+00

Pos logit contributions
 much+0.041
 best+0.040
 important+0.039
 Everyone+0.039
 And+0.038
Neg logit contributions
driver-0.055
paper-0.053
 unravel-0.052
 bud-0.049
ro-0.048
Token the
Token ID262
Feature activation-1.46e+00

Pos logit contributions
 best+0.027
 much+0.027
 important+0.026
ovable+0.025
 And+0.024
Neg logit contributions
driver-0.042
 unravel-0.042
paper-0.042
 bud-0.039
 rip-0.039
Token sunrise
Token ID40048
Feature activation-2.12e+00

Pos logit contributions
 much+0.027
 best+0.026
 And+0.025
 Everyone+0.025
 important+0.025
Neg logit contributions
driver-0.038
 unravel-0.038
paper-0.038
 bud-0.037
 rip-0.035
Token together
Token ID1978
Feature activation-1.16e+00

Pos logit contributions
 much+0.042
 best+0.041
 important+0.040
 Everyone+0.039
 And+0.039
Neg logit contributions
driver-0.056
paper-0.053
 unravel-0.053
 bud-0.051
ro-0.049
Token.
Token ID13
Feature activation-2.45e+00

Pos logit contributions
 much+0.022
 best+0.021
 important+0.020
 Everyone+0.020
 And+0.020
Neg logit contributions
driver-0.034
paper-0.032
 unravel-0.031
 bud-0.030
ro-0.029
Token They
Token ID1119
Feature activation-2.88e+00

Pos logit contributions
 important+0.050
 best+0.049
ovable+0.048
 adventurous+0.047
 lucky+0.047
Neg logit contributions
driver-0.060
 unravel-0.054
paper-0.051
ro-0.048
ho-0.048
Token became
Token ID2627
Feature activation-6.50e+00

Pos logit contributions
 Everyone+0.063
 important+0.061
 much+0.060
 And+0.059
 best+0.058
Neg logit contributions
 unravel-0.067
 bud-0.064
paper-0.064
ho-0.064
driver-0.062
Token the
Token ID262
Feature activation-3.61e+00

Pos logit contributions
 Everyone+0.110
ovable+0.109
orry+0.107
 Everybody+0.102
anna+0.102
Neg logit contributions
 unravel-0.155
paper-0.148
 bud-0.138
 swoop-0.137
 clip-0.127
Token best
Token ID1266
Feature activation-1.49e+00

Pos logit contributions
 Everyone+0.029
 much+0.024
 And+0.024
 From+0.019
 Everybody+0.019
Neg logit contributions
 unravel-0.131
 bud-0.131
paper-0.128
 clip-0.125
driver-0.122
Token of
Token ID286
Feature activation-4.20e+00

Pos logit contributions
 much+0.022
 Everyone+0.021
 important+0.021
 best+0.020
 And+0.020
Neg logit contributions
driver-0.043
 bud-0.042
 unravel-0.042
paper-0.041
pan-0.039
Token friends
Token ID2460
Feature activation-4.65e+00

Pos logit contributions
 Everyone+0.046
 And+0.044
 much+0.043
 People+0.042
 Everybody+0.039
Neg logit contributions
driver-0.131
 bud-0.131
 unravel-0.128
 swoop-0.123
paper-0.122
Token,
Token ID11
Feature activation-2.08e+00

Pos logit contributions
 important+0.123
 Everyone+0.123
 much+0.119
 best+0.115
 People+0.114
Neg logit contributions
driver-0.081
paper-0.073
 unravel-0.069
 bud-0.069
ho-0.067
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
 much+0.058
 best+0.056
 important+0.055
 Everyone+0.054
 And+0.052
Neg logit contributions
driver-0.057
 unravel-0.056
paper-0.054
 bud-0.053
ro-0.051
Token the
Token ID262
Feature activation+4.51e-02

Pos logit contributions
 much+0.026
 important+0.024
 best+0.024
ovable+0.023
 Everyone+0.023
Neg logit contributions
 unravel-0.038
driver-0.036
 bud-0.035
paper-0.035
ho-0.034
Token sheep
Token ID15900
Feature activation-3.25e+00

Pos logit contributions
driver+0.001
 unravel+0.001
paper+0.001
 bud+0.001
ro+0.001
Neg logit contributions
 much-0.001
 best-0.001
 Everyone-0.001
 important-0.001
 And-0.001
Token continued
Token ID3767
Feature activation-2.29e+00

Pos logit contributions
 much+0.062
 best+0.060
 important+0.060
 Everyone+0.057
 And+0.056
Neg logit contributions
driver-0.087
 unravel-0.084
paper-0.083
 bud-0.080
ho-0.077
Token to
Token ID284
Feature activation-7.74e-01

Pos logit contributions
 much+0.048
 best+0.048
 important+0.047
 Everyone+0.047
 And+0.046
Neg logit contributions
driver-0.056
paper-0.054
 unravel-0.054
 bud-0.050
ho-0.048
Token be
Token ID307
Feature activation-6.49e+00

Pos logit contributions
 much+0.015
 best+0.015
 Everyone+0.014
 important+0.014
 And+0.014
Neg logit contributions
driver-0.021
paper-0.020
 unravel-0.019
 bud-0.019
ro-0.018
Token friends
Token ID2460
Feature activation-4.84e+00

Pos logit contributions
 Everyone+0.092
 And+0.083
ovable+0.082
 much+0.077
 Everybody+0.077
Neg logit contributions
 unravel-0.187
paper-0.185
driver-0.180
 swoop-0.179
 bud-0.177
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
 Everyone+0.112
 important+0.112
 much+0.110
 best+0.107
ovable+0.104
Neg logit contributions
driver-0.098
paper-0.095
 unravel-0.094
 bud-0.089
ho-0.086
Token sharing
Token ID7373
Feature activation-1.64e+00

Pos logit contributions
 Everyone+0.034
 much+0.034
 important+0.034
 best+0.033
ovable+0.031
Neg logit contributions
 unravel-0.048
driver-0.048
paper-0.047
 bud-0.045
ho-0.044
Token y
Token ID331
Feature activation+1.52e-01

Pos logit contributions
 much+0.028
 best+0.027
 important+0.026
 Everyone+0.026
 And+0.026
Neg logit contributions
driver-0.048
paper-0.046
 unravel-0.046
 bud-0.044
ro-0.042
Tokenummy
Token ID13513
Feature activation-1.70e+00

Pos logit contributions
driver+0.004
paper+0.004
 unravel+0.004
 bud+0.004
ro+0.004
Neg logit contributions
 much-0.003
 best-0.003
 important-0.003
 Everyone-0.003
 And-0.002
Token mighty
Token ID18680
Feature activation+1.21e+00

Pos logit contributions
 much+0.023
 best+0.022
 Everyone+0.022
 important+0.022
 And+0.022
Neg logit contributions
driver-0.038
 unravel-0.037
paper-0.036
 bud-0.035
ro-0.034
Token blue
Token ID4171
Feature activation-1.68e-01

Pos logit contributions
driver+0.031
paper+0.029
 unravel+0.029
 bud+0.028
ro+0.026
Neg logit contributions
 much-0.026
 best-0.025
 important-0.025
 Everyone-0.024
 And-0.024
Tokenberry
Token ID8396
Feature activation-2.38e+00

Pos logit contributions
 best+0.004
 much+0.004
 Everyone+0.004
 And+0.004
 important+0.004
Neg logit contributions
driver-0.004
paper-0.004
 unravel-0.004
 bud-0.003
ge-0.003
Token would
Token ID561
Feature activation+1.02e+00

Pos logit contributions
 much+0.046
 best+0.045
 important+0.044
 Everyone+0.043
 And+0.042
Neg logit contributions
driver-0.064
paper-0.061
 unravel-0.061
 bud-0.058
ro-0.056
Token always
Token ID1464
Feature activation+3.14e-01

Pos logit contributions
driver+0.033
paper+0.032
 unravel+0.031
 bud+0.030
ro+0.029
Neg logit contributions
 much-0.015
 best-0.014
 Everyone-0.013
 important-0.013
 And-0.013
Token be
Token ID307
Feature activation-6.41e+00

Pos logit contributions
driver+0.008
paper+0.008
 unravel+0.008
 bud+0.007
ro+0.007
Neg logit contributions
 best-0.006
 much-0.006
 important-0.006
 Everyone-0.006
 And-0.006
Token united
Token ID16503
Feature activation-5.70e-01

Pos logit contributions
 Everyone+0.108
 And+0.101
 much+0.095
 People+0.093
ovable+0.092
Neg logit contributions
 unravel-0.174
paper-0.171
driver-0.169
 swoop-0.161
 bud-0.161
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
 much+0.010
 best+0.010
 important+0.010
 Everyone+0.009
 And+0.009
Neg logit contributions
driver-0.016
paper-0.016
 unravel-0.016
 bud-0.015
ro-0.014
Token no
Token ID645
Feature activation-1.64e+00

Pos logit contributions
 Everyone+0.032
 much+0.032
 important+0.031
 best+0.031
ovable+0.030
Neg logit contributions
driver-0.040
 unravel-0.039
paper-0.039
 bud-0.036
ro-0.035
Token matter
Token ID2300
Feature activation-2.64e+00

Pos logit contributions
 Everyone+0.041
 And+0.040
ovable+0.040
 much+0.038
 glad+0.037
Neg logit contributions
 unravel-0.031
 bud-0.029
paper-0.028
driver-0.028
ro-0.028
Token the
Token ID262
Feature activation-1.83e+00

Pos logit contributions
 Everyone+0.062
 best+0.062
ovable+0.061
 important+0.060
 And+0.060
Neg logit contributions
driver-0.050
ge-0.048
 bud-0.047
 unravel-0.046
paper-0.043
Tokenittens
Token ID34978
Feature activation-2.18e+00

Pos logit contributions
 much+0.004
 best+0.004
 Everyone+0.004
 important+0.004
 And+0.004
Neg logit contributions
driver-0.008
 unravel-0.008
paper-0.008
 bud-0.007
ro-0.007
Token on
Token ID319
Feature activation-4.57e-01

Pos logit contributions
 much+0.039
 best+0.039
 important+0.036
 Everyone+0.036
 And+0.035
Neg logit contributions
driver-0.061
 unravel-0.059
paper-0.058
 bud-0.057
ro-0.054
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
 much+0.007
 best+0.007
 important+0.007
 Everyone+0.007
 And+0.007
Neg logit contributions
driver-0.014
paper-0.013
 unravel-0.013
 bud-0.013
ro-0.012
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
 much+0.013
 best+0.012
 important+0.012
 Everyone+0.012
 And+0.012
Neg logit contributions
driver-0.016
paper-0.015
 unravel-0.015
 bud-0.014
ro-0.014
Token enjoyed
Token ID8359
Feature activation-3.65e+00

Pos logit contributions
 much+0.031
 best+0.031
 important+0.030
 Everyone+0.029
 And+0.029
Neg logit contributions
driver-0.045
 unravel-0.044
paper-0.044
 bud-0.041
ro-0.041
Token being
Token ID852
Feature activation-6.38e+00

Pos logit contributions
 much+0.057
 best+0.057
 important+0.055
 Everyone+0.055
 And+0.053
Neg logit contributions
driver-0.107
paper-0.105
 unravel-0.104
 bud-0.098
ro-0.096
Token warm
Token ID5814
Feature activation-2.11e+00

Pos logit contributions
 Everyone+0.107
 And+0.103
 much+0.097
 best+0.096
 important+0.095
Neg logit contributions
paper-0.167
 unravel-0.166
driver-0.165
 swoop-0.152
ro-0.151
Token and
Token ID290
Feature activation-3.95e+00

Pos logit contributions
 best+0.041
 much+0.040
 important+0.038
 And+0.037
 Everyone+0.036
Neg logit contributions
driver-0.059
 unravel-0.056
paper-0.055
 bud-0.053
 clip-0.052
Token snug
Token ID47032
Feature activation-4.20e-01

Pos logit contributions
 much+0.058
 Everyone+0.057
 best+0.057
 And+0.055
 important+0.054
Neg logit contributions
driver-0.119
 unravel-0.117
paper-0.117
 bud-0.111
ro-0.108
Token all
Token ID477
Feature activation-3.45e+00

Pos logit contributions
 best+0.008
 much+0.007
 important+0.007
 And+0.007
 Everyone+0.007
Neg logit contributions
driver-0.012
 unravel-0.012
paper-0.012
 bud-0.011
 clip-0.011
Token winter
Token ID7374
Feature activation-8.61e-01

Pos logit contributions
 Everyone+0.046
 And+0.045
ovable+0.044
 much+0.042
 best+0.041
Neg logit contributions
 unravel-0.100
paper-0.100
driver-0.098
ro-0.095
 bud-0.093
Token flying
Token ID7348
Feature activation-6.09e-01

Pos logit contributions
 much+0.007
 best+0.006
 important+0.006
 Everyone+0.006
 And+0.006
Neg logit contributions
driver-0.014
 unravel-0.013
paper-0.013
 bud-0.013
ro-0.012
Token every
Token ID790
Feature activation-1.37e+00

Pos logit contributions
 much+0.011
 best+0.011
 important+0.010
 Everyone+0.010
 And+0.010
Neg logit contributions
driver-0.018
paper-0.017
 unravel-0.017
 bud-0.016
ro-0.015
Token day
Token ID1110
Feature activation-1.45e+00

Pos logit contributions
 much+0.009
 best+0.008
 Everyone+0.007
 important+0.007
 And+0.007
Neg logit contributions
driver-0.054
 unravel-0.053
paper-0.053
 bud-0.051
ro-0.050
Token and
Token ID290
Feature activation-2.49e+00

Pos logit contributions
 much+0.030
 best+0.029
 important+0.028
 Everyone+0.028
 And+0.027
Neg logit contributions
driver-0.037
paper-0.036
 unravel-0.036
 bud-0.034
ro-0.033
Token they
Token ID484
Feature activation-2.95e+00

Pos logit contributions
 much+0.045
 best+0.044
 important+0.043
 Everyone+0.042
 And+0.041
Neg logit contributions
driver-0.070
paper-0.067
 unravel-0.067
 bud-0.064
ro-0.062
Token became
Token ID2627
Feature activation-6.36e+00

Pos logit contributions
 much+0.056
 best+0.055
 important+0.053
 Everyone+0.052
 And+0.051
Neg logit contributions
driver-0.081
paper-0.077
 unravel-0.077
 bud-0.073
ro-0.071
Token very
Token ID845
Feature activation-5.31e+00

Pos logit contributions
ovable+0.084
 Everyone+0.083
 And+0.079
orry+0.076
 People+0.073
Neg logit contributions
 unravel-0.185
paper-0.180
driver-0.170
 bud-0.169
 swoop-0.164
Token good
Token ID922
Feature activation-2.07e+00

Pos logit contributions
 Everyone+0.083
 And+0.081
 best+0.074
ovable+0.074
 named+0.073
Neg logit contributions
 unravel-0.150
driver-0.147
paper-0.147
 bud-0.145
 rip-0.137
Token at
Token ID379
Feature activation-1.76e+00

Pos logit contributions
 much+0.015
 best+0.012
 Everyone+0.012
 important+0.012
 And+0.012
Neg logit contributions
driver-0.080
paper-0.078
 unravel-0.077
 bud-0.076
ro-0.073
Token it
Token ID340
Feature activation-2.62e+00

Pos logit contributions
 much+0.029
 best+0.029
 important+0.027
 Everyone+0.027
 And+0.026
Neg logit contributions
driver-0.053
paper-0.051
 unravel-0.050
 bud-0.048
ro-0.047
Token.
Token ID13
Feature activation-2.86e+00

Pos logit contributions
 much+0.054
 best+0.053
 Everyone+0.052
 important+0.052
 And+0.050
Neg logit contributions
driver-0.065
 unravel-0.062
paper-0.062
 bud-0.059
ro-0.056
Token extra
Token ID3131
Feature activation+1.07e+00

Pos logit contributions
driver+0.069
 unravel+0.067
 bud+0.066
paper+0.065
 rip+0.061
Neg logit contributions
 much-0.051
 Everyone-0.050
 best-0.049
 And-0.048
 important-0.045
Token scoop
Token ID23529
Feature activation+1.22e+00

Pos logit contributions
driver+0.027
 unravel+0.026
paper+0.026
 bud+0.025
ro+0.024
Neg logit contributions
 much-0.022
 best-0.022
 Everyone-0.021
 important-0.021
 And-0.021
Token of
Token ID286
Feature activation-1.44e+00

Pos logit contributions
driver+0.039
 unravel+0.038
paper+0.038
 bud+0.036
ge+0.035
Neg logit contributions
 much-0.018
 best-0.017
 Everyone-0.016
 important-0.016
 And-0.016
Token wheat
Token ID17135
Feature activation-3.29e-01

Pos logit contributions
 much+0.029
 best+0.028
 Everyone+0.027
 important+0.027
 And+0.027
Neg logit contributions
driver-0.037
 unravel-0.036
paper-0.035
 bud-0.034
 clip-0.032
Token for
Token ID329
Feature activation-2.61e+00

Pos logit contributions
 best+0.006
 much+0.006
 important+0.006
 And+0.006
 Everyone+0.005
Neg logit contributions
driver-0.009
 unravel-0.009
paper-0.009
 bud-0.008
ge-0.008
Token being
Token ID852
Feature activation-6.34e+00

Pos logit contributions
 best+0.042
 much+0.040
 important+0.040
 And+0.040
 Everyone+0.039
Neg logit contributions
driver-0.074
 unravel-0.073
paper-0.072
 bud-0.071
 rip-0.066
Token so
Token ID523
Feature activation-3.09e+00

Pos logit contributions
 Everyone+0.097
 And+0.096
 best+0.092
 much+0.092
 important+0.088
Neg logit contributions
driver-0.181
 unravel-0.180
paper-0.178
 bud-0.168
 swoop-0.163
Token eager
Token ID11069
Feature activation+1.10e+00

Pos logit contributions
 best+0.048
 And+0.048
 much+0.047
 Everyone+0.047
 important+0.045
Neg logit contributions
driver-0.089
paper-0.088
 unravel-0.087
 bud-0.084
ho-0.082
Token!
Token ID0
Feature activation-1.92e+00

Pos logit contributions
driver+0.027
 unravel+0.026
paper+0.026
 bud+0.024
 clip+0.024
Neg logit contributions
 much-0.024
 best-0.024
 important-0.024
 Everyone-0.023
 And-0.022
Token<|endoftext|>
Token ID50256
Feature activation+3.59e-01

Pos logit contributions
 important+0.041
 best+0.041
ovable+0.035
 much+0.034
 adventurous+0.034
Neg logit contributions
driver-0.049
 unravel-0.045
ro-0.045
paper-0.044
ho-0.041
TokenM
Token ID44
Feature activation-3.50e+00

Pos logit contributions
driver+0.010
 unravel+0.010
paper+0.010
 bud+0.009
ro+0.009
Neg logit contributions
 best-0.006
 much-0.006
 important-0.006
 Everyone-0.006
 And-0.006
Token fun
Token ID1257
Feature activation-2.02e+00

Pos logit contributions
 Everyone+0.063
ovable+0.060
 Everybody+0.058
 Summer+0.054
 And+0.054
Neg logit contributions
 unravel-0.064
 bud-0.064
driver-0.063
paper-0.063
 rip-0.061
Token on
Token ID319
Feature activation-3.28e+00

Pos logit contributions
 Everyone+0.043
 much+0.042
ovable+0.042
 best+0.041
 important+0.041
Neg logit contributions
driver-0.046
paper-0.044
 unravel-0.043
 bud-0.041
ro-0.040
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
ovable+0.061
 Everyone+0.059
 And+0.056
 important+0.055
 best+0.055
Neg logit contributions
 unravel-0.084
paper-0.082
driver-0.080
 bud-0.075
 rip-0.074
Token farm
Token ID5318
Feature activation-2.31e+00

Pos logit contributions
 much+0.039
 Everyone+0.039
 best+0.037
 And+0.036
 important+0.035
Neg logit contributions
driver-0.057
paper-0.056
 unravel-0.055
 bud-0.052
 rip-0.052
Token and
Token ID290
Feature activation-3.00e+00

Pos logit contributions
 much+0.047
 best+0.046
 Everyone+0.046
 important+0.046
 And+0.044
Neg logit contributions
driver-0.057
paper-0.055
 unravel-0.055
 bud-0.052
ro-0.050
Token became
Token ID2627
Feature activation-6.33e+00

Pos logit contributions
 Everyone+0.056
 much+0.054
 best+0.053
ovable+0.053
 important+0.052
Neg logit contributions
paper-0.078
 unravel-0.077
driver-0.076
ho-0.073
 bud-0.070
Token best
Token ID1266
Feature activation-3.35e+00

Pos logit contributions
 Everyone+0.103
ovable+0.101
 Everybody+0.095
orry+0.094
 People+0.092
Neg logit contributions
 unravel-0.165
paper-0.158
 bud-0.151
 swoop-0.148
 clip-0.142
Token friends
Token ID2460
Feature activation-4.59e+00

Pos logit contributions
 much+0.027
 important+0.026
 Everyone+0.026
 best+0.025
 And+0.024
Neg logit contributions
driver-0.120
paper-0.113
 bud-0.112
 unravel-0.111
ro-0.106
Token.
Token ID13
Feature activation-3.28e+00

Pos logit contributions
 important+0.125
 Everyone+0.123
 best+0.121
 People+0.119
ovable+0.116
Neg logit contributions
driver-0.075
paper-0.066
 unravel-0.064
 bud-0.062
ho-0.057
Token<|endoftext|>
Token ID50256
Feature activation-3.90e-01

Pos logit contributions
ovable+0.083
 adventurous+0.076
 important+0.076
 best+0.075
affles+0.074
Neg logit contributions
 unravel-0.054
driver-0.052
paper-0.048
ASH-0.042
ho-0.042
TokenOnce
Token ID7454
Feature activation-2.28e+00

Pos logit contributions
 important+0.008
 best+0.008
 much+0.008
ovable+0.007
 glad+0.007
Neg logit contributions
driver-0.009
 unravel-0.009
paper-0.009
ro-0.009
 bud-0.008
Token she
Token ID673
Feature activation-1.24e+00

Pos logit contributions
 much+0.018
 best+0.018
 important+0.017
 Everyone+0.017
 And+0.017
Neg logit contributions
driver-0.021
 unravel-0.020
paper-0.020
 bud-0.019
ro-0.018
Token sings
Token ID33041
Feature activation-2.72e+00

Pos logit contributions
 much+0.024
 best+0.023
 important+0.023
 Everyone+0.023
 And+0.022
Neg logit contributions
driver-0.032
 unravel-0.031
paper-0.031
 bud-0.029
ro-0.029
Token a
Token ID257
Feature activation+6.54e-01

Pos logit contributions
 much+0.050
 best+0.050
 important+0.048
 Everyone+0.047
 And+0.046
Neg logit contributions
driver-0.073
 unravel-0.070
paper-0.070
 bud-0.067
ro-0.065
Token song
Token ID3496
Feature activation-1.40e+00

Pos logit contributions
driver+0.017
 unravel+0.017
paper+0.016
 bud+0.016
ro+0.015
Neg logit contributions
 much-0.012
 best-0.012
 Everyone-0.012
 important-0.012
 And-0.012
Token about
Token ID546
Feature activation-3.06e+00

Pos logit contributions
 much+0.025
 best+0.025
 important+0.024
 Everyone+0.024
 And+0.023
Neg logit contributions
driver-0.040
paper-0.038
 unravel-0.038
 bud-0.036
ro-0.035
Token being
Token ID852
Feature activation-6.30e+00

Pos logit contributions
 much+0.051
 best+0.050
 important+0.049
 Everyone+0.049
 And+0.047
Neg logit contributions
driver-0.088
 unravel-0.085
paper-0.085
 bud-0.081
ro-0.079
Token home
Token ID1363
Feature activation-2.05e+00

Pos logit contributions
 Everyone+0.106
 And+0.104
 much+0.103
 best+0.101
 important+0.098
Neg logit contributions
 unravel-0.166
paper-0.161
driver-0.160
 bud-0.153
ro-0.152
Token.
Token ID13
Feature activation-1.78e+00

Pos logit contributions
 Everyone+0.045
 best+0.044
 much+0.044
 important+0.043
 And+0.043
Neg logit contributions
driver-0.046
paper-0.045
 unravel-0.044
 bud-0.043
ro-0.042
Token Fl
Token ID1610
Feature activation-2.01e+00

Pos logit contributions
 important+0.041
 best+0.041
 much+0.038
ovable+0.038
 lucky+0.037
Neg logit contributions
driver-0.039
 unravel-0.037
ro-0.036
paper-0.036
 bud-0.033
Tokenuffy
Token ID15352
Feature activation-1.33e+00

Pos logit contributions
 much+0.039
 best+0.038
 important+0.037
 Everyone+0.037
 And+0.035
Neg logit contributions
driver-0.054
paper-0.052
 unravel-0.052
 bud-0.050
ro-0.048
Token listens
Token ID35019
Feature activation+1.25e-01

Pos logit contributions
 much+0.030
 best+0.029
 important+0.029
 Everyone+0.028
 And+0.027
Neg logit contributions
driver-0.031
 unravel-0.030
paper-0.030
 bud-0.028
ro-0.027
Token talked
Token ID6619
Feature activation-7.88e-01

Pos logit contributions
 much+0.046
 best+0.045
 important+0.044
 Everyone+0.044
 And+0.043
Neg logit contributions
driver-0.060
 unravel-0.058
paper-0.058
 bud-0.055
ho-0.053
Token,
Token ID11
Feature activation-4.21e-01

Pos logit contributions
 much+0.015
 best+0.015
 important+0.014
 Everyone+0.014
 And+0.014
Neg logit contributions
driver-0.021
paper-0.020
 unravel-0.020
 bud-0.019
ro-0.019
Token laughed
Token ID13818
Feature activation-1.34e+00

Pos logit contributions
 much+0.008
 best+0.008
 important+0.008
 Everyone+0.008
 And+0.008
Neg logit contributions
driver-0.011
 unravel-0.010
paper-0.010
 bud-0.010
ro-0.009
Token and
Token ID290
Feature activation-3.11e+00

Pos logit contributions
 much+0.029
 best+0.029
 important+0.028
 Everyone+0.028
 And+0.027
Neg logit contributions
driver-0.032
paper-0.031
 unravel-0.031
 bud-0.029
ro-0.028
Token enjoyed
Token ID8359
Feature activation-3.95e+00

Pos logit contributions
 important+0.067
 best+0.067
 much+0.067
 Everyone+0.065
ovable+0.064
Neg logit contributions
 unravel-0.070
paper-0.068
driver-0.068
 bud-0.063
ho-0.063
Token being
Token ID852
Feature activation-6.30e+00

Pos logit contributions
 much+0.056
 best+0.055
 important+0.054
 Everyone+0.054
 And+0.052
Neg logit contributions
driver-0.120
paper-0.119
 unravel-0.118
 bud-0.112
ro-0.109
Token together
Token ID1978
Feature activation-2.42e+00

Pos logit contributions
 Everyone+0.107
 And+0.105
 much+0.097
 important+0.096
 People+0.095
Neg logit contributions
 unravel-0.163
paper-0.162
driver-0.155
 swoop-0.149
ro-0.146
Token.
Token ID13
Feature activation-2.88e+00

Pos logit contributions
 much+0.048
 best+0.047
 important+0.045
 Everyone+0.044
 And+0.044
Neg logit contributions
driver-0.065
paper-0.062
 unravel-0.062
 bud-0.059
ro-0.057
Token<|endoftext|>
Token ID50256
Feature activation-2.83e-01

Pos logit contributions
 best+0.063
ovable+0.063
 important+0.063
 adventurous+0.059
 named+0.056
Neg logit contributions
 unravel-0.060
driver-0.057
ro-0.054
paper-0.053
Mine-0.052
TokenS
Token ID50
Feature activation-3.90e+00

Pos logit contributions
 important+0.006
 much+0.006
 best+0.006
 Everyone+0.005
 glad+0.005
Neg logit contributions
driver-0.007
 unravel-0.007
paper-0.007
ro-0.006
 bud-0.006
Tokenally
Token ID453
Feature activation-2.41e+00

Pos logit contributions
 much+0.073
 best+0.071
 important+0.069
 Everyone+0.068
 And+0.067
Neg logit contributions
driver-0.107
paper-0.103
 unravel-0.103
 bud-0.098
ro-0.095
Token him
Token ID683
Feature activation-1.75e+00

Pos logit contributions
 much+0.031
 best+0.030
 important+0.030
 Everyone+0.030
 And+0.029
Neg logit contributions
 unravel-0.045
driver-0.044
paper-0.044
 bud-0.042
 rip-0.040
Token every
Token ID790
Feature activation-9.18e-01

Pos logit contributions
 Everyone+0.040
 much+0.040
 best+0.039
 important+0.039
 And+0.037
Neg logit contributions
driver-0.038
 unravel-0.037
paper-0.037
 bud-0.035
ho-0.034
Token day
Token ID1110
Feature activation-1.21e+00

Pos logit contributions
 much+0.009
 best+0.008
 Everyone+0.008
 important+0.008
 And+0.008
Neg logit contributions
driver-0.033
 unravel-0.032
paper-0.032
 bud-0.031
ro-0.031
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 much+0.026
 best+0.026
 important+0.025
 Everyone+0.025
 And+0.024
Neg logit contributions
driver-0.030
paper-0.028
 unravel-0.028
 bud-0.027
ro-0.026
Token they
Token ID484
Feature activation-2.31e+00

Pos logit contributions
 much+0.024
 best+0.023
 important+0.022
 Everyone+0.022
 And+0.022
Neg logit contributions
driver-0.028
paper-0.027
 unravel-0.027
 bud-0.025
ro-0.024
Token became
Token ID2627
Feature activation-6.29e+00

Pos logit contributions
 much+0.043
 best+0.042
 important+0.042
 Everyone+0.041
 And+0.041
Neg logit contributions
driver-0.062
 unravel-0.061
paper-0.061
 bud-0.057
ho-0.056
Token best
Token ID1266
Feature activation-3.18e+00

Pos logit contributions
 Everyone+0.089
orry+0.086
ovable+0.084
 And+0.083
 Everybody+0.082
Neg logit contributions
 unravel-0.175
paper-0.175
 swoop-0.161
 bud-0.160
driver-0.158
Token friends
Token ID2460
Feature activation-4.23e+00

Pos logit contributions
 much+0.025
 important+0.023
 Everyone+0.023
 best+0.022
 And+0.021
Neg logit contributions
driver-0.117
paper-0.112
 bud-0.111
 unravel-0.109
ro-0.106
Token.
Token ID13
Feature activation-2.07e+00

Pos logit contributions
 important+0.100
 Everyone+0.100
 best+0.100
 much+0.097
 People+0.092
Neg logit contributions
driver-0.089
paper-0.084
 unravel-0.082
 bud-0.079
ho-0.076
Token One
Token ID1881
Feature activation-1.41e+00

Pos logit contributions
 best+0.046
 important+0.042
ovable+0.041
 lucky+0.041
 glad+0.039
Neg logit contributions
driver-0.048
 unravel-0.047
ro-0.046
paper-0.044
 rip-0.041
Token day
Token ID1110
Feature activation+7.69e-01

Pos logit contributions
ooks+0.023
 Everyone+0.023
ovable+0.023
 Everybody+0.021
 People+0.020
Neg logit contributions
ro-0.036
ho-0.035
 swoop-0.032
 ring-0.032
 bite-0.032
Token fun
Token ID1257
Feature activation-6.76e-01

Pos logit contributions
 best+0.055
 important+0.053
 much+0.053
 And+0.052
 Everyone+0.052
Neg logit contributions
driver-0.141
 unravel-0.140
paper-0.140
 bud-0.131
 rip-0.127
Token all
Token ID477
Feature activation-2.94e+00

Pos logit contributions
 much+0.013
 best+0.013
 important+0.012
 Everyone+0.012
 And+0.012
Neg logit contributions
driver-0.019
 unravel-0.018
paper-0.018
 bud-0.017
ho-0.017
Token day
Token ID1110
Feature activation-1.36e+00

Pos logit contributions
ovable+0.044
 Everyone+0.043
 best+0.042
 much+0.042
 And+0.041
Neg logit contributions
 unravel-0.085
driver-0.084
paper-0.083
 bud-0.080
 rip-0.077
Token.
Token ID13
Feature activation-2.16e+00

Pos logit contributions
 much+0.026
 best+0.025
 important+0.024
 Everyone+0.024
 And+0.023
Neg logit contributions
driver-0.037
paper-0.036
 unravel-0.036
 bud-0.034
ro-0.033
Token They
Token ID1119
Feature activation-1.88e+00

Pos logit contributions
 best+0.046
ovable+0.045
 important+0.040
 much+0.039
 lucky+0.038
Neg logit contributions
driver-0.049
 unravel-0.048
ro-0.045
 rip-0.045
paper-0.043
Token became
Token ID2627
Feature activation-6.29e+00

Pos logit contributions
 much+0.036
 best+0.036
 important+0.035
 Everyone+0.035
 And+0.034
Neg logit contributions
driver-0.050
 unravel-0.049
paper-0.048
 bud-0.046
ro-0.044
Token best
Token ID1266
Feature activation-3.03e+00

Pos logit contributions
ovable+0.109
 Everyone+0.106
anna+0.102
orry+0.101
 Everybody+0.100
Neg logit contributions
 unravel-0.158
paper-0.146
 bud-0.143
 swoop-0.137
 clip-0.132
Token friends
Token ID2460
Feature activation-4.26e+00

Pos logit contributions
 important+0.027
 much+0.027
 Everyone+0.024
 best+0.023
ovable+0.022
Neg logit contributions
driver-0.107
 bud-0.102
paper-0.101
 unravel-0.101
ro-0.096
Token and
Token ID290
Feature activation-3.30e+00

Pos logit contributions
 important+0.122
 Everyone+0.119
 best+0.117
 People+0.113
ovable+0.113
Neg logit contributions
driver-0.065
paper-0.057
 unravel-0.056
 bud-0.054
ro-0.049
Token shared
Token ID4888
Feature activation-3.13e+00

Pos logit contributions
ovable+0.076
 much+0.071
 important+0.070
 Everyone+0.070
 best+0.069
Neg logit contributions
ho-0.065
paper-0.065
driver-0.065
 unravel-0.064
 bud-0.061
Token many
Token ID867
Feature activation-3.40e+00

Pos logit contributions
ovable+0.048
 best+0.045
 important+0.045
 much+0.043
 Everyone+0.043
Neg logit contributions
 unravel-0.090
paper-0.089
driver-0.088
 bud-0.083
 rip-0.083
Token Elephant
Token ID42651
Feature activation-3.31e+00

Pos logit contributions
 much+0.036
 important+0.036
 Everyone+0.035
 best+0.035
ovable+0.034
Neg logit contributions
driver-0.051
paper-0.050
 unravel-0.050
 bud-0.046
ho-0.046
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 much+0.062
 best+0.061
 important+0.059
 Everyone+0.058
 And+0.057
Neg logit contributions
driver-0.091
paper-0.087
 unravel-0.087
 bud-0.083
ro-0.080
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 much+0.038
 important+0.037
 Everyone+0.037
 best+0.036
ovable+0.035
Neg logit contributions
driver-0.044
paper-0.043
 unravel-0.042
 bud-0.039
ho-0.039
Token the
Token ID262
Feature activation+5.50e-02

Pos logit contributions
 much+0.024
 important+0.023
 best+0.022
ovable+0.022
 Everyone+0.022
Neg logit contributions
driver-0.036
 unravel-0.036
paper-0.036
 bud-0.034
ho-0.033
Token cat
Token ID3797
Feature activation-2.95e+00

Pos logit contributions
driver+0.001
 unravel+0.001
paper+0.001
 bud+0.001
 rip+0.001
Neg logit contributions
 much-0.001
 Everyone-0.001
 important-0.001
 best-0.001
 And-0.001
Token became
Token ID2627
Feature activation-6.29e+00

Pos logit contributions
 much+0.055
 best+0.054
 important+0.053
 Everyone+0.053
 And+0.050
Neg logit contributions
driver-0.079
paper-0.077
 unravel-0.077
 bud-0.073
ho-0.071
Token the
Token ID262
Feature activation-3.11e+00

Pos logit contributions
 Everyone+0.091
ovable+0.086
 Everybody+0.082
orry+0.079
anny+0.078
Neg logit contributions
 unravel-0.181
paper-0.177
 bud-0.170
 swoop-0.168
driver-0.161
Token best
Token ID1266
Feature activation-1.53e+00

Pos logit contributions
 Everyone+0.017
 much+0.016
 And+0.013
 Everybody+0.010
 named+0.010
Neg logit contributions
 unravel-0.121
 bud-0.121
driver-0.120
paper-0.118
 clip-0.113
Token of
Token ID286
Feature activation-3.83e+00

Pos logit contributions
 much+0.023
 Everyone+0.022
 important+0.022
 best+0.022
 glad+0.021
Neg logit contributions
driver-0.046
 unravel-0.043
 bud-0.043
paper-0.042
ro-0.040
Token friends
Token ID2460
Feature activation-4.97e+00

Pos logit contributions
 Everyone+0.034
 much+0.032
 And+0.030
 Summer+0.027
 People+0.027
Neg logit contributions
driver-0.134
 bud-0.129
 unravel-0.128
paper-0.124
 swoop-0.121
Token.
Token ID13
Feature activation-2.78e+00

Pos logit contributions
 important+0.124
 Everyone+0.120
 much+0.119
 best+0.118
 glad+0.112
Neg logit contributions
driver-0.096
paper-0.091
 unravel-0.086
 bud-0.083
ho-0.082
Tokenined
Token ID1389
Feature activation-1.64e+00

Pos logit contributions
 best+0.024
 much+0.024
 important+0.023
 Everyone+0.022
 And+0.022
Neg logit contributions
 unravel-0.020
driver-0.019
paper-0.019
ro-0.018
 bud-0.017
Token together
Token ID1978
Feature activation-1.45e+00

Pos logit contributions
 much+0.029
 best+0.029
 important+0.028
 Everyone+0.028
 And+0.027
Neg logit contributions
driver-0.046
paper-0.044
 unravel-0.044
 bud-0.042
ro-0.040
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 much+0.025
 best+0.025
 important+0.023
 And+0.022
 Everyone+0.022
Neg logit contributions
driver-0.044
paper-0.042
 unravel-0.041
 bud-0.039
ro-0.038
Token and
Token ID290
Feature activation-2.76e+00

Pos logit contributions
 much+0.021
 best+0.021
 important+0.020
 Everyone+0.020
 And+0.020
Neg logit contributions
driver-0.029
paper-0.027
 unravel-0.027
 bud-0.026
ro-0.025
Token they
Token ID484
Feature activation-2.80e+00

Pos logit contributions
 much+0.047
 best+0.047
 important+0.046
 Everyone+0.046
ovable+0.045
Neg logit contributions
 unravel-0.074
paper-0.073
driver-0.072
ho-0.070
 bud-0.069
Token became
Token ID2627
Feature activation-6.29e+00

Pos logit contributions
 much+0.055
 important+0.054
 Everyone+0.054
 best+0.054
 And+0.053
Neg logit contributions
 unravel-0.071
driver-0.070
paper-0.070
 bud-0.067
ho-0.064
Token the
Token ID262
Feature activation-3.56e+00

Pos logit contributions
 Everyone+0.093
ovable+0.089
orry+0.085
 Everybody+0.083
 People+0.083
Neg logit contributions
 unravel-0.179
paper-0.174
 bud-0.165
 swoop-0.159
 clip-0.152
Token best
Token ID1266
Feature activation-1.46e+00

Pos logit contributions
 Everyone+0.021
 much+0.018
 And+0.017
 important+0.012
 People+0.012
Neg logit contributions
 unravel-0.138
 bud-0.138
paper-0.136
driver-0.132
 clip-0.130
Token of
Token ID286
Feature activation-4.19e+00

Pos logit contributions
 much+0.022
 Everyone+0.021
 important+0.021
 best+0.020
 And+0.020
Neg logit contributions
driver-0.042
 unravel-0.042
paper-0.041
 bud-0.041
ro-0.039
Token friends
Token ID2460
Feature activation-4.41e+00

Pos logit contributions
 And+0.037
 Everyone+0.036
 much+0.033
 People+0.030
 best+0.030
Neg logit contributions
 unravel-0.143
 bud-0.142
driver-0.141
paper-0.137
 swoop-0.135
Token.
Token ID13
Feature activation-2.81e+00

Pos logit contributions
 important+0.108
 Everyone+0.107
 best+0.105
 much+0.105
 People+0.099
Neg logit contributions
driver-0.086
paper-0.081
 unravel-0.079
 bud-0.076
ho-0.073
Token it
Token ID340
Feature activation+1.05e+00

Pos logit contributions
 best+0.006
 much+0.006
 important+0.006
 Everyone+0.006
 And+0.006
Neg logit contributions
driver-0.009
paper-0.009
 unravel-0.008
 bud-0.008
ho-0.008
Token was
Token ID373
Feature activation-2.52e+00

Pos logit contributions
driver+0.029
paper+0.028
 unravel+0.027
 bud+0.026
ro+0.025
Neg logit contributions
 much-0.020
 best-0.020
 important-0.019
 Everyone-0.019
 And-0.018
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
 Everyone+0.047
ovable+0.043
 Everybody+0.042
 And+0.041
 People+0.040
Neg logit contributions
 unravel-0.065
paper-0.063
 bud-0.061
driver-0.060
 rip-0.059
Token reward
Token ID6721
Feature activation+1.63e-01

Pos logit contributions
driver+0.034
 unravel+0.033
paper+0.032
 bud+0.032
 clip+0.030
Neg logit contributions
 much-0.015
 Everyone-0.015
 best-0.015
 important-0.014
 And-0.014
Token for
Token ID329
Feature activation-3.06e+00

Pos logit contributions
driver+0.005
paper+0.005
 unravel+0.005
 bud+0.004
ro+0.004
Neg logit contributions
 much-0.003
 best-0.003
 important-0.003
 Everyone-0.003
 And-0.002
Token being
Token ID852
Feature activation-6.29e+00

Pos logit contributions
 much+0.051
 best+0.051
 And+0.049
 Everyone+0.049
 important+0.049
Neg logit contributions
driver-0.085
 unravel-0.084
paper-0.082
 bud-0.081
 rip-0.077
Token good
Token ID922
Feature activation-1.42e+00

Pos logit contributions
 Everyone+0.096
 And+0.091
 best+0.090
 much+0.090
 important+0.085
Neg logit contributions
driver-0.183
paper-0.181
 unravel-0.178
 bud-0.168
 swoop-0.163
Token and
Token ID290
Feature activation-3.01e+00

Pos logit contributions
 best+0.024
 much+0.023
 important+0.022
 And+0.021
 Everyone+0.021
Neg logit contributions
driver-0.042
 unravel-0.042
paper-0.041
 bud-0.039
ho-0.039
Token helping
Token ID5742
Feature activation-1.18e-01

Pos logit contributions
 much+0.051
 best+0.050
 important+0.050
 Everyone+0.044
 And+0.044
Neg logit contributions
driver-0.090
 unravel-0.087
paper-0.086
 bud-0.083
ro-0.082
Token them
Token ID606
Feature activation-2.27e+00

Pos logit contributions
 best+0.002
 much+0.002
 And+0.002
 important+0.002
 Everyone+0.002
Neg logit contributions
driver-0.003
 unravel-0.003
paper-0.003
 bud-0.003
 rip-0.003
Token."
Token ID526
Feature activation-9.19e-01

Pos logit contributions
 much+0.047
 best+0.045
 Everyone+0.044
 important+0.043
 many+0.042
Neg logit contributions
driver-0.060
paper-0.058
 unravel-0.056
ho-0.055
 bud-0.053
Token oy
Token ID35104
Feature activation+3.00e-01

Pos logit contributions
 much+0.013
 best+0.013
 Everyone+0.012
 And+0.012
 important+0.012
Neg logit contributions
driver-0.013
paper-0.012
 unravel-0.012
 bud-0.012
ho-0.011
Tokenster
Token ID1706
Feature activation-1.88e+00

Pos logit contributions
driver+0.007
 unravel+0.006
paper+0.006
 rip+0.006
pan+0.006
Neg logit contributions
 best-0.008
 important-0.007
 lucky-0.007
 much-0.007
 And-0.007
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
 much+0.036
 best+0.035
 important+0.034
 Everyone+0.033
 And+0.033
Neg logit contributions
driver-0.051
paper-0.049
 unravel-0.049
 bud-0.046
ro-0.045
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
 much+0.029
ovable+0.029
 Everyone+0.028
 best+0.028
 And+0.027
Neg logit contributions
 unravel-0.044
paper-0.042
 bud-0.041
driver-0.041
 rip-0.040
Token crab
Token ID32202
Feature activation-2.97e+00

Pos logit contributions
 much+0.003
 best+0.003
 Everyone+0.003
 important+0.003
 And+0.003
Neg logit contributions
driver-0.003
paper-0.003
 unravel-0.003
 bud-0.002
ro-0.002
Token became
Token ID2627
Feature activation-6.28e+00

Pos logit contributions
 much+0.055
 Everyone+0.054
 important+0.053
 best+0.053
 And+0.050
Neg logit contributions
driver-0.079
 unravel-0.076
paper-0.076
 bud-0.072
ho-0.071
Token good
Token ID922
Feature activation-2.46e+00

Pos logit contributions
ovable+0.099
 Everyone+0.097
 Everybody+0.093
anny+0.090
orry+0.089
Neg logit contributions
 unravel-0.169
paper-0.166
 bud-0.162
 swoop-0.158
driver-0.142
Token friends
Token ID2460
Feature activation-4.99e+00

Pos logit contributions
 much+0.026
 Everyone+0.025
 important+0.021
 Everybody+0.021
 Summer+0.020
Neg logit contributions
paper-0.081
driver-0.080
 bud-0.079
 unravel-0.077
ro-0.075
Token and
Token ID290
Feature activation-3.59e+00

Pos logit contributions
 important+0.134
 Everyone+0.132
 much+0.125
 best+0.125
 Everybody+0.124
Neg logit contributions
driver-0.077
paper-0.076
ho-0.070
 unravel-0.069
 bud-0.068
Token protected
Token ID6861
Feature activation-8.69e-01

Pos logit contributions
 much+0.068
 best+0.068
ovable+0.068
 Everyone+0.068
 important+0.067
Neg logit contributions
paper-0.086
ho-0.086
 unravel-0.086
driver-0.085
 bud-0.082
Token each
Token ID1123
Feature activation-3.35e+00

Pos logit contributions
 much+0.017
 best+0.017
 Everyone+0.016
ovable+0.016
 important+0.016
Neg logit contributions
paper-0.021
 unravel-0.021
driver-0.020
 bud-0.020
ho-0.020
Token played
Token ID2826
Feature activation-7.83e-01

Pos logit contributions
 best+0.048
 much+0.048
 important+0.047
 Everyone+0.047
 And+0.045
Neg logit contributions
 unravel-0.049
driver-0.048
paper-0.048
 bud-0.045
ho-0.044
Token all
Token ID477
Feature activation-3.86e+00

Pos logit contributions
 much+0.015
 best+0.015
 important+0.015
 Everyone+0.015
 And+0.014
Neg logit contributions
driver-0.020
 unravel-0.020
paper-0.019
 bud-0.018
ro-0.018
Token day
Token ID1110
Feature activation-1.57e+00

Pos logit contributions
ovable+0.061
 importantly+0.053
 best+0.050
 Everyone+0.049
 And+0.046
Neg logit contributions
ho-0.107
ro-0.106
 swoop-0.104
paper-0.101
 unravel-0.101
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
 Everyone+0.030
 best+0.030
 much+0.030
 important+0.030
 And+0.029
Neg logit contributions
driver-0.041
paper-0.039
 unravel-0.039
 bud-0.037
ro-0.036
Token They
Token ID1119
Feature activation-2.06e+00

Pos logit contributions
 best+0.030
ovable+0.030
 important+0.029
 much+0.027
 lucky+0.026
Neg logit contributions
 unravel-0.043
driver-0.043
paper-0.042
ro-0.041
 rip-0.039
Token became
Token ID2627
Feature activation-6.27e+00

Pos logit contributions
 Everyone+0.045
 much+0.045
 best+0.044
 important+0.044
 And+0.043
Neg logit contributions
 unravel-0.049
paper-0.047
driver-0.047
 bud-0.045
ho-0.044
Token best
Token ID1266
Feature activation-3.05e+00

Pos logit contributions
 Everyone+0.107
 Everybody+0.101
ovable+0.098
 People+0.095
orry+0.094
Neg logit contributions
 unravel-0.163
paper-0.157
 swoop-0.156
 bud-0.153
 clip-0.141
Token friends
Token ID2460
Feature activation-4.19e+00

Pos logit contributions
 important+0.031
 much+0.031
 Everyone+0.031
 And+0.027
 best+0.026
Neg logit contributions
 bud-0.101
driver-0.100
paper-0.096
 unravel-0.095
ho-0.091
Token and
Token ID290
Feature activation-3.25e+00

Pos logit contributions
 important+0.119
 Everyone+0.117
 best+0.113
 Everybody+0.111
 People+0.111
Neg logit contributions
driver-0.065
paper-0.059
 bud-0.056
 unravel-0.056
ho-0.052
Token had
Token ID550
Feature activation-3.51e+00

Pos logit contributions
ovable+0.077
 Everyone+0.075
 important+0.074
 Everybody+0.073
 much+0.073
Neg logit contributions
ho-0.067
paper-0.062
 unravel-0.061
 swoop-0.059
 bud-0.057
Token many
Token ID867
Feature activation-3.04e+00

Pos logit contributions
ovable+0.060
 Everyone+0.053
 Everybody+0.052
 importantly+0.051
 glad+0.051
Neg logit contributions
paper-0.096
 unravel-0.089
 swoop-0.082
ASH-0.079
driver-0.077
Token every
Token ID790
Feature activation-1.89e+00

Pos logit contributions
 much+0.026
 best+0.025
 important+0.024
 And+0.022
 Everyone+0.022
Neg logit contributions
driver-0.058
paper-0.055
 unravel-0.055
 bud-0.053
ro-0.051
Token day
Token ID1110
Feature activation-2.19e+00

Pos logit contributions
 Everyone+0.015
 best+0.014
 much+0.014
 And+0.013
 important+0.012
Neg logit contributions
driver-0.071
 unravel-0.070
paper-0.068
 bud-0.068
ro-0.065
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
 Everyone+0.049
 best+0.049
 important+0.048
 much+0.048
 And+0.046
Neg logit contributions
driver-0.049
paper-0.048
 unravel-0.046
 bud-0.044
ho-0.042
Token and
Token ID290
Feature activation-2.28e+00

Pos logit contributions
 Everyone+0.034
 much+0.033
 best+0.033
 important+0.032
ovable+0.031
Neg logit contributions
driver-0.034
paper-0.033
 unravel-0.033
 bud-0.031
ro-0.029
Token they
Token ID484
Feature activation-3.23e+00

Pos logit contributions
ovable+0.045
 Everyone+0.044
 much+0.042
 best+0.041
 Everybody+0.041
Neg logit contributions
driver-0.055
 unravel-0.055
paper-0.054
ho-0.050
 bud-0.049
Token became
Token ID2627
Feature activation-6.27e+00

Pos logit contributions
 Everyone+0.061
 much+0.059
 important+0.058
 best+0.058
 And+0.056
Neg logit contributions
 unravel-0.085
paper-0.084
driver-0.083
 bud-0.080
ho-0.078
Token the
Token ID262
Feature activation-3.01e+00

Pos logit contributions
 Everyone+0.101
ovable+0.101
 Everybody+0.095
anna+0.094
orry+0.093
Neg logit contributions
 unravel-0.166
paper-0.162
 bud-0.154
driver-0.141
 swoop-0.139
Token best
Token ID1266
Feature activation-1.52e+00

Pos logit contributions
 Everyone+0.016
 much+0.014
 And+0.012
 Everybody+0.009
 named+0.009
Neg logit contributions
 unravel-0.118
 bud-0.118
driver-0.117
paper-0.116
 clip-0.111
Token of
Token ID286
Feature activation-3.90e+00

Pos logit contributions
 much+0.022
 Everyone+0.022
 best+0.022
 important+0.021
 glad+0.020
Neg logit contributions
driver-0.046
 unravel-0.043
paper-0.043
 bud-0.043
ro-0.040
Token friends
Token ID2460
Feature activation-4.42e+00

Pos logit contributions
 Everyone+0.040
 And+0.037
 much+0.037
 People+0.035
 Everybody+0.033
Neg logit contributions
driver-0.128
 bud-0.125
 unravel-0.124
paper-0.119
 swoop-0.114
Token.
Token ID13
Feature activation-2.61e+00

Pos logit contributions
 important+0.118
 Everyone+0.117
 best+0.112
 much+0.112
 People+0.111
Neg logit contributions
driver-0.074
paper-0.067
 unravel-0.063
 bud-0.062
ho-0.059
Token<|endoftext|>
Token ID50256
Feature activation+5.00e-01

Pos logit contributions
 best+0.027
 important+0.027
ovable+0.025
 much+0.024
 adventurous+0.024
Neg logit contributions
driver-0.029
 unravel-0.028
ro-0.027
paper-0.026
 bud-0.025
Token<|endoftext|>
Token ID50256
Feature activation-3.04e-01

Pos logit contributions
driver+0.013
 unravel+0.013
paper+0.013
 bud+0.012
ro+0.012
Neg logit contributions
 much-0.010
 best-0.010
 important-0.009
 Everyone-0.009
 And-0.009
Token<|endoftext|>
Token ID50256
Feature activation-8.31e-01

Pos logit contributions
 much+0.006
 important+0.006
 best+0.006
 Everyone+0.006
 And+0.005
Neg logit contributions
driver-0.008
 unravel-0.008
paper-0.007
 bud-0.007
ro-0.007
Token<|endoftext|>
Token ID50256
Feature activation-1.05e+00

Pos logit contributions
 much+0.016
 important+0.016
 best+0.016
 Everyone+0.015
 And+0.015
Neg logit contributions
driver-0.021
 unravel-0.020
paper-0.020
 bud-0.019
ro-0.019
Token<|endoftext|>
Token ID50256
Feature activation-1.13e+00

Pos logit contributions
 much+0.021
 important+0.021
 best+0.021
 Everyone+0.020
 And+0.019
Neg logit contributions
driver-0.027
 unravel-0.026
paper-0.026
 bud-0.024
ro-0.024
TokenL
Token ID43
Feature activation-6.26e+00

Pos logit contributions
 much+0.022
 important+0.022
 best+0.022
 Everyone+0.021
 And+0.020
Neg logit contributions
driver-0.028
 unravel-0.028
paper-0.028
 bud-0.026
ro-0.026
Tokenila
Token ID10102
Feature activation-5.14e+00

Pos logit contributions
 much+0.108
 best+0.107
 important+0.107
 Everyone+0.106
 glad+0.097
Neg logit contributions
driver-0.169
paper-0.162
 unravel-0.161
 bud-0.159
ro-0.155
Token and
Token ID290
Feature activation-1.89e+00

Pos logit contributions
 much+0.143
 best+0.142
 important+0.140
 Everyone+0.139
ovable+0.135
Neg logit contributions
driver-0.082
 unravel-0.080
paper-0.079
ro-0.078
 bud-0.075
Token Ben
Token ID3932
Feature activation-2.49e+00

Pos logit contributions
 important+0.037
 best+0.037
 much+0.035
ovable+0.034
 Everyone+0.033
Neg logit contributions
 unravel-0.046
paper-0.044
driver-0.044
 rip-0.043
 swoop-0.042
Token are
Token ID389
Feature activation-4.42e+00

Pos logit contributions
 much+0.056
 best+0.056
 important+0.055
 Everyone+0.054
 And+0.051
Neg logit contributions
driver-0.056
paper-0.055
 unravel-0.054
 bud-0.051
ro-0.050
Token friends
Token ID2460
Feature activation-4.02e+00

Pos logit contributions
 Everyone+0.069
 Everybody+0.064
 People+0.059
 important+0.057
ovable+0.057
Neg logit contributions
driver-0.118
paper-0.113
ho-0.112
 bud-0.111
 unravel-0.108
Token and
Token ID290
Feature activation-2.49e+00

Pos logit contributions
 much+0.012
 important+0.011
 best+0.011
 Everyone+0.011
ovable+0.010
Neg logit contributions
driver-0.016
 unravel-0.016
paper-0.015
 bud-0.015
ro-0.014
Token had
Token ID550
Feature activation-3.22e+00

Pos logit contributions
 Everyone+0.055
 best+0.055
 much+0.055
 important+0.054
 And+0.053
Neg logit contributions
driver-0.057
paper-0.057
 unravel-0.056
 bud-0.053
ho-0.053
Token fun
Token ID1257
Feature activation-1.67e+00

Pos logit contributions
ovable+0.045
 Everyone+0.045
 important+0.043
 best+0.041
 And+0.040
Neg logit contributions
paper-0.097
 unravel-0.095
driver-0.093
 bud-0.088
 swoop-0.086
Token.
Token ID13
Feature activation-2.13e+00

Pos logit contributions
 important+0.038
 much+0.038
 best+0.038
 Everyone+0.038
ovable+0.037
Neg logit contributions
driver-0.036
 unravel-0.034
paper-0.034
 bud-0.033
ro-0.031
Token They
Token ID1119
Feature activation-2.57e+00

Pos logit contributions
ovable+0.047
 best+0.047
 important+0.045
 much+0.043
 importantly+0.043
Neg logit contributions
ro-0.047
 unravel-0.046
driver-0.044
paper-0.041
 rip-0.039
Token became
Token ID2627
Feature activation-6.25e+00

Pos logit contributions
 Everyone+0.057
 important+0.056
 much+0.056
 best+0.055
 And+0.055
Neg logit contributions
 unravel-0.060
paper-0.058
 bud-0.056
driver-0.056
ho-0.056
Token best
Token ID1266
Feature activation-3.10e+00

Pos logit contributions
 Everyone+0.102
 Everybody+0.096
ovable+0.096
orry+0.091
anny+0.089
Neg logit contributions
 unravel-0.167
paper-0.161
 bud-0.159
 swoop-0.158
 clip-0.141
Token friends
Token ID2460
Feature activation-4.26e+00

Pos logit contributions
 important+0.031
 Everyone+0.030
 much+0.030
 best+0.026
 And+0.026
Neg logit contributions
driver-0.103
 bud-0.101
paper-0.099
 unravel-0.095
ro-0.092
Token and
Token ID290
Feature activation-3.37e+00

Pos logit contributions
 Everyone+0.121
 important+0.120
 People+0.116
 best+0.115
 Everybody+0.115
Neg logit contributions
driver-0.063
paper-0.058
 bud-0.056
 unravel-0.056
ho-0.050
Token lived
Token ID5615
Feature activation-3.67e+00

Pos logit contributions
ovable+0.078
 Everyone+0.077
 important+0.075
 much+0.074
 Everybody+0.074
Neg logit contributions
ho-0.071
paper-0.066
 unravel-0.064
 swoop-0.064
 bud-0.060
Token happily
Token ID18177
Feature activation-4.09e+00

Pos logit contributions
 Everyone+0.090
 much+0.087
 important+0.087
 best+0.085
 And+0.085
Neg logit contributions
 unravel-0.072
driver-0.070
paper-0.069
 bud-0.067
ho-0.062
Token Lily
Token ID20037
Feature activation+1.58e+00

Pos logit contributions
 best+0.005
 important+0.005
 much+0.004
ovable+0.004
 glad+0.004
Neg logit contributions
 unravel-0.005
driver-0.005
 rip-0.005
 bud-0.005
ro-0.005
Token shouted
Token ID17293
Feature activation+1.52e+00

Pos logit contributions
driver+0.036
paper+0.035
 unravel+0.034
 bud+0.032
ho+0.031
Neg logit contributions
 much-0.037
 best-0.037
 Everyone-0.036
 And-0.036
 important-0.035
Token.
Token ID13
Feature activation+5.71e-01

Pos logit contributions
driver+0.037
 unravel+0.035
paper+0.034
 bud+0.033
ro+0.031
Neg logit contributions
 best-0.033
 much-0.033
 important-0.032
 Everyone-0.031
 And-0.030
Token
Token ID198
Feature activation+1.74e+00

Pos logit contributions
driver+0.016
 unravel+0.016
paper+0.016
 bud+0.015
ro+0.014
Neg logit contributions
 much-0.010
 best-0.010
 important-0.010
 Everyone-0.009
 And-0.009
Token
Token ID198
Feature activation+1.59e+00

Pos logit contributions
driver+0.047
 unravel+0.045
paper+0.044
 bud+0.042
ro+0.042
Neg logit contributions
 best-0.032
 important-0.032
 much-0.031
ovable-0.028
 Everyone-0.028
Token"
Token ID1
Feature activation+4.41e+00

Pos logit contributions
driver+0.032
paper+0.030
 unravel+0.030
 bud+0.028
ro+0.026
Neg logit contributions
 much-0.042
 best-0.041
 important-0.040
 Everyone-0.040
 And-0.040
TokenNo
Token ID2949
Feature activation+3.96e-01

Pos logit contributions
driver+0.090
paper+0.085
 unravel+0.079
 bud+0.079
pan+0.074
Neg logit contributions
 Everyone-0.117
 much-0.117
 best-0.112
 important-0.110
 And-0.108
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
driver+0.011
paper+0.010
 unravel+0.010
 bud+0.010
ro+0.009
Neg logit contributions
 much-0.008
 best-0.007
 important-0.007
 Everyone-0.007
 And-0.007
Token you
Token ID345
Feature activation+3.05e-01

Pos logit contributions
driver+0.065
paper+0.062
 unravel+0.062
 bud+0.059
ro+0.057
Neg logit contributions
 much-0.041
 best-0.041
 important-0.039
 Everyone-0.039
 And-0.038
Token can
Token ID460
Feature activation+3.04e+00

Pos logit contributions
driver+0.007
paper+0.007
 unravel+0.007
 bud+0.007
ro+0.006
Neg logit contributions
 much-0.007
 best-0.007
 important-0.007
 Everyone-0.007
 And-0.006
Token't
Token ID470
Feature activation+2.27e+00

Pos logit contributions
driver+0.071
paper+0.070
pan+0.067
ro+0.066
 unravel+0.065
Neg logit contributions
 much-0.071
 best-0.068
 Everyone-0.068
 important-0.063
 And-0.062
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
 best+0.003
 important+0.003
 much+0.003
ovable+0.003
 lucky+0.003
Neg logit contributions
driver-0.005
 unravel-0.005
paper-0.004
ro-0.004
 bud-0.004
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
driver+0.029
 unravel+0.028
paper+0.027
 bud+0.026
ro+0.026
Neg logit contributions
 best-0.021
 important-0.020
 much-0.019
ovable-0.018
 Everyone-0.018
Token"
Token ID1
Feature activation+5.27e+00

Pos logit contributions
driver+0.032
 unravel+0.030
paper+0.029
 bud+0.028
ro+0.027
Neg logit contributions
 best-0.040
 much-0.039
 important-0.038
 Everyone-0.038
 And-0.037
TokenStop
Token ID19485
Feature activation+1.15e+00

Pos logit contributions
driver+0.109
paper+0.106
 bud+0.093
ge+0.091
 unravel+0.091
Neg logit contributions
 much-0.135
 Everyone-0.134
 best-0.131
 important-0.130
 And-0.125
Token!"
Token ID2474
Feature activation+8.69e-01

Pos logit contributions
driver+0.031
 unravel+0.030
paper+0.030
ro+0.030
 bud+0.028
Neg logit contributions
 much-0.021
 best-0.021
 Everyone-0.020
 important-0.019
 And-0.019
Token a
Token ID257
Feature activation+4.75e+00

Pos logit contributions
driver+0.022
 unravel+0.021
paper+0.020
 bud+0.020
 rip+0.020
Neg logit contributions
 best-0.018
 important-0.018
 much-0.017
ovable-0.016
 glad-0.015
Token voice
Token ID3809
Feature activation+2.51e+00

Pos logit contributions
driver+0.105
paper+0.105
 unravel+0.100
ro+0.090
ge+0.090
Neg logit contributions
 much-0.118
 best-0.114
 important-0.112
 Everyone-0.111
 And-0.106
Token said
Token ID531
Feature activation+2.29e+00

Pos logit contributions
driver+0.061
paper+0.059
 unravel+0.059
 bud+0.054
ro+0.054
Neg logit contributions
 much-0.054
 best-0.052
 Everyone-0.052
 important-0.052
 And-0.051
Token.
Token ID13
Feature activation+1.90e+00

Pos logit contributions
driver+0.064
paper+0.063
 unravel+0.063
 bud+0.059
ho+0.058
Neg logit contributions
 much-0.041
 best-0.040
 Everyone-0.038
 And-0.038
 important-0.038
Token It
Token ID632
Feature activation+4.00e+00

Pos logit contributions
paper+0.058
driver+0.057
 unravel+0.056
ge+0.054
 bud+0.054
Neg logit contributions
 And-0.030
 much-0.029
 Everyone-0.029
 best-0.027
 many-0.025
Token was
Token ID373
Feature activation+1.07e-01

Pos logit contributions
driver+0.140
paper+0.138
ge+0.131
ro+0.129
pan+0.126
Neg logit contributions
 named-0.050
 lived-0.047
 best-0.046
 much-0.045
 important-0.043
Token shiny
Token ID22441
Feature activation+3.13e+00

Pos logit contributions
 best+0.022
 much+0.020
 important+0.020
 many+0.018
 lucky+0.018
Neg logit contributions
driver-0.028
 unravel-0.025
paper-0.025
ge-0.023
 bud-0.023
Token one
Token ID530
Feature activation-1.54e+00

Pos logit contributions
driver+0.088
 unravel+0.084
ocated+0.074
paper+0.074
pan+0.073
Neg logit contributions
 best-0.065
 much-0.057
 friends-0.055
 many-0.054
 important-0.054
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
 much+0.029
 best+0.029
 important+0.028
 And+0.027
 Everyone+0.027
Neg logit contributions
driver-0.042
 unravel-0.041
paper-0.040
 bud-0.039
ro-0.037
Token
Token ID198
Feature activation-5.89e-01

Pos logit contributions
 best+0.032
 important+0.032
 much+0.030
ovable+0.029
 adventurous+0.028
Neg logit contributions
driver-0.047
 unravel-0.044
paper-0.043
ro-0.043
 bud-0.040
Token
Token ID198
Feature activation+4.46e-02

Pos logit contributions
 best+0.013
ovable+0.012
 important+0.012
 happiest+0.011
 adventurous+0.011
Neg logit contributions
driver-0.013
 unravel-0.012
paper-0.011
 bud-0.011
ho-0.011
Token"
Token ID1
Feature activation+4.07e+00

Pos logit contributions
driver+0.001
 unravel+0.001
paper+0.001
 bud+0.001
ho+0.001
Neg logit contributions
 best-0.001
 much-0.001
 Everyone-0.001
 important-0.001
ovable-0.001
TokenSorry
Token ID14385
Feature activation+1.81e+00

Pos logit contributions
driver+0.101
paper+0.097
 bud+0.090
ge+0.088
 unravel+0.086
Neg logit contributions
 much-0.089
 Everyone-0.088
 important-0.087
 best-0.086
 And-0.086
Token,
Token ID11
Feature activation+2.67e+00

Pos logit contributions
driver+0.053
 unravel+0.052
paper+0.051
 bud+0.048
ro+0.048
Neg logit contributions
 much-0.031
 best-0.031
 important-0.029
 Everyone-0.029
 And-0.028
Token kids
Token ID3988
Feature activation-1.98e+00

Pos logit contributions
driver+0.066
 unravel+0.063
paper+0.062
 bud+0.061
 rip+0.057
Neg logit contributions
 much-0.056
 best-0.055
 Everyone-0.054
 important-0.053
 And-0.052
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
 much+0.038
 best+0.037
 And+0.035
 important+0.035
 named+0.034
Neg logit contributions
driver-0.055
 unravel-0.054
paper-0.053
ro-0.050
 bud-0.048
Token I
Token ID314
Feature activation+2.37e+00

Pos logit contributions
driver+0.047
paper+0.045
 unravel+0.045
 bud+0.043
ro+0.041
Neg logit contributions
 much-0.038
 best-0.037
 important-0.036
 Everyone-0.036
 And-0.035
Token noise
Token ID7838
Feature activation+2.06e+00

Pos logit contributions
driver+0.125
paper+0.120
 unravel+0.112
ro+0.109
 clip+0.106
Neg logit contributions
 much-0.099
 important-0.091
 best-0.091
 Everyone-0.089
 And-0.086
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
driver+0.064
paper+0.062
 unravel+0.059
ro+0.058
 clip+0.057
Neg logit contributions
 much-0.034
 about-0.030
 Everyone-0.030
 important-0.029
 named-0.029
Token It
Token ID632
Feature activation+2.76e+00

Pos logit contributions
driver+0.013
 unravel+0.013
paper+0.012
 bud+0.012
ro+0.011
Neg logit contributions
 best-0.010
 much-0.009
 important-0.009
ovable-0.008
 glad-0.008
Token was
Token ID373
Feature activation-7.29e-01

Pos logit contributions
driver+0.078
paper+0.075
 unravel+0.069
ro+0.067
 clip+0.066
Neg logit contributions
 much-0.054
 important-0.051
 best-0.051
 Everyone-0.050
 And-0.048
Token a
Token ID257
Feature activation+3.09e+00

Pos logit contributions
 much+0.010
 best+0.010
 important+0.010
 Everyone+0.010
 And+0.009
Neg logit contributions
driver-0.023
paper-0.023
 unravel-0.022
 bud-0.022
ro-0.021
Token high
Token ID1029
Feature activation+5.02e+00

Pos logit contributions
driver+0.087
paper+0.084
 unravel+0.083
 bud+0.078
ro+0.077
Neg logit contributions
 much-0.057
 best-0.055
 important-0.054
 Everyone-0.052
 And-0.051
Token-
Token ID12
Feature activation+3.03e-01

Pos logit contributions
driver+0.119
paper+0.118
 unravel+0.103
ge+0.095
pan+0.094
Neg logit contributions
 much-0.113
 important-0.112
 best-0.110
 named-0.105
 day-0.103
Tokenp
Token ID79
Feature activation-5.14e-01

Pos logit contributions
driver+0.007
paper+0.007
 unravel+0.007
 bud+0.006
ro+0.006
Neg logit contributions
 much-0.007
 best-0.007
 important-0.007
 Everyone-0.007
 And-0.007
Tokenitched
Token ID10981
Feature activation+4.57e+00

Pos logit contributions
 much+0.011
 best+0.011
 Everyone+0.011
 important+0.010
 And+0.010
Neg logit contributions
driver-0.013
paper-0.012
 unravel-0.012
 bud-0.011
ro-0.011
Token,
Token ID11
Feature activation+1.85e+00

Pos logit contributions
paper+0.115
driver+0.115
 unravel+0.101
ge+0.098
 clip+0.095
Neg logit contributions
 much-0.100
 important-0.097
 best-0.096
 Everyone-0.094
 And-0.089
Token sque
Token ID35776
Feature activation-4.25e-02

Pos logit contributions
driver+0.054
paper+0.051
 unravel+0.048
 bud+0.047
ge+0.046
Neg logit contributions
 much-0.036
 important-0.034
 best-0.033
 And-0.031
 glad-0.031
Token.
Token ID13
Feature activation+4.63e-01

Pos logit contributions
driver+0.082
paper+0.078
 unravel+0.075
ho+0.074
 wra+0.071
Neg logit contributions
 much-0.065
 best-0.061
 Everyone-0.060
 And-0.058
 important-0.058
Token He
Token ID679
Feature activation+3.57e+00

Pos logit contributions
driver+0.012
paper+0.011
 unravel+0.011
 bud+0.011
ho+0.011
Neg logit contributions
 much-0.010
 Everyone-0.010
 And-0.009
 best-0.009
 important-0.009
Token only
Token ID691
Feature activation+9.53e-01

Pos logit contributions
driver+0.080
paper+0.076
 unravel+0.074
 bud+0.071
pan+0.066
Neg logit contributions
 much-0.086
 best-0.085
 important-0.082
 Everyone-0.080
 And-0.079
Token made
Token ID925
Feature activation-7.69e-01

Pos logit contributions
driver+0.023
paper+0.022
 unravel+0.022
 bud+0.021
pan+0.020
Neg logit contributions
 much-0.022
 best-0.021
 important-0.020
 And-0.019
 Everyone-0.019
Token a
Token ID257
Feature activation+3.27e+00

Pos logit contributions
 much+0.011
 best+0.011
 important+0.010
 Everyone+0.010
 And+0.010
Neg logit contributions
driver-0.024
paper-0.024
 unravel-0.024
 bud-0.023
ro-0.022
Token funny
Token ID8258
Feature activation+4.05e+00

Pos logit contributions
driver+0.097
paper+0.091
 unravel+0.088
ro+0.084
 bud+0.082
Neg logit contributions
 much-0.064
 best-0.062
 important-0.059
 many-0.054
 And-0.053
Token sound
Token ID2128
Feature activation+7.15e-01

Pos logit contributions
driver+0.114
paper+0.106
 unravel+0.106
 bud+0.100
ge+0.099
Neg logit contributions
 best-0.080
 much-0.080
 important-0.075
 Everyone-0.073
 And-0.073
Token.
Token ID13
Feature activation+6.48e-02

Pos logit contributions
driver+0.020
 unravel+0.020
paper+0.019
 bud+0.019
ro+0.018
Neg logit contributions
 much-0.013
 best-0.012
 important-0.012
 And-0.012
 Everyone-0.012
Token He
Token ID679
Feature activation+2.81e+00

Pos logit contributions
driver+0.002
paper+0.002
 unravel+0.002
 bud+0.002
ho+0.001
Neg logit contributions
 much-0.001
 best-0.001
 Everyone-0.001
 And-0.001
 important-0.001
Token tried
Token ID3088
Feature activation+3.95e+00

Pos logit contributions
driver+0.066
paper+0.062
 unravel+0.060
 bud+0.058
ro+0.055
Neg logit contributions
 much-0.065
 best-0.064
 important-0.062
 Everyone-0.060
 And-0.060
Token again
Token ID757
Feature activation+1.55e+00

Pos logit contributions
driver+0.098
paper+0.092
 unravel+0.092
 bud+0.085
pan+0.085
Neg logit contributions
 much-0.092
 best-0.088
 many-0.083
 important-0.083
 And-0.080
Token.
Token ID13
Feature activation-7.08e-01

Pos logit contributions
 much+0.024
 best+0.024
 important+0.023
 Everyone+0.023
 And+0.022
Neg logit contributions
driver-0.032
paper-0.030
 unravel-0.030
 bud-0.029
ro-0.027
Token But
Token ID887
Feature activation-1.56e-01

Pos logit contributions
 best+0.013
 much+0.012
 important+0.012
 Everyone+0.011
ovable+0.011
Neg logit contributions
driver-0.020
 unravel-0.019
paper-0.019
 bud-0.018
ro-0.018
Token then
Token ID788
Feature activation+1.13e+00

Pos logit contributions
 much+0.003
 best+0.003
 important+0.003
 Everyone+0.003
 And+0.003
Neg logit contributions
driver-0.004
paper-0.004
 unravel-0.004
 bud-0.004
ro-0.004
Token the
Token ID262
Feature activation+4.31e-01

Pos logit contributions
driver+0.029
 unravel+0.028
paper+0.028
 bud+0.027
ro+0.025
Neg logit contributions
 much-0.023
 best-0.022
 important-0.022
 Everyone-0.022
 And-0.021
Token bunny
Token ID44915
Feature activation+1.14e+00

Pos logit contributions
driver+0.011
paper+0.011
 unravel+0.010
 bud+0.010
ro+0.009
Neg logit contributions
 best-0.009
 much-0.009
 important-0.009
 Everyone-0.009
 And-0.008
Token had
Token ID550
Feature activation+7.83e-01

Pos logit contributions
driver+0.030
paper+0.029
 unravel+0.027
 bud+0.026
ro+0.025
Neg logit contributions
 best-0.023
 much-0.023
 important-0.023
 named-0.021
 And-0.021
Token a
Token ID257
Feature activation+2.85e+00

Pos logit contributions
driver+0.023
paper+0.022
 unravel+0.022
 bud+0.021
ro+0.021
Neg logit contributions
 much-0.014
 best-0.013
 important-0.013
 many-0.012
 Everyone-0.012
Token surprise
Token ID5975
Feature activation+2.17e+00

Pos logit contributions
driver+0.084
paper+0.082
 unravel+0.079
ro+0.075
 bud+0.075
Neg logit contributions
 much-0.051
 best-0.050
 important-0.048
 And-0.043
 Everyone-0.043
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
driver+0.069
paper+0.068
 unravel+0.067
ro+0.063
 bud+0.062
Neg logit contributions
 much-0.033
 best-0.032
 important-0.030
 And-0.028
 named-0.028
Token The
Token ID383
Feature activation+7.32e-01

Pos logit contributions
 best+0.004
 important+0.004
 much+0.004
ovable+0.003
 lucky+0.003
Neg logit contributions
driver-0.006
 unravel-0.006
paper-0.005
ro-0.005
 bud-0.005
Token doctor
Token ID6253
Feature activation-1.16e-01

Pos logit contributions
driver+0.018
paper+0.018
 unravel+0.017
 bud+0.016
ro+0.016
Neg logit contributions
 best-0.016
 much-0.016
 important-0.015
 Everyone-0.015
 And-0.014
Token worry
Token ID5490
Feature activation+2.46e+00

Pos logit contributions
driver+0.048
paper+0.046
 unravel+0.045
 bud+0.042
ho+0.041
Neg logit contributions
 much-0.046
 best-0.044
 important-0.044
 Everyone-0.043
 And-0.041
Token,
Token ID11
Feature activation+2.28e+00

Pos logit contributions
driver+0.062
paper+0.059
 unravel+0.059
 bud+0.057
ro+0.054
Neg logit contributions
 much-0.051
 best-0.051
 important-0.049
 Everyone-0.049
 And-0.048
Token my
Token ID616
Feature activation+1.35e+00

Pos logit contributions
driver+0.060
 unravel+0.058
paper+0.057
 bud+0.056
 clip+0.053
Neg logit contributions
 much-0.044
 best-0.043
 Everyone-0.042
 important-0.042
 And-0.041
Token love
Token ID1842
Feature activation+9.35e-01

Pos logit contributions
driver+0.035
 unravel+0.033
paper+0.033
 bud+0.033
ro+0.031
Neg logit contributions
 much-0.026
 Everyone-0.025
 best-0.025
 important-0.025
 And-0.024
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
driver+0.024
 unravel+0.024
paper+0.023
 bud+0.022
ho+0.022
Neg logit contributions
 much-0.020
 important-0.019
 best-0.019
 And-0.018
 Everyone-0.017
Token I
Token ID314
Feature activation+1.82e+00

Pos logit contributions
 best+0.023
 much+0.023
 important+0.022
 many+0.021
 lucky+0.020
Neg logit contributions
driver-0.025
 unravel-0.022
ro-0.022
paper-0.021
 bud-0.021
Token can
Token ID460
Feature activation+2.63e+00

Pos logit contributions
driver+0.048
paper+0.048
 unravel+0.044
 bud+0.043
ho+0.042
Neg logit contributions
 much-0.038
 best-0.036
 important-0.036
 And-0.034
 Everyone-0.033
Token help
Token ID1037
Feature activation+1.07e+00

Pos logit contributions
paper+0.069
driver+0.067
ho+0.064
 unravel+0.062
ro+0.062
Neg logit contributions
 Everyone-0.053
 much-0.053
 best-0.052
 important-0.050
 And-0.048
Token you
Token ID345
Feature activation-8.04e-01

Pos logit contributions
driver+0.032
paper+0.030
 unravel+0.030
 bud+0.029
ro+0.028
Neg logit contributions
 much-0.018
 best-0.017
 important-0.017
 Everyone-0.017
 And-0.016
Token,"
Token ID553
Feature activation+5.54e-01

Pos logit contributions
 much+0.018
 best+0.017
 important+0.017
 Everyone+0.016
 And+0.016
Neg logit contributions
driver-0.019
paper-0.019
ho-0.019
 unravel-0.018
ro-0.017
Token her
Token ID607
Feature activation+5.39e-01

Pos logit contributions
driver+0.012
 unravel+0.011
paper+0.011
 bud+0.011
 clip+0.010
Neg logit contributions
 much-0.013
 best-0.013
 important-0.013
 And-0.012
 Everyone-0.012
Token the
Token ID262
Feature activation+1.28e+00

Pos logit contributions
driver+0.013
 unravel+0.012
paper+0.012
 bud+0.012
 rip+0.011
Neg logit contributions
 important-0.008
 much-0.007
 best-0.007
 glad-0.007
 And-0.007
Token measure
Token ID3953
Feature activation+9.16e-01

Pos logit contributions
driver+0.028
paper+0.027
 unravel+0.027
ro+0.024
 bud+0.024
Neg logit contributions
 best-0.032
 much-0.031
 important-0.030
 Everyone-0.029
 And-0.029
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
 unravel+0.024
driver+0.024
paper+0.024
 bud+0.022
ro+0.022
Neg logit contributions
 much-0.018
 best-0.017
 And-0.016
 Everyone-0.016
 important-0.016
Token She
Token ID1375
Feature activation+1.59e+00

Pos logit contributions
driver+0.001
paper+0.001
 unravel+0.001
 bud+0.000
ge+0.000
Neg logit contributions
 much-0.000
 Everyone-0.000
 And-0.000
 best-0.000
 important-0.000
Token liked
Token ID8288
Feature activation+6.81e-01

Pos logit contributions
driver+0.039
paper+0.038
 unravel+0.035
ge+0.034
pan+0.034
Neg logit contributions
 much-0.035
 best-0.034
 named-0.034
 And-0.033
 important-0.033
Token to
Token ID284
Feature activation+3.15e+00

Pos logit contributions
driver+0.021
 unravel+0.020
paper+0.020
 bud+0.019
ro+0.019
Neg logit contributions
 much-0.011
 best-0.011
 important-0.010
 many-0.010
 Everyone-0.010
Token measure
Token ID3953
Feature activation-6.15e-01

Pos logit contributions
driver+0.082
paper+0.080
ho+0.074
 unravel+0.074
ge+0.072
Neg logit contributions
 much-0.065
 Everyone-0.061
 And-0.060
 best-0.060
 important-0.059
Token things
Token ID1243
Feature activation-1.27e+00

Pos logit contributions
 much+0.010
 best+0.009
 important+0.009
 Everyone+0.009
 And+0.009
Neg logit contributions
driver-0.019
 unravel-0.018
paper-0.018
 bud-0.017
ro-0.017
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 much+0.020
 best+0.020
 about+0.017
 And+0.017
 important+0.017
Neg logit contributions
driver-0.039
 unravel-0.039
paper-0.039
ASH-0.038
 swoop-0.037
Token She
Token ID1375
Feature activation+1.41e+00

Pos logit contributions
 much+0.021
 best+0.021
 important+0.020
 Everyone+0.020
 And+0.020
Neg logit contributions
driver-0.026
paper-0.024
 unravel-0.024
 bud-0.023
ro-0.022
Token measured
Token ID8630
Feature activation+1.36e+00

Pos logit contributions
driver+0.037
paper+0.036
 unravel+0.035
ge+0.034
pan+0.034
Neg logit contributions
 much-0.028
 best-0.028
 important-0.027
 named-0.027
 And-0.027
Token tired
Token ID10032
Feature activation+2.66e+00

Pos logit contributions
 much+0.026
 best+0.025
 important+0.024
 Everyone+0.024
 And+0.023
Neg logit contributions
driver-0.055
paper-0.053
 unravel-0.053
 bud-0.050
ro-0.049
Token to
Token ID284
Feature activation+2.90e+00

Pos logit contributions
driver+0.076
 unravel+0.075
paper+0.072
 clip+0.068
 swoop+0.067
Neg logit contributions
 much-0.052
 best-0.048
 important-0.044
 many-0.044
 about-0.044
Token run
Token ID1057
Feature activation+1.98e+00

Pos logit contributions
driver+0.071
paper+0.068
 unravel+0.066
 bud+0.062
ho+0.060
Neg logit contributions
 much-0.064
 best-0.063
 Everyone-0.060
 important-0.060
 And-0.059
Token anymore
Token ID7471
Feature activation-2.99e-01

Pos logit contributions
driver+0.052
paper+0.050
 unravel+0.050
 bud+0.047
ho+0.045
Neg logit contributions
 much-0.041
 best-0.039
 important-0.038
 Everyone-0.037
 And-0.037
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
 much+0.006
 best+0.006
 important+0.005
 Everyone+0.005
 And+0.005
Neg logit contributions
driver-0.008
paper-0.008
 unravel-0.008
 bud-0.007
ro-0.007
Token He
Token ID679
Feature activation+1.01e+00

Pos logit contributions
 best+0.033
 important+0.032
 much+0.030
ovable+0.030
 lucky+0.028
Neg logit contributions
driver-0.036
 unravel-0.035
ro-0.034
paper-0.033
ho-0.031
Token was
Token ID373
Feature activation-2.92e+00

Pos logit contributions
 unravel+0.022
driver+0.022
paper+0.021
 bud+0.020
ro+0.019
Neg logit contributions
 much-0.024
 best-0.024
 Everyone-0.023
 important-0.023
 And-0.022
Token very
Token ID845
Feature activation-3.55e+00

Pos logit contributions
 Everyone+0.050
 And+0.047
 People+0.045
 Everybody+0.043
ovable+0.042
Neg logit contributions
 unravel-0.080
driver-0.075
 bud-0.073
paper-0.072
 swoop-0.071
Token scared
Token ID12008
Feature activation+9.79e-01

Pos logit contributions
 Everyone+0.054
 best+0.050
 And+0.050
 People+0.048
 films+0.046
Neg logit contributions
 unravel-0.100
driver-0.096
 bud-0.096
paper-0.094
 rip-0.094
Token and
Token ID290
Feature activation-8.08e-01

Pos logit contributions
driver+0.026
 unravel+0.025
paper+0.025
 bud+0.023
ro+0.023
Neg logit contributions
 much-0.020
 best-0.019
 important-0.019
 Everyone-0.018
 And-0.018
Token his
Token ID465
Feature activation-2.52e-02

Pos logit contributions
 best+0.016
 much+0.016
 Everyone+0.016
 important+0.016
 And+0.016
Neg logit contributions
driver-0.020
 unravel-0.020
paper-0.019
 bud-0.018
ho-0.018
Token floor
Token ID4314
Feature activation+1.33e+00

Pos logit contributions
 much+0.010
 best+0.010
 important+0.010
 Everyone+0.010
 And+0.009
Neg logit contributions
driver-0.009
paper-0.009
 unravel-0.008
 bud-0.008
ro-0.008
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
 unravel+0.041
driver+0.041
paper+0.040
 swoop+0.038
 bud+0.038
Neg logit contributions
 much-0.021
 best-0.020
 And-0.019
 many-0.019
 Everyone-0.017
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
 important+0.029
 best+0.029
ovable+0.026
 much+0.026
 lucky+0.025
Neg logit contributions
driver-0.039
ro-0.036
 unravel-0.036
paper-0.036
 bud-0.034
Token
Token ID198
Feature activation+2.81e-01

Pos logit contributions
driver+0.000
 unravel+0.000
paper+0.000
 bud+0.000
ro+0.000
Neg logit contributions
 best-0.000
 important-0.000
ovable-0.000
 Everyone-0.000
 much-0.000
TokenOne
Token ID3198
Feature activation-1.42e+00

Pos logit contributions
driver+0.006
 unravel+0.005
 bud+0.005
paper+0.005
ro+0.005
Neg logit contributions
 best-0.007
 important-0.007
 much-0.007
 Everyone-0.007
ovable-0.007
Token day
Token ID1110
Feature activation+8.35e-01

Pos logit contributions
ovable+0.024
 Everyone+0.023
 Everybody+0.022
 From+0.022
 People+0.020
Neg logit contributions
ro-0.038
 bud-0.037
driver-0.035
 rip-0.035
 bite-0.034
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
driver+0.016
 bud+0.016
paper+0.015
 unravel+0.014
ro+0.013
Neg logit contributions
 important-0.022
ovable-0.021
 dearly-0.020
 glad-0.020
 best-0.020
Token Mom
Token ID11254
Feature activation+5.03e-01

Pos logit contributions
driver+0.005
 unravel+0.005
 bud+0.005
paper+0.005
 clip+0.004
Neg logit contributions
ovable-0.004
 important-0.004
 much-0.004
 And-0.004
 best-0.004
Token says
Token ID1139
Feature activation+1.26e+00

Pos logit contributions
driver+0.012
paper+0.012
 unravel+0.012
 bud+0.011
ro+0.011
Neg logit contributions
 best-0.011
 much-0.011
 important-0.011
 And-0.011
 Everyone-0.010
Token,
Token ID11
Feature activation+2.72e+00

Pos logit contributions
driver+0.035
 unravel+0.033
paper+0.033
 bud+0.032
ro+0.031
Neg logit contributions
 much-0.023
 best-0.023
 important-0.022
 Everyone-0.022
 And-0.021
Token "
Token ID366
Feature activation+4.36e+00

Pos logit contributions
driver+0.070
paper+0.067
 unravel+0.067
 bud+0.064
ro+0.061
Neg logit contributions
 much-0.056
 best-0.055
 important-0.053
 Everyone-0.053
 And-0.052
Token goodbye
Token ID24829
Feature activation+1.34e+00

Pos logit contributions
driver+0.018
paper+0.017
 unravel+0.017
 bud+0.016
ro+0.015
Neg logit contributions
 much-0.010
 best-0.010
 important-0.010
 Everyone-0.010
 And-0.009
Token to
Token ID284
Feature activation+2.01e-01

Pos logit contributions
driver+0.032
 unravel+0.031
paper+0.030
 bud+0.029
pan+0.027
Neg logit contributions
 much-0.031
 best-0.030
 And-0.029
 important-0.029
 Everyone-0.028
Token the
Token ID262
Feature activation+1.39e-01

Pos logit contributions
 unravel+0.005
 bud+0.004
 rip+0.004
paper+0.004
driver+0.004
Neg logit contributions
 important-0.004
 much-0.004
 best-0.004
ovable-0.004
 And-0.004
Token soft
Token ID2705
Feature activation+2.57e+00

Pos logit contributions
driver+0.004
paper+0.004
 unravel+0.004
 bud+0.003
ho+0.003
Neg logit contributions
 best-0.003
 important-0.003
 much-0.003
 Everyone-0.003
 And-0.002
Token tank
Token ID6873
Feature activation+1.35e-01

Pos logit contributions
driver+0.071
paper+0.069
 unravel+0.067
 bud+0.064
 clip+0.062
Neg logit contributions
 best-0.048
 much-0.047
 important-0.047
 Everyone-0.047
 And-0.045
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
driver+0.003
paper+0.003
 unravel+0.003
 bud+0.003
ro+0.003
Neg logit contributions
 much-0.003
 best-0.003
 important-0.003
 And-0.003
 Everyone-0.003
Token thanked
Token ID26280
Feature activation+7.00e-01

Pos logit contributions
 much+0.024
 best+0.023
 important+0.023
 Everyone+0.023
 And+0.022
Neg logit contributions
driver-0.023
 unravel-0.022
paper-0.022
 bud-0.021
ro-0.020
Token it
Token ID340
Feature activation-1.38e+00

Pos logit contributions
 unravel+0.020
driver+0.020
paper+0.019
 bud+0.019
 rip+0.018
Neg logit contributions
 much-0.012
 best-0.012
 important-0.011
 And-0.011
 Everyone-0.010
Token for
Token ID329
Feature activation-1.51e+00

Pos logit contributions
ovable+0.026
 important+0.026
 much+0.025
 Everyone+0.025
 glad+0.025
Neg logit contributions
 unravel-0.031
 bud-0.031
driver-0.030
ro-0.028
paper-0.028
Token the
Token ID262
Feature activation+4.70e-01

Pos logit contributions
 much+0.028
 Everyone+0.027
 best+0.027
 And+0.027
 important+0.026
Neg logit contributions
 unravel-0.040
driver-0.039
paper-0.038
 bud-0.038
pan-0.035
Token wonderful
Token ID7932
Feature activation+2.16e+00

Pos logit contributions
driver+0.013
paper+0.012
 unravel+0.012
 bud+0.011
ro+0.011
Neg logit contributions
 much-0.009
 best-0.009
 important-0.009
 Everyone-0.009
 And-0.008
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
 best+0.021
 much+0.021
 important+0.020
 Everyone+0.019
 And+0.018
Neg logit contributions
driver-0.029
 unravel-0.028
paper-0.028
 bud-0.026
ro-0.026
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
driver+0.001
 unravel+0.001
paper+0.001
 bud+0.001
ro+0.001
Neg logit contributions
 best-0.000
 much-0.000
 important-0.000
 Everyone-0.000
 And-0.000
Token"
Token ID1
Feature activation+4.54e+00

Pos logit contributions
driver+0.014
paper+0.014
 unravel+0.014
 bud+0.013
ro+0.012
Neg logit contributions
 much-0.016
 best-0.015
 Everyone-0.015
 And-0.015
 important-0.015
TokenSorry
Token ID14385
Feature activation+2.01e+00

Pos logit contributions
driver+0.120
paper+0.116
 unravel+0.111
ge+0.104
 bud+0.102
Neg logit contributions
 best-0.092
 much-0.091
 Everyone-0.091
 And-0.089
 important-0.086
Token,
Token ID11
Feature activation+2.57e+00

Pos logit contributions
driver+0.056
paper+0.054
 unravel+0.054
 bud+0.051
ro+0.050
Neg logit contributions
 much-0.037
 best-0.036
 important-0.035
 Everyone-0.034
 And-0.034
Token mom
Token ID1995
Feature activation-1.53e+00

Pos logit contributions
driver+0.080
 unravel+0.077
paper+0.077
 bud+0.075
 clip+0.072
Neg logit contributions
 much-0.038
 best-0.037
 important-0.036
 Everyone-0.036
 And-0.035
Token,
Token ID11
Feature activation+2.36e+00

Pos logit contributions
 much+0.029
 best+0.029
 important+0.026
 And+0.026
 named+0.025
Neg logit contributions
 unravel-0.043
driver-0.043
paper-0.042
ro-0.040
pan-0.038
Token we
Token ID356
Feature activation+1.18e+00

Pos logit contributions
driver+0.061
paper+0.057
 unravel+0.057
 bud+0.056
 clip+0.053
Neg logit contributions
 much-0.048
 best-0.047
 important-0.046
 Everyone-0.046
 And-0.044
Token were
Token ID547
Feature activation-2.46e+00

Pos logit contributions
driver+0.028
 unravel+0.026
paper+0.026
 bud+0.025
ro+0.024
Neg logit contributions
 much-0.027
 best-0.026
 important-0.026
 Everyone-0.025
 And-0.025
Token just
Token ID655
Feature activation-2.18e+00

Pos logit contributions
 Everyone+0.053
 And+0.049
 Everybody+0.048
 People+0.047
 much+0.047
Neg logit contributions
driver-0.056
 unravel-0.056
 bud-0.056
paper-0.055
 rip-0.053
Token playing
Token ID2712
Feature activation-2.77e-01

Pos logit contributions
 Everyone+0.033
 much+0.032
 And+0.031
 best+0.031
 important+0.030
Neg logit contributions
driver-0.063
 unravel-0.062
paper-0.061
 bud-0.060
 rip-0.058
Token touch
Token ID3638
Feature activation-1.18e+00

Pos logit contributions
driver+0.046
paper+0.045
 unravel+0.045
 bud+0.043
ro+0.041
Neg logit contributions
 much-0.034
 best-0.033
 important-0.032
 Everyone-0.032
 And-0.031
Token them
Token ID606
Feature activation-2.10e+00

Pos logit contributions
 much+0.024
 best+0.024
 important+0.023
 And+0.023
 Everyone+0.023
Neg logit contributions
driver-0.028
paper-0.028
 unravel-0.027
 bud-0.026
 rip-0.025
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
 much+0.042
 best+0.041
 important+0.040
 Everyone+0.040
 And+0.039
Neg logit contributions
driver-0.054
paper-0.052
 unravel-0.052
 bud-0.050
ro-0.048
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
driver+0.036
paper+0.034
 unravel+0.034
 bud+0.033
ge+0.031
Neg logit contributions
 much-0.027
 best-0.026
 important-0.025
 And-0.024
 Everyone-0.024
Token look
Token ID804
Feature activation-6.10e-01

Pos logit contributions
driver+0.004
paper+0.004
 unravel+0.004
 bud+0.003
ro+0.003
Neg logit contributions
 much-0.003
 best-0.003
 important-0.003
 Everyone-0.003
 And-0.003
Token at
Token ID379
Feature activation-1.52e+00

Pos logit contributions
 Everyone+0.012
 And+0.011
ovable+0.011
 best+0.011
 People+0.011
Neg logit contributions
paper-0.014
 unravel-0.013
driver-0.013
 bud-0.013
 rip-0.013
Token their
Token ID511
Feature activation-2.32e+00

Pos logit contributions
 much+0.029
 best+0.028
 And+0.028
 Everyone+0.028
 important+0.027
Neg logit contributions
driver-0.039
paper-0.038
 unravel-0.038
 bud-0.036
 rip-0.035
Token colors
Token ID7577
Feature activation-1.28e+00

Pos logit contributions
 much+0.048
 best+0.048
 important+0.046
 Everyone+0.046
 And+0.045
Neg logit contributions
driver-0.058
paper-0.056
 unravel-0.056
 bud-0.053
ro-0.051
Token.
Token ID13
Feature activation-2.05e+00

Pos logit contributions
 best+0.020
 much+0.020
 important+0.017
 about+0.017
 many+0.017
Neg logit contributions
driver-0.041
 unravel-0.039
paper-0.038
ASH-0.037
ho-0.036
Token She
Token ID1375
Feature activation+2.16e-01

Pos logit contributions
 important+0.042
 best+0.041
 much+0.040
ovable+0.038
 lucky+0.037
Neg logit contributions
driver-0.049
 unravel-0.048
ro-0.046
paper-0.046
 bud-0.044
Token had
Token ID550
Feature activation-1.70e+00

Pos logit contributions
 unravel+0.005
driver+0.005
paper+0.005
 bud+0.005
ho+0.005
Neg logit contributions
 much-0.004
 Everyone-0.004
 best-0.004
 important-0.004
 And-0.004
Token a
Token ID257
Feature activation-1.51e+00

Pos logit contributions
 best+0.058
 important+0.055
 Everyone+0.054
ovable+0.054
 And+0.053
Neg logit contributions
 unravel-0.098
 rip-0.092
driver-0.092
paper-0.091
 bud-0.087
Token big
Token ID1263
Feature activation+7.19e-01

Pos logit contributions
 Everyone+0.031
 best+0.029
 And+0.029
ovable+0.029
 much+0.028
Neg logit contributions
 unravel-0.036
paper-0.035
driver-0.034
 bud-0.033
 rip-0.033
Token castle
Token ID16669
Feature activation-7.55e-01

Pos logit contributions
paper+0.014
driver+0.014
 unravel+0.014
 bud+0.014
ho+0.013
Neg logit contributions
 much-0.017
 Everyone-0.017
 best-0.017
 important-0.017
 And-0.016
Token with
Token ID351
Feature activation-5.17e+00

Pos logit contributions
 much+0.017
 best+0.016
 Everyone+0.016
 important+0.016
 And+0.015
Neg logit contributions
driver-0.017
 unravel-0.017
paper-0.017
 bud-0.016
ro-0.015
Token her
Token ID607
Feature activation-3.26e+00

Pos logit contributions
 And+0.060
 best+0.058
 important+0.057
 Everyone+0.054
 much+0.052
Neg logit contributions
paper-0.168
driver-0.167
 unravel-0.167
 bud-0.159
 rip-0.155
Token mom
Token ID1995
Feature activation-2.72e+00

Pos logit contributions
 much+0.030
 Everyone+0.029
 important+0.026
 And+0.026
 best+0.022
Neg logit contributions
driver-0.115
paper-0.114
 unravel-0.112
 bud-0.112
ho-0.108
Tokenmy
Token ID1820
Feature activation-2.32e+00

Pos logit contributions
 much+0.065
 best+0.063
 important+0.062
 Everyone+0.062
 And+0.060
Neg logit contributions
driver-0.061
paper-0.058
 unravel-0.058
 bud-0.054
ro-0.052
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
 much+0.052
 best+0.051
 important+0.050
 Everyone+0.049
 And+0.048
Neg logit contributions
driver-0.055
paper-0.052
 unravel-0.052
 bud-0.050
ro-0.047
Token daddy
Token ID35822
Feature activation-2.33e+00

Pos logit contributions
 much+0.023
 best+0.023
 important+0.022
 Everyone+0.022
 And+0.021
Neg logit contributions
driver-0.049
paper-0.047
 unravel-0.047
 bud-0.045
ro-0.044
Token,
Token ID11
Feature activation-6.66e-01

Pos logit contributions
 much+0.048
 best+0.047
 important+0.046
 Everyone+0.045
 And+0.044
Neg logit contributions
driver-0.060
paper-0.057
 unravel-0.057
 bud-0.054
ro-0.052
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
 Everyone+0.012
 important+0.012
 much+0.012
 best+0.011
ovable+0.011
Neg logit contributions
 unravel-0.017
driver-0.017
paper-0.017
 bud-0.017
 clip-0.016
Token very
Token ID845
Feature activation-2.94e+00

Pos logit contributions
 Everyone+0.053
 And+0.048
 Everybody+0.046
 People+0.042
 best+0.042
Neg logit contributions
 unravel-0.110
driver-0.109
paper-0.107
 bud-0.106
 clip-0.104
Token thick
Token ID6546
Feature activation-3.07e-01

Pos logit contributions
 Everyone+0.051
 And+0.050
 best+0.049
 much+0.047
 Everybody+0.045
Neg logit contributions
 bud-0.075
driver-0.075
 unravel-0.074
paper-0.074
ho-0.072
Token and
Token ID290
Feature activation-2.59e+00

Pos logit contributions
 much+0.007
 Everyone+0.007
 best+0.007
 important+0.007
 And+0.006
Neg logit contributions
driver-0.007
paper-0.007
 unravel-0.007
 bud-0.006
ro-0.006
Token strong
Token ID1913
Feature activation+2.68e-01

Pos logit contributions
 Everyone+0.051
 best+0.050
 And+0.049
 much+0.048
ovable+0.046
Neg logit contributions
driver-0.065
paper-0.063
 unravel-0.063
 bud-0.060
 rip-0.058
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
driver+0.006
paper+0.006
 unravel+0.006
 bud+0.006
ro+0.006
Neg logit contributions
 much-0.006
 best-0.006
 Everyone-0.006
 important-0.006
 And-0.005
Token One
Token ID1881
Feature activation-1.64e+00

Pos logit contributions
 best+0.032
 important+0.031
 much+0.030
ovable+0.029
 glad+0.029
Neg logit contributions
driver-0.044
 unravel-0.042
paper-0.041
ro-0.040
 rip-0.038
Token day
Token ID1110
Feature activation+6.40e-01

Pos logit contributions
 Everyone+0.031
ooks+0.031
 Everybody+0.029
ovable+0.028
 importantly+0.027
Neg logit contributions
 knife-0.037
 bite-0.037
ho-0.037
 bud-0.036
ro-0.036
Token,
Token ID11
Feature activation-6.16e-01

Pos logit contributions
driver+0.015
paper+0.014
 unravel+0.014
 bud+0.014
 rip+0.013
Neg logit contributions
 best-0.014
 important-0.014
ovable-0.014
 much-0.013
 Everyone-0.013
Token the
Token ID262
Feature activation-2.44e-01

Pos logit contributions
 best+0.009
 much+0.009
 important+0.009
 And+0.009
 Everyone+0.009
Neg logit contributions
driver-0.019
 unravel-0.018
paper-0.018
 bud-0.017
 rip-0.017
Token staff
Token ID3085
Feature activation-1.27e+00

Pos logit contributions
 much+0.006
 And+0.006
 Everyone+0.006
 best+0.006
 important+0.006
Neg logit contributions
driver-0.005
 bud-0.005
paper-0.005
 unravel-0.005
ro-0.004
Token decided
Token ID3066
Feature activation+1.85e+00

Pos logit contributions
 much+0.027
 best+0.027
 important+0.026
 Everyone+0.026
 And+0.025
Neg logit contributions
driver-0.031
paper-0.030
 unravel-0.030
 bud-0.028
ro-0.027
Token my
Token ID616
Feature activation+5.94e-01

Pos logit contributions
 Everyone+0.026
 Everybody+0.024
ovable+0.024
 People+0.022
 And+0.022
Neg logit contributions
 unravel-0.040
 bud-0.039
driver-0.039
paper-0.038
 rip-0.037
Token pill
Token ID9582
Feature activation+1.14e+00

Pos logit contributions
driver+0.018
paper+0.017
 unravel+0.017
 bud+0.017
ro+0.016
Neg logit contributions
 much-0.009
 Everyone-0.009
 best-0.009
 important-0.009
 And-0.008
Token to
Token ID284
Feature activation+2.87e+00

Pos logit contributions
 unravel+0.029
paper+0.028
driver+0.028
 swoop+0.027
ge+0.027
Neg logit contributions
 best-0.024
 much-0.023
 important-0.022
 And-0.021
 Everyone-0.020
Token make
Token ID787
Feature activation-1.74e+00

Pos logit contributions
driver+0.075
paper+0.072
 unravel+0.068
 bud+0.065
ho+0.064
Neg logit contributions
 much-0.061
 best-0.057
 important-0.057
 Everyone-0.056
 And-0.055
Token me
Token ID502
Feature activation-1.07e+00

Pos logit contributions
 best+0.032
 much+0.031
 And+0.031
 Everyone+0.031
 important+0.031
Neg logit contributions
driver-0.045
 unravel-0.044
paper-0.044
 bud-0.041
 rip-0.041
Token feel
Token ID1254
Feature activation-4.08e+00

Pos logit contributions
 much+0.021
 best+0.021
 Everyone+0.020
 important+0.020
 And+0.020
Neg logit contributions
driver-0.028
paper-0.027
 unravel-0.027
 bud-0.025
ro-0.024
Token better
Token ID1365
Feature activation+4.15e-01

Pos logit contributions
 Everyone+0.069
 best+0.066
 And+0.065
ovable+0.061
 much+0.061
Neg logit contributions
 unravel-0.110
driver-0.106
paper-0.105
 bud-0.103
 rip-0.102
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
driver+0.012
 unravel+0.011
paper+0.011
 bud+0.010
ho+0.010
Neg logit contributions
 much-0.008
 best-0.008
 important-0.008
 And-0.007
 Everyone-0.007
Token Mom
Token ID11254
Feature activation-3.92e-01

Pos logit contributions
 best+0.028
 important+0.027
 much+0.027
 many+0.025
ovable+0.025
Neg logit contributions
driver-0.024
 unravel-0.022
ro-0.021
paper-0.020
 bud-0.020
Token supports
Token ID6971
Feature activation-3.54e-01

Pos logit contributions
 best+0.008
 much+0.008
 important+0.008
 And+0.008
 Everyone+0.008
Neg logit contributions
driver-0.010
paper-0.009
 unravel-0.009
 bud-0.009
ro-0.009
Token me
Token ID502
Feature activation-2.06e+00

Pos logit contributions
 much+0.007
 best+0.006
 important+0.006
 Everyone+0.006
 And+0.006
Neg logit contributions
driver-0.010
 unravel-0.009
paper-0.009
 bud-0.009
ro-0.009
Token he
Token ID339
Feature activation-1.99e-01

Pos logit contributions
 best+0.062
 much+0.061
 And+0.061
 Everyone+0.060
 important+0.060
Neg logit contributions
driver-0.067
 unravel-0.064
paper-0.063
 bud-0.060
 rip-0.056
Token made
Token ID925
Feature activation-2.34e+00

Pos logit contributions
 much+0.004
 best+0.004
 important+0.004
 Everyone+0.004
 And+0.004
Neg logit contributions
driver-0.006
paper-0.005
 unravel-0.005
 bud-0.005
pan-0.005
Token mistakes
Token ID10135
Feature activation+2.87e-01

Pos logit contributions
 much+0.048
 best+0.047
 And+0.047
 important+0.047
ovable+0.046
Neg logit contributions
driver-0.055
 unravel-0.053
paper-0.053
 bud-0.048
 rip-0.047
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
driver+0.008
paper+0.008
 unravel+0.008
ho+0.008
 bud+0.008
Neg logit contributions
 much-0.005
 best-0.005
 important-0.005
 And-0.005
 Everyone-0.005
Token<|endoftext|>
Token ID50256
Feature activation+8.49e-02

Pos logit contributions
 best+0.038
 important+0.036
 much+0.034
ovable+0.033
 adventurous+0.032
Neg logit contributions
driver-0.042
 unravel-0.039
paper-0.037
ro-0.036
 bud-0.034
TokenM
Token ID44
Feature activation-3.38e+00

Pos logit contributions
driver+0.002
 unravel+0.002
paper+0.002
 bud+0.002
ro+0.002
Neg logit contributions
 much-0.002
 best-0.002
 important-0.002
 Everyone-0.001
 And-0.001
Tokenama
Token ID1689
Feature activation-2.23e+00

Pos logit contributions
 best+0.061
 much+0.061
 important+0.060
 Everyone+0.058
 And+0.057
Neg logit contributions
driver-0.084
 bud-0.080
paper-0.080
 unravel-0.080
ho-0.078
Token and
Token ID290
Feature activation-6.66e-01

Pos logit contributions
 much+0.048
 best+0.047
 Everyone+0.046
 important+0.046
 And+0.045
Neg logit contributions
driver-0.055
paper-0.052
 unravel-0.052
 bud-0.049
ro-0.047
Token Daddy
Token ID26926
Feature activation-1.31e+00

Pos logit contributions
 much+0.012
 best+0.012
 important+0.012
 Everyone+0.011
 And+0.011
Neg logit contributions
driver-0.018
 unravel-0.018
paper-0.018
 bud-0.017
ho-0.017
Token were
Token ID547
Feature activation-4.47e+00

Pos logit contributions
 much+0.027
 best+0.026
 Everyone+0.026
 important+0.025
 And+0.025
Neg logit contributions
driver-0.034
paper-0.032
 unravel-0.032
 bud-0.030
ro-0.029
Token having
Token ID1719
Feature activation-3.67e+00

Pos logit contributions
 Everyone+0.071
 important+0.066
 much+0.065
ovable+0.064
 And+0.064
Neg logit contributions
driver-0.118
 unravel-0.113
paper-0.108
 bud-0.105
ho-0.104
Token have
Token ID423
Feature activation-7.75e-01

Pos logit contributions
 much+0.018
 best+0.018
 important+0.017
 Everyone+0.017
 And+0.017
Neg logit contributions
driver-0.019
 unravel-0.018
paper-0.018
 bud-0.017
ro-0.016
Token any
Token ID597
Feature activation-1.21e+00

Pos logit contributions
 best+0.013
 Everyone+0.013
 much+0.013
 And+0.013
 important+0.013
Neg logit contributions
driver-0.022
paper-0.021
 unravel-0.021
 bud-0.020
 clip-0.019
Token ideas
Token ID4213
Feature activation-5.05e-01

Pos logit contributions
 much+0.022
 best+0.022
 Everyone+0.021
 important+0.021
 And+0.021
Neg logit contributions
driver-0.033
 unravel-0.032
paper-0.032
 bud-0.031
ro-0.029
Token?
Token ID30
Feature activation-2.09e+00

Pos logit contributions
 best+0.008
 much+0.008
 important+0.007
 about+0.007
 Everyone+0.007
Neg logit contributions
driver-0.016
 unravel-0.015
paper-0.015
ho-0.015
 trigger-0.014
Tokenâ
Token ID22940
Feature activation-3.41e-01

Pos logit contributions
 best+0.044
 important+0.043
 much+0.043
 glad+0.037
 lucky+0.037
Neg logit contributions
driver-0.054
 unravel-0.049
paper-0.047
ro-0.045
 bud-0.045
Token
Token ID26391
Feature activation-3.44e+00

Pos logit contributions
 much+0.008
 best+0.008
 important+0.007
 Everyone+0.007
 And+0.007
Neg logit contributions
driver-0.008
 unravel-0.008
paper-0.008
 bud-0.007
ro-0.007
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
 much+0.087
 best+0.086
 important+0.085
 Everyone+0.083
 And+0.080
Neg logit contributions
driver-0.071
paper-0.068
 unravel-0.067
 bud-0.062
ro-0.059
Token
Token ID198
Feature activation+6.55e-01

Pos logit contributions
driver+0.004
 unravel+0.004
paper+0.004
 bud+0.004
ro+0.004
Neg logit contributions
 best-0.003
 important-0.003
 much-0.003
 Everyone-0.002
 And-0.002
TokenThe
Token ID464
Feature activation-8.85e-02

Pos logit contributions
driver+0.019
paper+0.018
 unravel+0.018
 bud+0.017
ho+0.016
Neg logit contributions
 much-0.012
 Everyone-0.012
 best-0.011
 important-0.011
 And-0.011
Token girl
Token ID2576
Feature activation-1.48e+00

Pos logit contributions
 much+0.002
 best+0.002
 Everyone+0.002
 And+0.002
 important+0.002
Neg logit contributions
driver-0.002
 unravel-0.002
paper-0.002
 bud-0.002
ro-0.002
Token smiled
Token ID13541
Feature activation-6.12e-02

Pos logit contributions
 best+0.032
 much+0.032
 important+0.031
 And+0.031
 Everyone+0.031
Neg logit contributions
driver-0.036
paper-0.035
 unravel-0.034
 bud-0.032
ge-0.031
Token
Token ID198
Feature activation+2.11e+00

Pos logit contributions
driver+0.034
 unravel+0.032
paper+0.032
 bud+0.030
ro+0.030
Neg logit contributions
 best-0.032
 important-0.030
ovable-0.029
 much-0.028
 lucky-0.027
TokenTim
Token ID14967
Feature activation+1.27e-01

Pos logit contributions
driver+0.039
 unravel+0.037
paper+0.037
 bud+0.034
ro+0.032
Neg logit contributions
 best-0.058
 much-0.057
 important-0.056
 Everyone-0.055
 And-0.055
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
driver+0.003
paper+0.003
 unravel+0.003
 bud+0.003
ro+0.003
Neg logit contributions
 best-0.002
 much-0.002
 important-0.002
 And-0.002
 Everyone-0.002
Token Sam
Token ID3409
Feature activation-9.73e-01

Pos logit contributions
driver+0.029
paper+0.028
 unravel+0.028
 bud+0.026
ro+0.025
Neg logit contributions
 much-0.019
 best-0.019
 important-0.018
 Everyone-0.018
 And-0.017
Token played
Token ID2826
Feature activation+3.24e-02

Pos logit contributions
 much+0.020
 best+0.020
 important+0.019
 Everyone+0.019
 And+0.019
Neg logit contributions
driver-0.024
 unravel-0.024
paper-0.023
 bud-0.022
ho-0.021
Token with
Token ID351
Feature activation-3.99e+00

Pos logit contributions
driver+0.001
 unravel+0.001
paper+0.001
 bud+0.001
ro+0.001
Neg logit contributions
 much-0.001
 best-0.001
 important-0.001
 Everyone-0.001
 And-0.001
Token the
Token ID262
Feature activation-3.98e-01

Pos logit contributions
ovable+0.071
 important+0.070
 best+0.069
 Everyone+0.069
 much+0.069
Neg logit contributions
driver-0.107
paper-0.105
 unravel-0.103
 rip-0.094
 bud-0.092
Token cars
Token ID5006
Feature activation-2.30e+00

Pos logit contributions
 best+0.007
 much+0.006
 important+0.006
 Everyone+0.006
 many+0.005
Neg logit contributions
driver-0.012
paper-0.012
 unravel-0.012
ho-0.012
 bud-0.012
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
 much+0.040
 best+0.039
 important+0.037
 Everyone+0.036
 And+0.036
Neg logit contributions
driver-0.067
 unravel-0.065
paper-0.065
 bud-0.062
ro-0.060
Token They
Token ID1119
Feature activation-1.60e+00

Pos logit contributions
 best+0.015
 much+0.014
 important+0.014
ovable+0.013
 glad+0.013
Neg logit contributions
driver-0.028
paper-0.027
 unravel-0.027
ro-0.025
 bud-0.025
Token raced
Token ID32897
Feature activation+2.17e-01

Pos logit contributions
 much+0.033
 best+0.032
 important+0.032
 Everyone+0.032
 And+0.031
Neg logit contributions
driver-0.041
 unravel-0.039
paper-0.039
 bud-0.037
ro-0.035
Token Sara
Token ID24799
Feature activation-4.13e+00

Pos logit contributions
driver+0.007
 unravel+0.007
paper+0.007
 bud+0.007
 clip+0.007
Neg logit contributions
 much-0.005
 Everyone-0.005
 best-0.005
 important-0.005
 And-0.005
Token.
Token ID13
Feature activation-2.34e+00

Pos logit contributions
 much+0.092
 Everyone+0.090
 best+0.089
 important+0.088
 And+0.084
Neg logit contributions
driver-0.096
paper-0.091
 unravel-0.090
 bud-0.086
ro-0.082
Token We
Token ID775
Feature activation-3.77e-01

Pos logit contributions
 important+0.059
 best+0.058
 much+0.057
ovable+0.054
 adventurous+0.053
Neg logit contributions
driver-0.044
ro-0.040
 unravel-0.037
ASH-0.036
 rip-0.035
Token love
Token ID1842
Feature activation-4.21e-01

Pos logit contributions
 much+0.008
 Everyone+0.008
 important+0.008
 best+0.008
 And+0.008
Neg logit contributions
driver-0.009
 unravel-0.009
 bud-0.008
paper-0.008
 clip-0.008
Token you
Token ID345
Feature activation-3.01e+00

Pos logit contributions
 much+0.009
 best+0.008
ovable+0.008
 Everyone+0.008
 important+0.008
Neg logit contributions
paper-0.010
driver-0.010
 unravel-0.010
 bud-0.009
ho-0.009
Token very
Token ID845
Feature activation-3.43e+00

Pos logit contributions
 Everyone+0.066
ovable+0.063
 important+0.063
 much+0.063
 People+0.061
Neg logit contributions
driver-0.065
 bud-0.062
paper-0.060
 unravel-0.059
 clip-0.056
Token much
Token ID881
Feature activation-2.39e+00

Pos logit contributions
 Summer+0.026
 named+0.025
ovable+0.025
 best+0.025
 Everyone+0.025
Neg logit contributions
paper-0.109
driver-0.104
 rip-0.103
 bud-0.103
 unravel-0.100
Token.
Token ID13
Feature activation-2.64e+00

Pos logit contributions
 much+0.050
 best+0.050
 important+0.048
 Everyone+0.048
 And+0.047
Neg logit contributions
driver-0.059
paper-0.057
 unravel-0.056
 bud-0.054
ro-0.052
Token Please
Token ID4222
Feature activation+1.08e+00

Pos logit contributions
 best+0.061
 important+0.061
ovable+0.060
 much+0.059
 many+0.056
Neg logit contributions
driver-0.050
ro-0.045
 unravel-0.044
ASH-0.042
 rip-0.042
Token don
Token ID836
Feature activation+4.24e-01

Pos logit contributions
driver+0.031
paper+0.030
 unravel+0.029
ro+0.027
 bud+0.027
Neg logit contributions
 best-0.019
 much-0.019
 And-0.018
 Everyone-0.018
 important-0.018
Token't
Token ID470
Feature activation+2.27e+00

Pos logit contributions
driver+0.008
paper+0.008
 unravel+0.008
 bud+0.007
ro+0.007
Neg logit contributions
 much-0.011
 best-0.011
 Everyone-0.011
 important-0.011
 And-0.011