TOP ACTIVATIONS
MAX = 6.131

trick - or - t reating with her . Tim my
elephants , lions and ac rob ats . They are amazed
trick - or - t reating anymore . She sadly put
clown s and the ac rob ats and the lions and
it to me for safe keeping . He told me to
saw clown s and ac rob ats and lions and elephants
thought they looked like ac rob ats . She enjoyed the
of fun watching the ac rob ats and clown s .
the animals , the ac rob ats and the clown s
truck . They were different sizes and colors . Some had
at the frame from different angles and adm iring it .
look at it from all angles . It was so colorful
name their university The Gener ous University . Both of
he could get some sa us ages for his dinner .
a king with all his angles . But ,
trucks of different colors and sizes . They make sounds like
and some had short t us ks . Sara thought they
cars of different colors and sizes . He likes to make
. Some had long t us ks and some had short
clown s juggling and ac rob ats doing flips . But

MIN ACTIVATIONS
MIN = -4.253

sure his z ig z ags were neat and tidy .
lines to z ig z ags . One day
enjoyed writing z ig z ags . He wrote z ig
around in z ig z ags . When Zig
He wrote z ig z ags on walls , floors ,
so many z ig z ags ," he said nervously .
running around with the r ags , so this kind of
of her mom 's r ags and soap . She washed
him to turn the r ags she kept in the closet
Jack was turning the r ags , he looked out the
writing his z ig z ags , he heard a funny
She found some old r ags which were dirty . She
But the z ig z ags weren 't as easy as
around in z ig z ags , going left then right
The dog w ags its tail and l icks
Paul put r ags and hay into a box
named Lucy who loved r ags . She would always let
She loved to bring r ags from the kitchen around the
and showed her the r ags . She said that she
it was difficult finding r ags and water . But Bob

INTERVAL 3.535 - 6.131
CONTAINS 0.015%

, he found the tast iest ac orn he had ever
m itt and a spat ula . They work together to
happy to see the sun fl owers growing tall and strong
have the biggest and prett iest tower . They frowned and
picked the prett iest pe b bles and started to thread

INTERVAL 0.939 - 3.535
CONTAINS 6.918%

<|endoftext|> Once there was an ostr ich . It was very
claws and made a loud noise . Sara and Ben jumped
park . The k ang aroo is big and brown .
, Little Billy and the snow man were best friends !
Dad sigh s and picks up the navy boat .

INTERVAL -1.657 - 0.939
CONTAINS 92.048%

you supply me with these y ummy snacks ," said the
was sad . He walked and walked and stopped at a
when the sun had gone down , the man and Tommy
's okay because there 's always a way to make things
her hand . She smiled . " Come on

INTERVAL -4.253 - -1.657
CONTAINS 1.018%

Jill noticed something surprising . There were two little pink beans
a loud noise and he looked up . There was a
grinned with joy . She looked around and asked her mom
flew to the mountain . At the top was a healing
she decided to take it out of her pocket and have
Token trick
Token ID6908
Feature activation+6.35e-01

Pos logit contributions
 distance+0.020
Ar+0.020
 possible+0.020
Mill+0.020
Welcome+0.019
Neg logit contributions
okes-0.019
ish-0.019
ours-0.018
our-0.018
ap-0.017
Token-
Token ID12
Feature activation+1.11e-01

Pos logit contributions
Ar+0.062
Mill+0.061
 possible+0.061
Welcome+0.059
 distance+0.059
Neg logit contributions
ish-0.066
okes-0.065
ours-0.064
our-0.063
 favorites-0.061
Tokenor
Token ID273
Feature activation+1.83e+00

Pos logit contributions
 distance+0.014
Ar+0.013
Mill+0.013
 possible+0.013
 winding+0.012
Neg logit contributions
okes-0.009
ish-0.009
our-0.008
ours-0.008
ap-0.008
Token-
Token ID12
Feature activation+1.05e-01

Pos logit contributions
 possible+0.190
Ar+0.190
 distance+0.189
Mill+0.188
Welcome+0.178
Neg logit contributions
okes-0.178
ish-0.175
ours-0.170
 favorites-0.166
our-0.165
Tokent
Token ID83
Feature activation+1.80e+00

Pos logit contributions
 distance+0.013
Ar+0.013
Mill+0.012
 possible+0.012
 winding+0.012
Neg logit contributions
okes-0.009
ish-0.008
our-0.008
ours-0.008
ap-0.008
Tokenreating
Token ID34567
Feature activation+5.03e+00

Pos logit contributions
Ar+0.175
 distance+0.168
Mill+0.167
 possible+0.157
Welcome+0.156
Neg logit contributions
ish-0.196
okes-0.195
our-0.186
ap-0.184
ours-0.184
Token with
Token ID351
Feature activation+1.36e-01

Pos logit contributions
 possible+0.564
 distance+0.533
Ar+0.530
Mill+0.505
Welcome+0.504
Neg logit contributions
okes-0.486
ish-0.481
ours-0.465
our-0.439
ishes-0.431
Token her
Token ID607
Feature activation-1.46e-01

Pos logit contributions
 distance+0.013
Mill+0.012
 possible+0.012
Ar+0.012
Welcome+0.012
Neg logit contributions
okes-0.015
ish-0.015
our-0.015
ours-0.015
ap-0.015
Token.
Token ID13
Feature activation-7.79e-02

Pos logit contributions
okes+0.016
ours+0.015
int+0.014
ish+0.014
ap+0.014
Neg logit contributions
 distance-0.015
 possible-0.014
Ar-0.013
 moment-0.013
 path-0.013
Token Tim
Token ID5045
Feature activation-6.33e-01

Pos logit contributions
ish+0.008
okes+0.008
int+0.008
ap+0.008
ishes+0.008
Neg logit contributions
Mill-0.008
Welcome-0.007
 distance-0.007
Ab-0.007
Ar-0.007
Tokenmy
Token ID1820
Feature activation-3.16e-01

Pos logit contributions
ours+0.075
okes+0.075
ids+0.074
ap+0.072
our+0.070
Neg logit contributions
 distance-0.051
Mill-0.051
 moment-0.047
Ar-0.046
fl-0.043
Token elephants
Token ID25368
Feature activation+3.81e-01

Pos logit contributions
Mill+0.032
Ar+0.032
 possible+0.032
 distance+0.032
Welcome+0.030
Neg logit contributions
okes-0.024
ish-0.023
ours-0.022
our-0.022
ap-0.022
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
 distance+0.052
Ar+0.051
Mill+0.051
 possible+0.050
Welcome+0.049
Neg logit contributions
ish-0.026
okes-0.026
ours-0.024
our-0.024
ap-0.023
Token lions
Token ID31420
Feature activation+4.80e-02

Pos logit contributions
 distance+0.034
Mill+0.034
Ar+0.034
 possible+0.033
Welcome+0.032
Neg logit contributions
okes-0.019
ish-0.019
our-0.018
ours-0.018
ap-0.017
Token and
Token ID290
Feature activation+7.69e-02

Pos logit contributions
 distance+0.006
Ar+0.006
Mill+0.006
 possible+0.006
Welcome+0.006
Neg logit contributions
okes-0.003
ish-0.003
ours-0.003
our-0.003
ap-0.003
Token ac
Token ID936
Feature activation+2.91e-01

Pos logit contributions
Mill+0.009
 possible+0.009
 distance+0.009
Ar+0.008
Ab+0.008
Neg logit contributions
okes-0.007
ish-0.006
 favorites-0.006
ours-0.006
ap-0.006
Tokenrob
Token ID22609
Feature activation+4.92e+00

Pos logit contributions
Ar+0.021
 distance+0.021
Mill+0.020
 possible+0.019
Welcome+0.019
Neg logit contributions
okes-0.039
ish-0.039
our-0.038
ours-0.038
ap-0.038
Tokenats
Token ID1381
Feature activation-1.11e-01

Pos logit contributions
Ar+0.544
Mill+0.502
Ab+0.498
 possible+0.487
Kay+0.475
Neg logit contributions
ish-0.341
ours-0.322
our-0.321
 brother-0.311
ravings-0.311
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
ish+0.008
okes+0.008
our+0.008
ours+0.007
ap+0.007
Neg logit contributions
 distance-0.015
Ar-0.015
Mill-0.015
 possible-0.014
Welcome-0.014
Token They
Token ID1119
Feature activation-8.23e-02

Pos logit contributions
Ar+0.019
 distance+0.018
Welcome+0.018
Mill+0.017
Hello+0.017
Neg logit contributions
ish-0.023
our-0.021
okes-0.021
int-0.020
ig-0.020
Token are
Token ID389
Feature activation-9.76e-01

Pos logit contributions
ish+0.007
okes+0.007
our+0.007
ap+0.007
ours+0.007
Neg logit contributions
 distance-0.010
Mill-0.010
Ar-0.010
 possible-0.009
Welcome-0.009
Token amazed
Token ID24562
Feature activation+9.71e-01

Pos logit contributions
okes+0.092
ap+0.089
ish+0.089
 favorites+0.088
ours+0.085
Neg logit contributions
Mill-0.105
 distance-0.103
Ar-0.103
Welcome-0.102
 possible-0.098
Token trick
Token ID6908
Feature activation+4.03e-01

Pos logit contributions
ish+0.005
okes+0.004
our+0.004
ap+0.004
ours+0.004
Neg logit contributions
Ar-0.007
Mill-0.007
 distance-0.007
Welcome-0.007
 possible-0.007
Token-
Token ID12
Feature activation-5.14e-03

Pos logit contributions
 distance+0.041
Ar+0.041
Mill+0.039
 possible+0.038
Welcome+0.038
Neg logit contributions
ish-0.042
okes-0.042
our-0.040
ap-0.040
ours-0.040
Tokenor
Token ID273
Feature activation+1.55e+00

Pos logit contributions
okes+0.000
ish+0.000
our+0.000
ours+0.000
ap+0.000
Neg logit contributions
Ar-0.001
 distance-0.001
 winding-0.001
Mill-0.001
 possible-0.001
Token-
Token ID12
Feature activation-1.72e-02

Pos logit contributions
 distance+0.158
Ar+0.156
Mill+0.152
 possible+0.147
Welcome+0.147
Neg logit contributions
ish-0.164
okes-0.163
our-0.157
ours-0.154
ap-0.154
Tokent
Token ID83
Feature activation+1.47e+00

Pos logit contributions
okes+0.001
ish+0.001
our+0.001
ours+0.001
ap+0.001
Neg logit contributions
Ar-0.002
 distance-0.002
 winding-0.002
 possible-0.002
Mill-0.002
Tokenreating
Token ID34567
Feature activation+4.92e+00

Pos logit contributions
Ar+0.171
 distance+0.156
Mill+0.149
Welcome+0.146
 winding+0.140
Neg logit contributions
ish-0.136
okes-0.136
 anymore-0.128
ours-0.128
ap-0.127
Token anymore
Token ID7471
Feature activation-2.65e+00

Pos logit contributions
 distance+0.602
Ar+0.595
Mill+0.589
 possible+0.580
Welcome+0.568
Neg logit contributions
ish-0.406
okes-0.405
our-0.383
ours-0.378
ap-0.373
Token.
Token ID13
Feature activation-6.31e-01

Pos logit contributions
okes+0.199
ish+0.199
our+0.187
ours+0.184
ap+0.181
Neg logit contributions
 distance-0.344
Ar-0.340
Mill-0.336
 possible-0.331
Welcome-0.325
Token She
Token ID1375
Feature activation-9.06e-01

Pos logit contributions
ish+0.074
okes+0.071
our+0.069
ap+0.067
ours+0.067
Neg logit contributions
Mill-0.061
 distance-0.059
Ar-0.056
Ab-0.056
Welcome-0.056
Token sadly
Token ID21098
Feature activation-9.98e-01

Pos logit contributions
okes+0.097
ish+0.097
our+0.093
ours+0.092
ap+0.091
Neg logit contributions
 distance-0.089
Ar-0.088
Mill-0.086
 possible-0.085
Welcome-0.082
Token put
Token ID1234
Feature activation-1.13e+00

Pos logit contributions
ish+0.100
okes+0.100
our+0.096
ap+0.094
ours+0.094
Neg logit contributions
 distance-0.105
Ar-0.105
Mill-0.102
 possible-0.102
Welcome-0.099
Token clown
Token ID25573
Feature activation+3.54e-01

Pos logit contributions
Mill+0.003
Ar+0.003
 distance+0.003
 possible+0.003
Welcome+0.003
Neg logit contributions
ish-0.003
okes-0.003
our-0.003
ours-0.003
 favorites-0.003
Tokens
Token ID82
Feature activation+2.31e+00

Pos logit contributions
 distance+0.041
Ar+0.040
Mill+0.039
 possible+0.039
Welcome+0.039
Neg logit contributions
okes-0.031
ish-0.031
our-0.030
ours-0.029
ap-0.029
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
 distance+0.280
Ar+0.268
Mill+0.265
 possible+0.264
Welcome+0.257
Neg logit contributions
ish-0.199
okes-0.198
ours-0.187
our-0.185
 favorites-0.179
Token the
Token ID262
Feature activation+3.11e-02

Pos logit contributions
Mill+0.032
Ar+0.031
 possible+0.030
 distance+0.030
Welcome+0.029
Neg logit contributions
okes-0.027
ish-0.026
ours-0.026
our-0.025
ap-0.025
Token ac
Token ID936
Feature activation+3.38e-01

Pos logit contributions
Mill+0.003
Ar+0.003
 distance+0.003
 possible+0.003
Welcome+0.003
Neg logit contributions
ish-0.003
okes-0.003
our-0.003
ap-0.003
 favorites-0.003
Tokenrob
Token ID22609
Feature activation+4.88e+00

Pos logit contributions
 distance+0.026
Ar+0.026
Mill+0.026
 possible+0.025
Welcome+0.024
Neg logit contributions
ish-0.044
okes-0.043
our-0.042
ours-0.041
ap-0.041
Tokenats
Token ID1381
Feature activation-7.77e-02

Pos logit contributions
Ar+0.653
 distance+0.640
 possible+0.639
Mill+0.635
Ab+0.628
Neg logit contributions
ish-0.271
ours-0.261
okes-0.246
our-0.244
ap-0.228
Token and
Token ID290
Feature activation+2.36e-01

Pos logit contributions
okes+0.006
ish+0.006
our+0.005
ours+0.005
ap+0.005
Neg logit contributions
Ar-0.010
 distance-0.010
Mill-0.010
 possible-0.010
Welcome-0.010
Token the
Token ID262
Feature activation+2.49e-02

Pos logit contributions
Mill+0.026
Ar+0.025
 distance+0.025
 possible+0.025
Welcome+0.024
Neg logit contributions
okes-0.023
ish-0.023
ours-0.021
 favorites-0.021
our-0.021
Token lions
Token ID31420
Feature activation+1.29e-01

Pos logit contributions
Mill+0.003
Ar+0.002
 possible+0.002
 distance+0.002
Welcome+0.002
Neg logit contributions
ish-0.003
okes-0.002
our-0.002
 favorites-0.002
ap-0.002
Token and
Token ID290
Feature activation+2.02e-01

Pos logit contributions
 distance+0.016
Ar+0.016
Mill+0.016
 possible+0.016
Welcome+0.015
Neg logit contributions
okes-0.010
ish-0.010
our-0.010
ours-0.010
ap-0.009
Token it
Token ID340
Feature activation+9.24e-01

Pos logit contributions
ish+0.021
okes+0.021
ours+0.020
our+0.020
ap+0.020
Neg logit contributions
 distance-0.018
Ar-0.018
Mill-0.017
 possible-0.017
Welcome-0.017
Token to
Token ID284
Feature activation+3.79e-01

Pos logit contributions
 distance+0.088
Ar+0.087
 possible+0.085
Mill+0.082
Welcome+0.081
Neg logit contributions
okes-0.103
our-0.102
ish-0.099
ours-0.099
ap-0.095
Token me
Token ID502
Feature activation-9.83e-01

Pos logit contributions
 distance+0.040
Ar+0.040
Mill+0.039
 possible+0.038
Welcome+0.037
Neg logit contributions
ish-0.039
okes-0.039
ours-0.037
our-0.036
ap-0.035
Token for
Token ID329
Feature activation-5.57e-01

Pos logit contributions
okes+0.100
ours+0.093
ish+0.091
our+0.088
 favorites+0.088
Neg logit contributions
 possible-0.100
 distance-0.098
Mill-0.096
Ar-0.094
Ab-0.089
Token safe
Token ID3338
Feature activation-6.14e-01

Pos logit contributions
okes+0.055
ish+0.055
our+0.053
ours+0.052
ap+0.052
Neg logit contributions
 distance-0.059
Ar-0.058
Mill-0.058
 possible-0.057
Welcome-0.055
Tokenkeeping
Token ID19934
Feature activation+4.83e+00

Pos logit contributions
ish+0.077
okes+0.076
our+0.074
ap+0.073
ours+0.072
Neg logit contributions
 distance-0.048
Ar-0.048
Mill-0.048
 possible-0.046
Welcome-0.045
Token.
Token ID13
Feature activation+3.42e-01

Pos logit contributions
 distance+0.508
 possible+0.497
Ar+0.477
Mill+0.471
keeping+0.461
Neg logit contributions
ish-0.471
okes-0.468
ours-0.456
int-0.443
 favorites-0.429
Token He
Token ID679
Feature activation+3.59e-03

Pos logit contributions
 distance+0.036
Ar+0.036
Mill+0.036
 possible+0.035
Welcome+0.034
Neg logit contributions
ish-0.034
okes-0.034
our-0.032
ours-0.032
ap-0.032
Token told
Token ID1297
Feature activation+4.25e-01

Pos logit contributions
Ar+0.000
 distance+0.000
 possible+0.000
Welcome+0.000
Mill+0.000
Neg logit contributions
ish-0.000
okes-0.000
ap-0.000
our-0.000
ours-0.000
Token me
Token ID502
Feature activation-1.01e+00

Pos logit contributions
 distance+0.044
Ar+0.043
Mill+0.043
 possible+0.042
Welcome+0.041
Neg logit contributions
ish-0.044
okes-0.044
our-0.042
ours-0.041
ap-0.041
Token to
Token ID284
Feature activation+3.16e-01

Pos logit contributions
okes+0.103
ish+0.102
our+0.099
 favorites+0.096
ours+0.096
Neg logit contributions
 possible-0.103
 distance-0.102
Mill-0.098
Ar-0.094
Welcome-0.093
Token saw
Token ID2497
Feature activation-8.11e-01

Pos logit contributions
okes+0.014
ish+0.013
ours+0.013
our+0.013
 favorites+0.013
Neg logit contributions
 possible-0.016
Mill-0.016
 distance-0.016
Ar-0.016
Welcome-0.015
Token clown
Token ID25573
Feature activation+2.29e-01

Pos logit contributions
okes+0.077
ish+0.075
ours+0.072
ap+0.070
our+0.069
Neg logit contributions
 distance-0.090
Ar-0.090
Mill-0.090
 possible-0.087
Welcome-0.084
Tokens
Token ID82
Feature activation+2.23e+00

Pos logit contributions
 distance+0.022
Ar+0.022
Mill+0.021
 possible+0.021
Welcome+0.021
Neg logit contributions
okes-0.025
ish-0.025
our-0.024
ours-0.023
ap-0.023
Token and
Token ID290
Feature activation+4.84e-02

Pos logit contributions
 distance+0.289
Ar+0.284
Mill+0.281
 possible+0.279
Welcome+0.273
Neg logit contributions
ish-0.169
okes-0.168
our-0.158
ours-0.156
ap-0.153
Token ac
Token ID936
Feature activation+2.11e-01

Pos logit contributions
Mill+0.006
 possible+0.005
Ar+0.005
Ab+0.005
 distance+0.005
Neg logit contributions
okes-0.004
 favorites-0.004
ish-0.004
 brother-0.004
ours-0.004
Tokenrob
Token ID22609
Feature activation+4.82e+00

Pos logit contributions
 distance+0.017
Ar+0.017
Mill+0.017
 possible+0.016
Welcome+0.016
Neg logit contributions
okes-0.026
ish-0.026
our-0.025
ours-0.025
ap-0.025
Tokenats
Token ID1381
Feature activation-2.03e-01

Pos logit contributions
Ar+0.552
Mill+0.519
Ab+0.516
 possible+0.500
 distance+0.486
Neg logit contributions
ish-0.315
our-0.299
ours-0.296
 brother-0.282
int-0.270
Token and
Token ID290
Feature activation+1.88e-02

Pos logit contributions
okes+0.013
ish+0.013
our+0.012
ap+0.012
ours+0.012
Neg logit contributions
Ar-0.028
Mill-0.028
 distance-0.028
 possible-0.028
Ab-0.027
Token lions
Token ID31420
Feature activation-7.29e-02

Pos logit contributions
Mill+0.002
 possible+0.002
 distance+0.002
Ar+0.002
Ab+0.002
Neg logit contributions
okes-0.002
ish-0.002
 favorites-0.002
ours-0.002
ap-0.002
Token and
Token ID290
Feature activation-7.73e-04

Pos logit contributions
okes+0.006
ish+0.006
ours+0.005
our+0.005
ap+0.005
Neg logit contributions
 distance-0.009
Ar-0.009
Mill-0.009
 possible-0.009
Welcome-0.009
Token elephants
Token ID25368
Feature activation+2.73e-01

Pos logit contributions
okes+0.000
ish+0.000
 favorites+0.000
ours+0.000
ap+0.000
Neg logit contributions
Mill-0.000
 possible-0.000
Ar-0.000
 distance-0.000
Ab-0.000
Token thought
Token ID1807
Feature activation+7.61e-01

Pos logit contributions
okes+0.057
ours+0.056
ish+0.056
ags+0.054
ap+0.053
Neg logit contributions
 distance-0.043
Ar-0.042
Mill-0.041
 possible-0.037
Welcome-0.037
Token they
Token ID484
Feature activation+3.19e-01

Pos logit contributions
 possible+0.059
Mill+0.054
 distance+0.054
Welcome+0.051
Ar+0.050
Neg logit contributions
ish-0.100
okes-0.096
int-0.094
our-0.093
ishes-0.093
Token looked
Token ID3114
Feature activation-1.67e+00

Pos logit contributions
 distance+0.027
Mill+0.026
Ar+0.026
Welcome+0.025
 possible+0.025
Neg logit contributions
okes-0.039
ours-0.037
ish-0.037
ap-0.036
our-0.036
Token like
Token ID588
Feature activation-9.04e-01

Pos logit contributions
okes+0.215
our+0.207
ish+0.204
ours+0.201
ap+0.198
Neg logit contributions
 distance-0.136
 possible-0.129
 winding-0.123
Ar-0.123
Welcome-0.121
Token ac
Token ID936
Feature activation+2.57e-01

Pos logit contributions
okes+0.086
ish+0.084
 favorites+0.082
ours+0.081
ap+0.081
Neg logit contributions
 distance-0.098
Ar-0.096
Mill-0.094
 possible-0.089
Ab-0.088
Tokenrob
Token ID22609
Feature activation+4.81e+00

Pos logit contributions
 distance+0.020
Ar+0.019
Mill+0.019
 possible+0.019
Welcome+0.018
Neg logit contributions
okes-0.033
ish-0.033
our-0.032
ours-0.032
ap-0.032
Tokenats
Token ID1381
Feature activation-8.45e-02

Pos logit contributions
 possible+0.454
Ab+0.445
Mill+0.444
Ar+0.444
lled+0.429
Neg logit contributions
ish-0.329
ours-0.318
 brother-0.318
our-0.309
 eyes-0.299
Token.
Token ID13
Feature activation-4.56e-02

Pos logit contributions
ish+0.006
okes+0.006
our+0.006
ap+0.006
ours+0.006
Neg logit contributions
Ar-0.011
 distance-0.011
Mill-0.011
 possible-0.011
Ab-0.011
Token She
Token ID1375
Feature activation-3.89e-01

Pos logit contributions
ish+0.006
okes+0.006
int+0.006
ap+0.005
ishes+0.005
Neg logit contributions
Mill-0.003
Welcome-0.003
 distance-0.003
Hello-0.003
Ar-0.003
Token enjoyed
Token ID8359
Feature activation+5.93e-01

Pos logit contributions
ish+0.044
okes+0.044
ours+0.043
our+0.042
ap+0.042
Neg logit contributions
Ar-0.035
 distance-0.035
Mill-0.034
 possible-0.033
Welcome-0.032
Token the
Token ID262
Feature activation-8.55e-02

Pos logit contributions
 distance+0.077
Ar+0.074
 possible+0.070
Mill+0.070
Welcome+0.070
Neg logit contributions
okes-0.046
ap-0.043
ours-0.043
 favorites-0.042
our-0.042
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
 distance+0.025
Ar+0.025
Mill+0.025
 possible+0.024
Welcome+0.024
Neg logit contributions
okes-0.021
ish-0.021
our-0.020
ours-0.020
ap-0.019
Token fun
Token ID1257
Feature activation-1.66e-02

Pos logit contributions
Ar+0.106
 distance+0.103
 possible+0.098
Welcome+0.096
Mill+0.095
Neg logit contributions
ish-0.141
okes-0.136
our-0.133
ours-0.132
ap-0.129
Token watching
Token ID4964
Feature activation+9.55e-01

Pos logit contributions
ish+0.001
okes+0.001
ours+0.001
our+0.001
ap+0.001
Neg logit contributions
 distance-0.002
Ar-0.002
Mill-0.002
 possible-0.002
Welcome-0.002
Token the
Token ID262
Feature activation-1.47e-01

Pos logit contributions
 distance+0.105
Ar+0.103
Mill+0.102
 possible+0.102
Welcome+0.100
Neg logit contributions
okes-0.090
ish-0.090
our-0.087
ours-0.086
ap-0.083
Token ac
Token ID936
Feature activation+2.32e-01

Pos logit contributions
ish+0.014
okes+0.013
 favorites+0.013
ap+0.013
our+0.013
Neg logit contributions
Mill-0.017
 distance-0.016
 possible-0.015
Ab-0.015
Ar-0.015
Tokenrob
Token ID22609
Feature activation+4.76e+00

Pos logit contributions
 distance+0.017
Ar+0.016
Mill+0.016
 possible+0.016
Welcome+0.015
Neg logit contributions
ish-0.031
okes-0.031
our-0.030
ours-0.030
ap-0.029
Tokenats
Token ID1381
Feature activation-7.63e-02

Pos logit contributions
 possible+0.601
Ar+0.593
 distance+0.585
Ab+0.577
Mill+0.565
Neg logit contributions
ish-0.296
ours-0.288
okes-0.274
our-0.265
ids-0.256
Token and
Token ID290
Feature activation-2.26e-01

Pos logit contributions
ish+0.007
okes+0.006
our+0.006
ap+0.006
ours+0.006
Neg logit contributions
Ar-0.009
 distance-0.009
Mill-0.009
 possible-0.009
Ab-0.009
Token clown
Token ID25573
Feature activation+2.17e-01

Pos logit contributions
okes+0.019
 favorites+0.019
ish+0.019
ishes+0.018
ap+0.018
Neg logit contributions
Mill-0.026
Ar-0.025
 distance-0.025
Ab-0.025
 possible-0.025
Tokens
Token ID82
Feature activation+2.04e+00

Pos logit contributions
 distance+0.021
Ar+0.020
Mill+0.020
 possible+0.020
Welcome+0.019
Neg logit contributions
okes-0.024
ish-0.023
ours-0.023
our-0.022
ap-0.022
Token.
Token ID13
Feature activation-6.79e-02

Pos logit contributions
 distance+0.251
Ar+0.245
 possible+0.242
Mill+0.242
Welcome+0.235
Neg logit contributions
okes-0.170
ish-0.169
ours-0.159
our-0.159
ap-0.153
Token the
Token ID262
Feature activation+4.31e-02

Pos logit contributions
ish+0.003
okes+0.003
our+0.003
ap+0.003
ours+0.003
Neg logit contributions
 distance-0.002
Ar-0.002
Mill-0.002
 possible-0.002
Welcome-0.002
Token animals
Token ID4695
Feature activation-4.70e-01

Pos logit contributions
Mill+0.005
 distance+0.004
 possible+0.004
Ab+0.004
Ar+0.004
Neg logit contributions
ish-0.004
 favorites-0.004
okes-0.004
ours-0.004
ap-0.004
Token,
Token ID11
Feature activation+3.06e-01

Pos logit contributions
okes+0.039
ish+0.037
ours+0.036
our+0.035
ap+0.034
Neg logit contributions
 distance-0.059
 possible-0.057
Ar-0.057
Mill-0.056
Welcome-0.055
Token the
Token ID262
Feature activation+5.65e-02

Pos logit contributions
 distance+0.035
Ar+0.034
Mill+0.034
 possible+0.034
Welcome+0.033
Neg logit contributions
ish-0.028
okes-0.028
our-0.027
ours-0.026
ap-0.026
Token ac
Token ID936
Feature activation+2.01e-01

Pos logit contributions
Mill+0.006
 possible+0.006
 distance+0.006
Ar+0.005
Ab+0.005
Neg logit contributions
ish-0.006
 favorites-0.006
okes-0.005
ours-0.005
ap-0.005
Tokenrob
Token ID22609
Feature activation+4.75e+00

Pos logit contributions
 distance+0.014
Ar+0.014
Mill+0.014
 possible+0.013
Welcome+0.013
Neg logit contributions
okes-0.027
ish-0.027
our-0.026
ours-0.026
ap-0.026
Tokenats
Token ID1381
Feature activation-1.24e-01

Pos logit contributions
Ar+0.591
 possible+0.574
 distance+0.573
Ab+0.568
Mill+0.555
Neg logit contributions
ish-0.277
ours-0.270
okes-0.262
our-0.258
int-0.255
Token and
Token ID290
Feature activation-1.40e-02

Pos logit contributions
okes+0.009
ish+0.009
our+0.008
ours+0.008
ap+0.008
Neg logit contributions
 distance-0.017
Ar-0.016
Mill-0.016
 possible-0.016
Welcome-0.016
Token the
Token ID262
Feature activation+4.49e-02

Pos logit contributions
ish+0.001
okes+0.001
 favorites+0.001
ours+0.001
our+0.001
Neg logit contributions
Mill-0.002
Ar-0.001
 distance-0.001
 possible-0.001
Welcome-0.001
Token clown
Token ID25573
Feature activation+1.60e-01

Pos logit contributions
Mill+0.005
 possible+0.004
 distance+0.004
Ar+0.004
Welcome+0.004
Neg logit contributions
ish-0.005
okes-0.004
 favorites-0.004
our-0.004
ours-0.004
Tokens
Token ID82
Feature activation+2.40e+00

Pos logit contributions
 distance+0.014
Ar+0.014
 possible+0.013
Welcome+0.013
Mill+0.013
Neg logit contributions
okes-0.018
ish-0.018
ours-0.017
our-0.017
ap-0.017
Token truck
Token ID7779
Feature activation-4.85e-01

Pos logit contributions
 distance+0.062
Ar+0.060
Mill+0.059
 possible+0.058
Welcome+0.056
Neg logit contributions
okes-0.069
ish-0.069
our-0.066
ours-0.066
ap-0.064
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
okes+0.040
ish+0.040
ours+0.038
our+0.037
 favorites+0.036
Neg logit contributions
 distance-0.060
Ar-0.058
 possible-0.057
Mill-0.057
Welcome-0.055
Token They
Token ID1119
Feature activation+4.28e-01

Pos logit contributions
 distance+0.046
Ar+0.045
Mill+0.045
 possible+0.044
Welcome+0.043
Neg logit contributions
ish-0.038
okes-0.037
our-0.036
ours-0.035
ap-0.034
Token were
Token ID547
Feature activation+5.68e-01

Pos logit contributions
 distance+0.046
Ar+0.046
Mill+0.044
 possible+0.044
Welcome+0.043
Neg logit contributions
okes-0.042
ish-0.042
ours-0.040
our-0.040
ap-0.039
Token different
Token ID1180
Feature activation+3.94e-01

Pos logit contributions
 distance+0.059
Ar+0.059
Mill+0.058
Welcome+0.056
 possible+0.056
Neg logit contributions
ish-0.057
okes-0.057
ap-0.054
ours-0.054
our-0.053
Token sizes
Token ID10620
Feature activation+4.71e+00

Pos logit contributions
 distance+0.043
Ar+0.042
Mill+0.042
 possible+0.042
Welcome+0.040
Neg logit contributions
okes-0.038
ish-0.038
our-0.036
ours-0.036
ap-0.035
Token and
Token ID290
Feature activation+4.31e-01

Pos logit contributions
 distance+0.594
 possible+0.582
Mill+0.576
Ar+0.574
Welcome+0.558
Neg logit contributions
okes-0.370
ish-0.364
ours-0.347
our-0.347
ap-0.331
Token colors
Token ID7577
Feature activation+1.96e+00

Pos logit contributions
Mill+0.042
Ab+0.041
 distance+0.041
Ar+0.041
Welcome+0.039
Neg logit contributions
ish-0.044
 favorites-0.043
ishes-0.042
okes-0.040
 promises-0.039
Token.
Token ID13
Feature activation+4.66e-01

Pos logit contributions
Ar+0.272
 distance+0.269
Mill+0.265
Welcome+0.257
Ab+0.254
Neg logit contributions
ish-0.137
okes-0.134
our-0.126
ap-0.123
ours-0.122
Token Some
Token ID2773
Feature activation+1.93e-01

Pos logit contributions
 distance+0.050
Ar+0.050
Mill+0.050
 possible+0.048
Welcome+0.048
Neg logit contributions
ish-0.046
okes-0.045
our-0.043
ours-0.042
ap-0.042
Token had
Token ID550
Feature activation+8.43e-01

Pos logit contributions
 distance+0.019
Ar+0.019
Mill+0.019
 possible+0.019
Welcome+0.018
Neg logit contributions
ish-0.020
okes-0.020
our-0.019
ours-0.019
ap-0.019
Token at
Token ID379
Feature activation-3.27e-01

Pos logit contributions
okes+0.037
ish+0.035
our+0.034
ours+0.033
ap+0.032
Neg logit contributions
 distance-0.075
 possible-0.072
Ar-0.071
Mill-0.071
Welcome-0.070
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
ish+0.035
okes+0.035
our+0.033
ours+0.033
ap+0.032
Neg logit contributions
 distance-0.033
Ar-0.033
Mill-0.032
 possible-0.031
Welcome-0.031
Token frame
Token ID5739
Feature activation-4.31e-01

Pos logit contributions
ish+0.021
okes+0.020
our+0.020
 favorites+0.019
ap+0.019
Neg logit contributions
Ar-0.019
Mill-0.019
 distance-0.018
Welcome-0.017
 possible-0.017
Token from
Token ID422
Feature activation+1.38e+00

Pos logit contributions
ish+0.033
okes+0.033
our+0.031
ours+0.030
ap+0.030
Neg logit contributions
 distance-0.055
Ar-0.055
Mill-0.054
 possible-0.053
Welcome-0.052
Token different
Token ID1180
Feature activation-3.18e-01

Pos logit contributions
Ar+0.143
Mill+0.142
 distance+0.141
Welcome+0.136
 possible+0.136
Neg logit contributions
ish-0.137
okes-0.136
our-0.132
ours-0.129
ap-0.129
Token angles
Token ID18333
Feature activation+4.69e+00

Pos logit contributions
ish+0.032
okes+0.032
our+0.031
ours+0.030
ap+0.030
Neg logit contributions
 distance-0.033
Ar-0.033
Mill-0.032
 possible-0.032
Welcome-0.031
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
 distance+0.617
Ar+0.612
Mill+0.605
 possible+0.595
Welcome+0.586
Neg logit contributions
ish-0.344
okes-0.344
our-0.324
ours-0.318
ap-0.315
Token adm
Token ID6178
Feature activation+6.89e-01

Pos logit contributions
ish+0.011
okes+0.011
our+0.011
 favorites+0.011
ours+0.011
Neg logit contributions
Ar-0.014
Mill-0.014
Ab-0.013
Welcome-0.013
 distance-0.013
Tokeniring
Token ID3428
Feature activation+1.74e+00

Pos logit contributions
 distance+0.052
Mill+0.051
Ar+0.051
 possible+0.051
Welcome+0.048
Neg logit contributions
okes-0.088
ish-0.088
our-0.085
ours-0.084
ap-0.083
Token it
Token ID340
Feature activation+6.37e-01

Pos logit contributions
Ar+0.178
 distance+0.177
Mill+0.173
Welcome+0.166
 possible+0.165
Neg logit contributions
ish-0.181
okes-0.178
ours-0.171
our-0.171
ap-0.169
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
 distance+0.081
Ar+0.081
Mill+0.080
 possible+0.078
Welcome+0.077
Neg logit contributions
ish-0.049
okes-0.049
our-0.046
ours-0.045
ap-0.045
Token look
Token ID804
Feature activation-1.46e+00

Pos logit contributions
Ar+0.029
 distance+0.029
Mill+0.028
 possible+0.028
Welcome+0.027
Neg logit contributions
ish-0.024
okes-0.023
our-0.023
ap-0.022
ours-0.022
Token at
Token ID379
Feature activation-2.89e-01

Pos logit contributions
okes+0.086
ish+0.083
our+0.078
ours+0.078
ap+0.075
Neg logit contributions
 distance-0.215
 possible-0.210
Mill-0.210
Ar-0.208
Welcome-0.204
Token it
Token ID340
Feature activation+6.43e-01

Pos logit contributions
ish+0.029
okes+0.029
our+0.028
ours+0.028
ap+0.027
Neg logit contributions
 distance-0.030
Ar-0.030
Mill-0.029
 possible-0.029
Welcome-0.028
Token from
Token ID422
Feature activation+1.44e+00

Pos logit contributions
 distance+0.078
Ar+0.076
 possible+0.076
Mill+0.076
Welcome+0.073
Neg logit contributions
okes-0.054
ish-0.053
ours-0.050
our-0.050
ap-0.048
Token all
Token ID477
Feature activation+6.11e-01

Pos logit contributions
Ar+0.152
 distance+0.151
Mill+0.150
 possible+0.145
Welcome+0.143
Neg logit contributions
ish-0.143
okes-0.143
our-0.137
ours-0.135
ap-0.134
Token angles
Token ID18333
Feature activation+4.69e+00

Pos logit contributions
Ar+0.076
Mill+0.070
Welcome+0.068
 distance+0.067
Ab+0.066
Neg logit contributions
ish-0.054
okes-0.052
ap-0.051
ours-0.050
our-0.050
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
Ar+0.655
 distance+0.647
Mill+0.626
Welcome+0.617
 possible+0.613
Neg logit contributions
ish-0.321
okes-0.319
our-0.306
ap-0.300
ours-0.293
Token It
Token ID632
Feature activation-3.69e-01

Pos logit contributions
ish+0.001
okes+0.001
our+0.001
ishes+0.001
int+0.001
Neg logit contributions
 distance-0.001
Mill-0.001
Welcome-0.001
Ar-0.001
Ab-0.001
Token was
Token ID373
Feature activation+1.93e-01

Pos logit contributions
ish+0.033
okes+0.032
our+0.030
ap+0.030
ours+0.030
Neg logit contributions
 distance-0.041
Ar-0.041
Mill-0.041
 possible-0.040
Welcome-0.040
Token so
Token ID523
Feature activation+1.02e+00

Pos logit contributions
Ar+0.024
 distance+0.023
Welcome+0.023
Mill+0.023
Ab+0.022
Neg logit contributions
okes-0.015
 favorites-0.015
ish-0.014
ap-0.014
ours-0.014
Token colorful
Token ID20239
Feature activation+4.02e-01

Pos logit contributions
 distance+0.119
keeping+0.109
Ar+0.107
Mill+0.102
Welcome+0.101
Neg logit contributions
okes-0.094
ours-0.089
ap-0.089
 favorites-0.084
udd-0.084
Token name
Token ID1438
Feature activation-8.06e-02

Pos logit contributions
ish+0.007
okes+0.007
ap+0.007
our+0.007
ours+0.007
Neg logit contributions
Ar-0.009
 distance-0.009
Mill-0.009
 possible-0.009
Welcome-0.009
Token their
Token ID511
Feature activation+1.39e+00

Pos logit contributions
ish+0.009
okes+0.009
our+0.008
ours+0.008
ap+0.008
Neg logit contributions
Mill-0.008
 distance-0.008
 possible-0.008
Welcome-0.008
Ar-0.008
Token university
Token ID6403
Feature activation+2.54e-01

Pos logit contributions
 distance+0.162
Ar+0.160
Mill+0.158
 possible+0.156
Welcome+0.152
Neg logit contributions
okes-0.121
ish-0.121
our-0.116
ours-0.114
ap-0.112
Token The
Token ID383
Feature activation-7.69e-01

Pos logit contributions
 distance+0.030
 possible+0.030
Ar+0.029
Mill+0.029
Welcome+0.028
Neg logit contributions
okes-0.022
ish-0.022
ours-0.021
our-0.020
ap-0.020
Token Gener
Token ID2980
Feature activation-5.89e-01

Pos logit contributions
ish+0.077
okes+0.077
our+0.074
ours+0.073
ap+0.072
Neg logit contributions
 distance-0.080
Ar-0.079
Mill-0.078
 possible-0.076
Welcome-0.074
Tokenous
Token ID516
Feature activation+4.68e+00

Pos logit contributions
ish+0.074
okes+0.074
our+0.070
ours+0.070
ap+0.069
Neg logit contributions
 distance-0.045
 possible-0.044
Mill-0.044
Ar-0.044
Welcome-0.042
Token University
Token ID2059
Feature activation-1.71e-01

Pos logit contributions
 distance+0.475
Ar+0.470
Mill+0.464
 possible+0.456
Welcome+0.444
Neg logit contributions
ish-0.483
okes-0.481
our-0.461
ours-0.455
ap-0.452
Token.
Token ID13
Feature activation+3.25e-01

Pos logit contributions
ish+0.017
okes+0.017
our+0.016
ours+0.016
ap+0.015
Neg logit contributions
 distance-0.018
Ar-0.018
Mill-0.018
 possible-0.018
Welcome-0.017
Token
Token ID198
Feature activation+5.18e-01

Pos logit contributions
 distance+0.037
Ar+0.037
Mill+0.036
 possible+0.036
Welcome+0.035
Neg logit contributions
ish-0.030
okes-0.030
our-0.028
ours-0.028
ap-0.027
TokenBoth
Token ID10265
Feature activation+7.11e-01

Pos logit contributions
 distance+0.059
 possible+0.058
Ar+0.056
Mill+0.056
Welcome+0.055
Neg logit contributions
okes-0.046
ours-0.043
ish-0.043
ap-0.042
 favorites-0.041
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
 distance+0.076
Mill+0.074
Ar+0.074
Welcome+0.074
 possible+0.073
Neg logit contributions
okes-0.070
ish-0.069
our-0.067
ours-0.065
int-0.064
Token he
Token ID339
Feature activation+9.77e-01

Pos logit contributions
ish+0.029
okes+0.029
our+0.029
ap+0.028
ours+0.027
Neg logit contributions
Ar-0.022
 distance-0.022
Mill-0.020
Welcome-0.019
Ab-0.018
Token could
Token ID714
Feature activation+5.07e-01

Pos logit contributions
 distance+0.092
 possible+0.090
Ar+0.090
Mill+0.089
Welcome+0.086
Neg logit contributions
okes-0.107
ish-0.106
ours-0.104
ap-0.102
our-0.102
Token get
Token ID651
Feature activation+4.38e-01

Pos logit contributions
 distance+0.056
 possible+0.055
Ar+0.055
Mill+0.054
Welcome+0.052
Neg logit contributions
okes-0.048
ish-0.047
ours-0.046
our-0.044
 favorites-0.043
Token some
Token ID617
Feature activation+4.91e-01

Pos logit contributions
 distance+0.049
Ar+0.049
Mill+0.048
 possible+0.046
Welcome+0.046
Neg logit contributions
ish-0.041
okes-0.041
our-0.038
ours-0.038
ap-0.038
Token sa
Token ID473
Feature activation+6.14e-01

Pos logit contributions
 distance+0.053
 possible+0.051
Ar+0.051
Mill+0.050
Welcome+0.049
Neg logit contributions
okes-0.049
ish-0.047
our-0.046
ours-0.045
ap-0.044
Tokenus
Token ID385
Feature activation+4.65e+00

Pos logit contributions
 distance+0.039
Ar+0.038
Mill+0.037
 possible+0.036
Welcome+0.034
Neg logit contributions
okes-0.088
ish-0.087
our-0.085
ours-0.084
ap-0.084
Tokenages
Token ID1095
Feature activation+6.96e-01

Pos logit contributions
 distance+0.506
Ar+0.501
 possible+0.497
Mill+0.491
Welcome+0.469
Neg logit contributions
okes-0.443
ish-0.428
ours-0.417
our-0.410
ap-0.402
Token for
Token ID329
Feature activation-3.34e-01

Pos logit contributions
 distance+0.087
 possible+0.085
Ar+0.085
Mill+0.084
Welcome+0.081
Neg logit contributions
okes-0.056
ish-0.055
ours-0.052
our-0.052
ap-0.050
Token his
Token ID465
Feature activation+1.04e+00

Pos logit contributions
okes+0.031
ish+0.031
our+0.030
ours+0.029
ap+0.029
Neg logit contributions
 distance-0.037
Ar-0.037
Mill-0.036
 possible-0.036
Welcome-0.035
Token dinner
Token ID8073
Feature activation-1.99e-01

Pos logit contributions
 distance+0.122
Ar+0.120
Mill+0.119
 possible+0.118
Welcome+0.114
Neg logit contributions
okes-0.091
ish-0.090
ours-0.085
our-0.085
ap-0.083
Token.
Token ID13
Feature activation+9.31e-03

Pos logit contributions
okes+0.017
ish+0.016
ours+0.016
our+0.015
ap+0.015
Neg logit contributions
 distance-0.025
Ar-0.024
 possible-0.024
Mill-0.024
Welcome-0.023
Token a
Token ID257
Feature activation-2.33e-01

Pos logit contributions
okes+0.094
ish+0.094
 favorites+0.091
ours+0.090
ap+0.090
Neg logit contributions
 distance-0.108
Ar-0.107
Mill-0.106
Ab-0.098
Welcome-0.098
Token king
Token ID5822
Feature activation-1.19e+00

Pos logit contributions
okes+0.026
ish+0.026
our+0.025
ours+0.025
ap+0.024
Neg logit contributions
 distance-0.022
Ar-0.022
Mill-0.021
 possible-0.021
Welcome-0.021
Token with
Token ID351
Feature activation+3.04e-01

Pos logit contributions
okes+0.103
ours+0.099
ish+0.097
our+0.095
ap+0.091
Neg logit contributions
 distance-0.146
 possible-0.142
Ar-0.140
Mill-0.139
Welcome-0.136
Token all
Token ID477
Feature activation+4.98e-01

Pos logit contributions
Mill+0.034
Ar+0.034
 distance+0.034
Ab+0.032
 possible+0.032
Neg logit contributions
okes-0.028
ish-0.027
 favorites-0.026
ours-0.026
ap-0.025
Token his
Token ID465
Feature activation+4.12e-01

Pos logit contributions
Ar+0.051
 distance+0.050
Mill+0.050
 possible+0.048
Welcome+0.048
Neg logit contributions
ish-0.052
okes-0.051
ours-0.049
our-0.049
ap-0.048
Token angles
Token ID18333
Feature activation+4.64e+00

Pos logit contributions
 distance+0.040
Ar+0.038
 possible+0.038
Mill+0.037
Welcome+0.035
Neg logit contributions
okes-0.044
ours-0.044
ish-0.044
ap-0.043
our-0.042
Token.
Token ID13
Feature activation-1.76e-01

Pos logit contributions
 distance+0.603
Ar+0.597
Mill+0.590
 possible+0.582
Welcome+0.571
Neg logit contributions
ish-0.346
okes-0.346
our-0.325
ours-0.321
ap-0.316
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
ish+0.018
okes+0.018
our+0.017
ours+0.017
ap+0.017
Neg logit contributions
 distance-0.018
Ar-0.018
Mill-0.018
 possible-0.018
Welcome-0.017
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
 distance+0.029
 possible+0.028
Ar+0.028
Mill+0.027
Welcome+0.027
Neg logit contributions
okes-0.022
ish-0.021
our-0.021
ap-0.021
ours-0.020
TokenBut
Token ID1537
Feature activation+7.41e-01

Pos logit contributions
 possible+0.022
 distance+0.021
Ar+0.020
fl+0.019
ho+0.019
Neg logit contributions
ish-0.028
okes-0.028
ap-0.027
int-0.027
our-0.027
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
Ar+0.085
 distance+0.083
Mill+0.082
Welcome+0.080
 possible+0.080
Neg logit contributions
ish-0.069
okes-0.066
our-0.065
ap-0.063
ours-0.061
Token trucks
Token ID13960
Feature activation+1.27e+00

Pos logit contributions
Ar+0.147
 distance+0.142
 winding+0.134
 path+0.125
Mill+0.122
Neg logit contributions
our-0.156
ish-0.149
okes-0.148
int-0.148
ap-0.147
Token of
Token ID286
Feature activation+1.43e+00

Pos logit contributions
 distance+0.124
 possible+0.120
Ar+0.117
Welcome+0.110
Mill+0.108
Neg logit contributions
int-0.135
our-0.134
ish-0.133
okes-0.131
ours-0.128
Token different
Token ID1180
Feature activation+2.82e-02

Pos logit contributions
Ar+0.119
 distance+0.115
Mill+0.110
 possible+0.106
Welcome+0.103
Neg logit contributions
ish-0.182
okes-0.180
our-0.177
int-0.173
ap-0.173
Token colors
Token ID7577
Feature activation+1.86e+00

Pos logit contributions
Ar+0.002
 distance+0.002
 possible+0.002
Welcome+0.002
Mill+0.002
Neg logit contributions
okes-0.003
ish-0.003
our-0.003
ours-0.003
ap-0.003
Token and
Token ID290
Feature activation+3.63e-01

Pos logit contributions
 distance+0.242
Ar+0.238
Mill+0.237
 possible+0.235
Welcome+0.229
Neg logit contributions
okes-0.140
ish-0.140
our-0.131
ours-0.130
ap-0.128
Token sizes
Token ID10620
Feature activation+4.63e+00

Pos logit contributions
Mill+0.035
 distance+0.034
Ab+0.033
Ar+0.033
 possible+0.032
Neg logit contributions
ish-0.040
 favorites-0.039
okes-0.039
our-0.039
ap-0.038
Token.
Token ID13
Feature activation+4.97e-01

Pos logit contributions
 possible+0.498
 distance+0.491
Ar+0.472
Mill+0.470
Welcome+0.464
Neg logit contributions
okes-0.439
ish-0.427
int-0.414
ours-0.410
our-0.409
Token They
Token ID1119
Feature activation+2.85e-01

Pos logit contributions
Mill+0.050
 distance+0.050
Ar+0.049
Ab+0.049
Welcome+0.049
Neg logit contributions
ish-0.052
our-0.049
okes-0.049
int-0.047
ishes-0.047
Token make
Token ID787
Feature activation+5.79e-01

Pos logit contributions
Ar+0.030
 distance+0.030
 possible+0.028
Welcome+0.027
Mill+0.027
Neg logit contributions
okes-0.030
ish-0.028
ours-0.028
our-0.027
int-0.027
Token sounds
Token ID5238
Feature activation+1.93e+00

Pos logit contributions
Ar+0.054
 distance+0.054
 possible+0.051
Mill+0.051
Welcome+0.049
Neg logit contributions
ish-0.065
okes-0.064
our-0.063
ours-0.061
int-0.060
Token like
Token ID588
Feature activation-4.79e-01

Pos logit contributions
 distance+0.239
Ar+0.236
Mill+0.234
 possible+0.231
Welcome+0.226
Neg logit contributions
okes-0.158
ish-0.158
our-0.148
ours-0.147
ap-0.145
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
 distance+0.193
Ar+0.191
Mill+0.189
 possible+0.187
Welcome+0.183
Neg logit contributions
okes-0.110
ish-0.110
our-0.103
ours-0.102
ap-0.100
Token some
Token ID617
Feature activation+5.65e-01

Pos logit contributions
 distance+0.041
Mill+0.040
Ar+0.040
 possible+0.038
Welcome+0.038
Neg logit contributions
okes-0.031
ours-0.028
ish-0.028
our-0.028
 favorites-0.027
Token had
Token ID550
Feature activation+7.79e-01

Pos logit contributions
Mill+0.059
Ar+0.059
 distance+0.058
 possible+0.057
Welcome+0.055
Neg logit contributions
okes-0.056
ish-0.056
our-0.053
ap-0.053
ours-0.053
Token short
Token ID1790
Feature activation+8.68e-02

Pos logit contributions
 distance+0.088
Mill+0.087
Ar+0.086
 possible+0.084
Welcome+0.083
Neg logit contributions
okes-0.072
ish-0.068
ours-0.066
 favorites-0.066
our-0.065
Token t
Token ID256
Feature activation+5.18e-01

Pos logit contributions
 distance+0.012
Ar+0.012
Mill+0.012
 possible+0.012
Welcome+0.011
Neg logit contributions
okes-0.006
ish-0.006
our-0.006
ours-0.005
ap-0.005
Tokenus
Token ID385
Feature activation+4.61e+00

Pos logit contributions
 distance+0.058
Ar+0.055
 possible+0.055
Welcome+0.054
Mill+0.054
Neg logit contributions
okes-0.053
ish-0.053
our-0.050
ours-0.048
ap-0.047
Tokenks
Token ID591
Feature activation+1.47e+00

Pos logit contributions
 distance+0.562
Ar+0.552
Mill+0.550
 possible+0.539
Welcome+0.531
Neg logit contributions
ish-0.393
okes-0.392
our-0.371
ours-0.362
ap-0.360
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
 distance+0.190
Ar+0.188
Mill+0.186
 possible+0.183
Welcome+0.179
Neg logit contributions
okes-0.111
ish-0.111
our-0.104
ours-0.103
ap-0.101
Token Sara
Token ID24799
Feature activation+3.75e-01

Pos logit contributions
Ar+0.037
Mill+0.036
Welcome+0.036
 distance+0.035
Ab+0.035
Neg logit contributions
ish-0.040
our-0.036
okes-0.035
int-0.035
ishes-0.034
Token thought
Token ID1807
Feature activation+1.09e+00

Pos logit contributions
Ar+0.038
 distance+0.037
Mill+0.036
Welcome+0.034
Ab+0.033
Neg logit contributions
ish-0.038
okes-0.037
ours-0.037
ap-0.036
our-0.035
Token they
Token ID484
Feature activation+8.43e-01

Pos logit contributions
 distance+0.086
 possible+0.086
Ar+0.084
Mill+0.084
Welcome+0.079
Neg logit contributions
ish-0.137
okes-0.135
our-0.131
ap-0.129
ours-0.128
Token cars
Token ID5006
Feature activation-2.26e-01

Pos logit contributions
Ar+0.139
 distance+0.136
 winding+0.127
 path+0.120
Mill+0.118
Neg logit contributions
our-0.134
ish-0.130
int-0.126
ours-0.126
okes-0.125
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
ish+0.021
okes+0.021
ours+0.021
our+0.021
int+0.020
Neg logit contributions
 distance-0.025
Ar-0.024
 possible-0.024
Welcome-0.022
Mill-0.022
Token different
Token ID1180
Feature activation-1.09e-01

Pos logit contributions
Ar+0.113
 distance+0.108
Mill+0.100
 possible+0.099
Welcome+0.096
Neg logit contributions
ish-0.175
okes-0.171
our-0.167
int-0.166
ours-0.165
Token colors
Token ID7577
Feature activation+1.82e+00

Pos logit contributions
ish+0.013
okes+0.013
ours+0.013
our+0.012
int+0.012
Neg logit contributions
Ar-0.009
 distance-0.009
 possible-0.008
Welcome-0.008
Mill-0.008
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
 distance+0.241
Ar+0.237
Mill+0.235
 possible+0.234
Welcome+0.228
Neg logit contributions
okes-0.132
ish-0.131
our-0.123
ours-0.122
ap-0.119
Token sizes
Token ID10620
Feature activation+4.59e+00

Pos logit contributions
Mill+0.023
 distance+0.023
Ab+0.022
Ar+0.022
Welcome+0.021
Neg logit contributions
 favorites-0.024
okes-0.024
ish-0.024
our-0.023
ap-0.023
Token.
Token ID13
Feature activation+4.88e-01

Pos logit contributions
 possible+0.510
 distance+0.507
Ar+0.491
Mill+0.481
Welcome+0.478
Neg logit contributions
okes-0.420
ish-0.411
ours-0.396
our-0.390
int-0.389
Token He
Token ID679
Feature activation+1.21e-01

Pos logit contributions
 distance+0.047
Ar+0.045
Welcome+0.045
Ab+0.043
 possible+0.043
Neg logit contributions
ish-0.058
ishes-0.053
okes-0.052
our-0.051
ap-0.051
Token likes
Token ID7832
Feature activation-7.37e-01

Pos logit contributions
Ar+0.013
 distance+0.011
 possible+0.011
Welcome+0.011
 winding+0.011
Neg logit contributions
ish-0.012
okes-0.012
our-0.012
int-0.012
ap-0.011
Token to
Token ID284
Feature activation+3.45e-01

Pos logit contributions
okes+0.070
ish+0.070
our+0.066
ours+0.066
ap+0.065
Neg logit contributions
 distance-0.081
Ar-0.080
Mill-0.080
 possible-0.078
Welcome-0.076
Token make
Token ID787
Feature activation+4.88e-01

Pos logit contributions
 distance+0.036
Ar+0.036
Mill+0.034
 possible+0.034
Welcome+0.033
Neg logit contributions
okes-0.035
ish-0.035
ours-0.033
our-0.033
 favorites-0.032
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
Ar+0.086
 distance+0.084
Mill+0.084
 possible+0.082
Welcome+0.081
Neg logit contributions
okes-0.040
ish-0.039
our-0.037
ours-0.036
ap-0.035
Token Some
Token ID2773
Feature activation+1.90e-01

Pos logit contributions
Ar+0.037
Mill+0.037
Welcome+0.036
 distance+0.035
Ab+0.035
Neg logit contributions
ish-0.036
our-0.032
okes-0.032
int-0.031
ishes-0.031
Token had
Token ID550
Feature activation+8.28e-01

Pos logit contributions
 distance+0.020
Ar+0.020
Mill+0.019
 possible+0.019
Welcome+0.018
Neg logit contributions
okes-0.019
ish-0.019
our-0.018
ours-0.018
ap-0.018
Token long
Token ID890
Feature activation+1.95e+00

Pos logit contributions
 distance+0.091
Mill+0.091
Ar+0.090
Welcome+0.088
 possible+0.088
Neg logit contributions
okes-0.078
ish-0.074
ours-0.072
our-0.071
 favorites-0.071
Token t
Token ID256
Feature activation+5.45e-01

Pos logit contributions
 distance+0.258
Ar+0.255
Mill+0.254
 possible+0.250
Welcome+0.246
Neg logit contributions
okes-0.144
ish-0.143
our-0.135
ours-0.132
ap-0.130
Tokenus
Token ID385
Feature activation+4.59e+00

Pos logit contributions
 distance+0.070
Ar+0.067
 possible+0.067
Mill+0.065
Welcome+0.065
Neg logit contributions
okes-0.047
ish-0.046
our-0.043
ours-0.042
ap-0.041
Tokenks
Token ID591
Feature activation+1.48e+00

Pos logit contributions
 distance+0.559
Ar+0.551
Mill+0.546
 possible+0.539
Welcome+0.527
Neg logit contributions
okes-0.385
ish-0.385
our-0.364
ours-0.359
ap-0.354
Token and
Token ID290
Feature activation+3.42e-01

Pos logit contributions
 distance+0.193
Ar+0.191
Mill+0.189
 possible+0.187
Welcome+0.183
Neg logit contributions
okes-0.110
ish-0.110
our-0.103
ours-0.102
ap-0.100
Token some
Token ID617
Feature activation+5.65e-01

Pos logit contributions
 distance+0.041
Mill+0.040
Ar+0.040
 possible+0.038
Welcome+0.038
Neg logit contributions
okes-0.031
ours-0.028
ish-0.028
our-0.028
 favorites-0.027
Token had
Token ID550
Feature activation+7.79e-01

Pos logit contributions
Mill+0.059
Ar+0.059
 distance+0.058
 possible+0.057
Welcome+0.055
Neg logit contributions
okes-0.056
ish-0.056
our-0.053
ap-0.053
ours-0.053
Token short
Token ID1790
Feature activation+8.68e-02

Pos logit contributions
 distance+0.088
Mill+0.087
Ar+0.086
 possible+0.084
Welcome+0.083
Neg logit contributions
okes-0.072
ish-0.068
ours-0.066
 favorites-0.066
our-0.065
Token clown
Token ID25573
Feature activation-8.76e-02

Pos logit contributions
okes+0.097
ish+0.093
 favorites+0.092
ours+0.091
ap+0.087
Neg logit contributions
Ar-0.152
 distance-0.146
Mill-0.146
 possible-0.143
Welcome-0.139
Tokens
Token ID82
Feature activation+1.82e+00

Pos logit contributions
okes+0.009
ish+0.009
our+0.009
ours+0.009
ap+0.009
Neg logit contributions
 distance-0.008
Ar-0.008
Mill-0.008
 possible-0.008
Welcome-0.008
Token juggling
Token ID48097
Feature activation+1.07e+00

Pos logit contributions
 distance+0.233
Ar+0.232
Mill+0.229
 possible+0.226
Welcome+0.221
Neg logit contributions
ish-0.139
okes-0.139
our-0.131
ours-0.129
ap-0.128
Token and
Token ID290
Feature activation-3.03e-01

Pos logit contributions
 distance+0.125
Ar+0.124
Mill+0.122
 possible+0.121
Welcome+0.118
Neg logit contributions
okes-0.094
ish-0.094
our-0.089
ours-0.088
ap-0.087
Token ac
Token ID936
Feature activation-4.54e-02

Pos logit contributions
okes+0.028
ish+0.026
 favorites+0.026
ap+0.025
 brother+0.025
Neg logit contributions
Mill-0.033
 distance-0.032
Ar-0.032
 possible-0.031
Ab-0.031
Tokenrob
Token ID22609
Feature activation+4.59e+00

Pos logit contributions
ish+0.005
okes+0.005
our+0.005
ours+0.005
ap+0.005
Neg logit contributions
 distance-0.004
Mill-0.004
 possible-0.004
Ar-0.004
Welcome-0.004
Tokenats
Token ID1381
Feature activation-5.05e-01

Pos logit contributions
Ar+0.491
Mill+0.481
Ab+0.473
 possible+0.466
lled+0.455
Neg logit contributions
ish-0.315
our-0.303
ours-0.301
 brother-0.284
ap-0.274
Token doing
Token ID1804
Feature activation+4.60e-01

Pos logit contributions
ish+0.035
okes+0.035
our+0.033
ap+0.032
ours+0.032
Neg logit contributions
Ar-0.069
Mill-0.068
 distance-0.067
 possible-0.066
Ab-0.065
Token flips
Token ID45971
Feature activation+6.81e-01

Pos logit contributions
 distance+0.048
Ar+0.048
Mill+0.047
 possible+0.046
Welcome+0.045
Neg logit contributions
okes-0.046
ish-0.046
ours-0.044
our-0.043
ap-0.043
Token.
Token ID13
Feature activation-7.69e-02

Pos logit contributions
Mill+0.091
Ar+0.089
 distance+0.089
Ab+0.086
Welcome+0.085
Neg logit contributions
ish-0.046
okes-0.045
ap-0.044
 anymore-0.044
our-0.041
Token But
Token ID887
Feature activation-6.79e-01

Pos logit contributions
ish+0.008
okes+0.008
our+0.008
ours+0.008
ap+0.007
Neg logit contributions
Mill-0.008
 distance-0.008
Ar-0.008
Welcome-0.007
Ab-0.007
Token sure
Token ID1654
Feature activation-1.41e+00

Pos logit contributions
okes+0.015
ish+0.015
our+0.015
ours+0.015
ap+0.014
Neg logit contributions
 possible-0.015
 distance-0.015
Ar-0.015
Mill-0.015
Ab-0.014
Token his
Token ID465
Feature activation+3.65e-03

Pos logit contributions
ish+0.149
okes+0.149
our+0.142
ours+0.142
ap+0.142
Neg logit contributions
Ar-0.140
 distance-0.137
Mill-0.136
Welcome-0.129
 possible-0.127
Token z
Token ID1976
Feature activation+8.11e-01

Pos logit contributions
 distance+0.000
 possible+0.000
Ar+0.000
Mill+0.000
Welcome+0.000
Neg logit contributions
okes-0.000
ish-0.000
ours-0.000
our-0.000
ap-0.000
Tokenig
Token ID328
Feature activation-1.70e+00

Pos logit contributions
Ar+0.101
 possible+0.100
fl+0.089
 seek+0.087
 pel+0.087
Neg logit contributions
 anymore-0.044
ours-0.043
 about-0.040
okes-0.040
 So-0.038
Tokenz
Token ID89
Feature activation+1.23e+00

Pos logit contributions
ish+0.178
okes+0.177
our+0.170
ours+0.168
ap+0.167
Neg logit contributions
 distance-0.172
Ar-0.169
Mill-0.167
 possible-0.163
Welcome-0.160
Tokenags
Token ID3775
Feature activation-4.25e+00

Pos logit contributions
 possible+0.126
Ar+0.116
 distance+0.116
Welcome+0.110
Ab+0.110
Neg logit contributions
ours-0.114
okes-0.113
 favorites-0.103
ish-0.102
our-0.100
Token were
Token ID547
Feature activation-2.02e-02

Pos logit contributions
ish+0.329
our+0.311
okes+0.310
ap+0.308
ours+0.285
Neg logit contributions
Ar-0.553
Mill-0.535
 distance-0.531
Welcome-0.530
Ab-0.513
Token neat
Token ID15049
Feature activation-1.42e-01

Pos logit contributions
ish+0.002
okes+0.002
our+0.002
ap+0.002
ours+0.002
Neg logit contributions
Ar-0.002
 distance-0.002
Mill-0.002
Welcome-0.002
Ab-0.002
Token and
Token ID290
Feature activation-3.33e-01

Pos logit contributions
okes+0.010
ish+0.010
our+0.010
ours+0.009
ap+0.009
Neg logit contributions
 distance-0.019
Ar-0.019
Mill-0.018
 possible-0.018
Welcome-0.018
Token tidy
Token ID43044
Feature activation-5.95e-01

Pos logit contributions
ish+0.029
our+0.028
ap+0.027
 promises+0.027
 favorites+0.024
Neg logit contributions
Mill-0.039
Ar-0.039
Ab-0.038
 distance-0.037
Welcome-0.036
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
ish+0.043
okes+0.040
our+0.040
ap+0.038
ours+0.037
Neg logit contributions
Ar-0.080
Mill-0.079
 distance-0.079
Welcome-0.076
Ab-0.076
Token lines
Token ID3951
Feature activation+1.83e+00

Pos logit contributions
okes+0.050
ours+0.048
ish+0.048
ap+0.047
int+0.047
Neg logit contributions
 distance-0.043
Ar-0.041
 possible-0.040
 path-0.038
 winding-0.037
Token to
Token ID284
Feature activation+5.67e-03

Pos logit contributions
 distance+0.238
Ar+0.235
 possible+0.234
Mill+0.231
Welcome+0.225
Neg logit contributions
okes-0.136
ish-0.135
ours-0.127
our-0.126
ap-0.122
Token z
Token ID1976
Feature activation+8.56e-01

Pos logit contributions
 distance+0.001
Ar+0.001
 possible+0.001
Mill+0.001
Welcome+0.001
Neg logit contributions
okes-0.001
ish-0.001
ours-0.001
our-0.001
ap-0.001
Tokenig
Token ID328
Feature activation-1.51e+00

Pos logit contributions
 possible+0.086
Ar+0.080
Car+0.078
Back+0.076
Jay+0.076
Neg logit contributions
 tum-0.062
 appetite-0.059
 favorites-0.053
ish-0.053
 mouths-0.053
Tokenz
Token ID89
Feature activation+1.31e+00

Pos logit contributions
ish+0.145
okes+0.145
ours+0.138
our+0.137
ap+0.135
Neg logit contributions
 distance-0.160
 possible-0.159
Mill-0.158
Ar-0.158
Welcome-0.150
Tokenags
Token ID3775
Feature activation-4.24e+00

Pos logit contributions
 possible+0.147
Back+0.144
iest+0.141
Ar+0.137
lling+0.136
Neg logit contributions
ours-0.085
okes-0.078
house-0.076
ish-0.075
 favorites-0.075
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
ish+0.310
okes+0.304
our+0.294
ap+0.283
ours+0.277
Neg logit contributions
Ar-0.565
 distance-0.557
Mill-0.552
Welcome-0.544
Ab-0.535
Token
Token ID198
Feature activation+3.27e-01

Pos logit contributions
okes+0.017
ish+0.017
our+0.017
ishes+0.017
ours+0.016
Neg logit contributions
Ar-0.008
 distance-0.008
Welcome-0.007
Mill-0.007
 moment-0.006
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
 distance+0.032
Ar+0.032
Mill+0.031
 possible+0.030
Welcome+0.029
Neg logit contributions
ish-0.036
okes-0.035
our-0.034
ours-0.034
ap-0.033
TokenOne
Token ID3198
Feature activation-3.21e-01

Pos logit contributions
 distance+0.027
Mill+0.027
 possible+0.026
Ar+0.026
Welcome+0.025
Neg logit contributions
okes-0.036
ish-0.036
ap-0.035
our-0.035
ours-0.035
Token day
Token ID1110
Feature activation-2.34e-02

Pos logit contributions
okes+0.047
ish+0.047
our+0.045
ours+0.045
ap+0.044
Neg logit contributions
 distance-0.020
Ar-0.019
Mill-0.019
 possible-0.018
Welcome-0.017
Token enjoyed
Token ID8359
Feature activation+3.83e-01

Pos logit contributions
okes+0.030
ish+0.030
ours+0.029
our+0.028
ap+0.028
Neg logit contributions
Ar-0.026
 distance-0.026
Mill-0.025
Welcome-0.024
 possible-0.024
Token writing
Token ID3597
Feature activation+9.34e-02

Pos logit contributions
 distance+0.053
Ar+0.052
Mill+0.051
 possible+0.051
Welcome+0.050
Neg logit contributions
okes-0.024
ish-0.023
our-0.023
ap-0.023
ours-0.022
Token z
Token ID1976
Feature activation+8.66e-01

Pos logit contributions
Ar+0.010
 distance+0.009
 possible+0.009
Mill+0.009
Welcome+0.009
Neg logit contributions
okes-0.010
ish-0.010
ours-0.009
our-0.009
ap-0.009
Tokenig
Token ID328
Feature activation-1.55e+00

Pos logit contributions
Ar+0.102
 possible+0.100
Mill+0.092
Ab+0.092
Kay+0.089
Neg logit contributions
ish-0.059
ours-0.057
 anymore-0.053
ishes-0.053
 oven-0.051
Tokenz
Token ID89
Feature activation+1.33e+00

Pos logit contributions
ish+0.167
okes+0.167
our+0.160
ours+0.158
ap+0.157
Neg logit contributions
 distance-0.152
Ar-0.149
Mill-0.147
 possible-0.144
Welcome-0.141
Tokenags
Token ID3775
Feature activation-4.22e+00

Pos logit contributions
 possible+0.139
Ar+0.134
Ab+0.132
Mill+0.132
 distance+0.131
Neg logit contributions
ours-0.114
okes-0.105
ish-0.104
ishes-0.098
ids-0.096
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
ish+0.299
okes+0.294
our+0.288
ap+0.281
 favorites+0.268
Neg logit contributions
Ar-0.567
 distance-0.560
Mill-0.556
Welcome-0.549
Ab-0.540
Token He
Token ID679
Feature activation-2.81e-01

Pos logit contributions
 distance+0.002
Ar+0.002
Welcome+0.002
 possible+0.002
Hello+0.002
Neg logit contributions
ish-0.002
okes-0.002
ishes-0.002
our-0.002
ours-0.002
Token wrote
Token ID2630
Feature activation-5.76e-02

Pos logit contributions
okes+0.033
ish+0.033
ours+0.032
our+0.031
ap+0.031
Neg logit contributions
Ar-0.024
 distance-0.024
 possible-0.023
Mill-0.022
Welcome-0.022
Token z
Token ID1976
Feature activation+8.57e-01

Pos logit contributions
ish+0.005
okes+0.005
our+0.005
ours+0.005
ap+0.004
Neg logit contributions
 distance-0.007
Ar-0.007
Mill-0.007
 possible-0.007
Welcome-0.007
Tokenig
Token ID328
Feature activation-1.47e+00

Pos logit contributions
Ar+0.095
 possible+0.089
fl+0.088
Ab+0.085
Mill+0.085
Neg logit contributions
ish-0.058
 anymore-0.057
 oven-0.056
ravings-0.055
ours-0.054
Token around
Token ID1088
Feature activation-3.63e-01

Pos logit contributions
ish+0.039
okes+0.035
ap+0.035
our+0.034
 at+0.033
Neg logit contributions
Ar-0.054
Ab-0.053
Mill-0.053
 possible-0.051
Welcome-0.050
Token in
Token ID287
Feature activation-4.82e-01

Pos logit contributions
ish+0.029
okes+0.029
our+0.028
ap+0.028
ours+0.027
Neg logit contributions
Ar-0.045
 distance-0.044
Mill-0.044
 possible-0.043
Welcome-0.042
Token z
Token ID1976
Feature activation+8.80e-01

Pos logit contributions
ish+0.044
okes+0.044
ap+0.043
 favorites+0.042
ours+0.041
Neg logit contributions
Mill-0.053
Ar-0.052
 possible-0.051
Ab-0.050
 distance-0.050
Tokenig
Token ID328
Feature activation-1.20e+00

Pos logit contributions
Ab+0.105
 distance+0.101
Kay+0.101
Pat+0.101
Ar+0.101
Neg logit contributions
 favorites-0.056
ish-0.051
ishes-0.049
 anymore-0.047
 brother-0.047
Tokenz
Token ID89
Feature activation+1.49e+00

Pos logit contributions
okes+0.119
ish+0.119
our+0.113
ours+0.112
ap+0.111
Neg logit contributions
 distance-0.128
Ar-0.127
Mill-0.125
 possible-0.123
Welcome-0.120
Tokenags
Token ID3775
Feature activation-4.22e+00

Pos logit contributions
 distance+0.181
Ab+0.178
Ar+0.172
Mill+0.171
 possible+0.170
Neg logit contributions
ours-0.098
 favorites-0.088
ish-0.088
ccoli-0.088
okes-0.084
Token.
Token ID13
Feature activation-4.05e-01

Pos logit contributions
ish+0.215
ap+0.205
okes+0.196
our+0.193
 at+0.180
Neg logit contributions
Ar-0.675
Welcome-0.650
Mill-0.635
Ab-0.616
 distance-0.614
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
okes+0.039
ish+0.038
ours+0.036
our+0.036
ap+0.036
Neg logit contributions
 distance-0.045
Mill-0.043
 possible-0.043
Ar-0.043
Welcome-0.042
Token
Token ID198
Feature activation+4.54e-02

Pos logit contributions
 distance+0.002
 possible+0.002
Welcome+0.002
Mill+0.002
Ar+0.002
Neg logit contributions
okes-0.001
ap-0.001
ours-0.001
ish-0.001
 favorites-0.001
TokenWhen
Token ID2215
Feature activation-7.69e-02

Pos logit contributions
 possible+0.004
 distance+0.004
keeping+0.004
Mill+0.004
Welcome+0.003
Neg logit contributions
ap-0.004
okes-0.004
int-0.004
 So-0.004
ig-0.004
Token Zig
Token ID24992
Feature activation+5.02e-01

Pos logit contributions
ap+0.007
ish+0.007
 at+0.007
 anymore+0.007
okes+0.006
Neg logit contributions
Mill-0.008
keeping-0.008
 distance-0.007
 possible-0.007
Ar-0.007
Token He
Token ID679
Feature activation-2.81e-01

Pos logit contributions
 distance+0.002
Ar+0.002
Welcome+0.002
 possible+0.002
Hello+0.002
Neg logit contributions
ish-0.002
okes-0.002
ishes-0.002
our-0.002
ours-0.002
Token wrote
Token ID2630
Feature activation-5.76e-02

Pos logit contributions
okes+0.033
ish+0.033
ours+0.032
our+0.031
ap+0.031
Neg logit contributions
Ar-0.024
 distance-0.024
 possible-0.023
Mill-0.022
Welcome-0.022
Token z
Token ID1976
Feature activation+8.57e-01

Pos logit contributions
ish+0.005
okes+0.005
our+0.005
ours+0.005
ap+0.004
Neg logit contributions
 distance-0.007
Ar-0.007
Mill-0.007
 possible-0.007
Welcome-0.007
Tokenig
Token ID328
Feature activation-1.47e+00

Pos logit contributions
Ar+0.095
 possible+0.089
fl+0.088
Ab+0.085
Mill+0.085
Neg logit contributions
ish-0.058
 anymore-0.057
 oven-0.056
ravings-0.055
ours-0.054
Tokenz
Token ID89
Feature activation+1.35e+00

Pos logit contributions
ish+0.169
okes+0.169
our+0.163
ours+0.160
ap+0.159
Neg logit contributions
 distance-0.135
Ar-0.132
Mill-0.130
 possible-0.126
Welcome-0.125
Tokenags
Token ID3775
Feature activation-4.21e+00

Pos logit contributions
 possible+0.154
iest+0.148
Ar+0.147
 distance+0.147
Ab+0.146
Neg logit contributions
ours-0.092
okes-0.088
ish-0.086
ishes-0.083
house-0.081
Token on
Token ID319
Feature activation+6.49e-01

Pos logit contributions
ish+0.327
okes+0.323
our+0.322
ap+0.318
 favorites+0.302
Neg logit contributions
Ar-0.528
Welcome-0.519
Mill-0.517
 distance-0.513
Ab-0.505
Token walls
Token ID7714
Feature activation+4.95e-01

Pos logit contributions
 distance+0.067
Mill+0.066
Ar+0.065
 possible+0.065
Welcome+0.062
Neg logit contributions
ish-0.065
okes-0.065
our-0.062
ap-0.061
 favorites-0.061
Token,
Token ID11
Feature activation+7.17e-02

Pos logit contributions
 distance+0.062
 possible+0.060
Ar+0.059
Mill+0.058
Welcome+0.056
Neg logit contributions
okes-0.041
ish-0.040
ours-0.039
our-0.037
ishes-0.036
Token floors
Token ID18570
Feature activation+1.21e+00

Pos logit contributions
Mill+0.009
Ar+0.009
 distance+0.009
 possible+0.009
Ab+0.009
Neg logit contributions
okes-0.006
ish-0.006
our-0.005
ours-0.005
ap-0.005
Token,
Token ID11
Feature activation+5.11e-02

Pos logit contributions
 distance+0.154
 possible+0.150
Ar+0.147
Mill+0.144
Welcome+0.140
Neg logit contributions
ish-0.094
okes-0.093
ours-0.087
int-0.083
ishes-0.083
Token so
Token ID523
Feature activation+4.42e-01

Pos logit contributions
ish+0.136
okes+0.136
ours+0.129
our+0.127
ap+0.127
Neg logit contributions
 distance-0.168
Ar-0.167
Mill-0.165
 possible-0.158
Welcome-0.158
Token many
Token ID867
Feature activation+7.55e-01

Pos logit contributions
 distance+0.047
Ar+0.046
Mill+0.045
 possible+0.042
Ab+0.042
Neg logit contributions
okes-0.043
ours-0.042
ish-0.041
ap-0.040
our-0.039
Token z
Token ID1976
Feature activation+8.66e-01

Pos logit contributions
 distance+0.075
Ar+0.075
Mill+0.074
 possible+0.071
Welcome+0.070
Neg logit contributions
okes-0.079
ish-0.079
our-0.075
ours-0.074
ap-0.073
Tokenig
Token ID328
Feature activation-1.51e+00

Pos logit contributions
 possible+0.119
Ar+0.113
Mill+0.111
Ab+0.109
 distance+0.107
Neg logit contributions
 favorites-0.052
ours-0.050
okes-0.047
ish-0.047
ishes-0.044
Tokenz
Token ID89
Feature activation+1.36e+00

Pos logit contributions
ish+0.147
okes+0.146
our+0.141
ap+0.137
ours+0.135
Neg logit contributions
 distance-0.167
Ar-0.167
Mill-0.161
Welcome-0.157
 possible-0.153
Tokenags
Token ID3775
Feature activation-4.15e+00

Pos logit contributions
 possible+0.150
 distance+0.143
Ab+0.142
Mill+0.141
Ar+0.140
Neg logit contributions
ours-0.112
okes-0.112
 favorites-0.110
ish-0.098
our-0.097
Token,"
Token ID553
Feature activation-3.61e-01

Pos logit contributions
ish+0.343
okes+0.335
ap+0.320
our+0.320
ours+0.310
Neg logit contributions
Ar-0.516
 distance-0.505
Mill-0.497
Welcome-0.489
 possible-0.478
Token he
Token ID339
Feature activation+4.97e-01

Pos logit contributions
ish+0.040
okes+0.039
our+0.038
ap+0.037
ours+0.037
Neg logit contributions
 distance-0.034
Ar-0.033
 possible-0.033
Mill-0.032
Welcome-0.031
Token said
Token ID531
Feature activation-7.30e-01

Pos logit contributions
 distance+0.056
Ar+0.055
Mill+0.054
 possible+0.054
Welcome+0.053
Neg logit contributions
ish-0.046
okes-0.046
our-0.044
ours-0.043
ap-0.043
Token nervously
Token ID43324
Feature activation-9.93e-02

Pos logit contributions
ish+0.056
okes+0.056
our+0.053
ours+0.052
ap+0.052
Neg logit contributions
 distance-0.093
Ar-0.092
Mill-0.091
 possible-0.090
Welcome-0.088
Token.
Token ID13
Feature activation-2.53e-01

Pos logit contributions
ish+0.007
okes+0.007
our+0.007
ap+0.007
ours+0.007
Neg logit contributions
 distance-0.013
Ar-0.013
Mill-0.013
 possible-0.013
Welcome-0.013
Token running
Token ID2491
Feature activation+9.46e-01

Pos logit contributions
okes+0.124
our+0.124
ish+0.123
ap+0.122
ours+0.120
Neg logit contributions
Ar-0.112
 distance-0.108
Mill-0.100
Welcome-0.100
 possible-0.096
Token around
Token ID1088
Feature activation-3.21e-01

Pos logit contributions
Ar+0.117
Mill+0.116
 distance+0.116
 possible+0.113
Welcome+0.112
Neg logit contributions
ish-0.077
okes-0.075
our-0.071
ap-0.071
ours-0.069
Token with
Token ID351
Feature activation+7.22e-03

Pos logit contributions
okes+0.029
ish+0.029
our+0.028
ours+0.027
ap+0.027
Neg logit contributions
 distance-0.037
Ar-0.036
Mill-0.036
 possible-0.035
Welcome-0.035
Token the
Token ID262
Feature activation-4.50e-01

Pos logit contributions
Ar+0.001
 distance+0.001
Mill+0.001
 possible+0.001
Welcome+0.001
Neg logit contributions
ish-0.001
okes-0.001
our-0.001
ours-0.001
ap-0.001
Token r
Token ID374
Feature activation-3.50e-01

Pos logit contributions
ish+0.043
okes+0.041
our+0.040
ours+0.039
 favorites+0.039
Neg logit contributions
Ar-0.050
 distance-0.049
 possible-0.049
Mill-0.048
Ab-0.047
Tokenags
Token ID3775
Feature activation-4.15e+00

Pos logit contributions
ish+0.025
okes+0.025
 favorites+0.025
ours+0.024
ap+0.024
Neg logit contributions
Mill-0.043
 distance-0.040
 possible-0.039
Ar-0.039
Ab-0.038
Token,
Token ID11
Feature activation-3.18e-01

Pos logit contributions
ish+0.306
okes+0.298
ap+0.285
our+0.285
ours+0.277
Neg logit contributions
Ar-0.554
 distance-0.537
Mill-0.532
Welcome-0.523
 possible-0.519
Token so
Token ID523
Feature activation+6.03e-01

Pos logit contributions
okes+0.030
ish+0.030
our+0.028
ours+0.028
ap+0.028
Neg logit contributions
 distance-0.036
Ar-0.035
Mill-0.035
 possible-0.034
Welcome-0.033
Token this
Token ID428
Feature activation-1.55e-01

Pos logit contributions
Ar+0.075
 distance+0.071
Welcome+0.068
Mill+0.067
Ab+0.067
Neg logit contributions
ap-0.054
ish-0.050
 promises-0.049
okes-0.049
ours-0.049
Token kind
Token ID1611
Feature activation-7.89e-01

Pos logit contributions
ish+0.017
okes+0.017
our+0.016
ours+0.016
ap+0.016
Neg logit contributions
Ar-0.015
 distance-0.015
Mill-0.015
 possible-0.014
Welcome-0.014
Token of
Token ID286
Feature activation+8.68e-01

Pos logit contributions
ish+0.075
okes+0.072
our+0.070
ap+0.070
ours+0.069
Neg logit contributions
Ar-0.089
 distance-0.086
Mill-0.086
 possible-0.085
Welcome-0.082
Token of
Token ID286
Feature activation+1.00e+00

Pos logit contributions
 distance+0.007
Ar+0.007
Mill+0.007
 possible+0.007
Welcome+0.006
Neg logit contributions
okes-0.006
ish-0.006
our-0.006
ours-0.006
ap-0.006
Token her
Token ID607
Feature activation-1.28e-01

Pos logit contributions
Ar+0.101
 distance+0.101
Mill+0.098
 possible+0.096
Welcome+0.095
Neg logit contributions
okes-0.103
ish-0.102
ours-0.099
our-0.097
 favorites-0.096
Token mom
Token ID1995
Feature activation-6.07e-01

Pos logit contributions
okes+0.012
ish+0.011
int+0.011
ap+0.011
ours+0.011
Neg logit contributions
 distance-0.014
 possible-0.013
Ar-0.013
Mill-0.013
Ab-0.013
Token's
Token ID338
Feature activation-1.40e+00

Pos logit contributions
okes+0.049
ish+0.046
ours+0.045
ap+0.044
our+0.044
Neg logit contributions
 distance-0.077
Mill-0.075
 possible-0.073
Welcome-0.072
Ar-0.071
Token r
Token ID374
Feature activation-1.00e-01

Pos logit contributions
okes+0.129
ish+0.122
our+0.117
ours+0.117
ap+0.114
Neg logit contributions
 distance-0.165
Ar-0.156
Mill-0.154
 possible-0.151
Welcome-0.148
Tokenags
Token ID3775
Feature activation-4.13e+00

Pos logit contributions
ish+0.007
okes+0.007
our+0.007
 favorites+0.007
 promises+0.007
Neg logit contributions
Mill-0.012
Ab-0.011
 distance-0.011
keeping-0.011
Ar-0.011
Token and
Token ID290
Feature activation-2.04e-01

Pos logit contributions
ish+0.258
our+0.244
okes+0.244
ap+0.238
ours+0.223
Neg logit contributions
Ar-0.607
Welcome-0.582
 distance-0.568
Mill-0.561
 possible-0.555
Token soap
Token ID19533
Feature activation+5.65e-01

Pos logit contributions
ish+0.019
okes+0.018
ours+0.018
our+0.018
ap+0.017
Neg logit contributions
Ar-0.024
 distance-0.023
Mill-0.023
 possible-0.022
Welcome-0.022
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 distance+0.072
Ar+0.072
Mill+0.071
 possible+0.070
Welcome+0.068
Neg logit contributions
ish-0.044
okes-0.044
our-0.041
ours-0.041
ap-0.040
Token She
Token ID1375
Feature activation-4.19e-01

Pos logit contributions
ish+0.014
okes+0.014
our+0.014
ap+0.013
ours+0.013
Neg logit contributions
Mill-0.013
 distance-0.013
Ar-0.013
Welcome-0.013
Ab-0.012
Token washed
Token ID18989
Feature activation-4.44e-01

Pos logit contributions
okes+0.048
ish+0.048
our+0.047
ap+0.046
ours+0.046
Neg logit contributions
 distance-0.037
Ar-0.037
Mill-0.037
 possible-0.035
Ab-0.034
Token him
Token ID683
Feature activation-7.57e-01

Pos logit contributions
okes+0.060
ish+0.060
ours+0.058
int+0.056
our+0.055
Neg logit contributions
 distance-0.049
 possible-0.048
Mill-0.047
Welcome-0.044
Ar-0.044
Token to
Token ID284
Feature activation+3.05e-02

Pos logit contributions
okes+0.088
ours+0.082
ish+0.079
int+0.078
our+0.076
Neg logit contributions
 possible-0.073
 distance-0.070
Mill-0.064
 path-0.063
 moment-0.063
Token turn
Token ID1210
Feature activation-7.94e-02

Pos logit contributions
 distance+0.003
Ar+0.003
Mill+0.003
 possible+0.003
Welcome+0.003
Neg logit contributions
okes-0.003
ish-0.003
ours-0.003
our-0.003
 favorites-0.003
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
okes+0.009
ish+0.009
our+0.008
ours+0.008
ap+0.008
Neg logit contributions
 distance-0.008
 possible-0.007
Ar-0.007
Mill-0.007
Welcome-0.007
Token r
Token ID374
Feature activation-2.52e-01

Pos logit contributions
okes+0.020
ish+0.019
ours+0.018
our+0.018
ap+0.018
Neg logit contributions
 distance-0.018
 possible-0.017
Ar-0.017
Welcome-0.016
Mill-0.016
Tokenags
Token ID3775
Feature activation-4.10e+00

Pos logit contributions
 favorites+0.020
okes+0.020
ish+0.019
ours+0.019
 anymore+0.019
Neg logit contributions
 possible-0.026
Ar-0.025
 distance-0.025
fl-0.025
Mill-0.024
Token she
Token ID673
Feature activation+6.32e-01

Pos logit contributions
ish+0.351
okes+0.346
our+0.331
ours+0.323
ap+0.323
Neg logit contributions
Ar-0.497
 distance-0.490
Mill-0.484
Welcome-0.471
 possible-0.467
Token kept
Token ID4030
Feature activation+9.42e-01

Pos logit contributions
Ar+0.057
 distance+0.056
Welcome+0.053
Mill+0.051
 possible+0.049
Neg logit contributions
ish-0.073
okes-0.072
ours-0.071
ap-0.070
our-0.069
Token in
Token ID287
Feature activation-3.08e-01

Pos logit contributions
 distance+0.101
Ar+0.101
Mill+0.099
 possible+0.096
Welcome+0.095
Neg logit contributions
ish-0.092
okes-0.092
our-0.089
ap-0.087
ours-0.086
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
ish+0.028
okes+0.028
our+0.027
ap+0.027
ours+0.027
Neg logit contributions
Ar-0.034
 distance-0.034
Mill-0.034
Welcome-0.032
 possible-0.032
Token closet
Token ID21615
Feature activation-4.22e-01

Pos logit contributions
okes+0.014
ish+0.013
our+0.013
ours+0.013
ishes+0.013
Neg logit contributions
 distance-0.014
Ar-0.013
Welcome-0.012
 possible-0.012
Mill-0.012
Token Jack
Token ID3619
Feature activation-2.81e-01

Pos logit contributions
ish+0.006
okes+0.006
ap+0.006
our+0.006
ours+0.005
Neg logit contributions
Ar-0.012
Mill-0.012
 distance-0.012
Ab-0.012
 possible-0.012
Token was
Token ID373
Feature activation+6.81e-02

Pos logit contributions
okes+0.036
ish+0.036
ours+0.036
ap+0.035
ags+0.035
Neg logit contributions
 distance-0.022
Ar-0.021
Mill-0.020
Welcome-0.020
Ab-0.019
Token turning
Token ID6225
Feature activation+5.04e-01

Pos logit contributions
 distance+0.006
 possible+0.006
Mill+0.006
 heading+0.005
 winding+0.005
Neg logit contributions
ish-0.008
ishes-0.008
okes-0.007
 favorites-0.007
ours-0.007
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
 distance+0.049
Mill+0.046
 possible+0.046
Ar+0.045
Welcome+0.043
Neg logit contributions
okes-0.055
ish-0.054
our-0.052
ours-0.052
ap-0.052
Token r
Token ID374
Feature activation-2.43e-01

Pos logit contributions
okes+0.016
ish+0.016
ours+0.015
our+0.015
ap+0.015
Neg logit contributions
 distance-0.018
Ar-0.017
Mill-0.017
 possible-0.017
Welcome-0.016
Tokenags
Token ID3775
Feature activation-4.08e+00

Pos logit contributions
ish+0.020
 favorites+0.019
okes+0.018
ours+0.018
ishes+0.018
Neg logit contributions
 distance-0.025
 possible-0.025
Mill-0.025
Ar-0.024
fl-0.024
Token,
Token ID11
Feature activation-1.79e-01

Pos logit contributions
ish+0.325
okes+0.321
our+0.306
ours+0.299
ap+0.297
Neg logit contributions
Ar-0.519
 distance-0.515
Mill-0.507
 possible-0.495
Welcome-0.492
Token he
Token ID339
Feature activation+6.28e-01

Pos logit contributions
ish+0.022
okes+0.022
our+0.021
ours+0.021
ap+0.021
Neg logit contributions
Ar-0.015
 distance-0.014
Mill-0.014
 possible-0.013
Welcome-0.013
Token looked
Token ID3114
Feature activation-1.83e+00

Pos logit contributions
 distance+0.063
Ar+0.062
Mill+0.061
 possible+0.058
Welcome+0.058
Neg logit contributions
ish-0.066
okes-0.066
ours-0.064
our-0.063
ap-0.062
Token out
Token ID503
Feature activation-1.64e+00

Pos logit contributions
okes+0.201
our+0.189
ap+0.186
ish+0.184
ours+0.182
Neg logit contributions
 distance-0.184
 possible-0.164
 winding-0.161
Mill-0.160
Welcome-0.151
Token the
Token ID262
Feature activation-9.84e-02

Pos logit contributions
okes+0.145
ish+0.145
our+0.137
ours+0.136
ap+0.135
Neg logit contributions
 distance-0.192
Ar-0.191
Mill-0.189
 possible-0.186
Welcome-0.181
Token writing
Token ID3597
Feature activation+9.38e-04

Pos logit contributions
okes+0.103
ap+0.101
ish+0.100
our+0.097
ours+0.095
Neg logit contributions
 distance-0.118
Ar-0.115
Mill-0.111
 possible-0.108
Welcome-0.106
Token his
Token ID465
Feature activation+2.35e-01

Pos logit contributions
 distance+0.000
Ar+0.000
Mill+0.000
 possible+0.000
Welcome+0.000
Neg logit contributions
ish-0.000
okes-0.000
our-0.000
ours-0.000
ap-0.000
Token z
Token ID1976
Feature activation+1.01e+00

Pos logit contributions
 distance+0.024
Ar+0.024
 possible+0.024
Mill+0.023
Welcome+0.022
Neg logit contributions
okes-0.024
ish-0.023
ours-0.023
our-0.022
ap-0.022
Tokenig
Token ID328
Feature activation-1.38e+00

Pos logit contributions
 possible+0.134
Ar+0.132
Mill+0.126
Ab+0.124
fl+0.123
Neg logit contributions
ours-0.055
ishes-0.050
okes-0.049
 anymore-0.049
ish-0.048
Tokenz
Token ID89
Feature activation+1.43e+00

Pos logit contributions
okes+0.134
ish+0.134
our+0.127
ours+0.127
ap+0.125
Neg logit contributions
 distance-0.148
Ar-0.146
Mill-0.145
 possible-0.143
Welcome-0.139
Tokenags
Token ID3775
Feature activation-4.07e+00

Pos logit contributions
 possible+0.150
 distance+0.144
Ab+0.138
iest+0.137
Ar+0.135
Neg logit contributions
ours-0.123
okes-0.120
 favorites-0.112
ish-0.109
ishes-0.105
Token,
Token ID11
Feature activation-2.20e-01

Pos logit contributions
ish+0.294
okes+0.283
our+0.278
ap+0.271
ours+0.257
Neg logit contributions
Ar-0.553
Mill-0.533
 distance-0.531
Welcome-0.525
Ab-0.515
Token he
Token ID339
Feature activation+5.42e-01

Pos logit contributions
ish+0.023
okes+0.023
our+0.023
ap+0.022
ours+0.022
Neg logit contributions
Ar-0.021
 distance-0.020
Mill-0.020
 possible-0.019
Welcome-0.019
Token heard
Token ID2982
Feature activation-3.26e-01

Pos logit contributions
 distance+0.050
Ar+0.049
Mill+0.047
 possible+0.046
Welcome+0.045
Neg logit contributions
okes-0.062
ish-0.061
ours-0.059
our-0.059
ap-0.058
Token a
Token ID257
Feature activation-2.48e-01

Pos logit contributions
okes+0.027
ish+0.026
ap+0.025
ours+0.025
our+0.025
Neg logit contributions
Ar-0.039
 distance-0.039
Mill-0.038
 possible-0.037
Welcome-0.036
Token funny
Token ID8258
Feature activation+9.06e-02

Pos logit contributions
ap+0.025
ish+0.025
our+0.024
okes+0.023
 brother+0.022
Neg logit contributions
 possible-0.025
Mill-0.024
Ar-0.024
keeping-0.023
Ab-0.023
Token She
Token ID1375
Feature activation-6.85e-01

Pos logit contributions
ish+0.052
okes+0.052
our+0.048
ishes+0.048
ap+0.048
Neg logit contributions
 distance-0.044
Mill-0.044
Ar-0.042
Ab-0.041
Welcome-0.041
Token found
Token ID1043
Feature activation-1.11e+00

Pos logit contributions
okes+0.068
ish+0.066
ours+0.064
our+0.064
ap+0.063
Neg logit contributions
 distance-0.074
Ar-0.073
Mill-0.072
 possible-0.069
Welcome-0.068
Token some
Token ID617
Feature activation+8.45e-03

Pos logit contributions
ish+0.097
okes+0.095
ours+0.091
our+0.087
 favorites+0.087
Neg logit contributions
Ar-0.133
 distance-0.133
Mill-0.128
 possible-0.127
Welcome-0.126
Token old
Token ID1468
Feature activation+8.27e-01

Pos logit contributions
 distance+0.001
Ar+0.001
Mill+0.001
 possible+0.001
Welcome+0.001
Neg logit contributions
ish-0.001
okes-0.001
ours-0.001
our-0.001
ap-0.001
Token r
Token ID374
Feature activation-2.42e-01

Pos logit contributions
 distance+0.094
Ar+0.094
Mill+0.092
 possible+0.092
Welcome+0.090
Neg logit contributions
ish-0.076
okes-0.075
our-0.071
ours-0.071
ap-0.069
Tokenags
Token ID3775
Feature activation-4.06e+00

Pos logit contributions
okes+0.016
ish+0.016
our+0.016
 favorites+0.016
ap+0.015
Neg logit contributions
Mill-0.031
Ab-0.029
 distance-0.029
Ar-0.029
 possible-0.028
Token which
Token ID543
Feature activation+1.66e-01

Pos logit contributions
ish+0.216
okes+0.195
our+0.189
ours+0.182
ap+0.177
Neg logit contributions
Ar-0.672
Welcome-0.631
 distance-0.598
Hello-0.590
 possible-0.588
Token were
Token ID547
Feature activation+1.11e-01

Pos logit contributions
Ar+0.019
Mill+0.017
 possible+0.017
Welcome+0.017
 distance+0.017
Neg logit contributions
ish-0.016
okes-0.014
ours-0.014
our-0.014
 favorites-0.014
Token dirty
Token ID11841
Feature activation+1.67e-01

Pos logit contributions
Ar+0.013
 distance+0.012
Welcome+0.012
Mill+0.011
 possible+0.011
Neg logit contributions
ish-0.010
okes-0.010
ours-0.010
 favorites-0.010
our-0.009
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
Ar+0.023
 distance+0.023
Mill+0.023
 possible+0.022
Welcome+0.022
Neg logit contributions
ish-0.012
okes-0.011
our-0.011
ap-0.010
ours-0.010
Token She
Token ID1375
Feature activation-4.45e-01

Pos logit contributions
ish+0.035
okes+0.034
our+0.032
ishes+0.032
ap+0.031
Neg logit contributions
Mill-0.024
 distance-0.024
Ab-0.023
Welcome-0.023
Oh-0.023
Token But
Token ID887
Feature activation-7.30e-01

Pos logit contributions
ish+0.026
okes+0.025
ishes+0.025
our+0.024
ours+0.024
Neg logit contributions
Welcome-0.017
Ar-0.017
 distance-0.017
Mill-0.017
 possible-0.015
Token the
Token ID262
Feature activation-2.39e-01

Pos logit contributions
ish+0.075
okes+0.075
our+0.072
ours+0.071
ap+0.071
Neg logit contributions
 distance-0.075
Ar-0.074
Mill-0.073
 possible-0.071
Welcome-0.070
Token z
Token ID1976
Feature activation+9.89e-01

Pos logit contributions
ish+0.027
okes+0.026
ap+0.025
our+0.025
 favorites+0.025
Neg logit contributions
Mill-0.022
Ar-0.021
 possible-0.021
 distance-0.021
Ab-0.020
Tokenig
Token ID328
Feature activation-1.32e+00

Pos logit contributions
Ar+0.105
 distance+0.098
 possible+0.095
 seek+0.095
boat+0.093
Neg logit contributions
ishes-0.077
 favorites-0.076
ours-0.074
 So-0.070
okes-0.068
Tokenz
Token ID89
Feature activation+1.45e+00

Pos logit contributions
ish+0.147
okes+0.146
our+0.140
ours+0.139
ap+0.138
Neg logit contributions
 distance-0.124
Ar-0.122
Mill-0.120
 possible-0.118
Welcome-0.115
Tokenags
Token ID3775
Feature activation-4.05e+00

Pos logit contributions
 distance+0.147
iest+0.147
 moment+0.139
Welcome+0.137
Oh+0.136
Neg logit contributions
 favorites-0.124
ours-0.123
okes-0.122
ishes-0.111
ids-0.109
Token weren
Token ID6304
Feature activation+1.14e-01

Pos logit contributions
ish+0.364
okes+0.348
our+0.344
ap+0.331
ours+0.317
Neg logit contributions
Ar-0.496
Mill-0.476
Welcome-0.466
 distance-0.460
Ab-0.456
Token't
Token ID470
Feature activation+1.55e+00

Pos logit contributions
 distance+0.012
Ar+0.012
Mill+0.011
Ab+0.011
keeping+0.011
Neg logit contributions
okes-0.011
ish-0.011
ours-0.010
our-0.010
 anymore-0.010
Token as
Token ID355
Feature activation+1.73e-01

Pos logit contributions
Ar+0.168
 distance+0.168
Mill+0.164
Welcome+0.159
 possible+0.159
Neg logit contributions
ish-0.148
okes-0.147
our-0.140
ours-0.137
ap-0.136
Token easy
Token ID2562
Feature activation-6.27e-01

Pos logit contributions
 distance+0.017
Ar+0.017
Mill+0.016
Welcome+0.016
Ab+0.016
Neg logit contributions
okes-0.017
ish-0.016
our-0.016
ap-0.016
 brother-0.015
Token as
Token ID355
Feature activation+1.67e-01

Pos logit contributions
ish+0.052
okes+0.051
ours+0.049
our+0.048
ap+0.047
Neg logit contributions
 distance-0.077
 possible-0.076
Ar-0.075
Mill-0.074
Welcome-0.073
Token around
Token ID1088
Feature activation-1.81e-01

Pos logit contributions
 distance+0.058
Ar+0.058
Mill+0.057
 possible+0.056
Welcome+0.055
Neg logit contributions
ish-0.055
okes-0.055
our-0.053
ours-0.052
ap-0.052
Token in
Token ID287
Feature activation-1.82e-01

Pos logit contributions
okes+0.018
ish+0.018
ours+0.018
our+0.017
int+0.017
Neg logit contributions
 distance-0.020
Ar-0.019
Mill-0.018
 possible-0.018
Welcome-0.018
Token z
Token ID1976
Feature activation+1.11e+00

Pos logit contributions
okes+0.016
ish+0.016
ap+0.016
 favorites+0.015
our+0.015
Neg logit contributions
Mill-0.020
Ar-0.020
 possible-0.020
 distance-0.019
Ab-0.019
Tokenig
Token ID328
Feature activation-1.35e+00

Pos logit contributions
Phil+0.119
Ar+0.117
Car+0.117
Pat+0.115
G+0.115
Neg logit contributions
ishes-0.072
 favorites-0.070
ish-0.069
 anymore-0.065
 appetite-0.064
Tokenz
Token ID89
Feature activation+1.56e+00

Pos logit contributions
ish+0.160
okes+0.159
ours+0.154
our+0.152
ap+0.149
Neg logit contributions
 distance-0.116
Ar-0.113
Mill-0.112
 possible-0.111
Ab-0.104
Tokenags
Token ID3775
Feature activation-4.05e+00

Pos logit contributions
 distance+0.188
Ar+0.186
Ab+0.177
Back+0.175
iest+0.173
Neg logit contributions
ours-0.103
ish-0.087
ccoli-0.087
 favorites-0.085
okes-0.084
Token,
Token ID11
Feature activation-2.42e-02

Pos logit contributions
ish+0.241
okes+0.234
ap+0.229
our+0.221
ours+0.194
Neg logit contributions
Ar-0.595
Mill-0.589
Welcome-0.586
 distance-0.577
Ab-0.573
Token going
Token ID1016
Feature activation+1.37e-01

Pos logit contributions
okes+0.002
ish+0.002
our+0.002
ap+0.002
ours+0.002
Neg logit contributions
Mill-0.003
Ar-0.003
 possible-0.003
Ab-0.003
Welcome-0.003
Token left
Token ID1364
Feature activation+2.16e-02

Pos logit contributions
Mill+0.013
Ar+0.013
Ab+0.013
 distance+0.012
Welcome+0.012
Neg logit contributions
ish-0.015
okes-0.014
our-0.014
ap-0.014
ours-0.013
Token then
Token ID788
Feature activation+3.99e-01

Pos logit contributions
 distance+0.002
Ar+0.002
Mill+0.002
 possible+0.002
Welcome+0.002
Neg logit contributions
okes-0.002
ish-0.002
our-0.002
ap-0.002
ours-0.002
Token right
Token ID826
Feature activation-7.12e-01

Pos logit contributions
Mill+0.044
Ar+0.044
 distance+0.043
Ab+0.042
Welcome+0.041
Neg logit contributions
ish-0.038
okes-0.038
ap-0.037
our-0.037
ours-0.035
Token
Token ID198
Feature activation+5.37e-01

Pos logit contributions
okes+0.077
ish+0.076
our+0.073
ours+0.073
ap+0.072
Neg logit contributions
 distance-0.083
Ar-0.082
Mill-0.081
 possible-0.080
Welcome-0.078
Token
Token ID198
Feature activation+5.04e-01

Pos logit contributions
 distance+0.063
 possible+0.062
Ar+0.060
Mill+0.060
Welcome+0.059
Neg logit contributions
okes-0.047
ish-0.043
ap-0.043
ours-0.042
our-0.042
TokenThe
Token ID464
Feature activation+2.19e-01

Pos logit contributions
 possible+0.054
 distance+0.050
fl+0.048
Welcome+0.045
Mill+0.045
Neg logit contributions
ap-0.046
okes-0.046
ish-0.041
ird-0.040
ours-0.039
Token dog
Token ID3290
Feature activation-8.34e-01

Pos logit contributions
Mill+0.031
 possible+0.030
 distance+0.029
Ar+0.029
Ab+0.029
Neg logit contributions
ish-0.015
okes-0.015
ap-0.014
ours-0.013
 brother-0.013
Token w
Token ID266
Feature activation+1.13e-01

Pos logit contributions
ish+0.083
our+0.081
ird+0.079
okes+0.079
ap+0.078
Neg logit contributions
Ar-0.081
 distance-0.081
Hello-0.077
Welcome-0.075
 winding-0.072
Tokenags
Token ID3775
Feature activation-4.04e+00

Pos logit contributions
Ar+0.014
Pat+0.012
Ab+0.012
Bo+0.012
Kay+0.012
Neg logit contributions
ish-0.007
 favorites-0.007
 brother-0.007
ishes-0.007
ird-0.007
Token its
Token ID663
Feature activation+1.30e+00

Pos logit contributions
okes+0.324
ap+0.322
 at+0.303
ish+0.301
ow+0.288
Neg logit contributions
Ar-0.485
Welcome-0.484
 possible-0.481
 distance-0.475
Mill-0.469
Token tail
Token ID7894
Feature activation+1.19e-01

Pos logit contributions
 distance+0.138
Ar+0.136
Mill+0.134
 possible+0.132
Welcome+0.128
Neg logit contributions
okes-0.130
ish-0.130
our-0.124
ours-0.123
ap-0.121
Token and
Token ID290
Feature activation-5.17e-02

Pos logit contributions
 distance+0.014
Ar+0.013
 possible+0.012
Welcome+0.012
Mill+0.012
Neg logit contributions
okes-0.011
ish-0.011
ours-0.011
our-0.011
ishes-0.011
Token l
Token ID300
Feature activation+8.36e-02

Pos logit contributions
okes+0.005
ish+0.005
our+0.005
ours+0.005
ap+0.005
Neg logit contributions
 distance-0.006
Ar-0.005
Mill-0.005
 possible-0.005
Welcome-0.005
Tokenicks
Token ID3378
Feature activation-4.08e-01

Pos logit contributions
Ar+0.012
 possible+0.012
 distance+0.011
Welcome+0.011
Mill+0.011
Neg logit contributions
ish-0.005
our-0.005
 favorites-0.005
ishes-0.004
okes-0.004
Token
Token ID198
Feature activation+2.78e-01

Pos logit contributions
okes+0.013
ish+0.012
our+0.012
ap+0.012
ours+0.012
Neg logit contributions
 distance-0.017
 possible-0.017
Ar-0.016
Mill-0.016
Welcome-0.016
Token
Token ID198
Feature activation+2.53e-01

Pos logit contributions
 distance+0.034
 possible+0.033
Ar+0.032
Mill+0.031
Welcome+0.031
Neg logit contributions
okes-0.023
our-0.021
ap-0.021
ish-0.021
ours-0.021
TokenPaul
Token ID12041
Feature activation+2.23e+00

Pos logit contributions
 distance+0.021
 possible+0.020
keeping+0.018
Welcome+0.017
Ab+0.017
Neg logit contributions
okes-0.029
ap-0.029
our-0.029
ish-0.027
 favorites-0.027
Token put
Token ID1234
Feature activation-7.82e-01

Pos logit contributions
Mill+0.210
Ar+0.209
 distance+0.204
Ab+0.198
Welcome+0.184
Neg logit contributions
okes-0.239
ags-0.233
ours-0.231
ish-0.227
 favorites-0.220
Token r
Token ID374
Feature activation-1.64e-01

Pos logit contributions
okes+0.099
our+0.095
ish+0.094
ap+0.094
ours+0.092
Neg logit contributions
 distance-0.064
Ar-0.062
Mill-0.059
 possible-0.058
Welcome-0.056
Tokenags
Token ID3775
Feature activation-4.04e+00

Pos logit contributions
ish+0.014
 anymore+0.014
ours+0.014
 favorites+0.014
okes+0.013
Neg logit contributions
Mill-0.018
Ar-0.017
Ab-0.016
 distance-0.016
keeping-0.016
Token and
Token ID290
Feature activation-1.22e-01

Pos logit contributions
ish+0.375
okes+0.369
our+0.354
ours+0.351
ap+0.348
Neg logit contributions
Ar-0.450
 distance-0.447
 possible-0.434
Welcome-0.433
Mill-0.431
Token hay
Token ID27678
Feature activation+5.88e-02

Pos logit contributions
okes+0.012
ish+0.012
our+0.012
ours+0.012
ap+0.012
Neg logit contributions
 distance-0.013
Ar-0.012
Mill-0.012
 possible-0.012
Welcome-0.012
Token into
Token ID656
Feature activation+6.10e-01

Pos logit contributions
 distance+0.007
Ar+0.007
Mill+0.007
 possible+0.007
Welcome+0.007
Neg logit contributions
okes-0.005
ish-0.005
ours-0.005
our-0.005
ap-0.004
Token a
Token ID257
Feature activation-2.39e-01

Pos logit contributions
 distance+0.065
Ar+0.064
 possible+0.064
Mill+0.063
Welcome+0.062
Neg logit contributions
okes-0.058
ish-0.057
ours-0.055
our-0.055
 favorites-0.054
Token box
Token ID3091
Feature activation-8.91e-01

Pos logit contributions
ish+0.024
okes+0.023
our+0.022
ours+0.022
ap+0.022
Neg logit contributions
Ar-0.025
 possible-0.025
Mill-0.025
 distance-0.024
Welcome-0.023
Token named
Token ID3706
Feature activation+8.06e-02

Pos logit contributions
 distance+0.037
Mill+0.036
Ar+0.035
Welcome+0.034
 possible+0.034
Neg logit contributions
okes-0.022
ish-0.021
ours-0.020
our-0.020
 favorites-0.020
Token Lucy
Token ID22162
Feature activation-7.75e-01

Pos logit contributions
Mill+0.010
 distance+0.010
 possible+0.010
Ar+0.010
Welcome+0.009
Neg logit contributions
ish-0.007
okes-0.007
our-0.006
ours-0.006
ap-0.006
Token who
Token ID508
Feature activation-1.03e+00

Pos logit contributions
ish+0.057
okes+0.057
our+0.054
ours+0.053
ap+0.052
Neg logit contributions
 distance-0.102
Ar-0.101
Mill-0.100
 possible-0.099
Welcome-0.097
Token loved
Token ID6151
Feature activation-4.20e-01

Pos logit contributions
ish+0.090
okes+0.087
our+0.084
ap+0.083
ours+0.082
Neg logit contributions
 distance-0.122
Ar-0.122
 possible-0.119
Mill-0.119
Welcome-0.114
Token r
Token ID374
Feature activation-2.19e-01

Pos logit contributions
okes+0.038
ish+0.038
ours+0.036
our+0.036
 favorites+0.035
Neg logit contributions
 distance-0.048
Ar-0.048
Mill-0.046
 possible-0.046
Welcome-0.044
Tokenags
Token ID3775
Feature activation-4.01e+00

Pos logit contributions
okes+0.019
ish+0.019
our+0.018
ours+0.018
 favorites+0.017
Neg logit contributions
Mill-0.026
Ab-0.024
 distance-0.024
Ar-0.024
 possible-0.023
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
ish+0.451
okes+0.448
our+0.433
ours+0.429
ap+0.425
Neg logit contributions
Ar-0.379
 distance-0.372
Mill-0.361
 possible-0.360
Welcome-0.350
Token She
Token ID1375
Feature activation-2.82e-01

Pos logit contributions
 distance+0.012
Mill+0.012
Welcome+0.012
Ab+0.011
Ar+0.011
Neg logit contributions
ish-0.017
okes-0.017
ishes-0.016
our-0.016
ours-0.016
Token would
Token ID561
Feature activation+3.52e-01

Pos logit contributions
okes+0.033
ish+0.032
our+0.031
ours+0.031
ap+0.031
Neg logit contributions
 distance-0.026
Ar-0.025
Mill-0.025
 possible-0.024
Welcome-0.024
Token always
Token ID1464
Feature activation-7.19e-01

Pos logit contributions
 distance+0.040
Ar+0.039
Mill+0.039
 possible+0.038
Welcome+0.037
Neg logit contributions
ish-0.033
okes-0.032
our-0.031
ours-0.031
ap-0.030
Token let
Token ID1309
Feature activation+7.50e-01

Pos logit contributions
okes+0.068
ish+0.067
ours+0.064
our+0.063
ap+0.061
Neg logit contributions
 distance-0.081
Ar-0.078
Mill-0.078
 possible-0.077
Welcome-0.075
Token She
Token ID1375
Feature activation-3.54e-01

Pos logit contributions
okes+0.009
ish+0.009
our+0.009
ours+0.009
ap+0.008
Neg logit contributions
 distance-0.009
Ar-0.009
Mill-0.009
 possible-0.008
Welcome-0.008
Token loved
Token ID6151
Feature activation-3.67e-01

Pos logit contributions
ish+0.028
okes+0.028
our+0.027
ap+0.027
 favorites+0.026
Neg logit contributions
 possible-0.044
 distance-0.044
Ar-0.042
Mill-0.042
Welcome-0.040
Token to
Token ID284
Feature activation+2.17e-01

Pos logit contributions
okes+0.035
ish+0.035
our+0.033
ours+0.033
ap+0.032
Neg logit contributions
 distance-0.041
Ar-0.040
Mill-0.040
 possible-0.039
Welcome-0.038
Token bring
Token ID2222
Feature activation-1.98e-01

Pos logit contributions
 distance+0.023
Mill+0.023
Ar+0.023
 possible+0.022
Welcome+0.022
Neg logit contributions
okes-0.021
ish-0.021
ours-0.020
our-0.020
ishes-0.019
Token r
Token ID374
Feature activation-1.95e-01

Pos logit contributions
okes+0.024
ish+0.024
our+0.023
ours+0.023
ap+0.022
Neg logit contributions
 distance-0.017
Mill-0.017
Ar-0.017
 possible-0.016
Welcome-0.016
Tokenags
Token ID3775
Feature activation-4.00e+00

Pos logit contributions
okes+0.015
ish+0.015
our+0.015
ours+0.014
 favorites+0.014
Neg logit contributions
Mill-0.024
Ab-0.022
Ar-0.022
 distance-0.022
fl-0.021
Token from
Token ID422
Feature activation+1.43e+00

Pos logit contributions
ish+0.418
okes+0.415
ours+0.397
our+0.395
ap+0.391
Neg logit contributions
Ar-0.409
 distance-0.404
 possible-0.392
Welcome-0.384
Mill-0.384
Token the
Token ID262
Feature activation-2.27e-02

Pos logit contributions
Ar+0.150
 distance+0.145
 possible+0.143
Mill+0.141
Welcome+0.137
Neg logit contributions
ish-0.150
okes-0.146
ours-0.142
our-0.140
 favorites-0.137
Token kitchen
Token ID9592
Feature activation+1.01e-01

Pos logit contributions
ish+0.002
 favorites+0.002
ours+0.002
our+0.002
ap+0.002
Neg logit contributions
 possible-0.003
Ar-0.003
Mill-0.003
 distance-0.002
Welcome-0.002
Token around
Token ID1088
Feature activation-1.38e-01

Pos logit contributions
 distance+0.011
Ar+0.010
Mill+0.010
 possible+0.010
Welcome+0.010
Neg logit contributions
okes-0.010
ish-0.010
ours-0.009
int-0.009
our-0.009
Token the
Token ID262
Feature activation+6.64e-04

Pos logit contributions
okes+0.017
ish+0.017
ours+0.016
our+0.016
ap+0.016
Neg logit contributions
 distance-0.012
Mill-0.011
Ar-0.011
 possible-0.011
Welcome-0.011
Token and
Token ID290
Feature activation-3.53e-01

Pos logit contributions
 distance+0.086
Mill+0.083
 possible+0.079
Welcome+0.079
Ar+0.076
Neg logit contributions
okes-0.069
ish-0.065
ours-0.063
ags-0.061
our-0.060
Token showed
Token ID3751
Feature activation-8.05e-01

Pos logit contributions
ish+0.032
okes+0.032
our+0.031
ours+0.030
ap+0.030
Neg logit contributions
Ar-0.040
 distance-0.040
 possible-0.039
Mill-0.039
Welcome-0.038
Token her
Token ID607
Feature activation-1.18e-01

Pos logit contributions
okes+0.100
ish+0.099
our+0.097
ours+0.095
ap+0.094
Neg logit contributions
 distance-0.065
Mill-0.063
Ar-0.063
 possible-0.062
Welcome-0.060
Token the
Token ID262
Feature activation-2.59e-01

Pos logit contributions
okes+0.012
ish+0.012
our+0.012
ap+0.011
int+0.011
Neg logit contributions
 distance-0.012
 possible-0.012
Mill-0.012
Ab-0.012
Ar-0.011
Token r
Token ID374
Feature activation-2.29e-01

Pos logit contributions
okes+0.029
ish+0.029
our+0.027
ours+0.027
ap+0.027
Neg logit contributions
 distance-0.025
Ar-0.024
Mill-0.024
 possible-0.024
Welcome-0.023
Tokenags
Token ID3775
Feature activation-4.00e+00

Pos logit contributions
okes+0.018
ish+0.018
our+0.017
ap+0.017
 favorites+0.016
Neg logit contributions
Mill-0.027
Ar-0.026
Ab-0.026
 distance-0.025
 possible-0.025
Token.
Token ID13
Feature activation-3.78e-02

Pos logit contributions
ish+0.332
okes+0.324
our+0.313
ours+0.311
ap+0.310
Neg logit contributions
Ar-0.526
Welcome-0.478
 distance-0.471
Mill-0.465
 possible-0.457
Token She
Token ID1375
Feature activation-3.70e-01

Pos logit contributions
ish+0.004
okes+0.004
our+0.004
ours+0.003
ap+0.003
Neg logit contributions
 distance-0.004
Mill-0.004
Ar-0.004
Welcome-0.004
 possible-0.004
Token said
Token ID531
Feature activation-4.43e-01

Pos logit contributions
okes+0.042
ish+0.042
our+0.040
ap+0.040
ours+0.039
Neg logit contributions
 distance-0.034
Mill-0.033
Ar-0.033
 possible-0.032
Ab-0.031
Token that
Token ID326
Feature activation+4.91e-01

Pos logit contributions
okes+0.037
ish+0.037
our+0.035
ours+0.034
ap+0.034
Neg logit contributions
 distance-0.054
Ar-0.053
Mill-0.053
 possible-0.052
Welcome-0.051
Token she
Token ID673
Feature activation+6.29e-01

Pos logit contributions
Ar+0.047
 distance+0.046
Mill+0.044
 possible+0.043
Welcome+0.042
Neg logit contributions
ish-0.055
okes-0.054
ours-0.053
our-0.052
ap-0.052
Token it
Token ID340
Feature activation+5.39e-01

Pos logit contributions
ish+0.020
ap+0.019
our+0.018
okes+0.018
ours+0.018
Neg logit contributions
Ar-0.028
Mill-0.026
Welcome-0.026
 distance-0.026
 possible-0.025
Token was
Token ID373
Feature activation+5.52e-02

Pos logit contributions
 distance+0.052
Ar+0.051
Mill+0.051
 possible+0.050
Welcome+0.049
Neg logit contributions
ish-0.059
okes-0.058
our-0.056
ours-0.056
ap-0.055
Token difficult
Token ID2408
Feature activation-5.89e-01

Pos logit contributions
 distance+0.006
Ar+0.005
 possible+0.005
Mill+0.005
Welcome+0.005
Neg logit contributions
okes-0.006
ish-0.006
our-0.006
ours-0.006
ap-0.005
Token finding
Token ID4917
Feature activation+6.18e-01

Pos logit contributions
okes+0.054
ish+0.054
ours+0.051
our+0.050
ap+0.049
Neg logit contributions
 distance-0.067
 possible-0.066
Mill-0.065
Ar-0.065
Welcome-0.062
Token r
Token ID374
Feature activation-1.99e-01

Pos logit contributions
 distance+0.064
Ar+0.062
Mill+0.062
 possible+0.061
Welcome+0.059
Neg logit contributions
okes-0.065
ish-0.063
our-0.062
ours-0.061
ap-0.060
Tokenags
Token ID3775
Feature activation-3.99e+00

Pos logit contributions
ish+0.018
okes+0.017
ours+0.017
 favorites+0.016
our+0.016
Neg logit contributions
Mill-0.022
Ar-0.022
 possible-0.022
 distance-0.021
Ab-0.020
Token and
Token ID290
Feature activation-3.01e-01

Pos logit contributions
ish+0.311
okes+0.295
our+0.288
ours+0.278
ap+0.274
Neg logit contributions
Ar-0.531
 distance-0.507
Welcome-0.502
Mill-0.501
Ab-0.481
Token water
Token ID1660
Feature activation+8.58e-01

Pos logit contributions
ish+0.029
okes+0.029
ours+0.028
our+0.027
ap+0.027
Neg logit contributions
Ar-0.033
Mill-0.031
 distance-0.030
Welcome-0.030
Ab-0.029
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
 distance+0.096
Ar+0.096
Mill+0.095
 possible+0.092
Welcome+0.091
Neg logit contributions
ish-0.080
okes-0.079
our-0.076
ours-0.075
ap-0.074
Token But
Token ID887
Feature activation-7.38e-01

Pos logit contributions
ish+0.025
okes+0.025
our+0.024
ours+0.024
ap+0.023
Neg logit contributions
 distance-0.021
Ar-0.021
Mill-0.021
Ab-0.020
Welcome-0.020
Token Bob
Token ID5811
Feature activation-1.10e+00

Pos logit contributions
ish+0.066
our+0.061
ap+0.061
okes+0.061
ours+0.059
Neg logit contributions
Ar-0.087
Mill-0.084
 distance-0.083
Welcome-0.082
 possible-0.079
Token,
Token ID11
Feature activation+1.86e-02

Pos logit contributions
 distance+0.047
Ar+0.046
Mill+0.046
 possible+0.045
Welcome+0.045
Neg logit contributions
ish-0.022
okes-0.022
our-0.021
ours-0.021
ap-0.020
Token he
Token ID339
Feature activation+7.18e-01

Pos logit contributions
Mill+0.001
Ar+0.001
 possible+0.001
 distance+0.001
Welcome+0.001
Neg logit contributions
ish-0.002
okes-0.002
our-0.002
ours-0.002
ap-0.002
Token found
Token ID1043
Feature activation-1.11e+00

Pos logit contributions
 distance+0.082
Ar+0.081
Mill+0.080
 possible+0.079
Welcome+0.077
Neg logit contributions
okes-0.066
ish-0.066
our-0.062
ours-0.062
ap-0.061
Token the
Token ID262
Feature activation-8.32e-02

Pos logit contributions
ish+0.095
okes+0.094
ours+0.087
 favorites+0.084
our+0.083
Neg logit contributions
 distance-0.131
Ar-0.131
Mill-0.129
 possible-0.127
Welcome-0.125
Token tast
Token ID14854
Feature activation-1.09e+00

Pos logit contributions
ish+0.008
okes+0.008
 favorites+0.008
our+0.008
ishes+0.007
Neg logit contributions
Ar-0.009
Mill-0.009
 distance-0.008
 possible-0.008
Welcome-0.008
Tokeniest
Token ID6386
Feature activation+3.62e+00

Pos logit contributions
ish+0.171
okes+0.168
our+0.165
ours+0.163
ap+0.161
Neg logit contributions
 distance-0.056
Ar-0.055
Mill-0.054
 possible-0.049
Welcome-0.049
Token ac
Token ID936
Feature activation+3.00e-01

Pos logit contributions
 distance+0.374
Ar+0.362
Mill+0.357
 possible+0.351
Welcome+0.344
Neg logit contributions
okes-0.379
ish-0.367
our-0.360
ours-0.359
ap-0.356
Tokenorn
Token ID1211
Feature activation+6.52e-01

Pos logit contributions
 distance+0.034
Ar+0.034
Mill+0.033
 possible+0.033
Welcome+0.032
Neg logit contributions
ish-0.028
okes-0.028
our-0.026
ours-0.026
ap-0.025
Token he
Token ID339
Feature activation+8.19e-01

Pos logit contributions
Ar+0.068
 distance+0.068
Mill+0.067
 possible+0.064
Welcome+0.064
Neg logit contributions
ish-0.067
okes-0.065
our-0.063
ours-0.062
ap-0.061
Token had
Token ID550
Feature activation+6.06e-01

Pos logit contributions
Ar+0.060
 distance+0.059
 possible+0.058
Mill+0.056
Welcome+0.056
Neg logit contributions
okes-0.110
ap-0.108
ish-0.107
ours-0.106
our-0.106
Token ever
Token ID1683
Feature activation-7.66e-02

Pos logit contributions
 distance+0.038
Ar+0.038
Mill+0.037
 possible+0.035
Welcome+0.034
Neg logit contributions
okes-0.086
ish-0.085
our-0.083
ours-0.082
ap-0.081
Token m
Token ID285
Feature activation+2.61e-01

Pos logit contributions
ish+0.004
okes+0.004
our+0.004
ours+0.004
ap+0.004
Neg logit contributions
 distance-0.006
Ar-0.006
Mill-0.006
 possible-0.006
Welcome-0.006
Tokenitt
Token ID715
Feature activation+1.01e+00

Pos logit contributions
 distance+0.030
Mill+0.030
Ar+0.029
 possible+0.029
Ab+0.028
Neg logit contributions
ish-0.022
okes-0.022
 favorites-0.021
ours-0.021
our-0.021
Token and
Token ID290
Feature activation+4.18e-01

Pos logit contributions
Ar+0.118
Mill+0.118
 distance+0.117
Welcome+0.112
 possible+0.112
Neg logit contributions
ish-0.092
okes-0.091
our-0.087
ours-0.085
ap-0.084
Token a
Token ID257
Feature activation-1.99e-03

Pos logit contributions
 distance+0.042
Ar+0.041
 possible+0.040
Welcome+0.039
Mill+0.038
Neg logit contributions
ish-0.043
okes-0.043
our-0.041
ap-0.040
ours-0.039
Token spat
Token ID15246
Feature activation+4.23e-01

Pos logit contributions
ish+0.000
okes+0.000
our+0.000
ours+0.000
ap+0.000
Neg logit contributions
Mill-0.000
 distance-0.000
Ar-0.000
 possible-0.000
Welcome-0.000
Tokenula
Token ID4712
Feature activation+3.59e+00

Pos logit contributions
 distance+0.044
Ar+0.043
 possible+0.043
Mill+0.043
Welcome+0.041
Neg logit contributions
ish-0.042
okes-0.042
our-0.040
ours-0.039
 favorites-0.039
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
 distance+0.405
Ar+0.396
 possible+0.395
Mill+0.388
Welcome+0.384
Neg logit contributions
okes-0.311
ish-0.298
 favorites-0.294
ours-0.287
our-0.285
Token They
Token ID1119
Feature activation+4.48e-02

Pos logit contributions
Ar+0.032
 distance+0.031
Welcome+0.031
Mill+0.029
Oh+0.029
Neg logit contributions
ish-0.043
okes-0.039
our-0.038
ishes-0.038
int-0.038
Token work
Token ID670
Feature activation+6.03e-01

Pos logit contributions
 distance+0.005
Ar+0.004
 possible+0.004
Ab+0.004
Mill+0.004
Neg logit contributions
okes-0.005
ish-0.005
ours-0.004
ap-0.004
 favorites-0.004
Token together
Token ID1978
Feature activation-2.90e-01

Pos logit contributions
 distance+0.057
Ar+0.056
 possible+0.054
Mill+0.054
Welcome+0.053
Neg logit contributions
ish-0.068
okes-0.067
our-0.065
ours-0.064
ap-0.063
Token to
Token ID284
Feature activation+4.47e-01

Pos logit contributions
ish+0.028
okes+0.028
 favorites+0.026
our+0.026
ours+0.026
Neg logit contributions
 distance-0.032
 possible-0.031
Ar-0.030
Mill-0.029
Welcome-0.029
Token happy
Token ID3772
Feature activation-8.59e-01

Pos logit contributions
Ar+0.071
 distance+0.070
Mill+0.069
 possible+0.068
Welcome+0.067
Neg logit contributions
ish-0.063
okes-0.063
ap-0.062
ours-0.060
our-0.060
Token to
Token ID284
Feature activation+3.27e-02

Pos logit contributions
okes+0.075
ish+0.075
our+0.071
ours+0.070
ap+0.069
Neg logit contributions
 distance-0.102
Ar-0.102
Mill-0.100
 possible-0.098
Welcome-0.096
Token see
Token ID766
Feature activation-4.12e-01

Pos logit contributions
Ar+0.004
 distance+0.004
Mill+0.004
Welcome+0.004
 possible+0.004
Neg logit contributions
our-0.002
ap-0.002
ish-0.002
okes-0.002
ours-0.002
Token the
Token ID262
Feature activation-3.80e-01

Pos logit contributions
okes+0.046
ish+0.046
our+0.044
ours+0.044
ap+0.043
Neg logit contributions
 distance-0.039
Mill-0.038
Ar-0.038
 possible-0.038
Welcome-0.036
Token sun
Token ID4252
Feature activation+7.78e-01

Pos logit contributions
ish+0.041
our+0.037
 favorites+0.036
okes+0.036
ishes+0.036
Neg logit contributions
Ar-0.040
Mill-0.037
Welcome-0.037
 possible-0.036
 distance-0.036
Tokenfl
Token ID2704
Feature activation+3.84e+00

Pos logit contributions
Ar+0.077
Mill+0.066
Ab+0.064
 distance+0.063
 possible+0.063
Neg logit contributions
ish-0.099
our-0.092
ap-0.088
ishes-0.086
okes-0.086
Tokenowers
Token ID3618
Feature activation-9.25e-01

Pos logit contributions
 distance+0.432
Mill+0.427
 possible+0.414
Welcome+0.409
Ar+0.408
Neg logit contributions
okes-0.346
ours-0.319
ish-0.315
ap-0.313
 favorites-0.308
Token growing
Token ID3957
Feature activation-1.82e-01

Pos logit contributions
ish+0.078
our+0.072
okes+0.070
ap+0.069
ours+0.068
Neg logit contributions
Ar-0.119
Mill-0.113
 distance-0.111
Ab-0.108
Welcome-0.107
Token tall
Token ID7331
Feature activation-7.82e-02

Pos logit contributions
ish+0.015
our+0.013
okes+0.013
ours+0.013
ap+0.013
Neg logit contributions
Ar-0.024
Mill-0.023
 distance-0.022
Welcome-0.022
Ab-0.021
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
ish+0.005
okes+0.005
our+0.005
ours+0.005
ap+0.005
Neg logit contributions
Ar-0.011
 distance-0.011
Mill-0.010
 possible-0.010
Welcome-0.010
Token strong
Token ID1913
Feature activation-9.04e-01

Pos logit contributions
ish+0.031
ours+0.030
okes+0.029
ishes+0.029
 favorites+0.028
Neg logit contributions
Ar-0.040
Mill-0.036
Ab-0.034
 distance-0.034
Welcome-0.034
Token have
Token ID423
Feature activation-2.35e-01

Pos logit contributions
 distance+0.044
Ar+0.044
Mill+0.043
 possible+0.042
Welcome+0.042
Neg logit contributions
okes-0.033
ish-0.033
our-0.031
ap-0.030
ours-0.030
Token the
Token ID262
Feature activation-9.19e-03

Pos logit contributions
ish+0.027
okes+0.026
our+0.026
ours+0.025
int+0.025
Neg logit contributions
 distance-0.022
 possible-0.022
Ar-0.022
Mill-0.021
Welcome-0.021
Token biggest
Token ID4094
Feature activation+4.30e-02

Pos logit contributions
okes+0.001
ours+0.001
int+0.001
our+0.001
ish+0.001
Neg logit contributions
 possible-0.001
 distance-0.001
Ar-0.001
 moment-0.001
 path-0.001
Token and
Token ID290
Feature activation+2.08e-01

Pos logit contributions
 distance+0.005
 possible+0.004
 moment+0.004
Ar+0.004
 path+0.004
Neg logit contributions
ours-0.004
okes-0.004
int-0.004
ish-0.004
our-0.004
Token prett
Token ID46442
Feature activation-1.48e+00

Pos logit contributions
Mill+0.021
Ar+0.020
 distance+0.020
Ab+0.019
Welcome+0.019
Neg logit contributions
ish-0.022
okes-0.021
 favorites-0.020
ours-0.020
our-0.020
Tokeniest
Token ID6386
Feature activation+3.56e+00

Pos logit contributions
okes+0.248
ish+0.244
ours+0.238
our+0.238
 favorites+0.237
Neg logit contributions
 possible-0.055
 distance-0.046
Ar-0.045
Welcome-0.042
Mill-0.041
Token tower
Token ID10580
Feature activation-1.07e+00

Pos logit contributions
 distance+0.431
 possible+0.415
Ar+0.405
Mill+0.395
Welcome+0.387
Neg logit contributions
okes-0.308
ish-0.305
ours-0.297
our-0.294
int-0.283
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
ish+0.115
okes+0.114
our+0.109
ap+0.108
ours+0.108
Neg logit contributions
Ar-0.104
Mill-0.103
 distance-0.103
Welcome-0.095
Ab-0.095
Token They
Token ID1119
Feature activation+1.16e-02

Pos logit contributions
Ar+0.026
 distance+0.025
Mill+0.025
Welcome+0.025
Oh+0.023
Neg logit contributions
ish-0.031
okes-0.030
our-0.029
int-0.029
ours-0.028
Token frowned
Token ID32198
Feature activation+1.19e+00

Pos logit contributions
 distance+0.001
Ar+0.001
Mill+0.001
Ab+0.001
Welcome+0.001
Neg logit contributions
okes-0.001
ish-0.001
ours-0.001
ap-0.001
our-0.001
Token and
Token ID290
Feature activation+2.85e-01

Pos logit contributions
 distance+0.148
Mill+0.142
Ar+0.142
 possible+0.141
Welcome+0.139
Neg logit contributions
okes-0.095
ish-0.095
ours-0.090
our-0.089
 favorites-0.088
Token picked
Token ID6497
Feature activation+3.52e-01

Pos logit contributions
Ar+0.010
 distance+0.010
Mill+0.010
Ab+0.009
Welcome+0.009
Neg logit contributions
okes-0.015
ish-0.014
ap-0.014
ags-0.014
our-0.014
Token the
Token ID262
Feature activation+1.60e-01

Pos logit contributions
 distance+0.035
Ar+0.034
Mill+0.034
 possible+0.033
Welcome+0.033
Neg logit contributions
okes-0.037
ish-0.037
our-0.036
ours-0.035
ap-0.035
Token prett
Token ID46442
Feature activation-1.33e+00

Pos logit contributions
 distance+0.016
Ar+0.015
Mill+0.015
 possible+0.015
Welcome+0.014
Neg logit contributions
okes-0.018
ish-0.017
our-0.017
ours-0.017
ap-0.016
Tokeniest
Token ID6386
Feature activation+3.79e+00

Pos logit contributions
okes+0.211
ish+0.208
our+0.204
ours+0.203
ap+0.203
Neg logit contributions
 distance-0.058
Ar-0.057
 possible-0.056
Mill-0.055
Welcome-0.049
Token pe
Token ID613
Feature activation-2.19e-01

Pos logit contributions
 distance+0.407
Ar+0.402
Mill+0.397
 possible+0.391
Welcome+0.381
Neg logit contributions
ish-0.371
okes-0.371
our-0.353
ours-0.350
ap-0.345
Tokenb
Token ID65
Feature activation+3.57e+00

Pos logit contributions
ish+0.034
okes+0.034
our+0.033
ours+0.033
ap+0.032
Neg logit contributions
 distance-0.011
Ar-0.011
Mill-0.010
 possible-0.010
Welcome-0.010
Tokenbles
Token ID7689
Feature activation+1.53e+00

Pos logit contributions
 distance+0.417
Ar+0.417
Mill+0.413
 possible+0.411
Welcome+0.395
Neg logit contributions
okes-0.305
ish-0.303
our-0.288
ours-0.286
ap-0.282
Token and
Token ID290
Feature activation+3.91e-01

Pos logit contributions
 distance+0.183
Ar+0.181
Mill+0.179
 possible+0.177
Welcome+0.173
Neg logit contributions
okes-0.130
ish-0.130
our-0.123
ours-0.122
ap-0.120
Token started
Token ID2067
Feature activation+1.06e+00

Pos logit contributions
 distance+0.047
Ar+0.046
Mill+0.046
 possible+0.045
Welcome+0.044
Neg logit contributions
ish-0.034
okes-0.034
our-0.032
ours-0.032
ap-0.031
Token to
Token ID284
Feature activation+4.71e-01

Pos logit contributions
 distance+0.117
Ar+0.115
Mill+0.114
 possible+0.112
Welcome+0.110
Neg logit contributions
okes-0.100
ish-0.100
our-0.095
ours-0.095
ap-0.093
Token thread
Token ID4704
Feature activation+1.37e-01

Pos logit contributions
 distance+0.055
Ar+0.054
Mill+0.053
 possible+0.052
Welcome+0.051
Neg logit contributions
ish-0.043
okes-0.042
our-0.041
ours-0.040
ap-0.040
Token<|endoftext|>
Token ID50256
Feature activation-6.16e-01

Pos logit contributions
ish+0.041
okes+0.041
our+0.039
ours+0.039
ap+0.038
Neg logit contributions
 distance-0.032
Mill-0.032
Ar-0.031
Welcome-0.030
Ab-0.029
TokenOnce
Token ID7454
Feature activation-2.40e-01

Pos logit contributions
ish+0.062
okes+0.062
our+0.059
ours+0.059
ap+0.058
Neg logit contributions
 distance-0.065
Ar-0.064
Mill-0.063
 possible-0.061
Welcome-0.061
Token there
Token ID612
Feature activation-4.68e-01

Pos logit contributions
ish+0.026
okes+0.026
our+0.025
ours+0.024
ap+0.024
Neg logit contributions
 distance-0.023
 possible-0.023
Mill-0.022
Ar-0.021
Ab-0.021
Token was
Token ID373
Feature activation+5.55e-02

Pos logit contributions
okes+0.058
ish+0.057
our+0.055
ours+0.055
ap+0.054
Neg logit contributions
 distance-0.039
Ar-0.038
Mill-0.037
 possible-0.037
Welcome-0.036
Token an
Token ID281
Feature activation+1.57e-01

Pos logit contributions
 distance+0.006
Ar+0.006
 possible+0.006
Mill+0.006
Welcome+0.006
Neg logit contributions
ish-0.005
okes-0.005
ap-0.005
ours-0.005
our-0.005
Token ostr
Token ID42387
Feature activation+1.53e+00

Pos logit contributions
 distance+0.013
Mill+0.013
Ar+0.013
 possible+0.012
Ab+0.012
Neg logit contributions
okes-0.019
ish-0.019
our-0.019
ours-0.018
ap-0.018
Tokenich
Token ID488
Feature activation-1.52e-01

Pos logit contributions
Mill+0.088
 distance+0.085
Ar+0.080
Ab+0.079
 possible+0.078
Neg logit contributions
okes-0.223
ish-0.221
ours-0.221
 favorites-0.214
ap-0.213
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
okes+0.013
ours+0.012
ish+0.012
our+0.012
 favorites+0.012
Neg logit contributions
 distance-0.019
Mill-0.018
 possible-0.017
Ar-0.017
Ab-0.017
Token It
Token ID632
Feature activation-6.76e-01

Pos logit contributions
okes+0.020
ish+0.020
our+0.019
ours+0.019
ap+0.019
Neg logit contributions
 distance-0.033
Ar-0.033
Mill-0.033
 possible-0.032
Welcome-0.031
Token was
Token ID373
Feature activation+1.21e-01

Pos logit contributions
ish+0.061
okes+0.056
our+0.054
 favorites+0.054
ours+0.054
Neg logit contributions
 possible-0.079
 distance-0.075
Mill-0.074
Ar-0.073
Welcome-0.071
Token very
Token ID845
Feature activation+4.48e-02

Pos logit contributions
Ar+0.014
 possible+0.014
 distance+0.014
Mill+0.013
Welcome+0.013
Neg logit contributions
ish-0.010
 favorites-0.010
ap-0.009
okes-0.009
 brother-0.009
Token claws
Token ID28421
Feature activation+6.27e-01

Pos logit contributions
 distance+0.143
Ar+0.142
Mill+0.140
 possible+0.138
Welcome+0.135
Neg logit contributions
okes-0.085
ish-0.085
our-0.080
ours-0.080
ap-0.078
Token and
Token ID290
Feature activation-2.33e-01

Pos logit contributions
 distance+0.082
Ar+0.081
Mill+0.080
 possible+0.079
Welcome+0.078
Neg logit contributions
okes-0.046
ish-0.046
our-0.043
ours-0.043
ap-0.042
Token made
Token ID925
Feature activation-1.97e-01

Pos logit contributions
okes+0.024
ish+0.023
ours+0.022
our+0.021
ap+0.021
Neg logit contributions
Mill-0.025
 distance-0.024
Ar-0.023
 possible-0.023
Welcome-0.023
Token a
Token ID257
Feature activation-1.56e-01

Pos logit contributions
okes+0.019
ish+0.019
ours+0.018
ap+0.018
our+0.017
Neg logit contributions
 distance-0.022
Ar-0.021
Mill-0.021
 possible-0.020
Welcome-0.020
Token loud
Token ID7812
Feature activation+1.19e+00

Pos logit contributions
ish+0.016
ap+0.015
okes+0.015
our+0.014
ours+0.014
Neg logit contributions
Mill-0.017
Welcome-0.016
 possible-0.016
Ab-0.015
Ar-0.015
Token noise
Token ID7838
Feature activation+1.21e+00

Pos logit contributions
 distance+0.104
Ar+0.102
Mill+0.102
 possible+0.101
Welcome+0.097
Neg logit contributions
ish-0.142
okes-0.141
ap-0.136
our-0.136
ours-0.132
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
Ar+0.163
 distance+0.162
Mill+0.161
 possible+0.157
Welcome+0.155
Neg logit contributions
ish-0.085
okes-0.085
our-0.079
ap-0.078
ours-0.077
Token Sara
Token ID24799
Feature activation+4.03e-02

Pos logit contributions
ish+0.017
okes+0.015
our+0.015
ishes+0.015
ours+0.015
Neg logit contributions
Mill-0.018
Ar-0.018
 distance-0.018
Welcome-0.017
Ab-0.017
Token and
Token ID290
Feature activation+6.15e-02

Pos logit contributions
 distance+0.004
Ar+0.004
Mill+0.004
Ab+0.003
Welcome+0.003
Neg logit contributions
ours-0.004
 favorites-0.004
okes-0.004
ish-0.004
ags-0.004
Token Ben
Token ID3932
Feature activation-3.89e-01

Pos logit contributions
 distance+0.009
Ar+0.008
 possible+0.008
Mill+0.008
Welcome+0.008
Neg logit contributions
ish-0.004
okes-0.004
our-0.004
ours-0.004
ap-0.003
Token jumped
Token ID11687
Feature activation-2.54e-01

Pos logit contributions
okes+0.046
ours+0.046
ish+0.045
 favorites+0.044
ap+0.043
Neg logit contributions
 distance-0.035
Ar-0.032
Mill-0.031
Welcome-0.030
Ab-0.029
Token park
Token ID3952
Feature activation+2.11e-01

Pos logit contributions
 possible+0.021
Mill+0.020
Ar+0.020
Ab+0.019
Welcome+0.018
Neg logit contributions
ish-0.013
okes-0.012
ap-0.012
 favorites-0.012
ours-0.011
Token.
Token ID13
Feature activation+3.91e-01

Pos logit contributions
Ar+0.028
Mill+0.028
 distance+0.028
 possible+0.028
Welcome+0.027
Neg logit contributions
okes-0.015
ish-0.015
our-0.014
ap-0.014
ours-0.014
Token The
Token ID383
Feature activation-8.88e-01

Pos logit contributions
 distance+0.048
Ar+0.047
Mill+0.046
Welcome+0.046
 possible+0.045
Neg logit contributions
ish-0.033
okes-0.032
our-0.031
ap-0.030
ours-0.030
Token k
Token ID479
Feature activation+9.68e-01

Pos logit contributions
ish+0.081
 favorites+0.072
ap+0.071
okes+0.066
ours+0.066
Neg logit contributions
Mill-0.104
 possible-0.104
keeping-0.097
Ar-0.094
Jess-0.093
Tokenang
Token ID648
Feature activation+1.52e+00

Pos logit contributions
 distance+0.111
Ar+0.110
 possible+0.109
Mill+0.109
Welcome+0.104
Neg logit contributions
ish-0.089
okes-0.089
ours-0.083
our-0.083
ap-0.082
Tokenaroo
Token ID38049
Feature activation+1.27e+00

Pos logit contributions
 distance+0.100
Mill+0.099
Ar+0.098
 possible+0.097
Welcome+0.090
Neg logit contributions
ish-0.216
okes-0.216
ours-0.207
our-0.206
ap-0.205
Token is
Token ID318
Feature activation-6.26e-01

Pos logit contributions
 distance+0.148
Ar+0.146
Welcome+0.140
Mill+0.140
 possible+0.138
Neg logit contributions
ish-0.105
okes-0.105
our-0.102
ap-0.100
ours-0.100
Token big
Token ID1263
Feature activation+3.27e-01

Pos logit contributions
okes+0.057
ish+0.056
ap+0.055
ours+0.053
 favorites+0.052
Neg logit contributions
 distance-0.069
Ar-0.069
Mill-0.068
Welcome-0.066
 possible-0.064
Token and
Token ID290
Feature activation+1.88e-01

Pos logit contributions
 distance+0.036
 possible+0.034
Ar+0.033
Welcome+0.033
Mill+0.032
Neg logit contributions
okes-0.031
ours-0.030
our-0.030
ish-0.029
int-0.029
Token brown
Token ID7586
Feature activation+5.07e-01

Pos logit contributions
Mill+0.021
 distance+0.020
Ar+0.019
Ab+0.019
keeping+0.019
Neg logit contributions
okes-0.018
ish-0.018
ours-0.018
our-0.016
 favorites-0.016
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
 distance+0.054
Welcome+0.051
 possible+0.051
Ar+0.051
Mill+0.051
Neg logit contributions
okes-0.047
ish-0.046
ours-0.046
our-0.045
 favorites-0.045
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 distance+0.100
Mill+0.099
Ar+0.098
 possible+0.097
Welcome+0.094
Neg logit contributions
okes-0.052
ish-0.051
ours-0.048
our-0.047
 favorites-0.045
Token Little
Token ID7703
Feature activation-2.97e-01

Pos logit contributions
ish+0.013
okes+0.013
our+0.012
ap+0.012
ours+0.012
Neg logit contributions
Ar-0.014
 distance-0.014
Mill-0.013
 possible-0.013
Ab-0.013
Token Billy
Token ID15890
Feature activation-7.18e-01

Pos logit contributions
okes+0.031
ours+0.030
ap+0.029
 favorites+0.028
ish+0.028
Neg logit contributions
 distance-0.031
Welcome-0.030
Mill-0.030
Ar-0.030
 possible-0.029
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
okes+0.075
ish+0.074
ours+0.072
our+0.071
ap+0.070
Neg logit contributions
 distance-0.073
Ar-0.072
Mill-0.071
 possible-0.069
Welcome-0.067
Token the
Token ID262
Feature activation-2.33e-01

Pos logit contributions
ish+0.027
okes+0.026
our+0.025
ap+0.025
ours+0.025
Neg logit contributions
Ar-0.033
 distance-0.033
Mill-0.032
 possible-0.032
Welcome-0.031
Token snow
Token ID6729
Feature activation+9.54e-01

Pos logit contributions
okes+0.026
ish+0.026
our+0.025
ours+0.024
ap+0.024
Neg logit contributions
 distance-0.023
Ar-0.022
Mill-0.022
 possible-0.021
Welcome-0.021
Tokenman
Token ID805
Feature activation+1.36e+00

Pos logit contributions
 distance+0.054
Ar+0.052
Mill+0.050
 possible+0.048
Welcome+0.046
Neg logit contributions
okes-0.144
ish-0.141
our-0.138
ours-0.138
ap-0.136
Token were
Token ID547
Feature activation+3.76e-02

Pos logit contributions
 distance+0.132
Ar+0.127
Mill+0.123
Welcome+0.119
 possible+0.118
Neg logit contributions
okes-0.155
ours-0.147
our-0.145
ish-0.145
ap-0.138
Token best
Token ID1266
Feature activation-1.82e+00

Pos logit contributions
 distance+0.004
Ar+0.004
Mill+0.004
 possible+0.004
Welcome+0.004
Neg logit contributions
ish-0.003
okes-0.003
ap-0.003
ours-0.003
our-0.003
Token friends
Token ID2460
Feature activation-5.07e-01

Pos logit contributions
ish+0.127
okes+0.124
our+0.124
ours+0.120
 favorites+0.110
Neg logit contributions
Mill-0.242
Ar-0.240
 distance-0.240
 possible-0.237
Welcome-0.231
Token!
Token ID0
Feature activation-5.92e-02

Pos logit contributions
okes+0.051
ours+0.049
ish+0.048
our+0.047
 favorites+0.045
Neg logit contributions
 distance-0.056
 possible-0.054
Mill-0.054
Welcome-0.053
Ar-0.052
Token
Token ID198
Feature activation+7.11e-01

Pos logit contributions
 distance+0.087
 possible+0.086
Mill+0.084
Ar+0.083
Welcome+0.082
Neg logit contributions
okes-0.065
ish-0.060
ours-0.060
ap-0.059
our-0.059
TokenDad
Token ID46270
Feature activation+1.84e+00

Pos logit contributions
 possible+0.076
 distance+0.072
Mill+0.069
Welcome+0.067
fl+0.067
Neg logit contributions
ap-0.072
okes-0.072
ours-0.066
our-0.065
ish-0.065
Token sigh
Token ID19680
Feature activation-9.26e-01

Pos logit contributions
 distance+0.155
Ar+0.149
Welcome+0.146
Hello+0.143
Oh+0.138
Neg logit contributions
 favorites-0.205
ish-0.192
ravings-0.179
ap-0.179
ishes-0.178
Tokens
Token ID82
Feature activation+2.15e+00

Pos logit contributions
ish+0.113
 favorites+0.110
our+0.110
okes+0.109
ap+0.106
Neg logit contributions
 distance-0.070
Ar-0.070
Welcome-0.068
Mill-0.066
 possible-0.065
Token and
Token ID290
Feature activation+2.67e-01

Pos logit contributions
 distance+0.258
Ar+0.251
Mill+0.245
 possible+0.241
Welcome+0.240
Neg logit contributions
okes-0.184
ish-0.183
 favorites-0.176
ours-0.173
our-0.172
Token picks
Token ID11103
Feature activation+1.01e+00

Pos logit contributions
 distance+0.032
Ar+0.031
 possible+0.030
Welcome+0.030
Mill+0.029
Neg logit contributions
ish-0.022
 favorites-0.021
our-0.021
okes-0.021
ap-0.021
Token up
Token ID510
Feature activation-3.97e-02

Pos logit contributions
 distance+0.079
Ar+0.078
Welcome+0.076
Mill+0.074
 possible+0.074
Neg logit contributions
okes-0.123
our-0.123
ish-0.122
 favorites-0.118
ap-0.117
Token the
Token ID262
Feature activation-1.02e-01

Pos logit contributions
ish+0.005
okes+0.005
our+0.005
 favorites+0.005
int+0.005
Neg logit contributions
 distance-0.003
Ar-0.003
Mill-0.003
Welcome-0.003
Ab-0.003
Token navy
Token ID23956
Feature activation+1.46e+00

Pos logit contributions
ish+0.010
okes+0.010
ours+0.010
our+0.010
ap+0.009
Neg logit contributions
Mill-0.011
 distance-0.011
Ar-0.010
 possible-0.010
Welcome-0.010
Token boat
Token ID8848
Feature activation+9.51e-02

Pos logit contributions
 distance+0.148
Ar+0.141
 possible+0.137
Welcome+0.131
Mill+0.130
Neg logit contributions
okes-0.156
ish-0.148
our-0.146
ours-0.145
 favorites-0.143
Token.
Token ID13
Feature activation+4.99e-01

Pos logit contributions
 distance+0.011
Ar+0.011
 possible+0.011
Mill+0.010
Welcome+0.010
Neg logit contributions
okes-0.008
ish-0.008
 favorites-0.008
our-0.008
ours-0.008
Token you
Token ID345
Feature activation-1.11e+00

Pos logit contributions
ish+0.066
okes+0.066
our+0.063
ours+0.062
ap+0.061
Neg logit contributions
 distance-0.074
Ar-0.073
Mill-0.073
 possible-0.072
Welcome-0.070
Token supply
Token ID5127
Feature activation-7.12e-01

Pos logit contributions
okes+0.113
ours+0.104
ish+0.100
our+0.100
ap+0.098
Neg logit contributions
 distance-0.123
Ar-0.123
 possible-0.117
Welcome-0.116
Mill-0.115
Token me
Token ID502
Feature activation-1.16e+00

Pos logit contributions
ish+0.068
okes+0.068
our+0.065
ours+0.064
ap+0.063
Neg logit contributions
 distance-0.078
Ar-0.077
Mill-0.076
 possible-0.075
Welcome-0.073
Token with
Token ID351
Feature activation+2.20e-01

Pos logit contributions
okes+0.113
ish+0.111
our+0.107
ours+0.106
ap+0.104
Neg logit contributions
 distance-0.126
Ar-0.124
Mill-0.124
 possible-0.122
Welcome-0.117
Token these
Token ID777
Feature activation+6.48e-01

Pos logit contributions
 distance+0.023
Ar+0.023
Mill+0.023
 possible+0.022
Welcome+0.022
Neg logit contributions
okes-0.022
ish-0.022
our-0.021
ours-0.021
ap-0.021
Token y
Token ID331
Feature activation-1.25e+00

Pos logit contributions
 distance+0.064
Ar+0.063
 possible+0.062
Mill+0.061
Welcome+0.060
Neg logit contributions
okes-0.071
our-0.066
ours-0.066
ish-0.065
ap-0.065
Tokenummy
Token ID13513
Feature activation+1.05e+00

Pos logit contributions
okes+0.112
ish+0.106
our+0.105
ours+0.105
ap+0.101
Neg logit contributions
 distance-0.140
Ar-0.138
Mill-0.136
 possible-0.134
Welcome-0.133
Token snacks
Token ID28487
Feature activation-3.27e-01

Pos logit contributions
 distance+0.096
Ar+0.096
 possible+0.092
Welcome+0.092
Mill+0.091
Neg logit contributions
okes-0.119
our-0.113
ours-0.113
ap-0.112
ish-0.109
Token,"
Token ID553
Feature activation-2.82e-01

Pos logit contributions
okes+0.029
ish+0.029
ours+0.027
our+0.027
ap+0.027
Neg logit contributions
 distance-0.038
Ar-0.037
 possible-0.037
Mill-0.037
Welcome-0.036
Token said
Token ID531
Feature activation-4.02e-01

Pos logit contributions
ish+0.028
okes+0.027
our+0.026
ours+0.026
ap+0.026
Neg logit contributions
 distance-0.031
Ar-0.030
Mill-0.030
 possible-0.029
Welcome-0.028
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
okes+0.045
ish+0.045
our+0.043
ours+0.043
ap+0.042
Neg logit contributions
 distance-0.038
Ar-0.037
Mill-0.037
 possible-0.036
Welcome-0.035
Token was
Token ID373
Feature activation-6.12e-02

Pos logit contributions
 distance+0.047
Ar+0.046
Mill+0.046
 possible+0.045
Welcome+0.044
Neg logit contributions
ish-0.051
okes-0.051
our-0.049
ours-0.048
ap-0.048
Token sad
Token ID6507
Feature activation-1.02e+00

Pos logit contributions
ish+0.006
okes+0.006
our+0.006
ours+0.006
ap+0.006
Neg logit contributions
 distance-0.006
Ar-0.006
Mill-0.006
 possible-0.006
Welcome-0.006
Token.
Token ID13
Feature activation-5.41e-01

Pos logit contributions
okes+0.084
ish+0.084
our+0.079
ours+0.079
ap+0.077
Neg logit contributions
 distance-0.126
 possible-0.123
Mill-0.122
Ar-0.122
Welcome-0.118
Token He
Token ID679
Feature activation-5.86e-01

Pos logit contributions
ish+0.060
okes+0.059
our+0.057
ours+0.056
ap+0.055
Neg logit contributions
 distance-0.053
Ar-0.053
Mill-0.052
Welcome-0.050
 possible-0.049
Token walked
Token ID6807
Feature activation-7.98e-01

Pos logit contributions
ish+0.063
okes+0.063
our+0.061
ours+0.060
ap+0.059
Neg logit contributions
 distance-0.057
Ar-0.056
Mill-0.055
 possible-0.055
Welcome-0.053
Token and
Token ID290
Feature activation-4.25e-01

Pos logit contributions
ish+0.061
ap+0.058
 at+0.055
 favorites+0.055
ags+0.053
Neg logit contributions
Welcome-0.096
Mill-0.095
 possible-0.094
Ar-0.093
Ab-0.092
Token walked
Token ID6807
Feature activation-8.27e-01

Pos logit contributions
ish+0.036
okes+0.035
our+0.034
ours+0.032
ap+0.032
Neg logit contributions
 possible-0.051
Ar-0.050
Mill-0.050
 distance-0.049
Welcome-0.048
Token and
Token ID290
Feature activation-4.58e-01

Pos logit contributions
ish+0.045
ap+0.045
our+0.040
 favorites+0.039
 anymore+0.039
Neg logit contributions
Welcome-0.118
Mill-0.116
Oh-0.114
Ar-0.114
 possible-0.113
Token stopped
Token ID5025
Feature activation-7.33e-01

Pos logit contributions
ish+0.040
our+0.037
okes+0.037
ap+0.036
 favorites+0.035
Neg logit contributions
 possible-0.053
Ar-0.052
Mill-0.052
Ab-0.050
Welcome-0.050
Token at
Token ID379
Feature activation-6.44e-01

Pos logit contributions
ish+0.061
okes+0.060
our+0.058
ap+0.057
ours+0.055
Neg logit contributions
Ar-0.091
 distance-0.089
Mill-0.087
Welcome-0.086
 possible-0.085
Token a
Token ID257
Feature activation-4.60e-01

Pos logit contributions
ish+0.061
okes+0.059
our+0.057
 favorites+0.056
ours+0.056
Neg logit contributions
Mill-0.069
Ar-0.069
 distance-0.068
Welcome-0.067
Ab-0.066
Token when
Token ID618
Feature activation-3.88e-01

Pos logit contributions
ish+0.034
okes+0.034
our+0.033
ours+0.032
ap+0.032
Neg logit contributions
Ar-0.047
 distance-0.047
Mill-0.046
 possible-0.045
Welcome-0.044
Token the
Token ID262
Feature activation-3.76e-01

Pos logit contributions
our+0.028
ish+0.027
okes+0.027
ap+0.027
ours+0.026
Neg logit contributions
Mill-0.050
Ar-0.049
 distance-0.049
 possible-0.047
Welcome-0.047
Token sun
Token ID4252
Feature activation+5.30e-01

Pos logit contributions
ish+0.045
okes+0.044
our+0.044
ap+0.043
ours+0.042
Neg logit contributions
Mill-0.033
Ar-0.031
 possible-0.031
 distance-0.031
Ab-0.030
Token had
Token ID550
Feature activation+1.68e-01

Pos logit contributions
 possible+0.062
Mill+0.061
Ar+0.058
Ab+0.056
 distance+0.056
Neg logit contributions
ish-0.050
our-0.049
okes-0.047
int-0.045
ours-0.045
Token gone
Token ID3750
Feature activation-1.22e+00

Pos logit contributions
Mill+0.018
Ar+0.018
 distance+0.018
 possible+0.017
Welcome+0.017
Neg logit contributions
okes-0.016
ish-0.016
our-0.015
ap-0.015
ours-0.015
Token down
Token ID866
Feature activation-1.05e+00

Pos logit contributions
ish+0.089
our+0.085
okes+0.084
ap+0.080
ours+0.078
Neg logit contributions
Ar-0.167
Mill-0.161
 distance-0.160
 possible-0.156
Welcome-0.155
Token,
Token ID11
Feature activation-4.22e-01

Pos logit contributions
ish+0.076
okes+0.075
our+0.074
ap+0.071
ours+0.070
Neg logit contributions
Ar-0.139
Mill-0.136
 distance-0.133
 possible-0.132
Welcome-0.130
Token the
Token ID262
Feature activation-3.67e-01

Pos logit contributions
okes+0.030
ish+0.030
our+0.029
ours+0.029
ap+0.028
Neg logit contributions
Ar-0.055
Mill-0.054
 distance-0.054
 possible-0.053
Welcome-0.052
Token man
Token ID582
Feature activation+6.26e-01

Pos logit contributions
okes+0.049
ish+0.049
our+0.048
ours+0.047
ap+0.047
Neg logit contributions
 distance-0.026
Ar-0.026
Mill-0.025
 possible-0.025
Welcome-0.024
Token and
Token ID290
Feature activation-4.27e-01

Pos logit contributions
 distance+0.063
Ar+0.061
Mill+0.061
 possible+0.059
Welcome+0.058
Neg logit contributions
okes-0.066
ish-0.066
our-0.063
ours-0.063
ap-0.062
Token Tommy
Token ID19919
Feature activation-2.71e-01

Pos logit contributions
ish+0.028
okes+0.027
our+0.026
ours+0.025
ap+0.025
Neg logit contributions
Ar-0.059
Mill-0.059
 distance-0.058
 possible-0.058
Welcome-0.056
Token's
Token ID338
Feature activation-1.56e+00

Pos logit contributions
 possible+0.047
 distance+0.046
Ar+0.045
Welcome+0.045
Mill+0.045
Neg logit contributions
okes-0.033
our-0.029
ish-0.029
int-0.029
ours-0.029
Token okay
Token ID8788
Feature activation-2.42e+00

Pos logit contributions
okes+0.189
ish+0.178
our+0.176
ours+0.174
ap+0.168
Neg logit contributions
 possible-0.144
 distance-0.140
Mill-0.136
Ar-0.135
Welcome-0.128
Token because
Token ID780
Feature activation-1.78e+00

Pos logit contributions
okes+0.240
ish+0.238
our+0.227
ours+0.225
ap+0.221
Neg logit contributions
 distance-0.256
 possible-0.253
Mill-0.252
Ar-0.249
Welcome-0.242
Token there
Token ID612
Feature activation-5.23e-01

Pos logit contributions
okes+0.184
ish+0.181
ours+0.180
our+0.179
ap+0.177
Neg logit contributions
Ar-0.182
 distance-0.179
Mill-0.166
Welcome-0.159
Ab-0.156
Token's
Token ID338
Feature activation-1.53e+00

Pos logit contributions
okes+0.043
ish+0.042
our+0.040
ours+0.039
int+0.038
Neg logit contributions
 distance-0.065
 possible-0.064
Mill-0.063
Ar-0.063
Welcome-0.061
Token always
Token ID1464
Feature activation-9.67e-01

Pos logit contributions
ish+0.137
okes+0.136
our+0.129
ours+0.128
ap+0.127
Neg logit contributions
 distance-0.176
Ar-0.174
Mill-0.172
 possible-0.169
Welcome-0.166
Token a
Token ID257
Feature activation-4.20e-01

Pos logit contributions
ish+0.091
okes+0.090
our+0.087
ap+0.087
ours+0.085
Neg logit contributions
 distance-0.106
Ar-0.106
Mill-0.105
Welcome-0.100
 possible-0.100
Token way
Token ID835
Feature activation+3.04e-01

Pos logit contributions
ish+0.044
okes+0.043
our+0.042
ap+0.042
ours+0.042
Neg logit contributions
 distance-0.041
Ar-0.041
Mill-0.041
 possible-0.039
Welcome-0.039
Token to
Token ID284
Feature activation+4.10e-02

Pos logit contributions
Ar+0.038
 distance+0.037
Mill+0.035
Welcome+0.034
Ab+0.034
Neg logit contributions
ish-0.026
okes-0.025
ap-0.025
our-0.024
ours-0.024
Token make
Token ID787
Feature activation+1.19e-01

Pos logit contributions
 distance+0.005
Ar+0.005
Mill+0.005
 possible+0.004
Welcome+0.004
Neg logit contributions
ish-0.004
okes-0.004
our-0.004
ap-0.004
ours-0.004
Token things
Token ID1243
Feature activation+4.43e-01

Pos logit contributions
 possible+0.012
Mill+0.012
 distance+0.011
Ar+0.011
Welcome+0.011
Neg logit contributions
okes-0.013
ish-0.013
our-0.012
ours-0.012
 favorites-0.012
Token her
Token ID607
Feature activation+1.79e-01

Pos logit contributions
okes+0.011
ish+0.011
our+0.011
ours+0.011
 favorites+0.010
Neg logit contributions
 distance-0.012
Mill-0.012
Ar-0.012
Welcome-0.012
 possible-0.012
Token hand
Token ID1021
Feature activation-8.80e-01

Pos logit contributions
 distance+0.019
 possible+0.018
Ar+0.017
Mill+0.017
Welcome+0.017
Neg logit contributions
okes-0.019
ours-0.018
ish-0.017
 favorites-0.017
our-0.017
Token.
Token ID13
Feature activation+4.72e-01

Pos logit contributions
okes+0.083
ish+0.079
ours+0.078
 favorites+0.075
our+0.074
Neg logit contributions
 distance-0.101
 possible-0.095
Ar-0.094
Mill-0.093
Welcome-0.093
Token She
Token ID1375
Feature activation+1.23e-01

Pos logit contributions
 distance+0.042
Mill+0.041
Ar+0.040
Welcome+0.040
 possible+0.039
Neg logit contributions
ish-0.057
okes-0.055
our-0.054
ap-0.053
ours-0.052
Token smiled
Token ID13541
Feature activation+9.87e-01

Pos logit contributions
 distance+0.012
Ar+0.012
Mill+0.011
Ab+0.011
Welcome+0.011
Neg logit contributions
ish-0.014
okes-0.013
our-0.013
ap-0.013
ours-0.013
Token.
Token ID13
Feature activation+4.78e-01

Pos logit contributions
 distance+0.120
 possible+0.117
Ar+0.114
Mill+0.114
Welcome+0.113
Neg logit contributions
okes-0.085
ish-0.083
ours-0.080
our-0.079
 favorites-0.076
Token
Token ID198
Feature activation+8.05e-01

Pos logit contributions
 distance+0.047
Ar+0.046
Mill+0.046
 possible+0.044
Welcome+0.044
Neg logit contributions
ish-0.052
okes-0.051
our-0.049
ours-0.049
ap-0.049
Token
Token ID198
Feature activation+7.85e-01

Pos logit contributions
 distance+0.103
 possible+0.102
Ar+0.099
Mill+0.096
Welcome+0.094
Neg logit contributions
okes-0.064
ours-0.057
ap-0.055
our-0.054
ish-0.053
Token"
Token ID1
Feature activation+3.50e-01

Pos logit contributions
 possible+0.090
 distance+0.088
Ar+0.084
Mill+0.081
keeping+0.078
Neg logit contributions
okes-0.072
ap-0.067
ours-0.067
our-0.064
ish-0.063
TokenCome
Token ID16773
Feature activation+1.44e+00

Pos logit contributions
 distance+0.027
 possible+0.027
Ar+0.025
Mill+0.025
 winding+0.024
Neg logit contributions
okes-0.040
ours-0.039
our-0.038
ags-0.037
ish-0.037
Token on
Token ID319
Feature activation+9.87e-01

Pos logit contributions
 distance+0.125
Ar+0.120
Welcome+0.118
Mill+0.117
 possible+0.112
Neg logit contributions
okes-0.172
ish-0.165
ours-0.163
our-0.163
 favorites-0.161
Token Jill
Token ID22772
Feature activation+7.54e-01

Pos logit contributions
okes+0.031
ish+0.031
our+0.030
ours+0.030
ap+0.029
Neg logit contributions
 distance-0.035
Ar-0.035
Mill-0.034
 possible-0.033
Welcome-0.033
Token noticed
Token ID6810
Feature activation-1.85e-01

Pos logit contributions
 distance+0.087
Ar+0.086
Mill+0.085
 possible+0.084
Welcome+0.082
Neg logit contributions
okes-0.067
ish-0.067
our-0.064
ours-0.063
ap-0.062
Token something
Token ID1223
Feature activation-8.76e-02

Pos logit contributions
okes+0.016
ish+0.016
our+0.015
ours+0.015
ap+0.015
Neg logit contributions
Ar-0.022
 distance-0.022
Mill-0.021
 possible-0.021
Welcome-0.020
Token surprising
Token ID6452
Feature activation+2.45e-01

Pos logit contributions
ish+0.009
okes+0.009
ours+0.008
our+0.008
ap+0.008
Neg logit contributions
Ar-0.009
Mill-0.009
 distance-0.009
Welcome-0.008
Ab-0.008
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
Ar+0.032
 distance+0.031
Mill+0.031
 possible+0.030
Welcome+0.030
Neg logit contributions
ish-0.019
okes-0.018
our-0.018
ours-0.017
ap-0.017
Token There
Token ID1318
Feature activation-1.74e+00

Pos logit contributions
ish+0.029
okes+0.027
ishes+0.026
our+0.026
ap+0.025
Neg logit contributions
Mill-0.027
 distance-0.026
Welcome-0.026
Ar-0.025
Ab-0.025
Token were
Token ID547
Feature activation-6.52e-02

Pos logit contributions
ish+0.177
okes+0.176
our+0.172
ap+0.169
ours+0.168
Neg logit contributions
Ar-0.181
 distance-0.178
Mill-0.172
 possible-0.170
Welcome-0.165
Token two
Token ID734
Feature activation+4.98e-01

Pos logit contributions
 favorites+0.005
ish+0.005
ours+0.005
okes+0.005
 eyes+0.005
Neg logit contributions
Ar-0.008
 distance-0.007
Mill-0.007
Welcome-0.007
 possible-0.007
Token little
Token ID1310
Feature activation-5.81e-01

Pos logit contributions
 distance+0.050
Ar+0.050
Mill+0.049
 possible+0.048
Welcome+0.047
Neg logit contributions
ish-0.052
okes-0.052
our-0.049
ours-0.049
ap-0.048
Token pink
Token ID11398
Feature activation+7.94e-01

Pos logit contributions
ish+0.052
okes+0.052
our+0.049
ours+0.049
ap+0.048
Neg logit contributions
 distance-0.067
Ar-0.066
Mill-0.066
 possible-0.065
Welcome-0.063
Token beans
Token ID16567
Feature activation-2.70e-01

Pos logit contributions
 distance+0.100
Ar+0.100
Mill+0.098
 possible+0.097
Welcome+0.095
Neg logit contributions
ish-0.064
okes-0.063
our-0.060
ours-0.059
ap-0.058
Token a
Token ID257
Feature activation-2.17e-01

Pos logit contributions
okes+0.025
ish+0.025
ap+0.023
our+0.023
ours+0.023
Neg logit contributions
 distance-0.034
Ar-0.034
 possible-0.033
Mill-0.033
Welcome-0.032
Token loud
Token ID7812
Feature activation+1.10e+00

Pos logit contributions
ish+0.021
ap+0.021
our+0.020
okes+0.019
 brother+0.019
Neg logit contributions
 possible-0.024
Ar-0.022
Mill-0.022
keeping-0.021
 distance-0.021
Token noise
Token ID7838
Feature activation+1.18e+00

Pos logit contributions
 possible+0.100
Ar+0.099
 distance+0.099
Mill+0.097
Ab+0.092
Neg logit contributions
ish-0.125
okes-0.123
our-0.121
ap-0.120
ours-0.116
Token and
Token ID290
Feature activation-7.58e-02

Pos logit contributions
Ar+0.171
 possible+0.165
Mill+0.164
 distance+0.162
Ab+0.161
Neg logit contributions
ish-0.073
okes-0.072
our-0.070
ap-0.069
int-0.062
Token he
Token ID339
Feature activation+7.05e-01

Pos logit contributions
okes+0.007
ish+0.006
 brother+0.006
our+0.006
ours+0.006
Neg logit contributions
 possible-0.008
Ar-0.008
 distance-0.008
Mill-0.008
Ab-0.008
Token looked
Token ID3114
Feature activation-1.79e+00

Pos logit contributions
 possible+0.085
 distance+0.079
Mill+0.078
Welcome+0.077
Ar+0.077
Neg logit contributions
ish-0.063
okes-0.063
our-0.060
int-0.059
ap-0.058
Token up
Token ID510
Feature activation-9.46e-02

Pos logit contributions
ish+0.181
okes+0.180
our+0.171
ours+0.170
ap+0.168
Neg logit contributions
 distance-0.188
Ar-0.187
Mill-0.185
 possible-0.182
Welcome-0.177
Token.
Token ID13
Feature activation-9.99e-02

Pos logit contributions
ish+0.004
 at+0.004
our+0.004
okes+0.004
ap+0.004
Neg logit contributions
Ar-0.015
Mill-0.015
Welcome-0.014
 possible-0.014
Ab-0.014
Token There
Token ID1318
Feature activation-1.61e+00

Pos logit contributions
ish+0.009
okes+0.009
our+0.009
ours+0.009
ap+0.009
Neg logit contributions
 distance-0.011
Ar-0.011
Mill-0.011
 possible-0.011
Welcome-0.010
Token was
Token ID373
Feature activation+9.20e-02

Pos logit contributions
ish+0.152
okes+0.148
our+0.148
ap+0.145
ours+0.138
Neg logit contributions
Ar-0.176
 distance-0.174
 possible-0.174
Mill-0.172
Welcome-0.167
Token a
Token ID257
Feature activation-2.11e-01

Pos logit contributions
Ar+0.012
Mill+0.012
 possible+0.012
 distance+0.012
Welcome+0.012
Neg logit contributions
okes-0.006
 brother-0.005
 favorites-0.005
ours-0.005
ish-0.005
Token grinned
Token ID36268
Feature activation+1.03e+00

Pos logit contributions
okes+0.020
ours+0.018
ish+0.018
our+0.018
 favorites+0.018
Neg logit contributions
Ar-0.028
 distance-0.027
 possible-0.026
Welcome-0.025
Mill-0.025
Token with
Token ID351
Feature activation+1.31e-01

Pos logit contributions
 distance+0.134
Ar+0.133
Mill+0.131
 possible+0.130
Welcome+0.127
Neg logit contributions
okes-0.078
ish-0.078
our-0.073
ours-0.073
ap-0.071
Token joy
Token ID8716
Feature activation+8.89e-01

Pos logit contributions
Ar+0.015
 distance+0.015
Mill+0.014
 possible+0.014
Ab+0.014
Neg logit contributions
ish-0.012
ap-0.012
okes-0.012
 favorites-0.011
ours-0.011
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
Ar+0.125
 distance+0.123
Mill+0.119
 possible+0.118
Welcome+0.117
Neg logit contributions
ish-0.061
okes-0.060
our-0.059
ap-0.055
ours-0.055
Token She
Token ID1375
Feature activation-2.09e-01

Pos logit contributions
 distance+0.013
Ar+0.013
 possible+0.012
Mill+0.012
Welcome+0.012
Neg logit contributions
okes-0.011
ish-0.011
our-0.010
ours-0.010
ap-0.010
Token looked
Token ID3114
Feature activation-1.82e+00

Pos logit contributions
okes+0.022
ish+0.022
our+0.021
ours+0.021
ap+0.021
Neg logit contributions
 distance-0.021
Ar-0.020
Mill-0.020
 possible-0.020
Welcome-0.019
Token around
Token ID1088
Feature activation-2.38e-01

Pos logit contributions
okes+0.119
ish+0.110
our+0.109
ours+0.107
ap+0.105
Neg logit contributions
 distance-0.260
Mill-0.252
 possible-0.245
Ar-0.244
Welcome-0.244
Token and
Token ID290
Feature activation-3.45e-01

Pos logit contributions
okes+0.019
ish+0.019
ours+0.018
our+0.017
ap+0.017
Neg logit contributions
 distance-0.031
Mill-0.030
Ar-0.030
 possible-0.029
Welcome-0.029
Token asked
Token ID1965
Feature activation+9.51e-02

Pos logit contributions
okes+0.031
ish+0.031
our+0.029
ours+0.029
ap+0.028
Neg logit contributions
Ar-0.040
 distance-0.040
Mill-0.039
 possible-0.039
Welcome-0.038
Token her
Token ID607
Feature activation-3.41e-02

Pos logit contributions
 distance+0.009
Mill+0.009
Ar+0.009
 possible+0.009
Welcome+0.009
Neg logit contributions
ish-0.010
okes-0.010
our-0.010
ours-0.010
ap-0.009
Token mom
Token ID1995
Feature activation-6.98e-01

Pos logit contributions
okes+0.003
our+0.003
ish+0.003
ours+0.003
int+0.003
Neg logit contributions
Mill-0.004
 possible-0.004
 distance-0.004
Ar-0.003
Ab-0.003
Token flew
Token ID13112
Feature activation+3.42e-01

Pos logit contributions
 distance+0.001
Ar+0.001
Mill+0.001
 possible+0.001
Welcome+0.001
Neg logit contributions
okes-0.002
ish-0.002
ours-0.002
our-0.002
ap-0.002
Token to
Token ID284
Feature activation+6.88e-02

Pos logit contributions
Mill+0.039
Ar+0.039
Ab+0.038
Welcome+0.038
 distance+0.037
Neg logit contributions
ish-0.031
ap-0.028
okes-0.027
our-0.027
ishes-0.025
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
 distance+0.007
Ar+0.007
Mill+0.007
 possible+0.007
Welcome+0.007
Neg logit contributions
ish-0.006
okes-0.006
our-0.006
ap-0.006
ours-0.006
Token mountain
Token ID8598
Feature activation+2.57e-01

Pos logit contributions
ish+0.035
our+0.033
okes+0.033
ap+0.033
ours+0.032
Neg logit contributions
Mill-0.025
Ar-0.024
Welcome-0.023
Ab-0.023
 possible-0.023
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 distance+0.033
Ar+0.032
 possible+0.032
Mill+0.032
Welcome+0.030
Neg logit contributions
okes-0.020
ish-0.020
ours-0.019
our-0.019
 favorites-0.018
Token At
Token ID1629
Feature activation-1.97e+00

Pos logit contributions
ish+0.010
okes+0.010
our+0.010
ap+0.010
ours+0.009
Neg logit contributions
 distance-0.011
Ar-0.011
Mill-0.010
Welcome-0.010
 possible-0.010
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
our+0.174
ish+0.173
ap+0.170
okes+0.170
ours+0.166
Neg logit contributions
Mill-0.221
Ar-0.219
Welcome-0.214
Ab-0.210
 distance-0.206
Token top
Token ID1353
Feature activation+8.61e-01

Pos logit contributions
ish+0.030
our+0.028
ap+0.028
okes+0.027
 favorites+0.027
Neg logit contributions
Mill-0.025
Ar-0.023
Welcome-0.022
Ab-0.022
 distance-0.022
Token was
Token ID373
Feature activation-2.06e-02

Pos logit contributions
 distance+0.104
Ar+0.100
 possible+0.099
Mill+0.099
Welcome+0.096
Neg logit contributions
okes-0.074
ish-0.074
ours-0.070
our-0.069
ap-0.068
Token a
Token ID257
Feature activation-3.69e-01

Pos logit contributions
okes+0.002
ish+0.002
ours+0.002
our+0.002
ap+0.002
Neg logit contributions
Ar-0.002
 distance-0.002
Mill-0.002
Welcome-0.002
 possible-0.002
Token healing
Token ID11516
Feature activation-3.45e-01

Pos logit contributions
ish+0.039
ap+0.037
our+0.036
okes+0.036
 favorites+0.035
Neg logit contributions
Mill-0.035
Ar-0.034
Welcome-0.033
 possible-0.033
Ab-0.033
Token she
Token ID673
Feature activation+7.03e-01

Pos logit contributions
ish+0.003
okes+0.003
our+0.002
ours+0.002
ap+0.002
Neg logit contributions
Ar-0.002
 distance-0.002
Mill-0.002
 possible-0.002
Welcome-0.002
Token decided
Token ID3066
Feature activation+2.03e+00

Pos logit contributions
 distance+0.067
Ar+0.066
Mill+0.062
Welcome+0.062
 possible+0.060
Neg logit contributions
okes-0.076
ish-0.076
ours-0.074
our-0.074
ap-0.073
Token to
Token ID284
Feature activation+7.91e-02

Pos logit contributions
Ar+0.274
 distance+0.260
Mill+0.253
Welcome+0.244
 possible+0.243
Neg logit contributions
okes-0.157
ish-0.148
our-0.147
ours-0.140
ap-0.137
Token take
Token ID1011
Feature activation+2.62e-01

Pos logit contributions
 distance+0.008
Ar+0.008
Mill+0.008
 possible+0.008
Welcome+0.008
Neg logit contributions
okes-0.008
ish-0.008
ours-0.008
our-0.007
 favorites-0.007
Token it
Token ID340
Feature activation+5.51e-01

Pos logit contributions
 distance+0.026
Ar+0.026
Mill+0.026
 possible+0.025
Welcome+0.025
Neg logit contributions
okes-0.028
ish-0.028
our-0.027
ours-0.026
ap-0.026
Token out
Token ID503
Feature activation-1.69e+00

Pos logit contributions
 distance+0.054
 possible+0.050
Ar+0.048
Mill+0.047
Welcome+0.044
Neg logit contributions
ours-0.067
okes-0.067
our-0.064
ish-0.063
int-0.059
Token of
Token ID286
Feature activation+1.01e+00

Pos logit contributions
ish+0.153
okes+0.153
our+0.145
ours+0.145
 favorites+0.139
Neg logit contributions
 distance-0.196
 possible-0.189
Mill-0.187
Ar-0.186
Welcome-0.180
Token her
Token ID607
Feature activation+2.99e-03

Pos logit contributions
 distance+0.095
Ar+0.094
Mill+0.093
 possible+0.091
Welcome+0.089
Neg logit contributions
okes-0.111
ish-0.111
our-0.107
ours-0.106
ap-0.105
Token pocket
Token ID10000
Feature activation-1.05e+00

Pos logit contributions
 distance+0.000
Ar+0.000
 path+0.000
 possible+0.000
Mill+0.000
Neg logit contributions
okes-0.000
ours-0.000
int-0.000
oking-0.000
ids-0.000
Token and
Token ID290
Feature activation-1.49e-01

Pos logit contributions
okes+0.104
ours+0.101
ish+0.099
ap+0.094
our+0.092
Neg logit contributions
 distance-0.119
 possible-0.110
Mill-0.105
Ar-0.103
 path-0.101
Token have
Token ID423
Feature activation-3.48e-01

Pos logit contributions
ish+0.013
okes+0.013
our+0.013
ours+0.012
ap+0.012
Neg logit contributions
 distance-0.017
Ar-0.017
Mill-0.017
 possible-0.017
Welcome-0.016