TOP ACTIVATIONS
MAX = 0.158

and the way the paint smelled as he was rolling it
it and see how it smelled . Ben wanted to leave
turned dark and the air smelled funny . Amy had to
shining and how the flowers smelled nice . Benny was happy
her nose at how it smelled . But then , she
als and the way it smelled . The little girl couldn
He wrote about how food smelled in the oven .
it looked , how it smelled , and how it tasted
delicious everything inside the grill smelled ! " Do
on the table and it smelled so good ! The little
grill and he thought it smelled so good ! He decided
and picked up something that smelled spicy . Finally
took it out and it smelled so good ! They let
of the oven and it smelled so good . She put
pie was apple and it smelled good . Sara and Grand
affle was ready and it smelled so good . Stacy picked
cooked the mixture until it smelled y ummy .
while . Soon , the tad p oles became frogs and
, but he thought it smelled good . Max jumped into
mom baked some cookies that smelled so good , Tim my

MIN ACTIVATIONS
MIN = -0.150

him . But to no avail , they were too scared
out , but to no avail . He felt sad and
times , but to no avail . Jane 's
deep paws , to no avail . He finally thought of
and he was su pr ised at how quickly he started
to use it with adult supervision ," Mr . Lee said
, but only with our supervision . And maybe you can
said , " I like Tues days too ! It 's
a little girl who loved Tues days . Every Tuesday ,
little girl was su pr ised . She had never heard
clown s juggling and ac rob ats doing flips . But
You don 't need to shave your face . You are
with excitement . He loved Tues days so much because they
and T idy were in se per able . They would
, he was su pr ised to see a gray bird
holding things . He pract ised every day , often with
cool ! Sammy was mesmer ised by its beauty . His
horn . He was mesmer ised by its beauty and wanted
ep inside and su pr ised himself when he saw that
blue wheels and pink handle bars . She liked to go

INTERVAL 0.081 - 0.158
CONTAINS 0.099%

hopped around and explored the swamp . They made sure the
! I wonder if it smells nice ."
park or bake cookies . How does that sound ?"
as he watched the rain come down . Jack
but be careful . The bush has th orns ." Mia

INTERVAL 0.004 - 0.081
CONTAINS 42.537%

in something , but it 's also okay if it doesn
, Lily saw a bird with a broken wing on the
friends that a factory up the river was dumping pink water
forest with her friends . One day , they saw a
to . At the market , he saw lots of people

INTERVAL -0.073 - 0.004
CONTAINS 57.166%

" What do you mean ?" asked Lucy .
was good to try new things and speak to each other
the rabbit and squirrel vowed to never separate again . They
instead of fear . From then on , the er aser
No , Tom ! That sugar is disgusting ! â €

INTERVAL -0.150 - -0.073
CONTAINS 0.197%

on the ground . Sadd ened , the rabbit thanked the
felt sad and angry at each other . But
a good holiday , no matter what . They said they
She played for hours and hours until she was too tired
wanted to have a big muscle too . They thought of
Token and
Token ID290
Feature activation+4.02e-04

Pos logit contributions
 pods+0.012
 lav+0.012
 tad+0.012
 bou+0.012
mary+0.011
Neg logit contributions
 rider-0.009
Brian-0.007
John-0.007
Paul-0.007
lf-0.007
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
 pods+0.001
 lav+0.001
 tad+0.001
 bou+0.001
mary+0.001
Neg logit contributions
 rider-0.001
Brian-0.000
John-0.000
Paul-0.000
Thomas-0.000
Token way
Token ID835
Feature activation-1.44e-02

Pos logit contributions
 rider+0.023
lf+0.019
Brian+0.019
 riding+0.019
John+0.019
Neg logit contributions
 pods-0.024
 tad-0.023
 lav-0.023
mary-0.022
 bou-0.022
Token the
Token ID262
Feature activation+1.95e-02

Pos logit contributions
 rider+0.020
Brian+0.016
lf+0.016
John+0.016
Thomas+0.015
Neg logit contributions
 pods-0.031
 tad-0.030
 lav-0.029
 bou-0.029
mary-0.028
Token paint
Token ID7521
Feature activation+5.00e-02

Pos logit contributions
 pods+0.036
 tad+0.036
 lav+0.035
 bou+0.034
mary+0.034
Neg logit contributions
 rider-0.034
lf-0.029
Brian-0.029
 riding-0.028
John-0.028
Token smelled
Token ID37609
Feature activation+1.58e-01

Pos logit contributions
 pods+0.098
 tad+0.092
 lav+0.092
 bou+0.090
mary+0.088
Neg logit contributions
 rider-0.079
Brian-0.071
John-0.068
Paul-0.068
Thomas-0.068
Token as
Token ID355
Feature activation-2.59e-02

Pos logit contributions
 pods+0.331
 tad+0.330
 lav+0.327
 bou+0.316
mary+0.310
Neg logit contributions
 rider-0.226
 riding-0.189
John-0.187
lf-0.184
Brian-0.183
Token he
Token ID339
Feature activation-2.35e-02

Pos logit contributions
 rider+0.033
Brian+0.027
lf+0.026
Paul+0.026
John+0.026
Neg logit contributions
 pods-0.060
 lav-0.060
 tad-0.059
 bou-0.057
mary-0.056
Token was
Token ID373
Feature activation+6.77e-03

Pos logit contributions
 rider+0.035
lf+0.030
Brian+0.029
 riding+0.029
Paul+0.029
Neg logit contributions
 pods-0.049
 tad-0.048
 lav-0.048
 bou-0.047
mary-0.044
Token rolling
Token ID10708
Feature activation+2.72e-02

Pos logit contributions
 pods+0.015
 tad+0.014
 lav+0.014
 bou+0.014
mary+0.013
Neg logit contributions
 rider-0.010
 riding-0.008
lf-0.008
Brian-0.008
John-0.008
Token it
Token ID340
Feature activation+6.12e-03

Pos logit contributions
 pods+0.064
 tad+0.063
 lav+0.063
 bou+0.061
mary+0.060
Neg logit contributions
 rider-0.033
Brian-0.027
John-0.026
lf-0.026
Paul-0.026
Token it
Token ID340
Feature activation+1.55e-02

Pos logit contributions
 pods+0.148
 lav+0.146
 tad+0.146
 bou+0.143
mary+0.140
Neg logit contributions
 rider-0.047
Brian-0.034
John-0.033
 riding-0.033
Paul-0.032
Token and
Token ID290
Feature activation-5.00e-03

Pos logit contributions
 lav+0.040
 tad+0.039
 pulp+0.038
 pods+0.038
 bou+0.037
Neg logit contributions
 rider-0.014
 his-0.013
 riding-0.012
 day-0.011
His-0.011
Token see
Token ID766
Feature activation-1.28e-03

Pos logit contributions
 rider+0.007
lf+0.006
 riding+0.006
John+0.006
 his+0.006
Neg logit contributions
 pods-0.011
 lav-0.011
 tad-0.010
mary-0.010
 bou-0.010
Token how
Token ID703
Feature activation-1.22e-02

Pos logit contributions
 rider+0.002
 riding+0.002
John+0.002
Brian+0.002
Paul+0.002
Neg logit contributions
 pods-0.003
 tad-0.002
 lav-0.002
 bou-0.002
mary-0.002
Token it
Token ID340
Feature activation+1.78e-02

Pos logit contributions
 rider+0.008
John+0.006
working+0.006
 riding+0.006
His+0.006
Neg logit contributions
 lav-0.034
 pods-0.034
 tad-0.034
 bou-0.034
mary-0.033
Token smelled
Token ID37609
Feature activation+1.57e-01

Pos logit contributions
 bou+0.041
 lav+0.039
 pods+0.039
 tad+0.038
umes+0.037
Neg logit contributions
 rider-0.019
 rode-0.017
 he-0.016
lf-0.016
 hit-0.016
Token.
Token ID13
Feature activation-8.00e-03

Pos logit contributions
 tad+0.383
 pods+0.375
 lav+0.367
 bou+0.366
flies+0.366
Neg logit contributions
 rider-0.147
 time-0.128
 riding-0.125
 his-0.125
 day-0.117
Token Ben
Token ID3932
Feature activation-3.16e-02

Pos logit contributions
 rider+0.011
Brian+0.010
John+0.009
 riding+0.009
Paul+0.009
Neg logit contributions
 pods-0.017
 lav-0.017
 tad-0.017
 bou-0.016
mary-0.016
Token wanted
Token ID2227
Feature activation+4.67e-03

Pos logit contributions
 rider+0.034
John+0.028
lf+0.028
His+0.027
Tom+0.026
Neg logit contributions
 lav-0.076
 bou-0.075
 pods-0.075
 tad-0.074
ount-0.069
Token to
Token ID284
Feature activation-1.99e-03

Pos logit contributions
 bou+0.013
 tad+0.013
 pods+0.012
 lav+0.012
flies+0.012
Neg logit contributions
 rider-0.003
 his-0.003
 riding-0.003
John-0.003
His-0.002
Token leave
Token ID2666
Feature activation-1.72e-02

Pos logit contributions
 rider+0.003
His+0.003
 riding+0.003
John+0.003
lf+0.003
Neg logit contributions
 pods-0.004
 lav-0.004
 bou-0.004
 tad-0.004
flies-0.004
Token turned
Token ID2900
Feature activation+7.18e-02

Pos logit contributions
 lav+0.039
 pods+0.039
 tad+0.039
 bou+0.038
mary+0.036
Neg logit contributions
 rider-0.024
lf-0.020
 riding-0.020
John-0.020
Brian-0.019
Token dark
Token ID3223
Feature activation+4.17e-02

Pos logit contributions
 pods+0.154
 lav+0.153
 tad+0.153
 bou+0.148
mary+0.142
Neg logit contributions
 rider-0.102
Brian-0.084
 riding-0.082
John-0.082
lf-0.081
Token and
Token ID290
Feature activation+5.17e-02

Pos logit contributions
 pods+0.097
 tad+0.096
 lav+0.096
 bou+0.093
mary+0.090
Neg logit contributions
 rider-0.053
Brian-0.042
lf-0.042
 riding-0.042
John-0.041
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
 pods+0.113
 lav+0.113
 tad+0.112
 bou+0.109
mary+0.106
Neg logit contributions
 rider-0.072
Brian-0.060
lf-0.059
John-0.059
 riding-0.057
Token air
Token ID1633
Feature activation-5.12e-03

Pos logit contributions
 lav+0.048
 pods+0.048
 tad+0.047
 bou+0.046
mary+0.046
Neg logit contributions
 rider-0.031
 riding-0.025
lf-0.025
Brian-0.025
John-0.025
Token smelled
Token ID37609
Feature activation+1.53e-01

Pos logit contributions
 rider+0.007
lf+0.006
John+0.006
 riding+0.006
Brian+0.005
Neg logit contributions
 lav-0.011
 pods-0.011
 tad-0.011
 bou-0.011
 basil-0.010
Token funny
Token ID8258
Feature activation+8.01e-02

Pos logit contributions
 tad+0.356
 pods+0.352
 lav+0.351
 bou+0.345
flies+0.344
Neg logit contributions
 rider-0.163
 time-0.158
 riding-0.140
 day-0.138
 his-0.138
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
 tad+0.227
 lav+0.227
 bou+0.216
mary+0.215
 basil+0.213
Neg logit contributions
 rider-0.046
 riding-0.041
 time-0.039
lf-0.039
 day-0.037
Token Amy
Token ID14235
Feature activation+1.23e-02

Pos logit contributions
 lav+0.046
 tad+0.046
 pods+0.046
 bou+0.044
mary+0.043
Neg logit contributions
 rider-0.032
Brian-0.027
John-0.026
lf-0.026
 riding-0.026
Token had
Token ID550
Feature activation-2.36e-02

Pos logit contributions
 pods+0.029
 lav+0.029
 bou+0.029
 tad+0.029
 basil+0.027
Neg logit contributions
 rider-0.014
 riding-0.011
lf-0.011
 rode-0.011
John-0.011
Token to
Token ID284
Feature activation-1.85e-02

Pos logit contributions
 rider+0.029
 riding+0.024
John+0.024
Brian+0.024
Paul+0.023
Neg logit contributions
 pods-0.055
 lav-0.055
 tad-0.054
 bou-0.053
mary-0.051
Token shining
Token ID22751
Feature activation+5.80e-02

Pos logit contributions
 tad+0.135
 pods+0.134
 lav+0.132
 bou+0.128
mary+0.122
Neg logit contributions
 rider-0.096
 riding-0.082
John-0.079
lf-0.079
working-0.079
Token and
Token ID290
Feature activation+4.38e-02

Pos logit contributions
 tad+0.138
 lav+0.137
 pods+0.136
 bou+0.132
mary+0.128
Neg logit contributions
 rider-0.069
 riding-0.053
John-0.052
Brian-0.051
working-0.051
Token how
Token ID703
Feature activation+3.85e-03

Pos logit contributions
 pods+0.094
 tad+0.094
 lav+0.093
 bou+0.090
mary+0.088
Neg logit contributions
 rider-0.062
 riding-0.051
Brian-0.051
lf-0.051
John-0.051
Token the
Token ID262
Feature activation+3.44e-02

Pos logit contributions
 pods+0.008
 tad+0.008
 lav+0.008
 bou+0.008
mary+0.007
Neg logit contributions
 rider-0.006
Brian-0.005
lf-0.005
Paul-0.005
John-0.005
Token flowers
Token ID12734
Feature activation+5.93e-02

Pos logit contributions
 pods+0.062
 lav+0.061
 tad+0.061
 bou+0.059
mary+0.057
Neg logit contributions
 rider-0.062
Brian-0.054
John-0.053
lf-0.052
 riding-0.052
Token smelled
Token ID37609
Feature activation+1.52e-01

Pos logit contributions
 lav+0.117
 pods+0.117
 tad+0.116
 bou+0.114
mary+0.107
Neg logit contributions
 rider-0.088
 riding-0.076
lf-0.074
working-0.072
 day-0.071
Token nice
Token ID3621
Feature activation+6.51e-02

Pos logit contributions
 tad+0.349
 pods+0.347
 lav+0.338
 bou+0.328
mary+0.321
Neg logit contributions
 rider-0.181
 riding-0.151
John-0.141
 time-0.137
working-0.137
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
 tad+0.161
 lav+0.158
 pods+0.155
 bou+0.154
mary+0.151
Neg logit contributions
 rider-0.062
 riding-0.050
 day-0.045
 days-0.045
working-0.043
Token Benny
Token ID44275
Feature activation+2.45e-04

Pos logit contributions
 tad+0.037
 lav+0.037
 pods+0.037
 bou+0.036
mary+0.034
Neg logit contributions
 rider-0.016
Brian-0.014
His-0.013
 riding-0.013
Thomas-0.013
Token was
Token ID373
Feature activation-8.83e-03

Pos logit contributions
 pods+0.001
 lav+0.001
 tad+0.001
 bou+0.000
 basil+0.000
Neg logit contributions
 rider-0.000
lf-0.000
Brian-0.000
John-0.000
 riding-0.000
Token happy
Token ID3772
Feature activation+6.29e-03

Pos logit contributions
 rider+0.013
 riding+0.011
John+0.011
Brian+0.011
Paul+0.011
Neg logit contributions
 pods-0.019
 lav-0.019
 tad-0.018
 bou-0.018
mary-0.017
Token her
Token ID607
Feature activation+2.55e-02

Pos logit contributions
 pods+0.015
 lav+0.015
 tad+0.015
 bou+0.015
mary+0.014
Neg logit contributions
 rider-0.009
Brian-0.008
John-0.007
lf-0.007
 riding-0.007
Token nose
Token ID9686
Feature activation-1.13e-03

Pos logit contributions
 pods+0.045
 tad+0.045
 lav+0.045
 bou+0.043
mary+0.042
Neg logit contributions
 rider-0.046
 riding-0.040
lf-0.039
Brian-0.039
John-0.038
Token at
Token ID379
Feature activation-2.00e-03

Pos logit contributions
 rider+0.001
 riding+0.001
lf+0.001
John+0.001
Brian+0.001
Neg logit contributions
 tad-0.003
 lav-0.003
 pods-0.003
 bou-0.003
mary-0.002
Token how
Token ID703
Feature activation-1.25e-02

Pos logit contributions
 rider+0.003
Brian+0.002
John+0.002
lf+0.002
Paul+0.002
Neg logit contributions
 pods-0.004
 lav-0.004
 tad-0.004
 bou-0.004
mary-0.004
Token it
Token ID340
Feature activation+1.85e-02

Pos logit contributions
 rider+0.015
John+0.014
 riding+0.013
working+0.013
His+0.013
Neg logit contributions
 lav-0.027
 tad-0.027
 pods-0.027
 bou-0.027
flies-0.026
Token smelled
Token ID37609
Feature activation+1.50e-01

Pos logit contributions
 bou+0.040
 lav+0.039
 tad+0.039
 pods+0.039
 basil+0.036
Neg logit contributions
 rider-0.025
lf-0.022
working-0.020
Brian-0.019
 riding-0.019
Token.
Token ID13
Feature activation+1.06e-02

Pos logit contributions
 tad+0.333
 pods+0.326
 lav+0.322
 bou+0.316
mary+0.306
Neg logit contributions
 rider-0.189
 riding-0.158
John-0.155
lf-0.155
His-0.150
Token But
Token ID887
Feature activation-2.92e-02

Pos logit contributions
 pods+0.022
 tad+0.022
 lav+0.022
 bou+0.021
mary+0.020
Neg logit contributions
 rider-0.015
Brian-0.014
John-0.013
 riding-0.013
Paul-0.013
Token then
Token ID788
Feature activation-4.26e-02

Pos logit contributions
 rider+0.037
Brian+0.030
lf+0.029
John+0.029
 riding+0.029
Neg logit contributions
 pods-0.068
 lav-0.067
 tad-0.067
 bou-0.065
mary-0.063
Token,
Token ID11
Feature activation-3.71e-03

Pos logit contributions
 rider+0.057
Brian+0.047
John+0.045
lf+0.045
Paul+0.045
Neg logit contributions
 pods-0.097
 lav-0.096
 tad-0.095
 bou-0.092
mary-0.090
Token she
Token ID673
Feature activation-1.13e-02

Pos logit contributions
 rider+0.005
Brian+0.004
John+0.004
 riding+0.004
lf+0.004
Neg logit contributions
 pods-0.008
 tad-0.008
 lav-0.008
 bou-0.008
mary-0.008
Tokenals
Token ID874
Feature activation+5.09e-02

Pos logit contributions
 tad+0.097
 pods+0.095
 lav+0.092
 bou+0.088
 scr+0.088
Neg logit contributions
 rider-0.151
 riding-0.127
 day-0.125
working-0.123
lf-0.122
Token and
Token ID290
Feature activation-3.63e-03

Pos logit contributions
 tad+0.132
 lav+0.128
 pods+0.128
 bou+0.122
 pulp+0.120
Neg logit contributions
 rider-0.046
 riding-0.038
working-0.034
lf-0.033
 day-0.032
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
 rider+0.005
 riding+0.004
lf+0.004
working+0.004
John+0.004
Neg logit contributions
 pods-0.008
 tad-0.008
 lav-0.008
 bou-0.007
mary-0.007
Token way
Token ID835
Feature activation-1.69e-02

Pos logit contributions
 rider+0.022
 day+0.019
 riding+0.019
 time+0.018
lf+0.018
Neg logit contributions
 pods-0.024
 tad-0.023
 lav-0.023
mary-0.022
ount-0.022
Token it
Token ID340
Feature activation+1.82e-02

Pos logit contributions
 rider+0.020
 riding+0.016
Brian+0.016
Paul+0.015
John+0.015
Neg logit contributions
 lav-0.040
 pods-0.040
 tad-0.040
 bou-0.039
mary-0.037
Token smelled
Token ID37609
Feature activation+1.50e-01

Pos logit contributions
 tad+0.038
 pods+0.038
 lav+0.037
 bou+0.036
 pulp+0.034
Neg logit contributions
 rider-0.025
lf-0.020
 riding-0.020
working-0.020
 rode-0.019
Token.
Token ID13
Feature activation+9.92e-03

Pos logit contributions
 tad+0.332
 pods+0.330
 lav+0.323
 bou+0.313
flies+0.309
Neg logit contributions
 rider-0.186
 riding-0.158
John-0.147
lf-0.147
working-0.145
Token The
Token ID383
Feature activation+3.14e-02

Pos logit contributions
 pods+0.021
 tad+0.021
 lav+0.020
 bou+0.020
mary+0.019
Neg logit contributions
 rider-0.015
Brian-0.013
 riding-0.013
John-0.012
Paul-0.012
Token little
Token ID1310
Feature activation+6.20e-02

Pos logit contributions
 pods+0.070
 tad+0.067
 lav+0.066
flies+0.065
mary+0.065
Neg logit contributions
 rider-0.040
lf-0.035
 riding-0.034
 day-0.032
 musician-0.031
Token girl
Token ID2576
Feature activation+1.61e-02

Pos logit contributions
 pods+0.130
 lav+0.127
 tad+0.125
mary+0.124
 scr+0.123
Neg logit contributions
 rider-0.087
 riding-0.068
 day-0.068
working-0.067
 musician-0.066
Token couldn
Token ID3521
Feature activation-6.90e-03

Pos logit contributions
 pods+0.037
 lav+0.036
 tad+0.036
 bou+0.035
 basil+0.033
Neg logit contributions
 rider-0.021
lf-0.017
 riding-0.017
John-0.016
working-0.016
Token He
Token ID679
Feature activation-1.30e-02

Pos logit contributions
 pods+0.027
 tad+0.027
 lav+0.027
 bou+0.026
mary+0.025
Neg logit contributions
 rider-0.015
Brian-0.012
lf-0.012
Paul-0.012
John-0.012
Token wrote
Token ID2630
Feature activation-1.93e-02

Pos logit contributions
 rider+0.020
Brian+0.017
John+0.016
Paul+0.016
 riding+0.016
Neg logit contributions
 pods-0.027
 lav-0.027
 tad-0.027
 bou-0.026
mary-0.025
Token about
Token ID546
Feature activation-2.71e-02

Pos logit contributions
 rider+0.033
Brian+0.028
working+0.028
 runner+0.027
Paul+0.027
Neg logit contributions
 pods-0.034
 lav-0.034
 scr-0.033
 smells-0.032
 bou-0.032
Token how
Token ID703
Feature activation-1.01e-02

Pos logit contributions
 rider+0.032
 riding+0.027
lf+0.027
John+0.025
Brian+0.025
Neg logit contributions
 pods-0.064
 tad-0.064
 lav-0.062
mary-0.061
 bou-0.061
Token food
Token ID2057
Feature activation+1.91e-02

Pos logit contributions
 rider+0.012
John+0.010
working+0.009
 riding+0.009
lf+0.009
Neg logit contributions
 pods-0.024
 tad-0.024
 lav-0.024
 bou-0.024
mary-0.023
Token smelled
Token ID37609
Feature activation+1.49e-01

Pos logit contributions
 pods+0.043
 tad+0.043
 lav+0.043
 bou+0.042
mary+0.040
Neg logit contributions
 rider-0.025
lf-0.020
Brian-0.020
 riding-0.020
John-0.020
Token in
Token ID287
Feature activation+3.58e-02

Pos logit contributions
 tad+0.348
 pods+0.344
 lav+0.339
 bou+0.335
mary+0.322
Neg logit contributions
 rider-0.180
 riding-0.144
lf-0.142
John-0.139
working-0.138
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
 pods+0.083
 lav+0.082
 tad+0.081
 bou+0.079
mary+0.077
Neg logit contributions
 rider-0.046
Brian-0.038
Paul-0.036
John-0.036
lf-0.035
Token oven
Token ID14361
Feature activation+8.13e-02

Pos logit contributions
 tad+0.017
 pods+0.016
mary+0.016
flies+0.015
 bou+0.015
Neg logit contributions
 rider-0.023
 day-0.021
working-0.020
 days-0.020
lf-0.020
Token.
Token ID13
Feature activation+9.08e-03

Pos logit contributions
 tad+0.181
 pods+0.180
 lav+0.179
 bou+0.173
mary+0.167
Neg logit contributions
 rider-0.112
lf-0.090
 riding-0.089
Brian-0.088
John-0.087
Token
Token ID198
Feature activation-8.23e-03

Pos logit contributions
 pods+0.020
 tad+0.019
 lav+0.019
 bou+0.018
mary+0.018
Neg logit contributions
 rider-0.013
Brian-0.011
lf-0.011
working-0.010
Paul-0.010
Token it
Token ID340
Feature activation-6.40e-03

Pos logit contributions
 rider+0.039
John+0.033
working+0.032
 riding+0.032
His+0.031
Neg logit contributions
 pods-0.076
 tad-0.076
 bou-0.076
 lav-0.076
mary-0.072
Token looked
Token ID3114
Feature activation+5.37e-02

Pos logit contributions
 rider+0.009
lf+0.008
working+0.007
 riding+0.007
 rode+0.007
Neg logit contributions
 bou-0.014
 tad-0.014
 lav-0.014
 pods-0.014
 basil-0.013
Token,
Token ID11
Feature activation-5.93e-03

Pos logit contributions
 tad+0.123
 pods+0.121
 lav+0.121
 bou+0.116
mary+0.112
Neg logit contributions
 rider-0.069
 riding-0.056
working-0.054
lf-0.053
Brian-0.053
Token how
Token ID703
Feature activation-1.39e-02

Pos logit contributions
 rider+0.009
Brian+0.008
Paul+0.008
John+0.008
lf+0.007
Neg logit contributions
 pods-0.012
 lav-0.012
 tad-0.012
 bou-0.012
mary-0.011
Token it
Token ID340
Feature activation+1.28e-02

Pos logit contributions
 rider+0.018
John+0.014
Brian+0.014
 riding+0.014
lf+0.014
Neg logit contributions
 pods-0.032
 tad-0.032
 lav-0.032
 bou-0.031
mary-0.030
Token smelled
Token ID37609
Feature activation+1.49e-01

Pos logit contributions
 bou+0.027
 tad+0.026
 lav+0.025
 pods+0.025
mary+0.023
Neg logit contributions
 rider-0.020
lf-0.016
working-0.016
 rode-0.015
 riding-0.015
Token,
Token ID11
Feature activation+5.04e-03

Pos logit contributions
 tad+0.339
 pods+0.337
 lav+0.333
 bou+0.325
mary+0.314
Neg logit contributions
 rider-0.191
lf-0.152
 riding-0.152
John-0.151
Brian-0.148
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 pods+0.011
 tad+0.011
 lav+0.011
 bou+0.011
mary+0.011
Neg logit contributions
 rider-0.007
 riding-0.005
lf-0.005
John-0.005
Brian-0.005
Token how
Token ID703
Feature activation-2.33e-02

Pos logit contributions
 rider+0.011
 riding+0.008
lf+0.008
John+0.008
Brian+0.008
Neg logit contributions
 pods-0.025
 tad-0.025
 lav-0.025
 bou-0.024
mary-0.024
Token it
Token ID340
Feature activation+7.56e-03

Pos logit contributions
 rider+0.032
Brian+0.027
lf+0.026
Paul+0.026
John+0.026
Neg logit contributions
 pods-0.051
 lav-0.051
 tad-0.050
 bou-0.049
mary-0.048
Token tasted
Token ID29187
Feature activation+1.37e-01

Pos logit contributions
 bou+0.015
 tad+0.015
 pods+0.015
 lav+0.015
mary+0.013
Neg logit contributions
 rider-0.012
lf-0.010
working-0.010
 rode-0.009
 riding-0.009
Token delicious
Token ID12625
Feature activation+6.71e-02

Pos logit contributions
 rider+0.022
John+0.019
Brian+0.019
 riding+0.018
Paul+0.018
Neg logit contributions
 pods-0.036
 lav-0.036
 tad-0.035
 bou-0.035
mary-0.034
Token everything
Token ID2279
Feature activation+3.80e-03

Pos logit contributions
 pods+0.168
 lav+0.166
 tad+0.166
 bou+0.161
mary+0.157
Neg logit contributions
 rider-0.073
Brian-0.057
John-0.055
lf-0.055
 riding-0.054
Token inside
Token ID2641
Feature activation+1.58e-03

Pos logit contributions
 lav+0.009
 tad+0.009
 pods+0.009
 bou+0.009
mary+0.008
Neg logit contributions
 rider-0.005
lf-0.004
John-0.004
 riding-0.004
Brian-0.004
Token the
Token ID262
Feature activation+3.48e-02

Pos logit contributions
 pods+0.004
 lav+0.004
 tad+0.004
 bou+0.004
mary+0.003
Neg logit contributions
 rider-0.002
lf-0.002
John-0.002
 riding-0.002
Brian-0.002
Token grill
Token ID29901
Feature activation+8.02e-02

Pos logit contributions
 pods+0.060
 tad+0.059
 lav+0.059
 bou+0.056
mary+0.056
Neg logit contributions
 rider-0.064
Brian-0.054
 riding-0.054
lf-0.054
John-0.053
Token smelled
Token ID37609
Feature activation+1.48e-01

Pos logit contributions
 lav+0.181
 tad+0.179
 pods+0.175
 bou+0.173
mary+0.166
Neg logit contributions
 rider-0.108
lf-0.092
 riding-0.088
John-0.086
Brian-0.083
Token!
Token ID0
Feature activation-1.08e-02

Pos logit contributions
 tad+0.342
 pods+0.339
 lav+0.337
 bou+0.323
mary+0.319
Neg logit contributions
 rider-0.179
 riding-0.147
John-0.144
lf-0.142
working-0.134
Token
Token ID198
Feature activation-1.73e-02

Pos logit contributions
 rider+0.015
Brian+0.013
John+0.012
lf+0.012
Paul+0.012
Neg logit contributions
 pods-0.023
 lav-0.023
 tad-0.023
 bou-0.022
mary-0.022
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
 rider+0.028
working+0.023
lf+0.021
Brian+0.021
 riding+0.021
Neg logit contributions
 tad-0.035
 pods-0.034
 lav-0.034
 bou-0.033
mary-0.033
Token"
Token ID1
Feature activation-4.74e-03

Pos logit contributions
 rider+0.020
working+0.016
Brian+0.016
lf+0.015
 riding+0.015
Neg logit contributions
 pods-0.031
 tad-0.031
 lav-0.031
 bou-0.029
mary-0.029
TokenDo
Token ID5211
Feature activation+5.66e-02

Pos logit contributions
 rider+0.008
John+0.007
Brian+0.007
Paul+0.006
lf+0.006
Neg logit contributions
 pods-0.010
 lav-0.009
 tad-0.009
 bou-0.009
mary-0.009
Token on
Token ID319
Feature activation-5.42e-03

Pos logit contributions
 rider+0.013
lf+0.010
 riding+0.010
working+0.010
Brian+0.009
Neg logit contributions
 tad-0.031
 lav-0.030
 pods-0.030
 bou-0.029
mary-0.028
Token the
Token ID262
Feature activation-3.66e-03

Pos logit contributions
 rider+0.007
Brian+0.006
Paul+0.005
John+0.005
lf+0.005
Neg logit contributions
 pods-0.013
 lav-0.013
 tad-0.012
 bou-0.012
mary-0.012
Token table
Token ID3084
Feature activation+2.01e-02

Pos logit contributions
 rider+0.008
lf+0.007
 riding+0.007
John+0.007
working+0.007
Neg logit contributions
 tad-0.006
 lav-0.005
 pods-0.005
 bou-0.005
mary-0.005
Token and
Token ID290
Feature activation+1.22e-03

Pos logit contributions
 lav+0.045
 pods+0.045
 tad+0.045
 bou+0.043
mary+0.042
Neg logit contributions
 rider-0.027
lf-0.022
 riding-0.021
John-0.021
Brian-0.021
Token it
Token ID340
Feature activation-1.09e-02

Pos logit contributions
 tad+0.002
 pods+0.002
 lav+0.002
 bou+0.002
mary+0.002
Neg logit contributions
 rider-0.002
lf-0.002
John-0.002
 riding-0.002
working-0.001
Token smelled
Token ID37609
Feature activation+1.48e-01

Pos logit contributions
 rider+0.016
lf+0.015
working+0.013
John+0.013
His+0.013
Neg logit contributions
 tad-0.023
 bou-0.022
 lav-0.022
 pods-0.021
flies-0.020
Token so
Token ID523
Feature activation+7.58e-02

Pos logit contributions
 tad+0.259
 pods+0.256
flies+0.251
 lav+0.249
 bou+0.243
Neg logit contributions
 rider-0.251
 time-0.226
John-0.223
 his-0.222
 riding-0.220
Token good
Token ID922
Feature activation+2.82e-02

Pos logit contributions
 tad+0.134
 pods+0.132
 bou+0.130
 lav+0.129
mary+0.124
Neg logit contributions
 rider-0.133
lf-0.116
John-0.115
working-0.114
 riding-0.112
Token!
Token ID0
Feature activation-3.02e-03

Pos logit contributions
 pods+0.059
 lav+0.058
 tad+0.057
 bou+0.055
mary+0.054
Neg logit contributions
 rider-0.043
Brian-0.037
John-0.036
Paul-0.036
lf-0.035
Token The
Token ID383
Feature activation+3.01e-02

Pos logit contributions
 rider+0.005
Brian+0.004
lf+0.004
Paul+0.004
John+0.004
Neg logit contributions
 pods-0.006
 lav-0.006
 tad-0.006
 bou-0.006
mary-0.006
Token little
Token ID1310
Feature activation+5.98e-02

Pos logit contributions
 tad+0.063
 pods+0.063
 lav+0.062
 bou+0.061
mary+0.060
Neg logit contributions
 rider-0.047
lf-0.039
John-0.037
 riding-0.037
Brian-0.036
Token grill
Token ID29901
Feature activation+2.12e-02

Pos logit contributions
 pods+0.014
 lav+0.014
 tad+0.014
 bou+0.013
mary+0.013
Neg logit contributions
 rider-0.011
Brian-0.009
lf-0.009
John-0.009
 riding-0.009
Token and
Token ID290
Feature activation-3.66e-03

Pos logit contributions
 pods+0.047
 lav+0.047
 tad+0.047
 bou+0.045
mary+0.044
Neg logit contributions
 rider-0.029
Brian-0.023
lf-0.023
John-0.023
 riding-0.023
Token he
Token ID339
Feature activation-4.98e-02

Pos logit contributions
 rider+0.005
John+0.004
lf+0.004
 riding+0.004
Paul+0.004
Neg logit contributions
 lav-0.008
 pods-0.008
 bou-0.008
 tad-0.008
mary-0.008
Token thought
Token ID1807
Feature activation+2.47e-02

Pos logit contributions
 rider+0.074
Brian+0.062
John+0.062
lf+0.062
Paul+0.061
Neg logit contributions
 pods-0.105
 lav-0.104
 tad-0.102
 bou-0.100
mary-0.096
Token it
Token ID340
Feature activation+3.39e-02

Pos logit contributions
 pods+0.060
 lav+0.059
 tad+0.059
 bou+0.056
mary+0.055
Neg logit contributions
 rider-0.030
Brian-0.025
Paul-0.023
Thomas-0.022
John-0.022
Token smelled
Token ID37609
Feature activation+1.48e-01

Pos logit contributions
 pods+0.070
 lav+0.069
 tad+0.069
 bou+0.067
mary+0.064
Neg logit contributions
 rider-0.052
Brian-0.044
lf-0.044
John-0.043
Paul-0.043
Token so
Token ID523
Feature activation+7.61e-02

Pos logit contributions
 pods+0.309
 tad+0.308
 lav+0.305
 bou+0.298
mary+0.290
Neg logit contributions
 rider-0.209
John-0.176
 riding-0.173
lf-0.170
His-0.168
Token good
Token ID922
Feature activation+2.67e-02

Pos logit contributions
 bou+0.153
 tad+0.151
 pods+0.150
 lav+0.150
flies+0.142
Neg logit contributions
 rider-0.113
John-0.097
working-0.097
lf-0.094
 riding-0.093
Token!
Token ID0
Feature activation+9.04e-03

Pos logit contributions
 pods+0.052
 lav+0.050
 tad+0.050
 bou+0.048
mary+0.046
Neg logit contributions
 rider-0.045
Brian-0.040
Paul-0.039
John-0.038
Thomas-0.038
Token He
Token ID679
Feature activation-1.94e-02

Pos logit contributions
 lav+0.023
 pods+0.023
 tad+0.023
 bou+0.022
mary+0.022
Neg logit contributions
 rider-0.009
Brian-0.007
John-0.007
 riding-0.007
Paul-0.007
Token decided
Token ID3066
Feature activation+4.79e-02

Pos logit contributions
 rider+0.029
lf+0.024
John+0.024
Brian+0.024
 riding+0.024
Neg logit contributions
 pods-0.041
 lav-0.040
 tad-0.039
 bou-0.039
mary-0.037
Token and
Token ID290
Feature activation+1.25e-02

Pos logit contributions
 pods+0.071
 lav+0.070
 tad+0.070
 bou+0.067
mary+0.066
Neg logit contributions
 rider-0.044
Brian-0.037
John-0.036
lf-0.036
Paul-0.035
Token picked
Token ID6497
Feature activation+5.59e-02

Pos logit contributions
 lav+0.030
 pods+0.029
 tad+0.029
 bou+0.029
flies+0.027
Neg logit contributions
 rider-0.015
John-0.013
 riding-0.013
lf-0.012
Brian-0.012
Token up
Token ID510
Feature activation+2.54e-02

Pos logit contributions
 pods+0.130
 lav+0.128
 tad+0.127
flies+0.122
 scr+0.122
Neg logit contributions
 rider-0.060
 his-0.051
 riding-0.049
John-0.048
working-0.047
Token something
Token ID1223
Feature activation+3.68e-02

Pos logit contributions
comb+0.064
 pods+0.063
 lav+0.063
flies+0.062
 tad+0.062
Neg logit contributions
 his-0.022
 rider-0.017
 riding-0.016
 Tommy-0.016
 time-0.015
Token that
Token ID326
Feature activation+2.55e-02

Pos logit contributions
 tad+0.076
 pods+0.075
 lav+0.073
flies+0.072
 bou+0.071
Neg logit contributions
 rider-0.051
 riding-0.044
lf-0.043
John-0.042
 he-0.041
Token smelled
Token ID37609
Feature activation+1.48e-01

Pos logit contributions
 pods+0.052
 lav+0.052
 tad+0.051
 bou+0.050
mary+0.048
Neg logit contributions
 rider-0.039
Brian-0.032
lf-0.032
John-0.032
 riding-0.032
Token spicy
Token ID26880
Feature activation+5.91e-02

Pos logit contributions
 tad+0.313
flies+0.304
 pods+0.301
 lav+0.297
 bou+0.294
Neg logit contributions
 rider-0.185
 time-0.176
 riding-0.168
 his-0.164
 day-0.163
Token.
Token ID13
Feature activation+1.43e-02

Pos logit contributions
 tad+0.146
 bou+0.141
 lav+0.139
 pods+0.135
mary+0.134
Neg logit contributions
 rider-0.058
working-0.047
 riding-0.047
lf-0.046
 day-0.044
Token
Token ID198
Feature activation-5.19e-03

Pos logit contributions
 pods+0.029
 lav+0.029
 tad+0.029
 bou+0.028
mary+0.027
Neg logit contributions
 rider-0.022
Brian-0.019
John-0.019
 riding-0.018
Paul-0.018
Token
Token ID198
Feature activation-4.13e-03

Pos logit contributions
 rider+0.011
working+0.009
lf+0.009
 runner+0.009
 musician+0.008
Neg logit contributions
 tad-0.008
mary-0.008
 pods-0.007
 bou-0.007
 lav-0.007
TokenFinally
Token ID11158
Feature activation+1.44e-02

Pos logit contributions
 rider+0.006
lf+0.005
 riding+0.005
Brian+0.005
Thomas+0.005
Neg logit contributions
 tad-0.008
 pods-0.008
mary-0.008
 lav-0.008
 bou-0.008
Token took
Token ID1718
Feature activation+2.07e-02

Pos logit contributions
 tad+0.033
 pods+0.032
 lav+0.032
flies+0.032
 bou+0.031
Neg logit contributions
 rider-0.014
lf-0.014
 rode-0.013
John-0.012
 kicked-0.011
Token it
Token ID340
Feature activation-4.64e-03

Pos logit contributions
 pods+0.049
 tad+0.049
 lav+0.048
 bou+0.047
mary+0.046
Neg logit contributions
 rider-0.025
John-0.020
Brian-0.019
 riding-0.019
lf-0.019
Token out
Token ID503
Feature activation+5.64e-03

Pos logit contributions
 rider+0.005
 riding+0.004
lf+0.004
working+0.004
John+0.004
Neg logit contributions
 tad-0.012
 lav-0.012
 pods-0.011
 bou-0.011
mary-0.011
Token and
Token ID290
Feature activation+9.56e-03

Pos logit contributions
 tad+0.016
 pods+0.016
 lav+0.016
mary+0.015
 bou+0.015
Neg logit contributions
 rider-0.004
 riding-0.003
John-0.003
lf-0.003
 his-0.003
Token it
Token ID340
Feature activation-8.55e-03

Pos logit contributions
 pods+0.021
 tad+0.021
 lav+0.020
 bou+0.019
mary+0.019
Neg logit contributions
 rider-0.013
lf-0.011
John-0.011
 riding-0.011
working-0.010
Token smelled
Token ID37609
Feature activation+1.48e-01

Pos logit contributions
 rider+0.010
lf+0.009
working+0.008
John+0.008
 day+0.008
Neg logit contributions
 tad-0.020
 lav-0.019
 pods-0.019
 bou-0.019
flies-0.018
Token so
Token ID523
Feature activation+7.85e-02

Pos logit contributions
 tad+0.280
 pods+0.279
 lav+0.272
flies+0.265
 bou+0.263
Neg logit contributions
 rider-0.234
John-0.205
 riding-0.201
lf-0.200
His-0.195
Token good
Token ID922
Feature activation+3.06e-02

Pos logit contributions
 pods+0.163
 tad+0.163
 lav+0.159
 bou+0.159
mary+0.154
Neg logit contributions
 rider-0.109
lf-0.092
John-0.092
working-0.091
 riding-0.089
Token!
Token ID0
Feature activation-1.38e-03

Pos logit contributions
 pods+0.065
 tad+0.065
 lav+0.065
 bou+0.062
mary+0.061
Neg logit contributions
 rider-0.045
Brian-0.037
lf-0.036
John-0.036
 riding-0.036
Token They
Token ID1119
Feature activation+2.53e-02

Pos logit contributions
 rider+0.002
 riding+0.002
Brian+0.002
John+0.002
lf+0.002
Neg logit contributions
 tad-0.003
 pods-0.003
 lav-0.003
 bou-0.003
flies-0.003
Token let
Token ID1309
Feature activation+3.99e-02

Pos logit contributions
 tad+0.057
 pods+0.055
 lav+0.055
 bou+0.053
flies+0.053
Neg logit contributions
 rider-0.029
lf-0.029
 rode-0.028
John-0.026
working-0.025
Token of
Token ID286
Feature activation+3.79e-02

Pos logit contributions
 pods+0.001
 tad+0.001
 lav+0.001
 bou+0.001
mary+0.001
Neg logit contributions
 rider-0.000
 riding-0.000
lf-0.000
Brian-0.000
John-0.000
Token the
Token ID262
Feature activation+1.38e-03

Pos logit contributions
 lav+0.082
 pods+0.082
 tad+0.080
 bou+0.079
mary+0.075
Neg logit contributions
 rider-0.057
Brian-0.046
Paul-0.045
John-0.045
working-0.044
Token oven
Token ID14361
Feature activation+7.67e-02

Pos logit contributions
 tad+0.002
 pods+0.002
flies+0.002
 bou+0.002
mary+0.002
Neg logit contributions
 day-0.003
 rider-0.003
 time-0.003
working-0.003
lf-0.003
Token and
Token ID290
Feature activation+8.93e-03

Pos logit contributions
 tad+0.186
 lav+0.184
 pods+0.183
 bou+0.175
mary+0.171
Neg logit contributions
 rider-0.091
lf-0.070
 riding-0.070
working-0.066
John-0.065
Token it
Token ID340
Feature activation-6.98e-03

Pos logit contributions
 pods+0.019
 tad+0.019
 lav+0.019
 bou+0.018
mary+0.018
Neg logit contributions
 rider-0.013
lf-0.011
 riding-0.011
John-0.011
Brian-0.010
Token smelled
Token ID37609
Feature activation+1.47e-01

Pos logit contributions
 rider+0.008
lf+0.007
working+0.006
John+0.006
 riding+0.006
Neg logit contributions
 tad-0.017
 lav-0.016
 pods-0.016
 bou-0.016
 basil-0.015
Token so
Token ID523
Feature activation+7.90e-02

Pos logit contributions
 pods+0.296
 tad+0.296
 lav+0.288
flies+0.288
mary+0.283
Neg logit contributions
 rider-0.206
 time-0.184
John-0.183
 riding-0.178
lf-0.174
Token good
Token ID922
Feature activation+3.16e-02

Pos logit contributions
 pods+0.167
 tad+0.165
 bou+0.163
 lav+0.161
flies+0.159
Neg logit contributions
 rider-0.104
John-0.089
working-0.088
lf-0.087
 riding-0.085
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 pods+0.069
 tad+0.069
 lav+0.068
 bou+0.066
mary+0.064
Neg logit contributions
 rider-0.045
 riding-0.036
lf-0.036
Brian-0.036
John-0.036
Token She
Token ID1375
Feature activation+2.61e-02

Pos logit contributions
 tad+0.026
 pods+0.025
 lav+0.025
 bou+0.025
mary+0.024
Neg logit contributions
 rider-0.020
 riding-0.017
Brian-0.017
working-0.017
John-0.016
Token put
Token ID1234
Feature activation-4.05e-02

Pos logit contributions
 pods+0.060
 tad+0.059
 lav+0.058
 bou+0.057
mary+0.054
Neg logit contributions
 rider-0.033
lf-0.029
 rode-0.026
 riding-0.026
John-0.025
Token pie
Token ID2508
Feature activation+6.14e-02

Pos logit contributions
 tad+0.050
 pods+0.047
flies+0.047
 lav+0.046
mary+0.046
Neg logit contributions
 rider-0.053
 day-0.048
Tom-0.046
lf-0.046
 cop-0.046
Token was
Token ID373
Feature activation+1.64e-02

Pos logit contributions
 tad+0.139
 lav+0.138
 pods+0.136
 bou+0.133
mary+0.127
Neg logit contributions
 rider-0.081
lf-0.073
 riding-0.061
John-0.060
working-0.059
Token apple
Token ID17180
Feature activation+5.54e-03

Pos logit contributions
 pods+0.031
 tad+0.030
 lav+0.030
 bou+0.029
mary+0.028
Neg logit contributions
 rider-0.028
lf-0.024
Brian-0.024
John-0.024
 riding-0.024
Token and
Token ID290
Feature activation+5.90e-03

Pos logit contributions
 tad+0.015
 lav+0.014
 pods+0.014
 bou+0.013
ount+0.013
Neg logit contributions
 rider-0.006
lf-0.005
 riding-0.004
 day-0.004
working-0.004
Token it
Token ID340
Feature activation-1.35e-02

Pos logit contributions
 tad+0.014
 pods+0.014
 lav+0.013
 bou+0.013
mary+0.013
Neg logit contributions
 rider-0.007
lf-0.006
John-0.006
 riding-0.006
Brian-0.005
Token smelled
Token ID37609
Feature activation+1.47e-01

Pos logit contributions
 rider+0.018
lf+0.016
working+0.013
 rode+0.013
 he+0.013
Neg logit contributions
 tad-0.031
 bou-0.030
 lav-0.029
 pods-0.029
ount-0.027
Token good
Token ID922
Feature activation+3.55e-02

Pos logit contributions
 tad+0.292
 pods+0.285
 lav+0.277
flies+0.274
 bou+0.270
Neg logit contributions
 rider-0.224
 time-0.189
 riding-0.189
John-0.187
lf-0.184
Token.
Token ID13
Feature activation+4.34e-03

Pos logit contributions
 tad+0.079
 pods+0.078
 lav+0.078
 bou+0.075
mary+0.074
Neg logit contributions
 rider-0.049
 riding-0.039
lf-0.038
Brian-0.038
working-0.038
Token Sara
Token ID24799
Feature activation-8.91e-04

Pos logit contributions
 pods+0.008
 tad+0.008
 bou+0.008
 lav+0.008
flies+0.008
Neg logit contributions
 rider-0.007
Tom-0.006
John-0.006
 riding-0.006
His-0.006
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
 rider+0.001
lf+0.001
 riding+0.001
John+0.001
Brian+0.001
Neg logit contributions
 pods-0.002
 lav-0.002
 tad-0.002
 bou-0.002
mary-0.002
Token Grand
Token ID5675
Feature activation-6.94e-02

Pos logit contributions
 rider+0.030
Brian+0.026
Paul+0.025
John+0.025
lf+0.025
Neg logit contributions
 pods-0.038
 lav-0.038
 tad-0.037
 bou-0.036
mary-0.035
Tokenaffle
Token ID30697
Feature activation+3.55e-02

Pos logit contributions
 pods+0.074
 lav+0.073
 tad+0.073
 bou+0.069
mary+0.067
Neg logit contributions
 rider-0.076
Brian-0.066
Paul-0.065
John-0.065
 riding-0.064
Token was
Token ID373
Feature activation+1.98e-02

Pos logit contributions
 lav+0.073
 bou+0.072
 tad+0.072
 pods+0.071
mary+0.067
Neg logit contributions
 rider-0.055
lf-0.050
working-0.044
John-0.044
 riding-0.043
Token ready
Token ID3492
Feature activation+2.47e-02

Pos logit contributions
 pods+0.036
 lav+0.036
 tad+0.035
 bou+0.034
mary+0.033
Neg logit contributions
 rider-0.035
Brian-0.031
Paul-0.030
John-0.030
lf-0.029
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 tad+0.060
 lav+0.059
 pods+0.059
 bou+0.057
mary+0.056
Neg logit contributions
 rider-0.028
 riding-0.023
lf-0.022
John-0.022
working-0.021
Token it
Token ID340
Feature activation+1.39e-05

Pos logit contributions
 pods+0.042
 tad+0.042
 lav+0.041
 bou+0.041
mary+0.038
Neg logit contributions
 rider-0.028
lf-0.024
 riding-0.024
working-0.023
Brian-0.023
Token smelled
Token ID37609
Feature activation+1.47e-01

Pos logit contributions
 lav+0.000
 bou+0.000
 tad+0.000
 pods+0.000
flies+0.000
Neg logit contributions
 rider-0.000
lf-0.000
John-0.000
His-0.000
working-0.000
Token so
Token ID523
Feature activation+7.87e-02

Pos logit contributions
 tad+0.284
 pods+0.282
 lav+0.280
 bou+0.272
flies+0.266
Neg logit contributions
 rider-0.229
John-0.202
lf-0.197
 riding-0.193
His-0.192
Token good
Token ID922
Feature activation+3.13e-02

Pos logit contributions
 tad+0.160
 pods+0.159
 bou+0.159
 lav+0.156
flies+0.153
Neg logit contributions
 rider-0.110
John-0.095
lf-0.095
working-0.094
His-0.092
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
 pods+0.068
 tad+0.068
 lav+0.068
 bou+0.065
mary+0.063
Neg logit contributions
 rider-0.044
Brian-0.036
lf-0.036
 riding-0.036
John-0.035
Token Stacy
Token ID49523
Feature activation-2.20e-02

Pos logit contributions
 tad+0.033
 pods+0.032
 bou+0.032
 lav+0.030
flies+0.030
Neg logit contributions
 rider-0.030
Brian-0.027
Thomas-0.026
 riding-0.026
working-0.026
Token picked
Token ID6497
Feature activation+4.60e-02

Pos logit contributions
 rider+0.027
lf+0.025
John+0.023
 riding+0.022
working+0.022
Neg logit contributions
 pods-0.049
 tad-0.048
 lav-0.048
 bou-0.048
flies-0.045
Token cooked
Token ID15847
Feature activation+3.42e-02

Pos logit contributions
 pods+0.059
 bou+0.059
 tad+0.059
 lav+0.056
flies+0.052
Neg logit contributions
 rider-0.056
lf-0.053
working-0.049
John-0.049
 riding-0.048
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
 tad+0.085
 pods+0.083
 lav+0.081
 bou+0.080
flies+0.079
Neg logit contributions
 rider-0.029
lf-0.028
John-0.028
 his-0.026
 riding-0.025
Token mixture
Token ID11710
Feature activation+5.43e-02

Pos logit contributions
bugs+0.022
ocular+0.022
ount+0.021
flies+0.021
 bou+0.021
Neg logit contributions
 day-0.027
 time-0.026
 days-0.025
working-0.025
lf-0.023
Token until
Token ID1566
Feature activation+1.10e-02

Pos logit contributions
 tad+0.144
 lav+0.139
 bou+0.136
 pods+0.135
 chicks+0.132
Neg logit contributions
 day-0.036
 rider-0.033
 his-0.033
 he-0.031
working-0.031
Token it
Token ID340
Feature activation+2.36e-02

Pos logit contributions
 tad+0.032
 pods+0.031
 bou+0.031
 lav+0.031
flies+0.030
Neg logit contributions
 rider-0.006
lf-0.005
John-0.005
 riding-0.005
working-0.004
Token smelled
Token ID37609
Feature activation+1.47e-01

Pos logit contributions
 bou+0.056
 tad+0.055
 lav+0.055
 pods+0.054
 tul+0.051
Neg logit contributions
lf-0.022
 rider-0.022
 time-0.021
 day-0.020
His-0.019
Token y
Token ID331
Feature activation+1.03e-01

Pos logit contributions
 tad+0.311
 pods+0.306
flies+0.305
 lav+0.301
 bou+0.294
Neg logit contributions
 time-0.182
 rider-0.172
 his-0.160
John-0.158
 riding-0.154
Tokenummy
Token ID13513
Feature activation+6.88e-02

Pos logit contributions
 lav+0.220
 tad+0.219
 bou+0.214
 pods+0.209
mary+0.208
Neg logit contributions
 rider-0.122
working-0.107
John-0.107
dd-0.103
Tom-0.102
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 tad+0.179
 lav+0.167
 bou+0.163
 pods+0.163
mary+0.160
Neg logit contributions
 rider-0.057
 time-0.051
lf-0.048
working-0.048
 days-0.048
Token
Token ID198
Feature activation-5.04e-03

Pos logit contributions
 pods+0.027
 tad+0.026
 lav+0.026
 bou+0.025
mary+0.024
Neg logit contributions
 rider-0.022
Brian-0.018
John-0.018
 riding-0.018
Paul-0.018
Token
Token ID198
Feature activation-4.29e-03

Pos logit contributions
 rider+0.009
working+0.008
lf+0.007
 racer+0.007
 runner+0.007
Neg logit contributions
 tad-0.009
 pods-0.009
 lav-0.009
mary-0.009
 bou-0.008
Token while
Token ID981
Feature activation-1.12e-02

Pos logit contributions
 pods+0.012
 lav+0.012
 tad+0.012
 bou+0.011
mary+0.011
Neg logit contributions
 rider-0.009
Brian-0.007
 riding-0.007
lf-0.007
working-0.007
Token.
Token ID13
Feature activation+4.66e-03

Pos logit contributions
 rider+0.016
Brian+0.014
lf+0.013
John+0.013
Paul+0.013
Neg logit contributions
 pods-0.024
 lav-0.024
 tad-0.024
 bou-0.023
mary-0.022
Token Soon
Token ID15894
Feature activation+1.24e-02

Pos logit contributions
 pods+0.011
 lav+0.011
 tad+0.011
 bou+0.010
 pulp+0.010
Neg logit contributions
 rider-0.006
Brian-0.005
John-0.005
 riding-0.005
Paul-0.005
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
 pods+0.033
 lav+0.032
 tad+0.032
 bou+0.031
mary+0.031
Neg logit contributions
 rider-0.012
 riding-0.009
Brian-0.009
John-0.008
Paul-0.008
Token the
Token ID262
Feature activation+6.50e-04

Pos logit contributions
 pods+0.036
 lav+0.036
mary+0.035
 tad+0.035
 bou+0.034
Neg logit contributions
 rider-0.012
 riding-0.010
John-0.009
working-0.009
lf-0.009
Token tad
Token ID36264
Feature activation+1.46e-01

Pos logit contributions
 pods+0.001
 lav+0.001
 bou+0.001
mary+0.001
 tad+0.001
Neg logit contributions
 rider-0.001
lf-0.001
 riding-0.001
working-0.001
 day-0.001
Tokenp
Token ID79
Feature activation+3.44e-03

Pos logit contributions
 pods+0.328
 lav+0.321
 tad+0.309
 bou+0.308
mary+0.299
Neg logit contributions
 rider-0.193
Brian-0.156
 riding-0.153
John-0.153
Paul-0.150
Tokenoles
Token ID4316
Feature activation+6.77e-02

Pos logit contributions
 pods+0.006
 lav+0.006
 bou+0.006
 tad+0.005
mary+0.005
Neg logit contributions
 rider-0.006
lf-0.006
John-0.005
 riding-0.005
Brian-0.005
Token became
Token ID2627
Feature activation-2.32e-03

Pos logit contributions
 pods+0.137
 lav+0.137
 tad+0.134
 bou+0.130
mary+0.125
Neg logit contributions
 rider-0.103
 riding-0.089
Brian-0.087
lf-0.085
John-0.084
Token frogs
Token ID37475
Feature activation+1.03e-02

Pos logit contributions
 rider+0.003
 riding+0.002
Brian+0.002
Paul+0.002
John+0.002
Neg logit contributions
 pods-0.005
 lav-0.005
 tad-0.005
 bou-0.005
mary-0.005
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 pods+0.024
 lav+0.024
 tad+0.024
 bou+0.023
mary+0.022
Neg logit contributions
 rider-0.013
 riding-0.011
Brian-0.010
John-0.010
lf-0.009
Token,
Token ID11
Feature activation-4.44e-03

Pos logit contributions
 pods+0.103
 lav+0.102
 tad+0.102
 bou+0.098
mary+0.096
Neg logit contributions
 rider-0.056
Brian-0.045
John-0.044
lf-0.044
 riding-0.043
Token but
Token ID475
Feature activation-3.41e-02

Pos logit contributions
 rider+0.008
Brian+0.006
John+0.006
lf+0.006
 riding+0.006
Neg logit contributions
 pods-0.008
 lav-0.008
 tad-0.008
 bou-0.008
mary-0.008
Token he
Token ID339
Feature activation-4.85e-02

Pos logit contributions
Paul+0.048
working+0.046
 rider+0.045
John+0.045
rob+0.045
Neg logit contributions
 pods-0.061
 berries-0.058
 sp-0.058
 honey-0.056
 smells-0.056
Token thought
Token ID1807
Feature activation+2.51e-02

Pos logit contributions
 rider+0.084
Brian+0.074
Paul+0.072
 riding+0.071
John+0.071
Neg logit contributions
 tad-0.089
 pods-0.089
 lav-0.087
mary-0.083
 bou-0.081
Token it
Token ID340
Feature activation+3.66e-02

Pos logit contributions
 pods+0.053
 lav+0.051
 basil+0.048
 cups+0.048
 herbs+0.048
Neg logit contributions
 rider-0.038
Paul-0.033
Brian-0.033
Thomas-0.031
John-0.030
Token smelled
Token ID37609
Feature activation+1.45e-01

Pos logit contributions
 pods+0.073
 tad+0.073
 lav+0.073
 bou+0.071
mary+0.067
Neg logit contributions
 rider-0.058
Brian-0.049
lf-0.048
John-0.047
Paul-0.047
Token good
Token ID922
Feature activation+3.47e-02

Pos logit contributions
 pods+0.307
 tad+0.307
 lav+0.303
 bou+0.298
mary+0.288
Neg logit contributions
 rider-0.208
 riding-0.170
John-0.169
lf-0.168
Brian-0.166
Token.
Token ID13
Feature activation+1.54e-02

Pos logit contributions
 pods+0.072
 lav+0.071
 tad+0.071
 bou+0.069
mary+0.067
Neg logit contributions
 rider-0.053
Brian-0.045
John-0.044
lf-0.044
Paul-0.043
Token Max
Token ID5436
Feature activation-3.17e-02

Pos logit contributions
 pods+0.029
 smells+0.026
 tad+0.025
 berries+0.025
 cups+0.025
Neg logit contributions
 rider-0.025
dd-0.023
Paul-0.023
John-0.023
working-0.022
Token jumped
Token ID11687
Feature activation+3.84e-02

Pos logit contributions
 rider+0.042
lf+0.035
John+0.034
Brian+0.034
His+0.033
Neg logit contributions
 lav-0.071
 pods-0.070
 tad-0.069
 bou-0.069
mary-0.064
Token into
Token ID656
Feature activation+8.17e-03

Pos logit contributions
 lav+0.092
 pods+0.091
 tad+0.091
 bou+0.089
mary+0.085
Neg logit contributions
 rider-0.045
 riding-0.036
Brian-0.036
John-0.034
lf-0.034
Token mom
Token ID1995
Feature activation+5.47e-03

Pos logit contributions
 tad+0.007
 pods+0.007
 lav+0.007
 bou+0.007
mary+0.007
Neg logit contributions
 rider-0.005
 riding-0.004
lf-0.004
Brian-0.003
working-0.003
Token baked
Token ID22979
Feature activation+5.83e-02

Pos logit contributions
 pods+0.011
 lav+0.011
 tad+0.011
 bou+0.011
mary+0.011
Neg logit contributions
 rider-0.008
Brian-0.007
Paul-0.007
John-0.007
lf-0.007
Token some
Token ID617
Feature activation+5.10e-02

Pos logit contributions
 pods+0.122
 tad+0.121
 lav+0.120
 bou+0.116
mary+0.113
Neg logit contributions
 rider-0.087
John-0.073
Brian-0.072
lf-0.072
 riding-0.072
Token cookies
Token ID14746
Feature activation+4.62e-02

Pos logit contributions
 tad+0.083
 pods+0.081
flies+0.079
 lav+0.075
 bou+0.073
Neg logit contributions
 rider-0.095
lf-0.084
 riding-0.084
working-0.084
 day-0.081
Token that
Token ID326
Feature activation+8.85e-03

Pos logit contributions
 tad+0.115
 pods+0.111
 lav+0.111
 bou+0.106
 pulp+0.103
Neg logit contributions
 rider-0.050
lf-0.042
 riding-0.040
working-0.039
John-0.038
Token smelled
Token ID37609
Feature activation+1.45e-01

Pos logit contributions
 tad+0.019
 lav+0.019
 pods+0.018
 bou+0.018
mary+0.017
Neg logit contributions
 rider-0.013
lf-0.011
John-0.011
working-0.010
 riding-0.010
Token so
Token ID523
Feature activation+7.34e-02

Pos logit contributions
 tad+0.283
flies+0.270
 pods+0.267
 lav+0.260
 bou+0.256
Neg logit contributions
 rider-0.216
 time-0.206
lf-0.197
John-0.192
 day-0.191
Token good
Token ID922
Feature activation+2.76e-02

Pos logit contributions
 tad+0.133
 pods+0.128
 bou+0.127
flies+0.125
 lav+0.125
Neg logit contributions
 rider-0.121
lf-0.110
working-0.109
John-0.105
 day-0.101
Token,
Token ID11
Feature activation+8.93e-03

Pos logit contributions
 pods+0.059
 lav+0.058
 tad+0.058
 bou+0.056
mary+0.054
Neg logit contributions
 rider-0.040
Brian-0.034
Paul-0.033
John-0.033
lf-0.033
Token Tim
Token ID5045
Feature activation+9.57e-03

Pos logit contributions
 lav+0.021
 pods+0.021
 tad+0.021
 bou+0.020
mary+0.020
Neg logit contributions
 rider-0.011
 riding-0.009
John-0.009
lf-0.009
His-0.008
Tokenmy
Token ID1820
Feature activation-2.30e-02

Pos logit contributions
 pods+0.019
 lav+0.019
 tad+0.019
 bou+0.018
mary+0.018
Neg logit contributions
 rider-0.015
Brian-0.013
Paul-0.012
John-0.012
lf-0.012
Token him
Token ID683
Feature activation-1.99e-02

Pos logit contributions
 pods+0.044
 lav+0.043
 tad+0.043
 bou+0.042
mary+0.041
Neg logit contributions
 rider-0.022
Brian-0.018
 riding-0.017
John-0.017
lf-0.017
Token.
Token ID13
Feature activation+5.42e-03

Pos logit contributions
 rider+0.024
 riding+0.020
Brian+0.019
lf+0.019
John+0.018
Neg logit contributions
 lav-0.047
 pods-0.046
 tad-0.046
 bou-0.045
mary-0.043
Token But
Token ID887
Feature activation-2.92e-02

Pos logit contributions
 lav+0.013
 pods+0.013
 bou+0.012
 tad+0.012
mary+0.012
Neg logit contributions
 rider-0.007
Brian-0.006
John-0.006
Paul-0.005
 riding-0.005
Token to
Token ID284
Feature activation-8.40e-03

Pos logit contributions
 rider+0.039
Brian+0.033
Paul+0.031
John+0.031
lf+0.031
Neg logit contributions
 tad-0.066
 pods-0.065
 lav-0.064
 bou-0.061
mary-0.060
Token no
Token ID645
Feature activation-2.57e-02

Pos logit contributions
 rider+0.012
Brian+0.010
Paul+0.010
John+0.010
lf+0.010
Neg logit contributions
 tad-0.018
 lav-0.018
 pods-0.017
 bou-0.017
 scr-0.016
Token avail
Token ID29107
Feature activation-1.50e-01

Pos logit contributions
 rider+0.037
Paul+0.034
Brian+0.034
John+0.034
Thomas+0.033
Neg logit contributions
 tad-0.054
 pods-0.053
 lav-0.052
 bou-0.052
 spr-0.050
Token,
Token ID11
Feature activation-1.60e-02

Pos logit contributions
 rider+0.220
Brian+0.185
John+0.179
lf+0.178
Paul+0.177
Neg logit contributions
 pods-0.315
 tad-0.311
 lav-0.311
 bou-0.300
mary-0.294
Token they
Token ID484
Feature activation-4.62e-04

Pos logit contributions
 rider+0.018
Brian+0.014
John+0.014
lf+0.014
 riding+0.014
Neg logit contributions
 pods-0.039
 lav-0.039
 tad-0.039
 bou-0.038
mary-0.037
Token were
Token ID547
Feature activation+8.27e-03

Pos logit contributions
 rider+0.001
Brian+0.001
lf+0.001
John+0.001
 riding+0.001
Neg logit contributions
 lav-0.001
 pods-0.001
 tad-0.001
 bou-0.001
 basil-0.001
Token too
Token ID1165
Feature activation-1.51e-02

Pos logit contributions
 pods+0.017
 lav+0.016
 tad+0.016
 bou+0.016
mary+0.015
Neg logit contributions
 rider-0.013
Brian-0.011
John-0.011
lf-0.011
 riding-0.011
Token scared
Token ID12008
Feature activation+4.99e-03

Pos logit contributions
 rider+0.023
Brian+0.019
John+0.019
 riding+0.019
Paul+0.019
Neg logit contributions
 pods-0.031
 lav-0.031
 tad-0.031
 bou-0.030
mary-0.029
Token out
Token ID503
Feature activation+1.23e-02

Pos logit contributions
 rider+0.025
Brian+0.022
lf+0.022
John+0.021
Paul+0.021
Neg logit contributions
 pods-0.030
 tad-0.030
 lav-0.029
 bou-0.029
mary-0.028
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 pods+0.030
 tad+0.030
 lav+0.029
 bou+0.029
mary+0.028
Neg logit contributions
 rider-0.014
Brian-0.011
John-0.011
lf-0.011
Paul-0.011
Token but
Token ID475
Feature activation-3.27e-02

Pos logit contributions
 pods+0.026
 lav+0.025
 tad+0.025
 bou+0.024
mary+0.024
Neg logit contributions
 rider-0.022
Brian-0.019
John-0.018
lf-0.018
 riding-0.018
Token to
Token ID284
Feature activation-5.68e-03

Pos logit contributions
 rider+0.042
Brian+0.035
Paul+0.033
Thomas+0.033
working+0.033
Neg logit contributions
 tad-0.075
 pods-0.074
 lav-0.070
 spr-0.067
 scr-0.067
Token no
Token ID645
Feature activation-2.05e-02

Pos logit contributions
 rider+0.009
Brian+0.007
Paul+0.007
lf+0.007
John+0.007
Neg logit contributions
 tad-0.011
 lav-0.011
 pods-0.011
 bou-0.011
 scr-0.011
Token avail
Token ID29107
Feature activation-1.46e-01

Pos logit contributions
 rider+0.027
Brian+0.025
Paul+0.025
lf+0.025
John+0.024
Neg logit contributions
 tad-0.044
 lav-0.043
 pods-0.043
 bou-0.043
 spr-0.042
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
 rider+0.247
Brian+0.215
Thomas+0.206
John+0.205
lf+0.205
Neg logit contributions
 pods-0.272
 tad-0.269
 lav-0.266
 bou-0.256
mary-0.253
Token He
Token ID679
Feature activation-2.00e-02

Pos logit contributions
 pods+0.002
 lav+0.002
 tad+0.002
 bou+0.002
mary+0.002
Neg logit contributions
 rider-0.001
Brian-0.001
John-0.001
Paul-0.001
 riding-0.001
Token felt
Token ID2936
Feature activation-1.78e-02

Pos logit contributions
 rider+0.033
Brian+0.029
John+0.028
Paul+0.028
 riding+0.028
Neg logit contributions
 pods-0.039
 tad-0.039
 lav-0.038
 bou-0.036
mary-0.036
Token sad
Token ID6507
Feature activation+2.44e-02

Pos logit contributions
 rider+0.029
Brian+0.025
lf+0.025
John+0.025
Paul+0.024
Neg logit contributions
 pods-0.035
 tad-0.034
 lav-0.034
 bou-0.033
mary-0.032
Token and
Token ID290
Feature activation+3.81e-03

Pos logit contributions
 pods+0.051
 lav+0.050
 tad+0.050
 bou+0.048
mary+0.046
Neg logit contributions
 rider-0.037
Brian-0.032
John-0.031
lf-0.031
Paul-0.031
Token times
Token ID1661
Feature activation-4.14e-02

Pos logit contributions
 rider+0.033
Brian+0.029
Paul+0.029
John+0.029
Thomas+0.028
Neg logit contributions
 lav-0.047
 pods-0.046
 tad-0.046
 bou-0.045
mary-0.043
Token,
Token ID11
Feature activation+9.18e-03

Pos logit contributions
 rider+0.060
Brian+0.050
John+0.049
lf+0.049
Thomas+0.049
Neg logit contributions
 pods-0.087
 lav-0.086
 tad-0.085
 bou-0.084
mary-0.081
Token but
Token ID475
Feature activation-3.24e-02

Pos logit contributions
 pods+0.019
 lav+0.018
 tad+0.018
 bou+0.018
mary+0.017
Neg logit contributions
 rider-0.014
 riding-0.012
Brian-0.012
John-0.012
lf-0.012
Token to
Token ID284
Feature activation-8.77e-03

Pos logit contributions
 rider+0.044
Brian+0.036
John+0.035
lf+0.035
 riding+0.035
Neg logit contributions
 pods-0.073
 lav-0.072
 tad-0.072
 bou-0.070
mary-0.068
Token no
Token ID645
Feature activation-1.97e-02

Pos logit contributions
 rider+0.016
Brian+0.014
John+0.013
Paul+0.013
lf+0.013
Neg logit contributions
 lav-0.016
 tad-0.015
 pods-0.015
 bou-0.014
 scr-0.014
Token avail
Token ID29107
Feature activation-1.46e-01

Pos logit contributions
 rider+0.028
Paul+0.027
John+0.026
Brian+0.026
Thomas+0.025
Neg logit contributions
 lav-0.041
 bou-0.040
 pods-0.040
 spr-0.039
 tad-0.039
Token.
Token ID13
Feature activation-9.78e-04

Pos logit contributions
 rider+0.224
Brian+0.190
John+0.184
lf+0.184
Paul+0.183
Neg logit contributions
 pods-0.299
 lav-0.295
 tad-0.295
 bou-0.285
mary-0.278
Token
Token ID220
Feature activation+1.20e-02

Pos logit contributions
John+0.001
Paul+0.001
James+0.001
Brian+0.001
 riding+0.001
Neg logit contributions
 lav-0.002
 pods-0.002
 tad-0.002
 bou-0.002
mary-0.002
Token
Token ID628
Feature activation-4.47e-03

Pos logit contributions
 pods+0.029
 lav+0.028
 tad+0.028
 bou+0.027
mary+0.027
Neg logit contributions
 rider-0.014
Brian-0.012
John-0.011
lf-0.011
Paul-0.011
Token Jane
Token ID12091
Feature activation+1.65e-02

Pos logit contributions
 rider+0.004
John+0.004
His+0.004
lf+0.003
Paul+0.003
Neg logit contributions
 lav-0.011
 pods-0.011
 bou-0.011
 tad-0.011
mary-0.011
Token's
Token ID338
Feature activation-1.03e-02

Pos logit contributions
 pods+0.030
 tad+0.029
 lav+0.029
 bou+0.028
mary+0.028
Neg logit contributions
 rider-0.029
Brian-0.025
John-0.025
Paul-0.025
lf-0.024
Token deep
Token ID2769
Feature activation-2.24e-02

Pos logit contributions
 rider+0.049
Brian+0.043
John+0.042
Paul+0.042
lf+0.041
Neg logit contributions
 pods-0.049
 lav-0.048
 tad-0.048
 bou-0.046
mary-0.044
Token paws
Token ID49193
Feature activation-2.46e-02

Pos logit contributions
 rider+0.047
John+0.043
Brian+0.043
Paul+0.042
Thomas+0.042
Neg logit contributions
 pods-0.034
 lav-0.033
 bou-0.033
 tad-0.032
 spr-0.030
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 rider+0.035
Brian+0.029
John+0.028
lf+0.028
Paul+0.028
Neg logit contributions
 pods-0.054
 tad-0.053
 lav-0.052
 bou-0.051
mary-0.050
Token to
Token ID284
Feature activation+3.18e-03

Pos logit contributions
 pods+0.026
 tad+0.026
 lav+0.026
 bou+0.025
mary+0.024
Neg logit contributions
 rider-0.021
Brian-0.018
lf-0.018
John-0.018
Paul-0.017
Token no
Token ID645
Feature activation-2.06e-02

Pos logit contributions
 tad+0.005
 lav+0.005
 pods+0.005
 bou+0.005
 spr+0.005
Neg logit contributions
 rider-0.006
Brian-0.005
Thomas-0.005
John-0.005
Paul-0.005
Token avail
Token ID29107
Feature activation-1.43e-01

Pos logit contributions
 rider+0.031
John+0.029
Paul+0.029
Brian+0.029
Thomas+0.028
Neg logit contributions
 bou-0.040
 pods-0.040
 lav-0.040
 tad-0.039
 spr-0.039
Token.
Token ID13
Feature activation-6.06e-03

Pos logit contributions
 rider+0.257
Brian+0.224
Thomas+0.217
lf+0.213
John+0.211
Neg logit contributions
 pods-0.247
 tad-0.242
 lav-0.238
 bou-0.232
mary-0.231
Token He
Token ID679
Feature activation-2.38e-02

Pos logit contributions
 rider+0.009
Brian+0.007
lf+0.007
John+0.007
 riding+0.007
Neg logit contributions
 pods-0.013
 tad-0.013
 lav-0.013
 bou-0.012
mary-0.012
Token finally
Token ID3443
Feature activation-3.06e-02

Pos logit contributions
 rider+0.038
Brian+0.033
John+0.032
lf+0.032
Paul+0.032
Neg logit contributions
 pods-0.047
 lav-0.046
 tad-0.046
 bou-0.045
mary-0.043
Token thought
Token ID1807
Feature activation+1.23e-02

Pos logit contributions
 rider+0.050
Brian+0.043
John+0.041
Paul+0.041
 riding+0.041
Neg logit contributions
 pods-0.060
 tad-0.060
 lav-0.059
 bou-0.057
mary-0.056
Token of
Token ID286
Feature activation+2.50e-02

Pos logit contributions
 pods+0.026
 lav+0.026
 tad+0.026
 bou+0.025
 spr+0.024
Neg logit contributions
 rider-0.017
Brian-0.014
Paul-0.014
Thomas-0.013
John-0.013
Token and
Token ID290
Feature activation-4.38e-03

Pos logit contributions
Rob+0.045
Thomas+0.045
dd+0.045
John+0.044
 rider+0.044
Neg logit contributions
 pods-0.042
 bou-0.041
 lav-0.041
 scr-0.041
 tad-0.039
Token he
Token ID339
Feature activation-3.78e-02

Pos logit contributions
 rider+0.006
Brian+0.006
Thomas+0.005
John+0.005
Paul+0.005
Neg logit contributions
 pods-0.009
 tad-0.009
 lav-0.009
 bou-0.008
mary-0.008
Token was
Token ID373
Feature activation-1.21e-02

Pos logit contributions
 rider+0.060
Brian+0.052
John+0.051
Paul+0.051
lf+0.050
Neg logit contributions
 pods-0.074
 tad-0.073
 lav-0.073
 bou-0.070
mary-0.069
Token su
Token ID424
Feature activation-2.06e-02

Pos logit contributions
 rider+0.021
Brian+0.018
lf+0.018
John+0.018
Paul+0.018
Neg logit contributions
 lav-0.022
 tad-0.022
 pods-0.022
 bou-0.021
mary-0.020
Tokenpr
Token ID1050
Feature activation+2.24e-04

Pos logit contributions
 rider+0.038
Brian+0.033
John+0.033
lf+0.033
Paul+0.033
Neg logit contributions
 pods-0.036
 lav-0.035
 tad-0.035
 bou-0.034
mary-0.033
Tokenised
Token ID1417
Feature activation-1.36e-01

Pos logit contributions
 tad+0.000
 pods+0.000
 lav+0.000
 scr+0.000
 bou+0.000
Neg logit contributions
 rider-0.000
Brian-0.000
 riding-0.000
John-0.000
lf-0.000
Token at
Token ID379
Feature activation-2.67e-02

Pos logit contributions
 rider+0.222
Bow+0.190
Thomas+0.190
Brian+0.190
Rob+0.188
Neg logit contributions
 pods-0.240
 tad-0.240
 lav-0.236
 smells-0.221
flies-0.221
Token how
Token ID703
Feature activation-1.84e-02

Pos logit contributions
 rider+0.044
Rob+0.036
Thomas+0.036
John+0.035
Brian+0.035
Neg logit contributions
 tad-0.049
 pods-0.049
 lav-0.049
 bou-0.046
 herbs-0.044
Token quickly
Token ID2952
Feature activation+2.40e-02

Pos logit contributions
 rider+0.025
Brian+0.020
lf+0.020
Paul+0.019
John+0.019
Neg logit contributions
 pods-0.041
 tad-0.041
 lav-0.041
 bou-0.039
mary-0.038
Token he
Token ID339
Feature activation-1.14e-02

Pos logit contributions
 pods+0.052
 lav+0.051
 tad+0.051
 bou+0.049
 spr+0.048
Neg logit contributions
 rider-0.031
Brian-0.030
Paul-0.028
Thomas-0.027
John-0.027
Token started
Token ID2067
Feature activation-6.76e-03

Pos logit contributions
 rider+0.019
Brian+0.016
John+0.016
Paul+0.016
lf+0.016
Neg logit contributions
 pods-0.021
 tad-0.021
 lav-0.021
 bou-0.020
mary-0.020
Token to
Token ID284
Feature activation-2.05e-02

Pos logit contributions
 rider+0.032
Brian+0.027
lf+0.026
John+0.026
Paul+0.026
Neg logit contributions
 pods-0.055
 lav-0.054
 tad-0.054
 bou-0.052
mary-0.051
Token use
Token ID779
Feature activation-5.61e-02

Pos logit contributions
 rider+0.037
Brian+0.033
John+0.032
Paul+0.032
lf+0.032
Neg logit contributions
 lav-0.036
 pods-0.035
 tad-0.035
 bou-0.034
mary-0.033
Token it
Token ID340
Feature activation-2.74e-02

Pos logit contributions
 rider+0.078
Brian+0.063
lf+0.061
John+0.061
Paul+0.059
Neg logit contributions
 lav-0.128
 pods-0.127
 tad-0.122
 bou-0.121
mary-0.117
Token with
Token ID351
Feature activation+1.35e-02

Pos logit contributions
 rider+0.040
Brian+0.034
John+0.033
Paul+0.033
lf+0.033
Neg logit contributions
 pods-0.058
 lav-0.057
 tad-0.056
 bou-0.055
mary-0.054
Token adult
Token ID4044
Feature activation-3.67e-02

Pos logit contributions
 pods+0.030
 lav+0.030
 tad+0.029
 bou+0.028
mary+0.027
Neg logit contributions
 rider-0.019
Brian-0.016
lf-0.016
John-0.015
Paul-0.015
Token supervision
Token ID20865
Feature activation-1.35e-01

Pos logit contributions
 rider+0.057
Brian+0.049
lf+0.048
John+0.047
Paul+0.047
Neg logit contributions
 pods-0.075
 lav-0.074
 tad-0.073
 bou-0.071
mary-0.069
Token,"
Token ID553
Feature activation-5.21e-03

Pos logit contributions
 rider+0.231
Brian+0.206
Paul+0.205
lf+0.204
John+0.198
Neg logit contributions
 pods-0.250
 lav-0.230
 tad-0.228
mary-0.228
 bou-0.227
Token Mr
Token ID1770
Feature activation+1.33e-02

Pos logit contributions
 rider+0.005
 riding+0.004
John+0.004
Brian+0.004
lf+0.004
Neg logit contributions
 pods-0.014
 tad-0.013
 lav-0.013
 bou-0.013
mary-0.013
Token.
Token ID13
Feature activation+9.00e-03

Pos logit contributions
 pods+0.029
 lav+0.028
 tad+0.028
 bou+0.028
mary+0.026
Neg logit contributions
 rider-0.020
Brian-0.016
John-0.016
Paul-0.016
 riding-0.016
Token Lee
Token ID5741
Feature activation-2.76e-03

Pos logit contributions
 pods+0.017
 lav+0.016
 tad+0.016
 bou+0.016
mary+0.015
Neg logit contributions
 rider-0.016
Brian-0.014
John-0.014
lf-0.014
Paul-0.014
Token said
Token ID531
Feature activation-1.78e-02

Pos logit contributions
 rider+0.004
Brian+0.003
lf+0.003
John+0.003
Paul+0.003
Neg logit contributions
 pods-0.006
 lav-0.006
 tad-0.006
 bou-0.006
mary-0.006
Token,
Token ID11
Feature activation-2.16e-02

Pos logit contributions
 pods+0.004
 lav+0.004
 tad+0.004
 bou+0.004
mary+0.004
Neg logit contributions
 rider-0.002
Brian-0.002
Paul-0.001
John-0.001
lf-0.001
Token but
Token ID475
Feature activation-5.16e-02

Pos logit contributions
 rider+0.029
John+0.024
 riding+0.024
lf+0.024
Tom+0.023
Neg logit contributions
 pods-0.049
mary-0.047
 tad-0.046
 lav-0.046
 bou-0.045
Token only
Token ID691
Feature activation-5.87e-02

Pos logit contributions
 rider+0.085
Brian+0.073
John+0.071
Paul+0.071
lf+0.071
Neg logit contributions
 lav-0.102
 pods-0.101
 tad-0.100
 bou-0.096
mary-0.093
Token with
Token ID351
Feature activation-1.66e-02

Pos logit contributions
 rider+0.099
Brian+0.088
John+0.086
Paul+0.086
lf+0.085
Neg logit contributions
 lav-0.112
 pods-0.111
 tad-0.109
 bou-0.105
mary-0.101
Token our
Token ID674
Feature activation-5.90e-03

Pos logit contributions
 rider+0.025
Brian+0.021
 riding+0.021
lf+0.021
John+0.021
Neg logit contributions
 pods-0.034
 tad-0.034
 lav-0.034
 bou-0.033
mary-0.032
Token supervision
Token ID20865
Feature activation-1.32e-01

Pos logit contributions
 rider+0.010
Brian+0.008
 riding+0.008
lf+0.008
John+0.008
Neg logit contributions
 pods-0.011
 tad-0.011
 lav-0.011
 bou-0.011
mary-0.010
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
 rider+0.196
Brian+0.167
lf+0.165
Paul+0.163
John+0.161
Neg logit contributions
 pods-0.274
 lav-0.266
 tad-0.265
 bou-0.259
mary-0.252
Token And
Token ID843
Feature activation+1.08e-02

Pos logit contributions
 rider+0.011
Brian+0.010
 riding+0.009
Thomas+0.009
John+0.009
Neg logit contributions
 pods-0.015
 bou-0.015
 tad-0.015
 lav-0.014
mary-0.014
Token maybe
Token ID3863
Feature activation-1.39e-02

Pos logit contributions
 pods+0.022
 tad+0.021
 lav+0.021
 bou+0.021
mary+0.020
Neg logit contributions
 rider-0.017
 riding-0.014
Brian-0.014
John-0.014
lf-0.014
Token you
Token ID345
Feature activation+1.39e-02

Pos logit contributions
 rider+0.025
Brian+0.022
Paul+0.021
John+0.021
Thomas+0.021
Neg logit contributions
 lav-0.026
 pods-0.025
 tad-0.024
 bou-0.024
 spr-0.024
Token can
Token ID460
Feature activation+7.79e-03

Pos logit contributions
 pods+0.026
 lav+0.026
 bou+0.026
 tad+0.026
mary+0.023
Neg logit contributions
 rider-0.025
 riding-0.020
working-0.020
Brian-0.020
lf-0.019
Token said
Token ID531
Feature activation-2.05e-03

Pos logit contributions
 rider+0.012
Brian+0.010
John+0.010
Paul+0.010
lf+0.010
Neg logit contributions
 pods-0.014
 lav-0.014
 tad-0.014
 bou-0.013
mary-0.013
Token,
Token ID11
Feature activation+1.96e-02

Pos logit contributions
 rider+0.003
Brian+0.002
Paul+0.002
lf+0.002
John+0.002
Neg logit contributions
 lav-0.005
 pods-0.005
 tad-0.005
 bou-0.004
mary-0.004
Token "
Token ID366
Feature activation+7.71e-03

Pos logit contributions
 pods+0.038
 tad+0.038
 lav+0.038
 bou+0.036
mary+0.035
Neg logit contributions
 rider-0.032
Brian-0.027
 riding-0.027
John-0.027
lf-0.027
TokenI
Token ID40
Feature activation+1.50e-02

Pos logit contributions
 tad+0.013
 lav+0.013
 pods+0.012
mary+0.011
 basil+0.011
Neg logit contributions
 rider-0.014
lf-0.013
 riding-0.013
working-0.012
Brian-0.012
Token like
Token ID588
Feature activation+3.74e-03

Pos logit contributions
 pods+0.030
 lav+0.028
 tad+0.028
 bou+0.027
 pulp+0.027
Neg logit contributions
 rider-0.025
working-0.022
 riding-0.021
Tom-0.020
John-0.020
Token Tues
Token ID48496
Feature activation-1.29e-01

Pos logit contributions
 pods+0.009
 tad+0.009
 lav+0.008
 bou+0.008
mary+0.008
Neg logit contributions
 rider-0.005
 riding-0.004
lf-0.004
John-0.004
Brian-0.004
Tokendays
Token ID12545
Feature activation+4.94e-02

Pos logit contributions
 rider+0.349
Brian+0.311
working+0.304
 cop+0.302
Bow+0.301
Neg logit contributions
 lav-0.109
 pods-0.105
 tad-0.104
 bou-0.097
mary-0.093
Token too
Token ID1165
Feature activation-1.72e-02

Pos logit contributions
 pods+0.097
 lav+0.095
 tad+0.092
 bou+0.092
mary+0.090
Neg logit contributions
 rider-0.076
Brian-0.069
Paul-0.066
lf-0.066
John-0.065
Token!
Token ID0
Feature activation-4.94e-03

Pos logit contributions
 rider+0.026
Brian+0.022
lf+0.022
John+0.022
Paul+0.022
Neg logit contributions
 pods-0.035
 lav-0.035
 tad-0.035
 bou-0.034
mary-0.033
Token It
Token ID632
Feature activation+8.30e-03

Pos logit contributions
 rider+0.008
Brian+0.007
lf+0.007
John+0.007
Paul+0.007
Neg logit contributions
 pods-0.010
 lav-0.010
 tad-0.010
 bou-0.009
mary-0.009
Token's
Token ID338
Feature activation+2.50e-02

Pos logit contributions
 pods+0.019
 bou+0.018
 tad+0.018
 lav+0.018
 pulp+0.018
Neg logit contributions
 rider-0.012
 riding-0.009
Brian-0.008
 day-0.008
working-0.008
Token a
Token ID257
Feature activation-2.58e-02

Pos logit contributions
 rider+0.059
Brian+0.050
lf+0.050
Paul+0.048
John+0.048
Neg logit contributions
 lav-0.073
 tad-0.073
 pods-0.073
 bou-0.070
mary-0.068
Token little
Token ID1310
Feature activation+6.79e-03

Pos logit contributions
 rider+0.041
Brian+0.033
lf+0.033
 riding+0.032
John+0.032
Neg logit contributions
 pods-0.053
 tad-0.052
 lav-0.052
 bou-0.050
mary-0.049
Token girl
Token ID2576
Feature activation-3.16e-02

Pos logit contributions
 pods+0.013
 lav+0.013
 tad+0.013
 bou+0.013
mary+0.012
Neg logit contributions
 rider-0.011
Brian-0.009
 riding-0.009
working-0.009
John-0.009
Token who
Token ID508
Feature activation-5.29e-02

Pos logit contributions
 rider+0.051
Brian+0.044
John+0.043
lf+0.043
Paul+0.042
Neg logit contributions
 pods-0.062
 lav-0.062
 tad-0.061
 bou-0.059
mary-0.058
Token loved
Token ID6151
Feature activation-2.21e-02

Pos logit contributions
 rider+0.058
Brian+0.043
 riding+0.043
working+0.042
John+0.042
Neg logit contributions
 pods-0.134
 tad-0.132
 lav-0.131
 bou-0.129
mary-0.123
Token Tues
Token ID48496
Feature activation-1.28e-01

Pos logit contributions
 rider+0.024
 riding+0.021
lf+0.019
John+0.018
working+0.018
Neg logit contributions
 tad-0.055
 pods-0.054
 lav-0.053
 bou-0.052
mary-0.052
Tokendays
Token ID12545
Feature activation+5.27e-02

Pos logit contributions
 rider+0.355
Brian+0.313
 cop+0.309
Bow+0.307
working+0.305
Neg logit contributions
 pods-0.097
 lav-0.095
 tad-0.094
mary-0.087
 bou-0.084
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 pods+0.101
 lav+0.099
 tad+0.096
 bou+0.095
mary+0.094
Neg logit contributions
 rider-0.085
Brian-0.076
John-0.073
lf-0.073
Paul-0.072
Token Every
Token ID3887
Feature activation-5.23e-03

Pos logit contributions
 rider+0.019
 riding+0.017
John+0.016
 day+0.016
Paul+0.015
Neg logit contributions
 tad-0.040
 pods-0.039
 lav-0.038
 pulp-0.038
flies-0.037
Token Tuesday
Token ID3431
Feature activation+3.27e-02

Pos logit contributions
 rider+0.005
Brian+0.005
Paul+0.004
lf+0.004
John+0.004
Neg logit contributions
 pods-0.013
 lav-0.013
 tad-0.013
mary-0.013
 bou-0.013
Token,
Token ID11
Feature activation-3.74e-03

Pos logit contributions
 pods+0.068
 lav+0.067
mary+0.064
 tad+0.064
 bou+0.064
Neg logit contributions
 rider-0.049
Brian-0.044
lf-0.041
Paul-0.041
John-0.040
Token little
Token ID1310
Feature activation+6.39e-02

Pos logit contributions
 pods+0.049
 lav+0.049
 tad+0.048
 bou+0.046
mary+0.045
Neg logit contributions
 rider-0.052
Brian-0.045
John-0.044
lf-0.044
Paul-0.044
Token girl
Token ID2576
Feature activation+2.13e-02

Pos logit contributions
 pods+0.115
 lav+0.112
 tad+0.111
 bou+0.107
mary+0.105
Neg logit contributions
 rider-0.116
Brian-0.098
 riding-0.097
John-0.097
lf-0.096
Token was
Token ID373
Feature activation-1.64e-02

Pos logit contributions
 pods+0.043
 lav+0.043
 tad+0.042
 bou+0.041
mary+0.039
Neg logit contributions
 rider-0.033
lf-0.028
Brian-0.028
John-0.028
 riding-0.027
Token su
Token ID424
Feature activation-1.51e-02

Pos logit contributions
 rider+0.029
Brian+0.026
lf+0.025
John+0.025
Paul+0.025
Neg logit contributions
 pods-0.029
 tad-0.029
 lav-0.029
 bou-0.028
mary-0.027
Tokenpr
Token ID1050
Feature activation+2.27e-03

Pos logit contributions
 rider+0.028
Brian+0.025
John+0.025
Paul+0.024
lf+0.024
Neg logit contributions
 pods-0.027
 lav-0.026
 tad-0.025
 bou-0.024
mary-0.024
Tokenised
Token ID1417
Feature activation-1.28e-01

Pos logit contributions
 pods+0.005
 tad+0.005
 lav+0.005
 bou+0.005
mary+0.005
Neg logit contributions
 rider-0.003
Brian-0.002
lf-0.002
John-0.002
 riding-0.002
Token.
Token ID13
Feature activation-2.37e-03

Pos logit contributions
 rider+0.185
Brian+0.153
lf+0.149
Paul+0.147
John+0.146
Neg logit contributions
 pods-0.274
 lav-0.272
 tad-0.272
 bou-0.262
mary-0.256
Token She
Token ID1375
Feature activation+2.27e-02

Pos logit contributions
 rider+0.004
 riding+0.003
Brian+0.003
Paul+0.003
working+0.003
Neg logit contributions
 pods-0.005
 tad-0.005
 lav-0.005
 bou-0.004
 bowls-0.004
Token had
Token ID550
Feature activation-3.07e-02

Pos logit contributions
 pods+0.045
 lav+0.044
 tad+0.044
 bou+0.042
mary+0.041
Neg logit contributions
 rider-0.037
Brian-0.032
John-0.031
lf-0.031
Paul-0.031
Token never
Token ID1239
Feature activation-4.57e-02

Pos logit contributions
 rider+0.042
Brian+0.035
 riding+0.034
John+0.034
Paul+0.034
Neg logit contributions
 pods-0.068
 lav-0.067
 tad-0.067
 bou-0.065
mary-0.063
Token heard
Token ID2982
Feature activation+2.49e-02

Pos logit contributions
 rider+0.064
Brian+0.055
lf+0.053
Paul+0.053
Thomas+0.052
Neg logit contributions
 tad-0.099
 pods-0.098
 lav-0.097
 bou-0.094
mary-0.092
Token clown
Token ID25573
Feature activation-1.15e-02

Pos logit contributions
 rider+0.010
Brian+0.008
lf+0.008
Paul+0.008
John+0.008
Neg logit contributions
 pods-0.012
 tad-0.012
 lav-0.012
 bou-0.012
mary-0.011
Tokens
Token ID82
Feature activation+3.59e-03

Pos logit contributions
 rider+0.022
Brian+0.019
 riding+0.019
John+0.019
lf+0.019
Neg logit contributions
 pods-0.020
 lav-0.020
 tad-0.019
 bou-0.019
mary-0.018
Token juggling
Token ID48097
Feature activation+2.29e-02

Pos logit contributions
 pods+0.008
 tad+0.008
 lav+0.008
 bou+0.008
mary+0.008
Neg logit contributions
 rider-0.005
Brian-0.004
John-0.004
lf-0.004
Paul-0.004
Token and
Token ID290
Feature activation+2.89e-03

Pos logit contributions
 tad+0.043
 pods+0.042
 lav+0.040
 bou+0.040
 scr+0.038
Neg logit contributions
 rider-0.040
Paul-0.035
Brian-0.035
John-0.035
Thomas-0.035
Token ac
Token ID936
Feature activation-6.35e-02

Pos logit contributions
 pods+0.006
 tad+0.006
 lav+0.005
 bou+0.005
mary+0.005
Neg logit contributions
 rider-0.005
Brian-0.004
John-0.004
Paul-0.004
lf-0.004
Tokenrob
Token ID22609
Feature activation-1.28e-01

Pos logit contributions
 rider+0.073
 mechanic+0.072
 musician+0.067
 runner+0.064
working+0.064
Neg logit contributions
 pods-0.133
mary-0.130
 tad-0.128
 lav-0.122
 bou-0.122
Tokenats
Token ID1381
Feature activation-3.49e-02

Pos logit contributions
 rider+0.244
Brian+0.215
Thomas+0.213
Paul+0.209
John+0.205
Neg logit contributions
 pods-0.202
 bou-0.198
 tad-0.197
 lav-0.191
mary-0.185
Token doing
Token ID1804
Feature activation-8.95e-03

Pos logit contributions
 rider+0.052
Brian+0.044
John+0.043
lf+0.043
Paul+0.042
Neg logit contributions
 pods-0.073
 tad-0.071
 lav-0.071
 bou-0.069
mary-0.068
Token flips
Token ID45971
Feature activation+1.74e-02

Pos logit contributions
 rider+0.013
 riding+0.010
Brian+0.010
John+0.010
lf+0.010
Neg logit contributions
 pods-0.020
 lav-0.019
 tad-0.019
 bou-0.018
mary-0.018
Token.
Token ID13
Feature activation-5.86e-03

Pos logit contributions
 pods+0.036
 tad+0.035
 lav+0.035
 bou+0.034
mary+0.034
Neg logit contributions
 rider-0.026
Brian-0.022
lf-0.022
John-0.021
Paul-0.021
Token But
Token ID887
Feature activation-3.33e-02

Pos logit contributions
 rider+0.008
Brian+0.007
John+0.007
Paul+0.007
 riding+0.007
Neg logit contributions
 pods-0.013
 lav-0.012
 tad-0.012
 bou-0.012
mary-0.012
Token You
Token ID921
Feature activation+1.18e-02

Pos logit contributions
 rider+0.055
lf+0.050
Brian+0.047
Paul+0.046
working+0.046
Neg logit contributions
 lav-0.049
 tad-0.048
 pods-0.048
 basil-0.043
mary-0.043
Token don
Token ID836
Feature activation-3.56e-02

Pos logit contributions
 pods+0.020
 lav+0.020
 tad+0.020
 bou+0.019
mary+0.019
Neg logit contributions
 rider-0.022
Brian-0.019
 riding-0.019
John-0.019
lf-0.019
Token't
Token ID470
Feature activation-2.92e-02

Pos logit contributions
 rider+0.045
Brian+0.034
working+0.034
Paul+0.034
John+0.034
Neg logit contributions
 pods-0.081
 lav-0.079
 tad-0.079
mary-0.077
 bou-0.075
Token need
Token ID761
Feature activation-4.53e-02

Pos logit contributions
 rider+0.055
lf+0.052
Brian+0.051
Paul+0.051
Thomas+0.050
Neg logit contributions
 lav-0.046
 pods-0.045
 tad-0.045
mary-0.042
 spr-0.041
Token to
Token ID284
Feature activation-1.96e-02

Pos logit contributions
 rider+0.074
lf+0.063
Brian+0.060
Paul+0.059
John+0.058
Neg logit contributions
 lav-0.094
 pods-0.092
 tad-0.090
 herbs-0.086
 bou-0.086
Token shave
Token ID34494
Feature activation-1.27e-01

Pos logit contributions
 rider+0.037
Brian+0.033
lf+0.032
Paul+0.032
John+0.032
Neg logit contributions
 lav-0.033
 pods-0.032
 tad-0.031
 bou-0.030
mary-0.030
Token your
Token ID534
Feature activation-1.50e-02

Pos logit contributions
 rider+0.187
Brian+0.158
John+0.154
Paul+0.154
lf+0.153
Neg logit contributions
 pods-0.269
 lav-0.265
 tad-0.260
 bou-0.251
mary-0.247
Token face
Token ID1986
Feature activation+6.39e-04

Pos logit contributions
 rider+0.032
Brian+0.029
John+0.029
Paul+0.029
lf+0.029
Neg logit contributions
 pods-0.021
 lav-0.021
 tad-0.021
 bou-0.020
 scr-0.018
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
 tad+0.001
 lav+0.001
 pods+0.001
 bou+0.001
mary+0.001
Neg logit contributions
 rider-0.001
 riding-0.001
lf-0.001
working-0.001
Brian-0.001
Token You
Token ID921
Feature activation+1.50e-02

Pos logit contributions
 rider+0.037
lf+0.032
Brian+0.032
John+0.031
Paul+0.031
Neg logit contributions
 lav-0.037
 pods-0.037
 tad-0.037
 bou-0.034
mary-0.034
Token are
Token ID389
Feature activation+1.39e-02

Pos logit contributions
 pods+0.025
 lav+0.024
 tad+0.024
 bou+0.023
mary+0.022
Neg logit contributions
 rider-0.030
Brian-0.026
John-0.025
 riding-0.025
lf-0.025
Token with
Token ID351
Feature activation+1.39e-02

Pos logit contributions
 rider+0.055
Brian+0.047
Paul+0.045
John+0.044
lf+0.044
Neg logit contributions
 pods-0.073
 lav-0.071
 tad-0.071
 bou-0.069
mary-0.066
Token excitement
Token ID14067
Feature activation-5.25e-02

Pos logit contributions
 pods+0.033
 tad+0.032
 lav+0.032
 bou+0.031
 bowls+0.030
Neg logit contributions
 rider-0.017
Brian-0.014
working-0.014
lf-0.013
Paul-0.013
Token.
Token ID13
Feature activation-3.25e-03

Pos logit contributions
 rider+0.077
Brian+0.064
lf+0.063
John+0.063
 riding+0.062
Neg logit contributions
 pods-0.112
 lav-0.111
 tad-0.110
 bou-0.106
mary-0.103
Token He
Token ID679
Feature activation-1.68e-02

Pos logit contributions
 rider+0.005
lf+0.004
Brian+0.004
working+0.004
Paul+0.004
Neg logit contributions
 pods-0.007
 tad-0.006
 lav-0.006
 bou-0.006
mary-0.006
Token loved
Token ID6151
Feature activation-1.37e-02

Pos logit contributions
 rider+0.028
Brian+0.024
John+0.024
Paul+0.023
 riding+0.023
Neg logit contributions
 pods-0.032
 lav-0.032
 tad-0.032
 bou-0.030
mary-0.030
Token Tues
Token ID48496
Feature activation-1.26e-01

Pos logit contributions
 rider+0.020
Brian+0.017
Paul+0.016
lf+0.016
John+0.016
Neg logit contributions
 lav-0.029
 pods-0.029
 tad-0.028
 bou-0.027
mary-0.026
Tokendays
Token ID12545
Feature activation+5.51e-02

Pos logit contributions
 rider+0.363
 cop+0.324
Brian+0.323
Bow+0.320
working+0.319
Neg logit contributions
 pods-0.075
 tad-0.074
 lav-0.070
bugs-0.066
 bou-0.063
Token so
Token ID523
Feature activation+2.02e-02

Pos logit contributions
 pods+0.092
 lav+0.088
 scr+0.087
mary+0.087
 tad+0.086
Neg logit contributions
Brian-0.087
 rider-0.084
Bow-0.081
Paul-0.078
lf-0.078
Token much
Token ID881
Feature activation+2.89e-03

Pos logit contributions
 scr+0.028
 pods+0.027
 tad+0.027
 lav+0.026
 bou+0.025
Neg logit contributions
 rider-0.038
Brian-0.036
bench-0.034
Ryan-0.034
Thomas-0.033
Token because
Token ID780
Feature activation-1.54e-02

Pos logit contributions
 lav+0.006
 pods+0.005
 tad+0.005
 smells+0.005
 herbs+0.005
Neg logit contributions
 rider-0.004
Brian-0.004
Paul-0.004
Thomas-0.004
John-0.004
Token they
Token ID484
Feature activation-4.86e-03

Pos logit contributions
Brian+0.019
 rider+0.018
Ryan+0.017
Bow+0.017
gh+0.017
Neg logit contributions
 pods-0.032
 lav-0.030
 cups-0.030
 tad-0.030
 spr-0.030
Token and
Token ID290
Feature activation-2.66e-02

Pos logit contributions
 rider+0.028
Brian+0.024
John+0.023
lf+0.023
Paul+0.023
Neg logit contributions
 pods-0.039
 lav-0.039
 tad-0.038
 bou-0.037
mary-0.037
Token T
Token ID309
Feature activation-2.52e-02

Pos logit contributions
 rider+0.051
John+0.045
Brian+0.044
lf+0.044
Paul+0.043
Neg logit contributions
 lav-0.044
 pods-0.044
 tad-0.043
 bou-0.041
mary-0.040
Tokenidy
Token ID19325
Feature activation-5.72e-02

Pos logit contributions
 rider+0.043
Brian+0.036
lf+0.036
John+0.036
Paul+0.035
Neg logit contributions
 lav-0.047
 pods-0.047
 tad-0.046
 bou-0.045
mary-0.043
Token were
Token ID547
Feature activation+3.89e-03

Pos logit contributions
 rider+0.085
Brian+0.069
 riding+0.069
lf+0.068
working+0.067
Neg logit contributions
 tad-0.122
 lav-0.121
 pods-0.121
 bou-0.114
 basil-0.111
Token in
Token ID287
Feature activation-2.98e-02

Pos logit contributions
 pods+0.009
 lav+0.009
 tad+0.009
 bou+0.009
mary+0.008
Neg logit contributions
 rider-0.005
Brian-0.004
John-0.004
lf-0.004
Paul-0.004
Tokense
Token ID325
Feature activation-1.26e-01

Pos logit contributions
 rider+0.045
lf+0.038
John+0.037
Brian+0.037
Paul+0.036
Neg logit contributions
 lav-0.062
 tad-0.061
 pods-0.061
 bou-0.060
mary-0.056
Tokenper
Token ID525
Feature activation-8.47e-02

Pos logit contributions
 rider+0.220
Brian+0.190
lf+0.188
John+0.187
Paul+0.184
Neg logit contributions
 pods-0.231
 tad-0.230
 lav-0.227
 bou-0.220
mary-0.214
Tokenable
Token ID540
Feature activation-5.53e-02

Pos logit contributions
 rider+0.113
Brian+0.092
 riding+0.091
working+0.089
Paul+0.088
Neg logit contributions
 pods-0.193
 lav-0.192
 tad-0.186
 bou-0.181
mary-0.175
Token.
Token ID13
Feature activation-3.57e-03

Pos logit contributions
 rider+0.087
John+0.074
Brian+0.074
lf+0.073
Paul+0.072
Neg logit contributions
 pods-0.111
 tad-0.110
 lav-0.109
 bou-0.106
mary-0.101
Token They
Token ID1119
Feature activation+1.60e-02

Pos logit contributions
 rider+0.006
Brian+0.006
Paul+0.006
 riding+0.005
John+0.005
Neg logit contributions
 pods-0.007
 tad-0.006
 lav-0.006
 bou-0.006
mary-0.006
Token would
Token ID561
Feature activation-5.47e-03

Pos logit contributions
 pods+0.035
 tad+0.034
 lav+0.034
 bou+0.033
mary+0.032
Neg logit contributions
 rider-0.023
Brian-0.019
lf-0.019
John-0.018
Paul-0.018
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
 pods+0.059
 tad+0.059
 lav+0.058
 bou+0.056
mary+0.054
Neg logit contributions
 rider-0.041
lf-0.034
Brian-0.034
John-0.033
Paul-0.033
Token he
Token ID339
Feature activation-3.43e-02

Pos logit contributions
 pods+0.027
 tad+0.027
 lav+0.027
 bou+0.025
mary+0.025
Neg logit contributions
 rider-0.014
Brian-0.011
Paul-0.011
lf-0.011
John-0.011
Token was
Token ID373
Feature activation-8.10e-03

Pos logit contributions
 rider+0.049
Brian+0.040
John+0.039
lf+0.039
Paul+0.039
Neg logit contributions
 pods-0.075
 tad-0.074
 lav-0.074
 bou-0.071
mary-0.070
Token su
Token ID424
Feature activation-1.60e-02

Pos logit contributions
 rider+0.015
Brian+0.013
lf+0.013
working+0.013
Paul+0.013
Neg logit contributions
 tad-0.014
 lav-0.014
 pods-0.014
 bou-0.013
mary-0.013
Tokenpr
Token ID1050
Feature activation+7.38e-03

Pos logit contributions
 rider+0.027
Brian+0.024
John+0.024
Paul+0.024
lf+0.023
Neg logit contributions
 pods-0.030
 lav-0.030
 tad-0.029
 bou-0.028
mary-0.027
Tokenised
Token ID1417
Feature activation-1.26e-01

Pos logit contributions
 pods+0.014
 lav+0.014
 tad+0.014
 bou+0.013
mary+0.013
Neg logit contributions
 rider-0.012
Brian-0.011
John-0.010
lf-0.010
Paul-0.010
Token to
Token ID284
Feature activation+4.02e-03

Pos logit contributions
 rider+0.206
Brian+0.173
lf+0.169
Thomas+0.165
Paul+0.165
Neg logit contributions
 tad-0.242
 pods-0.238
 lav-0.233
 bou-0.222
mary-0.222
Token see
Token ID766
Feature activation-5.00e-03

Pos logit contributions
 pods+0.008
 tad+0.008
 lav+0.008
 bou+0.007
mary+0.007
Neg logit contributions
 rider-0.007
Brian-0.006
lf-0.006
John-0.006
Paul-0.006
Token a
Token ID257
Feature activation+2.12e-02

Pos logit contributions
 rider+0.007
Brian+0.006
lf+0.006
John+0.006
Paul+0.006
Neg logit contributions
 pods-0.011
 lav-0.011
 tad-0.011
 bou-0.010
mary-0.010
Token gray
Token ID12768
Feature activation-1.44e-02

Pos logit contributions
 pods+0.044
 lav+0.041
 tad+0.041
mary+0.040
ount+0.040
Neg logit contributions
 rider-0.032
lf-0.027
Brian-0.026
 cop-0.025
 musician-0.025
Token bird
Token ID6512
Feature activation+3.07e-02

Pos logit contributions
 rider+0.029
lf+0.026
 riding+0.025
working+0.024
Brian+0.024
Neg logit contributions
 pods-0.022
 lav-0.021
 tad-0.021
 bou-0.020
mary-0.020
Token holding
Token ID4769
Feature activation-3.96e-02

Pos logit contributions
 rider+0.037
Brian+0.031
John+0.030
Paul+0.030
lf+0.029
Neg logit contributions
 pods-0.064
 lav-0.064
 tad-0.063
 bou-0.062
mary-0.060
Token things
Token ID1243
Feature activation-1.94e-02

Pos logit contributions
 rider+0.054
Brian+0.044
 riding+0.043
John+0.043
lf+0.042
Neg logit contributions
 pods-0.088
 lav-0.087
 tad-0.087
 bou-0.085
mary-0.082
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
 rider+0.029
Brian+0.025
John+0.024
lf+0.024
Paul+0.024
Neg logit contributions
 pods-0.041
 lav-0.040
 tad-0.039
 bou-0.038
mary-0.037
Token He
Token ID679
Feature activation-1.85e-02

Pos logit contributions
 rider+0.002
John+0.002
Brian+0.002
 riding+0.002
Paul+0.002
Neg logit contributions
 pods-0.005
 lav-0.005
 tad-0.005
 bou-0.005
mary-0.005
Token pract
Token ID1970
Feature activation-7.28e-02

Pos logit contributions
 rider+0.029
Brian+0.025
John+0.024
lf+0.024
Paul+0.024
Neg logit contributions
 pods-0.038
 lav-0.037
 tad-0.037
 bou-0.036
mary-0.035
Tokenised
Token ID1417
Feature activation-1.25e-01

Pos logit contributions
 rider+0.142
 cop+0.136
 mechanic+0.131
unt+0.126
 lawyer+0.125
Neg logit contributions
ishes-0.100
 scr-0.089
 tad-0.089
 pods-0.082
 bou-0.078
Token every
Token ID790
Feature activation-3.84e-02

Pos logit contributions
 rider+0.215
John+0.165
lf+0.164
Brian+0.164
Rob+0.164
Neg logit contributions
 lav-0.233
 pods-0.230
 tad-0.227
mary-0.220
 smells-0.219
Token day
Token ID1110
Feature activation-2.62e-02

Pos logit contributions
 rider+0.035
Paul+0.035
Brian+0.035
John+0.031
Thomas+0.030
Neg logit contributions
 pods-0.096
 lav-0.096
 tad-0.094
 bou-0.092
 spr-0.091
Token,
Token ID11
Feature activation-4.24e-04

Pos logit contributions
 rider+0.047
Brian+0.044
lf+0.044
John+0.043
Thomas+0.043
Neg logit contributions
 tad-0.042
 pods-0.041
 lav-0.041
mary-0.039
 scr-0.038
Token often
Token ID1690
Feature activation-1.17e-02

Pos logit contributions
 rider+0.001
Brian+0.000
lf+0.000
John+0.000
Paul+0.000
Neg logit contributions
 pods-0.001
 tad-0.001
 lav-0.001
 bou-0.001
mary-0.001
Token with
Token ID351
Feature activation-1.91e-03

Pos logit contributions
 rider+0.016
Brian+0.015
John+0.015
Paul+0.014
Thomas+0.013
Neg logit contributions
 lav-0.025
 pods-0.024
 tad-0.023
 bou-0.023
 spr-0.023
Token cool
Token ID3608
Feature activation+2.21e-02

Pos logit contributions
 pods+0.100
 lav+0.099
 tad+0.097
 bou+0.096
mary+0.092
Neg logit contributions
 rider-0.074
John-0.062
lf-0.062
Brian-0.061
 riding-0.061
Token!
Token ID0
Feature activation+8.42e-03

Pos logit contributions
 pods+0.046
 tad+0.045
 lav+0.045
 bou+0.043
mary+0.042
Neg logit contributions
 rider-0.034
Brian-0.029
John-0.028
Paul-0.028
lf-0.028
Token Sammy
Token ID44067
Feature activation-2.35e-02

Pos logit contributions
 pods+0.018
 tad+0.018
 lav+0.018
 bou+0.017
mary+0.017
Neg logit contributions
 rider-0.012
Brian-0.010
lf-0.010
John-0.010
Paul-0.010
Token was
Token ID373
Feature activation-2.39e-02

Pos logit contributions
 rider+0.038
Brian+0.032
John+0.031
lf+0.031
Paul+0.031
Neg logit contributions
 pods-0.047
 lav-0.046
 tad-0.046
 bou-0.044
mary-0.043
Token mesmer
Token ID46814
Feature activation-1.35e-02

Pos logit contributions
 rider+0.041
Brian+0.036
 riding+0.035
John+0.035
Paul+0.035
Neg logit contributions
 pods-0.045
 lav-0.044
 tad-0.043
 bou-0.042
mary-0.041
Tokenised
Token ID1417
Feature activation-1.24e-01

Pos logit contributions
 rider+0.023
Brian+0.020
lf+0.019
Thomas+0.018
Paul+0.018
Neg logit contributions
 tad-0.026
 pods-0.026
 lav-0.025
 bou-0.025
 spr-0.024
Token by
Token ID416
Feature activation-1.30e-02

Pos logit contributions
 rider+0.170
Brian+0.140
John+0.135
lf+0.135
Paul+0.135
Neg logit contributions
 pods-0.279
 lav-0.275
 tad-0.275
 bou-0.265
mary-0.258
Token its
Token ID663
Feature activation+1.03e-02

Pos logit contributions
 rider+0.012
 riding+0.011
lf+0.010
John+0.010
Paul+0.010
Neg logit contributions
 pods-0.033
 lav-0.033
 tad-0.032
mary-0.032
 bou-0.032
Token beauty
Token ID8737
Feature activation+1.16e-02

Pos logit contributions
 lav+0.023
 pods+0.022
 pulp+0.021
mary+0.021
 bou+0.021
Neg logit contributions
 rider-0.014
 riding-0.012
 day-0.012
working-0.011
 his-0.011
Token.
Token ID13
Feature activation-4.43e-03

Pos logit contributions
 pods+0.024
 lav+0.024
 tad+0.023
 bou+0.023
mary+0.022
Neg logit contributions
 rider-0.018
Brian-0.015
Paul-0.015
John-0.015
lf-0.014
Token His
Token ID2399
Feature activation-1.14e-02

Pos logit contributions
 rider+0.006
lf+0.005
Brian+0.005
John+0.005
Paul+0.005
Neg logit contributions
 pods-0.010
 tad-0.010
 lav-0.010
 bou-0.009
mary-0.009
Token horn
Token ID12718
Feature activation-4.15e-02

Pos logit contributions
 rider+0.028
lf+0.023
 riding+0.023
Brian+0.023
John+0.022
Neg logit contributions
 pods-0.028
 lav-0.027
 tad-0.027
mary-0.026
 bou-0.026
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 rider+0.057
Brian+0.047
lf+0.046
 riding+0.046
John+0.046
Neg logit contributions
 pods-0.092
 lav-0.092
 tad-0.091
 bou-0.088
mary-0.086
Token He
Token ID679
Feature activation-1.34e-02

Pos logit contributions
 pods+0.032
 lav+0.031
 tad+0.031
 bou+0.031
mary+0.029
Neg logit contributions
 rider-0.019
Brian-0.016
John-0.015
 riding-0.015
lf-0.015
Token was
Token ID373
Feature activation-1.84e-02

Pos logit contributions
 rider+0.024
Brian+0.020
working+0.020
 riding+0.019
John+0.019
Neg logit contributions
 tad-0.025
 pods-0.024
 lav-0.024
mary-0.023
 bou-0.023
Token mesmer
Token ID46814
Feature activation-1.01e-02

Pos logit contributions
 rider+0.035
Brian+0.030
lf+0.030
John+0.030
Paul+0.030
Neg logit contributions
 tad-0.031
 pods-0.031
 lav-0.031
 bou-0.029
mary-0.029
Tokenised
Token ID1417
Feature activation-1.24e-01

Pos logit contributions
 rider+0.018
Brian+0.015
lf+0.015
Thomas+0.015
Paul+0.014
Neg logit contributions
 tad-0.019
 lav-0.018
 pods-0.018
 bou-0.018
 spr-0.017
Token by
Token ID416
Feature activation-1.09e-02

Pos logit contributions
 rider+0.184
Brian+0.152
lf+0.147
Paul+0.146
John+0.145
Neg logit contributions
 pods-0.266
 tad-0.263
 lav-0.261
 bou-0.249
mary-0.245
Token its
Token ID663
Feature activation+1.29e-02

Pos logit contributions
 rider+0.013
 riding+0.011
Brian+0.010
John+0.010
lf+0.010
Neg logit contributions
 pods-0.027
 lav-0.026
 tad-0.026
 bou-0.025
mary-0.025
Token beauty
Token ID8737
Feature activation+1.16e-02

Pos logit contributions
 pods+0.025
 lav+0.025
 tad+0.024
 bou+0.023
mary+0.023
Neg logit contributions
 rider-0.022
Brian-0.019
 riding-0.018
lf-0.018
John-0.018
Token and
Token ID290
Feature activation+3.19e-03

Pos logit contributions
 pods+0.024
 lav+0.023
 tad+0.023
 bou+0.022
mary+0.022
Neg logit contributions
 rider-0.018
Brian-0.016
Paul-0.016
John-0.015
Thomas-0.015
Token wanted
Token ID2227
Feature activation+1.04e-02

Pos logit contributions
 pods+0.007
 tad+0.007
 lav+0.007
 bou+0.006
mary+0.006
Neg logit contributions
 rider-0.005
Brian-0.004
John-0.004
lf-0.004
Paul-0.004
Tokenep
Token ID538
Feature activation-4.66e-02

Pos logit contributions
 rider+0.028
 riding+0.024
Brian+0.024
Paul+0.024
lf+0.023
Neg logit contributions
 pods-0.043
 lav-0.041
 tad-0.041
 bou-0.039
mary-0.038
Token inside
Token ID2641
Feature activation-2.47e-02

Pos logit contributions
 rider+0.068
Brian+0.058
lf+0.056
John+0.056
Thomas+0.055
Neg logit contributions
 tad-0.098
 pods-0.097
 lav-0.095
 bou-0.093
mary-0.090
Token and
Token ID290
Feature activation-8.49e-03

Pos logit contributions
 rider+0.027
 riding+0.022
lf+0.021
John+0.021
Brian+0.020
Neg logit contributions
 pods-0.061
 lav-0.061
 tad-0.061
 bou-0.059
mary-0.058
Token su
Token ID424
Feature activation-1.19e-02

Pos logit contributions
 rider+0.013
Brian+0.011
lf+0.011
 riding+0.011
John+0.011
Neg logit contributions
 pods-0.017
 lav-0.017
 tad-0.017
 bou-0.017
mary-0.016
Tokenpr
Token ID1050
Feature activation+1.27e-02

Pos logit contributions
 rider+0.021
Brian+0.019
John+0.019
Paul+0.018
lf+0.018
Neg logit contributions
 pods-0.021
 lav-0.021
 tad-0.021
 bou-0.020
mary-0.019
Tokenised
Token ID1417
Feature activation-1.24e-01

Pos logit contributions
 pods+0.029
 tad+0.029
 lav+0.029
 bou+0.028
mary+0.027
Neg logit contributions
 rider-0.017
Brian-0.014
 riding-0.014
John-0.013
lf-0.013
Token himself
Token ID2241
Feature activation-3.37e-02

Pos logit contributions
 rider+0.202
Brian+0.166
John+0.161
Paul+0.160
lf+0.160
Neg logit contributions
 pods-0.252
 tad-0.251
 lav-0.247
 bou-0.237
flies-0.235
Token when
Token ID618
Feature activation+3.39e-03

Pos logit contributions
 rider+0.040
 riding+0.033
lf+0.032
Brian+0.031
working+0.030
Neg logit contributions
 pods-0.081
 lav-0.081
 tad-0.080
 bou-0.078
mary-0.075
Token he
Token ID339
Feature activation+1.57e-03

Pos logit contributions
 pods+0.009
 tad+0.009
 lav+0.009
 bou+0.008
 spr+0.008
Neg logit contributions
 rider-0.003
Brian-0.003
Paul-0.002
Thomas-0.002
lf-0.002
Token saw
Token ID2497
Feature activation+2.29e-02

Pos logit contributions
 pods+0.003
 tad+0.003
 lav+0.003
 bou+0.003
mary+0.003
Neg logit contributions
 rider-0.003
Brian-0.002
John-0.002
Paul-0.002
Thomas-0.002
Token that
Token ID326
Feature activation-1.90e-03

Pos logit contributions
 lav+0.048
 pods+0.048
 tad+0.048
 bou+0.046
 spr+0.045
Neg logit contributions
 rider-0.034
Brian-0.029
lf-0.028
John-0.027
Paul-0.027
Token blue
Token ID4171
Feature activation-2.51e-02

Pos logit contributions
 lav+0.010
 pods+0.010
 tad+0.010
 bou+0.010
 scr+0.010
Neg logit contributions
 rider-0.012
lf-0.011
Brian-0.011
Thomas-0.011
working-0.010
Token wheels
Token ID13666
Feature activation-6.89e-02

Pos logit contributions
 rider+0.040
Brian+0.040
lf+0.039
Thomas+0.039
dd+0.039
Neg logit contributions
 bou-0.043
 pods-0.042
 spr-0.040
 scr-0.040
 cups-0.040
Token and
Token ID290
Feature activation+7.58e-03

Pos logit contributions
Brian+0.124
 rider+0.123
John+0.120
Thomas+0.120
lf+0.119
Neg logit contributions
 pods-0.106
mary-0.103
 scr-0.102
 bou-0.102
 lav-0.100
Token pink
Token ID11398
Feature activation-6.39e-03

Pos logit contributions
 pods+0.013
 scr+0.013
 lav+0.013
 tad+0.013
 smells+0.012
Neg logit contributions
 rider-0.012
lf-0.012
Brian-0.011
Thomas-0.011
John-0.011
Token handle
Token ID5412
Feature activation-4.55e-02

Pos logit contributions
 rider+0.011
Brian+0.010
John+0.010
Thomas+0.010
lf+0.010
Neg logit contributions
 pods-0.011
 bou-0.011
 scr-0.010
 lav-0.010
 spr-0.010
Tokenbars
Token ID34046
Feature activation-1.24e-01

Pos logit contributions
 rider+0.078
lf+0.073
Brian+0.072
John+0.071
Thomas+0.070
Neg logit contributions
 pods-0.078
 lav-0.074
 bou-0.073
mary-0.072
 tad-0.071
Token.
Token ID13
Feature activation+6.96e-03

Pos logit contributions
lf+0.229
John+0.225
Brian+0.223
 rider+0.220
Thomas+0.216
Neg logit contributions
 pods-0.188
 lav-0.184
 bou-0.180
 scr-0.179
mary-0.177
Token She
Token ID1375
Feature activation+2.32e-02

Pos logit contributions
 tad+0.015
 pods+0.015
 lav+0.014
 bou+0.014
mary+0.014
Neg logit contributions
 rider-0.010
 riding-0.009
Brian-0.009
John-0.009
Paul-0.009
Token liked
Token ID8288
Feature activation-5.01e-03

Pos logit contributions
 scr+0.032
 spr+0.030
 sp+0.029
 smells+0.029
 smelled+0.029
Neg logit contributions
Brian-0.041
over-0.039
han-0.039
Roger-0.039
 rider-0.038
Token to
Token ID284
Feature activation-2.64e-02

Pos logit contributions
 rider+0.008
dd+0.008
lf+0.008
Brian+0.007
over+0.007
Neg logit contributions
 lav-0.008
 bou-0.008
 pods-0.008
 tul-0.008
 scr-0.008
Token go
Token ID467
Feature activation+6.49e-04

Pos logit contributions
dd+0.041
Thomas+0.038
Brian+0.038
John+0.037
Em+0.037
Neg logit contributions
 scr-0.047
 bou-0.045
 spr-0.042
 smells-0.041
 lav-0.041
Token hopped
Token ID45230
Feature activation+1.88e-02

Pos logit contributions
 pods+0.050
 lav+0.049
 tad+0.049
 bou+0.047
mary+0.045
Neg logit contributions
 rider-0.030
 riding-0.024
Brian-0.024
working-0.023
lf-0.023
Token around
Token ID1088
Feature activation-4.33e-02

Pos logit contributions
 lav+0.045
 pods+0.045
 tad+0.044
 bou+0.042
mary+0.042
Neg logit contributions
 rider-0.022
Brian-0.018
 riding-0.017
John-0.017
Paul-0.016
Token and
Token ID290
Feature activation+4.71e-03

Pos logit contributions
 rider+0.041
 riding+0.034
Brian+0.032
lf+0.032
John+0.031
Neg logit contributions
 pods-0.111
 lav-0.109
 tad-0.106
mary-0.105
 bou-0.104
Token explored
Token ID18782
Feature activation+1.22e-02

Pos logit contributions
 pods+0.010
 tad+0.010
 lav+0.010
 bou+0.009
mary+0.009
Neg logit contributions
 rider-0.007
Brian-0.006
John-0.006
lf-0.006
Paul-0.006
Token the
Token ID262
Feature activation-7.21e-03

Pos logit contributions
 pods+0.030
 lav+0.030
 tad+0.029
mary+0.028
 bou+0.028
Neg logit contributions
 rider-0.013
 riding-0.010
Brian-0.010
John-0.010
lf-0.010
Token swamp
Token ID26837
Feature activation+1.00e-01

Pos logit contributions
 day+0.012
 rider+0.012
 time+0.011
 riding+0.011
lf+0.011
Neg logit contributions
 lav-0.012
 pods-0.012
 scr-0.011
 pulp-0.011
comb-0.011
Token.
Token ID13
Feature activation+7.99e-03

Pos logit contributions
 lav+0.257
 pods+0.245
 tad+0.241
 pulp+0.237
 basil+0.235
Neg logit contributions
 rider-0.101
 riding-0.083
lf-0.078
 he-0.074
working-0.072
Token They
Token ID1119
Feature activation+2.55e-02

Pos logit contributions
 pods+0.019
 lav+0.019
 bou+0.018
 tad+0.018
mary+0.017
Neg logit contributions
 rider-0.010
Brian-0.009
John-0.009
 riding-0.008
Paul-0.008
Token made
Token ID925
Feature activation-2.68e-02

Pos logit contributions
 pods+0.055
 lav+0.054
 tad+0.052
 bou+0.051
mary+0.049
Neg logit contributions
 rider-0.037
lf-0.030
Brian-0.030
 riding-0.030
John-0.029
Token sure
Token ID1654
Feature activation-6.64e-03

Pos logit contributions
 rider+0.037
John+0.033
 riding+0.032
Brian+0.032
Paul+0.031
Neg logit contributions
 pods-0.058
 lav-0.057
 tad-0.054
mary-0.054
 bou-0.052
Token the
Token ID262
Feature activation+1.59e-02

Pos logit contributions
 rider+0.009
Brian+0.007
lf+0.007
Paul+0.007
John+0.007
Neg logit contributions
 pods-0.015
 tad-0.015
 lav-0.015
 bou-0.014
mary-0.014
Token!
Token ID0
Feature activation-4.97e-03

Pos logit contributions
 tad+0.122
 pulp+0.114
 pods+0.113
 lav+0.113
 gag+0.109
Neg logit contributions
 rider-0.033
 that-0.030
 day-0.028
lf-0.028
 riding-0.028
Token I
Token ID314
Feature activation+3.46e-03

Pos logit contributions
 rider+0.005
 riding+0.004
 Red+0.003
unt+0.003
Max+0.003
Neg logit contributions
 pods-0.012
 tad-0.012
 lav-0.012
 bou-0.012
 pulp-0.012
Token wonder
Token ID4240
Feature activation-3.43e-02

Pos logit contributions
 pods+0.007
 lav+0.007
 tad+0.007
 bou+0.007
 pulp+0.007
Neg logit contributions
 rider-0.005
working-0.004
 riding-0.004
 day-0.004
 time-0.004
Token if
Token ID611
Feature activation-5.54e-03

Pos logit contributions
 rider+0.053
 riding+0.048
John+0.043
lf+0.043
Paul+0.043
Neg logit contributions
 pods-0.070
 tad-0.068
 lav-0.066
mary-0.063
 bou-0.062
Token it
Token ID340
Feature activation+2.94e-02

Pos logit contributions
 rider+0.006
Brian+0.005
 riding+0.004
lf+0.004
John+0.004
Neg logit contributions
 pods-0.014
 tad-0.014
 lav-0.014
 bou-0.013
mary-0.013
Token smells
Token ID25760
Feature activation+1.07e-01

Pos logit contributions
 tad+0.058
 pods+0.057
 bou+0.055
 lav+0.054
 pulp+0.054
Neg logit contributions
 rider-0.042
working-0.036
 riding-0.036
 time-0.034
 day-0.034
Token nice
Token ID3621
Feature activation+4.72e-02

Pos logit contributions
 pods+0.233
 tad+0.232
 lav+0.220
 bou+0.216
flies+0.214
Neg logit contributions
 rider-0.137
 riding-0.117
working-0.111
 time-0.111
John-0.106
Token."
Token ID526
Feature activation+9.02e-03

Pos logit contributions
 tad+0.102
 pods+0.100
 lav+0.100
 bou+0.096
mary+0.093
Neg logit contributions
 rider-0.065
 riding-0.053
lf-0.052
working-0.052
John-0.051
Token
Token ID220
Feature activation+2.52e-02

Pos logit contributions
 pods+0.019
 lav+0.019
 tad+0.019
 bou+0.018
mary+0.018
Neg logit contributions
 rider-0.013
Brian-0.011
lf-0.011
John-0.011
Paul-0.011
Token
Token ID198
Feature activation+8.27e-04

Pos logit contributions
 pods+0.053
 tad+0.053
 lav+0.052
 bou+0.051
mary+0.050
Neg logit contributions
 rider-0.037
Brian-0.030
working-0.030
lf-0.030
Paul-0.029
Token
Token ID198
Feature activation-8.35e-04

Pos logit contributions
 bou+0.001
mary+0.001
 pods+0.001
 lav+0.001
 tad+0.001
Neg logit contributions
 rider-0.002
working-0.002
 runner-0.001
 racer-0.001
Brian-0.001
Token park
Token ID3952
Feature activation+2.34e-02

Pos logit contributions
 rider+0.028
 cop+0.023
lf+0.023
 riding+0.023
 mechanic+0.022
Neg logit contributions
 tad-0.035
 pods-0.035
 lav-0.033
flies-0.033
mary-0.032
Token or
Token ID393
Feature activation+1.36e-03

Pos logit contributions
 pods+0.051
 lav+0.051
 tad+0.051
 bou+0.049
mary+0.047
Neg logit contributions
 rider-0.033
Brian-0.027
lf-0.026
 riding-0.026
John-0.026
Token bake
Token ID28450
Feature activation+6.49e-02

Pos logit contributions
 pods+0.003
 lav+0.003
 tad+0.003
 bou+0.003
mary+0.002
Neg logit contributions
 rider-0.002
Brian-0.002
John-0.002
Paul-0.002
lf-0.002
Token cookies
Token ID14746
Feature activation+5.42e-02

Pos logit contributions
 pods+0.126
 tad+0.126
 lav+0.124
 bou+0.118
mary+0.115
Neg logit contributions
 rider-0.107
 riding-0.091
lf-0.091
John-0.090
Brian-0.089
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
 pods+0.114
 lav+0.112
 tad+0.111
 bou+0.109
mary+0.106
Neg logit contributions
 rider-0.080
Brian-0.069
Paul-0.067
John-0.066
lf-0.065
Token How
Token ID1374
Feature activation+1.01e-01

Pos logit contributions
 rider+0.015
John+0.012
Tom+0.012
Brian+0.012
 riding+0.012
Neg logit contributions
 pods-0.022
 tad-0.021
 bou-0.021
 lav-0.020
mary-0.020
Token does
Token ID857
Feature activation+3.64e-02

Pos logit contributions
 pods+0.205
 tad+0.204
 lav+0.200
 bou+0.194
mary+0.187
Neg logit contributions
 rider-0.163
John-0.134
lf-0.133
 riding-0.132
Brian-0.131
Token that
Token ID326
Feature activation+2.25e-03

Pos logit contributions
 pods+0.090
 lav+0.090
 tad+0.089
 bou+0.087
mary+0.085
Neg logit contributions
 rider-0.040
Brian-0.032
John-0.031
Paul-0.030
lf-0.030
Token sound
Token ID2128
Feature activation-1.87e-02

Pos logit contributions
 lav+0.004
 pods+0.004
 bou+0.004
 tad+0.004
 scr+0.004
Neg logit contributions
 rider-0.003
Brian-0.003
Paul-0.003
John-0.003
 riding-0.003
Token?"
Token ID1701
Feature activation-1.80e-02

Pos logit contributions
 rider+0.023
Brian+0.019
Paul+0.019
John+0.018
 riding+0.018
Neg logit contributions
 pods-0.044
 lav-0.044
 tad-0.043
 bou-0.042
mary-0.042
Token
Token ID198
Feature activation-8.24e-03

Pos logit contributions
 rider+0.024
Brian+0.019
lf+0.019
John+0.018
Paul+0.018
Neg logit contributions
 pods-0.041
 lav-0.041
 tad-0.041
 bou-0.040
mary-0.039
Token as
Token ID355
Feature activation-5.33e-02

Pos logit contributions
 pods+0.025
 lav+0.025
 tad+0.025
 bou+0.024
mary+0.023
Neg logit contributions
 rider-0.021
Brian-0.019
lf-0.018
John-0.018
Paul-0.018
Token he
Token ID339
Feature activation-2.96e-02

Pos logit contributions
 rider+0.065
Brian+0.063
Thomas+0.058
Paul+0.056
Ryan+0.055
Neg logit contributions
 tad-0.113
 lav-0.112
 pods-0.109
 spr-0.105
 scr-0.104
Token watched
Token ID7342
Feature activation+1.88e-02

Pos logit contributions
 rider+0.043
lf+0.036
Brian+0.035
 riding+0.035
Paul+0.035
Neg logit contributions
 pods-0.063
 lav-0.062
 tad-0.061
 bou-0.060
mary-0.056
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
 lav+0.041
 pods+0.041
 tad+0.041
 bou+0.041
 scr+0.039
Neg logit contributions
 rider-0.026
Brian-0.023
Paul-0.021
Thomas-0.020
working-0.020
Token rain
Token ID6290
Feature activation+7.28e-02

Pos logit contributions
 lav+0.038
 pods+0.037
mary+0.037
bugs+0.035
flies+0.035
Neg logit contributions
 rider-0.023
 day-0.023
 riding-0.021
Tom-0.021
 days-0.020
Token come
Token ID1282
Feature activation+8.32e-02

Pos logit contributions
 lav+0.128
 pods+0.127
 tad+0.126
 bou+0.117
 basil+0.116
Neg logit contributions
 rider-0.133
 riding-0.121
working-0.115
lf-0.112
 day-0.110
Token down
Token ID866
Feature activation+1.12e-02

Pos logit contributions
 lav+0.201
 pods+0.200
 tad+0.197
 basil+0.188
 pulp+0.186
Neg logit contributions
 rider-0.097
 riding-0.088
Paul-0.077
Brian-0.074
John-0.074
Token.
Token ID13
Feature activation+1.65e-03

Pos logit contributions
 lav+0.028
 pods+0.027
 bou+0.027
 tad+0.027
mary+0.026
Neg logit contributions
 rider-0.011
 riding-0.010
John-0.009
His-0.009
 his-0.009
Token
Token ID198
Feature activation-8.44e-03

Pos logit contributions
 pods+0.003
 tad+0.003
 lav+0.003
 bou+0.003
mary+0.003
Neg logit contributions
 rider-0.003
lf-0.002
working-0.002
Brian-0.002
Paul-0.002
Token
Token ID198
Feature activation-6.43e-03

Pos logit contributions
 rider+0.018
working+0.016
 runner+0.015
 cop+0.014
 racer+0.014
Neg logit contributions
 tad-0.012
 pods-0.011
mary-0.011
 bou-0.011
 lav-0.011
TokenJack
Token ID14295
Feature activation-1.23e-02

Pos logit contributions
 rider+0.011
 riding+0.009
working+0.008
rob+0.008
Brian+0.008
Neg logit contributions
 pods-0.012
 tad-0.012
mary-0.011
 lav-0.011
 scr-0.011
Token but
Token ID475
Feature activation-4.60e-02

Pos logit contributions
 rider+0.017
Tom+0.015
lf+0.014
John+0.014
 he+0.014
Neg logit contributions
mary-0.038
 lav-0.038
 pods-0.038
 bou-0.036
 tad-0.035
Token be
Token ID307
Feature activation-4.90e-02

Pos logit contributions
 rider+0.060
 riding+0.050
lf+0.049
Tom+0.046
John+0.046
Neg logit contributions
 pods-0.100
 lav-0.098
 tad-0.097
 bou-0.097
mary-0.096
Token careful
Token ID8161
Feature activation-4.12e-02

Pos logit contributions
 rider+0.066
 riding+0.054
Brian+0.054
John+0.053
lf+0.051
Neg logit contributions
 pods-0.111
 lav-0.107
 tad-0.107
 bou-0.105
mary-0.101
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
 rider+0.065
Brian+0.057
Paul+0.055
Thomas+0.054
John+0.052
Neg logit contributions
 pods-0.083
 tad-0.080
 lav-0.079
 bou-0.077
mary-0.077
Token The
Token ID383
Feature activation+2.02e-02

Pos logit contributions
 rider+0.001
Tom+0.001
 Red+0.001
 Speed+0.001
 riding+0.001
Neg logit contributions
 bou-0.002
 pods-0.002
 lav-0.002
 tad-0.002
mary-0.002
Token bush
Token ID24484
Feature activation+8.92e-02

Pos logit contributions
flies+0.037
 tad+0.036
 lav+0.036
mary+0.035
umes+0.035
Neg logit contributions
 day-0.029
 rider-0.028
lf-0.027
 time-0.027
 days-0.026
Token has
Token ID468
Feature activation+1.46e-02

Pos logit contributions
 lav+0.221
 tad+0.216
 bou+0.211
 tul+0.207
 pods+0.205
Neg logit contributions
 rider-0.085
 day-0.066
 that-0.065
 time-0.064
chair-0.063
Token th
Token ID294
Feature activation+1.08e-03

Pos logit contributions
 tad+0.028
 pods+0.028
 bou+0.028
 lav+0.027
mary+0.027
Neg logit contributions
 rider-0.020
 riding-0.018
 his-0.017
Tom-0.017
John-0.016
Tokenorns
Token ID19942
Feature activation-3.40e-02

Pos logit contributions
 pods+0.002
 tad+0.002
 lav+0.002
 bou+0.002
mary+0.002
Neg logit contributions
 rider-0.001
Brian-0.001
 riding-0.001
John-0.001
lf-0.001
Token."
Token ID526
Feature activation+2.08e-02

Pos logit contributions
 rider+0.039
 riding+0.032
working+0.031
lf+0.029
Brian+0.028
Neg logit contributions
 tad-0.082
 lav-0.081
 pods-0.080
 bou-0.079
mary-0.075
Token Mia
Token ID32189
Feature activation+3.34e-02

Pos logit contributions
 lav+0.034
 pods+0.033
 bou+0.033
 tad+0.031
mary+0.031
Neg logit contributions
 rider-0.040
John-0.037
His-0.036
lf-0.035
Tom-0.035
Token in
Token ID287
Feature activation+2.05e-02

Pos logit contributions
 rider+0.026
Brian+0.022
John+0.021
Paul+0.021
Thomas+0.020
Neg logit contributions
 pods-0.039
 lav-0.039
 tad-0.039
flies-0.037
 bou-0.036
Token something
Token ID1223
Feature activation+2.24e-02

Pos logit contributions
 lav+0.041
 tad+0.040
 pods+0.039
 bou+0.037
 herbs+0.036
Neg logit contributions
 rider-0.035
Brian-0.029
lf-0.029
John-0.028
Paul-0.028
Token,
Token ID11
Feature activation+5.70e-03

Pos logit contributions
 pods+0.051
 lav+0.050
 tad+0.050
 bou+0.049
mary+0.048
Neg logit contributions
 rider-0.029
 riding-0.024
Brian-0.023
lf-0.023
John-0.023
Token but
Token ID475
Feature activation-2.80e-02

Pos logit contributions
 pods+0.012
 lav+0.011
 tad+0.011
 bou+0.011
mary+0.011
Neg logit contributions
 rider-0.009
Brian-0.007
 riding-0.007
John-0.007
lf-0.007
Token it
Token ID340
Feature activation-1.36e-02

Pos logit contributions
 rider+0.042
Brian+0.036
Thomas+0.036
Paul+0.035
John+0.035
Neg logit contributions
 tad-0.060
 lav-0.058
 pods-0.058
 spr-0.055
 bou-0.054
Token's
Token ID338
Feature activation+2.98e-02

Pos logit contributions
 rider+0.020
Brian+0.017
 riding+0.016
working+0.016
lf+0.016
Neg logit contributions
 bou-0.029
 pods-0.029
 lav-0.028
 tad-0.027
mary-0.026
Token also
Token ID635
Feature activation-4.34e-03

Pos logit contributions
 lav+0.058
 pods+0.057
 tad+0.056
mary+0.054
 bou+0.053
Neg logit contributions
 rider-0.049
lf-0.043
Brian-0.043
John-0.042
Paul-0.042
Token okay
Token ID8788
Feature activation+4.83e-02

Pos logit contributions
 rider+0.005
Brian+0.004
 riding+0.004
John+0.004
lf+0.004
Neg logit contributions
 pods-0.010
 lav-0.010
 tad-0.010
 bou-0.010
mary-0.010
Token if
Token ID611
Feature activation+3.39e-03

Pos logit contributions
 pods+0.118
 tad+0.118
 lav+0.117
 bou+0.116
mary+0.111
Neg logit contributions
 rider-0.056
 riding-0.044
Brian-0.042
John-0.042
lf-0.042
Token it
Token ID340
Feature activation+3.18e-02

Pos logit contributions
 tad+0.008
 pods+0.008
 lav+0.008
 herbs+0.007
 bou+0.007
Neg logit contributions
 rider-0.004
Brian-0.003
Paul-0.003
lf-0.003
John-0.003
Token doesn
Token ID1595
Feature activation+2.00e-02

Pos logit contributions
 bou+0.064
 pods+0.063
 lav+0.063
 tad+0.061
mary+0.056
Neg logit contributions
 rider-0.053
 riding-0.044
working-0.043
Brian-0.043
lf-0.040
Token,
Token ID11
Feature activation+4.49e-03

Pos logit contributions
 pods+0.078
 lav+0.077
 tad+0.077
 bou+0.075
mary+0.073
Neg logit contributions
 rider-0.029
Brian-0.022
John-0.021
lf-0.021
Paul-0.021
Token Lily
Token ID20037
Feature activation+2.24e-02

Pos logit contributions
 pods+0.010
 tad+0.010
 lav+0.010
 bou+0.009
mary+0.009
Neg logit contributions
 rider-0.006
lf-0.005
John-0.005
 riding-0.005
Brian-0.005
Token saw
Token ID2497
Feature activation+6.87e-03

Pos logit contributions
 pods+0.044
 tad+0.043
 lav+0.043
mary+0.042
 bou+0.041
Neg logit contributions
 rider-0.036
Brian-0.031
Paul-0.029
John-0.029
lf-0.029
Token a
Token ID257
Feature activation+4.25e-03

Pos logit contributions
 pods+0.015
 lav+0.015
 tad+0.015
 bou+0.015
mary+0.014
Neg logit contributions
 rider-0.009
Brian-0.008
John-0.008
lf-0.008
 riding-0.008
Token bird
Token ID6512
Feature activation+1.61e-02

Pos logit contributions
 pods+0.009
 lav+0.009
 tad+0.009
 bou+0.008
mary+0.008
Neg logit contributions
 rider-0.006
lf-0.005
Brian-0.005
 riding-0.005
John-0.005
Token with
Token ID351
Feature activation+9.74e-03

Pos logit contributions
 pods+0.036
 lav+0.036
 tad+0.036
 bou+0.034
mary+0.034
Neg logit contributions
 rider-0.022
Brian-0.018
lf-0.018
 riding-0.018
John-0.017
Token a
Token ID257
Feature activation-2.97e-03

Pos logit contributions
 pods+0.020
 tad+0.020
 lav+0.020
 bou+0.019
mary+0.018
Neg logit contributions
 rider-0.015
Brian-0.013
lf-0.012
Paul-0.012
John-0.012
Token broken
Token ID5445
Feature activation-2.35e-02

Pos logit contributions
 rider+0.005
Brian+0.004
John+0.004
lf+0.004
Paul+0.004
Neg logit contributions
 pods-0.006
 lav-0.006
 tad-0.006
 bou-0.006
mary-0.005
Token wing
Token ID8539
Feature activation-2.99e-02

Pos logit contributions
 rider+0.046
Brian+0.043
John+0.042
Paul+0.042
Thomas+0.042
Neg logit contributions
 pods-0.038
 lav-0.037
 bou-0.037
 tad-0.037
 spr-0.035
Token on
Token ID319
Feature activation-9.37e-03

Pos logit contributions
 rider+0.048
Brian+0.043
John+0.042
Paul+0.041
Thomas+0.041
Neg logit contributions
 pods-0.059
 lav-0.058
 tad-0.057
 bou-0.056
mary-0.055
Token the
Token ID262
Feature activation-1.15e-02

Pos logit contributions
 rider+0.013
Brian+0.010
lf+0.010
John+0.010
 riding+0.010
Neg logit contributions
 pods-0.021
 tad-0.021
 lav-0.021
 bou-0.020
mary-0.019
Token friends
Token ID2460
Feature activation-9.12e-03

Pos logit contributions
 rider+0.006
 riding+0.005
 his+0.005
lf+0.004
His+0.004
Neg logit contributions
 lav-0.021
 pods-0.021
 bou-0.020
mary-0.020
 tad-0.019
Token that
Token ID326
Feature activation-1.15e-02

Pos logit contributions
 rider+0.013
 riding+0.011
John+0.011
Brian+0.010
lf+0.010
Neg logit contributions
 lav-0.020
 pods-0.020
 tad-0.020
 bou-0.019
mary-0.018
Token a
Token ID257
Feature activation+1.74e-02

Pos logit contributions
 rider+0.015
 riding+0.013
lf+0.012
Brian+0.012
John+0.012
Neg logit contributions
 pods-0.026
 lav-0.026
 tad-0.025
 bou-0.025
mary-0.024
Token factory
Token ID8860
Feature activation+3.79e-02

Pos logit contributions
 pods+0.035
 lav+0.034
 bou+0.033
mary+0.032
 tad+0.032
Neg logit contributions
 rider-0.028
Brian-0.022
 riding-0.022
lf-0.022
working-0.022
Token up
Token ID510
Feature activation-8.10e-03

Pos logit contributions
 lav+0.079
 pods+0.079
 tad+0.078
 bou+0.077
mary+0.074
Neg logit contributions
 rider-0.057
lf-0.047
 riding-0.046
Brian-0.046
John-0.046
Token the
Token ID262
Feature activation+6.50e-03

Pos logit contributions
 rider+0.012
lf+0.011
Brian+0.010
Paul+0.010
John+0.010
Neg logit contributions
 pods-0.017
 tad-0.017
 lav-0.017
mary-0.016
 bou-0.016
Token river
Token ID7850
Feature activation+4.30e-02

Pos logit contributions
 lav+0.011
 pods+0.011
 bou+0.010
 tad+0.010
mary+0.010
Neg logit contributions
 rider-0.013
Brian-0.011
 riding-0.011
lf-0.011
working-0.011
Token was
Token ID373
Feature activation+1.72e-02

Pos logit contributions
 lav+0.097
 pods+0.094
 bou+0.093
 tad+0.093
 basil+0.087
Neg logit contributions
 rider-0.057
 riding-0.048
lf-0.046
working-0.045
John-0.044
Token dumping
Token ID30231
Feature activation+4.41e-02

Pos logit contributions
 pods+0.034
 lav+0.034
 tad+0.033
 bou+0.032
mary+0.031
Neg logit contributions
 rider-0.028
Brian-0.024
John-0.023
lf-0.023
Paul-0.023
Token pink
Token ID11398
Feature activation+4.86e-02

Pos logit contributions
 lav+0.075
mary+0.073
 pods+0.073
 tad+0.072
 bou+0.072
Neg logit contributions
 rider-0.073
 riding-0.067
 his-0.065
John-0.064
lf-0.063
Token water
Token ID1660
Feature activation+2.18e-02

Pos logit contributions
 lav+0.094
 bou+0.092
 tad+0.091
mary+0.090
 pods+0.086
Neg logit contributions
 rider-0.069
 riding-0.062
working-0.059
 days-0.059
lf-0.057
Token forest
Token ID8222
Feature activation+1.85e-02

Pos logit contributions
 rider+0.040
Brian+0.034
John+0.034
lf+0.033
 riding+0.033
Neg logit contributions
 pods-0.050
 lav-0.050
 tad-0.050
 bou-0.048
mary-0.046
Token with
Token ID351
Feature activation+1.74e-03

Pos logit contributions
 pods+0.043
 lav+0.043
 tad+0.042
 bou+0.041
mary+0.039
Neg logit contributions
 rider-0.024
 riding-0.019
lf-0.019
Brian-0.019
working-0.019
Token her
Token ID607
Feature activation-9.29e-03

Pos logit contributions
 pods+0.003
 lav+0.003
 tad+0.003
 bou+0.003
mary+0.003
Neg logit contributions
 rider-0.003
Brian-0.003
Paul-0.002
John-0.002
lf-0.002
Token friends
Token ID2460
Feature activation-8.34e-03

Pos logit contributions
 rider+0.015
Brian+0.012
lf+0.012
John+0.012
Paul+0.012
Neg logit contributions
 pods-0.019
 lav-0.019
 tad-0.019
 bou-0.018
mary-0.017
Token.
Token ID13
Feature activation-5.95e-03

Pos logit contributions
 rider+0.013
Brian+0.012
John+0.011
lf+0.011
Paul+0.011
Neg logit contributions
 pods-0.016
 lav-0.016
 tad-0.016
 bou-0.016
mary-0.015
Token One
Token ID1881
Feature activation+1.25e-02

Pos logit contributions
 rider+0.007
 riding+0.006
Brian+0.006
John+0.006
Paul+0.006
Neg logit contributions
 tad-0.014
 pods-0.014
 lav-0.013
 bou-0.013
mary-0.012
Token day
Token ID1110
Feature activation-4.97e-02

Pos logit contributions
 pods+0.033
 tad+0.033
 lav+0.033
 bou+0.032
mary+0.032
Neg logit contributions
 rider-0.010
Brian-0.009
Paul-0.008
Thomas-0.008
Ryan-0.008
Token,
Token ID11
Feature activation-3.01e-03

Pos logit contributions
 rider+0.085
Brian+0.073
Paul+0.070
Thomas+0.068
John+0.067
Neg logit contributions
 lav-0.101
 pods-0.101
 tad-0.095
 cups-0.093
 bou-0.092
Token they
Token ID484
Feature activation+3.23e-03

Pos logit contributions
 rider+0.005
Brian+0.005
Paul+0.004
John+0.004
lf+0.004
Neg logit contributions
 pods-0.006
 lav-0.006
 tad-0.006
 bou-0.005
mary-0.005
Token saw
Token ID2497
Feature activation+6.50e-03

Pos logit contributions
 pods+0.007
 tad+0.007
 lav+0.007
 bou+0.006
mary+0.006
Neg logit contributions
 rider-0.005
Brian-0.004
lf-0.004
John-0.004
Paul-0.004
Token a
Token ID257
Feature activation+7.01e-03

Pos logit contributions
 pods+0.016
 tad+0.015
 lav+0.015
 bou+0.015
mary+0.015
Neg logit contributions
 rider-0.007
John-0.006
 riding-0.006
Brian-0.006
Paul-0.006
Token to
Token ID284
Feature activation+3.11e-02

Pos logit contributions
 pods+0.052
 lav+0.051
 tad+0.051
 bou+0.049
mary+0.048
Neg logit contributions
 rider-0.036
Brian-0.031
John-0.030
Paul-0.030
lf-0.030
Token.
Token ID13
Feature activation-9.73e-03

Pos logit contributions
 tad+0.065
 lav+0.065
 pods+0.065
 bou+0.064
mary+0.060
Neg logit contributions
 rider-0.044
Brian-0.037
Paul-0.035
Thomas-0.035
working-0.035
Token At
Token ID1629
Feature activation+1.58e-02

Pos logit contributions
 rider+0.013
Brian+0.011
John+0.011
Paul+0.011
 riding+0.010
Neg logit contributions
 pods-0.022
 lav-0.022
 tad-0.022
 bou-0.021
mary-0.020
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
 pods+0.029
 bou+0.029
 lav+0.028
 tad+0.028
mary+0.027
Neg logit contributions
 rider-0.026
Brian-0.022
Thomas-0.022
Paul-0.021
His-0.021
Token market
Token ID1910
Feature activation+2.38e-02

Pos logit contributions
 rider+0.023
Brian+0.019
lf+0.019
John+0.019
 riding+0.019
Neg logit contributions
 pods-0.026
 lav-0.025
 tad-0.025
 bou-0.024
mary-0.024
Token,
Token ID11
Feature activation+9.64e-03

Pos logit contributions
 pods+0.055
 tad+0.055
 lav+0.053
 bou+0.053
mary+0.051
Neg logit contributions
 rider-0.029
Brian-0.025
Thomas-0.024
Paul-0.024
lf-0.022
Token he
Token ID339
Feature activation-3.79e-02

Pos logit contributions
 pods+0.025
 tad+0.025
 lav+0.024
 bou+0.024
mary+0.023
Neg logit contributions
 rider-0.010
Brian-0.007
lf-0.007
Paul-0.007
John-0.007
Token saw
Token ID2497
Feature activation+4.58e-03

Pos logit contributions
 rider+0.055
Brian+0.046
John+0.045
lf+0.045
Paul+0.045
Neg logit contributions
 pods-0.081
 lav-0.080
 tad-0.080
 bou-0.077
mary-0.075
Token lots
Token ID6041
Feature activation+1.86e-02

Pos logit contributions
 pods+0.009
 tad+0.009
 lav+0.009
 bou+0.009
mary+0.008
Neg logit contributions
 rider-0.007
Brian-0.006
lf-0.006
Paul-0.006
John-0.006
Token of
Token ID286
Feature activation+1.92e-02

Pos logit contributions
 pods+0.045
 tad+0.045
 lav+0.044
 bou+0.043
mary+0.041
Neg logit contributions
 rider-0.022
Brian-0.018
Paul-0.017
John-0.017
Thomas-0.017
Token people
Token ID661
Feature activation-3.55e-02

Pos logit contributions
 tad+0.035
 pods+0.034
 lav+0.034
 bou+0.033
 herbs+0.031
Neg logit contributions
 rider-0.034
Brian-0.030
Paul-0.029
lf-0.029
Thomas-0.028
Token"
Token ID1
Feature activation-7.91e-03

Pos logit contributions
 rider+0.022
 riding+0.017
working+0.017
lf+0.017
Brian+0.017
Neg logit contributions
 tad-0.040
 pods-0.040
 lav-0.040
 bou-0.038
mary-0.038
TokenWhat
Token ID2061
Feature activation+3.88e-02

Pos logit contributions
 rider+0.013
Brian+0.011
lf+0.011
John+0.011
 riding+0.011
Neg logit contributions
 pods-0.015
 lav-0.015
 tad-0.015
 bou-0.015
mary-0.014
Token do
Token ID466
Feature activation-1.16e-02

Pos logit contributions
 pods+0.070
 tad+0.069
 lav+0.068
 bou+0.067
mary+0.065
Neg logit contributions
 rider-0.071
 riding-0.060
Brian-0.060
lf-0.060
John-0.060
Token you
Token ID345
Feature activation+7.93e-03

Pos logit contributions
 rider+0.022
Brian+0.020
Paul+0.019
 riding+0.019
Thomas+0.019
Neg logit contributions
 lav-0.018
 tad-0.018
 pods-0.017
 smells-0.016
flies-0.016
Token mean
Token ID1612
Feature activation+3.15e-03

Pos logit contributions
 pods+0.014
 lav+0.014
 tad+0.014
 bou+0.013
mary+0.013
Neg logit contributions
 rider-0.015
Brian-0.013
John-0.013
Paul-0.013
lf-0.012
Token?"
Token ID1701
Feature activation-1.05e-02

Pos logit contributions
 lav+0.006
 pods+0.006
 bou+0.006
 tad+0.006
 smells+0.006
Neg logit contributions
 rider-0.005
Paul-0.004
Brian-0.004
His-0.004
John-0.004
Token asked
Token ID1965
Feature activation-5.13e-02

Pos logit contributions
 rider+0.018
Brian+0.015
Paul+0.015
lf+0.015
working+0.015
Neg logit contributions
 tad-0.020
 lav-0.019
 pods-0.019
 bou-0.018
 basil-0.018
Token Lucy
Token ID22162
Feature activation+1.88e-02

Pos logit contributions
 rider+0.088
 riding+0.072
working+0.072
lf+0.071
Brian+0.070
Neg logit contributions
 tad-0.096
 lav-0.095
 pods-0.095
 bou-0.090
mary-0.088
Token.
Token ID13
Feature activation+6.80e-03

Pos logit contributions
 pods+0.038
 lav+0.037
 tad+0.037
 bou+0.036
mary+0.036
Neg logit contributions
 rider-0.029
Brian-0.025
John-0.024
Paul-0.024
lf-0.024
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
 pods+0.015
 lav+0.015
 tad+0.015
 bou+0.015
mary+0.014
Neg logit contributions
 rider-0.009
Brian-0.007
lf-0.007
John-0.007
Paul-0.007
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
 rider+0.020
working+0.017
lf+0.015
 runner+0.015
 riding+0.015
Neg logit contributions
 lav-0.019
 tad-0.019
mary-0.018
 pods-0.018
 bou-0.018
Token was
Token ID373
Feature activation+2.31e-02

Pos logit contributions
 bou+0.017
 tad+0.016
 pods+0.016
 lav+0.016
 pulp+0.015
Neg logit contributions
 rider-0.010
Brian-0.009
lf-0.008
 riding-0.008
Max-0.008
Token good
Token ID922
Feature activation+2.76e-03

Pos logit contributions
 pods+0.050
 tad+0.050
 lav+0.049
 bou+0.048
mary+0.046
Neg logit contributions
 rider-0.033
Brian-0.027
 riding-0.027
John-0.026
lf-0.026
Token to
Token ID284
Feature activation+1.55e-02

Pos logit contributions
 pods+0.006
 lav+0.006
 tad+0.006
 bou+0.006
mary+0.005
Neg logit contributions
 rider-0.004
Brian-0.004
Paul-0.004
John-0.003
lf-0.003
Token try
Token ID1949
Feature activation+1.55e-02

Pos logit contributions
 lav+0.028
 pods+0.028
 tad+0.027
 bou+0.026
mary+0.026
Neg logit contributions
 rider-0.028
Brian-0.024
John-0.024
Paul-0.024
lf-0.024
Token new
Token ID649
Feature activation+4.28e-03

Pos logit contributions
 pods+0.035
 tad+0.035
 lav+0.034
 bou+0.033
mary+0.032
Neg logit contributions
 rider-0.020
 riding-0.017
Brian-0.017
lf-0.016
John-0.016
Token things
Token ID1243
Feature activation-6.04e-03

Pos logit contributions
 pods+0.009
 lav+0.009
 tad+0.009
 bou+0.008
mary+0.008
Neg logit contributions
 rider-0.006
Brian-0.005
 riding-0.005
lf-0.005
John-0.005
Token and
Token ID290
Feature activation+3.15e-03

Pos logit contributions
 rider+0.008
Brian+0.007
John+0.007
lf+0.007
Paul+0.007
Neg logit contributions
 pods-0.013
 lav-0.013
 tad-0.013
 bou-0.013
mary-0.012
Token speak
Token ID2740
Feature activation+3.25e-02

Pos logit contributions
 pods+0.006
 lav+0.006
 tad+0.006
 bou+0.006
mary+0.006
Neg logit contributions
 rider-0.005
Brian-0.004
John-0.004
lf-0.004
Paul-0.004
Token to
Token ID284
Feature activation+2.94e-02

Pos logit contributions
 pods+0.076
 tad+0.075
 lav+0.075
 bou+0.072
mary+0.070
Neg logit contributions
 rider-0.040
 riding-0.031
Brian-0.031
His-0.029
John-0.029
Token each
Token ID1123
Feature activation-6.22e-02

Pos logit contributions
 pods+0.061
 lav+0.061
 tad+0.060
 bou+0.058
mary+0.057
Neg logit contributions
 rider-0.044
Brian-0.037
John-0.036
lf-0.036
Paul-0.036
Token other
Token ID584
Feature activation+1.02e-02

Pos logit contributions
 rider+0.100
Brian+0.084
John+0.083
lf+0.082
 riding+0.082
Neg logit contributions
 pods-0.125
 lav-0.123
 tad-0.122
 bou-0.119
mary-0.116
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 rider+0.022
Brian+0.018
John+0.018
lf+0.018
 riding+0.018
Neg logit contributions
 pods-0.034
 lav-0.034
 tad-0.034
 bou-0.033
mary-0.032
Token rabbit
Token ID22746
Feature activation+4.16e-02

Pos logit contributions
 rider+0.030
Brian+0.025
 riding+0.025
lf+0.024
John+0.024
Neg logit contributions
 pods-0.037
 lav-0.037
 tad-0.036
 bou-0.035
mary-0.035
Token and
Token ID290
Feature activation-1.98e-02

Pos logit contributions
 pods+0.087
 lav+0.086
 tad+0.086
 bou+0.083
mary+0.080
Neg logit contributions
 rider-0.062
Brian-0.052
John-0.051
lf-0.051
 riding-0.050
Token squirrel
Token ID33039
Feature activation-1.12e-02

Pos logit contributions
 rider+0.021
Brian+0.016
 riding+0.015
lf+0.015
John+0.015
Neg logit contributions
 pods-0.051
 lav-0.050
 tad-0.050
 bou-0.049
mary-0.048
Token vowed
Token ID19982
Feature activation-1.37e-02

Pos logit contributions
 rider+0.019
Brian+0.016
 riding+0.016
John+0.016
lf+0.016
Neg logit contributions
 pods-0.021
 lav-0.021
 tad-0.021
 bou-0.020
mary-0.019
Token to
Token ID284
Feature activation+1.23e-03

Pos logit contributions
 rider+0.020
lf+0.018
Brian+0.017
John+0.017
Thomas+0.017
Neg logit contributions
 pods-0.028
 lav-0.028
 tad-0.028
mary-0.027
 bou-0.027
Token never
Token ID1239
Feature activation-3.10e-02

Pos logit contributions
 pods+0.003
 lav+0.003
 tad+0.003
 bou+0.003
mary+0.002
Neg logit contributions
 rider-0.002
Brian-0.001
John-0.001
lf-0.001
Paul-0.001
Token separate
Token ID4553
Feature activation+6.12e-04

Pos logit contributions
 rider+0.058
Brian+0.050
John+0.049
lf+0.049
Paul+0.049
Neg logit contributions
 pods-0.054
 lav-0.053
 tad-0.053
 bou-0.051
mary-0.049
Token again
Token ID757
Feature activation-1.61e-02

Pos logit contributions
 pods+0.001
 lav+0.001
 tad+0.001
 bou+0.001
mary+0.001
Neg logit contributions
 rider-0.001
Brian-0.001
John-0.001
 riding-0.001
Paul-0.001
Token.
Token ID13
Feature activation-3.52e-03

Pos logit contributions
 rider+0.021
 riding+0.018
Brian+0.017
John+0.017
lf+0.017
Neg logit contributions
 pods-0.036
 lav-0.036
 tad-0.036
 bou-0.035
mary-0.034
Token They
Token ID1119
Feature activation+1.96e-02

Pos logit contributions
 rider+0.004
Brian+0.004
John+0.004
Paul+0.004
 riding+0.003
Neg logit contributions
 pods-0.008
 lav-0.008
 tad-0.008
 bou-0.008
mary-0.008
Token instead
Token ID2427
Feature activation+8.31e-03

Pos logit contributions
 tad+0.007
 lav+0.007
 pods+0.007
 bou+0.006
mary+0.006
Neg logit contributions
 rider-0.004
 riding-0.003
lf-0.003
Brian-0.003
working-0.003
Token of
Token ID286
Feature activation+3.86e-02

Pos logit contributions
 pods+0.017
 lav+0.017
 tad+0.017
 bou+0.016
mary+0.016
Neg logit contributions
 rider-0.012
Brian-0.011
Paul-0.010
John-0.010
lf-0.010
Token fear
Token ID3252
Feature activation-1.59e-02

Pos logit contributions
 pods+0.082
 lav+0.082
 tad+0.081
 bou+0.078
mary+0.075
Neg logit contributions
 rider-0.058
Brian-0.048
John-0.047
Paul-0.046
lf-0.046
Token.
Token ID13
Feature activation-3.33e-03

Pos logit contributions
 rider+0.023
Brian+0.019
 riding+0.018
lf+0.018
John+0.018
Neg logit contributions
 pods-0.034
 lav-0.034
 tad-0.034
 bou-0.033
mary-0.032
Token From
Token ID3574
Feature activation+3.72e-02

Pos logit contributions
 rider+0.004
Brian+0.004
John+0.003
Paul+0.003
 riding+0.003
Neg logit contributions
 pods-0.008
 lav-0.008
 tad-0.008
 bou-0.007
mary-0.007
Token then
Token ID788
Feature activation-5.74e-02

Pos logit contributions
 lav+0.067
 pods+0.065
mary+0.065
flies+0.065
 spr+0.065
Neg logit contributions
 rider-0.061
Brian-0.054
working-0.053
gh-0.053
bench-0.053
Token on
Token ID319
Feature activation-3.96e-02

Pos logit contributions
 rider+0.093
John+0.086
lf+0.084
Paul+0.083
James+0.082
Neg logit contributions
 lav-0.103
 pods-0.101
 tad-0.095
 spr-0.090
 bou-0.090
Token,
Token ID11
Feature activation-1.57e-02

Pos logit contributions
 rider+0.066
Brian+0.056
Paul+0.055
 ridden+0.055
John+0.053
Neg logit contributions
 pods-0.073
 lav-0.070
 tad-0.067
mary-0.066
 bou-0.064
Token the
Token ID262
Feature activation-1.65e-02

Pos logit contributions
 rider+0.019
Brian+0.015
John+0.015
lf+0.015
Paul+0.015
Neg logit contributions
 pods-0.038
 lav-0.037
 tad-0.037
 bou-0.036
mary-0.035
Token er
Token ID1931
Feature activation-3.51e-02

Pos logit contributions
 rider+0.031
Brian+0.027
lf+0.027
 riding+0.026
working+0.026
Neg logit contributions
 pods-0.029
 lav-0.028
 tad-0.028
 bou-0.027
mary-0.026
Tokenaser
Token ID6005
Feature activation-2.37e-02

Pos logit contributions
 rider+0.063
John+0.055
Brian+0.055
lf+0.054
 riding+0.054
Neg logit contributions
 pods-0.062
 tad-0.062
 lav-0.062
 bou-0.059
mary-0.056
TokenNo
Token ID2949
Feature activation-2.73e-02

Pos logit contributions
 rider+0.023
lf+0.022
Brian+0.020
working+0.019
 riding+0.019
Neg logit contributions
 tad-0.028
 pods-0.027
 lav-0.027
 scr-0.026
 bou-0.025
Token,
Token ID11
Feature activation-1.83e-02

Pos logit contributions
 rider+0.036
Brian+0.030
lf+0.029
Paul+0.029
working+0.028
Neg logit contributions
 tad-0.060
 pods-0.060
 lav-0.059
 bou-0.059
 scr-0.056
Token Tom
Token ID4186
Feature activation-4.41e-02

Pos logit contributions
 rider+0.024
Brian+0.021
working+0.020
Paul+0.020
lf+0.019
Neg logit contributions
 pods-0.040
 tad-0.040
 lav-0.039
 bou-0.038
 scr-0.037
Token!
Token ID0
Feature activation-1.96e-02

Pos logit contributions
 rider+0.055
Brian+0.044
lf+0.043
John+0.043
 riding+0.042
Neg logit contributions
 pods-0.104
 lav-0.103
 tad-0.103
 bou-0.100
mary-0.097
Token That
Token ID1320
Feature activation+3.80e-02

Pos logit contributions
 rider+0.031
Brian+0.027
John+0.026
Paul+0.026
 riding+0.026
Neg logit contributions
 lav-0.040
 pods-0.040
 tad-0.039
 bou-0.039
mary-0.038
Token sugar
Token ID7543
Feature activation-1.36e-02

Pos logit contributions
 tad+0.075
 lav+0.071
 bou+0.070
 pods+0.070
ount+0.066
Neg logit contributions
 rider-0.066
working-0.054
lf-0.052
 riding-0.051
 day-0.050
Token is
Token ID318
Feature activation+2.62e-02

Pos logit contributions
 rider+0.020
lf+0.018
working+0.017
John+0.016
 riding+0.016
Neg logit contributions
 tad-0.028
 lav-0.028
 bou-0.027
 pods-0.026
 tul-0.025
Token disgusting
Token ID23374
Feature activation+5.71e-02

Pos logit contributions
 tad+0.058
 bou+0.056
 pods+0.055
 lav+0.054
flies+0.052
Neg logit contributions
 rider-0.035
 riding-0.030
working-0.029
lf-0.028
 his-0.028
Token!
Token ID0
Feature activation-1.31e-02

Pos logit contributions
 tad+0.127
 lav+0.126
 pods+0.126
 bou+0.123
mary+0.119
Neg logit contributions
 rider-0.078
 riding-0.063
lf-0.063
John-0.062
Brian-0.062
Tokenâ
Token ID22940
Feature activation+5.55e-03

Pos logit contributions
 rider+0.020
Brian+0.017
lf+0.017
John+0.016
Paul+0.016
Neg logit contributions
 pods-0.027
 tad-0.026
 lav-0.026
 bou-0.025
mary-0.024
Token€
Token ID26391
Feature activation-2.76e-02

Pos logit contributions
 pods+0.009
 lav+0.009
 tad+0.009
 smells+0.009
mary+0.009
Neg logit contributions
 rider-0.009
lf-0.008
 riding-0.007
Brian-0.007
Thomas-0.007
Token on
Token ID319
Feature activation-1.71e-02

Pos logit contributions
 rider+0.037
Brian+0.029
John+0.028
lf+0.028
Paul+0.028
Neg logit contributions
 pods-0.077
 lav-0.076
 tad-0.076
 bou-0.074
mary-0.072
Token the
Token ID262
Feature activation-6.35e-03

Pos logit contributions
 rider+0.026
Brian+0.021
working+0.021
Paul+0.021
Thomas+0.020
Neg logit contributions
 pods-0.036
 tad-0.035
 lav-0.034
 bou-0.034
mary-0.032
Token ground
Token ID2323
Feature activation-2.02e-02

Pos logit contributions
 rider+0.012
lf+0.010
Brian+0.010
 riding+0.010
John+0.010
Neg logit contributions
 pods-0.011
 lav-0.011
 tad-0.011
 bou-0.010
mary-0.010
Token.
Token ID13
Feature activation+8.50e-03

Pos logit contributions
 rider+0.028
Brian+0.023
lf+0.022
John+0.022
 riding+0.022
Neg logit contributions
 pods-0.044
 lav-0.044
 tad-0.044
 bou-0.042
mary-0.041
Token Sadd
Token ID20454
Feature activation+1.83e-02

Pos logit contributions
 pods+0.018
 lav+0.018
 tad+0.018
 bou+0.017
mary+0.017
Neg logit contributions
 rider-0.012
Brian-0.010
John-0.010
 riding-0.010
Paul-0.010
Tokenened
Token ID2945
Feature activation-8.31e-02

Pos logit contributions
 pods+0.038
 lav+0.038
 tad+0.037
 bou+0.037
mary+0.036
Neg logit contributions
 rider-0.028
John-0.024
Brian-0.024
lf-0.024
Paul-0.024
Token,
Token ID11
Feature activation+1.09e-03

Pos logit contributions
 rider+0.106
Brian+0.086
John+0.084
lf+0.084
Paul+0.083
Neg logit contributions
 pods-0.192
 lav-0.190
 tad-0.189
 bou-0.184
mary-0.179
Token the
Token ID262
Feature activation-6.11e-03

Pos logit contributions
 pods+0.003
 lav+0.002
 tad+0.002
 bou+0.002
mary+0.002
Neg logit contributions
 rider-0.001
Brian-0.001
John-0.001
 riding-0.001
lf-0.001
Token rabbit
Token ID22746
Feature activation+4.04e-02

Pos logit contributions
 rider+0.010
lf+0.009
Brian+0.009
 riding+0.009
John+0.008
Neg logit contributions
 pods-0.012
 lav-0.011
 tad-0.011
 bou-0.011
mary-0.011
Token thanked
Token ID26280
Feature activation-2.56e-03

Pos logit contributions
 pods+0.083
 lav+0.083
 tad+0.082
 bou+0.079
mary+0.076
Neg logit contributions
 rider-0.061
Brian-0.051
lf-0.051
 riding-0.051
John-0.050
Token the
Token ID262
Feature activation+2.16e-02

Pos logit contributions
 rider+0.003
Brian+0.002
John+0.002
Paul+0.002
lf+0.002
Neg logit contributions
 pods-0.006
 lav-0.006
 tad-0.006
 bou-0.006
mary-0.006
Token felt
Token ID2936
Feature activation-8.83e-03

Pos logit contributions
 rider+0.027
lf+0.023
John+0.023
Brian+0.023
 riding+0.022
Neg logit contributions
 pods-0.038
 lav-0.038
 tad-0.038
 bou-0.036
mary-0.035
Token sad
Token ID6507
Feature activation+2.89e-02

Pos logit contributions
 rider+0.012
 riding+0.011
John+0.010
Brian+0.010
His+0.010
Neg logit contributions
 pods-0.019
 lav-0.018
 tad-0.018
 bou-0.018
mary-0.018
Token and
Token ID290
Feature activation+8.26e-03

Pos logit contributions
 pods+0.058
 lav+0.057
 tad+0.057
 bou+0.055
mary+0.053
Neg logit contributions
 rider-0.046
Brian-0.040
John-0.039
Paul-0.038
lf-0.038
Token angry
Token ID7954
Feature activation+2.21e-02

Pos logit contributions
 pods+0.017
 lav+0.017
 tad+0.017
 bou+0.017
mary+0.016
Neg logit contributions
 rider-0.012
 riding-0.010
lf-0.010
Brian-0.010
John-0.010
Token at
Token ID379
Feature activation-1.62e-02

Pos logit contributions
 pods+0.048
 lav+0.048
 tad+0.048
 bou+0.046
mary+0.045
Neg logit contributions
 rider-0.031
Brian-0.026
 riding-0.025
lf-0.025
John-0.025
Token each
Token ID1123
Feature activation-7.57e-02

Pos logit contributions
 rider+0.026
Brian+0.022
John+0.021
Paul+0.021
lf+0.021
Neg logit contributions
 pods-0.033
 lav-0.032
 tad-0.032
 bou-0.031
mary-0.030
Token other
Token ID584
Feature activation+5.37e-03

Pos logit contributions
 rider+0.134
Brian+0.115
Paul+0.114
John+0.114
lf+0.112
Neg logit contributions
 pods-0.138
 tad-0.136
 lav-0.135
 bou-0.129
mary-0.124
Token.
Token ID13
Feature activation+7.94e-03

Pos logit contributions
 tad+0.012
 lav+0.012
 pods+0.012
 bou+0.012
mary+0.011
Neg logit contributions
 rider-0.007
 riding-0.006
Brian-0.006
lf-0.006
John-0.006
Token
Token ID198
Feature activation-9.99e-03

Pos logit contributions
 pods+0.018
 lav+0.018
 tad+0.018
 bou+0.017
mary+0.017
Neg logit contributions
 rider-0.010
Brian-0.009
John-0.009
Paul-0.009
lf-0.008
Token
Token ID198
Feature activation-9.23e-03

Pos logit contributions
 rider+0.019
working+0.016
 runner+0.014
lf+0.014
 helmet+0.014
Neg logit contributions
 tad-0.018
 lav-0.017
 pods-0.017
mary-0.017
 bou-0.016
TokenBut
Token ID1537
Feature activation-2.08e-02

Pos logit contributions
 rider+0.013
 riding+0.010
Brian+0.010
working+0.010
lf+0.010
Neg logit contributions
 pods-0.020
 tad-0.020
 lav-0.020
mary-0.019
 bou-0.019
Token a
Token ID257
Feature activation-1.69e-02

Pos logit contributions
 rider+0.022
Brian+0.018
lf+0.018
Paul+0.017
John+0.017
Neg logit contributions
 pods-0.030
 lav-0.029
 tad-0.029
 bou-0.028
mary-0.028
Token good
Token ID922
Feature activation-2.59e-02

Pos logit contributions
 rider+0.025
 riding+0.020
Brian+0.020
John+0.020
lf+0.020
Neg logit contributions
 pods-0.037
 tad-0.036
 lav-0.035
 bou-0.034
mary-0.033
Token holiday
Token ID9912
Feature activation-2.08e-02

Pos logit contributions
 rider+0.027
Brian+0.025
Paul+0.023
lf+0.022
John+0.022
Neg logit contributions
 pods-0.063
 lav-0.063
 bou-0.063
 tad-0.062
mary-0.059
Token,
Token ID11
Feature activation+7.96e-03

Pos logit contributions
 rider+0.033
Brian+0.031
Paul+0.030
John+0.029
Thomas+0.029
Neg logit contributions
 pods-0.038
 bou-0.037
 tad-0.037
 lav-0.036
mary-0.036
Token no
Token ID645
Feature activation-2.21e-02

Pos logit contributions
 pods+0.018
 lav+0.017
 tad+0.017
mary+0.016
 bou+0.016
Neg logit contributions
 rider-0.011
 riding-0.009
John-0.009
lf-0.008
Brian-0.008
Token matter
Token ID2300
Feature activation-8.13e-02

Pos logit contributions
 rider+0.032
Brian+0.030
Paul+0.028
lf+0.027
John+0.027
Neg logit contributions
 pods-0.046
 lav-0.046
 tad-0.046
 bou-0.045
 cups-0.042
Token what
Token ID644
Feature activation+1.56e-03

Pos logit contributions
 rider+0.163
Brian+0.144
John+0.139
lf+0.139
Paul+0.139
Neg logit contributions
 pods-0.129
 lav-0.127
 tad-0.125
 bou-0.120
mary-0.115
Token.
Token ID13
Feature activation-2.19e-04

Pos logit contributions
 tad+0.004
 lav+0.004
 pods+0.004
 bou+0.004
mary+0.003
Neg logit contributions
 rider-0.002
lf-0.002
John-0.002
 riding-0.002
working-0.002
Token They
Token ID1119
Feature activation+2.41e-02

Pos logit contributions
 rider+0.000
John+0.000
Tom+0.000
Brian+0.000
Paul+0.000
Neg logit contributions
 pods-0.001
 lav-0.001
 bou-0.001
 tad-0.001
mary-0.001
Token said
Token ID531
Feature activation-5.49e-03

Pos logit contributions
 pods+0.053
 lav+0.052
 tad+0.051
 bou+0.049
mary+0.048
Neg logit contributions
 rider-0.035
Brian-0.028
John-0.028
lf-0.028
 riding-0.028
Token they
Token ID484
Feature activation+3.76e-02

Pos logit contributions
 rider+0.008
 riding+0.006
John+0.006
lf+0.006
Brian+0.006
Neg logit contributions
 pods-0.012
 tad-0.012
 lav-0.012
 bou-0.011
mary-0.011
Token She
Token ID1375
Feature activation+2.47e-02

Pos logit contributions
 tad+0.028
 pods+0.028
 lav+0.028
 bou+0.027
mary+0.025
Neg logit contributions
 rider-0.020
Brian-0.018
 riding-0.018
John-0.017
Paul-0.017
Token played
Token ID2826
Feature activation+2.95e-03

Pos logit contributions
 pods+0.049
 tad+0.048
 lav+0.048
 bou+0.046
mary+0.045
Neg logit contributions
 rider-0.040
Brian-0.034
John-0.033
Paul-0.033
 riding-0.033
Token for
Token ID329
Feature activation-4.07e-02

Pos logit contributions
 pods+0.007
 tad+0.007
 lav+0.007
 bou+0.007
mary+0.006
Neg logit contributions
 rider-0.004
Brian-0.003
Paul-0.003
John-0.003
working-0.003
Token hours
Token ID2250
Feature activation-8.91e-02

Pos logit contributions
 rider+0.067
Brian+0.053
working+0.053
Thomas+0.051
His+0.051
Neg logit contributions
 lav-0.080
 tad-0.078
 pods-0.077
 bou-0.075
 herbs-0.071
Token and
Token ID290
Feature activation-4.40e-03

Pos logit contributions
 rider+0.140
Brian+0.125
Thomas+0.120
Paul+0.118
His+0.117
Neg logit contributions
 pods-0.174
 tad-0.172
 bou-0.170
 lav-0.169
mary-0.163
Token hours
Token ID2250
Feature activation-8.80e-02

Pos logit contributions
 rider+0.006
Brian+0.004
Thomas+0.004
John+0.004
Paul+0.004
Neg logit contributions
 tad-0.010
 pods-0.010
 lav-0.010
 bou-0.010
mary-0.010
Token until
Token ID1566
Feature activation+2.98e-03

Pos logit contributions
 rider+0.151
Brian+0.134
Thomas+0.128
Paul+0.126
John+0.124
Neg logit contributions
 pods-0.163
 tad-0.162
 lav-0.160
 bou-0.158
mary-0.152
Token she
Token ID673
Feature activation+2.30e-02

Pos logit contributions
 pods+0.006
 lav+0.006
 tad+0.006
 bou+0.006
mary+0.006
Neg logit contributions
 rider-0.005
Brian-0.004
Paul-0.004
Thomas-0.004
John-0.004
Token was
Token ID373
Feature activation+9.25e-03

Pos logit contributions
 pods+0.047
 tad+0.047
 lav+0.046
 bou+0.045
mary+0.044
Neg logit contributions
 rider-0.036
Brian-0.030
John-0.029
Paul-0.029
 riding-0.029
Token too
Token ID1165
Feature activation+2.12e-03

Pos logit contributions
 pods+0.015
 lav+0.015
 tad+0.015
 bou+0.014
mary+0.014
Neg logit contributions
 rider-0.018
Brian-0.016
Paul-0.015
John-0.015
lf-0.015
Token tired
Token ID10032
Feature activation+1.65e-02

Pos logit contributions
 pods+0.004
 tad+0.004
 lav+0.004
 bou+0.004
mary+0.004
Neg logit contributions
 rider-0.004
Brian-0.003
lf-0.003
Paul-0.003
John-0.003
Token wanted
Token ID2227
Feature activation+6.57e-03

Pos logit contributions
 pods+0.010
 lav+0.009
 tad+0.009
 bou+0.009
mary+0.009
Neg logit contributions
 rider-0.007
Brian-0.006
John-0.006
Paul-0.006
lf-0.006
Token to
Token ID284
Feature activation-4.55e-03

Pos logit contributions
 pods+0.013
 lav+0.013
 tad+0.013
mary+0.012
 herbs+0.012
Neg logit contributions
 rider-0.011
lf-0.009
Thomas-0.009
Paul-0.009
Brian-0.009
Token have
Token ID423
Feature activation-1.30e-02

Pos logit contributions
 rider+0.008
Brian+0.007
John+0.006
Paul+0.006
lf+0.006
Neg logit contributions
 pods-0.009
 lav-0.009
 tad-0.009
 bou-0.008
mary-0.008
Token a
Token ID257
Feature activation-1.34e-02

Pos logit contributions
 rider+0.025
working+0.022
lf+0.022
Brian+0.021
 runner+0.020
Neg logit contributions
 lav-0.020
 pods-0.020
 berries-0.020
 scr-0.020
 cups-0.020
Token big
Token ID1263
Feature activation-1.97e-02

Pos logit contributions
 rider+0.022
lf+0.022
Paul+0.022
John+0.021
Brian+0.021
Neg logit contributions
 lav-0.023
 bou-0.023
 pods-0.022
 tad-0.022
 spr-0.021
Token muscle
Token ID8280
Feature activation-8.00e-02

Pos logit contributions
Paul+0.036
John+0.034
 rider+0.034
Brian+0.033
Thomas+0.033
Neg logit contributions
 bou-0.033
 pods-0.032
 lav-0.031
 tad-0.031
 spr-0.031
Token too
Token ID1165
Feature activation+8.30e-03

Pos logit contributions
 rider+0.124
Brian+0.110
John+0.109
Paul+0.108
lf+0.107
Neg logit contributions
 pods-0.158
 tad-0.156
 lav-0.155
 bou-0.150
mary-0.145
Token.
Token ID13
Feature activation-2.57e-03

Pos logit contributions
 pods+0.014
 tad+0.014
 lav+0.013
mary+0.013
 bou+0.013
Neg logit contributions
 rider-0.015
Brian-0.014
Paul-0.014
Thomas-0.014
lf-0.014
Token They
Token ID1119
Feature activation+2.10e-02

Pos logit contributions
 rider+0.004
Brian+0.003
lf+0.003
Paul+0.003
John+0.003
Neg logit contributions
 pods-0.006
 tad-0.005
 lav-0.005
 bou-0.005
mary-0.005
Token thought
Token ID1807
Feature activation+2.22e-02

Pos logit contributions
 lav+0.045
 pods+0.044
 tad+0.044
 bou+0.043
mary+0.042
Neg logit contributions
 rider-0.030
Brian-0.025
John-0.024
His-0.024
lf-0.024
Token of
Token ID286
Feature activation+3.44e-02

Pos logit contributions
 pods+0.048
 lav+0.048
 tad+0.047
 bou+0.045
 basil+0.044
Neg logit contributions
 rider-0.033
Brian-0.027
Paul-0.027
Thomas-0.026
John-0.025