TOP ACTIVATIONS
MAX = 15.379

They have names like Black beard or Hook ."
Tom watched as the lady bugs flew around the garden and
. Like butterflies and lady bugs and cater pill ars ."
he found some colourful lady bugs that he quickly released into
girl who loved to eat sandwiches . Every day she would
girl who loved to eat sandwiches . Every day she would
a purple bird named Bob o . Bob o loved to
tasty seeds and took n aps in the warm sun .
who loved to take n aps . Tom was a nos
every day and took n aps under the sun . <|endoftext|>
James saw lady bugs , ants and even a
princess and her graceful n aps . From that day on
know what kind of n aps are best for her .
saw some butterflies and lady bugs . They tried to catch
a big boo - bo o on his knee . He
he wanted to borrow n aps forever . <|endoftext|> Once upon
and playing with the lady bugs . At
filled with cheers and cl aps . Ad eline was so
friends and always take n aps together when they are feeling
. They smiled and cl apped along . The man waved

MIN ACTIVATIONS
MIN = -15.249

careful and listen to your gut ." Tom and
he was a very brave gut . He let curiosity get
should always listen to his gut and not do anything too
she should always trust her gut and <|endoftext|> One day ,
that he had followed his gut . He had been right
outside ." S aman tha looked at him and
her . S aman tha knew that the par
fun ." S aman tha didn 't know what
and love . But the big boy does not
and messy !" But the dog does not stop
fly away ." But Anna does not listen .
, move !" But the cows do not move
says . " But I don 't like jam
," said Jack . " But can you please turn around
€ . S aman tha was amazed and she
sting you !" But Anna does not hear him
, stop !" But Rex does not stop .
me away ." But the officer did not listen
it back ." But the boy did not care
said , nodding . " But can we come back to

INTERVAL 7.722 - 15.379
CONTAINS 0.904%

his toy boat . Tim my wanted to remove the raft
I will not , Lily ," Ben says . " I
day , he asked his three - year - old neighbor
" That dog was mean ," Tom says .
threat !" Mom my was very scared , and

INTERVAL 0.065 - 7.722
CONTAINS 41.276%

clean up her toys and put them in order . Lily
He looked around and saw a bottle of red sauce
nodded and thanked mom . They went back to their room
, Tweet ie never balanced on a hook again , but
morning he would set off , exploring the world and playing

INTERVAL -7.592 - 0.065
CONTAINS 56.643%

. The sandwich was cool because it was in the fridge
cream cone . The bird was happy and flew away .
the dog . They were still scared . But they also
r every time she saw him . <|endoftext|> Once upon a
. He worked hard to make sure his family had enough

INTERVAL -15.249 - -7.592
CONTAINS 1.177%

them on the tray . But Tim wanted more cheese .
tests . The next day , Lily went to
Lucy asked Jack , " Why don 't you lift the
the bread . But then a big wind comes
that day on , Betsy knew that anything was possible if
Token They
Token ID1119
Feature activation+6.07e+00

Pos logit contributions
But+0.072
 tune+0.067
 rules+0.067
 instructions+0.067
 lesson+0.066
Neg logit contributions
icked-0.078
apped-0.065
icks-0.065
ids-0.056
bug-0.056
Token have
Token ID423
Feature activation+7.54e+00

Pos logit contributions
icked+0.298
apped+0.252
icks+0.246
ids+0.183
o+0.179
Neg logit contributions
 tune-0.488
 lesson-0.472
 instructions-0.468
 path-0.459
 rules-0.459
Token names
Token ID3891
Feature activation+6.22e+00

Pos logit contributions
icked+0.555
apped+0.497
icks+0.494
ids+0.416
bug+0.413
Neg logit contributions
 tune-0.422
 lesson-0.406
 instructions-0.402
 rules-0.395
 path-0.384
Token like
Token ID588
Feature activation+3.74e+00

Pos logit contributions
icked+0.425
icks+0.375
apped+0.372
ids+0.312
bug+0.309
Neg logit contributions
 tune-0.375
 instructions-0.358
 lesson-0.357
 rules-0.350
But-0.347
Token Black
Token ID2619
Feature activation+3.91e+00

Pos logit contributions
icked+0.264
icks+0.236
apped+0.231
ids+0.194
bug+0.191
Neg logit contributions
 tune-0.222
 instructions-0.208
 lesson-0.208
izing-0.203
 rules-0.202
Tokenbeard
Token ID39433
Feature activation+1.54e+01

Pos logit contributions
icked+0.253
apped+0.223
icks+0.219
ids+0.175
bug+0.168
Neg logit contributions
 tune-0.257
 instructions-0.241
izing-0.239
 rules-0.237
 lesson-0.237
Token or
Token ID393
Feature activation-1.56e+00

Pos logit contributions
icked+1.110
icks+0.990
apped+0.975
ids+0.824
bug+0.805
Neg logit contributions
 tune-0.897
 lesson-0.839
 instructions-0.836
izing-0.813
 rules-0.813
Token Hook
Token ID18531
Feature activation+7.99e+00

Pos logit contributions
 tune+0.087
 instructions+0.081
 lesson+0.081
 rules+0.080
 path+0.078
Neg logit contributions
icked-0.117
apped-0.105
icks-0.104
ids-0.088
o-0.086
Token."
Token ID526
Feature activation-2.87e-02

Pos logit contributions
icked+0.559
icks+0.497
apped+0.493
ids+0.418
bug+0.411
Neg logit contributions
 tune-0.466
 lesson-0.443
 instructions-0.441
izing-0.429
 rules-0.429
Token
Token ID198
Feature activation+2.96e-01

Pos logit contributions
 tune+0.002
 lesson+0.002
 instructions+0.002
But+0.002
izing+0.002
Neg logit contributions
icked-0.002
icks-0.002
apped-0.002
ids-0.001
bug-0.001
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
icked+0.018
icks+0.016
apped+0.016
o+0.013
ids+0.013
Neg logit contributions
 tune-0.020
 instructions-0.019
 lesson-0.019
izing-0.019
 path-0.018
TokenTom
Token ID13787
Feature activation-6.87e-01

Pos logit contributions
icked+0.022
icks+0.020
apped+0.019
o+0.018
aves+0.017
Neg logit contributions
 tune-0.016
 instructions-0.015
 instincts-0.014
izing-0.014
 rightful-0.014
Token watched
Token ID7342
Feature activation-2.32e+00

Pos logit contributions
 tune+0.035
 instructions+0.033
 path+0.033
 lesson+0.032
But+0.032
Neg logit contributions
icked-0.053
icks-0.049
apped-0.047
ids-0.042
aves-0.039
Token as
Token ID355
Feature activation+1.22e+00

Pos logit contributions
 tune+0.141
 lesson+0.132
 instructions+0.132
izing+0.129
 path+0.128
Neg logit contributions
icked-0.161
icks-0.144
apped-0.141
ids-0.118
bug-0.116
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
icked+0.085
icks+0.076
apped+0.075
bug+0.063
ids+0.063
Neg logit contributions
 tune-0.073
 instructions-0.067
izing-0.067
 rules-0.065
 lesson-0.065
Token lady
Token ID10846
Feature activation-5.42e+00

Pos logit contributions
 tune+0.050
izing+0.042
 instructions+0.042
 lesson+0.041
But+0.039
Neg logit contributions
icked-0.181
icks-0.168
apped-0.167
bug-0.148
ids-0.148
Tokenbugs
Token ID32965
Feature activation+1.38e+01

Pos logit contributions
 tune+0.350
 path+0.326
 instructions+0.322
But+0.316
izing+0.311
Neg logit contributions
icked-0.338
icks-0.321
apped-0.284
ids-0.241
aves-0.231
Token flew
Token ID13112
Feature activation+1.02e+00

Pos logit contributions
icked+0.885
icks+0.793
apped+0.767
ids+0.634
o+0.627
Neg logit contributions
 tune-0.901
 instructions-0.840
 lesson-0.839
 path-0.834
izing-0.821
Token around
Token ID1088
Feature activation+2.74e+00

Pos logit contributions
icked+0.075
icks+0.069
apped+0.065
ids+0.057
aves+0.055
Neg logit contributions
 tune-0.057
 instructions-0.054
 path-0.053
 lesson-0.053
izing-0.052
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
icked+0.197
icks+0.177
apped+0.172
ids+0.146
bug+0.144
Neg logit contributions
 tune-0.159
 instructions-0.149
 lesson-0.149
izing-0.145
 path-0.144
Token garden
Token ID11376
Feature activation-2.97e+00

Pos logit contributions
 tune+0.078
 lesson+0.071
 instructions+0.070
izing+0.069
 rules+0.067
Neg logit contributions
icked-0.145
icks-0.133
apped-0.133
bug-0.117
ids-0.115
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
 tune+0.184
 instructions+0.174
 path+0.171
 lesson+0.169
izing+0.168
Neg logit contributions
icked-0.202
icks-0.181
apped-0.169
ids-0.146
aves-0.142
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 tune+0.099
 instructions+0.092
 lesson+0.092
izing+0.090
 rules+0.089
Neg logit contributions
icked-0.130
icks-0.117
apped-0.115
ids-0.098
o-0.096
Token Like
Token ID4525
Feature activation+2.47e+00

Pos logit contributions
 tune+0.080
But+0.079
 lesson+0.077
 instructions+0.076
 rules+0.074
Neg logit contributions
icked-0.088
icks-0.076
apped-0.076
ids-0.063
bug-0.061
Token butterflies
Token ID44571
Feature activation+5.97e-01

Pos logit contributions
icked+0.184
apped+0.165
icks+0.164
o+0.139
ids+0.137
Neg logit contributions
 tune-0.134
 instructions-0.128
 lesson-0.126
 rules-0.124
izing-0.124
Token and
Token ID290
Feature activation-1.49e+00

Pos logit contributions
icked+0.041
apped+0.036
icks+0.036
ids+0.029
o+0.029
Neg logit contributions
 tune-0.036
 instructions-0.034
 lesson-0.034
But-0.033
izing-0.033
Token lady
Token ID10846
Feature activation-5.65e+00

Pos logit contributions
 tune+0.074
 instructions+0.069
 lesson+0.069
 rules+0.067
 path+0.066
Neg logit contributions
icked-0.119
icks-0.107
apped-0.107
ids-0.091
o-0.091
Tokenbugs
Token ID32965
Feature activation+1.37e+01

Pos logit contributions
 tune+0.424
 instructions+0.398
But+0.394
 rules+0.391
away+0.386
Neg logit contributions
icked-0.287
apped-0.251
icks-0.235
ids-0.164
igh-0.160
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
icked+0.935
icks+0.833
apped+0.817
ids+0.681
bug+0.670
Neg logit contributions
 tune-0.838
 lesson-0.789
 instructions-0.787
izing-0.769
 rules-0.762
Token cater
Token ID20825
Feature activation-3.68e+00

Pos logit contributions
 tune+0.070
 lesson+0.065
 instructions+0.065
 rules+0.064
 path+0.063
Neg logit contributions
icked-0.118
apped-0.108
icks-0.106
ids-0.092
o-0.092
Tokenpill
Token ID27215
Feature activation+3.79e+00

Pos logit contributions
 tune+0.251
 instructions+0.235
izing+0.234
 lesson+0.233
 rules+0.230
Neg logit contributions
icked-0.222
apped-0.196
icks-0.190
o-0.152
ids-0.151
Tokenars
Token ID945
Feature activation+7.89e+00

Pos logit contributions
icked+0.220
apped+0.196
icks+0.184
o+0.155
bug+0.150
Neg logit contributions
 tune-0.261
 lesson-0.255
 instructions-0.251
 rules-0.249
But-0.244
Token."
Token ID526
Feature activation-1.33e-01

Pos logit contributions
icked+0.569
icks+0.514
apped+0.497
ids+0.423
bug+0.417
Neg logit contributions
 tune-0.456
 instructions-0.424
 lesson-0.423
izing-0.419
 path-0.412
Token he
Token ID339
Feature activation-2.83e+00

Pos logit contributions
icked+0.412
icks+0.373
apped+0.354
ids+0.303
aves+0.293
Neg logit contributions
 tune-0.352
 lesson-0.342
 instructions-0.338
 path-0.336
izing-0.332
Token found
Token ID1043
Feature activation+6.18e+00

Pos logit contributions
But+0.181
 tune+0.180
 path+0.178
 lesson+0.174
 instructions+0.171
Neg logit contributions
icked-0.174
icks-0.163
apped-0.145
ids-0.120
aves-0.115
Token some
Token ID617
Feature activation+5.59e+00

Pos logit contributions
icked+0.386
icks+0.366
apped+0.347
ids+0.321
aves+0.310
Neg logit contributions
 instructions-0.373
 path-0.369
 lesson-0.368
 man-0.357
 rules-0.355
Token colourful
Token ID44799
Feature activation-7.00e-01

Pos logit contributions
icked+0.499
icks+0.459
apped+0.447
ids+0.399
o+0.391
Neg logit contributions
 tune-0.230
 lesson-0.218
 path-0.214
 instructions-0.214
 rules-0.206
Token lady
Token ID10846
Feature activation-5.60e+00

Pos logit contributions
 path+0.024
 tune+0.023
 instructions+0.023
 lesson+0.022
 rules+0.021
Neg logit contributions
icked-0.068
icks-0.063
apped-0.060
o-0.055
ids-0.055
Tokenbugs
Token ID32965
Feature activation+1.37e+01

Pos logit contributions
 tune+0.313
 path+0.295
 instructions+0.289
 man+0.285
But+0.284
Neg logit contributions
icked-0.408
icks-0.381
apped-0.346
ids-0.283
're-0.267
Token that
Token ID326
Feature activation+6.65e-01

Pos logit contributions
icked+0.968
icks+0.868
apped+0.860
ids+0.717
aves+0.686
Neg logit contributions
 tune-0.778
 path-0.758
 instructions-0.738
 lesson-0.730
izing-0.723
Token he
Token ID339
Feature activation-2.76e+00

Pos logit contributions
icked+0.047
icks+0.042
apped+0.041
ids+0.035
aves+0.034
Neg logit contributions
 tune-0.039
 lesson-0.037
 instructions-0.036
izing-0.036
 path-0.036
Token quickly
Token ID2952
Feature activation-3.28e+00

Pos logit contributions
 tune+0.177
 lesson+0.167
 instructions+0.166
izing+0.163
 path+0.162
Neg logit contributions
icked-0.180
icks-0.162
apped-0.156
ids-0.130
aves-0.125
Token released
Token ID2716
Feature activation+3.69e+00

Pos logit contributions
 tune+0.213
 lesson+0.200
 instructions+0.200
izing+0.196
 path+0.194
Neg logit contributions
icked-0.213
icks-0.189
apped-0.184
ids-0.152
bug-0.150
Token into
Token ID656
Feature activation+1.14e+00

Pos logit contributions
icked+0.282
icks+0.255
apped+0.250
ids+0.214
bug+0.209
Neg logit contributions
 tune-0.197
 instructions-0.184
 lesson-0.183
izing-0.177
 rules-0.177
Token girl
Token ID2576
Feature activation+1.68e+00

Pos logit contributions
izing+0.082
 tune+0.080
 lesson+0.079
 instincts+0.078
 rightful+0.077
Neg logit contributions
icked-0.113
icks-0.108
apped-0.101
ids-0.092
o-0.089
Token who
Token ID508
Feature activation+4.59e+00

Pos logit contributions
icked+0.110
icks+0.099
apped+0.094
ids+0.083
aves+0.080
Neg logit contributions
 tune-0.099
But-0.091
 man-0.089
 instructions-0.088
 lesson-0.087
Token loved
Token ID6151
Feature activation+1.98e+00

Pos logit contributions
icked+0.313
icks+0.281
apped+0.272
ids+0.230
aves+0.223
Neg logit contributions
 tune-0.280
 lesson-0.265
 instructions-0.261
 path-0.255
izing-0.254
Token to
Token ID284
Feature activation-6.75e-01

Pos logit contributions
icked+0.118
icks+0.108
apped+0.101
o+0.092
bugs+0.087
Neg logit contributions
 tune-0.121
izing-0.121
But-0.117
 lesson-0.115
 path-0.115
Token eat
Token ID4483
Feature activation+1.77e+00

Pos logit contributions
 tune+0.038
izing+0.038
 lesson+0.037
 instructions+0.037
 rules+0.035
Neg logit contributions
icked-0.047
icks-0.044
apped-0.041
ids-0.035
bug-0.035
Token sandwiches
Token ID29777
Feature activation+1.36e+01

Pos logit contributions
icked+0.118
icks+0.108
apped+0.100
o+0.086
bug+0.085
Neg logit contributions
 tune-0.111
 instructions-0.101
 lesson-0.100
izing-0.100
 path-0.100
Token.
Token ID13
Feature activation-5.31e-01

Pos logit contributions
icked+0.978
icks+0.880
apped+0.858
ids+0.727
bug+0.715
Neg logit contributions
 tune-0.786
 lesson-0.737
 instructions-0.734
izing-0.719
 path-0.710
Token Every
Token ID3887
Feature activation+1.88e+00

Pos logit contributions
 tune+0.035
 lesson+0.033
 instructions+0.033
But+0.033
 path+0.033
Neg logit contributions
icked-0.034
icks-0.030
apped-0.029
ids-0.023
aves-0.023
Token day
Token ID1110
Feature activation-4.01e+00

Pos logit contributions
icked+0.158
icks+0.148
apped+0.145
o+0.132
bug+0.129
Neg logit contributions
 tune-0.079
izing-0.078
 instructions-0.076
 instincts-0.072
 rules-0.069
Token she
Token ID673
Feature activation-2.92e+00

Pos logit contributions
 tune+0.222
 lesson+0.206
 instructions+0.206
izing+0.202
 path+0.199
Neg logit contributions
icked-0.299
icks-0.269
apped-0.263
ids-0.225
bug-0.222
Token would
Token ID561
Feature activation-2.76e+00

Pos logit contributions
 tune+0.173
 instructions+0.162
 lesson+0.162
izing+0.159
 path+0.156
Neg logit contributions
icked-0.206
icks-0.185
apped-0.180
ids-0.152
bug-0.150
Token girl
Token ID2576
Feature activation+1.68e+00

Pos logit contributions
izing+0.082
 tune+0.080
 lesson+0.079
 instincts+0.078
 rightful+0.077
Neg logit contributions
icked-0.113
icks-0.108
apped-0.101
ids-0.092
o-0.089
Token who
Token ID508
Feature activation+4.59e+00

Pos logit contributions
icked+0.110
icks+0.099
apped+0.094
ids+0.083
aves+0.080
Neg logit contributions
 tune-0.099
But-0.091
 man-0.089
 instructions-0.088
 lesson-0.087
Token loved
Token ID6151
Feature activation+1.98e+00

Pos logit contributions
icked+0.313
icks+0.281
apped+0.272
ids+0.230
aves+0.223
Neg logit contributions
 tune-0.280
 lesson-0.265
 instructions-0.261
 path-0.255
izing-0.254
Token to
Token ID284
Feature activation-6.75e-01

Pos logit contributions
icked+0.118
icks+0.108
apped+0.101
o+0.092
bugs+0.087
Neg logit contributions
 tune-0.121
izing-0.121
But-0.117
 lesson-0.115
 path-0.115
Token eat
Token ID4483
Feature activation+1.77e+00

Pos logit contributions
 tune+0.038
izing+0.038
 lesson+0.037
 instructions+0.037
 rules+0.035
Neg logit contributions
icked-0.047
icks-0.044
apped-0.041
ids-0.035
bug-0.035
Token sandwiches
Token ID29777
Feature activation+1.36e+01

Pos logit contributions
icked+0.118
icks+0.108
apped+0.100
o+0.086
bug+0.085
Neg logit contributions
 tune-0.111
 instructions-0.101
 lesson-0.100
izing-0.100
 path-0.100
Token.
Token ID13
Feature activation-5.31e-01

Pos logit contributions
icked+0.978
icks+0.880
apped+0.858
ids+0.727
bug+0.715
Neg logit contributions
 tune-0.786
 lesson-0.737
 instructions-0.734
izing-0.719
 path-0.710
Token Every
Token ID3887
Feature activation+1.88e+00

Pos logit contributions
 tune+0.035
 lesson+0.033
 instructions+0.033
But+0.033
 path+0.033
Neg logit contributions
icked-0.034
icks-0.030
apped-0.029
ids-0.023
aves-0.023
Token day
Token ID1110
Feature activation-4.01e+00

Pos logit contributions
icked+0.158
icks+0.148
apped+0.145
o+0.132
bug+0.129
Neg logit contributions
 tune-0.079
izing-0.078
 instructions-0.076
 instincts-0.072
 rules-0.069
Token she
Token ID673
Feature activation-2.92e+00

Pos logit contributions
 tune+0.222
 lesson+0.206
 instructions+0.206
izing+0.202
 path+0.199
Neg logit contributions
icked-0.299
icks-0.269
apped-0.263
ids-0.225
bug-0.222
Token would
Token ID561
Feature activation-2.76e+00

Pos logit contributions
 tune+0.173
 instructions+0.162
 lesson+0.162
izing+0.159
 path+0.156
Neg logit contributions
icked-0.206
icks-0.185
apped-0.180
ids-0.152
bug-0.150
Token a
Token ID257
Feature activation+8.26e+00

Pos logit contributions
 tune+0.142
 lesson+0.137
But+0.135
 path+0.131
 instructions+0.130
Neg logit contributions
icked-0.114
icks-0.099
apped-0.093
ids-0.080
aves-0.074
Token purple
Token ID14032
Feature activation+2.94e-01

Pos logit contributions
icked+0.636
apped+0.587
icks+0.587
bug+0.523
o+0.512
Neg logit contributions
 tune-0.376
 instructions-0.363
izing-0.355
 lesson-0.350
 rules-0.341
Token bird
Token ID6512
Feature activation+1.55e+00

Pos logit contributions
icked+0.024
icks+0.022
apped+0.022
ids+0.019
bug+0.019
Neg logit contributions
 tune-0.014
 instructions-0.013
 lesson-0.013
izing-0.013
 rules-0.012
Token named
Token ID3706
Feature activation+8.48e+00

Pos logit contributions
icked+0.097
icks+0.085
apped+0.082
ids+0.069
aves+0.065
Neg logit contributions
 tune-0.106
But-0.096
 path-0.096
izing-0.096
 lesson-0.096
Token Bob
Token ID5811
Feature activation+2.51e+00

Pos logit contributions
icked+0.551
icks+0.493
apped+0.486
aves+0.394
ids+0.393
Neg logit contributions
 tune-0.522
But-0.492
 lesson-0.481
 path-0.477
izing-0.476
Tokeno
Token ID78
Feature activation+1.36e+01

Pos logit contributions
icked+0.196
icks+0.180
apped+0.177
ids+0.153
aves+0.143
Neg logit contributions
 tune-0.124
But-0.115
 lesson-0.114
 path-0.113
 instructions-0.109
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
icked+1.073
icks+0.980
apped+0.943
ids+0.830
bug+0.799
Neg logit contributions
 tune-0.691
 lesson-0.624
 instructions-0.620
But-0.618
 path-0.612
Token Bob
Token ID5811
Feature activation+2.25e+00

Pos logit contributions
icked+0.015
icks+0.013
apped+0.013
ids+0.011
bug+0.010
Neg logit contributions
 tune-0.012
But-0.012
 path-0.012
 lesson-0.012
 instructions-0.011
Tokeno
Token ID78
Feature activation+1.33e+01

Pos logit contributions
icked+0.070
icks+0.054
apped+0.049
ids+0.032
aves+0.020
Neg logit contributions
 tune-0.222
But-0.212
 lesson-0.211
 path-0.210
izing-0.206
Token loved
Token ID6151
Feature activation+1.42e+00

Pos logit contributions
icked+1.003
icks+0.916
apped+0.862
ids+0.780
aves+0.719
Neg logit contributions
 tune-0.733
But-0.686
 lesson-0.681
 path-0.673
 instructions-0.647
Token to
Token ID284
Feature activation-1.01e+00

Pos logit contributions
icked+0.079
icks+0.072
o+0.065
apped+0.065
bug+0.055
Neg logit contributions
izing-0.095
 tune-0.093
 rightful-0.092
But-0.090
 decision-0.089
Token tasty
Token ID25103
Feature activation-6.27e-01

Pos logit contributions
icked+0.358
icks+0.318
apped+0.311
ids+0.269
o+0.260
Neg logit contributions
 tune-0.311
 lesson-0.304
 instructions-0.294
But-0.288
 path-0.286
Token seeds
Token ID11904
Feature activation+1.33e+00

Pos logit contributions
 tune+0.032
 lesson+0.030
 instructions+0.030
izing+0.029
 rules+0.029
Neg logit contributions
icked-0.050
icks-0.045
apped-0.044
ids-0.038
bug-0.038
Token and
Token ID290
Feature activation-9.78e-01

Pos logit contributions
icked+0.090
icks+0.080
apped+0.077
ids+0.065
bug+0.063
Neg logit contributions
 tune-0.083
 lesson-0.078
 instructions-0.078
 path-0.077
izing-0.076
Token took
Token ID1718
Feature activation+1.23e+00

Pos logit contributions
 tune+0.066
 lesson+0.063
 instructions+0.062
izing+0.061
 path+0.060
Neg logit contributions
icked-0.061
icks-0.054
apped-0.053
ids-0.043
bug-0.042
Token n
Token ID299
Feature activation+2.92e+00

Pos logit contributions
icked+0.108
apped+0.099
icks+0.098
ids+0.088
bug+0.088
Neg logit contributions
 tune-0.050
 lesson-0.049
 instructions-0.047
 rules-0.042
away-0.042
Tokenaps
Token ID1686
Feature activation+1.35e+01

Pos logit contributions
icked+0.166
icks+0.155
're+0.129
 Four+0.127
ids+0.126
Neg logit contributions
 tune-0.174
 surroundings-0.169
away-0.168
 instincts-0.166
izing-0.165
Token in
Token ID287
Feature activation+4.18e+00

Pos logit contributions
icked+0.859
icks+0.758
apped+0.742
ids+0.607
bug+0.598
Neg logit contributions
 tune-0.892
 instructions-0.842
 lesson-0.840
izing-0.826
 path-0.819
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
icked+0.290
icks+0.266
apped+0.253
bug+0.216
aves+0.214
Neg logit contributions
 tune-0.248
 lesson-0.231
izing-0.229
 instructions-0.227
 path-0.223
Token warm
Token ID5814
Feature activation-1.03e+00

Pos logit contributions
izing+0.086
 tune+0.083
Years+0.081
 instincts+0.080
But+0.080
Neg logit contributions
icked-0.107
icks-0.102
apped-0.098
aves-0.083
bug-0.082
Token sun
Token ID4252
Feature activation-2.90e-01

Pos logit contributions
Years+0.050
 tune+0.049
izing+0.048
But+0.048
 lesson+0.048
Neg logit contributions
icked-0.075
icks-0.071
apped-0.071
o-0.058
ids-0.058
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 tune+0.016
 lesson+0.015
 instructions+0.015
izing+0.015
 rules+0.015
Neg logit contributions
icked-0.021
icks-0.019
apped-0.019
ids-0.016
bug-0.016
Token who
Token ID508
Feature activation+3.27e+00

Pos logit contributions
icked+0.569
apped+0.530
icks+0.525
bug+0.474
o+0.470
Neg logit contributions
 tune-0.365
 instructions-0.361
izing-0.348
 lesson-0.341
 rules-0.338
Token loved
Token ID6151
Feature activation+1.87e+00

Pos logit contributions
icked+0.213
icks+0.190
apped+0.184
ids+0.154
aves+0.149
Neg logit contributions
 tune-0.209
 lesson-0.199
 instructions-0.196
 path-0.192
But-0.191
Token to
Token ID284
Feature activation-6.86e-01

Pos logit contributions
icked+0.114
icks+0.106
apped+0.098
o+0.089
bugs+0.082
Neg logit contributions
 tune-0.115
izing-0.115
But-0.111
 lesson-0.108
 instructions-0.108
Token take
Token ID1011
Feature activation-8.11e-01

Pos logit contributions
 tune+0.040
izing+0.039
 instructions+0.038
 lesson+0.038
 rules+0.036
Neg logit contributions
icked-0.047
icks-0.044
apped-0.040
ids-0.036
aves-0.035
Token n
Token ID299
Feature activation+2.97e+00

Pos logit contributions
 tune+0.042
 lesson+0.039
 instructions+0.039
izing+0.037
 rules+0.037
Neg logit contributions
icked-0.064
icks-0.058
apped-0.057
bug-0.049
ids-0.049
Tokenaps
Token ID1686
Feature activation+1.35e+01

Pos logit contributions
icked+0.131
icks+0.114
apped+0.088
're+0.086
ids+0.084
Neg logit contributions
 tune-0.229
But-0.223
 surroundings-0.218
away-0.215
 instructions-0.215
Token.
Token ID13
Feature activation-8.84e-01

Pos logit contributions
icked+0.877
icks+0.780
apped+0.760
ids+0.628
bug+0.619
Neg logit contributions
 tune-0.867
 instructions-0.817
 lesson-0.815
izing-0.804
 path-0.793
Token Tom
Token ID4186
Feature activation+5.23e+00

Pos logit contributions
 tune+0.055
But+0.053
 lesson+0.052
 instructions+0.051
 path+0.051
Neg logit contributions
icked-0.060
icks-0.053
apped-0.051
ids-0.041
aves-0.040
Token was
Token ID373
Feature activation-2.51e+00

Pos logit contributions
icked+0.390
icks+0.357
apped+0.351
ids+0.295
aves+0.276
Neg logit contributions
 tune-0.278
But-0.262
 lesson-0.259
 path-0.257
 man-0.248
Token a
Token ID257
Feature activation+7.61e+00

Pos logit contributions
 tune+0.134
But+0.125
 lesson+0.123
 sound+0.120
 instructions+0.118
Neg logit contributions
icked-0.190
icks-0.171
apped-0.168
ids-0.146
aves-0.145
Token nos
Token ID43630
Feature activation-4.32e-01

Pos logit contributions
icked+0.471
apped+0.446
icks+0.430
bug+0.405
o+0.405
Neg logit contributions
 instructions-0.432
izing-0.424
 rules-0.417
 tune-0.402
 directions-0.390
Token every
Token ID790
Feature activation-5.45e+00

Pos logit contributions
icked+0.401
icks+0.359
apped+0.350
o+0.286
bug+0.284
Neg logit contributions
 tune-0.432
izing-0.412
 instructions-0.409
 lesson-0.404
 rules-0.397
Token day
Token ID1110
Feature activation-4.42e+00

Pos logit contributions
 tune+0.198
izing+0.192
 instructions+0.187
 rightful+0.172
 rules+0.172
Neg logit contributions
icked-0.488
icks-0.452
apped-0.446
o-0.400
bug-0.399
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
 tune+0.304
 instructions+0.285
 lesson+0.284
izing+0.281
 path+0.278
Neg logit contributions
icked-0.274
icks-0.241
apped-0.233
ids-0.189
bug-0.187
Token took
Token ID1718
Feature activation+9.98e-01

Pos logit contributions
 tune+0.068
 instructions+0.065
 lesson+0.064
izing+0.063
 rules+0.062
Neg logit contributions
icked-0.076
icks-0.068
apped-0.067
ids-0.055
o-0.055
Token n
Token ID299
Feature activation+2.88e+00

Pos logit contributions
icked+0.080
icks+0.072
apped+0.072
ids+0.063
bug+0.063
Neg logit contributions
 tune-0.049
 lesson-0.047
 instructions-0.046
 rules-0.044
But-0.043
Tokenaps
Token ID1686
Feature activation+1.35e+01

Pos logit contributions
icked+0.119
icks+0.104
apped+0.081
ids+0.080
're+0.076
Neg logit contributions
 tune-0.232
But-0.225
away-0.221
 instincts-0.221
 instructions-0.220
Token under
Token ID739
Feature activation+2.74e+00

Pos logit contributions
icked+0.816
icks+0.712
apped+0.711
ids+0.560
o+0.552
Neg logit contributions
 tune-0.925
 instructions-0.879
izing-0.872
 lesson-0.871
 path-0.863
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
icked+0.183
icks+0.170
apped+0.158
bug+0.134
aves+0.133
Neg logit contributions
 tune-0.166
izing-0.157
 lesson-0.157
 instructions-0.153
But-0.152
Token sun
Token ID4252
Feature activation-2.51e-01

Pos logit contributions
 tune+0.096
izing+0.091
But+0.089
 instructions+0.088
 lesson+0.088
Neg logit contributions
icked-0.125
icks-0.117
apped-0.110
o-0.095
aves-0.095
Token.
Token ID13
Feature activation-7.98e-01

Pos logit contributions
 tune+0.015
 lesson+0.015
 instructions+0.015
 rules+0.015
izing+0.015
Neg logit contributions
icked-0.017
apped-0.015
icks-0.015
o-0.013
ids-0.012
Token<|endoftext|>
Token ID50256
Feature activation+4.88e+00

Pos logit contributions
 tune+0.049
 lesson+0.048
But+0.046
 instructions+0.046
 path+0.045
Neg logit contributions
icked-0.054
icks-0.047
apped-0.046
ids-0.039
bug-0.038
Token
Token ID198
Feature activation-2.77e-01

Pos logit contributions
 instructions+0.075
But+0.074
 tune+0.073
 lesson+0.072
 rules+0.070
Neg logit contributions
icked-0.111
icks-0.100
apped-0.094
bug-0.085
ids-0.083
Token
Token ID198
Feature activation-2.85e-01

Pos logit contributions
 tune+0.018
 lesson+0.017
 instructions+0.017
izing+0.017
 path+0.016
Neg logit contributions
icked-0.018
icks-0.016
apped-0.015
ids-0.013
o-0.013
TokenJames
Token ID14731
Feature activation+5.94e+00

Pos logit contributions
 tune+0.015
 lesson+0.014
 path+0.013
izing+0.013
 instructions+0.013
Neg logit contributions
icked-0.021
icks-0.019
apped-0.019
aves-0.016
o-0.016
Token saw
Token ID2497
Feature activation+3.23e+00

Pos logit contributions
icked+0.420
icks+0.389
apped+0.367
ids+0.315
aves+0.297
Neg logit contributions
 tune-0.353
But-0.324
 instructions-0.323
 path-0.317
izing-0.316
Token lady
Token ID10846
Feature activation-5.81e+00

Pos logit contributions
icked+0.217
icks+0.195
apped+0.189
ids+0.161
aves+0.155
Neg logit contributions
 tune-0.203
 instructions-0.199
 lesson-0.195
 rules-0.192
 path-0.191
Tokenbugs
Token ID32965
Feature activation+1.35e+01

Pos logit contributions
 tune+0.428
 instructions+0.405
izing+0.393
 path+0.392
But+0.389
Neg logit contributions
icked-0.337
icks-0.294
apped-0.278
ids-0.223
aves-0.205
Token,
Token ID11
Feature activation+3.96e+00

Pos logit contributions
icked+0.908
icks+0.791
apped+0.788
ids+0.660
o+0.626
Neg logit contributions
 tune-0.831
 path-0.787
 instructions-0.782
izing-0.762
But-0.759
Token ants
Token ID27842
Feature activation+7.94e-01

Pos logit contributions
icked+0.317
icks+0.292
apped+0.288
ids+0.244
aves+0.224
Neg logit contributions
 tune-0.196
 instructions-0.183
 path-0.181
But-0.180
 man-0.172
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
icked+0.051
apped+0.044
icks+0.044
ids+0.035
aves+0.033
Neg logit contributions
 tune-0.050
 path-0.049
 instructions-0.049
izing-0.048
But-0.048
Token even
Token ID772
Feature activation+1.13e+00

Pos logit contributions
 tune+0.081
 instructions+0.075
 lesson+0.075
 path+0.075
But+0.073
Neg logit contributions
icked-0.124
icks-0.114
apped-0.112
ids-0.096
igh-0.093
Token a
Token ID257
Feature activation+7.18e+00

Pos logit contributions
icked+0.080
apped+0.074
icks+0.073
ids+0.063
igh+0.062
Neg logit contributions
 tune-0.063
 instructions-0.060
 path-0.059
 lesson-0.058
 sound-0.057
Token princess
Token ID21752
Feature activation+5.98e+00

Pos logit contributions
izing+0.335
 tune+0.322
But+0.314
away+0.286
Years+0.285
Neg logit contributions
icked-0.434
icks-0.407
apped-0.388
ids-0.340
aves-0.333
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
icked+0.386
icks+0.343
apped+0.333
ids+0.276
bug+0.273
Neg logit contributions
 tune-0.387
 instructions-0.365
 lesson-0.364
izing-0.361
 rules-0.355
Token her
Token ID607
Feature activation-5.60e+00

Pos logit contributions
 tune+0.058
izing+0.055
 instructions+0.055
 rules+0.053
 lesson+0.052
Neg logit contributions
icked-0.077
icks-0.069
apped-0.067
bug-0.056
o-0.056
Token graceful
Token ID44363
Feature activation-2.68e+00

Pos logit contributions
izing+0.323
 tune+0.305
But+0.301
away+0.288
 instructions+0.286
Neg logit contributions
icked-0.377
icks-0.347
apped-0.334
ids-0.285
bug-0.278
Token n
Token ID299
Feature activation+2.90e+00

Pos logit contributions
izing+0.149
But+0.140
 tune+0.135
 instructions+0.134
Years+0.133
Neg logit contributions
icked-0.180
apped-0.167
icks-0.166
ids-0.145
bug-0.138
Tokenaps
Token ID1686
Feature activation+1.35e+01

Pos logit contributions
icked+0.145
icks+0.128
apped+0.112
ids+0.094
're+0.092
Neg logit contributions
 tune-0.222
 instincts-0.208
 instructions-0.207
izing-0.206
 lesson-0.206
Token.
Token ID13
Feature activation-7.93e-01

Pos logit contributions
icked+0.862
icks+0.756
apped+0.751
ids+0.608
bug+0.601
Neg logit contributions
 tune-0.881
 instructions-0.842
 lesson-0.827
izing-0.824
 rules-0.815
Token From
Token ID3574
Feature activation-2.66e-01

Pos logit contributions
 tune+0.059
izing+0.055
 instructions+0.055
 path+0.054
 lesson+0.053
Neg logit contributions
icked-0.045
icks-0.040
apped-0.039
o-0.030
ids-0.030
Token that
Token ID326
Feature activation+9.96e-01

Pos logit contributions
 tune+0.017
 instructions+0.016
 lesson+0.016
 rules+0.015
izing+0.015
Neg logit contributions
icked-0.018
icks-0.016
apped-0.015
bug-0.013
o-0.013
Token day
Token ID1110
Feature activation-4.39e+00

Pos logit contributions
icked+0.082
icks+0.077
apped+0.076
o+0.069
igh+0.066
Neg logit contributions
 tune-0.041
izing-0.041
 instructions-0.040
 rules-0.037
 rightful-0.037
Token on
Token ID319
Feature activation+4.42e+00

Pos logit contributions
But+0.303
 tune+0.302
 lesson+0.300
 instructions+0.288
izing+0.287
Neg logit contributions
icked-0.240
icks-0.212
apped-0.201
bug-0.180
bugs-0.177
Token know
Token ID760
Feature activation+4.58e+00

Pos logit contributions
 tune+0.057
izing+0.053
 instructions+0.051
 lesson+0.051
 path+0.050
Neg logit contributions
icked-0.075
icks-0.069
apped-0.065
bug-0.056
're-0.056
Token what
Token ID644
Feature activation+6.59e+00

Pos logit contributions
icked+0.268
icks+0.238
apped+0.224
ids+0.183
bug+0.179
Neg logit contributions
 tune-0.328
 lesson-0.304
 instructions-0.302
 path-0.298
izing-0.297
Token kind
Token ID1611
Feature activation-1.33e-01

Pos logit contributions
icked+0.406
icks+0.381
apped+0.336
es+0.299
bugs+0.292
Neg logit contributions
 tune-0.439
izing-0.398
Years-0.394
 path-0.388
 instructions-0.387
Token of
Token ID286
Feature activation+5.57e+00

Pos logit contributions
 tune+0.011
 instructions+0.010
 lesson+0.010
izing+0.010
 path+0.010
Neg logit contributions
icked-0.006
icks-0.005
apped-0.005
ids-0.004
bug-0.004
Token n
Token ID299
Feature activation+2.80e+00

Pos logit contributions
icked+0.430
icks+0.414
apped+0.379
bug+0.339
ids+0.334
Neg logit contributions
 tune-0.262
izing-0.244
Years-0.240
But-0.240
 instructions-0.238
Tokenaps
Token ID1686
Feature activation+1.34e+01

Pos logit contributions
icked+0.145
icks+0.122
apped+0.111
ids+0.092
bug+0.089
Neg logit contributions
izing-0.194
 tune-0.190
But-0.188
 surroundings-0.188
 future-0.188
Token are
Token ID389
Feature activation+6.83e+00

Pos logit contributions
icked+0.786
icks+0.690
apped+0.667
ids+0.538
bug+0.529
Neg logit contributions
 tune-0.953
 instructions-0.901
 lesson-0.900
izing-0.886
 path-0.876
Token best
Token ID1266
Feature activation+2.13e+00

Pos logit contributions
icked+0.430
icks+0.389
apped+0.352
ids+0.311
aves+0.310
Neg logit contributions
 tune-0.453
 instructions-0.407
izing-0.406
 lesson-0.394
 melody-0.392
Token for
Token ID329
Feature activation+3.67e+00

Pos logit contributions
icked+0.146
icks+0.131
apped+0.126
ids+0.106
aves+0.103
Neg logit contributions
 tune-0.132
 lesson-0.121
izing-0.121
 instructions-0.121
But-0.119
Token her
Token ID607
Feature activation-5.55e+00

Pos logit contributions
icked+0.273
icks+0.248
apped+0.239
ids+0.207
aves+0.203
Neg logit contributions
 tune-0.203
 lesson-0.189
 instructions-0.185
 path-0.182
izing-0.178
Token.
Token ID13
Feature activation-6.89e-01

Pos logit contributions
 tune+0.327
izing+0.307
 instructions+0.304
But+0.299
 lesson+0.298
Neg logit contributions
icked-0.384
icks-0.350
apped-0.338
ids-0.286
bug-0.281
Token saw
Token ID2497
Feature activation+3.27e+00

Pos logit contributions
icked+0.025
icks+0.023
apped+0.023
ids+0.019
igh+0.019
Neg logit contributions
 tune-0.020
But-0.019
 lesson-0.019
 path-0.019
 instructions-0.019
Token some
Token ID617
Feature activation+5.32e+00

Pos logit contributions
icked+0.234
icks+0.211
apped+0.208
ids+0.184
igh+0.182
Neg logit contributions
 tune-0.176
 lesson-0.175
 instructions-0.175
 rules-0.170
 sound-0.163
Token butterflies
Token ID44571
Feature activation+5.60e-01

Pos logit contributions
icked+0.452
icks+0.413
apped+0.405
ids+0.353
bug+0.349
Neg logit contributions
 tune-0.239
 lesson-0.219
 instructions-0.219
izing-0.212
 path-0.209
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
icked+0.040
apped+0.035
icks+0.035
o+0.029
ids+0.029
Neg logit contributions
 tune-0.032
 path-0.030
 instructions-0.030
izing-0.030
But-0.029
Token lady
Token ID10846
Feature activation-5.88e+00

Pos logit contributions
 tune+0.071
 instructions+0.066
 lesson+0.065
 path+0.064
 rules+0.063
Neg logit contributions
icked-0.153
apped-0.141
icks-0.139
o-0.122
ids-0.121
Tokenbugs
Token ID32965
Feature activation+1.34e+01

Pos logit contributions
 tune+0.508
 instructions+0.492
izing+0.472
 path+0.466
 man+0.464
Neg logit contributions
icked-0.234
icks-0.190
apped-0.179
igh-0.108
're-0.107
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
icked+1.030
apped+0.925
icks+0.923
ids+0.780
o+0.772
Neg logit contributions
 tune-0.692
 instructions-0.653
 lesson-0.645
 path-0.631
But-0.625
Token They
Token ID1119
Feature activation+5.68e+00

Pos logit contributions
But+0.100
 tune+0.100
 lesson+0.098
 instructions+0.097
 rules+0.096
Neg logit contributions
icked-0.084
icks-0.072
apped-0.068
igh-0.059
bug-0.056
Token tried
Token ID3088
Feature activation-2.08e-01

Pos logit contributions
icked+0.381
icks+0.353
apped+0.341
ids+0.289
o+0.275
Neg logit contributions
 tune-0.331
But-0.323
 path-0.317
 lesson-0.314
 man-0.310
Token to
Token ID284
Feature activation-1.97e+00

Pos logit contributions
 tune+0.011
 lesson+0.011
 instructions+0.011
izing+0.010
 rules+0.010
Neg logit contributions
icked-0.016
icks-0.014
apped-0.014
ids-0.012
bug-0.012
Token catch
Token ID4929
Feature activation-2.90e+00

Pos logit contributions
 tune+0.103
 lesson+0.099
izing+0.099
 instructions+0.095
 instincts+0.095
Neg logit contributions
icked-0.145
icks-0.134
apped-0.125
bug-0.118
ids-0.110
Token a
Token ID257
Feature activation+8.84e+00

Pos logit contributions
icked+0.392
icks+0.335
apped+0.322
ids+0.247
o+0.245
Neg logit contributions
 tune-0.595
izing-0.566
 instructions-0.562
 lesson-0.559
 path-0.551
Token big
Token ID1263
Feature activation+1.60e+00

Pos logit contributions
icked+0.644
icks+0.604
apped+0.587
o+0.519
aves+0.517
Neg logit contributions
izing-0.434
 tune-0.429
 instructions-0.411
But-0.386
 rules-0.384
Token boo
Token ID22513
Feature activation-2.50e+00

Pos logit contributions
icked+0.131
icks+0.116
apped+0.114
ids+0.098
bug+0.097
Neg logit contributions
 tune-0.083
 lesson-0.078
 path-0.076
 instructions-0.075
 rules-0.073
Token-
Token ID12
Feature activation-2.52e+00

Pos logit contributions
 tune+0.118
 instructions+0.112
 lesson+0.110
izing+0.108
 rules+0.106
Neg logit contributions
icked-0.206
icks-0.187
apped-0.184
ids-0.159
o-0.158
Tokenbo
Token ID2127
Feature activation+6.18e+00

Pos logit contributions
 tune+0.172
 instructions+0.169
 path+0.168
 instincts+0.167
 rules+0.165
Neg logit contributions
icked-0.153
icks-0.130
apped-0.126
o-0.117
bug-0.109
Tokeno
Token ID78
Feature activation+1.34e+01

Pos logit contributions
icked+0.225
icks+0.183
apped+0.173
ids+0.110
aves+0.097
Neg logit contributions
 tune-0.558
 lesson-0.536
 path-0.522
But-0.521
 instructions-0.516
Token on
Token ID319
Feature activation+4.76e+00

Pos logit contributions
icked+0.917
icks+0.818
apped+0.791
o+0.680
ids+0.674
Neg logit contributions
 tune-0.800
 instructions-0.772
 lesson-0.765
izing-0.760
 rules-0.741
Token his
Token ID465
Feature activation-5.79e+00

Pos logit contributions
icked+0.320
icks+0.301
apped+0.275
o+0.247
ids+0.235
Neg logit contributions
 tune-0.259
 instructions-0.252
 instincts-0.251
izing-0.251
 lesson-0.246
Token knee
Token ID10329
Feature activation-1.67e+00

Pos logit contributions
izing+0.330
 lesson+0.298
Years+0.294
 tune+0.292
But+0.285
Neg logit contributions
icked-0.399
icks-0.379
apped-0.346
bug-0.311
o-0.311
Token.
Token ID13
Feature activation-6.01e-01

Pos logit contributions
 tune+0.106
 instructions+0.102
izing+0.102
 lesson+0.101
 rules+0.098
Neg logit contributions
icked-0.106
icks-0.093
apped-0.092
o-0.079
ids-0.076
Token He
Token ID679
Feature activation+7.73e-01

Pos logit contributions
 tune+0.041
 instructions+0.038
 lesson+0.038
But+0.038
izing+0.038
Neg logit contributions
icked-0.038
icks-0.033
apped-0.032
ids-0.026
bug-0.026
Token he
Token ID339
Feature activation-2.57e+00

Pos logit contributions
icked+0.068
apped+0.061
icks+0.061
o+0.053
bug+0.051
Neg logit contributions
 tune-0.045
 instructions-0.043
izing-0.042
 rules-0.040
Years-0.040
Token wanted
Token ID2227
Feature activation-1.51e+00

Pos logit contributions
 tune+0.166
 instructions+0.161
 rules+0.156
izing+0.156
 lesson+0.156
Neg logit contributions
icked-0.169
icks-0.145
apped-0.144
bug-0.118
o-0.117
Token to
Token ID284
Feature activation-1.57e+00

Pos logit contributions
izing+0.103
 tune+0.100
 instructions+0.091
 path+0.091
 melody+0.089
Neg logit contributions
icked-0.093
icks-0.083
apped-0.075
o-0.068
ids-0.065
Token borrow
Token ID8804
Feature activation+4.65e+00

Pos logit contributions
izing+0.094
 tune+0.094
 instructions+0.089
 lesson+0.086
 rules+0.084
Neg logit contributions
icked-0.107
icks-0.100
apped-0.091
ids-0.080
bug-0.079
Token n
Token ID299
Feature activation+2.72e+00

Pos logit contributions
icked+0.316
icks+0.283
apped+0.274
ids+0.229
bug+0.226
Neg logit contributions
 tune-0.290
 instructions-0.271
 lesson-0.271
izing-0.268
 path-0.263
Tokenaps
Token ID1686
Feature activation+1.34e+01

Pos logit contributions
icked+0.114
icks+0.101
apped+0.081
ids+0.074
're+0.071
Neg logit contributions
 tune-0.217
But-0.212
away-0.207
 instincts-0.205
 gut-0.204
Token forever
Token ID8097
Feature activation-1.29e+00

Pos logit contributions
icked+0.838
icks+0.728
apped+0.724
ids+0.583
o+0.581
Neg logit contributions
 tune-0.894
 instructions-0.854
 lesson-0.839
izing-0.839
 rules-0.828
Token.
Token ID13
Feature activation-7.41e-01

Pos logit contributions
 tune+0.083
 lesson+0.079
Years+0.079
 instructions+0.078
izing+0.078
Neg logit contributions
icked-0.078
icks-0.069
apped-0.066
o-0.057
bug-0.057
Token<|endoftext|>
Token ID50256
Feature activation+4.92e+00

Pos logit contributions
 tune+0.047
 lesson+0.046
 instructions+0.045
But+0.044
 path+0.044
Neg logit contributions
icked-0.048
icks-0.043
apped-0.041
ids-0.035
bug-0.034
TokenOnce
Token ID7454
Feature activation+2.14e+00

Pos logit contributions
icked+0.411
icks+0.370
apped+0.358
ids+0.325
aves+0.311
Neg logit contributions
But-0.224
 lesson-0.212
 tune-0.206
 path-0.205
 instructions-0.198
Token upon
Token ID2402
Feature activation+5.02e+00

Pos logit contributions
icked+0.134
icks+0.120
apped+0.114
ids+0.096
bug+0.093
Neg logit contributions
 lesson-0.138
 tune-0.133
 path-0.131
But-0.130
 instructions-0.129
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
icked+0.169
icks+0.151
apped+0.151
es+0.123
aves+0.120
Neg logit contributions
 tune-0.193
 instructions-0.180
 lesson-0.179
izing-0.178
 rightful-0.174
Token playing
Token ID2712
Feature activation+5.53e+00

Pos logit contributions
 tune+0.113
 instructions+0.106
 lesson+0.106
izing+0.105
 rules+0.103
Neg logit contributions
icked-0.107
icks-0.094
apped-0.092
ids-0.075
o-0.075
Token with
Token ID351
Feature activation+7.48e-01

Pos logit contributions
icked+0.346
icks+0.307
apped+0.302
ids+0.246
bug+0.240
Neg logit contributions
 tune-0.365
 lesson-0.347
 instructions-0.346
izing-0.341
 rules-0.334
Token the
Token ID262
Feature activation-2.39e+00

Pos logit contributions
icked+0.061
icks+0.056
apped+0.054
ids+0.047
bug+0.047
Neg logit contributions
 tune-0.036
 lesson-0.033
 instructions-0.033
izing-0.033
 path-0.032
Token lady
Token ID10846
Feature activation-5.93e+00

Pos logit contributions
 tune+0.111
izing+0.105
But+0.100
 instructions+0.100
 lesson+0.098
Neg logit contributions
icked-0.182
icks-0.169
apped-0.167
bug-0.149
aves-0.146
Tokenbugs
Token ID32965
Feature activation+1.34e+01

Pos logit contributions
 tune+0.367
 instructions+0.342
 path+0.339
 lesson+0.329
But+0.327
Neg logit contributions
icked-0.405
icks-0.362
apped-0.348
ids-0.282
aves-0.270
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
icked+0.932
icks+0.835
apped+0.813
ids+0.683
bug+0.669
Neg logit contributions
 tune-0.804
 instructions-0.753
 lesson-0.752
izing-0.734
 path-0.732
Token
Token ID220
Feature activation+1.89e+00

Pos logit contributions
 tune+0.094
 lesson+0.092
But+0.091
 instructions+0.090
 path+0.089
Neg logit contributions
icked-0.102
icks-0.089
apped-0.087
ids-0.072
bug-0.072
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
icked+0.147
icks+0.131
apped+0.127
bug+0.116
ids+0.113
Neg logit contributions
But-0.084
 instructions-0.080
 lesson-0.079
 tune-0.076
 rules-0.075
Token
Token ID198
Feature activation-2.99e-01

Pos logit contributions
 tune+0.018
izing+0.017
 instructions+0.017
 lesson+0.017
 path+0.016
Neg logit contributions
icked-0.017
icks-0.015
apped-0.014
o-0.012
ids-0.012
TokenAt
Token ID2953
Feature activation-6.98e+00

Pos logit contributions
 tune+0.014
izing+0.014
 instructions+0.013
 instincts+0.013
 rightful+0.012
Neg logit contributions
icked-0.023
icks-0.022
apped-0.021
o-0.020
bug-0.019
Token filled
Token ID5901
Feature activation+6.04e+00

Pos logit contributions
 tune+0.173
 instructions+0.163
 lesson+0.162
izing+0.160
 rules+0.158
Neg logit contributions
icked-0.173
icks-0.153
apped-0.149
ids-0.123
bug-0.122
Token with
Token ID351
Feature activation+1.17e+00

Pos logit contributions
icked+0.307
apped+0.288
icks+0.262
bug+0.226
o+0.221
Neg logit contributions
 tune-0.420
 instincts-0.415
 instructions-0.414
 lesson-0.412
 path-0.410
Token cheers
Token ID34550
Feature activation+5.93e+00

Pos logit contributions
icked+0.095
icks+0.086
apped+0.084
ids+0.074
o+0.072
Neg logit contributions
 tune-0.057
 lesson-0.053
 instructions-0.052
izing-0.051
 rules-0.050
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
icked+0.394
icks+0.350
apped+0.345
ids+0.285
bug+0.279
Neg logit contributions
 tune-0.371
 lesson-0.351
 instructions-0.350
izing-0.342
 path-0.340
Token cl
Token ID537
Feature activation-1.52e+00

Pos logit contributions
 tune+0.066
 lesson+0.062
 instructions+0.061
 path+0.059
izing+0.059
Neg logit contributions
icked-0.089
icks-0.080
apped-0.077
ids-0.067
aves-0.066
Tokenaps
Token ID1686
Feature activation+1.34e+01

Pos logit contributions
 tune+0.158
izing+0.150
 rightful+0.145
away+0.145
 melody+0.144
Neg logit contributions
icked-0.037
icks-0.030
ids-0.014
aves-0.011
apped-0.011
Token.
Token ID13
Feature activation-9.45e-01

Pos logit contributions
icked+0.910
icks+0.801
apped+0.800
ids+0.644
bug+0.635
Neg logit contributions
 tune-0.825
 instructions-0.785
 lesson-0.773
 path-0.766
 rules-0.764
Token Ad
Token ID1215
Feature activation+6.18e-02

Pos logit contributions
 tune+0.062
 lesson+0.058
 instructions+0.058
izing+0.057
But+0.056
Neg logit contributions
icked-0.061
icks-0.054
apped-0.052
ids-0.044
bug-0.043
Tokeneline
Token ID4470
Feature activation+5.74e+00

Pos logit contributions
icked+0.004
icks+0.004
apped+0.003
ids+0.003
o+0.003
Neg logit contributions
 tune-0.004
 instructions-0.004
izing-0.004
 lesson-0.004
But-0.004
Token was
Token ID373
Feature activation-2.57e+00

Pos logit contributions
icked+0.416
icks+0.380
apped+0.361
ids+0.316
aves+0.306
Neg logit contributions
 tune-0.328
 lesson-0.305
 instructions-0.301
But-0.294
 path-0.294
Token so
Token ID523
Feature activation+2.70e+00

Pos logit contributions
 tune+0.164
 lesson+0.151
But+0.151
 instructions+0.147
 sound+0.146
Neg logit contributions
icked-0.174
icks-0.155
apped-0.145
ids-0.127
aves-0.126
Token friends
Token ID2460
Feature activation+6.43e+00

Pos logit contributions
 tune+0.170
izing+0.161
 instructions+0.160
 lesson+0.154
But+0.154
Neg logit contributions
icked-0.193
icks-0.174
apped-0.170
o-0.143
bug-0.141
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
icked+0.456
icks+0.409
apped+0.400
ids+0.338
bug+0.331
Neg logit contributions
 tune-0.377
 lesson-0.355
 instructions-0.351
izing-0.343
But-0.342
Token always
Token ID1464
Feature activation-2.50e+00

Pos logit contributions
 tune+0.063
 lesson+0.059
 instructions+0.059
izing+0.057
 path+0.057
Neg logit contributions
icked-0.071
icks-0.063
apped-0.062
ids-0.052
bug-0.051
Token take
Token ID1011
Feature activation-1.11e+00

Pos logit contributions
 tune+0.156
 lesson+0.147
 instructions+0.144
 path+0.141
 rules+0.141
Neg logit contributions
icked-0.168
icks-0.150
apped-0.146
ids-0.126
bug-0.120
Token n
Token ID299
Feature activation+3.03e+00

Pos logit contributions
 tune+0.058
 instructions+0.055
 lesson+0.055
 rules+0.052
izing+0.052
Neg logit contributions
icked-0.085
icks-0.077
apped-0.076
ids-0.065
bug-0.064
Tokenaps
Token ID1686
Feature activation+1.34e+01

Pos logit contributions
icked+0.126
icks+0.108
apped+0.087
're+0.077
ids+0.076
Neg logit contributions
 tune-0.248
But-0.237
 instructions-0.237
 instincts-0.236
izing-0.233
Token together
Token ID1978
Feature activation+6.45e+00

Pos logit contributions
icked+0.861
icks+0.764
apped+0.748
ids+0.615
bug+0.606
Neg logit contributions
 tune-0.867
 instructions-0.817
 lesson-0.812
izing-0.806
 path-0.797
Token when
Token ID618
Feature activation-4.44e+00

Pos logit contributions
icked+0.417
icks+0.375
apped+0.365
bug+0.308
ids+0.306
Neg logit contributions
 tune-0.392
 lesson-0.370
izing-0.368
 instructions-0.365
 path-0.361
Token they
Token ID484
Feature activation+2.56e+00

Pos logit contributions
 tune+0.248
 instructions+0.230
 lesson+0.230
izing+0.227
 path+0.222
Neg logit contributions
icked-0.327
icks-0.295
apped-0.288
ids-0.245
bug-0.242
Token are
Token ID389
Feature activation+6.37e+00

Pos logit contributions
icked+0.165
icks+0.146
apped+0.144
ids+0.119
o+0.115
Neg logit contributions
 tune-0.165
 lesson-0.157
 instructions-0.155
 path-0.152
 rules-0.151
Token feeling
Token ID4203
Feature activation+1.61e+00

Pos logit contributions
icked+0.471
icks+0.423
apped+0.421
ids+0.359
aves+0.347
Neg logit contributions
 tune-0.350
 lesson-0.329
 instructions-0.324
 rules-0.316
 path-0.314
Token.
Token ID13
Feature activation-8.25e-01

Pos logit contributions
 tune+0.135
 instructions+0.128
izing+0.127
 lesson+0.127
But+0.125
Neg logit contributions
icked-0.149
icks-0.135
apped-0.132
ids-0.109
es-0.108
Token They
Token ID1119
Feature activation+6.18e+00

Pos logit contributions
 tune+0.058
 instructions+0.055
 lesson+0.055
izing+0.054
But+0.054
Neg logit contributions
icked-0.050
icks-0.044
apped-0.042
ids-0.034
bug-0.033
Token smiled
Token ID13541
Feature activation+3.69e+00

Pos logit contributions
icked+0.417
icks+0.383
apped+0.361
ids+0.310
aves+0.294
Neg logit contributions
 tune-0.386
 path-0.354
 lesson-0.353
 instructions-0.351
But-0.346
Token and
Token ID290
Feature activation-1.08e+00

Pos logit contributions
icked+0.247
icks+0.222
apped+0.210
ids+0.181
bug+0.177
Neg logit contributions
 tune-0.236
 instructions-0.219
 lesson-0.219
izing-0.216
But-0.214
Token cl
Token ID537
Feature activation-1.50e+00

Pos logit contributions
 tune+0.069
 lesson+0.064
 instructions+0.063
 path+0.062
But+0.062
Neg logit contributions
icked-0.071
icks-0.065
apped-0.061
ids-0.053
aves-0.052
Tokenapped
Token ID6320
Feature activation+1.33e+01

Pos logit contributions
 tune+0.145
izing+0.136
away+0.134
But+0.132
 rightful+0.131
Neg logit contributions
icked-0.033
icks-0.031
ids-0.019
izards-0.014
bugs-0.014
Token along
Token ID1863
Feature activation+5.08e-01

Pos logit contributions
icked+0.755
apped+0.681
icks+0.665
aves+0.508
ids+0.505
Neg logit contributions
 tune-0.890
izing-0.873
 lesson-0.866
 instructions-0.865
 path-0.855
Token.
Token ID13
Feature activation-7.59e-01

Pos logit contributions
icked+0.029
apped+0.026
icks+0.025
ids+0.020
bug+0.020
Neg logit contributions
 tune-0.033
 lesson-0.033
izing-0.032
 instincts-0.032
 rightful-0.032
Token The
Token ID383
Feature activation+5.54e-01

Pos logit contributions
 tune+0.047
But+0.045
 instructions+0.044
 lesson+0.044
 rules+0.042
Neg logit contributions
icked-0.052
icks-0.047
apped-0.042
bug-0.038
ids-0.037
Token man
Token ID582
Feature activation-6.49e+00

Pos logit contributions
icked+0.044
apped+0.041
icks+0.040
bug+0.034
o+0.034
Neg logit contributions
 tune-0.025
izing-0.025
 instructions-0.023
But-0.023
away-0.023
Token waved
Token ID26834
Feature activation+3.66e+00

Pos logit contributions
 tune+0.395
 lesson+0.361
 instructions+0.359
But+0.355
 path+0.353
Neg logit contributions
icked-0.452
icks-0.412
apped-0.389
ids-0.335
aves-0.321
Token careful
Token ID8161
Feature activation-6.52e+00

Pos logit contributions
 tune+0.116
 lesson+0.107
 instructions+0.106
izing+0.103
But+0.102
Neg logit contributions
icked-0.220
icks-0.201
apped-0.197
ids-0.173
bug-0.169
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
 tune+0.404
 lesson+0.379
 instructions+0.378
izing+0.371
 path+0.367
Neg logit contributions
icked-0.442
icks-0.397
apped-0.385
ids-0.322
bug-0.317
Token listen
Token ID6004
Feature activation-2.72e+00

Pos logit contributions
 tune+0.104
izing+0.098
 instructions+0.095
 lesson+0.094
Years+0.092
Neg logit contributions
icked-0.143
icks-0.135
apped-0.122
o-0.111
bug-0.108
Token to
Token ID284
Feature activation-2.22e+00

Pos logit contributions
 tune+0.169
izing+0.157
 path+0.155
 instructions+0.154
 lesson+0.153
Neg logit contributions
icked-0.180
icks-0.163
apped-0.156
o-0.132
ids-0.130
Token your
Token ID534
Feature activation-4.55e+00

Pos logit contributions
 tune+0.107
izing+0.105
 lesson+0.101
 path+0.098
 instructions+0.096
Neg logit contributions
icked-0.168
icks-0.162
apped-0.143
ids-0.131
o-0.130
Token gut
Token ID12500
Feature activation-1.52e+01

Pos logit contributions
izing+0.259
 tune+0.249
away+0.231
But+0.231
Years+0.229
Neg logit contributions
icked-0.303
icks-0.295
apped-0.275
o-0.235
ids-0.232
Token."
Token ID526
Feature activation-8.97e-01

Pos logit contributions
 tune+0.918
izing+0.876
 instructions+0.873
 lesson+0.870
 path+0.845
Neg logit contributions
icked-1.021
icks-0.930
apped-0.899
o-0.752
ids-0.751
Token
Token ID198
Feature activation-5.35e-01

Pos logit contributions
 tune+0.059
 lesson+0.055
 instructions+0.055
izing+0.054
 path+0.053
Neg logit contributions
icked-0.058
icks-0.052
apped-0.050
ids-0.041
bug-0.041
Token
Token ID198
Feature activation-5.76e-01

Pos logit contributions
 tune+0.039
izing+0.037
 instructions+0.037
 lesson+0.036
 path+0.036
Neg logit contributions
icked-0.030
icks-0.026
apped-0.025
o-0.022
bug-0.020
TokenTom
Token ID13787
Feature activation-1.45e+00

Pos logit contributions
 tune+0.035
 lesson+0.033
 decision+0.032
 path+0.032
 instructions+0.032
Neg logit contributions
icked-0.034
apped-0.031
o-0.031
icks-0.030
bug-0.028
Token and
Token ID290
Feature activation-2.03e+00

Pos logit contributions
 tune+0.091
 lesson+0.085
 instructions+0.084
But+0.083
 path+0.082
Neg logit contributions
icked-0.097
icks-0.088
apped-0.084
ids-0.073
aves-0.068
Token he
Token ID339
Feature activation-3.27e+00

Pos logit contributions
 tune+0.270
 instructions+0.251
 lesson+0.247
izing+0.246
 rules+0.243
Neg logit contributions
icked-0.413
apped-0.368
icks-0.367
bug-0.318
o-0.316
Token was
Token ID373
Feature activation-3.22e+00

Pos logit contributions
 tune+0.206
 instructions+0.200
izing+0.199
 rules+0.198
 lesson+0.191
Neg logit contributions
icked-0.225
apped-0.191
icks-0.184
bug-0.166
o-0.165
Token a
Token ID257
Feature activation+7.08e+00

Pos logit contributions
 tune+0.171
 lesson+0.160
 instructions+0.159
But+0.159
 path+0.154
Neg logit contributions
icked-0.245
icks-0.222
apped-0.220
ids-0.187
aves-0.183
Token very
Token ID845
Feature activation+1.63e+00

Pos logit contributions
icked+0.426
icks+0.388
apped+0.383
bug+0.350
aves+0.341
Neg logit contributions
izing-0.412
 instructions-0.385
 tune-0.384
 rules-0.372
 rightful-0.364
Token brave
Token ID14802
Feature activation-1.44e+00

Pos logit contributions
icked+0.105
icks+0.095
apped+0.089
o+0.081
bug+0.079
Neg logit contributions
 tune-0.100
izing-0.096
 instructions-0.094
 tunes-0.089
 melody-0.088
Token gut
Token ID12500
Feature activation-1.49e+01

Pos logit contributions
 tune+0.072
 lesson+0.066
 instructions+0.066
izing+0.064
 path+0.063
Neg logit contributions
icked-0.117
icks-0.107
apped-0.104
ids-0.090
o-0.089
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
 tune+0.878
izing+0.851
 instructions+0.848
 lesson+0.846
But+0.823
Neg logit contributions
icked-0.984
icks-0.891
apped-0.871
bug-0.740
o-0.739
Token He
Token ID679
Feature activation-2.02e-01

Pos logit contributions
 tune+0.090
 instructions+0.084
 lesson+0.084
izing+0.083
 path+0.082
Neg logit contributions
icked-0.102
icks-0.091
apped-0.089
ids-0.074
bug-0.073
Token let
Token ID1309
Feature activation-2.47e+00

Pos logit contributions
 tune+0.013
izing+0.013
 instructions+0.012
 rules+0.012
 lesson+0.012
Neg logit contributions
icked-0.013
apped-0.011
icks-0.011
bug-0.010
o-0.009
Token curiosity
Token ID20136
Feature activation-2.97e+00

Pos logit contributions
 tune+0.187
izing+0.178
 instructions+0.173
 lesson+0.171
But+0.169
Neg logit contributions
icked-0.137
icks-0.119
apped-0.114
ids-0.090
o-0.090
Token get
Token ID651
Feature activation-1.69e+00

Pos logit contributions
 tune+0.201
 lesson+0.189
 instructions+0.189
izing+0.186
 path+0.183
Neg logit contributions
icked-0.185
icks-0.164
apped-0.159
ids-0.131
bug-0.128
Token should
Token ID815
Feature activation-3.22e-01

Pos logit contributions
 tune+0.171
 lesson+0.168
 instructions+0.162
 path+0.159
 rules+0.158
Neg logit contributions
icked-0.210
icks-0.190
apped-0.187
ids-0.156
bug-0.151
Token always
Token ID1464
Feature activation-3.24e+00

Pos logit contributions
 tune+0.019
izing+0.019
 instructions+0.018
 lesson+0.017
away+0.017
Neg logit contributions
icked-0.021
icks-0.020
apped-0.018
're-0.017
ids-0.016
Token listen
Token ID6004
Feature activation-2.44e+00

Pos logit contributions
 tune+0.181
izing+0.178
 lesson+0.169
 instructions+0.168
Years+0.158
Neg logit contributions
icked-0.224
icks-0.212
apped-0.195
're-0.173
o-0.172
Token to
Token ID284
Feature activation-1.87e+00

Pos logit contributions
 tune+0.159
izing+0.151
 rightful+0.146
 path+0.146
 instructions+0.144
Neg logit contributions
icked-0.147
icks-0.129
apped-0.126
o-0.106
aves-0.105
Token his
Token ID465
Feature activation-6.62e+00

Pos logit contributions
izing+0.109
 tune+0.102
 lesson+0.098
Okay+0.094
Years+0.094
Neg logit contributions
icked-0.117
icks-0.115
apped-0.095
ow-0.088
o-0.088
Token gut
Token ID12500
Feature activation-1.47e+01

Pos logit contributions
izing+0.452
 tune+0.402
away+0.388
if+0.369
Years+0.367
Neg logit contributions
icked-0.341
icks-0.329
apped-0.328
o-0.274
ids-0.271
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 tune+0.895
izing+0.859
 instructions+0.853
 lesson+0.852
But+0.823
Neg logit contributions
icked-0.976
icks-0.882
apped-0.859
ids-0.720
o-0.717
Token not
Token ID407
Feature activation-2.44e+00

Pos logit contributions
izing+0.078
 tune+0.078
 lesson+0.070
 instructions+0.069
Days+0.069
Neg logit contributions
icked-0.086
icks-0.080
apped-0.074
o-0.067
're-0.066
Token do
Token ID466
Feature activation-1.05e+00

Pos logit contributions
 tune+0.135
izing+0.131
 lesson+0.129
 instructions+0.128
Years+0.122
Neg logit contributions
icked-0.173
icks-0.161
apped-0.150
o-0.136
bug-0.132
Token anything
Token ID1997
Feature activation-1.74e-01

Pos logit contributions
 tune+0.073
izing+0.065
 path+0.065
 melody+0.065
 rightful+0.064
Neg logit contributions
icked-0.062
icks-0.057
apped-0.048
es-0.045
ids-0.044
Token too
Token ID1165
Feature activation+3.27e+00

Pos logit contributions
 tune+0.011
 melody+0.010
 instructions+0.010
Years+0.010
 path+0.010
Neg logit contributions
icked-0.011
icks-0.010
apped-0.009
es-0.008
o-0.008
Token she
Token ID673
Feature activation-3.44e+00

Pos logit contributions
icked+0.049
icks+0.043
apped+0.042
o+0.038
bug+0.037
Neg logit contributions
 tune-0.046
izing-0.044
 instructions-0.042
away-0.042
 path-0.041
Token should
Token ID815
Feature activation-5.52e-02

Pos logit contributions
 tune+0.218
 instructions+0.205
 lesson+0.203
izing+0.201
 rules+0.199
Neg logit contributions
icked-0.231
icks-0.204
apped-0.199
ids-0.166
o-0.166
Token always
Token ID1464
Feature activation-3.00e+00

Pos logit contributions
 tune+0.003
izing+0.003
 instructions+0.003
 lesson+0.003
away+0.003
Neg logit contributions
icked-0.004
icks-0.004
apped-0.003
're-0.003
ow-0.003
Token trust
Token ID3774
Feature activation-2.74e+00

Pos logit contributions
 tune+0.165
izing+0.163
 instructions+0.153
 lesson+0.152
 rules+0.145
Neg logit contributions
icked-0.212
icks-0.200
apped-0.182
o-0.163
're-0.161
Token her
Token ID607
Feature activation-5.72e+00

Pos logit contributions
 tune+0.148
 lesson+0.140
izing+0.137
 rightful+0.131
 path+0.131
Neg logit contributions
icked-0.203
icks-0.184
apped-0.174
o-0.160
ids-0.158
Token gut
Token ID12500
Feature activation-1.46e+01

Pos logit contributions
 tune+0.350
izing+0.344
 melody+0.310
 sensation+0.309
away+0.308
Neg logit contributions
icked-0.337
icks-0.304
apped-0.284
es-0.265
igh-0.262
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
 tune+0.907
izing+0.881
 instructions+0.867
 lesson+0.854
 path+0.840
Neg logit contributions
icked-0.938
icks-0.849
apped-0.823
o-0.689
ids-0.683
Token<|endoftext|>
Token ID50256
Feature activation+4.85e+00

Pos logit contributions
 tune+0.065
izing+0.064
 instructions+0.058
away+0.057
 melody+0.056
Neg logit contributions
icked-0.075
icks-0.069
apped-0.064
o-0.056
're-0.056
TokenOne
Token ID3198
Feature activation+1.05e+00

Pos logit contributions
icked+0.366
icks+0.329
apped+0.312
ids+0.275
aves+0.265
Neg logit contributions
But-0.259
 tune-0.240
 instructions-0.237
 lesson-0.237
 future-0.235
Token day
Token ID1110
Feature activation-4.38e+00

Pos logit contributions
icked+0.098
icks+0.090
apped+0.089
o+0.080
bug+0.079
Neg logit contributions
 tune-0.039
 instructions-0.035
izing-0.035
 lesson-0.033
 rules-0.033
Token,
Token ID11
Feature activation+4.55e+00

Pos logit contributions
But+0.280
 lesson+0.269
 tune+0.267
 path+0.252
izing+0.250
Neg logit contributions
icked-0.266
icks-0.253
apped-0.224
bug-0.202
ids-0.199
Token that
Token ID326
Feature activation+6.72e-01

Pos logit contributions
icked+0.055
icks+0.049
apped+0.049
o+0.041
bug+0.040
Neg logit contributions
 tune-0.047
izing-0.044
 instructions-0.044
 lesson-0.043
 path-0.043
Token he
Token ID339
Feature activation-2.86e+00

Pos logit contributions
icked+0.052
icks+0.047
apped+0.046
o+0.040
bug+0.039
Neg logit contributions
 tune-0.034
izing-0.032
 instructions-0.031
But-0.030
 lesson-0.029
Token had
Token ID550
Feature activation+5.70e+00

Pos logit contributions
 tune+0.220
 instructions+0.213
 lesson+0.210
 rules+0.209
izing+0.208
Neg logit contributions
icked-0.151
icks-0.126
apped-0.125
bug-0.099
o-0.098
Token followed
Token ID3940
Feature activation-1.68e+00

Pos logit contributions
icked+0.338
icks+0.295
apped+0.287
ids+0.229
bug+0.226
Neg logit contributions
 tune-0.405
 instructions-0.383
 lesson-0.383
izing-0.379
 path-0.373
Token his
Token ID465
Feature activation-6.55e+00

Pos logit contributions
 tune+0.095
izing+0.092
 lesson+0.088
But+0.086
Years+0.086
Neg logit contributions
icked-0.112
icks-0.106
apped-0.097
ids-0.087
aves-0.083
Token gut
Token ID12500
Feature activation-1.44e+01

Pos logit contributions
izing+0.346
But+0.342
 tune+0.334
Years+0.314
away+0.313
Neg logit contributions
icked-0.418
icks-0.408
apped-0.393
bugs-0.334
aves-0.334
Token.
Token ID13
Feature activation-9.94e-01

Pos logit contributions
 tune+0.828
 lesson+0.778
 instructions+0.776
izing+0.768
But+0.750
Neg logit contributions
icked-1.029
icks-0.929
apped-0.905
ids-0.766
bug-0.758
Token He
Token ID679
Feature activation+2.48e-01

Pos logit contributions
 tune+0.068
izing+0.063
 instructions+0.063
 lesson+0.062
 path+0.061
Neg logit contributions
icked-0.062
icks-0.056
apped-0.054
o-0.044
ids-0.044
Token had
Token ID550
Feature activation+5.77e+00

Pos logit contributions
icked+0.017
icks+0.015
apped+0.015
o+0.013
bug+0.013
Neg logit contributions
 tune-0.015
 instructions-0.014
izing-0.014
 lesson-0.014
 rules-0.014
Token been
Token ID587
Feature activation+1.81e+00

Pos logit contributions
icked+0.357
icks+0.317
apped+0.307
ids+0.252
bug+0.247
Neg logit contributions
 tune-0.393
 lesson-0.372
 instructions-0.371
izing-0.360
 path-0.360
Token right
Token ID826
Feature activation-3.93e+00

Pos logit contributions
icked+0.125
icks+0.113
apped+0.108
ids+0.092
aves+0.088
Neg logit contributions
 tune-0.112
 instructions-0.104
But-0.104
 lesson-0.103
izing-0.099
Token outside
Token ID2354
Feature activation+2.50e+00

Pos logit contributions
 tune+0.216
 lesson+0.204
 instructions+0.203
izing+0.199
 path+0.197
Neg logit contributions
icked-0.236
icks-0.210
apped-0.204
ids-0.170
bug-0.168
Token."
Token ID526
Feature activation-6.10e-01

Pos logit contributions
icked+0.171
icks+0.152
apped+0.148
ids+0.124
bug+0.122
Neg logit contributions
 tune-0.154
 instructions-0.145
 lesson-0.145
izing-0.142
 rules-0.141
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
 tune+0.044
izing+0.042
 instructions+0.042
 lesson+0.041
 rules+0.041
Neg logit contributions
icked-0.034
apped-0.030
icks-0.030
o-0.024
ids-0.023
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
 tune+0.033
izing+0.031
 instructions+0.030
 path+0.029
 rules+0.029
Neg logit contributions
icked-0.023
icks-0.020
apped-0.019
o-0.017
bug-0.014
TokenS
Token ID50
Feature activation-5.01e+00

Pos logit contributions
 tune+0.025
 instructions+0.023
izing+0.023
 path+0.022
 temptation+0.022
Neg logit contributions
icked-0.028
apped-0.026
o-0.024
icks-0.024
bug-0.022
Tokenaman
Token ID10546
Feature activation-1.40e+01

Pos logit contributions
 tune+0.198
 instructions+0.176
 path+0.172
 lesson+0.171
 rules+0.167
Neg logit contributions
icked-0.460
apped-0.418
icks-0.416
o-0.366
bug-0.365
Tokentha
Token ID12898
Feature activation-5.08e+00

Pos logit contributions
 tune+0.667
 lesson+0.640
 rules+0.612
 instructions+0.607
 path+0.588
Neg logit contributions
icked-1.167
apped-1.044
icks-1.014
o-0.910
bug-0.895
Token looked
Token ID3114
Feature activation+4.54e+00

Pos logit contributions
 tune+0.304
 instructions+0.284
 lesson+0.284
izing+0.278
 path+0.275
Neg logit contributions
icked-0.354
icks-0.320
apped-0.309
ids-0.261
bug-0.255
Token at
Token ID379
Feature activation-2.24e+00

Pos logit contributions
icked+0.320
icks+0.290
apped+0.276
ids+0.239
o+0.234
Neg logit contributions
 tune-0.268
 lesson-0.253
 instructions-0.252
 path-0.248
But-0.246
Token him
Token ID683
Feature activation-1.53e+00

Pos logit contributions
 tune+0.158
 instructions+0.148
 lesson+0.147
izing+0.147
 rules+0.144
Neg logit contributions
icked-0.126
apped-0.109
icks-0.108
bug-0.089
o-0.086
Token and
Token ID290
Feature activation-1.70e+00

Pos logit contributions
 tune+0.102
 lesson+0.098
 instructions+0.095
izing+0.095
 rules+0.094
Neg logit contributions
icked-0.093
icks-0.083
apped-0.081
bug-0.067
ids-0.066
Token her
Token ID607
Feature activation-6.45e+00

Pos logit contributions
 tune+0.128
 lesson+0.117
izing+0.116
 instructions+0.115
But+0.112
Neg logit contributions
icked-0.174
icks-0.157
apped-0.154
ids-0.132
bug-0.132
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 tune+0.384
 lesson+0.370
izing+0.368
But+0.362
Years+0.362
Neg logit contributions
icked-0.422
icks-0.386
apped-0.371
bug-0.326
ids-0.317
Token
Token ID198
Feature activation-4.83e-01

Pos logit contributions
 tune+0.119
 instructions+0.111
 lesson+0.111
izing+0.110
 rules+0.109
Neg logit contributions
icked-0.104
icks-0.092
apped-0.089
ids-0.073
bug-0.071
Token
Token ID198
Feature activation-5.08e-01

Pos logit contributions
 tune+0.035
izing+0.032
 instructions+0.032
 lesson+0.031
 rules+0.031
Neg logit contributions
icked-0.029
icks-0.025
apped-0.024
o-0.020
ids-0.020
TokenS
Token ID50
Feature activation-4.94e+00

Pos logit contributions
 tune+0.026
 instructions+0.024
izing+0.023
 instincts+0.023
 rules+0.023
Neg logit contributions
icked-0.038
icks-0.034
apped-0.034
o-0.031
bug-0.031
Tokenaman
Token ID10546
Feature activation-1.39e+01

Pos logit contributions
 tune+0.193
 instructions+0.169
 lesson+0.167
 path+0.163
 rules+0.162
Neg logit contributions
icked-0.459
icks-0.414
apped-0.412
o-0.375
bug-0.373
Tokentha
Token ID12898
Feature activation-4.99e+00

Pos logit contributions
 tune+0.661
 lesson+0.642
 rules+0.610
 instructions+0.599
 path+0.578
Neg logit contributions
icked-1.155
apped-1.025
icks-1.002
o-0.903
bug-0.898
Token knew
Token ID2993
Feature activation-8.05e+00

Pos logit contributions
 tune+0.289
 instructions+0.269
 lesson+0.267
izing+0.261
But+0.261
Neg logit contributions
icked-0.356
icks-0.325
apped-0.312
ids-0.266
aves-0.258
Token that
Token ID326
Feature activation+1.04e-01

Pos logit contributions
 tune+0.519
 instructions+0.456
izing+0.452
 lesson+0.448
 rules+0.447
Neg logit contributions
icked-0.548
icks-0.467
apped-0.456
bug-0.398
ids-0.397
Token the
Token ID262
Feature activation-2.45e+00

Pos logit contributions
icked+0.007
icks+0.006
apped+0.006
bug+0.006
bugs+0.005
Neg logit contributions
 tune-0.006
izing-0.006
 instructions-0.005
Years-0.005
away-0.005
Token par
Token ID1582
Feature activation-1.19e+00

Pos logit contributions
 tune+0.135
 instructions+0.126
izing+0.126
 lesson+0.125
But+0.122
Neg logit contributions
icked-0.179
icks-0.163
apped-0.159
bug-0.136
ids-0.135
Token fun
Token ID1257
Feature activation-2.77e-01

Pos logit contributions
 tune+0.019
 instructions+0.017
izing+0.017
 lesson+0.017
 path+0.017
Neg logit contributions
icked-0.020
icks-0.018
apped-0.017
o-0.015
bug-0.015
Token."
Token ID526
Feature activation-5.34e-01

Pos logit contributions
 tune+0.019
 lesson+0.018
 instructions+0.018
izing+0.018
 path+0.017
Neg logit contributions
icked-0.017
icks-0.015
apped-0.015
ids-0.012
bug-0.012
Token
Token ID198
Feature activation-3.10e-01

Pos logit contributions
 tune+0.037
 instructions+0.034
 lesson+0.034
izing+0.034
 rules+0.033
Neg logit contributions
icked-0.033
icks-0.029
apped-0.028
o-0.023
ids-0.023
Token
Token ID198
Feature activation-3.04e-01

Pos logit contributions
 tune+0.023
izing+0.022
 instructions+0.021
 rules+0.021
 path+0.021
Neg logit contributions
icked-0.017
icks-0.015
apped-0.015
o-0.013
bug-0.011
TokenS
Token ID50
Feature activation-4.91e+00

Pos logit contributions
 tune+0.018
izing+0.017
 temptation+0.017
 instructions+0.017
 rules+0.017
Neg logit contributions
icked-0.018
apped-0.016
o-0.016
icks-0.015
bug-0.014
Tokenaman
Token ID10546
Feature activation-1.39e+01

Pos logit contributions
 tune+0.188
 instructions+0.165
 path+0.163
 lesson+0.159
 rules+0.157
Neg logit contributions
icked-0.455
apped-0.415
icks-0.411
o-0.366
bug-0.361
Tokentha
Token ID12898
Feature activation-5.01e+00

Pos logit contributions
 tune+0.680
 lesson+0.654
 rules+0.631
 instructions+0.621
 path+0.601
Neg logit contributions
icked-1.141
apped-1.021
icks-0.980
o-0.884
bug-0.860
Token didn
Token ID1422
Feature activation-5.17e+00

Pos logit contributions
 tune+0.303
 instructions+0.284
 lesson+0.283
But+0.276
 path+0.275
Neg logit contributions
icked-0.345
icks-0.314
apped-0.299
ids-0.256
aves-0.247
Token't
Token ID470
Feature activation-1.27e+00

Pos logit contributions
 tune+0.363
 rightful+0.324
izing+0.323
 path+0.321
 instructions+0.320
Neg logit contributions
icked-0.320
icks-0.294
apped-0.269
ids-0.224
o-0.214
Token know
Token ID760
Feature activation+4.02e+00

Pos logit contributions
 tune+0.076
 instructions+0.071
 lesson+0.071
izing+0.070
 path+0.069
Neg logit contributions
icked-0.090
icks-0.081
apped-0.078
ids-0.066
bug-0.065
Token what
Token ID644
Feature activation+5.98e+00

Pos logit contributions
icked+0.202
icks+0.172
apped+0.162
ids+0.128
bug+0.127
Neg logit contributions
 tune-0.323
 lesson-0.302
 instructions-0.301
 path-0.295
izing-0.294
Token and
Token ID290
Feature activation-2.39e+00

Pos logit contributions
 tune+0.038
 instructions+0.035
 lesson+0.035
izing+0.035
But+0.034
Neg logit contributions
icked-0.044
icks-0.039
apped-0.038
ids-0.032
bug-0.032
Token love
Token ID1842
Feature activation-1.33e+00

Pos logit contributions
 tune+0.122
izing+0.111
 instructions+0.110
 lesson+0.109
 path+0.107
Neg logit contributions
icked-0.187
icks-0.172
apped-0.165
ids-0.142
aves-0.141
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
 tune+0.083
izing+0.076
 instructions+0.075
 lesson+0.075
 path+0.074
Neg logit contributions
icked-0.090
icks-0.082
apped-0.078
o-0.066
ids-0.066
Token
Token ID198
Feature activation-8.94e-01

Pos logit contributions
But+0.125
 lesson+0.119
 tune+0.116
 instructions+0.115
 rules+0.113
Neg logit contributions
icked-0.169
apped-0.142
icks-0.141
igh-0.126
o-0.125
Token
Token ID198
Feature activation-9.17e-01

Pos logit contributions
 tune+0.051
 lesson+0.050
 instructions+0.049
But+0.048
 rules+0.047
Neg logit contributions
icked-0.065
icks-0.057
apped-0.056
ids-0.049
bug-0.048
TokenBut
Token ID1537
Feature activation-1.38e+01

Pos logit contributions
 tune+0.048
 lesson+0.043
 instructions+0.043
 path+0.042
izing+0.042
Neg logit contributions
icked-0.071
icks-0.064
apped-0.064
aves-0.055
o-0.055
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
 tune+0.718
But+0.702
 lesson+0.695
 instructions+0.675
izing+0.653
Neg logit contributions
icked-1.057
icks-0.981
apped-0.921
aves-0.816
ids-0.805
Token big
Token ID1263
Feature activation-6.93e-01

Pos logit contributions
 tune+0.119
izing+0.109
 instructions+0.107
 lesson+0.106
But+0.102
Neg logit contributions
icked-0.240
icks-0.223
apped-0.220
ids-0.191
bug-0.189
Token boy
Token ID2933
Feature activation+1.81e+00

Pos logit contributions
 tune+0.027
 lesson+0.025
 path+0.025
 instructions+0.024
 rules+0.024
Neg logit contributions
icked-0.065
icks-0.059
apped-0.057
ids-0.051
o-0.051
Token does
Token ID857
Feature activation-2.88e+00

Pos logit contributions
icked+0.124
apped+0.110
icks+0.109
ids+0.091
aves+0.087
Neg logit contributions
 tune-0.106
 lesson-0.103
 instructions-0.101
But-0.100
 rules-0.100
Token not
Token ID407
Feature activation-3.24e+00

Pos logit contributions
 tune+0.187
izing+0.173
 path+0.173
 instructions+0.169
 environment+0.168
Neg logit contributions
icked-0.184
icks-0.177
apped-0.156
ids-0.136
aves-0.132
Token and
Token ID290
Feature activation-2.39e+00

Pos logit contributions
 tune+0.200
 lesson+0.190
 instructions+0.189
 path+0.185
izing+0.185
Neg logit contributions
icked-0.191
icks-0.171
apped-0.163
o-0.134
ids-0.133
Token messy
Token ID24097
Feature activation-2.77e+00

Pos logit contributions
 tune+0.121
 instructions+0.112
izing+0.110
 lesson+0.110
But+0.107
Neg logit contributions
icked-0.187
icks-0.172
apped-0.164
o-0.145
bug-0.143
Token!"
Token ID2474
Feature activation+4.34e+00

Pos logit contributions
 tune+0.183
 lesson+0.174
 instructions+0.172
izing+0.169
 path+0.168
Neg logit contributions
icked-0.175
icks-0.156
apped-0.150
ids-0.123
o-0.122
Token
Token ID198
Feature activation-9.66e-01

Pos logit contributions
icked+0.268
icks+0.239
apped+0.232
ids+0.189
o+0.187
Neg logit contributions
 tune-0.297
izing-0.276
 instructions-0.276
 lesson-0.275
 path-0.270
Token
Token ID198
Feature activation-9.66e-01

Pos logit contributions
 tune+0.063
 lesson+0.059
 instructions+0.059
izing+0.059
 path+0.058
Neg logit contributions
icked-0.062
icks-0.055
apped-0.054
ids-0.044
o-0.044
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.052
 lesson+0.050
izing+0.047
 instructions+0.047
 path+0.046
Neg logit contributions
icked-0.070
icks-0.064
apped-0.063
o-0.057
bug-0.055
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 tune+0.845
But+0.836
 lesson+0.813
 instructions+0.793
 path+0.772
Neg logit contributions
icked-0.935
icks-0.864
apped-0.795
aves-0.701
ids-0.674
Token dog
Token ID3290
Feature activation-1.73e+00

Pos logit contributions
 tune+0.156
izing+0.154
 instructions+0.146
away+0.145
 lesson+0.142
Neg logit contributions
icked-0.193
icks-0.179
apped-0.176
o-0.147
bug-0.145
Token does
Token ID857
Feature activation-2.85e+00

Pos logit contributions
 tune+0.118
 instructions+0.111
 lesson+0.110
izing+0.109
 path+0.107
Neg logit contributions
icked-0.108
icks-0.096
apped-0.091
bug-0.075
ids-0.075
Token not
Token ID407
Feature activation-3.14e+00

Pos logit contributions
 tune+0.202
 rightful+0.200
 environment+0.198
 path+0.197
 surroundings+0.193
Neg logit contributions
icked-0.159
icks-0.151
apped-0.127
ow-0.106
o-0.106
Token stop
Token ID2245
Feature activation-4.50e+00

Pos logit contributions
 tune+0.186
 instructions+0.174
 lesson+0.174
izing+0.172
 path+0.168
Neg logit contributions
icked-0.222
icks-0.200
apped-0.194
ids-0.164
bug-0.162
Token fly
Token ID6129
Feature activation-4.35e-01

Pos logit contributions
 tune+0.134
 lesson+0.125
 instructions+0.125
izing+0.124
But+0.119
Neg logit contributions
icked-0.195
icks-0.178
apped-0.172
ids-0.148
bug-0.148
Token away
Token ID1497
Feature activation-6.59e+00

Pos logit contributions
 tune+0.026
 instructions+0.025
 lesson+0.025
izing+0.024
 path+0.024
Neg logit contributions
icked-0.030
icks-0.027
apped-0.026
ids-0.022
bug-0.022
Token."
Token ID526
Feature activation-1.24e+00

Pos logit contributions
 tune+0.389
 lesson+0.364
 instructions+0.364
izing+0.357
 path+0.352
Neg logit contributions
icked-0.467
icks-0.419
apped-0.408
ids-0.345
bug-0.340
Token
Token ID198
Feature activation-8.30e-01

Pos logit contributions
 tune+0.078
 lesson+0.074
 instructions+0.073
But+0.072
izing+0.071
Neg logit contributions
icked-0.084
icks-0.075
apped-0.073
ids-0.061
bug-0.060
Token
Token ID198
Feature activation-8.36e-01

Pos logit contributions
 tune+0.055
 instructions+0.051
 lesson+0.051
izing+0.050
 path+0.050
Neg logit contributions
icked-0.053
icks-0.048
apped-0.046
ids-0.038
o-0.038
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.050
 lesson+0.046
izing+0.045
 instructions+0.045
 instincts+0.044
Neg logit contributions
icked-0.057
icks-0.052
apped-0.051
o-0.045
bug-0.044
Token Anna
Token ID11735
Feature activation+5.31e+00

Pos logit contributions
 tune+0.873
But+0.853
 lesson+0.851
 instructions+0.825
 rules+0.798
Neg logit contributions
icked-0.901
icks-0.824
apped-0.764
aves-0.661
ids-0.644
Token does
Token ID857
Feature activation-2.88e+00

Pos logit contributions
icked+0.343
icks+0.304
apped+0.297
ids+0.245
bug+0.240
Neg logit contributions
 tune-0.345
 lesson-0.326
 instructions-0.326
izing-0.319
 path-0.316
Token not
Token ID407
Feature activation-3.12e+00

Pos logit contributions
 tune+0.206
 path+0.192
 environment+0.190
 rightful+0.188
 instructions+0.185
Neg logit contributions
icked-0.172
icks-0.163
apped-0.136
ids-0.124
ow-0.124
Token listen
Token ID6004
Feature activation-3.00e+00

Pos logit contributions
 tune+0.167
izing+0.161
 instructions+0.157
 lesson+0.154
 path+0.150
Neg logit contributions
icked-0.237
icks-0.219
apped-0.205
ids-0.182
o-0.180
Token.
Token ID13
Feature activation-2.22e+00

Pos logit contributions
 tune+0.231
 rightful+0.229
 environment+0.222
izing+0.221
 path+0.219
Neg logit contributions
icked-0.141
icks-0.122
apped-0.110
o-0.098
es-0.098
Token,
Token ID11
Feature activation+3.25e+00

Pos logit contributions
 tune+0.170
 instructions+0.163
 lesson+0.160
izing+0.159
 rules+0.159
Neg logit contributions
icked-0.153
icks-0.133
apped-0.131
ids-0.106
bug-0.105
Token move
Token ID1445
Feature activation-2.50e+00

Pos logit contributions
icked+0.244
apped+0.216
icks+0.216
bug+0.180
ids+0.178
Neg logit contributions
 tune-0.178
 instructions-0.170
 lesson-0.165
 rules-0.164
 path-0.164
Token!"
Token ID2474
Feature activation+4.49e+00

Pos logit contributions
 tune+0.172
 instructions+0.165
 lesson+0.163
izing+0.162
 rules+0.161
Neg logit contributions
icked-0.153
icks-0.133
apped-0.131
ids-0.106
bug-0.105
Token
Token ID198
Feature activation-8.06e-01

Pos logit contributions
icked+0.278
icks+0.245
apped+0.236
ids+0.193
bug+0.192
Neg logit contributions
 tune-0.309
 lesson-0.292
 instructions-0.292
izing-0.286
But-0.285
Token
Token ID198
Feature activation-8.21e-01

Pos logit contributions
 tune+0.055
izing+0.052
 instructions+0.051
 lesson+0.051
 path+0.050
Neg logit contributions
icked-0.050
icks-0.045
apped-0.043
o-0.035
ids-0.035
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.041
izing+0.038
 lesson+0.037
 instructions+0.036
 path+0.035
Neg logit contributions
icked-0.064
icks-0.059
apped-0.059
o-0.053
aves-0.051
Token the
Token ID262
Feature activation-2.74e+00

Pos logit contributions
 tune+0.746
But+0.720
 instructions+0.679
 lesson+0.678
 path+0.666
Neg logit contributions
icked-1.033
icks-0.965
apped-0.883
aves-0.780
ids-0.771
Token cows
Token ID22575
Feature activation+1.14e+00

Pos logit contributions
izing+0.120
 tune+0.115
away+0.107
 instructions+0.102
But+0.102
Neg logit contributions
icked-0.217
icks-0.206
apped-0.202
ids-0.176
o-0.173
Token do
Token ID466
Feature activation-1.65e+00

Pos logit contributions
icked+0.068
icks+0.059
apped+0.058
ids+0.047
bug+0.046
Neg logit contributions
 tune-0.080
 instructions-0.076
 lesson-0.076
izing-0.074
 path-0.074
Token not
Token ID407
Feature activation-3.19e+00

Pos logit contributions
 tune+0.093
 instructions+0.085
izing+0.085
 path+0.084
 rightful+0.081
Neg logit contributions
icked-0.119
icks-0.110
apped-0.106
ids-0.090
o-0.089
Token move
Token ID1445
Feature activation-2.42e+00

Pos logit contributions
 tune+0.205
 instructions+0.194
 lesson+0.194
izing+0.193
 path+0.188
Neg logit contributions
icked-0.208
icks-0.186
apped-0.179
ids-0.150
bug-0.148
Token says
Token ID1139
Feature activation+2.45e+00

Pos logit contributions
icked+0.273
apped+0.245
icks+0.237
ids+0.200
bug+0.191
Neg logit contributions
 instructions-0.271
 tune-0.268
 rules-0.262
 lesson-0.262
But-0.258
Token.
Token ID13
Feature activation-2.46e+00

Pos logit contributions
icked+0.198
icks+0.180
apped+0.175
ids+0.153
bug+0.152
Neg logit contributions
 tune-0.118
 instructions-0.112
 lesson-0.107
 rules-0.104
But-0.103
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
 tune+0.153
 lesson+0.144
 instructions+0.144
izing+0.140
But+0.139
Neg logit contributions
icked-0.167
icks-0.149
apped-0.145
ids-0.121
bug-0.120
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
 tune+0.066
 instructions+0.062
 lesson+0.062
izing+0.061
 path+0.060
Neg logit contributions
icked-0.066
icks-0.059
apped-0.057
ids-0.047
o-0.046
Token"
Token ID1
Feature activation-1.41e+00

Pos logit contributions
 tune+0.058
izing+0.052
 lesson+0.052
 instructions+0.052
 path+0.050
Neg logit contributions
icked-0.075
icks-0.067
apped-0.067
o-0.059
aves-0.057
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 decision+0.074
 instincts+0.074
 path+0.070
 tune+0.069
 experience+0.067
Neg logit contributions
icked-0.096
apped-0.091
bug-0.079
icks-0.078
bugs-0.076
Token I
Token ID314
Feature activation+1.80e+00

Pos logit contributions
 tune+0.776
 instructions+0.757
 lesson+0.746
But+0.739
izing+0.730
Neg logit contributions
icked-0.971
icks-0.879
apped-0.869
ids-0.720
o-0.711
Token don
Token ID836
Feature activation-4.48e+00

Pos logit contributions
icked+0.122
apped+0.111
icks+0.107
ids+0.085
bug+0.084
Neg logit contributions
 lesson-0.111
But-0.108
 tune-0.106
 instructions-0.100
 mistake-0.100
Token't
Token ID470
Feature activation-1.71e+00

Pos logit contributions
 tune+0.234
 lesson+0.225
 instructions+0.224
izing+0.219
But+0.218
Neg logit contributions
icked-0.341
icks-0.304
apped-0.303
ids-0.260
bug-0.259
Token like
Token ID588
Feature activation+2.95e+00

Pos logit contributions
 tune+0.107
 lesson+0.104
But+0.101
 rules+0.100
 instructions+0.099
Neg logit contributions
icked-0.117
apped-0.108
icks-0.100
es-0.083
ids-0.083
Token jam
Token ID16853
Feature activation+1.32e+00

Pos logit contributions
icked+0.223
icks+0.207
apped+0.187
bug+0.171
ids+0.167
Neg logit contributions
 tune-0.162
 lesson-0.146
 path-0.145
izing-0.144
 instructions-0.142
Token,"
Token ID553
Feature activation+8.09e+00

Pos logit contributions
icked+0.118
icks+0.105
apped+0.101
ids+0.084
bug+0.083
Neg logit contributions
 tune-0.120
 lesson-0.113
 instructions-0.113
izing-0.111
 path-0.109
Token said
Token ID531
Feature activation+2.75e-01

Pos logit contributions
icked+0.539
icks+0.478
apped+0.470
bug+0.392
ids+0.386
Neg logit contributions
 tune-0.510
 lesson-0.477
 instructions-0.475
izing-0.474
 path-0.464
Token Jack
Token ID3619
Feature activation+5.65e+00

Pos logit contributions
icked+0.021
icks+0.019
apped+0.018
ids+0.016
bug+0.015
Neg logit contributions
 tune-0.015
 instructions-0.014
 lesson-0.014
izing-0.014
But-0.013
Token.
Token ID13
Feature activation-2.10e+00

Pos logit contributions
icked+0.430
icks+0.395
apped+0.382
ids+0.329
aves+0.321
Neg logit contributions
 tune-0.302
 instructions-0.284
 lesson-0.272
izing-0.264
 path-0.263
Token "
Token ID366
Feature activation-4.18e+00

Pos logit contributions
 tune+0.124
 instructions+0.118
 lesson+0.117
But+0.117
 rules+0.113
Neg logit contributions
icked-0.147
icks-0.132
apped-0.127
ids-0.108
aves-0.106
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 decision+0.192
 mistake+0.189
 instincts+0.189
izing+0.183
 lesson+0.182
Neg logit contributions
icked-0.320
apped-0.308
icks-0.286
bug-0.281
o-0.267
Token can
Token ID460
Feature activation+1.01e-01

Pos logit contributions
 tune+0.800
 instructions+0.751
 lesson+0.749
izing+0.734
 path+0.725
Neg logit contributions
icked-0.981
icks-0.883
apped-0.860
ids-0.726
bug-0.714
Token you
Token ID345
Feature activation-2.27e+00

Pos logit contributions
icked+0.007
icks+0.006
apped+0.006
ids+0.005
bug+0.005
Neg logit contributions
 tune-0.006
 lesson-0.006
 instructions-0.006
izing-0.006
 path-0.006
Token please
Token ID3387
Feature activation-1.92e+00

Pos logit contributions
 tune+0.121
 lesson+0.113
 instructions+0.113
izing+0.111
 rules+0.108
Neg logit contributions
icked-0.173
icks-0.157
apped-0.153
ids-0.131
bug-0.130
Token turn
Token ID1210
Feature activation-2.23e+00

Pos logit contributions
 tune+0.105
 lesson+0.098
 instructions+0.097
izing+0.095
 path+0.094
Neg logit contributions
icked-0.145
icks-0.131
apped-0.128
ids-0.109
bug-0.108
Token around
Token ID1088
Feature activation+1.67e+00

Pos logit contributions
 tune+0.144
 instructions+0.136
 lesson+0.135
izing+0.133
 path+0.131
Neg logit contributions
icked-0.146
icks-0.129
apped-0.126
ids-0.104
bug-0.103
Token€
Token ID26391
Feature activation-5.07e+00

Pos logit contributions
 tune+0.361
 instructions+0.339
 lesson+0.338
 rules+0.322
izing+0.322
Neg logit contributions
icked-0.642
icks-0.587
apped-0.580
o-0.511
bug-0.506
Token.
Token ID13
Feature activation-1.77e+00

Pos logit contributions
But+0.268
 tune+0.255
 path+0.245
 man+0.237
 instructions+0.236
Neg logit contributions
icked-0.390
icks-0.342
apped-0.327
bug-0.274
ids-0.264
Token
Token ID198
Feature activation-3.16e-01

Pos logit contributions
But+0.108
 tune+0.103
 instructions+0.100
 rules+0.095
 path+0.094
Neg logit contributions
icked-0.123
icks-0.108
apped-0.103
ids-0.082
aves-0.082
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
 tune+0.020
 lesson+0.019
 instructions+0.019
izing+0.018
But+0.018
Neg logit contributions
icked-0.021
icks-0.019
apped-0.019
ids-0.015
bug-0.015
TokenS
Token ID50
Feature activation-4.81e+00

Pos logit contributions
 tune+0.018
 lesson+0.017
 instructions+0.017
izing+0.016
 path+0.016
Neg logit contributions
icked-0.032
icks-0.029
apped-0.029
o-0.026
bug-0.025
Tokenaman
Token ID10546
Feature activation-1.37e+01

Pos logit contributions
 tune+0.257
 instructions+0.242
 lesson+0.242
izing+0.240
 rules+0.233
Neg logit contributions
icked-0.364
icks-0.332
apped-0.321
ids-0.275
aves-0.266
Tokentha
Token ID12898
Feature activation-4.82e+00

Pos logit contributions
 tune+0.624
 lesson+0.585
 instructions+0.568
 rules+0.553
izing+0.547
Neg logit contributions
icked-1.163
icks-1.044
apped-1.041
o-0.910
bug-0.908
Token was
Token ID373
Feature activation-3.12e+00

Pos logit contributions
 tune+0.276
 instructions+0.259
 lesson+0.257
 path+0.255
But+0.253
Neg logit contributions
icked-0.340
icks-0.310
apped-0.300
ids-0.255
aves-0.246
Token amazed
Token ID24562
Feature activation+4.38e+00

Pos logit contributions
 tune+0.190
But+0.178
 lesson+0.174
 instructions+0.171
 path+0.170
Neg logit contributions
icked-0.215
icks-0.190
apped-0.182
aves-0.156
ids-0.156
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
icked+0.247
apped+0.222
icks+0.203
o+0.185
bug+0.184
Neg logit contributions
 lesson-0.296
 tune-0.295
 instructions-0.284
 path-0.280
 rules-0.275
Token she
Token ID673
Feature activation-3.83e+00

Pos logit contributions
 tune+0.106
 instructions+0.102
 lesson+0.101
izing+0.100
 rules+0.100
Neg logit contributions
icked-0.109
apped-0.096
icks-0.093
bug-0.080
o-0.078
Token sting
Token ID24276
Feature activation-5.56e+00

Pos logit contributions
 tune+0.129
izing+0.125
 instructions+0.118
 surroundings+0.117
 instincts+0.116
Neg logit contributions
icked-0.143
icks-0.135
're-0.114
apped-0.113
ow-0.108
Token you
Token ID345
Feature activation-2.16e+00

Pos logit contributions
 tune+0.245
 instructions+0.225
 lesson+0.222
 rules+0.217
But+0.217
Neg logit contributions
icked-0.479
icks-0.436
apped-0.431
ids-0.377
bug-0.367
Token!"
Token ID2474
Feature activation+4.41e+00

Pos logit contributions
 tune+0.142
 lesson+0.134
 instructions+0.134
izing+0.132
 path+0.130
Neg logit contributions
icked-0.138
icks-0.123
apped-0.118
ids-0.098
bug-0.097
Token
Token ID198
Feature activation-9.22e-01

Pos logit contributions
icked+0.260
icks+0.232
apped+0.226
ids+0.182
o+0.179
Neg logit contributions
 tune-0.312
izing-0.298
 instructions-0.289
 lesson-0.287
 path-0.283
Token
Token ID198
Feature activation-9.20e-01

Pos logit contributions
 tune+0.063
izing+0.059
 instructions+0.059
 lesson+0.059
 path+0.057
Neg logit contributions
icked-0.057
icks-0.051
apped-0.049
o-0.040
ids-0.040
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.051
 lesson+0.048
izing+0.047
 instructions+0.046
 rightful+0.044
Neg logit contributions
icked-0.066
apped-0.060
icks-0.059
o-0.053
bug-0.052
Token Anna
Token ID11735
Feature activation+5.21e+00

Pos logit contributions
 tune+0.835
But+0.799
 lesson+0.788
 instructions+0.773
 path+0.755
Neg logit contributions
icked-0.943
icks-0.874
apped-0.802
aves-0.699
ids-0.680
Token does
Token ID857
Feature activation-2.96e+00

Pos logit contributions
icked+0.333
icks+0.295
apped+0.288
ids+0.237
bug+0.233
Neg logit contributions
 tune-0.342
 lesson-0.323
 instructions-0.322
izing-0.316
 path-0.313
Token not
Token ID407
Feature activation-3.17e+00

Pos logit contributions
 tune+0.215
 environment+0.203
 rightful+0.202
 path+0.202
 scenery+0.196
Neg logit contributions
icked-0.164
icks-0.157
apped-0.126
ow-0.124
ids-0.120
Token hear
Token ID3285
Feature activation+3.22e+00

Pos logit contributions
 tune+0.176
izing+0.174
 instructions+0.165
 lesson+0.162
 rightful+0.159
Neg logit contributions
icked-0.234
icks-0.217
apped-0.198
ids-0.178
bug-0.175
Token him
Token ID683
Feature activation-1.91e+00

Pos logit contributions
icked+0.218
icks+0.202
apped+0.186
ids+0.163
ow+0.157
Neg logit contributions
 tune-0.192
izing-0.185
 lesson-0.184
 path-0.181
But-0.176
Token,
Token ID11
Feature activation+3.29e+00

Pos logit contributions
 tune+0.336
izing+0.303
 instructions+0.301
But+0.300
 lesson+0.297
Neg logit contributions
icked-0.507
icks-0.460
apped-0.454
ids-0.388
bug-0.382
Token stop
Token ID2245
Feature activation-4.59e+00

Pos logit contributions
icked+0.241
icks+0.218
apped+0.213
ids+0.180
bug+0.178
Neg logit contributions
 tune-0.187
 lesson-0.175
 instructions-0.174
izing-0.169
 path-0.169
Token!"
Token ID2474
Feature activation+4.46e+00

Pos logit contributions
 tune+0.298
 instructions+0.280
 lesson+0.280
izing+0.275
 path+0.271
Neg logit contributions
icked-0.302
icks-0.268
apped-0.261
ids-0.216
bug-0.213
Token
Token ID198
Feature activation-8.45e-01

Pos logit contributions
icked+0.276
icks+0.245
apped+0.239
ids+0.196
o+0.192
Neg logit contributions
 tune-0.304
izing-0.283
 lesson-0.283
 instructions-0.282
 path-0.275
Token
Token ID198
Feature activation-8.44e-01

Pos logit contributions
 tune+0.058
 lesson+0.054
 instructions+0.054
izing+0.054
 path+0.053
Neg logit contributions
icked-0.052
icks-0.046
apped-0.045
o-0.037
ids-0.036
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.041
 lesson+0.038
izing+0.036
 instructions+0.036
 rightful+0.035
Neg logit contributions
icked-0.068
icks-0.062
apped-0.061
o-0.057
aves-0.054
Token Rex
Token ID17853
Feature activation+7.88e-01

Pos logit contributions
 tune+0.772
But+0.764
 lesson+0.736
 path+0.723
 instructions+0.720
Neg logit contributions
icked-0.984
icks-0.914
apped-0.856
aves-0.749
bug-0.723
Token does
Token ID857
Feature activation-2.83e+00

Pos logit contributions
icked+0.049
icks+0.043
apped+0.041
ids+0.034
o+0.033
Neg logit contributions
 tune-0.054
 instructions-0.051
 lesson-0.051
izing-0.050
 path-0.049
Token not
Token ID407
Feature activation-3.13e+00

Pos logit contributions
 tune+0.207
 rightful+0.202
 environment+0.198
 surroundings+0.192
 path+0.192
Neg logit contributions
icked-0.143
icks-0.140
apped-0.111
ow-0.108
ids-0.099
Token stop
Token ID2245
Feature activation-4.55e+00

Pos logit contributions
 tune+0.176
izing+0.175
 instructions+0.166
 lesson+0.165
 rightful+0.164
Neg logit contributions
icked-0.224
icks-0.207
apped-0.190
ids-0.170
o-0.167
Token.
Token ID13
Feature activation-2.15e+00

Pos logit contributions
 lesson+0.294
 tune+0.285
Years+0.281
izing+0.278
 rightful+0.273
Neg logit contributions
icked-0.284
icks-0.259
apped-0.246
o-0.215
ids-0.203
Token me
Token ID502
Feature activation-5.75e+00

Pos logit contributions
 tune+0.108
 instructions+0.107
 rules+0.103
 lesson+0.102
But+0.096
Neg logit contributions
icked-0.204
apped-0.189
icks-0.183
ids-0.163
bug-0.160
Token away
Token ID1497
Feature activation-6.65e+00

Pos logit contributions
 tune+0.301
 instructions+0.292
 lesson+0.288
 rules+0.286
But+0.273
Neg logit contributions
icked-0.444
apped-0.405
icks-0.394
ids-0.334
bug-0.329
Token."
Token ID526
Feature activation-1.21e+00

Pos logit contributions
 tune+0.391
 lesson+0.370
 instructions+0.370
izing+0.361
 rules+0.361
Neg logit contributions
icked-0.476
icks-0.424
apped-0.419
ids-0.353
bug-0.346
Token
Token ID198
Feature activation-7.88e-01

Pos logit contributions
 tune+0.081
 lesson+0.076
 instructions+0.076
izing+0.076
 path+0.074
Neg logit contributions
icked-0.077
icks-0.068
apped-0.067
o-0.055
ids-0.054
Token
Token ID198
Feature activation-7.85e-01

Pos logit contributions
 tune+0.059
izing+0.055
 instructions+0.054
 lesson+0.054
 path+0.054
Neg logit contributions
icked-0.044
icks-0.040
apped-0.038
o-0.032
bug-0.029
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 tune+0.041
izing+0.039
 lesson+0.038
 instructions+0.037
 rules+0.036
Neg logit contributions
icked-0.057
apped-0.052
icks-0.052
o-0.052
bug-0.047
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 tune+1.011
But+1.008
 lesson+0.972
 instructions+0.954
 man+0.944
Neg logit contributions
icked-0.742
icks-0.681
apped-0.601
aves-0.520
ids-0.487
Token officer
Token ID3818
Feature activation+3.46e+00

Pos logit contributions
 tune+0.128
izing+0.121
 instructions+0.116
 lesson+0.114
But+0.113
Neg logit contributions
icked-0.231
icks-0.213
apped-0.209
bug-0.181
ids-0.180
Token did
Token ID750
Feature activation-5.19e-01

Pos logit contributions
icked+0.241
icks+0.225
apped+0.213
ids+0.185
aves+0.178
Neg logit contributions
 tune-0.202
 lesson-0.192
 instructions-0.189
But-0.186
izing-0.184
Token not
Token ID407
Feature activation-3.10e+00

Pos logit contributions
 tune+0.033
 path+0.031
izing+0.030
 instructions+0.030
 environment+0.029
Neg logit contributions
icked-0.035
icks-0.031
apped-0.028
o-0.025
ow-0.025
Token listen
Token ID6004
Feature activation-3.09e+00

Pos logit contributions
 tune+0.170
izing+0.161
 instructions+0.159
 lesson+0.157
 path+0.153
Neg logit contributions
icked-0.232
icks-0.211
apped-0.201
ids-0.173
o-0.173
Token it
Token ID340
Feature activation-2.98e+00

Pos logit contributions
 tune+0.067
 lesson+0.060
izing+0.059
 instructions+0.058
 path+0.058
Neg logit contributions
icked-0.146
icks-0.134
apped-0.130
bug-0.116
o-0.115
Token back
Token ID736
Feature activation-5.16e-01

Pos logit contributions
 tune+0.157
 lesson+0.152
 instructions+0.149
 rules+0.144
 path+0.141
Neg logit contributions
icked-0.228
apped-0.205
icks-0.205
ids-0.175
bug-0.171
Token."
Token ID526
Feature activation-1.24e+00

Pos logit contributions
 tune+0.028
 lesson+0.027
 instructions+0.027
But+0.026
 rules+0.026
Neg logit contributions
icked-0.038
icks-0.035
apped-0.034
ids-0.029
bug-0.028
Token
Token ID198
Feature activation-9.18e-01

Pos logit contributions
 tune+0.078
 instructions+0.074
 lesson+0.073
izing+0.072
But+0.072
Neg logit contributions
icked-0.083
icks-0.074
apped-0.072
ids-0.060
bug-0.059
Token
Token ID198
Feature activation-9.15e-01

Pos logit contributions
 tune+0.067
izing+0.063
 lesson+0.063
 instructions+0.062
 path+0.062
Neg logit contributions
icked-0.052
icks-0.047
apped-0.044
o-0.038
bug-0.035
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 lesson+0.047
 tune+0.047
 rightful+0.044
 decision+0.044
izing+0.042
Neg logit contributions
icked-0.064
icks-0.061
apped-0.058
o-0.058
bug-0.056
Token the
Token ID262
Feature activation-2.74e+00

Pos logit contributions
 tune+0.876
But+0.843
 instructions+0.830
 lesson+0.823
izing+0.800
Neg logit contributions
icked-0.894
icks-0.817
apped-0.766
aves-0.650
ids-0.633
Token boy
Token ID2933
Feature activation+1.84e+00

Pos logit contributions
 tune+0.120
izing+0.115
 instructions+0.107
But+0.105
 lesson+0.104
Neg logit contributions
icked-0.230
icks-0.213
apped-0.207
bug-0.182
o-0.177
Token did
Token ID750
Feature activation-5.23e-01

Pos logit contributions
icked+0.112
icks+0.100
apped+0.097
ids+0.079
aves+0.076
Neg logit contributions
 tune-0.126
 lesson-0.119
 instructions-0.118
izing-0.116
But-0.116
Token not
Token ID407
Feature activation-3.17e+00

Pos logit contributions
 tune+0.033
 path+0.031
 instructions+0.030
izing+0.029
 environment+0.029
Neg logit contributions
icked-0.036
icks-0.033
apped-0.030
ids-0.026
o-0.026
Token care
Token ID1337
Feature activation-1.45e+00

Pos logit contributions
 tune+0.161
izing+0.150
 instructions+0.150
 lesson+0.148
 path+0.144
Neg logit contributions
icked-0.251
icks-0.231
apped-0.221
ids-0.193
bug-0.191
Token said
Token ID531
Feature activation+3.48e-01

Pos logit contributions
icked+0.344
icks+0.317
apped+0.304
ids+0.256
igh+0.246
Neg logit contributions
 tune-0.329
 lesson-0.311
 instructions-0.307
But-0.304
 path-0.300
Token,
Token ID11
Feature activation+3.31e+00

Pos logit contributions
icked+0.024
icks+0.021
apped+0.021
bug+0.019
aves+0.018
Neg logit contributions
 tune-0.019
 instructions-0.019
 lesson-0.018
 rules-0.018
 instincts-0.017
Token nodding
Token ID47583
Feature activation-3.52e-02

Pos logit contributions
icked+0.224
icks+0.201
apped+0.197
bug+0.165
ids+0.162
Neg logit contributions
 tune-0.199
 lesson-0.199
 instructions-0.191
izing-0.190
 rules-0.189
Token.
Token ID13
Feature activation-2.15e+00

Pos logit contributions
 tune+0.002
 lesson+0.002
izing+0.002
 instructions+0.002
 path+0.002
Neg logit contributions
icked-0.002
icks-0.002
apped-0.002
o-0.002
ids-0.002
Token "
Token ID366
Feature activation-4.23e+00

Pos logit contributions
But+0.109
 tune+0.106
 instructions+0.104
 lesson+0.102
 path+0.099
Neg logit contributions
icked-0.164
icks-0.145
apped-0.141
igh-0.127
bug-0.126
TokenBut
Token ID1537
Feature activation-1.37e+01

Pos logit contributions
 instincts+0.202
 decision+0.201
 mistake+0.200
izing+0.196
 rightful+0.189
Neg logit contributions
icked-0.303
apped-0.301
icks-0.277
o-0.259
bug-0.253
Token can
Token ID460
Feature activation+1.22e-01

Pos logit contributions
 tune+0.826
 lesson+0.774
 instructions+0.774
izing+0.758
 path+0.748
Neg logit contributions
icked-0.957
icks-0.858
apped-0.835
ids-0.702
bug-0.693
Token we
Token ID356
Feature activation+1.18e+00

Pos logit contributions
icked+0.008
icks+0.007
apped+0.007
ids+0.006
bug+0.006
Neg logit contributions
 tune-0.008
 lesson-0.007
 instructions-0.007
izing-0.007
 path-0.007
Token come
Token ID1282
Feature activation+9.28e-01

Pos logit contributions
icked+0.087
icks+0.078
apped+0.076
ids+0.065
bug+0.064
Neg logit contributions
 tune-0.067
 lesson-0.063
 instructions-0.063
 path-0.061
izing-0.061
Token back
Token ID736
Feature activation-4.87e-01

Pos logit contributions
icked+0.076
icks+0.069
apped+0.066
ids+0.058
aves+0.056
Neg logit contributions
 tune-0.045
But-0.044
away-0.041
 path-0.041
 lesson-0.040
Token to
Token ID284
Feature activation-2.61e+00

Pos logit contributions
 tune+0.026
But+0.025
 lesson+0.025
 instructions+0.024
 path+0.024
Neg logit contributions
icked-0.037
icks-0.034
apped-0.033
ids-0.028
aves-0.027
Token his
Token ID465
Feature activation-6.78e+00

Pos logit contributions
icked+0.220
icks+0.194
apped+0.187
o+0.156
ids+0.155
Neg logit contributions
 tune-0.205
But-0.189
izing-0.188
 instructions-0.186
 lesson-0.186
Token toy
Token ID13373
Feature activation-4.44e+00

Pos logit contributions
 tune+0.349
izing+0.345
But+0.334
 instructions+0.334
Years+0.322
Neg logit contributions
icked-0.500
apped-0.462
icks-0.458
o-0.392
ids-0.381
Token boat
Token ID8848
Feature activation-1.93e+00

Pos logit contributions
 tune+0.233
 instructions+0.216
 lesson+0.213
izing+0.210
But+0.206
Neg logit contributions
icked-0.346
icks-0.313
apped-0.308
o-0.263
ids-0.261
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 tune+0.112
 instructions+0.105
 lesson+0.105
izing+0.102
But+0.101
Neg logit contributions
icked-0.138
icks-0.124
apped-0.120
ids-0.102
o-0.101
Token Tim
Token ID5045
Feature activation+2.69e+00

Pos logit contributions
 tune+0.094
 lesson+0.089
 instructions+0.089
izing+0.087
But+0.086
Neg logit contributions
icked-0.083
icks-0.073
apped-0.070
ids-0.058
bug-0.057
Tokenmy
Token ID1820
Feature activation+9.11e+00

Pos logit contributions
icked+0.131
icks+0.112
apped+0.107
ids+0.083
bug+0.077
Neg logit contributions
 tune-0.220
 lesson-0.210
 instructions-0.209
 path-0.206
izing-0.205
Token wanted
Token ID2227
Feature activation-1.88e+00

Pos logit contributions
icked+0.625
apped+0.551
icks+0.547
o+0.465
bug+0.455
Neg logit contributions
 tune-0.552
 instructions-0.526
 lesson-0.518
izing-0.514
 rules-0.510
Token to
Token ID284
Feature activation-2.04e+00

Pos logit contributions
 tune+0.125
izing+0.123
 instructions+0.113
 rightful+0.112
 lesson+0.111
Neg logit contributions
icked-0.120
icks-0.108
apped-0.099
o-0.092
bug-0.085
Token remove
Token ID4781
Feature activation-4.82e+00

Pos logit contributions
izing+0.139
 tune+0.135
 instructions+0.128
 instincts+0.125
 rules+0.124
Neg logit contributions
icked-0.120
icks-0.114
apped-0.101
're-0.092
o-0.087
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
 tune+0.257
izing+0.228
 lesson+0.228
 instructions+0.227
Years+0.222
Neg logit contributions
icked-0.376
icks-0.343
apped-0.326
ids-0.284
o-0.277
Token raft
Token ID31812
Feature activation-3.94e+00

Pos logit contributions
 tune+0.123
 rightful+0.113
izing+0.112
Years+0.111
But+0.108
Neg logit contributions
icked-0.142
icks-0.131
apped-0.123
're-0.113
aves-0.104
TokenI
Token ID40
Feature activation-3.31e+00

Pos logit contributions
 path+0.059
 instincts+0.059
 decision+0.059
 instructions+0.056
 gut+0.056
Neg logit contributions
icked-0.089
apped-0.086
icks-0.077
bug-0.075
bugs-0.071
Token will
Token ID481
Feature activation-1.98e+00

Pos logit contributions
 lesson+0.190
 tune+0.182
But+0.179
 instructions+0.169
 trick+0.167
Neg logit contributions
icked-0.242
icks-0.217
apped-0.216
ids-0.179
bug-0.174
Token not
Token ID407
Feature activation-3.09e+00

Pos logit contributions
 tune+0.113
izing+0.108
 instructions+0.106
 lesson+0.105
 rules+0.102
Neg logit contributions
icked-0.142
icks-0.132
apped-0.122
ids-0.106
aves-0.104
Token,
Token ID11
Feature activation+3.36e+00

Pos logit contributions
 tune+0.167
 lesson+0.153
 instructions+0.152
But+0.149
izing+0.148
Neg logit contributions
icked-0.240
icks-0.215
apped-0.215
ids-0.182
bug-0.177
Token Lily
Token ID20037
Feature activation+4.03e+00

Pos logit contributions
icked+0.182
icks+0.158
apped+0.152
ids+0.121
bug+0.120
Neg logit contributions
 tune-0.254
 instructions-0.241
 lesson-0.240
izing-0.239
 path-0.235
Token,"
Token ID553
Feature activation+8.45e+00

Pos logit contributions
icked+0.249
icks+0.215
apped+0.215
ids+0.174
aves+0.164
Neg logit contributions
 tune-0.275
 lesson-0.264
 instructions-0.263
 rules-0.259
But-0.258
Token Ben
Token ID3932
Feature activation+4.58e+00

Pos logit contributions
icked+0.485
icks+0.413
apped+0.406
ids+0.334
aves+0.316
Neg logit contributions
 tune-0.602
But-0.593
 instructions-0.592
 lesson-0.590
 rules-0.580
Token says
Token ID1139
Feature activation+2.74e+00

Pos logit contributions
icked+0.303
apped+0.269
icks+0.254
ids+0.225
o+0.214
Neg logit contributions
 instructions-0.286
 lesson-0.283
 rules-0.280
 tune-0.279
 man-0.272
Token.
Token ID13
Feature activation-2.13e+00

Pos logit contributions
icked+0.227
icks+0.203
apped+0.202
igh+0.181
o+0.179
Neg logit contributions
 tune-0.121
 instructions-0.120
 lesson-0.114
 rules-0.113
But-0.108
Token "
Token ID366
Feature activation-4.02e+00

Pos logit contributions
 tune+0.131
 lesson+0.121
 instructions+0.121
izing+0.120
 path+0.118
Neg logit contributions
icked-0.149
icks-0.133
apped-0.130
ids-0.108
o-0.107
TokenI
Token ID40
Feature activation-3.27e+00

Pos logit contributions
 instincts+0.205
 decision+0.201
 rightful+0.196
 path+0.195
 avail+0.194
Neg logit contributions
icked-0.289
apped-0.275
icks-0.261
bug-0.245
o-0.240
Token day
Token ID1110
Feature activation-4.59e+00

Pos logit contributions
icked+0.297
apped+0.278
icks+0.272
o+0.261
bug+0.257
Neg logit contributions
 instructions-0.161
izing-0.156
 tune-0.154
 rules-0.144
 instincts-0.144
Token,
Token ID11
Feature activation+4.33e+00

Pos logit contributions
But+0.243
 lesson+0.241
 man+0.233
 tune+0.227
 thing+0.224
Neg logit contributions
icked-0.332
icks-0.315
apped-0.281
bug-0.277
ids-0.266
Token he
Token ID339
Feature activation-2.69e+00

Pos logit contributions
icked+0.334
icks+0.303
apped+0.296
ids+0.254
bug+0.251
Neg logit contributions
 tune-0.230
 instructions-0.214
 lesson-0.213
izing-0.209
 path-0.205
Token asked
Token ID1965
Feature activation-1.09e+00

Pos logit contributions
 tune+0.173
 lesson+0.166
But+0.164
 path+0.163
 instructions+0.158
Neg logit contributions
icked-0.176
icks-0.161
apped-0.151
ids-0.130
aves-0.121
Token his
Token ID465
Feature activation-6.48e+00

Pos logit contributions
 tune+0.035
 lesson+0.035
 instructions+0.032
 rules+0.031
But+0.030
Neg logit contributions
icked-0.105
icks-0.097
apped-0.096
ids-0.087
bug-0.087
Token three
Token ID1115
Feature activation+9.21e+00

Pos logit contributions
 tune+0.419
izing+0.396
 instructions+0.384
But+0.380
 lesson+0.379
Neg logit contributions
icked-0.427
icks-0.382
apped-0.369
o-0.306
bug-0.304
Token-
Token ID12
Feature activation-3.13e+00

Pos logit contributions
icked+0.807
icks+0.740
apped+0.724
ids+0.637
bug+0.628
Neg logit contributions
 tune-0.392
 lesson-0.360
 instructions-0.357
izing-0.344
 path-0.342
Tokenyear
Token ID1941
Feature activation+3.43e+00

Pos logit contributions
 tune+0.239
 lesson+0.231
 instructions+0.229
But+0.225
 rules+0.223
Neg logit contributions
icked-0.162
icks-0.143
apped-0.137
ids-0.108
aves-0.102
Token-
Token ID12
Feature activation-3.12e+00

Pos logit contributions
icked+0.259
icks+0.238
apped+0.227
o+0.194
ids+0.187
Neg logit contributions
 tune-0.187
But-0.174
izing-0.174
 instructions-0.169
 rightful-0.161
Tokenold
Token ID727
Feature activation+5.07e+00

Pos logit contributions
 tune+0.222
 lesson+0.215
 instructions+0.212
 rules+0.207
But+0.206
Neg logit contributions
icked-0.181
icks-0.159
apped-0.154
ids-0.125
bug-0.119
Token neighbor
Token ID4780
Feature activation-2.32e+00

Pos logit contributions
icked+0.379
icks+0.344
apped+0.331
ids+0.287
aves+0.276
Neg logit contributions
 tune-0.282
 instructions-0.272
 lesson-0.269
 rules-0.261
 instincts-0.254
Token"
Token ID1
Feature activation-8.51e-01

Pos logit contributions
 tune+0.018
 lesson+0.017
 instructions+0.016
izing+0.016
 instincts+0.016
Neg logit contributions
icked-0.018
icks-0.016
apped-0.016
o-0.014
bug-0.014
TokenThat
Token ID2504
Feature activation-7.06e+00

Pos logit contributions
 tune+0.043
izing+0.041
 instincts+0.041
 decision+0.041
 path+0.040
Neg logit contributions
icked-0.065
apped-0.059
icks-0.057
bug-0.053
o-0.052
Token dog
Token ID3290
Feature activation-1.28e+00

Pos logit contributions
 tune+0.467
izing+0.461
 instructions+0.447
away+0.438
Years+0.433
Neg logit contributions
icked-0.432
icks-0.394
apped-0.366
o-0.342
bug-0.311
Token was
Token ID373
Feature activation-3.12e+00

Pos logit contributions
 tune+0.066
 lesson+0.066
 sound+0.062
 path+0.061
 instructions+0.061
Neg logit contributions
icked-0.094
apped-0.087
icks-0.085
ids-0.076
aves-0.069
Token mean
Token ID1612
Feature activation+1.61e+00

Pos logit contributions
 tune+0.166
 instructions+0.154
 lesson+0.153
izing+0.151
 path+0.148
Neg logit contributions
icked-0.242
icks-0.220
apped-0.211
ids-0.182
o-0.182
Token,"
Token ID553
Feature activation+8.88e+00

Pos logit contributions
icked+0.118
apped+0.108
icks+0.103
ids+0.094
bugs+0.088
Neg logit contributions
 rules-0.090
 lesson-0.089
 instructions-0.086
 tune-0.086
 man-0.082
Token Tom
Token ID4186
Feature activation+4.79e+00

Pos logit contributions
icked+0.385
icks+0.318
apped+0.303
ids+0.217
o+0.208
Neg logit contributions
 tune-0.776
 lesson-0.749
 instructions-0.746
But-0.736
izing-0.734
Token says
Token ID1139
Feature activation+3.13e+00

Pos logit contributions
icked+0.274
apped+0.237
icks+0.235
ids+0.188
bug+0.180
Neg logit contributions
 tune-0.344
 instructions-0.333
 lesson-0.333
 rules-0.326
But-0.323
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
icked+0.259
icks+0.234
apped+0.232
igh+0.213
o+0.208
Neg logit contributions
 tune-0.133
 instructions-0.132
 lesson-0.127
 rules-0.123
But-0.117
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
 tune+0.106
izing+0.100
 instructions+0.099
 lesson+0.099
 path+0.096
Neg logit contributions
icked-0.098
icks-0.087
apped-0.085
o-0.069
ids-0.069
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
 tune+0.016
 instructions+0.015
 lesson+0.015
izing+0.015
 path+0.014
Neg logit contributions
icked-0.014
icks-0.013
apped-0.012
o-0.010
ids-0.010
Token threat
Token ID2372
Feature activation-5.99e+00

Pos logit contributions
icked+0.488
icks+0.479
apped+0.441
bug+0.411
aves+0.400
Neg logit contributions
izing-0.321
 tune-0.316
But-0.295
 instructions-0.287
 lesson-0.280
Token!"
Token ID2474
Feature activation+4.60e+00

Pos logit contributions
 tune+0.369
 instructions+0.346
 lesson+0.345
izing+0.336
 path+0.336
Neg logit contributions
icked-0.410
icks-0.364
apped-0.356
ids-0.297
bug-0.292
Token
Token ID198
Feature activation-5.37e-01

Pos logit contributions
icked+0.282
icks+0.249
apped+0.242
ids+0.197
bug+0.195
Neg logit contributions
 tune-0.316
 lesson-0.298
 instructions-0.297
izing-0.294
 path-0.290
Token
Token ID198
Feature activation-5.90e-01

Pos logit contributions
 tune+0.040
izing+0.037
 lesson+0.037
 instructions+0.036
 path+0.036
Neg logit contributions
icked-0.030
icks-0.027
apped-0.026
o-0.022
bug-0.021
TokenMom
Token ID29252
Feature activation-1.29e+00

Pos logit contributions
 tune+0.031
 lesson+0.030
izing+0.030
 decision+0.029
 instincts+0.029
Neg logit contributions
icked-0.040
apped-0.036
icks-0.036
o-0.035
bug-0.034
Tokenmy
Token ID1820
Feature activation+8.55e+00

Pos logit contributions
 instructions+0.073
 tune+0.073
 man+0.071
But+0.069
 sound+0.068
Neg logit contributions
icked-0.087
icks-0.083
apped-0.076
ids-0.069
aves-0.062
Token was
Token ID373
Feature activation-3.35e+00

Pos logit contributions
icked+0.599
icks+0.559
apped+0.524
ids+0.458
aves+0.430
Neg logit contributions
 tune-0.503
 instructions-0.481
But-0.468
 man-0.460
 lesson-0.459
Token very
Token ID845
Feature activation+1.73e+00

Pos logit contributions
 tune+0.187
 instructions+0.175
But+0.175
 lesson+0.171
 rules+0.167
Neg logit contributions
icked-0.247
icks-0.222
apped-0.215
ids-0.186
aves-0.184
Token scared
Token ID12008
Feature activation+1.28e+00

Pos logit contributions
icked+0.126
icks+0.114
apped+0.111
ids+0.094
bug+0.093
Neg logit contributions
 tune-0.099
 lesson-0.092
 instructions-0.092
izing-0.090
 path-0.089
Token,
Token ID11
Feature activation+3.71e+00

Pos logit contributions
icked+0.083
icks+0.074
apped+0.072
ids+0.060
bug+0.059
Neg logit contributions
 tune-0.082
 lesson-0.078
 instructions-0.077
izing-0.076
 path-0.075
Token and
Token ID290
Feature activation-1.89e+00

Pos logit contributions
icked+0.223
icks+0.193
apped+0.193
o+0.164
bug+0.162
Neg logit contributions
 tune-0.233
 rightful-0.226
 lesson-0.226
away-0.223
 instructions-0.222
Token clean
Token ID3424
Feature activation-2.83e+00

Pos logit contributions
 tune+0.081
 lesson+0.076
 instructions+0.075
izing+0.074
 path+0.072
Neg logit contributions
icked-0.120
icks-0.109
apped-0.106
ids-0.091
bug-0.090
Token up
Token ID510
Feature activation+3.95e-01

Pos logit contributions
 tune+0.173
 lesson+0.162
Years+0.157
 path+0.156
Days+0.155
Neg logit contributions
icked-0.190
icks-0.173
apped-0.164
es-0.142
ids-0.140
Token her
Token ID607
Feature activation-5.82e+00

Pos logit contributions
icked+0.028
icks+0.025
apped+0.025
ids+0.021
bug+0.021
Neg logit contributions
 tune-0.022
izing-0.020
 lesson-0.020
 path-0.020
 instructions-0.020
Token toys
Token ID14958
Feature activation+4.24e+00

Pos logit contributions
 tune+0.367
izing+0.340
 lesson+0.338
 instructions+0.335
But+0.334
Neg logit contributions
icked-0.374
icks-0.333
apped-0.330
bug-0.272
ids-0.270
Token and
Token ID290
Feature activation-1.25e+00

Pos logit contributions
icked+0.266
apped+0.234
icks+0.233
ids+0.194
bug+0.191
Neg logit contributions
 tune-0.282
 lesson-0.261
 instructions-0.255
izing-0.253
 instincts-0.250
Token put
Token ID1234
Feature activation+1.32e+00

Pos logit contributions
 tune+0.070
 instructions+0.065
 lesson+0.065
izing+0.064
 path+0.063
Neg logit contributions
icked-0.093
icks-0.084
apped-0.082
ids-0.070
bug-0.069
Token them
Token ID606
Feature activation+9.20e-01

Pos logit contributions
icked+0.102
icks+0.095
apped+0.092
bug+0.079
aves+0.079
Neg logit contributions
 instructions-0.063
 tune-0.062
 lesson-0.061
But-0.059
 rules-0.058
Token in
Token ID287
Feature activation+3.79e+00

Pos logit contributions
icked+0.059
icks+0.055
apped+0.051
ids+0.043
bug+0.042
Neg logit contributions
 tune-0.057
 instructions-0.056
 lesson-0.055
But-0.053
 path-0.053
Token order
Token ID1502
Feature activation-1.88e+00

Pos logit contributions
icked+0.235
icks+0.211
apped+0.205
ids+0.170
aves+0.167
Neg logit contributions
 tune-0.251
izing-0.233
 lesson-0.229
 instructions-0.228
 path-0.227
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 tune+0.106
 instructions+0.101
 lesson+0.100
izing+0.097
 rules+0.097
Neg logit contributions
icked-0.137
icks-0.123
apped-0.120
ids-0.102
bug-0.101
Token Lily
Token ID20037
Feature activation+4.57e+00

Pos logit contributions
 tune+0.093
 lesson+0.088
 instructions+0.088
But+0.087
izing+0.086
Neg logit contributions
icked-0.081
icks-0.072
apped-0.069
ids-0.056
bug-0.055
Token
Token ID198
Feature activation-4.87e-01

Pos logit contributions
 tune+0.031
 lesson+0.029
 instructions+0.029
izing+0.029
But+0.028
Neg logit contributions
icked-0.033
icks-0.029
apped-0.028
ids-0.023
bug-0.023
TokenHe
Token ID1544
Feature activation-2.16e+00

Pos logit contributions
 tune+0.030
 lesson+0.028
 instincts+0.027
izing+0.027
 instructions+0.027
Neg logit contributions
icked-0.032
apped-0.030
icks-0.028
aves-0.024
o-0.024
Token looked
Token ID3114
Feature activation+4.49e+00

Pos logit contributions
 tune+0.131
 instructions+0.124
 lesson+0.124
But+0.124
 path+0.119
Neg logit contributions
icked-0.142
icks-0.134
apped-0.124
ids-0.108
igh-0.104
Token around
Token ID1088
Feature activation+2.09e+00

Pos logit contributions
icked+0.302
icks+0.273
apped+0.263
ids+0.225
o+0.224
Neg logit contributions
 tune-0.262
 lesson-0.259
But-0.255
 instructions-0.255
away-0.249
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
icked+0.113
apped+0.102
icks+0.101
ids+0.078
aves+0.076
Neg logit contributions
 tune-0.148
izing-0.142
 lesson-0.136
 path-0.136
 rightful-0.136
Token saw
Token ID2497
Feature activation+3.06e+00

Pos logit contributions
 tune+0.129
 instructions+0.121
But+0.121
 lesson+0.120
izing+0.119
Neg logit contributions
icked-0.114
icks-0.103
apped-0.097
ids-0.080
bug-0.077
Token a
Token ID257
Feature activation+6.87e+00

Pos logit contributions
icked+0.187
icks+0.166
apped+0.164
ids+0.143
o+0.138
Neg logit contributions
 instructions-0.199
 lesson-0.198
 tune-0.193
 rules-0.192
 sound-0.189
Token bottle
Token ID9294
Feature activation+1.32e+00

Pos logit contributions
icked+0.529
icks+0.483
apped+0.475
bug+0.409
ids+0.408
Neg logit contributions
 tune-0.344
 lesson-0.320
 instructions-0.319
izing-0.316
 rules-0.306
Token of
Token ID286
Feature activation+4.74e+00

Pos logit contributions
icked+0.081
icks+0.072
apped+0.070
ids+0.060
o+0.058
Neg logit contributions
 tune-0.085
 instructions-0.082
 path-0.082
 lesson-0.081
 sound-0.081
Token red
Token ID2266
Feature activation+6.87e-01

Pos logit contributions
icked+0.375
icks+0.338
apped+0.333
ids+0.286
o+0.284
Neg logit contributions
 tune-0.244
 lesson-0.230
 instructions-0.229
 rules-0.221
izing-0.219
Token sauce
Token ID10746
Feature activation+4.07e+00

Pos logit contributions
icked+0.054
icks+0.049
apped+0.048
o+0.042
ids+0.041
Neg logit contributions
 tune-0.035
 instructions-0.033
 lesson-0.033
 rules-0.032
izing-0.032
Token nodded
Token ID14464
Feature activation+4.19e-01

Pos logit contributions
icked+0.290
icks+0.272
apped+0.252
ids+0.229
o+0.216
Neg logit contributions
 tune-0.234
But-0.217
 instructions-0.213
 path-0.213
 lesson-0.210
Token and
Token ID290
Feature activation-2.01e+00

Pos logit contributions
icked+0.029
icks+0.026
apped+0.026
ids+0.022
bug+0.021
Neg logit contributions
 tune-0.025
 lesson-0.024
 instructions-0.024
izing-0.023
 path-0.023
Token thanked
Token ID26280
Feature activation-2.48e+00

Pos logit contributions
 tune+0.123
 lesson+0.115
 instructions+0.115
 path+0.112
But+0.112
Neg logit contributions
icked-0.134
icks-0.125
apped-0.116
ids-0.105
aves-0.101
Token mom
Token ID1995
Feature activation-1.77e+00

Pos logit contributions
 tune+0.162
izing+0.157
 lesson+0.153
 path+0.151
 instructions+0.150
Neg logit contributions
icked-0.157
icks-0.141
apped-0.137
bug-0.114
o-0.112
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
 tune+0.093
 instructions+0.090
 lesson+0.088
 rules+0.086
 path+0.084
Neg logit contributions
icked-0.138
icks-0.124
apped-0.121
ids-0.104
aves-0.101
Token They
Token ID1119
Feature activation+5.43e+00

Pos logit contributions
 tune+0.140
But+0.138
 instructions+0.137
 lesson+0.136
 path+0.134
Neg logit contributions
icked-0.105
icks-0.090
apped-0.083
ids-0.071
igh-0.069
Token went
Token ID1816
Feature activation-6.63e-01

Pos logit contributions
icked+0.371
icks+0.350
apped+0.333
ids+0.295
o+0.279
Neg logit contributions
 tune-0.304
 lesson-0.296
But-0.288
 path-0.288
 instructions-0.281
Token back
Token ID736
Feature activation-1.95e-01

Pos logit contributions
 tune+0.030
 instructions+0.030
 path+0.028
 lesson+0.028
 rules+0.028
Neg logit contributions
icked-0.056
icks-0.050
apped-0.049
ids-0.043
aves-0.043
Token to
Token ID284
Feature activation-2.31e+00

Pos logit contributions
 tune+0.009
 instructions+0.008
 lesson+0.008
 path+0.008
away+0.008
Neg logit contributions
icked-0.016
icks-0.015
apped-0.014
ids-0.013
aves-0.012
Token their
Token ID511
Feature activation-1.19e+00

Pos logit contributions
 tune+0.143
izing+0.136
 lesson+0.135
 instructions+0.134
 path+0.131
Neg logit contributions
icked-0.156
icks-0.142
apped-0.136
bug-0.113
ids-0.113
Token room
Token ID2119
Feature activation+3.50e-01

Pos logit contributions
izing+0.069
 tune+0.069
But+0.065
Years+0.062
 lesson+0.062
Neg logit contributions
icked-0.075
apped-0.070
icks-0.069
bug-0.061
o-0.061
Token,
Token ID11
Feature activation+3.73e+00

Pos logit contributions
icked+0.186
icks+0.158
apped+0.154
ids+0.119
bug+0.118
Neg logit contributions
 tune-0.284
 lesson-0.279
 instructions-0.273
izing-0.272
 path-0.270
Token Tweet
Token ID18752
Feature activation+3.68e+00

Pos logit contributions
icked+0.230
icks+0.203
apped+0.197
ids+0.161
bug+0.158
Neg logit contributions
 tune-0.256
 instructions-0.242
 lesson-0.241
izing-0.237
 path-0.235
Tokenie
Token ID494
Feature activation+3.62e+00

Pos logit contributions
icked+0.277
icks+0.248
apped+0.242
o+0.218
bug+0.212
Neg logit contributions
 tune-0.199
 instructions-0.192
 lesson-0.184
 rules-0.182
izing-0.182
Token never
Token ID1239
Feature activation-6.07e+00

Pos logit contributions
icked+0.229
icks+0.202
apped+0.197
o+0.175
es+0.165
Neg logit contributions
 tune-0.241
 instructions-0.233
izing-0.230
 rules-0.229
 path-0.219
Token balanced
Token ID12974
Feature activation-1.20e+00

Pos logit contributions
 tune+0.368
 lesson+0.339
 path+0.333
 instructions+0.329
 rules+0.322
Neg logit contributions
icked-0.424
icks-0.386
apped-0.379
ids-0.326
aves-0.310
Token on
Token ID319
Feature activation+3.67e+00

Pos logit contributions
 tune+0.092
 instructions+0.086
 lesson+0.086
izing+0.085
 path+0.084
Neg logit contributions
icked-0.065
icks-0.055
apped-0.051
ids-0.043
bug-0.042
Token a
Token ID257
Feature activation+6.88e+00

Pos logit contributions
icked+0.214
icks+0.193
apped+0.175
bug+0.150
o+0.148
Neg logit contributions
 tune-0.254
izing-0.236
 lesson-0.231
But-0.231
 instructions-0.231
Token hook
Token ID8011
Feature activation-2.99e+00

Pos logit contributions
icked+0.406
icks+0.391
aves+0.380
bug+0.373
ids+0.364
Neg logit contributions
izing-0.347
Years-0.333
 rightful-0.328
 instructions-0.323
 lesson-0.319
Token again
Token ID757
Feature activation-1.02e+00

Pos logit contributions
 tune+0.167
 lesson+0.159
 instructions+0.159
izing+0.157
 rules+0.152
Neg logit contributions
icked-0.215
icks-0.194
apped-0.189
bug-0.163
ids-0.163
Token,
Token ID11
Feature activation+3.78e+00

Pos logit contributions
 lesson+0.070
 tune+0.069
 instructions+0.068
Years+0.068
 rules+0.066
Neg logit contributions
icked-0.057
icks-0.051
apped-0.047
bug-0.046
es-0.044
Token but
Token ID475
Feature activation-5.32e+00

Pos logit contributions
icked+0.214
apped+0.192
icks+0.191
bug+0.173
o+0.170
Neg logit contributions
 instructions-0.250
 rules-0.239
izing-0.235
 tune-0.231
 lesson-0.230
Token morning
Token ID3329
Feature activation-4.04e+00

Pos logit contributions
icked+0.070
apped+0.068
icks+0.066
o+0.063
ow+0.057
Neg logit contributions
izing-0.050
 instructions-0.050
 instincts-0.047
 tune-0.046
 rightful-0.045
Token he
Token ID339
Feature activation-3.16e+00

Pos logit contributions
 tune+0.265
 lesson+0.252
But+0.252
 instructions+0.245
 path+0.243
Neg logit contributions
icked-0.269
icks-0.243
apped-0.219
ids-0.194
aves-0.194
Token would
Token ID561
Feature activation-3.64e+00

Pos logit contributions
 tune+0.165
 instructions+0.154
 lesson+0.153
izing+0.152
 rules+0.148
Neg logit contributions
icked-0.245
icks-0.222
apped-0.217
ids-0.187
o-0.185
Token set
Token ID900
Feature activation-3.28e+00

Pos logit contributions
 tune+0.212
izing+0.212
 lesson+0.203
 instructions+0.202
 rules+0.194
Neg logit contributions
icked-0.246
icks-0.230
apped-0.214
ids-0.188
aves-0.185
Token off
Token ID572
Feature activation-1.35e+00

Pos logit contributions
 tune+0.189
 lesson+0.175
izing+0.175
 instructions+0.174
But+0.169
Neg logit contributions
icked-0.237
icks-0.215
apped-0.206
ids-0.176
o-0.172
Token,
Token ID11
Feature activation+4.00e+00

Pos logit contributions
 tune+0.088
 lesson+0.083
 instructions+0.083
 path+0.081
izing+0.081
Neg logit contributions
icked-0.087
icks-0.077
apped-0.075
ids-0.062
bug-0.061
Token exploring
Token ID13504
Feature activation+2.75e+00

Pos logit contributions
icked+0.260
icks+0.236
apped+0.228
o+0.196
ids+0.195
Neg logit contributions
 tune-0.245
izing-0.237
 lesson-0.237
 instructions-0.234
 rules-0.227
Token the
Token ID262
Feature activation-2.31e+00

Pos logit contributions
icked+0.161
icks+0.148
apped+0.145
es+0.124
aves+0.121
Neg logit contributions
 tune-0.177
 instructions-0.164
 lesson-0.163
izing-0.162
 rightful-0.162
Token world
Token ID995
Feature activation-6.04e+00

Pos logit contributions
 tune+0.104
izing+0.103
 instincts+0.095
 instructions+0.094
 lesson+0.090
Neg logit contributions
icked-0.160
icks-0.156
apped-0.155
o-0.150
bug-0.143
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
 tune+0.401
 lesson+0.379
 instructions+0.377
izing+0.369
 rules+0.364
Neg logit contributions
icked-0.378
icks-0.337
apped-0.328
bug-0.272
o-0.272
Token playing
Token ID2712
Feature activation+5.45e+00

Pos logit contributions
 tune+0.102
izing+0.098
 lesson+0.096
 instructions+0.096
 rules+0.093
Neg logit contributions
icked-0.101
icks-0.092
apped-0.089
o-0.075
bug-0.074
Token.
Token ID13
Feature activation-8.31e-01

Pos logit contributions
icked+0.118
icks+0.107
apped+0.105
ids+0.088
bug+0.087
Neg logit contributions
 tune-0.101
 lesson-0.096
 instructions-0.096
izing-0.093
 path-0.092
Token The
Token ID383
Feature activation+5.00e-01

Pos logit contributions
 tune+0.051
But+0.050
 lesson+0.049
 instructions+0.048
 rules+0.046
Neg logit contributions
icked-0.057
icks-0.051
apped-0.047
igh-0.043
ids-0.041
Token sandwich
Token ID20433
Feature activation+9.22e+00

Pos logit contributions
icked+0.031
icks+0.027
apped+0.027
o+0.022
bug+0.021
Neg logit contributions
 tune-0.033
izing-0.033
 instructions-0.031
away-0.031
 instincts-0.030
Token was
Token ID373
Feature activation-2.46e+00

Pos logit contributions
icked+0.667
icks+0.591
apped+0.582
bug+0.489
o+0.488
Neg logit contributions
 tune-0.537
 instructions-0.503
izing-0.492
 lesson-0.491
 rules-0.487
Token cool
Token ID3608
Feature activation-2.84e+00

Pos logit contributions
 tune+0.140
 lesson+0.136
 instructions+0.130
But+0.129
izing+0.128
Neg logit contributions
icked-0.176
icks-0.160
apped-0.158
ids-0.136
aves-0.132
Token because
Token ID780
Feature activation-2.95e+00

Pos logit contributions
 tune+0.164
 instructions+0.154
izing+0.154
 lesson+0.153
 path+0.150
Neg logit contributions
icked-0.207
icks-0.187
apped-0.182
ids-0.157
bug-0.150
Token it
Token ID340
Feature activation-2.00e+00

Pos logit contributions
 tune+0.159
 lesson+0.149
 instructions+0.148
izing+0.145
But+0.143
Neg logit contributions
icked-0.224
icks-0.203
apped-0.198
ids-0.169
bug-0.167
Token was
Token ID373
Feature activation-2.43e+00

Pos logit contributions
 tune+0.148
 instructions+0.142
 rules+0.140
 instincts+0.137
izing+0.137
Neg logit contributions
icked-0.115
icks-0.094
apped-0.091
o-0.081
bug-0.080
Token in
Token ID287
Feature activation+4.03e+00

Pos logit contributions
 tune+0.141
 instructions+0.132
 lesson+0.131
izing+0.129
 path+0.127
Neg logit contributions
icked-0.175
icks-0.157
apped-0.152
ids-0.129
bug-0.128
Token the
Token ID262
Feature activation-1.71e+00

Pos logit contributions
icked+0.240
icks+0.218
apped+0.200
bug+0.169
aves+0.167
Neg logit contributions
 tune-0.272
 instincts-0.253
izing-0.249
 instructions-0.244
 lesson-0.244
Token fridge
Token ID25772
Feature activation+8.39e-01

Pos logit contributions
izing+0.103
 tune+0.101
 instincts+0.095
 instructions+0.093
 lesson+0.093
Neg logit contributions
icked-0.111
icks-0.103
apped-0.098
o-0.086
bug-0.085
Token cream
Token ID8566
Feature activation-1.51e+00

Pos logit contributions
icked+0.043
icks+0.039
apped+0.038
o+0.034
ids+0.033
Neg logit contributions
 tune-0.027
 lesson-0.024
 instructions-0.024
izing-0.024
But-0.023
Token cone
Token ID27763
Feature activation-7.83e-01

Pos logit contributions
 tune+0.095
 lesson+0.089
 instructions+0.088
izing+0.088
 rules+0.086
Neg logit contributions
icked-0.101
icks-0.089
apped-0.087
ids-0.073
o-0.072
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
 tune+0.047
 instructions+0.044
 lesson+0.044
izing+0.043
 path+0.043
Neg logit contributions
icked-0.054
icks-0.049
apped-0.047
ids-0.040
bug-0.039
Token The
Token ID383
Feature activation+2.71e-01

Pos logit contributions
 tune+0.091
But+0.086
 instructions+0.086
 lesson+0.085
 path+0.083
Neg logit contributions
icked-0.097
icks-0.086
apped-0.082
ids-0.068
aves-0.067
Token bird
Token ID6512
Feature activation+8.81e-01

Pos logit contributions
icked+0.021
apped+0.019
icks+0.019
bug+0.017
o+0.016
Neg logit contributions
 tune-0.013
izing-0.013
 instructions-0.012
But-0.012
 lesson-0.012
Token was
Token ID373
Feature activation-2.78e+00

Pos logit contributions
icked+0.061
icks+0.055
apped+0.053
ids+0.045
bug+0.045
Neg logit contributions
 tune-0.053
 instructions-0.050
 lesson-0.050
izing-0.049
 rules-0.048
Token happy
Token ID3772
Feature activation-1.39e+00

Pos logit contributions
 tune+0.166
 lesson+0.153
But+0.152
 instructions+0.151
 sound+0.151
Neg logit contributions
icked-0.191
apped-0.172
icks-0.171
ids-0.147
aves-0.142
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
 tune+0.090
 lesson+0.086
 instructions+0.085
izing+0.084
 path+0.083
Neg logit contributions
icked-0.089
icks-0.079
apped-0.077
bug-0.065
o-0.065
Token flew
Token ID13112
Feature activation+8.65e-01

Pos logit contributions
 tune+0.088
 instructions+0.085
 rules+0.084
izing+0.083
 lesson+0.082
Neg logit contributions
icked-0.085
icks-0.073
apped-0.073
o-0.059
bug-0.058
Token away
Token ID1497
Feature activation-5.98e+00

Pos logit contributions
icked+0.062
apped+0.055
icks+0.055
o+0.046
bug+0.046
Neg logit contributions
 tune-0.050
izing-0.047
 instructions-0.047
 lesson-0.046
 rules-0.046
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 tune+0.389
 instructions+0.367
 lesson+0.366
Years+0.363
izing+0.362
Neg logit contributions
icked-0.376
icks-0.335
apped-0.331
bug-0.264
o-0.262
Token the
Token ID262
Feature activation-2.01e+00

Pos logit contributions
icked+0.287
apped+0.257
icks+0.255
o+0.232
ids+0.230
Neg logit contributions
 lesson-0.173
 instructions-0.170
 tune-0.169
 rules-0.167
 sound-0.158
Token dog
Token ID3290
Feature activation-8.06e-01

Pos logit contributions
 tune+0.108
izing+0.101
 instructions+0.100
 lesson+0.098
But+0.097
Neg logit contributions
icked-0.148
icks-0.137
apped-0.133
ids-0.113
aves-0.111
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 tune+0.044
 instructions+0.041
 lesson+0.040
 path+0.040
 rules+0.040
Neg logit contributions
icked-0.061
icks-0.055
apped-0.054
ids-0.046
o-0.045
Token They
Token ID1119
Feature activation+6.00e+00

Pos logit contributions
 tune+0.076
 lesson+0.075
 instructions+0.074
But+0.073
 path+0.072
Neg logit contributions
icked-0.069
icks-0.056
apped-0.055
igh-0.049
ids-0.048
Token were
Token ID547
Feature activation+4.86e+00

Pos logit contributions
icked+0.382
icks+0.351
apped+0.342
ids+0.293
aves+0.270
Neg logit contributions
 tune-0.378
But-0.373
 path-0.365
 instructions-0.354
 lesson-0.350
Token still
Token ID991
Feature activation-5.16e+00

Pos logit contributions
icked+0.366
apped+0.340
icks+0.323
ids+0.307
igh+0.303
Neg logit contributions
 sound-0.228
 tune-0.227
 path-0.220
But-0.214
 instructions-0.212
Token scared
Token ID12008
Feature activation+1.71e+00

Pos logit contributions
 tune+0.307
 lesson+0.280
 instructions+0.280
 path+0.280
 sound+0.276
Neg logit contributions
icked-0.367
icks-0.326
apped-0.313
ids-0.275
aves-0.261
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
icked+0.128
icks+0.115
apped+0.113
ids+0.096
bug+0.095
Neg logit contributions
 tune-0.095
 instructions-0.089
 lesson-0.089
izing-0.087
 rules-0.086
Token But
Token ID887
Feature activation-9.27e+00

Pos logit contributions
But+0.074
 lesson+0.071
 tune+0.071
 instructions+0.071
 rules+0.069
Neg logit contributions
icked-0.066
icks-0.056
apped-0.053
igh-0.048
bug-0.047
Token they
Token ID484
Feature activation+2.65e+00

Pos logit contributions
But+0.445
 lesson+0.418
 path+0.393
 man+0.388
 sound+0.385
Neg logit contributions
icked-0.746
icks-0.707
apped-0.663
ids-0.619
igh-0.618
Token also
Token ID635
Feature activation+7.79e-01

Pos logit contributions
icked+0.170
icks+0.158
apped+0.153
ids+0.131
aves+0.126
Neg logit contributions
 tune-0.159
But-0.157
 lesson-0.153
 path-0.152
 instructions-0.149
Tokenr
Token ID81
Feature activation+5.96e+00

Pos logit contributions
icked+0.023
icks+0.020
apped+0.019
ids+0.016
bug+0.015
Neg logit contributions
 tune-0.032
izing-0.028
 instructions-0.028
 melody-0.028
away-0.027
Token every
Token ID790
Feature activation-5.91e+00

Pos logit contributions
icked+0.415
icks+0.376
apped+0.365
es+0.306
ids+0.301
Neg logit contributions
 tune-0.342
 lesson-0.332
 path-0.327
But-0.325
 instructions-0.325
Token time
Token ID640
Feature activation-2.94e+00

Pos logit contributions
izing+0.286
 tune+0.273
 instincts+0.269
 instructions+0.266
But+0.265
Neg logit contributions
icked-0.457
icks-0.422
apped-0.410
bug-0.360
o-0.346
Token she
Token ID673
Feature activation-3.85e+00

Pos logit contributions
 tune+0.161
 lesson+0.150
 instructions+0.148
izing+0.146
 path+0.144
Neg logit contributions
icked-0.220
icks-0.198
apped-0.194
ids-0.167
o-0.164
Token saw
Token ID2497
Feature activation+3.28e+00

Pos logit contributions
 tune+0.252
 instructions+0.237
 lesson+0.237
izing+0.233
 path+0.229
Neg logit contributions
icked-0.249
icks-0.221
apped-0.215
ids-0.177
bug-0.175
Token him
Token ID683
Feature activation-1.29e+00

Pos logit contributions
icked+0.235
icks+0.208
apped+0.205
bug+0.174
o+0.172
Neg logit contributions
 tune-0.191
 lesson-0.179
 path-0.176
izing-0.176
 instructions-0.176
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
 tune+0.074
 instructions+0.069
 lesson+0.069
izing+0.067
 path+0.067
Neg logit contributions
icked-0.095
icks-0.085
apped-0.083
ids-0.070
o-0.068
Token<|endoftext|>
Token ID50256
Feature activation+4.30e+00

Pos logit contributions
 tune+0.080
But+0.078
 lesson+0.077
 instructions+0.076
 path+0.074
Neg logit contributions
icked-0.094
icks-0.083
apped-0.082
ids-0.068
bug-0.067
TokenOnce
Token ID7454
Feature activation+1.55e+00

Pos logit contributions
icked+0.340
icks+0.304
apped+0.296
ids+0.262
bug+0.251
Neg logit contributions
But-0.212
 lesson-0.205
 tune-0.204
 instructions-0.198
 path-0.197
Token upon
Token ID2402
Feature activation+4.38e+00

Pos logit contributions
icked+0.097
icks+0.086
apped+0.081
ids+0.068
aves+0.066
Neg logit contributions
 lesson-0.100
 tune-0.099
But-0.095
 path-0.095
 instructions-0.095
Token a
Token ID257
Feature activation+7.20e+00

Pos logit contributions
icked+0.250
icks+0.215
apped+0.211
ids+0.174
igh+0.160
Neg logit contributions
 tune-0.325
But-0.313
 lesson-0.311
 instructions-0.300
 sound-0.297
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
 tune+0.236
 lesson+0.220
 instructions+0.220
izing+0.216
 path+0.213
Neg logit contributions
icked-0.299
icks-0.269
apped-0.263
ids-0.222
o-0.219
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
 tune+0.086
 lesson+0.083
But+0.081
 instructions+0.081
 rules+0.079
Neg logit contributions
icked-0.107
icks-0.093
apped-0.091
ids-0.077
bug-0.077
Token worked
Token ID3111
Feature activation+1.79e+00

Pos logit contributions
 tune+0.001
 instructions+0.001
izing+0.001
 lesson+0.001
 rules+0.001
Neg logit contributions
icked-0.001
icks-0.001
apped-0.001
o-0.001
ids-0.001
Token hard
Token ID1327
Feature activation-1.90e+00

Pos logit contributions
icked+0.128
icks+0.114
apped+0.112
o+0.094
aves+0.093
Neg logit contributions
 tune-0.105
 path-0.094
 instructions-0.094
 rightful-0.092
izing-0.092
Token to
Token ID284
Feature activation-2.08e+00

Pos logit contributions
 tune+0.127
 rightful+0.119
 instructions+0.116
 instincts+0.116
izing+0.115
Neg logit contributions
icked-0.116
icks-0.103
apped-0.102
ids-0.082
o-0.080
Token make
Token ID787
Feature activation-1.83e+00

Pos logit contributions
 lesson+0.123
 tune+0.122
izing+0.120
 instincts+0.115
Years+0.115
Neg logit contributions
icked-0.120
icks-0.118
apped-0.101
ids-0.095
aves-0.093
Token sure
Token ID1654
Feature activation-5.00e-01

Pos logit contributions
 tune+0.109
 path+0.102
 lesson+0.099
But+0.099
izing+0.099
Neg logit contributions
icked-0.125
icks-0.115
apped-0.108
es-0.091
aves-0.090
Token his
Token ID465
Feature activation-6.77e+00

Pos logit contributions
 tune+0.029
izing+0.027
Years+0.026
 melody+0.026
But+0.026
Neg logit contributions
icked-0.035
icks-0.032
apped-0.030
o-0.025
ids-0.025
Token family
Token ID1641
Feature activation+2.37e-01

Pos logit contributions
izing+0.371
 tune+0.365
But+0.354
Years+0.352
away+0.352
Neg logit contributions
icked-0.461
icks-0.428
apped-0.425
're-0.341
aves-0.339
Token had
Token ID550
Feature activation+5.58e+00

Pos logit contributions
icked+0.014
icks+0.012
apped+0.012
bug+0.009
o+0.009
Neg logit contributions
 tune-0.017
 instructions-0.016
izing-0.016
 lesson-0.016
 rules-0.015
Token enough
Token ID1576
Feature activation+6.51e+00

Pos logit contributions
icked+0.357
icks+0.316
apped+0.304
ids+0.249
aves+0.246
Neg logit contributions
 tune-0.368
izing-0.344
 instructions-0.340
 lesson-0.339
 path-0.339
Token them
Token ID606
Feature activation+9.73e-01

Pos logit contributions
icked+0.109
icks+0.099
apped+0.098
ids+0.084
bug+0.083
Neg logit contributions
 tune-0.065
 instructions-0.062
 lesson-0.062
But-0.059
izing-0.059
Token on
Token ID319
Feature activation+4.16e+00

Pos logit contributions
icked+0.055
icks+0.049
apped+0.048
ids+0.039
bug+0.038
Neg logit contributions
 tune-0.069
 instructions-0.066
 lesson-0.066
 path-0.064
But-0.064
Token the
Token ID262
Feature activation-1.98e+00

Pos logit contributions
icked+0.234
icks+0.215
apped+0.196
aves+0.172
ids+0.168
Neg logit contributions
 tune-0.287
 instincts-0.266
 path-0.264
izing-0.263
 lesson-0.260
Token tray
Token ID26473
Feature activation+1.97e+00

Pos logit contributions
 tune+0.113
izing+0.109
 instincts+0.103
Years+0.102
away+0.102
Neg logit contributions
icked-0.135
icks-0.131
apped-0.116
ids-0.105
aves-0.099
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
icked+0.144
icks+0.130
apped+0.127
ids+0.107
bug+0.106
Neg logit contributions
 tune-0.113
 lesson-0.105
 instructions-0.104
izing-0.104
But-0.101
Token But
Token ID887
Feature activation-9.35e+00

Pos logit contributions
But+0.075
 lesson+0.074
 tune+0.074
 instructions+0.073
 path+0.071
Neg logit contributions
icked-0.066
icks-0.058
apped-0.053
bug-0.047
igh-0.047
Token Tim
Token ID5045
Feature activation+2.90e+00

Pos logit contributions
 tune+0.533
 lesson+0.508
But+0.501
 instructions+0.498
izing+0.495
Neg logit contributions
icked-0.675
icks-0.621
apped-0.587
ids-0.503
aves-0.496
Token wanted
Token ID2227
Feature activation-1.52e+00

Pos logit contributions
icked+0.179
icks+0.155
apped+0.152
o+0.129
bug+0.126
Neg logit contributions
 tune-0.198
 instructions-0.188
 lesson-0.186
izing-0.185
 rules-0.183
Token more
Token ID517
Feature activation+2.51e+00

Pos logit contributions
 tune+0.089
 lesson+0.081
izing+0.081
 instructions+0.081
 path+0.080
Neg logit contributions
icked-0.111
icks-0.100
apped-0.096
ids-0.082
bug-0.082
Token cheese
Token ID9891
Feature activation+7.66e+00

Pos logit contributions
icked+0.173
icks+0.156
apped+0.150
ids+0.128
bug+0.126
Neg logit contributions
 tune-0.152
 instructions-0.142
 lesson-0.142
izing-0.139
 path-0.138
Token.
Token ID13
Feature activation-9.97e-01

Pos logit contributions
icked+0.513
icks+0.458
apped+0.442
ids+0.371
bug+0.367
Neg logit contributions
 tune-0.483
 instructions-0.452
 lesson-0.450
izing-0.445
 path-0.438
Token tests
Token ID5254
Feature activation-4.48e+00

Pos logit contributions
 tune+0.282
 rightful+0.278
away+0.269
 path+0.265
izing+0.260
Neg logit contributions
icked-0.243
icks-0.228
apped-0.212
ids-0.192
ow-0.190
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 tune+0.285
 instructions+0.267
 lesson+0.266
izing+0.262
 path+0.260
Neg logit contributions
icked-0.294
icks-0.262
apped-0.255
ids-0.213
o-0.208
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
 tune+0.076
 instructions+0.071
izing+0.071
 lesson+0.070
 path+0.070
Neg logit contributions
icked-0.055
apped-0.049
icks-0.047
o-0.038
ids-0.036
Token
Token ID198
Feature activation+4.00e-01

Pos logit contributions
icked+0.025
icks+0.022
apped+0.022
o+0.018
ids+0.017
Neg logit contributions
 tune-0.029
 instructions-0.028
izing-0.027
 lesson-0.027
 path-0.027
TokenThe
Token ID464
Feature activation-6.01e+00

Pos logit contributions
icked+0.025
apped+0.024
icks+0.022
o+0.021
aves+0.020
Neg logit contributions
 instincts-0.023
 tune-0.022
 instructions-0.022
 gut-0.022
 rightful-0.021
Token next
Token ID1306
Feature activation-7.92e+00

Pos logit contributions
izing+0.293
 tune+0.280
 rightful+0.275
away+0.270
 instincts+0.257
Neg logit contributions
icked-0.418
apped-0.409
icks-0.391
o-0.318
ids-0.318
Token day
Token ID1110
Feature activation-4.64e+00

Pos logit contributions
izing+0.320
 tune+0.318
 instructions+0.303
 instincts+0.303
 rightful+0.300
Neg logit contributions
icked-0.655
apped-0.639
icks-0.601
o-0.555
bug-0.526
Token,
Token ID11
Feature activation+4.47e+00

Pos logit contributions
 lesson+0.274
But+0.268
 tune+0.268
 man+0.256
 sound+0.247
Neg logit contributions
icked-0.323
icks-0.305
apped-0.250
aves-0.242
igh-0.235
Token Lily
Token ID20037
Feature activation+4.85e+00

Pos logit contributions
icked+0.272
icks+0.244
apped+0.227
ids+0.189
o+0.188
Neg logit contributions
 tune-0.316
 lesson-0.305
But-0.299
 instructions-0.297
izing-0.295
Token went
Token ID1816
Feature activation-5.42e-02

Pos logit contributions
icked+0.304
icks+0.269
apped+0.262
ids+0.215
bug+0.212
Neg logit contributions
 tune-0.325
 instructions-0.307
 lesson-0.306
izing-0.302
 rules-0.297
Token to
Token ID284
Feature activation-1.65e+00

Pos logit contributions
 tune+0.003
izing+0.003
 lesson+0.003
 instructions+0.003
But+0.003
Neg logit contributions
icked-0.004
icks-0.003
apped-0.003
ids-0.003
o-0.003
Token Lucy
Token ID22162
Feature activation+3.11e+00

Pos logit contributions
icked+0.224
icks+0.197
apped+0.184
ids+0.149
bug+0.143
Neg logit contributions
 tune-0.298
 lesson-0.284
 instructions-0.282
izing-0.279
But-0.277
Token asked
Token ID1965
Feature activation-1.52e+00

Pos logit contributions
icked+0.216
icks+0.194
apped+0.187
ids+0.158
bug+0.155
Neg logit contributions
 tune-0.189
 lesson-0.176
 instructions-0.176
izing-0.172
 path-0.171
Token Jack
Token ID3619
Feature activation+6.39e+00

Pos logit contributions
 tune+0.092
 instructions+0.088
 lesson+0.087
 rules+0.085
But+0.084
Neg logit contributions
icked-0.106
icks-0.095
apped-0.092
ids-0.078
bug-0.077
Token,
Token ID11
Feature activation+3.97e+00

Pos logit contributions
icked+0.449
apped+0.399
icks+0.399
ids+0.326
aves+0.315
Neg logit contributions
 tune-0.372
 instructions-0.364
 lesson-0.363
 rules-0.347
But-0.347
Token "
Token ID366
Feature activation-3.69e+00

Pos logit contributions
icked+0.281
icks+0.252
apped+0.248
ids+0.210
bug+0.207
Neg logit contributions
 tune-0.230
 lesson-0.216
izing-0.213
 path-0.213
 instructions-0.212
TokenWhy
Token ID5195
Feature activation-8.12e+00

Pos logit contributions
 lesson+0.159
 path+0.151
 tune+0.150
 instincts+0.146
 gut+0.143
Neg logit contributions
icked-0.298
apped-0.272
icks-0.258
bug-0.256
o-0.253
Token don
Token ID836
Feature activation-4.20e+00

Pos logit contributions
 tune+0.481
 lesson+0.468
 instructions+0.458
 rules+0.446
izing+0.446
Neg logit contributions
icked-0.569
icks-0.504
apped-0.498
bug-0.417
ids-0.414
Token't
Token ID470
Feature activation-1.26e+00

Pos logit contributions
 tune+0.245
 lesson+0.229
 instructions+0.228
izing+0.223
 path+0.221
Neg logit contributions
icked-0.299
icks-0.269
apped-0.262
ids-0.221
bug-0.218
Token you
Token ID345
Feature activation-1.87e+00

Pos logit contributions
 tune+0.083
 lesson+0.082
Years+0.080
 instructions+0.078
 rightful+0.077
Neg logit contributions
icked-0.071
icks-0.065
apped-0.061
bug-0.054
o-0.053
Token lift
Token ID10303
Feature activation-4.12e+00

Pos logit contributions
 tune+0.097
 lesson+0.090
 instructions+0.090
izing+0.088
 path+0.086
Neg logit contributions
icked-0.145
icks-0.132
apped-0.129
ids-0.111
bug-0.109
Token the
Token ID262
Feature activation-2.38e+00

Pos logit contributions
 tune+0.224
 instructions+0.210
 lesson+0.209
izing+0.205
 rules+0.202
Neg logit contributions
icked-0.312
icks-0.281
apped-0.276
ids-0.235
bug-0.232
Token the
Token ID262
Feature activation-2.52e+00

Pos logit contributions
icked+0.014
icks+0.013
apped+0.012
bug+0.010
ids+0.010
Neg logit contributions
 tune-0.012
 instructions-0.012
 lesson-0.011
izing-0.011
 path-0.011
Token bread
Token ID8509
Feature activation+1.27e-02

Pos logit contributions
 tune+0.127
 lesson+0.119
 instructions+0.118
 path+0.114
 rules+0.113
Neg logit contributions
icked-0.204
icks-0.184
apped-0.181
ids-0.158
bug-0.154
Token.
Token ID13
Feature activation-1.78e+00

Pos logit contributions
icked+0.001
icks+0.001
apped+0.001
ids+0.001
bug+0.001
Neg logit contributions
 tune-0.001
 lesson-0.001
 instructions-0.001
izing-0.001
 path-0.001
Token
Token ID198
Feature activation-3.74e-01

Pos logit contributions
But+0.107
 tune+0.104
 lesson+0.103
 instructions+0.099
 rules+0.096
Neg logit contributions
icked-0.123
icks-0.108
apped-0.101
igh-0.095
bug-0.092
Token
Token ID198
Feature activation-3.76e-01

Pos logit contributions
 tune+0.024
 lesson+0.022
 instructions+0.022
izing+0.022
But+0.022
Neg logit contributions
icked-0.025
icks-0.022
apped-0.021
ids-0.018
bug-0.018
TokenBut
Token ID1537
Feature activation-1.34e+01

Pos logit contributions
 tune+0.011
 instructions+0.010
 instincts+0.009
 lesson+0.009
 rightful+0.009
Neg logit contributions
icked-0.037
apped-0.034
icks-0.034
aves-0.031
o-0.031
Token then
Token ID788
Feature activation+7.71e-01

Pos logit contributions
But+0.655
 tune+0.625
 lesson+0.599
izing+0.557
 sound+0.551
Neg logit contributions
icked-1.076
icks-1.045
apped-0.929
aves-0.862
ids-0.851
Token a
Token ID257
Feature activation+6.88e+00

Pos logit contributions
icked+0.061
icks+0.056
umbs+0.056
rol+0.055
apped+0.052
Neg logit contributions
 man-0.036
 lesson-0.036
But-0.035
 thing-0.034
 things-0.032
Token big
Token ID1263
Feature activation-3.66e-01

Pos logit contributions
icked+0.568
icks+0.521
apped+0.512
ids+0.441
bug+0.439
Neg logit contributions
 tune-0.315
 instructions-0.290
izing-0.286
 lesson-0.286
But-0.275
Token wind
Token ID2344
Feature activation-2.02e+00

Pos logit contributions
 tune+0.015
 lesson+0.013
 path+0.013
 instructions+0.013
 rules+0.013
Neg logit contributions
icked-0.033
icks-0.030
apped-0.030
ids-0.027
aves-0.026
Token comes
Token ID2058
Feature activation+1.57e+00

Pos logit contributions
 tune+0.137
 lesson+0.129
 instructions+0.128
But+0.127
izing+0.126
Neg logit contributions
icked-0.121
icks-0.107
apped-0.105
ids-0.088
aves-0.085
Token that
Token ID326
Feature activation+1.41e-01

Pos logit contributions
 tune+0.067
 instructions+0.063
 lesson+0.062
 rules+0.061
izing+0.059
Neg logit contributions
icked-0.065
icks-0.056
apped-0.056
bug-0.047
o-0.046
Token day
Token ID1110
Feature activation-5.15e+00

Pos logit contributions
icked+0.013
icks+0.012
apped+0.012
o+0.010
bug+0.010
Neg logit contributions
 tune-0.005
 instructions-0.005
izing-0.005
 rules-0.004
 rightful-0.004
Token on
Token ID319
Feature activation+3.67e+00

Pos logit contributions
But+0.329
 lesson+0.320
 tune+0.312
 path+0.302
 instructions+0.302
Neg logit contributions
icked-0.292
icks-0.270
apped-0.254
bugs-0.244
bug-0.242
Token,
Token ID11
Feature activation+3.84e+00

Pos logit contributions
icked+0.177
icks+0.150
apped+0.144
ids+0.108
bug+0.106
Neg logit contributions
 tune-0.301
 lesson-0.288
 instructions-0.288
izing-0.284
 path-0.281
Token Betsy
Token ID45775
Feature activation+1.82e+00

Pos logit contributions
icked+0.239
icks+0.209
apped+0.208
bug+0.170
o+0.167
Neg logit contributions
 tune-0.257
 instructions-0.243
 rules-0.236
 lesson-0.236
 path-0.233
Token knew
Token ID2993
Feature activation-8.08e+00

Pos logit contributions
icked+0.121
apped+0.106
icks+0.105
o+0.094
es+0.090
Neg logit contributions
 tune-0.113
 instructions-0.110
 rules-0.109
izing-0.106
 lesson-0.106
Token that
Token ID326
Feature activation+1.67e-01

Pos logit contributions
 tune+0.630
 instructions+0.557
izing+0.548
 rules+0.543
away+0.539
Neg logit contributions
icked-0.445
apped-0.369
icks-0.367
o-0.289
ow-0.284
Token anything
Token ID1997
Feature activation-4.51e-01

Pos logit contributions
icked+0.010
icks+0.009
apped+0.009
bug+0.008
o+0.008
Neg logit contributions
 tune-0.010
izing-0.010
away-0.010
 instructions-0.010
 rightful-0.009
Token was
Token ID373
Feature activation-3.25e+00

Pos logit contributions
 tune+0.031
 instructions+0.028
 instincts+0.027
 path+0.027
 rightful+0.027
Neg logit contributions
icked-0.028
icks-0.024
apped-0.023
bug-0.020
're-0.020
Token possible
Token ID1744
Feature activation-1.81e+00

Pos logit contributions
 tune+0.191
 instructions+0.174
izing+0.166
 path+0.164
 rules+0.163
Neg logit contributions
icked-0.230
icks-0.206
apped-0.193
bug-0.174
ids-0.167
Token if
Token ID611
Feature activation-1.69e+00

Pos logit contributions
 tune+0.127
 instructions+0.116
izing+0.115
 lesson+0.113
 melody+0.113
Neg logit contributions
icked-0.106
icks-0.092
apped-0.090
bug-0.079
o-0.078