TOP ACTIVATIONS
MAX = 20.484

free so he could repl ant his field . Billy was
that said " rest aur ant ". They went inside and
was a girl named Ch ant elle . She was three
always best . Ign or ant people will always be present
it . Ch ant elle smiled and said ,
One day Ch ant elle 's mom put a
colourful and it made Ch ant elle feel special .
It was a cro iss ant . It was a little
special . Ch ant elle went outside to play
the word " ign or ant " and the number "
of him for being observ ant and noticing the cop .
. Ch ant elle stopped in the shade
says , " Ign or ant , zero ". The teacher
It is a cro iss ant . We can share it
were ignorant . Ign or ant meant they did not know
and said , " Pl ant this and watch it grow
That 's so special Ch ant elle . I 'm so
it down . Ign or ant children like you don 't
". Ch ant elle was very happy .
reached for the cro iss ant . He rubbed it on

MIN ACTIVATIONS
MIN = -15.307

She loved to go on pic n ics with her family
so excited ! She loved pic n ics and spending time
near the cliff and have pic n ics . One day
to be careful when having pic n ics and not to
went on adventures , had pic n ics , and even
the park , and have pic n ics outside .
, ready to have more pic n ics in the future
. She loved to have pic n ics with her family
see the bird and have pic n ics . <|endoftext|> Once
They decided to make more pic n ics and looked in
fun . Lily realized that pic n ics were not so
the warm sunshine and having pic n ics with her family
roof . They would have pic n ics and play games
They missed their walks and pic n ics near the cliff
girl named Lily who loved pic n ics . She would
favourite place to go for pic n ics . <|endoftext|> Once
children running around , families pic n icking , couples st
and it was going on pic n ics !
in the garden and have pic n ics together .
even see other people having pic n ics .

INTERVAL 11.536 - 20.484
CONTAINS 0.180%

to drop it on her foot . " That
and helped make food . Everyone was very happy .
a hammer and a ch isel and they began to carefully
Great job designing that sund ae !" Sarah
. The End . <|endoftext|> Once upon a

INTERVAL 2.589 - 11.536
CONTAINS 19.364%

gl owed brighter and brighter each day ! <|endoftext|> Once upon
saw something fragile on the ground . It was the mouse
, I found a yellow bulb flower for you !" Mom
After a few moments , Maggie 's dad joined
smiled and ran outside to see it . The

INTERVAL -6.359 - 2.589
CONTAINS 75.756%

a mess . And that 's the end
, the door opened and a little girl came in .
. Lots of people were there , and they all had
turn to sleep in the big crib !", while the second
happy anymore . He is small and sad . His hat

INTERVAL -15.307 - -6.359
CONTAINS 4.701%

wanted to bounce . She said , " You are not
liked to play in the park . One day , they
soft pillow through his joyful cl apping . <|endoftext|> John and
sheet . They like the tent . They do not know
child thought it would be fun to send their parents on
Token free
Token ID1479
Feature activation+1.98e+00

Pos logit contributions
rob+0.206
 hungry+0.206
 sleeping+0.197
 circus+0.194
 says+0.193
Neg logit contributions
antly-0.312
ant-0.306
ively-0.299
 efforts-0.290
ered-0.289
Token so
Token ID523
Feature activation-5.84e-02

Pos logit contributions
ant+0.120
antly+0.119
ively+0.116
ered+0.116
 efforts+0.116
Neg logit contributions
rob-0.074
 hungry-0.072
 says-0.071
 sleeping-0.069
 ours-0.068
Token he
Token ID339
Feature activation-3.32e+00

Pos logit contributions
 hungry+0.002
 scary+0.001
 sleeping+0.001
 fun+0.001
 says+0.001
Neg logit contributions
ant-0.004
antly-0.004
ively-0.004
ainer-0.004
ered-0.004
Token could
Token ID714
Feature activation+5.53e+00

Pos logit contributions
 says+0.151
 hungry+0.141
rob+0.140
 sleeping+0.133
 wants+0.133
Neg logit contributions
ant-0.182
ively-0.180
 efforts-0.175
 fingertips-0.172
ainer-0.172
Token repl
Token ID2186
Feature activation-1.22e-01

Pos logit contributions
ively+0.313
ant+0.311
antly+0.310
 efforts+0.297
ices+0.290
Neg logit contributions
 says-0.228
 hungry-0.210
 w-0.203
 ch-0.202
 sleeping-0.199
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
rob+0.009
P+0.009
ls+0.008
 ours+0.008
 scar+0.008
Neg logit contributions
 fingertips-0.003
 efforts-0.002
 surely-0.002
 End-0.002
 summit-0.002
Token his
Token ID465
Feature activation-3.81e+00

Pos logit contributions
ant+1.085
antly+1.080
 efforts+1.080
ively+1.060
 fingertips+1.058
Neg logit contributions
rob-0.872
 says-0.820
 hungry-0.801
 circus-0.770
 ours-0.765
Token field
Token ID2214
Feature activation-5.81e+00

Pos logit contributions
 hungry+0.159
rob+0.158
 says+0.157
 circus+0.153
 sleeping+0.152
Neg logit contributions
ant-0.215
ively-0.214
antly-0.212
ered-0.209
 efforts-0.201
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
rob+0.220
 says+0.216
 hungry+0.216
 sleeping+0.207
 ours+0.205
Neg logit contributions
ant-0.351
antly-0.344
ively-0.341
 efforts-0.336
 fingertips-0.328
Token Billy
Token ID15890
Feature activation+2.83e+00

Pos logit contributions
ant+0.062
 efforts+0.059
ively+0.059
ered+0.059
antly+0.058
Neg logit contributions
 says-0.053
rob-0.051
 hungry-0.050
 ours-0.047
 circus-0.046
Token was
Token ID373
Feature activation+3.89e+00

Pos logit contributions
ively+0.157
ant+0.156
 fingertips+0.153
 efforts+0.152
ainer+0.150
Neg logit contributions
 says-0.127
rob-0.123
 hungry-0.118
 sees-0.116
 sleeping-0.116
Token that
Token ID326
Feature activation+5.09e+00

Pos logit contributions
rob+0.061
 scar+0.052
 ours+0.052
 hungry+0.052
 circus+0.051
Neg logit contributions
antly-0.048
ively-0.047
ant-0.046
ered-0.045
 efforts-0.044
Token said
Token ID531
Feature activation-8.45e+00

Pos logit contributions
antly+0.371
ered+0.368
ively+0.363
ant+0.362
 efforts+0.356
Neg logit contributions
rob-0.139
 circus-0.111
 ours-0.106
 hungry-0.103
 scar-0.101
Token "
Token ID366
Feature activation+1.92e-01

Pos logit contributions
 hungry+0.303
 says+0.298
 waiting+0.284
 circus+0.278
 sleeping+0.277
Neg logit contributions
ant-0.535
ively-0.528
 efforts-0.525
antly-0.525
 fingertips-0.508
Tokenrest
Token ID2118
Feature activation+9.25e+00

Pos logit contributions
ant+0.011
antly+0.011
ively+0.011
 efforts+0.011
ered+0.010
Neg logit contributions
rob-0.008
 says-0.008
 hungry-0.007
 ours-0.007
 circus-0.007
Tokenaur
Token ID2899
Feature activation+5.06e+00

Pos logit contributions
ant+0.515
antly+0.511
 efforts+0.505
ively+0.503
ered+0.488
Neg logit contributions
rob-0.412
 says-0.383
 hungry-0.374
 circus-0.366
 ours-0.362
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
 efforts+0.236
 fingertips+0.227
antly+0.222
 Days+0.218
ively+0.217
Neg logit contributions
 says-0.260
rob-0.258
 ours-0.247
 sleeping-0.240
 hungry-0.240
Token".
Token ID1911
Feature activation+1.13e+00

Pos logit contributions
antly+1.085
 efforts+1.082
ant+1.080
ively+1.069
 fingertips+1.042
Neg logit contributions
rob-0.866
 says-0.821
 hungry-0.790
 circus-0.782
 ours-0.774
Token They
Token ID1119
Feature activation+4.57e-01

Pos logit contributions
ant+0.073
 efforts+0.072
antly+0.072
ively+0.070
 fingertips+0.069
Neg logit contributions
 says-0.042
 hungry-0.039
rob-0.037
 ours-0.036
 sees-0.035
Token went
Token ID1816
Feature activation-2.84e+00

Pos logit contributions
ainer+0.029
antly+0.029
ant+0.029
ively+0.029
 efforts+0.029
Neg logit contributions
 says-0.017
 hungry-0.016
 sees-0.015
 scar-0.015
rob-0.015
Token inside
Token ID2641
Feature activation+1.77e+00

Pos logit contributions
rob+0.152
 says+0.136
ww+0.132
 ours+0.131
P+0.128
Neg logit contributions
antly-0.138
ively-0.134
ered-0.133
 efforts-0.129
ant-0.127
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
ant+0.121
antly+0.120
ively+0.118
 efforts+0.117
ered+0.115
Neg logit contributions
 says-0.054
rob-0.053
 hungry-0.053
 sleeping-0.051
 circus-0.049
Token was
Token ID373
Feature activation+4.08e+00

Pos logit contributions
 says+0.187
rob+0.173
 sees+0.172
 sleeping+0.167
 wants+0.164
Neg logit contributions
 efforts-0.285
ant-0.278
ainer-0.265
ively-0.263
antly-0.263
Token a
Token ID257
Feature activation+1.77e+00

Pos logit contributions
ant+0.192
antly+0.192
 efforts+0.188
ively+0.188
ered+0.181
Neg logit contributions
rob-0.216
 says-0.204
 hungry-0.195
 circus-0.195
 ours-0.193
Token girl
Token ID2576
Feature activation-2.50e-01

Pos logit contributions
 efforts+0.068
antly+0.066
ant+0.065
 fingertips+0.064
 Days+0.061
Neg logit contributions
rob-0.109
 ours-0.097
 says-0.097
 yells-0.094
use-0.089
Token named
Token ID3706
Feature activation-4.04e-01

Pos logit contributions
rob+0.011
 says+0.010
 hungry+0.010
 circus+0.010
 sleeping+0.009
Neg logit contributions
ant-0.014
antly-0.014
 efforts-0.014
ively-0.014
ered-0.014
Token Ch
Token ID609
Feature activation+9.26e-01

Pos logit contributions
rob+0.012
 says+0.012
 hungry+0.012
 ours+0.011
 sleeping+0.011
Neg logit contributions
antly-0.028
ant-0.028
 efforts-0.028
ively-0.027
ered-0.027
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
antly+0.046
 efforts+0.045
ered+0.043
 fingertips+0.043
ant+0.042
Neg logit contributions
rob-0.043
 says-0.042
 sleeping-0.042
 barking-0.040
 hungry-0.040
Tokenelle
Token ID13485
Feature activation+2.46e+00

Pos logit contributions
 efforts+1.344
 fingertips+1.335
ered+1.322
ively+1.303
antly+1.295
Neg logit contributions
rob-0.667
 says-0.635
 hungry-0.580
 sees-0.572
 scary-0.568
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
ainer+0.177
 efforts+0.176
ively+0.175
ant+0.174
antly+0.172
Neg logit contributions
 says-0.071
 hungry-0.069
 sleeping-0.067
 sees-0.065
 wants-0.062
Token She
Token ID1375
Feature activation-1.87e+00

Pos logit contributions
ant+0.081
 efforts+0.081
ively+0.080
antly+0.080
ered+0.078
Neg logit contributions
rob-0.044
 says-0.042
 hungry-0.040
 scar-0.038
 sleeping-0.038
Token was
Token ID373
Feature activation+4.05e+00

Pos logit contributions
rob+0.101
 says+0.095
 ours+0.094
 hungry+0.093
 circus+0.092
Neg logit contributions
ant-0.086
antly-0.086
 efforts-0.083
ively-0.083
ered-0.082
Token three
Token ID1115
Feature activation+8.65e+00

Pos logit contributions
ant+0.227
antly+0.225
 efforts+0.224
ively+0.223
ered+0.217
Neg logit contributions
rob-0.180
 says-0.166
 circus-0.157
 ours-0.156
 hungry-0.156
Token always
Token ID1464
Feature activation+2.04e+00

Pos logit contributions
 hungry+0.207
rob+0.206
 says+0.202
 sleeping+0.198
 ours+0.195
Neg logit contributions
ant-0.327
antly-0.321
 efforts-0.318
ively-0.317
 fingertips-0.309
Token best
Token ID1266
Feature activation+2.35e+00

Pos logit contributions
antly+0.123
 efforts+0.121
ant+0.120
ively+0.119
ered+0.119
Neg logit contributions
rob-0.084
 says-0.074
 circus-0.072
 scar-0.068
 ours-0.068
Token.
Token ID13
Feature activation+8.49e-01

Pos logit contributions
ant+0.135
antly+0.134
ively+0.133
ered+0.129
 efforts+0.127
Neg logit contributions
 says-0.096
 hungry-0.095
rob-0.094
 sleeping-0.091
 circus-0.089
Token Ign
Token ID16583
Feature activation+3.83e+00

Pos logit contributions
antly+0.044
ant+0.044
 efforts+0.043
ively+0.043
ered+0.042
Neg logit contributions
rob-0.041
 says-0.037
 hungry-0.037
 circus-0.037
 ours-0.036
Tokenor
Token ID273
Feature activation+1.01e+01

Pos logit contributions
antly+0.170
 efforts+0.168
ant+0.166
ively+0.164
 fingertips+0.161
Neg logit contributions
rob-0.216
 says-0.204
 hungry-0.198
 ours-0.192
 circus-0.192
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
 fingertips+0.522
 efforts+0.513
ainer+0.491
 Days+0.483
antly+0.480
Neg logit contributions
rob-0.512
 says-0.504
 wants-0.461
 blows-0.449
 sees-0.446
Token people
Token ID661
Feature activation+5.69e+00

Pos logit contributions
ant+1.182
antly+1.179
 efforts+1.169
ively+1.163
ered+1.132
Neg logit contributions
rob-0.773
 says-0.723
 hungry-0.694
 circus-0.676
 ours-0.663
Token will
Token ID481
Feature activation-2.04e+00

Pos logit contributions
ant+0.321
antly+0.319
 efforts+0.316
ively+0.315
ered+0.306
Neg logit contributions
rob-0.247
 says-0.229
 hungry-0.224
 circus-0.220
 ours-0.217
Token always
Token ID1464
Feature activation+2.12e+00

Pos logit contributions
rob+0.099
 says+0.098
 hungry+0.095
 circus+0.094
 sleeping+0.093
Neg logit contributions
antly-0.101
ant-0.101
ively-0.098
 efforts-0.097
 fingertips-0.095
Token be
Token ID307
Feature activation+1.55e+00

Pos logit contributions
ant+0.098
antly+0.097
 efforts+0.095
ively+0.095
ered+0.092
Neg logit contributions
rob-0.114
 says-0.109
 hungry-0.107
 circus-0.105
 ours-0.104
Token present
Token ID1944
Feature activation+7.78e-01

Pos logit contributions
ant+0.091
antly+0.091
 efforts+0.091
ively+0.090
ered+0.089
Neg logit contributions
rob-0.066
 says-0.059
 circus-0.057
 hungry-0.055
 ours-0.054
Token it
Token ID340
Feature activation+2.06e+00

Pos logit contributions
antly+0.443
 efforts+0.437
ant+0.431
ainer+0.430
 fingertips+0.421
Neg logit contributions
 says-0.255
rob-0.253
 hungry-0.246
 waiting-0.243
 ours-0.242
Token.
Token ID13
Feature activation+6.52e-01

Pos logit contributions
ant+0.119
antly+0.118
 efforts+0.117
ively+0.116
 fingertips+0.113
Neg logit contributions
rob-0.085
 says-0.083
 hungry-0.081
 sleeping-0.077
 circus-0.077
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
ant+0.036
antly+0.036
 efforts+0.035
ively+0.035
ered+0.034
Neg logit contributions
rob-0.029
 says-0.027
 hungry-0.027
 circus-0.026
 ours-0.026
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
rob+0.079
 says+0.074
 hungry+0.072
 circus+0.070
 ours+0.069
Neg logit contributions
ant-0.105
antly-0.104
 efforts-0.104
ively-0.102
ered-0.100
TokenCh
Token ID1925
Feature activation+4.62e+00

Pos logit contributions
rob+0.090
 says+0.085
 hungry+0.083
 circus+0.082
 ours+0.081
Neg logit contributions
ant-0.094
antly-0.094
 efforts-0.093
ively-0.092
ered-0.089
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
antly+0.183
 fingertips+0.182
 efforts+0.173
 Days+0.165
 nobody+0.164
Neg logit contributions
rob-0.264
 says-0.257
 sleeping-0.243
etchup-0.235
 blows-0.233
Tokenelle
Token ID13485
Feature activation+2.44e+00

Pos logit contributions
 efforts+1.079
ant+1.070
antly+1.070
ively+1.066
 fingertips+1.061
Neg logit contributions
rob-0.881
 says-0.837
 hungry-0.785
 sees-0.768
 circus-0.767
Token smiled
Token ID13541
Feature activation+1.27e+00

Pos logit contributions
ively+0.140
ant+0.139
ainer+0.137
 efforts+0.136
antly+0.136
Neg logit contributions
 says-0.104
 sees-0.096
rob-0.096
 hungry-0.095
 wants-0.093
Token and
Token ID290
Feature activation-1.82e+00

Pos logit contributions
ant+0.087
ively+0.085
 efforts+0.084
antly+0.083
ered+0.083
Neg logit contributions
 hungry-0.040
 says-0.039
 sleeping-0.036
rob-0.036
 scary-0.035
Token said
Token ID531
Feature activation-8.99e+00

Pos logit contributions
rob+0.067
 says+0.061
 hungry+0.060
 circus+0.058
 ours+0.058
Neg logit contributions
ant-0.115
antly-0.115
 efforts-0.114
ively-0.113
ered-0.111
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
 hungry+0.339
 says+0.336
 sees+0.300
 scary+0.297
rob+0.296
Neg logit contributions
ively-0.552
ant-0.552
 efforts-0.540
antly-0.538
ainer-0.532
Token
Token ID198
Feature activation-1.28e+00

Pos logit contributions
ant+0.069
 efforts+0.068
antly+0.068
ively+0.067
ered+0.066
Neg logit contributions
rob-0.059
 says-0.056
 hungry-0.054
 circus-0.053
 sleeping-0.052
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
rob+0.056
 says+0.052
 hungry+0.051
 circus+0.050
 ours+0.049
Neg logit contributions
ant-0.073
antly-0.072
 efforts-0.071
ively-0.071
ered-0.069
TokenOne
Token ID3198
Feature activation-3.95e+00

Pos logit contributions
rob+0.057
 says+0.053
 ours+0.053
 hungry+0.053
 circus+0.051
Neg logit contributions
antly-0.072
ant-0.072
 efforts-0.069
ively-0.069
ered-0.068
Token day
Token ID1110
Feature activation-1.73e+00

Pos logit contributions
rob+0.153
 says+0.141
 hungry+0.138
 circus+0.134
 ours+0.132
Neg logit contributions
ant-0.244
antly-0.242
 efforts-0.240
ively-0.239
ered-0.233
Token Ch
Token ID609
Feature activation+8.99e-01

Pos logit contributions
rob+0.072
 says+0.069
 hungry+0.067
 sleeping+0.067
 scar+0.066
Neg logit contributions
 efforts-0.102
ively-0.101
ant-0.100
antly-0.098
 fingertips-0.097
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
antly+0.042
 efforts+0.039
ered+0.039
ively+0.039
 fingertips+0.039
Neg logit contributions
rob-0.047
 says-0.044
 sleeping-0.043
 sees-0.042
 blows-0.041
Tokenelle
Token ID13485
Feature activation+2.40e+00

Pos logit contributions
 efforts+1.147
antly+1.132
 fingertips+1.129
ant+1.124
ively+1.120
Neg logit contributions
rob-0.833
 says-0.783
 hungry-0.723
 sleeping-0.715
 circus-0.712
Token's
Token ID338
Feature activation-1.69e+00

Pos logit contributions
ant+0.145
antly+0.143
ively+0.142
 efforts+0.142
 fingertips+0.139
Neg logit contributions
 says-0.092
rob-0.091
 hungry-0.088
 sees-0.085
 sleeping-0.084
Token mom
Token ID1995
Feature activation+9.10e+00

Pos logit contributions
 hungry+0.088
 says+0.084
 sleeping+0.084
 scary+0.082
 black+0.081
Neg logit contributions
antly-0.082
ant-0.082
ered-0.082
ively-0.082
vered-0.077
Token put
Token ID1234
Feature activation+9.75e-01

Pos logit contributions
antly+0.548
ant+0.541
ered+0.535
ively+0.528
 efforts+0.527
Neg logit contributions
rob-0.378
 ours-0.338
 circus-0.330
 hungry-0.296
ww-0.284
Token a
Token ID257
Feature activation+1.62e+00

Pos logit contributions
antly+0.053
ant+0.053
ered+0.051
ively+0.051
 efforts+0.050
Neg logit contributions
rob-0.043
 circus-0.041
 says-0.040
 wants-0.039
 ours-0.039
Token colourful
Token ID44799
Feature activation-2.71e+00

Pos logit contributions
rob+0.059
 circus+0.052
 says+0.049
 ours+0.049
 hungry+0.047
Neg logit contributions
antly-0.091
 fingertips-0.088
ered-0.087
 efforts-0.087
ant-0.086
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
rob+0.107
 hungry+0.102
 says+0.101
 sleeping+0.100
 circus+0.098
Neg logit contributions
ant-0.164
antly-0.161
ively-0.161
 efforts-0.160
ered-0.156
Token it
Token ID340
Feature activation+2.11e+00

Pos logit contributions
rob+0.060
 says+0.055
 hungry+0.054
 circus+0.053
 ours+0.052
Neg logit contributions
antly-0.087
ant-0.087
 efforts-0.085
ively-0.085
ered-0.083
Token made
Token ID925
Feature activation-2.75e+00

Pos logit contributions
ant+0.114
antly+0.114
 efforts+0.112
ively+0.111
ered+0.109
Neg logit contributions
rob-0.097
 says-0.090
 hungry-0.089
 circus-0.087
 ours-0.087
Token Ch
Token ID609
Feature activation+8.75e-01

Pos logit contributions
 scar+0.075
 says+0.075
 scary+0.070
 sleeping+0.069
rob+0.066
Neg logit contributions
 efforts-0.194
ant-0.190
 fingertips-0.188
antly-0.187
ved-0.185
Tokenant
Token ID415
Feature activation+1.96e+01

Pos logit contributions
antly+0.041
 efforts+0.037
ered+0.036
 fingertips+0.036
ved+0.035
Neg logit contributions
rob-0.046
 says-0.042
 sleeping-0.041
 barking-0.040
 blows-0.040
Tokenelle
Token ID13485
Feature activation+2.40e+00

Pos logit contributions
ively+1.218
 efforts+1.202
ant+1.197
antly+1.190
ered+1.189
Neg logit contributions
 says-0.715
rob-0.704
 hungry-0.670
 scar-0.669
 scary-0.669
Token feel
Token ID1254
Feature activation+7.67e+00

Pos logit contributions
ively+0.151
ant+0.149
antly+0.143
ainer+0.142
growth+0.141
Neg logit contributions
 says-0.096
 hungry-0.094
 sleeping-0.087
 wants-0.087
 sees-0.086
Token special
Token ID2041
Feature activation-1.27e+00

Pos logit contributions
ant+0.561
ively+0.541
antly+0.541
 efforts+0.537
ered+0.515
Neg logit contributions
 hungry-0.235
 says-0.210
 scary-0.208
 sleeping-0.198
rob-0.195
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 hungry+0.047
 says+0.046
 sleeping+0.045
rob+0.044
 waiting+0.043
Neg logit contributions
ant-0.079
antly-0.078
ively-0.078
 efforts-0.076
ered-0.075
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 efforts+0.071
ant+0.071
ively+0.070
antly+0.069
ered+0.068
Neg logit contributions
 says-0.051
rob-0.051
 hungry-0.048
 sleeping-0.046
 scar-0.046
Token It
Token ID632
Feature activation+5.10e+00

Pos logit contributions
antly+0.001
ant+0.001
ively+0.001
 efforts+0.001
 fingertips+0.001
Neg logit contributions
rob-0.002
 hungry-0.002
 says-0.002
 circus-0.002
 ours-0.002
Token was
Token ID373
Feature activation+3.23e+00

Pos logit contributions
antly+0.189
ered+0.178
ant+0.159
ively+0.158
 fingertips+0.154
Neg logit contributions
rob-0.319
 ours-0.311
 circus-0.294
 hungry-0.284
 scar-0.284
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
 efforts+0.187
antly+0.186
 fingertips+0.185
ered+0.184
ively+0.183
Neg logit contributions
rob-0.148
use-0.115
 circus-0.108
 says-0.106
P-0.104
Token cro
Token ID6763
Feature activation-1.16e+01

Pos logit contributions
 fingertips+0.060
antly+0.059
 efforts+0.059
 bottom+0.056
 nobody+0.055
Neg logit contributions
rob-0.056
 says-0.044
bat-0.042
 yells-0.041
 ours-0.041
Tokeniss
Token ID747
Feature activation-1.69e+00

Pos logit contributions
rob+0.444
 says+0.424
 circus+0.410
 ours+0.410
 hungry+0.405
Neg logit contributions
ant-0.720
antly-0.710
ively-0.704
ered-0.688
 efforts-0.678
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
rob+0.118
 says+0.117
 ours+0.115
 sleeping+0.112
 hungry+0.112
Neg logit contributions
 efforts-0.042
antly-0.041
 Days-0.041
 fingertips-0.039
growth-0.035
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
ant+1.091
antly+1.084
 efforts+1.071
ively+1.067
ered+1.034
Neg logit contributions
rob-0.861
 says-0.798
 hungry-0.783
 circus-0.766
 ours-0.758
Token It
Token ID632
Feature activation+5.13e+00

Pos logit contributions
antly+0.005
ant+0.005
ively+0.004
 efforts+0.004
ered+0.004
Neg logit contributions
rob-0.006
 says-0.005
 hungry-0.005
 circus-0.005
 sleeping-0.005
Token was
Token ID373
Feature activation+3.29e+00

Pos logit contributions
antly+0.211
ered+0.202
ant+0.190
ively+0.187
 efforts+0.184
Neg logit contributions
rob-0.297
 ours-0.291
 circus-0.268
 hungry-0.261
use-0.256
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
 efforts+0.184
antly+0.181
ered+0.180
ively+0.179
ant+0.179
Neg logit contributions
rob-0.157
use-0.124
 circus-0.120
 says-0.119
P-0.117
Token little
Token ID1310
Feature activation+5.44e+00

Pos logit contributions
 efforts+0.062
antly+0.062
 fingertips+0.062
ant+0.057
 bottom+0.056
Neg logit contributions
rob-0.054
 says-0.045
 ours-0.042
 yells-0.041
bat-0.040
Token special
Token ID2041
Feature activation-1.27e+00

Pos logit contributions
ant+0.561
ively+0.541
antly+0.541
 efforts+0.537
ered+0.515
Neg logit contributions
 hungry-0.235
 says-0.210
 scary-0.208
 sleeping-0.198
rob-0.195
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 hungry+0.047
 says+0.046
 sleeping+0.045
rob+0.044
 waiting+0.043
Neg logit contributions
ant-0.079
antly-0.078
ively-0.078
 efforts-0.076
ered-0.075
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
 efforts+0.071
ant+0.071
ively+0.070
antly+0.069
ered+0.068
Neg logit contributions
 says-0.051
rob-0.051
 hungry-0.048
 sleeping-0.046
 scar-0.046
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
rob+0.054
 says+0.051
 hungry+0.048
 circus+0.047
 sleeping+0.046
Neg logit contributions
ant-0.078
 efforts-0.078
antly-0.077
ively-0.076
 fingertips-0.074
TokenCh
Token ID1925
Feature activation+4.58e+00

Pos logit contributions
rob+0.070
 says+0.066
 hungry+0.064
 circus+0.063
 sleeping+0.062
Neg logit contributions
ant-0.064
antly-0.064
 efforts-0.064
ively-0.063
ered-0.061
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
antly+0.212
 fingertips+0.211
 efforts+0.205
 nobody+0.197
mates+0.195
Neg logit contributions
rob-0.248
 says-0.222
 blows-0.207
 sleeping-0.205
etchup-0.205
Tokenelle
Token ID13485
Feature activation+2.39e+00

Pos logit contributions
 efforts+1.175
 fingertips+1.165
ively+1.151
antly+1.147
ant+1.134
Neg logit contributions
rob-0.814
 says-0.768
 sees-0.693
 scar-0.689
 wants-0.686
Token went
Token ID1816
Feature activation-2.84e+00

Pos logit contributions
ively+0.149
ant+0.148
ainer+0.147
antly+0.145
 efforts+0.144
Neg logit contributions
 says-0.093
 hungry-0.083
 sees-0.083
 wants-0.081
 sleeping-0.078
Token outside
Token ID2354
Feature activation-1.01e+00

Pos logit contributions
rob+0.161
 says+0.143
 ours+0.142
ww+0.141
 wants+0.137
Neg logit contributions
antly-0.129
ered-0.127
ively-0.125
 efforts-0.121
ant-0.119
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
rob+0.038
 says+0.035
 hungry+0.034
 circus+0.033
 ours+0.032
Neg logit contributions
ant-0.063
antly-0.063
 efforts-0.062
ively-0.062
ered-0.060
Token play
Token ID711
Feature activation-1.17e+00

Pos logit contributions
rob+0.055
 says+0.052
 hungry+0.051
 circus+0.050
 ours+0.049
Neg logit contributions
ant-0.077
antly-0.077
 efforts-0.076
ively-0.075
ered-0.073
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
rob+0.167
 says+0.155
 hungry+0.153
 ours+0.150
 circus+0.149
Neg logit contributions
ant-0.161
antly-0.159
 efforts-0.157
ively-0.155
ered-0.151
Token word
Token ID1573
Feature activation+1.57e+00

Pos logit contributions
ant+0.085
antly+0.085
 efforts+0.084
ively+0.082
 fingertips+0.081
Neg logit contributions
rob-0.082
 says-0.074
 ours-0.072
 hungry-0.072
 sleeping-0.070
Token "
Token ID366
Feature activation-1.53e-01

Pos logit contributions
ant+0.087
antly+0.087
 efforts+0.086
ively+0.086
ered+0.083
Neg logit contributions
rob-0.069
 says-0.065
 hungry-0.063
 circus-0.062
 ours-0.061
Tokenign
Token ID570
Feature activation+8.57e-02

Pos logit contributions
rob+0.007
 says+0.007
 hungry+0.006
 ours+0.006
 circus+0.006
Neg logit contributions
ant-0.008
antly-0.008
ively-0.008
 efforts-0.008
ered-0.008
Tokenor
Token ID273
Feature activation+1.02e+01

Pos logit contributions
 efforts+0.004
antly+0.004
 fingertips+0.004
ively+0.003
ered+0.003
Neg logit contributions
 says-0.005
rob-0.005
 hungry-0.005
 ours-0.004
 wants-0.004
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 efforts+0.327
 fingertips+0.307
antly+0.299
ively+0.293
growth+0.283
Neg logit contributions
 says-0.673
rob-0.664
 wants-0.637
 hungry-0.631
 ours-0.624
Token"
Token ID1
Feature activation-8.96e-01

Pos logit contributions
 efforts+1.062
antly+1.044
ively+1.028
 fingertips+1.023
ant+1.019
Neg logit contributions
rob-0.872
 says-0.853
 hungry-0.797
 circus-0.786
 ours-0.781
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
rob+0.040
 ours+0.036
 circus+0.036
 hungry+0.036
 says+0.036
Neg logit contributions
antly-0.050
ant-0.050
ively-0.049
 efforts-0.048
ered-0.046
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
rob+0.053
 hungry+0.046
 says+0.045
 ours+0.044
 circus+0.044
Neg logit contributions
ant-0.114
antly-0.114
 efforts-0.112
ively-0.111
ered-0.110
Token number
Token ID1271
Feature activation+1.84e+00

Pos logit contributions
ant+0.088
antly+0.088
 efforts+0.087
ively+0.087
ered+0.084
Neg logit contributions
rob-0.080
 says-0.075
 hungry-0.073
 circus-0.072
 ours-0.071
Token "
Token ID366
Feature activation-1.49e-01

Pos logit contributions
ant+0.111
antly+0.111
 efforts+0.111
ively+0.110
ered+0.107
Neg logit contributions
rob-0.071
 says-0.067
 hungry-0.064
 circus-0.063
 ours-0.062
Token of
Token ID286
Feature activation+7.01e+00

Pos logit contributions
rob+0.013
 says+0.012
 hungry+0.012
 circus+0.012
 ours+0.012
Neg logit contributions
ant-0.009
antly-0.009
 efforts-0.009
ively-0.009
ered-0.009
Token him
Token ID683
Feature activation-8.01e-01

Pos logit contributions
antly+0.382
ant+0.380
 efforts+0.378
ively+0.370
ered+0.362
Neg logit contributions
rob-0.323
 says-0.303
 hungry-0.296
 circus-0.292
 ours-0.287
Token for
Token ID329
Feature activation-5.43e+00

Pos logit contributions
 says+0.024
 hungry+0.023
 sleeping+0.022
 waiting+0.021
 scary+0.020
Neg logit contributions
ant-0.057
antly-0.056
ively-0.056
 fingertips-0.054
ered-0.054
Token being
Token ID852
Feature activation-4.03e+00

Pos logit contributions
rob+0.184
 hungry+0.173
 says+0.170
 sleeping+0.165
 circus+0.165
Neg logit contributions
antly-0.359
ant-0.356
 efforts-0.349
ively-0.349
 fingertips-0.339
Token observ
Token ID3799
Feature activation+1.37e+00

Pos logit contributions
rob+0.151
 hungry+0.147
 says+0.143
 circus+0.136
 sleeping+0.135
Neg logit contributions
ant-0.251
antly-0.250
 efforts-0.246
ively-0.246
 fingertips-0.240
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 fingertips+0.025
 surely+0.024
antly+0.024
 efforts+0.024
growth+0.023
Neg logit contributions
rob-0.106
 ours-0.101
 says-0.099
 wants-0.099
 yells-0.098
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
antly+1.190
ant+1.189
 efforts+1.184
 fingertips+1.180
ively+1.174
Neg logit contributions
rob-0.725
 says-0.709
 hungry-0.668
 sleeping-0.648
 circus-0.636
Token noticing
Token ID28107
Feature activation+2.66e+00

Pos logit contributions
rob+0.084
 says+0.075
 circus+0.073
 hungry+0.072
 ours+0.071
Neg logit contributions
ant-0.112
antly-0.111
 efforts-0.111
ively-0.109
ered-0.107
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
antly+0.143
 efforts+0.139
ant+0.139
ively+0.137
ainer+0.137
Neg logit contributions
 says-0.119
rob-0.118
 hungry-0.113
 circus-0.111
 scary-0.111
Token cop
Token ID2243
Feature activation-2.25e+00

Pos logit contributions
ant+0.105
 efforts+0.104
antly+0.104
ively+0.102
ered+0.099
Neg logit contributions
rob-0.079
 says-0.072
 hungry-0.070
 ours-0.068
 circus-0.068
Token.
Token ID13
Feature activation+6.84e-01

Pos logit contributions
rob+0.097
 says+0.094
 hungry+0.091
 circus+0.088
 sleeping+0.088
Neg logit contributions
ant-0.126
antly-0.125
 efforts-0.124
ively-0.123
 fingertips-0.119
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
rob+0.048
 hungry+0.047
 says+0.047
 sleeping+0.046
 ours+0.044
Neg logit contributions
ant-0.075
antly-0.075
ively-0.074
 efforts-0.073
ered-0.071
Token
Token ID220
Feature activation-2.23e-01

Pos logit contributions
ant+0.080
 efforts+0.080
ively+0.080
ered+0.078
antly+0.076
Neg logit contributions
 says-0.046
 hungry-0.042
rob-0.041
 scar-0.040
 sleeping-0.040
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 says+0.007
rob+0.007
 hungry+0.006
 sees+0.006
 sleeping+0.006
Neg logit contributions
 efforts-0.015
ant-0.015
ively-0.015
antly-0.014
 fingertips-0.014
Token
Token ID198
Feature activation-1.32e+00

Pos logit contributions
rob+0.045
 says+0.044
 hungry+0.041
 sleeping+0.040
 sees+0.039
Neg logit contributions
 efforts-0.086
ant-0.085
ively-0.083
 fingertips-0.082
antly-0.082
TokenCh
Token ID1925
Feature activation+4.56e+00

Pos logit contributions
rob+0.060
 says+0.059
 hungry+0.057
 sleeping+0.057
P+0.056
Neg logit contributions
 efforts-0.069
ively-0.068
ant-0.067
 fingertips-0.066
antly-0.065
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
antly+0.208
 fingertips+0.206
 efforts+0.200
 nobody+0.197
mates+0.195
Neg logit contributions
rob-0.246
 says-0.220
 blows-0.208
etchup-0.205
 sleeping-0.202
Tokenelle
Token ID13485
Feature activation+2.37e+00

Pos logit contributions
 efforts+1.181
 fingertips+1.180
ively+1.157
antly+1.152
ainer+1.144
Neg logit contributions
rob-0.797
 says-0.761
 sees-0.685
 scar-0.681
 wants-0.674
Token stopped
Token ID5025
Feature activation+3.33e+00

Pos logit contributions
ively+0.147
ant+0.147
ainer+0.144
antly+0.143
 fingertips+0.142
Neg logit contributions
 says-0.092
 hungry-0.085
 sees-0.083
 wants-0.080
 sleeping-0.079
Token in
Token ID287
Feature activation+5.98e-01

Pos logit contributions
 efforts+0.178
ant+0.178
antly+0.175
ively+0.175
 fingertips+0.171
Neg logit contributions
rob-0.169
 ours-0.151
ww-0.141
 circus-0.140
 says-0.136
Token the
Token ID262
Feature activation+2.06e+00

Pos logit contributions
ant+0.031
antly+0.031
 efforts+0.031
ively+0.031
ered+0.030
Neg logit contributions
rob-0.029
 says-0.027
 hungry-0.027
 circus-0.026
 ours-0.026
Token shade
Token ID17979
Feature activation+1.76e+00

Pos logit contributions
antly+0.130
 efforts+0.126
ant+0.125
ively+0.124
 fingertips+0.123
Neg logit contributions
rob-0.079
 says-0.070
 ours-0.069
 hungry-0.066
 circus-0.066
Token says
Token ID1139
Feature activation-1.36e+01

Pos logit contributions
rob+0.033
 ours+0.029
 hungry+0.029
 circus+0.027
 sleeping+0.027
Neg logit contributions
antly-0.071
ant-0.071
 efforts-0.070
ively-0.069
ered-0.069
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
 hungry+0.475
 says+0.469
 sees+0.419
 circus+0.418
 sleeping+0.417
Neg logit contributions
ively-0.849
 fingertips-0.848
growth-0.837
 efforts-0.835
ant-0.834
Token "
Token ID366
Feature activation-1.44e-01

Pos logit contributions
ant+0.068
antly+0.067
 efforts+0.067
ively+0.066
ered+0.064
Neg logit contributions
rob-0.063
 says-0.060
 hungry-0.058
 circus-0.057
 ours-0.056
TokenIgn
Token ID32916
Feature activation+6.72e-02

Pos logit contributions
rob+0.006
 says+0.006
 hungry+0.006
 ours+0.006
 sleeping+0.006
Neg logit contributions
ant-0.008
antly-0.008
ively-0.008
 efforts-0.008
ered-0.008
Tokenor
Token ID273
Feature activation+1.02e+01

Pos logit contributions
 efforts+0.003
antly+0.003
ant+0.003
ively+0.002
 fingertips+0.002
Neg logit contributions
rob-0.004
 says-0.004
 hungry-0.004
 circus-0.004
 ours-0.004
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 efforts+0.405
 fingertips+0.395
ively+0.389
antly+0.384
ainer+0.371
Neg logit contributions
 says-0.597
rob-0.584
 wants-0.556
 hungry-0.550
 sees-0.546
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 efforts+1.099
antly+1.084
ively+1.075
 fingertips+1.069
ant+1.057
Neg logit contributions
rob-0.830
 says-0.819
 hungry-0.757
 circus-0.748
 ours-0.734
Token zero
Token ID6632
Feature activation+6.68e+00

Pos logit contributions
antly+0.077
ant+0.076
 efforts+0.075
ively+0.075
ered+0.073
Neg logit contributions
rob-0.054
 says-0.052
 hungry-0.051
 circus-0.049
 ours-0.048
Token".
Token ID1911
Feature activation+6.70e-01

Pos logit contributions
antly+0.415
ant+0.414
 efforts+0.411
ively+0.410
ered+0.398
Neg logit contributions
rob-0.248
 says-0.232
 hungry-0.222
 circus-0.218
 ours-0.214
Token The
Token ID383
Feature activation+6.56e+00

Pos logit contributions
ant+0.046
 efforts+0.043
ered+0.043
ively+0.043
antly+0.043
Neg logit contributions
 says-0.025
 hungry-0.022
 sees-0.021
rob-0.020
 sleeping-0.020
Token teacher
Token ID4701
Feature activation+6.50e+00

Pos logit contributions
antly+0.331
ant+0.325
ively+0.319
 efforts+0.315
ered+0.314
Neg logit contributions
 says-0.325
rob-0.317
 hungry-0.315
 circus-0.310
 sees-0.302
Token It
Token ID632
Feature activation+5.16e+00

Pos logit contributions
ant+0.014
antly+0.014
 efforts+0.014
ively+0.013
ered+0.013
Neg logit contributions
rob-0.011
 says-0.011
 hungry-0.011
 circus-0.010
 ours-0.010
Token is
Token ID318
Feature activation-5.31e+00

Pos logit contributions
antly+0.303
ered+0.300
ant+0.290
ively+0.282
 efforts+0.271
Neg logit contributions
rob-0.206
 circus-0.175
 ours-0.166
P-0.160
 hungry-0.160
Token a
Token ID257
Feature activation+1.20e+00

Pos logit contributions
rob+0.285
use+0.215
 says+0.213
 circus+0.213
P+0.208
Neg logit contributions
 efforts-0.269
antly-0.266
ered-0.264
 fingertips-0.263
ant-0.259
Token cro
Token ID6763
Feature activation-1.14e+01

Pos logit contributions
 fingertips+0.062
 efforts+0.062
antly+0.062
 nobody+0.058
 bottom+0.058
Neg logit contributions
rob-0.058
 says-0.048
bat-0.045
 yells-0.044
 blows-0.044
Tokeniss
Token ID747
Feature activation-1.64e+00

Pos logit contributions
rob+0.448
 says+0.433
 circus+0.416
 ours+0.415
 hungry+0.413
Neg logit contributions
ant-0.699
ively-0.683
antly-0.683
ered-0.664
 efforts-0.648
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
rob+0.111
 says+0.108
 ours+0.106
 hungry+0.104
 sleeping+0.103
Neg logit contributions
 efforts-0.048
antly-0.047
 Days-0.044
 fingertips-0.044
ively-0.044
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
ant+1.095
antly+1.076
ively+1.060
 efforts+1.049
ered+1.012
Neg logit contributions
rob-0.867
 hungry-0.785
 says-0.780
 circus-0.760
 ours-0.749
Token We
Token ID775
Feature activation-3.19e+00

Pos logit contributions
ant+0.014
antly+0.014
 efforts+0.014
ively+0.014
ered+0.014
Neg logit contributions
rob-0.012
 says-0.011
 hungry-0.011
 circus-0.011
 ours-0.010
Token can
Token ID460
Feature activation-1.09e+00

Pos logit contributions
rob+0.124
 says+0.119
 hungry+0.116
 circus+0.111
 ours+0.111
Neg logit contributions
antly-0.194
ant-0.193
 efforts-0.191
ively-0.191
 fingertips-0.183
Token share
Token ID2648
Feature activation+3.14e+00

Pos logit contributions
 says+0.047
 hungry+0.044
 circus+0.043
 ours+0.043
 sees+0.042
Neg logit contributions
antly-0.064
 efforts-0.063
ant-0.061
ively-0.061
vered-0.058
Token it
Token ID340
Feature activation+1.86e+00

Pos logit contributions
ant+0.155
antly+0.152
ively+0.149
 efforts+0.148
ered+0.146
Neg logit contributions
rob-0.163
 says-0.146
 hungry-0.146
 circus-0.142
 sleeping-0.139
Token were
Token ID547
Feature activation+4.63e+00

Pos logit contributions
ainer+0.153
antly+0.151
ant+0.147
 efforts+0.145
 fingertips+0.144
Neg logit contributions
 says-0.128
 sees-0.116
 wants-0.115
 hungry-0.114
 waiting-0.107
Token ignorant
Token ID20109
Feature activation+4.30e-01

Pos logit contributions
ively+0.253
ered+0.250
ant+0.245
 efforts+0.240
 nobody+0.233
Neg logit contributions
rob-0.223
use-0.192
ww-0.186
iw-0.177
 scar-0.175
Token.
Token ID13
Feature activation+7.95e-01

Pos logit contributions
ant+0.023
antly+0.023
 efforts+0.023
ively+0.023
ered+0.022
Neg logit contributions
rob-0.020
 says-0.019
 hungry-0.018
 circus-0.018
 ours-0.018
Token Ign
Token ID16583
Feature activation+4.07e+00

Pos logit contributions
ant+0.050
antly+0.049
 efforts+0.049
ively+0.048
ered+0.048
Neg logit contributions
 says-0.030
rob-0.029
 hungry-0.029
 circus-0.027
 ours-0.027
Tokenor
Token ID273
Feature activation+1.01e+01

Pos logit contributions
ant+0.133
antly+0.124
ively+0.123
ered+0.121
 efforts+0.117
Neg logit contributions
rob-0.275
 circus-0.267
 hungry-0.266
 says-0.263
 ours-0.260
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 efforts+0.479
 fingertips+0.471
ainer+0.451
 Days+0.440
mates+0.435
Neg logit contributions
 says-0.543
rob-0.538
 wants-0.493
 sees-0.480
 blows-0.475
Token meant
Token ID4001
Feature activation-7.22e-01

Pos logit contributions
antly+1.168
ant+1.167
 efforts+1.161
ively+1.146
 fingertips+1.126
Neg logit contributions
rob-0.773
 says-0.741
 hungry-0.700
 circus-0.681
 ours-0.672
Token they
Token ID484
Feature activation-2.22e+00

Pos logit contributions
 says+0.031
 hungry+0.030
 sleeping+0.029
 ours+0.029
 waiting+0.028
Neg logit contributions
antly-0.041
ant-0.040
ively-0.039
 efforts-0.039
ainer-0.039
Token did
Token ID750
Feature activation-2.08e+00

Pos logit contributions
 says+0.080
 hungry+0.075
rob+0.072
 wants+0.072
 sees+0.071
Neg logit contributions
antly-0.146
ant-0.142
ainer-0.141
ively-0.141
 efforts-0.140
Token not
Token ID407
Feature activation+2.99e+00

Pos logit contributions
 hungry+0.087
 says+0.085
 ours+0.084
 scary+0.084
 sleeping+0.083
Neg logit contributions
antly-0.120
ant-0.118
 fingertips-0.117
ively-0.116
 efforts-0.114
Token know
Token ID760
Feature activation+5.81e+00

Pos logit contributions
ant+0.161
 efforts+0.159
ered+0.157
ively+0.157
antly+0.154
Neg logit contributions
rob-0.148
 says-0.124
 circus-0.123
 ours-0.122
 hungry-0.121
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
ant+0.158
antly+0.158
 efforts+0.156
ively+0.156
ered+0.151
Neg logit contributions
rob-0.097
 says-0.090
 hungry-0.088
 circus-0.085
 ours-0.084
Token said
Token ID531
Feature activation-8.57e+00

Pos logit contributions
rob+0.045
 says+0.037
 circus+0.037
 hungry+0.037
 ours+0.036
Neg logit contributions
antly-0.102
ant-0.102
 efforts-0.101
ively-0.100
ered-0.099
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
 hungry+0.314
 says+0.311
rob+0.296
 sleeping+0.285
 waiting+0.280
Neg logit contributions
ively-0.525
 efforts-0.519
antly-0.518
ant-0.510
 fingertips-0.507
Token "
Token ID366
Feature activation+1.10e-01

Pos logit contributions
ant+0.082
antly+0.082
 efforts+0.080
ively+0.080
ered+0.077
Neg logit contributions
rob-0.086
 says-0.079
 circus-0.077
 hungry-0.077
 ours-0.077
TokenPl
Token ID3646
Feature activation+1.87e+00

Pos logit contributions
antly+0.006
ant+0.006
ively+0.006
 efforts+0.006
ered+0.006
Neg logit contributions
rob-0.005
 says-0.005
 hungry-0.005
 ours-0.005
 sleeping-0.005
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 efforts+0.085
 fingertips+0.077
antly+0.073
 unison+0.072
mates+0.071
Neg logit contributions
rob-0.096
 ours-0.096
 says-0.095
iw-0.091
 wants-0.089
Token this
Token ID428
Feature activation-2.10e+00

Pos logit contributions
 efforts+1.169
antly+1.165
ant+1.156
ively+1.154
 fingertips+1.143
Neg logit contributions
rob-0.757
 says-0.734
 hungry-0.694
 ours-0.681
 circus-0.674
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
rob+0.075
 says+0.073
 hungry+0.070
 ours+0.068
 circus+0.067
Neg logit contributions
ant-0.133
antly-0.131
 efforts-0.130
ively-0.130
 fingertips-0.126
Token watch
Token ID2342
Feature activation+1.42e+00

Pos logit contributions
rob+0.060
 says+0.056
 hungry+0.055
 circus+0.054
 ours+0.053
Neg logit contributions
ant-0.084
antly-0.083
 efforts-0.082
ively-0.082
ered-0.080
Token it
Token ID340
Feature activation+2.07e+00

Pos logit contributions
ant+0.079
 efforts+0.079
antly+0.079
ively+0.078
 fingertips+0.076
Neg logit contributions
rob-0.060
 says-0.058
 hungry-0.057
 ours-0.055
 circus-0.055
Token grow
Token ID1663
Feature activation-4.31e+00

Pos logit contributions
antly+0.118
ant+0.117
ively+0.114
ered+0.114
 efforts+0.114
Neg logit contributions
rob-0.094
 circus-0.081
 says-0.080
 hungry-0.079
 ours-0.078
TokenThat
Token ID2504
Feature activation+1.78e+00

Pos logit contributions
rob+0.010
 says+0.009
 hungry+0.009
 circus+0.009
 ours+0.009
Neg logit contributions
antly-0.012
ant-0.012
ively-0.012
 efforts-0.012
ered-0.011
Token's
Token ID338
Feature activation-1.65e+00

Pos logit contributions
ant+0.103
antly+0.102
 efforts+0.101
ively+0.101
ered+0.098
Neg logit contributions
rob-0.076
 says-0.071
 hungry-0.069
 circus-0.068
 ours-0.067
Token so
Token ID523
Feature activation+1.66e-02

Pos logit contributions
 ours+0.058
 says+0.058
 hungry+0.058
 scary+0.057
 fun+0.056
Neg logit contributions
ant-0.104
 efforts-0.104
ively-0.103
 fingertips-0.103
antly-0.103
Token special
Token ID2041
Feature activation-1.08e+00

Pos logit contributions
ant+0.001
ered+0.001
antly+0.001
 efforts+0.001
ively+0.001
Neg logit contributions
rob-0.001
 says-0.001
 circus-0.001
 hungry-0.000
 sleeping-0.000
Token Ch
Token ID609
Feature activation+8.61e-01

Pos logit contributions
rob+0.047
 says+0.044
 hungry+0.043
 circus+0.042
 ours+0.042
Neg logit contributions
ant-0.062
antly-0.061
ively-0.060
 efforts-0.060
ered-0.059
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
antly+0.038
 efforts+0.035
 fingertips+0.035
ered+0.034
ively+0.034
Neg logit contributions
rob-0.046
 says-0.043
 sleeping-0.042
 circus-0.041
 blows-0.041
Tokenelle
Token ID13485
Feature activation+2.35e+00

Pos logit contributions
ant+1.162
antly+1.158
 efforts+1.150
ively+1.147
ered+1.113
Neg logit contributions
rob-0.775
 says-0.733
 hungry-0.701
 circus-0.686
 ours-0.673
Token.
Token ID13
Feature activation+8.46e-01

Pos logit contributions
ively+0.139
ant+0.139
antly+0.139
 efforts+0.137
ainer+0.134
Neg logit contributions
 says-0.092
rob-0.090
 hungry-0.087
 sleeping-0.083
 wants-0.083
Token I
Token ID314
Feature activation-4.57e+00

Pos logit contributions
 efforts+0.050
ant+0.049
ered+0.049
ively+0.049
antly+0.048
Neg logit contributions
 says-0.033
rob-0.032
 hungry-0.030
 wants-0.030
 circus-0.030
Token'm
Token ID1101
Feature activation+4.69e+00

Pos logit contributions
 says+0.208
rob+0.200
 hungry+0.194
 sees+0.193
 wants+0.191
Neg logit contributions
ant-0.249
ively-0.246
antly-0.243
 efforts-0.242
 fingertips-0.240
Token so
Token ID523
Feature activation+5.04e-02

Pos logit contributions
ant+0.254
antly+0.252
 efforts+0.251
ively+0.251
ered+0.247
Neg logit contributions
rob-0.219
 says-0.198
 circus-0.191
 hungry-0.188
 ours-0.186
Token it
Token ID340
Feature activation+2.11e+00

Pos logit contributions
ant+0.056
 efforts+0.056
antly+0.056
ively+0.055
 fingertips+0.053
Neg logit contributions
rob-0.050
 says-0.048
 hungry-0.047
 ours-0.046
 circus-0.045
Token down
Token ID866
Feature activation-4.06e+00

Pos logit contributions
ant+0.115
antly+0.115
 efforts+0.112
ively+0.112
ered+0.111
Neg logit contributions
rob-0.100
 says-0.091
 circus-0.088
 hungry-0.086
 ours-0.082
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
rob+0.159
 says+0.146
 hungry+0.142
 circus+0.139
 ours+0.137
Neg logit contributions
ant-0.246
antly-0.245
 efforts-0.243
ively-0.242
ered-0.235
Token Ign
Token ID16583
Feature activation+4.19e+00

Pos logit contributions
 efforts+0.066
ant+0.064
antly+0.064
ered+0.064
ively+0.063
Neg logit contributions
 says-0.053
 hungry-0.050
rob-0.050
 ours-0.049
 wants-0.048
Tokenor
Token ID273
Feature activation+1.01e+01

Pos logit contributions
ant+0.149
antly+0.146
ively+0.143
 efforts+0.142
ered+0.137
Neg logit contributions
rob-0.272
 says-0.259
 hungry-0.256
 circus-0.254
 ours-0.251
Tokenant
Token ID415
Feature activation+1.95e+01

Pos logit contributions
 fingertips+0.289
 efforts+0.288
ainer+0.267
 Days+0.257
antly+0.248
Neg logit contributions
 says-0.711
rob-0.706
 wants-0.678
 blows-0.658
 sees-0.656
Token children
Token ID1751
Feature activation+2.31e+00

Pos logit contributions
antly+1.195
 efforts+1.193
ant+1.186
ively+1.178
 fingertips+1.163
Neg logit contributions
rob-0.734
 says-0.703
 hungry-0.656
 circus-0.642
 ours-0.639
Token like
Token ID588
Feature activation-7.72e-01

Pos logit contributions
antly+0.138
ant+0.136
 efforts+0.135
ively+0.135
 fingertips+0.131
Neg logit contributions
rob-0.089
 says-0.089
 hungry-0.085
 ours-0.082
 sees-0.081
Token you
Token ID345
Feature activation-8.31e+00

Pos logit contributions
rob+0.032
 says+0.031
 hungry+0.030
 circus+0.029
 ours+0.029
Neg logit contributions
antly-0.045
ant-0.045
ively-0.044
 efforts-0.044
ered-0.043
Token don
Token ID836
Feature activation-1.05e+00

Pos logit contributions
rob+0.340
 says+0.326
 hungry+0.317
 circus+0.306
 ours+0.305
Neg logit contributions
antly-0.484
ant-0.483
 efforts-0.478
ively-0.476
 fingertips-0.460
Token't
Token ID470
Feature activation+4.68e+00

Pos logit contributions
rob+0.064
 says+0.061
 hungry+0.060
 circus+0.060
 ours+0.059
Neg logit contributions
ant-0.041
antly-0.040
ively-0.040
 efforts-0.039
ered-0.038
Token".
Token ID1911
Feature activation+9.62e-01

Pos logit contributions
 hungry+0.267
 says+0.267
 ours+0.252
 sleeping+0.250
rob+0.243
Neg logit contributions
antly-0.556
ively-0.549
 efforts-0.547
ant-0.546
 fingertips-0.535
Token
Token ID220
Feature activation-2.25e-01

Pos logit contributions
ant+0.059
 efforts+0.059
ively+0.058
antly+0.057
ered+0.057
Neg logit contributions
 says-0.038
rob-0.037
 hungry-0.034
 sleeping-0.033
 scar-0.032
Token
Token ID198
Feature activation-1.25e+00

Pos logit contributions
rob+0.008
 says+0.008
 hungry+0.008
 sleeping+0.007
 sees+0.007
Neg logit contributions
 efforts-0.014
ant-0.014
antly-0.014
ively-0.013
 fingertips-0.013
Token
Token ID198
Feature activation-1.26e+00

Pos logit contributions
rob+0.050
 says+0.047
 hungry+0.043
 circus+0.043
 sleeping+0.043
Neg logit contributions
 efforts-0.077
ant-0.076
antly-0.075
ively-0.075
ered-0.073
TokenCh
Token ID1925
Feature activation+4.54e+00

Pos logit contributions
rob+0.066
 says+0.062
 circus+0.059
 hungry+0.059
 sleeping+0.058
Neg logit contributions
 efforts-0.061
ant-0.060
ively-0.060
antly-0.060
 fingertips-0.058
Tokenant
Token ID415
Feature activation+1.94e+01

Pos logit contributions
 fingertips+0.162
antly+0.159
 efforts+0.154
 nobody+0.145
mates+0.144
Neg logit contributions
rob-0.295
 says-0.278
 blows-0.255
 sleeping-0.250
etchup-0.250
Tokenelle
Token ID13485
Feature activation+2.34e+00

Pos logit contributions
 efforts+1.131
 fingertips+1.130
ively+1.121
ainer+1.106
ant+1.104
Neg logit contributions
rob-0.829
 says-0.805
 scar-0.718
 sees-0.706
 wants-0.700
Token was
Token ID373
Feature activation+3.90e+00

Pos logit contributions
ively+0.144
ant+0.142
ainer+0.141
 efforts+0.135
 fingertips+0.134
Neg logit contributions
 says-0.099
 sees-0.085
 wants-0.084
 hungry-0.082
 sleeping-0.080
Token very
Token ID845
Feature activation-8.72e-01

Pos logit contributions
ant+0.278
antly+0.266
ively+0.265
growth+0.262
 efforts+0.260
Neg logit contributions
 hungry-0.140
 waiting-0.124
 sleeping-0.121
 says-0.117
 scary-0.109
Token happy
Token ID3772
Feature activation-2.46e+00

Pos logit contributions
 hungry+0.029
 says+0.028
rob+0.027
 sleeping+0.026
 ours+0.025
Neg logit contributions
ant-0.058
antly-0.058
ively-0.057
 efforts-0.057
ered-0.055
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
rob+0.093
 says+0.090
 hungry+0.088
 sleeping+0.085
 circus+0.085
Neg logit contributions
ant-0.152
antly-0.151
ively-0.149
 efforts-0.147
ered-0.145
Token reached
Token ID4251
Feature activation+8.02e-01

Pos logit contributions
rob+0.050
 says+0.048
 hungry+0.046
 circus+0.044
 sees+0.044
Neg logit contributions
ant-0.081
antly-0.080
ively-0.080
 efforts-0.079
 fingertips-0.077
Token for
Token ID329
Feature activation-5.39e+00

Pos logit contributions
antly+0.047
ant+0.047
ively+0.047
 efforts+0.046
ered+0.046
Neg logit contributions
rob-0.035
 circus-0.030
 hungry-0.029
 says-0.029
 ours-0.028
Token the
Token ID262
Feature activation+1.39e+00

Pos logit contributions
 hungry+0.192
rob+0.186
 says+0.184
 sleeping+0.178
 ours+0.177
Neg logit contributions
antly-0.355
ant-0.350
ively-0.332
 efforts-0.329
vered-0.323
Token cro
Token ID6763
Feature activation-1.14e+01

Pos logit contributions
ant+0.084
 efforts+0.083
antly+0.083
ively+0.082
 fingertips+0.081
Neg logit contributions
rob-0.056
 says-0.048
 hungry-0.046
 circus-0.045
 scar-0.045
Tokeniss
Token ID747
Feature activation-1.70e+00

Pos logit contributions
rob+0.430
 says+0.390
 hungry+0.389
 circus+0.388
 ours+0.385
Neg logit contributions
ant-0.723
antly-0.719
ively-0.706
 efforts-0.694
ered-0.692
Tokenant
Token ID415
Feature activation+1.94e+01

Pos logit contributions
 says+0.101
rob+0.099
 ours+0.095
 hungry+0.093
 sleeping+0.093
Neg logit contributions
 efforts-0.061
antly-0.060
 Days-0.059
 fingertips-0.058
ively-0.056
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
 efforts+1.143
ant+1.139
antly+1.138
ively+1.122
ered+1.115
Neg logit contributions
rob-0.772
 says-0.752
 hungry-0.711
 circus-0.695
 ours-0.681
Token He
Token ID679
Feature activation-1.32e+00

Pos logit contributions
ant+0.009
 efforts+0.008
ively+0.008
antly+0.008
ered+0.008
Neg logit contributions
 says-0.005
rob-0.005
 hungry-0.005
 circus-0.004
 sees-0.004
Token rubbed
Token ID31862
Feature activation+1.32e-01

Pos logit contributions
rob+0.052
 says+0.049
 hungry+0.048
 circus+0.046
 sleeping+0.045
Neg logit contributions
ant-0.080
antly-0.080
ively-0.079
 efforts-0.079
ered-0.076
Token it
Token ID340
Feature activation+1.79e+00

Pos logit contributions
 efforts+0.009
ively+0.009
 fingertips+0.009
ant+0.009
ered+0.008
Neg logit contributions
 says-0.004
rob-0.004
 hungry-0.004
 red-0.004
 brown-0.004
Token on
Token ID319
Feature activation-5.30e+00

Pos logit contributions
ant+0.110
 efforts+0.108
ively+0.107
antly+0.107
 fingertips+0.104
Neg logit contributions
rob-0.068
 says-0.066
 hungry-0.064
 sleeping-0.061
 circus-0.060
Token She
Token ID1375
Feature activation-1.95e+00

Pos logit contributions
ant+0.068
antly+0.068
 efforts+0.067
ively+0.067
 fingertips+0.065
Neg logit contributions
rob-0.046
 says-0.040
 circus-0.039
 ours-0.039
 hungry-0.039
Token loved
Token ID6151
Feature activation+3.52e+00

Pos logit contributions
 ours+0.117
rob+0.117
 says+0.109
 circus+0.109
 scar+0.107
Neg logit contributions
antly-0.074
ant-0.070
ered-0.069
 efforts-0.068
kered-0.067
Token to
Token ID284
Feature activation-1.45e+00

Pos logit contributions
 efforts+0.123
 fingertips+0.108
 everybody+0.105
 Everyone+0.104
ant+0.103
Neg logit contributions
rob-0.221
use-0.200
 says-0.197
 ours-0.191
 yells-0.191
Token go
Token ID467
Feature activation-1.96e+00

Pos logit contributions
rob+0.060
 says+0.055
 hungry+0.054
 circus+0.053
 ours+0.052
Neg logit contributions
ant-0.085
antly-0.085
 efforts-0.084
ively-0.084
ered-0.081
Token on
Token ID319
Feature activation-5.28e+00

Pos logit contributions
rob+0.103
 ours+0.098
 wra+0.097
 yells+0.097
 fru+0.095
Neg logit contributions
ered-0.081
antly-0.077
 Everyone-0.073
ively-0.071
 efforts-0.071
Token pic
Token ID8301
Feature activation-1.53e+01

Pos logit contributions
rob+0.236
 says+0.234
iw+0.224
 wants+0.223
etchup+0.220
Neg logit contributions
 efforts-0.264
ered-0.256
growth-0.256
antly-0.247
ant-0.245
Tokenn
Token ID77
Feature activation+4.47e+00

Pos logit contributions
rob+0.903
 circus+0.863
 says+0.851
 sleeping+0.847
 ours+0.832
Neg logit contributions
antly-0.621
 efforts-0.597
ively-0.594
ant-0.589
 fingertips-0.577
Tokenics
Token ID873
Feature activation+1.03e+00

Pos logit contributions
ant+0.208
antly+0.206
 efforts+0.203
ively+0.203
ered+0.196
Neg logit contributions
rob-0.238
 says-0.225
 hungry-0.221
 circus-0.218
 ours-0.216
Token with
Token ID351
Feature activation-4.93e-01

Pos logit contributions
mates+0.029
ant+0.027
ered+0.026
antly+0.026
ively+0.024
Neg logit contributions
rob-0.068
 yells-0.067
 says-0.066
ww-0.064
 ours-0.064
Token her
Token ID607
Feature activation-4.10e+00

Pos logit contributions
rob+0.024
 says+0.021
 sees+0.020
 circus+0.020
 ours+0.020
Neg logit contributions
 efforts-0.025
ant-0.025
antly-0.025
ered-0.025
ively-0.024
Token family
Token ID1641
Feature activation+4.75e+00

Pos logit contributions
rob+0.291
 says+0.262
 yells+0.262
 ours+0.260
 circus+0.257
Neg logit contributions
 efforts-0.127
antly-0.114
 fingertips-0.114
ant-0.104
mates-0.103
Token so
Token ID523
Feature activation-1.66e-01

Pos logit contributions
ant+0.210
antly+0.210
 efforts+0.209
ively+0.208
ered+0.204
Neg logit contributions
rob-0.185
 says-0.167
 circus-0.162
 ours-0.157
 hungry-0.157
Token excited
Token ID6568
Feature activation-2.07e+00

Pos logit contributions
rob+0.011
 circus+0.009
use+0.009
 scar+0.009
 barking+0.008
Neg logit contributions
ered-0.007
 efforts-0.007
ively-0.007
ant-0.007
kered-0.007
Token!
Token ID0
Feature activation+5.45e-02

Pos logit contributions
rob+0.124
 circus+0.108
 hungry+0.105
 ours+0.105
 scar+0.105
Neg logit contributions
 efforts-0.086
ant-0.085
ively-0.083
ered-0.081
antly-0.080
Token She
Token ID1375
Feature activation-1.92e+00

Pos logit contributions
antly+0.003
ant+0.003
ively+0.003
 efforts+0.003
 fingertips+0.003
Neg logit contributions
rob-0.002
 says-0.002
 circus-0.002
 hungry-0.002
 ours-0.002
Token loved
Token ID6151
Feature activation+3.57e+00

Pos logit contributions
rob+0.096
 ours+0.082
 circus+0.081
 hungry+0.078
 says+0.078
Neg logit contributions
antly-0.101
 efforts-0.100
ered-0.100
ant-0.098
 fingertips-0.096
Token pic
Token ID8301
Feature activation-1.52e+01

Pos logit contributions
 efforts+0.164
zed+0.152
 everybody+0.151
ered+0.149
 fingertips+0.145
Neg logit contributions
rob-0.179
use-0.174
arl-0.163
oky-0.157
P-0.156
Tokenn
Token ID77
Feature activation+4.38e+00

Pos logit contributions
rob+0.844
 circus+0.743
 says+0.742
 sleeping+0.733
 ours+0.706
Neg logit contributions
antly-0.692
 efforts-0.674
ively-0.656
 fingertips-0.650
ant-0.631
Tokenics
Token ID873
Feature activation+1.01e+00

Pos logit contributions
antly+0.220
 efforts+0.217
ant+0.217
ively+0.213
 fingertips+0.208
Neg logit contributions
rob-0.219
 says-0.205
 hungry-0.202
 sleeping-0.197
 ours-0.196
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
ant+0.058
antly+0.058
 efforts+0.058
ively+0.057
ered+0.055
Neg logit contributions
rob-0.043
 says-0.040
 hungry-0.038
 circus-0.037
 ours-0.037
Token spending
Token ID4581
Feature activation-4.94e+00

Pos logit contributions
rob+0.064
 says+0.055
 ours+0.054
 circus+0.053
P+0.052
Neg logit contributions
 efforts-0.077
ered-0.076
antly-0.076
ant-0.075
 fingertips-0.075
Token time
Token ID640
Feature activation-1.44e+00

Pos logit contributions
rob+0.248
 says+0.241
 hungry+0.231
 ours+0.231
 circus+0.223
Neg logit contributions
 efforts-0.234
antly-0.233
ered-0.231
ant-0.229
ively-0.225
Token near
Token ID1474
Feature activation-1.47e+00

Pos logit contributions
rob+0.349
 says+0.324
 hungry+0.319
 circus+0.310
 ours+0.304
Neg logit contributions
ant-0.514
antly-0.509
 efforts-0.507
ively-0.503
ered-0.489
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
 says+0.058
 hungry+0.057
 ours+0.055
 sleeping+0.053
 scary+0.052
Neg logit contributions
antly-0.090
ant-0.090
ively-0.088
 efforts-0.087
ainer-0.085
Token cliff
Token ID19516
Feature activation-9.24e-01

Pos logit contributions
ant+0.120
antly+0.120
ively+0.119
 efforts+0.119
ered+0.116
Neg logit contributions
rob-0.077
 says-0.070
 hungry-0.067
 circus-0.066
 ours-0.065
Token and
Token ID290
Feature activation-1.39e+00

Pos logit contributions
rob+0.042
 says+0.039
 hungry+0.038
 circus+0.037
 ours+0.037
Neg logit contributions
ant-0.050
antly-0.050
 efforts-0.050
ively-0.050
ered-0.048
Token have
Token ID423
Feature activation+2.26e+00

Pos logit contributions
rob+0.069
 says+0.062
 ours+0.060
 hungry+0.060
 circus+0.059
Neg logit contributions
antly-0.070
ant-0.070
 efforts-0.069
ively-0.069
ered-0.068
Token pic
Token ID8301
Feature activation-1.52e+01

Pos logit contributions
 efforts+0.144
antly+0.140
ered+0.139
ively+0.134
ant+0.134
Neg logit contributions
rob-0.080
 says-0.068
use-0.064
 blows-0.063
 yells-0.063
Tokenn
Token ID77
Feature activation+4.38e+00

Pos logit contributions
rob+0.835
 circus+0.779
 says+0.756
 ours+0.755
 sleeping+0.740
Neg logit contributions
antly-0.676
ively-0.655
 fingertips-0.646
ant-0.634
 efforts-0.632
Tokenics
Token ID873
Feature activation+1.05e+00

Pos logit contributions
antly+0.231
 efforts+0.231
ant+0.228
 fingertips+0.226
ively+0.223
Neg logit contributions
rob-0.204
 says-0.193
 hungry-0.187
 sleeping-0.182
 ours-0.182
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
antly+0.057
ant+0.056
 efforts+0.056
ively+0.055
ered+0.054
Neg logit contributions
rob-0.048
 says-0.045
 hungry-0.042
 circus-0.042
 ours-0.042
Token One
Token ID1881
Feature activation-6.02e-01

Pos logit contributions
antly+0.058
ant+0.058
 efforts+0.057
ively+0.057
ered+0.055
Neg logit contributions
rob-0.058
 says-0.054
 hungry-0.053
 ours-0.052
 circus-0.052
Token day
Token ID1110
Feature activation-1.89e+00

Pos logit contributions
rob+0.025
 says+0.022
 hungry+0.022
 circus+0.021
 ours+0.021
Neg logit contributions
ant-0.036
antly-0.036
 efforts-0.036
ively-0.035
ered-0.034
Token to
Token ID284
Feature activation-1.27e+00

Pos logit contributions
 fingertips+0.271
ant+0.269
antly+0.258
ively+0.255
 efforts+0.254
Neg logit contributions
 says-0.176
 hungry-0.168
 scary-0.166
rob-0.164
 sleeping-0.161
Token be
Token ID307
Feature activation+1.51e+00

Pos logit contributions
 says+0.057
rob+0.056
 circus+0.054
 hungry+0.053
 sleeping+0.053
Neg logit contributions
antly-0.069
ant-0.066
 fingertips-0.066
 efforts-0.066
ively-0.065
Token careful
Token ID8161
Feature activation+1.81e+00

Pos logit contributions
ant+0.096
 fingertips+0.093
ively+0.093
antly+0.093
 efforts+0.090
Neg logit contributions
 hungry-0.059
 says-0.054
 ours-0.053
 scary-0.053
 sleeping-0.052
Token when
Token ID618
Feature activation+5.35e+00

Pos logit contributions
growth+0.134
antly+0.133
ant+0.132
 fingertips+0.131
ively+0.129
Neg logit contributions
 sleeping-0.040
 hungry-0.039
 says-0.038
 waiting-0.038
 barking-0.035
Token having
Token ID1719
Feature activation+1.38e+00

Pos logit contributions
ant+0.334
antly+0.333
 efforts+0.332
ively+0.329
ered+0.322
Neg logit contributions
rob-0.205
 says-0.184
 hungry-0.175
 circus-0.175
 ours-0.172
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
ant+0.091
antly+0.091
 efforts+0.091
ively+0.090
ered+0.088
Neg logit contributions
rob-0.047
 says-0.042
 hungry-0.041
 circus-0.040
 ours-0.039
Tokenn
Token ID77
Feature activation+4.46e+00

Pos logit contributions
rob+0.731
 says+0.659
 ours+0.658
use+0.650
 sleeping+0.631
Neg logit contributions
antly-0.745
 fingertips-0.742
mates-0.659
 efforts-0.657
 Days-0.656
Tokenics
Token ID873
Feature activation+1.16e+00

Pos logit contributions
 fingertips+0.235
antly+0.235
 efforts+0.233
ant+0.226
ively+0.221
Neg logit contributions
 says-0.203
rob-0.202
 hungry-0.189
 sleeping-0.185
 wants-0.182
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
ant+0.065
antly+0.065
 efforts+0.064
ively+0.064
ered+0.062
Neg logit contributions
rob-0.051
 says-0.048
 hungry-0.047
 circus-0.046
 ours-0.045
Token not
Token ID407
Feature activation+3.15e+00

Pos logit contributions
rob+0.075
 says+0.069
 hungry+0.067
 circus+0.066
 ours+0.065
Neg logit contributions
ant-0.098
antly-0.098
 efforts-0.098
ively-0.097
ered-0.094
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
 efforts+0.171
ant+0.165
ered+0.162
ively+0.162
antly+0.161
Neg logit contributions
rob-0.160
 says-0.133
 circus-0.130
P-0.124
 ours-0.124
Token went
Token ID1816
Feature activation-2.95e+00

Pos logit contributions
antly+0.008
ant+0.008
ively+0.008
 efforts+0.008
ainer+0.008
Neg logit contributions
rob-0.006
 says-0.006
 hungry-0.006
 circus-0.006
 scar-0.006
Token on
Token ID319
Feature activation-5.09e+00

Pos logit contributions
rob+0.140
 ours+0.138
 says+0.136
Say+0.133
 yells+0.132
Neg logit contributions
antly-0.140
 efforts-0.133
ered-0.131
 Days-0.131
mates-0.130
Token adventures
Token ID17545
Feature activation-5.28e+00

Pos logit contributions
rob+0.213
 says+0.199
 ours+0.188
 hungry+0.184
 circus+0.180
Neg logit contributions
 efforts-0.294
antly-0.290
ant-0.285
ively-0.282
ered-0.279
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
rob+0.202
 says+0.185
 hungry+0.181
 circus+0.177
 sleeping+0.172
Neg logit contributions
ant-0.331
antly-0.328
ively-0.324
 efforts-0.324
ered-0.315
Token had
Token ID550
Feature activation+1.34e-01

Pos logit contributions
 efforts+0.090
ant+0.089
ively+0.087
antly+0.087
ered+0.085
Neg logit contributions
rob-0.071
 says-0.068
 ours-0.067
ww-0.063
 yells-0.063
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
ant+0.007
ively+0.007
antly+0.007
 efforts+0.007
ered+0.006
Neg logit contributions
rob-0.007
 says-0.006
 hungry-0.006
 circus-0.006
 sleeping-0.006
Tokenn
Token ID77
Feature activation+4.52e+00

Pos logit contributions
rob+0.810
 fun+0.741
 circus+0.724
 ours+0.721
 says+0.705
Neg logit contributions
antly-0.704
 fingertips-0.663
ively-0.661
ant-0.659
 efforts-0.631
Tokenics
Token ID873
Feature activation+1.34e+00

Pos logit contributions
antly+0.231
ant+0.228
 efforts+0.226
ively+0.224
 fingertips+0.223
Neg logit contributions
rob-0.222
 says-0.206
 hungry-0.201
 sleeping-0.197
 ours-0.195
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
ant+0.082
antly+0.082
 efforts+0.081
ively+0.081
ered+0.079
Neg logit contributions
rob-0.052
 says-0.048
 hungry-0.047
 circus-0.046
 ours-0.045
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
 efforts+0.092
antly+0.090
ant+0.089
ively+0.088
 fingertips+0.087
Neg logit contributions
rob-0.071
 says-0.067
 ours-0.064
 yells-0.062
 hungry-0.061
Token even
Token ID772
Feature activation+4.08e+00

Pos logit contributions
rob+0.069
 hungry+0.064
 says+0.064
 circus+0.062
 sleeping+0.061
Neg logit contributions
ant-0.081
antly-0.081
ively-0.079
 efforts-0.079
ered-0.076
Token the
Token ID262
Feature activation+1.90e+00

Pos logit contributions
ant+0.030
antly+0.030
 efforts+0.029
ively+0.029
ered+0.028
Neg logit contributions
rob-0.025
 says-0.024
 hungry-0.024
 circus-0.023
 sleeping-0.023
Token park
Token ID3952
Feature activation-7.19e+00

Pos logit contributions
antly+0.126
 efforts+0.120
ively+0.116
ant+0.116
ainer+0.115
Neg logit contributions
rob-0.067
 says-0.059
 ours-0.057
 hungry-0.053
 yells-0.053
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
rob+0.277
 hungry+0.272
 says+0.264
 circus+0.260
 sleeping+0.260
Neg logit contributions
ant-0.438
antly-0.431
ively-0.426
 efforts-0.425
ered-0.417
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
 efforts+0.095
antly+0.094
ant+0.093
ively+0.092
ered+0.091
Neg logit contributions
rob-0.073
 says-0.066
 ours-0.062
 circus-0.062
 hungry-0.062
Token have
Token ID423
Feature activation+2.11e+00

Pos logit contributions
 hungry+0.062
 says+0.061
 sleeping+0.060
rob+0.059
 circus+0.058
Neg logit contributions
ant-0.087
antly-0.085
ively-0.082
 efforts-0.082
ainer-0.081
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
ant+0.142
ively+0.138
antly+0.138
 efforts+0.135
ered+0.132
Neg logit contributions
rob-0.073
 says-0.068
 hungry-0.067
 circus-0.066
 sleeping-0.064
Tokenn
Token ID77
Feature activation+4.51e+00

Pos logit contributions
rob+0.759
 fun+0.694
 ours+0.679
 circus+0.670
 says+0.666
Neg logit contributions
antly-0.722
 fingertips-0.702
ively-0.696
ant-0.675
 efforts-0.661
Tokenics
Token ID873
Feature activation+1.19e+00

Pos logit contributions
 efforts+0.234
 fingertips+0.233
antly+0.233
ant+0.228
ively+0.227
Neg logit contributions
rob-0.209
 says-0.196
 hungry-0.190
 sleeping-0.188
 ours-0.186
Token outside
Token ID2354
Feature activation-1.08e+00

Pos logit contributions
ant+0.069
antly+0.068
ively+0.068
 efforts+0.068
ered+0.066
Neg logit contributions
rob-0.050
 says-0.046
 hungry-0.046
 circus-0.045
 ours-0.044
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 hungry+0.040
rob+0.040
 says+0.038
 sleeping+0.038
 circus+0.037
Neg logit contributions
ant-0.066
 efforts-0.065
antly-0.065
ively-0.064
ered-0.064
Token
Token ID220
Feature activation-2.96e-01

Pos logit contributions
ered+0.073
ant+0.072
 efforts+0.068
antly+0.067
bled+0.067
Neg logit contributions
 says-0.047
 hungry-0.041
 wants-0.040
 circus-0.039
 pic-0.038
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
ant+0.051
antly+0.050
ively+0.049
ered+0.048
 efforts+0.047
Neg logit contributions
 hungry-0.030
 sleeping-0.029
rob-0.029
 says-0.028
 waiting-0.027
Token ready
Token ID3492
Feature activation-8.84e+00

Pos logit contributions
 efforts+0.065
 fingertips+0.060
 Days+0.059
 nobody+0.057
 surely+0.057
Neg logit contributions
rob-0.068
ww-0.057
 says-0.057
P-0.057
 yells-0.054
Token to
Token ID284
Feature activation-1.38e+00

Pos logit contributions
rob+0.326
 says+0.301
 hungry+0.293
 circus+0.285
 ours+0.281
Neg logit contributions
ant-0.560
antly-0.557
 efforts-0.552
ively-0.550
ered-0.536
Token have
Token ID423
Feature activation+2.05e+00

Pos logit contributions
rob+0.058
 says+0.057
 hungry+0.055
 circus+0.055
 sleeping+0.053
Neg logit contributions
ant-0.080
antly-0.079
ively-0.077
 efforts-0.077
ered-0.074
Token more
Token ID517
Feature activation+4.04e+00

Pos logit contributions
antly+0.113
 efforts+0.112
ant+0.111
ively+0.110
ered+0.108
Neg logit contributions
rob-0.091
 says-0.085
 hungry-0.083
 circus-0.081
 ours-0.081
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.270
ered+0.232
kered+0.231
 fingertips+0.230
antly+0.225
Neg logit contributions
rob-0.152
use-0.115
 barking-0.113
 sees-0.106
 ours-0.106
Tokenn
Token ID77
Feature activation+4.40e+00

Pos logit contributions
rob+0.875
 circus+0.810
 says+0.804
 ours+0.795
 sleeping+0.784
Neg logit contributions
antly-0.645
ant-0.613
ively-0.609
 fingertips-0.607
 efforts-0.576
Tokenics
Token ID873
Feature activation+1.10e+00

Pos logit contributions
antly+0.223
ant+0.221
 efforts+0.219
ively+0.215
 fingertips+0.213
Neg logit contributions
rob-0.217
 says-0.205
 hungry-0.199
 ours-0.195
 sleeping-0.194
Token in
Token ID287
Feature activation+4.37e-01

Pos logit contributions
antly+0.057
 efforts+0.057
ant+0.057
ively+0.056
ered+0.054
Neg logit contributions
rob-0.052
 says-0.049
 hungry-0.047
 ours-0.046
 circus-0.046
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 efforts+0.019
antly+0.019
ively+0.019
ant+0.019
ered+0.018
Neg logit contributions
rob-0.024
 says-0.022
 circus-0.021
 hungry-0.021
 ours-0.021
Token future
Token ID2003
Feature activation-3.29e+00

Pos logit contributions
antly+0.098
 Days+0.097
 bottom+0.093
 efforts+0.091
 surely+0.088
Neg logit contributions
rob-0.074
 yells-0.065
 says-0.063
 ours-0.061
 blows-0.059
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
rob+0.015
 says+0.014
 hungry+0.014
 circus+0.014
 ours+0.013
Neg logit contributions
ant-0.019
antly-0.019
 efforts-0.018
ively-0.018
ered-0.018
Token She
Token ID1375
Feature activation-1.92e+00

Pos logit contributions
ant+0.078
antly+0.078
 efforts+0.076
ively+0.076
 fingertips+0.074
Neg logit contributions
rob-0.051
 says-0.045
 circus-0.044
 ours-0.043
 hungry-0.043
Token loved
Token ID6151
Feature activation+3.55e+00

Pos logit contributions
rob+0.109
 ours+0.103
 says+0.102
 circus+0.099
 hungry+0.098
Neg logit contributions
antly-0.083
ant-0.081
 efforts-0.079
ered-0.078
ively-0.078
Token to
Token ID284
Feature activation-1.37e+00

Pos logit contributions
 efforts+0.130
 fingertips+0.115
 everybody+0.112
ant+0.112
 Everyone+0.109
Neg logit contributions
rob-0.216
use-0.197
 says-0.196
 ours-0.188
 yells-0.187
Token have
Token ID423
Feature activation+2.12e+00

Pos logit contributions
rob+0.057
 says+0.053
 hungry+0.051
 ours+0.050
 circus+0.050
Neg logit contributions
ant-0.080
antly-0.079
 efforts-0.079
ively-0.078
ered-0.077
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.119
growth+0.115
 Everyone+0.115
ant+0.112
ered+0.112
Neg logit contributions
 says-0.074
 wants-0.074
rob-0.074
use-0.074
 yells-0.074
Tokenn
Token ID77
Feature activation+4.65e+00

Pos logit contributions
rob+0.856
 circus+0.793
 says+0.775
 ours+0.773
 sleeping+0.759
Neg logit contributions
antly-0.678
ively-0.640
 fingertips-0.625
 efforts-0.620
ant-0.611
Tokenics
Token ID873
Feature activation+1.35e+00

Pos logit contributions
antly+0.229
ant+0.227
 efforts+0.225
ively+0.222
 fingertips+0.216
Neg logit contributions
rob-0.237
 says-0.222
 hungry-0.217
 circus-0.213
 ours-0.211
Token with
Token ID351
Feature activation-4.76e-01

Pos logit contributions
antly+0.055
ant+0.054
 efforts+0.053
mates+0.053
ered+0.053
Neg logit contributions
rob-0.078
 says-0.074
 yells-0.072
 ours-0.070
 blows-0.069
Token her
Token ID607
Feature activation-3.99e+00

Pos logit contributions
rob+0.025
 says+0.021
 sees+0.021
 yells+0.021
 wants+0.020
Neg logit contributions
 efforts-0.023
ant-0.022
antly-0.022
ered-0.022
 fingertips-0.022
Token family
Token ID1641
Feature activation+4.94e+00

Pos logit contributions
rob+0.269
 yells+0.240
 says+0.237
 ours+0.234
 circus+0.234
Neg logit contributions
 efforts-0.136
antly-0.126
 fingertips-0.122
ant-0.116
mates-0.115
Token see
Token ID766
Feature activation+2.47e+00

Pos logit contributions
rob+0.053
 says+0.049
 hungry+0.049
 circus+0.048
 ours+0.046
Neg logit contributions
ant-0.069
antly-0.069
 efforts-0.068
ively-0.068
 fingertips-0.065
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
ant+0.125
antly+0.124
 efforts+0.123
ively+0.122
ered+0.118
Neg logit contributions
rob-0.122
 says-0.115
 hungry-0.113
 circus-0.111
 ours-0.110
Token bird
Token ID6512
Feature activation+9.53e-01

Pos logit contributions
 efforts+0.106
antly+0.105
ively+0.102
ant+0.101
 fingertips+0.098
Neg logit contributions
rob-0.084
 says-0.074
 ours-0.068
 circus-0.068
 hungry-0.067
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
antly+0.050
ant+0.050
 efforts+0.049
ively+0.049
ered+0.048
Neg logit contributions
rob-0.046
 says-0.042
 circus-0.041
 ours-0.040
 hungry-0.040
Token have
Token ID423
Feature activation+2.20e+00

Pos logit contributions
rob+0.060
 says+0.060
 hungry+0.059
 circus+0.056
 sleeping+0.056
Neg logit contributions
ant-0.086
antly-0.084
ively-0.083
 efforts-0.082
ered-0.079
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
antly+0.128
ant+0.128
 efforts+0.127
ively+0.126
ered+0.123
Neg logit contributions
rob-0.092
 says-0.085
 hungry-0.082
 circus-0.081
 ours-0.080
Tokenn
Token ID77
Feature activation+4.55e+00

Pos logit contributions
rob+0.776
 circus+0.722
 ours+0.718
 fun+0.710
 says+0.704
Neg logit contributions
antly-0.695
ively-0.664
 fingertips-0.660
ant-0.657
 Days-0.627
Tokenics
Token ID873
Feature activation+1.30e+00

Pos logit contributions
antly+0.250
 fingertips+0.250
 efforts+0.249
ant+0.245
ively+0.240
Neg logit contributions
rob-0.199
 says-0.184
 sleeping-0.179
 hungry-0.178
 ours-0.176
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
ant+0.073
antly+0.072
 efforts+0.071
ively+0.071
ered+0.069
Neg logit contributions
rob-0.057
 says-0.053
 hungry-0.052
 circus-0.051
 ours-0.050
Token<|endoftext|>
Token ID50256
Feature activation-3.31e-01

Pos logit contributions
ant+0.083
ered+0.080
ively+0.079
 efforts+0.079
antly+0.079
Neg logit contributions
rob-0.043
 says-0.043
 hungry-0.041
 sees-0.038
 circus-0.038
TokenOnce
Token ID7454
Feature activation-3.53e+00

Pos logit contributions
rob+0.013
 says+0.012
 hungry+0.012
 circus+0.011
 sleeping+0.011
Neg logit contributions
ant-0.020
 efforts-0.020
antly-0.020
ively-0.020
ered-0.019
TokenThey
Token ID2990
Feature activation-1.06e+00

Pos logit contributions
rob+0.066
P+0.063
 circus+0.062
 says+0.062
Say+0.058
Neg logit contributions
 efforts-0.070
 fingertips-0.070
antly-0.069
ant-0.067
ively-0.067
Token decided
Token ID3066
Feature activation-5.96e-01

Pos logit contributions
 says+0.045
 hungry+0.043
rob+0.043
 scar+0.041
 sleeping+0.041
Neg logit contributions
ant-0.062
antly-0.060
ively-0.060
 efforts-0.059
ainer-0.059
Token to
Token ID284
Feature activation-1.25e+00

Pos logit contributions
rob+0.021
 says+0.021
 hungry+0.021
 circus+0.020
 ours+0.020
Neg logit contributions
ant-0.038
antly-0.037
ively-0.037
 efforts-0.036
 fingertips-0.036
Token make
Token ID787
Feature activation+6.27e+00

Pos logit contributions
 says+0.049
 circus+0.046
 hungry+0.042
 wants+0.042
 ours+0.042
Neg logit contributions
antly-0.074
ant-0.072
ively-0.071
 efforts-0.071
 fingertips-0.069
Token more
Token ID517
Feature activation+4.18e+00

Pos logit contributions
ant+0.344
antly+0.338
ively+0.335
 efforts+0.334
 fingertips+0.323
Neg logit contributions
rob-0.286
 says-0.277
 hungry-0.269
 circus-0.260
 ours-0.259
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.235
ered+0.215
ant+0.212
antly+0.212
ively+0.210
Neg logit contributions
rob-0.203
use-0.163
P-0.160
 circus-0.154
 ours-0.154
Tokenn
Token ID77
Feature activation+4.50e+00

Pos logit contributions
rob+0.750
 ours+0.696
 fun+0.684
 says+0.684
 circus+0.674
Neg logit contributions
antly-0.705
 fingertips-0.693
 Days-0.661
ant-0.649
ively-0.640
Tokenics
Token ID873
Feature activation+1.24e+00

Pos logit contributions
antly+0.235
ant+0.230
 efforts+0.228
ively+0.223
 fingertips+0.223
Neg logit contributions
rob-0.215
 says-0.205
 hungry-0.199
 ours-0.194
 sleeping-0.193
Token and
Token ID290
Feature activation-1.53e+00

Pos logit contributions
ant+0.072
antly+0.072
 efforts+0.071
ively+0.071
ered+0.069
Neg logit contributions
rob-0.051
 says-0.048
 hungry-0.047
 circus-0.046
 ours-0.045
Token looked
Token ID3114
Feature activation-1.03e+00

Pos logit contributions
rob+0.067
 says+0.066
 hungry+0.065
 circus+0.063
 ours+0.062
Neg logit contributions
ant-0.086
antly-0.084
ively-0.082
 efforts-0.081
 fingertips-0.079
Token in
Token ID287
Feature activation+6.35e-01

Pos logit contributions
rob+0.044
 says+0.040
 hungry+0.039
 circus+0.038
 ours+0.038
Neg logit contributions
ant-0.060
antly-0.060
 efforts-0.059
ively-0.059
ered-0.057
Token fun
Token ID1257
Feature activation-9.26e+00

Pos logit contributions
 efforts+0.114
 Everyone+0.107
antly+0.106
ively+0.106
ered+0.105
Neg logit contributions
rob-0.046
 ours-0.037
bat-0.035
use-0.033
 sees-0.033
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
rob+0.430
 says+0.402
 hungry+0.391
 circus+0.385
 ours+0.384
Neg logit contributions
ant-0.499
antly-0.494
 efforts-0.491
ively-0.486
ered-0.471
Token Lily
Token ID20037
Feature activation-4.75e-01

Pos logit contributions
ant+0.081
antly+0.080
ered+0.079
ively+0.079
 efforts+0.079
Neg logit contributions
 says-0.042
 hungry-0.040
rob-0.040
 circus-0.038
 ours-0.037
Token realized
Token ID6939
Feature activation+5.74e+00

Pos logit contributions
rob+0.024
 ours+0.022
 says+0.022
 circus+0.022
 hungry+0.022
Neg logit contributions
antly-0.023
ant-0.023
 efforts-0.023
ively-0.022
ered-0.022
Token that
Token ID326
Feature activation+5.13e+00

Pos logit contributions
ant+0.247
antly+0.243
 efforts+0.242
ively+0.241
ered+0.235
Neg logit contributions
rob-0.328
 says-0.306
 hungry-0.300
 circus-0.299
 ours-0.298
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.250
ant+0.245
ively+0.240
antly+0.235
 fingertips+0.231
Neg logit contributions
rob-0.264
 ours-0.229
 says-0.228
 circus-0.228
use-0.224
Tokenn
Token ID77
Feature activation+4.52e+00

Pos logit contributions
rob+0.846
 says+0.795
 circus+0.750
 ours+0.749
 fun+0.728
Neg logit contributions
antly-0.672
 fingertips-0.660
 efforts-0.645
ant-0.623
ively-0.621
Tokenics
Token ID873
Feature activation+1.29e+00

Pos logit contributions
 fingertips+0.235
 efforts+0.235
antly+0.230
ant+0.224
ively+0.214
Neg logit contributions
rob-0.213
 says-0.211
 sees-0.200
 hungry-0.199
 wants-0.196
Token were
Token ID547
Feature activation+5.15e+00

Pos logit contributions
ant+0.065
antly+0.064
 efforts+0.063
ively+0.063
 fingertips+0.061
Neg logit contributions
rob-0.063
 says-0.061
 hungry-0.059
 circus-0.058
 sleeping-0.057
Token not
Token ID407
Feature activation+3.23e+00

Pos logit contributions
ant+0.292
antly+0.291
 efforts+0.291
ively+0.287
ered+0.280
Neg logit contributions
rob-0.225
 says-0.207
 hungry-0.197
 circus-0.196
 ours-0.192
Token so
Token ID523
Feature activation-6.45e-02

Pos logit contributions
 efforts+0.176
ively+0.169
ant+0.165
ered+0.165
antly+0.159
Neg logit contributions
rob-0.161
 says-0.137
 circus-0.129
 sees-0.129
use-0.125
Token the
Token ID262
Feature activation+1.94e+00

Pos logit contributions
ant+0.026
antly+0.025
 efforts+0.025
ively+0.025
ered+0.024
Neg logit contributions
rob-0.027
 says-0.025
 hungry-0.025
 circus-0.024
 ours-0.024
Token warm
Token ID5814
Feature activation-2.38e+00

Pos logit contributions
antly+0.094
 bottom+0.090
 Days+0.090
 Everyone+0.089
 Everybody+0.088
Neg logit contributions
rob-0.097
 says-0.090
 yells-0.090
 ours-0.086
use-0.086
Token sunshine
Token ID34488
Feature activation+9.39e-01

Pos logit contributions
rob+0.107
 says+0.099
 ours+0.097
 hungry+0.096
 yells+0.092
Neg logit contributions
antly-0.131
 efforts-0.128
ant-0.127
ively-0.126
ered-0.123
Token and
Token ID290
Feature activation-1.52e+00

Pos logit contributions
ant+0.051
antly+0.051
 efforts+0.051
ively+0.050
 fingertips+0.048
Neg logit contributions
rob-0.042
 says-0.039
 ours-0.038
 hungry-0.038
 circus-0.037
Token having
Token ID1719
Feature activation+1.40e+00

Pos logit contributions
rob+0.063
 ours+0.056
 says+0.056
 circus+0.053
P+0.053
Neg logit contributions
antly-0.087
ant-0.087
 efforts-0.086
ively-0.085
ered-0.085
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.102
antly+0.100
ant+0.099
ively+0.099
ered+0.099
Neg logit contributions
rob-0.035
 says-0.031
 ours-0.028
 sees-0.028
 hungry-0.027
Tokenn
Token ID77
Feature activation+4.56e+00

Pos logit contributions
rob+0.886
 circus+0.802
 says+0.774
 sleeping+0.772
 ours+0.763
Neg logit contributions
antly-0.649
ively-0.612
 efforts-0.609
 fingertips-0.607
ant-0.600
Tokenics
Token ID873
Feature activation+1.26e+00

Pos logit contributions
antly+0.243
 efforts+0.242
 fingertips+0.238
ant+0.236
ively+0.232
Neg logit contributions
rob-0.211
 says-0.198
 hungry-0.190
 sleeping-0.188
 ours-0.183
Token with
Token ID351
Feature activation-4.05e-01

Pos logit contributions
ant+0.067
antly+0.067
 efforts+0.066
ively+0.065
ered+0.063
Neg logit contributions
rob-0.059
 says-0.056
 hungry-0.053
 circus-0.053
 ours-0.053
Token her
Token ID607
Feature activation-4.14e+00

Pos logit contributions
 hungry+0.012
rob+0.012
 says+0.012
 sleeping+0.011
 circus+0.011
Neg logit contributions
antly-0.028
ant-0.028
 efforts-0.027
ively-0.027
ered-0.027
Token family
Token ID1641
Feature activation+4.87e+00

Pos logit contributions
rob+0.203
 says+0.193
 hungry+0.190
 circus+0.186
 sleeping+0.183
Neg logit contributions
ant-0.211
antly-0.210
ively-0.207
 efforts-0.205
ered-0.201
Token roof
Token ID9753
Feature activation+5.25e+00

Pos logit contributions
 efforts+0.114
antly+0.110
ant+0.109
ively+0.109
ered+0.108
Neg logit contributions
rob-0.080
 says-0.068
 scar-0.063
 hungry-0.063
 ours-0.062
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
ant+0.324
antly+0.318
 efforts+0.312
ively+0.311
 fingertips+0.308
Neg logit contributions
rob-0.206
 says-0.195
 hungry-0.195
 sleeping-0.185
 waiting-0.184
Token They
Token ID1119
Feature activation+4.19e-01

Pos logit contributions
ered+0.089
ant+0.084
antly+0.084
bled+0.083
ved+0.081
Neg logit contributions
 says-0.033
 wants-0.030
 hungry-0.028
 circus-0.028
 pic-0.028
Token would
Token ID561
Feature activation+4.13e-01

Pos logit contributions
antly+0.024
ant+0.024
ively+0.023
 efforts+0.023
ainer+0.023
Neg logit contributions
rob-0.018
 says-0.018
 hungry-0.018
 circus-0.017
 sees-0.017
Token have
Token ID423
Feature activation+2.17e+00

Pos logit contributions
antly+0.021
ant+0.021
ively+0.021
 efforts+0.021
ered+0.020
Neg logit contributions
rob-0.020
 says-0.019
 hungry-0.019
 circus-0.019
 ours-0.018
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
ant+0.120
antly+0.119
ively+0.117
 efforts+0.117
ered+0.113
Neg logit contributions
rob-0.099
 says-0.093
 hungry-0.091
 circus-0.089
 ours-0.088
Tokenn
Token ID77
Feature activation+4.54e+00

Pos logit contributions
rob+0.766
 fun+0.700
 circus+0.693
 ours+0.685
 says+0.684
Neg logit contributions
antly-0.712
ively-0.688
 fingertips-0.678
ant-0.668
 efforts-0.653
Tokenics
Token ID873
Feature activation+1.24e+00

Pos logit contributions
antly+0.229
ant+0.227
 efforts+0.224
ively+0.222
 fingertips+0.219
Neg logit contributions
rob-0.225
 says-0.212
 hungry-0.207
 ours-0.201
 circus-0.201
Token and
Token ID290
Feature activation-1.47e+00

Pos logit contributions
ant+0.078
antly+0.077
ively+0.077
 efforts+0.076
ered+0.075
Neg logit contributions
rob-0.046
 says-0.043
 hungry-0.042
 circus-0.041
 sleeping-0.040
Token play
Token ID711
Feature activation-1.21e+00

Pos logit contributions
rob+0.063
 says+0.060
 hungry+0.059
 circus+0.057
 ours+0.056
Neg logit contributions
ant-0.084
antly-0.084
ively-0.082
 efforts-0.082
ered-0.079
Token games
Token ID1830
Feature activation-8.56e-01

Pos logit contributions
rob+0.045
 says+0.041
 circus+0.041
 hungry+0.041
 sleeping+0.039
Neg logit contributions
ant-0.078
 efforts-0.076
antly-0.076
ively-0.076
 fingertips-0.074
Token They
Token ID1119
Feature activation+6.05e-02

Pos logit contributions
ant+0.079
ered+0.077
 efforts+0.075
antly+0.074
 fingertips+0.073
Neg logit contributions
 says-0.048
 hungry-0.042
rob-0.041
 ours-0.040
 late-0.040
Token missed
Token ID6825
Feature activation+2.90e+00

Pos logit contributions
ant+0.004
ainer+0.004
 fingertips+0.003
antly+0.003
ively+0.003
Neg logit contributions
 says-0.003
 hungry-0.003
 sees-0.002
 scar-0.002
 scary-0.002
Token their
Token ID511
Feature activation-3.88e+00

Pos logit contributions
ant+0.165
antly+0.165
 efforts+0.162
ively+0.162
ered+0.157
Neg logit contributions
rob-0.125
 says-0.118
 hungry-0.116
 circus-0.112
 ours-0.111
Token walks
Token ID11114
Feature activation-8.43e+00

Pos logit contributions
 hungry+0.157
 scary+0.153
 sleeping+0.153
 scar+0.148
 fun+0.146
Neg logit contributions
antly-0.235
ant-0.231
ively-0.229
ered-0.221
ainer-0.220
Token and
Token ID290
Feature activation-1.60e+00

Pos logit contributions
rob+0.297
 hungry+0.289
 says+0.280
 sleeping+0.276
 scary+0.273
Neg logit contributions
ant-0.529
 fingertips-0.515
antly-0.513
ively-0.510
ered-0.509
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 says+0.072
 hungry+0.071
rob+0.071
 sleeping+0.068
 sees+0.067
Neg logit contributions
ant-0.089
antly-0.086
ively-0.085
 efforts-0.082
ainer-0.081
Tokenn
Token ID77
Feature activation+4.52e+00

Pos logit contributions
rob+0.622
use+0.573
 says+0.555
 your+0.555
 ours+0.553
Neg logit contributions
 fingertips-0.835
antly-0.737
growth-0.729
ices-0.726
 Days-0.722
Tokenics
Token ID873
Feature activation+1.21e+00

Pos logit contributions
 fingertips+0.240
 efforts+0.227
ant+0.222
antly+0.219
growth+0.218
Neg logit contributions
 says-0.208
rob-0.207
 sees-0.192
 sleeping-0.189
 ours-0.188
Token near
Token ID1474
Feature activation-1.36e+00

Pos logit contributions
ant+0.075
ively+0.072
 fingertips+0.070
antly+0.070
ered+0.069
Neg logit contributions
 hungry-0.045
 says-0.044
rob-0.044
 sleeping-0.043
 waiting-0.042
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
 hungry+0.048
 says+0.048
 sleeping+0.047
 waiting+0.046
 scary+0.045
Neg logit contributions
ant-0.086
antly-0.083
ered-0.083
 efforts-0.082
ively-0.082
Token cliff
Token ID19516
Feature activation-7.55e-01

Pos logit contributions
antly+0.110
ant+0.110
 efforts+0.109
ively+0.109
ered+0.105
Neg logit contributions
rob-0.074
 says-0.067
 hungry-0.064
 circus-0.063
 ours-0.063
Token girl
Token ID2576
Feature activation-3.00e-01

Pos logit contributions
 efforts+0.268
 mummy+0.253
antly+0.249
ant+0.249
mates+0.246
Neg logit contributions
rob-0.268
 yells-0.242
 ours-0.235
 says-0.228
 shouts-0.222
Token named
Token ID3706
Feature activation-4.76e-01

Pos logit contributions
rob+0.014
 says+0.013
 hungry+0.012
 circus+0.012
 ours+0.012
Neg logit contributions
ant-0.016
antly-0.016
 efforts-0.016
ively-0.016
ered-0.016
Token Lily
Token ID20037
Feature activation-1.71e-01

Pos logit contributions
rob+0.017
 says+0.015
 hungry+0.015
 circus+0.014
 ours+0.014
Neg logit contributions
ant-0.031
antly-0.031
 efforts-0.031
ively-0.031
ered-0.030
Token who
Token ID508
Feature activation+4.08e+00

Pos logit contributions
rob+0.008
 says+0.007
 hungry+0.007
 circus+0.007
 ours+0.007
Neg logit contributions
ant-0.009
antly-0.009
 efforts-0.009
ively-0.009
ered-0.009
Token loved
Token ID6151
Feature activation+3.70e+00

Pos logit contributions
antly+0.173
ant+0.172
 efforts+0.169
ered+0.164
ively+0.162
Neg logit contributions
rob-0.235
 says-0.217
 ours-0.216
 circus-0.212
 scar-0.205
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
 efforts+0.140
 fingertips+0.125
ant+0.118
 Everyone+0.116
 everybody+0.115
Neg logit contributions
rob-0.228
 says-0.204
use-0.199
 ours-0.195
 yells-0.193
Tokenn
Token ID77
Feature activation+4.57e+00

Pos logit contributions
rob+0.839
 circus+0.751
 says+0.741
 sleeping+0.731
 ours+0.722
Neg logit contributions
antly-0.685
 efforts-0.661
ively-0.639
 fingertips-0.633
ant-0.627
Tokenics
Token ID873
Feature activation+1.22e+00

Pos logit contributions
antly+0.257
 efforts+0.255
 fingertips+0.248
ant+0.244
ively+0.238
Neg logit contributions
rob-0.199
 sleeping-0.183
 hungry-0.180
 says-0.180
 waiting-0.173
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
antly+0.056
ant+0.056
ively+0.055
 efforts+0.055
mates+0.052
Neg logit contributions
rob-0.065
 says-0.061
 ours-0.059
 circus-0.058
 yells-0.058
Token She
Token ID1375
Feature activation-1.81e+00

Pos logit contributions
ant+0.075
antly+0.075
 efforts+0.074
ively+0.074
ered+0.072
Neg logit contributions
rob-0.050
 says-0.046
 hungry-0.045
 circus-0.044
 ours-0.043
Token would
Token ID561
Feature activation+6.25e-01

Pos logit contributions
rob+0.099
 ours+0.096
 circus+0.090
 says+0.090
 yells+0.089
Neg logit contributions
antly-0.078
 efforts-0.075
ered-0.073
ant-0.072
zed-0.070
Token favourite
Token ID12507
Feature activation+6.57e+00

Pos logit contributions
rob+0.141
 says+0.141
 hungry+0.141
 circus+0.138
 sleeping+0.135
Neg logit contributions
ant-0.201
antly-0.198
ively-0.196
ered-0.191
 efforts-0.189
Token place
Token ID1295
Feature activation+1.69e+00

Pos logit contributions
ant+0.386
antly+0.380
ively+0.378
 efforts+0.370
ered+0.367
Neg logit contributions
rob-0.270
 says-0.256
 circus-0.252
 hungry-0.251
 sleeping-0.246
Token to
Token ID284
Feature activation-1.12e+00

Pos logit contributions
ant+0.100
antly+0.098
ively+0.097
 efforts+0.097
ered+0.094
Neg logit contributions
rob-0.067
 says-0.065
 hungry-0.064
 sleeping-0.062
 ours-0.062
Token go
Token ID467
Feature activation-1.85e+00

Pos logit contributions
rob+0.049
 says+0.048
 circus+0.048
 hungry+0.047
 sleeping+0.046
Neg logit contributions
ant-0.062
antly-0.062
 efforts-0.061
ively-0.060
 fingertips-0.058
Token for
Token ID329
Feature activation-5.24e+00

Pos logit contributions
rob+0.098
 says+0.091
 circus+0.087
 ours+0.087
 hungry+0.087
Neg logit contributions
antly-0.090
ively-0.086
ered-0.084
 efforts-0.083
ant-0.083
Token pic
Token ID8301
Feature activation-1.51e+01

Pos logit contributions
rob+0.227
 hungry+0.215
 says+0.214
 circus+0.212
 sleeping+0.211
Neg logit contributions
ant-0.303
antly-0.296
ively-0.290
 efforts-0.286
 fingertips-0.279
Tokenn
Token ID77
Feature activation+4.53e+00

Pos logit contributions
rob+0.815
 circus+0.768
 sleeping+0.758
 ours+0.746
 says+0.735
Neg logit contributions
antly-0.655
 fingertips-0.637
ant-0.636
ively-0.629
 efforts-0.617
Tokenics
Token ID873
Feature activation+1.21e+00

Pos logit contributions
ant+0.247
antly+0.246
 efforts+0.243
ively+0.240
 fingertips+0.235
Neg logit contributions
rob-0.203
 says-0.192
 hungry-0.187
 sleeping-0.182
 ours-0.181
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
ant+0.065
antly+0.065
 efforts+0.064
ively+0.064
ered+0.062
Neg logit contributions
rob-0.056
 says-0.052
 hungry-0.051
 circus-0.050
 ours-0.050
Token<|endoftext|>
Token ID50256
Feature activation-1.39e-01

Pos logit contributions
ant+0.080
ered+0.078
 efforts+0.077
ively+0.075
antly+0.075
Neg logit contributions
 says-0.045
rob-0.043
 hungry-0.040
 circus-0.039
 ours-0.037
TokenOnce
Token ID7454
Feature activation-3.44e+00

Pos logit contributions
rob+0.005
 ours+0.004
 says+0.004
 hungry+0.004
 circus+0.004
Neg logit contributions
antly-0.009
ively-0.009
ant-0.009
 efforts-0.009
ered-0.009
Token children
Token ID1751
Feature activation+2.30e+00

Pos logit contributions
ant+0.276
antly+0.274
 efforts+0.269
ively+0.268
ered+0.261
Neg logit contributions
rob-0.224
 says-0.211
 hungry-0.209
 circus-0.205
 ours-0.200
Token running
Token ID2491
Feature activation-6.04e+00

Pos logit contributions
ant+0.144
antly+0.142
 efforts+0.142
ively+0.141
ered+0.138
Neg logit contributions
rob-0.087
 says-0.078
 hungry-0.075
 ours-0.074
 circus-0.074
Token around
Token ID1088
Feature activation+2.95e+00

Pos logit contributions
rob+0.308
 says+0.286
 hungry+0.275
 ours+0.273
 circus+0.272
Neg logit contributions
antly-0.298
ant-0.297
ively-0.294
 efforts-0.292
ered-0.284
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
ant+0.170
antly+0.169
 efforts+0.168
ively+0.167
ered+0.162
Neg logit contributions
rob-0.126
 says-0.116
 hungry-0.113
 ours-0.111
 circus-0.110
Token families
Token ID4172
Feature activation+6.94e+00

Pos logit contributions
 efforts+0.100
antly+0.098
ant+0.098
ively+0.097
 fingertips+0.095
Neg logit contributions
rob-0.071
 says-0.058
 ours-0.058
P-0.058
ww-0.056
Token pic
Token ID8301
Feature activation-1.50e+01

Pos logit contributions
ant+0.381
 efforts+0.377
ively+0.376
antly+0.376
ered+0.363
Neg logit contributions
rob-0.316
 says-0.281
 ours-0.279
 hungry-0.270
 circus-0.269
Tokenn
Token ID77
Feature activation+4.48e+00

Pos logit contributions
rob+0.912
 circus+0.857
 says+0.835
 sleeping+0.820
 ours+0.817
Neg logit contributions
antly-0.596
 efforts-0.579
 fingertips-0.564
ant-0.559
ively-0.549
Tokenicking
Token ID7958
Feature activation+1.34e+01

Pos logit contributions
antly+0.229
 efforts+0.229
ant+0.228
 fingertips+0.223
ively+0.221
Neg logit contributions
rob-0.210
 says-0.202
 hungry-0.197
 sleeping-0.195
 sees-0.189
Token,
Token ID11
Feature activation+1.73e+00

Pos logit contributions
antly+0.749
ant+0.744
 efforts+0.743
ively+0.734
ered+0.720
Neg logit contributions
rob-0.610
 says-0.537
 circus-0.521
 hungry-0.518
 ours-0.516
Token couples
Token ID11886
Feature activation+4.98e+00

Pos logit contributions
 efforts+0.103
ant+0.100
antly+0.099
ively+0.098
 fingertips+0.097
Neg logit contributions
rob-0.073
 says-0.062
 ours-0.060
P-0.059
ww-0.058
Token st
Token ID336
Feature activation-1.17e+01

Pos logit contributions
ant+0.291
antly+0.289
 efforts+0.288
ively+0.285
ered+0.276
Neg logit contributions
rob-0.205
 says-0.184
 ours-0.181
 hungry-0.174
 circus-0.171
Token and
Token ID290
Feature activation-1.46e+00

Pos logit contributions
rob+0.107
 says+0.104
 hungry+0.102
 circus+0.099
 sleeping+0.098
Neg logit contributions
antly-0.157
ant-0.156
 efforts-0.154
ively-0.153
ered-0.149
Token it
Token ID340
Feature activation+2.04e+00

Pos logit contributions
 says+0.063
rob+0.061
 hungry+0.060
 circus+0.058
 wants+0.057
Neg logit contributions
ant-0.083
antly-0.083
ively-0.082
 efforts-0.080
ainer-0.078
Token was
Token ID373
Feature activation+3.93e+00

Pos logit contributions
ant+0.108
antly+0.107
 efforts+0.106
ively+0.106
 fingertips+0.103
Neg logit contributions
rob-0.096
 says-0.093
 hungry-0.090
 circus-0.087
 sleeping-0.086
Token going
Token ID1016
Feature activation-1.25e+01

Pos logit contributions
antly+0.214
ant+0.214
 efforts+0.213
ively+0.210
ered+0.206
Neg logit contributions
rob-0.182
 says-0.166
 circus-0.159
 hungry-0.158
 ours-0.155
Token on
Token ID319
Feature activation-5.24e+00

Pos logit contributions
rob+0.580
 says+0.533
 circus+0.500
 ours+0.483
 sees+0.480
Neg logit contributions
antly-0.704
ively-0.674
ant-0.672
 efforts-0.668
ered-0.662
Token pic
Token ID8301
Feature activation-1.50e+01

Pos logit contributions
rob+0.226
 hungry+0.213
 circus+0.210
 sleeping+0.208
 says+0.207
Neg logit contributions
ant-0.303
 efforts-0.292
antly-0.292
ively-0.290
 fingertips-0.288
Tokenn
Token ID77
Feature activation+4.43e+00

Pos logit contributions
rob+0.821
 circus+0.773
 says+0.749
 ours+0.746
 fun+0.740
Neg logit contributions
antly-0.668
 fingertips-0.664
 efforts-0.649
ant-0.634
ively-0.625
Tokenics
Token ID873
Feature activation+1.16e+00

Pos logit contributions
antly+0.234
 efforts+0.232
ant+0.230
 fingertips+0.226
ively+0.220
Neg logit contributions
rob-0.205
 says-0.198
 hungry-0.189
 sleeping-0.186
 ours-0.182
Token!
Token ID0
Feature activation-8.25e-02

Pos logit contributions
ant+0.062
antly+0.062
 efforts+0.061
ively+0.061
ered+0.059
Neg logit contributions
rob-0.054
 says-0.050
 hungry-0.048
 circus-0.048
 ours-0.047
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
rob+0.004
 hungry+0.004
 ours+0.004
 sleeping+0.004
 circus+0.004
Neg logit contributions
antly-0.004
ant-0.004
 efforts-0.004
ively-0.004
 fingertips-0.004
Token
Token ID198
Feature activation-1.28e+00

Pos logit contributions
rob+0.054
 says+0.050
 hungry+0.048
 circus+0.047
 ours+0.046
Neg logit contributions
ant-0.076
antly-0.075
 efforts-0.074
ively-0.074
ered-0.072
Token in
Token ID287
Feature activation+5.68e-01

Pos logit contributions
rob+0.053
 says+0.049
 ours+0.048
 hungry+0.047
 circus+0.046
Neg logit contributions
antly-0.066
ant-0.066
 efforts-0.065
ively-0.065
ered-0.064
Token the
Token ID262
Feature activation+2.11e+00

Pos logit contributions
ant+0.028
antly+0.028
 efforts+0.027
ively+0.027
ered+0.026
Neg logit contributions
rob-0.029
 says-0.027
 hungry-0.027
 circus-0.026
 ours-0.026
Token garden
Token ID11376
Feature activation-1.13e+00

Pos logit contributions
antly+0.104
 Days+0.099
ively+0.099
ered+0.098
 efforts+0.097
Neg logit contributions
rob-0.119
 yells-0.093
 says-0.093
 ours-0.090
 blows-0.090
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
rob+0.055
 says+0.050
 ours+0.049
 circus+0.048
 hungry+0.048
Neg logit contributions
ant-0.058
antly-0.058
 efforts-0.057
ively-0.057
ered-0.055
Token have
Token ID423
Feature activation+2.21e+00

Pos logit contributions
rob+0.068
 says+0.061
 hungry+0.059
 circus+0.059
 ours+0.058
Neg logit contributions
ant-0.075
 efforts-0.074
antly-0.074
ively-0.074
ered-0.072
Token pic
Token ID8301
Feature activation-1.50e+01

Pos logit contributions
 efforts+0.152
ered+0.149
antly+0.147
growth+0.143
 fingertips+0.142
Neg logit contributions
rob-0.064
 says-0.054
use-0.050
 yells-0.050
 ours-0.050
Tokenn
Token ID77
Feature activation+4.61e+00

Pos logit contributions
rob+0.810
 circus+0.744
 says+0.737
 ours+0.735
 fun+0.717
Neg logit contributions
antly-0.685
 fingertips-0.658
ively-0.649
 efforts-0.637
ant-0.620
Tokenics
Token ID873
Feature activation+1.34e+00

Pos logit contributions
antly+0.260
 efforts+0.259
 fingertips+0.257
ant+0.248
ively+0.245
Neg logit contributions
rob-0.199
 says-0.188
 hungry-0.179
 sleeping-0.177
 ours-0.176
Token together
Token ID1978
Feature activation-2.57e+00

Pos logit contributions
antly+0.070
ant+0.070
 efforts+0.069
ively+0.068
ered+0.066
Neg logit contributions
rob-0.064
 says-0.059
 hungry-0.056
 ours-0.056
 circus-0.056
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
rob+0.112
 says+0.105
 hungry+0.103
 circus+0.101
 ours+0.099
Neg logit contributions
ant-0.145
antly-0.144
 efforts-0.142
ively-0.142
ered-0.138
Token
Token ID220
Feature activation-2.13e-01

Pos logit contributions
ant+0.076
antly+0.075
 efforts+0.074
ively+0.074
ered+0.072
Neg logit contributions
rob-0.062
 says-0.059
 hungry-0.057
 circus-0.056
 ours-0.055
Token even
Token ID772
Feature activation+4.02e+00

Pos logit contributions
antly+0.308
ant+0.308
ively+0.299
 efforts+0.296
ainer+0.280
Neg logit contributions
 says-0.226
rob-0.223
 hungry-0.222
 circus-0.213
 ours-0.211
Token see
Token ID766
Feature activation+2.43e+00

Pos logit contributions
ant+0.215
antly+0.214
 efforts+0.212
ively+0.211
ered+0.205
Neg logit contributions
rob-0.188
 says-0.175
 hungry-0.171
 circus-0.168
 ours-0.166
Token other
Token ID584
Feature activation+6.21e+00

Pos logit contributions
ant+0.127
antly+0.126
 efforts+0.125
ively+0.123
ered+0.120
Neg logit contributions
rob-0.117
 says-0.110
 hungry-0.108
 circus-0.106
 ours-0.105
Token people
Token ID661
Feature activation+5.70e+00

Pos logit contributions
 efforts+0.355
ant+0.352
ively+0.346
antly+0.343
ered+0.342
Neg logit contributions
rob-0.268
 says-0.230
 hungry-0.227
P-0.223
 circus-0.220
Token having
Token ID1719
Feature activation+1.39e+00

Pos logit contributions
ively+0.311
ered+0.306
 efforts+0.306
ant+0.304
antly+0.298
Neg logit contributions
rob-0.264
P-0.233
ww-0.223
 says-0.216
 ours-0.214
Token pic
Token ID8301
Feature activation-1.50e+01

Pos logit contributions
 efforts+0.096
ered+0.095
antly+0.093
ively+0.093
ant+0.092
Neg logit contributions
rob-0.042
 says-0.035
 hungry-0.032
 blows-0.032
 ours-0.032
Tokenn
Token ID77
Feature activation+4.49e+00

Pos logit contributions
rob+0.897
 circus+0.824
 says+0.815
 ours+0.812
 sleeping+0.795
Neg logit contributions
antly-0.619
ant-0.586
ively-0.577
 efforts-0.568
 fingertips-0.561
Tokenics
Token ID873
Feature activation+1.30e+00

Pos logit contributions
antly+0.233
 efforts+0.230
ant+0.227
 fingertips+0.222
ively+0.221
Neg logit contributions
rob-0.216
 says-0.204
 hungry-0.196
 ours-0.194
 sleeping-0.191
Token.
Token ID13
Feature activation+8.10e-01

Pos logit contributions
antly+0.068
ant+0.068
 efforts+0.067
ively+0.066
ered+0.064
Neg logit contributions
rob-0.062
 says-0.058
 hungry-0.055
 circus-0.055
 ours-0.054
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
ant+0.052
ered+0.051
 efforts+0.049
ively+0.048
antly+0.048
Neg logit contributions
 says-0.033
 hungry-0.028
 sees-0.028
 wants-0.027
rob-0.027
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
rob+0.060
 says+0.060
 hungry+0.056
 sees+0.054
 sleeping+0.053
Neg logit contributions
ant-0.100
 efforts-0.099
antly-0.096
ively-0.096
 fingertips-0.095
Token to
Token ID284
Feature activation-1.20e+00

Pos logit contributions
rob+0.153
 says+0.144
 hungry+0.142
 circus+0.136
 ours+0.135
Neg logit contributions
ant-0.266
antly-0.263
ively-0.261
 efforts-0.260
ered-0.253
Token drop
Token ID4268
Feature activation+5.22e-01

Pos logit contributions
 says+0.046
 sleeping+0.044
 hungry+0.043
 scary+0.043
 mess+0.043
Neg logit contributions
antly-0.072
 efforts-0.071
 Days-0.069
ant-0.068
ively-0.066
Token it
Token ID340
Feature activation+2.13e+00

Pos logit contributions
 efforts+0.031
ant+0.030
antly+0.030
ively+0.029
 fingertips+0.029
Neg logit contributions
 says-0.020
 hungry-0.020
rob-0.019
 ours-0.019
 scary-0.019
Token on
Token ID319
Feature activation-5.06e+00

Pos logit contributions
antly+0.113
ant+0.113
 efforts+0.111
ively+0.111
ered+0.109
Neg logit contributions
rob-0.102
 says-0.093
 circus-0.091
 hungry-0.090
 ours-0.087
Token her
Token ID607
Feature activation-4.23e+00

Pos logit contributions
 says+0.175
 hungry+0.175
rob+0.173
 sleeping+0.173
 waiting+0.167
Neg logit contributions
ant-0.306
 efforts-0.305
antly-0.302
ered-0.293
ively-0.293
Token foot
Token ID2366
Feature activation+1.35e+01

Pos logit contributions
 says+0.205
 hungry+0.204
 sleeping+0.202
 circus+0.197
 sees+0.195
Neg logit contributions
ively-0.210
ant-0.209
antly-0.205
ainer-0.197
ered-0.196
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
ant+0.758
antly+0.755
 efforts+0.748
ively+0.743
ered+0.722
Neg logit contributions
rob-0.586
 says-0.552
 hungry-0.539
 circus-0.523
 ours-0.521
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
ant+0.086
 efforts+0.084
ered+0.083
ively+0.082
ainer+0.081
Neg logit contributions
 says-0.043
 hungry-0.038
 sees-0.036
 wants-0.034
 barking-0.034
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
 says+0.040
rob+0.036
 hungry+0.034
P+0.034
 sees+0.034
Neg logit contributions
 efforts-0.096
ant-0.094
 fingertips-0.091
ively-0.090
antly-0.090
Token"
Token ID1
Feature activation-5.84e-01

Pos logit contributions
P+0.057
 says+0.055
rob+0.053
 hungry+0.052
 sees+0.052
Neg logit contributions
 efforts-0.075
 fingertips-0.073
ant-0.072
ively-0.072
antly-0.071
TokenThat
Token ID2504
Feature activation+1.79e+00

Pos logit contributions
rob+0.023
 says+0.022
 hungry+0.021
 circus+0.021
 ours+0.020
Neg logit contributions
ant-0.035
antly-0.035
 efforts-0.035
ively-0.035
ered-0.034
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
ant+0.161
antly+0.158
ively+0.157
 efforts+0.157
 fingertips+0.152
Neg logit contributions
 says-0.095
rob-0.095
 hungry-0.094
 sleeping-0.088
 ours-0.087
Token helped
Token ID4193
Feature activation+1.64e+00

Pos logit contributions
rob+0.059
 says+0.058
 hungry+0.056
 sleeping+0.054
 scar+0.054
Neg logit contributions
ant-0.085
antly-0.084
ively-0.083
 efforts-0.081
 fingertips-0.080
Token make
Token ID787
Feature activation+6.33e+00

Pos logit contributions
 efforts+0.094
antly+0.092
growth+0.091
 fingertips+0.091
ainer+0.089
Neg logit contributions
 says-0.062
 scar-0.059
 barking-0.058
 scary-0.057
 hungry-0.055
Token food
Token ID2057
Feature activation+1.81e-01

Pos logit contributions
ant+0.362
antly+0.361
 efforts+0.355
ively+0.354
 fingertips+0.347
Neg logit contributions
rob-0.263
 says-0.261
 hungry-0.252
 scar-0.243
 sleeping-0.240
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
ant+0.010
antly+0.010
 efforts+0.010
ively+0.010
ered+0.010
Neg logit contributions
rob-0.008
 says-0.007
 hungry-0.007
 circus-0.007
 ours-0.007
Token Everyone
Token ID11075
Feature activation+1.41e+01

Pos logit contributions
 efforts+0.065
ant+0.063
ered+0.061
ively+0.061
antly+0.060
Neg logit contributions
 says-0.049
rob-0.046
 hungry-0.042
 barking-0.040
 yells-0.040
Token was
Token ID373
Feature activation+3.80e+00

Pos logit contributions
ant+0.736
 efforts+0.726
antly+0.720
ively+0.718
ainer+0.703
Neg logit contributions
 says-0.674
rob-0.662
 hungry-0.634
 sees-0.616
 sleeping-0.613
Token very
Token ID845
Feature activation-1.04e+00

Pos logit contributions
ant+0.282
antly+0.273
ively+0.270
growth+0.269
 efforts+0.263
Neg logit contributions
 hungry-0.125
 waiting-0.108
 sleeping-0.106
 says-0.100
 scary-0.098
Token happy
Token ID3772
Feature activation-2.54e+00

Pos logit contributions
rob+0.033
 hungry+0.032
 says+0.032
 ours+0.029
 circus+0.029
Neg logit contributions
ant-0.071
antly-0.070
ively-0.070
 efforts-0.069
ered-0.067
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
rob+0.104
 says+0.099
 hungry+0.096
 circus+0.094
 sleeping+0.092
Neg logit contributions
ant-0.150
antly-0.149
ively-0.147
 efforts-0.147
ered-0.143
Token
Token ID220
Feature activation-2.66e-01

Pos logit contributions
 efforts+0.072
ant+0.071
ered+0.069
ively+0.068
antly+0.068
Neg logit contributions
 says-0.052
rob-0.050
 hungry-0.045
 sees-0.043
 wants-0.043
Token a
Token ID257
Feature activation+1.46e+00

Pos logit contributions
ant+0.013
antly+0.012
ively+0.012
 fingertips+0.012
 efforts+0.012
Neg logit contributions
 hungry-0.009
 says-0.009
rob-0.009
 sleeping-0.009
 scary-0.008
Token hammer
Token ID15554
Feature activation+6.56e+00

Pos logit contributions
antly+0.080
 efforts+0.080
 fingertips+0.076
ant+0.075
ered+0.074
Neg logit contributions
rob-0.071
 says-0.061
 ours-0.060
 circus-0.059
 wants-0.057
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
ant+0.409
antly+0.407
 efforts+0.403
ively+0.401
ered+0.391
Neg logit contributions
rob-0.247
 says-0.227
 hungry-0.221
 circus-0.217
 ours-0.214
Token a
Token ID257
Feature activation+1.45e+00

Pos logit contributions
 says+0.069
rob+0.067
 hungry+0.065
 scar+0.063
 sees+0.062
Neg logit contributions
ant-0.099
ively-0.098
 efforts-0.096
antly-0.094
 fingertips-0.093
Token ch
Token ID442
Feature activation-1.10e+01

Pos logit contributions
ant+0.083
antly+0.083
 efforts+0.082
ively+0.081
ered+0.079
Neg logit contributions
rob-0.062
 says-0.057
 hungry-0.056
 circus-0.055
 ours-0.054
Tokenisel
Token ID36811
Feature activation+1.52e+01

Pos logit contributions
 says+0.671
 hungry+0.654
 sees+0.645
 blows+0.642
 mean+0.637
Neg logit contributions
antly-0.375
ively-0.332
ant-0.328
 Days-0.320
 fingertips-0.315
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
ant+0.906
antly+0.902
ively+0.895
 efforts+0.887
 fingertips+0.869
Neg logit contributions
rob-0.574
 says-0.573
 hungry-0.555
 sleeping-0.538
 sees-0.518
Token they
Token ID484
Feature activation-1.63e+00

Pos logit contributions
 says+0.072
rob+0.070
 hungry+0.069
 sleeping+0.065
 scar+0.065
Neg logit contributions
ant-0.097
ively-0.095
 efforts-0.093
antly-0.092
 fingertips-0.090
Token began
Token ID2540
Feature activation+5.88e+00

Pos logit contributions
 says+0.061
rob+0.057
 scar+0.057
 hungry+0.056
 ch+0.052
Neg logit contributions
ant-0.101
ively-0.100
 efforts-0.098
ainer-0.097
antly-0.095
Token to
Token ID284
Feature activation-1.28e+00

Pos logit contributions
ant+0.384
ively+0.378
antly+0.376
 efforts+0.370
ered+0.361
Neg logit contributions
rob-0.200
 hungry-0.194
 says-0.191
 sleeping-0.184
 circus-0.176
Token carefully
Token ID7773
Feature activation+5.65e+00

Pos logit contributions
rob+0.053
 says+0.053
 hungry+0.051
 scar+0.049
 sleeping+0.048
Neg logit contributions
ant-0.074
ively-0.074
antly-0.073
 efforts-0.072
 fingertips-0.069
Token Great
Token ID3878
Feature activation+1.73e-03

Pos logit contributions
ant+0.012
 efforts+0.012
antly+0.012
ively+0.012
ered+0.012
Neg logit contributions
rob-0.008
 says-0.007
 hungry-0.007
 circus-0.007
 ours-0.007
Token job
Token ID1693
Feature activation+7.48e+00

Pos logit contributions
ant+0.000
antly+0.000
 efforts+0.000
ively+0.000
ered+0.000
Neg logit contributions
rob-0.000
 says-0.000
 hungry-0.000
 circus-0.000
 ours-0.000
Token designing
Token ID18492
Feature activation+3.42e-01

Pos logit contributions
ant+0.499
antly+0.496
 efforts+0.493
ively+0.489
ered+0.480
Neg logit contributions
rob-0.252
 says-0.223
 hungry-0.219
 circus-0.213
 ours-0.208
Token that
Token ID326
Feature activation+5.22e+00

Pos logit contributions
ant+0.020
antly+0.020
 efforts+0.019
ively+0.019
ered+0.019
Neg logit contributions
rob-0.014
 says-0.014
 hungry-0.013
 circus-0.013
 ours-0.013
Token sund
Token ID37437
Feature activation-9.39e+00

Pos logit contributions
antly+0.302
 efforts+0.299
ant+0.294
ered+0.287
 fingertips+0.284
Neg logit contributions
rob-0.225
 scar-0.186
 circus-0.183
 says-0.179
 hungry-0.175
Tokenae
Token ID3609
Feature activation+1.25e+01

Pos logit contributions
rob+0.500
 says+0.486
P+0.483
ww+0.474
 circus+0.474
Neg logit contributions
 efforts-0.410
ant-0.402
ively-0.399
 fingertips-0.394
growth-0.393
Token!"
Token ID2474
Feature activation-5.22e+00

Pos logit contributions
antly+0.793
ant+0.793
 efforts+0.782
ively+0.775
ered+0.765
Neg logit contributions
rob-0.465
 says-0.412
 hungry-0.402
 circus-0.398
 ours-0.388
Token
Token ID220
Feature activation-3.16e-01

Pos logit contributions
rob+0.228
 says+0.219
 hungry+0.211
 circus+0.204
 ours+0.203
Neg logit contributions
ant-0.294
antly-0.290
 efforts-0.288
ively-0.286
ered-0.277
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
 says+0.012
rob+0.011
 hungry+0.010
 sees+0.010
 ours+0.010
Neg logit contributions
 efforts-0.020
ant-0.020
antly-0.019
ively-0.019
 fingertips-0.019
Token
Token ID198
Feature activation-1.38e+00

Pos logit contributions
rob+0.055
 says+0.053
 hungry+0.049
 circus+0.048
 sleeping+0.047
Neg logit contributions
ant-0.084
 efforts-0.084
antly-0.082
ively-0.082
ered-0.079
TokenSarah
Token ID29284
Feature activation+7.50e+00

Pos logit contributions
rob+0.064
 says+0.064
 circus+0.060
P+0.060
 hungry+0.060
Neg logit contributions
 efforts-0.072
ant-0.070
antly-0.069
ively-0.069
 fingertips-0.068
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
ant+0.149
 efforts+0.145
antly+0.144
ively+0.144
 fingertips+0.141
Neg logit contributions
rob-0.092
 says-0.088
 hungry-0.085
 sleeping-0.083
 circus-0.082
Token
Token ID220
Feature activation-2.79e-01

Pos logit contributions
ant+0.068
ered+0.068
ively+0.067
 efforts+0.066
antly+0.064
Neg logit contributions
 says-0.033
rob-0.031
 hungry-0.030
 sleeping-0.028
 wants-0.028
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
rob+0.010
 says+0.010
 hungry+0.009
 ours+0.009
 sleeping+0.008
Neg logit contributions
 efforts-0.018
ant-0.018
antly-0.018
ively-0.017
 fingertips-0.017
Token
Token ID198
Feature activation-1.36e+00

Pos logit contributions
rob+0.047
 says+0.044
 sleeping+0.042
 hungry+0.041
 circus+0.040
Neg logit contributions
 efforts-0.090
ant-0.089
ively-0.087
antly-0.086
 fingertips-0.086
TokenThe
Token ID464
Feature activation+8.18e+00

Pos logit contributions
rob+0.067
P+0.064
 says+0.062
 circus+0.062
 sleeping+0.061
Neg logit contributions
 efforts-0.067
 fingertips-0.067
ant-0.067
ively-0.066
antly-0.065
Token End
Token ID5268
Feature activation+1.17e+01

Pos logit contributions
ant+0.274
antly+0.270
ively+0.269
 efforts+0.252
ered+0.249
Neg logit contributions
rob-0.546
 says-0.535
 hungry-0.531
 circus-0.524
 sleeping-0.517
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
ant+0.695
ively+0.680
antly+0.678
 efforts+0.676
ered+0.664
Neg logit contributions
rob-0.491
 says-0.467
 hungry-0.445
 circus-0.435
 sleeping-0.429
Token<|endoftext|>
Token ID50256
Feature activation-2.11e-01

Pos logit contributions
ant+0.071
 efforts+0.069
ered+0.069
ively+0.068
antly+0.067
Neg logit contributions
rob-0.034
 says-0.033
 hungry-0.030
 sleeping-0.030
 circus-0.029
TokenOnce
Token ID7454
Feature activation-3.44e+00

Pos logit contributions
rob+0.006
 ours+0.006
 says+0.006
 hungry+0.006
 scar+0.006
Neg logit contributions
antly-0.015
ively-0.014
ant-0.014
 efforts-0.014
ered-0.014
Token upon
Token ID2402
Feature activation-5.26e+00

Pos logit contributions
rob+0.096
 says+0.086
 hungry+0.083
 circus+0.080
 ours+0.078
Neg logit contributions
ant-0.248
antly-0.247
 efforts-0.245
ively-0.244
ered-0.239
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
rob+0.232
 hungry+0.223
 says+0.221
 sleeping+0.216
 circus+0.213
Neg logit contributions
ant-0.297
antly-0.291
 efforts-0.290
ively-0.287
 fingertips-0.277
Token gl
Token ID1278
Feature activation-9.21e+00

Pos logit contributions
 hungry+0.324
 says+0.317
 sleeping+0.316
rob+0.314
 scary+0.311
Neg logit contributions
ant-0.468
antly-0.448
ively-0.446
 efforts-0.428
 fingertips-0.425
Tokenowed
Token ID6972
Feature activation+1.29e+01

Pos logit contributions
rob+0.497
 hungry+0.488
 ours+0.487
 scar+0.485
 sleeping+0.480
Neg logit contributions
growth-0.329
 fingertips-0.329
 efforts-0.323
antly-0.320
ively-0.311
Token brighter
Token ID26083
Feature activation+3.83e+00

Pos logit contributions
ant+0.797
 efforts+0.773
ively+0.768
 fingertips+0.756
ered+0.749
Neg logit contributions
 hungry-0.478
 sleeping-0.448
 scary-0.437
 waiting-0.431
 brown-0.425
Token and
Token ID290
Feature activation-1.55e+00

Pos logit contributions
ant+0.239
antly+0.238
 efforts+0.236
ively+0.234
ered+0.229
Neg logit contributions
rob-0.145
 says-0.133
 hungry-0.128
 circus-0.125
 ours-0.124
Token brighter
Token ID26083
Feature activation+3.84e+00

Pos logit contributions
rob+0.063
 says+0.057
 hungry+0.056
 circus+0.055
 ours+0.054
Neg logit contributions
antly-0.092
ant-0.091
 efforts-0.091
ively-0.090
ered-0.089
Token each
Token ID1123
Feature activation+7.73e+00

Pos logit contributions
antly+0.215
ant+0.215
 efforts+0.213
ively+0.211
ered+0.205
Neg logit contributions
rob-0.170
 says-0.157
 hungry-0.153
 circus-0.150
 ours-0.150
Token day
Token ID1110
Feature activation-1.81e+00

Pos logit contributions
ant+0.458
antly+0.453
 efforts+0.449
ively+0.445
ered+0.433
Neg logit contributions
rob-0.314
 says-0.297
 hungry-0.290
 circus-0.282
 sleeping-0.279
Token!
Token ID0
Feature activation-8.02e-02

Pos logit contributions
rob+0.075
 says+0.072
 hungry+0.071
 sleeping+0.068
 circus+0.068
Neg logit contributions
ant-0.104
antly-0.102
 efforts-0.101
ively-0.101
ered-0.098
Token<|endoftext|>
Token ID50256
Feature activation-4.27e-01

Pos logit contributions
 says+0.003
rob+0.003
 hungry+0.003
 circus+0.002
 sees+0.002
Neg logit contributions
ant-0.005
 efforts-0.005
ively-0.005
ered-0.005
antly-0.005
TokenOnce
Token ID7454
Feature activation-3.59e+00

Pos logit contributions
rob+0.013
 ours+0.012
 hungry+0.012
 says+0.012
 circus+0.011
Neg logit contributions
antly-0.030
ively-0.029
ant-0.029
 efforts-0.028
ered-0.028
Token upon
Token ID2402
Feature activation-5.40e+00

Pos logit contributions
rob+0.086
 says+0.082
 hungry+0.078
 ours+0.072
 sleeping+0.071
Neg logit contributions
ant-0.270
antly-0.268
 efforts-0.265
ively-0.264
ered-0.258
Token saw
Token ID2497
Feature activation-3.70e+00

Pos logit contributions
rob+0.141
 says+0.133
 hungry+0.129
 circus+0.126
 sleeping+0.123
Neg logit contributions
ant-0.199
antly-0.196
 efforts-0.195
ively-0.195
ered-0.188
Token something
Token ID1223
Feature activation+4.77e+00

Pos logit contributions
 hungry+0.141
 says+0.139
rob+0.134
 scary+0.132
 sleeping+0.131
Neg logit contributions
ant-0.231
 fingertips-0.230
 efforts-0.229
antly-0.227
ainer-0.227
Token fragile
Token ID21049
Feature activation-2.82e-01

Pos logit contributions
antly+0.266
 efforts+0.263
ively+0.261
ant+0.260
ered+0.259
Neg logit contributions
rob-0.219
 says-0.187
 circus-0.185
 ours-0.182
 hungry-0.176
Token on
Token ID319
Feature activation-5.19e+00

Pos logit contributions
rob+0.013
 says+0.012
 hungry+0.012
 circus+0.011
 ours+0.011
Neg logit contributions
antly-0.015
ant-0.015
ively-0.015
 efforts-0.015
ered-0.015
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
 hungry+0.174
 scary+0.172
 ours+0.172
 circus+0.169
 says+0.167
Neg logit contributions
ant-0.329
 efforts-0.327
antly-0.326
 fingertips-0.313
ered-0.312
Token ground
Token ID2323
Feature activation+4.88e+00

Pos logit contributions
ant+0.097
 efforts+0.094
antly+0.094
ively+0.093
ered+0.092
Neg logit contributions
rob-0.092
 says-0.087
 hungry-0.087
 circus-0.087
 ours-0.085
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
ant+0.275
antly+0.272
 efforts+0.270
ively+0.267
ered+0.260
Neg logit contributions
rob-0.211
 says-0.199
 hungry-0.195
 circus-0.190
 ours-0.187
Token It
Token ID632
Feature activation+5.31e+00

Pos logit contributions
ant+0.067
 efforts+0.064
ered+0.063
ively+0.063
antly+0.062
Neg logit contributions
 says-0.046
 hungry-0.045
rob-0.044
 circus-0.041
 sleeping-0.041
Token was
Token ID373
Feature activation+3.76e+00

Pos logit contributions
ant+0.272
 efforts+0.265
ively+0.263
antly+0.256
 fingertips+0.252
Neg logit contributions
rob-0.261
 says-0.260
 hungry-0.250
 sleeping-0.243
 scary-0.241
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
ant+0.209
antly+0.205
ively+0.203
 efforts+0.201
ered+0.195
Neg logit contributions
rob-0.165
 hungry-0.160
 says-0.159
 sleeping-0.154
 ours-0.153
Token mouse
Token ID10211
Feature activation+5.49e+00

Pos logit contributions
antly+0.104
 efforts+0.103
ant+0.103
ively+0.101
 fingertips+0.098
Neg logit contributions
rob-0.085
 says-0.078
 hungry-0.074
 circus-0.073
 ours-0.073
Token,
Token ID11
Feature activation+1.24e+00

Pos logit contributions
antly+0.347
ant+0.347
ered+0.337
 efforts+0.332
ively+0.329
Neg logit contributions
rob-0.556
 circus-0.483
 scar-0.466
 hungry-0.461
 ours-0.459
Token I
Token ID314
Feature activation-4.71e+00

Pos logit contributions
 efforts+0.063
ant+0.062
antly+0.061
 fingertips+0.061
ered+0.060
Neg logit contributions
rob-0.063
 says-0.052
 circus-0.051
 hungry-0.050
P-0.049
Token found
Token ID1043
Feature activation+5.71e-01

Pos logit contributions
rob+0.226
 circus+0.199
 ours+0.198
 says+0.195
 hungry+0.190
Neg logit contributions
antly-0.249
ant-0.249
 efforts-0.249
ered-0.243
ively-0.239
Token a
Token ID257
Feature activation+1.23e+00

Pos logit contributions
ant+0.028
antly+0.028
ively+0.028
 efforts+0.027
ered+0.027
Neg logit contributions
rob-0.030
 says-0.027
 circus-0.026
 hungry-0.026
 scar-0.025
Token yellow
Token ID7872
Feature activation-6.87e+00

Pos logit contributions
antly+0.068
 efforts+0.067
 fingertips+0.064
ant+0.063
ively+0.062
Neg logit contributions
rob-0.062
 says-0.050
Say-0.047
 yells-0.046
 circus-0.046
Token bulb
Token ID28287
Feature activation+5.18e+00

Pos logit contributions
rob+0.227
 hungry+0.227
 says+0.226
 sleeping+0.217
 circus+0.215
Neg logit contributions
ant-0.438
 efforts-0.434
ively-0.434
antly-0.428
ered-0.420
Token flower
Token ID15061
Feature activation+4.86e-02

Pos logit contributions
antly+0.246
ant+0.243
ively+0.237
 efforts+0.233
ered+0.226
Neg logit contributions
rob-0.278
 says-0.243
 circus-0.241
 hungry-0.236
 ours-0.234
Token for
Token ID329
Feature activation-5.27e+00

Pos logit contributions
antly+0.002
ant+0.002
ively+0.002
 efforts+0.002
 fingertips+0.002
Neg logit contributions
rob-0.003
 circus-0.002
 says-0.002
 yells-0.002
 scar-0.002
Token you
Token ID345
Feature activation-8.53e+00

Pos logit contributions
rob+0.184
 says+0.159
 circus+0.149
 hungry+0.149
 sees+0.144
Neg logit contributions
ant-0.351
 efforts-0.348
ively-0.345
antly-0.345
ered-0.339
Token!"
Token ID2474
Feature activation-5.80e+00

Pos logit contributions
rob+0.531
 circus+0.429
P+0.420
 says+0.411
 blows+0.410
Neg logit contributions
 efforts-0.385
ant-0.377
 fingertips-0.370
antly-0.367
ively-0.364
Token Mom
Token ID11254
Feature activation+7.20e+00

Pos logit contributions
rob+0.257
 says+0.243
 hungry+0.237
 circus+0.230
 ours+0.227
Neg logit contributions
ant-0.325
antly-0.322
 efforts-0.319
ively-0.318
ered-0.309
Token
Token ID198
Feature activation-1.25e+00

Pos logit contributions
 says+0.006
rob+0.006
 hungry+0.005
 sees+0.005
 sleeping+0.005
Neg logit contributions
 efforts-0.014
ant-0.014
 fingertips-0.013
antly-0.013
ively-0.013
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
 says+0.041
rob+0.039
 hungry+0.038
 sleeping+0.036
 sees+0.035
Neg logit contributions
ant-0.083
 efforts-0.083
 fingertips-0.080
ively-0.079
antly-0.079
TokenAfter
Token ID3260
Feature activation+3.91e+00

Pos logit contributions
P+0.049
 says+0.049
 hungry+0.048
rob+0.046
 sleeping+0.045
Neg logit contributions
 efforts-0.071
ant-0.071
 fingertips-0.069
ively-0.068
antly-0.066
Token a
Token ID257
Feature activation+1.59e+00

Pos logit contributions
ant+0.238
antly+0.232
ainer+0.226
ively+0.226
 efforts+0.221
Neg logit contributions
 sleeping-0.159
 hungry-0.158
 waiting-0.156
 circus-0.152
rob-0.152
Token few
Token ID1178
Feature activation+4.53e+00

Pos logit contributions
ant+0.087
antly+0.085
ively+0.085
 efforts+0.084
ered+0.082
Neg logit contributions
rob-0.073
 says-0.069
 hungry-0.068
 circus-0.067
 sleeping-0.065
Token moments
Token ID7188
Feature activation+5.96e+00

Pos logit contributions
ant+0.241
ively+0.233
antly+0.232
ainer+0.224
 fingertips+0.223
Neg logit contributions
rob-0.215
 says-0.212
 hungry-0.206
 sleeping-0.203
 circus-0.203
Token,
Token ID11
Feature activation+1.88e+00

Pos logit contributions
ant+0.346
antly+0.343
 fingertips+0.332
ainer+0.329
 efforts+0.325
Neg logit contributions
 says-0.244
 waiting-0.237
 sleeping-0.233
 hungry-0.228
rob-0.227
Token Maggie
Token ID33014
Feature activation+6.45e+00

Pos logit contributions
ant+0.101
 efforts+0.096
eness+0.095
antly+0.092
ively+0.092
Neg logit contributions
 hungry-0.090
 says-0.087
 waiting-0.080
 sleeping-0.080
 sees-0.078
Token's
Token ID338
Feature activation-1.50e+00

Pos logit contributions
ant+0.369
ively+0.361
 efforts+0.360
antly+0.359
 fingertips+0.346
Neg logit contributions
 says-0.267
rob-0.266
 hungry-0.256
 sleeping-0.246
 sees-0.245
Token dad
Token ID9955
Feature activation+2.94e+00

Pos logit contributions
 hungry+0.100
 says+0.096
 sleeping+0.095
 sees+0.093
 circus+0.093
Neg logit contributions
ant-0.054
ively-0.052
antly-0.050
ved-0.044
ered-0.044
Token joined
Token ID5399
Feature activation-4.96e-01

Pos logit contributions
ant+0.190
ively+0.185
 efforts+0.185
antly+0.185
 fingertips+0.180
Neg logit contributions
 says-0.103
rob-0.101
 hungry-0.097
 sleeping-0.094
 sees-0.092
Token smiled
Token ID13541
Feature activation+1.40e+00

Pos logit contributions
ant+0.201
ively+0.201
 efforts+0.195
antly+0.192
ainer+0.190
Neg logit contributions
 says-0.171
rob-0.162
 hungry-0.162
 sleeping-0.157
 sees-0.157
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
ant+0.099
ively+0.096
ered+0.093
 efforts+0.093
antly+0.092
Neg logit contributions
 hungry-0.045
 says-0.042
 sleeping-0.039
 scary-0.039
 sees-0.037
Token ran
Token ID4966
Feature activation-3.27e+00

Pos logit contributions
rob+0.046
 says+0.042
 hungry+0.041
 circus+0.040
 ours+0.040
Neg logit contributions
ant-0.077
antly-0.077
 efforts-0.076
ively-0.075
ered-0.074
Token outside
Token ID2354
Feature activation-9.81e-01

Pos logit contributions
rob+0.127
 hungry+0.125
 says+0.125
 circus+0.120
 sleeping+0.119
Neg logit contributions
ant-0.198
 efforts-0.192
antly-0.191
ively-0.188
 fingertips-0.186
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
 hungry+0.032
 says+0.031
rob+0.031
 sleeping+0.031
 waiting+0.029
Neg logit contributions
ant-0.066
antly-0.065
ively-0.063
 efforts-0.063
ered-0.062
Token see
Token ID766
Feature activation+2.63e+00

Pos logit contributions
 says+0.044
rob+0.043
 hungry+0.042
 circus+0.042
 sees+0.041
Neg logit contributions
ant-0.073
antly-0.073
 efforts-0.072
ively-0.070
 fingertips-0.068
Token it
Token ID340
Feature activation+2.15e+00

Pos logit contributions
antly+0.144
ant+0.143
 efforts+0.143
ively+0.139
 fingertips+0.138
Neg logit contributions
 says-0.120
rob-0.117
 hungry-0.114
 ours-0.112
 circus-0.110
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
antly+0.108
ant+0.106
ively+0.105
 efforts+0.104
ered+0.103
Neg logit contributions
rob-0.111
 says-0.100
 circus-0.099
 hungry-0.099
 ours-0.098
Token
Token ID198
Feature activation-1.03e+00

Pos logit contributions
ant+0.088
 efforts+0.084
ively+0.084
ered+0.084
antly+0.082
Neg logit contributions
 says-0.049
 hungry-0.047
rob-0.045
 sees-0.043
 circus-0.042
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 says+0.035
rob+0.034
 hungry+0.033
 sleeping+0.032
 sees+0.031
Neg logit contributions
ant-0.068
 efforts-0.068
antly-0.065
ively-0.065
 fingertips-0.064
TokenThe
Token ID464
Feature activation+8.32e+00

Pos logit contributions
rob+0.046
 hungry+0.045
P+0.045
 says+0.044
 circus+0.043
Neg logit contributions
ant-0.057
 efforts-0.057
antly-0.055
ively-0.055
 fingertips-0.055
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
ant+0.338
ively+0.331
antly+0.325
 fingertips+0.316
ered+0.311
Neg logit contributions
 hungry-0.192
 scary-0.189
 fun-0.183
 trash-0.169
 mean-0.167
Token mess
Token ID2085
Feature activation-9.44e+00

Pos logit contributions
antly+0.091
ant+0.091
 efforts+0.091
ively+0.090
ered+0.088
Neg logit contributions
rob-0.051
 says-0.046
 hungry-0.044
 circus-0.043
 ours-0.043
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
rob+0.417
 says+0.390
 hungry+0.382
 circus+0.372
 ours+0.368
Neg logit contributions
ant-0.524
antly-0.523
 efforts-0.516
ively-0.512
ered-0.499
Token
Token ID220
Feature activation-2.83e-01

Pos logit contributions
ant+0.066
 efforts+0.063
ively+0.062
ered+0.062
antly+0.062
Neg logit contributions
 says-0.045
 hungry-0.043
rob-0.041
 circus-0.039
 wants-0.039
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 says+0.009
rob+0.009
 hungry+0.009
 sleeping+0.008
 ours+0.008
Neg logit contributions
 efforts-0.019
ant-0.018
antly-0.018
 fingertips-0.018
ively-0.018
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
rob+0.043
 says+0.042
 hungry+0.039
 sleeping+0.037
 circus+0.036
Neg logit contributions
 efforts-0.089
ant-0.088
 fingertips-0.085
ively-0.084
antly-0.084
TokenAnd
Token ID1870
Feature activation+3.70e+00

Pos logit contributions
rob+0.061
P+0.060
 hungry+0.058
 says+0.057
 circus+0.057
Neg logit contributions
 efforts-0.067
 fingertips-0.067
ant-0.065
ively-0.064
antly-0.062
Token that
Token ID326
Feature activation+5.08e+00

Pos logit contributions
ant+0.240
ainer+0.234
antly+0.228
 efforts+0.224
 fingertips+0.223
Neg logit contributions
 says-0.139
 hungry-0.132
 waiting-0.126
 sleeping-0.124
rob-0.123
Token's
Token ID338
Feature activation-1.62e+00

Pos logit contributions
antly+0.249
ant+0.247
 efforts+0.243
ively+0.239
ered+0.236
Neg logit contributions
rob-0.264
 says-0.240
 circus-0.239
 hungry-0.238
 ours-0.236
Token the
Token ID262
Feature activation+1.89e+00

Pos logit contributions
 hungry+0.083
 ours+0.082
 sleeping+0.082
 says+0.081
 waiting+0.081
Neg logit contributions
 fingertips-0.077
growth-0.075
antly-0.074
ant-0.074
ively-0.074
Token end
Token ID886
Feature activation+1.02e+01

Pos logit contributions
ant+0.107
ively+0.104
antly+0.104
ered+0.102
 fingertips+0.101
Neg logit contributions
rob-0.083
 hungry-0.080
 says-0.080
 circus-0.080
 scary-0.077
Token,
Token ID11
Feature activation+1.69e+00

Pos logit contributions
ant+0.399
ively+0.390
 efforts+0.389
antly+0.389
 fingertips+0.377
Neg logit contributions
rob-0.314
 says-0.306
 hungry-0.299
 sleeping-0.288
 circus-0.286
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
ant+0.104
antly+0.101
 efforts+0.101
ively+0.100
 fingertips+0.098
Neg logit contributions
rob-0.065
 hungry-0.062
 says-0.062
 sleeping-0.059
 circus-0.059
Token door
Token ID3420
Feature activation+6.61e-01

Pos logit contributions
ant+0.118
antly+0.118
 efforts+0.116
ively+0.116
ered+0.113
Neg logit contributions
rob-0.071
 says-0.065
 hungry-0.064
 circus-0.062
 ours-0.061
Token opened
Token ID4721
Feature activation+4.24e-01

Pos logit contributions
ant+0.041
 efforts+0.039
ainer+0.038
ively+0.038
antly+0.037
Neg logit contributions
 says-0.026
 sees-0.024
 scary-0.024
 hungry-0.024
 sleeping-0.023
Token and
Token ID290
Feature activation-1.50e+00

Pos logit contributions
antly+0.025
ant+0.025
ively+0.024
 efforts+0.024
ered+0.024
Neg logit contributions
rob-0.018
 says-0.016
 hungry-0.016
 circus-0.016
 ours-0.015
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
rob+0.062
 says+0.056
 hungry+0.055
 circus+0.055
 ours+0.054
Neg logit contributions
antly-0.088
ant-0.088
ively-0.087
 efforts-0.086
ered-0.085
Token little
Token ID1310
Feature activation+5.64e+00

Pos logit contributions
antly+0.091
 efforts+0.090
ant+0.090
ively+0.089
 fingertips+0.087
Neg logit contributions
rob-0.054
 says-0.048
 hungry-0.046
 ours-0.046
 circus-0.045
Token girl
Token ID2576
Feature activation-3.61e-01

Pos logit contributions
antly+0.333
 efforts+0.330
ively+0.321
ant+0.318
 fingertips+0.316
Neg logit contributions
rob-0.246
 says-0.202
 ours-0.200
 circus-0.194
P-0.191
Token came
Token ID1625
Feature activation-3.66e+00

Pos logit contributions
rob+0.015
 says+0.014
 hungry+0.013
 circus+0.013
 ours+0.013
Neg logit contributions
ant-0.021
antly-0.021
 efforts-0.021
ively-0.021
ered-0.020
Token in
Token ID287
Feature activation+6.05e-01

Pos logit contributions
rob+0.166
 says+0.151
 circus+0.147
 hungry+0.146
 ours+0.144
Neg logit contributions
antly-0.206
ant-0.206
ively-0.203
 efforts-0.201
ered-0.197
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
antly+0.032
 efforts+0.032
ant+0.032
ively+0.032
ered+0.031
Neg logit contributions
rob-0.029
 says-0.026
 hungry-0.026
 circus-0.025
 ours-0.025
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
rob+0.254
 says+0.238
 hungry+0.235
 circus+0.231
 sleeping+0.225
Neg logit contributions
ant-0.353
antly-0.351
 efforts-0.351
ively-0.343
ered-0.337
Token Lots
Token ID25024
Feature activation+1.87e+00

Pos logit contributions
ant+0.049
antly+0.049
 efforts+0.048
ively+0.047
ered+0.046
Neg logit contributions
rob-0.065
 says-0.064
 hungry-0.063
 circus-0.062
 ours-0.061
Token of
Token ID286
Feature activation+7.13e+00

Pos logit contributions
ively+0.047
ant+0.046
 efforts+0.045
antly+0.043
ered+0.042
Neg logit contributions
rob-0.142
 hungry-0.129
 scar-0.127
 ours-0.127
 circus-0.126
Token people
Token ID661
Feature activation+5.67e+00

Pos logit contributions
ively+0.316
ant+0.306
 efforts+0.304
 Everyone+0.303
ered+0.303
Neg logit contributions
rob-0.374
use-0.355
iw-0.343
 says-0.339
 sleeping-0.332
Token were
Token ID547
Feature activation+5.21e+00

Pos logit contributions
ively+0.274
ered+0.273
ant+0.270
antly+0.256
 efforts+0.252
Neg logit contributions
rob-0.290
 scar-0.261
 ours-0.257
 hungry-0.252
P-0.250
Token there
Token ID612
Feature activation-4.68e+00

Pos logit contributions
ered+0.297
ively+0.297
 efforts+0.290
ant+0.289
antly+0.280
Neg logit contributions
rob-0.229
use-0.191
iw-0.190
 scar-0.190
P-0.184
Token,
Token ID11
Feature activation+1.67e+00

Pos logit contributions
rob+0.206
 says+0.191
 hungry+0.185
 circus+0.182
 ours+0.180
Neg logit contributions
ant-0.263
antly-0.262
 efforts-0.259
ively-0.257
ered-0.250
Token and
Token ID290
Feature activation-1.48e+00

Pos logit contributions
ively+0.092
ant+0.092
antly+0.091
 efforts+0.091
ered+0.089
Neg logit contributions
rob-0.074
 says-0.065
 scar-0.061
 circus-0.061
P-0.061
Token they
Token ID484
Feature activation-1.60e+00

Pos logit contributions
rob+0.083
 says+0.080
 hungry+0.078
 circus+0.077
 ours+0.076
Neg logit contributions
ant-0.065
antly-0.064
 efforts-0.063
ively-0.063
ered-0.060
Token all
Token ID477
Feature activation+3.24e+00

Pos logit contributions
rob+0.079
 says+0.074
 hungry+0.072
 circus+0.071
 ours+0.070
Neg logit contributions
ant-0.081
antly-0.081
 efforts-0.080
ively-0.079
ered-0.077
Token had
Token ID550
Feature activation+2.55e-01

Pos logit contributions
ant+0.180
antly+0.179
ively+0.177
 efforts+0.177
ered+0.173
Neg logit contributions
rob-0.143
 says-0.131
 hungry-0.129
 circus-0.127
 ours-0.126
Token turn
Token ID1210
Feature activation+4.64e+00

Pos logit contributions
rob+0.438
 says+0.417
 hungry+0.410
 circus+0.400
 ours+0.394
Neg logit contributions
ant-0.535
antly-0.530
ively-0.525
 efforts-0.520
ered-0.509
Token to
Token ID284
Feature activation-1.36e+00

Pos logit contributions
 efforts+0.251
antly+0.251
ant+0.248
ively+0.239
ered+0.239
Neg logit contributions
rob-0.225
 circus-0.190
 says-0.188
P-0.181
 hungry-0.181
Token sleep
Token ID3993
Feature activation-1.30e+00

Pos logit contributions
rob+0.063
 says+0.060
 hungry+0.058
 circus+0.057
 ours+0.056
Neg logit contributions
ant-0.073
antly-0.072
 efforts-0.071
ively-0.071
ered-0.069
Token in
Token ID287
Feature activation+4.51e-01

Pos logit contributions
rob+0.055
 says+0.051
 hungry+0.049
 circus+0.048
 ours+0.048
Neg logit contributions
ant-0.075
antly-0.075
 efforts-0.073
ively-0.073
ered-0.071
Token the
Token ID262
Feature activation+2.08e+00

Pos logit contributions
ant+0.026
antly+0.026
 efforts+0.026
ively+0.026
ered+0.025
Neg logit contributions
rob-0.019
 says-0.017
 hungry-0.017
 circus-0.016
 ours-0.016
Token big
Token ID1263
Feature activation-3.35e+00

Pos logit contributions
antly+0.112
 efforts+0.106
ively+0.104
ainer+0.104
ant+0.102
Neg logit contributions
rob-0.100
 says-0.090
 ours-0.087
iw-0.083
 circus-0.082
Token crib
Token ID48083
Feature activation-9.68e-01

Pos logit contributions
rob+0.164
 ours+0.144
 says+0.142
P+0.135
iw+0.135
Neg logit contributions
 efforts-0.173
antly-0.172
 fingertips-0.165
ant-0.162
ively-0.161
Token!",
Token ID40754
Feature activation-2.52e+00

Pos logit contributions
rob+0.039
 says+0.036
 hungry+0.035
 circus+0.035
 ours+0.034
Neg logit contributions
ant-0.057
antly-0.057
 efforts-0.057
ively-0.056
ered-0.055
Token while
Token ID981
Feature activation+3.76e+00

Pos logit contributions
rob+0.104
 says+0.101
 hungry+0.095
 circus+0.093
 sleeping+0.091
Neg logit contributions
ant-0.147
antly-0.146
 efforts-0.145
ively-0.144
 fingertips-0.140
Token the
Token ID262
Feature activation+2.12e+00

Pos logit contributions
ant+0.232
antly+0.231
 efforts+0.228
ively+0.228
ered+0.222
Neg logit contributions
rob-0.145
 says-0.133
 hungry-0.128
 circus-0.126
 ours-0.125
Token second
Token ID1218
Feature activation+7.39e+00

Pos logit contributions
ant+0.118
antly+0.117
ively+0.115
 efforts+0.114
 fingertips+0.112
Neg logit contributions
rob-0.094
 hungry-0.091
 says-0.091
 circus-0.088
 sleeping-0.087
Token happy
Token ID3772
Feature activation-2.77e+00

Pos logit contributions
rob+0.066
P+0.055
 circus+0.047
ww+0.045
use+0.045
Neg logit contributions
 efforts-0.114
ered-0.109
 fingertips-0.108
antly-0.108
ant-0.107
Token anymore
Token ID7471
Feature activation+9.49e+00

Pos logit contributions
rob+0.144
 says+0.127
 ours+0.124
 hungry+0.124
 circus+0.123
Neg logit contributions
 efforts-0.139
ant-0.138
antly-0.137
ively-0.137
ered-0.131
Token.
Token ID13
Feature activation+9.53e-01

Pos logit contributions
ant+0.525
antly+0.522
 efforts+0.516
ively+0.514
ered+0.499
Neg logit contributions
rob-0.426
 says-0.398
 hungry-0.389
 circus-0.381
 ours-0.376
Token He
Token ID679
Feature activation-1.03e+00

Pos logit contributions
antly+0.046
ively+0.045
ant+0.044
 Everyone+0.044
 efforts+0.044
Neg logit contributions
rob-0.050
 scar-0.045
 sleeping-0.044
 ours-0.043
 circus-0.043
Token is
Token ID318
Feature activation-5.16e+00

Pos logit contributions
rob+0.040
 ours+0.034
P+0.033
ww+0.033
 hungry+0.028
Neg logit contributions
ered-0.063
antly-0.062
 efforts-0.062
ant-0.059
ively-0.058
Token small
Token ID1402
Feature activation-1.88e+00

Pos logit contributions
rob+0.243
 says+0.204
 circus+0.193
use+0.192
ww+0.188
Neg logit contributions
 efforts-0.289
ively-0.284
ered-0.283
antly-0.283
ant-0.279
Token and
Token ID290
Feature activation-1.71e+00

Pos logit contributions
rob+0.088
 ours+0.075
 says+0.075
P+0.074
 circus+0.073
Neg logit contributions
 efforts-0.105
antly-0.102
ant-0.101
ively-0.100
 fingertips-0.097
Token sad
Token ID6507
Feature activation-6.33e-02

Pos logit contributions
rob+0.069
P+0.054
 circus+0.050
ww+0.045
 says+0.043
Neg logit contributions
 efforts-0.108
 fingertips-0.106
antly-0.104
ered-0.102
 nobody-0.101
Token.
Token ID13
Feature activation+9.57e-01

Pos logit contributions
rob+0.003
 says+0.003
 hungry+0.003
 ours+0.003
 circus+0.003
Neg logit contributions
antly-0.003
ant-0.003
 efforts-0.003
ively-0.003
ered-0.003
Token His
Token ID2399
Feature activation+3.69e+00

Pos logit contributions
antly+0.052
ant+0.051
ively+0.051
 efforts+0.050
 fingertips+0.048
Neg logit contributions
rob-0.045
 says-0.040
 hungry-0.040
 circus-0.039
 ours-0.039
Token hat
Token ID6877
Feature activation-6.31e+00

Pos logit contributions
 efforts+0.247
antly+0.246
ant+0.239
 fingertips+0.234
ered+0.234
Neg logit contributions
rob-0.130
 ours-0.108
 says-0.105
 hungry-0.103
 circus-0.102
Token wanted
Token ID2227
Feature activation-3.79e+00

Pos logit contributions
rob+0.139
 says+0.126
 hungry+0.125
 ours+0.123
 circus+0.123
Neg logit contributions
ant-0.183
antly-0.182
 efforts-0.180
ively-0.179
ered-0.176
Token to
Token ID284
Feature activation-1.44e+00

Pos logit contributions
rob+0.136
 says+0.131
 hungry+0.127
 circus+0.123
 sleeping+0.120
Neg logit contributions
ant-0.242
antly-0.240
ively-0.235
 efforts-0.235
ered-0.230
Token bounce
Token ID20110
Feature activation-3.26e+00

Pos logit contributions
 says+0.061
rob+0.057
 circus+0.056
 hungry+0.055
 ours+0.054
Neg logit contributions
antly-0.086
ant-0.084
ively-0.081
 efforts-0.081
 fingertips-0.078
Token.
Token ID13
Feature activation+2.19e-01

Pos logit contributions
rob+0.163
 circus+0.148
 hungry+0.147
 says+0.146
 ours+0.144
Neg logit contributions
antly-0.161
ively-0.157
ant-0.156
ered-0.154
 efforts-0.152
Token She
Token ID1375
Feature activation-3.18e+00

Pos logit contributions
antly+0.014
ant+0.014
ively+0.013
 efforts+0.013
ered+0.013
Neg logit contributions
rob-0.009
 says-0.008
 hungry-0.008
 circus-0.007
 ours-0.007
Token said
Token ID531
Feature activation-9.28e+00

Pos logit contributions
rob+0.140
 ours+0.128
 circus+0.123
 hungry+0.122
 sleeping+0.119
Neg logit contributions
antly-0.181
ered-0.179
ant-0.179
 efforts-0.178
ively-0.174
Token,
Token ID11
Feature activation+1.04e+00

Pos logit contributions
rob+0.435
 says+0.410
 hungry+0.401
 circus+0.393
 ours+0.388
Neg logit contributions
ant-0.494
antly-0.491
 efforts-0.485
ively-0.483
ered-0.468
Token "
Token ID366
Feature activation-4.44e-01

Pos logit contributions
antly+0.049
ant+0.049
 efforts+0.048
ively+0.047
 fingertips+0.046
Neg logit contributions
rob-0.056
 ours-0.050
 says-0.050
 circus-0.050
 hungry-0.049
TokenYou
Token ID1639
Feature activation-5.77e+00

Pos logit contributions
rob+0.023
 hungry+0.022
 ours+0.022
 sleeping+0.022
 says+0.022
Neg logit contributions
ant-0.021
antly-0.021
ively-0.021
 efforts-0.020
 fingertips-0.019
Token are
Token ID389
Feature activation-3.50e-01

Pos logit contributions
 says+0.218
 hungry+0.201
 ours+0.196
 wants+0.195
 sees+0.194
Neg logit contributions
antly-0.372
ively-0.365
ant-0.362
 efforts-0.354
 fingertips-0.349
Token not
Token ID407
Feature activation+2.99e+00

Pos logit contributions
rob+0.016
 says+0.014
 circus+0.013
 sees+0.012
 hungry+0.012
Neg logit contributions
 efforts-0.020
ant-0.020
ively-0.019
ered-0.019
antly-0.019
Token liked
Token ID8288
Feature activation+1.02e+00

Pos logit contributions
ant+0.186
antly+0.184
ively+0.181
 efforts+0.181
ered+0.177
Neg logit contributions
rob-0.187
 ours-0.172
 says-0.171
 hungry-0.168
 circus-0.167
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
 efforts+0.051
ant+0.048
ively+0.047
 fingertips+0.047
antly+0.047
Neg logit contributions
rob-0.054
 says-0.047
use-0.044
 ours-0.044
ww-0.043
Token play
Token ID711
Feature activation-1.32e+00

Pos logit contributions
 says+0.043
 hungry+0.041
 circus+0.041
rob+0.039
 sees+0.038
Neg logit contributions
antly-0.104
ively-0.104
ant-0.103
 efforts-0.100
bled-0.097
Token in
Token ID287
Feature activation+3.43e-01

Pos logit contributions
rob+0.059
 says+0.056
 ours+0.055
 hungry+0.054
 circus+0.052
Neg logit contributions
ant-0.072
antly-0.071
ively-0.070
 efforts-0.069
ered-0.068
Token the
Token ID262
Feature activation+1.83e+00

Pos logit contributions
ant+0.015
ively+0.015
 efforts+0.015
antly+0.015
ered+0.014
Neg logit contributions
rob-0.019
 says-0.018
 hungry-0.017
 ours-0.017
 circus-0.017
Token park
Token ID3952
Feature activation-7.48e+00

Pos logit contributions
antly+0.094
 bottom+0.093
 Days+0.091
growth+0.090
 Everyone+0.089
Neg logit contributions
rob-0.084
 says-0.072
use-0.070
 yells-0.070
 ours-0.068
Token.
Token ID13
Feature activation+9.09e-01

Pos logit contributions
rob+0.302
 says+0.288
 hungry+0.287
 circus+0.275
 sleeping+0.271
Neg logit contributions
ant-0.444
antly-0.442
 efforts-0.437
ively-0.434
 fingertips-0.423
Token One
Token ID1881
Feature activation-7.31e-01

Pos logit contributions
antly+0.037
 efforts+0.037
ant+0.036
ively+0.036
 fingertips+0.035
Neg logit contributions
rob-0.055
 says-0.049
 hungry-0.049
 ours-0.049
 circus-0.048
Token day
Token ID1110
Feature activation-2.00e+00

Pos logit contributions
rob+0.027
 says+0.026
 hungry+0.025
 circus+0.024
 ours+0.024
Neg logit contributions
ant-0.046
antly-0.046
 efforts-0.045
ively-0.045
ered-0.044
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
rob+0.092
 says+0.088
 hungry+0.086
 circus+0.084
 sleeping+0.082
Neg logit contributions
ant-0.107
antly-0.106
 efforts-0.105
ively-0.104
ered-0.101
Token they
Token ID484
Feature activation-2.25e+00

Pos logit contributions
ant+0.098
antly+0.097
 efforts+0.096
ively+0.096
ered+0.094
Neg logit contributions
rob-0.050
 says-0.047
 hungry-0.047
 circus-0.044
 ours-0.044
Token soft
Token ID2705
Feature activation-2.98e+00

Pos logit contributions
ant+0.102
antly+0.100
ively+0.100
ered+0.099
 efforts+0.098
Neg logit contributions
rob-0.073
 says-0.071
 hungry-0.070
 circus-0.069
 sleeping-0.068
Token pillow
Token ID28774
Feature activation+1.51e-01

Pos logit contributions
 says+0.109
rob+0.108
 sleeping+0.107
 hungry+0.106
 circus+0.105
Neg logit contributions
ant-0.185
ively-0.182
ered-0.179
antly-0.178
 efforts-0.175
Token through
Token ID832
Feature activation+3.70e+00

Pos logit contributions
ant+0.009
antly+0.009
ively+0.009
 efforts+0.009
ered+0.008
Neg logit contributions
rob-0.006
 says-0.006
 hungry-0.006
 sleeping-0.006
 circus-0.005
Token his
Token ID465
Feature activation-3.91e+00

Pos logit contributions
ant+0.232
antly+0.227
 efforts+0.226
ered+0.225
ainer+0.221
Neg logit contributions
 sleeping-0.129
 hungry-0.120
 scary-0.119
 waiting-0.118
 says-0.116
Token joyful
Token ID48977
Feature activation+3.01e+00

Pos logit contributions
 says+0.171
 hungry+0.170
 sleeping+0.167
rob+0.167
 circus+0.164
Neg logit contributions
ant-0.218
ively-0.216
antly-0.210
ered-0.207
 efforts-0.199
Token cl
Token ID537
Feature activation-6.43e+00

Pos logit contributions
ant+0.153
ively+0.151
antly+0.149
ered+0.144
 efforts+0.142
Neg logit contributions
rob-0.142
 says-0.141
 sleeping-0.139
 hungry-0.138
 circus-0.138
Tokenapping
Token ID5912
Feature activation+7.74e+00

Pos logit contributions
rob+0.259
 hungry+0.254
 says+0.253
P+0.251
 sleeping+0.249
Neg logit contributions
ered-0.309
ively-0.307
 resolve-0.302
 summit-0.301
antly-0.300
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
ant+0.483
ively+0.472
antly+0.466
 efforts+0.457
ered+0.456
Neg logit contributions
 says-0.264
 hungry-0.260
rob-0.258
 sleeping-0.257
 circus-0.249
Token<|endoftext|>
Token ID50256
Feature activation-3.23e-01

Pos logit contributions
ant+0.055
 efforts+0.054
antly+0.053
ered+0.053
ively+0.052
Neg logit contributions
rob-0.046
 says-0.046
 hungry-0.043
 circus-0.042
 sleeping-0.041
TokenJohn
Token ID7554
Feature activation+3.52e+00

Pos logit contributions
rob+0.012
 says+0.011
 ours+0.011
 hungry+0.011
 circus+0.011
Neg logit contributions
antly-0.020
ant-0.020
ively-0.020
 efforts-0.019
ered-0.019
Token and
Token ID290
Feature activation-1.50e+00

Pos logit contributions
antly+0.193
ant+0.188
ered+0.183
ively+0.182
 efforts+0.181
Neg logit contributions
rob-0.167
 ours-0.152
 circus-0.147
 hungry-0.143
 says-0.138
Token sheet
Token ID9629
Feature activation-1.17e+00

Pos logit contributions
 efforts+0.080
antly+0.077
ant+0.076
ainer+0.073
ered+0.072
Neg logit contributions
rob-0.069
iw-0.060
 hungry-0.059
 ours-0.058
use-0.057
Token.
Token ID13
Feature activation+8.68e-01

Pos logit contributions
rob+0.053
 says+0.050
 hungry+0.050
 ours+0.048
 circus+0.048
Neg logit contributions
ant-0.063
antly-0.063
 efforts-0.062
ively-0.061
ered-0.059
Token They
Token ID1119
Feature activation-3.48e-01

Pos logit contributions
ant+0.054
 efforts+0.053
antly+0.053
ively+0.052
ered+0.052
Neg logit contributions
rob-0.033
 says-0.033
 hungry-0.031
 circus-0.030
 ours-0.029
Token like
Token ID588
Feature activation-1.08e+00

Pos logit contributions
rob+0.015
 says+0.014
 hungry+0.013
 circus+0.013
 ours+0.013
Neg logit contributions
ant-0.020
antly-0.020
 efforts-0.020
ively-0.020
ered-0.019
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
rob+0.051
 says+0.046
 hungry+0.044
 circus+0.044
 ours+0.043
Neg logit contributions
ant-0.059
 efforts-0.059
antly-0.058
ively-0.057
ered-0.056
Token tent
Token ID11105
Feature activation-1.00e+01

Pos logit contributions
 efforts+0.088
ainer+0.081
antly+0.081
ant+0.079
ered+0.077
Neg logit contributions
rob-0.058
iw-0.052
 ours-0.052
use-0.049
ww-0.048
Token.
Token ID13
Feature activation+8.52e-01

Pos logit contributions
rob+0.488
 says+0.433
 hungry+0.426
 circus+0.425
 ours+0.420
Neg logit contributions
antly-0.521
 efforts-0.520
ant-0.519
ively-0.515
ered-0.501
Token They
Token ID1119
Feature activation-3.88e-01

Pos logit contributions
ant+0.052
 efforts+0.051
antly+0.050
ively+0.050
ered+0.050
Neg logit contributions
rob-0.034
 says-0.033
 hungry-0.032
 circus-0.031
 ours-0.030
Token do
Token ID466
Feature activation+1.89e+00

Pos logit contributions
rob+0.017
 says+0.015
 hungry+0.015
 circus+0.015
 ours+0.014
Neg logit contributions
ant-0.022
 efforts-0.022
antly-0.022
ively-0.022
ered-0.022
Token not
Token ID407
Feature activation+2.95e+00

Pos logit contributions
antly+0.105
ant+0.103
 fingertips+0.102
ively+0.102
 efforts+0.100
Neg logit contributions
 says-0.083
rob-0.082
 hungry-0.081
 circus-0.078
 sleeping-0.078
Token know
Token ID760
Feature activation+5.77e+00

Pos logit contributions
ant+0.176
 efforts+0.174
ively+0.173
ered+0.172
antly+0.167
Neg logit contributions
rob-0.128
 says-0.105
 circus-0.103
 hungry-0.100
 ours-0.099
Token child
Token ID1200
Feature activation+8.19e+00

Pos logit contributions
ant+0.348
antly+0.348
 efforts+0.344
ively+0.333
 fingertips+0.321
Neg logit contributions
rob-0.222
 ours-0.194
 circus-0.181
 says-0.178
 yells-0.172
Token thought
Token ID1807
Feature activation-2.15e-01

Pos logit contributions
antly+0.465
ant+0.465
ively+0.455
 efforts+0.454
ered+0.448
Neg logit contributions
rob-0.362
 ours-0.322
 circus-0.314
 says-0.313
 hungry-0.307
Token it
Token ID340
Feature activation+1.98e+00

Pos logit contributions
 says+0.010
 sleeping+0.009
 hungry+0.009
 scary+0.009
 ours+0.009
Neg logit contributions
antly-0.012
ant-0.012
ively-0.011
 efforts-0.011
 fingertips-0.011
Token would
Token ID561
Feature activation+3.55e-01

Pos logit contributions
ant+0.098
 efforts+0.098
ively+0.096
antly+0.093
 fingertips+0.092
Neg logit contributions
rob-0.100
 says-0.097
 hungry-0.095
 sleeping-0.093
 scary-0.091
Token be
Token ID307
Feature activation+1.51e+00

Pos logit contributions
ant+0.019
antly+0.019
ively+0.018
 efforts+0.018
 fingertips+0.017
Neg logit contributions
 hungry-0.016
 says-0.016
rob-0.016
 scary-0.016
 sleeping-0.016
Token fun
Token ID1257
Feature activation-9.30e+00

Pos logit contributions
 efforts+0.094
ered+0.092
antly+0.092
ively+0.090
ant+0.089
Neg logit contributions
rob-0.062
 says-0.051
 circus-0.048
iw-0.046
ww-0.046
Token to
Token ID284
Feature activation-1.31e+00

Pos logit contributions
rob+0.431
 says+0.380
 circus+0.368
 hungry+0.365
 ours+0.364
Neg logit contributions
 efforts-0.519
ant-0.516
antly-0.511
ively-0.506
ered-0.494
Token send
Token ID3758
Feature activation+7.55e-01

Pos logit contributions
 says+0.045
 circus+0.043
 sleeping+0.042
 hungry+0.042
 sees+0.041
Neg logit contributions
antly-0.086
ant-0.084
ively-0.083
 efforts-0.083
 fingertips-0.080
Token their
Token ID511
Feature activation-3.66e+00

Pos logit contributions
ant+0.040
antly+0.039
 efforts+0.039
ively+0.039
 fingertips+0.037
Neg logit contributions
rob-0.035
 says-0.034
 hungry-0.033
 ours-0.033
 circus-0.032
Token parents
Token ID3397
Feature activation+1.06e+01

Pos logit contributions
rob+0.164
 says+0.140
 hungry+0.137
 circus+0.134
 ours+0.134
Neg logit contributions
 efforts-0.213
ant-0.208
antly-0.207
ively-0.203
 fingertips-0.201
Token on
Token ID319
Feature activation-5.15e+00

Pos logit contributions
ant+0.589
antly+0.587
 efforts+0.580
ively+0.577
ered+0.560
Neg logit contributions
rob-0.467
 says-0.438
 hungry-0.429
 circus-0.420
 ours-0.416