TOP ACTIVATIONS
MAX = 15.439

. She loved to have pic n ics with her family
family were ready to go pic n icking again . But
fun . Lily realized that pic n ics were not so
the warm sunshine and having pic n ics with her family
so excited ! She loved pic n ics and spending time
She loved to go on pic n ics with her family
, ready to have more pic n ics in the future
a family who really enjoyed pic n ics . They loved
be there to support her pic n ics -- no matter
roof . They would have pic n ics and play games
favourite place to go for pic n ics . <|endoftext|> Once
their salad and have happy pic n ics together . <|endoftext|>
children running around , families pic n icking , couples st
to be careful when having pic n ics and not to
They decided to make more pic n ics and looked in
in the garden and have pic n ics together .
went on adventures , had pic n ics , and even
girl named Lily who loved pic n ics . She would
near the cliff and have pic n ics . One day
and it was going on pic n ics !

MIN ACTIVATIONS
MIN = -20.319

One day Ch ant elle 's mom put a
that said " rest aur ant ". They went inside and
reached for the cro iss ant . He rubbed it on
the word " ign or ant " and the number "
was a girl named Ch ant elle . She was three
colourful and it made Ch ant elle feel special .
". Ch ant elle was very happy .
free so he could repl ant his field . Billy was
bite of the cro iss ant . It was a little
special . Ch ant elle went outside to play
were ignorant . Ign or ant meant they did not know
That 's so special Ch ant elle . I 'm so
says , " Ign or ant , zero ". The teacher
and said , " Pl ant this and watch it grow
They ate the cro iss ant . They were happy .
. Ch ant elle stopped in the shade
it down . Ign or ant children like you don 't
It is a cro iss ant . We can share it
always best . Ign or ant people will always be present
It was a cro iss ant . It was a little

INTERVAL 6.499 - 15.439
CONTAINS 4.578%

happy . Mom picks up Tom and puts him
" You are very good at giving mass ages . You
of the rides at the park were blue and they were
. Suddenly , the mouse sc ur ried over to the
The farmer drives and drives . He shows them the

INTERVAL -2.440 - 6.499
CONTAINS 75.605%

who lost her gem in a deep well . " Maybe
it 's time for Tom to learn about science !"
help him . She was very kind , and gave John
It was a deep , booming voice . The voice told
their boats . They have a race . They say :

INTERVAL -11.379 - -2.440
CONTAINS 19.605%

Tom had learned a valuable lesson from his
was a little girl called Lucy who loved to hug .
. After a while , Sarah 's face turned
the park when she spotted something . It was a crocod
's back and they rode through the fields . The pig

INTERVAL -20.319 - -11.379
CONTAINS 0.213%

. He kept trying and eventually was flexible again . He
to the market with his mum . He saw lots of
and saw a pe b ble . He thought it looked
new p aj amas fit perfectly . " I love my
<|endoftext|> One day , the mum saw a pie at the
Token.
Token ID13
Feature activation-7.44e-01

Pos logit contributions
rob+0.023
 says+0.021
 ours+0.020
 circus+0.020
 scar+0.019
Neg logit contributions
ively-0.027
antly-0.027
ant-0.026
 efforts-0.026
 fingertips-0.026
Token She
Token ID1375
Feature activation+2.41e+00

Pos logit contributions
ively+0.047
antly+0.046
ant+0.046
 fingertips+0.045
 efforts+0.045
Neg logit contributions
rob-0.030
 says-0.026
 ours-0.025
 circus-0.025
 scar-0.024
Token loved
Token ID6151
Feature activation-3.76e+00

Pos logit contributions
rob+0.142
 ours+0.130
 says+0.129
 circus+0.125
 scar+0.123
Neg logit contributions
antly-0.103
ively-0.102
ant-0.100
ered-0.097
 fingertips-0.097
Token to
Token ID284
Feature activation+1.57e+00

Pos logit contributions
 efforts+0.128
 fingertips+0.121
 everybody+0.116
 mummy+0.113
 Everyone+0.111
Neg logit contributions
rob-0.239
 says-0.211
use-0.210
 ours-0.203
 yells-0.199
Token have
Token ID423
Feature activation-2.05e+00

Pos logit contributions
rob+0.073
 says+0.064
 ours+0.062
 circus+0.060
 scar+0.059
Neg logit contributions
ively-0.087
antly-0.086
ant-0.086
 fingertips-0.086
 efforts-0.085
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.111
 Everyone+0.109
growth+0.109
 Everybody+0.107
 Days+0.106
Neg logit contributions
rob-0.076
 says-0.073
 wants-0.072
 yells-0.072
 blows-0.072
Tokenn
Token ID77
Feature activation-4.41e+00

Pos logit contributions
rob+0.893
 circus+0.797
 says+0.787
 ours+0.786
 fun+0.768
Neg logit contributions
antly-0.700
ively-0.686
 fingertips-0.664
 efforts-0.625
ant-0.618
Tokenics
Token ID873
Feature activation-7.32e-01

Pos logit contributions
ively+0.217
antly+0.216
ant+0.210
 fingertips+0.209
 efforts+0.208
Neg logit contributions
rob-0.233
 says-0.211
 ours-0.202
 circus-0.201
 hungry-0.196
Token with
Token ID351
Feature activation+3.34e-01

Pos logit contributions
antly+0.029
ively+0.029
ant+0.028
mates+0.028
ered+0.027
Neg logit contributions
rob-0.044
 says-0.041
 yells-0.040
 ours-0.039
ww-0.039
Token her
Token ID607
Feature activation+4.40e+00

Pos logit contributions
rob+0.018
 says+0.015
ww+0.015
 yells+0.015
 ours+0.014
Neg logit contributions
ively-0.015
 fingertips-0.015
 efforts-0.015
antly-0.015
ered-0.015
Token family
Token ID1641
Feature activation-4.62e+00

Pos logit contributions
rob+0.305
 yells+0.264
 says+0.264
 ours+0.261
ww+0.261
Neg logit contributions
 efforts-0.144
 fingertips-0.138
antly-0.138
mates-0.126
 mummy-0.123
Token family
Token ID1641
Feature activation-4.60e+00

Pos logit contributions
rob+0.250
 says+0.236
 circus+0.226
 scar+0.223
 ours+0.222
Neg logit contributions
ively-0.202
antly-0.196
ant-0.194
ered-0.185
 fingertips-0.185
Token were
Token ID547
Feature activation-4.83e+00

Pos logit contributions
ively+0.253
antly+0.249
ant+0.248
 fingertips+0.242
 efforts+0.241
Neg logit contributions
rob-0.215
 says-0.201
 circus-0.186
 hungry-0.184
 ours-0.183
Token ready
Token ID3492
Feature activation+8.72e+00

Pos logit contributions
ively+0.297
antly+0.294
ant+0.289
 efforts+0.285
 fingertips+0.284
Neg logit contributions
rob-0.198
 says-0.175
 ours-0.164
 circus-0.164
 scar-0.159
Token to
Token ID284
Feature activation+1.57e+00

Pos logit contributions
rob+0.394
 says+0.337
 circus+0.323
 ours+0.321
 scar+0.309
Neg logit contributions
ively-0.510
antly-0.509
 efforts-0.497
ant-0.495
 fingertips-0.493
Token go
Token ID467
Feature activation+2.25e+00

Pos logit contributions
 says+0.062
rob+0.061
 circus+0.058
 ours+0.056
 hungry+0.056
Neg logit contributions
ively-0.094
ant-0.093
antly-0.093
 efforts-0.090
 fingertips-0.089
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.120
 ours+0.110
ww+0.108
 says+0.107
 wra+0.106
Neg logit contributions
antly-0.105
ively-0.101
ered-0.100
 efforts-0.094
 fingertips-0.091
Tokenn
Token ID77
Feature activation-4.41e+00

Pos logit contributions
rob+0.887
 circus+0.800
 says+0.790
 ours+0.780
 sleeping+0.767
Neg logit contributions
 fingertips-0.670
ively-0.664
antly-0.661
ant-0.641
 efforts-0.622
Tokenicking
Token ID7958
Feature activation-1.35e+01

Pos logit contributions
ively+0.216
antly+0.214
ant+0.211
 fingertips+0.207
 efforts+0.206
Neg logit contributions
rob-0.233
 says-0.214
 ours-0.203
 circus-0.202
 hungry-0.197
Token again
Token ID757
Feature activation-6.06e+00

Pos logit contributions
ively+0.699
antly+0.690
ant+0.668
 efforts+0.658
ered+0.658
Neg logit contributions
rob-0.694
 says-0.609
 circus-0.597
 ours-0.591
 scar-0.576
Token.
Token ID13
Feature activation-7.48e-01

Pos logit contributions
antly+0.300
ively+0.297
ant+0.289
 efforts+0.288
 fingertips+0.284
Neg logit contributions
rob-0.326
 says-0.286
 ours-0.280
 circus-0.276
ww-0.266
Token But
Token ID887
Feature activation-4.88e+00

Pos logit contributions
ant+0.043
ively+0.042
antly+0.042
ered+0.041
 efforts+0.041
Neg logit contributions
 says-0.032
rob-0.031
 ours-0.030
 circus-0.029
 hungry-0.028
Token fun
Token ID1257
Feature activation+9.46e+00

Pos logit contributions
 efforts+0.080
ively+0.077
 Everyone+0.076
antly+0.076
ered+0.075
Neg logit contributions
rob-0.034
 ours-0.026
 scar-0.023
bat-0.023
use-0.022
Token.
Token ID13
Feature activation-7.45e-01

Pos logit contributions
rob+0.473
 says+0.429
 ours+0.412
 circus+0.407
 scar+0.402
Neg logit contributions
ively-0.494
antly-0.486
ant-0.482
 efforts-0.473
 fingertips-0.469
Token Lily
Token ID20037
Feature activation+4.72e-01

Pos logit contributions
ively+0.049
ant+0.049
antly+0.048
ered+0.048
 efforts+0.047
Neg logit contributions
 says-0.026
rob-0.026
 hungry-0.023
 circus-0.023
 ours-0.023
Token realized
Token ID6939
Feature activation-5.80e+00

Pos logit contributions
rob+0.025
 says+0.022
 ours+0.022
 circus+0.022
 scar+0.021
Neg logit contributions
antly-0.023
ively-0.023
ant-0.022
 efforts-0.022
 fingertips-0.022
Token that
Token ID326
Feature activation-5.06e+00

Pos logit contributions
ively+0.248
ant+0.239
antly+0.239
 efforts+0.233
ered+0.232
Neg logit contributions
rob-0.347
 says-0.314
 ours-0.307
 circus-0.305
 scar-0.295
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
ively+0.240
 efforts+0.236
ant+0.232
 fingertips+0.228
antly+0.226
Neg logit contributions
rob-0.274
 ours-0.231
 says-0.229
 circus-0.227
ww-0.223
Tokenn
Token ID77
Feature activation-4.43e+00

Pos logit contributions
rob+0.874
 says+0.797
 ours+0.752
 circus+0.744
ww+0.741
Neg logit contributions
 fingertips-0.709
antly-0.703
ively-0.675
 efforts-0.657
 Days-0.641
Tokenics
Token ID873
Feature activation-7.45e-01

Pos logit contributions
 fingertips+0.229
 efforts+0.218
antly+0.218
ively+0.209
ant+0.208
Neg logit contributions
rob-0.223
 says-0.214
 ours-0.198
 wants-0.197
 blows-0.195
Token were
Token ID547
Feature activation-4.84e+00

Pos logit contributions
ively+0.037
antly+0.036
ant+0.036
 fingertips+0.035
 efforts+0.035
Neg logit contributions
rob-0.038
 says-0.036
 ours-0.034
 circus-0.034
 hungry-0.033
Token not
Token ID407
Feature activation-3.10e+00

Pos logit contributions
ively+0.275
antly+0.271
ant+0.268
 efforts+0.266
 fingertips+0.261
Neg logit contributions
rob-0.223
 says-0.197
 circus-0.185
 ours-0.183
 scar-0.179
Token so
Token ID523
Feature activation-1.20e-01

Pos logit contributions
ively+0.165
 efforts+0.164
ered+0.156
ant+0.153
antly+0.151
Neg logit contributions
rob-0.162
 says-0.133
 circus-0.124
P-0.121
 yells-0.119
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
ively+0.045
antly+0.044
ant+0.044
 efforts+0.043
 fingertips+0.043
Neg logit contributions
rob-0.050
 says-0.045
 ours-0.043
 circus-0.043
 scar-0.042
Token warm
Token ID5814
Feature activation+2.23e+00

Pos logit contributions
 bottom+0.093
antly+0.091
 Days+0.087
 Everybody+0.086
 top+0.085
Neg logit contributions
rob-0.102
 says-0.092
 yells-0.090
 ours-0.088
use-0.085
Token sunshine
Token ID34488
Feature activation-1.17e+00

Pos logit contributions
rob+0.105
 says+0.094
 ours+0.092
 circus+0.087
 scar+0.086
Neg logit contributions
ively-0.122
antly-0.122
ant-0.117
 efforts-0.117
 fingertips-0.117
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
ively+0.063
antly+0.062
ant+0.061
 efforts+0.060
 fingertips+0.060
Neg logit contributions
rob-0.056
 says-0.050
 ours-0.049
 circus-0.048
 scar-0.047
Token having
Token ID1719
Feature activation-1.59e+00

Pos logit contributions
rob+0.083
 ours+0.072
 says+0.071
P+0.069
ww+0.069
Neg logit contributions
ively-0.106
antly-0.105
 fingertips-0.103
ant-0.103
 efforts-0.102
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.114
ively+0.114
antly+0.113
 fingertips+0.113
ered+0.111
Neg logit contributions
rob-0.042
 says-0.035
 ours-0.033
 blows-0.030
ww-0.029
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.923
 circus+0.806
 says+0.786
 ours+0.776
 fun+0.769
Neg logit contributions
antly-0.668
ively-0.656
 fingertips-0.642
 efforts-0.610
ant-0.605
Tokenics
Token ID873
Feature activation-7.84e-01

Pos logit contributions
antly+0.237
 fingertips+0.237
ively+0.234
 efforts+0.231
ant+0.225
Neg logit contributions
rob-0.216
 says-0.196
 ours-0.182
 hungry-0.177
 sleeping-0.176
Token with
Token ID351
Feature activation+2.68e-01

Pos logit contributions
ively+0.040
antly+0.040
ant+0.039
 efforts+0.039
 fingertips+0.038
Neg logit contributions
rob-0.039
 says-0.036
 ours-0.034
 circus-0.034
 scar-0.033
Token her
Token ID607
Feature activation+4.35e+00

Pos logit contributions
rob+0.009
 says+0.008
 hungry+0.008
 scar+0.007
 circus+0.007
Neg logit contributions
antly-0.018
ively-0.018
ant-0.018
 efforts-0.017
ered-0.017
Token family
Token ID1641
Feature activation-4.66e+00

Pos logit contributions
rob+0.221
 says+0.203
 circus+0.194
 ours+0.193
 scar+0.190
Neg logit contributions
ively-0.225
antly-0.221
ant-0.218
 fingertips-0.211
 efforts-0.211
Token so
Token ID523
Feature activation-1.70e-01

Pos logit contributions
ively+0.214
antly+0.210
ant+0.207
 efforts+0.204
 fingertips+0.203
Neg logit contributions
rob-0.189
 says-0.168
 circus-0.161
 ours-0.160
 scar-0.156
Token excited
Token ID6568
Feature activation+1.73e+00

Pos logit contributions
ively+0.007
ered+0.007
 efforts+0.007
 fingertips+0.007
ant+0.007
Neg logit contributions
rob-0.011
 circus-0.009
 scar-0.009
use-0.009
P-0.008
Token!
Token ID0
Feature activation+1.01e-01

Pos logit contributions
rob+0.107
 circus+0.090
P+0.090
 scar+0.089
 ours+0.089
Neg logit contributions
ively-0.072
 efforts-0.070
ant-0.069
ered-0.067
antly-0.066
Token She
Token ID1375
Feature activation+2.32e+00

Pos logit contributions
rob+0.005
 says+0.004
 ours+0.004
 circus+0.004
 scar+0.004
Neg logit contributions
ively-0.006
antly-0.006
ant-0.006
 fingertips-0.005
 efforts-0.005
Token loved
Token ID6151
Feature activation-3.84e+00

Pos logit contributions
rob+0.122
 ours+0.101
 circus+0.099
 says+0.096
ww+0.093
Neg logit contributions
antly-0.120
ered-0.119
 fingertips-0.117
 efforts-0.117
ively-0.117
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.168
 everybody+0.160
zed+0.158
ered+0.156
 fingertips+0.156
Neg logit contributions
rob-0.202
use-0.187
P-0.175
arl-0.173
oky-0.167
Tokenn
Token ID77
Feature activation-4.49e+00

Pos logit contributions
rob+0.869
 says+0.741
 circus+0.735
ww+0.710
 ours+0.708
Neg logit contributions
antly-0.708
ively-0.697
 fingertips-0.682
 efforts-0.673
ant-0.633
Tokenics
Token ID873
Feature activation-7.93e-01

Pos logit contributions
antly+0.225
ively+0.225
 fingertips+0.218
ant+0.217
 efforts+0.217
Neg logit contributions
rob-0.232
 says-0.211
 ours-0.202
 circus-0.197
 hungry-0.195
Token and
Token ID290
Feature activation+1.84e+00

Pos logit contributions
ively+0.044
antly+0.044
ant+0.043
 efforts+0.043
 fingertips+0.042
Neg logit contributions
rob-0.036
 says-0.032
 ours-0.031
 circus-0.030
 scar-0.030
Token spending
Token ID4581
Feature activation+5.28e+00

Pos logit contributions
rob+0.089
 says+0.073
P+0.072
 ours+0.072
 circus+0.071
Neg logit contributions
 fingertips-0.098
ered-0.098
ively-0.097
antly-0.097
 efforts-0.096
Token time
Token ID640
Feature activation+8.87e-01

Pos logit contributions
rob+0.284
 says+0.269
 ours+0.259
 circus+0.247
 hungry+0.244
Neg logit contributions
ively-0.238
antly-0.238
ered-0.235
 efforts-0.233
ant-0.228
Token She
Token ID1375
Feature activation+2.32e+00

Pos logit contributions
ively+0.050
antly+0.049
ant+0.049
 fingertips+0.048
 efforts+0.048
Neg logit contributions
rob-0.034
 says-0.029
 ours-0.028
 circus-0.028
 scar-0.027
Token loved
Token ID6151
Feature activation-3.82e+00

Pos logit contributions
rob+0.145
 ours+0.142
 says+0.132
 scar+0.130
 circus+0.130
Neg logit contributions
antly-0.087
ered-0.080
ively-0.080
ant-0.080
 fingertips-0.078
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
 efforts+0.124
 fingertips+0.117
 everybody+0.111
 Everyone+0.108
 mummy+0.107
Neg logit contributions
rob-0.250
use-0.218
 says-0.217
 ours-0.211
 yells-0.208
Token go
Token ID467
Feature activation+2.17e+00

Pos logit contributions
rob+0.068
 says+0.061
 ours+0.058
 circus+0.057
 scar+0.056
Neg logit contributions
ively-0.084
antly-0.083
ant-0.082
 fingertips-0.081
 efforts-0.081
Token on
Token ID319
Feature activation+5.19e+00

Pos logit contributions
rob+0.120
 wra+0.112
 ours+0.112
ww+0.109
 fru+0.108
Neg logit contributions
ered-0.087
antly-0.083
ively-0.080
 Everyone-0.078
 Everybody-0.075
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.241
 says+0.231
iw+0.218
 ours+0.217
 wants+0.217
Neg logit contributions
 efforts-0.253
ered-0.250
growth-0.249
ively-0.245
 fingertips-0.242
Tokenn
Token ID77
Feature activation-4.47e+00

Pos logit contributions
rob+0.923
 circus+0.851
 says+0.847
 ours+0.829
 sleeping+0.807
Neg logit contributions
antly-0.633
ively-0.631
 fingertips-0.607
 efforts-0.592
ant-0.587
Tokenics
Token ID873
Feature activation-7.84e-01

Pos logit contributions
ively+0.210
antly+0.206
ant+0.204
 efforts+0.198
 fingertips+0.198
Neg logit contributions
rob-0.245
 says-0.225
 ours-0.216
 circus-0.215
 scar-0.211
Token with
Token ID351
Feature activation+2.70e-01

Pos logit contributions
mates+0.021
ively+0.019
ered+0.019
antly+0.019
ant+0.019
Neg logit contributions
rob-0.054
 says-0.051
 yells-0.051
ww-0.051
 ours-0.050
Token her
Token ID607
Feature activation+4.35e+00

Pos logit contributions
rob+0.014
 says+0.012
 ours+0.011
 circus+0.011
 yells+0.011
Neg logit contributions
ively-0.013
 fingertips-0.013
 efforts-0.013
antly-0.013
ant-0.013
Token family
Token ID1641
Feature activation-4.66e+00

Pos logit contributions
rob+0.321
 says+0.282
 ours+0.280
 yells+0.279
 circus+0.275
Neg logit contributions
 efforts-0.126
 fingertips-0.121
antly-0.116
mates-0.106
 mummy-0.105
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
ively+0.070
ant+0.069
antly+0.069
 fingertips+0.067
ered+0.066
Neg logit contributions
rob-0.045
 says-0.043
 hungry-0.043
 sleeping-0.041
 waiting-0.040
Token ready
Token ID3492
Feature activation+8.64e+00

Pos logit contributions
 efforts+0.076
 fingertips+0.073
 Days+0.071
 surely+0.070
ively+0.070
Neg logit contributions
rob-0.086
ww-0.074
P-0.072
 says-0.071
 circus-0.067
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
rob+0.340
 says+0.299
 ours+0.281
 circus+0.280
 scar+0.271
Neg logit contributions
ively-0.546
antly-0.540
ant-0.534
 efforts-0.525
 fingertips-0.524
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
rob+0.068
 says+0.063
 circus+0.060
 ours+0.059
 hungry+0.058
Neg logit contributions
ively-0.086
antly-0.085
ant-0.084
 efforts-0.081
 fingertips-0.081
Token more
Token ID517
Feature activation-3.68e+00

Pos logit contributions
antly+0.114
ively+0.113
 efforts+0.111
ant+0.109
 fingertips+0.109
Neg logit contributions
rob-0.099
 says-0.089
 ours-0.085
 circus-0.084
 wants-0.081
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.242
 fingertips+0.213
ered+0.210
kered+0.204
antly+0.204
Neg logit contributions
rob-0.145
ww-0.102
use-0.101
 ours-0.097
P-0.097
Tokenn
Token ID77
Feature activation-4.47e+00

Pos logit contributions
rob+0.919
 says+0.824
 circus+0.821
 ours+0.815
ww+0.812
Neg logit contributions
antly-0.652
ively-0.642
 fingertips-0.630
ant-0.608
 efforts-0.570
Tokenics
Token ID873
Feature activation-7.85e-01

Pos logit contributions
antly+0.224
ively+0.223
 fingertips+0.218
ant+0.217
 efforts+0.214
Neg logit contributions
rob-0.231
 says-0.211
 ours-0.202
 circus-0.196
 hungry-0.194
Token in
Token ID287
Feature activation-9.37e-01

Pos logit contributions
ively+0.040
antly+0.039
 efforts+0.038
ant+0.038
 fingertips+0.037
Neg logit contributions
rob-0.040
 says-0.036
 ours-0.035
 circus-0.034
 scar-0.034
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
ively+0.041
antly+0.040
 efforts+0.039
 fingertips+0.038
ered+0.038
Neg logit contributions
rob-0.055
 says-0.049
ww-0.047
 circus-0.047
 ours-0.046
Token future
Token ID2003
Feature activation+3.37e+00

Pos logit contributions
 bottom+0.110
antly+0.109
 Days+0.108
 summit+0.101
 surely+0.101
Neg logit contributions
rob-0.090
 yells-0.074
 says-0.074
 ours-0.072
 blows-0.069
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
ively+0.189
antly+0.186
ant+0.183
 efforts+0.179
 fingertips+0.178
Neg logit contributions
rob-0.211
 says-0.193
 ours-0.183
 circus-0.183
 scar-0.179
Token family
Token ID1641
Feature activation-4.67e+00

Pos logit contributions
antly+0.048
 efforts+0.048
 fingertips+0.047
ively+0.046
ant+0.046
Neg logit contributions
rob-0.088
 ours-0.077
 says-0.077
 yells-0.074
use-0.070
Token who
Token ID508
Feature activation-3.89e+00

Pos logit contributions
ively+0.221
antly+0.212
ant+0.212
 efforts+0.206
 fingertips+0.203
Neg logit contributions
rob-0.260
 says-0.232
 ours-0.227
 scar-0.224
 circus-0.221
Token really
Token ID1107
Feature activation-2.52e+00

Pos logit contributions
ively+0.183
antly+0.181
ant+0.177
 efforts+0.173
 fingertips+0.172
Neg logit contributions
rob-0.214
 says-0.193
 ours-0.188
 circus-0.186
 scar-0.183
Token enjoyed
Token ID8359
Feature activation-4.05e+00

Pos logit contributions
ively+0.143
antly+0.141
ant+0.139
 efforts+0.136
 fingertips+0.135
Neg logit contributions
rob-0.115
 says-0.105
 circus-0.099
 ours-0.098
 hungry-0.097
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.199
 fingertips+0.191
ively+0.188
ered+0.185
 itself+0.184
Neg logit contributions
rob-0.198
 says-0.185
use-0.180
ww-0.177
 ours-0.168
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.910
 circus+0.788
 says+0.784
 ours+0.764
ww+0.750
Neg logit contributions
antly-0.686
ively-0.656
 fingertips-0.654
 efforts-0.628
ant-0.607
Tokenics
Token ID873
Feature activation-7.80e-01

Pos logit contributions
antly+0.248
 fingertips+0.244
 efforts+0.240
ively+0.238
ant+0.232
Neg logit contributions
rob-0.208
 says-0.186
 ours-0.175
 sleeping-0.171
 hungry-0.167
Token.
Token ID13
Feature activation-8.04e-01

Pos logit contributions
ively+0.033
antly+0.033
ant+0.032
 efforts+0.031
mates+0.031
Neg logit contributions
rob-0.045
 says-0.041
 ours-0.039
ww-0.039
 circus-0.038
Token They
Token ID1119
Feature activation+4.00e-01

Pos logit contributions
ively+0.042
antly+0.042
ant+0.041
 fingertips+0.040
 efforts+0.040
Neg logit contributions
rob-0.040
 says-0.036
 ours-0.035
 circus-0.034
 scar-0.034
Token loved
Token ID6151
Feature activation-3.83e+00

Pos logit contributions
rob+0.023
 says+0.021
 ours+0.020
 circus+0.020
 scar+0.020
Neg logit contributions
ively-0.017
antly-0.017
ant-0.017
ered-0.017
 efforts-0.016
Token be
Token ID307
Feature activation-1.55e+00

Pos logit contributions
ively+0.107
antly+0.105
ant+0.103
 efforts+0.101
 fingertips+0.101
Neg logit contributions
rob-0.139
 says-0.128
 ours-0.123
 circus-0.122
 scar-0.120
Token there
Token ID612
Feature activation+4.39e+00

Pos logit contributions
ively+0.080
antly+0.079
ant+0.077
 efforts+0.076
 fingertips+0.076
Neg logit contributions
rob-0.080
 says-0.072
 circus-0.069
 ours-0.068
 scar-0.066
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
rob+0.176
 says+0.161
 ours+0.154
 circus+0.151
 hungry+0.148
Neg logit contributions
ively-0.267
antly-0.262
ant-0.261
 fingertips-0.256
 efforts-0.253
Token support
Token ID1104
Feature activation-9.15e+00

Pos logit contributions
rob+0.072
 says+0.068
 circus+0.068
 ours+0.065
 scar+0.065
Neg logit contributions
antly-0.076
ively-0.074
ant-0.074
 fingertips-0.073
 efforts-0.071
Token her
Token ID607
Feature activation+4.32e+00

Pos logit contributions
ively+0.553
antly+0.551
ant+0.541
 efforts+0.535
 fingertips+0.534
Neg logit contributions
rob-0.362
 says-0.336
 ours-0.318
 circus-0.308
 scar-0.305
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.181
 says+0.179
 hungry+0.172
 scar+0.170
 sleeping+0.169
Neg logit contributions
ively-0.246
antly-0.238
ant-0.237
ered-0.228
 fingertips-0.228
Tokenn
Token ID77
Feature activation-4.47e+00

Pos logit contributions
rob+0.792
 says+0.723
 circus+0.723
 ours+0.719
 fun+0.714
Neg logit contributions
 fingertips-0.718
antly-0.686
ively-0.679
 Days-0.657
ainer-0.644
Tokenics
Token ID873
Feature activation-7.95e-01

Pos logit contributions
antly+0.227
ively+0.225
 fingertips+0.221
ant+0.220
 efforts+0.217
Neg logit contributions
rob-0.221
 says-0.206
 ours-0.197
 hungry-0.189
 circus-0.189
Token--
Token ID438
Feature activation-1.00e+00

Pos logit contributions
ively+0.043
antly+0.042
ant+0.042
 fingertips+0.041
 efforts+0.041
Neg logit contributions
rob-0.037
 says-0.034
 ours-0.033
 circus-0.032
 hungry-0.032
Tokenno
Token ID3919
Feature activation+5.06e-01

Pos logit contributions
ively+0.051
ant+0.050
antly+0.049
 efforts+0.047
ered+0.047
Neg logit contributions
rob-0.052
 says-0.048
 circus-0.045
 ours-0.045
 scar-0.044
Token matter
Token ID2300
Feature activation-3.56e+00

Pos logit contributions
rob+0.026
 says+0.024
ww+0.022
 circus+0.022
 hungry+0.022
Neg logit contributions
ively-0.025
antly-0.025
 fingertips-0.025
ant-0.024
ainer-0.024
Token roof
Token ID9753
Feature activation-5.13e+00

Pos logit contributions
 efforts+0.113
ively+0.112
antly+0.110
 fingertips+0.109
ered+0.109
Neg logit contributions
rob-0.086
 says-0.072
 scar-0.068
ww-0.066
 ours-0.065
Token.
Token ID13
Feature activation-8.07e-01

Pos logit contributions
ively+0.305
ant+0.305
antly+0.304
 fingertips+0.300
ered+0.293
Neg logit contributions
rob-0.216
 says-0.197
 circus-0.184
 hungry-0.183
 waiting-0.182
Token They
Token ID1119
Feature activation+3.96e-01

Pos logit contributions
ered+0.060
antly+0.056
bled+0.056
ant+0.056
 fingertips+0.055
Neg logit contributions
 says-0.024
rob-0.022
 wants-0.021
 circus-0.020
 pic-0.020
Token would
Token ID561
Feature activation-1.19e-01

Pos logit contributions
rob+0.018
 says+0.017
 scar+0.016
 circus+0.016
 hungry+0.016
Neg logit contributions
ively-0.022
antly-0.022
ant-0.021
ainer-0.021
 fingertips-0.021
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
ively+0.006
antly+0.006
ant+0.006
 efforts+0.006
 fingertips+0.006
Neg logit contributions
rob-0.006
 says-0.006
 circus-0.005
 ours-0.005
 scar-0.005
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
ively+0.113
antly+0.112
ant+0.110
 efforts+0.109
 fingertips+0.108
Neg logit contributions
rob-0.102
 says-0.092
 ours-0.088
 circus-0.087
 scar-0.085
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.806
 fun+0.717
 ours+0.701
 circus+0.700
 says+0.700
Neg logit contributions
ively-0.729
antly-0.728
 fingertips-0.710
ant-0.670
 Days-0.664
Tokenics
Token ID873
Feature activation-7.81e-01

Pos logit contributions
ively+0.224
antly+0.223
ant+0.217
 fingertips+0.217
 efforts+0.213
Neg logit contributions
rob-0.233
 says-0.212
 ours-0.202
 circus-0.200
 hungry-0.196
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
ively+0.048
antly+0.047
ant+0.047
 fingertips+0.046
 efforts+0.046
Neg logit contributions
rob-0.032
 says-0.028
 circus-0.027
 ours-0.026
 scar-0.026
Token play
Token ID711
Feature activation+9.01e-01

Pos logit contributions
rob+0.086
 says+0.078
 ours+0.074
 circus+0.074
 scar+0.072
Neg logit contributions
ively-0.105
antly-0.104
ant-0.102
 efforts-0.100
 fingertips-0.100
Token games
Token ID1830
Feature activation+1.04e+00

Pos logit contributions
rob+0.036
 says+0.031
 circus+0.031
 ours+0.029
 hungry+0.029
Neg logit contributions
ively-0.057
antly-0.056
ant-0.056
 fingertips-0.055
 efforts-0.055
Token favourite
Token ID12507
Feature activation-6.49e+00

Pos logit contributions
rob+0.176
 says+0.171
 circus+0.167
 ours+0.163
 hungry+0.161
Neg logit contributions
ively-0.245
ant-0.240
antly-0.240
ered-0.232
 fingertips-0.227
Token place
Token ID1295
Feature activation-1.81e+00

Pos logit contributions
ively+0.379
ant+0.370
antly+0.370
 fingertips+0.358
ered+0.356
Neg logit contributions
rob-0.283
 says-0.258
 circus-0.253
 ours-0.242
 scar-0.240
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
ively+0.105
ant+0.103
antly+0.103
 fingertips+0.099
 efforts+0.099
Neg logit contributions
rob-0.078
 says-0.073
 ours-0.069
 circus-0.068
 hungry-0.066
Token go
Token ID467
Feature activation+2.18e+00

Pos logit contributions
rob+0.069
 says+0.065
 circus+0.064
 ours+0.061
 scar+0.060
Neg logit contributions
ively-0.082
antly-0.081
ant-0.080
 efforts-0.078
 fingertips-0.078
Token for
Token ID329
Feature activation+5.47e+00

Pos logit contributions
rob+0.122
 says+0.110
 ours+0.106
 circus+0.105
ww+0.104
Neg logit contributions
antly-0.103
ively-0.102
ered-0.096
ant-0.094
 efforts-0.093
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.249
 says+0.226
 circus+0.220
 ours+0.215
 scar+0.213
Neg logit contributions
ively-0.309
ant-0.308
antly-0.307
 fingertips-0.295
 efforts-0.290
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.848
 circus+0.769
 ours+0.755
ww+0.746
 says+0.743
Neg logit contributions
antly-0.678
 fingertips-0.677
ively-0.677
ant-0.645
 efforts-0.622
Tokenics
Token ID873
Feature activation-7.85e-01

Pos logit contributions
ively+0.240
antly+0.239
ant+0.235
 fingertips+0.232
 efforts+0.230
Neg logit contributions
rob-0.213
 says-0.194
 ours-0.185
 circus-0.180
 hungry-0.178
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
ively+0.041
antly+0.041
ant+0.040
 efforts+0.039
 fingertips+0.039
Neg logit contributions
rob-0.039
 says-0.035
 ours-0.033
 circus-0.033
 scar-0.033
Token<|endoftext|>
Token ID50256
Feature activation+1.91e-01

Pos logit contributions
ant+0.052
ered+0.051
ively+0.051
 efforts+0.050
antly+0.049
Neg logit contributions
rob-0.032
 says-0.031
 circus-0.027
 ours-0.026
 hungry-0.026
TokenOnce
Token ID7454
Feature activation+3.48e+00

Pos logit contributions
rob+0.007
 ours+0.006
 says+0.006
 scar+0.006
 circus+0.006
Neg logit contributions
antly-0.013
ively-0.013
ant-0.012
 fingertips-0.012
ered-0.012
Token their
Token ID511
Feature activation+4.21e+00

Pos logit contributions
ively+0.172
antly+0.170
ant+0.168
 fingertips+0.164
 efforts+0.164
Neg logit contributions
rob-0.119
 says-0.108
 ours-0.102
 circus-0.101
 hungry-0.098
Token salad
Token ID20698
Feature activation+5.59e+00

Pos logit contributions
rob+0.141
 says+0.120
 ours+0.111
 circus+0.111
 scar+0.107
Neg logit contributions
ively-0.291
antly-0.288
ant-0.285
 efforts-0.282
 fingertips-0.281
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
rob+0.226
 says+0.203
 ours+0.190
 circus+0.189
 scar+0.184
Neg logit contributions
ively-0.344
antly-0.338
ant-0.336
 fingertips-0.329
 efforts-0.327
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
rob+0.078
 says+0.075
 circus+0.070
 ours+0.070
 hungry+0.069
Neg logit contributions
ively-0.111
ant-0.110
antly-0.110
 fingertips-0.104
 efforts-0.103
Token happy
Token ID3772
Feature activation+2.22e+00

Pos logit contributions
ively+0.159
ant+0.156
antly+0.155
 fingertips+0.151
ered+0.150
Neg logit contributions
rob-0.060
 says-0.052
 circus-0.048
 ours-0.046
 scar-0.046
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.115
 says+0.104
 ours+0.100
 circus+0.099
 scar+0.097
Neg logit contributions
ively-0.113
antly-0.111
ant-0.110
 efforts-0.108
 fingertips-0.107
Tokenn
Token ID77
Feature activation-4.47e+00

Pos logit contributions
rob+0.843
ww+0.723
 circus+0.714
 says+0.708
 fun+0.704
Neg logit contributions
 fingertips-0.738
ively-0.717
antly-0.716
 Days-0.661
 efforts-0.657
Tokenics
Token ID873
Feature activation-7.78e-01

Pos logit contributions
 fingertips+0.258
antly+0.251
 efforts+0.245
ively+0.245
ant+0.240
Neg logit contributions
rob-0.195
 says-0.177
 ours-0.169
 hungry-0.162
 sleeping-0.160
Token together
Token ID1978
Feature activation+2.77e+00

Pos logit contributions
ively+0.051
ant+0.050
 fingertips+0.050
antly+0.050
ered+0.049
Neg logit contributions
rob-0.029
 says-0.025
 circus-0.024
 hungry-0.023
 ours-0.023
Token.
Token ID13
Feature activation-8.17e-01

Pos logit contributions
rob+0.102
 says+0.096
 circus+0.091
 hungry+0.091
 sleeping+0.089
Neg logit contributions
ively-0.175
ant-0.172
antly-0.171
ainer-0.167
 fingertips-0.167
Token<|endoftext|>
Token ID50256
Feature activation+1.89e-01

Pos logit contributions
ively+0.050
ant+0.049
ered+0.049
antly+0.048
 fingertips+0.047
Neg logit contributions
rob-0.035
 says-0.032
 circus-0.029
 ours-0.029
 hungry-0.029
Token children
Token ID1751
Feature activation-2.21e+00

Pos logit contributions
ively+0.272
antly+0.269
ant+0.266
 fingertips+0.259
 efforts+0.259
Neg logit contributions
rob-0.229
 says-0.208
 circus-0.198
 ours-0.197
 scar-0.192
Token running
Token ID2491
Feature activation+5.79e+00

Pos logit contributions
ively+0.137
antly+0.134
ant+0.133
 efforts+0.131
 fingertips+0.130
Neg logit contributions
rob-0.090
 says-0.078
 ours-0.074
 circus-0.073
 scar-0.071
Token around
Token ID1088
Feature activation-3.89e+00

Pos logit contributions
rob+0.314
 says+0.285
 ours+0.273
 circus+0.270
 scar+0.264
Neg logit contributions
ively-0.281
antly-0.275
ant-0.270
 efforts-0.264
ered-0.261
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
ively+0.222
antly+0.219
ant+0.216
 efforts+0.212
 fingertips+0.211
Neg logit contributions
rob-0.176
 says-0.158
 ours-0.150
 circus-0.148
 scar-0.145
Token families
Token ID4172
Feature activation-6.32e+00

Pos logit contributions
ively+0.092
antly+0.090
 efforts+0.090
 fingertips+0.089
ant+0.088
Neg logit contributions
rob-0.070
ww-0.057
P-0.057
 says-0.056
 ours-0.056
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
ively+0.346
antly+0.336
ant+0.334
 efforts+0.330
 fingertips+0.326
Neg logit contributions
rob-0.304
 says-0.264
 ours-0.261
ww-0.252
 circus-0.250
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.943
 circus+0.856
 says+0.839
 ours+0.823
 scar+0.794
Neg logit contributions
antly-0.623
 fingertips-0.607
ively-0.599
 efforts-0.588
ainer-0.574
Tokenicking
Token ID7958
Feature activation-1.36e+01

Pos logit contributions
antly+0.229
 fingertips+0.228
ively+0.227
 efforts+0.223
ant+0.223
Neg logit contributions
rob-0.218
 says-0.203
 ours-0.188
 sleeping-0.186
 hungry-0.186
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
ively+0.745
antly+0.738
ant+0.721
 efforts+0.716
 fingertips+0.707
Neg logit contributions
rob-0.655
 says-0.563
 ours-0.543
 circus-0.540
P-0.533
Token couples
Token ID11886
Feature activation-4.74e+00

Pos logit contributions
ively+0.091
 efforts+0.091
antly+0.090
 fingertips+0.089
ant+0.089
Neg logit contributions
rob-0.072
 says-0.060
ww-0.059
P-0.058
 ours-0.058
Token st
Token ID336
Feature activation+1.19e+01

Pos logit contributions
ively+0.273
antly+0.269
ant+0.266
 efforts+0.262
 fingertips+0.259
Neg logit contributions
rob-0.209
 says-0.181
 ours-0.179
ww-0.173
P-0.169
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
 fingertips+0.260
ant+0.252
ively+0.251
antly+0.247
ainer+0.238
Neg logit contributions
 says-0.179
rob-0.177
 ours-0.164
 scary-0.162
 hungry-0.160
Token be
Token ID307
Feature activation-1.55e+00

Pos logit contributions
rob+0.068
 says+0.067
 circus+0.063
 scar+0.063
 ours+0.062
Neg logit contributions
antly-0.081
 fingertips-0.079
ively-0.079
ant-0.076
 efforts-0.075
Token careful
Token ID8161
Feature activation-1.72e+00

Pos logit contributions
ively+0.096
 fingertips+0.095
ant+0.095
antly+0.094
 efforts+0.090
Neg logit contributions
rob-0.060
 says-0.057
 hungry-0.057
 ours-0.056
 scar-0.053
Token when
Token ID618
Feature activation-5.83e+00

Pos logit contributions
 fingertips+0.124
growth+0.124
antly+0.123
ively+0.123
ant+0.121
Neg logit contributions
 says-0.039
 waiting-0.038
 sleeping-0.037
 ours-0.036
 hungry-0.035
Token having
Token ID1719
Feature activation-1.60e+00

Pos logit contributions
ively+0.367
antly+0.361
ant+0.357
 efforts+0.355
 fingertips+0.351
Neg logit contributions
rob-0.236
 says-0.202
 circus-0.189
 ours-0.189
 scar-0.186
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
ively+0.105
antly+0.104
 efforts+0.103
ant+0.102
 fingertips+0.102
Neg logit contributions
rob-0.059
 says-0.050
 circus-0.046
 ours-0.046
 scar-0.044
Tokenn
Token ID77
Feature activation-4.49e+00

Pos logit contributions
rob+0.764
 says+0.668
 ours+0.668
use+0.640
 fun+0.636
Neg logit contributions
 fingertips-0.771
antly-0.759
ively-0.677
mates-0.673
 Days-0.673
Tokenics
Token ID873
Feature activation-8.04e-01

Pos logit contributions
 fingertips+0.239
antly+0.234
ively+0.228
 efforts+0.227
ant+0.221
Neg logit contributions
rob-0.212
 says-0.206
 ours-0.182
 wants-0.182
 blows-0.182
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
ively+0.044
antly+0.043
ant+0.043
 fingertips+0.042
 efforts+0.042
Neg logit contributions
rob-0.038
 says-0.035
 ours-0.033
 circus-0.033
 scar-0.032
Token not
Token ID407
Feature activation-3.15e+00

Pos logit contributions
rob+0.086
 says+0.076
 circus+0.071
 ours+0.071
 scar+0.070
Neg logit contributions
ively-0.105
antly-0.104
ant-0.102
 efforts-0.102
 fingertips-0.101
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
 efforts+0.165
ively+0.164
ered+0.159
ant+0.159
 fingertips+0.158
Neg logit contributions
rob-0.168
 says-0.136
 circus-0.131
P-0.130
 ours-0.127
TokenThey
Token ID2990
Feature activation+1.83e+00

Pos logit contributions
rob+0.075
P+0.069
 says+0.068
 circus+0.067
ww+0.065
Neg logit contributions
 fingertips-0.076
ively-0.074
antly-0.073
 efforts-0.073
ant-0.070
Token decided
Token ID3066
Feature activation+7.20e-01

Pos logit contributions
 says+0.079
rob+0.079
 scar+0.073
 hungry+0.071
 circus+0.070
Neg logit contributions
ively-0.104
ant-0.103
antly-0.102
ainer-0.101
 fingertips-0.099
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
rob+0.029
 says+0.026
 ours+0.025
 circus+0.024
 hungry+0.024
Neg logit contributions
ively-0.044
antly-0.044
ant-0.044
 fingertips-0.043
 efforts-0.042
Token make
Token ID787
Feature activation-6.06e+00

Pos logit contributions
 says+0.059
 circus+0.054
 ours+0.051
rob+0.051
 your+0.050
Neg logit contributions
antly-0.087
ively-0.086
ant-0.084
 fingertips-0.083
bled-0.082
Token more
Token ID517
Feature activation-3.68e+00

Pos logit contributions
ively+0.329
antly+0.323
ant+0.322
 fingertips+0.315
 efforts+0.313
Neg logit contributions
rob-0.290
 says-0.269
 ours-0.253
 circus-0.251
 scar-0.247
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.204
ively+0.191
ered+0.190
antly+0.187
 fingertips+0.186
Neg logit contributions
rob-0.185
P-0.144
ww-0.140
use-0.140
 ours-0.135
Tokenn
Token ID77
Feature activation-4.49e+00

Pos logit contributions
rob+0.795
 ours+0.717
 fun+0.705
 says+0.705
ww+0.696
Neg logit contributions
 fingertips-0.718
antly-0.715
ively-0.676
 Days-0.672
ant-0.647
Tokenics
Token ID873
Feature activation-8.00e-01

Pos logit contributions
antly+0.231
ively+0.227
 fingertips+0.224
ant+0.223
 efforts+0.219
Neg logit contributions
rob-0.225
 says-0.207
 ours-0.197
 circus-0.190
 hungry-0.190
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
ively+0.046
antly+0.045
ant+0.044
 efforts+0.044
 fingertips+0.043
Neg logit contributions
rob-0.036
 says-0.032
 ours-0.031
 circus-0.031
 scar-0.030
Token looked
Token ID3114
Feature activation+1.18e+00

Pos logit contributions
rob+0.087
 says+0.082
 ours+0.078
 circus+0.077
 scar+0.076
Neg logit contributions
ively-0.101
ant-0.100
antly-0.100
 fingertips-0.096
 efforts-0.095
Token in
Token ID287
Feature activation-9.46e-01

Pos logit contributions
rob+0.053
 says+0.048
 ours+0.045
 circus+0.045
 scar+0.044
Neg logit contributions
ively-0.068
antly-0.067
ant-0.066
 efforts-0.065
 fingertips-0.065
Token in
Token ID287
Feature activation-9.51e-01

Pos logit contributions
rob+0.044
 says+0.039
 ours+0.039
ww+0.037
 circus+0.036
Neg logit contributions
ively-0.048
antly-0.048
ant-0.046
ered-0.046
 efforts-0.045
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
ively+0.046
antly+0.045
ant+0.044
 efforts+0.043
 fingertips+0.043
Neg logit contributions
rob-0.052
 says-0.047
 ours-0.045
 circus-0.045
 scar-0.044
Token garden
Token ID11376
Feature activation+6.49e-01

Pos logit contributions
antly+0.094
ively+0.092
 Days+0.090
ered+0.088
 fingertips+0.086
Neg logit contributions
rob-0.115
 says-0.089
 yells-0.087
 ours-0.086
 scar-0.086
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
rob+0.034
 says+0.030
 ours+0.029
 circus+0.028
ww+0.028
Neg logit contributions
ively-0.033
antly-0.032
ant-0.032
 efforts-0.032
 fingertips-0.031
Token have
Token ID423
Feature activation-2.12e+00

Pos logit contributions
rob+0.094
 says+0.082
 ours+0.078
 circus+0.078
ww+0.077
Neg logit contributions
ively-0.098
antly-0.096
 fingertips-0.094
ant-0.094
 efforts-0.094
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.140
ered+0.140
antly+0.138
 fingertips+0.136
ively+0.136
Neg logit contributions
rob-0.067
 says-0.054
ww-0.050
use-0.050
 blows-0.050
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.851
 says+0.752
 ours+0.752
 circus+0.751
 fun+0.736
Neg logit contributions
antly-0.705
ively-0.694
 fingertips-0.693
 efforts-0.639
ant-0.626
Tokenics
Token ID873
Feature activation-8.01e-01

Pos logit contributions
 fingertips+0.252
antly+0.250
 efforts+0.244
ively+0.244
ant+0.235
Neg logit contributions
rob-0.203
 says-0.185
 ours-0.174
 hungry-0.164
 sleeping-0.164
Token together
Token ID1978
Feature activation+2.78e+00

Pos logit contributions
ively+0.041
antly+0.040
ant+0.040
 efforts+0.039
 fingertips+0.038
Neg logit contributions
rob-0.041
 says-0.037
 ours-0.035
 circus-0.034
ww-0.034
Token.
Token ID13
Feature activation-8.20e-01

Pos logit contributions
rob+0.131
 says+0.119
 ours+0.113
 circus+0.113
 hungry+0.110
Neg logit contributions
ively-0.153
antly-0.151
ant-0.149
 efforts-0.146
 fingertips-0.145
Token
Token ID220
Feature activation+3.22e-02

Pos logit contributions
ively+0.045
antly+0.045
ant+0.044
 efforts+0.043
 fingertips+0.043
Neg logit contributions
rob-0.039
 says-0.036
 circus-0.034
 ours-0.034
 hungry-0.033
Token went
Token ID1816
Feature activation+3.26e+00

Pos logit contributions
rob+0.020
 says+0.018
 scar+0.017
 circus+0.017
 hungry+0.016
Neg logit contributions
ively-0.023
antly-0.023
ant-0.022
 fingertips-0.022
ainer-0.022
Token on
Token ID319
Feature activation+5.17e+00

Pos logit contributions
rob+0.163
 ours+0.156
 says+0.154
Say+0.148
ww+0.148
Neg logit contributions
antly-0.152
ively-0.145
 Days-0.143
ered-0.142
mates-0.141
Token adventures
Token ID17545
Feature activation+5.59e+00

Pos logit contributions
rob+0.223
 says+0.200
 ours+0.191
ww+0.183
 circus+0.179
Neg logit contributions
ively-0.297
antly-0.296
 efforts-0.295
 fingertips-0.287
ant-0.286
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
rob+0.233
 says+0.206
 circus+0.195
 ours+0.193
 scar+0.188
Neg logit contributions
ively-0.341
antly-0.336
ant-0.333
 fingertips-0.327
 efforts-0.326
Token had
Token ID550
Feature activation-9.34e-02

Pos logit contributions
ively+0.083
 efforts+0.081
ant+0.081
antly+0.080
 fingertips+0.079
Neg logit contributions
rob-0.073
 says-0.067
 ours-0.067
ww-0.066
P-0.063
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
ively+0.005
ant+0.005
antly+0.005
 fingertips+0.005
 efforts+0.004
Neg logit contributions
rob-0.005
 says-0.004
 circus-0.004
 ours-0.004
 scar-0.004
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.843
 fun+0.753
 ours+0.730
 circus+0.724
ww+0.721
Neg logit contributions
antly-0.719
ively-0.701
 fingertips-0.695
ant-0.660
 Days-0.639
Tokenics
Token ID873
Feature activation-8.01e-01

Pos logit contributions
ively+0.226
antly+0.226
 fingertips+0.223
ant+0.220
 efforts+0.217
Neg logit contributions
rob-0.230
 says-0.207
 ours-0.197
 circus-0.192
 hungry-0.190
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
ively+0.048
antly+0.047
ant+0.047
 fingertips+0.046
 efforts+0.046
Neg logit contributions
rob-0.034
 says-0.030
 circus-0.028
 ours-0.028
 scar-0.028
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
ively+0.083
 efforts+0.082
antly+0.082
 fingertips+0.081
ant+0.080
Neg logit contributions
rob-0.073
 says-0.067
ww-0.064
 ours-0.064
P-0.061
Token even
Token ID772
Feature activation-3.77e+00

Pos logit contributions
rob+0.091
 says+0.083
 circus+0.079
 ours+0.079
 scar+0.078
Neg logit contributions
ively-0.099
antly-0.098
ant-0.097
 fingertips-0.094
 efforts-0.094
Token girl
Token ID2576
Feature activation+1.33e-01

Pos logit contributions
 mummy+0.279
 efforts+0.278
 fingertips+0.265
antly+0.265
mates+0.262
Neg logit contributions
rob-0.310
 yells-0.267
 ours-0.265
 says-0.257
 shouts-0.245
Token named
Token ID3706
Feature activation+4.47e-01

Pos logit contributions
rob+0.006
 says+0.006
 ours+0.005
 circus+0.005
 scar+0.005
Neg logit contributions
ively-0.007
antly-0.007
ant-0.007
 efforts-0.007
 fingertips-0.007
Token Lily
Token ID20037
Feature activation+4.13e-01

Pos logit contributions
rob+0.016
 says+0.015
 ours+0.014
 circus+0.013
 hungry+0.013
Neg logit contributions
ively-0.029
antly-0.029
ant-0.029
 efforts-0.028
 fingertips-0.028
Token who
Token ID508
Feature activation-3.89e+00

Pos logit contributions
rob+0.019
 says+0.018
 ours+0.017
 circus+0.017
 scar+0.016
Neg logit contributions
ively-0.023
antly-0.022
ant-0.022
 efforts-0.022
 fingertips-0.022
Token loved
Token ID6151
Feature activation-3.83e+00

Pos logit contributions
antly+0.162
ively+0.158
ant+0.157
 efforts+0.153
ered+0.152
Neg logit contributions
rob-0.235
 ours-0.211
 says-0.211
 circus-0.204
 scar-0.202
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.136
 fingertips+0.129
 everybody+0.117
 mummy+0.116
 Everyone+0.115
Neg logit contributions
rob-0.246
 says-0.214
use-0.207
 ours-0.206
 yells-0.202
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.866
 circus+0.746
 says+0.741
 ours+0.724
ww+0.723
Neg logit contributions
antly-0.714
ively-0.690
 fingertips-0.677
 efforts-0.671
ant-0.639
Tokenics
Token ID873
Feature activation-8.01e-01

Pos logit contributions
antly+0.253
 fingertips+0.249
 efforts+0.245
ively+0.240
ant+0.235
Neg logit contributions
rob-0.202
 says-0.176
 ours-0.170
 sleeping-0.169
 waiting-0.165
Token.
Token ID13
Feature activation-8.01e-01

Pos logit contributions
ively+0.036
antly+0.036
ant+0.035
 efforts+0.034
 fingertips+0.033
Neg logit contributions
rob-0.045
 says-0.041
 ours-0.040
 circus-0.039
ww-0.038
Token She
Token ID1375
Feature activation+2.32e+00

Pos logit contributions
ively+0.048
antly+0.048
ant+0.047
 efforts+0.046
 fingertips+0.046
Neg logit contributions
rob-0.034
 says-0.030
 ours-0.029
 circus-0.028
 scar-0.028
Token would
Token ID561
Feature activation-1.29e-01

Pos logit contributions
rob+0.133
 ours+0.126
 says+0.117
 circus+0.116
 yells+0.114
Neg logit contributions
antly-0.098
 efforts-0.091
ered-0.091
ant-0.089
ively-0.087
Token near
Token ID1474
Feature activation+1.12e+00

Pos logit contributions
rob+0.386
 says+0.345
 circus+0.327
 ours+0.326
 hungry+0.317
Neg logit contributions
ively-0.525
antly-0.517
ant-0.514
 fingertips-0.506
 efforts-0.506
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
 says+0.046
 ours+0.044
rob+0.043
 hungry+0.042
 circus+0.041
Neg logit contributions
antly-0.067
ively-0.067
ant-0.066
 efforts-0.063
ainer-0.063
Token cliff
Token ID19516
Feature activation+4.15e-01

Pos logit contributions
ively+0.119
antly+0.117
ant+0.116
 efforts+0.114
 fingertips+0.114
Neg logit contributions
rob-0.079
 says-0.070
 ours-0.066
 circus-0.065
 scar-0.064
Token and
Token ID290
Feature activation+1.88e+00

Pos logit contributions
rob+0.020
 says+0.018
 ours+0.017
 circus+0.017
 scar+0.017
Neg logit contributions
ively-0.023
antly-0.022
ant-0.022
 efforts-0.022
 fingertips-0.021
Token have
Token ID423
Feature activation-2.11e+00

Pos logit contributions
rob+0.098
 says+0.086
 ours+0.083
ww+0.082
 circus+0.081
Neg logit contributions
ively-0.096
antly-0.093
ant-0.091
 fingertips-0.091
 efforts-0.091
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
 efforts+0.129
antly+0.128
ered+0.127
ively+0.125
 fingertips+0.123
Neg logit contributions
rob-0.081
 says-0.065
 blows-0.062
use-0.061
 yells-0.060
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.865
 circus+0.776
 ours+0.761
 says+0.761
 fun+0.746
Neg logit contributions
ively-0.689
antly-0.686
 fingertips-0.672
ant-0.631
 efforts-0.625
Tokenics
Token ID873
Feature activation-7.88e-01

Pos logit contributions
antly+0.235
 fingertips+0.234
ively+0.233
 efforts+0.229
ant+0.227
Neg logit contributions
rob-0.218
 says-0.199
 ours-0.188
 hungry-0.181
 circus-0.180
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
ively+0.042
antly+0.042
ant+0.040
 efforts+0.040
ered+0.039
Neg logit contributions
rob-0.038
 says-0.035
 ours-0.033
 circus-0.032
ww-0.032
Token One
Token ID1881
Feature activation+8.90e-01

Pos logit contributions
ively+0.041
antly+0.040
ant+0.039
 efforts+0.039
 fingertips+0.039
Neg logit contributions
rob-0.043
 says-0.039
 ours-0.037
 circus-0.037
 scar-0.036
Token day
Token ID1110
Feature activation+1.30e+00

Pos logit contributions
rob+0.038
 says+0.034
 ours+0.032
 circus+0.032
 scar+0.031
Neg logit contributions
ively-0.053
antly-0.052
ant-0.052
 efforts-0.051
 fingertips-0.051
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
rob+0.119
 says+0.111
 circus+0.104
 ours+0.104
 hungry+0.102
Neg logit contributions
ively-0.158
antly-0.157
ant-0.153
 fingertips-0.151
 efforts-0.151
Token it
Token ID340
Feature activation-1.92e+00

Pos logit contributions
 says+0.082
rob+0.081
 circus+0.074
 hungry+0.073
 wants+0.073
Neg logit contributions
ively-0.105
antly-0.104
ant-0.102
 fingertips-0.099
ainer-0.099
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
ively+0.101
antly+0.099
ant+0.098
 fingertips+0.097
 efforts+0.096
Neg logit contributions
rob-0.095
 says-0.089
 circus-0.083
 hungry-0.082
 ours-0.081
Token going
Token ID1016
Feature activation+1.25e+01

Pos logit contributions
ively+0.217
antly+0.214
ant+0.211
 efforts+0.208
 fingertips+0.207
Neg logit contributions
rob-0.186
 says-0.165
 circus-0.157
 ours-0.156
 scar-0.152
Token on
Token ID319
Feature activation+5.18e+00

Pos logit contributions
rob+0.599
 says+0.533
 circus+0.494
ww+0.487
 ours+0.485
Neg logit contributions
antly-0.699
ively-0.690
ant-0.656
ered-0.655
 efforts-0.650
Token pic
Token ID8301
Feature activation+1.54e+01

Pos logit contributions
rob+0.230
 circus+0.204
 says+0.203
 ours+0.196
 hungry+0.195
Neg logit contributions
ant-0.297
ively-0.297
 fingertips-0.293
antly-0.290
 efforts-0.285
Tokenn
Token ID77
Feature activation-4.48e+00

Pos logit contributions
rob+0.855
 circus+0.775
 says+0.757
 ours+0.756
 fun+0.755
Neg logit contributions
 fingertips-0.706
antly-0.693
ively-0.674
 efforts-0.657
ant-0.644
Tokenics
Token ID873
Feature activation-7.97e-01

Pos logit contributions
antly+0.234
 fingertips+0.231
ively+0.227
 efforts+0.227
ant+0.225
Neg logit contributions
rob-0.218
 says-0.203
 ours-0.188
 hungry-0.182
 circus-0.181
Token!
Token ID0
Feature activation+1.05e-01

Pos logit contributions
ively+0.042
antly+0.042
ant+0.041
 efforts+0.040
 fingertips+0.040
Neg logit contributions
rob-0.039
 says-0.035
 ours-0.034
 circus-0.033
 scar-0.033
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
rob+0.005
 ours+0.005
 says+0.005
 circus+0.005
 scar+0.004
Neg logit contributions
antly-0.005
ively-0.005
ant-0.005
 fingertips-0.005
 efforts-0.005
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
rob+0.065
 says+0.059
 circus+0.054
 ours+0.054
 hungry+0.052
Neg logit contributions
ively-0.087
antly-0.085
ant-0.085
 efforts-0.084
 fingertips-0.084
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
ively+0.044
antly+0.042
ant+0.042
 efforts+0.042
 fingertips+0.041
Neg logit contributions
rob-0.041
 says-0.038
 circus-0.035
 scar-0.035
 ours-0.034
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
rob+0.068
 says+0.061
 circus+0.057
 ours+0.057
 scar+0.056
Neg logit contributions
ively-0.083
antly-0.082
ant-0.081
 efforts-0.080
 fingertips-0.080
TokenOne
Token ID3198
Feature activation+4.52e+00

Pos logit contributions
rob+0.069
 ours+0.063
 says+0.063
 circus+0.059
 hungry+0.059
Neg logit contributions
antly-0.081
ively-0.080
ant-0.080
 efforts-0.076
ered-0.076
Token day
Token ID1110
Feature activation+1.29e+00

Pos logit contributions
rob+0.186
 says+0.167
 circus+0.157
 ours+0.156
 hungry+0.152
Neg logit contributions
ively-0.276
antly-0.273
ant-0.270
 fingertips-0.265
 efforts-0.265
Token Ch
Token ID609
Feature activation-7.03e-01

Pos logit contributions
rob+0.058
 says+0.054
 scar+0.052
ww+0.050
 sleeping+0.049
Neg logit contributions
ively-0.074
 efforts-0.071
 fingertips-0.071
ant-0.070
antly-0.070
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
antly+0.033
ively+0.031
 fingertips+0.031
ered+0.030
 efforts+0.030
Neg logit contributions
rob-0.038
 says-0.035
 scar-0.033
 blows-0.033
etchup-0.033
Tokenelle
Token ID13485
Feature activation-2.26e+00

Pos logit contributions
ively+1.110
 fingertips+1.107
antly+1.091
 efforts+1.083
ant+1.065
Neg logit contributions
rob-0.866
 says-0.785
 scar-0.717
 circus-0.709
 wants-0.690
Token's
Token ID338
Feature activation+1.61e+00

Pos logit contributions
ively+0.135
ant+0.132
antly+0.132
 fingertips+0.131
 efforts+0.129
Neg logit contributions
rob-0.092
 says-0.090
 hungry-0.080
 circus-0.079
 wants-0.079
Token mom
Token ID1995
Feature activation-8.78e+00

Pos logit contributions
 says+0.081
 hungry+0.081
 scar+0.079
 sleeping+0.077
 scary+0.077
Neg logit contributions
ively-0.080
antly-0.078
ered-0.077
ant-0.076
ainer-0.072
Token put
Token ID1234
Feature activation-7.48e-01

Pos logit contributions
antly+0.515
ively+0.511
ered+0.501
ant+0.500
 efforts+0.485
Neg logit contributions
rob-0.393
 ours-0.341
 circus-0.329
ww-0.306
 scar-0.292
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
antly+0.040
ively+0.040
ant+0.039
ered+0.039
 efforts+0.036
Neg logit contributions
rob-0.035
 circus-0.032
 says-0.032
 ours-0.030
 wants-0.030
Token that
Token ID326
Feature activation-5.13e+00

Pos logit contributions
rob+0.078
ww+0.066
 scar+0.066
P+0.065
 ours+0.065
Neg logit contributions
ively-0.057
antly-0.057
ered-0.053
ant-0.053
 fingertips-0.052
Token said
Token ID531
Feature activation+8.79e+00

Pos logit contributions
ively+0.368
antly+0.367
ered+0.364
 fingertips+0.354
ant+0.353
Neg logit contributions
rob-0.155
 circus-0.115
 ours-0.113
 scar-0.110
ww-0.109
Token "
Token ID366
Feature activation+3.61e-01

Pos logit contributions
 says+0.326
 hungry+0.306
 circus+0.302
 waiting+0.300
rob+0.297
Neg logit contributions
ively-0.546
ant-0.532
antly-0.530
 fingertips-0.521
 efforts-0.520
Tokenrest
Token ID2118
Feature activation-8.46e+00

Pos logit contributions
rob+0.016
 says+0.015
 ours+0.014
 circus+0.014
 scar+0.013
Neg logit contributions
ively-0.021
antly-0.021
ant-0.021
 fingertips-0.020
 efforts-0.019
Tokenaur
Token ID2899
Feature activation-4.80e+00

Pos logit contributions
ively+0.467
antly+0.462
ant+0.457
 efforts+0.446
 fingertips+0.443
Neg logit contributions
rob-0.398
 says-0.356
 ours-0.338
 circus-0.336
 scar-0.332
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
 fingertips+0.218
 efforts+0.215
ively+0.211
antly+0.209
 Days+0.205
Neg logit contributions
rob-0.255
 says-0.249
 ours-0.238
ww-0.228
 wants-0.226
Token".
Token ID1911
Feature activation-6.73e-01

Pos logit contributions
ively+1.054
antly+1.039
ant+1.023
 fingertips+1.009
 efforts+1.009
Neg logit contributions
rob-0.899
 says-0.816
 ours-0.775
 circus-0.772
 scar-0.747
Token They
Token ID1119
Feature activation+3.74e-01

Pos logit contributions
ively+0.042
antly+0.042
ant+0.042
 efforts+0.041
 fingertips+0.041
Neg logit contributions
 says-0.026
rob-0.024
 ours-0.022
 hungry-0.022
 wants-0.021
Token went
Token ID1816
Feature activation+3.26e+00

Pos logit contributions
 says+0.014
rob+0.013
 scar+0.013
 hungry+0.013
 wants+0.012
Neg logit contributions
ainer-0.024
ively-0.024
antly-0.023
ant-0.023
 efforts-0.022
Token inside
Token ID2641
Feature activation-2.09e+00

Pos logit contributions
rob+0.179
ww+0.156
 says+0.156
 ours+0.151
P+0.148
Neg logit contributions
ively-0.160
antly-0.159
ered-0.152
 efforts-0.146
ant-0.145
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
ively+0.140
antly+0.139
ant+0.139
 fingertips+0.136
ered+0.133
Neg logit contributions
rob-0.069
 says-0.067
 hungry-0.060
 ours-0.060
 circus-0.060
Token reached
Token ID4251
Feature activation-6.57e-01

Pos logit contributions
rob+0.031
 says+0.028
 circus+0.026
 scar+0.025
 ours+0.025
Neg logit contributions
ively-0.046
antly-0.045
ant-0.044
 fingertips-0.044
 efforts-0.043
Token for
Token ID329
Feature activation+5.47e+00

Pos logit contributions
ively+0.039
antly+0.038
ant+0.038
ered+0.037
 efforts+0.037
Neg logit contributions
rob-0.030
ww-0.024
 circus-0.024
 says-0.024
 ours-0.024
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
rob+0.208
 says+0.197
 scar+0.190
 ours+0.190
 hungry+0.189
Neg logit contributions
antly-0.350
ant-0.339
ively-0.336
 fingertips-0.320
 efforts-0.319
Token cro
Token ID6763
Feature activation+1.19e+01

Pos logit contributions
ively+0.116
antly+0.115
ant+0.115
 fingertips+0.114
 efforts+0.114
Neg logit contributions
rob-0.082
 says-0.067
 scar-0.064
 ours-0.063
ww-0.063
Tokeniss
Token ID747
Feature activation+1.88e+00

Pos logit contributions
rob+0.468
 says+0.407
 ours+0.404
 circus+0.399
 scar+0.381
Neg logit contributions
ively-0.752
antly-0.747
ant-0.739
ered-0.716
 efforts-0.708
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
rob+0.114
 says+0.113
 ours+0.107
ww+0.103
 wants+0.103
Neg logit contributions
antly-0.066
 fingertips-0.065
ively-0.065
 Days-0.065
 efforts-0.064
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
ively+1.101
antly+1.086
ant+1.070
 efforts+1.068
 fingertips+1.067
Neg logit contributions
rob-0.834
 says-0.779
 circus-0.716
 ours-0.715
 scar-0.693
Token He
Token ID679
Feature activation+7.47e-01

Pos logit contributions
ively+0.049
ant+0.049
antly+0.048
ered+0.048
 efforts+0.048
Neg logit contributions
rob-0.031
 says-0.031
 circus-0.027
 hungry-0.027
 ours-0.027
Token rubbed
Token ID31862
Feature activation-1.03e-02

Pos logit contributions
rob+0.031
 says+0.028
 circus+0.026
 ours+0.026
 scar+0.026
Neg logit contributions
ively-0.045
antly-0.044
ant-0.044
 efforts-0.043
 fingertips-0.043
Token it
Token ID340
Feature activation-1.91e+00

Pos logit contributions
 efforts+0.001
ively+0.001
 fingertips+0.001
ant+0.001
 doorstep+0.001
Neg logit contributions
rob-0.000
 says-0.000
 red-0.000
 scar-0.000
 hungry-0.000
Token on
Token ID319
Feature activation+5.18e+00

Pos logit contributions
ively+0.115
ant+0.113
antly+0.112
 efforts+0.111
 fingertips+0.110
Neg logit contributions
rob-0.078
 says-0.072
 scar-0.066
 hungry-0.066
 ours-0.066
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
rob+0.205
 says+0.183
 ours+0.179
 circus+0.175
ww+0.175
Neg logit contributions
ively-0.181
antly-0.181
ant-0.179
 efforts-0.173
 fingertips-0.172
Token word
Token ID1573
Feature activation-1.13e+00

Pos logit contributions
antly+0.099
ively+0.098
ant+0.098
 fingertips+0.097
 efforts+0.097
Neg logit contributions
rob-0.098
 says-0.084
 ours-0.084
 scar-0.081
ww-0.081
Token "
Token ID366
Feature activation+3.78e-01

Pos logit contributions
ively+0.062
antly+0.061
ant+0.060
 efforts+0.059
 fingertips+0.059
Neg logit contributions
rob-0.054
 says-0.049
 ours-0.046
 circus-0.046
 scar-0.045
Tokenign
Token ID570
Feature activation+8.55e-01

Pos logit contributions
rob+0.018
 says+0.016
 ours+0.016
 circus+0.015
 scar+0.015
Neg logit contributions
ively-0.021
antly-0.020
ant-0.020
 fingertips-0.019
 efforts-0.019
Tokenor
Token ID273
Feature activation-1.01e+01

Pos logit contributions
rob+0.050
 says+0.049
 ours+0.046
 wants+0.045
ww+0.044
Neg logit contributions
 fingertips-0.035
ively-0.035
antly-0.035
 efforts-0.034
ainer-0.032
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
 fingertips+0.314
 efforts+0.311
ively+0.309
antly+0.298
ainer+0.284
Neg logit contributions
rob-0.682
 says-0.670
 wants-0.628
 ours-0.626
 circus-0.616
Token"
Token ID1
Feature activation+1.40e+00

Pos logit contributions
ively+1.028
antly+1.013
 fingertips+1.006
 efforts+1.002
ered+0.978
Neg logit contributions
rob-0.905
 says-0.847
 ours-0.785
 circus-0.779
ww-0.760
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
rob+0.066
 ours+0.058
 circus+0.057
 says+0.057
 scar+0.056
Neg logit contributions
ively-0.078
antly-0.077
ant-0.076
 efforts-0.071
ered-0.071
Token the
Token ID262
Feature activation-1.94e+00

Pos logit contributions
rob+0.064
 ours+0.052
 says+0.051
 circus+0.050
P+0.049
Neg logit contributions
ively-0.126
antly-0.126
ant-0.125
 efforts-0.122
 fingertips-0.121
Token number
Token ID1271
Feature activation-1.43e+00

Pos logit contributions
ively+0.102
antly+0.101
ant+0.100
 efforts+0.098
 fingertips+0.097
Neg logit contributions
rob-0.096
 says-0.087
 ours-0.083
 circus-0.082
 scar-0.080
Token "
Token ID366
Feature activation+3.73e-01

Pos logit contributions
ively+0.087
antly+0.086
ant+0.085
 efforts+0.083
 fingertips+0.083
Neg logit contributions
rob-0.059
 says-0.052
 ours-0.049
 circus-0.049
 scar-0.048
Token was
Token ID373
Feature activation-3.92e+00

Pos logit contributions
 says+0.181
rob+0.172
 sees+0.158
 wants+0.156
 sleeping+0.151
Neg logit contributions
 efforts-0.275
ant-0.267
ively-0.264
ainer-0.263
 fingertips-0.261
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
ively+0.191
antly+0.188
ant+0.185
 efforts+0.181
 fingertips+0.180
Neg logit contributions
rob-0.209
 says-0.192
 ours-0.182
 circus-0.182
 scar-0.178
Token girl
Token ID2576
Feature activation+1.41e-01

Pos logit contributions
antly+0.050
 efforts+0.050
 fingertips+0.050
ively+0.048
ant+0.048
Neg logit contributions
rob-0.087
 ours-0.076
 says-0.075
 yells-0.072
use-0.068
Token named
Token ID3706
Feature activation+4.50e-01

Pos logit contributions
rob+0.006
 says+0.006
 circus+0.005
 ours+0.005
 hungry+0.005
Neg logit contributions
ively-0.008
antly-0.008
ant-0.008
 efforts-0.008
 fingertips-0.008
Token Ch
Token ID609
Feature activation-6.95e-01

Pos logit contributions
rob+0.014
 says+0.013
 ours+0.013
 hungry+0.012
 circus+0.012
Neg logit contributions
antly-0.031
ively-0.031
ant-0.030
 efforts-0.030
 fingertips-0.030
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
antly+0.034
 fingertips+0.032
 efforts+0.032
ively+0.032
ered+0.031
Neg logit contributions
rob-0.034
 says-0.032
P-0.031
 sleeping-0.030
 scar-0.030
Tokenelle
Token ID13485
Feature activation-2.27e+00

Pos logit contributions
 fingertips+1.285
ively+1.265
 efforts+1.251
ered+1.247
ainer+1.230
Neg logit contributions
rob-0.722
 says-0.658
 wants-0.579
 scar-0.574
 scary-0.568
Token.
Token ID13
Feature activation-8.21e-01

Pos logit contributions
ively+0.162
ainer+0.161
 efforts+0.157
antly+0.155
ant+0.155
Neg logit contributions
 says-0.069
rob-0.062
 hungry-0.062
 sleeping-0.060
 wants-0.059
Token She
Token ID1375
Feature activation+2.32e+00

Pos logit contributions
ively+0.053
antly+0.051
ant+0.051
 efforts+0.051
 fingertips+0.050
Neg logit contributions
rob-0.031
 says-0.029
 scar-0.026
 circus-0.025
 hungry-0.025
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
rob+0.131
 says+0.119
 ours+0.117
 circus+0.114
 scar+0.113
Neg logit contributions
ively-0.106
antly-0.105
ant-0.103
 efforts-0.100
 fingertips-0.100
Token three
Token ID1115
Feature activation-8.57e+00

Pos logit contributions
ively+0.222
antly+0.218
ant+0.216
 efforts+0.212
 fingertips+0.212
Neg logit contributions
rob-0.181
 says-0.162
 ours-0.154
 circus-0.153
 scar-0.148
Token colourful
Token ID44799
Feature activation+2.52e+00

Pos logit contributions
rob+0.078
 circus+0.066
 says+0.063
 ours+0.063
P+0.061
Neg logit contributions
antly-0.111
 fingertips-0.109
ered-0.106
ively-0.105
 efforts-0.104
Token and
Token ID290
Feature activation+1.84e+00

Pos logit contributions
rob+0.109
 says+0.099
 circus+0.095
 hungry+0.093
 scar+0.093
Neg logit contributions
ively-0.148
ant-0.145
antly-0.145
 efforts-0.140
 fingertips-0.139
Token it
Token ID340
Feature activation-1.90e+00

Pos logit contributions
rob+0.081
 says+0.071
 circus+0.069
 ours+0.069
 hungry+0.065
Neg logit contributions
antly-0.106
ively-0.106
ant-0.104
 fingertips-0.103
 efforts-0.102
Token made
Token ID925
Feature activation+2.86e+00

Pos logit contributions
ively+0.101
antly+0.100
ant+0.098
 efforts+0.096
 fingertips+0.096
Neg logit contributions
rob-0.093
 says-0.084
 ours-0.081
 circus-0.080
 scar-0.078
Token Ch
Token ID609
Feature activation-6.89e-01

Pos logit contributions
 scar+0.082
 says+0.080
rob+0.077
 scary+0.072
 sleeping+0.067
Neg logit contributions
 efforts-0.196
 fingertips-0.196
ant-0.192
ively-0.192
antly-0.192
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
antly+0.032
 fingertips+0.028
ively+0.028
 efforts+0.028
ered+0.028
Neg logit contributions
rob-0.038
 says-0.033
P-0.032
ww-0.032
 blows-0.032
Tokenelle
Token ID13485
Feature activation-2.26e+00

Pos logit contributions
ively+1.199
 fingertips+1.146
ainer+1.143
antly+1.143
ered+1.136
Neg logit contributions
rob-0.747
 says-0.724
 scar-0.687
 scary-0.652
 hungry-0.630
Token feel
Token ID1254
Feature activation-8.09e+00

Pos logit contributions
ively+0.143
ant+0.136
ainer+0.133
antly+0.132
growth+0.130
Neg logit contributions
 says-0.093
 hungry-0.085
 wants-0.083
 sleeping-0.079
 sees-0.079
Token special
Token ID2041
Feature activation+1.45e+00

Pos logit contributions
ant+0.578
ively+0.577
antly+0.563
 efforts+0.550
 fingertips+0.543
Neg logit contributions
 hungry-0.237
 says-0.230
rob-0.225
 scary-0.219
 ours-0.212
Token.
Token ID13
Feature activation-8.11e-01

Pos logit contributions
rob+0.055
 says+0.054
 hungry+0.052
 ours+0.051
 sleeping+0.051
Neg logit contributions
ively-0.089
ant-0.087
antly-0.086
 fingertips-0.083
ered-0.083
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
ively+0.047
 efforts+0.045
ant+0.045
antly+0.045
 fingertips+0.044
Neg logit contributions
rob-0.036
 says-0.035
 scar-0.031
 circus-0.030
 hungry-0.030
Token".
Token ID1911
Feature activation-6.68e-01

Pos logit contributions
 says+0.278
rob+0.271
 ours+0.264
 hungry+0.256
 circus+0.247
Neg logit contributions
ively-0.545
antly-0.538
 fingertips-0.527
ant-0.522
 efforts-0.520
Token
Token ID220
Feature activation+4.04e-02

Pos logit contributions
ively+0.040
ant+0.039
 efforts+0.039
antly+0.038
ered+0.038
Neg logit contributions
rob-0.028
 says-0.027
 scar-0.024
 circus-0.023
 hungry-0.023
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
rob+0.002
 says+0.002
 ours+0.001
 circus+0.001
 hungry+0.001
Neg logit contributions
ively-0.002
 efforts-0.002
antly-0.002
ant-0.002
 fingertips-0.002
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
rob+0.064
 says+0.058
 circus+0.053
 scar+0.052
P+0.051
Neg logit contributions
ively-0.089
 efforts-0.087
ant-0.087
antly-0.087
 fingertips-0.086
TokenCh
Token ID1925
Feature activation-4.20e+00

Pos logit contributions
rob+0.081
 says+0.074
 circus+0.071
 scar+0.070
 ours+0.069
Neg logit contributions
ively-0.072
antly-0.070
 efforts-0.069
ant-0.069
 fingertips-0.069
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
 fingertips+0.152
antly+0.144
 efforts+0.135
ively+0.131
 Days+0.131
Neg logit contributions
rob-0.283
 says-0.261
ww-0.241
 blows-0.239
P-0.234
Tokenelle
Token ID13485
Feature activation-2.25e+00

Pos logit contributions
ively+1.117
 fingertips+1.115
ainer+1.080
 efforts+1.073
antly+1.066
Neg logit contributions
rob-0.867
 says-0.811
 scar-0.736
 wants-0.697
ww-0.695
Token was
Token ID373
Feature activation-3.92e+00

Pos logit contributions
ively+0.139
ainer+0.134
ant+0.131
 fingertips+0.129
antly+0.126
Neg logit contributions
 says-0.099
rob-0.084
 wants-0.082
 sees-0.081
 scar-0.078
Token very
Token ID845
Feature activation+8.01e-01

Pos logit contributions
ant+0.272
ively+0.269
antly+0.263
growth+0.258
 fingertips+0.254
Neg logit contributions
 hungry-0.136
 waiting-0.126
 says-0.122
 sleeping-0.118
 fun-0.112
Token happy
Token ID3772
Feature activation+2.22e+00

Pos logit contributions
rob+0.027
 says+0.027
 hungry+0.025
 ours+0.024
 circus+0.024
Neg logit contributions
ively-0.053
antly-0.052
ant-0.052
 fingertips-0.050
 efforts-0.050
Token.
Token ID13
Feature activation-8.07e-01

Pos logit contributions
rob+0.091
 says+0.084
 circus+0.079
 hungry+0.077
 ours+0.077
Neg logit contributions
ively-0.135
antly-0.133
ant-0.132
 fingertips-0.128
ered-0.127
Token free
Token ID1479
Feature activation-2.01e+00

Pos logit contributions
rob+0.234
 says+0.211
 ours+0.209
 circus+0.208
 hungry+0.208
Neg logit contributions
antly-0.318
ively-0.314
ant-0.307
 fingertips-0.298
ered-0.294
Token so
Token ID523
Feature activation-1.68e-01

Pos logit contributions
ively+0.118
antly+0.118
ant+0.117
 fingertips+0.115
ered+0.114
Neg logit contributions
rob-0.081
 says-0.075
 ours-0.071
 hungry-0.070
 circus-0.070
Token he
Token ID339
Feature activation+3.52e+00

Pos logit contributions
ively+0.012
ant+0.012
antly+0.012
ainer+0.012
ered+0.012
Neg logit contributions
 hungry-0.004
 scary-0.004
 fun-0.004
 says-0.004
 ours-0.004
Token could
Token ID714
Feature activation-5.56e+00

Pos logit contributions
 says+0.164
rob+0.159
 scar+0.144
 hungry+0.144
 wants+0.142
Neg logit contributions
ively-0.191
ant-0.185
 fingertips-0.182
ainer-0.179
antly-0.178
Token repl
Token ID2186
Feature activation+4.08e-01

Pos logit contributions
ively+0.322
antly+0.311
ant+0.306
 efforts+0.292
 fingertips+0.291
Neg logit contributions
 says-0.231
 fun-0.200
 hungry-0.198
 w-0.195
 ch-0.192
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
rob+0.030
P+0.030
 ours+0.028
 scar+0.028
ls+0.028
Neg logit contributions
 fingertips-0.009
 surely-0.008
 End-0.008
 efforts-0.008
 summit-0.008
Token his
Token ID465
Feature activation+3.99e+00

Pos logit contributions
ively+1.036
antly+1.024
 fingertips+1.020
ant+1.010
 efforts+1.001
Neg logit contributions
rob-0.912
 says-0.829
 ours-0.779
 circus-0.773
 hungry-0.760
Token field
Token ID2214
Feature activation+5.71e+00

Pos logit contributions
rob+0.175
 says+0.167
 circus+0.160
 ours+0.160
 hungry+0.158
Neg logit contributions
ively-0.228
antly-0.220
ant-0.219
ered-0.215
 fingertips-0.207
Token.
Token ID13
Feature activation-7.99e-01

Pos logit contributions
rob+0.234
 says+0.220
 ours+0.210
 hungry+0.205
 circus+0.204
Neg logit contributions
ively-0.337
ant-0.333
antly-0.331
 fingertips-0.322
 efforts-0.316
Token Billy
Token ID15890
Feature activation-2.85e+00

Pos logit contributions
ively+0.042
ant+0.042
ered+0.040
 efforts+0.040
antly+0.040
Neg logit contributions
rob-0.038
 says-0.038
 ours-0.035
 hungry-0.034
 circus-0.034
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
ively+0.159
 fingertips+0.153
ant+0.150
ainer+0.148
 efforts+0.146
Neg logit contributions
rob-0.133
 says-0.132
 wants-0.116
 hungry-0.116
 scar-0.116
Token bite
Token ID13197
Feature activation+3.49e+00

Pos logit contributions
antly+0.071
 fingertips+0.069
ively+0.067
 efforts+0.067
ainer+0.064
Neg logit contributions
rob-0.073
 circus-0.058
 ours-0.058
 says-0.056
 wants-0.054
Token of
Token ID286
Feature activation-7.10e+00

Pos logit contributions
rob+0.175
 says+0.171
 ours+0.155
 hungry+0.155
 scar+0.155
Neg logit contributions
ively-0.173
ant-0.170
antly-0.169
 efforts-0.168
 fingertips-0.166
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
antly+0.396
ively+0.395
ant+0.385
 efforts+0.378
 fingertips+0.373
Neg logit contributions
rob-0.327
 says-0.304
 ours-0.291
 circus-0.284
 scar-0.279
Token cro
Token ID6763
Feature activation+1.19e+01

Pos logit contributions
ant+0.111
ively+0.111
antly+0.111
 efforts+0.110
 fingertips+0.110
Neg logit contributions
rob-0.086
 scar-0.070
 says-0.068
 ours-0.066
 circus-0.066
Tokeniss
Token ID747
Feature activation+1.88e+00

Pos logit contributions
rob+0.477
 says+0.424
 ours+0.415
 circus+0.411
 scar+0.396
Neg logit contributions
ively-0.735
antly-0.728
ant-0.720
ered-0.697
 efforts-0.694
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
rob+0.107
 says+0.104
 ours+0.096
ww+0.095
P+0.094
Neg logit contributions
 fingertips-0.075
ively-0.075
antly-0.075
 Days-0.075
 efforts-0.075
Token.
Token ID13
Feature activation-8.17e-01

Pos logit contributions
ively+1.059
antly+1.043
ant+1.031
 efforts+1.014
 fingertips+1.012
Neg logit contributions
rob-0.894
 says-0.815
 ours-0.766
 circus-0.766
 scar-0.746
Token It
Token ID632
Feature activation-5.25e+00

Pos logit contributions
antly+0.038
ively+0.038
ant+0.037
 fingertips+0.037
 efforts+0.036
Neg logit contributions
rob-0.046
 ours-0.039
 says-0.039
 circus-0.039
 scar-0.039
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
antly+0.240
ively+0.233
ant+0.225
ered+0.223
 fingertips+0.222
Neg logit contributions
rob-0.296
 ours-0.271
 circus-0.263
 says-0.262
 scar-0.253
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
antly+0.238
ively+0.237
 fingertips+0.234
 efforts+0.233
ered+0.228
Neg logit contributions
rob-0.174
 circus-0.135
 says-0.134
ww-0.127
P-0.125
Token little
Token ID1310
Feature activation-6.11e+00

Pos logit contributions
antly+0.077
 fingertips+0.076
 efforts+0.076
 bottom+0.073
ainer+0.071
Neg logit contributions
rob-0.065
 says-0.052
 ours-0.052
 yells-0.048
 wants-0.047
Token special
Token ID2041
Feature activation+1.45e+00

Pos logit contributions
ant+0.578
ively+0.577
antly+0.563
 efforts+0.550
 fingertips+0.543
Neg logit contributions
 hungry-0.237
 says-0.230
rob-0.225
 scary-0.219
 ours-0.212
Token.
Token ID13
Feature activation-8.11e-01

Pos logit contributions
rob+0.055
 says+0.054
 hungry+0.052
 ours+0.051
 sleeping+0.051
Neg logit contributions
ively-0.089
ant-0.087
antly-0.086
 fingertips-0.083
ered-0.083
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
ively+0.047
 efforts+0.045
ant+0.045
antly+0.045
 fingertips+0.044
Neg logit contributions
rob-0.036
 says-0.035
 scar-0.031
 circus-0.030
 hungry-0.030
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
rob+0.064
 says+0.058
 circus+0.053
 scar+0.052
 ours+0.052
Neg logit contributions
ively-0.086
ant-0.084
antly-0.084
 efforts-0.083
 fingertips-0.083
TokenCh
Token ID1925
Feature activation-4.21e+00

Pos logit contributions
rob+0.080
 says+0.074
 circus+0.070
 ours+0.070
 scar+0.069
Neg logit contributions
ively-0.072
antly-0.070
ant-0.069
 efforts-0.068
 fingertips-0.068
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
 fingertips+0.196
antly+0.191
 efforts+0.180
mates+0.176
 nobody+0.176
Neg logit contributions
rob-0.238
 says-0.207
ww-0.193
 blows-0.193
P-0.192
Tokenelle
Token ID13485
Feature activation-2.26e+00

Pos logit contributions
 fingertips+1.147
ively+1.144
 efforts+1.114
antly+1.111
ainer+1.104
Neg logit contributions
rob-0.844
 says-0.767
 scar-0.699
 wants-0.674
 circus-0.668
Token went
Token ID1816
Feature activation+3.26e+00

Pos logit contributions
ively+0.143
ainer+0.139
ant+0.136
 fingertips+0.136
antly+0.135
Neg logit contributions
 says-0.090
rob-0.079
 wants-0.078
 sees-0.076
 hungry-0.075
Token outside
Token ID2354
Feature activation+3.48e-01

Pos logit contributions
rob+0.190
ww+0.167
 says+0.165
 ours+0.164
 wants+0.156
Neg logit contributions
ively-0.149
antly-0.148
ered-0.145
 efforts-0.136
ant-0.134
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
rob+0.013
 says+0.012
 circus+0.011
 ours+0.011
 hungry+0.011
Neg logit contributions
ively-0.022
antly-0.022
ant-0.022
 fingertips-0.021
 efforts-0.021
Token play
Token ID711
Feature activation+8.91e-01

Pos logit contributions
rob+0.067
 says+0.061
 circus+0.058
 ours+0.057
 scar+0.056
Neg logit contributions
ively-0.086
antly-0.086
ant-0.084
 efforts-0.083
 fingertips-0.082
Token were
Token ID547
Feature activation-4.90e+00

Pos logit contributions
ainer+0.154
antly+0.151
ively+0.148
 fingertips+0.147
ant+0.145
Neg logit contributions
 says-0.134
 wants-0.118
rob-0.117
 sees-0.115
 hungry-0.112
Token ignorant
Token ID20109
Feature activation-3.66e-01

Pos logit contributions
ively+0.271
ered+0.259
ant+0.250
 efforts+0.244
bled+0.240
Neg logit contributions
rob-0.248
ww-0.211
use-0.202
 scar-0.193
P-0.191
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
ively+0.019
antly+0.019
ant+0.019
 efforts+0.018
 fingertips+0.018
Neg logit contributions
rob-0.018
 says-0.016
 ours-0.016
 circus-0.016
 scar-0.015
Token Ign
Token ID16583
Feature activation-3.70e+00

Pos logit contributions
ively+0.050
ant+0.050
antly+0.049
ered+0.049
 efforts+0.048
Neg logit contributions
 says-0.030
rob-0.030
 ours-0.027
 circus-0.027
 hungry-0.027
Tokenor
Token ID273
Feature activation-1.01e+01

Pos logit contributions
ively+0.118
ant+0.117
antly+0.113
ered+0.109
 efforts+0.104
Neg logit contributions
rob-0.257
 says-0.240
 circus-0.240
 ours-0.237
 scar-0.232
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
 fingertips+0.470
 efforts+0.456
ainer+0.446
 Days+0.431
ively+0.428
Neg logit contributions
rob-0.562
 says-0.549
 wants-0.493
 scar-0.481
 blows-0.481
Token meant
Token ID4001
Feature activation+1.02e+00

Pos logit contributions
ively+1.133
antly+1.122
ant+1.105
 fingertips+1.096
 efforts+1.091
Neg logit contributions
rob-0.819
 says-0.747
 ours-0.692
 circus-0.689
 scar-0.672
Token they
Token ID484
Feature activation+2.52e+00

Pos logit contributions
 says+0.044
rob+0.043
 ours+0.042
 hungry+0.041
 circus+0.040
Neg logit contributions
antly-0.057
ively-0.056
ant-0.055
ainer-0.054
 fingertips-0.054
Token did
Token ID750
Feature activation+1.70e+00

Pos logit contributions
 says+0.094
rob+0.090
 wants+0.082
 hungry+0.081
 ours+0.081
Neg logit contributions
antly-0.162
ively-0.161
ainer-0.158
 fingertips-0.156
ant-0.155
Token not
Token ID407
Feature activation-3.15e+00

Pos logit contributions
 says+0.072
 ours+0.071
rob+0.069
 hungry+0.068
 scary+0.068
Neg logit contributions
ively-0.095
antly-0.095
 fingertips-0.094
ant-0.092
ainer-0.090
Token know
Token ID760
Feature activation-5.78e+00

Pos logit contributions
ively+0.167
ant+0.163
ered+0.163
 efforts+0.161
antly+0.158
Neg logit contributions
rob-0.165
 says-0.133
 ours-0.131
 circus-0.131
ww-0.127
TokenThat
Token ID2504
Feature activation-1.66e+00

Pos logit contributions
rob+0.020
 says+0.018
 ours+0.018
 circus+0.017
 hungry+0.017
Neg logit contributions
ively-0.023
antly-0.023
ant-0.022
 fingertips-0.022
 efforts-0.021
Token's
Token ID338
Feature activation+1.65e+00

Pos logit contributions
ively+0.093
antly+0.093
ant+0.091
 efforts+0.089
 fingertips+0.089
Neg logit contributions
rob-0.077
 says-0.068
 ours-0.066
 circus-0.065
 scar-0.063
Token so
Token ID523
Feature activation-1.44e-01

Pos logit contributions
 ours+0.059
 says+0.059
rob+0.058
 fun+0.057
 scary+0.055
Neg logit contributions
ively-0.105
 fingertips-0.104
antly-0.102
ant-0.102
 efforts-0.102
Token special
Token ID2041
Feature activation+1.50e+00

Pos logit contributions
ively+0.009
ered+0.009
ant+0.009
antly+0.009
 efforts+0.009
Neg logit contributions
rob-0.006
 says-0.004
 scar-0.004
 circus-0.004
 ours-0.004
Token Ch
Token ID609
Feature activation-6.59e-01

Pos logit contributions
rob+0.071
 says+0.065
 ours+0.061
 circus+0.061
 scar+0.060
Neg logit contributions
ively-0.082
antly-0.081
ant-0.080
 efforts-0.078
 fingertips-0.078
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
antly+0.028
 fingertips+0.027
ively+0.026
ered+0.025
 Days+0.025
Neg logit contributions
rob-0.037
 says-0.034
ww-0.032
 blows-0.032
etchup-0.032
Tokenelle
Token ID13485
Feature activation-2.24e+00

Pos logit contributions
ively+1.124
antly+1.108
ant+1.094
 fingertips+1.076
 efforts+1.076
Neg logit contributions
rob-0.828
 says-0.746
 ours-0.700
 circus-0.699
 scar-0.679
Token.
Token ID13
Feature activation-7.89e-01

Pos logit contributions
ively+0.132
antly+0.128
ant+0.126
 fingertips+0.124
ainer+0.124
Neg logit contributions
rob-0.094
 says-0.091
 ours-0.083
 circus-0.082
 hungry-0.081
Token I
Token ID314
Feature activation+4.57e+00

Pos logit contributions
ively+0.046
 efforts+0.045
ered+0.044
antly+0.044
ant+0.044
Neg logit contributions
rob-0.033
 says-0.032
 circus-0.029
 ours-0.028
 wants-0.028
Token'm
Token ID1101
Feature activation-4.74e+00

Pos logit contributions
rob+0.209
 says+0.208
 wants+0.188
 circus+0.185
 sees+0.184
Neg logit contributions
ively-0.252
ant-0.243
 fingertips-0.243
antly-0.241
 efforts-0.235
Token so
Token ID523
Feature activation-1.49e-01

Pos logit contributions
ively+0.256
antly+0.248
ant+0.246
ered+0.243
 efforts+0.243
Neg logit contributions
rob-0.235
 says-0.205
 circus-0.196
 scar-0.194
 ours-0.193
Token says
Token ID1139
Feature activation+1.36e+01

Pos logit contributions
rob+0.025
 ours+0.021
ww+0.020
 circus+0.019
 says+0.019
Neg logit contributions
antly-0.050
ively-0.050
ant-0.049
 efforts-0.049
ered-0.048
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
 says+0.507
 hungry+0.475
rob+0.469
 circus+0.452
 mean+0.442
Neg logit contributions
ively-0.830
 fingertips-0.820
antly-0.791
growth-0.784
ant-0.783
Token "
Token ID366
Feature activation+3.80e-01

Pos logit contributions
ively+0.082
antly+0.081
ant+0.080
 efforts+0.078
 fingertips+0.078
Neg logit contributions
rob-0.080
 says-0.073
 ours-0.069
 circus-0.069
 scar-0.068
TokenIgn
Token ID32916
Feature activation+6.16e-01

Pos logit contributions
rob+0.017
 says+0.016
 ours+0.015
 scar+0.015
 hungry+0.015
Neg logit contributions
ively-0.022
antly-0.022
ant-0.021
 fingertips-0.020
 efforts-0.020
Tokenor
Token ID273
Feature activation-1.01e+01

Pos logit contributions
rob+0.040
 says+0.037
 ours+0.035
 circus+0.035
 scar+0.035
Neg logit contributions
ively-0.023
antly-0.023
 efforts-0.022
 fingertips-0.022
ant-0.022
Tokenant
Token ID415
Feature activation-1.92e+01

Pos logit contributions
ively+0.402
 fingertips+0.400
 efforts+0.389
antly+0.383
ainer+0.372
Neg logit contributions
rob-0.605
 says-0.596
 wants-0.548
 circus-0.542
ww-0.540
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
ively+1.074
antly+1.053
 fingertips+1.050
 efforts+1.040
ant+1.016
Neg logit contributions
rob-0.864
 says-0.812
 circus-0.742
 ours-0.741
ww-0.722
Token zero
Token ID6632
Feature activation-6.27e+00

Pos logit contributions
ively+0.092
antly+0.091
ant+0.089
 fingertips+0.088
 efforts+0.088
Neg logit contributions
rob-0.069
 says-0.064
 circus-0.060
 ours-0.060
 scar-0.058
Token".
Token ID1911
Feature activation-6.67e-01

Pos logit contributions
ively+0.388
antly+0.383
ant+0.379
 efforts+0.372
 fingertips+0.371
Neg logit contributions
rob-0.252
 says-0.223
 ours-0.210
 circus-0.209
 scar-0.203
Token The
Token ID383
Feature activation-6.71e+00

Pos logit contributions
ant+0.045
ively+0.043
 fingertips+0.042
ered+0.042
antly+0.042
Neg logit contributions
 says-0.025
rob-0.021
 hungry-0.021
 sees-0.020
 wants-0.019
Token teacher
Token ID4701
Feature activation-6.66e+00

Pos logit contributions
antly+0.335
ively+0.334
ant+0.323
 fingertips+0.317
ered+0.317
Neg logit contributions
rob-0.340
 says-0.336
 circus-0.317
 hungry-0.308
 ours-0.303
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
ively+0.146
antly+0.144
ant+0.143
 efforts+0.140
 fingertips+0.140
Neg logit contributions
rob-0.100
 says-0.089
 ours-0.084
 circus-0.083
 scar-0.081
Token said
Token ID531
Feature activation+8.80e+00

Pos logit contributions
rob+0.064
 circus+0.050
 ours+0.049
 says+0.049
ww+0.046
Neg logit contributions
antly-0.128
ively-0.128
ant-0.125
 efforts-0.124
ered-0.124
Token,
Token ID11
Feature activation-1.59e+00

Pos logit contributions
rob+0.355
 says+0.350
 hungry+0.328
 ours+0.309
 circus+0.308
Neg logit contributions
ively-0.521
antly-0.502
 fingertips-0.498
 efforts-0.492
ant-0.487
Token "
Token ID366
Feature activation+3.71e-01

Pos logit contributions
ively+0.076
antly+0.076
ant+0.075
 fingertips+0.073
 efforts+0.073
Neg logit contributions
rob-0.087
 says-0.077
 ours-0.075
 circus-0.074
 scar-0.073
TokenPl
Token ID3646
Feature activation-1.27e+00

Pos logit contributions
rob+0.018
 says+0.017
 ours+0.016
 circus+0.016
 hungry+0.016
Neg logit contributions
antly-0.020
ively-0.020
ant-0.019
 fingertips-0.018
 efforts-0.018
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
 efforts+0.055
 fingertips+0.053
antly+0.049
 unison+0.049
mates+0.048
Neg logit contributions
rob-0.068
 ours-0.066
 says-0.066
ww-0.063
iw-0.062
Token this
Token ID428
Feature activation+2.20e+00

Pos logit contributions
ively+1.147
antly+1.129
 fingertips+1.114
ant+1.107
 efforts+1.105
Neg logit contributions
rob-0.799
 says-0.732
 ours-0.686
 circus-0.674
 hungry-0.651
Token and
Token ID290
Feature activation+1.86e+00

Pos logit contributions
rob+0.087
 says+0.079
 ours+0.074
 circus+0.073
 scar+0.070
Neg logit contributions
ively-0.137
antly-0.135
ant-0.134
 fingertips-0.132
 efforts-0.132
Token watch
Token ID2342
Feature activation-1.61e+00

Pos logit contributions
rob+0.084
 says+0.073
 circus+0.070
 ours+0.070
 scar+0.068
Neg logit contributions
ively-0.107
antly-0.106
ant-0.105
 efforts-0.103
 fingertips-0.102
Token it
Token ID340
Feature activation-1.90e+00

Pos logit contributions
ively+0.090
antly+0.088
ant+0.087
 efforts+0.087
 fingertips+0.086
Neg logit contributions
rob-0.072
 says-0.068
 ours-0.064
 circus-0.063
 hungry-0.062
Token grow
Token ID1663
Feature activation+4.42e+00

Pos logit contributions
antly+0.106
ively+0.106
ant+0.103
ered+0.103
 efforts+0.100
Neg logit contributions
rob-0.093
 says-0.076
 circus-0.076
 ours-0.075
 scar-0.073
Token They
Token ID1119
Feature activation+3.96e-01

Pos logit contributions
ively+0.043
antly+0.043
ant+0.042
 fingertips+0.041
 efforts+0.041
Neg logit contributions
rob-0.040
 says-0.035
 ours-0.034
 circus-0.033
 scar-0.033
Token ate
Token ID15063
Feature activation+4.72e+00

Pos logit contributions
rob+0.017
 says+0.016
 hungry+0.015
 circus+0.015
 scar+0.014
Neg logit contributions
ively-0.024
antly-0.023
ant-0.023
 efforts-0.023
 fingertips-0.022
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
rob+0.227
 says+0.206
 ours+0.196
 circus+0.195
 scar+0.191
Neg logit contributions
ively-0.256
antly-0.252
ant-0.249
 efforts-0.244
 fingertips-0.244
Token cro
Token ID6763
Feature activation+1.19e+01

Pos logit contributions
antly+0.122
ant+0.120
ively+0.120
 efforts+0.119
 fingertips+0.118
Neg logit contributions
rob-0.072
 says-0.058
 ours-0.057
 scar-0.057
 circus-0.056
Tokeniss
Token ID747
Feature activation+1.87e+00

Pos logit contributions
rob+0.465
 says+0.414
 ours+0.394
 circus+0.392
 scar+0.377
Neg logit contributions
ively-0.748
antly-0.740
ant-0.732
 efforts-0.714
 fingertips-0.710
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
rob+0.100
 says+0.096
 ours+0.090
ww+0.088
ls+0.087
Neg logit contributions
ively-0.082
 fingertips-0.081
antly-0.081
 Days-0.081
 efforts-0.081
Token.
Token ID13
Feature activation-8.09e-01

Pos logit contributions
ively+1.075
antly+1.061
ant+1.048
 efforts+1.028
 fingertips+1.026
Neg logit contributions
rob-0.880
 says-0.792
 ours-0.752
 circus-0.750
 scar-0.730
Token They
Token ID1119
Feature activation+4.12e-01

Pos logit contributions
ively+0.048
ant+0.047
antly+0.047
 efforts+0.046
ered+0.046
Neg logit contributions
rob-0.034
 says-0.032
 ours-0.030
 circus-0.030
 hungry-0.029
Token were
Token ID547
Feature activation-4.91e+00

Pos logit contributions
rob+0.018
 says+0.017
 hungry+0.015
 scar+0.015
 circus+0.015
Neg logit contributions
ively-0.025
antly-0.024
ant-0.024
 efforts-0.023
 fingertips-0.023
Token happy
Token ID3772
Feature activation+2.22e+00

Pos logit contributions
ively+0.280
antly+0.278
 efforts+0.275
 fingertips+0.274
ant+0.273
Neg logit contributions
rob-0.227
 says-0.183
 circus-0.180
ww-0.174
P-0.171
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
rob+0.126
 ours+0.106
ww+0.104
P+0.103
 says+0.103
Neg logit contributions
ively-0.106
 efforts-0.106
antly-0.104
 fingertips-0.102
ant-0.099
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
rob+0.033
 says+0.031
 hungry+0.029
 ours+0.029
 sleeping+0.029
Neg logit contributions
ively-0.047
antly-0.047
ant-0.046
 fingertips-0.045
 efforts-0.045
Token
Token ID220
Feature activation+4.41e-02

Pos logit contributions
ively+0.051
ant+0.049
 efforts+0.049
ered+0.048
 fingertips+0.047
Neg logit contributions
 says-0.031
rob-0.029
 scar-0.027
 hungry-0.025
 wants-0.025
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
 says+0.002
rob+0.002
 ours+0.001
 hungry+0.001
 sees+0.001
Neg logit contributions
 efforts-0.003
ively-0.003
ant-0.003
 fingertips-0.003
antly-0.003
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
rob+0.055
 says+0.052
P+0.045
 scar+0.044
 hungry+0.044
Neg logit contributions
ively-0.095
 efforts-0.095
ant-0.093
 fingertips-0.093
antly-0.091
TokenCh
Token ID1925
Feature activation-4.20e+00

Pos logit contributions
rob+0.071
 says+0.067
P+0.065
 scar+0.064
ww+0.064
Neg logit contributions
ively-0.079
 efforts-0.077
 fingertips-0.077
ant-0.074
antly-0.073
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
 fingertips+0.191
antly+0.187
mates+0.176
 nobody+0.176
 efforts+0.175
Neg logit contributions
rob-0.237
 says-0.207
 blows-0.195
ww-0.193
etchup-0.192
Tokenelle
Token ID13485
Feature activation-2.26e+00

Pos logit contributions
 fingertips+1.162
ively+1.152
 efforts+1.122
ainer+1.118
antly+1.117
Neg logit contributions
rob-0.827
 says-0.761
 scar-0.691
 wants-0.663
 circus-0.650
Token stopped
Token ID5025
Feature activation-3.67e+00

Pos logit contributions
ively+0.143
ainer+0.137
ant+0.137
 fingertips+0.136
antly+0.135
Neg logit contributions
 says-0.090
rob-0.079
 wants-0.077
 hungry-0.077
 sees-0.076
Token in
Token ID287
Feature activation-9.33e-01

Pos logit contributions
ively+0.197
antly+0.191
ant+0.190
 fingertips+0.190
 efforts+0.190
Neg logit contributions
rob-0.193
 ours-0.168
ww-0.163
 circus-0.155
 says-0.153
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
ively+0.048
antly+0.048
ant+0.047
 efforts+0.046
 fingertips+0.046
Neg logit contributions
rob-0.048
 says-0.043
 ours-0.041
 circus-0.041
 scar-0.040
Token shade
Token ID17979
Feature activation-1.75e+00

Pos logit contributions
antly+0.120
ively+0.119
 fingertips+0.116
ant+0.115
 efforts+0.115
Neg logit contributions
rob-0.077
 says-0.066
 ours-0.065
 circus-0.062
 scar-0.060
Token it
Token ID340
Feature activation-1.86e+00

Pos logit contributions
ively+0.037
antly+0.037
ant+0.036
 efforts+0.036
 fingertips+0.036
Neg logit contributions
rob-0.036
 says-0.033
 ours-0.031
 circus-0.031
 scar-0.030
Token down
Token ID866
Feature activation+3.45e+00

Pos logit contributions
ively+0.098
antly+0.097
ant+0.096
ered+0.093
 efforts+0.093
Neg logit contributions
rob-0.095
 says-0.084
 circus-0.080
 ours-0.077
 scar-0.076
Token.
Token ID13
Feature activation-7.81e-01

Pos logit contributions
rob+0.145
 says+0.129
 circus+0.121
 ours+0.121
 scar+0.118
Neg logit contributions
ively-0.207
antly-0.205
ant-0.202
 efforts-0.198
 fingertips-0.198
Token Ign
Token ID16583
Feature activation-3.68e+00

Pos logit contributions
ively+0.042
 efforts+0.041
antly+0.041
ered+0.041
ant+0.040
Neg logit contributions
 says-0.035
rob-0.035
 ours-0.033
 circus-0.032
 hungry-0.031
Tokenor
Token ID273
Feature activation-1.01e+01

Pos logit contributions
ively+0.129
antly+0.126
ant+0.125
 efforts+0.120
 fingertips+0.119
Neg logit contributions
rob-0.247
 says-0.230
 ours-0.223
 circus-0.223
 scar-0.218
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
 fingertips+0.296
 efforts+0.274
ainer+0.269
ively+0.261
 Days+0.255
Neg logit contributions
rob-0.723
 says-0.708
 wants-0.669
ww-0.659
 blows-0.654
Token children
Token ID1751
Feature activation-2.19e+00

Pos logit contributions
ively+1.175
antly+1.161
 fingertips+1.141
ant+1.140
 efforts+1.133
Neg logit contributions
rob-0.770
 says-0.698
 ours-0.646
 circus-0.640
 scar-0.619
Token like
Token ID588
Feature activation+5.26e-01

Pos logit contributions
ively+0.130
antly+0.130
ant+0.126
 fingertips+0.126
 efforts+0.125
Neg logit contributions
rob-0.089
 says-0.085
 ours-0.079
 circus-0.076
 hungry-0.075
Token you
Token ID345
Feature activation+8.35e+00

Pos logit contributions
rob+0.023
 says+0.021
 ours+0.020
 circus+0.020
 scar+0.020
Neg logit contributions
ively-0.030
antly-0.030
ant-0.029
 fingertips-0.029
 efforts-0.029
Token don
Token ID836
Feature activation+8.27e-01

Pos logit contributions
rob+0.362
 says+0.333
 ours+0.314
 circus+0.310
 hungry+0.303
Neg logit contributions
ively-0.486
antly-0.481
ant-0.472
 fingertips-0.466
 efforts-0.465
Token't
Token ID470
Feature activation-4.71e+00

Pos logit contributions
rob+0.053
 says+0.049
 circus+0.048
 ours+0.048
 scar+0.048
Neg logit contributions
ively-0.031
antly-0.030
ant-0.030
ered-0.029
 efforts-0.029
Token It
Token ID632
Feature activation-5.20e+00

Pos logit contributions
ively+0.041
antly+0.040
ant+0.040
 efforts+0.039
 fingertips+0.039
Neg logit contributions
rob-0.038
 says-0.034
 ours-0.033
 circus-0.032
 scar-0.032
Token is
Token ID318
Feature activation+5.19e+00

Pos logit contributions
antly+0.298
ered+0.295
ively+0.284
ant+0.279
 surely+0.269
Neg logit contributions
rob-0.223
 circus-0.181
 ours-0.175
P-0.173
 scar-0.170
Token a
Token ID257
Feature activation-1.33e+00

Pos logit contributions
rob+0.294
 says+0.214
P+0.214
 circus+0.211
use+0.211
Neg logit contributions
 fingertips-0.258
antly-0.254
ively-0.253
ered-0.252
 efforts-0.251
Token cro
Token ID6763
Feature activation+1.19e+01

Pos logit contributions
 fingertips+0.071
 bottom+0.069
antly+0.068
 efforts+0.067
 surely+0.065
Neg logit contributions
rob-0.067
 says-0.053
 blows-0.049
 yells-0.048
bat-0.048
Tokeniss
Token ID747
Feature activation+1.90e+00

Pos logit contributions
rob+0.491
 says+0.455
 ours+0.439
 circus+0.433
 scar+0.426
Neg logit contributions
ively-0.727
ant-0.712
antly-0.707
ered-0.686
ainer-0.659
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
rob+0.133
 says+0.127
 ours+0.124
ww+0.121
P+0.119
Neg logit contributions
antly-0.055
ively-0.054
 efforts-0.053
 fingertips-0.053
 Days-0.050
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
ively+1.035
ant+1.024
antly+1.019
 efforts+0.969
 fingertips+0.967
Neg logit contributions
rob-0.918
 says-0.796
 circus-0.772
 ours-0.771
 scar-0.771
Token We
Token ID775
Feature activation+3.32e+00

Pos logit contributions
ively+0.042
antly+0.042
ant+0.041
 fingertips+0.040
 efforts+0.040
Neg logit contributions
rob-0.038
 says-0.034
 circus-0.033
 ours-0.033
 scar-0.032
Token can
Token ID460
Feature activation+1.00e+00

Pos logit contributions
rob+0.138
 says+0.126
 ours+0.118
 circus+0.117
 scar+0.114
Neg logit contributions
ively-0.200
antly-0.198
ant-0.195
 efforts-0.192
 fingertips-0.191
Token share
Token ID2648
Feature activation-2.83e+00

Pos logit contributions
 says+0.043
rob+0.040
 ours+0.040
 circus+0.039
 hungry+0.038
Neg logit contributions
antly-0.058
ively-0.057
 efforts-0.056
ant-0.054
 fingertips-0.053
Token it
Token ID340
Feature activation-1.90e+00

Pos logit contributions
ively+0.136
ant+0.135
antly+0.134
 fingertips+0.128
ered+0.128
Neg logit contributions
rob-0.155
 says-0.133
 circus-0.129
ww-0.128
P-0.127
Token always
Token ID1464
Feature activation-2.40e+00

Pos logit contributions
rob+0.207
 says+0.194
 ours+0.188
 hungry+0.185
 circus+0.181
Neg logit contributions
ively-0.302
ant-0.297
antly-0.297
 fingertips-0.291
 efforts-0.288
Token best
Token ID1266
Feature activation-2.12e+00

Pos logit contributions
ively+0.142
antly+0.142
 efforts+0.137
ered+0.136
ant+0.136
Neg logit contributions
rob-0.106
 says-0.090
 circus-0.086
 scar-0.084
 ours-0.083
Token.
Token ID13
Feature activation-7.85e-01

Pos logit contributions
ively+0.121
antly+0.118
ant+0.117
 fingertips+0.113
ered+0.113
Neg logit contributions
rob-0.093
 says-0.089
 ours-0.084
 circus-0.083
 hungry-0.082
Token Ign
Token ID16583
Feature activation-3.66e+00

Pos logit contributions
ively+0.041
antly+0.040
ant+0.039
 efforts+0.039
 fingertips+0.039
Neg logit contributions
rob-0.040
 says-0.036
 ours-0.034
 circus-0.034
 scar-0.034
Tokenor
Token ID273
Feature activation-1.01e+01

Pos logit contributions
antly+0.158
ively+0.158
 fingertips+0.154
 efforts+0.152
ant+0.150
Neg logit contributions
rob-0.217
 says-0.200
 ours-0.188
 circus-0.186
 scar-0.185
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
 fingertips+0.518
 efforts+0.489
ainer+0.483
ively+0.480
 Days+0.472
Neg logit contributions
rob-0.531
 says-0.505
 wants-0.456
 blows-0.449
ww-0.441
Token people
Token ID661
Feature activation-5.43e+00

Pos logit contributions
ively+1.146
antly+1.131
ant+1.118
 fingertips+1.100
 efforts+1.099
Neg logit contributions
rob-0.801
 says-0.717
 ours-0.671
 circus-0.670
 scar-0.652
Token will
Token ID481
Feature activation+2.35e+00

Pos logit contributions
ively+0.308
antly+0.304
ant+0.301
 efforts+0.295
 fingertips+0.294
Neg logit contributions
rob-0.246
 says-0.220
 ours-0.209
 circus-0.209
 scar-0.203
Token always
Token ID1464
Feature activation-2.31e+00

Pos logit contributions
rob+0.121
 says+0.115
 circus+0.108
 ours+0.108
 scar+0.107
Neg logit contributions
antly-0.116
ively-0.115
ant-0.113
 fingertips-0.111
 efforts-0.109
Token be
Token ID307
Feature activation-1.45e+00

Pos logit contributions
ively+0.105
antly+0.104
ant+0.102
 efforts+0.100
 fingertips+0.100
Neg logit contributions
rob-0.130
 says-0.119
 ours-0.115
 circus-0.114
 scar-0.112
Token present
Token ID1944
Feature activation+8.83e-03

Pos logit contributions
ively+0.084
antly+0.083
ant+0.082
 efforts+0.081
ered+0.081
Neg logit contributions
rob-0.066
 says-0.056
 circus-0.054
 ours-0.052
 scar-0.051
Token It
Token ID632
Feature activation-5.17e+00

Pos logit contributions
ively+0.031
antly+0.030
ant+0.029
 fingertips+0.029
 efforts+0.028
Neg logit contributions
rob-0.047
 says-0.041
 scar-0.041
 ours-0.041
 circus-0.041
Token was
Token ID373
Feature activation-3.84e+00

Pos logit contributions
antly+0.184
ered+0.173
ively+0.161
 fingertips+0.154
ant+0.148
Neg logit contributions
rob-0.339
 ours-0.324
 circus-0.304
ww-0.299
 scar-0.298
Token a
Token ID257
Feature activation-1.29e+00

Pos logit contributions
 fingertips+0.220
ively+0.220
antly+0.217
ered+0.215
 efforts+0.214
Neg logit contributions
rob-0.186
use-0.137
ww-0.133
P-0.131
 circus-0.130
Token cro
Token ID6763
Feature activation+1.20e+01

Pos logit contributions
 fingertips+0.069
 bottom+0.067
antly+0.066
 efforts+0.065
 surely+0.062
Neg logit contributions
rob-0.065
 says-0.050
 ours-0.046
 yells-0.046
bat-0.046
Tokeniss
Token ID747
Feature activation+1.96e+00

Pos logit contributions
rob+0.481
 says+0.440
 ours+0.427
 circus+0.420
 scar+0.408
Neg logit contributions
ively-0.744
antly-0.731
ant-0.729
ered-0.706
 efforts-0.686
Tokenant
Token ID415
Feature activation-1.91e+01

Pos logit contributions
rob+0.140
 says+0.136
 ours+0.133
ww+0.130
P+0.129
Neg logit contributions
antly-0.048
 Days-0.047
 fingertips-0.047
 efforts-0.047
ively-0.044
Token.
Token ID13
Feature activation-7.07e-01

Pos logit contributions
ively+1.035
antly+1.021
ant+1.011
 efforts+0.984
 fingertips+0.982
Neg logit contributions
rob-0.912
 says-0.816
 ours-0.782
 circus-0.779
 scar-0.762
Token It
Token ID632
Feature activation-5.14e+00

Pos logit contributions
ively+0.031
antly+0.030
ant+0.029
 fingertips+0.029
 efforts+0.028
Neg logit contributions
rob-0.042
 says-0.037
 circus-0.037
 ours-0.037
 scar-0.037
Token was
Token ID373
Feature activation-3.82e+00

Pos logit contributions
antly+0.204
ered+0.194
ively+0.189
 fingertips+0.181
ant+0.178
Neg logit contributions
rob-0.314
 ours-0.300
 circus-0.273
ww-0.271
 scar-0.267
Token a
Token ID257
Feature activation-1.27e+00

Pos logit contributions
ively+0.211
 fingertips+0.209
antly+0.207
 efforts+0.206
ered+0.205
Neg logit contributions
rob-0.192
ww-0.144
use-0.144
P-0.142
 says-0.142
Token little
Token ID1310
Feature activation-6.03e+00

Pos logit contributions
 fingertips+0.069
antly+0.067
 efforts+0.066
 bottom+0.066
 surely+0.063
Neg logit contributions
rob-0.062
 says-0.050
 ours-0.047
 yells-0.045
 blows-0.044
Token happy
Token ID3772
Feature activation+2.28e+00

Pos logit contributions
 efforts+0.166
ively+0.164
ered+0.161
ant+0.157
 fingertips+0.156
Neg logit contributions
rob-0.158
use-0.123
ww-0.121
 circus-0.121
P-0.117
Token.
Token ID13
Feature activation-7.55e-01

Pos logit contributions
rob+0.143
P+0.119
ww+0.119
 ours+0.118
 scar+0.117
Neg logit contributions
ively-0.103
ant-0.097
 efforts-0.096
antly-0.094
 fingertips-0.094
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
ively+0.036
antly+0.036
ant+0.034
 fingertips+0.034
 Everyone+0.034
Neg logit contributions
rob-0.043
 scar-0.038
 ours-0.036
 circus-0.036
ww-0.034
Token
Token ID198
Feature activation+1.54e+00

Pos logit contributions
rob+0.076
 says+0.067
 ours+0.065
 circus+0.065
 scar+0.064
Neg logit contributions
ively-0.084
antly-0.082
ant-0.081
 fingertips-0.079
 efforts-0.078
TokenMom
Token ID29252
Feature activation-5.48e+00

Pos logit contributions
rob+0.084
 scar+0.073
 circus+0.071
 ours+0.071
 says+0.071
Neg logit contributions
ively-0.077
ant-0.076
antly-0.074
ered-0.071
ainer-0.068
Token picks
Token ID11103
Feature activation+6.91e+00

Pos logit contributions
antly+0.358
ively+0.357
ered+0.353
ant+0.340
 efforts+0.334
Neg logit contributions
rob-0.200
 ours-0.191
ww-0.176
P-0.165
 circus-0.149
Token up
Token ID510
Feature activation-7.50e-01

Pos logit contributions
rob+0.391
ww+0.342
 circus+0.336
 ours+0.330
 says+0.330
Neg logit contributions
ively-0.331
ant-0.318
antly-0.315
ered-0.298
ainer-0.287
Token Tom
Token ID4186
Feature activation+1.53e+00

Pos logit contributions
ively+0.044
antly+0.044
ant+0.043
 fingertips+0.043
 efforts+0.042
Neg logit contributions
rob-0.032
 says-0.031
 circus-0.028
 scar-0.028
 ours-0.027
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
rob+0.081
 circus+0.070
ww+0.069
 ours+0.069
P+0.067
Neg logit contributions
ant-0.079
ively-0.079
antly-0.077
ered-0.076
 efforts-0.071
Token puts
Token ID7584
Feature activation+6.31e+00

Pos logit contributions
rob+0.082
ww+0.074
P+0.074
 circus+0.063
 hungry+0.062
Neg logit contributions
ered-0.105
ively-0.103
antly-0.102
ant-0.100
growth-0.098
Token him
Token ID683
Feature activation+6.97e-01

Pos logit contributions
rob+0.349
 circus+0.309
 says+0.307
ww+0.304
 ours+0.299
Neg logit contributions
ively-0.296
antly-0.294
ant-0.283
ered-0.277
 fingertips-0.272
Token "
Token ID366
Feature activation+3.53e-01

Pos logit contributions
antly+0.033
ively+0.032
 Days+0.030
 fingertips+0.030
 efforts+0.030
Neg logit contributions
rob-0.053
 ours-0.046
 circus-0.044
ww-0.043
 scar-0.043
TokenYou
Token ID1639
Feature activation+5.68e+00

Pos logit contributions
rob+0.016
 says+0.015
 ours+0.014
 scar+0.014
 circus+0.014
Neg logit contributions
ively-0.020
antly-0.020
ant-0.019
ainer-0.018
 fingertips-0.018
Token are
Token ID389
Feature activation+1.36e-01

Pos logit contributions
 says+0.211
rob+0.203
 scar+0.181
 hungry+0.180
 wants+0.180
Neg logit contributions
ively-0.372
antly-0.361
ant-0.357
 fingertips-0.357
 efforts-0.349
Token very
Token ID845
Feature activation+7.89e-01

Pos logit contributions
rob+0.007
 says+0.006
 circus+0.005
ww+0.005
 scar+0.005
Neg logit contributions
ively-0.007
antly-0.007
ant-0.007
 efforts-0.007
ered-0.007
Token good
Token ID922
Feature activation+1.84e+00

Pos logit contributions
rob+0.037
 says+0.029
 circus+0.029
 ours+0.028
 scar+0.028
Neg logit contributions
ant-0.046
ered-0.045
ively-0.045
antly-0.044
 efforts-0.043
Token at
Token ID379
Feature activation+8.19e+00

Pos logit contributions
rob+0.099
 ours+0.087
 scar+0.083
 circus+0.081
P+0.080
Neg logit contributions
ant-0.091
ively-0.091
 efforts-0.089
ered-0.079
antly-0.079
Token giving
Token ID3501
Feature activation+3.78e+00

Pos logit contributions
rob+0.350
 says+0.312
 ours+0.295
 circus+0.293
 scar+0.285
Neg logit contributions
ively-0.490
antly-0.484
ant-0.479
 efforts-0.470
 fingertips-0.469
Token mass
Token ID2347
Feature activation+4.38e+00

Pos logit contributions
rob+0.136
 says+0.115
 circus+0.109
 ours+0.108
 scar+0.103
Neg logit contributions
ively-0.253
antly-0.251
ant-0.249
 efforts-0.243
 fingertips-0.243
Tokenages
Token ID1095
Feature activation-6.70e+00

Pos logit contributions
rob+0.232
 says+0.216
 ours+0.206
 circus+0.201
 scar+0.200
Neg logit contributions
ively-0.217
antly-0.211
ant-0.206
 fingertips-0.203
 efforts-0.202
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
ant+0.310
ively+0.310
antly+0.306
 efforts+0.300
ered+0.287
Neg logit contributions
rob-0.374
 circus-0.318
 says-0.313
 scar-0.307
 ours-0.302
Token You
Token ID921
Feature activation+7.26e+00

Pos logit contributions
ively+0.047
antly+0.047
ant+0.046
 fingertips+0.045
 efforts+0.045
Neg logit contributions
rob-0.037
 says-0.033
 ours-0.031
 circus-0.031
 scar-0.031
Token of
Token ID286
Feature activation-7.19e+00

Pos logit contributions
ant+0.457
 fingertips+0.456
ively+0.450
antly+0.447
 efforts+0.431
Neg logit contributions
rob-0.408
 says-0.406
 hungry-0.361
ww-0.359
 ours-0.359
Token the
Token ID262
Feature activation-1.99e+00

Pos logit contributions
ively+0.371
antly+0.366
ant+0.361
 fingertips+0.353
 efforts+0.353
Neg logit contributions
rob-0.365
 says-0.333
 circus-0.317
 ours-0.317
 scar-0.310
Token rides
Token ID17445
Feature activation+7.12e+00

Pos logit contributions
ively+0.103
antly+0.101
 efforts+0.100
ant+0.100
 fingertips+0.097
Neg logit contributions
rob-0.100
 says-0.089
 ours-0.087
 circus-0.083
 hungry-0.082
Token at
Token ID379
Feature activation+8.20e+00

Pos logit contributions
rob+0.301
 says+0.292
 circus+0.266
 hungry+0.260
 sees+0.256
Neg logit contributions
 fingertips-0.408
ively-0.403
ant-0.402
ainer-0.398
antly-0.398
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
rob+0.368
 says+0.344
 circus+0.323
 ours+0.323
 scar+0.304
Neg logit contributions
antly-0.479
ant-0.466
ively-0.464
 fingertips-0.460
 efforts-0.455
Token park
Token ID3952
Feature activation+6.75e+00

Pos logit contributions
ively+0.128
antly+0.127
ant+0.124
 efforts+0.124
 fingertips+0.122
Neg logit contributions
rob-0.065
 says-0.055
 ours-0.051
ww-0.048
 hungry-0.048
Token were
Token ID547
Feature activation-4.84e+00

Pos logit contributions
rob+0.295
 says+0.286
 circus+0.266
 hungry+0.261
 waiting+0.256
Neg logit contributions
ively-0.380
ant-0.376
antly-0.374
 fingertips-0.367
 efforts-0.361
Token blue
Token ID4171
Feature activation+7.58e+00

Pos logit contributions
ively+0.286
antly+0.282
ant+0.279
 efforts+0.273
 fingertips+0.273
Neg logit contributions
rob-0.209
 says-0.188
 ours-0.178
 circus-0.177
 scar-0.172
Token and
Token ID290
Feature activation+1.94e+00

Pos logit contributions
rob+0.310
 says+0.276
 circus+0.260
 ours+0.258
 scar+0.254
Neg logit contributions
ively-0.465
antly-0.459
ant-0.455
 fingertips-0.445
 efforts-0.445
Token they
Token ID484
Feature activation+2.58e+00

Pos logit contributions
rob+0.069
 says+0.059
 ours+0.056
P+0.056
use+0.053
Neg logit contributions
antly-0.123
 fingertips-0.123
ively-0.120
ered-0.118
 efforts-0.118
Token were
Token ID547
Feature activation-4.82e+00

Pos logit contributions
rob+0.120
 says+0.114
 hungry+0.108
 scar+0.106
 circus+0.106
Neg logit contributions
ively-0.143
antly-0.141
ant-0.139
ainer-0.137
 efforts-0.136
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
ively+0.065
antly+0.064
ant+0.063
 fingertips+0.062
 efforts+0.062
Neg logit contributions
rob-0.047
 says-0.043
 ours-0.042
 hungry-0.041
 scar-0.041
Token Suddenly
Token ID24975
Feature activation-8.09e+00

Pos logit contributions
ively+0.041
ant+0.040
antly+0.040
 efforts+0.039
ered+0.039
Neg logit contributions
rob-0.036
 says-0.034
 circus-0.032
 ours-0.031
 hungry-0.031
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
ively+0.468
ant+0.456
antly+0.451
 fingertips+0.451
ainer+0.443
Neg logit contributions
rob-0.336
 says-0.331
 scar-0.306
 hungry-0.305
 sleeping-0.297
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
ant+0.099
ively+0.098
antly+0.098
 fingertips+0.096
ainer+0.095
Neg logit contributions
rob-0.052
 says-0.048
 hungry-0.047
 scar-0.045
 sleeping-0.044
Token mouse
Token ID10211
Feature activation-5.06e+00

Pos logit contributions
ively+0.114
antly+0.112
ant+0.112
 fingertips+0.108
 efforts+0.108
Neg logit contributions
rob-0.075
 says-0.068
 circus-0.065
 ours-0.063
 scar-0.063
Token sc
Token ID629
Feature activation+1.02e+01

Pos logit contributions
ively+0.280
ant+0.273
antly+0.273
 efforts+0.266
 fingertips+0.265
Neg logit contributions
rob-0.232
 says-0.217
 circus-0.201
 ours-0.199
 hungry-0.198
Tokenur
Token ID333
Feature activation-2.32e+00

Pos logit contributions
 says+0.521
rob+0.469
 ours+0.461
 yells+0.454
iw+0.448
Neg logit contributions
antly-0.487
ively-0.482
ainer-0.482
ant-0.461
 efforts-0.456
Tokenried
Token ID2228
Feature activation-8.16e+00

Pos logit contributions
ively+0.087
antly+0.087
 fingertips+0.086
ant+0.085
 efforts+0.085
Neg logit contributions
rob-0.147
 says-0.135
 circus-0.130
 scar-0.129
 ours-0.129
Token over
Token ID625
Feature activation-2.22e+00

Pos logit contributions
ively+0.410
antly+0.404
ant+0.395
 efforts+0.385
 fingertips+0.384
Neg logit contributions
rob-0.433
 says-0.390
 ours-0.373
 circus-0.373
 scar-0.362
Token to
Token ID284
Feature activation+1.60e+00

Pos logit contributions
ively+0.135
antly+0.133
ant+0.132
 efforts+0.129
 fingertips+0.129
Neg logit contributions
rob-0.092
 says-0.083
 circus-0.078
 ours-0.078
 scar-0.076
Token the
Token ID262
Feature activation-1.84e+00

Pos logit contributions
rob+0.074
 says+0.070
 circus+0.067
 ours+0.065
 hungry+0.064
Neg logit contributions
antly-0.087
ively-0.085
ant-0.085
 efforts-0.085
 fingertips-0.083
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
rob+0.065
 says+0.060
 circus+0.055
 ours+0.055
 hungry+0.053
Neg logit contributions
ively-0.087
ant-0.086
antly-0.086
 efforts-0.085
 fingertips-0.085
TokenThe
Token ID464
Feature activation-7.83e+00

Pos logit contributions
rob+0.083
 says+0.077
 ours+0.073
 circus+0.072
 scar+0.072
Neg logit contributions
ively-0.069
antly-0.069
ant-0.067
ered-0.064
 efforts-0.064
Token farmer
Token ID18739
Feature activation+1.38e+00

Pos logit contributions
 efforts+0.421
antly+0.416
ively+0.400
ant+0.391
ainer+0.387
Neg logit contributions
rob-0.390
 ours-0.319
ww-0.308
iw-0.299
 says-0.299
Token drives
Token ID10182
Feature activation+8.42e+00

Pos logit contributions
rob+0.046
 ours+0.039
ww+0.036
P+0.035
 scar+0.034
Neg logit contributions
antly-0.095
ively-0.094
ered-0.092
ant-0.092
 efforts-0.091
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
rob+0.395
 says+0.368
 circus+0.345
 ours+0.344
 scar+0.337
Neg logit contributions
ively-0.457
antly-0.452
ant-0.451
 fingertips-0.446
 efforts-0.442
Token drives
Token ID10182
Feature activation+8.43e+00

Pos logit contributions
rob+0.077
 ours+0.070
P+0.070
ww+0.070
 hungry+0.061
Neg logit contributions
ively-0.106
antly-0.103
ered-0.103
 efforts-0.101
ant-0.100
Token.
Token ID13
Feature activation-8.09e-01

Pos logit contributions
rob+0.359
 says+0.346
 circus+0.317
 ours+0.312
 scar+0.309
Neg logit contributions
 fingertips-0.483
ant-0.482
ively-0.480
 efforts-0.474
antly-0.472
Token He
Token ID679
Feature activation+7.34e-01

Pos logit contributions
ant+0.048
antly+0.048
ively+0.047
 fingertips+0.046
 efforts+0.046
Neg logit contributions
rob-0.034
 says-0.033
 circus-0.030
 hungry-0.030
 ours-0.030
Token shows
Token ID2523
Feature activation+8.51e+00

Pos logit contributions
rob+0.024
 ours+0.022
ww+0.022
P+0.021
 ripe+0.018
Neg logit contributions
antly-0.048
 efforts-0.047
ively-0.047
ered-0.047
ant-0.046
Token them
Token ID606
Feature activation+3.31e-01

Pos logit contributions
 says+0.307
rob+0.278
 ours+0.261
 scary+0.249
 circus+0.248
Neg logit contributions
 fingertips-0.566
antly-0.543
ainer-0.535
 efforts-0.530
ant-0.522
Token the
Token ID262
Feature activation-1.92e+00

Pos logit contributions
 says+0.014
rob+0.014
 circus+0.013
 ours+0.013
 scar+0.013
Neg logit contributions
antly-0.019
ively-0.019
 fingertips-0.019
ainer-0.018
ant-0.018
Token who
Token ID508
Feature activation-3.91e+00

Pos logit contributions
rob+0.043
 says+0.039
 ours+0.037
 circus+0.037
 scar+0.036
Neg logit contributions
ively-0.050
antly-0.049
ant-0.049
 efforts-0.048
 fingertips-0.047
Token lost
Token ID2626
Feature activation+2.47e+00

Pos logit contributions
ively+0.208
antly+0.205
ant+0.203
 efforts+0.198
 fingertips+0.198
Neg logit contributions
rob-0.190
 says-0.171
 ours-0.165
 circus-0.163
 scar-0.160
Token her
Token ID607
Feature activation+4.32e+00

Pos logit contributions
rob+0.090
 says+0.084
 ours+0.081
 scar+0.078
 hungry+0.078
Neg logit contributions
antly-0.156
ively-0.156
ant-0.153
 efforts-0.151
 fingertips-0.149
Token gem
Token ID16840
Feature activation+3.72e-01

Pos logit contributions
rob+0.181
 says+0.168
 ours+0.161
 circus+0.157
 hungry+0.154
Neg logit contributions
ively-0.256
antly-0.252
ant-0.250
 efforts-0.241
 fingertips-0.241
Token in
Token ID287
Feature activation-9.53e-01

Pos logit contributions
rob+0.019
 says+0.017
 circus+0.017
 ours+0.016
ww+0.016
Neg logit contributions
ively-0.019
antly-0.019
ant-0.018
 efforts-0.018
 fingertips-0.018
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
ively+0.052
antly+0.052
ant+0.052
 efforts+0.050
 fingertips+0.050
Neg logit contributions
rob-0.044
 says-0.041
 ours-0.039
 circus-0.038
 hungry-0.038
Token deep
Token ID2769
Feature activation+3.41e+00

Pos logit contributions
antly+0.073
 fingertips+0.072
 efforts+0.071
 bottom+0.069
ively+0.068
Neg logit contributions
rob-0.071
 says-0.058
 ours-0.054
 blows-0.054
 yells-0.051
Token well
Token ID880
Feature activation-3.67e+00

Pos logit contributions
rob+0.162
 says+0.137
 ours+0.133
 circus+0.129
ww+0.124
Neg logit contributions
antly-0.194
ively-0.194
 efforts-0.187
 fingertips-0.186
ered-0.185
Token.
Token ID13
Feature activation-8.17e-01

Pos logit contributions
ively+0.178
antly+0.175
ant+0.170
 efforts+0.169
 fingertips+0.166
Neg logit contributions
rob-0.200
 says-0.173
 ours-0.172
 scar-0.166
ww-0.165
Token "
Token ID366
Feature activation+3.65e-01

Pos logit contributions
ively+0.047
antly+0.047
ant+0.046
 fingertips+0.045
 efforts+0.045
Neg logit contributions
rob-0.037
 says-0.032
 ours-0.031
 circus-0.031
 scar-0.030
TokenMaybe
Token ID13300
Feature activation-2.64e+00

Pos logit contributions
rob+0.016
 says+0.015
 ours+0.014
 circus+0.014
 scar+0.014
Neg logit contributions
ively-0.021
antly-0.021
ant-0.021
 efforts-0.020
 fingertips-0.020
Token it
Token ID340
Feature activation-1.91e+00

Pos logit contributions
rob+0.024
 says+0.022
 ours+0.021
 circus+0.021
 hungry+0.020
Neg logit contributions
ively-0.037
antly-0.037
ant-0.036
 efforts-0.036
 fingertips-0.036
Token's
Token ID338
Feature activation+1.63e+00

Pos logit contributions
ively+0.104
antly+0.103
ant+0.101
 efforts+0.099
ered+0.099
Neg logit contributions
rob-0.092
 says-0.081
 ours-0.079
 circus-0.079
 scar-0.077
Token time
Token ID640
Feature activation+8.97e-01

Pos logit contributions
rob+0.076
 says+0.070
 ours+0.067
 circus+0.066
 scar+0.065
Neg logit contributions
ively-0.090
antly-0.088
ant-0.088
 fingertips-0.086
 efforts-0.085
Token for
Token ID329
Feature activation+5.45e+00

Pos logit contributions
rob+0.044
 circus+0.035
 says+0.035
P+0.035
 scar+0.033
Neg logit contributions
antly-0.050
ively-0.050
ant-0.049
 efforts-0.049
 fingertips-0.048
Token Tom
Token ID4186
Feature activation+1.46e+00

Pos logit contributions
rob+0.154
 says+0.132
 ours+0.122
 circus+0.121
 scar+0.114
Neg logit contributions
ively-0.402
antly-0.400
ant-0.395
 efforts-0.388
 fingertips-0.387
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
rob+0.075
 says+0.062
 circus+0.062
 scar+0.059
 ours+0.059
Neg logit contributions
antly-0.075
ively-0.075
ant-0.075
 efforts-0.074
 fingertips-0.072
Token learn
Token ID2193
Feature activation-6.82e+00

Pos logit contributions
 says+0.059
 hungry+0.053
 mean+0.053
 circus+0.052
 scary+0.052
Neg logit contributions
antly-0.091
ively-0.088
ant-0.086
bled-0.085
 efforts-0.083
Token about
Token ID546
Feature activation-2.40e+00

Pos logit contributions
ively+0.375
antly+0.370
ant+0.365
 efforts+0.358
 fingertips+0.358
Neg logit contributions
rob-0.320
 says-0.291
 ours-0.277
 circus-0.275
 scar-0.268
Token science
Token ID3783
Feature activation+3.31e+00

Pos logit contributions
ively+0.130
antly+0.128
ant+0.127
 efforts+0.124
 fingertips+0.124
Neg logit contributions
rob-0.115
 says-0.103
 ours-0.098
 circus-0.098
 scar-0.096
Token!"
Token ID2474
Feature activation+5.54e+00

Pos logit contributions
rob+0.164
 says+0.145
 ours+0.140
 circus+0.137
 scar+0.136
Neg logit contributions
ively-0.173
antly-0.170
ant-0.170
 efforts-0.167
 fingertips-0.162
Token
Token ID220
Feature activation+2.68e-02

Pos logit contributions
rob+0.250
 says+0.228
 circus+0.213
 ours+0.213
 hungry+0.206
Neg logit contributions
ively-0.317
antly-0.312
ant-0.309
 efforts-0.303
 fingertips-0.301
Token help
Token ID1037
Feature activation-8.75e+00

Pos logit contributions
rob+0.095
 says+0.084
ww+0.084
 ours+0.084
 circus+0.081
Neg logit contributions
ively-0.062
ant-0.059
antly-0.058
ered-0.058
 efforts-0.057
Token him
Token ID683
Feature activation+6.70e-01

Pos logit contributions
ively+0.459
antly+0.454
ant+0.447
 efforts+0.439
 fingertips+0.437
Neg logit contributions
rob-0.433
 says-0.397
 ours-0.377
 circus-0.375
 scar-0.366
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
 says+0.023
 hungry+0.021
 sleeping+0.020
 scary+0.020
 waiting+0.019
Neg logit contributions
antly-0.044
ively-0.043
 fingertips-0.042
ant-0.042
 efforts-0.042
Token She
Token ID1375
Feature activation+2.34e+00

Pos logit contributions
ively+0.048
antly+0.048
ant+0.047
 fingertips+0.046
 efforts+0.046
Neg logit contributions
rob-0.036
 says-0.030
 circus-0.029
 ours-0.029
 scar-0.029
Token was
Token ID373
Feature activation-3.91e+00

Pos logit contributions
rob+0.075
 says+0.065
 ours+0.060
 circus+0.059
 scar+0.057
Neg logit contributions
ively-0.164
antly-0.163
ant-0.161
 efforts-0.158
 fingertips-0.158
Token very
Token ID845
Feature activation+7.82e-01

Pos logit contributions
ively+0.250
antly+0.249
ant+0.247
 fingertips+0.242
 efforts+0.240
Neg logit contributions
rob-0.146
 says-0.133
 hungry-0.127
 ours-0.126
 circus-0.123
Token kind
Token ID1611
Feature activation+2.14e-01

Pos logit contributions
rob+0.041
 circus+0.032
 ours+0.029
 scar+0.028
 pic+0.028
Neg logit contributions
ered-0.045
ively-0.044
 efforts-0.043
antly-0.043
ant-0.042
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
rob+0.007
 says+0.007
 hungry+0.006
 circus+0.006
 ours+0.006
Neg logit contributions
ively-0.014
antly-0.014
ant-0.014
ered-0.014
 fingertips-0.014
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
 efforts+0.073
 fingertips+0.070
ively+0.070
antly+0.070
ered+0.069
Neg logit contributions
rob-0.093
P-0.077
 circus-0.075
ww-0.075
 says-0.072
Token gave
Token ID2921
Feature activation+4.86e+00

Pos logit contributions
rob+0.066
 says+0.056
 circus+0.053
 ours+0.053
 scar+0.051
Neg logit contributions
ively-0.125
antly-0.124
ant-0.123
 efforts-0.121
 fingertips-0.121
Token John
Token ID1757
Feature activation-4.92e+00

Pos logit contributions
rob+0.219
 says+0.206
 ours+0.192
 circus+0.184
 hungry+0.182
Neg logit contributions
ively-0.270
antly-0.269
 fingertips-0.269
 efforts-0.268
ant-0.268
Token It
Token ID632
Feature activation-5.24e+00

Pos logit contributions
ant+0.044
ively+0.044
 efforts+0.043
antly+0.042
ered+0.041
Neg logit contributions
 says-0.039
rob-0.037
 hungry-0.035
 circus-0.034
 ours-0.033
Token was
Token ID373
Feature activation-3.92e+00

Pos logit contributions
antly+0.287
ively+0.284
ered+0.274
ant+0.272
 fingertips+0.268
Neg logit contributions
rob-0.253
 ours-0.224
 circus-0.216
 says-0.215
 scar-0.207
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
ively+0.215
ant+0.213
antly+0.213
 efforts+0.208
 fingertips+0.206
Neg logit contributions
rob-0.180
 says-0.168
 ours-0.160
 circus-0.159
 hungry-0.158
Token deep
Token ID2769
Feature activation+3.40e+00

Pos logit contributions
antly+0.076
 efforts+0.074
ively+0.074
 fingertips+0.074
ant+0.070
Neg logit contributions
rob-0.064
 says-0.054
 ours-0.053
 wants-0.049
P-0.048
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
rob+0.180
 says+0.156
 ours+0.155
 circus+0.146
P+0.143
Neg logit contributions
ively-0.174
antly-0.173
 efforts-0.166
 fingertips-0.165
ered-0.162
Token booming
Token ID32017
Feature activation+4.20e+00

Pos logit contributions
 fingertips+0.092
antly+0.090
 itself+0.088
 efforts+0.087
 surely+0.084
Neg logit contributions
rob-0.075
P-0.057
use-0.050
iw-0.049
ww-0.049
Token voice
Token ID3809
Feature activation-4.53e+00

Pos logit contributions
rob+0.268
 says+0.241
 ours+0.240
 circus+0.231
 scar+0.230
Neg logit contributions
ively-0.169
antly-0.163
ant-0.157
 fingertips-0.155
 efforts-0.155
Token.
Token ID13
Feature activation-8.06e-01

Pos logit contributions
ively+0.250
antly+0.244
ant+0.239
ered+0.235
 fingertips+0.233
Neg logit contributions
rob-0.219
 says-0.192
 ours-0.191
 circus-0.187
 scar-0.183
Token The
Token ID383
Feature activation-6.73e+00

Pos logit contributions
ant+0.038
ively+0.037
 efforts+0.037
antly+0.036
ered+0.035
Neg logit contributions
 says-0.043
rob-0.041
 hungry-0.039
 circus-0.038
 ours-0.038
Token voice
Token ID3809
Feature activation-4.51e+00

Pos logit contributions
ant+0.480
antly+0.465
ively+0.465
 fingertips+0.454
 efforts+0.452
Neg logit contributions
 says-0.213
rob-0.201
 circus-0.196
 hungry-0.194
 scary-0.186
Token told
Token ID1297
Feature activation+5.70e+00

Pos logit contributions
ant+0.322
ainer+0.321
 fingertips+0.321
 efforts+0.320
ively+0.316
Neg logit contributions
 says-0.130
rob-0.124
 scary-0.100
 yells-0.097
 circus-0.097
Token their
Token ID511
Feature activation+4.21e+00

Pos logit contributions
rob+0.078
 circus+0.068
 says+0.068
ww+0.067
 ours+0.067
Neg logit contributions
antly-0.069
ively-0.069
ant-0.064
ered-0.063
 fingertips-0.062
Token boats
Token ID16659
Feature activation+3.40e+00

Pos logit contributions
rob+0.198
 says+0.169
ww+0.159
 ours+0.159
 circus+0.156
Neg logit contributions
antly-0.233
 efforts-0.228
ively-0.228
 fingertips-0.228
ant-0.221
Token.
Token ID13
Feature activation-8.19e-01

Pos logit contributions
rob+0.181
 says+0.161
ww+0.157
 circus+0.156
 ours+0.154
Neg logit contributions
ively-0.165
antly-0.163
ant-0.158
 efforts-0.155
ered-0.154
Token They
Token ID1119
Feature activation+3.82e-01

Pos logit contributions
ively+0.050
antly+0.050
ant+0.049
 efforts+0.048
 fingertips+0.048
Neg logit contributions
rob-0.034
 says-0.030
 ours-0.028
 circus-0.028
 scar-0.027
Token have
Token ID423
Feature activation-2.14e+00

Pos logit contributions
rob+0.020
 says+0.017
 ours+0.017
ww+0.017
 circus+0.017
Neg logit contributions
ively-0.019
antly-0.019
ant-0.019
ered-0.018
 fingertips-0.018
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
 efforts+0.105
antly+0.103
ered+0.102
 fingertips+0.100
ively+0.098
Neg logit contributions
rob-0.113
 says-0.096
ww-0.093
iw-0.090
 circus-0.089
Token race
Token ID3234
Feature activation+3.62e+00

Pos logit contributions
 efforts+0.072
antly+0.070
 fingertips+0.070
 bottom+0.068
 unison+0.066
Neg logit contributions
rob-0.070
 says-0.056
 ours-0.054
 yells-0.052
 wants-0.052
Token.
Token ID13
Feature activation-8.26e-01

Pos logit contributions
rob+0.154
 says+0.139
 ours+0.131
 circus+0.131
 scar+0.126
Neg logit contributions
ively-0.217
antly-0.214
ant-0.213
 fingertips-0.208
 efforts-0.208
Token They
Token ID1119
Feature activation+3.81e-01

Pos logit contributions
ively+0.055
antly+0.055
ant+0.055
 efforts+0.054
ered+0.053
Neg logit contributions
rob-0.028
 says-0.027
 circus-0.024
 ours-0.024
 hungry-0.023
Token say
Token ID910
Feature activation+6.94e+00

Pos logit contributions
rob+0.019
 says+0.016
 ours+0.016
 circus+0.016
ww+0.015
Neg logit contributions
ively-0.020
antly-0.020
ant-0.020
 efforts-0.019
 fingertips-0.019
Token:
Token ID25
Feature activation-4.41e+00

Pos logit contributions
rob+0.321
 says+0.297
 circus+0.280
 ours+0.279
 scar+0.273
Neg logit contributions
ively-0.387
antly-0.380
ant-0.376
 fingertips-0.368
 efforts-0.366
Token
Token ID220
Feature activation+4.40e-02

Pos logit contributions
ively+0.045
ant+0.045
antly+0.044
 efforts+0.043
ered+0.043
Neg logit contributions
rob-0.036
 says-0.035
 circus-0.032
 ours-0.032
 hungry-0.031
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
 says+0.002
rob+0.002
 ours+0.001
 hungry+0.001
 circus+0.001
Neg logit contributions
ively-0.003
 efforts-0.003
ant-0.003
 fingertips-0.003
antly-0.003
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
rob+0.059
 says+0.056
 circus+0.049
 ours+0.048
 hungry+0.047
Neg logit contributions
ively-0.092
ant-0.091
 efforts-0.091
 fingertips-0.090
antly-0.090
TokenTom
Token ID13787
Feature activation+2.44e+00

Pos logit contributions
rob+0.072
 says+0.067
 circus+0.064
P+0.063
 ours+0.063
Neg logit contributions
ively-0.078
 fingertips-0.077
ant-0.076
antly-0.075
 efforts-0.075
Token had
Token ID550
Feature activation-1.01e-01

Pos logit contributions
 says+0.107
rob+0.104
 scar+0.095
 wants+0.092
 sees+0.092
Neg logit contributions
ively-0.140
ant-0.135
 fingertips-0.135
ainer-0.132
antly-0.131
Token learned
Token ID4499
Feature activation-4.30e+00

Pos logit contributions
ively+0.005
antly+0.005
ant+0.005
 fingertips+0.005
 efforts+0.005
Neg logit contributions
rob-0.005
 says-0.005
 ours-0.005
 circus-0.005
 scar-0.005
Token a
Token ID257
Feature activation-1.35e+00

Pos logit contributions
ively+0.245
 fingertips+0.245
ant+0.242
antly+0.241
ainer+0.234
Neg logit contributions
rob-0.184
 says-0.181
 ours-0.170
 scar-0.166
 circus-0.165
Token valuable
Token ID8119
Feature activation+6.08e-01

Pos logit contributions
ively+0.082
ant+0.080
antly+0.078
 fingertips+0.078
ered+0.077
Neg logit contributions
rob-0.054
 says-0.052
 circus-0.049
ww-0.049
 scary-0.049
Token lesson
Token ID11483
Feature activation-4.28e+00

Pos logit contributions
rob+0.028
 says+0.026
ww+0.026
 scary+0.026
 circus+0.025
Neg logit contributions
 fingertips-0.033
ively-0.033
ant-0.031
antly-0.031
ainer-0.029
Token from
Token ID422
Feature activation+2.16e+00

Pos logit contributions
 fingertips+0.227
antly+0.225
ively+0.222
ainer+0.221
ant+0.219
Neg logit contributions
rob-0.203
 says-0.193
 circus-0.177
 ours-0.176
 waiting-0.175
Token his
Token ID465
Feature activation+3.97e+00

Pos logit contributions
rob+0.105
 says+0.101
 ours+0.098
 circus+0.098
 hungry+0.096
Neg logit contributions
antly-0.111
ively-0.107
ant-0.107
 fingertips-0.107
 efforts-0.105
Token was
Token ID373
Feature activation-5.30e+00

Pos logit contributions
rob+0.107
 says+0.100
 ours+0.095
 circus+0.094
 hungry+0.093
Neg logit contributions
ively-0.085
antly-0.084
ant-0.083
 efforts-0.081
 fingertips-0.081
Token a
Token ID257
Feature activation-2.08e+00

Pos logit contributions
ively+0.279
antly+0.275
ant+0.275
 fingertips+0.268
 efforts+0.267
Neg logit contributions
rob-0.259
 says-0.238
 ours-0.228
 circus-0.228
 hungry-0.224
Token little
Token ID1310
Feature activation-6.57e+00

Pos logit contributions
antly+0.102
 efforts+0.100
 fingertips+0.100
ively+0.096
 bottom+0.095
Neg logit contributions
rob-0.113
 says-0.093
 ours-0.091
 yells-0.085
 wants-0.083
Token girl
Token ID2576
Feature activation-1.35e-01

Pos logit contributions
antly+0.372
 efforts+0.368
ively+0.365
 fingertips+0.358
ant+0.353
Neg logit contributions
rob-0.299
 ours-0.231
 says-0.224
P-0.216
 circus-0.211
Token called
Token ID1444
Feature activation+2.41e+00

Pos logit contributions
ively+0.007
antly+0.007
ant+0.007
 fingertips+0.007
 efforts+0.007
Neg logit contributions
rob-0.007
 says-0.006
 ours-0.006
 circus-0.006
 scar-0.006
Token Lucy
Token ID22162
Feature activation-5.23e+00

Pos logit contributions
rob+0.107
 says+0.103
 ours+0.097
 hungry+0.097
 sleeping+0.096
Neg logit contributions
 efforts-0.130
antly-0.130
ively-0.127
 fingertips-0.127
ant-0.125
Token who
Token ID508
Feature activation-3.95e+00

Pos logit contributions
ively+0.322
ant+0.314
 efforts+0.313
antly+0.311
ainer+0.311
Neg logit contributions
rob-0.199
 says-0.198
 hungry-0.178
 sleeping-0.176
 scar-0.174
Token loved
Token ID6151
Feature activation-3.84e+00

Pos logit contributions
ively+0.197
antly+0.194
ant+0.191
 efforts+0.187
 fingertips+0.187
Neg logit contributions
rob-0.206
 says-0.190
 circus-0.180
 ours-0.179
 hungry-0.176
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
 efforts+0.192
ively+0.189
 fingertips+0.189
ant+0.182
antly+0.177
Neg logit contributions
rob-0.209
 says-0.173
P-0.169
ww-0.167
 ours-0.161
Token hug
Token ID16225
Feature activation-8.98e-01

Pos logit contributions
rob+0.064
 says+0.059
 circus+0.056
 ours+0.055
 scar+0.054
Neg logit contributions
ively-0.090
antly-0.089
ant-0.087
 efforts-0.085
 fingertips-0.085
Token.
Token ID13
Feature activation-7.63e-01

Pos logit contributions
ively+0.052
antly+0.051
ant+0.050
 fingertips+0.049
 efforts+0.049
Neg logit contributions
rob-0.041
 says-0.036
 ours-0.034
 circus-0.034
 scar-0.033
Token.
Token ID13
Feature activation-8.02e-01

Pos logit contributions
rob+0.425
 says+0.378
 circus+0.361
 ours+0.350
 hungry+0.347
Neg logit contributions
ively-0.541
antly-0.536
ant-0.526
 fingertips-0.519
 efforts-0.512
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
 efforts+0.053
ant+0.053
ively+0.053
ered+0.052
 fingertips+0.052
Neg logit contributions
rob-0.029
 says-0.029
 wants-0.023
 ours-0.023
 hungry-0.023
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
rob+0.057
 says+0.054
 circus+0.045
 sleeping+0.045
 hungry+0.045
Neg logit contributions
 efforts-0.096
ively-0.095
ant-0.095
 fingertips-0.095
antly-0.093
TokenAfter
Token ID3260
Feature activation-3.84e+00

Pos logit contributions
rob+0.073
P+0.066
 says+0.064
 circus+0.063
ww+0.063
Neg logit contributions
 fingertips-0.079
ively-0.078
antly-0.077
 efforts-0.077
ant-0.075
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
ively+0.242
ant+0.240
antly+0.239
 fingertips+0.231
ainer+0.227
Neg logit contributions
rob-0.155
 circus-0.137
 says-0.136
 hungry-0.135
 sleeping-0.133
Token while
Token ID981
Feature activation-3.87e+00

Pos logit contributions
ively+0.075
antly+0.074
ant+0.074
 fingertips+0.071
ered+0.071
Neg logit contributions
rob-0.064
 says-0.058
 circus-0.056
 ours-0.055
 scar-0.055
Token,
Token ID11
Feature activation-1.58e+00

Pos logit contributions
ant+0.223
ively+0.220
antly+0.217
 fingertips+0.211
ered+0.210
Neg logit contributions
rob-0.170
 says-0.160
 hungry-0.156
 sleeping-0.149
 circus-0.148
Token Sarah
Token ID10490
Feature activation-5.78e+00

Pos logit contributions
ant+0.112
antly+0.110
ively+0.109
ainer+0.109
 fingertips+0.108
Neg logit contributions
rob-0.045
 hungry-0.044
 scary-0.042
 says-0.041
 fun-0.041
Token's
Token ID338
Feature activation+1.61e+00

Pos logit contributions
ively+0.321
antly+0.316
ant+0.313
 efforts+0.306
 fingertips+0.306
Neg logit contributions
rob-0.269
 says-0.244
 circus-0.230
 ours-0.230
 scar-0.225
Token face
Token ID1986
Feature activation-8.90e+00

Pos logit contributions
 says+0.079
 hungry+0.078
 circus+0.076
 scar+0.076
 sleeping+0.075
Neg logit contributions
ively-0.085
antly-0.082
ant-0.082
ered-0.078
ved-0.074
Token turned
Token ID2900
Feature activation-3.67e+00

Pos logit contributions
ively+0.504
ant+0.488
 efforts+0.457
 fingertips+0.454
antly+0.439
Neg logit contributions
 says-0.385
 hungry-0.380
 sleeping-0.372
rob-0.370
 scary-0.367
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
ively+0.046
antly+0.045
ant+0.044
 efforts+0.043
 fingertips+0.043
Neg logit contributions
rob-0.052
 says-0.047
 ours-0.046
 circus-0.045
ww-0.044
Token park
Token ID3952
Feature activation+6.70e+00

Pos logit contributions
antly+0.115
 Days+0.112
 bottom+0.111
ively+0.110
growth+0.106
Neg logit contributions
rob-0.084
 ours-0.073
 says-0.071
 yells-0.067
etchup-0.065
Token when
Token ID618
Feature activation-5.83e+00

Pos logit contributions
 hungry+0.294
 waiting+0.285
 says+0.285
rob+0.283
 sleeping+0.283
Neg logit contributions
antly-0.370
ant-0.363
ively-0.363
 fingertips-0.354
 efforts-0.351
Token she
Token ID673
Feature activation+4.58e+00

Pos logit contributions
ively+0.439
ant+0.435
antly+0.434
 efforts+0.427
 fingertips+0.423
Neg logit contributions
rob-0.154
 says-0.129
 circus-0.118
 ours-0.116
 hungry-0.115
Token spotted
Token ID13489
Feature activation-4.06e-01

Pos logit contributions
rob+0.141
 says+0.123
 circus+0.112
 hungry+0.109
 scar+0.107
Neg logit contributions
ively-0.326
ant-0.322
antly-0.320
 efforts-0.316
 fingertips-0.314
Token something
Token ID1223
Feature activation-4.58e+00

Pos logit contributions
 fingertips+0.026
ant+0.026
 efforts+0.026
ively+0.026
antly+0.026
Neg logit contributions
rob-0.015
 says-0.014
 hungry-0.013
 waiting-0.013
 scary-0.013
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
ively+0.271
antly+0.270
 fingertips+0.265
ered+0.264
ant+0.256
Neg logit contributions
rob-0.200
 ours-0.158
 says-0.156
 circus-0.156
ww-0.151
Token It
Token ID632
Feature activation-5.23e+00

Pos logit contributions
ively+0.036
antly+0.035
ant+0.035
 fingertips+0.034
 efforts+0.034
Neg logit contributions
rob-0.048
 says-0.044
 ours-0.042
 circus-0.042
 scar-0.042
Token was
Token ID373
Feature activation-3.93e+00

Pos logit contributions
ively+0.227
ant+0.221
antly+0.221
 efforts+0.217
 fingertips+0.214
Neg logit contributions
rob-0.308
 says-0.286
 circus-0.271
 ours-0.271
 hungry-0.267
Token a
Token ID257
Feature activation-1.36e+00

Pos logit contributions
ively+0.190
antly+0.188
ant+0.182
 fingertips+0.180
 efforts+0.180
Neg logit contributions
rob-0.215
 says-0.192
 ours-0.183
 circus-0.183
ww-0.178
Token crocod
Token ID37565
Feature activation+4.64e+00

Pos logit contributions
 fingertips+0.076
antly+0.075
 bottom+0.074
 efforts+0.071
ively+0.069
Neg logit contributions
rob-0.063
 says-0.053
 ours-0.049
 wants-0.046
 yells-0.045
Token's
Token ID338
Feature activation+1.62e+00

Pos logit contributions
rob+0.034
 says+0.030
 ours+0.029
 circus+0.029
 scar+0.028
Neg logit contributions
ively-0.041
antly-0.040
ant-0.040
 efforts-0.039
 fingertips-0.039
Token back
Token ID736
Feature activation-3.15e+00

Pos logit contributions
rob+0.048
 says+0.040
 ours+0.037
 circus+0.036
 scar+0.034
Neg logit contributions
ively-0.118
antly-0.117
ant-0.116
 efforts-0.115
 fingertips-0.115
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
ively+0.170
antly+0.167
ant+0.163
ered+0.163
 efforts+0.163
Neg logit contributions
rob-0.154
 ours-0.132
 says-0.130
 circus-0.130
ww-0.129
Token they
Token ID484
Feature activation+2.50e+00

Pos logit contributions
rob+0.089
 ours+0.078
 circus+0.076
 says+0.075
ww+0.074
Neg logit contributions
antly-0.102
ively-0.100
ant-0.098
ered-0.097
 efforts-0.097
Token rode
Token ID22075
Feature activation+6.08e+00

Pos logit contributions
rob+0.099
 says+0.090
 scar+0.085
 hungry+0.080
 circus+0.080
Neg logit contributions
ively-0.159
antly-0.155
ant-0.155
 fingertips-0.153
 efforts-0.152
Token through
Token ID832
Feature activation-4.22e+00

Pos logit contributions
rob+0.273
 says+0.244
 ours+0.232
 circus+0.231
 scar+0.224
Neg logit contributions
ively-0.350
antly-0.347
ant-0.341
 efforts-0.335
 fingertips-0.333
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
ant+0.257
ively+0.255
antly+0.254
ainer+0.252
 fingertips+0.244
Neg logit contributions
rob-0.173
 says-0.160
 circus-0.157
 scar-0.155
 ours-0.148
Token fields
Token ID7032
Feature activation+2.67e+00

Pos logit contributions
antly+0.114
 Days+0.108
ively+0.105
 Everyone+0.104
 Everybody+0.104
Neg logit contributions
rob-0.082
 ours-0.073
 says-0.070
etchup-0.068
iw-0.067
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
rob+0.109
 says+0.101
 circus+0.091
 hungry+0.091
 ours+0.090
Neg logit contributions
ant-0.162
ively-0.161
antly-0.160
 fingertips-0.158
 efforts-0.156
Token The
Token ID383
Feature activation-6.72e+00

Pos logit contributions
ant+0.042
ively+0.042
antly+0.042
ered+0.041
 efforts+0.041
Neg logit contributions
rob-0.040
 says-0.038
 circus-0.036
 ours-0.035
 hungry-0.035
Token pig
Token ID12967
Feature activation+5.36e+00

Pos logit contributions
antly+0.415
ively+0.404
 efforts+0.402
ainer+0.386
ant+0.386
Neg logit contributions
rob-0.269
 says-0.225
 ours-0.215
ww-0.205
 circus-0.204
Token.
Token ID13
Feature activation-8.03e-01

Pos logit contributions
ively+0.033
ant+0.033
antly+0.033
 fingertips+0.032
ainer+0.031
Neg logit contributions
 says-0.032
rob-0.032
 hungry-0.031
 ours-0.030
 scary-0.030
Token He
Token ID679
Feature activation+7.50e-01

Pos logit contributions
ant+0.055
ively+0.051
ainer+0.051
ved+0.051
 fingertips+0.051
Neg logit contributions
 says-0.028
 hungry-0.024
 wants-0.022
 fun-0.022
 late-0.021
Token kept
Token ID4030
Feature activation-7.43e+00

Pos logit contributions
 says+0.034
rob+0.032
 scar+0.031
 wants+0.030
 hungry+0.029
Neg logit contributions
ively-0.043
ainer-0.042
ant-0.042
 fingertips-0.039
antly-0.039
Token trying
Token ID2111
Feature activation+3.92e+00

Pos logit contributions
ively+0.449
antly+0.446
ered+0.444
 efforts+0.440
ant+0.439
Neg logit contributions
rob-0.321
 says-0.277
 ours-0.266
 circus-0.260
ww-0.258
Token and
Token ID290
Feature activation+1.87e+00

Pos logit contributions
rob+0.142
 says+0.139
 scar+0.134
 sleeping+0.131
 hungry+0.131
Neg logit contributions
ively-0.244
ant-0.237
antly-0.235
 fingertips-0.229
ainer-0.228
Token eventually
Token ID4191
Feature activation-1.16e+01

Pos logit contributions
 says+0.075
rob+0.075
 waiting+0.072
 sleeping+0.072
 hungry+0.070
Neg logit contributions
ively-0.107
ant-0.107
ainer-0.106
antly-0.105
 fingertips-0.102
Token was
Token ID373
Feature activation-3.90e+00

Pos logit contributions
ively+0.897
ant+0.884
antly+0.863
 fingertips+0.846
ainer+0.835
Neg logit contributions
 says-0.285
 hungry-0.266
rob-0.248
 sleeping-0.236
 scary-0.235
Token flexible
Token ID12846
Feature activation+8.45e-01

Pos logit contributions
ively+0.228
ant+0.226
antly+0.225
 fingertips+0.218
 efforts+0.214
Neg logit contributions
rob-0.157
 says-0.154
 hungry-0.153
 ours-0.149
 sleeping-0.146
Token again
Token ID757
Feature activation-6.14e+00

Pos logit contributions
rob+0.038
 says+0.035
 ours+0.033
 hungry+0.033
 circus+0.033
Neg logit contributions
ively-0.047
antly-0.047
ant-0.046
 fingertips-0.045
ered-0.044
Token.
Token ID13
Feature activation-8.11e-01

Pos logit contributions
ively+0.364
ant+0.356
antly+0.353
 fingertips+0.346
 efforts+0.345
Neg logit contributions
rob-0.251
 says-0.234
 hungry-0.220
 sleeping-0.216
 circus-0.215
Token He
Token ID679
Feature activation+7.41e-01

Pos logit contributions
ant+0.054
ively+0.050
ved+0.050
ainer+0.050
 efforts+0.049
Neg logit contributions
 says-0.030
 hungry-0.027
 wants-0.024
 fun-0.024
rob-0.024
Token to
Token ID284
Feature activation+1.49e+00

Pos logit contributions
rob+0.206
ww+0.189
 ours+0.187
 says+0.184
 wants+0.176
Neg logit contributions
antly-0.118
ively-0.114
ered-0.113
 surely-0.105
 efforts-0.105
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
rob+0.083
 says+0.074
 ours+0.071
ww+0.071
 circus+0.070
Neg logit contributions
ively-0.070
antly-0.068
ant-0.067
 fingertips-0.066
 efforts-0.066
Token market
Token ID1910
Feature activation+5.72e+00

Pos logit contributions
antly+0.127
ively+0.126
 bottom+0.121
ant+0.121
ainer+0.120
Neg logit contributions
rob-0.069
 says-0.060
 ours-0.055
 wants-0.052
 yells-0.051
Token with
Token ID351
Feature activation+2.58e-01

Pos logit contributions
rob+0.233
 says+0.217
 ours+0.205
 hungry+0.204
 circus+0.203
Neg logit contributions
ively-0.346
antly-0.342
ant-0.340
 fingertips-0.333
 efforts-0.331
Token his
Token ID465
Feature activation+3.97e+00

Pos logit contributions
rob+0.010
 says+0.009
 ours+0.008
 hungry+0.008
 circus+0.008
Neg logit contributions
ively-0.017
antly-0.017
ant-0.017
 efforts-0.016
 fingertips-0.016
Token mum
Token ID25682
Feature activation-1.23e+01

Pos logit contributions
rob+0.245
 says+0.229
 ours+0.221
 circus+0.220
 hungry+0.216
Neg logit contributions
ively-0.163
antly-0.159
ant-0.157
 fingertips-0.151
 efforts-0.150
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
ively+0.612
antly+0.607
ant+0.594
 efforts+0.584
 fingertips+0.578
Neg logit contributions
rob-0.663
 ours-0.564
 circus-0.561
ww-0.560
 says-0.559
Token He
Token ID679
Feature activation+7.37e-01

Pos logit contributions
ively+0.043
antly+0.042
ant+0.042
 efforts+0.041
 fingertips+0.040
Neg logit contributions
rob-0.040
 says-0.037
 circus-0.035
 ours-0.035
 hungry-0.035
Token saw
Token ID2497
Feature activation+3.18e+00

Pos logit contributions
rob+0.029
 says+0.025
 ours+0.024
 circus+0.023
 scar+0.022
Neg logit contributions
ively-0.046
antly-0.046
ant-0.045
 efforts-0.045
 fingertips-0.045
Token lots
Token ID6041
Feature activation-2.32e+00

Pos logit contributions
rob+0.163
 says+0.149
 ours+0.142
 circus+0.142
 scar+0.138
Neg logit contributions
ively-0.164
antly-0.162
ant-0.159
 fingertips-0.156
 efforts-0.156
Token of
Token ID286
Feature activation-7.10e+00

Pos logit contributions
ively+0.062
ant+0.057
 efforts+0.053
ered+0.052
antly+0.051
Neg logit contributions
rob-0.175
 says-0.160
 ours-0.159
 circus-0.156
 scar-0.156
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
ively+0.167
ered+0.165
antly+0.163
ant+0.152
 bottom+0.148
Neg logit contributions
rob-0.243
ww-0.217
etchup-0.212
P-0.210
iw-0.203
Token saw
Token ID2497
Feature activation+3.18e+00

Pos logit contributions
 says+0.062
rob+0.061
 hungry+0.056
 circus+0.054
 sees+0.054
Neg logit contributions
ively-0.123
ant-0.123
ainer-0.122
antly-0.121
 fingertips-0.120
Token a
Token ID257
Feature activation-1.37e+00

Pos logit contributions
rob+0.131
 says+0.126
 scary+0.116
 hungry+0.110
 circus+0.109
Neg logit contributions
 efforts-0.201
 fingertips-0.198
ainer-0.194
ant-0.193
ively-0.192
Token pe
Token ID613
Feature activation+8.99e+00

Pos logit contributions
antly+0.078
 fingertips+0.075
ively+0.073
 efforts+0.072
ant+0.071
Neg logit contributions
rob-0.062
 ours-0.053
 says-0.052
 wants-0.047
P-0.046
Tokenb
Token ID65
Feature activation+3.61e+00

Pos logit contributions
rob+0.302
 says+0.260
 ours+0.247
 circus+0.244
 scar+0.232
Neg logit contributions
ively-0.617
antly-0.613
ant-0.607
ered-0.598
 fingertips-0.589
Tokenble
Token ID903
Feature activation-1.16e+01

Pos logit contributions
rob+0.224
 says+0.214
P+0.201
 hungry+0.200
 sees+0.200
Neg logit contributions
 efforts-0.135
ively-0.129
 fingertips-0.124
ainer-0.124
 Days-0.121
Token.
Token ID13
Feature activation-8.14e-01

Pos logit contributions
ively+0.593
antly+0.587
ant+0.570
ered+0.557
 fingertips+0.556
Neg logit contributions
rob-0.596
 says-0.527
 ours-0.524
 scar-0.504
 circus-0.503
Token He
Token ID679
Feature activation+7.37e-01

Pos logit contributions
ively+0.036
antly+0.036
ant+0.035
 efforts+0.034
 fingertips+0.034
Neg logit contributions
rob-0.047
 says-0.044
 circus-0.042
 ours-0.042
 hungry-0.041
Token thought
Token ID1807
Feature activation-3.53e-02

Pos logit contributions
rob+0.032
 says+0.031
 scar+0.029
 hungry+0.028
 circus+0.028
Neg logit contributions
ively-0.043
ant-0.041
antly-0.041
ainer-0.040
 efforts-0.040
Token it
Token ID340
Feature activation-1.92e+00

Pos logit contributions
ively+0.002
antly+0.002
ant+0.002
ainer+0.002
 efforts+0.002
Neg logit contributions
rob-0.001
 says-0.001
 ours-0.001
 hungry-0.001
 scar-0.001
Token looked
Token ID3114
Feature activation+1.18e+00

Pos logit contributions
ively+0.085
antly+0.084
ant+0.082
 efforts+0.080
 fingertips+0.080
Neg logit contributions
rob-0.111
 says-0.102
 ours-0.099
 circus-0.099
 scar-0.096
Token new
Token ID649
Feature activation+1.79e+00

Pos logit contributions
rob+0.273
 says+0.243
 ours+0.240
 circus+0.235
ww+0.230
Neg logit contributions
ively-0.173
antly-0.172
 efforts-0.171
 fingertips-0.169
ant-0.168
Token p
Token ID279
Feature activation+8.73e+00

Pos logit contributions
rob+0.094
 ours+0.084
 says+0.082
 circus+0.078
ww+0.078
Neg logit contributions
 efforts-0.088
ively-0.086
antly-0.086
ant-0.086
 fingertips-0.086
Tokenaj
Token ID1228
Feature activation+7.85e+00

Pos logit contributions
rob+0.381
 says+0.333
 circus+0.322
 ours+0.322
 hungry+0.306
Neg logit contributions
ively-0.511
antly-0.508
ant-0.506
 efforts-0.498
 fingertips-0.495
Tokenamas
Token ID17485
Feature activation-1.97e+00

Pos logit contributions
rob+0.345
 says+0.295
 circus+0.286
ww+0.282
 scar+0.282
Neg logit contributions
ively-0.463
ered-0.438
ant-0.437
 efforts-0.426
antly-0.425
Token fit
Token ID4197
Feature activation-3.19e+00

Pos logit contributions
ively+0.111
ant+0.109
antly+0.109
 efforts+0.107
 fingertips+0.106
Neg logit contributions
rob-0.088
 says-0.081
 circus-0.075
 hungry-0.074
 ours-0.074
Token perfectly
Token ID7138
Feature activation-1.25e+01

Pos logit contributions
ively+0.163
antly+0.160
ant+0.159
 efforts+0.155
 fingertips+0.155
Neg logit contributions
rob-0.162
 says-0.148
 ours-0.141
 circus-0.140
 scar-0.139
Token.
Token ID13
Feature activation-8.15e-01

Pos logit contributions
antly+0.616
ively+0.608
ant+0.602
 efforts+0.596
ered+0.588
Neg logit contributions
rob-0.663
 says-0.607
 circus-0.572
 ours-0.571
ww-0.549
Token "
Token ID366
Feature activation+3.60e-01

Pos logit contributions
ively+0.043
antly+0.043
ant+0.042
 efforts+0.041
 fingertips+0.041
Neg logit contributions
rob-0.040
 says-0.037
 ours-0.035
 circus-0.035
 scar-0.034
TokenI
Token ID40
Feature activation+4.20e-02

Pos logit contributions
rob+0.017
 says+0.016
 ours+0.015
 circus+0.014
 hungry+0.014
Neg logit contributions
antly-0.020
ively-0.020
ant-0.019
 fingertips-0.019
ainer-0.018
Token love
Token ID1842
Feature activation-4.99e-01

Pos logit contributions
rob+0.002
 ours+0.002
 says+0.002
 circus+0.002
P+0.002
Neg logit contributions
antly-0.002
 efforts-0.002
ively-0.002
ered-0.002
ant-0.002
Token my
Token ID616
Feature activation+9.73e+00

Pos logit contributions
 efforts+0.029
 fingertips+0.029
ant+0.028
ively+0.028
antly+0.028
Neg logit contributions
rob-0.021
P-0.017
 says-0.017
 ours-0.016
 trash-0.016
Token<|endoftext|>
Token ID50256
Feature activation+2.54e-01

Pos logit contributions
ant+0.044
ered+0.043
ively+0.043
 efforts+0.042
antly+0.041
Neg logit contributions
 says-0.031
rob-0.029
 hungry-0.027
 ours-0.026
 circus-0.026
TokenOne
Token ID3198
Feature activation+4.59e+00

Pos logit contributions
rob+0.011
 says+0.010
 circus+0.009
 ours+0.009
 hungry+0.009
Neg logit contributions
ively-0.015
antly-0.015
ant-0.015
 efforts-0.015
 fingertips-0.014
Token day
Token ID1110
Feature activation+1.37e+00

Pos logit contributions
rob+0.168
 says+0.158
 circus+0.147
 hungry+0.146
 ours+0.145
Neg logit contributions
ively-0.300
antly-0.296
ant-0.295
 fingertips-0.285
 efforts-0.281
Token,
Token ID11
Feature activation-1.53e+00

Pos logit contributions
rob+0.080
 says+0.071
 ours+0.070
 circus+0.070
 scar+0.069
Neg logit contributions
antly-0.062
ively-0.061
ant-0.060
 efforts-0.058
ered-0.058
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
ively+0.089
 efforts+0.088
ant+0.088
 fingertips+0.087
antly+0.087
Neg logit contributions
rob-0.064
 says-0.060
 ours-0.059
 hungry-0.059
 circus-0.057
Token mum
Token ID25682
Feature activation-1.23e+01

Pos logit contributions
ively+0.091
 efforts+0.088
antly+0.087
ant+0.087
 fingertips+0.085
Neg logit contributions
rob-0.100
 says-0.086
 ours-0.085
 yells-0.081
 scar-0.076
Token saw
Token ID2497
Feature activation+3.24e+00

Pos logit contributions
ered+0.501
antly+0.498
ively+0.496
ant+0.481
 efforts+0.443
Neg logit contributions
rob-0.749
 ours-0.664
 scar-0.652
 circus-0.635
 coll-0.586
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
rob+0.184
 says+0.163
 circus+0.159
 ours+0.159
 scar+0.156
Neg logit contributions
ively-0.149
antly-0.147
ant-0.144
ered-0.140
 efforts-0.139
Token pie
Token ID2508
Feature activation+2.81e+00

Pos logit contributions
 efforts+0.056
 fingertips+0.056
antly+0.056
 bottom+0.055
 mummy+0.052
Neg logit contributions
rob-0.076
 says-0.064
 ours-0.063
 yells-0.060
 wants-0.059
Token at
Token ID379
Feature activation+8.25e+00

Pos logit contributions
rob+0.152
 says+0.135
 scar+0.135
 ours+0.134
 circus+0.130
Neg logit contributions
ively-0.132
antly-0.130
ant-0.127
ered-0.123
 fingertips-0.122
Token the
Token ID262
Feature activation-1.88e+00

Pos logit contributions
rob+0.455
 says+0.413
 ours+0.393
 circus+0.393
 scar+0.391
Neg logit contributions
ively-0.392
antly-0.380
ant-0.376
 efforts-0.365
 fingertips-0.363