TOP ACTIVATIONS
MAX = 0.156

dream , she saw Wh isk ers walking and talking like
, Lucy !" said Wh isk ers , " Do you
was a cat named Wh isk ers . Wh isk ers
, fluffy cat named Wh isk ers , who liked to
clever k itty named Wh isk ers . Wh isk ers
share ." Spot and Wh isk ers agreed , and they
, the cat named Wh isk ers . Spot
a lazy cat named Wh isk ers . Wh isk ers
him outside . But Wh isk ers just wanted to sleep
a naughty cat named Wh isk ers jumped in front of
, fat cat named Wh isk ers . Wh isk ers
that day on , Wh isk ers and Ch ir py
Wh isk ers . Wh isk ers had a fluffy paw
him . Spot urged Wh isk ers to come play with
ball ." Buddy and Wh isk ers stopped fighting . They
mom laughed and helped Wh isk ers get free . They
to be friends with Wh isk ers . B
that day on , Wh isk ers was no longer lazy
a black cat named Wh isk ers . Wh isk ers
. " Hey , Wh isk ers !" Spot called out

MIN ACTIVATIONS
MIN = -0.151

on dry and cozy p aj amas and sn ugg led
and put on their p aj amas . They went to
She puts on her p aj amas . She hugs her
put on her new p aj amas and showed them to
She put on her p aj amas and went to bed
to wear her new p aj amas and read the book
He put on his p aj amas and went to bed
, still in his p aj amas . Everyone in town
and put on your p aj amas . Then we can
put on her favorite p aj amas and sn ugg led
quickly get into his p aj amas , wash his face
time , her new p aj amas fit perfectly . "
He put on his p aj amas but something was missing
They put on their p aj amas and went to their
I love my new p aj amas ," said Lily .
and got into his p aj amas . Then he picked
and put on their p aj amas . They each pick
's go get your p aj amas from the drawer ."
and put on her p aj amas . Mom my read
put on her cozy p aj amas and went to bed

INTERVAL 0.079 - 0.156
CONTAINS 0.145%

's mother said : " If you want to find your
were . She gave Jos ie lots of medicine to help
its nest . The bird ie was very scared and felt
that day on , Ell y and the baby crocod ile
on many more adventurous explor ations with her new tool .

INTERVAL 0.002 - 0.079
CONTAINS 39.503%

'm sorry ," the clerk said , " we don 't
watch cartoons on TV . They sit on the couch and
" O w , ow , ow !" Ben
. He was wearing a red shirt . One day ,
M ia wanted to build a sand castle , so

INTERVAL -0.074 - 0.002
CONTAINS 60.181%

, sm elly dough !" Tim my 's
ruined my dictionary ! Now I can 't see new words
peace . Bob was in shock . He asked
hot . The man had been so kind , and she
to make it green because it was her favorite color .

INTERVAL -0.151 - -0.074
CONTAINS 0.171%

. They learned to be careful with animals . They learned
other stick . He goes home . Mom goes
But we need to be careful not to hurt any bugs
old man . She ran home and told her family all
, he learned to be careful when drawing so he wouldn
Token dream
Token ID4320
Feature activation-1.48e-02

Pos logit contributions
 generation+0.039
ility+0.038
pole+0.037
hip+0.037
cker+0.036
Neg logit contributions
 tr-0.031
 sp-0.029
uddle-0.028
 mo-0.028
 puck-0.027
Token,
Token ID11
Feature activation+8.89e-03

Pos logit contributions
 tr+0.027
 sp+0.026
 mo+0.025
uddle+0.025
 puck+0.024
Neg logit contributions
 generation-0.023
ility-0.023
hip-0.022
pole-0.022
 Him-0.021
Token she
Token ID673
Feature activation-5.31e-03

Pos logit contributions
 generation+0.014
ility+0.014
hip+0.013
pole+0.013
 Him+0.013
Neg logit contributions
 tr-0.016
 sp-0.015
uddle-0.015
 mo-0.015
 puck-0.015
Token saw
Token ID2497
Feature activation+1.99e-03

Pos logit contributions
 tr+0.009
uddle+0.009
 sp+0.008
 puck+0.008
 mo+0.008
Neg logit contributions
 generation-0.009
ility-0.008
pole-0.008
hip-0.008
ie-0.008
Token Wh
Token ID854
Feature activation+1.20e-02

Pos logit contributions
 generation+0.003
ility+0.003
hip+0.003
pole+0.003
 Him+0.003
Neg logit contributions
 tr-0.004
 sp-0.004
uddle-0.003
 mo-0.003
 puck-0.003
Tokenisk
Token ID1984
Feature activation+1.56e-01

Pos logit contributions
 generation+0.012
pole+0.010
cker+0.010
ility+0.010
 Him+0.009
Neg logit contributions
 tr-0.030
uddle-0.029
aj-0.027
 rac-0.027
 sp-0.027
Tokeners
Token ID364
Feature activation+2.14e-02

Pos logit contributions
 generation+0.230
cker+0.218
eday+0.218
ility+0.215
pole+0.210
Neg logit contributions
 tr-0.291
 rac-0.266
 sp-0.265
 mo-0.258
 sk-0.254
Token walking
Token ID6155
Feature activation-4.06e-02

Pos logit contributions
 generation+0.038
ility+0.037
pole+0.036
hip+0.036
cker+0.035
Neg logit contributions
 tr-0.036
 sp-0.034
 mo-0.032
uddle-0.032
 puck-0.031
Token and
Token ID290
Feature activation+9.00e-03

Pos logit contributions
 tr+0.081
 sp+0.078
uddle+0.074
 mo+0.074
 puck+0.073
Neg logit contributions
 generation-0.058
ility-0.056
hip-0.054
pole-0.054
cker-0.052
Token talking
Token ID3375
Feature activation-2.77e-02

Pos logit contributions
 generation+0.014
pole+0.014
ility+0.014
ie+0.014
 Him+0.013
Neg logit contributions
uddle-0.016
 tr-0.015
 puck-0.014
 sp-0.014
omo-0.014
Token like
Token ID588
Feature activation+3.56e-02

Pos logit contributions
 tr+0.057
 sp+0.055
uddle+0.053
 mo+0.053
 puck+0.052
Neg logit contributions
 generation-0.037
ility-0.036
pole-0.035
hip-0.035
 Him-0.033
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
 generation+0.112
ility+0.108
pole+0.107
hip+0.106
 Him+0.102
Neg logit contributions
 tr-0.129
 sp-0.125
 mo-0.121
uddle-0.119
 puck-0.119
Token Lucy
Token ID22162
Feature activation+2.18e-03

Pos logit contributions
pole+0.017
 God+0.017
hip+0.016
 Him+0.016
ie+0.016
Neg logit contributions
 puck-0.019
uddle-0.018
 mo-0.018
 sp-0.018
 tr-0.018
Token!"
Token ID2474
Feature activation-5.00e-03

Pos logit contributions
 generation+0.004
ility+0.004
hip+0.004
pole+0.004
immer+0.004
Neg logit contributions
 tr-0.003
 sp-0.003
uddle-0.003
 mo-0.003
 rac-0.003
Token said
Token ID531
Feature activation+6.86e-03

Pos logit contributions
uddle+0.008
 puck+0.008
 tr+0.008
 sp+0.008
 mo+0.008
Neg logit contributions
 generation-0.008
 Him-0.008
 God-0.008
hip-0.008
pole-0.008
Token Wh
Token ID854
Feature activation+1.13e-02

Pos logit contributions
ility+0.014
 generation+0.014
pole+0.014
hip+0.013
cker+0.013
Neg logit contributions
 tr-0.010
 sp-0.009
 mo-0.008
 rac-0.008
uddle-0.008
Tokenisk
Token ID1984
Feature activation+1.55e-01

Pos logit contributions
 generation+0.016
ility+0.015
pole+0.014
cker+0.014
 Him+0.014
Neg logit contributions
 tr-0.023
uddle-0.021
aj-0.021
omo-0.021
 sp-0.021
Tokeners
Token ID364
Feature activation+2.16e-02

Pos logit contributions
 generation+0.241
cker+0.223
ility+0.221
eday+0.220
immer+0.216
Neg logit contributions
 tr-0.294
 sp-0.268
 rac-0.256
 sk-0.250
uddle-0.249
Token,
Token ID11
Feature activation+9.48e-03

Pos logit contributions
 generation+0.040
ility+0.038
eday+0.037
pole+0.037
cker+0.036
Neg logit contributions
 tr-0.036
 sp-0.034
 mo-0.031
uddle-0.030
 rac-0.030
Token "
Token ID366
Feature activation+9.99e-04

Pos logit contributions
 generation+0.019
ility+0.019
pole+0.019
hip+0.019
 Him+0.018
Neg logit contributions
 tr-0.012
uddle-0.012
 sp-0.011
 puck-0.011
 mo-0.011
TokenDo
Token ID5211
Feature activation+3.91e-02

Pos logit contributions
 generation+0.002
ility+0.002
pole+0.002
hip+0.002
cker+0.002
Neg logit contributions
 tr-0.001
 sp-0.001
 mo-0.001
uddle-0.001
 puck-0.001
Token you
Token ID345
Feature activation+1.67e-02

Pos logit contributions
 generation+0.075
ility+0.073
pole+0.071
cker+0.070
hip+0.070
Neg logit contributions
 tr-0.061
 sp-0.057
 mo-0.052
uddle-0.052
 puck-0.051
Token was
Token ID373
Feature activation+3.22e-03

Pos logit contributions
 tr+0.015
 sp+0.014
uddle+0.013
 mo+0.013
 puck+0.012
Neg logit contributions
 generation-0.026
ility-0.026
pole-0.025
hip-0.025
ie-0.024
Token a
Token ID257
Feature activation+6.35e-03

Pos logit contributions
ility+0.006
 generation+0.006
pole+0.006
hip+0.006
cker+0.005
Neg logit contributions
 tr-0.005
 sp-0.005
 mo-0.005
 puck-0.004
 rac-0.004
Token cat
Token ID3797
Feature activation+7.78e-03

Pos logit contributions
 generation+0.013
pole+0.013
cker+0.013
 God+0.012
ie+0.012
Neg logit contributions
 tr-0.007
uddle-0.006
aj-0.006
 sneakers-0.006
 cones-0.006
Token named
Token ID3706
Feature activation-1.97e-02

Pos logit contributions
ility+0.014
 generation+0.013
immer+0.013
 Him+0.013
eday+0.013
Neg logit contributions
 tr-0.013
 sp-0.013
 mo-0.012
 sk-0.012
 rac-0.012
Token Wh
Token ID854
Feature activation+1.08e-02

Pos logit contributions
 tr+0.030
uddle+0.029
 sp+0.028
 mo+0.028
 puck+0.027
Neg logit contributions
 generation-0.036
ility-0.035
hip-0.035
pole-0.034
 Him-0.033
Tokenisk
Token ID1984
Feature activation+1.55e-01

Pos logit contributions
 generation+0.012
eday+0.011
ility+0.011
 Him+0.010
pole+0.010
Neg logit contributions
 tr-0.024
uddle-0.023
omo-0.023
aj-0.023
 rac-0.023
Tokeners
Token ID364
Feature activation+2.00e-02

Pos logit contributions
eday+0.259
 generation+0.257
ility+0.250
 Him+0.250
immer+0.248
Neg logit contributions
 tr-0.245
 rac-0.235
 puck-0.218
 sp-0.216
 sk-0.215
Token.
Token ID13
Feature activation-3.18e-03

Pos logit contributions
ility+0.045
 generation+0.044
immer+0.043
 Him+0.042
pole+0.042
Neg logit contributions
 tr-0.025
 sp-0.023
 rac-0.021
 mo-0.021
 sk-0.021
Token Wh
Token ID854
Feature activation+9.50e-03

Pos logit contributions
 tr+0.005
 sp+0.005
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.006
hip-0.006
ility-0.006
pole-0.006
 Him-0.005
Tokenisk
Token ID1984
Feature activation+1.54e-01

Pos logit contributions
 generation+0.010
ility+0.010
 Him+0.009
pole+0.009
 God+0.009
Neg logit contributions
uddle-0.022
 tr-0.022
omo-0.021
aj-0.021
 rac-0.021
Tokeners
Token ID364
Feature activation+1.91e-02

Pos logit contributions
 generation+0.254
 Him+0.248
ility+0.246
eday+0.242
immer+0.238
Neg logit contributions
 tr-0.260
 rac-0.238
 sp-0.236
 puck-0.229
 mo-0.223
Token,
Token ID11
Feature activation+4.46e-03

Pos logit contributions
 tr+0.003
 sp+0.003
uddle+0.003
 mo+0.003
 puck+0.003
Neg logit contributions
 generation-0.004
ility-0.004
hip-0.004
pole-0.004
 Him-0.003
Token fluffy
Token ID39145
Feature activation+2.43e-03

Pos logit contributions
 generation+0.008
pole+0.008
 God+0.008
cker+0.008
 Him+0.008
Neg logit contributions
uddle-0.006
 tr-0.006
 puck-0.006
 mo-0.006
 Oscar-0.006
Token cat
Token ID3797
Feature activation+7.07e-03

Pos logit contributions
 generation+0.005
ie+0.004
hip+0.004
pole+0.004
 Him+0.004
Neg logit contributions
 puck-0.003
 tr-0.003
uddle-0.003
 sp-0.003
 mo-0.003
Token named
Token ID3706
Feature activation-2.22e-02

Pos logit contributions
 generation+0.009
ility+0.009
pole+0.008
hip+0.008
cker+0.008
Neg logit contributions
 tr-0.015
 sp-0.015
uddle-0.014
 mo-0.014
 puck-0.014
Token Wh
Token ID854
Feature activation+8.12e-03

Pos logit contributions
 tr+0.035
 sp+0.034
uddle+0.033
 mo+0.032
 puck+0.032
Neg logit contributions
 generation-0.041
ility-0.039
hip-0.039
pole-0.038
 Him-0.037
Tokenisk
Token ID1984
Feature activation+1.54e-01

Pos logit contributions
 generation+0.008
pole+0.007
cker+0.007
ility+0.007
eday+0.006
Neg logit contributions
 tr-0.020
aj-0.018
uddle-0.018
 sp-0.018
 rac-0.018
Tokeners
Token ID364
Feature activation+1.79e-02

Pos logit contributions
 generation+0.240
cker+0.238
eday+0.233
immer+0.233
ility+0.227
Neg logit contributions
 tr-0.264
 rac-0.237
 sp-0.229
 sk-0.228
 mo-0.220
Token,
Token ID11
Feature activation+4.11e-03

Pos logit contributions
 generation+0.032
ility+0.032
pole+0.030
 Him+0.030
cker+0.030
Neg logit contributions
 tr-0.030
 sp-0.028
 mo-0.026
 rac-0.025
uddle-0.025
Token who
Token ID508
Feature activation-1.54e-02

Pos logit contributions
 generation+0.006
ility+0.006
pole+0.006
hip+0.005
 Him+0.005
Neg logit contributions
uddle-0.008
 tr-0.008
 sp-0.007
 mo-0.007
 puck-0.007
Token liked
Token ID8288
Feature activation+1.95e-02

Pos logit contributions
 tr+0.024
 sp+0.023
uddle+0.022
 mo+0.022
 puck+0.021
Neg logit contributions
 generation-0.028
ility-0.027
pole-0.027
hip-0.027
cker-0.026
Token to
Token ID284
Feature activation+1.82e-03

Pos logit contributions
 generation+0.036
ility+0.034
hip+0.034
pole+0.034
 Him+0.033
Neg logit contributions
 tr-0.030
 sp-0.028
uddle-0.028
 mo-0.027
 puck-0.027
Token clever
Token ID14169
Feature activation-2.17e-02

Pos logit contributions
 tr+0.000
aj+0.000
uddle+0.000
 sneakers+0.000
 cones+0.000
Neg logit contributions
 generation-0.001
pole-0.001
 God-0.001
cker-0.001
ie-0.000
Token k
Token ID479
Feature activation-1.03e-02

Pos logit contributions
 tr+0.025
uddle+0.024
 sp+0.024
 mo+0.023
aj+0.023
Neg logit contributions
 generation-0.046
pole-0.044
ility-0.043
cker-0.043
hip-0.043
Tokenitty
Token ID9760
Feature activation+2.58e-02

Pos logit contributions
 tr+0.018
 sp+0.017
 mo+0.016
uddle+0.016
 puck+0.016
Neg logit contributions
 generation-0.017
pole-0.017
ility-0.017
hip-0.016
cker-0.016
Token named
Token ID3706
Feature activation-2.42e-02

Pos logit contributions
ility+0.040
 generation+0.039
immer+0.038
 Him+0.038
hip+0.037
Neg logit contributions
 tr-0.049
 sp-0.046
 mo-0.046
 rac-0.045
 sk-0.044
Token Wh
Token ID854
Feature activation+6.83e-03

Pos logit contributions
 tr+0.038
 sp+0.037
 mo+0.036
uddle+0.036
 puck+0.035
Neg logit contributions
 generation-0.043
ility-0.041
hip-0.041
pole-0.041
 Him-0.039
Tokenisk
Token ID1984
Feature activation+1.54e-01

Pos logit contributions
 generation+0.008
eday+0.007
ility+0.007
 Him+0.006
pole+0.006
Neg logit contributions
 tr-0.016
uddle-0.015
omo-0.015
aj-0.015
 rac-0.015
Tokeners
Token ID364
Feature activation+1.52e-02

Pos logit contributions
eday+0.254
 generation+0.254
ility+0.247
 Him+0.247
immer+0.244
Neg logit contributions
 tr-0.245
 rac-0.229
 sp-0.214
 sk-0.214
 puck-0.212
Token.
Token ID13
Feature activation-7.17e-03

Pos logit contributions
ility+0.033
 generation+0.032
immer+0.032
 Him+0.031
pole+0.031
Neg logit contributions
 tr-0.020
 sp-0.018
 rac-0.017
 mo-0.017
 sk-0.017
Token Wh
Token ID854
Feature activation+5.14e-03

Pos logit contributions
 tr+0.011
 sp+0.010
uddle+0.010
 mo+0.010
 puck+0.009
Neg logit contributions
 generation-0.013
ility-0.013
hip-0.013
pole-0.013
 Him-0.012
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.006
ility+0.005
 Him+0.005
 God+0.005
pole+0.005
Neg logit contributions
uddle-0.012
 tr-0.012
 rac-0.011
omo-0.011
aj-0.011
Tokeners
Token ID364
Feature activation+1.40e-02

Pos logit contributions
 generation+0.255
 Him+0.250
ility+0.246
eday+0.242
immer+0.239
Neg logit contributions
 tr-0.252
 rac-0.228
 sp-0.225
 puck-0.219
 sk-0.215
Token share
Token ID2648
Feature activation-2.48e-02

Pos logit contributions
 generation+0.008
ility+0.007
pole+0.007
hip+0.007
 Him+0.007
Neg logit contributions
 tr-0.011
 sp-0.011
uddle-0.011
 mo-0.010
 puck-0.010
Token."
Token ID526
Feature activation+8.37e-04

Pos logit contributions
 tr+0.046
 sp+0.045
uddle+0.043
 mo+0.043
 puck+0.042
Neg logit contributions
 generation-0.038
ility-0.037
pole-0.036
hip-0.036
cker-0.034
Token Spot
Token ID15899
Feature activation+2.02e-02

Pos logit contributions
 generation+0.002
hip+0.002
pole+0.002
ility+0.002
eday+0.001
Neg logit contributions
 tr-0.001
 sp-0.001
 mo-0.001
uddle-0.001
 puck-0.001
Token and
Token ID290
Feature activation+4.85e-03

Pos logit contributions
pole+0.035
 generation+0.034
hip+0.034
ility+0.034
ie+0.033
Neg logit contributions
 tr-0.029
uddle-0.028
 mo-0.028
 sp-0.028
 puck-0.027
Token Wh
Token ID854
Feature activation+8.55e-03

Pos logit contributions
 generation+0.010
ility+0.009
hip+0.009
pole+0.009
immer+0.009
Neg logit contributions
 tr-0.007
 sp-0.007
 mo-0.006
 rac-0.006
 puck-0.006
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.010
 Him+0.009
ility+0.008
 God+0.008
 someday+0.007
Neg logit contributions
 tr-0.020
uddle-0.019
omo-0.019
 rac-0.018
 sp-0.018
Tokeners
Token ID364
Feature activation+2.02e-02

Pos logit contributions
 generation+0.274
 Him+0.266
immer+0.258
ility+0.254
cker+0.246
Neg logit contributions
 tr-0.239
 rac-0.214
 sp-0.206
 cones-0.199
 puck-0.195
Token agreed
Token ID4987
Feature activation+1.12e-02

Pos logit contributions
 generation+0.044
ility+0.042
 Him+0.042
hip+0.040
pole+0.040
Neg logit contributions
 tr-0.029
 sp-0.026
 mo-0.024
uddle-0.022
 puck-0.022
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
 generation+0.019
ility+0.019
pole+0.018
hip+0.018
cker+0.017
Neg logit contributions
 tr-0.019
 sp-0.018
uddle-0.017
 mo-0.017
 puck-0.017
Token and
Token ID290
Feature activation+7.97e-03

Pos logit contributions
 generation+0.018
ility+0.017
pole+0.017
hip+0.017
 Him+0.016
Neg logit contributions
 tr-0.017
 sp-0.016
uddle-0.016
 mo-0.015
 puck-0.015
Token they
Token ID484
Feature activation+2.97e-03

Pos logit contributions
 generation+0.013
pole+0.013
ility+0.012
hip+0.012
cker+0.012
Neg logit contributions
 tr-0.014
 sp-0.013
uddle-0.013
 mo-0.012
 puck-0.012
Token,
Token ID11
Feature activation+3.90e-03

Pos logit contributions
 tr+0.004
 sp+0.004
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.004
ility-0.004
pole-0.004
hip-0.004
cker-0.004
Token the
Token ID262
Feature activation+4.90e-03

Pos logit contributions
 generation+0.006
pole+0.006
ility+0.006
hip+0.006
cker+0.006
Neg logit contributions
 tr-0.007
uddle-0.006
 sp-0.006
 mo-0.006
 puck-0.006
Token cat
Token ID3797
Feature activation+3.98e-03

Pos logit contributions
 generation+0.009
pole+0.008
hip+0.008
cker+0.008
 communicate+0.008
Neg logit contributions
aj-0.008
uddle-0.007
 tr-0.007
 Liam-0.007
omo-0.007
Token named
Token ID3706
Feature activation-2.27e-02

Pos logit contributions
 generation+0.007
ility+0.007
hip+0.007
 Him+0.007
immer+0.007
Neg logit contributions
 tr-0.007
 sp-0.006
 mo-0.006
uddle-0.006
 rac-0.006
Token Wh
Token ID854
Feature activation+6.71e-03

Pos logit contributions
uddle+0.038
 tr+0.037
 mo+0.036
 sp+0.036
aj+0.034
Neg logit contributions
 generation-0.038
hip-0.037
pole-0.036
ility-0.036
ow-0.035
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.007
eday+0.006
 Him+0.006
ility+0.006
pole+0.006
Neg logit contributions
 tr-0.016
uddle-0.015
 rac-0.015
 sp-0.014
omo-0.014
Tokeners
Token ID364
Feature activation+1.71e-02

Pos logit contributions
 generation+0.251
eday+0.245
immer+0.239
ility+0.236
 Him+0.233
Neg logit contributions
 tr-0.251
 rac-0.227
 sp-0.222
 sk-0.219
 cones-0.217
Token.
Token ID13
Feature activation-7.11e-03

Pos logit contributions
 generation+0.035
ility+0.033
 Him+0.033
cker+0.033
immer+0.032
Neg logit contributions
 tr-0.025
 sp-0.023
 rac-0.021
 mo-0.021
 sk-0.020
Token
Token ID198
Feature activation-5.16e-03

Pos logit contributions
 tr+0.011
 sp+0.010
 mo+0.010
uddle+0.010
 puck+0.009
Neg logit contributions
pole-0.013
hip-0.012
 generation-0.012
ow-0.012
eday-0.012
Token
Token ID198
Feature activation-5.14e-03

Pos logit contributions
 tr+0.005
 sp+0.005
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.013
ility-0.012
pole-0.012
hip-0.012
cker-0.012
TokenSpot
Token ID32565
Feature activation+1.45e-02

Pos logit contributions
 tr+0.009
 sp+0.009
 mo+0.008
uddle+0.008
 puck+0.008
Neg logit contributions
pole-0.008
hip-0.008
 generation-0.008
cker-0.008
eday-0.008
Token a
Token ID257
Feature activation+2.17e-04

Pos logit contributions
 tr+0.005
 sp+0.005
 mo+0.005
uddle+0.005
 puck+0.005
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token lazy
Token ID16931
Feature activation-2.18e-02

Pos logit contributions
 generation+0.000
pole+0.000
cker+0.000
 God+0.000
ie+0.000
Neg logit contributions
 tr-0.000
uddle-0.000
aj-0.000
 sneakers-0.000
 puck-0.000
Token cat
Token ID3797
Feature activation+4.32e-03

Pos logit contributions
 tr+0.029
uddle+0.027
 sp+0.027
aj+0.027
 mo+0.026
Neg logit contributions
 generation-0.041
ie-0.040
pole-0.039
cker-0.038
 God-0.038
Token named
Token ID3706
Feature activation-2.51e-02

Pos logit contributions
ility+0.007
 generation+0.007
immer+0.007
 Him+0.007
hip+0.007
Neg logit contributions
 tr-0.008
 sp-0.007
 mo-0.007
 sk-0.007
 rac-0.007
Token Wh
Token ID854
Feature activation+4.54e-03

Pos logit contributions
 tr+0.039
uddle+0.039
 mo+0.038
 sp+0.037
 puck+0.036
Neg logit contributions
 generation-0.044
ility-0.043
hip-0.043
pole-0.041
 Him-0.041
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.005
eday+0.005
ility+0.004
 Him+0.004
pole+0.004
Neg logit contributions
 tr-0.010
uddle-0.010
omo-0.010
aj-0.010
 rac-0.009
Tokeners
Token ID364
Feature activation+1.64e-02

Pos logit contributions
eday+0.255
 generation+0.252
 Him+0.247
ility+0.244
immer+0.244
Neg logit contributions
 tr-0.241
 rac-0.229
 sp-0.213
 puck-0.212
 sk-0.211
Token.
Token ID13
Feature activation-6.35e-03

Pos logit contributions
ility+0.035
 generation+0.035
immer+0.034
 Him+0.034
pole+0.033
Neg logit contributions
 tr-0.022
 sp-0.020
 rac-0.018
 mo-0.018
 sk-0.018
Token Wh
Token ID854
Feature activation+2.82e-03

Pos logit contributions
 tr+0.010
 sp+0.009
uddle+0.009
 mo+0.009
 puck+0.008
Neg logit contributions
 generation-0.012
ility-0.011
hip-0.011
pole-0.011
 Him-0.011
Tokenisk
Token ID1984
Feature activation+1.51e-01

Pos logit contributions
 generation+0.003
ility+0.003
 Him+0.003
pole+0.003
 God+0.003
Neg logit contributions
uddle-0.007
 tr-0.006
omo-0.006
 rac-0.006
aj-0.006
Tokeners
Token ID364
Feature activation+1.48e-02

Pos logit contributions
 generation+0.254
 Him+0.249
ility+0.246
eday+0.244
immer+0.239
Neg logit contributions
 tr-0.247
 rac-0.225
 sp-0.222
 puck-0.216
 sk-0.210
Token him
Token ID683
Feature activation-8.26e-03

Pos logit contributions
 tr+0.042
 sp+0.040
uddle+0.038
 mo+0.037
 puck+0.037
Neg logit contributions
 generation-0.041
ility-0.041
pole-0.040
hip-0.039
cker-0.038
Token outside
Token ID2354
Feature activation-7.13e-03

Pos logit contributions
 tr+0.013
 sp+0.013
uddle+0.011
 rac+0.011
 puck+0.011
Neg logit contributions
 generation-0.016
ility-0.015
hip-0.015
pole-0.015
cker-0.015
Token.
Token ID13
Feature activation-2.01e-03

Pos logit contributions
 tr+0.012
 sp+0.012
 mo+0.011
uddle+0.011
 puck+0.011
Neg logit contributions
 generation-0.012
ility-0.012
pole-0.011
hip-0.011
immer-0.011
Token But
Token ID887
Feature activation-8.88e-03

Pos logit contributions
 mo+0.003
 tr+0.003
 sp+0.003
uddle+0.003
iden+0.003
Neg logit contributions
pole-0.003
hip-0.003
 generation-0.003
ow-0.003
eday-0.003
Token Wh
Token ID854
Feature activation+6.23e-03

Pos logit contributions
 tr+0.013
 sp+0.012
 puck+0.011
 rac+0.011
 mo+0.011
Neg logit contributions
 generation-0.018
ility-0.018
hip-0.017
pole-0.017
 Him-0.016
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.007
 Him+0.006
ility+0.006
 God+0.006
pole+0.006
Neg logit contributions
uddle-0.015
 tr-0.014
omo-0.014
 rac-0.014
 sp-0.014
Tokeners
Token ID364
Feature activation+1.91e-02

Pos logit contributions
 generation+0.278
 Him+0.266
immer+0.252
eday+0.251
ility+0.248
Neg logit contributions
 tr-0.241
 sp-0.208
 rac-0.201
 puck-0.197
 sk-0.191
Token just
Token ID655
Feature activation-6.25e-02

Pos logit contributions
 generation+0.038
ility+0.037
hip+0.036
pole+0.036
 Him+0.035
Neg logit contributions
 tr-0.028
 sp-0.027
uddle-0.025
 mo-0.025
 puck-0.024
Token wanted
Token ID2227
Feature activation+1.02e-02

Pos logit contributions
 tr+0.099
 sp+0.094
uddle+0.092
 mo+0.091
 puck+0.088
Neg logit contributions
 generation-0.112
ility-0.110
pole-0.107
hip-0.107
cker-0.104
Token to
Token ID284
Feature activation+3.88e-03

Pos logit contributions
 generation+0.018
ility+0.017
hip+0.017
pole+0.017
 Him+0.016
Neg logit contributions
 tr-0.016
uddle-0.016
 sp-0.015
 mo-0.015
 puck-0.014
Token sleep
Token ID3993
Feature activation-1.73e-02

Pos logit contributions
 generation+0.006
ility+0.006
hip+0.006
pole+0.006
cker+0.005
Neg logit contributions
 tr-0.007
 sp-0.007
uddle-0.007
 mo-0.007
 puck-0.006
Token a
Token ID257
Feature activation-2.90e-03

Pos logit contributions
 generation+0.008
ility+0.008
pole+0.008
hip+0.008
cker+0.007
Neg logit contributions
 tr-0.008
 sp-0.008
uddle-0.008
 mo-0.008
 puck-0.008
Token naughty
Token ID45128
Feature activation-3.19e-02

Pos logit contributions
aj+0.004
 tr+0.004
uddle+0.004
 Oscar+0.004
 sp+0.004
Neg logit contributions
 generation-0.005
pole-0.005
fly-0.005
cker-0.005
ility-0.005
Token cat
Token ID3797
Feature activation+6.24e-03

Pos logit contributions
 tr+0.036
 sp+0.033
 mo+0.032
uddle+0.032
aj+0.031
Neg logit contributions
 generation-0.070
ility-0.068
hip-0.068
pole-0.067
ie-0.066
Token named
Token ID3706
Feature activation-1.73e-02

Pos logit contributions
 generation+0.012
ility+0.011
hip+0.011
pole+0.011
 Him+0.011
Neg logit contributions
 tr-0.010
 sp-0.009
 mo-0.009
uddle-0.009
 puck-0.008
Token Wh
Token ID854
Feature activation+7.99e-03

Pos logit contributions
 tr+0.025
 sp+0.024
uddle+0.023
 mo+0.022
 puck+0.022
Neg logit contributions
 generation-0.034
ility-0.033
pole-0.032
hip-0.032
cker-0.031
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.009
 Him+0.008
ility+0.007
pole+0.007
 God+0.007
Neg logit contributions
 tr-0.019
uddle-0.018
 sp-0.017
 rac-0.017
 sk-0.017
Tokeners
Token ID364
Feature activation+1.69e-02

Pos logit contributions
 generation+0.259
 Him+0.244
immer+0.243
eday+0.242
ility+0.241
Neg logit contributions
 tr-0.244
 rac-0.221
 sk-0.213
 sp-0.211
 cones-0.208
Token jumped
Token ID11687
Feature activation-8.80e-04

Pos logit contributions
 generation+0.031
ility+0.030
hip+0.030
pole+0.029
 Him+0.029
Neg logit contributions
 tr-0.027
 sp-0.025
 mo-0.024
uddle-0.024
 puck-0.023
Token in
Token ID287
Feature activation+1.24e-02

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
uddle+0.002
 puck+0.002
Neg logit contributions
 generation-0.001
ility-0.001
pole-0.001
hip-0.001
immer-0.001
Token front
Token ID2166
Feature activation-5.76e-02

Pos logit contributions
 generation+0.019
ility+0.018
hip+0.018
pole+0.018
eday+0.017
Neg logit contributions
 tr-0.022
uddle-0.022
 sp-0.021
 mo-0.021
 puck-0.020
Token of
Token ID286
Feature activation-7.73e-03

Pos logit contributions
 tr+0.076
 sp+0.071
uddle+0.067
 mo+0.067
 puck+0.065
Neg logit contributions
 generation-0.120
ility-0.117
pole-0.115
hip-0.115
cker-0.111
Token,
Token ID11
Feature activation+7.10e-03

Pos logit contributions
 generation+0.005
ility+0.005
pole+0.005
hip+0.005
cker+0.005
Neg logit contributions
 tr-0.003
 sp-0.003
uddle-0.003
 mo-0.003
 puck-0.003
Token fat
Token ID3735
Feature activation+3.95e-03

Pos logit contributions
pole+0.011
 generation+0.011
cker+0.011
 God+0.011
et+0.010
Neg logit contributions
uddle-0.011
 tr-0.010
aj-0.010
 puck-0.010
 sneakers-0.009
Token cat
Token ID3797
Feature activation+6.03e-03

Pos logit contributions
 generation+0.007
pole+0.007
ie+0.007
hip+0.007
cker+0.007
Neg logit contributions
 tr-0.005
 sp-0.005
uddle-0.005
aj-0.005
 puck-0.005
Token named
Token ID3706
Feature activation-2.32e-02

Pos logit contributions
ility+0.010
 generation+0.010
immer+0.010
 Him+0.010
hip+0.009
Neg logit contributions
 tr-0.011
 sp-0.010
 mo-0.010
 sk-0.010
 rac-0.010
Token Wh
Token ID854
Feature activation+8.86e-03

Pos logit contributions
 tr+0.033
 sp+0.031
uddle+0.031
 mo+0.031
 puck+0.030
Neg logit contributions
 generation-0.045
ility-0.044
hip-0.043
pole-0.043
 Him-0.042
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.010
eday+0.009
ility+0.008
 Him+0.008
pole+0.008
Neg logit contributions
 tr-0.021
uddle-0.020
aj-0.019
omo-0.019
 rac-0.019
Tokeners
Token ID364
Feature activation+1.88e-02

Pos logit contributions
 generation+0.252
eday+0.251
 Him+0.243
immer+0.242
ility+0.239
Neg logit contributions
 tr-0.249
 rac-0.236
 sp-0.225
 mo-0.220
 sk-0.218
Token.
Token ID13
Feature activation-4.58e-03

Pos logit contributions
ility+0.042
 generation+0.041
immer+0.040
 Him+0.040
pole+0.039
Neg logit contributions
 tr-0.024
 sp-0.022
 rac-0.020
 mo-0.020
 sk-0.020
Token Wh
Token ID854
Feature activation+7.28e-03

Pos logit contributions
 tr+0.007
 sp+0.007
uddle+0.006
 mo+0.006
 puck+0.006
Neg logit contributions
 generation-0.009
ility-0.008
hip-0.008
pole-0.008
 Him-0.008
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.008
ility+0.008
 Him+0.007
pole+0.007
 God+0.007
Neg logit contributions
uddle-0.017
 tr-0.017
 rac-0.016
omo-0.016
 sp-0.016
Tokeners
Token ID364
Feature activation+1.74e-02

Pos logit contributions
 generation+0.254
 Him+0.246
ility+0.243
eday+0.241
immer+0.236
Neg logit contributions
 tr-0.253
 rac-0.233
 sp-0.232
 puck-0.221
 mo-0.218
Token that
Token ID326
Feature activation+1.82e-02

Pos logit contributions
 tr+0.008
 sp+0.007
 mo+0.007
uddle+0.007
 Oscar+0.007
Neg logit contributions
 generation-0.007
ility-0.007
hip-0.006
pole-0.006
immer-0.006
Token day
Token ID1110
Feature activation-8.79e-04

Pos logit contributions
ility+0.032
 generation+0.032
immer+0.031
hip+0.030
cker+0.030
Neg logit contributions
 tr-0.031
 sp-0.029
 rac-0.028
 sk-0.027
 mo-0.027
Token on
Token ID319
Feature activation+3.22e-02

Pos logit contributions
 tr+0.001
 sp+0.001
 rac+0.001
 mo+0.001
uddle+0.001
Neg logit contributions
 generation-0.002
ility-0.002
cker-0.002
pole-0.002
hip-0.002
Token,
Token ID11
Feature activation+7.59e-03

Pos logit contributions
 generation+0.044
ility+0.043
hip+0.042
pole+0.042
ie+0.041
Neg logit contributions
 tr-0.063
 sp-0.061
uddle-0.059
 mo-0.059
 puck-0.058
Token Wh
Token ID854
Feature activation+5.95e-03

Pos logit contributions
 generation+0.013
ility+0.013
hip+0.013
pole+0.012
 Him+0.012
Neg logit contributions
 tr-0.012
 sp-0.012
uddle-0.012
 mo-0.011
 puck-0.011
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.005
ility+0.005
pole+0.005
eday+0.004
 Him+0.004
Neg logit contributions
 tr-0.014
 rac-0.014
uddle-0.014
aj-0.014
omo-0.014
Tokeners
Token ID364
Feature activation+1.95e-02

Pos logit contributions
 generation+0.234
ility+0.217
 Him+0.216
immer+0.214
cker+0.210
Neg logit contributions
 tr-0.286
 rac-0.269
 sp-0.256
 sk-0.249
 mo-0.248
Token and
Token ID290
Feature activation+6.40e-03

Pos logit contributions
 generation+0.032
ility+0.031
immer+0.029
hip+0.029
 Him+0.029
Neg logit contributions
 tr-0.036
 sp-0.034
 mo-0.033
 rac-0.032
uddle-0.031
Token Ch
Token ID609
Feature activation-1.93e-03

Pos logit contributions
 generation+0.012
ility+0.011
hip+0.011
ie+0.011
pole+0.011
Neg logit contributions
uddle-0.010
 tr-0.009
 sp-0.009
 mo-0.009
 puck-0.008
Tokenir
Token ID343
Feature activation+6.65e-02

Pos logit contributions
 tr+0.004
uddle+0.004
 sp+0.004
 rac+0.004
 mo+0.004
Neg logit contributions
 generation-0.002
pole-0.002
ility-0.002
hip-0.002
 Him-0.002
Tokenpy
Token ID9078
Feature activation+4.80e-02

Pos logit contributions
 generation+0.044
ility+0.036
hip+0.032
eday+0.031
 Him+0.031
Neg logit contributions
 tr-0.185
 sp-0.179
 rac-0.175
 mo-0.173
uddle-0.172
Token Wh
Token ID854
Feature activation+6.83e-03

Pos logit contributions
 tr+0.038
 sp+0.037
 mo+0.036
uddle+0.036
 puck+0.035
Neg logit contributions
 generation-0.043
ility-0.041
hip-0.041
pole-0.041
 Him-0.039
Tokenisk
Token ID1984
Feature activation+1.54e-01

Pos logit contributions
 generation+0.008
eday+0.007
ility+0.007
 Him+0.006
pole+0.006
Neg logit contributions
 tr-0.016
uddle-0.015
omo-0.015
aj-0.015
 rac-0.015
Tokeners
Token ID364
Feature activation+1.52e-02

Pos logit contributions
eday+0.254
 generation+0.254
ility+0.247
 Him+0.247
immer+0.244
Neg logit contributions
 tr-0.245
 rac-0.229
 sp-0.214
 sk-0.214
 puck-0.212
Token.
Token ID13
Feature activation-7.17e-03

Pos logit contributions
ility+0.033
 generation+0.032
immer+0.032
 Him+0.031
pole+0.031
Neg logit contributions
 tr-0.020
 sp-0.018
 rac-0.017
 mo-0.017
 sk-0.017
Token Wh
Token ID854
Feature activation+5.14e-03

Pos logit contributions
 tr+0.011
 sp+0.010
uddle+0.010
 mo+0.010
 puck+0.009
Neg logit contributions
 generation-0.013
ility-0.013
hip-0.013
pole-0.013
 Him-0.012
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.006
ility+0.005
 Him+0.005
 God+0.005
pole+0.005
Neg logit contributions
uddle-0.012
 tr-0.012
 rac-0.011
omo-0.011
aj-0.011
Tokeners
Token ID364
Feature activation+1.40e-02

Pos logit contributions
 generation+0.255
 Him+0.250
ility+0.246
eday+0.242
immer+0.239
Neg logit contributions
 tr-0.252
 rac-0.228
 sp-0.225
 puck-0.219
 sk-0.215
Token had
Token ID550
Feature activation-1.01e-02

Pos logit contributions
ility+0.030
 generation+0.029
 Him+0.028
immer+0.028
pole+0.027
Neg logit contributions
 tr-0.021
 sp-0.020
 mo-0.017
 sk-0.017
 rac-0.016
Token a
Token ID257
Feature activation-1.45e-03

Pos logit contributions
uddle+0.020
 tr+0.020
 sp+0.019
 mo+0.018
 puck+0.018
Neg logit contributions
 generation-0.013
pole-0.012
hip-0.012
ow-0.012
ility-0.011
Token fluffy
Token ID39145
Feature activation+1.11e-03

Pos logit contributions
 tr+0.002
 mo+0.002
uddle+0.002
 sp+0.002
 puck+0.002
Neg logit contributions
 generation-0.003
pole-0.003
cker-0.003
hip-0.003
fly-0.003
Token paw
Token ID41827
Feature activation-2.15e-02

Pos logit contributions
 generation+0.002
pole+0.002
hip+0.002
ie+0.002
cker+0.002
Neg logit contributions
 tr-0.002
 sp-0.002
 mo-0.002
uddle-0.002
 puck-0.002
Token him
Token ID683
Feature activation-8.84e-03

Pos logit contributions
ility+0.073
pole+0.071
immer+0.069
 generation+0.069
cker+0.067
Neg logit contributions
 tr-0.054
 sp-0.049
 rac-0.046
 Oscar-0.046
 M-0.045
Token.
Token ID13
Feature activation-6.86e-03

Pos logit contributions
 tr+0.013
 sp+0.012
uddle+0.011
 rac+0.011
 mo+0.011
Neg logit contributions
ility-0.017
 generation-0.017
pole-0.016
cker-0.016
 Him-0.016
Token Spot
Token ID15899
Feature activation+1.88e-02

Pos logit contributions
 mo+0.012
uddle+0.012
 tr+0.012
 sp+0.012
Danny+0.011
Neg logit contributions
ow-0.010
hip-0.010
 generation-0.010
pole-0.009
eday-0.009
Token urged
Token ID11643
Feature activation+2.35e-02

Pos logit contributions
 generation+0.035
pole+0.034
ility+0.034
hip+0.034
cker+0.033
Neg logit contributions
 tr-0.026
 mo-0.025
 sp-0.025
uddle-0.024
 puck-0.024
Token Wh
Token ID854
Feature activation+5.67e-03

Pos logit contributions
ility+0.056
 generation+0.056
pole+0.054
immer+0.053
cker+0.053
Neg logit contributions
 tr-0.027
 sp-0.024
 rac-0.023
 Oscar-0.022
 sk-0.022
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.006
ility+0.005
 Him+0.005
pole+0.005
eday+0.004
Neg logit contributions
 tr-0.014
uddle-0.014
 rac-0.013
omo-0.013
 sp-0.013
Tokeners
Token ID364
Feature activation+1.85e-02

Pos logit contributions
 generation+0.220
ility+0.207
eday+0.207
immer+0.206
cker+0.206
Neg logit contributions
 tr-0.287
 sp-0.264
 puck-0.258
 rac-0.258
 sk-0.246
Token to
Token ID284
Feature activation+2.88e-03

Pos logit contributions
 generation+0.039
ility+0.037
pole+0.036
cker+0.036
immer+0.036
Neg logit contributions
 tr-0.026
 sp-0.024
 rac-0.021
 mo-0.021
uddle-0.021
Token come
Token ID1282
Feature activation+1.29e-02

Pos logit contributions
 generation+0.004
ility+0.004
pole+0.004
hip+0.004
 Him+0.004
Neg logit contributions
 tr-0.005
 sp-0.005
 mo-0.005
 puck-0.005
uddle-0.005
Token play
Token ID711
Feature activation-1.38e-02

Pos logit contributions
ility+0.025
 generation+0.025
pole+0.024
hip+0.024
eday+0.023
Neg logit contributions
 tr-0.019
 sp-0.018
 rac-0.016
 mo-0.016
uddle-0.015
Token with
Token ID351
Feature activation-2.43e-02

Pos logit contributions
 tr+0.022
 sp+0.021
 rac+0.020
uddle+0.020
 puck+0.020
Neg logit contributions
 generation-0.025
ility-0.025
cker-0.024
pole-0.024
 Him-0.023
Token ball
Token ID2613
Feature activation+3.99e-02

Pos logit contributions
hip+0.010
 generation+0.010
pole+0.010
 belong+0.010
eday+0.009
Neg logit contributions
 mo-0.008
 tr-0.007
iden-0.007
 Liam-0.007
Danny-0.007
Token."
Token ID526
Feature activation+3.92e-03

Pos logit contributions
ility+0.076
 generation+0.076
 Him+0.074
hip+0.071
cker+0.070
Neg logit contributions
 tr-0.062
 sp-0.059
uddle-0.056
 mo-0.054
 puck-0.054
Token Buddy
Token ID36896
Feature activation+2.77e-02

Pos logit contributions
 generation+0.006
pole+0.006
ility+0.006
hip+0.006
 Him+0.006
Neg logit contributions
 tr-0.007
 sp-0.007
 mo-0.007
 puck-0.006
uddle-0.006
Token and
Token ID290
Feature activation+6.29e-03

Pos logit contributions
pole+0.040
ow+0.039
ie+0.038
hip+0.038
 generation+0.038
Neg logit contributions
 tr-0.047
 mo-0.046
uddle-0.045
 puck-0.044
 sp-0.044
Token Wh
Token ID854
Feature activation+9.01e-03

Pos logit contributions
 generation+0.013
ility+0.013
hip+0.013
pole+0.012
immer+0.012
Neg logit contributions
 tr-0.009
 sp-0.008
 mo-0.008
 rac-0.008
 puck-0.007
Tokenisk
Token ID1984
Feature activation+1.53e-01

Pos logit contributions
 generation+0.009
ility+0.007
 Him+0.007
eday+0.007
 God+0.006
Neg logit contributions
 tr-0.023
uddle-0.022
 sp-0.022
omo-0.022
 rac-0.021
Tokeners
Token ID364
Feature activation+1.97e-02

Pos logit contributions
 generation+0.270
 Him+0.256
immer+0.250
ility+0.247
eday+0.239
Neg logit contributions
 tr-0.246
 rac-0.225
 sp-0.218
 cones-0.206
 sk-0.206
Token stopped
Token ID5025
Feature activation+2.67e-02

Pos logit contributions
 generation+0.044
ility+0.042
 Him+0.042
hip+0.040
pole+0.040
Neg logit contributions
 tr-0.026
 sp-0.025
 mo-0.022
uddle-0.021
 puck-0.021
Token fighting
Token ID4330
Feature activation-2.87e-02

Pos logit contributions
 generation+0.050
ility+0.048
pole+0.047
hip+0.047
immer+0.046
Neg logit contributions
 tr-0.043
 sp-0.041
 mo-0.039
uddle-0.038
 puck-0.037
Token.
Token ID13
Feature activation+6.52e-03

Pos logit contributions
 tr+0.034
 sp+0.032
 puck+0.027
 mo+0.027
 rac+0.026
Neg logit contributions
 generation-0.068
ility-0.064
cker-0.064
 Him-0.064
eday-0.063
Token They
Token ID1119
Feature activation+1.59e-02

Pos logit contributions
 generation+0.010
pole+0.009
hip+0.009
 Him+0.009
ow+0.009
Neg logit contributions
 tr-0.012
 sp-0.012
 mo-0.011
 puck-0.011
uddle-0.011
Token mom
Token ID1995
Feature activation+3.35e-03

Pos logit contributions
 generation+0.034
pole+0.033
ie+0.032
cker+0.032
hip+0.032
Neg logit contributions
 tr-0.022
 sp-0.020
 puck-0.019
iden-0.019
 mo-0.019
Token laughed
Token ID13818
Feature activation-3.63e-02

Pos logit contributions
ility+0.006
 generation+0.006
pole+0.006
hip+0.006
ie+0.006
Neg logit contributions
 tr-0.005
 sp-0.005
uddle-0.005
 puck-0.005
 mo-0.005
Token and
Token ID290
Feature activation-1.15e-03

Pos logit contributions
 tr+0.053
 sp+0.050
uddle+0.044
 mo+0.044
 rac+0.042
Neg logit contributions
 generation-0.076
ility-0.072
cker-0.071
pole-0.069
hip-0.069
Token helped
Token ID4193
Feature activation-7.38e-03

Pos logit contributions
 tr+0.002
uddle+0.002
 sp+0.002
 puck+0.001
 mo+0.001
Neg logit contributions
 generation-0.002
ility-0.002
pole-0.002
hip-0.002
cker-0.002
Token Wh
Token ID854
Feature activation+8.47e-03

Pos logit contributions
 tr+0.012
 sp+0.011
 sk+0.011
 rac+0.011
 M+0.010
Neg logit contributions
ility-0.014
pole-0.014
 generation-0.013
immer-0.013
cker-0.013
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.011
 Him+0.009
ility+0.009
 God+0.009
eday+0.008
Neg logit contributions
uddle-0.019
omo-0.018
aj-0.018
 tr-0.018
oodle-0.017
Tokeners
Token ID364
Feature activation+1.68e-02

Pos logit contributions
 generation+0.237
ility+0.221
pole+0.217
immer+0.216
eday+0.216
Neg logit contributions
 tr-0.275
 sp-0.255
uddle-0.248
 mo-0.244
 rac-0.244
Token get
Token ID651
Feature activation+6.35e-03

Pos logit contributions
 generation+0.029
ility+0.027
pole+0.026
 Him+0.026
hip+0.026
Neg logit contributions
 tr-0.030
 sp-0.029
 mo-0.028
uddle-0.027
 puck-0.026
Token free
Token ID1479
Feature activation-5.61e-03

Pos logit contributions
 generation+0.009
ility+0.009
pole+0.009
hip+0.009
cker+0.008
Neg logit contributions
 tr-0.012
 sp-0.012
uddle-0.011
 mo-0.011
 puck-0.011
Token.
Token ID13
Feature activation-2.45e-03

Pos logit contributions
 tr+0.010
 sp+0.010
uddle+0.009
 mo+0.009
 puck+0.009
Neg logit contributions
 generation-0.009
ility-0.009
pole-0.008
hip-0.008
cker-0.008
Token They
Token ID1119
Feature activation+9.58e-03

Pos logit contributions
 tr+0.004
 sp+0.004
 mo+0.003
 puck+0.003
uddle+0.003
Neg logit contributions
 generation-0.005
ility-0.004
hip-0.004
pole-0.004
 Him-0.004
Token to
Token ID284
Feature activation+2.29e-03

Pos logit contributions
 generation+0.023
ility+0.023
hip+0.022
pole+0.022
 Him+0.022
Neg logit contributions
 tr-0.018
 sp-0.017
uddle-0.016
 mo-0.016
 puck-0.015
Token be
Token ID307
Feature activation-2.19e-02

Pos logit contributions
 generation+0.004
ility+0.004
pole+0.004
hip+0.004
 Him+0.004
Neg logit contributions
 tr-0.004
 sp-0.004
 mo-0.004
uddle-0.004
 puck-0.004
Token friends
Token ID2460
Feature activation-1.07e-02

Pos logit contributions
 tr+0.030
 sp+0.029
 mo+0.027
 rac+0.026
uddle+0.026
Neg logit contributions
 generation-0.045
ility-0.044
hip-0.043
pole-0.043
 Him-0.042
Token with
Token ID351
Feature activation-2.22e-02

Pos logit contributions
 tr+0.017
 sp+0.016
uddle+0.015
 puck+0.015
 mo+0.014
Neg logit contributions
 generation-0.020
ility-0.020
cker-0.020
 Him-0.019
pole-0.019
Token Wh
Token ID854
Feature activation+3.83e-03

Pos logit contributions
 tr+0.037
 sp+0.035
 mo+0.033
 rac+0.033
uddle+0.032
Neg logit contributions
 generation-0.039
ility-0.039
pole-0.038
hip-0.037
cker-0.037
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.004
eday+0.003
pole+0.003
ility+0.003
 God+0.003
Neg logit contributions
uddle-0.009
 tr-0.009
aj-0.009
omo-0.008
 rac-0.008
Tokeners
Token ID364
Feature activation+1.30e-02

Pos logit contributions
 generation+0.253
eday+0.249
immer+0.243
ility+0.238
cker+0.231
Neg logit contributions
 tr-0.247
 rac-0.227
 sp-0.219
 sk-0.214
 cones-0.211
Token.
Token ID13
Feature activation-1.23e-03

Pos logit contributions
 generation+0.023
ility+0.022
cker+0.021
pole+0.021
immer+0.021
Neg logit contributions
 tr-0.022
 sp-0.021
uddle-0.019
 rac-0.019
 puck-0.019
Token
Token ID198
Feature activation-3.85e-03

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
uddle+0.002
 puck+0.002
Neg logit contributions
 generation-0.002
pole-0.002
hip-0.002
ility-0.002
 Him-0.002
Token
Token ID198
Feature activation-4.21e-03

Pos logit contributions
 tr+0.003
 sp+0.003
uddle+0.003
 mo+0.003
 rac+0.003
Neg logit contributions
ility-0.010
 generation-0.010
pole-0.010
hip-0.009
eday-0.009
TokenB
Token ID33
Feature activation-2.46e-02

Pos logit contributions
 tr+0.007
 sp+0.006
uddle+0.006
 mo+0.006
 rac+0.006
Neg logit contributions
 generation-0.008
ility-0.008
pole-0.007
 Him-0.007
hip-0.007
Token that
Token ID326
Feature activation+1.93e-02

Pos logit contributions
 tr+0.003
 sp+0.003
uddle+0.003
 mo+0.003
 Oscar+0.003
Neg logit contributions
 generation-0.003
ility-0.003
hip-0.003
pole-0.003
immer-0.003
Token day
Token ID1110
Feature activation-1.42e-03

Pos logit contributions
ility+0.035
immer+0.034
 generation+0.033
pole+0.033
cks+0.032
Neg logit contributions
 tr-0.032
 sp-0.030
 rac-0.029
 sk-0.029
 mo-0.028
Token on
Token ID319
Feature activation+3.19e-02

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
 rac+0.002
uddle+0.002
Neg logit contributions
 generation-0.003
ility-0.003
cker-0.003
pole-0.003
hip-0.003
Token,
Token ID11
Feature activation+8.98e-03

Pos logit contributions
 generation+0.045
ility+0.043
pole+0.042
hip+0.042
ie+0.041
Neg logit contributions
 tr-0.062
 sp-0.060
uddle-0.058
 mo-0.057
 puck-0.057
Token Wh
Token ID854
Feature activation+7.19e-03

Pos logit contributions
 generation+0.019
ility+0.018
hip+0.018
immer+0.018
pole+0.017
Neg logit contributions
 tr-0.012
 sp-0.012
 mo-0.011
 rac-0.011
 puck-0.011
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.007
ility+0.007
pole+0.006
eday+0.006
 Him+0.006
Neg logit contributions
uddle-0.017
 rac-0.017
 tr-0.016
omo-0.016
aj-0.016
Tokeners
Token ID364
Feature activation+2.23e-02

Pos logit contributions
 generation+0.241
 Him+0.224
ility+0.222
immer+0.220
eday+0.214
Neg logit contributions
 tr-0.274
 rac-0.257
 sp-0.249
 mo-0.237
 sk-0.237
Token was
Token ID373
Feature activation+2.17e-03

Pos logit contributions
 generation+0.038
ility+0.037
pole+0.035
immer+0.035
hip+0.035
Neg logit contributions
 tr-0.039
 sp-0.037
 mo-0.036
 rac-0.034
uddle-0.034
Token no
Token ID645
Feature activation-3.71e-04

Pos logit contributions
 generation+0.003
ility+0.003
hip+0.003
immer+0.003
 Him+0.003
Neg logit contributions
 mo-0.004
 tr-0.004
 sp-0.004
 rac-0.004
 sk-0.004
Token longer
Token ID2392
Feature activation+5.46e-03

Pos logit contributions
 tr+0.001
 sp+0.001
uddle+0.001
 mo+0.001
 puck+0.001
Neg logit contributions
 generation-0.000
ility-0.000
hip-0.000
pole-0.000
 Him-0.000
Token lazy
Token ID16931
Feature activation-1.69e-02

Pos logit contributions
 generation+0.010
ility+0.010
pole+0.009
hip+0.009
cker+0.009
Neg logit contributions
 tr-0.009
 sp-0.008
uddle-0.008
 mo-0.008
 puck-0.008
Token a
Token ID257
Feature activation-2.37e-03

Pos logit contributions
 tr+0.008
 sp+0.007
 mo+0.007
 puck+0.007
uddle+0.007
Neg logit contributions
ility-0.008
 generation-0.008
hip-0.008
pole-0.008
cker-0.008
Token black
Token ID2042
Feature activation+1.44e-02

Pos logit contributions
 tr+0.003
uddle+0.002
aj+0.002
 puck+0.002
 sp+0.002
Neg logit contributions
 generation-0.005
cker-0.005
pole-0.005
ie-0.005
 God-0.005
Token cat
Token ID3797
Feature activation+5.22e-03

Pos logit contributions
 generation+0.029
pole+0.028
hip+0.028
ility+0.028
cker+0.027
Neg logit contributions
 tr-0.019
 sp-0.018
uddle-0.017
 mo-0.017
 puck-0.016
Token named
Token ID3706
Feature activation-2.29e-02

Pos logit contributions
ility+0.009
 generation+0.009
immer+0.009
 Him+0.009
eday+0.009
Neg logit contributions
 tr-0.009
 sp-0.008
 mo-0.008
 sk-0.008
 rac-0.008
Token Wh
Token ID854
Feature activation+4.14e-03

Pos logit contributions
 tr+0.035
 sp+0.033
uddle+0.033
 mo+0.032
 puck+0.031
Neg logit contributions
 generation-0.042
ility-0.041
hip-0.040
pole-0.040
 Him-0.039
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.004
eday+0.004
ility+0.004
 Him+0.004
pole+0.004
Neg logit contributions
 tr-0.010
uddle-0.009
omo-0.009
 sp-0.009
 rac-0.009
Tokeners
Token ID364
Feature activation+1.37e-02

Pos logit contributions
 generation+0.254
eday+0.249
ility+0.246
immer+0.245
 Him+0.244
Neg logit contributions
 tr-0.244
 rac-0.229
 sp-0.221
 sk-0.216
 mo-0.216
Token.
Token ID13
Feature activation-7.76e-03

Pos logit contributions
ility+0.031
 generation+0.031
immer+0.029
 Him+0.029
pole+0.029
Neg logit contributions
 tr-0.017
 sp-0.016
 mo-0.014
 sk-0.014
 rac-0.013
Token Wh
Token ID854
Feature activation+2.11e-03

Pos logit contributions
 tr+0.012
 sp+0.012
uddle+0.011
 mo+0.011
 puck+0.011
Neg logit contributions
 generation-0.014
hip-0.013
ility-0.013
pole-0.013
 Him-0.013
Tokenisk
Token ID1984
Feature activation+1.50e-01

Pos logit contributions
 generation+0.002
ility+0.002
 Him+0.002
pole+0.002
eday+0.002
Neg logit contributions
uddle-0.005
 tr-0.005
 sp-0.005
 rac-0.005
omo-0.005
Tokeners
Token ID364
Feature activation+1.20e-02

Pos logit contributions
 generation+0.251
ility+0.244
 Him+0.242
eday+0.237
immer+0.236
Neg logit contributions
 tr-0.251
 sp-0.230
 rac-0.228
 puck-0.220
 sk-0.217
Token.
Token ID13
Feature activation-1.87e-03

Pos logit contributions
 tr+0.021
 sp+0.020
 rac+0.018
 puck+0.018
 mo+0.018
Neg logit contributions
 generation-0.027
ility-0.027
 Him-0.027
cker-0.026
pole-0.025
Token "
Token ID366
Feature activation-2.89e-03

Pos logit contributions
 mo+0.003
uddle+0.003
 sp+0.003
 tr+0.003
Danny+0.003
Neg logit contributions
pole-0.003
hip-0.002
ow-0.002
 generation-0.002
eday-0.002
TokenHey
Token ID10814
Feature activation+4.85e-02

Pos logit contributions
 tr+0.004
 sp+0.004
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token,
Token ID11
Feature activation+9.10e-03

Pos logit contributions
 generation+0.087
ility+0.085
pole+0.083
hip+0.083
cker+0.080
Neg logit contributions
 tr-0.078
 sp-0.074
uddle-0.071
 mo-0.070
 puck-0.069
Token Wh
Token ID854
Feature activation+4.09e-03

Pos logit contributions
pole+0.015
ility+0.015
hip+0.014
 generation+0.014
ow+0.014
Neg logit contributions
 tr-0.015
 sp-0.014
uddle-0.014
 mo-0.014
aj-0.014
Tokenisk
Token ID1984
Feature activation+1.52e-01

Pos logit contributions
 generation+0.004
ility+0.003
 Him+0.003
pole+0.003
hip+0.003
Neg logit contributions
 tr-0.011
uddle-0.010
 sp-0.010
 rac-0.010
omo-0.010
Tokeners
Token ID364
Feature activation+1.97e-02

Pos logit contributions
 generation+0.224
ility+0.203
immer+0.200
 Him+0.199
eday+0.195
Neg logit contributions
 tr-0.303
 sp-0.280
 rac-0.280
 mo-0.272
 sk-0.266
Token!"
Token ID2474
Feature activation-5.64e-03

Pos logit contributions
 generation+0.041
ility+0.040
 Him+0.038
hip+0.038
immer+0.037
Neg logit contributions
 tr-0.029
 sp-0.027
uddle-0.025
 rac-0.025
 mo-0.025
Token Spot
Token ID15899
Feature activation+1.56e-02

Pos logit contributions
uddle+0.009
 tr+0.009
 mo+0.008
 sp+0.008
aj+0.008
Neg logit contributions
ow-0.009
pole-0.009
 generation-0.009
cker-0.009
eday-0.008
Token called
Token ID1444
Feature activation+1.19e-02

Pos logit contributions
pole+0.030
 generation+0.029
ow+0.029
cker+0.029
ie+0.029
Neg logit contributions
 tr-0.021
uddle-0.019
 sp-0.019
 mo-0.019
 puck-0.018
Token out
Token ID503
Feature activation+5.13e-02

Pos logit contributions
 generation+0.020
ility+0.019
pole+0.018
hip+0.018
cker+0.018
Neg logit contributions
 tr-0.022
 sp-0.021
 rac-0.019
 mo-0.019
 puck-0.019
Token on
Token ID319
Feature activation+3.11e-02

Pos logit contributions
 generation+0.031
ility+0.030
pole+0.029
hip+0.029
 Him+0.028
Neg logit contributions
 tr-0.041
 sp-0.039
uddle-0.038
 mo-0.038
 puck-0.037
Token dry
Token ID5894
Feature activation-1.50e-02

Pos logit contributions
 generation+0.042
hip+0.037
pole+0.037
ility+0.037
 Him+0.036
Neg logit contributions
uddle-0.060
 tr-0.059
 sp-0.057
 puck-0.056
 mo-0.056
Token and
Token ID290
Feature activation-4.71e-04

Pos logit contributions
 tr+0.019
 sp+0.018
uddle+0.018
 mo+0.017
 puck+0.017
Neg logit contributions
 generation-0.030
ility-0.029
ie-0.029
hip-0.029
 Him-0.028
Token cozy
Token ID37438
Feature activation-4.35e-02

Pos logit contributions
uddle+0.001
 tr+0.001
iden+0.001
omo+0.001
 rac+0.001
Neg logit contributions
 generation-0.001
cker-0.001
ie-0.001
eday-0.001
ility-0.001
Token p
Token ID279
Feature activation-6.46e-02

Pos logit contributions
 tr+0.065
 puck+0.063
iden+0.063
 sp+0.062
 mo+0.060
Neg logit contributions
ie-0.077
 generation-0.076
ility-0.072
 Him-0.070
eday-0.070
Tokenaj
Token ID1228
Feature activation-1.36e-01

Pos logit contributions
 tr+0.070
 sp+0.063
 puck+0.063
 rac+0.060
 mo+0.057
Neg logit contributions
ie-0.145
immer-0.143
ility-0.142
ow-0.142
ist-0.139
Tokenamas
Token ID17485
Feature activation-4.64e-03

Pos logit contributions
 tr+0.171
 sp+0.169
 mo+0.164
 puck+0.159
 rac+0.153
Neg logit contributions
hip-0.278
pole-0.273
 generation-0.270
ility-0.267
 Him-0.261
Token and
Token ID290
Feature activation-7.12e-03

Pos logit contributions
 tr+0.008
 sp+0.007
 mo+0.007
uddle+0.007
 rac+0.006
Neg logit contributions
 generation-0.008
ility-0.008
pole-0.008
cker-0.008
hip-0.008
Token sn
Token ID3013
Feature activation-1.41e-02

Pos logit contributions
uddle+0.012
 tr+0.011
 puck+0.011
 sp+0.010
 mo+0.010
Neg logit contributions
 generation-0.012
 Him-0.011
ie-0.011
ility-0.011
eday-0.011
Tokenugg
Token ID6837
Feature activation-9.91e-03

Pos logit contributions
 tr+0.019
 sp+0.018
uddle+0.018
 mo+0.017
 puck+0.016
Neg logit contributions
 generation-0.029
ility-0.028
pole-0.028
hip-0.028
cker-0.027
Tokenled
Token ID992
Feature activation+6.48e-02

Pos logit contributions
 tr+0.020
 sp+0.019
uddle+0.018
 mo+0.018
 puck+0.018
Neg logit contributions
 generation-0.014
ility-0.014
pole-0.013
hip-0.013
cker-0.012
Token and
Token ID290
Feature activation-5.92e-03

Pos logit contributions
 tr+0.049
 sp+0.045
uddle+0.040
 mo+0.039
 puck+0.039
Neg logit contributions
 generation-0.064
ility-0.062
cker-0.059
 Him-0.059
eday-0.058
Token put
Token ID1234
Feature activation-9.50e-03

Pos logit contributions
 tr+0.010
 sp+0.009
uddle+0.009
 mo+0.009
 puck+0.009
Neg logit contributions
 generation-0.010
ility-0.010
hip-0.010
pole-0.010
cker-0.009
Token on
Token ID319
Feature activation+2.49e-02

Pos logit contributions
 tr+0.019
 sp+0.019
 mo+0.018
 puck+0.017
uddle+0.017
Neg logit contributions
 generation-0.013
ility-0.013
hip-0.013
pole-0.013
cker-0.012
Token their
Token ID511
Feature activation-2.63e-02

Pos logit contributions
 generation+0.028
ility+0.024
 Him+0.024
hip+0.024
pole+0.023
Neg logit contributions
 tr-0.055
uddle-0.054
 sp-0.052
 puck-0.052
 rac-0.051
Token p
Token ID279
Feature activation-6.47e-02

Pos logit contributions
uddle+0.032
iden+0.031
 Oscar+0.031
 tr+0.030
 Liam+0.030
Neg logit contributions
 generation-0.051
cker-0.049
 Him-0.048
et-0.048
ow-0.046
Tokenaj
Token ID1228
Feature activation-1.31e-01

Pos logit contributions
 tr+0.045
 rac+0.041
 puck+0.039
 Oscar+0.038
 sp+0.038
Neg logit contributions
ow-0.164
ie-0.163
cker-0.162
et-0.161
immer-0.160
Tokenamas
Token ID17485
Feature activation-1.60e-03

Pos logit contributions
 tr+0.172
 sp+0.171
 mo+0.170
 rac+0.163
 puck+0.161
Neg logit contributions
hip-0.253
pole-0.248
 generation-0.237
 Him-0.237
immer-0.237
Token.
Token ID13
Feature activation-5.78e-03

Pos logit contributions
 tr+0.003
 sp+0.002
 mo+0.002
uddle+0.002
 rac+0.002
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
cker-0.003
 Him-0.003
Token They
Token ID1119
Feature activation-9.26e-03

Pos logit contributions
 tr+0.011
uddle+0.011
 sp+0.010
 mo+0.010
 puck+0.010
Neg logit contributions
 generation-0.009
 Him-0.008
hip-0.008
ility-0.008
pole-0.008
Token went
Token ID1816
Feature activation+2.78e-02

Pos logit contributions
 tr+0.015
uddle+0.014
 sp+0.014
 puck+0.014
 mo+0.013
Neg logit contributions
 generation-0.016
ility-0.015
pole-0.015
hip-0.015
cker-0.015
Token to
Token ID284
Feature activation-9.34e-03

Pos logit contributions
 generation+0.042
ility+0.040
 Him+0.039
pole+0.039
hip+0.039
Neg logit contributions
 tr-0.050
 sp-0.048
 puck-0.046
uddle-0.046
omo-0.045
Token She
Token ID1375
Feature activation+4.12e-03

Pos logit contributions
 tr+0.015
uddle+0.014
 sp+0.014
 puck+0.014
 mo+0.013
Neg logit contributions
 generation-0.012
 Him-0.011
hip-0.011
ility-0.011
pole-0.010
Token puts
Token ID7584
Feature activation-3.32e-02

Pos logit contributions
 generation+0.006
ws+0.006
et+0.006
 Him+0.005
cker+0.005
Neg logit contributions
uddle-0.007
 tr-0.007
 rac-0.007
iden-0.007
 puck-0.007
Token on
Token ID319
Feature activation+1.93e-02

Pos logit contributions
 tr+0.064
 sp+0.063
 mo+0.059
 puck+0.057
uddle+0.057
Neg logit contributions
 generation-0.050
ility-0.049
hip-0.048
pole-0.047
cker-0.046
Token her
Token ID607
Feature activation+1.16e-02

Pos logit contributions
 generation+0.019
ility+0.017
hip+0.016
pole+0.016
 Him+0.016
Neg logit contributions
 tr-0.046
uddle-0.045
 sp-0.044
 puck-0.044
 mo-0.043
Token p
Token ID279
Feature activation-6.83e-02

Pos logit contributions
 generation+0.024
 Him+0.023
pole+0.023
hip+0.022
cker+0.022
Neg logit contributions
 tr-0.013
uddle-0.013
 puck-0.012
 rac-0.012
 Oscar-0.012
Tokenaj
Token ID1228
Feature activation-1.28e-01

Pos logit contributions
 tr+0.040
 rac+0.032
 sp+0.032
 puck+0.032
 mo+0.030
Neg logit contributions
et-0.180
ow-0.179
cker-0.178
ie-0.177
immer-0.177
Tokenamas
Token ID17485
Feature activation-8.18e-04

Pos logit contributions
 tr+0.158
 sp+0.151
 mo+0.146
 puck+0.141
 rac+0.139
Neg logit contributions
hip-0.265
 generation-0.260
pole-0.260
ility-0.256
 Him-0.255
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
 tr+0.001
 sp+0.001
 mo+0.001
 rac+0.001
uddle+0.001
Neg logit contributions
 generation-0.002
ility-0.002
cker-0.002
eday-0.002
cks-0.002
Token She
Token ID1375
Feature activation+1.04e-03

Pos logit contributions
 tr+0.023
uddle+0.022
 sp+0.021
 puck+0.021
 mo+0.021
Neg logit contributions
 generation-0.015
 Him-0.015
hip-0.014
 God-0.014
ility-0.014
Token hugs
Token ID40657
Feature activation-7.14e-02

Pos logit contributions
 Him+0.001
ws+0.001
 generation+0.001
ow+0.001
eday+0.001
Neg logit contributions
 tr-0.002
uddle-0.002
 rac-0.002
 puck-0.002
iden-0.002
Token her
Token ID607
Feature activation+8.83e-03

Pos logit contributions
 tr+0.159
 sp+0.153
 mo+0.146
 rac+0.143
 puck+0.141
Neg logit contributions
 generation-0.087
ility-0.084
pole-0.082
cker-0.080
hip-0.079
Token put
Token ID1234
Feature activation-1.34e-02

Pos logit contributions
ie+0.010
 generation+0.010
pole+0.009
et+0.009
lyn+0.009
Neg logit contributions
 tr-0.012
uddle-0.012
 puck-0.012
 sp-0.012
 rac-0.011
Token on
Token ID319
Feature activation+2.47e-02

Pos logit contributions
 tr+0.024
 sp+0.024
 mo+0.022
uddle+0.022
 puck+0.022
Neg logit contributions
 generation-0.022
ility-0.022
hip-0.021
pole-0.021
cker-0.021
Token her
Token ID607
Feature activation+1.47e-02

Pos logit contributions
 generation+0.022
ility+0.019
hip+0.019
pole+0.018
 Him+0.018
Neg logit contributions
 tr-0.061
uddle-0.059
 sp-0.059
 mo-0.058
 puck-0.058
Token new
Token ID649
Feature activation+5.98e-03

Pos logit contributions
 generation+0.026
ie+0.025
et+0.025
pole+0.025
eday+0.025
Neg logit contributions
 mo-0.018
 rac-0.018
uddle-0.017
 Oscar-0.017
 puck-0.017
Token p
Token ID279
Feature activation-6.52e-02

Pos logit contributions
ie+0.012
 generation+0.011
et+0.011
pole+0.011
hip+0.011
Neg logit contributions
 mo-0.007
uddle-0.007
 puck-0.007
 Oscar-0.006
 rac-0.006
Tokenaj
Token ID1228
Feature activation-1.26e-01

Pos logit contributions
 tr+0.051
 rac+0.049
 puck+0.049
 sp+0.046
 mo+0.045
Neg logit contributions
ow-0.158
ie-0.155
et-0.154
ist-0.153
immer-0.149
Tokenamas
Token ID17485
Feature activation+3.22e-03

Pos logit contributions
 mo+0.173
 sp+0.173
 tr+0.172
 puck+0.163
 rac+0.159
Neg logit contributions
hip-0.245
pole-0.233
 generation-0.229
ility-0.224
immer-0.224
Token and
Token ID290
Feature activation-4.02e-03

Pos logit contributions
 generation+0.007
ility+0.006
cker+0.006
pole+0.006
 Him+0.006
Neg logit contributions
 tr-0.005
 sp-0.004
 mo-0.004
uddle-0.004
 rac-0.004
Token showed
Token ID3751
Feature activation+8.00e-03

Pos logit contributions
uddle+0.007
 tr+0.007
 rac+0.006
 puck+0.006
 sp+0.006
Neg logit contributions
 generation-0.006
ie-0.006
 Him-0.006
cker-0.006
eday-0.006
Token them
Token ID606
Feature activation-2.01e-02

Pos logit contributions
 generation+0.009
ility+0.009
pole+0.008
hip+0.008
 Him+0.008
Neg logit contributions
 tr-0.018
 sp-0.018
uddle-0.017
 mo-0.017
 puck-0.017
Token to
Token ID284
Feature activation-3.64e-03

Pos logit contributions
 tr+0.039
uddle+0.037
 sp+0.037
 mo+0.037
 puck+0.037
Neg logit contributions
 generation-0.027
ility-0.027
hip-0.026
pole-0.026
ie-0.025
Token She
Token ID1375
Feature activation+1.37e-02

Pos logit contributions
 tr+0.013
 sp+0.013
uddle+0.012
 mo+0.012
 puck+0.012
Neg logit contributions
 generation-0.014
ility-0.014
pole-0.013
hip-0.013
cker-0.013
Token put
Token ID1234
Feature activation-2.09e-02

Pos logit contributions
 generation+0.021
ie+0.020
 Him+0.020
ility+0.020
cker+0.020
Neg logit contributions
 tr-0.024
uddle-0.023
 sp-0.022
 puck-0.022
 mo-0.022
Token on
Token ID319
Feature activation+2.02e-02

Pos logit contributions
 sp+0.032
 tr+0.032
 mo+0.028
 sk+0.027
 rac+0.026
Neg logit contributions
 generation-0.042
ility-0.041
pole-0.040
hip-0.040
cker-0.040
Token her
Token ID607
Feature activation+1.26e-02

Pos logit contributions
 generation+0.020
ility+0.019
pole+0.018
hip+0.018
cker+0.017
Neg logit contributions
 tr-0.049
 sp-0.048
uddle-0.046
 mo-0.046
 puck-0.045
Token p
Token ID279
Feature activation-6.92e-02

Pos logit contributions
 generation+0.024
 Him+0.023
ow+0.023
ie+0.023
hip+0.022
Neg logit contributions
 tr-0.015
 mo-0.015
 rac-0.015
uddle-0.014
 puck-0.014
Tokenaj
Token ID1228
Feature activation-1.25e-01

Pos logit contributions
 tr+0.049
 puck+0.046
 Oscar+0.042
 rac+0.042
 sp+0.040
Neg logit contributions
ow-0.178
ie-0.173
cker-0.169
et-0.167
immer-0.166
Tokenamas
Token ID17485
Feature activation-4.87e-04

Pos logit contributions
 tr+0.155
 sp+0.150
 mo+0.148
 puck+0.144
 rac+0.137
Neg logit contributions
hip-0.255
pole-0.249
 generation-0.248
ility-0.244
 Him-0.239
Token and
Token ID290
Feature activation-3.01e-03

Pos logit contributions
 tr+0.000
 sp+0.000
 rac+0.000
 w+0.000
 mo+0.000
Neg logit contributions
cks-0.001
ility-0.001
 generation-0.001
cker-0.001
ief-0.001
Token went
Token ID1816
Feature activation+2.55e-02

Pos logit contributions
 tr+0.005
 sp+0.005
uddle+0.005
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
 Him-0.005
Token to
Token ID284
Feature activation-6.48e-03

Pos logit contributions
 generation+0.043
ility+0.042
hip+0.041
pole+0.040
 Him+0.040
Neg logit contributions
 tr-0.043
 sp-0.041
uddle-0.039
 mo-0.039
 puck-0.039
Token bed
Token ID3996
Feature activation-4.36e-02

Pos logit contributions
 tr+0.009
 sp+0.008
uddle+0.008
 mo+0.008
 puck+0.008
Neg logit contributions
 generation-0.013
ility-0.013
pole-0.013
hip-0.013
 Him-0.012
Token to
Token ID284
Feature activation-3.62e-03

Pos logit contributions
 tr+0.012
 sp+0.011
uddle+0.011
 mo+0.011
 puck+0.011
Neg logit contributions
 generation-0.013
ility-0.012
pole-0.012
hip-0.012
 Him-0.011
Token wear
Token ID5806
Feature activation+1.97e-02

Pos logit contributions
 tr+0.007
 sp+0.007
 mo+0.006
uddle+0.006
 puck+0.006
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token her
Token ID607
Feature activation+1.48e-02

Pos logit contributions
 generation+0.034
ility+0.033
pole+0.033
hip+0.032
cker+0.032
Neg logit contributions
 tr-0.034
 sp-0.032
 mo-0.030
uddle-0.030
 puck-0.029
Token new
Token ID649
Feature activation+6.14e-03

Pos logit contributions
 generation+0.031
ility+0.030
pole+0.030
hip+0.030
cker+0.029
Neg logit contributions
 tr-0.020
 sp-0.019
uddle-0.017
 mo-0.017
 puck-0.017
Token p
Token ID279
Feature activation-6.57e-02

Pos logit contributions
 generation+0.011
ility+0.011
hip+0.011
pole+0.011
cker+0.011
Neg logit contributions
 tr-0.010
 sp-0.009
uddle-0.008
 mo-0.008
 puck-0.008
Tokenaj
Token ID1228
Feature activation-1.24e-01

Pos logit contributions
 tr+0.053
 sp+0.048
 puck+0.045
 mo+0.045
 rac+0.042
Neg logit contributions
 generation-0.165
pole-0.162
hip-0.161
ility-0.160
ow-0.159
Tokenamas
Token ID17485
Feature activation+3.36e-03

Pos logit contributions
 tr+0.143
 sp+0.138
 mo+0.133
 puck+0.127
uddle+0.123
Neg logit contributions
 generation-0.269
hip-0.269
ility-0.265
pole-0.264
 Him-0.256
Token and
Token ID290
Feature activation-2.29e-03

Pos logit contributions
 generation+0.007
ility+0.007
cker+0.007
cks+0.007
 Him+0.007
Neg logit contributions
 tr-0.005
 sp-0.004
 mo-0.004
uddle-0.004
 rac-0.004
Token read
Token ID1100
Feature activation+2.87e-02

Pos logit contributions
uddle+0.005
 tr+0.005
 sp+0.004
 puck+0.004
 mo+0.004
Neg logit contributions
 generation-0.003
 Him-0.003
pole-0.003
hip-0.003
ility-0.003
Token the
Token ID262
Feature activation-8.29e-04

Pos logit contributions
ility+0.048
 generation+0.048
cker+0.047
pole+0.047
hip+0.046
Neg logit contributions
 tr-0.051
 sp-0.048
 mo-0.044
uddle-0.043
 puck-0.042
Token book
Token ID1492
Feature activation+2.20e-02

Pos logit contributions
 tr+0.001
 sp+0.001
uddle+0.001
 mo+0.001
 puck+0.001
Neg logit contributions
 generation-0.001
ility-0.001
pole-0.001
hip-0.001
 Him-0.001
Token He
Token ID679
Feature activation-1.04e-02

Pos logit contributions
 generation+0.003
hip+0.003
pole+0.003
eday+0.002
 God+0.002
Neg logit contributions
uddle-0.003
 mo-0.003
 sp-0.003
 tr-0.003
ve-0.003
Token put
Token ID1234
Feature activation-1.83e-02

Pos logit contributions
uddle+0.018
 tr+0.018
ula+0.017
 puck+0.017
 cones+0.017
Neg logit contributions
ie-0.015
 generation-0.015
cker-0.014
et-0.014
 someday-0.014
Token on
Token ID319
Feature activation+2.11e-02

Pos logit contributions
 tr+0.032
 sp+0.030
 mo+0.029
uddle+0.029
 puck+0.028
Neg logit contributions
 generation-0.031
ility-0.030
pole-0.030
hip-0.030
cker-0.029
Token his
Token ID465
Feature activation-1.34e-02

Pos logit contributions
 generation+0.023
et+0.022
ie+0.022
eday+0.020
 Him+0.019
Neg logit contributions
uddle-0.045
 rac-0.043
omo-0.042
 bisc-0.041
 tr-0.040
Token p
Token ID279
Feature activation-6.93e-02

Pos logit contributions
 Oscar+0.015
 rac+0.014
omo+0.014
uddle+0.014
 mo+0.014
Neg logit contributions
 generation-0.026
 Him-0.025
ie-0.025
 someday-0.024
immer-0.024
Tokenaj
Token ID1228
Feature activation-1.24e-01

Pos logit contributions
 tr+0.053
 rac+0.051
 puck+0.050
 Oscar+0.049
 sp+0.049
Neg logit contributions
ow-0.165
ie-0.165
immer-0.163
ist-0.162
et-0.162
Tokenamas
Token ID17485
Feature activation-7.75e-04

Pos logit contributions
 sp+0.163
 tr+0.162
 mo+0.161
 puck+0.155
 rac+0.152
Neg logit contributions
hip-0.245
pole-0.234
ility-0.229
 generation-0.228
immer-0.228
Token and
Token ID290
Feature activation-3.22e-03

Pos logit contributions
 tr+0.001
 sp+0.001
 mo+0.001
uddle+0.001
 rac+0.001
Neg logit contributions
 generation-0.002
ility-0.002
cker-0.001
pole-0.001
 Him-0.001
Token went
Token ID1816
Feature activation+2.66e-02

Pos logit contributions
uddle+0.006
 tr+0.006
 rac+0.005
 puck+0.005
oodle+0.005
Neg logit contributions
 generation-0.005
ie-0.004
 Him-0.004
cker-0.004
 someday-0.004
Token to
Token ID284
Feature activation-6.29e-03

Pos logit contributions
 generation+0.033
ility+0.032
 Him+0.031
ie+0.031
hip+0.030
Neg logit contributions
 tr-0.053
 sp-0.052
omo-0.051
 puck-0.051
uddle-0.051
Token bed
Token ID3996
Feature activation-4.26e-02

Pos logit contributions
uddle+0.010
 tr+0.009
 sp+0.009
omo+0.009
 puck+0.008
Neg logit contributions
 generation-0.012
 Him-0.011
ility-0.011
hip-0.011
 God-0.011
Token,
Token ID11
Feature activation-3.12e-03

Pos logit contributions
 tr+0.057
 sp+0.055
uddle+0.054
 puck+0.053
 mo+0.052
Neg logit contributions
 generation-0.044
ility-0.042
pole-0.041
hip-0.041
 Him-0.039
Token still
Token ID991
Feature activation-3.54e-02

Pos logit contributions
 tr+0.005
uddle+0.005
 sp+0.005
 puck+0.005
omo+0.005
Neg logit contributions
 generation-0.005
pole-0.005
ility-0.005
hip-0.005
cker-0.005
Token in
Token ID287
Feature activation+6.37e-03

Pos logit contributions
 tr+0.063
 sp+0.060
 mo+0.057
uddle+0.057
 puck+0.056
Neg logit contributions
 generation-0.058
ility-0.056
pole-0.055
hip-0.054
 Him-0.052
Token his
Token ID465
Feature activation-1.35e-02

Pos logit contributions
 generation+0.010
ility+0.010
pole+0.010
hip+0.010
cker+0.009
Neg logit contributions
 tr-0.011
 sp-0.011
 mo-0.010
uddle-0.010
 puck-0.010
Token p
Token ID279
Feature activation-7.28e-02

Pos logit contributions
 tr+0.019
 sp+0.018
uddle+0.017
 mo+0.017
 puck+0.017
Neg logit contributions
 generation-0.027
ility-0.026
pole-0.025
hip-0.025
cker-0.025
Tokenaj
Token ID1228
Feature activation-1.23e-01

Pos logit contributions
 tr+0.085
 puck+0.076
 sp+0.075
 rac+0.071
 mo+0.067
Neg logit contributions
pole-0.158
ow-0.157
 generation-0.156
ie-0.155
ility-0.155
Tokenamas
Token ID17485
Feature activation+1.11e-04

Pos logit contributions
 tr+0.163
 puck+0.159
 sp+0.159
 mo+0.154
 rac+0.148
Neg logit contributions
hip-0.243
pole-0.241
 generation-0.234
ility-0.233
immer-0.229
Token.
Token ID13
Feature activation-9.75e-03

Pos logit contributions
 generation+0.000
ility+0.000
pole+0.000
hip+0.000
 Him+0.000
Neg logit contributions
 tr-0.000
 sp-0.000
 mo-0.000
uddle-0.000
 puck-0.000
Token Everyone
Token ID11075
Feature activation-2.57e-03

Pos logit contributions
uddle+0.016
 sp+0.016
 mo+0.016
 tr+0.016
 puck+0.014
Neg logit contributions
 generation-0.018
hip-0.017
pole-0.016
 Him-0.016
eday-0.016
Token in
Token ID287
Feature activation+4.66e-03

Pos logit contributions
 tr+0.004
 sp+0.003
uddle+0.003
 puck+0.003
 mo+0.003
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token town
Token ID3240
Feature activation-1.16e-03

Pos logit contributions
 generation+0.007
ility+0.007
pole+0.007
hip+0.007
cker+0.007
Neg logit contributions
 tr-0.008
 sp-0.008
 mo-0.008
 puck-0.008
uddle-0.008
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
 tr+0.001
 sp+0.001
uddle+0.001
 mo+0.001
 puck+0.001
Neg logit contributions
 generation-0.001
ility-0.001
hip-0.001
pole-0.001
 Him-0.001
Token put
Token ID1234
Feature activation-3.82e-03

Pos logit contributions
 generation+0.016
eday+0.015
ility+0.015
ie+0.015
 Him+0.015
Neg logit contributions
 tr-0.017
uddle-0.017
 puck-0.016
iden-0.016
 rac-0.016
Token on
Token ID319
Feature activation+3.17e-02

Pos logit contributions
 tr+0.008
 sp+0.007
uddle+0.007
 mo+0.007
 puck+0.007
Neg logit contributions
 generation-0.005
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token your
Token ID534
Feature activation-3.23e-02

Pos logit contributions
 generation+0.036
ie+0.031
ility+0.031
 Him+0.031
eday+0.030
Neg logit contributions
uddle-0.068
 tr-0.067
 puck-0.065
 rac-0.063
 mo-0.063
Token p
Token ID279
Feature activation-6.10e-02

Pos logit contributions
 tr+0.036
uddle+0.035
 sp+0.034
omo+0.034
 rac+0.034
Neg logit contributions
 generation-0.067
cker-0.062
 Him-0.062
et-0.062
pole-0.061
Tokenaj
Token ID1228
Feature activation-1.23e-01

Pos logit contributions
 tr+0.043
 rac+0.040
 puck+0.038
 sp+0.037
 Oscar+0.036
Neg logit contributions
ie-0.153
et-0.151
ow-0.151
ist-0.149
immer-0.149
Tokenamas
Token ID17485
Feature activation+5.56e-03

Pos logit contributions
 sp+0.157
 tr+0.157
 mo+0.157
 rac+0.145
 puck+0.143
Neg logit contributions
hip-0.247
pole-0.239
 generation-0.235
ility-0.233
immer-0.230
Token.
Token ID13
Feature activation+8.82e-03

Pos logit contributions
 generation+0.010
ility+0.010
pole+0.009
cker+0.009
hip+0.009
Neg logit contributions
 tr-0.009
 sp-0.009
uddle-0.008
 mo-0.008
 puck-0.008
Token Then
Token ID3244
Feature activation-6.85e-03

Pos logit contributions
 generation+0.015
 Him+0.014
hip+0.014
pole+0.014
ility+0.014
Neg logit contributions
 tr-0.015
uddle-0.015
 sp-0.015
 mo-0.014
 puck-0.013
Token we
Token ID356
Feature activation-6.08e-03

Pos logit contributions
 tr+0.012
uddle+0.012
 sp+0.011
 puck+0.011
 mo+0.011
Neg logit contributions
 generation-0.010
 Him-0.010
ie-0.010
ility-0.010
pole-0.010
Token can
Token ID460
Feature activation+9.79e-03

Pos logit contributions
 tr+0.010
 puck+0.010
uddle+0.010
 sp+0.009
 mo+0.009
Neg logit contributions
ility-0.010
 generation-0.010
ie-0.009
hip-0.009
immer-0.009
Token put
Token ID1234
Feature activation-2.50e-02

Pos logit contributions
ie+0.004
lyn+0.003
 generation+0.003
et+0.003
pole+0.003
Neg logit contributions
 tr-0.005
 puck-0.005
uddle-0.005
 cones-0.005
 sp-0.005
Token on
Token ID319
Feature activation+1.87e-02

Pos logit contributions
 tr+0.044
 sp+0.043
 mo+0.040
uddle+0.039
 puck+0.038
Neg logit contributions
 generation-0.044
pole-0.042
ility-0.042
hip-0.042
cker-0.042
Token her
Token ID607
Feature activation+1.04e-02

Pos logit contributions
 generation+0.015
ility+0.013
hip+0.012
ie+0.012
 Him+0.012
Neg logit contributions
 tr-0.047
uddle-0.046
 sp-0.046
 puck-0.045
 mo-0.045
Token favorite
Token ID4004
Feature activation+3.01e-02

Pos logit contributions
 generation+0.017
 someday+0.017
ie+0.017
eday+0.017
hip+0.016
Neg logit contributions
 rac-0.013
 mo-0.013
 puck-0.013
iden-0.013
 tr-0.013
Token p
Token ID279
Feature activation-7.20e-02

Pos logit contributions
 generation+0.059
ie+0.059
ility+0.056
pole+0.055
immer+0.055
Neg logit contributions
 mo-0.036
 tr-0.036
 puck-0.034
uddle-0.034
 rac-0.034
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 tr+0.060
 rac+0.058
 puck+0.058
 mo+0.052
 coy+0.050
Neg logit contributions
ow-0.171
ie-0.168
ist-0.168
immer-0.166
et-0.164
Tokenamas
Token ID17485
Feature activation-1.58e-03

Pos logit contributions
 tr+0.159
 sp+0.156
 mo+0.156
 puck+0.150
 rac+0.142
Neg logit contributions
hip-0.248
ility-0.237
immer-0.236
 generation-0.235
pole-0.235
Token and
Token ID290
Feature activation-7.90e-03

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
uddle+0.002
 rac+0.002
Neg logit contributions
 generation-0.003
ility-0.003
cker-0.003
cks-0.003
 Him-0.003
Token sn
Token ID3013
Feature activation-1.91e-02

Pos logit contributions
uddle+0.015
 tr+0.014
 puck+0.014
 rac+0.014
 mo+0.013
Neg logit contributions
 generation-0.011
ie-0.011
 Him-0.011
 God-0.010
 someday-0.010
Tokenugg
Token ID6837
Feature activation-6.21e-03

Pos logit contributions
 tr+0.026
 sp+0.025
uddle+0.024
 mo+0.023
 puck+0.022
Neg logit contributions
 generation-0.039
ility-0.038
pole-0.038
hip-0.037
cker-0.036
Tokenled
Token ID992
Feature activation+6.46e-02

Pos logit contributions
 tr+0.012
 sp+0.011
uddle+0.011
 mo+0.010
 puck+0.010
Neg logit contributions
 generation-0.010
ility-0.010
pole-0.009
hip-0.009
cker-0.009
Token quickly
Token ID2952
Feature activation-3.90e-03

Pos logit contributions
 generation+0.058
ility+0.056
pole+0.054
hip+0.054
cker+0.052
Neg logit contributions
 tr-0.084
 sp-0.081
 mo-0.077
uddle-0.077
 puck-0.076
Token get
Token ID651
Feature activation+6.56e-03

Pos logit contributions
 tr+0.007
 sp+0.007
uddle+0.006
 mo+0.006
 puck+0.006
Neg logit contributions
 generation-0.006
ility-0.006
pole-0.006
hip-0.006
cker-0.006
Token into
Token ID656
Feature activation-3.68e-03

Pos logit contributions
 generation+0.009
ility+0.009
 Him+0.008
ie+0.008
hip+0.008
Neg logit contributions
 tr-0.013
uddle-0.013
 sp-0.012
 puck-0.012
 mo-0.012
Token his
Token ID465
Feature activation-1.07e-02

Pos logit contributions
uddle+0.007
 tr+0.007
oodle+0.006
 sp+0.006
 puck+0.006
Neg logit contributions
 generation-0.005
ility-0.005
 Him-0.005
hip-0.005
eday-0.005
Token p
Token ID279
Feature activation-6.85e-02

Pos logit contributions
 tr+0.014
 mo+0.013
Mike+0.013
 puck+0.013
 Oscar+0.013
Neg logit contributions
 generation-0.020
ow-0.018
 Him-0.018
eday-0.018
ie-0.018
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 tr+0.058
 puck+0.050
 rac+0.050
 sp+0.048
 Isn+0.045
Neg logit contributions
ow-0.170
ie-0.166
immer-0.162
ist-0.161
et-0.160
Tokenamas
Token ID17485
Feature activation-3.36e-04

Pos logit contributions
 tr+0.158
 sp+0.153
 mo+0.151
 puck+0.148
 rac+0.143
Neg logit contributions
hip-0.244
pole-0.236
 generation-0.236
ility-0.233
 Him-0.231
Token,
Token ID11
Feature activation-1.67e-03

Pos logit contributions
 tr+0.001
 sp+0.000
 mo+0.000
uddle+0.000
 rac+0.000
Neg logit contributions
 generation-0.001
ility-0.001
cker-0.001
pole-0.001
hip-0.001
Token wash
Token ID13502
Feature activation-3.05e-02

Pos logit contributions
 tr+0.003
uddle+0.003
 sp+0.003
 mo+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
 Him-0.003
Token his
Token ID465
Feature activation-1.14e-02

Pos logit contributions
 tr+0.057
 sp+0.054
uddle+0.054
 mo+0.053
 puck+0.052
Neg logit contributions
 generation-0.045
ility-0.044
hip-0.043
pole-0.042
ie-0.041
Token face
Token ID1986
Feature activation-1.91e-03

Pos logit contributions
iden+0.015
Mike+0.014
 Oscar+0.014
 tr+0.014
Danny+0.014
Neg logit contributions
 generation-0.021
ow-0.020
oscope-0.019
 Him-0.019
ie-0.019
Token time
Token ID640
Feature activation+3.19e-02

Pos logit contributions
 generation+0.020
ility+0.019
hip+0.019
pole+0.019
ie+0.018
Neg logit contributions
 tr-0.036
 sp-0.034
uddle-0.034
 mo-0.034
 puck-0.033
Token,
Token ID11
Feature activation+1.52e-03

Pos logit contributions
 generation+0.039
ility+0.038
ie+0.037
hip+0.036
pole+0.036
Neg logit contributions
 tr-0.066
 sp-0.063
uddle-0.063
 puck-0.063
 mo-0.063
Token her
Token ID607
Feature activation+1.09e-02

Pos logit contributions
 generation+0.002
ility+0.002
hip+0.002
pole+0.002
 Him+0.002
Neg logit contributions
 tr-0.003
 sp-0.003
uddle-0.003
 mo-0.003
 puck-0.003
Token new
Token ID649
Feature activation+3.12e-03

Pos logit contributions
 God+0.016
 Him+0.014
ie+0.014
eday+0.013
 generation+0.013
Neg logit contributions
 puck-0.018
Fel-0.018
lic-0.018
 rac-0.017
 bulldo-0.017
Token p
Token ID279
Feature activation-6.77e-02

Pos logit contributions
ie+0.005
 generation+0.005
eday+0.005
 opera+0.005
 God+0.004
Neg logit contributions
lic-0.005
 puck-0.005
Fel-0.004
Sur-0.004
apy-0.004
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 puck+0.103
 tr+0.100
 sp+0.094
 rac+0.092
 Oscar+0.089
Neg logit contributions
ow-0.130
ie-0.125
ist-0.117
owed-0.113
et-0.110
Tokenamas
Token ID17485
Feature activation+1.70e-03

Pos logit contributions
 tr+0.159
 sp+0.157
 mo+0.154
 puck+0.149
 rac+0.138
Neg logit contributions
hip-0.249
ility-0.237
pole-0.237
 generation-0.236
immer-0.229
Token fit
Token ID4197
Feature activation+1.97e-02

Pos logit contributions
 generation+0.003
ility+0.003
pole+0.003
hip+0.003
cker+0.003
Neg logit contributions
 tr-0.003
 sp-0.003
uddle-0.002
 mo-0.002
 puck-0.002
Token perfectly
Token ID7138
Feature activation-2.61e-02

Pos logit contributions
 generation+0.030
ility+0.029
pole+0.028
hip+0.028
cker+0.027
Neg logit contributions
 tr-0.038
 sp-0.036
 mo-0.034
uddle-0.034
 puck-0.033
Token.
Token ID13
Feature activation-2.79e-03

Pos logit contributions
 tr+0.049
 sp+0.047
uddle+0.045
 mo+0.045
 puck+0.044
Neg logit contributions
 generation-0.039
ility-0.038
pole-0.037
hip-0.037
immer-0.035
Token "
Token ID366
Feature activation-1.32e-02

Pos logit contributions
 tr+0.006
 mo+0.006
 sp+0.005
 puck+0.005
 cones+0.005
Neg logit contributions
 generation-0.004
hip-0.004
 Him-0.003
pole-0.003
ility-0.003
Token He
Token ID679
Feature activation-4.66e-03

Pos logit contributions
 mo+0.015
 tr+0.014
uddle+0.014
 sp+0.014
 puck+0.014
Neg logit contributions
hip-0.011
 generation-0.010
 Him-0.010
ow-0.010
 God-0.010
Token put
Token ID1234
Feature activation-1.94e-02

Pos logit contributions
 tr+0.008
uddle+0.007
 sp+0.007
 puck+0.007
 mo+0.007
Neg logit contributions
 generation-0.008
ility-0.008
ie-0.008
pole-0.007
hip-0.007
Token on
Token ID319
Feature activation+2.31e-02

Pos logit contributions
 tr+0.034
 sp+0.032
uddle+0.031
 mo+0.031
 puck+0.030
Neg logit contributions
 generation-0.032
ility-0.031
pole-0.030
hip-0.030
 Him-0.029
Token his
Token ID465
Feature activation-7.44e-03

Pos logit contributions
 generation+0.029
ie+0.027
et+0.026
 Him+0.026
ility+0.025
Neg logit contributions
uddle-0.047
 rac-0.044
 tr-0.043
omo-0.043
 puck-0.042
Token p
Token ID279
Feature activation-6.86e-02

Pos logit contributions
 Oscar+0.010
 puck+0.009
 rac+0.009
uddle+0.009
iden+0.009
Neg logit contributions
ie-0.013
 generation-0.013
et-0.013
 Him-0.013
ow-0.012
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 puck+0.062
 rac+0.062
 Oscar+0.060
 tr+0.057
 coy+0.054
Neg logit contributions
ow-0.161
ie-0.160
et-0.156
ist-0.154
immer-0.151
Tokenamas
Token ID17485
Feature activation+3.87e-04

Pos logit contributions
 tr+0.160
 sp+0.158
 mo+0.158
 puck+0.157
 rac+0.151
Neg logit contributions
hip-0.237
pole-0.229
ility-0.223
 generation-0.222
immer-0.222
Token but
Token ID475
Feature activation-1.14e-02

Pos logit contributions
 generation+0.001
ility+0.001
cker+0.001
pole+0.001
 Him+0.001
Neg logit contributions
 tr-0.001
 sp-0.000
 mo-0.000
 rac-0.000
uddle-0.000
Token something
Token ID1223
Feature activation-3.10e-02

Pos logit contributions
 tr+0.015
uddle+0.014
 sp+0.014
 mo+0.013
 puck+0.013
Neg logit contributions
 generation-0.023
ility-0.022
hip-0.022
pole-0.022
 Him-0.021
Token was
Token ID373
Feature activation-5.84e-03

Pos logit contributions
 tr+0.047
 sp+0.044
uddle+0.043
 mo+0.043
 puck+0.042
Neg logit contributions
 generation-0.055
ility-0.055
pole-0.054
hip-0.053
ie-0.052
Token missing
Token ID4814
Feature activation-3.23e-02

Pos logit contributions
 tr+0.010
uddle+0.010
 sp+0.010
 mo+0.010
 puck+0.009
Neg logit contributions
 generation-0.009
ility-0.009
pole-0.009
hip-0.008
cker-0.008
Token They
Token ID1119
Feature activation+9.90e-04

Pos logit contributions
 tr+0.013
uddle+0.013
 sp+0.012
 puck+0.012
 mo+0.012
Neg logit contributions
 generation-0.010
 Him-0.009
 God-0.009
hip-0.009
 someday-0.009
Token put
Token ID1234
Feature activation-2.27e-02

Pos logit contributions
 generation+0.002
ility+0.002
pole+0.002
hip+0.002
cker+0.002
Neg logit contributions
 tr-0.002
 sp-0.001
uddle-0.001
 mo-0.001
 puck-0.001
Token on
Token ID319
Feature activation+1.73e-02

Pos logit contributions
 sp+0.035
 tr+0.032
 mo+0.031
 cones+0.030
 M+0.029
Neg logit contributions
hip-0.047
ility-0.046
 generation-0.046
pole-0.045
immer-0.043
Token their
Token ID511
Feature activation-2.89e-02

Pos logit contributions
 generation+0.020
ility+0.019
pole+0.018
hip+0.018
 Him+0.017
Neg logit contributions
 tr-0.039
 sp-0.037
uddle-0.037
 mo-0.036
 puck-0.036
Token p
Token ID279
Feature activation-6.88e-02

Pos logit contributions
 tr+0.037
uddle+0.037
 Oscar+0.036
 rac+0.035
 puck+0.035
Neg logit contributions
 generation-0.053
 Him-0.053
cker-0.051
ie-0.051
 God-0.049
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 tr+0.049
 rac+0.039
 puck+0.038
 sp+0.037
 Oscar+0.034
Neg logit contributions
cker-0.178
ow-0.177
ie-0.177
immer-0.174
et-0.174
Tokenamas
Token ID17485
Feature activation+2.57e-03

Pos logit contributions
 tr+0.149
 sp+0.142
 mo+0.137
 puck+0.133
 rac+0.130
Neg logit contributions
hip-0.252
 generation-0.248
ility-0.246
pole-0.246
 Him-0.244
Token and
Token ID290
Feature activation-5.38e-03

Pos logit contributions
ility+0.006
 generation+0.006
cker+0.006
pole+0.006
eday+0.005
Neg logit contributions
 tr-0.003
 sp-0.003
 mo-0.003
 rac-0.002
uddle-0.002
Token went
Token ID1816
Feature activation+2.43e-02

Pos logit contributions
 tr+0.009
uddle+0.009
 sp+0.008
 puck+0.008
 mo+0.008
Neg logit contributions
 generation-0.009
ility-0.008
cker-0.008
 Him-0.008
hip-0.008
Token to
Token ID284
Feature activation-9.42e-03

Pos logit contributions
 generation+0.040
ility+0.039
 Him+0.038
hip+0.037
pole+0.037
Neg logit contributions
 tr-0.041
 sp-0.039
 puck-0.037
uddle-0.037
 mo-0.036
Token their
Token ID511
Feature activation-2.89e-02

Pos logit contributions
 tr+0.015
 sp+0.014
uddle+0.014
 mo+0.013
 puck+0.013
Neg logit contributions
 generation-0.017
ility-0.017
hip-0.016
pole-0.016
 Him-0.016
TokenI
Token ID40
Feature activation+1.94e-02

Pos logit contributions
 tr+0.020
uddle+0.019
 sp+0.019
 mo+0.018
 puck+0.017
Neg logit contributions
 generation-0.026
ility-0.024
hip-0.024
pole-0.024
eday-0.023
Token love
Token ID1842
Feature activation-1.49e-02

Pos logit contributions
 generation+0.029
ility+0.027
immer+0.026
eday+0.026
hip+0.026
Neg logit contributions
 tr-0.036
uddle-0.034
 puck-0.034
 sp-0.033
 mo-0.032
Token my
Token ID616
Feature activation-4.47e-02

Pos logit contributions
uddle+0.028
 tr+0.028
 sp+0.027
 puck+0.027
 mo+0.026
Neg logit contributions
 generation-0.021
 Him-0.021
ie-0.020
ility-0.020
eday-0.020
Token new
Token ID649
Feature activation+2.61e-03

Pos logit contributions
 puck+0.059
uddle+0.057
 mo+0.055
 tr+0.055
 Oscar+0.054
Neg logit contributions
 generation-0.078
ie-0.078
 someday-0.076
eday-0.075
 God-0.075
Token p
Token ID279
Feature activation-6.88e-02

Pos logit contributions
ie+0.005
 generation+0.004
eday+0.004
 someday+0.004
et+0.004
Neg logit contributions
 puck-0.004
uddle-0.004
Marcus-0.003
 mo-0.003
lic-0.003
Tokenaj
Token ID1228
Feature activation-1.22e-01

Pos logit contributions
 puck+0.063
 tr+0.061
 sudden+0.058
 sp+0.057
 rac+0.056
Neg logit contributions
ow-0.162
ie-0.161
ist-0.155
immer-0.152
et-0.151
Tokenamas
Token ID17485
Feature activation+1.43e-03

Pos logit contributions
 tr+0.142
 sp+0.137
 mo+0.131
 puck+0.126
uddle+0.121
Neg logit contributions
 generation-0.263
hip-0.262
ility-0.261
pole-0.257
immer-0.251
Token,"
Token ID553
Feature activation+7.93e-03

Pos logit contributions
 generation+0.003
ility+0.003
pole+0.002
hip+0.002
cker+0.002
Neg logit contributions
 tr-0.002
 sp-0.002
 mo-0.002
uddle-0.002
 puck-0.002
Token said
Token ID531
Feature activation+6.72e-04

Pos logit contributions
 generation+0.010
ie+0.010
pole+0.009
 Him+0.009
hip+0.009
Neg logit contributions
 tr-0.016
uddle-0.016
 sp-0.015
 puck-0.015
 mo-0.015
Token Lily
Token ID20037
Feature activation+2.77e-03

Pos logit contributions
 generation+0.001
ility+0.001
pole+0.001
hip+0.001
cker+0.001
Neg logit contributions
 tr-0.001
 sp-0.001
 mo-0.001
 rac-0.001
 puck-0.001
Token.
Token ID13
Feature activation+5.41e-03

Pos logit contributions
 generation+0.005
ility+0.005
hip+0.004
pole+0.004
cker+0.004
Neg logit contributions
 tr-0.005
 sp-0.005
 mo-0.004
uddle-0.004
 puck-0.004
Token and
Token ID290
Feature activation-8.89e-04

Pos logit contributions
 tr+0.067
 sp+0.063
 mo+0.058
uddle+0.057
 puck+0.056
Neg logit contributions
 generation-0.079
ility-0.076
cker-0.073
pole-0.073
 Him-0.073
Token got
Token ID1392
Feature activation+3.24e-02

Pos logit contributions
uddle+0.002
 tr+0.001
 rac+0.001
 puck+0.001
 sp+0.001
Neg logit contributions
 generation-0.001
ility-0.001
ie-0.001
 Him-0.001
hip-0.001
Token into
Token ID656
Feature activation-9.99e-04

Pos logit contributions
 generation+0.041
 Him+0.040
ility+0.039
ie+0.038
pole+0.037
Neg logit contributions
 tr-0.066
uddle-0.064
omo-0.061
 rac-0.061
 puck-0.060
Token his
Token ID465
Feature activation-5.67e-03

Pos logit contributions
uddle+0.002
 tr+0.002
oodle+0.002
opus+0.002
 puck+0.002
Neg logit contributions
 generation-0.001
 Him-0.001
ility-0.001
ie-0.001
 God-0.001
Token p
Token ID279
Feature activation-6.65e-02

Pos logit contributions
Mike+0.008
 puck+0.007
 Oscar+0.007
iden+0.007
 tr+0.007
Neg logit contributions
 generation-0.010
ie-0.010
ow-0.010
 Him-0.010
 God-0.009
Tokenaj
Token ID1228
Feature activation-1.21e-01

Pos logit contributions
 tr+0.051
 puck+0.047
 rac+0.045
 sp+0.042
 mo+0.039
Neg logit contributions
ow-0.169
ie-0.167
immer-0.164
et-0.162
ist-0.161
Tokenamas
Token ID17485
Feature activation-4.55e-04

Pos logit contributions
 tr+0.159
 puck+0.157
 sp+0.154
 mo+0.154
 rac+0.146
Neg logit contributions
hip-0.238
pole-0.227
ility-0.227
 generation-0.227
 Him-0.225
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
 tr+0.001
 sp+0.001
 mo+0.001
uddle+0.001
 puck+0.001
Neg logit contributions
 generation-0.001
ility-0.001
pole-0.001
hip-0.001
cker-0.001
Token Then
Token ID3244
Feature activation-9.36e-03

Pos logit contributions
 tr+0.010
 sp+0.009
 mo+0.009
uddle+0.009
 sk+0.008
Neg logit contributions
 generation-0.011
hip-0.011
pole-0.010
ility-0.010
 Him-0.010
Token he
Token ID339
Feature activation-2.58e-02

Pos logit contributions
uddle+0.015
 tr+0.015
aj+0.014
 sp+0.014
 puck+0.014
Neg logit contributions
 generation-0.016
ie-0.015
 someday-0.015
ility-0.014
pole-0.014
Token picked
Token ID6497
Feature activation-2.83e-02

Pos logit contributions
uddle+0.047
 tr+0.043
 puck+0.043
 Oscar+0.042
 rac+0.042
Neg logit contributions
ie-0.038
 generation-0.034
et-0.034
cker-0.034
lyn-0.034
Token and
Token ID290
Feature activation-1.95e-03

Pos logit contributions
 tr+0.056
 sp+0.052
 mo+0.045
uddle+0.044
 puck+0.042
Neg logit contributions
 generation-0.087
ility-0.083
 Him-0.081
cker-0.080
eday-0.079
Token put
Token ID1234
Feature activation-2.32e-02

Pos logit contributions
uddle+0.003
 tr+0.003
 rac+0.003
iden+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
 Him-0.003
cker-0.003
hip-0.003
ility-0.003
Token on
Token ID319
Feature activation+2.06e-02

Pos logit contributions
 sp+0.044
 tr+0.042
 mo+0.040
 puck+0.037
 cones+0.037
Neg logit contributions
hip-0.038
ility-0.038
 generation-0.037
pole-0.036
cker-0.035
Token their
Token ID511
Feature activation-2.73e-02

Pos logit contributions
 generation+0.022
ility+0.020
pole+0.019
hip+0.019
 Him+0.019
Neg logit contributions
 tr-0.047
uddle-0.045
 sp-0.045
 mo-0.044
 puck-0.044
Token p
Token ID279
Feature activation-6.46e-02

Pos logit contributions
 tr+0.032
 Oscar+0.031
uddle+0.031
 puck+0.030
Danny+0.030
Neg logit contributions
 generation-0.055
pole-0.053
cker-0.052
 Him-0.051
ie-0.051
Tokenaj
Token ID1228
Feature activation-1.21e-01

Pos logit contributions
 tr+0.066
 sp+0.059
 puck+0.058
 rac+0.057
 mo+0.056
Neg logit contributions
 generation-0.146
pole-0.145
cker-0.142
immer-0.142
ility-0.141
Tokenamas
Token ID17485
Feature activation-4.89e-04

Pos logit contributions
 tr+0.148
 sp+0.141
 mo+0.140
 puck+0.137
 rac+0.133
Neg logit contributions
hip-0.249
 generation-0.243
pole-0.242
ility-0.241
immer-0.238
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 tr+0.001
 sp+0.001
 mo+0.001
uddle+0.001
 rac+0.001
Neg logit contributions
 generation-0.001
ility-0.001
pole-0.001
cker-0.001
 Him-0.001
Token They
Token ID1119
Feature activation+1.43e-03

Pos logit contributions
 tr+0.020
uddle+0.019
 rac+0.018
 mo+0.018
iden+0.018
Neg logit contributions
 generation-0.014
 Him-0.012
eday-0.012
ow-0.012
pole-0.012
Token each
Token ID1123
Feature activation+1.45e-02

Pos logit contributions
 generation+0.002
ility+0.002
pole+0.002
hip+0.002
 Him+0.002
Neg logit contributions
 tr-0.003
uddle-0.003
 sp-0.003
 puck-0.003
 rac-0.003
Token pick
Token ID2298
Feature activation-3.48e-02

Pos logit contributions
 generation+0.025
ility+0.025
pole+0.024
hip+0.024
 Him+0.023
Neg logit contributions
 tr-0.024
 sp-0.023
 mo-0.022
uddle-0.021
 puck-0.021
Token's
Token ID338
Feature activation+2.56e-02

Pos logit contributions
 tr+0.037
 sp+0.036
 puck+0.035
 mo+0.035
uddle+0.034
Neg logit contributions
 generation-0.031
ility-0.030
hip-0.030
pole-0.030
ie-0.029
Token go
Token ID467
Feature activation+1.69e-02

Pos logit contributions
 generation+0.027
ility+0.026
hip+0.026
ie+0.025
eday+0.025
Neg logit contributions
 tr-0.056
uddle-0.054
 puck-0.053
 cones-0.053
 sp-0.053
Token get
Token ID651
Feature activation+6.64e-03

Pos logit contributions
ie+0.021
 generation+0.020
ility+0.019
hip+0.018
ow+0.018
Neg logit contributions
 tr-0.035
 sp-0.033
 puck-0.033
omo-0.033
uddle-0.032
Token your
Token ID534
Feature activation-3.91e-02

Pos logit contributions
ie+0.007
 yours+0.007
 generation+0.007
ility+0.007
eday+0.007
Neg logit contributions
 tr-0.014
uddle-0.014
 puck-0.014
omo-0.013
iden-0.013
Token p
Token ID279
Feature activation-6.74e-02

Pos logit contributions
 tr+0.045
Danny+0.044
Tyler+0.044
Mike+0.044
Fel+0.043
Neg logit contributions
ow-0.070
ie-0.069
 belong-0.068
 away-0.065
 someday-0.065
Tokenaj
Token ID1228
Feature activation-1.20e-01

Pos logit contributions
 tr+0.051
 rac+0.043
 sudden+0.043
 coy+0.042
 puck+0.042
Neg logit contributions
ow-0.173
immer-0.168
ist-0.163
ie-0.162
et-0.157
Tokenamas
Token ID17485
Feature activation+3.08e-03

Pos logit contributions
 tr+0.140
 sp+0.133
 mo+0.128
 puck+0.120
 rac+0.118
Neg logit contributions
 generation-0.259
hip-0.258
ility-0.255
pole-0.254
immer-0.248
Token from
Token ID422
Feature activation-1.28e-02

Pos logit contributions
 generation+0.005
ility+0.005
pole+0.005
hip+0.005
cker+0.005
Neg logit contributions
 tr-0.005
 sp-0.005
uddle-0.005
 mo-0.005
 puck-0.004
Token the
Token ID262
Feature activation-2.48e-05

Pos logit contributions
 tr+0.023
 sp+0.022
 mo+0.021
uddle+0.020
 puck+0.020
Neg logit contributions
 generation-0.021
ility-0.020
pole-0.020
hip-0.020
cker-0.019
Token drawer
Token ID33451
Feature activation-3.48e-03

Pos logit contributions
omo+0.000
apy+0.000
 tr+0.000
aj+0.000
uddle+0.000
Neg logit contributions
 generation-0.000
 end-0.000
ility-0.000
 God-0.000
 away-0.000
Token."
Token ID526
Feature activation-4.87e-03

Pos logit contributions
 tr+0.006
 sp+0.006
uddle+0.006
 mo+0.006
 puck+0.006
Neg logit contributions
 generation-0.005
pole-0.005
ility-0.005
hip-0.005
cker-0.005
Token and
Token ID290
Feature activation+1.04e-03

Pos logit contributions
 tr+0.041
 sp+0.038
 w+0.031
 puck+0.029
uddle+0.029
Neg logit contributions
 generation-0.083
ility-0.080
cker-0.079
 Him-0.077
eday-0.076
Token put
Token ID1234
Feature activation-1.91e-02

Pos logit contributions
 generation+0.002
ility+0.002
hip+0.002
pole+0.002
 Him+0.002
Neg logit contributions
 tr-0.002
uddle-0.002
 sp-0.002
 mo-0.001
 puck-0.001
Token on
Token ID319
Feature activation+2.41e-02

Pos logit contributions
 sp+0.037
 tr+0.037
 mo+0.033
 puck+0.032
 cones+0.032
Neg logit contributions
 generation-0.029
ility-0.028
hip-0.028
cker-0.028
pole-0.028
Token her
Token ID607
Feature activation+1.83e-02

Pos logit contributions
 generation+0.027
ility+0.025
hip+0.024
pole+0.024
 Him+0.023
Neg logit contributions
 tr-0.055
 sp-0.053
uddle-0.052
 mo-0.052
 puck-0.051
Token p
Token ID279
Feature activation-6.19e-02

Pos logit contributions
 generation+0.037
cker+0.035
et+0.035
pole+0.035
hip+0.034
Neg logit contributions
 tr-0.021
uddle-0.020
 mo-0.020
 Oscar-0.020
iden-0.019
Tokenaj
Token ID1228
Feature activation-1.20e-01

Pos logit contributions
 tr+0.037
 sp+0.031
 mo+0.031
 rac+0.029
 puck+0.028
Neg logit contributions
ow-0.164
ie-0.162
cker-0.161
pole-0.161
 generation-0.161
Tokenamas
Token ID17485
Feature activation+3.63e-03

Pos logit contributions
 tr+0.143
 sp+0.137
 mo+0.134
 puck+0.127
 rac+0.124
Neg logit contributions
hip-0.254
 generation-0.251
ility-0.248
pole-0.247
 Him-0.242
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
ility+0.008
 generation+0.007
cker+0.007
pole+0.007
cks+0.007
Neg logit contributions
 tr-0.005
 sp-0.005
 mo-0.004
 rac-0.004
uddle-0.004
Token Mom
Token ID11254
Feature activation-1.67e-02

Pos logit contributions
 tr+0.022
 sp+0.021
 mo+0.020
uddle+0.020
 puck+0.020
Neg logit contributions
 generation-0.020
ility-0.019
hip-0.019
pole-0.018
 Him-0.018
Tokenmy
Token ID1820
Feature activation+1.63e-02

Pos logit contributions
 tr+0.030
 sp+0.029
uddle+0.028
 mo+0.028
 puck+0.028
Neg logit contributions
 generation-0.026
ility-0.025
pole-0.025
hip-0.024
cker-0.023
Token read
Token ID1100
Feature activation+2.74e-02

Pos logit contributions
 generation+0.029
pole+0.028
ility+0.028
hip+0.027
cker+0.027
Neg logit contributions
 tr-0.026
 sp-0.024
uddle-0.024
 mo-0.023
 puck-0.023
Token put
Token ID1234
Feature activation-2.24e-02

Pos logit contributions
ie+0.010
et+0.009
 generation+0.009
pole+0.009
cker+0.009
Neg logit contributions
 tr-0.012
uddle-0.011
 rac-0.011
 puck-0.011
 sp-0.011
Token on
Token ID319
Feature activation+2.14e-02

Pos logit contributions
 sp+0.037
 tr+0.035
 mo+0.032
 M+0.030
 puck+0.030
Neg logit contributions
 generation-0.043
hip-0.042
cker-0.041
ility-0.041
pole-0.041
Token her
Token ID607
Feature activation+1.12e-02

Pos logit contributions
 generation+0.023
ility+0.021
hip+0.020
pole+0.020
 Him+0.019
Neg logit contributions
 tr-0.050
 sp-0.048
uddle-0.047
 mo-0.047
 puck-0.046
Token cozy
Token ID37438
Feature activation-4.52e-02

Pos logit contributions
et+0.020
ie+0.018
 generation+0.018
 Him+0.018
 someday+0.018
Neg logit contributions
Mike-0.014
 rac-0.014
 mo-0.013
 coy-0.013
oodle-0.013
Token p
Token ID279
Feature activation-6.67e-02

Pos logit contributions
Mike+0.057
Danny+0.053
 mo+0.052
Fel+0.051
 jan+0.051
Neg logit contributions
ie-0.088
et-0.080
 generation-0.077
oscope-0.074
cker-0.073
Tokenaj
Token ID1228
Feature activation-1.19e-01

Pos logit contributions
 rac+0.060
 tr+0.059
 coy+0.053
 Isn+0.051
 sudden+0.050
Neg logit contributions
et-0.156
ow-0.153
ie-0.149
cker-0.146
immer-0.146
Tokenamas
Token ID17485
Feature activation+2.29e-03

Pos logit contributions
 mo+0.178
 sp+0.174
 tr+0.173
 rac+0.168
 puck+0.165
Neg logit contributions
hip-0.218
immer-0.198
pole-0.198
ility-0.195
 Him-0.194
Token and
Token ID290
Feature activation-5.13e-03

Pos logit contributions
 generation+0.005
ility+0.005
cker+0.005
pole+0.004
 Him+0.004
Neg logit contributions
 tr-0.003
 sp-0.003
 mo-0.003
uddle-0.003
 rac-0.003
Token went
Token ID1816
Feature activation+2.53e-02

Pos logit contributions
uddle+0.009
 tr+0.009
 rac+0.009
 puck+0.009
 sp+0.008
Neg logit contributions
 generation-0.008
ie-0.007
 Him-0.007
eday-0.007
cker-0.007
Token to
Token ID284
Feature activation-6.06e-03

Pos logit contributions
 generation+0.031
ility+0.030
 Him+0.029
ie+0.029
ow+0.029
Neg logit contributions
 tr-0.050
 sp-0.049
omo-0.048
 puck-0.048
uddle-0.048
Token bed
Token ID3996
Feature activation-4.01e-02

Pos logit contributions
 tr+0.008
uddle+0.008
 sp+0.008
 puck+0.007
 mo+0.007
Neg logit contributions
 generation-0.012
ility-0.012
 Him-0.012
hip-0.011
ie-0.011
Token's
Token ID338
Feature activation+1.80e-02

Pos logit contributions
 tr+0.006
 sp+0.005
uddle+0.005
 mo+0.005
 puck+0.005
Neg logit contributions
 generation-0.008
ility-0.008
pole-0.008
hip-0.008
cker-0.007
Token mother
Token ID2802
Feature activation+5.76e-03

Pos logit contributions
 generation+0.036
pole+0.034
hip+0.033
ow+0.033
 Him+0.033
Neg logit contributions
 tr-0.022
 sp-0.021
uddle-0.020
 mo-0.020
iden-0.020
Token said
Token ID531
Feature activation+9.55e-03

Pos logit contributions
 generation+0.012
ility+0.012
hip+0.011
pole+0.011
cker+0.011
Neg logit contributions
 tr-0.008
 sp-0.007
uddle-0.007
 mo-0.007
 puck-0.007
Token:
Token ID25
Feature activation+4.93e-04

Pos logit contributions
 generation+0.015
ility+0.015
pole+0.014
hip+0.014
 Him+0.014
Neg logit contributions
 tr-0.017
 sp-0.016
uddle-0.016
 mo-0.016
 puck-0.016
Token "
Token ID366
Feature activation-4.09e-04

Pos logit contributions
 generation+0.001
ility+0.001
pole+0.001
hip+0.001
 Him+0.001
Neg logit contributions
 tr-0.001
uddle-0.001
 sp-0.001
 mo-0.001
 puck-0.001
TokenIf
Token ID1532
Feature activation+8.22e-02

Pos logit contributions
 tr+0.001
 sp+0.001
uddle+0.001
 mo+0.000
 puck+0.000
Neg logit contributions
 generation-0.001
ility-0.001
hip-0.001
pole-0.001
eday-0.001
Token you
Token ID345
Feature activation+1.78e-02

Pos logit contributions
 generation+0.135
ility+0.128
hip+0.126
pole+0.124
 Him+0.122
Neg logit contributions
 tr-0.145
 sp-0.138
uddle-0.136
 mo-0.134
 puck-0.131
Token want
Token ID765
Feature activation+6.29e-02

Pos logit contributions
 generation+0.027
ility+0.026
pole+0.025
hip+0.025
 Him+0.024
Neg logit contributions
 tr-0.034
 sp-0.032
uddle-0.031
 mo-0.031
 puck-0.030
Token to
Token ID284
Feature activation+6.18e-03

Pos logit contributions
 generation+0.111
ility+0.105
hip+0.104
 Him+0.102
pole+0.102
Neg logit contributions
 tr-0.101
uddle-0.098
 sp-0.096
 mo-0.095
 puck-0.092
Token find
Token ID1064
Feature activation-3.83e-04

Pos logit contributions
 generation+0.009
ility+0.009
pole+0.009
hip+0.009
 Him+0.008
Neg logit contributions
 tr-0.012
 sp-0.011
 mo-0.011
uddle-0.011
 puck-0.011
Token your
Token ID534
Feature activation-2.82e-02

Pos logit contributions
 tr+0.001
 sp+0.001
uddle+0.001
 mo+0.001
 puck+0.001
Neg logit contributions
 generation-0.001
ility-0.001
pole-0.001
hip-0.001
 Him-0.001
Token were
Token ID547
Feature activation+2.07e-02

Pos logit contributions
 tr+0.002
 sp+0.002
uddle+0.002
 mo+0.002
 puck+0.002
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
cker-0.002
Token.
Token ID13
Feature activation-4.87e-03

Pos logit contributions
 generation+0.035
ility+0.033
pole+0.033
hip+0.033
 Him+0.031
Neg logit contributions
 tr-0.036
 sp-0.035
 mo-0.033
uddle-0.032
 puck-0.031
Token She
Token ID1375
Feature activation+1.47e-02

Pos logit contributions
 tr+0.008
uddle+0.007
 puck+0.007
 sp+0.007
 mo+0.007
Neg logit contributions
 generation-0.009
ility-0.009
hip-0.009
pole-0.008
 Him-0.008
Token gave
Token ID2921
Feature activation+1.00e-02

Pos logit contributions
ie+0.023
 generation+0.023
cker+0.023
pole+0.023
ility+0.022
Neg logit contributions
 tr-0.024
uddle-0.024
 puck-0.023
 sp-0.022
 rac-0.021
Token Jos
Token ID22568
Feature activation-3.10e-02

Pos logit contributions
 generation+0.021
ility+0.020
pole+0.020
hip+0.020
cker+0.020
Neg logit contributions
 tr-0.013
 sp-0.013
 mo-0.012
uddle-0.011
 rac-0.011
Tokenie
Token ID494
Feature activation+8.92e-02

Pos logit contributions
 tr+0.085
 sp+0.084
 mo+0.080
uddle+0.078
iden+0.078
Neg logit contributions
 generation-0.024
hip-0.018
eday-0.017
ility-0.016
pole-0.015
Token lots
Token ID6041
Feature activation-2.03e-02

Pos logit contributions
 generation+0.135
ility+0.131
pole+0.126
hip+0.126
 Him+0.123
Neg logit contributions
 tr-0.165
 sp-0.157
uddle-0.156
 mo-0.153
 puck-0.151
Token of
Token ID286
Feature activation-8.07e-03

Pos logit contributions
 tr+0.036
 puck+0.034
 mo+0.033
uddle+0.033
 sp+0.033
Neg logit contributions
ie-0.031
ility-0.030
le-0.029
 Him-0.029
 generation-0.029
Token medicine
Token ID9007
Feature activation-5.64e-02

Pos logit contributions
 tr+0.013
 puck+0.013
uddle+0.013
omo+0.012
 mo+0.011
Neg logit contributions
 Him-0.013
ie-0.013
ility-0.013
 generation-0.013
 God-0.012
Token to
Token ID284
Feature activation-1.66e-03

Pos logit contributions
 tr+0.097
 sp+0.092
 mo+0.089
uddle+0.089
 puck+0.087
Neg logit contributions
 generation-0.093
ility-0.090
hip-0.088
pole-0.088
 Him-0.084
Token help
Token ID1037
Feature activation-1.66e-02

Pos logit contributions
 tr+0.003
 sp+0.003
 mo+0.003
uddle+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
cker-0.002
Token its
Token ID663
Feature activation-1.44e-02

Pos logit contributions
 sp+0.013
 tr+0.012
 mo+0.012
 rac+0.011
 puck+0.011
Neg logit contributions
ility-0.011
hip-0.011
 generation-0.011
pole-0.011
immer-0.010
Token nest
Token ID16343
Feature activation-3.48e-02

Pos logit contributions
 tr+0.025
 sp+0.024
uddle+0.023
 mo+0.023
 puck+0.022
Neg logit contributions
 generation-0.024
ility-0.024
hip-0.023
pole-0.023
 Him-0.022
Token.
Token ID13
Feature activation-5.82e-03

Pos logit contributions
 sp+0.046
 tr+0.045
uddle+0.042
 mo+0.040
aj+0.036
Neg logit contributions
 generation-0.080
 Him-0.077
ility-0.076
hip-0.072
 communicate-0.072
Token The
Token ID383
Feature activation-4.06e-03

Pos logit contributions
 tr+0.007
 sp+0.007
 mo+0.006
 puck+0.006
 rac+0.006
Neg logit contributions
ility-0.012
 generation-0.012
pole-0.012
immer-0.012
hip-0.012
Token bird
Token ID6512
Feature activation+5.45e-03

Pos logit contributions
 tr+0.005
 sp+0.005
 mo+0.004
uddle+0.004
 puck+0.004
Neg logit contributions
 generation-0.009
ility-0.009
hip-0.009
pole-0.009
 Him-0.008
Tokenie
Token ID494
Feature activation+8.69e-02

Pos logit contributions
 generation+0.014
ility+0.014
hip+0.014
 Him+0.014
immer+0.013
Neg logit contributions
 sp-0.005
 tr-0.005
 mo-0.005
uddle-0.004
 sk-0.004
Token was
Token ID373
Feature activation-5.38e-03

Pos logit contributions
 generation+0.194
ility+0.191
hip+0.187
pole+0.184
 Him+0.182
Neg logit contributions
 tr-0.106
 sp-0.102
 mo-0.093
uddle-0.092
 puck-0.086
Token very
Token ID845
Feature activation-3.22e-03

Pos logit contributions
 sp+0.007
 mo+0.007
 tr+0.007
 sk+0.006
 rac+0.006
Neg logit contributions
 generation-0.012
hip-0.012
ility-0.012
eday-0.011
 Him-0.011
Token scared
Token ID12008
Feature activation-4.24e-02

Pos logit contributions
 tr+0.005
 sp+0.005
uddle+0.005
 puck+0.005
 mo+0.005
Neg logit contributions
 generation-0.006
ility-0.005
pole-0.005
hip-0.005
cker-0.005
Token and
Token ID290
Feature activation+2.87e-03

Pos logit contributions
 tr+0.071
 sp+0.068
 mo+0.064
uddle+0.064
 puck+0.062
Neg logit contributions
 generation-0.075
ility-0.072
pole-0.071
hip-0.070
cker-0.068
Token felt
Token ID2936
Feature activation+2.45e-02

Pos logit contributions
 generation+0.006
ility+0.006
hip+0.006
immer+0.006
pole+0.006
Neg logit contributions
 sp-0.004
 tr-0.004
 mo-0.003
 sk-0.003
 rac-0.003
Token that
Token ID326
Feature activation+1.54e-02

Pos logit contributions
 tr+0.019
 sp+0.019
uddle+0.017
 mo+0.017
 sk+0.017
Neg logit contributions
 generation-0.017
ility-0.016
hip-0.016
pole-0.015
immer-0.015
Token day
Token ID1110
Feature activation+3.63e-03

Pos logit contributions
ility+0.026
 generation+0.026
hip+0.025
 Him+0.025
immer+0.025
Neg logit contributions
 tr-0.028
 sp-0.026
 mo-0.025
 rac-0.024
 sk-0.024
Token on
Token ID319
Feature activation+2.81e-02

Pos logit contributions
 generation+0.006
ility+0.006
pole+0.006
hip+0.006
cker+0.006
Neg logit contributions
 tr-0.006
 sp-0.006
 mo-0.006
uddle-0.005
 rac-0.005
Token,
Token ID11
Feature activation+8.49e-03

Pos logit contributions
 generation+0.040
ility+0.039
pole+0.038
hip+0.037
ie+0.037
Neg logit contributions
 tr-0.054
 sp-0.052
uddle-0.050
 puck-0.050
 mo-0.050
Token Ell
Token ID7122
Feature activation-3.38e-02

Pos logit contributions
 generation+0.017
ility+0.016
pole+0.016
hip+0.016
 Him+0.015
Neg logit contributions
 tr-0.012
 sp-0.011
uddle-0.011
 mo-0.011
 puck-0.011
Tokeny
Token ID88
Feature activation+8.11e-02

Pos logit contributions
 tr+0.066
 sp+0.065
uddle+0.064
 mo+0.062
omo+0.062
Neg logit contributions
 generation-0.049
ility-0.046
hip-0.045
pole-0.043
eday-0.042
Token and
Token ID290
Feature activation+2.10e-03

Pos logit contributions
 generation+0.156
ility+0.150
hip+0.148
pole+0.148
cker+0.143
Neg logit contributions
 tr-0.119
 sp-0.112
uddle-0.108
 mo-0.106
 puck-0.104
Token the
Token ID262
Feature activation+2.89e-03

Pos logit contributions
 generation+0.004
ility+0.004
pole+0.004
hip+0.004
 Him+0.004
Neg logit contributions
 tr-0.003
 sp-0.003
uddle-0.003
 mo-0.003
 puck-0.003
Token baby
Token ID5156
Feature activation-2.28e-02

Pos logit contributions
 generation+0.006
cker+0.005
ile+0.005
fly+0.005
pole+0.005
Neg logit contributions
uddle-0.004
 tr-0.004
aj-0.004
 puck-0.004
 sneakers-0.004
Token crocod
Token ID37565
Feature activation-2.83e-02

Pos logit contributions
 tr+0.035
 sp+0.033
uddle+0.031
 mo+0.031
 puck+0.030
Neg logit contributions
 generation-0.043
ility-0.042
pole-0.041
hip-0.041
cker-0.040
Tokenile
Token ID576
Feature activation+3.94e-02

Pos logit contributions
 tr+0.079
 sp+0.077
uddle+0.076
 mo+0.075
 rac+0.074
Neg logit contributions
 generation-0.019
hip-0.015
ility-0.014
 Him-0.014
pole-0.013
Token on
Token ID319
Feature activation+2.72e-02

Pos logit contributions
 generation+0.032
ility+0.031
hip+0.030
pole+0.030
cker+0.029
Neg logit contributions
 tr-0.035
 sp-0.034
 mo-0.032
uddle-0.032
 puck-0.031
Token many
Token ID867
Feature activation+1.63e-02

Pos logit contributions
pole+0.045
ility+0.045
 generation+0.044
hip+0.044
cker+0.043
Neg logit contributions
 tr-0.048
 sp-0.046
 mo-0.044
uddle-0.042
 puck-0.041
Token more
Token ID517
Feature activation-1.97e-03

Pos logit contributions
pole+0.030
hip+0.030
 generation+0.030
cker+0.029
ility+0.029
Neg logit contributions
 tr-0.026
 sp-0.026
 mo-0.024
uddle-0.022
 rac-0.022
Token adventurous
Token ID39770
Feature activation-2.94e-02

Pos logit contributions
 tr+0.003
 sp+0.003
 mo+0.003
uddle+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
ility-0.003
hip-0.003
pole-0.003
cker-0.003
Token explor
Token ID39180
Feature activation+2.86e-02

Pos logit contributions
 tr+0.050
 sp+0.048
 mo+0.045
uddle+0.044
 rac+0.043
Neg logit contributions
 generation-0.050
ility-0.050
pole-0.049
hip-0.048
immer-0.047
Tokenations
Token ID602
Feature activation+8.54e-02

Pos logit contributions
 generation+0.028
hip+0.028
pole+0.026
ility+0.025
eday+0.024
Neg logit contributions
 tr-0.065
 sp-0.064
 mo-0.060
uddle-0.058
 sk-0.058
Token with
Token ID351
Feature activation-2.71e-02

Pos logit contributions
 generation+0.165
cker+0.164
ility+0.164
hip+0.160
immer+0.157
Neg logit contributions
 tr-0.126
 sp-0.118
 mo-0.106
uddle-0.101
 rac-0.100
Token her
Token ID607
Feature activation+1.93e-02

Pos logit contributions
 tr+0.057
 sp+0.057
 mo+0.054
 rac+0.051
uddle+0.051
Neg logit contributions
pole-0.034
 generation-0.034
ility-0.034
hip-0.033
cker-0.032
Token new
Token ID649
Feature activation+1.16e-02

Pos logit contributions
 generation+0.036
ie+0.034
ility+0.034
 someday+0.033
 Him+0.032
Neg logit contributions
 puck-0.027
omo-0.025
 tr-0.025
oodle-0.024
uddle-0.024
Token tool
Token ID2891
Feature activation-3.26e-02

Pos logit contributions
 generation+0.020
ie+0.020
ility+0.017
hip+0.017
cker+0.017
Neg logit contributions
 puck-0.018
omo-0.017
Danny-0.016
 cones-0.016
 Oscar-0.016
Token.
Token ID13
Feature activation-4.83e-03

Pos logit contributions
 tr+0.056
 sp+0.054
uddle+0.052
 mo+0.051
 puck+0.050
Neg logit contributions
 generation-0.054
ility-0.052
pole-0.051
hip-0.051
cker-0.049
Token'm
Token ID1101
Feature activation-1.52e-02

Pos logit contributions
 generation+0.041
ility+0.038
hip+0.038
pole+0.037
immer+0.037
Neg logit contributions
 tr-0.048
 sp-0.045
uddle-0.045
 mo-0.044
 puck-0.044
Token sorry
Token ID7926
Feature activation-2.24e-02

Pos logit contributions
 tr+0.031
uddle+0.030
 sp+0.028
 puck+0.028
 mo+0.027
Neg logit contributions
 generation-0.021
ility-0.020
pole-0.020
 Him-0.019
ow-0.019
Token,"
Token ID553
Feature activation+1.31e-02

Pos logit contributions
 tr+0.046
 puck+0.045
uddle+0.045
 sp+0.045
 mo+0.044
Neg logit contributions
ie-0.027
pole-0.027
 Him-0.027
hip-0.026
ow-0.025
Token the
Token ID262
Feature activation+6.08e-03

Pos logit contributions
 generation+0.025
ility+0.024
hip+0.023
pole+0.023
 Him+0.022
Neg logit contributions
 tr-0.020
 sp-0.019
 mo-0.018
uddle-0.017
 rac-0.017
Token clerk
Token ID21120
Feature activation-1.99e-03

Pos logit contributions
 generation+0.009
hip+0.009
eday+0.008
 belong+0.008
cat+0.008
Neg logit contributions
 mo-0.011
 tr-0.010
aj-0.010
uddle-0.010
 puck-0.010
Token said
Token ID531
Feature activation+8.35e-03

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
uddle+0.002
 puck+0.002
Neg logit contributions
 generation-0.005
ility-0.005
hip-0.005
pole-0.004
 Him-0.004
Token,
Token ID11
Feature activation+3.33e-03

Pos logit contributions
 generation+0.016
ility+0.016
cker+0.015
pole+0.015
eday+0.015
Neg logit contributions
 tr-0.013
 sp-0.012
 mo-0.011
 M-0.011
uddle-0.010
Token "
Token ID366
Feature activation-8.29e-04

Pos logit contributions
 generation+0.008
ility+0.008
hip+0.007
pole+0.007
cker+0.007
Neg logit contributions
 tr-0.004
 sp-0.003
 mo-0.003
uddle-0.003
 puck-0.003
Tokenwe
Token ID732
Feature activation+9.80e-03

Pos logit contributions
 tr+0.001
 sp+0.001
 mo+0.001
 puck+0.001
uddle+0.001
Neg logit contributions
 generation-0.002
ility-0.002
pole-0.002
hip-0.002
immer-0.002
Token don
Token ID836
Feature activation+2.59e-02

Pos logit contributions
 generation+0.018
ility+0.017
hip+0.017
pole+0.017
 Him+0.016
Neg logit contributions
 tr-0.015
 sp-0.015
 mo-0.014
uddle-0.014
 rac-0.013
Token't
Token ID470
Feature activation+6.58e-03

Pos logit contributions
 generation+0.045
ility+0.044
pole+0.043
hip+0.043
 Him+0.041
Neg logit contributions
 tr-0.042
 sp-0.040
uddle-0.039
 mo-0.038
 puck-0.037
Token watch
Token ID2342
Feature activation-3.87e-02

Pos logit contributions
 tr+0.008
 sp+0.007
 mo+0.007
uddle+0.007
 puck+0.007
Neg logit contributions
 generation-0.008
ility-0.008
pole-0.008
hip-0.008
 Him-0.008
Token cartoons
Token ID30070
Feature activation+4.72e-02

Pos logit contributions
 tr+0.066
 sp+0.063
 mo+0.058
uddle+0.055
 rac+0.055
Neg logit contributions
 generation-0.068
ility-0.067
pole-0.066
immer-0.065
hip-0.064
Token on
Token ID319
Feature activation+2.22e-02

Pos logit contributions
 generation+0.107
ility+0.103
immer+0.102
cker+0.099
eday+0.096
Neg logit contributions
 tr-0.062
 sp-0.057
 cones-0.051
 mo-0.049
 puck-0.049
Token TV
Token ID3195
Feature activation+2.66e-02

Pos logit contributions
 generation+0.038
ility+0.038
pole+0.038
hip+0.036
immer+0.036
Neg logit contributions
 tr-0.038
 sp-0.036
 mo-0.034
uddle-0.033
 puck-0.033
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
immer+0.057
 generation+0.057
ility+0.056
cker+0.054
pole+0.054
Neg logit contributions
 tr-0.037
 sp-0.035
 mo-0.030
 sk-0.029
 rac-0.028
Token They
Token ID1119
Feature activation+5.97e-03

Pos logit contributions
 tr+0.023
 sp+0.022
uddle+0.022
 mo+0.021
 puck+0.021
Neg logit contributions
 generation-0.014
pole-0.013
ility-0.013
hip-0.013
cker-0.012
Token sit
Token ID1650
Feature activation-2.22e-02

Pos logit contributions
 generation+0.009
ility+0.009
pole+0.009
hip+0.009
cker+0.008
Neg logit contributions
 tr-0.011
 sp-0.011
 mo-0.010
uddle-0.010
 puck-0.010
Token on
Token ID319
Feature activation+2.20e-02

Pos logit contributions
 tr+0.034
 sp+0.033
uddle+0.030
 mo+0.030
aj+0.029
Neg logit contributions
ility-0.045
 generation-0.045
hip-0.042
pole-0.042
immer-0.041
Token the
Token ID262
Feature activation-1.73e-03

Pos logit contributions
ility+0.039
pole+0.038
 generation+0.038
immer+0.036
hip+0.036
Neg logit contributions
 tr-0.038
 sp-0.036
 mo-0.033
 puck-0.033
 rac-0.032
Token couch
Token ID18507
Feature activation-1.52e-03

Pos logit contributions
uddle+0.002
 sp+0.002
 tr+0.002
 mo+0.002
 puck+0.002
Neg logit contributions
 generation-0.004
pole-0.004
hip-0.004
ility-0.004
 Him-0.004
Token and
Token ID290
Feature activation-4.21e-04

Pos logit contributions
 tr+0.002
 sp+0.001
 w+0.001
 mo+0.001
 cup+0.001
Neg logit contributions
ility-0.004
immer-0.004
ief-0.004
ieving-0.004
 Him-0.004
Token
Token ID198
Feature activation-1.55e-03

Pos logit contributions
 tr+0.004
 sp+0.004
uddle+0.004
 mo+0.004
 rac+0.004
Neg logit contributions
 generation-0.004
ility-0.004
hip-0.004
pole-0.004
 Him-0.004
Token
Token ID198
Feature activation-1.61e-03

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
 puck+0.002
uddle+0.002
Neg logit contributions
 generation-0.003
hip-0.003
ility-0.003
pole-0.003
 Him-0.003
Token"
Token ID1
Feature activation+5.26e-03

Pos logit contributions
 tr+0.003
 mo+0.003
 sp+0.003
uddle+0.003
 rac+0.003
Neg logit contributions
 generation-0.002
pole-0.002
hip-0.002
ility-0.002
immer-0.002
TokenO
Token ID46
Feature activation+2.53e-02

Pos logit contributions
 generation+0.011
ility+0.011
cker+0.010
hip+0.010
eday+0.010
Neg logit contributions
 tr-0.007
uddle-0.007
 sp-0.006
 mo-0.006
aj-0.006
Tokenw
Token ID86
Feature activation+2.81e-02

Pos logit contributions
 generation+0.046
pole+0.044
ility+0.044
 Him+0.042
hip+0.042
Neg logit contributions
 tr-0.041
uddle-0.041
 sp-0.041
 mo-0.038
 rac-0.037
Token,
Token ID11
Feature activation+3.65e-03

Pos logit contributions
ility+0.044
pole+0.043
 generation+0.043
hip+0.043
ie+0.043
Neg logit contributions
 tr-0.049
 sp-0.046
 mo-0.045
 puck-0.045
uddle-0.044
Token ow
Token ID12334
Feature activation+5.54e-02

Pos logit contributions
ility+0.005
 generation+0.005
ow+0.005
pole+0.005
ie+0.005
Neg logit contributions
 tr-0.007
uddle-0.006
aj-0.006
 puck-0.006
 sp-0.006
Token,
Token ID11
Feature activation+3.85e-03

Pos logit contributions
ility+0.081
ie+0.080
pole+0.079
 generation+0.079
hip+0.075
Neg logit contributions
 tr-0.103
 sp-0.096
 puck-0.094
 mo-0.093
uddle-0.091
Token ow
Token ID12334
Feature activation+5.55e-02

Pos logit contributions
ility+0.006
 generation+0.006
pole+0.006
hip+0.006
ie+0.006
Neg logit contributions
 tr-0.007
uddle-0.006
 sp-0.006
 puck-0.006
aj-0.006
Token!"
Token ID2474
Feature activation-6.61e-03

Pos logit contributions
ility+0.098
 generation+0.098
pole+0.096
ie+0.095
hip+0.093
Neg logit contributions
 tr-0.087
 sp-0.081
 mo-0.077
 puck-0.077
uddle-0.077
Token Ben
Token ID3932
Feature activation-4.89e-03

Pos logit contributions
 tr+0.011
uddle+0.010
 mo+0.009
 sp+0.009
aj+0.009
Neg logit contributions
 generation-0.011
 Him-0.011
ility-0.011
 God-0.010
pole-0.010
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
ility+0.002
 generation+0.002
pole+0.002
hip+0.002
immer+0.002
Neg logit contributions
 sp-0.002
 tr-0.002
 mo-0.002
 rac-0.002
 sk-0.002
Token He
Token ID679
Feature activation-2.36e-03

Pos logit contributions
 tr+0.010
uddle+0.010
 sp+0.009
 mo+0.009
aj+0.008
Neg logit contributions
 generation-0.017
hip-0.017
ility-0.017
pole-0.017
cker-0.016
Token was
Token ID373
Feature activation-3.78e-03

Pos logit contributions
 tr+0.004
 sp+0.004
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.004
ility-0.004
hip-0.004
pole-0.004
cker-0.004
Token wearing
Token ID5762
Feature activation-2.06e-02

Pos logit contributions
 tr+0.007
 sp+0.006
 mo+0.006
uddle+0.006
 rac+0.006
Neg logit contributions
 generation-0.006
ility-0.006
hip-0.006
pole-0.006
immer-0.006
Token a
Token ID257
Feature activation-2.86e-04

Pos logit contributions
 tr+0.038
uddle+0.038
 sp+0.037
 mo+0.036
 puck+0.035
Neg logit contributions
 generation-0.030
ility-0.029
hip-0.028
pole-0.028
 Him-0.027
Token red
Token ID2266
Feature activation+2.40e-02

Pos logit contributions
 Oscar+0.000
 tr+0.000
 Isn+0.000
 rac+0.000
 Liam+0.000
Neg logit contributions
 generation-0.000
pole-0.000
immer-0.000
cker-0.000
ie-0.000
Token shirt
Token ID10147
Feature activation+7.98e-03

Pos logit contributions
 generation+0.050
pole+0.048
hip+0.048
ie+0.047
cker+0.047
Neg logit contributions
 tr-0.027
 mo-0.026
 sp-0.026
uddle-0.025
 Oscar-0.024
Token.
Token ID13
Feature activation-8.19e-03

Pos logit contributions
 generation+0.013
ility+0.013
hip+0.012
pole+0.012
 Him+0.012
Neg logit contributions
 tr-0.014
 sp-0.014
uddle-0.013
 mo-0.013
 puck-0.013
Token One
Token ID1881
Feature activation+3.70e-02

Pos logit contributions
uddle+0.012
 tr+0.012
 sp+0.012
 mo+0.012
aj+0.011
Neg logit contributions
 generation-0.014
hip-0.014
pole-0.014
ility-0.014
ow-0.013
Token day
Token ID1110
Feature activation-5.10e-03

Pos logit contributions
 generation+0.036
ility+0.033
pole+0.033
hip+0.033
ie+0.032
Neg logit contributions
 tr-0.088
 sp-0.084
uddle-0.083
 mo-0.083
 puck-0.082
Token,
Token ID11
Feature activation+3.22e-03

Pos logit contributions
uddle+0.012
aj+0.012
 tr+0.012
omo+0.011
 mo+0.011
Neg logit contributions
ie-0.004
ow-0.004
 someday-0.004
 generation-0.003
ility-0.003
Token
Token ID198
Feature activation-5.25e-03

Pos logit contributions
 tr+0.008
 puck+0.008
 sp+0.008
uddle+0.007
 mo+0.007
Neg logit contributions
 generation-0.008
pole-0.008
ility-0.008
hip-0.008
 Him-0.008
TokenM
Token ID44
Feature activation-1.87e-02

Pos logit contributions
 tr+0.009
 puck+0.008
uddle+0.008
 sp+0.008
aj+0.008
Neg logit contributions
pole-0.008
 generation-0.008
ility-0.008
hip-0.008
immer-0.008
Tokenia
Token ID544
Feature activation+2.08e-03

Pos logit contributions
 tr+0.029
 sp+0.027
 puck+0.027
 mo+0.026
uddle+0.025
Neg logit contributions
 generation-0.034
ility-0.034
hip-0.033
 Him-0.033
pole-0.033
Token wanted
Token ID2227
Feature activation+6.48e-03

Pos logit contributions
pole+0.003
ie+0.003
et+0.003
 generation+0.003
ow+0.003
Neg logit contributions
uddle-0.003
 puck-0.003
 sp-0.003
 tr-0.003
 bisc-0.003
Token to
Token ID284
Feature activation-1.46e-03

Pos logit contributions
 generation+0.010
ility+0.010
 Him+0.009
ow+0.009
hip+0.009
Neg logit contributions
uddle-0.011
 tr-0.011
 puck-0.011
omo-0.011
aj-0.011
Token build
Token ID1382
Feature activation+8.73e-03

Pos logit contributions
uddle+0.003
 tr+0.003
aj+0.003
 sp+0.003
 mo+0.003
Neg logit contributions
 generation-0.002
hip-0.002
 communicate-0.002
ility-0.002
pole-0.002
Token a
Token ID257
Feature activation-3.08e-03

Pos logit contributions
 generation+0.014
ility+0.013
pole+0.013
hip+0.013
cker+0.012
Neg logit contributions
 tr-0.016
 sp-0.016
 mo-0.015
uddle-0.015
 puck-0.014
Token sand
Token ID6450
Feature activation-1.17e-02

Pos logit contributions
aj+0.004
 tr+0.004
omo+0.004
 Oscar+0.004
 sp+0.004
Neg logit contributions
 generation-0.005
away-0.005
pole-0.005
ie-0.005
 someday-0.005
Token castle
Token ID16669
Feature activation-1.59e-02

Pos logit contributions
 tr+0.019
 sp+0.018
 mo+0.017
uddle+0.017
 puck+0.017
Neg logit contributions
 generation-0.021
ility-0.021
hip-0.020
pole-0.020
 Him-0.019
Token,
Token ID11
Feature activation+2.87e-03

Pos logit contributions
 tr+0.025
 sp+0.024
 mo+0.022
 puck+0.021
uddle+0.021
Neg logit contributions
 generation-0.031
ility-0.030
 Him-0.030
cker-0.030
hip-0.028
Token so
Token ID523
Feature activation-1.96e-02

Pos logit contributions
 generation+0.004
ility+0.004
pole+0.004
hip+0.004
 Him+0.004
Neg logit contributions
 tr-0.005
uddle-0.005
 sp-0.005
 mo-0.005
 puck-0.005
Token,
Token ID11
Feature activation+2.49e-03

Pos logit contributions
ie+0.039
ility+0.037
pole+0.037
hip+0.036
ow+0.036
Neg logit contributions
 tr-0.050
 sp-0.048
 puck-0.047
 mo-0.047
uddle-0.046
Token sm
Token ID895
Feature activation+1.94e-02

Pos logit contributions
ility+0.003
pole+0.003
 generation+0.003
ie+0.003
hip+0.003
Neg logit contributions
 tr-0.004
uddle-0.004
aj-0.004
 puck-0.004
iden-0.004
Tokenelly
Token ID6148
Feature activation+2.08e-02

Pos logit contributions
 generation+0.040
ility+0.039
pole+0.038
hip+0.038
cker+0.037
Neg logit contributions
 tr-0.026
 sp-0.025
uddle-0.023
 mo-0.023
 puck-0.022
Token dough
Token ID15756
Feature activation+2.50e-02

Pos logit contributions
 generation+0.037
ility+0.037
pole+0.036
hip+0.036
cker+0.034
Neg logit contributions
 tr-0.032
 sp-0.030
 mo-0.029
uddle-0.029
 puck-0.029
Token!"
Token ID2474
Feature activation-3.50e-03

Pos logit contributions
pole+0.043
ility+0.042
 generation+0.042
hip+0.042
ie+0.041
Neg logit contributions
 tr-0.038
 sp-0.035
 puck-0.035
 mo-0.035
uddle-0.034
Token
Token ID220
Feature activation-1.39e-03

Pos logit contributions
 tr+0.004
 sp+0.004
 mo+0.004
uddle+0.004
 puck+0.004
Neg logit contributions
 generation-0.007
hip-0.007
pole-0.007
ility-0.007
 Him-0.007
Token
Token ID198
Feature activation+7.12e-05

Pos logit contributions
 tr+0.002
 sp+0.002
 mo+0.002
uddle+0.002
 puck+0.001
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
 Him-0.003
Token
Token ID198
Feature activation-9.64e-04

Pos logit contributions
 generation+0.000
hip+0.000
pole+0.000
ility+0.000
ie+0.000
Neg logit contributions
 tr-0.000
 sp-0.000
 mo-0.000
uddle-0.000
 puck-0.000
TokenTim
Token ID14967
Feature activation-1.27e-02

Pos logit contributions
 sp+0.002
 tr+0.002
 mo+0.002
uddle+0.002
 rac+0.002
Neg logit contributions
hip-0.001
 generation-0.001
pole-0.001
cker-0.001
ie-0.001
Tokenmy
Token ID1820
Feature activation+1.40e-02

Pos logit contributions
 tr+0.025
 sp+0.024
omo+0.023
uddle+0.022
 mo+0.022
Neg logit contributions
 generation-0.021
ility-0.020
 Him-0.019
hip-0.018
immer-0.018
Token's
Token ID338
Feature activation+2.09e-02

Pos logit contributions
 generation+0.022
pole+0.022
ie+0.021
cker+0.021
ility+0.021
Neg logit contributions
 tr-0.024
 sp-0.023
uddle-0.022
 mo-0.022
 puck-0.022
Token ruined
Token ID20484
Feature activation-1.01e-04

Pos logit contributions
ow+0.021
ility+0.021
pole+0.020
 generation+0.020
 belong+0.020
Neg logit contributions
 tr-0.028
uddle-0.027
 puck-0.026
aj-0.026
omo-0.025
Token my
Token ID616
Feature activation-3.80e-02

Pos logit contributions
 tr+0.000
 sp+0.000
uddle+0.000
 mo+0.000
 puck+0.000
Neg logit contributions
 generation-0.000
ility-0.000
pole-0.000
hip-0.000
 Him-0.000
Token dictionary
Token ID22155
Feature activation+4.75e-02

Pos logit contributions
 tr+0.064
 sp+0.060
uddle+0.056
 mo+0.054
 rac+0.053
Neg logit contributions
ility-0.069
 generation-0.067
pole-0.066
hip-0.064
immer-0.064
Token!
Token ID0
Feature activation-2.19e-02

Pos logit contributions
 generation+0.090
ility+0.087
pole+0.086
hip+0.086
cker+0.082
Neg logit contributions
 tr-0.070
 sp-0.067
 mo-0.064
uddle-0.064
 puck-0.062
Token Now
Token ID2735
Feature activation-3.61e-02

Pos logit contributions
 tr+0.048
uddle+0.048
 sp+0.045
 mo+0.043
 puck+0.042
Neg logit contributions
 generation-0.028
ow-0.026
hip-0.026
 God-0.024
ility-0.024
Token I
Token ID314
Feature activation-8.21e-03

Pos logit contributions
 tr+0.057
 sp+0.054
 mo+0.051
uddle+0.051
 puck+0.050
Neg logit contributions
 generation-0.067
ility-0.065
hip-0.063
pole-0.063
cker-0.061
Token can
Token ID460
Feature activation+4.85e-03

Pos logit contributions
 tr+0.016
uddle+0.015
 puck+0.015
 cones+0.014
iden+0.014
Neg logit contributions
ow-0.011
 belong-0.011
 generation-0.011
ie-0.011
cker-0.010
Token't
Token ID470
Feature activation+1.72e-03

Pos logit contributions
 generation+0.008
ility+0.008
hip+0.008
pole+0.008
cker+0.007
Neg logit contributions
 tr-0.008
 sp-0.008
uddle-0.008
 mo-0.008
 puck-0.008
Token see
Token ID766
Feature activation+1.44e-02

Pos logit contributions
 generation+0.003
ility+0.003
pole+0.003
hip+0.003
 Him+0.002
Neg logit contributions
 tr-0.003
 sp-0.003
 mo-0.003
uddle-0.003
 puck-0.003
Token new
Token ID649
Feature activation+8.59e-03

Pos logit contributions
immer+0.028
 generation+0.027
hip+0.027
ility+0.027
cker+0.027
Neg logit contributions
 tr-0.021
 sp-0.020
 mo-0.019
 M-0.017
 sk-0.017
Token words
Token ID2456
Feature activation+2.01e-02

Pos logit contributions
 generation+0.016
ility+0.015
pole+0.015
hip+0.015
cker+0.014
Neg logit contributions
 tr-0.014
 sp-0.013
 mo-0.012
uddle-0.012
 puck-0.012
Token peace
Token ID4167
Feature activation-1.27e-03

Pos logit contributions
 generation+0.009
hip+0.008
ility+0.008
pole+0.008
cker+0.008
Neg logit contributions
 tr-0.009
 sp-0.008
uddle-0.008
 mo-0.008
 puck-0.008
Token.
Token ID13
Feature activation+7.39e-03

Pos logit contributions
 tr+0.002
 sp+0.002
uddle+0.002
 mo+0.002
 puck+0.002
Neg logit contributions
 generation-0.002
ility-0.002
pole-0.002
hip-0.002
cker-0.002
Token
Token ID198
Feature activation+2.69e-03

Pos logit contributions
 generation+0.013
ow+0.012
pole+0.012
hip+0.012
 Him+0.011
Neg logit contributions
 tr-0.012
 sp-0.012
uddle-0.012
 puck-0.011
 mo-0.011
Token
Token ID198
Feature activation+1.13e-03

Pos logit contributions
 generation+0.005
hip+0.005
pole+0.005
ow+0.004
ility+0.004
Neg logit contributions
 tr-0.004
 sp-0.004
uddle-0.004
 puck-0.004
 mo-0.004
TokenBob
Token ID18861
Feature activation+1.30e-02

Pos logit contributions
 generation+0.002
pole+0.002
hip+0.002
eday+0.002
cker+0.002
Neg logit contributions
 tr-0.002
 sp-0.002
uddle-0.002
aj-0.002
 mo-0.002
Token was
Token ID373
Feature activation-5.53e-03

Pos logit contributions
 generation+0.025
ility+0.025
pole+0.024
hip+0.024
cker+0.024
Neg logit contributions
 tr-0.019
 sp-0.018
uddle-0.017
 mo-0.017
 puck-0.016
Token in
Token ID287
Feature activation+1.09e-02

Pos logit contributions
 tr+0.009
 sp+0.009
 mo+0.009
 rac+0.008
uddle+0.008
Neg logit contributions
 generation-0.010
ility-0.010
hip-0.009
pole-0.009
 Him-0.009
Token shock
Token ID6380
Feature activation-1.00e-02

Pos logit contributions
 generation+0.019
ility+0.018
hip+0.018
pole+0.018
 Him+0.017
Neg logit contributions
 tr-0.018
 sp-0.017
uddle-0.017
 mo-0.016
 puck-0.016
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
 tr+0.019
 sp+0.018
uddle+0.018
 mo+0.017
 puck+0.017
Neg logit contributions
 generation-0.015
ility-0.014
hip-0.014
pole-0.014
ie-0.013
Token He
Token ID679
Feature activation-2.41e-03

Pos logit contributions
 tr+0.003
uddle+0.003
 sp+0.003
 mo+0.003
 puck+0.002
Neg logit contributions
 generation-0.004
hip-0.004
ow-0.004
ility-0.003
pole-0.003
Token asked
Token ID1965
Feature activation+1.76e-02

Pos logit contributions
 tr+0.004
uddle+0.004
 sp+0.003
 puck+0.003
 mo+0.003
Neg logit contributions
 generation-0.004
pole-0.004
ility-0.004
cker-0.004
hip-0.004
Token hot
Token ID3024
Feature activation-6.31e-03

Pos logit contributions
 generation+0.047
ility+0.046
pole+0.045
hip+0.045
cker+0.043
Neg logit contributions
 tr-0.045
 sp-0.043
uddle-0.041
 mo-0.041
 puck-0.040
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
 tr+0.013
 sp+0.012
uddle+0.012
 mo+0.012
 puck+0.012
Neg logit contributions
 generation-0.008
ility-0.008
hip-0.008
pole-0.008
eday-0.007
Token The
Token ID383
Feature activation-7.04e-03

Pos logit contributions
 tr+0.019
 sp+0.018
uddle+0.017
 mo+0.017
 puck+0.017
Neg logit contributions
 generation-0.020
ility-0.019
hip-0.019
pole-0.019
 Him-0.018
Token man
Token ID582
Feature activation-1.49e-02

Pos logit contributions
 tr+0.010
 sp+0.010
uddle+0.009
 mo+0.009
 puck+0.009
Neg logit contributions
 generation-0.013
ility-0.013
hip-0.013
pole-0.013
cker-0.012
Token had
Token ID550
Feature activation-1.78e-02

Pos logit contributions
 tr+0.022
 sp+0.021
 mo+0.020
uddle+0.020
 puck+0.019
Neg logit contributions
 generation-0.029
ility-0.028
hip-0.027
pole-0.027
 Him-0.026
Token been
Token ID587
Feature activation-2.02e-02

Pos logit contributions
uddle+0.034
 tr+0.034
 sp+0.031
 puck+0.031
 mo+0.031
Neg logit contributions
 generation-0.025
ility-0.024
hip-0.023
pole-0.023
eday-0.022
Token so
Token ID523
Feature activation-2.49e-02

Pos logit contributions
 tr+0.035
 sp+0.034
 mo+0.033
uddle+0.032
 puck+0.031
Neg logit contributions
 generation-0.034
ility-0.033
hip-0.032
pole-0.032
 Him-0.031
Token kind
Token ID1611
Feature activation-4.93e-03

Pos logit contributions
 tr+0.040
uddle+0.037
 sp+0.036
 puck+0.036
 mo+0.035
Neg logit contributions
 generation-0.044
ility-0.044
pole-0.042
hip-0.042
ow-0.042
Token,
Token ID11
Feature activation-4.34e-03

Pos logit contributions
 tr+0.008
 sp+0.008
uddle+0.007
 mo+0.007
 puck+0.007
Neg logit contributions
 generation-0.009
ility-0.008
pole-0.008
hip-0.008
 Him-0.008
Token and
Token ID290
Feature activation-5.01e-03

Pos logit contributions
uddle+0.007
 tr+0.007
 sp+0.007
 puck+0.007
 mo+0.007
Neg logit contributions
 generation-0.007
pole-0.007
ility-0.007
hip-0.006
 Him-0.006
Token she
Token ID673
Feature activation-1.26e-02

Pos logit contributions
 tr+0.009
uddle+0.008
 sp+0.008
 puck+0.008
 mo+0.008
Neg logit contributions
 generation-0.008
pole-0.008
ility-0.008
hip-0.008
ie-0.007
Token to
Token ID284
Feature activation-1.95e-03

Pos logit contributions
 tr+0.030
 sp+0.029
uddle+0.028
 mo+0.027
 puck+0.027
Neg logit contributions
 generation-0.038
ility-0.037
hip-0.036
pole-0.036
 Him-0.035
Token make
Token ID787
Feature activation+1.25e-02

Pos logit contributions
 tr+0.004
 sp+0.003
uddle+0.003
 mo+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
cker-0.003
Token it
Token ID340
Feature activation-1.06e-02

Pos logit contributions
 generation+0.019
ility+0.019
pole+0.018
hip+0.018
 Him+0.017
Neg logit contributions
 tr-0.023
 sp-0.022
uddle-0.022
 mo-0.021
 puck-0.021
Token green
Token ID4077
Feature activation+1.01e-02

Pos logit contributions
 tr+0.018
 sp+0.018
 mo+0.016
uddle+0.016
 puck+0.016
Neg logit contributions
 generation-0.019
ility-0.019
hip-0.018
pole-0.017
eday-0.017
Token because
Token ID780
Feature activation-6.88e-03

Pos logit contributions
 generation+0.019
ility+0.019
hip+0.018
pole+0.018
cker+0.017
Neg logit contributions
 tr-0.017
 sp-0.016
 mo-0.014
uddle-0.014
 puck-0.014
Token it
Token ID340
Feature activation-1.03e-02

Pos logit contributions
 tr+0.014
uddle+0.013
 sp+0.013
 puck+0.012
 mo+0.012
Neg logit contributions
 generation-0.010
ility-0.009
pole-0.009
 Him-0.009
hip-0.009
Token was
Token ID373
Feature activation-7.04e-03

Pos logit contributions
 tr+0.015
 sp+0.014
uddle+0.014
 mo+0.014
 puck+0.014
Neg logit contributions
 generation-0.019
ility-0.019
pole-0.019
hip-0.018
cker-0.018
Token her
Token ID607
Feature activation+1.79e-02

Pos logit contributions
 tr+0.011
 sp+0.011
 mo+0.010
uddle+0.010
 puck+0.010
Neg logit contributions
 generation-0.013
ility-0.013
hip-0.012
pole-0.012
 Him-0.012
Token favorite
Token ID4004
Feature activation+3.55e-02

Pos logit contributions
 generation+0.035
ility+0.034
pole+0.033
hip+0.033
 Him+0.032
Neg logit contributions
 tr-0.025
 sp-0.023
uddle-0.023
 mo-0.023
 puck-0.022
Token color
Token ID3124
Feature activation+1.24e-02

Pos logit contributions
 generation+0.060
ility+0.057
hip+0.056
pole+0.056
cker+0.054
Neg logit contributions
 tr-0.061
 sp-0.057
uddle-0.056
 mo-0.056
 puck-0.054
Token.
Token ID13
Feature activation-3.21e-03

Pos logit contributions
 generation+0.020
ility+0.019
pole+0.019
hip+0.019
cker+0.018
Neg logit contributions
 tr-0.022
 sp-0.021
uddle-0.020
 mo-0.020
 puck-0.020
Token.
Token ID13
Feature activation-1.43e-03

Pos logit contributions
 tr+0.007
 sp+0.006
 rac+0.006
 cones+0.006
 mo+0.006
Neg logit contributions
ility-0.010
 generation-0.010
pole-0.010
immer-0.009
cker-0.009
Token They
Token ID1119
Feature activation+8.73e-03

Pos logit contributions
 tr+0.002
 sp+0.002
uddle+0.002
 mo+0.002
 puck+0.002
Neg logit contributions
 generation-0.003
ility-0.003
pole-0.003
hip-0.003
cker-0.003
Token learned
Token ID4499
Feature activation-3.39e-02

Pos logit contributions
 generation+0.015
ility+0.015
pole+0.014
hip+0.014
immer+0.014
Neg logit contributions
 tr-0.015
 sp-0.015
 mo-0.014
uddle-0.014
 rac-0.013
Token to
Token ID284
Feature activation-3.57e-03

Pos logit contributions
 tr+0.061
 sp+0.059
uddle+0.055
 mo+0.055
 puck+0.054
Neg logit contributions
 generation-0.055
ility-0.055
pole-0.053
hip-0.052
immer-0.051
Token be
Token ID307
Feature activation-2.23e-02

Pos logit contributions
 tr+0.006
 sp+0.005
 sk+0.005
omo+0.005
 rac+0.005
Neg logit contributions
ility-0.007
 generation-0.006
 Him-0.006
pole-0.006
immer-0.006
Token careful
Token ID8161
Feature activation-9.04e-02

Pos logit contributions
 tr+0.029
 sp+0.029
 mo+0.026
 rac+0.026
 puck+0.026
Neg logit contributions
 generation-0.048
ility-0.047
hip-0.045
 Him-0.045
pole-0.045
Token with
Token ID351
Feature activation-2.80e-02

Pos logit contributions
 tr+0.101
 sp+0.095
uddle+0.086
 mo+0.079
 rac+0.076
Neg logit contributions
 generation-0.204
ility-0.196
eday-0.190
pole-0.186
cker-0.184
Token animals
Token ID4695
Feature activation-4.18e-03

Pos logit contributions
 tr+0.047
 sp+0.046
 mo+0.042
 rac+0.041
uddle+0.041
Neg logit contributions
 generation-0.048
ility-0.048
pole-0.048
immer-0.047
hip-0.046
Token.
Token ID13
Feature activation+1.41e-03

Pos logit contributions
 tr+0.007
 sp+0.007
uddle+0.006
 mo+0.006
 rac+0.006
Neg logit contributions
 generation-0.008
ility-0.007
pole-0.007
hip-0.007
immer-0.007
Token They
Token ID1119
Feature activation+1.00e-02

Pos logit contributions
 generation+0.003
ility+0.003
pole+0.003
hip+0.003
cker+0.002
Neg logit contributions
 tr-0.002
 sp-0.002
uddle-0.002
 mo-0.002
 puck-0.002
Token learned
Token ID4499
Feature activation-3.19e-02

Pos logit contributions
 generation+0.017
ility+0.017
pole+0.017
hip+0.016
immer+0.016
Neg logit contributions
 tr-0.017
 sp-0.017
 mo-0.016
uddle-0.015
 rac-0.015
Token other
Token ID584
Feature activation+3.99e-05

Pos logit contributions
 tr+0.008
uddle+0.008
iden+0.008
 Oscar+0.007
 sp+0.007
Neg logit contributions
 generation-0.012
cker-0.012
pole-0.012
hip-0.011
ility-0.011
Token stick
Token ID4859
Feature activation+1.89e-04

Pos logit contributions
 generation+0.000
ility+0.000
hip+0.000
pole+0.000
cker+0.000
Neg logit contributions
 tr-0.000
 sp-0.000
uddle-0.000
 mo-0.000
 puck-0.000
Token.
Token ID13
Feature activation-5.18e-03

Pos logit contributions
 generation+0.000
ility+0.000
 Him+0.000
hip+0.000
eday+0.000
Neg logit contributions
 tr-0.000
 sp-0.000
 mo-0.000
 puck-0.000
uddle-0.000
Token He
Token ID679
Feature activation-7.77e-03

Pos logit contributions
 tr+0.009
uddle+0.009
 mo+0.008
 sp+0.008
ve+0.007
Neg logit contributions
 generation-0.008
hip-0.008
 Him-0.008
 God-0.008
 someday-0.008
Token goes
Token ID2925
Feature activation-1.86e-03

Pos logit contributions
 tr+0.013
iden+0.012
uddle+0.012
 rac+0.012
 sp+0.011
Neg logit contributions
 generation-0.012
pole-0.012
ws-0.011
eday-0.011
ility-0.011
Token home
Token ID1363
Feature activation-7.81e-02

Pos logit contributions
 tr+0.003
 sp+0.003
 mo+0.003
uddle+0.003
 puck+0.003
Neg logit contributions
 generation-0.003
ility-0.003
hip-0.003
pole-0.003
 Him-0.003
Token.
Token ID13
Feature activation-7.55e-03

Pos logit contributions
 tr+0.133
 sp+0.129
 mo+0.123
uddle+0.120
 puck+0.117
Neg logit contributions
 generation-0.135
ility-0.132
pole-0.129
hip-0.128
cker-0.123
Token
Token ID198
Feature activation-7.76e-03

Pos logit contributions
uddle+0.014
 tr+0.014
 mo+0.013
 sp+0.013
ve+0.012
Neg logit contributions
 God-0.011
 generation-0.010
 Him-0.010
hip-0.010
 someday-0.010
Token
Token ID198
Feature activation-7.21e-03

Pos logit contributions
 tr+0.012
 sp+0.011
uddle+0.011
 mo+0.010
 puck+0.010
Neg logit contributions
 generation-0.014
hip-0.014
ility-0.014
 Him-0.014
pole-0.014
TokenMom
Token ID29252
Feature activation-3.30e-02

Pos logit contributions
 tr+0.013
 sp+0.013
uddle+0.012
 mo+0.012
 cones+0.011
Neg logit contributions
 generation-0.011
hip-0.011
pole-0.011
ility-0.011
 someday-0.010
Token goes
Token ID2925
Feature activation-3.10e-03

Pos logit contributions
 tr+0.054
 sp+0.051
uddle+0.049
 puck+0.048
 mo+0.048
Neg logit contributions
 generation-0.056
ility-0.056
pole-0.055
hip-0.054
 Him-0.053
TokenBut
Token ID1537
Feature activation-1.96e-02

Pos logit contributions
 tr+0.008
 sp+0.008
 mo+0.007
uddle+0.007
 puck+0.007
Neg logit contributions
 generation-0.011
ility-0.011
pole-0.011
hip-0.010
 Him-0.010
Token we
Token ID356
Feature activation-1.24e-03

Pos logit contributions
 tr+0.036
 sp+0.035
 mo+0.033
uddle+0.032
 puck+0.032
Neg logit contributions
 generation-0.031
ility-0.029
hip-0.029
pole-0.029
cker-0.028
Token need
Token ID761
Feature activation+4.57e-02

Pos logit contributions
 tr+0.002
uddle+0.002
 puck+0.002
 sp+0.002
aj+0.002
Neg logit contributions
ility-0.002
 generation-0.002
pole-0.002
cker-0.002
hip-0.002
Token to
Token ID284
Feature activation+1.21e-03

Pos logit contributions
 generation+0.074
ility+0.071
 Him+0.068
hip+0.067
eday+0.065
Neg logit contributions
 tr-0.079
uddle-0.078
 puck-0.073
omo-0.073
 sp-0.073
Token be
Token ID307
Feature activation-1.93e-02

Pos logit contributions
 generation+0.002
ility+0.002
pole+0.002
 Him+0.002
hip+0.002
Neg logit contributions
 tr-0.002
 sp-0.002
 mo-0.002
 rac-0.002
uddle-0.002
Token careful
Token ID8161
Feature activation-8.63e-02

Pos logit contributions
 tr+0.028
 sp+0.027
 mo+0.026
uddle+0.024
 puck+0.024
Neg logit contributions
 generation-0.039
ility-0.037
hip-0.037
pole-0.036
 Him-0.036
Token not
Token ID407
Feature activation+1.17e-02

Pos logit contributions
 tr+0.133
 sp+0.127
 mo+0.119
uddle+0.118
 rac+0.112
Neg logit contributions
 generation-0.160
ility-0.154
pole-0.147
 Him-0.144
cker-0.143
Token to
Token ID284
Feature activation+4.25e-03

Pos logit contributions
 generation+0.023
ility+0.022
pole+0.021
hip+0.021
cker+0.021
Neg logit contributions
 tr-0.017
 sp-0.017
uddle-0.016
 mo-0.016
 puck-0.015
Token hurt
Token ID5938
Feature activation-1.61e-02

Pos logit contributions
 generation+0.007
ility+0.007
pole+0.007
 Him+0.007
hip+0.007
Neg logit contributions
 tr-0.007
 sp-0.007
 mo-0.007
 rac-0.006
 puck-0.006
Token any
Token ID597
Feature activation+6.20e-03

Pos logit contributions
 tr+0.027
 sp+0.027
 mo+0.025
 rac+0.024
uddle+0.024
Neg logit contributions
 generation-0.027
ility-0.027
pole-0.026
hip-0.026
immer-0.026
Token bugs
Token ID11316
Feature activation+2.69e-03

Pos logit contributions
 generation+0.010
ility+0.010
pole+0.010
hip+0.009
ie+0.009
Neg logit contributions
 tr-0.011
 sp-0.010
uddle-0.010
 mo-0.010
 puck-0.010
Token old
Token ID1468
Feature activation+2.50e-02

Pos logit contributions
 generation+0.000
ility+0.000
hip+0.000
pole+0.000
cker+0.000
Neg logit contributions
 tr-0.000
 sp-0.000
uddle-0.000
 mo-0.000
 puck-0.000
Token man
Token ID582
Feature activation-1.33e-02

Pos logit contributions
 generation+0.058
ility+0.057
hip+0.056
pole+0.056
cker+0.055
Neg logit contributions
 tr-0.027
 sp-0.025
uddle-0.023
 mo-0.023
 puck-0.022
Token.
Token ID13
Feature activation-2.99e-03

Pos logit contributions
 tr+0.022
 sp+0.020
 mo+0.018
 rac+0.018
 puck+0.018
Neg logit contributions
 generation-0.024
ility-0.024
eday-0.023
hip-0.023
pole-0.022
Token She
Token ID1375
Feature activation+1.14e-02

Pos logit contributions
 tr+0.005
 sp+0.005
 mo+0.005
uddle+0.005
 puck+0.005
Neg logit contributions
 generation-0.005
hip-0.005
pole-0.005
ility-0.004
 Him-0.004
Token ran
Token ID4966
Feature activation-2.41e-02

Pos logit contributions
pole+0.017
 generation+0.017
cker+0.016
ie+0.016
et+0.015
Neg logit contributions
uddle-0.019
 puck-0.018
 tr-0.018
aj-0.018
 rac-0.018
Token home
Token ID1363
Feature activation-7.68e-02

Pos logit contributions
 tr+0.051
 sp+0.051
uddle+0.050
 mo+0.048
omo+0.048
Neg logit contributions
 generation-0.030
pole-0.027
hip-0.027
ility-0.027
eday-0.025
Token and
Token ID290
Feature activation-6.89e-03

Pos logit contributions
 tr+0.140
 sp+0.134
uddle+0.133
 puck+0.130
 mo+0.128
Neg logit contributions
 generation-0.117
ility-0.111
hip-0.110
pole-0.109
 Him-0.103
Token told
Token ID1297
Feature activation-1.80e-02

Pos logit contributions
uddle+0.012
omo+0.011
 tr+0.011
 puck+0.011
oodle+0.011
Neg logit contributions
 generation-0.011
pole-0.010
eday-0.010
ie-0.010
ility-0.009
Token her
Token ID607
Feature activation+1.64e-02

Pos logit contributions
 tr+0.036
 sp+0.035
 mo+0.034
 rac+0.032
 M+0.032
Neg logit contributions
 generation-0.025
ility-0.025
pole-0.025
cker-0.024
immer-0.024
Token family
Token ID1641
Feature activation-2.34e-02

Pos logit contributions
 Him+0.031
 God+0.031
 generation+0.031
ie+0.029
hip+0.029
Neg logit contributions
uddle-0.022
 tr-0.022
 puck-0.021
 sp-0.021
aj-0.020
Token all
Token ID477
Feature activation-1.46e-02

Pos logit contributions
 tr+0.053
 sp+0.051
 mo+0.049
uddle+0.049
 puck+0.048
Neg logit contributions
 generation-0.027
ility-0.026
pole-0.025
hip-0.025
cker-0.024
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
 generation+0.044
ility+0.042
pole+0.041
hip+0.041
ie+0.040
Neg logit contributions
 tr-0.059
 sp-0.057
uddle-0.056
 mo-0.055
 puck-0.054
Token he
Token ID339
Feature activation-1.89e-02

Pos logit contributions
 generation+0.022
ility+0.022
pole+0.021
hip+0.021
 Him+0.020
Neg logit contributions
 tr-0.017
 sp-0.016
uddle-0.015
 mo-0.015
 puck-0.014
Token learned
Token ID4499
Feature activation-2.34e-02

Pos logit contributions
 tr+0.033
 sp+0.032
 mo+0.031
 rac+0.029
 sk+0.029
Neg logit contributions
 generation-0.033
ility-0.032
immer-0.031
pole-0.031
hip-0.031
Token to
Token ID284
Feature activation+9.59e-03

Pos logit contributions
 tr+0.043
 sp+0.041
uddle+0.041
 mo+0.040
 puck+0.039
Neg logit contributions
 generation-0.035
hip-0.034
ility-0.034
pole-0.033
 Him-0.032
Token be
Token ID307
Feature activation-7.40e-03

Pos logit contributions
ility+0.015
 generation+0.015
pole+0.015
 Him+0.014
immer+0.014
Neg logit contributions
 tr-0.018
 sp-0.017
 mo-0.016
 sk-0.015
 rac-0.015
Token careful
Token ID8161
Feature activation-7.91e-02

Pos logit contributions
 tr+0.010
 sp+0.010
 mo+0.010
 rac+0.009
 puck+0.009
Neg logit contributions
 generation-0.015
ility-0.015
pole-0.014
 Him-0.014
hip-0.014
Token when
Token ID618
Feature activation-1.29e-03

Pos logit contributions
 tr+0.132
 sp+0.122
uddle+0.115
 mo+0.114
 rac+0.112
Neg logit contributions
 generation-0.139
ility-0.133
pole-0.127
cker-0.126
 Him-0.125
Token drawing
Token ID8263
Feature activation+1.03e-02

Pos logit contributions
uddle+0.002
 tr+0.002
 sp+0.002
 bisc+0.002
omo+0.002
Neg logit contributions
 generation-0.002
ility-0.002
 Him-0.002
hip-0.002
ie-0.002
Token so
Token ID523
Feature activation-2.67e-03

Pos logit contributions
 generation+0.016
ility+0.016
hip+0.016
pole+0.016
 Him+0.015
Neg logit contributions
 tr-0.018
 sp-0.017
uddle-0.017
 mo-0.017
 puck-0.016
Token he
Token ID339
Feature activation-1.33e-02

Pos logit contributions
 tr+0.005
 sp+0.005
uddle+0.004
 mo+0.004
 puck+0.004
Neg logit contributions
 generation-0.004
ility-0.004
hip-0.004
pole-0.004
 Him-0.004
Token wouldn
Token ID3636
Feature activation+1.85e-02

Pos logit contributions
 tr+0.020
 sp+0.020
 mo+0.018
uddle+0.018
 puck+0.018
Neg logit contributions
 generation-0.025
ility-0.024
hip-0.024
pole-0.024
immer-0.023