TOP ACTIVATIONS
MAX = 0.263

a small table , a te ap ot , some cups
them tea from her toy te ap ot and they all
She reached for her toy te ap ot , but it
the nice lady at the te ash op . He said
, Lily accidentally dropped the te ap ot and it broke
playing . The te ap ot was very sad
higher than Anna . He te ases her and says she
poured some water from the te ap ot and pretended it
anything . She knew the te ap ot and the cups
honest and hiding the broken te ap ot .
But the teaspoon ignored the te ap ot and kept playing
. He designed a beautiful te ap ot out of clay
poured some water from the te ap ot and handed the
cups and plates and a te ap ot .
and his friend found the te ash op . <|endoftext|> M
his friend was at the te ash op . They all
poured some water from the te ap ot . She was
From then on , the te ap ot and teaspoon explored
oons . The te ap ot shouted , "
could go exploring with the te ap ot .

MIN ACTIVATIONS
MIN = -0.302

found a magical o int ment . He touched it and
: " Measure ment complete . Height : 90
it and the o int ment began to glow . Fred
full of happiness and content ment . The wise old
The o int ment worked like magic and Fred
a deep sense of content ment . He was glad he
that the magic o int ment was his gift and that
of the magical o int ment , Fred was able to
He kept the o int ment in his drawer and used
place of peace and content ment . One
rub the magical o int ment on her forehead . She
Anna picked the word " ment ion ". She asked Ben
sense of joy and content ment . After a few moments
'll put some o int ment on it ," said his
other and sighed in content ment . What a lovely day
The o int ment made the itch go away
an incredible sense of content ment . She said goodbye to
gave him some o int ment and a hug .
The princess watched with content ment , but then she noticed
and enjoyed the cool refresh ment . The traveller was happy

INTERVAL 0.122 - 0.263
CONTAINS 3.052%

. L ily 's mom my came outside and
. She loved to play outside with her toys . One
, " Thank you , Lily ! You are a good
" Come , my children . I have soup
hello to the elephant and show it that we 're friendly

INTERVAL -0.019 - 0.122
CONTAINS 62.911%

From that day on , Mia and her dad read
It was very expensive and much bigger than the old one
much that it stopped roaring and ate it all . It
named Lucy . She loved to play with her toy figures
his cart , but as he rode away , he could

INTERVAL -0.161 - -0.019
CONTAINS 33.688%

a big party with her friends . She wanted to invite
in trouble and need their help . Be kind and generous
out of the freezer and cook the cookies . Lily helped
the cooler because it kept everything cold . While
play with you ? I don 't have any toys here

INTERVAL -0.302 - -0.161
CONTAINS 0.348%

but sometimes she didn 't understand other people 's language .
some strange noises in the distance . The
. It was hard and steep . They had to use
M ira and R avi were scared and shouted ,
go home . Mary put the purse back in
Token a
Token ID257
Feature activation-1.39e-02

Pos logit contributions
Mark+0.067
Rex+0.065
By+0.063
 distance+0.063
Time+0.062
Neg logit contributions
ott-0.143
 Majesty-0.138
 te-0.133
annah-0.131
oppy-0.131
Token small
Token ID1402
Feature activation+7.87e-02

Pos logit contributions
 moment+0.015
 distance+0.015
Time+0.014
By+0.014
Rex+0.014
Neg logit contributions
ott-0.028
annah-0.027
 Majesty-0.027
angers-0.025
igg-0.025
Token table
Token ID3084
Feature activation+8.48e-02

Pos logit contributions
ott+0.116
 Majesty+0.112
annah+0.105
 Av+0.096
 Flor+0.094
Neg logit contributions
 distance-0.127
 moment-0.126
Rex-0.121
Time-0.119
Mark-0.117
Token,
Token ID11
Feature activation+6.42e-02

Pos logit contributions
ott+0.166
 Majesty+0.165
 te+0.150
annah+0.150
oppy+0.147
Neg logit contributions
Rex-0.094
Mark-0.093
 distance-0.093
By-0.089
Kim-0.084
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
ott+0.129
 Majesty+0.125
 te+0.124
 cabinet+0.118
annah+0.116
Neg logit contributions
Mark-0.071
Rex-0.070
By-0.067
Big-0.066
 distance-0.066
Token te
Token ID573
Feature activation+2.63e-01

Pos logit contributions
 distance+0.019
Rex+0.019
 moment+0.018
Time+0.018
Mark+0.018
Neg logit contributions
ott-0.021
 Majesty-0.020
annah-0.019
igg-0.019
act-0.018
Tokenap
Token ID499
Feature activation-4.34e-02

Pos logit contributions
ott+0.394
 Majesty+0.371
 cabinet+0.362
annah+0.360
 wardrobe+0.331
Neg logit contributions
Rex-0.388
By-0.362
Time-0.348
Kim-0.338
Mark-0.334
Tokenot
Token ID313
Feature activation-9.41e-02

Pos logit contributions
Mark+0.073
Rex+0.072
By+0.071
Big+0.070
 distance+0.070
Neg logit contributions
ott-0.061
 te-0.060
 Majesty-0.058
 cabinet-0.055
annah-0.053
Token,
Token ID11
Feature activation+6.50e-02

Pos logit contributions
Mark+0.126
Rex+0.125
By+0.120
Big+0.119
 distance+0.118
Neg logit contributions
ott-0.167
 te-0.160
 Majesty-0.159
 cabinet-0.151
annah-0.147
Token some
Token ID617
Feature activation-2.57e-03

Pos logit contributions
ott+0.131
 Majesty+0.126
 te+0.122
 cabinet+0.119
annah+0.117
Neg logit contributions
Mark-0.072
Rex-0.072
By-0.069
Big-0.068
 distance-0.067
Token cups
Token ID14180
Feature activation+4.69e-02

Pos logit contributions
Rex+0.004
Mark+0.004
 distance+0.004
By+0.004
Time+0.004
Neg logit contributions
ott-0.004
 Majesty-0.003
 te-0.003
annah-0.003
 cabinet-0.003
Token them
Token ID606
Feature activation-2.69e-02

Pos logit contributions
Mark+0.012
Rex+0.012
By+0.012
Big+0.012
 distance+0.012
Neg logit contributions
ott-0.016
 te-0.015
 Majesty-0.015
 cabinet-0.014
annah-0.014
Token tea
Token ID8887
Feature activation+9.51e-02

Pos logit contributions
Rex+0.026
Mark+0.026
 distance+0.026
By+0.025
Big+0.025
Neg logit contributions
ott-0.057
 Majesty-0.056
 te-0.052
annah-0.051
 cabinet-0.050
Token from
Token ID422
Feature activation-1.83e-02

Pos logit contributions
ott+0.191
 Majesty+0.188
 te+0.175
annah+0.168
 cabinet+0.168
Neg logit contributions
Rex-0.105
Mark-0.104
By-0.101
 distance-0.099
Big-0.096
Token her
Token ID607
Feature activation+7.64e-02

Pos logit contributions
 distance+0.014
Mark+0.014
Rex+0.013
Big+0.013
What+0.012
Neg logit contributions
ott-0.044
 Majesty-0.043
 te-0.041
annah-0.040
anda-0.040
Token toy
Token ID13373
Feature activation+1.39e-01

Pos logit contributions
ott+0.131
igg+0.118
 Majesty+0.116
annah+0.116
osing+0.114
Neg logit contributions
Rex-0.110
 distance-0.108
Big-0.101
By-0.094
Mark-0.094
Token te
Token ID573
Feature activation+2.62e-01

Pos logit contributions
 Majesty+0.236
ott+0.235
annah+0.226
igg+0.215
 Av+0.201
Neg logit contributions
Rex-0.207
 distance-0.191
Mark-0.176
Big-0.176
By-0.174
Tokenap
Token ID499
Feature activation-4.70e-02

Pos logit contributions
ott+0.414
 cabinet+0.396
 Majesty+0.386
annah+0.370
 Flor+0.368
Neg logit contributions
Rex-0.369
By-0.329
Time-0.328
 injuries-0.326
Mark-0.317
Tokenot
Token ID313
Feature activation-9.82e-02

Pos logit contributions
Mark+0.082
Rex+0.081
By+0.079
Big+0.078
 distance+0.078
Neg logit contributions
ott-0.064
 te-0.063
 Majesty-0.061
 cabinet-0.057
annah-0.055
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
Mark+0.131
Rex+0.128
By+0.124
Big+0.123
 distance+0.123
Neg logit contributions
ott-0.174
 te-0.170
 Majesty-0.166
 cabinet-0.159
annah-0.155
Token they
Token ID484
Feature activation-1.92e-02

Pos logit contributions
Mark+0.027
Rex+0.026
By+0.026
Big+0.025
 distance+0.025
Neg logit contributions
ott-0.032
 te-0.031
 Majesty-0.031
 cabinet-0.029
annah-0.028
Token all
Token ID477
Feature activation+1.08e-01

Pos logit contributions
Mark+0.026
Rex+0.026
Time+0.025
Rick+0.025
Kim+0.025
Neg logit contributions
 te-0.035
ott-0.033
 Majesty-0.032
 cabinet-0.031
 wardrobe-0.031
TokenShe
Token ID3347
Feature activation-9.58e-03

Pos logit contributions
Mark+0.010
Rex+0.010
By+0.010
What+0.009
Time+0.009
Neg logit contributions
ott-0.014
 Majesty-0.014
 te-0.013
 cabinet-0.013
oppy-0.013
Token reached
Token ID4251
Feature activation-1.21e-02

Pos logit contributions
By+0.008
Rex+0.008
Mark+0.008
 rounded+0.007
Run+0.007
Neg logit contributions
 Majesty-0.022
ott-0.021
oppy-0.019
 cabinet-0.019
 majesty-0.019
Token for
Token ID329
Feature activation-3.99e-02

Pos logit contributions
 distance+0.011
Mark+0.011
 against+0.010
Time+0.010
 moment+0.010
Neg logit contributions
ott-0.027
 Majesty-0.026
act-0.023
 majesty-0.023
oppy-0.023
Token her
Token ID607
Feature activation+7.94e-02

Pos logit contributions
Mark+0.061
Rex+0.060
Big+0.059
Kim+0.059
Rick+0.059
Neg logit contributions
 te-0.063
ott-0.059
 Flor-0.059
 Av-0.058
 cabinet-0.056
Token toy
Token ID13373
Feature activation+1.40e-01

Pos logit contributions
ott+0.110
 Majesty+0.103
annah+0.091
act+0.086
igg+0.085
Neg logit contributions
Rex-0.135
 distance-0.133
Mark-0.129
 moment-0.129
What-0.125
Token te
Token ID573
Feature activation+2.61e-01

Pos logit contributions
 Majesty+0.234
ott+0.227
annah+0.206
igg+0.201
 majesty+0.199
Neg logit contributions
Rex-0.213
Mark-0.196
 distance-0.193
Kim-0.185
By-0.181
Tokenap
Token ID499
Feature activation-4.41e-02

Pos logit contributions
ott+0.424
 Majesty+0.404
 cabinet+0.394
annah+0.366
 Flor+0.363
Neg logit contributions
Rex-0.334
Time-0.313
 response-0.306
 injuries-0.302
 volume-0.301
Tokenot
Token ID313
Feature activation-9.42e-02

Pos logit contributions
Mark+0.076
Rex+0.075
By+0.073
Big+0.073
 distance+0.072
Neg logit contributions
ott-0.061
 te-0.060
 Majesty-0.058
 cabinet-0.055
annah-0.053
Token,
Token ID11
Feature activation+6.25e-02

Pos logit contributions
Mark+0.125
Rex+0.123
By+0.119
Big+0.118
 distance+0.117
Neg logit contributions
ott-0.168
 te-0.163
 Majesty-0.161
 cabinet-0.153
annah-0.149
Token but
Token ID475
Feature activation-3.04e-02

Pos logit contributions
ott+0.117
 Majesty+0.112
 te+0.107
 cabinet+0.104
oppy+0.103
Neg logit contributions
Mark-0.079
Rex-0.078
By-0.076
Big-0.075
 distance-0.075
Token it
Token ID340
Feature activation+8.04e-03

Pos logit contributions
Mark+0.050
Rex+0.049
By+0.048
Big+0.047
Time+0.047
Neg logit contributions
ott-0.045
 te-0.043
 Majesty-0.043
 cabinet-0.040
annah-0.039
Token the
Token ID262
Feature activation-4.16e-02

Pos logit contributions
 distance+0.048
Rex+0.048
Mark+0.047
Time+0.047
By+0.047
Neg logit contributions
ott-0.072
 te-0.070
 Flor-0.069
 Av-0.069
 Majesty-0.067
Token nice
Token ID3621
Feature activation+3.63e-02

Pos logit contributions
Mark+0.070
Rex+0.067
Big+0.066
Rick+0.066
Kim+0.065
Neg logit contributions
 te-0.065
ott-0.061
 Majesty-0.059
 cabinet-0.056
 Flor-0.054
Token lady
Token ID10846
Feature activation-5.87e-03

Pos logit contributions
 te+0.062
ott+0.060
 cabinet+0.055
 Majesty+0.055
 Av+0.053
Neg logit contributions
Mark-0.055
Rex-0.054
Big-0.053
By-0.052
Rick-0.051
Token at
Token ID379
Feature activation+2.01e-02

Pos logit contributions
Mark+0.008
Rex+0.008
By+0.008
Big+0.008
Time+0.008
Neg logit contributions
ott-0.010
 te-0.010
 Majesty-0.009
 cabinet-0.009
 Flor-0.009
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
ott+0.045
 te+0.044
 Majesty+0.044
 cabinet+0.042
annah+0.041
Neg logit contributions
Mark-0.017
Rex-0.017
By-0.016
Big-0.016
 distance-0.016
Token te
Token ID573
Feature activation+2.61e-01

Pos logit contributions
Mark+0.070
Rex+0.070
 distance+0.069
By+0.068
Time+0.068
Neg logit contributions
ott-0.058
 Majesty-0.054
 te-0.052
 cabinet-0.049
annah-0.049
Tokenash
Token ID1077
Feature activation-1.40e-02

Pos logit contributions
ott+0.357
 cabinet+0.352
 Majesty+0.342
 wardrobe+0.333
 Flor+0.325
Neg logit contributions
Rex-0.397
Time-0.373
Mark-0.373
By-0.367
Thunder-0.365
Tokenop
Token ID404
Feature activation-3.31e-03

Pos logit contributions
Mark+0.021
Rex+0.021
By+0.020
Big+0.020
 distance+0.019
Neg logit contributions
ott-0.023
 te-0.022
 Majesty-0.022
 cabinet-0.020
annah-0.020
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
Mark+0.005
Rex+0.004
Big+0.004
By+0.004
 distance+0.004
Neg logit contributions
ott-0.006
 te-0.006
 Majesty-0.006
 cabinet-0.005
annah-0.005
Token He
Token ID679
Feature activation-6.16e-02

Pos logit contributions
Mark+0.020
By+0.019
Big+0.019
Rex+0.019
Kim+0.018
Neg logit contributions
 te-0.029
ott-0.029
 Majesty-0.028
 cabinet-0.027
 wardrobe-0.027
Token said
Token ID531
Feature activation+3.03e-02

Pos logit contributions
Rex+0.063
Mark+0.062
By+0.062
Big+0.059
 distance+0.057
Neg logit contributions
ott-0.127
 Majesty-0.124
 te-0.121
 cabinet-0.120
annah-0.117
Token,
Token ID11
Feature activation+5.70e-02

Pos logit contributions
Mark+0.075
Rex+0.073
By+0.072
Big+0.071
Time+0.070
Neg logit contributions
ott-0.082
 te-0.080
 Majesty-0.078
 cabinet-0.073
annah-0.073
Token Lily
Token ID20037
Feature activation+1.47e-01

Pos logit contributions
ott+0.091
 te+0.089
 Majesty+0.084
 cabinet+0.084
 Av+0.082
Neg logit contributions
Mark-0.086
Rex-0.086
Big-0.082
 distance-0.081
By-0.081
Token accidentally
Token ID14716
Feature activation+1.65e-01

Pos logit contributions
ott+0.238
 Majesty+0.233
 te+0.228
 cabinet+0.217
 Flor+0.215
Neg logit contributions
Mark-0.213
Rex-0.212
Big-0.205
By-0.204
 distance-0.199
Token dropped
Token ID5710
Feature activation+4.09e-02

Pos logit contributions
ott+0.257
 Majesty+0.249
 te+0.247
 cabinet+0.232
annah+0.228
Neg logit contributions
Mark-0.256
Rex-0.253
By-0.245
Big-0.243
Time-0.240
Token the
Token ID262
Feature activation-4.08e-02

Pos logit contributions
ott+0.082
 te+0.080
 cabinet+0.076
 Majesty+0.074
 Av+0.073
Neg logit contributions
Mark-0.048
Rex-0.046
By-0.046
Big-0.043
Rick-0.043
Token te
Token ID573
Feature activation+2.61e-01

Pos logit contributions
 distance+0.077
Rex+0.075
Time+0.075
 moment+0.074
What+0.073
Neg logit contributions
ott-0.049
 Majesty-0.049
annah-0.041
oppy-0.041
 majesty-0.038
Tokenap
Token ID499
Feature activation-4.76e-02

Pos logit contributions
ott+0.365
 Majesty+0.363
 cabinet+0.341
 wardrobe+0.328
 Flor+0.325
Neg logit contributions
Rex-0.365
Time-0.365
 injuries-0.356
By-0.354
 volume-0.334
Tokenot
Token ID313
Feature activation-9.95e-02

Pos logit contributions
Mark+0.085
Rex+0.083
Big+0.082
By+0.082
Rick+0.082
Neg logit contributions
 te-0.066
ott-0.064
 cabinet-0.058
 Majesty-0.057
 Av-0.055
Token and
Token ID290
Feature activation-1.48e-02

Pos logit contributions
Mark+0.121
Rex+0.116
 distance+0.113
Big+0.113
By+0.112
Neg logit contributions
 te-0.189
ott-0.188
 Majesty-0.174
 cabinet-0.173
annah-0.169
Token it
Token ID340
Feature activation+7.82e-03

Pos logit contributions
Mark+0.019
Rex+0.018
By+0.018
Big+0.018
 distance+0.018
Neg logit contributions
ott-0.027
 te-0.026
 Majesty-0.026
 cabinet-0.025
annah-0.024
Token broke
Token ID6265
Feature activation+4.64e-02

Pos logit contributions
ott+0.013
 te+0.013
 Majesty+0.013
 cabinet+0.012
annah+0.012
Neg logit contributions
Mark-0.011
Rex-0.011
By-0.010
Big-0.010
 distance-0.010
Token playing
Token ID2712
Feature activation+4.38e-02

Pos logit contributions
Mark+0.023
Rex+0.022
By+0.022
Big+0.021
 distance+0.021
Neg logit contributions
ott-0.031
 te-0.030
 Majesty-0.030
 cabinet-0.029
annah-0.028
Token.
Token ID13
Feature activation-5.60e-03

Pos logit contributions
ott+0.086
 Majesty+0.081
 te+0.077
oppy+0.076
annah+0.076
Neg logit contributions
Rex-0.050
Mark-0.050
By-0.050
 distance-0.049
What-0.048
Token
Token ID198
Feature activation-9.75e-03

Pos logit contributions
Mark+0.007
By+0.006
Big+0.006
Rex+0.006
Thunder+0.006
Neg logit contributions
 Majesty-0.011
ott-0.011
 te-0.011
 wardrobe-0.011
 cabinet-0.010
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
 distance+0.014
 brake+0.013
Rex+0.013
Mark+0.013
Time+0.013
Neg logit contributions
ott-0.016
 te-0.015
 Majesty-0.014
 Flor-0.014
annah-0.014
TokenThe
Token ID464
Feature activation-1.06e-01

Pos logit contributions
 distance+0.010
 brake+0.009
Rex+0.009
 levers+0.009
 breath+0.009
Neg logit contributions
ott-0.021
 te-0.020
annah-0.019
 Majesty-0.019
 Av-0.019
Token te
Token ID573
Feature activation+2.60e-01

Pos logit contributions
Mark+0.186
Rex+0.184
By+0.179
Big+0.178
 distance+0.177
Neg logit contributions
ott-0.145
 te-0.141
 Majesty-0.137
 cabinet-0.129
annah-0.124
Tokenap
Token ID499
Feature activation-4.90e-02

Pos logit contributions
 Majesty+0.421
annah+0.394
ott+0.390
 cabinet+0.385
 wardrobe+0.380
Neg logit contributions
Time-0.315
By-0.302
Rex-0.301
Thunder-0.287
Mark-0.282
Tokenot
Token ID313
Feature activation-1.02e-01

Pos logit contributions
Mark+0.067
Rex+0.066
By+0.065
Big+0.063
 distance+0.063
Neg logit contributions
ott-0.085
 Majesty-0.081
 te-0.081
 cabinet-0.077
annah-0.074
Token was
Token ID373
Feature activation+2.62e-02

Pos logit contributions
Mark+0.131
Big+0.125
Rex+0.124
 distance+0.124
Rick+0.124
Neg logit contributions
 te-0.183
ott-0.179
 Majesty-0.168
annah-0.163
 cabinet-0.162
Token very
Token ID845
Feature activation+7.32e-02

Pos logit contributions
 te+0.038
ott+0.036
 Majesty+0.035
 Flor+0.033
 cabinet+0.033
Neg logit contributions
Mark-0.043
Rex-0.043
Rick-0.042
Big-0.041
By-0.041
Token sad
Token ID6507
Feature activation+4.22e-03

Pos logit contributions
ott+0.119
 Majesty+0.114
 te+0.113
 cabinet+0.109
annah+0.104
Neg logit contributions
Mark-0.108
Rex-0.108
By-0.108
Big-0.105
 distance-0.105
Token higher
Token ID2440
Feature activation-3.31e-02

Pos logit contributions
Mark+0.011
By+0.010
Rex+0.010
Time+0.010
Big+0.010
Neg logit contributions
ott-0.020
 te-0.020
 Majesty-0.019
 cabinet-0.019
 Flor-0.019
Token than
Token ID621
Feature activation-1.14e-01

Pos logit contributions
Mark+0.049
By+0.048
Rex+0.048
Time+0.047
Big+0.047
Neg logit contributions
 te-0.052
ott-0.052
 Majesty-0.050
 cabinet-0.049
 Flor-0.048
Token Anna
Token ID11735
Feature activation+8.63e-02

Pos logit contributions
Rex+0.126
Mark+0.126
 distance+0.118
By+0.116
Big+0.114
Neg logit contributions
ott-0.234
 Majesty-0.231
 te-0.225
 cabinet-0.211
annah-0.210
Token.
Token ID13
Feature activation-1.09e-02

Pos logit contributions
ott+0.175
 Majesty+0.173
 te+0.163
 majesty+0.156
 cabinet+0.155
Neg logit contributions
Rex-0.097
Mark-0.094
 distance-0.086
By-0.084
Big-0.084
Token He
Token ID679
Feature activation-6.06e-02

Pos logit contributions
Rex+0.012
Mark+0.012
By+0.012
Time+0.011
What+0.011
Neg logit contributions
 Majesty-0.022
ott-0.022
 te-0.021
 wardrobe-0.020
 cabinet-0.020
Token te
Token ID573
Feature activation+2.60e-01

Pos logit contributions
Rex+0.074
Mark+0.073
 brake+0.067
By+0.067
Rick+0.067
Neg logit contributions
ott-0.116
 Majesty-0.107
 te-0.104
annah-0.104
oppy-0.102
Tokenases
Token ID1386
Feature activation-5.51e-02

Pos logit contributions
ott+0.463
 te+0.445
 Majesty+0.444
annah+0.440
 cabinet+0.436
Neg logit contributions
Rex-0.304
Mark-0.286
By-0.270
Thunder-0.270
Time-0.269
Token her
Token ID607
Feature activation+7.97e-02

Pos logit contributions
Rex+0.090
Mark+0.089
Big+0.084
By+0.083
Time+0.083
Neg logit contributions
ott-0.085
 Majesty-0.083
 te-0.078
 cabinet-0.074
oppy-0.072
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
ott+0.138
 te+0.134
 Majesty+0.132
 cabinet+0.125
annah+0.122
Neg logit contributions
Mark-0.110
Rex-0.109
By-0.105
Big-0.104
 distance-0.104
Token says
Token ID1139
Feature activation+4.79e-02

Pos logit contributions
Rex+0.024
Mark+0.023
What+0.022
 brake+0.022
 steer+0.022
Neg logit contributions
ott-0.039
oppy-0.033
 Majesty-0.033
ingu-0.032
annah-0.032
Token she
Token ID673
Feature activation+8.79e-02

Pos logit contributions
ott+0.082
 te+0.079
 Majesty+0.078
 cabinet+0.074
annah+0.073
Neg logit contributions
Mark-0.068
Rex-0.066
By-0.065
Big-0.063
Time-0.062
Token poured
Token ID19036
Feature activation+4.69e-02

Pos logit contributions
Mark+0.161
By+0.160
Rex+0.159
Big+0.153
 distance+0.150
Neg logit contributions
ott-0.281
 Majesty-0.279
 te-0.265
 cabinet-0.260
annah-0.258
Token some
Token ID617
Feature activation-5.85e-03

Pos logit contributions
 te+0.084
ott+0.082
 Flor+0.076
 cabinet+0.076
 Av+0.075
Neg logit contributions
Mark-0.066
Rex-0.063
By-0.062
Rick-0.062
Big-0.061
Token water
Token ID1660
Feature activation+2.72e-02

Pos logit contributions
Rex+0.010
Mark+0.010
 distance+0.010
Time+0.010
By+0.010
Neg logit contributions
ott-0.008
 Majesty-0.008
 te-0.007
 cabinet-0.007
annah-0.007
Token from
Token ID422
Feature activation-1.74e-02

Pos logit contributions
 te+0.055
ott+0.054
 cabinet+0.050
 Majesty+0.049
 Flor+0.049
Neg logit contributions
Mark-0.032
Rex-0.030
By-0.029
Big-0.029
Rick-0.029
Token the
Token ID262
Feature activation-4.04e-02

Pos logit contributions
Mark+0.014
Rex+0.014
Big+0.013
 distance+0.013
What+0.013
Neg logit contributions
ott-0.040
 Majesty-0.039
 te-0.038
annah-0.037
 cabinet-0.037
Token te
Token ID573
Feature activation+2.60e-01

Pos logit contributions
 distance+0.077
Rex+0.074
Time+0.073
What+0.071
By+0.070
Neg logit contributions
ott-0.051
 Majesty-0.048
oppy-0.043
annah-0.041
igg-0.040
Tokenap
Token ID499
Feature activation-4.71e-02

Pos logit contributions
ott+0.382
 Majesty+0.350
 cabinet+0.341
annah+0.318
 wardrobe+0.317
Neg logit contributions
Rex-0.375
Time-0.368
By-0.349
 injuries-0.341
 signals-0.341
Tokenot
Token ID313
Feature activation-9.78e-02

Pos logit contributions
Mark+0.090
Rex+0.088
By+0.086
Rick+0.086
Big+0.086
Neg logit contributions
 te-0.059
ott-0.057
 Majesty-0.052
 cabinet-0.051
 Av-0.049
Token and
Token ID290
Feature activation-1.55e-02

Pos logit contributions
Mark+0.114
Rex+0.113
By+0.108
Big+0.107
 distance+0.106
Neg logit contributions
ott-0.190
 Majesty-0.184
 te-0.183
 cabinet-0.174
annah-0.169
Token pretended
Token ID36766
Feature activation-3.21e-02

Pos logit contributions
Mark+0.022
Rex+0.022
By+0.021
Big+0.021
 distance+0.021
Neg logit contributions
ott-0.026
 te-0.026
 Majesty-0.025
 cabinet-0.024
 Flor-0.023
Token it
Token ID340
Feature activation+8.28e-03

Pos logit contributions
Rex+0.046
Mark+0.046
By+0.045
Big+0.045
Rick+0.044
Neg logit contributions
 te-0.053
ott-0.052
 cabinet-0.049
 Majesty-0.049
 Flor-0.047
Token anything
Token ID1997
Feature activation-6.62e-02

Pos logit contributions
ott+0.064
 Majesty+0.063
 te+0.061
 cabinet+0.058
annah+0.058
Neg logit contributions
Rex-0.037
Mark-0.037
What-0.035
Big-0.035
Time-0.034
Token.
Token ID13
Feature activation+1.31e-03

Pos logit contributions
 distance+0.072
What+0.070
By+0.070
Rex+0.070
Time+0.069
Neg logit contributions
 Majesty-0.132
ott-0.130
 majesty-0.115
annah-0.115
oppy-0.114
Token She
Token ID1375
Feature activation+5.15e-02

Pos logit contributions
 te+0.002
 Majesty+0.002
ott+0.002
anda+0.002
 wardrobe+0.002
Neg logit contributions
What-0.002
Big-0.002
Mark-0.002
By-0.002
Kim-0.002
Token knew
Token ID2993
Feature activation+9.75e-03

Pos logit contributions
 Majesty+0.110
ott+0.108
annah+0.104
 cabinet+0.104
 wardrobe+0.100
Neg logit contributions
Rex-0.047
By-0.046
Mark-0.045
 breathed-0.041
Big-0.041
Token the
Token ID262
Feature activation-4.03e-02

Pos logit contributions
 Majesty+0.020
ott+0.019
 te+0.019
annah+0.019
 cabinet+0.018
Neg logit contributions
What-0.010
Mark-0.010
Rex-0.010
Big-0.010
Time-0.009
Token te
Token ID573
Feature activation+2.60e-01

Pos logit contributions
Rex+0.069
Mark+0.069
 distance+0.068
Time+0.068
By+0.068
Neg logit contributions
ott-0.055
 Majesty-0.051
 te-0.050
annah-0.048
 cabinet-0.047
Tokenap
Token ID499
Feature activation-4.85e-02

Pos logit contributions
ott+0.398
 Majesty+0.394
 wardrobe+0.392
annah+0.383
 cabinet+0.381
Neg logit contributions
Rex-0.313
Time-0.296
 injuries-0.291
Thunder-0.285
By-0.283
Tokenot
Token ID313
Feature activation-9.83e-02

Pos logit contributions
Mark+0.070
Rex+0.069
By+0.067
Big+0.066
 distance+0.066
Neg logit contributions
ott-0.080
 te-0.078
 Majesty-0.077
 cabinet-0.072
annah-0.070
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
Mark+0.130
Rex+0.127
By+0.124
Big+0.123
 distance+0.122
Neg logit contributions
ott-0.175
 te-0.171
 Majesty-0.167
 cabinet-0.160
annah-0.155
Token the
Token ID262
Feature activation-3.96e-02

Pos logit contributions
Mark+0.015
Rex+0.015
By+0.015
Big+0.015
 distance+0.014
Neg logit contributions
ott-0.023
 te-0.022
 Majesty-0.022
 cabinet-0.021
 Flor-0.020
Token cups
Token ID14180
Feature activation+4.33e-02

Pos logit contributions
Mark+0.072
Rex+0.071
By+0.070
Big+0.069
 distance+0.068
Neg logit contributions
ott-0.051
 te-0.050
 Majesty-0.048
 cabinet-0.045
 Flor-0.043
Token honest
Token ID5508
Feature activation+1.21e-02

Pos logit contributions
ott+0.045
 Majesty+0.044
 te+0.043
 cabinet+0.040
 Flor+0.039
Neg logit contributions
Mark-0.040
Rex-0.040
Big-0.038
By-0.037
Time-0.037
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
ott+0.020
 te+0.020
 Majesty+0.019
 cabinet+0.019
annah+0.018
Neg logit contributions
Mark-0.017
Rex-0.017
By-0.016
Big-0.016
 distance-0.016
Token hiding
Token ID11816
Feature activation+1.78e-01

Pos logit contributions
Mark+0.014
Rex+0.014
By+0.013
Big+0.013
Time+0.013
Neg logit contributions
ott-0.019
 te-0.019
 Majesty-0.018
 cabinet-0.017
annah-0.017
Token the
Token ID262
Feature activation-4.05e-02

Pos logit contributions
ott+0.327
 te+0.317
 Majesty+0.314
 cabinet+0.290
annah+0.290
Neg logit contributions
Mark-0.227
Rex-0.222
 distance-0.215
Big-0.211
By-0.209
Token broken
Token ID5445
Feature activation+2.69e-02

Pos logit contributions
Mark+0.065
Rex+0.064
By+0.063
Big+0.062
 distance+0.062
Neg logit contributions
ott-0.061
 te-0.059
 Majesty-0.058
 cabinet-0.055
annah-0.053
Token te
Token ID573
Feature activation+2.59e-01

Pos logit contributions
ott+0.049
 Majesty+0.047
 te+0.046
annah+0.045
 Flor+0.044
Neg logit contributions
 distance-0.034
Mark-0.033
Rex-0.032
By-0.031
Time-0.031
Tokenap
Token ID499
Feature activation-4.72e-02

Pos logit contributions
ott+0.405
 Flor+0.390
 te+0.386
 cabinet+0.385
 wardrobe+0.380
Neg logit contributions
Rex-0.325
 injuries-0.319
By-0.299
Mark-0.292
Time-0.286
Tokenot
Token ID313
Feature activation-9.98e-02

Pos logit contributions
Mark+0.071
Rex+0.070
By+0.068
Big+0.068
 distance+0.067
Neg logit contributions
ott-0.075
 te-0.073
 Majesty-0.071
 cabinet-0.068
annah-0.066
Token.
Token ID13
Feature activation-1.41e-03

Pos logit contributions
Mark+0.141
Rex+0.137
By+0.133
Big+0.133
 distance+0.132
Neg logit contributions
ott-0.168
 te-0.167
 Majesty-0.160
 cabinet-0.154
annah-0.150
Token
Token ID198
Feature activation-4.46e-03

Pos logit contributions
What+0.001
Mark+0.001
Kim+0.001
By+0.001
Big+0.001
Neg logit contributions
 Majesty-0.003
 te-0.003
 collection-0.003
 wardrobe-0.003
 penn-0.003
Token
Token ID198
Feature activation-6.11e-03

Pos logit contributions
Mark+0.005
Rex+0.005
 distance+0.005
By+0.005
Big+0.005
Neg logit contributions
ott-0.008
 te-0.008
 Majesty-0.008
 cabinet-0.008
annah-0.007
Token But
Token ID887
Feature activation+3.34e-02

Pos logit contributions
ott+0.019
 Majesty+0.019
 te+0.018
 cabinet+0.018
 wardrobe+0.017
Neg logit contributions
Mark-0.014
Rex-0.014
By-0.013
Big-0.013
 distance-0.012
Token the
Token ID262
Feature activation-4.62e-02

Pos logit contributions
 te+0.070
ott+0.070
 Majesty+0.068
 Av+0.067
 Flor+0.067
Neg logit contributions
Mark-0.033
Rex-0.032
Time-0.030
Rick-0.030
Kim-0.030
Token teaspoon
Token ID22326
Feature activation+1.37e-01

Pos logit contributions
Mark+0.094
Rex+0.094
 distance+0.094
Time+0.093
By+0.093
Neg logit contributions
ott-0.047
 Majesty-0.045
annah-0.038
 te-0.037
oppy-0.037
Token ignored
Token ID9514
Feature activation-1.03e-01

Pos logit contributions
ott+0.280
 Majesty+0.276
 te+0.269
 cabinet+0.256
 Flor+0.254
Neg logit contributions
Mark-0.143
Rex-0.141
 distance-0.134
By-0.134
Big-0.130
Token the
Token ID262
Feature activation-4.54e-02

Pos logit contributions
Mark+0.099
Rex+0.096
By+0.094
 distance+0.092
Time+0.090
Neg logit contributions
ott-0.222
 te-0.218
 Majesty-0.214
 cabinet-0.207
 Flor-0.203
Token te
Token ID573
Feature activation+2.59e-01

Pos logit contributions
Mark+0.082
Rex+0.082
 distance+0.081
Time+0.081
By+0.081
Neg logit contributions
ott-0.057
 Majesty-0.054
 te-0.050
annah-0.048
oppy-0.047
Tokenap
Token ID499
Feature activation-4.94e-02

Pos logit contributions
 Majesty+0.399
 wardrobe+0.395
ott+0.393
 cabinet+0.389
annah+0.376
Neg logit contributions
Rex-0.285
 breath-0.275
Time-0.272
By-0.271
Thunder-0.266
Tokenot
Token ID313
Feature activation-1.02e-01

Pos logit contributions
Mark+0.074
Rex+0.073
By+0.071
Big+0.071
 distance+0.070
Neg logit contributions
ott-0.079
 te-0.078
 Majesty-0.075
 cabinet-0.072
annah-0.069
Token and
Token ID290
Feature activation-2.13e-02

Pos logit contributions
Mark+0.130
Rex+0.127
By+0.123
Big+0.122
 distance+0.121
Neg logit contributions
ott-0.187
 te-0.182
 Majesty-0.179
 cabinet-0.171
annah-0.166
Token kept
Token ID4030
Feature activation-1.73e-02

Pos logit contributions
Mark+0.026
Rex+0.026
By+0.025
Big+0.024
 distance+0.024
Neg logit contributions
ott-0.040
 te-0.039
 Majesty-0.039
 cabinet-0.037
annah-0.036
Token playing
Token ID2712
Feature activation+4.38e-02

Pos logit contributions
Mark+0.023
Rex+0.022
By+0.022
Big+0.021
 distance+0.021
Neg logit contributions
ott-0.031
 te-0.030
 Majesty-0.030
 cabinet-0.029
annah-0.028
Token.
Token ID13
Feature activation-6.37e-03

Pos logit contributions
ott+0.021
 Majesty+0.021
 te+0.021
 Flor+0.019
annah+0.019
Neg logit contributions
Mark-0.014
Rex-0.014
 distance-0.014
Big-0.013
By-0.013
Token He
Token ID679
Feature activation-5.46e-02

Pos logit contributions
Mark+0.009
Rex+0.009
Big+0.009
 distance+0.009
By+0.009
Neg logit contributions
ott-0.011
 Majesty-0.011
 te-0.011
annah-0.010
 cabinet-0.010
Token designed
Token ID3562
Feature activation+3.97e-02

Pos logit contributions
Rex+0.045
Mark+0.044
Big+0.042
By+0.042
Rick+0.037
Neg logit contributions
 Majesty-0.124
annah-0.121
ott-0.120
 te-0.116
 cabinet-0.115
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
ott+0.074
 te+0.072
 Majesty+0.071
 cabinet+0.067
annah+0.066
Neg logit contributions
Mark-0.050
Rex-0.049
By-0.047
Big-0.047
 distance-0.046
Token beautiful
Token ID4950
Feature activation+2.21e-02

Pos logit contributions
 distance+0.017
Mark+0.016
Rex+0.016
Big+0.016
By+0.016
Neg logit contributions
ott-0.027
annah-0.026
 te-0.025
 Majesty-0.025
 Flor-0.024
Token te
Token ID573
Feature activation+2.59e-01

Pos logit contributions
ott+0.038
annah+0.034
 te+0.034
oppy+0.033
 Majesty+0.033
Neg logit contributions
 distance-0.031
Mark-0.030
 moment-0.029
Rex-0.029
Big-0.028
Tokenap
Token ID499
Feature activation-4.78e-02

Pos logit contributions
annah+0.373
ott+0.354
 cabinet+0.350
 Majesty+0.339
 te+0.333
Neg logit contributions
Rex-0.371
By-0.350
Mark-0.336
Kim-0.332
Thunder-0.331
Tokenot
Token ID313
Feature activation-9.99e-02

Pos logit contributions
Mark+0.073
Rex+0.071
By+0.070
Big+0.069
Time+0.068
Neg logit contributions
 te-0.076
ott-0.076
 Majesty-0.072
 cabinet-0.070
 Av-0.067
Token out
Token ID503
Feature activation+9.34e-02

Pos logit contributions
Mark+0.139
Rex+0.134
 distance+0.132
By+0.131
Big+0.131
Neg logit contributions
 te-0.171
ott-0.169
 Majesty-0.163
 cabinet-0.156
 Av-0.152
Token of
Token ID286
Feature activation-5.15e-02

Pos logit contributions
 te+0.154
ott+0.144
 cabinet+0.139
 Majesty+0.139
 Flor+0.134
Neg logit contributions
Rex-0.141
Mark-0.140
Josh-0.134
Rick-0.133
Time-0.131
Token clay
Token ID21558
Feature activation+8.23e-02

Pos logit contributions
Mark+0.053
Rex+0.052
 distance+0.050
Big+0.049
By+0.049
Neg logit contributions
ott-0.107
 Majesty-0.103
 te-0.103
 cabinet-0.098
annah-0.097
Token poured
Token ID19036
Feature activation+4.84e-02

Pos logit contributions
ott+0.095
 Majesty+0.095
 cabinet+0.089
annah+0.089
 te+0.088
Neg logit contributions
Mark-0.053
By-0.051
Rex-0.051
Big-0.049
Rick-0.047
Token some
Token ID617
Feature activation-4.18e-03

Pos logit contributions
ott+0.090
 te+0.089
 Majesty+0.083
 cabinet+0.083
 Flor+0.082
Neg logit contributions
Mark-0.062
Rex-0.059
By-0.058
Big-0.057
Rick-0.057
Token water
Token ID1660
Feature activation+2.72e-02

Pos logit contributions
Rex+0.006
Mark+0.006
 distance+0.006
By+0.006
Time+0.006
Neg logit contributions
ott-0.007
 Majesty-0.007
 te-0.006
 cabinet-0.006
annah-0.006
Token from
Token ID422
Feature activation-1.57e-02

Pos logit contributions
 te+0.054
ott+0.053
 cabinet+0.050
 Majesty+0.049
 Flor+0.049
Neg logit contributions
Mark-0.032
Rex-0.030
By-0.029
Big-0.029
Rick-0.029
Token the
Token ID262
Feature activation-3.91e-02

Pos logit contributions
Mark+0.016
Rex+0.015
Big+0.015
 distance+0.015
What+0.014
Neg logit contributions
ott-0.033
 Majesty-0.033
 te-0.032
 majesty-0.030
oppy-0.030
Token te
Token ID573
Feature activation+2.59e-01

Pos logit contributions
 distance+0.073
Rex+0.072
Time+0.069
Mark+0.069
What+0.068
Neg logit contributions
ott-0.049
 Majesty-0.048
oppy-0.042
annah-0.041
igg-0.039
Tokenap
Token ID499
Feature activation-4.83e-02

Pos logit contributions
ott+0.386
 Majesty+0.363
 cabinet+0.353
 Flor+0.338
 wardrobe+0.337
Neg logit contributions
Rex-0.360
Time-0.344
By-0.331
 injuries-0.329
 signals-0.325
Tokenot
Token ID313
Feature activation-9.86e-02

Pos logit contributions
Mark+0.087
Rex+0.086
By+0.084
Big+0.083
Rick+0.082
Neg logit contributions
ott-0.063
 te-0.063
 Majesty-0.059
 cabinet-0.056
annah-0.054
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
Mark+0.121
Rex+0.119
By+0.115
Big+0.114
 distance+0.113
Neg logit contributions
ott-0.185
 te-0.179
 Majesty-0.178
 cabinet-0.170
annah-0.165
Token handed
Token ID10158
Feature activation+6.37e-02

Pos logit contributions
Mark+0.018
Rex+0.018
By+0.017
Big+0.017
 distance+0.017
Neg logit contributions
ott-0.025
 Majesty-0.024
 te-0.024
 cabinet-0.022
annah-0.022
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
ott+0.115
 te+0.112
 Majesty+0.111
 cabinet+0.106
annah+0.103
Neg logit contributions
Mark-0.083
Rex-0.081
By-0.078
Big-0.078
 distance-0.077
Token cups
Token ID14180
Feature activation+4.36e-02

Pos logit contributions
ott+0.177
 Majesty+0.170
 te+0.166
 cabinet+0.158
annah+0.157
Neg logit contributions
Mark-0.171
Rex-0.169
Big-0.164
By-0.163
 distance-0.163
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
ott+0.085
 Majesty+0.085
 te+0.082
 cabinet+0.078
 Flor+0.077
Neg logit contributions
Rex-0.048
Mark-0.048
 distance-0.047
By-0.046
Big-0.045
Token plates
Token ID13375
Feature activation+9.66e-03

Pos logit contributions
Mark+0.034
Rex+0.034
By+0.033
Big+0.033
 distance+0.033
Neg logit contributions
ott-0.025
 te-0.024
 Majesty-0.023
 cabinet-0.022
annah-0.021
Token and
Token ID290
Feature activation-1.86e-02

Pos logit contributions
ott+0.017
 te+0.016
 Majesty+0.016
 cabinet+0.015
annah+0.015
Neg logit contributions
Mark-0.013
Rex-0.013
By-0.012
Big-0.012
 distance-0.012
Token a
Token ID257
Feature activation-1.35e-02

Pos logit contributions
Mark+0.033
Rex+0.032
By+0.032
Big+0.031
 distance+0.031
Neg logit contributions
ott-0.025
 te-0.024
 Majesty-0.024
 cabinet-0.022
annah-0.021
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
Rex+0.022
Mark+0.022
 distance+0.022
By+0.022
Time+0.021
Neg logit contributions
ott-0.020
 Majesty-0.018
annah-0.017
 te-0.017
oppy-0.016
Tokenap
Token ID499
Feature activation-4.72e-02

Pos logit contributions
ott+0.383
 cabinet+0.348
 Majesty+0.342
annah+0.320
 wardrobe+0.315
Neg logit contributions
By-0.381
Rex-0.378
Time-0.368
 signals-0.349
 response-0.343
Tokenot
Token ID313
Feature activation-9.70e-02

Pos logit contributions
Mark+0.090
Rex+0.088
Big+0.087
Rick+0.086
By+0.086
Neg logit contributions
 te-0.062
ott-0.056
 Majesty-0.054
 cabinet-0.052
annah-0.050
Token.
Token ID13
Feature activation-5.33e-03

Pos logit contributions
Mark+0.134
Rex+0.130
Big+0.127
By+0.126
 distance+0.126
Neg logit contributions
 te-0.167
ott-0.166
 Majesty-0.159
 cabinet-0.152
annah-0.149
Token
Token ID198
Feature activation-6.60e-03

Pos logit contributions
What+0.005
Mark+0.005
By+0.005
Time+0.005
Rex+0.005
Neg logit contributions
 Majesty-0.012
ott-0.011
 wardrobe-0.011
 majesty-0.010
oppy-0.010
Token
Token ID198
Feature activation-8.21e-03

Pos logit contributions
Mark+0.008
Rex+0.008
 distance+0.008
By+0.008
Big+0.008
Neg logit contributions
ott-0.012
 te-0.011
 Majesty-0.011
 cabinet-0.011
 Flor-0.010
Token and
Token ID290
Feature activation-1.98e-02

Pos logit contributions
Mark+0.064
Rex+0.063
By+0.060
Big+0.060
 distance+0.059
Neg logit contributions
ott-0.093
 te-0.090
 Majesty-0.089
 cabinet-0.085
annah-0.083
Token his
Token ID465
Feature activation-1.43e-02

Pos logit contributions
Mark+0.022
Rex+0.022
By+0.022
Big+0.021
Time+0.021
Neg logit contributions
ott-0.039
 te-0.038
 Majesty-0.038
 cabinet-0.036
 Flor-0.035
Token friend
Token ID1545
Feature activation+1.68e-02

Pos logit contributions
Mark+0.022
Rex+0.022
By+0.021
Big+0.021
 distance+0.021
Neg logit contributions
ott-0.022
 te-0.021
 Majesty-0.021
 cabinet-0.020
annah-0.019
Token found
Token ID1043
Feature activation+8.60e-02

Pos logit contributions
ott+0.032
 te+0.031
 Majesty+0.031
 cabinet+0.030
annah+0.029
Neg logit contributions
Mark-0.020
Rex-0.019
By-0.019
Big-0.019
 distance-0.018
Token the
Token ID262
Feature activation-4.89e-02

Pos logit contributions
ott+0.184
 te+0.179
 Majesty+0.176
 cabinet+0.169
annah+0.168
Neg logit contributions
Mark-0.085
Rex-0.082
Big-0.078
 distance-0.077
By-0.076
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
Mark+0.080
Rex+0.078
By+0.076
Big+0.075
Kim+0.074
Neg logit contributions
ott-0.073
 te-0.072
 Majesty-0.071
 cabinet-0.068
 wardrobe-0.064
Tokenash
Token ID1077
Feature activation-2.10e-02

Pos logit contributions
ott+0.366
 cabinet+0.343
 Majesty+0.328
 te+0.325
annah+0.323
Neg logit contributions
Rex-0.386
By-0.374
Mark-0.371
Time-0.363
Thunder-0.352
Tokenop
Token ID404
Feature activation-8.01e-03

Pos logit contributions
Mark+0.031
Rex+0.030
By+0.029
Big+0.029
Time+0.029
Neg logit contributions
ott-0.035
 Majesty-0.033
 te-0.033
 cabinet-0.031
annah-0.030
Token.
Token ID13
Feature activation-1.36e-02

Pos logit contributions
Mark+0.011
Rex+0.010
Big+0.010
 distance+0.010
By+0.010
Neg logit contributions
ott-0.014
 Majesty-0.014
 te-0.014
 cabinet-0.013
annah-0.012
Token<|endoftext|>
Token ID50256
Feature activation-1.30e-01

Pos logit contributions
Mark+0.017
Rex+0.016
Big+0.016
By+0.016
Kim+0.015
Neg logit contributions
ott-0.026
 Majesty-0.025
 te-0.025
 wardrobe-0.024
 cabinet-0.024
TokenM
Token ID44
Feature activation-3.66e-02

Pos logit contributions
Mark+0.168
Rex+0.165
Big+0.162
By+0.162
 distance+0.159
Neg logit contributions
ott-0.233
 te-0.224
 cabinet-0.213
 Majesty-0.212
 Flor-0.207
Token his
Token ID465
Feature activation-8.75e-03

Pos logit contributions
Mark+0.101
Rex+0.098
By+0.097
 distance+0.094
Big+0.092
Neg logit contributions
ott-0.206
 te-0.200
 Majesty-0.197
 cabinet-0.191
annah-0.187
Token friend
Token ID1545
Feature activation+2.43e-02

Pos logit contributions
Rex+0.014
Mark+0.014
By+0.014
Big+0.014
 distance+0.014
Neg logit contributions
ott-0.013
 Majesty-0.012
 te-0.012
annah-0.011
 cabinet-0.011
Token was
Token ID373
Feature activation+2.65e-02

Pos logit contributions
ott+0.047
 Majesty+0.045
 te+0.045
 cabinet+0.043
annah+0.043
Neg logit contributions
Mark-0.029
Rex-0.028
By-0.027
Big-0.026
 distance-0.026
Token at
Token ID379
Feature activation+1.76e-02

Pos logit contributions
ott+0.048
 te+0.047
 Majesty+0.046
 cabinet+0.044
annah+0.043
Neg logit contributions
Mark-0.034
Rex-0.034
By-0.033
 distance-0.032
Big-0.032
Token the
Token ID262
Feature activation-4.30e-02

Pos logit contributions
ott+0.037
 te+0.036
 Majesty+0.036
 cabinet+0.034
annah+0.034
Neg logit contributions
Mark-0.018
Rex-0.017
 distance-0.016
Big-0.016
By-0.016
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
 distance+0.078
Time+0.076
Rex+0.073
By+0.073
Mark+0.073
Neg logit contributions
ott-0.056
 Majesty-0.052
oppy-0.050
annah-0.049
 majesty-0.046
Tokenash
Token ID1077
Feature activation-1.61e-02

Pos logit contributions
ott+0.356
 Majesty+0.352
 cabinet+0.351
 wardrobe+0.334
 Flor+0.329
Neg logit contributions
Rex-0.363
Time-0.354
Thunder-0.345
Mark-0.345
 injuries-0.335
Tokenop
Token ID404
Feature activation-4.35e-03

Pos logit contributions
Mark+0.023
Rex+0.022
Big+0.021
Time+0.021
By+0.020
Neg logit contributions
 Majesty-0.028
ott-0.027
 te-0.026
 majesty-0.025
annah-0.024
Token.
Token ID13
Feature activation-1.57e-02

Pos logit contributions
Mark+0.005
Rex+0.005
 distance+0.005
Big+0.005
Kim+0.005
Neg logit contributions
ott-0.008
 Majesty-0.008
 te-0.008
 majesty-0.007
 Flor-0.007
Token They
Token ID1119
Feature activation-5.24e-02

Pos logit contributions
Mark+0.014
Big+0.013
Rex+0.012
By+0.011
What+0.011
Neg logit contributions
ott-0.036
 Majesty-0.035
oppy-0.034
 wardrobe-0.034
anda-0.033
Token all
Token ID477
Feature activation+1.08e-01

Pos logit contributions
Mark+0.069
Rex+0.067
By+0.064
Time+0.064
Big+0.064
Neg logit contributions
ott-0.095
 te-0.093
 Majesty-0.091
 cabinet-0.087
 Flor-0.085
Token poured
Token ID19036
Feature activation+4.61e-02

Pos logit contributions
ott+0.296
 Majesty+0.291
 cabinet+0.271
oppy+0.270
annah+0.268
Neg logit contributions
Rex-0.141
Mark-0.136
By-0.135
Big-0.125
 distance-0.124
Token some
Token ID617
Feature activation-6.43e-03

Pos logit contributions
 te+0.085
ott+0.085
 Av+0.078
 cabinet+0.078
 Flor+0.077
Neg logit contributions
Mark-0.061
Rex-0.057
By-0.056
Rick-0.056
Big-0.056
Token water
Token ID1660
Feature activation+2.74e-02

Pos logit contributions
Mark+0.010
Rex+0.010
By+0.009
 distance+0.009
Big+0.009
Neg logit contributions
ott-0.010
 Majesty-0.010
 te-0.010
 cabinet-0.009
annah-0.009
Token from
Token ID422
Feature activation-1.66e-02

Pos logit contributions
 te+0.055
ott+0.054
 cabinet+0.051
 Flor+0.049
 Majesty+0.049
Neg logit contributions
Mark-0.033
Rex-0.030
Rick-0.029
Big-0.029
By-0.029
Token the
Token ID262
Feature activation-4.06e-02

Pos logit contributions
Mark+0.019
Rex+0.019
Big+0.018
By+0.018
 distance+0.017
Neg logit contributions
ott-0.033
 te-0.032
 Majesty-0.032
 cabinet-0.030
annah-0.030
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
 distance+0.078
Rex+0.078
Mark+0.075
Time+0.075
By+0.074
Neg logit contributions
ott-0.049
 Majesty-0.046
oppy-0.040
annah-0.040
 te-0.039
Tokenap
Token ID499
Feature activation-4.91e-02

Pos logit contributions
ott+0.380
 Majesty+0.351
 cabinet+0.343
 wardrobe+0.327
 Flor+0.325
Neg logit contributions
Rex-0.383
Time-0.367
By-0.352
 injuries-0.350
Thunder-0.347
Tokenot
Token ID313
Feature activation-9.75e-02

Pos logit contributions
Mark+0.092
Rex+0.090
By+0.089
Big+0.088
Rick+0.088
Neg logit contributions
 te-0.062
ott-0.060
 Majesty-0.056
 cabinet-0.054
 Av-0.052
Token.
Token ID13
Feature activation-3.65e-03

Pos logit contributions
Mark+0.125
Rex+0.121
By+0.118
Big+0.117
 distance+0.117
Neg logit contributions
ott-0.177
 te-0.175
 Majesty-0.169
 cabinet-0.163
annah-0.159
Token She
Token ID1375
Feature activation+5.01e-02

Pos logit contributions
Mark+0.005
Rex+0.004
By+0.004
Big+0.004
What+0.004
Neg logit contributions
ott-0.007
 Majesty-0.007
 te-0.007
 wardrobe-0.006
oppy-0.006
Token was
Token ID373
Feature activation+3.34e-02

Pos logit contributions
ott+0.100
 Majesty+0.100
 cabinet+0.094
annah+0.093
 te+0.091
Neg logit contributions
Rex-0.054
Mark-0.054
By-0.052
Big-0.050
Time-0.048
Token From
Token ID3574
Feature activation+6.45e-02

Pos logit contributions
By+0.011
Big+0.011
Mark+0.011
Rex+0.011
What+0.010
Neg logit contributions
ott-0.020
 Majesty-0.020
 wardrobe-0.018
anda-0.018
 cabinet-0.018
Token then
Token ID788
Feature activation-5.97e-02

Pos logit contributions
ott+0.125
 te+0.123
 Majesty+0.120
 cabinet+0.115
 Flor+0.114
Neg logit contributions
Mark-0.076
Rex-0.073
By-0.071
Time-0.070
Rick-0.069
Token on
Token ID319
Feature activation-6.68e-02

Pos logit contributions
Mark+0.052
Rex+0.050
 distance+0.049
Thunder+0.046
Big+0.046
Neg logit contributions
 te-0.131
ott-0.130
 Majesty-0.125
 cabinet-0.124
annah-0.122
Token,
Token ID11
Feature activation+4.78e-02

Pos logit contributions
Mark+0.104
Rex+0.102
By+0.099
Big+0.099
 distance+0.098
Neg logit contributions
ott-0.105
 te-0.101
 Majesty-0.100
 cabinet-0.094
annah-0.091
Token the
Token ID262
Feature activation-4.32e-02

Pos logit contributions
ott+0.110
 te+0.108
 Majesty+0.106
 cabinet+0.103
 Flor+0.101
Neg logit contributions
Mark-0.039
Rex-0.038
By-0.035
 distance-0.035
Big-0.035
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
 distance+0.091
What+0.090
Rex+0.089
Time+0.089
By+0.089
Neg logit contributions
ott-0.040
 Majesty-0.038
annah-0.032
oppy-0.031
 te-0.030
Tokenap
Token ID499
Feature activation-5.05e-02

Pos logit contributions
 cabinet+0.371
 Majesty+0.368
ott+0.368
 wardrobe+0.351
annah+0.346
Neg logit contributions
By-0.329
Time-0.327
Rex-0.316
Thunder-0.302
med-0.301
Tokenot
Token ID313
Feature activation-1.03e-01

Pos logit contributions
Mark+0.068
Rex+0.067
By+0.065
Big+0.065
 distance+0.064
Neg logit contributions
ott-0.088
 te-0.086
 Majesty-0.084
 cabinet-0.080
annah-0.078
Token and
Token ID290
Feature activation-2.20e-02

Pos logit contributions
Mark+0.132
Rex+0.128
Big+0.124
 distance+0.124
By+0.124
Neg logit contributions
ott-0.185
 te-0.183
 Majesty-0.177
 cabinet-0.169
annah-0.166
Token teaspoon
Token ID22326
Feature activation+1.38e-01

Pos logit contributions
Mark+0.024
Rex+0.024
By+0.023
Big+0.023
 distance+0.023
Neg logit contributions
ott-0.044
 te-0.043
 Majesty-0.043
 cabinet-0.041
annah-0.040
Token explored
Token ID18782
Feature activation-5.03e-02

Pos logit contributions
ott+0.255
 te+0.250
 Majesty+0.244
 cabinet+0.234
annah+0.226
Neg logit contributions
Mark-0.174
Rex-0.170
By-0.164
Big-0.163
 distance-0.161
Tokenoons
Token ID13022
Feature activation+5.04e-02

Pos logit contributions
 Majesty+0.140
 Flor+0.128
 Av+0.120
 te+0.118
 majesty+0.118
Neg logit contributions
Mark-0.075
Time-0.075
 brewing-0.073
lights-0.072
Little-0.071
Token.
Token ID13
Feature activation-7.99e-03

Pos logit contributions
ott+0.097
 Majesty+0.097
 te+0.091
 majesty+0.087
annah+0.087
Neg logit contributions
 distance-0.057
Rex-0.057
Mark-0.057
By-0.056
Big-0.054
Token
Token ID198
Feature activation-9.64e-03

Pos logit contributions
Mark+0.008
By+0.008
Rex+0.008
Big+0.008
What+0.007
Neg logit contributions
 Majesty-0.017
ott-0.017
 te-0.016
 wardrobe-0.016
 cabinet-0.016
Token
Token ID198
Feature activation-1.03e-02

Pos logit contributions
 distance+0.013
Rex+0.013
Mark+0.013
 brake+0.013
By+0.013
Neg logit contributions
ott-0.016
 te-0.015
 Majesty-0.014
 Flor-0.014
annah-0.014
TokenThe
Token ID464
Feature activation-1.07e-01

Pos logit contributions
 distance+0.009
 injuries+0.009
Rex+0.009
 brake+0.009
Rick+0.009
Neg logit contributions
ott-0.020
 te-0.020
 Av-0.019
annah-0.019
 Majesty-0.018
Token te
Token ID573
Feature activation+2.58e-01

Pos logit contributions
Mark+0.203
Rex+0.202
By+0.199
Time+0.197
 distance+0.197
Neg logit contributions
ott-0.127
 Majesty-0.121
 te-0.116
annah-0.107
 cabinet-0.107
Tokenap
Token ID499
Feature activation-5.03e-02

Pos logit contributions
 Majesty+0.446
annah+0.424
ott+0.413
 cabinet+0.411
 wardrobe+0.407
Neg logit contributions
Time-0.280
Rex-0.265
By-0.262
Thunder-0.250
Mark-0.245
Tokenot
Token ID313
Feature activation-1.02e-01

Pos logit contributions
Mark+0.071
Rex+0.070
By+0.069
Big+0.067
 distance+0.067
Neg logit contributions
ott-0.085
 Majesty-0.081
 te-0.081
 cabinet-0.077
annah-0.074
Token shouted
Token ID17293
Feature activation-1.14e-02

Pos logit contributions
Mark+0.126
Rex+0.123
Big+0.119
By+0.119
 distance+0.118
Neg logit contributions
ott-0.190
 te-0.187
 Majesty-0.182
 cabinet-0.174
annah-0.170
Token,
Token ID11
Feature activation+5.37e-02

Pos logit contributions
By+0.014
Mark+0.013
What+0.013
Rex+0.013
 louder+0.013
Neg logit contributions
ott-0.022
 Majesty-0.020
 te-0.019
annah-0.019
 cabinet-0.019
Token "
Token ID366
Feature activation+2.11e-02

Pos logit contributions
ott+0.085
 te+0.082
 Majesty+0.081
 cabinet+0.076
annah+0.074
Neg logit contributions
Mark-0.082
Rex-0.081
By-0.079
Big-0.078
 distance-0.077
Token could
Token ID714
Feature activation-1.34e-02

Pos logit contributions
ott+0.016
 Majesty+0.015
 te+0.015
 cabinet+0.014
annah+0.014
Neg logit contributions
Mark-0.008
Rex-0.008
By-0.008
 distance-0.008
Big-0.008
Token go
Token ID467
Feature activation-2.51e-02

Pos logit contributions
Mark+0.015
Rex+0.015
By+0.014
 distance+0.014
Big+0.014
Neg logit contributions
ott-0.027
 te-0.026
 Majesty-0.026
 cabinet-0.025
annah-0.024
Token exploring
Token ID13504
Feature activation-5.13e-03

Pos logit contributions
Mark+0.031
Rex+0.031
 distance+0.030
By+0.029
Big+0.029
Neg logit contributions
ott-0.047
 te-0.046
 Majesty-0.045
 cabinet-0.043
annah-0.042
Token with
Token ID351
Feature activation+1.17e-01

Pos logit contributions
Mark+0.006
Rex+0.006
 distance+0.005
Big+0.005
By+0.005
Neg logit contributions
ott-0.010
 te-0.010
 Majesty-0.010
 cabinet-0.009
annah-0.009
Token the
Token ID262
Feature activation-3.73e-02

Pos logit contributions
ott+0.258
 te+0.246
 Majesty+0.245
annah+0.236
 cabinet+0.235
Neg logit contributions
Mark-0.106
Rex-0.104
 distance-0.101
Big-0.101
By-0.096
Token te
Token ID573
Feature activation+2.57e-01

Pos logit contributions
Mark+0.058
Rex+0.058
 distance+0.058
By+0.056
Big+0.056
Neg logit contributions
ott-0.057
 Majesty-0.053
 te-0.052
annah-0.049
 cabinet-0.048
Tokenap
Token ID499
Feature activation-4.88e-02

Pos logit contributions
ott+0.385
 cabinet+0.366
 Majesty+0.344
 wardrobe+0.339
 Flor+0.339
Neg logit contributions
Time-0.320
By-0.312
 injuries-0.306
Rex-0.304
 response-0.299
Tokenot
Token ID313
Feature activation-1.01e-01

Pos logit contributions
Mark+0.069
Rex+0.068
By+0.066
Big+0.066
 distance+0.065
Neg logit contributions
ott-0.082
 te-0.080
 Majesty-0.078
 cabinet-0.075
annah-0.072
Token.
Token ID13
Feature activation-2.26e-03

Pos logit contributions
Mark+0.131
Rex+0.129
By+0.125
Big+0.123
 distance+0.122
Neg logit contributions
ott-0.184
 te-0.176
 Majesty-0.176
 cabinet-0.167
 Flor-0.161
Token
Token ID198
Feature activation-7.97e-03

Pos logit contributions
Mark+0.002
Big+0.002
Rex+0.002
By+0.002
What+0.002
Neg logit contributions
 Majesty-0.005
ott-0.005
 te-0.005
 wardrobe-0.005
 cabinet-0.004
Token
Token ID198
Feature activation-9.88e-03

Pos logit contributions
Mark+0.010
Rex+0.010
 distance+0.010
By+0.009
Time+0.009
Neg logit contributions
ott-0.015
 te-0.014
 Majesty-0.014
 cabinet-0.013
annah-0.013
Token found
Token ID1043
Feature activation+8.63e-02

Pos logit contributions
Mark+0.004
Rex+0.004
By+0.003
Big+0.003
 distance+0.003
Neg logit contributions
ott-0.007
 te-0.007
 Majesty-0.007
 cabinet-0.006
annah-0.006
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
ott+0.188
 te+0.184
 Majesty+0.183
annah+0.172
 cabinet+0.171
Neg logit contributions
Mark-0.079
Rex-0.078
 distance-0.074
Big-0.072
What-0.071
Token magical
Token ID10883
Feature activation+7.15e-02

Pos logit contributions
 distance+0.020
Mark+0.019
Rex+0.019
Time+0.018
What+0.018
Neg logit contributions
ott-0.022
 Majesty-0.020
 te-0.020
annah-0.020
 Flor-0.019
Token o
Token ID267
Feature activation+1.76e-02

Pos logit contributions
ott+0.106
 Majesty+0.100
 te+0.097
annah+0.093
 Flor+0.091
Neg logit contributions
 distance-0.115
Mark-0.110
Rex-0.108
Time-0.108
 moment-0.107
Tokenint
Token ID600
Feature activation-8.83e-02

Pos logit contributions
ott+0.027
 te+0.026
 Majesty+0.025
 cabinet+0.025
annah+0.024
Neg logit contributions
Mark-0.029
Rex-0.028
By-0.028
Big-0.027
What-0.027
Tokenment
Token ID434
Feature activation-3.02e-01

Pos logit contributions
Mark+0.125
Rex+0.121
By+0.119
Big+0.118
Time+0.116
Neg logit contributions
ott-0.153
 te-0.149
 Majesty-0.144
 cabinet-0.141
 wardrobe-0.135
Token.
Token ID13
Feature activation-4.39e-03

Pos logit contributions
Mark+0.393
Rex+0.387
By+0.375
 distance+0.371
Big+0.370
Neg logit contributions
ott-0.544
 te-0.527
 Majesty-0.523
 cabinet-0.495
annah-0.482
Token He
Token ID679
Feature activation-5.78e-02

Pos logit contributions
Mark+0.005
Rex+0.004
What+0.004
Big+0.004
Andy+0.004
Neg logit contributions
ott-0.009
 te-0.009
 Majesty-0.009
act-0.009
igg-0.009
Token touched
Token ID12615
Feature activation+8.47e-03

Pos logit contributions
Mark+0.066
Rex+0.063
By+0.063
Big+0.062
 distance+0.060
Neg logit contributions
ott-0.113
 Majesty-0.109
 te-0.109
 cabinet-0.105
annah-0.104
Token it
Token ID340
Feature activation-3.43e-03

Pos logit contributions
ott+0.016
 te+0.016
 Majesty+0.015
 Flor+0.015
 cabinet+0.015
Neg logit contributions
Mark-0.010
By-0.010
Rex-0.010
Time-0.010
Kim-0.009
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
Mark+0.005
Rex+0.005
By+0.004
 distance+0.004
Big+0.004
Neg logit contributions
ott-0.006
 te-0.006
 Majesty-0.006
 cabinet-0.005
annah-0.005
Token:
Token ID25
Feature activation+1.50e-02

Pos logit contributions
ott+0.096
 te+0.095
 Majesty+0.091
 cabinet+0.090
 Av+0.088
Neg logit contributions
Mark-0.057
Rex-0.056
Big-0.054
By-0.054
Rick-0.053
Token
Token ID198
Feature activation-3.26e-04

Pos logit contributions
ott+0.028
 te+0.027
 Majesty+0.027
 cabinet+0.026
annah+0.025
Neg logit contributions
Mark-0.019
Rex-0.018
What-0.018
Big-0.017
Time-0.017
Token
Token ID198
Feature activation-3.52e-03

Pos logit contributions
Mark+0.000
Rex+0.000
By+0.000
Big+0.000
 distance+0.000
Neg logit contributions
ott-0.001
 te-0.001
 Majesty-0.001
 cabinet-0.001
annah-0.001
Token"
Token ID1
Feature activation-3.52e-02

Pos logit contributions
Mark+0.004
Rex+0.003
By+0.003
Big+0.003
 distance+0.003
Neg logit contributions
ott-0.007
 te-0.007
 Majesty-0.007
 cabinet-0.007
annah-0.007
TokenMeasure
Token ID47384
Feature activation+9.06e-03

Pos logit contributions
Mark+0.032
Rex+0.031
 distance+0.031
Big+0.030
By+0.029
Neg logit contributions
ott-0.076
 te-0.075
 Majesty-0.072
 Flor-0.069
 Av-0.069
Tokenment
Token ID434
Feature activation-3.00e-01

Pos logit contributions
ott+0.015
 te+0.015
 Majesty+0.015
 cabinet+0.014
annah+0.013
Neg logit contributions
Mark-0.013
Rex-0.013
Big-0.012
By-0.012
Time-0.012
Token complete
Token ID1844
Feature activation-5.85e-03

Pos logit contributions
Mark+0.412
Rex+0.404
By+0.391
Big+0.391
 distance+0.388
Neg logit contributions
ott-0.519
 te-0.500
 Majesty-0.496
 cabinet-0.467
annah-0.454
Token.
Token ID13
Feature activation+6.98e-03

Pos logit contributions
Mark+0.008
Rex+0.007
Big+0.007
 distance+0.007
Time+0.007
Neg logit contributions
ott-0.011
 Majesty-0.010
 te-0.010
 cabinet-0.009
oppy-0.009
Token Height
Token ID27280
Feature activation-2.75e-02

Pos logit contributions
ott+0.011
 te+0.011
 Majesty+0.010
annah+0.010
 wardrobe+0.010
Neg logit contributions
Mark-0.010
Rex-0.010
Big-0.010
Kim-0.010
What-0.010
Token:
Token ID25
Feature activation+3.41e-03

Pos logit contributions
Mark+0.045
Rex+0.043
Big+0.043
Rick+0.042
Kim+0.042
Neg logit contributions
ott-0.041
 te-0.039
 Majesty-0.037
 cabinet-0.036
 Flor-0.034
Token 90
Token ID4101
Feature activation-7.14e-04

Pos logit contributions
ott+0.006
 te+0.006
 Majesty+0.006
 cabinet+0.005
annah+0.005
Neg logit contributions
Mark-0.005
Rex-0.005
Big-0.004
By-0.004
Time-0.004
Token it
Token ID340
Feature activation-3.43e-03

Pos logit contributions
ott+0.016
 te+0.016
 Majesty+0.015
 Flor+0.015
 cabinet+0.015
Neg logit contributions
Mark-0.010
By-0.010
Rex-0.010
Time-0.010
Kim-0.009
Token and
Token ID290
Feature activation-1.74e-02

Pos logit contributions
Mark+0.005
Rex+0.005
By+0.004
 distance+0.004
Big+0.004
Neg logit contributions
ott-0.006
 te-0.006
 Majesty-0.006
 cabinet-0.005
annah-0.005
Token the
Token ID262
Feature activation-4.54e-02

Pos logit contributions
Mark+0.021
Rex+0.020
By+0.020
Big+0.019
 distance+0.019
Neg logit contributions
ott-0.033
 te-0.032
 Majesty-0.032
 cabinet-0.031
annah-0.030
Token o
Token ID267
Feature activation+2.25e-02

Pos logit contributions
Mark+0.076
Rex+0.074
By+0.073
Big+0.072
Time+0.071
Neg logit contributions
ott-0.066
 te-0.065
 Majesty-0.064
 cabinet-0.061
 Flor-0.058
Tokenint
Token ID600
Feature activation-8.81e-02

Pos logit contributions
ott+0.034
 Majesty+0.032
 te+0.031
 cabinet+0.030
annah+0.029
Neg logit contributions
Mark-0.037
Rex-0.036
 distance-0.036
By-0.035
Big-0.035
Tokenment
Token ID434
Feature activation-3.00e-01

Pos logit contributions
Mark+0.119
Rex+0.117
By+0.113
 distance+0.112
Big+0.112
Neg logit contributions
ott-0.156
 te-0.152
 Majesty-0.149
 cabinet-0.141
 Flor-0.138
Token began
Token ID2540
Feature activation+7.62e-03

Pos logit contributions
Mark+0.396
Rex+0.388
By+0.374
Big+0.372
Time+0.368
Neg logit contributions
ott-0.539
 te-0.527
 Majesty-0.512
 cabinet-0.491
annah-0.478
Token to
Token ID284
Feature activation-2.11e-02

Pos logit contributions
ott+0.013
 te+0.013
 Majesty+0.013
 cabinet+0.012
 Flor+0.012
Neg logit contributions
Mark-0.011
Rex-0.010
By-0.010
Big-0.010
Time-0.010
Token glow
Token ID19634
Feature activation-4.21e-02

Pos logit contributions
Mark+0.030
Rex+0.030
By+0.028
Big+0.028
Time+0.028
Neg logit contributions
ott-0.036
 te-0.036
 Majesty-0.035
 cabinet-0.033
 Flor-0.032
Token.
Token ID13
Feature activation-7.35e-03

Pos logit contributions
Mark+0.065
Rex+0.064
By+0.062
Kim+0.061
Rick+0.061
Neg logit contributions
ott-0.067
 te-0.066
 Majesty-0.063
 cabinet-0.062
 Flor-0.060
Token Fred
Token ID8559
Feature activation-3.30e-02

Pos logit contributions
Mark+0.008
Rex+0.008
Big+0.007
By+0.007
What+0.007
Neg logit contributions
ott-0.015
 Majesty-0.015
 te-0.015
 wardrobe-0.014
 cabinet-0.014
Token full
Token ID1336
Feature activation-5.22e-02

Pos logit contributions
Mark+0.096
Rex+0.096
By+0.090
Big+0.089
Kim+0.089
Neg logit contributions
 te-0.117
ott-0.116
 Majesty-0.114
 Flor-0.106
 cabinet-0.106
Token of
Token ID286
Feature activation-6.55e-02

Pos logit contributions
Mark+0.064
Rex+0.063
Big+0.059
By+0.059
Rick+0.059
Neg logit contributions
ott-0.098
 te-0.098
 Majesty-0.094
 cabinet-0.091
 Av-0.088
Token happiness
Token ID12157
Feature activation-1.14e-01

Pos logit contributions
Mark+0.081
Rex+0.080
By+0.077
Big+0.076
Time+0.074
Neg logit contributions
ott-0.122
 te-0.120
 Majesty-0.118
 cabinet-0.113
 Flor-0.110
Token and
Token ID290
Feature activation-2.54e-02

Pos logit contributions
Mark+0.178
Rex+0.175
Kim+0.167
By+0.166
Thunder+0.165
Neg logit contributions
 te-0.176
ott-0.174
 Majesty-0.170
 cabinet-0.163
 wardrobe-0.159
Token content
Token ID2695
Feature activation-9.94e-02

Pos logit contributions
Mark+0.033
Rex+0.032
Rick+0.031
Kim+0.030
Big+0.030
Neg logit contributions
 te-0.045
ott-0.045
 Majesty-0.044
 Av-0.042
 Flor-0.042
Tokenment
Token ID434
Feature activation-2.98e-01

Pos logit contributions
Mark+0.140
Rex+0.138
Big+0.131
Rick+0.129
Kim+0.129
Neg logit contributions
ott-0.169
 Majesty-0.169
 te-0.167
 cabinet-0.156
 Flor-0.154
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
Mark+0.445
Rex+0.437
Big+0.420
Time+0.417
By+0.415
Neg logit contributions
ott-0.477
 te-0.472
 Majesty-0.455
 cabinet-0.433
annah-0.424
Token The
Token ID383
Feature activation-5.26e-02

Pos logit contributions
Mark+0.019
Rex+0.019
Big+0.018
By+0.018
Time+0.017
Neg logit contributions
ott-0.035
 te-0.035
 Majesty-0.035
 cabinet-0.033
 wardrobe-0.032
Token
Token ID220
Feature activation+2.28e-03

Pos logit contributions
Mark+0.079
Rex+0.077
Big+0.073
Rick+0.073
By+0.072
Neg logit contributions
ott-0.087
 Majesty-0.085
 te-0.084
 cabinet-0.080
 Av-0.077
Token wise
Token ID10787
Feature activation-2.20e-02

Pos logit contributions
ott+0.004
 te+0.004
 Majesty+0.004
 cabinet+0.004
annah+0.004
Neg logit contributions
Mark-0.003
Rex-0.003
By-0.003
Big-0.003
 distance-0.003
Token old
Token ID1468
Feature activation+8.04e-02

Pos logit contributions
 distance+0.028
Mark+0.028
By+0.028
Rex+0.027
Big+0.026
Neg logit contributions
ott-0.040
 te-0.039
 Majesty-0.036
oppy-0.036
 cabinet-0.036
Token
Token ID198
Feature activation-1.46e-02

Pos logit contributions
Mark+0.012
Rex+0.011
By+0.011
Big+0.011
What+0.011
Neg logit contributions
 te-0.023
ott-0.023
 Majesty-0.023
 wardrobe-0.022
 cabinet-0.021
Token
Token ID198
Feature activation-1.64e-02

Pos logit contributions
Rex+0.019
Mark+0.019
 distance+0.019
 brake+0.018
By+0.018
Neg logit contributions
ott-0.025
 te-0.024
 Majesty-0.023
annah-0.022
 cabinet-0.022
TokenThe
Token ID464
Feature activation-1.13e-01

Pos logit contributions
 brake+0.015
Rex+0.014
 injuries+0.014
 distance+0.014
 levers+0.014
Neg logit contributions
ott-0.033
 te-0.031
 Majesty-0.030
annah-0.030
 Av-0.029
Token o
Token ID267
Feature activation+2.31e-02

Pos logit contributions
Mark+0.184
Rex+0.173
Kim+0.170
Rick+0.168
By+0.168
Neg logit contributions
 te-0.179
ott-0.176
 Majesty-0.175
 cabinet-0.165
 wardrobe-0.162
Tokenint
Token ID600
Feature activation-8.72e-02

Pos logit contributions
ott+0.034
 Majesty+0.031
 te+0.028
annah+0.028
 cabinet+0.027
Neg logit contributions
 brake-0.040
Mark-0.039
 levers-0.038
Rex-0.038
Run-0.038
Tokenment
Token ID434
Feature activation-2.97e-01

Pos logit contributions
Mark+0.110
Rex+0.107
By+0.104
 distance+0.102
Big+0.102
Neg logit contributions
ott-0.166
 te-0.160
 Majesty-0.156
oppy-0.148
 cabinet-0.147
Token worked
Token ID3111
Feature activation+1.62e-02

Pos logit contributions
Mark+0.382
Rex+0.371
Rick+0.360
Time+0.358
Kim+0.358
Neg logit contributions
 te-0.548
ott-0.541
 cabinet-0.480
 Flor-0.480
 Av-0.477
Token like
Token ID588
Feature activation-9.73e-02

Pos logit contributions
ott+0.025
 te+0.024
 cabinet+0.023
 Majesty+0.022
 Av+0.022
Neg logit contributions
Rex-0.026
Mark-0.026
By-0.025
Big-0.024
Kim-0.024
Token magic
Token ID5536
Feature activation+9.56e-02

Pos logit contributions
Mark+0.109
Rex+0.108
By+0.103
Big+0.102
 distance+0.101
Neg logit contributions
ott-0.192
 te-0.188
 Majesty-0.186
 cabinet-0.178
 Flor-0.174
Token and
Token ID290
Feature activation-2.01e-02

Pos logit contributions
ott+0.159
 te+0.157
 Majesty+0.153
 cabinet+0.146
 Flor+0.141
Neg logit contributions
Mark-0.138
Rex-0.136
By-0.132
Big-0.131
Time-0.129
Token Fred
Token ID8559
Feature activation-3.33e-02

Pos logit contributions
Rex+0.026
Mark+0.025
Kim+0.024
Rick+0.024
By+0.024
Neg logit contributions
 Majesty-0.036
ott-0.035
 te-0.035
 cabinet-0.034
 Av-0.033
Token a
Token ID257
Feature activation-1.83e-02

Pos logit contributions
Mark+0.081
Rex+0.080
By+0.078
Big+0.077
 distance+0.077
Neg logit contributions
ott-0.096
 te-0.094
 Majesty-0.092
 cabinet-0.088
annah-0.085
Token deep
Token ID2769
Feature activation-1.06e-02

Pos logit contributions
Mark+0.028
Rex+0.028
By+0.027
Big+0.026
 distance+0.026
Neg logit contributions
ott-0.029
 te-0.028
 Majesty-0.028
 cabinet-0.026
annah-0.025
Token sense
Token ID2565
Feature activation-1.23e-01

Pos logit contributions
Mark+0.015
Rex+0.014
By+0.013
Big+0.013
Rick+0.013
Neg logit contributions
 te-0.019
ott-0.019
 Majesty-0.018
 cabinet-0.017
 wardrobe-0.017
Token of
Token ID286
Feature activation-6.38e-02

Pos logit contributions
Mark+0.160
Rex+0.158
Rick+0.149
Big+0.148
Kim+0.145
Neg logit contributions
 te-0.229
ott-0.229
 cabinet-0.211
 wardrobe-0.204
 Flor-0.202
Token content
Token ID2695
Feature activation-9.19e-02

Pos logit contributions
Mark+0.082
Rex+0.080
Rick+0.077
Big+0.076
Kim+0.074
Neg logit contributions
 te-0.117
ott-0.115
 Majesty-0.112
 cabinet-0.109
 wardrobe-0.108
Tokenment
Token ID434
Feature activation-2.95e-01

Pos logit contributions
Mark+0.140
Rex+0.137
Big+0.131
Rick+0.129
Andy+0.127
Neg logit contributions
 te-0.152
ott-0.149
 Majesty-0.148
 cabinet-0.142
 wardrobe-0.141
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
Mark+0.443
Rex+0.436
Big+0.417
Time+0.414
Rick+0.412
Neg logit contributions
ott-0.475
 te-0.470
 Majesty-0.447
 cabinet-0.433
annah-0.422
Token He
Token ID679
Feature activation-6.06e-02

Pos logit contributions
Mark+0.011
By+0.010
Rex+0.010
Time+0.010
Big+0.010
Neg logit contributions
 te-0.022
ott-0.022
 Majesty-0.021
oppy-0.020
 cabinet-0.020
Token was
Token ID373
Feature activation+2.00e-02

Pos logit contributions
Mark+0.073
Rex+0.072
By+0.069
Big+0.068
 distance+0.068
Neg logit contributions
ott-0.115
 te-0.112
 Majesty-0.111
 cabinet-0.106
annah-0.103
Token glad
Token ID9675
Feature activation-9.78e-02

Pos logit contributions
ott+0.034
 te+0.033
 Majesty+0.032
 cabinet+0.031
annah+0.030
Neg logit contributions
Mark-0.029
Rex-0.028
By-0.027
Big-0.027
 distance-0.027
Token he
Token ID339
Feature activation-1.19e-02

Pos logit contributions
Mark+0.138
Rex+0.137
Big+0.128
By+0.127
Kim+0.127
Neg logit contributions
ott-0.168
 Majesty-0.164
 te-0.162
 Av-0.154
 Flor-0.154
Token that
Token ID326
Feature activation+4.26e-02

Pos logit contributions
Mark+0.022
Rex+0.022
By+0.021
 distance+0.021
Big+0.021
Neg logit contributions
ott-0.027
 te-0.026
 Majesty-0.026
 cabinet-0.025
 Flor-0.024
Token the
Token ID262
Feature activation-4.87e-02

Pos logit contributions
ott+0.077
 te+0.076
 Majesty+0.074
 cabinet+0.072
 Flor+0.070
Neg logit contributions
Mark-0.055
Rex-0.055
By-0.052
Kim-0.051
Big-0.051
Token magic
Token ID5536
Feature activation+9.54e-02

Pos logit contributions
Mark+0.083
Rex+0.080
Kim+0.077
By+0.076
Big+0.075
Neg logit contributions
 te-0.074
ott-0.074
 Majesty-0.073
 cabinet-0.070
 wardrobe-0.069
Token o
Token ID267
Feature activation+2.62e-02

Pos logit contributions
ott+0.165
 te+0.163
 Majesty+0.159
 cabinet+0.153
 wardrobe+0.146
Neg logit contributions
Mark-0.134
Rex-0.131
By-0.126
Big-0.125
Kim-0.123
Tokenint
Token ID600
Feature activation-8.37e-02

Pos logit contributions
ott+0.039
 Majesty+0.037
 te+0.034
 cabinet+0.034
annah+0.033
Neg logit contributions
Mark-0.043
Rex-0.043
 distance-0.041
 brake-0.041
Big-0.041
Tokenment
Token ID434
Feature activation-2.94e-01

Pos logit contributions
Mark+0.110
Rex+0.108
By+0.105
Big+0.103
 distance+0.102
Neg logit contributions
ott-0.154
 te-0.147
 Majesty-0.146
 cabinet-0.141
oppy-0.136
Token was
Token ID373
Feature activation+2.46e-02

Pos logit contributions
Mark+0.389
Rex+0.380
Big+0.367
Rick+0.362
By+0.362
Neg logit contributions
ott-0.526
 te-0.521
 cabinet-0.487
 Majesty-0.483
 Flor-0.466
Token his
Token ID465
Feature activation-1.11e-02

Pos logit contributions
ott+0.039
 te+0.039
 Majesty+0.038
 cabinet+0.037
 Flor+0.036
Neg logit contributions
Rex-0.038
Mark-0.037
By-0.034
Kim-0.034
Rick-0.034
Token gift
Token ID6979
Feature activation-1.95e-02

Pos logit contributions
Mark+0.017
Rex+0.017
By+0.017
 distance+0.016
Big+0.016
Neg logit contributions
ott-0.017
 te-0.017
 Majesty-0.016
 cabinet-0.015
annah-0.015
Token and
Token ID290
Feature activation-2.17e-02

Pos logit contributions
Mark+0.027
Rex+0.027
By+0.026
Big+0.025
Time+0.025
Neg logit contributions
ott-0.034
 te-0.033
 Majesty-0.032
 cabinet-0.031
 wardrobe-0.030
Token that
Token ID326
Feature activation+4.43e-02

Pos logit contributions
Mark+0.026
Rex+0.025
By+0.024
Big+0.024
Time+0.023
Neg logit contributions
ott-0.042
 te-0.041
 Majesty-0.041
 cabinet-0.039
 Flor-0.038
Token of
Token ID286
Feature activation-5.99e-02

Pos logit contributions
Mark+0.058
Rex+0.057
What+0.055
By+0.054
Big+0.054
Neg logit contributions
ott-0.084
 te-0.084
 Majesty-0.079
 cabinet-0.079
 Flor-0.076
Token the
Token ID262
Feature activation-4.67e-02

Pos logit contributions
Rex+0.081
Mark+0.079
Kim+0.078
By+0.077
Rick+0.075
Neg logit contributions
ott-0.100
 te-0.099
 cabinet-0.097
 Majesty-0.096
 Av-0.094
Token magical
Token ID10883
Feature activation+7.62e-02

Pos logit contributions
Mark+0.080
Rex+0.077
By+0.074
Big+0.074
Kim+0.074
Neg logit contributions
ott-0.069
 te-0.068
 Majesty-0.067
 cabinet-0.063
 wardrobe-0.061
Token o
Token ID267
Feature activation+2.90e-02

Pos logit contributions
 te+0.123
ott+0.122
 Majesty+0.120
 cabinet+0.114
 wardrobe+0.109
Neg logit contributions
Mark-0.118
Rex-0.115
By-0.111
Big-0.110
Kim-0.108
Tokenint
Token ID600
Feature activation-7.93e-02

Pos logit contributions
ott+0.043
 Majesty+0.041
 te+0.039
 cabinet+0.038
annah+0.038
Neg logit contributions
Mark-0.050
Rex-0.048
By-0.046
Big-0.046
What-0.045
Tokenment
Token ID434
Feature activation-2.93e-01

Pos logit contributions
Mark+0.085
Rex+0.082
By+0.077
Big+0.075
Time+0.074
Neg logit contributions
ott-0.167
 te-0.164
 Majesty-0.162
 cabinet-0.156
 Flor-0.152
Token,
Token ID11
Feature activation+5.02e-02

Pos logit contributions
Mark+0.450
Rex+0.439
Big+0.424
Time+0.422
By+0.421
Neg logit contributions
ott-0.463
 te-0.458
 Majesty-0.431
 cabinet-0.422
 Flor-0.406
Token Fred
Token ID8559
Feature activation-2.77e-02

Pos logit contributions
 Majesty+0.064
 te+0.064
 Av+0.062
ott+0.062
 Flor+0.062
Neg logit contributions
Rex-0.085
Kim-0.083
Mark-0.083
Rick-0.082
Thunder-0.078
Token was
Token ID373
Feature activation+2.60e-02

Pos logit contributions
Mark+0.036
Rex+0.036
Time+0.034
Kim+0.034
By+0.034
Neg logit contributions
ott-0.050
 te-0.049
 Majesty-0.048
 cabinet-0.045
 Flor-0.045
Token able
Token ID1498
Feature activation-1.49e-02

Pos logit contributions
ott+0.040
 te+0.039
 Majesty+0.039
 Flor+0.036
 cabinet+0.035
Neg logit contributions
Rex-0.042
Mark-0.042
Kim-0.039
Big-0.039
By-0.038
Token to
Token ID284
Feature activation-1.92e-02

Pos logit contributions
Rex+0.025
Mark+0.025
Kim+0.023
Big+0.023
Rick+0.023
Neg logit contributions
ott-0.022
 te-0.021
 Majesty-0.021
 cabinet-0.019
 Av-0.019
TokenHe
Token ID1544
Feature activation-1.34e-01

Pos logit contributions
 brake+0.022
 distance+0.021
Rex+0.021
Mark+0.020
 levers+0.020
Neg logit contributions
ott-0.024
 te-0.021
 Majesty-0.020
 Flor-0.020
 Av-0.020
Token kept
Token ID4030
Feature activation-1.86e-02

Pos logit contributions
Mark+0.154
Rex+0.153
By+0.150
Big+0.146
 distance+0.144
Neg logit contributions
ott-0.260
 te-0.252
 Majesty-0.251
 cabinet-0.242
annah-0.235
Token the
Token ID262
Feature activation-4.66e-02

Pos logit contributions
Mark+0.027
Rex+0.027
By+0.025
 distance+0.025
Kim+0.025
Neg logit contributions
ott-0.031
 te-0.031
 Majesty-0.030
 cabinet-0.028
annah-0.028
Token o
Token ID267
Feature activation+2.90e-02

Pos logit contributions
Mark+0.078
Rex+0.074
Kim+0.071
Big+0.070
By+0.070
Neg logit contributions
 te-0.073
ott-0.072
 Majesty-0.071
 cabinet-0.067
 wardrobe-0.066
Tokenint
Token ID600
Feature activation-8.02e-02

Pos logit contributions
ott+0.041
 Majesty+0.039
annah+0.037
 te+0.036
oppy+0.035
Neg logit contributions
Mark-0.052
Rex-0.050
Run-0.049
What-0.047
Time-0.047
Tokenment
Token ID434
Feature activation-2.92e-01

Pos logit contributions
Mark+0.097
Rex+0.093
By+0.089
Time+0.087
Big+0.085
Neg logit contributions
ott-0.157
 te-0.152
 Majesty-0.151
 cabinet-0.144
oppy-0.142
Token in
Token ID287
Feature activation+5.63e-02

Pos logit contributions
Mark+0.387
Rex+0.377
By+0.360
Time+0.359
Big+0.358
Neg logit contributions
ott-0.524
 te-0.513
 Majesty-0.496
 cabinet-0.480
annah-0.469
Token his
Token ID465
Feature activation-9.22e-03

Pos logit contributions
 te+0.099
ott+0.097
 Majesty+0.093
 cabinet+0.093
 Av+0.090
Neg logit contributions
Rex-0.076
Mark-0.076
By-0.071
Kim-0.071
Thunder-0.070
Token drawer
Token ID33451
Feature activation+1.55e-01

Pos logit contributions
 distance+0.016
Rex+0.015
Mark+0.015
By+0.015
Big+0.015
Neg logit contributions
ott-0.013
 te-0.012
 Majesty-0.012
 Av-0.011
 cabinet-0.011
Token and
Token ID290
Feature activation-1.87e-02

Pos logit contributions
ott+0.280
 te+0.270
 Majesty+0.267
 cabinet+0.252
 Flor+0.248
Neg logit contributions
Mark-0.199
Rex-0.194
 distance-0.193
By-0.191
Big-0.188
Token used
Token ID973
Feature activation+9.32e-02

Pos logit contributions
Mark+0.022
Rex+0.022
By+0.021
Big+0.021
Time+0.020
Neg logit contributions
ott-0.036
 te-0.035
 Majesty-0.035
 cabinet-0.033
 Flor-0.032
Token place
Token ID1295
Feature activation+1.62e-02

Pos logit contributions
Mark+0.027
Rex+0.027
By+0.026
Big+0.026
 distance+0.026
Neg logit contributions
ott-0.030
 te-0.028
 Majesty-0.028
 cabinet-0.026
annah-0.026
Token of
Token ID286
Feature activation-5.73e-02

Pos logit contributions
ott+0.026
 te+0.025
 cabinet+0.024
 Majesty+0.024
 wardrobe+0.023
Neg logit contributions
Mark-0.024
Rex-0.024
By-0.024
Big-0.023
Time-0.023
Token peace
Token ID4167
Feature activation-5.54e-02

Pos logit contributions
Mark+0.082
Rex+0.079
By+0.079
Big+0.076
Kim+0.074
Neg logit contributions
ott-0.096
 te-0.096
 Majesty-0.092
 cabinet-0.090
 wardrobe-0.086
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
Mark+0.080
Rex+0.079
By+0.077
Big+0.076
Time+0.075
Neg logit contributions
ott-0.092
 te-0.090
 Majesty-0.088
 cabinet-0.084
 Flor-0.081
Token content
Token ID2695
Feature activation-9.26e-02

Pos logit contributions
Mark+0.025
Rex+0.025
Rick+0.024
Kim+0.023
Josh+0.023
Neg logit contributions
 Majesty-0.027
 te-0.026
 cabinet-0.025
ott-0.024
 wardrobe-0.024
Tokenment
Token ID434
Feature activation-2.92e-01

Pos logit contributions
Mark+0.141
Rex+0.139
Big+0.132
Rick+0.131
By+0.131
Neg logit contributions
 Majesty-0.149
ott-0.147
 te-0.146
 cabinet-0.138
 Flor-0.134
Token.
Token ID13
Feature activation-3.27e-03

Pos logit contributions
Mark+0.422
Rex+0.414
By+0.400
Big+0.398
Time+0.395
Neg logit contributions
ott-0.485
 te-0.473
 Majesty-0.463
 cabinet-0.441
annah-0.427
Token
Token ID220
Feature activation+1.50e-02

Pos logit contributions
Mark+0.003
Rex+0.003
Big+0.003
By+0.003
Rick+0.003
Neg logit contributions
 te-0.007
ott-0.007
 Majesty-0.007
annah-0.007
 cabinet-0.007
Token
Token ID198
Feature activation-8.57e-03

Pos logit contributions
ott+0.029
 te+0.029
 Majesty+0.028
 cabinet+0.027
annah+0.026
Neg logit contributions
Mark-0.017
Rex-0.017
By-0.016
Big-0.016
 distance-0.016
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
Rex+0.011
Mark+0.011
By+0.011
 distance+0.011
Time+0.011
Neg logit contributions
ott-0.015
 te-0.014
 Majesty-0.014
 cabinet-0.013
 Flor-0.013
TokenOne
Token ID3198
Feature activation+6.95e-03

Pos logit contributions
 distance+0.013
Rex+0.013
Mark+0.013
 brake+0.013
Rick+0.013
Neg logit contributions
ott-0.018
 te-0.017
 Majesty-0.016
 cabinet-0.016
 Flor-0.016
Token rub
Token ID6437
Feature activation+4.70e-02

Pos logit contributions
Mark+0.011
Rex+0.010
By+0.010
Rick+0.010
Big+0.010
Neg logit contributions
 te-0.013
 Majesty-0.012
ott-0.012
 cabinet-0.012
 Flor-0.011
Token the
Token ID262
Feature activation-3.97e-02

Pos logit contributions
ott+0.083
 te+0.082
 Majesty+0.079
 cabinet+0.076
 Flor+0.075
Neg logit contributions
Mark-0.064
Rex-0.061
By-0.061
 distance-0.059
Time-0.059
Token magical
Token ID10883
Feature activation+7.96e-02

Pos logit contributions
Mark+0.059
Rex+0.057
By+0.055
Big+0.055
Rick+0.054
Neg logit contributions
 te-0.065
ott-0.065
 Majesty-0.063
 cabinet-0.060
 wardrobe-0.058
Token o
Token ID267
Feature activation+2.62e-02

Pos logit contributions
ott+0.129
 te+0.126
 Majesty+0.124
 cabinet+0.118
 Flor+0.113
Neg logit contributions
Mark-0.118
Rex-0.116
By-0.113
Big-0.112
Time-0.110
Tokenint
Token ID600
Feature activation-7.91e-02

Pos logit contributions
ott+0.040
 te+0.039
 Majesty+0.038
 cabinet+0.036
annah+0.035
Neg logit contributions
Mark-0.043
Rex-0.041
By-0.040
What-0.040
Big-0.039
Tokenment
Token ID434
Feature activation-2.92e-01

Pos logit contributions
Mark+0.106
Rex+0.103
By+0.099
 distance+0.097
Big+0.097
Neg logit contributions
ott-0.142
 te-0.139
 Majesty-0.136
 cabinet-0.130
 Flor-0.126
Token on
Token ID319
Feature activation-6.49e-02

Pos logit contributions
Mark+0.376
Rex+0.373
By+0.367
 distance+0.364
Big+0.360
Neg logit contributions
ott-0.526
 Majesty-0.508
 te-0.501
 cabinet-0.475
annah-0.463
Token her
Token ID607
Feature activation+8.24e-02

Pos logit contributions
Mark+0.078
Rex+0.075
Big+0.075
 distance+0.075
By+0.073
Neg logit contributions
ott-0.125
 Majesty-0.119
 te-0.118
 cabinet-0.111
oppy-0.110
Token forehead
Token ID22645
Feature activation-1.28e-01

Pos logit contributions
ott+0.125
 te+0.120
 Majesty+0.118
 cabinet+0.111
annah+0.108
Neg logit contributions
Mark-0.131
Rex-0.130
By-0.126
Big-0.125
 distance-0.125
Token.
Token ID13
Feature activation+4.17e-05

Pos logit contributions
Mark+0.186
Rex+0.183
By+0.177
Big+0.176
 distance+0.174
Neg logit contributions
ott-0.210
 te-0.205
 Majesty-0.201
 cabinet-0.191
annah-0.185
Token She
Token ID1375
Feature activation+5.45e-02

Pos logit contributions
ott+0.000
 te+0.000
 Majesty+0.000
 wardrobe+0.000
 cabinet+0.000
Neg logit contributions
Mark-0.000
Big-0.000
By-0.000
Rex-0.000
Time-0.000
TokenAnna
Token ID31160
Feature activation+5.60e-02

Pos logit contributions
 distance+0.003
Mark+0.003
Rex+0.003
By+0.003
Big+0.003
Neg logit contributions
ott-0.007
 te-0.007
 Majesty-0.007
annah-0.006
 cabinet-0.006
Token picked
Token ID6497
Feature activation+6.85e-02

Pos logit contributions
ott+0.113
 Majesty+0.110
 te+0.108
 cabinet+0.105
oppy+0.101
Neg logit contributions
Mark-0.061
Rex-0.060
By-0.057
 distance-0.056
Big-0.055
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
ott+0.118
 Majesty+0.114
 te+0.109
 cabinet+0.106
 majesty+0.106
Neg logit contributions
Mark-0.092
Rex-0.091
What-0.090
Big-0.085
 distance-0.085
Token word
Token ID1573
Feature activation-1.48e-01

Pos logit contributions
Mark+0.065
Rex+0.063
Big+0.062
Kim+0.062
Time+0.062
Neg logit contributions
 te-0.063
 Majesty-0.062
ott-0.061
 cabinet-0.058
 Flor-0.056
Token "
Token ID366
Feature activation+2.07e-02

Pos logit contributions
Mark+0.186
Rex+0.182
By+0.175
 distance+0.173
Big+0.172
Neg logit contributions
ott-0.275
 Majesty-0.267
 te-0.262
 cabinet-0.247
annah-0.245
Tokenment
Token ID434
Feature activation-2.90e-01

Pos logit contributions
ott+0.037
 te+0.035
 Av+0.033
 Flor+0.032
 Majesty+0.031
Neg logit contributions
Mark-0.025
 distance-0.024
Josh-0.024
Rex-0.024
Time-0.023
Tokenion
Token ID295
Feature activation+4.86e-02

Pos logit contributions
Mark+0.419
Rex+0.412
By+0.399
Big+0.395
 distance+0.394
Neg logit contributions
ott-0.480
 te-0.469
 Majesty-0.459
 cabinet-0.437
annah-0.425
Token".
Token ID1911
Feature activation+1.56e-02

Pos logit contributions
ott+0.083
 Majesty+0.080
 te+0.079
 cabinet+0.074
 Flor+0.072
Neg logit contributions
Mark-0.067
Rex-0.066
By-0.064
Big-0.064
 distance-0.063
Token She
Token ID1375
Feature activation+5.41e-02

Pos logit contributions
 Majesty+0.026
ott+0.025
anda+0.023
 majesty+0.023
oppy+0.023
Neg logit contributions
What-0.023
Mark-0.022
The-0.021
When-0.021
Before-0.021
Token asked
Token ID1965
Feature activation+8.21e-02

Pos logit contributions
ott+0.104
 Majesty+0.103
 te+0.097
 cabinet+0.097
annah+0.095
Neg logit contributions
Mark-0.064
Rex-0.063
By-0.060
Big-0.060
Rick-0.059
Token Ben
Token ID3932
Feature activation+4.12e-02

Pos logit contributions
ott+0.144
 te+0.135
 Majesty+0.134
 cabinet+0.125
oppy+0.124
Neg logit contributions
Mark-0.115
Rex-0.111
Big-0.107
What-0.106
By-0.105
Token sense
Token ID2565
Feature activation-1.25e-01

Pos logit contributions
Mark+0.020
Rex+0.019
Big+0.018
Kim+0.018
Rick+0.018
Neg logit contributions
ott-0.019
 te-0.018
 Majesty-0.018
 cabinet-0.017
 Flor-0.016
Token of
Token ID286
Feature activation-5.76e-02

Pos logit contributions
Mark+0.124
Rex+0.120
By+0.115
Big+0.114
 distance+0.112
Neg logit contributions
ott-0.266
 te-0.261
 Majesty-0.256
 cabinet-0.247
annah-0.240
Token joy
Token ID8716
Feature activation-1.35e-01

Pos logit contributions
Mark+0.064
Rex+0.063
Big+0.060
By+0.059
Rick+0.059
Neg logit contributions
ott-0.116
 te-0.114
 Majesty-0.112
 cabinet-0.107
 Flor-0.105
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
Mark+0.226
Rex+0.223
Big+0.214
Rick+0.210
Kim+0.210
Neg logit contributions
ott-0.201
 te-0.200
 Majesty-0.189
 cabinet-0.186
 wardrobe-0.178
Token content
Token ID2695
Feature activation-9.16e-02

Pos logit contributions
Mark+0.023
Rex+0.023
Big+0.022
Rick+0.021
By+0.021
Neg logit contributions
 Majesty-0.030
ott-0.030
 te-0.030
 cabinet-0.028
 Av-0.027
Tokenment
Token ID434
Feature activation-2.90e-01

Pos logit contributions
Mark+0.122
Rex+0.121
Big+0.115
Rick+0.113
By+0.112
Neg logit contributions
ott-0.165
 te-0.162
 Majesty-0.161
 cabinet-0.152
 Flor-0.148
Token.
Token ID13
Feature activation-9.18e-03

Pos logit contributions
Mark+0.450
Rex+0.441
Big+0.426
Time+0.423
Rick+0.421
Neg logit contributions
ott-0.454
 te-0.448
 Majesty-0.426
 cabinet-0.410
annah-0.399
Token After
Token ID2293
Feature activation-6.06e-02

Pos logit contributions
Mark+0.009
Rex+0.009
By+0.009
Big+0.009
Time+0.008
Neg logit contributions
 te-0.020
ott-0.019
 Majesty-0.018
 wardrobe-0.018
 cabinet-0.018
Token a
Token ID257
Feature activation-1.49e-02

Pos logit contributions
Mark+0.079
Rex+0.078
 distance+0.077
Big+0.075
By+0.074
Neg logit contributions
ott-0.108
 te-0.106
 Majesty-0.103
 Av-0.099
 cabinet-0.099
Token few
Token ID1178
Feature activation-3.17e-02

Pos logit contributions
Rex+0.023
Mark+0.023
Kim+0.022
Rick+0.022
Josh+0.022
Neg logit contributions
 te-0.024
ott-0.021
 Av-0.021
 Flor-0.020
 Majesty-0.020
Token moments
Token ID7188
Feature activation-1.56e-01

Pos logit contributions
Mark+0.035
Rex+0.034
Big+0.034
Rick+0.033
Kim+0.033
Neg logit contributions
 te-0.064
ott-0.062
 Majesty-0.060
 cabinet-0.059
annah-0.057
Token'll
Token ID1183
Feature activation-2.81e-02

Pos logit contributions
ott+0.257
 te+0.253
 Majesty+0.251
 cabinet+0.240
annah+0.238
Neg logit contributions
Mark-0.156
Rex-0.155
By-0.153
 distance-0.148
Time-0.148
Token put
Token ID1234
Feature activation+4.48e-02

Pos logit contributions
Mark+0.024
Rex+0.024
By+0.024
Big+0.024
 distance+0.024
Neg logit contributions
ott-0.061
 Majesty-0.059
 te-0.058
 cabinet-0.057
annah-0.057
Token some
Token ID617
Feature activation-6.03e-03

Pos logit contributions
ott+0.073
 te+0.071
 Majesty+0.070
 cabinet+0.066
annah+0.064
Neg logit contributions
Mark-0.066
Rex-0.066
By-0.063
Big-0.063
 distance-0.062
Token o
Token ID267
Feature activation+2.81e-02

Pos logit contributions
Mark+0.009
Rex+0.008
By+0.008
Big+0.008
Rick+0.008
Neg logit contributions
ott-0.010
 te-0.010
 Majesty-0.010
 cabinet-0.009
 Flor-0.009
Tokenint
Token ID600
Feature activation-7.41e-02

Pos logit contributions
ott+0.035
 te+0.035
 Majesty+0.033
oppy+0.032
anda+0.031
Neg logit contributions
Mark-0.052
 distance-0.051
Big-0.048
What-0.048
By-0.047
Tokenment
Token ID434
Feature activation-2.89e-01

Pos logit contributions
Mark+0.106
Rex+0.101
By+0.100
 distance+0.099
Time+0.099
Neg logit contributions
ott-0.126
 te-0.124
 Majesty-0.121
 cabinet-0.114
 Flor-0.112
Token on
Token ID319
Feature activation-6.85e-02

Pos logit contributions
Mark+0.323
Rex+0.315
By+0.302
Big+0.299
 distance+0.296
Neg logit contributions
ott-0.577
 te-0.565
 Majesty-0.554
 cabinet-0.532
annah-0.519
Token it
Token ID340
Feature activation+3.64e-03

Pos logit contributions
Mark+0.106
Rex+0.105
Big+0.104
What+0.098
By+0.098
Neg logit contributions
ott-0.111
 Majesty-0.104
 te-0.102
 cabinet-0.096
 wardrobe-0.093
Token,"
Token ID553
Feature activation-3.01e-02

Pos logit contributions
ott+0.007
 Majesty+0.007
 te+0.006
 majesty+0.006
oppy+0.006
Neg logit contributions
 distance-0.004
Mark-0.004
Big-0.004
Rex-0.004
By-0.004
Token said
Token ID531
Feature activation+3.71e-02

Pos logit contributions
Mark+0.048
Rex+0.047
By+0.046
Big+0.046
 distance+0.046
Neg logit contributions
ott-0.046
 te-0.044
 Majesty-0.043
 cabinet-0.041
annah-0.039
Token his
Token ID465
Feature activation-6.51e-03

Pos logit contributions
ott+0.068
 Majesty+0.067
 te+0.065
 cabinet+0.062
annah+0.061
Neg logit contributions
Mark-0.048
Rex-0.046
Big-0.045
By-0.045
 distance-0.044
Token other
Token ID584
Feature activation+5.24e-02

Pos logit contributions
Mark+0.118
Rex+0.115
Kim+0.114
By+0.114
Rick+0.113
Neg logit contributions
 te-0.085
ott-0.078
annah-0.074
 Majesty-0.073
 cabinet-0.073
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
ott+0.084
 te+0.084
 Majesty+0.082
 cabinet+0.078
annah+0.076
Neg logit contributions
Mark-0.078
Rex-0.077
By-0.074
Time-0.073
Big-0.073
Token sighed
Token ID21893
Feature activation-1.03e-01

Pos logit contributions
Mark+0.032
Rex+0.032
Kim+0.030
Rick+0.030
Big+0.030
Neg logit contributions
 te-0.043
ott-0.043
 Majesty-0.042
 Flor-0.041
 Av-0.040
Token in
Token ID287
Feature activation+5.68e-02

Pos logit contributions
Rex+0.166
Mark+0.165
Rick+0.156
Big+0.155
Kim+0.154
Neg logit contributions
 te-0.155
ott-0.154
 Majesty-0.151
 cabinet-0.142
 Flor-0.141
Token content
Token ID2695
Feature activation-9.33e-02

Pos logit contributions
 te+0.092
 fashion+0.087
 Flor+0.085
 Av+0.084
 Majesty+0.084
Neg logit contributions
Rex-0.089
Mark-0.089
Rick-0.087
Kim-0.086
Andy-0.083
Tokenment
Token ID434
Feature activation-2.89e-01

Pos logit contributions
Mark+0.119
Rex+0.119
Big+0.110
Rick+0.110
Kim+0.108
Neg logit contributions
ott-0.174
 te-0.174
 Majesty-0.171
 cabinet-0.160
 Flor-0.160
Token.
Token ID13
Feature activation-1.96e-02

Pos logit contributions
Mark+0.421
Rex+0.414
By+0.400
Big+0.398
 distance+0.394
Neg logit contributions
ott-0.478
 te-0.466
 Majesty-0.456
 cabinet-0.433
annah-0.421
Token What
Token ID1867
Feature activation-8.27e-02

Pos logit contributions
Time+0.020
By+0.020
What+0.020
Mark+0.019
Big+0.019
Neg logit contributions
 te-0.040
ott-0.040
 Majesty-0.038
 cabinet-0.038
 wardrobe-0.037
Token a
Token ID257
Feature activation-1.54e-02

Pos logit contributions
Mark+0.100
Rex+0.100
 distance+0.094
Big+0.094
By+0.094
Neg logit contributions
ott-0.158
 te-0.154
 Majesty-0.150
 cabinet-0.144
 Flor-0.141
Token lovely
Token ID14081
Feature activation+2.09e-02

Pos logit contributions
Mark+0.023
Rex+0.023
Big+0.021
Kim+0.021
Rick+0.021
Neg logit contributions
ott-0.025
 te-0.025
 Majesty-0.025
 cabinet-0.023
 wardrobe-0.022
Token day
Token ID1110
Feature activation-6.44e-02

Pos logit contributions
 te+0.037
 Majesty+0.036
ott+0.036
 cabinet+0.035
 wardrobe+0.034
Neg logit contributions
Rex-0.030
Mark-0.028
Rick-0.027
Big-0.026
Kim-0.026
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
Mark+0.010
Big+0.010
Kim+0.009
Rick+0.009
What+0.009
Neg logit contributions
 te-0.016
 Majesty-0.016
ott-0.015
anda-0.015
sie-0.014
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
Mark+0.014
Rex+0.014
 distance+0.014
By+0.014
Time+0.013
Neg logit contributions
ott-0.020
 te-0.020
 Majesty-0.019
 cabinet-0.018
annah-0.018
TokenThe
Token ID464
Feature activation-1.05e-01

Pos logit contributions
 distance+0.010
Rex+0.009
Mark+0.009
 brake+0.009
 injuries+0.009
Neg logit contributions
ott-0.026
 te-0.025
 Majesty-0.024
annah-0.024
 cabinet-0.024
Token o
Token ID267
Feature activation+2.94e-02

Pos logit contributions
Mark+0.183
Big+0.173
Rick+0.172
Rex+0.170
Kim+0.169
Neg logit contributions
 te-0.155
 cabinet-0.153
ott-0.148
 Majesty-0.146
 wardrobe-0.142
Tokenint
Token ID600
Feature activation-7.51e-02

Pos logit contributions
ott+0.041
 Majesty+0.035
 te+0.034
 cabinet+0.034
annah+0.033
Neg logit contributions
 levers-0.053
 crashing-0.051
 distance-0.051
 brake-0.051
Run-0.049
Tokenment
Token ID434
Feature activation-2.87e-01

Pos logit contributions
Mark+0.097
Rex+0.095
 distance+0.094
By+0.094
Big+0.093
Neg logit contributions
ott-0.139
 te-0.134
 Majesty-0.131
 cabinet-0.126
oppy-0.123
Token made
Token ID925
Feature activation-6.38e-02

Pos logit contributions
Mark+0.380
Rex+0.375
Big+0.369
Rick+0.362
 distance+0.361
Neg logit contributions
 te-0.519
ott-0.514
 cabinet-0.463
 Av-0.448
 Flor-0.445
Token the
Token ID262
Feature activation-4.27e-02

Pos logit contributions
Mark+0.074
Rex+0.073
By+0.070
Big+0.069
 distance+0.069
Neg logit contributions
ott-0.124
 te-0.121
 Majesty-0.120
 cabinet-0.114
annah-0.111
Token itch
Token ID49700
Feature activation-1.29e-01

Pos logit contributions
Mark+0.080
Rex+0.073
Kim+0.073
Big+0.072
Andy+0.072
Neg logit contributions
 te-0.061
 Majesty-0.060
ott-0.057
 cabinet-0.056
 drawer-0.053
Token go
Token ID467
Feature activation-3.25e-02

Pos logit contributions
Rex+0.186
Mark+0.185
Big+0.177
Rick+0.174
By+0.172
Neg logit contributions
 te-0.217
ott-0.214
 Majesty-0.205
 cabinet-0.204
 Flor-0.191
Token away
Token ID1497
Feature activation-1.72e-02

Pos logit contributions
Mark+0.047
Rex+0.046
By+0.045
Josh+0.044
Big+0.044
Neg logit contributions
ott-0.055
 te-0.054
 cabinet-0.051
 Majesty-0.051
 Flor-0.048
Token an
Token ID281
Feature activation+2.87e-02

Pos logit contributions
ott+0.205
 te+0.203
 Majesty+0.193
 cabinet+0.191
 wardrobe+0.185
Neg logit contributions
Mark-0.145
Rex-0.143
By-0.136
Big-0.134
Rick-0.134
Token incredible
Token ID8082
Feature activation+5.43e-03

Pos logit contributions
ott+0.048
 Majesty+0.047
 te+0.047
 Flor+0.045
oppy+0.044
Neg logit contributions
 distance-0.039
Mark-0.038
Time-0.037
Big-0.037
Rex-0.037
Token sense
Token ID2565
Feature activation-1.23e-01

Pos logit contributions
ott+0.009
 te+0.009
 Majesty+0.009
 cabinet+0.008
 wardrobe+0.008
Neg logit contributions
Mark-0.008
Rex-0.008
By-0.008
Big-0.007
Rick-0.007
Token of
Token ID286
Feature activation-5.64e-02

Pos logit contributions
Mark+0.113
Rex+0.110
By+0.105
 distance+0.103
Big+0.103
Neg logit contributions
ott-0.269
 Majesty-0.263
 te-0.263
 cabinet-0.249
annah-0.245
Token content
Token ID2695
Feature activation-8.98e-02

Pos logit contributions
Mark+0.063
Rex+0.063
Big+0.060
By+0.059
Rick+0.058
Neg logit contributions
ott-0.113
 te-0.112
 Majesty-0.109
 cabinet-0.106
 Flor-0.103
Tokenment
Token ID434
Feature activation-2.87e-01

Pos logit contributions
Mark+0.113
Rex+0.113
Big+0.106
Rick+0.103
By+0.100
Neg logit contributions
 te-0.172
ott-0.171
 Majesty-0.169
 cabinet-0.163
 wardrobe-0.160
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
Mark+0.417
Rex+0.410
By+0.394
Big+0.394
Time+0.389
Neg logit contributions
ott-0.477
 te-0.467
 Majesty-0.451
 cabinet-0.434
annah-0.422
Token She
Token ID1375
Feature activation+4.63e-02

Pos logit contributions
What+0.015
Time+0.014
By+0.014
Kim+0.014
 distance+0.014
Neg logit contributions
 te-0.025
 Majesty-0.024
 wardrobe-0.022
ott-0.022
anda-0.022
Token said
Token ID531
Feature activation+2.98e-02

Pos logit contributions
ott+0.094
 Majesty+0.093
 te+0.090
 cabinet+0.090
annah+0.086
Neg logit contributions
By-0.048
Mark-0.047
Rex-0.046
Big-0.045
 distance-0.044
Token goodbye
Token ID24829
Feature activation-9.08e-02

Pos logit contributions
ott+0.059
 te+0.055
 Majesty+0.054
 cabinet+0.053
act+0.051
Neg logit contributions
By-0.034
Mark-0.031
Big-0.030
Time-0.029
What-0.029
Token to
Token ID284
Feature activation-1.99e-02

Pos logit contributions
Mark+0.102
Rex+0.100
By+0.100
 distance+0.098
Big+0.097
Neg logit contributions
ott-0.178
 te-0.173
 Majesty-0.170
 cabinet-0.165
oppy-0.158
Token gave
Token ID2921
Feature activation+7.01e-02

Pos logit contributions
ott+0.164
 Majesty+0.161
 te+0.158
annah+0.154
 cabinet+0.153
Neg logit contributions
Mark-0.090
By-0.087
Rex-0.086
Big-0.083
Rick-0.082
Token him
Token ID683
Feature activation-2.24e-02

Pos logit contributions
ott+0.141
 te+0.135
 Majesty+0.132
 cabinet+0.128
 Av+0.128
Neg logit contributions
Mark-0.080
Rex-0.078
By-0.077
Time-0.076
 distance-0.074
Token some
Token ID617
Feature activation-3.74e-03

Pos logit contributions
Mark+0.027
Rex+0.026
By+0.025
 distance+0.025
Time+0.025
Neg logit contributions
ott-0.043
 te-0.042
 Majesty-0.041
 cabinet-0.040
 Av-0.039
Token o
Token ID267
Feature activation+2.84e-02

Pos logit contributions
Mark+0.006
Rex+0.006
By+0.005
Big+0.005
 distance+0.005
Neg logit contributions
ott-0.006
 te-0.006
 Majesty-0.006
 cabinet-0.006
annah-0.005
Tokenint
Token ID600
Feature activation-7.54e-02

Pos logit contributions
ott+0.037
 Majesty+0.033
 te+0.033
oppy+0.032
annah+0.031
Neg logit contributions
 distance-0.055
Mark-0.053
What-0.052
Big-0.051
Time-0.051
Tokenment
Token ID434
Feature activation-2.87e-01

Pos logit contributions
Mark+0.090
Rex+0.087
By+0.085
 distance+0.084
Big+0.084
Neg logit contributions
ott-0.149
 te-0.144
 Majesty-0.142
 cabinet-0.136
 Flor-0.132
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
Mark+0.379
Rex+0.370
By+0.356
Big+0.355
Time+0.352
Neg logit contributions
ott-0.516
 te-0.505
 Majesty-0.489
 cabinet-0.471
annah-0.457
Token a
Token ID257
Feature activation-1.48e-02

Pos logit contributions
Mark+0.024
Rex+0.024
By+0.023
Big+0.023
 distance+0.022
Neg logit contributions
ott-0.038
 te-0.037
 Majesty-0.036
 cabinet-0.035
annah-0.034
Token hug
Token ID16225
Feature activation-3.18e-02

Pos logit contributions
Mark+0.023
Rex+0.022
By+0.021
Big+0.021
Time+0.021
Neg logit contributions
ott-0.024
 te-0.024
 Majesty-0.023
 cabinet-0.022
 Flor-0.020
Token.
Token ID13
Feature activation-8.21e-03

Pos logit contributions
Mark+0.055
Rex+0.054
Big+0.053
Time+0.053
By+0.052
Neg logit contributions
ott-0.042
 te-0.042
 cabinet-0.040
 Majesty-0.039
 Av-0.038
Token
Token ID198
Feature activation-1.13e-02

Pos logit contributions
Mark+0.010
Big+0.010
Kim+0.009
Rick+0.009
What+0.009
Neg logit contributions
 te-0.016
 Majesty-0.016
ott-0.015
anda-0.015
sie-0.014
TokenThe
Token ID464
Feature activation-1.04e-01

Pos logit contributions
Rex+0.008
Mark+0.008
 distance+0.008
Rick+0.007
 brake+0.007
Neg logit contributions
ott-0.031
 te-0.030
 Majesty-0.029
annah-0.028
 cabinet-0.028
Token princess
Token ID21752
Feature activation+8.61e-02

Pos logit contributions
Mark+0.175
Rex+0.172
By+0.168
Big+0.166
 distance+0.166
Neg logit contributions
ott-0.151
 te-0.145
 Majesty-0.142
 cabinet-0.134
annah-0.129
Token watched
Token ID7342
Feature activation-3.47e-02

Pos logit contributions
ott+0.173
 Majesty+0.168
 te+0.167
 cabinet+0.161
annah+0.157
Neg logit contributions
Mark-0.094
Rex-0.091
By-0.089
Big-0.087
 distance-0.085
Token with
Token ID351
Feature activation+1.13e-01

Pos logit contributions
Mark+0.042
Rex+0.041
By+0.039
Big+0.039
 distance+0.038
Neg logit contributions
ott-0.066
 te-0.066
 Majesty-0.064
 cabinet-0.061
 Flor-0.060
Token content
Token ID2695
Feature activation-8.87e-02

Pos logit contributions
 te+0.192
 Flor+0.183
ott+0.183
 Majesty+0.181
 cabinet+0.177
Neg logit contributions
Rex-0.160
Mark-0.155
Kim-0.153
Rick-0.149
By-0.149
Tokenment
Token ID434
Feature activation-2.86e-01

Pos logit contributions
Mark+0.086
Rex+0.081
Big+0.077
Rick+0.076
Kim+0.076
Neg logit contributions
 Majesty-0.193
 te-0.193
ott-0.191
 Flor-0.180
 majesty-0.180
Token,
Token ID11
Feature activation+5.37e-02

Pos logit contributions
Mark+0.415
Rex+0.408
By+0.394
Big+0.392
Time+0.389
Neg logit contributions
ott-0.476
 te-0.464
 Majesty-0.451
 cabinet-0.431
annah-0.419
Token but
Token ID475
Feature activation-3.53e-02

Pos logit contributions
 te+0.092
ott+0.092
 Majesty+0.089
 cabinet+0.084
 Flor+0.083
Neg logit contributions
Mark-0.075
Rex-0.074
By-0.070
Big-0.070
Rick-0.069
Token then
Token ID788
Feature activation-5.48e-02

Pos logit contributions
Mark+0.038
Rex+0.037
By+0.035
Big+0.035
 distance+0.035
Neg logit contributions
ott-0.072
 te-0.071
 Majesty-0.070
 cabinet-0.067
annah-0.065
Token she
Token ID673
Feature activation+8.42e-02

Pos logit contributions
Mark+0.080
Rex+0.078
By+0.076
Big+0.075
 distance+0.075
Neg logit contributions
ott-0.091
 te-0.088
 Majesty-0.086
 cabinet-0.082
annah-0.080
Token noticed
Token ID6810
Feature activation-1.34e-02

Pos logit contributions
ott+0.162
 te+0.159
 Majesty+0.154
 cabinet+0.147
 Av+0.144
Neg logit contributions
Mark-0.101
Rex-0.098
Big-0.093
Time-0.093
 distance-0.092
Token and
Token ID290
Feature activation-1.90e-02

Pos logit contributions
ott+0.044
 te+0.043
 Majesty+0.041
 cabinet+0.040
 Flor+0.039
Neg logit contributions
Mark-0.034
Rex-0.033
By-0.032
Big-0.032
Time-0.032
Token enjoyed
Token ID8359
Feature activation-7.01e-02

Pos logit contributions
Mark+0.026
Rex+0.025
By+0.024
Big+0.024
Time+0.024
Neg logit contributions
ott-0.033
 te-0.033
 Majesty-0.032
 cabinet-0.031
 Flor-0.030
Token the
Token ID262
Feature activation-4.32e-02

Pos logit contributions
Mark+0.083
Rex+0.082
Big+0.075
 distance+0.075
By+0.075
Neg logit contributions
ott-0.135
 Majesty-0.134
 te-0.134
 cabinet-0.125
 Av-0.124
Token cool
Token ID3608
Feature activation+4.75e-02

Pos logit contributions
Mark+0.077
Rex+0.074
Rick+0.071
Big+0.071
Josh+0.070
Neg logit contributions
 te-0.065
ott-0.063
 Majesty-0.060
 cabinet-0.057
 Flor-0.055
Token refresh
Token ID14976
Feature activation-3.90e-02

Pos logit contributions
 te+0.078
ott+0.076
 Majesty+0.075
 cabinet+0.072
 Flor+0.068
Neg logit contributions
Mark-0.074
Rex-0.072
Big-0.069
By-0.068
Rick-0.068
Tokenment
Token ID434
Feature activation-2.86e-01

Pos logit contributions
Mark+0.030
Rex+0.028
By+0.026
Big+0.026
 distance+0.025
Neg logit contributions
ott-0.093
 te-0.091
 Majesty-0.091
 cabinet-0.087
annah-0.085
Token.
Token ID13
Feature activation-1.12e-02

Pos logit contributions
Mark+0.400
Rex+0.392
By+0.378
Big+0.376
Time+0.373
Neg logit contributions
ott-0.490
 te-0.478
 Majesty-0.466
 cabinet-0.445
annah-0.433
Token The
Token ID383
Feature activation-4.01e-02

Pos logit contributions
Mark+0.014
Rex+0.014
By+0.013
Big+0.013
Time+0.013
Neg logit contributions
 te-0.021
ott-0.021
 Majesty-0.020
 cabinet-0.020
 wardrobe-0.019
Token traveller
Token ID49130
Feature activation+1.10e-02

Pos logit contributions
Mark+0.078
Josh+0.070
Kim+0.068
Rick+0.068
Rex+0.067
Neg logit contributions
ott-0.059
 te-0.057
 Majesty-0.055
 wardrobe-0.052
 cabinet-0.051
Token was
Token ID373
Feature activation+2.82e-02

Pos logit contributions
ott+0.021
 te+0.021
 Majesty+0.020
 cabinet+0.019
annah+0.019
Neg logit contributions
Mark-0.013
Rex-0.013
Big-0.012
By-0.012
Time-0.012
Token happy
Token ID3772
Feature activation-2.46e-02

Pos logit contributions
 te+0.037
ott+0.037
 Majesty+0.036
 Flor+0.034
 Av+0.033
Neg logit contributions
Rex-0.051
Mark-0.050
Run-0.047
Big-0.047
Rick-0.047
Token.
Token ID13
Feature activation-6.80e-03

Pos logit contributions
Mark+0.066
Rex+0.065
By+0.063
Big+0.063
 distance+0.062
Neg logit contributions
ott-0.082
 te-0.080
 Majesty-0.079
 cabinet-0.074
annah-0.072
Token
Token ID198
Feature activation-6.48e-03

Pos logit contributions
What+0.006
Mark+0.006
Kim+0.005
Big+0.005
By+0.005
Neg logit contributions
 Majesty-0.016
 te-0.016
ott-0.015
 majesty-0.015
oppy-0.014
Token
Token ID198
Feature activation-7.54e-03

Pos logit contributions
Mark+0.008
Rex+0.008
By+0.008
 distance+0.008
Big+0.008
Neg logit contributions
ott-0.012
 te-0.011
 Majesty-0.011
 cabinet-0.011
annah-0.010
TokenL
Token ID43
Feature activation+1.15e-01

Pos logit contributions
Mark+0.008
What+0.008
Rex+0.008
By+0.008
Big+0.008
Neg logit contributions
 Majesty-0.016
ott-0.016
 te-0.016
 cabinet-0.015
 Flor-0.015
Tokenily
Token ID813
Feature activation+8.08e-02

Pos logit contributions
 Majesty+0.189
 te+0.171
 majesty+0.158
 collection+0.155
 bur+0.154
Neg logit contributions
Time-0.187
Mark-0.170
Kim-0.168
What-0.168
By-0.164
Token's
Token ID338
Feature activation+1.45e-01

Pos logit contributions
ott+0.151
 Majesty+0.150
 te+0.148
 cabinet+0.143
 Flor+0.139
Neg logit contributions
Mark-0.099
Rex-0.098
Big-0.096
By-0.095
 distance-0.093
Token mom
Token ID1995
Feature activation-5.99e-02

Pos logit contributions
ott+0.221
 te+0.212
 Majesty+0.210
 cabinet+0.202
 wardrobe+0.197
Neg logit contributions
Mark-0.233
Rex-0.228
Time-0.222
By-0.222
Big-0.220
Tokenmy
Token ID1820
Feature activation-2.81e-02

Pos logit contributions
Rex+0.089
Mark+0.088
By+0.088
 distance+0.085
Big+0.083
Neg logit contributions
ott-0.099
 te-0.093
 cabinet-0.087
 wardrobe-0.085
 Majesty-0.084
Token came
Token ID1625
Feature activation+2.85e-02

Pos logit contributions
Mark+0.041
Rex+0.040
By+0.039
 distance+0.038
Time+0.038
Neg logit contributions
ott-0.047
 te-0.045
 Majesty-0.042
 cabinet-0.041
 wardrobe-0.041
Token outside
Token ID2354
Feature activation+1.29e-01

Pos logit contributions
ott+0.042
 te+0.041
 cabinet+0.038
 Majesty+0.037
 wardrobe+0.037
Neg logit contributions
Mark-0.046
By-0.046
Rex-0.046
Big-0.044
Rick-0.043
Token and
Token ID290
Feature activation-1.85e-02

Pos logit contributions
 te+0.206
ott+0.204
 cabinet+0.190
 Av+0.182
 Flor+0.181
Neg logit contributions
Rex-0.199
Mark-0.197
Rick-0.191
Big-0.189
By-0.189
Token.
Token ID13
Feature activation-8.75e-04

Pos logit contributions
 Majesty+0.264
ott+0.263
 te+0.262
 cabinet+0.245
annah+0.241
Neg logit contributions
Mark-0.196
Rex-0.193
Big-0.186
 distance-0.185
By-0.184
Token She
Token ID1375
Feature activation+5.15e-02

Pos logit contributions
Mark+0.001
What+0.001
Time+0.001
By+0.001
Kim+0.001
Neg logit contributions
 Majesty-0.001
ott-0.001
 majesty-0.001
 te-0.001
annah-0.001
Token loved
Token ID6151
Feature activation+4.73e-02

Pos logit contributions
 Majesty+0.090
ott+0.088
 te+0.086
 cabinet+0.082
annah+0.082
Neg logit contributions
Mark-0.071
Rex-0.071
By-0.069
Big-0.068
Time-0.066
Token to
Token ID284
Feature activation-1.76e-02

Pos logit contributions
ott+0.089
 Majesty+0.087
 te+0.087
annah+0.082
 cabinet+0.081
Neg logit contributions
Mark-0.057
Rex-0.056
Big-0.054
By-0.053
Time-0.053
Token play
Token ID711
Feature activation+4.29e-02

Pos logit contributions
Mark+0.021
Rex+0.020
Big+0.020
By+0.020
 distance+0.020
Neg logit contributions
ott-0.033
 Majesty-0.032
 te-0.032
annah-0.030
 cabinet-0.030
Token outside
Token ID2354
Feature activation+1.31e-01

Pos logit contributions
 Majesty+0.079
ott+0.076
annah+0.073
 majesty+0.071
anda+0.071
Neg logit contributions
Rex-0.056
What-0.056
Mark-0.055
By-0.055
Thunder-0.051
Token with
Token ID351
Feature activation+1.13e-01

Pos logit contributions
 Majesty+0.272
ott+0.266
 majesty+0.249
annah+0.247
 te+0.246
Neg logit contributions
Mark-0.142
Rex-0.137
 distance-0.136
Big-0.134
What-0.134
Token her
Token ID607
Feature activation+7.72e-02

Pos logit contributions
 Majesty+0.209
ott+0.197
annah+0.193
 te+0.183
 majesty+0.179
Neg logit contributions
Big-0.155
Mark-0.155
What-0.154
Rex-0.151
Kim-0.149
Token toys
Token ID14958
Feature activation+1.48e-02

Pos logit contributions
 Majesty+0.119
ott+0.117
 te+0.114
annah+0.112
 Av+0.108
Neg logit contributions
Rex-0.122
Mark-0.120
Big-0.117
By-0.116
Kim-0.113
Token.
Token ID13
Feature activation-2.71e-03

Pos logit contributions
 Majesty+0.032
ott+0.030
 majesty+0.029
 te+0.029
annah+0.028
Neg logit contributions
 distance-0.015
Rex-0.014
By-0.014
Mark-0.014
Big-0.013
Token One
Token ID1881
Feature activation+4.55e-02

Pos logit contributions
What+0.003
Mark+0.003
Kim+0.003
 moment+0.003
Time+0.003
Neg logit contributions
 Majesty-0.006
ott-0.005
 majesty-0.005
onel-0.005
annah-0.005
Token,
Token ID11
Feature activation+5.99e-02

Pos logit contributions
ott+0.069
annah+0.062
 te+0.061
 Majesty+0.061
 cabinet+0.061
Neg logit contributions
By-0.041
Rex-0.041
Mark-0.041
Big-0.037
Time-0.037
Token "
Token ID366
Feature activation+2.08e-02

Pos logit contributions
 te+0.070
ott+0.067
 Majesty+0.064
 Flor+0.063
 cabinet+0.062
Neg logit contributions
Mark-0.115
 distance-0.114
Rex-0.111
Time-0.109
Rick-0.109
TokenThank
Token ID10449
Feature activation-8.37e-02

Pos logit contributions
ott+0.037
 te+0.036
 Av+0.035
 Flor+0.033
 Majesty+0.033
Neg logit contributions
 injuries-0.025
 breath-0.024
 brewing-0.024
 distance-0.024
 pounding-0.023
Token you
Token ID345
Feature activation+1.41e-01

Pos logit contributions
 distance+0.172
Mark+0.169
Big+0.165
Rick+0.165
Rex+0.164
Neg logit contributions
ott-0.087
 te-0.082
 Majesty-0.074
 Av-0.073
 Flor-0.071
Token,
Token ID11
Feature activation+5.81e-02

Pos logit contributions
ott+0.188
 te+0.184
 Majesty+0.180
 cabinet+0.172
 Av+0.164
Neg logit contributions
Mark-0.248
Rex-0.243
 distance-0.239
By-0.238
Time-0.237
Token Lily
Token ID20037
Feature activation+1.47e-01

Pos logit contributions
ott+0.062
 Majesty+0.057
 te+0.055
 cabinet+0.050
oppy+0.049
Neg logit contributions
Mark-0.122
Rex-0.122
By-0.119
Time-0.117
Kim-0.116
Token!
Token ID0
Feature activation-1.34e-02

Pos logit contributions
ott+0.213
 te+0.210
 Majesty+0.204
 cabinet+0.197
 Av+0.185
Neg logit contributions
Mark-0.237
Rex-0.233
 distance-0.232
Big-0.232
By-0.231
Token You
Token ID921
Feature activation+7.05e-02

Pos logit contributions
Rex+0.023
Mark+0.023
By+0.022
Time+0.022
Big+0.022
Neg logit contributions
ott-0.019
 te-0.018
 Majesty-0.018
 cabinet-0.017
annah-0.017
Token are
Token ID389
Feature activation-4.01e-02

Pos logit contributions
 te+0.113
ott+0.110
 Majesty+0.105
 Av+0.101
 Flor+0.096
Neg logit contributions
Mark-0.107
Kim-0.105
Big-0.104
 distance-0.104
Rex-0.103
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
Mark+0.060
Rex+0.060
 distance+0.059
Time+0.058
Rick+0.058
Neg logit contributions
 te-0.063
ott-0.061
 Majesty-0.061
 Av-0.059
 Flor-0.058
Token good
Token ID922
Feature activation+2.22e-02

Pos logit contributions
Rex+0.020
By+0.020
Mark+0.020
Big+0.019
Time+0.019
Neg logit contributions
ott-0.025
 te-0.024
annah-0.023
 Majesty-0.023
 cabinet-0.022
Token
Token ID198
Feature activation+1.33e-03

Pos logit contributions
ott+0.021
 te+0.019
 Majesty+0.019
 wardrobe+0.019
act+0.019
Neg logit contributions
Mark-0.013
Rex-0.013
Big-0.013
By-0.013
What-0.013
Token
Token ID198
Feature activation-1.61e-03

Pos logit contributions
ott+0.003
 te+0.003
 Majesty+0.003
 cabinet+0.002
annah+0.002
Neg logit contributions
Mark-0.001
Rex-0.001
By-0.001
 distance-0.001
Big-0.001
Token"
Token ID1
Feature activation-3.53e-02

Pos logit contributions
Mark+0.002
Rex+0.002
By+0.001
Big+0.001
Time+0.001
Neg logit contributions
ott-0.003
 te-0.003
 Majesty-0.003
 cabinet-0.003
 Flor-0.003
TokenCome
Token ID16773
Feature activation-1.01e-01

Pos logit contributions
Mark+0.033
 distance+0.031
Rex+0.031
Rick+0.030
Big+0.030
Neg logit contributions
ott-0.075
 te-0.074
 Majesty-0.069
 Flor-0.069
 Av-0.069
Token,
Token ID11
Feature activation+6.26e-02

Pos logit contributions
Mark+0.120
Big+0.112
 distance+0.111
By+0.108
Thunder+0.107
Neg logit contributions
 te-0.197
ott-0.191
 Majesty-0.184
 cabinet-0.182
 Av-0.180
Token my
Token ID616
Feature activation+1.52e-01

Pos logit contributions
ott+0.092
 te+0.087
 Majesty+0.086
 cabinet+0.081
annah+0.079
Neg logit contributions
Mark-0.104
Rex-0.103
By-0.100
Big-0.099
 distance-0.098
Token children
Token ID1751
Feature activation-8.32e-02

Pos logit contributions
ott+0.248
 te+0.220
annah+0.218
 Majesty+0.218
 cabinet+0.205
Neg logit contributions
Rex-0.226
Mark-0.219
By-0.217
Time-0.208
What-0.206
Token.
Token ID13
Feature activation+7.87e-03

Pos logit contributions
Mark+0.115
Rex+0.115
By+0.110
 distance+0.109
Big+0.107
Neg logit contributions
ott-0.145
 Majesty-0.137
 te-0.135
 cabinet-0.128
 Flor-0.125
Token I
Token ID314
Feature activation+1.37e-01

Pos logit contributions
ott+0.012
 te+0.011
 Majesty+0.011
annah+0.011
 cabinet+0.011
Neg logit contributions
Rex-0.013
Mark-0.012
By-0.012
Big-0.012
What-0.012
Token have
Token ID423
Feature activation-7.76e-02

Pos logit contributions
ott+0.250
 te+0.247
 Majesty+0.237
 cabinet+0.227
 Flor+0.222
Neg logit contributions
Mark-0.178
Rex-0.172
Big-0.167
By-0.163
Rick-0.162
Token soup
Token ID17141
Feature activation+1.62e-06

Pos logit contributions
Rex+0.088
Mark+0.088
By+0.084
Big+0.083
Time+0.083
Neg logit contributions
ott-0.154
 Majesty-0.146
 te-0.145
 cabinet-0.141
annah-0.138
Token hello
Token ID23748
Feature activation-4.29e-02

Pos logit contributions
Mark+0.111
Rex+0.109
By+0.106
 distance+0.106
Big+0.105
Neg logit contributions
ott-0.110
 te-0.108
 Majesty-0.107
 cabinet-0.100
 Flor-0.097
Token to
Token ID284
Feature activation-1.73e-02

Pos logit contributions
Mark+0.061
Rex+0.060
By+0.058
Big+0.058
 distance+0.058
Neg logit contributions
ott-0.071
 te-0.070
 Majesty-0.069
 cabinet-0.065
 Flor-0.063
Token the
Token ID262
Feature activation-3.93e-02

Pos logit contributions
Mark+0.017
Rex+0.016
Big+0.015
By+0.015
What+0.015
Neg logit contributions
ott-0.038
 te-0.037
 Majesty-0.036
 cabinet-0.035
annah-0.035
Token elephant
Token ID20950
Feature activation+4.25e-02

Pos logit contributions
Mark+0.052
 distance+0.051
Rex+0.050
Big+0.049
By+0.048
Neg logit contributions
ott-0.072
 te-0.069
annah-0.067
oppy-0.065
 Majesty-0.065
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
ott+0.080
 te+0.073
 Majesty+0.073
annah+0.069
 cabinet+0.069
Neg logit contributions
Mark-0.053
Rex-0.052
 distance-0.050
By-0.050
Big-0.049
Token show
Token ID905
Feature activation+1.41e-01

Pos logit contributions
Mark+0.026
Rex+0.026
By+0.025
Big+0.025
 distance+0.025
Neg logit contributions
ott-0.038
 te-0.036
 Majesty-0.036
 cabinet-0.034
annah-0.034
Token it
Token ID340
Feature activation+4.48e-03

Pos logit contributions
ott+0.265
 te+0.248
 Majesty+0.240
 cabinet+0.237
oppy+0.226
Neg logit contributions
Mark-0.185
Rex-0.179
Big-0.174
By-0.170
Time-0.167
Token that
Token ID326
Feature activation+4.92e-02

Pos logit contributions
ott+0.008
 te+0.008
 Majesty+0.008
 cabinet+0.008
annah+0.007
Neg logit contributions
Mark-0.006
Rex-0.006
By-0.005
Big-0.005
 distance-0.005
Token we
Token ID356
Feature activation+6.95e-02

Pos logit contributions
ott+0.088
 te+0.084
 Majesty+0.083
annah+0.079
 cabinet+0.078
Neg logit contributions
Mark-0.067
Rex-0.065
Big-0.063
By-0.063
 distance-0.062
Token're
Token ID821
Feature activation-5.54e-02

Pos logit contributions
ott+0.140
 te+0.136
 Majesty+0.134
annah+0.129
 cabinet+0.128
Neg logit contributions
Mark-0.074
By-0.073
Rex-0.073
Big-0.070
 distance-0.070
Token friendly
Token ID8030
Feature activation-5.01e-02

Pos logit contributions
Mark+0.066
 distance+0.064
Rex+0.064
By+0.063
Big+0.063
Neg logit contributions
ott-0.108
 te-0.100
 Majesty-0.099
 cabinet-0.096
oppy-0.094
Token
Token ID198
Feature activation-6.35e-03

Pos logit contributions
Mark+0.006
Rex+0.006
By+0.006
 distance+0.006
 brake+0.006
Neg logit contributions
ott-0.011
 te-0.011
 Majesty-0.010
 cabinet-0.010
 Flor-0.010
TokenFrom
Token ID4863
Feature activation-3.92e-04

Pos logit contributions
 brake+0.007
Rex+0.006
 distance+0.006
Mark+0.006
Rick+0.006
Neg logit contributions
ott-0.013
 te-0.012
 Majesty-0.011
 cabinet-0.011
 Av-0.011
Token that
Token ID326
Feature activation+4.65e-02

Pos logit contributions
Mark+0.001
Rex+0.001
By+0.001
Time+0.001
Rick+0.001
Neg logit contributions
ott-0.001
 te-0.001
 Majesty-0.001
 cabinet-0.001
 Flor-0.001
Token day
Token ID1110
Feature activation-6.26e-02

Pos logit contributions
ott+0.076
 te+0.076
 cabinet+0.074
 wardrobe+0.069
 Av+0.069
Neg logit contributions
Rex-0.068
Mark-0.067
Rick-0.065
By-0.064
Kim-0.064
Token on
Token ID319
Feature activation-6.83e-02

Pos logit contributions
Mark+0.063
Rex+0.061
 distance+0.058
Thunder+0.057
Time+0.056
Neg logit contributions
ott-0.129
 te-0.127
 cabinet-0.123
oppy-0.121
 Majesty-0.121
Token,
Token ID11
Feature activation+5.34e-02

Pos logit contributions
Mark+0.091
Rex+0.087
Big+0.085
 distance+0.083
By+0.083
Neg logit contributions
ott-0.123
 Majesty-0.119
 te-0.114
annah-0.105
 majesty-0.104
Token Mia
Token ID32189
Feature activation+5.50e-05

Pos logit contributions
ott+0.079
 Majesty+0.075
 te+0.074
annah+0.069
 cabinet+0.069
Neg logit contributions
Mark-0.089
Rex-0.086
By-0.085
Big-0.084
Time-0.083
Token and
Token ID290
Feature activation-2.13e-02

Pos logit contributions
ott+0.000
 te+0.000
 Majesty+0.000
 cabinet+0.000
annah+0.000
Neg logit contributions
Mark-0.000
Rex-0.000
By-0.000
Big-0.000
 distance-0.000
Token her
Token ID607
Feature activation+7.72e-02

Pos logit contributions
Mark+0.043
Rex+0.043
By+0.042
Big+0.042
Time+0.041
Neg logit contributions
ott-0.023
 te-0.022
 Majesty-0.021
 cabinet-0.020
 Av-0.020
Token dad
Token ID9955
Feature activation-8.79e-02

Pos logit contributions
ott+0.124
 te+0.121
 Majesty+0.119
 cabinet+0.112
annah+0.109
Neg logit contributions
Mark-0.116
Rex-0.115
By-0.112
 distance-0.110
Big-0.110
Token read
Token ID1100
Feature activation-5.65e-03

Pos logit contributions
Mark+0.111
Rex+0.111
Time+0.105
Big+0.105
By+0.105
Neg logit contributions
ott-0.161
 te-0.155
 Majesty-0.153
 cabinet-0.147
 Flor-0.142
Token It
Token ID632
Feature activation-4.47e-02

Pos logit contributions
Time+0.011
What+0.011
Mark+0.010
Kim+0.010
When+0.009
Neg logit contributions
oppy-0.020
 Majesty-0.020
ott-0.019
anda-0.019
act-0.018
Token was
Token ID373
Feature activation+2.54e-02

Pos logit contributions
Mark+0.047
Rex+0.044
Big+0.043
By+0.043
Time+0.042
Neg logit contributions
ott-0.093
 Majesty-0.091
 te-0.089
annah-0.086
 cabinet-0.085
Token very
Token ID845
Feature activation+7.07e-02

Pos logit contributions
 te+0.040
ott+0.038
 cabinet+0.037
 Majesty+0.035
 Av+0.034
Neg logit contributions
Rex-0.041
Mark-0.040
By-0.039
Rick-0.038
Big-0.037
Token expensive
Token ID5789
Feature activation+3.20e-02

Pos logit contributions
ott+0.128
 Majesty+0.126
 te+0.121
 Flor+0.115
oppy+0.115
Neg logit contributions
Mark-0.090
By-0.089
Time-0.088
Big-0.088
 distance-0.086
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
ott+0.069
 Majesty+0.067
 te+0.064
oppy+0.063
 majesty+0.062
Neg logit contributions
 distance-0.029
Mark-0.029
Rex-0.027
What-0.026
Time-0.026
Token much
Token ID881
Feature activation-1.07e-02

Pos logit contributions
Mark+0.028
Rex+0.027
By+0.027
Big+0.026
 distance+0.026
Neg logit contributions
ott-0.036
 te-0.035
 Majesty-0.035
 cabinet-0.033
annah-0.032
Token bigger
Token ID5749
Feature activation+5.81e-02

Pos logit contributions
Mark+0.014
Rex+0.014
By+0.013
 distance+0.013
Time+0.013
Neg logit contributions
ott-0.019
 Majesty-0.019
 te-0.018
annah-0.017
 cabinet-0.017
Token than
Token ID621
Feature activation-1.18e-01

Pos logit contributions
ott+0.115
 te+0.109
 Majesty+0.107
 cabinet+0.102
annah+0.102
Neg logit contributions
Mark-0.067
Rex-0.066
By-0.063
 distance-0.063
Time-0.061
Token the
Token ID262
Feature activation-4.60e-02

Pos logit contributions
Mark+0.161
 distance+0.152
Rex+0.151
Time+0.148
By+0.147
Neg logit contributions
ott-0.209
 Majesty-0.201
 te-0.200
oppy-0.184
 cabinet-0.182
Token old
Token ID1468
Feature activation+9.07e-02

Pos logit contributions
Mark+0.073
Rex+0.072
By+0.070
Big+0.069
Time+0.069
Neg logit contributions
ott-0.069
 te-0.068
 Majesty-0.066
 cabinet-0.063
annah-0.060
Token one
Token ID530
Feature activation+3.51e-02

Pos logit contributions
ott+0.146
 Majesty+0.138
 te+0.136
oppy+0.130
annah+0.127
Neg logit contributions
Mark-0.137
 distance-0.132
Rex-0.131
By-0.129
Time-0.129
Token much
Token ID881
Feature activation-6.12e-04

Pos logit contributions
ott+0.152
 te+0.148
 Majesty+0.146
 cabinet+0.140
annah+0.136
Neg logit contributions
Mark-0.091
Rex-0.089
By-0.087
Big-0.085
 distance-0.085
Token that
Token ID326
Feature activation+5.67e-02

Pos logit contributions
Mark+0.001
Rex+0.001
By+0.001
 distance+0.001
Big+0.001
Neg logit contributions
ott-0.001
 te-0.001
 Majesty-0.001
 cabinet-0.001
annah-0.001
Token it
Token ID340
Feature activation+8.11e-03

Pos logit contributions
ott+0.122
 te+0.118
 Majesty+0.116
 cabinet+0.111
annah+0.111
Neg logit contributions
Mark-0.054
Rex-0.052
By-0.051
Big-0.050
Time-0.050
Token stopped
Token ID5025
Feature activation-3.32e-02

Pos logit contributions
ott+0.017
 Majesty+0.016
 te+0.016
annah+0.016
 cabinet+0.015
Neg logit contributions
Mark-0.007
By-0.007
Rex-0.007
 distance-0.007
Big-0.007
Token roaring
Token ID43170
Feature activation-1.57e-01

Pos logit contributions
Mark+0.046
Rex+0.045
By+0.044
Big+0.043
 distance+0.043
Neg logit contributions
ott-0.058
 te-0.056
 Majesty-0.055
 cabinet-0.052
annah-0.051
Token and
Token ID290
Feature activation-4.87e-03

Pos logit contributions
Mark+0.236
Rex+0.232
Rick+0.225
Kim+0.224
By+0.224
Neg logit contributions
 te-0.247
ott-0.245
 Majesty-0.237
 cabinet-0.229
 Flor-0.220
Token ate
Token ID15063
Feature activation+8.24e-03

Pos logit contributions
Mark+0.006
Rex+0.006
By+0.006
Big+0.006
Time+0.006
Neg logit contributions
ott-0.009
 te-0.009
 Majesty-0.009
 cabinet-0.008
 Flor-0.008
Token it
Token ID340
Feature activation+8.05e-03

Pos logit contributions
ott+0.019
 Majesty+0.018
 te+0.018
 cabinet+0.017
annah+0.017
Neg logit contributions
Rex-0.007
Mark-0.007
By-0.006
 distance-0.006
Big-0.006
Token all
Token ID477
Feature activation+1.24e-01

Pos logit contributions
ott+0.015
 Majesty+0.014
 te+0.014
 cabinet+0.013
annah+0.013
Neg logit contributions
Mark-0.010
 distance-0.010
Rex-0.010
By-0.010
Big-0.010
Token.
Token ID13
Feature activation+2.79e-03

Pos logit contributions
ott+0.208
 te+0.201
 cabinet+0.189
 Majesty+0.188
 Av+0.183
Neg logit contributions
Rex-0.181
Mark-0.175
Rick-0.171
By-0.169
Big-0.167
Token It
Token ID632
Feature activation-3.60e-02

Pos logit contributions
ott+0.006
 te+0.005
 Majesty+0.005
 wardrobe+0.005
 cabinet+0.005
Neg logit contributions
Mark-0.003
Big-0.003
By-0.003
Rex-0.003
What-0.003
Token named
Token ID3706
Feature activation+1.15e-01

Pos logit contributions
 Majesty+0.049
ott+0.043
annah+0.041
 majesty+0.041
 te+0.041
Neg logit contributions
Rex-0.050
Mark-0.048
 distance-0.047
Big-0.046
Kim-0.046
Token Lucy
Token ID22162
Feature activation+7.18e-02

Pos logit contributions
 Majesty+0.203
ott+0.189
act+0.181
 te+0.179
 penn+0.171
Neg logit contributions
Mark-0.175
Kim-0.166
By-0.164
What-0.156
Rex-0.155
Token.
Token ID13
Feature activation+2.92e-03

Pos logit contributions
ott+0.119
 te+0.116
 Majesty+0.113
 cabinet+0.108
annah+0.104
Neg logit contributions
Mark-0.104
Rex-0.103
By-0.100
Big-0.099
 distance-0.098
Token She
Token ID1375
Feature activation+5.23e-02

Pos logit contributions
 Majesty+0.004
ott+0.004
 te+0.003
oppy+0.003
annah+0.003
Neg logit contributions
Mark-0.006
By-0.005
Rex-0.005
Time-0.005
What-0.005
Token loved
Token ID6151
Feature activation+5.24e-02

Pos logit contributions
 Majesty+0.093
ott+0.091
 te+0.088
annah+0.084
 cabinet+0.083
Neg logit contributions
Mark-0.072
Rex-0.070
By-0.069
Big-0.067
Time-0.065
Token to
Token ID284
Feature activation-1.29e-02

Pos logit contributions
ott+0.098
 Majesty+0.095
 te+0.095
 cabinet+0.089
annah+0.089
Neg logit contributions
Mark-0.065
Rex-0.064
Big-0.062
By-0.061
Time-0.061
Token play
Token ID711
Feature activation+4.50e-02

Pos logit contributions
Mark+0.015
Rex+0.014
Big+0.014
 distance+0.014
By+0.014
Neg logit contributions
ott-0.025
 Majesty-0.024
 te-0.024
annah-0.023
anda-0.023
Token with
Token ID351
Feature activation+1.16e-01

Pos logit contributions
 Majesty+0.086
ott+0.084
anda+0.080
annah+0.080
 majesty+0.079
Neg logit contributions
What-0.056
Rex-0.055
Mark-0.055
By-0.055
Thunder-0.051
Token her
Token ID607
Feature activation+7.98e-02

Pos logit contributions
 Majesty+0.223
ott+0.217
annah+0.204
anda+0.196
 te+0.195
Neg logit contributions
Mark-0.152
Big-0.142
Rex-0.140
What-0.136
Kim-0.135
Token toy
Token ID13373
Feature activation+1.40e-01

Pos logit contributions
 Majesty+0.121
ott+0.118
annah+0.111
 te+0.110
 Av+0.108
Neg logit contributions
Rex-0.129
Mark-0.127
Big-0.123
By-0.121
 distance-0.119
Token figures
Token ID5538
Feature activation+1.23e-03

Pos logit contributions
 Majesty+0.284
annah+0.282
ott+0.273
 majesty+0.258
 Av+0.257
Neg logit contributions
Rex-0.155
Big-0.146
 end-0.136
 clouds-0.134
Mark-0.134
Token his
Token ID465
Feature activation-7.83e-03

Pos logit contributions
Mark+0.161
Rex+0.160
Time+0.157
By+0.154
Big+0.149
Neg logit contributions
ott-0.299
 te-0.298
 Majesty-0.292
 Flor-0.285
 Av-0.281
Token cart
Token ID6383
Feature activation-6.49e-03

Pos logit contributions
 distance+0.012
Rex+0.012
Mark+0.012
Big+0.012
By+0.012
Neg logit contributions
ott-0.012
 te-0.011
 Majesty-0.011
annah-0.010
 Av-0.010
Token,
Token ID11
Feature activation+5.29e-02

Pos logit contributions
Mark+0.009
Rex+0.009
By+0.008
Big+0.008
 distance+0.008
Neg logit contributions
ott-0.011
 te-0.011
 Majesty-0.011
 cabinet-0.010
annah-0.010
Token but
Token ID475
Feature activation-3.89e-02

Pos logit contributions
ott+0.091
 te+0.090
 Majesty+0.087
 cabinet+0.083
 Flor+0.081
Neg logit contributions
Mark-0.073
Rex-0.072
By-0.069
Big-0.068
Time-0.068
Token as
Token ID355
Feature activation-1.07e-01

Pos logit contributions
Mark+0.046
Rex+0.045
Big+0.043
By+0.043
Time+0.042
Neg logit contributions
ott-0.075
 te-0.073
 Majesty-0.072
 cabinet-0.069
 Flor-0.068
Token he
Token ID339
Feature activation-3.10e-03

Pos logit contributions
Mark+0.137
Rex+0.137
By+0.130
Time+0.129
 distance+0.129
Neg logit contributions
ott-0.193
 te-0.188
 Majesty-0.186
 cabinet-0.178
 Flor-0.177
Token rode
Token ID22075
Feature activation-1.46e-02

Pos logit contributions
Mark+0.004
Rex+0.004
Big+0.004
Time+0.004
 distance+0.004
Neg logit contributions
ott-0.006
 te-0.005
 Majesty-0.005
 cabinet-0.005
 Flor-0.005
Token away
Token ID1497
Feature activation-1.93e-02

Pos logit contributions
Time+0.020
Mark+0.020
Rex+0.020
Thunder+0.019
By+0.019
Neg logit contributions
ott-0.023
 te-0.023
 Majesty-0.023
 Flor-0.023
 cabinet-0.022
Token,
Token ID11
Feature activation+5.43e-02

Pos logit contributions
Mark+0.028
Rex+0.028
By+0.027
Big+0.027
 distance+0.027
Neg logit contributions
ott-0.032
 te-0.031
 Majesty-0.030
 cabinet-0.029
annah-0.028
Token he
Token ID339
Feature activation-1.43e-03

Pos logit contributions
ott+0.094
 te+0.092
 Majesty+0.091
 Flor+0.088
 cabinet+0.088
Neg logit contributions
Rex-0.072
Mark-0.072
By-0.068
Big-0.067
Time-0.067
Token could
Token ID714
Feature activation-2.04e-02

Pos logit contributions
Mark+0.002
Rex+0.002
Time+0.002
Big+0.002
Kim+0.002
Neg logit contributions
ott-0.003
 te-0.003
 Majesty-0.003
 cabinet-0.002
 Flor-0.002
Token a
Token ID257
Feature activation-1.48e-02

Pos logit contributions
Mark+0.086
Rex+0.083
By+0.081
Big+0.080
Time+0.079
Neg logit contributions
ott-0.171
 Majesty-0.167
 te-0.166
 cabinet-0.159
annah-0.157
Token big
Token ID1263
Feature activation+1.24e-02

Pos logit contributions
 moment+0.019
 distance+0.018
Time+0.017
Big+0.016
 loud+0.016
Neg logit contributions
annah-0.026
ott-0.026
 Majesty-0.025
 Flor-0.024
 Av-0.024
Token party
Token ID2151
Feature activation+1.32e-01

Pos logit contributions
ott+0.021
 Flor+0.019
 Majesty+0.019
annah+0.019
 Av+0.018
Neg logit contributions
 distance-0.017
 moment-0.017
 end-0.016
 flash-0.016
 breath-0.016
Token with
Token ID351
Feature activation+1.13e-01

Pos logit contributions
ott+0.220
 te+0.215
 Majesty+0.209
 cabinet+0.201
annah+0.193
Neg logit contributions
Mark-0.190
Rex-0.187
By-0.181
Big-0.180
 distance-0.178
Token her
Token ID607
Feature activation+7.93e-02

Pos logit contributions
 Majesty+0.213
ott+0.205
oppy+0.195
igg+0.189
annah+0.189
Neg logit contributions
Mark-0.148
What-0.141
Rex-0.139
Kim-0.138
Big-0.135
Token friends
Token ID2460
Feature activation-2.47e-02

Pos logit contributions
ott+0.138
 Majesty+0.132
annah+0.127
 te+0.126
igg+0.120
Neg logit contributions
Mark-0.105
Rex-0.105
Big-0.100
By-0.100
 distance-0.099
Token.
Token ID13
Feature activation-3.07e-03

Pos logit contributions
Mark+0.022
Rex+0.021
What+0.021
Big+0.021
 against+0.020
Neg logit contributions
 Majesty-0.055
ott-0.053
 majesty-0.051
 te-0.050
annah-0.048
Token She
Token ID1375
Feature activation+4.83e-02

Pos logit contributions
What+0.004
Mark+0.003
Kim+0.003
Time+0.003
By+0.003
Neg logit contributions
 Majesty-0.007
ott-0.006
 majesty-0.006
igg-0.006
oppy-0.006
Token wanted
Token ID2227
Feature activation+3.12e-02

Pos logit contributions
 Majesty+0.108
ott+0.108
annah+0.101
 cabinet+0.101
ucky+0.100
Neg logit contributions
By-0.038
Mark-0.037
Rex-0.037
 drove-0.035
Big-0.034
Token to
Token ID284
Feature activation-1.70e-02

Pos logit contributions
 Majesty+0.066
ott+0.065
 te+0.063
oppy+0.062
annah+0.061
Neg logit contributions
Mark-0.032
Rex-0.030
Time-0.030
 distance-0.029
Big-0.029
Token invite
Token ID14037
Feature activation+6.69e-02

Pos logit contributions
Mark+0.020
 distance+0.019
By+0.019
Rex+0.019
Kim+0.018
Neg logit contributions
ott-0.034
 Majesty-0.032
 te-0.031
 cabinet-0.030
annah-0.030
Token in
Token ID287
Feature activation+5.66e-02

Pos logit contributions
ott+0.202
 te+0.196
 Majesty+0.193
 cabinet+0.184
annah+0.179
Neg logit contributions
Mark-0.157
Rex-0.154
By-0.149
Big-0.148
 distance-0.146
Token trouble
Token ID5876
Feature activation-7.93e-02

Pos logit contributions
ott+0.109
 te+0.106
 Majesty+0.105
 cabinet+0.100
annah+0.097
Neg logit contributions
Mark-0.067
Rex-0.066
By-0.064
Big-0.063
 distance-0.063
Token and
Token ID290
Feature activation-1.44e-02

Pos logit contributions
Mark+0.107
Rex+0.106
By+0.102
 distance+0.102
Big+0.100
Neg logit contributions
ott-0.140
 te-0.133
 Majesty-0.132
 cabinet-0.126
 Flor-0.121
Token need
Token ID761
Feature activation-4.95e-02

Pos logit contributions
Mark+0.019
Rex+0.019
By+0.018
 distance+0.018
Big+0.018
Neg logit contributions
ott-0.026
 te-0.025
 Majesty-0.025
 cabinet-0.023
annah-0.023
Token their
Token ID511
Feature activation-4.57e-02

Pos logit contributions
Mark+0.056
Rex+0.055
By+0.053
Big+0.052
 distance+0.052
Neg logit contributions
ott-0.099
 te-0.096
 Majesty-0.095
 cabinet-0.091
annah-0.089
Token help
Token ID1037
Feature activation-4.37e-02

Pos logit contributions
Mark+0.072
Rex+0.071
By+0.069
Big+0.069
 distance+0.069
Neg logit contributions
ott-0.070
 te-0.067
 Majesty-0.066
 cabinet-0.062
annah-0.061
Token.
Token ID13
Feature activation-4.65e-03

Pos logit contributions
Mark+0.060
Rex+0.059
By+0.058
Big+0.057
 distance+0.057
Neg logit contributions
ott-0.075
 te-0.073
 Majesty-0.072
 cabinet-0.069
annah-0.067
Token Be
Token ID1355
Feature activation-8.40e-03

Pos logit contributions
Mark+0.006
Rex+0.006
By+0.006
Big+0.006
What+0.006
Neg logit contributions
 Majesty-0.008
 te-0.008
ott-0.008
 cabinet-0.008
 wardrobe-0.008
Token kind
Token ID1611
Feature activation-1.02e-02

Pos logit contributions
Mark+0.017
Rick+0.016
Kim+0.016
Rex+0.016
Josh+0.016
Neg logit contributions
 te-0.009
 Majesty-0.009
ott-0.008
 Av-0.008
 Flor-0.008
Token and
Token ID290
Feature activation-1.40e-02

Pos logit contributions
Mark+0.013
Rex+0.013
By+0.012
Big+0.012
 distance+0.012
Neg logit contributions
ott-0.019
 te-0.018
 Majesty-0.018
 cabinet-0.017
annah-0.017
Token generous
Token ID14431
Feature activation+1.41e-02

Pos logit contributions
Mark+0.021
Rex+0.020
By+0.020
Big+0.019
 distance+0.019
Neg logit contributions
ott-0.023
 te-0.022
 Majesty-0.022
 cabinet-0.021
annah-0.020
Token out
Token ID503
Feature activation+8.84e-02

Pos logit contributions
ott+0.002
 Majesty+0.002
 te+0.002
 cabinet+0.002
 Flor+0.002
Neg logit contributions
Mark-0.002
 distance-0.002
Rex-0.002
By-0.002
Big-0.002
Token of
Token ID286
Feature activation-5.48e-02

Pos logit contributions
ott+0.162
 te+0.160
 Majesty+0.153
 cabinet+0.150
annah+0.145
Neg logit contributions
Mark-0.115
Rex-0.113
By-0.107
Big-0.106
Rick-0.105
Token the
Token ID262
Feature activation-4.66e-02

Pos logit contributions
Mark+0.056
Rex+0.055
By+0.053
Big+0.052
 distance+0.052
Neg logit contributions
ott-0.115
 te-0.112
 Majesty-0.111
 cabinet-0.106
annah-0.104
Token freezer
Token ID30967
Feature activation+6.19e-02

Pos logit contributions
 distance+0.090
Time+0.088
Mark+0.087
Rex+0.086
What+0.086
Neg logit contributions
ott-0.054
 Majesty-0.053
 te-0.049
 Flor-0.048
oppy-0.047
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
ott+0.115
 Majesty+0.113
 te+0.112
 Flor+0.106
oppy+0.103
Neg logit contributions
 distance-0.075
Mark-0.073
Rex-0.071
By-0.069
Big-0.069
Token cook
Token ID4255
Feature activation-3.27e-02

Pos logit contributions
Mark+0.023
Rex+0.022
 distance+0.022
By+0.022
Time+0.022
Neg logit contributions
ott-0.036
 Majesty-0.034
 te-0.033
annah-0.032
 cabinet-0.031
Token the
Token ID262
Feature activation-4.68e-02

Pos logit contributions
Mark+0.044
Rex+0.044
Big+0.042
By+0.041
Rick+0.041
Neg logit contributions
 te-0.059
ott-0.058
 Majesty-0.056
 cabinet-0.055
 Av-0.053
Token cookies
Token ID14746
Feature activation-5.83e-02

Pos logit contributions
Mark+0.080
Rex+0.078
By+0.076
Big+0.075
Rick+0.075
Neg logit contributions
 te-0.069
ott-0.067
 Majesty-0.064
 cabinet-0.063
 wardrobe-0.058
Token.
Token ID13
Feature activation-9.37e-03

Pos logit contributions
Mark+0.082
Rex+0.081
By+0.078
Big+0.077
 distance+0.076
Neg logit contributions
ott-0.099
 te-0.097
 Majesty-0.095
 cabinet-0.091
annah-0.088
Token Lily
Token ID20037
Feature activation+1.45e-01

Pos logit contributions
When+0.011
What+0.011
By+0.011
The+0.010
Before+0.010
Neg logit contributions
 Majesty-0.019
anda-0.017
 majesty-0.017
ott-0.016
act-0.016
Token helped
Token ID4193
Feature activation+4.92e-02

Pos logit contributions
 Majesty+0.298
ott+0.297
 te+0.281
 Flor+0.277
 cabinet+0.277
Neg logit contributions
Mark-0.142
Rex-0.138
By-0.134
Big-0.133
Time-0.127
Token the
Token ID262
Feature activation-4.76e-02

Pos logit contributions
ott+0.087
 te+0.086
 Majesty+0.083
 cabinet+0.080
 Flor+0.078
Neg logit contributions
Mark-0.056
Rex-0.054
By-0.053
Big-0.052
 distance-0.051
Token cooler
Token ID19346
Feature activation+3.84e-02

Pos logit contributions
Mark+0.082
Rex+0.080
Big+0.078
By+0.078
Rick+0.077
Neg logit contributions
ott-0.067
 te-0.066
 Majesty-0.063
 cabinet-0.061
 Flor-0.058
Token because
Token ID780
Feature activation-1.90e-02

Pos logit contributions
 Majesty+0.072
ott+0.071
 te+0.069
 majesty+0.064
annah+0.064
Neg logit contributions
Mark-0.046
 distance-0.046
Rex-0.046
By-0.043
Big-0.042
Token it
Token ID340
Feature activation+4.87e-04

Pos logit contributions
Mark+0.024
Rex+0.023
By+0.022
Rick+0.022
Big+0.022
Neg logit contributions
ott-0.035
 te-0.034
 cabinet-0.033
 Majesty-0.032
 wardrobe-0.031
Token kept
Token ID4030
Feature activation-1.57e-02

Pos logit contributions
 Majesty+0.001
ott+0.001
 te+0.001
annah+0.001
 cabinet+0.001
Neg logit contributions
Rex-0.000
Mark-0.000
 distance-0.000
Big-0.000
By-0.000
Token everything
Token ID2279
Feature activation-6.82e-02

Pos logit contributions
Mark+0.020
Rex+0.019
By+0.019
Big+0.018
 distance+0.018
Neg logit contributions
ott-0.029
 te-0.028
 Majesty-0.027
 cabinet-0.027
 Flor-0.026
Token cold
Token ID4692
Feature activation-4.92e-02

Pos logit contributions
Mark+0.102
Rex+0.100
By+0.097
Big+0.096
 distance+0.096
Neg logit contributions
ott-0.110
 te-0.107
 Majesty-0.105
 cabinet-0.099
annah-0.096
Token.
Token ID13
Feature activation-4.87e-03

Pos logit contributions
Mark+0.062
Rex+0.061
 distance+0.059
By+0.059
Big+0.058
Neg logit contributions
ott-0.091
 te-0.089
 Majesty-0.088
 cabinet-0.083
annah-0.081
Token
Token ID198
Feature activation-5.73e-03

Pos logit contributions
Mark+0.005
Rex+0.005
Big+0.005
By+0.005
Kim+0.005
Neg logit contributions
ott-0.010
 te-0.010
 Majesty-0.010
 cabinet-0.009
 Flor-0.009
Token
Token ID198
Feature activation-7.48e-03

Pos logit contributions
Rex+0.008
Mark+0.008
By+0.008
 brake+0.008
 distance+0.008
Neg logit contributions
ott-0.010
 te-0.009
 cabinet-0.008
oppy-0.008
 Majesty-0.008
TokenWhile
Token ID3633
Feature activation-3.52e-02

Pos logit contributions
Rex+0.008
Mark+0.008
 distance+0.008
By+0.008
 brake+0.008
Neg logit contributions
ott-0.014
 te-0.014
 Majesty-0.013
 cabinet-0.013
annah-0.013
Token play
Token ID711
Feature activation+4.60e-02

Pos logit contributions
 Majesty+0.317
ott+0.305
 te+0.295
annah+0.286
 majesty+0.278
Neg logit contributions
Mark-0.139
Rex-0.136
 distance-0.132
Kim-0.130
By-0.130
Token with
Token ID351
Feature activation+1.16e-01

Pos logit contributions
ott+0.087
 Majesty+0.085
 te+0.080
annah+0.079
 majesty+0.077
Neg logit contributions
Rex-0.056
Mark-0.056
What-0.055
By-0.054
Time-0.052
Token you
Token ID345
Feature activation+1.48e-01

Pos logit contributions
ott+0.203
 Majesty+0.197
 te+0.184
 cabinet+0.176
annah+0.174
Neg logit contributions
Rex-0.166
Mark-0.163
Big-0.157
What-0.155
By-0.148
Token?
Token ID30
Feature activation+4.86e-02

Pos logit contributions
ott+0.201
 te+0.193
 Majesty+0.190
 cabinet+0.176
annah+0.171
Neg logit contributions
Mark-0.260
Rex-0.256
By-0.250
Big-0.248
 distance-0.247
Token I
Token ID314
Feature activation+1.44e-01

Pos logit contributions
ott+0.083
 Majesty+0.082
 te+0.076
act+0.074
igg+0.073
Neg logit contributions
Rex-0.071
Mark-0.071
What-0.068
By-0.067
Big-0.066
Token don
Token ID836
Feature activation-9.53e-02

Pos logit contributions
ott+0.281
 Majesty+0.281
 te+0.274
 cabinet+0.262
annah+0.260
Neg logit contributions
Mark-0.160
Rex-0.159
By-0.157
Time-0.152
 distance-0.151
Token't
Token ID470
Feature activation+2.14e-02

Pos logit contributions
Mark+0.115
Rex+0.114
 distance+0.112
By+0.109
Big+0.108
Neg logit contributions
ott-0.179
 te-0.175
 Majesty-0.167
 cabinet-0.165
annah-0.160
Token have
Token ID423
Feature activation-7.45e-02

Pos logit contributions
ott+0.044
 te+0.044
 Majesty+0.044
 cabinet+0.041
 Flor+0.040
Neg logit contributions
Mark-0.022
 distance-0.021
Rex-0.021
By-0.020
Time-0.020
Token any
Token ID597
Feature activation-2.07e-02

Pos logit contributions
Mark+0.075
Rex+0.073
By+0.071
 distance+0.070
Time+0.070
Neg logit contributions
ott-0.156
 Majesty-0.146
 te-0.146
 cabinet-0.143
annah-0.141
Token toys
Token ID14958
Feature activation+1.77e-02

Pos logit contributions
Rex+0.028
Mark+0.027
By+0.027
 distance+0.027
 brake+0.026
Neg logit contributions
ott-0.038
 Majesty-0.035
annah-0.033
 te-0.033
iana-0.032
Token here
Token ID994
Feature activation+6.99e-02

Pos logit contributions
ott+0.034
 Majesty+0.033
 te+0.032
 Flor+0.030
oppy+0.030
Neg logit contributions
Rex-0.021
Mark-0.021
By-0.020
 distance-0.020
Time-0.019
Token but
Token ID475
Feature activation-3.09e-02

Pos logit contributions
ott+0.098
 te+0.097
 Majesty+0.090
 cabinet+0.089
 Flor+0.086
Neg logit contributions
Mark-0.096
Rex-0.094
By-0.093
 distance-0.090
Time-0.090
Token sometimes
Token ID3360
Feature activation-4.30e-02

Pos logit contributions
Mark+0.050
Rex+0.049
By+0.048
Time+0.048
Big+0.047
Neg logit contributions
ott-0.046
 Majesty-0.045
 te-0.044
 cabinet-0.041
annah-0.040
Token she
Token ID673
Feature activation+8.82e-02

Pos logit contributions
Rex+0.078
Mark+0.078
By+0.075
Big+0.074
Rick+0.073
Neg logit contributions
 te-0.055
ott-0.055
 Majesty-0.053
 cabinet-0.051
 wardrobe-0.049
Token didn
Token ID1422
Feature activation-3.77e-02

Pos logit contributions
ott+0.173
 Majesty+0.171
 te+0.166
 cabinet+0.161
annah+0.160
Neg logit contributions
Mark-0.098
Rex-0.097
Big-0.091
By-0.091
 distance-0.089
Token't
Token ID470
Feature activation+2.04e-02

Pos logit contributions
Mark+0.039
Rex+0.037
Big+0.036
By+0.036
Time+0.035
Neg logit contributions
ott-0.078
 Majesty-0.078
 te-0.077
 cabinet-0.073
annah-0.071
Token understand
Token ID1833
Feature activation-2.28e-01

Pos logit contributions
ott+0.039
 te+0.038
 Majesty+0.037
 cabinet+0.035
annah+0.034
Neg logit contributions
Mark-0.025
Rex-0.024
By-0.023
Big-0.023
 distance-0.023
Token other
Token ID584
Feature activation+6.04e-02

Pos logit contributions
Mark+0.294
Rex+0.289
By+0.279
Big+0.277
 distance+0.275
Neg logit contributions
ott-0.416
 te-0.405
 Majesty-0.399
 cabinet-0.380
annah-0.370
Token people
Token ID661
Feature activation-1.02e-01

Pos logit contributions
ott+0.107
 te+0.106
 Majesty+0.104
 cabinet+0.101
 wardrobe+0.097
Neg logit contributions
Mark-0.080
By-0.078
Rex-0.078
Big-0.075
Rick-0.074
Token's
Token ID338
Feature activation+1.48e-01

Pos logit contributions
Mark+0.162
Rex+0.159
By+0.157
Big+0.154
Time+0.153
Neg logit contributions
ott-0.155
 te-0.152
 Majesty-0.145
 cabinet-0.141
 Flor-0.135
Token language
Token ID3303
Feature activation-6.08e-02

Pos logit contributions
 te+0.247
 Majesty+0.246
ott+0.245
 cabinet+0.228
 wardrobe+0.224
Neg logit contributions
Mark-0.220
Rex-0.214
By-0.208
Rick-0.208
Kim-0.204
Token.
Token ID13
Feature activation-1.21e-03

Pos logit contributions
Mark+0.079
Rex+0.076
By+0.074
 distance+0.073
Big+0.073
Neg logit contributions
ott-0.111
 Majesty-0.107
 te-0.106
annah-0.099
 cabinet-0.098
Token some
Token ID617
Feature activation-9.33e-04

Pos logit contributions
Mark+0.253
Rex+0.251
By+0.245
Time+0.234
Big+0.234
Neg logit contributions
ott-0.389
 te-0.384
 Majesty-0.382
 cabinet-0.365
 Flor-0.359
Token strange
Token ID6283
Feature activation-1.30e-02

Pos logit contributions
Mark+0.001
Rex+0.001
Rick+0.001
By+0.001
Josh+0.001
Neg logit contributions
 te-0.002
ott-0.002
 Majesty-0.002
 cabinet-0.002
 Flor-0.002
Token noises
Token ID26782
Feature activation-1.42e-01

Pos logit contributions
Rex+0.014
Mark+0.014
By+0.014
Rick+0.013
Big+0.013
Neg logit contributions
 te-0.027
ott-0.026
 Majesty-0.026
 cabinet-0.025
 Flor-0.024
Token in
Token ID287
Feature activation+6.01e-02

Pos logit contributions
Mark+0.233
Rex+0.229
Rick+0.227
Big+0.223
Kim+0.222
Neg logit contributions
 te-0.208
ott-0.200
 cabinet-0.187
 Majesty-0.178
 Flor-0.177
Token the
Token ID262
Feature activation-3.89e-02

Pos logit contributions
 te+0.108
 cabinet+0.107
ott+0.106
 wardrobe+0.102
 Flor+0.100
Neg logit contributions
By-0.075
Rex-0.072
Mark-0.070
Rick-0.068
Thunder-0.068
Token distance
Token ID5253
Feature activation-1.84e-01

Pos logit contributions
Mark+0.066
Rex+0.064
By+0.062
Rick+0.061
Big+0.061
Neg logit contributions
ott-0.057
 te-0.055
 Majesty-0.053
 cabinet-0.052
 wardrobe-0.050
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
Mark+0.327
Rex+0.326
By+0.325
Rick+0.317
Big+0.313
Neg logit contributions
 te-0.238
ott-0.232
 cabinet-0.221
 Majesty-0.212
 Flor-0.207
Token
Token ID220
Feature activation+2.01e-02

Pos logit contributions
Mark+0.011
Big+0.011
By+0.011
Andy+0.011
Rex+0.011
Neg logit contributions
 te-0.021
 Majesty-0.021
ott-0.021
 majesty-0.019
 cabinet-0.019
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
ott+0.041
 te+0.041
 Majesty+0.040
 cabinet+0.038
 wardrobe+0.037
Neg logit contributions
Mark-0.022
Rex-0.021
By-0.020
Big-0.020
Kim-0.020
Token
Token ID198
Feature activation-1.21e-02

Pos logit contributions
 distance+0.015
Mark+0.015
Rex+0.015
 brake+0.015
Time+0.014
Neg logit contributions
ott-0.018
 te-0.016
 Majesty-0.015
 Flor-0.015
oppy-0.015
TokenThe
Token ID464
Feature activation-1.01e-01

Pos logit contributions
 distance+0.012
Mark+0.012
Rex+0.012
 brake+0.012
Josh+0.011
Neg logit contributions
ott-0.024
 te-0.022
 Majesty-0.021
oppy-0.021
 cabinet-0.021
Token.
Token ID13
Feature activation-5.18e-03

Pos logit contributions
ott+0.041
 Majesty+0.040
 te+0.040
 cabinet+0.037
annah+0.036
Neg logit contributions
Mark-0.032
Rex-0.031
 distance-0.030
By-0.030
Big-0.030
Token It
Token ID632
Feature activation-3.41e-02

Pos logit contributions
Mark+0.006
What+0.006
Time+0.006
By+0.006
 distance+0.006
Neg logit contributions
ott-0.010
 Majesty-0.010
oppy-0.010
 te-0.009
 wardrobe-0.009
Token was
Token ID373
Feature activation+3.09e-02

Pos logit contributions
Rex+0.048
Mark+0.048
Big+0.047
By+0.046
Rick+0.046
Neg logit contributions
ott-0.057
 te-0.056
 cabinet-0.051
 Flor-0.049
 Majesty-0.049
Token hard
Token ID1327
Feature activation-4.04e-02

Pos logit contributions
 te+0.035
 cabinet+0.035
 wardrobe+0.031
ott+0.030
 Flor+0.030
Neg logit contributions
Rex-0.057
hing-0.056
Mark-0.055
cle-0.054
By-0.054
Token and
Token ID290
Feature activation-1.81e-02

Pos logit contributions
Mark+0.052
Rex+0.051
By+0.050
Big+0.049
 distance+0.049
Neg logit contributions
ott-0.074
 te-0.072
 Majesty-0.070
 cabinet-0.067
annah-0.065
Token steep
Token ID14559
Feature activation-1.68e-01

Pos logit contributions
Mark+0.027
Rex+0.026
Rick+0.026
By+0.026
Josh+0.025
Neg logit contributions
 te-0.028
ott-0.027
 cabinet-0.026
 Flor-0.025
 Av-0.025
Token.
Token ID13
Feature activation-5.87e-03

Pos logit contributions
Rex+0.275
Mark+0.275
By+0.273
Big+0.270
Time+0.262
Neg logit contributions
 te-0.240
ott-0.237
 Majesty-0.229
 cabinet-0.227
annah-0.212
Token They
Token ID1119
Feature activation-4.33e-02

Pos logit contributions
Mark+0.008
By+0.007
What+0.007
Time+0.007
Rex+0.007
Neg logit contributions
ott-0.011
 Majesty-0.010
 te-0.010
oppy-0.010
 wardrobe-0.010
Token had
Token ID550
Feature activation-6.64e-02

Pos logit contributions
Mark+0.056
Rex+0.056
Time+0.054
By+0.054
Big+0.053
Neg logit contributions
ott-0.078
 te-0.075
 Majesty-0.071
 cabinet-0.071
 Flor-0.068
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
Rex+0.111
Mark+0.107
Big+0.107
hing+0.107
Little+0.104
Neg logit contributions
 te-0.087
ott-0.087
 Majesty-0.083
 cabinet-0.080
 wardrobe-0.076
Token use
Token ID779
Feature activation+1.88e-02

Pos logit contributions
Rex+0.021
Mark+0.021
By+0.021
Big+0.021
Time+0.020
Neg logit contributions
 Majesty-0.023
 te-0.023
ott-0.023
 cabinet-0.022
annah-0.022
Token
Token ID198
Feature activation-3.91e-03

Pos logit contributions
Mark+0.004
Rex+0.004
 distance+0.004
By+0.004
Time+0.004
Neg logit contributions
ott-0.005
 te-0.005
 Majesty-0.005
 Flor-0.005
 cabinet-0.005
TokenM
Token ID44
Feature activation-3.29e-02

Pos logit contributions
 distance+0.004
 brake+0.004
Mark+0.004
hing+0.004
 response+0.004
Neg logit contributions
ott-0.008
 te-0.007
 Flor-0.007
 Majesty-0.007
 Av-0.007
Tokenira
Token ID8704
Feature activation-1.26e-01

Pos logit contributions
Mark+0.059
 distance+0.059
What+0.058
Rex+0.058
By+0.057
Neg logit contributions
 Majesty-0.042
ott-0.041
 te-0.041
anda-0.038
annah-0.037
Token and
Token ID290
Feature activation-1.51e-02

Pos logit contributions
Mark+0.154
Rex+0.151
By+0.146
Big+0.145
 distance+0.144
Neg logit contributions
ott-0.237
 te-0.231
 Majesty-0.227
 cabinet-0.217
annah-0.211
Token R
Token ID371
Feature activation-4.63e-02

Pos logit contributions
Mark+0.025
Rex+0.025
By+0.025
 distance+0.025
Time+0.024
Neg logit contributions
ott-0.021
 te-0.021
 Majesty-0.020
 Flor-0.019
 Av-0.019
Tokenavi
Token ID15820
Feature activation-1.77e-01

Pos logit contributions
Mark+0.054
Rex+0.053
By+0.051
Big+0.051
 distance+0.050
Neg logit contributions
ott-0.090
 te-0.088
 Majesty-0.087
 cabinet-0.083
annah-0.081
Token were
Token ID547
Feature activation-5.27e-02

Pos logit contributions
Mark+0.215
Time+0.214
Rex+0.209
Big+0.205
Kim+0.203
Neg logit contributions
ott-0.328
 te-0.327
 Majesty-0.306
 Flor-0.293
 cabinet-0.291
Token scared
Token ID12008
Feature activation-3.66e-02

Pos logit contributions
Mark+0.082
Rex+0.081
By+0.079
Big+0.079
Time+0.078
Neg logit contributions
 te-0.081
ott-0.079
 Majesty-0.078
 cabinet-0.073
 Flor-0.072
Token and
Token ID290
Feature activation-1.41e-02

Pos logit contributions
Mark+0.056
Rex+0.055
By+0.053
Big+0.053
Rick+0.052
Neg logit contributions
ott-0.058
 te-0.058
 Majesty-0.054
 cabinet-0.053
 Flor-0.052
Token shouted
Token ID17293
Feature activation-8.39e-03

Pos logit contributions
Mark+0.020
Rex+0.019
By+0.019
Big+0.019
 distance+0.019
Neg logit contributions
ott-0.024
 te-0.023
 Majesty-0.023
 cabinet-0.022
annah-0.021
Token,
Token ID11
Feature activation+5.90e-02

Pos logit contributions
Rex+0.010
By+0.010
Mark+0.010
 louder+0.009
What+0.009
Neg logit contributions
ott-0.016
 Majesty-0.015
 te-0.014
annah-0.014
 cabinet-0.014
Token go
Token ID467
Feature activation-2.48e-02

Pos logit contributions
Mark+0.016
Rex+0.016
By+0.015
Big+0.015
 distance+0.015
Neg logit contributions
ott-0.026
 te-0.026
 Majesty-0.025
 cabinet-0.024
annah-0.024
Token home
Token ID1363
Feature activation+5.07e-02

Pos logit contributions
Mark+0.033
Rex+0.033
 distance+0.032
By+0.031
Big+0.031
Neg logit contributions
ott-0.044
 Majesty-0.043
 te-0.042
 cabinet-0.040
 Flor-0.039
Token.
Token ID13
Feature activation+4.12e-03

Pos logit contributions
ott+0.086
 te+0.083
 Majesty+0.082
 cabinet+0.078
annah+0.075
Neg logit contributions
Mark-0.072
Rex-0.071
By-0.069
Big-0.068
 distance-0.068
Token
Token ID220
Feature activation+1.99e-02

Pos logit contributions
 Majesty+0.009
ott+0.009
 te+0.008
 cabinet+0.008
 wardrobe+0.008
Neg logit contributions
Mark-0.005
Rex-0.004
Big-0.004
By-0.004
Kim-0.004
Token
Token ID198
Feature activation-6.60e-03

Pos logit contributions
ott+0.041
 te+0.041
 Majesty+0.041
 cabinet+0.039
 wardrobe+0.037
Neg logit contributions
Mark-0.021
Rex-0.020
Big-0.019
By-0.019
Kim-0.019
TokenMary
Token ID24119
Feature activation-1.83e-01

Pos logit contributions
Mark+0.008
Rex+0.008
 distance+0.008
By+0.008
Big+0.008
Neg logit contributions
ott-0.012
 te-0.012
 Majesty-0.012
 cabinet-0.011
annah-0.011
Token put
Token ID1234
Feature activation+4.39e-02

Pos logit contributions
Mark+0.204
Rex+0.202
By+0.192
Big+0.191
 distance+0.188
Neg logit contributions
ott-0.362
 Majesty-0.353
 te-0.343
 cabinet-0.337
 Flor-0.321
Token the
Token ID262
Feature activation-4.71e-02

Pos logit contributions
 te+0.091
ott+0.090
 cabinet+0.086
annah+0.083
 wardrobe+0.083
Neg logit contributions
Mark-0.046
By-0.045
Rex-0.043
Thunder-0.042
Big-0.042
Token purse
Token ID27517
Feature activation+3.44e-02

Pos logit contributions
Mark+0.082
Rex+0.080
By+0.079
Big+0.078
Time+0.077
Neg logit contributions
 te-0.065
ott-0.065
 Majesty-0.062
 cabinet-0.060
 wardrobe-0.057
Token back
Token ID736
Feature activation+2.44e-02

Pos logit contributions
ott+0.058
 te+0.057
 cabinet+0.055
 Majesty+0.054
 wardrobe+0.053
Neg logit contributions
Mark-0.048
Rex-0.047
Time-0.046
By-0.046
Big-0.045
Token in
Token ID287
Feature activation+5.84e-02

Pos logit contributions
ott+0.049
 Majesty+0.048
 te+0.048
 cabinet+0.044
 Av+0.044
Neg logit contributions
Mark-0.027
Rex-0.026
 distance-0.026
Big-0.025
By-0.025