TOP ACTIVATIONS
MAX = 4.852

are going to a far away place !" Tom said ,
, going to a far away place . "
She had never seen a math book before , so she
red . It was a yoga mat ! Ell a was
take us to a far away place ." L
. She had a big math book in her bag .
a story about a far away land that was very distant
take you to a far away place . You have to
can find animals in the math book . Maybe I can
it was from a far away place . She said it
was a nightmare about a scary monster that wouldn 't go
a map of a far away land . "
away . She has a math book with many pictures and
, she went to a yoga class with her mom .
to travel to a far away place and brought Spot along
. Lily has a pink yoga mat . She puts it
. Can we play together again ?" Ben smiles
were going to a far away land . "
. Can we play together again ?" he said .
. Can we play together again ?" L ila

MIN ACTIVATIONS
MIN = -5.273

for you ." Bill gave Bob the bread , and Bob
ily and Ben gave the man some bread . They taped
agreed and they gave the man some of their breakfast .
! The old elephant gave Bob a hug . Bob was
, and they gave the man some food and water .
she loved to give the boy lots of kisses .
The elderly man was giving daddy lots of advice .
iz thanked him , gave John a big hug , and
moment and then gave the deer a big , juicy apple
agreed , and gave the farmer some of their eggs ,
Tom gave the man a new blanket and wished
away . He gave the children some corn . He told
. Lily and Tom give him some treats . They pet
you ." He gave Tim my a sandwich and some juice
nodded eagerly . Mum gave them some food and they sprinkled
So , she gave the bear a sandwich .
Mom nodded and gave them some bread cr umbs from
others . They gave each person a medal to show that
smiled , giving his grand pa a big hug . <|endoftext|>
rescue . They gave the family food , clothes and a

INTERVAL 2.320 - 4.852
CONTAINS 0.493%

fun . Can we be friends ?" Lily forg ives Ben
She fl aps her wings and flies away . She feels
danced and danced until the rain stopped . She went back
study yet because he wanted to have fun .
agreed and they both fell asleep with Teddy between them .

INTERVAL -0.211 - 2.320
CONTAINS 59.919%

. â € It was the l ily !
The girl said , " No , it is not .
day , Lily was playing in the park when she saw
started to cry . They did not have any money .
knew how special it was for her daughter .

INTERVAL -2.742 - -0.211
CONTAINS 39.259%

at the squirrel and asked if he had any money .
felt bad too . But it was too late to say
it left a trail of glitter ing spark les !
poke their home again . I will be nice to the
voice . " Who a there , little one

INTERVAL -5.273 - -2.742
CONTAINS 0.329%

the old woman for giving him this magical spray . He
. So , he gave them all a magical emer ald
territory again . They remembered Nem o and how he loved
two best friends . They were both three years old .
the sea and saw all sorts of wonderful things . Then
Token are
Token ID389
Feature activation-4.53e-01

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
igators+0.002
izza+0.002
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.002
 woke-0.002
Who-0.002
Token going
Token ID1016
Feature activation+1.16e+00

Pos logit contributions
umbs+0.019
beh+0.018
lette+0.017
izza+0.017
igators+0.016
Neg logit contributions
 notice-0.016
 stage-0.015
But-0.015
 woke-0.015
 suddenly-0.014
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
Who+0.037
 woke+0.037
 notice+0.037
Stop+0.036
 stage+0.035
Neg logit contributions
umbs-0.045
lette-0.045
o-0.044
beh-0.043
igators-0.042
Token a
Token ID257
Feature activation+1.68e+00

Pos logit contributions
 woke+0.053
But+0.053
Stop+0.053
icol+0.053
Exc+0.052
Neg logit contributions
 equal-0.066
umbs-0.066
igators-0.065
lette-0.063
izza-0.062
Token far
Token ID1290
Feature activation+2.55e+00

Pos logit contributions
 notice+0.059
Stop+0.058
 imagined+0.057
 woke+0.057
But+0.056
Neg logit contributions
 dollar-0.054
ocado-0.052
izza-0.052
igators-0.051
lette-0.051
Tokenaway
Token ID8272
Feature activation+4.85e+00

Pos logit contributions
La+0.077
Who+0.076
icol+0.074
 woke+0.073
 Ach+0.073
Neg logit contributions
igator-0.101
igators-0.100
ole-0.098
 dollar-0.096
melon-0.091
Token place
Token ID1295
Feature activation+1.35e+00

Pos logit contributions
 Ach+0.203
Stop+0.187
Are+0.186
 blush+0.179
Her+0.178
Neg logit contributions
igators-0.144
igator-0.120
 kinds-0.120
 ours-0.119
 equal-0.114
Token!"
Token ID2474
Feature activation+1.36e+00

Pos logit contributions
Who+0.037
 notice+0.036
La+0.035
 woke+0.035
Stop+0.035
Neg logit contributions
umbs-0.061
igators-0.061
lette-0.060
 dollar-0.057
izza-0.057
Token Tom
Token ID4186
Feature activation-3.29e-01

Pos logit contributions
 woke+0.050
 notice+0.049
 seek+0.049
 ti+0.048
Raw+0.048
Neg logit contributions
izza-0.042
umbs-0.041
lette-0.039
o-0.038
 equal-0.038
Token said
Token ID531
Feature activation-8.84e-03

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
izza+0.017
igators+0.016
Neg logit contributions
 notice-0.007
 stage-0.006
But-0.006
 woke-0.006
 suddenly-0.006
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
umbs+0.001
beh+0.000
lette+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
 stage-0.000
But-0.000
 woke-0.000
Who-0.000
Token,
Token ID11
Feature activation+6.07e-02

Pos logit contributions
 notice+0.000
 stage+0.000
But+0.000
 woke+0.000
Who+0.000
Neg logit contributions
umbs-0.001
beh-0.001
lette-0.001
izza-0.001
igators-0.001
Token going
Token ID1016
Feature activation+5.97e-01

Pos logit contributions
 notice+0.002
 stage+0.002
 woke+0.002
But+0.002
Who+0.001
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
igators-0.003
izza-0.003
Token to
Token ID284
Feature activation+1.59e+00

Pos logit contributions
 notice+0.020
 woke+0.020
Who+0.019
La+0.019
 stage+0.018
Neg logit contributions
umbs-0.023
beh-0.022
igators-0.022
lette-0.022
o-0.021
Token a
Token ID257
Feature activation+1.25e+00

Pos logit contributions
 woke+0.054
icol+0.053
Stop+0.052
 imagined+0.051
�+0.051
Neg logit contributions
 equal-0.053
igators-0.053
umbs-0.052
beh-0.050
 dollar-0.049
Token far
Token ID1290
Feature activation+1.93e+00

Pos logit contributions
 imagined+0.044
 notice+0.044
 woke+0.043
Stop+0.043
 suddenly+0.042
Neg logit contributions
 dollar-0.040
ocado-0.038
lette-0.038
umbs-0.038
igators-0.038
Tokenaway
Token ID8272
Feature activation+4.56e+00

Pos logit contributions
La+0.054
Who+0.051
 woke+0.051
icol+0.050
 ti+0.050
Neg logit contributions
 dollar-0.079
ole-0.079
igators-0.078
igator-0.077
melon-0.074
Token place
Token ID1295
Feature activation+1.13e+00

Pos logit contributions
 Ach+0.190
 blush+0.186
Stop+0.181
Are+0.171
 grade+0.168
Neg logit contributions
igators-0.127
 kinds-0.109
 tank-0.107
 strongest-0.100
igator-0.099
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
Who+0.032
Stop+0.032
 notice+0.031
La+0.031
 woke+0.030
Neg logit contributions
umbs-0.048
lette-0.047
igators-0.047
beh-0.046
 dollar-0.046
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
 notice+0.009
 stage+0.008
 woke+0.008
But+0.008
 imagined+0.008
Neg logit contributions
umbs-0.009
lette-0.008
beh-0.008
izza-0.008
ocado-0.007
Token
Token ID198
Feature activation+1.46e-01

Pos logit contributions
 notice+0.010
 stage+0.009
 woke+0.009
Who+0.009
 imagined+0.009
Neg logit contributions
umbs-0.015
lette-0.014
beh-0.014
izza-0.014
igators-0.013
Token"
Token ID1
Feature activation+7.53e-01

Pos logit contributions
 notice+0.005
 stage+0.005
 woke+0.005
 imagined+0.004
Who+0.004
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
izza-0.005
ocado-0.005
Token She
Token ID1375
Feature activation+1.43e+00

Pos logit contributions
 notice+0.008
 stage+0.007
But+0.007
 imagined+0.007
 woke+0.007
Neg logit contributions
umbs-0.009
lette-0.008
beh-0.008
izza-0.008
igators-0.008
Token had
Token ID550
Feature activation+2.82e-01

Pos logit contributions
 notice+0.048
 stage+0.045
 avail+0.045
 recess+0.044
 concert+0.042
Neg logit contributions
o-0.056
beh-0.056
lette-0.055
umbs-0.052
ocado-0.052
Token never
Token ID1239
Feature activation+7.58e-01

Pos logit contributions
 notice+0.011
 stage+0.010
But+0.010
 woke+0.010
Who+0.010
Neg logit contributions
umbs-0.010
lette-0.010
beh-0.010
izza-0.009
igators-0.009
Token seen
Token ID1775
Feature activation+5.12e-01

Pos logit contributions
 stage+0.032
But+0.031
 notice+0.030
La+0.030
�+0.030
Neg logit contributions
beh-0.024
umbs-0.023
lette-0.023
 supposed-0.022
known-0.021
Token a
Token ID257
Feature activation+2.35e+00

Pos logit contributions
 notice+0.017
But+0.015
 woke+0.015
 stage+0.015
Who+0.015
Neg logit contributions
umbs-0.022
lette-0.021
beh-0.020
igators-0.019
izza-0.019
Token math
Token ID10688
Feature activation+4.55e+00

Pos logit contributions
 notice+0.088
 imagined+0.085
La+0.085
Stop+0.085
 woke+0.084
Neg logit contributions
 dollar-0.076
ocado-0.070
 supposed-0.070
 equal-0.068
 tast-0.067
Token book
Token ID1492
Feature activation+1.89e+00

Pos logit contributions
icol+0.157
La+0.154
AAAA+0.154
Sh+0.150
Stop+0.149
Neg logit contributions
 dollar-0.155
 dollars-0.144
 strongest-0.143
 tast-0.139
 supposed-0.138
Token before
Token ID878
Feature activation+1.64e+00

Pos logit contributions
Stop+0.061
Who+0.060
 notice+0.060
AAAA+0.059
La+0.059
Neg logit contributions
umbs-0.076
lette-0.070
icking-0.069
asting-0.067
 supposed-0.066
Token,
Token ID11
Feature activation+7.89e-01

Pos logit contributions
 notice+0.050
La+0.047
Who+0.047
Stop+0.046
But+0.044
Neg logit contributions
umbs-0.072
icking-0.064
 supposed-0.064
lette-0.064
allows-0.064
Token so
Token ID523
Feature activation+1.19e+00

Pos logit contributions
 notice+0.023
 stage+0.022
La+0.022
 woke+0.021
 concert+0.021
Neg logit contributions
umbs-0.034
lette-0.033
beh-0.032
 equal-0.030
igators-0.030
Token she
Token ID673
Feature activation+1.12e+00

Pos logit contributions
 notice+0.032
La+0.031
 concert+0.030
Stop+0.030
 speaker+0.029
Neg logit contributions
umbs-0.053
allows-0.051
lette-0.050
beh-0.050
igators-0.049
Token red
Token ID2266
Feature activation+1.51e+00

Pos logit contributions
 notice+0.007
 woke+0.006
But+0.006
 stage+0.006
Who+0.006
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
igators-0.006
izza-0.006
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
 notice+0.056
La+0.053
But+0.051
 seek+0.050
 stage+0.050
Neg logit contributions
umbs-0.059
lette-0.058
icking-0.057
melon-0.056
beh-0.053
Token It
Token ID632
Feature activation+1.34e+00

Pos logit contributions
 notice+0.014
 stage+0.013
But+0.013
 imagined+0.013
 woke+0.012
Neg logit contributions
umbs-0.014
lette-0.013
beh-0.013
izza-0.013
oun-0.012
Token was
Token ID373
Feature activation-3.21e-01

Pos logit contributions
 notice+0.043
 concert+0.040
 recess+0.039
 stage+0.039
 grade+0.037
Neg logit contributions
o-0.053
umbs-0.052
igators-0.050
lette-0.048
ocado-0.048
Token a
Token ID257
Feature activation+1.76e+00

Pos logit contributions
umbs+0.014
beh+0.013
lette+0.013
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.010
 stage-0.010
But-0.010
 woke-0.010
 suddenly-0.009
Token yoga
Token ID20351
Feature activation+4.54e+00

Pos logit contributions
 notice+0.075
 imagined+0.069
Stop+0.068
But+0.067
 woke+0.066
Neg logit contributions
 dollar-0.053
ocado-0.050
lette-0.049
umbs-0.049
melon-0.048
Token mat
Token ID2603
Feature activation+7.29e-01

Pos logit contributions
 notice+0.148
Exc+0.136
 inspection+0.134
Who+0.131
Stop+0.125
Neg logit contributions
 dollar-0.164
ery-0.151
ocado-0.151
 best-0.150
lette-0.149
Token!
Token ID0
Feature activation+2.67e-02

Pos logit contributions
 notice+0.020
Who+0.018
But+0.018
La+0.018
 stage+0.017
Neg logit contributions
umbs-0.035
lette-0.033
beh-0.033
igators-0.032
melon-0.032
Token Ell
Token ID7122
Feature activation-8.66e-01

Pos logit contributions
 notice+0.001
 stage+0.001
But+0.001
 woke+0.001
 imagined+0.001
Neg logit contributions
umbs-0.001
beh-0.001
lette-0.001
izza-0.001
igators-0.001
Tokena
Token ID64
Feature activation+7.96e-01

Pos logit contributions
umbs+0.027
igators+0.025
beh+0.025
lette+0.024
izza+0.024
Neg logit contributions
 woke-0.038
 notice-0.038
 imagined-0.037
 heard-0.037
But-0.037
Token was
Token ID373
Feature activation-5.21e-01

Pos logit contributions
 notice+0.022
 stage+0.020
 avail+0.019
Who+0.018
Stop+0.018
Neg logit contributions
umbs-0.037
lette-0.036
beh-0.035
izza-0.034
ocado-0.034
Token take
Token ID1011
Feature activation+2.61e-01

Pos logit contributions
 notice+0.017
 stage+0.017
Who+0.016
La+0.016
But+0.016
Neg logit contributions
umbs-0.027
lette-0.026
beh-0.025
igators-0.025
izza-0.024
Token us
Token ID514
Feature activation-5.79e-01

Pos logit contributions
 notice+0.008
But+0.007
 stage+0.007
Who+0.007
 woke+0.007
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
izza-0.011
igators-0.011
Token to
Token ID284
Feature activation+1.64e+00

Pos logit contributions
umbs+0.029
beh+0.028
lette+0.027
izza+0.026
igators+0.026
Neg logit contributions
 notice-0.015
 stage-0.014
But-0.014
 woke-0.013
 suddenly-0.013
Token a
Token ID257
Feature activation+6.35e-01

Pos logit contributions
Stop+0.057
 woke+0.056
icol+0.056
 bumped+0.056
La+0.056
Neg logit contributions
umbs-0.055
 equal-0.053
igators-0.053
lette-0.053
izza-0.053
Token far
Token ID1290
Feature activation+2.03e+00

Pos logit contributions
 notice+0.022
But+0.021
 woke+0.021
 imagined+0.020
Stop+0.020
Neg logit contributions
umbs-0.024
lette-0.023
izza-0.022
igators-0.022
 dollar-0.022
Tokenaway
Token ID8272
Feature activation+4.48e+00

Pos logit contributions
La+0.061
Who+0.060
icol+0.057
Stop+0.055
 ti+0.054
Neg logit contributions
igators-0.082
ole-0.082
igator-0.082
 dollar-0.081
melon-0.078
Token place
Token ID1295
Feature activation+8.12e-01

Pos logit contributions
 Ach+0.184
Are+0.178
Stop+0.177
 blush+0.169
Exc+0.167
Neg logit contributions
igators-0.135
 ours-0.115
 kinds-0.111
igator-0.110
 tank-0.108
Token."
Token ID526
Feature activation+4.70e-01

Pos logit contributions
 notice+0.022
Who+0.021
Stop+0.021
La+0.021
 woke+0.021
Neg logit contributions
umbs-0.037
lette-0.036
igators-0.036
beh-0.035
izza-0.034
Token
Token ID198
Feature activation+5.82e-01

Pos logit contributions
 notice+0.014
 stage+0.014
 woke+0.013
 imagined+0.013
Stop+0.013
Neg logit contributions
umbs-0.021
izza-0.019
lette-0.019
beh-0.019
o-0.019
Token
Token ID198
Feature activation-9.17e-02

Pos logit contributions
 notice+0.018
 stage+0.017
 imagined+0.016
 woke+0.016
Who+0.016
Neg logit contributions
umbs-0.026
izza-0.024
lette-0.024
beh-0.023
o-0.023
TokenL
Token ID43
Feature activation+4.85e-01

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.003
 stage-0.002
 woke-0.002
But-0.002
Who-0.002
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 notice+0.036
Stop+0.034
 woke+0.033
La+0.033
 avail+0.033
Neg logit contributions
beh-0.050
illing-0.049
umbs-0.048
lette-0.047
 dollar-0.047
Token She
Token ID1375
Feature activation+2.22e+00

Pos logit contributions
 notice+0.016
 stage+0.015
But+0.015
La+0.015
 imagined+0.015
Neg logit contributions
umbs-0.020
lette-0.018
beh-0.018
igators-0.018
izza-0.018
Token had
Token ID550
Feature activation+9.75e-01

Pos logit contributions
 avail+0.088
 conclusions+0.073
icol+0.073
 notice+0.069
 stage+0.068
Neg logit contributions
o-0.086
 supposed-0.072
ocado-0.070
illing-0.069
known-0.068
Token a
Token ID257
Feature activation+1.63e+00

Pos logit contributions
�+0.040
 notice+0.040
 avail+0.040
 stage+0.039
chool+0.039
Neg logit contributions
 supposed-0.027
lette-0.027
beh-0.027
umbs-0.026
o-0.024
Token big
Token ID1263
Feature activation+1.64e+00

Pos logit contributions
 notice+0.071
 imagined+0.069
But+0.068
Who+0.067
 woke+0.067
Neg logit contributions
 dollar-0.044
ocado-0.039
 best-0.039
lette-0.038
oun-0.036
Token math
Token ID10688
Feature activation+4.45e+00

Pos logit contributions
Who+0.058
But+0.057
 notice+0.054
 suddenly+0.054
Stop+0.053
Neg logit contributions
 dollar-0.063
melon-0.056
ish-0.056
 brothers-0.056
 best-0.055
Token book
Token ID1492
Feature activation+1.37e+00

Pos logit contributions
icol+0.171
 notice+0.168
La+0.167
Stop+0.164
 avail+0.162
Neg logit contributions
 dollar-0.145
 best-0.128
 dollars-0.122
 brothers-0.112
 strongest-0.112
Token in
Token ID287
Feature activation+1.15e+00

Pos logit contributions
 notice+0.053
Who+0.052
Stop+0.052
 avail+0.050
La+0.050
Neg logit contributions
umbs-0.048
 dollar-0.045
lette-0.045
o-0.044
illing-0.043
Token her
Token ID607
Feature activation+8.30e-01

Pos logit contributions
icol+0.032
 concert+0.030
 woke+0.030
�+0.030
 notice+0.030
Neg logit contributions
umbs-0.048
lette-0.047
ady-0.047
 equal-0.046
 dollar-0.046
Token bag
Token ID6131
Feature activation+4.61e-01

Pos logit contributions
 notice+0.025
Stop+0.023
 suddenly+0.023
Who+0.023
 woke+0.022
Neg logit contributions
lette-0.035
umbs-0.034
ady-0.034
ocado-0.034
 dollar-0.034
Token.
Token ID13
Feature activation+5.34e-01

Pos logit contributions
 notice+0.012
Who+0.011
 stage+0.010
Stop+0.010
La+0.010
Neg logit contributions
umbs-0.023
lette-0.022
beh-0.022
icking-0.022
izza-0.021
Token a
Token ID257
Feature activation-4.08e-01

Pos logit contributions
umbs+0.068
beh+0.065
igators+0.062
ady+0.062
lette+0.062
Neg logit contributions
 notice-0.025
 woke-0.025
 stage-0.024
 suddenly-0.023
 heard-0.022
Token story
Token ID1621
Feature activation+7.37e-01

Pos logit contributions
umbs+0.017
beh+0.017
lette+0.016
igators+0.015
izza+0.015
Neg logit contributions
 notice-0.014
 stage-0.013
 woke-0.013
But-0.013
Who-0.013
Token about
Token ID546
Feature activation-7.26e-01

Pos logit contributions
 notice+0.032
Who+0.030
But+0.030
Stop+0.030
 stage+0.030
Neg logit contributions
umbs-0.024
lette-0.022
beh-0.021
izza-0.021
 dollar-0.020
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
umbs+0.035
beh+0.033
lette+0.032
izza+0.031
igators+0.031
Neg logit contributions
 notice-0.021
 stage-0.020
But-0.020
 woke-0.019
Who-0.019
Token far
Token ID1290
Feature activation+1.20e+00

Pos logit contributions
 notice+0.044
Stop+0.044
But+0.042
La+0.042
 imagined+0.041
Neg logit contributions
ocado-0.036
 dollar-0.036
lette-0.035
izza-0.035
umbs-0.034
Tokenaway
Token ID8272
Feature activation+4.44e+00

Pos logit contributions
La+0.035
 imagined+0.033
Who+0.032
 stage+0.032
 woke+0.032
Neg logit contributions
 dollar-0.050
melon-0.049
igators-0.048
beh-0.048
igator-0.047
Token land
Token ID1956
Feature activation+7.13e-01

Pos logit contributions
Stop+0.175
 Ach+0.172
La+0.166
Her+0.157
J+0.154
Neg logit contributions
ishes-0.116
 chicks-0.115
bi-0.111
igators-0.110
 dollars-0.106
Token that
Token ID326
Feature activation-2.10e-01

Pos logit contributions
 notice+0.021
Who+0.020
 stage+0.019
 woke+0.019
Stop+0.019
Neg logit contributions
umbs-0.032
lette-0.031
beh-0.031
igators-0.030
icking-0.029
Token was
Token ID373
Feature activation-4.25e-01

Pos logit contributions
umbs+0.010
beh+0.010
lette+0.010
izza+0.010
igators+0.009
Neg logit contributions
 notice-0.005
 stage-0.005
But-0.005
 woke-0.005
Who-0.005
Token very
Token ID845
Feature activation-3.92e-01

Pos logit contributions
umbs+0.017
beh+0.016
lette+0.016
izza+0.016
igators+0.015
Neg logit contributions
 notice-0.015
 stage-0.014
But-0.014
 woke-0.014
 suddenly-0.014
Token distant
Token ID12899
Feature activation+6.30e-01

Pos logit contributions
umbs+0.014
beh+0.012
lette+0.012
izza+0.012
igators+0.011
Neg logit contributions
 notice-0.017
But-0.016
 stage-0.016
 woke-0.016
 suddenly-0.015
Token take
Token ID1011
Feature activation+1.13e-02

Pos logit contributions
 stage+0.017
 notice+0.017
 woke+0.016
Who+0.016
But+0.016
Neg logit contributions
umbs-0.025
lette-0.023
beh-0.023
igators-0.023
 equal-0.022
Token you
Token ID345
Feature activation-5.08e-01

Pos logit contributions
 notice+0.000
 stage+0.000
But+0.000
 woke+0.000
Who+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
izza-0.000
igators-0.000
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
umbs+0.025
beh+0.024
lette+0.023
izza+0.023
igators+0.022
Neg logit contributions
 notice-0.014
 stage-0.013
But-0.012
 woke-0.012
 suddenly-0.012
Token a
Token ID257
Feature activation+6.72e-01

Pos logit contributions
 woke+0.046
Who+0.045
Stop+0.045
icol+0.044
But+0.044
Neg logit contributions
umbs-0.050
 equal-0.049
lette-0.049
izza-0.048
beh-0.047
Token far
Token ID1290
Feature activation+2.04e+00

Pos logit contributions
 notice+0.022
But+0.021
 woke+0.021
Stop+0.021
 imagined+0.021
Neg logit contributions
umbs-0.026
lette-0.025
beh-0.025
izza-0.025
igators-0.024
Tokenaway
Token ID8272
Feature activation+4.43e+00

Pos logit contributions
La+0.062
Who+0.060
Stop+0.056
icol+0.055
 stage+0.054
Neg logit contributions
ole-0.083
igators-0.081
 dollar-0.081
igator-0.080
melon-0.077
Token place
Token ID1295
Feature activation+8.23e-01

Pos logit contributions
 Ach+0.176
Stop+0.167
Are+0.167
La+0.157
 blush+0.156
Neg logit contributions
igators-0.143
 equal-0.121
 ours-0.118
igator-0.116
ishes-0.115
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
 notice+0.022
Who+0.022
 woke+0.021
Stop+0.021
La+0.021
Neg logit contributions
umbs-0.038
lette-0.037
beh-0.036
igators-0.036
izza-0.035
Token You
Token ID921
Feature activation-1.03e+00

Pos logit contributions
umbs+0.013
beh+0.012
lette+0.012
izza+0.012
igators+0.012
Neg logit contributions
 notice-0.011
 stage-0.010
But-0.010
 woke-0.010
Who-0.010
Token have
Token ID423
Feature activation-1.67e-01

Pos logit contributions
umbs+0.049
beh+0.047
lette+0.046
izza+0.044
illing+0.044
Neg logit contributions
 notice-0.032
 woke-0.029
 stage-0.029
But-0.028
 imagined-0.028
Token to
Token ID284
Feature activation+5.64e-01

Pos logit contributions
umbs+0.008
beh+0.008
lette+0.008
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.005
 stage-0.004
But-0.004
 woke-0.004
Who-0.004
Token can
Token ID460
Feature activation+1.19e+00

Pos logit contributions
 stage+0.009
 notice+0.009
But+0.008
 woke+0.008
La+0.008
Neg logit contributions
umbs-0.011
beh-0.011
lette-0.011
ocado-0.010
izza-0.010
Token find
Token ID1064
Feature activation+1.17e+00

Pos logit contributions
icol+0.035
 woke+0.035
 stage+0.035
Stop+0.034
AAAA+0.034
Neg logit contributions
lette-0.047
umbs-0.047
 equal-0.045
beh-0.043
izza-0.043
Token animals
Token ID4695
Feature activation+9.41e-01

Pos logit contributions
 notice+0.040
 woke+0.038
But+0.038
 stage+0.037
Who+0.037
Neg logit contributions
umbs-0.045
lette-0.044
izza-0.042
 equal-0.041
beh-0.041
Token in
Token ID287
Feature activation+2.70e-01

Pos logit contributions
 notice+0.026
 woke+0.024
icol+0.024
 stage+0.024
Who+0.023
Neg logit contributions
umbs-0.043
lette-0.042
igators-0.041
beh-0.040
 dollar-0.040
Token the
Token ID262
Feature activation+7.36e-01

Pos logit contributions
 notice+0.008
 woke+0.008
But+0.008
 stage+0.008
Who+0.007
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
igators-0.011
izza-0.011
Token math
Token ID10688
Feature activation+4.41e+00

Pos logit contributions
 notice+0.023
 suddenly+0.021
 imagined+0.020
 woke+0.020
But+0.020
Neg logit contributions
lette-0.030
 dollar-0.030
umbs-0.029
ocado-0.029
izza-0.029
Token book
Token ID1492
Feature activation+2.58e+00

Pos logit contributions
icol+0.168
 inspection+0.144
 notice+0.141
AAAA+0.139
Stop+0.137
Neg logit contributions
 dollar-0.159
 dollars-0.152
igators-0.150
ocado-0.143
 equal-0.137
Token.
Token ID13
Feature activation+6.36e-01

Pos logit contributions
icol+0.087
 notice+0.081
Who+0.080
Stop+0.079
 avail+0.077
Neg logit contributions
umbs-0.108
igators-0.101
 dollar-0.098
ocado-0.096
lette-0.096
Token Maybe
Token ID6674
Feature activation+2.28e+00

Pos logit contributions
 notice+0.019
 stage+0.018
 woke+0.017
lf+0.016
 suddenly+0.016
Neg logit contributions
umbs-0.029
lette-0.028
izza-0.027
beh-0.027
ocado-0.026
Token I
Token ID314
Feature activation+1.87e-01

Pos logit contributions
icol+0.087
 notice+0.085
Stop+0.082
 woke+0.081
 bumped+0.080
Neg logit contributions
 dollar-0.072
lette-0.070
 ours-0.069
izza-0.069
igators-0.068
Token can
Token ID460
Feature activation+1.04e+00

Pos logit contributions
 stage+0.006
 notice+0.006
But+0.006
 woke+0.005
La+0.005
Neg logit contributions
umbs-0.008
beh-0.008
lette-0.008
izza-0.007
ocado-0.007
Token it
Token ID340
Feature activation-7.95e-01

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.006
 stage-0.006
 woke-0.006
But-0.006
 suddenly-0.005
Token was
Token ID373
Feature activation-6.95e-01

Pos logit contributions
umbs+0.049
beh+0.048
lette+0.047
izza+0.046
igators+0.045
Neg logit contributions
 woke-0.011
 notice-0.011
 suddenly-0.010
 stage-0.010
But-0.009
Token from
Token ID422
Feature activation+4.22e-01

Pos logit contributions
umbs+0.032
beh+0.030
lette+0.029
izza+0.029
igators+0.028
Neg logit contributions
 notice-0.021
 stage-0.020
But-0.020
 woke-0.020
 suddenly-0.019
Token a
Token ID257
Feature activation+6.35e-01

Pos logit contributions
 notice+0.013
 woke+0.012
But+0.012
 stage+0.012
Who+0.012
Neg logit contributions
umbs-0.018
lette-0.018
beh-0.018
igators-0.017
izza-0.017
Token far
Token ID1290
Feature activation+1.59e+00

Pos logit contributions
 notice+0.021
But+0.021
 stage+0.020
La+0.020
Who+0.020
Neg logit contributions
umbs-0.025
lette-0.024
 dollar-0.023
beh-0.023
izza-0.023
Tokenaway
Token ID8272
Feature activation+4.40e+00

Pos logit contributions
La+0.058
Who+0.055
 imagined+0.054
lf+0.053
 stage+0.053
Neg logit contributions
 dollar-0.055
igator-0.054
melon-0.054
igators-0.054
ole-0.052
Token place
Token ID1295
Feature activation+8.04e-01

Pos logit contributions
 Ach+0.162
La+0.153
Ben+0.153
Ba+0.153
Stop+0.150
Neg logit contributions
igators-0.141
bi-0.123
igator-0.119
 chicks-0.119
 kinds-0.119
Token.
Token ID13
Feature activation+8.41e-01

Pos logit contributions
 notice+0.021
Who+0.020
La+0.019
 stage+0.019
But+0.019
Neg logit contributions
umbs-0.038
lette-0.037
beh-0.037
igators-0.036
icking-0.035
Token She
Token ID1375
Feature activation+8.44e-01

Pos logit contributions
 notice+0.029
 imagined+0.028
 seek+0.028
 stage+0.028
lf+0.027
Neg logit contributions
umbs-0.034
izza-0.030
lette-0.030
ocado-0.029
 dollar-0.028
Token said
Token ID531
Feature activation-2.18e-01

Pos logit contributions
 notice+0.027
 stage+0.025
But+0.024
Stop+0.023
La+0.023
Neg logit contributions
umbs-0.035
lette-0.034
beh-0.034
igators-0.033
izza-0.033
Token it
Token ID340
Feature activation-7.30e-01

Pos logit contributions
umbs+0.012
beh+0.011
lette+0.011
izza+0.011
igators+0.011
Neg logit contributions
 notice-0.005
 stage-0.004
But-0.004
 woke-0.004
Who-0.004
Token was
Token ID373
Feature activation+3.27e-01

Pos logit contributions
 notice+0.025
 stage+0.024
But+0.023
La+0.022
Who+0.022
Neg logit contributions
beh-0.043
umbs-0.042
igators-0.041
o-0.039
lette-0.039
Token a
Token ID257
Feature activation+1.61e+00

Pos logit contributions
 notice+0.012
But+0.011
Who+0.011
 stage+0.011
Stop+0.011
Neg logit contributions
umbs-0.012
beh-0.012
lette-0.012
igators-0.011
icking-0.011
Token nightmare
Token ID17123
Feature activation+2.60e+00

Pos logit contributions
Stop+0.067
 notice+0.067
Who+0.064
 imagined+0.064
But+0.064
Neg logit contributions
 dollar-0.043
ocado-0.041
 tast-0.040
melon-0.039
lette-0.039
Token about
Token ID546
Feature activation+1.72e+00

Pos logit contributions
Who+0.093
La+0.092
Stop+0.092
 avail+0.090
 notice+0.090
Neg logit contributions
umbs-0.093
icking-0.086
o-0.084
lette-0.084
 supposed-0.083
Token a
Token ID257
Feature activation+3.40e+00

Pos logit contributions
 notice+0.069
But+0.067
�+0.067
La+0.063
Exc+0.063
Neg logit contributions
beh-0.052
 supposed-0.052
igators-0.052
allows-0.050
umbs-0.049
Token scary
Token ID14343
Feature activation+4.38e+00

Pos logit contributions
Stop+0.129
zes+0.128
ulk+0.128
chool+0.127
Fine+0.124
Neg logit contributions
 valuable-0.097
 best-0.090
 lucky-0.090
 important-0.089
 strongest-0.089
Token monster
Token ID9234
Feature activation+2.47e+00

Pos logit contributions
Sorry+0.189
Exc+0.182
Who+0.180
Stop+0.180
But+0.178
Neg logit contributions
 important-0.098
 best-0.095
 tank-0.093
 adorable-0.092
 cub-0.091
Token that
Token ID326
Feature activation+1.98e+00

Pos logit contributions
 avail+0.084
 notice+0.081
Stop+0.080
But+0.078
Who+0.078
Neg logit contributions
o-0.093
umbs-0.088
igators-0.087
beh-0.086
lette-0.084
Token wouldn
Token ID3636
Feature activation+8.09e-01

Pos logit contributions
 notice+0.063
 avail+0.060
But+0.060
chool+0.058
 stage+0.057
Neg logit contributions
igators-0.079
beh-0.078
igator-0.074
o-0.072
umbs-0.070
Token't
Token ID470
Feature activation+1.54e+00

Pos logit contributions
 notice+0.030
 stage+0.029
 imagined+0.029
But+0.028
Who+0.028
Neg logit contributions
umbs-0.031
beh-0.028
lette-0.026
oun-0.026
igators-0.026
Token go
Token ID467
Feature activation+1.29e+00

Pos logit contributions
icol+0.053
chool+0.052
 stage+0.051
Really+0.049
La+0.049
Neg logit contributions
umbs-0.057
beh-0.056
igators-0.054
 supposed-0.053
asting-0.052
Token a
Token ID257
Feature activation+1.77e+00

Pos logit contributions
umbs+0.049
beh+0.047
lette+0.045
izza+0.044
oun+0.043
Neg logit contributions
 notice-0.029
 stage-0.028
 suddenly-0.027
 school-0.026
But-0.026
Token map
Token ID3975
Feature activation-1.25e+00

Pos logit contributions
 notice+0.065
Stop+0.062
But+0.060
 woke+0.059
 imagined+0.059
Neg logit contributions
 dollar-0.057
ocado-0.056
melon-0.054
lette-0.054
igators-0.053
Token of
Token ID286
Feature activation-8.79e-03

Pos logit contributions
beh+0.075
umbs+0.074
lette+0.071
izza+0.069
allows+0.069
Neg logit contributions
 stage-0.021
 notice-0.020
 woke-0.018
 suddenly-0.017
But-0.017
Token a
Token ID257
Feature activation+1.66e+00

Pos logit contributions
umbs+0.000
beh+0.000
lette+0.000
igators+0.000
izza+0.000
Neg logit contributions
 notice-0.000
But-0.000
 stage-0.000
 woke-0.000
Who-0.000
Token far
Token ID1290
Feature activation+1.09e+00

Pos logit contributions
 notice+0.058
Stop+0.054
But+0.053
 woke+0.053
Who+0.052
Neg logit contributions
 dollar-0.059
ocado-0.057
umbs-0.057
igators-0.056
lette-0.056
Tokenaway
Token ID8272
Feature activation+4.37e+00

Pos logit contributions
La+0.034
But+0.033
 notice+0.033
Who+0.032
 woke+0.032
Neg logit contributions
igators-0.042
umbs-0.041
 dollar-0.041
beh-0.041
melon-0.040
Token land
Token ID1956
Feature activation+7.00e-02

Pos logit contributions
Exc+0.161
 Ach+0.160
Stop+0.157
Sorry+0.154
Ba+0.154
Neg logit contributions
igators-0.131
 kinds-0.120
 tank-0.112
 sizes-0.112
ishes-0.108
Token.
Token ID13
Feature activation-5.58e-01

Pos logit contributions
 notice+0.002
 stage+0.001
But+0.001
 woke+0.001
Who+0.001
Neg logit contributions
umbs-0.004
lette-0.004
beh-0.004
igators-0.003
izza-0.003
Token
Token ID198
Feature activation+6.56e-01

Pos logit contributions
beh+0.027
umbs+0.027
lette+0.027
icking+0.025
igators+0.025
Neg logit contributions
 notice-0.015
But-0.014
 stage-0.014
 woke-0.013
Who-0.013
Token
Token ID198
Feature activation-6.08e-01

Pos logit contributions
 notice+0.019
 stage+0.019
 woke+0.018
 imagined+0.018
Who+0.018
Neg logit contributions
umbs-0.029
lette-0.026
izza-0.026
beh-0.026
o-0.026
Token"
Token ID1
Feature activation+4.09e-01

Pos logit contributions
umbs+0.032
beh+0.031
lette+0.031
izza+0.030
icking+0.029
Neg logit contributions
 notice-0.015
But-0.015
 stage-0.013
 suddenly-0.013
 woke-0.013
Token away
Token ID1497
Feature activation-2.05e-01

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.004
beh-0.003
lette-0.003
izza-0.003
igators-0.003
Token.
Token ID13
Feature activation+3.18e-01

Pos logit contributions
umbs+0.010
beh+0.009
lette+0.009
izza+0.009
igators+0.009
Neg logit contributions
 notice-0.006
 stage-0.005
But-0.005
 woke-0.005
Who-0.005
Token She
Token ID1375
Feature activation+3.52e-03

Pos logit contributions
 notice+0.012
 stage+0.011
 woke+0.011
But+0.011
 imagined+0.011
Neg logit contributions
umbs-0.012
lette-0.011
beh-0.011
izza-0.011
igators-0.011
Token has
Token ID468
Feature activation+3.59e-01

Pos logit contributions
 notice+0.000
 stage+0.000
But+0.000
 woke+0.000
Who+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
igators-0.000
izza-0.000
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
 notice+0.014
 stage+0.013
 woke+0.013
But+0.013
Who+0.013
Neg logit contributions
umbs-0.013
lette-0.012
beh-0.012
igators-0.011
izza-0.011
Token math
Token ID10688
Feature activation+4.35e+00

Pos logit contributions
 notice+0.042
 woke+0.041
 imagined+0.040
Stop+0.039
But+0.039
Neg logit contributions
 dollar-0.033
umbs-0.031
ocado-0.030
izza-0.030
lette-0.030
Token book
Token ID1492
Feature activation+1.32e+00

Pos logit contributions
 woke+0.165
icol+0.165
AAAA+0.160
Hi+0.158
Stop+0.158
Neg logit contributions
 dollar-0.143
 best-0.132
 dollars-0.128
 equal-0.121
 tast-0.120
Token with
Token ID351
Feature activation+2.81e-03

Pos logit contributions
 notice+0.045
Who+0.044
Stop+0.044
 woke+0.042
 stage+0.041
Neg logit contributions
umbs-0.052
lette-0.048
 dollar-0.048
beh-0.046
igators-0.046
Token many
Token ID867
Feature activation+6.41e-01

Pos logit contributions
 notice+0.000
 stage+0.000
 woke+0.000
But+0.000
Who+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
izza-0.000
igators-0.000
Token pictures
Token ID5986
Feature activation+9.29e-02

Pos logit contributions
 notice+0.024
 woke+0.023
 stage+0.022
 suddenly+0.022
But+0.022
Neg logit contributions
umbs-0.024
beh-0.022
lette-0.022
 equal-0.021
igators-0.021
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
 notice+0.003
 stage+0.002
 woke+0.002
But+0.002
Who+0.002
Neg logit contributions
umbs-0.004
lette-0.004
beh-0.004
igators-0.004
izza-0.004
Token,
Token ID11
Feature activation+1.60e+00

Pos logit contributions
 notice+0.029
Who+0.027
 stage+0.027
 imagined+0.026
La+0.026
Neg logit contributions
umbs-0.048
o-0.046
izza-0.045
 dollar-0.045
itt-0.045
Token she
Token ID673
Feature activation+1.48e+00

Pos logit contributions
La+0.053
icol+0.052
 notice+0.052
 concert+0.051
 imagined+0.051
Neg logit contributions
melon-0.052
oun-0.049
o-0.049
allows-0.048
 dollar-0.048
Token went
Token ID1816
Feature activation+3.93e-01

Pos logit contributions
 stage+0.047
 avail+0.045
chool+0.041
 speaker+0.040
 notice+0.040
Neg logit contributions
o-0.062
ocado-0.059
lette-0.058
beh-0.057
ady-0.056
Token to
Token ID284
Feature activation+1.39e+00

Pos logit contributions
 notice+0.012
Who+0.011
 stage+0.011
But+0.011
 woke+0.011
Neg logit contributions
umbs-0.017
beh-0.017
lette-0.016
igators-0.016
icking-0.015
Token a
Token ID257
Feature activation+7.82e-01

Pos logit contributions
But+0.051
 ti+0.049
icol+0.049
lf+0.048
Stop+0.048
Neg logit contributions
igators-0.040
beh-0.039
o-0.038
umbs-0.038
 supposed-0.038
Token yoga
Token ID20351
Feature activation+4.34e+00

Pos logit contributions
 notice+0.027
 imagined+0.025
But+0.025
Stop+0.025
 woke+0.024
Neg logit contributions
umbs-0.027
 dollar-0.027
lette-0.027
beh-0.027
izza-0.027
Token class
Token ID1398
Feature activation+1.04e+00

Pos logit contributions
 notice+0.147
 inspection+0.147
 avail+0.143
Who+0.141
 widened+0.138
Neg logit contributions
 dollar-0.155
melon-0.143
 tank-0.132
ocado-0.132
itt-0.131
Token with
Token ID351
Feature activation+3.02e-01

Pos logit contributions
 notice+0.031
Who+0.029
Stop+0.029
La+0.028
 woke+0.028
Neg logit contributions
beh-0.044
umbs-0.044
lette-0.043
illing-0.043
 dollar-0.042
Token her
Token ID607
Feature activation+2.40e-01

Pos logit contributions
 notice+0.007
But+0.007
 woke+0.006
 stage+0.006
Who+0.006
Neg logit contributions
umbs-0.015
beh-0.015
lette-0.015
igators-0.014
izza-0.014
Token mom
Token ID1995
Feature activation-1.67e+00

Pos logit contributions
 notice+0.008
 woke+0.008
But+0.008
Who+0.008
 stage+0.007
Neg logit contributions
umbs-0.010
lette-0.009
beh-0.009
izza-0.009
igators-0.009
Token.
Token ID13
Feature activation+6.75e-01

Pos logit contributions
beh+0.079
umbs+0.077
igators+0.075
izza+0.073
oun+0.071
Neg logit contributions
 woke-0.049
 heard-0.049
 notice-0.049
But-0.048
 saw-0.047
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
La+0.047
 notice+0.045
 stage+0.044
Stop+0.042
Ba+0.042
Neg logit contributions
beh-0.038
lette-0.037
 supposed-0.036
o-0.036
izza-0.035
Token travel
Token ID3067
Feature activation+1.65e+00

Pos logit contributions
umbs+0.052
beh+0.049
lette+0.049
ocado+0.048
izza+0.047
Neg logit contributions
 notice-0.027
 stage-0.025
 woke-0.025
 dress-0.024
But-0.024
Token to
Token ID284
Feature activation+8.06e-01

Pos logit contributions
icol+0.069
Who+0.069
AAAA+0.068
 stage+0.068
Raw+0.067
Neg logit contributions
o-0.047
beh-0.046
asting-0.043
igators-0.042
 dollar-0.041
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
But+0.026
 notice+0.026
 woke+0.026
 suddenly+0.025
 imagined+0.025
Neg logit contributions
beh-0.028
umbs-0.028
lette-0.027
igators-0.027
izza-0.027
Token far
Token ID1290
Feature activation+1.48e+00

Pos logit contributions
 notice+0.039
 imagined+0.037
 woke+0.037
Stop+0.037
But+0.036
Neg logit contributions
 dollar-0.033
umbs-0.032
ocado-0.032
lette-0.032
izza-0.032
Tokenaway
Token ID8272
Feature activation+4.33e+00

Pos logit contributions
La+0.035
Who+0.035
Stop+0.034
 woke+0.034
 notice+0.033
Neg logit contributions
igators-0.066
 dollar-0.066
ole-0.064
igator-0.063
beh-0.063
Token place
Token ID1295
Feature activation+1.45e-01

Pos logit contributions
 Ach+0.179
Stop+0.178
 blush+0.177
ged+0.171
La+0.169
Neg logit contributions
igators-0.111
 tank-0.098
 ours-0.096
 dollar-0.095
 strongest-0.092
Token and
Token ID290
Feature activation+1.54e+00

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.007
igators-0.007
izza-0.007
Token brought
Token ID3181
Feature activation-1.15e+00

Pos logit contributions
La+0.052
Stop+0.050
 stage+0.048
ara+0.048
 notice+0.047
Neg logit contributions
beh-0.057
igators-0.055
 tast-0.052
 dollar-0.052
lette-0.052
Token Spot
Token ID15899
Feature activation-2.88e+00

Pos logit contributions
umbs+0.059
lette+0.056
beh+0.055
ocado+0.054
illing+0.054
Neg logit contributions
 notice-0.028
 woke-0.027
 stage-0.026
But-0.025
 heard-0.025
Token along
Token ID1863
Feature activation+5.89e-01

Pos logit contributions
umbs+0.174
itt+0.166
omp+0.165
lette+0.164
elling+0.161
Neg logit contributions
 again-0.047
 dress-0.044
 woke-0.042
 heard-0.041
 school-0.041
Token.
Token ID13
Feature activation-6.85e-01

Pos logit contributions
umbs+0.024
beh+0.023
lette+0.023
izza+0.022
igators+0.022
Neg logit contributions
 notice-0.008
 stage-0.007
But-0.007
 woke-0.007
 suddenly-0.007
Token Lily
Token ID20037
Feature activation+1.51e-01

Pos logit contributions
umbs+0.030
beh+0.030
lette+0.030
icking+0.028
izza+0.028
Neg logit contributions
 notice-0.021
But-0.020
 stage-0.020
 woke-0.019
Who-0.019
Token has
Token ID468
Feature activation+6.04e-01

Pos logit contributions
 notice+0.005
 stage+0.004
But+0.004
Who+0.004
 woke+0.004
Neg logit contributions
umbs-0.007
beh-0.006
lette-0.006
igators-0.006
izza-0.006
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
 notice+0.022
 stage+0.021
Who+0.021
But+0.020
 woke+0.020
Neg logit contributions
umbs-0.023
lette-0.021
beh-0.021
igators-0.020
ocado-0.020
Token pink
Token ID11398
Feature activation+3.40e+00

Pos logit contributions
Who+0.073
 notice+0.072
Stop+0.071
 woke+0.070
Exc+0.069
Neg logit contributions
 dollar-0.046
ocado-0.042
 best-0.041
 equal-0.041
umbs-0.040
Token yoga
Token ID20351
Feature activation+4.31e+00

Pos logit contributions
 notice+0.127
Stop+0.125
Who+0.124
Exc+0.122
 inspection+0.117
Neg logit contributions
ish-0.110
 brothers-0.109
 dollar-0.108
 best-0.107
une-0.105
Token mat
Token ID2603
Feature activation+8.00e-01

Pos logit contributions
 notice+0.137
Who+0.133
 avail+0.129
Stop+0.127
Exc+0.125
Neg logit contributions
 dollar-0.167
 best-0.158
ery-0.151
ocado-0.149
melon-0.149
Token.
Token ID13
Feature activation+7.98e-02

Pos logit contributions
 notice+0.023
Who+0.021
 stage+0.020
Stop+0.020
But+0.020
Neg logit contributions
umbs-0.037
lette-0.036
beh-0.035
igators-0.034
 dollar-0.034
Token She
Token ID1375
Feature activation+6.90e-01

Pos logit contributions
 notice+0.002
 stage+0.002
But+0.002
 woke+0.002
 imagined+0.002
Neg logit contributions
umbs-0.004
beh-0.003
lette-0.003
izza-0.003
igators-0.003
Token puts
Token ID7584
Feature activation+7.48e-01

Pos logit contributions
 notice+0.024
 stage+0.022
But+0.022
Who+0.022
 suddenly+0.021
Neg logit contributions
umbs-0.026
beh-0.025
igators-0.025
lette-0.025
ocado-0.024
Token it
Token ID340
Feature activation+2.10e-01

Pos logit contributions
 notice+0.023
 woke+0.021
Who+0.021
But+0.020
 stage+0.020
Neg logit contributions
umbs-0.032
lette-0.030
beh-0.030
igators-0.029
allows-0.029
Token.
Token ID13
Feature activation-5.81e-01

Pos logit contributions
umbs+0.021
beh+0.021
lette+0.021
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.009
 stage-0.008
But-0.008
 woke-0.008
 suddenly-0.008
Token Can
Token ID1680
Feature activation+4.90e-01

Pos logit contributions
umbs+0.024
beh+0.023
lette+0.023
igators+0.022
icking+0.022
Neg logit contributions
 notice-0.019
But-0.019
 stage-0.018
 woke-0.018
Who-0.018
Token we
Token ID356
Feature activation+1.53e-01

Pos logit contributions
 notice+0.017
 stage+0.015
 imagined+0.015
 woke+0.015
La+0.015
Neg logit contributions
umbs-0.019
beh-0.019
lette-0.019
izza-0.018
igators-0.018
Token play
Token ID711
Feature activation+2.54e+00

Pos logit contributions
 notice+0.004
 stage+0.004
 woke+0.004
But+0.004
Who+0.004
Neg logit contributions
umbs-0.007
lette-0.007
beh-0.007
izza-0.006
igators-0.006
Token together
Token ID1978
Feature activation+3.17e+00

Pos logit contributions
icol+0.106
 notice+0.103
 inspection+0.100
yt+0.100
 avail+0.098
Neg logit contributions
 equally-0.077
 dollar-0.074
 equal-0.071
lette-0.069
o-0.069
Token again
Token ID757
Feature activation+4.23e+00

Pos logit contributions
icol+0.112
Who+0.096
Raw+0.095
AAAA+0.093
Stop+0.092
Neg logit contributions
o-0.132
icking-0.126
 dollar-0.124
ac-0.121
ipp-0.120
Token?"
Token ID1701
Feature activation+1.46e+00

Pos logit contributions
icol+0.166
 robe+0.159
atively+0.158
Stop+0.158
 speaker+0.152
Neg logit contributions
o-0.134
 brothers-0.122
 dollar-0.120
 best-0.117
icking-0.114
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
lf+0.061
 imagined+0.061
 stage+0.060
 woke+0.058
 concert+0.057
Neg logit contributions
umbs-0.047
o-0.040
izza-0.039
itt-0.039
imes-0.038
Token
Token ID198
Feature activation+4.57e-01

Pos logit contributions
 notice+0.036
 imagined+0.036
 stage+0.035
chool+0.033
 woke+0.033
Neg logit contributions
umbs-0.039
izza-0.035
o-0.035
itt-0.034
ocado-0.033
TokenBen
Token ID11696
Feature activation+4.72e-01

Pos logit contributions
 stage+0.008
 notice+0.007
 woke+0.007
 imagined+0.006
chool+0.006
Neg logit contributions
umbs-0.026
ocado-0.025
lette-0.025
beh-0.025
izza-0.024
Token smiles
Token ID21845
Feature activation+1.06e+00

Pos logit contributions
 notice+0.015
 stage+0.014
But+0.014
Who+0.013
 suddenly+0.013
Neg logit contributions
umbs-0.020
beh-0.019
lette-0.019
izza-0.019
igators-0.018
Token were
Token ID547
Feature activation-1.65e+00

Pos logit contributions
umbs+0.009
beh+0.009
lette+0.009
izza+0.008
igators+0.008
Neg logit contributions
 notice-0.008
 stage-0.007
But-0.007
 woke-0.007
Who-0.007
Token going
Token ID1016
Feature activation-2.80e-01

Pos logit contributions
umbs+0.076
beh+0.069
izza+0.069
lette+0.068
oun+0.067
Neg logit contributions
 notice-0.053
 stage-0.052
 school-0.050
 suddenly-0.050
But-0.049
Token to
Token ID284
Feature activation+8.28e-01

Pos logit contributions
umbs+0.014
beh+0.014
lette+0.013
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.007
 stage-0.007
But-0.007
 woke-0.006
 suddenly-0.006
Token a
Token ID257
Feature activation+8.10e-01

Pos logit contributions
 woke+0.025
 notice+0.025
But+0.025
Stop+0.024
lf+0.024
Neg logit contributions
umbs-0.032
igators-0.032
beh-0.032
lette-0.031
 equal-0.030
Token far
Token ID1290
Feature activation+1.76e+00

Pos logit contributions
 notice+0.027
 woke+0.026
Stop+0.026
 imagined+0.026
Who+0.025
Neg logit contributions
umbs-0.028
lette-0.028
igators-0.027
ocado-0.027
izza-0.027
Tokenaway
Token ID8272
Feature activation+4.23e+00

Pos logit contributions
La+0.049
 woke+0.046
Who+0.046
 ti+0.045
Stop+0.045
Neg logit contributions
igators-0.073
ole-0.072
 dollar-0.071
igator-0.069
melon-0.069
Token land
Token ID1956
Feature activation+6.35e-01

Pos logit contributions
 Ach+0.178
Stop+0.175
 blush+0.171
Ba+0.171
Exc+0.165
Neg logit contributions
igators-0.113
 tank-0.099
 kinds-0.095
ishes-0.093
neys-0.091
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 notice+0.016
Who+0.015
 woke+0.015
 stage+0.015
Stop+0.015
Neg logit contributions
umbs-0.031
lette-0.030
beh-0.029
igators-0.029
icking-0.028
Token
Token ID198
Feature activation+5.79e-01

Pos logit contributions
 notice+0.009
 stage+0.009
 woke+0.009
 imagined+0.008
But+0.008
Neg logit contributions
umbs-0.011
lette-0.010
izza-0.010
beh-0.010
ocado-0.010
Token
Token ID198
Feature activation+3.40e-01

Pos logit contributions
 notice+0.018
 stage+0.017
Who+0.016
Stop+0.016
 imagined+0.016
Neg logit contributions
umbs-0.025
izza-0.023
lette-0.023
o-0.022
ocado-0.022
Token"
Token ID1
Feature activation+6.92e-01

Pos logit contributions
 notice+0.011
 stage+0.010
 woke+0.010
Who+0.009
 imagined+0.009
Neg logit contributions
umbs-0.014
lette-0.014
ocado-0.014
izza-0.014
beh-0.014
Token.
Token ID13
Feature activation-5.64e-01

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
igators+0.006
izza+0.006
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.002
 woke-0.002
Who-0.002
Token Can
Token ID1680
Feature activation+1.15e-01

Pos logit contributions
umbs+0.024
beh+0.023
lette+0.022
icking+0.021
izza+0.021
Neg logit contributions
 notice-0.018
But-0.018
 stage-0.017
 woke-0.017
Who-0.017
Token we
Token ID356
Feature activation+1.16e+00

Pos logit contributions
 notice+0.003
 stage+0.003
 woke+0.003
But+0.003
 imagined+0.003
Neg logit contributions
umbs-0.005
beh-0.005
lette-0.005
izza-0.005
igators-0.005
Token play
Token ID711
Feature activation+3.11e+00

Pos logit contributions
 stage+0.034
Who+0.034
 notice+0.034
La+0.033
 woke+0.033
Neg logit contributions
lette-0.046
umbs-0.046
beh-0.045
melon-0.044
izza-0.043
Token together
Token ID1978
Feature activation+3.28e+00

Pos logit contributions
icol+0.137
 inspection+0.131
 notice+0.129
 avail+0.129
yt+0.127
Neg logit contributions
 equally-0.093
o-0.083
 equal-0.080
 dollar-0.076
ery-0.076
Token again
Token ID757
Feature activation+4.23e+00

Pos logit contributions
icol+0.121
Who+0.102
AAAA+0.100
Raw+0.096
Stop+0.094
Neg logit contributions
o-0.135
 dollar-0.124
ac-0.122
icking-0.121
umbs-0.121
Token?"
Token ID1701
Feature activation+1.06e+00

Pos logit contributions
icol+0.175
 robe+0.164
Stop+0.163
atively+0.158
 bumped+0.154
Neg logit contributions
o-0.136
 brothers-0.113
 dollar-0.109
 best-0.109
ac-0.106
Token he
Token ID339
Feature activation+1.49e+00

Pos logit contributions
 notice+0.035
 stage+0.035
 woke+0.034
 imagined+0.034
 concert+0.033
Neg logit contributions
umbs-0.039
o-0.038
izza-0.037
beh-0.037
lette-0.037
Token said
Token ID531
Feature activation-1.37e-02

Pos logit contributions
 notice+0.053
 recess+0.052
 stage+0.051
icol+0.050
 seek+0.049
Neg logit contributions
o-0.049
igators-0.049
beh-0.048
 dollar-0.047
 brothers-0.047
Token.
Token ID13
Feature activation+1.21e+00

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.000
 stage-0.000
But-0.000
 woke-0.000
Who-0.000
Token
Token ID198
Feature activation+7.51e-01

Pos logit contributions
 notice+0.044
 stage+0.043
 imagined+0.042
 concert+0.041
 woke+0.040
Neg logit contributions
umbs-0.047
o-0.043
izza-0.042
oun-0.041
 dollar-0.040
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
 notice+0.010
 stage+0.010
 woke+0.009
Who+0.009
But+0.009
Neg logit contributions
umbs-0.016
lette-0.016
beh-0.016
igators-0.015
 dollar-0.015
Token Can
Token ID1680
Feature activation+3.94e-01

Pos logit contributions
 notice+0.001
 stage+0.001
 woke+0.001
But+0.001
 suddenly+0.001
Neg logit contributions
umbs-0.002
beh-0.002
lette-0.002
izza-0.001
igators-0.001
Token we
Token ID356
Feature activation+6.91e-01

Pos logit contributions
 notice+0.013
 stage+0.012
 imagined+0.012
 woke+0.011
La+0.011
Neg logit contributions
umbs-0.016
beh-0.016
lette-0.016
izza-0.015
igators-0.015
Token play
Token ID711
Feature activation+2.79e+00

Pos logit contributions
 stage+0.020
 notice+0.020
Who+0.020
 woke+0.019
La+0.019
Neg logit contributions
umbs-0.029
lette-0.028
beh-0.027
icking-0.027
melon-0.027
Token together
Token ID1978
Feature activation+3.30e+00

Pos logit contributions
icol+0.131
 inspection+0.129
 notice+0.125
yt+0.125
 avail+0.123
Neg logit contributions
 equally-0.071
 equal-0.066
ery-0.064
o-0.062
ocado-0.062
Token again
Token ID757
Feature activation+4.22e+00

Pos logit contributions
icol+0.115
Who+0.096
 inspection+0.096
Raw+0.093
 avail+0.091
Neg logit contributions
o-0.136
icking-0.132
 dollar-0.129
ac-0.128
asp-0.124
Token?"
Token ID1701
Feature activation+1.29e+00

Pos logit contributions
icol+0.158
 robe+0.156
atively+0.151
 inspection+0.145
Stop+0.145
Neg logit contributions
o-0.137
 brothers-0.127
 dollar-0.126
ac-0.123
icking-0.120
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
 stage+0.052
 concert+0.049
 imagined+0.049
 notice+0.048
lf+0.048
Neg logit contributions
umbs-0.044
o-0.043
izza-0.038
oops-0.037
 dollar-0.037
Token
Token ID198
Feature activation+4.81e-01

Pos logit contributions
 stage+0.042
 notice+0.041
 imagined+0.039
 concert+0.038
 avail+0.038
Neg logit contributions
umbs-0.042
o-0.042
izza-0.039
beh-0.038
oun-0.036
TokenL
Token ID43
Feature activation+5.64e-01

Pos logit contributions
 stage+0.010
 notice+0.009
 woke+0.008
 imagined+0.008
 concert+0.007
Neg logit contributions
umbs-0.026
beh-0.026
lette-0.025
o-0.025
izza-0.024
Tokenila
Token ID10102
Feature activation+7.38e-01

Pos logit contributions
 notice+0.019
 stage+0.019
 suddenly+0.018
 woke+0.017
But+0.017
Neg logit contributions
umbs-0.024
beh-0.023
lette-0.022
oops-0.021
icking-0.021
Token for
Token ID329
Feature activation-5.42e-01

Pos logit contributions
beh+0.048
umbs+0.048
lette+0.045
izza+0.044
known+0.044
Neg logit contributions
 notice-0.022
But-0.021
 stage-0.021
 woke-0.021
 suddenly-0.020
Token you
Token ID345
Feature activation-4.06e-01

Pos logit contributions
umbs+0.026
beh+0.025
lette+0.025
izza+0.024
ocado+0.024
Neg logit contributions
 notice-0.015
 stage-0.014
But-0.014
 woke-0.013
 suddenly-0.013
Token."
Token ID526
Feature activation-1.31e+00

Pos logit contributions
umbs+0.022
beh+0.021
lette+0.021
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.009
 stage-0.008
But-0.008
 woke-0.008
 suddenly-0.008
Token Bill
Token ID3941
Feature activation-1.75e+00

Pos logit contributions
beh+0.069
umbs+0.067
igators+0.066
lette+0.066
ocado+0.064
Neg logit contributions
But-0.029
 notice-0.028
 woke-0.027
Who-0.026
 suddenly-0.026
Token gave
Token ID2921
Feature activation-1.13e+00

Pos logit contributions
umbs+0.100
beh+0.100
lette+0.094
igators+0.092
 supposed+0.091
Neg logit contributions
 woke-0.037
 suddenly-0.033
 heard-0.032
But-0.031
 imagined-0.031
Token Bob
Token ID5811
Feature activation-5.27e+00

Pos logit contributions
umbs+0.055
beh+0.052
ocado+0.049
izza+0.049
igators+0.049
Neg logit contributions
 notice-0.033
 suddenly-0.032
But-0.031
 heard-0.031
 woke-0.030
Token the
Token ID262
Feature activation-1.81e+00

Pos logit contributions
beh+0.374
ini+0.368
rol+0.362
anes+0.355
igators+0.354
Neg logit contributions
 again-0.055
 heard-0.048
 saw-0.043
 her-0.041
,-0.038
Token bread
Token ID8509
Feature activation-3.42e+00

Pos logit contributions
beh+0.083
umbs+0.072
rol+0.067
igators+0.067
illing+0.066
Neg logit contributions
 notice-0.065
 stage-0.064
 dress-0.063
 heard-0.062
But-0.061
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
beh+0.245
known+0.219
igators+0.218
 supposed+0.216
oun+0.216
Neg logit contributions
 again-0.034
,-0.034
 and-0.027
 dress-0.025
 heard-0.024
Token and
Token ID290
Feature activation-1.01e+00

Pos logit contributions
beh+0.081
umbs+0.078
lette+0.074
igators+0.073
oun+0.073
Neg logit contributions
 notice-0.037
But-0.036
 woke-0.036
 suddenly-0.034
 heard-0.034
Token Bob
Token ID5811
Feature activation-2.62e+00

Pos logit contributions
umbs+0.052
beh+0.049
lette+0.048
izza+0.047
igators+0.047
Neg logit contributions
 notice-0.025
 suddenly-0.024
 woke-0.024
But-0.023
 stage-0.023
Tokenily
Token ID813
Feature activation+1.60e-02

Pos logit contributions
 notice+0.009
 stage+0.008
But+0.008
 woke+0.008
 suddenly+0.008
Neg logit contributions
umbs-0.008
beh-0.008
lette-0.008
izza-0.008
ocado-0.007
Token and
Token ID290
Feature activation-3.68e-01

Pos logit contributions
 notice+0.000
 stage+0.000
But+0.000
Who+0.000
 woke+0.000
Neg logit contributions
umbs-0.001
beh-0.001
lette-0.001
izza-0.001
igators-0.001
Token Ben
Token ID3932
Feature activation-4.17e-01

Pos logit contributions
umbs+0.020
beh+0.020
lette+0.019
izza+0.019
igators+0.018
Neg logit contributions
 notice-0.008
 stage-0.007
 woke-0.007
But-0.007
Who-0.007
Token gave
Token ID2921
Feature activation-1.39e+00

Pos logit contributions
umbs+0.021
beh+0.021
lette+0.020
igators+0.020
izza+0.020
Neg logit contributions
 notice-0.010
 woke-0.009
 stage-0.009
But-0.009
 suddenly-0.009
Token the
Token ID262
Feature activation-6.28e-01

Pos logit contributions
umbs+0.085
lette+0.081
beh+0.081
illing+0.080
izza+0.077
Neg logit contributions
 stage-0.024
 notice-0.023
 again-0.023
 dress-0.022
 school-0.021
Token man
Token ID582
Feature activation-5.19e+00

Pos logit contributions
umbs+0.036
beh+0.035
lette+0.033
igators+0.033
rol+0.031
Neg logit contributions
 notice-0.013
 stage-0.012
But-0.012
 woke-0.012
Who-0.011
Token some
Token ID617
Feature activation-1.46e+00

Pos logit contributions
umbs+0.338
 supposed+0.333
rol+0.331
istent+0.326
ini+0.326
Neg logit contributions
 again-0.065
 her-0.065
 dress-0.059
 heard-0.058
,-0.056
Token bread
Token ID8509
Feature activation-2.77e+00

Pos logit contributions
beh+0.066
umbs+0.066
lette+0.062
oun+0.058
rol+0.057
Neg logit contributions
 notice-0.046
 stage-0.045
But-0.043
 dress-0.043
 school-0.042
Token.
Token ID13
Feature activation-4.31e-01

Pos logit contributions
beh+0.202
known+0.188
 highlight+0.184
 supposed+0.183
umbs+0.181
Neg logit contributions
 dress-0.016
 again-0.016
,-0.014
 and-0.009
 heard-0.008
Token They
Token ID1119
Feature activation-9.29e-01

Pos logit contributions
umbs+0.016
beh+0.016
lette+0.015
igators+0.014
izza+0.014
Neg logit contributions
 notice-0.017
 stage-0.016
But-0.016
 woke-0.015
Who-0.015
Token taped
Token ID36114
Feature activation-2.17e+00

Pos logit contributions
umbs+0.047
beh+0.046
lette+0.046
izza+0.043
igators+0.043
Neg logit contributions
 notice-0.025
 woke-0.023
 imagined-0.022
 suddenly-0.022
But-0.022
Token agreed
Token ID4987
Feature activation+3.19e-01

Pos logit contributions
umbs+0.095
beh+0.091
igators+0.089
lette+0.086
izza+0.085
Neg logit contributions
 woke-0.028
 heard-0.027
 suddenly-0.026
 saw-0.025
But-0.023
Token and
Token ID290
Feature activation-8.85e-01

Pos logit contributions
 notice+0.007
 stage+0.007
Who+0.007
La+0.007
Stop+0.007
Neg logit contributions
umbs-0.016
lette-0.016
beh-0.015
izza-0.015
icking-0.015
Token they
Token ID484
Feature activation-1.29e+00

Pos logit contributions
umbs+0.048
lette+0.045
beh+0.044
ady+0.043
igators+0.043
Neg logit contributions
 notice-0.021
 woke-0.020
 suddenly-0.020
 stage-0.019
But-0.019
Token gave
Token ID2921
Feature activation-9.03e-01

Pos logit contributions
umbs+0.069
beh+0.066
lette+0.065
illing+0.062
igators+0.062
Neg logit contributions
 woke-0.032
 notice-0.031
 suddenly-0.029
 heard-0.029
 imagined-0.028
Token the
Token ID262
Feature activation-4.41e-01

Pos logit contributions
umbs+0.052
beh+0.049
lette+0.049
illing+0.047
izza+0.047
Neg logit contributions
 notice-0.018
 woke-0.016
 suddenly-0.016
 stage-0.015
But-0.015
Token man
Token ID582
Feature activation-5.17e+00

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.026
igators+0.026
izza+0.025
Neg logit contributions
 notice-0.006
 woke-0.005
 stage-0.005
But-0.005
Who-0.005
Token some
Token ID617
Feature activation-1.81e+00

Pos logit contributions
omp+0.333
umbs+0.330
istent+0.322
ini+0.315
umb+0.315
Neg logit contributions
 again-0.067
 her-0.060
 heard-0.058
 saw-0.049
 woke-0.048
Token of
Token ID286
Feature activation-1.08e+00

Pos logit contributions
beh+0.070
umbs+0.068
lette+0.063
oun+0.059
icking+0.059
Neg logit contributions
 notice-0.069
 stage-0.066
 woke-0.064
 heard-0.064
 dress-0.064
Token their
Token ID511
Feature activation-1.53e+00

Pos logit contributions
umbs+0.061
beh+0.060
lette+0.058
ocado+0.057
izza+0.056
Neg logit contributions
 notice-0.023
 stage-0.022
 school-0.020
But-0.019
Who-0.019
Token breakfast
Token ID12607
Feature activation-9.64e-01

Pos logit contributions
umbs+0.063
beh+0.060
lette+0.054
umb+0.054
rol+0.054
Neg logit contributions
 notice-0.058
 stage-0.055
 saw-0.054
 woke-0.054
 heard-0.054
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
beh+0.056
umbs+0.056
lette+0.054
igators+0.054
ocado+0.052
Neg logit contributions
 notice-0.015
 stage-0.014
 woke-0.014
But-0.014
 suddenly-0.013
Token!
Token ID0
Feature activation-1.95e+00

Pos logit contributions
beh+0.190
umbs+0.189
lette+0.178
wy+0.177
ocado+0.174
Neg logit contributions
 again-0.034
 stage-0.026
,-0.025
 suddenly-0.025
 school-0.021
Token The
Token ID383
Feature activation-1.97e+00

Pos logit contributions
beh+0.098
lette+0.093
icking+0.091
wy+0.091
gary+0.090
Neg logit contributions
But-0.052
 But-0.048
 notice-0.047
 suddenly-0.046
 woke-0.045
Token old
Token ID1468
Feature activation-1.15e+00

Pos logit contributions
umbs+0.106
beh+0.101
rol+0.092
lette+0.091
ipp+0.088
Neg logit contributions
 stage-0.059
 school-0.058
 saw-0.057
 heard-0.054
 notice-0.053
Token elephant
Token ID20950
Feature activation-1.62e+00

Pos logit contributions
umbs+0.050
beh+0.046
lette+0.043
izza+0.041
ocado+0.041
Neg logit contributions
 stage-0.044
 woke-0.043
 notice-0.042
 school-0.041
 heard-0.041
Token gave
Token ID2921
Feature activation-1.99e+00

Pos logit contributions
umbs+0.094
beh+0.093
lette+0.088
izza+0.086
igators+0.085
Neg logit contributions
 woke-0.031
 suddenly-0.030
 notice-0.029
 heard-0.029
 imagined-0.027
Token Bob
Token ID5811
Feature activation-5.10e+00

Pos logit contributions
umbs+0.115
beh+0.111
illing+0.109
izza+0.107
ocado+0.103
Neg logit contributions
 suddenly-0.040
 heard-0.039
 notice-0.038
 again-0.038
But-0.036
Token a
Token ID257
Feature activation-1.55e+00

Pos logit contributions
beh+0.359
umbs+0.336
oun+0.332
oke+0.330
rol+0.328
Neg logit contributions
 and-0.051
 again-0.047
 heard-0.039
,-0.038
 saw-0.029
Token hug
Token ID16225
Feature activation-4.89e-01

Pos logit contributions
beh+0.078
umbs+0.074
lette+0.070
igators+0.068
icking+0.068
Neg logit contributions
 notice-0.045
 woke-0.044
 stage-0.044
 heard-0.041
 dress-0.040
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
umbs+0.028
beh+0.028
lette+0.027
igators+0.026
izza+0.026
Neg logit contributions
 notice-0.009
 stage-0.008
 woke-0.008
But-0.008
 suddenly-0.007
Token Bob
Token ID5811
Feature activation-1.80e+00

Pos logit contributions
beh+0.077
lette+0.071
umbs+0.071
icking+0.069
known+0.068
Neg logit contributions
But-0.048
 notice-0.044
 stage-0.044
 suddenly-0.043
Who-0.043
Token was
Token ID373
Feature activation-1.88e+00

Pos logit contributions
umbs+0.111
beh+0.110
lette+0.104
igators+0.103
izza+0.102
Neg logit contributions
 woke-0.029
 suddenly-0.026
 heard-0.025
 imagined-0.024
But-0.024
Token,
Token ID11
Feature activation-2.84e-01

Pos logit contributions
 notice+0.004
 stage+0.004
Who+0.004
La+0.004
But+0.004
Neg logit contributions
umbs-0.011
lette-0.011
beh-0.011
izza-0.010
icking-0.010
Token and
Token ID290
Feature activation-3.46e-01

Pos logit contributions
umbs+0.016
beh+0.015
lette+0.015
izza+0.015
igators+0.015
Neg logit contributions
 notice-0.006
 stage-0.005
But-0.005
 woke-0.005
Who-0.005
Token they
Token ID484
Feature activation-1.33e+00

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.008
 stage-0.008
But-0.008
 woke-0.007
 suddenly-0.007
Token gave
Token ID2921
Feature activation-1.14e+00

Pos logit contributions
umbs+0.075
beh+0.073
lette+0.071
izza+0.069
igators+0.069
Neg logit contributions
 woke-0.031
 notice-0.028
 suddenly-0.027
 heard-0.027
 imagined-0.026
Token the
Token ID262
Feature activation-5.92e-01

Pos logit contributions
umbs+0.064
beh+0.061
lette+0.061
illing+0.060
igators+0.058
Neg logit contributions
 notice-0.024
 woke-0.023
 suddenly-0.022
 heard-0.021
 again-0.021
Token man
Token ID582
Feature activation-5.07e+00

Pos logit contributions
umbs+0.030
beh+0.029
lette+0.028
igators+0.027
izza+0.026
Neg logit contributions
 notice-0.016
 woke-0.015
 stage-0.015
But-0.014
Who-0.014
Token some
Token ID617
Feature activation-1.74e+00

Pos logit contributions
omp+0.319
umbs+0.317
istent+0.310
illing+0.303
behavior+0.300
Neg logit contributions
 again-0.061
 her-0.059
 heard-0.057
 woke-0.052
 dress-0.052
Token food
Token ID2057
Feature activation-1.76e+00

Pos logit contributions
beh+0.066
umbs+0.064
lette+0.059
oun+0.057
icking+0.055
Neg logit contributions
 notice-0.066
 stage-0.064
 woke-0.063
 heard-0.062
 dress-0.062
Token and
Token ID290
Feature activation-3.77e-01

Pos logit contributions
beh+0.106
umbs+0.102
lette+0.099
known+0.097
igators+0.096
Neg logit contributions
 notice-0.025
 woke-0.025
 stage-0.023
 again-0.023
But-0.022
Token water
Token ID1660
Feature activation-1.48e+00

Pos logit contributions
umbs+0.017
beh+0.017
lette+0.016
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.011
 stage-0.011
 woke-0.010
But-0.010
 suddenly-0.010
Token.
Token ID13
Feature activation-5.43e-01

Pos logit contributions
beh+0.085
umbs+0.084
lette+0.082
ocado+0.079
izza+0.079
Neg logit contributions
 woke-0.025
 notice-0.025
 stage-0.023
 suddenly-0.022
 heard-0.021
Token she
Token ID673
Feature activation-9.06e-02

Pos logit contributions
umbs+0.068
beh+0.064
lette+0.063
izza+0.062
icking+0.060
Neg logit contributions
 woke-0.018
But-0.018
 notice-0.018
 suddenly-0.017
 stage-0.017
Token loved
Token ID6151
Feature activation-3.48e-01

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.003
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token to
Token ID284
Feature activation+7.08e-01

Pos logit contributions
umbs+0.017
beh+0.017
lette+0.017
izza+0.016
igators+0.016
Neg logit contributions
 notice-0.009
 stage-0.008
But-0.008
 woke-0.008
Who-0.008
Token give
Token ID1577
Feature activation-1.77e+00

Pos logit contributions
But+0.020
 notice+0.019
Stop+0.019
Who+0.019
 suddenly+0.018
Neg logit contributions
umbs-0.031
lette-0.030
beh-0.030
izza-0.029
igators-0.029
Token the
Token ID262
Feature activation-1.50e+00

Pos logit contributions
umbs+0.103
igators+0.094
lette+0.093
umb+0.093
icking+0.093
Neg logit contributions
 notice-0.034
 school-0.031
 dress-0.030
 woke-0.030
 stage-0.030
Token boy
Token ID2933
Feature activation-5.04e+00

Pos logit contributions
beh+0.077
umbs+0.077
lette+0.070
ady+0.067
rol+0.067
Neg logit contributions
 stage-0.042
 school-0.041
 dress-0.040
 woke-0.039
 notice-0.038
Token lots
Token ID6041
Feature activation-2.43e+00

Pos logit contributions
umbs+0.328
icking+0.309
omp+0.304
oun+0.299
ini+0.299
Neg logit contributions
 her-0.062
 school-0.055
 heard-0.055
 dress-0.055
 woke-0.051
Token of
Token ID286
Feature activation-1.61e+00

Pos logit contributions
lette+0.096
beh+0.093
icking+0.090
umbs+0.088
bi+0.086
Neg logit contributions
 again-0.084
 stage-0.081
 woke-0.080
But-0.079
 notice-0.079
Token kisses
Token ID36544
Feature activation-6.11e-01

Pos logit contributions
umbs+0.073
lette+0.069
beh+0.067
icking+0.067
ocado+0.064
Neg logit contributions
 stage-0.052
 notice-0.052
 school-0.050
 dress-0.049
 woke-0.048
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
umbs+0.035
beh+0.034
lette+0.034
igators+0.033
izza+0.033
Neg logit contributions
 notice-0.011
 stage-0.010
 woke-0.010
But-0.010
 suddenly-0.009
Token
Token ID198
Feature activation+9.28e-01

Pos logit contributions
beh+0.049
umbs+0.048
lette+0.047
icking+0.045
ocado+0.045
Neg logit contributions
 notice-0.019
But-0.018
 woke-0.017
 stage-0.017
Who-0.016
Token The
Token ID383
Feature activation-9.45e-01

Pos logit contributions
beh+0.047
umbs+0.044
lette+0.043
icking+0.041
igators+0.041
Neg logit contributions
 notice-0.033
But-0.031
 stage-0.031
 suddenly-0.030
Who-0.030
Token elderly
Token ID13830
Feature activation-5.44e-01

Pos logit contributions
beh+0.044
umbs+0.043
lette+0.040
izza+0.038
igators+0.038
Neg logit contributions
 stage-0.031
 notice-0.030
 woke-0.029
 suddenly-0.029
But-0.028
Token man
Token ID582
Feature activation-5.81e-01

Pos logit contributions
umbs+0.035
beh+0.035
lette+0.034
izza+0.033
igators+0.033
Neg logit contributions
 notice-0.006
 stage-0.006
But-0.005
 woke-0.005
 suddenly-0.005
Token was
Token ID373
Feature activation-1.62e+00

Pos logit contributions
umbs+0.032
beh+0.031
lette+0.030
izza+0.029
igators+0.029
Neg logit contributions
 notice-0.012
 woke-0.011
 stage-0.011
But-0.011
 suddenly-0.011
Token giving
Token ID3501
Feature activation-1.54e+00

Pos logit contributions
umbs+0.073
izza+0.071
beh+0.069
ishes+0.065
ocado+0.065
Neg logit contributions
 notice-0.055
 suddenly-0.054
 stage-0.053
But-0.052
 heard-0.051
Token daddy
Token ID35822
Feature activation-4.99e+00

Pos logit contributions
umbs+0.080
beh+0.075
lette+0.073
izza+0.073
illing+0.070
Neg logit contributions
 notice-0.039
 suddenly-0.038
 woke-0.037
 heard-0.036
 stage-0.036
Token lots
Token ID6041
Feature activation-2.50e+00

Pos logit contributions
beh+0.300
umbs+0.296
icking+0.293
oun+0.286
known+0.284
Neg logit contributions
 heard-0.081
 woke-0.067
 dress-0.063
 again-0.062
 saw-0.059
Token of
Token ID286
Feature activation-1.64e+00

Pos logit contributions
lette+0.102
beh+0.102
icking+0.096
izza+0.094
umbs+0.093
Neg logit contributions
 stage-0.081
 again-0.080
 woke-0.079
 notice-0.079
But-0.078
Token advice
Token ID5608
Feature activation-1.40e+00

Pos logit contributions
umbs+0.072
beh+0.068
lette+0.067
icking+0.064
ocado+0.063
Neg logit contributions
 stage-0.055
 notice-0.054
 dress-0.051
 woke-0.050
 heard-0.050
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
umbs+0.085
beh+0.082
lette+0.080
izza+0.078
igators+0.078
Neg logit contributions
 notice-0.021
But-0.020
 stage-0.020
 suddenly-0.018
 woke-0.018
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
beh+0.052
umbs+0.050
lette+0.049
igators+0.047
icking+0.047
Neg logit contributions
 notice-0.023
But-0.022
 stage-0.021
 suddenly-0.021
Who-0.021
Tokeniz
Token ID528
Feature activation-8.88e-01

Pos logit contributions
umbs+0.005
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.009
 stage-0.009
But-0.009
 woke-0.009
Who-0.009
Token thanked
Token ID26280
Feature activation+1.82e-01

Pos logit contributions
umbs+0.048
beh+0.047
lette+0.045
igators+0.044
izza+0.044
Neg logit contributions
 woke-0.019
 notice-0.019
 suddenly-0.018
But-0.018
 heard-0.017
Token him
Token ID683
Feature activation-1.30e+00

Pos logit contributions
 notice+0.006
 woke+0.005
But+0.005
 stage+0.005
 imagined+0.005
Neg logit contributions
umbs-0.008
beh-0.007
lette-0.007
izza-0.007
igators-0.007
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
umbs+0.086
beh+0.084
lette+0.083
igators+0.083
ocado+0.081
Neg logit contributions
 again-0.014
But-0.013
 notice-0.013
 stage-0.011
 suddenly-0.010
Token gave
Token ID2921
Feature activation-6.80e-01

Pos logit contributions
umbs+0.020
beh+0.019
lette+0.019
igators+0.019
izza+0.019
Neg logit contributions
 notice-0.008
 stage-0.007
But-0.007
 woke-0.007
Who-0.006
Token John
Token ID1757
Feature activation-4.93e+00

Pos logit contributions
umbs+0.035
beh+0.034
lette+0.033
igators+0.032
ocado+0.032
Neg logit contributions
 notice-0.017
 stage-0.016
But-0.016
 suddenly-0.015
 woke-0.015
Token a
Token ID257
Feature activation-1.64e+00

Pos logit contributions
oke+0.339
ini+0.332
beh+0.330
rol+0.326
igators+0.326
Neg logit contributions
 again-0.066
 her-0.046
 and-0.038
,-0.027
 heard-0.026
Token big
Token ID1263
Feature activation-1.15e+00

Pos logit contributions
beh+0.083
umbs+0.075
lette+0.075
igators+0.075
icking+0.071
Neg logit contributions
 woke-0.048
 notice-0.048
 stage-0.044
 heard-0.043
 dress-0.042
Token hug
Token ID16225
Feature activation-5.09e-01

Pos logit contributions
beh+0.061
umbs+0.059
lette+0.058
igators+0.056
known+0.054
Neg logit contributions
 notice-0.028
 woke-0.027
 stage-0.027
But-0.024
 dress-0.024
Token,
Token ID11
Feature activation-4.91e-01

Pos logit contributions
umbs+0.029
beh+0.029
lette+0.028
igators+0.028
izza+0.027
Neg logit contributions
 notice-0.009
 stage-0.008
 woke-0.008
But-0.008
 suddenly-0.007
Token and
Token ID290
Feature activation-1.57e-01

Pos logit contributions
umbs+0.027
beh+0.027
lette+0.026
igators+0.025
izza+0.025
Neg logit contributions
 notice-0.010
But-0.009
 stage-0.009
 woke-0.009
 suddenly-0.009
Token moment
Token ID2589
Feature activation+1.21e-01

Pos logit contributions
umbs+0.017
beh+0.017
lette+0.016
igators+0.016
izza+0.016
Neg logit contributions
 notice-0.008
 stage-0.007
But-0.007
 woke-0.007
Who-0.006
Token and
Token ID290
Feature activation+5.80e-01

Pos logit contributions
 notice+0.003
 stage+0.002
Who+0.002
But+0.002
 woke+0.002
Neg logit contributions
umbs-0.006
lette-0.006
beh-0.006
izza-0.006
igators-0.006
Token then
Token ID788
Feature activation+4.64e-01

Pos logit contributions
 notice+0.014
 stage+0.013
La+0.012
Stop+0.012
But+0.011
Neg logit contributions
lette-0.028
umbs-0.028
beh-0.028
izza-0.027
melon-0.027
Token gave
Token ID2921
Feature activation-9.92e-01

Pos logit contributions
 notice+0.013
 stage+0.013
But+0.012
Stop+0.012
La+0.012
Neg logit contributions
beh-0.021
umbs-0.020
lette-0.020
izza-0.020
igators-0.019
Token the
Token ID262
Feature activation-9.79e-01

Pos logit contributions
umbs+0.056
beh+0.053
lette+0.052
illing+0.051
ady+0.050
Neg logit contributions
 notice-0.021
 suddenly-0.020
 woke-0.019
 stage-0.019
 again-0.019
Token deer
Token ID20096
Feature activation-4.91e+00

Pos logit contributions
umbs+0.050
beh+0.049
izza+0.045
lette+0.045
icking+0.044
Neg logit contributions
 woke-0.026
 notice-0.026
But-0.026
 stage-0.025
Who-0.025
Token a
Token ID257
Feature activation-1.18e+00

Pos logit contributions
oke+0.328
umbs+0.321
oun+0.317
rol+0.316
omp+0.313
Neg logit contributions
 again-0.062
 her-0.053
.-0.048
 something-0.045
,-0.045
Token big
Token ID1263
Feature activation-9.86e-01

Pos logit contributions
beh+0.052
umbs+0.049
lette+0.047
igators+0.046
icking+0.045
Neg logit contributions
 notice-0.041
 woke-0.039
 stage-0.038
But-0.036
 heard-0.036
Token,
Token ID11
Feature activation-5.88e-01

Pos logit contributions
beh+0.044
umbs+0.043
lette+0.041
igators+0.039
icking+0.038
Neg logit contributions
 notice-0.031
 woke-0.030
 stage-0.030
But-0.028
 heard-0.028
Token juicy
Token ID35075
Feature activation-2.55e-01

Pos logit contributions
umbs+0.016
beh+0.016
lette+0.015
izza+0.014
igators+0.014
Neg logit contributions
 notice-0.028
 stage-0.027
But-0.027
 woke-0.027
Who-0.026
Token apple
Token ID17180
Feature activation-2.11e+00

Pos logit contributions
umbs+0.008
beh+0.008
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.011
 stage-0.011
But-0.011
 woke-0.011
Who-0.010
Token agreed
Token ID4987
Feature activation-1.33e-01

Pos logit contributions
umbs+0.071
beh+0.068
lette+0.067
izza+0.064
ocado+0.063
Neg logit contributions
 notice-0.031
 woke-0.031
 heard-0.030
 suddenly-0.030
 imagined-0.029
Token,
Token ID11
Feature activation-1.56e+00

Pos logit contributions
umbs+0.007
beh+0.007
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.002
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
umbs+0.094
beh+0.093
lette+0.090
igators+0.089
ocado+0.089
Neg logit contributions
 notice-0.025
But-0.024
 stage-0.023
 suddenly-0.022
 woke-0.022
Token gave
Token ID2921
Feature activation-1.64e+00

Pos logit contributions
umbs+0.065
beh+0.061
lette+0.061
izza+0.059
ady+0.059
Neg logit contributions
 notice-0.027
 suddenly-0.027
 woke-0.026
 stage-0.025
But-0.025
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
umbs+0.097
beh+0.091
illing+0.088
lette+0.088
ady+0.087
Neg logit contributions
 notice-0.029
 suddenly-0.029
 woke-0.027
 heard-0.026
 stage-0.026
Token farmer
Token ID18739
Feature activation-4.90e+00

Pos logit contributions
beh+0.072
umbs+0.068
rol+0.061
izza+0.060
icking+0.059
Neg logit contributions
 notice-0.046
 stage-0.045
 woke-0.044
 heard-0.044
But-0.044
Token some
Token ID617
Feature activation-2.11e+00

Pos logit contributions
umbs+0.320
beh+0.314
illing+0.301
 supposed+0.295
unch+0.293
Neg logit contributions
 again-0.058
 heard-0.049
,-0.047
 woke-0.043
 her-0.042
Token of
Token ID286
Feature activation-2.22e+00

Pos logit contributions
beh+0.084
umbs+0.080
lette+0.073
icking+0.072
oun+0.071
Neg logit contributions
 notice-0.078
 woke-0.074
 stage-0.074
 heard-0.073
 dress-0.072
Token their
Token ID511
Feature activation-1.81e+00

Pos logit contributions
umbs+0.141
beh+0.138
ocado+0.133
lette+0.132
izza+0.130
Neg logit contributions
 notice-0.033
 stage-0.033
 school-0.030
But-0.028
 suddenly-0.025
Token eggs
Token ID9653
Feature activation-3.08e+00

Pos logit contributions
umbs+0.074
beh+0.072
rol+0.064
umb+0.064
lette+0.063
Neg logit contributions
 notice-0.067
 stage-0.065
 saw-0.063
 woke-0.063
 school-0.062
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
beh+0.215
igators+0.206
umbs+0.201
ocado+0.199
known+0.199
Neg logit contributions
 again-0.018
,-0.016
 suddenly-0.014
 dress-0.013
 notice-0.010
Token
Token ID198
Feature activation+5.23e-01

Pos logit contributions
umbs+0.000
beh+0.000
lette+0.000
izza+0.000
igators+0.000
Neg logit contributions
 notice-0.000
 stage-0.000
 woke-0.000
But-0.000
Who-0.000
Token
Token ID198
Feature activation-5.30e-01

Pos logit contributions
 notice+0.016
 stage+0.015
 imagined+0.014
Who+0.014
 woke+0.014
Neg logit contributions
umbs-0.024
izza-0.022
beh-0.022
lette-0.022
o-0.022
TokenTom
Token ID13787
Feature activation-1.31e+00

Pos logit contributions
umbs+0.031
lette+0.030
beh+0.030
igators+0.029
icking+0.028
Neg logit contributions
 notice-0.010
But-0.010
 stage-0.008
Who-0.008
 woke-0.008
Token gave
Token ID2921
Feature activation-9.73e-01

Pos logit contributions
umbs+0.080
beh+0.078
lette+0.074
igators+0.071
izza+0.071
Neg logit contributions
 woke-0.023
 heard-0.020
 imagined-0.019
 notice-0.019
 suddenly-0.019
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
umbs+0.055
beh+0.052
lette+0.052
igators+0.050
illing+0.049
Neg logit contributions
 notice-0.020
But-0.019
 stage-0.018
 suddenly-0.018
 woke-0.018
Token man
Token ID582
Feature activation-4.89e+00

Pos logit contributions
beh+0.115
lette+0.098
umbs+0.094
illing+0.092
rol+0.091
Neg logit contributions
 woke-0.044
 stage-0.042
 dress-0.040
 saw-0.039
 man-0.038
Token a
Token ID257
Feature activation-1.33e+00

Pos logit contributions
umbs+0.332
rol+0.317
beh+0.317
ini+0.316
oun+0.316
Neg logit contributions
 again-0.052
 heard-0.047
 woke-0.043
,-0.042
 her-0.041
Token new
Token ID649
Feature activation-6.28e-01

Pos logit contributions
beh+0.066
umbs+0.062
lette+0.060
igators+0.058
icking+0.057
Neg logit contributions
 woke-0.038
 notice-0.038
 stage-0.038
 dress-0.035
 heard-0.034
Token blanket
Token ID18447
Feature activation-1.85e+00

Pos logit contributions
umbs+0.032
beh+0.032
lette+0.030
igators+0.029
izza+0.029
Neg logit contributions
 notice-0.016
 stage-0.015
 woke-0.015
But-0.014
 suddenly-0.014
Token and
Token ID290
Feature activation+4.12e-01

Pos logit contributions
beh+0.119
umbs+0.113
igators+0.110
lette+0.110
izza+0.108
Neg logit contributions
 woke-0.024
 again-0.020
 stage-0.020
 suddenly-0.020
 notice-0.019
Token wished
Token ID16555
Feature activation+4.24e-01

Pos logit contributions
 notice+0.012
But+0.011
 stage+0.011
Who+0.011
Stop+0.011
Neg logit contributions
umbs-0.018
beh-0.018
lette-0.018
izza-0.017
igators-0.017
Token away
Token ID1497
Feature activation-1.36e-01

Pos logit contributions
umbs+0.198
gary+0.197
beh+0.197
izza+0.192
wy+0.191
Neg logit contributions
 woke-0.031
 again-0.030
 heard-0.030
 notice-0.029
 dress-0.027
Token.
Token ID13
Feature activation+8.21e-02

Pos logit contributions
umbs+0.007
beh+0.007
lette+0.007
izza+0.006
igators+0.006
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token He
Token ID679
Feature activation+3.04e-01

Pos logit contributions
 notice+0.003
 stage+0.003
 woke+0.003
But+0.003
 imagined+0.003
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.003
igators-0.003
Token gave
Token ID2921
Feature activation-1.02e+00

Pos logit contributions
 notice+0.009
 stage+0.008
But+0.008
Who+0.008
La+0.008
Neg logit contributions
umbs-0.014
beh-0.013
lette-0.013
igators-0.013
izza-0.013
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
umbs+0.059
beh+0.057
lette+0.056
illing+0.055
izza+0.054
Neg logit contributions
 notice-0.019
 woke-0.018
 stage-0.018
 suddenly-0.018
But-0.017
Token children
Token ID1751
Feature activation-4.89e+00

Pos logit contributions
beh+0.048
umbs+0.046
lette+0.042
igators+0.042
izza+0.041
Neg logit contributions
 notice-0.042
 woke-0.041
 stage-0.041
But-0.041
 dress-0.040
Token some
Token ID617
Feature activation-1.41e+00

Pos logit contributions
umbs+0.326
ini+0.314
umm+0.310
oun+0.302
omp+0.299
Neg logit contributions
 her-0.065
 dress-0.058
 heard-0.048
 school-0.047
,-0.045
Token corn
Token ID11676
Feature activation-1.99e+00

Pos logit contributions
umbs+0.053
beh+0.052
lette+0.050
ocado+0.046
oun+0.045
Neg logit contributions
 notice-0.053
 stage-0.052
 woke-0.051
 dress-0.050
 heard-0.049
Token.
Token ID13
Feature activation+4.14e-01

Pos logit contributions
beh+0.126
umbs+0.120
known+0.116
lette+0.113
gary+0.113
Neg logit contributions
 woke-0.024
 again-0.023
 stage-0.023
 notice-0.022
 suddenly-0.021
Token He
Token ID679
Feature activation+1.18e-01

Pos logit contributions
 notice+0.017
 stage+0.016
 woke+0.016
 imagined+0.016
But+0.015
Neg logit contributions
umbs-0.014
izza-0.013
lette-0.013
beh-0.013
igators-0.012
Token told
Token ID1297
Feature activation+6.60e-02

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
Who+0.003
 woke+0.003
Neg logit contributions
umbs-0.005
beh-0.005
lette-0.005
izza-0.005
igators-0.005
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
beh+0.095
lette+0.093
umbs+0.092
izza+0.089
known+0.089
Neg logit contributions
 notice-0.018
 woke-0.016
 stage-0.015
But-0.014
 again-0.014
Token Lily
Token ID20037
Feature activation-5.07e-01

Pos logit contributions
 notice+0.011
 stage+0.011
 woke+0.010
 imagined+0.010
But+0.010
Neg logit contributions
umbs-0.012
beh-0.011
lette-0.011
izza-0.011
igators-0.010
Token and
Token ID290
Feature activation-2.59e-01

Pos logit contributions
umbs+0.029
beh+0.029
lette+0.028
izza+0.027
igators+0.027
Neg logit contributions
 notice-0.009
 stage-0.008
But-0.008
 woke-0.008
 imagined-0.007
Token Tom
Token ID4186
Feature activation-1.92e+00

Pos logit contributions
umbs+0.012
beh+0.012
lette+0.012
izza+0.011
igators+0.011
Neg logit contributions
 notice-0.007
 stage-0.007
But-0.007
 woke-0.007
Who-0.006
Token give
Token ID1577
Feature activation-1.34e+00

Pos logit contributions
beh+0.100
umbs+0.100
lette+0.095
izza+0.093
icking+0.092
Neg logit contributions
 notice-0.056
 woke-0.048
 heard-0.045
 hear-0.045
 stage-0.045
Token him
Token ID683
Feature activation-4.86e+00

Pos logit contributions
umbs+0.074
lette+0.071
beh+0.071
illing+0.071
igators+0.070
Neg logit contributions
 woke-0.028
 notice-0.027
 suddenly-0.027
 heard-0.024
 stage-0.024
Token some
Token ID617
Feature activation-1.04e+00

Pos logit contributions
ini+0.276
istent+0.272
illing+0.269
icking+0.267
oke+0.267
Neg logit contributions
 dress-0.086
 woke-0.086
 heard-0.083
 shoes-0.079
 her-0.078
Token treats
Token ID18432
Feature activation-1.70e+00

Pos logit contributions
beh+0.034
umbs+0.033
lette+0.032
icking+0.030
ocado+0.029
Neg logit contributions
 notice-0.044
 woke-0.042
 stage-0.042
But-0.042
 suddenly-0.040
Token.
Token ID13
Feature activation+7.69e-02

Pos logit contributions
beh+0.108
umbs+0.105
known+0.101
lette+0.101
ady+0.099
Neg logit contributions
 again-0.018
 notice-0.018
But-0.018
 woke-0.018
 suddenly-0.016
Token They
Token ID1119
Feature activation-1.37e+00

Pos logit contributions
 notice+0.003
 stage+0.003
 woke+0.003
But+0.003
Who+0.003
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.002
igators-0.002
Token pet
Token ID4273
Feature activation-9.09e-01

Pos logit contributions
beh+0.071
umbs+0.071
lette+0.068
icking+0.066
izza+0.065
Neg logit contributions
 notice-0.037
 stage-0.033
 woke-0.032
 imagined-0.030
But-0.030
Token you
Token ID345
Feature activation-1.77e+00

Pos logit contributions
beh+0.066
umbs+0.065
lette+0.062
ady+0.061
illing+0.061
Neg logit contributions
 notice-0.025
But-0.025
 stage-0.024
 woke-0.024
 suddenly-0.022
Token."
Token ID526
Feature activation-1.27e+00

Pos logit contributions
beh+0.092
umbs+0.089
known+0.084
lette+0.084
ady+0.082
Neg logit contributions
But-0.041
 woke-0.040
 notice-0.039
 stage-0.038
 suddenly-0.037
Token He
Token ID679
Feature activation-7.90e-01

Pos logit contributions
lette+0.066
beh+0.066
umbs+0.064
igators+0.063
ocado+0.062
Neg logit contributions
But-0.029
 notice-0.028
 woke-0.026
 suddenly-0.026
�-0.025
Token gave
Token ID2921
Feature activation-1.21e+00

Pos logit contributions
umbs+0.038
beh+0.038
lette+0.036
igators+0.035
izza+0.035
Neg logit contributions
 notice-0.021
 woke-0.021
But-0.020
 suddenly-0.020
 stage-0.020
Token Tim
Token ID5045
Feature activation-3.08e+00

Pos logit contributions
umbs+0.067
lette+0.063
beh+0.063
illing+0.063
ocado+0.062
Neg logit contributions
 suddenly-0.026
 notice-0.025
 again-0.024
 woke-0.024
But-0.024
Tokenmy
Token ID1820
Feature activation-4.86e+00

Pos logit contributions
beh+0.137
umbs+0.127
igators+0.125
illing+0.123
itt+0.123
Neg logit contributions
 again-0.105
 heard-0.100
 woke-0.100
 saw-0.096
 dress-0.086
Token a
Token ID257
Feature activation-1.46e+00

Pos logit contributions
beh+0.317
oke+0.302
rol+0.297
 supposed+0.296
behavior+0.294
Neg logit contributions
 again-0.063
 woke-0.063
 heard-0.059
 her-0.056
 dress-0.052
Token sandwich
Token ID20433
Feature activation-1.51e+00

Pos logit contributions
beh+0.071
umbs+0.065
lette+0.062
igators+0.062
icking+0.061
Neg logit contributions
 notice-0.045
 woke-0.045
 stage-0.043
 heard-0.041
 dress-0.041
Token and
Token ID290
Feature activation-2.79e-02

Pos logit contributions
beh+0.097
umbs+0.090
igators+0.090
lette+0.088
known+0.088
Neg logit contributions
 woke-0.019
But-0.018
 notice-0.018
 stage-0.017
 dress-0.016
Token some
Token ID617
Feature activation-1.36e+00

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.001
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token juice
Token ID13135
Feature activation-1.30e+00

Pos logit contributions
beh+0.046
umbs+0.044
lette+0.042
icking+0.040
oun+0.039
Neg logit contributions
 notice-0.057
 woke-0.055
But-0.053
 stage-0.053
 suddenly-0.053
Token nodded
Token ID14464
Feature activation+4.93e-01

Pos logit contributions
umbs+0.071
beh+0.070
lette+0.066
ocado+0.065
izza+0.065
Neg logit contributions
 notice-0.024
 woke-0.023
 suddenly-0.023
But-0.022
 stage-0.022
Token eagerly
Token ID30130
Feature activation-5.33e-01

Pos logit contributions
 notice+0.012
 stage+0.012
Stop+0.011
But+0.010
 woke+0.010
Neg logit contributions
umbs-0.024
lette-0.024
beh-0.023
icking-0.023
allows-0.022
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
umbs+0.031
beh+0.030
lette+0.030
igators+0.029
izza+0.029
Neg logit contributions
 notice-0.009
But-0.008
 stage-0.008
 woke-0.008
 suddenly-0.008
Token Mum
Token ID38367
Feature activation-2.30e+00

Pos logit contributions
umbs+0.018
beh+0.017
lette+0.017
igators+0.016
izza+0.016
Neg logit contributions
 notice-0.013
But-0.012
 stage-0.012
 woke-0.012
Who-0.011
Token gave
Token ID2921
Feature activation-1.66e+00

Pos logit contributions
beh+0.145
umbs+0.145
 ours+0.135
igators+0.132
lette+0.131
Neg logit contributions
 woke-0.038
 heard-0.035
 suddenly-0.034
 saw-0.033
 imagined-0.031
Token them
Token ID606
Feature activation-4.85e+00

Pos logit contributions
beh+0.096
umbs+0.095
illing+0.088
ocado+0.087
lette+0.087
Neg logit contributions
 notice-0.031
 heard-0.031
 suddenly-0.031
 stage-0.030
 woke-0.029
Token some
Token ID617
Feature activation-2.09e+00

Pos logit contributions
beh+0.313
oke+0.312
umbs+0.308
 supposed+0.302
ini+0.297
Neg logit contributions
 her-0.062
 dress-0.058
 again-0.055
 school-0.048
 shoes-0.048
Token food
Token ID2057
Feature activation-2.41e+00

Pos logit contributions
beh+0.088
umbs+0.082
lette+0.076
oun+0.074
icking+0.072
Neg logit contributions
 notice-0.072
 heard-0.070
 woke-0.070
 stage-0.070
But-0.069
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
beh+0.164
umbs+0.154
known+0.150
igators+0.147
lette+0.147
Neg logit contributions
 woke-0.020
 again-0.018
 stage-0.018
 dress-0.018
 suddenly-0.018
Token they
Token ID484
Feature activation-2.04e+00

Pos logit contributions
umbs+0.049
beh+0.048
lette+0.046
igators+0.044
oun+0.044
Neg logit contributions
 notice-0.027
 woke-0.026
 suddenly-0.026
 stage-0.026
 imagined-0.025
Token sprinkled
Token ID49692
Feature activation-1.86e+00

Pos logit contributions
beh+0.110
umbs+0.109
lette+0.101
igators+0.098
oun+0.097
Neg logit contributions
 woke-0.053
 heard-0.049
 suddenly-0.048
 imagined-0.048
 saw-0.047
Token So
Token ID1406
Feature activation+7.21e-01

Pos logit contributions
 notice+0.008
 stage+0.008
 woke+0.008
Who+0.007
But+0.007
Neg logit contributions
umbs-0.010
lette-0.009
beh-0.009
izza-0.009
icking-0.009
Token,
Token ID11
Feature activation+1.31e-01

Pos logit contributions
 notice+0.018
La+0.017
 stage+0.017
Who+0.017
Stop+0.016
Neg logit contributions
umbs-0.033
lette-0.032
beh-0.032
icking-0.032
melon-0.031
Token she
Token ID673
Feature activation-8.21e-02

Pos logit contributions
 notice+0.003
 stage+0.003
 woke+0.003
But+0.003
 imagined+0.002
Neg logit contributions
umbs-0.007
beh-0.007
lette-0.007
igators-0.006
izza-0.006
Token gave
Token ID2921
Feature activation-9.83e-01

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.002
 woke-0.002
Who-0.002
Token the
Token ID262
Feature activation-7.74e-01

Pos logit contributions
umbs+0.057
beh+0.055
lette+0.055
illing+0.054
ocado+0.053
Neg logit contributions
 notice-0.018
 suddenly-0.017
 stage-0.017
 woke-0.017
But-0.016
Token bear
Token ID6842
Feature activation-4.85e+00

Pos logit contributions
beh+0.039
umbs+0.038
lette+0.037
izza+0.034
illing+0.034
Neg logit contributions
 stage-0.021
 woke-0.021
 notice-0.021
But-0.019
 dress-0.019
Token a
Token ID257
Feature activation-1.08e+00

Pos logit contributions
beh+0.312
 fastest+0.299
omp+0.299
oun+0.297
rol+0.297
Neg logit contributions
 her-0.055
 again-0.054
 woke-0.054
 heard-0.054
 dress-0.052
Token sandwich
Token ID20433
Feature activation-1.31e+00

Pos logit contributions
beh+0.049
umbs+0.047
lette+0.046
igators+0.043
icking+0.043
Neg logit contributions
 notice-0.035
 woke-0.034
 stage-0.033
 heard-0.031
 dress-0.031
Token.
Token ID13
Feature activation-6.11e-01

Pos logit contributions
beh+0.080
umbs+0.076
igators+0.075
lette+0.074
known+0.073
Neg logit contributions
 woke-0.019
 notice-0.018
But-0.017
 stage-0.016
 dress-0.016
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
beh+0.027
umbs+0.027
lette+0.027
igators+0.025
izza+0.025
Neg logit contributions
 notice-0.018
But-0.018
 stage-0.017
 woke-0.017
Who-0.017
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
 notice+0.021
 stage+0.021
 imagined+0.020
Who+0.019
 woke+0.019
Neg logit contributions
umbs-0.031
o-0.027
lette-0.027
izza-0.027
beh-0.027
Token
Token ID198
Feature activation-1.86e-02

Pos logit contributions
 stage+0.020
 notice+0.020
 imagined+0.019
Who+0.018
 woke+0.018
Neg logit contributions
umbs-0.029
izza-0.026
o-0.026
lette-0.026
beh-0.025
TokenMom
Token ID29252
Feature activation-5.80e-01

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.001
 stage-0.000
 woke-0.000
But-0.000
Who-0.000
Token nodded
Token ID14464
Feature activation+9.20e-01

Pos logit contributions
umbs+0.033
beh+0.032
lette+0.031
igators+0.030
izza+0.030
Neg logit contributions
 notice-0.011
 woke-0.010
But-0.010
 stage-0.010
 suddenly-0.010
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
 stage+0.025
 notice+0.025
Stop+0.023
 concert+0.023
 woke+0.021
Neg logit contributions
umbs-0.042
lette-0.042
icking-0.039
allows-0.039
beh-0.039
Token gave
Token ID2921
Feature activation-1.18e+00

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.003
 stage-0.002
But-0.002
 woke-0.002
Who-0.002
Token them
Token ID606
Feature activation-4.85e+00

Pos logit contributions
beh+0.070
umbs+0.070
lette+0.068
illing+0.066
ady+0.065
Neg logit contributions
 notice-0.020
 suddenly-0.019
 again-0.019
 stage-0.018
 woke-0.018
Token some
Token ID617
Feature activation-2.00e+00

Pos logit contributions
beh+0.337
ini+0.333
 supposed+0.326
umbs+0.322
oke+0.321
Neg logit contributions
 her-0.058
 again-0.048
 dress-0.044
 it-0.036
.-0.034
Token bread
Token ID8509
Feature activation-2.68e+00

Pos logit contributions
beh+0.080
umbs+0.074
lette+0.068
known+0.066
icking+0.066
Neg logit contributions
 notice-0.073
But-0.071
 dress-0.071
 stage-0.069
 heard-0.069
Token cr
Token ID1067
Feature activation-3.62e+00

Pos logit contributions
beh+0.183
known+0.173
umbs+0.166
igators+0.165
 highlight+0.164
Neg logit contributions
 dress-0.023
,-0.021
But-0.019
 suddenly-0.019
 again-0.018
Tokenumbs
Token ID18146
Feature activation-1.88e+00

Pos logit contributions
beh+0.094
igators+0.090
 highlight+0.090
 strongest+0.083
known+0.082
Neg logit contributions
But-0.154
ro-0.153
 ti-0.152
 dress-0.151
 woke-0.151
Token from
Token ID422
Feature activation+2.59e-01

Pos logit contributions
beh+0.127
lette+0.119
known+0.118
umbs+0.118
izza+0.115
Neg logit contributions
 notice-0.018
 woke-0.016
 again-0.016
 suddenly-0.014
 heard-0.014
Token others
Token ID1854
Feature activation-1.54e+00

Pos logit contributions
umbs+0.008
beh+0.007
lette+0.007
izza+0.007
igators+0.007
Neg logit contributions
 notice-0.006
 stage-0.005
But-0.005
 woke-0.005
Who-0.005
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
umbs+0.099
beh+0.094
lette+0.092
izza+0.091
igators+0.089
Neg logit contributions
 notice-0.020
 stage-0.017
But-0.017
 woke-0.017
 imagined-0.016
Token They
Token ID1119
Feature activation-7.79e-01

Pos logit contributions
beh+0.054
umbs+0.051
lette+0.051
icking+0.047
 dollars+0.046
Neg logit contributions
 notice-0.038
But-0.035
 stage-0.034
 woke-0.033
 suddenly-0.033
Token gave
Token ID2921
Feature activation-1.76e+00

Pos logit contributions
umbs+0.040
beh+0.037
lette+0.037
izza+0.035
igators+0.034
Neg logit contributions
 notice-0.021
 woke-0.020
 stage-0.019
 imagined-0.019
 suddenly-0.019
Token each
Token ID1123
Feature activation-1.86e+00

Pos logit contributions
umbs+0.103
beh+0.097
lette+0.093
illing+0.092
oun+0.091
Neg logit contributions
 notice-0.035
 suddenly-0.032
 heard-0.032
 stage-0.032
 again-0.031
Token person
Token ID1048
Feature activation-4.84e+00

Pos logit contributions
beh+0.101
icking+0.095
umbs+0.095
lette+0.091
izza+0.089
Neg logit contributions
 notice-0.047
 heard-0.046
 woke-0.045
But-0.044
 school-0.043
Token a
Token ID257
Feature activation-7.90e-01

Pos logit contributions
umbs+0.298
unch+0.274
omp+0.267
beh+0.267
umb+0.266
Neg logit contributions
 heard-0.085
 saw-0.078
 dress-0.076
 woke-0.072
 school-0.068
Token medal
Token ID18279
Feature activation-1.58e+00

Pos logit contributions
umbs+0.035
beh+0.035
lette+0.033
igators+0.032
icking+0.031
Neg logit contributions
 notice-0.026
 stage-0.024
 woke-0.024
But-0.023
 school-0.022
Token to
Token ID284
Feature activation+8.74e-01

Pos logit contributions
umbs+0.092
beh+0.091
lette+0.086
igators+0.084
allows+0.083
Neg logit contributions
 stage-0.027
 woke-0.026
 notice-0.025
 imagined-0.024
But-0.024
Token show
Token ID905
Feature activation-2.60e-01

Pos logit contributions
 notice+0.027
 woke+0.027
 stage+0.026
But+0.026
Who+0.026
Neg logit contributions
umbs-0.034
beh-0.034
lette-0.034
igators-0.033
izza-0.032
Token that
Token ID326
Feature activation-8.52e-01

Pos logit contributions
umbs+0.012
beh+0.012
lette+0.012
izza+0.011
igators+0.011
Neg logit contributions
 notice-0.007
 stage-0.007
But-0.007
 woke-0.007
Who-0.006
Token smiled
Token ID13541
Feature activation+4.94e-01

Pos logit contributions
 notice+0.024
 stage+0.023
Stop+0.021
 recess+0.020
Who+0.020
Neg logit contributions
lette-0.037
umbs-0.036
beh-0.035
ocado-0.035
izza-0.034
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
 notice+0.013
 stage+0.012
Stop+0.012
 woke+0.011
Who+0.011
Neg logit contributions
umbs-0.023
lette-0.023
beh-0.022
icking-0.022
ocado-0.022
Token giving
Token ID3501
Feature activation-6.30e-01

Pos logit contributions
 notice+0.006
 stage+0.006
 woke+0.005
Stop+0.005
But+0.005
Neg logit contributions
umbs-0.008
lette-0.008
beh-0.008
izza-0.008
ocado-0.008
Token his
Token ID465
Feature activation-6.86e-01

Pos logit contributions
umbs+0.030
beh+0.029
lette+0.028
igators+0.027
izza+0.027
Neg logit contributions
 notice-0.018
 stage-0.017
But-0.017
 woke-0.017
 suddenly-0.016
Token grand
Token ID4490
Feature activation-1.26e+00

Pos logit contributions
umbs+0.029
beh+0.027
lette+0.026
igators+0.025
izza+0.024
Neg logit contributions
 notice-0.024
 stage-0.024
 woke-0.023
But-0.023
 school-0.022
Tokenpa
Token ID8957
Feature activation-4.84e+00

Pos logit contributions
umbs+0.058
beh+0.053
igators+0.050
ocado+0.050
lette+0.049
Neg logit contributions
 notice-0.038
 stage-0.038
 woke-0.037
 school-0.035
 heard-0.035
Token a
Token ID257
Feature activation-1.30e+00

Pos logit contributions
umbs+0.295
oun+0.281
rol+0.277
beh+0.273
umb+0.271
Neg logit contributions
 heard-0.092
 again-0.085
 woke-0.076
 saw-0.072
,-0.068
Token big
Token ID1263
Feature activation-6.04e-01

Pos logit contributions
beh+0.062
umbs+0.061
lette+0.057
igators+0.056
icking+0.055
Neg logit contributions
 notice-0.040
 woke-0.038
 stage-0.037
 heard-0.035
But-0.035
Token hug
Token ID16225
Feature activation-4.89e-01

Pos logit contributions
umbs+0.030
beh+0.029
lette+0.029
igators+0.027
izza+0.027
Neg logit contributions
 notice-0.016
 stage-0.015
 woke-0.015
But-0.014
 suddenly-0.014
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
umbs+0.028
beh+0.028
lette+0.027
igators+0.026
izza+0.026
Neg logit contributions
 notice-0.009
 stage-0.008
 woke-0.008
But-0.008
 suddenly-0.007
Token<|endoftext|>
Token ID50256
Feature activation+7.86e-01

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.004
 stage-0.004
But-0.004
 woke-0.003
Who-0.003
Token rescue
Token ID9992
Feature activation-5.28e-01

Pos logit contributions
umbs+0.010
beh+0.010
lette+0.010
izza+0.010
igators+0.009
Neg logit contributions
 notice-0.007
 stage-0.006
But-0.006
 woke-0.006
Who-0.006
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
umbs+0.030
beh+0.029
lette+0.029
izza+0.028
igators+0.028
Neg logit contributions
 notice-0.010
 stage-0.009
 woke-0.009
But-0.009
 suddenly-0.008
Token They
Token ID1119
Feature activation-9.23e-01

Pos logit contributions
beh+0.056
umbs+0.055
lette+0.054
wy+0.052
icking+0.051
Neg logit contributions
 notice-0.039
 stage-0.039
 woke-0.038
But-0.038
Who-0.037
Token gave
Token ID2921
Feature activation-1.92e+00

Pos logit contributions
umbs+0.050
beh+0.048
lette+0.047
izza+0.045
oun+0.044
Neg logit contributions
 notice-0.022
 woke-0.022
 heard-0.021
 suddenly-0.020
 imagined-0.020
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
umbs+0.130
illing+0.118
beh+0.117
ady+0.116
umb+0.115
Neg logit contributions
 notice-0.027
 heard-0.025
 saw-0.024
 it-0.024
 woke-0.024
Token family
Token ID1641
Feature activation-4.82e+00

Pos logit contributions
beh+0.066
umbs+0.064
lette+0.059
rol+0.058
izza+0.058
Neg logit contributions
 stage-0.044
 notice-0.043
 woke-0.040
 heard-0.040
 school-0.039
Token food
Token ID2057
Feature activation-2.11e+00

Pos logit contributions
umbs+0.301
beh+0.291
omp+0.287
icking+0.284
unch+0.280
Neg logit contributions
 heard-0.065
 her-0.064
 school-0.064
 dress-0.062
 woke-0.060
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
beh+0.130
umbs+0.127
lette+0.122
oun+0.119
known+0.118
Neg logit contributions
 woke-0.030
 notice-0.029
 stage-0.028
 heard-0.027
 again-0.026
Token clothes
Token ID8242
Feature activation-1.77e+00

Pos logit contributions
umbs+0.070
beh+0.070
lette+0.066
oun+0.065
ocado+0.064
Neg logit contributions
 notice-0.033
 woke-0.032
But-0.031
 stage-0.031
 heard-0.030
Token and
Token ID290
Feature activation-6.99e-01

Pos logit contributions
beh+0.117
umbs+0.114
igators+0.111
lette+0.111
ocado+0.109
Neg logit contributions
 notice-0.018
 woke-0.017
 stage-0.016
 suddenly-0.015
 dress-0.014
Token a
Token ID257
Feature activation-4.60e-01

Pos logit contributions
umbs+0.031
beh+0.029
lette+0.028
izza+0.027
oun+0.027
Neg logit contributions
 notice-0.023
 woke-0.022
 stage-0.022
But-0.021
 suddenly-0.021
Token fun
Token ID1257
Feature activation+1.14e-01

Pos logit contributions
umbs+0.075
lette+0.063
rol+0.062
izza+0.060
ady+0.060
Neg logit contributions
 notice-0.055
 seek-0.054
 stage-0.053
 school-0.053
But-0.052
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
 notice+0.003
 stage+0.002
 woke+0.002
But+0.002
Who+0.002
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
igators-0.006
izza-0.005
Token Can
Token ID1680
Feature activation+3.59e-01

Pos logit contributions
umbs+0.008
beh+0.008
lette+0.008
izza+0.008
igators+0.008
Neg logit contributions
 notice-0.007
 stage-0.006
But-0.006
 woke-0.006
Who-0.006
Token we
Token ID356
Feature activation+4.04e-01

Pos logit contributions
 notice+0.012
 stage+0.011
 woke+0.010
 imagined+0.010
La+0.010
Neg logit contributions
umbs-0.015
beh-0.015
lette-0.014
izza-0.014
igators-0.014
Token be
Token ID307
Feature activation+8.28e-02

Pos logit contributions
 notice+0.011
 stage+0.011
Who+0.011
 woke+0.011
La+0.011
Neg logit contributions
umbs-0.018
lette-0.018
beh-0.017
izza-0.017
icking-0.016
Token friends
Token ID2460
Feature activation+2.83e+00

Pos logit contributions
 notice+0.004
 stage+0.004
 woke+0.004
Who+0.004
But+0.004
Neg logit contributions
umbs-0.002
beh-0.002
lette-0.002
igators-0.002
izza-0.002
Token?"
Token ID1701
Feature activation+1.09e+00

Pos logit contributions
icol+0.108
 inspection+0.095
 staircase+0.086
 stage+0.085
 avail+0.083
Neg logit contributions
 dollar-0.108
o-0.104
lette-0.101
 equally-0.100
 equal-0.100
Token Lily
Token ID20037
Feature activation+8.82e-01

Pos logit contributions
 stage+0.041
 imagined+0.041
 notice+0.039
 woke+0.039
 concert+0.038
Neg logit contributions
umbs-0.039
o-0.036
izza-0.036
melon-0.033
 dollar-0.032
Token forg
Token ID5784
Feature activation+6.92e-01

Pos logit contributions
 notice+0.032
 seek+0.029
 stage+0.029
 imagined+0.029
But+0.029
Neg logit contributions
umbs-0.032
beh-0.031
lette-0.030
igators-0.030
izza-0.030
Tokenives
Token ID1083
Feature activation+4.13e-01

Pos logit contributions
 stage+0.038
 notice+0.036
 suddenly+0.036
But+0.036
Who+0.036
Neg logit contributions
umbs-0.016
beh-0.015
ocado-0.013
lette-0.013
izza-0.012
Token Ben
Token ID3932
Feature activation-3.10e-01

Pos logit contributions
 notice+0.012
 stage+0.011
 imagined+0.011
Who+0.010
But+0.010
Neg logit contributions
umbs-0.020
beh-0.018
lette-0.018
icking-0.018
izza-0.018
Token She
Token ID1375
Feature activation+1.04e+00

Pos logit contributions
 notice+0.002
 stage+0.002
 woke+0.002
But+0.002
Who+0.002
Neg logit contributions
umbs-0.003
beh-0.003
lette-0.003
izza-0.003
igators-0.003
Token fl
Token ID781
Feature activation-8.68e-01

Pos logit contributions
 notice+0.033
But+0.031
Who+0.031
 stage+0.030
Stop+0.030
Neg logit contributions
umbs-0.043
beh-0.041
lette-0.041
izza-0.040
igators-0.040
Tokenaps
Token ID1686
Feature activation+1.91e+00

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.027
igators+0.026
izza+0.025
Neg logit contributions
 notice-0.035
But-0.035
 stage-0.034
 woke-0.034
 suddenly-0.033
Token her
Token ID607
Feature activation+5.64e-01

Pos logit contributions
Who+0.065
icol+0.065
 stage+0.064
lf+0.061
 concert+0.061
Neg logit contributions
umbs-0.072
 equal-0.065
igators-0.064
lette-0.064
allows-0.063
Token wings
Token ID12098
Feature activation+5.69e-01

Pos logit contributions
 notice+0.022
But+0.020
Who+0.020
 woke+0.020
Stop+0.019
Neg logit contributions
umbs-0.021
lette-0.020
beh-0.019
igators-0.019
izza-0.019
Token and
Token ID290
Feature activation+2.37e+00

Pos logit contributions
 notice+0.014
Who+0.013
 stage+0.013
But+0.012
Stop+0.012
Neg logit contributions
umbs-0.029
lette-0.027
beh-0.027
igators-0.026
icking-0.026
Token flies
Token ID17607
Feature activation+1.04e+00

Pos logit contributions
But+0.074
chool+0.072
Stop+0.072
 notice+0.071
Who+0.071
Neg logit contributions
igators-0.084
 tast-0.081
ishes-0.081
oops-0.080
beh-0.080
Token away
Token ID1497
Feature activation+9.31e-01

Pos logit contributions
 notice+0.035
Who+0.034
 stage+0.034
 concert+0.033
lf+0.033
Neg logit contributions
umbs-0.041
beh-0.039
lette-0.038
igators-0.038
o-0.037
Token.
Token ID13
Feature activation+9.08e-01

Pos logit contributions
Who+0.025
 notice+0.024
Stop+0.024
La+0.023
But+0.023
Neg logit contributions
umbs-0.043
beh-0.042
lette-0.040
igators-0.040
 equal-0.039
Token She
Token ID1375
Feature activation+1.16e+00

Pos logit contributions
 notice+0.030
 stage+0.029
 woke+0.028
 imagined+0.028
 concert+0.027
Neg logit contributions
umbs-0.038
izza-0.034
lette-0.034
beh-0.033
o-0.033
Token feels
Token ID5300
Feature activation+5.69e-02

Pos logit contributions
 notice+0.038
But+0.036
Who+0.036
 avail+0.035
 stage+0.035
Neg logit contributions
umbs-0.045
igators-0.043
beh-0.043
lette-0.041
ocado-0.041
Token danced
Token ID39480
Feature activation+1.92e+00

Pos logit contributions
 avail+0.037
 notice+0.037
Who+0.035
 stage+0.034
 recess+0.033
Neg logit contributions
umbs-0.046
o-0.045
beh-0.045
lette-0.045
allows-0.043
Token and
Token ID290
Feature activation+2.08e+00

Pos logit contributions
 avail+0.087
Who+0.083
Stop+0.075
Raw+0.074
 notice+0.073
Neg logit contributions
umbs-0.053
allows-0.050
 equally-0.049
o-0.049
sy-0.048
Token danced
Token ID39480
Feature activation+2.23e+00

Pos logit contributions
 avail+0.074
Who+0.074
Stop+0.073
 grade+0.073
�+0.072
Neg logit contributions
allows-0.065
o-0.065
beh-0.062
igators-0.058
lings-0.058
Token until
Token ID1566
Feature activation+1.86e+00

Pos logit contributions
Who+0.106
 avail+0.103
Stop+0.096
AAAA+0.093
Raw+0.092
Neg logit contributions
umbs-0.055
sy-0.053
 equally-0.052
allows-0.051
asting-0.051
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 imagined+0.063
Who+0.062
 robe+0.062
Raw+0.062
chool+0.061
Neg logit contributions
melon-0.055
umbs-0.054
 highlight-0.053
 much-0.052
allows-0.052
Token rain
Token ID6290
Feature activation+2.39e+00

Pos logit contributions
 notice+0.038
 imagined+0.036
Who+0.035
But+0.034
 inspection+0.033
Neg logit contributions
umbs-0.041
lette-0.038
allows-0.037
igators-0.037
icking-0.037
Token stopped
Token ID5025
Feature activation+1.63e+00

Pos logit contributions
 avail+0.080
Who+0.074
 robe+0.073
 stage+0.073
 notice+0.072
Neg logit contributions
umbs-0.100
icking-0.087
beh-0.085
asting-0.085
o-0.083
Token.
Token ID13
Feature activation+1.49e+00

Pos logit contributions
lf+0.054
 robe+0.051
 stage+0.051
 woke+0.050
Who+0.050
Neg logit contributions
icking-0.063
allows-0.061
beh-0.059
lette-0.058
o-0.057
Token She
Token ID1375
Feature activation+2.43e+00

Pos logit contributions
 notice+0.065
 imagined+0.063
 seek+0.062
lf+0.061
 stage+0.061
Neg logit contributions
umbs-0.045
izza-0.039
o-0.038
oun-0.038
lette-0.037
Token went
Token ID1816
Feature activation+1.80e+00

Pos logit contributions
 avail+0.097
 recess+0.081
Who+0.080
 notice+0.079
chool+0.079
Neg logit contributions
o-0.079
ished-0.075
allows-0.074
beh-0.071
ipp-0.071
Token back
Token ID736
Feature activation+2.13e+00

Pos logit contributions
 avail+0.082
Who+0.081
lf+0.076
�+0.076
Stop+0.076
Neg logit contributions
o-0.047
icking-0.044
melon-0.044
beh-0.043
rol-0.042
Token study
Token ID2050
Feature activation+1.93e+00

Pos logit contributions
icol+0.056
AAAA+0.055
Raw+0.052
 stage+0.052
Who+0.051
Neg logit contributions
beh-0.063
lette-0.062
ough-0.058
 dollar-0.058
 equal-0.057
Token yet
Token ID1865
Feature activation+9.53e-01

Pos logit contributions
Who+0.062
 notice+0.060
icol+0.059
Raw+0.059
 woke+0.058
Neg logit contributions
umbs-0.070
 equal-0.069
 dollars-0.069
beh-0.069
 dollar-0.068
Token because
Token ID780
Feature activation-6.61e-02

Pos logit contributions
La+0.022
 notice+0.022
Who+0.022
Stop+0.021
 stage+0.021
Neg logit contributions
umbs-0.045
beh-0.044
lette-0.043
 equal-0.042
izza-0.042
Token he
Token ID339
Feature activation+6.52e-01

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
igators+0.003
izza+0.003
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.001
 woke-0.001
Who-0.001
Token wanted
Token ID2227
Feature activation+6.67e-01

Pos logit contributions
 notice+0.019
 stage+0.018
But+0.017
Who+0.016
La+0.016
Neg logit contributions
beh-0.030
umbs-0.029
lette-0.029
o-0.028
ocado-0.027
Token to
Token ID284
Feature activation+2.47e+00

Pos logit contributions
 notice+0.020
But+0.019
 stage+0.019
 woke+0.019
Who+0.018
Neg logit contributions
umbs-0.028
beh-0.027
lette-0.027
izza-0.026
 dollar-0.026
Token have
Token ID423
Feature activation+9.80e-01

Pos logit contributions
icol+0.102
AAAA+0.096
Exc+0.090
chool+0.090
 inspection+0.088
Neg logit contributions
lette-0.072
beh-0.072
ough-0.071
 equal-0.070
 supposed-0.068
Token fun
Token ID1257
Feature activation+1.99e+00

Pos logit contributions
 notice+0.036
But+0.036
 woke+0.036
 stage+0.035
Who+0.035
Neg logit contributions
beh-0.032
umbs-0.032
lette-0.031
 equal-0.030
izza-0.030
Token.
Token ID13
Feature activation-5.70e-02

Pos logit contributions
icol+0.069
AAAA+0.068
 woke+0.068
Raw+0.066
Stop+0.066
Neg logit contributions
beh-0.069
 brothers-0.066
 dollars-0.065
 supposed-0.065
 equal-0.065
Token
Token ID198
Feature activation+9.67e-01

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.002
 stage-0.001
 woke-0.001
But-0.001
Who-0.001
Token
Token ID198
Feature activation-4.26e-01

Pos logit contributions
 notice+0.030
 stage+0.030
 imagined+0.028
Who+0.027
 woke+0.027
Neg logit contributions
umbs-0.042
o-0.039
lette-0.038
beh-0.038
izza-0.038
Token agreed
Token ID4987
Feature activation+1.69e-01

Pos logit contributions
umbs+0.028
beh+0.028
lette+0.027
igators+0.026
izza+0.026
Neg logit contributions
 notice-0.011
 woke-0.011
 stage-0.011
But-0.010
 suddenly-0.010
Token and
Token ID290
Feature activation-7.59e-01

Pos logit contributions
 notice+0.004
 stage+0.003
Who+0.003
La+0.003
But+0.003
Neg logit contributions
umbs-0.009
lette-0.009
beh-0.009
izza-0.008
icking-0.008
Token they
Token ID484
Feature activation-1.32e+00

Pos logit contributions
umbs+0.040
lette+0.038
beh+0.038
igators+0.036
izza+0.036
Neg logit contributions
 notice-0.018
 woke-0.018
 stage-0.017
 suddenly-0.017
But-0.016
Token both
Token ID1111
Feature activation-1.03e+00

Pos logit contributions
umbs+0.064
beh+0.063
lette+0.061
igators+0.059
izza+0.058
Neg logit contributions
 woke-0.039
 notice-0.036
 suddenly-0.035
 imagined-0.034
 heard-0.034
Token fell
Token ID3214
Feature activation+7.59e-01

Pos logit contributions
umbs+0.056
beh+0.055
lette+0.054
izza+0.052
igators+0.051
Neg logit contributions
 woke-0.023
 notice-0.022
 imagined-0.020
 suddenly-0.019
 heard-0.019
Token asleep
Token ID16039
Feature activation+2.71e+00

Pos logit contributions
 notice+0.024
Who+0.024
But+0.023
Stop+0.023
 stage+0.023
Neg logit contributions
umbs-0.032
beh-0.030
icking-0.029
o-0.028
allows-0.028
Token with
Token ID351
Feature activation-1.36e+00

Pos logit contributions
Stop+0.113
Who+0.102
Raw+0.101
La+0.098
Exc+0.097
Neg logit contributions
icking-0.082
 equal-0.080
umbs-0.080
known-0.078
ages-0.078
Token Teddy
Token ID29345
Feature activation-2.23e+00

Pos logit contributions
beh+0.071
umbs+0.069
lette+0.069
illing+0.065
izza+0.065
Neg logit contributions
 notice-0.037
 stage-0.036
 suddenly-0.034
 school-0.033
 dress-0.033
Token between
Token ID1022
Feature activation-1.03e+00

Pos logit contributions
igators+0.142
beh+0.142
umbs+0.140
lette+0.136
izza+0.133
Neg logit contributions
 again-0.034
 woke-0.032
 dress-0.029
 stage-0.028
 heard-0.028
Token them
Token ID606
Feature activation-1.69e+00

Pos logit contributions
beh+0.051
lette+0.051
umbs+0.049
ocado+0.048
izza+0.047
Neg logit contributions
 notice-0.029
 stage-0.028
 suddenly-0.026
But-0.026
 school-0.025
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
beh+0.107
umbs+0.106
lette+0.105
igators+0.104
izza+0.102
Neg logit contributions
 notice-0.021
 woke-0.020
 again-0.019
 suddenly-0.019
 stage-0.019
Token.
Token ID13
Feature activation-9.39e-01

Pos logit contributions
 notice+0.000
 stage+0.000
 woke+0.000
But+0.000
Who+0.000
Neg logit contributions
umbs-0.000
beh-0.000
lette-0.000
izza-0.000
igators-0.000
Tokenâ
Token ID22940
Feature activation-2.59e-02

Pos logit contributions
umbs+0.043
beh+0.042
lette+0.041
igators+0.040
icking+0.039
Neg logit contributions
 notice-0.027
But-0.026
Who-0.025
 stage-0.025
 woke-0.025
Token€
Token ID26391
Feature activation-1.77e-01

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.001
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token It
Token ID632
Feature activation-3.01e-01

Pos logit contributions
umbs+0.008
beh+0.008
lette+0.008
izza+0.008
igators+0.008
Neg logit contributions
 notice-0.005
 stage-0.005
But-0.004
 woke-0.004
Who-0.004
Token was
Token ID373
Feature activation+9.57e-02

Pos logit contributions
umbs+0.014
beh+0.013
lette+0.013
izza+0.013
igators+0.012
Neg logit contributions
 notice-0.009
 stage-0.008
 woke-0.008
But-0.008
 suddenly-0.008
Token the
Token ID262
Feature activation-5.27e-02

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.004
beh-0.004
lette-0.004
igators-0.003
izza-0.003
Token l
Token ID300
Feature activation+2.32e-01

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
igators+0.002
izza+0.002
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.002
 woke-0.002
Who-0.002
Tokenily
Token ID813
Feature activation-5.16e-01

Pos logit contributions
 notice+0.010
 stage+0.010
But+0.009
 woke+0.009
Who+0.009
Neg logit contributions
umbs-0.008
beh-0.007
lette-0.007
ocado-0.007
igators-0.006
Token!
Token ID0
Feature activation-5.12e-01

Pos logit contributions
umbs+0.026
beh+0.025
lette+0.025
izza+0.024
igators+0.024
Neg logit contributions
 notice-0.013
 stage-0.012
 woke-0.012
But-0.012
 suddenly-0.011
Token
Token ID220
Feature activation-1.23e+00

Pos logit contributions
umbs+0.023
beh+0.022
lette+0.022
igators+0.021
izza+0.021
Neg logit contributions
 notice-0.015
But-0.015
 stage-0.014
 woke-0.014
Who-0.014
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
umbs+0.068
beh+0.068
lette+0.068
igators+0.066
melon+0.064
Neg logit contributions
 notice-0.022
But-0.021
 stage-0.019
 woke-0.018
 suddenly-0.018
Token The
Token ID383
Feature activation+3.22e-01

Pos logit contributions
 notice+0.037
 imagined+0.036
Stop+0.036
 seek+0.035
 stage+0.035
Neg logit contributions
umbs-0.037
izza-0.033
o-0.032
lette-0.032
allows-0.031
Token girl
Token ID2576
Feature activation-1.96e-01

Pos logit contributions
 notice+0.009
But+0.009
 stage+0.009
 imagined+0.009
 woke+0.008
Neg logit contributions
umbs-0.014
lette-0.014
igators-0.013
beh-0.013
izza-0.013
Token said
Token ID531
Feature activation+2.35e-01

Pos logit contributions
umbs+0.010
beh+0.010
lette+0.010
izza+0.010
igators+0.010
Neg logit contributions
 notice-0.004
 stage-0.004
But-0.004
 woke-0.004
Who-0.004
Token,
Token ID11
Feature activation+5.49e-01

Pos logit contributions
 notice+0.004
 stage+0.004
But+0.004
Who+0.004
 imagined+0.004
Neg logit contributions
umbs-0.013
lette-0.013
beh-0.012
izza-0.012
icking-0.012
Token "
Token ID366
Feature activation-1.27e-01

Pos logit contributions
 notice+0.018
 stage+0.016
 imagined+0.016
But+0.015
Stop+0.015
Neg logit contributions
umbs-0.023
lette-0.022
melon-0.021
beh-0.021
izza-0.021
TokenNo
Token ID2949
Feature activation+4.11e-01

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.006
igators+0.006
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token,
Token ID11
Feature activation-4.22e-01

Pos logit contributions
 notice+0.009
But+0.008
Who+0.008
 imagined+0.008
La+0.008
Neg logit contributions
umbs-0.021
izza-0.020
beh-0.020
lette-0.020
 dollar-0.020
Token it
Token ID340
Feature activation-2.35e+00

Pos logit contributions
umbs+0.023
beh+0.023
lette+0.022
izza+0.021
igators+0.021
Neg logit contributions
 notice-0.009
 stage-0.008
 woke-0.008
But-0.008
Who-0.008
Token is
Token ID318
Feature activation-1.11e+00

Pos logit contributions
umbs+0.137
lette+0.127
beh+0.125
gary+0.122
illing+0.122
Neg logit contributions
 woke-0.054
�-0.048
?"-0.048
 saw-0.048
 heard-0.048
Token not
Token ID407
Feature activation-1.50e+00

Pos logit contributions
umbs+0.057
lette+0.053
beh+0.052
ady+0.050
ocado+0.049
Neg logit contributions
 notice-0.031
 stage-0.031
But-0.029
 suddenly-0.029
 school-0.028
Token.
Token ID13
Feature activation-2.35e+00

Pos logit contributions
umbs+0.079
beh+0.072
lette+0.071
itt+0.071
izza+0.068
Neg logit contributions
 notice-0.042
 school-0.039
 stage-0.037
?"-0.037
 imagined-0.036
Token day
Token ID1110
Feature activation+1.18e+00

Pos logit contributions
umbs+0.066
beh+0.066
lette+0.062
izza+0.061
ishes+0.061
Neg logit contributions
 woke-0.033
But-0.031
 notice-0.031
 school-0.031
 suddenly-0.029
Token,
Token ID11
Feature activation+1.83e+00

Pos logit contributions
 notice+0.032
Who+0.031
 stage+0.031
La+0.031
 imagined+0.030
Neg logit contributions
umbs-0.052
o-0.049
izza-0.049
 dollar-0.049
itt-0.048
Token Lily
Token ID20037
Feature activation+5.16e-01

Pos logit contributions
La+0.065
icol+0.063
 imagined+0.061
 portal+0.060
 campus+0.060
Neg logit contributions
melon-0.059
umbs-0.052
 dollar-0.051
oun-0.051
o-0.050
Token was
Token ID373
Feature activation-7.59e-01

Pos logit contributions
 notice+0.021
 stage+0.021
But+0.020
Who+0.020
La+0.020
Neg logit contributions
umbs-0.016
lette-0.016
beh-0.016
ocado-0.015
icking-0.015
Token playing
Token ID2712
Feature activation+1.74e+00

Pos logit contributions
umbs+0.035
izza+0.033
lette+0.032
beh+0.032
igators+0.032
Neg logit contributions
 notice-0.025
 stage-0.023
But-0.023
 suddenly-0.023
 woke-0.022
Token in
Token ID287
Feature activation+8.06e-01

Pos logit contributions
icol+0.064
 avail+0.062
La+0.062
 notice+0.061
 woke+0.060
Neg logit contributions
beh-0.053
umbs-0.053
asting-0.052
aking-0.051
o-0.051
Token the
Token ID262
Feature activation-2.75e-02

Pos logit contributions
 notice+0.024
 woke+0.024
�+0.023
La+0.022
 imagined+0.022
Neg logit contributions
umbs-0.032
beh-0.031
igators-0.031
 equal-0.030
lette-0.029
Token park
Token ID3952
Feature activation+4.29e-01

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
igators+0.001
izza+0.001
Neg logit contributions
 notice-0.001
 stage-0.001
 woke-0.001
But-0.001
Who-0.000
Token when
Token ID618
Feature activation+1.95e+00

Pos logit contributions
 notice+0.012
 stage+0.011
 woke+0.011
Who+0.011
La+0.011
Neg logit contributions
umbs-0.020
lette-0.019
beh-0.019
illing-0.018
icking-0.018
Token she
Token ID673
Feature activation+1.67e+00

Pos logit contributions
 imagined+0.057
 notice+0.056
icol+0.055
 woke+0.053
 speaker+0.052
Neg logit contributions
o-0.068
umbs-0.067
itt-0.067
 brothers-0.065
melon-0.065
Token saw
Token ID2497
Feature activation+3.93e-01

Pos logit contributions
 stage+0.040
 avail+0.037
chool+0.035
 speaker+0.034
Stop+0.034
Neg logit contributions
o-0.084
umbs-0.079
lette-0.076
ocado-0.076
known-0.074
Token started
Token ID2067
Feature activation-7.83e-01

Pos logit contributions
 stage+0.023
 notice+0.023
La+0.022
Who+0.021
 avail+0.021
Neg logit contributions
umbs-0.035
lette-0.033
icking-0.032
beh-0.032
izza-0.030
Token to
Token ID284
Feature activation-8.39e-02

Pos logit contributions
umbs+0.045
lette+0.043
ocado+0.042
beh+0.042
izza+0.042
Neg logit contributions
 notice-0.017
 stage-0.014
But-0.014
 woke-0.014
 suddenly-0.014
Token cry
Token ID3960
Feature activation+1.04e+00

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
igators+0.004
izza+0.004
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.002
 woke-0.002
Who-0.002
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 stage+0.031
 notice+0.030
La+0.029
 seek+0.029
Stop+0.029
Neg logit contributions
umbs-0.045
lette-0.042
beh-0.042
icking-0.042
illing-0.040
Token They
Token ID1119
Feature activation+1.06e+00

Pos logit contributions
 seek+0.056
 woke+0.055
lf+0.055
 notice+0.055
 concert+0.055
Neg logit contributions
umbs-0.018
izza-0.016
oun-0.014
oops-0.014
melon-0.014
Token did
Token ID750
Feature activation+2.22e-01

Pos logit contributions
 stage+0.031
 avail+0.030
La+0.029
 notice+0.029
 concert+0.028
Neg logit contributions
umbs-0.044
beh-0.043
lette-0.042
 equal-0.040
igators-0.040
Token not
Token ID407
Feature activation+1.57e-01

Pos logit contributions
 notice+0.008
 stage+0.008
 woke+0.008
Who+0.008
La+0.008
Neg logit contributions
beh-0.008
umbs-0.008
lette-0.008
izza-0.007
igators-0.007
Token have
Token ID423
Feature activation-1.02e+00

Pos logit contributions
 stage+0.006
 notice+0.006
But+0.006
 suddenly+0.005
 woke+0.005
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
igators-0.005
ocado-0.005
Token any
Token ID597
Feature activation-1.64e-01

Pos logit contributions
umbs+0.049
lette+0.045
beh+0.043
ocado+0.043
izza+0.043
Neg logit contributions
 notice-0.034
 woke-0.029
 heard-0.029
 imagined-0.029
 stage-0.028
Token money
Token ID1637
Feature activation-1.19e+00

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.007
 stage-0.007
But-0.007
 woke-0.007
Who-0.007
Token.
Token ID13
Feature activation+8.44e-01

Pos logit contributions
umbs+0.071
beh+0.068
lette+0.068
izza+0.065
igators+0.065
Neg logit contributions
 notice-0.021
 stage-0.018
 woke-0.017
 suddenly-0.017
But-0.017
Token knew
Token ID2993
Feature activation+2.23e-01

Pos logit contributions
 notice+0.043
 stage+0.041
 avail+0.041
Stop+0.040
Who+0.039
Neg logit contributions
lette-0.055
umbs-0.054
o-0.053
ocado-0.052
beh-0.052
Token how
Token ID703
Feature activation-9.74e-01

Pos logit contributions
 notice+0.006
 stage+0.006
 woke+0.006
But+0.006
Who+0.005
Neg logit contributions
umbs-0.010
beh-0.010
lette-0.010
igators-0.010
izza-0.010
Token special
Token ID2041
Feature activation+8.97e-01

Pos logit contributions
umbs+0.045
lette+0.041
beh+0.040
izza+0.039
icking+0.039
Neg logit contributions
 notice-0.031
But-0.029
 stage-0.029
 woke-0.029
 suddenly-0.029
Token it
Token ID340
Feature activation+2.51e-01

Pos logit contributions
 notice+0.023
La+0.020
 stage+0.020
Who+0.020
But+0.020
Neg logit contributions
umbs-0.043
beh-0.042
igators-0.042
lette-0.042
izza-0.039
Token was
Token ID373
Feature activation-3.00e-01

Pos logit contributions
 notice+0.005
 stage+0.005
But+0.005
Who+0.005
La+0.005
Neg logit contributions
umbs-0.013
beh-0.013
lette-0.013
igators-0.012
izza-0.012
Token for
Token ID329
Feature activation+8.62e-01

Pos logit contributions
umbs+0.016
beh+0.016
lette+0.016
izza+0.015
igators+0.015
Neg logit contributions
 notice-0.006
 stage-0.006
But-0.006
 woke-0.005
 suddenly-0.005
Token her
Token ID607
Feature activation-6.72e-02

Pos logit contributions
 notice+0.022
 woke+0.021
La+0.021
Who+0.021
But+0.020
Neg logit contributions
umbs-0.040
lette-0.039
beh-0.039
allows-0.037
 equal-0.037
Token daughter
Token ID4957
Feature activation-7.76e-01

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.002
 stage-0.002
But-0.002
 woke-0.002
Who-0.001
Token.
Token ID13
Feature activation-3.88e-01

Pos logit contributions
umbs+0.042
beh+0.041
lette+0.040
igators+0.039
izza+0.039
Neg logit contributions
 notice-0.017
 stage-0.015
 woke-0.015
But-0.014
 suddenly-0.014
Token
Token ID220
Feature activation-1.30e+00

Pos logit contributions
umbs+0.018
beh+0.018
lette+0.017
igators+0.017
izza+0.017
Neg logit contributions
 notice-0.011
 stage-0.010
But-0.010
 woke-0.010
Who-0.010
Token
Token ID198
Feature activation+2.42e-02

Pos logit contributions
beh+0.074
umbs+0.073
lette+0.072
igators+0.071
melon+0.068
Neg logit contributions
 notice-0.022
But-0.021
 stage-0.019
 woke-0.019
 suddenly-0.018
Token at
Token ID379
Feature activation-1.03e+00

Pos logit contributions
umbs+0.030
beh+0.028
lette+0.028
izza+0.027
igators+0.026
Neg logit contributions
 notice-0.019
 stage-0.018
 woke-0.017
But-0.017
 suddenly-0.017
Token the
Token ID262
Feature activation-9.44e-01

Pos logit contributions
umbs+0.058
beh+0.057
lette+0.056
ocado+0.054
illing+0.053
Neg logit contributions
 notice-0.021
 stage-0.021
 suddenly-0.020
But-0.020
 woke-0.019
Token squirrel
Token ID33039
Feature activation-3.37e+00

Pos logit contributions
beh+0.046
umbs+0.046
lette+0.043
izza+0.041
icking+0.040
Neg logit contributions
 stage-0.028
 notice-0.027
Who-0.026
 woke-0.026
But-0.026
Token and
Token ID290
Feature activation+5.64e-01

Pos logit contributions
beh+0.214
umbs+0.213
igators+0.204
wy+0.201
asp+0.199
Neg logit contributions
 again-0.059
 suddenly-0.052
 stage-0.049
,-0.046
 woke-0.040
Token asked
Token ID1965
Feature activation-2.75e-02

Pos logit contributions
 notice+0.015
 stage+0.014
La+0.013
But+0.013
Stop+0.013
Neg logit contributions
umbs-0.026
lette-0.026
beh-0.025
izza-0.025
igators-0.024
Token if
Token ID611
Feature activation-4.13e-01

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
izza+0.001
igators+0.001
Neg logit contributions
 notice-0.001
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token he
Token ID339
Feature activation-3.67e-02

Pos logit contributions
umbs+0.022
beh+0.021
lette+0.021
izza+0.020
igators+0.020
Neg logit contributions
 notice-0.009
 stage-0.009
But-0.009
 woke-0.008
 suddenly-0.008
Token had
Token ID550
Feature activation+1.47e-01

Pos logit contributions
umbs+0.002
beh+0.002
lette+0.002
izza+0.002
igators+0.002
Neg logit contributions
 notice-0.001
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token any
Token ID597
Feature activation-9.65e-01

Pos logit contributions
 notice+0.005
 stage+0.005
But+0.005
Who+0.005
 woke+0.005
Neg logit contributions
umbs-0.006
beh-0.006
lette-0.006
izza-0.005
igators-0.005
Token money
Token ID1637
Feature activation-8.64e-01

Pos logit contributions
umbs+0.039
beh+0.037
lette+0.035
ocado+0.034
izza+0.033
Neg logit contributions
 notice-0.036
 stage-0.035
 school-0.033
But-0.033
 suddenly-0.033
Token.
Token ID13
Feature activation-9.21e-01

Pos logit contributions
umbs+0.048
beh+0.046
lette+0.045
igators+0.044
izza+0.043
Neg logit contributions
 notice-0.018
 stage-0.016
But-0.016
 woke-0.016
 suddenly-0.016
Token felt
Token ID2936
Feature activation-1.04e+00

Pos logit contributions
umbs+0.025
beh+0.025
lette+0.024
izza+0.023
igators+0.023
Neg logit contributions
 notice-0.011
 woke-0.010
 stage-0.010
But-0.010
 suddenly-0.010
Token bad
Token ID2089
Feature activation+3.58e-01

Pos logit contributions
umbs+0.047
lette+0.043
beh+0.043
izza+0.043
icking+0.041
Neg logit contributions
 notice-0.036
But-0.033
 woke-0.033
 suddenly-0.032
 stage-0.031
Token too
Token ID1165
Feature activation-6.37e-01

Pos logit contributions
 notice+0.009
 stage+0.009
La+0.008
But+0.008
Who+0.008
Neg logit contributions
umbs-0.017
beh-0.017
lette-0.017
izza-0.016
icking-0.016
Token.
Token ID13
Feature activation-7.09e-01

Pos logit contributions
umbs+0.039
beh+0.037
lette+0.037
igators+0.036
izza+0.036
Neg logit contributions
 notice-0.010
 stage-0.009
 woke-0.009
But-0.009
 suddenly-0.008
Token But
Token ID887
Feature activation+6.60e-01

Pos logit contributions
umbs+0.033
beh+0.032
lette+0.032
igators+0.030
icking+0.030
Neg logit contributions
 notice-0.021
But-0.020
 woke-0.020
 stage-0.019
Who-0.019
Token it
Token ID340
Feature activation-8.77e-01

Pos logit contributions
 notice+0.019
 stage+0.019
La+0.019
Stop+0.018
Who+0.017
Neg logit contributions
beh-0.027
lette-0.027
umbs-0.026
melon-0.026
icking-0.026
Token was
Token ID373
Feature activation-1.60e+00

Pos logit contributions
umbs+0.055
beh+0.052
lette+0.051
izza+0.050
igators+0.049
Neg logit contributions
 woke-0.015
 suddenly-0.014
 notice-0.014
 heard-0.013
But-0.013
Token too
Token ID1165
Feature activation-1.17e+00

Pos logit contributions
umbs+0.082
lette+0.073
izza+0.073
beh+0.073
itt+0.071
Neg logit contributions
 suddenly-0.048
 notice-0.048
But-0.047
 again-0.045
 heard-0.043
Token late
Token ID2739
Feature activation+1.71e+00

Pos logit contributions
umbs+0.053
lette+0.052
beh+0.052
igators+0.049
rol+0.049
Neg logit contributions
 notice-0.039
But-0.035
 woke-0.035
 heard-0.033
 suddenly-0.033
Token to
Token ID284
Feature activation+5.86e-01

Pos logit contributions
La+0.061
Who+0.060
 notice+0.059
Raw+0.059
Stop+0.057
Neg logit contributions
icking-0.063
lette-0.061
izza-0.059
beh-0.059
o-0.058
Token say
Token ID910
Feature activation+4.84e-01

Pos logit contributions
 stage+0.018
But+0.018
 notice+0.018
Who+0.017
La+0.017
Neg logit contributions
umbs-0.023
beh-0.023
lette-0.023
igators-0.022
izza-0.021
Token it
Token ID340
Feature activation+6.64e-01

Pos logit contributions
umbs+0.029
lette+0.028
beh+0.028
izza+0.026
ocado+0.026
Neg logit contributions
 notice-0.016
 stage-0.015
But-0.014
 woke-0.014
 suddenly-0.014
Token left
Token ID1364
Feature activation+5.80e-01

Pos logit contributions
 notice+0.017
 stage+0.017
But+0.016
Who+0.015
Stop+0.015
Neg logit contributions
umbs-0.030
beh-0.030
igators-0.029
lette-0.029
o-0.028
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
 notice+0.016
Who+0.015
La+0.014
 stage+0.014
But+0.014
Neg logit contributions
umbs-0.027
lette-0.026
beh-0.026
igators-0.025
izza-0.025
Token trail
Token ID8025
Feature activation+3.09e-01

Pos logit contributions
 notice+0.054
Stop+0.054
 woke+0.054
Who+0.053
La+0.053
Neg logit contributions
 dollar-0.046
 equal-0.044
ocado-0.044
umbs-0.043
lette-0.043
Token of
Token ID286
Feature activation+3.66e-01

Pos logit contributions
 notice+0.010
Who+0.009
 stage+0.009
But+0.009
 woke+0.009
Neg logit contributions
umbs-0.013
beh-0.013
lette-0.013
igators-0.012
icking-0.012
Token glitter
Token ID31133
Feature activation-8.90e-01

Pos logit contributions
 notice+0.013
But+0.012
 woke+0.012
 stage+0.012
 suddenly+0.012
Neg logit contributions
umbs-0.014
beh-0.014
igators-0.013
lette-0.013
izza-0.013
Tokening
Token ID278
Feature activation+9.12e-01

Pos logit contributions
umbs+0.048
beh+0.047
lette+0.046
izza+0.045
igators+0.044
Neg logit contributions
 notice-0.019
 stage-0.018
 woke-0.018
 suddenly-0.017
But-0.017
Token spark
Token ID9009
Feature activation+5.85e-01

Pos logit contributions
 notice+0.029
Who+0.028
But+0.027
La+0.027
 woke+0.026
Neg logit contributions
umbs-0.037
igators-0.037
beh-0.036
lette-0.036
melon-0.035
Tokenles
Token ID829
Feature activation-9.46e-01

Pos logit contributions
 notice+0.018
But+0.017
Who+0.016
 stage+0.016
La+0.016
Neg logit contributions
umbs-0.026
beh-0.025
lette-0.025
igators-0.024
ishes-0.024
Token!
Token ID0
Feature activation-1.32e+00

Pos logit contributions
umbs+0.050
beh+0.049
lette+0.048
izza+0.047
igators+0.046
Neg logit contributions
 notice-0.021
 stage-0.020
But-0.019
 woke-0.019
 suddenly-0.019
Token
Token ID220
Feature activation-1.36e+00

Pos logit contributions
beh+0.066
lette+0.064
umbs+0.061
icking+0.060
 supposed+0.059
Neg logit contributions
 stage-0.035
But-0.035
 notice-0.035
 woke-0.034
�-0.033
Token poke
Token ID22620
Feature activation+1.08e-01

Pos logit contributions
 stage+0.008
 notice+0.008
 woke+0.007
But+0.007
La+0.007
Neg logit contributions
umbs-0.014
beh-0.013
lette-0.013
igators-0.012
izza-0.012
Token their
Token ID511
Feature activation-1.00e-01

Pos logit contributions
 notice+0.003
 stage+0.003
 woke+0.003
But+0.003
Who+0.003
Neg logit contributions
umbs-0.005
beh-0.005
lette-0.005
igators-0.005
izza-0.004
Token home
Token ID1363
Feature activation+1.19e-01

Pos logit contributions
umbs+0.004
beh+0.004
lette+0.004
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token again
Token ID757
Feature activation+1.32e+00

Pos logit contributions
 notice+0.002
 stage+0.002
 woke+0.002
But+0.002
Who+0.002
Neg logit contributions
umbs-0.007
beh-0.006
lette-0.006
igators-0.006
izza-0.006
Token.
Token ID13
Feature activation-5.23e-01

Pos logit contributions
 stage+0.042
icol+0.042
 woke+0.040
 notice+0.040
Stop+0.040
Neg logit contributions
umbs-0.054
beh-0.051
 equal-0.050
lette-0.050
igators-0.049
Token I
Token ID314
Feature activation-3.32e-01

Pos logit contributions
umbs+0.023
beh+0.022
lette+0.022
igators+0.021
icking+0.021
Neg logit contributions
 notice-0.016
But-0.016
 woke-0.015
 stage-0.015
Who-0.015
Token will
Token ID481
Feature activation+7.66e-01

Pos logit contributions
umbs+0.014
beh+0.014
lette+0.014
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.011
But-0.010
 woke-0.010
 stage-0.010
 suddenly-0.010
Token be
Token ID307
Feature activation-9.73e-01

Pos logit contributions
 stage+0.025
La+0.025
Who+0.025
chool+0.024
 woke+0.024
Neg logit contributions
lette-0.029
umbs-0.028
beh-0.028
 equal-0.026
icking-0.026
Token nice
Token ID3621
Feature activation+1.12e-01

Pos logit contributions
umbs+0.042
izza+0.038
oun+0.036
lette+0.036
beh+0.036
Neg logit contributions
 notice-0.036
But-0.035
 heard-0.032
 suddenly-0.032
 stage-0.032
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
 notice+0.003
 stage+0.003
But+0.003
 woke+0.003
Who+0.003
Neg logit contributions
umbs-0.006
beh-0.005
lette-0.005
izza-0.005
igators-0.005
Token the
Token ID262
Feature activation-2.81e-01

Pos logit contributions
 imagined+0.039
 notice+0.039
 ti+0.039
 woke+0.038
 stage+0.038
Neg logit contributions
igators-0.038
beh-0.037
lette-0.036
umbs-0.036
melon-0.034
Token voice
Token ID3809
Feature activation+1.68e+00

Pos logit contributions
 notice+0.024
 imagined+0.023
But+0.022
 stage+0.022
 recess+0.022
Neg logit contributions
umbs-0.038
lette-0.037
beh-0.037
izza-0.036
igators-0.035
Token.
Token ID13
Feature activation+2.61e-01

Pos logit contributions
But+0.053
Stop+0.053
 notice+0.052
La+0.052
Raw+0.052
Neg logit contributions
lette-0.064
beh-0.063
umbs-0.061
icking-0.059
igators-0.059
Token
Token ID198
Feature activation+4.89e-01

Pos logit contributions
 notice+0.009
 stage+0.008
But+0.008
 woke+0.008
 imagined+0.008
Neg logit contributions
umbs-0.011
lette-0.010
beh-0.010
izza-0.010
igators-0.010
Token
Token ID198
Feature activation-2.13e-01

Pos logit contributions
 notice+0.014
 stage+0.013
 imagined+0.013
 woke+0.013
 suddenly+0.012
Neg logit contributions
umbs-0.022
lette-0.021
beh-0.021
izza-0.020
igators-0.020
Token"
Token ID1
Feature activation+6.34e-01

Pos logit contributions
umbs+0.010
beh+0.009
lette+0.009
izza+0.009
igators+0.009
Neg logit contributions
 notice-0.006
 stage-0.006
But-0.006
 woke-0.006
Who-0.006
TokenWho
Token ID8241
Feature activation-1.18e+00

Pos logit contributions
 stage+0.014
 notice+0.013
 woke+0.011
 imagined+0.011
 school+0.011
Neg logit contributions
umbs-0.034
beh-0.033
lette-0.033
igators-0.032
ocado-0.031
Tokena
Token ID64
Feature activation+2.58e-01

Pos logit contributions
umbs+0.052
beh+0.050
lette+0.048
oun+0.046
melon+0.046
Neg logit contributions
 woke-0.037
 notice-0.037
 heard-0.035
 stage-0.034
 imagined-0.034
Token there
Token ID612
Feature activation+4.55e-01

Pos logit contributions
 notice+0.005
 stage+0.005
But+0.005
Who+0.005
La+0.004
Neg logit contributions
umbs-0.014
lette-0.014
beh-0.014
izza-0.013
igators-0.013
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
 notice+0.008
But+0.007
 stage+0.007
La+0.007
Who+0.007
Neg logit contributions
umbs-0.026
lette-0.025
beh-0.024
igators-0.024
izza-0.024
Token little
Token ID1310
Feature activation+4.76e-01

Pos logit contributions
umbs+0.005
beh+0.005
lette+0.005
izza+0.004
igators+0.004
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token one
Token ID530
Feature activation+7.31e-01

Pos logit contributions
 notice+0.015
But+0.014
 stage+0.014
 woke+0.013
La+0.013
Neg logit contributions
umbs-0.020
lette-0.020
beh-0.020
igators-0.019
izza-0.019
Token the
Token ID262
Feature activation-6.33e-01

Pos logit contributions
 notice+0.018
 stage+0.018
But+0.018
 woke+0.017
 imagined+0.017
Neg logit contributions
umbs-0.024
beh-0.024
lette-0.023
igators-0.023
izza-0.022
Token old
Token ID1468
Feature activation+3.13e-01

Pos logit contributions
umbs+0.033
beh+0.032
lette+0.030
igators+0.029
izza+0.029
Neg logit contributions
 notice-0.016
 stage-0.015
But-0.014
 woke-0.014
Who-0.014
Token woman
Token ID2415
Feature activation-1.58e+00

Pos logit contributions
 notice+0.006
But+0.005
 stage+0.005
 woke+0.005
La+0.005
Neg logit contributions
umbs-0.017
lette-0.017
beh-0.017
izza-0.017
igators-0.016
Token for
Token ID329
Feature activation-8.14e-02

Pos logit contributions
umbs+0.093
beh+0.089
lette+0.085
igators+0.085
izza+0.083
Neg logit contributions
 notice-0.028
 woke-0.027
 suddenly-0.027
 stage-0.026
 again-0.026
Token giving
Token ID3501
Feature activation-6.63e-01

Pos logit contributions
umbs+0.003
beh+0.003
lette+0.003
izza+0.003
igators+0.003
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.002
Token him
Token ID683
Feature activation-3.03e+00

Pos logit contributions
umbs+0.034
beh+0.032
lette+0.032
izza+0.031
igators+0.031
Neg logit contributions
 notice-0.017
 stage-0.015
But-0.015
 woke-0.015
 suddenly-0.015
Token this
Token ID428
Feature activation-8.71e-01

Pos logit contributions
umbs+0.177
beh+0.161
illing+0.159
lette+0.159
oun+0.158
Neg logit contributions
 again-0.055
 notice-0.054
 school-0.054
 stage-0.052
 heard-0.052
Token magical
Token ID10883
Feature activation+8.41e-01

Pos logit contributions
umbs+0.036
beh+0.035
lette+0.033
igators+0.032
izza+0.032
Neg logit contributions
 notice-0.030
 stage-0.029
 woke-0.028
But-0.028
 suddenly-0.027
Token spray
Token ID11662
Feature activation-3.17e-01

Pos logit contributions
 notice+0.029
But+0.027
Who+0.027
Stop+0.026
La+0.026
Neg logit contributions
lette-0.033
umbs-0.033
beh-0.031
izza-0.031
 dollar-0.031
Token.
Token ID13
Feature activation-9.76e-01

Pos logit contributions
umbs+0.017
beh+0.016
lette+0.016
izza+0.015
igators+0.015
Neg logit contributions
 notice-0.007
 stage-0.007
But-0.006
 woke-0.006
Who-0.006
Token He
Token ID679
Feature activation-1.81e-02

Pos logit contributions
umbs+0.042
beh+0.042
lette+0.041
icking+0.039
igators+0.039
Neg logit contributions
 notice-0.031
But-0.030
 stage-0.029
 woke-0.028
Who-0.028
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
umbs+0.014
beh+0.014
lette+0.014
izza+0.013
igators+0.013
Neg logit contributions
 notice-0.005
 stage-0.005
But-0.005
 woke-0.005
Who-0.004
Token So
Token ID1406
Feature activation-1.29e-01

Pos logit contributions
beh+0.054
umbs+0.053
lette+0.052
icking+0.050
allows+0.049
Neg logit contributions
But-0.038
 notice-0.038
 stage-0.036
 woke-0.036
Who-0.036
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
umbs+0.007
beh+0.006
lette+0.006
izza+0.006
igators+0.006
Neg logit contributions
 notice-0.003
 stage-0.003
But-0.003
 woke-0.003
Who-0.003
Token he
Token ID339
Feature activation+1.78e-02

Pos logit contributions
umbs+0.043
beh+0.041
lette+0.041
izza+0.039
igators+0.038
Neg logit contributions
 notice-0.019
 stage-0.018
But-0.018
 suddenly-0.018
 woke-0.017
Token gave
Token ID2921
Feature activation-9.90e-01

Pos logit contributions
 notice+0.001
 stage+0.000
But+0.000
 woke+0.000
Who+0.000
Neg logit contributions
umbs-0.001
beh-0.001
lette-0.001
izza-0.001
igators-0.001
Token them
Token ID606
Feature activation-4.13e+00

Pos logit contributions
umbs+0.053
beh+0.051
lette+0.049
izza+0.048
illing+0.048
Neg logit contributions
 notice-0.023
 stage-0.022
 suddenly-0.021
 woke-0.021
But-0.021
Token all
Token ID477
Feature activation-3.04e+00

Pos logit contributions
umbs+0.253
beh+0.244
illing+0.238
umb+0.234
omp+0.233
Neg logit contributions
 school-0.068
 again-0.066
 dress-0.063
 stage-0.063
 heard-0.056
Token a
Token ID257
Feature activation-1.92e+00

Pos logit contributions
umbs+0.179
beh+0.178
lette+0.170
icking+0.168
ocado+0.166
Neg logit contributions
 woke-0.048
 heard-0.046
 again-0.045
 school-0.044
 notice-0.044
Token magical
Token ID10883
Feature activation+6.24e-02

Pos logit contributions
beh+0.100
umbs+0.092
igators+0.090
icking+0.090
lette+0.088
Neg logit contributions
 notice-0.054
 woke-0.052
 stage-0.052
 heard-0.049
 dress-0.049
Token emer
Token ID3165
Feature activation-8.95e-01

Pos logit contributions
 notice+0.002
But+0.002
 stage+0.002
 woke+0.002
Who+0.002
Neg logit contributions
umbs-0.002
lette-0.002
beh-0.002
izza-0.002
igators-0.002
Tokenald
Token ID1940
Feature activation-2.42e+00

Pos logit contributions
umbs+0.035
igators+0.032
beh+0.032
lette+0.030
illing+0.030
Neg logit contributions
 notice-0.033
But-0.031
 stage-0.031
 woke-0.031
 suddenly-0.030
Token territory
Token ID7674
Feature activation+1.46e+00

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
igators+0.001
izza+0.001
Neg logit contributions
 notice-0.001
But-0.001
 stage-0.001
 woke-0.001
Who-0.001
Token again
Token ID757
Feature activation+1.43e+00

Pos logit contributions
 notice+0.025
Who+0.025
La+0.024
icol+0.024
 stage+0.024
Neg logit contributions
umbs-0.083
beh-0.077
lette-0.076
igators-0.076
icking-0.074
Token.
Token ID13
Feature activation-2.02e+00

Pos logit contributions
 stage+0.048
icol+0.045
 ti+0.044
Stop+0.044
La+0.043
Neg logit contributions
beh-0.056
o-0.055
umbs-0.055
lette-0.054
 equal-0.052
Token They
Token ID1119
Feature activation-1.50e+00

Pos logit contributions
beh+0.101
lette+0.098
ady+0.094
umbs+0.093
wy+0.091
Neg logit contributions
 woke-0.055
But-0.053
 notice-0.053
 again-0.051
 But-0.050
Token remembered
Token ID12086
Feature activation-5.92e-01

Pos logit contributions
umbs+0.081
lette+0.075
beh+0.075
izza+0.073
ady+0.071
Neg logit contributions
 woke-0.042
 notice-0.041
 heard-0.039
 imagined-0.038
 suddenly-0.037
Token Nem
Token ID22547
Feature activation-2.88e+00

Pos logit contributions
umbs+0.031
beh+0.029
lette+0.028
izza+0.027
igators+0.027
Neg logit contributions
 notice-0.016
 woke-0.014
 stage-0.014
But-0.014
 suddenly-0.014
Tokeno
Token ID78
Feature activation-2.97e+00

Pos logit contributions
igators+0.077
 equal+0.074
melon+0.072
izza+0.070
umbs+0.068
Neg logit contributions
 heard-0.137
 woke-0.133
 saw-0.130
 imagined-0.128
But-0.128
Token and
Token ID290
Feature activation-5.88e-01

Pos logit contributions
umbs+0.180
beh+0.174
oun+0.170
lette+0.168
izza+0.166
Neg logit contributions
 woke-0.057
 again-0.054
 heard-0.052
 saw-0.045
 suddenly-0.043
Token how
Token ID703
Feature activation-2.07e+00

Pos logit contributions
umbs+0.030
lette+0.028
beh+0.028
izza+0.027
ocado+0.026
Neg logit contributions
 notice-0.016
 woke-0.015
 stage-0.014
But-0.014
 suddenly-0.014
Token he
Token ID339
Feature activation-8.88e-01

Pos logit contributions
umbs+0.119
lette+0.107
ady+0.107
ction+0.105
umb+0.103
Neg logit contributions
 notice-0.051
 woke-0.049
 suddenly-0.048
But-0.047
 saw-0.046
Token loved
Token ID6151
Feature activation-4.57e-01

Pos logit contributions
umbs+0.047
lette+0.045
beh+0.044
izza+0.043
igators+0.042
Neg logit contributions
 notice-0.021
 woke-0.021
 imagined-0.019
 heard-0.019
 suddenly-0.019
Token two
Token ID734
Feature activation-3.07e+00

Pos logit contributions
beh+0.124
lette+0.109
igators+0.109
icking+0.109
ocado+0.107
Neg logit contributions
 suddenly-0.083
 school-0.082
 notice-0.082
 stage-0.081
 woke-0.081
Token best
Token ID1266
Feature activation-1.83e+00

Pos logit contributions
beh+0.148
umbs+0.139
cream+0.131
lette+0.131
izza+0.126
Neg logit contributions
 saw-0.099
 heard-0.099
 dress-0.092
 stage-0.091
 notice-0.090
Token friends
Token ID2460
Feature activation-1.02e+00

Pos logit contributions
umbs+0.075
lette+0.073
beh+0.072
igators+0.066
melon+0.065
Neg logit contributions
 dress-0.062
 stage-0.061
 notice-0.059
 school-0.059
 woke-0.057
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
umbs+0.065
beh+0.062
izza+0.061
lette+0.060
ocado+0.059
Neg logit contributions
 notice-0.015
 woke-0.014
But-0.014
 heard-0.014
 suddenly-0.013
Token They
Token ID1119
Feature activation-1.24e+00

Pos logit contributions
beh+0.049
umbs+0.047
lette+0.047
icking+0.045
known+0.043
Neg logit contributions
 notice-0.038
 woke-0.036
But-0.036
Who-0.035
 stage-0.035
Token were
Token ID547
Feature activation-3.68e+00

Pos logit contributions
umbs+0.070
beh+0.066
lette+0.064
izza+0.064
icking+0.061
Neg logit contributions
 notice-0.030
 woke-0.029
 heard-0.028
 imagined-0.027
 saw-0.027
Token both
Token ID1111
Feature activation-3.02e+00

Pos logit contributions
umbs+0.224
izza+0.204
beh+0.190
ction+0.186
uns+0.185
Neg logit contributions
 school-0.097
 dress-0.089
 saw-0.089
 stage-0.087
,-0.083
Token three
Token ID1115
Feature activation-2.08e+00

Pos logit contributions
umbs+0.180
izza+0.168
ishes+0.157
plings+0.152
igators+0.151
Neg logit contributions
 dress-0.075
 school-0.071
 stage-0.070
But-0.063
 saw-0.063
Token years
Token ID812
Feature activation-1.58e+00

Pos logit contributions
umbs+0.104
beh+0.104
lette+0.100
melon+0.099
izza+0.098
Neg logit contributions
 heard-0.051
 woke-0.050
 stage-0.050
 notice-0.049
 saw-0.047
Token old
Token ID1468
Feature activation+1.85e-02

Pos logit contributions
umbs+0.079
beh+0.078
lette+0.074
rol+0.070
 highlight+0.069
Neg logit contributions
 stage-0.048
 school-0.045
 notice-0.042
 woke-0.039
lf-0.039
Token.
Token ID13
Feature activation-9.65e-01

Pos logit contributions
 notice+0.000
 stage+0.000
But+0.000
 woke+0.000
Who+0.000
Neg logit contributions
umbs-0.001
lette-0.001
beh-0.001
izza-0.001
igators-0.001
Token the
Token ID262
Feature activation-6.45e-01

Pos logit contributions
 notice+0.047
Who+0.044
But+0.043
 woke+0.042
Stop+0.042
Neg logit contributions
umbs-0.050
igators-0.048
o-0.047
lette-0.046
 dollar-0.046
Token sea
Token ID5417
Feature activation-7.29e-01

Pos logit contributions
umbs+0.031
beh+0.031
lette+0.029
izza+0.028
igators+0.027
Neg logit contributions
 notice-0.019
 stage-0.019
But-0.018
 woke-0.017
 suddenly-0.017
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
umbs+0.044
beh+0.043
lette+0.042
izza+0.041
igators+0.041
Neg logit contributions
 notice-0.012
 stage-0.011
But-0.010
 woke-0.010
 suddenly-0.009
Token saw
Token ID2497
Feature activation-6.45e-01

Pos logit contributions
 notice+0.032
Stop+0.031
But+0.031
Who+0.031
 stage+0.030
Neg logit contributions
o-0.051
icking-0.049
lette-0.049
beh-0.049
known-0.048
Token all
Token ID477
Feature activation-2.96e+00

Pos logit contributions
umbs+0.028
beh+0.027
lette+0.026
izza+0.025
igators+0.025
Neg logit contributions
 notice-0.021
 stage-0.021
But-0.020
 woke-0.020
 suddenly-0.020
Token sorts
Token ID10524
Feature activation-2.84e+00

Pos logit contributions
izza+0.208
lette+0.192
inky+0.192
umbs+0.189
ocado+0.187
Neg logit contributions
 stage-0.045
 her-0.039
 school-0.038
 she-0.038
 it-0.035
Token of
Token ID286
Feature activation-1.17e+00

Pos logit contributions
izza+0.130
lette+0.130
ocado+0.125
umbs+0.124
ction+0.123
Neg logit contributions
 stage-0.077
 again-0.073
,-0.073
But-0.069
 she-0.069
Token wonderful
Token ID7932
Feature activation-2.07e-02

Pos logit contributions
umbs+0.054
lette+0.050
beh+0.050
izza+0.048
icking+0.048
Neg logit contributions
 notice-0.037
 stage-0.037
But-0.034
 woke-0.034
 imagined-0.034
Token things
Token ID1243
Feature activation-4.86e-01

Pos logit contributions
umbs+0.001
beh+0.001
lette+0.001
igators+0.001
izza+0.001
Neg logit contributions
 notice-0.001
 stage-0.001
But-0.001
 woke-0.001
Who-0.001
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
umbs+0.027
beh+0.026
lette+0.026
izza+0.025
igators+0.025
Neg logit contributions
 notice-0.010
 stage-0.009
 woke-0.009
But-0.009
 suddenly-0.008
Token Then
Token ID3244
Feature activation+1.52e+00

Pos logit contributions
umbs+0.006
beh+0.006
lette+0.006
izza+0.005
igators+0.005
Neg logit contributions
 notice-0.004
 stage-0.004
But-0.004
 woke-0.004
Who-0.004