TOP ACTIVATIONS
MAX = 5.687

ed her . Mum took a new , brilliant bucket ,
Her mom takes a piece of meat from the
." Sam took a step forward and bowed .
puppy , so she took a photo of it and showed
so he went to take a look . He saw a
so excited as she brought a big present into the living
So , her mom took a needle and thread and se
was excited and quickly took a bath . After he got
." Tim took a deep breath and said ,
he walked closer to take a better look . He was
sleepy . Can I take a nap now ?" asked Walter
so he decided to take a nap instead . He curled
his book and his bin ocular s in his backpack .
tired . So it took a nap in the shade of
, Lily 's mom took a photo of her with her
she ran over to take a closer look . She wanted
, she decided to take a deep breath and ask Max
said , as he snatched a red ball from Lily .
mom asked her to take a break and read a story
playing . Let me take a look and make it all

MIN ACTIVATIONS
MIN = -6.935

see ?" Mom my said , guiding Lily 's brush on
for cooking ," Mom said , smiling . She blew out
She ate popcorn and roasted marsh m allows .
it too ," she said , kissing them on their cheeks
a pig !" she said , holding up her paper .
be ep !" Tom says , moving his truck around the
good friend ," Tom said , smiling . S
each other ?" Mom said , smiling . L
be ep !" Ben says , pushing the buttons . He
, okay ?" mom says , hugging Anna .
it !" Mom my said , cl apping her hands .
, Ben ," she said , smiling . " I know
, wait !" he said , running after her . "
the swings ?" she said , feeling better .
go zoom ," Sara says , pointing to her blue car
try this !" she said , handing him the cup .
a car !" Ben said , showing his picture . "
for you !" Tom said , holding up his creation .
than you !" Tom said , scoop ing up a lot
this stuff !" Ben said , picking up a hat and

INTERVAL 2.532 - 5.687
CONTAINS 2.723%

" Look , Ben , look !" Jill shouted . "
brother came and tried to take the ball away . "
. They get a m itt and a spat
ran over to him , giving him a big hug .
She tried to reach the key , but she could not

INTERVAL -0.624 - 2.532
CONTAINS 66.292%

my couldn 't help but smile a little . Even though
that would stretch and stretch and stretch . He asked Sally
! <|endoftext|> Once upon a time , a boy wanted to
. They told her what happened . They said they were
an elephant ." Ma ia smiled and spent some time looking

INTERVAL -3.779 - -0.624
CONTAINS 30.612%

could do . He took the lot ion and
them . He said , " No , they are mine
sandbox . Sometimes they pretend to be animals and run and
Dave waited patiently for them to finish . After
. You are taller than me ." Ben said

INTERVAL -6.935 - -3.779
CONTAINS 0.373%

to stand up for ourselves , but not to be mean
she heard a loud scree ch . A big bird flew
. The car went backward , and Spot me owed louder
toys , books , clothes , and food . Ben is
pain . He spent the night in hospital , and he
Tokened
Token ID276
Feature activation+1.54e+00

Pos logit contributions
kin+0.003
ena+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
 closer-0.002
ls-0.002
 back-0.001
s-0.001
 home-0.001
Token her
Token ID607
Feature activation+2.27e-01

Pos logit contributions
ls+0.028
 closer+0.027
 back+0.023
 groceries+0.023
s+0.022
Neg logit contributions
ena-0.043
 scalp-0.041
kin-0.041
 unison-0.040
 End-0.040
Token.
Token ID13
Feature activation+9.23e-01

Pos logit contributions
 closer+0.004
ls+0.004
 back+0.004
 home+0.003
 off+0.003
Neg logit contributions
ena-0.007
kin-0.007
 unison-0.007
 scalp-0.006
 End-0.006
Token Mum
Token ID38367
Feature activation+2.47e+00

Pos logit contributions
 closer+0.019
ls+0.018
s+0.017
 to+0.016
 back+0.016
Neg logit contributions
kin-0.027
 scalp-0.025
 unison-0.025
ities-0.024
ena-0.023
Token took
Token ID1718
Feature activation+2.36e+00

Pos logit contributions
 closer+0.050
ls+0.049
 back+0.045
s+0.042
 to+0.040
Neg logit contributions
kin-0.066
 scalp-0.065
 unison-0.064
ena-0.063
 End-0.061
Token a
Token ID257
Feature activation+5.69e+00

Pos logit contributions
ls+0.057
 closer+0.051
 groceries+0.051
s+0.049
erry+0.048
Neg logit contributions
ena-0.053
 scalp-0.043
era-0.042
 unison-0.042
uff-0.040
Token new
Token ID649
Feature activation+2.68e+00

Pos logit contributions
ls+0.122
elling+0.106
lers+0.105
beh+0.105
ishes+0.102
Neg logit contributions
kin-0.117
pron-0.109
 unison-0.104
 cou-0.103
 hallway-0.101
Token,
Token ID11
Feature activation-2.86e+00

Pos logit contributions
ls+0.066
s+0.055
 closer+0.054
lers+0.051
 unnecessary+0.051
Neg logit contributions
kin-0.063
 sensation-0.055
 scalp-0.053
 unison-0.053
 End-0.052
Token brilliant
Token ID10457
Feature activation+2.15e+00

Pos logit contributions
 scalp+0.073
ena+0.073
kin+0.072
 unison+0.069
 rarely+0.068
Neg logit contributions
ls-0.059
 closer-0.054
s-0.049
 back-0.046
erry-0.045
Token bucket
Token ID19236
Feature activation+1.34e+00

Pos logit contributions
ls+0.041
 closer+0.041
 back+0.035
s+0.034
 off+0.032
Neg logit contributions
kin-0.058
ena-0.057
 scalp-0.057
 unison-0.056
 End-0.055
Token,
Token ID11
Feature activation-3.64e+00

Pos logit contributions
 closer+0.024
ls+0.024
 back+0.021
s+0.020
 home+0.019
Neg logit contributions
kin-0.037
ena-0.037
 scalp-0.037
 unison-0.036
 End-0.035
Token
Token ID198
Feature activation+3.91e-01

Pos logit contributions
kin+0.016
ena+0.016
 scalp+0.016
 unison+0.016
 End+0.015
Neg logit contributions
 closer-0.022
ls-0.021
 back-0.019
 groceries-0.018
s-0.018
Token
Token ID198
Feature activation+8.24e-01

Pos logit contributions
 closer+0.010
ls+0.009
 back+0.009
s+0.008
 home+0.008
Neg logit contributions
kin-0.008
ena-0.008
 scalp-0.008
 unison-0.008
 End-0.008
TokenHer
Token ID9360
Feature activation-9.62e-01

Pos logit contributions
 closer+0.023
ls+0.022
 back+0.020
s+0.020
 off+0.019
Neg logit contributions
kin-0.016
ena-0.015
 scalp-0.015
 unison-0.015
ric-0.014
Token mom
Token ID1995
Feature activation+2.64e+00

Pos logit contributions
ena+0.017
kin+0.017
 scalp+0.016
 End+0.016
 unison+0.015
Neg logit contributions
ls-0.028
 closer-0.028
s-0.025
lers-0.024
 groceries-0.024
Token takes
Token ID2753
Feature activation+1.87e+00

Pos logit contributions
 closer+0.047
ls+0.046
 back+0.040
s+0.038
 off+0.035
Neg logit contributions
kin-0.077
 scalp-0.076
ena-0.075
 unison-0.074
 End-0.072
Token a
Token ID257
Feature activation+5.63e+00

Pos logit contributions
ls+0.051
erry+0.046
 closer+0.043
earned+0.043
s+0.042
Neg logit contributions
ena-0.029
 unison-0.026
 scalp-0.025
 End-0.025
kin-0.024
Token piece
Token ID3704
Feature activation+1.64e+00

Pos logit contributions
beh+0.114
ls+0.114
elling+0.114
lers+0.103
off+0.101
Neg logit contributions
kin-0.118
 hallway-0.105
 beard-0.102
 unison-0.101
 cou-0.100
Token of
Token ID286
Feature activation+1.61e-01

Pos logit contributions
ls+0.026
 closer+0.022
 groceries+0.020
 unnecessary+0.019
s+0.018
Neg logit contributions
kin-0.050
ena-0.049
 scalp-0.046
 End-0.045
 sensation-0.044
Token meat
Token ID6174
Feature activation-5.19e-01

Pos logit contributions
ls+0.004
 closer+0.004
 back+0.003
s+0.003
 nearer+0.003
Neg logit contributions
ena-0.004
 scalp-0.003
kin-0.003
 beard-0.003
 unison-0.003
Token from
Token ID422
Feature activation-6.38e-01

Pos logit contributions
ena+0.015
 unison+0.015
 scalp+0.015
kin+0.015
 End+0.015
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.007
 home-0.007
Token the
Token ID262
Feature activation+7.45e-01

Pos logit contributions
ena+0.017
kin+0.017
 scalp+0.017
 unison+0.017
 End+0.017
Neg logit contributions
ls-0.012
 closer-0.012
 back-0.010
s-0.010
 home-0.009
Token."
Token ID526
Feature activation-8.90e-01

Pos logit contributions
ls+0.007
 closer+0.006
 groceries+0.006
 back+0.005
s+0.005
Neg logit contributions
ena-0.009
 End-0.009
 unison-0.009
 rarely-0.009
kin-0.009
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
kin+0.020
ena+0.020
 scalp+0.020
 unison+0.019
 End+0.019
Neg logit contributions
 closer-0.021
ls-0.021
 back-0.019
s-0.018
 home-0.017
Token
Token ID198
Feature activation+5.07e-01

Pos logit contributions
 closer+0.006
ls+0.006
 back+0.005
 groceries+0.005
s+0.005
Neg logit contributions
ena-0.005
kin-0.005
 unison-0.005
 scalp-0.005
 End-0.004
TokenSam
Token ID16305
Feature activation+8.37e-01

Pos logit contributions
 closer+0.015
ls+0.015
 back+0.014
 groceries+0.013
 home+0.013
Neg logit contributions
ena-0.009
kin-0.008
 unison-0.008
 scalp-0.008
 End-0.008
Token took
Token ID1718
Feature activation+2.73e+00

Pos logit contributions
 closer+0.019
ls+0.018
 back+0.016
s+0.016
 off+0.015
Neg logit contributions
kin-0.020
 scalp-0.020
ena-0.020
 unison-0.020
 End-0.019
Token a
Token ID257
Feature activation+5.58e+00

Pos logit contributions
ls+0.078
erry+0.073
s+0.072
 groceries+0.068
use+0.066
Neg logit contributions
ena-0.053
 pretending-0.037
kin-0.037
 End-0.036
era-0.035
Token step
Token ID2239
Feature activation+2.30e+00

Pos logit contributions
ls+0.113
 groceries+0.106
beh+0.103
elling+0.102
ished+0.096
Neg logit contributions
kin-0.122
 beard-0.111
uren-0.110
 unison-0.110
 hallway-0.110
Token forward
Token ID2651
Feature activation-1.45e-01

Pos logit contributions
ls+0.037
 groceries+0.032
 unnecessary+0.032
 peanuts+0.028
 av+0.027
Neg logit contributions
ena-0.062
kin-0.056
uff-0.056
 lear-0.055
 End-0.054
Token and
Token ID290
Feature activation+2.30e+00

Pos logit contributions
ena+0.003
kin+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
ls-0.003
 closer-0.003
 back-0.003
s-0.003
 groceries-0.003
Token bowed
Token ID35737
Feature activation+6.53e-01

Pos logit contributions
ls+0.053
 groceries+0.048
 closer+0.047
erry+0.047
s+0.044
Neg logit contributions
ena-0.051
kin-0.046
 unison-0.046
 scalp-0.046
 rarely-0.046
Token.
Token ID13
Feature activation+8.79e-01

Pos logit contributions
ls+0.010
 closer+0.008
 groceries+0.008
erry+0.007
s+0.006
Neg logit contributions
ena-0.021
 scalp-0.020
kin-0.020
 unison-0.019
 End-0.019
Token puppy
Token ID26188
Feature activation+2.43e+00

Pos logit contributions
ls+0.025
 closer+0.024
lers+0.020
erry+0.020
 nearer+0.019
Neg logit contributions
 End-0.034
 unison-0.034
kin-0.033
 scalp-0.033
ena-0.033
Token,
Token ID11
Feature activation-1.71e+00

Pos logit contributions
ls+0.067
 groceries+0.055
war+0.053
 closer+0.053
lers+0.053
Neg logit contributions
 always-0.041
 rarely-0.040
kin-0.039
 pretending-0.039
 scalp-0.038
Token so
Token ID523
Feature activation+2.38e+00

Pos logit contributions
ena+0.033
 scalp+0.032
 unison+0.031
 rarely+0.031
kin+0.031
Neg logit contributions
ls-0.046
 closer-0.043
 back-0.038
s-0.038
 groceries-0.037
Token she
Token ID673
Feature activation+1.30e+00

Pos logit contributions
 closer+0.051
ls+0.051
 back+0.045
s+0.043
 home+0.041
Neg logit contributions
kin-0.058
ena-0.058
 scalp-0.057
 unison-0.056
 End-0.055
Token took
Token ID1718
Feature activation+2.79e+00

Pos logit contributions
ls+0.033
 closer+0.031
 groceries+0.031
s+0.029
 attention+0.029
Neg logit contributions
ena-0.025
kin-0.024
 rarely-0.022
 scalp-0.022
 End-0.022
Token a
Token ID257
Feature activation+5.56e+00

Pos logit contributions
ls+0.082
erry+0.076
s+0.071
 closer+0.070
ooters+0.069
Neg logit contributions
ena-0.034
 End-0.032
 scalp-0.031
 only-0.029
 pretending-0.028
Token photo
Token ID4590
Feature activation+2.34e+00

Pos logit contributions
elling+0.119
ls+0.117
beh+0.112
lers+0.101
ished+0.101
Neg logit contributions
kin-0.113
 beard-0.102
pron-0.101
 hallway-0.095
 cou-0.094
Token of
Token ID286
Feature activation-2.37e-01

Pos logit contributions
ls+0.044
 unnecessary+0.036
 closer+0.036
 nearer+0.030
lers+0.030
Neg logit contributions
ena-0.063
kin-0.061
 scalp-0.059
 lear-0.057
ana-0.056
Token it
Token ID340
Feature activation+1.64e+00

Pos logit contributions
ena+0.006
 scalp+0.006
kin+0.006
 unison+0.006
 End+0.006
Neg logit contributions
ls-0.005
 closer-0.005
 back-0.004
s-0.004
 groceries-0.004
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
ls+0.039
 closer+0.035
 groceries+0.031
erry+0.031
s+0.030
Neg logit contributions
kin-0.035
ena-0.032
 scalp-0.032
 unison-0.031
 End-0.030
Token showed
Token ID3751
Feature activation+2.90e+00

Pos logit contributions
ls+0.050
 closer+0.047
erry+0.043
s+0.042
 back+0.042
Neg logit contributions
ena-0.041
 scalp-0.040
kin-0.039
 unison-0.039
 rarely-0.038
Token so
Token ID523
Feature activation+1.47e+00

Pos logit contributions
kin+0.070
 scalp+0.067
ena+0.067
 unison+0.066
 End+0.065
Neg logit contributions
 closer-0.058
ls-0.054
 back-0.050
s-0.046
 to-0.046
Token he
Token ID339
Feature activation+2.13e-01

Pos logit contributions
 closer+0.025
ls+0.024
 back+0.021
s+0.019
 home+0.018
Neg logit contributions
kin-0.044
 scalp-0.044
ena-0.044
 unison-0.043
 End-0.042
Token went
Token ID1816
Feature activation+1.80e+00

Pos logit contributions
ls+0.005
 closer+0.005
 groceries+0.005
 back+0.005
s+0.004
Neg logit contributions
ena-0.004
kin-0.004
 scalp-0.004
 rarely-0.004
 unison-0.004
Token to
Token ID284
Feature activation+5.88e-01

Pos logit contributions
ls+0.024
lers+0.018
 groceries+0.017
seed+0.016
erry+0.016
Neg logit contributions
ena-0.057
 End-0.055
 pretending-0.054
kin-0.052
 barely-0.051
Token take
Token ID1011
Feature activation+3.14e+00

Pos logit contributions
ls+0.014
lers+0.011
erry+0.011
s+0.011
 closer+0.010
Neg logit contributions
ena-0.014
 unison-0.013
 rarely-0.012
 End-0.012
 pretending-0.012
Token a
Token ID257
Feature activation+5.56e+00

Pos logit contributions
ls+0.095
s+0.086
erry+0.081
lers+0.076
 closer+0.075
Neg logit contributions
ena-0.045
 pretending-0.038
kin-0.038
 End-0.035
 scalp-0.034
Token look
Token ID804
Feature activation+3.03e+00

Pos logit contributions
elling+0.084
beh+0.083
ls+0.081
ishes+0.076
ups+0.072
Neg logit contributions
kin-0.126
pron-0.123
 pretending-0.120
 beard-0.115
OM-0.114
Token.
Token ID13
Feature activation+6.28e-01

Pos logit contributions
 closer+0.071
ls+0.071
Lost+0.070
 unnecessary+0.070
 groceries+0.069
Neg logit contributions
ena-0.045
kin-0.044
 ago-0.042
 End-0.040
aron-0.037
Token He
Token ID679
Feature activation-2.49e-01

Pos logit contributions
 closer+0.014
ls+0.013
 back+0.012
s+0.012
 to+0.012
Neg logit contributions
kin-0.016
 scalp-0.015
ena-0.015
 unison-0.015
 End-0.014
Token saw
Token ID2497
Feature activation+9.03e-01

Pos logit contributions
ena+0.006
kin+0.005
 unison+0.005
 scalp+0.005
 rarely+0.005
Neg logit contributions
ls-0.006
 closer-0.006
 back-0.005
 groceries-0.005
s-0.005
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
ls+0.021
 closer+0.020
s+0.018
 back+0.018
erry+0.018
Neg logit contributions
ena-0.020
kin-0.018
 unison-0.018
 End-0.018
 scalp-0.017
Token so
Token ID523
Feature activation+1.43e+00

Pos logit contributions
kin+0.020
 unison+0.019
 scalp+0.019
ities+0.018
ena+0.018
Neg logit contributions
 closer-0.016
 back-0.015
 home-0.014
 to-0.014
 out-0.014
Token excited
Token ID6568
Feature activation+1.16e+00

Pos logit contributions
 closer+0.029
ls+0.027
 back+0.024
 to+0.024
s+0.023
Neg logit contributions
 unison-0.040
 scalp-0.040
kin-0.039
ities-0.039
ena-0.039
Token as
Token ID355
Feature activation+1.68e+00

Pos logit contributions
 closer+0.017
ls+0.017
 back+0.014
s+0.012
 groceries+0.012
Neg logit contributions
ena-0.036
kin-0.035
 scalp-0.035
 unison-0.035
 End-0.034
Token she
Token ID673
Feature activation-4.21e-01

Pos logit contributions
 closer+0.037
ls+0.036
 back+0.032
s+0.031
 groceries+0.030
Neg logit contributions
ena-0.040
kin-0.040
 scalp-0.039
 unison-0.039
 End-0.038
Token brought
Token ID3181
Feature activation+2.94e+00

Pos logit contributions
 scalp+0.010
kin+0.010
ena+0.010
 unison+0.010
 End+0.010
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.008
 home-0.007
Token a
Token ID257
Feature activation+5.53e+00

Pos logit contributions
ls+0.078
 closer+0.073
 nearer+0.066
 unnecessary+0.063
erry+0.061
Neg logit contributions
ena-0.049
 unison-0.045
ana-0.043
 creating-0.042
 always-0.042
Token big
Token ID1263
Feature activation+4.62e+00

Pos logit contributions
ls+0.120
 aback+0.108
 unnecessary+0.101
war+0.100
beh+0.099
Neg logit contributions
 unison-0.125
kin-0.117
pron-0.114
�-0.108
lf-0.104
Token present
Token ID1944
Feature activation+2.37e+00

Pos logit contributions
ls+0.106
 closer+0.100
s+0.087
 off+0.086
 nearer+0.085
Neg logit contributions
kin-0.104
 unison-0.104
 End-0.096
 scalp-0.093
ena-0.092
Token into
Token ID656
Feature activation-3.61e-01

Pos logit contributions
ls+0.047
 closer+0.045
 back+0.037
 groceries+0.037
s+0.036
Neg logit contributions
ena-0.062
kin-0.062
 scalp-0.059
 unison-0.059
 End-0.055
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
ena+0.009
 unison+0.009
kin+0.008
 scalp+0.008
 End+0.008
Neg logit contributions
ls-0.008
 closer-0.008
 back-0.007
s-0.007
erry-0.006
Token living
Token ID2877
Feature activation-3.84e+00

Pos logit contributions
ls+0.039
 closer+0.036
s+0.032
 nearer+0.031
 off+0.030
Neg logit contributions
 unison-0.028
 End-0.028
ena-0.027
kin-0.026
 scalp-0.026
Token So
Token ID1406
Feature activation+1.53e-01

Pos logit contributions
 closer+0.030
ls+0.029
 back+0.028
s+0.027
 off+0.026
Neg logit contributions
kin-0.031
ena-0.031
 unison-0.030
 scalp-0.030
 End-0.029
Token,
Token ID11
Feature activation+5.95e-01

Pos logit contributions
 closer+0.003
ls+0.003
 back+0.002
s+0.002
 off+0.002
Neg logit contributions
ena-0.004
 unison-0.004
kin-0.004
 scalp-0.004
ric-0.004
Token her
Token ID607
Feature activation-1.95e+00

Pos logit contributions
 closer+0.011
ls+0.011
 back+0.010
s+0.010
 off+0.009
Neg logit contributions
ena-0.016
 unison-0.016
kin-0.016
 scalp-0.016
ric-0.016
Token mom
Token ID1995
Feature activation+3.04e+00

Pos logit contributions
kin+0.043
ena+0.043
 scalp+0.043
 unison+0.041
 End+0.041
Neg logit contributions
 closer-0.047
ls-0.047
 back-0.041
s-0.040
 off-0.039
Token took
Token ID1718
Feature activation+2.48e+00

Pos logit contributions
ls+0.065
 closer+0.062
 back+0.054
 groceries+0.053
s+0.052
Neg logit contributions
ena-0.077
kin-0.074
 scalp-0.072
 unison-0.071
 End-0.071
Token a
Token ID257
Feature activation+5.52e+00

Pos logit contributions
ls+0.068
 closer+0.062
erry+0.058
 nearer+0.058
earned+0.055
Neg logit contributions
 scalp-0.038
ena-0.038
 End-0.035
 unison-0.034
 lear-0.033
Token needle
Token ID17598
Feature activation+1.14e+00

Pos logit contributions
ls+0.122
beh+0.107
ished+0.106
 aback+0.102
elling+0.102
Neg logit contributions
kin-0.115
pron-0.111
 unison-0.104
 hallway-0.102
lf-0.102
Token and
Token ID290
Feature activation+2.85e+00

Pos logit contributions
ls+0.028
 closer+0.026
 groceries+0.023
s+0.023
 back+0.022
Neg logit contributions
ena-0.025
kin-0.025
 scalp-0.024
 unison-0.023
 End-0.022
Token thread
Token ID4704
Feature activation+6.81e-01

Pos logit contributions
ls+0.072
erry+0.068
 closer+0.066
away+0.064
off+0.064
Neg logit contributions
 scalp-0.050
kin-0.048
ena-0.048
 pretending-0.043
 cou-0.043
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
 closer+0.011
ls+0.011
 back+0.008
 groceries+0.008
s+0.008
Neg logit contributions
kin-0.020
ena-0.020
 scalp-0.020
 unison-0.020
 End-0.019
Token se
Token ID384
Feature activation+3.29e+00

Pos logit contributions
ls+0.050
 closer+0.046
erry+0.044
s+0.042
 back+0.041
Neg logit contributions
 scalp-0.037
kin-0.037
ena-0.036
 rarely-0.032
 unison-0.032
Token was
Token ID373
Feature activation-2.81e+00

Pos logit contributions
 closer+0.002
s+0.002
 back+0.002
ls+0.002
 away+0.002
Neg logit contributions
kin-0.003
 scalp-0.003
ities-0.003
 unison-0.003
ric-0.003
Token excited
Token ID6568
Feature activation+1.17e+00

Pos logit contributions
kin+0.088
OM+0.087
 scalp+0.085
 unison+0.084
ities+0.084
Neg logit contributions
 closer-0.049
 back-0.046
 up-0.038
 to-0.037
 off-0.037
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 closer+0.022
ls+0.021
 back+0.020
s+0.019
 to+0.019
Neg logit contributions
kin-0.033
 scalp-0.032
ena-0.032
 unison-0.032
 End-0.031
Token quickly
Token ID2952
Feature activation+2.09e+00

Pos logit contributions
 closer+0.050
ls+0.049
 back+0.043
s+0.041
 home+0.040
Neg logit contributions
kin-0.057
ena-0.057
 scalp-0.056
 unison-0.055
 End-0.054
Token took
Token ID1718
Feature activation+2.64e+00

Pos logit contributions
 closer+0.046
ls+0.046
 back+0.041
s+0.039
 home+0.038
Neg logit contributions
kin-0.050
ena-0.050
 scalp-0.049
 unison-0.049
 End-0.047
Token a
Token ID257
Feature activation+5.50e+00

Pos logit contributions
ls+0.074
s+0.068
erry+0.064
 closer+0.062
lers+0.058
Neg logit contributions
ena-0.046
kin-0.046
 scalp-0.041
 unison-0.039
 End-0.039
Token bath
Token ID7837
Feature activation-5.48e-01

Pos logit contributions
ls+0.126
beh+0.121
earned+0.118
elling+0.114
 av+0.114
Neg logit contributions
kin-0.118
 unison-0.098
pron-0.096
 cou-0.086
 hallway-0.086
Token.
Token ID13
Feature activation+9.82e-01

Pos logit contributions
ena+0.017
 unison+0.016
 rarely+0.016
 End+0.016
 scalp+0.016
Neg logit contributions
 closer-0.009
 back-0.009
ls-0.008
 to-0.008
 home-0.008
Token After
Token ID2293
Feature activation-1.21e+00

Pos logit contributions
 closer+0.021
ls+0.020
 back+0.019
s+0.018
 home+0.017
Neg logit contributions
kin-0.025
ena-0.025
 scalp-0.024
 unison-0.024
 End-0.023
Token he
Token ID339
Feature activation-7.88e-03

Pos logit contributions
 scalp+0.038
ena+0.038
kin+0.037
 unison+0.037
ric+0.036
Neg logit contributions
 closer-0.019
ls-0.018
 back-0.017
 to-0.016
 home-0.015
Token got
Token ID1392
Feature activation+4.03e+00

Pos logit contributions
 scalp+0.000
kin+0.000
ena+0.000
 unison+0.000
 End+0.000
Neg logit contributions
 closer-0.000
ls-0.000
 back-0.000
s-0.000
 off-0.000
Token."
Token ID526
Feature activation-4.37e-01

Pos logit contributions
ls+0.007
 closer+0.007
 groceries+0.006
erry+0.006
 back+0.006
Neg logit contributions
ena-0.008
 scalp-0.008
 rarely-0.008
 unison-0.007
ric-0.007
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
kin+0.012
ena+0.012
 scalp+0.012
 unison+0.012
 End+0.011
Neg logit contributions
 closer-0.008
ls-0.008
s-0.008
 back-0.008
 to-0.008
Token
Token ID198
Feature activation+5.08e-01

Pos logit contributions
 closer+0.003
ls+0.003
 back+0.003
 groceries+0.003
s+0.003
Neg logit contributions
ena-0.003
kin-0.003
 scalp-0.003
 unison-0.003
 End-0.003
TokenTim
Token ID14967
Feature activation+1.40e+00

Pos logit contributions
 closer+0.012
ls+0.011
s+0.011
 back+0.010
 off+0.010
Neg logit contributions
 scalp-0.012
kin-0.012
ena-0.011
 unison-0.011
ric-0.011
Token took
Token ID1718
Feature activation+2.31e+00

Pos logit contributions
 closer+0.029
ls+0.028
 back+0.025
s+0.023
 home+0.023
Neg logit contributions
ena-0.036
kin-0.036
 scalp-0.035
 unison-0.035
 End-0.034
Token a
Token ID257
Feature activation+5.50e+00

Pos logit contributions
ls+0.063
s+0.058
erry+0.054
 closer+0.053
use+0.052
Neg logit contributions
ena-0.043
kin-0.038
 End-0.037
 scalp-0.036
 unison-0.034
Token deep
Token ID2769
Feature activation+1.55e+00

Pos logit contributions
ls+0.116
lers+0.100
elling+0.100
beh+0.098
erry+0.096
Neg logit contributions
kin-0.114
 lear-0.104
pron-0.102
ena-0.102
 hallway-0.102
Token breath
Token ID8033
Feature activation+7.57e-01

Pos logit contributions
ls+0.031
 closer+0.030
 back+0.026
s+0.026
 groceries+0.025
Neg logit contributions
kin-0.041
ena-0.040
 scalp-0.039
 unison-0.038
 End-0.038
Token and
Token ID290
Feature activation+2.37e+00

Pos logit contributions
ls+0.018
 closer+0.018
 back+0.016
 groceries+0.015
s+0.015
Neg logit contributions
ena-0.016
kin-0.016
 scalp-0.016
 unison-0.015
 End-0.015
Token said
Token ID531
Feature activation+8.25e-01

Pos logit contributions
ls+0.058
 closer+0.054
erry+0.054
 groceries+0.051
 back+0.049
Neg logit contributions
ena-0.049
 scalp-0.045
kin-0.044
 rarely-0.043
 unison-0.042
Token,
Token ID11
Feature activation-2.60e+00

Pos logit contributions
ls+0.021
 closer+0.021
 groceries+0.020
needed+0.018
erry+0.018
Neg logit contributions
kin-0.016
 scalp-0.014
 End-0.014
ena-0.014
 rarely-0.013
Token he
Token ID339
Feature activation+5.12e-01

Pos logit contributions
kin+0.006
ena+0.006
 unison+0.006
 End+0.006
 scalp+0.006
Neg logit contributions
ls-0.005
 closer-0.005
 back-0.004
s-0.004
 groceries-0.004
Token walked
Token ID6807
Feature activation+4.25e-01

Pos logit contributions
ls+0.012
 closer+0.012
 groceries+0.011
 back+0.011
 home+0.011
Neg logit contributions
ena-0.011
kin-0.010
 unison-0.010
 scalp-0.010
 rarely-0.010
Token closer
Token ID5699
Feature activation+1.13e+00

Pos logit contributions
ls+0.007
 groceries+0.006
lers+0.006
 unnecessary+0.005
erry+0.005
Neg logit contributions
ena-0.012
 End-0.012
But-0.011
Now-0.011
kin-0.011
Token to
Token ID284
Feature activation-3.02e-02

Pos logit contributions
ls+0.020
 closer+0.019
 back+0.016
 groceries+0.016
s+0.015
Neg logit contributions
ena-0.031
kin-0.030
 scalp-0.030
 unison-0.030
 End-0.029
Token take
Token ID1011
Feature activation+2.82e+00

Pos logit contributions
ena+0.001
 unison+0.001
 End+0.001
ric+0.001
 rarely+0.001
Neg logit contributions
ls-0.001
lers-0.001
s-0.001
 groceries-0.001
erry-0.001
Token a
Token ID257
Feature activation+5.50e+00

Pos logit contributions
ls+0.088
s+0.078
erry+0.073
 groceries+0.073
lers+0.072
Neg logit contributions
ena-0.038
kin-0.032
 pretending-0.030
 End-0.029
 unison-0.028
Token better
Token ID1365
Feature activation+2.54e+00

Pos logit contributions
ls+0.071
ishes+0.070
 groceries+0.067
beh+0.067
ups+0.066
Neg logit contributions
kin-0.146
 sensation-0.135
 unison-0.134
pron-0.130
 pretending-0.129
Token look
Token ID804
Feature activation+2.64e+00

Pos logit contributions
ls+0.058
 groceries+0.053
s+0.048
 closer+0.046
lers+0.046
Neg logit contributions
kin-0.061
 sensation-0.053
 scalp-0.053
 unison-0.052
ena-0.051
Token.
Token ID13
Feature activation+7.80e-01

Pos logit contributions
 closer+0.061
ls+0.058
 groceries+0.056
 back+0.053
 home+0.051
Neg logit contributions
kin-0.054
ena-0.053
 scalp-0.048
 End-0.048
 unison-0.047
Token He
Token ID679
Feature activation-7.57e-02

Pos logit contributions
ls+0.020
 closer+0.020
 back+0.018
s+0.017
 home+0.016
Neg logit contributions
ena-0.016
kin-0.016
 scalp-0.016
 unison-0.016
 End-0.015
Token was
Token ID373
Feature activation-2.34e+00

Pos logit contributions
ena+0.002
kin+0.002
 scalp+0.002
 unison+0.002
 rarely+0.002
Neg logit contributions
ls-0.002
 closer-0.002
 groceries-0.002
 back-0.002
 home-0.001
Token sleepy
Token ID42023
Feature activation-1.65e+00

Pos logit contributions
 closer+0.030
ls+0.029
 back+0.026
s+0.024
 home+0.024
Neg logit contributions
 unison-0.042
ena-0.042
 scalp-0.041
kin-0.041
 End-0.039
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
kin+0.049
ena+0.049
 scalp+0.048
 unison+0.048
 End+0.047
Neg logit contributions
 closer-0.026
ls-0.026
 back-0.022
s-0.021
 home-0.019
Token Can
Token ID1680
Feature activation+1.43e-01

Pos logit contributions
 closer+0.030
 back+0.027
ls+0.027
 to+0.024
 home+0.024
Neg logit contributions
kin-0.033
ena-0.032
 scalp-0.031
 unison-0.030
 End-0.030
Token I
Token ID314
Feature activation+3.36e-01

Pos logit contributions
 closer+0.003
ls+0.003
 back+0.003
s+0.002
 home+0.002
Neg logit contributions
kin-0.004
ena-0.004
 scalp-0.004
 unison-0.004
 End-0.003
Token take
Token ID1011
Feature activation+3.39e+00

Pos logit contributions
ls+0.008
 closer+0.007
 groceries+0.006
 back+0.006
s+0.006
Neg logit contributions
kin-0.008
 scalp-0.008
ena-0.008
 unison-0.008
 End-0.007
Token a
Token ID257
Feature activation+5.49e+00

Pos logit contributions
ls+0.082
 closer+0.077
s+0.070
erry+0.065
 nearer+0.064
Neg logit contributions
ena-0.065
kin-0.065
 scalp-0.064
 unison-0.064
 End-0.063
Token nap
Token ID25422
Feature activation-2.16e+00

Pos logit contributions
ls+0.157
seed+0.138
elling+0.137
beh+0.137
ishes+0.134
Neg logit contributions
kin-0.085
 unison-0.071
pron-0.068
 scalp-0.065
uren-0.065
Token now
Token ID783
Feature activation+1.28e+00

Pos logit contributions
ena+0.059
 scalp+0.058
kin+0.058
 unison+0.058
 End+0.056
Neg logit contributions
 closer-0.040
ls-0.040
 back-0.035
s-0.033
 home-0.032
Token?"
Token ID1701
Feature activation-3.62e+00

Pos logit contributions
 closer+0.024
ls+0.023
 back+0.021
s+0.019
 home+0.019
Neg logit contributions
ena-0.035
kin-0.035
 scalp-0.035
 unison-0.035
 End-0.034
Token asked
Token ID1965
Feature activation-1.63e+00

Pos logit contributions
 End+0.068
 rarely+0.066
 unison+0.066
kin+0.062
ena+0.062
Neg logit contributions
ls-0.088
 closer-0.084
 back-0.077
erry-0.076
 groceries-0.076
Token Walter
Token ID15819
Feature activation+2.21e+00

Pos logit contributions
kin+0.034
 End+0.034
 unison+0.033
ena+0.032
 rarely+0.031
Neg logit contributions
ls-0.037
 closer-0.035
s-0.032
war-0.031
 back-0.030
Token so
Token ID523
Feature activation+1.65e+00

Pos logit contributions
kin+0.038
ena+0.037
 unison+0.037
 scalp+0.037
ric+0.035
Neg logit contributions
 closer-0.016
ls-0.014
 back-0.014
 to-0.013
 off-0.012
Token he
Token ID339
Feature activation+7.93e-02

Pos logit contributions
 closer+0.027
ls+0.027
 back+0.023
s+0.021
 off+0.020
Neg logit contributions
kin-0.051
ena-0.051
 scalp-0.050
 unison-0.050
ric-0.048
Token decided
Token ID3066
Feature activation+2.80e+00

Pos logit contributions
ls+0.002
 closer+0.002
 back+0.002
s+0.002
 groceries+0.002
Neg logit contributions
kin-0.002
ena-0.002
 scalp-0.002
 unison-0.002
 End-0.002
Token to
Token ID284
Feature activation-1.02e+00

Pos logit contributions
ls+0.027
 closer+0.022
s+0.016
erry+0.015
 groceries+0.015
Neg logit contributions
ena-0.100
 scalp-0.097
 rarely-0.096
kin-0.096
 End-0.095
Token take
Token ID1011
Feature activation+2.67e+00

Pos logit contributions
kin+0.029
ena+0.029
 scalp+0.029
 unison+0.029
 End+0.028
Neg logit contributions
 closer-0.018
ls-0.017
 back-0.016
 off-0.014
 home-0.014
Token a
Token ID257
Feature activation+5.48e+00

Pos logit contributions
ls+0.072
s+0.064
 closer+0.064
erry+0.062
 groceries+0.057
Neg logit contributions
ena-0.047
 End-0.045
kin-0.043
 scalp-0.043
 unison-0.042
Token nap
Token ID25422
Feature activation-2.71e+00

Pos logit contributions
ls+0.140
elling+0.134
ishes+0.126
 groceries+0.123
ished+0.122
Neg logit contributions
kin-0.088
uren-0.082
pron-0.079
 unison-0.075
 barely-0.074
Token instead
Token ID2427
Feature activation-8.83e-02

Pos logit contributions
ric+0.081
 unison+0.081
ena+0.081
 scalp+0.080
 End+0.079
Neg logit contributions
 closer-0.045
ls-0.042
 back-0.041
 home-0.036
 off-0.036
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
kin+0.002
ena+0.002
 scalp+0.002
 unison+0.002
 End+0.002
Neg logit contributions
ls-0.002
 closer-0.002
 back-0.001
s-0.001
 groceries-0.001
Token He
Token ID679
Feature activation-1.06e+00

Pos logit contributions
 closer+0.024
ls+0.024
 back+0.022
s+0.021
 to+0.021
Neg logit contributions
kin-0.027
ena-0.026
 scalp-0.026
 unison-0.026
 End-0.025
Token curled
Token ID41108
Feature activation-9.68e-01

Pos logit contributions
kin+0.024
ena+0.024
 scalp+0.024
 unison+0.024
 End+0.023
Neg logit contributions
 closer-0.025
ls-0.024
 back-0.022
s-0.021
 home-0.020
Token his
Token ID465
Feature activation-6.51e-01

Pos logit contributions
kin+0.014
ena+0.013
 scalp+0.013
 End+0.013
 unison+0.012
Neg logit contributions
ls-0.010
 closer-0.010
s-0.009
 back-0.008
erry-0.008
Token book
Token ID1492
Feature activation+1.04e+00

Pos logit contributions
kin+0.017
ena+0.017
 scalp+0.017
 unison+0.016
 End+0.016
Neg logit contributions
ls-0.013
 closer-0.013
s-0.011
 back-0.011
 off-0.010
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
ls+0.016
 closer+0.014
erry+0.013
 groceries+0.013
s+0.012
Neg logit contributions
kin-0.032
ena-0.031
 scalp-0.029
 unison-0.029
 End-0.029
Token his
Token ID465
Feature activation-1.28e+00

Pos logit contributions
 closer+0.038
ls+0.037
 back+0.033
s+0.031
 home+0.030
Neg logit contributions
ena-0.050
kin-0.049
 scalp-0.049
 unison-0.048
 End-0.047
Token bin
Token ID9874
Feature activation+2.04e+00

Pos logit contributions
ena+0.033
kin+0.033
 scalp+0.032
 unison+0.032
 End+0.031
Neg logit contributions
 closer-0.026
ls-0.026
 back-0.023
s-0.022
 home-0.021
Tokenocular
Token ID37320
Feature activation+5.48e+00

Pos logit contributions
ls+0.037
 closer+0.036
 groceries+0.029
erry+0.029
s+0.028
Neg logit contributions
kin-0.055
ena-0.055
 scalp-0.053
 unison-0.051
 End-0.049
Tokens
Token ID82
Feature activation-4.30e-02

Pos logit contributions
 closer+0.055
ls+0.044
 groceries+0.041
erry+0.036
 nearer+0.036
Neg logit contributions
ena-0.176
ric-0.172
 scalp-0.171
kin-0.169
 rarely-0.166
Token in
Token ID287
Feature activation-1.41e+00

Pos logit contributions
ena+0.001
kin+0.001
 scalp+0.001
 unison+0.001
 End+0.001
Neg logit contributions
 closer-0.001
ls-0.001
 back-0.001
s-0.001
 home-0.001
Token his
Token ID465
Feature activation-1.44e+00

Pos logit contributions
 scalp+0.043
kin+0.043
ena+0.042
 unison+0.041
 End+0.041
Neg logit contributions
 closer-0.023
ls-0.021
 back-0.019
 home-0.018
 off-0.017
Token backpack
Token ID22526
Feature activation-5.67e-01

Pos logit contributions
kin+0.035
 unison+0.034
 scalp+0.034
ena+0.033
 End+0.032
Neg logit contributions
 closer-0.033
ls-0.033
s-0.027
 nearer-0.026
 off-0.026
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
 scalp+0.017
 unison+0.016
ena+0.016
kin+0.016
 End+0.016
Neg logit contributions
 closer-0.010
ls-0.009
 back-0.009
s-0.008
 to-0.008
Token tired
Token ID10032
Feature activation-9.38e-01

Pos logit contributions
kin+0.073
 unison+0.073
ena+0.072
 scalp+0.072
OM+0.068
Neg logit contributions
 closer-0.053
 back-0.049
ls-0.047
 home-0.044
 off-0.043
Token.
Token ID13
Feature activation+7.10e-01

Pos logit contributions
kin+0.021
ena+0.021
 scalp+0.021
 unison+0.020
 End+0.020
Neg logit contributions
ls-0.022
 closer-0.022
 back-0.019
s-0.019
 groceries-0.018
Token So
Token ID1406
Feature activation-6.96e-02

Pos logit contributions
 closer+0.013
 back+0.012
 to+0.012
ls+0.011
 off+0.011
Neg logit contributions
kin-0.021
ena-0.019
 unison-0.019
 End-0.019
 scalp-0.019
Token it
Token ID340
Feature activation+1.58e+00

Pos logit contributions
 scalp+0.002
ena+0.002
kin+0.002
 unison+0.002
ric+0.002
Neg logit contributions
 closer-0.001
ls-0.001
 back-0.001
 off-0.001
 to-0.001
Token took
Token ID1718
Feature activation+2.49e+00

Pos logit contributions
ls+0.037
 closer+0.036
 back+0.032
 groceries+0.031
s+0.031
Neg logit contributions
kin-0.035
ena-0.035
 scalp-0.034
 unison-0.034
 End-0.033
Token a
Token ID257
Feature activation+5.47e+00

Pos logit contributions
ls+0.065
 closer+0.060
s+0.060
erry+0.054
 groceries+0.053
Neg logit contributions
kin-0.045
 scalp-0.041
 End-0.041
ena-0.041
 unison-0.039
Token nap
Token ID25422
Feature activation-3.30e+00

Pos logit contributions
ls+0.133
elling+0.127
beh+0.120
lers+0.117
ishes+0.116
Neg logit contributions
kin-0.106
pron-0.089
 cou-0.082
 sensation-0.080
 hallway-0.079
Token in
Token ID287
Feature activation-2.51e-01

Pos logit contributions
ric+0.098
 unison+0.096
ena+0.095
 scalp+0.092
 End+0.092
Neg logit contributions
 closer-0.058
 back-0.057
ls-0.056
 to-0.052
 home-0.051
Token the
Token ID262
Feature activation+8.35e-01

Pos logit contributions
ena+0.008
kin+0.008
 scalp+0.008
 unison+0.008
 End+0.007
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.003
 home-0.003
 off-0.003
Token shade
Token ID17979
Feature activation-2.55e+00

Pos logit contributions
ls+0.020
 closer+0.018
s+0.016
 groceries+0.015
 off+0.015
Neg logit contributions
 End-0.020
 scalp-0.019
kin-0.019
 unison-0.019
 sensation-0.018
Token of
Token ID286
Feature activation-1.63e+00

Pos logit contributions
 unison+0.079
 scalp+0.078
ena+0.077
kin+0.077
 End+0.075
Neg logit contributions
 closer-0.041
ls-0.039
 back-0.037
s-0.034
 to-0.033
Token,
Token ID11
Feature activation+4.54e-01

Pos logit contributions
kin+0.013
 scalp+0.012
ena+0.012
 End+0.012
 unison+0.012
Neg logit contributions
 closer-0.012
ls-0.011
 back-0.010
 groceries-0.010
 home-0.009
Token Lily
Token ID20037
Feature activation+1.52e-01

Pos logit contributions
 closer+0.009
ls+0.008
 back+0.007
 to+0.007
s+0.007
Neg logit contributions
ena-0.013
ric-0.013
 unison-0.013
 scalp-0.013
kin-0.013
Token's
Token ID338
Feature activation-1.25e+00

Pos logit contributions
 closer+0.002
ls+0.002
 back+0.002
 groceries+0.002
s+0.002
Neg logit contributions
ena-0.005
kin-0.005
 scalp-0.005
 unison-0.004
 End-0.004
Token mom
Token ID1995
Feature activation+2.64e+00

Pos logit contributions
kin+0.027
ena+0.027
 scalp+0.026
 unison+0.025
 End+0.025
Neg logit contributions
ls-0.031
 closer-0.030
s-0.026
 back-0.026
 off-0.025
Token took
Token ID1718
Feature activation+2.17e+00

Pos logit contributions
 closer+0.053
ls+0.053
 back+0.046
s+0.044
 home+0.042
Neg logit contributions
ena-0.068
kin-0.068
 scalp-0.067
 unison-0.066
 End-0.065
Token a
Token ID257
Feature activation+5.45e+00

Pos logit contributions
ls+0.064
 closer+0.058
 nearer+0.054
s+0.054
erry+0.054
Neg logit contributions
ena-0.036
 scalp-0.030
ana-0.028
 pretending-0.027
 lear-0.026
Token photo
Token ID4590
Feature activation+1.82e+00

Pos logit contributions
ls+0.117
beh+0.105
 aback+0.102
elling+0.099
ished+0.097
Neg logit contributions
pron-0.114
kin-0.102
 laptop-0.100
lf-0.099
 hallway-0.099
Token of
Token ID286
Feature activation-1.50e+00

Pos logit contributions
ls+0.028
 closer+0.027
 unnecessary+0.023
 nearer+0.022
lers+0.020
Neg logit contributions
ena-0.056
kin-0.052
 scalp-0.050
 lear-0.050
ana-0.049
Token her
Token ID607
Feature activation-8.88e-01

Pos logit contributions
kin+0.042
ena+0.042
 scalp+0.041
 unison+0.041
 End+0.040
Neg logit contributions
 closer-0.027
ls-0.026
 back-0.023
s-0.022
 home-0.021
Token with
Token ID351
Feature activation+1.45e-01

Pos logit contributions
kin+0.023
ena+0.022
 scalp+0.022
 unison+0.022
 End+0.022
Neg logit contributions
ls-0.018
 closer-0.018
 back-0.017
s-0.016
 home-0.016
Token her
Token ID607
Feature activation-6.95e-01

Pos logit contributions
ls+0.003
 closer+0.003
erry+0.003
s+0.003
 back+0.003
Neg logit contributions
ena-0.003
 unison-0.003
 scalp-0.003
kin-0.003
 End-0.003
Token she
Token ID673
Feature activation+2.42e-02

Pos logit contributions
 closer+0.049
ls+0.048
 back+0.044
s+0.042
 off+0.041
Neg logit contributions
kin-0.053
ena-0.053
 unison-0.053
 scalp-0.051
 End-0.050
Token ran
Token ID4966
Feature activation+1.50e+00

Pos logit contributions
 closer+0.001
ls+0.001
s+0.000
 back+0.000
 to+0.000
Neg logit contributions
kin-0.001
 unison-0.001
 End-0.001
ena-0.001
 scalp-0.001
Token over
Token ID625
Feature activation+1.04e+00

Pos logit contributions
ls+0.022
 groceries+0.017
lers+0.016
erry+0.016
seed+0.016
Neg logit contributions
ena-0.043
 End-0.042
 lear-0.041
 scalp-0.041
 pretending-0.041
Token to
Token ID284
Feature activation+4.66e-01

Pos logit contributions
ls+0.013
lers+0.008
 closer+0.008
 groceries+0.008
erry+0.007
Neg logit contributions
ena-0.035
 unison-0.033
 scalp-0.033
 End-0.033
era-0.032
Token take
Token ID1011
Feature activation+2.90e+00

Pos logit contributions
ls+0.010
 closer+0.009
s+0.008
 groceries+0.008
erry+0.008
Neg logit contributions
ena-0.011
 scalp-0.011
 End-0.011
 unison-0.011
kin-0.011
Token a
Token ID257
Feature activation+5.44e+00

Pos logit contributions
ls+0.081
 closer+0.069
s+0.069
lers+0.064
erry+0.063
Neg logit contributions
ena-0.046
 scalp-0.045
 lear-0.039
kin-0.038
ana-0.038
Token closer
Token ID5699
Feature activation+2.45e+00

Pos logit contributions
ls+0.084
 groceries+0.078
beh+0.078
elling+0.076
ished+0.073
Neg logit contributions
pron-0.125
kin-0.119
 violet-0.115
ric-0.112
 scalp-0.112
Token look
Token ID804
Feature activation+2.42e+00

Pos logit contributions
ls+0.062
 groceries+0.057
 unnecessary+0.049
 closer+0.049
lers+0.047
Neg logit contributions
ric-0.044
 scalp-0.043
kin-0.043
 sensation-0.043
ena-0.042
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
 closer+0.058
 groceries+0.056
ls+0.055
 aback+0.051
war+0.051
Neg logit contributions
kin-0.044
ena-0.044
 End-0.041
 scalp-0.041
 unison-0.038
Token She
Token ID1375
Feature activation-6.28e-01

Pos logit contributions
 closer+0.028
ls+0.027
 back+0.025
s+0.024
 to+0.023
Neg logit contributions
kin-0.026
ena-0.025
 unison-0.025
 scalp-0.025
 End-0.024
Token wanted
Token ID2227
Feature activation+1.85e+00

Pos logit contributions
ena+0.014
kin+0.014
 scalp+0.014
 unison+0.014
 End+0.013
Neg logit contributions
ls-0.015
 closer-0.015
 back-0.013
 groceries-0.012
s-0.012
Token,
Token ID11
Feature activation+6.49e-01

Pos logit contributions
kin+0.012
ena+0.012
 scalp+0.011
 unison+0.011
 End+0.011
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.007
 home-0.007
Token she
Token ID673
Feature activation+4.80e-01

Pos logit contributions
 closer+0.014
ls+0.014
 back+0.013
s+0.012
 home+0.012
Neg logit contributions
kin-0.016
ena-0.016
 scalp-0.015
 unison-0.015
 End-0.015
Token decided
Token ID3066
Feature activation+2.60e+00

Pos logit contributions
 closer+0.011
ls+0.011
 groceries+0.010
 back+0.010
 home+0.010
Neg logit contributions
ena-0.011
kin-0.011
 scalp-0.010
 unison-0.010
 End-0.010
Token to
Token ID284
Feature activation-9.98e-01

Pos logit contributions
ls+0.021
 closer+0.019
s+0.012
 back+0.010
erry+0.010
Neg logit contributions
ena-0.097
 scalp-0.096
kin-0.095
 End-0.093
 rarely-0.093
Token take
Token ID1011
Feature activation+2.73e+00

Pos logit contributions
ena+0.030
kin+0.030
 scalp+0.030
 End+0.029
 unison+0.029
Neg logit contributions
 closer-0.017
 back-0.015
ls-0.014
 off-0.014
 home-0.014
Token a
Token ID257
Feature activation+5.44e+00

Pos logit contributions
ls+0.069
 closer+0.065
erry+0.062
s+0.056
 nearer+0.055
Neg logit contributions
ena-0.051
 End-0.051
 scalp-0.051
kin-0.050
 unison-0.048
Token deep
Token ID2769
Feature activation+1.63e+00

Pos logit contributions
ls+0.116
elling+0.099
lers+0.099
beh+0.099
ishes+0.093
Neg logit contributions
pron-0.115
kin-0.113
 pretending-0.101
 lear-0.098
 hallway-0.097
Token breath
Token ID8033
Feature activation+1.21e+00

Pos logit contributions
ls+0.032
 closer+0.031
s+0.027
 back+0.027
 groceries+0.026
Neg logit contributions
kin-0.044
ena-0.042
 scalp-0.041
 unison-0.041
 End-0.040
Token and
Token ID290
Feature activation+2.81e+00

Pos logit contributions
ls+0.029
 closer+0.028
 groceries+0.025
 back+0.024
s+0.024
Neg logit contributions
ena-0.026
kin-0.026
 scalp-0.024
 unison-0.024
 End-0.023
Token ask
Token ID1265
Feature activation+1.13e+00

Pos logit contributions
ls+0.067
 closer+0.060
s+0.057
erry+0.055
 back+0.054
Neg logit contributions
 scalp-0.061
ena-0.061
kin-0.061
 unison-0.059
 rarely-0.057
Token Max
Token ID5436
Feature activation+2.61e+00

Pos logit contributions
ls+0.029
 closer+0.027
erry+0.024
s+0.023
lers+0.023
Neg logit contributions
ena-0.025
kin-0.023
 scalp-0.023
 End-0.023
 unison-0.022
Token said
Token ID531
Feature activation-8.82e-01

Pos logit contributions
kin+0.025
 End+0.024
 scalp+0.024
 unison+0.024
ric+0.023
Neg logit contributions
 closer-0.014
ls-0.014
s-0.012
 back-0.012
 to-0.011
Token,
Token ID11
Feature activation-6.05e+00

Pos logit contributions
kin+0.023
ena+0.023
 scalp+0.023
 unison+0.023
 End+0.022
Neg logit contributions
ls-0.017
 closer-0.017
 back-0.015
s-0.014
 groceries-0.014
Token as
Token ID355
Feature activation+4.06e-01

Pos logit contributions
kin+0.161
ena+0.152
OM+0.151
 unison+0.145
 End+0.145
Neg logit contributions
 closer-0.156
 back-0.142
 down-0.132
 to-0.128
 up-0.127
Token he
Token ID339
Feature activation-6.96e-01

Pos logit contributions
 closer+0.006
ls+0.006
 back+0.005
 to+0.005
s+0.004
Neg logit contributions
ric-0.014
 scalp-0.013
 unison-0.013
ities-0.013
OM-0.013
Token snatched
Token ID43276
Feature activation+2.67e+00

Pos logit contributions
ric+0.018
 End+0.018
kin+0.017
 scalp+0.017
 unison+0.017
Neg logit contributions
 closer-0.015
ls-0.014
s-0.013
 back-0.013
 to-0.012
Token a
Token ID257
Feature activation+5.43e+00

Pos logit contributions
ls+0.062
 closer+0.060
erry+0.055
needed+0.054
s+0.054
Neg logit contributions
ena-0.056
kin-0.054
 unison-0.053
 scalp-0.053
 End-0.052
Token red
Token ID2266
Feature activation+2.82e+00

Pos logit contributions
 groceries+0.115
ls+0.112
beh+0.106
earned+0.106
needed+0.105
Neg logit contributions
kin-0.118
 rectangle-0.102
 unison-0.102
 scalp-0.101
 loser-0.099
Token ball
Token ID2613
Feature activation+2.93e+00

Pos logit contributions
ls+0.059
 closer+0.055
 groceries+0.050
 unnecessary+0.047
needed+0.046
Neg logit contributions
kin-0.070
 scalp-0.067
 unison-0.066
 End-0.063
ena-0.062
Token from
Token ID422
Feature activation+6.65e-01

Pos logit contributions
ls+0.062
 groceries+0.059
 unnecessary+0.056
 closer+0.055
seed+0.052
Neg logit contributions
kin-0.066
ena-0.062
 scalp-0.061
uff-0.057
 unison-0.057
Token Lily
Token ID20037
Feature activation+9.74e-01

Pos logit contributions
 closer+0.012
ls+0.011
 back+0.010
 home+0.009
 off+0.009
Neg logit contributions
kin-0.019
 scalp-0.019
ena-0.019
 unison-0.019
 End-0.018
Token.
Token ID13
Feature activation+5.21e-02

Pos logit contributions
 closer+0.018
ls+0.018
 back+0.015
s+0.014
 groceries+0.014
Neg logit contributions
kin-0.027
ena-0.027
 scalp-0.026
 unison-0.026
 End-0.025
Token mom
Token ID1995
Feature activation+2.67e+00

Pos logit contributions
kin+0.013
 scalp+0.013
ena+0.013
 unison+0.013
 End+0.013
Neg logit contributions
ls-0.019
 closer-0.019
 back-0.016
s-0.016
 away-0.016
Token asked
Token ID1965
Feature activation+1.38e-01

Pos logit contributions
 closer+0.053
ls+0.052
 back+0.046
s+0.043
 home+0.041
Neg logit contributions
kin-0.071
ena-0.070
 scalp-0.070
 unison-0.069
 End-0.067
Token her
Token ID607
Feature activation+9.21e-01

Pos logit contributions
ls+0.003
 closer+0.003
 back+0.002
s+0.002
 groceries+0.002
Neg logit contributions
ena-0.004
kin-0.004
 scalp-0.004
 End-0.004
 unison-0.004
Token to
Token ID284
Feature activation-6.61e-01

Pos logit contributions
ls+0.013
 closer+0.011
s+0.009
lers+0.009
 back+0.009
Neg logit contributions
ena-0.030
kin-0.029
 End-0.028
 scalp-0.028
 unison-0.027
Token take
Token ID1011
Feature activation+2.38e+00

Pos logit contributions
kin+0.019
ena+0.019
 scalp+0.018
 unison+0.018
 End+0.018
Neg logit contributions
ls-0.012
 closer-0.012
 back-0.010
s-0.010
 home-0.009
Token a
Token ID257
Feature activation+5.42e+00

Pos logit contributions
ls+0.069
 closer+0.061
s+0.059
 nearer+0.054
erry+0.053
Neg logit contributions
ena-0.044
 scalp-0.038
ana-0.036
kin-0.034
 lear-0.033
Token break
Token ID2270
Feature activation+1.69e+00

Pos logit contributions
ls+0.131
beh+0.109
elling+0.108
lers+0.106
s+0.105
Neg logit contributions
pron-0.107
kin-0.097
�-0.089
 laptop-0.088
 pretending-0.088
Token and
Token ID290
Feature activation+2.64e+00

Pos logit contributions
ls+0.039
 closer+0.038
 back+0.032
 nearer+0.031
s+0.031
Neg logit contributions
ena-0.039
kin-0.038
 scalp-0.036
 unison-0.034
ities-0.033
Token read
Token ID1100
Feature activation+5.12e-03

Pos logit contributions
ls+0.065
 closer+0.056
s+0.054
erry+0.053
 back+0.051
Neg logit contributions
ena-0.059
kin-0.057
 scalp-0.057
 unison-0.054
 rarely-0.053
Token a
Token ID257
Feature activation+2.84e+00

Pos logit contributions
ls+0.000
 closer+0.000
 back+0.000
s+0.000
erry+0.000
Neg logit contributions
ena-0.000
kin-0.000
 unison-0.000
 scalp-0.000
 End-0.000
Token story
Token ID1621
Feature activation-1.30e-01

Pos logit contributions
ls+0.073
 closer+0.064
 back+0.061
lers+0.057
s+0.057
Neg logit contributions
kin-0.057
 unison-0.056
ena-0.056
 scalp-0.055
 End-0.052
Token playing
Token ID2712
Feature activation-1.32e+00

Pos logit contributions
ena+0.019
kin+0.019
 unison+0.019
 scalp+0.019
 End+0.018
Neg logit contributions
ls-0.014
 closer-0.013
 back-0.012
s-0.011
 home-0.011
Token.
Token ID13
Feature activation+2.50e+00

Pos logit contributions
ena+0.036
kin+0.035
 unison+0.035
 scalp+0.035
 End+0.034
Neg logit contributions
 closer-0.025
ls-0.025
 back-0.023
s-0.021
 home-0.021
Token Let
Token ID3914
Feature activation+1.26e+00

Pos logit contributions
 closer+0.056
ls+0.052
 back+0.050
s+0.049
 to+0.047
Neg logit contributions
kin-0.062
ena-0.057
 scalp-0.057
 unison-0.056
 End-0.055
Token me
Token ID502
Feature activation+1.14e+00

Pos logit contributions
ls+0.026
 closer+0.025
 nearer+0.022
erry+0.021
 groceries+0.019
Neg logit contributions
 always-0.030
 End-0.029
 rarely-0.029
ena-0.028
 accident-0.028
Token take
Token ID1011
Feature activation+3.27e+00

Pos logit contributions
ls+0.026
 closer+0.022
 groceries+0.020
s+0.019
erry+0.018
Neg logit contributions
ena-0.027
 scalp-0.027
 unison-0.027
kin-0.026
ric-0.026
Token a
Token ID257
Feature activation+5.42e+00

Pos logit contributions
ls+0.082
 closer+0.076
erry+0.071
 nearer+0.067
s+0.067
Neg logit contributions
ena-0.058
 scalp-0.058
 End-0.057
kin-0.056
Now-0.056
Token look
Token ID804
Feature activation+3.55e+00

Pos logit contributions
ls+0.124
beh+0.105
ishes+0.104
elling+0.104
lers+0.103
Neg logit contributions
pron-0.103
kin-0.095
stand-0.090
 lear-0.087
uren-0.086
Token and
Token ID290
Feature activation+3.17e+00

Pos logit contributions
 closer+0.078
ls+0.076
 groceries+0.066
 nearer+0.065
erry+0.064
Neg logit contributions
kin-0.076
ena-0.076
 scalp-0.072
 End-0.071
 rarely-0.070
Token make
Token ID787
Feature activation+5.63e-01

Pos logit contributions
ls+0.075
 closer+0.066
s+0.063
erry+0.061
 back+0.057
Neg logit contributions
 scalp-0.070
kin-0.069
ena-0.068
 unison-0.068
 rarely-0.067
Token it
Token ID340
Feature activation+1.78e+00

Pos logit contributions
ls+0.013
 closer+0.013
erry+0.011
s+0.011
 nearer+0.011
Neg logit contributions
 scalp-0.012
ena-0.012
kin-0.011
 End-0.011
 unison-0.011
Token all
Token ID477
Feature activation+3.93e+00

Pos logit contributions
ls+0.032
 closer+0.027
erry+0.027
 groceries+0.025
s+0.023
Neg logit contributions
kin-0.047
 scalp-0.046
 rarely-0.044
ena-0.044
 always-0.042
Token see
Token ID766
Feature activation+1.56e+00

Pos logit contributions
kin+0.060
ena+0.060
 scalp+0.059
 unison+0.059
 End+0.057
Neg logit contributions
 closer-0.056
ls-0.055
 back-0.049
s-0.047
 off-0.046
Token?"
Token ID1701
Feature activation-2.05e+00

Pos logit contributions
 closer+0.032
ls+0.031
 back+0.028
 home+0.026
s+0.026
Neg logit contributions
ena-0.039
kin-0.038
 unison-0.038
 scalp-0.038
 End-0.036
Token Mom
Token ID11254
Feature activation-2.69e-01

Pos logit contributions
ena+0.032
 scalp+0.030
 End+0.030
 unison+0.028
kin+0.028
Neg logit contributions
ls-0.060
 closer-0.059
 back-0.054
erry-0.053
 away-0.053
Tokenmy
Token ID1820
Feature activation-7.48e-01

Pos logit contributions
ena+0.008
 scalp+0.008
kin+0.008
 unison+0.007
 End+0.007
Neg logit contributions
ls-0.005
 closer-0.004
 back-0.004
 groceries-0.004
 home-0.003
Token said
Token ID531
Feature activation-1.32e+00

Pos logit contributions
kin+0.020
ena+0.020
 scalp+0.020
 unison+0.020
 End+0.019
Neg logit contributions
 closer-0.014
ls-0.014
 back-0.012
s-0.012
 home-0.011
Token,
Token ID11
Feature activation-6.93e+00

Pos logit contributions
 scalp+0.032
ena+0.032
kin+0.032
 unison+0.031
 End+0.030
Neg logit contributions
ls-0.028
 closer-0.027
 groceries-0.023
 back-0.023
s-0.022
Token guiding
Token ID26727
Feature activation+5.63e-01

Pos logit contributions
OM+0.166
ric+0.165
kin+0.164
uders+0.155
ena+0.155
Neg logit contributions
 closer-0.183
 back-0.168
 down-0.151
 to-0.145
 off-0.144
Token Lily
Token ID20037
Feature activation+1.00e+00

Pos logit contributions
ls+0.013
 closer+0.013
erry+0.011
s+0.011
 back+0.011
Neg logit contributions
ena-0.013
 unison-0.012
 scalp-0.012
kin-0.012
 End-0.012
Token's
Token ID338
Feature activation+1.10e-01

Pos logit contributions
ls+0.016
 closer+0.015
 back+0.012
 groceries+0.012
s+0.011
Neg logit contributions
ena-0.031
kin-0.030
 scalp-0.030
 unison-0.029
 End-0.029
Token brush
Token ID14093
Feature activation-4.43e-01

Pos logit contributions
ls+0.003
 closer+0.003
s+0.002
 back+0.002
 groceries+0.002
Neg logit contributions
ena-0.002
 scalp-0.002
kin-0.002
 unison-0.002
 End-0.002
Token on
Token ID319
Feature activation+7.95e-01

Pos logit contributions
kin+0.012
ena+0.012
 scalp+0.012
 unison+0.012
 End+0.011
Neg logit contributions
 closer-0.008
ls-0.008
 back-0.007
s-0.007
 home-0.007
Token for
Token ID329
Feature activation-2.09e+00

Pos logit contributions
 unison+0.124
ena+0.124
 scalp+0.123
kin+0.121
ric+0.117
Neg logit contributions
 closer-0.061
 back-0.054
ls-0.052
 to-0.048
 home-0.046
Token cooking
Token ID10801
Feature activation-2.40e+00

Pos logit contributions
 unison+0.050
 scalp+0.050
ena+0.048
 rarely+0.048
kin+0.048
Neg logit contributions
 closer-0.046
 back-0.046
 home-0.043
 to-0.042
 off-0.041
Token,"
Token ID553
Feature activation-2.22e+00

Pos logit contributions
kin+0.067
ena+0.065
 scalp+0.065
 unison+0.064
 End+0.063
Neg logit contributions
 closer-0.044
ls-0.044
 back-0.037
s-0.036
 home-0.034
Token Mom
Token ID11254
Feature activation+8.23e-01

Pos logit contributions
kin+0.053
ena+0.052
 scalp+0.052
 unison+0.050
 End+0.050
Neg logit contributions
 closer-0.050
ls-0.050
 back-0.044
s-0.042
 home-0.041
Token said
Token ID531
Feature activation-1.46e+00

Pos logit contributions
ls+0.016
 closer+0.016
 back+0.014
 groceries+0.013
 home+0.013
Neg logit contributions
ena-0.022
kin-0.021
 scalp-0.021
 unison-0.021
 End-0.020
Token,
Token ID11
Feature activation-6.83e+00

Pos logit contributions
kin+0.037
 scalp+0.036
ena+0.036
 unison+0.035
 End+0.034
Neg logit contributions
ls-0.030
 closer-0.030
 back-0.025
s-0.025
 groceries-0.024
Token smiling
Token ID16755
Feature activation+3.38e-01

Pos logit contributions
OM+0.170
ric+0.167
uders+0.166
 rarely+0.164
ena+0.160
Neg logit contributions
 closer-0.169
 back-0.162
 down-0.146
 to-0.136
 out-0.131
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
ls+0.007
 closer+0.007
 groceries+0.006
erry+0.006
s+0.006
Neg logit contributions
 scalp-0.008
kin-0.008
ena-0.008
 unison-0.007
 End-0.007
Token She
Token ID1375
Feature activation-1.79e+00

Pos logit contributions
kin+0.025
ena+0.024
 unison+0.024
 scalp+0.024
ric+0.024
Neg logit contributions
 closer-0.025
ls-0.024
 back-0.023
s-0.022
 to-0.022
Token blew
Token ID17948
Feature activation-4.77e-01

Pos logit contributions
ric+0.053
ities+0.051
 scalp+0.051
 End+0.050
 unison+0.049
Neg logit contributions
 closer-0.032
ls-0.031
 back-0.029
s-0.027
 to-0.026
Token out
Token ID503
Feature activation+5.76e-01

Pos logit contributions
 scalp+0.011
kin+0.010
 End+0.010
ena+0.010
 unison+0.010
Neg logit contributions
ls-0.010
 closer-0.010
earned-0.009
erry-0.009
s-0.009
Token She
Token ID1375
Feature activation-1.81e+00

Pos logit contributions
ls+0.012
 closer+0.012
s+0.012
 back+0.011
 to+0.011
Neg logit contributions
kin-0.017
 unison-0.016
 scalp-0.016
ena-0.015
ities-0.015
Token ate
Token ID15063
Feature activation-1.18e+00

Pos logit contributions
 unison+0.045
 scalp+0.045
kin+0.045
ities+0.043
ric+0.041
Neg logit contributions
 closer-0.040
ls-0.039
 back-0.038
s-0.036
 to-0.036
Token popcorn
Token ID38914
Feature activation-2.31e+00

Pos logit contributions
kin+0.032
ena+0.032
 scalp+0.031
 unison+0.031
 End+0.030
Neg logit contributions
ls-0.022
 closer-0.022
 back-0.019
s-0.018
 home-0.017
Token and
Token ID290
Feature activation-7.54e-01

Pos logit contributions
ena+0.058
 scalp+0.057
kin+0.057
 unison+0.057
 End+0.056
Neg logit contributions
 closer-0.049
ls-0.049
 back-0.044
s-0.042
 home-0.041
Token roasted
Token ID30490
Feature activation-4.04e+00

Pos logit contributions
ena+0.018
 unison+0.018
uff+0.018
kin+0.018
ider+0.018
Neg logit contributions
 closer-0.015
ls-0.015
 back-0.015
 to-0.015
 off-0.014
Token marsh
Token ID22397
Feature activation-6.83e+00

Pos logit contributions
ena+0.074
kin+0.072
 scalp+0.071
 unison+0.069
 End+0.068
Neg logit contributions
ls-0.116
 closer-0.115
 back-0.105
s-0.103
 off-0.102
Tokenm
Token ID76
Feature activation-1.87e+00

Pos logit contributions
 unison+0.201
 rarely+0.199
 End+0.199
 scalp+0.195
kin+0.193
Neg logit contributions
s-0.117
ls-0.115
 closer-0.111
 to-0.103
 back-0.101
Tokenallows
Token ID47205
Feature activation-3.43e+00

Pos logit contributions
 unison+0.038
ena+0.036
 End+0.034
 scalp+0.034
kin+0.034
Neg logit contributions
ls-0.052
s-0.048
 closer-0.048
 off-0.047
 back-0.047
Token.
Token ID13
Feature activation+9.80e-03

Pos logit contributions
ena+0.100
kin+0.099
 scalp+0.099
 unison+0.098
 End+0.097
Neg logit contributions
 closer-0.061
ls-0.058
 back-0.055
s-0.051
 to-0.050
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
 closer+0.000
ls+0.000
 back+0.000
s+0.000
 home+0.000
Neg logit contributions
kin-0.000
 unison-0.000
ena-0.000
 scalp-0.000
ities-0.000
Token
Token ID198
Feature activation-7.27e-02

Pos logit contributions
 scalp+0.006
kin+0.006
 unison+0.006
ena+0.006
ities+0.006
Neg logit contributions
 closer-0.005
ls-0.005
s-0.004
 back-0.004
 to-0.004
Token it
Token ID340
Feature activation-3.08e-01

Pos logit contributions
ena+0.007
kin+0.007
 scalp+0.007
 unison+0.007
 End+0.007
Neg logit contributions
ls-0.004
 closer-0.004
 back-0.003
s-0.003
 off-0.003
Token too
Token ID1165
Feature activation+7.94e-01

Pos logit contributions
kin+0.008
ena+0.008
 scalp+0.007
 unison+0.007
 End+0.007
Neg logit contributions
ls-0.006
 closer-0.006
 back-0.005
 groceries-0.005
s-0.005
Token,"
Token ID553
Feature activation-2.60e+00

Pos logit contributions
 closer+0.013
ls+0.013
 back+0.011
s+0.010
 home+0.010
Neg logit contributions
kin-0.024
ena-0.024
 scalp-0.023
 unison-0.023
 End-0.023
Token she
Token ID673
Feature activation-1.21e+00

Pos logit contributions
ena+0.053
 scalp+0.051
kin+0.051
 unison+0.050
 End+0.050
Neg logit contributions
 closer-0.065
ls-0.065
 back-0.059
 away-0.056
s-0.055
Token said
Token ID531
Feature activation-6.93e-01

Pos logit contributions
ena+0.033
kin+0.032
 scalp+0.032
 unison+0.032
 End+0.031
Neg logit contributions
 closer-0.023
ls-0.023
 back-0.020
s-0.019
 home-0.018
Token,
Token ID11
Feature activation-6.76e+00

Pos logit contributions
kin+0.018
ena+0.018
 scalp+0.018
 unison+0.017
 End+0.017
Neg logit contributions
ls-0.014
 closer-0.014
 back-0.012
s-0.011
 groceries-0.011
Token kissing
Token ID25847
Feature activation-1.61e+00

Pos logit contributions
OM+0.183
kin+0.183
uders+0.179
ric+0.175
ena+0.170
Neg logit contributions
 closer-0.158
 back-0.154
 to-0.138
 down-0.131
 off-0.125
Token them
Token ID606
Feature activation-7.79e-01

Pos logit contributions
 unison+0.046
ena+0.046
kin+0.046
 scalp+0.046
 End+0.045
Neg logit contributions
 closer-0.028
ls-0.028
 back-0.025
s-0.023
 off-0.022
Token on
Token ID319
Feature activation-1.61e-01

Pos logit contributions
kin+0.022
ena+0.021
 scalp+0.021
 unison+0.021
 End+0.021
Neg logit contributions
 closer-0.015
ls-0.014
 back-0.013
s-0.012
 off-0.012
Token their
Token ID511
Feature activation-1.55e+00

Pos logit contributions
kin+0.005
ric+0.005
OM+0.005
ena+0.005
 End+0.005
Neg logit contributions
 back-0.002
 closer-0.002
 off-0.002
ls-0.002
 home-0.002
Token cheeks
Token ID25839
Feature activation-1.25e+00

Pos logit contributions
kin+0.032
 scalp+0.031
ena+0.031
 unison+0.030
 End+0.030
Neg logit contributions
 closer-0.040
ls-0.040
 back-0.035
s-0.035
 off-0.033
Token a
Token ID257
Feature activation+1.21e+00

Pos logit contributions
ena+0.002
kin+0.001
 scalp+0.001
 unison+0.001
ric+0.001
Neg logit contributions
ls-0.001
 closer-0.001
 back-0.001
s-0.001
 groceries-0.001
Token pig
Token ID12967
Feature activation-1.82e-01

Pos logit contributions
ls+0.025
 closer+0.024
 back+0.021
s+0.020
 groceries+0.020
Neg logit contributions
kin-0.030
 scalp-0.030
ena-0.030
 unison-0.030
 End-0.028
Token!"
Token ID2474
Feature activation-2.47e+00

Pos logit contributions
kin+0.005
ena+0.005
 scalp+0.005
 unison+0.005
 End+0.005
Neg logit contributions
 closer-0.003
ls-0.003
 back-0.003
s-0.003
 groceries-0.003
Token she
Token ID673
Feature activation-1.49e+00

Pos logit contributions
kin+0.054
ena+0.054
 scalp+0.053
 unison+0.052
 End+0.051
Neg logit contributions
 closer-0.060
ls-0.059
 back-0.053
s-0.050
 home-0.049
Token said
Token ID531
Feature activation-1.01e+00

Pos logit contributions
kin+0.041
ena+0.041
 scalp+0.040
 unison+0.040
 End+0.039
Neg logit contributions
 closer-0.028
ls-0.028
 back-0.024
s-0.023
 home-0.022
Token,
Token ID11
Feature activation-6.76e+00

Pos logit contributions
kin+0.027
ena+0.027
 scalp+0.027
 unison+0.027
 End+0.026
Neg logit contributions
 closer-0.019
ls-0.019
 back-0.017
s-0.016
 home-0.015
Token holding
Token ID4769
Feature activation-6.19e-01

Pos logit contributions
ities+0.165
kin+0.164
 habitats+0.162
OM+0.162
 narrative+0.161
Neg logit contributions
 closer-0.171
 back-0.157
 down-0.153
 to-0.147
 up-0.145
Token up
Token ID510
Feature activation-6.20e-01

Pos logit contributions
 scalp+0.015
ena+0.015
kin+0.015
 unison+0.014
 End+0.014
Neg logit contributions
ls-0.014
 closer-0.012
s-0.011
erry-0.011
 nearer-0.010
Token her
Token ID607
Feature activation-7.15e-01

Pos logit contributions
kin+0.018
ena+0.018
 scalp+0.018
 unison+0.018
 End+0.017
Neg logit contributions
 closer-0.011
ls-0.010
 back-0.009
s-0.008
 home-0.008
Token paper
Token ID3348
Feature activation+4.83e-01

Pos logit contributions
kin+0.018
 scalp+0.017
ena+0.017
 unison+0.016
 End+0.016
Neg logit contributions
ls-0.016
 closer-0.015
s-0.013
 off-0.013
 back-0.012
Token.
Token ID13
Feature activation-6.06e-02

Pos logit contributions
 closer+0.010
ls+0.010
 back+0.008
s+0.008
 home+0.008
Neg logit contributions
kin-0.013
ena-0.012
 scalp-0.012
 unison-0.012
 End-0.012
Token be
Token ID307
Feature activation-9.50e-01

Pos logit contributions
kin+0.117
 unison+0.115
 scalp+0.114
ena+0.112
 End+0.111
Neg logit contributions
 closer-0.094
ls-0.086
 back-0.084
 to-0.076
s-0.075
Tokenep
Token ID538
Feature activation-5.76e-01

Pos logit contributions
 scalp+0.020
kin+0.020
 unison+0.019
�+0.019
ities+0.019
Neg logit contributions
 closer-0.024
 back-0.024
 home-0.022
 away-0.021
 off-0.021
Token!"
Token ID2474
Feature activation-2.96e+00

Pos logit contributions
 unison+0.019
 scalp+0.019
kin+0.019
 End+0.018
�+0.018
Neg logit contributions
 closer-0.008
 back-0.008
ls-0.008
s-0.007
 to-0.007
Token Tom
Token ID4186
Feature activation-2.91e-01

Pos logit contributions
kin+0.072
 scalp+0.068
ric+0.065
 unison+0.065
 End+0.064
Neg logit contributions
ls-0.072
 closer-0.072
s-0.066
 to-0.062
 off-0.060
Token says
Token ID1139
Feature activation-1.23e+00

Pos logit contributions
kin+0.010
 scalp+0.009
 unison+0.009
 End+0.009
ena+0.009
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.003
s-0.003
 off-0.003
Token,
Token ID11
Feature activation-6.72e+00

Pos logit contributions
kin+0.033
ena+0.033
 scalp+0.032
 unison+0.032
 End+0.031
Neg logit contributions
 closer-0.024
ls-0.023
 back-0.021
s-0.019
 home-0.019
Token moving
Token ID3867
Feature activation-2.29e-01

Pos logit contributions
ities+0.174
kin+0.171
OM+0.166
ric+0.160
uders+0.160
Neg logit contributions
 closer-0.170
 back-0.154
 down-0.146
 to-0.142
 up-0.138
Token his
Token ID465
Feature activation-5.98e-01

Pos logit contributions
kin+0.007
 unison+0.007
 scalp+0.007
ena+0.007
 End+0.007
Neg logit contributions
 closer-0.003
ls-0.003
 back-0.003
s-0.002
 off-0.002
Token truck
Token ID7779
Feature activation+4.26e-01

Pos logit contributions
kin+0.015
 scalp+0.014
ena+0.014
 unison+0.014
 End+0.013
Neg logit contributions
ls-0.013
 closer-0.013
s-0.011
 back-0.011
 off-0.011
Token around
Token ID1088
Feature activation-2.39e+00

Pos logit contributions
 closer+0.008
ls+0.007
 back+0.007
s+0.006
 home+0.006
Neg logit contributions
kin-0.012
ena-0.012
 scalp-0.012
 unison-0.012
 End-0.011
Token the
Token ID262
Feature activation-5.71e-01

Pos logit contributions
kin+0.086
ric+0.084
 scalp+0.082
 unison+0.082
OM+0.082
Neg logit contributions
 closer-0.033
 back-0.031
 to-0.031
 home-0.027
 away-0.025
Token good
Token ID922
Feature activation-1.25e-01

Pos logit contributions
 closer+0.005
ls+0.004
 back+0.004
 home+0.004
 to+0.004
Neg logit contributions
ena-0.009
 scalp-0.008
kin-0.008
 unison-0.008
ities-0.008
Token friend
Token ID1545
Feature activation+8.88e-01

Pos logit contributions
 unison+0.004
kin+0.004
 scalp+0.004
ena+0.004
 End+0.004
Neg logit contributions
 closer-0.002
ls-0.002
 back-0.002
 home-0.002
 to-0.001
Token,"
Token ID553
Feature activation-2.86e+00

Pos logit contributions
ls+0.018
 closer+0.017
 back+0.014
 groceries+0.014
s+0.014
Neg logit contributions
ena-0.023
kin-0.023
 unison-0.022
 scalp-0.022
 End-0.021
Token Tom
Token ID4186
Feature activation-2.24e-01

Pos logit contributions
kin+0.062
 unison+0.060
 scalp+0.060
ena+0.057
ric+0.055
Neg logit contributions
 closer-0.074
ls-0.074
 back-0.067
s-0.065
 to-0.062
Token said
Token ID531
Feature activation-4.21e-01

Pos logit contributions
ena+0.006
kin+0.006
 scalp+0.006
 unison+0.006
 End+0.006
Neg logit contributions
ls-0.004
 closer-0.004
 back-0.004
 groceries-0.004
s-0.004
Token,
Token ID11
Feature activation-6.71e+00

Pos logit contributions
kin+0.009
 scalp+0.009
ena+0.009
 unison+0.008
 End+0.008
Neg logit contributions
ls-0.010
 closer-0.010
 groceries-0.009
erry-0.008
needed-0.008
Token smiling
Token ID16755
Feature activation+4.48e-01

Pos logit contributions
kin+0.176
ena+0.170
OM+0.168
uders+0.168
ric+0.164
Neg logit contributions
 closer-0.167
 back-0.157
 down-0.138
 to-0.133
 up-0.128
Token.
Token ID13
Feature activation-9.67e-01

Pos logit contributions
ls+0.009
 closer+0.009
 groceries+0.007
 back+0.007
s+0.007
Neg logit contributions
ena-0.011
kin-0.011
 scalp-0.011
 unison-0.011
 End-0.011
Token
Token ID198
Feature activation+4.25e-01

Pos logit contributions
kin+0.027
 scalp+0.027
 unison+0.026
ena+0.026
ities+0.025
Neg logit contributions
 closer-0.019
ls-0.018
 back-0.017
s-0.017
 to-0.016
Token
Token ID198
Feature activation+4.46e-01

Pos logit contributions
 closer+0.011
ls+0.010
 back+0.009
s+0.009
 groceries+0.009
Neg logit contributions
kin-0.009
ena-0.009
 scalp-0.009
 unison-0.009
 End-0.008
TokenS
Token ID50
Feature activation+1.81e-01

Pos logit contributions
 closer+0.009
ls+0.009
s+0.009
 back+0.009
 off+0.008
Neg logit contributions
 scalp-0.011
 unison-0.011
kin-0.011
ric-0.011
ena-0.011
Token each
Token ID1123
Feature activation+1.39e+00

Pos logit contributions
 closer+0.014
ls+0.014
 back+0.012
s+0.011
 home+0.011
Neg logit contributions
ena-0.020
kin-0.020
 scalp-0.020
 unison-0.019
 End-0.019
Token other
Token ID584
Feature activation+1.40e+00

Pos logit contributions
ls+0.029
 closer+0.027
 nearer+0.022
 back+0.021
s+0.021
Neg logit contributions
ena-0.037
kin-0.035
 End-0.034
 scalp-0.034
 unison-0.033
Token?"
Token ID1701
Feature activation-2.18e+00

Pos logit contributions
ls+0.031
 closer+0.030
s+0.026
 back+0.025
erry+0.025
Neg logit contributions
kin-0.033
ena-0.033
 scalp-0.032
 End-0.032
 unison-0.032
Token Mom
Token ID11254
Feature activation-2.27e-02

Pos logit contributions
ena+0.041
 scalp+0.039
 End+0.039
kin+0.039
 unison+0.037
Neg logit contributions
ls-0.060
 closer-0.059
 back-0.053
 off-0.051
s-0.051
Token said
Token ID531
Feature activation-6.09e-01

Pos logit contributions
ena+0.001
 scalp+0.001
kin+0.001
 unison+0.001
 End+0.001
Neg logit contributions
ls-0.000
 closer-0.000
 back-0.000
 groceries-0.000
 home-0.000
Token,
Token ID11
Feature activation-6.67e+00

Pos logit contributions
kin+0.014
 scalp+0.014
ena+0.014
 unison+0.013
 End+0.013
Neg logit contributions
ls-0.013
 closer-0.013
 groceries-0.011
 back-0.011
s-0.011
Token smiling
Token ID16755
Feature activation+1.66e-01

Pos logit contributions
OM+0.171
ric+0.170
uders+0.166
kin+0.165
ena+0.164
Neg logit contributions
 closer-0.169
 back-0.160
 down-0.138
 to-0.136
 out-0.130
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
ls+0.003
 closer+0.003
 back+0.003
s+0.003
 groceries+0.003
Neg logit contributions
kin-0.004
ena-0.004
 scalp-0.004
 unison-0.004
 End-0.004
Token
Token ID198
Feature activation+3.29e-01

Pos logit contributions
kin+0.036
 scalp+0.035
ena+0.035
 unison+0.034
ric+0.034
Neg logit contributions
 closer-0.031
ls-0.030
 back-0.027
s-0.026
 to-0.026
Token
Token ID198
Feature activation+7.36e-01

Pos logit contributions
 closer+0.009
ls+0.008
 back+0.008
 groceries+0.007
 off+0.007
Neg logit contributions
ena-0.007
kin-0.006
 unison-0.006
 scalp-0.006
 End-0.006
TokenL
Token ID43
Feature activation-1.54e+00

Pos logit contributions
 closer+0.018
ls+0.018
 back+0.016
s+0.016
 home+0.015
Neg logit contributions
 scalp-0.016
kin-0.016
ena-0.016
 unison-0.016
ric-0.015
Token be
Token ID307
Feature activation-1.39e+00

Pos logit contributions
kin+0.130
 unison+0.129
 scalp+0.127
 End+0.124
ena+0.122
Neg logit contributions
 closer-0.082
 back-0.074
ls-0.073
 to-0.064
 off-0.063
Tokenep
Token ID538
Feature activation-8.46e-01

Pos logit contributions
 scalp+0.027
kin+0.026
 unison+0.026
�+0.025
 End+0.025
Neg logit contributions
 closer-0.039
 back-0.037
ls-0.035
 home-0.034
 off-0.034
Token!"
Token ID2474
Feature activation-2.43e+00

Pos logit contributions
 scalp+0.026
 unison+0.026
kin+0.026
 End+0.025
ena+0.025
Neg logit contributions
 closer-0.013
 back-0.012
ls-0.012
s-0.011
 off-0.011
Token Ben
Token ID3932
Feature activation-2.87e-01

Pos logit contributions
kin+0.064
 End+0.056
 unison+0.055
 scalp+0.055
 sensation+0.054
Neg logit contributions
 closer-0.056
ls-0.053
 back-0.050
 to-0.050
s-0.049
Token says
Token ID1139
Feature activation-1.17e+00

Pos logit contributions
kin+0.009
 scalp+0.009
ena+0.009
 unison+0.009
 End+0.009
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.003
s-0.003
 off-0.003
Token,
Token ID11
Feature activation-6.67e+00

Pos logit contributions
kin+0.031
ena+0.031
 scalp+0.031
 unison+0.030
 End+0.030
Neg logit contributions
ls-0.023
 closer-0.023
 back-0.019
s-0.019
 groceries-0.018
Token pushing
Token ID7796
Feature activation-6.60e-01

Pos logit contributions
kin+0.173
ities+0.165
OM+0.163
ena+0.159
 End+0.159
Neg logit contributions
 closer-0.178
 back-0.160
 down-0.145
 to-0.143
 out-0.139
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
ena+0.017
 scalp+0.016
kin+0.016
 unison+0.015
 End+0.015
Neg logit contributions
ls-0.014
 closer-0.013
 groceries-0.011
s-0.011
erry-0.011
Token buttons
Token ID12163
Feature activation+9.52e-01

Pos logit contributions
 closer+0.020
ls+0.020
 back+0.018
s+0.017
 home+0.016
Neg logit contributions
kin-0.026
ena-0.026
 scalp-0.026
 unison-0.025
 End-0.025
Token.
Token ID13
Feature activation-7.03e-01

Pos logit contributions
ls+0.018
 closer+0.017
 groceries+0.015
 back+0.015
erry+0.014
Neg logit contributions
ena-0.025
kin-0.025
 scalp-0.025
 unison-0.024
 End-0.023
Token He
Token ID679
Feature activation-6.61e-01

Pos logit contributions
kin+0.021
 unison+0.020
 End+0.020
ities+0.019
 scalp+0.019
Neg logit contributions
 closer-0.013
 to-0.012
 back-0.012
ls-0.012
s-0.011
Token,
Token ID11
Feature activation-3.52e+00

Pos logit contributions
 closer+0.011
ls+0.011
 back+0.010
s+0.009
 to+0.009
Neg logit contributions
kin-0.018
ena-0.018
 unison-0.018
 scalp-0.018
 End-0.017
Token okay
Token ID8788
Feature activation+1.23e+00

Pos logit contributions
kin+0.090
 unison+0.090
ric+0.089
�+0.089
OM+0.089
Neg logit contributions
 closer-0.076
 back-0.071
 to-0.065
 home-0.060
ls-0.059
Token?"
Token ID1701
Feature activation-2.29e+00

Pos logit contributions
 closer+0.024
ls+0.024
 back+0.021
s+0.020
 home+0.019
Neg logit contributions
kin-0.033
ena-0.033
 scalp-0.032
 unison-0.032
 End-0.031
Token mom
Token ID1995
Feature activation+1.52e+00

Pos logit contributions
kin+0.053
 unison+0.051
 scalp+0.051
ena+0.051
ric+0.049
Neg logit contributions
 closer-0.054
ls-0.053
 back-0.049
s-0.047
 to-0.045
Token says
Token ID1139
Feature activation-8.45e-01

Pos logit contributions
ls+0.028
 closer+0.027
 back+0.023
 home+0.023
 groceries+0.023
Neg logit contributions
ena-0.044
 scalp-0.040
 End-0.040
 unison-0.039
kin-0.039
Token,
Token ID11
Feature activation-6.67e+00

Pos logit contributions
kin+0.021
ena+0.021
 scalp+0.021
 unison+0.021
 End+0.020
Neg logit contributions
ls-0.017
 closer-0.017
 back-0.015
s-0.014
 groceries-0.014
Token hugging
Token ID46292
Feature activation+3.24e-01

Pos logit contributions
kin+0.174
ric+0.171
OM+0.162
ena+0.159
uders+0.159
Neg logit contributions
 closer-0.169
 back-0.163
 down-0.144
 to-0.142
 out-0.132
Token Anna
Token ID11735
Feature activation+6.97e-01

Pos logit contributions
 closer+0.008
ls+0.008
 back+0.007
s+0.007
 home+0.007
Neg logit contributions
kin-0.007
ena-0.007
 scalp-0.007
 unison-0.006
 End-0.006
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
ls+0.014
 closer+0.014
 back+0.012
s+0.011
 groceries+0.011
Neg logit contributions
ena-0.018
kin-0.018
 scalp-0.018
 unison-0.017
 End-0.017
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
ric+0.042
kin+0.042
 unison+0.040
hers+0.039
 scalp+0.039
Neg logit contributions
 closer-0.024
 to-0.022
ls-0.022
s-0.022
 back-0.021
Token
Token ID198
Feature activation+8.16e-01

Pos logit contributions
ls+0.007
 closer+0.007
 back+0.007
s+0.007
 to+0.006
Neg logit contributions
kin-0.009
 scalp-0.008
ena-0.008
 unison-0.008
 rarely-0.008
Token it
Token ID340
Feature activation+1.50e+00

Pos logit contributions
ena+0.020
kin+0.020
 scalp+0.019
 unison+0.019
 End+0.018
Neg logit contributions
ls-0.014
 closer-0.014
 back-0.012
s-0.012
 groceries-0.011
Token!"
Token ID2474
Feature activation-1.96e+00

Pos logit contributions
ls+0.030
 closer+0.029
 groceries+0.025
s+0.024
 back+0.024
Neg logit contributions
ena-0.039
kin-0.038
 scalp-0.036
 End-0.035
 unison-0.035
Token Mom
Token ID11254
Feature activation+7.21e-01

Pos logit contributions
kin+0.043
 unison+0.042
ena+0.042
 scalp+0.042
 End+0.040
Neg logit contributions
ls-0.049
 closer-0.049
 back-0.043
s-0.043
 off-0.041
Tokenmy
Token ID1820
Feature activation+3.87e-01

Pos logit contributions
ls+0.012
 closer+0.012
 back+0.010
s+0.009
 groceries+0.009
Neg logit contributions
ena-0.022
kin-0.021
 scalp-0.021
 unison-0.021
 End-0.020
Token said
Token ID531
Feature activation-1.21e+00

Pos logit contributions
 closer+0.007
ls+0.007
 back+0.006
s+0.006
 off+0.005
Neg logit contributions
kin-0.011
 scalp-0.011
 unison-0.011
ena-0.011
 End-0.011
Token,
Token ID11
Feature activation-6.65e+00

Pos logit contributions
kin+0.033
ena+0.032
 scalp+0.032
 unison+0.032
 End+0.031
Neg logit contributions
 closer-0.023
ls-0.023
 back-0.020
s-0.019
 home-0.018
Token cl
Token ID537
Feature activation-2.61e-01

Pos logit contributions
kin+0.169
OM+0.162
 habitats+0.159
ric+0.158
ities+0.158
Neg logit contributions
 closer-0.170
 back-0.156
 down-0.144
 to-0.137
 off-0.134
Tokenapping
Token ID5912
Feature activation-7.83e-01

Pos logit contributions
kin+0.004
 unison+0.004
 scalp+0.004
ena+0.004
 End+0.004
Neg logit contributions
 closer-0.008
ls-0.008
 back-0.007
s-0.007
 off-0.007
Token her
Token ID607
Feature activation-1.10e-01

Pos logit contributions
kin+0.024
 unison+0.024
ena+0.024
 scalp+0.023
 End+0.023
Neg logit contributions
 closer-0.013
ls-0.012
 back-0.011
 off-0.010
s-0.010
Token hands
Token ID2832
Feature activation-3.01e-01

Pos logit contributions
 scalp+0.003
ena+0.003
kin+0.003
 unison+0.003
 End+0.002
Neg logit contributions
ls-0.002
 closer-0.002
 groceries-0.002
s-0.002
erry-0.002
Token.
Token ID13
Feature activation-8.94e-01

Pos logit contributions
ena+0.008
kin+0.008
 scalp+0.008
 unison+0.007
 End+0.007
Neg logit contributions
ls-0.006
 closer-0.006
 back-0.005
s-0.005
 groceries-0.005
Token,
Token ID11
Feature activation-3.01e+00

Pos logit contributions
ls+0.017
 closer+0.017
 back+0.015
 groceries+0.014
s+0.014
Neg logit contributions
ena-0.017
kin-0.017
 scalp-0.017
 End-0.016
 unison-0.016
Token Ben
Token ID3932
Feature activation+9.76e-01

Pos logit contributions
ric+0.068
 unison+0.067
ena+0.066
bes+0.065
 scalp+0.065
Neg logit contributions
 closer-0.071
ls-0.071
 back-0.069
 off-0.061
 home-0.059
Token,"
Token ID553
Feature activation-2.61e+00

Pos logit contributions
 closer+0.016
ls+0.016
 back+0.014
s+0.013
 home+0.012
Neg logit contributions
 unison-0.030
 scalp-0.029
kin-0.029
ena-0.029
 End-0.028
Token she
Token ID673
Feature activation-9.34e-01

Pos logit contributions
kin+0.061
ena+0.060
 scalp+0.060
 unison+0.059
 End+0.057
Neg logit contributions
 closer-0.060
ls-0.060
 back-0.053
s-0.051
 home-0.049
Token said
Token ID531
Feature activation-1.02e+00

Pos logit contributions
ena+0.024
kin+0.023
 scalp+0.023
 unison+0.022
 End+0.021
Neg logit contributions
ls-0.019
 closer-0.019
 back-0.017
 groceries-0.016
 home-0.016
Token,
Token ID11
Feature activation-6.64e+00

Pos logit contributions
kin+0.025
 scalp+0.025
ena+0.025
 unison+0.024
 End+0.023
Neg logit contributions
ls-0.022
 closer-0.021
 groceries-0.018
 back-0.018
s-0.017
Token smiling
Token ID16755
Feature activation+5.86e-01

Pos logit contributions
kin+0.179
OM+0.175
uders+0.174
ena+0.168
 unison+0.167
Neg logit contributions
 closer-0.163
 back-0.158
 down-0.138
 to-0.137
 out-0.126
Token.
Token ID13
Feature activation-1.56e+00

Pos logit contributions
ls+0.012
 closer+0.012
 groceries+0.010
 back+0.010
s+0.010
Neg logit contributions
 scalp-0.014
ena-0.014
kin-0.014
 unison-0.014
 End-0.013
Token "
Token ID366
Feature activation-3.52e+00

Pos logit contributions
kin+0.040
 unison+0.039
ena+0.036
ric+0.036
hers+0.036
Neg logit contributions
 to-0.036
 closer-0.036
 back-0.035
ls-0.033
s-0.033
TokenI
Token ID40
Feature activation-5.93e-01

Pos logit contributions
 unison+0.090
uders+0.084
kin+0.080
ena+0.080
ric+0.079
Neg logit contributions
 closer-0.073
 back-0.063
s-0.063
ls-0.061
 to-0.059
Token know
Token ID760
Feature activation+1.19e+00

Pos logit contributions
kin+0.015
 scalp+0.014
 unison+0.014
ena+0.014
 End+0.014
Neg logit contributions
 closer-0.013
ls-0.013
 back-0.012
s-0.011
 home-0.011
Token,
Token ID11
Feature activation-3.69e+00

Pos logit contributions
ls+0.027
 closer+0.027
 back+0.024
s+0.023
 to+0.022
Neg logit contributions
 scalp-0.045
kin-0.045
 unison-0.045
ena-0.044
 End-0.043
Token wait
Token ID4043
Feature activation+2.36e-01

Pos logit contributions
 unison+0.105
 scalp+0.103
kin+0.100
ena+0.099
ric+0.098
Neg logit contributions
 closer-0.071
 back-0.068
ls-0.060
 to-0.059
 away-0.055
Token!"
Token ID2474
Feature activation-2.86e+00

Pos logit contributions
 closer+0.003
 back+0.003
ls+0.003
 to+0.003
 home+0.003
Neg logit contributions
kin-0.008
ena-0.008
 unison-0.008
 scalp-0.008
ric-0.007
Token he
Token ID339
Feature activation-1.27e+00

Pos logit contributions
kin+0.083
 scalp+0.081
 unison+0.081
ric+0.081
 End+0.078
Neg logit contributions
 closer-0.053
ls-0.053
s-0.047
 back-0.047
 to-0.046
Token said
Token ID531
Feature activation-9.75e-01

Pos logit contributions
kin+0.035
ena+0.035
 scalp+0.034
 unison+0.034
 End+0.033
Neg logit contributions
 closer-0.024
ls-0.024
 back-0.021
s-0.019
 home-0.019
Token,
Token ID11
Feature activation-6.64e+00

Pos logit contributions
kin+0.023
 scalp+0.023
ena+0.023
 unison+0.022
 End+0.021
Neg logit contributions
ls-0.021
 closer-0.021
 groceries-0.018
 back-0.017
s-0.017
Token running
Token ID2491
Feature activation-3.61e-01

Pos logit contributions
kin+0.168
ities+0.160
mm+0.158
ena+0.158
OM+0.157
Neg logit contributions
 closer-0.172
 back-0.162
 down-0.153
 up-0.143
 to-0.143
Token after
Token ID706
Feature activation-1.25e+00

Pos logit contributions
ena+0.012
kin+0.012
 scalp+0.011
 unison+0.011
 End+0.011
Neg logit contributions
ls-0.004
 closer-0.003
erry-0.003
 groceries-0.003
lers-0.003
Token her
Token ID607
Feature activation+6.48e-01

Pos logit contributions
kin+0.038
 scalp+0.038
ena+0.037
 unison+0.037
ric+0.036
Neg logit contributions
 closer-0.021
ls-0.020
 back-0.018
 home-0.016
 off-0.016
Token.
Token ID13
Feature activation-8.65e-01

Pos logit contributions
ls+0.015
 closer+0.015
 groceries+0.012
 nearer+0.012
s+0.012
Neg logit contributions
kin-0.015
 scalp-0.014
ena-0.014
 unison-0.014
 End-0.013
Token "
Token ID366
Feature activation-2.36e+00

Pos logit contributions
kin+0.026
ric+0.026
 unison+0.026
 scalp+0.025
 End+0.025
Neg logit contributions
 to-0.017
 back-0.015
 closer-0.015
s-0.013
 down-0.013
Token the
Token ID262
Feature activation+2.43e-01

Pos logit contributions
 closer+0.015
ls+0.015
 back+0.013
s+0.012
erry+0.012
Neg logit contributions
ena-0.019
kin-0.019
 unison-0.019
 scalp-0.019
 End-0.018
Token swings
Token ID26728
Feature activation-2.90e+00

Pos logit contributions
 closer+0.006
ls+0.006
 back+0.006
s+0.005
 home+0.005
Neg logit contributions
kin-0.005
ena-0.005
 scalp-0.005
 unison-0.005
 End-0.005
Token?"
Token ID1701
Feature activation-2.96e+00

Pos logit contributions
kin+0.076
ena+0.075
 scalp+0.075
 unison+0.074
 End+0.072
Neg logit contributions
 closer-0.058
ls-0.057
 back-0.050
s-0.047
 home-0.046
Token she
Token ID673
Feature activation-1.90e+00

Pos logit contributions
kin+0.064
ena+0.063
 scalp+0.062
 unison+0.060
 End+0.060
Neg logit contributions
 closer-0.072
ls-0.072
 back-0.065
 groceries-0.061
s-0.061
Token said
Token ID531
Feature activation-9.96e-01

Pos logit contributions
kin+0.053
 scalp+0.052
 unison+0.052
ena+0.052
 End+0.051
Neg logit contributions
 closer-0.035
ls-0.034
 back-0.030
s-0.029
 off-0.027
Token,
Token ID11
Feature activation-6.64e+00

Pos logit contributions
kin+0.024
 scalp+0.024
ena+0.024
 unison+0.022
 End+0.021
Neg logit contributions
ls-0.022
 closer-0.021
 groceries-0.018
 back-0.018
s-0.017
Token feeling
Token ID4203
Feature activation-7.51e-01

Pos logit contributions
kin+0.165
OM+0.160
ric+0.155
uders+0.154
ena+0.153
Neg logit contributions
 closer-0.177
 back-0.162
 down-0.149
 to-0.142
 up-0.139
Token better
Token ID1365
Feature activation-3.40e-01

Pos logit contributions
ena+0.020
kin+0.020
 scalp+0.019
 unison+0.019
OM+0.019
Neg logit contributions
 closer-0.015
ls-0.015
 back-0.013
s-0.012
 off-0.012
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 unison+0.012
kin+0.012
 rarely+0.012
ric+0.012
OM+0.011
Neg logit contributions
 closer-0.004
 back-0.004
 to-0.004
 home-0.004
ls-0.003
Token
Token ID198
Feature activation-5.97e-02

Pos logit contributions
kin+0.047
 unison+0.046
 scalp+0.045
ena+0.045
 End+0.045
Neg logit contributions
 closer-0.033
ls-0.032
s-0.029
 back-0.029
 to-0.028
Token
Token ID198
Feature activation+1.45e-01

Pos logit contributions
ena+0.001
kin+0.001
 scalp+0.001
 unison+0.001
 End+0.001
Neg logit contributions
 closer-0.002
ls-0.002
 back-0.001
 groceries-0.001
s-0.001
Token go
Token ID467
Feature activation-6.44e-01

Pos logit contributions
kin+0.004
ena+0.004
 scalp+0.004
 rarely+0.004
 unison+0.004
Neg logit contributions
ls-0.004
 closer-0.003
 groceries-0.003
erry-0.003
 back-0.003
Token zoom
Token ID19792
Feature activation-1.22e+00

Pos logit contributions
kin+0.016
ena+0.016
 rarely+0.015
 scalp+0.015
 unison+0.015
Neg logit contributions
ls-0.012
 closer-0.011
erry-0.010
 groceries-0.010
seed-0.009
Token,"
Token ID553
Feature activation-2.90e+00

Pos logit contributions
ena+0.032
kin+0.031
 scalp+0.030
 unison+0.030
 End+0.029
Neg logit contributions
ls-0.023
 closer-0.023
 groceries-0.020
 back-0.019
erry-0.019
Token Sara
Token ID24799
Feature activation+7.63e-02

Pos logit contributions
 unison+0.059
ena+0.059
kin+0.058
 scalp+0.058
ric+0.057
Neg logit contributions
 closer-0.077
ls-0.075
 back-0.068
s-0.066
 home-0.064
Token says
Token ID1139
Feature activation-3.87e-01

Pos logit contributions
 closer+0.001
ls+0.001
 back+0.001
s+0.001
 home+0.001
Neg logit contributions
ena-0.002
kin-0.002
 scalp-0.002
 unison-0.002
 End-0.002
Token,
Token ID11
Feature activation-6.62e+00

Pos logit contributions
 scalp+0.009
kin+0.009
ena+0.009
 unison+0.009
 End+0.008
Neg logit contributions
ls-0.009
 closer-0.008
 groceries-0.007
 back-0.007
s-0.007
Token pointing
Token ID10609
Feature activation+1.66e+00

Pos logit contributions
kin+0.172
ric+0.154
OM+0.154
 unison+0.151
uders+0.150
Neg logit contributions
 closer-0.175
 back-0.158
 down-0.150
 up-0.143
 to-0.143
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
ls+0.027
 groceries+0.022
 closer+0.022
erry+0.021
lers+0.019
Neg logit contributions
ena-0.050
 scalp-0.048
kin-0.045
 unison-0.044
 rarely-0.044
Token her
Token ID607
Feature activation-4.49e-01

Pos logit contributions
kin+0.039
 scalp+0.036
 End+0.036
ena+0.036
 unison+0.036
Neg logit contributions
 closer-0.020
 back-0.017
ls-0.017
 home-0.015
 to-0.015
Token blue
Token ID4171
Feature activation-3.20e-01

Pos logit contributions
kin+0.012
ena+0.012
 scalp+0.012
 unison+0.012
 End+0.011
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.007
 home-0.007
Token car
Token ID1097
Feature activation+6.82e-01

Pos logit contributions
kin+0.010
ena+0.010
 scalp+0.009
 unison+0.009
 End+0.009
Neg logit contributions
 closer-0.005
ls-0.005
 back-0.004
s-0.004
 home-0.004
Token try
Token ID1949
Feature activation+2.19e+00

Pos logit contributions
ena+0.034
 scalp+0.034
kin+0.034
 unison+0.034
 End+0.033
Neg logit contributions
 closer-0.023
ls-0.022
 back-0.020
s-0.018
 home-0.018
Token this
Token ID428
Feature activation+6.17e-01

Pos logit contributions
ls+0.045
 closer+0.045
 back+0.039
 groceries+0.039
 home+0.037
Neg logit contributions
kin-0.050
 scalp-0.050
ena-0.049
 unison-0.048
ric-0.048
Token!"
Token ID2474
Feature activation-2.16e+00

Pos logit contributions
 closer+0.012
ls+0.012
 back+0.010
s+0.009
 groceries+0.009
Neg logit contributions
kin-0.017
ena-0.016
 scalp-0.016
 unison-0.016
 End-0.016
Token she
Token ID673
Feature activation-1.63e+00

Pos logit contributions
kin+0.054
ena+0.053
 scalp+0.053
 unison+0.052
 End+0.051
Neg logit contributions
 closer-0.047
ls-0.047
 back-0.041
s-0.039
 off-0.038
Token said
Token ID531
Feature activation-7.36e-01

Pos logit contributions
kin+0.046
 scalp+0.046
ena+0.046
 unison+0.046
 End+0.045
Neg logit contributions
 closer-0.029
ls-0.029
 back-0.024
s-0.024
 off-0.022
Token,
Token ID11
Feature activation-6.62e+00

Pos logit contributions
kin+0.018
 scalp+0.018
ena+0.018
 unison+0.018
 End+0.017
Neg logit contributions
ls-0.015
 closer-0.015
 back-0.013
 groceries-0.013
s-0.013
Token handing
Token ID22786
Feature activation+1.99e+00

Pos logit contributions
kin+0.167
OM+0.166
 End+0.160
ena+0.158
ric+0.157
Neg logit contributions
 closer-0.170
 back-0.154
 down-0.143
 to-0.142
 off-0.140
Token him
Token ID683
Feature activation+2.05e+00

Pos logit contributions
ls+0.045
 closer+0.038
erry+0.038
needed+0.037
earned+0.037
Neg logit contributions
ena-0.045
 scalp-0.042
kin-0.042
 End-0.042
era-0.038
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
ls+0.049
 closer+0.045
erry+0.039
 nearer+0.039
s+0.039
Neg logit contributions
ena-0.043
kin-0.040
 scalp-0.039
 End-0.037
era-0.036
Token cup
Token ID6508
Feature activation+1.27e+00

Pos logit contributions
ls+0.023
 closer+0.023
 back+0.020
s+0.019
 off+0.019
Neg logit contributions
kin-0.033
 scalp-0.033
ena-0.032
 unison-0.032
 End-0.032
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
ls+0.026
 closer+0.026
 unnecessary+0.022
 groceries+0.022
 nearer+0.021
Neg logit contributions
kin-0.032
 scalp-0.030
ena-0.029
 End-0.027
 unison-0.027
Token a
Token ID257
Feature activation+1.08e+00

Pos logit contributions
ena+0.003
kin+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
ls-0.003
 closer-0.003
 back-0.003
s-0.003
 groceries-0.003
Token car
Token ID1097
Feature activation-1.17e-02

Pos logit contributions
ls+0.022
 closer+0.022
 back+0.019
s+0.018
 groceries+0.018
Neg logit contributions
kin-0.027
ena-0.027
 scalp-0.027
 unison-0.027
 End-0.026
Token!"
Token ID2474
Feature activation-2.78e+00

Pos logit contributions
ena+0.000
kin+0.000
 scalp+0.000
 unison+0.000
 End+0.000
Neg logit contributions
ls-0.000
 closer-0.000
 back-0.000
s-0.000
 groceries-0.000
Token Ben
Token ID3932
Feature activation-1.10e+00

Pos logit contributions
kin+0.043
ena+0.043
 scalp+0.042
 unison+0.041
 End+0.039
Neg logit contributions
 closer-0.085
ls-0.085
 back-0.078
s-0.076
 home-0.074
Token said
Token ID531
Feature activation-5.55e-01

Pos logit contributions
kin+0.034
 scalp+0.033
 End+0.033
 unison+0.033
ric+0.032
Neg logit contributions
 closer-0.018
ls-0.017
 back-0.015
s-0.015
 to-0.014
Token,
Token ID11
Feature activation-6.62e+00

Pos logit contributions
kin+0.014
 scalp+0.014
ena+0.014
 unison+0.013
 End+0.013
Neg logit contributions
ls-0.012
 closer-0.011
 back-0.010
 groceries-0.009
s-0.009
Token showing
Token ID4478
Feature activation+2.35e+00

Pos logit contributions
OM+0.157
kin+0.156
ric+0.156
ities+0.156
ena+0.154
Neg logit contributions
 closer-0.171
 back-0.157
 down-0.152
 to-0.145
 up-0.145
Token his
Token ID465
Feature activation+4.83e-01

Pos logit contributions
ls+0.059
 closer+0.056
erry+0.055
 groceries+0.051
needed+0.051
Neg logit contributions
ena-0.047
 scalp-0.044
kin-0.042
 unison-0.039
era-0.039
Token picture
Token ID4286
Feature activation+1.29e+00

Pos logit contributions
ls+0.011
 closer+0.011
 back+0.009
s+0.009
 off+0.009
Neg logit contributions
ena-0.011
kin-0.011
 scalp-0.011
 unison-0.011
 End-0.010
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
ls+0.022
 closer+0.021
 groceries+0.018
erry+0.018
 unnecessary+0.016
Neg logit contributions
ena-0.037
kin-0.036
 scalp-0.035
 unison-0.032
 sensation-0.032
Token "
Token ID366
Feature activation-2.57e+00

Pos logit contributions
ena+0.015
 unison+0.014
kin+0.014
ric+0.014
 scalp+0.014
Neg logit contributions
 closer-0.014
ls-0.013
 back-0.013
s-0.012
 to-0.012
Token for
Token ID329
Feature activation-1.54e+00

Pos logit contributions
 closer+0.015
ls+0.015
 groceries+0.014
 back+0.013
s+0.012
Neg logit contributions
kin-0.019
ena-0.019
 scalp-0.017
 unison-0.017
 End-0.016
Token you
Token ID345
Feature activation-2.44e-01

Pos logit contributions
kin+0.046
ena+0.046
 scalp+0.045
 unison+0.045
 End+0.044
Neg logit contributions
 closer-0.025
ls-0.025
 back-0.021
s-0.019
 home-0.019
Token!"
Token ID2474
Feature activation-1.89e+00

Pos logit contributions
ena+0.006
kin+0.006
 scalp+0.006
 unison+0.006
 End+0.005
Neg logit contributions
ls-0.005
 closer-0.005
 groceries-0.004
 back-0.004
s-0.004
Token Tom
Token ID4186
Feature activation-4.60e-01

Pos logit contributions
ena+0.029
kin+0.028
 scalp+0.027
 unison+0.027
 End+0.025
Neg logit contributions
 closer-0.059
ls-0.059
 back-0.054
s-0.052
 home-0.051
Token said
Token ID531
Feature activation-7.28e-01

Pos logit contributions
ena+0.012
kin+0.012
 scalp+0.012
 unison+0.011
 End+0.011
Neg logit contributions
ls-0.009
 closer-0.009
 back-0.008
 groceries-0.008
 home-0.007
Token,
Token ID11
Feature activation-6.62e+00

Pos logit contributions
kin+0.017
 scalp+0.017
ena+0.017
 unison+0.016
 End+0.015
Neg logit contributions
ls-0.016
 closer-0.016
 groceries-0.014
erry-0.013
 back-0.013
Token holding
Token ID4769
Feature activation-6.03e-01

Pos logit contributions
OM+0.152
ric+0.151
kin+0.151
ena+0.150
ities+0.146
Neg logit contributions
 closer-0.180
 back-0.163
 down-0.152
 up-0.146
 to-0.145
Token up
Token ID510
Feature activation-7.16e-01

Pos logit contributions
kin+0.017
ena+0.017
 scalp+0.016
 unison+0.016
 End+0.015
Neg logit contributions
ls-0.011
 closer-0.011
s-0.009
erry-0.009
 groceries-0.009
Token his
Token ID465
Feature activation-3.16e-01

Pos logit contributions
kin+0.022
 scalp+0.021
ena+0.021
 End+0.021
 unison+0.021
Neg logit contributions
 closer-0.012
ls-0.011
 back-0.010
 to-0.009
 home-0.009
Token creation
Token ID6282
Feature activation+1.06e+00

Pos logit contributions
kin+0.008
 scalp+0.008
ena+0.008
 End+0.008
 unison+0.007
Neg logit contributions
ls-0.007
 closer-0.007
s-0.006
 off-0.005
 back-0.005
Token.
Token ID13
Feature activation-5.08e-01

Pos logit contributions
 closer+0.021
ls+0.020
 back+0.018
s+0.017
 home+0.016
Neg logit contributions
kin-0.028
ena-0.028
 scalp-0.028
 unison-0.027
 End-0.027
Token than
Token ID621
Feature activation+1.29e+00

Pos logit contributions
kin+0.029
ena+0.028
 scalp+0.027
uff+0.025
pit+0.025
Neg logit contributions
 closer-0.025
ls-0.023
 back-0.021
 groceries-0.020
needed-0.020
Token you
Token ID345
Feature activation-1.54e+00

Pos logit contributions
ls+0.025
 closer+0.024
s+0.020
 back+0.020
erry+0.019
Neg logit contributions
ena-0.035
kin-0.034
 scalp-0.033
 unison-0.032
 End-0.032
Token!"
Token ID2474
Feature activation-2.81e+00

Pos logit contributions
ena+0.044
kin+0.044
 scalp+0.043
 unison+0.043
 End+0.042
Neg logit contributions
ls-0.026
 closer-0.026
 back-0.022
s-0.020
 groceries-0.020
Token Tom
Token ID4186
Feature activation-5.22e-01

Pos logit contributions
ena+0.046
kin+0.044
 scalp+0.043
 unison+0.042
 End+0.041
Neg logit contributions
 closer-0.084
ls-0.083
 back-0.077
 home-0.073
s-0.072
Token said
Token ID531
Feature activation-7.08e-01

Pos logit contributions
ena+0.014
kin+0.013
 scalp+0.013
 unison+0.013
 End+0.012
Neg logit contributions
ls-0.011
 closer-0.011
 back-0.009
 groceries-0.009
 home-0.009
Token,
Token ID11
Feature activation-6.61e+00

Pos logit contributions
kin+0.018
ena+0.017
 scalp+0.017
 unison+0.017
 End+0.016
Neg logit contributions
ls-0.015
 closer-0.014
 groceries-0.012
 back-0.012
s-0.012
Token scoop
Token ID23529
Feature activation-6.49e-01

Pos logit contributions
ities+0.162
ric+0.162
OM+0.158
kin+0.157
ena+0.156
Neg logit contributions
 closer-0.171
 back-0.154
 down-0.147
 to-0.142
 up-0.141
Tokening
Token ID278
Feature activation-7.21e-01

Pos logit contributions
ena+0.017
kin+0.017
 scalp+0.016
 End+0.015
uff+0.015
Neg logit contributions
 closer-0.012
ls-0.012
 groceries-0.010
erry-0.010
needed-0.009
Token up
Token ID510
Feature activation-1.17e-01

Pos logit contributions
ena+0.020
kin+0.020
 scalp+0.019
 unison+0.019
 End+0.018
Neg logit contributions
ls-0.013
 closer-0.012
 groceries-0.010
erry-0.010
 back-0.010
Token a
Token ID257
Feature activation+2.50e+00

Pos logit contributions
ena+0.003
kin+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
ls-0.002
 closer-0.002
 back-0.002
s-0.002
 home-0.002
Token lot
Token ID1256
Feature activation+1.99e+00

Pos logit contributions
ls+0.055
 closer+0.050
 groceries+0.045
 back+0.045
 off+0.043
Neg logit contributions
kin-0.061
 unison-0.057
 scalp-0.056
ena-0.055
 End-0.053
Token this
Token ID428
Feature activation-1.02e-01

Pos logit contributions
 closer+0.047
ls+0.046
 back+0.039
s+0.037
 groceries+0.035
Neg logit contributions
ena-0.059
kin-0.056
 unison-0.055
 End-0.055
 scalp-0.054
Token stuff
Token ID3404
Feature activation-4.00e-01

Pos logit contributions
kin+0.003
ena+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
 closer-0.002
ls-0.002
 back-0.002
s-0.002
 home-0.001
Token!"
Token ID2474
Feature activation-2.52e+00

Pos logit contributions
kin+0.011
ena+0.011
 scalp+0.011
 unison+0.011
 End+0.010
Neg logit contributions
 closer-0.007
ls-0.007
 back-0.006
s-0.006
 home-0.006
Token Ben
Token ID3932
Feature activation-1.29e+00

Pos logit contributions
kin+0.047
ena+0.046
 scalp+0.046
ric+0.045
 unison+0.044
Neg logit contributions
ls-0.072
 closer-0.071
 back-0.064
s-0.063
 off-0.061
Token said
Token ID531
Feature activation-5.95e-01

Pos logit contributions
kin+0.040
 End+0.039
 scalp+0.039
 unison+0.038
ric+0.038
Neg logit contributions
 closer-0.021
ls-0.020
s-0.017
 back-0.017
 to-0.016
Token,
Token ID11
Feature activation-6.61e+00

Pos logit contributions
kin+0.013
 scalp+0.013
ena+0.012
 unison+0.012
 End+0.011
Neg logit contributions
ls-0.014
 closer-0.013
 groceries-0.013
erry-0.012
s-0.011
Token picking
Token ID10868
Feature activation+4.71e-01

Pos logit contributions
kin+0.169
OM+0.159
ric+0.152
ena+0.151
ities+0.149
Neg logit contributions
 closer-0.182
 back-0.160
 down-0.151
 up-0.145
 to-0.141
Token up
Token ID510
Feature activation+8.11e-01

Pos logit contributions
ls+0.005
 closer+0.005
s+0.003
 back+0.003
erry+0.003
Neg logit contributions
ena-0.016
kin-0.016
 scalp-0.016
 unison-0.015
 End-0.015
Token a
Token ID257
Feature activation+3.11e+00

Pos logit contributions
 closer+0.015
ls+0.015
 back+0.013
s+0.012
 home+0.012
Neg logit contributions
kin-0.022
ena-0.022
 scalp-0.022
 unison-0.021
 End-0.021
Token hat
Token ID6877
Feature activation-1.95e-01

Pos logit contributions
ls+0.066
 closer+0.061
 back+0.054
s+0.052
 groceries+0.052
Neg logit contributions
kin-0.078
 scalp-0.074
 unison-0.073
ena-0.072
 End-0.069
Token and
Token ID290
Feature activation+5.78e-01

Pos logit contributions
kin+0.005
 unison+0.005
 scalp+0.005
ena+0.005
 End+0.005
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.003
s-0.003
 off-0.003
Token"
Token ID1
Feature activation+1.92e-01

Pos logit contributions
 closer+0.019
ls+0.018
s+0.017
 back+0.016
 to+0.015
Neg logit contributions
 scalp-0.015
kin-0.014
ena-0.013
ric-0.013
 unison-0.013
TokenLook
Token ID8567
Feature activation+3.10e+00

Pos logit contributions
 closer+0.005
ls+0.004
 back+0.004
s+0.004
 groceries+0.004
Neg logit contributions
kin-0.004
 scalp-0.004
ric-0.004
 rarely-0.004
ena-0.004
Token,
Token ID11
Feature activation-6.47e-01

Pos logit contributions
ls+0.068
 closer+0.068
 groceries+0.062
needed+0.061
earned+0.060
Neg logit contributions
ena-0.068
kin-0.067
 scalp-0.062
 unison-0.060
uff-0.060
Token Ben
Token ID3932
Feature activation+9.81e-01

Pos logit contributions
kin+0.014
ena+0.014
 scalp+0.014
 unison+0.014
 End+0.013
Neg logit contributions
 closer-0.016
ls-0.016
 back-0.014
s-0.013
 home-0.013
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
 closer+0.017
ls+0.016
 back+0.015
s+0.013
 home+0.013
Neg logit contributions
 scalp-0.029
 unison-0.029
kin-0.029
ena-0.028
 End-0.028
Token look
Token ID804
Feature activation+2.75e+00

Pos logit contributions
ena+0.039
kin+0.039
 scalp+0.038
 unison+0.037
 End+0.036
Neg logit contributions
ls-0.031
 closer-0.030
 back-0.026
s-0.026
 home-0.025
Token!"
Token ID2474
Feature activation-2.85e+00

Pos logit contributions
ls+0.054
 closer+0.052
 groceries+0.048
s+0.047
 back+0.046
Neg logit contributions
ena-0.067
kin-0.066
 unison-0.064
 scalp-0.063
 End-0.060
Token Jill
Token ID22772
Feature activation+2.35e-01

Pos logit contributions
kin+0.065
 scalp+0.064
 unison+0.063
ena+0.063
 End+0.062
Neg logit contributions
 closer-0.068
ls-0.066
 back-0.060
s-0.058
 off-0.056
Token shouted
Token ID17293
Feature activation-7.02e-01

Pos logit contributions
 closer+0.004
ls+0.004
 back+0.004
s+0.003
 off+0.003
Neg logit contributions
kin-0.007
 scalp-0.007
 unison-0.007
ena-0.007
 End-0.006
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
 scalp+0.016
ena+0.016
kin+0.016
 unison+0.015
 End+0.014
Neg logit contributions
ls-0.016
 closer-0.015
 groceries-0.014
erry-0.013
s-0.013
Token "
Token ID366
Feature activation-3.25e+00

Pos logit contributions
kin+0.040
 End+0.039
ric+0.037
 unison+0.037
 scalp+0.036
Neg logit contributions
 closer-0.037
 to-0.036
 back-0.033
ls-0.033
s-0.033
Token brother
Token ID3956
Feature activation+1.41e+00

Pos logit contributions
 closer+0.026
ls+0.026
 back+0.022
s+0.022
 off+0.021
Neg logit contributions
 scalp-0.019
kin-0.019
ena-0.019
 unison-0.018
 End-0.017
Token came
Token ID1625
Feature activation+1.34e+00

Pos logit contributions
 closer+0.029
ls+0.029
 back+0.027
s+0.025
 to+0.024
Neg logit contributions
kin-0.036
 scalp-0.035
 unison-0.035
ena-0.034
 End-0.034
Token and
Token ID290
Feature activation+3.10e+00

Pos logit contributions
ls+0.026
 groceries+0.020
 closer+0.019
erry+0.018
s+0.018
Neg logit contributions
ena-0.037
 pretending-0.034
uff-0.034
 scalp-0.034
 unison-0.033
Token tried
Token ID3088
Feature activation+2.35e+00

Pos logit contributions
ls+0.074
 closer+0.071
 back+0.065
 home+0.065
 groceries+0.065
Neg logit contributions
ena-0.066
kin-0.061
 scalp-0.061
 rarely-0.060
 unison-0.060
Token to
Token ID284
Feature activation+1.68e-01

Pos logit contributions
ls+0.022
 closer+0.017
s+0.011
erry+0.011
 back+0.010
Neg logit contributions
ena-0.086
 scalp-0.085
kin-0.083
 unison-0.082
 End-0.082
Token take
Token ID1011
Feature activation+2.88e+00

Pos logit contributions
ls+0.003
 closer+0.003
 back+0.003
s+0.003
 home+0.003
Neg logit contributions
kin-0.004
ena-0.004
 scalp-0.004
 unison-0.004
 End-0.004
Token the
Token ID262
Feature activation+2.34e+00

Pos logit contributions
ls+0.070
 closer+0.065
s+0.058
erry+0.057
 nearer+0.053
Neg logit contributions
ena-0.061
kin-0.060
 scalp-0.060
 unison-0.058
 End-0.055
Token ball
Token ID2613
Feature activation+3.12e+00

Pos logit contributions
ls+0.049
 closer+0.045
s+0.040
 back+0.039
 off+0.038
Neg logit contributions
kin-0.061
 scalp-0.056
 unison-0.055
 End-0.055
 sensation-0.052
Token away
Token ID1497
Feature activation+1.24e+00

Pos logit contributions
ls+0.049
lers+0.035
 closer+0.035
 unnecessary+0.034
 groceries+0.034
Neg logit contributions
kin-0.091
ena-0.082
 scalp-0.080
 sensation-0.078
 size-0.078
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 closer+0.022
ls+0.022
 back+0.019
s+0.018
 off+0.018
Neg logit contributions
ena-0.035
kin-0.035
 scalp-0.035
 unison-0.034
 End-0.034
Token "
Token ID366
Feature activation-9.26e-01

Pos logit contributions
 closer+0.029
ls+0.028
 back+0.026
s+0.026
 to+0.025
Neg logit contributions
kin-0.032
ena-0.031
 unison-0.031
 scalp-0.030
 End-0.030
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
kin+0.030
ena+0.030
 scalp+0.030
 unison+0.030
 End+0.029
Neg logit contributions
 closer-0.018
ls-0.018
 back-0.016
s-0.015
 home-0.014
Token
Token ID198
Feature activation+6.94e-01

Pos logit contributions
 closer+0.006
ls+0.005
s+0.005
 back+0.005
 to+0.005
Neg logit contributions
kin-0.006
 scalp-0.006
ena-0.006
 End-0.006
 unison-0.006
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
 closer+0.015
ls+0.015
s+0.013
 back+0.013
 to+0.012
Neg logit contributions
 scalp-0.018
kin-0.018
ena-0.017
 unison-0.017
ities-0.016
TokenThey
Token ID2990
Feature activation-3.57e-01

Pos logit contributions
 closer+0.024
ls+0.024
s+0.022
 back+0.020
 to+0.019
Neg logit contributions
 scalp-0.026
kin-0.024
ities-0.024
 rarely-0.024
ena-0.024
Token get
Token ID651
Feature activation+2.85e+00

Pos logit contributions
kin+0.010
ena+0.010
 scalp+0.010
 End+0.010
 unison+0.010
Neg logit contributions
 closer-0.007
ls-0.006
 back-0.006
 away-0.005
 off-0.005
Token a
Token ID257
Feature activation+4.01e+00

Pos logit contributions
ls+0.046
earned+0.040
away+0.038
erry+0.038
ches+0.035
Neg logit contributions
 scalp-0.076
 End-0.074
ena-0.073
kin-0.072
 making-0.070
Token m
Token ID285
Feature activation+5.32e-01

Pos logit contributions
ls+0.095
lers+0.082
 closer+0.080
 off+0.079
 aback+0.078
Neg logit contributions
kin-0.089
 scalp-0.080
 unison-0.078
 End-0.073
ena-0.073
Tokenitt
Token ID715
Feature activation+3.55e+00

Pos logit contributions
 closer+0.015
ls+0.015
 back+0.014
 off+0.013
 groceries+0.013
Neg logit contributions
kin-0.009
ena-0.009
 scalp-0.009
 unison-0.009
 End-0.009
Token and
Token ID290
Feature activation+2.42e+00

Pos logit contributions
 closer+0.059
ls+0.056
 back+0.051
 groceries+0.049
 home+0.047
Neg logit contributions
kin-0.100
ena-0.099
 scalp-0.098
 unison-0.094
 End-0.092
Token a
Token ID257
Feature activation+2.23e+00

Pos logit contributions
ls+0.052
 closer+0.049
 back+0.044
s+0.044
 home+0.042
Neg logit contributions
 scalp-0.059
ena-0.058
kin-0.058
 unison-0.058
 End-0.055
Token spat
Token ID15246
Feature activation+1.73e-01

Pos logit contributions
ls+0.053
 closer+0.050
 back+0.045
 off+0.043
s+0.043
Neg logit contributions
kin-0.051
 scalp-0.049
 unison-0.048
ena-0.047
 End-0.046
Token ran
Token ID4966
Feature activation+1.05e+00

Pos logit contributions
 closer+0.032
ls+0.031
 back+0.029
s+0.028
 off+0.026
Neg logit contributions
kin-0.037
 scalp-0.036
ena-0.036
 unison-0.035
 End-0.035
Token over
Token ID625
Feature activation+4.10e-01

Pos logit contributions
ls+0.016
erry+0.013
lers+0.012
seed+0.011
 groceries+0.011
Neg logit contributions
ena-0.032
 unison-0.030
 lear-0.030
 scalp-0.030
 pretending-0.029
Token to
Token ID284
Feature activation-7.43e-03

Pos logit contributions
ls+0.004
 closer+0.003
 groceries+0.002
s+0.002
erry+0.002
Neg logit contributions
ena-0.015
 unison-0.015
kin-0.015
 scalp-0.014
 End-0.014
Token him
Token ID683
Feature activation+9.25e-01

Pos logit contributions
ena+0.000
kin+0.000
 unison+0.000
 scalp+0.000
 End+0.000
Neg logit contributions
ls-0.000
 closer-0.000
s-0.000
 back-0.000
 groceries-0.000
Token,
Token ID11
Feature activation-2.62e+00

Pos logit contributions
ls+0.019
 closer+0.019
 back+0.016
 groceries+0.016
s+0.016
Neg logit contributions
ena-0.023
kin-0.023
 scalp-0.023
 unison-0.022
 End-0.021
Token giving
Token ID3501
Feature activation+3.25e+00

Pos logit contributions
kin+0.074
ities+0.066
ena+0.065
 unison+0.065
 scalp+0.064
Neg logit contributions
 closer-0.067
 back-0.061
 to-0.058
 down-0.056
 out-0.053
Token him
Token ID683
Feature activation+2.52e+00

Pos logit contributions
ls+0.084
erry+0.082
 closer+0.069
earned+0.068
needed+0.068
Neg logit contributions
ena-0.061
 End-0.055
 unison-0.052
 scalp-0.052
kin-0.050
Token a
Token ID257
Feature activation+4.08e+00

Pos logit contributions
ls+0.073
 closer+0.064
s+0.060
erry+0.057
 nearer+0.056
Neg logit contributions
ena-0.044
 unison-0.039
kin-0.035
 scalp-0.034
 End-0.033
Token big
Token ID1263
Feature activation+4.04e+00

Pos logit contributions
ls+0.097
lers+0.078
 groceries+0.073
erry+0.071
s+0.067
Neg logit contributions
kin-0.091
 unison-0.089
ena-0.085
 scalp-0.084
OM-0.083
Token hug
Token ID16225
Feature activation+3.91e-01

Pos logit contributions
ls+0.085
 closer+0.070
s+0.068
 groceries+0.065
lers+0.062
Neg logit contributions
kin-0.104
 unison-0.102
 scalp-0.101
ena-0.099
 End-0.094
Token.
Token ID13
Feature activation+1.40e+00

Pos logit contributions
 closer+0.007
ls+0.006
 back+0.006
s+0.006
 to+0.005
Neg logit contributions
kin-0.011
 scalp-0.011
ena-0.011
 unison-0.011
 End-0.011
Token She
Token ID1375
Feature activation-7.83e-01

Pos logit contributions
 closer+0.007
ls+0.007
 back+0.007
s+0.006
 to+0.006
Neg logit contributions
kin-0.008
ena-0.008
ric-0.008
 unison-0.007
 scalp-0.007
Token tried
Token ID3088
Feature activation+1.99e+00

Pos logit contributions
kin+0.018
 unison+0.018
 scalp+0.018
ena+0.018
 End+0.017
Neg logit contributions
 closer-0.018
ls-0.018
 back-0.016
s-0.015
 to-0.015
Token to
Token ID284
Feature activation-7.13e-01

Pos logit contributions
ls+0.018
 closer+0.016
s+0.010
 back+0.010
 groceries+0.009
Neg logit contributions
ena-0.074
 scalp-0.072
kin-0.072
 unison-0.070
 End-0.070
Token reach
Token ID3151
Feature activation+1.62e+00

Pos logit contributions
ena+0.021
kin+0.021
 scalp+0.021
 unison+0.020
 rarely+0.020
Neg logit contributions
ls-0.013
 closer-0.011
s-0.010
lers-0.009
 back-0.009
Token the
Token ID262
Feature activation+1.20e+00

Pos logit contributions
ls+0.034
 closer+0.032
s+0.029
lers+0.027
erry+0.027
Neg logit contributions
ena-0.039
kin-0.038
 scalp-0.037
 End-0.036
 unison-0.035
Token key
Token ID1994
Feature activation+2.94e+00

Pos logit contributions
 closer+0.025
ls+0.024
 back+0.022
s+0.020
 home+0.020
Neg logit contributions
kin-0.031
ena-0.031
 scalp-0.030
 unison-0.030
 End-0.029
Token,
Token ID11
Feature activation-2.88e+00

Pos logit contributions
ls+0.065
 closer+0.061
 groceries+0.054
 back+0.052
erry+0.051
Neg logit contributions
ena-0.070
kin-0.069
 scalp-0.066
 unison-0.064
 End-0.061
Token but
Token ID475
Feature activation+4.01e-01

Pos logit contributions
kin+0.070
 unison+0.068
ena+0.065
 scalp+0.064
 End+0.064
Neg logit contributions
 closer-0.073
 back-0.065
ls-0.063
 to-0.062
 off-0.059
Token she
Token ID673
Feature activation-1.02e+00

Pos logit contributions
 closer+0.008
ls+0.008
 back+0.007
s+0.006
 home+0.006
Neg logit contributions
ena-0.011
kin-0.011
 unison-0.011
 scalp-0.011
 End-0.010
Token could
Token ID714
Feature activation-2.47e+00

Pos logit contributions
 unison+0.026
 scalp+0.024
 End+0.023
ric+0.023
kin+0.023
Neg logit contributions
ls-0.022
 closer-0.022
 back-0.020
s-0.020
 to-0.018
Token not
Token ID407
Feature activation+2.31e+00

Pos logit contributions
 unison+0.036
kin+0.033
ena+0.033
 scalp+0.033
 End+0.033
Neg logit contributions
 closer-0.083
ls-0.080
 back-0.077
 to-0.074
 off-0.073
Tokenmy
Token ID1820
Feature activation+8.60e-01

Pos logit contributions
 closer+0.007
ls+0.006
 back+0.006
s+0.005
 home+0.005
Neg logit contributions
kin-0.013
 unison-0.013
 scalp-0.013
ena-0.012
 End-0.012
Token couldn
Token ID3521
Feature activation-4.46e-01

Pos logit contributions
 closer+0.018
ls+0.017
 back+0.016
s+0.015
 home+0.015
Neg logit contributions
kin-0.022
 scalp-0.022
 unison-0.022
ena-0.022
 End-0.021
Token't
Token ID470
Feature activation+1.73e+00

Pos logit contributions
kin+0.008
ena+0.008
 scalp+0.008
 unison+0.007
 End+0.007
Neg logit contributions
 closer-0.013
ls-0.013
 back-0.012
s-0.011
 home-0.011
Token help
Token ID1037
Feature activation+1.17e+00

Pos logit contributions
ls+0.036
 closer+0.035
 back+0.030
s+0.030
 groceries+0.028
Neg logit contributions
kin-0.044
ena-0.044
 scalp-0.043
 unison-0.043
 End-0.042
Token but
Token ID475
Feature activation+7.28e-01

Pos logit contributions
ls+0.035
 closer+0.034
 back+0.031
s+0.031
 groceries+0.029
Neg logit contributions
ena-0.019
kin-0.019
 scalp-0.019
 unison-0.018
 End-0.018
Token smile
Token ID8212
Feature activation+9.61e-01

Pos logit contributions
ls+0.017
 closer+0.016
s+0.014
 groceries+0.014
 back+0.014
Neg logit contributions
kin-0.017
 scalp-0.016
 unison-0.016
ena-0.016
 End-0.016
Token a
Token ID257
Feature activation+5.58e-01

Pos logit contributions
 closer+0.017
ls+0.016
 back+0.015
s+0.014
 off+0.013
Neg logit contributions
kin-0.028
ena-0.028
 scalp-0.027
 unison-0.027
 End-0.027
Token little
Token ID1310
Feature activation+8.74e-01

Pos logit contributions
 closer+0.011
 back+0.010
ls+0.010
s+0.009
 home+0.009
Neg logit contributions
ena-0.016
kin-0.015
 scalp-0.015
ric-0.014
ities-0.014
Token.
Token ID13
Feature activation+9.72e-01

Pos logit contributions
ls+0.018
 closer+0.016
 groceries+0.014
s+0.014
 back+0.014
Neg logit contributions
 scalp-0.022
 unison-0.022
kin-0.022
ena-0.022
 End-0.021
Token Even
Token ID3412
Feature activation+1.61e+00

Pos logit contributions
 closer+0.017
 back+0.016
ls+0.016
 to+0.015
 home+0.015
Neg logit contributions
kin-0.028
ena-0.027
 unison-0.027
 scalp-0.027
 End-0.025
Token though
Token ID996
Feature activation+4.23e-01

Pos logit contributions
 closer+0.035
ls+0.035
 back+0.031
s+0.029
 home+0.028
Neg logit contributions
kin-0.040
ena-0.039
 scalp-0.039
 unison-0.039
 End-0.037
Token that
Token ID326
Feature activation-1.74e-01

Pos logit contributions
 closer+0.004
ls+0.004
 back+0.003
s+0.003
 home+0.003
Neg logit contributions
kin-0.005
ena-0.005
 scalp-0.005
 unison-0.005
 End-0.005
Token would
Token ID561
Feature activation-3.22e+00

Pos logit contributions
ena+0.004
kin+0.004
 scalp+0.004
 unison+0.004
 End+0.004
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.003
s-0.003
 home-0.003
Token stretch
Token ID7539
Feature activation-1.85e+00

Pos logit contributions
ena+0.085
 scalp+0.083
kin+0.083
 unison+0.080
 End+0.079
Neg logit contributions
 closer-0.069
ls-0.062
 back-0.060
 to-0.055
 off-0.055
Token and
Token ID290
Feature activation+1.23e+00

Pos logit contributions
kin+0.051
ena+0.050
 scalp+0.049
 unison+0.049
 End+0.048
Neg logit contributions
ls-0.034
 closer-0.034
 back-0.028
s-0.027
 groceries-0.027
Token stretch
Token ID7539
Feature activation-1.53e+00

Pos logit contributions
ls+0.026
 closer+0.025
 back+0.022
s+0.021
 groceries+0.020
Neg logit contributions
kin-0.031
 scalp-0.031
ena-0.031
 unison-0.030
 End-0.029
Token and
Token ID290
Feature activation+1.04e+00

Pos logit contributions
kin+0.043
ena+0.043
 scalp+0.042
 unison+0.042
 End+0.041
Neg logit contributions
 closer-0.027
ls-0.027
 back-0.023
s-0.022
 groceries-0.021
Token stretch
Token ID7539
Feature activation-1.42e+00

Pos logit contributions
 closer+0.021
ls+0.021
 back+0.018
s+0.018
 home+0.017
Neg logit contributions
kin-0.027
ena-0.027
 scalp-0.026
 unison-0.026
 End-0.025
Token.
Token ID13
Feature activation-2.54e-02

Pos logit contributions
kin+0.039
ena+0.039
 scalp+0.039
 unison+0.038
 End+0.037
Neg logit contributions
 closer-0.026
ls-0.026
 back-0.023
s-0.021
 home-0.020
Token He
Token ID679
Feature activation-1.21e+00

Pos logit contributions
kin+0.001
ena+0.001
 scalp+0.001
 unison+0.001
 End+0.001
Neg logit contributions
 closer-0.001
ls-0.001
 back-0.000
s-0.000
 home-0.000
Token asked
Token ID1965
Feature activation+4.48e-02

Pos logit contributions
 scalp+0.028
kin+0.028
ena+0.028
 unison+0.028
 End+0.027
Neg logit contributions
 closer-0.028
ls-0.027
 back-0.025
s-0.024
 off-0.023
Token Sally
Token ID25737
Feature activation+3.57e+00

Pos logit contributions
ls+0.001
 closer+0.001
s+0.001
lers+0.001
 groceries+0.001
Neg logit contributions
kin-0.001
 End-0.001
ena-0.001
 rarely-0.001
 scalp-0.001
Token!
Token ID0
Feature activation+5.39e-01

Pos logit contributions
ena+0.027
kin+0.027
 scalp+0.027
 unison+0.026
 End+0.026
Neg logit contributions
 closer-0.019
ls-0.018
 back-0.016
s-0.015
 home-0.014
Token<|endoftext|>
Token ID50256
Feature activation-3.22e+00

Pos logit contributions
 closer+0.010
s+0.010
ls+0.009
 back+0.009
 to+0.009
Neg logit contributions
kin-0.015
 scalp-0.014
ena-0.014
ities-0.014
 unison-0.014
TokenOnce
Token ID7454
Feature activation+1.74e+00

Pos logit contributions
 scalp+0.086
kin+0.084
ena+0.082
 unison+0.082
 rarely+0.080
Neg logit contributions
 closer-0.065
ls-0.065
 back-0.058
s-0.057
 home-0.054
Token upon
Token ID2402
Feature activation-2.59e+00

Pos logit contributions
 closer+0.041
ls+0.040
 back+0.036
 groceries+0.035
 off+0.034
Neg logit contributions
kin-0.039
ena-0.038
 End-0.037
 scalp-0.036
 unison-0.035
Token a
Token ID257
Feature activation-1.21e-01

Pos logit contributions
kin+0.078
 scalp+0.077
 unison+0.076
ena+0.075
 End+0.074
Neg logit contributions
 closer-0.043
ls-0.039
 back-0.037
 to-0.033
s-0.032
Token time
Token ID640
Feature activation-1.86e-01

Pos logit contributions
 scalp+0.003
ric+0.003
kin+0.003
ena+0.003
stack+0.002
Neg logit contributions
 closer-0.003
ls-0.003
 back-0.003
s-0.002
 home-0.002
Token,
Token ID11
Feature activation-4.27e-01

Pos logit contributions
 scalp+0.005
 unison+0.005
ena+0.005
kin+0.005
 End+0.005
Neg logit contributions
 closer-0.003
ls-0.003
 back-0.003
s-0.003
 home-0.003
Token a
Token ID257
Feature activation-6.45e-01

Pos logit contributions
 scalp+0.014
 unison+0.014
ena+0.013
ities+0.013
kin+0.013
Neg logit contributions
 closer-0.007
ls-0.006
 back-0.006
 home-0.005
s-0.005
Token boy
Token ID2933
Feature activation-7.27e-01

Pos logit contributions
 unison+0.018
 scalp+0.018
ric+0.018
ities+0.018
ena+0.018
Neg logit contributions
 closer-0.011
ls-0.011
 back-0.010
s-0.009
 home-0.009
Token wanted
Token ID2227
Feature activation+1.62e+00

Pos logit contributions
kin+0.015
 scalp+0.015
 unison+0.015
ena+0.014
ities+0.014
Neg logit contributions
 closer-0.019
ls-0.018
 back-0.017
s-0.017
 to-0.017
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
ls+0.028
 closer+0.021
lers+0.019
s+0.018
erry+0.018
Neg logit contributions
ena-0.050
 scalp-0.047
 pretending-0.046
 creating-0.045
 always-0.045
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
kin+0.002
ena+0.002
 scalp+0.002
 unison+0.002
 End+0.002
Neg logit contributions
ls-0.001
 closer-0.001
 back-0.001
s-0.001
 groceries-0.001
Token They
Token ID1119
Feature activation-3.78e-01

Pos logit contributions
 closer+0.028
 back+0.027
ls+0.027
 to+0.027
s+0.026
Neg logit contributions
kin-0.028
ena-0.026
ric-0.026
 End-0.025
 unison-0.025
Token told
Token ID1297
Feature activation+7.15e-01

Pos logit contributions
kin+0.010
 scalp+0.010
 End+0.010
ena+0.010
 unison+0.010
Neg logit contributions
 closer-0.008
ls-0.008
 back-0.007
s-0.007
 off-0.006
Token her
Token ID607
Feature activation+1.10e+00

Pos logit contributions
ls+0.016
 closer+0.015
s+0.014
erry+0.013
lers+0.013
Neg logit contributions
ena-0.017
 scalp-0.017
kin-0.016
 End-0.016
 unison-0.016
Token what
Token ID644
Feature activation+6.26e-01

Pos logit contributions
 closer+0.025
ls+0.025
 back+0.022
s+0.022
 groceries+0.021
Neg logit contributions
kin-0.025
ena-0.025
 scalp-0.024
 unison-0.024
 End-0.023
Token happened
Token ID3022
Feature activation+2.30e-01

Pos logit contributions
 closer+0.014
ls+0.014
 back+0.013
s+0.012
 home+0.012
Neg logit contributions
kin-0.015
ena-0.015
 scalp-0.015
 unison-0.014
 End-0.014
Token.
Token ID13
Feature activation+7.54e-01

Pos logit contributions
 closer+0.004
 back+0.003
ls+0.003
 to+0.003
s+0.003
Neg logit contributions
kin-0.008
 scalp-0.008
 unison-0.008
ena-0.007
 rarely-0.007
Token They
Token ID1119
Feature activation-3.01e-01

Pos logit contributions
 closer+0.016
ls+0.016
 to+0.015
 back+0.015
s+0.015
Neg logit contributions
kin-0.019
ric-0.018
ena-0.018
 End-0.018
 scalp-0.018
Token said
Token ID531
Feature activation+4.93e-02

Pos logit contributions
kin+0.008
 scalp+0.008
ena+0.008
 unison+0.007
 End+0.007
Neg logit contributions
 closer-0.006
ls-0.006
 back-0.006
s-0.005
 off-0.005
Token they
Token ID484
Feature activation+2.16e-01

Pos logit contributions
 closer+0.001
ls+0.001
 back+0.001
s+0.001
 home+0.001
Neg logit contributions
kin-0.001
ena-0.001
 scalp-0.001
 unison-0.001
 End-0.001
Token were
Token ID547
Feature activation-1.96e+00

Pos logit contributions
 closer+0.006
ls+0.006
 back+0.005
s+0.005
 home+0.005
Neg logit contributions
kin-0.004
 scalp-0.004
ena-0.004
 unison-0.004
 End-0.004
Token an
Token ID281
Feature activation+1.22e+00

Pos logit contributions
kin+0.038
ena+0.038
 scalp+0.038
 unison+0.037
 End+0.036
Neg logit contributions
 closer-0.027
ls-0.027
 back-0.023
s-0.022
 home-0.021
Token elephant
Token ID20950
Feature activation+2.10e-01

Pos logit contributions
ls+0.027
 closer+0.025
s+0.023
 back+0.022
 off+0.021
Neg logit contributions
ena-0.030
kin-0.030
 scalp-0.029
 unison-0.029
 End-0.029
Token."
Token ID526
Feature activation-1.21e+00

Pos logit contributions
 closer+0.004
ls+0.004
 back+0.003
s+0.003
 home+0.003
Neg logit contributions
kin-0.006
 scalp-0.006
ena-0.006
 unison-0.006
 End-0.006
Token Ma
Token ID6669
Feature activation-5.91e-01

Pos logit contributions
kin+0.035
 unison+0.032
 scalp+0.032
ities+0.031
 End+0.031
Neg logit contributions
 closer-0.024
ls-0.022
 to-0.022
s-0.022
 back-0.021
Tokenia
Token ID544
Feature activation+3.87e-01

Pos logit contributions
 unison+0.011
 scalp+0.010
kin+0.010
 End+0.010
 rarely+0.010
Neg logit contributions
ls-0.017
 closer-0.017
 back-0.015
s-0.015
 to-0.015
Token smiled
Token ID13541
Feature activation-1.98e-01

Pos logit contributions
 closer+0.008
ls+0.008
 back+0.007
s+0.006
 off+0.006
Neg logit contributions
kin-0.010
 scalp-0.010
ena-0.010
 unison-0.010
 End-0.010
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
kin+0.005
ena+0.005
 scalp+0.005
 unison+0.005
 End+0.005
Neg logit contributions
 closer-0.004
ls-0.004
 back-0.004
s-0.004
 home-0.003
Token spent
Token ID3377
Feature activation-2.04e+00

Pos logit contributions
 closer+0.020
ls+0.019
 back+0.017
s+0.016
 off+0.016
Neg logit contributions
kin-0.031
ena-0.031
 unison-0.031
 scalp-0.031
 End-0.030
Token some
Token ID617
Feature activation+1.95e-01

Pos logit contributions
 scalp+0.062
kin+0.062
ena+0.061
 unison+0.060
OM+0.059
Neg logit contributions
 closer-0.033
ls-0.032
 back-0.029
 home-0.026
 off-0.026
Token time
Token ID640
Feature activation-8.58e-01

Pos logit contributions
 closer+0.005
ls+0.005
 back+0.004
s+0.004
 home+0.004
Neg logit contributions
kin-0.004
ena-0.004
 scalp-0.004
 unison-0.004
 End-0.004
Token looking
Token ID2045
Feature activation+1.60e+00

Pos logit contributions
kin+0.020
 unison+0.020
ena+0.020
 scalp+0.020
 End+0.019
Neg logit contributions
 closer-0.020
ls-0.019
 back-0.018
 home-0.016
s-0.016
Token could
Token ID714
Feature activation-1.86e+00

Pos logit contributions
kin+0.027
ena+0.027
 scalp+0.027
 unison+0.026
 End+0.026
Neg logit contributions
 closer-0.034
ls-0.033
 back-0.030
s-0.029
 home-0.028
Token do
Token ID466
Feature activation-1.60e+00

Pos logit contributions
kin+0.047
ena+0.047
 scalp+0.047
 unison+0.047
 End+0.046
Neg logit contributions
 closer-0.039
ls-0.037
 back-0.034
s-0.031
 home-0.031
Token.
Token ID13
Feature activation+5.18e-01

Pos logit contributions
 scalp+0.047
 unison+0.047
ena+0.046
kin+0.046
 End+0.045
Neg logit contributions
 closer-0.028
ls-0.027
 back-0.025
 to-0.023
 home-0.022
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
 closer+0.012
ls+0.012
 back+0.011
s+0.010
 home+0.010
Neg logit contributions
kin-0.013
 scalp-0.012
ena-0.012
 unison-0.012
 End-0.011
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
 closer+0.008
ls+0.008
 back+0.007
s+0.007
 home+0.006
Neg logit contributions
kin-0.007
ena-0.007
 scalp-0.007
 unison-0.007
 End-0.007
TokenHe
Token ID1544
Feature activation-1.15e+00

Pos logit contributions
 closer+0.006
ls+0.006
 back+0.005
s+0.005
 off+0.005
Neg logit contributions
 scalp-0.005
kin-0.005
ena-0.005
 unison-0.005
ities-0.004
Token took
Token ID1718
Feature activation+2.08e+00

Pos logit contributions
kin+0.028
 scalp+0.028
ena+0.027
 unison+0.027
 End+0.026
Neg logit contributions
 closer-0.026
ls-0.025
 back-0.023
s-0.022
 off-0.021
Token the
Token ID262
Feature activation+1.61e+00

Pos logit contributions
ls+0.050
s+0.045
 closer+0.042
erry+0.041
 groceries+0.039
Neg logit contributions
ena-0.044
kin-0.041
 unison-0.040
 End-0.039
 sensation-0.038
Token lot
Token ID1256
Feature activation+2.78e+00

Pos logit contributions
ls+0.039
 closer+0.038
 back+0.035
s+0.034
 off+0.033
Neg logit contributions
kin-0.035
 scalp-0.033
ena-0.033
 End-0.032
 unison-0.032
Tokenion
Token ID295
Feature activation+1.32e+00

Pos logit contributions
ls+0.045
 closer+0.044
 back+0.038
 groceries+0.037
 home+0.035
Neg logit contributions
ena-0.079
kin-0.078
 End-0.073
 scalp-0.072
 unison-0.071
Token and
Token ID290
Feature activation+1.37e+00

Pos logit contributions
ls+0.024
 closer+0.022
 groceries+0.019
erry+0.018
earned+0.018
Neg logit contributions
kin-0.034
 scalp-0.032
ena-0.031
 unison-0.030
 End-0.030
Token them
Token ID606
Feature activation+1.22e+00

Pos logit contributions
ls+0.038
 closer+0.038
erry+0.034
s+0.033
 groceries+0.031
Neg logit contributions
ena-0.043
kin-0.042
 scalp-0.041
 unison-0.041
 rarely-0.039
Token.
Token ID13
Feature activation+9.34e-01

Pos logit contributions
ls+0.029
 closer+0.027
erry+0.024
 groceries+0.024
s+0.023
Neg logit contributions
kin-0.026
ena-0.026
 scalp-0.025
 unison-0.024
 End-0.024
Token He
Token ID679
Feature activation-7.39e-01

Pos logit contributions
 closer+0.016
ls+0.015
 back+0.015
 to+0.013
s+0.013
Neg logit contributions
kin-0.028
 unison-0.027
 scalp-0.027
ena-0.026
ric-0.026
Token said
Token ID531
Feature activation+6.25e-01

Pos logit contributions
kin+0.017
 scalp+0.016
 unison+0.016
 End+0.016
ena+0.016
Neg logit contributions
 closer-0.018
ls-0.018
 back-0.016
s-0.015
 to-0.014
Token,
Token ID11
Feature activation-2.86e+00

Pos logit contributions
 closer+0.016
ls+0.015
 groceries+0.015
needed+0.014
erry+0.014
Neg logit contributions
kin-0.012
ena-0.011
 scalp-0.010
 End-0.010
 rarely-0.010
Token "
Token ID366
Feature activation-1.02e+00

Pos logit contributions
 unison+0.069
kin+0.067
ena+0.064
 scalp+0.062
ric+0.062
Neg logit contributions
 closer-0.072
 to-0.070
 back-0.070
ls-0.063
s-0.062
TokenNo
Token ID2949
Feature activation-3.50e-01

Pos logit contributions
kin+0.020
 unison+0.019
ena+0.019
 scalp+0.019
ric+0.018
Neg logit contributions
 closer-0.028
ls-0.027
 back-0.025
s-0.025
 away-0.024
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
ena+0.009
kin+0.009
 scalp+0.009
 unison+0.009
 End+0.009
Neg logit contributions
 closer-0.007
ls-0.007
 back-0.006
s-0.006
 home-0.006
Token they
Token ID484
Feature activation-1.10e+00

Pos logit contributions
kin+0.051
ena+0.051
 scalp+0.050
 unison+0.049
 End+0.048
Neg logit contributions
 closer-0.054
ls-0.054
 back-0.048
s-0.047
 home-0.045
Token are
Token ID389
Feature activation-1.18e+00

Pos logit contributions
kin+0.023
ena+0.022
 scalp+0.022
 unison+0.022
 End+0.021
Neg logit contributions
 closer-0.028
ls-0.028
 back-0.025
 home-0.023
s-0.023
Token mine
Token ID6164
Feature activation+1.98e+00

Pos logit contributions
ena+0.028
kin+0.028
 scalp+0.027
 unison+0.026
 End+0.026
Neg logit contributions
ls-0.026
 closer-0.026
s-0.023
 back-0.022
 groceries-0.022
Token sandbox
Token ID35204
Feature activation-3.57e+00

Pos logit contributions
ena+0.008
kin+0.007
 scalp+0.007
 unison+0.007
 End+0.007
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.008
 home-0.008
Token.
Token ID13
Feature activation-3.23e-02

Pos logit contributions
 rarely+0.130
 unison+0.127
ric+0.126
oyd+0.126
uren+0.126
Neg logit contributions
 back-0.037
ls-0.035
 to-0.034
 closer-0.033
s-0.030
Token Sometimes
Token ID8975
Feature activation-4.00e-01

Pos logit contributions
kin+0.001
 scalp+0.001
ena+0.001
ities+0.001
 rarely+0.001
Neg logit contributions
 closer-0.001
ls-0.001
s-0.001
 back-0.001
 to-0.001
Token they
Token ID484
Feature activation-1.13e+00

Pos logit contributions
kin+0.009
ena+0.009
 scalp+0.009
 unison+0.009
 End+0.009
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.008
 home-0.008
Token pretend
Token ID16614
Feature activation-8.71e-01

Pos logit contributions
kin+0.028
ena+0.028
 scalp+0.027
 unison+0.027
 End+0.026
Neg logit contributions
 closer-0.024
ls-0.024
 back-0.021
s-0.020
 home-0.020
Token to
Token ID284
Feature activation-1.05e+00

Pos logit contributions
ena+0.025
kin+0.025
 scalp+0.024
 unison+0.024
 rarely+0.024
Neg logit contributions
 closer-0.014
ls-0.013
 back-0.010
erry-0.010
 groceries-0.010
Token be
Token ID307
Feature activation+1.58e-01

Pos logit contributions
kin+0.026
ena+0.026
 scalp+0.026
 unison+0.025
 End+0.025
Neg logit contributions
 closer-0.023
ls-0.022
 back-0.020
s-0.019
 home-0.019
Token animals
Token ID4695
Feature activation-9.94e-01

Pos logit contributions
 closer+0.003
ls+0.003
 back+0.003
 home+0.002
 to+0.002
Neg logit contributions
kin-0.004
 scalp-0.004
 unison-0.004
 End-0.004
ena-0.004
Token and
Token ID290
Feature activation+6.14e-01

Pos logit contributions
kin+0.025
ena+0.024
 scalp+0.024
 unison+0.024
 End+0.023
Neg logit contributions
 closer-0.021
ls-0.021
 back-0.018
s-0.018
 home-0.017
Token run
Token ID1057
Feature activation-2.16e-01

Pos logit contributions
 closer+0.013
ls+0.012
 back+0.011
s+0.010
 home+0.010
Neg logit contributions
kin-0.016
ena-0.016
 scalp-0.016
 unison-0.015
 End-0.015
Token and
Token ID290
Feature activation+1.28e+00

Pos logit contributions
ena+0.006
kin+0.006
 unison+0.006
 scalp+0.006
 rarely+0.006
Neg logit contributions
ls-0.003
 closer-0.003
 groceries-0.002
erry-0.002
 back-0.002
Token Dave
Token ID9935
Feature activation+1.19e+00

Pos logit contributions
 closer+0.011
ls+0.011
 back+0.010
s+0.010
 to+0.010
Neg logit contributions
kin-0.009
 scalp-0.009
ena-0.009
 unison-0.008
 End-0.008
Token waited
Token ID13488
Feature activation-1.17e+00

Pos logit contributions
 closer+0.026
ls+0.026
s+0.023
 back+0.023
 to+0.021
Neg logit contributions
 scalp-0.030
kin-0.029
 unison-0.029
ena-0.028
 End-0.028
Token patiently
Token ID37220
Feature activation-1.13e+00

Pos logit contributions
 unison+0.034
ric+0.034
 scalp+0.034
ena+0.034
kin+0.034
Neg logit contributions
 closer-0.024
 back-0.021
 to-0.020
ls-0.020
 out-0.018
Token for
Token ID329
Feature activation-1.22e-01

Pos logit contributions
kin+0.037
 unison+0.036
 scalp+0.036
ena+0.036
ric+0.035
Neg logit contributions
 closer-0.020
 to-0.018
 back-0.017
 off-0.016
 away-0.015
Token them
Token ID606
Feature activation+1.17e+00

Pos logit contributions
kin+0.003
ena+0.003
 scalp+0.003
 unison+0.003
 End+0.003
Neg logit contributions
ls-0.002
 closer-0.002
 back-0.002
s-0.002
 home-0.002
Token to
Token ID284
Feature activation-2.76e+00

Pos logit contributions
ls+0.011
 closer+0.010
 back+0.006
s+0.005
 groceries+0.005
Neg logit contributions
ena-0.043
kin-0.042
 scalp-0.041
 unison-0.041
 End-0.040
Token finish
Token ID5461
Feature activation-1.16e+00

Pos logit contributions
 scalp+0.082
ena+0.082
kin+0.080
 End+0.078
uff+0.078
Neg logit contributions
 closer-0.048
 back-0.042
 home-0.039
 off-0.037
 away-0.037
Token.
Token ID13
Feature activation+3.42e-01

Pos logit contributions
 scalp+0.035
ena+0.034
kin+0.034
 End+0.034
 unison+0.033
Neg logit contributions
 closer-0.020
ls-0.018
 back-0.017
 to-0.017
 home-0.017
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
 closer+0.008
ls+0.007
 back+0.007
 to+0.007
s+0.007
Neg logit contributions
kin-0.009
 scalp-0.008
ena-0.008
 unison-0.008
ities-0.008
Token
Token ID198
Feature activation-3.81e-03

Pos logit contributions
 closer+0.003
ls+0.003
 back+0.003
s+0.002
 groceries+0.002
Neg logit contributions
kin-0.002
ena-0.002
 scalp-0.002
 unison-0.002
 End-0.002
TokenAfter
Token ID3260
Feature activation-1.88e+00

Pos logit contributions
 scalp+0.000
kin+0.000
ena+0.000
 unison+0.000
ric+0.000
Neg logit contributions
 closer-0.000
ls-0.000
 back-0.000
s-0.000
 off-0.000
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
kin+0.045
 unison+0.043
 scalp+0.043
ena+0.043
 End+0.042
Neg logit contributions
ls-0.030
 closer-0.028
 back-0.026
 to-0.024
s-0.024
Token You
Token ID921
Feature activation-1.18e+00

Pos logit contributions
 closer+0.011
ls+0.010
 back+0.010
 to+0.009
 off+0.009
Neg logit contributions
kin-0.010
 scalp-0.009
 End-0.009
 unison-0.009
ena-0.009
Token are
Token ID389
Feature activation-2.20e+00

Pos logit contributions
kin+0.030
 scalp+0.030
ena+0.029
 unison+0.029
 End+0.029
Neg logit contributions
 closer-0.025
ls-0.024
 back-0.022
s-0.021
 off-0.020
Token taller
Token ID25242
Feature activation-1.93e+00

Pos logit contributions
kin+0.061
 scalp+0.060
 unison+0.060
ena+0.059
 End+0.057
Neg logit contributions
 closer-0.042
ls-0.039
 back-0.037
 off-0.033
 home-0.033
Token than
Token ID621
Feature activation+8.14e-02

Pos logit contributions
 unison+0.041
kin+0.040
 scalp+0.040
 End+0.039
ric+0.038
Neg logit contributions
ls-0.050
 closer-0.049
 back-0.046
 to-0.046
 home-0.044
Token me
Token ID502
Feature activation-1.58e+00

Pos logit contributions
 closer+0.002
ls+0.001
 back+0.001
 home+0.001
 to+0.001
Neg logit contributions
kin-0.002
 unison-0.002
 scalp-0.002
ena-0.002
 End-0.002
Token."
Token ID526
Feature activation-1.45e+00

Pos logit contributions
kin+0.042
 scalp+0.041
ena+0.041
 unison+0.041
 End+0.040
Neg logit contributions
 closer-0.031
ls-0.031
 back-0.027
s-0.026
 home-0.025
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
kin+0.040
 unison+0.038
 scalp+0.038
ena+0.037
 rarely+0.037
Neg logit contributions
 closer-0.029
ls-0.029
s-0.026
 back-0.026
 to-0.026
Token
Token ID198
Feature activation+3.44e-01

Pos logit contributions
kin+0.004
 scalp+0.004
 unison+0.004
ena+0.004
 End+0.003
Neg logit contributions
ls-0.003
 closer-0.003
 back-0.003
s-0.003
 home-0.003
TokenBen
Token ID11696
Feature activation-5.62e-01

Pos logit contributions
 closer+0.008
ls+0.008
s+0.008
 back+0.007
 to+0.007
Neg logit contributions
 scalp-0.007
kin-0.007
ric-0.007
 rarely-0.007
 unison-0.007
Token said
Token ID531
Feature activation+2.46e-01

Pos logit contributions
kin+0.015
ric+0.015
 scalp+0.014
 unison+0.014
 End+0.014
Neg logit contributions
 closer-0.011
ls-0.011
s-0.011
 back-0.010
 to-0.010
Token to
Token ID284
Feature activation-2.38e+00

Pos logit contributions
kin+0.039
ena+0.038
 scalp+0.038
 unison+0.038
 End+0.037
Neg logit contributions
 closer-0.005
ls-0.004
 back-0.002
s-0.001
 home-0.001
Token stand
Token ID1302
Feature activation-1.02e+00

Pos logit contributions
kin+0.063
 scalp+0.062
ena+0.062
 unison+0.061
 End+0.059
Neg logit contributions
 closer-0.048
ls-0.043
 back-0.042
 home-0.040
 to-0.038
Token up
Token ID510
Feature activation+3.41e-02

Pos logit contributions
kin+0.031
ena+0.030
 scalp+0.030
 unison+0.029
 End+0.029
Neg logit contributions
ls-0.016
 closer-0.016
 back-0.013
s-0.012
 groceries-0.012
Token for
Token ID329
Feature activation-1.66e+00

Pos logit contributions
ls+0.001
 closer+0.001
 back+0.001
s+0.001
 groceries+0.001
Neg logit contributions
kin-0.001
ena-0.001
 scalp-0.001
 unison-0.001
 End-0.001
Token ourselves
Token ID6731
Feature activation+1.04e+00

Pos logit contributions
kin+0.046
ena+0.046
 scalp+0.045
 unison+0.045
 End+0.043
Neg logit contributions
 closer-0.031
ls-0.030
 back-0.027
s-0.025
 home-0.025
Token,
Token ID11
Feature activation-3.93e+00

Pos logit contributions
 closer+0.020
ls+0.020
 back+0.018
s+0.017
 home+0.016
Neg logit contributions
ena-0.028
kin-0.028
 scalp-0.028
 unison-0.027
 End-0.027
Token but
Token ID475
Feature activation+4.39e-01

Pos logit contributions
kin+0.090
 scalp+0.087
ena+0.087
 unison+0.086
 End+0.085
Neg logit contributions
 closer-0.097
ls-0.088
 back-0.087
 to-0.083
 home-0.080
Token not
Token ID407
Feature activation+1.54e+00

Pos logit contributions
 closer+0.011
ls+0.010
 back+0.009
s+0.009
 home+0.009
Neg logit contributions
kin-0.010
ena-0.010
 scalp-0.009
 unison-0.009
 End-0.009
Token to
Token ID284
Feature activation-1.91e+00

Pos logit contributions
 closer+0.033
ls+0.033
 back+0.029
s+0.028
 home+0.027
Neg logit contributions
kin-0.038
ena-0.038
 scalp-0.037
 unison-0.037
 End-0.036
Token be
Token ID307
Feature activation+2.82e-01

Pos logit contributions
kin+0.050
ena+0.049
 scalp+0.048
 unison+0.048
 End+0.047
Neg logit contributions
 closer-0.040
ls-0.037
 back-0.035
 home-0.033
 off-0.032
Token mean
Token ID1612
Feature activation+1.91e+00

Pos logit contributions
ls+0.006
 closer+0.006
s+0.005
 back+0.005
 groceries+0.005
Neg logit contributions
ena-0.007
kin-0.007
 scalp-0.007
 unison-0.007
 rarely-0.006
Token she
Token ID673
Feature activation-4.16e-01

Pos logit contributions
 closer+0.000
ls+0.000
 back+0.000
 home+0.000
 off+0.000
Neg logit contributions
kin-0.000
ena-0.000
 scalp-0.000
 unison-0.000
 End-0.000
Token heard
Token ID2982
Feature activation+7.87e-01

Pos logit contributions
kin+0.010
ena+0.010
 scalp+0.010
 unison+0.009
 End+0.009
Neg logit contributions
 closer-0.009
ls-0.009
 back-0.008
s-0.008
 home-0.008
Token a
Token ID257
Feature activation+2.44e+00

Pos logit contributions
ls+0.016
 closer+0.016
 back+0.014
s+0.013
 groceries+0.013
Neg logit contributions
ena-0.020
kin-0.020
 scalp-0.020
 unison-0.020
 End-0.019
Token loud
Token ID7812
Feature activation+1.64e-01

Pos logit contributions
ls+0.056
 closer+0.056
 back+0.050
s+0.048
 groceries+0.047
Neg logit contributions
kin-0.055
ena-0.055
 scalp-0.055
 unison-0.054
 End-0.053
Token scree
Token ID43465
Feature activation+7.64e-01

Pos logit contributions
 closer+0.004
ls+0.003
 back+0.003
s+0.003
 to+0.003
Neg logit contributions
kin-0.004
ena-0.004
 scalp-0.004
 unison-0.004
 End-0.004
Tokench
Token ID354
Feature activation-4.32e+00

Pos logit contributions
ls+0.014
 closer+0.013
 groceries+0.012
 home+0.011
 back+0.011
Neg logit contributions
ena-0.021
 unison-0.020
 End-0.020
 scalp-0.020
kin-0.020
Token.
Token ID13
Feature activation+6.48e-01

Pos logit contributions
 scalp+0.126
 unison+0.124
 End+0.124
 rarely+0.121
kin+0.120
Neg logit contributions
 closer-0.076
ls-0.075
 back-0.072
 off-0.070
 to-0.069
Token A
Token ID317
Feature activation+1.47e+00

Pos logit contributions
 closer+0.013
ls+0.012
 back+0.012
 to+0.011
 off+0.011
Neg logit contributions
kin-0.018
ena-0.017
 End-0.016
 unison-0.016
 scalp-0.016
Token big
Token ID1263
Feature activation+1.24e+00

Pos logit contributions
 closer+0.027
ls+0.026
 back+0.023
s+0.021
 home+0.020
Neg logit contributions
kin-0.041
ena-0.041
 scalp-0.041
 unison-0.040
 End-0.039
Token bird
Token ID6512
Feature activation+8.85e-01

Pos logit contributions
 closer+0.025
ls+0.024
 back+0.022
 home+0.020
s+0.020
Neg logit contributions
kin-0.033
ena-0.032
 scalp-0.032
 unison-0.031
 End-0.031
Token flew
Token ID13112
Feature activation+1.07e+00

Pos logit contributions
 closer+0.020
ls+0.020
 back+0.017
s+0.017
 groceries+0.016
Neg logit contributions
kin-0.021
ena-0.021
 scalp-0.020
 unison-0.020
 End-0.020
Token.
Token ID13
Feature activation+6.99e-02

Pos logit contributions
 closer+0.009
ls+0.008
 back+0.007
s+0.007
 home+0.007
Neg logit contributions
kin-0.013
ena-0.013
 scalp-0.013
 unison-0.013
 End-0.013
Token The
Token ID383
Feature activation-2.30e-01

Pos logit contributions
ls+0.001
 closer+0.001
 back+0.001
s+0.001
 to+0.001
Neg logit contributions
kin-0.002
ena-0.002
 unison-0.002
 scalp-0.002
 End-0.002
Token car
Token ID1097
Feature activation+9.53e-01

Pos logit contributions
ena+0.008
 scalp+0.007
kin+0.007
 unison+0.007
ric+0.007
Neg logit contributions
 closer-0.003
 back-0.003
ls-0.003
 home-0.002
s-0.002
Token went
Token ID1816
Feature activation+6.73e-01

Pos logit contributions
 closer+0.020
ls+0.019
 back+0.019
s+0.017
 off+0.017
Neg logit contributions
 unison-0.025
kin-0.025
 scalp-0.025
ric-0.024
 End-0.024
Token backward
Token ID19528
Feature activation-1.60e+00

Pos logit contributions
ls+0.010
 closer+0.010
 back+0.008
s+0.008
 groceries+0.008
Neg logit contributions
ena-0.020
kin-0.020
 scalp-0.020
 unison-0.019
 End-0.019
Token,
Token ID11
Feature activation-3.97e+00

Pos logit contributions
 unison+0.052
kin+0.051
ric+0.051
 End+0.050
 scalp+0.050
Neg logit contributions
 back-0.027
 closer-0.027
 to-0.025
ls-0.024
 off-0.024
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
kin+0.128
 unison+0.123
ities+0.120
ena+0.112
 scalp+0.112
Neg logit contributions
 closer-0.087
 back-0.082
 down-0.075
 to-0.072
 away-0.070
Token Spot
Token ID15899
Feature activation+2.42e-01

Pos logit contributions
 closer+0.007
ls+0.006
 back+0.006
s+0.005
 off+0.005
Neg logit contributions
kin-0.009
ena-0.009
 scalp-0.009
 unison-0.009
 End-0.008
Token me
Token ID502
Feature activation-2.12e+00

Pos logit contributions
 closer+0.005
 back+0.005
s+0.005
ls+0.005
 to+0.004
Neg logit contributions
 unison-0.007
kin-0.007
 End-0.007
 scalp-0.006
ities-0.006
Tokenowed
Token ID6972
Feature activation-7.65e-01

Pos logit contributions
 End+0.060
 unison+0.059
 scalp+0.056
kin+0.055
 rarely+0.054
Neg logit contributions
 closer-0.040
ls-0.036
 back-0.036
s-0.034
 to-0.033
Token louder
Token ID27089
Feature activation-2.56e+00

Pos logit contributions
kin+0.021
ena+0.021
 scalp+0.021
 unison+0.021
 End+0.020
Neg logit contributions
 closer-0.014
ls-0.014
 back-0.013
s-0.011
 off-0.011
Token toys
Token ID14958
Feature activation-7.23e-02

Pos logit contributions
 closer+0.009
ls+0.009
 back+0.008
s+0.007
 home+0.007
Neg logit contributions
kin-0.012
ena-0.012
 scalp-0.012
 unison-0.012
 End-0.012
Token,
Token ID11
Feature activation-3.02e+00

Pos logit contributions
kin+0.002
ena+0.002
 scalp+0.002
 unison+0.002
 End+0.002
Neg logit contributions
 closer-0.002
ls-0.001
 back-0.001
s-0.001
 home-0.001
Token books
Token ID3835
Feature activation-2.06e+00

Pos logit contributions
ena+0.073
kin+0.072
 scalp+0.072
 unison+0.071
 End+0.068
Neg logit contributions
ls-0.065
 closer-0.065
 back-0.057
s-0.055
 home-0.054
Token,
Token ID11
Feature activation-3.82e+00

Pos logit contributions
 scalp+0.062
 unison+0.061
ena+0.060
kin+0.060
ric+0.059
Neg logit contributions
ls-0.036
 closer-0.035
 back-0.032
s-0.031
 to-0.029
Token clothes
Token ID8242
Feature activation-1.26e+00

Pos logit contributions
kin+0.097
ena+0.097
 scalp+0.096
 unison+0.095
 End+0.092
Neg logit contributions
 closer-0.079
ls-0.078
 back-0.069
s-0.065
 home-0.063
Token,
Token ID11
Feature activation-4.21e+00

Pos logit contributions
 unison+0.038
 scalp+0.038
 End+0.037
kin+0.037
 rarely+0.037
Neg logit contributions
ls-0.021
 closer-0.020
 back-0.019
s-0.019
 to-0.017
Token and
Token ID290
Feature activation-2.08e-01

Pos logit contributions
kin+0.107
ena+0.105
 scalp+0.105
 unison+0.104
 End+0.102
Neg logit contributions
 closer-0.088
ls-0.087
 back-0.077
s-0.073
 groceries-0.070
Token food
Token ID2057
Feature activation-2.30e+00

Pos logit contributions
ena+0.005
 scalp+0.005
 unison+0.005
kin+0.005
 End+0.005
Neg logit contributions
ls-0.005
 closer-0.004
 back-0.004
s-0.004
 home-0.004
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
ena+0.063
kin+0.063
 scalp+0.063
 unison+0.062
 End+0.061
Neg logit contributions
 closer-0.043
ls-0.042
 back-0.037
s-0.035
 home-0.034
Token Ben
Token ID3932
Feature activation+8.80e-01

Pos logit contributions
kin+0.008
ric+0.007
 End+0.007
 scalp+0.007
ena+0.007
Neg logit contributions
 closer-0.008
ls-0.007
 back-0.007
 to-0.007
s-0.007
Token is
Token ID318
Feature activation-2.85e+00

Pos logit contributions
ls+0.017
 closer+0.016
 back+0.015
s+0.014
 to+0.014
Neg logit contributions
kin-0.026
 unison-0.024
 scalp-0.024
OM-0.023
 End-0.023
Token pain
Token ID2356
Feature activation-1.05e+00

Pos logit contributions
kin+0.015
ena+0.015
 unison+0.015
 scalp+0.014
 End+0.014
Neg logit contributions
 closer-0.007
 back-0.006
ls-0.006
 to-0.006
 off-0.006
Token.
Token ID13
Feature activation+4.42e-01

Pos logit contributions
ena+0.032
 unison+0.032
 scalp+0.032
 End+0.032
kin+0.031
Neg logit contributions
 closer-0.017
ls-0.016
 back-0.016
 to-0.014
s-0.014
Token He
Token ID679
Feature activation-7.73e-01

Pos logit contributions
 closer+0.008
ls+0.008
 back+0.008
s+0.007
 to+0.007
Neg logit contributions
kin-0.012
 scalp-0.012
ena-0.011
 unison-0.011
ities-0.011
Token spent
Token ID3377
Feature activation-2.45e+00

Pos logit contributions
kin+0.016
ena+0.016
 scalp+0.015
 unison+0.015
 End+0.015
Neg logit contributions
 closer-0.020
ls-0.020
 back-0.018
s-0.018
 off-0.017
Token the
Token ID262
Feature activation-1.17e+00

Pos logit contributions
 scalp+0.086
OM+0.084
ric+0.084
ider+0.082
kin+0.081
Neg logit contributions
 back-0.031
 away-0.029
 home-0.028
 closer-0.028
 off-0.026
Token night
Token ID1755
Feature activation-3.88e+00

Pos logit contributions
ena+0.036
 unison+0.036
 scalp+0.036
kin+0.035
ric+0.034
Neg logit contributions
 closer-0.019
ls-0.017
 back-0.017
 home-0.015
 groceries-0.014
Token in
Token ID287
Feature activation-1.98e+00

Pos logit contributions
ena+0.090
 unison+0.088
kin+0.086
 scalp+0.085
ities+0.081
Neg logit contributions
 closer-0.092
ls-0.090
 back-0.090
s-0.086
 to-0.085
Token hospital
Token ID4436
Feature activation-1.56e+00

Pos logit contributions
 scalp+0.074
kin+0.072
ena+0.071
OM+0.071
ities+0.071
Neg logit contributions
 home-0.023
 to-0.020
 closer-0.020
 back-0.020
 away-0.017
Token,
Token ID11
Feature activation-3.38e+00

Pos logit contributions
kin+0.047
 unison+0.046
 scalp+0.045
ena+0.044
ric+0.043
Neg logit contributions
 back-0.027
 to-0.027
 closer-0.027
 home-0.025
ls-0.024
Token and
Token ID290
Feature activation+2.75e-01

Pos logit contributions
kin+0.085
 unison+0.083
ena+0.081
 scalp+0.080
OM+0.078
Neg logit contributions
 closer-0.080
 back-0.079
 to-0.074
 home-0.072
 away-0.071
Token he
Token ID339
Feature activation-9.56e-01

Pos logit contributions
 closer+0.006
ls+0.005
 back+0.005
 to+0.005
 off+0.005
Neg logit contributions
 scalp-0.007
kin-0.007
 unison-0.007
ena-0.007
ric-0.007