TOP ACTIVATIONS
MAX = 2.628

of animals that lived long ago . They saw shiny rocks
inating the flowers in their me adow . But one day
silence Paul said , " Please ." M olly
Max and said , " Please give us our toys back
filled with treasures from long ago ! The adventurers
and he said , " Please help me eat them ."
tree . " Please don 't move too much
the perfect spot in a me adow . The staff acted
it to me for safe keeping . He told me to
. She said , " Please let Max heal and play
, but there were many mosquitoes that would bite him .
books , clothes from long ago , and special family keeps
counter and said , " Please mail this to my grandma
note that said , " Please send my toys to the
girl in a grass y me adow . She was playing
's house was built long ago . L ily
squirrel . " Please help me , Mister Squirrel
sentence on it . ' Please remove your coats '.
called out loudly , " Please come back , tooth !
brother and said , " Please give me back my er

MIN ACTIVATIONS
MIN = -3.959

too close . The ost ric hes might pe ck you
should not tease the ost ric hes . They are not
you count how many ost ric hes there are ?"
shouted . " The ost ric hes are so funny !"
but be careful . Ost ric hes can be gr umpy
mom and to the ost ric hes . They learned their
cried . " The ost ric hes are mean ! They
wanted to touch the ost ric hes . They reached out
They do not pack ost ric hes anymore . They pack
ostr ich pen . Ost ric hes are big birds with
ostr ich pen . Ost ric hes are big birds with
and run to the ost ric hes . "
But the ost ric hes did not like that
and looked at the ost ric hes . They saw one
. They were called ost ric hes . Lily and Ben
did not know what ost ric hes were . They looked
. One of the ost ric hes looked different from the
" O st ric hes do not like people
Yes , they are ost ric hes ," mom said .
ie . Unlike other ost ric hes , his feathers gl

INTERVAL 0.981 - 2.628
CONTAINS 2.204%

man 's plate . All the food went everywhere ! The
a mysterious box in the bushes . L
the proud cub at the university and went home . She
m op . � � € � � Hello ! â
the guitar over and it broke again . Tim my was

INTERVAL -0.666 - 0.981
CONTAINS 89.245%

had taken the invitation . <|endoftext|> Once there was a lady
put it on his wall . But then the
, " Where is my car ? I want to play
fl apped its wings and flew all around . One day
<|endoftext|> Jimmy wanted to climb to the top of the tree

INTERVAL -2.312 - -0.666
CONTAINS 8.547%

will make you happy . It is fun to catch a
the water . They run to the shore and see a
for lunch . " Mom , what are you
. She had a pet ham ster named Spark y .
. They liked to pretend they were chefs and make y

INTERVAL -3.959 - -2.312
CONTAINS 0.005%

'll put a band - aid on your hand and then
. They were called ost ric hes . Lily and Ben
asked her m ama and pap a if she could go
. He was a happy 3 year old who loved playing
if they both closed their eyes and said a magic spell
Token of
Token ID286
Feature activation+5.48e-01

Pos logit contributions
 ships+0.005
 shark+0.005
washer+0.005
 armor+0.005
 tanks+0.005
Neg logit contributions
uct-0.005
 Samantha-0.005
 Heidi-0.004
aman-0.004
 Lib-0.004
Token animals
Token ID4695
Feature activation+9.38e-01

Pos logit contributions
 saying+0.017
 Heidi+0.017
 Samantha+0.017
 Lib+0.016
uct+0.016
Neg logit contributions
 ships-0.018
 shark-0.017
washer-0.016
 armor-0.016
zo-0.016
Token that
Token ID326
Feature activation+2.02e-01

Pos logit contributions
uct+0.030
 Samantha+0.028
 Heidi+0.028
aman+0.027
 Lib+0.026
Neg logit contributions
 ships-0.034
 shark-0.031
 tanks-0.030
washer-0.030
 armor-0.029
Token lived
Token ID5615
Feature activation+9.46e-01

Pos logit contributions
uct+0.006
 Heidi+0.006
aman+0.006
 Samantha+0.006
 saying+0.006
Neg logit contributions
 ships-0.007
 shark-0.007
 armor-0.007
 tanks-0.006
washer-0.006
Token long
Token ID890
Feature activation+4.56e-01

Pos logit contributions
uct+0.025
 Heidi+0.024
 Samantha+0.024
 saying+0.024
 Lib+0.024
Neg logit contributions
 ships-0.036
 shark-0.034
 armor-0.033
 tanks-0.032
washer-0.032
Token ago
Token ID2084
Feature activation+2.63e+00

Pos logit contributions
uct+0.013
 Samantha+0.013
 Heidi+0.012
aman+0.012
 Lib+0.012
Neg logit contributions
 ships-0.018
 armor-0.016
 shark-0.016
 tanks-0.016
 armour-0.016
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
uct+0.078
 Heidi+0.070
aman+0.068
 Samantha+0.068
 Lib+0.065
Neg logit contributions
 ships-0.103
 armor-0.094
 tanks-0.093
 ship-0.093
bel-0.092
Token They
Token ID1119
Feature activation+1.95e-02

Pos logit contributions
 ships+0.006
 shark+0.006
 tanks+0.006
 armor+0.006
washer+0.006
Neg logit contributions
uct-0.005
 saying-0.005
 mistake-0.005
 Heidi-0.005
aman-0.005
Token saw
Token ID2497
Feature activation+6.80e-02

Pos logit contributions
 saying+0.000
 Lib+0.000
 Samantha+0.000
 right+0.000
 Heidi+0.000
Neg logit contributions
 shark-0.001
 ships-0.001
bel-0.001
 armor-0.001
zo-0.001
Token shiny
Token ID22441
Feature activation-3.61e-02

Pos logit contributions
 saying+0.002
 right+0.002
 Lib+0.002
 Samantha+0.002
 Heidi+0.002
Neg logit contributions
bel-0.002
zo-0.002
 ships-0.002
 shark-0.002
washer-0.002
Token rocks
Token ID12586
Feature activation+5.23e-01

Pos logit contributions
 ships+0.001
washer+0.001
bel+0.001
zo+0.001
 shark+0.001
Neg logit contributions
 saying-0.001
 right-0.001
 Lib-0.001
 mistake-0.001
 Samantha-0.001
Tokeninating
Token ID6010
Feature activation-1.65e-01

Pos logit contributions
uct+0.015
 Heidi+0.015
 Samantha+0.015
 saying+0.014
 Lib+0.014
Neg logit contributions
 ships-0.017
 shark-0.016
 armor-0.015
washer-0.015
 tanks-0.015
Token the
Token ID262
Feature activation-1.14e-01

Pos logit contributions
 ships+0.006
 shark+0.005
 tanks+0.005
 armor+0.005
bel+0.005
Neg logit contributions
uct-0.005
 Heidi-0.005
 saying-0.005
 Samantha-0.005
 Lib-0.005
Token flowers
Token ID12734
Feature activation+5.83e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 coral+0.003
 armor+0.003
bel+0.002
Neg logit contributions
uct-0.005
 Samantha-0.004
 Lib-0.004
 Heidi-0.004
aman-0.004
Token in
Token ID287
Feature activation+3.08e-01

Pos logit contributions
uct+0.013
 Heidi+0.012
 Samantha+0.012
aman+0.012
 Lib+0.011
Neg logit contributions
 ships-0.026
 shark-0.024
bel-0.024
washer-0.024
 tanks-0.023
Token their
Token ID511
Feature activation+6.15e-01

Pos logit contributions
uct+0.011
 Heidi+0.010
aman+0.010
 Lib+0.010
 Samantha+0.010
Neg logit contributions
 ships-0.010
 shark-0.009
 coral-0.009
 tanks-0.009
 armor-0.008
Token me
Token ID502
Feature activation+2.59e+00

Pos logit contributions
uct+0.020
 Heidi+0.020
 saying+0.020
 Samantha+0.020
 Lib+0.019
Neg logit contributions
 ships-0.020
 shark-0.018
 armor-0.018
bel-0.017
 tanks-0.017
Tokenadow
Token ID4584
Feature activation+1.39e+00

Pos logit contributions
uct+0.108
 Heidi+0.108
 Lib+0.107
aman+0.106
 Samantha+0.104
Neg logit contributions
 ships-0.054
bel-0.050
zo-0.050
washer-0.049
 armor-0.048
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
uct+0.027
 Heidi+0.026
 Samantha+0.026
aman+0.025
 Lib+0.024
Neg logit contributions
 ships-0.065
bel-0.061
washer-0.060
 shark-0.060
 armor-0.059
Token But
Token ID887
Feature activation+3.37e-01

Pos logit contributions
 ships+0.006
 tanks+0.006
 shark+0.006
zo+0.005
 Lions+0.005
Neg logit contributions
 mistake-0.006
uct-0.006
 saying-0.005
 Heidi-0.005
aman-0.005
Token one
Token ID530
Feature activation-5.58e-01

Pos logit contributions
uct+0.010
 Heidi+0.009
 Samantha+0.009
 Lib+0.009
 saying+0.009
Neg logit contributions
 ships-0.013
 shark-0.012
 tanks-0.011
 armor-0.011
 ship-0.011
Token day
Token ID1110
Feature activation-5.96e-01

Pos logit contributions
 ships+0.026
 armor+0.024
bel+0.024
 tanks+0.024
washer+0.023
Neg logit contributions
 Heidi-0.010
 saying-0.010
 right-0.010
 Samantha-0.010
uct-0.010
Token silence
Token ID9550
Feature activation-3.81e-02

Pos logit contributions
uct+0.009
 Heidi+0.009
 Samantha+0.009
 Lib+0.009
 saying+0.009
Neg logit contributions
 ships-0.012
 shark-0.011
 armor-0.011
 tanks-0.011
bel-0.011
Token Paul
Token ID3362
Feature activation-2.30e-01

Pos logit contributions
 ships+0.002
 shark+0.001
washer+0.001
 tanks+0.001
bel+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.001
aman-0.001
 Lib-0.001
Token said
Token ID531
Feature activation-2.08e-01

Pos logit contributions
 ships+0.008
zo+0.007
washer+0.007
 tanks+0.007
 shark+0.007
Neg logit contributions
 Heidi-0.007
uct-0.007
 Lib-0.007
 Samantha-0.007
aman-0.007
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
 ships+0.007
 shark+0.006
washer+0.006
 armor+0.006
bel+0.006
Neg logit contributions
uct-0.007
aman-0.007
 Heidi-0.007
 Samantha-0.007
 Lib-0.006
Token "
Token ID366
Feature activation+5.23e-01

Pos logit contributions
uct+0.005
 mistake+0.005
aman+0.005
 Heidi+0.004
 policy+0.004
Neg logit contributions
 ships-0.004
 shark-0.004
 armor-0.004
washer-0.004
 tanks-0.003
TokenPlease
Token ID5492
Feature activation+2.54e+00

Pos logit contributions
 mistake+0.018
 Heidi+0.015
 OK+0.015
 okay+0.015
 saying+0.015
Neg logit contributions
zo-0.016
bel-0.016
 ships-0.015
Circ-0.013
 armour-0.013
Token."
Token ID526
Feature activation+5.91e-01

Pos logit contributions
uct+0.071
 Heidi+0.070
 Lib+0.069
 Samantha+0.068
 saying+0.068
Neg logit contributions
 ships-0.093
 shark-0.088
washer-0.085
 armor-0.084
 tanks-0.084
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
uct+0.018
 Heidi+0.017
 mistake+0.017
 Samantha+0.016
 saying+0.016
Neg logit contributions
 ships-0.021
 shark-0.018
 armor-0.018
 tanks-0.018
bel-0.018
Token
Token ID198
Feature activation+6.89e-01

Pos logit contributions
uct+0.009
 mistake+0.008
 Heidi+0.008
 saying+0.008
 Samantha+0.008
Neg logit contributions
 ships-0.012
zo-0.011
 shark-0.011
washer-0.011
bel-0.011
TokenM
Token ID44
Feature activation+5.58e-01

Pos logit contributions
uct+0.021
 Heidi+0.021
 saying+0.020
 Samantha+0.020
 Lib+0.020
Neg logit contributions
 ships-0.023
 shark-0.021
zo-0.021
bel-0.021
washer-0.021
Tokenolly
Token ID5098
Feature activation+2.36e-01

Pos logit contributions
 Heidi+0.019
uct+0.019
 Samantha+0.019
 Lib+0.018
 saying+0.018
Neg logit contributions
 ships-0.018
washer-0.016
zo-0.016
 shark-0.016
 armor-0.016
Token Max
Token ID5436
Feature activation+3.84e-01

Pos logit contributions
 ships+0.005
 shark+0.005
 armor+0.005
 tanks+0.005
washer+0.005
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 saying-0.004
 Lib-0.004
Token and
Token ID290
Feature activation-4.83e-02

Pos logit contributions
 Samantha+0.012
uct+0.012
 Heidi+0.011
 Lib+0.011
aman+0.011
Neg logit contributions
 ships-0.013
 tanks-0.012
 shark-0.012
washer-0.012
 ship-0.012
Token said
Token ID531
Feature activation-3.01e-01

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
washer+0.002
bel+0.002
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.001
 saying-0.001
 Lib-0.001
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
 ships+0.009
 shark+0.009
 armor+0.008
washer+0.008
bel+0.008
Neg logit contributions
uct-0.010
aman-0.010
 Samantha-0.010
 Heidi-0.010
 saying-0.010
Token "
Token ID366
Feature activation+7.16e-01

Pos logit contributions
 ships+0.006
 shark+0.005
washer+0.005
bel+0.005
 tanks+0.005
Neg logit contributions
uct-0.008
aman-0.008
 mistake-0.008
 Samantha-0.008
 Heidi-0.008
TokenPlease
Token ID5492
Feature activation+2.53e+00

Pos logit contributions
 mistake+0.023
 Lib+0.023
uct+0.022
 Samantha+0.022
 saying+0.022
Neg logit contributions
 ships-0.021
zo-0.021
bel-0.021
 armor-0.021
 shark-0.020
Token give
Token ID1577
Feature activation+1.33e+00

Pos logit contributions
uct+0.065
 Heidi+0.065
 Samantha+0.064
 saying+0.064
 Lib+0.063
Neg logit contributions
 ships-0.098
 shark-0.093
 armor-0.090
 tanks-0.088
washer-0.088
Token us
Token ID514
Feature activation+5.23e-01

Pos logit contributions
 Heidi+0.029
 saying+0.028
 Samantha+0.028
uct+0.028
 right+0.028
Neg logit contributions
 ships-0.057
bel-0.055
 shark-0.054
 armor-0.053
zo-0.052
Token our
Token ID674
Feature activation+3.40e-01

Pos logit contributions
 right+0.012
 OK+0.011
 Samantha+0.010
 herself+0.010
 okay+0.010
Neg logit contributions
 ships-0.022
 shark-0.021
bel-0.021
zo-0.021
 armor-0.021
Token toys
Token ID14958
Feature activation+5.23e-01

Pos logit contributions
uct+0.008
 Heidi+0.008
 saying+0.008
 Samantha+0.008
 Lib+0.008
Neg logit contributions
 ships-0.013
 shark-0.013
 armor-0.013
zo-0.012
washer-0.012
Token back
Token ID736
Feature activation+7.48e-01

Pos logit contributions
uct+0.014
 Heidi+0.014
 Samantha+0.013
 saying+0.013
 Lib+0.013
Neg logit contributions
 ships-0.020
 shark-0.019
 armor-0.018
 tanks-0.018
washer-0.018
Token filled
Token ID5901
Feature activation-5.34e-01

Pos logit contributions
uct+0.009
 Heidi+0.008
 OK+0.008
 Samantha+0.008
aman+0.008
Neg logit contributions
 ships-0.008
washer-0.008
 shark-0.007
bel-0.007
 ship-0.007
Token with
Token ID351
Feature activation+4.52e-01

Pos logit contributions
 ships+0.020
 shark+0.019
 armor+0.018
bel+0.018
 tanks+0.018
Neg logit contributions
uct-0.014
 Heidi-0.014
 Samantha-0.014
 saying-0.014
 OK-0.014
Token treasures
Token ID29561
Feature activation+1.14e+00

Pos logit contributions
uct+0.013
 Heidi+0.013
 saying+0.013
 Samantha+0.013
 Lib+0.012
Neg logit contributions
 ships-0.016
 shark-0.015
 armor-0.015
zo-0.014
washer-0.014
Token from
Token ID422
Feature activation+2.44e-01

Pos logit contributions
uct+0.032
 Heidi+0.031
 Samantha+0.031
aman+0.030
 OK+0.028
Neg logit contributions
 ships-0.047
 ship-0.042
 shark-0.042
washer-0.041
 armor-0.041
Token long
Token ID890
Feature activation+4.00e-01

Pos logit contributions
uct+0.008
 Heidi+0.008
aman+0.007
 OK+0.007
 Samantha+0.007
Neg logit contributions
 ships-0.008
 shark-0.008
 armor-0.008
 tanks-0.007
 armour-0.007
Token ago
Token ID2084
Feature activation+2.53e+00

Pos logit contributions
uct+0.013
 Samantha+0.012
 Heidi+0.012
 Lib+0.012
aman+0.012
Neg logit contributions
 ships-0.014
 armor-0.013
 shark-0.013
 tanks-0.013
 armour-0.013
Token!
Token ID0
Feature activation+4.23e-01

Pos logit contributions
uct+0.072
 Heidi+0.065
 Samantha+0.062
aman+0.060
 Lib+0.060
Neg logit contributions
 ships-0.101
 armor-0.093
 ship-0.092
 tanks-0.090
washer-0.090
Token
Token ID198
Feature activation+7.84e-02

Pos logit contributions
uct+0.013
 Heidi+0.013
 Samantha+0.013
 saying+0.013
 Lib+0.013
Neg logit contributions
 ships-0.014
 shark-0.013
 armor-0.013
bel-0.013
washer-0.013
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
uct+0.002
 Heidi+0.001
 saying+0.001
 Samantha+0.001
 Lib+0.001
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
bel-0.003
washer-0.003
TokenThe
Token ID464
Feature activation+4.14e-01

Pos logit contributions
 Heidi+0.006
uct+0.006
 Samantha+0.005
 Lib+0.005
 saying+0.005
Neg logit contributions
 ships-0.011
 shark-0.011
 armor-0.010
 tanks-0.010
 armour-0.010
Token adventurers
Token ID36843
Feature activation+3.27e-01

Pos logit contributions
 right+0.012
 Heidi+0.011
 saying+0.010
 okay+0.010
 OK+0.010
Neg logit contributions
bel-0.014
 ships-0.014
zo-0.014
os-0.014
washer-0.013
Token and
Token ID290
Feature activation+8.09e-01

Pos logit contributions
 Heidi+0.019
 Samantha+0.019
uct+0.019
 Lib+0.018
aman+0.017
Neg logit contributions
 ships-0.040
washer-0.038
 shark-0.037
 tanks-0.037
 ship-0.037
Token he
Token ID339
Feature activation+6.14e-01

Pos logit contributions
uct+0.023
 Heidi+0.023
 Samantha+0.022
 saying+0.022
 Lib+0.022
Neg logit contributions
 ships-0.029
 shark-0.027
 armor-0.027
 tanks-0.026
washer-0.026
Token said
Token ID531
Feature activation-8.08e-02

Pos logit contributions
 Heidi+0.018
uct+0.017
 Samantha+0.017
 saying+0.017
 Lib+0.017
Neg logit contributions
 ships-0.022
 shark-0.021
 armor-0.020
washer-0.020
 tanks-0.020
Token,
Token ID11
Feature activation+2.56e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.002
washer+0.002
 tanks+0.002
Neg logit contributions
uct-0.003
aman-0.003
 Heidi-0.002
 Samantha-0.002
 saying-0.002
Token "
Token ID366
Feature activation+4.36e-01

Pos logit contributions
aman+0.009
 Heidi+0.009
uct+0.009
 Samantha+0.009
 Lib+0.008
Neg logit contributions
 ships-0.007
 shark-0.007
washer-0.006
 tanks-0.006
 armor-0.006
TokenPlease
Token ID5492
Feature activation+2.52e+00

Pos logit contributions
 saying+0.014
 mistake+0.014
 Heidi+0.014
 Lib+0.014
 Samantha+0.013
Neg logit contributions
bel-0.013
zo-0.012
 shark-0.011
 ships-0.011
washer-0.011
Token help
Token ID1037
Feature activation-1.66e-01

Pos logit contributions
uct+0.071
 Heidi+0.070
 Lib+0.070
 Samantha+0.069
 saying+0.068
Neg logit contributions
 ships-0.088
 shark-0.087
washer-0.082
 tanks-0.081
 armor-0.080
Token me
Token ID502
Feature activation+7.36e-01

Pos logit contributions
 ships+0.006
 shark+0.006
washer+0.006
 armor+0.006
 tanks+0.006
Neg logit contributions
uct-0.004
 Heidi-0.004
 Lib-0.004
 saying-0.004
 Samantha-0.004
Token eat
Token ID4483
Feature activation-1.68e-01

Pos logit contributions
uct+0.021
 Heidi+0.021
 Samantha+0.020
 Lib+0.020
 saying+0.020
Neg logit contributions
 ships-0.026
 shark-0.025
 armor-0.024
washer-0.024
bel-0.024
Token them
Token ID606
Feature activation-2.57e-02

Pos logit contributions
 ships+0.006
 shark+0.005
 tanks+0.005
 armor+0.005
 coral+0.005
Neg logit contributions
 Heidi-0.005
uct-0.005
 Samantha-0.005
 Lib-0.005
 saying-0.005
Token."
Token ID526
Feature activation+1.34e-01

Pos logit contributions
 ships+0.001
 shark+0.001
washer+0.001
 tanks+0.001
bel+0.001
Neg logit contributions
 Heidi-0.001
 Samantha-0.001
 saying-0.001
uct-0.001
 Lib-0.001
Token tree
Token ID5509
Feature activation+7.93e-01

Pos logit contributions
 Heidi+0.003
 Samantha+0.003
 saying+0.003
 right+0.002
 Lib+0.002
Neg logit contributions
 ships-0.004
bel-0.004
zo-0.004
 shark-0.004
washer-0.004
Token.
Token ID13
Feature activation+6.70e-01

Pos logit contributions
 Heidi+0.020
uct+0.020
 Samantha+0.020
 Lib+0.019
 Stella+0.017
Neg logit contributions
 ships-0.031
bel-0.030
 ship-0.030
washer-0.029
 armor-0.029
Token
Token ID220
Feature activation+5.79e-02

Pos logit contributions
 mistake+0.027
 saying+0.023
uct+0.023
 excuse+0.022
aman+0.022
Neg logit contributions
 ships-0.016
 tanks-0.016
bel-0.015
 tank-0.015
 While-0.014
Token
Token ID198
Feature activation+1.75e-01

Pos logit contributions
 mistake+0.002
 saying+0.002
 fuss+0.002
 policy+0.002
aman+0.002
Neg logit contributions
bel-0.001
os-0.001
washer-0.001
 ships-0.001
tub-0.001
Token"
Token ID1
Feature activation+6.20e-02

Pos logit contributions
 mistake+0.005
 saying+0.005
uct+0.005
aman+0.005
 Heidi+0.005
Neg logit contributions
bel-0.006
 ships-0.005
washer-0.005
 shark-0.005
 tanks-0.005
TokenPlease
Token ID5492
Feature activation+2.52e+00

Pos logit contributions
 mistake+0.002
 saying+0.002
 policy+0.002
 either+0.002
 Heidi+0.002
Neg logit contributions
bel-0.002
zo-0.002
Circ-0.002
 ships-0.002
 shark-0.002
Token don
Token ID836
Feature activation+4.97e-01

Pos logit contributions
uct+0.076
 Heidi+0.074
 saying+0.073
 Lib+0.073
 Samantha+0.071
Neg logit contributions
 ships-0.084
 shark-0.082
 armor-0.077
washer-0.077
 ship-0.076
Token't
Token ID470
Feature activation-3.10e-02

Pos logit contributions
 Heidi+0.015
 Samantha+0.015
uct+0.015
 saying+0.015
 Lib+0.014
Neg logit contributions
 ships-0.016
 shark-0.016
washer-0.015
 armor-0.015
 tanks-0.015
Token move
Token ID1445
Feature activation-5.96e-01

Pos logit contributions
 shark+0.001
 ships+0.001
zo+0.001
 ship+0.001
washer+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 Lib-0.001
aman-0.001
 Samantha-0.001
Token too
Token ID1165
Feature activation-1.58e+00

Pos logit contributions
 ships+0.023
 shark+0.022
 armor+0.021
 tanks+0.021
bel+0.021
Neg logit contributions
uct-0.016
 Heidi-0.015
aman-0.014
 saying-0.014
 Samantha-0.014
Token much
Token ID881
Feature activation-5.91e-01

Pos logit contributions
 ships+0.048
 shark+0.047
 armor+0.044
zo+0.042
bel+0.042
Neg logit contributions
uct-0.053
 saying-0.052
 Heidi-0.049
 Samantha-0.049
aman-0.049
Token the
Token ID262
Feature activation+3.93e-01

Pos logit contributions
uct+0.008
 Heidi+0.008
 saying+0.008
 Samantha+0.008
 Lib+0.008
Neg logit contributions
 ships-0.011
 shark-0.011
 armor-0.011
 tanks-0.010
washer-0.010
Token perfect
Token ID2818
Feature activation+2.93e-01

Pos logit contributions
uct+0.014
 Heidi+0.013
 Samantha+0.013
 Lib+0.012
aman+0.012
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.012
 tanks-0.012
 ship-0.012
Token spot
Token ID4136
Feature activation+9.05e-01

Pos logit contributions
uct+0.011
 saying+0.010
 Heidi+0.010
 Samantha+0.010
 Lib+0.010
Neg logit contributions
 ships-0.009
 shark-0.009
 armor-0.008
 tanks-0.008
 ship-0.008
Token in
Token ID287
Feature activation+5.71e-01

Pos logit contributions
uct+0.032
 Heidi+0.030
aman+0.030
 Samantha+0.029
 Lib+0.028
Neg logit contributions
 ships-0.029
washer-0.027
 shark-0.027
bel-0.026
 tanks-0.026
Token a
Token ID257
Feature activation+1.54e-01

Pos logit contributions
uct+0.019
 Heidi+0.018
 saying+0.017
aman+0.017
 Lib+0.017
Neg logit contributions
 ships-0.019
 shark-0.018
 tanks-0.017
 armor-0.017
 ship-0.016
Token me
Token ID502
Feature activation+2.51e+00

Pos logit contributions
 Heidi+0.005
 saying+0.005
 Samantha+0.005
uct+0.005
 Lib+0.004
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.005
zo-0.005
bel-0.005
Tokenadow
Token ID4584
Feature activation+1.02e+00

Pos logit contributions
uct+0.094
 Heidi+0.093
 Lib+0.091
 Samantha+0.091
 saying+0.089
Neg logit contributions
 ships-0.068
washer-0.064
 armor-0.062
bel-0.061
zo-0.061
Token.
Token ID13
Feature activation-7.06e-02

Pos logit contributions
uct+0.024
 Heidi+0.023
 Samantha+0.022
aman+0.022
 Lib+0.020
Neg logit contributions
 ships-0.044
washer-0.043
bel-0.041
 shark-0.041
 ship-0.041
Token The
Token ID383
Feature activation+5.44e-03

Pos logit contributions
 ships+0.002
 shark+0.002
 tanks+0.002
 armor+0.002
washer+0.002
Neg logit contributions
uct-0.002
 saying-0.002
 Heidi-0.002
 Samantha-0.002
 mistake-0.002
Token staff
Token ID3085
Feature activation-1.43e-01

Pos logit contributions
uct+0.000
 Heidi+0.000
 Samantha+0.000
 Lib+0.000
 saying+0.000
Neg logit contributions
 ships-0.000
 shark-0.000
 armor-0.000
 tanks-0.000
washer-0.000
Token acted
Token ID13134
Feature activation+1.11e-01

Pos logit contributions
 ships+0.005
 shark+0.005
 armor+0.005
 tanks+0.005
zo+0.005
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 saying-0.004
 Lib-0.004
Token it
Token ID340
Feature activation+2.71e-01

Pos logit contributions
uct+0.015
 Heidi+0.014
 saying+0.014
aman+0.014
 Samantha+0.014
Neg logit contributions
 ships-0.021
 shark-0.020
 armor-0.020
 tanks-0.019
 armour-0.019
Token to
Token ID284
Feature activation+2.84e-01

Pos logit contributions
 right+0.008
 herself+0.007
 saying+0.007
 OK+0.007
 Heidi+0.007
Neg logit contributions
 ships-0.010
 shark-0.009
zo-0.009
washer-0.009
bel-0.009
Token me
Token ID502
Feature activation+6.06e-01

Pos logit contributions
uct+0.009
 Heidi+0.009
aman+0.009
 Lib+0.009
 Samantha+0.009
Neg logit contributions
 ships-0.009
 shark-0.009
 armor-0.009
 ship-0.008
 tanks-0.008
Token for
Token ID329
Feature activation+1.81e-01

Pos logit contributions
uct+0.014
 Heidi+0.014
 Samantha+0.014
 Lib+0.013
aman+0.013
Neg logit contributions
 ships-0.026
 armor-0.024
 shark-0.024
washer-0.024
 tanks-0.023
Token safe
Token ID3338
Feature activation+7.44e-01

Pos logit contributions
aman+0.006
uct+0.006
 Heidi+0.006
 Lib+0.006
 Samantha+0.006
Neg logit contributions
 ships-0.006
 armor-0.006
 shark-0.006
 tanks-0.006
 ship-0.005
Tokenkeeping
Token ID19934
Feature activation+2.50e+00

Pos logit contributions
uct+0.019
 Heidi+0.018
 Samantha+0.018
aman+0.018
 Lib+0.017
Neg logit contributions
 ships-0.031
 shark-0.029
 armor-0.029
 tanks-0.028
bel-0.027
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
uct+0.078
 Heidi+0.073
aman+0.072
 Samantha+0.070
Looks+0.066
Neg logit contributions
 ships-0.083
washer-0.082
bel-0.081
 ship-0.080
 armor-0.077
Token He
Token ID679
Feature activation+2.01e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.003
 tanks+0.003
washer+0.003
Neg logit contributions
uct-0.003
 Samantha-0.003
 Heidi-0.003
 saying-0.003
aman-0.003
Token told
Token ID1297
Feature activation-2.11e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 Samantha+0.004
 Lib+0.004
 saying+0.004
Neg logit contributions
 ships-0.009
 shark-0.009
 armor-0.009
washer-0.009
 tanks-0.008
Token me
Token ID502
Feature activation+1.11e+00

Pos logit contributions
 ships+0.007
 shark+0.007
 armor+0.007
 tanks+0.006
 ship+0.006
Neg logit contributions
uct-0.007
 Heidi-0.006
aman-0.006
 Samantha-0.006
 Lib-0.006
Token to
Token ID284
Feature activation+5.38e-01

Pos logit contributions
 saying+0.028
 right+0.027
 Samantha+0.027
 Heidi+0.027
 OK+0.027
Neg logit contributions
 ships-0.042
 shark-0.040
 armor-0.039
zo-0.039
bel-0.038
Token.
Token ID13
Feature activation+4.72e-01

Pos logit contributions
uct+0.001
 Samantha+0.001
aman+0.001
 Heidi+0.001
 Lib+0.001
Neg logit contributions
 ships-0.001
zo-0.001
 shark-0.001
 armor-0.001
washer-0.001
Token She
Token ID1375
Feature activation+2.01e-01

Pos logit contributions
uct+0.011
 Samantha+0.011
 Heidi+0.011
 Lib+0.011
 saying+0.011
Neg logit contributions
 ships-0.019
 armor-0.018
 shark-0.018
 tanks-0.017
zo-0.017
Token said
Token ID531
Feature activation-4.13e-01

Pos logit contributions
uct+0.005
 saying+0.005
 right+0.005
 Samantha+0.005
 Heidi+0.005
Neg logit contributions
 ships-0.008
 shark-0.008
 armor-0.007
bel-0.007
 tanks-0.007
Token,
Token ID11
Feature activation-3.32e-01

Pos logit contributions
 ships+0.016
 shark+0.014
 tanks+0.014
zo+0.014
 armor+0.014
Neg logit contributions
 Heidi-0.011
 saying-0.011
 right-0.011
 Samantha-0.011
 OK-0.011
Token "
Token ID366
Feature activation+6.74e-01

Pos logit contributions
 ships+0.010
 armor+0.009
 shark+0.008
bel+0.008
 armour+0.008
Neg logit contributions
aman-0.012
 mistake-0.012
 Samantha-0.012
uct-0.012
 Heidi-0.011
TokenPlease
Token ID5492
Feature activation+2.49e+00

Pos logit contributions
 mistake+0.022
 Lib+0.021
 saying+0.021
uct+0.021
 Samantha+0.021
Neg logit contributions
zo-0.020
bel-0.020
 armor-0.020
 armour-0.019
 ships-0.018
Token let
Token ID1309
Feature activation+5.74e-01

Pos logit contributions
uct+0.066
 Heidi+0.065
 Samantha+0.065
 saying+0.065
 Lib+0.064
Neg logit contributions
 ships-0.094
 shark-0.089
 armor-0.087
washer-0.084
 tanks-0.084
Token Max
Token ID5436
Feature activation-1.89e-01

Pos logit contributions
 Heidi+0.013
uct+0.013
 Samantha+0.012
 saying+0.012
 right+0.012
Neg logit contributions
 ships-0.025
 shark-0.023
 armor-0.023
bel-0.023
 tanks-0.022
Token heal
Token ID12035
Feature activation-1.15e+00

Pos logit contributions
 ships+0.006
 shark+0.006
 armor+0.006
 tanks+0.006
washer+0.006
Neg logit contributions
 saying-0.006
 Heidi-0.006
 Samantha-0.006
uct-0.006
 Lib-0.006
Token and
Token ID290
Feature activation-7.71e-02

Pos logit contributions
 ships+0.044
 shark+0.042
 armor+0.041
zo+0.040
 tanks+0.039
Neg logit contributions
uct-0.031
 Samantha-0.030
 Heidi-0.030
 Lib-0.030
 saying-0.029
Token play
Token ID711
Feature activation-2.03e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.003
bel+0.002
washer+0.002
Neg logit contributions
uct-0.002
 Heidi-0.002
 Samantha-0.002
 saying-0.002
 Lib-0.002
Token,
Token ID11
Feature activation-7.76e-02

Pos logit contributions
 ships+0.017
 armor+0.016
 shark+0.016
bel+0.016
 armour+0.016
Neg logit contributions
 right-0.003
 saying-0.003
 herself-0.002
 Lib-0.002
 Heidi-0.002
Token but
Token ID475
Feature activation+6.04e-01

Pos logit contributions
zo+0.004
 ships+0.004
bel+0.004
 armor+0.004
washer+0.004
Neg logit contributions
 right-0.001
 saying-0.001
 Heidi-0.001
 Samantha-0.001
uct-0.001
Token there
Token ID612
Feature activation-7.31e-02

Pos logit contributions
uct+0.019
 Heidi+0.019
 Samantha+0.019
 saying+0.019
 Lib+0.019
Neg logit contributions
 ships-0.020
 shark-0.019
 armor-0.018
 tanks-0.017
washer-0.017
Token were
Token ID547
Feature activation-3.89e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.003
bel+0.003
zo+0.003
Neg logit contributions
uct-0.002
 Heidi-0.002
 saying-0.002
 Samantha-0.002
 Lib-0.002
Token many
Token ID867
Feature activation+5.75e-02

Pos logit contributions
bel+0.017
zo+0.017
 ships+0.017
 armor+0.017
 shark+0.016
Neg logit contributions
 right-0.009
 okay-0.007
 herself-0.007
 OK-0.007
 saying-0.007
Token mosquitoes
Token ID35573
Feature activation+2.48e+00

Pos logit contributions
uct+0.001
 saying+0.001
 Heidi+0.001
 Samantha+0.001
 right+0.001
Neg logit contributions
 ships-0.002
 shark-0.002
 armor-0.002
bel-0.002
zo-0.002
Token that
Token ID326
Feature activation-5.84e-02

Pos logit contributions
 saying+0.054
uct+0.053
 Heidi+0.052
 Samantha+0.051
 right+0.051
Neg logit contributions
 ships-0.105
 shark-0.099
 armor-0.098
washer-0.095
bel-0.095
Token would
Token ID561
Feature activation-5.35e-01

Pos logit contributions
 ships+0.002
 shark+0.002
bel+0.002
 tanks+0.002
zo+0.002
Neg logit contributions
uct-0.002
 Heidi-0.002
 Lib-0.002
aman-0.002
 Samantha-0.002
Token bite
Token ID13197
Feature activation+5.32e-02

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.017
bel+0.017
 tanks+0.017
Neg logit contributions
uct-0.015
 Heidi-0.015
 Samantha-0.015
 Lib-0.015
 saying-0.015
Token him
Token ID683
Feature activation+5.07e-01

Pos logit contributions
uct+0.001
 Heidi+0.001
 Samantha+0.001
 saying+0.001
 Lib+0.001
Neg logit contributions
 ships-0.002
 shark-0.002
 armor-0.002
 tanks-0.002
zo-0.002
Token.
Token ID13
Feature activation-4.25e-01

Pos logit contributions
 Samantha+0.010
uct+0.010
 Heidi+0.010
 Lib+0.009
aman+0.009
Neg logit contributions
 ships-0.023
 shark-0.022
 armor-0.022
 tanks-0.021
zo-0.021
Token books
Token ID3835
Feature activation+1.64e-01

Pos logit contributions
 saying+0.016
 Samantha+0.016
 right+0.016
 Heidi+0.016
 Lib+0.015
Neg logit contributions
 shark-0.019
 ships-0.019
zo-0.018
 armor-0.018
washer-0.017
Token,
Token ID11
Feature activation+8.42e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 Samantha+0.004
aman+0.004
 mistake+0.004
Neg logit contributions
 ships-0.007
washer-0.006
 ship-0.006
bel-0.006
 shark-0.006
Token clothes
Token ID8242
Feature activation+1.07e+00

Pos logit contributions
uct+0.023
 Heidi+0.022
 saying+0.022
 Samantha+0.022
 Lib+0.022
Neg logit contributions
 ships-0.031
 shark-0.030
 armor-0.029
washer-0.028
zo-0.028
Token from
Token ID422
Feature activation-3.62e-03

Pos logit contributions
uct+0.033
 Heidi+0.032
 Samantha+0.030
aman+0.029
 Lib+0.029
Neg logit contributions
 ships-0.040
washer-0.036
 tanks-0.035
 ship-0.035
bel-0.035
Token long
Token ID890
Feature activation+2.14e-01

Pos logit contributions
 ships+0.000
 shark+0.000
 armor+0.000
 tanks+0.000
 armour+0.000
Neg logit contributions
uct-0.000
 Heidi-0.000
 Samantha-0.000
 Lib-0.000
 saying-0.000
Token ago
Token ID2084
Feature activation+2.48e+00

Pos logit contributions
uct+0.007
 Heidi+0.006
 Samantha+0.006
 Lib+0.006
 saying+0.006
Neg logit contributions
 ships-0.008
 armor-0.007
 tanks-0.007
 shark-0.007
 armour-0.007
Token,
Token ID11
Feature activation+8.82e-01

Pos logit contributions
uct+0.069
 Heidi+0.063
 Samantha+0.059
aman+0.059
 Lib+0.058
Neg logit contributions
 ships-0.100
 armor-0.091
bel-0.090
 ship-0.090
 tanks-0.090
Token and
Token ID290
Feature activation+5.51e-01

Pos logit contributions
uct+0.027
 Heidi+0.026
 Samantha+0.026
 Lib+0.025
 saying+0.025
Neg logit contributions
 ships-0.031
 shark-0.029
 armor-0.029
 tanks-0.028
bel-0.027
Token special
Token ID2041
Feature activation-1.60e-01

Pos logit contributions
uct+0.017
 Heidi+0.016
 Samantha+0.016
 saying+0.016
 Lib+0.016
Neg logit contributions
 ships-0.019
 shark-0.018
 armor-0.017
 tanks-0.017
bel-0.017
Token family
Token ID1641
Feature activation+3.11e-01

Pos logit contributions
 ships+0.006
 armor+0.005
 shark+0.005
 tanks+0.005
bel+0.005
Neg logit contributions
uct-0.005
 Heidi-0.005
 Samantha-0.005
 Lib-0.005
aman-0.005
Token keeps
Token ID7622
Feature activation+1.18e+00

Pos logit contributions
uct+0.005
 Heidi+0.005
 Samantha+0.005
aman+0.005
 Lib+0.004
Neg logit contributions
 ships-0.016
 shark-0.015
washer-0.014
 tanks-0.014
 armor-0.014
Token counter
Token ID3753
Feature activation-2.33e-01

Pos logit contributions
 ships+0.006
washer+0.006
 ship+0.006
 shark+0.005
 armor+0.005
Neg logit contributions
uct-0.008
 Heidi-0.007
 OK-0.007
 Samantha-0.007
shadow-0.007
Token and
Token ID290
Feature activation+3.31e-02

Pos logit contributions
 ships+0.009
washer+0.008
bel+0.008
 armor+0.008
 ship+0.008
Neg logit contributions
uct-0.007
 Heidi-0.007
 Samantha-0.006
 Lib-0.006
 Stella-0.006
Token said
Token ID531
Feature activation-4.52e-02

Pos logit contributions
uct+0.001
 Heidi+0.001
 saying+0.001
 Samantha+0.001
 Lib+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
bel-0.001
washer-0.001
Token,
Token ID11
Feature activation+5.65e-02

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
 tanks+0.002
zo+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.001
 saying-0.001
 Lib-0.001
Token "
Token ID366
Feature activation+3.97e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 mistake+0.002
 Samantha+0.002
aman+0.002
Neg logit contributions
 ships-0.002
 shark-0.002
bel-0.002
washer-0.002
 armor-0.002
TokenPlease
Token ID5492
Feature activation+2.46e+00

Pos logit contributions
 mistake+0.013
 saying+0.011
 Heidi+0.011
uct+0.011
 OK+0.011
Neg logit contributions
 ships-0.013
bel-0.013
zo-0.012
 armor-0.012
 armour-0.012
Token mail
Token ID6920
Feature activation-6.89e-01

Pos logit contributions
uct+0.079
 Heidi+0.075
 saying+0.074
 Lib+0.073
aman+0.070
Neg logit contributions
 ships-0.079
 shark-0.077
washer-0.073
 ship-0.073
 armor-0.072
Token this
Token ID428
Feature activation-9.31e-02

Pos logit contributions
 ships+0.026
 armor+0.024
 shark+0.024
washer+0.023
 tanks+0.023
Neg logit contributions
uct-0.019
 Heidi-0.018
 Lib-0.017
 saying-0.017
aman-0.017
Token to
Token ID284
Feature activation+1.85e-01

Pos logit contributions
washer+0.003
 ships+0.003
 ship+0.002
 port+0.002
 shark+0.002
Neg logit contributions
uct-0.003
 Heidi-0.003
 saying-0.003
 OK-0.003
Ste-0.003
Token my
Token ID616
Feature activation+1.71e-01

Pos logit contributions
uct+0.005
 Heidi+0.005
 Samantha+0.005
 saying+0.004
 Lib+0.004
Neg logit contributions
 ships-0.007
 shark-0.007
 armor-0.007
washer-0.006
 tanks-0.006
Token grandma
Token ID49890
Feature activation-5.08e-01

Pos logit contributions
uct+0.006
 Heidi+0.006
 Samantha+0.005
 Lib+0.005
shadow+0.005
Neg logit contributions
 ships-0.006
 ship-0.005
 armor-0.005
 shark-0.005
washer-0.005
Token note
Token ID3465
Feature activation+7.43e-02

Pos logit contributions
 ships+0.001
 shark+0.001
 armor+0.001
bel+0.001
washer+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.001
 Lib-0.001
 saying-0.001
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 Samantha+0.003
 Heidi+0.002
uct+0.002
aman+0.002
 Lib+0.002
Neg logit contributions
 ships-0.002
washer-0.002
 armor-0.002
bel-0.002
 ship-0.002
Token said
Token ID531
Feature activation+4.04e-01

Pos logit contributions
uct+0.032
 Heidi+0.031
aman+0.031
 Samantha+0.031
 Lib+0.030
Neg logit contributions
 ships-0.034
 shark-0.032
washer-0.031
 armor-0.030
 tanks-0.030
Token,
Token ID11
Feature activation+8.48e-01

Pos logit contributions
 saying+0.009
 Heidi+0.009
 Samantha+0.009
uct+0.008
 Lib+0.008
Neg logit contributions
 ships-0.017
 shark-0.016
 armor-0.016
 tanks-0.016
zo-0.016
Token "
Token ID366
Feature activation+4.02e-01

Pos logit contributions
uct+0.025
 Heidi+0.025
 Samantha+0.025
aman+0.024
 Lib+0.024
Neg logit contributions
 ships-0.030
 shark-0.028
 armor-0.027
 tanks-0.026
washer-0.026
TokenPlease
Token ID5492
Feature activation+2.45e+00

Pos logit contributions
 mistake+0.013
 Heidi+0.012
 saying+0.011
 Samantha+0.011
uct+0.011
Neg logit contributions
bel-0.014
zo-0.014
 ships-0.013
 armor-0.012
 armour-0.012
Token send
Token ID3758
Feature activation+1.28e-01

Pos logit contributions
uct+0.071
 Heidi+0.070
 Lib+0.068
 Samantha+0.068
 saying+0.068
Neg logit contributions
 ships-0.087
 shark-0.083
 armor-0.081
washer-0.079
zo-0.078
Token my
Token ID616
Feature activation+3.64e-01

Pos logit contributions
uct+0.003
 Heidi+0.003
 Samantha+0.003
 Lib+0.003
 saying+0.003
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.005
 tanks-0.005
zo-0.005
Token toys
Token ID14958
Feature activation+1.66e-01

Pos logit contributions
uct+0.011
 Heidi+0.011
 Samantha+0.011
 Lib+0.011
aman+0.011
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.012
 tanks-0.011
washer-0.011
Token to
Token ID284
Feature activation-1.22e-01

Pos logit contributions
uct+0.005
 Heidi+0.005
 Samantha+0.005
 saying+0.005
 Lib+0.005
Neg logit contributions
 ships-0.006
 shark-0.005
 armor-0.005
washer-0.005
bel-0.005
Token the
Token ID262
Feature activation+8.05e-03

Pos logit contributions
 ships+0.005
 shark+0.004
 armor+0.004
 tanks+0.004
zo+0.004
Neg logit contributions
uct-0.003
 Heidi-0.003
 Samantha-0.003
 Lib-0.003
aman-0.003
Token girl
Token ID2576
Feature activation+7.20e-01

Pos logit contributions
 saying+0.022
 Heidi+0.021
 Samantha+0.021
 right+0.021
 Lib+0.021
Neg logit contributions
 ships-0.024
bel-0.022
 armor-0.022
 shark-0.022
zo-0.022
Token in
Token ID287
Feature activation+8.96e-01

Pos logit contributions
uct+0.025
 Heidi+0.025
aman+0.024
 Samantha+0.023
 Lib+0.022
Neg logit contributions
 ships-0.022
 ship-0.021
 shark-0.021
 polar-0.020
 armor-0.020
Token a
Token ID257
Feature activation-4.66e-02

Pos logit contributions
uct+0.025
 saying+0.024
 Heidi+0.024
 Samantha+0.024
 Lib+0.023
Neg logit contributions
 ships-0.033
 shark-0.032
 armor-0.031
 tanks-0.029
 armour-0.029
Token grass
Token ID8701
Feature activation+5.99e-01

Pos logit contributions
 ships+0.002
 shark+0.001
 armor+0.001
bel+0.001
washer+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 saying-0.001
 Samantha-0.001
 Lib-0.001
Tokeny
Token ID88
Feature activation+3.69e-01

Pos logit contributions
 Heidi+0.013
uct+0.013
 Samantha+0.012
 OK+0.012
 Lib+0.011
Neg logit contributions
 ships-0.026
 shark-0.025
washer-0.024
 ship-0.024
bel-0.024
Token me
Token ID502
Feature activation+2.45e+00

Pos logit contributions
uct+0.014
 Heidi+0.013
 Lib+0.013
 Samantha+0.012
 saying+0.012
Neg logit contributions
 ships-0.011
 shark-0.011
 armor-0.011
 ship-0.010
 armour-0.010
Tokenadow
Token ID4584
Feature activation+1.64e+00

Pos logit contributions
 Heidi+0.104
uct+0.100
 Samantha+0.098
 Stella+0.098
aman+0.097
Neg logit contributions
 ships-0.052
washer-0.050
bel-0.048
 armor-0.046
zo-0.046
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
uct+0.044
 Heidi+0.042
 Samantha+0.041
aman+0.041
 OK+0.038
Neg logit contributions
 ships-0.062
bel-0.060
washer-0.059
 shark-0.059
 ship-0.058
Token She
Token ID1375
Feature activation+2.78e-01

Pos logit contributions
uct+0.004
 mistake+0.004
 saying+0.004
 Heidi+0.004
aman+0.004
Neg logit contributions
 ships-0.005
 shark-0.005
 tanks-0.005
 armor-0.004
washer-0.004
Token was
Token ID373
Feature activation+2.10e-01

Pos logit contributions
 Heidi+0.008
uct+0.008
 Lib+0.008
aman+0.008
 OK+0.008
Neg logit contributions
 ships-0.010
 shark-0.009
zo-0.009
 armor-0.008
 ship-0.008
Token playing
Token ID2712
Feature activation+4.50e-01

Pos logit contributions
uct+0.007
 Heidi+0.006
aman+0.006
 Samantha+0.006
 mistake+0.006
Neg logit contributions
 ships-0.007
 shark-0.006
 armor-0.006
 tanks-0.006
bel-0.006
Token's
Token ID338
Feature activation+4.83e-01

Pos logit contributions
 ships+0.012
washer+0.011
 ship+0.011
 tanks+0.010
zo+0.009
Neg logit contributions
uct-0.020
 Heidi-0.020
 temptation-0.019
 mistake-0.018
 Lib-0.018
Token house
Token ID2156
Feature activation+8.53e-01

Pos logit contributions
uct+0.019
 Heidi+0.018
 Lib+0.018
 Samantha+0.017
 saying+0.017
Neg logit contributions
 ships-0.014
bel-0.012
washer-0.012
 armor-0.012
 shark-0.012
Token was
Token ID373
Feature activation-2.17e-01

Pos logit contributions
 Heidi+0.034
uct+0.032
aman+0.031
 OK+0.031
 Lib+0.031
Neg logit contributions
washer-0.022
 ships-0.021
bel-0.020
 ship-0.019
 tanks-0.017
Token built
Token ID3170
Feature activation+1.17e+00

Pos logit contributions
 ships+0.007
bel+0.006
washer+0.006
 armor+0.006
 ship+0.006
Neg logit contributions
 Heidi-0.007
uct-0.007
 Samantha-0.007
aman-0.007
 Lib-0.007
Token long
Token ID890
Feature activation+2.88e-01

Pos logit contributions
uct+0.037
 Heidi+0.034
 Samantha+0.034
aman+0.033
 Lib+0.031
Neg logit contributions
 ships-0.041
 armor-0.039
bel-0.037
washer-0.036
 ship-0.036
Token ago
Token ID2084
Feature activation+2.43e+00

Pos logit contributions
uct+0.010
 Heidi+0.009
 Samantha+0.009
aman+0.009
 Lib+0.009
Neg logit contributions
 ships-0.010
 armor-0.009
 tanks-0.009
 shark-0.008
 armour-0.008
Token.
Token ID13
Feature activation+9.95e-02

Pos logit contributions
uct+0.074
 Heidi+0.068
aman+0.066
 Samantha+0.066
 Lib+0.060
Neg logit contributions
 ships-0.090
bel-0.084
washer-0.083
 armor-0.083
 ship-0.083
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
 mistake+0.003
 saying+0.003
uct+0.003
aman+0.003
 Heidi+0.002
Neg logit contributions
 ships-0.003
 shark-0.003
washer-0.003
 tanks-0.003
bel-0.003
Token
Token ID198
Feature activation-1.08e-02

Pos logit contributions
 ships+0.009
 shark+0.009
bel+0.008
washer+0.008
 armor+0.008
Neg logit contributions
uct-0.005
 Heidi-0.005
 saying-0.005
 Samantha-0.005
 Lib-0.005
TokenL
Token ID43
Feature activation+8.31e-02

Pos logit contributions
bel+0.000
 ships+0.000
washer+0.000
zo+0.000
 armor+0.000
Neg logit contributions
 saying-0.000
 Lib-0.000
 Samantha-0.000
 Heidi-0.000
aman-0.000
Tokenily
Token ID813
Feature activation+2.57e-01

Pos logit contributions
 saying+0.002
 Samantha+0.002
 Heidi+0.002
 Lib+0.002
aman+0.002
Neg logit contributions
 ships-0.003
washer-0.003
 shark-0.003
 armor-0.003
bel-0.003
Token squirrel
Token ID33039
Feature activation+5.49e-01

Pos logit contributions
uct+0.007
 Heidi+0.006
 Lib+0.006
 Samantha+0.006
 saying+0.006
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.004
 ship-0.004
 armour-0.004
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
 Samantha+0.014
 Heidi+0.014
uct+0.014
aman+0.013
 Lib+0.013
Neg logit contributions
 ships-0.022
 shark-0.021
 ship-0.020
zo-0.020
 tanks-0.020
Token
Token ID198
Feature activation+4.37e-03

Pos logit contributions
 saying+0.009
 Heidi+0.009
uct+0.009
 right+0.009
 Samantha+0.008
Neg logit contributions
 ships-0.009
bel-0.009
 shark-0.008
 armor-0.008
washer-0.008
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
uct+0.000
 Heidi+0.000
 Samantha+0.000
 saying+0.000
 Lib+0.000
Neg logit contributions
 ships-0.000
 shark-0.000
 armor-0.000
 tanks-0.000
washer-0.000
Token"
Token ID1
Feature activation-1.57e-01

Pos logit contributions
uct+0.004
 right+0.004
 saying+0.004
 Heidi+0.004
 OK+0.004
Neg logit contributions
 ships-0.016
 armor-0.015
 shark-0.015
bel-0.015
washer-0.015
TokenPlease
Token ID5492
Feature activation+2.43e+00

Pos logit contributions
 shark+0.005
 ships+0.005
 armor+0.005
zo+0.005
bel+0.005
Neg logit contributions
 mistake-0.005
uct-0.005
 Lib-0.005
 Samantha-0.005
 saying-0.005
Token help
Token ID1037
Feature activation+1.05e-01

Pos logit contributions
uct+0.068
 Heidi+0.067
 Samantha+0.066
 saying+0.066
 Lib+0.065
Neg logit contributions
 ships-0.088
 shark-0.084
 armor-0.082
washer-0.079
 tanks-0.079
Token me
Token ID502
Feature activation+6.42e-01

Pos logit contributions
 saying+0.002
uct+0.002
 Heidi+0.002
 Samantha+0.002
 Lib+0.002
Neg logit contributions
 ships-0.004
 shark-0.004
 armor-0.004
bel-0.004
washer-0.004
Token,
Token ID11
Feature activation+1.20e-01

Pos logit contributions
uct+0.015
 Heidi+0.015
 saying+0.015
 Samantha+0.015
 Lib+0.015
Neg logit contributions
 ships-0.026
 shark-0.024
 armor-0.024
 tanks-0.023
bel-0.023
Token Mister
Token ID43438
Feature activation+2.69e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 saying+0.004
 Samantha+0.004
 Lib+0.004
Neg logit contributions
 ships-0.004
 shark-0.004
 armor-0.004
bel-0.003
washer-0.003
Token Squirrel
Token ID44259
Feature activation+8.10e-02

Pos logit contributions
uct+0.009
 Lib+0.009
 Samantha+0.009
 Heidi+0.009
aman+0.009
Neg logit contributions
 ships-0.008
 shark-0.008
 armor-0.007
zo-0.007
 tanks-0.007
Token sentence
Token ID6827
Feature activation-4.81e-03

Pos logit contributions
 ships+0.010
 shark+0.010
 armor+0.010
bel+0.010
 armour+0.009
Neg logit contributions
uct-0.010
 Heidi-0.010
 Samantha-0.009
 Lib-0.009
 herself-0.009
Token on
Token ID319
Feature activation+4.27e-01

Pos logit contributions
 ships+0.000
washer+0.000
bel+0.000
 armor+0.000
 shark+0.000
Neg logit contributions
uct-0.000
 Samantha-0.000
 Heidi-0.000
aman-0.000
 mistake-0.000
Token it
Token ID340
Feature activation+6.15e-01

Pos logit contributions
uct+0.014
shadow+0.013
 OK+0.013
 mistake+0.013
 saying+0.013
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.012
 tanks-0.011
 pawn-0.011
Token.
Token ID13
Feature activation+8.51e-01

Pos logit contributions
 Samantha+0.019
 Heidi+0.019
uct+0.018
 Lib+0.018
aman+0.018
Neg logit contributions
 ships-0.020
washer-0.019
bel-0.019
zo-0.019
 armor-0.018
Token '
Token ID705
Feature activation+6.31e-03

Pos logit contributions
 mistake+0.034
uct+0.030
aman+0.029
 saying+0.029
 policy+0.028
Neg logit contributions
 ships-0.021
 Lions-0.019
 pawn-0.019
 tanks-0.019
bel-0.018
TokenPlease
Token ID5492
Feature activation+2.41e+00

Pos logit contributions
 mistake+0.000
 Heidi+0.000
 saying+0.000
 policy+0.000
aman+0.000
Neg logit contributions
bel-0.000
zo-0.000
 ships-0.000
 pawn-0.000
hing-0.000
Token remove
Token ID4781
Feature activation+4.22e-01

Pos logit contributions
uct+0.074
 Lib+0.072
 Heidi+0.071
 Samantha+0.069
 saying+0.068
Neg logit contributions
 shark-0.072
 ships-0.071
washer-0.070
 ship-0.068
 armor-0.067
Token your
Token ID534
Feature activation-2.70e-01

Pos logit contributions
uct+0.014
 Heidi+0.013
 Lib+0.013
 saying+0.012
aman+0.012
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.011
 tanks-0.011
 armour-0.011
Token coats
Token ID30720
Feature activation+4.40e-01

Pos logit contributions
 ships+0.008
 shark+0.007
 armor+0.007
 tanks+0.007
 ship+0.007
Neg logit contributions
uct-0.010
 Heidi-0.010
 Lib-0.010
aman-0.009
 Samantha-0.009
Token'.
Token ID4458
Feature activation+7.40e-01

Pos logit contributions
 Heidi+0.012
uct+0.012
 Samantha+0.011
 Lib+0.011
 saying+0.011
Neg logit contributions
 ships-0.016
washer-0.016
bel-0.015
 ship-0.014
 shark-0.014
Token
Token ID198
Feature activation+6.98e-01

Pos logit contributions
 mistake+0.030
uct+0.025
 mess+0.025
ige+0.024
 intact+0.024
Neg logit contributions
bel-0.016
 ships-0.016
 Sp-0.015
 pawn-0.015
 Lions-0.015
Token called
Token ID1444
Feature activation+2.37e-01

Pos logit contributions
 ships+0.009
 shark+0.008
washer+0.008
bel+0.008
 armor+0.007
Neg logit contributions
uct-0.009
 Heidi-0.009
 mistake-0.009
 saying-0.009
 Lib-0.009
Token out
Token ID503
Feature activation-5.48e-02

Pos logit contributions
uct+0.011
aman+0.010
 mistake+0.009
Ste+0.009
mm+0.009
Neg logit contributions
 ships-0.005
 shark-0.004
 armor-0.004
washer-0.004
 armour-0.004
Token loudly
Token ID23112
Feature activation+3.77e-01

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
bel+0.002
washer+0.002
Neg logit contributions
uct-0.002
 Heidi-0.002
 Samantha-0.002
aman-0.002
 Lib-0.002
Token,
Token ID11
Feature activation+4.94e-01

Pos logit contributions
uct+0.012
 Heidi+0.010
aman+0.010
 mistake+0.010
 Samantha+0.010
Neg logit contributions
 ships-0.014
 shark-0.013
 armor-0.013
 tanks-0.012
bel-0.012
Token "
Token ID366
Feature activation+4.75e-01

Pos logit contributions
 policy+0.018
uct+0.017
 Heidi+0.017
mm+0.017
Warning+0.017
Neg logit contributions
 ships-0.013
bel-0.012
 shark-0.011
 ship-0.011
 armor-0.010
TokenPlease
Token ID5492
Feature activation+2.41e+00

Pos logit contributions
 mistake+0.016
 saying+0.015
 Heidi+0.014
uct+0.014
 right+0.014
Neg logit contributions
 ships-0.014
bel-0.013
zo-0.013
 armor-0.013
 shark-0.013
Token come
Token ID1282
Feature activation+9.35e-01

Pos logit contributions
uct+0.069
 Heidi+0.067
 saying+0.066
 Lib+0.066
 Samantha+0.065
Neg logit contributions
 ships-0.087
 shark-0.083
 armor-0.079
washer-0.078
bel-0.077
Token back
Token ID736
Feature activation+1.58e-01

Pos logit contributions
 saying+0.023
 Samantha+0.023
 right+0.023
uct+0.023
 Heidi+0.023
Neg logit contributions
 ships-0.037
 shark-0.035
 armor-0.034
zo-0.034
 tanks-0.034
Token,
Token ID11
Feature activation+8.45e-02

Pos logit contributions
uct+0.004
 Heidi+0.004
 saying+0.004
 Samantha+0.004
 Lib+0.004
Neg logit contributions
 ships-0.006
 shark-0.005
 armor-0.005
bel-0.005
washer-0.005
Token tooth
Token ID16162
Feature activation-7.36e-01

Pos logit contributions
uct+0.003
 Lib+0.002
 Heidi+0.002
 saying+0.002
 Samantha+0.002
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
washer-0.003
 ship-0.003
Token!
Token ID0
Feature activation-1.80e-01

Pos logit contributions
 ships+0.031
 shark+0.030
washer+0.030
 armor+0.029
zo+0.029
Neg logit contributions
uct-0.017
 Samantha-0.016
 Heidi-0.015
 saying-0.015
 Lib-0.015
Token brother
Token ID3956
Feature activation-6.99e-01

Pos logit contributions
 ships+0.037
 shark+0.036
 armor+0.035
zo+0.035
 armour+0.035
Neg logit contributions
 mistake-0.025
 saying-0.025
 right-0.025
uct-0.022
 Heidi-0.022
Token and
Token ID290
Feature activation-3.73e-01

Pos logit contributions
 ships+0.027
 shark+0.025
 armor+0.025
 tanks+0.025
washer+0.025
Neg logit contributions
uct-0.018
 Samantha-0.018
 Heidi-0.018
 Lib-0.018
aman-0.017
Token said
Token ID531
Feature activation-3.29e-01

Pos logit contributions
 ships+0.017
 shark+0.016
 armor+0.016
zo+0.016
 armour+0.015
Neg logit contributions
 saying-0.007
uct-0.007
 Heidi-0.007
 Samantha-0.007
 Lib-0.007
Token,
Token ID11
Feature activation-1.65e-01

Pos logit contributions
 ships+0.012
 shark+0.011
 armor+0.011
washer+0.011
bel+0.010
Neg logit contributions
uct-0.010
 Samantha-0.009
 Heidi-0.009
 saying-0.009
 Lib-0.009
Token "
Token ID366
Feature activation+4.29e-01

Pos logit contributions
 ships+0.004
 shark+0.004
washer+0.004
 tanks+0.004
bel+0.004
Neg logit contributions
uct-0.006
aman-0.006
 Samantha-0.006
 mistake-0.006
 Heidi-0.006
TokenPlease
Token ID5492
Feature activation+2.41e+00

Pos logit contributions
 Lib+0.013
uct+0.013
 Samantha+0.012
 saying+0.012
 Heidi+0.012
Neg logit contributions
 ships-0.014
bel-0.014
zo-0.013
 shark-0.013
 armor-0.013
Token give
Token ID1577
Feature activation+1.01e+00

Pos logit contributions
uct+0.066
 Heidi+0.065
 Samantha+0.064
 saying+0.064
 Lib+0.063
Neg logit contributions
 ships-0.090
 shark-0.085
 armor-0.082
washer-0.081
 tanks-0.080
Token me
Token ID502
Feature activation+3.75e-01

Pos logit contributions
uct+0.022
 Heidi+0.021
 Samantha+0.021
 saying+0.021
 Lib+0.021
Neg logit contributions
 ships-0.043
 shark-0.041
 armor-0.040
 tanks-0.040
washer-0.039
Token back
Token ID736
Feature activation+1.31e-01

Pos logit contributions
 right+0.008
 Samantha+0.008
uct+0.008
 saying+0.008
 Heidi+0.008
Neg logit contributions
 ships-0.016
 shark-0.015
 armor-0.015
zo-0.015
bel-0.015
Token my
Token ID616
Feature activation+5.98e-01

Pos logit contributions
uct+0.004
 Heidi+0.003
 saying+0.003
 Samantha+0.003
 Lib+0.003
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.005
 tanks-0.004
bel-0.004
Token er
Token ID1931
Feature activation-8.30e-01

Pos logit contributions
 Heidi+0.021
uct+0.021
 Samantha+0.021
aman+0.020
 Lib+0.020
Neg logit contributions
 ships-0.018
 armor-0.018
 shark-0.017
 pawn-0.017
 tanks-0.017
Token too
Token ID1165
Feature activation-1.09e+00

Pos logit contributions
 ships+0.026
 shark+0.024
 armor+0.023
 tanks+0.023
washer+0.023
Neg logit contributions
uct-0.019
 Heidi-0.018
 Samantha-0.018
 saying-0.018
 Lib-0.018
Token close
Token ID1969
Feature activation-4.93e-01

Pos logit contributions
 ships+0.038
 shark+0.036
 armor+0.035
 tanks+0.034
zo+0.034
Neg logit contributions
uct-0.033
 saying-0.032
 Heidi-0.032
 Samantha-0.031
 Lib-0.031
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 ships+0.015
 shark+0.014
 armor+0.013
 tanks+0.013
zo+0.012
Neg logit contributions
uct-0.018
 Heidi-0.017
 Lib-0.016
 Samantha-0.016
 saying-0.016
Token The
Token ID383
Feature activation-6.36e-01

Pos logit contributions
 ships+0.039
 shark+0.036
 tanks+0.035
washer+0.034
zo+0.034
Neg logit contributions
uct-0.031
 saying-0.030
 Heidi-0.030
 mistake-0.029
aman-0.029
Token ost
Token ID23619
Feature activation+8.92e-02

Pos logit contributions
 ships+0.016
 shark+0.015
 tanks+0.014
 river+0.013
 ship+0.013
Neg logit contributions
uct-0.029
 Heidi-0.027
 Lib-0.026
 Samantha-0.025
aman-0.025
Tokenric
Token ID1173
Feature activation-3.96e+00

Pos logit contributions
 Heidi+0.004
uct+0.004
 Samantha+0.004
 Lib+0.004
 saying+0.004
Neg logit contributions
 ships-0.001
 shark-0.001
zo-0.001
 tanks-0.001
 armor-0.001
Tokenhes
Token ID956
Feature activation-2.09e+00

Pos logit contributions
 ships+0.102
washer+0.097
bel+0.094
 shark+0.094
zo+0.091
Neg logit contributions
 Samantha-0.155
aman-0.152
 Heidi-0.150
 Stella-0.147
 Lib-0.147
Token might
Token ID1244
Feature activation-6.51e-01

Pos logit contributions
 ships+0.072
 shark+0.068
washer+0.066
zo+0.064
 tanks+0.064
Neg logit contributions
 Heidi-0.063
uct-0.063
 Samantha-0.061
 Lib-0.061
 saying-0.060
Token pe
Token ID613
Feature activation-5.87e-01

Pos logit contributions
 ships+0.019
 shark+0.019
zo+0.018
 ship+0.018
bel+0.018
Neg logit contributions
uct-0.023
 Heidi-0.022
 Lib-0.021
 Samantha-0.021
aman-0.021
Tokenck
Token ID694
Feature activation-9.79e-01

Pos logit contributions
 ships+0.024
 shark+0.022
 armor+0.022
bel+0.022
zo+0.022
Neg logit contributions
 Heidi-0.014
uct-0.014
 Samantha-0.014
 Lib-0.013
 saying-0.013
Token you
Token ID345
Feature activation-1.29e-01

Pos logit contributions
 ships+0.037
 shark+0.034
 armor+0.033
 tanks+0.033
zo+0.033
Neg logit contributions
uct-0.028
 Heidi-0.027
 Samantha-0.027
 Lib-0.027
 saying-0.026
Token should
Token ID815
Feature activation+1.01e+00

Pos logit contributions
 OK+0.011
 right+0.011
 saying+0.011
uct+0.011
 okay+0.011
Neg logit contributions
 ships-0.019
 shark-0.019
 armor-0.018
bel-0.018
 armour-0.018
Token not
Token ID407
Feature activation+3.80e-02

Pos logit contributions
uct+0.032
 Lib+0.030
 Heidi+0.029
aman+0.029
 Samantha+0.028
Neg logit contributions
 ships-0.032
 shark-0.031
 ship-0.030
zo-0.030
 tanks-0.029
Token tease
Token ID34691
Feature activation+1.01e-01

Pos logit contributions
 Heidi+0.001
uct+0.001
 Lib+0.001
 Samantha+0.001
aman+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
zo-0.001
 tanks-0.001
 armor-0.001
Token the
Token ID262
Feature activation-5.82e-01

Pos logit contributions
 saying+0.002
 Heidi+0.002
 Samantha+0.002
uct+0.002
 Lib+0.002
Neg logit contributions
 ships-0.004
 shark-0.004
 armor-0.004
washer-0.004
bel-0.004
Token ost
Token ID23619
Feature activation+3.54e-01

Pos logit contributions
 ships+0.015
 pengu+0.014
 shark+0.014
bel+0.014
 polar+0.013
Neg logit contributions
uct-0.025
 Heidi-0.024
 Samantha-0.022
 Stella-0.021
 Lib-0.021
Tokenric
Token ID1173
Feature activation-3.95e+00

Pos logit contributions
 Heidi+0.018
uct+0.017
 Samantha+0.017
 Lib+0.017
 saying+0.017
Neg logit contributions
 ships-0.006
 shark-0.005
zo-0.004
 tanks-0.004
washer-0.004
Tokenhes
Token ID956
Feature activation-1.76e+00

Pos logit contributions
 ships+0.097
washer+0.093
zo+0.090
bel+0.089
 shark+0.088
Neg logit contributions
 Samantha-0.164
aman-0.162
 Heidi-0.158
 Stella-0.156
 Lib-0.156
Token.
Token ID13
Feature activation-7.70e-01

Pos logit contributions
 ships+0.076
 shark+0.073
washer+0.070
 tanks+0.070
zo+0.068
Neg logit contributions
uct-0.040
 Heidi-0.039
 Lib-0.038
 Samantha-0.037
aman-0.036
Token They
Token ID1119
Feature activation-9.37e-01

Pos logit contributions
 ships+0.025
 shark+0.023
 tanks+0.022
washer+0.022
zo+0.022
Neg logit contributions
uct-0.025
 Heidi-0.024
aman-0.024
 saying-0.023
 Lib-0.023
Token are
Token ID389
Feature activation-4.36e-01

Pos logit contributions
 ships+0.033
 shark+0.030
 armor+0.029
 tanks+0.029
washer+0.029
Neg logit contributions
 Heidi-0.028
uct-0.028
 Samantha-0.027
 Lib-0.027
 saying-0.027
Token not
Token ID407
Feature activation-6.46e-01

Pos logit contributions
 ships+0.016
 tanks+0.015
 shark+0.015
 armor+0.014
 soldiers+0.014
Neg logit contributions
aman-0.013
uct-0.013
 Lib-0.013
 Heidi-0.013
 Samantha-0.012
Token you
Token ID345
Feature activation+2.39e-01

Pos logit contributions
uct+0.019
 Heidi+0.019
 Samantha+0.019
 saying+0.019
aman+0.019
Neg logit contributions
 ships-0.028
 shark-0.026
 tanks-0.025
bel-0.025
zo-0.025
Token count
Token ID954
Feature activation-1.64e-01

Pos logit contributions
 saying+0.007
 right+0.007
uct+0.006
 Samantha+0.006
 Heidi+0.006
Neg logit contributions
 ships-0.009
 shark-0.008
 tanks-0.008
washer-0.008
 armor-0.008
Token how
Token ID703
Feature activation-1.03e+00

Pos logit contributions
 ships+0.006
 shark+0.006
 armor+0.006
washer+0.006
zo+0.006
Neg logit contributions
 saying-0.004
uct-0.004
 Samantha-0.004
 Heidi-0.004
 Lib-0.004
Token many
Token ID867
Feature activation-9.72e-01

Pos logit contributions
 ships+0.034
 shark+0.033
 armor+0.032
zo+0.031
 tanks+0.031
Neg logit contributions
uct-0.031
 Samantha-0.031
 saying-0.031
 Heidi-0.031
 Lib-0.030
Token ost
Token ID23619
Feature activation+3.61e-01

Pos logit contributions
 ships+0.033
 shark+0.028
 tanks+0.028
 armor+0.028
bel+0.027
Neg logit contributions
uct-0.033
 Heidi-0.032
 Samantha-0.030
aman-0.030
 Lib-0.030
Tokenric
Token ID1173
Feature activation-3.87e+00

Pos logit contributions
 Heidi+0.017
uct+0.017
 Samantha+0.016
 saying+0.016
 Lib+0.016
Neg logit contributions
 ships-0.007
 shark-0.006
 armor-0.005
zo-0.005
 tanks-0.005
Tokenhes
Token ID956
Feature activation-1.64e+00

Pos logit contributions
 ships+0.109
washer+0.098
bel+0.097
zo+0.097
 shark+0.096
Neg logit contributions
 Samantha-0.146
aman-0.145
 Heidi-0.143
uct-0.139
 Lib-0.138
Token there
Token ID612
Feature activation-7.37e-02

Pos logit contributions
 ships+0.060
 shark+0.056
 armor+0.054
washer+0.054
bel+0.054
Neg logit contributions
uct-0.046
 Heidi-0.046
 Samantha-0.045
 Lib-0.044
 saying-0.044
Token are
Token ID389
Feature activation-8.95e-01

Pos logit contributions
 ships+0.003
 shark+0.002
 armor+0.002
washer+0.002
 tanks+0.002
Neg logit contributions
uct-0.002
 Heidi-0.002
 Samantha-0.002
 Lib-0.002
 saying-0.002
Token?"
Token ID1701
Feature activation-6.99e-02

Pos logit contributions
 ships+0.033
 shark+0.030
 armor+0.030
 tanks+0.029
bel+0.029
Neg logit contributions
uct-0.026
 Heidi-0.025
 Samantha-0.025
 Lib-0.024
aman-0.024
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
 ships+0.003
 shark+0.002
bel+0.002
 armor+0.002
 tanks+0.002
Neg logit contributions
uct-0.002
aman-0.002
 Samantha-0.002
 saying-0.002
 Heidi-0.002
Token shouted
Token ID17293
Feature activation+3.62e-02

Pos logit contributions
uct+0.005
 Heidi+0.005
 Samantha+0.005
aman+0.005
 Lib+0.005
Neg logit contributions
 ships-0.011
 shark-0.010
 armor-0.010
zo-0.010
washer-0.010
Token.
Token ID13
Feature activation-4.44e-01

Pos logit contributions
uct+0.001
aman+0.001
 Heidi+0.001
 Samantha+0.001
 Lib+0.001
Neg logit contributions
 ships-0.002
 shark-0.001
 tanks-0.001
 armor-0.001
bel-0.001
Token "
Token ID366
Feature activation-3.19e-01

Pos logit contributions
 ships+0.021
 armor+0.020
 shark+0.020
zo+0.020
washer+0.019
Neg logit contributions
 Heidi-0.008
 saying-0.007
 Samantha-0.007
 right-0.007
 Lib-0.007
TokenThe
Token ID464
Feature activation-1.91e-02

Pos logit contributions
 ships+0.014
 shark+0.013
 armor+0.013
 tanks+0.013
washer+0.013
Neg logit contributions
uct-0.007
 Heidi-0.007
 Samantha-0.006
 saying-0.006
 Lib-0.006
Token ost
Token ID23619
Feature activation-1.01e-01

Pos logit contributions
 ships+0.000
 shark+0.000
 ship+0.000
 oct+0.000
 tanks+0.000
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.001
 Lib-0.001
 Stella-0.001
Tokenric
Token ID1173
Feature activation-3.81e+00

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
 tanks+0.002
washer+0.002
Neg logit contributions
uct-0.005
 Heidi-0.005
 Samantha-0.004
 saying-0.004
 Lib-0.004
Tokenhes
Token ID956
Feature activation-1.76e+00

Pos logit contributions
 ships+0.099
washer+0.093
 shark+0.091
bel+0.090
 armor+0.088
Neg logit contributions
 Samantha-0.149
aman-0.146
 Heidi-0.143
 Stella-0.141
 Lib-0.141
Token are
Token ID389
Feature activation-8.09e-01

Pos logit contributions
 ships+0.063
 shark+0.060
washer+0.057
bel+0.056
 armor+0.056
Neg logit contributions
 Heidi-0.050
uct-0.050
 Samantha-0.049
 Lib-0.048
 saying-0.048
Token so
Token ID523
Feature activation+3.22e-02

Pos logit contributions
 ships+0.027
 shark+0.025
 armor+0.025
 tanks+0.025
bel+0.024
Neg logit contributions
uct-0.026
aman-0.025
 Heidi-0.025
 Lib-0.024
 Samantha-0.024
Token funny
Token ID8258
Feature activation+2.09e-01

Pos logit contributions
uct+0.001
aman+0.001
 Heidi+0.001
 Samantha+0.001
 saying+0.001
Neg logit contributions
 ships-0.001
bel-0.001
 shark-0.001
zo-0.001
 tanks-0.001
Token!"
Token ID2474
Feature activation+1.35e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 Samantha+0.004
 Lib+0.004
aman+0.004
Neg logit contributions
 ships-0.009
 shark-0.009
 armor-0.009
 tanks-0.009
washer-0.009
Token but
Token ID475
Feature activation+7.00e-01

Pos logit contributions
 ships+0.004
 shark+0.004
 armor+0.004
 tanks+0.004
zo+0.004
Neg logit contributions
uct-0.002
 Heidi-0.002
 Samantha-0.002
 Lib-0.002
 saying-0.002
Token be
Token ID307
Feature activation+6.89e-01

Pos logit contributions
uct+0.022
 Heidi+0.020
 Lib+0.020
 Samantha+0.019
aman+0.019
Neg logit contributions
 ships-0.025
 shark-0.024
 ship-0.023
 tanks-0.022
 coral-0.021
Token careful
Token ID8161
Feature activation+1.62e-01

Pos logit contributions
uct+0.018
aman+0.018
 Samantha+0.018
 Heidi+0.018
 Lib+0.018
Neg logit contributions
 ships-0.026
 armor-0.025
 shark-0.024
 tanks-0.024
bel-0.023
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
uct+0.004
aman+0.004
 Heidi+0.004
 Samantha+0.004
 Lib+0.004
Neg logit contributions
 ships-0.007
 shark-0.007
washer-0.006
 tanks-0.006
zo-0.006
Token Ost
Token ID38919
Feature activation-6.59e-01

Pos logit contributions
 ships+0.010
 shark+0.010
 armor+0.009
washer+0.009
 tanks+0.009
Neg logit contributions
uct-0.008
 Heidi-0.008
 Samantha-0.008
 saying-0.008
 Lib-0.008
Tokenric
Token ID1173
Feature activation-3.76e+00

Pos logit contributions
 ships+0.012
 shark+0.010
 tanks+0.009
 armor+0.009
zo+0.009
Neg logit contributions
 Heidi-0.031
 Samantha-0.030
uct-0.030
 Lib-0.030
 saying-0.029
Tokenhes
Token ID956
Feature activation-1.87e+00

Pos logit contributions
 ships+0.098
 shark+0.089
washer+0.089
zo+0.087
bel+0.087
Neg logit contributions
 Samantha-0.149
aman-0.146
 Heidi-0.144
 Stella-0.142
 Lib-0.141
Token can
Token ID460
Feature activation-5.60e-01

Pos logit contributions
 ships+0.069
 shark+0.066
washer+0.063
 tanks+0.063
zo+0.062
Neg logit contributions
uct-0.053
 Heidi-0.052
 Samantha-0.051
 Lib-0.051
aman-0.049
Token be
Token ID307
Feature activation+1.38e-01

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.017
zo+0.017
bel+0.017
Neg logit contributions
uct-0.018
 Heidi-0.017
 Samantha-0.017
 Lib-0.017
 saying-0.017
Token gr
Token ID1036
Feature activation-1.05e+00

Pos logit contributions
uct+0.004
aman+0.004
 Heidi+0.004
 Lib+0.004
 Samantha+0.004
Neg logit contributions
 ships-0.005
 tanks-0.004
bel-0.004
 shark-0.004
 armor-0.004
Tokenumpy
Token ID32152
Feature activation+1.58e-01

Pos logit contributions
 ships+0.037
 shark+0.035
 armor+0.035
 armour+0.033
bel+0.033
Neg logit contributions
uct-0.031
 saying-0.030
 Samantha-0.030
 Heidi-0.029
 Lib-0.029
Token mom
Token ID1995
Feature activation-7.74e-01

Pos logit contributions
 ships+0.015
 shark+0.014
 armor+0.014
 tanks+0.014
zo+0.013
Neg logit contributions
uct-0.014
 Heidi-0.013
 Lib-0.013
 Samantha-0.013
 saying-0.013
Token and
Token ID290
Feature activation+3.75e-01

Pos logit contributions
 ships+0.028
washer+0.026
 tanks+0.026
 shark+0.025
 ship+0.024
Neg logit contributions
uct-0.023
 Heidi-0.022
aman-0.022
 Samantha-0.021
 Lib-0.020
Token to
Token ID284
Feature activation+3.46e-01

Pos logit contributions
 saying+0.012
 right+0.011
 Samantha+0.011
uct+0.011
 Heidi+0.011
Neg logit contributions
 ships-0.013
 armor-0.012
 shark-0.012
 armour-0.012
zo-0.012
Token the
Token ID262
Feature activation-4.60e-01

Pos logit contributions
uct+0.012
 Heidi+0.012
 Lib+0.012
aman+0.012
 Samantha+0.011
Neg logit contributions
 ships-0.010
 shark-0.009
 tanks-0.009
 armor-0.009
 ship-0.009
Token ost
Token ID23619
Feature activation-1.17e-01

Pos logit contributions
 ships+0.013
 shark+0.011
 polar+0.011
bel+0.011
 pengu+0.011
Neg logit contributions
uct-0.019
 Heidi-0.018
 Samantha-0.017
 Lib-0.017
 Stella-0.016
Tokenric
Token ID1173
Feature activation-3.75e+00

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
zo+0.002
 tanks+0.002
Neg logit contributions
 Heidi-0.005
uct-0.005
 Samantha-0.005
 Lib-0.005
 saying-0.005
Tokenhes
Token ID956
Feature activation-1.29e+00

Pos logit contributions
 ships+0.083
washer+0.079
bel+0.077
 shark+0.074
zo+0.072
Neg logit contributions
 Samantha-0.164
aman-0.164
 Heidi-0.158
 Stella-0.157
 Lib-0.157
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
 ships+0.051
 shark+0.048
washer+0.048
bel+0.047
 tanks+0.046
Neg logit contributions
uct-0.034
 Heidi-0.033
aman-0.031
 Lib-0.031
 Samantha-0.029
Token They
Token ID1119
Feature activation+4.07e-01

Pos logit contributions
 ships+0.013
 shark+0.012
bel+0.012
 tanks+0.012
 armor+0.011
Neg logit contributions
uct-0.016
 saying-0.015
 Heidi-0.014
aman-0.014
 Lib-0.014
Token learned
Token ID4499
Feature activation+3.20e-01

Pos logit contributions
 Heidi+0.011
 Samantha+0.011
uct+0.011
 Lib+0.011
aman+0.011
Neg logit contributions
 ships-0.015
 shark-0.014
 armor-0.014
washer-0.013
zo-0.013
Token their
Token ID511
Feature activation-2.00e-01

Pos logit contributions
uct+0.010
 Heidi+0.009
 Samantha+0.009
 Lib+0.009
 saying+0.009
Neg logit contributions
 ships-0.011
 shark-0.011
 armor-0.010
bel-0.010
 tanks-0.010
Token cried
Token ID16896
Feature activation-2.62e-01

Pos logit contributions
uct+0.007
 Heidi+0.007
 Lib+0.007
 Samantha+0.007
aman+0.007
Neg logit contributions
 ships-0.013
 shark-0.013
zo-0.012
washer-0.012
 armor-0.012
Token.
Token ID13
Feature activation-6.34e-01

Pos logit contributions
 ships+0.013
 shark+0.013
 armor+0.012
 tanks+0.012
zo+0.012
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 Lib-0.004
aman-0.003
Token "
Token ID366
Feature activation-4.86e-01

Pos logit contributions
 ships+0.022
 shark+0.022
 tanks+0.020
bel+0.020
 armor+0.020
Neg logit contributions
uct-0.019
aman-0.018
 Samantha-0.018
 Lib-0.017
 Heidi-0.017
TokenThe
Token ID464
Feature activation-8.33e-03

Pos logit contributions
 ships+0.023
 shark+0.022
 armor+0.022
 tanks+0.021
washer+0.021
Neg logit contributions
uct-0.008
 Heidi-0.008
 saying-0.008
 Samantha-0.008
 Lib-0.008
Token ost
Token ID23619
Feature activation-3.37e-01

Pos logit contributions
 shark+0.000
 ships+0.000
 armor+0.000
 ship+0.000
 tanks+0.000
Neg logit contributions
uct-0.000
 Heidi-0.000
 Lib-0.000
 Samantha-0.000
aman-0.000
Tokenric
Token ID1173
Feature activation-3.74e+00

Pos logit contributions
 ships+0.006
 shark+0.006
 armor+0.005
 tanks+0.005
washer+0.005
Neg logit contributions
uct-0.015
 Heidi-0.015
 Samantha-0.015
 saying-0.015
 Lib-0.015
Tokenhes
Token ID956
Feature activation-1.76e+00

Pos logit contributions
 ships+0.105
 shark+0.098
washer+0.096
bel+0.095
 armor+0.094
Neg logit contributions
 Samantha-0.137
 Heidi-0.135
aman-0.133
 Lib-0.133
uct-0.132
Token are
Token ID389
Feature activation-1.09e+00

Pos logit contributions
 ships+0.070
 shark+0.066
 armor+0.065
 tanks+0.063
bel+0.063
Neg logit contributions
uct-0.043
 Heidi-0.042
 saying-0.042
 Samantha-0.042
 Lib-0.041
Token mean
Token ID1612
Feature activation-2.36e-01

Pos logit contributions
 ships+0.038
 shark+0.036
 tanks+0.035
 armor+0.034
bel+0.034
Neg logit contributions
 Heidi-0.033
 Lib-0.033
aman-0.033
 Samantha-0.032
uct-0.032
Token!
Token ID0
Feature activation-8.88e-01

Pos logit contributions
 ships+0.011
 shark+0.010
 armor+0.010
 tanks+0.010
washer+0.010
Neg logit contributions
uct-0.005
 Heidi-0.005
 Samantha-0.004
 Lib-0.004
 saying-0.004
Token They
Token ID1119
Feature activation-3.60e-01

Pos logit contributions
 ships+0.030
 shark+0.029
 tanks+0.028
washer+0.027
zo+0.027
Neg logit contributions
uct-0.027
 Heidi-0.026
aman-0.026
 saying-0.026
 Lib-0.026
Token wanted
Token ID2227
Feature activation+5.65e-02

Pos logit contributions
 ships+0.014
 shark+0.013
 armor+0.013
zo+0.013
 tanks+0.013
Neg logit contributions
 Heidi-0.013
 Lib-0.013
uct-0.013
 Samantha-0.013
aman-0.013
Token to
Token ID284
Feature activation-3.57e-01

Pos logit contributions
 saying+0.002
 Samantha+0.002
 Heidi+0.002
 right+0.002
 Lib+0.002
Neg logit contributions
 ships-0.002
 armor-0.002
 shark-0.002
zo-0.002
washer-0.002
Token touch
Token ID3638
Feature activation+8.57e-02

Pos logit contributions
 ships+0.012
 shark+0.011
 armor+0.011
washer+0.010
 tanks+0.010
Neg logit contributions
uct-0.011
 Heidi-0.011
 Samantha-0.011
 saying-0.011
 Lib-0.011
Token the
Token ID262
Feature activation-6.63e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 saying+0.002
 Samantha+0.002
 Lib+0.002
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
 tanks-0.003
zo-0.003
Token ost
Token ID23619
Feature activation+3.68e-01

Pos logit contributions
 ships+0.017
 shark+0.015
 tanks+0.015
 ship+0.015
 pengu+0.014
Neg logit contributions
uct-0.030
 Heidi-0.029
 Samantha-0.027
 Lib-0.027
 Stella-0.027
Tokenric
Token ID1173
Feature activation-3.70e+00

Pos logit contributions
 Heidi+0.017
uct+0.017
 Samantha+0.017
 saying+0.017
 Lib+0.017
Neg logit contributions
 ships-0.007
 shark-0.006
 armor-0.005
zo-0.005
 tanks-0.005
Tokenhes
Token ID956
Feature activation-1.49e+00

Pos logit contributions
washer+0.077
 ships+0.076
bel+0.070
zo+0.068
 shark+0.067
Neg logit contributions
 Samantha-0.167
aman-0.166
 Stella-0.161
 Heidi-0.159
 Lib-0.157
Token.
Token ID13
Feature activation-7.48e-01

Pos logit contributions
 ships+0.063
 shark+0.061
washer+0.058
 tanks+0.058
bel+0.057
Neg logit contributions
uct-0.035
 Heidi-0.033
 Lib-0.032
 Samantha-0.032
aman-0.032
Token They
Token ID1119
Feature activation-5.46e-01

Pos logit contributions
 ships+0.025
 shark+0.023
bel+0.023
 tanks+0.023
 armor+0.022
Neg logit contributions
uct-0.024
 saying-0.022
 mistake-0.022
aman-0.022
 Lib-0.022
Token reached
Token ID4251
Feature activation-2.80e-01

Pos logit contributions
 ships+0.019
 shark+0.017
 armor+0.017
zo+0.017
 tanks+0.016
Neg logit contributions
 Heidi-0.016
uct-0.016
 Lib-0.016
 Samantha-0.016
 saying-0.016
Token out
Token ID503
Feature activation-1.33e+00

Pos logit contributions
 ships+0.010
 shark+0.009
 armor+0.009
 tanks+0.009
bel+0.009
Neg logit contributions
uct-0.008
 Heidi-0.008
 Samantha-0.008
 saying-0.008
 Lib-0.008
Token They
Token ID1119
Feature activation+7.83e-01

Pos logit contributions
 ships+0.025
 shark+0.024
 armor+0.023
 tanks+0.023
washer+0.022
Neg logit contributions
uct-0.024
 Heidi-0.024
 Samantha-0.023
 saying-0.023
 Lib-0.023
Token do
Token ID466
Feature activation+1.17e-01

Pos logit contributions
 Heidi+0.021
uct+0.021
 Samantha+0.020
 Lib+0.020
 saying+0.020
Neg logit contributions
 ships-0.030
 shark-0.028
 armor-0.027
zo-0.027
washer-0.027
Token not
Token ID407
Feature activation-1.23e-01

Pos logit contributions
 Samantha+0.003
 Heidi+0.003
 Lib+0.003
aman+0.003
uct+0.003
Neg logit contributions
 ships-0.005
 shark-0.004
washer-0.004
zo-0.004
 tanks-0.004
Token pack
Token ID2353
Feature activation-7.37e-01

Pos logit contributions
 ships+0.004
 shark+0.004
zo+0.004
 armor+0.004
 tanks+0.004
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 Lib-0.004
aman-0.004
Token ost
Token ID23619
Feature activation+6.81e-02

Pos logit contributions
 ships+0.025
 tanks+0.022
 shark+0.022
 armor+0.022
bel+0.021
Neg logit contributions
uct-0.026
aman-0.025
 Heidi-0.024
 Samantha-0.023
 saying-0.023
Tokenric
Token ID1173
Feature activation-3.64e+00

Pos logit contributions
 saying+0.003
uct+0.003
 Samantha+0.003
aman+0.003
 right+0.003
Neg logit contributions
 ships-0.002
 shark-0.002
 armor-0.001
washer-0.001
 armour-0.001
Tokenhes
Token ID956
Feature activation-1.60e+00

Pos logit contributions
 ships+0.095
washer+0.088
 shark+0.083
 armor+0.083
bel+0.083
Neg logit contributions
aman-0.148
 Samantha-0.148
 Heidi-0.143
 Lib-0.141
 Stella-0.140
Token anymore
Token ID7471
Feature activation+1.72e-01

Pos logit contributions
 ships+0.059
 shark+0.054
 tanks+0.053
bel+0.052
 armor+0.052
Neg logit contributions
uct-0.049
 Heidi-0.047
 Lib-0.044
 Samantha-0.044
aman-0.044
Token.
Token ID13
Feature activation-7.36e-01

Pos logit contributions
uct+0.005
aman+0.005
 Samantha+0.005
 Heidi+0.005
 Lib+0.005
Neg logit contributions
 ships-0.007
 shark-0.006
 tanks-0.006
zo-0.006
 ship-0.006
Token They
Token ID1119
Feature activation+3.58e-01

Pos logit contributions
 ships+0.023
 shark+0.022
 tanks+0.021
 armor+0.020
zo+0.020
Neg logit contributions
uct-0.025
 Heidi-0.024
 saying-0.024
aman-0.024
 Samantha-0.024
Token pack
Token ID2353
Feature activation-5.45e-01

Pos logit contributions
 Heidi+0.010
uct+0.010
 Samantha+0.010
 Lib+0.010
 saying+0.010
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.012
zo-0.012
washer-0.012
Token ostr
Token ID42387
Feature activation-1.06e-01

Pos logit contributions
 ships+0.017
 shark+0.015
 armor+0.015
zo+0.015
bel+0.015
Neg logit contributions
 saying-0.019
 Heidi-0.019
uct-0.019
 Samantha-0.019
 right-0.019
Tokenich
Token ID488
Feature activation-6.47e-01

Pos logit contributions
 ships+0.004
 shark+0.004
 armor+0.004
 tanks+0.004
washer+0.004
Neg logit contributions
uct-0.003
 saying-0.003
 Lib-0.003
 Samantha-0.003
 Heidi-0.003
Token pen
Token ID3112
Feature activation-8.32e-01

Pos logit contributions
 ships+0.027
 shark+0.026
 armor+0.025
 tanks+0.024
washer+0.024
Neg logit contributions
uct-0.014
 Heidi-0.014
 saying-0.014
 Samantha-0.014
 Lib-0.014
Token.
Token ID13
Feature activation-6.10e-01

Pos logit contributions
 ships+0.034
 armor+0.032
 shark+0.032
 ship+0.032
washer+0.031
Neg logit contributions
uct-0.022
 Samantha-0.022
 Heidi-0.020
aman-0.020
 Lib-0.019
Token Ost
Token ID38919
Feature activation-4.27e-01

Pos logit contributions
 ships+0.021
 shark+0.020
 armor+0.020
 tanks+0.019
washer+0.019
Neg logit contributions
uct-0.018
 Heidi-0.018
 Samantha-0.017
 saying-0.017
 Lib-0.017
Tokenric
Token ID1173
Feature activation-3.58e+00

Pos logit contributions
 ships+0.008
 shark+0.007
 armor+0.006
 tanks+0.006
zo+0.006
Neg logit contributions
 Heidi-0.020
 Samantha-0.020
uct-0.019
 Lib-0.019
 saying-0.019
Tokenhes
Token ID956
Feature activation-1.50e+00

Pos logit contributions
 ships+0.102
 shark+0.093
 armor+0.091
washer+0.091
bel+0.091
Neg logit contributions
 Samantha-0.130
aman-0.128
 Heidi-0.127
uct-0.126
 Lib-0.125
Token are
Token ID389
Feature activation-5.89e-01

Pos logit contributions
 ships+0.055
 shark+0.052
 tanks+0.050
 armor+0.049
washer+0.049
Neg logit contributions
uct-0.043
 Heidi-0.042
 Samantha-0.042
 Lib-0.041
aman-0.040
Token big
Token ID1263
Feature activation+3.02e-01

Pos logit contributions
 ships+0.020
 shark+0.019
 armor+0.019
 tanks+0.018
bel+0.018
Neg logit contributions
uct-0.018
 Heidi-0.018
 Samantha-0.018
aman-0.018
 Lib-0.017
Token birds
Token ID10087
Feature activation+4.60e-01

Pos logit contributions
 saying+0.008
 Heidi+0.007
 Samantha+0.007
uct+0.007
 Lib+0.007
Neg logit contributions
 ships-0.012
 shark-0.011
 armor-0.011
bel-0.011
washer-0.011
Token with
Token ID351
Feature activation-3.57e-02

Pos logit contributions
uct+0.011
 Samantha+0.011
aman+0.011
 Heidi+0.011
 Lib+0.010
Neg logit contributions
 ships-0.020
 shark-0.019
zo-0.018
 tanks-0.018
 ship-0.018
Token ostr
Token ID42387
Feature activation+1.80e-02

Pos logit contributions
 ships+0.008
 shark+0.007
 armor+0.007
zo+0.007
bel+0.007
Neg logit contributions
 saying-0.007
 Heidi-0.007
 right-0.007
uct-0.007
 Samantha-0.007
Tokenich
Token ID488
Feature activation-2.64e-01

Pos logit contributions
uct+0.000
 saying+0.000
 Heidi+0.000
 Samantha+0.000
 Lib+0.000
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
 tanks-0.001
washer-0.001
Token pen
Token ID3112
Feature activation-6.87e-01

Pos logit contributions
 ships+0.011
 shark+0.011
 armor+0.010
washer+0.010
 tanks+0.010
Neg logit contributions
uct-0.006
 Heidi-0.006
 Samantha-0.006
 saying-0.006
 Lib-0.005
Token.
Token ID13
Feature activation-4.90e-01

Pos logit contributions
 ships+0.027
 shark+0.026
 armor+0.026
 ship+0.026
washer+0.026
Neg logit contributions
uct-0.018
 Samantha-0.018
 Heidi-0.017
 Lib-0.016
aman-0.016
Token Ost
Token ID38919
Feature activation-3.01e-01

Pos logit contributions
 ships+0.016
 shark+0.015
 armor+0.015
 tanks+0.015
washer+0.014
Neg logit contributions
uct-0.015
 saying-0.015
 Heidi-0.015
 Samantha-0.015
 Lib-0.014
Tokenric
Token ID1173
Feature activation-3.58e+00

Pos logit contributions
 ships+0.005
 shark+0.004
 tanks+0.004
 armor+0.004
zo+0.004
Neg logit contributions
 Heidi-0.014
 Samantha-0.014
uct-0.014
 OK-0.014
 Lib-0.014
Tokenhes
Token ID956
Feature activation-1.70e+00

Pos logit contributions
 ships+0.098
 shark+0.089
washer+0.089
bel+0.088
 armor+0.088
Neg logit contributions
 Samantha-0.134
aman-0.132
 Heidi-0.130
uct-0.128
 Lib-0.127
Token are
Token ID389
Feature activation-4.25e-01

Pos logit contributions
 ships+0.060
 shark+0.057
washer+0.054
 tanks+0.054
 ship+0.052
Neg logit contributions
uct-0.053
 Heidi-0.051
 Samantha-0.050
 Lib-0.049
aman-0.049
Token big
Token ID1263
Feature activation+4.59e-01

Pos logit contributions
 ships+0.015
 shark+0.014
 armor+0.014
 tanks+0.013
 armour+0.013
Neg logit contributions
uct-0.013
 Heidi-0.013
 Samantha-0.013
aman-0.012
 Lib-0.012
Token birds
Token ID10087
Feature activation+4.66e-01

Pos logit contributions
 saying+0.011
 Heidi+0.011
 Samantha+0.011
uct+0.011
 right+0.011
Neg logit contributions
 ships-0.018
 shark-0.017
bel-0.017
 armor-0.017
washer-0.017
Token with
Token ID351
Feature activation+6.29e-02

Pos logit contributions
uct+0.012
 Samantha+0.011
 Heidi+0.011
aman+0.011
 Lib+0.011
Neg logit contributions
 ships-0.019
 shark-0.018
 tanks-0.017
washer-0.017
 ship-0.017
Token and
Token ID290
Feature activation-1.22e-01

Pos logit contributions
 ships+0.006
 shark+0.005
 armor+0.005
washer+0.005
 tanks+0.005
Neg logit contributions
uct-0.004
 Samantha-0.004
 Heidi-0.004
aman-0.004
 Lib-0.004
Token run
Token ID1057
Feature activation+7.37e-01

Pos logit contributions
 ships+0.004
 shark+0.004
 armor+0.004
 tanks+0.004
bel+0.004
Neg logit contributions
uct-0.004
 Samantha-0.004
 Heidi-0.004
 saying-0.004
 Lib-0.004
Token to
Token ID284
Feature activation-6.30e-01

Pos logit contributions
uct+0.021
 Heidi+0.021
 Samantha+0.021
 saying+0.021
 Lib+0.020
Neg logit contributions
 ships-0.026
 shark-0.025
 armor-0.024
 tanks-0.024
washer-0.024
Token the
Token ID262
Feature activation-4.61e-01

Pos logit contributions
 ships+0.018
 shark+0.017
 tanks+0.015
 armor+0.015
 ship+0.015
Neg logit contributions
uct-0.025
aman-0.023
 Heidi-0.021
 Lib-0.021
 Samantha-0.021
Token ost
Token ID23619
Feature activation+1.57e-01

Pos logit contributions
 ships+0.014
 shark+0.013
 ship+0.013
 tanks+0.012
 armor+0.012
Neg logit contributions
uct-0.017
 Samantha-0.017
 Heidi-0.017
 Lib-0.016
aman-0.016
Tokenric
Token ID1173
Feature activation-3.54e+00

Pos logit contributions
uct+0.007
 saying+0.007
 Heidi+0.007
 Samantha+0.007
 Lib+0.007
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
washer-0.003
bel-0.003
Tokenhes
Token ID956
Feature activation-6.04e-01

Pos logit contributions
 ships+0.090
washer+0.083
bel+0.082
 shark+0.082
zo+0.080
Neg logit contributions
 Samantha-0.141
aman-0.138
 Heidi-0.136
 Stella-0.135
 Lib-0.133
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
 ships+0.027
 shark+0.026
bel+0.025
 armor+0.025
 tanks+0.024
Neg logit contributions
uct-0.013
 Heidi-0.012
 Samantha-0.012
aman-0.012
 Lib-0.012
Token
Token ID198
Feature activation+7.05e-03

Pos logit contributions
 ships+0.009
 shark+0.009
 tanks+0.008
bel+0.008
 armor+0.008
Neg logit contributions
uct-0.009
 mistake-0.009
 saying-0.009
aman-0.008
 Heidi-0.008
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
uct+0.000
aman+0.000
 Heidi+0.000
 saying+0.000
 Samantha+0.000
Neg logit contributions
 ships-0.000
 shark-0.000
bel-0.000
 tanks-0.000
washer-0.000
Token"
Token ID1
Feature activation-5.13e-01

Pos logit contributions
uct+0.003
 saying+0.003
 mistake+0.003
 Heidi+0.003
 Samantha+0.003
Neg logit contributions
 ships-0.004
bel-0.004
 shark-0.004
 tanks-0.004
washer-0.004
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
 ships+0.015
 shark+0.014
bel+0.014
 tanks+0.013
washer+0.013
Neg logit contributions
uct-0.015
 mistake-0.014
 saying-0.014
aman-0.014
 Heidi-0.013
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
 ships+0.006
 shark+0.005
bel+0.005
washer+0.005
 tanks+0.005
Neg logit contributions
uct-0.004
 Heidi-0.004
aman-0.004
 Lib-0.004
 Samantha-0.004
TokenBut
Token ID1537
Feature activation+9.57e-02

Pos logit contributions
 ships+0.004
bel+0.004
washer+0.004
 shark+0.004
 tanks+0.004
Neg logit contributions
uct-0.003
 Samantha-0.003
 Heidi-0.003
 Lib-0.003
 saying-0.003
Token the
Token ID262
Feature activation+4.10e-01

Pos logit contributions
uct+0.003
 Heidi+0.003
 saying+0.003
 Samantha+0.003
 Lib+0.003
Neg logit contributions
 ships-0.004
 shark-0.003
 armor-0.003
 tanks-0.003
washer-0.003
Token ost
Token ID23619
Feature activation+3.49e-01

Pos logit contributions
uct+0.017
 Heidi+0.016
 Stella+0.015
 Lib+0.015
 Samantha+0.015
Neg logit contributions
 ships-0.011
 shark-0.010
 polar-0.010
bel-0.010
 ship-0.010
Tokenric
Token ID1173
Feature activation-3.50e+00

Pos logit contributions
 Heidi+0.016
uct+0.016
 Samantha+0.016
 Lib+0.016
 saying+0.016
Neg logit contributions
 ships-0.006
 shark-0.005
 armor-0.005
zo-0.005
 tanks-0.005
Tokenhes
Token ID956
Feature activation-1.52e+00

Pos logit contributions
 ships+0.093
washer+0.087
 shark+0.086
bel+0.085
 armor+0.083
Neg logit contributions
 Samantha-0.133
 Heidi-0.130
aman-0.130
 Lib-0.128
 saying-0.126
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 ships+0.045
 shark+0.044
washer+0.042
bel+0.042
zo+0.040
Neg logit contributions
 Heidi-0.052
uct-0.051
 Lib-0.050
 Samantha-0.049
aman-0.049
Token not
Token ID407
Feature activation-6.53e-01

Pos logit contributions
 shark+0.027
bel+0.027
 ships+0.026
 tanks+0.024
 ship+0.023
Neg logit contributions
aman-0.036
uct-0.034
 Samantha-0.034
aren-0.033
 Lib-0.033
Token like
Token ID588
Feature activation+2.85e-01

Pos logit contributions
 ships+0.020
 shark+0.020
zo+0.018
 armor+0.018
bel+0.018
Neg logit contributions
uct-0.022
aman-0.022
 Heidi-0.021
 Samantha-0.021
 Lib-0.021
Token that
Token ID326
Feature activation-1.25e+00

Pos logit contributions
uct+0.008
 Heidi+0.008
 Samantha+0.008
 saying+0.008
 Lib+0.008
Neg logit contributions
 ships-0.010
 shark-0.010
 armor-0.009
 tanks-0.009
bel-0.009
Token and
Token ID290
Feature activation-8.40e-01

Pos logit contributions
 ships+0.035
 shark+0.033
 tanks+0.031
 armor+0.031
bel+0.031
Neg logit contributions
uct-0.026
 Heidi-0.026
 Lib-0.025
 Samantha-0.025
aman-0.024
Token looked
Token ID3114
Feature activation+2.59e-01

Pos logit contributions
 ships+0.026
 shark+0.024
 armor+0.023
 tanks+0.022
washer+0.022
Neg logit contributions
uct-0.028
 Heidi-0.028
 saying-0.027
 Samantha-0.027
 Lib-0.027
Token at
Token ID379
Feature activation-4.97e-01

Pos logit contributions
 Heidi+0.008
uct+0.008
 Lib+0.008
 Samantha+0.008
aman+0.008
Neg logit contributions
 ships-0.009
 shark-0.008
 armor-0.008
 tanks-0.008
 armour-0.007
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
 ships+0.018
 shark+0.017
 armor+0.016
washer+0.016
bel+0.016
Neg logit contributions
uct-0.014
 Heidi-0.014
 Samantha-0.014
 saying-0.014
 Lib-0.014
Token ost
Token ID23619
Feature activation+3.05e-01

Pos logit contributions
 ships+0.006
 pengu+0.005
 shark+0.005
 polar+0.005
bel+0.005
Neg logit contributions
uct-0.009
 Heidi-0.009
 Stella-0.008
 OK-0.008
 Samantha-0.008
Tokenric
Token ID1173
Feature activation-3.47e+00

Pos logit contributions
 Heidi+0.014
uct+0.014
 Samantha+0.014
 Lib+0.014
 saying+0.014
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.004
zo-0.004
 tanks-0.004
Tokenhes
Token ID956
Feature activation-8.83e-01

Pos logit contributions
 ships+0.083
washer+0.079
bel+0.076
 shark+0.076
zo+0.073
Neg logit contributions
 Samantha-0.143
aman-0.141
 Stella-0.138
 Heidi-0.138
 Lib-0.136
Token.
Token ID13
Feature activation-8.72e-01

Pos logit contributions
 ships+0.037
 shark+0.035
washer+0.034
 tanks+0.033
bel+0.033
Neg logit contributions
uct-0.022
 Heidi-0.021
 Lib-0.020
aman-0.020
 Samantha-0.020
Token They
Token ID1119
Feature activation-2.97e-01

Pos logit contributions
 ships+0.027
 shark+0.024
 tanks+0.024
 armor+0.022
washer+0.022
Neg logit contributions
uct-0.031
 saying-0.030
 Heidi-0.029
aman-0.029
 Lib-0.029
Token saw
Token ID2497
Feature activation-4.20e-01

Pos logit contributions
 ships+0.009
 shark+0.009
 armor+0.008
 tanks+0.008
zo+0.008
Neg logit contributions
uct-0.010
 Heidi-0.010
 Samantha-0.010
 Lib-0.010
 saying-0.010
Token one
Token ID530
Feature activation-8.60e-01

Pos logit contributions
 ships+0.017
 armor+0.016
 shark+0.016
zo+0.016
washer+0.015
Neg logit contributions
 saying-0.010
 Samantha-0.010
 Heidi-0.010
 Lib-0.010
uct-0.010
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 Samantha+0.002
aman+0.002
 mistake+0.002
Neg logit contributions
 ships-0.003
 shark-0.003
washer-0.003
 tanks-0.003
 armor-0.003
Token They
Token ID1119
Feature activation-3.32e-01

Pos logit contributions
 ships+0.006
 tanks+0.005
 shark+0.005
 armor+0.005
 Lions+0.005
Neg logit contributions
 mistake-0.006
uct-0.006
 saying-0.006
aman-0.006
 Heidi-0.006
Token were
Token ID547
Feature activation-3.04e-01

Pos logit contributions
 ships+0.012
 shark+0.011
 armor+0.011
washer+0.011
bel+0.011
Neg logit contributions
 saying-0.009
 Samantha-0.009
uct-0.009
 Heidi-0.009
 Lib-0.009
Token called
Token ID1444
Feature activation-1.58e-01

Pos logit contributions
 ships+0.011
 shark+0.010
 armor+0.010
 tanks+0.010
zo+0.009
Neg logit contributions
uct-0.009
 Heidi-0.009
 Samantha-0.009
aman-0.009
 Lib-0.008
Token ost
Token ID23619
Feature activation+2.84e-01

Pos logit contributions
 ships+0.006
 shark+0.005
 armor+0.005
 tanks+0.005
 ship+0.005
Neg logit contributions
uct-0.005
aman-0.005
 mistake-0.005
 Heidi-0.005
 Samantha-0.005
Tokenric
Token ID1173
Feature activation-3.43e+00

Pos logit contributions
 Heidi+0.013
uct+0.013
 Samantha+0.013
 Lib+0.013
 saying+0.013
Neg logit contributions
 ships-0.005
 shark-0.004
 armor-0.004
zo-0.004
 tanks-0.004
Tokenhes
Token ID956
Feature activation-1.00e+00

Pos logit contributions
 ships+0.094
 shark+0.084
washer+0.083
zo+0.083
bel+0.082
Neg logit contributions
aman-0.133
 Samantha-0.131
 Heidi-0.128
uct-0.127
 mistake-0.125
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
 ships+0.039
 shark+0.036
 tanks+0.035
zo+0.035
 armor+0.035
Neg logit contributions
uct-0.027
 Heidi-0.026
 Samantha-0.026
 Lib-0.025
aman-0.025
Token Lily
Token ID20037
Feature activation+6.80e-02

Pos logit contributions
 ships+0.008
 tanks+0.007
 Lions+0.007
 shark+0.007
 polar+0.006
Neg logit contributions
 mistake-0.010
uct-0.010
 saying-0.009
aman-0.009
 Heidi-0.008
Token and
Token ID290
Feature activation+2.13e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 Samantha+0.002
 saying+0.002
 Lib+0.002
Neg logit contributions
 ships-0.003
 shark-0.002
 armor-0.002
zo-0.002
washer-0.002
Token Ben
Token ID3932
Feature activation-9.38e-01

Pos logit contributions
uct+0.006
 mistake+0.005
aman+0.005
 saying+0.005
 OK+0.005
Neg logit contributions
 ships-0.008
 shark-0.008
 tanks-0.008
 armor-0.007
zo-0.007
Token did
Token ID750
Feature activation-5.49e-01

Pos logit contributions
 ships+0.007
 shark+0.006
washer+0.006
 armor+0.006
 tanks+0.006
Neg logit contributions
 Heidi-0.006
uct-0.006
 Samantha-0.006
 Lib-0.006
 saying-0.006
Token not
Token ID407
Feature activation-1.58e-01

Pos logit contributions
 ships+0.018
bel+0.016
 shark+0.016
 ship+0.016
 tanks+0.015
Neg logit contributions
aman-0.017
uct-0.017
 Heidi-0.016
 Samantha-0.016
aren-0.015
Token know
Token ID760
Feature activation-8.83e-02

Pos logit contributions
 ships+0.005
zo+0.005
 shark+0.004
 ship+0.004
 armor+0.004
Neg logit contributions
uct-0.006
aman-0.006
 Heidi-0.005
 Lib-0.005
 Samantha-0.005
Token what
Token ID644
Feature activation-1.78e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 tanks+0.002
washer+0.002
 armor+0.002
Neg logit contributions
uct-0.003
 Heidi-0.003
aman-0.003
 Samantha-0.003
 Lib-0.003
Token ost
Token ID23619
Feature activation+3.48e-01

Pos logit contributions
 ships+0.007
 shark+0.006
 tanks+0.006
washer+0.006
 armor+0.006
Neg logit contributions
uct-0.005
aman-0.005
 Heidi-0.005
 Samantha-0.005
 saying-0.005
Tokenric
Token ID1173
Feature activation-3.39e+00

Pos logit contributions
 Heidi+0.018
 Samantha+0.017
uct+0.017
 Lib+0.017
 saying+0.017
Neg logit contributions
 ships-0.005
zo-0.004
 shark-0.004
 tanks-0.004
washer-0.004
Tokenhes
Token ID956
Feature activation-1.64e+00

Pos logit contributions
 ships+0.093
washer+0.086
 shark+0.084
zo+0.083
bel+0.081
Neg logit contributions
 Samantha-0.129
aman-0.129
 Heidi-0.128
 Lib-0.124
uct-0.123
Token were
Token ID547
Feature activation-1.08e+00

Pos logit contributions
 ships+0.055
 shark+0.053
washer+0.050
 tanks+0.049
 ship+0.047
Neg logit contributions
uct-0.053
 Heidi-0.050
aman-0.049
 Lib-0.048
 Samantha-0.046
Token.
Token ID13
Feature activation-4.11e-01

Pos logit contributions
 ships+0.040
 tanks+0.035
washer+0.035
 ship+0.034
zo+0.034
Neg logit contributions
 Heidi-0.034
uct-0.033
 Samantha-0.031
aman-0.031
 Lib-0.030
Token They
Token ID1119
Feature activation-2.34e-01

Pos logit contributions
 ships+0.012
 tanks+0.011
 shark+0.011
bel+0.010
 armor+0.010
Neg logit contributions
 saying-0.014
 mistake-0.014
uct-0.014
 Heidi-0.013
aman-0.013
Token looked
Token ID3114
Feature activation-1.89e-01

Pos logit contributions
 ships+0.009
 shark+0.008
 armor+0.008
 tanks+0.008
washer+0.008
Neg logit contributions
uct-0.006
 Heidi-0.006
 Samantha-0.006
 saying-0.006
 Lib-0.006
Token.
Token ID13
Feature activation+8.97e-02

Pos logit contributions
aman+0.017
uct+0.017
 Samantha+0.016
 Heidi+0.016
 Lib+0.016
Neg logit contributions
 ships-0.015
 shark-0.015
bel-0.014
washer-0.014
 armor-0.014
Token One
Token ID1881
Feature activation-1.82e-01

Pos logit contributions
uct+0.003
 saying+0.003
 Heidi+0.003
 Samantha+0.003
 mistake+0.003
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
 tanks-0.003
washer-0.003
Token of
Token ID286
Feature activation-5.49e-02

Pos logit contributions
 ships+0.005
 shark+0.005
washer+0.004
bel+0.004
 tanks+0.004
Neg logit contributions
uct-0.007
 Heidi-0.007
 Samantha-0.007
 Lib-0.006
aman-0.006
Token the
Token ID262
Feature activation+5.97e-01

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
 tanks+0.002
zo+0.002
Neg logit contributions
uct-0.002
 Heidi-0.001
 Samantha-0.001
 saying-0.001
 Lib-0.001
Token ost
Token ID23619
Feature activation+3.90e-01

Pos logit contributions
uct+0.025
 Heidi+0.024
 Samantha+0.022
 Stella+0.021
aman+0.021
Neg logit contributions
 ships-0.017
 pengu-0.015
 tanks-0.014
 polar-0.014
 ship-0.014
Tokenric
Token ID1173
Feature activation-3.31e+00

Pos logit contributions
uct+0.017
 saying+0.017
 Heidi+0.017
 Samantha+0.017
 Lib+0.017
Neg logit contributions
 ships-0.008
 shark-0.007
 armor-0.007
washer-0.006
bel-0.006
Tokenhes
Token ID956
Feature activation-1.38e+00

Pos logit contributions
 ships+0.078
washer+0.075
bel+0.071
 shark+0.068
zo+0.067
Neg logit contributions
 Samantha-0.139
aman-0.138
 Stella-0.133
 Heidi-0.132
 Lib-0.130
Token looked
Token ID3114
Feature activation+1.43e-01

Pos logit contributions
 ships+0.039
 shark+0.036
bel+0.034
washer+0.034
zo+0.033
Neg logit contributions
 Heidi-0.051
uct-0.051
 Lib-0.049
 Samantha-0.049
aman-0.048
Token different
Token ID1180
Feature activation-6.11e-01

Pos logit contributions
uct+0.004
aman+0.004
 Samantha+0.004
 Heidi+0.004
 Lib+0.004
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.005
 tanks-0.004
 coral-0.004
Token from
Token ID422
Feature activation+2.00e-02

Pos logit contributions
 ships+0.020
bel+0.018
 tanks+0.017
 shark+0.017
 armor+0.017
Neg logit contributions
uct-0.021
 Heidi-0.021
 Samantha-0.020
 Lib-0.019
aman-0.019
Token the
Token ID262
Feature activation+1.97e-01

Pos logit contributions
uct+0.001
aman+0.001
 Heidi+0.001
shadow+0.001
 OK+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
 armour-0.001
 tanks-0.001
Token
Token ID198
Feature activation+3.32e-01

Pos logit contributions
 ships+0.012
bel+0.011
 shark+0.011
 Lions+0.011
 tanks+0.011
Neg logit contributions
uct-0.012
 mistake-0.011
aman-0.011
 saying-0.011
 Lib-0.010
Token
Token ID198
Feature activation+4.81e-01

Pos logit contributions
uct+0.011
aman+0.010
 Lib+0.009
 Heidi+0.009
 saying+0.009
Neg logit contributions
 ships-0.010
bel-0.009
 shark-0.009
washer-0.009
zo-0.009
Token"
Token ID1
Feature activation-7.37e-01

Pos logit contributions
uct+0.014
 saying+0.013
 mistake+0.013
 Lib+0.013
 OK+0.013
Neg logit contributions
bel-0.015
 ships-0.014
washer-0.014
 tanks-0.013
zo-0.013
TokenO
Token ID46
Feature activation+4.76e-01

Pos logit contributions
bel+0.024
 ships+0.023
 shark+0.023
zo+0.022
washer+0.021
Neg logit contributions
 mistake-0.021
 Heidi-0.021
uct-0.021
aman-0.021
 saying-0.021
Tokenst
Token ID301
Feature activation-3.85e-01

Pos logit contributions
uct+0.014
 Heidi+0.013
 Samantha+0.013
 saying+0.013
 Stella+0.013
Neg logit contributions
 ships-0.016
zo-0.016
washer-0.016
bel-0.016
 shark-0.015
Tokenric
Token ID1173
Feature activation-3.23e+00

Pos logit contributions
 ships+0.012
 shark+0.011
zo+0.010
washer+0.010
 tanks+0.010
Neg logit contributions
uct-0.014
 Heidi-0.013
 saying-0.013
 Lib-0.013
 Samantha-0.013
Tokenhes
Token ID956
Feature activation-1.79e+00

Pos logit contributions
 ships+0.083
washer+0.080
 shark+0.076
 armor+0.075
bel+0.074
Neg logit contributions
 Samantha-0.129
aman-0.128
 Heidi-0.123
 Lib-0.122
 Stella-0.122
Token do
Token ID466
Feature activation-4.05e-01

Pos logit contributions
 ships+0.059
 shark+0.057
washer+0.057
bel+0.054
 ship+0.053
Neg logit contributions
 Heidi-0.055
uct-0.055
 Lib-0.054
 Samantha-0.053
aman-0.052
Token not
Token ID407
Feature activation-1.53e-01

Pos logit contributions
 ships+0.014
zo+0.013
 shark+0.013
 tanks+0.013
washer+0.013
Neg logit contributions
 Heidi-0.012
 Samantha-0.012
 Lib-0.012
aman-0.012
uct-0.011
Token like
Token ID588
Feature activation+2.62e-02

Pos logit contributions
 ships+0.004
zo+0.004
 shark+0.004
 ship+0.004
 armor+0.004
Neg logit contributions
uct-0.006
 Heidi-0.005
 Lib-0.005
aman-0.005
 Samantha-0.005
Token people
Token ID661
Feature activation-1.74e-01

Pos logit contributions
uct+0.001
 Heidi+0.001
 Lib+0.001
 Samantha+0.001
aman+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 tanks-0.001
 armor-0.001
washer-0.001
TokenYes
Token ID5297
Feature activation+9.03e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 saying+0.001
 Samantha+0.001
 Lib+0.001
Neg logit contributions
 ships-0.003
 shark-0.003
 armor-0.003
washer-0.003
 tanks-0.003
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
uct+0.029
 Heidi+0.028
 saying+0.028
 Samantha+0.027
 Lib+0.027
Neg logit contributions
 ships-0.030
 shark-0.028
 armor-0.026
 tanks-0.026
 ship-0.025
Token they
Token ID484
Feature activation+4.93e-01

Pos logit contributions
uct+0.003
 Heidi+0.003
 saying+0.003
 Samantha+0.003
 Lib+0.003
Neg logit contributions
 ships-0.005
 shark-0.004
 armor-0.004
zo-0.004
 tanks-0.004
Token are
Token ID389
Feature activation-2.24e-01

Pos logit contributions
uct+0.012
 saying+0.012
 right+0.012
 Heidi+0.012
 Samantha+0.012
Neg logit contributions
 ships-0.019
 shark-0.018
 armor-0.018
bel-0.018
 tanks-0.017
Token ost
Token ID23619
Feature activation+3.57e-01

Pos logit contributions
 ships+0.009
 ship+0.007
 shark+0.007
 tanks+0.007
 armor+0.007
Neg logit contributions
uct-0.007
aman-0.007
 Heidi-0.006
 Samantha-0.006
 mistake-0.006
Tokenric
Token ID1173
Feature activation-3.16e+00

Pos logit contributions
 Heidi+0.016
uct+0.016
 Samantha+0.016
 saying+0.016
 Lib+0.016
Neg logit contributions
 ships-0.007
 shark-0.006
 armor-0.006
zo-0.006
 tanks-0.005
Tokenhes
Token ID956
Feature activation-9.54e-01

Pos logit contributions
 ships+0.077
washer+0.068
zo+0.067
bel+0.065
 shark+0.064
Neg logit contributions
aman-0.139
 Samantha-0.134
 Heidi-0.127
 mistake-0.126
 Stella-0.126
Token,"
Token ID553
Feature activation-4.55e-01

Pos logit contributions
 ships+0.037
 shark+0.035
zo+0.034
washer+0.033
 tanks+0.033
Neg logit contributions
uct-0.026
 Heidi-0.025
 Samantha-0.024
 Lib-0.024
aman-0.024
Token mom
Token ID1995
Feature activation-7.92e-01

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.018
 tanks+0.018
washer+0.018
Neg logit contributions
uct-0.010
 Heidi-0.010
 saying-0.010
 Samantha-0.010
 Lib-0.010
Token said
Token ID531
Feature activation-1.15e-01

Pos logit contributions
 ships+0.037
washer+0.035
 shark+0.034
zo+0.034
 ship+0.033
Neg logit contributions
 Heidi-0.015
uct-0.014
 Samantha-0.014
 mistake-0.014
aman-0.014
Token.
Token ID13
Feature activation+7.90e-02

Pos logit contributions
 ships+0.005
 shark+0.004
bel+0.004
washer+0.004
 tanks+0.004
Neg logit contributions
uct-0.003
aman-0.003
 Heidi-0.003
 Samantha-0.002
 mistake-0.002
Tokenie
Token ID494
Feature activation-9.98e-04

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.003
 tanks+0.002
washer+0.002
Neg logit contributions
uct-0.003
 Heidi-0.003
 saying-0.003
 Samantha-0.003
 Lib-0.003
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
 ships+0.000
 shark+0.000
 armor+0.000
zo+0.000
 tanks+0.000
Neg logit contributions
 Heidi-0.000
uct-0.000
 Samantha-0.000
 Lib-0.000
 saying-0.000
Token Unlike
Token ID12101
Feature activation+5.01e-01

Pos logit contributions
 ships+0.015
bel+0.015
 armor+0.015
 shark+0.015
washer+0.015
Neg logit contributions
 Lib-0.013
 Samantha-0.013
 right-0.012
uct-0.012
 Heidi-0.012
Token other
Token ID584
Feature activation-5.51e-01

Pos logit contributions
uct+0.015
 Heidi+0.015
 Samantha+0.015
aman+0.014
 Lib+0.014
Neg logit contributions
 ships-0.018
 shark-0.017
 armor-0.016
 tanks-0.016
zo-0.016
Token ost
Token ID23619
Feature activation+6.44e-01

Pos logit contributions
 ships+0.017
 shark+0.015
 armor+0.015
 tanks+0.015
zo+0.015
Neg logit contributions
uct-0.019
 Heidi-0.019
 Samantha-0.018
 saying-0.017
 Lib-0.017
Tokenric
Token ID1173
Feature activation-3.15e+00

Pos logit contributions
 Heidi+0.030
uct+0.030
 Samantha+0.029
 saying+0.029
 Lib+0.029
Neg logit contributions
 ships-0.012
 shark-0.010
 armor-0.010
zo-0.009
 tanks-0.009
Tokenhes
Token ID956
Feature activation-1.42e+00

Pos logit contributions
 ships+0.083
washer+0.077
zo+0.075
 armor+0.075
 shark+0.074
Neg logit contributions
 Samantha-0.124
aman-0.123
 Heidi-0.119
 mistake-0.116
 Stella-0.116
Token,
Token ID11
Feature activation-3.06e-01

Pos logit contributions
 ships+0.053
 shark+0.050
 armor+0.049
bel+0.048
 tanks+0.048
Neg logit contributions
uct-0.037
 Heidi-0.037
 saying-0.036
 Samantha-0.036
 Lib-0.036
Token his
Token ID465
Feature activation+1.47e-01

Pos logit contributions
 ships+0.011
 shark+0.010
 armor+0.009
 tanks+0.009
bel+0.009
Neg logit contributions
uct-0.010
 Heidi-0.009
aman-0.009
 saying-0.009
 Samantha-0.009
Token feathers
Token ID27400
Feature activation+4.11e-01

Pos logit contributions
uct+0.005
 Heidi+0.005
 Samantha+0.005
 Lib+0.005
 saying+0.004
Neg logit contributions
 ships-0.005
 shark-0.004
 armor-0.004
zo-0.004
bel-0.004
Token gl
Token ID1278
Feature activation-9.51e-01

Pos logit contributions
uct+0.012
 Heidi+0.011
 Samantha+0.011
 Lib+0.011
 saying+0.011
Neg logit contributions
 ships-0.015
 shark-0.014
 armor-0.014
zo-0.013
washer-0.013
Token man
Token ID582
Feature activation+7.10e-02

Pos logit contributions
 shark+0.014
 ships+0.014
washer+0.013
 ship+0.013
 armor+0.013
Neg logit contributions
uct-0.015
 OK-0.014
 okay-0.013
aman-0.013
 Samantha-0.013
Token's
Token ID338
Feature activation-4.45e-01

Pos logit contributions
 Heidi+0.002
 Samantha+0.002
uct+0.002
aman+0.002
 Stella+0.002
Neg logit contributions
washer-0.002
 ships-0.002
 tank-0.002
 ship-0.002
 tanks-0.002
Token plate
Token ID7480
Feature activation+4.39e-01

Pos logit contributions
 ships+0.013
 shark+0.013
 armor+0.012
washer+0.012
 tanks+0.012
Neg logit contributions
uct-0.016
 Samantha-0.016
 Lib-0.015
 Heidi-0.015
aman-0.015
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
 Samantha+0.011
uct+0.010
 Heidi+0.010
 Lib+0.010
aman+0.010
Neg logit contributions
washer-0.018
 shark-0.017
 ships-0.017
 armor-0.017
 ship-0.017
Token All
Token ID1439
Feature activation+2.83e-01

Pos logit contributions
uct+0.007
 Heidi+0.007
 Samantha+0.007
 saying+0.007
 Lib+0.007
Neg logit contributions
 ships-0.007
 shark-0.006
 armor-0.006
bel-0.006
zo-0.006
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
 right+0.004
 saying+0.004
 Samantha+0.004
 Sue+0.003
 said+0.003
Neg logit contributions
 armor-0.014
 armour-0.014
 rook-0.013
 crew-0.013
bel-0.013
Token food
Token ID2057
Feature activation+6.40e-01

Pos logit contributions
uct+0.023
 Heidi+0.022
 Samantha+0.022
 saying+0.022
 Lib+0.022
Neg logit contributions
 ships-0.044
 shark-0.042
 armor-0.040
 tanks-0.040
washer-0.040
Token went
Token ID1816
Feature activation+6.14e-01

Pos logit contributions
uct+0.019
 Heidi+0.019
 Samantha+0.018
 saying+0.018
 Lib+0.018
Neg logit contributions
 ships-0.022
 shark-0.021
 armor-0.020
washer-0.020
 tanks-0.020
Token everywhere
Token ID8347
Feature activation+4.50e-01

Pos logit contributions
uct+0.019
 Heidi+0.018
 Samantha+0.018
 OK+0.017
 Lib+0.017
Neg logit contributions
 ships-0.021
 shark-0.020
 tanks-0.020
 armor-0.019
washer-0.019
Token!
Token ID0
Feature activation+5.19e-01

Pos logit contributions
uct+0.014
 Heidi+0.013
aman+0.013
 Samantha+0.013
 Lib+0.013
Neg logit contributions
 ships-0.016
 shark-0.016
washer-0.015
 tanks-0.014
 ship-0.014
Token The
Token ID383
Feature activation+6.82e-01

Pos logit contributions
uct+0.018
 saying+0.018
 Heidi+0.018
 mistake+0.018
 Samantha+0.018
Neg logit contributions
 ships-0.015
 shark-0.014
 tanks-0.013
 armor-0.013
washer-0.013
Token a
Token ID257
Feature activation-1.24e-01

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
 tanks+0.002
bel+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
aman-0.001
 Lib-0.001
 OK-0.001
Token mysterious
Token ID11428
Feature activation-1.41e-02

Pos logit contributions
 ships+0.004
 shark+0.004
 armor+0.004
washer+0.003
 tanks+0.003
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 Lib-0.004
 saying-0.004
Token box
Token ID3091
Feature activation-1.38e-01

Pos logit contributions
 ships+0.000
 shark+0.000
 armor+0.000
 ship+0.000
 tanks+0.000
Neg logit contributions
uct-0.001
 Heidi-0.001
 Samantha-0.000
 Lib-0.000
 OK-0.000
Token in
Token ID287
Feature activation+4.15e-01

Pos logit contributions
 ships+0.005
washer+0.005
 ship+0.005
 shark+0.005
bel+0.004
Neg logit contributions
 Heidi-0.004
 Samantha-0.004
uct-0.004
aman-0.004
 OK-0.004
Token the
Token ID262
Feature activation+9.41e-01

Pos logit contributions
uct+0.016
 OK+0.015
 okay+0.015
 Heidi+0.014
aman+0.014
Neg logit contributions
 ships-0.012
 shark-0.011
 tanks-0.010
 armor-0.010
 coral-0.010
Token bushes
Token ID37413
Feature activation+1.25e+00

Pos logit contributions
uct+0.033
 Samantha+0.031
 Heidi+0.031
 Lib+0.031
 saying+0.031
Neg logit contributions
 ships-0.030
 shark-0.028
 armor-0.027
washer-0.026
 tanks-0.026
Token.
Token ID13
Feature activation+4.34e-01

Pos logit contributions
 Heidi+0.035
uct+0.035
 Samantha+0.035
aman+0.034
 mistake+0.032
Neg logit contributions
 ships-0.049
washer-0.045
 ship-0.044
 shark-0.043
 tanks-0.043
Token
Token ID220
Feature activation-2.34e-01

Pos logit contributions
 mistake+0.015
 saying+0.014
uct+0.013
aman+0.013
 possible+0.013
Neg logit contributions
 ships-0.014
 tanks-0.012
 shark-0.012
washer-0.012
 pawn-0.011
Token
Token ID198
Feature activation+2.53e-01

Pos logit contributions
 ships+0.007
washer+0.006
bel+0.006
os+0.006
 tanks+0.006
Neg logit contributions
 mistake-0.008
uct-0.008
 saying-0.008
aman-0.008
 Lib-0.007
Token
Token ID198
Feature activation+3.52e-01

Pos logit contributions
 mistake+0.008
 saying+0.008
uct+0.008
 OK+0.007
aman+0.007
Neg logit contributions
 ships-0.008
bel-0.007
washer-0.007
zo-0.007
 tanks-0.007
TokenL
Token ID43
Feature activation-1.77e-02

Pos logit contributions
 saying+0.010
 mistake+0.010
 Heidi+0.009
uct+0.009
 Lib+0.009
Neg logit contributions
 ships-0.012
bel-0.012
washer-0.012
zo-0.011
 shark-0.011
Token the
Token ID262
Feature activation+1.09e+00

Pos logit contributions
 ships+0.021
zo+0.021
 shark+0.020
washer+0.020
 armor+0.019
Neg logit contributions
 Heidi-0.019
aman-0.019
 Samantha-0.019
 saying-0.019
 Lib-0.019
Token proud
Token ID6613
Feature activation-7.49e-01

Pos logit contributions
uct+0.039
 Heidi+0.037
 Lib+0.037
 Samantha+0.037
aman+0.034
Neg logit contributions
 ships-0.035
 shark-0.033
 armor-0.031
 tanks-0.030
 ship-0.030
Token cub
Token ID13617
Feature activation-9.69e-02

Pos logit contributions
 ships+0.025
 shark+0.025
washer+0.023
 ship+0.023
 armor+0.022
Neg logit contributions
uct-0.025
 Heidi-0.023
 Lib-0.023
 Samantha-0.022
aman-0.022
Token at
Token ID379
Feature activation+6.60e-01

Pos logit contributions
 ships+0.003
 shark+0.003
washer+0.003
zo+0.003
 tanks+0.003
Neg logit contributions
uct-0.003
 Heidi-0.003
 Samantha-0.003
aman-0.003
 Lib-0.003
Token the
Token ID262
Feature activation-1.56e-02

Pos logit contributions
uct+0.023
 Heidi+0.022
 Lib+0.021
 Samantha+0.021
aman+0.021
Neg logit contributions
 ships-0.021
 shark-0.020
 armor-0.018
 tanks-0.018
 ship-0.018
Token university
Token ID6403
Feature activation+1.10e+00

Pos logit contributions
 ships+0.001
 shark+0.000
 armor+0.000
 ship+0.000
 tanks+0.000
Neg logit contributions
uct-0.001
 Samantha-0.001
 Heidi-0.001
 Lib-0.000
 saying-0.000
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
 Heidi+0.029
uct+0.029
 Samantha+0.028
aman+0.027
 OK+0.027
Neg logit contributions
 ships-0.043
washer-0.040
bel-0.039
 ship-0.038
 tanks-0.038
Token went
Token ID1816
Feature activation+1.18e-01

Pos logit contributions
uct+0.008
 Heidi+0.008
 Samantha+0.008
aman+0.008
 Lib+0.008
Neg logit contributions
 ships-0.009
 shark-0.009
 armor-0.008
bel-0.008
washer-0.008
Token home
Token ID1363
Feature activation+5.10e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 Samantha+0.004
aman+0.004
 Lib+0.004
Neg logit contributions
 ships-0.004
 armor-0.003
 shark-0.003
washer-0.003
bel-0.003
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
uct+0.015
 Heidi+0.014
aman+0.014
 Samantha+0.013
 Lib+0.013
Neg logit contributions
bel-0.019
 ships-0.019
washer-0.018
 shark-0.017
 ship-0.017
Token She
Token ID1375
Feature activation-1.52e-01

Pos logit contributions
 mistake+0.008
 saying+0.007
uct+0.007
aman+0.007
 right+0.007
Neg logit contributions
 ships-0.008
 shark-0.008
 tanks-0.007
 armor-0.007
zo-0.007
Token m
Token ID285
Feature activation-1.28e-01

Pos logit contributions
uct+0.010
 Heidi+0.010
 Samantha+0.010
 Lib+0.010
 saying+0.009
Neg logit contributions
 ships-0.010
 shark-0.010
 armor-0.009
washer-0.009
 tanks-0.009
Tokenop
Token ID404
Feature activation+1.34e-01

Pos logit contributions
 ships+0.003
 shark+0.003
 armor+0.003
washer+0.002
 tanks+0.002
Neg logit contributions
uct-0.005
 Heidi-0.005
 Samantha-0.005
 saying-0.005
 Lib-0.005
Token.
Token ID13
Feature activation+6.77e-01

Pos logit contributions
 Heidi+0.003
uct+0.003
 Samantha+0.003
 Lib+0.003
aman+0.003
Neg logit contributions
 ships-0.006
washer-0.006
 shark-0.006
 ship-0.005
 tanks-0.005
Token �
Token ID6184
Feature activation+1.57e+00

Pos logit contributions
 mistake+0.026
uct+0.025
aman+0.024
 saying+0.023
ige+0.022
Neg logit contributions
 ships-0.018
 tanks-0.016
 shark-0.016
 armor-0.016
washer-0.015
Token�
Token ID95
Feature activation+9.07e-01

Pos logit contributions
 saying+0.050
 Heidi+0.049
uct+0.049
 Samantha+0.049
 Lib+0.048
Neg logit contributions
 ships-0.049
 shark-0.046
 armor-0.045
washer-0.045
zo-0.045
Token€
Token ID26391
Feature activation+1.34e+00

Pos logit contributions
 Samantha+0.029
 Lib+0.029
 Heidi+0.029
 saying+0.029
 Stella+0.028
Neg logit contributions
washer-0.025
zo-0.025
 ships-0.025
 armor-0.024
 shark-0.023
Token�
Token ID129
Feature activation+1.66e-01

Pos logit contributions
uct+0.034
 Heidi+0.033
 saying+0.033
 Samantha+0.032
 Lib+0.032
Neg logit contributions
 ships-0.052
 shark-0.049
bel-0.048
washer-0.048
 armor-0.048
Token�
Token ID241
Feature activation+1.05e-01

Pos logit contributions
 Samantha+0.004
 Heidi+0.004
 OK+0.004
 Lib+0.004
uct+0.004
Neg logit contributions
 ships-0.006
zo-0.006
 armor-0.006
 shark-0.005
 armour-0.005
TokenHello
Token ID15496
Feature activation+9.50e-01

Pos logit contributions
 mistake+0.004
uct+0.003
aman+0.003
 saying+0.003
ige+0.003
Neg logit contributions
bel-0.003
 shark-0.003
zo-0.003
 ships-0.003
 polar-0.003
Token!
Token ID0
Feature activation+2.55e-01

Pos logit contributions
uct+0.033
 Heidi+0.030
 mistake+0.030
 saying+0.030
 herself+0.029
Neg logit contributions
 shark-0.028
 ships-0.028
 armor-0.026
washer-0.025
 tanks-0.025
Tokenâ
Token ID22940
Feature activation+6.55e-01

Pos logit contributions
 mistake+0.009
aman+0.009
uct+0.008
 saying+0.008
ige+0.008
Neg logit contributions
 shark-0.007
 ships-0.007
 polar-0.006
 tanks-0.006
 Lions-0.006
Token the
Token ID262
Feature activation-4.69e-02

Pos logit contributions
 Heidi+0.036
uct+0.034
 Lib+0.034
aman+0.033
 Samantha+0.033
Neg logit contributions
 ships-0.028
 shark-0.026
 tanks-0.025
washer-0.025
 armor-0.024
Token guitar
Token ID10047
Feature activation+3.89e-02

Pos logit contributions
 ships+0.001
 shark+0.001
 armor+0.001
washer+0.001
 tanks+0.001
Neg logit contributions
uct-0.002
 Samantha-0.002
 Heidi-0.002
 Lib-0.002
 saying-0.002
Token over
Token ID625
Feature activation+1.48e-01

Pos logit contributions
 Samantha+0.001
 Heidi+0.001
uct+0.001
 Lib+0.001
aman+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
washer-0.001
 armor-0.001
bel-0.001
Token and
Token ID290
Feature activation+3.70e-01

Pos logit contributions
uct+0.005
 Heidi+0.004
 Lib+0.004
aman+0.004
 Samantha+0.004
Neg logit contributions
 ships-0.005
 shark-0.005
 armor-0.005
 tanks-0.005
 ship-0.005
Token it
Token ID340
Feature activation+9.86e-01

Pos logit contributions
uct+0.010
 Heidi+0.010
 Lib+0.010
 Samantha+0.010
aman+0.010
Neg logit contributions
 ships-0.014
 shark-0.013
 armor-0.013
washer-0.013
 tanks-0.012
Token broke
Token ID6265
Feature activation+1.67e+00

Pos logit contributions
uct+0.026
 saying+0.025
 Heidi+0.025
 Samantha+0.025
 Lib+0.025
Neg logit contributions
 ships-0.037
 shark-0.035
 armor-0.035
 tanks-0.034
zo-0.033
Token again
Token ID757
Feature activation+1.72e+00

Pos logit contributions
 Heidi+0.047
uct+0.044
 Samantha+0.042
 Lib+0.042
aman+0.041
Neg logit contributions
 ships-0.067
 shark-0.062
 tanks-0.060
 armor-0.060
washer-0.059
Token.
Token ID13
Feature activation+6.50e-01

Pos logit contributions
 Heidi+0.056
 Lib+0.052
uct+0.051
 Samantha+0.050
aman+0.050
Neg logit contributions
 ships-0.057
 tanks-0.055
bel-0.054
washer-0.052
 ship-0.051
Token Tim
Token ID5045
Feature activation+9.26e-01

Pos logit contributions
 Heidi+0.022
 saying+0.022
 mistake+0.022
aman+0.021
uct+0.021
Neg logit contributions
 ships-0.019
 tanks-0.017
 shark-0.017
washer-0.016
 armor-0.016
Tokenmy
Token ID1820
Feature activation+1.52e-01

Pos logit contributions
uct+0.016
 saying+0.016
 mistake+0.016
 right+0.015
 Samantha+0.015
Neg logit contributions
 ships-0.044
 shark-0.042
 armor-0.041
bel-0.041
 armour-0.040
Token was
Token ID373
Feature activation+3.77e-01

Pos logit contributions
 saying+0.003
 mistake+0.003
uct+0.003
 right+0.003
 Heidi+0.003
Neg logit contributions
 ships-0.007
 armor-0.007
bel-0.006
 armour-0.006
 shark-0.006
Token had
Token ID550
Feature activation+9.64e-02

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.002
 tanks+0.002
bel+0.002
Neg logit contributions
uct-0.002
 saying-0.002
 Heidi-0.002
 Samantha-0.002
 Lib-0.002
Token taken
Token ID2077
Feature activation+4.92e-01

Pos logit contributions
aman+0.003
 Lib+0.003
 Samantha+0.003
 Heidi+0.003
uct+0.003
Neg logit contributions
 ships-0.003
 armor-0.003
 shark-0.003
zo-0.003
 tanks-0.003
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
uct+0.016
 Heidi+0.015
 saying+0.015
 Lib+0.015
 Samantha+0.015
Neg logit contributions
 ships-0.016
 shark-0.015
 armor-0.014
 tanks-0.014
zo-0.014
Token invitation
Token ID17023
Feature activation+7.15e-01

Pos logit contributions
 shark+0.011
 ships+0.011
 armor+0.010
 ship+0.010
washer+0.009
Neg logit contributions
uct-0.010
 Samantha-0.010
 Heidi-0.010
 Lib-0.010
aman-0.010
Token.
Token ID13
Feature activation+7.07e-01

Pos logit contributions
uct+0.022
 Samantha+0.022
 Heidi+0.022
 Lib+0.021
aman+0.021
Neg logit contributions
 ships-0.024
 armor-0.022
washer-0.022
 shark-0.022
 ship-0.022
Token<|endoftext|>
Token ID50256
Feature activation-6.15e-01

Pos logit contributions
uct+0.020
 Samantha+0.020
aman+0.020
 saying+0.020
 mistake+0.020
Neg logit contributions
 ships-0.024
 shark-0.023
 tanks-0.023
 armor-0.023
zo-0.022
TokenOnce
Token ID7454
Feature activation+4.90e-01

Pos logit contributions
 ships+0.020
 shark+0.020
washer+0.019
 tanks+0.019
 armor+0.018
Neg logit contributions
 mistake-0.019
uct-0.019
aman-0.018
 saying-0.018
 Samantha-0.017
Token there
Token ID612
Feature activation+2.99e-01

Pos logit contributions
uct+0.012
 Heidi+0.012
 saying+0.012
 Samantha+0.012
 Lib+0.012
Neg logit contributions
 ships-0.019
 shark-0.018
washer-0.018
 armor-0.018
 tanks-0.017
Token was
Token ID373
Feature activation+5.04e-01

Pos logit contributions
uct+0.009
 saying+0.008
 Heidi+0.008
 Samantha+0.008
 Lib+0.008
Neg logit contributions
 ships-0.011
 shark-0.010
washer-0.010
 armor-0.009
 tanks-0.009
Token a
Token ID257
Feature activation-4.65e-01

Pos logit contributions
 Heidi+0.011
 Samantha+0.011
 Lib+0.010
 saying+0.010
 right+0.010
Neg logit contributions
 ships-0.022
 shark-0.020
bel-0.020
washer-0.020
 armor-0.020
Token lady
Token ID10846
Feature activation-2.72e-01

Pos logit contributions
 ships+0.018
 armor+0.017
 shark+0.017
bel+0.016
 tanks+0.016
Neg logit contributions
 Heidi-0.011
 Samantha-0.011
 saying-0.011
uct-0.011
 Lib-0.011
Token put
Token ID1234
Feature activation+2.90e-01

Pos logit contributions
uct+0.001
 Heidi+0.001
 Samantha+0.001
aman+0.001
 saying+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
bel-0.001
washer-0.001
Token it
Token ID340
Feature activation-2.34e-01

Pos logit contributions
aman+0.009
uct+0.009
 mistake+0.009
 Heidi+0.008
 Samantha+0.008
Neg logit contributions
 armor-0.010
 ships-0.010
 armour-0.010
 shark-0.009
 sword-0.009
Token on
Token ID319
Feature activation-2.10e-01

Pos logit contributions
 ships+0.008
 armor+0.007
bel+0.007
 armour+0.007
 shark+0.007
Neg logit contributions
 Samantha-0.008
 Heidi-0.008
aman-0.008
uct-0.007
 Lib-0.007
Token his
Token ID465
Feature activation-1.10e-01

Pos logit contributions
 ships+0.006
 armor+0.006
 shark+0.006
 armour+0.006
 tanks+0.006
Neg logit contributions
uct-0.007
 Samantha-0.007
 Heidi-0.007
aman-0.007
 Lib-0.007
Token wall
Token ID3355
Feature activation+5.38e-01

Pos logit contributions
 ships+0.004
 shark+0.004
 armor+0.004
bel+0.004
zo+0.004
Neg logit contributions
uct-0.003
 saying-0.003
 Heidi-0.003
 Samantha-0.003
 Lib-0.003
Token.
Token ID13
Feature activation+2.73e-01

Pos logit contributions
 Heidi+0.010
 Samantha+0.010
uct+0.010
 Lib+0.010
 saying+0.010
Neg logit contributions
 ships-0.025
 armor-0.023
 shark-0.023
bel-0.023
washer-0.023
Token
Token ID198
Feature activation-1.76e-02

Pos logit contributions
uct+0.010
 saying+0.010
 Heidi+0.010
 Samantha+0.010
 mistake+0.010
Neg logit contributions
 ships-0.008
 shark-0.007
 armor-0.007
 tanks-0.007
washer-0.007
Token
Token ID198
Feature activation+2.87e-02

Pos logit contributions
 ships+0.001
 shark+0.001
 armor+0.001
zo+0.001
washer+0.001
Neg logit contributions
uct-0.000
 Samantha-0.000
 saying-0.000
 Heidi-0.000
aman-0.000
TokenBut
Token ID1537
Feature activation+3.77e-01

Pos logit contributions
uct+0.001
 Heidi+0.001
 saying+0.001
 Samantha+0.001
 Lib+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
 tanks-0.001
washer-0.001
Token then
Token ID788
Feature activation+1.27e-02

Pos logit contributions
uct+0.011
 Heidi+0.011
 Samantha+0.011
 saying+0.011
 Lib+0.011
Neg logit contributions
 ships-0.013
 shark-0.012
 armor-0.012
 tanks-0.011
washer-0.011
Token the
Token ID262
Feature activation+8.03e-01

Pos logit contributions
 Heidi+0.000
 saying+0.000
 Samantha+0.000
 Lib+0.000
 right+0.000
Neg logit contributions
 ships-0.001
bel-0.000
washer-0.000
 shark-0.000
zo-0.000
Token,
Token ID11
Feature activation-1.84e-01

Pos logit contributions
 Heidi+0.003
uct+0.003
 saying+0.003
 Samantha+0.003
 Lib+0.002
Neg logit contributions
 ships-0.004
 shark-0.003
 armor-0.003
zo-0.003
bel-0.003
Token "
Token ID366
Feature activation-1.57e-01

Pos logit contributions
 ships+0.005
 shark+0.005
 armor+0.005
washer+0.005
 tanks+0.005
Neg logit contributions
uct-0.007
aman-0.006
 mistake-0.006
 Samantha-0.006
 Heidi-0.006
TokenWhere
Token ID8496
Feature activation+1.11e+00

Pos logit contributions
 ships+0.005
 shark+0.005
 armor+0.005
bel+0.005
washer+0.005
Neg logit contributions
 Samantha-0.005
 saying-0.005
 Lib-0.005
 Heidi-0.005
uct-0.005
Token is
Token ID318
Feature activation-5.81e-01

Pos logit contributions
 Heidi+0.024
 right+0.024
 OK+0.024
uct+0.024
 saying+0.023
Neg logit contributions
 ships-0.047
 armor-0.044
 shark-0.044
bel-0.043
 armour-0.043
Token my
Token ID616
Feature activation+1.14e+00

Pos logit contributions
 ships+0.024
bel+0.023
zo+0.023
 shark+0.023
 armor+0.023
Neg logit contributions
 saying-0.013
 Samantha-0.013
 Heidi-0.013
 Lib-0.012
uct-0.012
Token car
Token ID1097
Feature activation+1.73e-02

Pos logit contributions
uct+0.043
 Heidi+0.042
 Samantha+0.042
 Lib+0.041
 saying+0.041
Neg logit contributions
 ships-0.032
 shark-0.030
 armor-0.029
 tanks-0.028
washer-0.027
Token?
Token ID30
Feature activation-4.43e-01

Pos logit contributions
 saying+0.000
 right+0.000
uct+0.000
 OK+0.000
 Heidi+0.000
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
zo-0.001
 tanks-0.001
Token I
Token ID314
Feature activation+1.00e+00

Pos logit contributions
 ships+0.017
 shark+0.016
 armor+0.016
zo+0.015
 tanks+0.015
Neg logit contributions
uct-0.011
 Heidi-0.011
 saying-0.011
 Samantha-0.011
 Lib-0.011
Token want
Token ID765
Feature activation+4.01e-01

Pos logit contributions
 saying+0.024
 right+0.024
 Samantha+0.023
 OK+0.023
uct+0.023
Neg logit contributions
 ships-0.039
 shark-0.037
 armor-0.037
 tanks-0.036
 armour-0.036
Token to
Token ID284
Feature activation-1.27e-01

Pos logit contributions
uct+0.011
 Heidi+0.011
 saying+0.011
 Samantha+0.011
 Lib+0.011
Neg logit contributions
 ships-0.015
 shark-0.014
 armor-0.014
bel-0.013
washer-0.013
Token play
Token ID711
Feature activation-3.68e-01

Pos logit contributions
 ships+0.004
 shark+0.004
washer+0.004
 armor+0.004
 tanks+0.004
Neg logit contributions
 saying-0.004
 right-0.004
 Samantha-0.004
 Lib-0.004
uct-0.004
Token fl
Token ID781
Feature activation-2.58e-01

Pos logit contributions
 saying+0.007
uct+0.006
 right+0.006
 mistake+0.006
 Lib+0.006
Neg logit contributions
 ships-0.012
 shark-0.012
 armor-0.012
 pawn-0.012
 armour-0.012
Tokenapped
Token ID6320
Feature activation-8.66e-01

Pos logit contributions
 ships+0.008
 shark+0.008
 tanks+0.008
 armor+0.008
 pawn+0.007
Neg logit contributions
uct-0.008
 Samantha-0.008
 Lib-0.008
 OK-0.008
 saying-0.008
Token its
Token ID663
Feature activation-8.09e-01

Pos logit contributions
 ships+0.040
washer+0.040
 shark+0.039
 tanks+0.038
zo+0.038
Neg logit contributions
 right-0.015
 saying-0.015
 herself-0.014
 OK-0.013
 mistake-0.013
Token wings
Token ID12098
Feature activation-3.50e-01

Pos logit contributions
 ships+0.023
 shark+0.022
washer+0.021
 armor+0.020
 tanks+0.020
Neg logit contributions
 saying-0.028
 right-0.028
uct-0.028
 Heidi-0.028
 OK-0.027
Token and
Token ID290
Feature activation-2.83e-01

Pos logit contributions
 ships+0.015
 shark+0.015
 armor+0.014
washer+0.014
 tanks+0.014
Neg logit contributions
 saying-0.007
 right-0.007
uct-0.007
 Heidi-0.007
 Lib-0.007
Token flew
Token ID13112
Feature activation+3.15e-01

Pos logit contributions
 ships+0.010
 shark+0.009
 armor+0.009
 tanks+0.009
washer+0.009
Neg logit contributions
uct-0.008
 Heidi-0.008
 Samantha-0.008
 saying-0.008
 Lib-0.008
Token all
Token ID477
Feature activation-7.12e-01

Pos logit contributions
uct+0.010
 Heidi+0.009
 saying+0.009
 Samantha+0.009
 Lib+0.009
Neg logit contributions
 ships-0.011
 shark-0.010
 armor-0.010
washer-0.010
 tanks-0.010
Token around
Token ID1088
Feature activation-1.36e-01

Pos logit contributions
 armor+0.030
 ships+0.029
 shark+0.029
 armour+0.029
 pawn+0.028
Neg logit contributions
 right-0.017
 saying-0.016
 herself-0.015
uct-0.015
 Lib-0.015
Token.
Token ID13
Feature activation-3.82e-01

Pos logit contributions
 ships+0.006
 shark+0.006
 armor+0.005
 tanks+0.005
washer+0.005
Neg logit contributions
uct-0.003
 Heidi-0.003
 Samantha-0.003
 saying-0.003
 Lib-0.003
Token One
Token ID1881
Feature activation+3.32e-02

Pos logit contributions
 ships+0.011
 armor+0.010
 shark+0.010
 armour+0.009
 tanks+0.009
Neg logit contributions
 mistake-0.014
uct-0.014
aman-0.013
 saying-0.013
 Samantha-0.013
Token day
Token ID1110
Feature activation-9.24e-01

Pos logit contributions
 right+0.001
 saying+0.001
 Samantha+0.001
uct+0.001
 Heidi+0.001
Neg logit contributions
 ships-0.002
 armor-0.001
bel-0.001
 shark-0.001
 tanks-0.001
Token<|endoftext|>
Token ID50256
Feature activation-2.33e-01

Pos logit contributions
 mistake+0.023
uct+0.021
 Heidi+0.019
 Lib+0.019
 saying+0.019
Neg logit contributions
 ships-0.027
 shark-0.026
 armor-0.024
washer-0.024
 tanks-0.024
TokenJimmy
Token ID40335
Feature activation-1.84e-01

Pos logit contributions
 ships+0.007
 shark+0.007
 tanks+0.006
washer+0.006
zo+0.006
Neg logit contributions
 mistake-0.008
uct-0.008
aman-0.008
 saying-0.008
 Heidi-0.007
Token wanted
Token ID2227
Feature activation+8.22e-01

Pos logit contributions
 ships+0.008
 shark+0.007
 armor+0.007
bel+0.007
 tanks+0.007
Neg logit contributions
 saying-0.004
 Samantha-0.004
uct-0.004
 Heidi-0.004
 Lib-0.004
Token to
Token ID284
Feature activation+7.04e-01

Pos logit contributions
uct+0.029
 Heidi+0.029
 saying+0.029
 Samantha+0.029
 Lib+0.029
Neg logit contributions
 ships-0.024
 shark-0.022
 armor-0.021
bel-0.020
zo-0.020
Token climb
Token ID12080
Feature activation+6.48e-01

Pos logit contributions
 Samantha+0.021
 saying+0.020
 Heidi+0.020
uct+0.020
 right+0.020
Neg logit contributions
 ships-0.025
 shark-0.023
 armor-0.023
bel-0.023
zo-0.022
Token to
Token ID284
Feature activation-4.83e-02

Pos logit contributions
uct+0.019
 Heidi+0.018
 Samantha+0.018
 saying+0.017
 Lib+0.017
Neg logit contributions
 ships-0.024
 shark-0.022
 armor-0.022
 tanks-0.021
 armour-0.021
Token the
Token ID262
Feature activation-4.85e-01

Pos logit contributions
 ships+0.002
 shark+0.002
 armor+0.001
 tanks+0.001
 armour+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 saying-0.001
 Samantha-0.001
 Lib-0.001
Token top
Token ID1353
Feature activation-6.19e-01

Pos logit contributions
 ships+0.019
bel+0.018
 armor+0.018
zo+0.018
 shark+0.018
Neg logit contributions
 right-0.012
 saying-0.012
aman-0.011
 Lib-0.011
 Samantha-0.011
Token of
Token ID286
Feature activation-4.54e-01

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.017
washer+0.017
bel+0.017
Neg logit contributions
uct-0.020
 Heidi-0.020
 Samantha-0.020
 saying-0.020
 Lib-0.020
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
 ships+0.015
 shark+0.014
 armor+0.014
 tanks+0.013
 armour+0.013
Neg logit contributions
uct-0.015
 Heidi-0.014
 Lib-0.014
aman-0.014
 Samantha-0.014
Token tree
Token ID5509
Feature activation+1.02e-01

Pos logit contributions
 ships+0.006
zo+0.005
 armor+0.005
bel+0.005
 shark+0.005
Neg logit contributions
 right-0.005
 Lib-0.004
uct-0.004
 saying-0.004
 mistake-0.004
Token will
Token ID481
Feature activation-6.82e-01

Pos logit contributions
 ships+0.039
 armor+0.037
 shark+0.037
bel+0.036
 armour+0.036
Neg logit contributions
uct-0.023
 saying-0.023
 Heidi-0.023
 right-0.023
 Samantha-0.022
Token make
Token ID787
Feature activation-5.40e-01

Pos logit contributions
 ships+0.023
 shark+0.022
 armor+0.021
zo+0.021
 tanks+0.020
Neg logit contributions
uct-0.021
 Heidi-0.021
 Samantha-0.021
 Lib-0.020
 saying-0.020
Token you
Token ID345
Feature activation+5.25e-01

Pos logit contributions
 ships+0.022
 shark+0.021
 tanks+0.020
 armor+0.020
zo+0.020
Neg logit contributions
uct-0.013
 Lib-0.013
 Heidi-0.013
 Samantha-0.013
aman-0.013
Token happy
Token ID3772
Feature activation-6.82e-01

Pos logit contributions
 saying+0.014
uct+0.014
 Heidi+0.014
 Samantha+0.014
 right+0.014
Neg logit contributions
 ships-0.019
 shark-0.018
 armor-0.018
 tanks-0.017
washer-0.017
Token.
Token ID13
Feature activation-6.04e-01

Pos logit contributions
 ships+0.030
 shark+0.029
 armor+0.028
 tanks+0.027
washer+0.027
Neg logit contributions
uct-0.015
 Heidi-0.014
 Samantha-0.014
 Lib-0.014
aman-0.013
Token It
Token ID632
Feature activation-8.77e-01

Pos logit contributions
 ships+0.023
 shark+0.022
 armor+0.022
 tanks+0.021
zo+0.021
Neg logit contributions
uct-0.015
 Heidi-0.015
 saying-0.015
 Samantha-0.015
 Lib-0.015
Token is
Token ID318
Feature activation-7.11e-01

Pos logit contributions
 ships+0.033
 shark+0.031
 armor+0.031
bel+0.030
 tanks+0.030
Neg logit contributions
uct-0.023
 saying-0.023
 Heidi-0.022
 Samantha-0.022
 right-0.022
Token fun
Token ID1257
Feature activation+4.83e-01

Pos logit contributions
 ships+0.026
 shark+0.025
 armor+0.024
 tanks+0.024
 armour+0.022
Neg logit contributions
uct-0.021
 Lib-0.021
aman-0.021
 Samantha-0.020
aren-0.020
Token to
Token ID284
Feature activation+2.33e-01

Pos logit contributions
 saying+0.011
 Heidi+0.010
uct+0.010
 Samantha+0.010
 right+0.010
Neg logit contributions
 ships-0.020
 shark-0.019
 armor-0.019
bel-0.019
zo-0.018
Token catch
Token ID4929
Feature activation+5.93e-01

Pos logit contributions
 saying+0.008
 Heidi+0.008
 Samantha+0.008
uct+0.008
 right+0.008
Neg logit contributions
 ships-0.007
 shark-0.007
 armor-0.006
washer-0.006
bel-0.006
Token a
Token ID257
Feature activation-5.48e-01

Pos logit contributions
uct+0.017
 Samantha+0.017
 Heidi+0.017
 Lib+0.016
aman+0.016
Neg logit contributions
 ships-0.022
 shark-0.021
 armor-0.020
 tanks-0.020
zo-0.020
Token the
Token ID262
Feature activation-2.57e-01

Pos logit contributions
 ships+0.013
washer+0.013
zo+0.013
 pawn+0.012
 armor+0.012
Neg logit contributions
 saying-0.008
 Heidi-0.007
 right-0.007
 Samantha-0.007
 Lib-0.007
Token water
Token ID1660
Feature activation+1.28e-01

Pos logit contributions
bel+0.007
zo+0.007
 ships+0.007
 armor+0.007
os+0.007
Neg logit contributions
 right-0.009
 saying-0.009
 mistake-0.008
 Heidi-0.008
 Samantha-0.007
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
 saying+0.002
 right+0.002
uct+0.002
 Samantha+0.002
 Heidi+0.002
Neg logit contributions
 ships-0.006
 armor-0.006
 shark-0.006
bel-0.006
 armour-0.006
Token They
Token ID1119
Feature activation+3.91e-01

Pos logit contributions
 ships+0.017
 shark+0.016
 armor+0.015
 tanks+0.015
washer+0.015
Neg logit contributions
uct-0.013
 Heidi-0.013
 saying-0.013
 Samantha-0.013
 Lib-0.013
Token run
Token ID1057
Feature activation+7.15e-01

Pos logit contributions
 saying+0.010
 right+0.010
 Samantha+0.010
 Heidi+0.010
uct+0.010
Neg logit contributions
 ships-0.014
bel-0.013
 armor-0.013
 shark-0.013
washer-0.013
Token to
Token ID284
Feature activation-9.76e-01

Pos logit contributions
 right+0.018
 saying+0.014
 Lib+0.012
 herself+0.011
 Heidi+0.011
Neg logit contributions
 armor-0.032
zo-0.031
 armour-0.031
 ships-0.030
 rook-0.030
Token the
Token ID262
Feature activation-4.57e-01

Pos logit contributions
 ships+0.033
 shark+0.032
 armor+0.030
 tanks+0.030
 ship+0.029
Neg logit contributions
uct-0.030
 Heidi-0.029
 Lib-0.029
 Samantha-0.029
aman-0.028
Token shore
Token ID15191
Feature activation+1.69e-01

Pos logit contributions
zo+0.014
bel+0.014
 armor+0.014
 ships+0.014
 armour+0.014
Neg logit contributions
 right-0.015
 saying-0.014
 mistake-0.013
 Heidi-0.012
 Samantha-0.012
Token and
Token ID290
Feature activation+5.71e-02

Pos logit contributions
 saying+0.004
 Samantha+0.004
uct+0.004
 Heidi+0.004
 right+0.003
Neg logit contributions
 ships-0.007
 shark-0.007
 armor-0.007
zo-0.007
bel-0.006
Token see
Token ID766
Feature activation+7.15e-01

Pos logit contributions
 saying+0.002
 Heidi+0.002
 right+0.002
 Samantha+0.002
 Lib+0.002
Neg logit contributions
 ships-0.002
 armor-0.002
zo-0.002
bel-0.002
 shark-0.002
Token a
Token ID257
Feature activation-1.76e-01

Pos logit contributions
 saying+0.015
 Heidi+0.015
 Samantha+0.015
 right+0.014
 Lib+0.014
Neg logit contributions
 ships-0.030
bel-0.029
 armor-0.029
zo-0.029
 shark-0.029
Token for
Token ID329
Feature activation+1.10e-01

Pos logit contributions
uct+0.014
 Samantha+0.013
 Heidi+0.013
aman+0.013
 Lib+0.013
Neg logit contributions
 ships-0.026
 shark-0.024
washer-0.024
 armor-0.024
 tanks-0.024
Token lunch
Token ID9965
Feature activation+6.46e-01

Pos logit contributions
uct+0.004
 Heidi+0.004
 Lib+0.003
aman+0.003
 saying+0.003
Neg logit contributions
 ships-0.004
 tanks-0.003
 shark-0.003
 coral-0.003
 armor-0.003
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
uct+0.014
 Samantha+0.014
 Heidi+0.013
aman+0.013
 Lib+0.012
Neg logit contributions
 ships-0.028
washer-0.027
 shark-0.026
 armor-0.026
 tanks-0.026
Token
Token ID198
Feature activation+7.36e-02

Pos logit contributions
 mistake+0.005
uct+0.004
ve+0.004
shadow+0.004
 possible+0.004
Neg logit contributions
 ships-0.004
 tanks-0.003
 shark-0.003
washer-0.003
 Lions-0.003
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
uct+0.002
 saying+0.002
 Heidi+0.002
 Samantha+0.002
aman+0.002
Neg logit contributions
 ships-0.003
bel-0.003
 shark-0.003
washer-0.003
 tanks-0.002
Token"
Token ID1
Feature activation-8.85e-01

Pos logit contributions
 mistake+0.003
uct+0.003
 saying+0.003
 Heidi+0.003
 Samantha+0.003
Neg logit contributions
bel-0.003
 ships-0.003
washer-0.003
 tanks-0.003
zo-0.003
TokenMom
Token ID29252
Feature activation-1.61e-01

Pos logit contributions
 ships+0.025
 shark+0.024
washer+0.023
 tanks+0.023
 pawn+0.023
Neg logit contributions
 mistake-0.030
uct-0.028
 saying-0.027
 Heidi-0.027
 Samantha-0.026
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 ships+0.005
washer+0.005
 shark+0.005
 tanks+0.004
 armor+0.004
Neg logit contributions
uct-0.005
 Heidi-0.005
 Samantha-0.005
 Lib-0.005
aman-0.005
Token what
Token ID644
Feature activation+4.70e-01

Pos logit contributions
zo+0.006
bel+0.006
 ships+0.005
 armor+0.005
 armour+0.005
Neg logit contributions
 right-0.004
 OK-0.004
 okay-0.004
 Samantha-0.004
 saying-0.004
Token are
Token ID389
Feature activation-1.27e+00

Pos logit contributions
 Heidi+0.010
uct+0.010
 saying+0.010
 Samantha+0.010
 Lib+0.010
Neg logit contributions
 ships-0.020
 shark-0.019
 armor-0.019
bel-0.018
zo-0.018
Token you
Token ID345
Feature activation+7.72e-01

Pos logit contributions
 ships+0.054
 shark+0.051
 armor+0.050
washer+0.049
 tanks+0.049
Neg logit contributions
uct-0.028
 Heidi-0.027
 Samantha-0.027
 saying-0.027
 Lib-0.026
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
 ships+0.020
zo+0.019
washer+0.018
 ship+0.018
 armor+0.018
Neg logit contributions
uct-0.014
 Heidi-0.013
 Lib-0.013
 Samantha-0.013
aman-0.012
Token She
Token ID1375
Feature activation-6.39e-01

Pos logit contributions
 ships+0.007
 shark+0.007
 tanks+0.007
 armor+0.006
 pawn+0.006
Neg logit contributions
 saying-0.005
uct-0.005
 mistake-0.005
aman-0.005
 Samantha-0.005
Token had
Token ID550
Feature activation-2.05e-02

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.017
zo+0.017
washer+0.017
Neg logit contributions
uct-0.022
 Heidi-0.022
 Samantha-0.022
 saying-0.022
 Lib-0.021
Token a
Token ID257
Feature activation+3.13e-02

Pos logit contributions
 ships+0.001
 shark+0.001
 armor+0.001
bel+0.001
 tanks+0.001
Neg logit contributions
uct-0.001
 Heidi-0.001
 saying-0.001
 Samantha-0.001
 Lib-0.001
Token pet
Token ID4273
Feature activation-3.60e-01

Pos logit contributions
 Heidi+0.001
 saying+0.001
 Samantha+0.001
 Lib+0.001
 right+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
zo-0.001
bel-0.001
Token ham
Token ID8891
Feature activation-7.16e-01

Pos logit contributions
 ships+0.012
 shark+0.011
 armor+0.011
 tanks+0.010
washer+0.010
Neg logit contributions
uct-0.012
 Heidi-0.011
 Samantha-0.011
 saying-0.011
 Lib-0.011
Tokenster
Token ID1706
Feature activation-1.94e-01

Pos logit contributions
 ships+0.021
 shark+0.020
 armor+0.019
 tanks+0.019
washer+0.018
Neg logit contributions
 Heidi-0.024
 saying-0.024
uct-0.024
 Samantha-0.024
 Lib-0.024
Token named
Token ID3706
Feature activation-3.86e-01

Pos logit contributions
 ships+0.007
 shark+0.007
 armor+0.007
bel+0.007
 tanks+0.007
Neg logit contributions
 saying-0.005
uct-0.005
 Heidi-0.005
 Samantha-0.005
 Lib-0.005
Token Spark
Token ID17732
Feature activation-1.15e+00

Pos logit contributions
 ships+0.011
 shark+0.011
 armor+0.010
zo+0.010
 tanks+0.010
Neg logit contributions
uct-0.014
 mistake-0.014
aman-0.013
 saying-0.012
 OK-0.012
Tokeny
Token ID88
Feature activation-3.60e-01

Pos logit contributions
 ships+0.025
zo+0.023
 shark+0.022
 armor+0.021
 tanks+0.020
Neg logit contributions
 Heidi-0.051
uct-0.051
 Samantha-0.051
 Lib-0.050
 OK-0.050
Token.
Token ID13
Feature activation-1.34e-02

Pos logit contributions
 ships+0.016
 shark+0.015
 armor+0.015
zo+0.015
 tanks+0.015
Neg logit contributions
uct-0.008
 Heidi-0.007
 Samantha-0.007
 Lib-0.007
 saying-0.007
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
uct+0.015
 Samantha+0.015
 Heidi+0.015
 Lib+0.014
 Stella+0.013
Neg logit contributions
 ships-0.032
washer-0.029
 tanks-0.029
 shark-0.029
 armor-0.029
Token They
Token ID1119
Feature activation-5.16e-01

Pos logit contributions
 ships+0.023
 shark+0.021
 armor+0.020
 tanks+0.020
bel+0.020
Neg logit contributions
uct-0.018
 saying-0.017
 Heidi-0.017
 Samantha-0.017
 Lib-0.017
Token liked
Token ID8288
Feature activation-1.61e-02

Pos logit contributions
 shark+0.020
 ships+0.020
 armor+0.020
zo+0.020
 armour+0.019
Neg logit contributions
 saying-0.012
 right-0.012
 Samantha-0.012
 Heidi-0.011
 Lib-0.011
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
zo+0.001
 armour+0.001
 armor+0.001
hing+0.001
ozo+0.001
Neg logit contributions
 right-0.000
 saying-0.000
 herself-0.000
 Samantha-0.000
 cher-0.000
Token pretend
Token ID16614
Feature activation-2.05e-01

Pos logit contributions
 right+0.010
 herself+0.009
 saying+0.009
 Samantha+0.009
 Heidi+0.009
Neg logit contributions
 ships-0.012
zo-0.012
 shark-0.012
 armor-0.011
 tanks-0.011
Token they
Token ID484
Feature activation-1.05e+00

Pos logit contributions
 ships+0.008
 shark+0.008
 armor+0.008
zo+0.008
bel+0.008
Neg logit contributions
 saying-0.005
 Heidi-0.005
 Samantha-0.005
uct-0.005
 Lib-0.005
Token were
Token ID547
Feature activation-1.48e+00

Pos logit contributions
 ships+0.037
 shark+0.036
 armor+0.035
 armour+0.033
zo+0.033
Neg logit contributions
 saying-0.029
 Samantha-0.029
uct-0.029
 Lib-0.029
 Heidi-0.029
Token chefs
Token ID36236
Feature activation+4.59e-01

Pos logit contributions
 ships+0.044
 shark+0.043
 armor+0.042
zo+0.041
 armour+0.040
Neg logit contributions
 saying-0.049
 right-0.048
 Samantha-0.047
 Heidi-0.047
 Lib-0.046
Token and
Token ID290
Feature activation+3.54e-03

Pos logit contributions
 saying+0.010
 Heidi+0.010
uct+0.010
 Samantha+0.009
 Lib+0.009
Neg logit contributions
 ships-0.020
 shark-0.019
 armor-0.018
bel-0.018
zo-0.018
Token make
Token ID787
Feature activation+3.38e-02

Pos logit contributions
 saying+0.000
 Samantha+0.000
 right+0.000
 Heidi+0.000
 mistake+0.000
Neg logit contributions
zo-0.000
 ships-0.000
 shark-0.000
 armor-0.000
 armour-0.000
Token y
Token ID331
Feature activation+7.37e-01

Pos logit contributions
uct+0.001
 Heidi+0.001
 Samantha+0.001
 saying+0.001
 Lib+0.001
Neg logit contributions
 ships-0.001
 shark-0.001
 armor-0.001
 tanks-0.001
bel-0.001
Token'll
Token ID1183
Feature activation-5.04e-02

Pos logit contributions
uct+0.014
 Heidi+0.014
 Lib+0.014
 Samantha+0.014
 saying+0.014
Neg logit contributions
 ships-0.013
zo-0.012
 shark-0.012
bel-0.012
washer-0.012
Token put
Token ID1234
Feature activation-6.83e-01

Pos logit contributions
 ships+0.002
 shark+0.001
 armor+0.001
zo+0.001
 ship+0.001
Neg logit contributions
uct-0.002
 Heidi-0.002
 Lib-0.002
 Samantha-0.002
aman-0.002
Token a
Token ID257
Feature activation-1.38e+00

Pos logit contributions
 ships+0.026
 armor+0.025
 shark+0.025
 armour+0.024
 tanks+0.024
Neg logit contributions
uct-0.019
 Heidi-0.018
 Samantha-0.018
aman-0.018
 Lib-0.017
Token band
Token ID4097
Feature activation-1.43e+00

Pos logit contributions
 ships+0.047
 shark+0.044
 armor+0.043
 tanks+0.041
washer+0.041
Neg logit contributions
uct-0.042
 Heidi-0.041
 Samantha-0.041
 saying-0.041
 Lib-0.040
Token-
Token ID12
Feature activation-9.00e-01

Pos logit contributions
 ships+0.050
 shark+0.047
 armor+0.046
 tanks+0.045
washer+0.044
Neg logit contributions
 saying-0.041
uct-0.041
 Heidi-0.041
 Samantha-0.040
 Lib-0.040
Tokenaid
Token ID1698
Feature activation-2.33e+00

Pos logit contributions
 ships+0.029
 shark+0.027
 armor+0.026
 tanks+0.025
washer+0.025
Neg logit contributions
uct-0.029
 saying-0.029
 Samantha-0.029
 Heidi-0.029
 Lib-0.028
Token on
Token ID319
Feature activation-1.03e+00

Pos logit contributions
 ships+0.090
 shark+0.085
 armor+0.083
washer+0.083
bel+0.082
Neg logit contributions
uct-0.061
 Heidi-0.060
 Samantha-0.060
 Lib-0.058
 saying-0.057
Token your
Token ID534
Feature activation+2.27e-01

Pos logit contributions
 ships+0.039
 shark+0.038
 armor+0.037
zo+0.035
 armour+0.035
Neg logit contributions
uct-0.027
 Samantha-0.026
 Heidi-0.026
 Lib-0.026
 saying-0.026
Token hand
Token ID1021
Feature activation-4.49e-01

Pos logit contributions
uct+0.007
 Heidi+0.007
 Samantha+0.007
 Lib+0.007
 saying+0.007
Neg logit contributions
 ships-0.008
 shark-0.007
 armor-0.007
 armour-0.007
 tanks-0.007
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
 ships+0.016
washer+0.015
 armor+0.015
 shark+0.015
zo+0.015
Neg logit contributions
uct-0.013
 Heidi-0.013
 Samantha-0.013
 mistake-0.012
 Lib-0.012
Token then
Token ID788
Feature activation-5.25e-01

Pos logit contributions
 ships+0.008
 shark+0.008
 armor+0.008
bel+0.007
 tanks+0.007
Neg logit contributions
uct-0.007
 Heidi-0.007
 Samantha-0.007
 Lib-0.007
 saying-0.007
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 Samantha+0.002
aman+0.002
 mistake+0.002
Neg logit contributions
 ships-0.003
 shark-0.003
washer-0.003
 tanks-0.003
 armor-0.003
Token They
Token ID1119
Feature activation-3.32e-01

Pos logit contributions
 ships+0.006
 tanks+0.005
 shark+0.005
 armor+0.005
 Lions+0.005
Neg logit contributions
 mistake-0.006
uct-0.006
 saying-0.006
aman-0.006
 Heidi-0.006
Token were
Token ID547
Feature activation-3.04e-01

Pos logit contributions
 ships+0.012
 shark+0.011
 armor+0.011
washer+0.011
bel+0.011
Neg logit contributions
 saying-0.009
 Samantha-0.009
uct-0.009
 Heidi-0.009
 Lib-0.009
Token called
Token ID1444
Feature activation-1.58e-01

Pos logit contributions
 ships+0.011
 shark+0.010
 armor+0.010
 tanks+0.010
zo+0.009
Neg logit contributions
uct-0.009
 Heidi-0.009
 Samantha-0.009
aman-0.009
 Lib-0.008
Token ost
Token ID23619
Feature activation+2.84e-01

Pos logit contributions
 ships+0.006
 shark+0.005
 armor+0.005
 tanks+0.005
 ship+0.005
Neg logit contributions
uct-0.005
aman-0.005
 mistake-0.005
 Heidi-0.005
 Samantha-0.005
Tokenric
Token ID1173
Feature activation-3.43e+00

Pos logit contributions
 Heidi+0.013
uct+0.013
 Samantha+0.013
 Lib+0.013
 saying+0.013
Neg logit contributions
 ships-0.005
 shark-0.004
 armor-0.004
zo-0.004
 tanks-0.004
Tokenhes
Token ID956
Feature activation-1.00e+00

Pos logit contributions
 ships+0.094
 shark+0.084
washer+0.083
zo+0.083
bel+0.082
Neg logit contributions
aman-0.133
 Samantha-0.131
 Heidi-0.128
uct-0.127
 mistake-0.125
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
 ships+0.039
 shark+0.036
 tanks+0.035
zo+0.035
 armor+0.035
Neg logit contributions
uct-0.027
 Heidi-0.026
 Samantha-0.026
 Lib-0.025
aman-0.025
Token Lily
Token ID20037
Feature activation+6.80e-02

Pos logit contributions
 ships+0.008
 tanks+0.007
 Lions+0.007
 shark+0.007
 polar+0.006
Neg logit contributions
 mistake-0.010
uct-0.010
 saying-0.009
aman-0.009
 Heidi-0.008
Token and
Token ID290
Feature activation+2.13e-01

Pos logit contributions
uct+0.002
 Heidi+0.002
 Samantha+0.002
 saying+0.002
 Lib+0.002
Neg logit contributions
 ships-0.003
 shark-0.002
 armor-0.002
zo-0.002
washer-0.002
Token Ben
Token ID3932
Feature activation-9.38e-01

Pos logit contributions
uct+0.006
 mistake+0.005
aman+0.005
 saying+0.005
 OK+0.005
Neg logit contributions
 ships-0.008
 shark-0.008
 tanks-0.008
 armor-0.007
zo-0.007
Token asked
Token ID1965
Feature activation-5.52e-02

Pos logit contributions
 ships+0.010
 shark+0.009
 armor+0.009
zo+0.009
washer+0.009
Neg logit contributions
uct-0.009
 Heidi-0.009
 Samantha-0.009
 saying-0.009
 Lib-0.008
Token her
Token ID607
Feature activation-7.82e-01

Pos logit contributions
 ships+0.002
 armor+0.002
 shark+0.002
 armour+0.002
bel+0.002
Neg logit contributions
uct-0.001
aman-0.001
 Lib-0.001
 Samantha-0.001
 Heidi-0.001
Token m
Token ID285
Feature activation-2.38e-01

Pos logit contributions
 ships+0.034
zo+0.032
 shark+0.032
bel+0.032
 armor+0.032
Neg logit contributions
 saying-0.016
 right-0.016
 Samantha-0.016
 Heidi-0.016
 mistake-0.016
Tokenama
Token ID1689
Feature activation-1.51e+00

Pos logit contributions
 ships+0.011
 shark+0.011
 tanks+0.011
 armor+0.010
 pawn+0.010
Neg logit contributions
aman-0.004
 Samantha-0.004
uct-0.004
 Lib-0.004
 saying-0.004
Token and
Token ID290
Feature activation-2.86e-01

Pos logit contributions
 ships+0.060
 shark+0.057
 armor+0.056
bel+0.055
zo+0.054
Neg logit contributions
 saying-0.036
 Heidi-0.036
 Samantha-0.035
uct-0.035
 Lib-0.035
Token pap
Token ID20461
Feature activation-2.81e+00

Pos logit contributions
 ships+0.009
 shark+0.009
 armor+0.009
zo+0.008
 tanks+0.008
Neg logit contributions
 saying-0.009
 Samantha-0.009
 Heidi-0.009
uct-0.009
 Lib-0.009
Tokena
Token ID64
Feature activation-1.20e+00

Pos logit contributions
 ships+0.104
 shark+0.098
 armor+0.096
washer+0.095
zo+0.095
Neg logit contributions
uct-0.076
 Heidi-0.075
 Samantha-0.074
 saying-0.074
 Lib-0.073
Token if
Token ID611
Feature activation-5.85e-01

Pos logit contributions
 ships+0.045
 shark+0.042
 armor+0.041
 tanks+0.040
washer+0.040
Neg logit contributions
uct-0.033
 Heidi-0.032
 Samantha-0.032
 saying-0.031
 Lib-0.031
Token she
Token ID673
Feature activation-9.10e-01

Pos logit contributions
 ships+0.024
 shark+0.023
washer+0.022
bel+0.022
 armor+0.021
Neg logit contributions
uct-0.014
 mistake-0.013
 Heidi-0.012
aman-0.012
 saying-0.012
Token could
Token ID714
Feature activation-9.03e-01

Pos logit contributions
 ships+0.022
 shark+0.020
 armor+0.019
washer+0.019
zo+0.019
Neg logit contributions
uct-0.036
 Heidi-0.036
 saying-0.036
 Samantha-0.036
 Lib-0.035
Token go
Token ID467
Feature activation-3.35e-01

Pos logit contributions
 ships+0.028
 shark+0.026
 armor+0.025
washer+0.024
zo+0.024
Neg logit contributions
uct-0.031
 Heidi-0.030
 Samantha-0.030
 Lib-0.029
 saying-0.029
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
 ships+0.039
 shark+0.037
zo+0.036
 armor+0.036
washer+0.036
Neg logit contributions
uct-0.015
 Heidi-0.015
 Samantha-0.014
 Lib-0.014
 saying-0.013
Token He
Token ID679
Feature activation-3.41e-01

Pos logit contributions
 Samantha+0.011
uct+0.011
 Heidi+0.011
 Lib+0.011
 saying+0.011
Neg logit contributions
 ships-0.011
bel-0.010
 shark-0.010
 armor-0.010
zo-0.010
Token was
Token ID373
Feature activation+1.38e-01

Pos logit contributions
 ships+0.011
 shark+0.011
 armor+0.010
zo+0.010
 tanks+0.010
Neg logit contributions
uct-0.011
 Heidi-0.011
 Samantha-0.010
 saying-0.010
 Lib-0.010
Token a
Token ID257
Feature activation-5.68e-01

Pos logit contributions
 saying+0.004
 right+0.004
 Samantha+0.004
 Heidi+0.004
 Lib+0.004
Neg logit contributions
 ships-0.005
 shark-0.005
zo-0.005
bel-0.005
 armor-0.005
Token happy
Token ID3772
Feature activation-3.97e-01

Pos logit contributions
 ships+0.024
ests+0.023
bel+0.023
os+0.022
 armor+0.022
Neg logit contributions
 right-0.013
 okay-0.011
 mistake-0.011
 saying-0.011
 OK-0.011
Token 3
Token ID513
Feature activation-2.49e+00

Pos logit contributions
 ships+0.014
 shark+0.013
 armor+0.013
 tanks+0.012
washer+0.012
Neg logit contributions
uct-0.012
 Heidi-0.012
 Samantha-0.012
 saying-0.012
 Lib-0.011
Token year
Token ID614
Feature activation-9.40e-01

Pos logit contributions
 ships+0.088
 shark+0.083
 armor+0.081
washer+0.079
 tanks+0.079
Neg logit contributions
uct-0.073
 Heidi-0.071
 saying-0.070
 Samantha-0.070
 Lib-0.069
Token old
Token ID1468
Feature activation-1.22e+00

Pos logit contributions
 ships+0.035
washer+0.031
 shark+0.030
 tanks+0.030
 ship+0.030
Neg logit contributions
 Heidi-0.029
uct-0.028
 Samantha-0.028
 OK-0.027
 Lib-0.026
Token who
Token ID508
Feature activation-6.82e-01

Pos logit contributions
 ships+0.047
 shark+0.044
 armor+0.043
 tanks+0.042
washer+0.041
Neg logit contributions
uct-0.035
 Heidi-0.033
 Samantha-0.032
aman-0.031
 Lib-0.031
Token loved
Token ID6151
Feature activation-1.17e-01

Pos logit contributions
 ships+0.023
 shark+0.022
 armor+0.021
 tanks+0.021
 armour+0.020
Neg logit contributions
 saying-0.020
 Samantha-0.020
uct-0.020
 Heidi-0.020
 right-0.020
Token playing
Token ID2712
Feature activation-7.21e-02

Pos logit contributions
 ships+0.004
 shark+0.004
zo+0.004
washer+0.004
 armor+0.003
Neg logit contributions
 saying-0.004
 Samantha-0.004
 right-0.003
 Lib-0.003
 Heidi-0.003
Token if
Token ID611
Feature activation-1.75e-01

Pos logit contributions
 ships+0.007
 shark+0.007
 armor+0.007
washer+0.007
 tanks+0.006
Neg logit contributions
uct-0.004
 Heidi-0.004
 Samantha-0.004
 Lib-0.004
aman-0.004
Token they
Token ID484
Feature activation-1.16e+00

Pos logit contributions
 ships+0.007
 shark+0.007
washer+0.007
 tanks+0.007
 armor+0.006
Neg logit contributions
uct-0.004
aman-0.004
 Heidi-0.004
 Lib-0.004
 Samantha-0.004
Token both
Token ID1111
Feature activation-3.42e-01

Pos logit contributions
 ships+0.039
 shark+0.036
 armor+0.036
bel+0.035
 armour+0.034
Neg logit contributions
uct-0.035
 saying-0.034
 Samantha-0.034
 right-0.034
 Heidi-0.034
Token closed
Token ID4838
Feature activation-7.43e-01

Pos logit contributions
 ships+0.012
 shark+0.012
 armor+0.011
bel+0.011
 tanks+0.011
Neg logit contributions
uct-0.010
 Heidi-0.009
 saying-0.009
 Samantha-0.009
 Lib-0.009
Token their
Token ID511
Feature activation-1.69e+00

Pos logit contributions
 ships+0.027
 shark+0.025
 armor+0.025
 tanks+0.024
bel+0.023
Neg logit contributions
uct-0.022
 Heidi-0.021
 saying-0.020
 Samantha-0.020
 Lib-0.020
Token eyes
Token ID2951
Feature activation-2.40e+00

Pos logit contributions
 ships+0.053
 shark+0.050
 armor+0.049
zo+0.048
bel+0.048
Neg logit contributions
uct-0.054
 saying-0.053
 Heidi-0.053
 right-0.052
 Samantha-0.052
Token and
Token ID290
Feature activation-5.33e-01

Pos logit contributions
 ships+0.082
washer+0.075
 shark+0.075
 armor+0.074
bel+0.073
Neg logit contributions
 Heidi-0.072
uct-0.072
 Samantha-0.072
 Lib-0.070
 saying-0.069
Token said
Token ID531
Feature activation-5.05e-01

Pos logit contributions
 ships+0.019
 shark+0.017
 armor+0.017
washer+0.017
bel+0.016
Neg logit contributions
uct-0.016
 Heidi-0.016
 Lib-0.016
 Samantha-0.016
aman-0.016
Token a
Token ID257
Feature activation-9.57e-02

Pos logit contributions
 ships+0.019
 shark+0.018
 armor+0.017
washer+0.017
bel+0.017
Neg logit contributions
uct-0.014
 Heidi-0.014
 Samantha-0.014
 saying-0.014
 Lib-0.013
Token magic
Token ID5536
Feature activation-6.17e-01

Pos logit contributions
 shark+0.003
 ships+0.003
washer+0.003
 armor+0.003
bel+0.003
Neg logit contributions
uct-0.003
 Lib-0.003
 Heidi-0.003
 Samantha-0.003
 saying-0.003
Token spell
Token ID4822
Feature activation-8.31e-01

Pos logit contributions
 ships+0.020
 shark+0.019
 armor+0.018
washer+0.018
 tanks+0.017
Neg logit contributions
uct-0.021
 Samantha-0.020
 Heidi-0.020
 Lib-0.020
 saying-0.019