TOP ACTIVATIONS
MAX = 0.092

He announced that the wood pe cker was going to visit
bad hunter took the re ind eer to a place far
He will never forget the nod between them , his reminder
I play with your re ind eer ?" Ben asks Lily
She picks up the re ind eer . She looks at
to have the best pe b ble . "
. Sam wanted the pe b ble for himself . He
sorry for throwing the pe b ble at you . Will
big . The wood pe cker was sad . Then
very proud of his beautiful whist ling , but he wanted
boy understood that the chip m unk was boss y ,
They had lost the pe b ble because they did not
to play with the re ind eer . They played together
Tim shared the pe b ble with his friends ,
looked for the best pe b bles and took them home
finally understood why the chip m unk had been so boss
frog had found the perfect mate . <|endoftext|> Once upon a
not everyone likes Tom 's whist ling . One day ,
. Max had a w agging tail and a shiny collar
to play with the re ind eer , you have to

MIN ACTIVATIONS
MIN = -0.084

shoot hay and eat cup cakes . It was a good
it to me for safe keeping . He told me to
out and it looked so y ummy ! The wild pig
until it was so com fy it felt like a cozy
the sun came out and dried up all the rain .
lunch and it was so y ummy and sweet .
and Sarah 's kitchen was clean again . Sarah
nut and it looked so y ummy . She couldn 't
inside the cave , and curled up into a tight ball
hole and it was so cozy inside . It was his
but it felt so com fy and warm . She decided
some soap and began to clean his motorcycle . He spread
set up a table and chairs . She invited her other
she was safe and pl opped down in her lap .
fish and it was so y ummy ! Tim my said
a towel and started to clean the floor . Anna helped
her blanket and felt so cozy . She said to her
in his cardboard box and curled up for a nap .
it , it was so y ummy ! The
mom and went to their rooms . They cried and wished

INTERVAL 0.048 - 0.092
CONTAINS 0.227%

. It belonged to their grand pa , who did not
troubled . He had a pistol , and he wanted to
." The curious girl thought about this for
the man had for them every day ! <|endoftext|> Once upon
squirrel couldn 't believe his luck ! He filled his tum

INTERVAL 0.004 - 0.048
CONTAINS 39.774%

something bad . " We are sorry , Mom
He asked his mom what was wrong and his mom said
and saw the elephant taking his apples . He was very
was crying . She gave him a big hug and told
and mold y and ranc id and expired and best before

INTERVAL -0.040 - 0.004
CONTAINS 59.389%

, and her mom said yes . L ily
L ily and Ben felt better . They held hands
important it is to save energy and money . Robbie also
Their mom said , " You will see . But you
share because he wanted to keep it all for himself .

INTERVAL -0.084 - -0.040
CONTAINS 0.610%

over to a cool and shady spot . There were big
, my dear . Do you want some ?" Mom says
- aid on it and kisses it . "
a small box . He opens the box . Inside ,
deep hills and treasures and castles and dragons . They are
Token He
Token ID679
Feature activation+1.17e-02

Pos logit contributions
working+0.005
sing+0.005
work+0.005
 warriors+0.005
 rook+0.004
Neg logit contributions
See-0.003
So-0.003
Ann-0.003
 else-0.003
Lu-0.003
Token announced
Token ID3414
Feature activation-9.41e-03

Pos logit contributions
working+0.034
sing+0.032
work+0.031
 guys+0.030
 warriors+0.029
Neg logit contributions
See-0.010
 -0.010
Lu-0.008
So-0.008
 much-0.008
Token that
Token ID326
Feature activation-6.14e-03

Pos logit contributions
 +0.010
See+0.010
So+0.009
Lu+0.009
+0.009
Neg logit contributions
working-0.025
sing-0.024
work-0.023
 guys-0.022
 warriors-0.021
Token the
Token ID262
Feature activation-3.32e-03

Pos logit contributions
See+0.007
So+0.006
 +0.006
uct+0.006
Lu+0.006
Neg logit contributions
working-0.016
work-0.015
sing-0.015
 guys-0.014
 warriors-0.014
Token wood
Token ID4898
Feature activation+3.81e-02

Pos logit contributions
See+0.006
 +0.006
So+0.005
 was+0.005
+0.005
Neg logit contributions
working-0.007
sing-0.006
work-0.006
 guys-0.006
zo-0.006
Tokenpe
Token ID431
Feature activation+9.19e-02

Pos logit contributions
working+0.097
sing+0.092
work+0.086
 guys+0.085
 warriors+0.083
Neg logit contributions
 -0.045
See-0.044
So-0.040
Lu-0.039
 was-0.039
Tokencker
Token ID15280
Feature activation+3.12e-02

Pos logit contributions
working+0.124
sing+0.112
work+0.098
 guys+0.097
 warriors+0.093
Neg logit contributions
 -0.214
See-0.212
 was-0.206
So-0.204
-0.203
Token was
Token ID373
Feature activation+7.80e-03

Pos logit contributions
working+0.088
sing+0.082
work+0.081
 guys+0.077
zo+0.076
Neg logit contributions
See-0.029
 -0.028
Lu-0.026
So-0.025
Ph-0.024
Token going
Token ID1016
Feature activation+6.70e-03

Pos logit contributions
working+0.018
sing+0.016
work+0.016
 guys+0.015
 warriors+0.015
Neg logit contributions
See-0.012
 -0.011
So-0.011
Lu-0.011
uct-0.010
Token to
Token ID284
Feature activation-6.74e-03

Pos logit contributions
working+0.017
sing+0.017
work+0.016
 photographer+0.016
 warrior+0.016
Neg logit contributions
 -0.007
 was-0.007
 much-0.007
-0.007
-0.006
Token visit
Token ID3187
Feature activation-5.55e-03

Pos logit contributions
See+0.011
 +0.011
So+0.010
Lu+0.010
uct+0.010
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 warriors-0.012
Token bad
Token ID2089
Feature activation+3.37e-02

Pos logit contributions
 +0.015
See+0.015
 was+0.014
So+0.014
+0.014
Neg logit contributions
working-0.021
sing-0.019
work-0.018
 guys-0.017
 warriors-0.017
Token hunter
Token ID19177
Feature activation+5.66e-02

Pos logit contributions
sing+0.070
working+0.067
 rook+0.067
work+0.064
 bishop+0.063
Neg logit contributions
 was-0.052
 -0.049
 much-0.047
See-0.046
 p-0.046
Token took
Token ID1718
Feature activation+1.54e-02

Pos logit contributions
working+0.160
sing+0.151
work+0.147
 guys+0.142
zo+0.142
Neg logit contributions
See-0.050
 -0.046
Lu-0.043
So-0.041
uct-0.039
Token the
Token ID262
Feature activation+2.64e-02

Pos logit contributions
working+0.040
sing+0.037
work+0.036
 guys+0.036
 warriors+0.034
Neg logit contributions
See-0.017
 -0.016
So-0.016
uct-0.015
Lu-0.015
Token re
Token ID302
Feature activation+5.37e-02

Pos logit contributions
working+0.056
sing+0.052
work+0.048
 warriors+0.046
 photographer+0.046
Neg logit contributions
 -0.044
See-0.043
 was-0.040
So-0.040
 much-0.039
Tokenind
Token ID521
Feature activation+9.09e-02

Pos logit contributions
working+0.086
sing+0.080
 guys+0.075
work+0.069
 rook+0.068
Neg logit contributions
 -0.109
See-0.108
So-0.107
Lu-0.105
 was-0.105
Tokeneer
Token ID28153
Feature activation+4.29e-02

Pos logit contributions
sing+0.104
working+0.095
 rook+0.081
 photographer+0.080
 guys+0.078
Neg logit contributions
 -0.221
 was-0.218
See-0.215
-0.214
 much-0.214
Token to
Token ID284
Feature activation+1.51e-03

Pos logit contributions
working+0.104
sing+0.097
work+0.091
 guys+0.090
 photographer+0.088
Neg logit contributions
 -0.056
-0.052
So-0.051
See-0.051
 much-0.051
Token a
Token ID257
Feature activation-7.84e-03

Pos logit contributions
working+0.004
sing+0.004
work+0.004
 guys+0.004
 warriors+0.003
Neg logit contributions
See-0.002
 -0.002
So-0.001
Lu-0.001
-0.001
Token place
Token ID1295
Feature activation-3.82e-02

Pos logit contributions
 +0.011
See+0.011
So+0.010
 much+0.010
+0.010
Neg logit contributions
working-0.018
sing-0.017
work-0.016
 guys-0.016
 warriors-0.015
Token far
Token ID1290
Feature activation+1.85e-02

Pos logit contributions
See+0.050
 +0.047
So+0.045
Lu+0.045
uct+0.043
Neg logit contributions
working-0.094
sing-0.089
work-0.085
 guys-0.079
 warriors-0.079
Token He
Token ID679
Feature activation+4.78e-03

Pos logit contributions
up+0.015
 else+0.015
Lu+0.015
uct+0.014
See+0.014
Neg logit contributions
working-0.026
work-0.025
sing-0.024
zo-0.024
 warrior-0.024
Token will
Token ID481
Feature activation-5.06e-03

Pos logit contributions
working+0.012
sing+0.011
work+0.011
zo+0.011
 crew+0.011
Neg logit contributions
See-0.005
Lu-0.005
 -0.004
Ph-0.004
S-0.004
Token never
Token ID1239
Feature activation+2.22e-03

Pos logit contributions
See+0.006
 +0.006
Lu+0.006
So+0.006
uct+0.006
Neg logit contributions
working-0.012
sing-0.011
work-0.011
 guys-0.011
zo-0.010
Token forget
Token ID6044
Feature activation-2.14e-02

Pos logit contributions
working+0.005
sing+0.004
work+0.004
 guys+0.004
 warriors+0.004
Neg logit contributions
 -0.004
See-0.004
So-0.004
-0.004
 was-0.003
Token the
Token ID262
Feature activation+4.25e-03

Pos logit contributions
See+0.030
Ph+0.026
Ann+0.026
S+0.026
Lu+0.025
Neg logit contributions
working-0.048
work-0.045
 guys-0.044
 warriors-0.042
zo-0.042
Token nod
Token ID18666
Feature activation+8.85e-02

Pos logit contributions
working+0.010
sing+0.009
work+0.009
 guys+0.008
 warriors+0.008
Neg logit contributions
 -0.006
See-0.006
So-0.006
Lu-0.006
-0.006
Token between
Token ID1022
Feature activation+2.11e-02

Pos logit contributions
working+0.249
sing+0.232
work+0.228
 guys+0.218
zo+0.212
Neg logit contributions
See-0.082
 -0.079
So-0.073
Lu-0.071
uct-0.066
Token them
Token ID606
Feature activation+1.35e-02

Pos logit contributions
working+0.054
sing+0.050
work+0.048
 guys+0.047
 warriors+0.046
Neg logit contributions
See-0.025
 -0.024
So-0.022
Lu-0.022
-0.021
Token,
Token ID11
Feature activation-5.76e-04

Pos logit contributions
working+0.036
sing+0.034
work+0.033
 guys+0.032
 warriors+0.031
Neg logit contributions
See-0.014
 -0.013
Lu-0.013
So-0.012
Ann-0.012
Token his
Token ID465
Feature activation-1.40e-02

Pos logit contributions
See+0.001
 +0.001
Lu+0.001
So+0.001
Ann+0.001
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 warriors-0.001
Token reminder
Token ID15438
Feature activation-1.40e-02

Pos logit contributions
 +0.021
See+0.021
So+0.019
Lu+0.019
+0.018
Neg logit contributions
working-0.032
sing-0.029
work-0.028
 guys-0.027
 warriors-0.026
Token I
Token ID314
Feature activation-2.58e-02

Pos logit contributions
See+0.019
 +0.018
So+0.017
Lu+0.016
uct+0.016
Neg logit contributions
working-0.039
sing-0.037
work-0.036
 guys-0.035
 warriors-0.033
Token play
Token ID711
Feature activation-5.21e-03

Pos logit contributions
See+0.041
 +0.039
Lu+0.037
So+0.037
uct+0.036
Neg logit contributions
working-0.056
sing-0.051
work-0.049
 guys-0.048
 warriors-0.045
Token with
Token ID351
Feature activation+9.86e-03

Pos logit contributions
 +0.005
 was+0.004
See+0.004
+0.004
+0.004
Neg logit contributions
working-0.016
sing-0.015
work-0.014
 spine-0.013
zo-0.013
Token your
Token ID534
Feature activation+1.02e-02

Pos logit contributions
working+0.027
sing+0.025
work+0.025
 guys+0.024
 warriors+0.023
Neg logit contributions
See-0.010
 -0.009
So-0.009
Lu-0.008
uct-0.008
Token re
Token ID302
Feature activation+5.41e-02

Pos logit contributions
working+0.024
sing+0.022
work+0.021
zo+0.020
 warriors+0.020
Neg logit contributions
 -0.014
See-0.014
 was-0.013
 p-0.013
So-0.013
Tokenind
Token ID521
Feature activation+8.64e-02

Pos logit contributions
working+0.105
sing+0.098
work+0.087
 guys+0.087
 warriors+0.087
Neg logit contributions
 -0.086
So-0.084
aw-0.083
S-0.083
 was-0.082
Tokeneer
Token ID28153
Feature activation+4.30e-02

Pos logit contributions
sing+0.102
working+0.093
work+0.075
 warriors+0.072
 photographer+0.072
Neg logit contributions
 -0.202
 was-0.200
-0.199
aw-0.198
Lu-0.197
Token?"
Token ID1701
Feature activation+1.15e-02

Pos logit contributions
working+0.120
sing+0.114
work+0.108
 guys+0.105
 warriors+0.105
Neg logit contributions
 -0.039
See-0.035
So-0.034
 was-0.034
-0.034
Token Ben
Token ID3932
Feature activation+1.17e-03

Pos logit contributions
working+0.029
sing+0.026
work+0.025
 guys+0.025
 warriors+0.024
Neg logit contributions
See-0.014
So-0.012
Lu-0.012
uct-0.012
 much-0.011
Token asks
Token ID7893
Feature activation+9.96e-03

Pos logit contributions
working+0.003
sing+0.003
work+0.003
zo+0.003
 guys+0.003
Neg logit contributions
See-0.001
Lu-0.001
Ph-0.001
 -0.001
Ann-0.001
Token Lily
Token ID20037
Feature activation-3.34e-02

Pos logit contributions
working+0.029
sing+0.028
work+0.027
 guys+0.026
 warriors+0.026
Neg logit contributions
See-0.008
 -0.007
So-0.007
Lu-0.006
uct-0.006
Token She
Token ID1375
Feature activation-2.22e-02

Pos logit contributions
working+0.004
sing+0.004
work+0.004
 warriors+0.004
 soldier+0.004
Neg logit contributions
uct-0.003
See-0.003
Lu-0.003
Ph-0.003
up-0.003
Token picks
Token ID11103
Feature activation-1.19e-02

Pos logit contributions
Lu+0.027
See+0.027
Ph+0.026
Ab+0.025
Su+0.025
Neg logit contributions
working-0.051
sing-0.048
work-0.046
 warriors-0.045
zo-0.045
Token up
Token ID510
Feature activation+1.02e-03

Pos logit contributions
See+0.014
 +0.013
So+0.013
Lu+0.012
uct+0.012
Neg logit contributions
working-0.030
sing-0.028
work-0.027
 guys-0.027
 warriors-0.026
Token the
Token ID262
Feature activation+1.75e-02

Pos logit contributions
working+0.003
sing+0.003
work+0.003
 guys+0.002
 warriors+0.002
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
uct-0.001
Token re
Token ID302
Feature activation+4.78e-02

Pos logit contributions
working+0.036
sing+0.034
work+0.032
 guys+0.031
 warriors+0.030
Neg logit contributions
See-0.029
 -0.029
So-0.027
Lu-0.026
-0.026
Tokenind
Token ID521
Feature activation+8.57e-02

Pos logit contributions
working+0.082
sing+0.075
 guys+0.073
 warriors+0.067
 photographer+0.066
Neg logit contributions
 -0.091
See-0.089
aw-0.087
 was-0.086
Lu-0.086
Tokeneer
Token ID28153
Feature activation+4.17e-02

Pos logit contributions
sing+0.091
working+0.090
 guys+0.073
 warriors+0.069
 photographer+0.069
Neg logit contributions
 -0.214
See-0.209
 was-0.209
Lu-0.207
-0.204
Token.
Token ID13
Feature activation+1.28e-02

Pos logit contributions
working+0.099
sing+0.094
work+0.092
 guys+0.085
 warriors+0.084
Neg logit contributions
See-0.057
 -0.053
Lu-0.051
So-0.050
Ph-0.050
Token She
Token ID1375
Feature activation-2.02e-02

Pos logit contributions
working+0.028
sing+0.027
 rook+0.025
 warriors+0.025
 soldier+0.024
Neg logit contributions
uct-0.015
ight-0.014
So-0.014
Lu-0.014
up-0.013
Token looks
Token ID3073
Feature activation+9.51e-03

Pos logit contributions
Lu+0.026
See+0.026
 +0.024
So+0.024
Ph+0.024
Neg logit contributions
working-0.047
sing-0.045
work-0.042
 warriors-0.042
 guys-0.041
Token at
Token ID379
Feature activation-2.15e-02

Pos logit contributions
working+0.020
sing+0.019
work+0.017
 guys+0.017
 warriors+0.017
Neg logit contributions
See-0.015
Lu-0.015
So-0.014
Ann-0.014
 -0.013
Token to
Token ID284
Feature activation+4.14e-03

Pos logit contributions
working+0.041
sing+0.039
work+0.037
 guys+0.036
zo+0.035
Neg logit contributions
 -0.019
See-0.019
So-0.017
 much-0.017
Lu-0.017
Token have
Token ID423
Feature activation+8.31e-03

Pos logit contributions
working+0.008
sing+0.008
work+0.007
 guys+0.007
 warriors+0.007
Neg logit contributions
See-0.007
 -0.007
So-0.007
Lu-0.007
uct-0.006
Token the
Token ID262
Feature activation+1.44e-02

Pos logit contributions
working+0.017
 guys+0.016
sing+0.015
work+0.015
 rook+0.015
Neg logit contributions
So-0.012
uct-0.011
Lu-0.011
-0.011
Conf-0.011
Token best
Token ID1266
Feature activation+4.46e-02

Pos logit contributions
working+0.033
sing+0.031
work+0.030
 guys+0.029
 warriors+0.028
Neg logit contributions
See-0.020
 -0.020
Lu-0.019
So-0.019
-0.018
Token pe
Token ID613
Feature activation+5.99e-02

Pos logit contributions
working+0.103
sing+0.097
work+0.092
 guys+0.090
 warriors+0.086
Neg logit contributions
See-0.062
 -0.060
Lu-0.059
So-0.058
uct-0.057
Tokenb
Token ID65
Feature activation+8.42e-02

Pos logit contributions
working+0.141
sing+0.134
work+0.126
 guys+0.123
 warriors+0.119
Neg logit contributions
 -0.081
See-0.081
So-0.075
 was-0.072
Lu-0.072
Tokenble
Token ID903
Feature activation+5.35e-02

Pos logit contributions
working+0.140
sing+0.127
work+0.118
 guys+0.112
 warriors+0.108
Neg logit contributions
See-0.173
 -0.173
So-0.164
Lu-0.163
-0.160
Token.
Token ID13
Feature activation+1.75e-02

Pos logit contributions
working+0.148
sing+0.136
work+0.134
 guys+0.128
 warriors+0.125
Neg logit contributions
See-0.056
 -0.053
So-0.048
uct-0.047
Lu-0.046
Token
Token ID198
Feature activation+5.28e-03

Pos logit contributions
working+0.050
sing+0.048
 warriors+0.044
work+0.044
 guys+0.044
Neg logit contributions
uct-0.011
See-0.010
Lu-0.010
Ph-0.009
Ab-0.009
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
working+0.014
sing+0.013
zo+0.013
work+0.012
 warriors+0.012
Neg logit contributions
uct-0.004
See-0.004
ird-0.004
Lu-0.004
Ab-0.003
Token"
Token ID1
Feature activation+1.87e-02

Pos logit contributions
working+0.042
sing+0.038
 warriors+0.037
zo+0.037
 guys+0.037
Neg logit contributions
 -0.014
See-0.013
uct-0.013
Lu-0.011
 Lucy-0.011
Token.
Token ID13
Feature activation+3.50e-03

Pos logit contributions
See+0.005
 +0.005
Lu+0.004
So+0.004
uct+0.004
Neg logit contributions
working-0.012
sing-0.012
work-0.011
 guys-0.011
 warriors-0.011
Token Sam
Token ID3409
Feature activation+7.91e-03

Pos logit contributions
working+0.009
sing+0.008
work+0.008
 guys+0.007
 warriors+0.007
Neg logit contributions
See-0.004
Lu-0.004
 -0.004
So-0.004
uct-0.004
Token wanted
Token ID2227
Feature activation+2.09e-02

Pos logit contributions
working+0.021
sing+0.020
 guys+0.019
work+0.019
zo+0.018
Neg logit contributions
See-0.008
 -0.007
Lu-0.007
So-0.007
Ab-0.007
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
working+0.060
sing+0.057
work+0.054
 spine+0.052
 warrior+0.052
Neg logit contributions
 -0.021
 was-0.020
-0.019
 much-0.019
See-0.019
Token pe
Token ID613
Feature activation+5.73e-02

Pos logit contributions
working+0.032
sing+0.030
work+0.028
 guys+0.028
 warrior+0.026
Neg logit contributions
Lu-0.022
See-0.021
So-0.021
 -0.021
-0.020
Tokenb
Token ID65
Feature activation+8.41e-02

Pos logit contributions
working+0.129
sing+0.120
work+0.114
 guys+0.110
 warriors+0.107
Neg logit contributions
 -0.084
See-0.084
So-0.078
Lu-0.077
-0.075
Tokenble
Token ID903
Feature activation+4.73e-02

Pos logit contributions
working+0.144
sing+0.130
work+0.123
 guys+0.116
 warriors+0.111
Neg logit contributions
See-0.169
 -0.168
Lu-0.160
So-0.160
-0.156
Token for
Token ID329
Feature activation+2.24e-02

Pos logit contributions
working+0.124
sing+0.116
work+0.111
 guys+0.108
 warriors+0.106
Neg logit contributions
 -0.053
See-0.053
So-0.048
Lu-0.047
-0.045
Token himself
Token ID2241
Feature activation+3.86e-02

Pos logit contributions
working+0.064
sing+0.060
work+0.058
zo+0.055
 photographer+0.054
Neg logit contributions
 -0.022
See-0.021
 much-0.019
So-0.018
 was-0.018
Token.
Token ID13
Feature activation+8.63e-04

Pos logit contributions
working+0.113
sing+0.107
work+0.102
 guys+0.100
 warriors+0.098
Neg logit contributions
See-0.032
 -0.031
Lu-0.027
So-0.027
-0.024
Token He
Token ID679
Feature activation+1.26e-02

Pos logit contributions
working+0.002
sing+0.002
 guys+0.002
work+0.002
 warriors+0.002
Neg logit contributions
See-0.001
Lu-0.001
uct-0.001
Ab-0.001
 much-0.001
Token sorry
Token ID7926
Feature activation+9.96e-04

Pos logit contributions
working+0.004
sing+0.004
work+0.004
 guys+0.004
 warriors+0.004
Neg logit contributions
See-0.002
 -0.002
So-0.002
Lu-0.002
-0.002
Token for
Token ID329
Feature activation+1.35e-02

Pos logit contributions
working+0.003
sing+0.002
 guys+0.002
work+0.002
 warriors+0.002
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
uct-0.001
Token throwing
Token ID9644
Feature activation+1.92e-02

Pos logit contributions
working+0.031
sing+0.029
work+0.027
 guys+0.026
 warriors+0.026
Neg logit contributions
See-0.020
 -0.020
So-0.018
Lu-0.018
-0.017
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
 guys+0.039
working+0.038
sing+0.036
 rook+0.035
work+0.035
Neg logit contributions
Lu-0.028
uct-0.027
See-0.027
So-0.026
Ann-0.025
Token pe
Token ID613
Feature activation+5.38e-02

Pos logit contributions
working+0.061
sing+0.056
 guys+0.053
work+0.053
 warrior+0.050
Neg logit contributions
 -0.042
Lu-0.042
See-0.041
So-0.041
-0.039
Tokenb
Token ID65
Feature activation+8.37e-02

Pos logit contributions
working+0.119
sing+0.111
work+0.105
 guys+0.101
 warriors+0.099
Neg logit contributions
 -0.081
See-0.081
So-0.075
Lu-0.074
-0.072
Tokenble
Token ID903
Feature activation+5.13e-02

Pos logit contributions
working+0.142
sing+0.128
work+0.121
 guys+0.114
 warriors+0.109
Neg logit contributions
See-0.170
 -0.170
So-0.161
Lu-0.161
-0.156
Token at
Token ID379
Feature activation-2.45e-03

Pos logit contributions
working+0.128
sing+0.117
work+0.113
 guys+0.111
 warriors+0.106
Neg logit contributions
See-0.068
 -0.065
So-0.060
Lu-0.060
uct-0.057
Token you
Token ID345
Feature activation-1.85e-02

Pos logit contributions
 +0.002
See+0.002
So+0.002
Lu+0.002
+0.002
Neg logit contributions
working-0.007
sing-0.007
work-0.007
 guys-0.006
 warriors-0.006
Token.
Token ID13
Feature activation-8.80e-03

Pos logit contributions
See+0.020
 +0.017
Lu+0.017
So+0.016
uct+0.015
Neg logit contributions
working-0.051
sing-0.049
work-0.046
 guys-0.046
 warriors-0.044
Token Will
Token ID2561
Feature activation+3.31e-02

Pos logit contributions
See+0.011
 +0.010
Lu+0.010
So+0.010
Ph+0.010
Neg logit contributions
working-0.021
sing-0.020
work-0.019
 guys-0.018
 warriors-0.018
Token big
Token ID1263
Feature activation-4.42e-02

Pos logit contributions
working+0.009
sing+0.009
work+0.008
 guys+0.008
 warriors+0.008
Neg logit contributions
See-0.006
 -0.006
So-0.005
Lu-0.005
-0.005
Token.
Token ID13
Feature activation-7.78e-03

Pos logit contributions
 +0.065
See+0.063
So+0.060
+0.059
 was+0.059
Neg logit contributions
working-0.098
sing-0.093
work-0.087
 guys-0.083
 warriors-0.082
Token
Token ID198
Feature activation-6.79e-03

Pos logit contributions
See+0.011
Lu+0.011
Ab+0.010
She+0.010
 much+0.010
Neg logit contributions
working-0.016
work-0.015
sing-0.014
zo-0.014
 warriors-0.013
Token The
Token ID383
Feature activation-5.05e-03

Pos logit contributions
See+0.005
uct+0.005
 kisses+0.004
 p+0.004
 cal+0.004
Neg logit contributions
working-0.018
zo-0.017
sing-0.016
work-0.016
 guys-0.015
Token wood
Token ID4898
Feature activation+2.95e-02

Pos logit contributions
See+0.007
 +0.007
 was+0.007
+0.006
So+0.006
Neg logit contributions
working-0.012
sing-0.011
work-0.011
zo-0.010
 guys-0.010
Tokenpe
Token ID431
Feature activation+8.36e-02

Pos logit contributions
working+0.051
sing+0.047
work+0.043
 guys+0.041
 warriors+0.040
Neg logit contributions
 -0.059
See-0.059
So-0.056
Lu-0.055
 was-0.054
Tokencker
Token ID15280
Feature activation+1.87e-02

Pos logit contributions
working+0.063
sing+0.052
work+0.042
 guys+0.039
 warriors+0.036
Neg logit contributions
 -0.244
See-0.243
So-0.237
 was-0.235
Lu-0.232
Token was
Token ID373
Feature activation+8.66e-03

Pos logit contributions
working+0.054
sing+0.051
work+0.049
 guys+0.047
zo+0.047
Neg logit contributions
See-0.016
 -0.015
Lu-0.014
So-0.013
 much-0.013
Token sad
Token ID6507
Feature activation+8.33e-03

Pos logit contributions
working+0.023
sing+0.021
work+0.021
 guys+0.020
 photographer+0.019
Neg logit contributions
 -0.010
See-0.009
 much-0.009
So-0.008
-0.008
Token.
Token ID13
Feature activation+9.97e-03

Pos logit contributions
working+0.023
sing+0.021
work+0.021
 photographer+0.020
 guys+0.020
Neg logit contributions
 -0.008
 was-0.008
 much-0.008
-0.008
So-0.008
Token Then
Token ID3244
Feature activation-9.43e-03

Pos logit contributions
working+0.025
sing+0.024
work+0.023
 guys+0.022
 warriors+0.022
Neg logit contributions
See-0.012
 -0.011
Lu-0.011
So-0.011
-0.010
Token very
Token ID845
Feature activation+1.38e-02

Pos logit contributions
working+0.025
sing+0.024
work+0.023
 guys+0.023
 warriors+0.022
Neg logit contributions
 -0.009
See-0.008
 much-0.008
 was-0.007
-0.007
Token proud
Token ID6613
Feature activation+2.01e-02

Pos logit contributions
working+0.036
sing+0.035
work+0.034
 guys+0.033
 warriors+0.032
Neg logit contributions
 -0.015
See-0.014
 much-0.014
-0.012
So-0.012
Token of
Token ID286
Feature activation-1.92e-02

Pos logit contributions
working+0.059
sing+0.055
work+0.053
 guys+0.052
 warriors+0.051
Neg logit contributions
See-0.017
 -0.016
Lu-0.015
So-0.014
uct-0.014
Token his
Token ID465
Feature activation+1.25e-02

Pos logit contributions
See+0.026
So+0.025
 +0.025
Ph+0.024
Lu+0.024
Neg logit contributions
working-0.045
sing-0.040
 guys-0.039
work-0.038
 warriors-0.038
Token beautiful
Token ID4950
Feature activation+3.00e-02

Pos logit contributions
working+0.028
sing+0.025
work+0.024
 spine+0.023
 warriors+0.023
Neg logit contributions
 -0.018
 was-0.017
-0.017
 much-0.017
 p-0.017
Token whist
Token ID32262
Feature activation+8.35e-02

Pos logit contributions
working+0.057
sing+0.055
work+0.051
 warriors+0.048
 spine+0.048
Neg logit contributions
 -0.052
 was-0.051
 p-0.049
See-0.048
-0.047
Tokenling
Token ID1359
Feature activation+6.26e-02

Pos logit contributions
sing+0.179
working+0.158
 spine+0.150
 rook+0.147
 warriors+0.145
Neg logit contributions
 was-0.135
 much-0.126
 -0.125
So-0.121
aw-0.120
Token,
Token ID11
Feature activation+8.86e-03

Pos logit contributions
working+0.168
sing+0.164
work+0.152
 warriors+0.148
 photographer+0.147
Neg logit contributions
 -0.062
 was-0.059
See-0.056
So-0.054
 much-0.053
Token but
Token ID475
Feature activation+4.21e-03

Pos logit contributions
working+0.022
sing+0.020
work+0.020
 guys+0.019
 warriors+0.018
Neg logit contributions
 -0.012
See-0.011
So-0.010
 was-0.010
 much-0.010
Token he
Token ID339
Feature activation+6.54e-03

Pos logit contributions
working+0.011
sing+0.010
work+0.009
 guys+0.009
zo+0.009
Neg logit contributions
See-0.005
 -0.005
So-0.005
Lu-0.005
uct-0.005
Token wanted
Token ID2227
Feature activation+1.00e-02

Pos logit contributions
working+0.019
sing+0.018
work+0.017
 guys+0.017
zo+0.016
Neg logit contributions
See-0.006
 -0.006
So-0.005
Lu-0.005
-0.004
Token boy
Token ID2933
Feature activation+2.35e-02

Pos logit contributions
 +0.011
See+0.011
So+0.010
 was+0.010
Lu+0.010
Neg logit contributions
working-0.018
sing-0.016
work-0.016
 guys-0.015
zo-0.015
Token understood
Token ID7247
Feature activation+6.87e-03

Pos logit contributions
working+0.065
sing+0.060
work+0.059
zo+0.057
 guys+0.057
Neg logit contributions
See-0.023
 -0.021
Lu-0.020
Ph-0.018
So-0.018
Token that
Token ID326
Feature activation+7.35e-04

Pos logit contributions
working+0.017
sing+0.015
work+0.015
 guys+0.014
 warriors+0.014
Neg logit contributions
See-0.009
 -0.008
Lu-0.008
Ann-0.007
Ph-0.007
Token the
Token ID262
Feature activation+7.71e-05

Pos logit contributions
working+0.002
sing+0.002
work+0.002
 guys+0.002
 warriors+0.002
Neg logit contributions
See-0.001
 -0.001
Lu-0.001
So-0.001
Ph-0.001
Token chip
Token ID11594
Feature activation+6.07e-02

Pos logit contributions
working+0.000
sing+0.000
work+0.000
 guys+0.000
zo+0.000
Neg logit contributions
 -0.000
See-0.000
So-0.000
 was-0.000
-0.000
Tokenm
Token ID76
Feature activation+8.35e-02

Pos logit contributions
working+0.189
sing+0.182
work+0.173
 guys+0.169
 warriors+0.167
Neg logit contributions
 -0.039
See-0.037
So-0.032
 was-0.031
Lu-0.030
Tokenunk
Token ID2954
Feature activation+3.69e-02

Pos logit contributions
working+0.148
sing+0.136
work+0.128
 guys+0.123
 warriors+0.121
Neg logit contributions
 -0.159
See-0.155
 was-0.151
So-0.149
Lu-0.147
Token was
Token ID373
Feature activation+4.68e-03

Pos logit contributions
working+0.106
sing+0.098
work+0.097
 guys+0.093
zo+0.091
Neg logit contributions
See-0.032
 -0.031
So-0.028
Lu-0.028
Ph-0.026
Token boss
Token ID6478
Feature activation+2.44e-02

Pos logit contributions
working+0.011
sing+0.010
work+0.010
 guys+0.009
 warriors+0.009
Neg logit contributions
See-0.006
 -0.006
So-0.006
Lu-0.006
Ann-0.006
Tokeny
Token ID88
Feature activation+9.42e-03

Pos logit contributions
working+0.036
work+0.030
sing+0.029
 guys+0.026
zo+0.026
Neg logit contributions
See-0.057
 -0.057
So-0.054
Lu-0.054
Ph-0.053
Token,
Token ID11
Feature activation+2.06e-03

Pos logit contributions
working+0.025
sing+0.023
work+0.022
 guys+0.022
 warriors+0.021
Neg logit contributions
See-0.011
 -0.010
Lu-0.009
So-0.009
uct-0.009
Token They
Token ID1119
Feature activation+5.21e-03

Pos logit contributions
working+0.043
sing+0.040
work+0.039
 guys+0.037
 warriors+0.036
Neg logit contributions
See-0.026
 -0.025
Lu-0.025
So-0.024
uct-0.024
Token had
Token ID550
Feature activation+1.50e-02

Pos logit contributions
working+0.012
sing+0.012
 guys+0.011
zo+0.011
 warriors+0.011
Neg logit contributions
Lu-0.006
See-0.006
Ann-0.006
 -0.006
uct-0.006
Token lost
Token ID2626
Feature activation+1.54e-02

Pos logit contributions
working+0.036
sing+0.034
work+0.032
 guys+0.031
 warriors+0.031
Neg logit contributions
See-0.020
 -0.020
So-0.018
Lu-0.018
-0.017
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
working+0.040
sing+0.037
work+0.035
 guys+0.034
 warriors+0.033
Neg logit contributions
 -0.018
See-0.018
So-0.016
Lu-0.016
 was-0.016
Token pe
Token ID613
Feature activation+5.51e-02

Pos logit contributions
working+0.042
sing+0.040
work+0.038
 guys+0.036
 warriors+0.036
Neg logit contributions
 -0.022
See-0.022
So-0.021
Lu-0.021
-0.020
Tokenb
Token ID65
Feature activation+8.30e-02

Pos logit contributions
working+0.126
sing+0.118
work+0.112
 guys+0.109
 warriors+0.106
Neg logit contributions
 -0.078
See-0.078
So-0.073
Lu-0.071
 was-0.070
Tokenble
Token ID903
Feature activation+4.94e-02

Pos logit contributions
working+0.159
sing+0.145
work+0.134
 guys+0.131
 warriors+0.127
Neg logit contributions
 -0.151
See-0.149
So-0.142
 was-0.141
 much-0.137
Token because
Token ID780
Feature activation-9.73e-03

Pos logit contributions
working+0.143
sing+0.135
work+0.129
 guys+0.127
 warriors+0.124
Neg logit contributions
 -0.040
See-0.038
So-0.035
-0.034
 much-0.034
Token they
Token ID484
Feature activation+5.25e-03

Pos logit contributions
 +0.011
See+0.011
So+0.011
Lu+0.011
uct+0.011
Neg logit contributions
working-0.024
sing-0.023
work-0.023
 guys-0.021
 warriors-0.021
Token did
Token ID750
Feature activation+2.10e-02

Pos logit contributions
working+0.014
sing+0.014
work+0.013
 guys+0.013
 warriors+0.012
Neg logit contributions
See-0.005
 -0.005
So-0.005
Lu-0.005
-0.004
Token not
Token ID407
Feature activation+1.69e-02

Pos logit contributions
working+0.047
sing+0.044
work+0.041
 guys+0.041
 warriors+0.039
Neg logit contributions
See-0.030
 -0.030
Lu-0.029
So-0.028
Ann-0.027
Token to
Token ID284
Feature activation+5.82e-03

Pos logit contributions
working+0.028
sing+0.026
work+0.025
 guys+0.024
zo+0.024
Neg logit contributions
 -0.014
See-0.013
 was-0.013
-0.012
So-0.012
Token play
Token ID711
Feature activation-3.71e-03

Pos logit contributions
working+0.015
sing+0.015
work+0.014
 spine+0.013
 warriors+0.013
Neg logit contributions
 -0.008
-0.007
 was-0.007
See-0.006
 Sue-0.006
Token with
Token ID351
Feature activation+7.02e-03

Pos logit contributions
 +0.002
+0.002
 was+0.002
 much+0.002
So+0.002
Neg logit contributions
working-0.012
sing-0.012
work-0.011
 spine-0.011
cker-0.011
Token the
Token ID262
Feature activation+1.89e-02

Pos logit contributions
working+0.017
sing+0.016
 guys+0.015
work+0.015
 warriors+0.015
Neg logit contributions
See-0.009
 -0.008
So-0.008
uct-0.008
Lu-0.008
Token re
Token ID302
Feature activation+5.35e-02

Pos logit contributions
working+0.041
sing+0.039
work+0.036
 warriors+0.033
 rook+0.033
Neg logit contributions
 -0.031
See-0.031
 was-0.029
So-0.029
-0.028
Tokenind
Token ID521
Feature activation+8.27e-02

Pos logit contributions
working+0.087
sing+0.084
 guys+0.075
 warriors+0.072
work+0.071
Neg logit contributions
 -0.104
So-0.104
Lu-0.104
See-0.102
S-0.102
Tokeneer
Token ID28153
Feature activation+3.39e-02

Pos logit contributions
sing+0.103
working+0.091
 rook+0.078
 warriors+0.074
work+0.074
Neg logit contributions
 was-0.194
Lu-0.192
-0.192
 -0.191
S-0.186
Token.
Token ID13
Feature activation-5.43e-03

Pos logit contributions
working+0.100
sing+0.095
work+0.090
 guys+0.089
 photographer+0.087
Neg logit contributions
 -0.027
See-0.023
 much-0.022
-0.022
So-0.022
Token They
Token ID1119
Feature activation+8.47e-03

Pos logit contributions
See+0.005
Lu+0.005
ird+0.004
uct+0.004
Ann+0.004
Neg logit contributions
working-0.014
sing-0.013
work-0.013
zo-0.013
 warriors-0.013
Token played
Token ID2826
Feature activation-9.88e-03

Pos logit contributions
working+0.023
sing+0.022
work+0.020
 spine+0.020
 warriors+0.019
Neg logit contributions
 -0.009
See-0.009
 was-0.008
So-0.008
-0.008
Token together
Token ID1978
Feature activation+7.88e-03

Pos logit contributions
 +0.008
+0.007
 was+0.007
So+0.007
See+0.006
Neg logit contributions
working-0.031
sing-0.029
work-0.028
 spine-0.027
cker-0.026
Token
Token ID198
Feature activation+1.04e-03

Pos logit contributions
See+0.013
uct+0.011
 cal+0.011
ird+0.010
aw+0.010
Neg logit contributions
working-0.033
sing-0.031
zo-0.030
work-0.028
 guys-0.028
TokenTim
Token ID14967
Feature activation-5.30e-03

Pos logit contributions
working+0.003
sing+0.002
 guys+0.002
zo+0.002
work+0.002
Neg logit contributions
 -0.001
See-0.001
 kisses-0.001
 cal-0.001
uct-0.001
Token shared
Token ID4888
Feature activation+1.76e-02

Pos logit contributions
See+0.007
 +0.006
Lu+0.006
Ph+0.006
 much+0.006
Neg logit contributions
working-0.013
sing-0.012
zo-0.012
 guys-0.012
work-0.011
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
working+0.040
sing+0.038
work+0.036
 guys+0.036
 warriors+0.035
Neg logit contributions
See-0.023
Lu-0.023
So-0.022
-0.022
Ann-0.021
Token pe
Token ID613
Feature activation+6.10e-02

Pos logit contributions
working+0.061
sing+0.055
work+0.051
 guys+0.051
 warrior+0.049
Neg logit contributions
See-0.041
Lu-0.041
So-0.041
 -0.041
-0.039
Tokenb
Token ID65
Feature activation+8.27e-02

Pos logit contributions
working+0.141
sing+0.132
work+0.125
 guys+0.121
 warriors+0.118
Neg logit contributions
 -0.086
See-0.086
So-0.080
Lu-0.078
-0.076
Tokenble
Token ID903
Feature activation+4.11e-02

Pos logit contributions
working+0.148
sing+0.135
work+0.126
 guys+0.120
 warriors+0.116
Neg logit contributions
 -0.161
See-0.160
So-0.152
Lu-0.150
-0.147
Token with
Token ID351
Feature activation+8.17e-03

Pos logit contributions
working+0.114
sing+0.108
work+0.103
 guys+0.101
 warriors+0.099
Neg logit contributions
 -0.040
See-0.036
So-0.035
 much-0.035
-0.034
Token his
Token ID465
Feature activation+9.30e-03

Pos logit contributions
working+0.018
sing+0.017
 guys+0.016
work+0.016
 warriors+0.016
Neg logit contributions
See-0.012
-0.011
So-0.011
 -0.011
Lu-0.011
Token friends
Token ID2460
Feature activation+3.13e-03

Pos logit contributions
working+0.026
sing+0.024
work+0.023
 guys+0.022
 warriors+0.022
Neg logit contributions
 -0.010
See-0.009
So-0.008
Lu-0.008
 was-0.008
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
working+0.009
sing+0.009
work+0.008
 guys+0.008
 warriors+0.008
Neg logit contributions
 -0.003
See-0.002
So-0.002
-0.002
 was-0.002
Token looked
Token ID3114
Feature activation+1.19e-02

Pos logit contributions
working+0.009
sing+0.008
work+0.008
 guys+0.008
 warriors+0.008
Neg logit contributions
See-0.003
 -0.003
So-0.003
Lu-0.003
-0.003
Token for
Token ID329
Feature activation-1.17e-03

Pos logit contributions
working+0.030
sing+0.028
work+0.027
 guys+0.026
zo+0.025
Neg logit contributions
 -0.015
See-0.014
So-0.013
 much-0.013
-0.013
Token the
Token ID262
Feature activation-3.20e-03

Pos logit contributions
 +0.002
See+0.002
So+0.001
Lu+0.001
 was+0.001
Neg logit contributions
working-0.003
sing-0.003
work-0.003
 guys-0.002
zo-0.002
Token best
Token ID1266
Feature activation+4.05e-02

Pos logit contributions
See+0.004
 +0.004
Lu+0.004
So+0.004
uct+0.004
Neg logit contributions
working-0.007
sing-0.007
work-0.006
 guys-0.006
 warriors-0.006
Token pe
Token ID613
Feature activation+5.49e-02

Pos logit contributions
working+0.092
sing+0.085
work+0.080
 guys+0.079
 warriors+0.076
Neg logit contributions
See-0.059
 -0.058
Lu-0.055
So-0.055
uct-0.054
Tokenb
Token ID65
Feature activation+8.22e-02

Pos logit contributions
working+0.124
sing+0.116
work+0.110
 guys+0.106
 warriors+0.104
Neg logit contributions
See-0.080
 -0.080
So-0.074
Lu-0.073
-0.071
Tokenbles
Token ID7689
Feature activation+3.58e-02

Pos logit contributions
working+0.139
sing+0.126
work+0.118
 guys+0.111
 warriors+0.108
Neg logit contributions
See-0.167
 -0.167
So-0.159
Lu-0.158
-0.154
Token and
Token ID290
Feature activation+1.12e-02

Pos logit contributions
working+0.089
sing+0.082
work+0.080
 guys+0.077
 warriors+0.076
Neg logit contributions
See-0.045
 -0.044
So-0.041
Lu-0.040
uct-0.039
Token took
Token ID1718
Feature activation-1.47e-03

Pos logit contributions
working+0.029
sing+0.027
work+0.026
 guys+0.025
 warriors+0.025
Neg logit contributions
See-0.013
 -0.013
So-0.012
Lu-0.011
-0.011
Token them
Token ID606
Feature activation-4.60e-04

Pos logit contributions
See+0.002
 +0.002
So+0.002
Lu+0.002
uct+0.002
Neg logit contributions
working-0.004
sing-0.003
work-0.003
 guys-0.003
 warriors-0.003
Token home
Token ID1363
Feature activation-2.77e-02

Pos logit contributions
See+0.001
 +0.001
So+0.001
Lu+0.001
+0.001
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 warriors-0.001
Token finally
Token ID3443
Feature activation+7.04e-03

Pos logit contributions
working+0.023
sing+0.022
work+0.022
zo+0.021
 warriors+0.021
Neg logit contributions
See-0.009
Lu-0.008
 -0.008
Care-0.008
So-0.008
Token understood
Token ID7247
Feature activation-2.01e-03

Pos logit contributions
working+0.016
work+0.014
 warriors+0.014
 warrior+0.014
zo+0.014
Neg logit contributions
See-0.010
Lu-0.010
 -0.009
So-0.009
Ann-0.009
Token why
Token ID1521
Feature activation+8.00e-03

Pos logit contributions
See+0.002
Lu+0.002
 +0.002
Ann+0.002
So+0.002
Neg logit contributions
working-0.005
work-0.005
sing-0.005
 guys-0.004
 warriors-0.004
Token the
Token ID262
Feature activation-3.69e-03

Pos logit contributions
working+0.022
sing+0.021
work+0.020
 guys+0.019
 warriors+0.019
Neg logit contributions
See-0.008
 -0.008
So-0.007
Lu-0.007
uct-0.006
Token chip
Token ID11594
Feature activation+5.16e-02

Pos logit contributions
 +0.006
See+0.006
So+0.005
 was+0.005
Lu+0.005
Neg logit contributions
working-0.008
sing-0.007
work-0.007
 guys-0.007
zo-0.007
Tokenm
Token ID76
Feature activation+8.20e-02

Pos logit contributions
working+0.162
sing+0.155
work+0.148
 guys+0.145
 warriors+0.144
Neg logit contributions
 -0.031
See-0.030
So-0.025
Lu-0.024
 was-0.023
Tokenunk
Token ID2954
Feature activation+3.34e-02

Pos logit contributions
working+0.146
sing+0.133
work+0.125
 guys+0.124
 warriors+0.121
Neg logit contributions
 -0.156
See-0.153
 was-0.148
So-0.145
Lu-0.144
Token had
Token ID550
Feature activation+2.25e-02

Pos logit contributions
working+0.098
sing+0.092
work+0.090
 guys+0.086
zo+0.085
Neg logit contributions
See-0.027
 -0.025
Lu-0.022
So-0.022
Ph-0.021
Token been
Token ID587
Feature activation+3.09e-02

Pos logit contributions
working+0.053
sing+0.048
work+0.048
 guys+0.047
 warriors+0.046
Neg logit contributions
See-0.029
 -0.029
Lu-0.028
So-0.028
Ann-0.028
Token so
Token ID523
Feature activation-1.11e-02

Pos logit contributions
working+0.075
sing+0.069
work+0.067
 warriors+0.064
 guys+0.064
Neg logit contributions
See-0.040
 -0.039
So-0.037
Lu-0.037
Ann-0.035
Token boss
Token ID6478
Feature activation+1.64e-02

Pos logit contributions
 +0.016
See+0.016
Lu+0.015
So+0.015
Ann+0.015
Neg logit contributions
working-0.026
sing-0.023
work-0.022
 warriors-0.021
 guys-0.021
Token frog
Token ID21264
Feature activation+2.35e-02

Pos logit contributions
 +0.006
See+0.006
So+0.006
Lu+0.005
 was+0.005
Neg logit contributions
working-0.010
sing-0.009
work-0.009
 guys-0.008
zo-0.008
Token had
Token ID550
Feature activation+1.99e-02

Pos logit contributions
working+0.059
sing+0.055
work+0.053
 guys+0.051
 warriors+0.051
Neg logit contributions
See-0.029
 -0.028
So-0.025
Lu-0.025
uct-0.024
Token found
Token ID1043
Feature activation-6.58e-03

Pos logit contributions
working+0.046
work+0.041
sing+0.040
 warriors+0.040
 guys+0.040
Neg logit contributions
See-0.026
 -0.025
Lu-0.025
Ann-0.025
So-0.024
Token the
Token ID262
Feature activation+8.51e-03

Pos logit contributions
See+0.008
 +0.007
uct+0.007
So+0.007
Lu+0.007
Neg logit contributions
working-0.016
work-0.015
sing-0.015
 guys-0.014
 warriors-0.014
Token perfect
Token ID2818
Feature activation+1.91e-02

Pos logit contributions
working+0.020
sing+0.019
work+0.018
 guys+0.017
zo+0.017
Neg logit contributions
 -0.012
See-0.011
So-0.011
 was-0.010
-0.010
Token mate
Token ID16133
Feature activation+8.20e-02

Pos logit contributions
working+0.041
sing+0.038
work+0.036
 guys+0.034
 warriors+0.034
Neg logit contributions
 -0.030
See-0.030
So-0.028
Lu-0.028
 was-0.027
Token.
Token ID13
Feature activation-5.69e-03

Pos logit contributions
working+0.239
sing+0.227
work+0.214
 guys+0.213
 warriors+0.208
Neg logit contributions
 -0.066
See-0.063
So-0.057
Lu-0.055
 was-0.054
Token<|endoftext|>
Token ID50256
Feature activation-9.93e-03

Pos logit contributions
See+0.006
up+0.006
Ph+0.006
uct+0.006
 else+0.006
Neg logit contributions
working-0.013
work-0.013
sing-0.012
zo-0.012
 photographer-0.011
TokenOnce
Token ID7454
Feature activation+8.12e-03

Pos logit contributions
uct+0.013
So+0.011
See+0.011
Ph+0.011
 else+0.011
Neg logit contributions
working-0.022
work-0.020
zo-0.019
 guys-0.019
 warriors-0.019
Token upon
Token ID2402
Feature activation-2.03e-02

Pos logit contributions
working+0.017
sing+0.015
work+0.014
 guys+0.013
zo+0.013
Neg logit contributions
See-0.014
So-0.013
Lu-0.012
Care-0.012
uct-0.012
Token a
Token ID257
Feature activation-2.45e-02

Pos logit contributions
See+0.031
uct+0.031
Ph+0.028
 +0.028
Lu+0.028
Neg logit contributions
working-0.040
sing-0.039
 guys-0.036
work-0.034
 warriors-0.033
Token not
Token ID407
Feature activation+2.02e-02

Pos logit contributions
working+0.004
sing+0.004
work+0.004
 guys+0.003
 warriors+0.003
Neg logit contributions
See-0.002
 -0.002
So-0.002
Lu-0.002
-0.002
Token everyone
Token ID2506
Feature activation+3.14e-03

Pos logit contributions
working+0.051
sing+0.048
work+0.046
 guys+0.045
 warriors+0.044
Neg logit contributions
 -0.024
See-0.023
So-0.021
 much-0.021
-0.021
Token likes
Token ID7832
Feature activation+1.67e-02

Pos logit contributions
working+0.008
sing+0.008
work+0.008
 guys+0.007
zo+0.007
Neg logit contributions
See-0.003
 -0.003
So-0.003
Lu-0.003
Ph-0.003
Token Tom
Token ID4186
Feature activation+3.62e-02

Pos logit contributions
working+0.042
sing+0.040
work+0.038
 guys+0.035
 warriors+0.035
Neg logit contributions
 -0.021
See-0.020
 was-0.018
 much-0.018
So-0.018
Token's
Token ID338
Feature activation+3.87e-02

Pos logit contributions
working+0.102
sing+0.096
work+0.093
 guys+0.091
zo+0.089
Neg logit contributions
See-0.035
Lu-0.030
 -0.030
So-0.029
Ph-0.027
Token whist
Token ID32262
Feature activation+8.19e-02

Pos logit contributions
working+0.076
sing+0.067
work+0.064
 warriors+0.063
 photographer+0.061
Neg logit contributions
 -0.071
 was-0.066
See-0.066
So-0.064
-0.064
Tokenling
Token ID1359
Feature activation+5.68e-02

Pos logit contributions
sing+0.163
working+0.162
 spine+0.140
work+0.140
 warriors+0.140
Neg logit contributions
 -0.134
 was-0.132
See-0.128
So-0.128
 much-0.128
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
working+0.168
sing+0.158
work+0.154
 guys+0.151
zo+0.146
Neg logit contributions
See-0.045
 -0.042
Lu-0.038
So-0.037
uct-0.035
Token One
Token ID1881
Feature activation+3.42e-02

Pos logit contributions
working+0.039
sing+0.037
 guys+0.035
work+0.035
 warriors+0.033
Neg logit contributions
See-0.021
Lu-0.020
So-0.019
 -0.018
Ab-0.018
Token day
Token ID1110
Feature activation+3.58e-03

Pos logit contributions
working+0.070
sing+0.065
work+0.061
 guys+0.059
zo+0.057
Neg logit contributions
See-0.058
 -0.058
So-0.054
Lu-0.054
uct-0.052
Token,
Token ID11
Feature activation+1.97e-02

Pos logit contributions
working+0.008
 guys+0.007
sing+0.006
work+0.006
 warriors+0.006
Neg logit contributions
uct-0.006
See-0.006
So-0.005
Lu-0.005
 -0.005
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
working+0.035
sing+0.033
work+0.032
 guys+0.031
zo+0.031
Neg logit contributions
See-0.010
 -0.009
So-0.008
Lu-0.008
uct-0.008
Token Max
Token ID5436
Feature activation+1.10e-02

Pos logit contributions
working+0.035
sing+0.033
work+0.032
 guys+0.030
 warriors+0.030
Neg logit contributions
 -0.017
See-0.016
So-0.015
 So-0.015
-0.015
Token had
Token ID550
Feature activation+1.65e-02

Pos logit contributions
working+0.033
sing+0.031
work+0.030
 guys+0.029
 warriors+0.028
Neg logit contributions
See-0.008
 -0.008
So-0.007
Lu-0.007
uct-0.006
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
working+0.043
sing+0.040
work+0.037
 guys+0.037
 warriors+0.037
Neg logit contributions
See-0.018
 -0.018
Lu-0.017
So-0.017
uct-0.016
Token w
Token ID266
Feature activation+5.24e-02

Pos logit contributions
working+0.029
work+0.028
sing+0.026
 warriors+0.026
zo+0.026
Neg logit contributions
 -0.012
 much-0.010
See-0.010
 was-0.010
 p-0.010
Tokenagging
Token ID16406
Feature activation+8.19e-02

Pos logit contributions
working+0.111
work+0.105
sing+0.100
fly+0.092
 photographer+0.091
Neg logit contributions
See-0.079
aw-0.078
 -0.077
 much-0.076
So-0.074
Token tail
Token ID7894
Feature activation+1.85e-02

Pos logit contributions
working+0.138
work+0.125
sing+0.121
 guys+0.113
 warriors+0.112
Neg logit contributions
 -0.171
 was-0.162
See-0.159
-0.156
 much-0.152
Token and
Token ID290
Feature activation+4.43e-03

Pos logit contributions
working+0.048
sing+0.045
work+0.043
 guys+0.042
 warriors+0.041
Neg logit contributions
 -0.021
See-0.020
So-0.019
Lu-0.018
-0.018
Token a
Token ID257
Feature activation+2.35e-03

Pos logit contributions
working+0.010
sing+0.010
 warriors+0.009
 guys+0.009
work+0.009
Neg logit contributions
uct-0.006
See-0.006
So-0.006
Lu-0.006
Ph-0.006
Token shiny
Token ID22441
Feature activation+2.05e-02

Pos logit contributions
working+0.005
sing+0.005
work+0.005
 guys+0.005
 warriors+0.005
Neg logit contributions
 -0.003
See-0.003
So-0.003
 much-0.003
 was-0.003
Token collar
Token ID19908
Feature activation-6.59e-03

Pos logit contributions
working+0.045
work+0.041
sing+0.040
zo+0.038
 warriors+0.038
Neg logit contributions
 -0.031
 p-0.030
 was-0.029
See-0.029
 much-0.027
Token to
Token ID284
Feature activation+1.49e-02

Pos logit contributions
working+0.056
sing+0.051
work+0.050
 guys+0.048
 warriors+0.046
Neg logit contributions
See-0.035
 -0.034
Lu-0.032
So-0.032
uct-0.031
Token play
Token ID711
Feature activation-3.42e-03

Pos logit contributions
working+0.034
sing+0.032
work+0.030
 guys+0.029
 warriors+0.028
Neg logit contributions
 -0.022
See-0.021
So-0.020
-0.019
Lu-0.019
Token with
Token ID351
Feature activation+8.98e-03

Pos logit contributions
 +0.003
See+0.003
So+0.003
 was+0.003
 much+0.003
Neg logit contributions
working-0.010
sing-0.009
work-0.009
zo-0.009
 warriors-0.009
Token the
Token ID262
Feature activation+2.31e-02

Pos logit contributions
working+0.024
sing+0.022
work+0.022
 guys+0.021
 warriors+0.020
Neg logit contributions
See-0.010
 -0.009
So-0.009
Lu-0.008
uct-0.008
Token re
Token ID302
Feature activation+5.30e-02

Pos logit contributions
working+0.049
sing+0.046
work+0.043
 photographer+0.040
zo+0.040
Neg logit contributions
See-0.037
 -0.037
 was-0.034
 p-0.033
So-0.033
Tokenind
Token ID521
Feature activation+8.16e-02

Pos logit contributions
working+0.091
sing+0.087
 guys+0.083
 warriors+0.077
 photographer+0.077
Neg logit contributions
 -0.094
 was-0.092
So-0.092
Lu-0.090
-0.090
Tokeneer
Token ID28153
Feature activation+3.82e-02

Pos logit contributions
sing+0.097
working+0.089
 rook+0.075
 photographer+0.075
 bishop+0.072
Neg logit contributions
 was-0.190
 -0.190
-0.189
 much-0.185
Lu-0.185
Token,
Token ID11
Feature activation+3.57e-03

Pos logit contributions
working+0.106
sing+0.099
work+0.096
 guys+0.093
 warriors+0.091
Neg logit contributions
See-0.038
 -0.036
So-0.032
Lu-0.032
uct-0.030
Token you
Token ID345
Feature activation-3.48e-02

Pos logit contributions
working+0.010
sing+0.009
work+0.009
 guys+0.009
 warriors+0.008
Neg logit contributions
See-0.004
 -0.004
So-0.003
Lu-0.003
-0.003
Token have
Token ID423
Feature activation+7.21e-03

Pos logit contributions
 +0.036
See+0.036
So+0.033
+0.032
 was+0.031
Neg logit contributions
working-0.094
sing-0.088
work-0.084
 guys-0.081
 warriors-0.080
Token to
Token ID284
Feature activation+1.60e-02

Pos logit contributions
working+0.014
sing+0.013
work+0.012
 guys+0.012
 warriors+0.012
Neg logit contributions
Lu-0.012
See-0.012
 -0.012
So-0.012
Ann-0.012
Token shoot
Token ID2686
Feature activation+1.74e-02

Pos logit contributions
See+0.006
 +0.006
So+0.006
Lu+0.006
uct+0.006
Neg logit contributions
working-0.007
sing-0.006
work-0.006
 guys-0.006
 warriors-0.005
Token hay
Token ID27678
Feature activation-3.52e-02

Pos logit contributions
working+0.043
sing+0.040
work+0.039
 guys+0.038
 warriors+0.037
Neg logit contributions
See-0.021
 -0.021
So-0.020
Lu-0.019
uct-0.018
Token and
Token ID290
Feature activation+2.17e-03

Pos logit contributions
 +0.032
 was+0.029
+0.028
So+0.027
See+0.027
Neg logit contributions
working-0.098
sing-0.093
work-0.086
 guys-0.085
 photographer-0.084
Token eat
Token ID4483
Feature activation-3.45e-02

Pos logit contributions
working+0.005
sing+0.005
work+0.004
 guys+0.004
 warriors+0.004
Neg logit contributions
See-0.003
 -0.003
So-0.003
Lu-0.003
uct-0.003
Token cup
Token ID6508
Feature activation-4.59e-02

Pos logit contributions
See+0.048
So+0.044
uct+0.044
 +0.043
Ph+0.042
Neg logit contributions
working-0.079
 guys-0.072
sing-0.071
work-0.070
 warriors-0.066
Tokencakes
Token ID37263
Feature activation-7.46e-02

Pos logit contributions
See+0.059
 +0.056
So+0.050
uct+0.049
+0.048
Neg logit contributions
working-0.116
work-0.105
sing-0.104
 guys-0.101
zo-0.096
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
See+0.062
 +0.059
So+0.051
Lu+0.050
+0.046
Neg logit contributions
working-0.219
sing-0.205
work-0.199
 guys-0.194
 warriors-0.189
Token It
Token ID632
Feature activation-5.44e-03

Pos logit contributions
Ann+0.015
Ab+0.015
aw+0.015
See+0.015
So+0.015
Neg logit contributions
working-0.031
sing-0.028
work-0.027
 guys-0.026
 spine-0.025
Token was
Token ID373
Feature activation-2.68e-03

Pos logit contributions
See+0.003
 +0.003
uct+0.002
So+0.002
Ph+0.002
Neg logit contributions
working-0.017
sing-0.016
work-0.016
 guys-0.015
zo-0.015
Token a
Token ID257
Feature activation-1.94e-02

Pos logit contributions
See+0.004
So+0.004
 +0.003
 So+0.003
Conf+0.003
Neg logit contributions
working-0.006
sing-0.006
 guys-0.005
work-0.005
 warriors-0.005
Token good
Token ID922
Feature activation-7.46e-03

Pos logit contributions
 +0.021
See+0.021
So+0.019
Lu+0.018
 much+0.018
Neg logit contributions
working-0.052
sing-0.048
work-0.047
 guys-0.045
 warriors-0.044
Token it
Token ID340
Feature activation+9.67e-04

Pos logit contributions
So+0.003
See+0.003
 +0.003
uct+0.003
Lu+0.003
Neg logit contributions
working-0.008
work-0.008
 guys-0.008
sing-0.007
zo-0.007
Token to
Token ID284
Feature activation-1.27e-03

Pos logit contributions
working+0.002
sing+0.002
work+0.002
 guys+0.002
 warriors+0.002
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
-0.001
Token me
Token ID502
Feature activation-3.03e-02

Pos logit contributions
See+0.001
 +0.001
So+0.001
Lu+0.001
+0.001
Neg logit contributions
working-0.003
sing-0.003
work-0.003
 guys-0.003
 warriors-0.003
Token for
Token ID329
Feature activation+2.60e-02

Pos logit contributions
See+0.033
 +0.030
So+0.027
Lu+0.027
Ph+0.026
Neg logit contributions
working-0.082
sing-0.077
work-0.075
 guys-0.072
 warriors-0.070
Token safe
Token ID3338
Feature activation-1.30e-02

Pos logit contributions
working+0.064
sing+0.060
work+0.057
 guys+0.056
 warriors+0.055
Neg logit contributions
See-0.033
 -0.032
So-0.030
Lu-0.030
-0.029
Tokenkeeping
Token ID19934
Feature activation-7.42e-02

Pos logit contributions
 +0.017
See+0.017
So+0.016
Lu+0.015
 was+0.015
Neg logit contributions
working-0.031
sing-0.029
work-0.028
 guys-0.027
 warriors-0.026
Token.
Token ID13
Feature activation+5.25e-03

Pos logit contributions
See+0.088
So+0.076
Care+0.075
Lu+0.074
Conf+0.072
Neg logit contributions
working-0.199
sing-0.185
work-0.181
 guys-0.175
 spine-0.167
Token He
Token ID679
Feature activation+8.41e-03

Pos logit contributions
working+0.010
sing+0.009
work+0.009
 soldier+0.009
 warriors+0.009
Neg logit contributions
uct-0.008
Lu-0.008
Ph-0.008
Ab-0.007
So-0.007
Token told
Token ID1297
Feature activation+3.32e-03

Pos logit contributions
working+0.023
work+0.021
sing+0.021
 guys+0.021
zo+0.020
Neg logit contributions
See-0.008
 -0.008
Lu-0.008
Ph-0.007
uct-0.007
Token me
Token ID502
Feature activation-2.69e-02

Pos logit contributions
working+0.008
sing+0.008
 guys+0.008
work+0.008
 warriors+0.007
Neg logit contributions
See-0.004
So-0.004
 -0.004
Lu-0.003
-0.003
Token to
Token ID284
Feature activation+5.38e-03

Pos logit contributions
See+0.037
 +0.032
Lu+0.032
uct+0.032
So+0.032
Neg logit contributions
working-0.066
sing-0.061
work-0.060
 guys-0.058
 warriors-0.056
Token out
Token ID503
Feature activation+1.40e-02

Pos logit contributions
working+0.002
sing+0.001
work+0.001
 guys+0.001
 warriors+0.001
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
-0.001
Token and
Token ID290
Feature activation-4.02e-03

Pos logit contributions
working+0.036
work+0.033
sing+0.033
 warriors+0.031
 guys+0.030
Neg logit contributions
See-0.016
 -0.015
Lu-0.015
Ph-0.015
Ann-0.014
Token it
Token ID340
Feature activation-6.09e-03

Pos logit contributions
So+0.005
See+0.005
 +0.005
Conf+0.005
Lu+0.005
Neg logit contributions
working-0.010
sing-0.009
work-0.009
 guys-0.008
 warriors-0.008
Token looked
Token ID3114
Feature activation+1.55e-02

Pos logit contributions
See+0.008
So+0.007
 +0.007
uct+0.007
Ph+0.007
Neg logit contributions
working-0.015
work-0.014
sing-0.013
 guys-0.013
zo-0.012
Token so
Token ID523
Feature activation-2.04e-02

Pos logit contributions
working+0.037
sing+0.035
 guys+0.034
work+0.033
 warriors+0.031
Neg logit contributions
See-0.020
Ann-0.020
Lu-0.019
So-0.019
 -0.019
Token y
Token ID331
Feature activation-7.42e-02

Pos logit contributions
Ann+0.028
See+0.027
So+0.026
Conf+0.026
Lu+0.025
Neg logit contributions
working-0.047
sing-0.042
 guys-0.041
work-0.040
zo-0.036
Tokenummy
Token ID13513
Feature activation-3.75e-02

Pos logit contributions
See+0.148
 +0.148
So+0.137
Lu+0.136
+0.133
Neg logit contributions
working-0.131
sing-0.117
work-0.112
 guys-0.106
 warriors-0.100
Token!
Token ID0
Feature activation-1.28e-02

Pos logit contributions
See+0.043
 +0.040
So+0.038
Ann+0.037
Conf+0.036
Neg logit contributions
working-0.100
work-0.092
 guys-0.089
sing-0.088
 warriors-0.084
Token The
Token ID383
Feature activation-1.80e-02

Pos logit contributions
 else+0.022
up+0.021
Ann+0.021
Lu+0.020
ight+0.020
Neg logit contributions
working-0.022
sing-0.020
work-0.020
 Whale-0.018
 battlefield-0.018
Token wild
Token ID4295
Feature activation+3.16e-02

Pos logit contributions
 +0.032
See+0.032
So+0.030
Lu+0.029
+0.029
Neg logit contributions
working-0.036
sing-0.033
work-0.031
 guys-0.030
 warriors-0.029
Token pig
Token ID12967
Feature activation+2.04e-02

Pos logit contributions
working+0.069
sing+0.067
work+0.061
 rook+0.060
 warriors+0.060
Neg logit contributions
 -0.048
See-0.048
 was-0.048
 p-0.044
-0.043
Token until
Token ID1566
Feature activation-1.96e-02

Pos logit contributions
See+0.023
 +0.023
So+0.020
Lu+0.020
+0.019
Neg logit contributions
working-0.079
sing-0.074
work-0.071
 guys-0.069
 warriors-0.068
Token it
Token ID340
Feature activation-4.03e-03

Pos logit contributions
So+0.026
 So+0.023
Conf+0.022
old+0.022
Lu+0.022
Neg logit contributions
working-0.049
 guys-0.044
work-0.043
sing-0.042
 soldier-0.038
Token was
Token ID373
Feature activation-8.57e-03

Pos logit contributions
 +0.004
See+0.004
old+0.004
uct+0.004
So+0.004
Neg logit contributions
working-0.010
sing-0.010
 guys-0.009
zo-0.009
work-0.009
Token so
Token ID523
Feature activation-2.01e-02

Pos logit contributions
So+0.013
See+0.013
 So+0.013
old+0.013
Ann+0.012
Neg logit contributions
working-0.018
sing-0.018
 guys-0.015
work-0.015
 spine-0.015
Token com
Token ID401
Feature activation-5.13e-02

Pos logit contributions
So+0.033
 So+0.032
Ann+0.032
ird+0.032
old+0.032
Neg logit contributions
working-0.040
sing-0.037
work-0.033
 guys-0.028
 spine-0.028
Tokenfy
Token ID24928
Feature activation-7.39e-02

Pos logit contributions
See+0.064
 +0.063
So+0.060
 much+0.058
+0.057
Neg logit contributions
working-0.125
sing-0.118
work-0.113
 guys-0.110
 warriors-0.109
Token it
Token ID340
Feature activation-6.91e-04

Pos logit contributions
See+0.079
 +0.077
So+0.073
Lu+0.068
uct+0.066
Neg logit contributions
working-0.199
sing-0.184
work-0.180
 guys-0.173
 warriors-0.168
Token felt
Token ID2936
Feature activation+7.13e-03

Pos logit contributions
See+0.001
 +0.001
So+0.001
 So+0.001
Ab+0.001
Neg logit contributions
working-0.002
sing-0.001
work-0.001
 guys-0.001
zo-0.001
Token like
Token ID588
Feature activation-2.21e-02

Pos logit contributions
working+0.014
sing+0.012
 guys+0.012
work+0.012
 soldier+0.011
Neg logit contributions
Ann-0.012
So-0.011
 So-0.011
See-0.011
 -0.010
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
 So+0.031
So+0.031
Conf+0.029
uct+0.029
 +0.028
Neg logit contributions
working-0.047
 guys-0.044
work-0.043
sing-0.042
 warriors-0.040
Token cozy
Token ID37438
Feature activation-3.12e-02

Pos logit contributions
uct+0.027
 +0.027
See+0.026
So+0.026
Lu+0.026
Neg logit contributions
working-0.031
sing-0.030
 warrior-0.027
 guys-0.027
work-0.027
Token the
Token ID262
Feature activation-2.21e-02

Pos logit contributions
So+0.026
 So+0.023
See+0.023
Conf+0.022
aw+0.021
Neg logit contributions
working-0.063
sing-0.057
work-0.056
 guys-0.054
 spine-0.050
Token sun
Token ID4252
Feature activation-1.04e-02

Pos logit contributions
 +0.035
See+0.035
So+0.033
Lu+0.032
uct+0.031
Neg logit contributions
working-0.048
sing-0.044
work-0.042
 guys-0.041
 warriors-0.039
Token came
Token ID1625
Feature activation+1.24e-02

Pos logit contributions
So+0.013
See+0.013
uct+0.012
 +0.012
Lu+0.012
Neg logit contributions
working-0.026
work-0.024
sing-0.024
 guys-0.022
fly-0.021
Token out
Token ID503
Feature activation+5.83e-03

Pos logit contributions
working+0.026
sing+0.024
work+0.023
 guys+0.022
 warriors+0.021
Neg logit contributions
See-0.020
 -0.019
So-0.019
Lu-0.018
uct-0.018
Token and
Token ID290
Feature activation-2.00e-02

Pos logit contributions
working+0.015
work+0.014
sing+0.014
 warriors+0.012
 guys+0.012
Neg logit contributions
See-0.006
Ann-0.006
Lu-0.006
 -0.006
uct-0.006
Token dried
Token ID16577
Feature activation-7.34e-02

Pos logit contributions
So+0.027
 So+0.026
See+0.025
aw+0.024
Conf+0.024
Neg logit contributions
working-0.046
sing-0.044
work-0.043
 guys-0.039
 soldier-0.038
Token up
Token ID510
Feature activation-3.73e-03

Pos logit contributions
See+0.094
So+0.088
 +0.082
Conf+0.082
uct+0.079
Neg logit contributions
working-0.176
sing-0.162
work-0.162
 guys-0.158
gy-0.152
Token all
Token ID477
Feature activation-7.53e-04

Pos logit contributions
See+0.005
So+0.004
 +0.004
Ann+0.004
Ph+0.004
Neg logit contributions
working-0.009
sing-0.008
work-0.008
 guys-0.008
 warriors-0.007
Token the
Token ID262
Feature activation-1.74e-03

Pos logit contributions
So+0.001
ird+0.001
Conf+0.001
 So+0.001
Ph+0.001
Neg logit contributions
working-0.001
work-0.001
ky-0.001
mer-0.001
fly-0.001
Token rain
Token ID6290
Feature activation-3.45e-02

Pos logit contributions
 +0.003
See+0.003
So+0.002
uct+0.002
Ph+0.002
Neg logit contributions
working-0.004
 guys-0.003
sing-0.003
work-0.003
 warriors-0.003
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
See+0.044
 +0.040
So+0.039
uct+0.038
Care+0.037
Neg logit contributions
working-0.089
work-0.084
sing-0.083
 guys-0.075
 spine-0.074
Token lunch
Token ID9965
Feature activation-6.82e-03

Pos logit contributions
See+0.004
 +0.004
So+0.003
Lu+0.003
+0.003
Neg logit contributions
working-0.005
sing-0.005
work-0.005
 guys-0.004
 warriors-0.004
Token and
Token ID290
Feature activation-1.83e-03

Pos logit contributions
See+0.009
 +0.009
So+0.008
Lu+0.008
uct+0.008
Neg logit contributions
working-0.016
sing-0.015
work-0.015
 guys-0.014
 warriors-0.014
Token it
Token ID340
Feature activation-3.53e-03

Pos logit contributions
So+0.002
Conf+0.002
See+0.002
Lu+0.002
Ph+0.002
Neg logit contributions
working-0.004
sing-0.004
work-0.004
 guys-0.004
 warriors-0.004
Token was
Token ID373
Feature activation-9.90e-03

Pos logit contributions
See+0.003
 +0.003
Ab+0.003
Ph+0.003
So+0.003
Neg logit contributions
working-0.009
work-0.009
 guys-0.008
sing-0.008
zo-0.008
Token so
Token ID523
Feature activation-2.43e-02

Pos logit contributions
old+0.015
Ann+0.015
Conf+0.015
aw+0.014
So+0.014
Neg logit contributions
working-0.019
sing-0.019
work-0.018
 guys-0.016
gy-0.014
Token y
Token ID331
Feature activation-7.31e-02

Pos logit contributions
Ann+0.031
Conf+0.027
See+0.027
ird+0.027
aw+0.027
Neg logit contributions
working-0.056
sing-0.051
work-0.048
 guys-0.042
zo-0.040
Tokenummy
Token ID13513
Feature activation-4.91e-02

Pos logit contributions
 +0.142
See+0.141
So+0.131
Lu+0.130
+0.127
Neg logit contributions
working-0.133
sing-0.120
work-0.114
 guys-0.107
 warriors-0.103
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
See+0.059
 +0.057
So+0.052
uct+0.051
Ph+0.050
Neg logit contributions
working-0.127
work-0.117
sing-0.112
 guys-0.109
 warriors-0.105
Token sweet
Token ID6029
Feature activation-3.24e-02

Pos logit contributions
So+0.022
Conf+0.021
aw+0.021
Ann+0.020
Ph+0.020
Neg logit contributions
working-0.038
work-0.035
sing-0.034
 guys-0.032
 crew-0.031
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
 +0.034
See+0.034
So+0.030
Ann+0.030
Ph+0.029
Neg logit contributions
working-0.087
work-0.081
sing-0.078
 guys-0.078
 photographer-0.072
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
aw+0.014
uct+0.012
Ann+0.012
Jane+0.011
Lu+0.011
Neg logit contributions
working-0.037
work-0.035
sing-0.033
 crew-0.032
zo-0.032
Token and
Token ID290
Feature activation-2.45e-02

Pos logit contributions
See+0.011
 +0.011
Lu+0.010
So+0.010
uct+0.009
Neg logit contributions
working-0.030
sing-0.029
work-0.027
 guys-0.027
 warriors-0.026
Token Sarah
Token ID10490
Feature activation-3.23e-02

Pos logit contributions
So+0.031
Lu+0.028
Ab+0.028
S+0.028
See+0.028
Neg logit contributions
working-0.059
sing-0.056
work-0.055
 guys-0.052
 warriors-0.049
Token's
Token ID338
Feature activation-3.18e-03

Pos logit contributions
See+0.036
Care+0.033
Lu+0.033
Ab+0.032
uct+0.032
Neg logit contributions
working-0.075
sing-0.072
zo-0.071
work-0.069
gy-0.068
Token kitchen
Token ID9592
Feature activation-4.54e-02

Pos logit contributions
 +0.004
See+0.004
 was+0.004
So+0.004
 much+0.004
Neg logit contributions
working-0.008
sing-0.007
work-0.007
 guys-0.006
zo-0.006
Token was
Token ID373
Feature activation-1.56e-02

Pos logit contributions
See+0.047
uct+0.042
So+0.040
Ab+0.039
Ph+0.039
Neg logit contributions
working-0.120
sing-0.118
 guys-0.111
work-0.109
gy-0.102
Token clean
Token ID3424
Feature activation-7.28e-02

Pos logit contributions
 So+0.025
So+0.024
See+0.023
Ann+0.023
Lu+0.022
Neg logit contributions
sing-0.033
working-0.030
work-0.028
 crew-0.026
 guys-0.025
Token again
Token ID757
Feature activation-2.08e-03

Pos logit contributions
See+0.080
 +0.072
So+0.068
Lu+0.067
Care+0.066
Neg logit contributions
working-0.195
sing-0.180
work-0.178
 guys-0.173
 warriors-0.165
Token.
Token ID13
Feature activation-1.43e-02

Pos logit contributions
See+0.002
 +0.002
Lu+0.002
So+0.002
Ann+0.002
Neg logit contributions
working-0.006
sing-0.006
work-0.005
 guys-0.005
 warriors-0.005
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
Ab+0.016
aw+0.016
Ph+0.015
S+0.015
See+0.015
Neg logit contributions
working-0.035
sing-0.032
work-0.030
 spine-0.029
 photographer-0.029
Token
Token ID198
Feature activation+6.38e-04

Pos logit contributions
aw+0.012
See+0.012
 cal+0.011
uct+0.011
ird+0.011
Neg logit contributions
working-0.032
sing-0.028
 tablet-0.028
zo-0.027
 battlefield-0.026
TokenSarah
Token ID29284
Feature activation-1.63e-02

Pos logit contributions
working+0.002
 photographer+0.001
sing+0.001
 guys+0.001
zo+0.001
Neg logit contributions
 -0.001
 Megan-0.001
 Sara-0.001
 p-0.001
 kisses-0.001
Tokennut
Token ID14930
Feature activation+6.79e-03

Pos logit contributions
See+0.008
 +0.008
So+0.007
Lu+0.007
+0.007
Neg logit contributions
working-0.018
sing-0.017
work-0.016
 guys-0.016
 warriors-0.015
Token and
Token ID290
Feature activation+1.85e-03

Pos logit contributions
working+0.017
sing+0.016
work+0.016
 guys+0.015
 warriors+0.015
Neg logit contributions
See-0.008
 -0.008
Lu-0.007
So-0.007
Ph-0.007
Token it
Token ID340
Feature activation-2.15e-04

Pos logit contributions
working+0.005
work+0.004
sing+0.004
 guys+0.004
 warriors+0.004
Neg logit contributions
Ph-0.002
See-0.002
So-0.002
 -0.002
Lu-0.002
Token looked
Token ID3114
Feature activation+1.65e-02

Pos logit contributions
See+0.000
 +0.000
So+0.000
Ph+0.000
uct+0.000
Neg logit contributions
working-0.001
work-0.001
sing-0.000
 guys-0.000
zo-0.000
Token so
Token ID523
Feature activation-2.02e-02

Pos logit contributions
working+0.036
sing+0.032
 guys+0.032
work+0.031
 warriors+0.029
Neg logit contributions
Ann-0.024
See-0.024
Lu-0.023
 -0.022
So-0.022
Token y
Token ID331
Feature activation-7.24e-02

Pos logit contributions
Ann+0.025
See+0.024
 +0.023
So+0.023
Conf+0.022
Neg logit contributions
working-0.049
sing-0.041
work-0.041
 guys-0.040
zo-0.037
Tokenummy
Token ID13513
Feature activation-3.99e-02

Pos logit contributions
See+0.145
 +0.145
So+0.134
Lu+0.133
Ann+0.131
Neg logit contributions
working-0.127
sing-0.113
work-0.108
 guys-0.102
 warriors-0.097
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
See+0.048
 +0.044
So+0.041
Ann+0.041
Ph+0.041
Neg logit contributions
working-0.104
work-0.094
 guys-0.089
sing-0.088
 warriors-0.086
Token She
Token ID1375
Feature activation-2.59e-02

Pos logit contributions
Ph+0.012
 else+0.012
Ann+0.011
See+0.011
Lu+0.011
Neg logit contributions
working-0.024
work-0.021
sing-0.020
 rook-0.020
 warriors-0.020
Token couldn
Token ID3521
Feature activation-3.83e-02

Pos logit contributions
See+0.031
 +0.029
Ph+0.029
Lu+0.027
Ann+0.027
Neg logit contributions
working-0.063
sing-0.057
work-0.056
zo-0.054
 warriors-0.053
Token't
Token ID470
Feature activation+8.18e-03

Pos logit contributions
See+0.069
So+0.064
Lu+0.063
 +0.063
aw+0.061
Neg logit contributions
working-0.073
work-0.062
sing-0.062
 guys-0.057
zo-0.054
Token inside
Token ID2641
Feature activation-1.52e-02

Pos logit contributions
working+0.008
sing+0.008
work+0.007
 guys+0.007
 warriors+0.007
Neg logit contributions
 -0.004
See-0.004
So-0.004
Lu-0.004
-0.004
Token the
Token ID262
Feature activation+6.12e-03

Pos logit contributions
See+0.023
Ph+0.021
So+0.021
Lu+0.020
uct+0.020
Neg logit contributions
working-0.032
sing-0.028
work-0.028
 warriors-0.026
 guys-0.026
Token cave
Token ID11527
Feature activation+2.17e-02

Pos logit contributions
working+0.010
sing+0.009
work+0.008
 guys+0.008
 warriors+0.008
Neg logit contributions
See-0.013
 -0.013
So-0.012
Lu-0.012
uct-0.012
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
working+0.054
sing+0.051
work+0.050
 warriors+0.046
 spine+0.046
Neg logit contributions
See-0.028
Lu-0.025
So-0.025
 -0.024
Ph-0.024
Token and
Token ID290
Feature activation+6.56e-03

Pos logit contributions
working+0.028
sing+0.028
work+0.025
 spine+0.024
 warriors+0.024
Neg logit contributions
See-0.019
So-0.019
Lu-0.019
Ph-0.018
Ann-0.018
Token curled
Token ID41108
Feature activation-7.18e-02

Pos logit contributions
working+0.016
sing+0.015
work+0.015
 guys+0.014
 warriors+0.014
Neg logit contributions
See-0.008
 -0.008
So-0.008
Lu-0.007
uct-0.007
Token up
Token ID510
Feature activation+7.21e-04

Pos logit contributions
See+0.088
 +0.087
So+0.081
Lu+0.078
+0.077
Neg logit contributions
working-0.180
sing-0.170
work-0.162
 guys-0.156
 warriors-0.153
Token into
Token ID656
Feature activation-1.71e-02

Pos logit contributions
working+0.001
sing+0.001
work+0.001
 spine+0.001
 guys+0.001
Neg logit contributions
See-0.001
So-0.001
 -0.001
Ann-0.001
Lu-0.001
Token a
Token ID257
Feature activation-8.64e-03

Pos logit contributions
So+0.023
uct+0.022
See+0.022
 So+0.021
aw+0.021
Neg logit contributions
working-0.038
sing-0.035
work-0.034
 guys-0.034
 warriors-0.033
Token tight
Token ID5381
Feature activation-4.20e-03

Pos logit contributions
uct+0.014
See+0.014
Ann+0.013
So+0.013
 +0.013
Neg logit contributions
working-0.017
sing-0.017
 warrior-0.015
 soldier-0.015
work-0.015
Token ball
Token ID2613
Feature activation-2.80e-02

Pos logit contributions
See+0.006
 +0.006
So+0.005
Lu+0.005
uct+0.005
Neg logit contributions
working-0.010
sing-0.009
work-0.009
 guys-0.009
 warriors-0.008
Token hole
Token ID7604
Feature activation+1.10e-02

Pos logit contributions
 +0.006
See+0.006
So+0.005
 was+0.005
 much+0.005
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 photographer-0.012
Token and
Token ID290
Feature activation-2.76e-04

Pos logit contributions
working+0.028
sing+0.026
work+0.025
 guys+0.024
 warriors+0.024
Neg logit contributions
See-0.013
 -0.013
So-0.012
Lu-0.012
-0.011
Token it
Token ID340
Feature activation-3.17e-03

Pos logit contributions
See+0.000
 +0.000
So+0.000
Lu+0.000
uct+0.000
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 warriors-0.001
Token was
Token ID373
Feature activation-7.24e-03

Pos logit contributions
See+0.003
 +0.003
Lu+0.003
S+0.002
So+0.002
Neg logit contributions
working-0.009
sing-0.008
work-0.008
 guys-0.008
 warriors-0.008
Token so
Token ID523
Feature activation-1.75e-02

Pos logit contributions
So+0.010
See+0.010
Lu+0.010
Ann+0.010
S+0.010
Neg logit contributions
sing-0.016
working-0.016
work-0.014
 warriors-0.014
 guys-0.014
Token cozy
Token ID37438
Feature activation-7.18e-02

Pos logit contributions
Lu+0.029
ird+0.028
Ann+0.028
See+0.028
So+0.027
Neg logit contributions
sing-0.035
working-0.035
work-0.030
 warriors-0.030
 crew-0.028
Token inside
Token ID2641
Feature activation-1.45e-02

Pos logit contributions
 +0.066
See+0.066
So+0.058
Lu+0.057
+0.055
Neg logit contributions
working-0.201
sing-0.190
work-0.182
 guys-0.177
 warriors-0.174
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
So+0.019
See+0.019
Ph+0.019
Lu+0.018
S+0.018
Neg logit contributions
working-0.033
work-0.031
sing-0.030
 guys-0.028
 ship-0.028
Token It
Token ID632
Feature activation-1.33e-03

Pos logit contributions
See+0.018
S+0.018
So+0.018
Lu+0.018
Ab+0.018
Neg logit contributions
working-0.023
sing-0.021
 warriors-0.020
work-0.020
 rook-0.019
Token was
Token ID373
Feature activation-2.08e-03

Pos logit contributions
See+0.001
 +0.001
Lu+0.001
S+0.001
uct+0.001
Neg logit contributions
working-0.004
sing-0.004
work-0.003
zo-0.003
 warriors-0.003
Token his
Token ID465
Feature activation-1.30e-02

Pos logit contributions
See+0.003
So+0.003
Lu+0.003
Ann+0.003
 +0.003
Neg logit contributions
working-0.005
sing-0.005
work-0.004
 guys-0.004
 crew-0.004
Token but
Token ID475
Feature activation-5.01e-03

Pos logit contributions
See+0.008
Lu+0.008
So+0.007
Ann+0.007
 +0.007
Neg logit contributions
working-0.010
sing-0.010
work-0.009
 spine-0.008
 warriors-0.008
Token it
Token ID340
Feature activation-1.22e-02

Pos logit contributions
See+0.007
Lu+0.006
So+0.006
S+0.006
 +0.006
Neg logit contributions
working-0.012
sing-0.011
work-0.010
 guys-0.010
 warrior-0.010
Token felt
Token ID2936
Feature activation-5.94e-03

Pos logit contributions
See+0.010
 +0.009
Ph+0.008
S+0.007
So+0.007
Neg logit contributions
working-0.034
sing-0.033
work-0.031
zo-0.030
 guys-0.030
Token so
Token ID523
Feature activation-1.84e-02

Pos logit contributions
Ann+0.010
Lu+0.009
See+0.009
So+0.009
uct+0.008
Neg logit contributions
working-0.012
sing-0.011
 guys-0.010
work-0.010
 warriors-0.009
Token com
Token ID401
Feature activation-4.71e-02

Pos logit contributions
Ann+0.031
See+0.028
Lu+0.028
ird+0.027
So+0.027
Neg logit contributions
working-0.037
sing-0.033
 guys-0.028
 spine-0.027
work-0.026
Tokenfy
Token ID24928
Feature activation-7.16e-02

Pos logit contributions
See+0.059
 +0.059
So+0.055
 much+0.054
+0.053
Neg logit contributions
working-0.114
sing-0.108
work-0.103
 guys-0.101
 warriors-0.099
Token and
Token ID290
Feature activation-1.96e-02

Pos logit contributions
See+0.078
 +0.074
So+0.071
Lu+0.068
uct+0.064
Neg logit contributions
working-0.191
sing-0.178
work-0.172
 guys-0.165
 warriors-0.162
Token warm
Token ID5814
Feature activation-5.65e-02

Pos logit contributions
So+0.029
Conf+0.028
aw+0.028
See+0.027
Ann+0.027
Neg logit contributions
working-0.041
sing-0.038
 guys-0.033
 spine-0.032
work-0.032
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
See+0.059
So+0.054
 +0.054
Conf+0.049
Ann+0.049
Neg logit contributions
working-0.155
work-0.143
sing-0.142
 guys-0.135
 spine-0.129
Token She
Token ID1375
Feature activation-2.89e-02

Pos logit contributions
Lu+0.011
Ab+0.010
See+0.009
uct+0.009
Ann+0.009
Neg logit contributions
working-0.044
sing-0.040
work-0.038
 guys-0.038
zo-0.038
Token decided
Token ID3066
Feature activation-2.81e-02

Pos logit contributions
See+0.041
 +0.037
Ph+0.037
 Instead+0.036
Lu+0.036
Neg logit contributions
working-0.061
sing-0.059
zo-0.055
 crew-0.053
 guys-0.052
Token some
Token ID617
Feature activation-1.06e-02

Pos logit contributions
working+0.003
 guys+0.002
sing+0.002
zo+0.002
 soldier+0.002
Neg logit contributions
See-0.001
 -0.001
uct-0.001
Ab-0.001
Lu-0.001
Token soap
Token ID19533
Feature activation-2.62e-02

Pos logit contributions
 +0.017
uct+0.017
See+0.016
Lu+0.016
Ann+0.016
Neg logit contributions
working-0.022
sing-0.019
 guys-0.019
work-0.018
 warriors-0.018
Token and
Token ID290
Feature activation+1.01e-02

Pos logit contributions
See+0.035
 +0.033
Lu+0.032
So+0.032
uct+0.031
Neg logit contributions
working-0.064
sing-0.059
work-0.058
 guys-0.055
 warriors-0.053
Token began
Token ID2540
Feature activation-4.94e-03

Pos logit contributions
working+0.024
 guys+0.021
sing+0.021
work+0.020
zo+0.019
Neg logit contributions
See-0.013
 -0.013
uct-0.013
Lu-0.013
Ab-0.012
Token to
Token ID284
Feature activation-1.14e-02

Pos logit contributions
See+0.008
So+0.008
 +0.008
Lu+0.008
uct+0.008
Neg logit contributions
working-0.010
sing-0.009
work-0.009
 guys-0.008
 warriors-0.008
Token clean
Token ID3424
Feature activation-7.13e-02

Pos logit contributions
See+0.022
 +0.021
Lu+0.020
So+0.020
S+0.020
Neg logit contributions
working-0.021
sing-0.019
work-0.017
 guys-0.017
zo-0.016
Token his
Token ID465
Feature activation-7.25e-03

Pos logit contributions
See+0.078
 +0.072
So+0.072
Lu+0.068
+0.065
Neg logit contributions
working-0.190
sing-0.176
work-0.170
 guys-0.168
 warriors-0.160
Token motorcycle
Token ID18757
Feature activation+8.92e-03

Pos logit contributions
 +0.012
See+0.012
So+0.011
 was+0.011
 much+0.011
Neg logit contributions
working-0.015
sing-0.014
work-0.013
 guys-0.012
 warriors-0.012
Token.
Token ID13
Feature activation-4.81e-03

Pos logit contributions
working+0.027
sing+0.025
work+0.024
 guys+0.024
 warriors+0.023
Neg logit contributions
See-0.007
 -0.007
So-0.006
Lu-0.006
-0.005
Token He
Token ID679
Feature activation+5.31e-03

Pos logit contributions
See+0.007
Lu+0.007
So+0.007
Ann+0.007
Ab+0.006
Neg logit contributions
working-0.011
sing-0.010
 guys-0.009
work-0.009
 spine-0.009
Token spread
Token ID4104
Feature activation-1.30e-02

Pos logit contributions
working+0.013
sing+0.011
zo+0.011
 guys+0.011
work+0.011
Neg logit contributions
See-0.007
Ab-0.006
Lu-0.006
 -0.006
S-0.006
Token set
Token ID900
Feature activation-3.08e-02

Pos logit contributions
working+0.005
sing+0.005
work+0.005
 guys+0.005
 warriors+0.005
Neg logit contributions
See-0.003
So-0.003
 -0.003
Lu-0.003
Ph-0.002
Token up
Token ID510
Feature activation-5.04e-03

Pos logit contributions
See+0.028
So+0.027
 +0.025
Lu+0.025
uct+0.024
Neg logit contributions
working-0.086
sing-0.081
work-0.079
 guys-0.077
 warriors-0.075
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
 +0.005
See+0.005
So+0.005
Lu+0.005
+0.004
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 warriors-0.012
Token table
Token ID3084
Feature activation-5.05e-02

Pos logit contributions
See+0.024
uct+0.023
 +0.023
So+0.022
Ph+0.021
Neg logit contributions
working-0.028
sing-0.027
work-0.025
 guys-0.024
 rook-0.023
Token and
Token ID290
Feature activation-7.66e-03

Pos logit contributions
See+0.051
 +0.047
So+0.044
Lu+0.043
uct+0.042
Neg logit contributions
working-0.140
sing-0.132
work-0.127
 guys-0.122
 warriors-0.121
Token chairs
Token ID18791
Feature activation-7.13e-02

Pos logit contributions
See+0.010
 +0.010
So+0.010
Lu+0.009
Ph+0.009
Neg logit contributions
working-0.018
sing-0.017
work-0.016
 guys-0.016
 warriors-0.015
Token.
Token ID13
Feature activation+5.40e-03

Pos logit contributions
See+0.088
 +0.080
So+0.080
Lu+0.078
uct+0.074
Neg logit contributions
working-0.182
sing-0.170
work-0.166
 guys-0.156
 warriors-0.153
Token She
Token ID1375
Feature activation-2.71e-02

Pos logit contributions
working+0.013
sing+0.013
 warriors+0.011
 photographer+0.011
 rook+0.011
Neg logit contributions
uct-0.005
aw-0.005
So-0.005
Lu-0.005
See-0.005
Token invited
Token ID9392
Feature activation-1.67e-02

Pos logit contributions
See+0.034
 +0.032
So+0.031
Ph+0.030
uct+0.030
Neg logit contributions
working-0.066
sing-0.063
work-0.060
 guys-0.058
 warriors-0.057
Token her
Token ID607
Feature activation-2.68e-02

Pos logit contributions
See+0.015
 +0.013
So+0.013
Lu+0.012
+0.012
Neg logit contributions
working-0.047
sing-0.045
work-0.044
 guys-0.043
 warriors-0.042
Token other
Token ID584
Feature activation+1.12e-02

Pos logit contributions
See+0.040
 +0.039
So+0.037
Lu+0.037
uct+0.035
Neg logit contributions
working-0.060
sing-0.056
work-0.054
 guys-0.052
 warriors-0.050
Token she
Token ID673
Feature activation-1.92e-02

Pos logit contributions
See+0.028
 +0.026
So+0.026
Ann+0.026
Care+0.025
Neg logit contributions
working-0.040
work-0.037
sing-0.035
 crew-0.035
 guys-0.034
Token was
Token ID373
Feature activation-4.38e-04

Pos logit contributions
See+0.020
 +0.019
So+0.018
Lu+0.017
Ph+0.017
Neg logit contributions
working-0.052
sing-0.048
work-0.047
 guys-0.046
zo-0.044
Token safe
Token ID3338
Feature activation-1.98e-02

Pos logit contributions
See+0.001
So+0.001
 +0.001
Lu+0.001
Ann+0.001
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 warriors-0.001
Token and
Token ID290
Feature activation-1.05e-02

Pos logit contributions
 +0.022
See+0.021
So+0.019
Lu+0.019
+0.019
Neg logit contributions
working-0.052
sing-0.049
work-0.047
 guys-0.046
 warriors-0.045
Token pl
Token ID458
Feature activation-8.08e-03

Pos logit contributions
See+0.013
Ann+0.012
So+0.012
Lu+0.012
 +0.012
Neg logit contributions
working-0.025
sing-0.023
work-0.021
 guys-0.021
 warriors-0.021
Tokenopped
Token ID38333
Feature activation-7.10e-02

Pos logit contributions
 +0.012
See+0.012
 was+0.011
 much+0.011
Lu+0.011
Neg logit contributions
working-0.017
sing-0.016
work-0.016
zo-0.014
 warriors-0.014
Token down
Token ID866
Feature activation+2.96e-03

Pos logit contributions
See+0.116
So+0.113
 +0.113
+0.107
Lu+0.104
Neg logit contributions
working-0.148
sing-0.132
 guys-0.128
work-0.127
 warriors-0.119
Token in
Token ID287
Feature activation+1.12e-02

Pos logit contributions
working+0.007
sing+0.006
work+0.006
 warriors+0.005
 guys+0.005
Neg logit contributions
See-0.004
So-0.004
 -0.004
Lu-0.004
Ann-0.004
Token her
Token ID607
Feature activation-1.82e-02

Pos logit contributions
working+0.026
sing+0.025
work+0.024
 warriors+0.023
 guys+0.023
Neg logit contributions
So-0.012
See-0.012
aw-0.012
uct-0.012
Ph-0.012
Token lap
Token ID14779
Feature activation-3.77e-02

Pos logit contributions
See+0.030
 +0.029
So+0.028
Lu+0.027
uct+0.026
Neg logit contributions
working-0.039
sing-0.037
work-0.034
 guys-0.033
 spine-0.032
Token.
Token ID13
Feature activation-5.01e-03

Pos logit contributions
See+0.040
 +0.032
So+0.031
Lu+0.031
uct+0.031
Neg logit contributions
working-0.104
sing-0.097
work-0.095
 guys-0.089
 spine-0.088
Token fish
Token ID5916
Feature activation+2.38e-02

Pos logit contributions
working+0.034
sing+0.034
work+0.031
 spine+0.030
 warriors+0.029
Neg logit contributions
 -0.024
 was-0.023
See-0.022
-0.022
 much-0.021
Token and
Token ID290
Feature activation+8.47e-03

Pos logit contributions
working+0.057
sing+0.054
work+0.050
 guys+0.049
 warriors+0.048
Neg logit contributions
 -0.032
See-0.030
So-0.029
 much-0.028
 was-0.028
Token it
Token ID340
Feature activation-5.79e-03

Pos logit contributions
working+0.020
sing+0.019
work+0.018
 guys+0.017
 warriors+0.017
Neg logit contributions
See-0.011
 -0.011
So-0.011
Lu-0.011
uct-0.010
Token was
Token ID373
Feature activation-6.69e-03

Pos logit contributions
See+0.005
 +0.004
So+0.004
Lu+0.004
uct+0.004
Neg logit contributions
working-0.017
sing-0.016
work-0.016
 guys-0.015
zo-0.015
Token so
Token ID523
Feature activation-1.92e-02

Pos logit contributions
See+0.010
Ann+0.009
Conf+0.009
Lu+0.009
Start+0.009
Neg logit contributions
working-0.015
sing-0.015
work-0.014
 guys-0.013
 crew-0.012
Token y
Token ID331
Feature activation-7.08e-02

Pos logit contributions
See+0.023
Ann+0.022
Lu+0.021
 +0.021
So+0.020
Neg logit contributions
working-0.049
sing-0.046
work-0.043
 guys-0.041
zo-0.039
Tokenummy
Token ID13513
Feature activation-4.77e-02

Pos logit contributions
 +0.124
See+0.123
So+0.118
Lu+0.116
 much+0.114
Neg logit contributions
working-0.139
sing-0.129
work-0.121
 guys-0.116
 warriors-0.114
Token!
Token ID0
Feature activation-8.69e-03

Pos logit contributions
See+0.040
 +0.039
So+0.034
Lu+0.033
uct+0.031
Neg logit contributions
working-0.139
sing-0.130
work-0.127
 guys-0.123
 warriors-0.121
Token Tim
Token ID5045
Feature activation-1.36e-02

Pos logit contributions
 else+0.013
See+0.013
Ann+0.013
up+0.013
Lu+0.013
Neg logit contributions
sing-0.017
working-0.016
work-0.015
gy-0.014
 guys-0.014
Tokenmy
Token ID1820
Feature activation-4.41e-03

Pos logit contributions
See+0.016
 +0.015
Lu+0.014
Ph+0.014
 much+0.013
Neg logit contributions
working-0.035
sing-0.033
work-0.032
zo-0.031
 guys-0.030
Token said
Token ID531
Feature activation-1.50e-02

Pos logit contributions
See+0.004
Lu+0.004
Ph+0.004
 +0.004
uct+0.004
Neg logit contributions
working-0.012
sing-0.011
work-0.011
zo-0.011
 guys-0.010
Token a
Token ID257
Feature activation-8.36e-03

Pos logit contributions
 +0.007
uct+0.007
See+0.007
Ph+0.006
Lu+0.006
Neg logit contributions
working-0.012
sing-0.010
work-0.010
 guys-0.010
 warriors-0.010
Token towel
Token ID24808
Feature activation-5.43e-02

Pos logit contributions
uct+0.013
See+0.013
 +0.013
So+0.012
Lu+0.012
Neg logit contributions
working-0.018
sing-0.017
work-0.015
 guys-0.015
 rook-0.015
Token and
Token ID290
Feature activation-1.17e-03

Pos logit contributions
See+0.072
 +0.067
uct+0.065
Lu+0.065
So+0.064
Neg logit contributions
working-0.130
sing-0.125
work-0.121
 guys-0.111
 warriors-0.108
Token started
Token ID2067
Feature activation-8.73e-03

Pos logit contributions
 +0.002
See+0.001
uct+0.001
So+0.001
Ph+0.001
Neg logit contributions
working-0.003
sing-0.002
 guys-0.002
work-0.002
 warriors-0.002
Token to
Token ID284
Feature activation-1.11e-02

Pos logit contributions
See+0.015
So+0.015
 +0.015
Lu+0.014
uct+0.014
Neg logit contributions
working-0.018
sing-0.016
work-0.015
 guys-0.013
 warriors-0.013
Token clean
Token ID3424
Feature activation-7.08e-02

Pos logit contributions
See+0.018
 +0.017
aw+0.016
So+0.016
uct+0.016
Neg logit contributions
working-0.024
sing-0.022
work-0.021
 guys-0.020
zo-0.019
Token the
Token ID262
Feature activation-6.50e-03

Pos logit contributions
See+0.093
So+0.090
 +0.081
Conf+0.080
uct+0.079
Neg logit contributions
working-0.171
sing-0.158
work-0.153
 guys-0.149
gy-0.139
Token floor
Token ID4314
Feature activation-3.77e-02

Pos logit contributions
See+0.010
 +0.010
So+0.010
Lu+0.009
 much+0.009
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 warriors-0.012
Token.
Token ID13
Feature activation+8.60e-03

Pos logit contributions
See+0.042
Lu+0.035
 +0.035
So+0.035
Ph+0.034
Neg logit contributions
working-0.102
sing-0.097
work-0.093
 warriors-0.085
 spine-0.085
Token Anna
Token ID11735
Feature activation-2.52e-02

Pos logit contributions
sing+0.017
working+0.017
 tablet+0.015
 spine+0.015
 warriors+0.015
Neg logit contributions
Su-0.012
Ab-0.011
uct-0.011
Ph-0.011
ird-0.011
Token helped
Token ID4193
Feature activation+2.56e-03

Pos logit contributions
See+0.031
Ab+0.030
uct+0.030
Ph+0.030
 much+0.029
Neg logit contributions
working-0.056
sing-0.054
work-0.051
 warriors-0.049
zo-0.049
Token her
Token ID607
Feature activation-1.72e-02

Pos logit contributions
working+0.000
 guys+0.000
sing+0.000
work+0.000
 warriors+0.000
Neg logit contributions
uct-0.000
So-0.000
Conf-0.000
See-0.000
Lu-0.000
Token blanket
Token ID18447
Feature activation-2.89e-02

Pos logit contributions
 +0.025
See+0.025
So+0.023
Lu+0.022
+0.022
Neg logit contributions
working-0.040
sing-0.037
work-0.035
 guys-0.034
 warriors-0.033
Token and
Token ID290
Feature activation+5.03e-03

Pos logit contributions
See+0.036
 +0.035
So+0.032
Lu+0.032
+0.031
Neg logit contributions
working-0.073
sing-0.068
work-0.065
 guys-0.062
 warriors-0.061
Token felt
Token ID2936
Feature activation-1.38e-02

Pos logit contributions
working+0.012
sing+0.011
 guys+0.011
work+0.011
 warriors+0.010
Neg logit contributions
See-0.007
So-0.006
Lu-0.006
 -0.006
uct-0.006
Token so
Token ID523
Feature activation-1.58e-02

Pos logit contributions
Ann+0.026
Lu+0.023
So+0.022
ue+0.022
uct+0.022
Neg logit contributions
working-0.024
 guys-0.022
sing-0.021
 soldier-0.021
 crew-0.020
Token cozy
Token ID37438
Feature activation-7.07e-02

Pos logit contributions
Ann+0.025
Lu+0.025
See+0.025
ion+0.024
So+0.023
Neg logit contributions
working-0.032
sing-0.028
 guys-0.025
 spine-0.024
zo-0.023
Token.
Token ID13
Feature activation-7.81e-03

Pos logit contributions
 +0.056
See+0.055
So+0.047
Lu+0.046
+0.044
Neg logit contributions
working-0.208
sing-0.197
work-0.189
 guys-0.184
 warriors-0.181
Token She
Token ID1375
Feature activation-2.23e-02

Pos logit contributions
See+0.009
So+0.009
Lu+0.009
Ann+0.008
aw+0.008
Neg logit contributions
working-0.020
sing-0.018
 guys-0.017
 spine-0.016
work-0.016
Token said
Token ID531
Feature activation-1.10e-02

Pos logit contributions
See+0.033
Lu+0.031
 +0.030
uct+0.029
Ann+0.029
Neg logit contributions
working-0.048
sing-0.046
zo-0.043
 guys-0.042
 crew-0.041
Token to
Token ID284
Feature activation+1.47e-03

Pos logit contributions
See+0.017
uct+0.016
So+0.016
Lu+0.016
Ann+0.016
Neg logit contributions
working-0.024
 guys-0.021
sing-0.021
work-0.020
 warriors-0.019
Token her
Token ID607
Feature activation-4.01e-02

Pos logit contributions
working+0.004
sing+0.004
work+0.004
 guys+0.004
 warriors+0.004
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
-0.001
Token in
Token ID287
Feature activation+2.15e-02

Pos logit contributions
working+0.025
sing+0.024
work+0.023
 warriors+0.021
 guys+0.021
Neg logit contributions
See-0.010
 -0.009
Lu-0.009
So-0.008
Ph-0.008
Token his
Token ID465
Feature activation+9.01e-03

Pos logit contributions
working+0.053
sing+0.050
work+0.048
 warriors+0.046
 guys+0.046
Neg logit contributions
So-0.024
See-0.024
Lu-0.023
uct-0.023
Ph-0.023
Token cardboard
Token ID26819
Feature activation-1.14e-02

Pos logit contributions
working+0.022
sing+0.020
work+0.019
 guys+0.018
zo+0.018
Neg logit contributions
 -0.013
See-0.012
So-0.011
 was-0.011
Lu-0.011
Token box
Token ID3091
Feature activation-2.35e-02

Pos logit contributions
See+0.018
 +0.017
Lu+0.017
+0.016
uct+0.016
Neg logit contributions
working-0.024
sing-0.023
work-0.022
 warriors-0.021
 guys-0.020
Token and
Token ID290
Feature activation+3.38e-03

Pos logit contributions
See+0.025
Lu+0.022
 +0.021
Ann+0.020
Ph+0.020
Neg logit contributions
working-0.064
sing-0.061
work-0.058
 spine-0.055
 warriors-0.055
Token curled
Token ID41108
Feature activation-7.07e-02

Pos logit contributions
working+0.008
sing+0.008
work+0.007
 guys+0.007
 warriors+0.007
Neg logit contributions
See-0.004
 -0.004
So-0.004
Lu-0.004
uct-0.004
Token up
Token ID510
Feature activation+2.47e-03

Pos logit contributions
See+0.086
 +0.085
So+0.079
Lu+0.077
+0.075
Neg logit contributions
working-0.178
sing-0.168
work-0.160
 guys-0.154
 warriors-0.151
Token for
Token ID329
Feature activation+7.57e-03

Pos logit contributions
working+0.005
sing+0.005
work+0.005
 guys+0.004
 spine+0.004
Neg logit contributions
See-0.004
So-0.004
Ann-0.004
 -0.004
Lu-0.004
Token a
Token ID257
Feature activation-2.47e-03

Pos logit contributions
working+0.018
sing+0.017
 guys+0.017
work+0.016
 warriors+0.016
Neg logit contributions
See-0.009
So-0.009
Lu-0.008
 -0.008
-0.008
Token nap
Token ID25422
Feature activation+3.48e-03

Pos logit contributions
See+0.004
 +0.004
So+0.004
Lu+0.004
uct+0.004
Neg logit contributions
working-0.005
sing-0.005
work-0.005
 guys-0.004
 warriors-0.004
Token.
Token ID13
Feature activation-3.58e-03

Pos logit contributions
working+0.010
sing+0.010
work+0.009
 guys+0.009
 warriors+0.009
Neg logit contributions
See-0.003
 -0.003
So-0.002
Lu-0.002
-0.002
Token it
Token ID340
Feature activation+1.16e-03

Pos logit contributions
Ann+0.061
Lu+0.057
See+0.056
ird+0.054
Ph+0.053
Neg logit contributions
working-0.110
 guys-0.100
work-0.100
sing-0.098
 warriors-0.093
Token,
Token ID11
Feature activation-6.00e-03

Pos logit contributions
working+0.003
work+0.003
sing+0.003
 guys+0.003
zo+0.002
Neg logit contributions
See-0.001
 -0.001
Ph-0.001
Ann-0.001
Lu-0.001
Token it
Token ID340
Feature activation-8.19e-03

Pos logit contributions
See+0.008
 +0.008
Lu+0.008
So+0.008
Ann+0.007
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 warriors-0.012
Token was
Token ID373
Feature activation-9.64e-03

Pos logit contributions
See+0.008
 +0.008
Ph+0.007
So+0.007
Lu+0.007
Neg logit contributions
working-0.023
work-0.021
sing-0.021
 guys-0.020
zo-0.019
Token so
Token ID523
Feature activation-1.76e-02

Pos logit contributions
Ann+0.013
See+0.013
So+0.013
Ph+0.012
 So+0.012
Neg logit contributions
working-0.022
sing-0.021
work-0.020
 guys-0.018
 miner-0.017
Token y
Token ID331
Feature activation-7.06e-02

Pos logit contributions
Ann+0.024
See+0.023
Lu+0.022
So+0.022
 +0.022
Neg logit contributions
working-0.041
sing-0.038
work-0.035
 guys-0.033
 miner-0.031
Tokenummy
Token ID13513
Feature activation-4.49e-02

Pos logit contributions
See+0.130
 +0.130
So+0.123
Lu+0.122
+0.119
Neg logit contributions
working-0.133
sing-0.122
work-0.114
 guys-0.109
 warriors-0.106
Token!
Token ID0
Feature activation-1.58e-02

Pos logit contributions
See+0.049
 +0.047
So+0.043
Ph+0.041
uct+0.041
Neg logit contributions
working-0.122
work-0.111
sing-0.109
 guys-0.106
 warriors-0.103
Token
Token ID198
Feature activation-1.60e-02

Pos logit contributions
up+0.018
 cal+0.018
 else+0.017
aw+0.017
 calmed+0.016
Neg logit contributions
working-0.033
sing-0.031
work-0.030
 spine-0.028
 photographer-0.028
Token
Token ID198
Feature activation-2.07e-03

Pos logit contributions
aw+0.018
 cal+0.018
 p+0.017
up+0.017
ird+0.017
Neg logit contributions
working-0.029
zo-0.027
sing-0.026
ico-0.025
 tablet-0.025
TokenThe
Token ID464
Feature activation-1.64e-02

Pos logit contributions
 kisses+0.003
 +0.003
 says+0.002
 sits+0.002
 Instead+0.002
Neg logit contributions
working-0.004
 photographer-0.004
 warrior-0.004
zo-0.004
 warriors-0.003
Token mom
Token ID1995
Feature activation-1.64e-02

Pos logit contributions
working+0.025
sing+0.023
work+0.022
 guys+0.022
 warriors+0.021
Neg logit contributions
See-0.012
 -0.012
So-0.011
Lu-0.011
-0.011
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
See+0.024
 +0.021
uct+0.020
Ph+0.020
So+0.020
Neg logit contributions
working-0.039
work-0.035
sing-0.035
 guys-0.033
 warriors-0.032
Token went
Token ID1816
Feature activation-1.78e-02

Pos logit contributions
working+0.046
sing+0.042
work+0.041
 guys+0.041
 warriors+0.039
Neg logit contributions
See-0.023
 -0.022
Lu-0.020
So-0.020
Ann-0.019
Token to
Token ID284
Feature activation-9.69e-03

Pos logit contributions
 +0.025
See+0.025
So+0.023
Lu+0.023
+0.022
Neg logit contributions
working-0.041
sing-0.038
work-0.036
 guys-0.035
 warriors-0.034
Token their
Token ID511
Feature activation-1.71e-02

Pos logit contributions
See+0.011
 +0.011
So+0.010
Lu+0.010
+0.010
Neg logit contributions
working-0.025
sing-0.024
work-0.022
 guys-0.022
 warriors-0.021
Token rooms
Token ID9519
Feature activation-7.05e-02

Pos logit contributions
 +0.021
See+0.020
So+0.019
Lu+0.018
 was+0.018
Neg logit contributions
working-0.043
sing-0.040
work-0.038
 guys-0.037
zo-0.037
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
See+0.076
 +0.066
Lu+0.064
So+0.063
uct+0.060
Neg logit contributions
working-0.192
sing-0.181
work-0.175
 guys-0.166
 warriors-0.164
Token They
Token ID1119
Feature activation-1.89e-04

Pos logit contributions
working+0.027
sing+0.026
work+0.023
 photographer+0.023
 guys+0.022
Neg logit contributions
uct-0.013
See-0.012
Lu-0.012
So-0.012
-0.011
Token cried
Token ID16896
Feature activation+4.51e-03

Pos logit contributions
See+0.000
 +0.000
Lu+0.000
So+0.000
Care+0.000
Neg logit contributions
working-0.000
sing-0.000
work-0.000
 guys-0.000
zo-0.000
Token and
Token ID290
Feature activation-3.80e-03

Pos logit contributions
working+0.012
sing+0.012
work+0.011
 photographer+0.011
 warrior+0.011
Neg logit contributions
 -0.005
See-0.004
 much-0.004
So-0.004
 was-0.004
Token wished
Token ID16555
Feature activation-2.46e-02

Pos logit contributions
See+0.004
Lu+0.004
+0.003
Ab+0.003
So+0.003
Neg logit contributions
working-0.010
sing-0.010
 guys-0.009
work-0.009
 warriors-0.008
Token.
Token ID13
Feature activation+2.39e-02

Pos logit contributions
 +0.011
See+0.011
So+0.009
 was+0.009
 much+0.009
Neg logit contributions
working-0.051
sing-0.048
work-0.046
 guys-0.045
zo-0.045
Token It
Token ID632
Feature activation+8.81e-03

Pos logit contributions
working+0.059
sing+0.058
work+0.055
 guys+0.054
 warriors+0.053
Neg logit contributions
See-0.027
So-0.025
Lu-0.025
 -0.025
uct-0.025
Token belonged
Token ID19611
Feature activation-5.24e-04

Pos logit contributions
working+0.022
sing+0.021
work+0.021
zo+0.020
 guys+0.020
Neg logit contributions
See-0.010
So-0.009
 -0.009
Lu-0.009
uct-0.009
Token to
Token ID284
Feature activation-5.89e-03

Pos logit contributions
 +0.001
See+0.001
So+0.001
Lu+0.001
+0.001
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 warriors-0.001
Token their
Token ID511
Feature activation-9.97e-03

Pos logit contributions
 +0.006
See+0.006
So+0.005
Lu+0.005
 was+0.005
Neg logit contributions
working-0.016
sing-0.015
work-0.015
 guys-0.014
 warriors-0.014
Token grand
Token ID4490
Feature activation+5.00e-02

Pos logit contributions
 +0.013
See+0.012
 was+0.012
So+0.012
 much+0.011
Neg logit contributions
working-0.025
sing-0.023
work-0.022
 guys-0.021
zo-0.021
Tokenpa
Token ID8957
Feature activation+4.04e-02

Pos logit contributions
working+0.110
sing+0.100
work+0.090
 warrior+0.088
fly+0.086
Neg logit contributions
 was-0.075
 -0.071
 p-0.071
 is-0.070
 kisses-0.069
Token,
Token ID11
Feature activation+2.06e-02

Pos logit contributions
working+0.110
sing+0.104
work+0.102
 guys+0.097
 warriors+0.095
Neg logit contributions
See-0.043
 -0.037
Lu-0.036
So-0.035
Ph-0.034
Token who
Token ID508
Feature activation+2.05e-02

Pos logit contributions
working+0.049
sing+0.047
work+0.044
 guys+0.042
 warriors+0.042
Neg logit contributions
See-0.027
 -0.027
So-0.025
Lu-0.025
uct-0.024
Token did
Token ID750
Feature activation+2.58e-02

Pos logit contributions
working+0.059
work+0.053
sing+0.053
 guys+0.051
zo+0.050
Neg logit contributions
See-0.018
 -0.017
So-0.016
Lu-0.015
uct-0.015
Token not
Token ID407
Feature activation+2.24e-02

Pos logit contributions
working+0.057
sing+0.052
work+0.049
 guys+0.047
 warriors+0.046
Neg logit contributions
See-0.040
 -0.038
Lu-0.037
So-0.036
 So-0.035
Token troubled
Token ID17840
Feature activation+5.50e-03

Pos logit contributions
working+0.047
sing+0.046
work+0.045
 guys+0.043
 warriors+0.042
Neg logit contributions
 -0.025
 much-0.024
See-0.024
 was-0.022
Lu-0.022
Token.
Token ID13
Feature activation+6.34e-03

Pos logit contributions
working+0.016
sing+0.015
work+0.014
 guys+0.014
 warriors+0.013
Neg logit contributions
See-0.006
 -0.005
So-0.005
uct-0.004
Lu-0.004
Token He
Token ID679
Feature activation+2.79e-02

Pos logit contributions
working+0.016
sing+0.015
work+0.014
 guys+0.013
 warriors+0.013
Neg logit contributions
See-0.008
So-0.008
 -0.008
Lu-0.007
-0.007
Token had
Token ID550
Feature activation+2.46e-02

Pos logit contributions
working+0.079
sing+0.075
work+0.072
 guys+0.070
 warriors+0.068
Neg logit contributions
See-0.025
 -0.025
So-0.022
Lu-0.022
-0.020
Token a
Token ID257
Feature activation+1.06e-02

Pos logit contributions
working+0.064
sing+0.058
work+0.056
 guys+0.055
 warriors+0.054
Neg logit contributions
 -0.028
See-0.028
So-0.027
Lu-0.026
uct-0.025
Token pistol
Token ID16790
Feature activation+5.13e-02

Pos logit contributions
working+0.028
work+0.027
sing+0.026
 warriors+0.025
zo+0.025
Neg logit contributions
 -0.011
 much-0.011
 p-0.010
 was-0.010
See-0.010
Token,
Token ID11
Feature activation+2.19e-02

Pos logit contributions
working+0.142
sing+0.141
work+0.131
 photographer+0.129
 guys+0.128
Neg logit contributions
 was-0.044
 -0.041
 much-0.038
-0.036
-0.035
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
working+0.056
sing+0.052
work+0.050
 guys+0.048
 warriors+0.048
Neg logit contributions
 -0.026
See-0.026
So-0.023
Lu-0.023
 was-0.023
Token he
Token ID339
Feature activation+1.85e-02

Pos logit contributions
working+0.032
sing+0.029
 guys+0.028
work+0.028
 warriors+0.027
Neg logit contributions
See-0.016
 -0.015
So-0.015
Lu-0.014
uct-0.014
Token wanted
Token ID2227
Feature activation+1.22e-02

Pos logit contributions
working+0.050
sing+0.046
work+0.044
 guys+0.043
zo+0.043
Neg logit contributions
See-0.020
 -0.019
So-0.018
Lu-0.018
uct-0.017
Token to
Token ID284
Feature activation+2.99e-03

Pos logit contributions
working+0.028
sing+0.027
work+0.025
 guys+0.024
 warriors+0.024
Neg logit contributions
 -0.017
See-0.017
So-0.016
Lu-0.016
-0.015
Token."
Token ID526
Feature activation-2.21e-02

Pos logit contributions
See+0.034
 +0.034
So+0.031
Lu+0.031
uct+0.031
Neg logit contributions
working-0.052
sing-0.048
work-0.046
 guys-0.044
 warriors-0.043
Token
Token ID220
Feature activation-1.38e-02

Pos logit contributions
See+0.014
Ph+0.012
Lu+0.012
Care+0.012
Ann+0.012
Neg logit contributions
working-0.062
work-0.061
sing-0.061
 spine-0.056
 photographer-0.056
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
See+0.015
 p+0.013
aw+0.013
Care+0.013
 cal+0.013
Neg logit contributions
working-0.032
sing-0.031
work-0.029
gy-0.028
zo-0.027
Token
Token ID198
Feature activation-2.23e-03

Pos logit contributions
See+0.012
uct+0.011
aw+0.011
 cal+0.010
 p+0.010
Neg logit contributions
working-0.046
sing-0.043
work-0.042
zo-0.041
 photographer-0.040
TokenThe
Token ID464
Feature activation-1.85e-02

Pos logit contributions
 +0.002
See+0.002
 Sue+0.002
uct+0.002
Ph+0.002
Neg logit contributions
working-0.006
sing-0.005
work-0.005
 photographer-0.005
zo-0.005
Token curious
Token ID11040
Feature activation+5.00e-02

Pos logit contributions
 +0.021
See+0.020
 was+0.020
So+0.019
 much+0.019
Neg logit contributions
working-0.048
sing-0.047
work-0.044
zo-0.042
 spine-0.041
Token girl
Token ID2576
Feature activation-9.93e-03

Pos logit contributions
working+0.128
sing+0.123
work+0.115
zo+0.109
 guys+0.109
Neg logit contributions
 -0.057
See-0.054
 was-0.052
Lu-0.051
So-0.051
Token thought
Token ID1807
Feature activation+1.01e-02

Pos logit contributions
See+0.010
 +0.009
Lu+0.009
 much+0.008
Ph+0.008
Neg logit contributions
working-0.027
sing-0.025
work-0.025
 guys-0.024
zo-0.023
Token about
Token ID546
Feature activation-4.94e-03

Pos logit contributions
working+0.025
sing+0.023
work+0.022
 guys+0.022
 warriors+0.021
Neg logit contributions
See-0.013
 -0.012
So-0.012
Lu-0.011
S-0.011
Token this
Token ID428
Feature activation+2.30e-02

Pos logit contributions
See+0.005
 +0.004
So+0.004
Lu+0.004
Ph+0.004
Neg logit contributions
working-0.014
sing-0.013
work-0.012
 guys-0.012
 warriors-0.012
Token for
Token ID329
Feature activation+1.45e-02

Pos logit contributions
working+0.054
sing+0.050
work+0.048
 guys+0.046
 warriors+0.045
Neg logit contributions
 -0.032
See-0.032
So-0.029
Lu-0.029
-0.028
Token the
Token ID262
Feature activation-5.31e-03

Pos logit contributions
working+0.005
 guys+0.005
work+0.005
 warriors+0.005
zo+0.004
Neg logit contributions
See-0.002
 -0.002
Ph-0.002
uct-0.002
So-0.002
Token man
Token ID582
Feature activation+6.15e-02

Pos logit contributions
 +0.008
See+0.007
So+0.007
 was+0.007
 much+0.007
Neg logit contributions
working-0.013
sing-0.012
work-0.011
zo-0.010
 guys-0.010
Token had
Token ID550
Feature activation+3.17e-02

Pos logit contributions
working+0.171
sing+0.159
work+0.155
 guys+0.149
 warriors+0.147
Neg logit contributions
See-0.059
 -0.058
So-0.052
Lu-0.051
 much-0.048
Token for
Token ID329
Feature activation+2.25e-02

Pos logit contributions
working+0.078
sing+0.070
work+0.070
 guys+0.068
 warriors+0.066
Neg logit contributions
See-0.040
 -0.040
Lu-0.038
So-0.037
Ann-0.036
Token them
Token ID606
Feature activation+8.10e-03

Pos logit contributions
working+0.061
sing+0.058
work+0.055
 guys+0.053
 warriors+0.052
Neg logit contributions
 -0.023
See-0.022
So-0.020
 was-0.020
 much-0.020
Token every
Token ID790
Feature activation+5.33e-02

Pos logit contributions
working+0.023
sing+0.021
work+0.021
 guys+0.020
 warriors+0.019
Neg logit contributions
See-0.008
 -0.007
Lu-0.007
So-0.007
Ann-0.006
Token day
Token ID1110
Feature activation+6.64e-03

Pos logit contributions
working+0.115
sing+0.107
work+0.101
 guys+0.097
 warriors+0.095
Neg logit contributions
 -0.084
See-0.084
So-0.078
Lu-0.077
-0.076
Token!
Token ID0
Feature activation-1.18e-02

Pos logit contributions
working+0.020
sing+0.019
work+0.018
 guys+0.018
 warriors+0.017
Neg logit contributions
 -0.005
See-0.005
So-0.004
Lu-0.004
-0.004
Token<|endoftext|>
Token ID50256
Feature activation-1.19e-02

Pos logit contributions
Lu+0.010
See+0.010
uct+0.010
Ph+0.009
Ann+0.009
Neg logit contributions
working-0.031
work-0.028
zo-0.028
sing-0.028
 warrior-0.027
TokenOnce
Token ID7454
Feature activation+6.24e-03

Pos logit contributions
uct+0.016
So+0.015
See+0.014
Ph+0.014
 else+0.014
Neg logit contributions
working-0.024
work-0.023
 guys-0.022
zo-0.021
 warriors-0.021
Token upon
Token ID2402
Feature activation-1.97e-02

Pos logit contributions
working+0.013
sing+0.011
work+0.010
 guys+0.010
zo+0.010
Neg logit contributions
See-0.011
So-0.010
Lu-0.010
Care-0.010
aw-0.010
Token squirrel
Token ID33039
Feature activation+2.58e-02

Pos logit contributions
See+0.017
 +0.017
Lu+0.016
So+0.016
uct+0.016
Neg logit contributions
working-0.022
sing-0.021
 guys-0.019
work-0.019
 warriors-0.019
Token couldn
Token ID3521
Feature activation-3.20e-02

Pos logit contributions
working+0.069
sing+0.061
zo+0.060
work+0.060
fly+0.058
Neg logit contributions
See-0.027
Ph-0.024
Care-0.022
Lu-0.021
uct-0.021
Token't
Token ID470
Feature activation+1.54e-02

Pos logit contributions
See+0.054
 +0.051
So+0.050
Lu+0.050
uct+0.049
Neg logit contributions
working-0.065
sing-0.058
work-0.056
 guys-0.053
zo-0.051
Token believe
Token ID1975
Feature activation-5.99e-03

Pos logit contributions
working+0.035
work+0.032
sing+0.032
 photographer+0.029
 battlefield+0.029
Neg logit contributions
-0.022
 -0.022
 was-0.021
So-0.021
 much-0.020
Token his
Token ID465
Feature activation-1.53e-02

Pos logit contributions
 +0.005
See+0.005
So+0.005
Lu+0.005
+0.004
Neg logit contributions
working-0.017
sing-0.016
work-0.015
 guys-0.015
 warriors-0.015
Token luck
Token ID8458
Feature activation+5.36e-02

Pos logit contributions
See+0.026
 +0.025
Lu+0.024
So+0.024
Ann+0.023
Neg logit contributions
working-0.031
sing-0.030
work-0.027
 guys-0.027
 warriors-0.026
Token!
Token ID0
Feature activation-7.49e-03

Pos logit contributions
working+0.159
sing+0.148
work+0.143
 guys+0.139
 warriors+0.137
Neg logit contributions
See-0.044
 -0.042
So-0.037
Lu-0.036
uct-0.033
Token He
Token ID679
Feature activation+1.25e-02

Pos logit contributions
See+0.010
Lu+0.009
 else+0.009
Ann+0.009
up+0.009
Neg logit contributions
working-0.015
sing-0.015
 warriors-0.015
 warrior-0.014
 rook-0.014
Token filled
Token ID5901
Feature activation-1.52e-02

Pos logit contributions
working+0.029
sing+0.027
zo+0.026
 warrior+0.026
 warriors+0.026
Neg logit contributions
See-0.015
Lu-0.014
 -0.014
Ph-0.014
S-0.014
Token his
Token ID465
Feature activation+1.89e-02

Pos logit contributions
See+0.015
 +0.015
So+0.014
Lu+0.013
+0.013
Neg logit contributions
working-0.041
sing-0.039
work-0.037
 guys-0.036
 warriors-0.035
Token tum
Token ID11814
Feature activation-9.34e-03

Pos logit contributions
working+0.040
sing+0.039
work+0.035
 guys+0.035
 warriors+0.034
Neg logit contributions
See-0.030
 -0.028
So-0.028
Lu-0.028
Ann-0.027
Token something
Token ID1223
Feature activation+9.21e-03

Pos logit contributions
working+0.071
sing+0.066
work+0.063
 guys+0.062
 warriors+0.060
Neg logit contributions
See-0.034
 -0.033
So-0.031
Lu-0.031
-0.029
Token bad
Token ID2089
Feature activation+3.52e-03

Pos logit contributions
working+0.023
sing+0.022
work+0.021
 guys+0.020
 warriors+0.019
Neg logit contributions
See-0.012
 -0.011
So-0.010
Lu-0.010
uct-0.010
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
working+0.011
sing+0.010
work+0.010
 guys+0.010
 warriors+0.010
Neg logit contributions
 -0.002
See-0.002
So-0.002
Lu-0.002
-0.002
Token
Token ID198
Feature activation+6.87e-03

Pos logit contributions
working+0.034
sing+0.034
work+0.031
 photographer+0.030
 warriors+0.030
Neg logit contributions
uct-0.018
See-0.016
Lu-0.015
ight-0.015
ird-0.015
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
working+0.015
sing+0.015
zo+0.014
 tablet+0.013
work+0.013
Neg logit contributions
uct-0.008
ird-0.007
See-0.007
rol-0.006
aw-0.006
Token"
Token ID1
Feature activation+2.24e-02

Pos logit contributions
working+0.043
sing+0.038
 photographer+0.038
work+0.036
zo+0.036
Neg logit contributions
uct-0.019
See-0.018
 -0.018
 Instead-0.016
 kisses-0.016
TokenWe
Token ID1135
Feature activation-2.36e-03

Pos logit contributions
working+0.061
sing+0.058
work+0.056
 guys+0.054
 warriors+0.053
Neg logit contributions
 -0.022
See-0.022
So-0.020
Lu-0.019
-0.019
Token are
Token ID389
Feature activation+7.33e-03

Pos logit contributions
See+0.003
Lu+0.003
 +0.003
uct+0.002
So+0.002
Neg logit contributions
working-0.006
sing-0.006
 guys-0.006
work-0.005
 photographer-0.005
Token sorry
Token ID7926
Feature activation+1.05e-02

Pos logit contributions
working+0.017
sing+0.016
work+0.015
 guys+0.015
 warriors+0.014
Neg logit contributions
See-0.011
So-0.010
 -0.010
Lu-0.010
Ann-0.009
Token,
Token ID11
Feature activation+2.90e-03

Pos logit contributions
working+0.026
sing+0.025
 guys+0.024
work+0.024
 warriors+0.023
Neg logit contributions
See-0.014
 -0.012
uct-0.012
Lu-0.012
So-0.011
Token Mom
Token ID11254
Feature activation-1.94e-02

Pos logit contributions
sing+0.007
working+0.007
work+0.007
 guys+0.007
 warriors+0.007
Neg logit contributions
See-0.003
 -0.003
Lu-0.003
So-0.003
Ph-0.003
Token He
Token ID679
Feature activation+1.46e-02

Pos logit contributions
See+0.011
Lu+0.010
 cal+0.010
 kisses+0.010
Ann+0.010
Neg logit contributions
working-0.012
sing-0.010
 guys-0.010
 spine-0.009
 battlefield-0.009
Token asked
Token ID1965
Feature activation+1.33e-02

Pos logit contributions
working+0.035
sing+0.033
 guys+0.032
zo+0.031
work+0.030
Neg logit contributions
See-0.019
 -0.017
Lu-0.017
So-0.016
uct-0.016
Token his
Token ID465
Feature activation-8.45e-03

Pos logit contributions
working+0.036
sing+0.034
work+0.032
 guys+0.031
 warriors+0.030
Neg logit contributions
 -0.014
See-0.014
So-0.013
Lu-0.013
 was-0.012
Token mom
Token ID1995
Feature activation-2.26e-03

Pos logit contributions
 +0.010
See+0.010
So+0.009
 was+0.009
+0.009
Neg logit contributions
working-0.022
sing-0.020
work-0.019
 guys-0.018
zo-0.018
Token what
Token ID644
Feature activation+1.03e-02

Pos logit contributions
 +0.002
See+0.002
So+0.002
+0.002
 was+0.002
Neg logit contributions
working-0.006
sing-0.006
work-0.005
 guys-0.005
 warriors-0.005
Token was
Token ID373
Feature activation+2.20e-02

Pos logit contributions
working+0.029
 guys+0.026
work+0.025
sing+0.025
zo+0.025
Neg logit contributions
See-0.009
uct-0.009
 -0.009
So-0.008
Ph-0.008
Token wrong
Token ID2642
Feature activation+2.11e-02

Pos logit contributions
working+0.057
sing+0.054
work+0.052
 guys+0.050
 warriors+0.050
Neg logit contributions
 -0.024
See-0.023
Lu-0.021
So-0.021
 much-0.021
Token and
Token ID290
Feature activation-1.37e-02

Pos logit contributions
working+0.058
sing+0.054
work+0.052
 guys+0.050
 warriors+0.049
Neg logit contributions
See-0.021
 -0.021
So-0.019
Lu-0.019
-0.018
Token his
Token ID465
Feature activation-1.11e-02

Pos logit contributions
See+0.018
So+0.016
 +0.016
Lu+0.015
Ann+0.015
Neg logit contributions
working-0.035
work-0.031
sing-0.030
 guys-0.030
 warriors-0.029
Token mom
Token ID1995
Feature activation-2.53e-03

Pos logit contributions
See+0.015
 +0.015
So+0.014
Lu+0.014
+0.013
Neg logit contributions
working-0.027
sing-0.025
work-0.024
 guys-0.023
 warriors-0.022
Token said
Token ID531
Feature activation-5.83e-03

Pos logit contributions
See+0.003
Lu+0.002
Ph+0.002
Care+0.002
Ann+0.002
Neg logit contributions
working-0.007
work-0.006
sing-0.006
zo-0.006
 guys-0.006
Token and
Token ID290
Feature activation-1.56e-03

Pos logit contributions
working+0.038
 photographer+0.038
 rook+0.038
sing+0.037
 warrior+0.037
Neg logit contributions
 -0.011
 was-0.010
-0.010
.-0.010
 she-0.008
Token saw
Token ID2497
Feature activation-3.22e-03

Pos logit contributions
See+0.002
 +0.002
So+0.002
Lu+0.002
+0.002
Neg logit contributions
working-0.004
sing-0.004
work-0.004
 guys-0.004
 warriors-0.003
Token the
Token ID262
Feature activation-4.03e-03

Pos logit contributions
 +0.003
See+0.003
So+0.002
Lu+0.002
 much+0.002
Neg logit contributions
working-0.009
sing-0.009
work-0.009
 guys-0.008
 warriors-0.008
Token elephant
Token ID20950
Feature activation+2.16e-02

Pos logit contributions
See+0.006
 +0.006
 much+0.005
So+0.005
 was+0.005
Neg logit contributions
working-0.009
sing-0.009
work-0.009
zo-0.008
 guys-0.008
Token taking
Token ID2263
Feature activation+3.16e-02

Pos logit contributions
working+0.058
sing+0.054
work+0.052
 guys+0.050
 warriors+0.049
Neg logit contributions
See-0.024
 -0.023
Lu-0.021
So-0.021
Ph-0.020
Token his
Token ID465
Feature activation+2.95e-02

Pos logit contributions
working+0.080
sing+0.072
 guys+0.071
work+0.070
 warriors+0.067
Neg logit contributions
See-0.038
 -0.036
So-0.036
Lu-0.034
uct-0.034
Token apples
Token ID22514
Feature activation-8.60e-03

Pos logit contributions
working+0.071
sing+0.068
work+0.066
zo+0.062
 warriors+0.061
Neg logit contributions
 -0.038
See-0.036
So-0.034
 was-0.034
 much-0.034
Token.
Token ID13
Feature activation+7.03e-03

Pos logit contributions
See+0.008
 +0.008
So+0.007
Lu+0.007
Ann+0.006
Neg logit contributions
working-0.024
sing-0.023
work-0.022
 guys-0.021
 warriors-0.021
Token He
Token ID679
Feature activation+1.28e-02

Pos logit contributions
working+0.017
sing+0.016
work+0.015
 guys+0.015
 warriors+0.015
Neg logit contributions
See-0.009
 -0.008
So-0.008
Lu-0.008
uct-0.008
Token was
Token ID373
Feature activation-1.94e-03

Pos logit contributions
working+0.035
sing+0.033
work+0.031
 guys+0.031
zo+0.030
Neg logit contributions
See-0.013
 -0.012
Lu-0.011
So-0.011
Ph-0.010
Token very
Token ID845
Feature activation+5.18e-03

Pos logit contributions
See+0.002
 +0.002
So+0.002
Lu+0.002
Ann+0.002
Neg logit contributions
working-0.005
sing-0.005
work-0.004
 warriors-0.004
 guys-0.004
Token was
Token ID373
Feature activation-2.58e-03

Pos logit contributions
working+0.030
sing+0.028
work+0.027
zo+0.027
 guys+0.026
Neg logit contributions
See-0.008
 -0.007
Ab-0.007
Lu-0.007
Ph-0.007
Token crying
Token ID13774
Feature activation+7.80e-03

Pos logit contributions
See+0.004
So+0.004
 +0.004
 So+0.004
Lu+0.004
Neg logit contributions
working-0.005
sing-0.005
work-0.005
 guys-0.005
 warriors-0.004
Token.
Token ID13
Feature activation-3.55e-03

Pos logit contributions
working+0.022
sing+0.020
work+0.020
 guys+0.019
 warriors+0.019
Neg logit contributions
See-0.008
 -0.007
So-0.007
Lu-0.007
-0.006
Token She
Token ID1375
Feature activation-2.57e-02

Pos logit contributions
See+0.004
So+0.003
Ann+0.003
uct+0.003
Ab+0.003
Neg logit contributions
working-0.009
sing-0.009
work-0.008
 guys-0.008
 spine-0.008
Token gave
Token ID2921
Feature activation-1.33e-02

Pos logit contributions
See+0.029
 +0.025
Ph+0.024
Lu+0.024
uct+0.024
Neg logit contributions
working-0.062
sing-0.057
work-0.056
zo-0.055
 crew-0.054
Token him
Token ID683
Feature activation+8.17e-03

Pos logit contributions
See+0.023
So+0.022
 +0.022
Lu+0.021
Conf+0.021
Neg logit contributions
working-0.026
sing-0.023
 guys-0.023
work-0.023
 warriors-0.020
Token a
Token ID257
Feature activation-1.25e-03

Pos logit contributions
working+0.022
sing+0.021
work+0.020
 guys+0.019
 warriors+0.019
Neg logit contributions
See-0.009
 -0.008
So-0.008
Lu-0.007
-0.007
Token big
Token ID1263
Feature activation-1.80e-02

Pos logit contributions
See+0.002
 +0.002
uct+0.002
Lu+0.002
So+0.002
Neg logit contributions
working-0.003
sing-0.003
work-0.002
 guys-0.002
 soldier-0.002
Token hug
Token ID16225
Feature activation-1.12e-02

Pos logit contributions
 +0.020
See+0.020
So+0.018
Lu+0.018
+0.017
Neg logit contributions
working-0.047
sing-0.044
work-0.042
 guys-0.041
 warriors-0.040
Token and
Token ID290
Feature activation-9.76e-03

Pos logit contributions
See+0.015
 +0.014
So+0.013
Lu+0.013
uct+0.013
Neg logit contributions
working-0.027
sing-0.025
work-0.025
 guys-0.024
 warriors-0.023
Token told
Token ID1297
Feature activation-1.74e-02

Pos logit contributions
See+0.010
Lu+0.010
Ann+0.010
 +0.010
So+0.009
Neg logit contributions
working-0.025
sing-0.023
work-0.022
 guys-0.022
 crew-0.022
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
See+0.035
 +0.034
So+0.031
uct+0.031
Lu+0.029
Neg logit contributions
working-0.091
work-0.084
sing-0.083
 guys-0.083
 warriors-0.078
Token mold
Token ID15936
Feature activation+5.30e-03

Pos logit contributions
See+0.022
 +0.022
So+0.021
Lu+0.021
Ph+0.020
Neg logit contributions
working-0.040
sing-0.037
work-0.036
 guys-0.034
 warriors-0.033
Tokeny
Token ID88
Feature activation+7.09e-03

Pos logit contributions
working+0.012
work+0.011
 guys+0.010
sing+0.010
zo+0.010
Neg logit contributions
See-0.008
 -0.008
uct-0.008
So-0.008
Ph-0.007
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
working+0.017
sing+0.016
work+0.015
 guys+0.015
 warriors+0.014
Neg logit contributions
See-0.009
 -0.009
uct-0.009
So-0.009
Ph-0.009
Token ranc
Token ID32205
Feature activation-3.02e-03

Pos logit contributions
 +0.016
See+0.016
So+0.015
Lu+0.015
uct+0.015
Neg logit contributions
working-0.025
sing-0.023
work-0.022
 guys-0.021
 warriors-0.021
Tokenid
Token ID312
Feature activation+3.27e-02

Pos logit contributions
See+0.004
 +0.004
So+0.004
Lu+0.004
+0.004
Neg logit contributions
working-0.007
sing-0.007
work-0.006
zo-0.006
 guys-0.006
Token and
Token ID290
Feature activation+4.92e-04

Pos logit contributions
working+0.072
work+0.063
sing+0.062
zo+0.059
 guys+0.058
Neg logit contributions
See-0.052
 -0.050
So-0.048
uct-0.048
Lu-0.047
Token expired
Token ID21350
Feature activation-1.48e-02

Pos logit contributions
working+0.001
sing+0.001
work+0.001
 guys+0.001
 warriors+0.001
Neg logit contributions
See-0.001
 -0.001
So-0.001
Lu-0.001
-0.001
Token and
Token ID290
Feature activation-9.58e-03

Pos logit contributions
See+0.014
 +0.014
So+0.012
uct+0.012
Lu+0.012
Neg logit contributions
working-0.042
work-0.039
sing-0.039
 guys-0.038
 warriors-0.036
Token best
Token ID1266
Feature activation-1.75e-02

Pos logit contributions
See+0.013
 +0.013
So+0.012
Lu+0.012
uct+0.012
Neg logit contributions
working-0.023
sing-0.021
work-0.020
 guys-0.019
 warriors-0.019
Token before
Token ID878
Feature activation-7.89e-03

Pos logit contributions
 +0.027
See+0.027
uct+0.026
So+0.026
Lu+0.025
Neg logit contributions
working-0.039
sing-0.034
work-0.034
 guys-0.033
 warriors-0.032
Token,
Token ID11
Feature activation-1.98e-02

Pos logit contributions
See+0.015
So+0.013
 +0.013
Lu+0.013
uct+0.012
Neg logit contributions
working-0.034
sing-0.032
work-0.031
 guys-0.030
 warriors-0.029
Token and
Token ID290
Feature activation-1.24e-02

Pos logit contributions
See+0.032
 +0.030
Lu+0.030
So+0.029
Ph+0.028
Neg logit contributions
working-0.040
sing-0.039
work-0.036
 guys-0.035
 warriors-0.034
Token her
Token ID607
Feature activation-3.59e-02

Pos logit contributions
See+0.014
Lu+0.013
Ph+0.013
So+0.013
 +0.012
Neg logit contributions
working-0.032
sing-0.030
work-0.029
 warriors-0.027
 guys-0.027
Token mom
Token ID1995
Feature activation-2.70e-05

Pos logit contributions
See+0.063
 +0.060
Lu+0.060
So+0.058
Ph+0.057
Neg logit contributions
working-0.071
sing-0.067
work-0.062
 guys-0.059
 warrior-0.058
Token said
Token ID531
Feature activation-5.04e-03

Pos logit contributions
See+0.000
Ph+0.000
Ab+0.000
So+0.000
Care+0.000
Neg logit contributions
working-0.000
work-0.000
sing-0.000
zo-0.000
fly-0.000
Token yes
Token ID3763
Feature activation-2.45e-02

Pos logit contributions
See+0.007
So+0.007
Lu+0.006
Ann+0.006
uct+0.006
Neg logit contributions
working-0.012
sing-0.011
work-0.010
 guys-0.010
 warriors-0.009
Token.
Token ID13
Feature activation-1.49e-03

Pos logit contributions
See+0.029
 +0.023
Lu+0.023
Care+0.022
Ab+0.021
Neg logit contributions
working-0.065
sing-0.060
 guys-0.059
work-0.059
zo-0.056
Token
Token ID198
Feature activation-8.59e-05

Pos logit contributions
up+0.002
See+0.002
ered+0.002
aw+0.002
 else+0.001
Neg logit contributions
sing-0.003
working-0.003
work-0.003
 photographer-0.003
 rook-0.003
Token
Token ID198
Feature activation+1.24e-02

Pos logit contributions
 calmed+0.000
uct+0.000
 cal+0.000
See+0.000
 p+0.000
Neg logit contributions
working-0.000
zo-0.000
 tablet-0.000
sing-0.000
ico-0.000
TokenL
Token ID43
Feature activation+1.56e-04

Pos logit contributions
working+0.028
 photographer+0.025
zo+0.025
 warrior+0.024
 warriors+0.023
Neg logit contributions
 kisses-0.014
 -0.014
 calmed-0.013
 Instead-0.013
See-0.013
Tokenily
Token ID813
Feature activation-1.13e-02

Pos logit contributions
working+0.000
work+0.000
 guys+0.000
sing+0.000
zo+0.000
Neg logit contributions
 -0.000
See-0.000
 Instead-0.000
So-0.000
Care-0.000
Token
Token ID198
Feature activation+9.58e-03

Pos logit contributions
uct+0.001
ird+0.001
See+0.001
Lu+0.001
Ab+0.001
Neg logit contributions
working-0.002
sing-0.002
zo-0.002
 tablet-0.002
work-0.002
TokenL
Token ID43
Feature activation-1.07e-02

Pos logit contributions
working+0.022
zo+0.020
sing+0.020
 photographer+0.020
 warriors+0.019
Neg logit contributions
 -0.010
uct-0.010
See-0.010
 Lucy-0.010
 Instead-0.009
Tokenily
Token ID813
Feature activation-2.46e-02

Pos logit contributions
 +0.015
See+0.013
So+0.012
+0.012
 Instead+0.011
Neg logit contributions
working-0.026
 guys-0.024
sing-0.023
work-0.023
 warriors-0.022
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
Ab+0.038
See+0.038
Lu+0.037
Su+0.035
Care+0.035
Neg logit contributions
working-0.044
sing-0.043
zo-0.043
 Claus-0.041
 soldier-0.041
Token Ben
Token ID3932
Feature activation+2.31e-02

Pos logit contributions
working+0.053
sing+0.050
work+0.048
 guys+0.045
 warriors+0.045
Neg logit contributions
 -0.025
See-0.024
 was-0.021
Lu-0.021
-0.021
Token felt
Token ID2936
Feature activation-6.06e-03

Pos logit contributions
working+0.052
sing+0.051
work+0.049
 guys+0.049
zo+0.048
Neg logit contributions
See-0.029
Lu-0.028
uct-0.026
Ab-0.026
Care-0.026
Token better
Token ID1365
Feature activation-1.42e-02

Pos logit contributions
Ann+0.010
Lu+0.009
uct+0.009
Su+0.009
liv+0.008
Neg logit contributions
working-0.012
 soldier-0.011
sing-0.011
 guys-0.011
 warriors-0.010
Token.
Token ID13
Feature activation+1.80e-02

Pos logit contributions
See+0.010
 +0.010
Lu+0.009
So+0.008
uct+0.008
Neg logit contributions
working-0.043
sing-0.041
work-0.039
 guys-0.038
 warriors-0.038
Token They
Token ID1119
Feature activation+7.40e-03

Pos logit contributions
working+0.040
sing+0.037
work+0.035
 guys+0.034
 warriors+0.034
Neg logit contributions
See-0.027
 -0.026
Lu-0.025
So-0.025
uct-0.025
Token held
Token ID2714
Feature activation+1.27e-02

Pos logit contributions
working+0.017
sing+0.016
zo+0.016
 guys+0.015
work+0.015
Neg logit contributions
See-0.009
Lu-0.008
Care-0.008
Ann-0.008
uct-0.008
Token hands
Token ID2832
Feature activation-1.35e-02

Pos logit contributions
working+0.031
sing+0.029
work+0.028
 guys+0.027
 warriors+0.026
Neg logit contributions
See-0.016
 -0.016
So-0.015
Lu-0.015
uct-0.014
Token important
Token ID1593
Feature activation+2.16e-02

Pos logit contributions
working+0.026
work+0.021
sing+0.021
 warriors+0.020
 guys+0.020
Neg logit contributions
See-0.018
 -0.017
So-0.017
Ann-0.017
uct-0.016
Token it
Token ID340
Feature activation+3.40e-02

Pos logit contributions
working+0.057
work+0.051
 guys+0.050
sing+0.050
 warriors+0.048
Neg logit contributions
See-0.023
 -0.022
uct-0.021
So-0.020
Ph-0.020
Token is
Token ID318
Feature activation+2.34e-02

Pos logit contributions
working+0.106
work+0.097
sing+0.097
 guys+0.096
 warriors+0.093
Neg logit contributions
See-0.022
 -0.020
Lu-0.017
Ph-0.016
So-0.016
Token to
Token ID284
Feature activation+7.93e-04

Pos logit contributions
working+0.063
sing+0.056
work+0.055
 guys+0.053
 warriors+0.052
Neg logit contributions
See-0.027
 -0.025
Lu-0.024
Ann-0.024
So-0.023
Token save
Token ID3613
Feature activation+9.30e-03

Pos logit contributions
working+0.002
sing+0.001
work+0.001
 guys+0.001
 warriors+0.001
Neg logit contributions
See-0.001
 -0.001
Lu-0.001
So-0.001
uct-0.001
Token energy
Token ID2568
Feature activation-1.34e-02

Pos logit contributions
working+0.023
sing+0.021
work+0.021
 guys+0.020
 warriors+0.020
Neg logit contributions
See-0.012
 -0.011
So-0.010
Lu-0.010
uct-0.010
Token and
Token ID290
Feature activation-6.05e-03

Pos logit contributions
See+0.013
 +0.013
So+0.011
Lu+0.011
uct+0.010
Neg logit contributions
working-0.038
sing-0.035
work-0.034
 guys-0.033
 warriors-0.032
Token money
Token ID1637
Feature activation-1.62e-02

Pos logit contributions
See+0.009
 +0.009
Lu+0.009
Ann+0.009
So+0.009
Neg logit contributions
working-0.013
sing-0.012
work-0.011
 guys-0.011
 warriors-0.011
Token.
Token ID13
Feature activation-9.09e-03

Pos logit contributions
See+0.019
 +0.018
Lu+0.017
So+0.017
Ann+0.017
Neg logit contributions
working-0.042
work-0.038
sing-0.037
 guys-0.036
 warriors-0.035
Token Robbie
Token ID34855
Feature activation+1.17e-02

Pos logit contributions
See+0.012
uct+0.012
Jane+0.012
aw+0.011
 cheered+0.011
Neg logit contributions
working-0.020
work-0.017
sing-0.017
zo-0.017
 guys-0.017
Token also
Token ID635
Feature activation+1.52e-02

Pos logit contributions
working+0.034
sing+0.032
work+0.031
 guys+0.030
zo+0.030
Neg logit contributions
See-0.010
 -0.009
Lu-0.008
So-0.008
uct-0.008
TokenTheir
Token ID14574
Feature activation-1.83e-03

Pos logit contributions
working+0.019
sing+0.018
work+0.017
 photographer+0.017
zo+0.016
Neg logit contributions
uct-0.009
See-0.009
 -0.008
 Instead-0.007
aw-0.007
Token mom
Token ID1995
Feature activation-1.21e-02

Pos logit contributions
 +0.002
See+0.002
So+0.002
Lu+0.002
 was+0.002
Neg logit contributions
working-0.005
sing-0.004
work-0.004
 guys-0.004
 warriors-0.004
Token said
Token ID531
Feature activation+2.47e-03

Pos logit contributions
See+0.011
 +0.009
Ph+0.009
 much+0.009
Lu+0.009
Neg logit contributions
working-0.032
work-0.031
sing-0.031
zo-0.029
 guys-0.029
Token,
Token ID11
Feature activation+1.11e-02

Pos logit contributions
working+0.006
sing+0.006
work+0.006
 guys+0.006
 warriors+0.005
Neg logit contributions
See-0.003
 -0.003
So-0.003
Lu-0.003
uct-0.002
Token "
Token ID366
Feature activation+8.56e-03

Pos logit contributions
working+0.029
sing+0.027
work+0.026
 guys+0.025
 warriors+0.024
Neg logit contributions
See-0.013
 -0.012
So-0.012
Lu-0.011
uct-0.011
TokenYou
Token ID1639
Feature activation-1.91e-02

Pos logit contributions
working+0.025
sing+0.024
work+0.023
 guys+0.022
 warriors+0.022
Neg logit contributions
 -0.007
See-0.006
So-0.006
Lu-0.006
 much-0.005
Token will
Token ID481
Feature activation+2.43e-03

Pos logit contributions
 +0.019
See+0.019
So+0.017
+0.017
 was+0.016
Neg logit contributions
working-0.052
sing-0.050
work-0.047
 guys-0.045
 warriors-0.045
Token see
Token ID766
Feature activation-3.54e-03

Pos logit contributions
working+0.006
sing+0.006
work+0.005
 guys+0.005
 warriors+0.005
Neg logit contributions
 -0.003
See-0.003
So-0.003
-0.003
Lu-0.003
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
See+0.004
 +0.004
Lu+0.004
So+0.004
Ph+0.004
Neg logit contributions
working-0.009
 guys-0.008
sing-0.008
work-0.008
 warriors-0.007
Token But
Token ID887
Feature activation-2.46e-03

Pos logit contributions
working+0.036
sing+0.034
work+0.033
 guys+0.032
 warriors+0.031
Neg logit contributions
See-0.019
 -0.018
So-0.017
Lu-0.017
uct-0.016
Token you
Token ID345
Feature activation-3.54e-02

Pos logit contributions
 +0.003
See+0.003
So+0.002
 was+0.002
+0.002
Neg logit contributions
working-0.007
sing-0.006
work-0.006
 guys-0.006
zo-0.006
Token share
Token ID2648
Feature activation+9.08e-03

Pos logit contributions
working+0.020
sing+0.019
work+0.018
 guys+0.017
 warriors+0.017
Neg logit contributions
 -0.014
See-0.014
So-0.013
-0.013
Lu-0.013
Token because
Token ID780
Feature activation-2.00e-02

Pos logit contributions
working+0.023
sing+0.021
 guys+0.021
work+0.020
 warriors+0.019
Neg logit contributions
See-0.011
 -0.010
So-0.009
Lu-0.009
uct-0.009
Token he
Token ID339
Feature activation+1.70e-02

Pos logit contributions
See+0.023
So+0.022
 +0.022
uct+0.020
Conf+0.020
Neg logit contributions
working-0.052
sing-0.045
work-0.045
 guys-0.044
 warriors-0.042
Token wanted
Token ID2227
Feature activation+1.32e-02

Pos logit contributions
working+0.046
sing+0.043
work+0.041
 guys+0.040
zo+0.039
Neg logit contributions
See-0.018
 -0.017
So-0.015
Lu-0.015
-0.014
Token to
Token ID284
Feature activation+1.38e-03

Pos logit contributions
working+0.032
sing+0.029
work+0.028
 guys+0.027
 warriors+0.026
Neg logit contributions
See-0.018
 -0.017
So-0.016
Lu-0.016
-0.015
Token keep
Token ID1394
Feature activation+3.73e-03

Pos logit contributions
working+0.003
sing+0.002
 guys+0.002
work+0.002
 rook+0.002
Neg logit contributions
See-0.003
 -0.002
Lu-0.002
So-0.002
Conf-0.002
Token it
Token ID340
Feature activation+1.23e-02

Pos logit contributions
working+0.011
sing+0.010
work+0.010
 warriors+0.009
 warrior+0.009
Neg logit contributions
 -0.004
See-0.003
 much-0.003
 was-0.003
-0.003
Token all
Token ID477
Feature activation+9.18e-03

Pos logit contributions
working+0.034
sing+0.032
work+0.031
 guys+0.030
 warriors+0.029
Neg logit contributions
See-0.013
 -0.012
Lu-0.011
So-0.011
Ann-0.010
Token for
Token ID329
Feature activation+2.50e-02

Pos logit contributions
working+0.021
work+0.018
sing+0.018
 guys+0.017
zo+0.016
Neg logit contributions
See-0.015
So-0.014
Care-0.014
Conf-0.013
Ann-0.013
Token himself
Token ID2241
Feature activation+2.12e-02

Pos logit contributions
working+0.068
sing+0.064
work+0.062
 guys+0.059
 warriors+0.059
Neg logit contributions
 -0.026
See-0.025
So-0.023
Lu-0.023
 much-0.022
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
working+0.061
sing+0.056
work+0.055
 guys+0.053
 warriors+0.052
Neg logit contributions
See-0.021
 -0.018
Lu-0.017
Ann-0.016
Care-0.016
Token over
Token ID625
Feature activation+2.00e-02

Pos logit contributions
working+0.060
sing+0.056
work+0.053
 guys+0.051
 warriors+0.050
Neg logit contributions
 -0.038
See-0.038
So-0.035
Lu-0.035
-0.034
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
working+0.053
sing+0.049
work+0.047
 guys+0.046
zo+0.045
Neg logit contributions
 -0.024
See-0.023
 was-0.022
 much-0.021
-0.021
Token a
Token ID257
Feature activation-8.10e-03

Pos logit contributions
 +0.012
See+0.010
+0.010
 was+0.009
 much+0.009
Neg logit contributions
working-0.040
sing-0.037
work-0.036
zo-0.035
 guys-0.034
Token cool
Token ID3608
Feature activation-7.95e-03

Pos logit contributions
See+0.012
 +0.011
So+0.011
Lu+0.010
uct+0.010
Neg logit contributions
working-0.019
sing-0.017
work-0.016
 guys-0.016
 warriors-0.016
Token and
Token ID290
Feature activation-1.42e-02

Pos logit contributions
See+0.011
 +0.011
So+0.011
Lu+0.010
+0.010
Neg logit contributions
working-0.018
sing-0.017
work-0.016
 guys-0.016
 warriors-0.015
Token shady
Token ID36135
Feature activation-4.08e-02

Pos logit contributions
See+0.018
 +0.018
Lu+0.017
So+0.017
Ann+0.016
Neg logit contributions
working-0.034
sing-0.032
work-0.030
 warriors-0.029
 guys-0.029
Token spot
Token ID4136
Feature activation-4.94e-02

Pos logit contributions
See+0.068
 +0.066
So+0.064
+0.062
uct+0.061
Neg logit contributions
working-0.084
sing-0.078
work-0.072
 warriors-0.071
 guys-0.070
Token.
Token ID13
Feature activation-1.29e-03

Pos logit contributions
See+0.074
So+0.070
 +0.069
Lu+0.067
 So+0.065
Neg logit contributions
working-0.111
sing-0.107
work-0.102
 warriors-0.093
 guys-0.093
Token There
Token ID1318
Feature activation-4.48e-03

Pos logit contributions
See+0.002
 +0.001
So+0.001
Lu+0.001
uct+0.001
Neg logit contributions
working-0.003
sing-0.003
work-0.003
 guys-0.003
 warriors-0.003
Token were
Token ID547
Feature activation-1.78e-02

Pos logit contributions
See+0.004
Lu+0.004
 +0.004
uct+0.003
So+0.003
Neg logit contributions
working-0.013
sing-0.012
work-0.011
 guys-0.011
 warriors-0.011
Token big
Token ID1263
Feature activation-4.19e-02

Pos logit contributions
See+0.021
 +0.021
uct+0.021
Lu+0.020
So+0.020
Neg logit contributions
working-0.043
sing-0.041
 guys-0.038
 warriors-0.038
work-0.038
Token,
Token ID11
Feature activation+6.52e-03

Pos logit contributions
working+0.045
sing+0.044
work+0.039
 spine+0.039
 photographer+0.038
Neg logit contributions
 was-0.023
-0.023
 -0.023
 much-0.022
So-0.022
Token my
Token ID616
Feature activation-4.00e-02

Pos logit contributions
working+0.018
sing+0.016
work+0.016
 guys+0.015
 warriors+0.015
Neg logit contributions
 -0.007
See-0.007
So-0.006
-0.006
Lu-0.006
Token dear
Token ID13674
Feature activation-1.19e-02

Pos logit contributions
 +0.066
 was+0.063
See+0.061
 much+0.060
+0.060
Neg logit contributions
working-0.086
sing-0.076
work-0.074
 photographer-0.070
 warriors-0.069
Token.
Token ID13
Feature activation+1.13e-02

Pos logit contributions
See+0.012
 +0.011
uct+0.010
Lu+0.009
So+0.009
Neg logit contributions
working-0.033
sing-0.031
work-0.030
 guys-0.030
 warriors-0.029
Token Do
Token ID2141
Feature activation+3.30e-03

Pos logit contributions
working+0.030
sing+0.028
work+0.027
 guys+0.026
 warriors+0.026
Neg logit contributions
 -0.013
See-0.012
-0.011
So-0.011
 So-0.011
Token you
Token ID345
Feature activation-4.22e-02

Pos logit contributions
working+0.009
sing+0.008
 guys+0.008
work+0.008
zo+0.007
Neg logit contributions
See-0.004
 -0.003
Lu-0.003
uct-0.003
So-0.003
Token want
Token ID765
Feature activation+1.04e-02

Pos logit contributions
 +0.065
+0.062
See+0.062
 much+0.060
So+0.059
Neg logit contributions
working-0.093
sing-0.088
work-0.085
 warriors-0.077
zo-0.077
Token some
Token ID617
Feature activation+3.54e-03

Pos logit contributions
working+0.027
sing+0.026
 spine+0.024
work+0.024
 warrior+0.023
Neg logit contributions
 -0.013
 was-0.012
-0.012
 much-0.011
-0.011
Token?"
Token ID1701
Feature activation+6.72e-03

Pos logit contributions
working+0.007
sing+0.007
work+0.006
 warriors+0.006
 spine+0.006
Neg logit contributions
 -0.006
See-0.006
 was-0.006
 much-0.006
So-0.006
Token Mom
Token ID11254
Feature activation-4.07e-02

Pos logit contributions
working+0.015
sing+0.014
 soldier+0.012
 guys+0.012
work+0.012
Neg logit contributions
See-0.009
Ph-0.008
uct-0.008
aw-0.008
Lu-0.008
Token says
Token ID1139
Feature activation-2.63e-03

Pos logit contributions
See+0.052
Ph+0.049
 much+0.047
Lu+0.046
So+0.045
Neg logit contributions
working-0.097
work-0.090
sing-0.087
 warriors-0.084
zo-0.083
Token-
Token ID12
Feature activation+1.06e-02

Pos logit contributions
See+0.005
 +0.005
So+0.005
Lu+0.005
+0.005
Neg logit contributions
working-0.009
sing-0.009
work-0.008
 guys-0.008
 warriors-0.008
Tokenaid
Token ID1698
Feature activation-7.85e-03

Pos logit contributions
working+0.030
work+0.026
sing+0.026
zo+0.025
 guys+0.025
Neg logit contributions
 -0.010
See-0.009
uct-0.009
So-0.008
 So-0.008
Token on
Token ID319
Feature activation-8.27e-03

Pos logit contributions
 +0.009
See+0.009
So+0.008
Lu+0.008
+0.008
Neg logit contributions
working-0.020
sing-0.019
work-0.018
 guys-0.018
 warriors-0.018
Token it
Token ID340
Feature activation+1.51e-03

Pos logit contributions
So+0.008
See+0.007
Lu+0.007
uct+0.007
 So+0.006
Neg logit contributions
working-0.022
sing-0.021
 guys-0.019
work-0.019
 rook-0.019
Token and
Token ID290
Feature activation+7.59e-04

Pos logit contributions
working+0.004
sing+0.003
work+0.003
 guys+0.003
 spine+0.003
Neg logit contributions
See-0.002
Lu-0.002
Ab-0.002
Ph-0.002
Ann-0.002
Token kisses
Token ID36544
Feature activation-4.13e-02

Pos logit contributions
working+0.002
sing+0.002
work+0.001
 guys+0.001
 warriors+0.001
Neg logit contributions
Lu-0.001
So-0.001
 -0.001
Ph-0.001
See-0.001
Token it
Token ID340
Feature activation-2.93e-03

Pos logit contributions
uct+0.053
So+0.049
Conf+0.048
See+0.047
Ph+0.046
Neg logit contributions
working-0.091
 guys-0.085
work-0.083
sing-0.082
fly-0.076
Token.
Token ID13
Feature activation+9.47e-03

Pos logit contributions
See+0.003
Ab+0.003
Lu+0.003
Su+0.003
Ph+0.003
Neg logit contributions
working-0.008
sing-0.007
work-0.007
 guys-0.007
zo-0.006
Token
Token ID198
Feature activation+1.06e-04

Pos logit contributions
working+0.021
sing+0.020
 spine+0.017
ses+0.017
 warriors+0.017
Neg logit contributions
uct-0.012
ight-0.011
ird-0.010
up-0.010
So-0.010
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
working+0.000
sing+0.000
zo+0.000
 tablet+0.000
gy+0.000
Neg logit contributions
uct-0.000
ird-0.000
aw-0.000
ight-0.000
See-0.000
Token"
Token ID1
Feature activation+1.86e-02

Pos logit contributions
working+0.024
sing+0.019
 photographer+0.019
 warriors+0.018
king+0.018
Neg logit contributions
uct-0.014
 kisses-0.013
See-0.013
 cal-0.013
rol-0.013
Token a
Token ID257
Feature activation-3.77e-03

Pos logit contributions
working+0.036
sing+0.033
work+0.033
 guys+0.032
 warriors+0.032
Neg logit contributions
See-0.017
 -0.016
So-0.016
Lu-0.016
uct-0.015
Token small
Token ID1402
Feature activation+1.39e-02

Pos logit contributions
See+0.007
 +0.006
So+0.006
uct+0.006
Lu+0.006
Neg logit contributions
working-0.007
sing-0.007
work-0.006
 guys-0.006
 warrior-0.006
Token box
Token ID3091
Feature activation-2.57e-02

Pos logit contributions
working+0.031
sing+0.029
work+0.028
zo+0.026
 guys+0.026
Neg logit contributions
 -0.021
See-0.020
-0.019
 was-0.019
So-0.019
Token.
Token ID13
Feature activation+1.81e-02

Pos logit contributions
See+0.021
 +0.020
So+0.018
Lu+0.017
+0.016
Neg logit contributions
working-0.075
sing-0.071
work-0.068
 guys-0.067
 warriors-0.065
Token He
Token ID679
Feature activation+1.54e-02

Pos logit contributions
working+0.037
sing+0.035
 rook+0.034
 warriors+0.034
 soldier+0.033
Neg logit contributions
uct-0.026
Lu-0.025
So-0.024
See-0.024
liv-0.023
Token opens
Token ID9808
Feature activation-4.03e-02

Pos logit contributions
working+0.036
sing+0.034
work+0.032
 warriors+0.032
 guys+0.031
Neg logit contributions
See-0.020
 -0.020
Lu-0.020
So-0.019
Ph-0.019
Token the
Token ID262
Feature activation-5.16e-04

Pos logit contributions
See+0.036
So+0.034
Lu+0.032
 +0.032
uct+0.030
Neg logit contributions
working-0.112
sing-0.103
work-0.102
 guys-0.098
 warriors-0.097
Token box
Token ID3091
Feature activation-1.41e-02

Pos logit contributions
Lu+0.001
See+0.001
Jane+0.001
Ph+0.001
So+0.001
Neg logit contributions
working-0.001
sing-0.001
work-0.001
 guys-0.001
 rook-0.001
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
See+0.018
Lu+0.017
Ph+0.016
 +0.015
So+0.015
Neg logit contributions
working-0.034
sing-0.033
work-0.032
 warriors-0.030
 rook-0.029
Token Inside
Token ID14384
Feature activation-1.66e-02

Pos logit contributions
working+0.030
sing+0.028
 rook+0.028
 warriors+0.027
 soldier+0.026
Neg logit contributions
uct-0.022
Lu-0.021
So-0.021
See-0.020
liv-0.020
Token,
Token ID11
Feature activation+3.41e-03

Pos logit contributions
See+0.018
Lu+0.017
uct+0.016
So+0.016
Ph+0.015
Neg logit contributions
working-0.043
work-0.039
 guys-0.038
sing-0.038
 warriors-0.037
Token deep
Token ID2769
Feature activation+9.15e-03

Pos logit contributions
working+0.041
sing+0.038
work+0.036
 guys+0.035
 warriors+0.034
Neg logit contributions
 -0.021
See-0.021
So-0.019
Lu-0.019
-0.019
Token hills
Token ID18639
Feature activation-3.15e-02

Pos logit contributions
working+0.023
sing+0.022
work+0.021
 guys+0.020
 photographer+0.020
Neg logit contributions
 -0.011
See-0.011
 was-0.010
Lu-0.010
So-0.009
Token and
Token ID290
Feature activation+5.21e-03

Pos logit contributions
See+0.032
 +0.032
So+0.029
Lu+0.028
+0.027
Neg logit contributions
working-0.085
sing-0.080
work-0.077
 guys-0.074
 warriors-0.073
Token treasures
Token ID29561
Feature activation-3.66e-02

Pos logit contributions
working+0.013
sing+0.012
work+0.011
 guys+0.011
 photographer+0.011
Neg logit contributions
 -0.007
See-0.007
 was-0.006
 much-0.006
So-0.006
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
See+0.031
 +0.031
So+0.027
Lu+0.026
+0.025
Neg logit contributions
working-0.106
sing-0.099
work-0.096
 guys-0.093
 warriors-0.092
Token castles
Token ID49203
Feature activation-4.28e-02

Pos logit contributions
working+0.023
sing+0.022
work+0.021
 guys+0.020
 warriors+0.020
Neg logit contributions
See-0.016
 -0.016
So-0.015
Lu-0.015
-0.014
Token and
Token ID290
Feature activation+9.04e-03

Pos logit contributions
See+0.037
 +0.036
So+0.032
Lu+0.031
uct+0.030
Neg logit contributions
working-0.123
sing-0.115
work-0.112
 guys-0.109
 warriors-0.107
Token dragons
Token ID20308
Feature activation-3.10e-02

Pos logit contributions
working+0.020
sing+0.018
work+0.017
 guys+0.017
 warriors+0.016
Neg logit contributions
 -0.014
See-0.014
So-0.013
Lu-0.013
-0.012
Token.
Token ID13
Feature activation+1.63e-02

Pos logit contributions
See+0.026
 +0.025
So+0.022
Lu+0.022
uct+0.021
Neg logit contributions
working-0.090
sing-0.084
work-0.082
 guys-0.080
 warriors-0.079
Token They
Token ID1119
Feature activation+5.48e-03

Pos logit contributions
working+0.035
sing+0.033
work+0.030
 guys+0.029
 warriors+0.029
Neg logit contributions
uct-0.024
See-0.023
So-0.023
Lu-0.022
 -0.022
Token are
Token ID389
Feature activation-1.11e-02

Pos logit contributions
working+0.014
sing+0.013
work+0.013
 guys+0.012
 warriors+0.012
Neg logit contributions
 -0.007
See-0.006
So-0.006
Lu-0.006
-0.006