TOP ACTIVATIONS
MAX = 0.099

, gir aff es and ze br as . They took
of animals like lions , ze br as , and gir
aff es , and fast ze br as . Tim was
animals , like lions , ze br as , and gir
animals , like lions , ze br as , and gir
, gir aff es , ze br as , and more
animals , like lions , ze br as , and gir
many animals like lions , ze br as and elephants .
animals , like lions , ze br as , gir aff
lions , monkeys , and ze br as . Sara liked
animals , like lions , ze br as , and gir
lions , monkeys , and ze br as . But the
, tigers , monkeys and ze br as . He was
of animals like lions , ze br as , and gir
animals , like lions , ze br as , and gir
lions , monkeys , and ze br as . But their
did not see any baby ze br as .
animals , like monkeys , ze br as and gir aff
lions , monkeys , and ze br as . But their
Lily was looking at the ze br as , a rude

MIN ACTIVATIONS
MIN = -0.075

! He smiled and gave Mom my and Daddy big hugs
a desk . She gives Mom and Dad a key .
mum asked him to put all the toys in the toy
remind me to bring our bucket back to the car ?"
Her mom brought out all the ingredients and they started
excited ! She takes off all the paper , and she
present for their mom . At the store , they looked
and helped her mom put all the paper in a box
! Where did you get it ?" Jack told
a cake . Mary put all the y ummy ingredients in
to the store to buy all the things they needed .
idea . They filled up buckets with water and ran outside
. I have to bring all my things with me ."
. Where did you get it ?" Tom says " Dad
. Where did you get it ?" L ily
clean his room and put all his toys in the bin
your room now ? Put all your toys away and make
her room and put away all of her dress - up
. Lisa put all her dolls on the new
you want to buy some laundry ? I have a shirt

INTERVAL 0.055 - 0.099
CONTAINS 0.021%

pulled and tug ged and shouted . They did not want
see lions , elephants , gir aff es and more .
walked on a path through a big bush with lots of
Ben and Lily liked the ze br as the best .
he reached out his tiny cater pillar hand and grasped the

INTERVAL 0.012 - 0.055
CONTAINS 19.793%

all looked around , but they could not find it .
share it ." Bill was happy because Dee
the waves and spl ashed around for hours ! In the
is shocked . She says , " Hey , that 's
ball and it flew through the air . He was so

INTERVAL -0.032 - 0.012
CONTAINS 78.588%

okay , Tom . We are glad you are safe .
big bear who wanted to take a bath . He w
you create ?" Lily smiled and said , " I can
the right places and he found the ancient victory he was
As night started to fall , the animals said goodbye and

INTERVAL -0.075 - -0.032
CONTAINS 1.599%

hand . She felt so safe and happy that they were
too . Let 's put the top back on the box
of fun . <|endoftext|> Once upon a time , there was
Mom my was very pleased , and said that from now
her to help with the laundry . She showed Lily how
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 conclusions+0.122
 Majesty+0.114
alks+0.113
ocol+0.111
 fingertips+0.110
Neg logit contributions
my-0.045
 next-0.041
 with-0.037
 over-0.036
 also-0.032
Token gir
Token ID37370
Feature activation+6.71e-02

Pos logit contributions
 conclusions+0.031
alks+0.030
oles+0.029
ocol+0.029
 Majesty+0.028
Neg logit contributions
my-0.019
 next-0.018
 tidy-0.017
 over-0.016
 packed-0.016
Tokenaff
Token ID2001
Feature activation+6.67e-02

Pos logit contributions
 conclusions+0.094
 fingertips+0.077
 Majesty+0.076
 perspective+0.075
 colony+0.073
Neg logit contributions
my-0.149
 next-0.136
 over-0.130
 with-0.126
 packed-0.126
Tokenes
Token ID274
Feature activation+3.55e-02

Pos logit contributions
 conclusions+0.137
 fingertips+0.116
joy+0.112
 Majesty+0.112
 mischief+0.112
Neg logit contributions
my-0.120
 over-0.106
 next-0.106
 with-0.103
 also-0.099
Token and
Token ID290
Feature activation+2.96e-02

Pos logit contributions
 conclusions+0.108
ocol+0.102
alks+0.101
 Majesty+0.101
 fingertips+0.100
Neg logit contributions
 next-0.026
 with-0.024
my-0.024
 over-0.023
 from-0.020
Token ze
Token ID41271
Feature activation+9.88e-02

Pos logit contributions
alks+0.051
 conclusions+0.051
ocol+0.049
 perspective+0.048
oles+0.048
Neg logit contributions
my-0.052
 next-0.049
 over-0.046
 made-0.045
 tidy-0.045
Tokenbr
Token ID1671
Feature activation+3.74e-02

Pos logit contributions
 conclusions+0.236
 perspective+0.225
 fingertips+0.220
 Majesty+0.220
 anth+0.217
Neg logit contributions
my-0.106
 next-0.098
 over-0.095
 ride-0.091
go-0.091
Tokenas
Token ID292
Feature activation+2.60e-02

Pos logit contributions
 conclusions+0.053
alks+0.049
 Majesty+0.046
ocol+0.046
oles+0.046
Neg logit contributions
my-0.077
 next-0.072
 tidy-0.070
 over-0.069
 packed-0.068
Token.
Token ID13
Feature activation-1.06e-03

Pos logit contributions
 conclusions+0.064
 Majesty+0.060
ocol+0.059
alks+0.059
 fingertips+0.058
Neg logit contributions
my-0.032
 next-0.030
 over-0.030
 with-0.027
 from-0.025
Token They
Token ID1119
Feature activation+1.33e-02

Pos logit contributions
my+0.001
 next+0.001
 over+0.001
 also+0.001
 tidy+0.001
Neg logit contributions
 conclusions-0.003
 Majesty-0.003
ocol-0.003
alks-0.003
 predators-0.002
Token took
Token ID1718
Feature activation+7.71e-03

Pos logit contributions
 conclusions+0.033
alks+0.032
ocol+0.032
oles+0.031
 Majesty+0.031
Neg logit contributions
my-0.014
 next-0.012
 packed-0.012
 tidy-0.012
 over-0.012
Token of
Token ID286
Feature activation+2.10e-03

Pos logit contributions
 conclusions+0.010
alks+0.010
 Majesty+0.010
oles+0.010
ocol+0.010
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 over-0.001
 clean-0.001
Token animals
Token ID4695
Feature activation+1.86e-02

Pos logit contributions
alks+0.005
 Majesty+0.004
 conclusions+0.004
 fingertips+0.004
oles+0.004
Neg logit contributions
my-0.003
 tidy-0.003
 clean-0.002
 next-0.002
 organized-0.002
Token like
Token ID588
Feature activation+3.53e-02

Pos logit contributions
 conclusions+0.046
alks+0.045
 Majesty+0.045
ocol+0.044
 fingertips+0.043
Neg logit contributions
my-0.020
 next-0.020
 over-0.018
 with-0.017
 tidy-0.016
Token lions
Token ID31420
Feature activation+5.07e-02

Pos logit contributions
 conclusions+0.062
alks+0.060
ocol+0.058
 fingertips+0.057
 Majesty+0.055
Neg logit contributions
 next-0.062
my-0.058
 over-0.058
 tidy-0.055
 clean-0.055
Token,
Token ID11
Feature activation+9.84e-03

Pos logit contributions
 conclusions+0.144
alks+0.140
 Majesty+0.139
ocol+0.136
 fingertips+0.134
Neg logit contributions
my-0.044
 next-0.044
 with-0.040
 over-0.038
 made-0.035
Token ze
Token ID41271
Feature activation+9.83e-02

Pos logit contributions
 conclusions+0.014
alks+0.014
ocol+0.013
 perspective+0.013
 fingertips+0.013
Neg logit contributions
my-0.020
 next-0.020
 over-0.018
 tidy-0.018
 also-0.018
Tokenbr
Token ID1671
Feature activation+3.06e-02

Pos logit contributions
 conclusions+0.248
 perspective+0.236
 Majesty+0.235
 fingertips+0.232
 anth+0.225
Neg logit contributions
my-0.091
 next-0.088
 over-0.079
go-0.077
 with-0.076
Tokenas
Token ID292
Feature activation+3.14e-02

Pos logit contributions
 conclusions+0.042
alks+0.039
oles+0.036
ocol+0.036
 Majesty+0.036
Neg logit contributions
my-0.064
 next-0.061
 tidy-0.059
 over-0.058
 packed-0.057
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 conclusions+0.085
alks+0.078
 Majesty+0.078
ocol+0.077
 fingertips+0.076
Neg logit contributions
my-0.033
 next-0.029
 over-0.028
 with-0.024
 packed-0.023
Token and
Token ID290
Feature activation+1.72e-02

Pos logit contributions
 conclusions+0.026
alks+0.025
ocol+0.024
oles+0.024
 Majesty+0.024
Neg logit contributions
my-0.020
 next-0.019
 over-0.018
 tidy-0.017
 packed-0.017
Token gir
Token ID37370
Feature activation+6.86e-02

Pos logit contributions
 conclusions+0.026
alks+0.026
ocol+0.025
oles+0.024
 fingertips+0.024
Neg logit contributions
my-0.034
 next-0.033
 over-0.031
 tidy-0.030
 made-0.030
Tokenaff
Token ID2001
Feature activation+7.20e-02

Pos logit contributions
 conclusions+0.100
 Majesty+0.082
 fingertips+0.081
 perspective+0.079
joy+0.077
Neg logit contributions
my-0.151
 next-0.138
 over-0.130
 with-0.129
 packed-0.128
Tokenes
Token ID274
Feature activation+3.26e-02

Pos logit contributions
 conclusions+0.151
 fingertips+0.127
 mischief+0.125
 sidelines+0.124
 anth+0.123
Neg logit contributions
my-0.129
 next-0.113
 over-0.112
 with-0.109
o-0.107
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
 conclusions+0.097
ocol+0.093
alks+0.092
 Majesty+0.091
 fingertips+0.090
Neg logit contributions
 next-0.029
 with-0.027
my-0.026
 over-0.025
 from-0.022
Token and
Token ID290
Feature activation+1.71e-02

Pos logit contributions
 conclusions+0.041
alks+0.040
ocol+0.038
oles+0.038
 Majesty+0.038
Neg logit contributions
my-0.014
 next-0.012
 tidy-0.011
 over-0.011
 packed-0.010
Token fast
Token ID3049
Feature activation+1.08e-02

Pos logit contributions
 conclusions+0.033
alks+0.033
ocol+0.031
oles+0.031
 fingertips+0.030
Neg logit contributions
my-0.027
 next-0.026
 over-0.024
 tidy-0.024
 packed-0.023
Token ze
Token ID41271
Feature activation+9.65e-02

Pos logit contributions
 conclusions+0.024
oles+0.024
 fingertips+0.024
apa+0.023
ocol+0.023
Neg logit contributions
 next-0.013
my-0.013
 with-0.013
 over-0.012
 set-0.012
Tokenbr
Token ID1671
Feature activation+3.88e-02

Pos logit contributions
 conclusions+0.246
 perspective+0.234
 Majesty+0.228
 anth+0.227
 fingertips+0.227
Neg logit contributions
 next-0.087
my-0.086
 over-0.080
 ride-0.077
go-0.077
Tokenas
Token ID292
Feature activation+2.96e-02

Pos logit contributions
 conclusions+0.055
alks+0.050
ocol+0.047
oles+0.047
 Majesty+0.047
Neg logit contributions
my-0.080
 next-0.076
 tidy-0.073
 over-0.072
 packed-0.071
Token.
Token ID13
Feature activation+9.57e-03

Pos logit contributions
 conclusions+0.072
 Majesty+0.068
alks+0.067
ocol+0.067
 fingertips+0.066
Neg logit contributions
my-0.037
 over-0.035
 next-0.035
 with-0.030
 from-0.029
Token Tim
Token ID5045
Feature activation+2.71e-03

Pos logit contributions
 conclusions+0.023
alks+0.022
ocol+0.022
 Majesty+0.022
oles+0.021
Neg logit contributions
my-0.011
 next-0.010
 over-0.009
 tidy-0.009
 clean-0.008
Token was
Token ID373
Feature activation+1.14e-02

Pos logit contributions
 conclusions+0.007
alks+0.006
oles+0.006
 Majesty+0.006
 perspective+0.006
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 over-0.002
 clean-0.002
Token animals
Token ID4695
Feature activation+2.30e-02

Pos logit contributions
 conclusions+0.013
alks+0.012
oles+0.012
ocol+0.012
 Majesty+0.012
Neg logit contributions
my-0.011
 next-0.010
 tidy-0.010
 over-0.009
 clean-0.009
Token,
Token ID11
Feature activation+9.72e-03

Pos logit contributions
 conclusions+0.054
alks+0.052
 Majesty+0.050
ocol+0.050
oles+0.049
Neg logit contributions
my-0.026
 next-0.024
 tidy-0.022
 over-0.022
 clean-0.021
Token like
Token ID588
Feature activation+3.86e-02

Pos logit contributions
 conclusions+0.021
alks+0.020
oles+0.019
 predators+0.019
ocol+0.019
Neg logit contributions
my-0.012
 next-0.011
 tidy-0.011
 over-0.011
 clean-0.010
Token lions
Token ID31420
Feature activation+5.04e-02

Pos logit contributions
 conclusions+0.061
alks+0.059
ocol+0.056
oles+0.054
 fingertips+0.054
Neg logit contributions
my-0.074
 next-0.071
 tidy-0.067
 over-0.067
 packed-0.066
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
 conclusions+0.133
alks+0.126
 Majesty+0.123
ocol+0.122
 perspective+0.121
Neg logit contributions
my-0.055
 next-0.052
 with-0.046
 over-0.044
 made-0.041
Token ze
Token ID41271
Feature activation+9.50e-02

Pos logit contributions
 conclusions+0.017
alks+0.016
oles+0.015
ocol+0.015
 Majesty+0.015
Neg logit contributions
my-0.030
 next-0.029
 tidy-0.028
 over-0.028
 packed-0.027
Tokenbr
Token ID1671
Feature activation+3.10e-02

Pos logit contributions
 conclusions+0.234
 perspective+0.221
 Majesty+0.220
 fingertips+0.216
 anth+0.212
Neg logit contributions
my-0.100
 next-0.092
 over-0.082
 with-0.079
 packed-0.078
Tokenas
Token ID292
Feature activation+3.80e-02

Pos logit contributions
 conclusions+0.040
alks+0.038
oles+0.036
ocol+0.035
 Majesty+0.034
Neg logit contributions
my-0.066
 next-0.063
 tidy-0.062
 over-0.060
 clean-0.060
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
 conclusions+0.103
alks+0.095
 Majesty+0.095
ocol+0.093
 fingertips+0.093
Neg logit contributions
my-0.040
 next-0.036
 over-0.033
 with-0.031
 from-0.028
Token and
Token ID290
Feature activation+2.24e-02

Pos logit contributions
 conclusions+0.033
alks+0.032
oles+0.031
 Majesty+0.031
ocol+0.031
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.012
 clean-0.011
 over-0.011
Token gir
Token ID37370
Feature activation+6.54e-02

Pos logit contributions
alks+0.033
 conclusions+0.032
ocol+0.031
oles+0.030
 fingertips+0.029
Neg logit contributions
my-0.045
 next-0.043
 over-0.041
 tidy-0.041
 packed-0.040
Token animals
Token ID4695
Feature activation+2.24e-02

Pos logit contributions
 conclusions+0.016
alks+0.015
oles+0.014
ocol+0.014
 Majesty+0.014
Neg logit contributions
my-0.012
 next-0.012
 tidy-0.011
 over-0.011
 clean-0.011
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
 conclusions+0.051
alks+0.049
 Majesty+0.047
ocol+0.047
oles+0.046
Neg logit contributions
my-0.028
 next-0.026
 tidy-0.024
 over-0.024
 clean-0.023
Token like
Token ID588
Feature activation+3.97e-02

Pos logit contributions
 conclusions+0.025
alks+0.023
oles+0.023
 predators+0.023
ocol+0.022
Neg logit contributions
my-0.015
 next-0.013
 tidy-0.013
 over-0.013
pack-0.012
Token lions
Token ID31420
Feature activation+4.85e-02

Pos logit contributions
 conclusions+0.061
alks+0.059
ocol+0.056
 fingertips+0.055
 Majesty+0.054
Neg logit contributions
my-0.077
 next-0.075
 over-0.070
 tidy-0.070
 packed-0.070
Token,
Token ID11
Feature activation+1.53e-02

Pos logit contributions
 conclusions+0.127
alks+0.120
 Majesty+0.118
ocol+0.117
 perspective+0.116
Neg logit contributions
my-0.052
 next-0.050
 with-0.045
 over-0.043
 made-0.039
Token ze
Token ID41271
Feature activation+9.40e-02

Pos logit contributions
 conclusions+0.019
alks+0.018
ocol+0.016
oles+0.016
 Majesty+0.016
Neg logit contributions
my-0.034
 next-0.033
 tidy-0.032
 over-0.031
 packed-0.031
Tokenbr
Token ID1671
Feature activation+3.32e-02

Pos logit contributions
 conclusions+0.232
 perspective+0.219
 Majesty+0.219
 fingertips+0.214
joy+0.209
Neg logit contributions
my-0.099
 next-0.091
 over-0.081
 with-0.079
 packed-0.077
Tokenas
Token ID292
Feature activation+3.81e-02

Pos logit contributions
 conclusions+0.044
alks+0.042
oles+0.039
ocol+0.039
 Majesty+0.038
Neg logit contributions
my-0.070
 next-0.067
 tidy-0.065
 over-0.064
 clean-0.063
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
 conclusions+0.103
 Majesty+0.096
alks+0.095
 fingertips+0.094
ocol+0.093
Neg logit contributions
my-0.040
 next-0.035
 over-0.033
 with-0.031
 from-0.028
Token and
Token ID290
Feature activation+2.24e-02

Pos logit contributions
 conclusions+0.037
alks+0.036
oles+0.035
ocol+0.034
 Majesty+0.034
Neg logit contributions
my-0.015
 next-0.013
 tidy-0.013
 over-0.012
 clean-0.012
Token gir
Token ID37370
Feature activation+6.10e-02

Pos logit contributions
alks+0.033
 conclusions+0.032
ocol+0.031
oles+0.030
 fingertips+0.030
Neg logit contributions
my-0.045
 next-0.043
 over-0.040
 tidy-0.040
 packed-0.040
Token,
Token ID11
Feature activation+8.10e-03

Pos logit contributions
 conclusions+0.056
alks+0.054
 Majesty+0.053
 perspective+0.052
ocol+0.052
Neg logit contributions
my-0.023
 next-0.023
 over-0.020
 tidy-0.019
 clean-0.018
Token gir
Token ID37370
Feature activation+5.32e-02

Pos logit contributions
alks+0.012
 conclusions+0.012
 perspective+0.012
 Majesty+0.011
ocol+0.011
Neg logit contributions
my-0.016
 next-0.016
 over-0.014
 tidy-0.014
 clean-0.014
Tokenaff
Token ID2001
Feature activation+5.29e-02

Pos logit contributions
 conclusions+0.070
 fingertips+0.060
 perspective+0.060
 Majesty+0.059
 colony+0.058
Neg logit contributions
my-0.117
 next-0.110
 over-0.104
 with-0.102
 from-0.099
Tokenes
Token ID274
Feature activation+2.12e-02

Pos logit contributions
 conclusions+0.098
 fingertips+0.087
 anth+0.086
 perspective+0.084
joy+0.082
Neg logit contributions
my-0.100
 next-0.091
 over-0.088
 with-0.084
 also-0.084
Token,
Token ID11
Feature activation+1.42e-02

Pos logit contributions
 conclusions+0.061
alks+0.058
 Majesty+0.058
ocol+0.057
 fingertips+0.057
Neg logit contributions
 next-0.017
 over-0.016
my-0.016
 with-0.015
 from-0.013
Token ze
Token ID41271
Feature activation+9.19e-02

Pos logit contributions
 conclusions+0.027
alks+0.026
 perspective+0.025
oles+0.024
ocol+0.024
Neg logit contributions
my-0.023
 next-0.023
 over-0.021
 tidy-0.020
 clean-0.020
Tokenbr
Token ID1671
Feature activation+3.81e-02

Pos logit contributions
 conclusions+0.225
 perspective+0.219
 fingertips+0.214
 anth+0.213
 Majesty+0.211
Neg logit contributions
my-0.090
 next-0.087
 over-0.077
 with-0.077
 have-0.071
Tokenas
Token ID292
Feature activation+3.51e-02

Pos logit contributions
 conclusions+0.058
alks+0.051
 Majesty+0.049
oles+0.048
ocol+0.048
Neg logit contributions
my-0.076
 next-0.071
 over-0.068
 tidy-0.068
 packed-0.067
Token,
Token ID11
Feature activation+1.82e-02

Pos logit contributions
 conclusions+0.095
 Majesty+0.089
alks+0.088
 anth+0.086
 fingertips+0.086
Neg logit contributions
my-0.035
 next-0.032
 over-0.031
 with-0.028
 from-0.026
Token and
Token ID290
Feature activation+2.64e-02

Pos logit contributions
 conclusions+0.043
alks+0.042
ocol+0.040
oles+0.040
 perspective+0.040
Neg logit contributions
my-0.021
 next-0.020
 over-0.017
 tidy-0.017
 clean-0.017
Token more
Token ID517
Feature activation+2.74e-02

Pos logit contributions
alks+0.044
 conclusions+0.042
oles+0.041
ocol+0.041
 perspective+0.040
Neg logit contributions
my-0.048
 next-0.047
 over-0.044
 tidy-0.043
 clean-0.042
Token animals
Token ID4695
Feature activation+2.17e-02

Pos logit contributions
 conclusions+0.016
alks+0.016
oles+0.015
ocol+0.015
 Majesty+0.015
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.012
 over-0.011
 clean-0.011
Token,
Token ID11
Feature activation+1.01e-02

Pos logit contributions
 conclusions+0.050
alks+0.048
 Majesty+0.046
ocol+0.046
oles+0.046
Neg logit contributions
my-0.026
 next-0.024
 tidy-0.022
 over-0.022
 clean-0.021
Token like
Token ID588
Feature activation+3.72e-02

Pos logit contributions
 conclusions+0.022
alks+0.021
oles+0.020
 predators+0.020
ocol+0.020
Neg logit contributions
my-0.012
 next-0.011
 tidy-0.011
 over-0.011
 clean-0.010
Token lions
Token ID31420
Feature activation+4.72e-02

Pos logit contributions
 conclusions+0.057
alks+0.055
ocol+0.053
 fingertips+0.051
oles+0.051
Neg logit contributions
my-0.072
 next-0.070
 tidy-0.066
 over-0.066
 packed-0.065
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
 conclusions+0.122
alks+0.115
 Majesty+0.113
ocol+0.112
 perspective+0.111
Neg logit contributions
my-0.052
 next-0.050
 with-0.044
 over-0.043
 tidy-0.039
Token ze
Token ID41271
Feature activation+8.92e-02

Pos logit contributions
 conclusions+0.018
alks+0.017
ocol+0.016
oles+0.016
 Majesty+0.015
Neg logit contributions
my-0.032
 next-0.030
 tidy-0.029
 over-0.029
 packed-0.028
Tokenbr
Token ID1671
Feature activation+3.19e-02

Pos logit contributions
 conclusions+0.219
 perspective+0.206
 Majesty+0.205
 fingertips+0.201
joy+0.197
Neg logit contributions
my-0.096
 next-0.090
 over-0.079
 with-0.077
 packed-0.077
Tokenas
Token ID292
Feature activation+3.50e-02

Pos logit contributions
 conclusions+0.043
alks+0.041
oles+0.038
ocol+0.038
 Majesty+0.037
Neg logit contributions
my-0.067
 next-0.063
 tidy-0.062
 over-0.061
 packed-0.060
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 conclusions+0.094
 Majesty+0.087
alks+0.087
 fingertips+0.086
ocol+0.085
Neg logit contributions
my-0.037
 next-0.032
 over-0.030
 with-0.028
 from-0.025
Token and
Token ID290
Feature activation+2.18e-02

Pos logit contributions
 conclusions+0.036
alks+0.035
oles+0.034
ocol+0.034
 Majesty+0.033
Neg logit contributions
my-0.014
 next-0.013
 tidy-0.012
 over-0.011
 packed-0.011
Token gir
Token ID37370
Feature activation+5.62e-02

Pos logit contributions
alks+0.033
 conclusions+0.032
ocol+0.031
oles+0.030
 fingertips+0.030
Neg logit contributions
my-0.043
 next-0.041
 over-0.039
 tidy-0.038
 packed-0.038
Token many
Token ID867
Feature activation+4.62e-03

Pos logit contributions
alks+0.004
 conclusions+0.004
 predators+0.004
 Majesty+0.004
oles+0.004
Neg logit contributions
my-0.003
 next-0.003
 tidy-0.003
pack-0.003
 clean-0.003
Token animals
Token ID4695
Feature activation+1.56e-02

Pos logit contributions
 conclusions+0.009
alks+0.009
oles+0.008
ocol+0.008
 Majesty+0.008
Neg logit contributions
my-0.007
 next-0.007
 tidy-0.006
 packed-0.006
 over-0.006
Token like
Token ID588
Feature activation+3.12e-02

Pos logit contributions
 conclusions+0.040
alks+0.038
 Majesty+0.037
ocol+0.037
oles+0.036
Neg logit contributions
my-0.015
 next-0.013
 over-0.013
 tidy-0.012
 packed-0.012
Token lions
Token ID31420
Feature activation+3.78e-02

Pos logit contributions
 conclusions+0.054
alks+0.050
ocol+0.049
 fingertips+0.048
 perspective+0.047
Neg logit contributions
 next-0.058
my-0.057
 packed-0.054
 over-0.053
 set-0.053
Token,
Token ID11
Feature activation+9.20e-03

Pos logit contributions
 conclusions+0.106
 Majesty+0.100
alks+0.099
ocol+0.099
 perspective+0.098
Neg logit contributions
my-0.035
 next-0.035
 with-0.032
 over-0.030
 made-0.028
Token ze
Token ID41271
Feature activation+8.90e-02

Pos logit contributions
 conclusions+0.012
alks+0.012
ocol+0.011
oles+0.011
 Majesty+0.010
Neg logit contributions
my-0.020
 next-0.019
 tidy-0.018
 over-0.018
 packed-0.018
Tokenbr
Token ID1671
Feature activation+2.76e-02

Pos logit contributions
 conclusions+0.224
 perspective+0.211
 Majesty+0.208
 fingertips+0.206
 anth+0.202
Neg logit contributions
my-0.090
 next-0.087
 packed-0.076
 ride-0.075
go-0.075
Tokenas
Token ID292
Feature activation+2.76e-02

Pos logit contributions
 conclusions+0.038
alks+0.035
oles+0.033
ocol+0.033
 Majesty+0.032
Neg logit contributions
my-0.058
 next-0.055
 tidy-0.053
 over-0.052
 packed-0.052
Token and
Token ID290
Feature activation+2.04e-02

Pos logit contributions
 conclusions+0.074
 fingertips+0.068
 Majesty+0.068
ocol+0.068
alks+0.067
Neg logit contributions
my-0.031
 next-0.027
 over-0.027
 with-0.025
 from-0.022
Token elephants
Token ID25368
Feature activation+3.61e-02

Pos logit contributions
 conclusions+0.029
alks+0.029
ocol+0.028
oles+0.027
 fingertips+0.026
Neg logit contributions
my-0.042
 next-0.040
 over-0.038
 packed-0.038
 tidy-0.038
Token.
Token ID13
Feature activation+8.04e-03

Pos logit contributions
 conclusions+0.100
 Majesty+0.093
ocol+0.093
 fingertips+0.092
joy+0.090
Neg logit contributions
my-0.038
 next-0.035
 with-0.034
 over-0.034
 on-0.029
Token animals
Token ID4695
Feature activation+2.42e-02

Pos logit contributions
 conclusions+0.053
alks+0.052
 Majesty+0.051
oles+0.050
 fingertips+0.050
Neg logit contributions
my-0.038
 next-0.033
 over-0.031
 tidy-0.030
 with-0.030
Token,
Token ID11
Feature activation+8.84e-03

Pos logit contributions
 conclusions+0.057
alks+0.054
 Majesty+0.054
ocol+0.053
 fingertips+0.052
Neg logit contributions
my-0.030
 next-0.029
 over-0.028
 with-0.025
 packed-0.025
Token like
Token ID588
Feature activation+3.75e-02

Pos logit contributions
 conclusions+0.019
alks+0.019
oles+0.018
 predators+0.018
ocol+0.018
Neg logit contributions
my-0.010
 next-0.009
 tidy-0.009
 over-0.009
 clean-0.009
Token lions
Token ID31420
Feature activation+4.41e-02

Pos logit contributions
 conclusions+0.060
alks+0.058
ocol+0.055
oles+0.054
 Majesty+0.053
Neg logit contributions
my-0.071
 next-0.068
 tidy-0.065
 over-0.064
 packed-0.064
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
 conclusions+0.118
alks+0.114
 Majesty+0.110
 fingertips+0.110
ocol+0.109
Neg logit contributions
my-0.045
 next-0.044
 with-0.040
 over-0.038
 made-0.034
Token ze
Token ID41271
Feature activation+8.89e-02

Pos logit contributions
 conclusions+0.017
alks+0.016
ocol+0.015
oles+0.015
 Majesty+0.014
Neg logit contributions
my-0.027
 next-0.026
 over-0.024
 tidy-0.024
 packed-0.024
Tokenbr
Token ID1671
Feature activation+2.95e-02

Pos logit contributions
 conclusions+0.223
 perspective+0.214
 Majesty+0.212
 fingertips+0.210
 anth+0.205
Neg logit contributions
my-0.085
 next-0.082
 over-0.070
 ride-0.070
 with-0.069
Tokenas
Token ID292
Feature activation+3.52e-02

Pos logit contributions
 conclusions+0.041
alks+0.038
oles+0.035
ocol+0.035
 Majesty+0.035
Neg logit contributions
my-0.061
 next-0.058
 tidy-0.056
 over-0.056
 packed-0.055
Token,
Token ID11
Feature activation+1.31e-02

Pos logit contributions
 conclusions+0.096
alks+0.090
 fingertips+0.089
 Majesty+0.089
ocol+0.088
Neg logit contributions
my-0.036
 next-0.032
 over-0.030
 with-0.028
 from-0.025
Token gir
Token ID37370
Feature activation+6.32e-02

Pos logit contributions
 conclusions+0.033
alks+0.032
ocol+0.031
oles+0.031
 Majesty+0.030
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.010
 over-0.010
 packed-0.010
Tokenaff
Token ID2001
Feature activation+6.33e-02

Pos logit contributions
 conclusions+0.093
 fingertips+0.078
 Majesty+0.075
 perspective+0.074
 colony+0.074
Neg logit contributions
my-0.134
 next-0.123
 over-0.116
 with-0.115
 packed-0.113
Token lions
Token ID31420
Feature activation+4.94e-02

Pos logit contributions
 conclusions+0.068
alks+0.067
ocol+0.063
 fingertips+0.062
 Majesty+0.060
Neg logit contributions
my-0.087
 next-0.085
 over-0.080
 tidy-0.080
 packed-0.079
Token,
Token ID11
Feature activation+1.47e-02

Pos logit contributions
 conclusions+0.131
alks+0.124
 Majesty+0.122
ocol+0.120
 perspective+0.119
Neg logit contributions
my-0.053
 next-0.050
 with-0.045
 over-0.044
 made-0.040
Token monkeys
Token ID26697
Feature activation+4.55e-02

Pos logit contributions
 conclusions+0.021
alks+0.020
ocol+0.019
oles+0.018
 Majesty+0.018
Neg logit contributions
my-0.030
 next-0.029
 tidy-0.027
 over-0.027
 packed-0.027
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 conclusions+0.123
alks+0.114
 Majesty+0.112
ocol+0.112
 fingertips+0.109
Neg logit contributions
my-0.051
 next-0.044
 over-0.040
 with-0.040
 clean-0.035
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
 conclusions+0.030
alks+0.029
ocol+0.028
oles+0.028
 Majesty+0.028
Neg logit contributions
my-0.010
 next-0.009
 over-0.008
 tidy-0.008
 packed-0.007
Token ze
Token ID41271
Feature activation+8.81e-02

Pos logit contributions
alks+0.030
 conclusions+0.029
ocol+0.028
oles+0.028
 fingertips+0.027
Neg logit contributions
my-0.033
 next-0.032
 tidy-0.030
 over-0.030
 packed-0.029
Tokenbr
Token ID1671
Feature activation+3.15e-02

Pos logit contributions
 conclusions+0.217
 perspective+0.208
 Majesty+0.205
 fingertips+0.204
 anth+0.201
Neg logit contributions
my-0.089
 next-0.084
 over-0.077
 ride-0.075
 with-0.075
Tokenas
Token ID292
Feature activation+1.99e-02

Pos logit contributions
 conclusions+0.043
alks+0.041
oles+0.038
ocol+0.038
 Majesty+0.037
Neg logit contributions
my-0.065
 next-0.062
 tidy-0.060
 over-0.059
 packed-0.059
Token.
Token ID13
Feature activation-5.80e-03

Pos logit contributions
 conclusions+0.048
 Majesty+0.046
alks+0.045
 fingertips+0.045
ocol+0.044
Neg logit contributions
my-0.025
 next-0.023
 over-0.023
 with-0.020
 set-0.019
Token Sara
Token ID24799
Feature activation+4.90e-03

Pos logit contributions
my+0.007
 next+0.007
 tidy+0.006
 over+0.006
 packed+0.006
Neg logit contributions
 conclusions-0.013
alks-0.012
oles-0.012
ocol-0.012
 Majesty-0.012
Token liked
Token ID8288
Feature activation+2.40e-02

Pos logit contributions
 conclusions+0.013
alks+0.012
ocol+0.012
oles+0.012
 Majesty+0.012
Neg logit contributions
my-0.005
 next-0.004
 tidy-0.004
 over-0.004
 packed-0.003
Token animals
Token ID4695
Feature activation+2.08e-02

Pos logit contributions
 conclusions+0.010
alks+0.010
oles+0.010
ocol+0.010
 Majesty+0.010
Neg logit contributions
my-0.008
 next-0.007
 tidy-0.007
 over-0.007
 clean-0.007
Token,
Token ID11
Feature activation+8.14e-03

Pos logit contributions
 conclusions+0.050
alks+0.048
 Majesty+0.047
ocol+0.046
 fingertips+0.045
Neg logit contributions
my-0.023
 next-0.022
 over-0.020
 tidy-0.020
 clean-0.019
Token like
Token ID588
Feature activation+3.65e-02

Pos logit contributions
 conclusions+0.019
alks+0.018
oles+0.017
ocol+0.017
 Majesty+0.017
Neg logit contributions
my-0.009
 next-0.008
 tidy-0.008
 over-0.008
 clean-0.008
Token lions
Token ID31420
Feature activation+4.58e-02

Pos logit contributions
 conclusions+0.059
alks+0.057
ocol+0.055
 fingertips+0.053
 Majesty+0.052
Neg logit contributions
my-0.068
 next-0.066
 over-0.062
 tidy-0.062
 packed-0.061
Token,
Token ID11
Feature activation+1.44e-02

Pos logit contributions
 conclusions+0.120
alks+0.114
 Majesty+0.111
ocol+0.110
 perspective+0.110
Neg logit contributions
my-0.049
 next-0.048
 with-0.042
 over-0.040
 made-0.037
Token ze
Token ID41271
Feature activation+8.80e-02

Pos logit contributions
 conclusions+0.019
alks+0.018
ocol+0.017
oles+0.016
 Majesty+0.016
Neg logit contributions
my-0.031
 next-0.030
 tidy-0.029
 over-0.028
 packed-0.028
Tokenbr
Token ID1671
Feature activation+3.20e-02

Pos logit contributions
 conclusions+0.216
 perspective+0.206
 Majesty+0.203
 fingertips+0.201
 anth+0.197
Neg logit contributions
my-0.092
 next-0.087
 over-0.075
 packed-0.073
 with-0.073
Tokenas
Token ID292
Feature activation+3.72e-02

Pos logit contributions
 conclusions+0.046
alks+0.042
 Majesty+0.040
oles+0.039
ocol+0.039
Neg logit contributions
my-0.065
 next-0.062
 tidy-0.059
 over-0.059
 packed-0.058
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
 conclusions+0.100
 Majesty+0.093
alks+0.093
 fingertips+0.092
ocol+0.091
Neg logit contributions
my-0.039
 next-0.035
 over-0.032
 with-0.030
 from-0.027
Token and
Token ID290
Feature activation+2.17e-02

Pos logit contributions
 conclusions+0.040
alks+0.039
ocol+0.037
oles+0.037
 Majesty+0.037
Neg logit contributions
my-0.015
 next-0.014
 tidy-0.012
 over-0.012
 packed-0.012
Token gir
Token ID37370
Feature activation+5.76e-02

Pos logit contributions
alks+0.034
 conclusions+0.033
ocol+0.032
oles+0.031
 fingertips+0.030
Neg logit contributions
my-0.042
 next-0.041
 over-0.038
 tidy-0.037
 packed-0.037
Token lions
Token ID31420
Feature activation+4.43e-02

Pos logit contributions
 conclusions+0.074
alks+0.072
 fingertips+0.070
ocol+0.067
 perspective+0.066
Neg logit contributions
my-0.076
 next-0.075
 over-0.071
 tidy-0.070
 clean-0.069
Token,
Token ID11
Feature activation+1.04e-02

Pos logit contributions
 conclusions+0.120
alks+0.115
 Majesty+0.114
ocol+0.110
 fingertips+0.110
Neg logit contributions
my-0.046
 next-0.042
 with-0.039
 over-0.037
 made-0.034
Token monkeys
Token ID26697
Feature activation+3.75e-02

Pos logit contributions
 conclusions+0.016
alks+0.016
ocol+0.015
 Majesty+0.014
 fingertips+0.014
Neg logit contributions
my-0.021
 next-0.020
 over-0.019
 tidy-0.019
 clean-0.018
Token,
Token ID11
Feature activation+8.12e-03

Pos logit contributions
 conclusions+0.106
alks+0.098
ocol+0.097
 Majesty+0.097
 fingertips+0.095
Neg logit contributions
my-0.039
 next-0.031
 over-0.028
 with-0.028
 made-0.025
Token and
Token ID290
Feature activation+1.12e-02

Pos logit contributions
 conclusions+0.018
alks+0.017
ocol+0.017
oles+0.016
 fingertips+0.016
Neg logit contributions
my-0.011
 next-0.011
 over-0.010
 tidy-0.009
 packed-0.009
Token ze
Token ID41271
Feature activation+8.73e-02

Pos logit contributions
alks+0.019
 conclusions+0.019
ocol+0.018
oles+0.018
 fingertips+0.017
Neg logit contributions
my-0.020
 next-0.019
 over-0.018
 tidy-0.018
 clean-0.018
Tokenbr
Token ID1671
Feature activation+3.22e-02

Pos logit contributions
 conclusions+0.204
 perspective+0.195
 fingertips+0.192
 Majesty+0.190
 anth+0.188
Neg logit contributions
my-0.099
 next-0.093
 over-0.088
go-0.085
 with-0.083
Tokenas
Token ID292
Feature activation+2.00e-02

Pos logit contributions
 conclusions+0.045
alks+0.041
oles+0.039
ocol+0.039
 Majesty+0.038
Neg logit contributions
my-0.067
 next-0.063
 tidy-0.061
 over-0.060
 packed-0.059
Token.
Token ID13
Feature activation+1.34e-02

Pos logit contributions
 conclusions+0.047
alks+0.044
ocol+0.043
 Majesty+0.043
 fingertips+0.043
Neg logit contributions
my-0.027
 next-0.024
 over-0.023
 with-0.020
 packed-0.020
Token But
Token ID887
Feature activation+2.76e-04

Pos logit contributions
 conclusions+0.035
ocol+0.033
alks+0.033
 perspective+0.033
oles+0.032
Neg logit contributions
my-0.014
 next-0.011
 over-0.011
 tidy-0.010
 clean-0.009
Token the
Token ID262
Feature activation+4.67e-03

Pos logit contributions
 conclusions+0.001
alks+0.001
oles+0.001
ocol+0.001
 Majesty+0.001
Neg logit contributions
my-0.000
 next-0.000
 tidy-0.000
 over-0.000
 clean-0.000
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
 conclusions+0.099
alks+0.095
 Majesty+0.092
ocol+0.091
 perspective+0.090
Neg logit contributions
my-0.033
 next-0.032
 over-0.028
 with-0.026
 packed-0.024
Token tigers
Token ID43114
Feature activation+4.59e-02

Pos logit contributions
 conclusions+0.021
alks+0.021
ocol+0.020
 Majesty+0.019
 perspective+0.019
Neg logit contributions
my-0.025
 next-0.024
 over-0.023
 tidy-0.023
 packed-0.023
Token,
Token ID11
Feature activation+1.39e-02

Pos logit contributions
 conclusions+0.125
 Majesty+0.118
alks+0.118
ocol+0.116
 fingertips+0.113
Neg logit contributions
my-0.044
 next-0.040
 with-0.037
 over-0.037
 made-0.033
Token monkeys
Token ID26697
Feature activation+4.19e-02

Pos logit contributions
 conclusions+0.036
alks+0.035
ocol+0.034
oles+0.033
 Majesty+0.033
Neg logit contributions
my-0.014
 next-0.012
 over-0.011
 tidy-0.010
 packed-0.010
Token and
Token ID290
Feature activation+2.44e-02

Pos logit contributions
 conclusions+0.118
alks+0.108
 Majesty+0.107
ocol+0.106
 fingertips+0.103
Neg logit contributions
my-0.043
 next-0.036
 with-0.034
 over-0.033
 made-0.029
Token ze
Token ID41271
Feature activation+8.67e-02

Pos logit contributions
alks+0.045
 conclusions+0.044
ocol+0.043
oles+0.041
 fingertips+0.041
Neg logit contributions
my-0.040
 next-0.038
 over-0.036
 tidy-0.036
 made-0.035
Tokenbr
Token ID1671
Feature activation+3.73e-02

Pos logit contributions
 conclusions+0.219
 Majesty+0.208
 perspective+0.206
 fingertips+0.202
 anth+0.200
Neg logit contributions
my-0.084
 next-0.075
 over-0.074
 ride-0.070
 with-0.069
Tokenas
Token ID292
Feature activation+2.21e-02

Pos logit contributions
 conclusions+0.058
alks+0.051
 Majesty+0.050
ocol+0.048
joy+0.048
Neg logit contributions
my-0.074
 next-0.068
 over-0.066
 tidy-0.066
 packed-0.065
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
 conclusions+0.054
 Majesty+0.051
alks+0.050
ocol+0.050
 fingertips+0.049
Neg logit contributions
my-0.028
 over-0.026
 next-0.025
 with-0.023
 from-0.021
Token He
Token ID679
Feature activation+8.47e-03

Pos logit contributions
 conclusions+0.030
ocol+0.029
oles+0.029
 perspective+0.028
alks+0.027
Neg logit contributions
my-0.011
 over-0.011
 next-0.011
 tidy-0.010
 clean-0.010
Token was
Token ID373
Feature activation+1.06e-02

Pos logit contributions
 conclusions+0.023
alks+0.022
ocol+0.021
 perspective+0.020
oles+0.020
Neg logit contributions
my-0.008
 next-0.007
 packed-0.007
 over-0.007
 tidy-0.007
Token of
Token ID286
Feature activation+2.10e-02

Pos logit contributions
my+0.018
 pack+0.014
 tidy+0.014
 next+0.014
 packed+0.014
Neg logit contributions
alks-0.039
oles-0.038
 conclusions-0.038
ocol-0.038
 predators-0.036
Token animals
Token ID4695
Feature activation+2.76e-02

Pos logit contributions
 conclusions+0.044
alks+0.043
oles+0.041
 Majesty+0.041
ocol+0.040
Neg logit contributions
my-0.029
 next-0.027
 tidy-0.026
 over-0.025
 clean-0.025
Token like
Token ID588
Feature activation+3.37e-02

Pos logit contributions
 conclusions+0.068
alks+0.066
ocol+0.064
 Majesty+0.064
oles+0.063
Neg logit contributions
my-0.029
 next-0.027
 over-0.024
 tidy-0.024
 clean-0.023
Token lions
Token ID31420
Feature activation+4.38e-02

Pos logit contributions
 conclusions+0.056
alks+0.052
ocol+0.050
 fingertips+0.050
 Majesty+0.049
Neg logit contributions
my-0.063
 next-0.061
 tidy-0.057
 over-0.057
 packed-0.056
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 conclusions+0.120
alks+0.115
ocol+0.111
 Majesty+0.111
 fingertips+0.110
Neg logit contributions
my-0.044
 next-0.041
 with-0.037
 over-0.035
 made-0.032
Token ze
Token ID41271
Feature activation+8.60e-02

Pos logit contributions
 conclusions+0.014
alks+0.013
ocol+0.012
oles+0.012
 Majesty+0.012
Neg logit contributions
my-0.026
 next-0.024
 tidy-0.023
 over-0.023
 packed-0.023
Tokenbr
Token ID1671
Feature activation+2.83e-02

Pos logit contributions
 conclusions+0.218
 perspective+0.204
 Majesty+0.204
 fingertips+0.200
alks+0.197
Neg logit contributions
my-0.086
 next-0.080
 over-0.070
 with-0.069
 ride-0.068
Tokenas
Token ID292
Feature activation+3.14e-02

Pos logit contributions
 conclusions+0.038
alks+0.036
oles+0.033
ocol+0.033
 Majesty+0.033
Neg logit contributions
my-0.059
 next-0.056
 tidy-0.055
 over-0.054
 packed-0.053
Token,
Token ID11
Feature activation+9.28e-03

Pos logit contributions
 conclusions+0.085
alks+0.080
 fingertips+0.078
ocol+0.078
 Majesty+0.077
Neg logit contributions
my-0.034
 next-0.028
 over-0.027
 with-0.024
 packed-0.022
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
 conclusions+0.019
alks+0.019
oles+0.018
ocol+0.018
 Majesty+0.018
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.011
 over-0.011
 clean-0.011
Token gir
Token ID37370
Feature activation+4.79e-02

Pos logit contributions
 conclusions+0.021
alks+0.020
ocol+0.019
oles+0.019
 fingertips+0.018
Neg logit contributions
my-0.028
 next-0.026
 over-0.025
 tidy-0.025
 packed-0.024
Token animals
Token ID4695
Feature activation+2.24e-02

Pos logit contributions
alks+0.018
 conclusions+0.017
oles+0.016
 Majesty+0.016
 fingertips+0.016
Neg logit contributions
my-0.011
 next-0.010
 tidy-0.010
 clean-0.009
 organized-0.009
Token,
Token ID11
Feature activation+7.27e-03

Pos logit contributions
 conclusions+0.057
alks+0.056
 Majesty+0.054
ocol+0.054
 fingertips+0.053
Neg logit contributions
my-0.023
 next-0.022
 over-0.021
 tidy-0.019
 clean-0.018
Token like
Token ID588
Feature activation+3.36e-02

Pos logit contributions
 conclusions+0.016
alks+0.016
oles+0.015
ocol+0.015
 Majesty+0.015
Neg logit contributions
my-0.009
 next-0.008
 tidy-0.008
 over-0.007
 clean-0.007
Token lions
Token ID31420
Feature activation+3.84e-02

Pos logit contributions
 conclusions+0.065
alks+0.062
ocol+0.060
 fingertips+0.058
oles+0.057
Neg logit contributions
my-0.054
 next-0.053
 tidy-0.050
 over-0.050
 clean-0.049
Token,
Token ID11
Feature activation+6.70e-03

Pos logit contributions
 conclusions+0.107
alks+0.102
ocol+0.099
 Majesty+0.098
 perspective+0.098
Neg logit contributions
my-0.038
 next-0.035
 with-0.031
 over-0.030
 made-0.029
Token ze
Token ID41271
Feature activation+8.57e-02

Pos logit contributions
 conclusions+0.010
alks+0.010
ocol+0.009
oles+0.009
 Majesty+0.009
Neg logit contributions
my-0.013
 next-0.013
 tidy-0.012
 over-0.012
 packed-0.012
Tokenbr
Token ID1671
Feature activation+3.13e-02

Pos logit contributions
 conclusions+0.214
 perspective+0.202
 Majesty+0.198
 fingertips+0.198
alks+0.197
Neg logit contributions
my-0.090
 next-0.082
 over-0.073
 with-0.069
 packed-0.069
Tokenas
Token ID292
Feature activation+2.80e-02

Pos logit contributions
 conclusions+0.043
alks+0.040
oles+0.037
ocol+0.037
 Majesty+0.037
Neg logit contributions
my-0.065
 next-0.062
 tidy-0.060
 over-0.059
 packed-0.059
Token,
Token ID11
Feature activation+5.69e-03

Pos logit contributions
 conclusions+0.074
alks+0.070
 Majesty+0.068
 fingertips+0.068
ocol+0.068
Neg logit contributions
my-0.033
 over-0.027
 next-0.026
 with-0.023
 from-0.022
Token and
Token ID290
Feature activation+1.20e-02

Pos logit contributions
 conclusions+0.014
alks+0.014
ocol+0.013
oles+0.013
 Majesty+0.013
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.005
 over-0.004
 packed-0.004
Token gir
Token ID37370
Feature activation+4.97e-02

Pos logit contributions
 conclusions+0.018
alks+0.018
ocol+0.017
oles+0.017
 perspective+0.016
Neg logit contributions
my-0.023
 next-0.022
 tidy-0.021
 over-0.021
 packed-0.020
Token lions
Token ID31420
Feature activation+4.53e-02

Pos logit contributions
 conclusions+0.062
alks+0.060
ocol+0.057
 fingertips+0.056
 Majesty+0.054
Neg logit contributions
my-0.081
 next-0.079
 over-0.075
 tidy-0.075
 packed-0.073
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
 conclusions+0.118
alks+0.111
 Majesty+0.109
ocol+0.107
 perspective+0.107
Neg logit contributions
my-0.049
 next-0.048
 with-0.042
 over-0.042
 tidy-0.038
Token monkeys
Token ID26697
Feature activation+4.13e-02

Pos logit contributions
 conclusions+0.019
alks+0.019
ocol+0.018
 Majesty+0.017
oles+0.017
Neg logit contributions
my-0.031
 next-0.030
 over-0.028
 tidy-0.028
 packed-0.028
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
 conclusions+0.110
alks+0.102
 Majesty+0.101
ocol+0.100
 fingertips+0.098
Neg logit contributions
my-0.046
 next-0.041
 over-0.037
 with-0.037
 clean-0.033
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
 conclusions+0.028
alks+0.028
ocol+0.027
oles+0.027
 Majesty+0.026
Neg logit contributions
my-0.010
 next-0.009
 over-0.008
 tidy-0.008
 packed-0.007
Token ze
Token ID41271
Feature activation+8.54e-02

Pos logit contributions
alks+0.029
 conclusions+0.028
ocol+0.027
oles+0.027
 fingertips+0.026
Neg logit contributions
my-0.035
 next-0.034
 tidy-0.032
 over-0.031
 packed-0.031
Tokenbr
Token ID1671
Feature activation+3.22e-02

Pos logit contributions
 conclusions+0.212
 perspective+0.206
 Majesty+0.204
 fingertips+0.202
 anth+0.196
Neg logit contributions
my-0.083
 next-0.079
 over-0.072
 with-0.070
 ride-0.067
Tokenas
Token ID292
Feature activation+2.03e-02

Pos logit contributions
 conclusions+0.049
alks+0.043
 Majesty+0.042
ocol+0.041
oles+0.041
Neg logit contributions
my-0.064
 next-0.060
 over-0.058
 tidy-0.057
 packed-0.057
Token.
Token ID13
Feature activation-6.28e-03

Pos logit contributions
 conclusions+0.050
 Majesty+0.048
 fingertips+0.047
alks+0.047
ocol+0.046
Neg logit contributions
my-0.025
 next-0.023
 over-0.023
 with-0.020
 from-0.019
Token But
Token ID887
Feature activation-2.00e-02

Pos logit contributions
my+0.008
 next+0.007
 tidy+0.006
 over+0.006
 clean+0.006
Neg logit contributions
 conclusions-0.014
alks-0.014
 Majesty-0.013
ocol-0.013
oles-0.013
Token their
Token ID511
Feature activation-1.94e-03

Pos logit contributions
my+0.024
 tidy+0.022
 next+0.021
pack+0.021
Close+0.021
Neg logit contributions
 conclusions-0.042
alks-0.040
ocol-0.039
oles-0.039
 Majesty-0.039
Token did
Token ID750
Feature activation+3.61e-03

Pos logit contributions
 conclusions+0.065
alks+0.063
 Majesty+0.062
ocol+0.062
 perspective+0.060
Neg logit contributions
my-0.021
 also-0.019
 next-0.018
 packed-0.018
 made-0.017
Token not
Token ID407
Feature activation+2.42e-05

Pos logit contributions
 conclusions+0.011
alks+0.011
ocol+0.010
oles+0.010
 perspective+0.010
Neg logit contributions
 next-0.002
my-0.002
 tidy-0.001
 over-0.001
 clean-0.001
Token see
Token ID766
Feature activation+3.03e-02

Pos logit contributions
 conclusions+0.000
alks+0.000
oles+0.000
ocol+0.000
 Majesty+0.000
Neg logit contributions
my-0.000
 next-0.000
 tidy-0.000
 over-0.000
 packed-0.000
Token any
Token ID597
Feature activation+1.34e-02

Pos logit contributions
 perspective+0.090
oles+0.089
 colony+0.089
alks+0.088
ocol+0.088
Neg logit contributions
 next-0.019
 a-0.018
 over-0.018
 made-0.013
 up-0.013
Token baby
Token ID5156
Feature activation+2.65e-02

Pos logit contributions
alks+0.028
 Majesty+0.028
 conclusions+0.027
oles+0.027
 colony+0.026
Neg logit contributions
my-0.019
 next-0.018
 over-0.016
 made-0.015
 also-0.015
Token ze
Token ID41271
Feature activation+8.49e-02

Pos logit contributions
 conclusions+0.044
alks+0.042
 perspective+0.040
 Majesty+0.039
 fingertips+0.038
Neg logit contributions
my-0.051
 next-0.050
 over-0.046
 made-0.044
 with-0.043
Tokenbr
Token ID1671
Feature activation+3.15e-02

Pos logit contributions
 conclusions+0.209
 perspective+0.206
 Majesty+0.203
 anth+0.199
alks+0.196
Neg logit contributions
 next-0.080
my-0.078
 over-0.071
go-0.067
 on-0.067
Tokenas
Token ID292
Feature activation+2.28e-02

Pos logit contributions
 conclusions+0.044
alks+0.041
oles+0.038
 Majesty+0.038
ocol+0.038
Neg logit contributions
my-0.065
 next-0.062
 tidy-0.060
 over-0.059
 packed-0.058
Token.
Token ID13
Feature activation+5.91e-03

Pos logit contributions
 Majesty+0.055
alks+0.055
 conclusions+0.054
 anth+0.051
 fingertips+0.051
Neg logit contributions
my-0.027
 over-0.026
 next-0.026
 with-0.021
 from-0.021
Token
Token ID198
Feature activation-2.48e-03

Pos logit contributions
 Majesty+0.016
ocol+0.016
 conclusions+0.016
 perspective+0.015
alks+0.015
Neg logit contributions
 next-0.004
my-0.004
 over-0.004
 with-0.004
 late-0.003
Token
Token ID198
Feature activation+1.26e-03

Pos logit contributions
my+0.002
 next+0.002
 over+0.002
 tidy+0.002
 clean+0.002
Neg logit contributions
 conclusions-0.006
ocol-0.006
alks-0.006
 Majesty-0.006
 fingertips-0.006
Token animals
Token ID4695
Feature activation+1.47e-02

Pos logit contributions
 conclusions+0.016
alks+0.016
 Majesty+0.016
 fingertips+0.015
oles+0.015
Neg logit contributions
my-0.011
 next-0.011
 tidy-0.010
 over-0.010
 clean-0.010
Token,
Token ID11
Feature activation+1.77e-03

Pos logit contributions
 conclusions+0.036
alks+0.035
 Majesty+0.033
ocol+0.033
oles+0.033
Neg logit contributions
my-0.016
 next-0.015
 over-0.013
 tidy-0.013
 clean-0.013
Token like
Token ID588
Feature activation+3.32e-02

Pos logit contributions
 conclusions+0.004
alks+0.004
oles+0.004
ocol+0.004
 predators+0.004
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 over-0.002
 clean-0.002
Token monkeys
Token ID26697
Feature activation+3.33e-02

Pos logit contributions
 conclusions+0.052
alks+0.051
ocol+0.048
 fingertips+0.046
 Majesty+0.046
Neg logit contributions
my-0.064
 next-0.062
 tidy-0.059
 over-0.059
 packed-0.058
Token,
Token ID11
Feature activation+6.85e-03

Pos logit contributions
 conclusions+0.082
alks+0.078
 Majesty+0.076
ocol+0.075
 perspective+0.074
Neg logit contributions
my-0.040
 next-0.037
 over-0.034
 tidy-0.032
 with-0.032
Token ze
Token ID41271
Feature activation+8.47e-02

Pos logit contributions
 conclusions+0.013
alks+0.013
ocol+0.013
 Majesty+0.012
oles+0.012
Neg logit contributions
my-0.011
 next-0.010
 over-0.009
 tidy-0.009
 packed-0.009
Tokenbr
Token ID1671
Feature activation+2.41e-02

Pos logit contributions
 conclusions+0.206
 perspective+0.198
 Majesty+0.196
 fingertips+0.191
 anth+0.186
Neg logit contributions
my-0.090
 next-0.085
 over-0.074
 packed-0.072
 with-0.072
Tokenas
Token ID292
Feature activation+3.12e-02

Pos logit contributions
 conclusions+0.033
alks+0.031
oles+0.029
ocol+0.029
 Majesty+0.029
Neg logit contributions
my-0.050
 next-0.047
 tidy-0.046
 over-0.045
 packed-0.045
Token and
Token ID290
Feature activation+2.28e-02

Pos logit contributions
 conclusions+0.083
alks+0.077
 Majesty+0.076
 fingertips+0.074
ocol+0.074
Neg logit contributions
my-0.034
 next-0.031
 over-0.029
 with-0.027
 clean-0.024
Token gir
Token ID37370
Feature activation+5.86e-02

Pos logit contributions
alks+0.036
 conclusions+0.034
ocol+0.034
oles+0.032
 fingertips+0.032
Neg logit contributions
my-0.044
 next-0.043
 over-0.039
 tidy-0.039
 packed-0.039
Tokenaff
Token ID2001
Feature activation+5.85e-02

Pos logit contributions
 conclusions+0.087
 perspective+0.073
 Majesty+0.072
 fingertips+0.072
 anth+0.070
Neg logit contributions
my-0.121
 next-0.110
 over-0.104
 with-0.102
 packed-0.101
Token lions
Token ID31420
Feature activation+4.43e-02

Pos logit contributions
 conclusions+0.065
alks+0.064
ocol+0.061
 fingertips+0.059
 Majesty+0.058
Neg logit contributions
my-0.081
 next-0.079
 tidy-0.075
 over-0.075
 packed-0.073
Token,
Token ID11
Feature activation+1.40e-02

Pos logit contributions
 conclusions+0.115
alks+0.109
 Majesty+0.107
 perspective+0.105
ocol+0.104
Neg logit contributions
my-0.049
 next-0.047
 with-0.041
 over-0.041
 tidy-0.039
Token monkeys
Token ID26697
Feature activation+4.11e-02

Pos logit contributions
 conclusions+0.019
alks+0.019
ocol+0.018
 Majesty+0.017
oles+0.017
Neg logit contributions
my-0.029
 next-0.028
 tidy-0.026
 over-0.026
 packed-0.026
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
 conclusions+0.109
alks+0.101
 Majesty+0.100
ocol+0.099
 perspective+0.098
Neg logit contributions
my-0.046
 next-0.041
 over-0.037
 with-0.037
 clean-0.034
Token and
Token ID290
Feature activation+2.07e-02

Pos logit contributions
 conclusions+0.030
alks+0.029
ocol+0.028
oles+0.028
 Majesty+0.028
Neg logit contributions
my-0.011
 next-0.010
 over-0.008
 tidy-0.008
 packed-0.008
Token ze
Token ID41271
Feature activation+8.43e-02

Pos logit contributions
alks+0.034
 conclusions+0.032
ocol+0.032
oles+0.031
 fingertips+0.030
Neg logit contributions
my-0.038
 next-0.037
 tidy-0.035
 over-0.034
 clean-0.034
Tokenbr
Token ID1671
Feature activation+2.94e-02

Pos logit contributions
 conclusions+0.210
 perspective+0.207
 Majesty+0.204
 fingertips+0.202
 anth+0.196
Neg logit contributions
my-0.079
 next-0.075
 with-0.067
 over-0.066
 ride-0.065
Tokenas
Token ID292
Feature activation+2.16e-02

Pos logit contributions
 conclusions+0.044
alks+0.040
 Majesty+0.038
ocol+0.037
oles+0.037
Neg logit contributions
my-0.059
 next-0.055
 tidy-0.053
 over-0.053
 packed-0.052
Token.
Token ID13
Feature activation-1.05e-03

Pos logit contributions
 conclusions+0.053
 Majesty+0.052
 fingertips+0.051
alks+0.051
joy+0.049
Neg logit contributions
my-0.026
 next-0.024
 over-0.023
 with-0.021
 from-0.020
Token But
Token ID887
Feature activation-1.49e-02

Pos logit contributions
my+0.001
 next+0.001
 over+0.001
 tidy+0.001
 clean+0.001
Neg logit contributions
 conclusions-0.003
alks-0.003
 Majesty-0.003
ocol-0.003
oles-0.003
Token their
Token ID511
Feature activation+1.03e-03

Pos logit contributions
my+0.017
 next+0.015
 tidy+0.015
 clean+0.014
 over+0.014
Neg logit contributions
 conclusions-0.033
alks-0.032
ocol-0.031
oles-0.031
 Majesty-0.031
Token Lily
Token ID20037
Feature activation+1.46e-02

Pos logit contributions
 next+0.004
my+0.004
 over+0.003
 packed+0.003
 with+0.003
Neg logit contributions
 conclusions-0.012
 Majesty-0.011
 perspective-0.011
alks-0.011
 fingertips-0.011
Token was
Token ID373
Feature activation+2.08e-02

Pos logit contributions
 conclusions+0.039
alks+0.038
ocol+0.037
oles+0.037
 Majesty+0.036
Neg logit contributions
my-0.012
 next-0.010
 tidy-0.009
 packed-0.009
 over-0.009
Token looking
Token ID2045
Feature activation+1.86e-02

Pos logit contributions
 conclusions+0.039
alks+0.037
oles+0.035
ocol+0.035
 perspective+0.034
Neg logit contributions
my-0.034
 next-0.032
 tidy-0.031
 over-0.030
 packed-0.030
Token at
Token ID379
Feature activation-7.27e-03

Pos logit contributions
 conclusions+0.047
alks+0.046
oles+0.044
 fingertips+0.044
ocol+0.043
Neg logit contributions
my-0.020
 next-0.018
 over-0.017
 tidy-0.016
 clean-0.016
Token the
Token ID262
Feature activation+5.11e-03

Pos logit contributions
my+0.005
 over+0.004
 next+0.004
 clean+0.003
 tidy+0.003
Neg logit contributions
 conclusions-0.021
oles-0.021
alks-0.020
ocol-0.020
 Majesty-0.020
Token ze
Token ID41271
Feature activation+8.35e-02

Pos logit contributions
 conclusions+0.011
alks+0.011
oles+0.011
ocol+0.010
 Majesty+0.010
Neg logit contributions
my-0.007
 next-0.006
 tidy-0.006
 over-0.006
 packed-0.005
Tokenbr
Token ID1671
Feature activation+2.08e-02

Pos logit contributions
 conclusions+0.211
 Majesty+0.210
 perspective+0.202
alks+0.198
 anth+0.198
Neg logit contributions
 next-0.080
my-0.079
 with-0.071
 made-0.070
 over-0.070
Tokenas
Token ID292
Feature activation+3.29e-02

Pos logit contributions
alks+0.026
 conclusions+0.025
oles+0.024
ocol+0.024
 perspective+0.022
Neg logit contributions
my-0.044
 next-0.044
 tidy-0.043
 clean-0.042
 over-0.041
Token,
Token ID11
Feature activation-2.95e-04

Pos logit contributions
 conclusions+0.088
 Majesty+0.088
alks+0.087
ocol+0.084
 anth+0.082
Neg logit contributions
my-0.034
 next-0.032
 over-0.031
 with-0.028
 on-0.026
Token a
Token ID257
Feature activation+4.11e-02

Pos logit contributions
my+0.000
 next+0.000
 tidy+0.000
 over+0.000
 clean+0.000
Neg logit contributions
alks-0.001
 conclusions-0.001
ocol-0.001
 Majesty-0.001
 perspective-0.001
Token rude
Token ID22066
Feature activation+3.28e-02

Pos logit contributions
 conclusions+0.120
ocol+0.117
alks+0.115
ighters+0.112
ants+0.111
Neg logit contributions
 tidy-0.029
 clean-0.027
 next-0.027
my-0.026
 pack-0.023
Token!
Token ID0
Feature activation-1.86e-02

Pos logit contributions
my+0.019
 next+0.018
 over+0.016
 tidy+0.016
 clean+0.015
Neg logit contributions
 conclusions-0.041
alks-0.039
ocol-0.038
oles-0.037
 Majesty-0.037
Token He
Token ID679
Feature activation-2.19e-02

Pos logit contributions
my+0.023
 next+0.022
 tidy+0.021
 over+0.020
 packed+0.020
Neg logit contributions
 conclusions-0.040
alks-0.039
oles-0.037
ocol-0.037
 Majesty-0.037
Token smiled
Token ID13541
Feature activation-2.96e-02

Pos logit contributions
my+0.024
 next+0.022
 tidy+0.020
 over+0.019
 clean+0.019
Neg logit contributions
alks-0.049
 conclusions-0.048
oles-0.047
 Majesty-0.046
 fingertips-0.046
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
my+0.035
 tidy+0.030
pack+0.030
 next+0.029
Close+0.029
Neg logit contributions
joy-0.062
alks-0.061
 Majesty-0.060
 conclusions-0.059
oles-0.058
Token gave
Token ID2921
Feature activation-2.43e-02

Pos logit contributions
my+0.017
 next+0.015
 tidy+0.013
pack+0.013
 over+0.012
Neg logit contributions
alks-0.030
 Majesty-0.029
oles-0.028
 conclusions-0.028
 fingertips-0.028
Token Mom
Token ID11254
Feature activation-7.52e-02

Pos logit contributions
my+0.026
 next+0.024
 tidy+0.023
 packed+0.022
 clean+0.021
Neg logit contributions
 conclusions-0.055
alks-0.054
ocol-0.052
 Majesty-0.051
oles-0.051
Tokenmy
Token ID1820
Feature activation-5.52e-02

Pos logit contributions
Close+0.042
 fit+0.033
pack+0.033
 tidy+0.033
 next+0.032
Neg logit contributions
joy-0.178
alks-0.174
 conclusions-0.173
 pad-0.171
lee-0.168
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
my+0.048
 next+0.048
Close+0.048
 tidy+0.046
pack+0.045
Neg logit contributions
alks-0.115
 conclusions-0.113
joy-0.110
umps-0.107
 colony-0.106
Token Daddy
Token ID26926
Feature activation-4.14e-02

Pos logit contributions
my+0.068
 next+0.064
 tidy+0.062
pack+0.061
 over+0.060
Neg logit contributions
alks-0.047
 Majesty-0.043
 fingertips-0.043
joy-0.043
 conclusions-0.041
Token big
Token ID1263
Feature activation-2.30e-02

Pos logit contributions
my+0.039
 next+0.038
 tidy+0.037
Close+0.036
pack+0.032
Neg logit contributions
alks-0.087
 conclusions-0.084
joy-0.082
umps-0.082
oles-0.082
Token hugs
Token ID40657
Feature activation-1.79e-02

Pos logit contributions
my+0.026
 next+0.023
 tidy+0.022
 over+0.021
 clean+0.021
Neg logit contributions
 conclusions-0.054
alks-0.052
oles-0.050
ocol-0.050
 Majesty-0.049
Token a
Token ID257
Feature activation+3.02e-02

Pos logit contributions
my+0.014
 next+0.013
 tidy+0.012
 over+0.011
 clean+0.011
Neg logit contributions
 conclusions-0.041
alks-0.039
oles-0.039
 Majesty-0.038
ocol-0.038
Token desk
Token ID6915
Feature activation-1.45e-03

Pos logit contributions
alks+0.070
 conclusions+0.067
oles+0.067
ocol+0.065
 fingertips+0.064
Neg logit contributions
 tidy-0.038
 next-0.038
my-0.037
 clean-0.036
 pack-0.035
Token.
Token ID13
Feature activation-1.32e-02

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 over+0.002
 packed+0.002
Neg logit contributions
 conclusions-0.003
alks-0.003
oles-0.003
ocol-0.003
 Majesty-0.003
Token She
Token ID1375
Feature activation-1.95e-02

Pos logit contributions
my+0.012
 next+0.010
 over+0.010
 tidy+0.010
 clean+0.010
Neg logit contributions
 conclusions-0.035
 perspective-0.033
 Majesty-0.033
oles-0.033
alks-0.033
Token gives
Token ID3607
Feature activation-1.85e-02

Pos logit contributions
my+0.023
 next+0.021
 tidy+0.021
 clean+0.019
 over+0.019
Neg logit contributions
 conclusions-0.041
alks-0.040
 Majesty-0.039
ocol-0.039
oles-0.038
Token Mom
Token ID11254
Feature activation-7.25e-02

Pos logit contributions
my+0.027
 next+0.025
 over+0.024
 tidy+0.023
 clean+0.022
Neg logit contributions
 conclusions-0.039
alks-0.038
oles-0.037
 Majesty-0.036
 perspective-0.036
Token and
Token ID290
Feature activation-2.89e-02

Pos logit contributions
 next+0.079
 tidy+0.079
 fit+0.079
Close+0.078
pack+0.078
Neg logit contributions
alks-0.123
 conclusions-0.122
umps-0.119
joy-0.115
oles-0.115
Token Dad
Token ID17415
Feature activation-4.14e-02

Pos logit contributions
my+0.047
 next+0.044
 tidy+0.042
 over+0.041
 packed+0.040
Neg logit contributions
 conclusions-0.052
alks-0.051
oles-0.048
ocol-0.047
 Majesty-0.047
Token a
Token ID257
Feature activation-7.56e-03

Pos logit contributions
my+0.036
 next+0.035
 tidy+0.033
Close+0.031
 over+0.030
Neg logit contributions
 conclusions-0.090
alks-0.089
ocol-0.085
oles-0.085
umps-0.084
Token key
Token ID1994
Feature activation-3.73e-02

Pos logit contributions
my+0.010
 next+0.009
 over+0.008
 tidy+0.008
 packed+0.008
Neg logit contributions
 conclusions-0.016
alks-0.015
oles-0.015
ocol-0.015
 Majesty-0.014
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
my+0.053
 tidy+0.051
Close+0.047
 clean+0.046
 next+0.045
Neg logit contributions
alks-0.068
oles-0.064
 conclusions-0.062
ocol-0.062
 ago-0.060
Token mum
Token ID25682
Feature activation-1.16e-02

Pos logit contributions
 next+0.015
 tidy+0.014
 clean+0.014
my+0.014
 pack+0.013
Neg logit contributions
 conclusions-0.022
ocol-0.020
alks-0.020
 predators-0.020
oles-0.019
Token asked
Token ID1965
Feature activation+2.74e-02

Pos logit contributions
my+0.010
 next+0.010
 tidy+0.010
Close+0.009
 clean+0.009
Neg logit contributions
alks-0.025
oles-0.025
 colony-0.024
 conclusions-0.024
joy-0.024
Token him
Token ID683
Feature activation-5.73e-03

Pos logit contributions
 conclusions+0.067
alks+0.067
 Majesty+0.064
oles+0.063
 fingertips+0.062
Neg logit contributions
my-0.037
 next-0.033
 over-0.030
 clean-0.028
 with-0.028
Token to
Token ID284
Feature activation-1.04e-02

Pos logit contributions
my+0.006
 next+0.005
 tidy+0.005
 over+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.014
alks-0.014
ocol-0.013
oles-0.013
 perspective-0.013
Token put
Token ID1234
Feature activation-3.71e-02

Pos logit contributions
my+0.015
 packed+0.013
 next+0.013
Close+0.013
 over+0.013
Neg logit contributions
 conclusions-0.018
 fingertips-0.017
ocol-0.017
alks-0.017
 Majesty-0.016
Token all
Token ID477
Feature activation-7.12e-02

Pos logit contributions
my+0.035
 packed+0.030
 next+0.030
 tidy+0.029
Close+0.029
Neg logit contributions
alks-0.085
ocol-0.085
 conclusions-0.083
joy-0.078
 predators-0.077
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
mar+0.096
 packed+0.093
Close+0.087
 closed+0.087
 next+0.086
Neg logit contributions
ocol-0.123
alks-0.123
umps-0.106
 conclusions-0.103
bers-0.095
Token toys
Token ID14958
Feature activation-3.93e-02

Pos logit contributions
my+0.023
 next+0.021
 over+0.021
 packed+0.020
 tidy+0.020
Neg logit contributions
 conclusions-0.029
alks-0.028
ocol-0.027
oles-0.026
 Majesty-0.026
Token in
Token ID287
Feature activation-1.61e-02

Pos logit contributions
my+0.040
Close+0.038
 packed+0.033
 pack+0.032
 tidy+0.031
Neg logit contributions
alks-0.082
 conclusions-0.080
ocol-0.077
oles-0.077
joy-0.074
Token the
Token ID262
Feature activation-3.21e-03

Pos logit contributions
my+0.015
 next+0.013
 tidy+0.012
 over+0.011
 packed+0.011
Neg logit contributions
 conclusions-0.040
alks-0.039
ocol-0.038
oles-0.037
 perspective-0.037
Token toy
Token ID13373
Feature activation+1.60e-02

Pos logit contributions
my+0.004
 next+0.004
 tidy+0.004
 over+0.004
 packed+0.004
Neg logit contributions
 conclusions-0.007
alks-0.006
ocol-0.006
 Majesty-0.006
oles-0.006
Token remind
Token ID7101
Feature activation-1.15e-02

Pos logit contributions
my+0.020
 next+0.018
 packed+0.017
 over+0.016
Close+0.016
Neg logit contributions
 conclusions-0.032
alks-0.030
ocol-0.028
oles-0.028
 Majesty-0.028
Token me
Token ID502
Feature activation-3.02e-02

Pos logit contributions
my+0.017
 next+0.015
 tidy+0.015
 packed+0.015
 over+0.015
Neg logit contributions
 conclusions-0.022
alks-0.021
ocol-0.020
 Majesty-0.020
oles-0.020
Token to
Token ID284
Feature activation-2.71e-02

Pos logit contributions
my+0.030
 next+0.026
 tidy+0.024
 packed+0.024
Close+0.023
Neg logit contributions
 conclusions-0.072
alks-0.069
oles-0.066
ocol-0.066
 Majesty-0.066
Token bring
Token ID2222
Feature activation-3.46e-02

Pos logit contributions
my+0.026
 next+0.022
Close+0.020
 over+0.020
 packed+0.018
Neg logit contributions
 conclusions-0.064
alks-0.060
 Majesty-0.059
ocol-0.059
 perspective-0.058
Token our
Token ID674
Feature activation-1.31e-03

Pos logit contributions
my+0.033
 next+0.028
 tidy+0.027
 packed+0.026
 over+0.025
Neg logit contributions
 conclusions-0.086
alks-0.084
ocol-0.080
 Majesty-0.080
oles-0.079
Token bucket
Token ID19236
Feature activation-6.99e-02

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 clean+0.002
 packed+0.002
Neg logit contributions
 conclusions-0.002
alks-0.002
oles-0.002
ocol-0.002
 Majesty-0.002
Token back
Token ID736
Feature activation-4.53e-02

Pos logit contributions
Close+0.079
my+0.076
 packed+0.070
 also+0.068
That+0.065
Neg logit contributions
alks-0.135
 conclusions-0.130
ocol-0.119
oles-0.118
 colony-0.117
Token to
Token ID284
Feature activation-4.12e-03

Pos logit contributions
my+0.046
 tidy+0.041
 next+0.040
Close+0.040
 packed+0.039
Neg logit contributions
 conclusions-0.099
alks-0.096
 perspective-0.092
 Majesty-0.090
oles-0.090
Token the
Token ID262
Feature activation-2.61e-03

Pos logit contributions
my+0.003
 next+0.003
 tidy+0.003
 clean+0.003
 over+0.003
Neg logit contributions
alks-0.011
 conclusions-0.011
oles-0.011
ocol-0.010
 perspective-0.010
Token car
Token ID1097
Feature activation-1.98e-02

Pos logit contributions
my+0.005
 next+0.005
 tidy+0.005
 clean+0.005
 packed+0.005
Neg logit contributions
 conclusions-0.004
alks-0.004
oles-0.004
ocol-0.004
 Majesty-0.003
Token?"
Token ID1701
Feature activation+1.06e-02

Pos logit contributions
my+0.018
 next+0.016
 tidy+0.015
 packed+0.014
 over+0.014
Neg logit contributions
 conclusions-0.049
alks-0.047
oles-0.045
 Majesty-0.045
ocol-0.045
Token
Token ID198
Feature activation-4.38e-03

Pos logit contributions
my+0.023
 next+0.021
 tidy+0.020
 over+0.020
 clean+0.019
Neg logit contributions
 conclusions-0.046
alks-0.045
ocol-0.043
oles-0.043
 Majesty-0.043
TokenHer
Token ID9360
Feature activation-3.15e-02

Pos logit contributions
my+0.006
 over+0.005
 next+0.005
 packed+0.005
 tidy+0.005
Neg logit contributions
 conclusions-0.009
 perspective-0.008
oles-0.008
 anth-0.007
alks-0.007
Token mom
Token ID1995
Feature activation-1.46e-03

Pos logit contributions
my+0.052
 over+0.048
Close+0.047
 packed+0.047
pack+0.046
Neg logit contributions
alks-0.049
oles-0.048
 conclusions-0.048
 Majesty-0.047
 fingertips-0.047
Token brought
Token ID3181
Feature activation-2.17e-02

Pos logit contributions
 next+0.001
 tidy+0.001
my+0.001
 pack+0.001
Close+0.001
Neg logit contributions
alks-0.003
oles-0.003
 conclusions-0.003
 colony-0.003
 Majesty-0.003
Token out
Token ID503
Feature activation-4.94e-02

Pos logit contributions
my+0.020
 next+0.019
 tidy+0.017
 packed+0.017
 pack+0.016
Neg logit contributions
 conclusions-0.051
alks-0.050
ocol-0.049
oles-0.047
 predators-0.047
Token all
Token ID477
Feature activation-6.98e-02

Pos logit contributions
my+0.049
 next+0.044
pack+0.041
 tidy+0.041
 packed+0.039
Neg logit contributions
alks-0.107
 conclusions-0.105
ocol-0.103
joy-0.100
umps-0.097
Token the
Token ID262
Feature activation-3.34e-02

Pos logit contributions
mar+0.105
 cost+0.103
 packed+0.099
 next+0.099
man+0.098
Neg logit contributions
alks-0.106
ocol-0.100
umps-0.092
 conclusions-0.087
 colony-0.085
Token ingredients
Token ID9391
Feature activation-3.82e-02

Pos logit contributions
my+0.053
 over+0.052
Close+0.049
 packed+0.048
 next+0.047
Neg logit contributions
ocol-0.050
 conclusions-0.050
alks-0.048
oles-0.046
 Majesty-0.045
Token and
Token ID290
Feature activation-6.62e-03

Pos logit contributions
my+0.042
 next+0.037
 tidy+0.037
 packed+0.035
 pack+0.034
Neg logit contributions
alks-0.085
 conclusions-0.083
oles-0.083
ocol-0.079
 fingertips-0.077
Token they
Token ID484
Feature activation-1.34e-02

Pos logit contributions
my+0.006
 next+0.005
 over+0.004
 tidy+0.004
pack+0.004
Neg logit contributions
alks-0.015
 conclusions-0.015
oles-0.014
ocol-0.014
 fingertips-0.014
Token started
Token ID2067
Feature activation-8.83e-03

Pos logit contributions
my+0.011
 next+0.010
 tidy+0.009
 over+0.008
 clean+0.008
Neg logit contributions
alks-0.033
 conclusions-0.033
oles-0.032
ocol-0.031
 fingertips-0.031
Token excited
Token ID6568
Feature activation-1.65e-02

Pos logit contributions
my+0.013
 next+0.011
 over+0.010
 packed+0.010
pack+0.010
Neg logit contributions
oles-0.013
alks-0.013
 fingertips-0.012
 Majesty-0.012
 conclusions-0.012
Token!
Token ID0
Feature activation+2.22e-03

Pos logit contributions
my+0.023
 next+0.022
 tidy+0.021
 over+0.020
 clean+0.020
Neg logit contributions
 conclusions-0.033
alks-0.032
oles-0.031
ocol-0.030
 Majesty-0.030
Token She
Token ID1375
Feature activation-1.91e-02

Pos logit contributions
 conclusions+0.006
ocol+0.006
alks+0.006
 predators+0.006
 Majesty+0.006
Neg logit contributions
my-0.002
 over-0.001
 next-0.001
 tidy-0.001
 clean-0.001
Token takes
Token ID2753
Feature activation-1.18e-02

Pos logit contributions
my+0.021
 next+0.020
 tidy+0.018
 over+0.017
 clean+0.016
Neg logit contributions
alks-0.041
 conclusions-0.040
 Majesty-0.040
oles-0.039
 fingertips-0.039
Token off
Token ID572
Feature activation-2.82e-02

Pos logit contributions
my+0.013
 next+0.012
 tidy+0.011
 packed+0.010
 clean+0.010
Neg logit contributions
alks-0.027
 conclusions-0.026
ocol-0.026
oles-0.025
 fingertips-0.025
Token all
Token ID477
Feature activation-6.97e-02

Pos logit contributions
my+0.032
 next+0.028
 tidy+0.027
 packed+0.025
 pack+0.024
Neg logit contributions
alks-0.063
 conclusions-0.059
joy-0.058
oles-0.057
ocol-0.057
Token the
Token ID262
Feature activation-2.84e-02

Pos logit contributions
mar+0.101
 next+0.100
 cost+0.099
tr+0.098
 pack+0.097
Neg logit contributions
alks-0.111
ocol-0.103
lee-0.096
 angles-0.094
umps-0.093
Token paper
Token ID3348
Feature activation-3.08e-02

Pos logit contributions
my+0.037
 over+0.036
Close+0.035
 also+0.035
burn+0.035
Neg logit contributions
ocol-0.051
 conclusions-0.046
alks-0.045
 fingertips-0.044
 sudden-0.043
Token,
Token ID11
Feature activation-8.91e-03

Pos logit contributions
my+0.031
 next+0.028
 tidy+0.026
 over+0.025
 packed+0.025
Neg logit contributions
 conclusions-0.073
alks-0.073
oles-0.070
ocol-0.068
 Majesty-0.068
Token and
Token ID290
Feature activation-3.91e-03

Pos logit contributions
my+0.010
 next+0.009
pack+0.009
Close+0.009
 tidy+0.008
Neg logit contributions
alks-0.018
 fingertips-0.017
 conclusions-0.017
ocol-0.017
joy-0.017
Token she
Token ID673
Feature activation+7.50e-04

Pos logit contributions
my+0.004
 next+0.004
pack+0.003
 tidy+0.003
 packed+0.003
Neg logit contributions
 conclusions-0.008
alks-0.008
 Majesty-0.008
 fingertips-0.008
 predators-0.008
Token present
Token ID1944
Feature activation-2.47e-02

Pos logit contributions
my+0.009
 next+0.008
 tidy+0.007
 over+0.007
 packed+0.007
Neg logit contributions
 conclusions-0.017
alks-0.016
oles-0.016
ocol-0.016
 Majesty-0.016
Token for
Token ID329
Feature activation+3.65e-03

Pos logit contributions
my+0.036
 tidy+0.033
 next+0.033
Close+0.032
 packed+0.031
Neg logit contributions
alks-0.043
 conclusions-0.043
oles-0.041
 fingertips-0.040
ocol-0.040
Token their
Token ID511
Feature activation-7.11e-03

Pos logit contributions
 conclusions+0.010
alks+0.010
oles+0.010
 perspective+0.009
 Majesty+0.009
Neg logit contributions
 next-0.003
my-0.003
 tidy-0.002
 clean-0.002
 organized-0.002
Token mom
Token ID1995
Feature activation-3.06e-02

Pos logit contributions
 next+0.009
my+0.009
 tidy+0.008
 clean+0.008
 packed+0.007
Neg logit contributions
 conclusions-0.016
alks-0.016
ocol-0.015
oles-0.015
 Majesty-0.014
Token.
Token ID13
Feature activation-1.70e-02

Pos logit contributions
Close+0.037
 next+0.037
 tidy+0.036
my+0.035
pack+0.034
Neg logit contributions
alks-0.055
 conclusions-0.054
oles-0.053
 pad-0.052
ocol-0.051
Token At
Token ID1629
Feature activation-6.96e-02

Pos logit contributions
my+0.023
 next+0.021
 tidy+0.020
 over+0.020
 clean+0.020
Neg logit contributions
 conclusions-0.037
alks-0.035
ocol-0.034
oles-0.034
 Majesty-0.033
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
my+0.094
 tidy+0.091
Close+0.084
go+0.084
 included+0.082
Neg logit contributions
alks-0.124
joy-0.120
 conclusions-0.117
 heights-0.114
 perspective-0.114
Token store
Token ID3650
Feature activation-2.58e-02

Pos logit contributions
my+0.021
 over+0.018
 next+0.018
 tidy+0.018
 packed+0.018
Neg logit contributions
 conclusions-0.027
alks-0.025
 colony-0.025
 Majesty-0.025
 anth-0.025
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
my+0.042
 tidy+0.039
 next+0.039
Close+0.038
 over+0.038
Neg logit contributions
alks-0.041
oles-0.040
 conclusions-0.039
joy-0.038
 Majesty-0.037
Token they
Token ID484
Feature activation-1.20e-02

Pos logit contributions
my+0.033
 next+0.030
 tidy+0.029
 packed+0.028
 clean+0.027
Neg logit contributions
 conclusions-0.051
alks-0.050
oles-0.049
 perspective-0.047
 Majesty-0.047
Token looked
Token ID3114
Feature activation+4.91e-03

Pos logit contributions
my+0.011
 next+0.010
 tidy+0.009
 over+0.008
 clean+0.008
Neg logit contributions
 conclusions-0.030
alks-0.029
oles-0.028
ocol-0.028
 Majesty-0.028
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
my+0.024
 next+0.022
 tidy+0.021
 over+0.020
 packed+0.020
Neg logit contributions
 conclusions-0.038
alks-0.037
oles-0.035
ocol-0.034
 Majesty-0.034
Token helped
Token ID4193
Feature activation+2.62e-03

Pos logit contributions
my+0.032
 next+0.026
 tidy+0.026
Close+0.025
 over+0.025
Neg logit contributions
alks-0.046
oles-0.045
 fingertips-0.044
 conclusions-0.043
 Majesty-0.043
Token her
Token ID607
Feature activation-2.19e-02

Pos logit contributions
 conclusions+0.006
alks+0.005
oles+0.005
ocol+0.005
 fingertips+0.005
Neg logit contributions
my-0.003
 next-0.003
 packed-0.003
Close-0.002
 over-0.002
Token mom
Token ID1995
Feature activation-3.56e-02

Pos logit contributions
my+0.036
Close+0.032
 packed+0.032
 next+0.031
 late+0.030
Neg logit contributions
alks-0.034
 fingertips-0.034
 Majesty-0.032
 anth-0.032
 conclusions-0.031
Token put
Token ID1234
Feature activation-3.81e-02

Pos logit contributions
Close+0.032
Listen+0.031
Thank+0.030
Sleep+0.030
Good+0.030
Neg logit contributions
umps-0.069
alks-0.068
 fingertips-0.067
pler-0.065
 toes-0.065
Token all
Token ID477
Feature activation-6.90e-02

Pos logit contributions
my+0.044
 next+0.038
 packed+0.038
 tidy+0.037
Close+0.036
Neg logit contributions
alks-0.083
ocol-0.081
 conclusions-0.080
 predators-0.074
joy-0.073
Token the
Token ID262
Feature activation-1.74e-02

Pos logit contributions
mar+0.109
 packed+0.102
my+0.101
man+0.100
 cost+0.100
Neg logit contributions
alks-0.105
ocol-0.101
umps-0.090
 webs-0.084
ighters-0.083
Token paper
Token ID3348
Feature activation-3.24e-02

Pos logit contributions
my+0.030
 next+0.027
 over+0.027
 packed+0.027
 tidy+0.026
Neg logit contributions
 conclusions-0.028
alks-0.027
ocol-0.026
 perspective-0.025
oles-0.025
Token in
Token ID287
Feature activation-2.17e-02

Pos logit contributions
my+0.031
Close+0.025
 next+0.025
 tidy+0.024
 packed+0.024
Neg logit contributions
alks-0.077
 conclusions-0.074
oles-0.073
ocol-0.071
 Majesty-0.071
Token a
Token ID257
Feature activation+2.79e-02

Pos logit contributions
my+0.020
 next+0.018
 tidy+0.017
 over+0.016
 packed+0.016
Neg logit contributions
 conclusions-0.054
alks-0.053
oles-0.051
ocol-0.051
 Majesty-0.050
Token box
Token ID3091
Feature activation-1.59e-02

Pos logit contributions
 conclusions+0.074
oles+0.073
alks+0.073
ocol+0.070
 Majesty+0.069
Neg logit contributions
 tidy-0.028
 clean-0.027
 next-0.025
 organized-0.024
my-0.024
Token!
Token ID0
Feature activation+6.90e-03

Pos logit contributions
 conclusions+0.019
alks+0.018
ocol+0.017
oles+0.017
 Majesty+0.017
Neg logit contributions
my-0.015
 next-0.014
 tidy-0.014
 clean-0.013
 over-0.013
Token Where
Token ID6350
Feature activation+2.15e-02

Pos logit contributions
 conclusions+0.013
alks+0.013
oles+0.012
ocol+0.012
 Majesty+0.012
Neg logit contributions
my-0.010
 next-0.010
 tidy-0.010
 clean-0.009
 packed-0.009
Token did
Token ID750
Feature activation+7.46e-03

Pos logit contributions
oles+0.035
umps+0.035
orry+0.033
 conclusions+0.032
 Majesty+0.032
Neg logit contributions
 next-0.031
 over-0.029
my-0.028
 late-0.028
Close-0.027
Token you
Token ID345
Feature activation-1.32e-02

Pos logit contributions
 conclusions+0.016
alks+0.015
ocol+0.014
oles+0.014
 Majesty+0.014
Neg logit contributions
my-0.010
 next-0.009
 over-0.009
 tidy-0.008
 packed-0.008
Token get
Token ID651
Feature activation-2.96e-02

Pos logit contributions
my+0.015
 next+0.014
 tidy+0.013
 packed+0.013
 over+0.013
Neg logit contributions
 conclusions-0.030
alks-0.028
oles-0.027
ocol-0.027
 Majesty-0.026
Token it
Token ID340
Feature activation-6.78e-02

Pos logit contributions
my+0.035
 next+0.032
 tidy+0.031
 packed+0.030
 pack+0.029
Neg logit contributions
alks-0.063
 conclusions-0.063
ocol-0.060
oles-0.058
 Majesty-0.058
Token?"
Token ID1701
Feature activation+4.07e-03

Pos logit contributions
Close+0.096
my+0.089
Yes+0.088
Hold+0.084
 packed+0.083
Neg logit contributions
alks-0.099
 conclusions-0.090
 anymore-0.090
 Majesty-0.089
cient-0.086
Token
Token ID198
Feature activation+8.46e-04

Pos logit contributions
 conclusions+0.010
 predators+0.010
ocol+0.010
 Majesty+0.010
alks+0.010
Neg logit contributions
my-0.005
 next-0.004
 over-0.004
 clean-0.004
 tidy-0.004
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
 conclusions+0.002
alks+0.002
ocol+0.002
 Majesty+0.002
 predators+0.002
Neg logit contributions
my-0.001
 next-0.001
 tidy-0.001
 over-0.001
 clean-0.001
TokenJack
Token ID14295
Feature activation-3.32e-03

Pos logit contributions
 conclusions+0.020
alks+0.019
oles+0.018
ocol+0.018
 Majesty+0.018
Neg logit contributions
my-0.015
 next-0.014
 tidy-0.013
 over-0.013
 packed-0.013
Token told
Token ID1297
Feature activation-1.14e-02

Pos logit contributions
my+0.004
 next+0.003
 tidy+0.003
 clean+0.003
 over+0.003
Neg logit contributions
alks-0.007
 conclusions-0.007
oles-0.007
 Majesty-0.007
 fingertips-0.007
Token a
Token ID257
Feature activation-3.20e-03

Pos logit contributions
my+0.026
 next+0.022
 tidy+0.021
 packed+0.020
 over+0.019
Neg logit contributions
 conclusions-0.077
alks-0.076
ocol-0.073
oles-0.073
 Majesty-0.072
Token cake
Token ID12187
Feature activation-3.30e-02

Pos logit contributions
 tidy+0.003
 next+0.003
my+0.003
 clean+0.003
 organized+0.003
Neg logit contributions
 conclusions-0.008
alks-0.008
ocol-0.007
oles-0.007
 predators-0.007
Token.
Token ID13
Feature activation-5.04e-03

Pos logit contributions
my+0.043
 packed+0.037
 tidy+0.036
Close+0.036
 next+0.034
Neg logit contributions
alks-0.063
 conclusions-0.061
oles-0.060
ocol-0.057
 colony-0.057
Token Mary
Token ID5335
Feature activation-6.98e-03

Pos logit contributions
my+0.006
 next+0.006
 tidy+0.005
 over+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.011
alks-0.011
ocol-0.010
oles-0.010
 Majesty-0.010
Token put
Token ID1234
Feature activation-3.61e-02

Pos logit contributions
my+0.006
 next+0.005
 tidy+0.005
 over+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.018
alks-0.017
oles-0.017
ocol-0.017
 Majesty-0.017
Token all
Token ID477
Feature activation-6.75e-02

Pos logit contributions
my+0.036
 tidy+0.032
 next+0.032
 packed+0.031
pack+0.029
Neg logit contributions
alks-0.083
 conclusions-0.081
ocol-0.081
oles-0.077
 fingertips-0.075
Token the
Token ID262
Feature activation-1.86e-02

Pos logit contributions
 packed+0.093
 pack+0.093
mar+0.088
 next+0.087
my+0.086
Neg logit contributions
alks-0.115
ocol-0.109
umps-0.102
ir-0.091
 conclusions-0.090
Token y
Token ID331
Feature activation+1.27e-02

Pos logit contributions
my+0.029
 next+0.026
 over+0.025
 tidy+0.025
 packed+0.025
Neg logit contributions
 conclusions-0.034
alks-0.033
ocol-0.032
oles-0.031
 Majesty-0.031
Tokenummy
Token ID13513
Feature activation-1.63e-02

Pos logit contributions
 conclusions+0.020
alks+0.020
ocol+0.019
oles+0.019
 Majesty+0.018
Neg logit contributions
my-0.023
 next-0.022
 tidy-0.021
 over-0.021
 packed-0.021
Token ingredients
Token ID9391
Feature activation-4.29e-02

Pos logit contributions
my+0.025
 next+0.023
 packed+0.023
 tidy+0.022
pack+0.022
Neg logit contributions
ocol-0.028
 conclusions-0.027
alks-0.027
oles-0.026
 fingertips-0.025
Token in
Token ID287
Feature activation-1.19e-02

Pos logit contributions
my+0.044
Close+0.039
 tidy+0.038
 next+0.037
 packed+0.037
Neg logit contributions
alks-0.096
oles-0.093
 conclusions-0.090
ocol-0.088
 fingertips-0.086
Token to
Token ID284
Feature activation-5.77e-03

Pos logit contributions
my+0.027
Close+0.024
 tidy+0.023
 next+0.022
 cost+0.022
Neg logit contributions
alks-0.034
joy-0.032
 colony-0.030
 conclusions-0.030
 anth-0.029
Token the
Token ID262
Feature activation-7.73e-03

Pos logit contributions
my+0.005
 next+0.005
 tidy+0.005
 over+0.004
 packed+0.004
Neg logit contributions
 conclusions-0.014
alks-0.014
oles-0.013
ocol-0.013
 Majesty-0.013
Token store
Token ID3650
Feature activation-2.88e-02

Pos logit contributions
my+0.012
 next+0.011
 over+0.011
 tidy+0.011
 packed+0.010
Neg logit contributions
 conclusions-0.014
alks-0.013
 colony-0.012
 Majesty-0.012
 anth-0.012
Token to
Token ID284
Feature activation-1.09e-02

Pos logit contributions
my+0.030
 tidy+0.027
 next+0.026
Close+0.025
 clean+0.024
Neg logit contributions
alks-0.063
 conclusions-0.062
joy-0.061
oles-0.060
 perspective-0.060
Token buy
Token ID2822
Feature activation-3.00e-02

Pos logit contributions
my+0.014
 next+0.012
man+0.012
go+0.011
eddy+0.011
Neg logit contributions
 conclusions-0.022
alks-0.020
ocol-0.019
 Majesty-0.019
 colony-0.019
Token all
Token ID477
Feature activation-6.70e-02

Pos logit contributions
my+0.027
 next+0.023
 tidy+0.022
 over+0.021
 packed+0.020
Neg logit contributions
 conclusions-0.076
alks-0.074
oles-0.071
ocol-0.071
 Majesty-0.070
Token the
Token ID262
Feature activation-1.63e-02

Pos logit contributions
 next+0.103
my+0.097
mar+0.097
 closed+0.093
 packed+0.091
Neg logit contributions
alks-0.108
 conclusions-0.101
ocol-0.098
umps-0.094
 colony-0.092
Token things
Token ID1243
Feature activation-4.12e-02

Pos logit contributions
my+0.022
 next+0.019
 over+0.019
 tidy+0.018
 packed+0.018
Neg logit contributions
 conclusions-0.033
alks-0.031
ocol-0.031
oles-0.030
 Majesty-0.030
Token they
Token ID484
Feature activation+1.39e-03

Pos logit contributions
my+0.066
Close+0.059
 packed+0.053
pack+0.053
 next+0.053
Neg logit contributions
alks-0.064
oles-0.062
 conclusions-0.060
 fingertips-0.059
ocol-0.056
Token needed
Token ID2622
Feature activation+9.87e-03

Pos logit contributions
 conclusions+0.003
alks+0.003
oles+0.003
ocol+0.003
 Majesty+0.003
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.001
 over-0.001
 clean-0.001
Token.
Token ID13
Feature activation-4.77e-03

Pos logit contributions
 conclusions+0.024
alks+0.023
 Majesty+0.023
oles+0.022
ocol+0.022
Neg logit contributions
 next-0.012
my-0.011
 over-0.010
 tidy-0.010
 clean-0.010
Token idea
Token ID2126
Feature activation-1.08e-02

Pos logit contributions
my+0.045
 next+0.040
 tidy+0.038
 over+0.038
 packed+0.037
Neg logit contributions
 conclusions-0.060
alks-0.058
ocol-0.056
 Majesty-0.055
oles-0.055
Token.
Token ID13
Feature activation-4.85e-03

Pos logit contributions
my+0.017
 next+0.016
 tidy+0.016
 over+0.015
 clean+0.015
Neg logit contributions
 conclusions-0.020
alks-0.019
oles-0.018
ocol-0.018
 Majesty-0.018
Token They
Token ID1119
Feature activation-2.05e-02

Pos logit contributions
my+0.007
 next+0.007
 tidy+0.006
 over+0.006
 clean+0.006
Neg logit contributions
 conclusions-0.009
alks-0.009
oles-0.009
ocol-0.009
 Majesty-0.009
Token filled
Token ID5901
Feature activation-3.69e-02

Pos logit contributions
 next+0.020
my+0.019
 tidy+0.017
Close+0.016
pack+0.015
Neg logit contributions
alks-0.044
oles-0.042
 conclusions-0.042
umps-0.041
 anth-0.041
Token up
Token ID510
Feature activation-3.30e-02

Pos logit contributions
my+0.048
 packed+0.043
 tidy+0.040
 pack+0.040
 next+0.039
Neg logit contributions
 conclusions-0.073
alks-0.073
oles-0.070
ocol-0.065
 predators-0.065
Token buckets
Token ID38674
Feature activation-6.70e-02

Pos logit contributions
my+0.042
 packed+0.037
 pack+0.035
 tidy+0.035
 next+0.034
Neg logit contributions
 conclusions-0.067
alks-0.067
joy-0.063
oles-0.062
ocol-0.061
Token with
Token ID351
Feature activation-1.63e-04

Pos logit contributions
my+0.096
Close+0.093
 cost+0.090
Good+0.085
 true+0.083
Neg logit contributions
alks-0.118
oles-0.115
umps-0.106
 vines-0.100
 webs-0.094
Token water
Token ID1660
Feature activation-3.43e-02

Pos logit contributions
my+0.000
 clean+0.000
 tidy+0.000
 next+0.000
 over+0.000
Neg logit contributions
 conclusions-0.000
 Majesty-0.000
alks-0.000
oles-0.000
 perspective-0.000
Token and
Token ID290
Feature activation-1.48e-03

Pos logit contributions
my+0.031
 next+0.028
 tidy+0.026
 over+0.025
 clean+0.024
Neg logit contributions
 conclusions-0.087
alks-0.084
oles-0.081
ocol-0.081
 Majesty-0.080
Token ran
Token ID4966
Feature activation-4.02e-03

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.001
 over+0.001
pack+0.001
Neg logit contributions
 conclusions-0.003
alks-0.003
oles-0.003
ocol-0.003
 fingertips-0.003
Token outside
Token ID2354
Feature activation-7.78e-03

Pos logit contributions
my+0.004
 next+0.004
 tidy+0.004
 packed+0.003
 pack+0.003
Neg logit contributions
 conclusions-0.009
alks-0.009
oles-0.008
ocol-0.008
joy-0.008
Token.
Token ID13
Feature activation+3.41e-03

Pos logit contributions
my+0.041
 tidy+0.037
Close+0.036
 packed+0.035
 next+0.035
Neg logit contributions
 conclusions-0.067
alks-0.067
 perspective-0.064
 Majesty-0.063
oles-0.063
Token I
Token ID314
Feature activation-2.66e-03

Pos logit contributions
 conclusions+0.006
alks+0.006
 Majesty+0.006
oles+0.006
ocol+0.006
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.005
 packed-0.004
 clean-0.004
Token have
Token ID423
Feature activation+3.02e-03

Pos logit contributions
my+0.003
 next+0.003
 over+0.003
 tidy+0.003
Close+0.003
Neg logit contributions
 conclusions-0.005
alks-0.005
ocol-0.005
 Majesty-0.005
 perspective-0.005
Token to
Token ID284
Feature activation-3.56e-02

Pos logit contributions
 conclusions+0.008
alks+0.008
oles+0.007
ocol+0.007
 Majesty+0.007
Neg logit contributions
my-0.003
 next-0.002
 tidy-0.002
 over-0.002
 packed-0.002
Token bring
Token ID2222
Feature activation-2.74e-02

Pos logit contributions
my+0.036
 over+0.029
 next+0.028
Close+0.026
 tidy+0.024
Neg logit contributions
 conclusions-0.086
alks-0.080
 Majesty-0.079
ocol-0.079
 predators-0.078
Token all
Token ID477
Feature activation-6.70e-02

Pos logit contributions
my+0.029
 next+0.025
 tidy+0.025
 packed+0.023
 over+0.023
Neg logit contributions
 conclusions-0.064
alks-0.062
ocol-0.060
 Majesty-0.059
oles-0.058
Token my
Token ID616
Feature activation+1.17e-02

Pos logit contributions
my+0.085
mar+0.084
 next+0.080
 cost+0.080
organ+0.080
Neg logit contributions
ocol-0.114
alks-0.114
 conclusions-0.104
 predators-0.101
umps-0.097
Token things
Token ID1243
Feature activation-3.90e-02

Pos logit contributions
 conclusions+0.020
alks+0.019
ocol+0.019
oles+0.018
 Majesty+0.018
Neg logit contributions
my-0.019
 next-0.018
 over-0.017
 tidy-0.017
 clean-0.017
Token with
Token ID351
Feature activation-3.30e-03

Pos logit contributions
my+0.055
Close+0.048
 tidy+0.045
 packed+0.044
pack+0.044
Neg logit contributions
alks-0.069
 conclusions-0.068
oles-0.065
 Majesty-0.064
ocol-0.063
Token me
Token ID502
Feature activation+3.97e-03

Pos logit contributions
my+0.003
 next+0.003
 tidy+0.003
 clean+0.003
 over+0.003
Neg logit contributions
 conclusions-0.008
alks-0.008
oles-0.008
 Majesty-0.008
joy-0.008
Token."
Token ID526
Feature activation-5.10e-03

Pos logit contributions
 conclusions+0.009
alks+0.008
ocol+0.008
oles+0.008
 perspective+0.008
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.005
 over-0.005
 packed-0.005
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
 conclusions+0.047
alks+0.045
oles+0.043
ocol+0.043
 Majesty+0.042
Neg logit contributions
my-0.034
 next-0.032
 tidy-0.030
 over-0.030
 clean-0.029
Token Where
Token ID6350
Feature activation+1.61e-02

Pos logit contributions
my+0.009
 next+0.008
 tidy+0.008
 packed+0.008
 clean+0.007
Neg logit contributions
 conclusions-0.009
alks-0.009
 Majesty-0.009
ocol-0.009
oles-0.008
Token did
Token ID750
Feature activation+5.56e-03

Pos logit contributions
oles+0.028
 conclusions+0.028
umps+0.027
 Majesty+0.027
orry+0.026
Neg logit contributions
 next-0.021
my-0.021
 over-0.020
 tidy-0.019
 late-0.019
Token you
Token ID345
Feature activation-9.21e-03

Pos logit contributions
 conclusions+0.013
alks+0.012
oles+0.012
ocol+0.012
 Majesty+0.012
Neg logit contributions
my-0.006
 next-0.006
 tidy-0.006
 over-0.005
 clean-0.005
Token get
Token ID651
Feature activation-2.70e-02

Pos logit contributions
my+0.010
 next+0.009
 tidy+0.009
 packed+0.009
 over+0.009
Neg logit contributions
 conclusions-0.021
alks-0.020
oles-0.020
ocol-0.019
 perspective-0.019
Token it
Token ID340
Feature activation-6.68e-02

Pos logit contributions
my+0.037
 next+0.033
 tidy+0.033
 packed+0.031
 clean+0.030
Neg logit contributions
 conclusions-0.055
alks-0.054
ocol-0.051
oles-0.051
 Majesty-0.050
Token?"
Token ID1701
Feature activation-1.73e-02

Pos logit contributions
Close+0.082
my+0.079
 packed+0.076
 tidy+0.074
pack+0.068
Neg logit contributions
alks-0.117
 conclusions-0.112
oles-0.106
 Majesty-0.105
der-0.102
Token Tom
Token ID4186
Feature activation-2.76e-02

Pos logit contributions
my+0.021
 next+0.019
 tidy+0.017
 over+0.017
 clean+0.017
Neg logit contributions
 conclusions-0.040
alks-0.039
 Majesty-0.038
ocol-0.038
oles-0.037
Token says
Token ID1139
Feature activation+2.07e-03

Pos logit contributions
my+0.037
 next+0.036
 tidy+0.035
 packed+0.035
 clean+0.033
Neg logit contributions
 conclusions-0.052
alks-0.051
 Majesty-0.049
oles-0.048
joy-0.047
Token "
Token ID366
Feature activation-8.60e-03

Pos logit contributions
 conclusions+0.004
alks+0.004
ocol+0.004
oles+0.004
 Majesty+0.004
Neg logit contributions
my-0.003
 next-0.003
 over-0.003
 tidy-0.003
 clean-0.003
TokenDad
Token ID46270
Feature activation-1.87e-03

Pos logit contributions
my+0.009
 tidy+0.008
 clean+0.008
 packed+0.008
 pack+0.007
Neg logit contributions
 conclusions-0.020
alks-0.018
oles-0.018
 perspective-0.018
 fingertips-0.018
Token.
Token ID13
Feature activation+8.32e-03

Pos logit contributions
my+0.034
 tidy+0.031
 packed+0.031
 pack+0.030
pack+0.030
Neg logit contributions
 conclusions-0.029
alks-0.028
oles-0.028
 Majesty-0.027
joy-0.026
Token Where
Token ID6350
Feature activation+2.16e-02

Pos logit contributions
 conclusions+0.013
alks+0.013
 Majesty+0.012
ocol+0.012
 fingertips+0.012
Neg logit contributions
my-0.014
 next-0.014
 packed-0.013
 tidy-0.013
 clean-0.013
Token did
Token ID750
Feature activation+1.31e-02

Pos logit contributions
oles+0.036
 conclusions+0.036
umps+0.035
orry+0.035
 Majesty+0.034
Neg logit contributions
 next-0.029
 late-0.027
my-0.027
 over-0.026
pack-0.025
Token you
Token ID345
Feature activation-1.09e-02

Pos logit contributions
 conclusions+0.027
alks+0.026
oles+0.025
ocol+0.025
 Majesty+0.025
Neg logit contributions
my-0.018
 next-0.016
 tidy-0.015
 over-0.015
 packed-0.014
Token get
Token ID651
Feature activation-3.03e-02

Pos logit contributions
my+0.012
 next+0.011
 tidy+0.011
 packed+0.010
 over+0.010
Neg logit contributions
 conclusions-0.025
alks-0.023
 perspective-0.022
ocol-0.022
oles-0.022
Token it
Token ID340
Feature activation-6.67e-02

Pos logit contributions
my+0.051
 tidy+0.047
 next+0.047
 packed+0.045
 pack+0.045
Neg logit contributions
 conclusions-0.050
alks-0.049
ocol-0.046
 Majesty-0.044
oles-0.044
Token?"
Token ID1701
Feature activation+2.57e-03

Pos logit contributions
Close+0.095
Yes+0.084
 packed+0.083
my+0.082
Hold+0.081
Neg logit contributions
alks-0.095
 anymore-0.094
 conclusions-0.090
 pad-0.088
 Majesty-0.086
Token
Token ID198
Feature activation-5.02e-03

Pos logit contributions
 conclusions+0.007
alks+0.006
ocol+0.006
oles+0.006
 predators+0.006
Neg logit contributions
my-0.003
 next-0.002
 tidy-0.002
 clean-0.002
 over-0.002
Token
Token ID198
Feature activation+7.40e-03

Pos logit contributions
my+0.005
 next+0.005
 tidy+0.005
 over+0.005
 clean+0.004
Neg logit contributions
 conclusions-0.012
alks-0.012
ocol-0.012
oles-0.011
 Majesty-0.011
TokenL
Token ID43
Feature activation-1.53e-02

Pos logit contributions
 conclusions+0.017
alks+0.016
 Majesty+0.015
oles+0.015
 perspective+0.015
Neg logit contributions
my-0.008
 next-0.008
 packed-0.007
 over-0.007
 tidy-0.007
Tokenily
Token ID813
Feature activation-3.28e-03

Pos logit contributions
my+0.027
 pack+0.026
 packed+0.026
 tidy+0.026
 next+0.025
Neg logit contributions
 conclusions-0.023
ocol-0.021
oles-0.021
alks-0.021
der-0.020
Token clean
Token ID3424
Feature activation-3.18e-02

Pos logit contributions
my+0.016
 late+0.013
 next+0.013
Close+0.013
eddy+0.013
Neg logit contributions
 conclusions-0.029
 fingertips-0.028
alks-0.026
ocol-0.026
 anth-0.025
Token his
Token ID465
Feature activation-3.85e-03

Pos logit contributions
my+0.028
 packed+0.025
 next+0.024
Close+0.023
 tidy+0.022
Neg logit contributions
 conclusions-0.078
alks-0.076
ocol-0.074
 Majesty-0.072
 anth-0.071
Token room
Token ID2119
Feature activation-2.23e-02

Pos logit contributions
my+0.005
 packed+0.004
Close+0.004
 next+0.004
 over+0.004
Neg logit contributions
 conclusions-0.008
alks-0.007
 Majesty-0.007
 anth-0.007
 fingertips-0.007
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
my+0.035
Close+0.032
 packed+0.031
 next+0.030
 tidy+0.030
Neg logit contributions
 conclusions-0.037
alks-0.035
oles-0.033
ocol-0.033
 perspective-0.033
Token put
Token ID1234
Feature activation-3.12e-02

Pos logit contributions
my+0.010
Close+0.008
eddy+0.008
 next+0.008
go+0.008
Neg logit contributions
 conclusions-0.023
 fingertips-0.022
 Majesty-0.022
ocol-0.022
 mouths-0.021
Token all
Token ID477
Feature activation-6.65e-02

Pos logit contributions
my+0.036
 packed+0.034
Close+0.034
 next+0.033
 tidy+0.033
Neg logit contributions
ocol-0.062
alks-0.060
 conclusions-0.059
 predators-0.054
ighters-0.054
Token his
Token ID465
Feature activation+3.18e-03

Pos logit contributions
 packed+0.093
mar+0.091
man+0.090
 read+0.086
 true+0.086
Neg logit contributions
ocol-0.103
alks-0.096
umps-0.089
 conclusions-0.085
ighters-0.078
Token toys
Token ID14958
Feature activation-3.41e-02

Pos logit contributions
 conclusions+0.007
alks+0.007
ocol+0.007
oles+0.007
 Majesty+0.007
Neg logit contributions
my-0.004
 next-0.003
 tidy-0.003
 over-0.003
 packed-0.003
Token in
Token ID287
Feature activation-1.31e-02

Pos logit contributions
Close+0.038
my+0.038
 packed+0.032
 pack+0.031
 bring+0.030
Neg logit contributions
 conclusions-0.062
alks-0.061
ighters-0.059
ocol-0.058
 pad-0.057
Token the
Token ID262
Feature activation-3.14e-03

Pos logit contributions
my+0.014
 next+0.013
 packed+0.012
Close+0.012
 tidy+0.012
Neg logit contributions
 conclusions-0.030
alks-0.029
ocol-0.027
 perspective-0.027
joy-0.027
Token bin
Token ID9874
Feature activation-1.80e-03

Pos logit contributions
my+0.005
 packed+0.004
 next+0.004
 over+0.004
 bring+0.004
Neg logit contributions
 conclusions-0.006
alks-0.005
 colony-0.005
 anth-0.005
ocol-0.005
Token your
Token ID534
Feature activation+4.39e-03

Pos logit contributions
my+0.015
 next+0.014
 tidy+0.013
 over+0.012
 clean+0.012
Neg logit contributions
 conclusions-0.045
alks-0.044
oles-0.043
ocol-0.043
 Majesty-0.042
Token room
Token ID2119
Feature activation-3.30e-02

Pos logit contributions
 conclusions+0.010
alks+0.009
oles+0.009
ocol+0.009
 Majesty+0.009
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.005
 over-0.005
 packed-0.004
Token now
Token ID783
Feature activation-3.38e-02

Pos logit contributions
my+0.033
 next+0.028
 packed+0.028
 tidy+0.028
Close+0.027
Neg logit contributions
 conclusions-0.074
alks-0.072
 Majesty-0.070
 perspective-0.069
joy-0.068
Token?
Token ID30
Feature activation+8.26e-03

Pos logit contributions
my+0.038
Close+0.033
 packed+0.032
 tidy+0.031
 next+0.031
Neg logit contributions
 conclusions-0.067
 Majesty-0.067
alks-0.064
joy-0.062
ocol-0.062
Token Put
Token ID5930
Feature activation-2.51e-02

Pos logit contributions
 conclusions+0.016
alks+0.016
ocol+0.015
oles+0.015
 Majesty+0.015
Neg logit contributions
my-0.012
 next-0.011
 tidy-0.011
 over-0.010
 packed-0.010
Token all
Token ID477
Feature activation-6.62e-02

Pos logit contributions
my+0.032
 next+0.030
 tidy+0.029
 packed+0.028
Close+0.027
Neg logit contributions
ocol-0.049
 conclusions-0.049
alks-0.048
 Majesty-0.044
 predators-0.044
Token your
Token ID534
Feature activation+7.81e-03

Pos logit contributions
 packed+0.082
man+0.080
mar+0.078
 pack+0.077
Close+0.074
Neg logit contributions
ocol-0.117
 conclusions-0.108
alks-0.107
umps-0.096
ir-0.092
Token toys
Token ID14958
Feature activation-3.71e-02

Pos logit contributions
 conclusions+0.019
alks+0.018
oles+0.017
ocol+0.017
 Majesty+0.017
Neg logit contributions
my-0.008
 next-0.007
 tidy-0.007
 over-0.007
 packed-0.007
Token away
Token ID1497
Feature activation-2.53e-02

Pos logit contributions
my+0.037
Close+0.034
 packed+0.031
 next+0.031
 tidy+0.030
Neg logit contributions
 conclusions-0.080
alks-0.078
ocol-0.074
 perspective-0.074
oles-0.073
Token and
Token ID290
Feature activation-1.85e-02

Pos logit contributions
my+0.035
 next+0.032
 tidy+0.031
 packed+0.030
 over+0.030
Neg logit contributions
 conclusions-0.051
alks-0.049
oles-0.047
ocol-0.046
 Majesty-0.046
Token make
Token ID787
Feature activation-4.09e-03

Pos logit contributions
my+0.017
 next+0.015
 over+0.014
 packed+0.013
Close+0.013
Neg logit contributions
 conclusions-0.044
alks-0.041
 Majesty-0.041
 fingertips-0.041
ocol-0.041
Token her
Token ID607
Feature activation-6.10e-03

Pos logit contributions
my+0.021
 next+0.017
 packed+0.017
 tidy+0.017
 over+0.016
Neg logit contributions
 conclusions-0.046
alks-0.045
ocol-0.043
oles-0.042
 fingertips-0.041
Token room
Token ID2119
Feature activation-1.94e-02

Pos logit contributions
my+0.008
 over+0.007
 next+0.007
 tidy+0.007
Close+0.007
Neg logit contributions
 conclusions-0.012
alks-0.012
ocol-0.011
 Majesty-0.011
oles-0.011
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
my+0.028
 next+0.026
 tidy+0.025
 over+0.024
 clean+0.024
Neg logit contributions
 conclusions-0.039
alks-0.038
oles-0.036
ocol-0.036
 Majesty-0.036
Token put
Token ID1234
Feature activation-3.54e-02

Pos logit contributions
my+0.016
eddy+0.015
man+0.014
go+0.014
 next+0.013
Neg logit contributions
 conclusions-0.031
 fingertips-0.031
 Majesty-0.031
ocol-0.029
 mouths-0.029
Token away
Token ID1497
Feature activation-1.49e-02

Pos logit contributions
my+0.038
 packed+0.034
 next+0.033
 tidy+0.033
Close+0.033
Neg logit contributions
ocol-0.074
alks-0.073
 conclusions-0.071
ighters-0.067
 webs-0.066
Token all
Token ID477
Feature activation-6.62e-02

Pos logit contributions
my+0.020
 next+0.018
Close+0.017
go+0.017
 packed+0.017
Neg logit contributions
 conclusions-0.028
alks-0.027
ocol-0.027
 Majesty-0.025
umps-0.025
Token of
Token ID286
Feature activation-2.16e-03

Pos logit contributions
man+0.095
mar+0.092
 packed+0.092
 pack+0.092
 Claus+0.090
Neg logit contributions
ocol-0.095
alks-0.094
umps-0.084
 conclusions-0.083
lee-0.078
Token her
Token ID607
Feature activation+2.12e-03

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 clean+0.002
 over+0.001
Neg logit contributions
 conclusions-0.006
alks-0.005
oles-0.005
 Majesty-0.005
 perspective-0.005
Token dress
Token ID6576
Feature activation-1.42e-02

Pos logit contributions
 conclusions+0.004
alks+0.004
ocol+0.004
oles+0.004
 Majesty+0.004
Neg logit contributions
my-0.003
 next-0.003
 over-0.003
 tidy-0.003
 packed-0.002
Token-
Token ID12
Feature activation-2.69e-02

Pos logit contributions
my+0.023
 next+0.021
 tidy+0.021
 over+0.020
 packed+0.020
Neg logit contributions
 conclusions-0.025
alks-0.024
oles-0.023
ocol-0.022
 Majesty-0.022
Tokenup
Token ID929
Feature activation-8.80e-03

Pos logit contributions
 next+0.028
 pack+0.027
my+0.027
 tidy+0.027
 also+0.027
Neg logit contributions
 conclusions-0.057
ocol-0.055
alks-0.053
oles-0.051
umps-0.050
Token.
Token ID13
Feature activation-6.92e-03

Pos logit contributions
my+0.044
 next+0.041
 tidy+0.039
 packed+0.039
Close+0.038
Neg logit contributions
 conclusions-0.054
alks-0.053
oles-0.050
 perspective-0.050
joy-0.049
Token
Token ID198
Feature activation-7.49e-03

Pos logit contributions
my+0.007
 next+0.006
 over+0.006
 tidy+0.005
 clean+0.004
Neg logit contributions
 conclusions-0.019
ocol-0.018
 predators-0.017
alks-0.017
oles-0.017
Token
Token ID198
Feature activation-7.99e-04

Pos logit contributions
my+0.007
 next+0.006
 over+0.006
 tidy+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.020
ocol-0.019
alks-0.019
 Majesty-0.019
 predators-0.018
TokenLisa
Token ID44203
Feature activation-4.63e-03

Pos logit contributions
my+0.001
 next+0.001
 tidy+0.001
 packed+0.001
 over+0.001
Neg logit contributions
 conclusions-0.002
alks-0.002
oles-0.002
 Majesty-0.002
 perspective-0.002
Token put
Token ID1234
Feature activation-3.63e-02

Pos logit contributions
my+0.005
 next+0.004
 tidy+0.004
 over+0.004
 clean+0.004
Neg logit contributions
 conclusions-0.011
alks-0.011
oles-0.010
 Majesty-0.010
 colony-0.010
Token all
Token ID477
Feature activation-6.62e-02

Pos logit contributions
my+0.038
 packed+0.037
 tidy+0.035
 next+0.035
Close+0.034
Neg logit contributions
alks-0.075
ocol-0.075
 conclusions-0.073
 webs-0.070
joy-0.068
Token her
Token ID607
Feature activation-1.55e-03

Pos logit contributions
 pack+0.096
mar+0.095
 packed+0.094
 next+0.094
 cost+0.093
Neg logit contributions
alks-0.102
ocol-0.098
umps-0.092
 webs-0.090
 conclusions-0.085
Token dolls
Token ID36062
Feature activation-2.86e-02

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 over+0.002
 clean+0.002
Neg logit contributions
 conclusions-0.003
alks-0.003
oles-0.003
ocol-0.003
 Majesty-0.003
Token on
Token ID319
Feature activation+1.17e-02

Pos logit contributions
Close+0.029
my+0.028
 packed+0.024
 pack+0.024
 bring+0.023
Neg logit contributions
alks-0.058
oles-0.055
 pad-0.054
 conclusions-0.054
 webs-0.054
Token the
Token ID262
Feature activation+1.41e-03

Pos logit contributions
 Majesty+0.035
 conclusions+0.035
oles+0.035
alks+0.034
ocol+0.034
Neg logit contributions
 next-0.007
my-0.006
 tidy-0.006
 clean-0.006
 over-0.006
Token new
Token ID649
Feature activation+4.57e-03

Pos logit contributions
 conclusions+0.002
alks+0.002
oles+0.002
ocol+0.002
 Majesty+0.002
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 clean-0.002
 over-0.002
Token you
Token ID345
Feature activation-5.59e-03

Pos logit contributions
 conclusions+0.031
alks+0.029
oles+0.029
 Majesty+0.029
ocol+0.028
Neg logit contributions
my-0.020
 tidy-0.018
 over-0.018
 next-0.018
 packed-0.018
Token want
Token ID765
Feature activation-1.38e-02

Pos logit contributions
my+0.007
 next+0.007
 tidy+0.007
 over+0.006
 packed+0.006
Neg logit contributions
 conclusions-0.011
alks-0.011
ocol-0.011
 Majesty-0.010
oles-0.010
Token to
Token ID284
Feature activation-3.10e-02

Pos logit contributions
my+0.015
 tidy+0.014
 next+0.013
 packed+0.013
 pack+0.013
Neg logit contributions
 conclusions-0.031
alks-0.029
ocol-0.028
oles-0.028
 perspective-0.027
Token buy
Token ID2822
Feature activation-3.44e-02

Pos logit contributions
my+0.035
 next+0.031
 packed+0.031
Close+0.029
 tidy+0.028
Neg logit contributions
 conclusions-0.069
alks-0.065
 Majesty-0.061
ocol-0.061
 fingertips-0.060
Token some
Token ID617
Feature activation-2.39e-02

Pos logit contributions
my+0.033
 tidy+0.029
 next+0.029
 packed+0.028
 clean+0.026
Neg logit contributions
 conclusions-0.083
alks-0.081
ocol-0.079
 Majesty-0.077
oles-0.076
Token laundry
Token ID25724
Feature activation-6.62e-02

Pos logit contributions
my+0.030
 next+0.026
 tidy+0.025
 packed+0.025
 over+0.024
Neg logit contributions
 conclusions-0.051
ocol-0.049
alks-0.049
oles-0.047
 perspective-0.045
Token?
Token ID30
Feature activation+9.25e-04

Pos logit contributions
my+0.081
Close+0.078
 packed+0.070
Open+0.067
 tidy+0.066
Neg logit contributions
alks-0.115
 conclusions-0.114
ocol-0.113
bers-0.109
joy-0.107
Token I
Token ID314
Feature activation-1.14e-02

Pos logit contributions
 conclusions+0.002
alks+0.002
ocol+0.002
oles+0.002
 Majesty+0.002
Neg logit contributions
my-0.001
 next-0.001
 tidy-0.001
 clean-0.001
 packed-0.001
Token have
Token ID423
Feature activation-2.07e-03

Pos logit contributions
my+0.016
 next+0.016
 tidy+0.014
 over+0.014
Close+0.014
Neg logit contributions
 conclusions-0.020
ocol-0.019
alks-0.019
 perspective-0.018
oles-0.018
Token a
Token ID257
Feature activation+3.12e-04

Pos logit contributions
my+0.002
 next+0.001
 tidy+0.001
 clean+0.001
 packed+0.001
Neg logit contributions
 conclusions-0.006
alks-0.005
oles-0.005
ocol-0.005
 perspective-0.005
Token shirt
Token ID10147
Feature activation-2.79e-02

Pos logit contributions
 conclusions+0.001
alks+0.001
oles+0.001
ocol+0.001
 Majesty+0.001
Neg logit contributions
my-0.000
 next-0.000
 tidy-0.000
 over-0.000
 clean-0.000
Token pulled
Token ID5954
Feature activation+1.14e-03

Pos logit contributions
 conclusions+0.014
alks+0.013
ocol+0.013
oles+0.013
 Majesty+0.013
Neg logit contributions
my-0.006
 tidy-0.005
 next-0.005
 packed-0.005
 clean-0.005
Token and
Token ID290
Feature activation+8.94e-03

Pos logit contributions
 conclusions+0.003
alks+0.003
 Majesty+0.003
oles+0.003
ocol+0.003
Neg logit contributions
my-0.001
 next-0.001
 over-0.001
 tidy-0.001
 clean-0.001
Token tug
Token ID27762
Feature activation+3.09e-02

Pos logit contributions
 conclusions+0.017
alks+0.016
oles+0.015
ocol+0.015
 fingertips+0.015
Neg logit contributions
my-0.014
 next-0.013
 tidy-0.012
 over-0.012
 packed-0.012
Tokenged
Token ID2004
Feature activation+4.07e-02

Pos logit contributions
 conclusions+0.046
 Majesty+0.045
alks+0.045
 colony+0.042
oles+0.042
Neg logit contributions
my-0.062
 next-0.061
 over-0.059
 tidy-0.058
 clean-0.057
Token and
Token ID290
Feature activation+2.61e-02

Pos logit contributions
 Majesty+0.121
alks+0.114
 conclusions+0.113
 colony+0.110
ocol+0.110
Neg logit contributions
my-0.030
 over-0.029
 next-0.028
 with-0.027
 clean-0.025
Token shouted
Token ID17293
Feature activation+5.68e-02

Pos logit contributions
 Majesty+0.063
 conclusions+0.062
ocol+0.060
alks+0.060
oles+0.058
Neg logit contributions
my-0.032
 made-0.031
 over-0.031
 clean-0.030
 tidy-0.030
Token.
Token ID13
Feature activation+3.66e-02

Pos logit contributions
 conclusions+0.167
 colony+0.164
 perspective+0.161
 predators+0.159
alks+0.159
Neg logit contributions
 with-0.050
 over-0.049
 next-0.042
 clean-0.041
 right-0.038
Token They
Token ID1119
Feature activation+1.86e-02

Pos logit contributions
ocol+0.100
 conclusions+0.099
 perspective+0.094
 Majesty+0.092
oles+0.092
Neg logit contributions
 over-0.037
 next-0.034
 clean-0.032
 with-0.032
 tidy-0.031
Token did
Token ID750
Feature activation+1.65e-02

Pos logit contributions
 conclusions+0.048
alks+0.047
ocol+0.046
 Majesty+0.046
oles+0.044
Neg logit contributions
my-0.020
 tidy-0.017
 clean-0.017
 over-0.016
 made-0.016
Token not
Token ID407
Feature activation+3.39e-03

Pos logit contributions
 conclusions+0.050
alks+0.050
ocol+0.049
oles+0.048
 Majesty+0.048
Neg logit contributions
 next-0.007
my-0.006
 tidy-0.006
 over-0.005
 clean-0.005
Token want
Token ID765
Feature activation+1.55e-02

Pos logit contributions
 conclusions+0.006
alks+0.006
ocol+0.005
 perspective+0.005
 Majesty+0.005
Neg logit contributions
my-0.005
 next-0.005
Close-0.005
 packed-0.005
 tidy-0.004
Token see
Token ID766
Feature activation+1.68e-02

Pos logit contributions
my+0.034
 next+0.030
 over+0.027
 tidy+0.026
Close+0.026
Neg logit contributions
 conclusions-0.066
alks-0.063
 Majesty-0.061
 perspective-0.060
 predators-0.059
Token lions
Token ID31420
Feature activation+3.14e-02

Pos logit contributions
 conclusions+0.040
alks+0.040
oles+0.039
ocol+0.038
 Majesty+0.038
Neg logit contributions
my-0.018
 next-0.016
 tidy-0.015
 over-0.015
 clean-0.015
Token,
Token ID11
Feature activation+9.56e-03

Pos logit contributions
 conclusions+0.079
alks+0.078
ocol+0.075
 Majesty+0.073
 perspective+0.073
Neg logit contributions
 next-0.033
my-0.032
 over-0.029
 with-0.027
 tidy-0.027
Token elephants
Token ID25368
Feature activation+3.79e-02

Pos logit contributions
 conclusions+0.013
alks+0.012
ocol+0.011
oles+0.011
 Majesty+0.011
Neg logit contributions
my-0.020
 next-0.020
 tidy-0.019
 over-0.019
 packed-0.018
Token,
Token ID11
Feature activation+4.72e-03

Pos logit contributions
 conclusions+0.098
alks+0.094
 Majesty+0.091
ocol+0.090
 fingertips+0.088
Neg logit contributions
my-0.040
 next-0.038
 over-0.034
 with-0.032
 packed-0.031
Token gir
Token ID37370
Feature activation+5.55e-02

Pos logit contributions
 conclusions+0.009
alks+0.009
oles+0.009
ocol+0.009
 Majesty+0.008
Neg logit contributions
my-0.007
 next-0.006
 tidy-0.006
 over-0.006
 clean-0.006
Tokenaff
Token ID2001
Feature activation+5.64e-02

Pos logit contributions
 conclusions+0.072
 fingertips+0.058
 Majesty+0.058
 colony+0.056
 perspective+0.056
Neg logit contributions
my-0.128
 next-0.119
 over-0.113
 packed-0.112
 with-0.109
Tokenes
Token ID274
Feature activation+2.88e-02

Pos logit contributions
 conclusions+0.113
 fingertips+0.096
 Majesty+0.095
 anth+0.094
 mischief+0.094
Neg logit contributions
my-0.103
 next-0.093
 over-0.090
 with-0.089
 on-0.085
Token and
Token ID290
Feature activation+2.45e-02

Pos logit contributions
 conclusions+0.084
alks+0.081
ocol+0.080
 Majesty+0.078
 fingertips+0.078
Neg logit contributions
 next-0.023
my-0.021
 over-0.020
 with-0.019
 from-0.018
Token more
Token ID517
Feature activation+1.91e-02

Pos logit contributions
alks+0.045
 conclusions+0.044
ocol+0.043
oles+0.043
 fingertips+0.041
Neg logit contributions
my-0.040
 next-0.038
 tidy-0.036
 over-0.035
 packed-0.035
Token.
Token ID13
Feature activation-4.80e-03

Pos logit contributions
 conclusions+0.044
alks+0.043
 fingertips+0.041
 Majesty+0.041
ocol+0.040
Neg logit contributions
 next-0.025
my-0.023
 over-0.021
 with-0.021
 tidy-0.021
Token walked
Token ID6807
Feature activation-1.65e-03

Pos logit contributions
my+0.000
 next+0.000
 tidy+0.000
 over+0.000
 clean+0.000
Neg logit contributions
 conclusions-0.001
alks-0.001
oles-0.001
 colony-0.001
 perspective-0.001
Token on
Token ID319
Feature activation+1.29e-02

Pos logit contributions
my+0.002
 tidy+0.002
Close+0.002
 bring+0.002
 pack+0.002
Neg logit contributions
alks-0.003
joy-0.003
 perspective-0.003
 conclusions-0.003
 pad-0.003
Token a
Token ID257
Feature activation+3.17e-02

Pos logit contributions
alks+0.037
 conclusions+0.036
oles+0.036
ocol+0.036
 Majesty+0.035
Neg logit contributions
my-0.008
 next-0.008
 tidy-0.007
 clean-0.007
 over-0.006
Token path
Token ID3108
Feature activation+1.39e-03

Pos logit contributions
 conclusions+0.069
alks+0.066
oles+0.064
 Majesty+0.063
 perspective+0.063
Neg logit contributions
my-0.039
 next-0.035
 over-0.033
 tidy-0.032
 packed-0.031
Token through
Token ID832
Feature activation+3.98e-02

Pos logit contributions
 conclusions+0.003
alks+0.003
oles+0.003
 perspective+0.003
joy+0.003
Neg logit contributions
my-0.002
 tidy-0.001
 next-0.001
 clean-0.001
 pack-0.001
Token a
Token ID257
Feature activation+5.66e-02

Pos logit contributions
alks+0.111
ocol+0.108
 conclusions+0.107
oles+0.107
 Majesty+0.107
Neg logit contributions
my-0.027
 next-0.026
 tidy-0.023
 over-0.022
 clean-0.021
Token big
Token ID1263
Feature activation+1.25e-02

Pos logit contributions
ocol+0.154
alks+0.153
 conclusions+0.147
oles+0.145
 fingertips+0.145
Neg logit contributions
my-0.049
 tidy-0.048
 clean-0.043
 next-0.043
 organized-0.038
Token bush
Token ID24484
Feature activation+2.20e-02

Pos logit contributions
 conclusions+0.023
alks+0.022
oles+0.022
ocol+0.021
 Majesty+0.021
Neg logit contributions
my-0.021
 next-0.019
 tidy-0.019
 clean-0.018
 over-0.018
Token with
Token ID351
Feature activation+1.28e-02

Pos logit contributions
 conclusions+0.044
alks+0.043
ocol+0.041
oles+0.041
 Majesty+0.041
Neg logit contributions
my-0.031
 next-0.030
 tidy-0.028
 over-0.028
 clean-0.027
Token lots
Token ID6041
Feature activation+1.17e-02

Pos logit contributions
 conclusions+0.031
alks+0.030
oles+0.029
ocol+0.029
 Majesty+0.029
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.012
 over-0.011
 clean-0.011
Token of
Token ID286
Feature activation+1.13e-02

Pos logit contributions
 conclusions+0.036
 Majesty+0.036
alks+0.035
 fingertips+0.034
ocol+0.034
Neg logit contributions
my-0.004
 next-0.004
 tidy-0.003
 clean-0.003
 over-0.003
Token Ben
Token ID3932
Feature activation+5.40e-03

Pos logit contributions
my+0.028
 next+0.025
 tidy+0.024
 over+0.023
 clean+0.023
Neg logit contributions
 conclusions-0.057
alks-0.056
oles-0.053
ocol-0.053
 Majesty-0.053
Token and
Token ID290
Feature activation-2.42e-02

Pos logit contributions
 conclusions+0.014
alks+0.013
ocol+0.013
oles+0.013
 Majesty+0.013
Neg logit contributions
my-0.005
 next-0.004
 tidy-0.004
 over-0.004
 packed-0.004
Token Lily
Token ID20037
Feature activation+2.88e-02

Pos logit contributions
pack+0.036
my+0.035
 next+0.032
man+0.032
Close+0.031
Neg logit contributions
 conclusions-0.041
joy-0.038
alks-0.038
ocol-0.037
 colony-0.037
Token liked
Token ID8288
Feature activation+2.29e-02

Pos logit contributions
 conclusions+0.078
alks+0.077
ocol+0.077
 Majesty+0.073
 predators+0.071
Neg logit contributions
my-0.029
 also-0.024
 tidy-0.024
 clean-0.024
 next-0.024
Token the
Token ID262
Feature activation-9.18e-03

Pos logit contributions
 conclusions+0.054
ocol+0.052
alks+0.052
 fingertips+0.050
oles+0.050
Neg logit contributions
 next-0.027
my-0.027
 over-0.025
 tidy-0.024
 clean-0.024
Token ze
Token ID41271
Feature activation+7.09e-02

Pos logit contributions
my+0.017
 next+0.016
 tidy+0.014
 over+0.014
 packed+0.014
Neg logit contributions
 conclusions-0.016
alks-0.015
ocol-0.015
 fingertips-0.015
oles-0.015
Tokenbr
Token ID1671
Feature activation+1.42e-02

Pos logit contributions
 conclusions+0.156
 Majesty+0.150
 perspective+0.148
alks+0.147
 fingertips+0.143
Neg logit contributions
my-0.092
 next-0.086
 over-0.079
 tidy-0.077
 packed-0.076
Tokenas
Token ID292
Feature activation+5.34e-03

Pos logit contributions
alks+0.018
 conclusions+0.018
oles+0.017
ocol+0.016
 Majesty+0.015
Neg logit contributions
my-0.030
 next-0.029
 tidy-0.029
 clean-0.028
 over-0.028
Token the
Token ID262
Feature activation+8.07e-03

Pos logit contributions
 conclusions+0.012
alks+0.012
 Majesty+0.011
ocol+0.011
 perspective+0.011
Neg logit contributions
my-0.007
 next-0.007
 over-0.006
 tidy-0.006
 clean-0.006
Token best
Token ID1266
Feature activation+1.27e-02

Pos logit contributions
 conclusions+0.016
alks+0.015
ocol+0.015
oles+0.015
 fingertips+0.015
Neg logit contributions
my-0.014
 next-0.013
 set-0.011
 made-0.011
 right-0.011
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
 conclusions+0.028
alks+0.027
 Majesty+0.027
ocol+0.027
 fingertips+0.026
Neg logit contributions
my-0.016
 next-0.016
 over-0.015
 with-0.014
 packed-0.013
Token he
Token ID339
Feature activation+1.45e-02

Pos logit contributions
 conclusions+0.027
alks+0.025
 Majesty+0.025
oles+0.025
ocol+0.024
Neg logit contributions
 next-0.010
my-0.009
 over-0.008
 clean-0.008
 tidy-0.008
Token reached
Token ID4251
Feature activation-1.63e-02

Pos logit contributions
 conclusions+0.037
alks+0.035
ocol+0.034
 perspective+0.034
oles+0.033
Neg logit contributions
my-0.015
 next-0.014
 packed-0.013
 over-0.013
 took-0.012
Token out
Token ID503
Feature activation-1.67e-03

Pos logit contributions
my+0.013
 next+0.012
 over+0.011
 tidy+0.011
 clean+0.010
Neg logit contributions
 conclusions-0.043
alks-0.042
oles-0.041
ocol-0.040
 Majesty-0.040
Token his
Token ID465
Feature activation+1.63e-02

Pos logit contributions
my+0.001
 next+0.001
 over+0.001
 tidy+0.001
 clean+0.001
Neg logit contributions
 conclusions-0.004
alks-0.004
 Majesty-0.004
oles-0.004
ocol-0.004
Token tiny
Token ID7009
Feature activation+2.08e-02

Pos logit contributions
 conclusions+0.033
alks+0.031
oles+0.030
ocol+0.030
 perspective+0.029
Neg logit contributions
my-0.025
 next-0.024
 tidy-0.022
 clean-0.022
 over-0.022
Token cater
Token ID20825
Feature activation+6.79e-02

Pos logit contributions
 conclusions+0.040
alks+0.038
oles+0.038
ighters+0.038
ants+0.036
Neg logit contributions
my-0.033
 next-0.031
 tidy-0.030
 clean-0.029
 pack-0.028
Tokenpillar
Token ID41643
Feature activation+3.03e-02

Pos logit contributions
 Majesty+0.094
 conclusions+0.094
alks+0.090
 fingertips+0.082
 mouths+0.080
Neg logit contributions
my-0.148
 next-0.125
 over-0.120
 made-0.118
 with-0.116
Token hand
Token ID1021
Feature activation-4.41e-03

Pos logit contributions
 conclusions+0.073
alks+0.070
ocol+0.068
 Majesty+0.065
 perspective+0.064
Neg logit contributions
my-0.039
 next-0.038
 over-0.033
 with-0.032
 closed-0.031
Token and
Token ID290
Feature activation+1.26e-02

Pos logit contributions
my+0.005
 next+0.005
 over+0.005
 tidy+0.004
 clean+0.004
Neg logit contributions
 conclusions-0.010
alks-0.010
ocol-0.010
 Majesty-0.010
 perspective-0.009
Token grasped
Token ID45569
Feature activation+2.93e-03

Pos logit contributions
 conclusions+0.030
alks+0.029
ocol+0.028
 perspective+0.028
oles+0.028
Neg logit contributions
my-0.014
 next-0.013
 over-0.012
 tidy-0.012
 clean-0.012
Token the
Token ID262
Feature activation+2.36e-04

Pos logit contributions
 conclusions+0.007
alks+0.007
oles+0.007
 perspective+0.007
 Majesty+0.007
Neg logit contributions
 next-0.003
my-0.003
 over-0.003
 tidy-0.002
 clean-0.002
Token all
Token ID477
Feature activation-1.12e-02

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 over+0.002
 packed+0.002
Neg logit contributions
 conclusions-0.006
alks-0.006
ocol-0.005
oles-0.005
 Majesty-0.005
Token looked
Token ID3114
Feature activation+1.06e-02

Pos logit contributions
 next+0.012
my+0.012
 tidy+0.010
pack+0.010
 pack+0.010
Neg logit contributions
 conclusions-0.024
alks-0.023
oles-0.023
 perspective-0.023
ocol-0.023
Token around
Token ID1088
Feature activation+1.51e-02

Pos logit contributions
 conclusions+0.026
alks+0.025
oles+0.024
 Majesty+0.024
ocol+0.023
Neg logit contributions
my-0.012
 next-0.011
 over-0.011
 tidy-0.011
 clean-0.010
Token,
Token ID11
Feature activation+8.59e-03

Pos logit contributions
 Majesty+0.048
oles+0.047
 conclusions+0.047
alks+0.046
ants+0.045
Neg logit contributions
 next-0.011
 over-0.009
 with-0.008
my-0.007
 clean-0.006
Token but
Token ID475
Feature activation+3.98e-03

Pos logit contributions
 conclusions+0.017
alks+0.016
oles+0.016
ocol+0.015
 Majesty+0.015
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.011
 over-0.011
 clean-0.011
Token they
Token ID484
Feature activation+2.21e-02

Pos logit contributions
 conclusions+0.009
alks+0.009
oles+0.008
ocol+0.008
 Majesty+0.008
Neg logit contributions
my-0.005
 next-0.004
 tidy-0.004
 over-0.004
 clean-0.004
Token could
Token ID714
Feature activation+1.00e-02

Pos logit contributions
 conclusions+0.051
alks+0.049
ocol+0.047
 Majesty+0.047
 perspective+0.045
Neg logit contributions
my-0.028
 next-0.026
 tidy-0.025
 packed-0.024
 over-0.024
Token not
Token ID407
Feature activation+3.82e-03

Pos logit contributions
 conclusions+0.022
oles+0.021
alks+0.021
 fingertips+0.020
 Majesty+0.020
Neg logit contributions
my-0.012
 packed-0.009
Close-0.009
 next-0.009
 tidy-0.009
Token find
Token ID1064
Feature activation+2.37e-02

Pos logit contributions
 conclusions+0.008
alks+0.008
oles+0.007
ocol+0.007
 Majesty+0.007
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.004
 over-0.004
 packed-0.004
Token it
Token ID340
Feature activation+3.93e-04

Pos logit contributions
oles+0.059
 conclusions+0.058
alks+0.058
 Majesty+0.057
 perspective+0.057
Neg logit contributions
my-0.022
 over-0.021
 next-0.020
 clean-0.019
 tidy-0.019
Token.
Token ID13
Feature activation-4.06e-04

Pos logit contributions
 conclusions+0.001
alks+0.001
ocol+0.001
 Majesty+0.001
oles+0.001
Neg logit contributions
my-0.001
 next-0.001
 over-0.001
 tidy-0.001
 clean-0.001
Token share
Token ID2648
Feature activation+5.29e-03

Pos logit contributions
my+0.009
 next+0.008
 packed+0.008
 tidy+0.007
 over+0.007
Neg logit contributions
 conclusions-0.023
alks-0.021
 Majesty-0.021
oles-0.021
 perspective-0.020
Token it
Token ID340
Feature activation-2.33e-02

Pos logit contributions
alks+0.014
 conclusions+0.014
oles+0.014
ocol+0.013
 Majesty+0.013
Neg logit contributions
my-0.004
 next-0.004
 over-0.004
 tidy-0.004
 clean-0.003
Token."
Token ID526
Feature activation+1.05e-02

Pos logit contributions
my+0.021
 next+0.017
 tidy+0.017
 packed+0.017
 pack+0.015
Neg logit contributions
 conclusions-0.058
alks-0.056
oles-0.055
 Majesty-0.054
ocol-0.053
Token
Token ID220
Feature activation+2.18e-02

Pos logit contributions
ocol+0.026
 conclusions+0.025
 predators+0.024
 Majesty+0.024
ighters+0.024
Neg logit contributions
my-0.014
 next-0.011
 over-0.010
 with-0.009
 took-0.009
Token
Token ID198
Feature activation+1.15e-02

Pos logit contributions
 conclusions+0.060
ocol+0.057
 predators+0.056
 Majesty+0.055
alks+0.055
Neg logit contributions
my-0.019
 next-0.016
 over-0.015
 clean-0.012
 tidy-0.012
Token
Token ID198
Feature activation+1.29e-02

Pos logit contributions
 conclusions+0.031
ocol+0.030
alks+0.029
 Majesty+0.029
 predators+0.029
Neg logit contributions
my-0.010
 next-0.009
 over-0.008
 tidy-0.007
 clean-0.007
TokenBill
Token ID17798
Feature activation-2.40e-03

Pos logit contributions
 conclusions+0.027
alks+0.026
ocol+0.025
 Majesty+0.025
oles+0.024
Neg logit contributions
my-0.018
 next-0.017
 tidy-0.016
 over-0.015
 packed-0.015
Token was
Token ID373
Feature activation+7.98e-04

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 clean+0.002
 over+0.002
Neg logit contributions
 conclusions-0.005
alks-0.005
oles-0.005
 Majesty-0.005
ocol-0.005
Token happy
Token ID3772
Feature activation-1.84e-02

Pos logit contributions
 conclusions+0.001
alks+0.001
 Majesty+0.001
joy+0.001
 colony+0.001
Neg logit contributions
my-0.001
 next-0.001
pack-0.001
 tidy-0.001
Close-0.001
Token because
Token ID780
Feature activation-3.83e-02

Pos logit contributions
my+0.020
 next+0.019
 tidy+0.018
 over+0.017
 clean+0.016
Neg logit contributions
 conclusions-0.043
alks-0.042
oles-0.040
ocol-0.040
 Majesty-0.040
Token Dee
Token ID29195
Feature activation+2.15e-02

Pos logit contributions
my+0.052
pack+0.049
 tidy+0.049
 pack+0.048
 packed+0.046
Neg logit contributions
 conclusions-0.072
alks-0.071
oles-0.070
ocol-0.067
joy-0.067
Token the
Token ID262
Feature activation-2.40e-03

Pos logit contributions
my+0.002
 next+0.002
 over+0.002
 tidy+0.002
 clean+0.002
Neg logit contributions
alks-0.009
 conclusions-0.009
oles-0.008
 Majesty-0.008
ocol-0.008
Token waves
Token ID9813
Feature activation+1.57e-02

Pos logit contributions
my+0.004
 next+0.004
 tidy+0.004
 over+0.004
 packed+0.004
Neg logit contributions
 conclusions-0.004
alks-0.004
ocol-0.004
 Majesty-0.004
oles-0.004
Token and
Token ID290
Feature activation+9.02e-03

Pos logit contributions
 conclusions+0.043
alks+0.041
ocol+0.040
 Majesty+0.040
oles+0.040
Neg logit contributions
my-0.013
 next-0.012
 over-0.012
 tidy-0.010
 clean-0.010
Token spl
Token ID4328
Feature activation+1.64e-02

Pos logit contributions
 conclusions+0.021
alks+0.020
ocol+0.019
oles+0.019
 Majesty+0.019
Neg logit contributions
my-0.011
 next-0.010
 tidy-0.009
 over-0.009
 packed-0.009
Tokenashed
Token ID5263
Feature activation+3.42e-03

Pos logit contributions
 conclusions+0.026
alks+0.025
oles+0.024
ocol+0.023
 Majesty+0.023
Neg logit contributions
my-0.030
 next-0.029
 tidy-0.028
 over-0.027
 clean-0.027
Token around
Token ID1088
Feature activation+3.08e-02

Pos logit contributions
 conclusions+0.009
alks+0.009
 Majesty+0.009
oles+0.008
ocol+0.008
Neg logit contributions
my-0.003
 next-0.003
 over-0.003
 tidy-0.003
 clean-0.003
Token for
Token ID329
Feature activation+2.01e-02

Pos logit contributions
 conclusions+0.096
alks+0.096
 Majesty+0.095
oles+0.094
ocol+0.092
Neg logit contributions
 next-0.022
 with-0.021
 over-0.019
 in-0.015
 on-0.015
Token hours
Token ID2250
Feature activation+3.27e-02

Pos logit contributions
oles+0.041
alks+0.040
ants+0.037
 Majesty+0.036
 perspective+0.036
Neg logit contributions
 next-0.033
 a-0.031
 clean-0.030
 tidy-0.030
 put-0.029
Token!
Token ID0
Feature activation+9.00e-03

Pos logit contributions
 conclusions+0.087
 Majesty+0.082
alks+0.079
ocol+0.078
 unnecessary+0.075
Neg logit contributions
 next-0.039
 with-0.035
my-0.033
 over-0.033
 on-0.029
Token In
Token ID554
Feature activation-2.00e-02

Pos logit contributions
 conclusions+0.021
ocol+0.020
 perspective+0.020
oles+0.020
 predators+0.019
Neg logit contributions
my-0.011
 over-0.010
 next-0.010
 late-0.009
 clean-0.009
Token the
Token ID262
Feature activation-3.57e-03

Pos logit contributions
my+0.020
 next+0.018
 tidy+0.017
 over+0.016
 packed+0.016
Neg logit contributions
 conclusions-0.048
alks-0.047
ocol-0.045
oles-0.045
 Majesty-0.045
Token is
Token ID318
Feature activation+8.81e-03

Pos logit contributions
my+0.001
 next+0.001
 tidy+0.001
 over+0.001
 clean+0.001
Neg logit contributions
 conclusions-0.003
alks-0.003
oles-0.003
ocol-0.003
 Majesty-0.003
Token shocked
Token ID11472
Feature activation-2.09e-02

Pos logit contributions
 conclusions+0.015
alks+0.014
 Majesty+0.014
joy+0.014
 colony+0.013
Neg logit contributions
my-0.014
 next-0.013
 packed-0.012
Close-0.012
 tidy-0.012
Token.
Token ID13
Feature activation+3.48e-03

Pos logit contributions
my+0.027
 next+0.025
 tidy+0.024
 over+0.023
 packed+0.023
Neg logit contributions
 conclusions-0.045
alks-0.043
oles-0.042
ocol-0.041
 Majesty-0.041
Token She
Token ID1375
Feature activation+2.47e-03

Pos logit contributions
 conclusions+0.010
 predators+0.010
ocol+0.010
 perspective+0.009
 Majesty+0.009
Neg logit contributions
my-0.002
 next-0.002
 over-0.002
 clean-0.002
 tidy-0.001
Token says
Token ID1139
Feature activation+3.81e-02

Pos logit contributions
 conclusions+0.006
alks+0.005
oles+0.005
ocol+0.005
 perspective+0.005
Neg logit contributions
my-0.003
 next-0.003
 tidy-0.003
 over-0.003
 clean-0.003
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
 conclusions+0.099
 perspective+0.093
alks+0.092
 predators+0.091
ocol+0.090
Neg logit contributions
 next-0.038
my-0.035
 over-0.034
 clean-0.032
 with-0.032
Token "
Token ID366
Feature activation+4.59e-03

Pos logit contributions
 conclusions+0.072
 perspective+0.067
ocol+0.066
 predators+0.065
alks+0.064
Neg logit contributions
 next-0.028
 over-0.026
my-0.024
 with-0.023
 took-0.022
TokenHey
Token ID10814
Feature activation+3.41e-02

Pos logit contributions
 conclusions+0.011
alks+0.010
oles+0.010
 perspective+0.010
ocol+0.010
Neg logit contributions
my-0.005
 tidy-0.005
 next-0.005
 over-0.005
 packed-0.004
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
 conclusions+0.082
alks+0.080
oles+0.074
ocol+0.074
 colony+0.074
Neg logit contributions
my-0.040
 next-0.039
 tidy-0.034
 clean-0.034
 pack-0.032
Token that
Token ID326
Feature activation+1.62e-02

Pos logit contributions
 conclusions+0.057
alks+0.057
oles+0.054
 perspective+0.053
ocol+0.053
Neg logit contributions
my-0.029
 next-0.027
 tidy-0.026
 clean-0.025
 over-0.024
Token's
Token ID338
Feature activation-1.35e-02

Pos logit contributions
 conclusions+0.040
alks+0.039
ocol+0.038
oles+0.038
 Majesty+0.037
Neg logit contributions
my-0.016
 next-0.014
 tidy-0.013
 over-0.013
 clean-0.012
Token ball
Token ID2613
Feature activation+2.47e-03

Pos logit contributions
 conclusions+0.020
ocol+0.020
oles+0.019
alks+0.019
 Majesty+0.019
Neg logit contributions
 next-0.014
my-0.014
 tidy-0.013
 clean-0.013
 over-0.012
Token and
Token ID290
Feature activation+1.66e-02

Pos logit contributions
 conclusions+0.006
alks+0.006
oles+0.006
ocol+0.006
 Majesty+0.006
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 packed-0.002
 clean-0.002
Token it
Token ID340
Feature activation+3.41e-03

Pos logit contributions
 conclusions+0.041
alks+0.040
ocol+0.040
oles+0.039
 Majesty+0.038
Neg logit contributions
my-0.017
 next-0.015
 over-0.015
 tidy-0.014
 clean-0.014
Token flew
Token ID13112
Feature activation-5.46e-03

Pos logit contributions
 conclusions+0.009
alks+0.009
oles+0.008
ocol+0.008
 Majesty+0.008
Neg logit contributions
my-0.003
 next-0.002
 tidy-0.002
 over-0.002
 packed-0.002
Token through
Token ID832
Feature activation+3.00e-02

Pos logit contributions
my+0.005
 next+0.005
 over+0.004
 tidy+0.004
 clean+0.004
Neg logit contributions
 conclusions-0.014
alks-0.014
oles-0.013
ocol-0.013
 Majesty-0.013
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
oles+0.100
 Majesty+0.097
ocol+0.096
ighters+0.094
alks+0.094
Neg logit contributions
 next-0.012
 with-0.009
 a-0.008
 over-0.008
my-0.007
Token air
Token ID1633
Feature activation+1.24e-02

Pos logit contributions
 conclusions+0.028
oles+0.027
ocol+0.027
alks+0.027
 Majesty+0.026
Neg logit contributions
my-0.020
 next-0.019
 tidy-0.019
 clean-0.018
 over-0.018
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
 conclusions+0.032
ocol+0.031
alks+0.030
 Majesty+0.030
oles+0.029
Neg logit contributions
 next-0.015
 over-0.014
 with-0.013
my-0.013
 made-0.012
Token He
Token ID679
Feature activation+1.06e-02

Pos logit contributions
 conclusions+0.056
ocol+0.054
 predators+0.052
oles+0.052
 perspective+0.052
Neg logit contributions
my-0.019
 over-0.017
 next-0.016
 clean-0.014
 tidy-0.014
Token was
Token ID373
Feature activation+1.31e-02

Pos logit contributions
 conclusions+0.030
ocol+0.029
alks+0.029
 Majesty+0.028
oles+0.028
Neg logit contributions
my-0.008
 next-0.007
 over-0.007
 made-0.007
 tidy-0.006
Token so
Token ID523
Feature activation+1.76e-04

Pos logit contributions
alks+0.027
 conclusions+0.027
 Majesty+0.026
oles+0.026
joy+0.025
Neg logit contributions
my-0.016
 next-0.015
 tidy-0.014
 packed-0.013
pack-0.013
Token okay
Token ID8788
Feature activation+2.39e-03

Pos logit contributions
my+0.012
 next+0.012
 packed+0.011
 tidy+0.011
pack+0.011
Neg logit contributions
 conclusions-0.021
alks-0.020
 Majesty-0.020
ocol-0.020
 predators-0.020
Token,
Token ID11
Feature activation+8.82e-03

Pos logit contributions
 conclusions+0.005
alks+0.005
ocol+0.005
oles+0.005
 Majesty+0.005
Neg logit contributions
my-0.003
 next-0.003
 tidy-0.003
 over-0.003
 clean-0.003
Token Tom
Token ID4186
Feature activation-2.98e-03

Pos logit contributions
 conclusions+0.019
alks+0.019
oles+0.018
ocol+0.018
 perspective+0.018
Neg logit contributions
my-0.011
 next-0.011
 tidy-0.010
 over-0.009
 clean-0.009
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
my+0.004
 next+0.004
 tidy+0.004
 over+0.004
 clean+0.004
Neg logit contributions
 conclusions-0.006
alks-0.006
ocol-0.006
oles-0.006
 fingertips-0.006
Token We
Token ID775
Feature activation+1.84e-02

Pos logit contributions
 conclusions+0.029
alks+0.028
oles+0.027
ocol+0.026
 perspective+0.026
Neg logit contributions
my-0.017
 next-0.015
 tidy-0.015
 over-0.014
 clean-0.014
Token are
Token ID389
Feature activation+6.82e-03

Pos logit contributions
 conclusions+0.043
alks+0.043
ocol+0.041
oles+0.041
 Majesty+0.040
Neg logit contributions
my-0.022
 next-0.019
 tidy-0.018
 over-0.018
 clean-0.017
Token glad
Token ID9675
Feature activation-4.54e-03

Pos logit contributions
 conclusions+0.015
alks+0.014
oles+0.014
 perspective+0.013
ocol+0.013
Neg logit contributions
my-0.009
 next-0.009
 tidy-0.009
 clean-0.008
 over-0.008
Token you
Token ID345
Feature activation+1.88e-02

Pos logit contributions
 next+0.004
my+0.004
 over+0.003
 clean+0.003
 tidy+0.003
Neg logit contributions
 conclusions-0.012
alks-0.012
ocol-0.012
oles-0.011
 perspective-0.011
Token are
Token ID389
Feature activation-7.62e-04

Pos logit contributions
 conclusions+0.041
alks+0.040
ocol+0.038
oles+0.038
 Majesty+0.037
Neg logit contributions
my-0.024
 next-0.022
 tidy-0.021
 over-0.021
 clean-0.021
Token safe
Token ID3338
Feature activation-1.72e-02

Pos logit contributions
my+0.001
 next+0.001
 tidy+0.001
 packed+0.001
 over+0.001
Neg logit contributions
 conclusions-0.002
alks-0.002
 Majesty-0.002
oles-0.002
ocol-0.002
Token.
Token ID13
Feature activation-6.38e-03

Pos logit contributions
 next+0.018
my+0.017
 over+0.016
 tidy+0.015
 clean+0.014
Neg logit contributions
 conclusions-0.043
alks-0.043
ocol-0.042
oles-0.040
 fingertips-0.040
Token big
Token ID1263
Feature activation-2.68e-02

Pos logit contributions
 conclusions+0.000
alks+0.000
ocol+0.000
oles+0.000
 Majesty+0.000
Neg logit contributions
my-0.000
 next-0.000
 tidy-0.000
 clean-0.000
 pack-0.000
Token bear
Token ID6842
Feature activation+1.07e-02

Pos logit contributions
my+0.039
 next+0.037
 tidy+0.035
 clean+0.034
 over+0.034
Neg logit contributions
 conclusions-0.053
alks-0.052
oles-0.049
ocol-0.049
 Majesty-0.048
Token who
Token ID508
Feature activation+3.39e-03

Pos logit contributions
 conclusions+0.025
ocol+0.024
alks+0.024
 Majesty+0.024
 fingertips+0.023
Neg logit contributions
my-0.014
 next-0.013
 with-0.012
 over-0.012
 clean-0.012
Token wanted
Token ID2227
Feature activation+2.15e-03

Pos logit contributions
 conclusions+0.007
alks+0.007
oles+0.006
 Majesty+0.006
 colony+0.006
Neg logit contributions
my-0.004
 next-0.004
 over-0.004
 tidy-0.004
 clean-0.004
Token to
Token ID284
Feature activation-2.50e-02

Pos logit contributions
 conclusions+0.005
alks+0.005
oles+0.005
ocol+0.005
 Majesty+0.005
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 over-0.002
 clean-0.002
Token take
Token ID1011
Feature activation-3.32e-03

Pos logit contributions
my+0.025
 next+0.022
 over+0.020
 tidy+0.019
 packed+0.019
Neg logit contributions
 conclusions-0.062
alks-0.059
oles-0.057
 Majesty-0.057
ocol-0.056
Token a
Token ID257
Feature activation+5.66e-03

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 over+0.002
 clean+0.002
Neg logit contributions
 conclusions-0.009
alks-0.009
oles-0.009
ocol-0.009
 Majesty-0.009
Token bath
Token ID7837
Feature activation-2.38e-02

Pos logit contributions
 conclusions+0.013
alks+0.012
oles+0.012
ocol+0.012
 Majesty+0.012
Neg logit contributions
my-0.007
 next-0.006
 tidy-0.006
 over-0.006
 packed-0.006
Token.
Token ID13
Feature activation-8.95e-04

Pos logit contributions
my+0.036
 next+0.034
 tidy+0.032
 over+0.031
 clean+0.031
Neg logit contributions
 conclusions-0.046
alks-0.044
oles-0.042
ocol-0.042
 Majesty-0.042
Token He
Token ID679
Feature activation-1.43e-02

Pos logit contributions
my+0.001
 next+0.001
 tidy+0.001
 over+0.001
 clean+0.001
Neg logit contributions
 conclusions-0.002
alks-0.002
ocol-0.002
oles-0.002
 Majesty-0.002
Token w
Token ID266
Feature activation+3.31e-02

Pos logit contributions
my+0.015
 next+0.013
 over+0.013
 tidy+0.012
pack+0.012
Neg logit contributions
 conclusions-0.030
alks-0.030
oles-0.030
 predators-0.030
 colony-0.029
Token you
Token ID345
Feature activation-6.51e-03

Pos logit contributions
my+0.001
 tidy+0.001
 next+0.001
 over+0.001
 packed+0.001
Neg logit contributions
alks-0.001
 conclusions-0.001
oles-0.001
ocol-0.001
 fingertips-0.001
Token create
Token ID2251
Feature activation-1.42e-02

Pos logit contributions
my+0.007
 next+0.006
 tidy+0.006
 over+0.006
 packed+0.006
Neg logit contributions
 conclusions-0.015
alks-0.015
oles-0.014
ocol-0.014
 Majesty-0.014
Token?"
Token ID1701
Feature activation-4.88e-03

Pos logit contributions
my+0.016
 packed+0.014
Close+0.014
 next+0.013
 tidy+0.013
Neg logit contributions
 conclusions-0.031
alks-0.030
ocol-0.029
oles-0.028
 Majesty-0.028
Token Lily
Token ID20037
Feature activation-9.93e-03

Pos logit contributions
my+0.005
 next+0.005
 tidy+0.005
 packed+0.005
 over+0.005
Neg logit contributions
 conclusions-0.011
alks-0.011
oles-0.010
 Majesty-0.010
ocol-0.010
Token smiled
Token ID13541
Feature activation-3.09e-02

Pos logit contributions
my+0.010
 next+0.009
 tidy+0.009
 over+0.008
 clean+0.008
Neg logit contributions
 conclusions-0.024
alks-0.023
oles-0.022
 Majesty-0.022
ocol-0.022
Token and
Token ID290
Feature activation-3.09e-03

Pos logit contributions
my+0.034
 packed+0.032
 tidy+0.031
pack+0.031
 next+0.030
Neg logit contributions
joy-0.065
alks-0.064
 conclusions-0.064
 Majesty-0.061
oles-0.061
Token said
Token ID531
Feature activation+3.43e-02

Pos logit contributions
my+0.003
 next+0.003
 tidy+0.003
 over+0.002
 clean+0.002
Neg logit contributions
alks-0.007
 conclusions-0.007
oles-0.007
 Majesty-0.007
ocol-0.007
Token,
Token ID11
Feature activation+7.02e-04

Pos logit contributions
 conclusions+0.088
alks+0.081
 predators+0.079
ocol+0.078
 perspective+0.078
Neg logit contributions
 next-0.037
my-0.037
 over-0.034
 with-0.032
 tidy-0.032
Token "
Token ID366
Feature activation-7.32e-03

Pos logit contributions
 conclusions+0.002
ocol+0.001
alks+0.001
 perspective+0.001
 predators+0.001
Neg logit contributions
 next-0.001
my-0.001
 over-0.001
 tidy-0.001
 clean-0.001
TokenI
Token ID40
Feature activation-1.95e-03

Pos logit contributions
my+0.007
 tidy+0.006
 next+0.006
 clean+0.006
 packed+0.006
Neg logit contributions
 conclusions-0.018
alks-0.017
oles-0.016
 perspective-0.016
 colony-0.016
Token can
Token ID460
Feature activation-8.16e-03

Pos logit contributions
my+0.003
 next+0.002
 tidy+0.002
 over+0.002
 clean+0.002
Neg logit contributions
 conclusions-0.004
alks-0.004
oles-0.004
ocol-0.004
 Majesty-0.004
Token the
Token ID262
Feature activation+9.81e-03

Pos logit contributions
my+0.034
pack+0.034
 next+0.034
 packed+0.034
 tidy+0.033
Neg logit contributions
 conclusions-0.059
alks-0.058
 perspective-0.057
 predators-0.055
 colony-0.054
Token right
Token ID826
Feature activation-6.34e-03

Pos logit contributions
 conclusions+0.018
alks+0.018
oles+0.017
ocol+0.017
 Majesty+0.016
Neg logit contributions
my-0.016
 next-0.015
 tidy-0.014
 over-0.014
 clean-0.014
Token places
Token ID4113
Feature activation+3.68e-03

Pos logit contributions
my+0.010
 next+0.009
 tidy+0.009
 clean+0.009
 packed+0.009
Neg logit contributions
 conclusions-0.012
alks-0.011
oles-0.010
 perspective-0.010
ocol-0.010
Token and
Token ID290
Feature activation+1.18e-02

Pos logit contributions
 conclusions+0.009
alks+0.009
 Majesty+0.008
ocol+0.008
oles+0.008
Neg logit contributions
my-0.004
 next-0.004
 over-0.004
 tidy-0.004
 clean-0.003
Token he
Token ID339
Feature activation+1.54e-02

Pos logit contributions
 conclusions+0.029
alks+0.029
ocol+0.028
oles+0.028
 Majesty+0.027
Neg logit contributions
my-0.012
 next-0.012
 over-0.011
 tidy-0.011
 packed-0.010
Token found
Token ID1043
Feature activation+1.08e-02

Pos logit contributions
 conclusions+0.039
alks+0.037
ocol+0.037
oles+0.034
 perspective+0.034
Neg logit contributions
my-0.018
 next-0.017
 packed-0.016
 tidy-0.015
 over-0.015
Token the
Token ID262
Feature activation+1.50e-03

Pos logit contributions
 conclusions+0.030
alks+0.029
oles+0.028
ocol+0.028
 Majesty+0.028
Neg logit contributions
 next-0.008
my-0.008
 over-0.008
 tidy-0.007
 clean-0.006
Token ancient
Token ID6156
Feature activation+2.08e-02

Pos logit contributions
 conclusions+0.003
alks+0.003
oles+0.003
ocol+0.003
 Majesty+0.003
Neg logit contributions
 next-0.002
my-0.002
 tidy-0.002
 over-0.001
 clean-0.001
Token victory
Token ID5373
Feature activation+1.57e-02

Pos logit contributions
 conclusions+0.040
alks+0.040
oles+0.039
ocol+0.039
ighters+0.037
Neg logit contributions
 next-0.032
my-0.029
 tidy-0.028
 over-0.027
 clean-0.027
Token he
Token ID339
Feature activation+5.91e-03

Pos logit contributions
 conclusions+0.036
alks+0.034
ocol+0.034
 Majesty+0.034
oles+0.032
Neg logit contributions
 next-0.019
my-0.019
 over-0.017
 tidy-0.017
 packed-0.016
Token was
Token ID373
Feature activation+1.44e-02

Pos logit contributions
 conclusions+0.016
alks+0.015
oles+0.015
ocol+0.015
 Majesty+0.014
Neg logit contributions
my-0.005
 next-0.005
 tidy-0.004
 packed-0.004
 over-0.004
Token As
Token ID1081
Feature activation-2.43e-02

Pos logit contributions
 conclusions+0.027
ocol+0.025
 Majesty+0.025
alks+0.025
oles+0.024
Neg logit contributions
my-0.013
 next-0.012
 tidy-0.012
 over-0.012
 clean-0.011
Token night
Token ID1755
Feature activation+6.64e-03

Pos logit contributions
 next+0.022
my+0.021
 tidy+0.019
 clean+0.018
 over+0.017
Neg logit contributions
 conclusions-0.065
alks-0.064
ocol-0.063
 Majesty-0.061
oles-0.061
Token started
Token ID2067
Feature activation-6.29e-03

Pos logit contributions
 conclusions+0.013
alks+0.013
oles+0.013
ocol+0.012
 perspective+0.012
Neg logit contributions
my-0.009
 next-0.008
 tidy-0.008
 clean-0.007
 over-0.007
Token to
Token ID284
Feature activation-2.35e-02

Pos logit contributions
 next+0.005
my+0.005
 over+0.005
 clean+0.004
 tidy+0.004
Neg logit contributions
 conclusions-0.017
ocol-0.016
 Majesty-0.016
alks-0.016
 perspective-0.016
Token fall
Token ID2121
Feature activation-1.37e-02

Pos logit contributions
my+0.026
 next+0.024
 tidy+0.023
 over+0.022
 packed+0.021
Neg logit contributions
 conclusions-0.055
alks-0.053
oles-0.051
ocol-0.051
 Majesty-0.050
Token,
Token ID11
Feature activation-2.37e-03

Pos logit contributions
my+0.017
 next+0.016
 over+0.016
 tidy+0.015
 clean+0.015
Neg logit contributions
 conclusions-0.030
alks-0.029
ocol-0.028
oles-0.028
 Majesty-0.028
Token the
Token ID262
Feature activation-2.31e-03

Pos logit contributions
 next+0.001
 a+0.001
 with+0.001
 clean+0.001
 made+0.001
Neg logit contributions
ocol-0.008
 conclusions-0.008
alks-0.007
 Majesty-0.007
 perspective-0.007
Token animals
Token ID4695
Feature activation+2.63e-02

Pos logit contributions
 next+0.003
my+0.002
 tidy+0.002
 clean+0.002
 over+0.002
Neg logit contributions
 conclusions-0.006
alks-0.006
ocol-0.006
oles-0.005
 Majesty-0.005
Token said
Token ID531
Feature activation+3.55e-02

Pos logit contributions
alks+0.071
ocol+0.070
 Majesty+0.068
 conclusions+0.068
joy+0.063
Neg logit contributions
my-0.027
 over-0.025
 took-0.025
 next-0.025
 made-0.024
Token goodbye
Token ID24829
Feature activation-9.32e-03

Pos logit contributions
 conclusions+0.093
alks+0.092
ocol+0.089
 perspective+0.086
 anth+0.085
Neg logit contributions
 next-0.035
 over-0.033
 with-0.030
 clean-0.030
 tidy-0.030
Token and
Token ID290
Feature activation+2.49e-03

Pos logit contributions
my+0.008
 next+0.008
 over+0.007
 tidy+0.006
 clean+0.006
Neg logit contributions
 conclusions-0.025
ocol-0.024
alks-0.024
oles-0.023
 Majesty-0.023
Token hand
Token ID1021
Feature activation-7.50e-03

Pos logit contributions
 conclusions+0.001
alks+0.001
ocol+0.001
 Majesty+0.001
oles+0.001
Neg logit contributions
my-0.001
 next-0.001
 over-0.001
 tidy-0.001
 packed-0.001
Token.
Token ID13
Feature activation+5.57e-03

Pos logit contributions
my+0.010
 next+0.010
 over+0.009
 tidy+0.009
 clean+0.009
Neg logit contributions
 conclusions-0.017
alks-0.016
ocol-0.016
 Majesty-0.016
oles-0.015
Token She
Token ID1375
Feature activation-6.09e-03

Pos logit contributions
 conclusions+0.014
ocol+0.013
oles+0.013
 Majesty+0.013
alks+0.013
Neg logit contributions
my-0.006
 over-0.006
 next-0.005
 tidy-0.005
 clean-0.005
Token felt
Token ID2936
Feature activation+3.45e-03

Pos logit contributions
my+0.006
 next+0.006
 tidy+0.005
 over+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.013
alks-0.013
oles-0.013
 Majesty-0.013
 colony-0.013
Token so
Token ID523
Feature activation-7.14e-04

Pos logit contributions
 conclusions+0.007
alks+0.007
ocol+0.007
oles+0.007
 Majesty+0.007
Neg logit contributions
my-0.004
 next-0.004
 tidy-0.003
 packed-0.003
pack-0.003
Token safe
Token ID3338
Feature activation-3.29e-02

Pos logit contributions
my+0.001
 over+0.001
 next+0.001
man+0.001
Close+0.001
Neg logit contributions
alks-0.001
 conclusions-0.001
oles-0.001
 Majesty-0.001
ocol-0.001
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
my+0.036
 tidy+0.031
 next+0.030
 packed+0.029
 clean+0.028
Neg logit contributions
 conclusions-0.074
alks-0.073
oles-0.071
 Majesty-0.069
 perspective-0.069
Token happy
Token ID3772
Feature activation-1.02e-02

Pos logit contributions
my+0.016
pack+0.015
 over+0.014
Close+0.014
 next+0.014
Neg logit contributions
 conclusions-0.021
 fingertips-0.021
 Majesty-0.020
joy-0.020
 predators-0.020
Token that
Token ID326
Feature activation-7.58e-03

Pos logit contributions
 next+0.014
my+0.014
 over+0.013
 tidy+0.013
 clean+0.012
Neg logit contributions
 conclusions-0.022
alks-0.021
ocol-0.020
oles-0.020
 Majesty-0.020
Token they
Token ID484
Feature activation-5.17e-03

Pos logit contributions
my+0.008
 next+0.008
 tidy+0.007
 over+0.007
 clean+0.007
Neg logit contributions
 conclusions-0.018
alks-0.017
oles-0.017
ocol-0.017
 Majesty-0.016
Token were
Token ID547
Feature activation+2.92e-03

Pos logit contributions
my+0.007
 next+0.007
Close+0.006
 tidy+0.006
pack+0.006
Neg logit contributions
 conclusions-0.009
alks-0.009
 predators-0.009
oles-0.008
ighters-0.008
Token too
Token ID1165
Feature activation+2.67e-02

Pos logit contributions
my+0.009
 next+0.008
 tidy+0.008
 over+0.007
 clean+0.007
Neg logit contributions
 conclusions-0.016
alks-0.015
oles-0.015
ocol-0.014
 Majesty-0.014
Token.
Token ID13
Feature activation+1.08e-03

Pos logit contributions
 conclusions+0.075
growth+0.073
ocol+0.073
alks+0.072
 fingertips+0.069
Neg logit contributions
 next-0.024
 with-0.021
 clean-0.021
 tidy-0.021
my-0.019
Token Let
Token ID3914
Feature activation+6.79e-03

Pos logit contributions
 conclusions+0.002
alks+0.002
 Majesty+0.002
ocol+0.002
oles+0.002
Neg logit contributions
my-0.001
 next-0.001
 tidy-0.001
 packed-0.001
 over-0.001
Token's
Token ID338
Feature activation-3.94e-02

Pos logit contributions
 conclusions+0.017
alks+0.016
ocol+0.016
oles+0.016
 Majesty+0.016
Neg logit contributions
my-0.007
 next-0.006
 tidy-0.006
 over-0.006
 clean-0.005
Token put
Token ID1234
Feature activation-4.56e-02

Pos logit contributions
my+0.028
 next+0.027
 packed+0.024
Close+0.023
 over+0.022
Neg logit contributions
 conclusions-0.100
 Majesty-0.095
alks-0.093
 perspective-0.091
ocol-0.090
Token the
Token ID262
Feature activation-3.19e-02

Pos logit contributions
my+0.056
 packed+0.052
 tidy+0.051
 next+0.048
Close+0.047
Neg logit contributions
 conclusions-0.093
alks-0.090
ocol-0.090
 Majesty-0.085
joy-0.083
Token top
Token ID1353
Feature activation-2.97e-02

Pos logit contributions
my+0.048
 over+0.042
 tidy+0.042
 packed+0.042
 next+0.041
Neg logit contributions
 conclusions-0.059
alks-0.055
 Majesty-0.054
ocol-0.054
joy-0.053
Token back
Token ID736
Feature activation-2.95e-02

Pos logit contributions
my+0.027
Close+0.024
 tidy+0.024
 packed+0.023
 pack+0.023
Neg logit contributions
alks-0.066
 conclusions-0.065
oles-0.062
ocol-0.061
 fingertips-0.060
Token on
Token ID319
Feature activation+8.43e-04

Pos logit contributions
my+0.025
 tidy+0.021
 next+0.021
 packed+0.020
 clean+0.019
Neg logit contributions
 conclusions-0.073
alks-0.072
oles-0.069
ocol-0.069
 Majesty-0.068
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
oles+0.003
 conclusions+0.003
alks+0.002
 Majesty+0.002
 colony+0.002
Neg logit contributions
 next-0.001
 my-0.000
my-0.000
 over-0.000
 clean-0.000
Token box
Token ID3091
Feature activation-6.03e-03

Pos logit contributions
my+0.021
 next+0.019
 tidy+0.018
 over+0.018
 packed+0.018
Neg logit contributions
 conclusions-0.020
alks-0.019
oles-0.018
 Majesty-0.018
ocol-0.018
Token of
Token ID286
Feature activation-1.54e-02

Pos logit contributions
my+0.006
 next+0.005
 tidy+0.004
 over+0.004
 packed+0.004
Neg logit contributions
 conclusions-0.019
alks-0.018
oles-0.018
ocol-0.018
 perspective-0.017
Token fun
Token ID1257
Feature activation+6.04e-03

Pos logit contributions
my+0.011
 next+0.011
 tidy+0.010
 clean+0.010
 over+0.008
Neg logit contributions
alks-0.042
 conclusions-0.042
oles-0.041
 Majesty-0.040
ocol-0.040
Token.
Token ID13
Feature activation-6.28e-03

Pos logit contributions
alks+0.015
 Majesty+0.015
 conclusions+0.015
ocol+0.014
 fingertips+0.014
Neg logit contributions
 next-0.006
 with-0.006
my-0.006
 clean-0.005
 tidy-0.005
Token<|endoftext|>
Token ID50256
Feature activation-1.23e-02

Pos logit contributions
my+0.008
 next+0.008
 tidy+0.007
 over+0.007
 clean+0.007
Neg logit contributions
 conclusions-0.013
alks-0.013
oles-0.012
ocol-0.012
 Majesty-0.012
TokenOnce
Token ID7454
Feature activation-1.69e-02

Pos logit contributions
my+0.014
 next+0.013
 tidy+0.012
 over+0.012
 clean+0.011
Neg logit contributions
 conclusions-0.028
alks-0.027
oles-0.026
ocol-0.026
 Majesty-0.026
Token upon
Token ID2402
Feature activation-3.62e-02

Pos logit contributions
my+0.022
 next+0.020
 tidy+0.019
 over+0.019
 clean+0.018
Neg logit contributions
 conclusions-0.036
alks-0.035
oles-0.034
ocol-0.033
 Majesty-0.033
Token a
Token ID257
Feature activation+3.38e-03

Pos logit contributions
my+0.030
 next+0.026
 tidy+0.025
 packed+0.024
 over+0.024
Neg logit contributions
 conclusions-0.091
alks-0.083
oles-0.083
joy-0.082
 perspective-0.081
Token time
Token ID640
Feature activation+5.51e-03

Pos logit contributions
 conclusions+0.007
alks+0.007
oles+0.006
ocol+0.006
 Majesty+0.006
Neg logit contributions
my-0.005
 next-0.004
 tidy-0.004
 over-0.004
 packed-0.004
Token,
Token ID11
Feature activation-2.05e-02

Pos logit contributions
 conclusions+0.011
alks+0.011
ocol+0.010
oles+0.010
 Majesty+0.010
Neg logit contributions
my-0.008
 next-0.008
 tidy-0.007
 over-0.007
 clean-0.007
Token there
Token ID612
Feature activation+3.08e-03

Pos logit contributions
my+0.022
 packed+0.021
 tidy+0.021
Close+0.021
pack+0.021
Neg logit contributions
oles-0.040
 conclusions-0.039
joy-0.037
 perspective-0.036
alks-0.036
Token was
Token ID373
Feature activation-3.12e-02

Pos logit contributions
 conclusions+0.008
alks+0.008
oles+0.008
ocol+0.008
 perspective+0.008
Neg logit contributions
my-0.002
 next-0.002
 tidy-0.002
 packed-0.002
 over-0.002
TokenMom
Token ID29252
Feature activation-2.47e-02

Pos logit contributions
my+0.006
 next+0.006
 tidy+0.005
 over+0.005
 packed+0.005
Neg logit contributions
 conclusions-0.013
alks-0.012
oles-0.012
ocol-0.012
 perspective-0.012
Tokenmy
Token ID1820
Feature activation-6.11e-03

Pos logit contributions
 next+0.005
 tidy+0.005
 pack+0.004
Close+0.003
pack+0.002
Neg logit contributions
alks-0.070
oles-0.069
 Majesty-0.068
 conclusions-0.067
 colony-0.067
Token was
Token ID373
Feature activation+1.67e-03

Pos logit contributions
my+0.006
 next+0.006
 tidy+0.005
 pack+0.005
 clean+0.005
Neg logit contributions
alks-0.014
oles-0.013
 conclusions-0.013
 Majesty-0.013
 colony-0.013
Token very
Token ID845
Feature activation-5.33e-03

Pos logit contributions
 conclusions+0.003
alks+0.003
 Majesty+0.003
joy+0.003
ocol+0.003
Neg logit contributions
my-0.002
 next-0.002
Close-0.002
 tidy-0.002
 pack-0.002
Token pleased
Token ID10607
Feature activation-2.96e-02

Pos logit contributions
my+0.008
 next+0.007
 over+0.007
 packed+0.007
Close+0.006
Neg logit contributions
alks-0.010
 conclusions-0.010
oles-0.010
 Majesty-0.009
ocol-0.009
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
my+0.025
 tidy+0.021
 packed+0.021
 pack+0.020
 next+0.019
Neg logit contributions
alks-0.073
 conclusions-0.073
oles-0.072
joy-0.070
 perspective-0.069
Token and
Token ID290
Feature activation-2.39e-02

Pos logit contributions
my+0.038
 next+0.034
 tidy+0.033
pack+0.033
 packed+0.031
Neg logit contributions
 conclusions-0.076
alks-0.076
joy-0.073
oles-0.073
ocol-0.071
Token said
Token ID531
Feature activation+2.00e-02

Pos logit contributions
my+0.029
 next+0.026
pack+0.024
 tidy+0.024
 pack+0.024
Neg logit contributions
alks-0.049
oles-0.048
 conclusions-0.048
 Majesty-0.047
 colony-0.046
Token that
Token ID326
Feature activation-5.70e-03

Pos logit contributions
 conclusions+0.054
alks+0.050
 perspective+0.050
 predators+0.049
ocol+0.048
Neg logit contributions
 next-0.018
 clean-0.016
 tidy-0.016
 over-0.016
my-0.016
Token from
Token ID422
Feature activation-1.65e-02

Pos logit contributions
my+0.006
 next+0.006
 tidy+0.006
 over+0.005
 clean+0.005
Neg logit contributions
 conclusions-0.013
alks-0.013
ocol-0.013
 Majesty-0.013
oles-0.013
Token now
Token ID783
Feature activation-1.30e-03

Pos logit contributions
my+0.017
 next+0.015
 tidy+0.014
 over+0.014
 packed+0.014
Neg logit contributions
 conclusions-0.040
alks-0.039
ocol-0.037
oles-0.037
 Majesty-0.037
Token her
Token ID607
Feature activation-1.57e-02

Pos logit contributions
alks+0.081
oles+0.077
 conclusions+0.076
 Majesty+0.075
 fingertips+0.074
Neg logit contributions
my-0.013
 next-0.011
 over-0.010
 clean-0.007
 with-0.007
Token to
Token ID284
Feature activation-1.01e-02

Pos logit contributions
my+0.018
 next+0.016
 tidy+0.015
 over+0.015
 packed+0.015
Neg logit contributions
 conclusions-0.036
alks-0.035
oles-0.034
ocol-0.034
 Majesty-0.034
Token help
Token ID1037
Feature activation-1.97e-03

Pos logit contributions
my+0.012
 next+0.011
 packed+0.010
 over+0.010
 tidy+0.010
Neg logit contributions
 conclusions-0.021
alks-0.020
ocol-0.020
 fingertips-0.020
 Majesty-0.019
Token with
Token ID351
Feature activation-5.31e-03

Pos logit contributions
my+0.002
 next+0.002
 tidy+0.002
 clean+0.002
 over+0.002
Neg logit contributions
 conclusions-0.005
 Majesty-0.005
alks-0.005
joy-0.005
oles-0.005
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
my+0.004
 next+0.004
 tidy+0.003
 clean+0.003
 over+0.003
Neg logit contributions
 conclusions-0.015
oles-0.015
alks-0.015
 Majesty-0.014
 perspective-0.014
Token laundry
Token ID25724
Feature activation-3.92e-02

Pos logit contributions
my+0.022
 next+0.021
 tidy+0.020
 clean+0.019
 over+0.019
Neg logit contributions
 conclusions-0.030
alks-0.029
oles-0.028
 Majesty-0.028
ocol-0.027
Token.
Token ID13
Feature activation-2.02e-03

Pos logit contributions
my+0.058
 next+0.055
 tidy+0.053
 over+0.052
 clean+0.051
Neg logit contributions
 conclusions-0.078
alks-0.074
ocol-0.071
oles-0.071
 Majesty-0.071
Token She
Token ID1375
Feature activation-2.58e-02

Pos logit contributions
my+0.002
 tidy+0.002
 next+0.002
 clean+0.001
 over+0.001
Neg logit contributions
 conclusions-0.006
oles-0.005
alks-0.005
ocol-0.005
 predators-0.005
Token showed
Token ID3751
Feature activation-1.71e-03

Pos logit contributions
my+0.027
 next+0.025
 tidy+0.025
 pack+0.025
 over+0.024
Neg logit contributions
 colony-0.049
oles-0.049
alks-0.048
 anth-0.047
 Majesty-0.046
Token Lily
Token ID20037
Feature activation-3.43e-02

Pos logit contributions
my+0.002
 next+0.001
 tidy+0.001
 over+0.001
 clean+0.001
Neg logit contributions
 conclusions-0.004
alks-0.004
oles-0.004
ocol-0.004
 Majesty-0.004
Token how
Token ID703
Feature activation+1.02e-02

Pos logit contributions
my+0.043
 next+0.040
 packed+0.039
 tidy+0.038
Close+0.037
Neg logit contributions
 conclusions-0.066
alks-0.065
oles-0.063
 colony-0.062
 Majesty-0.061