TOP ACTIVATIONS
MAX = 6.867

on , whenever they wanted av oc ados , they would
something new ! She thought av oc ados were the most
something new ! She thought av oc ados were the most
girl who didn 't like av oc ados . She always
the tree if they were av oc ados , and they
away . She learned that av oc ados are good for
said , " I love av oc ados ! They are
tree could see lots of av oc ados that the supplier
B ongo loved to eat av oc ados . One day
ongo didn 't like wet av oc ados , so he
. She loved to eat av oc ados because they were
. Did you know that av oc ados are very healthy
says , " They are av oc ados . My mom
the tree to its once former beauty . He was happy
happy that it produced more av oc ados than ever before
, gir aff es and ze br as . They took
Tim my loved to eat av oc ados , they were
like monkeys and elephants and gir aff es . They had
day on , Lily loved av oc ados and would ask
continued to enjoy eating her av oc ados with a smile

MIN ACTIVATIONS
MIN = -5.232

. He put the soft end into the lock and pushed
the plank and put one end on the barrel . He
him the ice and the ice cream . She put the
. But I dropped the dirty dish . I 'm sorry
lady gave her a delicious ice cream . Sandy was so
hit him with the wet end . Sam was startled and
Mia a slice and a glass of milk . They sit
let 's clean up the ice and make a new cake
cream . She put the ice on his toe .
stick and put the other end near the food . When
He put the point y end in his mouth and l
and Tom . He shares his other snacks with them .
they could share the y ummy pasta together ! <|endoftext|> Once
. They sit on each end . They push with their
happy . She wrapped the rest of the mel on in
Tom says . He fills the pan with water and puts
naughty . You wasted the expensive cream ." Anna
. She picks up a clean dish and puts it in
a man with a big ice cream truck . Lily 's
Then she put a special cleaner on it and scrub bed

INTERVAL 3.842 - 6.867
CONTAINS 0.003%

. They make rings and brace lets and neck l aces
were always together hopping and ch ir ping .
like monkeys and elephants and gir aff es . They had
imes . He thought l imes were the perfect fruit .
continue exploring . He saw ze br as and gir aff

INTERVAL 0.818 - 3.842
CONTAINS 19.038%

" Tom , are you okay ?" Sue said .
went deeper into the woods , she began to hear some
the east . <|endoftext|> Once upon a time , there was
. Suddenly , Pat sne ez ed really loudly and really
had a lot of fun . Sometimes ,

INTERVAL -2.207 - 0.818
CONTAINS 79.735%

, he found a big tree in the park . He
say they hope her leg will heal soon .
ducks bit her face . It hurt a lot . Sara
waited for it to steam . Then , her mom asked
Tim my was so excited to hear that he would have

INTERVAL -5.232 - -2.207
CONTAINS 1.223%

ily liked to cook . She had a small kitchen with
there was a little boy named Tim my . Tim my
. They agreed to take turns and play nicely .
I can share a way to play with the other
cup of juice . Tom drinks the juice . He says
Token on
Token ID319
Feature activation+2.75e+00

Pos logit contributions
ishly+0.017
 argued+0.014
 course+0.014
 practice+0.013
 advantage+0.013
Neg logit contributions
eddy-0.009
 pans-0.009
ucer-0.008
loo-0.008
izz-0.008
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
ishly+0.075
 argued+0.067
 echoing+0.067
vious+0.064
 advantage+0.064
Neg logit contributions
 pans-0.060
eddy-0.056
oe-0.055
 flour-0.052
kins-0.051
Token whenever
Token ID8797
Feature activation+1.05e+00

Pos logit contributions
eddy+0.005
 pans+0.005
oe+0.004
emo+0.004
 glass+0.004
Neg logit contributions
ishly-0.012
 argued-0.010
 strides-0.010
 echoing-0.010
 advantage-0.010
Token they
Token ID484
Feature activation+1.26e+00

Pos logit contributions
ishly+0.040
 argued+0.031
 echoing+0.031
 advantage+0.030
 course+0.030
Neg logit contributions
 pans-0.016
eddy-0.015
 candle-0.014
 glass-0.013
 salt-0.013
Token wanted
Token ID2227
Feature activation+1.26e+00

Pos logit contributions
ishly+0.044
 echoing+0.035
 advantage+0.033
 antics+0.032
 strides+0.032
Neg logit contributions
 pans-0.022
 lad-0.022
loo-0.020
 glass-0.019
izz-0.019
Token av
Token ID1196
Feature activation+6.87e+00

Pos logit contributions
ishly+0.044
 echoing+0.036
 lear+0.036
 argued+0.034
rified+0.034
Neg logit contributions
 pans-0.022
 flour-0.021
 salt-0.020
 glass-0.019
eddy-0.019
Tokenoc
Token ID420
Feature activation+2.74e+00

Pos logit contributions
ishly+0.160
 lear+0.116
 echoing+0.116
 argued+0.114
 compete+0.107
Neg logit contributions
eddy-0.206
 pans-0.195
ucer-0.175
emo-0.173
 flour-0.168
Tokenados
Token ID22484
Feature activation+4.11e+00

Pos logit contributions
 echoing+0.076
ishly+0.076
 rather+0.071
 practice+0.070
 lear+0.069
Neg logit contributions
eddy-0.056
 pans-0.052
ble-0.045
ucks-0.045
 flour-0.045
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
ishly+0.137
 lear+0.125
rified+0.116
 antics+0.114
 echoing+0.113
Neg logit contributions
oe-0.062
 pans-0.060
tub-0.059
eddy-0.058
kins-0.056
Token they
Token ID484
Feature activation+4.77e-01

Pos logit contributions
ishly+0.005
 advantage+0.004
 argued+0.004
 course+0.004
 strides+0.004
Neg logit contributions
 pans-0.002
eddy-0.001
 flour-0.001
 salt-0.001
 glass-0.001
Token would
Token ID561
Feature activation-1.12e-01

Pos logit contributions
ishly+0.015
 strides+0.013
 advantage+0.013
 antics+0.013
 unnecessary+0.013
Neg logit contributions
 pans-0.008
 lad-0.008
 candle-0.008
emo-0.007
ull-0.007
Token something
Token ID1223
Feature activation+1.18e+00

Pos logit contributions
eddy+0.004
 flour+0.004
 salt+0.004
 pans+0.004
ble+0.004
Neg logit contributions
ishly-0.011
 argued-0.010
 lear-0.009
 antics-0.009
nered-0.009
Token new
Token ID649
Feature activation+2.00e-01

Pos logit contributions
ishly+0.042
 argued+0.038
 lear+0.035
 strides+0.034
 course+0.033
Neg logit contributions
eddy-0.020
 pans-0.020
ble-0.016
 flour-0.016
 candle-0.015
Token!
Token ID0
Feature activation+7.43e-01

Pos logit contributions
ishly+0.007
 argued+0.006
nered+0.006
cient+0.006
 strides+0.006
Neg logit contributions
eddy-0.003
 room-0.003
ble-0.002
loo-0.002
ucer-0.002
Token She
Token ID1375
Feature activation-8.72e-01

Pos logit contributions
ishly+0.021
 course+0.017
 argued+0.017
 practice+0.017
 advantage+0.016
Neg logit contributions
eddy-0.017
 pans-0.016
ucer-0.015
emo-0.015
izz-0.014
Token thought
Token ID1807
Feature activation-3.08e-01

Pos logit contributions
eddy+0.017
 pans+0.016
 lad+0.016
emo+0.015
onna+0.015
Neg logit contributions
ishly-0.026
 advantage-0.023
 strides-0.022
rified-0.022
 unnecessary-0.021
Token av
Token ID1196
Feature activation+6.77e+00

Pos logit contributions
eddy+0.005
 pans+0.005
 flour+0.005
 salt+0.005
onna+0.005
Neg logit contributions
ishly-0.011
 argued-0.009
 course-0.008
 advantage-0.008
nered-0.008
Tokenoc
Token ID420
Feature activation+2.90e+00

Pos logit contributions
ishly+0.150
 argued+0.102
 lear+0.101
 echoing+0.100
 course+0.097
Neg logit contributions
eddy-0.209
 pans-0.202
ucer-0.187
emo-0.181
izz-0.178
Tokenados
Token ID22484
Feature activation+3.87e+00

Pos logit contributions
ishly+0.074
 echoing+0.068
 practice+0.067
 argued+0.065
 rather+0.064
Neg logit contributions
eddy-0.071
 pans-0.062
ble-0.061
emo-0.058
loo-0.056
Token were
Token ID547
Feature activation+5.49e-01

Pos logit contributions
ishly+0.116
 lear+0.107
 advantage+0.102
 strides+0.101
 antics+0.099
Neg logit contributions
 pans-0.067
kins-0.066
ucer-0.065
eddy-0.065
ble-0.064
Token the
Token ID262
Feature activation-1.71e-01

Pos logit contributions
ishly+0.017
 lear+0.014
 argued+0.014
 strides+0.013
rified+0.013
Neg logit contributions
 pans-0.012
 flour-0.011
eddy-0.010
 salt-0.010
ble-0.010
Token most
Token ID749
Feature activation+1.14e+00

Pos logit contributions
 pans+0.003
 flour+0.003
eddy+0.003
 candle+0.003
 salt+0.003
Neg logit contributions
ishly-0.006
 argued-0.005
ted-0.004
 strides-0.004
vered-0.004
Token something
Token ID1223
Feature activation+1.08e+00

Pos logit contributions
eddy+0.008
ble+0.007
 pans+0.007
 flour+0.007
 salt+0.007
Neg logit contributions
ishly-0.020
 argued-0.017
 lear-0.016
 antics-0.016
nered-0.016
Token new
Token ID649
Feature activation-1.65e-01

Pos logit contributions
ishly+0.038
 argued+0.034
 lear+0.031
 strides+0.030
 course+0.030
Neg logit contributions
eddy-0.019
 pans-0.019
ble-0.016
 flour-0.015
 candle-0.015
Token!
Token ID0
Feature activation+5.66e-01

Pos logit contributions
eddy+0.003
 room+0.002
ble+0.002
loo+0.002
ucer+0.002
Neg logit contributions
ishly-0.006
 argued-0.005
nered-0.005
 strides-0.005
cient-0.005
Token She
Token ID1375
Feature activation-1.06e+00

Pos logit contributions
ishly+0.016
 course+0.013
 argued+0.013
 practice+0.012
 advantage+0.012
Neg logit contributions
eddy-0.014
 pans-0.013
ucer-0.012
emo-0.012
izz-0.012
Token thought
Token ID1807
Feature activation-4.51e-01

Pos logit contributions
eddy+0.022
 pans+0.021
 lad+0.020
emo+0.020
onna+0.019
Neg logit contributions
ishly-0.032
 advantage-0.027
 strides-0.026
rified-0.025
 echoing-0.025
Token av
Token ID1196
Feature activation+6.51e+00

Pos logit contributions
eddy+0.008
 pans+0.008
 flour+0.008
ble+0.008
onna+0.007
Neg logit contributions
ishly-0.015
 argued-0.013
 course-0.011
 advantage-0.011
nered-0.011
Tokenoc
Token ID420
Feature activation+2.45e+00

Pos logit contributions
ishly+0.137
 argued+0.092
 course+0.091
 lear+0.091
 echoing+0.089
Neg logit contributions
eddy-0.208
 pans-0.201
ucer-0.186
emo-0.182
izz-0.178
Tokenados
Token ID22484
Feature activation+3.60e+00

Pos logit contributions
ishly+0.060
 practice+0.051
 echoing+0.051
 argued+0.050
 course+0.049
Neg logit contributions
eddy-0.064
 pans-0.059
ble-0.056
emo-0.055
loo-0.053
Token were
Token ID547
Feature activation+3.04e-01

Pos logit contributions
ishly+0.109
 lear+0.100
 advantage+0.097
 strides+0.095
 antics+0.093
Neg logit contributions
 pans-0.061
kins-0.060
eddy-0.059
ucer-0.059
tub-0.058
Token the
Token ID262
Feature activation-3.55e-01

Pos logit contributions
ishly+0.010
 lear+0.008
 argued+0.008
 strides+0.007
rified+0.007
Neg logit contributions
 pans-0.007
 flour-0.006
eddy-0.006
 salt-0.006
ble-0.006
Token most
Token ID749
Feature activation+9.93e-01

Pos logit contributions
 pans+0.007
eddy+0.007
 flour+0.006
 candle+0.006
ble+0.006
Neg logit contributions
ishly-0.012
 argued-0.009
 strides-0.009
ted-0.009
vered-0.009
Token girl
Token ID2576
Feature activation-1.46e+00

Pos logit contributions
ishly+0.004
 course+0.004
 lear+0.003
 practice+0.003
 competition+0.003
Neg logit contributions
 pans-0.003
eddy-0.003
izz-0.003
emo-0.003
ucer-0.003
Token who
Token ID508
Feature activation-8.01e-01

Pos logit contributions
 pans+0.047
eddy+0.046
ucer+0.043
emo+0.043
ble+0.042
Neg logit contributions
ishly-0.028
 course-0.021
 practice-0.020
 argued-0.019
 lear-0.019
Token didn
Token ID1422
Feature activation+2.50e-01

Pos logit contributions
eddy+0.026
 pans+0.026
ucer+0.023
emo+0.023
ble+0.023
Neg logit contributions
ishly-0.016
 course-0.012
 practice-0.011
 argued-0.011
 lear-0.011
Token't
Token ID470
Feature activation-6.46e-01

Pos logit contributions
ishly+0.006
 practice+0.004
 course+0.004
 rather+0.004
 advantage+0.004
Neg logit contributions
 pans-0.008
eddy-0.007
ucer-0.007
emo-0.006
loo-0.006
Token like
Token ID588
Feature activation+6.99e-01

Pos logit contributions
eddy+0.022
 pans+0.021
ucer+0.020
loo+0.019
emo+0.019
Neg logit contributions
ishly-0.011
 practice-0.009
 course-0.009
 argued-0.009
 lear-0.008
Token av
Token ID1196
Feature activation+6.36e+00

Pos logit contributions
ishly+0.022
 course+0.017
 advantage+0.017
 argued+0.017
 lear+0.016
Neg logit contributions
 pans-0.015
eddy-0.015
 flour-0.013
 candle-0.013
 glass-0.013
Tokenoc
Token ID420
Feature activation+2.27e+00

Pos logit contributions
ishly+0.134
 course+0.093
 argued+0.091
 lear+0.089
 practice+0.087
Neg logit contributions
eddy-0.199
 pans-0.197
ucer-0.178
emo-0.175
ble-0.172
Tokenados
Token ID22484
Feature activation+2.90e+00

Pos logit contributions
ishly+0.064
 echoing+0.060
 rather+0.058
 believe+0.056
 weren+0.056
Neg logit contributions
eddy-0.046
 pans-0.042
loo-0.038
ble-0.038
 flour-0.037
Token.
Token ID13
Feature activation+5.44e-02

Pos logit contributions
ishly+0.115
 lear+0.100
 advantage+0.095
 strides+0.094
alled+0.094
Neg logit contributions
 pans-0.033
ble-0.032
 glass-0.032
 room-0.031
eddy-0.030
Token She
Token ID1375
Feature activation-1.11e+00

Pos logit contributions
ishly+0.001
 course+0.001
 argued+0.001
 practice+0.001
 lear+0.001
Neg logit contributions
eddy-0.001
 pans-0.001
ucer-0.001
emo-0.001
izz-0.001
Token always
Token ID1464
Feature activation-2.64e-01

Pos logit contributions
 pans+0.026
eddy+0.025
emo+0.023
izz+0.022
 lad+0.022
Neg logit contributions
ishly-0.032
 course-0.025
 advantage-0.025
 strides-0.024
 echoing-0.024
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
ishly+0.039
vious+0.037
 echoing+0.037
beat+0.036
ited+0.036
Neg logit contributions
 flour-0.014
eddy-0.014
ble-0.014
 Sara-0.013
 candle-0.013
Token tree
Token ID5509
Feature activation+2.92e+00

Pos logit contributions
ishly+0.065
 argued+0.054
 echoing+0.052
 practice+0.050
 advantage+0.050
Neg logit contributions
 pans-0.042
eddy-0.036
 flour-0.035
ble-0.034
 lad-0.034
Token if
Token ID611
Feature activation+9.56e-01

Pos logit contributions
ishly+0.079
 argued+0.071
 course+0.071
 lear+0.068
 practice+0.067
Neg logit contributions
eddy-0.062
 pans-0.059
loo-0.057
tub-0.055
ble-0.054
Token they
Token ID484
Feature activation+1.89e+00

Pos logit contributions
ishly+0.032
 echoing+0.028
 advantage+0.027
 antics+0.027
 strides+0.027
Neg logit contributions
 pans-0.016
 salt-0.014
 flour-0.014
 candle-0.013
eddy-0.013
Token were
Token ID547
Feature activation+1.60e+00

Pos logit contributions
ishly+0.064
 advantage+0.050
 course+0.050
 antics+0.050
 lear+0.050
Neg logit contributions
 pans-0.033
 lad-0.032
izz-0.031
 glass-0.029
 salt-0.029
Token av
Token ID1196
Feature activation+5.97e+00

Pos logit contributions
ishly+0.054
 course+0.045
 lear+0.044
 argued+0.042
 practice+0.041
Neg logit contributions
 pans-0.032
ble-0.027
eddy-0.026
 flour-0.026
izz-0.026
Tokenoc
Token ID420
Feature activation+2.59e+00

Pos logit contributions
ishly+0.133
 echoing+0.094
 argued+0.092
 course+0.091
 lear+0.091
Neg logit contributions
eddy-0.182
 pans-0.177
ucer-0.161
emo-0.153
izz-0.151
Tokenados
Token ID22484
Feature activation+3.54e+00

Pos logit contributions
ishly+0.071
 echoing+0.071
 practice+0.068
 rather+0.067
 believe+0.065
Neg logit contributions
eddy-0.053
 pans-0.050
loo-0.043
ucks-0.043
ble-0.042
Token,
Token ID11
Feature activation+6.16e-01

Pos logit contributions
ishly+0.106
 lear+0.104
 advantage+0.101
 antics+0.099
 strides+0.098
Neg logit contributions
 pans-0.056
eddy-0.053
tub-0.052
ble-0.051
oe-0.050
Token and
Token ID290
Feature activation+7.63e-01

Pos logit contributions
ishly+0.019
 advantage+0.016
 argued+0.016
 lear+0.016
ued+0.016
Neg logit contributions
 pans-0.012
 flour-0.011
 salt-0.010
 lad-0.010
 glass-0.010
Token they
Token ID484
Feature activation+9.15e-01

Pos logit contributions
ishly+0.025
opl+0.023
 advantage+0.023
istent+0.022
 strides+0.022
Neg logit contributions
 pans-0.013
 flour-0.011
 lad-0.010
 salt-0.010
 glass-0.010
Token away
Token ID1497
Feature activation-8.46e-01

Pos logit contributions
eddy+0.024
 pans+0.024
oe+0.022
ucer+0.021
emo+0.021
Neg logit contributions
ishly-0.026
 course-0.020
 argued-0.020
 lear-0.019
 weren-0.019
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
onna+0.013
 pans+0.013
eddy+0.013
ble+0.013
 late+0.012
Neg logit contributions
ishly-0.028
 course-0.026
ercise-0.025
 argued-0.024
 practice-0.023
Token She
Token ID1375
Feature activation-2.61e-01

Pos logit contributions
ishly+0.004
 course+0.003
 argued+0.003
 practice+0.003
 lear+0.003
Neg logit contributions
eddy-0.004
 pans-0.004
ucer-0.003
emo-0.003
izz-0.003
Token learned
Token ID4499
Feature activation-4.77e-01

Pos logit contributions
eddy+0.005
 pans+0.005
emo+0.005
 lad+0.005
izz+0.004
Neg logit contributions
ishly-0.008
 advantage-0.007
 strides-0.007
 compete-0.006
 course-0.006
Token that
Token ID326
Feature activation-5.69e-01

Pos logit contributions
eddy+0.011
 pans+0.010
ble+0.009
emo+0.009
 flour+0.009
Neg logit contributions
ishly-0.014
 argued-0.012
 strides-0.011
 compete-0.011
 lear-0.011
Token av
Token ID1196
Feature activation+5.84e+00

Pos logit contributions
eddy+0.014
 pans+0.014
emo+0.012
ble+0.012
 flour+0.012
Neg logit contributions
ishly-0.016
 course-0.012
 argued-0.012
 lear-0.012
 advantage-0.011
Tokenoc
Token ID420
Feature activation+1.31e+00

Pos logit contributions
ishly+0.120
 argued+0.081
 course+0.081
 lear+0.079
 echoing+0.077
Neg logit contributions
eddy-0.188
 pans-0.183
ucer-0.170
emo-0.167
izz-0.163
Tokenados
Token ID22484
Feature activation+2.89e+00

Pos logit contributions
ishly+0.032
 echoing+0.027
 practice+0.026
 argued+0.026
 lear+0.026
Neg logit contributions
eddy-0.035
 pans-0.033
ble-0.030
emo-0.030
loo-0.029
Token are
Token ID389
Feature activation+5.67e-01

Pos logit contributions
ishly+0.086
 lear+0.073
 echoing+0.067
 antics+0.066
 strides+0.066
Neg logit contributions
 pans-0.060
ble-0.056
eddy-0.055
ucer-0.053
tub-0.053
Token good
Token ID922
Feature activation+2.13e-01

Pos logit contributions
ishly+0.018
 argued+0.014
 lear+0.014
 course+0.014
 strides+0.014
Neg logit contributions
 pans-0.012
eddy-0.011
 flour-0.010
ble-0.010
emo-0.010
Token for
Token ID329
Feature activation+7.27e-01

Pos logit contributions
ishly+0.007
 course+0.006
 echoing+0.005
 compete+0.005
 argued+0.005
Neg logit contributions
eddy-0.004
 pans-0.004
emo-0.004
 salt-0.003
oys-0.003
Token said
Token ID531
Feature activation+1.28e+00

Pos logit contributions
ishly+0.031
 advantage+0.029
 strides+0.028
 lear+0.028
 unnecessary+0.027
Neg logit contributions
 lad-0.021
 pans-0.020
emo-0.020
eddy-0.019
ble-0.018
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
ishly+0.037
 course+0.034
opl+0.034
 strides+0.034
 habitats+0.033
Neg logit contributions
ble-0.021
 pans-0.020
emo-0.020
eddy-0.019
 lad-0.019
Token "
Token ID366
Feature activation+1.14e+00

Pos logit contributions
ishly+0.053
 course+0.049
 advantage+0.049
vious+0.047
 practice+0.047
Neg logit contributions
 pans-0.021
eddy-0.019
 lad-0.018
ble-0.017
emo-0.017
TokenI
Token ID40
Feature activation+4.03e-01

Pos logit contributions
ishly+0.037
 course+0.030
 argued+0.029
 practice+0.029
 lear+0.029
Neg logit contributions
eddy-0.023
 pans-0.023
emo-0.020
ucer-0.020
izz-0.019
Token love
Token ID1842
Feature activation-1.75e+00

Pos logit contributions
ishly+0.012
 strides+0.009
 course+0.009
 echoing+0.009
alled+0.009
Neg logit contributions
onna-0.009
emo-0.009
eddy-0.009
 lad-0.008
izz-0.008
Token av
Token ID1196
Feature activation+5.74e+00

Pos logit contributions
 pans+0.036
eddy+0.036
emo+0.031
 flour+0.030
ble+0.030
Neg logit contributions
ishly-0.057
 echoing-0.045
 argued-0.044
 lear-0.044
 advantage-0.043
Tokenoc
Token ID420
Feature activation+1.83e+00

Pos logit contributions
ishly+0.132
 echoing+0.094
 argued+0.091
 lear+0.088
 course+0.087
Neg logit contributions
eddy-0.172
 pans-0.166
ucer-0.152
emo-0.149
 flour-0.143
Tokenados
Token ID22484
Feature activation+2.79e+00

Pos logit contributions
ishly+0.050
 echoing+0.047
 practice+0.044
 rather+0.043
 argued+0.042
Neg logit contributions
eddy-0.041
 pans-0.038
ble-0.033
emo-0.033
loo-0.033
Token!
Token ID0
Feature activation+1.15e+00

Pos logit contributions
ishly+0.093
 argued+0.082
 strides+0.079
 echoing+0.079
alled+0.078
Neg logit contributions
 pans-0.050
eddy-0.049
onna-0.040
tub-0.040
 lad-0.040
Token They
Token ID1119
Feature activation+3.36e-01

Pos logit contributions
ishly+0.037
 argued+0.032
 course+0.031
 practice+0.030
 advantage+0.030
Neg logit contributions
eddy-0.023
 pans-0.021
emo-0.018
 lad-0.017
loo-0.017
Token are
Token ID389
Feature activation+8.48e-03

Pos logit contributions
ishly+0.009
 echoing+0.008
 lear+0.008
 strides+0.008
rified+0.008
Neg logit contributions
 pans-0.007
 lad-0.006
 flour-0.006
 candle-0.006
 glass-0.006
Token tree
Token ID5509
Feature activation+8.57e-01

Pos logit contributions
ishly+0.021
 lear+0.018
 practice+0.018
 advantage+0.018
 strides+0.017
Neg logit contributions
 pans-0.012
ble-0.011
kins-0.011
eddy-0.011
 candle-0.010
Token could
Token ID714
Feature activation-9.57e-01

Pos logit contributions
ishly+0.026
 lear+0.024
 practice+0.023
 advantage+0.023
 strides+0.023
Neg logit contributions
 pans-0.015
ble-0.014
eddy-0.014
kins-0.014
oe-0.014
Token see
Token ID766
Feature activation+9.31e-01

Pos logit contributions
ble+0.019
onna+0.019
 pans+0.018
eddy+0.018
 candle+0.017
Neg logit contributions
ishly-0.029
 lear-0.024
 argued-0.024
rified-0.023
 strides-0.023
Token lots
Token ID6041
Feature activation+4.83e-01

Pos logit contributions
ishly+0.032
 lear+0.027
 argued+0.027
 practice+0.026
 strides+0.025
Neg logit contributions
eddy-0.017
 pans-0.016
 flour-0.015
ble-0.015
ull-0.014
Token of
Token ID286
Feature activation+1.16e+00

Pos logit contributions
ishly+0.018
 argued+0.015
 advantage+0.014
 lear+0.014
 practice+0.014
Neg logit contributions
eddy-0.008
 pans-0.008
 flour-0.006
loo-0.006
 lad-0.006
Token av
Token ID1196
Feature activation+5.71e+00

Pos logit contributions
ishly+0.040
 lear+0.030
 argued+0.030
 course+0.029
 echoing+0.028
Neg logit contributions
 pans-0.024
eddy-0.023
 flour-0.020
emo-0.020
 candle-0.019
Tokenoc
Token ID420
Feature activation+2.41e+00

Pos logit contributions
ishly+0.117
 course+0.081
 argued+0.080
 lear+0.077
 practice+0.076
Neg logit contributions
eddy-0.183
 pans-0.181
ucer-0.164
emo-0.161
ble-0.159
Tokenados
Token ID22484
Feature activation+2.15e+00

Pos logit contributions
ishly+0.068
 echoing+0.060
 practice+0.058
 rather+0.056
 reassured+0.056
Neg logit contributions
eddy-0.053
 pans-0.051
 flour-0.044
loo-0.044
ble-0.043
Token that
Token ID326
Feature activation-1.24e+00

Pos logit contributions
ishly+0.073
 lear+0.060
 advantage+0.060
 practice+0.058
 strides+0.057
Neg logit contributions
 pans-0.039
ble-0.037
eddy-0.036
onna-0.033
kins-0.033
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
 pans+0.023
ull+0.020
emo+0.020
kins+0.020
tub+0.020
Neg logit contributions
ishly-0.041
 lear-0.035
 strides-0.034
 antics-0.033
 practice-0.032
Token supplier
Token ID22693
Feature activation+5.62e-02

Pos logit contributions
 pans+0.000
eddy+0.000
 flour+0.000
ble+0.000
 candle+0.000
Neg logit contributions
ishly-0.000
 argued-0.000
 echoing-0.000
 reassured-0.000
 course-0.000
Token B
Token ID347
Feature activation-3.98e-01

Pos logit contributions
ishly+0.008
 course+0.006
 weren+0.005
 practice+0.005
 argued+0.005
Neg logit contributions
 pans-0.013
ucer-0.012
eddy-0.012
ble-0.012
emo-0.012
Tokenongo
Token ID25162
Feature activation-1.93e-01

Pos logit contributions
 pans+0.011
eddy+0.010
ucer+0.010
 candle+0.010
 flour+0.009
Neg logit contributions
ishly-0.010
 course-0.008
 practice-0.007
 argued-0.007
 lear-0.007
Token loved
Token ID6151
Feature activation-3.77e-01

Pos logit contributions
 pans+0.006
eddy+0.006
ucer+0.005
izz+0.005
emo+0.005
Neg logit contributions
ishly-0.004
 course-0.003
 argued-0.003
 practice-0.003
 lear-0.003
Token to
Token ID284
Feature activation+8.85e-01

Pos logit contributions
 pans+0.009
eddy+0.009
emo+0.008
loo+0.008
 flour+0.008
Neg logit contributions
ishly-0.011
 course-0.008
 argued-0.008
 advantage-0.008
 lear-0.008
Token eat
Token ID4483
Feature activation+1.11e+00

Pos logit contributions
ishly+0.025
rified+0.019
 argued+0.018
 advantage+0.018
 course+0.018
Neg logit contributions
 pans-0.022
eddy-0.021
emo-0.019
ble-0.019
onna-0.019
Token av
Token ID1196
Feature activation+5.56e+00

Pos logit contributions
ishly+0.034
 argued+0.027
 lear+0.026
 advantage+0.026
 practice+0.025
Neg logit contributions
 pans-0.026
eddy-0.025
 flour-0.022
emo-0.021
ble-0.021
Tokenoc
Token ID420
Feature activation+2.39e+00

Pos logit contributions
ishly+0.107
 course+0.076
 argued+0.072
 practice+0.070
 lear+0.068
Neg logit contributions
eddy-0.183
 pans-0.181
ucer-0.166
emo-0.164
izz-0.163
Tokenados
Token ID22484
Feature activation+3.00e+00

Pos logit contributions
ishly+0.068
 echoing+0.064
 rather+0.060
 believe+0.058
 weren+0.058
Neg logit contributions
eddy-0.052
 pans-0.046
loo-0.042
ble-0.042
emo-0.041
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
ishly+0.102
 lear+0.085
 argued+0.081
 strides+0.079
 echoing+0.079
Neg logit contributions
 pans-0.059
eddy-0.058
ble-0.051
 flour-0.049
tub-0.048
Token One
Token ID1881
Feature activation+1.62e+00

Pos logit contributions
ishly+0.005
 course+0.003
 practice+0.003
 weren+0.003
 advantage+0.003
Neg logit contributions
 pans-0.008
eddy-0.008
ucer-0.008
izz-0.007
ble-0.007
Token day
Token ID1110
Feature activation+1.14e-01

Pos logit contributions
ishly+0.026
 course+0.023
 practice+0.021
 argued+0.019
 weren+0.018
Neg logit contributions
 pans-0.054
eddy-0.052
ucer-0.050
ucks-0.049
izz-0.048
Tokenongo
Token ID25162
Feature activation+3.38e-01

Pos logit contributions
ishly+0.005
 course+0.004
 argued+0.003
 practice+0.003
 advantage+0.003
Neg logit contributions
 pans-0.006
eddy-0.005
 candle-0.005
ucer-0.005
loo-0.005
Token didn
Token ID1422
Feature activation+5.81e-01

Pos logit contributions
ishly+0.009
 course+0.007
 advantage+0.007
 lear+0.007
 practice+0.007
Neg logit contributions
 pans-0.009
eddy-0.008
emo-0.008
ble-0.008
ucer-0.007
Token't
Token ID470
Feature activation-4.91e-02

Pos logit contributions
ishly+0.012
 course+0.010
 practice+0.010
 rather+0.009
 advantage+0.009
Neg logit contributions
 pans-0.018
eddy-0.017
ucer-0.016
loo-0.015
emo-0.015
Token like
Token ID588
Feature activation+5.31e-01

Pos logit contributions
eddy+0.002
 pans+0.002
loo+0.002
ucer+0.002
ble+0.002
Neg logit contributions
ishly-0.001
 practice-0.000
 argued-0.000
 advantage-0.000
 lear-0.000
Token wet
Token ID9583
Feature activation-2.36e-01

Pos logit contributions
ishly+0.017
 lear+0.013
 advantage+0.013
 argued+0.013
 course+0.013
Neg logit contributions
 pans-0.011
eddy-0.011
emo-0.009
 flour-0.009
 glass-0.009
Token av
Token ID1196
Feature activation+5.54e+00

Pos logit contributions
 pans+0.005
eddy+0.005
emo+0.005
ble+0.005
loo+0.004
Neg logit contributions
ishly-0.007
 course-0.006
 practice-0.006
 lear-0.005
 echoing-0.005
Tokenoc
Token ID420
Feature activation+2.28e+00

Pos logit contributions
ishly+0.098
 course+0.069
 argued+0.065
 practice+0.063
 lear+0.061
Neg logit contributions
eddy-0.189
 pans-0.188
ucer-0.172
emo-0.171
izz-0.170
Tokenados
Token ID22484
Feature activation+2.72e+00

Pos logit contributions
ishly+0.062
 echoing+0.054
 practice+0.052
 rather+0.051
 argued+0.050
Neg logit contributions
eddy-0.053
 pans-0.048
ble-0.046
emo-0.044
loo-0.043
Token,
Token ID11
Feature activation+3.97e-01

Pos logit contributions
ishly+0.094
 lear+0.081
 advantage+0.080
alled+0.078
ercise+0.077
Neg logit contributions
 glass-0.041
 pans-0.040
ble-0.037
eddy-0.037
 dirty-0.037
Token so
Token ID523
Feature activation+3.47e-01

Pos logit contributions
ishly+0.010
 lear+0.008
 course+0.008
 advantage+0.007
 practice+0.007
Neg logit contributions
 pans-0.011
eddy-0.010
 flour-0.009
ble-0.009
emo-0.009
Token he
Token ID339
Feature activation+2.38e-01

Pos logit contributions
ishly+0.010
 course+0.008
 argued+0.008
 practice+0.008
 lear+0.008
Neg logit contributions
eddy-0.008
 pans-0.008
ucer-0.007
emo-0.007
izz-0.007
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
 pans+0.067
eddy+0.063
ucer+0.060
loo+0.059
emo+0.059
Neg logit contributions
ishly-0.043
 course-0.034
 argued-0.032
 practice-0.031
 lear-0.031
Token She
Token ID1375
Feature activation-1.77e+00

Pos logit contributions
 pans+0.004
eddy+0.004
ucer+0.004
ble+0.004
emo+0.004
Neg logit contributions
ishly-0.003
 course-0.002
 practice-0.002
 argued-0.002
 weren-0.002
Token loved
Token ID6151
Feature activation-5.12e-01

Pos logit contributions
 pans+0.070
ucks+0.069
eddy+0.068
ucer+0.067
loo+0.065
Neg logit contributions
ishly-0.016
 practice-0.012
 chased-0.012
 course-0.011
 argued-0.011
Token to
Token ID284
Feature activation+9.32e-01

Pos logit contributions
eddy+0.014
 pans+0.014
ucer+0.013
emo+0.013
izz+0.012
Neg logit contributions
ishly-0.012
 course-0.010
 argued-0.009
 practice-0.009
 lear-0.009
Token eat
Token ID4483
Feature activation+5.73e-01

Pos logit contributions
ishly+0.021
 course+0.016
 argued+0.015
 advantage+0.015
 lear+0.014
Neg logit contributions
 pans-0.028
eddy-0.027
emo-0.024
ucer-0.024
izz-0.024
Token av
Token ID1196
Feature activation+5.54e+00

Pos logit contributions
ishly+0.017
 course+0.014
 argued+0.014
 practice+0.014
 lear+0.013
Neg logit contributions
eddy-0.012
 pans-0.012
ucer-0.011
emo-0.011
izz-0.010
Tokenoc
Token ID420
Feature activation+1.85e+00

Pos logit contributions
ishly+0.095
 course+0.069
 argued+0.062
 practice+0.062
 advantage+0.059
Neg logit contributions
eddy-0.189
 pans-0.189
ucer-0.174
emo-0.172
izz-0.172
Tokenados
Token ID22484
Feature activation+2.83e+00

Pos logit contributions
ishly+0.048
 echoing+0.042
 rather+0.040
 practice+0.040
 argued+0.039
Neg logit contributions
eddy-0.046
 pans-0.043
emo-0.037
ble-0.037
loo-0.037
Token because
Token ID780
Feature activation-3.70e-02

Pos logit contributions
ishly+0.093
 lear+0.077
 argued+0.074
 strides+0.072
 advantage+0.072
Neg logit contributions
 pans-0.059
eddy-0.058
ble-0.050
 flour-0.049
tub-0.048
Token they
Token ID484
Feature activation+8.72e-02

Pos logit contributions
eddy+0.001
 pans+0.001
ucer+0.001
emo+0.001
izz+0.001
Neg logit contributions
ishly-0.001
 course-0.001
 argued-0.001
 practice-0.001
 lear-0.001
Token were
Token ID547
Feature activation+9.35e-02

Pos logit contributions
ishly+0.002
 course+0.001
 practice+0.001
 argued+0.001
 lear+0.001
Neg logit contributions
 pans-0.003
eddy-0.003
ucer-0.002
izz-0.002
emo-0.002
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
eddy+0.019
 pans+0.018
ucer+0.016
emo+0.016
ble+0.016
Neg logit contributions
ishly-0.021
 course-0.017
 argued-0.017
 advantage-0.016
 practice-0.016
Token Did
Token ID7731
Feature activation+6.48e-01

Pos logit contributions
ishly+0.034
 course+0.028
 argued+0.027
 practice+0.027
 lear+0.026
Neg logit contributions
eddy-0.028
 pans-0.028
ucer-0.024
emo-0.024
izz-0.024
Token you
Token ID345
Feature activation+2.08e+00

Pos logit contributions
ishly+0.022
 lear+0.019
 echoing+0.018
 argued+0.018
 advantage+0.018
Neg logit contributions
 pans-0.011
eddy-0.010
ble-0.010
 flour-0.009
emo-0.009
Token know
Token ID760
Feature activation+1.24e+00

Pos logit contributions
ishly+0.067
 course+0.053
rified+0.052
 lear+0.050
 strides+0.050
Neg logit contributions
ble-0.042
emo-0.038
onna-0.038
eddy-0.038
 pans-0.037
Token that
Token ID326
Feature activation+1.77e-01

Pos logit contributions
ishly+0.038
 lear+0.032
 argued+0.032
 antics+0.029
 advantage+0.029
Neg logit contributions
 pans-0.027
eddy-0.026
 flour-0.025
 candle-0.025
ble-0.025
Token av
Token ID1196
Feature activation+5.39e+00

Pos logit contributions
ishly+0.007
 argued+0.006
 lear+0.006
 advantage+0.005
nered+0.005
Neg logit contributions
 pans-0.003
 candle-0.003
 glass-0.003
 flour-0.003
 room-0.002
Tokenoc
Token ID420
Feature activation+2.47e+00

Pos logit contributions
ishly+0.139
 echoing+0.099
 lear+0.091
 strides+0.091
 argued+0.091
Neg logit contributions
eddy-0.152
 pans-0.145
ucer-0.135
emo-0.135
iflower-0.128
Tokenados
Token ID22484
Feature activation+3.05e+00

Pos logit contributions
ishly+0.071
 echoing+0.070
 practice+0.063
 argued+0.061
 rather+0.060
Neg logit contributions
eddy-0.054
 pans-0.048
loo-0.044
ble-0.043
ucks-0.041
Token are
Token ID389
Feature activation+8.10e-01

Pos logit contributions
ishly+0.101
 lear+0.085
 strides+0.079
 advantage+0.079
 echoing+0.077
Neg logit contributions
 pans-0.057
ble-0.056
loo-0.054
tub-0.052
eddy-0.052
Token very
Token ID845
Feature activation-1.78e-01

Pos logit contributions
ishly+0.028
 argued+0.022
 lear+0.022
 course+0.021
rified+0.021
Neg logit contributions
 pans-0.017
eddy-0.014
 flour-0.014
emo-0.014
ble-0.014
Token healthy
Token ID5448
Feature activation-2.92e-01

Pos logit contributions
 pans+0.004
eddy+0.004
emo+0.004
izz+0.003
ble+0.003
Neg logit contributions
ishly-0.005
 course-0.004
 argued-0.004
 lear-0.004
 practice-0.004
Token says
Token ID1139
Feature activation+1.35e+00

Pos logit contributions
ishly+0.004
 advantage+0.003
 course+0.003
 lear+0.003
 echoing+0.003
Neg logit contributions
 pans-0.003
eddy-0.002
 lad-0.002
emo-0.002
ara-0.002
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
ishly+0.034
 course+0.029
 argued+0.028
 practice+0.027
 echoing+0.027
Neg logit contributions
 pans-0.033
eddy-0.032
oe-0.030
ble-0.029
loo-0.029
Token "
Token ID366
Feature activation+9.79e-02

Pos logit contributions
ishly+0.032
 course+0.027
 argued+0.026
 practice+0.026
 advantage+0.025
Neg logit contributions
 pans-0.026
eddy-0.026
ucer-0.022
emo-0.022
ble-0.022
TokenThey
Token ID2990
Feature activation-3.34e-01

Pos logit contributions
ishly+0.003
 course+0.002
 argued+0.002
 lear+0.002
 practice+0.002
Neg logit contributions
 pans-0.002
eddy-0.002
loo-0.002
izz-0.002
ucer-0.002
Token are
Token ID389
Feature activation+7.03e-01

Pos logit contributions
 pans+0.007
 flour+0.006
 glass+0.006
 lad+0.006
ucer+0.006
Neg logit contributions
ishly-0.010
 lear-0.008
 echoing-0.008
 advantage-0.008
rified-0.007
Token av
Token ID1196
Feature activation+5.36e+00

Pos logit contributions
ishly+0.020
 course+0.016
 argued+0.015
 practice+0.015
 lear+0.015
Neg logit contributions
 pans-0.018
eddy-0.017
emo-0.015
ucer-0.015
izz-0.014
Tokenoc
Token ID420
Feature activation+1.95e+00

Pos logit contributions
ishly+0.094
 course+0.066
 argued+0.061
 practice+0.060
 lear+0.057
Neg logit contributions
eddy-0.183
 pans-0.183
ucer-0.168
emo-0.166
izz-0.165
Tokenados
Token ID22484
Feature activation+2.23e+00

Pos logit contributions
ishly+0.049
 practice+0.046
 rather+0.045
 echoing+0.044
 weren+0.044
Neg logit contributions
eddy-0.046
 pans-0.044
loo-0.038
ucks-0.037
emo-0.037
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
ishly+0.068
 advantage+0.058
 argued+0.058
 course+0.057
 lear+0.056
Neg logit contributions
 pans-0.049
eddy-0.047
ble-0.040
ucer-0.039
 lad-0.038
Token My
Token ID2011
Feature activation+2.16e+00

Pos logit contributions
ishly+0.051
 course+0.043
 argued+0.041
 practice+0.041
 lear+0.040
Neg logit contributions
eddy-0.032
 pans-0.032
ucer-0.027
emo-0.027
izz-0.026
Token mom
Token ID1995
Feature activation+6.09e-02

Pos logit contributions
ishly+0.080
 echoing+0.069
 reassured+0.068
 course+0.066
rified+0.066
Neg logit contributions
 pans-0.035
eddy-0.031
 flour-0.028
 glass-0.028
 room-0.027
Token the
Token ID262
Feature activation+3.36e-01

Pos logit contributions
 pans+0.012
eddy+0.011
 flour+0.010
 salt+0.010
izz+0.009
Neg logit contributions
ishly-0.024
 lear-0.021
 argued-0.020
 echoing-0.020
 antics-0.019
Token tree
Token ID5509
Feature activation+1.47e+00

Pos logit contributions
ishly+0.012
 argued+0.010
 echoing+0.010
 reassured+0.009
 lear+0.009
Neg logit contributions
 pans-0.005
 glass-0.005
 room-0.005
 flour-0.005
 candle-0.005
Token to
Token ID284
Feature activation+5.96e-01

Pos logit contributions
ishly+0.044
 lear+0.040
 argued+0.039
 course+0.039
rified+0.038
Neg logit contributions
 pans-0.026
eddy-0.026
 room-0.025
tub-0.025
 glass-0.024
Token its
Token ID663
Feature activation+2.92e+00

Pos logit contributions
ishly+0.019
 argued+0.016
vious+0.016
rified+0.016
 lear+0.015
Neg logit contributions
 pans-0.011
ble-0.010
eddy-0.010
izz-0.010
 candle-0.010
Token once
Token ID1752
Feature activation+4.00e-01

Pos logit contributions
ishly+0.102
 argued+0.091
rified+0.088
vious+0.088
istent+0.087
Neg logit contributions
 room-0.051
 pans-0.044
 candle-0.043
 glass-0.043
 door-0.041
Token former
Token ID1966
Feature activation+5.35e+00

Pos logit contributions
ishly+0.012
 argued+0.009
 lear+0.009
 compete+0.008
 strides+0.008
Neg logit contributions
eddy-0.009
 pans-0.009
ble-0.008
 candle-0.008
loo-0.008
Token beauty
Token ID8737
Feature activation+4.38e-01

Pos logit contributions
ishly+0.174
 lear+0.138
rified+0.134
 argued+0.134
 course+0.132
Neg logit contributions
 pans-0.114
eddy-0.103
ooks-0.096
 flour-0.094
 room-0.093
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
ishly+0.015
 argued+0.014
 advantage+0.013
 lear+0.013
 practice+0.013
Neg logit contributions
eddy-0.007
tub-0.006
 pans-0.006
loo-0.006
oe-0.006
Token He
Token ID679
Feature activation+6.09e-01

Pos logit contributions
ishly+0.006
 argued+0.005
 course+0.005
 practice+0.004
 lear+0.004
Neg logit contributions
eddy-0.004
 pans-0.004
emo-0.003
loo-0.003
izz-0.003
Token was
Token ID373
Feature activation+5.45e-01

Pos logit contributions
ishly+0.020
 strides+0.017
 anymore+0.017
 advantage+0.017
 compete+0.017
Neg logit contributions
eddy-0.009
 pans-0.008
 lad-0.008
 room-0.008
ble-0.008
Token happy
Token ID3772
Feature activation+2.29e-01

Pos logit contributions
ishly+0.015
 course+0.013
 strides+0.012
 argued+0.012
 lear+0.012
Neg logit contributions
eddy-0.011
 pans-0.011
ble-0.011
izz-0.011
 room-0.010
Token happy
Token ID3772
Feature activation-3.29e-01

Pos logit contributions
 pans+0.012
eddy+0.012
emo+0.010
izz+0.010
ble+0.010
Neg logit contributions
ishly-0.016
 course-0.013
 argued-0.013
 lear-0.012
 practice-0.012
Token that
Token ID326
Feature activation-1.36e+00

Pos logit contributions
eddy+0.007
 pans+0.006
ble+0.006
emo+0.006
loo+0.005
Neg logit contributions
ishly-0.011
 argued-0.009
 advantage-0.009
 practice-0.008
 course-0.008
Token it
Token ID340
Feature activation-1.05e+00

Pos logit contributions
 pans+0.016
ble+0.015
 candle+0.014
 salt+0.014
ull+0.014
Neg logit contributions
ishly-0.053
 argued-0.044
 antics-0.044
 lear-0.044
vious-0.043
Token produced
Token ID4635
Feature activation+1.43e-01

Pos logit contributions
 pans+0.020
eddy+0.018
 lad+0.018
emo+0.017
ucer+0.016
Neg logit contributions
ishly-0.034
 advantage-0.029
 strides-0.028
 antics-0.028
 lear-0.027
Token more
Token ID517
Feature activation-1.61e+00

Pos logit contributions
ishly+0.005
 lear+0.004
 argued+0.004
 echoing+0.004
 strides+0.004
Neg logit contributions
 flour-0.002
 pans-0.002
 salt-0.002
eddy-0.002
emo-0.002
Token av
Token ID1196
Feature activation+5.31e+00

Pos logit contributions
 pans+0.033
eddy+0.031
 flour+0.029
emo+0.028
 salt+0.027
Neg logit contributions
ishly-0.057
 lear-0.043
 argued-0.043
 echoing-0.042
rified-0.041
Tokenoc
Token ID420
Feature activation+2.02e+00

Pos logit contributions
ishly+0.117
 argued+0.084
 course+0.081
 echoing+0.079
 lear+0.079
Neg logit contributions
eddy-0.165
 pans-0.161
ucer-0.147
emo-0.143
ble-0.139
Tokenados
Token ID22484
Feature activation+1.74e+00

Pos logit contributions
ishly+0.054
 echoing+0.049
 practice+0.048
 argued+0.046
 lear+0.045
Neg logit contributions
eddy-0.047
 pans-0.044
ble-0.040
emo-0.039
loo-0.038
Token than
Token ID621
Feature activation-3.84e-01

Pos logit contributions
ishly+0.058
 lear+0.050
 argued+0.049
 strides+0.048
 advantage+0.048
Neg logit contributions
 pans-0.030
eddy-0.029
ble-0.027
ucer-0.026
kins-0.026
Token ever
Token ID1683
Feature activation+1.76e-01

Pos logit contributions
 pans+0.008
 salt+0.007
 flour+0.007
emo+0.007
 lad+0.007
Neg logit contributions
ishly-0.012
 echoing-0.009
 strides-0.009
 lear-0.009
 antics-0.009
Token before
Token ID878
Feature activation-7.34e-01

Pos logit contributions
ishly+0.006
beat+0.004
 strides+0.004
 lear+0.004
 course+0.004
Neg logit contributions
 pans-0.004
 salt-0.003
eddy-0.003
 lad-0.003
emo-0.003
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
ishly+0.109
 advantage+0.101
 argued+0.095
 persever+0.092
 lear+0.091
Neg logit contributions
 pans-0.055
eddy-0.051
 flour-0.048
 twins-0.047
 block-0.047
Token gir
Token ID37370
Feature activation+4.82e+00

Pos logit contributions
ishly+0.040
 advantage+0.032
 argued+0.032
 practice+0.031
 course+0.031
Neg logit contributions
 pans-0.031
eddy-0.029
 flour-0.025
loo-0.025
emo-0.025
Tokenaff
Token ID2001
Feature activation+2.59e+00

Pos logit contributions
ishly+0.157
 argued+0.135
 advantage+0.133
 practice+0.132
 reassured+0.131
Neg logit contributions
eddy-0.088
 pans-0.079
loo-0.072
onna-0.068
 flour-0.065
Tokenes
Token ID274
Feature activation+2.23e+00

Pos logit contributions
ishly+0.079
 advantage+0.067
 practice+0.066
 course+0.063
 competition+0.063
Neg logit contributions
eddy-0.053
 pans-0.053
ble-0.048
ucer-0.047
izz-0.047
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
ishly+0.078
 advantage+0.078
 persever+0.075
 argued+0.074
 unnecessary+0.073
Neg logit contributions
 pans-0.027
eddy-0.027
ble-0.022
 twins-0.021
 flour-0.020
Token ze
Token ID41271
Feature activation+5.31e+00

Pos logit contributions
ishly+0.055
 advantage+0.044
 argued+0.043
date+0.042
istent+0.042
Neg logit contributions
 pans-0.042
 flour-0.034
 glass-0.033
 salt-0.033
eddy-0.032
Tokenbr
Token ID1671
Feature activation+2.99e+00

Pos logit contributions
ishly+0.180
 argued+0.156
 advantage+0.149
 reassured+0.142
 former+0.138
Neg logit contributions
 pans-0.081
loo-0.075
oons-0.074
eddy-0.073
onna-0.072
Tokenas
Token ID292
Feature activation+2.93e-01

Pos logit contributions
ishly+0.091
 course+0.076
 argued+0.073
 practice+0.073
 lear+0.071
Neg logit contributions
eddy-0.063
 pans-0.063
ucer-0.054
emo-0.053
izz-0.053
Token.
Token ID13
Feature activation+4.82e-02

Pos logit contributions
ishly+0.010
 advantage+0.009
 persever+0.009
 practice+0.008
 course+0.008
Neg logit contributions
eddy-0.005
 pans-0.004
ble-0.004
oe-0.004
tub-0.004
Token They
Token ID1119
Feature activation+2.88e-01

Pos logit contributions
ishly+0.001
 course+0.001
 argued+0.001
 practice+0.001
 advantage+0.001
Neg logit contributions
eddy-0.001
 pans-0.001
ucer-0.001
emo-0.001
ble-0.001
Token took
Token ID1718
Feature activation+4.50e-01

Pos logit contributions
ishly+0.009
 advantage+0.008
 unnecessary+0.007
 course+0.007
date+0.007
Neg logit contributions
 pans-0.006
 lad-0.005
emo-0.005
ull-0.004
 block-0.004
Token Tim
Token ID5045
Feature activation-1.37e+00

Pos logit contributions
ishly+0.004
 course+0.003
 argued+0.002
 practice+0.002
 weren+0.002
Neg logit contributions
 pans-0.008
eddy-0.007
ucer-0.007
ble-0.007
izz-0.007
Tokenmy
Token ID1820
Feature activation-1.63e+00

Pos logit contributions
 pans+0.044
ucer+0.039
ucks+0.038
onna+0.037
 lad+0.037
Neg logit contributions
 course-0.024
ishly-0.023
 argued-0.021
 practice-0.021
 weren-0.020
Token loved
Token ID6151
Feature activation-8.83e-01

Pos logit contributions
 pans+0.067
ucks+0.065
ucer+0.062
izz+0.061
eddy+0.060
Neg logit contributions
ishly-0.014
 practice-0.009
 course-0.009
 rather-0.009
 argued-0.008
Token to
Token ID284
Feature activation+8.63e-01

Pos logit contributions
eddy+0.024
 pans+0.024
ucer+0.023
izz+0.022
emo+0.022
Neg logit contributions
ishly-0.020
 course-0.017
 practice-0.016
 argued-0.016
 lear-0.015
Token eat
Token ID4483
Feature activation+3.99e-01

Pos logit contributions
ishly+0.019
 course+0.014
 argued+0.013
 advantage+0.013
 lear+0.013
Neg logit contributions
 pans-0.026
eddy-0.026
emo-0.023
ucer-0.023
izz-0.023
Token av
Token ID1196
Feature activation+5.28e+00

Pos logit contributions
ishly+0.012
 course+0.011
 practice+0.010
 argued+0.010
 advantage+0.010
Neg logit contributions
eddy-0.008
 pans-0.008
ucer-0.007
izz-0.007
emo-0.007
Tokenoc
Token ID420
Feature activation+1.64e+00

Pos logit contributions
ishly+0.091
 course+0.065
 argued+0.059
 practice+0.059
 advantage+0.056
Neg logit contributions
eddy-0.181
 pans-0.181
ucer-0.166
emo-0.165
izz-0.164
Tokenados
Token ID22484
Feature activation+2.61e+00

Pos logit contributions
ishly+0.042
 echoing+0.036
 rather+0.035
 practice+0.035
 lear+0.035
Neg logit contributions
eddy-0.042
 pans-0.039
emo-0.035
ble-0.034
loo-0.034
Token,
Token ID11
Feature activation-4.17e-01

Pos logit contributions
ishly+0.086
 lear+0.071
 argued+0.068
 course+0.066
 strides+0.066
Neg logit contributions
eddy-0.054
 pans-0.054
ble-0.047
tub-0.044
 flour-0.044
Token they
Token ID484
Feature activation-5.81e-01

Pos logit contributions
 pans+0.013
eddy+0.013
ucer+0.012
emo+0.012
izz+0.011
Neg logit contributions
ishly-0.009
 course-0.007
 argued-0.006
 practice-0.006
 lear-0.006
Token were
Token ID547
Feature activation-1.54e-01

Pos logit contributions
 pans+0.017
eddy+0.017
ucer+0.015
emo+0.015
izz+0.015
Neg logit contributions
ishly-0.013
 course-0.010
 advantage-0.009
 practice-0.009
 argued-0.009
Token like
Token ID588
Feature activation+8.47e-01

Pos logit contributions
ishly+0.011
 advantage+0.008
 course+0.008
 lear+0.008
 argued+0.008
Neg logit contributions
 pans-0.010
 flour-0.008
eddy-0.008
 salt-0.008
 glass-0.008
Token monkeys
Token ID26697
Feature activation+3.30e+00

Pos logit contributions
ishly+0.028
 advantage+0.023
istent+0.022
 reassured+0.022
 argued+0.022
Neg logit contributions
 pans-0.018
 flour-0.015
 salt-0.014
eddy-0.014
 glass-0.014
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
ishly+0.107
 advantage+0.104
 lear+0.103
 persever+0.102
 practice+0.098
Neg logit contributions
 pans-0.046
eddy-0.043
onna-0.042
oe-0.042
 flour-0.040
Token elephants
Token ID25368
Feature activation+3.07e+00

Pos logit contributions
ishly+0.065
 advantage+0.056
istent+0.055
 reassured+0.053
date+0.053
Neg logit contributions
 pans-0.047
 salt-0.037
 glass-0.037
 flour-0.036
 lad-0.035
Token and
Token ID290
Feature activation+2.04e+00

Pos logit contributions
ishly+0.118
 persever+0.109
 advantage+0.108
 practice+0.104
 practise+0.103
Neg logit contributions
 pans-0.029
eddy-0.028
 twins-0.026
 block-0.024
onna-0.023
Token gir
Token ID37370
Feature activation+5.23e+00

Pos logit contributions
ishly+0.066
istent+0.057
 advantage+0.055
date+0.053
 reassured+0.053
Neg logit contributions
 pans-0.043
 glass-0.035
 salt-0.034
 flour-0.033
 twins-0.032
Tokenaff
Token ID2001
Feature activation+2.70e+00

Pos logit contributions
ishly+0.193
 reassured+0.175
 practice+0.171
date+0.170
 practise+0.170
Neg logit contributions
eddy-0.061
onna-0.050
loo-0.049
 pans-0.047
ucks-0.046
Tokenes
Token ID274
Feature activation+2.21e+00

Pos logit contributions
ishly+0.087
 practice+0.072
 advantage+0.070
 competition+0.069
 course+0.068
Neg logit contributions
eddy-0.053
 pans-0.052
emo-0.050
ble-0.050
ull-0.050
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
ishly+0.088
 persever+0.087
 advantage+0.083
 practice+0.082
istent+0.082
Neg logit contributions
 pans-0.012
eddy-0.011
tub-0.010
kins-0.010
ull-0.009
Token They
Token ID1119
Feature activation+1.48e-02

Pos logit contributions
ishly+0.014
 course+0.012
 argued+0.011
 practice+0.011
 advantage+0.011
Neg logit contributions
eddy-0.007
 pans-0.007
emo-0.006
izz-0.006
ucer-0.006
Token had
Token ID550
Feature activation+6.90e-01

Pos logit contributions
ishly+0.001
 advantage+0.000
 unnecessary+0.000
 strides+0.000
 course+0.000
Neg logit contributions
 pans-0.000
 lad-0.000
emo-0.000
 glass-0.000
kins-0.000
Token day
Token ID1110
Feature activation-1.57e-01

Pos logit contributions
 pans+0.010
eddy+0.010
ucer+0.009
izz+0.009
emo+0.009
Neg logit contributions
ishly-0.008
 course-0.007
 practice-0.006
 argued-0.006
 lear-0.006
Token on
Token ID319
Feature activation+2.19e+00

Pos logit contributions
 pans+0.004
eddy+0.004
emo+0.003
ucer+0.003
izz+0.003
Neg logit contributions
ishly-0.004
 course-0.004
 practice-0.004
 argued-0.003
 lear-0.003
Token,
Token ID11
Feature activation-8.93e-01

Pos logit contributions
ishly+0.048
 course+0.036
 argued+0.035
 practice+0.034
 advantage+0.033
Neg logit contributions
eddy-0.066
 pans-0.066
ucer-0.059
emo-0.059
izz-0.058
Token Lily
Token ID20037
Feature activation+4.22e-01

Pos logit contributions
eddy+0.013
 pans+0.013
emo+0.011
izz+0.010
ucer+0.010
Neg logit contributions
ishly-0.034
 argued-0.028
 course-0.027
 advantage-0.027
 strides-0.027
Token loved
Token ID6151
Feature activation-8.07e-01

Pos logit contributions
ishly+0.011
 course+0.008
 advantage+0.008
 argued+0.008
 strides+0.008
Neg logit contributions
eddy-0.011
 pans-0.011
emo-0.010
izz-0.010
ble-0.010
Token av
Token ID1196
Feature activation+5.20e+00

Pos logit contributions
eddy+0.020
 pans+0.020
emo+0.018
ucer+0.018
izz+0.017
Neg logit contributions
ishly-0.022
 course-0.018
 argued-0.017
 practice-0.017
 advantage-0.016
Tokenoc
Token ID420
Feature activation+8.09e-01

Pos logit contributions
ishly+0.087
 course+0.062
 argued+0.056
 practice+0.056
 advantage+0.053
Neg logit contributions
eddy-0.181
 pans-0.181
ucer-0.166
emo-0.165
izz-0.164
Tokenados
Token ID22484
Feature activation+2.26e+00

Pos logit contributions
ishly+0.019
 course+0.015
 argued+0.014
 practice+0.014
 lear+0.014
Neg logit contributions
eddy-0.023
 pans-0.022
ucer-0.020
emo-0.020
ble-0.020
Token and
Token ID290
Feature activation+3.02e-01

Pos logit contributions
ishly+0.084
 argued+0.072
ued+0.070
 strides+0.070
alled+0.069
Neg logit contributions
eddy-0.034
 room-0.029
 pans-0.027
 glass-0.027
tub-0.027
Token would
Token ID561
Feature activation-5.35e-01

Pos logit contributions
ishly+0.008
 advantage+0.006
 strides+0.006
 course+0.006
 echoing+0.006
Neg logit contributions
 pans-0.007
eddy-0.007
emo-0.007
 flour-0.007
 salt-0.006
Token ask
Token ID1265
Feature activation+7.18e-01

Pos logit contributions
 pans+0.015
onna+0.014
eddy+0.014
 lad+0.013
emo+0.013
Neg logit contributions
ishly-0.012
rified-0.009
 strides-0.009
 advantage-0.009
 argued-0.009
Token continued
Token ID3767
Feature activation-6.33e-01

Pos logit contributions
ishly+0.037
 advantage+0.033
 echoing+0.033
 unnecessary+0.033
 antics+0.033
Neg logit contributions
eddy-0.021
ble-0.021
emo-0.021
 lad-0.021
ate-0.019
Token to
Token ID284
Feature activation+2.56e+00

Pos logit contributions
 pans+0.010
eddy+0.009
onna+0.008
oe+0.008
 room+0.008
Neg logit contributions
ishly-0.021
rified-0.020
 weren-0.019
 course-0.019
Deal-0.018
Token enjoy
Token ID2883
Feature activation+1.02e+00

Pos logit contributions
ishly+0.073
rified+0.073
ted+0.066
ified+0.065
vious+0.065
Neg logit contributions
ble-0.043
 room-0.042
 pans-0.042
 candle-0.042
onna-0.041
Token eating
Token ID6600
Feature activation+6.68e-01

Pos logit contributions
ishly+0.039
ave+0.036
vious+0.034
 argued+0.033
rified+0.033
Neg logit contributions
 pans-0.016
 salt-0.013
 flour-0.013
 room-0.012
ble-0.011
Token her
Token ID607
Feature activation-7.80e-01

Pos logit contributions
ishly+0.024
 argued+0.020
ave+0.020
 lear+0.020
istent+0.020
Neg logit contributions
 pans-0.012
 flour-0.011
 salt-0.011
 glass-0.010
 candle-0.009
Token av
Token ID1196
Feature activation+5.17e+00

Pos logit contributions
 pans+0.015
 glass+0.013
 room+0.013
 salt+0.012
 flour+0.012
Neg logit contributions
ishly-0.028
 lear-0.023
 argued-0.023
rified-0.022
istent-0.022
Tokenoc
Token ID420
Feature activation+1.82e+00

Pos logit contributions
ishly+0.121
 echoing+0.087
 lear+0.087
 argued+0.087
 course+0.082
Neg logit contributions
eddy-0.154
 pans-0.151
ucer-0.136
emo-0.132
 flour-0.127
Tokenados
Token ID22484
Feature activation+2.98e+00

Pos logit contributions
 echoing+0.049
ishly+0.048
 practice+0.046
 rather+0.045
 lear+0.045
Neg logit contributions
eddy-0.041
 pans-0.037
loo-0.032
ble-0.031
ucks-0.031
Token with
Token ID351
Feature activation-7.64e-02

Pos logit contributions
ishly+0.106
 lear+0.097
rified+0.094
 argued+0.092
 antics+0.092
Neg logit contributions
 pans-0.043
kins-0.041
ble-0.039
oe-0.035
tub-0.035
Token a
Token ID257
Feature activation+4.46e-01

Pos logit contributions
 flour+0.001
 salt+0.001
 glass+0.001
 pans+0.001
 candle+0.001
Neg logit contributions
ishly-0.003
vious-0.002
 lear-0.002
 echoing-0.002
istent-0.002
Token smile
Token ID8212
Feature activation-7.77e-01

Pos logit contributions
ishly+0.016
alled+0.013
 argued+0.013
vious+0.013
ted+0.013
Neg logit contributions
 pans-0.007
 candle-0.007
 glass-0.007
pron-0.006
 bowl-0.006
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
oe+0.007
onna+0.007
 block+0.007
 room+0.007
ble+0.007
Neg logit contributions
ishly-0.019
 course-0.016
 Barkley-0.015
itivity-0.015
cient-0.015
Token He
Token ID679
Feature activation-1.46e+00

Pos logit contributions
 pans+0.033
eddy+0.033
ucer+0.030
izz+0.028
loo+0.028
Neg logit contributions
ishly-0.038
 course-0.031
 practice-0.030
 lear-0.030
 argued-0.029
Token put
Token ID1234
Feature activation-2.92e+00

Pos logit contributions
 pans+0.032
eddy+0.031
 lad+0.028
izz+0.027
emo+0.027
Neg logit contributions
ishly-0.045
 advantage-0.035
 course-0.035
 practice-0.033
 strides-0.033
Token the
Token ID262
Feature activation-2.63e+00

Pos logit contributions
eddy+0.073
 pans+0.071
emo+0.064
ucer+0.063
loo+0.063
Neg logit contributions
ishly-0.078
 course-0.066
 practice-0.062
 argued-0.061
 advantage-0.060
Token soft
Token ID2705
Feature activation-3.26e+00

Pos logit contributions
eddy+0.038
izz+0.033
emo+0.033
 pans+0.032
ucer+0.027
Neg logit contributions
ishly-0.095
 course-0.087
 advantage-0.083
 practice-0.083
 lear-0.081
Token end
Token ID886
Feature activation-5.23e+00

Pos logit contributions
izz+0.062
emo+0.061
eddy+0.061
 pans+0.057
ucker+0.055
Neg logit contributions
ishly-0.097
 course-0.085
 advantage-0.080
 practice-0.079
 lear-0.078
Token into
Token ID656
Feature activation-1.34e+00

Pos logit contributions
 pans+0.150
eddy+0.147
emo+0.142
ucer+0.135
izz+0.134
Neg logit contributions
ishly-0.101
 course-0.090
 practice-0.081
 rather-0.077
 advantage-0.075
Token the
Token ID262
Feature activation-2.05e+00

Pos logit contributions
eddy+0.036
 pans+0.034
loo+0.032
emo+0.032
ucer+0.032
Neg logit contributions
ishly-0.030
 course-0.027
 practice-0.026
 lear-0.024
 argued-0.024
Token lock
Token ID5793
Feature activation-1.39e+00

Pos logit contributions
eddy+0.036
 pans+0.033
emo+0.033
izz+0.032
ucer+0.029
Neg logit contributions
ishly-0.069
 course-0.062
 practice-0.058
 advantage-0.057
 lear-0.056
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
eddy+0.022
 pans+0.020
oe+0.017
 room+0.017
tub+0.017
Neg logit contributions
ishly-0.050
 advantage-0.042
 compete-0.042
 argued-0.042
 course-0.041
Token pushed
Token ID7121
Feature activation-1.40e+00

Pos logit contributions
 pans+0.024
eddy+0.021
ucer+0.019
 lad+0.019
 flour+0.019
Neg logit contributions
ishly-0.035
 course-0.029
 advantage-0.028
 lear-0.027
 argued-0.027
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
eddy+0.023
 pans+0.021
ucer+0.020
loo+0.019
emo+0.019
Neg logit contributions
ishly-0.020
 course-0.017
 practice-0.016
 argued-0.015
 compete-0.015
Token plank
Token ID39599
Feature activation-3.33e-01

Pos logit contributions
ishly+0.004
 course+0.003
 practice+0.003
 argued+0.003
 competition+0.003
Neg logit contributions
eddy-0.003
izz-0.002
 pans-0.002
emo-0.002
ucer-0.002
Token and
Token ID290
Feature activation-1.99e-01

Pos logit contributions
eddy+0.007
 pans+0.007
ucer+0.006
loo+0.006
ble+0.006
Neg logit contributions
ishly-0.010
 argued-0.008
 course-0.008
 practice-0.008
 lear-0.008
Token put
Token ID1234
Feature activation-2.46e+00

Pos logit contributions
 pans+0.005
eddy+0.005
ucer+0.005
emo+0.005
izz+0.005
Neg logit contributions
ishly-0.005
 course-0.004
 argued-0.004
 practice-0.004
 advantage-0.004
Token one
Token ID530
Feature activation-3.20e+00

Pos logit contributions
 pans+0.055
eddy+0.054
ucer+0.048
izz+0.047
emo+0.046
Neg logit contributions
ishly-0.074
 course-0.059
 argued-0.059
 practice-0.057
 lear-0.057
Token end
Token ID886
Feature activation-4.92e+00

Pos logit contributions
eddy+0.070
 pans+0.069
ucer+0.060
emo+0.059
izz+0.059
Neg logit contributions
ishly-0.095
 course-0.080
 practice-0.077
 argued-0.076
 advantage-0.075
Token on
Token ID319
Feature activation-1.09e+00

Pos logit contributions
eddy+0.138
 pans+0.135
emo+0.128
ucer+0.124
tub+0.122
Neg logit contributions
ishly-0.094
 course-0.085
 practice-0.082
 advantage-0.073
 rather-0.071
Token the
Token ID262
Feature activation-7.48e-01

Pos logit contributions
eddy+0.027
 pans+0.026
ucer+0.024
emo+0.023
izz+0.023
Neg logit contributions
ishly-0.030
 course-0.025
 argued-0.023
 practice-0.023
 advantage-0.023
Token barrel
Token ID9036
Feature activation-5.67e-01

Pos logit contributions
eddy+0.015
 pans+0.014
izz+0.012
emo+0.012
ucer+0.012
Neg logit contributions
ishly-0.024
 course-0.021
 argued-0.020
 practice-0.020
 advantage-0.019
Token.
Token ID13
Feature activation-8.62e-01

Pos logit contributions
eddy+0.009
tub+0.008
 pans+0.008
 lad+0.008
 glass+0.008
Neg logit contributions
ishly-0.019
 practice-0.016
 unnecessary-0.016
 advantage-0.016
itivity-0.016
Token He
Token ID679
Feature activation-1.14e+00

Pos logit contributions
eddy+0.021
 pans+0.020
ucer+0.018
izz+0.017
ble+0.017
Neg logit contributions
ishly-0.024
 course-0.020
 practice-0.019
 argued-0.019
 lear-0.019
Token him
Token ID683
Feature activation-5.36e-01

Pos logit contributions
eddy+0.081
 pans+0.080
emo+0.073
ucer+0.072
izz+0.072
Neg logit contributions
ishly-0.047
 course-0.032
 argued-0.032
 lear-0.031
 echoing-0.031
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
eddy+0.013
 pans+0.013
ucer+0.011
emo+0.011
izz+0.011
Neg logit contributions
ishly-0.015
 course-0.012
 argued-0.012
 practice-0.012
 lear-0.012
Token ice
Token ID4771
Feature activation-4.49e+00

Pos logit contributions
eddy+0.056
 pans+0.055
ucer+0.049
izz+0.047
emo+0.047
Neg logit contributions
ishly-0.083
 course-0.072
 practice-0.069
 argued-0.068
 advantage-0.067
Token and
Token ID290
Feature activation-1.75e+00

Pos logit contributions
 pans+0.083
eddy+0.075
ucer+0.072
izz+0.071
ooks+0.070
Neg logit contributions
ishly-0.131
 course-0.119
 advantage-0.109
 argued-0.107
 lear-0.105
Token the
Token ID262
Feature activation-2.22e+00

Pos logit contributions
 pans+0.040
eddy+0.038
emo+0.034
 flour+0.033
ucer+0.033
Neg logit contributions
ishly-0.052
 advantage-0.042
 course-0.041
 argued-0.040
 compete-0.040
Token ice
Token ID4771
Feature activation-4.89e+00

Pos logit contributions
eddy+0.042
 pans+0.041
ucer+0.035
emo+0.035
izz+0.034
Neg logit contributions
ishly-0.073
 course-0.062
 argued-0.060
 practice-0.059
 lear-0.058
Token cream
Token ID8566
Feature activation-3.03e+00

Pos logit contributions
 pans+0.089
eddy+0.082
ucer+0.076
izz+0.076
ooks+0.074
Neg logit contributions
ishly-0.147
 course-0.131
 argued-0.123
 advantage-0.120
 lear-0.118
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
eddy+0.070
 pans+0.069
ucer+0.061
emo+0.060
loo+0.059
Neg logit contributions
ishly-0.088
 course-0.072
 argued-0.069
 practice-0.069
 advantage-0.067
Token She
Token ID1375
Feature activation-1.65e+00

Pos logit contributions
eddy+0.033
 pans+0.032
ucer+0.028
emo+0.028
izz+0.028
Neg logit contributions
ishly-0.041
 course-0.034
 argued-0.032
 practice-0.032
 lear-0.031
Token put
Token ID1234
Feature activation-2.89e+00

Pos logit contributions
eddy+0.039
 pans+0.039
emo+0.035
izz+0.034
 lad+0.034
Neg logit contributions
ishly-0.046
 advantage-0.037
 course-0.037
 practice-0.036
 argued-0.035
Token the
Token ID262
Feature activation-3.26e+00

Pos logit contributions
eddy+0.072
 pans+0.071
ucer+0.063
izz+0.062
emo+0.062
Neg logit contributions
ishly-0.079
 argued-0.061
 course-0.061
 practice-0.060
 lear-0.060
Token.
Token ID13
Feature activation-4.79e-01

Pos logit contributions
eddy+0.038
 pans+0.038
ucer+0.035
emo+0.034
loo+0.034
Neg logit contributions
ishly-0.036
 course-0.029
 argued-0.027
 practice-0.027
 advantage-0.026
Token But
Token ID887
Feature activation-7.53e-02

Pos logit contributions
eddy+0.011
 pans+0.011
ucer+0.010
emo+0.010
izz+0.010
Neg logit contributions
ishly-0.013
 course-0.011
 argued-0.010
 practice-0.010
 lear-0.010
Token I
Token ID314
Feature activation-8.25e-01

Pos logit contributions
 pans+0.002
eddy+0.001
ble+0.001
loo+0.001
emo+0.001
Neg logit contributions
ishly-0.002
 argued-0.002
 course-0.002
 lear-0.002
 advantage-0.002
Token dropped
Token ID5710
Feature activation-1.73e+00

Pos logit contributions
eddy+0.020
 pans+0.020
izz+0.019
emo+0.019
 lad+0.018
Neg logit contributions
ishly-0.023
 course-0.019
 advantage-0.017
 strides-0.017
 competition-0.016
Token the
Token ID262
Feature activation-3.42e+00

Pos logit contributions
 pans+0.035
eddy+0.032
 flour+0.028
ucer+0.027
 salt+0.027
Neg logit contributions
ishly-0.058
 echoing-0.048
 lear-0.047
 argued-0.046
 compete-0.046
Token dirty
Token ID11841
Feature activation-4.78e+00

Pos logit contributions
ooks+0.070
eddy+0.070
emo+0.070
ucks+0.069
igi+0.068
Neg logit contributions
ishly-0.090
 course-0.077
 weren-0.075
 advantage-0.074
 anymore-0.070
Token dish
Token ID9433
Feature activation-2.58e+00

Pos logit contributions
onna+0.098
izz+0.094
igi+0.093
emo+0.092
ucker+0.091
Neg logit contributions
ishly-0.125
 weren-0.101
 course-0.098
 after-0.093
 argued-0.092
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
eddy+0.047
 pans+0.046
emo+0.040
ucer+0.039
izz+0.039
Neg logit contributions
ishly-0.085
 course-0.072
 argued-0.069
 practice-0.068
 advantage-0.068
Token I
Token ID314
Feature activation-1.65e+00

Pos logit contributions
 pans+0.028
eddy+0.028
ucer+0.026
ble+0.025
oe+0.025
Neg logit contributions
ishly-0.025
 course-0.019
 practice-0.019
 rather-0.018
 advantage-0.018
Token'm
Token ID1101
Feature activation-1.40e+00

Pos logit contributions
eddy+0.038
 pans+0.036
izz+0.036
emo+0.035
ucer+0.033
Neg logit contributions
ishly-0.049
 course-0.040
 strides-0.038
 advantage-0.037
 competition-0.036
Token sorry
Token ID7926
Feature activation-1.69e+00

Pos logit contributions
eddy+0.040
 pans+0.040
ucer+0.036
emo+0.036
loo+0.035
Neg logit contributions
ishly-0.033
 course-0.025
 argued-0.024
 practice-0.024
 advantage-0.023
Token lady
Token ID10846
Feature activation-5.42e-01

Pos logit contributions
 pans+0.026
eddy+0.026
ucer+0.022
emo+0.021
loo+0.021
Neg logit contributions
ishly-0.044
 course-0.037
 argued-0.036
 practice-0.035
 lear-0.035
Token gave
Token ID2921
Feature activation-2.15e+00

Pos logit contributions
eddy+0.011
 pans+0.010
 lad+0.010
oe+0.009
izz+0.009
Neg logit contributions
ishly-0.017
 advantage-0.015
 strides-0.014
 course-0.014
 unnecessary-0.014
Token her
Token ID607
Feature activation-1.01e+00

Pos logit contributions
eddy+0.047
 pans+0.046
izz+0.045
emo+0.043
ble+0.042
Neg logit contributions
ishly-0.063
 lear-0.049
 echoing-0.048
 course-0.048
 strides-0.047
Token a
Token ID257
Feature activation-2.45e+00

Pos logit contributions
eddy+0.020
 pans+0.020
 flour+0.017
ucer+0.016
 glass+0.016
Neg logit contributions
ishly-0.033
 argued-0.027
 course-0.027
 echoing-0.026
 strides-0.026
Token delicious
Token ID12625
Feature activation-2.25e+00

Pos logit contributions
eddy+0.050
 pans+0.049
ucer+0.043
emo+0.042
izz+0.042
Neg logit contributions
ishly-0.076
 course-0.064
 practice-0.062
 argued-0.061
 lear-0.060
Token ice
Token ID4771
Feature activation-4.75e+00

Pos logit contributions
 pans+0.055
eddy+0.055
ucer+0.050
emo+0.049
izz+0.048
Neg logit contributions
ishly-0.058
 course-0.052
 practice-0.049
 argued-0.047
 advantage-0.047
Token cream
Token ID8566
Feature activation-3.19e+00

Pos logit contributions
 pans+0.080
eddy+0.075
ucer+0.065
izz+0.065
emo+0.065
Neg logit contributions
ishly-0.151
 course-0.136
 advantage-0.127
 argued-0.126
 lear-0.126
Token.
Token ID13
Feature activation-4.08e-01

Pos logit contributions
eddy+0.063
 pans+0.059
ucer+0.052
loo+0.049
ble+0.049
Neg logit contributions
ishly-0.106
 course-0.089
 argued-0.088
 practice-0.088
 advantage-0.086
Token Sandy
Token ID17581
Feature activation-6.02e-01

Pos logit contributions
eddy+0.009
 pans+0.008
izz+0.008
emo+0.008
tub+0.008
Neg logit contributions
ishly-0.012
 course-0.010
 argued-0.009
 practice-0.009
 advantage-0.009
Token was
Token ID373
Feature activation-3.57e-01

Pos logit contributions
 pans+0.010
eddy+0.010
 lad+0.009
oe+0.008
izz+0.008
Neg logit contributions
ishly-0.020
 advantage-0.018
 echoing-0.017
 strides-0.017
 course-0.017
Token so
Token ID523
Feature activation-6.71e-01

Pos logit contributions
eddy+0.009
 pans+0.009
ble+0.008
izz+0.008
emo+0.008
Neg logit contributions
ishly-0.009
 course-0.008
 lear-0.007
 argued-0.007
 strides-0.007
Token hit
Token ID2277
Feature activation-1.34e+00

Pos logit contributions
ishly+0.015
 advantage+0.014
 antics+0.013
 upcoming+0.013
istent+0.013
Neg logit contributions
 pans-0.010
 lad-0.009
 salt-0.008
 glass-0.008
ucer-0.008
Token him
Token ID683
Feature activation+1.68e-01

Pos logit contributions
 pans+0.029
eddy+0.028
ble+0.025
 flour+0.025
izz+0.024
Neg logit contributions
ishly-0.040
 lear-0.032
 argued-0.032
 compete-0.031
 echoing-0.031
Token with
Token ID351
Feature activation-1.84e+00

Pos logit contributions
ishly+0.006
 course+0.005
 argued+0.005
 discussed+0.005
 competition+0.005
Neg logit contributions
ble-0.003
eddy-0.003
 pans-0.003
 block-0.002
 candle-0.002
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 pans+0.029
 glass+0.027
 flour+0.027
 salt+0.026
eddy+0.023
Neg logit contributions
ishly-0.068
 compete-0.053
 antics-0.053
cient-0.053
Deal-0.052
Token wet
Token ID9583
Feature activation-3.16e+00

Pos logit contributions
 pans+0.039
eddy+0.038
ucer+0.034
ble+0.032
izz+0.032
Neg logit contributions
ishly-0.053
 course-0.044
 argued-0.042
 practice-0.041
 advantage-0.041
Token end
Token ID886
Feature activation-4.65e+00

Pos logit contributions
eddy+0.068
 pans+0.065
emo+0.061
izz+0.060
ucer+0.058
Neg logit contributions
ishly-0.094
 course-0.077
 argued-0.074
 practice-0.074
 lear-0.074
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
eddy+0.128
 pans+0.127
emo+0.118
ucer+0.117
izz+0.112
Neg logit contributions
ishly-0.103
 course-0.085
 practice-0.079
 argued-0.075
 advantage-0.074
Token Sam
Token ID3409
Feature activation-1.45e-01

Pos logit contributions
 pans+0.007
eddy+0.006
emo+0.006
onna+0.006
tub+0.006
Neg logit contributions
ishly-0.012
 course-0.010
 argued-0.010
 advantage-0.010
nered-0.010
Token was
Token ID373
Feature activation-9.62e-01

Pos logit contributions
 pans+0.003
eddy+0.002
 lad+0.002
 glass+0.002
Anna+0.002
Neg logit contributions
ishly-0.005
 advantage-0.004
 compete-0.004
 upcoming-0.004
 course-0.004
Token startled
Token ID36249
Feature activation+3.96e-01

Pos logit contributions
 pans+0.022
eddy+0.021
ble+0.019
izz+0.018
emo+0.018
Neg logit contributions
ishly-0.028
 course-0.024
 lear-0.022
 argued-0.022
 practice-0.022
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
ishly+0.012
 course+0.009
 argued+0.009
 practice+0.009
 lear+0.009
Neg logit contributions
eddy-0.009
 pans-0.009
izz-0.007
emo-0.007
ble-0.007
Token Mia
Token ID32189
Feature activation-2.22e+00

Pos logit contributions
eddy+0.054
 pans+0.052
emo+0.047
ucer+0.047
loo+0.046
Neg logit contributions
ishly-0.056
 course-0.049
 argued-0.046
 practice-0.045
 advantage-0.044
Token a
Token ID257
Feature activation-2.94e+00

Pos logit contributions
eddy+0.056
 pans+0.056
ucer+0.051
emo+0.050
loo+0.050
Neg logit contributions
ishly-0.058
 course-0.047
 argued-0.045
 practice-0.045
 advantage-0.043
Token slice
Token ID16416
Feature activation-3.24e+00

Pos logit contributions
eddy+0.062
 pans+0.053
ble+0.053
izz+0.053
emo+0.051
Neg logit contributions
ishly-0.080
 practice-0.076
 course-0.076
 lear-0.072
 competition-0.072
Token and
Token ID290
Feature activation-2.02e+00

Pos logit contributions
 pans+0.088
eddy+0.086
emo+0.078
ucer+0.078
izz+0.077
Neg logit contributions
ishly-0.076
 course-0.064
 argued-0.058
 practice-0.058
 lear-0.057
Token a
Token ID257
Feature activation-3.17e+00

Pos logit contributions
 pans+0.044
eddy+0.043
ucer+0.037
izz+0.037
emo+0.037
Neg logit contributions
ishly-0.061
 course-0.050
 argued-0.049
 advantage-0.047
 practice-0.047
Token glass
Token ID5405
Feature activation-4.65e+00

Pos logit contributions
eddy+0.064
izz+0.056
emo+0.055
 pans+0.055
ble+0.054
Neg logit contributions
ishly-0.092
 course-0.086
 practice-0.084
 lear-0.081
 competition-0.079
Token of
Token ID286
Feature activation-2.97e+00

Pos logit contributions
 pans+0.137
emo+0.131
eddy+0.131
izz+0.126
ucer+0.125
Neg logit contributions
ishly-0.083
 course-0.073
 lear-0.066
 practice-0.059
 advantage-0.059
Token milk
Token ID7545
Feature activation-3.88e+00

Pos logit contributions
eddy+0.055
 pans+0.050
ucer+0.047
izz+0.045
emo+0.045
Neg logit contributions
ishly-0.097
 course-0.088
 practice-0.083
 argued-0.080
 advantage-0.079
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
 pans+0.112
eddy+0.111
emo+0.104
izz+0.103
ucer+0.102
Neg logit contributions
ishly-0.083
 course-0.064
 argued-0.060
 lear-0.057
 practice-0.057
Token They
Token ID1119
Feature activation-2.65e+00

Pos logit contributions
eddy+0.030
 pans+0.030
ucer+0.027
emo+0.026
loo+0.026
Neg logit contributions
ishly-0.035
 course-0.029
 practice-0.028
 argued-0.027
 lear-0.027
Token sit
Token ID1650
Feature activation-1.74e+00

Pos logit contributions
 pans+0.066
eddy+0.062
izz+0.056
 lad+0.056
ucer+0.055
Neg logit contributions
ishly-0.073
 course-0.057
 advantage-0.056
 argued-0.056
 echoing-0.054
Token let
Token ID1309
Feature activation-1.14e-02

Pos logit contributions
 pans+0.011
eddy+0.011
ble+0.010
ucer+0.010
 flour+0.010
Neg logit contributions
ishly-0.012
 course-0.009
 lear-0.009
 argued-0.009
 advantage-0.009
Token's
Token ID338
Feature activation+5.06e-01

Pos logit contributions
eddy+0.000
 pans+0.000
izz+0.000
 flour+0.000
emo+0.000
Neg logit contributions
ishly-0.000
 lear-0.000
 course-0.000
 argued-0.000
 echoing-0.000
Token clean
Token ID3424
Feature activation-1.50e+00

Pos logit contributions
ishly+0.014
 course+0.011
 lear+0.011
 argued+0.010
 echoing+0.010
Neg logit contributions
 pans-0.012
eddy-0.012
ble-0.011
izz-0.011
 candle-0.011
Token up
Token ID510
Feature activation-1.55e+00

Pos logit contributions
eddy+0.038
 pans+0.037
ucer+0.035
emo+0.034
loo+0.034
Neg logit contributions
ishly-0.039
 course-0.032
 practice-0.030
 argued-0.029
 advantage-0.029
Token the
Token ID262
Feature activation-2.95e+00

Pos logit contributions
 pans+0.033
eddy+0.030
 flour+0.027
ble+0.026
izz+0.026
Neg logit contributions
ishly-0.052
 argued-0.041
 course-0.040
 echoing-0.040
 lear-0.039
Token ice
Token ID4771
Feature activation-4.59e+00

Pos logit contributions
ucer+0.067
emo+0.066
eddy+0.065
izz+0.064
ooks+0.061
Neg logit contributions
 course-0.070
ishly-0.063
 anymore-0.062
 practice-0.061
 argued-0.060
Token and
Token ID290
Feature activation-9.55e-01

Pos logit contributions
eddy+0.087
 pans+0.085
izz+0.079
emo+0.078
ooks+0.074
Neg logit contributions
ishly-0.127
 course-0.114
 anymore-0.106
 argued-0.105
 advantage-0.102
Token make
Token ID787
Feature activation-1.22e+00

Pos logit contributions
 pans+0.023
eddy+0.023
ucer+0.020
emo+0.020
ble+0.019
Neg logit contributions
ishly-0.027
 course-0.022
 argued-0.021
 lear-0.020
 practice-0.020
Token a
Token ID257
Feature activation-6.74e-01

Pos logit contributions
eddy+0.029
 pans+0.028
ucer+0.025
emo+0.025
izz+0.024
Neg logit contributions
ishly-0.035
 course-0.029
 argued-0.027
 practice-0.027
 lear-0.027
Token new
Token ID649
Feature activation-2.01e+00

Pos logit contributions
 pans+0.017
eddy+0.017
izz+0.016
ucer+0.016
emo+0.016
Neg logit contributions
ishly-0.015
 course-0.014
 practice-0.014
 rather-0.013
 advantage-0.013
Token cake
Token ID12187
Feature activation-1.15e+00

Pos logit contributions
 pans+0.050
izz+0.045
eddy+0.045
ucer+0.044
emo+0.042
Neg logit contributions
ishly-0.052
 course-0.045
 practice-0.042
 advantage-0.042
 lear-0.042
Token cream
Token ID8566
Feature activation-3.03e+00

Pos logit contributions
 pans+0.089
eddy+0.082
ucer+0.076
izz+0.076
ooks+0.074
Neg logit contributions
ishly-0.147
 course-0.131
 argued-0.123
 advantage-0.120
 lear-0.118
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
eddy+0.070
 pans+0.069
ucer+0.061
emo+0.060
loo+0.059
Neg logit contributions
ishly-0.088
 course-0.072
 argued-0.069
 practice-0.069
 advantage-0.067
Token She
Token ID1375
Feature activation-1.65e+00

Pos logit contributions
eddy+0.033
 pans+0.032
ucer+0.028
emo+0.028
izz+0.028
Neg logit contributions
ishly-0.041
 course-0.034
 argued-0.032
 practice-0.032
 lear-0.031
Token put
Token ID1234
Feature activation-2.89e+00

Pos logit contributions
eddy+0.039
 pans+0.039
emo+0.035
izz+0.034
 lad+0.034
Neg logit contributions
ishly-0.046
 advantage-0.037
 course-0.037
 practice-0.036
 argued-0.035
Token the
Token ID262
Feature activation-3.26e+00

Pos logit contributions
eddy+0.072
 pans+0.071
ucer+0.063
izz+0.062
emo+0.062
Neg logit contributions
ishly-0.079
 argued-0.061
 course-0.061
 practice-0.060
 lear-0.060
Token ice
Token ID4771
Feature activation-4.58e+00

Pos logit contributions
eddy+0.075
 pans+0.074
ucer+0.069
izz+0.068
emo+0.068
Neg logit contributions
ishly-0.087
 course-0.079
 practice-0.071
 advantage-0.071
 argued-0.070
Token on
Token ID319
Feature activation-1.44e+00

Pos logit contributions
 pans+0.090
eddy+0.089
ucer+0.080
ooks+0.080
emo+0.079
Neg logit contributions
ishly-0.125
 course-0.114
 argued-0.102
 advantage-0.101
 every-0.099
Token his
Token ID465
Feature activation-2.06e+00

Pos logit contributions
eddy+0.032
 pans+0.031
emo+0.027
ble+0.027
ucer+0.026
Neg logit contributions
ishly-0.043
 argued-0.035
 course-0.034
 echoing-0.033
 lear-0.033
Token toe
Token ID21189
Feature activation-1.52e+00

Pos logit contributions
eddy+0.041
 pans+0.041
ucer+0.035
emo+0.034
izz+0.034
Neg logit contributions
ishly-0.066
 course-0.055
 argued-0.053
 practice-0.053
 lear-0.052
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
eddy+0.028
 pans+0.027
ble+0.023
loo+0.022
ucer+0.022
Neg logit contributions
ishly-0.051
 compete-0.044
 practice-0.043
 echoing-0.043
 course-0.043
Token
Token ID198
Feature activation-9.25e-01

Pos logit contributions
eddy+0.035
 pans+0.035
ucer+0.031
emo+0.030
izz+0.030
Neg logit contributions
ishly-0.042
 course-0.035
 practice-0.033
 argued-0.033
 lear-0.032
Token stick
Token ID4859
Feature activation-8.27e-01

Pos logit contributions
ishly+0.022
 argued+0.018
 advantage+0.017
 echoing+0.017
 course+0.017
Neg logit contributions
 pans-0.012
eddy-0.012
 lad-0.010
 glass-0.010
 candle-0.010
Token and
Token ID290
Feature activation-1.67e-01

Pos logit contributions
oe+0.011
eddy+0.011
ble+0.010
onna+0.010
 block+0.010
Neg logit contributions
ishly-0.028
 unnecessary-0.026
itivity-0.026
 meanings-0.025
 antics-0.025
Token put
Token ID1234
Feature activation-2.42e+00

Pos logit contributions
 pans+0.004
 lad+0.003
 salt+0.003
 flour+0.003
 glass+0.003
Neg logit contributions
ishly-0.005
 advantage-0.004
 antics-0.004
 anymore-0.004
 echoing-0.004
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
 pans+0.043
eddy+0.039
izz+0.035
ucer+0.035
 flour+0.035
Neg logit contributions
ishly-0.085
 argued-0.070
 echoing-0.070
 lear-0.069
 rather-0.068
Token other
Token ID584
Feature activation-1.58e+00

Pos logit contributions
 pans+0.028
eddy+0.025
 flour+0.023
ucer+0.023
 salt+0.022
Neg logit contributions
ishly-0.050
 argued-0.040
 echoing-0.039
 course-0.039
 compete-0.038
Token end
Token ID886
Feature activation-4.58e+00

Pos logit contributions
 pans+0.032
eddy+0.029
 flour+0.026
ucer+0.026
loo+0.024
Neg logit contributions
ishly-0.053
 argued-0.042
 course-0.042
 lear-0.042
 echoing-0.041
Token near
Token ID1474
Feature activation-1.84e+00

Pos logit contributions
eddy+0.114
 pans+0.113
ucer+0.100
emo+0.099
izz+0.098
Neg logit contributions
ishly-0.122
 course-0.099
 practice-0.094
 argued-0.094
 advantage-0.091
Token the
Token ID262
Feature activation+3.16e-01

Pos logit contributions
 pans+0.035
eddy+0.033
ble+0.029
ucer+0.029
 flour+0.029
Neg logit contributions
ishly-0.061
 lear-0.049
 argued-0.049
 course-0.048
 echoing-0.048
Token food
Token ID2057
Feature activation-1.17e+00

Pos logit contributions
ishly+0.010
 course+0.008
 argued+0.008
 practice+0.008
 lear+0.008
Neg logit contributions
eddy-0.007
 pans-0.007
ucer-0.006
emo-0.006
loo-0.006
Token.
Token ID13
Feature activation-7.78e-01

Pos logit contributions
 pans+0.020
tub+0.019
ble+0.018
ucer+0.018
eddy+0.018
Neg logit contributions
ishly-0.038
 lear-0.033
 advantage-0.033
 practice-0.033
 argued-0.032
Token When
Token ID1649
Feature activation-1.16e+00

Pos logit contributions
eddy+0.018
 pans+0.018
ucer+0.016
emo+0.016
izz+0.015
Neg logit contributions
ishly-0.023
 course-0.018
 argued-0.018
 practice-0.017
 advantage-0.017
Token He
Token ID679
Feature activation-7.22e-01

Pos logit contributions
 pans+0.015
eddy+0.015
ucer+0.013
izz+0.013
emo+0.013
Neg logit contributions
ishly-0.021
 course-0.018
 argued-0.017
 practice-0.017
 lear-0.017
Token put
Token ID1234
Feature activation-2.26e+00

Pos logit contributions
 pans+0.016
eddy+0.015
 lad+0.014
emo+0.014
ble+0.013
Neg logit contributions
ishly-0.022
 advantage-0.018
 course-0.018
 strides-0.017
 practice-0.017
Token the
Token ID262
Feature activation-2.24e+00

Pos logit contributions
 pans+0.049
eddy+0.049
ucer+0.043
emo+0.042
izz+0.042
Neg logit contributions
ishly-0.068
 course-0.055
 argued-0.054
 practice-0.053
 lear-0.053
Token point
Token ID966
Feature activation-1.83e+00

Pos logit contributions
eddy+0.052
 pans+0.044
izz+0.044
loo+0.042
ucks+0.040
Neg logit contributions
ishly-0.064
 course-0.059
 practice-0.056
 advantage-0.056
 argued-0.055
Tokeny
Token ID88
Feature activation-3.08e+00

Pos logit contributions
eddy+0.039
 pans+0.038
ble+0.034
ucer+0.034
emo+0.033
Neg logit contributions
ishly-0.056
 course-0.046
 argued-0.045
 practice-0.044
 advantage-0.044
Token end
Token ID886
Feature activation-4.56e+00

Pos logit contributions
eddy+0.050
 pans+0.049
ucer+0.040
emo+0.039
izz+0.039
Neg logit contributions
ishly-0.110
 course-0.094
 argued-0.092
 practice-0.091
 lear-0.089
Token in
Token ID287
Feature activation-8.28e-01

Pos logit contributions
eddy+0.125
 pans+0.124
emo+0.113
ucer+0.111
izz+0.110
Neg logit contributions
ishly-0.101
 course-0.087
 practice-0.080
 advantage-0.076
 rather-0.075
Token his
Token ID465
Feature activation-6.08e-01

Pos logit contributions
 pans+0.018
eddy+0.017
ucer+0.015
emo+0.015
izz+0.014
Neg logit contributions
ishly-0.026
 argued-0.020
 course-0.020
 practice-0.020
 lear-0.020
Token mouth
Token ID5422
Feature activation-2.73e-01

Pos logit contributions
 pans+0.011
eddy+0.011
ucer+0.009
emo+0.009
loo+0.009
Neg logit contributions
ishly-0.020
 course-0.017
 argued-0.017
 practice-0.016
 lear-0.016
Token and
Token ID290
Feature activation-4.61e-01

Pos logit contributions
eddy+0.004
 pans+0.004
ble+0.004
tub+0.004
 lad+0.004
Neg logit contributions
ishly-0.010
 course-0.008
 compete-0.008
 lear-0.008
 advantage-0.008
Token l
Token ID300
Feature activation+1.03e+00

Pos logit contributions
 pans+0.010
 lad+0.009
eddy+0.009
 salt+0.008
 glass+0.008
Neg logit contributions
ishly-0.014
 advantage-0.012
 course-0.012
 rather-0.011
 compete-0.011
Token and
Token ID290
Feature activation-6.80e-01

Pos logit contributions
eddy+0.028
 pans+0.025
emo+0.023
ucer+0.022
ble+0.022
Neg logit contributions
ishly-0.042
 course-0.036
 argued-0.035
 echoing-0.034
 practice-0.034
Token Tom
Token ID4186
Feature activation-3.59e-01

Pos logit contributions
 pans+0.010
eddy+0.010
 lad+0.009
 glass+0.009
 flour+0.009
Neg logit contributions
ishly-0.023
 advantage-0.020
date-0.020
 weren-0.020
dates-0.020
Token.
Token ID13
Feature activation-7.77e-01

Pos logit contributions
eddy+0.007
 pans+0.007
emo+0.006
ucer+0.006
 lad+0.006
Neg logit contributions
ishly-0.011
 course-0.009
 echoing-0.009
 lear-0.009
 strides-0.009
Token He
Token ID679
Feature activation-1.26e+00

Pos logit contributions
 pans+0.016
eddy+0.016
ucer+0.014
izz+0.014
emo+0.014
Neg logit contributions
ishly-0.024
 course-0.020
 argued-0.019
 practice-0.019
 lear-0.019
Token shares
Token ID7303
Feature activation-3.22e+00

Pos logit contributions
eddy+0.032
 pans+0.031
emo+0.028
ucer+0.027
loo+0.027
Neg logit contributions
ishly-0.034
 course-0.027
 argued-0.026
 practice-0.025
 advantage-0.025
Token his
Token ID465
Feature activation-4.56e+00

Pos logit contributions
eddy+0.085
 pans+0.085
ucer+0.078
loo+0.077
izz+0.076
Neg logit contributions
ishly-0.078
 course-0.067
 argued-0.062
 practice-0.062
 advantage-0.059
Token other
Token ID584
Feature activation-4.33e+00

Pos logit contributions
eddy+0.086
 pans+0.079
ucer+0.074
izz+0.073
emo+0.070
Neg logit contributions
ishly-0.147
 course-0.132
 argued-0.123
 practice-0.123
 advantage-0.121
Token snacks
Token ID28487
Feature activation-2.71e+00

Pos logit contributions
eddy+0.089
 pans+0.085
ucer+0.080
izz+0.079
emo+0.078
Neg logit contributions
ishly-0.130
 course-0.113
 argued-0.111
 practice-0.106
 advantage-0.106
Token with
Token ID351
Feature activation-1.45e+00

Pos logit contributions
eddy+0.057
 pans+0.055
ucer+0.048
emo+0.048
ble+0.047
Neg logit contributions
ishly-0.084
 course-0.070
 argued-0.069
 practice-0.068
 lear-0.067
Token them
Token ID606
Feature activation-1.32e+00

Pos logit contributions
eddy+0.030
 pans+0.030
 flour+0.027
emo+0.027
izz+0.026
Neg logit contributions
ishly-0.045
 lear-0.036
 echoing-0.035
 argued-0.035
 strides-0.034
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
eddy+0.024
 pans+0.022
oe+0.019
ble+0.019
emo+0.019
Neg logit contributions
ishly-0.044
 course-0.037
 argued-0.037
 strides-0.037
 echoing-0.036
Token they
Token ID484
Feature activation+5.28e-01

Pos logit contributions
eddy+0.036
 pans+0.031
ble+0.029
loo+0.029
ucer+0.028
Neg logit contributions
ishly-0.059
 argued-0.050
 compete-0.047
 strides-0.046
 course-0.045
Token could
Token ID714
Feature activation-1.61e+00

Pos logit contributions
ishly+0.016
 heights+0.016
 antics+0.016
 advantage+0.016
 strides+0.015
Neg logit contributions
izz-0.008
ucer-0.008
 pans-0.007
hes-0.007
eddy-0.006
Token share
Token ID2648
Feature activation-1.48e+00

Pos logit contributions
eddy+0.036
ble+0.036
 pans+0.035
emo+0.034
onna+0.033
Neg logit contributions
ishly-0.044
rified-0.036
 argued-0.036
 lear-0.036
 course-0.034
Token the
Token ID262
Feature activation-3.26e+00

Pos logit contributions
 pans+0.028
eddy+0.027
 flour+0.023
emo+0.023
 salt+0.022
Neg logit contributions
ishly-0.053
 argued-0.042
 lear-0.042
 echoing-0.040
 strides-0.040
Token y
Token ID331
Feature activation-1.11e+00

Pos logit contributions
eddy+0.056
 pans+0.050
ucer+0.048
izz+0.048
ble+0.046
Neg logit contributions
ishly-0.105
 course-0.100
 practice-0.094
 competition-0.090
 advantage-0.090
Tokenummy
Token ID13513
Feature activation-4.56e+00

Pos logit contributions
 pans+0.023
ucer+0.022
eddy+0.022
ble+0.021
izz+0.020
Neg logit contributions
ishly-0.031
 course-0.027
 practice-0.026
 advantage-0.026
 argued-0.025
Token pasta
Token ID26296
Feature activation-1.32e+00

Pos logit contributions
eddy+0.091
izz+0.084
ble+0.082
ucer+0.080
emo+0.078
Neg logit contributions
 course-0.122
ishly-0.120
 practice-0.110
 believe-0.107
 argued-0.106
Token together
Token ID1978
Feature activation-1.23e+00

Pos logit contributions
 pans+0.025
eddy+0.024
tub+0.022
loo+0.021
ucer+0.020
Neg logit contributions
ishly-0.045
 argued-0.038
 practice-0.036
 lear-0.036
 strides-0.035
Token!
Token ID0
Feature activation+4.94e-01

Pos logit contributions
 room+0.013
kins+0.013
eddy+0.012
 pans+0.011
ble+0.011
Neg logit contributions
ishly-0.048
cient-0.041
vious-0.041
ercise-0.040
rified-0.039
Token<|endoftext|>
Token ID50256
Feature activation-1.10e+00

Pos logit contributions
ishly+0.015
 course+0.013
 argued+0.012
 practice+0.012
 lear+0.012
Neg logit contributions
eddy-0.011
 pans-0.011
ucer-0.009
emo-0.009
izz-0.009
TokenOnce
Token ID7454
Feature activation-1.21e+00

Pos logit contributions
 pans+0.026
eddy+0.025
ucer+0.022
emo+0.022
ble+0.021
Neg logit contributions
ishly-0.030
 course-0.027
 practice-0.024
 argued-0.023
 weren-0.023
Token.
Token ID13
Feature activation+4.02e-01

Pos logit contributions
eddy+0.006
 pans+0.006
ucer+0.005
ble+0.005
emo+0.005
Neg logit contributions
ishly-0.006
 course-0.005
 argued-0.005
 practice-0.005
 advantage-0.004
Token They
Token ID1119
Feature activation-1.61e+00

Pos logit contributions
ishly+0.012
 course+0.010
 argued+0.010
 practice+0.010
 lear+0.010
Neg logit contributions
eddy-0.009
 pans-0.009
ucer-0.008
izz-0.007
emo-0.007
Token sit
Token ID1650
Feature activation-1.11e+00

Pos logit contributions
 pans+0.031
 lad+0.028
onna+0.026
 block+0.026
 glass+0.026
Neg logit contributions
ishly-0.050
 echoing-0.040
dates-0.040
 course-0.040
 advantage-0.039
Token on
Token ID319
Feature activation+8.86e-01

Pos logit contributions
 pans+0.031
emo+0.031
onna+0.031
eddy+0.031
ucer+0.029
Neg logit contributions
ishly-0.024
 at-0.017
 course-0.016
 argued-0.016
 forward-0.016
Token each
Token ID1123
Feature activation-2.24e+00

Pos logit contributions
ishly+0.026
 course+0.021
 argued+0.020
 practice+0.020
 lear+0.020
Neg logit contributions
eddy-0.020
 pans-0.020
ucer-0.017
emo-0.017
izz-0.017
Token end
Token ID886
Feature activation-4.56e+00

Pos logit contributions
eddy+0.066
 pans+0.066
ucer+0.059
emo+0.059
izz+0.058
Neg logit contributions
ishly-0.050
 course-0.039
 argued-0.037
 practice-0.036
 lear-0.035
Token.
Token ID13
Feature activation-4.25e-02

Pos logit contributions
eddy+0.160
ucer+0.151
 pans+0.149
icer+0.148
emo+0.142
Neg logit contributions
ishly-0.046
 practice-0.044
 compete-0.042
 course-0.042
 anymore-0.035
Token They
Token ID1119
Feature activation-1.84e+00

Pos logit contributions
 pans+0.001
eddy+0.001
ucer+0.001
ble+0.001
loo+0.001
Neg logit contributions
ishly-0.001
 practice-0.001
 argued-0.001
 lear-0.001
 advantage-0.001
Token push
Token ID4574
Feature activation+6.58e-02

Pos logit contributions
 pans+0.035
 lad+0.032
 glass+0.031
onna+0.031
ble+0.029
Neg logit contributions
ishly-0.057
 echoing-0.046
dates-0.046
 course-0.045
 advantage-0.044
Token with
Token ID351
Feature activation-5.68e-01

Pos logit contributions
ishly+0.002
 course+0.001
 argued+0.001
 lear+0.001
 echoing+0.001
Neg logit contributions
eddy-0.002
 pans-0.002
ucer-0.001
ble-0.001
izz-0.001
Token their
Token ID511
Feature activation-2.14e-01

Pos logit contributions
 pans+0.014
eddy+0.012
 flour+0.011
 salt+0.011
 glass+0.011
Neg logit contributions
ishly-0.016
 lear-0.013
 course-0.013
 argued-0.012
 echoing-0.012
Token happy
Token ID3772
Feature activation-1.24e+00

Pos logit contributions
eddy+0.043
 pans+0.041
ucer+0.040
loo+0.039
emo+0.037
Neg logit contributions
ishly-0.027
 course-0.021
 rather-0.020
 practice-0.020
 reassured-0.020
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 pans+0.038
eddy+0.038
ucer+0.036
izz+0.035
emo+0.034
Neg logit contributions
ishly-0.025
 course-0.020
 argued-0.018
 practice-0.018
 lear-0.017
Token She
Token ID1375
Feature activation-2.17e+00

Pos logit contributions
 pans+0.028
eddy+0.027
ucer+0.025
ble+0.025
emo+0.024
Neg logit contributions
ishly-0.025
 course-0.020
 practice-0.019
 lear-0.019
 argued-0.018
Token wrapped
Token ID12908
Feature activation-2.23e+00

Pos logit contributions
eddy+0.053
 pans+0.053
emo+0.046
izz+0.046
ucer+0.046
Neg logit contributions
ishly-0.060
 course-0.047
 advantage-0.047
 argued-0.046
 practice-0.046
Token the
Token ID262
Feature activation-3.54e+00

Pos logit contributions
eddy+0.056
 pans+0.055
loo+0.050
ucer+0.050
emo+0.048
Neg logit contributions
ishly-0.059
 course-0.049
 argued-0.045
 practice-0.044
 advantage-0.044
Token rest
Token ID1334
Feature activation-4.55e+00

Pos logit contributions
eddy+0.089
 pans+0.084
izz+0.083
emo+0.081
ucks+0.080
Neg logit contributions
ishly-0.086
 course-0.078
 argued-0.069
 advantage-0.068
 practice-0.067
Token of
Token ID286
Feature activation-2.38e+00

Pos logit contributions
 pans+0.119
eddy+0.115
ucer+0.103
emo+0.103
izz+0.101
Neg logit contributions
ishly-0.113
 course-0.094
 argued-0.086
 practice-0.085
 lear-0.084
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
eddy+0.059
 pans+0.057
ucer+0.053
loo+0.051
izz+0.051
Neg logit contributions
ishly-0.062
 course-0.055
 practice-0.050
 argued-0.049
 advantage-0.047
Token mel
Token ID7758
Feature activation-1.08e+00

Pos logit contributions
eddy+0.069
 pans+0.066
izz+0.063
emo+0.062
ucer+0.062
Neg logit contributions
ishly-0.071
 course-0.063
 argued-0.058
 practice-0.057
 advantage-0.056
Tokenon
Token ID261
Feature activation-1.75e+00

Pos logit contributions
 pans+0.023
eddy+0.022
emo+0.021
tub+0.020
ucks+0.019
Neg logit contributions
ishly-0.030
 course-0.028
 practice-0.024
 argued-0.023
 weren-0.023
Token in
Token ID287
Feature activation-1.41e+00

Pos logit contributions
eddy+0.040
 pans+0.039
ucer+0.035
ble+0.034
emo+0.034
Neg logit contributions
ishly-0.052
 course-0.043
 argued-0.041
 practice-0.041
 lear-0.040
Token Tom
Token ID4186
Feature activation-6.77e-01

Pos logit contributions
 pans+0.002
eddy+0.002
izz+0.001
ucer+0.001
emo+0.001
Neg logit contributions
ishly-0.003
 course-0.002
 argued-0.002
 advantage-0.002
 practice-0.002
Token says
Token ID1139
Feature activation+2.55e-01

Pos logit contributions
 pans+0.012
eddy+0.011
 lad+0.011
ucks+0.010
Anna+0.010
Neg logit contributions
ishly-0.022
 course-0.019
 lear-0.018
 advantage-0.018
nered-0.018
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
ishly+0.007
 course+0.006
 practice+0.006
 argued+0.006
 lear+0.005
Neg logit contributions
 pans-0.006
eddy-0.006
ble-0.005
oe-0.005
ucer-0.005
Token He
Token ID679
Feature activation-1.47e+00

Pos logit contributions
eddy+0.027
 pans+0.026
ucer+0.024
ble+0.023
loo+0.023
Neg logit contributions
ishly-0.034
 course-0.028
 practice-0.028
 argued-0.028
 lear-0.027
Token fills
Token ID23816
Feature activation-3.65e+00

Pos logit contributions
eddy+0.034
 pans+0.033
ucer+0.030
emo+0.029
loo+0.029
Neg logit contributions
ishly-0.043
 course-0.035
 practice-0.034
 argued-0.034
 lear-0.033
Token the
Token ID262
Feature activation-4.54e+00

Pos logit contributions
loo+0.137
eddy+0.136
 outage+0.123
ucer+0.123
onna+0.120
Neg logit contributions
 every-0.035
 all-0.035
 as-0.035
ishly-0.033
 after-0.031
Token pan
Token ID3425
Feature activation-1.55e+00

Pos logit contributions
emo+0.094
eddy+0.090
igi+0.088
onna+0.082
ucer+0.081
Neg logit contributions
ishly-0.105
 at-0.101
 course-0.097
 competition-0.090
 advantage-0.089
Token with
Token ID351
Feature activation-2.50e+00

Pos logit contributions
eddy+0.041
emo+0.040
 pans+0.040
izz+0.039
loo+0.038
Neg logit contributions
ishly-0.038
 argued-0.027
 practice-0.026
 course-0.026
 rather-0.026
Token water
Token ID1660
Feature activation-3.14e+00

Pos logit contributions
 pans+0.042
eddy+0.041
ucer+0.034
emo+0.033
izz+0.033
Neg logit contributions
ishly-0.088
 course-0.075
 argued-0.073
 practice-0.072
 lear-0.071
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
eddy+0.079
 pans+0.078
ucer+0.069
emo+0.068
izz+0.068
Neg logit contributions
ishly-0.084
 course-0.068
 argued-0.065
 practice-0.064
 lear-0.063
Token puts
Token ID7584
Feature activation-2.77e+00

Pos logit contributions
 pans+0.042
eddy+0.040
ucer+0.034
emo+0.034
izz+0.034
Neg logit contributions
ishly-0.069
 course-0.058
 argued-0.056
 advantage-0.055
 practice-0.055
Token naughty
Token ID45128
Feature activation-9.38e-01

Pos logit contributions
eddy+0.014
 pans+0.013
ucer+0.012
loo+0.012
emo+0.012
Neg logit contributions
ishly-0.012
 course-0.009
 practice-0.009
 argued-0.009
 advantage-0.009
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
eddy+0.019
 pans+0.017
loo+0.014
 twins+0.014
 lad+0.014
Neg logit contributions
ishly-0.032
 course-0.027
 practice-0.025
 advantage-0.025
 argued-0.025
Token You
Token ID921
Feature activation+7.89e-01

Pos logit contributions
ishly+0.005
 course+0.004
 practice+0.004
 argued+0.004
 lear+0.004
Neg logit contributions
 pans-0.005
eddy-0.005
ucer-0.004
loo-0.004
ble-0.004
Token wasted
Token ID18359
Feature activation-2.26e+00

Pos logit contributions
ishly+0.023
 course+0.018
 echoing+0.018
 lear+0.018
 strides+0.017
Neg logit contributions
 pans-0.018
eddy-0.018
izz-0.016
ucer-0.016
emo-0.016
Token the
Token ID262
Feature activation-3.82e+00

Pos logit contributions
 pans+0.042
eddy+0.039
 flour+0.036
 salt+0.034
 glass+0.034
Neg logit contributions
ishly-0.080
 echoing-0.064
 compete-0.063
 argued-0.063
 lear-0.062
Token expensive
Token ID5789
Feature activation-4.54e+00

Pos logit contributions
eddy+0.064
 pans+0.058
ucer+0.052
izz+0.050
emo+0.049
Neg logit contributions
ishly-0.131
 course-0.120
 practice-0.114
 argued-0.112
 advantage-0.112
Token cream
Token ID8566
Feature activation-2.77e+00

Pos logit contributions
eddy+0.066
 pans+0.065
ucer+0.053
emo+0.052
izz+0.051
Neg logit contributions
ishly-0.169
 course-0.146
 argued-0.141
 practice-0.141
 lear-0.138
Token."
Token ID526
Feature activation-1.32e-01

Pos logit contributions
eddy+0.058
 pans+0.058
ucer+0.050
loo+0.048
ble+0.048
Neg logit contributions
ishly-0.088
 echoing-0.075
 practice-0.074
 argued-0.073
 strides-0.072
Token
Token ID198
Feature activation-4.48e-01

Pos logit contributions
 pans+0.003
eddy+0.003
ucer+0.003
ble+0.003
loo+0.003
Neg logit contributions
ishly-0.003
 course-0.003
 practice-0.003
 argued-0.003
 lear-0.003
Token
Token ID198
Feature activation-8.64e-01

Pos logit contributions
 pans+0.014
eddy+0.013
ucer+0.012
loo+0.012
emo+0.011
Neg logit contributions
ishly-0.009
 course-0.008
 practice-0.008
 rather-0.007
 advantage-0.006
TokenAnna
Token ID31160
Feature activation-5.40e-01

Pos logit contributions
eddy+0.012
 pans+0.011
loo+0.010
emo+0.010
ucer+0.009
Neg logit contributions
ishly-0.032
 course-0.030
 practice-0.029
 argued-0.027
 lear-0.027
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 pans+0.013
eddy+0.012
 lad+0.011
ucer+0.011
izz+0.011
Neg logit contributions
ishly-0.019
 course-0.016
 lear-0.016
 strides-0.015
 advantage-0.015
Token She
Token ID1375
Feature activation-1.68e+00

Pos logit contributions
eddy+0.023
 pans+0.023
ucer+0.020
emo+0.020
izz+0.020
Neg logit contributions
ishly-0.032
 course-0.027
 argued-0.026
 practice-0.026
 lear-0.025
Token picks
Token ID11103
Feature activation-1.09e+00

Pos logit contributions
 pans+0.039
eddy+0.039
emo+0.034
ucer+0.034
izz+0.033
Neg logit contributions
ishly-0.048
 course-0.039
 argued-0.037
 practice-0.037
 advantage-0.037
Token up
Token ID510
Feature activation-3.87e-01

Pos logit contributions
eddy+0.027
 pans+0.026
ucer+0.023
emo+0.023
izz+0.023
Neg logit contributions
ishly-0.030
 course-0.024
 argued-0.023
 practice-0.023
 lear-0.023
Token a
Token ID257
Feature activation-3.36e+00

Pos logit contributions
 pans+0.009
eddy+0.008
 flour+0.007
izz+0.007
ucer+0.007
Neg logit contributions
ishly-0.012
 course-0.009
 argued-0.009
 lear-0.009
 practice-0.009
Token clean
Token ID3424
Feature activation-4.54e+00

Pos logit contributions
eddy+0.066
izz+0.059
emo+0.058
ucks+0.057
 pans+0.056
Neg logit contributions
ishly-0.099
 practice-0.086
 course-0.086
 competition-0.085
 lear-0.081
Token dish
Token ID9433
Feature activation-1.94e+00

Pos logit contributions
eddy+0.097
izz+0.088
emo+0.087
ble+0.087
ucer+0.083
Neg logit contributions
ishly-0.132
 course-0.115
 practice-0.111
 competition-0.109
 argued-0.105
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
 pans+0.043
eddy+0.043
emo+0.039
izz+0.038
ucer+0.038
Neg logit contributions
ishly-0.055
 course-0.045
 argued-0.043
 practice-0.042
 lear-0.042
Token puts
Token ID7584
Feature activation-2.78e+00

Pos logit contributions
 pans+0.030
eddy+0.028
 flour+0.024
izz+0.024
 lad+0.023
Neg logit contributions
ishly-0.050
 course-0.042
 advantage-0.040
 argued-0.040
 lear-0.039
Token it
Token ID340
Feature activation-1.22e+00

Pos logit contributions
eddy+0.063
 pans+0.063
ucer+0.055
emo+0.054
loo+0.054
Neg logit contributions
ishly-0.081
 course-0.067
 argued-0.064
 practice-0.064
 advantage-0.062
Token in
Token ID287
Feature activation-1.89e+00

Pos logit contributions
 pans+0.025
eddy+0.024
ucer+0.021
ble+0.020
izz+0.020
Neg logit contributions
ishly-0.040
 course-0.033
 lear-0.032
 practice-0.032
 argued-0.032
Token a
Token ID257
Feature activation+3.21e-01

Pos logit contributions
eddy+0.008
 pans+0.008
 flour+0.007
ucer+0.007
izz+0.007
Neg logit contributions
ishly-0.012
 course-0.010
 argued-0.010
 practice-0.010
 advantage-0.010
Token man
Token ID582
Feature activation-3.50e-01

Pos logit contributions
ishly+0.009
 course+0.008
 competition+0.007
 practice+0.007
 lear+0.007
Neg logit contributions
 pans-0.007
eddy-0.007
izz-0.006
emo-0.006
ucks-0.006
Token with
Token ID351
Feature activation-1.62e+00

Pos logit contributions
 pans+0.009
eddy+0.008
ucer+0.008
emo+0.008
izz+0.007
Neg logit contributions
ishly-0.010
 course-0.008
 argued-0.007
 practice-0.007
 lear-0.007
Token a
Token ID257
Feature activation-1.57e+00

Pos logit contributions
 pans+0.032
eddy+0.032
 flour+0.027
ucer+0.026
emo+0.025
Neg logit contributions
ishly-0.053
 course-0.043
 argued-0.042
 lear-0.042
 advantage-0.041
Token big
Token ID1263
Feature activation-9.23e-01

Pos logit contributions
eddy+0.029
 pans+0.028
ucer+0.024
emo+0.024
izz+0.024
Neg logit contributions
ishly-0.052
 course-0.044
 argued-0.042
 practice-0.042
 lear-0.041
Token ice
Token ID4771
Feature activation-4.50e+00

Pos logit contributions
eddy+0.019
 pans+0.019
ucer+0.016
emo+0.015
loo+0.015
Neg logit contributions
ishly-0.029
 course-0.024
 argued-0.024
 practice-0.023
 lear-0.023
Token cream
Token ID8566
Feature activation-3.05e+00

Pos logit contributions
 pans+0.094
ooks+0.091
emo+0.091
stals+0.088
izz+0.087
Neg logit contributions
ishly-0.107
 course-0.102
 at-0.090
 anymore-0.083
 competition-0.083
Token truck
Token ID7779
Feature activation-1.23e+00

Pos logit contributions
 pans+0.065
eddy+0.064
emo+0.059
izz+0.058
ucer+0.056
Neg logit contributions
ishly-0.091
 course-0.077
 argued-0.072
 lear-0.072
 practice-0.070
Token.
Token ID13
Feature activation-9.23e-01

Pos logit contributions
eddy+0.031
 pans+0.030
ucer+0.027
loo+0.026
ble+0.026
Neg logit contributions
ishly-0.033
 course-0.027
 practice-0.026
 argued-0.026
 advantage-0.025
Token Lily
Token ID20037
Feature activation-3.78e-01

Pos logit contributions
 pans+0.024
eddy+0.024
ble+0.024
ucer+0.023
loo+0.022
Neg logit contributions
ishly-0.023
 course-0.018
 argued-0.017
 practice-0.017
 weren-0.016
Token's
Token ID338
Feature activation+4.95e-02

Pos logit contributions
 pans+0.008
eddy+0.008
emo+0.007
 lad+0.007
 room+0.007
Neg logit contributions
ishly-0.011
 course-0.009
 advantage-0.009
 lear-0.009
 competition-0.008
Token Then
Token ID3244
Feature activation+1.48e-01

Pos logit contributions
eddy+0.002
 pans+0.002
izz+0.002
emo+0.002
tub+0.002
Neg logit contributions
ishly-0.002
 course-0.002
 argued-0.002
 advantage-0.002
 practice-0.002
Token she
Token ID673
Feature activation+2.54e-01

Pos logit contributions
ishly+0.004
 course+0.004
vious+0.004
 advantage+0.004
 rather+0.004
Neg logit contributions
 pans-0.003
eddy-0.003
ucer-0.003
 flour-0.002
ble-0.002
Token put
Token ID1234
Feature activation-2.02e+00

Pos logit contributions
ishly+0.008
 course+0.007
 advantage+0.007
 compete+0.006
 strides+0.006
Neg logit contributions
 pans-0.005
eddy-0.005
izz-0.005
 lad-0.004
ucer-0.004
Token a
Token ID257
Feature activation-2.48e+00

Pos logit contributions
 pans+0.043
eddy+0.043
ucer+0.038
izz+0.037
emo+0.036
Neg logit contributions
ishly-0.062
 course-0.050
 argued-0.049
 practice-0.048
 lear-0.048
Token special
Token ID2041
Feature activation-3.60e+00

Pos logit contributions
eddy+0.047
 pans+0.044
emo+0.040
izz+0.040
loo+0.039
Neg logit contributions
ishly-0.079
 course-0.068
 practice-0.067
 lear-0.065
 argued-0.064
Token cleaner
Token ID21723
Feature activation-4.50e+00

Pos logit contributions
eddy+0.065
 pans+0.064
ucer+0.055
emo+0.054
izz+0.054
Neg logit contributions
ishly-0.120
 course-0.103
 practice-0.099
 argued-0.098
 lear-0.097
Token on
Token ID319
Feature activation-5.07e-01

Pos logit contributions
eddy+0.096
 pans+0.096
ucer+0.082
loo+0.079
ble+0.078
Neg logit contributions
ishly-0.141
 course-0.113
 argued-0.112
 practice-0.109
 advantage-0.108
Token it
Token ID340
Feature activation-2.35e-01

Pos logit contributions
 pans+0.009
eddy+0.009
ble+0.008
 flour+0.008
izz+0.007
Neg logit contributions
ishly-0.017
 argued-0.015
 lear-0.014
vious-0.014
 reassured-0.014
Token and
Token ID290
Feature activation-7.62e-01

Pos logit contributions
eddy+0.003
 room+0.003
 pans+0.003
tub+0.003
oe+0.003
Neg logit contributions
ishly-0.008
 lear-0.008
 antics-0.007
 argued-0.007
 compete-0.007
Token scrub
Token ID27268
Feature activation-6.26e-01

Pos logit contributions
 pans+0.017
eddy+0.016
ucer+0.014
izz+0.014
 flour+0.014
Neg logit contributions
ishly-0.024
 course-0.020
 advantage-0.019
 argued-0.019
 lear-0.019
Tokenbed
Token ID3077
Feature activation-3.92e-01

Pos logit contributions
eddy+0.019
 pans+0.019
ucer+0.017
loo+0.017
emo+0.017
Neg logit contributions
ishly-0.014
 course-0.010
 argued-0.010
 practice-0.010
 lear-0.009
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
ishly+0.028
 lear+0.024
rified+0.024
 reassured+0.024
 strides+0.024
Neg logit contributions
 pans-0.011
 room-0.011
tub-0.010
eddy-0.010
 flour-0.009
Token They
Token ID1119
Feature activation-3.36e-01

Pos logit contributions
ishly+0.008
 course+0.006
 practice+0.006
 argued+0.006
 lear+0.006
Neg logit contributions
 pans-0.011
eddy-0.011
ucer-0.010
ble-0.010
emo-0.009
Token make
Token ID787
Feature activation+8.35e-01

Pos logit contributions
 pans+0.008
 lad+0.007
eddy+0.007
 candle+0.006
emo+0.006
Neg logit contributions
ishly-0.009
 course-0.007
 echoing-0.007
 advantage-0.007
 strides-0.007
Token rings
Token ID13917
Feature activation+2.41e+00

Pos logit contributions
ishly+0.026
 course+0.021
 argued+0.020
 practice+0.020
 lear+0.020
Neg logit contributions
eddy-0.018
 pans-0.018
emo-0.015
ucer-0.015
ble-0.015
Token and
Token ID290
Feature activation+6.20e-01

Pos logit contributions
ishly+0.063
 course+0.051
 argued+0.049
 practice+0.048
 lear+0.047
Neg logit contributions
eddy-0.062
 pans-0.061
ucer-0.054
emo-0.054
izz-0.053
Token brace
Token ID22581
Feature activation+4.02e+00

Pos logit contributions
ishly+0.019
 course+0.015
 advantage+0.014
rified+0.014
 argued+0.014
Neg logit contributions
 pans-0.015
eddy-0.013
 lad-0.011
ucer-0.011
ble-0.011
Tokenlets
Token ID5289
Feature activation+1.48e+00

Pos logit contributions
ishly+0.080
 course+0.059
 argued+0.056
 practice+0.055
 advantage+0.053
Neg logit contributions
eddy-0.128
 pans-0.128
ucer-0.116
emo-0.115
izz-0.115
Token and
Token ID290
Feature activation+9.97e-01

Pos logit contributions
ishly+0.047
 course+0.039
 argued+0.038
 advantage+0.037
 practice+0.037
Neg logit contributions
eddy-0.029
 pans-0.028
ble-0.027
ucer-0.026
tub-0.025
Token neck
Token ID7393
Feature activation+1.16e+00

Pos logit contributions
ishly+0.031
 course+0.025
rified+0.025
 advantage+0.024
 echoing+0.024
Neg logit contributions
 pans-0.023
eddy-0.018
 glass-0.017
 flour-0.017
 salt-0.017
Tokenl
Token ID75
Feature activation+2.97e+00

Pos logit contributions
ishly+0.039
 course+0.032
 argued+0.031
 lear+0.031
 practice+0.031
Neg logit contributions
eddy-0.022
 pans-0.022
ucer-0.018
ble-0.018
emo-0.018
Tokenaces
Token ID2114
Feature activation+2.54e+00

Pos logit contributions
ishly+0.094
 competition+0.078
 course+0.077
 reassured+0.075
 lear+0.074
Neg logit contributions
eddy-0.060
ble-0.051
 pans-0.050
hes-0.048
ak-0.048
Token were
Token ID547
Feature activation+1.22e+00

Pos logit contributions
 unnecessary+0.037
 outcomes+0.036
 advantage+0.036
ishly+0.034
 antics+0.034
Neg logit contributions
 pans-0.018
ull-0.017
 candle-0.017
 pot-0.016
ucer-0.016
Token always
Token ID1464
Feature activation+2.81e-02

Pos logit contributions
ishly+0.033
 course+0.028
rified+0.028
 lear+0.027
 argued+0.027
Neg logit contributions
 pans-0.027
eddy-0.026
ble-0.024
 twins-0.023
emo-0.023
Token together
Token ID1978
Feature activation+1.10e+00

Pos logit contributions
ishly+0.001
 course+0.001
 lear+0.001
rified+0.001
 weren+0.001
Neg logit contributions
 pans-0.001
eddy-0.000
 room-0.000
 twins-0.000
emo-0.000
Token hopping
Token ID47153
Feature activation+2.22e+00

Pos logit contributions
ishly+0.039
rified+0.035
vious+0.034
 argued+0.034
 advantage+0.033
Neg logit contributions
 twins-0.012
 pans-0.012
onna-0.011
oe-0.011
eddy-0.011
Token and
Token ID290
Feature activation+2.01e+00

Pos logit contributions
ishly+0.071
 argued+0.065
istent+0.065
iable+0.064
ird+0.064
Neg logit contributions
tub-0.032
eddy-0.031
oe-0.031
ull-0.029
 pans-0.029
Token ch
Token ID442
Feature activation+3.95e+00

Pos logit contributions
ishly+0.058
vious+0.056
 advantage+0.055
rified+0.053
 course+0.051
Neg logit contributions
 pans-0.036
emo-0.031
onna-0.030
ull-0.030
ucer-0.029
Tokenir
Token ID343
Feature activation+2.21e+00

Pos logit contributions
ishly+0.127
 course+0.104
 competition+0.104
 argued+0.099
 reassured+0.098
Neg logit contributions
emo-0.074
eddy-0.073
ull-0.070
izz-0.069
 pans-0.067
Tokenping
Token ID13886
Feature activation+2.92e+00

Pos logit contributions
ishly+0.071
 competition+0.067
 practice+0.064
 course+0.063
 compete+0.063
Neg logit contributions
eddy-0.036
emo-0.032
ull-0.031
ble-0.030
oe-0.027
Token.
Token ID13
Feature activation+2.52e+00

Pos logit contributions
ishly+0.096
iable+0.088
eday+0.086
ird+0.085
 argued+0.085
Neg logit contributions
eddy-0.039
ull-0.039
 pans-0.037
 twins-0.035
tub-0.034
Token
Token ID220
Feature activation-5.31e-02

Pos logit contributions
ishly+0.087
 argued+0.078
 competition+0.074
 course+0.074
 lear+0.073
Neg logit contributions
eddy-0.041
 pans-0.034
emo-0.031
ucks-0.030
izz-0.029
Token
Token ID198
Feature activation+1.29e+00

Pos logit contributions
eddy+0.001
 pans+0.001
emo+0.001
ucer+0.001
izz+0.001
Neg logit contributions
ishly-0.002
 course-0.001
 argued-0.001
 lear-0.001
 practice-0.001
Token like
Token ID588
Feature activation+8.47e-01

Pos logit contributions
ishly+0.011
 advantage+0.008
 course+0.008
 lear+0.008
 argued+0.008
Neg logit contributions
 pans-0.010
 flour-0.008
eddy-0.008
 salt-0.008
 glass-0.008
Token monkeys
Token ID26697
Feature activation+3.30e+00

Pos logit contributions
ishly+0.028
 advantage+0.023
istent+0.022
 reassured+0.022
 argued+0.022
Neg logit contributions
 pans-0.018
 flour-0.015
 salt-0.014
eddy-0.014
 glass-0.014
Token and
Token ID290
Feature activation+2.11e+00

Pos logit contributions
ishly+0.107
 advantage+0.104
 lear+0.103
 persever+0.102
 practice+0.098
Neg logit contributions
 pans-0.046
eddy-0.043
onna-0.042
oe-0.042
 flour-0.040
Token elephants
Token ID25368
Feature activation+3.07e+00

Pos logit contributions
ishly+0.065
 advantage+0.056
istent+0.055
 reassured+0.053
date+0.053
Neg logit contributions
 pans-0.047
 salt-0.037
 glass-0.037
 flour-0.036
 lad-0.035
Token and
Token ID290
Feature activation+2.04e+00

Pos logit contributions
ishly+0.118
 persever+0.109
 advantage+0.108
 practice+0.104
 practise+0.103
Neg logit contributions
 pans-0.029
eddy-0.028
 twins-0.026
 block-0.024
onna-0.023
Token gir
Token ID37370
Feature activation+5.23e+00

Pos logit contributions
ishly+0.066
istent+0.057
 advantage+0.055
date+0.053
 reassured+0.053
Neg logit contributions
 pans-0.043
 glass-0.035
 salt-0.034
 flour-0.033
 twins-0.032
Tokenaff
Token ID2001
Feature activation+2.70e+00

Pos logit contributions
ishly+0.193
 reassured+0.175
 practice+0.171
date+0.170
 practise+0.170
Neg logit contributions
eddy-0.061
onna-0.050
loo-0.049
 pans-0.047
ucks-0.046
Tokenes
Token ID274
Feature activation+2.21e+00

Pos logit contributions
ishly+0.087
 practice+0.072
 advantage+0.070
 competition+0.069
 course+0.068
Neg logit contributions
eddy-0.053
 pans-0.052
emo-0.050
ble-0.050
ull-0.050
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
ishly+0.088
 persever+0.087
 advantage+0.083
 practice+0.082
istent+0.082
Neg logit contributions
 pans-0.012
eddy-0.011
tub-0.010
kins-0.010
ull-0.009
Token They
Token ID1119
Feature activation+1.48e-02

Pos logit contributions
ishly+0.014
 course+0.012
 argued+0.011
 practice+0.011
 advantage+0.011
Neg logit contributions
eddy-0.007
 pans-0.007
emo-0.006
izz-0.006
ucer-0.006
Token had
Token ID550
Feature activation+6.90e-01

Pos logit contributions
ishly+0.001
 advantage+0.000
 unnecessary+0.000
 strides+0.000
 course+0.000
Neg logit contributions
 pans-0.000
 lad-0.000
emo-0.000
 glass-0.000
kins-0.000
Tokenimes
Token ID999
Feature activation+2.87e+00

Pos logit contributions
ishly+0.052
 course+0.040
 argued+0.039
 practice+0.038
 lear+0.037
Neg logit contributions
eddy-0.055
 pans-0.054
izz-0.049
emo-0.049
ucer-0.049
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
ishly+0.100
 lear+0.087
 argued+0.083
 echoing+0.082
 strides+0.081
Neg logit contributions
 pans-0.050
eddy-0.049
ble-0.045
 flour-0.043
oons-0.041
Token He
Token ID679
Feature activation-3.82e-01

Pos logit contributions
ishly+0.005
 course+0.004
 practice+0.004
 compete+0.004
 advantage+0.004
Neg logit contributions
 pans-0.008
eddy-0.008
ucer-0.008
izz-0.007
ble-0.007
Token thought
Token ID1807
Feature activation+1.72e-01

Pos logit contributions
 pans+0.010
eddy+0.010
emo+0.009
izz+0.009
onna+0.009
Neg logit contributions
ishly-0.010
 advantage-0.007
 course-0.007
 echoing-0.007
 strides-0.007
Token l
Token ID300
Feature activation+3.62e+00

Pos logit contributions
ishly+0.005
 course+0.004
 argued+0.004
 practice+0.004
 advantage+0.004
Neg logit contributions
 pans-0.004
eddy-0.004
emo-0.003
izz-0.003
ucer-0.003
Tokenimes
Token ID999
Feature activation+4.59e+00

Pos logit contributions
ishly+0.103
 argued+0.080
 echoing+0.080
 course+0.079
 practice+0.079
Neg logit contributions
eddy-0.082
 pans-0.081
izz-0.081
emo-0.077
ble-0.073
Token were
Token ID547
Feature activation+1.03e+00

Pos logit contributions
ishly+0.132
 lear+0.122
 strides+0.109
 echoing+0.108
 antics+0.106
Neg logit contributions
 pans-0.098
ble-0.095
kins-0.091
ucer-0.091
eddy-0.090
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
ishly+0.029
 course+0.023
 argued+0.023
 lear+0.022
 practice+0.022
Neg logit contributions
 pans-0.025
eddy-0.025
emo-0.022
ucer-0.021
ble-0.021
Token perfect
Token ID2818
Feature activation-6.54e-01

Pos logit contributions
 pans+0.005
eddy+0.005
 flour+0.004
ble+0.004
emo+0.004
Neg logit contributions
ishly-0.006
 argued-0.004
vered-0.004
 course-0.004
 strides-0.004
Token fruit
Token ID8234
Feature activation+1.83e+00

Pos logit contributions
 pans+0.013
eddy+0.013
 flour+0.012
 salt+0.012
onna+0.012
Neg logit contributions
ishly-0.021
 echoing-0.016
 argued-0.016
 reassured-0.016
 strides-0.015
Token.
Token ID13
Feature activation+8.25e-01

Pos logit contributions
ishly+0.058
 practice+0.049
 lear+0.049
 advantage+0.048
 argued+0.048
Neg logit contributions
 pans-0.038
eddy-0.037
loo-0.032
 flour-0.032
onna-0.031
Token continue
Token ID2555
Feature activation-1.26e+00

Pos logit contributions
ishly+0.015
rified+0.013
vious+0.013
 argued+0.013
 lear+0.013
Neg logit contributions
ble-0.011
eddy-0.011
 pans-0.011
emo-0.011
 lad-0.010
Token exploring
Token ID13504
Feature activation+1.77e+00

Pos logit contributions
 pans+0.021
eddy+0.019
 room+0.018
 boiling+0.018
ble+0.018
Neg logit contributions
ishly-0.042
rified-0.038
vious-0.035
affe-0.035
 argued-0.034
Token.
Token ID13
Feature activation+5.88e-02

Pos logit contributions
ishly+0.065
 argued+0.058
istent+0.053
 meanings+0.053
 lear+0.052
Neg logit contributions
 pans-0.029
ble-0.023
eddy-0.023
 salt-0.023
 boiling-0.021
Token He
Token ID679
Feature activation+2.37e-01

Pos logit contributions
ishly+0.002
 course+0.002
 argued+0.002
 practice+0.002
 lear+0.002
Neg logit contributions
eddy-0.001
 pans-0.001
emo-0.001
loo-0.001
ucer-0.001
Token saw
Token ID2497
Feature activation+6.96e-01

Pos logit contributions
ishly+0.007
 course+0.006
 advantage+0.006
 anymore+0.006
 lear+0.006
Neg logit contributions
 pans-0.005
 lad-0.004
emo-0.004
izz-0.004
eddy-0.004
Token ze
Token ID41271
Feature activation+4.00e+00

Pos logit contributions
ishly+0.023
istent+0.019
 argued+0.019
 compete+0.019
 lear+0.018
Neg logit contributions
 pans-0.013
 flour-0.012
eddy-0.011
 salt-0.011
ull-0.011
Tokenbr
Token ID1671
Feature activation+1.86e+00

Pos logit contributions
ishly+0.135
 argued+0.120
 advantage+0.118
 former+0.108
 rather+0.105
Neg logit contributions
 pans-0.064
izz-0.054
loo-0.053
oons-0.053
oe-0.051
Tokenas
Token ID292
Feature activation+1.25e+00

Pos logit contributions
ishly+0.053
 course+0.044
 argued+0.043
 practice+0.041
 lear+0.041
Neg logit contributions
 pans-0.043
eddy-0.042
ucer-0.037
emo-0.036
ble-0.036
Token and
Token ID290
Feature activation+9.72e-01

Pos logit contributions
 advantage+0.033
ishly+0.032
 meanings+0.032
Deal+0.031
 argued+0.030
Neg logit contributions
 pans-0.023
ble-0.023
eddy-0.023
tub-0.021
oe-0.020
Token gir
Token ID37370
Feature activation+4.28e+00

Pos logit contributions
ishly+0.030
 advantage+0.026
 course+0.023
date+0.023
 anymore+0.023
Neg logit contributions
 pans-0.022
 lad-0.017
emo-0.017
 salt-0.017
onna-0.016
Tokenaff
Token ID2001
Feature activation+2.06e+00

Pos logit contributions
ishly+0.136
date+0.125
 competition+0.121
 advantage+0.121
 practice+0.119
Neg logit contributions
eddy-0.065
 pans-0.055
onna-0.053
ucker-0.052
us-0.051
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
 pans+0.016
eddy+0.015
ucer+0.014
loo+0.013
tub+0.013
Neg logit contributions
ishly-0.016
 course-0.013
 practice-0.013
 argued-0.012
 advantage-0.012
Token"
Token ID1
Feature activation-9.47e-02

Pos logit contributions
eddy+0.027
 pans+0.025
emo+0.023
ucer+0.022
loo+0.022
Neg logit contributions
ishly-0.034
 course-0.029
 practice-0.027
 argued-0.027
 lear-0.026
TokenTom
Token ID13787
Feature activation-1.03e+00

Pos logit contributions
eddy+0.002
 pans+0.002
ucer+0.002
loo+0.002
emo+0.002
Neg logit contributions
ishly-0.003
 course-0.002
 argued-0.002
 practice-0.002
 lear-0.002
Token,
Token ID11
Feature activation+8.47e-02

Pos logit contributions
eddy+0.031
 pans+0.031
ucer+0.028
emo+0.028
izz+0.027
Neg logit contributions
ishly-0.022
 course-0.017
 argued-0.016
 practice-0.016
 advantage-0.016
Token are
Token ID389
Feature activation-1.39e+00

Pos logit contributions
ishly+0.002
 course+0.002
 argued+0.002
 practice+0.002
 lear+0.002
Neg logit contributions
 pans-0.002
eddy-0.002
ucer-0.002
emo-0.002
izz-0.002
Token you
Token ID345
Feature activation+1.02e+00

Pos logit contributions
 pans+0.029
eddy+0.027
izz+0.024
emo+0.024
 flour+0.024
Neg logit contributions
ishly-0.043
 lear-0.035
 argued-0.034
 echoing-0.034
 course-0.034
Token okay
Token ID8788
Feature activation+6.62e-02

Pos logit contributions
ishly+0.022
 course+0.016
 argued+0.016
 lear+0.015
 practice+0.015
Neg logit contributions
 pans-0.031
eddy-0.031
ble-0.029
ucer-0.028
izz-0.028
Token?"
Token ID1701
Feature activation-5.53e-02

Pos logit contributions
ishly+0.002
 course+0.002
 practice+0.001
 anymore+0.001
 argued+0.001
Neg logit contributions
 pans-0.002
ucer-0.001
eddy-0.001
emo-0.001
tub-0.001
Token Sue
Token ID26089
Feature activation+4.16e-01

Pos logit contributions
 pans+0.001
eddy+0.001
izz+0.001
emo+0.001
loo+0.001
Neg logit contributions
ishly-0.002
 course-0.001
 argued-0.001
 lear-0.001
 advantage-0.001
Token said
Token ID531
Feature activation+2.22e-01

Pos logit contributions
ishly+0.011
 course+0.009
 advantage+0.009
 lear+0.009
 practice+0.009
Neg logit contributions
 pans-0.010
eddy-0.010
 lad-0.009
izz-0.009
ble-0.008
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
ishly+0.006
 course+0.004
 argued+0.004
 practice+0.004
 lear+0.004
Neg logit contributions
eddy-0.006
 pans-0.006
ucer-0.005
emo-0.005
izz-0.005
Token went
Token ID1816
Feature activation+1.61e+00

Pos logit contributions
ishly+0.033
 advantage+0.027
 rather+0.025
 lear+0.025
 course+0.024
Neg logit contributions
eddy-0.025
 pans-0.024
izz-0.023
oe-0.022
 lad-0.021
Token deeper
Token ID9211
Feature activation+1.30e+00

Pos logit contributions
ishly+0.042
 argued+0.036
 course+0.033
 lear+0.032
 echoing+0.032
Neg logit contributions
eddy-0.040
 pans-0.039
loo-0.036
ucer-0.036
oe-0.035
Token into
Token ID656
Feature activation+1.70e+00

Pos logit contributions
ishly+0.037
 course+0.031
 argued+0.029
 practice+0.029
 lear+0.029
Neg logit contributions
 pans-0.031
eddy-0.029
ble-0.026
oe-0.026
ucer-0.025
Token the
Token ID262
Feature activation+9.99e-01

Pos logit contributions
ishly+0.054
 argued+0.043
 course+0.043
 advantage+0.041
 lear+0.041
Neg logit contributions
 pans-0.036
eddy-0.034
izz-0.029
loo-0.029
ucer-0.029
Token woods
Token ID16479
Feature activation+1.94e+00

Pos logit contributions
ishly+0.025
 course+0.021
 practice+0.021
 advantage+0.019
 competition+0.019
Neg logit contributions
eddy-0.026
 pans-0.026
emo-0.024
ucer-0.023
izz-0.023
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
ishly+0.057
ercise+0.050
 argued+0.050
 advantage+0.048
 course+0.047
Neg logit contributions
tub-0.037
 pans-0.037
oe-0.034
loo-0.034
 room-0.034
Token she
Token ID673
Feature activation+7.50e-01

Pos logit contributions
ishly+0.034
 course+0.027
 argued+0.027
 advantage+0.026
 lear+0.025
Neg logit contributions
 pans-0.025
eddy-0.023
ucer-0.021
izz-0.021
ble-0.021
Token began
Token ID2540
Feature activation-1.52e+00

Pos logit contributions
ishly+0.025
 course+0.021
 advantage+0.021
 lear+0.020
 strides+0.019
Neg logit contributions
 pans-0.013
 lad-0.012
 room-0.012
ble-0.012
tub-0.012
Token to
Token ID284
Feature activation+1.89e+00

Pos logit contributions
 pans+0.030
eddy+0.028
 lad+0.025
onna+0.025
tub+0.024
Neg logit contributions
ishly-0.049
 course-0.042
 advantage-0.038
Deal-0.038
 argued-0.038
Token hear
Token ID3285
Feature activation+2.93e+00

Pos logit contributions
ishly+0.061
 course+0.051
 lear+0.050
ted+0.049
 advantage+0.048
Neg logit contributions
 pans-0.032
 glass-0.030
 lad-0.030
 candle-0.029
 room-0.029
Token some
Token ID617
Feature activation+1.99e+00

Pos logit contributions
ishly+0.114
 argued+0.095
 lear+0.092
Deal+0.092
istent+0.091
Neg logit contributions
 pans-0.036
 glass-0.033
eddy-0.033
 candle-0.032
 flour-0.031
Token the
Token ID262
Feature activation+1.92e+00

Pos logit contributions
ishly+0.043
 argued+0.035
rified+0.035
ave+0.034
 antics+0.034
Neg logit contributions
 pans-0.009
 late-0.008
ull-0.008
eddy-0.007
 room-0.007
Token east
Token ID7627
Feature activation+1.24e+00

Pos logit contributions
ishly+0.073
 argued+0.059
 lear+0.058
 echoing+0.055
 compete+0.054
Neg logit contributions
eddy-0.027
 pans-0.027
 candle-0.023
 room-0.023
loo-0.022
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
ishly+0.046
 argued+0.040
ercise+0.039
 lear+0.037
itivity+0.037
Neg logit contributions
eddy-0.016
 room-0.016
loo-0.015
ney-0.015
oe-0.014
Token<|endoftext|>
Token ID50256
Feature activation-1.33e-01

Pos logit contributions
ishly+0.030
 course+0.025
 argued+0.024
 practice+0.024
 lear+0.023
Neg logit contributions
 pans-0.022
eddy-0.022
ucer-0.019
emo-0.019
izz-0.019
TokenOnce
Token ID7454
Feature activation-7.28e-01

Pos logit contributions
 pans+0.004
eddy+0.003
ucer+0.003
ble+0.003
emo+0.003
Neg logit contributions
ishly-0.003
 course-0.003
 practice-0.002
 compete-0.002
 weren-0.002
Token upon
Token ID2402
Feature activation+1.13e+00

Pos logit contributions
 pans+0.014
eddy+0.013
ucer+0.011
 lad+0.010
izz+0.010
Neg logit contributions
ishly-0.023
 course-0.021
 practice-0.020
 weren-0.020
 argued-0.019
Token a
Token ID257
Feature activation+2.20e+00

Pos logit contributions
ishly+0.022
 course+0.020
 practice+0.020
 rather+0.018
 argued+0.017
Neg logit contributions
 pans-0.034
eddy-0.033
ucer-0.031
loo-0.031
izz-0.030
Token time
Token ID640
Feature activation-6.60e-01

Pos logit contributions
ishly+0.053
 course+0.042
 argued+0.040
 practice+0.039
 lear+0.038
Neg logit contributions
eddy-0.061
 pans-0.060
ucer-0.054
emo-0.054
izz-0.053
Token,
Token ID11
Feature activation-7.81e-01

Pos logit contributions
eddy+0.020
 pans+0.020
ucer+0.018
emo+0.018
izz+0.018
Neg logit contributions
ishly-0.014
 course-0.010
 argued-0.010
 practice-0.010
 lear-0.010
Token there
Token ID612
Feature activation-3.19e-01

Pos logit contributions
 pans+0.019
eddy+0.019
ucks+0.017
ucer+0.017
loo+0.016
Neg logit contributions
ishly-0.020
 course-0.018
 practice-0.017
 weren-0.017
 argued-0.016
Token was
Token ID373
Feature activation+6.59e-01

Pos logit contributions
 pans+0.008
eddy+0.008
ucer+0.007
emo+0.007
izz+0.007
Neg logit contributions
ishly-0.008
 course-0.007
 argued-0.006
 practice-0.006
 weren-0.006
Token.
Token ID13
Feature activation+2.09e+00

Pos logit contributions
ishly+0.012
 advantage+0.011
 meanings+0.011
 course+0.011
 persever+0.011
Neg logit contributions
eddy-0.005
 pans-0.005
 room-0.005
tub-0.005
 pot-0.005
Token Suddenly
Token ID24975
Feature activation+1.99e+00

Pos logit contributions
ishly+0.067
 course+0.061
 advantage+0.056
nered+0.056
 lear+0.056
Neg logit contributions
 pans-0.033
Jen-0.030
eddy-0.030
emo-0.030
 Blood-0.029
Token,
Token ID11
Feature activation+9.70e-01

Pos logit contributions
ishly+0.054
 course+0.045
 argued+0.044
 advantage+0.042
 lear+0.042
Neg logit contributions
 pans-0.046
ble-0.042
eddy-0.040
ucer-0.039
oe-0.039
Token Pat
Token ID3208
Feature activation+2.07e+00

Pos logit contributions
ishly+0.029
 course+0.025
 argued+0.024
 advantage+0.024
 rather+0.023
Neg logit contributions
 pans-0.019
ble-0.018
ucer-0.017
izz-0.016
Jen-0.016
Token sne
Token ID10505
Feature activation+3.33e+00

Pos logit contributions
ishly+0.070
 advantage+0.059
 course+0.059
 lear+0.058
Deal+0.057
Neg logit contributions
 pans-0.036
ble-0.033
oe-0.032
izz-0.032
eddy-0.031
Tokenez
Token ID8471
Feature activation+1.71e+00

Pos logit contributions
ishly+0.096
 competition+0.075
 compete+0.073
 course+0.072
 lear+0.071
Neg logit contributions
izz-0.074
eddy-0.074
emo-0.070
ull-0.068
 pans-0.068
Tokened
Token ID276
Feature activation+2.11e+00

Pos logit contributions
ishly+0.052
 course+0.047
 compete+0.045
 advantage+0.044
 practice+0.044
Neg logit contributions
eddy-0.037
ble-0.034
 pans-0.033
ucer-0.033
izz-0.032
Token really
Token ID1107
Feature activation+4.00e-01

Pos logit contributions
ishly+0.064
 compete+0.058
 course+0.057
pieces+0.056
 lear+0.056
Neg logit contributions
eddy-0.035
 pans-0.035
ble-0.031
 glass-0.031
 dirty-0.030
Token loudly
Token ID23112
Feature activation+1.39e+00

Pos logit contributions
ishly+0.015
 argued+0.012
pieces+0.012
 compete+0.012
date+0.012
Neg logit contributions
 pans-0.005
ble-0.005
 late-0.005
 glass-0.005
 salt-0.004
Token and
Token ID290
Feature activation+9.38e-01

Pos logit contributions
ishly+0.045
 course+0.041
 practice+0.040
Deal+0.040
pieces+0.040
Neg logit contributions
 pans-0.021
eddy-0.021
tub-0.018
ble-0.017
onna-0.017
Token really
Token ID1107
Feature activation+4.35e-01

Pos logit contributions
ishly+0.029
 course+0.026
dates+0.025
 advantage+0.025
date+0.024
Neg logit contributions
 pans-0.017
ble-0.014
 glass-0.013
 Blood-0.013
 salt-0.013
Token had
Token ID550
Feature activation+1.58e+00

Pos logit contributions
ishly+0.015
 course+0.012
 advantage+0.012
 lear+0.011
 argued+0.011
Neg logit contributions
 pans-0.011
eddy-0.010
ucer-0.009
emo-0.009
ble-0.009
Token a
Token ID257
Feature activation+1.15e+00

Pos logit contributions
ishly+0.045
 lear+0.034
 course+0.033
 advantage+0.032
 argued+0.031
Neg logit contributions
 pans-0.038
eddy-0.035
 flour-0.034
emo-0.033
 glass-0.032
Token lot
Token ID1256
Feature activation-8.13e-01

Pos logit contributions
ishly+0.032
 argued+0.024
 course+0.024
 advantage+0.022
 reassured+0.022
Neg logit contributions
 pans-0.029
eddy-0.027
ucer-0.025
oe-0.025
emo-0.025
Token of
Token ID286
Feature activation+9.33e-01

Pos logit contributions
eddy+0.015
 pans+0.014
loo+0.013
izz+0.012
oe+0.012
Neg logit contributions
ishly-0.029
 argued-0.023
 lear-0.022
 course-0.022
 advantage-0.022
Token fun
Token ID1257
Feature activation-1.05e+00

Pos logit contributions
ishly+0.027
 argued+0.018
 lear+0.018
rified+0.018
 echoing+0.018
Neg logit contributions
 pans-0.024
eddy-0.021
 flour-0.021
emo-0.020
 room-0.020
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
 pans+0.023
eddy+0.023
ble+0.020
emo+0.020
tub+0.020
Neg logit contributions
ishly-0.033
 argued-0.024
 course-0.024
 advantage-0.024
rified-0.023
Token
Token ID220
Feature activation-5.59e-01

Pos logit contributions
ishly+0.036
 course+0.030
 argued+0.029
 lear+0.029
 practice+0.028
Neg logit contributions
eddy-0.026
 pans-0.026
ucer-0.023
emo-0.023
izz-0.023
Token
Token ID198
Feature activation+5.45e-01

Pos logit contributions
eddy+0.011
 pans+0.011
emo+0.009
ucer+0.009
izz+0.009
Neg logit contributions
ishly-0.018
 course-0.015
 argued-0.015
 practice-0.014
 lear-0.014
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
ishly+0.015
 course+0.013
 practice+0.012
 argued+0.012
 lear+0.012
Neg logit contributions
 pans-0.013
eddy-0.013
ucer-0.011
loo-0.011
emo-0.011
TokenSometimes
Token ID15468
Feature activation+1.43e+00

Pos logit contributions
ishly+0.016
 course+0.013
 argued+0.013
 advantage+0.012
 practice+0.012
Neg logit contributions
 pans-0.009
eddy-0.009
ucer-0.008
emo-0.007
izz-0.007
Token,
Token ID11
Feature activation+3.14e-01

Pos logit contributions
ishly+0.045
 course+0.037
 argued+0.034
 advantage+0.034
 lear+0.034
Neg logit contributions
 pans-0.028
eddy-0.027
emo-0.025
ble-0.024
 salt-0.024
Token,
Token ID11
Feature activation-6.08e-01

Pos logit contributions
eddy+0.010
 pans+0.010
ucer+0.009
izz+0.009
emo+0.009
Neg logit contributions
ishly-0.004
 course-0.003
 practice-0.003
 argued-0.003
 weren-0.002
Token he
Token ID339
Feature activation-6.71e-01

Pos logit contributions
eddy+0.019
 pans+0.018
ucer+0.017
izz+0.017
ucks+0.016
Neg logit contributions
ishly-0.012
 course-0.010
 practice-0.009
 argued-0.009
 weren-0.009
Token found
Token ID1043
Feature activation+8.38e-02

Pos logit contributions
eddy+0.020
 pans+0.019
ucer+0.018
ble+0.017
emo+0.017
Neg logit contributions
ishly-0.014
 course-0.012
 practice-0.012
 argued-0.012
 lear-0.011
Token a
Token ID257
Feature activation+8.94e-01

Pos logit contributions
ishly+0.002
 course+0.001
 practice+0.001
 advantage+0.001
 encouraged+0.001
Neg logit contributions
 pans-0.003
eddy-0.003
ucer-0.002
oe-0.002
onna-0.002
Token big
Token ID1263
Feature activation+1.25e+00

Pos logit contributions
ishly+0.025
 course+0.022
 practice+0.022
 competition+0.021
 rather+0.021
Neg logit contributions
eddy-0.019
 pans-0.018
izz-0.018
ucks-0.017
emo-0.016
Token tree
Token ID5509
Feature activation+3.79e-01

Pos logit contributions
ishly+0.035
 course+0.029
 practice+0.028
 argued+0.027
 lear+0.027
Neg logit contributions
eddy-0.029
 pans-0.029
izz-0.026
emo-0.026
ucer-0.026
Token in
Token ID287
Feature activation-3.15e-01

Pos logit contributions
ishly+0.009
 course+0.007
 rather+0.006
 practice+0.006
 argued+0.006
Neg logit contributions
 pans-0.011
eddy-0.011
emo-0.010
izz-0.010
ucer-0.010
Token the
Token ID262
Feature activation+8.39e-01

Pos logit contributions
eddy+0.011
ucker+0.011
ucer+0.011
ouse+0.010
 pans+0.010
Neg logit contributions
 course-0.005
 practice-0.003
 every-0.003
ishly-0.003
 argued-0.003
Token park
Token ID3952
Feature activation-1.12e+00

Pos logit contributions
ishly+0.021
 course+0.018
 practice+0.017
 argued+0.017
 advantage+0.016
Neg logit contributions
eddy-0.022
 pans-0.022
emo-0.019
ucer-0.019
izz-0.019
Token.
Token ID13
Feature activation-5.87e-01

Pos logit contributions
eddy+0.030
 pans+0.030
ucer+0.027
emo+0.027
izz+0.026
Neg logit contributions
ishly-0.028
 course-0.022
 argued-0.021
 practice-0.021
 lear-0.020
Token He
Token ID679
Feature activation-9.25e-01

Pos logit contributions
 pans+0.018
eddy+0.018
ucer+0.017
ble+0.016
izz+0.016
Neg logit contributions
ishly-0.011
 course-0.009
 practice-0.009
 weren-0.009
 compete-0.009
Token say
Token ID910
Feature activation+4.81e-01

Pos logit contributions
 pans+0.029
 lad+0.028
 glass+0.027
eddy+0.026
Anna+0.026
Neg logit contributions
ishly-0.045
 echoing-0.036
rified-0.035
 advantage-0.035
 strides-0.035
Token they
Token ID484
Feature activation-7.79e-01

Pos logit contributions
ishly+0.015
 course+0.013
 argued+0.013
 advantage+0.013
 practice+0.012
Neg logit contributions
eddy-0.009
 pans-0.009
emo-0.008
 candle-0.007
 flour-0.007
Token hope
Token ID2911
Feature activation-9.58e-01

Pos logit contributions
eddy+0.015
 pans+0.014
 flour+0.013
ucer+0.013
 glass+0.013
Neg logit contributions
ishly-0.025
 course-0.020
 echoing-0.019
 strides-0.019
 competition-0.018
Token her
Token ID607
Feature activation+1.24e+00

Pos logit contributions
eddy+0.017
 pans+0.017
onna+0.014
 flour+0.014
emo+0.014
Neg logit contributions
ishly-0.034
 course-0.027
 strides-0.026
rified-0.026
 argued-0.025
Token leg
Token ID1232
Feature activation+8.74e-01

Pos logit contributions
ishly+0.037
 course+0.031
 argued+0.030
 lear+0.028
 practice+0.028
Neg logit contributions
eddy-0.028
 pans-0.027
ucer-0.024
emo-0.024
ble-0.024
Token will
Token ID481
Feature activation-5.50e-01

Pos logit contributions
ishly+0.030
 course+0.024
 strides+0.024
 lear+0.024
 echoing+0.024
Neg logit contributions
eddy-0.016
 pans-0.014
 glass-0.013
 candle-0.013
 sticker-0.013
Token heal
Token ID12035
Feature activation+4.60e-01

Pos logit contributions
eddy+0.011
 pans+0.010
onna+0.010
ble+0.010
oe+0.009
Neg logit contributions
ishly-0.018
 course-0.014
rified-0.014
 argued-0.014
 strides-0.014
Token soon
Token ID2582
Feature activation-7.71e-01

Pos logit contributions
ishly+0.013
 course+0.010
 argued+0.010
 lear+0.010
 strides+0.010
Neg logit contributions
eddy-0.012
 pans-0.011
emo-0.010
ble-0.010
ucer-0.009
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
eddy+0.012
 pans+0.011
 room+0.009
oe+0.009
 candle+0.009
Neg logit contributions
ishly-0.029
 course-0.023
 strides-0.023
rified-0.023
 practise-0.022
Token
Token ID198
Feature activation-1.01e-01

Pos logit contributions
 pans+0.003
eddy+0.003
ucer+0.003
ble+0.003
loo+0.003
Neg logit contributions
ishly-0.004
 practice-0.003
 course-0.003
 argued-0.003
 rather-0.003
Token
Token ID198
Feature activation-7.83e-01

Pos logit contributions
 pans+0.003
eddy+0.003
ucer+0.003
loo+0.002
 lad+0.002
Neg logit contributions
ishly-0.002
 course-0.002
 practice-0.002
 rather-0.002
 advantage-0.002
Token ducks
Token ID39694
Feature activation+1.75e-01

Pos logit contributions
eddy+0.000
 pans+0.000
ucer+0.000
emo+0.000
izz+0.000
Neg logit contributions
ishly-0.000
 course-0.000
 argued-0.000
 practice-0.000
 advantage-0.000
Token bit
Token ID1643
Feature activation-1.87e+00

Pos logit contributions
ishly+0.006
 advantage+0.005
 lear+0.005
 course+0.005
 practice+0.005
Neg logit contributions
ble-0.003
ull-0.003
 pans-0.003
emo-0.003
tub-0.003
Token her
Token ID607
Feature activation-9.72e-01

Pos logit contributions
eddy+0.048
 pans+0.046
ucer+0.044
emo+0.043
loo+0.042
Neg logit contributions
ishly-0.048
 argued-0.039
 course-0.038
 practice-0.036
 advantage-0.035
Token face
Token ID1986
Feature activation+4.52e-01

Pos logit contributions
 pans+0.017
eddy+0.016
ble+0.015
ucer+0.014
emo+0.014
Neg logit contributions
ishly-0.034
 course-0.029
 practice-0.028
 lear-0.028
 strides-0.027
Token.
Token ID13
Feature activation-9.30e-01

Pos logit contributions
ishly+0.015
 advantage+0.013
 lear+0.013
 course+0.013
 compete+0.012
Neg logit contributions
 pans-0.008
eddy-0.008
ble-0.007
 glass-0.007
 flour-0.007
Token It
Token ID632
Feature activation-8.50e-01

Pos logit contributions
eddy+0.016
 pans+0.016
ucer+0.014
loo+0.013
emo+0.013
Neg logit contributions
ishly-0.032
 course-0.027
 argued-0.027
 practice-0.026
 lear-0.026
Token hurt
Token ID5938
Feature activation-1.01e+00

Pos logit contributions
 pans+0.015
eddy+0.014
emo+0.013
ble+0.013
 glass+0.012
Neg logit contributions
ishly-0.028
 course-0.024
 lear-0.024
 advantage-0.024
 practice-0.024
Token a
Token ID257
Feature activation-7.80e-01

Pos logit contributions
eddy+0.024
 pans+0.024
izz+0.021
ucer+0.021
ble+0.021
Neg logit contributions
ishly-0.028
 course-0.023
 argued-0.022
 practice-0.022
 lear-0.022
Token lot
Token ID1256
Feature activation-1.74e+00

Pos logit contributions
eddy+0.019
 pans+0.019
ucer+0.017
emo+0.017
izz+0.017
Neg logit contributions
ishly-0.021
 course-0.017
 argued-0.016
 practice-0.016
 advantage-0.016
Token.
Token ID13
Feature activation-8.88e-01

Pos logit contributions
eddy+0.036
 pans+0.033
loo+0.031
oe+0.030
izz+0.030
Neg logit contributions
ishly-0.056
 argued-0.048
 compete-0.045
 antics-0.045
 practice-0.045
Token Sara
Token ID24799
Feature activation-9.97e-01

Pos logit contributions
eddy+0.015
 pans+0.015
ucer+0.013
loo+0.012
emo+0.012
Neg logit contributions
ishly-0.031
 course-0.026
 argued-0.026
 practice-0.025
 advantage-0.025
Token waited
Token ID13488
Feature activation-1.93e-01

Pos logit contributions
ishly+0.010
 course+0.008
 advantage+0.008
 echoing+0.008
 strides+0.008
Neg logit contributions
 pans-0.006
eddy-0.006
 flour-0.005
 lad-0.005
izz-0.005
Token for
Token ID329
Feature activation+6.52e-01

Pos logit contributions
eddy+0.005
 pans+0.005
ucer+0.005
emo+0.005
izz+0.005
Neg logit contributions
ishly-0.005
 course-0.004
 argued-0.003
 practice-0.003
 advantage-0.003
Token it
Token ID340
Feature activation+8.13e-01

Pos logit contributions
ishly+0.019
 course+0.015
 argued+0.015
 lear+0.015
 practice+0.014
Neg logit contributions
 pans-0.015
eddy-0.015
izz-0.013
ucer-0.013
emo-0.013
Token to
Token ID284
Feature activation-4.20e-02

Pos logit contributions
ishly+0.028
 course+0.026
 lear+0.024
 practice+0.023
 advantage+0.023
Neg logit contributions
 pans-0.014
oe-0.013
eddy-0.013
ucer-0.012
tub-0.012
Token steam
Token ID13324
Feature activation+7.60e-01

Pos logit contributions
 pans+0.001
eddy+0.001
ucer+0.001
izz+0.001
emo+0.001
Neg logit contributions
ishly-0.001
 course-0.001
 argued-0.001
 practice-0.001
 lear-0.001
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
ishly+0.025
 course+0.022
 persever+0.022
 reassured+0.021
 advantage+0.021
Neg logit contributions
eddy-0.014
tub-0.013
 pans-0.013
oe-0.011
ble-0.011
Token Then
Token ID3244
Feature activation+7.44e-01

Pos logit contributions
ishly+0.004
 course+0.003
 argued+0.003
 practice+0.003
 advantage+0.003
Neg logit contributions
 pans-0.004
eddy-0.004
ucer-0.004
loo-0.004
ble-0.004
Token,
Token ID11
Feature activation-2.41e-01

Pos logit contributions
ishly+0.016
 course+0.012
 argued+0.011
 practice+0.011
 lear+0.011
Neg logit contributions
eddy-0.023
 pans-0.023
ucer-0.021
emo-0.021
izz-0.021
Token her
Token ID607
Feature activation+9.14e-01

Pos logit contributions
 pans+0.005
eddy+0.004
ucer+0.004
izz+0.004
ble+0.004
Neg logit contributions
ishly-0.008
 course-0.007
 argued-0.006
 advantage-0.006
 strides-0.006
Token mom
Token ID1995
Feature activation+7.18e-01

Pos logit contributions
ishly+0.031
 course+0.026
 argued+0.025
 practice+0.025
 lear+0.025
Neg logit contributions
eddy-0.017
 pans-0.017
ucer-0.014
emo-0.014
izz-0.014
Token asked
Token ID1965
Feature activation-1.44e-01

Pos logit contributions
ishly+0.018
 course+0.015
 argued+0.014
 practice+0.014
 advantage+0.014
Neg logit contributions
eddy-0.019
 pans-0.019
ucer-0.017
emo-0.016
izz-0.016
TokenTim
Token ID14967
Feature activation+1.89e-01

Pos logit contributions
eddy+0.007
 pans+0.006
emo+0.005
ucer+0.005
izz+0.005
Neg logit contributions
ishly-0.016
 course-0.013
 argued-0.013
 advantage-0.013
 practice-0.013
Tokenmy
Token ID1820
Feature activation+2.15e-01

Pos logit contributions
ishly+0.006
 course+0.005
 argued+0.005
 practice+0.005
 lear+0.005
Neg logit contributions
 pans-0.004
eddy-0.004
ucer-0.003
emo-0.003
izz-0.003
Token was
Token ID373
Feature activation+1.69e-01

Pos logit contributions
ishly+0.006
 course+0.005
 advantage+0.005
 lear+0.005
 strides+0.005
Neg logit contributions
eddy-0.005
 pans-0.005
emo-0.004
ble-0.004
 lad-0.004
Token so
Token ID523
Feature activation-2.16e-01

Pos logit contributions
ishly+0.005
 course+0.004
 lear+0.004
 argued+0.004
 practice+0.004
Neg logit contributions
eddy-0.004
 pans-0.004
ble-0.004
izz-0.003
emo-0.003
Token excited
Token ID6568
Feature activation-2.68e-01

Pos logit contributions
eddy+0.005
 pans+0.005
ucer+0.005
emo+0.005
izz+0.005
Neg logit contributions
ishly-0.006
 course-0.005
 argued-0.005
 practice-0.004
 lear-0.004
Token to
Token ID284
Feature activation-8.71e-01

Pos logit contributions
eddy+0.006
 pans+0.005
loo+0.005
oe+0.005
emo+0.005
Neg logit contributions
ishly-0.008
 advantage-0.006
 course-0.006
 lear-0.006
 practice-0.006
Token hear
Token ID3285
Feature activation+4.25e-01

Pos logit contributions
 pans+0.019
eddy+0.019
ble+0.018
izz+0.017
emo+0.017
Neg logit contributions
ishly-0.026
 course-0.020
rified-0.020
 argued-0.020
nered-0.019
Token that
Token ID326
Feature activation-1.46e+00

Pos logit contributions
ishly+0.014
 argued+0.011
 lear+0.011
rified+0.011
Deal+0.010
Neg logit contributions
eddy-0.009
 pans-0.008
 candle-0.008
 room-0.007
ble-0.007
Token he
Token ID339
Feature activation-1.88e-02

Pos logit contributions
eddy+0.034
 pans+0.033
ucer+0.028
ble+0.028
emo+0.028
Neg logit contributions
ishly-0.044
 course-0.035
 argued-0.034
 advantage-0.032
 strides-0.032
Token would
Token ID561
Feature activation-1.49e+00

Pos logit contributions
 pans+0.000
 lad+0.000
eddy+0.000
 room+0.000
ble+0.000
Neg logit contributions
ishly-0.001
 course-0.001
 advantage-0.001
opl-0.000
 strides-0.000
Token have
Token ID423
Feature activation-6.96e-02

Pos logit contributions
onna+0.029
ble+0.027
 candle+0.026
eddy+0.025
 glass+0.025
Neg logit contributions
ishly-0.045
rified-0.039
vious-0.036
 unnecessary-0.036
vered-0.035
Tokenily
Token ID813
Feature activation-2.56e+00

Pos logit contributions
 pans+0.055
eddy+0.046
ble+0.045
ucer+0.045
stals+0.044
Neg logit contributions
 course-0.041
ishly-0.033
 argued-0.033
 him-0.032
 practice-0.031
Token liked
Token ID8288
Feature activation-1.15e+00

Pos logit contributions
eddy+0.059
 pans+0.056
emo+0.050
izz+0.049
loo+0.048
Neg logit contributions
ishly-0.076
 advantage-0.060
 course-0.059
 argued-0.057
 echoing-0.057
Token to
Token ID284
Feature activation-2.55e-02

Pos logit contributions
eddy+0.031
 pans+0.031
ucer+0.028
emo+0.027
izz+0.027
Neg logit contributions
ishly-0.028
 course-0.022
 argued-0.021
 practice-0.021
 lear-0.021
Token cook
Token ID4255
Feature activation-8.86e-01

Pos logit contributions
 pans+0.001
eddy+0.001
emo+0.001
 lad+0.001
 candle+0.001
Neg logit contributions
ishly-0.001
rified-0.001
 course-0.001
 advantage-0.001
 argued-0.001
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
eddy+0.022
 pans+0.022
ucer+0.020
emo+0.020
loo+0.020
Neg logit contributions
ishly-0.023
 course-0.019
 practice-0.018
 argued-0.018
 advantage-0.017
Token She
Token ID1375
Feature activation-2.83e+00

Pos logit contributions
 pans+0.004
eddy+0.004
ucer+0.004
ble+0.004
izz+0.004
Neg logit contributions
ishly-0.003
 course-0.003
 practice-0.003
 argued-0.003
 lear-0.003
Token had
Token ID550
Feature activation-1.16e+00

Pos logit contributions
eddy+0.093
ucer+0.087
ucks+0.086
ble+0.085
 pans+0.084
Neg logit contributions
ishly-0.043
 argued-0.039
 course-0.039
 practice-0.037
 lear-0.037
Token a
Token ID257
Feature activation-9.23e-01

Pos logit contributions
eddy+0.035
ucer+0.033
ble+0.032
ucks+0.031
 pans+0.031
Neg logit contributions
ishly-0.020
 course-0.020
 practice-0.019
 argued-0.019
 rather-0.018
Token small
Token ID1402
Feature activation-2.00e+00

Pos logit contributions
eddy+0.020
 pans+0.018
izz+0.018
ucer+0.018
ucks+0.017
Neg logit contributions
ishly-0.026
 course-0.023
 practice-0.023
 competition-0.022
 lear-0.022
Token kitchen
Token ID9592
Feature activation-7.15e-01

Pos logit contributions
eddy+0.031
 pans+0.029
izz+0.026
ucer+0.026
emo+0.026
Neg logit contributions
ishly-0.072
 course-0.062
 practice-0.061
 lear-0.060
 argued-0.059
Token with
Token ID351
Feature activation-1.86e+00

Pos logit contributions
eddy+0.019
 pans+0.018
ucer+0.017
emo+0.017
izz+0.017
Neg logit contributions
ishly-0.018
 course-0.014
 argued-0.013
 practice-0.013
 lear-0.013
Token there
Token ID612
Feature activation-8.20e-01

Pos logit contributions
 pans+0.039
eddy+0.038
ucks+0.035
ucer+0.034
loo+0.033
Neg logit contributions
ishly-0.032
 course-0.029
 at-0.028
 weren-0.028
 practice-0.027
Token was
Token ID373
Feature activation+8.91e-02

Pos logit contributions
 pans+0.021
eddy+0.020
ucer+0.018
emo+0.018
izz+0.018
Neg logit contributions
ishly-0.022
 course-0.018
 argued-0.017
 practice-0.017
 lear-0.016
Token a
Token ID257
Feature activation+5.51e-01

Pos logit contributions
ishly+0.002
 course+0.002
 practice+0.002
 rather+0.001
 lear+0.001
Neg logit contributions
 pans-0.003
ucer-0.003
eddy-0.003
loo-0.002
ble-0.002
Token little
Token ID1310
Feature activation-5.34e-01

Pos logit contributions
ishly+0.010
 lear+0.010
 practice+0.009
 course+0.009
 competition+0.009
Neg logit contributions
ucks-0.015
eddy-0.015
ucer-0.015
izz-0.015
 pans-0.015
Token boy
Token ID2933
Feature activation-1.91e+00

Pos logit contributions
emo+0.011
izz+0.011
 pans+0.011
onna+0.011
ucer+0.011
Neg logit contributions
ishly-0.014
 course-0.013
 lear-0.012
 practice-0.012
 competition-0.011
Token named
Token ID3706
Feature activation-2.62e+00

Pos logit contributions
 pans+0.071
eddy+0.068
ucer+0.067
ble+0.065
emo+0.065
Neg logit contributions
ishly-0.027
 course-0.022
 practice-0.018
 lear-0.017
 argued-0.017
Token Tim
Token ID5045
Feature activation-2.96e+00

Pos logit contributions
eddy+0.095
 pans+0.095
ucer+0.092
ble+0.091
izz+0.088
Neg logit contributions
ishly-0.035
 course-0.025
 practice-0.023
 argued-0.022
 compete-0.019
Tokenmy
Token ID1820
Feature activation-2.46e+00

Pos logit contributions
 pans+0.084
eddy+0.074
ucer+0.073
tub+0.071
ble+0.070
Neg logit contributions
ishly-0.065
 course-0.063
 practice-0.057
 argued-0.057
 rather-0.053
Token.
Token ID13
Feature activation-1.29e-01

Pos logit contributions
 pans+0.081
eddy+0.074
izz+0.074
ucer+0.073
loo+0.072
Neg logit contributions
ishly-0.045
 course-0.036
 lear-0.034
 practice-0.033
 argued-0.032
Token Tim
Token ID5045
Feature activation-1.86e+00

Pos logit contributions
 pans+0.004
eddy+0.004
ucer+0.004
ble+0.004
izz+0.004
Neg logit contributions
ishly-0.002
 course-0.002
 argued-0.002
 practice-0.001
 weren-0.001
Tokenmy
Token ID1820
Feature activation-2.30e+00

Pos logit contributions
 pans+0.059
ucer+0.053
ucks+0.051
eddy+0.050
ble+0.050
Neg logit contributions
 course-0.034
ishly-0.033
 practice-0.030
 argued-0.030
 weren-0.028
Token.
Token ID13
Feature activation-6.38e-01

Pos logit contributions
eddy+0.030
 pans+0.030
izz+0.025
oe+0.025
 flour+0.025
Neg logit contributions
ishly-0.039
 course-0.032
 lear-0.031
 advantage-0.031
 practice-0.030
Token They
Token ID1119
Feature activation-1.10e+00

Pos logit contributions
 pans+0.016
eddy+0.015
ucer+0.015
loo+0.014
ble+0.014
Neg logit contributions
ishly-0.017
 course-0.014
 argued-0.013
 practice-0.013
 rather-0.013
Token agreed
Token ID4987
Feature activation-1.55e+00

Pos logit contributions
 pans+0.025
eddy+0.024
izz+0.023
 lad+0.022
ucer+0.022
Neg logit contributions
ishly-0.029
 strides-0.024
 advantage-0.024
 practice-0.023
 course-0.023
Token to
Token ID284
Feature activation-3.78e-01

Pos logit contributions
eddy+0.041
 pans+0.041
ucer+0.036
emo+0.036
izz+0.036
Neg logit contributions
ishly-0.040
 course-0.032
 argued-0.030
 practice-0.030
 lear-0.029
Token take
Token ID1011
Feature activation-8.71e-01

Pos logit contributions
 pans+0.009
eddy+0.009
emo+0.008
ble+0.008
izz+0.008
Neg logit contributions
ishly-0.010
 course-0.008
 echoing-0.007
rified-0.007
 strides-0.007
Token turns
Token ID4962
Feature activation-3.14e+00

Pos logit contributions
eddy+0.022
 pans+0.022
ucer+0.019
izz+0.019
emo+0.019
Neg logit contributions
ishly-0.024
 course-0.019
 argued-0.018
 lear-0.018
 echoing-0.018
Token and
Token ID290
Feature activation-8.57e-03

Pos logit contributions
eddy+0.051
 pans+0.045
oe+0.042
 room+0.040
onna+0.039
Neg logit contributions
ishly-0.116
rified-0.095
 course-0.093
 advantage-0.092
cient-0.091
Token play
Token ID711
Feature activation-1.18e-01

Pos logit contributions
 pans+0.000
eddy+0.000
 glass+0.000
 lad+0.000
emo+0.000
Neg logit contributions
ishly-0.000
rified-0.000
 course-0.000
 echoing-0.000
 advantage-0.000
Token nicely
Token ID16576
Feature activation-1.91e+00

Pos logit contributions
eddy+0.001
 pans+0.001
ble+0.001
 room+0.001
tub+0.001
Neg logit contributions
ishly-0.005
 argued-0.004
 weren-0.004
istent-0.004
 echoing-0.004
Token.
Token ID13
Feature activation-6.62e-01

Pos logit contributions
eddy+0.036
 pans+0.035
oe+0.029
ble+0.028
ucer+0.028
Neg logit contributions
ishly-0.069
 course-0.057
 argued-0.053
 weren-0.053
 echoing-0.053
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
 pans+0.018
eddy+0.017
ucer+0.017
ble+0.016
loo+0.016
Neg logit contributions
ishly-0.016
 practice-0.013
 rather-0.013
 course-0.012
 lear-0.012
Token
Token ID241
Feature activation-2.72e+00

Pos logit contributions
ishly+0.026
 course+0.021
 lear+0.020
 weren+0.020
 argued+0.020
Neg logit contributions
 pans-0.031
eddy-0.031
emo-0.030
ble-0.030
izz-0.028
TokenI
Token ID40
Feature activation-5.18e-01

Pos logit contributions
 pans+0.062
eddy+0.062
ucer+0.054
emo+0.053
loo+0.053
Neg logit contributions
ishly-0.078
 course-0.065
 argued-0.062
 practice-0.062
 lear-0.061
Token can
Token ID460
Feature activation-3.38e-01

Pos logit contributions
izz+0.010
emo+0.010
onna+0.009
oe+0.009
eddy+0.009
Neg logit contributions
ishly-0.018
 argued-0.014
 course-0.013
rified-0.013
ting-0.013
Token share
Token ID2648
Feature activation-9.02e-01

Pos logit contributions
ble+0.007
eddy+0.007
onna+0.006
 pans+0.006
 block+0.006
Neg logit contributions
ishly-0.010
rified-0.009
 argued-0.009
 unnecessary-0.008
 lear-0.008
Token a
Token ID257
Feature activation-1.72e+00

Pos logit contributions
 pans+0.012
eddy+0.010
 flour+0.009
 yourselves+0.009
 glass+0.008
Neg logit contributions
ishly-0.036
 argued-0.031
 echoing-0.030
 lear-0.030
istent-0.029
Token way
Token ID835
Feature activation-2.67e+00

Pos logit contributions
eddy+0.037
 pans+0.036
ucer+0.032
emo+0.032
loo+0.031
Neg logit contributions
ishly-0.054
 argued-0.044
 course-0.042
 weren-0.040
 advantage-0.040
Token to
Token ID284
Feature activation-2.44e-01

Pos logit contributions
eddy+0.062
 pans+0.059
ble+0.052
ucer+0.052
loo+0.051
Neg logit contributions
ishly-0.076
 argued-0.065
 course-0.063
 practice-0.060
 lear-0.059
Token play
Token ID711
Feature activation+4.95e-01

Pos logit contributions
emo+0.004
 pans+0.004
ble+0.004
onna+0.004
 glass+0.004
Neg logit contributions
ishly-0.008
rified-0.007
 echoing-0.006
storms-0.006
 argued-0.006
Token with
Token ID351
Feature activation-8.37e-01

Pos logit contributions
ishly+0.020
 argued+0.019
 echoing+0.017
 discussed+0.017
 meanings+0.017
Neg logit contributions
 pans-0.005
eddy-0.005
ble-0.005
tub-0.004
 room-0.004
Token the
Token ID262
Feature activation+5.38e-01

Pos logit contributions
 pans+0.014
eddy+0.013
emo+0.012
 salt+0.011
ble+0.011
Neg logit contributions
ishly-0.030
 echoing-0.026
 lear-0.026
 argued-0.025
 reassured-0.024
Token other
Token ID584
Feature activation+5.36e-01

Pos logit contributions
ishly+0.018
 argued+0.014
 lear+0.013
 course+0.013
 reassured+0.013
Neg logit contributions
eddy-0.011
 pans-0.011
emo-0.010
loo-0.010
ble-0.009
Token cup
Token ID6508
Feature activation-1.83e+00

Pos logit contributions
eddy+0.061
 pans+0.057
ble+0.055
izz+0.055
loo+0.053
Neg logit contributions
ishly-0.070
 practice-0.066
 course-0.063
 lear-0.059
 competition-0.058
Token of
Token ID286
Feature activation-2.32e+00

Pos logit contributions
 pans+0.050
eddy+0.047
emo+0.045
ucer+0.044
izz+0.043
Neg logit contributions
ishly-0.044
 course-0.035
 lear-0.032
 practice-0.032
 argued-0.031
Token juice
Token ID13135
Feature activation-2.96e+00

Pos logit contributions
 pans+0.039
eddy+0.039
ucer+0.032
emo+0.031
izz+0.031
Neg logit contributions
ishly-0.081
 course-0.068
 argued-0.067
 lear-0.066
 practice-0.066
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
eddy+0.079
 pans+0.079
ucer+0.070
emo+0.069
izz+0.069
Neg logit contributions
ishly-0.074
 course-0.059
 argued-0.056
 practice-0.055
 lear-0.054
Token Tom
Token ID4186
Feature activation-1.34e+00

Pos logit contributions
eddy+0.027
 pans+0.027
ucer+0.023
emo+0.023
izz+0.023
Neg logit contributions
ishly-0.031
 course-0.025
 argued-0.024
 practice-0.024
 lear-0.023
Token drinks
Token ID11758
Feature activation-2.38e+00

Pos logit contributions
eddy+0.033
 pans+0.032
ucer+0.029
loo+0.028
izz+0.028
Neg logit contributions
ishly-0.037
 course-0.030
 practice-0.029
 argued-0.029
 lear-0.028
Token the
Token ID262
Feature activation-3.10e+00

Pos logit contributions
eddy+0.066
 pans+0.064
ucer+0.059
izz+0.057
loo+0.057
Neg logit contributions
ishly-0.056
 course-0.048
 practice-0.044
 argued-0.043
 advantage-0.041
Token juice
Token ID13135
Feature activation-2.62e+00

Pos logit contributions
eddy+0.062
 pans+0.059
ucer+0.052
izz+0.052
emo+0.051
Neg logit contributions
ishly-0.098
 course-0.083
 practice-0.080
 argued-0.080
 advantage-0.078
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
eddy+0.063
 pans+0.063
ucer+0.056
emo+0.055
loo+0.054
Neg logit contributions
ishly-0.073
 course-0.059
 argued-0.057
 practice-0.056
 advantage-0.055
Token He
Token ID679
Feature activation-1.80e+00

Pos logit contributions
 pans+0.025
eddy+0.025
ucer+0.024
izz+0.023
ble+0.022
Neg logit contributions
ishly-0.025
 course-0.021
 practice-0.021
 lear-0.019
 argued-0.019
Token says
Token ID1139
Feature activation+6.75e-01

Pos logit contributions
 pans+0.039
eddy+0.038
emo+0.033
 lad+0.033
izz+0.032
Neg logit contributions
ishly-0.055
 course-0.044
 argued-0.043
 advantage-0.043
 lear-0.042