TOP ACTIVATIONS
MAX = 5.472

But then , Benny and Ros ie started to quarrel .
went to the store . J od ie saw something colorful
day on , Benny and Ros ie played together every day
it . So J od ie 's mom bought
. Her mom and dad told her that she was going
! <|endoftext|> S ully and J ody were good friends .
she saw something colourful pe eking out from the grass .
named Cr ust y . Cr ust y lived in a
L ily and Ros ie went to the kitchen
the movie for her . J od ie and her mom
Soon enough , all of S ully 's friends started arriving
two pieces ! Max and Ros ie were so happy they
girl named Jenny was running gigg ling through a me adow
watched it . J od ie was amazed by
there was a girl named J illy . J illy was
play . Bel le was proud of her
One day , J od ie decided to gather
S ully tried to warn J ody to be careful ,
play . Her friend saw Ros ie and became en vious
soft copper coin . E sm ie wanted to pick up

MIN ACTIVATIONS
MIN = -5.662

Go away . Leave our hockey alone ." Max
Max , that 's our hockey . Give it back .
but we must return the hockey stick if we find the
be with you . The operation is not bad . It
a little accident . Your motorcycle is okay too . It
" L ily , your leg is bad . We need
hockey . Mia brought her hockey stick and John brought his
, Tim my ! Your tire is flat . We need
you very much . The operation will be over soon and
dad . I have a hockey stick and a ball .
good son and a good hockey player . You don 't
hot cocoa in their bath ro bes . <|endoftext|> Once upon
room clean and used his muscle to help his mom around
, " No , my muscle hurts ." The
They say , " Our tower is gone ! The dog
" Tim my , your clock is not honest . It
are right . The new hockey stick is not important .
. " Your tower was high and colorful !"
stayed together ! That â s how people help
. Good nap . Good hockey . Good friends ." <|endoftext|>

INTERVAL 2.688 - 5.472
CONTAINS 0.557%

on his truck . He showed them how to hold the
a voice . It was an old man . He said
said , " I will search the sky for you ."
It was so pretty that Sus ie wanted to keep it
. They had never seen anything like it before ! They

INTERVAL -0.095 - 2.688
CONTAINS 52.845%

careful . Don 't touch the nest or the eggs .
full of lettuce . Next , Lucy 's mom
They are y ummy . They are happy . They are
are sorry . But you can still play with us .
of herself for following her mom 's order and gaining a

INTERVAL -2.878 - -0.095
CONTAINS 46.387%

arm . Tim my said , " O uch ! Why
I will give you some of my milk if you bat
. He says he was playing with it outside and a
, feeling much better than before . <|endoftext|> Once upon a
gasped . She tried to fix the cord , but it

INTERVAL -5.662 - -2.878
CONTAINS 0.211%

't your coat . â J enna was
even like you . â Sam â
very tired . No more bubbles
girl asked , ˜ What can I do
Let â s go inside now
TokenBut
Token ID1537
Feature activation+9.48e-01

Pos logit contributions
ills+0.083
ares+0.080
ax+0.079
elling+0.078
ada+0.078
Neg logit contributions
Spot-0.029
ggy-0.029
Fl-0.028
 Wh-0.028
 surgery-0.025
Token then
Token ID788
Feature activation-9.06e-01

Pos logit contributions
ills+0.021
ares+0.019
ada+0.019
lee+0.019
ly+0.019
Neg logit contributions
ggy-0.030
rob-0.028
bes-0.027
 Maple-0.027
 cookie-0.027
Token,
Token ID11
Feature activation+1.26e+00

Pos logit contributions
ggy+0.022
rob+0.021
bes+0.021
 surgery+0.021
 clinic+0.021
Neg logit contributions
ills-0.027
lee-0.024
ares-0.024
ada-0.024
ly-0.023
Token Benny
Token ID44275
Feature activation+1.65e-01

Pos logit contributions
ills+0.039
ares+0.036
ada+0.035
hing+0.035
elling+0.034
Neg logit contributions
ggy-0.030
 surgery-0.028
rob-0.027
bes-0.027
 Maple-0.027
Token and
Token ID290
Feature activation+2.40e+00

Pos logit contributions
ills+0.005
ares+0.004
ada+0.004
elling+0.004
hing+0.004
Neg logit contributions
ggy-0.004
 surgery-0.004
 Fl-0.004
 Wh-0.004
 chip-0.004
Token Ros
Token ID10018
Feature activation+5.47e+00

Pos logit contributions
ills+0.065
ares+0.059
ada+0.057
elling+0.056
hing+0.056
Neg logit contributions
ggy-0.067
 surgery-0.064
 chip-0.062
Spot-0.061
 Fl-0.061
Tokenie
Token ID494
Feature activation+1.10e+00

Pos logit contributions
 contain+0.121
leaf+0.121
ivated+0.118
ares+0.116
elling+0.116
Neg logit contributions
 Wh-0.146
ggy-0.141
Fl-0.141
rob-0.140
gy-0.140
Token started
Token ID2067
Feature activation-1.59e+00

Pos logit contributions
ills+0.034
ares+0.032
elling+0.031
ax+0.030
ada+0.030
Neg logit contributions
ggy-0.027
 Wh-0.026
 surgery-0.026
 Fl-0.025
Spot-0.025
Token to
Token ID284
Feature activation-8.91e-01

Pos logit contributions
ggy+0.044
rob+0.044
Spot+0.042
 surgery+0.042
Fl+0.042
Neg logit contributions
ills-0.042
arted-0.038
ares-0.037
lee-0.036
ada-0.035
Token quarrel
Token ID41998
Feature activation+7.53e-01

Pos logit contributions
ggy+0.026
 surgery+0.025
 Fl+0.024
 Wh+0.024
 chip+0.024
Neg logit contributions
ills-0.023
ares-0.021
ada-0.021
elling-0.020
ax-0.020
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
ills+0.023
ares+0.021
ada+0.020
hing+0.020
elling+0.020
Neg logit contributions
ggy-0.019
 surgery-0.018
rob-0.018
 chip-0.017
 Fl-0.017
Token went
Token ID1816
Feature activation+1.84e-01

Pos logit contributions
ares+0.048
ills+0.048
idges+0.045
elling+0.044
ivated+0.044
Neg logit contributions
 Wh-0.026
K-0.024
tr-0.023
 Wednesday-0.023
 Sunday-0.023
Token to
Token ID284
Feature activation-5.32e-01

Pos logit contributions
ills+0.006
ares+0.006
ada+0.006
elling+0.005
hing+0.005
Neg logit contributions
ggy-0.004
 surgery-0.004
 Wh-0.004
 Fl-0.004
 Maple-0.004
Token the
Token ID262
Feature activation+1.51e+00

Pos logit contributions
ggy+0.010
 surgery+0.010
 Fl+0.009
 Wh+0.009
 chip+0.009
Neg logit contributions
ills-0.019
ares-0.018
ada-0.017
elling-0.017
hing-0.017
Token store
Token ID3650
Feature activation-1.10e-01

Pos logit contributions
ills+0.052
ares+0.048
ada+0.047
elling+0.047
ax+0.046
Neg logit contributions
ggy-0.031
 surgery-0.029
 Fl-0.027
 Wh-0.027
 chip-0.027
Token.
Token ID13
Feature activation+2.41e+00

Pos logit contributions
ggy+0.002
 surgery+0.002
 Wh+0.002
 chip+0.002
 Fl+0.002
Neg logit contributions
ills-0.004
ares-0.003
ada-0.003
elling-0.003
hing-0.003
Token J
Token ID449
Feature activation+5.29e+00

Pos logit contributions
ills+0.090
hing+0.086
ares+0.082
idges+0.082
elling+0.081
Neg logit contributions
 Wh-0.040
 Fl-0.040
 chip-0.037
 surgery-0.036
 Wednesday-0.036
Tokenod
Token ID375
Feature activation+3.81e+00

Pos logit contributions
hing+0.125
ivated+0.114
 contain+0.113
 wanna+0.111
arted+0.109
Neg logit contributions
ggy-0.142
ork-0.134
urse-0.127
S-0.126
 chip-0.126
Tokenie
Token ID494
Feature activation-3.53e-01

Pos logit contributions
ares+0.062
ivated+0.061
ills+0.060
 contain+0.057
ada+0.056
Neg logit contributions
ggy-0.141
ves-0.132
rob-0.129
elope-0.121
Fl-0.121
Token saw
Token ID2497
Feature activation+4.62e-01

Pos logit contributions
ggy+0.010
 surgery+0.010
 chip+0.010
 Wh+0.010
 Fl+0.009
Neg logit contributions
ills-0.009
ares-0.008
ada-0.008
elling-0.008
ax-0.008
Token something
Token ID1223
Feature activation+3.16e+00

Pos logit contributions
ills+0.018
ares+0.017
ada+0.016
hing+0.016
elling+0.016
Neg logit contributions
ggy-0.008
 surgery-0.007
 Fl-0.007
 Wh-0.007
 chip-0.007
Token colorful
Token ID20239
Feature activation+2.24e+00

Pos logit contributions
ills+0.101
ares+0.094
arted+0.093
hing+0.093
elling+0.092
Neg logit contributions
ggy-0.081
 chip-0.072
 Fl-0.072
 cookie-0.070
 surgery-0.070
Token day
Token ID1110
Feature activation-7.47e-01

Pos logit contributions
ggy+0.033
bes+0.032
 surgery+0.031
 clinic+0.030
rob+0.030
Neg logit contributions
ills-0.050
hing-0.047
elling-0.045
 Jackie-0.045
ie-0.045
Token on
Token ID319
Feature activation-6.20e-01

Pos logit contributions
ggy+0.015
 chip+0.014
bes+0.014
 clinic+0.014
 surgery+0.014
Neg logit contributions
ills-0.025
ares-0.024
hing-0.023
ada-0.023
arted-0.022
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
rob+0.017
ggy+0.017
tr+0.017
 clinic+0.017
 surgery+0.017
Neg logit contributions
ills-0.015
lee-0.015
ada-0.014
 Jackie-0.014
ares-0.014
Token Benny
Token ID44275
Feature activation-6.48e-02

Pos logit contributions
ills+0.048
ares+0.044
ada+0.043
elling+0.042
hing+0.042
Neg logit contributions
ggy-0.030
 surgery-0.028
 chip-0.027
 Fl-0.026
 Wh-0.026
Token and
Token ID290
Feature activation+2.78e+00

Pos logit contributions
ggy+0.002
 surgery+0.002
 chip+0.002
 Wh+0.002
 Fl+0.002
Neg logit contributions
ills-0.002
ada-0.001
ares-0.001
hing-0.001
elling-0.001
Token Ros
Token ID10018
Feature activation+5.26e+00

Pos logit contributions
ills+0.076
ares+0.069
ada+0.067
elling+0.066
ax+0.065
Neg logit contributions
ggy-0.077
 surgery-0.074
 Wh-0.072
 Fl-0.072
 chip-0.071
Tokenie
Token ID494
Feature activation+7.30e-01

Pos logit contributions
 contain+0.108
ares+0.108
ivated+0.107
ax+0.107
oured+0.102
Neg logit contributions
 Wh-0.152
Fl-0.151
ggy-0.149
Spot-0.148
 Fl-0.147
Token played
Token ID2826
Feature activation-1.59e+00

Pos logit contributions
ills+0.021
ares+0.020
ada+0.018
elling+0.018
ax+0.018
Neg logit contributions
ggy-0.020
 surgery-0.018
 Wh-0.018
 Fl-0.018
 chip-0.018
Token together
Token ID1978
Feature activation-5.85e-01

Pos logit contributions
 surgery+0.041
ggy+0.040
 chip+0.039
 Wh+0.038
Fl+0.038
Neg logit contributions
ills-0.044
ares-0.039
hing-0.039
ada-0.038
arted-0.037
Token every
Token ID790
Feature activation-1.21e+00

Pos logit contributions
ggy+0.017
 surgery+0.017
rob+0.016
 Wh+0.016
 chip+0.016
Neg logit contributions
ills-0.015
ares-0.013
hing-0.013
arted-0.012
ada-0.012
Token day
Token ID1110
Feature activation-2.91e-01

Pos logit contributions
ggy+0.024
bes+0.024
 Wh+0.024
rob+0.024
Pe+0.021
Neg logit contributions
ills-0.038
ly-0.037
 contain-0.037
arted-0.036
ares-0.036
Token it
Token ID340
Feature activation+2.09e+00

Pos logit contributions
ills+0.049
ares+0.046
ada+0.044
hing+0.044
elling+0.043
Neg logit contributions
ggy-0.039
 surgery-0.037
Spot-0.036
rob-0.036
 chip-0.035
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
ills+0.067
ares+0.061
ada+0.061
elling+0.060
ax+0.059
Neg logit contributions
ggy-0.048
 surgery-0.045
 Fl-0.044
 Wh-0.044
 chip-0.044
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
ills+0.045
hing+0.043
ares+0.042
elling+0.042
idges+0.041
Neg logit contributions
ggy-0.013
 Fl-0.013
 Wh-0.013
 surgery-0.012
 chip-0.012
Token
Token ID198
Feature activation+2.64e+00

Pos logit contributions
ills+0.045
ares+0.043
elling+0.042
ada+0.042
hing+0.042
Neg logit contributions
ggy-0.010
 surgery-0.008
 Fl-0.008
 Wh-0.008
Spot-0.008
TokenSo
Token ID2396
Feature activation+1.53e+00

Pos logit contributions
ills+0.102
elling+0.102
hing+0.098
rant+0.097
ares+0.096
Neg logit contributions
ggy-0.041
S-0.039
K-0.034
Pe-0.034
Spot-0.034
Token J
Token ID449
Feature activation+5.15e+00

Pos logit contributions
ills+0.059
hing+0.054
ares+0.054
elling+0.053
ax+0.052
Neg logit contributions
ggy-0.026
 surgery-0.023
 Fl-0.023
 Wh-0.023
 chip-0.022
Tokenod
Token ID375
Feature activation+4.05e+00

Pos logit contributions
hing+0.121
ivated+0.110
 contain+0.104
 wanna+0.103
arted+0.103
Neg logit contributions
ggy-0.145
ork-0.137
S-0.132
 chip-0.131
urse-0.131
Tokenie
Token ID494
Feature activation-1.42e-01

Pos logit contributions
ills+0.067
ivated+0.067
ares+0.064
ada+0.061
oured+0.061
Neg logit contributions
ggy-0.149
ves-0.137
rob-0.132
elope-0.130
Fl-0.127
Token's
Token ID338
Feature activation-4.64e-01

Pos logit contributions
ggy+0.004
 surgery+0.004
 Fl+0.004
 Wh+0.004
 chip+0.004
Neg logit contributions
ills-0.004
ares-0.004
ada-0.003
elling-0.003
ax-0.003
Token mom
Token ID1995
Feature activation+3.51e-01

Pos logit contributions
ggy+0.008
 surgery+0.007
 Fl+0.007
 chip+0.007
 Wh+0.007
Neg logit contributions
ills-0.018
ares-0.017
ada-0.016
hing-0.016
elling-0.016
Token bought
Token ID5839
Feature activation+1.46e+00

Pos logit contributions
ills+0.012
ares+0.011
elling+0.011
ivated+0.011
hing+0.010
Neg logit contributions
ggy-0.007
 chip-0.007
 Wh-0.007
 kit-0.007
 Fl-0.007
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
ggy+0.003
 surgery+0.003
 Wh+0.003
 Fl+0.003
 chip+0.003
Neg logit contributions
ills-0.005
ares-0.005
ada-0.004
elling-0.004
ax-0.004
Token Her
Token ID2332
Feature activation-2.44e-01

Pos logit contributions
ills+0.078
hing+0.076
ares+0.074
elling+0.074
idges+0.072
Neg logit contributions
 Wh-0.013
 Fl-0.013
ggy-0.010
 Velvet-0.010
 Wednesday-0.010
Token mom
Token ID1995
Feature activation+6.57e-01

Pos logit contributions
 surgery+0.006
ggy+0.006
rob+0.006
Fl+0.006
 Wh+0.005
Neg logit contributions
ills-0.007
ares-0.007
ada-0.006
lee-0.006
elling-0.006
Token and
Token ID290
Feature activation+5.07e-02

Pos logit contributions
ills+0.019
ares+0.019
arted+0.017
elling+0.017
vell+0.017
Neg logit contributions
 Fl-0.016
 chip-0.016
ggy-0.016
 Wh-0.016
 cookie-0.015
Token dad
Token ID9955
Feature activation+1.47e+00

Pos logit contributions
ills+0.002
ares+0.001
elling+0.001
hing+0.001
ada+0.001
Neg logit contributions
ggy-0.001
 surgery-0.001
 Fl-0.001
 Wh-0.001
 chip-0.001
Token told
Token ID1297
Feature activation+5.14e+00

Pos logit contributions
ills+0.053
ares+0.052
elling+0.050
arted+0.049
idges+0.047
Neg logit contributions
 cookie-0.025
 Fl-0.024
ggy-0.024
 kit-0.024
 ha-0.024
Token her
Token ID607
Feature activation+1.16e-01

Pos logit contributions
hing+0.186
vell+0.185
ills+0.182
elling+0.180
ares+0.177
Neg logit contributions
 Wh-0.080
 Wednesday-0.077
 Tuesday-0.076
 Fl-0.073
 Sunday-0.071
Token that
Token ID326
Feature activation-1.02e+00

Pos logit contributions
ills+0.003
ares+0.003
ada+0.003
lee+0.003
ly+0.003
Neg logit contributions
rob-0.003
Spot-0.003
ggy-0.003
Fl-0.003
 surgery-0.003
Token she
Token ID673
Feature activation+2.36e-01

Pos logit contributions
ggy+0.025
bes+0.024
rob+0.024
 surgery+0.023
Spot+0.023
Neg logit contributions
ills-0.030
idges-0.026
hing-0.025
 closely-0.025
ares-0.025
Token was
Token ID373
Feature activation-2.65e-01

Pos logit contributions
ills+0.008
ares+0.007
ada+0.007
hing+0.007
elling+0.007
Neg logit contributions
ggy-0.005
 surgery-0.005
 Fl-0.005
 Wh-0.005
 chip-0.005
Token going
Token ID1016
Feature activation-3.45e-01

Pos logit contributions
ggy+0.009
 surgery+0.009
rob+0.009
bes+0.009
Fl+0.009
Neg logit contributions
ills-0.005
ada-0.005
ares-0.005
ax-0.004
elling-0.004
Token!
Token ID0
Feature activation-5.77e-01

Pos logit contributions
ggy+0.036
 surgery+0.035
 Wh+0.034
 chip+0.034
 Fl+0.034
Neg logit contributions
ills-0.038
ares-0.035
ada-0.034
ax-0.033
hing-0.033
Token<|endoftext|>
Token ID50256
Feature activation-2.26e-01

Pos logit contributions
ggy+0.012
 surgery+0.011
rob+0.011
bes+0.011
 clinic+0.011
Neg logit contributions
ills-0.019
ax-0.018
ada-0.018
ares-0.018
lee-0.018
TokenS
Token ID50
Feature activation+3.45e+00

Pos logit contributions
ggy+0.005
 surgery+0.005
 chip+0.005
bes+0.005
 clinic+0.005
Neg logit contributions
ills-0.007
ada-0.007
hing-0.006
ax-0.006
ares-0.006
Tokenully
Token ID2132
Feature activation+1.25e+00

Pos logit contributions
ills+0.113
hing+0.107
ares+0.107
ivated+0.103
arted+0.103
Neg logit contributions
ggy-0.070
 surgery-0.068
 Wh-0.066
 chip-0.066
 Fl-0.065
Token and
Token ID290
Feature activation+1.19e+00

Pos logit contributions
ills+0.037
ares+0.035
ada+0.034
elling+0.033
hing+0.033
Neg logit contributions
ggy-0.031
 surgery-0.029
 Fl-0.028
 Wh-0.028
 chip-0.028
Token J
Token ID449
Feature activation+5.13e+00

Pos logit contributions
ills+0.036
ada+0.033
ares+0.033
ax+0.032
hing+0.032
Neg logit contributions
ggy-0.029
 surgery-0.028
bes-0.027
 chip-0.027
 cookie-0.026
Tokenody
Token ID1118
Feature activation+2.31e+00

Pos logit contributions
ivated+0.097
ares+0.094
ills+0.091
rant+0.089
elling+0.086
Neg logit contributions
rob-0.156
 Wh-0.151
K-0.150
ggy-0.149
Spot-0.143
Token were
Token ID547
Feature activation+1.05e+00

Pos logit contributions
ills+0.074
ares+0.071
elling+0.067
idges+0.065
ivated+0.065
Neg logit contributions
ggy-0.052
 Fl-0.049
 Wh-0.047
Spot-0.047
rob-0.046
Token good
Token ID922
Feature activation+2.57e-01

Pos logit contributions
ills+0.033
ares+0.030
ada+0.029
elling+0.029
hing+0.029
Neg logit contributions
ggy-0.025
 surgery-0.024
 Fl-0.023
 Wh-0.023
 chip-0.023
Token friends
Token ID2460
Feature activation+1.28e+00

Pos logit contributions
ills+0.008
ares+0.008
hing+0.008
elling+0.008
ada+0.007
Neg logit contributions
ggy-0.006
 surgery-0.005
 Wh-0.005
 chip-0.005
 Fl-0.005
Token.
Token ID13
Feature activation+3.01e+00

Pos logit contributions
ills+0.045
ares+0.043
elling+0.042
ada+0.041
hing+0.041
Neg logit contributions
ggy-0.025
 Fl-0.022
 surgery-0.022
 Wh-0.022
Spot-0.022
Token she
Token ID673
Feature activation+6.38e-01

Pos logit contributions
rob+0.024
ggy+0.023
bes+0.023
 surgery+0.023
 clinic+0.022
Neg logit contributions
ly-0.022
ills-0.021
ivated-0.021
 curiously-0.020
 contain-0.020
Token saw
Token ID2497
Feature activation+4.14e-01

Pos logit contributions
ills+0.015
ares+0.013
ada+0.013
elling+0.012
ax+0.012
Neg logit contributions
ggy-0.020
 surgery-0.020
 chip-0.019
 Fl-0.019
rob-0.019
Token something
Token ID1223
Feature activation+3.01e+00

Pos logit contributions
ills+0.015
ares+0.015
ada+0.014
elling+0.014
hing+0.014
Neg logit contributions
ggy-0.008
 surgery-0.007
 Wh-0.007
 Fl-0.007
 chip-0.007
Token colourful
Token ID44799
Feature activation+2.15e+00

Pos logit contributions
ills+0.090
ares+0.082
hing+0.080
ada+0.079
elling+0.079
Neg logit contributions
ggy-0.082
 Fl-0.074
 surgery-0.073
 chip-0.073
 cookie-0.072
Token pe
Token ID613
Feature activation+1.44e+00

Pos logit contributions
ills+0.065
ares+0.060
ada+0.058
elling+0.057
ax+0.057
Neg logit contributions
ggy-0.053
 surgery-0.050
 Fl-0.049
 chip-0.049
 Wh-0.048
Tokeneking
Token ID18754
Feature activation+5.10e+00

Pos logit contributions
ills+0.043
ares+0.040
ada+0.040
hing+0.040
arted+0.039
Neg logit contributions
ggy-0.036
ves-0.031
rob-0.031
 surgery-0.031
bes-0.031
Token out
Token ID503
Feature activation+3.24e+00

Pos logit contributions
ills+0.150
idges+0.139
ody+0.134
elling+0.133
ax+0.132
Neg logit contributions
 Fl-0.110
ggy-0.107
 Wh-0.103
Fl-0.103
tr-0.101
Token from
Token ID422
Feature activation-3.85e-02

Pos logit contributions
ills+0.104
ada+0.100
ody+0.099
ares+0.098
ax+0.097
Neg logit contributions
 Fl-0.066
 surgery-0.064
 chip-0.062
ggy-0.061
 Wednesday-0.061
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
ggy+0.001
 surgery+0.001
 chip+0.001
 Wh+0.001
 Fl+0.001
Neg logit contributions
ills-0.001
ares-0.001
ada-0.001
elling-0.001
hing-0.001
Token grass
Token ID8701
Feature activation+8.33e-01

Pos logit contributions
ills+0.062
ares+0.059
ada+0.055
hing+0.055
ax+0.055
Neg logit contributions
ggy-0.037
 surgery-0.035
 Fl-0.035
 chip-0.034
 Wh-0.034
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
ills+0.028
ares+0.026
ada+0.025
elling+0.025
hing+0.025
Neg logit contributions
ggy-0.018
 surgery-0.017
 Fl-0.016
 Wh-0.016
 chip-0.016
Token named
Token ID3706
Feature activation+2.05e+00

Pos logit contributions
ills+0.010
ares+0.009
hing+0.009
ada+0.009
elling+0.009
Neg logit contributions
 surgery-0.006
ggy-0.006
 Maple-0.006
 chip-0.006
rob-0.006
Token Cr
Token ID3864
Feature activation+4.92e+00

Pos logit contributions
ills+0.060
ares+0.055
ada+0.052
elling+0.052
hing+0.052
Neg logit contributions
ggy-0.054
 Fl-0.052
 surgery-0.051
 Wh-0.051
 chip-0.050
Tokenust
Token ID436
Feature activation+1.68e+00

Pos logit contributions
hing+0.091
ivated+0.088
ills+0.086
elling+0.086
arted+0.085
Neg logit contributions
ggy-0.162
S-0.150
Spot-0.147
ork-0.146
urse-0.145
Tokeny
Token ID88
Feature activation+1.17e+00

Pos logit contributions
ills+0.038
ares+0.034
ada+0.033
ax+0.032
elling+0.032
Neg logit contributions
ggy-0.053
 surgery-0.051
 Fl-0.050
 Wh-0.050
 chip-0.049
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
ills+0.038
ares+0.035
ada+0.034
elling+0.034
ax+0.033
Neg logit contributions
ggy-0.026
 surgery-0.025
 Wh-0.023
rob-0.023
 chip-0.023
Token Cr
Token ID3864
Feature activation+5.06e+00

Pos logit contributions
ills+0.095
ares+0.085
hing+0.083
elling+0.083
ada+0.082
Neg logit contributions
 Wh-0.075
 Fl-0.073
 chip-0.073
ggy-0.072
 surgery-0.070
Tokenust
Token ID436
Feature activation+1.70e+00

Pos logit contributions
hing+0.092
ivated+0.089
ills+0.085
 anticipation+0.084
 contain+0.084
Neg logit contributions
ggy-0.160
ork-0.150
S-0.149
Spot-0.147
urse-0.144
Tokeny
Token ID88
Feature activation+9.67e-01

Pos logit contributions
ills+0.042
ada+0.038
ares+0.038
ax+0.037
idges+0.037
Neg logit contributions
ggy-0.051
 Fl-0.047
 surgery-0.047
Spot-0.046
 Maple-0.046
Token lived
Token ID5615
Feature activation+4.60e-01

Pos logit contributions
ills+0.026
lee+0.023
ly+0.023
ada+0.023
hing+0.022
Neg logit contributions
 surgery-0.027
 chip-0.024
 cookie-0.024
rob-0.024
ggy-0.024
Token in
Token ID287
Feature activation+1.45e-01

Pos logit contributions
ills+0.014
ares+0.013
ada+0.012
elling+0.012
hing+0.012
Neg logit contributions
ggy-0.012
 surgery-0.011
 Fl-0.011
 Wh-0.011
Spot-0.011
Token a
Token ID257
Feature activation+4.23e-01

Pos logit contributions
ills+0.005
ares+0.004
hing+0.004
ada+0.004
elling+0.004
Neg logit contributions
ggy-0.003
 surgery-0.003
bes-0.003
ves-0.003
rob-0.003
Token
Token ID198
Feature activation-4.33e-01

Pos logit contributions
ggy+0.008
 surgery+0.007
rob+0.007
 chip+0.006
 Maple+0.006
Neg logit contributions
ills-0.021
ares-0.020
ada-0.020
ax-0.019
elling-0.019
Token
Token ID198
Feature activation+6.52e-01

Pos logit contributions
ggy+0.007
 surgery+0.007
rob+0.006
 chip+0.006
 cookie+0.006
Neg logit contributions
ills-0.017
ares-0.016
ada-0.015
ax-0.015
elling-0.015
TokenL
Token ID43
Feature activation+2.92e+00

Pos logit contributions
ills+0.024
ares+0.023
ada+0.022
elling+0.022
hing+0.022
Neg logit contributions
ggy-0.011
 surgery-0.011
 Fl-0.010
 chip-0.010
 Wh-0.010
Tokenily
Token ID813
Feature activation-8.45e-01

Pos logit contributions
ills+0.093
hing+0.090
 contain+0.087
ivated+0.086
ax+0.084
Neg logit contributions
ggy-0.061
 surgery-0.054
Fl-0.053
 Velvet-0.053
 Wh-0.053
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
ggy+0.023
 surgery+0.021
 chip+0.021
 Fl+0.021
 Wh+0.021
Neg logit contributions
ills-0.024
ares-0.023
ada-0.021
elling-0.021
hing-0.021
Token Ros
Token ID10018
Feature activation+5.05e+00

Pos logit contributions
ills+0.065
ares+0.060
elling+0.058
ax+0.057
hing+0.057
Neg logit contributions
 Wh-0.054
 Fl-0.053
ggy-0.052
 surgery-0.051
 chip-0.051
Tokenie
Token ID494
Feature activation+2.29e-01

Pos logit contributions
ares+0.111
 contain+0.109
ax+0.109
 wanna+0.107
ivated+0.107
Neg logit contributions
 kit-0.140
Fl-0.138
rob-0.136
 Wh-0.136
 Fl-0.132
Token went
Token ID1816
Feature activation-1.15e+00

Pos logit contributions
ills+0.007
ares+0.007
ax+0.007
elling+0.007
ada+0.007
Neg logit contributions
ggy-0.005
 Wh-0.005
 surgery-0.005
 Fl-0.005
 chip-0.005
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
ggy+0.028
 surgery+0.027
 Wh+0.026
 Fl+0.026
 chip+0.026
Neg logit contributions
ills-0.035
ares-0.032
ada-0.031
elling-0.031
hing-0.031
Token the
Token ID262
Feature activation+5.97e-01

Pos logit contributions
ggy+0.025
 surgery+0.024
 Wh+0.024
 Fl+0.024
 chip+0.023
Neg logit contributions
ills-0.036
ares-0.033
elling-0.032
ax-0.032
hing-0.032
Token kitchen
Token ID9592
Feature activation-9.94e-01

Pos logit contributions
ills+0.017
ada+0.016
elling+0.016
ares+0.016
ivated+0.015
Neg logit contributions
ggy-0.014
rob-0.013
ves-0.013
bes-0.013
 surgery-0.013
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
ills+0.053
ares+0.051
hing+0.049
elling+0.048
arted+0.048
Neg logit contributions
ggy-0.026
 surgery-0.025
 Fl-0.025
 Wh-0.025
 chip-0.025
Token movie
Token ID3807
Feature activation-1.11e+00

Pos logit contributions
ills+0.009
ares+0.009
ada+0.008
elling+0.008
hing+0.008
Neg logit contributions
ggy-0.003
 surgery-0.003
 Wh-0.003
 Fl-0.003
 chip-0.003
Token for
Token ID329
Feature activation+4.95e-01

Pos logit contributions
ggy+0.028
 surgery+0.028
 Wh+0.027
Spot+0.027
rob+0.026
Neg logit contributions
ills-0.031
ares-0.029
ada-0.028
ax-0.028
hing-0.027
Token her
Token ID607
Feature activation-4.33e-01

Pos logit contributions
ills+0.015
ares+0.014
ada+0.014
ax+0.013
lee+0.013
Neg logit contributions
ggy-0.012
 surgery-0.011
Spot-0.011
 chip-0.011
rob-0.011
Token.
Token ID13
Feature activation+1.76e+00

Pos logit contributions
ggy+0.010
 surgery+0.010
 Wh+0.010
rob+0.010
Spot+0.010
Neg logit contributions
ills-0.013
ares-0.012
ada-0.012
elling-0.012
ax-0.012
Token J
Token ID449
Feature activation+5.05e+00

Pos logit contributions
ills+0.062
hing+0.058
ares+0.057
idges+0.056
elling+0.055
Neg logit contributions
 Wh-0.032
 Fl-0.031
 chip-0.031
 surgery-0.029
 Maple-0.029
Tokenod
Token ID375
Feature activation+3.19e+00

Pos logit contributions
hing+0.123
ivated+0.113
 contain+0.113
 wanna+0.110
renched+0.106
Neg logit contributions
ggy-0.133
ork-0.122
S-0.116
urse-0.115
 chip-0.115
Tokenie
Token ID494
Feature activation-8.10e-01

Pos logit contributions
ivated+0.049
ills+0.048
ares+0.046
ada+0.043
 contain+0.043
Neg logit contributions
ggy-0.123
ves-0.114
rob-0.111
Fl-0.106
elope-0.106
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
 surgery+0.020
ggy+0.020
rob+0.019
 Wh+0.019
Spot+0.019
Neg logit contributions
ills-0.023
lee-0.021
ada-0.021
ares-0.021
ax-0.021
Token her
Token ID607
Feature activation-4.18e-01

Pos logit contributions
ills+0.026
ada+0.025
ares+0.024
lee+0.024
arted+0.023
Neg logit contributions
ggy-0.034
 surgery-0.033
rob-0.033
Spot-0.032
 Maple-0.032
Token mom
Token ID1995
Feature activation+7.52e-01

Pos logit contributions
ggy+0.009
 surgery+0.009
rob+0.008
 Wh+0.008
 Maple+0.008
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
elling-0.012
ax-0.012
TokenSoon
Token ID28093
Feature activation+3.44e-01

Pos logit contributions
ills+0.061
ares+0.056
ada+0.055
ax+0.054
hing+0.054
Neg logit contributions
ggy-0.033
 surgery-0.032
 Fl-0.030
 Wh-0.030
 chip-0.030
Token enough
Token ID1576
Feature activation+8.62e-01

Pos logit contributions
ills+0.011
ares+0.010
ada+0.010
ax+0.010
lee+0.010
Neg logit contributions
ggy-0.008
 surgery-0.008
bes-0.008
Fl-0.007
rob-0.007
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
ills+0.027
ares+0.024
ada+0.024
ax+0.024
lee+0.024
Neg logit contributions
ggy-0.020
bes-0.019
rob-0.019
Fl-0.019
 surgery-0.019
Token all
Token ID477
Feature activation+2.16e+00

Pos logit contributions
ills+0.036
ares+0.033
ada+0.032
elling+0.032
hing+0.032
Neg logit contributions
ggy-0.025
 surgery-0.024
 Wh-0.023
 Fl-0.023
 chip-0.023
Token of
Token ID286
Feature activation-1.85e-01

Pos logit contributions
ills+0.064
ares+0.059
ada+0.057
elling+0.056
ax+0.055
Neg logit contributions
ggy-0.055
 surgery-0.052
 Fl-0.051
 Wh-0.051
 chip-0.051
Token S
Token ID311
Feature activation+5.03e+00

Pos logit contributions
ggy+0.004
 surgery+0.004
 chip+0.004
Spot+0.004
 cookie+0.004
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
hing-0.005
ax-0.005
Tokenully
Token ID2132
Feature activation+1.11e+00

Pos logit contributions
ivated+0.100
ares+0.096
ills+0.095
 contain+0.094
 curiously+0.089
Neg logit contributions
 Wh-0.153
ggy-0.150
rob-0.150
Spot-0.144
 surgery-0.143
Token's
Token ID338
Feature activation+2.05e-01

Pos logit contributions
ills+0.031
ares+0.029
ada+0.028
elling+0.028
hing+0.027
Neg logit contributions
ggy-0.029
 surgery-0.028
 Fl-0.027
 Wh-0.027
 chip-0.027
Token friends
Token ID2460
Feature activation+1.29e+00

Pos logit contributions
ills+0.006
ares+0.006
ada+0.006
elling+0.006
hing+0.006
Neg logit contributions
ggy-0.005
 surgery-0.005
 Fl-0.005
 chip-0.005
 Wh-0.005
Token started
Token ID2067
Feature activation-6.48e-01

Pos logit contributions
ills+0.038
ares+0.035
elling+0.035
ada+0.032
idges+0.032
Neg logit contributions
ggy-0.033
 Fl-0.031
 surgery-0.030
 Wh-0.030
 cookie-0.030
Token arriving
Token ID14820
Feature activation+4.80e-01

Pos logit contributions
ggy+0.018
 surgery+0.018
Spot+0.017
rob+0.017
Fl+0.017
Neg logit contributions
ills-0.017
ares-0.015
ada-0.015
arted-0.015
lee-0.015
Token two
Token ID734
Feature activation+4.91e-01

Pos logit contributions
ggy+0.018
 surgery+0.017
 Fl+0.016
 Wh+0.016
 chip+0.016
Neg logit contributions
ills-0.029
ares-0.027
ada-0.026
elling-0.026
hing-0.026
Token pieces
Token ID5207
Feature activation+1.06e+00

Pos logit contributions
ills+0.016
ares+0.016
ada+0.015
elling+0.015
hing+0.015
Neg logit contributions
ggy-0.010
 chip-0.010
 cookie-0.010
 Fl-0.010
 surgery-0.010
Token!
Token ID0
Feature activation-7.48e-02

Pos logit contributions
ills+0.035
elling+0.035
ada+0.034
ares+0.034
idget+0.033
Neg logit contributions
ggy-0.021
 chip-0.018
 cookie-0.018
 surgery-0.018
 Fl-0.018
Token Max
Token ID5436
Feature activation-3.80e-01

Pos logit contributions
ggy+0.002
 surgery+0.002
rob+0.001
 clinic+0.001
 chip+0.001
Neg logit contributions
ills-0.002
ares-0.002
ada-0.002
arted-0.002
elling-0.002
Token and
Token ID290
Feature activation+2.47e+00

Pos logit contributions
ggy+0.008
 surgery+0.008
 Fl+0.008
 Wh+0.008
 chip+0.008
Neg logit contributions
ills-0.013
ares-0.012
ada-0.011
elling-0.011
hing-0.011
Token Ros
Token ID10018
Feature activation+5.03e+00

Pos logit contributions
ills+0.064
ares+0.058
ada+0.057
elling+0.055
hing+0.055
Neg logit contributions
ggy-0.072
 surgery-0.069
 chip-0.066
 Fl-0.066
Spot-0.065
Tokenie
Token ID494
Feature activation+4.38e-01

Pos logit contributions
ivated+0.105
ares+0.104
 contain+0.097
elling+0.094
 wanna+0.094
Neg logit contributions
Fl-0.148
ggy-0.146
 Wh-0.146
K-0.144
 Fl-0.140
Token were
Token ID547
Feature activation+4.98e-01

Pos logit contributions
ills+0.013
ares+0.013
elling+0.012
ada+0.012
ax+0.012
Neg logit contributions
ggy-0.011
 chip-0.010
 Fl-0.010
 Wh-0.010
 cookie-0.010
Token so
Token ID523
Feature activation-2.52e-02

Pos logit contributions
ills+0.013
ares+0.012
ada+0.012
elling+0.012
hing+0.011
Neg logit contributions
ggy-0.014
 Fl-0.013
 surgery-0.013
 chip-0.013
 Wh-0.013
Token happy
Token ID3772
Feature activation-2.51e-01

Pos logit contributions
ggy+0.001
 chip+0.001
 cookie+0.001
 Fl+0.001
 Wh+0.000
Neg logit contributions
ills-0.001
ares-0.001
elling-0.001
ada-0.001
ly-0.001
Token they
Token ID484
Feature activation+2.29e+00

Pos logit contributions
ggy+0.006
 surgery+0.006
 Fl+0.006
 Wh+0.006
 chip+0.006
Neg logit contributions
ills-0.007
ares-0.007
ada-0.007
elling-0.006
ax-0.006
Token girl
Token ID2576
Feature activation+1.31e-01

Pos logit contributions
ggy+0.014
bes+0.013
 surgery+0.013
ves+0.012
rob+0.011
Neg logit contributions
ills-0.017
ada-0.016
elling-0.015
ly-0.015
ory-0.015
Token named
Token ID3706
Feature activation+2.60e+00

Pos logit contributions
ills+0.004
lee+0.003
ada+0.003
elling+0.003
ax+0.003
Neg logit contributions
ggy-0.003
 surgery-0.003
 chip-0.003
rob-0.003
Spot-0.003
Token Jenny
Token ID28437
Feature activation+6.84e-01

Pos logit contributions
ills+0.098
ares+0.092
hing+0.089
elling+0.088
ax+0.087
Neg logit contributions
ggy-0.045
 Fl-0.043
 Wh-0.042
 surgery-0.042
 Maple-0.040
Token was
Token ID373
Feature activation+1.21e+00

Pos logit contributions
ills+0.022
ares+0.020
ada+0.020
elling+0.020
ax+0.020
Neg logit contributions
ggy-0.015
 surgery-0.015
 chip-0.014
 Wh-0.014
Spot-0.014
Token running
Token ID2491
Feature activation+4.07e-01

Pos logit contributions
ills+0.031
ada+0.028
ivated+0.028
ares+0.027
arted+0.027
Neg logit contributions
ggy-0.034
rob-0.034
 surgery-0.033
ves-0.033
Spot-0.032
Token gigg
Token ID30442
Feature activation+5.02e+00

Pos logit contributions
ills+0.009
ares+0.008
ada+0.007
elling+0.007
hing+0.007
Neg logit contributions
ggy-0.014
 surgery-0.013
 Fl-0.013
 Wh-0.013
Spot-0.013
Tokenling
Token ID1359
Feature activation+9.00e-01

Pos logit contributions
ills+0.097
enna+0.090
 recognise+0.089
 contain+0.087
idges+0.083
Neg logit contributions
ggy-0.151
 fle-0.138
Spot-0.138
 Maple-0.138
Fl-0.137
Token through
Token ID832
Feature activation-6.92e-01

Pos logit contributions
ills+0.026
ares+0.024
ada+0.024
hing+0.023
arted+0.023
Neg logit contributions
ggy-0.023
 surgery-0.022
rob-0.021
 chip-0.021
 Wh-0.021
Token a
Token ID257
Feature activation-1.97e-01

Pos logit contributions
ggy+0.016
 surgery+0.015
rob+0.015
 chip+0.015
Spot+0.015
Neg logit contributions
ills-0.022
ares-0.020
ada-0.019
lee-0.019
arted-0.019
Token me
Token ID502
Feature activation-4.32e-01

Pos logit contributions
ggy+0.005
 surgery+0.005
rob+0.005
 Wh+0.005
 Fl+0.005
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
elling-0.005
hing-0.005
Tokenadow
Token ID4584
Feature activation+4.30e-01

Pos logit contributions
ggy+0.015
 surgery+0.014
 chip+0.014
 Fl+0.014
 Wh+0.014
Neg logit contributions
ills-0.009
ares-0.008
ada-0.008
ax-0.007
hing-0.007
Token watched
Token ID7342
Feature activation+2.47e+00

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
ax+0.008
hing+0.007
Neg logit contributions
ggy-0.007
 surgery-0.007
rob-0.007
Spot-0.007
bes-0.007
Token it
Token ID340
Feature activation+1.06e+00

Pos logit contributions
ills+0.083
ares+0.077
ada+0.076
elling+0.076
ax+0.074
Neg logit contributions
ggy-0.053
 Fl-0.049
 surgery-0.049
 Wh-0.048
 chip-0.048
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
ills+0.026
ares+0.024
ada+0.023
elling+0.023
ax+0.022
Neg logit contributions
ggy-0.032
 surgery-0.030
 Fl-0.030
 Wh-0.029
 chip-0.029
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
ills+0.052
hing+0.048
ares+0.048
elling+0.046
idges+0.046
Neg logit contributions
 Wh-0.031
ggy-0.030
 Fl-0.030
 surgery-0.030
 chip-0.029
Token
Token ID198
Feature activation+2.66e+00

Pos logit contributions
ills+0.051
ares+0.048
ada+0.047
elling+0.047
hing+0.047
Neg logit contributions
ggy-0.013
 surgery-0.011
 Fl-0.010
 Wh-0.010
 chip-0.010
TokenJ
Token ID41
Feature activation+5.02e+00

Pos logit contributions
ills+0.104
elling+0.104
ares+0.101
ly+0.100
hing+0.098
Neg logit contributions
ggy-0.040
S-0.034
ves-0.033
Spot-0.033
K-0.032
Tokenod
Token ID375
Feature activation+3.95e+00

Pos logit contributions
ivated+0.113
hing+0.107
 contain+0.103
 wanna+0.094
renched+0.093
Neg logit contributions
ggy-0.141
ork-0.138
tr-0.132
S-0.131
urse-0.129
Tokenie
Token ID494
Feature activation+1.63e-01

Pos logit contributions
ills+0.061
ares+0.060
ivated+0.059
ada+0.056
ax+0.054
Neg logit contributions
ggy-0.151
ves-0.138
rob-0.137
elope-0.134
Fl-0.134
Token was
Token ID373
Feature activation+8.30e-01

Pos logit contributions
ills+0.005
ares+0.004
ada+0.004
ax+0.004
lee+0.004
Neg logit contributions
ggy-0.004
 surgery-0.004
 Wh-0.004
 chip-0.004
 Fl-0.004
Token amazed
Token ID24562
Feature activation+2.89e+00

Pos logit contributions
ills+0.018
ares+0.016
ada+0.016
ax+0.016
arted+0.015
Neg logit contributions
 surgery-0.027
ggy-0.026
rob-0.026
Spot-0.025
 chip-0.025
Token by
Token ID416
Feature activation+5.23e-01

Pos logit contributions
hing+0.110
ares+0.106
ills+0.105
ada+0.104
idget+0.099
Neg logit contributions
 Wednesday-0.041
ggy-0.040
 Sunday-0.039
 Tuesday-0.039
 Fl-0.038
Token there
Token ID612
Feature activation+1.30e+00

Pos logit contributions
ills+0.001
ax+0.001
ada+0.001
 Jackie+0.001
 curiously+0.001
Neg logit contributions
rob-0.002
 surgery-0.002
ggy-0.002
 chip-0.001
bes-0.001
Token was
Token ID373
Feature activation-1.21e-01

Pos logit contributions
ills+0.045
ares+0.042
ada+0.041
ax+0.041
hing+0.040
Neg logit contributions
ggy-0.025
 surgery-0.025
 chip-0.023
rob-0.023
 Wh-0.023
Token a
Token ID257
Feature activation-2.78e-01

Pos logit contributions
rob+0.003
ggy+0.003
bes+0.003
 chip+0.003
 surgery+0.003
Neg logit contributions
ills-0.003
ax-0.003
arted-0.003
years-0.003
ada-0.003
Token girl
Token ID2576
Feature activation+4.31e-01

Pos logit contributions
rob+0.008
 surgery+0.008
bes+0.008
ggy+0.008
 Wh+0.007
Neg logit contributions
ills-0.006
ada-0.006
ory-0.006
ax-0.006
ares-0.006
Token named
Token ID3706
Feature activation+1.46e+00

Pos logit contributions
ills+0.013
ada+0.012
lee+0.012
ares+0.012
ly+0.011
Neg logit contributions
 surgery-0.010
rob-0.010
ggy-0.010
Spot-0.010
 chip-0.009
Token J
Token ID449
Feature activation+5.02e+00

Pos logit contributions
ills+0.046
ada+0.045
ares+0.043
arted+0.043
lee+0.042
Neg logit contributions
ggy-0.034
 surgery-0.032
rob-0.032
bes-0.031
 chip-0.030
Tokenilly
Token ID6548
Feature activation+1.30e-01

Pos logit contributions
ills+0.064
hing+0.060
ares+0.057
ivated+0.056
 contain+0.053
Neg logit contributions
ggy-0.190
rob-0.182
 Wh-0.179
 Maple-0.176
S-0.174
Token.
Token ID13
Feature activation+2.83e+00

Pos logit contributions
ills+0.004
ada+0.003
lee+0.003
ax+0.003
ly+0.003
Neg logit contributions
 surgery-0.004
rob-0.003
ggy-0.003
bes-0.003
Spot-0.003
Token J
Token ID449
Feature activation+4.17e+00

Pos logit contributions
ills+0.079
hing+0.074
ares+0.073
elling+0.071
ly+0.071
Neg logit contributions
 Wh-0.071
 Fl-0.069
 chip-0.065
 Maple-0.065
ggy-0.065
Tokenilly
Token ID6548
Feature activation+8.77e-02

Pos logit contributions
hing+0.049
ares+0.047
ills+0.046
ivated+0.045
 contain+0.045
Neg logit contributions
ggy-0.159
rob-0.153
S-0.150
 Wh-0.150
 surgery-0.150
Token was
Token ID373
Feature activation+1.07e+00

Pos logit contributions
ills+0.003
ada+0.002
lee+0.002
ax+0.002
ares+0.002
Neg logit contributions
 surgery-0.002
ggy-0.002
rob-0.002
 chip-0.002
 cookie-0.002
Token play
Token ID711
Feature activation-5.54e-01

Pos logit contributions
ills+0.030
ares+0.027
ada+0.027
ax+0.026
elling+0.026
Neg logit contributions
ggy-0.029
 surgery-0.028
bes-0.027
rob-0.027
Spot-0.027
Token.
Token ID13
Feature activation+7.76e-01

Pos logit contributions
 surgery+0.016
 chip+0.015
elope+0.014
Pe+0.014
 Maple+0.014
Neg logit contributions
ills-0.013
ares-0.012
hing-0.012
ly-0.012
ie-0.011
Token
Token ID220
Feature activation+2.73e-01

Pos logit contributions
ills+0.021
ares+0.021
ada+0.020
ax+0.019
arted+0.019
Neg logit contributions
ggy-0.021
 surgery-0.020
rob-0.020
bes-0.018
ves-0.018
Token
Token ID198
Feature activation+3.36e-01

Pos logit contributions
ills+0.011
ares+0.010
ada+0.010
ax+0.010
elling+0.010
Neg logit contributions
ggy-0.004
 surgery-0.004
 chip-0.004
rob-0.004
 cookie-0.004
Token
Token ID198
Feature activation+6.49e-01

Pos logit contributions
ills+0.012
ares+0.011
ax+0.011
ada+0.011
arted+0.011
Neg logit contributions
 surgery-0.006
ggy-0.006
rob-0.005
 chip-0.005
 cookie-0.005
TokenBel
Token ID12193
Feature activation+5.01e+00

Pos logit contributions
ills+0.021
ares+0.019
ada+0.019
ax+0.019
hing+0.018
Neg logit contributions
 surgery-0.014
ggy-0.014
rob-0.014
 chip-0.013
 cookie-0.013
Tokenle
Token ID293
Feature activation+1.58e+00

Pos logit contributions
ills+0.084
ares+0.074
ada+0.071
ax+0.066
elling+0.064
Neg logit contributions
ggy-0.190
 surgery-0.180
rob-0.176
Fl-0.176
 Maple-0.175
Token was
Token ID373
Feature activation+5.47e-01

Pos logit contributions
ills+0.054
ares+0.051
ada+0.049
elling+0.049
ax+0.048
Neg logit contributions
ggy-0.033
 surgery-0.030
 Fl-0.029
 Wh-0.029
 chip-0.029
Token proud
Token ID6613
Feature activation+1.20e+00

Pos logit contributions
ills+0.013
ares+0.011
arted+0.011
ada+0.011
ax+0.011
Neg logit contributions
 surgery-0.017
ggy-0.017
rob-0.017
 chip-0.016
Spot-0.016
Token of
Token ID286
Feature activation-4.48e-01

Pos logit contributions
ills+0.027
ares+0.025
ada+0.024
hing+0.023
elling+0.023
Neg logit contributions
ggy-0.038
 surgery-0.037
 Fl-0.036
 chip-0.036
 Wh-0.036
Token her
Token ID607
Feature activation+4.17e-01

Pos logit contributions
ggy+0.013
 surgery+0.013
 chip+0.013
Spot+0.013
tr+0.012
Neg logit contributions
ills-0.011
ares-0.010
arted-0.010
lee-0.010
ada-0.009
Token
Token ID198
Feature activation+9.58e-01

Pos logit contributions
ills+0.051
hing+0.048
ares+0.048
elling+0.046
ada+0.046
Neg logit contributions
 Wh-0.028
 Fl-0.027
 surgery-0.027
ggy-0.026
 Maple-0.026
Token
Token ID198
Feature activation+2.39e+00

Pos logit contributions
ills+0.041
ares+0.039
ada+0.038
elling+0.037
ax+0.037
Neg logit contributions
ggy-0.012
 surgery-0.011
 Fl-0.010
 chip-0.010
 Wh-0.010
TokenOne
Token ID3198
Feature activation+5.22e-01

Pos logit contributions
ills+0.102
elling+0.098
ares+0.098
hing+0.097
ada+0.096
Neg logit contributions
ggy-0.027
 Wh-0.024
K-0.022
Spot-0.022
S-0.022
Token day
Token ID1110
Feature activation+1.13e+00

Pos logit contributions
ills+0.021
ares+0.019
ada+0.019
elling+0.019
ax+0.019
Neg logit contributions
ggy-0.008
 surgery-0.007
rob-0.007
bes-0.007
 Wh-0.007
Token,
Token ID11
Feature activation+2.51e+00

Pos logit contributions
ills+0.043
ares+0.040
elling+0.039
hing+0.039
ada+0.039
Neg logit contributions
ggy-0.020
 surgery-0.018
 Wh-0.018
 Fl-0.018
 chip-0.017
Token J
Token ID449
Feature activation+4.96e+00

Pos logit contributions
ills+0.088
hing+0.084
elling+0.082
ares+0.080
idges+0.077
Neg logit contributions
 Wh-0.049
ggy-0.047
 Fl-0.046
 Wednesday-0.043
 surgery-0.043
Tokenod
Token ID375
Feature activation+3.89e+00

Pos logit contributions
hing+0.114
ivated+0.101
 contain+0.098
arted+0.098
 wanna+0.094
Neg logit contributions
ggy-0.142
ork-0.137
S-0.129
urse-0.127
 chip-0.125
Tokenie
Token ID494
Feature activation-1.22e-01

Pos logit contributions
ares+0.062
ivated+0.062
ills+0.062
ada+0.059
oured+0.056
Neg logit contributions
ggy-0.147
ves-0.133
rob-0.129
elope-0.128
Fl-0.126
Token decided
Token ID3066
Feature activation-5.23e-01

Pos logit contributions
ggy+0.004
 surgery+0.003
 chip+0.003
rob+0.003
 Fl+0.003
Neg logit contributions
ills-0.003
ares-0.003
ada-0.003
elling-0.003
ax-0.003
Token to
Token ID284
Feature activation+6.29e-01

Pos logit contributions
ggy+0.013
 surgery+0.012
 Fl+0.012
 chip+0.012
 Wh+0.011
Neg logit contributions
ills-0.016
ares-0.015
ada-0.014
elling-0.014
ax-0.014
Token gather
Token ID6431
Feature activation+1.13e+00

Pos logit contributions
ills+0.016
ares+0.015
elling+0.014
arted+0.014
ax+0.014
Neg logit contributions
ggy-0.019
 surgery-0.018
 cookie-0.017
 chip-0.017
 Fl-0.017
Token S
Token ID311
Feature activation+4.43e+00

Pos logit contributions
ills+0.020
ares+0.019
ada+0.019
ax+0.019
arted+0.018
Neg logit contributions
ggy-0.009
 surgery-0.009
rob-0.008
 chip-0.008
 cookie-0.008
Tokenully
Token ID2132
Feature activation+3.01e-01

Pos logit contributions
ills+0.049
ares+0.041
hing+0.039
ivated+0.036
arted+0.035
Neg logit contributions
ggy-0.185
rob-0.177
 Wh-0.173
 surgery-0.172
Spot-0.172
Token tried
Token ID3088
Feature activation-5.62e-01

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
elling+0.008
hing+0.008
Neg logit contributions
ggy-0.007
 surgery-0.007
 Fl-0.007
 Wh-0.007
 chip-0.007
Token to
Token ID284
Feature activation+1.94e-01

Pos logit contributions
ggy+0.013
 surgery+0.012
 Fl+0.012
 Wh+0.012
 chip+0.012
Neg logit contributions
ills-0.018
ares-0.017
ada-0.017
elling-0.016
hing-0.016
Token warn
Token ID9828
Feature activation+2.70e+00

Pos logit contributions
ills+0.006
elling+0.006
ares+0.006
ada+0.005
lee+0.005
Neg logit contributions
ggy-0.005
 Fl-0.005
 Wh-0.005
 surgery-0.005
Spot-0.004
Token J
Token ID449
Feature activation+4.95e+00

Pos logit contributions
ills+0.062
ares+0.056
ada+0.054
elling+0.053
hing+0.052
Neg logit contributions
ggy-0.086
 surgery-0.083
 Fl-0.081
 Wh-0.081
 chip-0.080
Tokenody
Token ID1118
Feature activation+8.26e-01

Pos logit contributions
ivated+0.072
hing+0.071
ills+0.070
 contain+0.070
 curiously+0.065
Neg logit contributions
ggy-0.176
rob-0.168
 Wh-0.162
ork-0.159
ves-0.159
Token to
Token ID284
Feature activation+7.00e-01

Pos logit contributions
ills+0.027
ares+0.025
ada+0.024
ax+0.024
elling+0.024
Neg logit contributions
ggy-0.018
 surgery-0.017
rob-0.017
 chip-0.017
Spot-0.017
Token be
Token ID307
Feature activation-5.56e-01

Pos logit contributions
ills+0.019
ares+0.018
elling+0.017
ada+0.017
arted+0.016
Neg logit contributions
ggy-0.020
 surgery-0.019
 Fl-0.018
 Wh-0.018
Spot-0.018
Token careful
Token ID8161
Feature activation+1.02e-02

Pos logit contributions
rob+0.015
ggy+0.015
 surgery+0.015
 chip+0.015
 cookie+0.014
Neg logit contributions
ills-0.015
ares-0.013
arted-0.013
ada-0.013
ax-0.013
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
ills+0.000
ares+0.000
ada+0.000
elling+0.000
hing+0.000
Neg logit contributions
ggy-0.000
 surgery-0.000
 chip-0.000
 Fl-0.000
 Wh-0.000
Token play
Token ID711
Feature activation+9.77e-01

Pos logit contributions
ills+0.010
ares+0.009
elling+0.009
ax+0.009
ada+0.009
Neg logit contributions
ggy-0.010
 surgery-0.010
 Wh-0.009
 Fl-0.009
 cookie-0.009
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
ills+0.031
ares+0.029
ada+0.028
hing+0.028
elling+0.027
Neg logit contributions
ggy-0.022
 surgery-0.021
 chip-0.020
rob-0.020
 Maple-0.020
Token Her
Token ID2332
Feature activation+5.52e-01

Pos logit contributions
ills+0.042
ares+0.039
hing+0.038
elling+0.038
ax+0.038
Neg logit contributions
ggy-0.020
 Wh-0.019
 surgery-0.019
 Fl-0.019
 chip-0.018
Token friend
Token ID1545
Feature activation+4.63e-01

Pos logit contributions
ills+0.018
ares+0.017
ada+0.017
elling+0.016
hing+0.016
Neg logit contributions
ggy-0.012
 surgery-0.011
 Fl-0.011
 Wh-0.011
 chip-0.011
Token saw
Token ID2497
Feature activation+2.00e+00

Pos logit contributions
ills+0.015
ares+0.014
elling+0.013
hing+0.013
ada+0.013
Neg logit contributions
ggy-0.011
 Wh-0.010
 Fl-0.010
 surgery-0.010
 chip-0.010
Token Ros
Token ID10018
Feature activation+4.95e+00

Pos logit contributions
ills+0.075
 contain+0.072
ares+0.070
hing+0.069
vell+0.069
Neg logit contributions
 Fl-0.037
 Wh-0.037
 Velvet-0.034
 cookie-0.032
 surgery-0.031
Tokenie
Token ID494
Feature activation+8.23e-01

Pos logit contributions
 contain+0.117
arted+0.108
ax+0.108
 wanna+0.107
leaf+0.106
Neg logit contributions
 kit-0.137
 cookie-0.134
bes-0.132
 book-0.130
house-0.129
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
ills+0.025
ares+0.023
ada+0.023
elling+0.023
ax+0.022
Neg logit contributions
ggy-0.020
 surgery-0.019
 Fl-0.018
 chip-0.018
 Wh-0.018
Token became
Token ID2627
Feature activation-8.55e-01

Pos logit contributions
ills+0.011
ares+0.011
elling+0.010
hing+0.010
ax+0.010
Neg logit contributions
ggy-0.008
 Wh-0.008
 Fl-0.008
 chip-0.007
 surgery-0.007
Token en
Token ID551
Feature activation+3.84e+00

Pos logit contributions
ggy+0.024
 surgery+0.023
 Fl+0.022
 chip+0.022
 Wh+0.022
Neg logit contributions
ills-0.023
ares-0.021
ada-0.020
elling-0.019
hing-0.019
Tokenvious
Token ID1442
Feature activation+1.51e+00

Pos logit contributions
ills+0.060
idges+0.052
lee+0.049
 curiously+0.048
 contain+0.048
Neg logit contributions
 Wh-0.132
 cookie-0.132
 surgery-0.131
S-0.131
 fle-0.130
Token soft
Token ID2705
Feature activation+4.94e-01

Pos logit contributions
ills+0.002
ada+0.002
ares+0.002
ax+0.002
hing+0.002
Neg logit contributions
ggy-0.002
 surgery-0.002
rob-0.002
bes-0.002
 Wh-0.002
Token copper
Token ID15317
Feature activation-6.67e-01

Pos logit contributions
ills+0.015
ada+0.014
ares+0.014
hing+0.014
elling+0.013
Neg logit contributions
ggy-0.012
 surgery-0.011
rob-0.010
 Wh-0.010
Spot-0.010
Token coin
Token ID10752
Feature activation-3.17e-01

Pos logit contributions
ggy+0.015
 surgery+0.014
 Wh+0.014
Spot+0.014
bes+0.014
Neg logit contributions
ills-0.021
ada-0.019
ly-0.019
ares-0.019
elling-0.018
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
ggy+0.008
 surgery+0.008
 Wh+0.007
 Fl+0.007
 chip+0.007
Neg logit contributions
ills-0.009
ares-0.009
ada-0.008
ax-0.008
elling-0.008
Token E
Token ID412
Feature activation+4.33e-01

Pos logit contributions
ills+0.003
ada+0.003
ax+0.003
ares+0.003
lee+0.003
Neg logit contributions
ggy-0.003
 surgery-0.003
bes-0.003
rob-0.003
ves-0.003
Tokensm
Token ID5796
Feature activation+4.95e+00

Pos logit contributions
ills+0.013
ares+0.012
ada+0.012
elling+0.012
hing+0.011
Neg logit contributions
ggy-0.011
 surgery-0.010
 Fl-0.010
 Wh-0.010
 chip-0.010
Tokenie
Token ID494
Feature activation-4.22e-01

Pos logit contributions
ills+0.087
 contain+0.084
vell+0.083
ares+0.082
ax+0.079
Neg logit contributions
 Fl-0.161
 Wh-0.159
Fl-0.159
Spot-0.156
 Maple-0.148
Token wanted
Token ID2227
Feature activation+1.35e+00

Pos logit contributions
ggy+0.009
 surgery+0.009
 Fl+0.009
 Wh+0.008
 chip+0.008
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
elling-0.012
hing-0.012
Token to
Token ID284
Feature activation+8.84e-01

Pos logit contributions
ills+0.043
ares+0.040
ada+0.039
elling+0.038
ax+0.038
Neg logit contributions
ggy-0.032
 surgery-0.030
 Fl-0.029
 Wh-0.029
 chip-0.029
Token pick
Token ID2298
Feature activation+1.32e+00

Pos logit contributions
ills+0.026
ares+0.024
elling+0.024
arted+0.023
ada+0.022
Neg logit contributions
ggy-0.025
 surgery-0.023
 Wh-0.023
 Fl-0.022
 chip-0.022
Token up
Token ID510
Feature activation+1.00e+00

Pos logit contributions
ills+0.047
ares+0.045
hing+0.043
ada+0.043
arted+0.042
Neg logit contributions
ggy-0.025
 Fl-0.024
 surgery-0.024
 Wh-0.024
 chip-0.023
Token Go
Token ID1514
Feature activation-1.75e+00

Pos logit contributions
ggy+0.028
 surgery+0.026
rob+0.026
bes+0.026
ves+0.026
Neg logit contributions
ares-0.024
ills-0.024
ada-0.023
elling-0.023
ax-0.022
Token away
Token ID1497
Feature activation-1.27e+00

Pos logit contributions
ggy+0.052
 surgery+0.050
 Fl+0.049
 Wh+0.049
 chip+0.048
Neg logit contributions
ills-0.044
ares-0.040
ada-0.039
elling-0.038
hing-0.038
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
rob+0.037
ggy+0.036
 surgery+0.036
bes+0.035
 chip+0.034
Neg logit contributions
ills-0.032
ares-0.028
ly-0.027
ada-0.026
elling-0.026
Token Leave
Token ID17446
Feature activation-4.94e-01

Pos logit contributions
ggy+0.041
rob+0.039
ves+0.038
bes+0.038
 surgery+0.038
Neg logit contributions
ares-0.034
ills-0.033
elling-0.032
ada-0.032
hing-0.030
Token our
Token ID674
Feature activation-1.07e+00

Pos logit contributions
ggy+0.015
 surgery+0.014
 chip+0.014
 Fl+0.014
 Wh+0.013
Neg logit contributions
ills-0.012
ares-0.011
ada-0.011
elling-0.011
hing-0.011
Token hockey
Token ID12217
Feature activation-5.66e+00

Pos logit contributions
ggy+0.026
rob+0.025
 surgery+0.025
K+0.024
Fl+0.024
Neg logit contributions
ills-0.031
ares-0.027
arted-0.027
ory-0.027
elling-0.027
Token alone
Token ID3436
Feature activation-1.73e+00

Pos logit contributions
 surgery+0.151
 Wh+0.145
K+0.145
Pe+0.144
rob+0.143
Neg logit contributions
ills-0.138
ly-0.132
ares-0.130
ished-0.125
ory-0.122
Token."
Token ID526
Feature activation-1.67e+00

Pos logit contributions
ggy+0.046
 surgery+0.045
rob+0.045
 chip+0.043
 Wh+0.042
Neg logit contributions
ills-0.049
ares-0.044
hing-0.042
ly-0.042
elling-0.042
Token
Token ID198
Feature activation-1.19e+00

Pos logit contributions
ggy+0.042
rob+0.041
 surgery+0.038
 bidder+0.038
bes+0.038
Neg logit contributions
ills-0.042
 Jackie-0.041
ares-0.040
ada-0.040
lee-0.039
Token
Token ID198
Feature activation-1.11e+00

Pos logit contributions
 surgery+0.022
ggy+0.021
rob+0.021
 bidder+0.021
 clinic+0.020
Neg logit contributions
ills-0.040
ares-0.037
hing-0.037
ada-0.036
lee-0.035
TokenMax
Token ID11518
Feature activation-2.37e+00

Pos logit contributions
 bidder+0.027
 surgery+0.026
 clinic+0.026
rob+0.026
 Raven+0.026
Neg logit contributions
ills-0.028
hing-0.027
idges-0.026
 both-0.025
ada-0.024
Token Max
Token ID5436
Feature activation-2.20e+00

Pos logit contributions
ills+0.024
elling+0.021
lee+0.021
arted+0.020
ares+0.020
Neg logit contributions
ggy-0.015
 Fl-0.015
 Wh-0.015
 surgery-0.014
Spot-0.014
Token,
Token ID11
Feature activation+4.70e-01

Pos logit contributions
 surgery+0.058
ggy+0.057
rob+0.056
bes+0.055
 Maple+0.054
Neg logit contributions
ills-0.061
ada-0.055
ares-0.054
ax-0.053
elling-0.052
Token that
Token ID326
Feature activation-2.71e+00

Pos logit contributions
ills+0.013
elling+0.011
ares+0.011
ada+0.011
arted+0.011
Neg logit contributions
ggy-0.014
 Fl-0.013
 surgery-0.013
 Wh-0.013
 cookie-0.013
Token's
Token ID338
Feature activation-1.19e+00

Pos logit contributions
 surgery+0.059
ggy+0.059
elope+0.056
 Wh+0.056
bes+0.056
Neg logit contributions
ills-0.084
ares-0.078
hing-0.077
ax-0.074
elling-0.072
Token our
Token ID674
Feature activation-6.98e-01

Pos logit contributions
rob+0.033
 surgery+0.032
bes+0.031
ggy+0.031
 clinic+0.031
Neg logit contributions
ills-0.031
ares-0.030
ax-0.027
elling-0.026
arted-0.026
Token hockey
Token ID12217
Feature activation-5.25e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
rob+0.015
Fl+0.014
 Wh+0.014
Neg logit contributions
ills-0.022
ares-0.020
ax-0.020
ada-0.020
elling-0.019
Token.
Token ID13
Feature activation-8.78e-01

Pos logit contributions
 surgery+0.133
 Wh+0.132
Pe+0.131
K+0.130
 Maple+0.123
Neg logit contributions
ills-0.142
ares-0.129
ly-0.128
hing-0.122
ory-0.122
Token Give
Token ID13786
Feature activation+3.65e-01

Pos logit contributions
ggy+0.022
 surgery+0.021
rob+0.020
bes+0.020
 chip+0.020
Neg logit contributions
ills-0.025
ares-0.025
ada-0.024
elling-0.023
hing-0.023
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
hing+0.008
 contain+0.007
Neg logit contributions
ggy-0.011
 surgery-0.011
Spot-0.011
 Fl-0.011
 Wh-0.011
Token back
Token ID736
Feature activation-1.47e+00

Pos logit contributions
ggy+0.029
 surgery+0.028
 Wh+0.027
 Fl+0.027
Spot+0.027
Neg logit contributions
ills-0.026
ares-0.024
ada-0.023
hing-0.023
elling-0.022
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
ggy+0.038
 surgery+0.037
rob+0.036
 chip+0.036
bes+0.035
Neg logit contributions
ills-0.043
ares-0.038
elling-0.038
ada-0.037
ax-0.036
Token but
Token ID475
Feature activation-1.12e+00

Pos logit contributions
ills+0.055
ivated+0.052
hing+0.052
arted+0.052
ada+0.051
Neg logit contributions
ggy-0.033
 Wh-0.030
 Fl-0.030
 chip-0.030
 cookie-0.030
Token we
Token ID356
Feature activation+1.12e+00

Pos logit contributions
ggy+0.028
 surgery+0.026
rob+0.026
 Maple+0.025
bes+0.025
Neg logit contributions
ills-0.033
ares-0.031
ada-0.030
elling-0.029
ax-0.029
Token must
Token ID1276
Feature activation-7.03e-01

Pos logit contributions
ills+0.042
elling+0.041
arted+0.040
lee+0.039
ares+0.039
Neg logit contributions
 cookie-0.020
ggy-0.020
 surgery-0.020
 Wh-0.020
Spot-0.019
Token return
Token ID1441
Feature activation-1.45e+00

Pos logit contributions
ggy+0.020
 surgery+0.019
 Wh+0.018
 Fl+0.018
Spot+0.018
Neg logit contributions
ills-0.019
ares-0.018
elling-0.017
ada-0.017
arted-0.017
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
ggy+0.041
elope+0.038
bes+0.037
 surgery+0.037
rob+0.037
Neg logit contributions
ills-0.039
elling-0.035
ares-0.035
ax-0.034
arted-0.034
Token hockey
Token ID12217
Feature activation-5.16e+00

Pos logit contributions
ggy+0.022
elope+0.021
bes+0.020
 Wh+0.019
Fl+0.019
Neg logit contributions
ills-0.040
ares-0.038
ax-0.037
elling-0.036
hing-0.036
Token stick
Token ID4859
Feature activation-6.24e-01

Pos logit contributions
Pe+0.132
K+0.132
 surgery+0.130
 Wh+0.130
elope+0.128
Neg logit contributions
ills-0.128
ares-0.127
ly-0.126
ets-0.119
ory-0.119
Token if
Token ID611
Feature activation-1.16e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
rob+0.015
 chip+0.015
 Wh+0.014
Neg logit contributions
ills-0.018
ares-0.017
ada-0.016
ax-0.016
elling-0.016
Token we
Token ID356
Feature activation+1.31e+00

Pos logit contributions
rob+0.037
tr+0.035
bes+0.035
elope+0.034
K+0.034
Neg logit contributions
ills-0.026
ares-0.022
idges-0.022
ly-0.022
ax-0.021
Token find
Token ID1064
Feature activation-1.86e+00

Pos logit contributions
ills+0.039
elling+0.036
lee+0.034
ares+0.034
arted+0.034
Neg logit contributions
 surgery-0.033
ggy-0.033
 Fl-0.033
 cookie-0.033
 Wh-0.033
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
ggy+0.044
 surgery+0.040
 chip+0.040
bes+0.039
 Wh+0.039
Neg logit contributions
ills-0.058
ares-0.054
elling-0.052
ada-0.051
ax-0.051
Token be
Token ID307
Feature activation-1.32e+00

Pos logit contributions
ggy+0.019
 surgery+0.018
 Fl+0.017
 Wh+0.017
 chip+0.017
Neg logit contributions
ills-0.017
ares-0.015
ada-0.015
elling-0.014
hing-0.014
Token with
Token ID351
Feature activation-7.54e-01

Pos logit contributions
ggy+0.035
 chip+0.034
rob+0.034
 surgery+0.034
 Fl+0.033
Neg logit contributions
ills-0.036
ares-0.033
ax-0.032
ada-0.032
elling-0.031
Token you
Token ID345
Feature activation+4.83e-02

Pos logit contributions
ggy+0.020
 surgery+0.020
 chip+0.019
 Wh+0.019
 Fl+0.019
Neg logit contributions
ills-0.021
ares-0.019
ada-0.019
elling-0.018
ax-0.018
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
ills+0.001
ares+0.001
ly+0.001
elling+0.001
ax+0.001
Neg logit contributions
rob-0.001
ggy-0.001
 surgery-0.001
 chip-0.001
bes-0.001
Token The
Token ID383
Feature activation-5.18e-01

Pos logit contributions
ggy+0.043
rob+0.041
ves+0.040
bes+0.038
 chip+0.038
Neg logit contributions
elling-0.039
ills-0.038
ares-0.036
ada-0.036
arted-0.034
Token operation
Token ID4905
Feature activation-5.09e+00

Pos logit contributions
ggy+0.011
rob+0.011
bes+0.010
elope+0.010
ves+0.010
Neg logit contributions
ills-0.016
elling-0.015
ada-0.015
hing-0.014
ares-0.014
Token is
Token ID318
Feature activation-1.50e+00

Pos logit contributions
Pe+0.144
rob+0.135
Br+0.132
 chip+0.132
Spot+0.128
Neg logit contributions
ills-0.121
ly-0.114
hing-0.112
ets-0.111
 want-0.111
Token not
Token ID407
Feature activation-5.48e-01

Pos logit contributions
rob+0.043
ggy+0.041
bes+0.040
 chip+0.038
 Maple+0.037
Neg logit contributions
ills-0.039
ares-0.035
elling-0.034
hing-0.033
idges-0.033
Token bad
Token ID2089
Feature activation+2.67e-01

Pos logit contributions
rob+0.016
ggy+0.016
bes+0.015
 chip+0.014
 surgery+0.014
Neg logit contributions
ills-0.014
ares-0.012
hing-0.012
elling-0.012
ly-0.012
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
ills+0.008
ares+0.008
ly+0.007
ada+0.007
elling+0.007
Neg logit contributions
ggy-0.007
rob-0.006
 surgery-0.006
Fl-0.006
bes-0.006
Token It
Token ID632
Feature activation-1.56e+00

Pos logit contributions
ggy+0.019
rob+0.018
bes+0.017
 surgery+0.017
 chip+0.017
Neg logit contributions
ills-0.017
ares-0.016
elling-0.016
ada-0.015
ax-0.014
Token a
Token ID257
Feature activation-1.85e+00

Pos logit contributions
rob+0.079
elope+0.077
bes+0.076
ggy+0.075
Fl+0.073
Neg logit contributions
ills-0.072
ax-0.062
 both-0.060
arted-0.060
're-0.059
Token little
Token ID1310
Feature activation-2.17e+00

Pos logit contributions
ggy+0.048
bes+0.047
rob+0.046
Fl+0.045
 Fl+0.045
Neg logit contributions
ills-0.050
ares-0.046
ada-0.046
hing-0.045
ax-0.045
Token accident
Token ID5778
Feature activation-6.90e-02

Pos logit contributions
ggy+0.058
bes+0.055
rob+0.054
Fl+0.053
 Wh+0.053
Neg logit contributions
ills-0.058
ares-0.057
ly-0.055
elling-0.054
hing-0.053
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
ggy+0.002
elope+0.002
 surgery+0.002
rob+0.002
Spot+0.002
Neg logit contributions
ills-0.002
ly-0.002
ares-0.002
hing-0.002
elling-0.002
Token Your
Token ID3406
Feature activation-8.19e-01

Pos logit contributions
ggy+0.052
ves+0.050
rob+0.049
bes+0.047
 cookie+0.045
Neg logit contributions
elling-0.040
ills-0.039
ares-0.038
ada-0.038
hing-0.034
Token motorcycle
Token ID18757
Feature activation-5.04e+00

Pos logit contributions
ggy+0.021
 Fl+0.019
 surgery+0.019
Fl+0.019
 Wh+0.019
Neg logit contributions
ills-0.024
ares-0.022
ada-0.021
elling-0.021
hing-0.021
Token is
Token ID318
Feature activation-1.29e+00

Pos logit contributions
ggy+0.109
 surgery+0.105
 Maple+0.103
Pe+0.102
rob+0.102
Neg logit contributions
ills-0.154
ares-0.145
hing-0.144
elling-0.142
ly-0.141
Token okay
Token ID8788
Feature activation-3.97e-01

Pos logit contributions
ggy+0.034
rob+0.034
 surgery+0.033
bes+0.032
 Maple+0.032
Neg logit contributions
ills-0.035
ares-0.032
elling-0.031
hing-0.030
ax-0.030
Token too
Token ID1165
Feature activation-7.30e-01

Pos logit contributions
rob+0.012
ggy+0.011
bes+0.011
 surgery+0.010
 chip+0.010
Neg logit contributions
ills-0.011
ares-0.009
ly-0.009
elling-0.009
ada-0.009
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
rob+0.018
ggy+0.018
 surgery+0.017
bes+0.017
Fl+0.017
Neg logit contributions
ills-0.022
ares-0.020
elling-0.019
ada-0.019
ly-0.019
Token It
Token ID632
Feature activation-1.92e+00

Pos logit contributions
ggy+0.057
rob+0.054
ves+0.053
bes+0.052
 surgery+0.050
Neg logit contributions
elling-0.042
ares-0.042
ills-0.042
ada-0.039
hing-0.038
Token "
Token ID366
Feature activation-7.09e-01

Pos logit contributions
rob+0.005
ggy+0.004
 surgery+0.004
bes+0.004
Spot+0.004
Neg logit contributions
ills-0.005
ares-0.005
ada-0.004
ax-0.004
lee-0.004
TokenL
Token ID43
Feature activation+1.10e+00

Pos logit contributions
ggy+0.020
 clinic+0.018
bes+0.018
ves+0.018
rob+0.018
Neg logit contributions
ills-0.017
ares-0.017
ax-0.017
hing-0.017
ada-0.016
Tokenily
Token ID813
Feature activation-2.24e+00

Pos logit contributions
ills+0.026
ares+0.023
ada+0.022
ax+0.022
elling+0.022
Neg logit contributions
ggy-0.035
 chip-0.034
 Fl-0.034
 surgery-0.034
 Maple-0.033
Token,
Token ID11
Feature activation-4.72e-01

Pos logit contributions
rob+0.060
bes+0.056
ggy+0.056
 surgery+0.055
 Maple+0.055
Neg logit contributions
ills-0.061
ada-0.055
ly-0.054
ares-0.053
ax-0.053
Token your
Token ID534
Feature activation-1.00e+00

Pos logit contributions
ggy+0.013
rob+0.012
 surgery+0.012
bes+0.012
 Maple+0.012
Neg logit contributions
ills-0.013
ares-0.012
ada-0.011
ax-0.011
hing-0.011
Token leg
Token ID1232
Feature activation-4.89e+00

Pos logit contributions
rob+0.022
ggy+0.021
 surgery+0.020
Fl+0.020
 Maple+0.020
Neg logit contributions
ills-0.031
ares-0.029
ada-0.029
ax-0.028
elling-0.028
Token is
Token ID318
Feature activation-1.93e+00

Pos logit contributions
rob+0.114
 Maple+0.107
ggy+0.106
 surgery+0.104
Pe+0.104
Neg logit contributions
ills-0.141
ares-0.132
ly-0.131
hing-0.129
elling-0.125
Token bad
Token ID2089
Feature activation-3.95e-01

Pos logit contributions
rob+0.058
ggy+0.054
bes+0.053
elope+0.052
 Maple+0.051
Neg logit contributions
ills-0.048
ares-0.043
elling-0.042
arted-0.040
hing-0.040
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
ggy+0.010
 surgery+0.010
rob+0.010
bes+0.009
Fl+0.009
Neg logit contributions
ills-0.012
ares-0.011
ly-0.010
ada-0.010
elling-0.010
Token We
Token ID775
Feature activation-6.88e-01

Pos logit contributions
rob+0.037
ggy+0.035
ves+0.035
bes+0.034
 clinic+0.032
Neg logit contributions
ares-0.026
elling-0.026
ills-0.024
ada-0.024
ly-0.022
Token need
Token ID761
Feature activation-2.14e+00

Pos logit contributions
ggy+0.018
 surgery+0.018
 Wh+0.017
 Fl+0.017
 cookie+0.017
Neg logit contributions
ills-0.020
ares-0.018
elling-0.018
arted-0.018
ada-0.018
Token hockey
Token ID12217
Feature activation-2.70e+00

Pos logit contributions
 chip+0.017
 surgery+0.017
 Raven+0.016
ggy+0.016
 Maple+0.016
Neg logit contributions
ills-0.018
hing-0.017
ares-0.016
ly-0.016
ada-0.015
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 surgery+0.069
 Wh+0.066
 chip+0.065
 Raven+0.065
 Fl+0.064
Neg logit contributions
ills-0.075
ares-0.068
hing-0.067
ly-0.066
ax-0.064
Token Mia
Token ID32189
Feature activation-1.11e+00

Pos logit contributions
ggy+0.011
 surgery+0.010
rob+0.010
bes+0.010
 chip+0.009
Neg logit contributions
ills-0.016
ares-0.015
ax-0.015
ada-0.015
arted-0.014
Token brought
Token ID3181
Feature activation-1.23e+00

Pos logit contributions
ggy+0.028
 surgery+0.027
 Wh+0.026
 chip+0.026
 Fl+0.026
Neg logit contributions
ills-0.032
ares-0.030
ada-0.029
ax-0.028
elling-0.028
Token her
Token ID607
Feature activation-8.24e-01

Pos logit contributions
ggy+0.032
 surgery+0.030
 clinic+0.029
elope+0.029
 bidder+0.029
Neg logit contributions
ills-0.034
ares-0.032
ada-0.032
arted-0.031
ax-0.031
Token hockey
Token ID12217
Feature activation-4.82e+00

Pos logit contributions
ggy+0.018
 surgery+0.017
rob+0.017
elope+0.016
 Maple+0.016
Neg logit contributions
ills-0.026
ares-0.025
ax-0.024
ada-0.024
arted-0.023
Token stick
Token ID4859
Feature activation-6.18e-01

Pos logit contributions
 surgery+0.118
 Wh+0.115
K+0.111
ggy+0.111
Pe+0.110
Neg logit contributions
ills-0.128
ly-0.125
ares-0.122
 contain-0.118
ory-0.117
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 Fl+0.015
 Wh+0.015
 chip+0.015
Neg logit contributions
ills-0.018
ares-0.016
ada-0.016
elling-0.016
hing-0.016
Token John
Token ID1757
Feature activation-1.49e+00

Pos logit contributions
ggy+0.039
 surgery+0.038
 Maple+0.037
 clinic+0.036
Fl+0.036
Neg logit contributions
ills-0.044
ada-0.040
ares-0.040
ax-0.040
hing-0.039
Token brought
Token ID3181
Feature activation-7.55e-01

Pos logit contributions
ggy+0.037
 surgery+0.035
 Wh+0.033
 Fl+0.033
 chip+0.033
Neg logit contributions
ills-0.045
ares-0.042
ada-0.040
elling-0.040
hing-0.040
Token his
Token ID465
Feature activation-1.55e+00

Pos logit contributions
ggy+0.018
 surgery+0.017
 Fl+0.016
 Wh+0.016
 chip+0.016
Neg logit contributions
ills-0.024
ares-0.022
ada-0.021
elling-0.021
hing-0.021
Token,
Token ID11
Feature activation+5.60e-01

Pos logit contributions
ggy+0.018
bes+0.018
 surgery+0.017
Spot+0.017
Fl+0.017
Neg logit contributions
ills-0.017
ares-0.016
ax-0.015
ada-0.015
ly-0.015
Token Tim
Token ID5045
Feature activation-5.22e-01

Pos logit contributions
ills+0.020
ares+0.019
arted+0.019
hing+0.018
ada+0.018
Neg logit contributions
 Wh-0.011
 surgery-0.011
 Fl-0.011
ggy-0.011
 chip-0.011
Tokenmy
Token ID1820
Feature activation-3.00e+00

Pos logit contributions
ggy+0.019
 surgery+0.019
 Wh+0.018
 Fl+0.018
 chip+0.018
Neg logit contributions
ills-0.010
ares-0.008
ada-0.008
elling-0.008
hing-0.008
Token!
Token ID0
Feature activation-6.75e-01

Pos logit contributions
rob+0.097
 surgery+0.095
bes+0.095
ggy+0.095
 Maple+0.093
Neg logit contributions
ills-0.064
ly-0.055
ares-0.054
ada-0.054
ax-0.053
Token Your
Token ID3406
Feature activation-1.14e+00

Pos logit contributions
ggy+0.018
rob+0.016
 surgery+0.016
bes+0.016
ves+0.015
Neg logit contributions
ills-0.020
ares-0.019
ada-0.018
elling-0.018
ax-0.017
Token tire
Token ID15867
Feature activation-4.81e+00

Pos logit contributions
ggy+0.019
 Fl+0.018
 surgery+0.018
rob+0.018
 Maple+0.017
Neg logit contributions
ills-0.043
ares-0.039
ada-0.039
ax-0.038
elling-0.038
Token is
Token ID318
Feature activation-1.04e+00

Pos logit contributions
ggy+0.099
 surgery+0.095
 Maple+0.094
 Fl+0.094
 chip+0.092
Neg logit contributions
ills-0.156
ares-0.145
ly-0.142
elling-0.141
hing-0.140
Token flat
Token ID6228
Feature activation+2.55e-01

Pos logit contributions
ggy+0.027
rob+0.026
 surgery+0.025
 Fl+0.025
 chip+0.025
Neg logit contributions
ills-0.030
ares-0.027
elling-0.026
ada-0.026
hing-0.025
Token.
Token ID13
Feature activation-7.46e-01

Pos logit contributions
ills+0.007
ares+0.007
ada+0.007
elling+0.007
hing+0.006
Neg logit contributions
ggy-0.007
 surgery-0.006
 Fl-0.006
 Wh-0.006
 chip-0.006
Token We
Token ID775
Feature activation+7.84e-01

Pos logit contributions
ggy+0.020
 surgery+0.019
rob+0.018
bes+0.018
 chip+0.017
Neg logit contributions
ills-0.021
ares-0.020
elling-0.019
ada-0.019
hing-0.018
Token need
Token ID761
Feature activation-7.24e-01

Pos logit contributions
ills+0.027
arted+0.025
lee+0.024
elling+0.024
idges+0.024
Neg logit contributions
 Wh-0.017
 surgery-0.017
 ha-0.015
 kit-0.015
 cookie-0.015
Token you
Token ID345
Feature activation+4.90e-01

Pos logit contributions
ggy+0.006
rob+0.006
bes+0.006
 surgery+0.006
elope+0.006
Neg logit contributions
ills-0.004
elling-0.004
idges-0.003
ly-0.003
ax-0.003
Token very
Token ID845
Feature activation-1.19e+00

Pos logit contributions
ills+0.016
ares+0.014
ada+0.014
hing+0.014
elling+0.014
Neg logit contributions
ggy-0.011
 surgery-0.011
rob-0.010
 chip-0.010
 Wh-0.010
Token much
Token ID881
Feature activation-9.16e-01

Pos logit contributions
ggy+0.025
 surgery+0.025
rob+0.025
 Wh+0.024
 chip+0.024
Neg logit contributions
ills-0.039
ares-0.036
ly-0.035
elling-0.034
hing-0.034
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
ggy+0.026
 Wh+0.024
Fl+0.024
rob+0.024
bes+0.024
Neg logit contributions
ills-0.024
ly-0.023
ares-0.021
elling-0.021
ax-0.021
Token The
Token ID383
Feature activation-9.24e-01

Pos logit contributions
ggy+0.041
rob+0.038
ves+0.037
bes+0.036
 surgery+0.034
Neg logit contributions
ares-0.037
elling-0.036
ills-0.036
ada-0.035
hing-0.033
Token operation
Token ID4905
Feature activation-4.80e+00

Pos logit contributions
ggy+0.019
elope+0.019
bes+0.019
rob+0.018
 Fl+0.017
Neg logit contributions
ills-0.028
hing-0.027
ada-0.026
elling-0.026
ares-0.026
Token will
Token ID481
Feature activation-1.11e-01

Pos logit contributions
Pe+0.135
 Wh+0.123
Br+0.123
rob+0.122
 chip+0.121
Neg logit contributions
ills-0.114
hing-0.109
ly-0.109
elling-0.105
ets-0.104
Token be
Token ID307
Feature activation-1.04e+00

Pos logit contributions
ggy+0.003
 surgery+0.003
rob+0.003
 chip+0.003
 Fl+0.003
Neg logit contributions
ills-0.003
ares-0.003
ada-0.003
hing-0.003
elling-0.003
Token over
Token ID625
Feature activation-6.34e-01

Pos logit contributions
rob+0.032
ggy+0.031
bes+0.031
 chip+0.029
 clinic+0.029
Neg logit contributions
ills-0.025
ares-0.022
elling-0.021
hing-0.021
ax-0.021
Token soon
Token ID2582
Feature activation-5.41e-01

Pos logit contributions
ggy+0.017
rob+0.016
 surgery+0.016
bes+0.016
 chip+0.016
Neg logit contributions
ills-0.019
elling-0.016
ares-0.016
ada-0.016
ax-0.015
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
ggy+0.017
rob+0.016
Fl+0.016
 Wh+0.015
Spot+0.015
Neg logit contributions
ills-0.013
ly-0.012
ares-0.012
ax-0.011
elling-0.011
Token dad
Token ID9955
Feature activation-6.68e-01

Pos logit contributions
ggy+0.021
 surgery+0.020
K+0.020
rob+0.020
Fl+0.020
Neg logit contributions
ills-0.027
ares-0.025
ory-0.024
ax-0.024
ada-0.024
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 surgery+0.019
ggy+0.019
rob+0.018
Pe+0.018
bes+0.017
Neg logit contributions
ills-0.017
ares-0.016
ly-0.016
ax-0.016
elling-0.015
Token I
Token ID314
Feature activation-1.06e+00

Pos logit contributions
ggy+0.013
 surgery+0.012
rob+0.012
bes+0.012
elope+0.011
Neg logit contributions
ills-0.013
ares-0.012
elling-0.012
ada-0.011
hing-0.011
Token have
Token ID423
Feature activation-2.38e+00

Pos logit contributions
ggy+0.039
 surgery+0.038
 Fl+0.037
 Wh+0.037
 chip+0.037
Neg logit contributions
ills-0.019
ares-0.016
ada-0.016
elling-0.015
arted-0.015
Token a
Token ID257
Feature activation-1.75e+00

Pos logit contributions
ggy+0.062
elope+0.060
rob+0.057
bes+0.055
 Raven+0.055
Neg logit contributions
ills-0.069
ax-0.060
lee-0.058
ares-0.057
elling-0.057
Token hockey
Token ID12217
Feature activation-4.72e+00

Pos logit contributions
bes+0.044
ggy+0.043
rob+0.043
elope+0.042
Fl+0.041
Neg logit contributions
ills-0.046
ares-0.044
ory-0.044
ada-0.042
ax-0.042
Token stick
Token ID4859
Feature activation+7.88e-02

Pos logit contributions
K+0.127
elope+0.126
 surgery+0.125
 Wh+0.124
Pe+0.122
Neg logit contributions
ills-0.112
ly-0.112
ory-0.108
ares-0.105
ish-0.102
Token and
Token ID290
Feature activation-2.07e+00

Pos logit contributions
ills+0.002
ares+0.002
ada+0.002
elling+0.002
ax+0.002
Neg logit contributions
ggy-0.002
 surgery-0.002
rob-0.002
 Wh-0.002
 Fl-0.002
Token a
Token ID257
Feature activation-1.89e+00

Pos logit contributions
ggy+0.054
elope+0.051
 surgery+0.049
rob+0.049
Fl+0.048
Neg logit contributions
ills-0.054
ares-0.051
ax-0.049
hing-0.048
ly-0.048
Token ball
Token ID2613
Feature activation-1.13e+00

Pos logit contributions
ggy+0.048
rob+0.047
elope+0.046
bes+0.046
 surgery+0.045
Neg logit contributions
ills-0.049
ares-0.047
ada-0.044
hing-0.044
ory-0.044
Token.
Token ID13
Feature activation-6.98e-01

Pos logit contributions
ggy+0.027
 surgery+0.026
rob+0.024
bes+0.024
 Wh+0.024
Neg logit contributions
ills-0.035
ares-0.033
ada-0.031
elling-0.031
hing-0.031
Token good
Token ID922
Feature activation-7.10e-01

Pos logit contributions
ggy+0.025
 surgery+0.025
bes+0.024
 Wh+0.023
rob+0.023
Neg logit contributions
ills-0.041
ada-0.038
ares-0.038
hing-0.038
ax-0.037
Token son
Token ID3367
Feature activation-1.41e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 Wh+0.015
 Fl+0.015
bes+0.015
Neg logit contributions
ills-0.023
ares-0.021
ada-0.021
ax-0.020
elling-0.020
Token and
Token ID290
Feature activation-1.59e+00

Pos logit contributions
 surgery+0.038
ggy+0.037
bes+0.037
rob+0.037
 Wh+0.036
Neg logit contributions
ills-0.036
ly-0.034
ax-0.033
ares-0.032
ada-0.031
Token a
Token ID257
Feature activation-1.63e+00

Pos logit contributions
ggy+0.042
 surgery+0.042
bes+0.041
 clinic+0.041
rob+0.041
Neg logit contributions
ills-0.040
ares-0.036
elling-0.036
ax-0.036
ada-0.035
Token good
Token ID922
Feature activation-1.25e+00

Pos logit contributions
bes+0.043
rob+0.042
ggy+0.040
 surgery+0.039
ves+0.038
Neg logit contributions
ills-0.041
ada-0.040
ory-0.039
 want-0.037
hing-0.037
Token hockey
Token ID12217
Feature activation-4.70e+00

Pos logit contributions
ggy+0.030
 surgery+0.029
bes+0.029
rob+0.028
Fl+0.028
Neg logit contributions
ills-0.038
ada-0.035
ares-0.035
ory-0.034
ax-0.034
Token player
Token ID2137
Feature activation-1.65e+00

Pos logit contributions
Pe+0.120
 surgery+0.119
 Wh+0.119
K+0.118
elope+0.112
Neg logit contributions
ly-0.119
ills-0.112
ory-0.109
 want-0.106
hing-0.106
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
ggy+0.041
 surgery+0.041
 Wh+0.040
rob+0.039
Pe+0.039
Neg logit contributions
ills-0.045
ly-0.042
ares-0.042
elling-0.040
ada-0.040
Token You
Token ID921
Feature activation-4.67e-01

Pos logit contributions
ggy+0.031
ves+0.029
 surgery+0.029
bes+0.029
rob+0.028
Neg logit contributions
ills-0.024
elling-0.024
ares-0.022
ada-0.022
ax-0.021
Token don
Token ID836
Feature activation+1.11e+00

Pos logit contributions
ggy+0.011
 Wh+0.011
 cookie+0.011
 Fl+0.011
 surgery+0.011
Neg logit contributions
ills-0.015
arted-0.014
ares-0.014
elling-0.013
lee-0.013
Token't
Token ID470
Feature activation+9.02e-01

Pos logit contributions
ills+0.037
ares+0.034
ada+0.033
elling+0.033
ax+0.033
Neg logit contributions
ggy-0.025
 surgery-0.023
 chip-0.022
 Wh-0.022
 Fl-0.022
Token hot
Token ID3024
Feature activation-7.36e-01

Pos logit contributions
ggy+0.005
 surgery+0.005
 Maple+0.004
 Wh+0.004
rob+0.004
Neg logit contributions
ills-0.007
ares-0.006
ada-0.006
elling-0.006
ax-0.006
Token cocoa
Token ID35845
Feature activation-1.81e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 Wh+0.015
 Fl+0.015
 chip+0.015
Neg logit contributions
ills-0.024
ares-0.022
ada-0.022
elling-0.022
hing-0.021
Token in
Token ID287
Feature activation-1.28e+00

Pos logit contributions
ggy+0.050
 surgery+0.049
 Maple+0.047
Pe+0.046
 clinic+0.046
Neg logit contributions
ills-0.047
ly-0.043
ada-0.042
hing-0.042
ax-0.041
Token their
Token ID511
Feature activation+3.72e-01

Pos logit contributions
ggy+0.034
rob+0.033
 surgery+0.031
Spot+0.030
bes+0.030
Neg logit contributions
ills-0.037
ada-0.033
lee-0.031
elling-0.031
 Jackie-0.031
Token bath
Token ID7837
Feature activation-1.60e+00

Pos logit contributions
ills+0.011
lee+0.010
elling+0.010
ada+0.010
ares+0.010
Neg logit contributions
ggy-0.009
rob-0.009
 surgery-0.009
 Maple-0.008
 Raven-0.008
Tokenro
Token ID305
Feature activation-4.67e+00

Pos logit contributions
 Raven+0.046
ggy+0.045
 Maple+0.044
 surgery+0.044
 bidder+0.043
Neg logit contributions
ills-0.044
lee-0.038
ada-0.038
ly-0.037
ory-0.036
Tokenbes
Token ID12636
Feature activation+9.61e-01

Pos logit contributions
 surgery+0.081
 Maple+0.070
 Wh+0.070
 Raven+0.070
 veterin+0.067
Neg logit contributions
ills-0.165
lee-0.164
ly-0.157
iggle-0.148
ets-0.142
Token.
Token ID13
Feature activation-4.68e-01

Pos logit contributions
ills+0.029
ares+0.027
ada+0.026
ax+0.026
ly+0.026
Neg logit contributions
ggy-0.024
 surgery-0.023
 Wh-0.021
rob-0.021
Fl-0.021
Token<|endoftext|>
Token ID50256
Feature activation-1.35e+00

Pos logit contributions
ggy+0.009
rob+0.009
 surgery+0.009
 chip+0.008
 clinic+0.008
Neg logit contributions
ills-0.016
elling-0.015
ares-0.015
ada-0.015
lee-0.015
TokenOnce
Token ID7454
Feature activation+9.18e-01

Pos logit contributions
 surgery+0.037
ggy+0.036
 chip+0.034
bes+0.034
 clinic+0.033
Neg logit contributions
ills-0.032
ax-0.031
ada-0.031
 him-0.030
hing-0.030
Token upon
Token ID2402
Feature activation-1.56e+00

Pos logit contributions
ills+0.018
ares+0.015
ada+0.015
elling+0.014
hing+0.014
Neg logit contributions
ggy-0.033
 surgery-0.032
 Fl-0.031
 Wh-0.031
 chip-0.031
Token room
Token ID2119
Feature activation-5.44e-01

Pos logit contributions
ggy+0.010
rob+0.010
 surgery+0.009
Fl+0.009
Spot+0.009
Neg logit contributions
ills-0.014
ares-0.013
elling-0.012
ada-0.012
hing-0.012
Token clean
Token ID3424
Feature activation-5.37e-01

Pos logit contributions
ggy+0.015
 surgery+0.015
 cookie+0.014
 Fl+0.014
 Wh+0.014
Neg logit contributions
ills-0.015
ares-0.014
ada-0.014
elling-0.014
arted-0.013
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
ggy+0.016
 surgery+0.014
Spot+0.014
 chip+0.014
 Maple+0.014
Neg logit contributions
ills-0.014
ly-0.012
hing-0.012
ares-0.012
elling-0.012
Token used
Token ID973
Feature activation-1.20e+00

Pos logit contributions
ggy+0.065
 Maple+0.060
 surgery+0.060
rob+0.060
 chip+0.060
Neg logit contributions
ills-0.044
hing-0.037
ares-0.037
elling-0.036
ly-0.036
Token his
Token ID465
Feature activation-1.32e+00

Pos logit contributions
ggy+0.030
 surgery+0.027
rob+0.026
 chip+0.026
Spot+0.026
Neg logit contributions
ills-0.037
ares-0.034
ada-0.033
hing-0.032
elling-0.032
Token muscle
Token ID8280
Feature activation-4.67e+00

Pos logit contributions
ggy+0.036
rob+0.035
Pe+0.033
K+0.033
Fl+0.033
Neg logit contributions
ills-0.034
elling-0.031
ares-0.030
ly-0.030
hing-0.030
Token to
Token ID284
Feature activation-6.79e-01

Pos logit contributions
ggy+0.130
rob+0.129
 bidder+0.124
Pe+0.124
 Raven+0.122
Neg logit contributions
ills-0.116
ly-0.108
hing-0.103
ares-0.098
ada-0.097
Token help
Token ID1037
Feature activation+9.81e-01

Pos logit contributions
ggy+0.023
 surgery+0.022
rob+0.021
 chip+0.021
Spot+0.021
Neg logit contributions
ills-0.015
ares-0.013
hing-0.012
ada-0.012
elling-0.012
Token his
Token ID465
Feature activation-6.32e-01

Pos logit contributions
ills+0.025
ares+0.024
elling+0.023
ada+0.022
arted+0.022
Neg logit contributions
ggy-0.029
 surgery-0.028
 Wh-0.027
 Fl-0.027
 chip-0.027
Token mom
Token ID1995
Feature activation-5.51e-02

Pos logit contributions
rob+0.018
Fl+0.016
ggy+0.016
K+0.016
Pe+0.015
Neg logit contributions
ills-0.017
lee-0.015
ly-0.015
 closely-0.015
ares-0.015
Token around
Token ID1088
Feature activation-1.98e-01

Pos logit contributions
ggy+0.001
 surgery+0.001
 Fl+0.001
 chip+0.001
 Wh+0.001
Neg logit contributions
ills-0.002
ares-0.002
ada-0.002
elling-0.001
ax-0.001
Token,
Token ID11
Feature activation-6.16e-01

Pos logit contributions
rob+0.024
 Maple+0.023
ggy+0.023
bes+0.023
 surgery+0.023
Neg logit contributions
ills-0.028
ares-0.028
ada-0.027
ly-0.026
ivated-0.025
Token "
Token ID366
Feature activation+4.77e-01

Pos logit contributions
ggy+0.015
rob+0.014
 surgery+0.014
bes+0.013
 Maple+0.013
Neg logit contributions
ills-0.019
ada-0.018
ares-0.018
lee-0.017
arted-0.017
TokenNo
Token ID2949
Feature activation-6.77e-01

Pos logit contributions
ills+0.017
ares+0.016
ada+0.015
elling+0.015
hing+0.015
Neg logit contributions
ggy-0.010
 surgery-0.009
 chip-0.009
 Wh-0.009
 Fl-0.009
Token,
Token ID11
Feature activation+2.87e-01

Pos logit contributions
rob+0.016
ggy+0.016
bes+0.016
Spot+0.016
Fl+0.015
Neg logit contributions
ills-0.021
ares-0.019
ada-0.019
ly-0.018
elling-0.018
Token my
Token ID616
Feature activation-8.80e-01

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
elling+0.008
ax+0.008
Neg logit contributions
ggy-0.007
 surgery-0.006
rob-0.006
 Maple-0.006
Spot-0.006
Token muscle
Token ID8280
Feature activation-4.66e+00

Pos logit contributions
ggy+0.019
rob+0.019
 surgery+0.019
 Maple+0.018
Spot+0.018
Neg logit contributions
ills-0.028
ares-0.025
ada-0.025
elling-0.024
ax-0.024
Token hurts
Token ID20406
Feature activation+1.85e-01

Pos logit contributions
rob+0.104
 surgery+0.102
Pe+0.102
 Maple+0.102
ggy+0.100
Neg logit contributions
ills-0.135
ly-0.128
ares-0.123
hing-0.123
ada-0.121
Token."
Token ID526
Feature activation-2.45e-01

Pos logit contributions
ills+0.005
ares+0.005
ada+0.005
hing+0.005
elling+0.005
Neg logit contributions
ggy-0.005
rob-0.005
 surgery-0.005
bes-0.004
 clinic-0.004
Token
Token ID198
Feature activation+3.04e-01

Pos logit contributions
ggy+0.006
rob+0.006
 surgery+0.006
 Maple+0.006
 chip+0.006
Neg logit contributions
ills-0.007
ada-0.007
lee-0.007
ares-0.007
arted-0.006
Token
Token ID198
Feature activation+9.57e-01

Pos logit contributions
ills+0.012
ares+0.011
ada+0.011
ax+0.011
lee+0.011
Neg logit contributions
 surgery-0.005
ggy-0.005
rob-0.004
 cookie-0.004
 chip-0.004
TokenThe
Token ID464
Feature activation-2.38e-01

Pos logit contributions
ills+0.029
ares+0.026
ada+0.026
hing+0.025
elling+0.025
Neg logit contributions
ggy-0.023
 surgery-0.023
rob-0.022
 Fl-0.022
 chip-0.022
Token They
Token ID1119
Feature activation+9.33e-01

Pos logit contributions
rob+0.008
ggy+0.007
bes+0.007
 kit+0.007
 surgery+0.007
Neg logit contributions
ills-0.005
elling-0.005
lee-0.005
arted-0.005
ada-0.005
Token say
Token ID910
Feature activation-2.23e-01

Pos logit contributions
ills+0.035
elling+0.032
arted+0.031
heart+0.030
idges+0.030
Neg logit contributions
 Fl-0.021
 Spot-0.020
 Wh-0.020
ggy-0.020
 Snow-0.018
Token,
Token ID11
Feature activation-3.36e-01

Pos logit contributions
rob+0.007
bes+0.006
elope+0.006
 bidder+0.006
Fl+0.006
Neg logit contributions
ares-0.005
ills-0.005
lee-0.005
ly-0.005
ada-0.005
Token "
Token ID366
Feature activation-1.54e+00

Pos logit contributions
rob+0.010
ggy+0.010
 surgery+0.009
bes+0.009
 bidder+0.009
Neg logit contributions
ills-0.008
lee-0.008
ares-0.007
ada-0.007
hing-0.007
TokenOur
Token ID5122
Feature activation-5.50e-01

Pos logit contributions
ggy+0.039
 clinic+0.039
 surgery+0.038
bes+0.037
 kit+0.037
Neg logit contributions
ills-0.041
hing-0.038
ivated-0.037
ada-0.037
ares-0.036
Token tower
Token ID10580
Feature activation-4.64e+00

Pos logit contributions
ggy+0.010
 cookie+0.010
 Wh+0.009
 surgery+0.009
 Fl+0.009
Neg logit contributions
ills-0.020
ares-0.019
ada-0.019
ax-0.019
hing-0.018
Token is
Token ID318
Feature activation-8.13e-01

Pos logit contributions
ggy+0.084
 surgery+0.082
 Wh+0.077
rob+0.077
 Fl+0.076
Neg logit contributions
ills-0.167
ares-0.158
elling-0.151
hing-0.150
ada-0.150
Token gone
Token ID3750
Feature activation-3.62e-01

Pos logit contributions
ggy+0.021
 surgery+0.020
 Fl+0.019
 chip+0.019
 Wh+0.019
Neg logit contributions
ills-0.024
ares-0.022
ada-0.021
elling-0.021
hing-0.020
Token!
Token ID0
Feature activation-1.55e+00

Pos logit contributions
ggy+0.010
 surgery+0.009
 Wh+0.009
 chip+0.009
 Fl+0.009
Neg logit contributions
ills-0.010
ares-0.009
ada-0.009
elling-0.009
ax-0.009
Token The
Token ID383
Feature activation-6.98e-01

Pos logit contributions
rob+0.044
ggy+0.042
bes+0.040
ves+0.039
 surgery+0.038
Neg logit contributions
ares-0.041
ills-0.040
elling-0.039
ada-0.039
lee-0.036
Token dog
Token ID3290
Feature activation-2.81e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 Fl+0.014
 Wh+0.014
 chip+0.014
Neg logit contributions
ills-0.023
ares-0.021
ada-0.021
elling-0.020
hing-0.020
Token "
Token ID366
Feature activation+2.87e-01

Pos logit contributions
ggy+0.010
rob+0.010
bes+0.010
 surgery+0.010
 clinic+0.009
Neg logit contributions
ills-0.013
ada-0.013
ares-0.013
ax-0.012
lee-0.012
TokenTim
Token ID14967
Feature activation-4.06e-01

Pos logit contributions
ills+0.009
ares+0.009
ada+0.008
elling+0.008
hing+0.008
Neg logit contributions
ggy-0.006
 surgery-0.006
 Fl-0.006
 Wh-0.006
 chip-0.006
Tokenmy
Token ID1820
Feature activation-2.35e+00

Pos logit contributions
ggy+0.014
 surgery+0.014
 Wh+0.014
 chip+0.014
 Fl+0.014
Neg logit contributions
ills-0.008
ares-0.007
ada-0.007
hing-0.006
elling-0.006
Token,
Token ID11
Feature activation-3.95e-01

Pos logit contributions
ggy+0.046
 surgery+0.043
 Wh+0.041
 Fl+0.041
 chip+0.041
Neg logit contributions
ills-0.084
ares-0.079
ada-0.076
elling-0.075
hing-0.075
Token your
Token ID534
Feature activation-1.26e+00

Pos logit contributions
ggy+0.009
 surgery+0.009
 Fl+0.009
 Wh+0.009
 chip+0.008
Neg logit contributions
ills-0.013
ares-0.012
arted-0.012
ada-0.012
elling-0.011
Token clock
Token ID8801
Feature activation-4.62e+00

Pos logit contributions
rob+0.029
ggy+0.027
Fl+0.027
Spot+0.027
bes+0.027
Neg logit contributions
ills-0.039
ares-0.035
elling-0.035
ada-0.034
ax-0.033
Token is
Token ID318
Feature activation-7.25e-01

Pos logit contributions
ggy+0.093
 surgery+0.089
Spot+0.087
rob+0.087
 Maple+0.086
Neg logit contributions
ills-0.156
ares-0.146
elling-0.141
hing-0.141
ada-0.139
Token not
Token ID407
Feature activation-1.52e-01

Pos logit contributions
ggy+0.019
 surgery+0.018
rob+0.018
Spot+0.017
 Fl+0.017
Neg logit contributions
ills-0.021
ares-0.019
elling-0.018
ada-0.018
hing-0.018
Token honest
Token ID5508
Feature activation+8.74e-01

Pos logit contributions
ggy+0.004
rob+0.004
 surgery+0.004
bes+0.004
Spot+0.004
Neg logit contributions
ills-0.004
ares-0.004
hing-0.004
ada-0.004
elling-0.004
Token.
Token ID13
Feature activation-5.30e-01

Pos logit contributions
ills+0.028
ares+0.026
ada+0.025
elling+0.024
ax+0.024
Neg logit contributions
ggy-0.021
 surgery-0.020
rob-0.019
 Wh-0.019
 Fl-0.019
Token It
Token ID632
Feature activation-1.64e+00

Pos logit contributions
ggy+0.015
 surgery+0.014
rob+0.013
Spot+0.013
 chip+0.013
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
elling-0.013
hing-0.013
Token are
Token ID389
Feature activation-1.06e+00

Pos logit contributions
ggy+0.010
 Wh+0.010
 Fl+0.010
 cookie+0.010
 surgery+0.010
Neg logit contributions
ills-0.013
arted-0.012
elling-0.012
ares-0.012
ada-0.011
Token right
Token ID826
Feature activation-3.22e-01

Pos logit contributions
ggy+0.025
 surgery+0.023
 Wh+0.023
 Fl+0.023
 chip+0.022
Neg logit contributions
ills-0.034
ares-0.031
ada-0.030
elling-0.030
hing-0.030
Token.
Token ID13
Feature activation-1.45e+00

Pos logit contributions
ggy+0.008
 surgery+0.008
rob+0.007
bes+0.007
Spot+0.007
Neg logit contributions
ills-0.010
ares-0.009
ada-0.009
ax-0.009
elling-0.009
Token The
Token ID383
Feature activation-3.39e-01

Pos logit contributions
ggy+0.045
rob+0.043
ves+0.042
bes+0.041
 surgery+0.041
Neg logit contributions
elling-0.034
ares-0.033
ills-0.033
ada-0.032
lee-0.029
Token new
Token ID649
Feature activation-2.03e+00

Pos logit contributions
ggy+0.007
elope+0.007
rob+0.007
 surgery+0.007
bes+0.007
Neg logit contributions
ills-0.011
ares-0.010
hing-0.010
ada-0.010
elling-0.010
Token hockey
Token ID12217
Feature activation-4.62e+00

Pos logit contributions
ggy+0.044
K+0.042
elope+0.042
Fl+0.041
 surgery+0.041
Neg logit contributions
ills-0.063
ares-0.058
ory-0.058
ly-0.056
ada-0.056
Token stick
Token ID4859
Feature activation-1.35e+00

Pos logit contributions
 Wh+0.111
K+0.110
Pe+0.107
 surgery+0.107
 Fl+0.103
Neg logit contributions
ills-0.128
ares-0.123
ly-0.121
hing-0.120
 want-0.113
Token is
Token ID318
Feature activation-1.05e+00

Pos logit contributions
ggy+0.025
 surgery+0.024
 Fl+0.023
 chip+0.022
 Wh+0.022
Neg logit contributions
ills-0.049
ares-0.046
ada-0.045
elling-0.044
ax-0.044
Token not
Token ID407
Feature activation+3.11e-01

Pos logit contributions
ggy+0.026
rob+0.025
 surgery+0.025
 Maple+0.024
bes+0.024
Neg logit contributions
ills-0.032
ares-0.029
elling-0.027
ax-0.027
ada-0.027
Token important
Token ID1593
Feature activation-2.04e-01

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
ax+0.008
elling+0.008
Neg logit contributions
ggy-0.008
 surgery-0.008
rob-0.008
bes-0.008
Spot-0.007
Token.
Token ID13
Feature activation-4.06e-01

Pos logit contributions
ggy+0.005
Fl+0.005
rob+0.005
Spot+0.005
elope+0.005
Neg logit contributions
ills-0.006
ares-0.005
ly-0.005
elling-0.005
ax-0.005
Token.
Token ID13
Feature activation-5.63e-01

Pos logit contributions
ggy+0.041
rob+0.040
 Wh+0.038
 surgery+0.038
Spot+0.037
Neg logit contributions
ills-0.045
ares-0.043
hing-0.041
ly-0.040
ada-0.040
Token
Token ID198
Feature activation-5.91e-01

Pos logit contributions
rob+0.018
ggy+0.018
 surgery+0.016
bes+0.016
elope+0.016
Neg logit contributions
ills-0.012
ares-0.012
lee-0.011
ada-0.011
 both-0.011
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
ggy+0.009
 surgery+0.009
rob+0.009
 Maple+0.008
 Wh+0.008
Neg logit contributions
ills-0.023
ares-0.021
ada-0.021
elling-0.020
hing-0.020
Token"
Token ID1
Feature activation-1.26e+00

Pos logit contributions
 surgery+0.005
rob+0.005
ggy+0.005
 Raven+0.005
 Fl+0.005
Neg logit contributions
ills-0.006
ares-0.006
ada-0.006
hing-0.006
elling-0.005
TokenYour
Token ID7120
Feature activation-3.92e-01

Pos logit contributions
ggy+0.034
elope+0.033
 Maple+0.032
 Fl+0.032
 clinic+0.032
Neg logit contributions
ills-0.032
hing-0.030
ares-0.030
ada-0.030
elling-0.029
Token tower
Token ID10580
Feature activation-4.61e+00

Pos logit contributions
ggy+0.008
 surgery+0.008
 Wh+0.007
 chip+0.007
 Fl+0.007
Neg logit contributions
ills-0.013
ares-0.013
ada-0.012
elling-0.012
ax-0.012
Token was
Token ID373
Feature activation-1.10e+00

Pos logit contributions
ggy+0.088
 surgery+0.084
 Wh+0.081
 Fl+0.080
Spot+0.080
Neg logit contributions
ills-0.162
ares-0.153
elling-0.146
hing-0.146
ada-0.144
Token high
Token ID1029
Feature activation-6.59e-01

Pos logit contributions
ggy+0.028
 surgery+0.027
 Fl+0.026
 Wh+0.026
 chip+0.026
Neg logit contributions
ills-0.033
ares-0.030
ada-0.029
elling-0.029
hing-0.028
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
ggy+0.021
 surgery+0.020
Fl+0.019
 Wh+0.019
Spot+0.019
Neg logit contributions
ills-0.015
ly-0.015
elling-0.014
ares-0.014
hing-0.013
Token colorful
Token ID20239
Feature activation-3.79e-01

Pos logit contributions
ggy+0.047
 surgery+0.044
rob+0.043
bes+0.042
 Maple+0.042
Neg logit contributions
ills-0.044
ares-0.040
elling-0.039
hing-0.038
ada-0.038
Token!"
Token ID2474
Feature activation+4.27e-01

Pos logit contributions
ggy+0.010
 surgery+0.009
 Wh+0.009
Fl+0.009
Spot+0.009
Neg logit contributions
ills-0.011
ares-0.010
ly-0.010
hing-0.010
elling-0.010
Token stayed
Token ID9658
Feature activation-9.99e-01

Pos logit contributions
 chip+0.036
ggy+0.036
rob+0.036
bes+0.035
 Wh+0.034
Neg logit contributions
ills-0.033
ax-0.029
lee-0.028
arted-0.028
ares-0.028
Token together
Token ID1978
Feature activation+8.96e-01

Pos logit contributions
ggy+0.028
 surgery+0.027
 chip+0.027
rob+0.027
 Wh+0.026
Neg logit contributions
ills-0.026
ares-0.023
ada-0.023
elling-0.022
ax-0.022
Token!
Token ID0
Feature activation-1.47e+00

Pos logit contributions
ills+0.024
ares+0.021
ax+0.020
ada+0.020
hing+0.020
Neg logit contributions
ggy-0.025
 surgery-0.025
rob-0.024
 Wh-0.024
 chip-0.023
Token That
Token ID1320
Feature activation-1.10e+00

Pos logit contributions
ggy+0.041
rob+0.039
ves+0.038
bes+0.038
 surgery+0.037
Neg logit contributions
ares-0.038
ada-0.037
ax-0.036
 wanna-0.034
arted-0.034
Tokenâ
Token ID22940
Feature activation-8.09e-01

Pos logit contributions
ggy+0.025
bes+0.024
 Wh+0.024
 surgery+0.024
Fl+0.023
Neg logit contributions
ills-0.034
hing-0.031
elling-0.030
heart-0.030
ax-0.029
Token
Token ID26391
Feature activation-4.61e+00

Pos logit contributions
 Fl+0.020
ggy+0.020
 clinic+0.019
 Maple+0.019
 chip+0.019
Neg logit contributions
elling-0.023
ly-0.022
hing-0.022
ax-0.022
ills-0.022
Token
Token ID8151
Feature activation-1.23e+00

Pos logit contributions
 surgery+0.103
 chip+0.097
rob+0.093
Fl+0.092
tr+0.092
Neg logit contributions
ills-0.133
 wanna-0.129
ax-0.128
're-0.125
 want-0.120
Tokens
Token ID82
Feature activation+4.91e-03

Pos logit contributions
 surgery+0.035
bes+0.033
ggy+0.033
 clinic+0.032
rob+0.031
Neg logit contributions
ills-0.034
ax-0.031
ares-0.031
ada-0.031
arted-0.028
Token how
Token ID703
Feature activation-1.31e+00

Pos logit contributions
ills+0.000
ares+0.000
hing+0.000
elling+0.000
ax+0.000
Neg logit contributions
rob-0.000
 surgery-0.000
ggy-0.000
bes-0.000
 chip-0.000
Token people
Token ID661
Feature activation+6.20e-01

Pos logit contributions
rob+0.035
bes+0.035
ggy+0.035
 surgery+0.034
 clinic+0.034
Neg logit contributions
ills-0.035
ax-0.031
ada-0.031
ares-0.031
elling-0.030
Token help
Token ID1037
Feature activation+9.20e-01

Pos logit contributions
ills+0.018
ares+0.017
ada+0.017
arted+0.017
lee+0.016
Neg logit contributions
ggy-0.015
 cookie-0.014
 Fl-0.014
 Maple-0.014
 surgery-0.014
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
ggy+0.035
 surgery+0.033
rob+0.032
 chip+0.032
bes+0.031
Neg logit contributions
ills-0.049
ares-0.045
ada-0.044
ax-0.043
elling-0.043
Token Good
Token ID4599
Feature activation-1.59e+00

Pos logit contributions
ggy+0.053
ves+0.050
rob+0.048
bes+0.048
 surgery+0.047
Neg logit contributions
elling-0.039
ills-0.038
ada-0.036
ares-0.034
lee-0.033
Token nap
Token ID25422
Feature activation-1.19e+00

Pos logit contributions
ggy+0.039
bes+0.038
Fl+0.038
rob+0.038
ves+0.037
Neg logit contributions
ills-0.046
elling-0.044
ly-0.044
ares-0.043
ory-0.042
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
 surgery+0.032
rob+0.031
 Wh+0.031
Pe+0.030
 chip+0.030
Neg logit contributions
ills-0.034
ly-0.029
ares-0.029
elling-0.028
ada-0.028
Token Good
Token ID4599
Feature activation-1.93e+00

Pos logit contributions
ggy+0.050
ves+0.047
bes+0.045
 surgery+0.044
rob+0.044
Neg logit contributions
ills-0.040
elling-0.040
ada-0.039
ares-0.039
hing-0.037
Token hockey
Token ID12217
Feature activation-4.60e+00

Pos logit contributions
ggy+0.049
Fl+0.048
bes+0.047
ves+0.047
rob+0.047
Neg logit contributions
ills-0.053
ly-0.052
elling-0.052
ory-0.049
ares-0.049
Token.
Token ID13
Feature activation-1.76e+00

Pos logit contributions
Pe+0.127
 surgery+0.123
 Wh+0.122
K+0.121
Br+0.114
Neg logit contributions
ills-0.115
ly-0.107
hing-0.101
 want-0.100
ets-0.099
Token Good
Token ID4599
Feature activation-2.07e+00

Pos logit contributions
ggy+0.050
ves+0.047
bes+0.046
 surgery+0.044
rob+0.044
Neg logit contributions
ills-0.044
ares-0.044
elling-0.043
ada-0.043
hing-0.041
Token friends
Token ID2460
Feature activation-4.78e-01

Pos logit contributions
ggy+0.052
Fl+0.051
bes+0.050
rob+0.050
ves+0.050
Neg logit contributions
ills-0.058
ly-0.057
elling-0.056
ory-0.054
ares-0.053
Token."
Token ID526
Feature activation-8.60e-01

Pos logit contributions
ggy+0.012
 surgery+0.011
 chip+0.011
 Wh+0.011
rob+0.011
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
hing-0.013
ax-0.013
Token<|endoftext|>
Token ID50256
Feature activation-1.10e+00

Pos logit contributions
ggy+0.028
rob+0.026
ves+0.025
 surgery+0.024
bes+0.024
Neg logit contributions
ills-0.019
ares-0.018
lee-0.018
ada-0.018
elling-0.017
Token on
Token ID319
Feature activation-1.60e+00

Pos logit contributions
ggy+0.063
bes+0.059
 surgery+0.058
elope+0.056
 clinic+0.056
Neg logit contributions
ills-0.031
arted-0.029
 closely-0.028
hing-0.028
ares-0.028
Token his
Token ID465
Feature activation-5.74e-01

Pos logit contributions
ggy+0.045
rob+0.043
bes+0.042
 surgery+0.040
 clinic+0.040
Neg logit contributions
ills-0.041
hing-0.035
 him-0.035
 closely-0.035
 both-0.035
Token truck
Token ID7779
Feature activation-1.97e+00

Pos logit contributions
ggy+0.011
 surgery+0.011
rob+0.011
Fl+0.011
 Wh+0.011
Neg logit contributions
ills-0.019
lee-0.018
ares-0.018
ax-0.017
ada-0.017
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
ggy+0.054
rob+0.052
 surgery+0.052
Spot+0.050
 Velvet+0.050
Neg logit contributions
ills-0.051
ares-0.049
hing-0.047
ly-0.046
arted-0.045
Token He
Token ID679
Feature activation-6.62e-01

Pos logit contributions
ills+0.030
ares+0.030
ada+0.028
ax+0.027
arted+0.027
Neg logit contributions
ggy-0.029
rob-0.027
 surgery-0.026
bes-0.026
elope-0.025
Token showed
Token ID3751
Feature activation+2.85e+00

Pos logit contributions
 surgery+0.019
ggy+0.019
bes+0.018
 Wh+0.018
 Maple+0.018
Neg logit contributions
ills-0.016
ares-0.014
ax-0.014
ada-0.014
arted-0.014
Token them
Token ID606
Feature activation+1.51e-01

Pos logit contributions
ills+0.071
ares+0.063
ivated+0.063
hing+0.062
 contain+0.061
Neg logit contributions
 Fl-0.087
 Wh-0.086
ggy-0.084
 Spot-0.081
 Velvet-0.081
Token how
Token ID703
Feature activation-1.87e-01

Pos logit contributions
ills+0.005
ares+0.004
ada+0.004
elling+0.004
hing+0.004
Neg logit contributions
ggy-0.004
 Fl-0.004
 chip-0.003
 surgery-0.003
 Wh-0.003
Token to
Token ID284
Feature activation-3.26e-01

Pos logit contributions
ggy+0.004
 surgery+0.004
 Fl+0.004
 Wh+0.004
 chip+0.004
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
ax-0.005
elling-0.005
Token hold
Token ID1745
Feature activation-1.57e-01

Pos logit contributions
ggy+0.007
 surgery+0.006
 Fl+0.006
 Wh+0.006
 chip+0.006
Neg logit contributions
ills-0.011
ares-0.010
ada-0.010
elling-0.010
hing-0.010
Token the
Token ID262
Feature activation-7.79e-01

Pos logit contributions
ggy+0.004
 surgery+0.004
rob+0.004
bes+0.004
 Wh+0.004
Neg logit contributions
ills-0.005
ares-0.004
ada-0.004
ax-0.004
elling-0.004
Token a
Token ID257
Feature activation+3.82e-01

Pos logit contributions
ills+0.047
ares+0.043
ada+0.042
elling+0.042
hing+0.042
Neg logit contributions
ggy-0.026
 surgery-0.024
rob-0.023
 chip-0.023
Spot-0.023
Token voice
Token ID3809
Feature activation+3.11e+00

Pos logit contributions
ills+0.011
ares+0.010
ada+0.010
elling+0.009
ax+0.009
Neg logit contributions
ggy-0.010
 surgery-0.010
 Fl-0.009
 chip-0.009
 Wh-0.009
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
ills+0.103
ada+0.090
ares+0.089
elling+0.089
arted+0.088
Neg logit contributions
ggy-0.073
 surgery-0.067
 Fl-0.065
 Wh-0.064
 chip-0.064
Token It
Token ID632
Feature activation+1.57e+00

Pos logit contributions
ills+0.042
ares+0.039
ada+0.038
elling+0.037
ax+0.037
Neg logit contributions
ggy-0.046
 surgery-0.044
rob-0.043
bes-0.042
Spot-0.042
Token was
Token ID373
Feature activation+1.31e+00

Pos logit contributions
ills+0.047
ares+0.045
ada+0.043
hing+0.043
elling+0.042
Neg logit contributions
 surgery-0.036
ggy-0.036
 cookie-0.034
 chip-0.034
bes-0.034
Token an
Token ID281
Feature activation+3.81e+00

Pos logit contributions
ills+0.041
ares+0.037
ada+0.037
ax+0.036
elling+0.036
Neg logit contributions
ggy-0.030
 surgery-0.029
rob-0.029
bes-0.028
Spot-0.027
Token old
Token ID1468
Feature activation-9.65e-01

Pos logit contributions
ills+0.125
ares+0.116
idges+0.114
arted+0.112
oured+0.110
Neg logit contributions
ggy-0.085
 chip-0.079
 surgery-0.077
 Wh-0.075
 Fl-0.073
Token man
Token ID582
Feature activation+1.18e+00

Pos logit contributions
Fl+0.020
Spot+0.020
 surgery+0.020
bes+0.020
ggy+0.020
Neg logit contributions
ills-0.029
ada-0.028
ly-0.028
heart-0.027
ax-0.027
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
ills+0.036
ares+0.034
ada+0.033
ax+0.032
hing+0.032
Neg logit contributions
ggy-0.028
 surgery-0.027
 Fl-0.026
 Wh-0.026
 chip-0.026
Token He
Token ID679
Feature activation+9.71e-01

Pos logit contributions
ills+0.042
ares+0.040
ada+0.040
ax+0.039
lee+0.039
Neg logit contributions
ggy-0.037
 surgery-0.035
rob-0.034
bes-0.034
Spot-0.032
Token said
Token ID531
Feature activation+3.70e-01

Pos logit contributions
ills+0.028
ares+0.026
ada+0.025
elling+0.024
hing+0.024
Neg logit contributions
ggy-0.026
 surgery-0.024
 Fl-0.024
 Wh-0.023
 chip-0.023
Token said
Token ID531
Feature activation-2.33e-01

Pos logit contributions
ggy+0.031
 surgery+0.031
rob+0.030
 Wh+0.030
 Fl+0.030
Neg logit contributions
ills-0.027
ares-0.025
ly-0.024
ada-0.024
elling-0.023
Token,
Token ID11
Feature activation-1.66e-01

Pos logit contributions
ggy+0.005
rob+0.005
 surgery+0.005
Spot+0.005
 Maple+0.005
Neg logit contributions
ills-0.008
ares-0.007
ada-0.007
ly-0.007
hing-0.007
Token "
Token ID366
Feature activation+1.46e-02

Pos logit contributions
ggy+0.003
 surgery+0.003
rob+0.003
bes+0.003
Spot+0.003
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
arted-0.005
ax-0.005
TokenI
Token ID40
Feature activation-4.68e-01

Pos logit contributions
ills+0.000
ares+0.000
ada+0.000
elling+0.000
hing+0.000
Neg logit contributions
ggy-0.000
 surgery-0.000
 Fl-0.000
 Wh-0.000
 chip-0.000
Token will
Token ID481
Feature activation+4.64e-01

Pos logit contributions
ggy+0.012
 surgery+0.012
 Fl+0.011
 Wh+0.011
 cookie+0.011
Neg logit contributions
ills-0.014
arted-0.013
ares-0.013
elling-0.013
ada-0.013
Token search
Token ID2989
Feature activation+2.85e+00

Pos logit contributions
ills+0.015
arted+0.014
elling+0.014
ares+0.014
ax+0.014
Neg logit contributions
ggy-0.011
 surgery-0.011
 Fl-0.010
 Wh-0.010
 Maple-0.010
Token the
Token ID262
Feature activation-3.50e-02

Pos logit contributions
ills+0.087
ada+0.083
ares+0.082
hing+0.079
lee+0.078
Neg logit contributions
 Fl-0.067
 surgery-0.066
 Maple-0.064
ggy-0.064
 Wh-0.064
Token sky
Token ID6766
Feature activation+3.61e-01

Pos logit contributions
ggy+0.001
 surgery+0.001
 Fl+0.001
 Wh+0.001
 chip+0.001
Neg logit contributions
ills-0.001
ares-0.001
ada-0.001
elling-0.001
hing-0.001
Token for
Token ID329
Feature activation-3.26e-01

Pos logit contributions
ills+0.011
ares+0.010
ada+0.009
elling+0.009
hing+0.009
Neg logit contributions
ggy-0.009
 surgery-0.009
 Fl-0.008
 Wh-0.008
 chip-0.008
Token you
Token ID345
Feature activation+1.04e+00

Pos logit contributions
ggy+0.009
 surgery+0.008
 chip+0.008
rob+0.008
 Wh+0.008
Neg logit contributions
ills-0.009
ares-0.008
elling-0.008
ada-0.008
ax-0.008
Token."
Token ID526
Feature activation-8.60e-02

Pos logit contributions
ills+0.036
ares+0.034
ada+0.033
elling+0.033
hing+0.033
Neg logit contributions
ggy-0.021
 surgery-0.020
 Fl-0.020
 Maple-0.019
 Wh-0.019
Token It
Token ID632
Feature activation+1.04e+00

Pos logit contributions
ills+0.005
ares+0.004
ada+0.004
elling+0.004
ax+0.004
Neg logit contributions
ggy-0.004
 surgery-0.004
rob-0.004
Spot-0.004
 chip-0.004
Token was
Token ID373
Feature activation+1.18e+00

Pos logit contributions
ills+0.038
ares+0.036
ada+0.035
ax+0.034
elling+0.034
Neg logit contributions
ggy-0.020
 Fl-0.019
 surgery-0.018
 Wh-0.018
 chip-0.018
Token so
Token ID523
Feature activation+1.24e-01

Pos logit contributions
ills+0.034
ares+0.032
ada+0.031
elling+0.030
hing+0.030
Neg logit contributions
ggy-0.031
 surgery-0.029
 Fl-0.029
 Wh-0.028
 chip-0.028
Token pretty
Token ID2495
Feature activation+6.93e-01

Pos logit contributions
ares+0.004
ills+0.004
ada+0.003
hing+0.003
arted+0.003
Neg logit contributions
ggy-0.003
 Fl-0.003
 chip-0.003
 cookie-0.003
 Wednesday-0.003
Token that
Token ID326
Feature activation+3.66e-01

Pos logit contributions
ills+0.019
ares+0.017
ada+0.017
elling+0.016
hing+0.016
Neg logit contributions
ggy-0.019
 surgery-0.018
 Fl-0.018
 chip-0.018
 Wh-0.018
Token Sus
Token ID8932
Feature activation+3.43e+00

Pos logit contributions
ills+0.009
ares+0.008
ada+0.008
elling+0.008
hing+0.007
Neg logit contributions
ggy-0.011
 surgery-0.011
Spot-0.010
rob-0.010
bes-0.010
Tokenie
Token ID494
Feature activation+5.86e-01

Pos logit contributions
ares+0.062
 contain+0.061
ivated+0.059
hing+0.054
ills+0.054
Neg logit contributions
ggy-0.107
Fl-0.106
rob-0.105
 Fl-0.104
elope-0.102
Token wanted
Token ID2227
Feature activation+1.03e+00

Pos logit contributions
ills+0.018
ares+0.017
ada+0.016
elling+0.016
hing+0.016
Neg logit contributions
ggy-0.015
 Fl-0.013
 surgery-0.013
 chip-0.013
bes-0.013
Token to
Token ID284
Feature activation+1.34e+00

Pos logit contributions
ills+0.035
ares+0.034
hing+0.033
ada+0.033
elling+0.032
Neg logit contributions
ggy-0.021
 surgery-0.021
 Fl-0.020
 Wh-0.019
 chip-0.019
Token keep
Token ID1394
Feature activation-3.51e-01

Pos logit contributions
ills+0.041
arted+0.039
ares+0.039
elling+0.039
years+0.036
Neg logit contributions
ggy-0.036
 Fl-0.033
 cookie-0.031
 surgery-0.031
 ha-0.030
Token it
Token ID340
Feature activation+1.45e+00

Pos logit contributions
ggy+0.007
 Fl+0.007
 surgery+0.007
 Wh+0.007
 chip+0.006
Neg logit contributions
ills-0.012
ares-0.012
hing-0.011
ada-0.011
elling-0.011
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
ills+0.088
ares+0.087
ada+0.087
hing+0.086
elling+0.083
Neg logit contributions
ggy-0.043
 Fl-0.039
 Wh-0.037
 chip-0.036
 Maple-0.036
Token They
Token ID1119
Feature activation+2.81e+00

Pos logit contributions
ggy+0.008
 surgery+0.008
rob+0.007
 cookie+0.007
 chip+0.007
Neg logit contributions
ills-0.012
ares-0.011
ada-0.011
lee-0.011
arted-0.011
Token had
Token ID550
Feature activation-8.04e-01

Pos logit contributions
ares+0.109
elling+0.108
ills+0.101
ada+0.096
idget+0.095
Neg logit contributions
 Fl-0.046
 Snow-0.040
 k-0.039
 Maple-0.039
 Wh-0.038
Token never
Token ID1239
Feature activation+1.44e+00

Pos logit contributions
ggy+0.015
 surgery+0.014
 Fl+0.013
 Wh+0.013
 Maple+0.013
Neg logit contributions
ills-0.029
ares-0.029
ada-0.028
elling-0.027
hing-0.027
Token seen
Token ID1775
Feature activation+1.26e+00

Pos logit contributions
ills+0.033
ares+0.030
ada+0.029
elling+0.028
hing+0.027
Neg logit contributions
ggy-0.046
 surgery-0.044
 Fl-0.043
 Wh-0.043
 chip-0.043
Token anything
Token ID1997
Feature activation+3.00e+00

Pos logit contributions
ills+0.042
ares+0.040
ada+0.039
elling+0.038
hing+0.038
Neg logit contributions
ggy-0.027
 surgery-0.025
 Fl-0.025
 Wh-0.024
 chip-0.024
Token like
Token ID588
Feature activation+8.71e-01

Pos logit contributions
ills+0.068
ada+0.061
ares+0.061
elling+0.060
arted+0.058
Neg logit contributions
ggy-0.098
 Fl-0.094
 surgery-0.093
 cookie-0.090
 chip-0.090
Token it
Token ID340
Feature activation+3.41e+00

Pos logit contributions
ills+0.028
ares+0.027
ada+0.026
elling+0.025
hing+0.025
Neg logit contributions
ggy-0.019
 surgery-0.018
 Fl-0.018
 chip-0.017
 Wh-0.017
Token before
Token ID878
Feature activation+3.28e-01

Pos logit contributions
ills+0.111
ares+0.104
ada+0.102
elling+0.100
arted+0.099
Neg logit contributions
ggy-0.077
 surgery-0.072
 Fl-0.071
 Wh-0.069
 chip-0.068
Token!
Token ID0
Feature activation-1.33e-02

Pos logit contributions
ills+0.010
ares+0.009
ada+0.009
hing+0.009
elling+0.009
Neg logit contributions
ggy-0.008
 surgery-0.008
Spot-0.007
rob-0.007
 Wh-0.007
Token They
Token ID1119
Feature activation+2.74e+00

Pos logit contributions
ggy+0.000
 surgery+0.000
rob+0.000
bes+0.000
 cookie+0.000
Neg logit contributions
ills-0.000
ada-0.000
ares-0.000
lee-0.000
elling-0.000
Token careful
Token ID8161
Feature activation-8.08e-01

Pos logit contributions
rob+0.076
 surgery+0.075
ggy+0.073
 chip+0.071
 bidder+0.070
Neg logit contributions
ills-0.064
ax-0.054
ares-0.054
arted-0.053
ivated-0.052
Token.
Token ID13
Feature activation-9.79e-02

Pos logit contributions
 surgery+0.018
ggy+0.018
rob+0.017
 Wh+0.017
 Fl+0.017
Neg logit contributions
ills-0.026
ares-0.024
ada-0.023
elling-0.023
ax-0.023
Token Don
Token ID2094
Feature activation+6.47e-01

Pos logit contributions
ggy+0.003
rob+0.003
 surgery+0.002
bes+0.002
ves+0.002
Neg logit contributions
ills-0.003
ares-0.003
elling-0.003
ada-0.003
arted-0.002
Token't
Token ID470
Feature activation-2.01e-02

Pos logit contributions
ares+0.013
ills+0.013
hing+0.012
elling+0.012
idges+0.011
Neg logit contributions
rob-0.021
bes-0.020
 clinic-0.019
elope-0.019
 surgery-0.019
Token touch
Token ID3638
Feature activation+5.77e-01

Pos logit contributions
ggy+0.001
 surgery+0.001
 Fl+0.001
 Wh+0.001
 chip+0.001
Neg logit contributions
ills-0.000
ares-0.000
ada-0.000
elling-0.000
hing-0.000
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
ills+0.017
elling+0.015
ares+0.015
ada+0.014
hing+0.014
Neg logit contributions
ggy-0.016
 surgery-0.014
bes-0.014
rob-0.014
Spot-0.014
Token nest
Token ID16343
Feature activation+1.20e-01

Pos logit contributions
ills+0.003
ada+0.003
ares+0.003
elling+0.003
hing+0.003
Neg logit contributions
ggy-0.002
 surgery-0.002
rob-0.002
 Wh-0.002
Spot-0.002
Token or
Token ID393
Feature activation-3.31e-01

Pos logit contributions
ills+0.004
ares+0.003
elling+0.003
ada+0.003
ly+0.003
Neg logit contributions
ggy-0.003
 surgery-0.003
rob-0.003
 Wh-0.003
Spot-0.003
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
ggy+0.009
 surgery+0.009
 Wh+0.008
 chip+0.008
 Fl+0.008
Neg logit contributions
ills-0.009
ares-0.008
ada-0.008
elling-0.008
hing-0.008
Token eggs
Token ID9653
Feature activation-4.62e-01

Pos logit contributions
ggy+0.008
rob+0.008
 surgery+0.007
bes+0.007
 Wh+0.007
Neg logit contributions
ills-0.010
ada-0.009
elling-0.009
ares-0.009
ly-0.009
Token.
Token ID13
Feature activation-1.35e-03

Pos logit contributions
ggy+0.010
 surgery+0.010
 Fl+0.009
 Wh+0.009
 chip+0.009
Neg logit contributions
ills-0.015
ares-0.014
ada-0.014
elling-0.013
hing-0.013
Token full
Token ID1336
Feature activation+1.65e+00

Pos logit contributions
ggy+0.010
 surgery+0.010
 Fl+0.010
 chip+0.010
 Wh+0.010
Neg logit contributions
ills-0.011
ares-0.010
ada-0.010
elling-0.010
hing-0.009
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
ills+0.047
ares+0.044
ada+0.042
hing+0.042
elling+0.041
Neg logit contributions
ggy-0.043
 surgery-0.041
 chip-0.041
 Fl-0.041
 cookie-0.041
Token lettuce
Token ID39406
Feature activation-7.14e-01

Pos logit contributions
 cookie+0.024
 chip+0.023
 Fl+0.023
 surgery+0.022
ggy+0.022
Neg logit contributions
ares-0.033
ills-0.031
ada-0.031
idget-0.031
hing-0.030
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 chip+0.014
 Fl+0.014
 cookie+0.014
Neg logit contributions
ills-0.023
ares-0.022
ada-0.021
elling-0.021
hing-0.021
Token
Token ID198
Feature activation-3.82e-01

Pos logit contributions
rob+0.033
ggy+0.030
 surgery+0.029
 kit+0.027
ves+0.027
Neg logit contributions
ada-0.028
 Jackie-0.027
arted-0.027
lee-0.027
 curiously-0.027
Token
Token ID198
Feature activation+2.31e-01

Pos logit contributions
 surgery+0.007
rob+0.007
ggy+0.007
 chip+0.007
 kit+0.007
Neg logit contributions
ills-0.013
ada-0.012
ares-0.012
ax-0.012
 curiously-0.011
TokenNext
Token ID10019
Feature activation+7.82e-02

Pos logit contributions
ills+0.008
ada+0.007
ivated+0.007
ares+0.007
idges+0.007
Neg logit contributions
rob-0.005
 surgery-0.004
 kit-0.004
 Wh-0.004
 chip-0.004
Token,
Token ID11
Feature activation-6.57e-01

Pos logit contributions
ills+0.002
ly+0.002
ada+0.002
illy+0.002
ares+0.002
Neg logit contributions
bes-0.002
rob-0.002
tr-0.002
ggy-0.002
Fl-0.002
Token Lucy
Token ID22162
Feature activation+2.18e-01

Pos logit contributions
rob+0.019
 surgery+0.017
ggy+0.017
 clinic+0.016
bes+0.016
Neg logit contributions
 both-0.016
 curiously-0.016
 Jackie-0.016
ada-0.016
ills-0.016
Token's
Token ID338
Feature activation-8.72e-01

Pos logit contributions
ills+0.006
ares+0.006
elling+0.006
ada+0.006
hing+0.006
Neg logit contributions
ggy-0.006
 surgery-0.005
 Fl-0.005
 chip-0.005
 Wh-0.005
Token mom
Token ID1995
Feature activation+1.90e-01

Pos logit contributions
ggy+0.015
 surgery+0.014
 chip+0.014
 cookie+0.014
 Fl+0.013
Neg logit contributions
ills-0.033
ares-0.032
hing-0.031
ada-0.031
elling-0.030
Token They
Token ID1119
Feature activation+1.23e-01

Pos logit contributions
ggy+0.007
rob+0.006
 surgery+0.006
bes+0.006
 clinic+0.006
Neg logit contributions
ills-0.006
ada-0.006
ares-0.005
elling-0.005
lee-0.005
Token are
Token ID389
Feature activation+1.04e-01

Pos logit contributions
ills+0.003
ares+0.003
arted+0.003
elling+0.003
ada+0.003
Neg logit contributions
 cookie-0.003
 Fl-0.003
ggy-0.003
 chip-0.003
 surgery-0.003
Token y
Token ID331
Feature activation+1.55e+00

Pos logit contributions
ills+0.003
ares+0.003
ada+0.002
elling+0.002
ax+0.002
Neg logit contributions
ggy-0.003
 surgery-0.003
 Wh-0.003
rob-0.003
 Maple-0.003
Tokenummy
Token ID13513
Feature activation-1.53e+00

Pos logit contributions
ills+0.029
hing+0.029
ares+0.028
ivated+0.027
 contain+0.027
Neg logit contributions
ggy-0.051
 chip-0.050
 cookie-0.049
 surgery-0.048
 Fl-0.048
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
ggy+0.039
 surgery+0.038
 Wh+0.036
rob+0.036
Spot+0.036
Neg logit contributions
ills-0.045
ares-0.040
ax-0.039
ly-0.039
ada-0.039
Token They
Token ID1119
Feature activation+1.78e-01

Pos logit contributions
ggy+0.005
rob+0.005
 surgery+0.005
bes+0.005
 clinic+0.004
Neg logit contributions
ills-0.005
ada-0.005
ares-0.004
elling-0.004
lee-0.004
Token are
Token ID389
Feature activation+2.53e-01

Pos logit contributions
ills+0.005
ares+0.005
arted+0.005
elling+0.004
ada+0.004
Neg logit contributions
 cookie-0.005
 Fl-0.005
ggy-0.005
 chip-0.005
 Wh-0.005
Token happy
Token ID3772
Feature activation-6.87e-01

Pos logit contributions
ills+0.007
ares+0.006
ada+0.006
ax+0.006
elling+0.006
Neg logit contributions
ggy-0.007
 surgery-0.007
rob-0.007
 Maple-0.007
 Wh-0.006
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
ggy+0.018
 surgery+0.017
rob+0.017
Fl+0.016
Spot+0.016
Neg logit contributions
ills-0.020
ares-0.018
ada-0.017
elling-0.017
lee-0.017
Token They
Token ID1119
Feature activation+3.18e-01

Pos logit contributions
ggy+0.007
rob+0.006
 surgery+0.006
bes+0.006
ves+0.006
Neg logit contributions
ills-0.008
ada-0.007
elling-0.007
lee-0.007
ares-0.007
Token are
Token ID389
Feature activation+3.90e-01

Pos logit contributions
ills+0.009
ares+0.009
arted+0.008
elling+0.008
idges+0.008
Neg logit contributions
 Fl-0.009
 cookie-0.009
ggy-0.009
 Wh-0.008
 chip-0.008
Token are
Token ID389
Feature activation-7.36e-01

Pos logit contributions
ggy+0.002
 Wh+0.002
 ha+0.002
 Fl+0.002
 cookie+0.002
Neg logit contributions
ills-0.002
arted-0.002
elling-0.002
lee-0.002
ares-0.002
Token sorry
Token ID7926
Feature activation+9.94e-01

Pos logit contributions
ggy+0.019
 surgery+0.019
 chip+0.018
rob+0.018
bes+0.018
Neg logit contributions
ills-0.021
ares-0.019
ada-0.019
ax-0.018
elling-0.018
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
ills+0.036
ares+0.034
ada+0.033
hing+0.033
elling+0.033
Neg logit contributions
ggy-0.019
 Fl-0.017
 surgery-0.017
 Wh-0.017
 chip-0.017
Token But
Token ID887
Feature activation-1.73e+00

Pos logit contributions
ggy+0.044
rob+0.043
 surgery+0.041
bes+0.041
ves+0.040
Neg logit contributions
ares-0.044
elling-0.044
ills-0.044
ada-0.043
hing-0.040
Token you
Token ID345
Feature activation+5.80e-01

Pos logit contributions
ggy+0.037
 surgery+0.035
rob+0.034
bes+0.033
Spot+0.033
Neg logit contributions
ills-0.057
ares-0.053
ada-0.051
elling-0.051
hing-0.050
Token can
Token ID460
Feature activation+4.02e-01

Pos logit contributions
ills+0.022
arted+0.021
ada+0.020
ares+0.020
elling+0.020
Neg logit contributions
 Wh-0.011
ggy-0.010
 Fl-0.010
 cookie-0.010
 Maple-0.009
Token still
Token ID991
Feature activation-1.79e-01

Pos logit contributions
ills+0.015
elling+0.013
ares+0.013
arted+0.013
ada+0.013
Neg logit contributions
ggy-0.009
 Fl-0.008
 Wh-0.008
 surgery-0.008
Spot-0.007
Token play
Token ID711
Feature activation-2.76e-01

Pos logit contributions
ggy+0.005
 surgery+0.005
 chip+0.005
 Fl+0.005
 Wh+0.005
Neg logit contributions
ills-0.005
ares-0.004
ada-0.004
hing-0.004
elling-0.004
Token with
Token ID351
Feature activation+7.36e-01

Pos logit contributions
 chip+0.007
 surgery+0.007
ggy+0.007
elope+0.007
rob+0.007
Neg logit contributions
ills-0.007
ares-0.007
hing-0.007
ly-0.007
ada-0.006
Token us
Token ID514
Feature activation+1.03e+00

Pos logit contributions
ills+0.019
ares+0.018
ada+0.017
hing+0.017
ax+0.017
Neg logit contributions
ggy-0.021
 surgery-0.020
 Fl-0.020
 Wh-0.020
Spot-0.020
Token.
Token ID13
Feature activation-4.29e-01

Pos logit contributions
ills+0.033
ares+0.030
ada+0.029
elling+0.029
hing+0.029
Neg logit contributions
ggy-0.024
 surgery-0.023
 chip-0.022
rob-0.022
 Wh-0.022
Token of
Token ID286
Feature activation-4.82e-01

Pos logit contributions
ills+0.026
ares+0.025
hing+0.025
ada+0.024
elling+0.023
Neg logit contributions
ggy-0.025
 surgery-0.024
 Fl-0.024
 chip-0.024
 cookie-0.023
Token herself
Token ID5223
Feature activation+1.21e+00

Pos logit contributions
ggy+0.016
 surgery+0.015
 chip+0.015
rob+0.015
Spot+0.015
Neg logit contributions
ills-0.011
ares-0.009
ada-0.009
elling-0.009
arted-0.009
Token for
Token ID329
Feature activation-1.08e+00

Pos logit contributions
ills+0.027
ares+0.024
ada+0.023
ly+0.022
ax+0.022
Neg logit contributions
ggy-0.039
rob-0.038
 surgery-0.038
Spot-0.037
 Wh-0.037
Token following
Token ID1708
Feature activation-5.30e-02

Pos logit contributions
ggy+0.030
rob+0.030
Spot+0.029
 Maple+0.029
tr+0.028
Neg logit contributions
ills-0.028
ares-0.024
idges-0.024
ivated-0.024
lee-0.024
Token her
Token ID607
Feature activation-3.44e-01

Pos logit contributions
ggy+0.001
rob+0.001
 surgery+0.001
Spot+0.001
 chip+0.001
Neg logit contributions
ills-0.001
ada-0.001
ares-0.001
lee-0.001
ax-0.001
Token mom
Token ID1995
Feature activation+2.13e-01

Pos logit contributions
ggy+0.008
rob+0.008
 surgery+0.008
Spot+0.008
 Wh+0.008
Neg logit contributions
ills-0.010
ares-0.009
ada-0.009
lee-0.009
elling-0.009
Token's
Token ID338
Feature activation-1.36e+00

Pos logit contributions
ills+0.006
ada+0.005
ares+0.005
ax+0.005
elling+0.005
Neg logit contributions
ggy-0.006
rob-0.006
 surgery-0.006
Spot-0.006
 Maple-0.006
Token order
Token ID1502
Feature activation+2.05e-01

Pos logit contributions
ggy+0.038
rob+0.037
 Wh+0.036
 surgery+0.036
 Fl+0.036
Neg logit contributions
ills-0.036
ares-0.032
ada-0.032
elling-0.031
hing-0.031
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
ills+0.006
ares+0.006
ada+0.006
ax+0.005
elling+0.005
Neg logit contributions
ggy-0.005
 surgery-0.005
rob-0.005
 Wh-0.005
 Fl-0.005
Token gaining
Token ID13977
Feature activation-1.38e+00

Pos logit contributions
ggy+0.038
rob+0.037
Spot+0.036
 Maple+0.036
Fl+0.036
Neg logit contributions
ills-0.035
ada-0.030
lee-0.030
ares-0.030
arted-0.029
Token a
Token ID257
Feature activation-1.77e+00

Pos logit contributions
ggy+0.035
 surgery+0.032
rob+0.032
 Wh+0.032
Spot+0.032
Neg logit contributions
ills-0.041
ares-0.037
ada-0.037
elling-0.036
ax-0.036
Token arm
Token ID3211
Feature activation-2.05e+00

Pos logit contributions
rob+0.031
 Maple+0.029
ggy+0.028
 surgery+0.028
 Wh+0.028
Neg logit contributions
ills-0.026
ares-0.025
ly-0.025
ada-0.024
lee-0.023
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
rob+0.054
ggy+0.054
 clinic+0.052
 Maple+0.051
 surgery+0.051
Neg logit contributions
ills-0.053
ly-0.053
ares-0.051
hing-0.049
elling-0.048
Token Tim
Token ID5045
Feature activation+6.61e-01

Pos logit contributions
ills+0.006
ares+0.006
ada+0.006
lee+0.006
arted+0.006
Neg logit contributions
ggy-0.006
rob-0.005
 surgery-0.005
 Maple-0.005
 clinic-0.005
Tokenmy
Token ID1820
Feature activation-4.22e-01

Pos logit contributions
ills+0.016
ares+0.014
ivated+0.013
ada+0.013
 contain+0.013
Neg logit contributions
ggy-0.020
 Wh-0.020
 Fl-0.020
Spot-0.020
 chip-0.019
Token said
Token ID531
Feature activation-3.16e-01

Pos logit contributions
ggy+0.011
 surgery+0.011
rob+0.011
 Maple+0.011
 chip+0.011
Neg logit contributions
ills-0.011
ares-0.010
ada-0.010
ax-0.010
hing-0.010
Token,
Token ID11
Feature activation-4.64e-01

Pos logit contributions
rob+0.007
ggy+0.007
 surgery+0.007
bes+0.007
 Maple+0.007
Neg logit contributions
ills-0.010
ares-0.009
ada-0.009
ly-0.009
ax-0.009
Token "
Token ID366
Feature activation+5.11e-01

Pos logit contributions
ggy+0.011
rob+0.010
 surgery+0.010
bes+0.010
 clinic+0.010
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
lee-0.013
ax-0.013
TokenO
Token ID46
Feature activation-9.38e-01

Pos logit contributions
ills+0.018
ares+0.017
ada+0.017
elling+0.016
ax+0.016
Neg logit contributions
ggy-0.010
 surgery-0.009
Spot-0.009
 Wh-0.009
 chip-0.009
Tokenuch
Token ID794
Feature activation-7.97e-01

Pos logit contributions
ggy+0.020
 surgery+0.020
 Fl+0.020
 Maple+0.019
 Wh+0.019
Neg logit contributions
ills-0.031
ares-0.029
ada-0.028
elling-0.028
ax-0.028
Token!
Token ID0
Feature activation-4.91e-01

Pos logit contributions
ggy+0.021
 Fl+0.019
 chip+0.019
 Wh+0.019
 surgery+0.018
Neg logit contributions
ills-0.024
ares-0.022
ada-0.021
elling-0.021
arted-0.021
Token Why
Token ID4162
Feature activation-3.20e-01

Pos logit contributions
rob+0.013
ggy+0.013
 surgery+0.012
bes+0.012
ves+0.012
Neg logit contributions
ills-0.013
ares-0.013
ada-0.013
elling-0.013
lee-0.012
TokenI
Token ID40
Feature activation-1.79e-01

Pos logit contributions
 surgery+0.025
 clinic+0.024
bes+0.024
ggy+0.024
 kit+0.024
Neg logit contributions
ills-0.029
ax-0.027
ada-0.026
 wanna-0.025
ares-0.025
Token will
Token ID481
Feature activation+1.86e-01

Pos logit contributions
ggy+0.004
 surgery+0.004
 Fl+0.004
 Wh+0.004
 chip+0.004
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
elling-0.005
arted-0.005
Token give
Token ID1577
Feature activation+2.06e-01

Pos logit contributions
ills+0.005
ares+0.004
ada+0.004
elling+0.004
hing+0.004
Neg logit contributions
ggy-0.005
 surgery-0.005
 Fl-0.005
 Wh-0.005
 chip-0.005
Token you
Token ID345
Feature activation-1.14e-01

Pos logit contributions
ills+0.005
ares+0.005
ada+0.005
ax+0.005
elling+0.005
Neg logit contributions
ggy-0.006
 surgery-0.005
rob-0.005
bes-0.005
 clinic-0.005
Token some
Token ID617
Feature activation+9.14e-01

Pos logit contributions
ggy+0.002
 surgery+0.002
 Fl+0.002
 Wh+0.002
 chip+0.002
Neg logit contributions
ills-0.004
ares-0.004
ada-0.004
elling-0.004
hing-0.004
Token of
Token ID286
Feature activation-1.99e-01

Pos logit contributions
ills+0.029
ares+0.027
ada+0.026
hing+0.026
arted+0.026
Neg logit contributions
ggy-0.021
 surgery-0.020
 cookie-0.019
 Fl-0.019
 chip-0.019
Token my
Token ID616
Feature activation-1.41e-01

Pos logit contributions
ggy+0.005
 surgery+0.005
rob+0.005
 Maple+0.004
tr+0.004
Neg logit contributions
ills-0.006
ada-0.005
ares-0.005
ax-0.005
elling-0.005
Token milk
Token ID7545
Feature activation-1.15e+00

Pos logit contributions
ggy+0.003
rob+0.003
 Maple+0.003
 surgery+0.003
Spot+0.003
Neg logit contributions
ills-0.004
ares-0.004
elling-0.004
ada-0.004
ly-0.004
Token if
Token ID611
Feature activation-1.04e+00

Pos logit contributions
ggy+0.029
 surgery+0.027
 Wh+0.026
rob+0.026
 Maple+0.026
Neg logit contributions
ills-0.034
ares-0.031
ada-0.031
elling-0.030
ax-0.030
Token you
Token ID345
Feature activation+2.14e+00

Pos logit contributions
rob+0.034
bes+0.032
tr+0.032
ggy+0.031
Spot+0.030
Neg logit contributions
ills-0.023
ly-0.020
ax-0.019
ada-0.019
elling-0.019
Token bat
Token ID7365
Feature activation+2.37e-01

Pos logit contributions
ills+0.055
arted+0.049
ares+0.048
elling+0.048
lee+0.046
Neg logit contributions
ggy-0.062
 cookie-0.061
 Fl-0.061
 surgery-0.060
 Wh-0.058
Token.
Token ID13
Feature activation-6.17e-01

Pos logit contributions
ggy+0.034
 surgery+0.033
rob+0.032
 Wh+0.032
 chip+0.032
Neg logit contributions
ills-0.040
ares-0.038
hing-0.036
ada-0.036
ax-0.035
Token He
Token ID679
Feature activation-4.27e-01

Pos logit contributions
rob+0.020
ggy+0.019
ves+0.018
 surgery+0.018
bes+0.017
Neg logit contributions
ares-0.013
ills-0.013
ada-0.012
lee-0.012
ax-0.012
Token says
Token ID1139
Feature activation-2.89e-01

Pos logit contributions
ggy+0.010
 surgery+0.009
 Fl+0.009
 Wh+0.009
 Maple+0.009
Neg logit contributions
ills-0.014
ares-0.013
ada-0.013
elling-0.012
hing-0.012
Token he
Token ID339
Feature activation-1.07e+00

Pos logit contributions
rob+0.009
bes+0.008
elope+0.008
 clinic+0.008
tr+0.008
Neg logit contributions
ills-0.007
lee-0.007
ly-0.007
ares-0.006
ax-0.006
Token was
Token ID373
Feature activation-1.04e+00

Pos logit contributions
ggy+0.025
 surgery+0.024
 Fl+0.023
 Wh+0.023
 chip+0.022
Neg logit contributions
ills-0.034
ares-0.032
ada-0.031
elling-0.030
hing-0.030
Token playing
Token ID2712
Feature activation-8.60e-01

Pos logit contributions
ggy+0.027
 surgery+0.026
 chip+0.026
rob+0.026
Spot+0.025
Neg logit contributions
ills-0.029
ares-0.027
ada-0.026
ax-0.026
elling-0.025
Token with
Token ID351
Feature activation-7.90e-01

Pos logit contributions
rob+0.021
ggy+0.021
 chip+0.020
 surgery+0.020
elope+0.019
Neg logit contributions
ills-0.024
ares-0.022
hing-0.022
ly-0.022
ax-0.021
Token it
Token ID340
Feature activation+7.68e-01

Pos logit contributions
ggy+0.019
 surgery+0.018
 chip+0.018
rob+0.018
Spot+0.017
Neg logit contributions
ills-0.024
ares-0.021
elling-0.021
ada-0.021
ax-0.021
Token outside
Token ID2354
Feature activation+1.04e-01

Pos logit contributions
ills+0.022
ares+0.020
hing+0.019
ly+0.019
ax+0.019
Neg logit contributions
ggy-0.020
 surgery-0.019
rob-0.019
 chip-0.019
 Wh-0.018
Token and
Token ID290
Feature activation-1.27e+00

Pos logit contributions
ills+0.003
ly+0.002
ares+0.002
hing+0.002
ax+0.002
Neg logit contributions
ggy-0.003
rob-0.003
 surgery-0.003
 chip-0.003
bes-0.003
Token a
Token ID257
Feature activation-1.33e+00

Pos logit contributions
ggy+0.034
 surgery+0.032
 chip+0.032
Spot+0.031
rob+0.031
Neg logit contributions
ills-0.035
ares-0.032
arted-0.031
hing-0.031
elling-0.030
Token,
Token ID11
Feature activation+9.68e-02

Pos logit contributions
ills+0.026
ares+0.024
ada+0.023
elling+0.023
hing+0.022
Neg logit contributions
ggy-0.022
 surgery-0.020
 Fl-0.020
 chip-0.020
 Wh-0.020
Token feeling
Token ID4203
Feature activation+7.92e-02

Pos logit contributions
ills+0.003
ares+0.002
ada+0.002
ax+0.002
hing+0.002
Neg logit contributions
ggy-0.003
 surgery-0.003
 Wh-0.003
 chip-0.003
 Fl-0.003
Token much
Token ID881
Feature activation-7.38e-01

Pos logit contributions
ills+0.002
ares+0.002
ada+0.002
hing+0.002
elling+0.002
Neg logit contributions
ggy-0.002
 surgery-0.002
 Wh-0.002
 chip-0.002
rob-0.002
Token better
Token ID1365
Feature activation-1.01e+00

Pos logit contributions
ggy+0.015
 surgery+0.015
 Fl+0.014
 chip+0.014
 cookie+0.014
Neg logit contributions
ills-0.025
ares-0.023
ada-0.023
hing-0.022
elling-0.022
Token than
Token ID621
Feature activation+9.29e-01

Pos logit contributions
ggy+0.028
 surgery+0.027
 Wh+0.027
rob+0.027
Fl+0.026
Neg logit contributions
ills-0.027
ly-0.025
ares-0.024
ax-0.024
ada-0.023
Token before
Token ID878
Feature activation-4.90e-01

Pos logit contributions
ills+0.026
ares+0.023
ax+0.023
ada+0.023
arted+0.022
Neg logit contributions
ggy-0.026
 surgery-0.024
rob-0.024
Spot-0.024
bes-0.023
Token.
Token ID13
Feature activation-4.30e-02

Pos logit contributions
ggy+0.013
rob+0.013
tr+0.013
Spot+0.013
Fl+0.013
Neg logit contributions
ills-0.014
lee-0.012
ares-0.012
ada-0.011
arted-0.011
Token<|endoftext|>
Token ID50256
Feature activation-8.28e-01

Pos logit contributions
ggy+0.001
 surgery+0.001
rob+0.001
ves+0.001
bes+0.001
Neg logit contributions
ada-0.001
ills-0.001
lee-0.001
arted-0.001
ares-0.001
TokenOnce
Token ID7454
Feature activation+1.18e+00

Pos logit contributions
ggy+0.021
 surgery+0.021
 chip+0.020
bes+0.019
 clinic+0.018
Neg logit contributions
ills-0.023
ada-0.022
hing-0.022
ax-0.021
ares-0.020
Token upon
Token ID2402
Feature activation-1.35e+00

Pos logit contributions
ills+0.022
ares+0.019
hing+0.018
elling+0.018
ada+0.018
Neg logit contributions
ggy-0.043
 surgery-0.041
 Fl-0.041
 Wh-0.040
 chip-0.040
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
 surgery+0.030
 chip+0.029
ggy+0.029
rob+0.029
 Wh+0.028
Neg logit contributions
ills-0.041
ares-0.037
ax-0.037
hing-0.037
ada-0.036
Token gasped
Token ID45236
Feature activation+1.25e+00

Pos logit contributions
ggy+0.015
 surgery+0.014
 chip+0.014
 Wh+0.014
 Fl+0.013
Neg logit contributions
ills-0.019
ares-0.018
ada-0.017
elling-0.017
hing-0.017
Token.
Token ID13
Feature activation-5.54e-01

Pos logit contributions
ills+0.042
ares+0.039
ada+0.039
ax+0.038
elling+0.038
Neg logit contributions
ggy-0.026
 Fl-0.025
 Wh-0.024
 surgery-0.023
 chip-0.023
Token She
Token ID1375
Feature activation+4.99e-01

Pos logit contributions
ggy+0.020
rob+0.019
 surgery+0.018
ves+0.018
bes+0.017
Neg logit contributions
ills-0.010
ares-0.010
ada-0.010
lee-0.010
arted-0.010
Token tried
Token ID3088
Feature activation-1.09e+00

Pos logit contributions
ills+0.017
ares+0.016
ada+0.015
elling+0.015
hing+0.015
Neg logit contributions
ggy-0.012
 Fl-0.011
 Wh-0.010
Spot-0.010
 surgery-0.010
Token to
Token ID284
Feature activation-6.13e-02

Pos logit contributions
ggy+0.032
rob+0.032
bes+0.030
elope+0.030
Fl+0.029
Neg logit contributions
ills-0.027
ares-0.024
ax-0.023
arted-0.023
ada-0.023
Token fix
Token ID4259
Feature activation-6.71e-01

Pos logit contributions
ggy+0.002
 Fl+0.002
 Wh+0.002
Spot+0.002
 surgery+0.002
Neg logit contributions
ills-0.002
elling-0.002
ares-0.002
years-0.002
lee-0.002
Token the
Token ID262
Feature activation-1.83e-01

Pos logit contributions
ggy+0.018
 surgery+0.017
tr+0.017
rob+0.016
 clinic+0.016
Neg logit contributions
ills-0.019
ares-0.017
ada-0.016
elling-0.016
ax-0.016
Token cord
Token ID15050
Feature activation-1.35e+00

Pos logit contributions
ggy+0.004
 surgery+0.004
rob+0.004
 Fl+0.004
 Maple+0.004
Neg logit contributions
ills-0.006
ares-0.005
ada-0.005
elling-0.005
hing-0.005
Token,
Token ID11
Feature activation+5.85e-01

Pos logit contributions
ggy+0.034
rob+0.033
 surgery+0.032
Spot+0.031
 Maple+0.031
Neg logit contributions
ills-0.039
ares-0.037
ada-0.035
ly-0.034
hing-0.034
Token but
Token ID475
Feature activation-4.56e-01

Pos logit contributions
ills+0.015
ares+0.014
ada+0.013
elling+0.013
ax+0.013
Neg logit contributions
ggy-0.017
 surgery-0.016
 chip-0.016
 Fl-0.016
 Wh-0.016
Token it
Token ID340
Feature activation-4.24e-01

Pos logit contributions
ggy+0.012
rob+0.011
 surgery+0.011
 chip+0.011
bes+0.011
Neg logit contributions
ills-0.013
ares-0.012
ada-0.012
ax-0.011
elling-0.011
Token't
Token ID470
Feature activation+3.39e-01

Pos logit contributions
ills+0.029
ares+0.027
ada+0.026
elling+0.026
hing+0.026
Neg logit contributions
ggy-0.023
 surgery-0.022
 Fl-0.021
 Wh-0.021
 chip-0.021
Token your
Token ID534
Feature activation+3.10e-01

Pos logit contributions
ills+0.010
ax+0.009
ada+0.009
ares+0.008
hing+0.008
Neg logit contributions
ggy-0.009
bes-0.009
 surgery-0.009
 clinic-0.008
 Fl-0.008
Token coat
Token ID13209
Feature activation+2.67e-01

Pos logit contributions
ills+0.008
ares+0.008
ada+0.007
ax+0.007
elling+0.007
Neg logit contributions
rob-0.008
ggy-0.008
Fl-0.008
Spot-0.008
 surgery-0.008
Token.
Token ID13
Feature activation-2.99e-02

Pos logit contributions
ills+0.008
ares+0.007
ada+0.007
hing+0.007
ax+0.007
Neg logit contributions
ggy-0.007
 surgery-0.006
Spot-0.006
 Fl-0.006
 Maple-0.006
Tokenâ
Token ID22940
Feature activation-7.35e-02

Pos logit contributions
ggy+0.001
rob+0.001
 surgery+0.001
bes+0.001
ves+0.001
Neg logit contributions
ills-0.001
ares-0.001
ada-0.001
ax-0.001
elling-0.001
Token
Token ID26391
Feature activation-3.40e+00

Pos logit contributions
 Fl+0.002
ggy+0.002
 surgery+0.002
 cookie+0.002
 Maple+0.002
Neg logit contributions
ills-0.002
hing-0.002
elling-0.002
ax-0.002
ares-0.002
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 surgery+0.076
 chip+0.067
 cookie+0.066
rob+0.066
tr+0.065
Neg logit contributions
ills-0.107
ax-0.104
 wanna-0.099
're-0.094
ada-0.094
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
 surgery+0.003
 cookie+0.003
 clinic+0.003
 kit+0.003
tr+0.003
Neg logit contributions
ills-0.005
ax-0.005
ares-0.005
ada-0.005
idges-0.005
TokenJ
Token ID41
Feature activation+2.94e+00

Pos logit contributions
ills+0.005
ax+0.005
ada+0.005
ares+0.005
hing+0.005
Neg logit contributions
 surgery-0.004
 cookie-0.004
tr-0.004
ggy-0.004
 clinic-0.003
Tokenenna
Token ID13713
Feature activation+6.90e-01

Pos logit contributions
ills+0.051
ares+0.044
hing+0.042
ada+0.042
elling+0.042
Neg logit contributions
ggy-0.109
 surgery-0.104
 Wh-0.102
 Fl-0.102
 chip-0.101
Token was
Token ID373
Feature activation+3.38e-01

Pos logit contributions
ills+0.019
ares+0.017
ada+0.017
ax+0.016
hing+0.016
Neg logit contributions
ggy-0.019
 surgery-0.019
 Wh-0.018
 chip-0.018
 Fl-0.018
Token even
Token ID772
Feature activation+1.51e+00

Pos logit contributions
ills+0.020
ares+0.017
elling+0.017
arted+0.015
idges+0.015
Neg logit contributions
ggy-0.049
 surgery-0.047
 Wh-0.046
 Fl-0.046
Spot-0.045
Token like
Token ID588
Feature activation+3.02e-01

Pos logit contributions
ills+0.035
ares+0.031
ada+0.030
elling+0.029
hing+0.029
Neg logit contributions
ggy-0.049
 surgery-0.047
 Fl-0.045
 chip-0.045
 Wh-0.045
Token you
Token ID345
Feature activation+1.31e+00

Pos logit contributions
ills+0.009
ax+0.008
ares+0.008
ada+0.007
elling+0.007
Neg logit contributions
ggy-0.008
 surgery-0.008
rob-0.008
bes-0.007
Spot-0.007
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
ills+0.038
ares+0.034
ada+0.033
elling+0.033
hing+0.032
Neg logit contributions
ggy-0.035
 surgery-0.033
 Fl-0.032
 Wh-0.032
 chip-0.032
Tokenâ
Token ID22940
Feature activation-4.59e-01

Pos logit contributions
ggy+0.038
 surgery+0.035
rob+0.035
ves+0.035
bes+0.034
Neg logit contributions
ills-0.034
ares-0.033
ada-0.033
ax-0.032
elling-0.031
Token
Token ID26391
Feature activation-3.81e+00

Pos logit contributions
ggy+0.010
 Fl+0.009
 surgery+0.009
 Wh+0.009
 Maple+0.009
Neg logit contributions
ills-0.015
ares-0.014
hing-0.014
elling-0.014
ax-0.014
Token
Token ID220
Feature activation-4.56e-01

Pos logit contributions
 surgery+0.089
tr+0.076
 chip+0.076
 veterin+0.075
rob+0.075
Neg logit contributions
ills-0.119
ax-0.114
 wanna-0.107
lee-0.106
ody-0.103
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
ggy+0.009
 surgery+0.009
rob+0.008
 clinic+0.008
 cookie+0.008
Neg logit contributions
ills-0.017
ares-0.016
ax-0.016
ada-0.015
elling-0.015
Token
Token ID198
Feature activation+4.73e-01

Pos logit contributions
 surgery+0.008
ggy+0.007
rob+0.007
 kit+0.007
 cookie+0.007
Neg logit contributions
ills-0.015
ax-0.014
ares-0.014
ada-0.013
lee-0.013
TokenSam
Token ID16305
Feature activation+1.50e-02

Pos logit contributions
ills+0.015
ax+0.014
ares+0.013
ada+0.013
idges+0.013
Neg logit contributions
 surgery-0.010
ggy-0.010
rob-0.010
bes-0.010
 Fl-0.010
Tokenâ
Token ID22940
Feature activation-2.69e-01

Pos logit contributions
ills+0.000
ax+0.000
ada+0.000
ares+0.000
lee+0.000
Neg logit contributions
ggy-0.000
 surgery-0.000
rob-0.000
 chip-0.000
 Wh-0.000
Token very
Token ID845
Feature activation-4.00e-01

Pos logit contributions
 surgery+0.020
 Wh+0.019
 Fl+0.019
 chip+0.019
ggy+0.018
Neg logit contributions
ills-0.027
ares-0.024
ada-0.024
elling-0.024
hing-0.024
Token tired
Token ID10032
Feature activation+1.26e+00

Pos logit contributions
ggy+0.009
 surgery+0.009
 chip+0.009
 Fl+0.008
 cookie+0.008
Neg logit contributions
ills-0.013
ada-0.012
hing-0.012
ares-0.011
elling-0.011
Token.
Token ID13
Feature activation+9.54e-01

Pos logit contributions
ills+0.036
ares+0.033
ada+0.032
ax+0.031
elling+0.031
Neg logit contributions
ggy-0.034
 surgery-0.032
bes-0.031
rob-0.031
 Wh-0.030
Token
Token ID6184
Feature activation-2.56e-01

Pos logit contributions
ills+0.027
ares+0.025
arted+0.025
ada+0.024
elling+0.024
Neg logit contributions
ggy-0.026
rob-0.024
 surgery-0.023
bes-0.023
Spot-0.023
Token
Token ID95
Feature activation-1.23e+00

Pos logit contributions
 Fl+0.006
 chip+0.005
rob+0.005
 surgery+0.005
ggy+0.005
Neg logit contributions
ills-0.008
ares-0.007
elling-0.007
hing-0.007
ivated-0.007
Token
Token ID26391
Feature activation-3.71e+00

Pos logit contributions
 Fl+0.029
 chip+0.028
rob+0.028
 Maple+0.027
 cookie+0.027
Neg logit contributions
ills-0.034
elling-0.032
hing-0.031
ares-0.031
eting-0.029
Token
Token ID129
Feature activation-1.75e-01

Pos logit contributions
 surgery+0.076
rob+0.072
 Fl+0.071
 chip+0.069
 cookie+0.067
Neg logit contributions
ills-0.121
hing-0.104
lee-0.104
idges-0.104
're-0.103
Token
Token ID241
Feature activation-1.40e+00

Pos logit contributions
ggy+0.005
 surgery+0.005
 Fl+0.005
 chip+0.005
 Wh+0.005
Neg logit contributions
ills-0.005
ares-0.004
ada-0.004
elling-0.004
ax-0.004
TokenNo
Token ID2949
Feature activation-5.17e-01

Pos logit contributions
 kit+0.033
 clinic+0.033
bes+0.033
rob+0.032
 cookie+0.032
Neg logit contributions
ills-0.039
ax-0.037
 wanna-0.037
 want-0.035
ada-0.034
Token more
Token ID517
Feature activation+4.61e-02

Pos logit contributions
bes+0.013
Spot+0.013
Fl+0.012
rob+0.012
tr+0.012
Neg logit contributions
ills-0.015
ly-0.014
ares-0.014
heart-0.013
ada-0.013
Token bubbles
Token ID25037
Feature activation-3.77e-01

Pos logit contributions
ills+0.001
ares+0.001
lee+0.001
ada+0.001
elling+0.001
Neg logit contributions
rob-0.001
 surgery-0.001
ggy-0.001
 Wh-0.001
Spot-0.001
Token girl
Token ID2576
Feature activation+1.35e-01

Pos logit contributions
ggy+0.028
 surgery+0.026
bes+0.026
rob+0.026
ves+0.025
Neg logit contributions
ills-0.029
ada-0.028
ly-0.027
heart-0.027
elling-0.026
Token asked
Token ID1965
Feature activation+1.02e+00

Pos logit contributions
ills+0.004
ares+0.003
ada+0.003
ax+0.003
elling+0.003
Neg logit contributions
ggy-0.004
 surgery-0.004
 chip-0.003
 Wh-0.003
rob-0.003
Token,
Token ID11
Feature activation+3.49e-01

Pos logit contributions
ills+0.026
ada+0.024
ax+0.023
ly+0.023
ares+0.023
Neg logit contributions
ggy-0.030
rob-0.029
bes-0.029
 surgery-0.028
Spot-0.028
Token
Token ID6184
Feature activation+5.24e-01

Pos logit contributions
ills+0.010
ada+0.010
ares+0.010
ax+0.009
arted+0.009
Neg logit contributions
ggy-0.009
rob-0.009
bes-0.008
 surgery-0.008
Spot-0.008
Token
Token ID95
Feature activation-6.06e-01

Pos logit contributions
ills+0.015
ares+0.014
ada+0.014
elling+0.013
hing+0.013
Neg logit contributions
ggy-0.013
 surgery-0.013
 Fl-0.012
 Wh-0.012
 chip-0.012
Token
Token ID26391
Feature activation-3.18e+00

Pos logit contributions
ggy+0.015
 surgery+0.015
 Fl+0.015
 cookie+0.015
rob+0.015
Neg logit contributions
ills-0.017
ada-0.015
elling-0.015
ax-0.015
ares-0.015
Token˜
Token ID41185
Feature activation-2.05e-01

Pos logit contributions
 surgery+0.067
rob+0.064
 cookie+0.057
 chip+0.057
ggy+0.055
Neg logit contributions
ills-0.108
lee-0.100
ax-0.098
ada-0.096
ody-0.094
TokenWhat
Token ID2061
Feature activation-1.06e+00

Pos logit contributions
 surgery+0.005
ggy+0.005
rob+0.005
Spot+0.004
 Maple+0.004
Neg logit contributions
ills-0.006
ada-0.006
ares-0.006
ax-0.006
hing-0.006
Token can
Token ID460
Feature activation+7.83e-01

Pos logit contributions
ggy+0.029
 surgery+0.027
 Fl+0.027
 Wh+0.026
 chip+0.026
Neg logit contributions
ills-0.029
ares-0.027
ada-0.026
elling-0.026
hing-0.025
Token I
Token ID314
Feature activation+1.09e+00

Pos logit contributions
ills+0.023
ares+0.021
ada+0.021
ax+0.020
lee+0.020
Neg logit contributions
 surgery-0.018
rob-0.018
ggy-0.018
Spot-0.018
 clinic-0.018
Token do
Token ID466
Feature activation+4.73e-01

Pos logit contributions
ills+0.029
ares+0.026
elling+0.026
ada+0.026
arted+0.026
Neg logit contributions
ggy-0.032
 surgery-0.030
 Fl-0.029
 Wh-0.029
 cookie-0.029
Token
Token ID26391
Feature activation-3.34e+00

Pos logit contributions
ggy+0.009
 Fl+0.009
 surgery+0.009
 cookie+0.009
 chip+0.009
Neg logit contributions
ills-0.012
ada-0.011
elling-0.011
hing-0.011
ares-0.011
Token
Token ID129
Feature activation+3.02e-01

Pos logit contributions
 surgery+0.066
rob+0.062
ggy+0.062
 chip+0.060
 cookie+0.059
Neg logit contributions
ills-0.117
lee-0.104
ada-0.102
ax-0.101
ares-0.101
Token
Token ID241
Feature activation-1.04e+00

Pos logit contributions
ills+0.008
ares+0.007
ada+0.007
elling+0.006
hing+0.006
Neg logit contributions
ggy-0.009
 surgery-0.009
 Wh-0.008
 Fl-0.008
 chip-0.008
TokenLet
Token ID5756
Feature activation+1.08e+00

Pos logit contributions
ggy+0.026
 surgery+0.026
 clinic+0.025
bes+0.025
 cookie+0.025
Neg logit contributions
ills-0.030
ada-0.027
ax-0.027
ares-0.026
 wanna-0.026
Tokenâ
Token ID22940
Feature activation+3.88e-01

Pos logit contributions
ills+0.033
ares+0.031
ada+0.030
elling+0.029
ax+0.029
Neg logit contributions
ggy-0.026
 surgery-0.025
 Fl-0.024
 Wh-0.024
 chip-0.024
Token
Token ID26391
Feature activation-3.02e+00

Pos logit contributions
ills+0.012
hing+0.012
ares+0.012
ada+0.012
ax+0.011
Neg logit contributions
ggy-0.008
 Fl-0.008
 Maple-0.008
 surgery-0.008
 chip-0.008
Token
Token ID8151
Feature activation+2.68e-01

Pos logit contributions
 surgery+0.053
rob+0.050
 chip+0.047
 Raven+0.046
Fl+0.045
Neg logit contributions
ills-0.108
ax-0.101
 wanna-0.101
ada-0.100
're-0.100
Tokens
Token ID82
Feature activation-1.53e-01

Pos logit contributions
ills+0.007
ares+0.007
ada+0.007
ax+0.007
arted+0.006
Neg logit contributions
ggy-0.007
 surgery-0.007
bes-0.007
rob-0.007
 Maple-0.007
Token go
Token ID467
Feature activation-5.49e-01

Pos logit contributions
ggy+0.004
 surgery+0.004
 Fl+0.004
 Wh+0.004
 ha+0.004
Neg logit contributions
ills-0.004
ares-0.004
elling-0.004
arted-0.004
years-0.004
Token inside
Token ID2641
Feature activation+1.09e+00

Pos logit contributions
ggy+0.014
 surgery+0.013
 Fl+0.012
 Wh+0.012
 cookie+0.012
Neg logit contributions
ills-0.017
ares-0.015
elling-0.015
hing-0.015
arted-0.015
Token now
Token ID783
Feature activation-2.93e-01

Pos logit contributions
ills+0.031
ares+0.028
ada+0.028
ax+0.027
elling+0.027
Neg logit contributions
ggy-0.029
 surgery-0.028
 Wh-0.026
rob-0.026
Spot-0.026