TOP ACTIVATIONS
MAX = 0.177

He was here a minute ago . He was chasing the
our country a long time ago . He helped many people
born only a few days ago . The deer felt sorry
are from a long time ago , before we were born
lived here a long time ago ." " Can
story . A long time ago , there was an adventurous
they lived a long time ago . Lily listened carefully and
people did a long time ago . They lived in castles
Daisy had planted it weeks ago , but now she could
were from a long time ago and they help us remember
is from a long time ago , from when dinosaurs lived
he had seen a year ago . He went home ,
lost it a long time ago , and you found it
Max swapped their bubble w ands and blew more bubbles .
a ship had wrecked long ago , and now it was
found the truck many years ago and worked hard to fix
But it was so long ago that everything was a bit
it away a long time ago . It is not good
the park a long time ago and gave me the key
had been made many years ago , but it still looked

MIN ACTIVATIONS
MIN = -0.219

try new things . Bro ccoli is not that bad .
Mom says , " Bro ccoli is good for you .
. " Bro ccoli is not our friend ,
. " Bro ccoli is what we have for
Oh , Mike ! Bro ccoli is a vegetable ! It
ruined the broccoli ! Bro ccoli is good for you .
to eat broccoli . Bro ccoli was green and bitter and
and stay well . Bro ccoli is your friend , not
happy . Everyone had enough beef to eat .
. " Bro ccoli is y ucky !" Ben
mom . He saw some beef there . He wanted to
" More beef please ," Little Jack asked
, Betsy ! I love beef ." Betsy and Lily became
. He enjoyed the soft beef and finished all the pieces
He noticed a piece of beef on the ground , it
! She ate all the beef and felt very full .
So , he took his beef and shared it with everyone
garden , and picked the ju ic iest carrots he could
rewarded with the biggest , ju ic iest cher ries !
. " Bro ccoli is not y ucky ,

INTERVAL 0.078 - 0.177
CONTAINS 2.054%

On the way back to school , they saw a p
for dessert ." L ily p outed and pushed
there was a little girl named Lily who loved to play
him . But Benny didn 't want carrots , he wanted
" Come on , don 't be a baby !" Tim

INTERVAL -0.021 - 0.078
CONTAINS 75.887%

Lily felt happy . She said , " I love you
on top of it and looked at their shadows . They
, but eventually she managed to copy the signs down .
friends , Amy and Sarah . They liked to play together
, " E w , this toy is disgusting . I

INTERVAL -0.120 - -0.021
CONTAINS 21.727%

mom came back to check on him . She saw that
The mole was happy because he sold all his vegetables and
. " It is a trout . Do you want to
said Lily . " My ball doesn 't taste nice at
. They are sorry for making a mess . They are

INTERVAL -0.219 - -0.120
CONTAINS 0.331%

having fun . They picked apples until they were tired and
When they returned , the turkey had been able to laugh
's useful , but the best things in life are free
and the rabbit were the best of friends . <|endoftext|> Once
. It tastes sweet and juicy . Do you want some
Token He
Token ID679
Feature activation-2.14e-02

Pos logit contributions
ands+0.032
 ago+0.028
 pattern+0.025
 space+0.025
 tunnel+0.025
Neg logit contributions
aleb-0.038
 butcher-0.037
 Junior-0.035
Which-0.034
 beef-0.034
Token was
Token ID373
Feature activation+1.11e-02

Pos logit contributions
aleb+0.035
 butcher+0.034
 Junior+0.032
 beef+0.031
 steak+0.030
Neg logit contributions
ands-0.039
 ago-0.034
 pattern-0.031
 tunnel-0.031
 space-0.031
Token here
Token ID994
Feature activation+2.94e-02

Pos logit contributions
ands+0.017
 ago+0.013
 tunnel+0.012
 pattern+0.012
 disl+0.012
Neg logit contributions
 butcher-0.022
aleb-0.021
 beef-0.021
 Junior-0.021
 steak-0.021
Token a
Token ID257
Feature activation-2.55e-03

Pos logit contributions
ands+0.042
 ago+0.033
 pattern+0.031
 tunnel+0.031
 disl+0.030
Neg logit contributions
 butcher-0.059
aleb-0.058
 beef-0.056
 steak-0.055
 Junior-0.055
Token minute
Token ID5664
Feature activation+1.31e-02

Pos logit contributions
aleb+0.005
 butcher+0.005
 Junior+0.005
Which+0.005
 beef+0.005
Neg logit contributions
ands-0.004
 ago-0.003
 space-0.003
 pattern-0.003
 tunnel-0.003
Token ago
Token ID2084
Feature activation+1.76e-01

Pos logit contributions
ands+0.016
 disl+0.012
 tunnel+0.011
 pattern+0.011
 common+0.011
Neg logit contributions
 butcher-0.028
aleb-0.028
 beef-0.027
 Junior-0.026
 steak-0.026
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
ands+0.203
 tunnel+0.141
 pattern+0.140
 disl+0.133
 ago+0.132
Neg logit contributions
aleb-0.395
 butcher-0.391
 beef-0.375
 steak-0.375
 Junior-0.369
Token He
Token ID679
Feature activation-2.11e-02

Pos logit contributions
ands+0.031
 ago+0.027
 pattern+0.025
 space+0.024
 tunnel+0.024
Neg logit contributions
aleb-0.033
 butcher-0.032
 Junior-0.030
Which-0.030
 beef-0.030
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
aleb+0.037
 butcher+0.037
 Junior+0.035
 beef+0.034
 steak+0.033
Neg logit contributions
ands-0.036
 ago-0.030
 pattern-0.027
 tunnel-0.027
 space-0.027
Token chasing
Token ID20023
Feature activation-6.73e-02

Pos logit contributions
ands+0.017
 ago+0.013
 tunnel+0.012
 disl+0.012
 pattern+0.012
Neg logit contributions
 butcher-0.021
aleb-0.021
 beef-0.020
 Junior-0.020
 steak-0.020
Token the
Token ID262
Feature activation+4.44e-03

Pos logit contributions
aleb+0.132
 butcher+0.129
 Junior+0.120
Which+0.119
 beef+0.119
Neg logit contributions
ands-0.098
 ago-0.087
 pattern-0.076
 space-0.076
 tunnel-0.076
Token our
Token ID674
Feature activation-1.85e-02

Pos logit contributions
aleb+0.029
 butcher+0.028
 beef+0.026
 Junior+0.026
Which+0.026
Neg logit contributions
ands-0.020
 ago-0.017
 pattern-0.016
 tunnel-0.016
 space-0.015
Token country
Token ID1499
Feature activation+6.85e-02

Pos logit contributions
aleb+0.035
 butcher+0.034
 beef+0.032
 Junior+0.031
 steak+0.031
Neg logit contributions
ands-0.028
 ago-0.024
 pattern-0.022
 tunnel-0.021
 disl-0.021
Token a
Token ID257
Feature activation-6.90e-03

Pos logit contributions
ands+0.089
 ago+0.064
 disl+0.061
 pattern+0.058
 common+0.057
Neg logit contributions
 butcher-0.147
aleb-0.146
 beef-0.138
 steak-0.137
 Junior-0.136
Token long
Token ID890
Feature activation+2.05e-02

Pos logit contributions
 butcher+0.013
aleb+0.013
 beef+0.012
 steak+0.012
 Junior+0.012
Neg logit contributions
ands-0.011
 ago-0.009
 disl-0.008
 pattern-0.008
 tunnel-0.007
Token time
Token ID640
Feature activation+2.31e-02

Pos logit contributions
ands+0.031
 disl+0.024
chie+0.022
 ago+0.022
 pattern+0.022
Neg logit contributions
aleb-0.038
 butcher-0.038
 beef-0.036
 steak-0.036
 Junior-0.035
Token ago
Token ID2084
Feature activation+1.75e-01

Pos logit contributions
ands+0.030
 disl+0.021
 pattern+0.021
 tunnel+0.021
chie+0.020
Neg logit contributions
 butcher-0.046
aleb-0.046
 Junior-0.044
 beef-0.044
 steak-0.043
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
ands+0.199
 pattern+0.135
 tunnel+0.134
 disl+0.131
 common+0.127
Neg logit contributions
aleb-0.395
 butcher-0.392
 beef-0.374
 steak-0.373
 Junior-0.372
Token He
Token ID679
Feature activation-2.92e-02

Pos logit contributions
ands+0.006
 ago+0.005
 pattern+0.005
 tunnel+0.005
 space+0.004
Neg logit contributions
aleb-0.007
 butcher-0.007
 Junior-0.007
Which-0.006
 beef-0.006
Token helped
Token ID4193
Feature activation-1.35e-02

Pos logit contributions
aleb+0.048
 butcher+0.046
 Junior+0.044
Which+0.042
 beef+0.041
Neg logit contributions
ands-0.055
 ago-0.045
 disl-0.041
 pattern-0.041
 tunnel-0.040
Token many
Token ID867
Feature activation+2.56e-02

Pos logit contributions
aleb+0.027
 butcher+0.026
Which+0.024
 Junior+0.024
 beef+0.024
Neg logit contributions
ands-0.019
 ago-0.017
 space-0.015
 pattern-0.015
 tunnel-0.015
Token people
Token ID661
Feature activation+3.44e-02

Pos logit contributions
ands+0.036
 ago+0.032
 disl+0.028
 pattern+0.027
 tunnel+0.027
Neg logit contributions
aleb-0.050
 butcher-0.050
 beef-0.047
 steak-0.046
 Junior-0.046
Token born
Token ID4642
Feature activation+1.45e-02

Pos logit contributions
aleb+0.015
 butcher+0.015
 Junior+0.014
 beef+0.013
Which+0.013
Neg logit contributions
ands-0.012
 ago-0.010
 pattern-0.009
 tunnel-0.009
 space-0.009
Token only
Token ID691
Feature activation-1.10e-02

Pos logit contributions
ands+0.018
 ago+0.015
 pattern+0.013
 tunnel+0.013
 space+0.013
Neg logit contributions
aleb-0.031
 butcher-0.031
 Junior-0.029
 beef-0.029
Which-0.029
Token a
Token ID257
Feature activation-1.25e-02

Pos logit contributions
aleb+0.021
 butcher+0.020
 beef+0.019
Which+0.019
 Junior+0.019
Neg logit contributions
ands-0.017
 ago-0.014
 tunnel-0.013
 pattern-0.012
 space-0.012
Token few
Token ID1178
Feature activation+3.14e-03

Pos logit contributions
 butcher+0.024
aleb+0.024
 beef+0.022
 Junior+0.022
Which+0.022
Neg logit contributions
ands-0.019
 ago-0.016
 disl-0.014
 space-0.014
 tunnel-0.014
Token days
Token ID1528
Feature activation+5.29e-02

Pos logit contributions
ands+0.005
 ago+0.004
 tunnel+0.004
 disl+0.004
 pattern+0.004
Neg logit contributions
 butcher-0.005
aleb-0.005
 beef-0.005
 steak-0.005
Which-0.005
Token ago
Token ID2084
Feature activation+1.73e-01

Pos logit contributions
ands+0.069
 tunnel+0.052
 pattern+0.052
 common+0.051
 ago+0.050
Neg logit contributions
aleb-0.111
 butcher-0.110
 Junior-0.105
 beef-0.103
 steak-0.101
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
ands+0.199
 tunnel+0.135
 ago+0.135
 pattern+0.133
 common+0.129
Neg logit contributions
aleb-0.390
 butcher-0.386
 Junior-0.367
 beef-0.367
 steak-0.365
Token The
Token ID383
Feature activation+3.25e-02

Pos logit contributions
aleb+0.020
 butcher+0.020
Which+0.018
 Junior+0.018
 beef+0.018
Neg logit contributions
ands-0.015
 ago-0.013
 space-0.012
 pattern-0.011
 tunnel-0.011
Token deer
Token ID20096
Feature activation-5.16e-02

Pos logit contributions
ands+0.051
 ago+0.045
 space+0.040
 pattern+0.040
 tunnel+0.039
Neg logit contributions
aleb-0.060
 butcher-0.059
 Junior-0.055
 beef-0.054
Which-0.054
Token felt
Token ID2936
Feature activation+1.25e-02

Pos logit contributions
aleb+0.093
 butcher+0.090
 Junior+0.084
Which+0.083
 beef+0.083
Neg logit contributions
ands-0.085
 ago-0.074
 space-0.066
 tunnel-0.065
 pattern-0.065
Token sorry
Token ID7926
Feature activation-2.95e-02

Pos logit contributions
ands+0.020
 ago+0.017
 tunnel+0.016
 hall+0.015
 pattern+0.015
Neg logit contributions
 butcher-0.022
aleb-0.022
 beef-0.022
 Junior-0.022
 steak-0.021
Token are
Token ID389
Feature activation+4.30e-03

Pos logit contributions
ands+0.024
 ago+0.022
 tunnel+0.020
chie+0.020
 hall+0.020
Neg logit contributions
aleb-0.019
 beef-0.019
 butcher-0.019
 Junior-0.018
 steak-0.018
Token from
Token ID422
Feature activation-2.37e-02

Pos logit contributions
ands+0.006
 ago+0.005
 hall+0.004
 disl+0.004
 tunnel+0.004
Neg logit contributions
 butcher-0.009
 beef-0.009
aleb-0.009
 Junior-0.008
 steak-0.008
Token a
Token ID257
Feature activation-4.46e-04

Pos logit contributions
 butcher+0.044
aleb+0.044
 beef+0.043
 Junior+0.042
 steak+0.042
Neg logit contributions
ands-0.035
 ago-0.028
 pattern-0.028
 tunnel-0.027
 disl-0.026
Token long
Token ID890
Feature activation+2.55e-02

Pos logit contributions
 butcher+0.001
aleb+0.001
 beef+0.001
 Junior+0.001
Which+0.001
Neg logit contributions
ands-0.001
 ago-0.001
 pattern-0.000
 tunnel-0.000
 disl-0.000
Token time
Token ID640
Feature activation+2.68e-02

Pos logit contributions
ands+0.037
 disl+0.029
chie+0.028
 ago+0.028
 pattern+0.027
Neg logit contributions
 butcher-0.049
aleb-0.048
 beef-0.046
 steak-0.046
 Junior-0.045
Token ago
Token ID2084
Feature activation+1.72e-01

Pos logit contributions
ands+0.031
chie+0.026
 pattern+0.025
 disl+0.024
 tunnel+0.024
Neg logit contributions
 butcher-0.050
 beef-0.049
aleb-0.049
 steak-0.048
 Junior-0.047
Token,
Token ID11
Feature activation+2.60e-02

Pos logit contributions
ands+0.214
chie+0.161
 tunnel+0.160
 pattern+0.158
 disl+0.157
Neg logit contributions
 butcher-0.349
aleb-0.348
 beef-0.336
 steak-0.334
 Junior-0.320
Token before
Token ID878
Feature activation-2.13e-02

Pos logit contributions
ands+0.039
 ago+0.032
 pattern+0.029
 tunnel+0.029
chie+0.028
Neg logit contributions
 butcher-0.049
aleb-0.049
 beef-0.047
Which-0.046
 Junior-0.046
Token we
Token ID356
Feature activation+1.97e-02

Pos logit contributions
 beef+0.038
 butcher+0.037
 Junior+0.037
 steak+0.036
Which+0.034
Neg logit contributions
ands-0.033
chie-0.029
 pattern-0.027
 disl-0.027
ro-0.026
Token were
Token ID547
Feature activation-4.13e-03

Pos logit contributions
ands+0.029
 ago+0.025
 disl+0.022
 tunnel+0.022
 pattern+0.022
Neg logit contributions
aleb-0.037
 butcher-0.037
 Junior-0.035
 beef-0.034
 steak-0.034
Token born
Token ID4642
Feature activation+2.26e-02

Pos logit contributions
 butcher+0.008
aleb+0.007
 beef+0.007
 steak+0.007
 Junior+0.007
Neg logit contributions
ands-0.006
 tunnel-0.005
 ago-0.005
 pattern-0.005
 disl-0.005
Token lived
Token ID5615
Feature activation+2.24e-02

Pos logit contributions
aleb+0.003
 butcher+0.003
 Junior+0.003
Which+0.003
 beef+0.003
Neg logit contributions
ands-0.003
 ago-0.003
 space-0.002
 pattern-0.002
 tunnel-0.002
Token here
Token ID994
Feature activation+2.55e-02

Pos logit contributions
ands+0.032
 ago+0.029
 space+0.025
 pattern+0.025
 tunnel+0.025
Neg logit contributions
aleb-0.045
 butcher-0.044
 Junior-0.041
Which-0.040
 beef-0.040
Token a
Token ID257
Feature activation-6.53e-03

Pos logit contributions
ands+0.031
 ago+0.027
 pattern+0.023
 tunnel+0.023
 space+0.022
Neg logit contributions
aleb-0.056
 butcher-0.055
Which-0.051
 beef-0.051
 Junior-0.051
Token long
Token ID890
Feature activation+2.16e-02

Pos logit contributions
aleb+0.013
 butcher+0.013
 Junior+0.012
 beef+0.012
Which+0.012
Neg logit contributions
ands-0.010
 ago-0.008
 pattern-0.007
 space-0.007
 tunnel-0.007
Token time
Token ID640
Feature activation+2.28e-02

Pos logit contributions
ands+0.031
 ago+0.025
 disl+0.024
 pattern+0.023
chie+0.022
Neg logit contributions
aleb-0.043
 butcher-0.042
 beef-0.039
 steak-0.039
 Junior-0.039
Token ago
Token ID2084
Feature activation+1.72e-01

Pos logit contributions
ands+0.030
 disl+0.022
chie+0.021
 pattern+0.021
 common+0.020
Neg logit contributions
 butcher-0.044
aleb-0.044
 steak-0.043
 beef-0.043
 Junior-0.042
Token."
Token ID526
Feature activation+4.00e-03

Pos logit contributions
ands+0.198
 pattern+0.135
 disl+0.134
 tunnel+0.132
 ago+0.130
Neg logit contributions
aleb-0.387
 butcher-0.383
 steak-0.367
 beef-0.367
 Junior-0.362
Token
Token ID198
Feature activation-3.89e-03

Pos logit contributions
ands+0.006
 ago+0.005
 pattern+0.005
 tunnel+0.005
 space+0.005
Neg logit contributions
aleb-0.007
 butcher-0.007
 Junior-0.007
 beef-0.007
Which-0.007
Token
Token ID198
Feature activation-4.88e-03

Pos logit contributions
aleb+0.008
 butcher+0.008
 Junior+0.007
Which+0.007
 beef+0.007
Neg logit contributions
ands-0.005
 ago-0.005
 pattern-0.004
 space-0.004
 tunnel-0.004
Token"
Token ID1
Feature activation+2.97e-03

Pos logit contributions
aleb+0.008
 butcher+0.007
Which+0.007
 Junior+0.007
 beef+0.006
Neg logit contributions
ands-0.009
 ago-0.009
 pattern-0.008
 space-0.008
 tunnel-0.008
TokenCan
Token ID6090
Feature activation-6.77e-02

Pos logit contributions
ands+0.006
 ago+0.005
 space+0.005
 common+0.005
 tunnel+0.005
Neg logit contributions
aleb-0.004
 butcher-0.004
 Junior-0.004
 steak-0.003
 beef-0.003
Token story
Token ID1621
Feature activation+9.86e-03

Pos logit contributions
aleb+0.016
 butcher+0.016
 beef+0.015
Which+0.015
 Junior+0.015
Neg logit contributions
ands-0.012
 ago-0.011
 space-0.009
 pattern-0.009
 tunnel-0.009
Token.
Token ID13
Feature activation+5.47e-03

Pos logit contributions
ands+0.014
 disl+0.011
 tunnel+0.011
 ago+0.011
 pattern+0.010
Neg logit contributions
 butcher-0.019
aleb-0.018
 beef-0.018
 steak-0.018
 Turkey-0.017
Token A
Token ID317
Feature activation+2.34e-02

Pos logit contributions
ands+0.010
 ago+0.008
 pattern+0.008
 tunnel+0.008
 space+0.008
Neg logit contributions
aleb-0.009
 butcher-0.009
 Junior-0.008
Which-0.008
 beef-0.008
Token long
Token ID890
Feature activation+1.93e-02

Pos logit contributions
ands+0.037
 ago+0.031
 disl+0.027
chie+0.025
ro+0.025
Neg logit contributions
 butcher-0.045
aleb-0.042
 beef-0.042
 steak-0.042
 turkey-0.041
Token time
Token ID640
Feature activation+2.06e-02

Pos logit contributions
ands+0.031
 ago+0.024
chie+0.023
 disl+0.023
 common+0.022
Neg logit contributions
 butcher-0.034
 steak-0.033
 beef-0.033
aleb-0.033
 turkey-0.033
Token ago
Token ID2084
Feature activation+1.72e-01

Pos logit contributions
ands+0.032
chie+0.025
ro+0.025
 pattern+0.023
 tunnel+0.023
Neg logit contributions
 beef-0.031
 steak-0.030
 Turkey-0.029
 butcher-0.029
 turkey-0.029
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
ands+0.218
 pattern+0.159
 tunnel+0.157
chie+0.152
 disl+0.149
Neg logit contributions
aleb-0.360
 butcher-0.356
 steak-0.344
 beef-0.343
 Junior-0.336
Token there
Token ID612
Feature activation-6.05e-04

Pos logit contributions
ands+0.029
 ago+0.025
 pattern+0.022
 tunnel+0.022
 space+0.022
Neg logit contributions
aleb-0.036
 butcher-0.035
 Junior-0.033
Which-0.033
 beef-0.033
Token was
Token ID373
Feature activation+5.50e-03

Pos logit contributions
aleb+0.001
 butcher+0.001
 Junior+0.001
 beef+0.001
Which+0.001
Neg logit contributions
ands-0.001
 ago-0.001
 pattern-0.001
 space-0.001
 tunnel-0.001
Token an
Token ID281
Feature activation+2.43e-02

Pos logit contributions
ands+0.007
 ago+0.006
 pattern+0.005
 space+0.005
 tunnel+0.005
Neg logit contributions
aleb-0.012
 butcher-0.012
 Junior-0.011
Which-0.011
 beef-0.011
Token adventurous
Token ID39770
Feature activation+2.63e-02

Pos logit contributions
ands+0.033
 ago+0.028
 common+0.025
ro+0.024
 disl+0.024
Neg logit contributions
aleb-0.051
 butcher-0.050
 beef-0.047
 steak-0.046
 turkey-0.045
Token they
Token ID484
Feature activation-1.31e-02

Pos logit contributions
aleb+0.126
 butcher+0.125
 beef+0.120
 steak+0.118
Which+0.116
Neg logit contributions
ands-0.118
 ago-0.101
 tunnel-0.094
 pattern-0.092
 hall-0.091
Token lived
Token ID5615
Feature activation+2.58e-02

Pos logit contributions
aleb+0.022
 butcher+0.021
 Junior+0.019
Which+0.019
 beef+0.019
Neg logit contributions
ands-0.023
 ago-0.021
 tunnel-0.019
 space-0.018
 pattern-0.018
Token a
Token ID257
Feature activation-4.73e-03

Pos logit contributions
ands+0.039
 ago+0.034
 disl+0.030
 pattern+0.030
 tunnel+0.030
Neg logit contributions
aleb-0.048
 butcher-0.048
 beef-0.044
 steak-0.043
Which-0.043
Token long
Token ID890
Feature activation+2.47e-02

Pos logit contributions
 butcher+0.009
aleb+0.009
 beef+0.008
 Junior+0.008
 steak+0.008
Neg logit contributions
ands-0.007
 ago-0.006
 disl-0.005
 tunnel-0.005
 pattern-0.005
Token time
Token ID640
Feature activation+2.38e-02

Pos logit contributions
ands+0.038
 disl+0.030
 ago+0.030
chie+0.028
 outage+0.028
Neg logit contributions
aleb-0.046
 butcher-0.045
 steak-0.043
 beef-0.042
 Junior-0.042
Token ago
Token ID2084
Feature activation+1.72e-01

Pos logit contributions
ands+0.033
 disl+0.026
chie+0.025
 common+0.025
 tunnel+0.025
Neg logit contributions
 butcher-0.043
aleb-0.043
 beef-0.041
 steak-0.041
 Junior-0.040
Token.
Token ID13
Feature activation+2.77e-03

Pos logit contributions
ands+0.188
 tunnel+0.128
 pattern+0.125
 disl+0.123
 space+0.119
Neg logit contributions
aleb-0.393
 butcher-0.388
 beef-0.371
 steak-0.369
 Junior-0.369
Token Lily
Token ID20037
Feature activation+5.25e-02

Pos logit contributions
ands+0.004
 ago+0.004
 tunnel+0.003
 pattern+0.003
 space+0.003
Neg logit contributions
aleb-0.005
 butcher-0.005
Which-0.005
 Junior-0.005
 beef-0.005
Token listened
Token ID16399
Feature activation-3.44e-02

Pos logit contributions
ands+0.088
 ago+0.078
 space+0.069
 tunnel+0.068
 common+0.067
Neg logit contributions
aleb-0.095
 butcher-0.090
 Junior-0.084
Which-0.082
 beef-0.081
Token carefully
Token ID7773
Feature activation+3.23e-02

Pos logit contributions
aleb+0.053
 butcher+0.051
 beef+0.050
 steak+0.050
 Junior+0.050
Neg logit contributions
ands-0.062
 ago-0.053
 tunnel-0.052
 disl-0.050
 hall-0.049
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
ands+0.047
 ago+0.038
 tunnel+0.036
 disl+0.035
 pattern+0.035
Neg logit contributions
 butcher-0.062
aleb-0.062
 beef-0.058
 steak-0.058
 Junior-0.058
Token people
Token ID661
Feature activation+3.62e-02

Pos logit contributions
aleb+0.193
 butcher+0.192
 beef+0.182
 steak+0.179
Which+0.179
Neg logit contributions
ands-0.121
 ago-0.100
 pattern-0.089
 tunnel-0.088
 space-0.084
Token did
Token ID750
Feature activation+1.66e-02

Pos logit contributions
ands+0.054
 tunnel+0.046
 ago+0.044
 pattern+0.044
 hall+0.042
Neg logit contributions
aleb-0.066
 butcher-0.064
 beef-0.062
 Junior-0.061
 steak-0.061
Token a
Token ID257
Feature activation-8.52e-04

Pos logit contributions
ands+0.026
chie+0.023
 tunnel+0.022
ro+0.021
 pattern+0.020
Neg logit contributions
 beef-0.026
 steak-0.025
 butcher-0.025
 turkey-0.024
aleb-0.024
Token long
Token ID890
Feature activation+2.58e-02

Pos logit contributions
aleb+0.002
 butcher+0.002
 Junior+0.002
Which+0.002
 beef+0.002
Neg logit contributions
ands-0.001
 ago-0.001
 space-0.001
 pattern-0.001
 tunnel-0.001
Token time
Token ID640
Feature activation+2.55e-02

Pos logit contributions
ands+0.037
chie+0.029
 ago+0.028
 tunnel+0.028
 pattern+0.027
Neg logit contributions
 butcher-0.049
aleb-0.047
 steak-0.046
 beef-0.046
 turkey-0.044
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.030
chie+0.025
 tunnel+0.025
 pattern+0.023
 hall+0.022
Neg logit contributions
 beef-0.049
 butcher-0.048
 steak-0.048
aleb-0.046
 Junior-0.045
Token.
Token ID13
Feature activation+2.71e-02

Pos logit contributions
ands+0.213
 tunnel+0.167
 pattern+0.161
chie+0.157
 disl+0.155
Neg logit contributions
aleb-0.345
 butcher-0.339
 steak-0.335
 beef-0.333
 Junior-0.314
Token They
Token ID1119
Feature activation+2.43e-02

Pos logit contributions
ands+0.045
 ago+0.038
 pattern+0.036
 tunnel+0.035
 space+0.034
Neg logit contributions
 butcher-0.048
aleb-0.047
Which-0.045
 Junior-0.044
 beef-0.043
Token lived
Token ID5615
Feature activation+2.83e-02

Pos logit contributions
ands+0.041
 ago+0.034
 tunnel+0.033
ro+0.032
chie+0.032
Neg logit contributions
aleb-0.038
 butcher-0.038
 Junior-0.036
 beef-0.035
 Turkey-0.035
Token in
Token ID287
Feature activation+2.54e-02

Pos logit contributions
ands+0.045
 ago+0.039
 pattern+0.035
 tunnel+0.035
 space+0.035
Neg logit contributions
aleb-0.052
 butcher-0.051
 beef-0.047
Which-0.047
 Junior-0.047
Token castles
Token ID49203
Feature activation+5.74e-02

Pos logit contributions
ands+0.042
 ago+0.032
 pattern+0.032
 disl+0.032
ooters+0.031
Neg logit contributions
 butcher-0.045
aleb-0.045
 beef-0.044
 steak-0.044
 Junior-0.043
Token Daisy
Token ID40355
Feature activation-4.47e-02

Pos logit contributions
aleb+0.011
 butcher+0.011
 Junior+0.010
Which+0.010
 beef+0.010
Neg logit contributions
ands-0.009
 ago-0.007
 tunnel-0.007
 pattern-0.006
 space-0.006
Token had
Token ID550
Feature activation+1.22e-02

Pos logit contributions
aleb+0.080
 butcher+0.076
 Junior+0.072
Which+0.070
 beef+0.069
Neg logit contributions
ands-0.076
 ago-0.065
 tunnel-0.059
 space-0.058
 common-0.057
Token planted
Token ID18969
Feature activation+7.16e-03

Pos logit contributions
ands+0.021
 ago+0.017
 tunnel+0.016
 disl+0.016
 outage+0.016
Neg logit contributions
aleb-0.021
Which-0.020
 butcher-0.020
 Junior-0.019
 ju-0.019
Token it
Token ID340
Feature activation-6.00e-03

Pos logit contributions
ands+0.009
 ago+0.008
 space+0.007
 pattern+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.015
 butcher-0.015
 Junior-0.014
Which-0.014
 beef-0.014
Token weeks
Token ID2745
Feature activation+6.07e-02

Pos logit contributions
aleb+0.011
 butcher+0.011
Which+0.010
 beef+0.010
 Junior+0.010
Neg logit contributions
ands-0.009
 ago-0.008
 space-0.007
 tunnel-0.007
 pattern-0.007
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.090
 disl+0.078
 tunnel+0.075
 common+0.074
 pattern+0.072
Neg logit contributions
aleb-0.110
 butcher-0.107
 Junior-0.101
 beef-0.100
avored-0.099
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
ands+0.228
 tunnel+0.170
 ago+0.166
 disl+0.164
 pattern+0.164
Neg logit contributions
aleb-0.349
 butcher-0.345
 beef-0.330
 steak-0.327
 Junior-0.326
Token but
Token ID475
Feature activation-6.28e-02

Pos logit contributions
ands+0.017
 ago+0.014
 tunnel+0.013
 pattern+0.012
 space+0.012
Neg logit contributions
aleb-0.022
 butcher-0.022
 Junior-0.021
Which-0.021
 beef-0.020
Token now
Token ID783
Feature activation+3.97e-02

Pos logit contributions
aleb+0.127
 butcher+0.124
 Junior+0.115
Which+0.115
 beef+0.114
Neg logit contributions
ands-0.087
 ago-0.078
 space-0.068
 pattern-0.068
 tunnel-0.067
Token she
Token ID673
Feature activation+8.33e-03

Pos logit contributions
ands+0.040
 ago+0.036
 space+0.030
 pattern+0.029
 tunnel+0.028
Neg logit contributions
aleb-0.094
 butcher-0.093
Which-0.086
 beef-0.086
 Junior-0.086
Token could
Token ID714
Feature activation-1.12e-02

Pos logit contributions
ands+0.013
 ago+0.011
 pattern+0.010
 space+0.010
 tunnel+0.010
Neg logit contributions
aleb-0.016
 butcher-0.015
 Junior-0.014
Which-0.014
 beef-0.014
Token were
Token ID547
Feature activation-1.20e-02

Pos logit contributions
aleb+0.021
 butcher+0.021
 Junior+0.019
Which+0.019
 beef+0.019
Neg logit contributions
ands-0.026
 ago-0.023
 tunnel-0.021
 space-0.021
 pattern-0.021
Token from
Token ID422
Feature activation-2.84e-02

Pos logit contributions
 butcher+0.022
aleb+0.022
 beef+0.021
 Junior+0.021
 steak+0.020
Neg logit contributions
ands-0.019
 ago-0.015
 tunnel-0.015
 pattern-0.015
ro-0.015
Token a
Token ID257
Feature activation-6.50e-03

Pos logit contributions
aleb+0.054
 butcher+0.053
 beef+0.051
 Junior+0.050
 steak+0.049
Neg logit contributions
ands-0.043
 ago-0.035
 pattern-0.034
 tunnel-0.034
 outage-0.031
Token long
Token ID890
Feature activation+2.13e-02

Pos logit contributions
 butcher+0.013
aleb+0.013
 beef+0.012
Which+0.012
 Junior+0.012
Neg logit contributions
ands-0.009
 ago-0.008
 pattern-0.007
 space-0.007
 tunnel-0.007
Token time
Token ID640
Feature activation+2.23e-02

Pos logit contributions
ands+0.032
 disl+0.023
 outage+0.022
 ago+0.021
chie+0.021
Neg logit contributions
aleb-0.041
 butcher-0.041
 beef-0.038
 steak-0.038
 turkey-0.038
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.034
 tunnel+0.027
 pattern+0.027
chie+0.026
 disl+0.025
Neg logit contributions
 butcher-0.038
aleb-0.038
 beef-0.036
Which-0.036
 Junior-0.036
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
ands+0.197
 tunnel+0.138
 pattern+0.136
 ago+0.130
 space+0.129
Neg logit contributions
aleb-0.383
 butcher-0.380
 beef-0.364
 steak-0.360
 Junior-0.360
Token they
Token ID484
Feature activation-1.27e-02

Pos logit contributions
 butcher+0.034
aleb+0.033
Which+0.032
 beef+0.032
 Junior+0.032
Neg logit contributions
ands-0.032
 ago-0.025
 tunnel-0.024
 pattern-0.024
 space-0.023
Token help
Token ID1037
Feature activation+2.71e-02

Pos logit contributions
aleb+0.019
 butcher+0.019
 Junior+0.018
Which+0.017
 beef+0.017
Neg logit contributions
ands-0.025
 ago-0.021
 tunnel-0.019
 pattern-0.019
 space-0.019
Token us
Token ID514
Feature activation+6.45e-02

Pos logit contributions
ands+0.039
 ago+0.034
 space+0.031
 pattern+0.030
 tunnel+0.030
Neg logit contributions
aleb-0.054
 butcher-0.053
Which-0.049
 Junior-0.049
 beef-0.048
Token remember
Token ID3505
Feature activation+1.01e-02

Pos logit contributions
ands+0.096
 ago+0.081
 tunnel+0.072
 pattern+0.069
 disl+0.069
Neg logit contributions
 butcher-0.117
aleb-0.117
 beef-0.113
 Junior-0.112
 steak-0.109
Token is
Token ID318
Feature activation+2.40e-02

Pos logit contributions
ands+0.106
 ago+0.092
 disl+0.080
 space+0.080
 pattern+0.080
Neg logit contributions
aleb-0.096
 butcher-0.093
 beef-0.088
 steak-0.087
rified-0.085
Token from
Token ID422
Feature activation-2.73e-02

Pos logit contributions
ands+0.036
 ago+0.029
 disl+0.026
 pattern+0.025
 hall+0.025
Neg logit contributions
 butcher-0.046
aleb-0.046
 beef-0.045
 Junior-0.044
Which-0.044
Token a
Token ID257
Feature activation-3.31e-03

Pos logit contributions
aleb+0.052
 butcher+0.051
 beef+0.049
 steak+0.049
 Junior+0.049
Neg logit contributions
ands-0.040
 ago-0.033
 pattern-0.031
 disl-0.030
 tunnel-0.029
Token long
Token ID890
Feature activation+2.26e-02

Pos logit contributions
aleb+0.007
 butcher+0.007
 beef+0.006
 Junior+0.006
Which+0.006
Neg logit contributions
ands-0.005
 ago-0.004
 pattern-0.003
 space-0.003
 disl-0.003
Token time
Token ID640
Feature activation+2.16e-02

Pos logit contributions
ands+0.034
 disl+0.027
 ago+0.025
 common+0.023
 pattern+0.023
Neg logit contributions
aleb-0.042
 butcher-0.042
 steak-0.040
 beef-0.040
 Junior-0.039
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.029
 disl+0.021
 pattern+0.020
 common+0.020
 tunnel+0.020
Neg logit contributions
 butcher-0.042
aleb-0.041
 beef-0.040
 Junior-0.040
 steak-0.039
Token,
Token ID11
Feature activation+2.03e-02

Pos logit contributions
ands+0.201
 tunnel+0.138
 pattern+0.137
 disl+0.134
 ago+0.133
Neg logit contributions
aleb-0.379
 butcher-0.376
 beef-0.359
 steak-0.357
 Junior-0.351
Token from
Token ID422
Feature activation-2.59e-02

Pos logit contributions
ands+0.032
 ago+0.026
 tunnel+0.024
 pattern+0.024
 space+0.023
Neg logit contributions
aleb-0.038
 butcher-0.038
 Junior-0.036
Which-0.035
 beef-0.035
Token when
Token ID618
Feature activation-4.32e-02

Pos logit contributions
aleb+0.051
 butcher+0.050
 beef+0.047
 steak+0.047
 Junior+0.047
Neg logit contributions
ands-0.038
 ago-0.031
 pattern-0.029
 disl-0.028
 tunnel-0.028
Token dinosaurs
Token ID27199
Feature activation-2.72e-02

Pos logit contributions
aleb+0.083
 butcher+0.083
 Junior+0.079
 beef+0.079
Which+0.079
Neg logit contributions
ands-0.064
 ago-0.052
 pattern-0.047
 disl-0.047
 tunnel-0.046
Token lived
Token ID5615
Feature activation+2.70e-02

Pos logit contributions
aleb+0.049
 butcher+0.048
 Junior+0.046
 steak+0.045
 beef+0.045
Neg logit contributions
ands-0.042
 ago-0.035
 disl-0.033
 pattern-0.032
 common-0.031
Token he
Token ID339
Feature activation-4.12e-02

Pos logit contributions
aleb+0.056
 butcher+0.055
 Junior+0.051
Which+0.051
 beef+0.051
Neg logit contributions
ands-0.033
 ago-0.029
 space-0.025
 pattern-0.025
 tunnel-0.024
Token had
Token ID550
Feature activation+1.40e-02

Pos logit contributions
aleb+0.072
 butcher+0.070
 Junior+0.064
Which+0.064
 beef+0.062
Neg logit contributions
ands-0.070
 ago-0.059
 tunnel-0.056
 space-0.055
 pattern-0.054
Token seen
Token ID1775
Feature activation+3.86e-02

Pos logit contributions
ands+0.023
 tunnel+0.020
 disl+0.019
 hall+0.019
 pattern+0.019
Neg logit contributions
aleb-0.022
 butcher-0.020
Which-0.020
 steak-0.019
eled-0.019
Token a
Token ID257
Feature activation-6.76e-03

Pos logit contributions
ands+0.042
 ago+0.036
 space+0.030
 pattern+0.030
 tunnel+0.029
Neg logit contributions
aleb-0.090
 butcher-0.088
 Junior-0.083
Which-0.082
 beef-0.082
Token year
Token ID614
Feature activation+1.18e-01

Pos logit contributions
 butcher+0.013
aleb+0.013
Which+0.012
 beef+0.012
 Junior+0.012
Neg logit contributions
ands-0.011
 ago-0.009
 disl-0.008
 tunnel-0.008
 space-0.008
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.155
 tunnel+0.108
 pattern+0.106
 disl+0.103
 ago+0.103
Neg logit contributions
aleb-0.248
 butcher-0.246
 steak-0.232
 beef-0.230
 Junior-0.226
Token.
Token ID13
Feature activation+1.63e-03

Pos logit contributions
ands+0.193
 tunnel+0.132
 disl+0.128
 pattern+0.126
 common+0.123
Neg logit contributions
aleb-0.387
 butcher-0.383
 steak-0.367
 beef-0.366
 Junior-0.363
Token He
Token ID679
Feature activation-3.08e-02

Pos logit contributions
ands+0.002
 ago+0.002
 pattern+0.002
 space+0.002
 tunnel+0.002
Neg logit contributions
aleb-0.003
 butcher-0.003
 Junior-0.003
Which-0.003
 beef-0.003
Token went
Token ID1816
Feature activation-1.38e-02

Pos logit contributions
aleb+0.054
 butcher+0.052
 Junior+0.049
Which+0.048
eled+0.047
Neg logit contributions
ands-0.054
 ago-0.046
 disl-0.041
 space-0.041
 tunnel-0.040
Token home
Token ID1363
Feature activation+9.70e-02

Pos logit contributions
 butcher+0.021
aleb+0.020
Which+0.018
 Junior+0.018
 beef+0.017
Neg logit contributions
ands-0.027
 disl-0.024
 ago-0.024
chie-0.021
 common-0.021
Token,
Token ID11
Feature activation+1.79e-02

Pos logit contributions
ands+0.210
 disl+0.170
chie+0.158
 pattern+0.156
 tunnel+0.155
Neg logit contributions
 butcher-0.126
 Junior-0.122
 steak-0.111
aleb-0.110
 toddler-0.108
Token lost
Token ID2626
Feature activation+5.37e-02

Pos logit contributions
aleb+0.040
 butcher+0.039
 Junior+0.037
 beef+0.036
Which+0.035
Neg logit contributions
ands-0.041
 ago-0.035
 pattern-0.033
 tunnel-0.033
 space-0.031
Token it
Token ID340
Feature activation+5.92e-04

Pos logit contributions
ands+0.089
 ago+0.076
 pattern+0.070
 disl+0.069
 tunnel+0.069
Neg logit contributions
aleb-0.094
 butcher-0.092
 beef-0.088
 Junior-0.087
 steak-0.087
Token a
Token ID257
Feature activation-2.80e-03

Pos logit contributions
ands+0.001
 disl+0.001
 ago+0.001
 pattern+0.001
 tunnel+0.001
Neg logit contributions
 butcher-0.001
aleb-0.001
 beef-0.001
 steak-0.001
 Junior-0.001
Token long
Token ID890
Feature activation+2.42e-02

Pos logit contributions
 butcher+0.005
aleb+0.005
 beef+0.005
 steak+0.005
 Junior+0.005
Neg logit contributions
ands-0.005
 ago-0.004
 disl-0.003
 pattern-0.003
 space-0.003
Token time
Token ID640
Feature activation+2.51e-02

Pos logit contributions
ands+0.037
 ago+0.028
 disl+0.028
chie+0.027
 tunnel+0.027
Neg logit contributions
aleb-0.045
 butcher-0.044
 steak-0.043
 beef-0.042
 Junior-0.041
Token ago
Token ID2084
Feature activation+1.71e-01

Pos logit contributions
ands+0.023
chie+0.016
 disl+0.015
 tunnel+0.015
 pattern+0.014
Neg logit contributions
 steak-0.055
 beef-0.055
 butcher-0.054
aleb-0.052
 Junior-0.052
Token,
Token ID11
Feature activation+2.52e-02

Pos logit contributions
ands+0.215
 tunnel+0.161
 pattern+0.160
 disl+0.159
chie+0.153
Neg logit contributions
aleb-0.348
 butcher-0.347
 beef-0.340
 steak-0.340
 Junior-0.325
Token and
Token ID290
Feature activation-9.89e-03

Pos logit contributions
ands+0.039
 ago+0.030
 pattern+0.029
 tunnel+0.028
chie+0.027
Neg logit contributions
 butcher-0.047
aleb-0.047
 beef-0.045
 Junior-0.045
Which-0.045
Token you
Token ID345
Feature activation+6.19e-02

Pos logit contributions
aleb+0.018
 butcher+0.018
 Junior+0.017
Which+0.017
 beef+0.017
Neg logit contributions
ands-0.015
 ago-0.013
 pattern-0.012
 tunnel-0.011
 space-0.011
Token found
Token ID1043
Feature activation+4.26e-02

Pos logit contributions
ands+0.108
 ago+0.093
 disl+0.087
 hall+0.085
 pattern+0.083
Neg logit contributions
aleb-0.099
 butcher-0.095
 Junior-0.094
 beef-0.089
 steak-0.089
Token it
Token ID340
Feature activation+3.92e-04

Pos logit contributions
ands+0.063
 ago+0.056
 space+0.050
 pattern+0.049
 tunnel+0.049
Neg logit contributions
aleb-0.083
 butcher-0.081
 Junior-0.075
Which-0.075
 beef-0.074
Token Max
Token ID5436
Feature activation-2.90e-02

Pos logit contributions
aleb+0.025
 butcher+0.024
Which+0.023
 beef+0.022
 Junior+0.022
Neg logit contributions
ands-0.020
 ago-0.019
 space-0.017
 rain-0.017
 pattern-0.017
Token swapped
Token ID37245
Feature activation-3.03e-02

Pos logit contributions
aleb+0.057
 butcher+0.056
Which+0.052
 Junior+0.052
 beef+0.052
Neg logit contributions
ands-0.042
 ago-0.037
 space-0.033
 pattern-0.033
 tunnel-0.032
Token their
Token ID511
Feature activation+5.07e-03

Pos logit contributions
aleb+0.051
 steak+0.049
 beef+0.048
 butcher+0.047
Which+0.047
Neg logit contributions
ands-0.052
 ago-0.046
chie-0.041
 outage-0.040
 tunnel-0.039
Token bubble
Token ID14310
Feature activation-1.86e-02

Pos logit contributions
ands+0.007
 ago+0.007
 outage+0.006
 disl+0.006
forest+0.006
Neg logit contributions
aleb-0.009
 beef-0.009
 turkey-0.009
 steak-0.009
 butcher-0.008
Token w
Token ID266
Feature activation+9.48e-03

Pos logit contributions
aleb+0.035
 butcher+0.034
 steak+0.034
 beef+0.034
 turkey+0.033
Neg logit contributions
ands-0.025
 ago-0.024
 disl-0.021
chie-0.019
 common-0.019
Tokenands
Token ID1746
Feature activation+1.71e-01

Pos logit contributions
 ago+0.013
 hall+0.012
 tunnel+0.011
 disl+0.011
 hallway+0.011
Neg logit contributions
aleb-0.019
 beef-0.017
avored-0.017
 steak-0.016
 butcher-0.016
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
ands+0.243
 disl+0.208
 tunnel+0.196
 rain+0.190
 ago+0.187
Neg logit contributions
aleb-0.318
 butcher-0.300
 steak-0.293
 beef-0.291
 Junior-0.283
Token blew
Token ID17948
Feature activation-5.24e-02

Pos logit contributions
aleb+0.026
 butcher+0.025
 Junior+0.024
Which+0.024
 beef+0.024
Neg logit contributions
ands-0.020
 ago-0.018
 tunnel-0.015
 space-0.015
 pattern-0.015
Token more
Token ID517
Feature activation+2.31e-02

Pos logit contributions
aleb+0.104
 butcher+0.101
 Junior+0.094
Which+0.093
 beef+0.093
Neg logit contributions
ands-0.077
 ago-0.068
 space-0.060
 pattern-0.060
 tunnel-0.059
Token bubbles
Token ID25037
Feature activation+1.42e-02

Pos logit contributions
 ago+0.034
ands+0.033
chie+0.032
 hall+0.031
 disl+0.031
Neg logit contributions
 beef-0.037
 steak-0.036
 turkey-0.035
aleb-0.035
 butter-0.033
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
ands+0.018
 ago+0.016
 disl+0.015
 common+0.015
 tunnel+0.015
Neg logit contributions
aleb-0.027
 beef-0.026
 butcher-0.026
 steak-0.026
 turkey-0.025
Token a
Token ID257
Feature activation-6.42e-03

Pos logit contributions
aleb+0.040
 butcher+0.038
Which+0.035
 Junior+0.035
rified+0.034
Neg logit contributions
ands-0.022
 ago-0.021
 space-0.019
 tunnel-0.019
 pattern-0.018
Token ship
Token ID4074
Feature activation+3.60e-02

Pos logit contributions
 butcher+0.012
aleb+0.012
 steak+0.011
 beef+0.011
 Junior+0.011
Neg logit contributions
ands-0.010
 ago-0.008
 disl-0.007
 pattern-0.007
ro-0.007
Token had
Token ID550
Feature activation+1.29e-02

Pos logit contributions
ands+0.068
 ago+0.055
 disl+0.050
 space+0.050
 common+0.050
Neg logit contributions
 butcher-0.054
aleb-0.053
 beef-0.051
 steak-0.050
 Junior-0.050
Token wrecked
Token ID44770
Feature activation+4.54e-02

Pos logit contributions
ands+0.022
 disl+0.018
 hall+0.017
 ago+0.017
 pattern+0.016
Neg logit contributions
 steak-0.020
 beef-0.020
 butcher-0.019
 Junior-0.019
aleb-0.019
Token long
Token ID890
Feature activation+2.24e-02

Pos logit contributions
ands+0.068
 ago+0.056
 disl+0.050
 pattern+0.050
 tunnel+0.048
Neg logit contributions
aleb-0.085
 butcher-0.084
 beef-0.082
 steak-0.081
 Junior-0.080
Token ago
Token ID2084
Feature activation+1.70e-01

Pos logit contributions
ands+0.027
chie+0.021
 disl+0.020
 hall+0.018
 Ag+0.018
Neg logit contributions
 steak-0.046
 beef-0.045
 butcher-0.044
aleb-0.043
 Junior-0.042
Token,
Token ID11
Feature activation+1.48e-02

Pos logit contributions
ands+0.203
 tunnel+0.141
 pattern+0.138
 disl+0.138
chie+0.137
Neg logit contributions
 butcher-0.366
aleb-0.364
 steak-0.356
 beef-0.355
 Junior-0.344
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
ands+0.022
 ago+0.017
 pattern+0.015
 tunnel+0.015
chie+0.015
Neg logit contributions
 butcher-0.029
aleb-0.029
 beef-0.028
 Junior-0.028
Which-0.027
Token now
Token ID783
Feature activation+4.11e-02

Pos logit contributions
aleb+0.045
 butcher+0.044
 Junior+0.041
Which+0.041
 beef+0.040
Neg logit contributions
ands-0.034
 ago-0.030
 space-0.027
 pattern-0.026
 tunnel-0.026
Token it
Token ID340
Feature activation-4.41e-03

Pos logit contributions
ands+0.054
 ago+0.050
 space+0.044
 pattern+0.043
 tunnel+0.043
Neg logit contributions
aleb-0.085
 butcher-0.082
Which-0.077
 Junior-0.076
 beef-0.075
Token was
Token ID373
Feature activation+3.98e-03

Pos logit contributions
aleb+0.007
 butcher+0.007
 Junior+0.007
Which+0.007
 beef+0.007
Neg logit contributions
ands-0.008
 ago-0.007
 space-0.006
 disl-0.006
 pattern-0.006
Token found
Token ID1043
Feature activation+3.92e-02

Pos logit contributions
ands+0.017
 ago+0.014
 tunnel+0.013
 pattern+0.013
 hall+0.012
Neg logit contributions
aleb-0.022
 butcher-0.022
 beef-0.021
 Junior-0.021
 steak-0.021
Token the
Token ID262
Feature activation-4.31e-03

Pos logit contributions
ands+0.045
 ago+0.041
 space+0.037
 pattern+0.034
 tunnel+0.034
Neg logit contributions
aleb-0.088
 butcher-0.084
Which-0.079
 Junior-0.078
avored-0.076
Token truck
Token ID7779
Feature activation-2.60e-02

Pos logit contributions
aleb+0.009
 butcher+0.009
Which+0.008
 Junior+0.008
rified+0.008
Neg logit contributions
ands-0.005
 ago-0.005
 space-0.004
 pattern-0.004
 tunnel-0.004
Token many
Token ID867
Feature activation+2.44e-02

Pos logit contributions
aleb+0.056
 butcher+0.055
 Junior+0.051
Which+0.051
 beef+0.051
Neg logit contributions
ands-0.033
 ago-0.029
 space-0.025
 pattern-0.025
 tunnel-0.024
Token years
Token ID812
Feature activation+1.25e-01

Pos logit contributions
ands+0.032
 ago+0.027
 pattern+0.024
 tunnel+0.023
 space+0.023
Neg logit contributions
aleb-0.052
 butcher-0.051
 beef-0.048
 Junior-0.047
Which-0.047
Token ago
Token ID2084
Feature activation+1.70e-01

Pos logit contributions
ands+0.125
 pattern+0.088
 tunnel+0.083
chie+0.082
 disl+0.082
Neg logit contributions
 butcher-0.298
aleb-0.295
 beef-0.289
 steak-0.285
 Junior-0.278
Token and
Token ID290
Feature activation-1.94e-02

Pos logit contributions
ands+0.192
 tunnel+0.131
 pattern+0.129
 space+0.125
 disl+0.121
Neg logit contributions
aleb-0.386
 butcher-0.383
 beef-0.368
 steak-0.367
 Junior-0.361
Token worked
Token ID3111
Feature activation+1.73e-02

Pos logit contributions
 butcher+0.036
aleb+0.036
 beef+0.034
 Junior+0.034
 steak+0.033
Neg logit contributions
ands-0.031
 ago-0.025
 tunnel-0.023
 pattern-0.023
chie-0.023
Token hard
Token ID1327
Feature activation-1.11e-02

Pos logit contributions
ands+0.027
 ago+0.024
 hall+0.022
 disl+0.021
 tunnel+0.021
Neg logit contributions
aleb-0.032
 butcher-0.030
 beef-0.030
 steak-0.030
 Junior-0.029
Token to
Token ID284
Feature activation+1.77e-02

Pos logit contributions
 beef+0.018
 butcher+0.018
 steak+0.018
aleb+0.017
 Junior+0.017
Neg logit contributions
ands-0.017
 tunnel-0.015
chie-0.015
 pattern-0.015
 disl-0.014
Token fix
Token ID4259
Feature activation+2.96e-02

Pos logit contributions
ands+0.028
 ago+0.025
 disl+0.020
 tunnel+0.020
 hall+0.020
Neg logit contributions
aleb-0.032
 butcher-0.031
 Junior-0.030
 beef-0.029
 steak-0.029
Token But
Token ID887
Feature activation+1.59e-02

Pos logit contributions
aleb+0.024
 butcher+0.023
 Junior+0.022
Which+0.022
 beef+0.021
Neg logit contributions
ands-0.019
 ago-0.016
 pattern-0.015
 tunnel-0.015
 space-0.015
Token it
Token ID340
Feature activation-6.64e-03

Pos logit contributions
ands+0.024
 ago+0.022
 space+0.019
 pattern+0.019
 tunnel+0.019
Neg logit contributions
aleb-0.030
 butcher-0.029
Which-0.027
 Junior-0.027
 beef-0.027
Token was
Token ID373
Feature activation-1.21e-04

Pos logit contributions
aleb+0.013
 butcher+0.012
Which+0.011
 Junior+0.011
 beef+0.011
Neg logit contributions
ands-0.010
 ago-0.009
 pattern-0.008
 space-0.008
 tunnel-0.008
Token so
Token ID523
Feature activation-4.76e-05

Pos logit contributions
 butcher+0.000
 beef+0.000
aleb+0.000
 steak+0.000
 Junior+0.000
Neg logit contributions
ands-0.000
 disl-0.000
 ago-0.000
 hall-0.000
 pattern-0.000
Token long
Token ID890
Feature activation+1.93e-02

Pos logit contributions
aleb+0.000
 beef+0.000
 butcher+0.000
 steak+0.000
 Junior+0.000
Neg logit contributions
ands-0.000
 speaker-0.000
 figures-0.000
ro-0.000
 outage-0.000
Token ago
Token ID2084
Feature activation+1.70e-01

Pos logit contributions
ands+0.030
chie+0.023
 common+0.023
 tunnel+0.023
 pattern+0.023
Neg logit contributions
aleb-0.034
 butcher-0.034
 beef-0.033
 Junior-0.033
 steak-0.032
Token that
Token ID326
Feature activation-1.97e-02

Pos logit contributions
ands+0.215
 ago+0.155
 tunnel+0.149
 pattern+0.147
 space+0.146
Neg logit contributions
aleb-0.366
 butcher-0.363
 beef-0.346
 steak-0.345
 Junior-0.344
Token everything
Token ID2279
Feature activation-9.53e-03

Pos logit contributions
aleb+0.039
 butcher+0.038
 Junior+0.035
Which+0.035
 beef+0.035
Neg logit contributions
ands-0.029
 ago-0.025
 pattern-0.022
 space-0.022
 tunnel-0.022
Token was
Token ID373
Feature activation+3.07e-03

Pos logit contributions
aleb+0.016
 butcher+0.015
 Junior+0.014
 beef+0.014
Which+0.014
Neg logit contributions
ands-0.017
 ago-0.015
 space-0.013
 pattern-0.013
 tunnel-0.013
Token a
Token ID257
Feature activation-8.83e-03

Pos logit contributions
ands+0.005
 ago+0.004
 disl+0.004
ro+0.004
 hall+0.004
Neg logit contributions
 butcher-0.005
 beef-0.005
aleb-0.005
 Junior-0.005
 steak-0.005
Token bit
Token ID1643
Feature activation+2.61e-03

Pos logit contributions
 butcher+0.018
aleb+0.018
 beef+0.017
 steak+0.017
Which+0.016
Neg logit contributions
ands-0.013
 ago-0.011
 disl-0.009
 space-0.009
 pattern-0.009
Token it
Token ID340
Feature activation+3.21e-03

Pos logit contributions
aleb+0.031
 butcher+0.030
Which+0.028
 beef+0.028
 Junior+0.028
Neg logit contributions
ands-0.027
 ago-0.025
 tunnel-0.022
 pattern-0.022
 space-0.021
Token away
Token ID1497
Feature activation+5.90e-02

Pos logit contributions
ands+0.005
 ago+0.005
 disl+0.005
 pattern+0.004
 tunnel+0.004
Neg logit contributions
aleb-0.005
 butcher-0.005
 beef-0.005
 steak-0.005
rified-0.005
Token a
Token ID257
Feature activation-1.13e-03

Pos logit contributions
ands+0.072
 ago+0.058
 pattern+0.053
 disl+0.052
 tunnel+0.051
Neg logit contributions
aleb-0.126
 butcher-0.125
 beef-0.119
 Junior-0.118
 steak-0.117
Token long
Token ID890
Feature activation+2.79e-02

Pos logit contributions
 butcher+0.002
aleb+0.002
 beef+0.002
Which+0.002
 steak+0.002
Neg logit contributions
ands-0.002
 ago-0.002
 disl-0.001
 space-0.001
 pattern-0.001
Token time
Token ID640
Feature activation+2.60e-02

Pos logit contributions
ands+0.042
 ago+0.033
 disl+0.032
chie+0.031
 space+0.031
Neg logit contributions
aleb-0.051
 butcher-0.051
 beef-0.049
 steak-0.048
 Junior-0.047
Token ago
Token ID2084
Feature activation+1.70e-01

Pos logit contributions
ands+0.030
 tunnel+0.024
chie+0.023
 pattern+0.022
 disl+0.022
Neg logit contributions
 butcher-0.051
aleb-0.051
 beef-0.049
 steak-0.049
 Junior-0.048
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
ands+0.197
 tunnel+0.148
 pattern+0.143
 disl+0.142
chie+0.138
Neg logit contributions
 butcher-0.359
aleb-0.359
 beef-0.346
 steak-0.346
 Junior-0.332
Token It
Token ID632
Feature activation+2.60e-02

Pos logit contributions
ands+0.043
 ago+0.038
 space+0.034
 pattern+0.034
 tunnel+0.034
Neg logit contributions
aleb-0.047
 butcher-0.045
 Junior-0.042
Which-0.042
 beef-0.041
Token is
Token ID318
Feature activation+2.45e-02

Pos logit contributions
ands+0.050
 ago+0.046
 space+0.039
 pattern+0.038
 tunnel+0.038
Neg logit contributions
aleb-0.039
 butcher-0.038
 beef-0.036
 Junior-0.035
Which-0.035
Token not
Token ID407
Feature activation+4.38e-02

Pos logit contributions
ands+0.032
 ago+0.026
 tunnel+0.023
 hall+0.023
 disl+0.022
Neg logit contributions
 butcher-0.051
 beef-0.050
aleb-0.050
 steak-0.048
 Junior-0.048
Token good
Token ID922
Feature activation-5.56e-02

Pos logit contributions
ands+0.068
 ago+0.059
 hall+0.054
 tunnel+0.052
 disl+0.051
Neg logit contributions
 beef-0.082
 steak-0.079
 butcher-0.078
aleb-0.077
 Junior-0.076
Token the
Token ID262
Feature activation+2.00e-04

Pos logit contributions
ands+0.037
 ago+0.031
 tunnel+0.029
 disl+0.028
 pattern+0.028
Neg logit contributions
 butcher-0.045
aleb-0.044
 beef-0.044
 Junior-0.042
 steak-0.042
Token park
Token ID3952
Feature activation+1.02e-01

Pos logit contributions
ands+0.000
 ago+0.000
 pattern+0.000
 disl+0.000
 space+0.000
Neg logit contributions
aleb-0.000
 butcher-0.000
 beef-0.000
Which-0.000
 steak-0.000
Token a
Token ID257
Feature activation-6.85e-03

Pos logit contributions
ands+0.177
 disl+0.136
 ago+0.136
chie+0.136
 common+0.128
Neg logit contributions
aleb-0.173
 butcher-0.171
 beef-0.162
 steak-0.160
 turkey-0.155
Token long
Token ID890
Feature activation+2.02e-02

Pos logit contributions
 butcher+0.014
aleb+0.014
 beef+0.013
 steak+0.013
Which+0.013
Neg logit contributions
ands-0.010
 ago-0.009
 disl-0.007
chie-0.007
 pattern-0.007
Token time
Token ID640
Feature activation+2.27e-02

Pos logit contributions
ands+0.032
chie+0.027
 disl+0.026
 ago+0.023
 outage+0.022
Neg logit contributions
 butcher-0.036
aleb-0.035
 steak-0.034
 beef-0.034
 turkey-0.033
Token ago
Token ID2084
Feature activation+1.69e-01

Pos logit contributions
ands+0.035
chie+0.029
 disl+0.028
 common+0.028
 tunnel+0.025
Neg logit contributions
aleb-0.035
 butcher-0.035
 steak-0.034
 beef-0.033
eled-0.033
Token and
Token ID290
Feature activation-1.38e-02

Pos logit contributions
ands+0.215
 disl+0.150
 pattern+0.148
 tunnel+0.148
chie+0.148
Neg logit contributions
aleb-0.358
 butcher-0.357
 beef-0.342
 steak-0.340
 Junior-0.334
Token gave
Token ID2921
Feature activation-3.49e-02

Pos logit contributions
 butcher+0.023
aleb+0.022
Which+0.022
 Junior+0.021
 steak+0.021
Neg logit contributions
ands-0.025
 ago-0.020
chie-0.019
 pattern-0.019
 tunnel-0.018
Token me
Token ID502
Feature activation+6.58e-02

Pos logit contributions
aleb+0.086
 butcher+0.084
Which+0.079
avored+0.077
 Junior+0.077
Neg logit contributions
ands-0.030
 space-0.026
 rain-0.025
 work-0.024
 ago-0.023
Token the
Token ID262
Feature activation+2.02e-03

Pos logit contributions
ands+0.087
 ago+0.076
 space+0.066
 pattern+0.066
 tunnel+0.065
Neg logit contributions
aleb-0.138
 butcher-0.135
 Junior-0.126
Which-0.126
 beef-0.125
Token key
Token ID1994
Feature activation-4.47e-03

Pos logit contributions
ands+0.003
 ago+0.003
 pattern+0.003
 disl+0.003
 tunnel+0.003
Neg logit contributions
aleb-0.004
 butcher-0.004
 beef-0.003
 steak-0.003
Which-0.003
Token had
Token ID550
Feature activation+1.47e-02

Pos logit contributions
ands+0.054
 ago+0.046
 disl+0.043
 common+0.041
 recess+0.041
Neg logit contributions
aleb-0.022
Which-0.022
 butcher-0.022
 Junior-0.021
 beef-0.019
Token been
Token ID587
Feature activation+1.54e-02

Pos logit contributions
ands+0.026
 disl+0.023
 ago+0.022
 common+0.020
 outage+0.019
Neg logit contributions
Which-0.023
 beef-0.023
 butcher-0.023
 steak-0.022
aleb-0.022
Token made
Token ID925
Feature activation-1.32e-02

Pos logit contributions
 disl+0.029
ands+0.027
alk+0.023
 tunnel+0.022
ro+0.022
Neg logit contributions
 Junior-0.021
 beef-0.020
 butcher-0.020
aleb-0.020
eled-0.019
Token many
Token ID867
Feature activation+2.66e-02

Pos logit contributions
 beef+0.018
 butcher+0.017
aleb+0.017
 steak+0.017
 Junior+0.017
Neg logit contributions
 disl-0.023
ands-0.023
 ago-0.021
 tunnel-0.020
 common-0.019
Token years
Token ID812
Feature activation+1.28e-01

Pos logit contributions
ands+0.031
 ago+0.027
 pattern+0.022
 space+0.022
 tunnel+0.022
Neg logit contributions
aleb-0.061
 butcher-0.059
 Junior-0.056
Which-0.056
 beef-0.056
Token ago
Token ID2084
Feature activation+1.69e-01

Pos logit contributions
ands+0.148
 disl+0.146
 seeking+0.131
 common+0.123
 seek+0.123
Neg logit contributions
aleb-0.248
 butcher-0.241
 beef-0.235
 steak-0.228
 Junior-0.219
Token,
Token ID11
Feature activation+2.48e-02

Pos logit contributions
ands+0.209
chie+0.159
 tunnel+0.157
 disl+0.155
 common+0.153
Neg logit contributions
 butcher-0.344
aleb-0.342
 beef-0.339
 steak-0.337
 Junior-0.321
Token but
Token ID475
Feature activation-5.42e-02

Pos logit contributions
ands+0.045
 ago+0.035
chie+0.035
 tunnel+0.033
 common+0.033
Neg logit contributions
 beef-0.039
 butcher-0.039
 steak-0.039
aleb-0.038
 Junior-0.037
Token it
Token ID340
Feature activation-1.00e-03

Pos logit contributions
Which+0.076
aleb+0.073
 beef+0.072
 steak+0.070
 Junior+0.069
Neg logit contributions
ands-0.116
 ago-0.088
ro-0.085
 disl-0.083
chie-0.083
Token still
Token ID991
Feature activation+5.36e-02

Pos logit contributions
aleb+0.002
 butcher+0.001
 Junior+0.001
Which+0.001
 beef+0.001
Neg logit contributions
ands-0.002
 ago-0.002
 disl-0.002
 common-0.002
 space-0.002
Token looked
Token ID3114
Feature activation+3.37e-02

Pos logit contributions
ands+0.115
 ago+0.092
chie+0.088
 common+0.084
ooters+0.083
Neg logit contributions
aleb-0.066
 Junior-0.065
 butcher-0.063
 steak-0.062
 beef-0.061
Token try
Token ID1949
Feature activation-1.37e-02

Pos logit contributions
ands+0.026
 ago+0.024
 tunnel+0.021
 hall+0.021
 pattern+0.021
Neg logit contributions
aleb-0.034
 butcher-0.033
 beef-0.031
 Junior-0.031
 steak-0.031
Token new
Token ID649
Feature activation+5.03e-04

Pos logit contributions
aleb+0.030
 butcher+0.028
rified+0.028
 Junior+0.027
eled+0.026
Neg logit contributions
ands-0.017
 ago-0.015
 space-0.014
 pattern-0.013
 rain-0.013
Token things
Token ID1243
Feature activation+6.02e-03

Pos logit contributions
ands+0.001
 ago+0.001
 space+0.001
 work+0.000
 common+0.000
Neg logit contributions
aleb-0.001
 butcher-0.001
rified-0.001
 Junior-0.001
Which-0.001
Token.
Token ID13
Feature activation+2.61e-02

Pos logit contributions
ands+0.008
 ago+0.007
 tunnel+0.006
 pattern+0.006
 hall+0.006
Neg logit contributions
aleb-0.012
 butcher-0.012
 beef-0.012
 Junior-0.011
 steak-0.011
Token Bro
Token ID2806
Feature activation-9.58e-02

Pos logit contributions
ands+0.041
 ago+0.038
 space+0.033
 pattern+0.032
 tunnel+0.032
Neg logit contributions
aleb-0.048
 butcher-0.046
 Junior-0.043
Which-0.042
 beef-0.042
Tokenccoli
Token ID34544
Feature activation-2.05e-01

Pos logit contributions
aleb+0.084
Which+0.076
 butcher+0.075
eled+0.067
rified+0.065
Neg logit contributions
ands-0.231
 ago-0.229
 disl-0.209
 pattern-0.206
ro-0.206
Token is
Token ID318
Feature activation+2.22e-02

Pos logit contributions
aleb+0.456
 butcher+0.438
Which+0.417
 Junior+0.414
eled+0.402
Neg logit contributions
ands-0.250
 ago-0.212
 space-0.194
 pattern-0.190
 tunnel-0.182
Token not
Token ID407
Feature activation+4.08e-02

Pos logit contributions
ands+0.022
 ago+0.019
 space+0.017
 pattern+0.016
 common+0.015
Neg logit contributions
aleb-0.054
 butcher-0.051
rified-0.050
 Junior-0.049
Which-0.048
Token that
Token ID326
Feature activation-1.50e-02

Pos logit contributions
ands+0.053
 ago+0.047
 space+0.042
 pattern+0.040
 tunnel+0.039
Neg logit contributions
aleb-0.087
 butcher-0.084
Which-0.079
 Junior-0.078
rified-0.077
Token bad
Token ID2089
Feature activation-3.21e-02

Pos logit contributions
aleb+0.031
 butcher+0.029
eled+0.028
 Junior+0.028
rified+0.027
Neg logit contributions
ands-0.020
 ago-0.018
 space-0.017
 work-0.016
 rain-0.016
Token.
Token ID13
Feature activation+2.51e-02

Pos logit contributions
aleb+0.082
 butcher+0.080
 Junior+0.076
Which+0.076
 beef+0.075
Neg logit contributions
ands-0.028
 ago-0.023
 space-0.018
 pattern-0.018
 tunnel-0.017
TokenMom
Token ID29252
Feature activation-3.62e-02

Pos logit contributions
ands+0.004
 ago+0.003
 space+0.003
 pattern+0.003
 tunnel+0.003
Neg logit contributions
aleb-0.002
 butcher-0.002
 Junior-0.002
 beef-0.002
 steak-0.002
Token says
Token ID1139
Feature activation+5.76e-05

Pos logit contributions
aleb+0.070
 butcher+0.068
Which+0.063
 Junior+0.063
 beef+0.063
Neg logit contributions
ands-0.054
 ago-0.048
 space-0.043
 pattern-0.043
 tunnel-0.042
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
ands+0.000
 ago+0.000
 space+0.000
 pattern+0.000
 tunnel+0.000
Neg logit contributions
aleb-0.000
 butcher-0.000
 Junior-0.000
Which-0.000
 beef-0.000
Token "
Token ID366
Feature activation-4.84e-02

Pos logit contributions
ands+0.034
 ago+0.030
 space+0.027
 pattern+0.027
 tunnel+0.026
Neg logit contributions
aleb-0.042
 butcher-0.041
 Junior-0.038
Which-0.038
 beef-0.038
TokenBro
Token ID15783
Feature activation-2.38e-02

Pos logit contributions
aleb+0.061
 butcher+0.058
Which+0.052
 Junior+0.051
 beef+0.051
Neg logit contributions
ands-0.106
 ago-0.099
 space-0.091
 pattern-0.091
 tunnel-0.090
Tokenccoli
Token ID34544
Feature activation-2.05e-01

Pos logit contributions
aleb+0.015
 butcher+0.014
Which+0.011
 Junior+0.011
rified+0.010
Neg logit contributions
ands-0.065
 ago-0.063
 space-0.058
 pattern-0.058
 disl-0.057
Token is
Token ID318
Feature activation+2.13e-02

Pos logit contributions
aleb+0.482
 butcher+0.465
Which+0.452
eled+0.445
 Junior+0.443
Neg logit contributions
ands-0.200
 pattern-0.163
 ago-0.162
 grow-0.159
 space-0.158
Token good
Token ID922
Feature activation-5.65e-02

Pos logit contributions
ands+0.022
 ago+0.019
 space+0.016
 pattern+0.016
 tunnel+0.015
Neg logit contributions
aleb-0.051
 butcher-0.049
 Junior-0.046
Which-0.046
rified-0.046
Token for
Token ID329
Feature activation-3.46e-02

Pos logit contributions
aleb+0.124
 butcher+0.123
 beef+0.117
 steak+0.115
 Junior+0.114
Neg logit contributions
ands-0.069
 ago-0.058
 tunnel-0.051
 pattern-0.050
 space-0.049
Token you
Token ID345
Feature activation+6.29e-02

Pos logit contributions
aleb+0.075
 butcher+0.073
Which+0.070
 Junior+0.068
rified+0.068
Neg logit contributions
ands-0.044
 ago-0.038
 space-0.033
 pattern-0.032
 work-0.031
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
ands+0.073
 ago+0.053
 disl+0.050
 pattern+0.049
 tunnel+0.048
Neg logit contributions
 butcher-0.141
aleb-0.141
 beef-0.135
 steak-0.135
 Junior-0.132
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
aleb+0.277
 butcher+0.268
Which+0.263
 Junior+0.259
rified+0.253
Neg logit contributions
ands-0.053
 ago-0.043
 space-0.029
 pattern-0.024
 tunnel-0.022
Token
Token ID198
Feature activation+6.44e-03

Pos logit contributions
ands+0.012
 ago+0.011
 space+0.009
 pattern+0.009
 tunnel+0.009
Neg logit contributions
aleb-0.026
 butcher-0.025
 Junior-0.023
 beef-0.023
 steak-0.023
Token
Token ID198
Feature activation+6.20e-03

Pos logit contributions
ands+0.010
 ago+0.009
 pattern+0.008
 space+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.012
 butcher-0.012
Which-0.011
 Junior-0.011
 beef-0.011
Token"
Token ID1
Feature activation+7.44e-03

Pos logit contributions
ands+0.010
 ago+0.009
 pattern+0.008
 space+0.008
 tunnel+0.008
Neg logit contributions
aleb-0.012
 butcher-0.011
Which-0.010
 Junior-0.010
 beef-0.010
TokenBro
Token ID15783
Feature activation-2.45e-02

Pos logit contributions
ands+0.015
 common+0.013
 ago+0.013
 space+0.012
 pattern+0.012
Neg logit contributions
aleb-0.011
 butcher-0.010
 Junior-0.010
rified-0.009
 steak-0.009
Tokenccoli
Token ID34544
Feature activation-2.05e-01

Pos logit contributions
aleb+0.015
 butcher+0.014
 Junior+0.011
 beef+0.010
Which+0.010
Neg logit contributions
ands-0.069
 ago-0.063
 space-0.062
 pattern-0.061
 tunnel-0.061
Token is
Token ID318
Feature activation+2.01e-02

Pos logit contributions
aleb+0.459
 butcher+0.443
Which+0.427
 Junior+0.416
rified+0.409
Neg logit contributions
ands-0.238
 ago-0.207
 pattern-0.189
 space-0.186
 tunnel-0.183
Token not
Token ID407
Feature activation+3.80e-02

Pos logit contributions
ands+0.018
 ago+0.015
 space+0.013
 pattern+0.013
 rain+0.012
Neg logit contributions
aleb-0.052
 butcher-0.049
rified-0.048
eled-0.047
 Junior-0.046
Token our
Token ID674
Feature activation-2.30e-02

Pos logit contributions
ands+0.046
 ago+0.041
 space+0.038
 pattern+0.036
 work+0.035
Neg logit contributions
aleb-0.084
 butcher-0.080
Which-0.076
rified-0.076
 Junior-0.075
Token friend
Token ID1545
Feature activation+6.93e-03

Pos logit contributions
aleb+0.043
 butcher+0.041
Which+0.041
rified+0.040
 Junior+0.040
Neg logit contributions
ands-0.034
 ago-0.031
 space-0.030
 work-0.029
 pattern-0.029
Token,
Token ID11
Feature activation+2.24e-02

Pos logit contributions
ands+0.006
 ago+0.006
 space+0.005
 pattern+0.004
 tunnel+0.004
Neg logit contributions
aleb-0.017
 butcher-0.017
Which-0.016
 Junior-0.016
 beef-0.015
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
ands+0.039
 ago+0.029
 pattern+0.025
 tunnel+0.025
 space+0.024
Neg logit contributions
aleb-0.107
 butcher-0.106
 beef-0.100
 Junior-0.100
 steak-0.099
Token
Token ID198
Feature activation+6.13e-03

Pos logit contributions
ands+0.020
 ago+0.018
 space+0.015
 pattern+0.015
 tunnel+0.015
Neg logit contributions
aleb-0.039
 butcher-0.037
 Junior-0.035
 beef-0.035
Which-0.035
Token
Token ID198
Feature activation+5.42e-03

Pos logit contributions
ands+0.008
 ago+0.007
 space+0.006
 pattern+0.006
 tunnel+0.006
Neg logit contributions
aleb-0.013
 butcher-0.012
 Junior-0.012
Which-0.012
 beef-0.012
Token"
Token ID1
Feature activation+9.04e-03

Pos logit contributions
ands+0.009
 ago+0.008
 pattern+0.007
 space+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.010
 butcher-0.010
Which-0.009
 Junior-0.009
 beef-0.009
TokenBro
Token ID15783
Feature activation-2.48e-02

Pos logit contributions
ands+0.017
 ago+0.015
 common+0.014
 space+0.014
 rain+0.014
Neg logit contributions
aleb-0.013
 butcher-0.013
 Junior-0.012
rified-0.012
urger-0.012
Tokenccoli
Token ID34544
Feature activation-2.02e-01

Pos logit contributions
aleb+0.031
 butcher+0.031
 Junior+0.027
 beef+0.027
Which+0.026
Neg logit contributions
ands-0.054
 ago-0.048
 pattern-0.046
 space-0.046
 tunnel-0.046
Token is
Token ID318
Feature activation+2.51e-02

Pos logit contributions
aleb+0.459
 butcher+0.440
Which+0.431
 Junior+0.414
eled+0.410
Neg logit contributions
ands-0.218
 ago-0.193
 pattern-0.175
 space-0.173
 work-0.173
Token what
Token ID644
Feature activation-9.44e-02

Pos logit contributions
ands+0.023
 ago+0.020
 space+0.017
 pattern+0.016
 tunnel+0.015
Neg logit contributions
aleb-0.063
 butcher-0.060
rified-0.058
 Junior-0.057
Which-0.057
Token we
Token ID356
Feature activation+1.78e-02

Pos logit contributions
aleb+0.230
 butcher+0.221
rified+0.209
 Junior+0.208
eled+0.207
Neg logit contributions
ands-0.091
 ago-0.085
 space-0.072
 pattern-0.066
 rain-0.066
Token have
Token ID423
Feature activation+3.52e-02

Pos logit contributions
ands+0.022
 ago+0.019
 space+0.016
 pattern+0.016
 tunnel+0.016
Neg logit contributions
aleb-0.039
 butcher-0.038
 Junior-0.036
Which-0.036
 beef-0.036
Token for
Token ID329
Feature activation-3.15e-02

Pos logit contributions
ands+0.048
 ago+0.040
 tunnel+0.038
 pattern+0.038
 hall+0.036
Neg logit contributions
aleb-0.072
 butcher-0.070
 beef-0.066
 steak-0.066
 Junior-0.065
TokenOh
Token ID5812
Feature activation-6.70e-02

Pos logit contributions
aleb+0.069
 butcher+0.066
 Junior+0.059
 beef+0.058
Which+0.058
Neg logit contributions
ands-0.117
 ago-0.109
 pattern-0.100
 space-0.100
 tunnel-0.099
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
aleb+0.121
 Junior+0.117
 butcher+0.116
 steak+0.115
 beef+0.114
Neg logit contributions
ands-0.117
 ago-0.089
 pattern-0.079
ro-0.079
 tunnel-0.078
Token Mike
Token ID4995
Feature activation+3.89e-02

Pos logit contributions
ands+0.029
 ago+0.024
 pattern+0.022
 tunnel+0.021
 space+0.021
Neg logit contributions
aleb-0.035
 butcher-0.034
 Junior-0.032
Which-0.032
 beef-0.032
Token!
Token ID0
Feature activation-6.38e-03

Pos logit contributions
ands+0.046
 ago+0.036
 pattern+0.032
 space+0.031
 tunnel+0.031
Neg logit contributions
aleb-0.090
 butcher-0.087
 Junior-0.083
 beef-0.082
 steak-0.081
Token Bro
Token ID2806
Feature activation-9.70e-02

Pos logit contributions
aleb+0.012
 butcher+0.011
 Junior+0.011
 beef+0.011
Which+0.011
Neg logit contributions
ands-0.010
 ago-0.009
 space-0.008
 pattern-0.008
 tunnel-0.008
Tokenccoli
Token ID34544
Feature activation-2.00e-01

Pos logit contributions
aleb+0.100
 butcher+0.094
Which+0.086
 Junior+0.080
 beef+0.079
Neg logit contributions
ands-0.228
 ago-0.217
 pattern-0.200
 space-0.198
 tunnel-0.197
Token is
Token ID318
Feature activation+1.69e-02

Pos logit contributions
aleb+0.393
 butcher+0.384
 Junior+0.356
Which+0.356
 beef+0.353
Neg logit contributions
ands-0.291
 ago-0.258
 space-0.227
 pattern-0.226
 tunnel-0.224
Token a
Token ID257
Feature activation-3.79e-03

Pos logit contributions
ands+0.023
 ago+0.019
 tunnel+0.017
 pattern+0.016
 space+0.016
Neg logit contributions
aleb-0.035
 butcher-0.035
 beef-0.033
 Junior-0.033
 steak-0.033
Token vegetable
Token ID20236
Feature activation-1.41e-01

Pos logit contributions
aleb+0.008
 butcher+0.008
 Junior+0.008
Which+0.007
rified+0.007
Neg logit contributions
ands-0.005
 ago-0.004
 pattern-0.004
 space-0.004
 tunnel-0.004
Token!
Token ID0
Feature activation-6.14e-03

Pos logit contributions
aleb+0.315
 butcher+0.308
 Junior+0.289
Which+0.288
 beef+0.286
Neg logit contributions
ands-0.167
 ago-0.144
 space-0.122
 pattern-0.122
 tunnel-0.120
Token It
Token ID632
Feature activation+2.32e-02

Pos logit contributions
aleb+0.012
 butcher+0.011
Which+0.010
 beef+0.010
 Junior+0.010
Neg logit contributions
ands-0.009
 ago-0.009
 space-0.007
 pattern-0.007
 disl-0.007
Token ruined
Token ID20484
Feature activation+1.75e-02

Pos logit contributions
aleb+0.020
 butcher+0.019
 Junior+0.018
Which+0.018
 beef+0.017
Neg logit contributions
ands-0.017
 ago-0.015
 space-0.013
 pattern-0.013
 tunnel-0.013
Token the
Token ID262
Feature activation+7.60e-03

Pos logit contributions
ands+0.015
 ago+0.012
 space+0.011
 rain+0.010
 disl+0.010
Neg logit contributions
aleb-0.044
 butcher-0.043
Which-0.041
rified-0.040
eled-0.040
Token broccoli
Token ID44653
Feature activation-1.47e-01

Pos logit contributions
ands+0.013
 ago+0.013
 space+0.013
 pattern+0.012
 work+0.012
Neg logit contributions
aleb-0.012
 butcher-0.011
Which-0.011
 Junior-0.011
eled-0.011
Token!
Token ID0
Feature activation+3.35e-03

Pos logit contributions
aleb+0.329
 butcher+0.325
 beef+0.304
 Junior+0.302
 steak+0.301
Neg logit contributions
ands-0.173
 ago-0.146
 pattern-0.121
 space-0.121
 tunnel-0.119
Token Bro
Token ID2806
Feature activation-9.31e-02

Pos logit contributions
ands+0.006
 ago+0.005
 pattern+0.005
 space+0.005
 tunnel+0.005
Neg logit contributions
aleb-0.006
 butcher-0.006
 Junior-0.005
Which-0.005
 beef-0.005
Tokenccoli
Token ID34544
Feature activation-2.00e-01

Pos logit contributions
aleb+0.102
Which+0.101
 butcher+0.097
eled+0.087
rified+0.085
Neg logit contributions
 ago-0.201
ands-0.198
 pattern-0.184
 disl-0.181
 tunnel-0.178
Token is
Token ID318
Feature activation+2.51e-02

Pos logit contributions
aleb+0.436
 butcher+0.424
Which+0.405
 Junior+0.397
 beef+0.387
Neg logit contributions
ands-0.242
 ago-0.213
 pattern-0.193
 tunnel-0.189
 space-0.189
Token good
Token ID922
Feature activation-5.69e-02

Pos logit contributions
ands+0.026
 ago+0.022
 space+0.018
 pattern+0.018
 tunnel+0.018
Neg logit contributions
aleb-0.060
 butcher-0.059
 Junior-0.055
Which-0.055
 beef-0.055
Token for
Token ID329
Feature activation-3.24e-02

Pos logit contributions
aleb+0.117
 butcher+0.116
 beef+0.109
 steak+0.108
 Junior+0.107
Neg logit contributions
ands-0.077
 ago-0.063
 pattern-0.057
 tunnel-0.057
 space-0.056
Token you
Token ID345
Feature activation+6.31e-02

Pos logit contributions
aleb+0.072
 butcher+0.070
Which+0.066
 Junior+0.066
 beef+0.065
Neg logit contributions
ands-0.039
 ago-0.034
 space-0.029
 pattern-0.029
 tunnel-0.029
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
ands+0.089
 ago+0.070
 disl+0.065
 pattern+0.064
 space+0.063
Neg logit contributions
aleb-0.126
 butcher-0.126
 steak-0.120
 beef-0.120
 Junior-0.117
Token to
Token ID284
Feature activation+2.20e-02

Pos logit contributions
aleb+0.044
 butcher+0.042
Which+0.039
 Junior+0.039
rified+0.039
Neg logit contributions
ands-0.031
 ago-0.029
 space-0.026
 pattern-0.025
 rain-0.025
Token eat
Token ID4483
Feature activation-7.25e-02

Pos logit contributions
ands+0.042
 ago+0.038
 tunnel+0.036
 outage+0.034
 hall+0.034
Neg logit contributions
aleb-0.033
 butcher-0.032
 beef-0.032
 steak-0.031
 Junior-0.030
Token broccoli
Token ID44653
Feature activation-1.49e-01

Pos logit contributions
aleb+0.152
 butcher+0.143
Which+0.135
 Junior+0.134
rified+0.134
Neg logit contributions
ands-0.098
 ago-0.088
 space-0.081
 rain-0.079
 pattern-0.077
Token.
Token ID13
Feature activation+9.83e-03

Pos logit contributions
aleb+0.397
 butcher+0.388
 Junior+0.369
Which+0.368
 beef+0.364
Neg logit contributions
ands-0.115
 ago-0.091
 space-0.068
 pattern-0.067
 tunnel-0.065
Token Bro
Token ID2806
Feature activation-9.87e-02

Pos logit contributions
ands+0.011
 ago+0.010
 space+0.008
 pattern+0.008
 tunnel+0.008
Neg logit contributions
aleb-0.022
 butcher-0.022
 Junior-0.020
Which-0.020
 beef-0.020
Tokenccoli
Token ID34544
Feature activation-2.00e-01

Pos logit contributions
aleb+0.105
 butcher+0.103
Which+0.099
rified+0.088
 Junior+0.087
Neg logit contributions
ands-0.222
 ago-0.210
 pattern-0.196
 disl-0.196
 grow-0.193
Token was
Token ID373
Feature activation+9.58e-03

Pos logit contributions
aleb+0.446
 butcher+0.430
Which+0.413
 Junior+0.403
Stan+0.388
Neg logit contributions
ands-0.227
 ago-0.190
 pattern-0.188
 grow-0.185
 space-0.182
Token green
Token ID4077
Feature activation+7.15e-04

Pos logit contributions
ands+0.011
 ago+0.010
 pattern+0.009
 space+0.009
 tunnel+0.008
Neg logit contributions
aleb-0.022
 butcher-0.021
 Junior-0.020
Which-0.020
rified-0.019
Token and
Token ID290
Feature activation-1.26e-02

Pos logit contributions
ands+0.001
 ago+0.001
 space+0.001
 pattern+0.001
 tunnel+0.001
Neg logit contributions
aleb-0.002
 butcher-0.002
Which-0.002
 Junior-0.002
 beef-0.002
Token bitter
Token ID12922
Feature activation-4.51e-02

Pos logit contributions
aleb+0.025
 butcher+0.024
 Junior+0.022
Which+0.022
 beef+0.022
Neg logit contributions
ands-0.018
 ago-0.016
 space-0.014
 pattern-0.014
 tunnel-0.014
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
aleb+0.127
 butcher+0.124
Which+0.119
 Junior+0.119
 beef+0.116
Neg logit contributions
ands-0.028
 ago-0.021
 space-0.014
 pattern-0.014
 tunnel-0.013
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
ands+0.058
 ago+0.050
 pattern+0.044
 tunnel+0.043
 space+0.043
Neg logit contributions
aleb-0.101
 butcher-0.099
 Junior-0.094
 beef-0.093
Which-0.092
Token stay
Token ID2652
Feature activation-7.49e-03

Pos logit contributions
 butcher+0.021
aleb+0.021
 beef+0.020
 steak+0.020
Which+0.020
Neg logit contributions
ands-0.018
 ago-0.015
 tunnel-0.014
 pattern-0.014
 hall-0.014
Token well
Token ID880
Feature activation+3.44e-02

Pos logit contributions
aleb+0.014
 butcher+0.014
 beef+0.013
 Junior+0.013
 steak+0.013
Neg logit contributions
ands-0.011
 ago-0.010
 pattern-0.009
 tunnel-0.009
 hall-0.009
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
ands+0.041
 pattern+0.032
 ago+0.032
 tunnel+0.031
 hall+0.030
Neg logit contributions
 butcher-0.073
 beef-0.072
aleb-0.071
 steak-0.071
 Junior-0.069
Token Bro
Token ID2806
Feature activation-9.14e-02

Pos logit contributions
ands+0.030
 ago+0.028
 space+0.024
 pattern+0.024
 tunnel+0.023
Neg logit contributions
aleb-0.037
 butcher-0.036
 Junior-0.033
Which-0.032
 beef-0.032
Tokenccoli
Token ID34544
Feature activation-1.99e-01

Pos logit contributions
aleb+0.095
 butcher+0.092
Which+0.087
 Junior+0.079
 beef+0.078
Neg logit contributions
ands-0.210
 ago-0.204
 disl-0.185
 pattern-0.184
ro-0.182
Token is
Token ID318
Feature activation+2.11e-02

Pos logit contributions
aleb+0.421
 butcher+0.408
Which+0.384
 Junior+0.381
 beef+0.369
Neg logit contributions
ands-0.263
 ago-0.227
 space-0.204
 pattern-0.204
 tunnel-0.199
Token your
Token ID534
Feature activation+2.18e-02

Pos logit contributions
ands+0.023
 ago+0.020
 space+0.017
 pattern+0.016
 tunnel+0.016
Neg logit contributions
aleb-0.050
 butcher-0.048
 Junior-0.045
Which-0.045
 beef-0.045
Token friend
Token ID1545
Feature activation+7.99e-03

Pos logit contributions
ands+0.035
 ago+0.032
 space+0.029
 pattern+0.028
 tunnel+0.028
Neg logit contributions
aleb-0.040
 butcher-0.038
Which-0.036
 Junior-0.036
rified-0.035
Token,
Token ID11
Feature activation+2.27e-02

Pos logit contributions
ands+0.008
 ago+0.007
 pattern+0.005
 space+0.005
 tunnel+0.005
Neg logit contributions
aleb-0.019
 butcher-0.019
 Junior-0.018
Which-0.018
 beef-0.018
Token not
Token ID407
Feature activation+4.01e-02

Pos logit contributions
ands+0.030
 ago+0.027
 space+0.024
 pattern+0.024
 tunnel+0.023
Neg logit contributions
aleb-0.047
 butcher-0.045
Which-0.042
 Junior-0.042
rified-0.042
Token happy
Token ID3772
Feature activation+1.86e-02

Pos logit contributions
ands+0.034
 ago+0.028
 tunnel+0.027
 pattern+0.025
 space+0.025
Neg logit contributions
 butcher-0.038
aleb-0.037
 beef-0.035
 Junior-0.034
 steak-0.034
Token.
Token ID13
Feature activation-3.78e-03

Pos logit contributions
ands+0.027
chie+0.021
 ago+0.020
 pattern+0.020
 disl+0.020
Neg logit contributions
aleb-0.036
 butcher-0.036
 steak-0.035
 beef-0.034
 Junior-0.033
Token Everyone
Token ID11075
Feature activation+5.52e-02

Pos logit contributions
aleb+0.007
 butcher+0.007
 Junior+0.007
 beef+0.006
Which+0.006
Neg logit contributions
ands-0.006
 ago-0.005
 pattern-0.005
 tunnel-0.005
 common-0.005
Token had
Token ID550
Feature activation+3.94e-03

Pos logit contributions
ands+0.080
 ago+0.071
 space+0.063
 pattern+0.063
 tunnel+0.062
Neg logit contributions
aleb-0.108
 butcher-0.106
 Junior-0.098
Which-0.098
 beef-0.097
Token enough
Token ID1576
Feature activation-9.20e-02

Pos logit contributions
ands+0.005
 ago+0.004
 space+0.004
 pattern+0.004
 tunnel+0.003
Neg logit contributions
aleb-0.009
 butcher-0.009
Which-0.008
 Junior-0.008
 beef-0.008
Token beef
Token ID12023
Feature activation-1.99e-01

Pos logit contributions
aleb+0.190
rified+0.183
Which+0.182
 butcher+0.176
eled+0.169
Neg logit contributions
ands-0.118
 ago-0.111
 space-0.107
 work-0.105
 old-0.095
Token to
Token ID284
Feature activation+8.30e-03

Pos logit contributions
aleb+0.446
 butcher+0.433
Which+0.418
 Junior+0.407
 beef+0.399
Neg logit contributions
ands-0.229
 ago-0.206
 space-0.175
 tunnel-0.168
 pattern-0.167
Token eat
Token ID4483
Feature activation-8.21e-02

Pos logit contributions
ands+0.013
 ago+0.012
 tunnel+0.010
 pattern+0.010
 space+0.010
Neg logit contributions
aleb-0.015
 butcher-0.015
 Junior-0.014
 beef-0.014
 steak-0.014
Token.
Token ID13
Feature activation-4.22e-03

Pos logit contributions
aleb+0.208
 butcher+0.198
Which+0.193
rified+0.190
eled+0.186
Neg logit contributions
ands-0.071
 ago-0.060
 space-0.055
 work-0.048
 rain-0.048
Token
Token ID220
Feature activation+1.44e-02

Pos logit contributions
aleb+0.008
 butcher+0.008
 Junior+0.007
Which+0.007
 beef+0.007
Neg logit contributions
ands-0.007
 ago-0.006
 pattern-0.005
 tunnel-0.005
 common-0.005
Token
Token ID198
Feature activation-9.31e-03

Pos logit contributions
ands+0.017
 ago+0.015
 space+0.012
 pattern+0.012
 tunnel+0.012
Neg logit contributions
aleb-0.033
 butcher-0.032
Which-0.030
 Junior-0.029
 beef-0.029
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
aleb+0.264
 butcher+0.258
Which+0.243
 Junior+0.242
 beef+0.239
Neg logit contributions
ands-0.125
 ago-0.108
 space-0.089
 pattern-0.089
 tunnel-0.087
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
aleb+0.040
 butcher+0.040
 Junior+0.037
Which+0.037
 beef+0.037
Neg logit contributions
ands-0.032
 ago-0.027
 pattern-0.024
 tunnel-0.024
 space-0.024
Token
Token ID198
Feature activation-1.43e-02

Pos logit contributions
aleb+0.029
 butcher+0.029
Which+0.027
 Junior+0.027
 beef+0.027
Neg logit contributions
ands-0.028
 ago-0.025
 pattern-0.023
 space-0.022
 tunnel-0.022
Token"
Token ID1
Feature activation-7.58e-03

Pos logit contributions
aleb+0.023
 butcher+0.023
Which+0.021
 Junior+0.020
 beef+0.020
Neg logit contributions
ands-0.026
 ago-0.024
 pattern-0.022
 space-0.021
 tunnel-0.021
TokenBro
Token ID15783
Feature activation-3.24e-02

Pos logit contributions
aleb+0.011
 butcher+0.010
 Junior+0.010
rified+0.009
 steak+0.009
Neg logit contributions
ands-0.014
 ago-0.013
 space-0.012
 rain-0.012
ro-0.012
Tokenccoli
Token ID34544
Feature activation-1.98e-01

Pos logit contributions
aleb+0.047
 butcher+0.046
 Junior+0.042
 beef+0.041
Which+0.041
Neg logit contributions
ands-0.064
 ago-0.057
 space-0.053
 pattern-0.053
 tunnel-0.053
Token is
Token ID318
Feature activation+2.15e-02

Pos logit contributions
aleb+0.452
 butcher+0.434
Which+0.418
 Junior+0.406
eled+0.399
Neg logit contributions
ands-0.213
 ago-0.189
 pattern-0.176
 space-0.172
 work-0.170
Token y
Token ID331
Feature activation-1.17e-01

Pos logit contributions
ands+0.026
 ago+0.024
 space+0.022
 pattern+0.021
 rain+0.021
Neg logit contributions
aleb-0.048
 butcher-0.045
rified-0.044
Which-0.042
eled-0.042
Tokenucky
Token ID5309
Feature activation-5.74e-02

Pos logit contributions
Which+0.194
 butcher+0.188
aleb+0.185
 Junior+0.175
eled+0.163
Neg logit contributions
ands-0.209
 ago-0.188
 tunnel-0.168
 pattern-0.168
 space-0.165
Token!"
Token ID2474
Feature activation-2.02e-02

Pos logit contributions
aleb+0.144
 butcher+0.138
Which+0.136
 Junior+0.132
rified+0.129
Neg logit contributions
ands-0.052
 ago-0.044
 space-0.036
 pattern-0.036
 tunnel-0.035
Token Ben
Token ID3932
Feature activation-2.88e-02

Pos logit contributions
 butcher+0.032
aleb+0.032
 Junior+0.031
 beef+0.030
 steak+0.029
Neg logit contributions
ands-0.038
 ago-0.032
 pattern-0.031
 tunnel-0.031
 space-0.029
Token mom
Token ID1995
Feature activation-3.27e-02

Pos logit contributions
aleb+0.029
 butcher+0.027
rified+0.027
Which+0.026
 Junior+0.026
Neg logit contributions
ands-0.025
 ago-0.023
 space-0.022
 tunnel-0.021
 pattern-0.021
Token.
Token ID13
Feature activation-3.11e-03

Pos logit contributions
aleb+0.085
 butcher+0.083
rified+0.080
Which+0.079
 Junior+0.078
Neg logit contributions
ands-0.024
 ago-0.022
 space-0.017
 pattern-0.015
 hall-0.015
Token He
Token ID679
Feature activation-3.18e-02

Pos logit contributions
aleb+0.007
 butcher+0.007
 beef+0.007
avored+0.006
 Junior+0.006
Neg logit contributions
ands-0.003
 ago-0.003
 space-0.003
 pattern-0.002
 hall-0.002
Token saw
Token ID2497
Feature activation+1.34e-02

Pos logit contributions
aleb+0.063
 butcher+0.061
Which+0.058
 beef+0.057
 Junior+0.057
Neg logit contributions
ands-0.043
 ago-0.040
 space-0.035
 pattern-0.035
 tunnel-0.034
Token some
Token ID617
Feature activation-3.36e-02

Pos logit contributions
ands+0.009
 space+0.009
 old+0.008
 ago+0.008
 school+0.008
Neg logit contributions
aleb-0.035
 butcher-0.032
rified-0.031
 Junior-0.031
Stan-0.031
Token beef
Token ID12023
Feature activation-1.98e-01

Pos logit contributions
aleb+0.074
 butcher+0.071
 Junior+0.068
Which+0.067
rified+0.066
Neg logit contributions
ands-0.041
 ago-0.035
 space-0.032
 pattern-0.031
 tunnel-0.031
Token there
Token ID612
Feature activation+2.53e-03

Pos logit contributions
aleb+0.499
Which+0.476
rified+0.463
 butcher+0.459
eled+0.445
Neg logit contributions
 ago-0.160
ands-0.148
 work-0.133
 space-0.132
 grow-0.126
Token.
Token ID13
Feature activation-4.28e-03

Pos logit contributions
ands+0.003
 ago+0.002
 pattern+0.002
 tunnel+0.002
 disl+0.002
Neg logit contributions
aleb-0.006
 butcher-0.006
 beef-0.006
 Junior-0.006
Which-0.006
Token He
Token ID679
Feature activation-3.36e-02

Pos logit contributions
aleb+0.010
 butcher+0.010
avored+0.010
 beef+0.009
eled+0.009
Neg logit contributions
 ago-0.003
ands-0.003
 space-0.003
ro-0.003
 pattern-0.003
Token wanted
Token ID2227
Feature activation-3.30e-02

Pos logit contributions
aleb+0.077
 butcher+0.074
rified+0.073
avored+0.072
Which+0.071
Neg logit contributions
ands-0.031
 ago-0.029
 grow-0.028
 pattern-0.027
 grown-0.027
Token to
Token ID284
Feature activation+8.91e-03

Pos logit contributions
aleb+0.088
rified+0.088
eled+0.082
avored+0.081
umm+0.079
Neg logit contributions
 space-0.020
 work-0.018
 old-0.018
.-0.018
 grown-0.017
Token
Token ID220
Feature activation+1.47e-02

Pos logit contributions
aleb+0.017
 butcher+0.016
Which+0.015
 beef+0.015
avored+0.015
Neg logit contributions
 ago-0.008
ands-0.008
 space-0.007
 pattern-0.006
 rain-0.006
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
ands+0.013
 ago+0.012
 space+0.010
 pattern+0.009
 disl+0.008
Neg logit contributions
aleb-0.036
 butcher-0.035
Which-0.033
rified-0.033
 steak-0.033
Token
Token ID198
Feature activation-1.18e-02

Pos logit contributions
aleb+0.027
 butcher+0.026
rified+0.025
 beef+0.025
Which+0.025
Neg logit contributions
ands-0.012
 ago-0.010
 space-0.009
 pattern-0.008
 tunnel-0.008
Token"
Token ID1
Feature activation-1.74e-03

Pos logit contributions
aleb+0.019
rified+0.019
 butcher+0.019
ishes+0.019
avored+0.018
Neg logit contributions
ands-0.018
 tunnel-0.016
 rain-0.016
 grown-0.016
 pattern-0.016
TokenMore
Token ID5167
Feature activation-7.09e-02

Pos logit contributions
ishes+0.003
aleb+0.003
ish+0.003
 butcher+0.003
icking+0.003
Neg logit contributions
ands-0.002
 rain-0.002
 light-0.002
 grown-0.002
 fun-0.002
Token beef
Token ID12023
Feature activation-1.98e-01

Pos logit contributions
aleb+0.131
rified+0.122
 butcher+0.121
Which+0.118
eled+0.112
Neg logit contributions
ands-0.107
 ago-0.103
 work-0.096
 space-0.094
 rain-0.088
Token please
Token ID3387
Feature activation-8.90e-03

Pos logit contributions
aleb+0.484
 butcher+0.464
Which+0.455
rified+0.442
 Junior+0.439
Neg logit contributions
ands-0.188
 ago-0.165
 work-0.135
 space-0.135
 pattern-0.124
Token,"
Token ID553
Feature activation-2.38e-02

Pos logit contributions
 butcher+0.018
aleb+0.018
 beef+0.017
 steak+0.017
 Junior+0.017
Neg logit contributions
ands-0.012
 ago-0.010
 tunnel-0.009
 pattern-0.009
 disl-0.009
Token Little
Token ID7703
Feature activation-5.92e-02

Pos logit contributions
aleb+0.051
 butcher+0.050
Which+0.047
 beef+0.046
avored+0.046
Neg logit contributions
ands-0.027
 ago-0.024
 space-0.023
 rain-0.022
 pattern-0.022
Token Jack
Token ID3619
Feature activation-3.37e-02

Pos logit contributions
aleb+0.111
 butcher+0.109
Which+0.101
 Junior+0.100
 beef+0.100
Neg logit contributions
ands-0.088
 ago-0.081
 pattern-0.072
 space-0.071
 tunnel-0.071
Token asked
Token ID1965
Feature activation+4.16e-04

Pos logit contributions
aleb+0.071
 butcher+0.070
Which+0.065
 Junior+0.065
 beef+0.064
Neg logit contributions
ands-0.044
 ago-0.039
 pattern-0.034
 space-0.034
 tunnel-0.033
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
ands+0.059
 ago+0.050
 space+0.041
 pattern+0.041
 tunnel+0.040
Neg logit contributions
aleb-0.133
 butcher-0.130
 Junior-0.123
Which-0.122
 beef-0.122
Token Betsy
Token ID45775
Feature activation-2.34e-02

Pos logit contributions
ands+0.022
 ago+0.020
 space+0.019
 pattern+0.019
 tunnel+0.018
Neg logit contributions
aleb-0.024
 butcher-0.024
Which-0.022
 Junior-0.021
 beef-0.021
Token!
Token ID0
Feature activation-4.73e-03

Pos logit contributions
aleb+0.055
 butcher+0.055
 beef+0.052
 Junior+0.052
 steak+0.051
Neg logit contributions
ands-0.025
 ago-0.021
 tunnel-0.016
 pattern-0.016
 common-0.015
Token I
Token ID314
Feature activation-6.65e-03

Pos logit contributions
aleb+0.010
 butcher+0.010
 beef+0.009
 steak+0.009
Which+0.009
Neg logit contributions
ands-0.006
 ago-0.006
 space-0.005
 pattern-0.005
 disl-0.005
Token love
Token ID1842
Feature activation-7.87e-04

Pos logit contributions
aleb+0.012
 butcher+0.012
 Junior+0.011
 beef+0.011
Which+0.010
Neg logit contributions
ands-0.011
 ago-0.010
 tunnel-0.009
 hall-0.008
 space-0.008
Token beef
Token ID12023
Feature activation-1.97e-01

Pos logit contributions
aleb+0.001
rified+0.001
 butcher+0.001
eled+0.001
Which+0.001
Neg logit contributions
ands-0.001
 work-0.001
 space-0.001
 ago-0.001
 rain-0.001
Token."
Token ID526
Feature activation+2.64e-02

Pos logit contributions
aleb+0.506
 butcher+0.491
Which+0.472
 Junior+0.468
 beef+0.457
Neg logit contributions
ands-0.169
 ago-0.138
 space-0.112
 pattern-0.107
 tunnel-0.104
Token Betsy
Token ID45775
Feature activation-2.48e-02

Pos logit contributions
ands+0.041
 ago+0.036
 space+0.035
 rain+0.033
 pattern+0.033
Neg logit contributions
aleb-0.049
 butcher-0.047
Which-0.043
rified-0.042
 Junior-0.041
Token and
Token ID290
Feature activation-2.70e-02

Pos logit contributions
aleb+0.049
 butcher+0.048
Which+0.045
 Junior+0.044
 beef+0.044
Neg logit contributions
ands-0.035
 ago-0.031
 pattern-0.028
 space-0.027
 tunnel-0.027
Token Lily
Token ID20037
Feature activation+4.45e-02

Pos logit contributions
aleb+0.058
 butcher+0.056
Which+0.052
avored+0.051
 steak+0.050
Neg logit contributions
ands-0.030
 rain-0.030
 work-0.029
 space-0.028
 grow-0.028
Token became
Token ID2627
Feature activation+2.01e-02

Pos logit contributions
ands+0.053
 pattern+0.044
 work+0.044
 ago+0.044
 rain+0.043
Neg logit contributions
 butcher-0.095
aleb-0.092
Which-0.091
 beef-0.087
rified-0.086
Token.
Token ID13
Feature activation-7.55e-03

Pos logit contributions
ands+0.029
 ago+0.027
 space+0.027
 work+0.026
 common+0.025
Neg logit contributions
aleb-0.021
rified-0.019
 butcher-0.019
Which-0.018
 Junior-0.017
Token He
Token ID679
Feature activation-3.49e-02

Pos logit contributions
aleb+0.015
 butcher+0.014
Which+0.014
 beef+0.013
 steak+0.013
Neg logit contributions
ands-0.010
 ago-0.009
 space-0.008
 pattern-0.008
 rain-0.008
Token enjoyed
Token ID8359
Feature activation-3.63e-02

Pos logit contributions
aleb+0.077
Which+0.076
rified+0.076
 butcher+0.074
avored+0.073
Neg logit contributions
 grow-0.031
ands-0.030
 work-0.030
 be-0.029
 grown-0.027
Token the
Token ID262
Feature activation-7.81e-03

Pos logit contributions
aleb+0.079
 butcher+0.076
rified+0.072
Which+0.072
 Junior+0.070
Neg logit contributions
ands-0.044
 ago-0.040
 space-0.035
 work-0.033
 pattern-0.032
Token soft
Token ID2705
Feature activation-1.82e-02

Pos logit contributions
aleb+0.010
rified+0.009
Which+0.009
eled+0.008
 butcher+0.007
Neg logit contributions
 work-0.017
 old-0.016
 space-0.016
 light-0.016
 fun-0.015
Token beef
Token ID12023
Feature activation-1.97e-01

Pos logit contributions
aleb+0.021
Which+0.019
 butcher+0.019
rified+0.018
Stan+0.016
Neg logit contributions
ands-0.040
 work-0.038
 space-0.037
 ago-0.037
 rain-0.036
Token and
Token ID290
Feature activation-2.66e-02

Pos logit contributions
aleb+0.511
Which+0.499
 butcher+0.488
rified+0.472
eled+0.461
Neg logit contributions
ands-0.138
 ago-0.115
 work-0.108
 space-0.107
 grow-0.104
Token finished
Token ID5201
Feature activation+1.86e-02

Pos logit contributions
aleb+0.054
 butcher+0.051
rified+0.048
Which+0.048
avored+0.047
Neg logit contributions
ands-0.035
 ago-0.034
 space-0.031
 pattern-0.030
 work-0.030
Token all
Token ID477
Feature activation+3.04e-02

Pos logit contributions
ands+0.017
 work+0.016
 ago+0.015
 space+0.015
 grow+0.014
Neg logit contributions
aleb-0.044
rified-0.044
 butcher-0.042
Which-0.041
avored-0.039
Token the
Token ID262
Feature activation-8.29e-03

Pos logit contributions
ands+0.030
 ago+0.026
 space+0.025
 pattern+0.023
 work+0.023
Neg logit contributions
aleb-0.071
 butcher-0.070
Which-0.066
 Junior-0.065
rified-0.065
Token pieces
Token ID5207
Feature activation-1.39e-02

Pos logit contributions
rified+0.012
aleb+0.012
Which+0.011
eled+0.011
 butcher+0.010
Neg logit contributions
 work-0.016
ands-0.015
 old-0.015
 space-0.015
 fun-0.014
Token He
Token ID679
Feature activation-2.93e-02

Pos logit contributions
aleb+0.005
 butcher+0.005
rified+0.005
Which+0.005
avored+0.005
Neg logit contributions
 ago-0.003
ands-0.002
 space-0.002
 pattern-0.002
ro-0.002
Token noticed
Token ID6810
Feature activation-3.05e-02

Pos logit contributions
aleb+0.056
 butcher+0.055
 Junior+0.050
Which+0.050
 beef+0.050
Neg logit contributions
ands-0.044
 ago-0.039
 pattern-0.035
 space-0.035
 tunnel-0.035
Token a
Token ID257
Feature activation-7.86e-03

Pos logit contributions
aleb+0.070
 butcher+0.068
Which+0.062
 Junior+0.062
avored+0.062
Neg logit contributions
ands-0.031
 ago-0.028
 space-0.027
 rain-0.025
 tunnel-0.024
Token piece
Token ID3704
Feature activation-7.75e-02

Pos logit contributions
aleb+0.017
 butcher+0.016
 Junior+0.016
Which+0.016
rified+0.015
Neg logit contributions
ands-0.009
 ago-0.008
 pattern-0.008
 tunnel-0.008
 space-0.008
Token of
Token ID286
Feature activation-8.06e-03

Pos logit contributions
aleb+0.162
 butcher+0.159
 beef+0.149
Which+0.148
 Junior+0.147
Neg logit contributions
ands-0.103
 ago-0.087
 space-0.075
 disl-0.075
 pattern-0.074
Token beef
Token ID12023
Feature activation-1.97e-01

Pos logit contributions
aleb+0.014
 butcher+0.013
Which+0.012
 Junior+0.012
rified+0.012
Neg logit contributions
ands-0.014
 ago-0.013
 space-0.012
 pattern-0.012
 tunnel-0.012
Token on
Token ID319
Feature activation-4.01e-02

Pos logit contributions
aleb+0.429
Which+0.402
 butcher+0.399
 Junior+0.380
rified+0.378
Neg logit contributions
 ago-0.228
ands-0.221
 tunnel-0.196
 space-0.193
 pattern-0.188
Token the
Token ID262
Feature activation-1.60e-03

Pos logit contributions
aleb+0.094
 butcher+0.091
Which+0.086
rified+0.085
avored+0.083
Neg logit contributions
ands-0.041
 ago-0.036
 space-0.034
 pattern-0.030
 work-0.030
Token ground
Token ID2323
Feature activation+1.84e-02

Pos logit contributions
aleb+0.004
Which+0.003
eled+0.003
rified+0.003
 butcher+0.003
Neg logit contributions
ands-0.002
 ago-0.002
 space-0.002
 rain-0.002
 pattern-0.001
Token,
Token ID11
Feature activation+1.56e-02

Pos logit contributions
ands+0.026
 disl+0.019
chie+0.018
 common+0.018
 ago+0.017
Neg logit contributions
aleb-0.037
 butcher-0.037
 steak-0.035
 beef-0.035
rified-0.034
Token it
Token ID340
Feature activation-9.58e-03

Pos logit contributions
ands+0.021
 ago+0.018
 pattern+0.016
 space+0.016
 tunnel+0.015
Neg logit contributions
aleb-0.033
 butcher-0.032
 Junior-0.030
Which-0.030
 beef-0.030
Token!
Token ID0
Feature activation-2.10e-02

Pos logit contributions
ands+0.029
 ago+0.023
chie+0.022
 pattern+0.021
 common+0.021
Neg logit contributions
aleb-0.032
 butcher-0.032
 steak-0.030
 beef-0.030
 Junior-0.029
Token She
Token ID1375
Feature activation+1.17e-02

Pos logit contributions
aleb+0.044
 butcher+0.044
 Junior+0.041
 beef+0.041
Which+0.040
Neg logit contributions
ands-0.028
 ago-0.024
 pattern-0.021
 tunnel-0.021
 space-0.020
Token ate
Token ID15063
Feature activation-9.47e-02

Pos logit contributions
ands+0.019
 ago+0.017
 space+0.015
 pattern+0.015
 tunnel+0.015
Neg logit contributions
aleb-0.022
 butcher-0.021
 Junior-0.020
Which-0.019
 beef-0.019
Token all
Token ID477
Feature activation+3.13e-02

Pos logit contributions
aleb+0.267
 butcher+0.238
rified+0.235
urger+0.233
Which+0.232
Neg logit contributions
 grow-0.054
 work-0.053
.-0.051
 rain-0.050
 light-0.048
Token the
Token ID262
Feature activation-4.96e-03

Pos logit contributions
ands+0.023
 space+0.020
 old+0.017
 ago+0.016
 work+0.016
Neg logit contributions
 butcher-0.078
aleb-0.077
Which-0.074
rified-0.072
urger-0.072
Token beef
Token ID12023
Feature activation-1.97e-01

Pos logit contributions
aleb+0.007
rified+0.007
Which+0.006
eled+0.006
 butcher+0.006
Neg logit contributions
 work-0.009
ands-0.009
 old-0.009
 space-0.009
 grown-0.009
Token and
Token ID290
Feature activation-2.69e-02

Pos logit contributions
aleb+0.478
 butcher+0.464
Which+0.464
 Junior+0.433
rified+0.431
Neg logit contributions
ands-0.182
 ago-0.158
 space-0.139
 work-0.133
 pattern-0.131
Token felt
Token ID2936
Feature activation+6.96e-03

Pos logit contributions
aleb+0.053
 butcher+0.050
Which+0.048
avored+0.046
rified+0.046
Neg logit contributions
ands-0.037
 ago-0.035
 space-0.033
 rain-0.032
 grow-0.032
Token very
Token ID845
Feature activation+1.90e-02

Pos logit contributions
ands+0.006
 ago+0.005
 space+0.005
 grow+0.004
 rain+0.004
Neg logit contributions
aleb-0.018
 butcher-0.017
Which-0.016
 Junior-0.016
rified-0.016
Token full
Token ID1336
Feature activation-3.46e-02

Pos logit contributions
ands+0.027
 ago+0.024
 tunnel+0.021
 pattern+0.021
 space+0.020
Neg logit contributions
aleb-0.038
 butcher-0.037
 beef-0.035
 Junior-0.035
 steak-0.035
Token.
Token ID13
Feature activation-8.00e-03

Pos logit contributions
aleb+0.088
 butcher+0.087
 beef+0.082
 Junior+0.082
 steak+0.082
Neg logit contributions
ands-0.031
 ago-0.023
 tunnel-0.019
 pattern-0.019
 disl-0.018
TokenSo
Token ID2396
Feature activation-5.50e-02

Pos logit contributions
aleb+0.009
 butcher+0.009
Which+0.008
 Junior+0.008
 beef+0.008
Neg logit contributions
ands-0.008
 ago-0.007
 pattern-0.006
 space-0.006
 tunnel-0.006
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
aleb+0.139
 butcher+0.136
Which+0.127
avored+0.126
 Junior+0.124
Neg logit contributions
ands-0.044
 ago-0.040
 space-0.038
 tunnel-0.034
 work-0.033
Token he
Token ID339
Feature activation-4.17e-02

Pos logit contributions
ands+0.012
 space+0.012
 ago+0.011
 rain+0.011
 work+0.011
Neg logit contributions
aleb-0.041
 butcher-0.038
Which-0.036
eled-0.036
avored-0.036
Token took
Token ID1718
Feature activation-5.19e-02

Pos logit contributions
aleb+0.077
 butcher+0.075
 Junior+0.069
Which+0.069
 beef+0.069
Neg logit contributions
ands-0.066
 ago-0.059
 space-0.052
 pattern-0.052
 tunnel-0.052
Token his
Token ID465
Feature activation-1.90e-02

Pos logit contributions
aleb+0.130
 butcher+0.125
Which+0.119
rified+0.118
avored+0.117
Neg logit contributions
ands-0.046
 space-0.040
 work-0.038
 ago-0.034
 rain-0.033
Token beef
Token ID12023
Feature activation-1.97e-01

Pos logit contributions
rified+0.023
aleb+0.021
oph+0.021
Which+0.019
eled+0.018
Neg logit contributions
 work-0.041
 old-0.040
 space-0.039
 voice-0.037
 be-0.037
Token and
Token ID290
Feature activation-2.38e-02

Pos logit contributions
aleb+0.466
Which+0.457
 butcher+0.445
 Junior+0.431
rified+0.430
Neg logit contributions
ands-0.196
 ago-0.165
 space-0.161
 work-0.154
 tunnel-0.145
Token shared
Token ID4888
Feature activation-4.57e-02

Pos logit contributions
aleb+0.052
Which+0.049
 butcher+0.049
rified+0.048
avored+0.047
Neg logit contributions
ands-0.025
 work-0.025
 space-0.024
 rain-0.023
 grow-0.023
Token it
Token ID340
Feature activation-1.09e-02

Pos logit contributions
aleb+0.118
rified+0.114
 butcher+0.110
avored+0.109
eled+0.104
Neg logit contributions
 space-0.034
.-0.032
ands-0.031
 work-0.030
 grow-0.029
Token with
Token ID351
Feature activation+1.43e-02

Pos logit contributions
aleb+0.022
 butcher+0.021
 beef+0.020
 steak+0.020
 Junior+0.020
Neg logit contributions
ands-0.016
 ago-0.014
 tunnel-0.012
 disl-0.012
 pattern-0.012
Token everyone
Token ID2506
Feature activation+5.53e-03

Pos logit contributions
ands+0.023
 ago+0.021
 tunnel+0.019
 pattern+0.018
 space+0.018
Neg logit contributions
aleb-0.026
 butcher-0.025
 beef-0.024
 Junior-0.024
 steak-0.023
Token garden
Token ID11376
Feature activation+4.75e-02

Pos logit contributions
aleb+0.007
 butcher+0.006
 Junior+0.006
eled+0.006
Which+0.006
Neg logit contributions
ands-0.004
 ago-0.004
 space-0.004
 pattern-0.004
 tunnel-0.004
Token,
Token ID11
Feature activation+1.61e-02

Pos logit contributions
ands+0.065
 ago+0.051
 disl+0.048
 common+0.048
chie+0.045
Neg logit contributions
 butcher-0.099
aleb-0.097
 beef-0.092
 steak-0.092
Which-0.089
Token and
Token ID290
Feature activation-2.50e-02

Pos logit contributions
ands+0.020
 ago+0.017
 space+0.015
 pattern+0.015
 tunnel+0.015
Neg logit contributions
aleb-0.035
 butcher-0.035
 Junior-0.032
Which-0.032
 beef-0.032
Token picked
Token ID6497
Feature activation-2.79e-02

Pos logit contributions
aleb+0.057
 butcher+0.053
avored+0.051
Which+0.050
 beef+0.048
Neg logit contributions
ands-0.026
 ago-0.025
 space-0.024
 grow-0.024
 rain-0.023
Token the
Token ID262
Feature activation-3.85e-03

Pos logit contributions
aleb+0.072
 butcher+0.066
avored+0.064
mallow+0.063
rified+0.063
Neg logit contributions
 old-0.018
 grow-0.018
.-0.018
 space-0.018
 work-0.016
Token ju
Token ID7544
Feature activation-1.97e-01

Pos logit contributions
aleb+0.006
eled+0.006
rified+0.005
 butcher+0.005
 Junior+0.005
Neg logit contributions
 old-0.006
 space-0.006
ands-0.006
 grow-0.006
 pattern-0.006
Tokenic
Token ID291
Feature activation-1.75e-02

Pos logit contributions
 butcher+0.327
Which+0.317
 Junior+0.311
aleb+0.301
 Stan+0.290
Neg logit contributions
ands-0.340
 pattern-0.285
 tunnel-0.270
 space-0.264
 grow-0.261
Tokeniest
Token ID6386
Feature activation-6.57e-02

Pos logit contributions
aleb+0.032
 butcher+0.031
Which+0.029
 Junior+0.029
 beef+0.028
Neg logit contributions
ands-0.028
 ago-0.025
 pattern-0.022
 space-0.022
 tunnel-0.022
Token carrots
Token ID34397
Feature activation-1.50e-01

Pos logit contributions
aleb+0.084
 butcher+0.073
rified+0.071
 Junior+0.070
Which+0.067
Neg logit contributions
ands-0.138
 space-0.133
 pattern-0.131
 work-0.127
 ago-0.126
Token he
Token ID339
Feature activation-3.96e-02

Pos logit contributions
aleb+0.352
 butcher+0.346
 Junior+0.325
 beef+0.325
Which+0.322
Neg logit contributions
ands-0.161
 ago-0.131
 pattern-0.111
 space-0.110
 tunnel-0.109
Token could
Token ID714
Feature activation-1.45e-02

Pos logit contributions
aleb+0.084
 butcher+0.082
 Junior+0.078
 beef+0.077
Which+0.076
Neg logit contributions
ands-0.051
 ago-0.046
 pattern-0.039
 space-0.039
 tunnel-0.038
Token rewarded
Token ID20945
Feature activation-2.38e-02

Pos logit contributions
aleb+0.008
 butcher+0.007
 Junior+0.007
 beef+0.007
Which+0.007
Neg logit contributions
ands-0.006
 ago-0.005
 tunnel-0.004
 space-0.004
 pattern-0.004
Token with
Token ID351
Feature activation+1.38e-02

Pos logit contributions
aleb+0.048
 butcher+0.047
 beef+0.044
 Junior+0.044
Which+0.043
Neg logit contributions
ands-0.033
 ago-0.029
 tunnel-0.026
 space-0.025
 pattern-0.025
Token the
Token ID262
Feature activation-5.06e-03

Pos logit contributions
ands+0.018
 ago+0.016
 space+0.014
 pattern+0.014
 tunnel+0.014
Neg logit contributions
aleb-0.029
 butcher-0.028
 Junior-0.027
Which-0.026
 beef-0.026
Token biggest
Token ID4094
Feature activation-1.13e-01

Pos logit contributions
aleb+0.007
 butcher+0.007
 Junior+0.006
Which+0.006
 beef+0.006
Neg logit contributions
ands-0.010
 ago-0.010
 pattern-0.009
 space-0.009
 tunnel-0.009
Token,
Token ID11
Feature activation+1.12e-02

Pos logit contributions
aleb+0.171
 butcher+0.167
 beef+0.151
Which+0.150
 Junior+0.150
Neg logit contributions
ands-0.216
 ago-0.194
 space-0.175
 pattern-0.175
 tunnel-0.175
Token ju
Token ID7544
Feature activation-1.96e-01

Pos logit contributions
ands+0.017
 ago+0.016
 space+0.014
 pattern+0.014
 tunnel+0.014
Neg logit contributions
aleb-0.021
 butcher-0.020
 Junior-0.019
Which-0.019
 beef-0.018
Tokenic
Token ID291
Feature activation-1.40e-02

Pos logit contributions
 butcher+0.343
Which+0.326
 Junior+0.318
aleb+0.311
 Stanley+0.296
Neg logit contributions
ands-0.341
 pattern-0.275
 tunnel-0.265
 space-0.254
 ago-0.254
Tokeniest
Token ID6386
Feature activation-6.53e-02

Pos logit contributions
aleb+0.022
 butcher+0.022
 beef+0.020
 Junior+0.020
 steak+0.019
Neg logit contributions
ands-0.026
 ago-0.023
 space-0.021
 tunnel-0.021
 pattern-0.021
Token cher
Token ID21382
Feature activation-1.10e-01

Pos logit contributions
aleb+0.086
 butcher+0.082
 Junior+0.075
Which+0.073
 beef+0.071
Neg logit contributions
ands-0.137
 ago-0.128
 pattern-0.119
 space-0.119
 tunnel-0.117
Tokenries
Token ID1678
Feature activation-1.13e-01

Pos logit contributions
aleb+0.138
 butcher+0.137
 Junior+0.128
Which+0.121
rified+0.117
Neg logit contributions
ands-0.232
 ago-0.217
chie-0.197
 work-0.193
 space-0.193
Token!
Token ID0
Feature activation-2.50e-02

Pos logit contributions
aleb+0.249
 butcher+0.243
Which+0.228
 Junior+0.228
 beef+0.225
Neg logit contributions
ands-0.138
 ago-0.121
 space-0.103
 pattern-0.102
 tunnel-0.101
Token.
Token ID13
Feature activation+1.78e-02

Pos logit contributions
ands+0.036
 ago+0.028
 pattern+0.022
 tunnel+0.022
 space+0.022
Neg logit contributions
aleb-0.109
 butcher-0.107
 Junior-0.102
 beef-0.101
Which-0.100
Token
Token ID198
Feature activation+7.72e-03

Pos logit contributions
ands+0.021
 ago+0.019
 space+0.016
 pattern+0.015
 tunnel+0.015
Neg logit contributions
aleb-0.040
 butcher-0.038
 Junior-0.036
 beef-0.036
Which-0.036
Token
Token ID198
Feature activation+6.85e-03

Pos logit contributions
ands+0.012
 ago+0.011
 pattern+0.010
 tunnel+0.010
 space+0.010
Neg logit contributions
aleb-0.014
 butcher-0.014
 Junior-0.013
Which-0.013
 beef-0.013
Token"
Token ID1
Feature activation+1.01e-02

Pos logit contributions
ands+0.012
 ago+0.011
 pattern+0.010
 space+0.010
 disl+0.009
Neg logit contributions
aleb-0.012
 butcher-0.012
Which-0.011
 Junior-0.010
Stan-0.010
TokenBro
Token ID15783
Feature activation-1.86e-02

Pos logit contributions
ands+0.020
 ago+0.017
 common+0.017
 space+0.016
 rain+0.016
Neg logit contributions
aleb-0.015
 butcher-0.014
 Junior-0.013
rified-0.013
 steak-0.013
Tokenccoli
Token ID34544
Feature activation-1.96e-01

Pos logit contributions
aleb+0.020
 butcher+0.019
 Junior+0.016
 beef+0.016
Which+0.016
Neg logit contributions
ands-0.044
 ago-0.039
 space-0.039
 pattern-0.039
 tunnel-0.039
Token is
Token ID318
Feature activation+2.44e-02

Pos logit contributions
aleb+0.437
 butcher+0.420
Which+0.407
 Junior+0.393
rified+0.389
Neg logit contributions
ands-0.225
 ago-0.198
 pattern-0.182
 space-0.177
 tunnel-0.175
Token not
Token ID407
Feature activation+4.26e-02

Pos logit contributions
ands+0.027
 ago+0.023
 space+0.020
 pattern+0.020
 tunnel+0.019
Neg logit contributions
aleb-0.057
 butcher-0.055
 Junior-0.051
Which-0.051
rified-0.051
Token y
Token ID331
Feature activation-1.15e-01

Pos logit contributions
ands+0.057
 ago+0.051
 space+0.046
 pattern+0.044
 tunnel+0.043
Neg logit contributions
aleb-0.089
 butcher-0.085
Which-0.080
 Junior-0.079
rified-0.079
Tokenucky
Token ID5309
Feature activation-4.93e-02

Pos logit contributions
Which+0.200
 butcher+0.193
aleb+0.187
 Junior+0.180
 Turkey+0.172
Neg logit contributions
ands-0.197
 ago-0.178
 work-0.157
 space-0.155
 pattern-0.155
Token,
Token ID11
Feature activation+2.80e-02

Pos logit contributions
aleb+0.130
 butcher+0.124
Which+0.123
 Junior+0.120
eled+0.118
Neg logit contributions
ands-0.040
 ago-0.034
 space-0.025
 work-0.025
 pattern-0.023
Token On
Token ID1550
Feature activation+2.68e-03

Pos logit contributions
ands+0.001
 ago+0.001
 tunnel+0.001
 pattern+0.001
chie+0.000
Neg logit contributions
 butcher-0.001
aleb-0.001
 Junior-0.001
 beef-0.001
Which-0.001
Token the
Token ID262
Feature activation+1.87e-04

Pos logit contributions
ands+0.004
 ago+0.004
 disl+0.003
 pattern+0.003
chie+0.003
Neg logit contributions
aleb-0.005
 butcher-0.005
 Junior-0.005
 beef-0.005
 steak-0.005
Token way
Token ID835
Feature activation-3.55e-02

Pos logit contributions
ands+0.000
 ago+0.000
 disl+0.000
chie+0.000
 common+0.000
Neg logit contributions
 butcher-0.000
 beef-0.000
aleb-0.000
 steak-0.000
 Junior-0.000
Token back
Token ID736
Feature activation-3.74e-02

Pos logit contributions
aleb+0.081
 butcher+0.079
 Junior+0.074
Which+0.074
 beef+0.073
Neg logit contributions
ands-0.041
 ago-0.035
 pattern-0.029
 space-0.029
 tunnel-0.029
Token to
Token ID284
Feature activation+1.85e-02

Pos logit contributions
 butcher+0.072
aleb+0.071
 Junior+0.067
 beef+0.067
 steak+0.065
Neg logit contributions
ands-0.056
 disl-0.045
 ago-0.043
chie-0.040
 pattern-0.039
Token school
Token ID1524
Feature activation+1.31e-01

Pos logit contributions
ands+0.031
 ago+0.026
chie+0.024
 pattern+0.023
ooters+0.022
Neg logit contributions
 butcher-0.033
aleb-0.033
 Junior-0.032
 beef-0.031
 steak-0.031
Token,
Token ID11
Feature activation+1.46e-02

Pos logit contributions
ands+0.196
 disl+0.143
 common+0.139
chie+0.138
 ago+0.132
Neg logit contributions
 butcher-0.255
aleb-0.253
 Junior-0.243
 steak-0.237
 beef-0.235
Token they
Token ID484
Feature activation-1.12e-02

Pos logit contributions
ands+0.020
 ago+0.018
 space+0.016
 pattern+0.016
 tunnel+0.015
Neg logit contributions
aleb-0.030
 butcher-0.029
Which-0.027
 Junior-0.027
 beef-0.027
Token saw
Token ID2497
Feature activation+1.86e-02

Pos logit contributions
aleb+0.021
 butcher+0.021
 Junior+0.019
Which+0.019
 beef+0.019
Neg logit contributions
ands-0.017
 ago-0.015
 space-0.014
 pattern-0.014
 tunnel-0.013
Token a
Token ID257
Feature activation-6.70e-03

Pos logit contributions
ands+0.017
 ago+0.016
 space+0.015
 tunnel+0.014
 rain+0.014
Neg logit contributions
aleb-0.045
 butcher-0.043
 Junior-0.040
Stan-0.040
Which-0.040
Token p
Token ID279
Feature activation+1.01e-01

Pos logit contributions
aleb+0.015
 butcher+0.014
 Junior+0.014
Which+0.014
 beef+0.013
Neg logit contributions
ands-0.008
 ago-0.007
 pattern-0.006
 space-0.006
 tunnel-0.006
Token for
Token ID329
Feature activation-2.86e-02

Pos logit contributions
ands+0.006
 ago+0.005
 tunnel+0.005
 pattern+0.004
 space+0.004
Neg logit contributions
aleb-0.008
 butcher-0.008
 beef-0.008
 Junior-0.008
 steak-0.007
Token dessert
Token ID23084
Feature activation-9.13e-02

Pos logit contributions
aleb+0.056
 butcher+0.055
 Junior+0.051
 beef+0.051
Which+0.051
Neg logit contributions
ands-0.042
 ago-0.037
 pattern-0.033
 tunnel-0.032
 space-0.032
Token."
Token ID526
Feature activation+3.77e-02

Pos logit contributions
aleb+0.231
 butcher+0.228
 beef+0.214
 Junior+0.214
Which+0.213
Neg logit contributions
ands-0.081
 ago-0.065
 pattern-0.052
 tunnel-0.052
 space-0.051
Token
Token ID198
Feature activation+6.42e-03

Pos logit contributions
ands+0.063
 ago+0.054
 tunnel+0.051
 pattern+0.050
chie+0.049
Neg logit contributions
aleb-0.067
 butcher-0.066
 Junior-0.062
Which-0.061
 beef-0.060
Token
Token ID198
Feature activation+5.28e-03

Pos logit contributions
ands+0.009
 ago+0.008
 pattern+0.007
 space+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.013
 butcher-0.013
 Junior-0.012
Which-0.012
 beef-0.012
TokenL
Token ID43
Feature activation+9.94e-02

Pos logit contributions
ands+0.009
 ago+0.008
 pattern+0.008
 space+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.009
 butcher-0.009
Which-0.008
 Junior-0.008
 beef-0.008
Tokenily
Token ID813
Feature activation+4.35e-02

Pos logit contributions
ands+0.140
 ago+0.129
 tunnel+0.125
 pattern+0.123
 space+0.121
Neg logit contributions
aleb-0.194
 butcher-0.180
 beef-0.163
rified-0.162
Which-0.161
Token p
Token ID279
Feature activation+1.04e-01

Pos logit contributions
ands+0.045
 ago+0.042
 pattern+0.038
 grow+0.036
 hall+0.035
Neg logit contributions
aleb-0.097
 butcher-0.096
Which-0.094
 Junior-0.090
 beef-0.089
Tokenouted
Token ID18534
Feature activation+2.41e-02

Pos logit contributions
ands+0.195
 ago+0.178
 space+0.162
 pattern+0.162
 tunnel+0.161
Neg logit contributions
aleb-0.164
 butcher-0.158
 Junior-0.144
Which-0.143
 beef-0.142
Token and
Token ID290
Feature activation-1.45e-02

Pos logit contributions
ands+0.029
 ago+0.024
 tunnel+0.020
 pattern+0.020
 space+0.019
Neg logit contributions
aleb-0.055
 butcher-0.054
 beef-0.051
 Junior-0.051
 steak-0.050
Token pushed
Token ID7121
Feature activation-2.63e-02

Pos logit contributions
aleb+0.026
 butcher+0.025
 Junior+0.024
 beef+0.023
 steak+0.023
Neg logit contributions
ands-0.025
 ago-0.021
 tunnel-0.020
 pattern-0.019
 space-0.019
Token there
Token ID612
Feature activation+7.65e-03

Pos logit contributions
 ago+0.023
ands+0.020
 space+0.019
 tunnel+0.019
 pattern+0.018
Neg logit contributions
aleb-0.055
 butcher-0.054
 french-0.049
 Junior-0.049
 beef-0.048
Token was
Token ID373
Feature activation+7.34e-03

Pos logit contributions
 be+0.005
 pattern+0.005
 ago+0.005
 tunnel+0.005
 space+0.005
Neg logit contributions
aleb-0.021
 butcher-0.021
Which-0.018
 beef-0.018
Stan-0.018
Token a
Token ID257
Feature activation-2.89e-03

Pos logit contributions
ands+0.005
 ago+0.004
 space+0.004
 rain+0.004
 pattern+0.003
Neg logit contributions
aleb-0.020
 butcher-0.019
rified-0.018
Which-0.018
 Junior-0.018
Token little
Token ID1310
Feature activation-4.80e-03

Pos logit contributions
aleb+0.007
 butcher+0.006
rified+0.006
Which+0.006
 Junior+0.006
Neg logit contributions
ands-0.003
 space-0.002
 ago-0.002
 pattern-0.002
 tunnel-0.002
Token girl
Token ID2576
Feature activation+2.47e-03

Pos logit contributions
aleb+0.008
 butcher+0.008
Which+0.007
 Junior+0.007
 beef+0.007
Neg logit contributions
ands-0.008
 ago-0.008
 space-0.007
 tunnel-0.007
 pattern-0.007
Token named
Token ID3706
Feature activation+8.54e-02

Pos logit contributions
ands+0.004
 ago+0.003
 pattern+0.003
 disl+0.003
 tunnel+0.003
Neg logit contributions
aleb-0.005
 butcher-0.005
 Junior-0.004
Which-0.004
 beef-0.004
Token Lily
Token ID20037
Feature activation+5.10e-02

Pos logit contributions
ands+0.122
 ago+0.095
 disl+0.087
 tunnel+0.087
 outage+0.084
Neg logit contributions
aleb-0.169
 butcher-0.166
Which-0.162
 Junior-0.158
 beef-0.157
Token who
Token ID508
Feature activation-2.40e-02

Pos logit contributions
ands+0.040
 ago+0.030
 pattern+0.022
 tunnel+0.022
 space+0.022
Neg logit contributions
aleb-0.136
 butcher-0.133
 Junior-0.127
Which-0.126
 beef-0.126
Token loved
Token ID6151
Feature activation+1.92e-02

Pos logit contributions
aleb+0.048
 butcher+0.047
Which+0.044
 Junior+0.044
 beef+0.043
Neg logit contributions
ands-0.034
 ago-0.030
 space-0.027
 pattern-0.027
 tunnel-0.027
Token to
Token ID284
Feature activation+1.90e-02

Pos logit contributions
ands+0.023
 ago+0.020
 pattern+0.017
 space+0.017
 tunnel+0.017
Neg logit contributions
aleb-0.042
 butcher-0.042
 Junior-0.039
Which-0.039
 beef-0.039
Token play
Token ID711
Feature activation+8.04e-02

Pos logit contributions
ands+0.031
 ago+0.029
chie+0.026
 tunnel+0.024
 outage+0.023
Neg logit contributions
 butcher-0.033
 beef-0.032
 steak-0.032
aleb-0.032
 french-0.031
Token him
Token ID683
Feature activation-2.66e-02

Pos logit contributions
aleb+0.060
 butcher+0.058
Which+0.053
 Junior+0.053
 beef+0.052
Neg logit contributions
ands-0.070
 ago-0.063
 space-0.058
 pattern-0.057
 tunnel-0.057
Token.
Token ID13
Feature activation-5.69e-03

Pos logit contributions
aleb+0.069
 butcher+0.068
Which+0.064
 Junior+0.064
 beef+0.064
Neg logit contributions
ands-0.022
 ago-0.017
 space-0.013
 pattern-0.013
 tunnel-0.013
Token But
Token ID887
Feature activation+2.03e-02

Pos logit contributions
aleb+0.011
 butcher+0.011
Which+0.010
 Junior+0.010
 beef+0.010
Neg logit contributions
ands-0.009
 ago-0.008
 space-0.007
 pattern-0.007
 tunnel-0.007
Token Benny
Token ID44275
Feature activation-3.28e-02

Pos logit contributions
ands+0.033
 ago+0.030
 space+0.028
 pattern+0.027
 tunnel+0.027
Neg logit contributions
aleb-0.036
 butcher-0.035
Which-0.032
 Junior-0.032
 beef-0.032
Token didn
Token ID1422
Feature activation+2.03e-02

Pos logit contributions
aleb+0.071
 butcher+0.069
Which+0.068
avored+0.065
 beef+0.064
Neg logit contributions
ands-0.036
 ago-0.034
 grow-0.032
 pattern-0.030
 tunnel-0.030
Token't
Token ID470
Feature activation+8.56e-02

Pos logit contributions
ands+0.024
 ago+0.021
 space+0.017
 disl+0.017
 tunnel+0.017
Neg logit contributions
aleb-0.045
 butcher-0.045
 Junior-0.043
 beef-0.042
Which-0.042
Token want
Token ID765
Feature activation-4.35e-02

Pos logit contributions
ands+0.135
 ago+0.121
 tunnel+0.109
 disl+0.106
 hall+0.103
Neg logit contributions
 butcher-0.158
aleb-0.150
Stan-0.142
 Junior-0.141
 beef-0.141
Token carrots
Token ID34397
Feature activation-1.50e-01

Pos logit contributions
aleb+0.117
 butcher+0.111
eled+0.108
avored+0.107
rified+0.107
Neg logit contributions
ands-0.030
 space-0.024
 ago-0.023
 rain-0.021
 work-0.020
Token,
Token ID11
Feature activation+1.54e-02

Pos logit contributions
aleb+0.355
 butcher+0.349
 Junior+0.328
 beef+0.327
Which+0.324
Neg logit contributions
ands-0.159
 ago-0.131
 pattern-0.109
 space-0.108
 tunnel-0.108
Token he
Token ID339
Feature activation-3.91e-02

Pos logit contributions
ands+0.019
 ago+0.016
 space+0.014
 pattern+0.014
 tunnel+0.014
Neg logit contributions
aleb-0.034
 butcher-0.033
 Junior-0.031
Which-0.031
 beef-0.031
Token wanted
Token ID2227
Feature activation-3.17e-02

Pos logit contributions
aleb+0.086
 butcher+0.083
Which+0.082
 Junior+0.078
avored+0.078
Neg logit contributions
ands-0.042
 ago-0.039
 grow-0.038
 pattern-0.037
 tunnel-0.035
Token"
Token ID1
Feature activation+9.89e-03

Pos logit contributions
ands+0.018
 ago+0.016
 pattern+0.015
 space+0.014
 disl+0.014
Neg logit contributions
aleb-0.016
 butcher-0.016
Which-0.015
 Junior-0.014
 beef-0.014
TokenCome
Token ID16773
Feature activation-1.05e-01

Pos logit contributions
ands+0.020
 ago+0.018
 space+0.017
 common+0.017
 tunnel+0.017
Neg logit contributions
aleb-0.014
 butcher-0.013
 Junior-0.012
 steak-0.012
 beef-0.012
Token on
Token ID319
Feature activation-3.24e-02

Pos logit contributions
aleb+0.197
 butcher+0.190
 Junior+0.174
 beef+0.173
rified+0.173
Neg logit contributions
ands-0.160
 ago-0.143
 space-0.137
 tunnel-0.135
 pattern-0.132
Token,
Token ID11
Feature activation+2.76e-02

Pos logit contributions
aleb+0.081
 butcher+0.080
 Junior+0.076
 beef+0.075
Which+0.075
Neg logit contributions
ands-0.030
 ago-0.024
 pattern-0.019
 space-0.019
 tunnel-0.019
Token don
Token ID836
Feature activation+2.09e-02

Pos logit contributions
ands+0.044
 ago+0.039
 pattern+0.033
 common+0.033
 space+0.033
Neg logit contributions
aleb-0.051
 butcher-0.050
 Junior-0.047
 beef-0.047
Which-0.046
Token't
Token ID470
Feature activation+9.40e-02

Pos logit contributions
ands+0.018
 ago+0.015
 space+0.012
 pattern+0.011
 tunnel+0.011
Neg logit contributions
aleb-0.053
 butcher-0.052
 Junior-0.049
Which-0.049
 beef-0.049
Token be
Token ID307
Feature activation+5.62e-02

Pos logit contributions
ands+0.132
 ago+0.115
 pattern+0.090
 disl+0.089
 space+0.088
Neg logit contributions
aleb-0.195
 butcher-0.186
 Junior-0.180
 beef-0.177
 steak-0.176
Token a
Token ID257
Feature activation-3.28e-04

Pos logit contributions
ands+0.097
 ago+0.081
 pattern+0.072
alk+0.072
ro+0.070
Neg logit contributions
 beef-0.090
aleb-0.089
 butcher-0.089
 Junior-0.087
 steak-0.086
Token baby
Token ID5156
Feature activation-8.57e-02

Pos logit contributions
 butcher+0.001
 beef+0.001
 turkey+0.001
aleb+0.001
 Junior+0.001
Neg logit contributions
ands-0.001
 ago-0.000
 disl-0.000
alk-0.000
ro-0.000
Token!"
Token ID2474
Feature activation-8.86e-03

Pos logit contributions
aleb+0.198
 butcher+0.196
 Junior+0.184
 beef+0.184
Which+0.183
Neg logit contributions
ands-0.095
 ago-0.078
 pattern-0.066
 space-0.064
 tunnel-0.064
Token Tim
Token ID5045
Feature activation+1.42e-02

Pos logit contributions
 butcher+0.010
 beef+0.010
 Stan+0.010
 Junior+0.010
Which+0.010
Neg logit contributions
ands-0.021
 ago-0.017
chie-0.017
 outage-0.016
forest-0.016
Token Lily
Token ID20037
Feature activation+6.01e-02

Pos logit contributions
ands+0.030
 ago+0.025
chie+0.025
 tunnel+0.024
 pattern+0.023
Neg logit contributions
 butcher-0.032
aleb-0.031
Which-0.030
 Junior-0.030
 beef-0.030
Token felt
Token ID2936
Feature activation+1.79e-02

Pos logit contributions
ands+0.085
 ago+0.075
 pattern+0.067
 space+0.067
 tunnel+0.066
Neg logit contributions
aleb-0.120
 butcher-0.117
Which-0.109
 Junior-0.109
 beef-0.108
Token happy
Token ID3772
Feature activation+2.46e-02

Pos logit contributions
ands+0.025
 ago+0.023
 tunnel+0.019
 pattern+0.019
 space+0.019
Neg logit contributions
aleb-0.036
 butcher-0.035
Which-0.033
 Junior-0.033
 beef-0.033
Token.
Token ID13
Feature activation+1.67e-02

Pos logit contributions
ands+0.044
chie+0.040
 ago+0.038
 common+0.037
 disl+0.034
Neg logit contributions
 steak-0.041
 beef-0.038
 butcher-0.037
 turkey-0.036
aleb-0.035
Token She
Token ID1375
Feature activation+3.01e-02

Pos logit contributions
ands+0.023
 ago+0.020
 tunnel+0.018
 pattern+0.017
chie+0.017
Neg logit contributions
aleb-0.034
 butcher-0.033
Which-0.031
 Junior-0.031
 beef-0.031
Token said
Token ID531
Feature activation-2.09e-02

Pos logit contributions
ands+0.050
 ago+0.046
 tunnel+0.039
 space+0.038
chie+0.038
Neg logit contributions
aleb-0.054
 butcher-0.052
 Junior-0.049
Which-0.048
 beef-0.047
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
aleb+0.051
 butcher+0.050
 Junior+0.047
 beef+0.047
Which+0.047
Neg logit contributions
ands-0.021
 ago-0.017
 tunnel-0.014
 pattern-0.014
 space-0.014
Token "
Token ID366
Feature activation-4.54e-02

Pos logit contributions
ands+0.038
 ago+0.035
 space+0.030
 pattern+0.030
 tunnel+0.029
Neg logit contributions
aleb-0.054
 butcher-0.052
 Junior-0.048
Which-0.048
 beef-0.048
TokenI
Token ID40
Feature activation-2.18e-02

Pos logit contributions
aleb+0.067
 butcher+0.066
 Junior+0.059
 beef+0.058
 steak+0.057
Neg logit contributions
ands-0.088
 ago-0.078
 space-0.073
 pattern-0.073
 rain-0.071
Token love
Token ID1842
Feature activation+6.24e-03

Pos logit contributions
aleb+0.042
 butcher+0.041
Which+0.039
 Junior+0.038
 beef+0.038
Neg logit contributions
ands-0.033
 ago-0.029
 tunnel-0.025
 space-0.025
 pattern-0.025
Token you
Token ID345
Feature activation+6.34e-02

Pos logit contributions
ands+0.010
 ago+0.009
 tunnel+0.008
ro+0.008
 outage+0.008
Neg logit contributions
 steak-0.011
 beef-0.011
aleb-0.010
 butcher-0.010
 turkey-0.010
Token on
Token ID319
Feature activation-3.23e-02

Pos logit contributions
aleb+0.075
 butcher+0.073
 Junior+0.066
Which+0.066
 beef+0.065
Neg logit contributions
ands-0.070
 ago-0.064
 space-0.059
 tunnel-0.058
 pattern-0.058
Token top
Token ID1353
Feature activation+9.98e-03

Pos logit contributions
aleb+0.066
 butcher+0.065
 Junior+0.060
Which+0.060
 beef+0.060
Neg logit contributions
ands-0.044
 ago-0.039
 space-0.034
 pattern-0.034
 tunnel-0.034
Token of
Token ID286
Feature activation+1.21e-03

Pos logit contributions
ands+0.017
 disl+0.015
 ago+0.014
chie+0.013
 talent+0.013
Neg logit contributions
aleb-0.016
 butcher-0.016
 beef-0.015
 steak-0.015
rified-0.014
Token it
Token ID340
Feature activation+1.48e-03

Pos logit contributions
ands+0.002
 ago+0.002
 pattern+0.001
 disl+0.001
 tunnel+0.001
Neg logit contributions
aleb-0.002
 butcher-0.002
 beef-0.002
Which-0.002
 Junior-0.002
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
ands+0.002
 disl+0.002
chie+0.002
 common+0.002
 ago+0.002
Neg logit contributions
aleb-0.003
 butcher-0.003
 steak-0.003
 beef-0.003
rified-0.003
Token looked
Token ID3114
Feature activation+3.34e-02

Pos logit contributions
aleb+0.037
 butcher+0.037
 beef+0.034
Which+0.034
avored+0.034
Neg logit contributions
ands-0.017
 ago-0.016
 space-0.015
 pattern-0.015
 grow-0.015
Token at
Token ID379
Feature activation-2.39e-03

Pos logit contributions
ands+0.052
 ago+0.048
 space+0.044
 pattern+0.043
 tunnel+0.042
Neg logit contributions
aleb-0.062
 butcher-0.061
 Junior-0.056
Which-0.055
 beef-0.054
Token their
Token ID511
Feature activation+5.30e-03

Pos logit contributions
aleb+0.005
 butcher+0.005
 Junior+0.004
Which+0.004
 beef+0.004
Neg logit contributions
ands-0.003
 ago-0.003
 space-0.003
 pattern-0.003
 tunnel-0.003
Token shadows
Token ID16187
Feature activation+6.50e-02

Pos logit contributions
ands+0.008
 ago+0.007
 disl+0.006
 outage+0.006
chie+0.006
Neg logit contributions
aleb-0.010
 butcher-0.009
 steak-0.009
 beef-0.009
Which-0.009
Token.
Token ID13
Feature activation+5.78e-03

Pos logit contributions
ands+0.066
 ago+0.049
 disl+0.044
 common+0.043
chie+0.038
Neg logit contributions
aleb-0.153
 butcher-0.151
 beef-0.146
 steak-0.145
rified-0.142
Token They
Token ID1119
Feature activation+1.30e-02

Pos logit contributions
ands+0.008
 ago+0.007
 tunnel+0.006
 pattern+0.006
 space+0.006
Neg logit contributions
aleb-0.012
 butcher-0.012
 Junior-0.011
Which-0.011
 beef-0.011
Token,
Token ID11
Feature activation+2.04e-02

Pos logit contributions
ands+0.016
 ago+0.013
 disl+0.013
 tunnel+0.012
chie+0.012
Neg logit contributions
 butcher-0.016
aleb-0.016
 beef-0.016
 Junior-0.015
 steak-0.015
Token but
Token ID475
Feature activation-5.90e-02

Pos logit contributions
ands+0.030
 ago+0.025
 tunnel+0.022
 space+0.022
 pattern+0.022
Neg logit contributions
aleb-0.040
 butcher-0.040
 Junior-0.037
Which-0.037
 beef-0.037
Token eventually
Token ID4191
Feature activation+2.05e-02

Pos logit contributions
aleb+0.136
 butcher+0.133
Which+0.123
 Junior+0.123
 beef+0.123
Neg logit contributions
ands-0.062
 ago-0.059
 space-0.048
 pattern-0.048
 tunnel-0.047
Token she
Token ID673
Feature activation+6.77e-03

Pos logit contributions
ands+0.027
 ago+0.022
 tunnel+0.019
 pattern+0.019
 space+0.019
Neg logit contributions
aleb-0.044
 butcher-0.043
 Junior-0.041
Which-0.041
 beef-0.040
Token managed
Token ID5257
Feature activation-4.34e-02

Pos logit contributions
ands+0.009
 ago+0.009
 pattern+0.008
 space+0.007
 tunnel+0.007
Neg logit contributions
aleb-0.014
 butcher-0.013
Which-0.012
 Junior-0.012
 beef-0.012
Token to
Token ID284
Feature activation+1.73e-02

Pos logit contributions
aleb+0.083
 butcher+0.082
 beef+0.078
 steak+0.077
 Junior+0.077
Neg logit contributions
ands-0.066
 ago-0.055
 tunnel-0.051
 pattern-0.048
 disl-0.048
Token copy
Token ID4866
Feature activation-1.17e-02

Pos logit contributions
ands+0.026
 ago+0.024
 disl+0.020
 tunnel+0.020
 hall+0.020
Neg logit contributions
aleb-0.033
 butcher-0.031
 Junior-0.030
 steak-0.029
 beef-0.029
Token the
Token ID262
Feature activation-2.60e-03

Pos logit contributions
aleb+0.023
 butcher+0.023
 Junior+0.021
 beef+0.021
Which+0.021
Neg logit contributions
ands-0.017
 ago-0.015
 tunnel-0.013
 pattern-0.012
 space-0.012
Token signs
Token ID5895
Feature activation+6.35e-02

Pos logit contributions
 butcher+0.005
aleb+0.005
 beef+0.005
 steak+0.005
 turkey+0.005
Neg logit contributions
ands-0.004
 ago-0.003
 disl-0.003
 tunnel-0.003
ro-0.003
Token down
Token ID866
Feature activation-7.29e-02

Pos logit contributions
ands+0.076
 ago+0.061
 tunnel+0.054
 disl+0.052
 space+0.052
Neg logit contributions
aleb-0.141
 butcher-0.139
 Junior-0.130
 beef-0.130
Which-0.129
Token.
Token ID13
Feature activation-5.67e-03

Pos logit contributions
 butcher+0.150
aleb+0.149
Which+0.141
 beef+0.141
rified+0.137
Neg logit contributions
ands-0.095
 ago-0.074
 disl-0.073
 tunnel-0.068
 pattern-0.066
Token friends
Token ID2460
Feature activation+3.41e-02

Pos logit contributions
aleb+0.250
 butcher+0.244
 beef+0.228
 Junior+0.224
Which+0.224
Neg logit contributions
ands-0.184
 ago-0.165
 pattern-0.142
 space-0.141
 tunnel-0.139
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
ands+0.050
 ago+0.040
 disl+0.037
 pattern+0.036
ro+0.036
Neg logit contributions
aleb-0.069
 butcher-0.067
 Junior-0.064
 beef-0.063
Which-0.062
Token Amy
Token ID14235
Feature activation-1.83e-02

Pos logit contributions
ands+0.038
 ago+0.033
 pattern+0.029
 tunnel+0.029
 space+0.028
Neg logit contributions
aleb-0.046
 butcher-0.044
 Junior-0.042
 beef-0.041
Which-0.041
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
aleb+0.041
 butcher+0.040
 Junior+0.038
Which+0.038
 beef+0.037
Neg logit contributions
ands-0.022
 ago-0.018
 pattern-0.016
 space-0.016
 tunnel-0.015
Token Sarah
Token ID10490
Feature activation+4.81e-02

Pos logit contributions
aleb+0.027
 butcher+0.027
 Junior+0.025
Which+0.024
 beef+0.024
Neg logit contributions
ands-0.026
 ago-0.024
 space-0.021
 pattern-0.021
 tunnel-0.021
Token.
Token ID13
Feature activation+5.59e-04

Pos logit contributions
ands+0.042
 ago+0.032
 pattern+0.025
 tunnel+0.024
 space+0.024
Neg logit contributions
aleb-0.124
 butcher-0.120
 Junior-0.115
 beef-0.115
Which-0.113
Token They
Token ID1119
Feature activation+1.18e-02

Pos logit contributions
ands+0.001
 ago+0.001
 tunnel+0.001
 pattern+0.001
 space+0.001
Neg logit contributions
aleb-0.001
 butcher-0.001
 Junior-0.001
Which-0.001
 beef-0.001
Token liked
Token ID8288
Feature activation+6.68e-03

Pos logit contributions
ands+0.021
 ago+0.019
 tunnel+0.017
 outage+0.017
 disl+0.016
Neg logit contributions
aleb-0.019
 butcher-0.018
 Junior-0.017
Which-0.017
rified-0.016
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
ands+0.006
 ago+0.005
 space+0.005
 pattern+0.004
 rain+0.004
Neg logit contributions
aleb-0.017
 butcher-0.016
rified-0.015
 Junior-0.015
Which-0.015
Token play
Token ID711
Feature activation+7.82e-02

Pos logit contributions
ands+0.027
 ago+0.025
 outage+0.022
chie+0.021
post+0.019
Neg logit contributions
aleb-0.027
 butcher-0.026
 turkey-0.025
 beef-0.025
 apple-0.025
Token together
Token ID1978
Feature activation+5.63e-02

Pos logit contributions
ands+0.103
 ago+0.086
 space+0.081
 pattern+0.079
 tunnel+0.079
Neg logit contributions
aleb-0.168
 butcher-0.164
 Junior-0.154
Which-0.151
 beef-0.149
Token,
Token ID11
Feature activation+2.09e-02

Pos logit contributions
 butcher+0.055
aleb+0.055
Which+0.052
 steak+0.052
 beef+0.052
Neg logit contributions
ands-0.037
 ago-0.030
 tunnel-0.026
 space-0.026
 pattern-0.026
Token "
Token ID366
Feature activation-5.19e-02

Pos logit contributions
ands+0.037
 ago+0.032
 tunnel+0.029
 pattern+0.029
 space+0.029
Neg logit contributions
 butcher-0.035
aleb-0.035
 Junior-0.033
Which-0.033
 beef-0.032
TokenE
Token ID36
Feature activation+3.38e-03

Pos logit contributions
aleb+0.089
 butcher+0.088
 steak+0.084
 beef+0.084
 turkey+0.083
Neg logit contributions
ands-0.071
 rain-0.069
ro-0.066
 be-0.066
 tunnel-0.066
Tokenw
Token ID86
Feature activation-5.04e-02

Pos logit contributions
ands+0.007
 ago+0.007
 space+0.006
 pattern+0.006
 tunnel+0.006
Neg logit contributions
aleb-0.004
 butcher-0.004
 Junior-0.004
Which-0.004
 beef-0.004
Token,
Token ID11
Feature activation+2.26e-02

Pos logit contributions
aleb+0.129
 butcher+0.128
Which+0.121
 beef+0.119
 Junior+0.119
Neg logit contributions
ands-0.041
 ago-0.034
 pattern-0.029
 space-0.027
 tunnel-0.027
Token this
Token ID428
Feature activation+3.36e-02

Pos logit contributions
ands+0.026
 ago+0.023
 space+0.020
 tunnel+0.020
 pattern+0.020
Neg logit contributions
aleb-0.051
 butcher-0.049
Which-0.046
 Junior-0.046
 beef-0.045
Token toy
Token ID13373
Feature activation-3.48e-02

Pos logit contributions
ands+0.057
 ago+0.049
 disl+0.042
 pattern+0.041
 tunnel+0.041
Neg logit contributions
aleb-0.060
 butcher-0.059
 beef-0.055
Which-0.055
 Junior-0.055
Token is
Token ID318
Feature activation+2.45e-02

Pos logit contributions
aleb+0.065
 butcher+0.064
 Junior+0.059
Which+0.059
 beef+0.059
Neg logit contributions
ands-0.055
 ago-0.048
 pattern-0.041
 disl-0.041
 space-0.041
Token disgusting
Token ID23374
Feature activation-4.43e-02

Pos logit contributions
ands+0.038
 ago+0.031
 disl+0.027
 hall+0.026
ro+0.025
Neg logit contributions
aleb-0.046
 butcher-0.046
 beef-0.046
 steak-0.045
Which-0.044
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
aleb+0.099
 butcher+0.098
 beef+0.093
 Junior+0.092
 steak+0.091
Neg logit contributions
ands-0.053
 ago-0.044
 pattern-0.036
 space-0.036
 tunnel-0.036
Token I
Token ID314
Feature activation-4.16e-03

Pos logit contributions
ands+0.038
 ago+0.033
 space+0.030
 pattern+0.030
 tunnel+0.030
Neg logit contributions
aleb-0.040
 butcher-0.040
 Junior-0.037
Which-0.037
 beef-0.036
Token mom
Token ID1995
Feature activation-3.17e-02

Pos logit contributions
Which+0.095
aleb+0.095
rified+0.092
Stan+0.088
 Junior+0.088
Neg logit contributions
 space-0.044
 old-0.044
 work-0.041
 voice-0.040
 rain-0.040
Token came
Token ID1625
Feature activation+3.63e-02

Pos logit contributions
aleb+0.073
Which+0.070
 butcher+0.068
avored+0.068
rified+0.067
Neg logit contributions
 ago-0.029
ands-0.028
 grown-0.026
 grow-0.026
 work-0.025
Token back
Token ID736
Feature activation-4.41e-02

Pos logit contributions
ands+0.050
 ago+0.044
 space+0.038
 pattern+0.038
 tunnel+0.038
Neg logit contributions
aleb-0.075
 butcher-0.073
 Junior-0.068
Which-0.068
 beef-0.068
Token to
Token ID284
Feature activation+1.72e-02

Pos logit contributions
aleb+0.087
 butcher+0.087
 beef+0.080
 Junior+0.080
 steak+0.078
Neg logit contributions
ands-0.065
 ago-0.054
 pattern-0.049
 disl-0.049
 tunnel-0.048
Token check
Token ID2198
Feature activation+2.78e-02

Pos logit contributions
ands+0.027
 ago+0.023
 tunnel+0.021
 pattern+0.021
 outage+0.021
Neg logit contributions
 butcher-0.032
aleb-0.032
 Junior-0.031
 beef-0.030
Which-0.030
Token on
Token ID319
Feature activation-3.50e-02

Pos logit contributions
ands+0.049
 ago+0.044
 pattern+0.040
 space+0.040
 tunnel+0.039
Neg logit contributions
aleb-0.047
 butcher-0.046
 Junior-0.042
Which-0.042
 beef-0.041
Token him
Token ID683
Feature activation-2.27e-02

Pos logit contributions
aleb+0.067
 butcher+0.065
Which+0.060
 Junior+0.060
 beef+0.058
Neg logit contributions
ands-0.053
 ago-0.048
 space-0.043
 pattern-0.042
 tunnel-0.042
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
aleb+0.062
 butcher+0.061
Which+0.058
 Junior+0.058
 beef+0.057
Neg logit contributions
ands-0.016
 ago-0.012
 space-0.009
 pattern-0.008
 tunnel-0.008
Token She
Token ID1375
Feature activation+2.09e-02

Pos logit contributions
ands+0.002
 ago+0.002
 common+0.002
chie+0.002
 tunnel+0.002
Neg logit contributions
 butcher-0.002
aleb-0.002
 beef-0.002
 Junior-0.002
Which-0.002
Token saw
Token ID2497
Feature activation+1.76e-02

Pos logit contributions
ands+0.019
 ago+0.018
 grow+0.017
 space+0.017
 work+0.017
Neg logit contributions
aleb-0.048
 butcher-0.046
rified-0.045
avored-0.045
Which-0.044
Token that
Token ID326
Feature activation-1.67e-02

Pos logit contributions
ands+0.019
 ago+0.017
 space+0.016
 rain+0.016
 tunnel+0.015
Neg logit contributions
aleb-0.041
 butcher-0.039
Which-0.036
 Junior-0.036
rified-0.036
Token The
Token ID383
Feature activation+3.82e-02

Pos logit contributions
aleb+0.004
 butcher+0.004
 Junior+0.004
Which+0.004
 beef+0.004
Neg logit contributions
ands-0.003
 ago-0.003
 pattern-0.002
 space-0.002
 tunnel-0.002
Token mole
Token ID9411
Feature activation-9.88e-02

Pos logit contributions
ands+0.081
 ago+0.075
 space+0.072
 tunnel+0.072
 rain+0.072
Neg logit contributions
aleb-0.050
 butcher-0.044
 Junior-0.042
Which-0.042
avored-0.039
Token was
Token ID373
Feature activation+2.37e-03

Pos logit contributions
aleb+0.200
 butcher+0.194
Which+0.182
 Junior+0.180
 beef+0.178
Neg logit contributions
ands-0.135
 ago-0.121
 pattern-0.110
 space-0.108
 tunnel-0.107
Token happy
Token ID3772
Feature activation+2.28e-02

Pos logit contributions
ands+0.003
 ago+0.002
 space+0.002
 pattern+0.002
 tunnel+0.002
Neg logit contributions
aleb-0.005
 butcher-0.005
 Junior-0.005
Which-0.005
 beef-0.005
Token because
Token ID780
Feature activation-2.84e-03

Pos logit contributions
ands+0.031
 ago+0.026
 pattern+0.022
 tunnel+0.022
 space+0.022
Neg logit contributions
aleb-0.048
 butcher-0.047
 beef-0.044
 Junior-0.044
 steak-0.044
Token he
Token ID339
Feature activation-3.72e-02

Pos logit contributions
aleb+0.006
 butcher+0.006
 Junior+0.005
Which+0.005
 beef+0.005
Neg logit contributions
ands-0.004
 ago-0.003
 space-0.003
 pattern-0.003
 tunnel-0.003
Token sold
Token ID2702
Feature activation-2.06e-02

Pos logit contributions
aleb+0.072
 butcher+0.070
Which+0.065
 Junior+0.065
 beef+0.065
Neg logit contributions
ands-0.055
 ago-0.049
 space-0.044
 pattern-0.044
 tunnel-0.043
Token all
Token ID477
Feature activation+3.74e-02

Pos logit contributions
aleb+0.046
 butcher+0.044
rified+0.042
Which+0.042
 Junior+0.041
Neg logit contributions
ands-0.023
 space-0.020
 ago-0.020
 work-0.018
 old-0.018
Token his
Token ID465
Feature activation-1.41e-02

Pos logit contributions
ands+0.041
 ago+0.035
 space+0.033
 pattern+0.030
 tunnel+0.029
Neg logit contributions
aleb-0.085
 butcher-0.083
Which-0.079
 Junior-0.078
rified-0.077
Token vegetables
Token ID13701
Feature activation-1.27e-01

Pos logit contributions
aleb+0.021
 butcher+0.020
Which+0.018
 Junior+0.018
rified+0.017
Neg logit contributions
ands-0.028
 ago-0.026
 space-0.024
 pattern-0.023
 tunnel-0.023
Token and
Token ID290
Feature activation-2.09e-02

Pos logit contributions
aleb+0.291
 butcher+0.286
 beef+0.269
 Junior+0.268
 steak+0.265
Neg logit contributions
ands-0.144
 ago-0.120
 pattern-0.102
 tunnel-0.101
 space-0.100
Token.
Token ID13
Feature activation-1.58e-02

Pos logit contributions
ands+0.010
 ago+0.008
 tunnel+0.007
 disl+0.007
 space+0.007
Neg logit contributions
aleb-0.015
 butcher-0.015
 beef-0.014
 steak-0.014
 Junior-0.014
Token "
Token ID366
Feature activation-6.20e-02

Pos logit contributions
aleb+0.029
 butcher+0.029
 Junior+0.027
Which+0.027
 beef+0.027
Neg logit contributions
ands-0.025
 ago-0.021
 pattern-0.020
 tunnel-0.019
 space-0.019
TokenIt
Token ID1026
Feature activation-1.86e-02

Pos logit contributions
aleb+0.084
 butcher+0.084
 Junior+0.075
 beef+0.075
 steak+0.074
Neg logit contributions
ands-0.122
 ago-0.114
 pattern-0.104
 space-0.103
ro-0.102
Token is
Token ID318
Feature activation+2.38e-02

Pos logit contributions
aleb+0.024
 butcher+0.023
Which+0.021
 Junior+0.021
 beef+0.021
Neg logit contributions
ands-0.039
 ago-0.036
 space-0.032
 tunnel-0.032
 pattern-0.032
Token a
Token ID257
Feature activation-2.89e-03

Pos logit contributions
ands+0.033
 ago+0.027
 pattern+0.024
 tunnel+0.024
 hall+0.024
Neg logit contributions
aleb-0.047
 butcher-0.047
 beef-0.046
 Junior-0.045
 steak-0.045
Token trout
Token ID49411
Feature activation-7.48e-02

Pos logit contributions
aleb+0.006
 butcher+0.006
 beef+0.006
Which+0.005
 steak+0.005
Neg logit contributions
ands-0.004
 ago-0.003
 pattern-0.003
 space-0.003
 disl-0.003
Token.
Token ID13
Feature activation+2.12e-02

Pos logit contributions
aleb+0.156
 butcher+0.153
 Junior+0.142
 beef+0.142
Which+0.142
Neg logit contributions
ands-0.100
 ago-0.086
 space-0.075
 pattern-0.075
 tunnel-0.074
Token Do
Token ID2141
Feature activation-2.70e-02

Pos logit contributions
ands+0.035
 ago+0.031
 pattern+0.028
 space+0.028
 tunnel+0.028
Neg logit contributions
aleb-0.038
 butcher-0.037
 Junior-0.034
Which-0.034
 beef-0.034
Token you
Token ID345
Feature activation+5.51e-02

Pos logit contributions
aleb+0.052
 steak+0.051
 butcher+0.051
 beef+0.051
Which+0.050
Neg logit contributions
ands-0.038
 ago-0.034
 hall-0.029
 outage-0.028
 tunnel-0.028
Token want
Token ID765
Feature activation-3.57e-02

Pos logit contributions
ands+0.096
 hall+0.081
 tunnel+0.080
 stairs+0.076
 hallway+0.075
Neg logit contributions
aleb-0.083
 butcher-0.078
 steak-0.076
 french-0.075
 Junior-0.074
Token to
Token ID284
Feature activation+2.00e-02

Pos logit contributions
aleb+0.075
 butcher+0.074
 beef+0.070
Which+0.069
 steak+0.069
Neg logit contributions
ands-0.047
 ago-0.041
 tunnel-0.035
 pattern-0.035
 space-0.034
Token said
Token ID531
Feature activation-1.97e-02

Pos logit contributions
rified+0.023
 beef+0.022
aleb+0.022
Which+0.022
 butcher+0.022
Neg logit contributions
ands-0.029
 ago-0.026
chie-0.024
 outage-0.024
 tunnel-0.022
Token Lily
Token ID20037
Feature activation+4.85e-02

Pos logit contributions
aleb+0.038
 butcher+0.037
 Junior+0.036
 beef+0.036
 turkey+0.035
Neg logit contributions
ands-0.030
 ago-0.025
 tunnel-0.022
 outage-0.020
 disl-0.020
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
ands+0.054
 ago+0.041
 tunnel+0.034
 common+0.034
 space+0.034
Neg logit contributions
aleb-0.114
 butcher-0.112
 Junior-0.107
 beef-0.105
 steak-0.105
Token "
Token ID366
Feature activation-5.16e-02

Pos logit contributions
ands+0.009
 ago+0.008
 tunnel+0.008
chie+0.007
 common+0.007
Neg logit contributions
 butcher-0.007
aleb-0.007
 Junior-0.007
 beef-0.006
 Turkey-0.006
TokenMy
Token ID3666
Feature activation-3.88e-02

Pos logit contributions
aleb+0.085
 butcher+0.080
 Junior+0.079
rified+0.073
 beef+0.072
Neg logit contributions
ands-0.093
 ago-0.078
 pattern-0.077
ro-0.075
 disl-0.074
Token ball
Token ID2613
Feature activation-5.09e-02

Pos logit contributions
aleb+0.071
 butcher+0.070
 Junior+0.065
Which+0.063
 beef+0.063
Neg logit contributions
ands-0.061
 ago-0.056
 pattern-0.052
 space-0.052
 tunnel-0.050
Token doesn
Token ID1595
Feature activation-4.89e-02

Pos logit contributions
aleb+0.092
 butcher+0.090
 beef+0.085
 steak+0.083
Which+0.083
Neg logit contributions
ands-0.081
 ago-0.072
 space-0.061
 tunnel-0.060
 common-0.060
Token't
Token ID470
Feature activation+8.77e-02

Pos logit contributions
aleb+0.114
 butcher+0.113
 Junior+0.107
 beef+0.106
Which+0.106
Neg logit contributions
ands-0.052
 ago-0.045
 tunnel-0.036
 space-0.036
 disl-0.034
Token taste
Token ID6938
Feature activation-6.11e-02

Pos logit contributions
 ago+0.149
ands+0.148
 hall+0.126
 tunnel+0.125
 disl+0.124
Neg logit contributions
 apple-0.130
 steak-0.129
 beef-0.129
 ju-0.128
aleb-0.126
Token nice
Token ID3621
Feature activation-6.81e-03

Pos logit contributions
aleb+0.136
 butcher+0.133
 Junior+0.124
Which+0.124
 beef+0.122
Neg logit contributions
ands-0.074
 ago-0.064
 space-0.055
 pattern-0.055
 tunnel-0.054
Token at
Token ID379
Feature activation-6.34e-04

Pos logit contributions
aleb+0.018
 butcher+0.017
 Junior+0.016
Which+0.016
 beef+0.016
Neg logit contributions
ands-0.006
 ago-0.005
 space-0.004
 pattern-0.004
 tunnel-0.003
Token.
Token ID13
Feature activation+9.32e-03

Pos logit contributions
 butcher+0.054
aleb+0.053
 beef+0.052
 Junior+0.052
 steak+0.051
Neg logit contributions
ands-0.025
 ago-0.019
 tunnel-0.018
 pattern-0.017
chie-0.016
Token They
Token ID1119
Feature activation+1.88e-02

Pos logit contributions
ands+0.015
 ago+0.012
 pattern+0.011
 tunnel+0.011
 space+0.011
Neg logit contributions
 butcher-0.018
aleb-0.018
 Junior-0.017
Which-0.017
 beef-0.016
Token are
Token ID389
Feature activation+3.48e-03

Pos logit contributions
ands+0.029
 ago+0.025
 tunnel+0.021
 pattern+0.021
 space+0.021
Neg logit contributions
aleb-0.036
 butcher-0.035
 Junior-0.033
 beef-0.032
rified-0.032
Token sorry
Token ID7926
Feature activation-2.53e-02

Pos logit contributions
ands+0.005
 ago+0.004
 pattern+0.004
 tunnel+0.004
ro+0.004
Neg logit contributions
 butcher-0.007
aleb-0.007
 beef-0.006
 Junior-0.006
 steak-0.006
Token for
Token ID329
Feature activation-3.17e-02

Pos logit contributions
 beef+0.049
 steak+0.049
 butcher+0.048
Which+0.048
aleb+0.047
Neg logit contributions
ands-0.035
 ago-0.029
chie-0.027
ro-0.025
 common-0.024
Token making
Token ID1642
Feature activation-2.89e-02

Pos logit contributions
 beef+0.054
 butcher+0.054
aleb+0.053
 steak+0.053
 Junior+0.051
Neg logit contributions
ands-0.052
 ago-0.045
ro-0.042
 pattern-0.041
chie-0.040
Token a
Token ID257
Feature activation+4.31e-04

Pos logit contributions
aleb+0.060
 butcher+0.058
 Junior+0.055
 beef+0.055
Which+0.054
Neg logit contributions
ands-0.040
 ago-0.035
 pattern-0.030
 tunnel-0.030
 space-0.030
Token mess
Token ID2085
Feature activation+5.51e-02

Pos logit contributions
ands+0.001
 ago+0.000
 space+0.000
 pattern+0.000
 tunnel+0.000
Neg logit contributions
aleb-0.001
 butcher-0.001
 Junior-0.001
Which-0.001
 beef-0.001
Token.
Token ID13
Feature activation+1.94e-02

Pos logit contributions
ands+0.073
 ago+0.056
 tunnel+0.055
chie+0.054
 pattern+0.054
Neg logit contributions
 beef-0.105
 butcher-0.105
 steak-0.101
aleb-0.100
 Junior-0.100
Token They
Token ID1119
Feature activation+2.51e-02

Pos logit contributions
ands+0.028
 ago+0.024
 pattern+0.021
 tunnel+0.021
 space+0.021
Neg logit contributions
aleb-0.039
 butcher-0.038
Which-0.036
 Junior-0.036
 beef-0.036
Token are
Token ID389
Feature activation+7.15e-03

Pos logit contributions
ands+0.040
 ago+0.035
 tunnel+0.029
 space+0.029
 pattern+0.029
Neg logit contributions
aleb-0.047
 butcher-0.045
 Junior-0.042
 beef-0.042
 steak-0.041
Token having
Token ID1719
Feature activation-1.71e-02

Pos logit contributions
ands+0.016
 ago+0.013
 tunnel+0.011
 space+0.011
 pattern+0.011
Neg logit contributions
aleb-0.024
 butcher-0.023
 beef-0.022
 Junior-0.022
 steak-0.022
Token fun
Token ID1257
Feature activation+3.62e-02

Pos logit contributions
aleb+0.035
 butcher+0.034
 beef+0.034
 steak+0.033
 Junior+0.033
Neg logit contributions
ands-0.023
 ago-0.020
 tunnel-0.017
 pattern-0.017
 hall-0.017
Token.
Token ID13
Feature activation-2.03e-03

Pos logit contributions
ands+0.040
 ago+0.029
 tunnel+0.027
 pattern+0.026
chie+0.026
Neg logit contributions
aleb-0.081
 butcher-0.079
 beef-0.079
 steak-0.077
Which-0.075
Token They
Token ID1119
Feature activation+4.33e-03

Pos logit contributions
aleb+0.004
 butcher+0.004
Which+0.004
 steak+0.004
 beef+0.004
Neg logit contributions
ands-0.003
 ago-0.002
 space-0.002
 pattern-0.002
 tunnel-0.002
Token picked
Token ID6497
Feature activation-2.86e-02

Pos logit contributions
ands+0.006
 ago+0.005
 pattern+0.005
 space+0.005
 tunnel+0.005
Neg logit contributions
 butcher-0.009
aleb-0.009
Which-0.008
 beef-0.008
 Junior-0.008
Token apples
Token ID22514
Feature activation-1.49e-01

Pos logit contributions
aleb+0.064
 butcher+0.062
rified+0.059
mallow+0.059
eled+0.058
Neg logit contributions
 work-0.025
 old-0.024
.-0.023
 fun-0.023
 space-0.022
Token until
Token ID1566
Feature activation-1.99e-02

Pos logit contributions
aleb+0.378
 butcher+0.368
Which+0.351
 Junior+0.347
rified+0.342
Neg logit contributions
ands-0.126
 ago-0.108
 space-0.092
 pattern-0.087
 tunnel-0.084
Token they
Token ID484
Feature activation-1.25e-02

Pos logit contributions
aleb+0.051
 butcher+0.048
Which+0.045
rified+0.045
avored+0.045
Neg logit contributions
 ago-0.016
 rain-0.015
ands-0.014
 work-0.013
 space-0.013
Token were
Token ID547
Feature activation-1.84e-02

Pos logit contributions
aleb+0.027
 butcher+0.027
Which+0.026
 beef+0.026
 Junior+0.025
Neg logit contributions
ands-0.014
 ago-0.013
 grow-0.012
 pattern-0.012
 work-0.011
Token tired
Token ID10032
Feature activation+3.29e-04

Pos logit contributions
aleb+0.037
 butcher+0.036
 Junior+0.034
Which+0.034
 beef+0.033
Neg logit contributions
ands-0.026
 ago-0.023
 space-0.020
 pattern-0.020
 tunnel-0.020
Token and
Token ID290
Feature activation-2.48e-02

Pos logit contributions
ands+0.000
 ago+0.000
 pattern+0.000
 disl+0.000
 tunnel+0.000
Neg logit contributions
aleb-0.001
 butcher-0.001
 Junior-0.001
 beef-0.001
 steak-0.001
TokenWhen
Token ID2215
Feature activation-2.45e-02

Pos logit contributions
aleb+0.014
 butcher+0.014
rified+0.013
 Junior+0.013
 steak+0.012
Neg logit contributions
ands-0.012
 ago-0.010
 space-0.010
 pattern-0.009
 tunnel-0.009
Token they
Token ID484
Feature activation-1.57e-02

Pos logit contributions
aleb+0.057
 butcher+0.054
Which+0.051
rified+0.049
eled+0.049
Neg logit contributions
ands-0.025
 ago-0.023
 space-0.023
 rain-0.022
 work-0.021
Token returned
Token ID4504
Feature activation-2.22e-02

Pos logit contributions
aleb+0.029
 butcher+0.028
 Junior+0.026
Which+0.026
 beef+0.026
Neg logit contributions
ands-0.025
 ago-0.022
 space-0.020
 pattern-0.020
 tunnel-0.020
Token,
Token ID11
Feature activation+9.64e-03

Pos logit contributions
aleb+0.053
 butcher+0.052
 Junior+0.049
Which+0.049
 beef+0.049
Neg logit contributions
ands-0.023
 ago-0.019
 space-0.016
 pattern-0.016
 tunnel-0.016
Token the
Token ID262
Feature activation-7.85e-03

Pos logit contributions
 space+0.009
 rain+0.009
 ago+0.008
ands+0.008
 hall+0.008
Neg logit contributions
aleb-0.023
 butcher-0.022
Which-0.020
urger-0.020
eled-0.020
Token turkey
Token ID26876
Feature activation-1.41e-01

Pos logit contributions
aleb+0.009
Which+0.008
eled+0.008
avored+0.007
rified+0.007
Neg logit contributions
 old-0.017
 rain-0.017
 space-0.017
 voice-0.017
 work-0.016
Token had
Token ID550
Feature activation+2.79e-03

Pos logit contributions
aleb+0.278
 butcher+0.271
Which+0.253
 Junior+0.250
 beef+0.248
Neg logit contributions
ands-0.201
 ago-0.180
 pattern-0.162
 space-0.162
 tunnel-0.160
Token been
Token ID587
Feature activation+5.91e-03

Pos logit contributions
ands+0.005
 ago+0.004
 tunnel+0.004
 hall+0.003
 pattern+0.003
Neg logit contributions
aleb-0.005
 butcher-0.005
 Junior-0.005
 steak-0.005
 beef-0.005
Token able
Token ID1498
Feature activation-6.53e-02

Pos logit contributions
ands+0.009
 ago+0.008
 tunnel+0.007
 disl+0.007
 pattern+0.007
Neg logit contributions
aleb-0.011
 butcher-0.011
 Junior-0.010
 beef-0.010
 steak-0.010
Token to
Token ID284
Feature activation+9.59e-03

Pos logit contributions
aleb+0.116
 butcher+0.113
 beef+0.110
 steak+0.110
 Junior+0.106
Neg logit contributions
ands-0.109
 ago-0.089
 disl-0.083
 tunnel-0.083
 hall-0.082
Token laugh
Token ID6487
Feature activation-1.75e-02

Pos logit contributions
ands+0.016
 ago+0.015
 tunnel+0.012
 hall+0.012
 outage+0.012
Neg logit contributions
aleb-0.017
 Junior-0.015
 butcher-0.015
 Goose-0.015
Stan-0.014
Token's
Token ID338
Feature activation-3.85e-02

Pos logit contributions
ands+0.045
 ago+0.041
 space+0.037
 tunnel+0.037
 pattern+0.037
Neg logit contributions
aleb-0.026
 butcher-0.025
 Junior-0.023
Which-0.023
 beef-0.022
Token useful
Token ID4465
Feature activation-2.51e-03

Pos logit contributions
aleb+0.078
 butcher+0.076
 beef+0.073
 Junior+0.073
Which+0.072
Neg logit contributions
ands-0.054
 ago-0.046
 tunnel-0.039
 pattern-0.039
 disl-0.039
Token,
Token ID11
Feature activation+1.67e-02

Pos logit contributions
aleb+0.005
 butcher+0.005
 beef+0.005
 steak+0.004
 Junior+0.004
Neg logit contributions
ands-0.004
 ago-0.003
 disl-0.003
 tunnel-0.003
 hall-0.003
Token but
Token ID475
Feature activation-5.77e-02

Pos logit contributions
ands+0.028
 ago+0.023
 tunnel+0.020
 space+0.020
 common+0.020
Neg logit contributions
aleb-0.029
 butcher-0.028
 Junior-0.027
Which-0.027
 beef-0.027
Token the
Token ID262
Feature activation-1.78e-04

Pos logit contributions
aleb+0.105
 butcher+0.102
Which+0.099
 Junior+0.098
 beef+0.096
Neg logit contributions
ands-0.093
 ago-0.076
 tunnel-0.069
 pattern-0.068
 hall-0.067
Token best
Token ID1266
Feature activation-1.31e-01

Pos logit contributions
aleb+0.000
 butcher+0.000
 Junior+0.000
Which+0.000
 beef+0.000
Neg logit contributions
ands-0.000
 ago-0.000
 space-0.000
 pattern-0.000
 tunnel-0.000
Token things
Token ID1243
Feature activation+1.25e-02

Pos logit contributions
aleb+0.256
 butcher+0.242
 beef+0.232
Which+0.230
 steak+0.230
Neg logit contributions
ands-0.196
 ago-0.168
 disl-0.147
 tunnel-0.141
 hall-0.141
Token in
Token ID287
Feature activation+2.43e-02

Pos logit contributions
ands+0.020
 ago+0.018
 tunnel+0.017
 pattern+0.016
 hall+0.016
Neg logit contributions
aleb-0.022
 butcher-0.021
 beef-0.020
 Junior-0.020
 steak-0.020
Token life
Token ID1204
Feature activation+5.11e-02

Pos logit contributions
ands+0.038
 ago+0.030
 tunnel+0.030
 pattern+0.029
 disl+0.029
Neg logit contributions
aleb-0.043
 butcher-0.041
 Junior-0.041
 beef-0.041
 steak-0.039
Token are
Token ID389
Feature activation-2.80e-03

Pos logit contributions
ands+0.067
 ago+0.053
 tunnel+0.050
 pattern+0.049
 disl+0.047
Neg logit contributions
aleb-0.103
 butcher-0.099
 beef-0.097
 Junior-0.096
 steak-0.095
Token free
Token ID1479
Feature activation+6.80e-02

Pos logit contributions
aleb+0.005
 butcher+0.005
 beef+0.005
 Junior+0.005
 steak+0.005
Neg logit contributions
ands-0.004
 ago-0.003
 tunnel-0.003
 hall-0.003
ro-0.003
Token and
Token ID290
Feature activation-2.43e-02

Pos logit contributions
aleb+0.065
 butcher+0.064
 Junior+0.059
Which+0.059
 beef+0.059
Neg logit contributions
ands-0.043
 ago-0.037
 space-0.032
 pattern-0.032
 tunnel-0.032
Token the
Token ID262
Feature activation-5.01e-03

Pos logit contributions
aleb+0.063
 butcher+0.061
avored+0.059
Which+0.058
 Junior+0.056
Neg logit contributions
ands-0.017
 space-0.017
 work-0.016
 ago-0.015
 tunnel-0.015
Token rabbit
Token ID22746
Feature activation-7.45e-02

Pos logit contributions
aleb+0.008
 butcher+0.008
 Junior+0.007
Which+0.007
 beef+0.007
Neg logit contributions
ands-0.009
 ago-0.008
 tunnel-0.008
 space-0.008
 pattern-0.008
Token were
Token ID547
Feature activation-1.99e-02

Pos logit contributions
aleb+0.131
 butcher+0.127
 Junior+0.118
Which+0.116
 beef+0.116
Neg logit contributions
ands-0.124
 ago-0.111
 space-0.099
 pattern-0.098
 tunnel-0.098
Token the
Token ID262
Feature activation-4.18e-03

Pos logit contributions
aleb+0.045
 butcher+0.044
rified+0.042
Which+0.041
avored+0.040
Neg logit contributions
ands-0.020
 space-0.018
 ago-0.017
 pattern-0.017
 rain-0.017
Token best
Token ID1266
Feature activation-1.36e-01

Pos logit contributions
aleb+0.007
eled+0.007
 butcher+0.007
 Junior+0.007
Which+0.007
Neg logit contributions
ands-0.006
 space-0.006
 pattern-0.006
 tunnel-0.006
 work-0.006
Token of
Token ID286
Feature activation-6.84e-03

Pos logit contributions
aleb+0.318
 butcher+0.309
Which+0.300
 Junior+0.292
rified+0.285
Neg logit contributions
ands-0.149
 ago-0.129
 space-0.114
 tunnel-0.110
 pattern-0.109
Token friends
Token ID2460
Feature activation+2.99e-02

Pos logit contributions
aleb+0.014
 butcher+0.013
 beef+0.012
 Junior+0.012
Which+0.012
Neg logit contributions
ands-0.010
 ago-0.009
 pattern-0.008
 tunnel-0.008
 space-0.008
Token.
Token ID13
Feature activation-6.84e-03

Pos logit contributions
ands+0.030
 ago+0.025
 pattern+0.020
 space+0.020
 tunnel+0.020
Neg logit contributions
aleb-0.072
 butcher-0.071
 Junior-0.067
Which-0.066
 beef-0.066
Token<|endoftext|>
Token ID50256
Feature activation-9.32e-03

Pos logit contributions
aleb+0.016
 butcher+0.015
Which+0.014
 beef+0.014
 steak+0.014
Neg logit contributions
ands-0.008
 ago-0.007
 space-0.006
 pattern-0.005
 tunnel-0.005
TokenOnce
Token ID7454
Feature activation+1.44e-02

Pos logit contributions
aleb+0.020
 butcher+0.020
Which+0.018
 Junior+0.018
 beef+0.018
Neg logit contributions
ands-0.011
 ago-0.011
 space-0.009
 tunnel-0.009
 pattern-0.009
Token.
Token ID13
Feature activation+2.27e-02

Pos logit contributions
aleb+0.300
 butcher+0.291
Which+0.278
 Junior+0.275
 beef+0.270
Neg logit contributions
ands-0.118
 ago-0.101
 space-0.082
 pattern-0.080
 tunnel-0.080
Token It
Token ID632
Feature activation+1.83e-02

Pos logit contributions
ands+0.031
 ago+0.030
 space+0.025
 disl+0.023
 pattern+0.023
Neg logit contributions
aleb-0.047
 butcher-0.044
Which-0.041
 Junior-0.041
eled-0.040
Token tastes
Token ID18221
Feature activation-7.22e-02

Pos logit contributions
ands+0.029
 ago+0.026
 space+0.023
 pattern+0.023
 tunnel+0.023
Neg logit contributions
aleb-0.034
 butcher-0.033
 Junior-0.030
Which-0.030
 beef-0.030
Token sweet
Token ID6029
Feature activation-7.30e-02

Pos logit contributions
aleb+0.145
 butcher+0.141
 Junior+0.131
Which+0.131
 beef+0.130
Neg logit contributions
ands-0.103
 ago-0.091
 space-0.079
 pattern-0.079
 tunnel-0.078
Token and
Token ID290
Feature activation-1.64e-02

Pos logit contributions
aleb+0.190
 butcher+0.185
 Junior+0.176
Which+0.175
rified+0.171
Neg logit contributions
ands-0.062
 ago-0.052
 space-0.041
 pattern-0.038
 tunnel-0.037
Token juicy
Token ID35075
Feature activation-1.25e-01

Pos logit contributions
aleb+0.029
 butcher+0.028
 Junior+0.026
Which+0.026
 beef+0.026
Neg logit contributions
ands-0.027
 ago-0.024
 space-0.022
 pattern-0.022
 tunnel-0.022
Token.
Token ID13
Feature activation+2.10e-02

Pos logit contributions
aleb+0.345
 butcher+0.336
 Junior+0.322
Which+0.321
rified+0.314
Neg logit contributions
ands-0.085
 ago-0.068
 space-0.049
 pattern-0.043
 tunnel-0.040
Token Do
Token ID2141
Feature activation-3.23e-02

Pos logit contributions
ands+0.030
 ago+0.028
 space+0.024
 disl+0.023
 pattern+0.023
Neg logit contributions
aleb-0.042
 butcher-0.039
 Junior-0.037
Which-0.036
 steak-0.036
Token you
Token ID345
Feature activation+5.67e-02

Pos logit contributions
aleb+0.077
 butcher+0.076
 Junior+0.071
Which+0.071
 beef+0.071
Neg logit contributions
ands-0.034
 ago-0.029
 space-0.024
 pattern-0.023
 tunnel-0.023
Token want
Token ID765
Feature activation-4.58e-02

Pos logit contributions
ands+0.090
 tunnel+0.076
 hall+0.074
 ago+0.074
 space+0.070
Neg logit contributions
aleb-0.098
 butcher-0.094
Which-0.090
 beef-0.088
 steak-0.088
Token some
Token ID617
Feature activation-3.40e-02

Pos logit contributions
aleb+0.104
 butcher+0.098
rified+0.093
 Junior+0.093
avored+0.089
Neg logit contributions
ands-0.055
 ago-0.046
 space-0.044
 rain-0.041
 pattern-0.041