TOP ACTIVATIONS
MAX = 5.008

and she could go as long as she promised to look
to let him go as long as he promised to be
He wanted to delay as long as he could . He
take them with you as long as you don 't eat
his chain was twice as long as before ! He was
chain was now twice as long as it was before .
said it was ok as long as he trusted the vegetable
said it was okay as long as they didn 't make
he 'd be OK as long as he stayed close to
gift every year for as long as she is three .
and watched it for as long as they could . They
you will be safe as long as you stay near me
he could keep it as long as he promised to take
okay to make mistakes as long as you learn from them
okay to make mistakes as long as we learn from them
join them again , as long as he kept his promise
said it was okay as long as he was careful .
stretched out his legs as long as he could and he
hold his breath for as long as he could . One
until his walks were as long as he wanted them to

MIN ACTIVATIONS
MIN = -6.927

Did you know that av oc ados are very healthy for
, " The rh in oc eros made a big mess
turned out that the av oc ados were bad and made
celebrated . The rh in oc eros learned that humans can
fun . The rh in oc eros was happy to help
hurt . The rh in oc eros kept sn orting and
" The rh in oc eros is in there ,
playing ! The rh in oc eros was very worried ,
away . The rh in oc eros felt che ery and
relief . The rh in oc eros was so happy to
gentle . The rh in oc eros is scared of new
he would take the av oc ados and scatter them around
left behind . The av oc ados were round and green
The bad rh in oc eros was so happy .
mess . The rh in oc eros had destroyed her room
The rh in oc eros smiled and flew away
animals . The rh in oc eros was so excited to
The rh in oc eros looked up , hopeful
he ." The rh in oc eros felt a little better
The rh in oc eros was so excited .

INTERVAL 2.024 - 5.008
CONTAINS 0.300%

year old and the shoe turned left and continued on their
the pool and got all wet . When she got out
The mixer went round and round , and the batter became
pped on his stick and fell down .
on the monkey bars and spun on the sees aw .

INTERVAL -0.960 - 2.024
CONTAINS 92.041%

so happy and not embarrassed to ask for help . Together
them all . Mom said they were here to adopt one
the neighbor . She hugs Tom and takes him inside .
heard a loud noise . It was his little sister playing
L ily 's mom my explained that the shrimp

INTERVAL -3.944 - -0.960
CONTAINS 7.655%

inside - a big pile of warm , shiny gold !
" No , you can 't !" Lily said
. Grace was always very careful when playing in the garden
from the snack stand to celebrate and ran around the stadium
dog was the second fastest , but he didn 't complete

INTERVAL -6.927 - -3.944
CONTAINS 0.003%

arrival . The rh in oc eros was no longer hungry
The bad rh in oc eros was so happy .
elephant saw the rh in oc eros z ooming and said
give to the rh in oc eros . Lily walked to
and took the rh in oc eros home with her .
Token and
Token ID290
Feature activation-6.08e-01

Pos logit contributions
bek+0.020
ved+0.018
 Exhibit+0.018
isdom+0.017
rave+0.017
Neg logit contributions
mo-0.020
 camps-0.019
bo-0.018
 numb-0.018
izzy-0.018
Token she
Token ID673
Feature activation+6.19e-01

Pos logit contributions
OO+0.017
bek+0.017
anish+0.016
 Exhibit+0.016
ved+0.016
Neg logit contributions
 then-0.008
 sometimes-0.007
 pour-0.007
 stream-0.006
 dip-0.006
Token could
Token ID714
Feature activation+3.71e-01

Pos logit contributions
bo+0.012
mo+0.012
izzy+0.012
 camps+0.011
Sn+0.011
Neg logit contributions
bek-0.013
ved-0.012
 Exhibit-0.012
adoes-0.011
isdom-0.011
Token go
Token ID467
Feature activation-5.04e-02

Pos logit contributions
mo+0.007
 camps+0.006
bo+0.006
izzy+0.006
 numb+0.006
Neg logit contributions
bek-0.008
ved-0.008
 Exhibit-0.008
 refused-0.007
 won-0.007
Token as
Token ID355
Feature activation-5.96e-01

Pos logit contributions
 won+0.001
bek+0.001
 goodbye+0.001
 refused+0.001
mare+0.001
Neg logit contributions
mo-0.001
 tul-0.001
 camps-0.001
ucky-0.001
Crunch-0.001
Token long
Token ID890
Feature activation+5.01e+00

Pos logit contributions
bek+0.013
ved+0.012
 Exhibit+0.012
 refused+0.011
 Susan+0.011
Neg logit contributions
mo-0.012
 camps-0.011
bo-0.011
 Arctic-0.010
 numb-0.010
Token as
Token ID355
Feature activation+6.90e-01

Pos logit contributions
mo+0.148
 Arctic+0.144
bo+0.142
 camps+0.139
izzy+0.139
Neg logit contributions
 goodbye-0.055
ved-0.046
 Susan-0.045
 refused-0.044
 Steve-0.044
Token she
Token ID673
Feature activation+1.94e+00

Pos logit contributions
mo+0.017
 camps+0.016
bo+0.016
 numb+0.015
 Arctic+0.015
Neg logit contributions
bek-0.011
 Exhibit-0.009
 Susan-0.009
ved-0.009
 Steve-0.009
Token promised
Token ID8072
Feature activation-8.12e-02

Pos logit contributions
mo+0.048
 camps+0.046
 numb+0.045
bo+0.043
 Arctic+0.043
Neg logit contributions
bek-0.031
 refused-0.028
 won-0.028
ved-0.027
 Exhibit-0.027
Token to
Token ID284
Feature activation+8.07e-02

Pos logit contributions
bek+0.003
 Exhibit+0.002
mare+0.002
ved+0.002
 Sorry+0.002
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
 end-0.001
izzy-0.001
Token look
Token ID804
Feature activation+1.16e+00

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
 numb+0.001
 Arctic+0.001
Neg logit contributions
bek-0.002
 Exhibit-0.002
ved-0.002
isdom-0.002
mare-0.002
Token to
Token ID284
Feature activation+2.07e-02

Pos logit contributions
bek+0.011
 Exhibit+0.011
mare+0.010
isdom+0.010
ved+0.010
Neg logit contributions
 end-0.006
 numb-0.005
bo-0.005
mo-0.005
 chilly-0.005
Token let
Token ID1309
Feature activation-5.46e-02

Pos logit contributions
mo+0.000
 toe+0.000
 camps+0.000
bo+0.000
 bounce+0.000
Neg logit contributions
bek-0.001
 Exhibit-0.001
ved-0.001
isdom-0.000
mare-0.000
Token him
Token ID683
Feature activation+1.23e+00

Pos logit contributions
bek+0.001
ved+0.001
 Exhibit+0.001
mare+0.001
isdom+0.001
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
izzy-0.001
 numb-0.001
Token go
Token ID467
Feature activation+1.73e-01

Pos logit contributions
mo+0.020
 camps+0.019
bo+0.018
 numb+0.017
izzy+0.017
Neg logit contributions
bek-0.031
 Exhibit-0.027
ved-0.027
mare-0.027
isdom-0.027
Token as
Token ID355
Feature activation-8.34e-01

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
Crunch+0.004
izzy+0.003
Neg logit contributions
bek-0.003
 goodbye-0.003
 won-0.003
mare-0.003
 Susan-0.003
Token long
Token ID890
Feature activation+4.92e+00

Pos logit contributions
bek+0.019
ved+0.017
 Exhibit+0.017
mare+0.017
isdom+0.016
Neg logit contributions
 camps-0.015
mo-0.015
bo-0.014
 numb-0.014
izzy-0.013
Token as
Token ID355
Feature activation+4.75e-01

Pos logit contributions
mo+0.135
izzy+0.131
bo+0.130
Crunch+0.128
 Arctic+0.128
Neg logit contributions
 goodbye-0.062
ved-0.054
 Steve-0.053
 Susan-0.052
bek-0.051
Token he
Token ID339
Feature activation+2.13e+00

Pos logit contributions
mo+0.010
 camps+0.009
bo+0.009
 numb+0.009
izzy+0.009
Neg logit contributions
bek-0.010
ved-0.009
 Exhibit-0.009
mare-0.008
isdom-0.008
Token promised
Token ID8072
Feature activation-2.96e-02

Pos logit contributions
mo+0.048
 camps+0.046
bo+0.044
 numb+0.044
izzy+0.043
Neg logit contributions
bek-0.039
ved-0.035
 Exhibit-0.034
 refused-0.033
mare-0.033
Token to
Token ID284
Feature activation+3.39e-01

Pos logit contributions
bek+0.001
 Exhibit+0.001
mare+0.001
ved+0.001
 vows+0.001
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
 end-0.000
 numb-0.000
Token be
Token ID307
Feature activation+1.14e-01

Pos logit contributions
mo+0.005
bo+0.005
 camps+0.005
 toe+0.004
 numb+0.004
Neg logit contributions
bek-0.009
 Exhibit-0.008
ved-0.008
isdom-0.008
mare-0.008
Token He
Token ID679
Feature activation-1.65e-01

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
 numb+0.003
izzy+0.003
Neg logit contributions
bek-0.005
ved-0.004
 Exhibit-0.004
mare-0.004
isdom-0.004
Token wanted
Token ID2227
Feature activation+7.05e-01

Pos logit contributions
bek+0.003
 Exhibit+0.003
mare+0.002
 Susan+0.002
ved+0.002
Neg logit contributions
mo-0.004
bo-0.004
 camps-0.004
Sn-0.004
izzy-0.004
Token to
Token ID284
Feature activation+4.15e-01

Pos logit contributions
mo+0.020
 camps+0.019
bo+0.019
 numb+0.018
izzy+0.018
Neg logit contributions
bek-0.009
ved-0.008
 Exhibit-0.008
mare-0.007
isdom-0.007
Token delay
Token ID5711
Feature activation-3.72e-01

Pos logit contributions
mo+0.009
 camps+0.008
izzy+0.008
bo+0.007
 Arctic+0.007
Neg logit contributions
bek-0.008
 refused-0.008
 won-0.008
 Susan-0.008
ved-0.007
Token as
Token ID355
Feature activation+2.65e-02

Pos logit contributions
bek+0.009
 Exhibit+0.008
mare+0.007
ved+0.007
 vows+0.007
Neg logit contributions
 camps-0.007
mo-0.007
 numb-0.007
bo-0.006
 toe-0.006
Token long
Token ID890
Feature activation+4.65e+00

Pos logit contributions
mo+0.001
bo+0.001
 camps+0.001
bath+0.001
 Arctic+0.001
Neg logit contributions
bek-0.000
 Steve-0.000
 refused-0.000
isdom-0.000
 Exhibit-0.000
Token as
Token ID355
Feature activation+6.58e-01

Pos logit contributions
mo+0.120
izzy+0.116
 camps+0.114
bo+0.113
 Arctic+0.112
Neg logit contributions
 goodbye-0.062
bek-0.061
ved-0.061
 refused-0.060
 Steve-0.058
Token he
Token ID339
Feature activation+1.73e+00

Pos logit contributions
mo+0.017
bo+0.015
bath+0.015
 camps+0.015
Crunch+0.015
Neg logit contributions
bek-0.010
 Steve-0.009
 Susan-0.009
 goodbye-0.009
 refused-0.009
Token could
Token ID714
Feature activation+1.78e+00

Pos logit contributions
mo+0.042
 camps+0.041
bo+0.039
 numb+0.038
izzy+0.038
Neg logit contributions
bek-0.029
ved-0.025
 Exhibit-0.025
mare-0.024
isdom-0.024
Token.
Token ID13
Feature activation+1.63e-01

Pos logit contributions
mo+0.029
 camps+0.028
bo+0.026
izzy+0.025
 numb+0.025
Neg logit contributions
bek-0.043
ved-0.040
 refused-0.039
 Exhibit-0.039
mare-0.038
Token He
Token ID679
Feature activation+1.89e-02

Pos logit contributions
mo+0.004
 camps+0.003
bath+0.003
bo+0.003
 numb+0.003
Neg logit contributions
bek-0.003
 won-0.003
 refused-0.003
 Steve-0.003
mare-0.003
Token take
Token ID1011
Feature activation-3.80e-01

Pos logit contributions
bek+0.000
 won+0.000
 vows+0.000
ved+0.000
 refused+0.000
Neg logit contributions
mo-0.000
 camps-0.000
 colder-0.000
 numb-0.000
 Arctic-0.000
Token them
Token ID606
Feature activation+1.46e-01

Pos logit contributions
bek+0.009
 Exhibit+0.008
mare+0.008
ved+0.008
isdom+0.007
Neg logit contributions
bo-0.007
mo-0.007
 camps-0.007
Sn-0.006
bath-0.006
Token with
Token ID351
Feature activation+4.50e-01

Pos logit contributions
 camps+0.003
mo+0.003
bo+0.003
izzy+0.003
bath+0.003
Neg logit contributions
bek-0.003
 Exhibit-0.003
isdom-0.003
mare-0.003
 vows-0.003
Token you
Token ID345
Feature activation-3.11e-01

Pos logit contributions
bo+0.007
 camps+0.007
mo+0.007
 end+0.007
ucky+0.007
Neg logit contributions
bek-0.012
ved-0.010
 Exhibit-0.010
mare-0.010
rave-0.009
Token as
Token ID355
Feature activation-5.19e-01

Pos logit contributions
bek+0.008
 Steve+0.007
 Susan+0.007
 Exhibit+0.007
ved+0.007
Neg logit contributions
 camps-0.005
mo-0.005
bo-0.005
 chilly-0.005
One-0.004
Token long
Token ID890
Feature activation+4.51e+00

Pos logit contributions
bek+0.012
ved+0.011
mare+0.011
 Exhibit+0.011
 refused+0.010
Neg logit contributions
 camps-0.009
 chilly-0.008
mo-0.008
 numb-0.008
ucky-0.008
Token as
Token ID355
Feature activation+3.87e-01

Pos logit contributions
mo+0.080
 camps+0.079
bo+0.074
 numb+0.070
bath+0.069
Neg logit contributions
bek-0.110
 Exhibit-0.095
ved-0.091
rave-0.090
mare-0.089
Token you
Token ID345
Feature activation-2.56e-02

Pos logit contributions
 camps+0.008
mo+0.007
 chilly+0.007
ucky+0.006
foot+0.006
Neg logit contributions
bek-0.009
ved-0.008
mare-0.007
 Exhibit-0.007
rave-0.007
Token don
Token ID836
Feature activation+1.24e+00

Pos logit contributions
bek+0.001
ved+0.001
 Exhibit+0.000
mare+0.000
 refused+0.000
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
 numb-0.000
izzy-0.000
Token't
Token ID470
Feature activation+4.79e-01

Pos logit contributions
mo+0.030
 camps+0.029
bo+0.027
 numb+0.027
 Arctic+0.026
Neg logit contributions
bek-0.021
 Exhibit-0.018
ved-0.018
mare-0.018
isdom-0.018
Token eat
Token ID4483
Feature activation+9.20e-01

Pos logit contributions
mo+0.007
bo+0.007
 camps+0.007
 numb+0.007
izzy+0.006
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
mare-0.011
 vows-0.011
Token his
Token ID465
Feature activation+1.19e+00

Pos logit contributions
 camps+0.023
mo+0.021
 colder+0.020
bo+0.019
 polar+0.019
Neg logit contributions
bek-0.042
isdom-0.037
rave-0.037
 Exhibit-0.036
mare-0.036
Token chain
Token ID6333
Feature activation+1.42e+00

Pos logit contributions
mo+0.023
 camps+0.023
bo+0.022
 numb+0.021
izzy+0.021
Neg logit contributions
bek-0.025
 Exhibit-0.022
ved-0.022
mare-0.022
isdom-0.021
Token was
Token ID373
Feature activation+1.54e+00

Pos logit contributions
mo+0.027
bo+0.026
 camps+0.026
 numb+0.025
izzy+0.024
Neg logit contributions
bek-0.032
isdom-0.029
 Exhibit-0.028
mare-0.028
 Susan-0.026
Token twice
Token ID5403
Feature activation+7.29e-01

Pos logit contributions
mo+0.031
 camps+0.029
bo+0.028
izzy+0.027
 Arctic+0.027
Neg logit contributions
bek-0.032
ved-0.029
 Exhibit-0.028
 refused-0.028
mare-0.028
Token as
Token ID355
Feature activation-7.68e-01

Pos logit contributions
mo+0.017
 camps+0.016
bo+0.016
izzy+0.016
 Arctic+0.015
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
 goodbye-0.011
 Susan-0.011
Token long
Token ID890
Feature activation+4.46e+00

Pos logit contributions
bek+0.020
mare+0.019
ved+0.019
isdom+0.019
 Exhibit+0.018
Neg logit contributions
 camps-0.011
 colder-0.010
 numb-0.010
mo-0.010
 chilly-0.010
Token as
Token ID355
Feature activation+3.58e-01

Pos logit contributions
mo+0.116
izzy+0.113
bo+0.113
 camps+0.111
 Arctic+0.110
Neg logit contributions
 goodbye-0.059
ved-0.057
bek-0.056
 refused-0.055
 won-0.054
Token before
Token ID878
Feature activation+2.42e+00

Pos logit contributions
 camps+0.006
mo+0.006
bo+0.005
 numb+0.005
 Arctic+0.005
Neg logit contributions
bek-0.009
mare-0.008
ved-0.008
 Exhibit-0.008
isdom-0.008
Token!
Token ID0
Feature activation+2.79e-01

Pos logit contributions
mo+0.049
 camps+0.044
bo+0.044
izzy+0.043
 Arctic+0.041
Neg logit contributions
bek-0.050
 Susan-0.046
 Steve-0.046
 Exhibit-0.045
 refused-0.045
Token He
Token ID679
Feature activation-5.75e-01

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
 Arctic+0.004
ucky+0.004
Neg logit contributions
bek-0.007
ved-0.006
mare-0.006
 Exhibit-0.006
isdom-0.006
Token was
Token ID373
Feature activation-1.99e-02

Pos logit contributions
bek+0.013
isdom+0.012
mare+0.012
 Exhibit+0.012
 hi+0.012
Neg logit contributions
mo-0.010
bo-0.010
 camps-0.010
 toe-0.009
izzy-0.009
Token chain
Token ID6333
Feature activation+5.39e-02

Pos logit contributions
bek+0.004
 won+0.003
 refused+0.003
ved+0.003
 Susan+0.003
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.004
 numb-0.004
izzy-0.004
Token was
Token ID373
Feature activation+9.12e-01

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
 numb+0.001
izzy+0.001
Neg logit contributions
bek-0.001
 Exhibit-0.001
mare-0.001
isdom-0.001
ved-0.001
Token now
Token ID783
Feature activation+2.93e-02

Pos logit contributions
mo+0.017
 camps+0.016
bo+0.016
izzy+0.015
 numb+0.015
Neg logit contributions
bek-0.020
ved-0.018
 Exhibit-0.018
mare-0.018
 refused-0.018
Token twice
Token ID5403
Feature activation+3.54e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 numb+0.000
izzy+0.000
Neg logit contributions
bek-0.001
 Exhibit-0.001
mare-0.001
ved-0.001
isdom-0.001
Token as
Token ID355
Feature activation-7.12e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
izzy+0.007
 numb+0.007
Neg logit contributions
bek-0.007
ved-0.006
 Exhibit-0.006
isdom-0.006
mare-0.006
Token long
Token ID890
Feature activation+4.41e+00

Pos logit contributions
bek+0.018
mare+0.017
ved+0.017
 Exhibit+0.017
isdom+0.016
Neg logit contributions
 camps-0.011
 colder-0.010
mo-0.010
 chilly-0.010
 numb-0.010
Token as
Token ID355
Feature activation+3.84e-01

Pos logit contributions
mo+0.108
 camps+0.104
bo+0.103
izzy+0.101
 Arctic+0.100
Neg logit contributions
bek-0.067
ved-0.063
 goodbye-0.063
 Susan-0.061
 refused-0.061
Token it
Token ID340
Feature activation+2.07e+00

Pos logit contributions
 camps+0.006
mo+0.006
 colder+0.006
 chilly+0.006
Where+0.006
Neg logit contributions
bek-0.009
mare-0.009
ved-0.009
 Exhibit-0.008
isdom-0.008
Token was
Token ID373
Feature activation+1.87e+00

Pos logit contributions
mo+0.044
 camps+0.043
bo+0.042
izzy+0.040
 numb+0.040
Neg logit contributions
bek-0.041
 Exhibit-0.036
ved-0.035
mare-0.035
isdom-0.035
Token before
Token ID878
Feature activation+1.89e+00

Pos logit contributions
mo+0.035
 camps+0.034
bo+0.033
 numb+0.032
izzy+0.031
Neg logit contributions
bek-0.041
 Exhibit-0.037
mare-0.036
ved-0.036
isdom-0.035
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
 camps+0.023
mo+0.022
 numb+0.021
bo+0.020
 chilly+0.019
Neg logit contributions
bek-0.056
ved-0.051
 Exhibit-0.050
mare-0.050
isdom-0.049
Token said
Token ID531
Feature activation-9.67e-02

Pos logit contributions
bek+0.014
 Exhibit+0.013
adoes+0.012
mare+0.012
 culture+0.012
Neg logit contributions
bo-0.011
mo-0.011
izzy-0.009
 polar-0.009
 camps-0.009
Token it
Token ID340
Feature activation-2.75e-01

Pos logit contributions
bek+0.002
 Exhibit+0.002
rave+0.002
mare+0.002
adoes+0.002
Neg logit contributions
mo-0.002
bo-0.001
Sn-0.001
 end-0.001
ucky-0.001
Token was
Token ID373
Feature activation+6.84e-01

Pos logit contributions
bek+0.008
ved+0.007
isdom+0.007
mare+0.007
 Exhibit+0.006
Neg logit contributions
bo-0.004
chy-0.004
mo-0.004
 chilly-0.004
�-0.003
Token ok
Token ID12876
Feature activation-1.18e+00

Pos logit contributions
 camps+0.012
 chilly+0.011
mo+0.011
 numb+0.011
bo+0.011
Neg logit contributions
bek-0.017
 Exhibit-0.015
ved-0.015
 vows-0.014
mare-0.014
Token as
Token ID355
Feature activation-9.97e-01

Pos logit contributions
bek+0.028
ved+0.026
 Exhibit+0.026
isdom+0.024
 vows+0.024
Neg logit contributions
mo-0.021
 camps-0.019
bo-0.019
 chilly-0.018
 numb-0.017
Token long
Token ID890
Feature activation+4.40e+00

Pos logit contributions
ved+0.026
bek+0.026
rave+0.024
mare+0.024
 Exhibit+0.023
Neg logit contributions
 camps-0.015
 chilly-0.014
 polar-0.014
 Arctic-0.013
 colder-0.013
Token as
Token ID355
Feature activation+2.46e-01

Pos logit contributions
mo+0.086
 camps+0.084
bo+0.079
 numb+0.076
 Arctic+0.074
Neg logit contributions
bek-0.097
 Exhibit-0.085
ved-0.082
mare-0.082
isdom-0.080
Token he
Token ID339
Feature activation+1.47e+00

Pos logit contributions
 camps+0.004
mo+0.004
 chilly+0.004
 colder+0.004
 Arctic+0.004
Neg logit contributions
bek-0.006
ved-0.006
mare-0.006
rave-0.005
 Exhibit-0.005
Token trusted
Token ID13467
Feature activation+1.93e-01

Pos logit contributions
mo+0.033
 camps+0.032
bo+0.031
 numb+0.030
izzy+0.030
Neg logit contributions
bek-0.027
ved-0.023
 Exhibit-0.023
mare-0.023
isdom-0.022
Token the
Token ID262
Feature activation+2.51e-01

Pos logit contributions
mo+0.003
 camps+0.003
 end+0.002
bo+0.002
One+0.002
Neg logit contributions
bek-0.005
mare-0.005
 Exhibit-0.005
ved-0.005
rave-0.005
Token vegetable
Token ID20236
Feature activation-1.56e-01

Pos logit contributions
mo+0.005
 camps+0.005
bo+0.005
 Arctic+0.005
 numb+0.005
Neg logit contributions
bek-0.005
ved-0.005
 Exhibit-0.004
mare-0.004
isdom-0.004
Token said
Token ID531
Feature activation+1.57e-01

Pos logit contributions
mo+0.003
bo+0.003
 camps+0.003
izzy+0.003
bath+0.002
Neg logit contributions
bek-0.002
 Exhibit-0.002
mare-0.002
ved-0.002
isdom-0.002
Token it
Token ID340
Feature activation-5.35e-02

Pos logit contributions
mo+0.003
ucky+0.003
Sn+0.003
 tul+0.003
izzy+0.003
Neg logit contributions
bek-0.004
 Exhibit-0.003
mare-0.003
rave-0.003
adoes-0.003
Token was
Token ID373
Feature activation+7.26e-01

Pos logit contributions
bek+0.001
isdom+0.001
mare+0.001
 Exhibit+0.001
ved+0.001
Neg logit contributions
bo-0.001
chy-0.001
mo-0.001
izzy-0.001
Sn-0.001
Token okay
Token ID8788
Feature activation-3.95e-01

Pos logit contributions
mo+0.015
 camps+0.014
bo+0.014
 numb+0.013
izzy+0.013
Neg logit contributions
bek-0.015
 Exhibit-0.014
ved-0.013
mare-0.013
isdom-0.013
Token as
Token ID355
Feature activation-7.88e-01

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
 vows+0.007
rave+0.007
Neg logit contributions
mo-0.008
 camps-0.008
izzy-0.007
bo-0.007
 numb-0.007
Token long
Token ID890
Feature activation+4.36e+00

Pos logit contributions
bek+0.021
ved+0.020
 Exhibit+0.019
mare+0.018
rave+0.018
Neg logit contributions
 camps-0.011
 tul-0.011
 chilly-0.011
mo-0.011
izzy-0.010
Token as
Token ID355
Feature activation+4.01e-01

Pos logit contributions
mo+0.097
 camps+0.094
bo+0.091
izzy+0.087
 numb+0.087
Neg logit contributions
bek-0.080
ved-0.071
 Exhibit-0.070
 refused-0.069
mare-0.068
Token they
Token ID484
Feature activation+9.22e-01

Pos logit contributions
 camps+0.007
mo+0.007
 tul+0.007
izzy+0.007
 numb+0.007
Neg logit contributions
bek-0.009
ved-0.008
 Exhibit-0.008
mare-0.008
rave-0.008
Token didn
Token ID1422
Feature activation+8.41e-01

Pos logit contributions
mo+0.019
 camps+0.018
bo+0.017
izzy+0.017
 numb+0.016
Neg logit contributions
bek-0.019
 Exhibit-0.017
ved-0.017
mare-0.017
isdom-0.016
Token't
Token ID470
Feature activation+3.02e-02

Pos logit contributions
mo+0.023
 camps+0.023
 Arctic+0.021
 polar+0.021
 numb+0.021
Neg logit contributions
bek-0.010
ved-0.009
rave-0.009
 Exhibit-0.009
 Susan-0.009
Token make
Token ID787
Feature activation+3.74e-01

Pos logit contributions
mo+0.000
bo+0.000
 camps+0.000
izzy+0.000
 numb+0.000
Neg logit contributions
bek-0.001
 Exhibit-0.001
mare-0.001
ved-0.001
 vows-0.001
Token he
Token ID339
Feature activation+3.33e-01

Pos logit contributions
mo+0.002
 camps+0.002
 end+0.002
 chilly+0.002
 toe+0.002
Neg logit contributions
bek-0.003
mare-0.003
 Exhibit-0.003
 Sorry-0.002
 vows-0.002
Token'd
Token ID1549
Feature activation-8.09e-01

Pos logit contributions
mo+0.007
bo+0.006
 camps+0.006
izzy+0.006
 Arctic+0.005
Neg logit contributions
bek-0.007
mare-0.007
 Exhibit-0.006
ved-0.006
isdom-0.006
Token be
Token ID307
Feature activation+3.06e-01

Pos logit contributions
bek+0.020
 Exhibit+0.018
adoes+0.017
rave+0.017
mare+0.017
Neg logit contributions
mo-0.013
 end-0.012
 camps-0.012
bo-0.012
 toe-0.012
Token OK
Token ID7477
Feature activation+4.64e-01

Pos logit contributions
mo+0.005
bo+0.005
 camps+0.005
Where+0.005
 chilly+0.005
Neg logit contributions
bek-0.008
 Exhibit-0.007
isdom-0.007
 vows-0.007
 Sale-0.007
Token as
Token ID355
Feature activation-7.10e-01

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.007
izzy+0.007
 numb+0.007
Neg logit contributions
bek-0.011
 Exhibit-0.010
ved-0.010
mare-0.010
 vows-0.010
Token long
Token ID890
Feature activation+4.35e+00

Pos logit contributions
bek+0.018
ved+0.016
mare+0.016
 Exhibit+0.016
 vows+0.015
Neg logit contributions
mo-0.011
 camps-0.011
 chilly-0.011
 polar-0.010
 Arctic-0.010
Token as
Token ID355
Feature activation+3.86e-01

Pos logit contributions
mo+0.098
 camps+0.095
bo+0.092
izzy+0.090
 numb+0.089
Neg logit contributions
bek-0.077
ved-0.069
 Exhibit-0.068
 refused-0.067
 goodbye-0.066
Token he
Token ID339
Feature activation+1.83e+00

Pos logit contributions
 camps+0.006
mo+0.006
bo+0.005
 chilly+0.005
 Arctic+0.005
Neg logit contributions
bek-0.010
mare-0.009
ved-0.009
 Exhibit-0.008
 vows-0.008
Token stayed
Token ID9658
Feature activation-2.38e-01

Pos logit contributions
mo+0.040
 camps+0.039
bo+0.038
izzy+0.036
 Arctic+0.035
Neg logit contributions
bek-0.035
 Exhibit-0.030
ved-0.030
mare-0.030
isdom-0.029
Token close
Token ID1969
Feature activation+7.48e-01

Pos logit contributions
bek+0.006
 Exhibit+0.005
 Sale+0.005
 vows+0.005
ved+0.005
Neg logit contributions
 camps-0.004
mo-0.004
bo-0.004
 numb-0.004
 chilly-0.004
Token to
Token ID284
Feature activation-4.63e-01

Pos logit contributions
mo+0.016
 camps+0.016
bo+0.015
 numb+0.015
 chilly+0.014
Neg logit contributions
bek-0.015
 Exhibit-0.013
ved-0.013
mare-0.013
isdom-0.012
Token gift
Token ID6979
Feature activation-2.98e-01

Pos logit contributions
 camps+0.007
 end+0.006
 chilly+0.006
 bath+0.006
 tul+0.006
Neg logit contributions
bek-0.007
ved-0.007
adoes-0.006
 Susan-0.006
isdom-0.006
Token every
Token ID790
Feature activation+1.16e+00

Pos logit contributions
bek+0.008
 Exhibit+0.007
rave+0.006
ved+0.006
isdom+0.006
Neg logit contributions
 end-0.004
 camps-0.004
mo-0.004
izzy-0.004
bo-0.004
Token year
Token ID614
Feature activation-1.03e+00

Pos logit contributions
mo+0.018
 camps+0.017
bo+0.017
 numb+0.016
izzy+0.016
Neg logit contributions
bek-0.029
ved-0.027
 Exhibit-0.026
isdom-0.026
mare-0.025
Token for
Token ID329
Feature activation+9.29e-02

Pos logit contributions
bek+0.027
ved+0.024
 Exhibit+0.024
mare+0.024
rave+0.023
Neg logit contributions
mo-0.015
bo-0.015
 camps-0.014
ucky-0.013
izzy-0.013
Token as
Token ID355
Feature activation+7.39e-01

Pos logit contributions
 end+0.001
ucky+0.001
 End+0.001
 camps+0.001
 toe+0.001
Neg logit contributions
bek-0.002
ved-0.002
 Exhibit-0.002
 won-0.002
rave-0.002
Token long
Token ID890
Feature activation+4.34e+00

Pos logit contributions
 camps+0.012
 chilly+0.011
mo+0.011
 tul+0.011
izzy+0.011
Neg logit contributions
bek-0.018
ved-0.017
mare-0.016
 Exhibit-0.016
rave-0.016
Token as
Token ID355
Feature activation+1.07e+00

Pos logit contributions
mo+0.081
 camps+0.080
bo+0.074
 numb+0.073
bath+0.071
Neg logit contributions
bek-0.100
 Exhibit-0.087
ved-0.084
mare-0.083
rave-0.082
Token she
Token ID673
Feature activation+2.00e+00

Pos logit contributions
 camps+0.018
 chilly+0.017
mo+0.016
 tul+0.016
Where+0.016
Neg logit contributions
bek-0.026
ved-0.024
 Exhibit-0.022
mare-0.022
rave-0.022
Token is
Token ID318
Feature activation+1.67e+00

Pos logit contributions
mo+0.042
bo+0.041
 camps+0.041
izzy+0.039
bath+0.037
Neg logit contributions
bek-0.040
 Exhibit-0.034
 Susan-0.033
ved-0.033
mare-0.033
Token three
Token ID1115
Feature activation+1.31e+00

Pos logit contributions
mo+0.027
bo+0.025
 camps+0.025
 chilly+0.025
 end+0.024
Neg logit contributions
bek-0.042
 Exhibit-0.036
adoes-0.034
mare-0.034
rav-0.034
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
mo+0.025
 camps+0.024
bo+0.023
 numb+0.022
 Arctic+0.021
Neg logit contributions
bek-0.029
 Exhibit-0.026
ved-0.026
mare-0.026
 Susan-0.025
Token and
Token ID290
Feature activation-4.59e-01

Pos logit contributions
bek+0.008
 Exhibit+0.007
mare+0.007
isdom+0.007
rave+0.007
Neg logit contributions
mo-0.007
 camps-0.006
bo-0.006
bath-0.006
 end-0.005
Token watched
Token ID7342
Feature activation+1.52e+00

Pos logit contributions
isdom+0.013
 Exhibit+0.012
anish+0.012
rave+0.012
adoes+0.012
Neg logit contributions
 polar-0.006
 then-0.006
 stream-0.006
bo-0.006
 camps-0.005
Token it
Token ID340
Feature activation+1.28e+00

Pos logit contributions
mo+0.030
 camps+0.029
bo+0.027
 numb+0.027
 chilly+0.027
Neg logit contributions
bek-0.033
 Exhibit-0.029
mare-0.028
ved-0.027
 refused-0.027
Token for
Token ID329
Feature activation-2.27e-01

Pos logit contributions
mo+0.021
 camps+0.020
bo+0.019
 numb+0.018
izzy+0.018
Neg logit contributions
bek-0.032
ved-0.028
 Exhibit-0.028
mare-0.028
isdom-0.028
Token as
Token ID355
Feature activation-1.76e-02

Pos logit contributions
bek+0.006
 Exhibit+0.005
ved+0.005
 refused+0.005
mare+0.005
Neg logit contributions
 camps-0.004
mo-0.003
 numb-0.003
 polar-0.003
Where-0.003
Token long
Token ID890
Feature activation+4.32e+00

Pos logit contributions
bek+0.000
ved+0.000
mare+0.000
isdom+0.000
rave+0.000
Neg logit contributions
 camps-0.000
 chilly-0.000
 colder-0.000
 polar-0.000
mo-0.000
Token as
Token ID355
Feature activation+7.05e-01

Pos logit contributions
mo+0.110
bo+0.104
izzy+0.104
 camps+0.103
 Arctic+0.101
Neg logit contributions
 goodbye-0.060
bek-0.060
ved-0.058
 refused-0.056
 Susan-0.055
Token they
Token ID484
Feature activation+1.18e+00

Pos logit contributions
 camps+0.015
mo+0.014
 colder+0.014
 chilly+0.014
 numb+0.014
Neg logit contributions
bek-0.014
ved-0.012
mare-0.012
isdom-0.012
 Exhibit-0.012
Token could
Token ID714
Feature activation+1.52e+00

Pos logit contributions
mo+0.023
 camps+0.023
bo+0.022
izzy+0.020
 numb+0.020
Neg logit contributions
bek-0.025
 Susan-0.022
isdom-0.022
mare-0.022
 Exhibit-0.022
Token.
Token ID13
Feature activation+4.37e-01

Pos logit contributions
mo+0.025
 camps+0.024
bo+0.022
izzy+0.022
 numb+0.021
Neg logit contributions
bek-0.037
ved-0.034
 Exhibit-0.034
 refused-0.033
mare-0.033
Token They
Token ID1119
Feature activation-6.63e-01

Pos logit contributions
 camps+0.008
mo+0.008
 Arctic+0.007
bo+0.007
izzy+0.007
Neg logit contributions
bek-0.010
ved-0.009
 Exhibit-0.009
isdom-0.009
rave-0.008
Token you
Token ID345
Feature activation-9.95e-01

Pos logit contributions
mo+0.012
bo+0.012
 camps+0.012
 chilly+0.012
 numb+0.011
Neg logit contributions
bek-0.014
 Exhibit-0.013
ved-0.012
isdom-0.012
mare-0.012
Token will
Token ID481
Feature activation-5.29e-01

Pos logit contributions
bek+0.023
isdom+0.019
 Exhibit+0.019
 Susan+0.019
 Steve+0.019
Neg logit contributions
mo-0.019
bo-0.018
 camps-0.018
 numb-0.017
 chilly-0.017
Token be
Token ID307
Feature activation-2.49e-02

Pos logit contributions
bek+0.013
 Exhibit+0.011
ved+0.011
isdom+0.011
 refused+0.011
Neg logit contributions
mo-0.009
 camps-0.009
bo-0.009
 numb-0.009
izzy-0.008
Token safe
Token ID3338
Feature activation-4.28e-01

Pos logit contributions
bek+0.001
 Exhibit+0.001
ved+0.001
 vows+0.001
isdom+0.001
Neg logit contributions
mo-0.000
bo-0.000
 numb-0.000
 camps-0.000
 chilly-0.000
Token as
Token ID355
Feature activation+1.37e-01

Pos logit contributions
bek+0.011
 Exhibit+0.009
ved+0.009
isdom+0.009
mare+0.009
Neg logit contributions
 camps-0.008
mo-0.008
bo-0.007
 numb-0.007
 chilly-0.007
Token long
Token ID890
Feature activation+4.32e+00

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
 numb+0.003
izzy+0.003
Neg logit contributions
bek-0.003
ved-0.003
 Exhibit-0.002
mare-0.002
isdom-0.002
Token as
Token ID355
Feature activation+1.02e+00

Pos logit contributions
mo+0.091
 camps+0.088
bo+0.084
 numb+0.082
izzy+0.080
Neg logit contributions
bek-0.087
 Exhibit-0.076
ved-0.076
mare-0.073
isdom-0.073
Token you
Token ID345
Feature activation+1.09e-01

Pos logit contributions
 camps+0.022
mo+0.021
bo+0.020
 chilly+0.020
izzy+0.020
Neg logit contributions
bek-0.020
ved-0.018
 Exhibit-0.017
isdom-0.017
mare-0.017
Token stay
Token ID2652
Feature activation-1.95e-01

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
izzy+0.002
 numb+0.002
Neg logit contributions
bek-0.003
ved-0.002
 Exhibit-0.002
mare-0.002
isdom-0.002
Token near
Token ID1474
Feature activation+2.71e-01

Pos logit contributions
bek+0.004
 Exhibit+0.004
ved+0.004
isdom+0.004
mare+0.004
Neg logit contributions
mo-0.004
 camps-0.004
bo-0.003
 numb-0.003
 Arctic-0.003
Token me
Token ID502
Feature activation+1.56e-01

Pos logit contributions
mo+0.006
 camps+0.006
bo+0.005
 numb+0.005
 Arctic+0.005
Neg logit contributions
bek-0.006
ved-0.005
 Exhibit-0.005
mare-0.005
 refused-0.005
Token he
Token ID339
Feature activation-2.50e-01

Pos logit contributions
bek+0.004
 Exhibit+0.003
rave+0.003
mare+0.003
ved+0.003
Neg logit contributions
 end-0.001
 numb-0.001
mo-0.001
 chilly-0.001
Sn-0.001
Token could
Token ID714
Feature activation+7.76e-01

Pos logit contributions
bek+0.006
mare+0.006
adoes+0.005
ved+0.005
 Exhibit+0.005
Neg logit contributions
bo-0.004
mo-0.004
izzy-0.004
chy-0.003
 camps-0.003
Token keep
Token ID1394
Feature activation-2.60e-01

Pos logit contributions
mo+0.016
 camps+0.015
izzy+0.014
bo+0.014
 numb+0.014
Neg logit contributions
bek-0.016
ved-0.014
 won-0.014
isdom-0.014
 refused-0.014
Token it
Token ID340
Feature activation+1.19e-01

Pos logit contributions
bek+0.006
ved+0.005
mare+0.005
 vows+0.005
 Exhibit+0.005
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.005
 polar-0.004
 numb-0.004
Token as
Token ID355
Feature activation-4.08e-01

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
 numb+0.003
izzy+0.002
Neg logit contributions
bek-0.002
 Exhibit-0.002
ved-0.002
mare-0.002
isdom-0.002
Token long
Token ID890
Feature activation+4.32e+00

Pos logit contributions
bek+0.009
ved+0.009
 Exhibit+0.008
mare+0.008
rave+0.008
Neg logit contributions
 camps-0.007
mo-0.007
 numb-0.006
 chilly-0.006
bo-0.006
Token as
Token ID355
Feature activation+5.44e-01

Pos logit contributions
mo+0.104
 camps+0.100
bo+0.099
izzy+0.098
 Arctic+0.096
Neg logit contributions
bek-0.067
ved-0.062
 goodbye-0.062
 refused-0.060
 won-0.059
Token he
Token ID339
Feature activation+1.71e+00

Pos logit contributions
 camps+0.010
mo+0.010
bo+0.009
 numb+0.009
izzy+0.009
Neg logit contributions
bek-0.012
ved-0.011
mare-0.011
 Exhibit-0.011
 refused-0.010
Token promised
Token ID8072
Feature activation-1.30e-02

Pos logit contributions
mo+0.040
 camps+0.039
bo+0.037
 numb+0.036
izzy+0.036
Neg logit contributions
bek-0.030
ved-0.026
 Exhibit-0.026
 refused-0.025
mare-0.025
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
bek+0.000
mare+0.000
 Exhibit+0.000
ved+0.000
 vows+0.000
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
One-0.000
foot-0.000
Token take
Token ID1011
Feature activation-3.18e-01

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
 Arctic+0.002
Neg logit contributions
bek-0.003
 Exhibit-0.002
ved-0.002
mare-0.002
 refused-0.002
Token okay
Token ID8788
Feature activation-1.88e-02

Pos logit contributions
mo+0.026
 camps+0.025
bo+0.024
 numb+0.023
izzy+0.023
Neg logit contributions
bek-0.020
 Exhibit-0.017
ved-0.017
mare-0.017
isdom-0.016
Token to
Token ID284
Feature activation-3.22e-01

Pos logit contributions
bek+0.000
 Exhibit+0.000
ved+0.000
mare+0.000
 vows+0.000
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
izzy-0.000
 numb-0.000
Token make
Token ID787
Feature activation+2.19e-01

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
 refused+0.006
isdom+0.006
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.006
 numb-0.006
izzy-0.006
Token mistakes
Token ID10135
Feature activation+7.49e-01

Pos logit contributions
mo+0.005
 camps+0.005
bo+0.004
izzy+0.004
 numb+0.004
Neg logit contributions
bek-0.004
 Exhibit-0.004
ved-0.004
isdom-0.003
 Susan-0.003
Token as
Token ID355
Feature activation-4.94e-01

Pos logit contributions
mo+0.010
 camps+0.009
bo+0.009
 numb+0.009
izzy+0.008
Neg logit contributions
bek-0.021
 Exhibit-0.019
mare-0.019
ved-0.019
isdom-0.018
Token long
Token ID890
Feature activation+4.28e+00

Pos logit contributions
bek+0.010
ved+0.009
mare+0.009
 Exhibit+0.009
 refused+0.009
Neg logit contributions
mo-0.010
 camps-0.010
bo-0.009
 numb-0.009
izzy-0.009
Token as
Token ID355
Feature activation+4.55e-01

Pos logit contributions
mo+0.090
 camps+0.088
bo+0.084
 numb+0.081
izzy+0.080
Neg logit contributions
bek-0.085
ved-0.074
 Exhibit-0.074
mare-0.073
isdom-0.072
Token you
Token ID345
Feature activation+4.56e-02

Pos logit contributions
mo+0.010
 camps+0.009
bo+0.009
 numb+0.009
izzy+0.009
Neg logit contributions
bek-0.009
ved-0.008
 Exhibit-0.008
mare-0.008
isdom-0.008
Token learn
Token ID2193
Feature activation+4.41e-01

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
izzy+0.001
 numb+0.001
Neg logit contributions
bek-0.001
 Exhibit-0.001
ved-0.001
mare-0.001
isdom-0.001
Token from
Token ID422
Feature activation-2.97e-01

Pos logit contributions
 camps+0.003
 smaller+0.003
 end+0.003
One+0.003
mo+0.003
Neg logit contributions
bek-0.015
mare-0.014
 Exhibit-0.013
lder-0.013
ved-0.013
Token them
Token ID606
Feature activation+2.80e-01

Pos logit contributions
bek+0.006
mare+0.005
ved+0.005
 Exhibit+0.005
 refused+0.005
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 numb-0.005
izzy-0.005
Token okay
Token ID8788
Feature activation-1.73e-01

Pos logit contributions
mo+0.014
 camps+0.014
bo+0.014
 chilly+0.013
Sn+0.013
Neg logit contributions
bek-0.013
 Exhibit-0.012
 vows-0.011
ved-0.011
mare-0.011
Token to
Token ID284
Feature activation+1.50e-01

Pos logit contributions
bek+0.004
ved+0.003
 Exhibit+0.003
 vows+0.003
mare+0.003
Neg logit contributions
mo-0.004
bo-0.003
 camps-0.003
ucky-0.003
 toe-0.003
Token make
Token ID787
Feature activation-5.17e-03

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
izzy+0.003
 Arctic+0.003
Neg logit contributions
bek-0.003
 won-0.003
 refused-0.003
isdom-0.003
 Susan-0.003
Token mistakes
Token ID10135
Feature activation+7.80e-01

Pos logit contributions
bek+0.000
 Exhibit+0.000
ved+0.000
mare+0.000
 refused+0.000
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
ucky-0.000
 numb-0.000
Token as
Token ID355
Feature activation-1.32e-01

Pos logit contributions
mo+0.011
 camps+0.010
bo+0.010
 numb+0.010
bath+0.010
Neg logit contributions
bek-0.022
 Exhibit-0.019
ved-0.019
mare-0.019
 vows-0.019
Token long
Token ID890
Feature activation+4.26e+00

Pos logit contributions
bek+0.003
ved+0.003
mare+0.003
 Exhibit+0.003
 vows+0.002
Neg logit contributions
 camps-0.002
 numb-0.002
mo-0.002
ucky-0.002
 chilly-0.002
Token as
Token ID355
Feature activation+6.12e-01

Pos logit contributions
mo+0.072
 camps+0.072
 numb+0.068
bo+0.065
bath+0.063
Neg logit contributions
bek-0.109
 Exhibit-0.092
ved-0.091
mare-0.089
rave-0.087
Token we
Token ID356
Feature activation+7.16e-01

Pos logit contributions
 camps+0.012
mo+0.011
 tul+0.011
 chilly+0.011
 numb+0.011
Neg logit contributions
bek-0.014
ved-0.012
mare-0.012
 Exhibit-0.012
 refused-0.011
Token learn
Token ID2193
Feature activation+5.18e-01

Pos logit contributions
mo+0.012
 camps+0.012
bo+0.012
izzy+0.011
 numb+0.011
Neg logit contributions
bek-0.017
ved-0.015
 Exhibit-0.015
mare-0.015
isdom-0.015
Token from
Token ID422
Feature activation-3.20e-01

Pos logit contributions
 camps+0.005
 colder+0.005
 end+0.005
 smaller+0.005
 chilly+0.004
Neg logit contributions
bek-0.017
mare-0.016
 Exhibit-0.015
ved-0.014
 Steve-0.014
Token them
Token ID606
Feature activation+7.26e-01

Pos logit contributions
bek+0.006
ved+0.006
 Exhibit+0.006
mare+0.005
 refused+0.005
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.006
 tul-0.006
 numb-0.006
Token join
Token ID4654
Feature activation-3.66e-01

Pos logit contributions
 camps+0.005
mo+0.005
 end+0.004
 polar+0.004
 bounce+0.004
Neg logit contributions
bek-0.013
ved-0.011
 Exhibit-0.011
isdom-0.011
 refused-0.011
Token them
Token ID606
Feature activation+2.73e-01

Pos logit contributions
bek+0.009
 Exhibit+0.008
ved+0.007
 refused+0.007
mare+0.007
Neg logit contributions
mo-0.007
 camps-0.006
bo-0.006
izzy-0.006
 numb-0.006
Token again
Token ID757
Feature activation+9.23e-03

Pos logit contributions
 camps+0.005
mo+0.005
bo+0.005
bath+0.004
 numb+0.004
Neg logit contributions
bek-0.007
isdom-0.006
mare-0.006
ved-0.006
 Exhibit-0.006
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
 camps+0.000
mo+0.000
One+0.000
bo+0.000
 chilly+0.000
Neg logit contributions
bek-0.000
 Exhibit-0.000
ved-0.000
rave-0.000
 vows-0.000
Token as
Token ID355
Feature activation+3.11e-01

Pos logit contributions
 camps+0.003
mo+0.003
 numb+0.002
 chilly+0.002
 toe+0.002
Neg logit contributions
bek-0.004
 Exhibit-0.004
isdom-0.004
 vows-0.004
 Sale-0.004
Token long
Token ID890
Feature activation+4.25e+00

Pos logit contributions
 camps+0.006
mo+0.006
 numb+0.005
 polar+0.005
bo+0.005
Neg logit contributions
bek-0.007
ved-0.006
mare-0.006
 Exhibit-0.006
rave-0.006
Token as
Token ID355
Feature activation+7.81e-01

Pos logit contributions
mo+0.100
 camps+0.095
bo+0.094
izzy+0.092
 numb+0.091
Neg logit contributions
bek-0.070
ved-0.064
 goodbye-0.062
 refused-0.062
 Exhibit-0.062
Token he
Token ID339
Feature activation+1.72e+00

Pos logit contributions
mo+0.018
 camps+0.018
bo+0.017
 numb+0.016
izzy+0.016
Neg logit contributions
bek-0.014
ved-0.012
 Exhibit-0.012
mare-0.012
isdom-0.012
Token kept
Token ID4030
Feature activation+1.35e+00

Pos logit contributions
mo+0.038
 camps+0.037
bo+0.036
izzy+0.034
 numb+0.034
Neg logit contributions
bek-0.033
ved-0.028
 Exhibit-0.028
isdom-0.028
mare-0.028
Token his
Token ID465
Feature activation+1.05e+00

Pos logit contributions
mo+0.029
 camps+0.029
bo+0.027
 numb+0.027
 Arctic+0.026
Neg logit contributions
bek-0.027
ved-0.023
 Exhibit-0.023
mare-0.022
isdom-0.022
Token promise
Token ID6991
Feature activation+2.40e-01

Pos logit contributions
mo+0.020
 camps+0.020
 numb+0.017
bo+0.017
 Arctic+0.017
Neg logit contributions
bek-0.024
ved-0.021
 Exhibit-0.021
isdom-0.020
mare-0.020
Token said
Token ID531
Feature activation+1.36e-01

Pos logit contributions
mo+0.015
bo+0.014
 camps+0.013
izzy+0.013
 Polar+0.012
Neg logit contributions
bek-0.011
 Exhibit-0.010
ved-0.009
mare-0.009
adoes-0.009
Token it
Token ID340
Feature activation+2.12e-01

Pos logit contributions
mo+0.003
bo+0.003
 camps+0.002
ucky+0.002
Sn+0.002
Neg logit contributions
bek-0.003
 Exhibit-0.003
mare-0.002
rave-0.002
ved-0.002
Token was
Token ID373
Feature activation+8.78e-01

Pos logit contributions
bo+0.004
mo+0.004
chy+0.003
 chilly+0.003
 camps+0.003
Neg logit contributions
bek-0.005
ved-0.005
mare-0.005
isdom-0.005
 Exhibit-0.005
Token okay
Token ID8788
Feature activation-5.99e-01

Pos logit contributions
mo+0.017
 camps+0.017
bo+0.016
 numb+0.015
 chilly+0.015
Neg logit contributions
bek-0.019
ved-0.017
 Exhibit-0.017
mare-0.017
isdom-0.016
Token as
Token ID355
Feature activation-1.08e+00

Pos logit contributions
bek+0.011
ved+0.010
 Exhibit+0.010
mare+0.010
 vows+0.010
Neg logit contributions
mo-0.013
 camps-0.013
bo-0.012
izzy-0.012
 numb-0.012
Token long
Token ID890
Feature activation+4.25e+00

Pos logit contributions
ved+0.027
bek+0.027
mare+0.025
 Exhibit+0.024
 vows+0.024
Neg logit contributions
 camps-0.017
 chilly-0.016
 polar-0.015
 colder-0.015
 Arctic-0.015
Token as
Token ID355
Feature activation+1.96e-01

Pos logit contributions
mo+0.097
 camps+0.093
bo+0.091
izzy+0.088
 numb+0.087
Neg logit contributions
bek-0.075
ved-0.067
 Exhibit-0.066
 refused-0.065
isdom-0.064
Token he
Token ID339
Feature activation+1.52e+00

Pos logit contributions
 camps+0.004
mo+0.003
 chilly+0.003
 Arctic+0.003
foot+0.003
Neg logit contributions
bek-0.004
ved-0.004
mare-0.004
 Exhibit-0.004
 vows-0.004
Token was
Token ID373
Feature activation+1.08e+00

Pos logit contributions
mo+0.037
 camps+0.037
 numb+0.035
bo+0.034
 Arctic+0.034
Neg logit contributions
bek-0.024
 refused-0.021
 Exhibit-0.021
ved-0.021
 won-0.021
Token careful
Token ID8161
Feature activation+4.50e-03

Pos logit contributions
mo+0.020
 camps+0.019
bo+0.019
 numb+0.018
 chilly+0.018
Neg logit contributions
bek-0.025
 Exhibit-0.022
ved-0.022
mare-0.022
rave-0.021
Token.
Token ID13
Feature activation-1.59e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 numb+0.000
izzy+0.000
Neg logit contributions
bek-0.000
 Exhibit-0.000
ved-0.000
mare-0.000
isdom-0.000
Token stretched
Token ID19110
Feature activation+1.20e+00

Pos logit contributions
bo+0.010
mo+0.009
 stream+0.008
 camps+0.008
 polar+0.008
Neg logit contributions
bek-0.016
 Exhibit-0.014
rave-0.014
 Sale-0.013
adoes-0.013
Token out
Token ID503
Feature activation-5.65e-01

Pos logit contributions
mo+0.021
 camps+0.021
bo+0.020
 numb+0.019
 toe+0.019
Neg logit contributions
bek-0.032
 Exhibit-0.028
isdom-0.027
 Sale-0.026
mare-0.025
Token his
Token ID465
Feature activation+1.41e+00

Pos logit contributions
bek+0.013
 Exhibit+0.011
mare+0.011
ved+0.011
isdom+0.010
Neg logit contributions
mo-0.011
 camps-0.011
bo-0.010
 numb-0.010
 polar-0.010
Token legs
Token ID7405
Feature activation+1.01e+00

Pos logit contributions
mo+0.028
 camps+0.027
bo+0.026
 numb+0.025
 toe+0.025
Neg logit contributions
bek-0.032
ved-0.027
 Exhibit-0.027
isdom-0.027
mare-0.026
Token as
Token ID355
Feature activation-4.91e-02

Pos logit contributions
mo+0.023
 camps+0.022
bo+0.022
 numb+0.021
bath+0.021
Neg logit contributions
bek-0.019
 Exhibit-0.016
mare-0.016
isdom-0.016
 vows-0.015
Token long
Token ID890
Feature activation+4.25e+00

Pos logit contributions
bek+0.001
 Exhibit+0.001
mare+0.001
ved+0.001
isdom+0.001
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
 numb-0.001
izzy-0.001
Token as
Token ID355
Feature activation+7.17e-01

Pos logit contributions
mo+0.105
izzy+0.101
 camps+0.100
bo+0.100
 Arctic+0.097
Neg logit contributions
bek-0.061
 goodbye-0.060
ved-0.060
 refused-0.057
 Steve-0.055
Token he
Token ID339
Feature activation+1.66e+00

Pos logit contributions
mo+0.015
 camps+0.014
bo+0.014
izzy+0.013
 numb+0.013
Neg logit contributions
bek-0.015
ved-0.013
 Exhibit-0.013
 Susan-0.013
 refused-0.012
Token could
Token ID714
Feature activation+1.55e+00

Pos logit contributions
mo+0.037
 camps+0.036
bo+0.035
 numb+0.034
izzy+0.033
Neg logit contributions
bek-0.031
ved-0.027
 Exhibit-0.027
mare-0.026
isdom-0.026
Token and
Token ID290
Feature activation+1.67e-02

Pos logit contributions
mo+0.029
 camps+0.028
bo+0.027
 numb+0.026
izzy+0.025
Neg logit contributions
bek-0.036
 Exhibit-0.031
ved-0.031
isdom-0.030
mare-0.030
Token he
Token ID339
Feature activation+6.44e-01

Pos logit contributions
mo+0.000
bo+0.000
 camps+0.000
 colder+0.000
 toe+0.000
Neg logit contributions
bek-0.000
 Exhibit-0.000
isdom-0.000
 Sale-0.000
rave-0.000
Token hold
Token ID1745
Feature activation+7.90e-01

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
 numb+0.004
izzy+0.003
Neg logit contributions
bek-0.010
ved-0.009
 Exhibit-0.009
isdom-0.009
mare-0.009
Token his
Token ID465
Feature activation+6.98e-01

Pos logit contributions
mo+0.023
 camps+0.022
 Arctic+0.020
bo+0.020
bath+0.020
Neg logit contributions
 goodbye-0.009
 hello-0.008
ved-0.008
bek-0.008
 its-0.008
Token breath
Token ID8033
Feature activation-8.64e-02

Pos logit contributions
mo+0.015
 camps+0.015
bo+0.014
 numb+0.014
izzy+0.014
Neg logit contributions
bek-0.014
ved-0.012
 Exhibit-0.012
isdom-0.011
mare-0.011
Token for
Token ID329
Feature activation-3.21e-01

Pos logit contributions
bek+0.002
 Exhibit+0.002
ved+0.002
mare+0.002
isdom+0.002
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
 numb-0.001
izzy-0.001
Token as
Token ID355
Feature activation+1.17e-01

Pos logit contributions
bek+0.005
 Exhibit+0.005
ved+0.005
 goodbye+0.005
 refused+0.004
Neg logit contributions
mo-0.008
 camps-0.007
bo-0.007
izzy-0.007
bath-0.007
Token long
Token ID890
Feature activation+4.24e+00

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
izzy+0.002
Neg logit contributions
bek-0.002
ved-0.002
 Exhibit-0.002
mare-0.002
isdom-0.002
Token as
Token ID355
Feature activation+9.87e-01

Pos logit contributions
mo+0.130
izzy+0.128
bo+0.125
 Arctic+0.124
idget+0.122
Neg logit contributions
 goodbye-0.044
 its-0.034
 hello-0.033
 bye-0.031
 refused-0.030
Token he
Token ID339
Feature activation+2.13e+00

Pos logit contributions
mo+0.021
 camps+0.020
bo+0.020
 numb+0.019
bath+0.019
Neg logit contributions
bek-0.019
 Exhibit-0.017
ved-0.017
 Steve-0.017
 Susan-0.016
Token could
Token ID714
Feature activation+1.95e+00

Pos logit contributions
mo+0.055
 numb+0.053
 camps+0.052
ucky+0.051
 tul+0.050
Neg logit contributions
 won-0.029
 refused-0.027
 Exhibit-0.026
ved-0.026
bek-0.026
Token.
Token ID13
Feature activation-5.39e-02

Pos logit contributions
mo+0.038
bo+0.035
 camps+0.035
Where+0.034
 Arctic+0.033
Neg logit contributions
 goodbye-0.038
 won-0.038
ved-0.037
 refused-0.036
 hello-0.036
Token One
Token ID1881
Feature activation-1.24e-01

Pos logit contributions
 won+0.001
 Susan+0.001
 Steve+0.001
bek+0.001
 Exhibit+0.001
Neg logit contributions
 numb-0.001
mo-0.001
 colder-0.001
bo-0.001
 tul-0.001
Token until
Token ID1566
Feature activation+1.43e+00

Pos logit contributions
bek+0.008
 vows+0.008
isdom+0.008
 Exhibit+0.007
mare+0.007
Neg logit contributions
mo-0.005
 camps-0.005
 chilly-0.004
 numb-0.004
 colder-0.004
Token his
Token ID465
Feature activation+1.94e+00

Pos logit contributions
mo+0.030
 camps+0.029
bo+0.028
 numb+0.027
izzy+0.027
Neg logit contributions
bek-0.028
ved-0.025
 Exhibit-0.025
mare-0.024
 refused-0.024
Token walks
Token ID11114
Feature activation+9.67e-02

Pos logit contributions
mo+0.049
 camps+0.046
bo+0.046
izzy+0.045
 numb+0.044
Neg logit contributions
bek-0.030
 Exhibit-0.026
ved-0.026
 goodbye-0.026
 vows-0.025
Token were
Token ID547
Feature activation+1.77e+00

Pos logit contributions
bo+0.002
 camps+0.001
mo+0.001
 stream+0.001
B+0.001
Neg logit contributions
bek-0.002
isdom-0.002
 Exhibit-0.002
 meanings-0.002
 Sorry-0.002
Token as
Token ID355
Feature activation-4.68e-01

Pos logit contributions
mo+0.033
 camps+0.032
bo+0.031
 numb+0.031
 chilly+0.029
Neg logit contributions
bek-0.041
 Exhibit-0.037
isdom-0.036
mare-0.035
ved-0.035
Token long
Token ID890
Feature activation+4.23e+00

Pos logit contributions
bek+0.012
ved+0.012
isdom+0.012
mare+0.011
 Exhibit+0.011
Neg logit contributions
 camps-0.007
 numb-0.006
mo-0.006
 colder-0.006
izzy-0.006
Token as
Token ID355
Feature activation+4.44e-01

Pos logit contributions
mo+0.111
bo+0.107
izzy+0.106
 camps+0.104
 Arctic+0.102
Neg logit contributions
 goodbye-0.056
ved-0.054
bek-0.053
 refused-0.052
 won-0.052
Token he
Token ID339
Feature activation+1.91e+00

Pos logit contributions
mo+0.009
 camps+0.008
bo+0.008
 numb+0.008
izzy+0.008
Neg logit contributions
bek-0.009
 Exhibit-0.008
ved-0.008
 refused-0.008
mare-0.008
Token wanted
Token ID2227
Feature activation+5.10e-01

Pos logit contributions
mo+0.042
 camps+0.041
bo+0.040
izzy+0.038
 numb+0.038
Neg logit contributions
bek-0.036
 Exhibit-0.031
ved-0.031
mare-0.030
isdom-0.030
Token them
Token ID606
Feature activation+3.62e-01

Pos logit contributions
mo+0.012
 camps+0.012
bo+0.011
 numb+0.011
izzy+0.011
Neg logit contributions
bek-0.009
mare-0.008
 Exhibit-0.008
ved-0.008
isdom-0.007
Token to
Token ID284
Feature activation+2.48e-01

Pos logit contributions
 camps+0.007
mo+0.007
 numb+0.007
bo+0.007
 chilly+0.007
Neg logit contributions
bek-0.008
mare-0.006
 Exhibit-0.006
isdom-0.006
 vows-0.006
Token Did
Token ID7731
Feature activation+5.02e-01

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
mare+0.005
isdom+0.005
Neg logit contributions
mo-0.009
 camps-0.009
bo-0.009
 numb-0.008
izzy-0.008
Token you
Token ID345
Feature activation-4.41e-01

Pos logit contributions
mo+0.012
 camps+0.012
bo+0.011
 numb+0.011
izzy+0.011
Neg logit contributions
bek-0.009
ved-0.008
 Exhibit-0.007
mare-0.007
isdom-0.007
Token know
Token ID760
Feature activation+1.83e-03

Pos logit contributions
bek+0.012
 Exhibit+0.010
ved+0.010
mare+0.010
 vows+0.010
Neg logit contributions
mo-0.007
bo-0.007
 numb-0.006
 camps-0.006
 toe-0.006
Token that
Token ID326
Feature activation+2.30e-01

Pos logit contributions
mo+0.000
 camps+0.000
 toe+0.000
bo+0.000
 tul+0.000
Neg logit contributions
bek-0.000
mare-0.000
 Exhibit-0.000
ved-0.000
rave-0.000
Token av
Token ID1196
Feature activation-3.31e+00

Pos logit contributions
 toe+0.004
 camps+0.004
 chilly+0.004
mo+0.004
 bounce+0.004
Neg logit contributions
bek-0.006
ved-0.005
 Exhibit-0.005
mare-0.005
rave-0.005
Tokenoc
Token ID420
Feature activation-5.99e+00

Pos logit contributions
bek+0.072
 goodbye+0.067
mare+0.067
 vows+0.067
 Exhibit+0.065
Neg logit contributions
bo-0.063
mo-0.061
 camps-0.057
bath-0.053
One-0.053
Tokenados
Token ID22484
Feature activation+2.17e-01

Pos logit contributions
mare+0.125
bek+0.124
ved+0.110
 Closed+0.107
 Exhibit+0.106
Neg logit contributions
mo-0.121
bo-0.119
ucky-0.106
 camps-0.103
 tul-0.095
Token are
Token ID389
Feature activation+1.05e+00

Pos logit contributions
bo+0.004
izzy+0.003
mo+0.003
chy+0.003
 end+0.003
Neg logit contributions
bek-0.006
 goodbye-0.005
mare-0.005
ved-0.005
 Susan-0.005
Token very
Token ID845
Feature activation-3.70e-01

Pos logit contributions
 camps+0.024
mo+0.023
bo+0.022
 numb+0.022
One+0.021
Neg logit contributions
bek-0.021
ved-0.017
mare-0.017
 Exhibit-0.017
 goodbye-0.016
Token healthy
Token ID5448
Feature activation+1.50e-01

Pos logit contributions
bek+0.009
ved+0.009
mare+0.009
 Exhibit+0.008
 Susan+0.008
Neg logit contributions
 camps-0.006
mo-0.006
bo-0.006
 chilly-0.005
One-0.005
Token for
Token ID329
Feature activation+9.52e-03

Pos logit contributions
 camps+0.003
mo+0.003
bo+0.003
ucky+0.003
izzy+0.003
Neg logit contributions
bek-0.003
 Exhibit-0.003
ved-0.003
 vows-0.003
rave-0.003
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
bek+0.008
 Exhibit+0.007
isdom+0.007
rave+0.007
adoes+0.007
Neg logit contributions
mo-0.004
 gn-0.004
 tul-0.003
bo-0.003
ucky-0.003
Token "
Token ID366
Feature activation+8.66e-01

Pos logit contributions
bek+0.002
ved+0.002
isdom+0.002
 vows+0.002
 Exhibit+0.002
Neg logit contributions
mo-0.002
Where-0.002
 tul-0.002
 camps-0.002
Sn-0.002
TokenThe
Token ID464
Feature activation-3.77e-01

Pos logit contributions
mo+0.014
 camps+0.013
bo+0.013
 numb+0.012
izzy+0.012
Neg logit contributions
bek-0.022
ved-0.020
 Exhibit-0.020
mare-0.019
isdom-0.019
Token rh
Token ID9529
Feature activation-2.50e+00

Pos logit contributions
bek+0.008
 Exhibit+0.008
ved+0.007
mare+0.007
isdom+0.007
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.006
izzy-0.006
 numb-0.006
Tokenin
Token ID259
Feature activation-1.57e+00

Pos logit contributions
 Susan+0.093
bek+0.093
 goodbye+0.091
mare+0.090
 vows+0.090
Neg logit contributions
mo-0.009
bo-0.007
 camps-0.005
izzy-0.002
ucky-0.001
Tokenoc
Token ID420
Feature activation-5.91e+00

Pos logit contributions
bek+0.041
ved+0.041
 Katy+0.040
 Susan+0.039
adoes+0.039
Neg logit contributions
bo-0.024
mo-0.023
 polar-0.021
 camps-0.019
 Arctic-0.019
Tokeneros
Token ID27498
Feature activation-6.86e-01

Pos logit contributions
bek+0.147
mare+0.144
 Susan+0.138
 refused+0.137
ved+0.136
Neg logit contributions
mo-0.100
bo-0.091
 camps-0.086
 polar-0.079
 Arctic-0.077
Token made
Token ID925
Feature activation+2.03e-02

Pos logit contributions
bek+0.016
mare+0.013
ved+0.013
rave+0.013
adoes+0.013
Neg logit contributions
mo-0.014
bo-0.013
 camps-0.012
 polar-0.012
 Arctic-0.011
Token a
Token ID257
Feature activation-3.40e-01

Pos logit contributions
bo+0.000
 camps+0.000
mo+0.000
 polar+0.000
ucky+0.000
Neg logit contributions
bek-0.000
ved-0.000
mare-0.000
 refused-0.000
isdom-0.000
Token big
Token ID1263
Feature activation+1.28e-02

Pos logit contributions
bek+0.007
ved+0.007
isdom+0.007
mare+0.007
 refused+0.007
Neg logit contributions
 camps-0.006
mo-0.006
bo-0.006
 polar-0.006
 numb-0.006
Token mess
Token ID2085
Feature activation+2.36e-01

Pos logit contributions
 camps+0.000
mo+0.000
 polar+0.000
bo+0.000
 Arctic+0.000
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
mare-0.000
 refused-0.000
Token turned
Token ID2900
Feature activation+1.05e+00

Pos logit contributions
bek+0.002
 Exhibit+0.002
ved+0.002
mare+0.002
isdom+0.002
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.002
 numb-0.002
izzy-0.002
Token out
Token ID503
Feature activation-2.07e+00

Pos logit contributions
mo+0.020
bo+0.020
 camps+0.020
 numb+0.019
 chilly+0.019
Neg logit contributions
bek-0.024
 Exhibit-0.022
isdom-0.021
ved-0.021
mare-0.021
Token that
Token ID326
Feature activation-6.28e-01

Pos logit contributions
bek+0.050
 Exhibit+0.049
mare+0.044
 vows+0.042
isdom+0.042
Neg logit contributions
 camps-0.037
mo-0.036
 chilly-0.036
 numb-0.036
 colder-0.035
Token the
Token ID262
Feature activation+5.41e-01

Pos logit contributions
bek+0.018
 Exhibit+0.016
ved+0.016
rave+0.015
isdom+0.015
Neg logit contributions
 colder-0.010
 chilly-0.010
 stream-0.009
 toe-0.009
 bubble-0.009
Token av
Token ID1196
Feature activation-3.45e+00

Pos logit contributions
mo+0.011
 camps+0.010
izzy+0.010
bo+0.010
 numb+0.010
Neg logit contributions
bek-0.011
 Exhibit-0.010
 refused-0.010
ved-0.010
 won-0.009
Tokenoc
Token ID420
Feature activation-5.89e+00

Pos logit contributions
bek+0.071
 vows+0.065
ved+0.065
 Exhibit+0.065
mare+0.064
Neg logit contributions
mo-0.069
bo-0.068
 camps-0.067
bath-0.061
 numb-0.060
Tokenados
Token ID22484
Feature activation-4.26e-01

Pos logit contributions
mare+0.118
bek+0.116
ved+0.110
 Exhibit+0.108
 won+0.107
Neg logit contributions
mo-0.125
bo-0.122
 camps-0.108
ucky-0.107
 ig-0.099
Token were
Token ID547
Feature activation+8.50e-01

Pos logit contributions
bek+0.011
isdom+0.009
mare+0.009
 Exhibit+0.009
rave+0.009
Neg logit contributions
mo-0.007
bo-0.007
 camps-0.007
chy-0.007
 numb-0.007
Token bad
Token ID2089
Feature activation+4.35e-01

Pos logit contributions
mo+0.015
 numb+0.014
 camps+0.014
bo+0.014
 chilly+0.013
Neg logit contributions
bek-0.021
 Exhibit-0.019
mare-0.018
ved-0.018
isdom-0.018
Token and
Token ID290
Feature activation+2.58e-01

Pos logit contributions
mo+0.009
 camps+0.008
bo+0.008
 numb+0.008
ucky+0.008
Neg logit contributions
bek-0.010
ved-0.008
 Exhibit-0.008
mare-0.008
rave-0.008
Token made
Token ID925
Feature activation+3.58e-01

Pos logit contributions
 numb+0.004
 colder+0.004
 chilly+0.004
 polar+0.004
mo+0.004
Neg logit contributions
bek-0.006
ved-0.006
 Exhibit-0.006
rave-0.006
mare-0.006
Token celebrated
Token ID13943
Feature activation-1.46e+00

Pos logit contributions
bek+0.033
 Sale+0.029
isdom+0.029
mare+0.028
rave+0.028
Neg logit contributions
mo-0.019
 polar-0.016
 stream-0.016
aved-0.016
bo-0.016
Token.
Token ID13
Feature activation-6.76e-01

Pos logit contributions
bek+0.042
mare+0.042
 Exhibit+0.040
 contract+0.039
ved+0.038
Neg logit contributions
 end-0.017
 next-0.015
 sometimes-0.014
 particular-0.014
 camps-0.013
Token The
Token ID383
Feature activation-6.36e-02

Pos logit contributions
bek+0.013
ved+0.011
 Exhibit+0.011
mare+0.011
isdom+0.011
Neg logit contributions
mo-0.015
 camps-0.015
bo-0.014
 numb-0.014
izzy-0.013
Token rh
Token ID9529
Feature activation-2.96e+00

Pos logit contributions
bek+0.001
 Exhibit+0.001
 vows+0.001
isdom+0.001
 won+0.001
Neg logit contributions
mo-0.001
izzy-0.001
bo-0.001
 numb-0.001
 camps-0.001
Tokenin
Token ID259
Feature activation-2.55e+00

Pos logit contributions
bek+0.103
mare+0.097
 Susan+0.097
 goodbye+0.097
 vows+0.097
Neg logit contributions
mo-0.020
 camps-0.016
bo-0.016
izzy-0.011
ucky-0.011
Tokenoc
Token ID420
Feature activation-5.86e+00

Pos logit contributions
bek+0.066
ved+0.064
 Katy+0.064
 vows+0.062
 Susan+0.062
Neg logit contributions
mo-0.042
bo-0.040
 polar-0.037
 camps-0.036
 Arctic-0.035
Tokeneros
Token ID27498
Feature activation-1.38e+00

Pos logit contributions
bek+0.149
mare+0.142
 refused+0.139
 Katy+0.137
 Susan+0.136
Neg logit contributions
mo-0.099
bo-0.088
 camps-0.085
 polar-0.081
 Arctic-0.079
Token learned
Token ID4499
Feature activation-5.16e-01

Pos logit contributions
bek+0.035
 Exhibit+0.031
isdom+0.030
 Sorry+0.029
mare+0.029
Neg logit contributions
mo-0.026
bo-0.025
 camps-0.023
 polar-0.021
 Arctic-0.021
Token that
Token ID326
Feature activation-5.93e-02

Pos logit contributions
bek+0.017
 Exhibit+0.016
mare+0.016
ved+0.015
rav+0.015
Neg logit contributions
One-0.004
 camps-0.004
 sometimes-0.004
 stream-0.004
 ig-0.003
Token humans
Token ID5384
Feature activation-1.44e+00

Pos logit contributions
bek+0.001
ved+0.001
mare+0.001
 vows+0.001
 Exhibit+0.001
Neg logit contributions
 camps-0.001
mo-0.001
 chilly-0.001
 toe-0.001
 Arctic-0.001
Token can
Token ID460
Feature activation-1.12e-01

Pos logit contributions
bek+0.038
mare+0.033
 Exhibit+0.031
ved+0.030
 Susan+0.030
Neg logit contributions
mo-0.024
bo-0.023
 end-0.022
chy-0.021
 chilly-0.021
Token fun
Token ID1257
Feature activation+1.13e+00

Pos logit contributions
bek+0.010
ved+0.009
 Exhibit+0.009
mare+0.009
isdom+0.009
Neg logit contributions
mo-0.008
 camps-0.008
bo-0.007
 numb-0.007
 Arctic-0.007
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
mo+0.018
 camps+0.017
bo+0.016
izzy+0.016
 numb+0.016
Neg logit contributions
bek-0.028
ved-0.026
 Exhibit-0.026
isdom-0.025
 refused-0.025
Token The
Token ID383
Feature activation+3.06e-01

Pos logit contributions
bek+0.009
ved+0.008
 Exhibit+0.008
 refused+0.007
mare+0.007
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 Arctic-0.006
izzy-0.006
Token rh
Token ID9529
Feature activation-2.85e+00

Pos logit contributions
izzy+0.007
mo+0.006
 numb+0.006
bath+0.006
foot+0.006
Neg logit contributions
bek-0.006
 Exhibit-0.005
 letter-0.005
 won-0.005
 refused-0.005
Tokenin
Token ID259
Feature activation-2.09e+00

Pos logit contributions
bek+0.099
mare+0.094
 Susan+0.094
 vows+0.093
 goodbye+0.093
Neg logit contributions
mo-0.018
 camps-0.015
bo-0.015
izzy-0.011
ucky-0.010
Tokenoc
Token ID420
Feature activation-5.81e+00

Pos logit contributions
bek+0.051
 Katy+0.049
ved+0.049
 Susan+0.049
 vows+0.048
Neg logit contributions
mo-0.035
bo-0.035
 camps-0.032
 polar-0.030
 Arctic-0.029
Tokeneros
Token ID27498
Feature activation-9.33e-01

Pos logit contributions
bek+0.140
mare+0.135
 refused+0.130
 Susan+0.129
ved+0.129
Neg logit contributions
mo-0.099
bo-0.093
 camps-0.089
 polar-0.082
ucky-0.081
Token was
Token ID373
Feature activation+7.05e-01

Pos logit contributions
bek+0.024
 Exhibit+0.021
mare+0.021
isdom+0.020
 Katy+0.020
Neg logit contributions
bo-0.017
mo-0.017
 camps-0.015
 end-0.013
 toe-0.013
Token happy
Token ID3772
Feature activation-3.32e-01

Pos logit contributions
mo+0.015
 camps+0.015
bo+0.014
 numb+0.014
 Arctic+0.013
Neg logit contributions
bek-0.014
 Exhibit-0.013
ved-0.012
isdom-0.012
mare-0.012
Token to
Token ID284
Feature activation+7.66e-01

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
 vows+0.006
isdom+0.006
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 numb-0.006
 toe-0.006
Token help
Token ID1037
Feature activation-2.68e-01

Pos logit contributions
mo+0.016
 camps+0.015
bo+0.014
 numb+0.014
izzy+0.014
Neg logit contributions
bek-0.016
ved-0.014
 Exhibit-0.014
mare-0.014
isdom-0.014
Token hurt
Token ID5938
Feature activation+1.72e-01

Pos logit contributions
bek+0.006
ved+0.006
 refused+0.006
 Exhibit+0.005
 goodbye+0.005
Neg logit contributions
mo-0.007
 camps-0.007
izzy-0.007
bo-0.007
 Arctic-0.007
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.002
izzy+0.002
 numb+0.002
Neg logit contributions
bek-0.004
ved-0.004
 Exhibit-0.004
 refused-0.004
 Susan-0.004
Token The
Token ID383
Feature activation+2.33e-01

Pos logit contributions
bek+0.003
ved+0.002
 Exhibit+0.002
mare+0.002
isdom+0.002
Neg logit contributions
mo-0.002
 camps-0.002
bo-0.002
 numb-0.002
izzy-0.002
Token rh
Token ID9529
Feature activation-2.64e+00

Pos logit contributions
mo+0.005
bo+0.004
izzy+0.004
 camps+0.004
 numb+0.004
Neg logit contributions
bek-0.005
 goodbye-0.004
 vows-0.004
 refused-0.004
mare-0.004
Tokenin
Token ID259
Feature activation-1.90e+00

Pos logit contributions
bek+0.097
 Susan+0.096
mare+0.095
 goodbye+0.094
 vows+0.093
Neg logit contributions
mo-0.012
bo-0.010
 camps-0.007
 polar-0.003
izzy-0.002
Tokenoc
Token ID420
Feature activation-5.81e+00

Pos logit contributions
bek+0.040
ved+0.036
 Susan+0.036
 Exhibit+0.036
mare+0.035
Neg logit contributions
mo-0.040
bo-0.038
 camps-0.037
 Arctic-0.034
 polar-0.034
Tokeneros
Token ID27498
Feature activation-1.68e+00

Pos logit contributions
bek+0.148
mare+0.143
 refused+0.139
 Susan+0.138
ved+0.136
Neg logit contributions
mo-0.095
bo-0.089
 camps-0.081
 polar-0.075
 ig-0.073
Token kept
Token ID4030
Feature activation+3.08e-01

Pos logit contributions
bek+0.038
mare+0.034
 Exhibit+0.034
isdom+0.032
rave+0.031
Neg logit contributions
mo-0.037
bo-0.034
 camps-0.031
 polar-0.028
chy-0.028
Token sn
Token ID3013
Feature activation-2.36e-01

Pos logit contributions
mo+0.006
 camps+0.006
bo+0.006
 numb+0.006
 polar+0.005
Neg logit contributions
bek-0.007
 Exhibit-0.006
ved-0.006
isdom-0.006
mare-0.006
Tokenorting
Token ID24707
Feature activation-7.54e-02

Pos logit contributions
bek+0.005
ved+0.004
isdom+0.004
 Exhibit+0.004
mare+0.004
Neg logit contributions
 camps-0.005
mo-0.005
 numb-0.004
 Arctic-0.004
izzy-0.004
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
bek+0.002
mare+0.002
isdom+0.002
 Exhibit+0.002
ved+0.001
Neg logit contributions
 camps-0.001
mo-0.001
 numb-0.001
bo-0.001
 chilly-0.001
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
mo+0.000
 numb+0.000
 camps+0.000
 colder+0.000
chy+0.000
Neg logit contributions
bek-0.000
 Susan-0.000
 Steve-0.000
 Exhibit-0.000
isdom-0.000
Token"
Token ID1
Feature activation+1.42e+00

Pos logit contributions
mo+0.010
 camps+0.010
 numb+0.009
 colder+0.009
bo+0.009
Neg logit contributions
bek-0.010
 Exhibit-0.008
 Susan-0.008
 Steve-0.008
isdom-0.008
TokenThe
Token ID464
Feature activation+3.03e-01

Pos logit contributions
mo+0.021
 camps+0.020
bo+0.019
 numb+0.018
izzy+0.018
Neg logit contributions
bek-0.037
ved-0.034
 Exhibit-0.034
mare-0.033
 refused-0.033
Token rh
Token ID9529
Feature activation-2.30e+00

Pos logit contributions
 camps+0.006
mo+0.006
 polar+0.005
 Arctic+0.005
 tul+0.005
Neg logit contributions
bek-0.006
ved-0.006
isdom-0.006
 refused-0.006
mare-0.006
Tokenin
Token ID259
Feature activation-1.25e+00

Pos logit contributions
 Susan+0.084
bek+0.082
mare+0.081
 goodbye+0.081
 vows+0.080
Neg logit contributions
mo-0.010
bo-0.009
 camps-0.006
in-0.003
 end-0.003
Tokenoc
Token ID420
Feature activation-5.75e+00

Pos logit contributions
bek+0.029
ved+0.029
 Katy+0.028
adoes+0.027
 Susan+0.027
Neg logit contributions
bo-0.021
mo-0.021
 polar-0.018
 camps-0.018
 ig-0.018
Tokeneros
Token ID27498
Feature activation-7.73e-01

Pos logit contributions
bek+0.140
mare+0.137
 Susan+0.134
 refused+0.131
 Katy+0.130
Neg logit contributions
mo-0.101
bo-0.091
 camps-0.086
 polar-0.079
 ig-0.079
Token is
Token ID318
Feature activation+8.24e-01

Pos logit contributions
bek+0.018
mare+0.015
ved+0.015
isdom+0.014
 vows+0.014
Neg logit contributions
mo-0.015
bo-0.015
 camps-0.013
 polar-0.013
 ig-0.013
Token in
Token ID287
Feature activation-6.83e-01

Pos logit contributions
 camps+0.014
mo+0.014
 chilly+0.013
 polar+0.013
bo+0.013
Neg logit contributions
bek-0.021
ved-0.018
mare-0.018
anish-0.018
isdom-0.018
Token there
Token ID612
Feature activation-5.46e-01

Pos logit contributions
bek+0.013
ved+0.012
 Exhibit+0.011
 refused+0.011
isdom+0.011
Neg logit contributions
 camps-0.015
mo-0.015
bo-0.014
 numb-0.014
 Arctic-0.013
Token,
Token ID11
Feature activation+1.88e-01

Pos logit contributions
bek+0.011
ved+0.010
 Exhibit+0.010
mare+0.009
isdom+0.009
Neg logit contributions
mo-0.011
 camps-0.011
bo-0.011
 Arctic-0.010
izzy-0.010
Token playing
Token ID2712
Feature activation-5.65e-01

Pos logit contributions
bek+0.003
 Exhibit+0.003
mare+0.003
ved+0.003
isdom+0.003
Neg logit contributions
mo-0.002
 camps-0.002
bo-0.002
izzy-0.002
bath-0.002
Token!
Token ID0
Feature activation-2.63e-02

Pos logit contributions
bek+0.015
mare+0.014
isdom+0.014
 Exhibit+0.014
 vows+0.013
Neg logit contributions
 camps-0.008
 toe-0.008
mo-0.007
bath-0.007
 bounce-0.007
Token The
Token ID383
Feature activation+3.54e-01

Pos logit contributions
bek+0.001
ved+0.001
 Exhibit+0.001
mare+0.000
isdom+0.000
Neg logit contributions
mo-0.001
 camps-0.000
bo-0.000
izzy-0.000
 Arctic-0.000
Token rh
Token ID9529
Feature activation-2.77e+00

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.007
 numb+0.007
izzy+0.007
Neg logit contributions
bek-0.007
ved-0.006
 Exhibit-0.006
mare-0.006
 refused-0.006
Tokenin
Token ID259
Feature activation-2.21e+00

Pos logit contributions
bek+0.103
 Susan+0.101
 goodbye+0.101
mare+0.100
 vows+0.099
Neg logit contributions
mo-0.011
bo-0.008
 camps-0.005
izzy-0.003
ucky-0.003
Tokenoc
Token ID420
Feature activation-5.75e+00

Pos logit contributions
bek+0.050
ved+0.047
 Susan+0.046
mare+0.045
isdom+0.044
Neg logit contributions
mo-0.043
bo-0.042
 camps-0.038
 polar-0.037
 Arctic-0.036
Tokeneros
Token ID27498
Feature activation-1.72e+00

Pos logit contributions
bek+0.144
mare+0.140
 Susan+0.134
 refused+0.132
ved+0.129
Neg logit contributions
mo-0.097
bo-0.088
 camps-0.083
 polar-0.076
 ig-0.076
Token was
Token ID373
Feature activation+4.98e-01

Pos logit contributions
bek+0.043
mare+0.038
 Exhibit+0.038
isdom+0.037
 Sale+0.036
Neg logit contributions
mo-0.033
bo-0.032
 camps-0.027
 polar-0.026
 toe-0.026
Token very
Token ID845
Feature activation-9.82e-01

Pos logit contributions
mo+0.010
 camps+0.009
bo+0.009
 numb+0.009
 Arctic+0.009
Neg logit contributions
bek-0.011
 Exhibit-0.010
isdom-0.010
ved-0.010
mare-0.010
Token worried
Token ID7960
Feature activation-3.02e-01

Pos logit contributions
bek+0.023
ved+0.021
mare+0.021
 Exhibit+0.020
isdom+0.020
Neg logit contributions
mo-0.018
bo-0.017
 camps-0.017
 numb-0.017
 chilly-0.017
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
mare+0.007
rave+0.006
Neg logit contributions
mo-0.005
 camps-0.005
 numb-0.005
bo-0.005
 chilly-0.005
Token away
Token ID1497
Feature activation-7.65e-01

Pos logit contributions
mo+0.011
 camps+0.011
bo+0.010
 numb+0.010
izzy+0.010
Neg logit contributions
bek-0.010
 Exhibit-0.009
isdom-0.009
ved-0.009
mare-0.009
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
bek+0.025
isdom+0.023
 Exhibit+0.023
mare+0.022
 Sale+0.022
Neg logit contributions
 camps-0.008
 numb-0.007
mo-0.007
 chilly-0.006
 colder-0.006
Token The
Token ID383
Feature activation+7.97e-02

Pos logit contributions
bek+0.011
ved+0.010
 Exhibit+0.010
 vows+0.010
mare+0.009
Neg logit contributions
 camps-0.006
bo-0.005
mo-0.005
 Arctic-0.005
 Polar-0.005
Token rh
Token ID9529
Feature activation-2.94e+00

Pos logit contributions
mo+0.002
izzy+0.002
 numb+0.002
 camps+0.002
bo+0.002
Neg logit contributions
bek-0.002
 Exhibit-0.001
 letter-0.001
 goodbye-0.001
isdom-0.001
Tokenin
Token ID259
Feature activation-2.53e+00

Pos logit contributions
bek+0.104
 goodbye+0.100
mare+0.099
 vows+0.099
 Susan+0.099
Neg logit contributions
mo-0.017
bo-0.013
 camps-0.012
izzy-0.009
ucky-0.008
Tokenoc
Token ID420
Feature activation-5.73e+00

Pos logit contributions
bek+0.060
ved+0.058
 vows+0.057
 Katy+0.056
 Susan+0.056
Neg logit contributions
bo-0.044
mo-0.044
 camps-0.039
 polar-0.038
 Arctic-0.037
Tokeneros
Token ID27498
Feature activation-1.35e+00

Pos logit contributions
bek+0.138
mare+0.130
 refused+0.128
 Susan+0.124
ved+0.123
Neg logit contributions
mo-0.098
bo-0.092
 camps-0.088
 polar-0.081
 Arctic-0.080
Token felt
Token ID2936
Feature activation+9.06e-01

Pos logit contributions
bek+0.034
 Exhibit+0.032
isdom+0.031
 ours+0.031
 Sale+0.030
Neg logit contributions
bo-0.024
mo-0.023
 camps-0.020
 end-0.020
 toe-0.020
Token che
Token ID1125
Feature activation-2.29e-01

Pos logit contributions
mo+0.021
 camps+0.020
bo+0.019
izzy+0.019
 Arctic+0.018
Neg logit contributions
bek-0.016
ved-0.014
 goodbye-0.014
 refused-0.014
 won-0.014
Tokenery
Token ID1924
Feature activation-1.15e+00

Pos logit contributions
bek+0.005
ved+0.005
 refused+0.005
isdom+0.005
mare+0.004
Neg logit contributions
 camps-0.004
mo-0.004
 numb-0.004
 Arctic-0.004
 Polar-0.004
Token and
Token ID290
Feature activation-6.75e-01

Pos logit contributions
bek+0.024
 Exhibit+0.022
ved+0.021
isdom+0.021
mare+0.021
Neg logit contributions
mo-0.023
 camps-0.022
bo-0.021
 numb-0.021
 chilly-0.020
Token relief
Token ID8259
Feature activation-3.79e-01

Pos logit contributions
bek+0.011
 goodbye+0.009
ved+0.009
 Susan+0.009
mare+0.009
Neg logit contributions
mo-0.016
 camps-0.015
bo-0.015
izzy-0.014
bath-0.014
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
bek+0.012
 Exhibit+0.010
mare+0.010
rave+0.010
isdom+0.010
Neg logit contributions
 numb-0.004
mo-0.004
 camps-0.004
 polar-0.004
bo-0.004
Token The
Token ID383
Feature activation+1.96e-01

Pos logit contributions
bek+0.005
ved+0.005
 Exhibit+0.005
 vows+0.005
mare+0.005
Neg logit contributions
mo-0.003
bo-0.003
 camps-0.003
 Arctic-0.003
 Polar-0.003
Token rh
Token ID9529
Feature activation-2.57e+00

Pos logit contributions
mo+0.004
izzy+0.004
 numb+0.004
 camps+0.004
bo+0.004
Neg logit contributions
bek-0.004
 Exhibit-0.003
 vows-0.003
 goodbye-0.003
 refused-0.003
Tokenin
Token ID259
Feature activation-1.95e+00

Pos logit contributions
bek+0.092
mare+0.089
 goodbye+0.088
 Susan+0.088
 vows+0.087
Neg logit contributions
mo-0.015
bo-0.011
 camps-0.010
izzy-0.007
ucky-0.006
Tokenoc
Token ID420
Feature activation-5.71e+00

Pos logit contributions
bek+0.047
ved+0.047
 Katy+0.046
 Susan+0.046
mare+0.046
Neg logit contributions
mo-0.034
bo-0.033
 polar-0.028
 camps-0.028
 Arctic-0.027
Tokeneros
Token ID27498
Feature activation-1.18e+00

Pos logit contributions
bek+0.144
mare+0.143
 refused+0.134
 Susan+0.133
 won+0.131
Neg logit contributions
mo-0.094
bo-0.085
 camps-0.077
 polar-0.073
 Arctic-0.072
Token was
Token ID373
Feature activation+2.62e-01

Pos logit contributions
bek+0.028
 Exhibit+0.026
mare+0.025
 ours+0.025
isdom+0.025
Neg logit contributions
mo-0.023
bo-0.023
 camps-0.019
 polar-0.019
 toe-0.018
Token so
Token ID523
Feature activation-9.41e-01

Pos logit contributions
mo+0.006
 camps+0.005
bo+0.005
 numb+0.005
 Arctic+0.005
Neg logit contributions
bek-0.005
 Exhibit-0.005
ved-0.005
isdom-0.005
mare-0.004
Token happy
Token ID3772
Feature activation-9.04e-01

Pos logit contributions
bek+0.019
 Exhibit+0.016
ved+0.016
mare+0.016
isdom+0.015
Neg logit contributions
mo-0.020
bo-0.020
 camps-0.019
 numb-0.019
izzy-0.018
Token to
Token ID284
Feature activation+3.48e-01

Pos logit contributions
bek+0.024
 Exhibit+0.022
ved+0.021
 vows+0.021
mare+0.021
Neg logit contributions
mo-0.013
 camps-0.013
foot-0.012
bo-0.012
 polar-0.012
Token gentle
Token ID10296
Feature activation+1.72e-01

Pos logit contributions
 chilly+0.016
 numb+0.016
 colder+0.015
 bounce+0.015
bo+0.015
Neg logit contributions
bek-0.028
ved-0.026
 Exhibit-0.025
anish-0.025
 vows-0.024
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
mo+0.002
bo+0.002
 camps+0.002
 numb+0.002
foot+0.002
Neg logit contributions
bek-0.005
 Exhibit-0.005
ved-0.004
 refused-0.004
isdom-0.004
Token The
Token ID383
Feature activation+1.30e-02

Pos logit contributions
bek+0.003
ved+0.002
 Exhibit+0.002
mare+0.002
 refused+0.002
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
 numb-0.003
izzy-0.003
Token rh
Token ID9529
Feature activation-2.66e+00

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
izzy+0.000
 numb+0.000
Neg logit contributions
bek-0.000
 Exhibit-0.000
ved-0.000
mare-0.000
isdom-0.000
Tokenin
Token ID259
Feature activation-1.95e+00

Pos logit contributions
bek+0.095
 Susan+0.092
 goodbye+0.091
ved+0.090
mare+0.090
Neg logit contributions
mo-0.015
bo-0.013
 camps-0.011
izzy-0.009
ucky-0.007
Tokenoc
Token ID420
Feature activation-5.70e+00

Pos logit contributions
bek+0.052
 Susan+0.051
 Katy+0.051
ved+0.050
 Steve+0.047
Neg logit contributions
bo-0.031
mo-0.029
 camps-0.026
 polar-0.025
 Arctic-0.024
Tokeneros
Token ID27498
Feature activation-1.10e+00

Pos logit contributions
bek+0.141
mare+0.135
 refused+0.133
 Susan+0.133
ved+0.130
Neg logit contributions
mo-0.094
bo-0.090
 camps-0.082
 polar-0.076
 Arctic-0.075
Token is
Token ID318
Feature activation+7.87e-01

Pos logit contributions
bek+0.026
 Susan+0.022
 Katy+0.022
mare+0.021
ved+0.021
Neg logit contributions
mo-0.023
bo-0.022
 camps-0.020
 end-0.018
chy-0.018
Token scared
Token ID12008
Feature activation-6.84e-01

Pos logit contributions
mo+0.015
 numb+0.014
bo+0.014
 chilly+0.014
 camps+0.014
Neg logit contributions
bek-0.020
 Exhibit-0.017
ved-0.016
anish-0.016
 vows-0.016
Token of
Token ID286
Feature activation-4.37e-01

Pos logit contributions
bek+0.016
 Exhibit+0.014
ved+0.014
 vows+0.013
mare+0.013
Neg logit contributions
mo-0.013
 camps-0.013
bo-0.013
 numb-0.012
ucky-0.012
Token new
Token ID649
Feature activation-5.57e-01

Pos logit contributions
bek+0.010
ved+0.008
mare+0.008
 Exhibit+0.008
 vows+0.008
Neg logit contributions
mo-0.009
 camps-0.009
bo-0.008
 Arctic-0.008
 polar-0.008
Token he
Token ID339
Feature activation+2.34e-01

Pos logit contributions
bek+0.004
ved+0.004
 Exhibit+0.004
rave+0.004
isdom+0.004
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
 Arctic-0.002
 polar-0.002
Token would
Token ID561
Feature activation-8.51e-01

Pos logit contributions
mo+0.005
 camps+0.005
bo+0.005
izzy+0.005
 Arctic+0.005
Neg logit contributions
bek-0.004
 Exhibit-0.004
ved-0.004
mare-0.004
isdom-0.004
Token take
Token ID1011
Feature activation+5.55e-02

Pos logit contributions
bek+0.022
 Exhibit+0.020
ved+0.019
mare+0.019
isdom+0.019
Neg logit contributions
mo-0.013
 camps-0.013
bo-0.012
 numb-0.011
 toe-0.011
Token the
Token ID262
Feature activation+5.79e-01

Pos logit contributions
mo+0.001
bo+0.001
 camps+0.001
 numb+0.001
foot+0.001
Neg logit contributions
bek-0.001
 Exhibit-0.001
mare-0.001
ved-0.001
 refused-0.001
Token av
Token ID1196
Feature activation-2.92e+00

Pos logit contributions
mo+0.011
 camps+0.011
bo+0.011
izzy+0.010
 numb+0.010
Neg logit contributions
bek-0.012
 Exhibit-0.011
ved-0.011
 goodbye-0.010
 refused-0.010
Tokenoc
Token ID420
Feature activation-5.69e+00

Pos logit contributions
bek+0.055
ved+0.049
 Exhibit+0.049
mare+0.048
 refused+0.047
Neg logit contributions
mo-0.064
 camps-0.062
bo-0.060
izzy-0.057
 numb-0.057
Tokenados
Token ID22484
Feature activation-2.67e-01

Pos logit contributions
mare+0.109
bek+0.104
 refused+0.101
 won+0.099
ved+0.098
Neg logit contributions
mo-0.127
bo-0.120
 camps-0.111
ucky-0.103
 Arctic-0.101
Token and
Token ID290
Feature activation+1.20e-01

Pos logit contributions
bek+0.005
mare+0.005
 vows+0.005
 Exhibit+0.005
isdom+0.005
Neg logit contributions
 camps-0.006
mo-0.006
bo-0.005
izzy-0.005
bath-0.005
Token scatter
Token ID41058
Feature activation+8.48e-01

Pos logit contributions
 camps+0.002
mo+0.002
bo+0.002
 tul+0.002
 stream+0.002
Neg logit contributions
bek-0.003
mare-0.003
ved-0.003
 Exhibit-0.003
rave-0.003
Token them
Token ID606
Feature activation-3.17e-01

Pos logit contributions
mo+0.016
 camps+0.016
bo+0.015
 numb+0.014
izzy+0.014
Neg logit contributions
bek-0.019
mare-0.017
 Exhibit-0.017
ved-0.017
isdom-0.016
Token around
Token ID1088
Feature activation-4.98e-01

Pos logit contributions
bek+0.007
mare+0.007
 Exhibit+0.006
isdom+0.006
ved+0.006
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 numb-0.005
bath-0.005
Token left
Token ID1364
Feature activation-6.54e-01

Pos logit contributions
mo+0.006
bo+0.006
 camps+0.006
 numb+0.005
bath+0.005
Neg logit contributions
bek-0.007
mare-0.006
 Exhibit-0.006
isdom-0.005
adoes-0.005
Token behind
Token ID2157
Feature activation+5.21e-01

Pos logit contributions
bek+0.019
mare+0.017
 Exhibit+0.017
rav+0.017
 Sorry+0.017
Neg logit contributions
mo-0.008
 camps-0.007
bo-0.007
ucky-0.006
 end-0.006
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.006
 numb+0.006
 Arctic+0.006
Neg logit contributions
bek-0.015
 Exhibit-0.013
mare-0.013
ved-0.013
isdom-0.013
Token The
Token ID383
Feature activation+5.02e-01

Pos logit contributions
bek+0.005
ved+0.005
 Exhibit+0.005
mare+0.005
 vows+0.005
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.004
izzy-0.004
 Arctic-0.004
Token av
Token ID1196
Feature activation-3.46e+00

Pos logit contributions
mo+0.009
 camps+0.009
bo+0.008
 End+0.008
 Arctic+0.008
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
mare-0.011
isdom-0.010
Tokenoc
Token ID420
Feature activation-5.69e+00

Pos logit contributions
bek+0.069
ved+0.063
 Exhibit+0.062
mare+0.062
 vows+0.061
Neg logit contributions
mo-0.072
bo-0.069
 camps-0.069
bath-0.062
 Arctic-0.062
Tokenados
Token ID22484
Feature activation-6.86e-01

Pos logit contributions
mare+0.121
bek+0.120
 refused+0.109
 won+0.107
ved+0.107
Neg logit contributions
mo-0.110
bo-0.105
ucky-0.096
 camps-0.094
 End-0.093
Token were
Token ID547
Feature activation+3.69e-01

Pos logit contributions
bek+0.016
mare+0.015
 Exhibit+0.015
isdom+0.014
 hi+0.014
Neg logit contributions
mo-0.012
bo-0.012
 camps-0.011
 end-0.011
bath-0.011
Token round
Token ID2835
Feature activation+9.32e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
 numb+0.007
 Arctic+0.006
Neg logit contributions
bek-0.008
 Exhibit-0.007
mare-0.007
ved-0.007
isdom-0.007
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
mo+0.022
 camps+0.022
bo+0.020
 numb+0.020
 Arctic+0.019
Neg logit contributions
bek-0.017
 Exhibit-0.014
mare-0.014
ved-0.014
isdom-0.014
Token green
Token ID4077
Feature activation+1.44e+00

Pos logit contributions
mo+0.009
 chilly+0.009
 tul+0.009
 camps+0.009
 numb+0.009
Neg logit contributions
bek-0.019
ved-0.019
mare-0.018
 Exhibit-0.018
isdom-0.017
Token
Token ID198
Feature activation+5.34e-03

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
 Susan+0.007
 Steve+0.007
Neg logit contributions
mo-0.008
 camps-0.007
 numb-0.007
bo-0.007
 Arctic-0.007
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 numb+0.000
izzy+0.000
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
mare-0.000
isdom-0.000
Token bad
Token ID2089
Feature activation-1.70e+00

Pos logit contributions
bek+0.005
 Exhibit+0.005
 letter+0.004
 hello+0.004
 Sale+0.004
Neg logit contributions
mo-0.006
izzy-0.006
 camps-0.006
bath-0.006
foot-0.006
Token rh
Token ID9529
Feature activation-2.91e+00

Pos logit contributions
bek+0.040
ved+0.036
 Exhibit+0.036
mare+0.035
isdom+0.035
Neg logit contributions
mo-0.030
 camps-0.029
bo-0.027
 numb-0.026
izzy-0.026
Tokenin
Token ID259
Feature activation-2.68e+00

Pos logit contributions
bek+0.104
 goodbye+0.101
 Susan+0.101
 vows+0.101
mare+0.100
Neg logit contributions
mo-0.014
bo-0.012
 camps-0.009
izzy-0.007
ucky-0.006
Tokenoc
Token ID420
Feature activation-5.67e+00

Pos logit contributions
 Katy+0.074
 vows+0.072
 Susan+0.071
bek+0.071
 hi+0.070
Neg logit contributions
bo-0.039
mo-0.037
 camps-0.030
 polar-0.029
 end-0.029
Tokeneros
Token ID27498
Feature activation-9.51e-01

Pos logit contributions
bek+0.132
mare+0.125
 refused+0.124
 Susan+0.123
ved+0.120
Neg logit contributions
mo-0.100
bo-0.096
 camps-0.090
 polar-0.082
 Arctic-0.081
Token was
Token ID373
Feature activation+5.87e-01

Pos logit contributions
bek+0.024
 Exhibit+0.022
 ours+0.021
mare+0.021
 Sale+0.020
Neg logit contributions
bo-0.019
mo-0.017
 end-0.015
 toe-0.014
 ac-0.013
Token so
Token ID523
Feature activation-6.67e-01

Pos logit contributions
mo+0.011
bo+0.011
 numb+0.010
 camps+0.010
 chilly+0.010
Neg logit contributions
bek-0.014
 Exhibit-0.013
mare-0.012
 vows-0.012
isdom-0.012
Token happy
Token ID3772
Feature activation-4.67e-01

Pos logit contributions
bek+0.017
mare+0.015
 Exhibit+0.015
ved+0.015
 vows+0.015
Neg logit contributions
bo-0.011
mo-0.011
 chilly-0.010
 numb-0.010
 polar-0.010
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
bek+0.013
 vows+0.012
 Exhibit+0.012
mare+0.011
ved+0.011
Neg logit contributions
 camps-0.006
mo-0.006
foot-0.006
ucky-0.006
bo-0.006
Token mess
Token ID2085
Feature activation+2.98e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 polar+0.000
 Arctic+0.000
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
 refused-0.000
rave-0.000
Token.
Token ID13
Feature activation-6.65e-02

Pos logit contributions
mo+0.004
bo+0.004
 camps+0.004
izzy+0.004
 numb+0.004
Neg logit contributions
bek-0.008
 Exhibit-0.007
ved-0.007
isdom-0.007
 vows-0.007
Token The
Token ID383
Feature activation+6.14e-01

Pos logit contributions
bek+0.002
ved+0.001
 Exhibit+0.001
mare+0.001
 refused+0.001
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
izzy-0.001
 numb-0.001
Token rh
Token ID9529
Feature activation-2.66e+00

Pos logit contributions
mo+0.012
 camps+0.011
bo+0.011
 numb+0.011
izzy+0.011
Neg logit contributions
bek-0.013
ved-0.011
 Exhibit-0.011
 refused-0.011
mare-0.011
Tokenin
Token ID259
Feature activation-2.21e+00

Pos logit contributions
bek+0.096
 vows+0.092
 goodbye+0.092
 Susan+0.092
 Exhibit+0.091
Neg logit contributions
mo-0.014
bo-0.011
 camps-0.009
izzy-0.007
ucky-0.006
Tokenoc
Token ID420
Feature activation-5.64e+00

Pos logit contributions
bek+0.045
ved+0.041
 Exhibit+0.039
mare+0.039
 Susan+0.039
Neg logit contributions
mo-0.047
bo-0.045
 camps-0.045
 Arctic-0.041
 polar-0.041
Tokeneros
Token ID27498
Feature activation-1.73e+00

Pos logit contributions
bek+0.136
mare+0.130
 refused+0.127
ved+0.127
 Susan+0.126
Neg logit contributions
mo-0.098
bo-0.091
 camps-0.087
 polar-0.080
 ig-0.079
Token had
Token ID550
Feature activation-3.72e-01

Pos logit contributions
bek+0.041
mare+0.036
 Exhibit+0.036
isdom+0.035
ved+0.034
Neg logit contributions
mo-0.035
bo-0.033
 camps-0.029
 polar-0.028
 toe-0.027
Token destroyed
Token ID6572
Feature activation-4.53e-02

Pos logit contributions
bek+0.010
 Exhibit+0.009
ved+0.008
isdom+0.008
mare+0.008
Neg logit contributions
bo-0.006
 numb-0.006
mo-0.006
 camps-0.006
 chilly-0.006
Token her
Token ID607
Feature activation-3.81e-01

Pos logit contributions
bek+0.001
ved+0.001
mare+0.001
 Exhibit+0.001
isdom+0.001
Neg logit contributions
 camps-0.001
mo-0.001
 numb-0.001
 ig-0.001
 toe-0.001
Token room
Token ID2119
Feature activation-1.33e+00

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
 refused+0.007
 vows+0.007
Neg logit contributions
mo-0.008
 camps-0.007
bo-0.007
izzy-0.007
 numb-0.007
Token
Token ID198
Feature activation-1.98e-01

Pos logit contributions
bek+0.010
ved+0.009
 Exhibit+0.009
mare+0.008
 refused+0.008
Neg logit contributions
mo-0.009
 camps-0.009
bo-0.008
izzy-0.008
 numb-0.008
Token
Token ID198
Feature activation+1.84e-01

Pos logit contributions
 Steve+0.003
bek+0.003
 Susan+0.003
 Exhibit+0.003
 won+0.003
Neg logit contributions
mo-0.005
 camps-0.005
 numb-0.005
 colder-0.004
bo-0.004
TokenThe
Token ID464
Feature activation+1.53e-01

Pos logit contributions
mo+0.004
 camps+0.004
 numb+0.003
bo+0.003
izzy+0.003
Neg logit contributions
bek-0.004
ved-0.003
 Exhibit-0.003
 Steve-0.003
rave-0.003
Token rh
Token ID9529
Feature activation-2.85e+00

Pos logit contributions
izzy+0.004
mo+0.003
 numb+0.003
 camps+0.003
bath+0.003
Neg logit contributions
bek-0.003
 Exhibit-0.002
 won-0.002
isdom-0.002
 goodbye-0.002
Tokenin
Token ID259
Feature activation-2.42e+00

Pos logit contributions
bek+0.097
mare+0.093
 Susan+0.093
 vows+0.092
 goodbye+0.092
Neg logit contributions
mo-0.020
bo-0.016
 camps-0.016
izzy-0.011
 Arctic-0.011
Tokenoc
Token ID420
Feature activation-5.64e+00

Pos logit contributions
bek+0.057
ved+0.056
mare+0.055
 Katy+0.055
 vows+0.054
Neg logit contributions
mo-0.043
bo-0.042
 camps-0.038
 polar-0.036
 Arctic-0.035
Tokeneros
Token ID27498
Feature activation-1.17e+00

Pos logit contributions
bek+0.139
mare+0.136
 refused+0.129
ved+0.127
 Susan+0.126
Neg logit contributions
mo-0.094
bo-0.089
 camps-0.082
 polar-0.076
 Arctic-0.074
Token smiled
Token ID13541
Feature activation-9.45e-02

Pos logit contributions
bek+0.027
 Exhibit+0.026
mare+0.025
adoes+0.024
isdom+0.024
Neg logit contributions
bo-0.022
mo-0.022
 camps-0.019
 end-0.018
 toe-0.018
Token and
Token ID290
Feature activation-2.63e-01

Pos logit contributions
bek+0.002
 Exhibit+0.002
mare+0.002
ved+0.002
isdom+0.002
Neg logit contributions
mo-0.002
 camps-0.002
bo-0.002
 numb-0.002
 chilly-0.002
Token flew
Token ID13112
Feature activation+7.54e-01

Pos logit contributions
bek+0.007
 Exhibit+0.006
anish+0.006
isdom+0.006
adoes+0.006
Neg logit contributions
bo-0.004
 polar-0.004
 stream-0.004
 then-0.004
 bounce-0.004
Token away
Token ID1497
Feature activation-7.09e-01

Pos logit contributions
mo+0.013
bo+0.012
 camps+0.012
 numb+0.011
 chilly+0.011
Neg logit contributions
bek-0.019
 Exhibit-0.018
isdom-0.018
mare-0.017
 refused-0.016
Token animals
Token ID4695
Feature activation+1.05e-01

Pos logit contributions
bek+0.013
ved+0.012
mare+0.011
asure+0.011
iversity+0.011
Neg logit contributions
 camps-0.008
 ig-0.007
mo-0.007
 Arctic-0.007
 polar-0.007
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
bath+0.001
 numb+0.001
Neg logit contributions
bek-0.003
mare-0.003
 Exhibit-0.003
ved-0.003
isdom-0.003
Token The
Token ID383
Feature activation+7.24e-01

Pos logit contributions
bek+0.003
 Exhibit+0.003
ved+0.003
 Susan+0.003
mare+0.003
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
 numb-0.003
izzy-0.003
Token rh
Token ID9529
Feature activation-2.45e+00

Pos logit contributions
mo+0.015
bo+0.014
 camps+0.014
 numb+0.014
izzy+0.014
Neg logit contributions
bek-0.014
 Exhibit-0.013
ved-0.013
 vows-0.013
 refused-0.012
Tokenin
Token ID259
Feature activation-1.81e+00

Pos logit contributions
 goodbye+0.091
bek+0.090
mare+0.089
 Susan+0.088
 vows+0.086
Neg logit contributions
mo-0.012
bo-0.007
izzy-0.005
 camps-0.005
ucky-0.004
Tokenoc
Token ID420
Feature activation-5.64e+00

Pos logit contributions
bek+0.043
ved+0.042
 Katy+0.041
 Susan+0.041
 hi+0.041
Neg logit contributions
mo-0.034
bo-0.032
 camps-0.029
 polar-0.028
 Arctic-0.028
Tokeneros
Token ID27498
Feature activation-1.34e+00

Pos logit contributions
bek+0.139
mare+0.137
 refused+0.132
 Susan+0.129
 Katy+0.127
Neg logit contributions
mo-0.096
bo-0.086
 camps-0.078
 ig-0.074
 Arctic-0.074
Token was
Token ID373
Feature activation+3.69e-01

Pos logit contributions
bek+0.032
 Exhibit+0.028
mare+0.027
 Sorry+0.027
 hi+0.027
Neg logit contributions
mo-0.028
bo-0.026
 camps-0.024
 Arctic-0.021
 polar-0.021
Token so
Token ID523
Feature activation-6.93e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
 numb+0.006
 Arctic+0.006
Neg logit contributions
bek-0.009
 Exhibit-0.008
isdom-0.007
ved-0.007
mare-0.007
Token excited
Token ID6568
Feature activation-1.49e+00

Pos logit contributions
bek+0.016
ved+0.014
mare+0.014
 Exhibit+0.014
isdom+0.014
Neg logit contributions
mo-0.013
bo-0.012
 chilly-0.012
 camps-0.012
 Arctic-0.012
Token to
Token ID284
Feature activation+8.16e-02

Pos logit contributions
bek+0.041
ved+0.035
 Exhibit+0.035
mare+0.034
 vows+0.033
Neg logit contributions
mo-0.022
 camps-0.022
ucky-0.021
bo-0.021
foot-0.020
Token
Token ID198
Feature activation-6.01e-01

Pos logit contributions
bek+0.046
 vows+0.042
ved+0.040
 refused+0.040
isdom+0.040
Neg logit contributions
bo-0.012
Sn-0.010
bath-0.010
 Polar-0.010
 stream-0.009
Token
Token ID198
Feature activation-3.32e-01

Pos logit contributions
bek+0.014
 Exhibit+0.012
ved+0.012
mare+0.012
isdom+0.012
Neg logit contributions
mo-0.011
 camps-0.011
 numb-0.010
bo-0.010
izzy-0.009
TokenThe
Token ID464
Feature activation-4.75e-01

Pos logit contributions
bek+0.009
ved+0.008
 refused+0.008
isdom+0.008
mare+0.008
Neg logit contributions
 camps-0.005
mo-0.005
bo-0.004
Sn-0.004
One-0.004
Token rh
Token ID9529
Feature activation-2.81e+00

Pos logit contributions
bek+0.008
 Exhibit+0.008
 letter+0.007
 hello+0.007
 letters+0.007
Neg logit contributions
mo-0.012
izzy-0.011
 numb-0.011
inn-0.010
bath-0.010
Tokenin
Token ID259
Feature activation-2.55e+00

Pos logit contributions
bek+0.095
 Susan+0.089
mare+0.089
 goodbye+0.089
ved+0.089
Neg logit contributions
mo-0.020
 camps-0.017
bo-0.017
izzy-0.014
Sn-0.012
Tokenoc
Token ID420
Feature activation-5.64e+00

Pos logit contributions
bek+0.057
ved+0.053
 Susan+0.052
 vows+0.051
 Katy+0.051
Neg logit contributions
mo-0.048
bo-0.047
 camps-0.045
 polar-0.042
 Arctic-0.040
Tokeneros
Token ID27498
Feature activation-1.30e+00

Pos logit contributions
bek+0.124
 refused+0.114
 Susan+0.114
mare+0.114
ved+0.112
Neg logit contributions
mo-0.107
bo-0.102
 camps-0.100
 polar-0.090
 Arctic-0.089
Token looked
Token ID3114
Feature activation-4.80e-01

Pos logit contributions
bek+0.030
 Sorry+0.028
 ours+0.028
 Exhibit+0.027
mare+0.027
Neg logit contributions
mo-0.024
bo-0.024
 camps-0.021
 gn-0.020
 toe-0.020
Token up
Token ID510
Feature activation-1.21e+00

Pos logit contributions
bek+0.011
 Exhibit+0.009
mare+0.009
isdom+0.009
 vows+0.009
Neg logit contributions
 camps-0.010
mo-0.010
bo-0.008
 chilly-0.008
 numb-0.008
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
bek+0.022
 Exhibit+0.019
mare+0.018
ved+0.018
isdom+0.018
Neg logit contributions
 camps-0.028
mo-0.028
bo-0.026
 numb-0.026
 chilly-0.025
Token hopeful
Token ID17836
Feature activation-4.50e-01

Pos logit contributions
bek+0.033
 Sale+0.030
 Exhibit+0.030
 vows+0.029
anish+0.029
Neg logit contributions
 camps-0.018
mo-0.017
 chilly-0.017
Where-0.017
 numb-0.016
Tokenhe
Token ID258
Feature activation-1.11e+00

Pos logit contributions
mo+0.005
bo+0.005
 camps+0.005
izzy+0.004
ucky+0.004
Neg logit contributions
bek-0.010
 Exhibit-0.009
isdom-0.009
ved-0.009
 Susan-0.009
Token."
Token ID526
Feature activation-8.14e-01

Pos logit contributions
bek+0.029
 Exhibit+0.025
 Susan+0.025
isdom+0.024
ved+0.024
Neg logit contributions
mo-0.018
 camps-0.017
bo-0.017
 toe-0.016
 numb-0.016
Token The
Token ID383
Feature activation-2.51e-01

Pos logit contributions
bek+0.023
ved+0.021
 vows+0.020
isdom+0.020
 Exhibit+0.020
Neg logit contributions
bo-0.011
mo-0.010
 Polar-0.010
ucky-0.009
Sn-0.009
Token rh
Token ID9529
Feature activation-2.67e+00

Pos logit contributions
bek+0.004
 Exhibit+0.004
 letter+0.004
 hello+0.004
 goodbye+0.004
Neg logit contributions
mo-0.006
izzy-0.006
 camps-0.005
 numb-0.005
inn-0.005
Tokenin
Token ID259
Feature activation-2.09e+00

Pos logit contributions
bek+0.092
 Susan+0.087
mare+0.087
 goodbye+0.086
 vows+0.086
Neg logit contributions
mo-0.017
bo-0.015
 camps-0.014
izzy-0.010
ucky-0.010
Tokenoc
Token ID420
Feature activation-5.63e+00

Pos logit contributions
bek+0.047
 Katy+0.043
ved+0.043
 Susan+0.043
 vows+0.042
Neg logit contributions
bo-0.040
mo-0.039
 camps-0.036
 polar-0.033
 Arctic-0.033
Tokeneros
Token ID27498
Feature activation-1.34e+00

Pos logit contributions
bek+0.134
mare+0.126
 refused+0.125
 Susan+0.125
 Katy+0.123
Neg logit contributions
mo-0.097
bo-0.094
 camps-0.087
 polar-0.081
 Arctic-0.080
Token felt
Token ID2936
Feature activation+1.46e+00

Pos logit contributions
bek+0.031
 Exhibit+0.028
 Katy+0.027
 Susan+0.027
isdom+0.027
Neg logit contributions
bo-0.028
mo-0.027
 camps-0.023
Sn-0.023
 toe-0.023
Token a
Token ID257
Feature activation-8.32e-01

Pos logit contributions
mo+0.038
 camps+0.036
izzy+0.034
bo+0.032
 Arctic+0.032
Neg logit contributions
 goodbye-0.023
 won-0.022
 refused-0.022
 hello-0.022
 its-0.021
Token little
Token ID1310
Feature activation+2.50e-01

Pos logit contributions
bek+0.015
 refused+0.013
 goodbye+0.013
ved+0.013
mare+0.013
Neg logit contributions
mo-0.020
 camps-0.018
bo-0.018
izzy-0.018
 Arctic-0.017
Token better
Token ID1365
Feature activation-7.14e-01

Pos logit contributions
mo+0.006
 camps+0.006
izzy+0.006
 Arctic+0.005
bo+0.005
Neg logit contributions
 goodbye-0.004
 refused-0.004
 won-0.004
 hello-0.004
bek-0.004
Token
Token ID198
Feature activation-1.99e-01

Pos logit contributions
bek+0.012
 Exhibit+0.011
ved+0.010
mare+0.010
 Susan+0.010
Neg logit contributions
mo-0.012
 camps-0.011
bo-0.011
 numb-0.011
izzy-0.010
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
bek+0.004
 Steve+0.003
 Susan+0.003
 won+0.003
 Exhibit+0.003
Neg logit contributions
mo-0.005
 numb-0.004
 camps-0.004
 colder-0.004
 tul-0.004
TokenThe
Token ID464
Feature activation+5.52e-01

Pos logit contributions
mo+0.000
 camps+0.000
 numb+0.000
bo+0.000
izzy+0.000
Neg logit contributions
bek-0.000
 Exhibit-0.000
ved-0.000
 Steve-0.000
 Susan-0.000
Token rh
Token ID9529
Feature activation-2.36e+00

Pos logit contributions
mo+0.012
 numb+0.011
bo+0.011
izzy+0.011
 camps+0.011
Neg logit contributions
bek-0.010
 goodbye-0.009
 Exhibit-0.009
 vows-0.009
 refused-0.009
Tokenin
Token ID259
Feature activation-1.80e+00

Pos logit contributions
bek+0.082
mare+0.080
 Susan+0.080
 vows+0.079
 goodbye+0.078
Neg logit contributions
mo-0.015
bo-0.012
 camps-0.011
 polar-0.007
 End-0.006
Tokenoc
Token ID420
Feature activation-5.63e+00

Pos logit contributions
bek+0.039
ved+0.035
mare+0.035
 Susan+0.035
 Exhibit+0.034
Neg logit contributions
mo-0.036
bo-0.034
 camps-0.034
 Arctic-0.031
 polar-0.031
Tokeneros
Token ID27498
Feature activation-1.21e+00

Pos logit contributions
bek+0.136
mare+0.135
 refused+0.127
 Susan+0.127
ved+0.127
Neg logit contributions
mo-0.098
bo-0.089
 camps-0.086
 polar-0.079
 ig-0.079
Token was
Token ID373
Feature activation+7.52e-01

Pos logit contributions
bek+0.028
mare+0.026
 Exhibit+0.025
adoes+0.024
 Sorry+0.024
Neg logit contributions
mo-0.023
bo-0.023
 camps-0.021
 polar-0.019
 Arctic-0.019
Token so
Token ID523
Feature activation-4.04e-01

Pos logit contributions
mo+0.015
 camps+0.015
bo+0.014
 numb+0.014
 Arctic+0.014
Neg logit contributions
bek-0.016
 Exhibit-0.014
ved-0.014
mare-0.013
isdom-0.013
Token excited
Token ID6568
Feature activation-9.36e-01

Pos logit contributions
bek+0.008
ved+0.007
mare+0.007
 Exhibit+0.007
isdom+0.007
Neg logit contributions
mo-0.008
 camps-0.008
bo-0.008
 numb-0.008
 Arctic-0.008
Token.
Token ID13
Feature activation-3.85e-01

Pos logit contributions
bek+0.027
ved+0.023
 Exhibit+0.023
mare+0.023
rave+0.022
Neg logit contributions
 camps-0.013
mo-0.013
bo-0.012
ucky-0.012
foot-0.011
Token year
Token ID614
Feature activation-1.16e+00

Pos logit contributions
bek+0.029
 Exhibit+0.026
ved+0.026
isdom+0.026
 Susan+0.025
Neg logit contributions
mo-0.034
bo-0.032
 camps-0.032
izzy-0.030
 Arctic-0.029
Token old
Token ID1468
Feature activation-5.22e-02

Pos logit contributions
bek+0.029
ved+0.026
 Exhibit+0.026
mare+0.026
isdom+0.025
Neg logit contributions
mo-0.019
bo-0.018
 camps-0.018
izzy-0.017
 Arctic-0.016
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
bek+0.001
ved+0.001
 Exhibit+0.001
isdom+0.001
mare+0.001
Neg logit contributions
mo-0.001
 camps-0.001
bo-0.001
izzy-0.001
 Arctic-0.001
Token the
Token ID262
Feature activation+6.05e-01

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
isdom+0.006
 vows+0.006
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.005
 chilly-0.005
 polar-0.005
Token shoe
Token ID17292
Feature activation+1.85e-01

Pos logit contributions
mo+0.013
 camps+0.013
 numb+0.012
bo+0.012
bath+0.012
Neg logit contributions
bek-0.012
 refused-0.010
 goodbye-0.010
 won-0.010
 Exhibit-0.010
Token turned
Token ID2900
Feature activation+2.10e+00

Pos logit contributions
mo+0.004
 camps+0.003
bo+0.003
izzy+0.003
bath+0.003
Neg logit contributions
bek-0.004
isdom-0.004
mare-0.004
 Exhibit-0.004
ved-0.003
Token left
Token ID1364
Feature activation-4.61e-01

Pos logit contributions
mo+0.042
 camps+0.041
bo+0.040
 numb+0.038
 chilly+0.038
Neg logit contributions
bek-0.046
 Exhibit-0.040
isdom-0.040
ved-0.040
mare-0.040
Token and
Token ID290
Feature activation+6.79e-01

Pos logit contributions
bek+0.015
isdom+0.014
 Exhibit+0.014
mare+0.013
ved+0.013
Neg logit contributions
 camps-0.004
mo-0.004
 chilly-0.003
 numb-0.003
bo-0.003
Token continued
Token ID3767
Feature activation+1.78e+00

Pos logit contributions
mo+0.013
 camps+0.013
bo+0.012
 numb+0.012
 chilly+0.012
Neg logit contributions
bek-0.015
isdom-0.014
ved-0.014
 Exhibit-0.013
mare-0.013
Token on
Token ID319
Feature activation+1.33e+00

Pos logit contributions
mo+0.041
 camps+0.041
 numb+0.039
bo+0.039
bath+0.037
Neg logit contributions
bek-0.032
ved-0.027
isdom-0.027
 Exhibit-0.027
mare-0.027
Token their
Token ID511
Feature activation+1.27e+00

Pos logit contributions
mo+0.031
bo+0.029
 camps+0.029
izzy+0.028
 numb+0.028
Neg logit contributions
bek-0.023
 Exhibit-0.020
ved-0.020
 Susan-0.019
isdom-0.019
Token the
Token ID262
Feature activation-6.70e-01

Pos logit contributions
bek+0.001
 Susan+0.001
mare+0.001
 Steve+0.001
 Exhibit+0.001
Neg logit contributions
mo-0.002
bo-0.002
 camps-0.002
 numb-0.002
izzy-0.002
Token pool
Token ID5933
Feature activation-8.48e-01

Pos logit contributions
 our+0.012
 won+0.012
 bye+0.012
mare+0.012
 hello+0.011
Neg logit contributions
mo-0.015
izzy-0.012
 numb-0.012
inn-0.012
oll-0.011
Token and
Token ID290
Feature activation+2.09e-01

Pos logit contributions
bek+0.016
 Exhibit+0.014
ved+0.014
isdom+0.014
 vows+0.013
Neg logit contributions
mo-0.018
 camps-0.018
bo-0.018
izzy-0.017
 numb-0.017
Token got
Token ID1392
Feature activation+1.23e+00

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
izzy+0.004
 chilly+0.003
Neg logit contributions
bek-0.005
isdom-0.004
 Exhibit-0.004
ved-0.004
rave-0.004
Token all
Token ID477
Feature activation+9.10e-01

Pos logit contributions
mo+0.026
 camps+0.024
bo+0.023
izzy+0.021
 Arctic+0.021
Neg logit contributions
bek-0.024
 refused-0.022
ved-0.022
 goodbye-0.022
 won-0.022
Token wet
Token ID9583
Feature activation+2.62e+00

Pos logit contributions
mo+0.020
 camps+0.020
bo+0.018
 numb+0.018
Sn+0.017
Neg logit contributions
bek-0.016
mare-0.015
 Susan-0.015
 won-0.014
 refused-0.014
Token.
Token ID13
Feature activation+6.35e-01

Pos logit contributions
mo+0.053
 camps+0.048
izzy+0.047
bo+0.045
bath+0.044
Neg logit contributions
bek-0.048
adoes-0.047
 goodbye-0.046
ved-0.046
 won-0.046
Token When
Token ID1649
Feature activation+1.10e+00

Pos logit contributions
mo+0.013
 camps+0.012
bo+0.012
 numb+0.011
izzy+0.011
Neg logit contributions
bek-0.013
ved-0.012
 Exhibit-0.012
mare-0.012
 refused-0.012
Token she
Token ID673
Feature activation+2.01e+00

Pos logit contributions
mo+0.021
 camps+0.021
bo+0.020
 numb+0.019
izzy+0.019
Neg logit contributions
bek-0.024
ved-0.022
 Exhibit-0.021
isdom-0.021
mare-0.020
Token got
Token ID1392
Feature activation-2.27e-01

Pos logit contributions
mo+0.054
 camps+0.051
 numb+0.050
bo+0.049
Crunch+0.049
Neg logit contributions
bek-0.026
 refused-0.025
ved-0.024
 won-0.024
mare-0.024
Token out
Token ID503
Feature activation+1.27e-02

Pos logit contributions
 goodbye+0.002
bek+0.002
 won+0.002
 refused+0.002
 Susan+0.002
Neg logit contributions
mo-0.007
 camps-0.006
bo-0.006
Sn-0.006
Crunch-0.006
Token The
Token ID383
Feature activation+1.10e-01

Pos logit contributions
mo+0.005
 camps+0.005
bo+0.005
izzy+0.005
 Arctic+0.005
Neg logit contributions
bek-0.008
ved-0.007
 Exhibit-0.007
mare-0.007
 vows-0.007
Token mixer
Token ID33938
Feature activation-1.06e+00

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
 chilly+0.002
Neg logit contributions
bek-0.003
ved-0.002
 Exhibit-0.002
isdom-0.002
mare-0.002
Token went
Token ID1816
Feature activation+2.85e-01

Pos logit contributions
bek+0.026
mare+0.024
isdom+0.024
 Exhibit+0.024
 Susan+0.022
Neg logit contributions
bo-0.019
mo-0.018
 toe-0.017
 camps-0.016
 end-0.016
Token round
Token ID2835
Feature activation+1.28e+00

Pos logit contributions
 camps+0.005
mo+0.005
 numb+0.005
bo+0.005
 chilly+0.005
Neg logit contributions
bek-0.007
isdom-0.007
 Exhibit-0.006
mare-0.006
 vows-0.006
Token and
Token ID290
Feature activation+6.70e-01

Pos logit contributions
mo+0.030
 camps+0.030
bo+0.028
 numb+0.027
izzy+0.027
Neg logit contributions
bek-0.023
 Exhibit-0.020
mare-0.019
ved-0.019
isdom-0.019
Token round
Token ID2835
Feature activation+2.09e+00

Pos logit contributions
mo+0.009
bo+0.008
 camps+0.008
 chilly+0.008
 numb+0.008
Neg logit contributions
bek-0.019
isdom-0.019
ved-0.018
 Exhibit-0.017
mare-0.017
Token,
Token ID11
Feature activation-5.84e-01

Pos logit contributions
mo+0.043
 camps+0.041
bo+0.040
izzy+0.038
 numb+0.038
Neg logit contributions
bek-0.043
ved-0.038
 Exhibit-0.038
 refused-0.037
mare-0.036
Token and
Token ID290
Feature activation-4.47e-01

Pos logit contributions
bek+0.016
isdom+0.015
 Exhibit+0.015
mare+0.015
 Sale+0.014
Neg logit contributions
 camps-0.008
mo-0.008
 numb-0.008
 toe-0.008
 chilly-0.008
Token the
Token ID262
Feature activation-3.22e-01

Pos logit contributions
bek+0.011
isdom+0.010
mare+0.010
 Exhibit+0.010
ved+0.010
Neg logit contributions
mo-0.008
 camps-0.007
bo-0.007
 toe-0.007
 colder-0.007
Token batter
Token ID4984
Feature activation+1.84e-01

Pos logit contributions
bek+0.008
ved+0.007
isdom+0.007
 Exhibit+0.007
mare+0.007
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.005
 toe-0.005
 numb-0.005
Token became
Token ID2627
Feature activation+1.16e+00

Pos logit contributions
mo+0.004
 camps+0.003
bo+0.003
 numb+0.003
izzy+0.003
Neg logit contributions
bek-0.004
 Exhibit-0.004
isdom-0.004
mare-0.004
ved-0.003
Tokenpped
Token ID1496
Feature activation+1.02e+00

Pos logit contributions
 camps+0.017
mo+0.016
 Arctic+0.016
 tul+0.015
 particular+0.015
Neg logit contributions
ved-0.019
bek-0.017
 won-0.016
 refused-0.016
rave-0.016
Token on
Token ID319
Feature activation-9.87e-01

Pos logit contributions
mo+0.031
 camps+0.029
 Arctic+0.028
 tul+0.028
izzy+0.027
Neg logit contributions
 won-0.010
 its-0.009
bek-0.009
 refused-0.009
 goodbye-0.009
Token his
Token ID465
Feature activation+1.43e+00

Pos logit contributions
bek+0.015
ved+0.013
 goodbye+0.013
 Susan+0.013
 refused+0.013
Neg logit contributions
mo-0.025
 camps-0.024
izzy-0.023
bo-0.023
 Arctic-0.022
Token stick
Token ID4859
Feature activation+4.51e-01

Pos logit contributions
mo+0.029
 camps+0.028
bo+0.026
izzy+0.026
 Arctic+0.025
Neg logit contributions
bek-0.029
ved-0.027
 Exhibit-0.026
mare-0.026
 refused-0.026
Token and
Token ID290
Feature activation+1.26e-01

Pos logit contributions
mo+0.010
 camps+0.009
 Arctic+0.009
izzy+0.009
 polar+0.009
Neg logit contributions
bek-0.008
ved-0.008
 refused-0.008
 won-0.008
 goodbye-0.007
Token fell
Token ID3214
Feature activation+2.63e+00

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
izzy+0.002
 numb+0.002
Neg logit contributions
bek-0.002
ved-0.002
 Exhibit-0.002
 refused-0.002
mare-0.002
Token down
Token ID866
Feature activation+7.84e-01

Pos logit contributions
mo+0.070
 camps+0.066
 Arctic+0.065
 End+0.063
 tul+0.061
Neg logit contributions
 won-0.034
 goodbye-0.031
bek-0.031
ved-0.031
 its-0.031
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
mo+0.015
 camps+0.014
bo+0.013
 Arctic+0.013
izzy+0.012
Neg logit contributions
bek-0.017
ved-0.016
 won-0.016
 refused-0.016
 goodbye-0.016
Token
Token ID220
Feature activation-5.19e-01

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
 numb+0.002
izzy+0.002
Neg logit contributions
bek-0.004
ved-0.003
 Exhibit-0.003
mare-0.003
 vows-0.003
Token
Token ID198
Feature activation+8.65e-02

Pos logit contributions
bek+0.010
ved+0.009
 won+0.009
 Exhibit+0.009
 Susan+0.009
Neg logit contributions
mo-0.011
 camps-0.010
 numb-0.010
 tul-0.010
bo-0.009
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
mo+0.002
 camps+0.002
 numb+0.002
 tul+0.002
bo+0.002
Neg logit contributions
bek-0.002
 Susan-0.001
 Steve-0.001
 won-0.001
 Exhibit-0.001
Token on
Token ID319
Feature activation+2.38e-01

Pos logit contributions
mo+0.041
 camps+0.039
bo+0.038
izzy+0.037
 numb+0.037
Neg logit contributions
bek-0.033
ved-0.030
 Exhibit-0.029
 won-0.028
 Susan-0.028
Token the
Token ID262
Feature activation+7.05e-02

Pos logit contributions
 camps+0.005
mo+0.005
bo+0.004
 numb+0.004
 Arctic+0.004
Neg logit contributions
bek-0.005
mare-0.004
 Exhibit-0.004
ved-0.004
isdom-0.004
Token monkey
Token ID21657
Feature activation-3.09e-01

Pos logit contributions
mo+0.001
bo+0.001
 camps+0.001
 numb+0.001
izzy+0.001
Neg logit contributions
bek-0.002
 goodbye-0.001
 Exhibit-0.001
 won-0.001
 vows-0.001
Token bars
Token ID9210
Feature activation-3.52e-01

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
mare+0.006
isdom+0.006
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 numb-0.005
izzy-0.005
Token and
Token ID290
Feature activation+1.12e-01

Pos logit contributions
bek+0.010
 Exhibit+0.009
isdom+0.009
mare+0.009
 vows+0.009
Neg logit contributions
 camps-0.004
mo-0.004
izzy-0.004
bo-0.004
bath-0.004
Token spun
Token ID26843
Feature activation+2.14e+00

Pos logit contributions
mo+0.002
 camps+0.002
 toe+0.002
 bounce+0.002
 stream+0.002
Neg logit contributions
bek-0.003
isdom-0.003
rave-0.003
 Exhibit-0.003
anish-0.003
Token on
Token ID319
Feature activation+3.62e-01

Pos logit contributions
mo+0.041
 camps+0.039
bo+0.037
izzy+0.037
 numb+0.036
Neg logit contributions
bek-0.045
ved-0.041
 Exhibit-0.040
 goodbye-0.040
 won-0.040
Token the
Token ID262
Feature activation+2.85e-01

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.008
 numb+0.007
izzy+0.007
Neg logit contributions
bek-0.007
ved-0.006
 Exhibit-0.006
mare-0.006
isdom-0.006
Token sees
Token ID7224
Feature activation+2.81e-01

Pos logit contributions
mo+0.005
 camps+0.004
bo+0.004
 numb+0.004
izzy+0.004
Neg logit contributions
bek-0.007
 Exhibit-0.006
ved-0.006
 goodbye-0.006
 vows-0.006
Tokenaw
Token ID707
Feature activation-3.59e-03

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.006
 numb+0.006
izzy+0.006
Neg logit contributions
bek-0.005
 Exhibit-0.004
mare-0.004
ved-0.004
isdom-0.004
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
bek+0.000
isdom+0.000
mare+0.000
 Exhibit+0.000
 Susan+0.000
Neg logit contributions
mo-0.000
 camps-0.000
bo-0.000
 numb-0.000
izzy-0.000
Token so
Token ID523
Feature activation-1.00e+00

Pos logit contributions
 numb+0.001
mo+0.000
 polar+0.000
 chilly+0.000
bo+0.000
Neg logit contributions
bek-0.001
isdom-0.001
rav-0.001
anish-0.001
 Exhibit-0.001
Token happy
Token ID3772
Feature activation-8.36e-01

Pos logit contributions
bek+0.026
ved+0.023
mare+0.023
isdom+0.022
 Exhibit+0.022
Neg logit contributions
 numb-0.017
bo-0.016
mo-0.016
 polar-0.016
 chilly-0.016
Token and
Token ID290
Feature activation-8.19e-01

Pos logit contributions
bek+0.024
ved+0.021
mare+0.020
 vows+0.020
 Exhibit+0.020
Neg logit contributions
 camps-0.012
mo-0.011
foot-0.011
 stream-0.010
ucky-0.010
Token not
Token ID407
Feature activation+5.87e-02

Pos logit contributions
anish+0.023
bek+0.022
isdom+0.022
OO+0.022
ved+0.021
Neg logit contributions
 polar-0.012
 stream-0.012
 then-0.011
 chilly-0.011
 colder-0.010
Token embarrassed
Token ID21100
Feature activation+1.86e-01

Pos logit contributions
 numb+0.001
mo+0.001
 chilly+0.001
 polar+0.001
 colder+0.001
Neg logit contributions
bek-0.001
isdom-0.001
ved-0.001
 Sale-0.001
 Exhibit-0.001
Token to
Token ID284
Feature activation-4.62e-01

Pos logit contributions
mo+0.002
ucky+0.002
 camps+0.002
 numb+0.002
bo+0.002
Neg logit contributions
bek-0.006
mare-0.005
 Exhibit-0.005
isdom-0.005
ved-0.005
Token ask
Token ID1265
Feature activation-6.17e-01

Pos logit contributions
bek+0.011
 Exhibit+0.010
ved+0.010
mare+0.009
isdom+0.009
Neg logit contributions
mo-0.008
 camps-0.008
bo-0.008
 numb-0.007
izzy-0.007
Token for
Token ID329
Feature activation-1.03e+00

Pos logit contributions
bek+0.015
mare+0.013
 Exhibit+0.013
ved+0.013
 vows+0.013
Neg logit contributions
mo-0.011
 camps-0.010
 end-0.010
Where-0.009
ucky-0.009
Token help
Token ID1037
Feature activation-8.52e-01

Pos logit contributions
bek+0.034
ved+0.031
 Exhibit+0.031
mare+0.031
rave+0.030
Neg logit contributions
 numb-0.009
 camps-0.009
 polar-0.008
 end-0.007
mo-0.007
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
bek+0.023
 Exhibit+0.021
ved+0.021
mare+0.021
 vows+0.021
Neg logit contributions
mo-0.012
 camps-0.011
bo-0.011
 numb-0.010
ucky-0.010
Token Together
Token ID17083
Feature activation+3.68e-01

Pos logit contributions
bek+0.007
ved+0.006
 vows+0.006
isdom+0.006
mare+0.006
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
ucky-0.003
 Polar-0.003
Token them
Token ID606
Feature activation-2.98e-01

Pos logit contributions
bek+0.027
isdom+0.022
 Exhibit+0.022
mare+0.022
ved+0.022
Neg logit contributions
mo-0.016
 camps-0.014
bo-0.014
Where-0.013
Sn-0.013
Token all
Token ID477
Feature activation-6.94e-01

Pos logit contributions
bek+0.009
mare+0.008
isdom+0.008
 Exhibit+0.007
ved+0.007
Neg logit contributions
 camps-0.004
mo-0.004
bo-0.003
 numb-0.003
One-0.003
Token.
Token ID13
Feature activation+7.80e-02

Pos logit contributions
bek+0.022
 Exhibit+0.020
ved+0.020
mare+0.020
isdom+0.020
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.006
 end-0.006
 Arctic-0.006
Token Mom
Token ID11254
Feature activation+3.30e-01

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
 colder+0.002
Neg logit contributions
bek-0.001
 Susan-0.001
 Steve-0.001
 hello-0.001
 won-0.001
Token said
Token ID531
Feature activation-9.38e-01

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.007
izzy+0.007
 Arctic+0.007
Neg logit contributions
bek-0.006
ved-0.005
 Exhibit-0.005
isdom-0.005
mare-0.005
Token they
Token ID484
Feature activation-3.02e-01

Pos logit contributions
bek+0.023
rave+0.018
isdom+0.018
ved+0.018
 Exhibit+0.018
Neg logit contributions
mo-0.019
 tul-0.018
 camps-0.018
Sn-0.017
 polar-0.017
Token were
Token ID547
Feature activation+4.25e-01

Pos logit contributions
bek+0.006
ved+0.005
 Exhibit+0.005
mare+0.005
isdom+0.005
Neg logit contributions
mo-0.007
 camps-0.006
bo-0.006
izzy-0.006
 numb-0.006
Token here
Token ID994
Feature activation-1.38e+00

Pos logit contributions
mo+0.008
 camps+0.008
 numb+0.008
 polar+0.008
bo+0.007
Neg logit contributions
bek-0.010
ved-0.008
isdom-0.008
 Exhibit-0.008
mare-0.008
Token to
Token ID284
Feature activation-8.01e-01

Pos logit contributions
bek+0.029
ved+0.026
 Exhibit+0.025
isdom+0.025
mare+0.025
Neg logit contributions
 camps-0.028
mo-0.026
 chilly-0.026
One-0.026
 numb-0.025
Token adopt
Token ID11206
Feature activation-1.77e+00

Pos logit contributions
bek+0.021
 Exhibit+0.018
ved+0.018
 refused+0.018
mare+0.017
Neg logit contributions
mo-0.013
 camps-0.012
bo-0.012
 numb-0.011
 Arctic-0.011
Token one
Token ID530
Feature activation+5.54e-02

Pos logit contributions
bek+0.057
 refused+0.047
 Exhibit+0.047
ved+0.047
isdom+0.046
Neg logit contributions
bo-0.022
 camps-0.019
mo-0.019
Where-0.018
One-0.018
Token the
Token ID262
Feature activation+2.16e-02

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
 numb+0.002
 tul+0.002
Neg logit contributions
bek-0.004
ved-0.004
 Exhibit-0.004
mare-0.004
isdom-0.004
Token neighbor
Token ID4780
Feature activation+4.94e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 numb+0.000
 Arctic+0.000
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
isdom-0.000
mare-0.000
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.007
 numb+0.007
izzy+0.007
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
 refused-0.011
 won-0.011
Token She
Token ID1375
Feature activation-2.85e-01

Pos logit contributions
bo+0.004
Where+0.004
 numb+0.004
One+0.004
Sn+0.004
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
 vows-0.011
isdom-0.010
Token hugs
Token ID40657
Feature activation-1.90e-01

Pos logit contributions
bek+0.005
ved+0.005
mare+0.004
 Exhibit+0.004
isdom+0.004
Neg logit contributions
mo-0.007
 camps-0.006
bo-0.006
izzy-0.006
 numb-0.006
Token Tom
Token ID4186
Feature activation+5.05e-01

Pos logit contributions
bek+0.003
ved+0.003
 Exhibit+0.002
 Susan+0.002
mare+0.002
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.004
 numb-0.004
izzy-0.004
Token and
Token ID290
Feature activation-2.94e-01

Pos logit contributions
mo+0.011
bo+0.010
 camps+0.010
 numb+0.009
ucky+0.009
Neg logit contributions
bek-0.011
 Exhibit-0.010
mare-0.009
ved-0.009
isdom-0.009
Token takes
Token ID2753
Feature activation+6.30e-01

Pos logit contributions
bek+0.006
ved+0.005
 Exhibit+0.005
mare+0.005
isdom+0.004
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.006
 numb-0.006
izzy-0.006
Token him
Token ID683
Feature activation+5.93e-01

Pos logit contributions
mo+0.013
 camps+0.013
bo+0.012
 numb+0.012
izzy+0.012
Neg logit contributions
bek-0.012
ved-0.011
 Exhibit-0.011
mare-0.011
 refused-0.011
Token inside
Token ID2641
Feature activation-1.60e-01

Pos logit contributions
mo+0.013
 camps+0.013
bo+0.012
 numb+0.012
bath+0.012
Neg logit contributions
bek-0.012
 Exhibit-0.010
mare-0.010
isdom-0.010
 vows-0.010
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
bek+0.005
 Exhibit+0.005
ved+0.005
isdom+0.005
mare+0.005
Neg logit contributions
 camps-0.002
mo-0.001
 numb-0.001
 chilly-0.001
 toe-0.001
Token heard
Token ID2982
Feature activation+6.38e-02

Pos logit contributions
mo+0.020
 camps+0.019
bo+0.019
 numb+0.018
izzy+0.018
Neg logit contributions
bek-0.015
 Exhibit-0.012
ved-0.012
mare-0.012
isdom-0.012
Token a
Token ID257
Feature activation-6.58e-01

Pos logit contributions
 camps+0.001
bo+0.001
 polar+0.001
mo+0.001
 toe+0.001
Neg logit contributions
bek-0.002
ved-0.002
 Exhibit-0.002
mare-0.002
 vows-0.001
Token loud
Token ID7812
Feature activation-2.16e-01

Pos logit contributions
bek+0.011
 Exhibit+0.009
ved+0.009
mare+0.009
 refused+0.009
Neg logit contributions
mo-0.016
 camps-0.016
bo-0.015
izzy-0.015
 numb-0.015
Token noise
Token ID7838
Feature activation-1.43e-01

Pos logit contributions
bek+0.005
ved+0.004
 Exhibit+0.004
 vows+0.004
 refused+0.004
Neg logit contributions
mo-0.004
bo-0.004
 camps-0.004
 polar-0.004
 numb-0.004
Token.
Token ID13
Feature activation-6.48e-02

Pos logit contributions
 about+0.002
 Steve+0.002
 hello+0.002
 Susan+0.002
bek+0.002
Neg logit contributions
mo-0.004
 camps-0.004
Crunch-0.003
 Arctic-0.003
bo-0.003
Token It
Token ID632
Feature activation-6.00e-01

Pos logit contributions
bek+0.002
ved+0.002
 vows+0.002
isdom+0.002
 culture+0.002
Neg logit contributions
 Polar-0.001
 One-0.001
 Arctic-0.001
bo-0.001
 polar-0.001
Token was
Token ID373
Feature activation-1.63e-01

Pos logit contributions
bek+0.014
isdom+0.013
ved+0.012
 Exhibit+0.012
 goodbye+0.012
Neg logit contributions
bo-0.011
mo-0.011
 camps-0.010
 polar-0.010
Where-0.010
Token his
Token ID465
Feature activation-9.55e-01

Pos logit contributions
bek+0.003
ved+0.003
 Exhibit+0.003
mare+0.003
isdom+0.003
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
 numb-0.003
 Arctic-0.003
Token little
Token ID1310
Feature activation+3.36e-01

Pos logit contributions
bek+0.019
ved+0.016
 Exhibit+0.016
mare+0.016
isdom+0.016
Neg logit contributions
mo-0.021
 camps-0.020
bo-0.019
 numb-0.019
izzy-0.018
Token sister
Token ID6621
Feature activation-3.44e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
Sn+0.007
izzy+0.006
Neg logit contributions
bek-0.006
 refused-0.006
 Exhibit-0.005
 Steve-0.005
 goodbye-0.005
Token playing
Token ID2712
Feature activation-5.03e-01

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
mare+0.006
isdom+0.006
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.007
 numb-0.006
izzy-0.006
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
bek+0.010
ved+0.009
 Susan+0.009
 Exhibit+0.009
 won+0.008
Neg logit contributions
mo-0.009
 camps-0.009
 numb-0.008
bo-0.008
 Arctic-0.008
Token
Token ID198
Feature activation+9.78e-01

Pos logit contributions
mo+0.004
 camps+0.004
 numb+0.004
bo+0.004
 Arctic+0.004
Neg logit contributions
bek-0.004
 Susan-0.004
 won-0.004
 Exhibit-0.004
ved-0.004
TokenL
Token ID43
Feature activation+9.06e-01

Pos logit contributions
mo+0.021
 camps+0.020
 numb+0.018
 polar+0.018
 Arctic+0.018
Neg logit contributions
bek-0.021
 refused-0.018
ved-0.018
 Susan-0.018
 won-0.018
Tokenily
Token ID813
Feature activation-4.30e-02

Pos logit contributions
 numb+0.016
 camps+0.016
mo+0.016
bo+0.015
 Arctic+0.014
Neg logit contributions
bek-0.021
ved-0.020
 refused-0.019
mare-0.019
isdom-0.019
Token's
Token ID338
Feature activation+1.60e-01

Pos logit contributions
bek+0.001
 Exhibit+0.001
mare+0.001
ved+0.001
 vows+0.001
Neg logit contributions
bo-0.001
mo-0.001
 camps-0.001
Sn-0.001
One-0.001
Token mom
Token ID1995
Feature activation+4.11e-01

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
bath+0.004
izzy+0.004
Neg logit contributions
bek-0.002
 goodbye-0.002
 refused-0.002
 vows-0.002
ved-0.002
Tokenmy
Token ID1820
Feature activation+4.34e-01

Pos logit contributions
 camps+0.006
mo+0.006
 numb+0.005
bo+0.005
 Arctic+0.005
Neg logit contributions
bek-0.011
 refused-0.010
ved-0.010
 won-0.010
 Susan-0.010
Token explained
Token ID4893
Feature activation-1.03e+00

Pos logit contributions
mo+0.010
bo+0.010
 camps+0.009
izzy+0.009
 polar+0.009
Neg logit contributions
bek-0.008
 Exhibit-0.007
ved-0.007
 vows-0.007
mare-0.007
Token that
Token ID326
Feature activation+2.98e-01

Pos logit contributions
bek+0.034
 Exhibit+0.030
mare+0.029
ved+0.029
boxes+0.028
Neg logit contributions
ucky-0.010
 tul-0.010
 end-0.010
mo-0.009
 sometimes-0.009
Token the
Token ID262
Feature activation+6.64e-02

Pos logit contributions
 tul+0.005
 camps+0.005
mo+0.005
 chilly+0.004
 numb+0.004
Neg logit contributions
bek-0.008
ved-0.007
 Exhibit-0.007
mare-0.007
rave-0.007
Token shrimp
Token ID30060
Feature activation-1.10e+00

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
izzy+0.001
 numb+0.001
Neg logit contributions
bek-0.001
 Exhibit-0.001
 vows-0.001
 refused-0.001
ved-0.001
Token inside
Token ID2641
Feature activation-9.15e-01

Pos logit contributions
bek+0.011
ved+0.010
 Exhibit+0.010
isdom+0.010
mare+0.010
Neg logit contributions
mo-0.011
 camps-0.011
bo-0.010
 numb-0.010
izzy-0.010
Token -
Token ID532
Feature activation-7.03e-01

Pos logit contributions
bek+0.027
ved+0.025
 vows+0.025
 Exhibit+0.025
mare+0.024
Neg logit contributions
 camps-0.010
mo-0.010
 Arctic-0.009
bo-0.009
 ig-0.008
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
bek+0.018
isdom+0.016
mare+0.016
ved+0.016
 vows+0.016
Neg logit contributions
 camps-0.012
mo-0.010
Where-0.010
 polar-0.010
 chilly-0.010
Token big
Token ID1263
Feature activation-8.79e-01

Pos logit contributions
bek+0.030
ved+0.027
 Exhibit+0.027
isdom+0.026
mare+0.026
Neg logit contributions
mo-0.023
 camps-0.023
bo-0.021
 numb-0.021
izzy-0.020
Token pile
Token ID14540
Feature activation-1.70e-01

Pos logit contributions
bek+0.020
ved+0.019
 Exhibit+0.018
isdom+0.017
 vows+0.017
Neg logit contributions
 camps-0.016
mo-0.016
 polar-0.015
bo-0.014
 Arctic-0.014
Token of
Token ID286
Feature activation-2.08e+00

Pos logit contributions
bek+0.004
ved+0.003
 Exhibit+0.003
mare+0.003
 refused+0.003
Neg logit contributions
mo-0.003
 camps-0.003
bo-0.003
 numb-0.003
izzy-0.003
Token warm
Token ID5814
Feature activation-2.80e-01

Pos logit contributions
bek+0.044
ved+0.040
 Exhibit+0.039
mare+0.038
isdom+0.038
Neg logit contributions
mo-0.041
 camps-0.040
bo-0.038
 numb-0.036
izzy-0.036
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
bek+0.007
ved+0.006
 Exhibit+0.006
 refused+0.006
mare+0.006
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.004
 numb-0.004
 Arctic-0.004
Token shiny
Token ID22441
Feature activation+3.79e-01

Pos logit contributions
bek+0.029
ved+0.028
 Exhibit+0.027
isdom+0.027
mare+0.027
Neg logit contributions
 camps-0.015
mo-0.014
 chilly-0.013
 numb-0.013
 polar-0.013
Token gold
Token ID3869
Feature activation+3.45e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
 numb+0.006
izzy+0.006
Neg logit contributions
bek-0.008
ved-0.008
 Exhibit-0.007
mare-0.007
isdom-0.007
Token!
Token ID0
Feature activation-7.37e-01

Pos logit contributions
mo+0.007
 camps+0.007
bo+0.007
 numb+0.006
izzy+0.006
Neg logit contributions
bek-0.007
ved-0.006
 Exhibit-0.006
mare-0.006
isdom-0.006
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
bek+0.011
mare+0.010
 goodbye+0.010
 Susan+0.010
 won+0.010
Neg logit contributions
mo-0.012
 camps-0.011
bo-0.010
 numb-0.010
 Arctic-0.010
Token
Token ID198
Feature activation+6.45e-01

Pos logit contributions
mo+0.006
 camps+0.006
 numb+0.006
 tul+0.006
 colder+0.006
Neg logit contributions
bek-0.003
 its-0.003
 won-0.002
 Steve-0.002
 Susan-0.002
Token"
Token ID1
Feature activation+8.82e-01

Pos logit contributions
mo+0.018
 camps+0.017
 numb+0.017
 Arctic+0.016
 tul+0.016
Neg logit contributions
bek-0.010
ved-0.007
mare-0.007
 won-0.007
 Exhibit-0.007
TokenNo
Token ID2949
Feature activation-1.56e+00

Pos logit contributions
mo+0.024
 camps+0.023
 Arctic+0.021
 polar+0.021
 ig+0.021
Neg logit contributions
bek-0.015
ved-0.012
 refused-0.012
 hello-0.011
rave-0.011
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
bek+0.027
ved+0.024
 Exhibit+0.023
mare+0.023
isdom+0.022
Neg logit contributions
mo-0.037
 camps-0.037
bo-0.035
 numb-0.034
izzy-0.034
Token you
Token ID345
Feature activation-1.34e+00

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
 toe+0.002
Neg logit contributions
bek-0.002
 Exhibit-0.002
ved-0.002
rave-0.002
mare-0.002
Token can
Token ID460
Feature activation-7.34e-02

Pos logit contributions
bek+0.033
mare+0.029
ved+0.029
 vows+0.029
 Exhibit+0.028
Neg logit contributions
mo-0.022
 camps-0.022
bo-0.021
izzy-0.020
ucky-0.019
Token't
Token ID470
Feature activation+9.27e-01

Pos logit contributions
 won+0.001
bek+0.001
 refused+0.001
ved+0.001
 vows+0.001
Neg logit contributions
mo-0.002
 camps-0.002
 Arctic-0.002
 colder-0.002
 polar-0.002
Token!"
Token ID2474
Feature activation+2.45e-01

Pos logit contributions
mo+0.023
 camps+0.022
 Arctic+0.021
 colder+0.020
 numb+0.020
Neg logit contributions
bek-0.015
 won-0.015
ved-0.014
isdom-0.014
 Steve-0.014
Token Lily
Token ID20037
Feature activation+5.96e-01

Pos logit contributions
mo+0.004
 camps+0.004
bo+0.004
izzy+0.004
 numb+0.004
Neg logit contributions
bek-0.006
 Exhibit-0.005
ved-0.005
isdom-0.005
mare-0.005
Token said
Token ID531
Feature activation-2.26e-01

Pos logit contributions
bo+0.010
Sn+0.009
mo+0.009
izzy+0.008
Crunch+0.008
Neg logit contributions
bek-0.016
 Exhibit-0.014
isdom-0.014
mare-0.014
 culture-0.014
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
mo+0.015
 camps+0.014
 numb+0.014
 colder+0.014
Crunch+0.014
Neg logit contributions
bek-0.016
mare-0.014
 Exhibit-0.014
ess-0.014
ved-0.014
Token Grace
Token ID16156
Feature activation-6.51e-01

Pos logit contributions
mare+0.003
 goodbye+0.003
 Susan+0.003
 hello+0.003
 Katy+0.003
Neg logit contributions
mo-0.006
bo-0.006
 camps-0.005
 numb-0.005
ipples-0.005
Token was
Token ID373
Feature activation+3.42e-01

Pos logit contributions
bek+0.010
ved+0.009
 Exhibit+0.009
mare+0.008
isdom+0.008
Neg logit contributions
mo-0.016
 camps-0.016
bo-0.016
izzy-0.015
 Arctic-0.015
Token always
Token ID1464
Feature activation-5.37e-01

Pos logit contributions
mo+0.009
 camps+0.008
bo+0.008
bath+0.008
 numb+0.008
Neg logit contributions
bek-0.005
mare-0.005
 Susan-0.005
 refused-0.004
 Exhibit-0.004
Token very
Token ID845
Feature activation-4.37e-01

Pos logit contributions
bek+0.014
 vows+0.013
ved+0.013
isdom+0.013
 Exhibit+0.012
Neg logit contributions
 chilly-0.009
 numb-0.008
mo-0.008
 toe-0.008
bo-0.008
Token careful
Token ID8161
Feature activation-1.25e+00

Pos logit contributions
bek+0.010
ved+0.009
 Exhibit+0.009
isdom+0.008
mare+0.008
Neg logit contributions
mo-0.008
 camps-0.008
bo-0.008
 numb-0.008
izzy-0.007
Token when
Token ID618
Feature activation+7.08e-01

Pos logit contributions
bek+0.029
ved+0.025
 Exhibit+0.024
 vows+0.024
mare+0.023
Neg logit contributions
mo-0.023
bo-0.023
 numb-0.022
 chilly-0.022
 camps-0.022
Token playing
Token ID2712
Feature activation+8.82e-02

Pos logit contributions
mo+0.013
 camps+0.013
 numb+0.012
bo+0.012
 Arctic+0.012
Neg logit contributions
bek-0.016
ved-0.014
 Exhibit-0.014
isdom-0.013
mare-0.013
Token in
Token ID287
Feature activation-4.35e-01

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.001
 numb+0.001
izzy+0.001
Neg logit contributions
bek-0.002
ved-0.002
 Exhibit-0.002
mare-0.002
isdom-0.002
Token the
Token ID262
Feature activation-4.31e-01

Pos logit contributions
 its+0.003
 your+0.003
 our+0.003
 Susan+0.003
 Sarah+0.002
Neg logit contributions
mo-0.016
bo-0.015
oll-0.014
 camps-0.014
inn-0.014
Token garden
Token ID11376
Feature activation+4.08e-01

Pos logit contributions
 your+0.006
mare+0.006
 won+0.006
 bye+0.006
 hello+0.006
Neg logit contributions
mo-0.010
izzy-0.009
inn-0.009
 numb-0.009
bo-0.008
Token from
Token ID422
Feature activation-1.82e-01

Pos logit contributions
bek+0.005
 Exhibit+0.004
ved+0.004
isdom+0.004
mare+0.004
Neg logit contributions
mo-0.004
 camps-0.004
bo-0.004
bath-0.004
izzy-0.004
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
bek+0.004
 Exhibit+0.003
mare+0.003
ved+0.003
isdom+0.003
Neg logit contributions
mo-0.004
 camps-0.004
bo-0.003
 numb-0.003
 toe-0.003
Token snack
Token ID26906
Feature activation-6.97e-01

Pos logit contributions
bek+0.008
ved+0.007
isdom+0.007
mare+0.007
 Susan+0.007
Neg logit contributions
 camps-0.005
 end-0.004
 chilly-0.004
 End-0.004
 bottom-0.004
Token stand
Token ID1302
Feature activation+3.69e-01

Pos logit contributions
bek+0.017
 Exhibit+0.015
isdom+0.015
ved+0.015
rave+0.015
Neg logit contributions
 camps-0.012
ucky-0.011
izzy-0.011
mo-0.011
bo-0.011
Token to
Token ID284
Feature activation-1.68e-01

Pos logit contributions
mo+0.006
bo+0.006
 camps+0.006
bath+0.006
 numb+0.005
Neg logit contributions
bek-0.009
 Exhibit-0.008
mare-0.008
isdom-0.008
 Sale-0.008
Token celebrate
Token ID10648
Feature activation-1.06e+00

Pos logit contributions
bek+0.005
 Exhibit+0.004
ved+0.004
rave+0.004
mare+0.004
Neg logit contributions
 toe-0.002
 end-0.002
 dip-0.002
 bounce-0.002
 chew-0.002
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
bek+0.032
 Exhibit+0.031
ved+0.030
mare+0.029
 refused+0.029
Neg logit contributions
 end-0.011
 particular-0.010
Where-0.009
 camps-0.009
 toe-0.009
Token ran
Token ID4966
Feature activation+1.06e+00

Pos logit contributions
bek+0.003
 Exhibit+0.002
rave+0.002
isdom+0.002
ved+0.002
Neg logit contributions
 toe-0.002
 bounce-0.002
mo-0.002
bo-0.002
 dip-0.002
Token around
Token ID1088
Feature activation-6.75e-01

Pos logit contributions
 camps+0.019
bo+0.019
mo+0.018
foot+0.017
inn+0.017
Neg logit contributions
bek-0.025
isdom-0.023
 Exhibit-0.022
mare-0.021
 refused-0.021
Token the
Token ID262
Feature activation-3.66e-01

Pos logit contributions
bek+0.017
isdom+0.015
 Exhibit+0.015
mare+0.015
ved+0.014
Neg logit contributions
 camps-0.011
mo-0.011
foot-0.010
 toe-0.010
Where-0.010
Token stadium
Token ID10308
Feature activation-1.65e+00

Pos logit contributions
bek+0.009
ved+0.008
isdom+0.008
 Susan+0.008
 refused+0.008
Neg logit contributions
 camps-0.006
 end-0.005
 Arctic-0.005
One-0.005
mo-0.005
Token dog
Token ID3290
Feature activation-1.43e+00

Pos logit contributions
mo+0.006
 camps+0.006
bo+0.006
 numb+0.006
izzy+0.006
Neg logit contributions
bek-0.007
ved-0.007
 Exhibit-0.007
mare-0.006
isdom-0.006
Token was
Token ID373
Feature activation+4.05e-01

Pos logit contributions
bek+0.036
isdom+0.031
mare+0.031
boxes+0.029
 Susan+0.029
Neg logit contributions
mo-0.028
bo-0.026
 camps-0.025
 bounce-0.023
chy-0.022
Token the
Token ID262
Feature activation-3.54e-01

Pos logit contributions
mo+0.007
bo+0.007
 camps+0.007
 chilly+0.006
 numb+0.006
Neg logit contributions
bek-0.010
 Exhibit-0.009
isdom-0.009
ved-0.009
mare-0.009
Token second
Token ID1218
Feature activation+1.17e-01

Pos logit contributions
bek+0.008
ved+0.007
 Exhibit+0.007
 refused+0.007
mare+0.007
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
izzy-0.006
 numb-0.006
Token fastest
Token ID14162
Feature activation-6.87e-01

Pos logit contributions
mo+0.002
 camps+0.002
bo+0.002
 numb+0.002
izzy+0.002
Neg logit contributions
bek-0.003
ved-0.002
 Exhibit-0.002
mare-0.002
 refused-0.002
Token,
Token ID11
Feature activation-1.14e+00

Pos logit contributions
bek+0.016
 Exhibit+0.014
ved+0.014
mare+0.014
isdom+0.014
Neg logit contributions
mo-0.013
 camps-0.012
bo-0.012
izzy-0.011
 numb-0.011
Token but
Token ID475
Feature activation+6.89e-01

Pos logit contributions
bek+0.028
 Exhibit+0.026
ved+0.025
 vows+0.025
mare+0.025
Neg logit contributions
mo-0.019
 camps-0.018
 chilly-0.018
 bounce-0.018
bo-0.018
Token he
Token ID339
Feature activation+2.49e-01

Pos logit contributions
mo+0.010
 chilly+0.009
 bounce+0.009
 toe+0.008
bo+0.008
Neg logit contributions
 Exhibit-0.018
bek-0.018
isdom-0.018
 vows-0.017
mare-0.017
Token didn
Token ID1422
Feature activation-2.98e-01

Pos logit contributions
mo+0.005
bo+0.005
 camps+0.004
 toe+0.004
 bounce+0.004
Neg logit contributions
bek-0.006
 Exhibit-0.005
mare-0.005
isdom-0.005
adoes-0.005
Token't
Token ID470
Feature activation-1.32e-01

Pos logit contributions
bek+0.004
 won+0.004
 Steve+0.004
ved+0.004
 Susan+0.004
Neg logit contributions
mo-0.008
 camps-0.008
 Arctic-0.007
 polar-0.007
 numb-0.007
Token complete
Token ID1844
Feature activation-1.73e-01

Pos logit contributions
bek+0.003
ved+0.003
 Exhibit+0.003
 refused+0.003
mare+0.002
Neg logit contributions
mo-0.003
 camps-0.002
bo-0.002
 numb-0.002
izzy-0.002
Token arrival
Token ID10325
Feature activation+6.70e-01

Pos logit contributions
mo+0.027
 camps+0.026
bo+0.025
 numb+0.024
 Arctic+0.024
Neg logit contributions
bek-0.020
ved-0.017
 Exhibit-0.017
mare-0.016
isdom-0.016
Token.
Token ID13
Feature activation-4.29e-01

Pos logit contributions
 camps+0.006
mo+0.006
bo+0.006
ucky+0.005
 numb+0.005
Neg logit contributions
bek-0.021
ved-0.019
rave-0.019
 Exhibit-0.019
mare-0.019
Token The
Token ID383
Feature activation+1.33e-01

Pos logit contributions
bek+0.010
ved+0.009
 Exhibit+0.008
 refused+0.008
mare+0.008
Neg logit contributions
mo-0.008
 camps-0.008
bo-0.007
izzy-0.007
 Arctic-0.007
Token rh
Token ID9529
Feature activation-2.73e+00

Pos logit contributions
mo+0.002
izzy+0.002
 numb+0.002
foot+0.002
bath+0.002
Neg logit contributions
bek-0.003
 Exhibit-0.003
 letter-0.003
isdom-0.003
 won-0.003
Tokenin
Token ID259
Feature activation-2.29e+00

Pos logit contributions
bek+0.093
mare+0.088
 Susan+0.087
 vows+0.087
 Exhibit+0.087
Neg logit contributions
mo-0.019
 camps-0.016
bo-0.016
izzy-0.012
ucky-0.011
Tokenoc
Token ID420
Feature activation-5.59e+00

Pos logit contributions
bek+0.058
ved+0.057
 Katy+0.057
 vows+0.056
 Susan+0.055
Neg logit contributions
mo-0.037
bo-0.036
 camps-0.033
 polar-0.032
 Arctic-0.031
Tokeneros
Token ID27498
Feature activation-1.13e+00

Pos logit contributions
bek+0.131
mare+0.123
 refused+0.120
ved+0.118
 Susan+0.118
Neg logit contributions
mo-0.099
bo-0.094
 camps-0.090
 polar-0.083
 Arctic-0.083
Token was
Token ID373
Feature activation+6.85e-01

Pos logit contributions
bek+0.028
 Exhibit+0.025
 hi+0.025
 hello+0.024
 ours+0.024
Neg logit contributions
mo-0.021
bo-0.021
 camps-0.019
 end-0.018
 Arctic-0.017
Token no
Token ID645
Feature activation+7.61e-01

Pos logit contributions
mo+0.013
bo+0.013
 camps+0.013
 numb+0.012
 chilly+0.012
Neg logit contributions
bek-0.015
 Exhibit-0.014
ved-0.013
isdom-0.013
 vows-0.013
Token longer
Token ID2392
Feature activation-1.45e+00

Pos logit contributions
mo+0.011
izzy+0.011
bo+0.010
 camps+0.010
bath+0.010
Neg logit contributions
 refused-0.019
bek-0.018
 goodbye-0.017
 won-0.017
ved-0.017
Token hungry
Token ID14720
Feature activation+1.50e-01

Pos logit contributions
bek+0.044
 Exhibit+0.041
 Sale+0.039
 vows+0.039
ved+0.038
Neg logit contributions
mo-0.019
 chilly-0.019
 numb-0.018
bo-0.017
 colder-0.017
Token
Token ID198
Feature activation+5.34e-03

Pos logit contributions
bek+0.008
 Exhibit+0.007
ved+0.007
 Susan+0.007
 Steve+0.007
Neg logit contributions
mo-0.008
 camps-0.007
 numb-0.007
bo-0.007
 Arctic-0.007
TokenThe
Token ID464
Feature activation-2.76e-01

Pos logit contributions
mo+0.000
 camps+0.000
bo+0.000
 numb+0.000
izzy+0.000
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
mare-0.000
isdom-0.000
Token bad
Token ID2089
Feature activation-1.70e+00

Pos logit contributions
bek+0.005
 Exhibit+0.005
 letter+0.004
 hello+0.004
 Sale+0.004
Neg logit contributions
mo-0.006
izzy-0.006
 camps-0.006
bath-0.006
foot-0.006
Token rh
Token ID9529
Feature activation-2.91e+00

Pos logit contributions
bek+0.040
ved+0.036
 Exhibit+0.036
mare+0.035
isdom+0.035
Neg logit contributions
mo-0.030
 camps-0.029
bo-0.027
 numb-0.026
izzy-0.026
Tokenin
Token ID259
Feature activation-2.68e+00

Pos logit contributions
bek+0.104
 goodbye+0.101
 Susan+0.101
 vows+0.101
mare+0.100
Neg logit contributions
mo-0.014
bo-0.012
 camps-0.009
izzy-0.007
ucky-0.006
Tokenoc
Token ID420
Feature activation-5.67e+00

Pos logit contributions
 Katy+0.074
 vows+0.072
 Susan+0.071
bek+0.071
 hi+0.070
Neg logit contributions
bo-0.039
mo-0.037
 camps-0.030
 polar-0.029
 end-0.029
Tokeneros
Token ID27498
Feature activation-9.51e-01

Pos logit contributions
bek+0.132
mare+0.125
 refused+0.124
 Susan+0.123
ved+0.120
Neg logit contributions
mo-0.100
bo-0.096
 camps-0.090
 polar-0.082
 Arctic-0.081
Token was
Token ID373
Feature activation+5.87e-01

Pos logit contributions
bek+0.024
 Exhibit+0.022
 ours+0.021
mare+0.021
 Sale+0.020
Neg logit contributions
bo-0.019
mo-0.017
 end-0.015
 toe-0.014
 ac-0.013
Token so
Token ID523
Feature activation-6.67e-01

Pos logit contributions
mo+0.011
bo+0.011
 numb+0.010
 camps+0.010
 chilly+0.010
Neg logit contributions
bek-0.014
 Exhibit-0.013
mare-0.012
 vows-0.012
isdom-0.012
Token happy
Token ID3772
Feature activation-4.67e-01

Pos logit contributions
bek+0.017
mare+0.015
 Exhibit+0.015
ved+0.015
 vows+0.015
Neg logit contributions
bo-0.011
mo-0.011
 chilly-0.010
 numb-0.010
 polar-0.010
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
bek+0.013
 vows+0.012
 Exhibit+0.012
mare+0.011
ved+0.011
Neg logit contributions
 camps-0.006
mo-0.006
foot-0.006
ucky-0.006
bo-0.006
Token elephant
Token ID20950
Feature activation-6.78e-01

Pos logit contributions
bek+0.014
 Susan+0.013
isdom+0.013
 Steve+0.013
 Exhibit+0.013
Neg logit contributions
 numb-0.020
izzy-0.019
mo-0.019
 camps-0.019
bath-0.019
Token saw
Token ID2497
Feature activation-2.30e-01

Pos logit contributions
bek+0.015
 vows+0.012
 Exhibit+0.012
ved+0.012
rave+0.012
Neg logit contributions
mo-0.014
 camps-0.013
bo-0.013
 Arctic-0.012
izzy-0.012
Token the
Token ID262
Feature activation-3.46e-01

Pos logit contributions
bek+0.005
ved+0.004
 Exhibit+0.004
mare+0.004
 vows+0.004
Neg logit contributions
mo-0.005
 camps-0.005
bo-0.004
 Arctic-0.004
izzy-0.004
Token rh
Token ID9529
Feature activation-1.88e+00

Pos logit contributions
bek+0.008
ved+0.007
 Exhibit+0.007
mare+0.007
isdom+0.007
Neg logit contributions
mo-0.006
 camps-0.006
bo-0.006
 numb-0.006
izzy-0.006
Tokenin
Token ID259
Feature activation-1.62e+00

Pos logit contributions
bek+0.063
mare+0.059
 goodbye+0.059
ved+0.059
 vows+0.059
Neg logit contributions
mo-0.014
 camps-0.012
bo-0.012
izzy-0.010
ucky-0.009
Tokenoc
Token ID420
Feature activation-4.27e+00

Pos logit contributions
bek+0.036
ved+0.033
 refused+0.032
isdom+0.031
mare+0.031
Neg logit contributions
mo-0.031
 camps-0.031
bo-0.030
 Arctic-0.028
 polar-0.028
Tokeneros
Token ID27498
Feature activation-4.80e-01

Pos logit contributions
bek+0.101
mare+0.091
 refused+0.090
ved+0.090
 Susan+0.087
Neg logit contributions
mo-0.078
 camps-0.075
bo-0.074
 Arctic-0.068
 polar-0.067
Token z
Token ID1976
Feature activation+1.97e-01

Pos logit contributions
bek+0.010
ved+0.009
mare+0.009
 Exhibit+0.009
isdom+0.008
Neg logit contributions
mo-0.010
 camps-0.010
bo-0.009
 Arctic-0.009
 numb-0.009
Tokenooming
Token ID30602
Feature activation+5.71e-01

Pos logit contributions
mo+0.003
 camps+0.003
bo+0.003
izzy+0.002
 numb+0.002
Neg logit contributions
bek-0.005
 Exhibit-0.005
 Susan-0.005
mare-0.005
 refused-0.005
Token and
Token ID290
Feature activation-2.23e-01

Pos logit contributions
bo+0.007
 camps+0.007
mo+0.007
 chilly+0.006
 numb+0.006
Neg logit contributions
bek-0.017
isdom-0.016
mare-0.016
 vows-0.015
 Exhibit-0.015
Token said
Token ID531
Feature activation-1.96e-01

Pos logit contributions
bek+0.006
isdom+0.005
 vows+0.005
mare+0.005
ved+0.005
Neg logit contributions
mo-0.004
 stream-0.004
 camps-0.003
bo-0.003
 polar-0.003
Token give
Token ID1577
Feature activation-7.73e-01

Pos logit contributions
bek+0.015
 Exhibit+0.013
ved+0.013
mare+0.013
isdom+0.013
Neg logit contributions
mo-0.010
 camps-0.010
bo-0.009
 numb-0.009
izzy-0.009
Token to
Token ID284
Feature activation-6.17e-01

Pos logit contributions
bek+0.016
 Exhibit+0.014
mare+0.014
ved+0.014
isdom+0.013
Neg logit contributions
mo-0.016
bo-0.016
 camps-0.015
 numb-0.014
bath-0.014
Token the
Token ID262
Feature activation+5.27e-02

Pos logit contributions
bek+0.014
 Exhibit+0.012
ved+0.012
mare+0.012
rave+0.012
Neg logit contributions
mo-0.012
 camps-0.012
bo-0.011
 toe-0.011
 numb-0.011
Token rh
Token ID9529
Feature activation-2.21e+00

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
 numb+0.001
izzy+0.001
Neg logit contributions
bek-0.001
ved-0.001
 Exhibit-0.001
mare-0.001
isdom-0.001
Tokenin
Token ID259
Feature activation-1.19e+00

Pos logit contributions
bek+0.075
 Susan+0.073
mare+0.072
ved+0.071
 refused+0.071
Neg logit contributions
mo-0.015
bo-0.013
 camps-0.012
izzy-0.008
ucky-0.008
Tokenoc
Token ID420
Feature activation-5.02e+00

Pos logit contributions
bek+0.025
ved+0.022
 Exhibit+0.021
 refused+0.021
mare+0.021
Neg logit contributions
mo-0.024
 camps-0.024
bo-0.023
 Arctic-0.022
bath-0.021
Tokeneros
Token ID27498
Feature activation-3.28e-01

Pos logit contributions
bek+0.113
mare+0.109
 Susan+0.105
 refused+0.104
ved+0.103
Neg logit contributions
mo-0.095
bo-0.088
 camps-0.087
 Arctic-0.078
 ig-0.077
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
bek+0.008
 Exhibit+0.007
mare+0.007
ved+0.007
rave+0.007
Neg logit contributions
mo-0.006
 camps-0.005
bo-0.005
bath-0.005
 Arctic-0.005
Token Lily
Token ID20037
Feature activation-1.62e-01

Pos logit contributions
bek+0.009
ved+0.008
 Exhibit+0.008
 refused+0.008
mare+0.008
Neg logit contributions
mo-0.007
 camps-0.007
bo-0.007
izzy-0.006
 Arctic-0.006
Token walked
Token ID6807
Feature activation-7.37e-02

Pos logit contributions
bek+0.004
 Exhibit+0.003
ved+0.003
mare+0.003
 Susan+0.003
Neg logit contributions
mo-0.003
bo-0.003
 camps-0.003
izzy-0.003
 Arctic-0.003
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
bek+0.002
isdom+0.002
 Exhibit+0.002
 Sorry+0.002
 vows+0.002
Neg logit contributions
 camps-0.001
mo-0.001
One-0.001
bo-0.001
inn-0.001
Token and
Token ID290
Feature activation-3.29e-01

Pos logit contributions
 camps+0.002
bo+0.002
mo+0.001
 numb+0.001
 Arctic+0.001
Neg logit contributions
bek-0.002
ved-0.002
isdom-0.002
 Exhibit-0.002
rave-0.002
Token took
Token ID1718
Feature activation+2.63e-01

Pos logit contributions
bek+0.009
isdom+0.009
 Exhibit+0.009
anish+0.009
rave+0.009
Neg logit contributions
 then-0.004
 colder-0.004
 toe-0.004
 chilly-0.004
 least-0.004
Token the
Token ID262
Feature activation+9.14e-01

Pos logit contributions
mo+0.005
bo+0.005
 camps+0.005
 numb+0.005
 chilly+0.005
Neg logit contributions
bek-0.006
 Exhibit-0.005
isdom-0.005
ved-0.005
mare-0.005
Token rh
Token ID9529
Feature activation-1.93e+00

Pos logit contributions
mo+0.017
 camps+0.016
bo+0.016
izzy+0.015
 numb+0.015
Neg logit contributions
bek-0.020
 Exhibit-0.018
ved-0.018
mare-0.018
 refused-0.018
Tokenin
Token ID259
Feature activation-8.72e-01

Pos logit contributions
bek+0.066
 Susan+0.063
mare+0.062
ved+0.062
 refused+0.062
Neg logit contributions
mo-0.012
bo-0.011
 camps-0.011
izzy-0.008
 numb-0.007
Tokenoc
Token ID420
Feature activation-4.01e+00

Pos logit contributions
bek+0.017
ved+0.015
 refused+0.015
isdom+0.015
 Exhibit+0.015
Neg logit contributions
mo-0.019
 camps-0.018
bo-0.018
 Arctic-0.017
 numb-0.017
Tokeneros
Token ID27498
Feature activation+2.51e-02

Pos logit contributions
bek+0.081
ved+0.071
mare+0.071
 refused+0.071
 Susan+0.070
Neg logit contributions
mo-0.083
 camps-0.080
bo-0.079
 numb-0.073
 Arctic-0.073
Token home
Token ID1363
Feature activation-2.45e-01

Pos logit contributions
mo+0.001
 camps+0.001
bo+0.001
 numb+0.001
izzy+0.001
Neg logit contributions
bek-0.000
ved-0.000
 Exhibit-0.000
mare-0.000
 refused-0.000
Token with
Token ID351
Feature activation+1.28e+00

Pos logit contributions
bek+0.008
 Exhibit+0.008
isdom+0.008
 Steve+0.008
mare+0.008
Neg logit contributions
 numb-0.002
 camps-0.001
bo-0.001
 chilly-0.001
bath-0.001
Token her
Token ID607
Feature activation+4.66e-01

Pos logit contributions
mo+0.028
 camps+0.027
bo+0.026
 numb+0.025
izzy+0.025
Neg logit contributions
bek-0.025
ved-0.021
 Exhibit-0.021
mare-0.021
isdom-0.021
Token.
Token ID13
Feature activation+8.01e-02

Pos logit contributions
mo+0.008
 camps+0.008
bo+0.008
 numb+0.007
izzy+0.007
Neg logit contributions
bek-0.011
ved-0.010
 Exhibit-0.010
mare-0.010
isdom-0.009