TOP ACTIVATIONS
MAX = 4.745

coal melted and made the room warm and cozy .
the box and they pulled out a new toy car model
soon he made the book case taller ! His family cheered
table and the daddy got out a score . Everyone was
all colorful and made the house look nice . Billy was
. The decorations made the room look so fun and colourful
worked hard to make the house look pretty , but they
, Lily ! Your room looks great ! You 're so
saw that his room was indeed tidy . He smiled and
parked the car and got out . They took their bags
, and it made the room warm and cozy . Grand
colorful books that made the room look beautiful .
it ! You made your house successful ." Joe
our stuff and make our room nice ," Tim said .
candles because they made his room smell nice . One day
s artwork made the house grow brighter .
they wanted to make the house even more beautiful . So
the office , they filled out some forms . Then they
light . It made the room shine with stars and moons
been clever and made the room tidy . Her mom my

MIN ACTIVATIONS
MIN = -6.891

animals , like lions , ze br as , and gir
, gir aff es and ze br as . They took
, gir aff es , ze br as , and more
of animals like lions , ze br as , and gir
animals , like lions , ze br as , and gir
animals , like lions , ze br as , and gir
lions , monkeys , and ze br as . Sara liked
, tigers , monkeys and ze br as . He was
lions , monkeys , and ze br as . But their
animals , like lions , ze br as , gir aff
animals , like monkeys , ze br as and gir aff
animals , like lions , ze br as , and gir
lions , monkeys , and ze br as . But their
of animals like lions , ze br as , and gir
lions , monkeys , and ze br as . But the
animals , like lions , ze br as , and gir
butterflies and lady bugs and cater pill ars ."
lions , monkeys , and ze br as .
aff es , and fast ze br as . Tim was
like monkeys and elephants and gir aff es . They had

INTERVAL 1.836 - 4.745
CONTAINS 2.618%

" I liked it very much , mom ,"
and had a great time at the picnic . In the
and tell it how pretty it was . And even though
Lily and her family went on a trip to a far
took time and patience to grow . She learned that sometimes

INTERVAL -1.073 - 1.836
CONTAINS 85.788%

was warm and the rain was soft . The seed became
find a treasure . As she walked , she saw a
He shouted out loud , " Tom , let 's ignore
went back and sat in her chair , happy that the
smiled at Lucy , asking her to go first . She

INTERVAL -3.982 - -1.073
CONTAINS 11.587%

No , we can â
! <|endoftext|> Once upon a time , there was a little
had so much energy to spare . She loved to run
" It 's not safe to go near the z ebra
listen to her mom and trust her decisions . <|endoftext|> Once

INTERVAL -6.891 - -3.982
CONTAINS 0.007%

teeth . " Ro ar !" the le opard
took his hat from the ostr ich 's be ak and
. One day , while Mitt ens was eating her pasta
garden . She saw a cater pillar crawling on a leaf
that ducks were better than ge ese . They looked for
Token coal
Token ID5655
Feature activation+1.45e+00

Pos logit contributions
 entire+0.023
 un+0.022
 pack+0.021
 open+0.020
 opened+0.019
Neg logit contributions
orting-0.068
 conclusions-0.067
 mosquitoes-0.066
aro-0.065
ican-0.064
Token melted
Token ID24178
Feature activation+1.33e+00

Pos logit contributions
 entire+0.023
 un+0.022
 pack+0.021
pack+0.019
 open+0.018
Neg logit contributions
 mosquitoes-0.049
 hawk-0.049
orting-0.048
aro-0.047
 conclusions-0.047
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
 entire+0.024
 un+0.023
 pack+0.022
 opened+0.021
 open+0.020
Neg logit contributions
orting-0.045
 conclusions-0.044
 mosquitoes-0.044
 course-0.042
 hawk-0.042
Token made
Token ID925
Feature activation+2.05e+00

Pos logit contributions
 entire+0.026
 pack+0.023
 un+0.022
pack+0.022
 whole+0.021
Neg logit contributions
 mosquitoes-0.046
 conclusions-0.044
 hawk-0.043
aro-0.043
orting-0.043
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
 entire+0.033
 pack+0.033
 un+0.031
pack+0.029
 opened+0.028
Neg logit contributions
 mosquitoes-0.072
 conclusions-0.070
orting-0.068
 course-0.067
 hawk-0.065
Token room
Token ID2119
Feature activation+4.74e+00

Pos logit contributions
 entire+0.037
 un+0.036
 pack+0.035
 open+0.033
 opened+0.033
Neg logit contributions
orting-0.073
 conclusions-0.073
 mosquitoes-0.071
aro-0.070
ican-0.068
Token warm
Token ID5814
Feature activation+2.85e+00

Pos logit contributions
ta+0.090
 entire+0.089
 Riding+0.087
pack+0.083
ate+0.082
Neg logit contributions
bs-0.129
 conclusions-0.125
 anymore-0.120
orting-0.120
 injuries-0.119
Token and
Token ID290
Feature activation+2.15e+00

Pos logit contributions
 un+0.072
 pack+0.070
pack+0.070
 unlocked+0.069
 entire+0.068
Neg logit contributions
orting-0.074
 mosquitoes-0.070
 conclusions-0.069
 course-0.068
 anymore-0.067
Token cozy
Token ID37438
Feature activation+1.72e+00

Pos logit contributions
 entire+0.042
pack+0.039
 pack+0.037
ate+0.036
 un+0.035
Neg logit contributions
 mosquitoes-0.065
 conclusions-0.063
 caution-0.060
 hawk-0.060
 injuries-0.060
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
 un+0.039
 pack+0.038
 entire+0.038
 unlocked+0.037
 opened+0.036
Neg logit contributions
 conclusions-0.047
orting-0.046
 mosquitoes-0.045
 hawk-0.043
 course-0.042
Token
Token ID220
Feature activation+5.60e-01

Pos logit contributions
orting+0.005
 course+0.005
 conclusions+0.005
aro+0.005
crow+0.004
Neg logit contributions
 entire-0.004
 un-0.004
 pack-0.004
 open-0.004
 whole-0.004
Token the
Token ID262
Feature activation+1.79e+00

Pos logit contributions
 entire+0.039
 un+0.034
 pack+0.033
 whole+0.030
 opened+0.029
Neg logit contributions
orting-0.089
 conclusions-0.089
 course-0.087
 mosquitoes-0.087
aro-0.086
Token box
Token ID3091
Feature activation+1.86e+00

Pos logit contributions
 entire+0.041
 un+0.040
 pack+0.040
 opened+0.037
 was+0.037
Neg logit contributions
orting-0.047
 conclusions-0.046
 course-0.046
 hawk-0.043
 mosquitoes-0.043
Token and
Token ID290
Feature activation+1.81e+00

Pos logit contributions
 entire+0.041
 un+0.038
 pack+0.038
Y+0.036
 whole+0.035
Neg logit contributions
orting-0.052
 course-0.047
bs-0.043
 anymore-0.043
ion-0.043
Token they
Token ID484
Feature activation+1.84e+00

Pos logit contributions
 entire+0.037
 whole+0.029
 pack+0.024
 un+0.022
aming+0.022
Neg logit contributions
orting-0.056
bs-0.054
 course-0.054
aro-0.054
 conclusions-0.052
Token pulled
Token ID5954
Feature activation+1.26e+00

Pos logit contributions
 entire+0.038
 whole+0.028
 pack+0.025
 smells+0.025
 contents+0.024
Neg logit contributions
bs-0.057
 course-0.052
orting-0.052
aro-0.052
 caution-0.048
Token out
Token ID503
Feature activation+4.74e+00

Pos logit contributions
 entire+0.025
 pack+0.022
 un+0.021
 opened+0.020
 whole+0.020
Neg logit contributions
orting-0.039
 conclusions-0.038
 mosquitoes-0.037
 course-0.037
 caution-0.036
Token a
Token ID257
Feature activation+2.12e-01

Pos logit contributions
 entire+0.099
 pack+0.082
ta+0.080
boxing+0.079
Y+0.078
Neg logit contributions
orting-0.132
 conclusions-0.128
 course-0.127
ion-0.125
bs-0.125
Token new
Token ID649
Feature activation+8.92e-01

Pos logit contributions
 entire+0.004
 un+0.004
 pack+0.003
 opened+0.003
ate+0.003
Neg logit contributions
 conclusions-0.006
orting-0.006
 hawk-0.006
 course-0.006
 mosquitoes-0.006
Token toy
Token ID13373
Feature activation+8.47e-02

Pos logit contributions
 entire+0.018
 un+0.017
 opened+0.016
 pack+0.015
 unlocked+0.015
Neg logit contributions
 conclusions-0.026
orting-0.026
 course-0.026
 hawk-0.025
 mosquitoes-0.025
Token car
Token ID1097
Feature activation+1.87e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 was+0.002
Neg logit contributions
orting-0.003
 conclusions-0.003
 mosquitoes-0.003
aro-0.003
ican-0.003
Token model
Token ID2746
Feature activation+2.84e-01

Pos logit contributions
 entire+0.004
 un+0.004
 pack+0.004
 whole+0.003
 opened+0.003
Neg logit contributions
orting-0.006
 conclusions-0.005
aro-0.005
 course-0.005
 mosquitoes-0.005
Token soon
Token ID2582
Feature activation+5.38e-01

Pos logit contributions
orting+0.007
 conclusions+0.007
 mosquitoes+0.007
 course+0.007
aro+0.007
Neg logit contributions
 entire-0.004
 un-0.004
 pack-0.004
 open-0.004
 opened-0.004
Token he
Token ID339
Feature activation+1.49e+00

Pos logit contributions
 entire+0.013
 pack+0.012
 un+0.012
pack+0.012
load+0.011
Neg logit contributions
orting-0.015
 course-0.014
 mosquitoes-0.014
 conclusions-0.014
aro-0.014
Token made
Token ID925
Feature activation+1.75e+00

Pos logit contributions
 pack+0.020
 entire+0.019
 un+0.017
pack+0.015
load+0.015
Neg logit contributions
 course-0.054
ede-0.051
orting-0.050
 mosquitoes-0.050
bs-0.050
Token the
Token ID262
Feature activation+2.27e+00

Pos logit contributions
 entire+0.032
 pack+0.031
 un+0.029
 opened+0.028
 packed+0.028
Neg logit contributions
 mosquitoes-0.055
 conclusions-0.054
 course-0.052
 caution-0.052
orting-0.051
Token book
Token ID1492
Feature activation+2.54e+00

Pos logit contributions
 un+0.046
 entire+0.044
 pack+0.044
 opened+0.042
 open+0.041
Neg logit contributions
 conclusions-0.069
orting-0.068
 mosquitoes-0.067
 course-0.066
 hawk-0.064
Tokencase
Token ID7442
Feature activation+4.66e+00

Pos logit contributions
 entire+0.046
 pack+0.042
 packed+0.041
ta+0.041
load+0.041
Neg logit contributions
orting-0.072
 conclusions-0.069
 anymore-0.069
bs-0.067
 course-0.066
Token taller
Token ID25242
Feature activation+5.98e-01

Pos logit contributions
ta+0.088
 entire+0.087
load+0.087
 Riding+0.082
 pack+0.081
Neg logit contributions
 anymore-0.124
 conclusions-0.115
orting-0.111
 course-0.110
 injuries-0.106
Token!
Token ID0
Feature activation+8.97e-01

Pos logit contributions
 entire+0.012
 pack+0.012
 un+0.011
 opened+0.011
 open+0.010
Neg logit contributions
orting-0.019
 conclusions-0.018
 mosquitoes-0.018
 course-0.018
aro-0.017
Token His
Token ID2399
Feature activation+1.64e+00

Pos logit contributions
 entire+0.017
 un+0.017
 pack+0.017
 open+0.016
 opened+0.015
Neg logit contributions
orting-0.029
 conclusions-0.028
 course-0.027
aro-0.026
 mosquitoes-0.026
Token family
Token ID1641
Feature activation+1.39e+00

Pos logit contributions
 un+0.035
 entire+0.035
 pack+0.034
load+0.032
 opened+0.032
Neg logit contributions
 conclusions-0.045
 mosquitoes-0.045
 course-0.044
orting-0.043
 hawk-0.042
Token cheered
Token ID34767
Feature activation+1.83e+00

Pos logit contributions
 entire+0.021
 pack+0.020
 un+0.018
 whole+0.016
 open+0.016
Neg logit contributions
 course-0.051
orting-0.046
aro-0.045
 mosquitoes-0.044
bs-0.044
Token table
Token ID3084
Feature activation+1.03e+00

Pos logit contributions
 entire+0.011
 un+0.011
 pack+0.010
 opened+0.010
 open+0.010
Neg logit contributions
 conclusions-0.025
orting-0.024
 mosquitoes-0.024
aro-0.023
 course-0.023
Token and
Token ID290
Feature activation+3.95e-01

Pos logit contributions
 entire+0.019
 un+0.018
 pack+0.017
 whole+0.016
 open+0.015
Neg logit contributions
orting-0.033
 course-0.031
 mosquitoes-0.031
bs-0.030
aro-0.030
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
 entire+0.007
 whole+0.006
 pack+0.006
 un+0.006
load+0.005
Neg logit contributions
 mosquitoes-0.013
bs-0.012
aro-0.012
orting-0.012
 conclusions-0.012
Token daddy
Token ID35822
Feature activation+1.73e+00

Pos logit contributions
 entire+0.033
 un+0.033
 pack+0.031
 opened+0.030
 unlocked+0.030
Neg logit contributions
 mosquitoes-0.055
 conclusions-0.055
orting-0.053
 course-0.051
 hawk-0.051
Token got
Token ID1392
Feature activation+2.11e+00

Pos logit contributions
 entire+0.036
 whole+0.028
 pack+0.027
ate+0.026
 un+0.026
Neg logit contributions
bs-0.051
orting-0.047
 course-0.046
ip-0.046
aro-0.045
Token out
Token ID503
Feature activation+4.58e+00

Pos logit contributions
 entire+0.044
 Riding+0.037
 pack+0.036
 unlocked+0.035
 un+0.034
Neg logit contributions
 mosquitoes-0.061
bs-0.059
orting-0.057
aro-0.056
 caution-0.055
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
 entire+0.093
 pack+0.087
 un+0.085
pack+0.081
load+0.080
Neg logit contributions
 mosquitoes-0.128
orting-0.127
 course-0.126
ion-0.125
aro-0.122
Token score
Token ID4776
Feature activation+4.59e-01

Pos logit contributions
 entire+0.019
 un+0.018
 pack+0.017
 open+0.016
 opened+0.016
Neg logit contributions
 conclusions-0.039
orting-0.039
 mosquitoes-0.038
 hawk-0.037
 course-0.037
Token.
Token ID13
Feature activation+8.14e-01

Pos logit contributions
 entire+0.008
 un+0.008
 pack+0.008
 open+0.007
 opened+0.007
Neg logit contributions
orting-0.015
 conclusions-0.015
 mosquitoes-0.015
aro-0.014
 course-0.014
Token Everyone
Token ID11075
Feature activation+1.68e+00

Pos logit contributions
 entire+0.018
 un+0.015
 pack+0.015
 whole+0.015
 unlocked+0.015
Neg logit contributions
orting-0.024
 conclusions-0.022
 course-0.022
crow-0.021
bs-0.021
Token was
Token ID373
Feature activation+2.50e-01

Pos logit contributions
 entire+0.029
 pack+0.026
 contents+0.024
 whole+0.023
 un+0.022
Neg logit contributions
aro-0.055
 course-0.055
bs-0.054
 mosquitoes-0.052
ion-0.050
Token all
Token ID477
Feature activation+2.19e+00

Pos logit contributions
 entire+0.035
pack+0.034
 pack+0.033
load+0.032
 un+0.031
Neg logit contributions
 mosquitoes-0.060
 caution-0.058
 course-0.057
orting-0.056
 hawk-0.055
Token colorful
Token ID20239
Feature activation+9.35e-01

Pos logit contributions
 pack+0.039
 entire+0.038
load+0.036
 un+0.035
pack+0.033
Neg logit contributions
 course-0.064
orting-0.062
 mosquitoes-0.060
 fault-0.058
 caution-0.058
Token and
Token ID290
Feature activation+1.89e+00

Pos logit contributions
 entire+0.016
 pack+0.015
 un+0.015
 opened+0.014
 open+0.014
Neg logit contributions
orting-0.032
 mosquitoes-0.031
 conclusions-0.031
 course-0.031
aro-0.030
Token made
Token ID925
Feature activation+1.47e+00

Pos logit contributions
 entire+0.035
 pack+0.032
pack+0.031
load+0.030
 un+0.029
Neg logit contributions
 course-0.057
 mosquitoes-0.056
orting-0.055
 caution-0.054
 conclusions-0.054
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
 entire+0.030
 pack+0.029
 un+0.028
 opened+0.027
 open+0.026
Neg logit contributions
orting-0.044
 course-0.043
 conclusions-0.043
 mosquitoes-0.043
 caution-0.040
Token house
Token ID2156
Feature activation+4.56e+00

Pos logit contributions
 entire+0.035
 un+0.032
 pack+0.031
 open+0.031
 was+0.030
Neg logit contributions
orting-0.059
 conclusions-0.058
 mosquitoes-0.057
aro-0.057
ican-0.055
Token look
Token ID804
Feature activation+3.48e+00

Pos logit contributions
 Riding+0.081
 entire+0.079
ta+0.078
load+0.076
ipt+0.074
Neg logit contributions
 conclusions-0.123
 anymore-0.122
orting-0.119
bs-0.118
 course-0.115
Token nice
Token ID3621
Feature activation+2.25e+00

Pos logit contributions
pack+0.072
 Riding+0.067
ipt+0.067
load+0.066
 pack+0.065
Neg logit contributions
 anymore-0.093
bs-0.088
orting-0.086
 unnecessary-0.086
 conclusions-0.083
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
 pack+0.055
load+0.054
 un+0.053
 entire+0.053
pack+0.052
Neg logit contributions
orting-0.056
 anymore-0.053
 course-0.051
 conclusions-0.051
 mosquitoes-0.051
Token Billy
Token ID15890
Feature activation+1.62e+00

Pos logit contributions
 entire+0.023
 pack+0.021
 un+0.021
 open+0.020
 opened+0.020
Neg logit contributions
orting-0.032
 conclusions-0.031
 course-0.030
 mosquitoes-0.029
bs-0.029
Token was
Token ID373
Feature activation-2.14e-02

Pos logit contributions
 entire+0.026
 pack+0.024
 un+0.022
 whole+0.020
pack+0.020
Neg logit contributions
 course-0.052
orting-0.051
bs-0.050
 mosquitoes-0.050
aro-0.049
Token.
Token ID13
Feature activation-2.35e-01

Pos logit contributions
 entire+0.022
 pack+0.020
 un+0.020
 whole+0.019
 open+0.018
Neg logit contributions
orting-0.037
 conclusions-0.036
aro-0.036
 mosquitoes-0.035
 course-0.035
Token The
Token ID383
Feature activation+2.05e+00

Pos logit contributions
orting+0.008
 conclusions+0.008
 mosquitoes+0.008
aro+0.008
ican+0.008
Neg logit contributions
 entire-0.004
 un-0.004
 pack-0.004
 open-0.004
 opened-0.003
Token decorations
Token ID39011
Feature activation+1.97e+00

Pos logit contributions
 entire+0.038
 un+0.038
 pack+0.037
 open+0.035
 opened+0.035
Neg logit contributions
 conclusions-0.065
 mosquitoes-0.065
orting-0.064
aro-0.062
 course-0.061
Token made
Token ID925
Feature activation+1.31e+00

Pos logit contributions
 entire+0.038
 un+0.035
 pack+0.034
 open+0.031
 whole+0.030
Neg logit contributions
orting-0.061
 course-0.061
aro-0.060
 mosquitoes-0.059
bs-0.059
Token the
Token ID262
Feature activation+2.36e+00

Pos logit contributions
 entire+0.023
 pack+0.021
 un+0.021
 opened+0.018
 unlocked+0.018
Neg logit contributions
 mosquitoes-0.044
orting-0.044
 conclusions-0.043
 course-0.042
aro-0.041
Token room
Token ID2119
Feature activation+4.53e+00

Pos logit contributions
 un+0.049
 entire+0.048
 pack+0.047
 opened+0.045
 unlocked+0.044
Neg logit contributions
 conclusions-0.071
 mosquitoes-0.070
orting-0.070
 course-0.066
ican-0.066
Token look
Token ID804
Feature activation+3.58e+00

Pos logit contributions
 entire+0.082
load+0.076
ta+0.075
 Riding+0.074
pack+0.070
Neg logit contributions
bs-0.125
orting-0.119
 conclusions-0.119
 anymore-0.116
 course-0.115
Token so
Token ID523
Feature activation+1.76e+00

Pos logit contributions
pack+0.076
 entire+0.069
 Riding+0.069
load+0.068
 pack+0.066
Neg logit contributions
 anymore-0.092
bs-0.092
orting-0.091
 conclusions-0.086
 mosquitoes-0.086
Token fun
Token ID1257
Feature activation+2.00e+00

Pos logit contributions
pack+0.035
 entire+0.033
 pack+0.033
 Riding+0.032
load+0.032
Neg logit contributions
bs-0.052
obb-0.047
orting-0.045
 mosquitoes-0.044
 caution-0.044
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
 entire+0.044
 pack+0.043
 un+0.043
load+0.042
 opened+0.039
Neg logit contributions
orting-0.055
 course-0.053
 mosquitoes-0.052
 conclusions-0.051
aro-0.049
Token colourful
Token ID44799
Feature activation+2.50e+00

Pos logit contributions
 entire+0.033
pack+0.030
 pack+0.029
load+0.028
 whole+0.026
Neg logit contributions
 mosquitoes-0.046
bs-0.045
 caution-0.044
orting-0.044
 conclusions-0.044
Token worked
Token ID3111
Feature activation+8.77e-01

Pos logit contributions
 entire+0.026
 pack+0.021
 whole+0.020
 un+0.020
pack+0.018
Neg logit contributions
bs-0.043
 course-0.042
aro-0.042
orting-0.041
 conclusions-0.040
Token hard
Token ID1327
Feature activation-1.10e-01

Pos logit contributions
 entire+0.020
 pack+0.019
 un+0.018
 opened+0.017
 open+0.017
Neg logit contributions
orting-0.025
 course-0.023
 mosquitoes-0.022
bs-0.022
aro-0.022
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 conclusions+0.004
orting+0.004
 mosquitoes+0.004
aro+0.004
ican+0.004
Neg logit contributions
 entire-0.002
 un-0.002
 pack-0.002
 open-0.002
 was-0.002
Token make
Token ID787
Feature activation+1.49e+00

Pos logit contributions
 entire+0.022
 was+0.019
 opened+0.018
 whole+0.018
 pack+0.018
Neg logit contributions
 conclusions-0.043
orting-0.039
 course-0.039
aro-0.037
 mosquitoes-0.037
Token the
Token ID262
Feature activation+2.09e+00

Pos logit contributions
 entire+0.029
 pack+0.026
 opened+0.025
 un+0.025
 unlocked+0.024
Neg logit contributions
 conclusions-0.047
 mosquitoes-0.046
orting-0.045
 course-0.044
 caution-0.042
Token house
Token ID2156
Feature activation+4.52e+00

Pos logit contributions
 entire+0.043
 un+0.043
 pack+0.041
 open+0.040
 opened+0.039
Neg logit contributions
orting-0.064
 conclusions-0.064
 mosquitoes-0.063
aro-0.060
 course-0.060
Token look
Token ID804
Feature activation+3.84e+00

Pos logit contributions
 entire+0.081
ta+0.079
 Riding+0.078
athan+0.074
rev+0.072
Neg logit contributions
 conclusions-0.122
orting-0.121
 anymore-0.120
bs-0.117
 course-0.113
Token pretty
Token ID2495
Feature activation+3.76e+00

Pos logit contributions
pack+0.079
 Riding+0.074
ipt+0.073
 entire+0.073
boxing+0.072
Neg logit contributions
 anymore-0.104
orting-0.101
bs-0.100
 unnecessary-0.095
 conclusions-0.095
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
pack+0.095
load+0.092
 pack+0.090
 unlocked+0.090
boxing+0.087
Neg logit contributions
 anymore-0.086
orting-0.079
 conclusions-0.079
 mosquitoes-0.076
 nightmares-0.073
Token but
Token ID475
Feature activation-4.70e-02

Pos logit contributions
 entire+0.027
 pack+0.025
 un+0.025
 whole+0.023
pack+0.023
Neg logit contributions
orting-0.044
 conclusions-0.044
 mosquitoes-0.044
 course-0.042
aro-0.041
Token they
Token ID484
Feature activation+8.67e-01

Pos logit contributions
orting+0.001
 conclusions+0.001
 mosquitoes+0.001
 course+0.001
aro+0.001
Neg logit contributions
 entire-0.001
 un-0.001
 pack-0.001
 open-0.001
 whole-0.001
Token,
Token ID11
Feature activation-4.97e-01

Pos logit contributions
 entire+0.007
 un+0.007
 pack+0.007
 opened+0.006
 open+0.006
Neg logit contributions
orting-0.006
 mosquitoes-0.006
 conclusions-0.006
 course-0.006
aro-0.006
Token Lily
Token ID20037
Feature activation+7.24e-01

Pos logit contributions
orting+0.016
 conclusions+0.016
 mosquitoes+0.015
aro+0.015
ican+0.015
Neg logit contributions
 entire-0.010
 un-0.010
 pack-0.010
 open-0.009
 was-0.009
Token!
Token ID0
Feature activation+1.67e+00

Pos logit contributions
 entire+0.016
 un+0.016
 pack+0.015
 opened+0.014
 open+0.014
Neg logit contributions
orting-0.021
 mosquitoes-0.020
 conclusions-0.020
 course-0.020
 hawk-0.020
Token Your
Token ID3406
Feature activation+1.86e+00

Pos logit contributions
 entire+0.032
 un+0.031
 pack+0.030
 open+0.029
 opened+0.028
Neg logit contributions
orting-0.054
 conclusions-0.054
 mosquitoes-0.053
aro-0.051
ican-0.050
Token room
Token ID2119
Feature activation+4.01e+00

Pos logit contributions
 entire+0.028
 un+0.027
load+0.026
 unlocked+0.026
 opened+0.026
Neg logit contributions
 mosquitoes-0.062
 conclusions-0.060
orting-0.059
 nightmares-0.058
 fault-0.056
Token looks
Token ID3073
Feature activation+4.52e+00

Pos logit contributions
load+0.081
 pack+0.076
rily+0.074
 entire+0.073
 himself+0.073
Neg logit contributions
 course-0.109
orting-0.100
 mosquitoes-0.097
 wrong-0.097
chairs-0.097
Token great
Token ID1049
Feature activation+2.50e+00

Pos logit contributions
load+0.091
ipt+0.087
pack+0.086
 entire+0.085
 pack+0.084
Neg logit contributions
obb-0.118
orting-0.115
 anymore-0.114
 wrong-0.114
 mosquitoes-0.110
Token!
Token ID0
Feature activation+2.16e+00

Pos logit contributions
 un+0.057
 pack+0.057
 entire+0.056
load+0.054
 opened+0.051
Neg logit contributions
orting-0.066
 mosquitoes-0.062
 course-0.061
 anymore-0.059
bs-0.058
Token You
Token ID921
Feature activation+1.27e+00

Pos logit contributions
 entire+0.047
 un+0.044
 pack+0.044
 whole+0.042
 opened+0.041
Neg logit contributions
orting-0.065
 conclusions-0.062
 mosquitoes-0.061
aro-0.059
 course-0.058
Token're
Token ID821
Feature activation+1.32e+00

Pos logit contributions
 entire+0.029
 pack+0.027
load+0.026
rily+0.026
 un+0.026
Neg logit contributions
orting-0.032
 course-0.031
bs-0.031
 mosquitoes-0.029
 conclusions-0.029
Token so
Token ID523
Feature activation+1.39e+00

Pos logit contributions
load+0.030
rily+0.028
 pack+0.027
acked+0.025
athan+0.025
Neg logit contributions
 mosquitoes-0.034
 earthquake-0.033
 nightmares-0.031
 wrong-0.031
 course-0.030
Token saw
Token ID2497
Feature activation+7.98e-01

Pos logit contributions
 entire+0.003
 pack+0.003
 un+0.003
 whole+0.003
 open+0.003
Neg logit contributions
orting-0.006
 conclusions-0.005
bs-0.005
 mosquitoes-0.005
 course-0.005
Token that
Token ID326
Feature activation+1.28e+00

Pos logit contributions
 un+0.011
 was+0.011
 entire+0.011
 open+0.010
 pack+0.010
Neg logit contributions
orting-0.031
 conclusions-0.030
 mosquitoes-0.029
ican-0.029
aro-0.029
Token his
Token ID465
Feature activation+1.06e+00

Pos logit contributions
 entire+0.018
 un+0.016
 pack+0.016
 opened+0.015
 open+0.015
Neg logit contributions
 conclusions-0.048
orting-0.048
 mosquitoes-0.048
aro-0.046
 course-0.046
Token room
Token ID2119
Feature activation+3.63e+00

Pos logit contributions
 entire+0.017
 un+0.017
 pack+0.016
 open+0.015
 was+0.015
Neg logit contributions
orting-0.038
 conclusions-0.038
 mosquitoes-0.037
aro-0.037
ican-0.036
Token was
Token ID373
Feature activation+3.58e+00

Pos logit contributions
 entire+0.047
load+0.046
 pack+0.043
efully+0.042
boxing+0.037
Neg logit contributions
 course-0.121
 anymore-0.118
orting-0.118
chairs-0.114
 mosquitoes-0.110
Token indeed
Token ID5600
Feature activation+4.52e+00

Pos logit contributions
 entire+0.059
load+0.058
 pack+0.055
 smells+0.053
 Riding+0.052
Neg logit contributions
 injuries-0.104
 mosquitoes-0.101
orting-0.099
obb-0.099
 course-0.098
Token tidy
Token ID43044
Feature activation+3.14e+00

Pos logit contributions
 Riding+0.084
load+0.084
boxing+0.081
efully+0.079
 entire+0.077
Neg logit contributions
 injuries-0.115
 anymore-0.115
obb-0.113
 wrong-0.113
 earthquake-0.108
Token.
Token ID13
Feature activation+3.10e-01

Pos logit contributions
 entire+0.075
load+0.073
 un+0.072
 pack+0.071
boxing+0.070
Neg logit contributions
 anymore-0.075
orting-0.075
 course-0.067
 fault-0.065
 conclusions-0.065
Token He
Token ID679
Feature activation+1.87e+00

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.004
 open+0.004
 opened+0.004
Neg logit contributions
orting-0.011
 conclusions-0.011
 mosquitoes-0.011
aro-0.011
 course-0.011
Token smiled
Token ID13541
Feature activation+2.90e+00

Pos logit contributions
 entire+0.042
 pack+0.038
 whole+0.037
pack+0.034
load+0.034
Neg logit contributions
bs-0.048
 course-0.048
orting-0.044
 mosquitoes-0.044
 caution-0.044
Token and
Token ID290
Feature activation-2.62e-01

Pos logit contributions
load+0.069
 entire+0.069
pack+0.066
 pack+0.063
boxing+0.062
Neg logit contributions
orting-0.071
 anymore-0.063
der-0.061
bs-0.060
 nightmares-0.059
Token parked
Token ID19584
Feature activation+1.57e+00

Pos logit contributions
 entire+0.019
 whole+0.015
 pack+0.015
 un+0.014
pack+0.013
Neg logit contributions
bs-0.027
 mosquitoes-0.027
 conclusions-0.026
 course-0.026
aro-0.026
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
 entire+0.028
 un+0.025
 pack+0.024
 whole+0.021
 opened+0.020
Neg logit contributions
 mosquitoes-0.052
orting-0.052
 conclusions-0.051
aro-0.051
 course-0.051
Token car
Token ID1097
Feature activation+7.32e-01

Pos logit contributions
 entire+0.042
 un+0.041
 pack+0.040
 was+0.039
 opened+0.038
Neg logit contributions
 conclusions-0.049
orting-0.049
aro-0.049
 mosquitoes-0.049
crow-0.046
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
 entire+0.013
 un+0.011
 pack+0.011
 whole+0.010
 open+0.009
Neg logit contributions
orting-0.025
 course-0.023
aro-0.023
 conclusions-0.023
 mosquitoes-0.023
Token got
Token ID1392
Feature activation+2.33e+00

Pos logit contributions
 entire+0.029
 whole+0.023
 pack+0.020
ate+0.019
pack+0.019
Neg logit contributions
bs-0.041
 mosquitoes-0.040
aro-0.039
 conclusions-0.039
 course-0.037
Token out
Token ID503
Feature activation+4.52e+00

Pos logit contributions
 entire+0.048
 pack+0.041
 whole+0.037
 Riding+0.037
 un+0.037
Neg logit contributions
 mosquitoes-0.072
aro-0.071
bs-0.068
 conclusions-0.065
orting-0.064
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
 entire+0.102
 pack+0.094
 un+0.090
pack+0.089
load+0.086
Neg logit contributions
aro-0.123
 mosquitoes-0.121
orting-0.117
 conclusions-0.117
ion-0.116
Token They
Token ID1119
Feature activation+8.58e-01

Pos logit contributions
 entire+0.009
 un+0.009
 pack+0.008
 open+0.008
 was+0.007
Neg logit contributions
orting-0.031
 conclusions-0.031
 mosquitoes-0.030
aro-0.030
ican-0.029
Token took
Token ID1718
Feature activation+8.60e-01

Pos logit contributions
 entire+0.017
 whole+0.013
 pack+0.012
ate+0.011
 un+0.011
Neg logit contributions
bs-0.028
aro-0.026
 mosquitoes-0.026
 course-0.026
 conclusions-0.026
Token their
Token ID511
Feature activation+7.80e-01

Pos logit contributions
 entire+0.018
 pack+0.016
 un+0.015
pack+0.015
 Riding+0.014
Neg logit contributions
bs-0.025
 mosquitoes-0.025
 conclusions-0.025
aro-0.024
crow-0.024
Token bags
Token ID11668
Feature activation+1.82e+00

Pos logit contributions
 entire+0.018
 un+0.017
 pack+0.017
 was+0.016
 open+0.016
Neg logit contributions
 conclusions-0.021
 mosquitoes-0.021
orting-0.021
aro-0.021
 hawk-0.020
Token,
Token ID11
Feature activation+4.80e-01

Pos logit contributions
 entire+0.008
 un+0.008
 pack+0.008
 open+0.007
 was+0.007
Neg logit contributions
orting-0.023
 conclusions-0.023
 mosquitoes-0.022
aro-0.022
ican-0.021
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
 entire+0.009
 un+0.008
 pack+0.008
 whole+0.008
 opened+0.008
Neg logit contributions
 mosquitoes-0.016
orting-0.015
 conclusions-0.015
 course-0.015
aro-0.014
Token it
Token ID340
Feature activation+1.75e+00

Pos logit contributions
 entire+0.024
 pack+0.021
 un+0.021
pack+0.020
 open+0.019
Neg logit contributions
 mosquitoes-0.048
 conclusions-0.047
orting-0.046
aro-0.046
 course-0.045
Token made
Token ID925
Feature activation+1.69e+00

Pos logit contributions
 entire+0.027
 pack+0.025
pack+0.023
 un+0.023
 whole+0.022
Neg logit contributions
 mosquitoes-0.061
aro-0.057
 hawk-0.056
bs-0.056
 course-0.055
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
 entire+0.034
 pack+0.031
 un+0.030
 opened+0.029
 unlocked+0.029
Neg logit contributions
 mosquitoes-0.052
 conclusions-0.052
orting-0.052
 course-0.050
aro-0.048
Token room
Token ID2119
Feature activation+4.51e+00

Pos logit contributions
 entire+0.027
 un+0.026
 pack+0.025
 open+0.025
 was+0.024
Neg logit contributions
orting-0.036
 conclusions-0.035
aro-0.034
 mosquitoes-0.034
ican-0.033
Token warm
Token ID5814
Feature activation+2.58e+00

Pos logit contributions
 entire+0.082
ta+0.079
 Riding+0.074
pack+0.072
load+0.072
Neg logit contributions
 conclusions-0.127
bs-0.124
 mosquitoes-0.123
orting-0.119
obb-0.118
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
 unlocked+0.061
 pack+0.061
pack+0.061
 un+0.060
 entire+0.060
Neg logit contributions
orting-0.068
 mosquitoes-0.065
 conclusions-0.064
 course-0.062
 anymore-0.060
Token cozy
Token ID37438
Feature activation+1.58e+00

Pos logit contributions
 entire+0.034
pack+0.030
 pack+0.028
ate+0.027
load+0.026
Neg logit contributions
 mosquitoes-0.056
 conclusions-0.052
bs-0.050
aro-0.050
 caution-0.050
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
 un+0.036
 entire+0.036
 pack+0.035
 unlocked+0.034
 opened+0.034
Neg logit contributions
 conclusions-0.043
orting-0.042
 mosquitoes-0.042
 hawk-0.038
 course-0.038
Token Grand
Token ID5675
Feature activation+1.18e+00

Pos logit contributions
 entire+0.026
 un+0.023
 pack+0.023
 open+0.022
 opened+0.022
Neg logit contributions
orting-0.047
 conclusions-0.045
 course-0.044
 mosquitoes-0.043
ittens-0.042
Token colorful
Token ID20239
Feature activation-2.80e-01

Pos logit contributions
 entire+0.010
 un+0.009
 pack+0.009
 open+0.009
 opened+0.009
Neg logit contributions
 conclusions-0.012
orting-0.012
 mosquitoes-0.012
aro-0.011
 course-0.011
Token books
Token ID3835
Feature activation+1.48e+00

Pos logit contributions
orting+0.008
 conclusions+0.008
 mosquitoes+0.008
aro+0.007
 course+0.007
Neg logit contributions
 entire-0.006
 un-0.006
 pack-0.006
 open-0.006
 opened-0.006
Token that
Token ID326
Feature activation+6.83e-01

Pos logit contributions
 entire+0.034
 pack+0.030
 un+0.030
 3+0.029
 whole+0.028
Neg logit contributions
orting-0.042
 course-0.038
aro-0.037
bs-0.037
ion-0.037
Token made
Token ID925
Feature activation+1.08e+00

Pos logit contributions
 entire+0.014
 pack+0.012
 un+0.012
 whole+0.011
pack+0.010
Neg logit contributions
orting-0.021
 mosquitoes-0.021
aro-0.021
 conclusions-0.020
 course-0.020
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 entire+0.019
 un+0.017
 pack+0.017
 opened+0.016
 whole+0.015
Neg logit contributions
orting-0.037
 mosquitoes-0.036
 conclusions-0.036
aro-0.035
 course-0.035
Token room
Token ID2119
Feature activation+4.50e+00

Pos logit contributions
 un+0.034
 entire+0.033
 pack+0.032
 opened+0.031
 open+0.030
Neg logit contributions
 conclusions-0.054
orting-0.054
 mosquitoes-0.054
 course-0.051
 hawk-0.050
Token look
Token ID804
Feature activation+3.37e+00

Pos logit contributions
 entire+0.080
efully+0.073
load+0.069
ta+0.068
die+0.068
Neg logit contributions
bs-0.127
 anymore-0.124
 conclusions-0.122
orting-0.118
 mosquitoes-0.118
Token beautiful
Token ID4950
Feature activation+3.18e+00

Pos logit contributions
pack+0.063
 entire+0.063
load+0.056
 pack+0.056
ipt+0.055
Neg logit contributions
 mosquitoes-0.092
bs-0.092
 anymore-0.092
orting-0.091
 conclusions-0.089
Token.
Token ID13
Feature activation+1.34e-01

Pos logit contributions
 pack+0.084
load+0.082
pack+0.082
 entire+0.081
 un+0.080
Neg logit contributions
 anymore-0.070
 mosquitoes-0.066
orting-0.064
 nightmares-0.061
 conclusions-0.060
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 whole+0.002
Neg logit contributions
orting-0.005
 conclusions-0.004
 mosquitoes-0.004
aro-0.004
 course-0.004
Token
Token ID198
Feature activation+6.57e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.004
 was+0.004
 open+0.004
Neg logit contributions
orting-0.013
 conclusions-0.013
 mosquitoes-0.013
aro-0.012
ican-0.012
Token it
Token ID340
Feature activation+1.92e+00

Pos logit contributions
 entire+0.018
 pack+0.017
load+0.016
 un+0.016
pack+0.015
Neg logit contributions
 conclusions-0.041
 course-0.041
 mosquitoes-0.039
orting-0.039
 violence-0.039
Token!
Token ID0
Feature activation+8.05e-01

Pos logit contributions
 un+0.045
 pack+0.044
 entire+0.044
load+0.041
 opened+0.041
Neg logit contributions
orting-0.050
 course-0.047
 anymore-0.045
 conclusions-0.043
 mosquitoes-0.043
Token You
Token ID921
Feature activation+5.53e-01

Pos logit contributions
 entire+0.017
 un+0.016
 pack+0.016
 open+0.015
 whole+0.015
Neg logit contributions
orting-0.025
 conclusions-0.024
aro-0.024
 course-0.023
 mosquitoes-0.023
Token made
Token ID925
Feature activation+8.47e-01

Pos logit contributions
 entire+0.012
 pack+0.011
 un+0.011
load+0.010
 open+0.010
Neg logit contributions
 course-0.016
orting-0.015
bs-0.015
 conclusions-0.015
 mosquitoes-0.014
Token your
Token ID534
Feature activation+1.17e+00

Pos logit contributions
 entire+0.016
 pack+0.015
 opened+0.015
 un+0.014
 packed+0.014
Neg logit contributions
 conclusions-0.027
 mosquitoes-0.027
 course-0.026
orting-0.026
 caution-0.025
Token house
Token ID2156
Feature activation+4.48e+00

Pos logit contributions
 un+0.025
 entire+0.025
 pack+0.025
 opened+0.023
 open+0.023
Neg logit contributions
 conclusions-0.033
orting-0.033
 mosquitoes-0.032
 course-0.031
ican-0.030
Token successful
Token ID4388
Feature activation+1.30e+00

Pos logit contributions
 Riding+0.073
load+0.072
ta+0.072
 pack+0.071
athan+0.071
Neg logit contributions
 anymore-0.124
 course-0.123
 conclusions-0.119
orting-0.119
bs-0.113
Token."
Token ID526
Feature activation-5.52e-01

Pos logit contributions
 un+0.025
 entire+0.025
 pack+0.025
 opened+0.023
 open+0.023
Neg logit contributions
orting-0.039
 conclusions-0.039
 mosquitoes-0.039
 course-0.038
aro-0.036
Token
Token ID198
Feature activation+5.87e-02

Pos logit contributions
 conclusions+0.020
orting+0.020
 mosquitoes+0.020
aro+0.019
 hawk+0.019
Neg logit contributions
 un-0.008
 entire-0.008
 pack-0.008
 was-0.008
 open-0.007
Token
Token ID198
Feature activation-8.54e-02

Pos logit contributions
 entire+0.001
 un+0.001
 was+0.001
 pack+0.001
 open+0.001
Neg logit contributions
orting-0.002
 conclusions-0.002
 mosquitoes-0.002
ican-0.002
aro-0.002
TokenJoe
Token ID19585
Feature activation+1.62e+00

Pos logit contributions
orting+0.003
 conclusions+0.003
 mosquitoes+0.003
aro+0.003
 course+0.002
Neg logit contributions
 entire-0.002
 un-0.002
 pack-0.002
 open-0.002
 opened-0.002
Token our
Token ID674
Feature activation+1.17e+00

Pos logit contributions
 entire+0.051
 un+0.046
 pack+0.046
 opened+0.045
 unlocked+0.043
Neg logit contributions
orting-0.086
 conclusions-0.084
 course-0.083
 mosquitoes-0.081
aro-0.080
Token stuff
Token ID3404
Feature activation+1.62e+00

Pos logit contributions
 entire+0.022
 pack+0.020
 un+0.019
 opened+0.019
 was+0.019
Neg logit contributions
 conclusions-0.036
orting-0.036
 mosquitoes-0.035
 course-0.033
aro-0.033
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
 entire+0.042
 un+0.040
 pack+0.039
 opened+0.037
 whole+0.037
Neg logit contributions
orting-0.039
 course-0.035
 conclusions-0.035
aro-0.035
 mosquitoes-0.034
Token make
Token ID787
Feature activation+1.19e+00

Pos logit contributions
 entire+0.045
 whole+0.035
rily+0.033
 3+0.032
acked+0.032
Neg logit contributions
 conclusions-0.067
bs-0.063
 mosquitoes-0.062
 caution-0.062
orting-0.061
Token our
Token ID674
Feature activation+1.60e+00

Pos logit contributions
 entire+0.022
 opened+0.020
 pack+0.019
 unlocked+0.019
 packed+0.018
Neg logit contributions
 conclusions-0.039
 mosquitoes-0.039
orting-0.037
 course-0.036
aro-0.035
Token room
Token ID2119
Feature activation+4.47e+00

Pos logit contributions
 entire+0.030
 opened+0.028
 un+0.027
 unlocked+0.027
 pack+0.027
Neg logit contributions
 conclusions-0.050
 mosquitoes-0.049
orting-0.048
aro-0.045
 course-0.045
Token nice
Token ID3621
Feature activation+2.50e+00

Pos logit contributions
 Riding+0.092
 entire+0.087
ta+0.082
ipt+0.080
boxing+0.079
Neg logit contributions
bs-0.109
orting-0.109
 anymore-0.108
 conclusions-0.108
 fault-0.104
Token,"
Token ID553
Feature activation+1.21e+00

Pos logit contributions
 entire+0.064
 un+0.061
 pack+0.060
 unlocked+0.059
 opened+0.059
Neg logit contributions
orting-0.058
 mosquitoes-0.058
 conclusions-0.057
 course-0.052
 nightmares-0.052
Token Tim
Token ID5045
Feature activation+2.92e-02

Pos logit contributions
 entire+0.027
 pack+0.025
 un+0.025
 whole+0.023
 open+0.023
Neg logit contributions
orting-0.036
 conclusions-0.036
 mosquitoes-0.036
aro-0.035
 course-0.034
Token said
Token ID531
Feature activation+3.52e-01

Pos logit contributions
 entire+0.001
 whole+0.001
 pack+0.001
 un+0.001
 open+0.001
Neg logit contributions
orting-0.001
bs-0.001
 course-0.001
 mosquitoes-0.001
crow-0.001
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
 entire+0.008
 un+0.007
 pack+0.007
 whole+0.007
 was+0.007
Neg logit contributions
orting-0.010
 conclusions-0.010
bs-0.010
aro-0.010
 mosquitoes-0.010
Token candles
Token ID32268
Feature activation+1.45e+00

Pos logit contributions
orting+0.035
 conclusions+0.034
aro+0.033
ican+0.032
 hawk+0.032
Neg logit contributions
 un-0.013
 entire-0.012
 pack-0.012
 was-0.011
 open-0.011
Token because
Token ID780
Feature activation+1.98e+00

Pos logit contributions
 entire+0.033
 un+0.031
 pack+0.029
 unlocked+0.029
 opened+0.029
Neg logit contributions
orting-0.042
 course-0.039
 mosquitoes-0.038
 conclusions-0.037
orns-0.036
Token they
Token ID484
Feature activation+2.16e+00

Pos logit contributions
 entire+0.056
pack+0.050
 pack+0.049
 whole+0.047
 un+0.047
Neg logit contributions
 mosquitoes-0.046
aro-0.045
 course-0.043
 hawk-0.041
ion-0.040
Token made
Token ID925
Feature activation+2.05e+00

Pos logit contributions
 entire+0.051
pack+0.045
 pack+0.045
load+0.044
 un+0.043
Neg logit contributions
bs-0.053
 course-0.052
 conclusions-0.047
 mosquitoes-0.047
leness-0.046
Token his
Token ID465
Feature activation+1.39e+00

Pos logit contributions
 entire+0.046
 unlocked+0.042
 pack+0.040
 un+0.040
 opened+0.039
Neg logit contributions
 mosquitoes-0.057
 conclusions-0.056
orting-0.054
 course-0.053
aro-0.049
Token room
Token ID2119
Feature activation+4.46e+00

Pos logit contributions
 entire+0.025
 un+0.025
 pack+0.024
 open+0.023
 opened+0.022
Neg logit contributions
 conclusions-0.046
orting-0.046
 mosquitoes-0.045
aro-0.043
 course-0.043
Token smell
Token ID8508
Feature activation+2.20e+00

Pos logit contributions
 entire+0.088
ta+0.088
ipt+0.083
 Riding+0.082
efully+0.081
Neg logit contributions
 anymore-0.110
bs-0.105
orting-0.102
 nightmares-0.100
 chores-0.100
Token nice
Token ID3621
Feature activation+1.66e+00

Pos logit contributions
 entire+0.043
pack+0.039
 un+0.039
 unlocked+0.039
 pack+0.039
Neg logit contributions
 mosquitoes-0.068
orting-0.068
 conclusions-0.067
 course-0.065
bs-0.062
Token.
Token ID13
Feature activation-8.38e-01

Pos logit contributions
 entire+0.037
 un+0.035
 pack+0.035
 unlocked+0.034
 opened+0.033
Neg logit contributions
orting-0.047
 mosquitoes-0.044
 conclusions-0.043
 course-0.042
bs-0.042
Token One
Token ID1881
Feature activation+1.05e+00

Pos logit contributions
 conclusions+0.031
 mosquitoes+0.030
orting+0.030
aro+0.029
 hawk+0.029
Neg logit contributions
 un-0.013
 was-0.013
 entire-0.012
 pack-0.012
 open-0.011
Token day
Token ID1110
Feature activation-7.10e-01

Pos logit contributions
 un+0.033
 entire+0.033
 unlocked+0.032
 pack+0.032
load+0.031
Neg logit contributions
 mosquitoes-0.019
 conclusions-0.017
 course-0.017
orting-0.016
bs-0.016
Token
Token ID8151
Feature activation-2.44e+00

Pos logit contributions
orting+0.018
 conclusions+0.017
 mosquitoes+0.017
bs+0.016
 course+0.016
Neg logit contributions
 entire-0.013
 un-0.012
 pack-0.012
 open-0.012
 opened-0.011
Tokens
Token ID82
Feature activation+3.20e-02

Pos logit contributions
 conclusions+0.084
 mosquitoes+0.081
orting+0.080
 hawk+0.078
ican+0.077
Neg logit contributions
 entire-0.044
 was-0.044
 un-0.043
 pack-0.041
 opened-0.041
Token artwork
Token ID16257
Feature activation+1.32e+00

Pos logit contributions
 entire+0.001
 un+0.001
 was+0.001
 open+0.001
 pack+0.001
Neg logit contributions
orting-0.001
 conclusions-0.001
aro-0.001
 mosquitoes-0.001
ican-0.001
Token made
Token ID925
Feature activation+9.62e-01

Pos logit contributions
 entire+0.016
 un+0.014
 pack+0.014
 open+0.012
 whole+0.011
Neg logit contributions
orting-0.052
 mosquitoes-0.051
 conclusions-0.050
 course-0.050
aro-0.049
Token the
Token ID262
Feature activation+1.55e+00

Pos logit contributions
 entire+0.020
 pack+0.018
 un+0.018
 opened+0.017
 open+0.017
Neg logit contributions
orting-0.030
 mosquitoes-0.029
 conclusions-0.029
 course-0.028
aro-0.027
Token house
Token ID2156
Feature activation+4.45e+00

Pos logit contributions
 entire+0.031
 un+0.029
 pack+0.028
 open+0.028
 whole+0.027
Neg logit contributions
orting-0.051
 conclusions-0.051
aro-0.050
 mosquitoes-0.049
 hawk-0.048
Token grow
Token ID1663
Feature activation+2.89e+00

Pos logit contributions
 entire+0.080
ta+0.076
load+0.073
 Riding+0.071
pack+0.070
Neg logit contributions
orting-0.124
 conclusions-0.124
bs-0.122
 course-0.120
 mosquitoes-0.118
Token brighter
Token ID26083
Feature activation+9.84e-01

Pos logit contributions
 pack+0.055
 un+0.053
 entire+0.052
pack+0.051
load+0.047
Neg logit contributions
orting-0.090
 mosquitoes-0.086
 conclusions-0.084
bs-0.081
obb-0.081
Token.
Token ID13
Feature activation-8.45e-01

Pos logit contributions
 entire+0.018
 un+0.017
 pack+0.017
 unlocked+0.016
pack+0.016
Neg logit contributions
 conclusions-0.031
orting-0.031
 mosquitoes-0.030
bs-0.028
 hawk-0.028
Token
Token ID198
Feature activation-6.61e-01

Pos logit contributions
 conclusions+0.025
orting+0.025
 mosquitoes+0.024
aro+0.024
ican+0.023
Neg logit contributions
 entire-0.019
 un-0.018
 pack-0.018
 open-0.017
 was-0.017
Token
Token ID198
Feature activation-4.50e-01

Pos logit contributions
 conclusions+0.026
 mosquitoes+0.026
orting+0.026
ican+0.025
aro+0.025
Neg logit contributions
 un-0.008
 entire-0.008
 was-0.008
 pack-0.008
 open-0.007
Token they
Token ID484
Feature activation+2.12e+00

Pos logit contributions
 entire+0.012
 un+0.010
 pack+0.010
 whole+0.010
 opened+0.010
Neg logit contributions
 mosquitoes-0.019
 conclusions-0.019
orting-0.018
aro-0.017
 hawk-0.017
Token wanted
Token ID2227
Feature activation+2.68e-01

Pos logit contributions
 entire+0.045
 pack+0.042
ate+0.041
load+0.041
 un+0.040
Neg logit contributions
bs-0.059
 mosquitoes-0.055
orting-0.054
 conclusions-0.052
 course-0.052
Token to
Token ID284
Feature activation+1.93e+00

Pos logit contributions
 entire+0.006
 pack+0.005
 un+0.005
 whole+0.005
 opened+0.005
Neg logit contributions
 conclusions-0.008
 mosquitoes-0.008
orting-0.008
 course-0.007
aro-0.007
Token make
Token ID787
Feature activation+1.01e+00

Pos logit contributions
 entire+0.038
 whole+0.031
 un+0.030
 unlocked+0.030
 opened+0.030
Neg logit contributions
 conclusions-0.061
orting-0.057
 course-0.057
 mosquitoes-0.056
 hawk-0.053
Token the
Token ID262
Feature activation+2.54e+00

Pos logit contributions
 entire+0.018
 pack+0.016
 opened+0.016
 unlocked+0.015
 un+0.015
Neg logit contributions
 conclusions-0.035
 mosquitoes-0.035
orting-0.034
 course-0.032
 caution-0.031
Token house
Token ID2156
Feature activation+4.43e+00

Pos logit contributions
 entire+0.045
 un+0.042
 pack+0.041
 open+0.040
 whole+0.038
Neg logit contributions
orting-0.090
 conclusions-0.089
aro-0.087
 mosquitoes-0.087
ican-0.084
Token even
Token ID772
Feature activation+2.28e+00

Pos logit contributions
athan+0.085
ta+0.084
die+0.083
 Riding+0.083
 entire+0.080
Neg logit contributions
orting-0.115
 anymore-0.115
 conclusions-0.113
bs-0.110
 mosquitoes-0.109
Token more
Token ID517
Feature activation+1.88e+00

Pos logit contributions
 entire+0.054
 pack+0.048
 was+0.047
 whole+0.047
 un+0.046
Neg logit contributions
orting-0.059
 mosquitoes-0.059
 conclusions-0.058
bs-0.054
 unnecessary-0.052
Token beautiful
Token ID4950
Feature activation+2.15e+00

Pos logit contributions
 entire+0.033
 pack+0.032
 opened+0.028
 whole+0.028
load+0.028
Neg logit contributions
 mosquitoes-0.061
orting-0.059
 conclusions-0.058
bs-0.056
aro-0.055
Token.
Token ID13
Feature activation-2.76e-01

Pos logit contributions
 pack+0.048
 entire+0.047
 un+0.045
 unlocked+0.044
 opened+0.044
Neg logit contributions
 mosquitoes-0.060
 conclusions-0.059
orting-0.057
aro-0.054
 course-0.054
Token So
Token ID1406
Feature activation+3.49e-01

Pos logit contributions
orting+0.009
 conclusions+0.009
crow+0.008
ican+0.008
 course+0.008
Neg logit contributions
 entire-0.005
 un-0.005
 pack-0.005
 whole-0.005
 open-0.004
Token the
Token ID262
Feature activation+9.57e-01

Pos logit contributions
 entire+0.018
 un+0.017
 pack+0.016
 open+0.015
 opened+0.014
Neg logit contributions
orting-0.055
 conclusions-0.054
 mosquitoes-0.053
aro-0.052
 course-0.052
Token office
Token ID2607
Feature activation-3.02e-01

Pos logit contributions
 entire+0.024
 un+0.023
 pack+0.023
 opened+0.022
 open+0.022
Neg logit contributions
orting-0.025
 conclusions-0.024
 mosquitoes-0.023
 course-0.023
ican-0.022
Token,
Token ID11
Feature activation+2.38e+00

Pos logit contributions
orting+0.008
 conclusions+0.008
 mosquitoes+0.008
aro+0.007
 course+0.007
Neg logit contributions
 entire-0.008
 un-0.007
 pack-0.007
 open-0.007
 whole-0.007
Token they
Token ID484
Feature activation+7.57e-01

Pos logit contributions
 entire+0.060
pack+0.047
 un+0.046
boxing+0.046
 whole+0.045
Neg logit contributions
 mosquitoes-0.063
orting-0.060
 hawk-0.059
aro-0.058
bs-0.057
Token filled
Token ID5901
Feature activation+3.07e+00

Pos logit contributions
 entire+0.016
 whole+0.012
 contents+0.012
 pack+0.012
 un+0.012
Neg logit contributions
bs-0.023
 course-0.022
orting-0.022
 mosquitoes-0.021
aro-0.021
Token out
Token ID503
Feature activation+4.42e+00

Pos logit contributions
 entire+0.079
'd+0.064
 pack+0.063
 Riding+0.062
 unlocked+0.062
Neg logit contributions
 mosquitoes-0.070
 course-0.065
aro-0.063
ion-0.063
orting-0.062
Token some
Token ID617
Feature activation+1.47e+00

Pos logit contributions
 entire+0.102
 Riding+0.094
 pack+0.093
boxing+0.090
pack+0.087
Neg logit contributions
ion-0.107
 mosquitoes-0.107
 conclusions-0.107
orting-0.105
aro-0.103
Token forms
Token ID5107
Feature activation+2.56e+00

Pos logit contributions
 entire+0.030
 un+0.028
 pack+0.028
 opened+0.027
 was+0.026
Neg logit contributions
 conclusions-0.046
orting-0.045
 mosquitoes-0.045
 course-0.042
aro-0.042
Token.
Token ID13
Feature activation+1.02e+00

Pos logit contributions
 entire+0.053
 pack+0.053
 un+0.050
Y+0.046
boxing+0.045
Neg logit contributions
orting-0.070
ion-0.067
 course-0.063
aro-0.062
bs-0.061
Token Then
Token ID3244
Feature activation+1.09e+00

Pos logit contributions
 entire+0.023
 whole+0.021
 pack+0.020
 un+0.019
 was+0.019
Neg logit contributions
orting-0.028
 conclusions-0.027
 mosquitoes-0.026
ican-0.026
crow-0.025
Token they
Token ID484
Feature activation+8.24e-01

Pos logit contributions
 entire+0.033
 un+0.029
 pack+0.029
 whole+0.028
pack+0.027
Neg logit contributions
orting-0.025
 mosquitoes-0.024
 conclusions-0.023
aro-0.023
 course-0.023
Token light
Token ID1657
Feature activation+1.91e+00

Pos logit contributions
 entire+0.016
 un+0.016
 pack+0.015
 opened+0.014
 open+0.014
Neg logit contributions
orting-0.026
 conclusions-0.025
 mosquitoes-0.025
 course-0.024
 hawk-0.024
Token.
Token ID13
Feature activation+9.93e-01

Pos logit contributions
 entire+0.035
 un+0.034
 pack+0.032
 unlocked+0.029
 open+0.029
Neg logit contributions
orting-0.062
 mosquitoes-0.060
 conclusions-0.060
 course-0.059
 hawk-0.058
Token It
Token ID632
Feature activation+2.78e+00

Pos logit contributions
 entire+0.024
 un+0.021
 pack+0.021
 whole+0.020
 open+0.020
Neg logit contributions
orting-0.029
 conclusions-0.026
 mosquitoes-0.026
bs-0.025
ion-0.025
Token made
Token ID925
Feature activation+1.54e+00

Pos logit contributions
pack+0.045
load+0.045
 entire+0.042
 pack+0.042
boxing+0.039
Neg logit contributions
 course-0.084
 nightmares-0.079
bs-0.079
 hawk-0.078
 mosquitoes-0.078
Token the
Token ID262
Feature activation+1.93e+00

Pos logit contributions
 entire+0.030
 pack+0.028
 un+0.027
pack+0.026
 unlocked+0.026
Neg logit contributions
 mosquitoes-0.048
orting-0.047
 conclusions-0.047
 course-0.046
ion-0.044
Token room
Token ID2119
Feature activation+4.41e+00

Pos logit contributions
 un+0.034
 entire+0.033
 pack+0.032
 opened+0.031
 unlocked+0.030
Neg logit contributions
 conclusions-0.064
orting-0.063
 mosquitoes-0.062
 course-0.061
 hawk-0.060
Token shine
Token ID18340
Feature activation+3.24e+00

Pos logit contributions
load+0.085
 entire+0.082
ta+0.082
 Riding+0.079
rev+0.078
Neg logit contributions
bs-0.111
 anymore-0.109
orting-0.107
 nightmares-0.106
 conclusions-0.106
Token with
Token ID351
Feature activation+1.67e+00

Pos logit contributions
pack+0.070
load+0.069
 pack+0.067
 un+0.067
boxing+0.066
Neg logit contributions
orting-0.086
 anymore-0.085
 mosquitoes-0.079
 course-0.079
bs-0.076
Token stars
Token ID5788
Feature activation+1.46e+00

Pos logit contributions
 entire+0.028
 pack+0.026
 un+0.026
pack+0.024
 unlocked+0.023
Neg logit contributions
 conclusions-0.057
 mosquitoes-0.055
orting-0.055
 course-0.054
 caution-0.052
Token and
Token ID290
Feature activation+7.05e-01

Pos logit contributions
 entire+0.029
 un+0.029
 pack+0.028
pack+0.026
load+0.026
Neg logit contributions
orting-0.044
 mosquitoes-0.042
 conclusions-0.042
 course-0.041
 hawk-0.041
Token moons
Token ID31093
Feature activation+5.14e-01

Pos logit contributions
 entire+0.015
 un+0.015
 pack+0.014
 open+0.013
pack+0.013
Neg logit contributions
 conclusions-0.021
 mosquitoes-0.021
orting-0.020
aro-0.019
 course-0.019
Token been
Token ID587
Feature activation+4.09e-02

Pos logit contributions
load+0.004
 pack+0.004
 entire+0.004
rily+0.004
pack+0.004
Neg logit contributions
 course-0.007
 caution-0.007
 mosquitoes-0.007
 nightmares-0.006
 violence-0.006
Token clever
Token ID14169
Feature activation+6.66e-02

Pos logit contributions
 entire+0.001
 pack+0.001
load+0.001
 was+0.001
 un+0.001
Neg logit contributions
 mosquitoes-0.001
aro-0.001
 course-0.001
orting-0.001
 hesitation-0.001
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
 entire+0.001
 un+0.001
 pack+0.001
 opened+0.001
 whole+0.001
Neg logit contributions
orting-0.002
 mosquitoes-0.002
 conclusions-0.002
 course-0.002
 hawk-0.002
Token made
Token ID925
Feature activation+1.05e+00

Pos logit contributions
 entire+0.021
 pack+0.019
load+0.018
 whole+0.017
rily+0.016
Neg logit contributions
bs-0.030
 caution-0.027
 mosquitoes-0.027
 course-0.027
orting-0.026
Token the
Token ID262
Feature activation+1.66e+00

Pos logit contributions
 entire+0.020
 pack+0.019
 un+0.017
 packed+0.017
 opened+0.016
Neg logit contributions
 mosquitoes-0.034
 conclusions-0.032
orting-0.032
 course-0.031
 caution-0.031
Token room
Token ID2119
Feature activation+4.38e+00

Pos logit contributions
 entire+0.033
 un+0.032
 pack+0.031
 open+0.030
 was+0.029
Neg logit contributions
orting-0.054
 conclusions-0.054
 mosquitoes-0.052
aro-0.052
ican-0.051
Token tidy
Token ID43044
Feature activation+3.20e+00

Pos logit contributions
ta+0.079
load+0.077
 entire+0.077
athan+0.071
 Riding+0.071
Neg logit contributions
bs-0.120
orting-0.114
 anymore-0.112
 conclusions-0.111
 course-0.111
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 pack+0.073
 entire+0.073
load+0.072
 un+0.071
 opened+0.069
Neg logit contributions
orting-0.079
 anymore-0.079
 conclusions-0.075
 course-0.075
 mosquitoes-0.072
Token Her
Token ID2332
Feature activation+1.37e+00

Pos logit contributions
orting+0.037
 conclusions+0.037
 mosquitoes+0.036
 hawk+0.035
ican+0.035
Neg logit contributions
 un-0.018
 entire-0.017
 was-0.017
 pack-0.016
 opened-0.016
Token mom
Token ID1995
Feature activation+1.09e+00

Pos logit contributions
 entire+0.030
 un+0.029
 pack+0.028
 open+0.026
 opened+0.026
Neg logit contributions
 mosquitoes-0.038
 conclusions-0.038
orting-0.038
 course-0.036
 hawk-0.036
Tokenmy
Token ID1820
Feature activation+7.67e-01

Pos logit contributions
 entire+0.033
 pack+0.030
 whole+0.029
 un+0.028
 open+0.027
Neg logit contributions
 course-0.024
orting-0.022
bs-0.019
orns-0.018
aro-0.018
Token animals
Token ID4695
Feature activation-1.49e+00

Pos logit contributions
 entire+0.016
 pack+0.015
 un+0.015
 opened+0.014
 whole+0.014
Neg logit contributions
orting-0.017
 conclusions-0.017
 mosquitoes-0.017
 course-0.017
orns-0.016
Token,
Token ID11
Feature activation-1.45e+00

Pos logit contributions
 conclusions+0.052
orting+0.052
 mosquitoes+0.051
aro+0.049
ican+0.049
Neg logit contributions
 un-0.025
 entire-0.025
 pack-0.024
 open-0.023
 was-0.023
Token like
Token ID588
Feature activation-1.06e+00

Pos logit contributions
orting+0.045
 conclusions+0.044
 mosquitoes+0.043
aro+0.042
ican+0.042
Neg logit contributions
 entire-0.031
 un-0.030
 pack-0.029
 open-0.028
 was-0.028
Token lions
Token ID31420
Feature activation-2.88e+00

Pos logit contributions
 mosquitoes+0.020
orting+0.020
 conclusions+0.020
aro+0.019
 course+0.019
Neg logit contributions
 entire-0.034
 un-0.033
 pack-0.032
 open-0.031
 opened-0.031
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 conclusions+0.094
orting+0.090
 mosquitoes+0.089
aro+0.086
ican+0.086
Neg logit contributions
 un-0.058
 was-0.056
 entire-0.056
 open-0.055
 pack-0.054
Token ze
Token ID41271
Feature activation-6.08e+00

Pos logit contributions
orting+0.032
 conclusions+0.031
 mosquitoes+0.029
aro+0.028
ican+0.028
Neg logit contributions
 entire-0.047
 un-0.047
 pack-0.046
 was-0.045
 open-0.045
Tokenbr
Token ID1671
Feature activation-2.50e+00

Pos logit contributions
 conclusions+0.160
 narrative+0.141
 violence+0.135
 fault+0.135
 elbows+0.133
Neg logit contributions
 un-0.164
 open-0.160
 was-0.160
 opened-0.160
 with-0.155
Tokenas
Token ID292
Feature activation-2.27e+00

Pos logit contributions
 conclusions+0.052
orting+0.052
 mosquitoes+0.050
aro+0.047
ican+0.046
Neg logit contributions
 entire-0.078
 un-0.077
 pack-0.075
 open-0.073
 opened-0.072
Token,
Token ID11
Feature activation-1.50e+00

Pos logit contributions
 conclusions+0.085
 mosquitoes+0.081
orting+0.079
 unnecessary+0.077
 narrative+0.077
Neg logit contributions
 un-0.037
 was-0.035
 open-0.035
 pack-0.034
 entire-0.034
Token and
Token ID290
Feature activation-1.52e+00

Pos logit contributions
orting+0.058
 conclusions+0.058
 mosquitoes+0.057
aro+0.056
 course+0.055
Neg logit contributions
 entire-0.020
 un-0.019
 pack-0.018
 open-0.016
 opened-0.016
Token gir
Token ID37370
Feature activation-4.98e+00

Pos logit contributions
orting+0.034
 conclusions+0.032
 mosquitoes+0.031
aro+0.030
ican+0.030
Neg logit contributions
 un-0.045
 entire-0.045
 pack-0.044
 was-0.044
 open-0.043
Token,
Token ID11
Feature activation-9.35e-01

Pos logit contributions
 conclusions+0.092
orting+0.089
 mosquitoes+0.087
ican+0.085
aro+0.084
Neg logit contributions
 un-0.047
 was-0.046
 open-0.045
 pack-0.044
 entire-0.044
Token gir
Token ID37370
Feature activation-5.05e+00

Pos logit contributions
orting+0.031
 conclusions+0.031
 mosquitoes+0.030
aro+0.030
ican+0.029
Neg logit contributions
 entire-0.017
 un-0.017
 pack-0.016
 open-0.016
 opened-0.015
Tokenaff
Token ID2001
Feature activation-2.50e+00

Pos logit contributions
 conclusions+0.136
 narrative+0.123
 violence+0.119
 mosquitoes+0.113
 unnecessary+0.112
Neg logit contributions
 open-0.128
 was-0.125
 opened-0.124
 un-0.123
 entire-0.122
Tokenes
Token ID274
Feature activation-1.53e+00

Pos logit contributions
 conclusions+0.086
orting+0.083
 mosquitoes+0.079
ican+0.078
 narrative+0.076
Neg logit contributions
 un-0.045
 was-0.045
 entire-0.043
 opened-0.043
 open-0.042
Token and
Token ID290
Feature activation-1.58e+00

Pos logit contributions
 conclusions+0.059
orting+0.057
 mosquitoes+0.056
ican+0.055
aro+0.054
Neg logit contributions
 un-0.022
 was-0.022
 open-0.021
 entire-0.021
 pack-0.020
Token ze
Token ID41271
Feature activation-5.92e+00

Pos logit contributions
orting+0.042
 conclusions+0.039
ican+0.037
 mosquitoes+0.037
aro+0.037
Neg logit contributions
 un-0.042
 entire-0.042
 was-0.041
 pack-0.040
 open-0.040
Tokenbr
Token ID1671
Feature activation-2.37e+00

Pos logit contributions
 conclusions+0.146
 narrative+0.129
 injuries+0.124
 elbows+0.123
 violence+0.123
Neg logit contributions
 un-0.163
 open-0.160
 opened-0.159
 was-0.158
 with-0.153
Tokenas
Token ID292
Feature activation-8.39e-01

Pos logit contributions
orting+0.049
 conclusions+0.049
 mosquitoes+0.047
aro+0.044
ican+0.044
Neg logit contributions
 entire-0.074
 un-0.073
 pack-0.071
 open-0.069
 opened-0.069
Token.
Token ID13
Feature activation+2.01e-02

Pos logit contributions
 conclusions+0.031
orting+0.030
 mosquitoes+0.030
 hawk+0.029
aro+0.029
Neg logit contributions
 un-0.013
 entire-0.013
 pack-0.012
 open-0.012
 was-0.011
Token They
Token ID1119
Feature activation-4.83e-01

Pos logit contributions
 un+0.000
 entire+0.000
 pack+0.000
 was+0.000
 open+0.000
Neg logit contributions
orting-0.001
 conclusions-0.001
 mosquitoes-0.001
aro-0.001
 hawk-0.001
Token took
Token ID1718
Feature activation-6.37e-01

Pos logit contributions
orting+0.017
 conclusions+0.017
 mosquitoes+0.017
aro+0.017
 course+0.017
Neg logit contributions
 entire-0.008
 un-0.007
 pack-0.007
 open-0.006
 whole-0.006
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 conclusions+0.074
orting+0.071
 mosquitoes+0.069
aro+0.068
ican+0.068
Neg logit contributions
 un-0.032
 was-0.031
 open-0.031
 pack-0.030
 opened-0.029
Token gir
Token ID37370
Feature activation-4.34e+00

Pos logit contributions
orting+0.026
 conclusions+0.025
aro+0.023
ican+0.023
arling+0.022
Neg logit contributions
 un-0.033
 entire-0.032
 open-0.032
 pack-0.032
 was-0.032
Tokenaff
Token ID2001
Feature activation-2.02e+00

Pos logit contributions
 conclusions+0.114
 narrative+0.105
 violence+0.100
 caution+0.097
 unnecessary+0.096
Neg logit contributions
 open-0.108
 with-0.104
 was-0.103
 un-0.102
 opened-0.102
Tokenes
Token ID274
Feature activation-1.45e+00

Pos logit contributions
 conclusions+0.067
orting+0.065
 mosquitoes+0.062
ican+0.061
 hawk+0.060
Neg logit contributions
 un-0.038
 was-0.036
 entire-0.036
 opened-0.035
 pack-0.035
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
 conclusions+0.057
orting+0.055
 mosquitoes+0.054
ican+0.053
 narrative+0.053
Neg logit contributions
 was-0.020
 un-0.020
 open-0.019
 pack-0.018
 with-0.018
Token ze
Token ID41271
Feature activation-5.85e+00

Pos logit contributions
orting+0.048
 conclusions+0.047
aro+0.044
ican+0.044
 mosquitoes+0.044
Neg logit contributions
 un-0.037
 entire-0.035
 was-0.035
 open-0.035
 pack-0.035
Tokenbr
Token ID1671
Feature activation-2.63e+00

Pos logit contributions
 conclusions+0.149
 narrative+0.134
 elbows+0.134
 injuries+0.130
 violence+0.127
Neg logit contributions
 un-0.153
 open-0.150
 with-0.150
 opened-0.148
 was-0.147
Tokenas
Token ID292
Feature activation-2.61e+00

Pos logit contributions
 conclusions+0.058
orting+0.057
 mosquitoes+0.056
ican+0.053
 hawk+0.053
Neg logit contributions
 entire-0.078
 un-0.077
 pack-0.075
 open-0.073
 opened-0.073
Token,
Token ID11
Feature activation-1.80e+00

Pos logit contributions
 conclusions+0.097
 mosquitoes+0.092
orting+0.092
 narrative+0.091
 unnecessary+0.090
Neg logit contributions
 un-0.041
 open-0.039
 was-0.039
 pack-0.038
 with-0.037
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
orting+0.067
 conclusions+0.067
aro+0.064
 mosquitoes+0.064
ican+0.063
Neg logit contributions
 un-0.026
 entire-0.026
 was-0.025
 pack-0.024
 open-0.024
Token more
Token ID517
Feature activation-1.53e+00

Pos logit contributions
orting+0.045
 conclusions+0.042
ican+0.041
aro+0.040
 mosquitoes+0.040
Neg logit contributions
 un-0.047
 entire-0.045
 was-0.045
 open-0.045
 pack-0.045
Token of
Token ID286
Feature activation+4.65e-01

Pos logit contributions
 entire+0.005
 un+0.004
 pack+0.004
 open+0.004
 opened+0.004
Neg logit contributions
orting-0.007
 conclusions-0.007
 mosquitoes-0.007
aro-0.006
ican-0.006
Token animals
Token ID4695
Feature activation-6.58e-01

Pos logit contributions
 entire+0.010
 un+0.009
 pack+0.009
 open+0.009
 was+0.009
Neg logit contributions
orting-0.015
 conclusions-0.014
 mosquitoes-0.014
aro-0.014
ican-0.013
Token like
Token ID588
Feature activation-6.66e-01

Pos logit contributions
 conclusions+0.023
orting+0.023
 mosquitoes+0.023
aro+0.022
ican+0.022
Neg logit contributions
 entire-0.011
 un-0.011
 pack-0.010
 open-0.010
 was-0.010
Token lions
Token ID31420
Feature activation-2.33e+00

Pos logit contributions
orting+0.015
 conclusions+0.014
 mosquitoes+0.014
aro+0.013
ican+0.013
Neg logit contributions
 entire-0.020
 un-0.020
 pack-0.019
 open-0.019
 was-0.019
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 conclusions+0.087
orting+0.084
 mosquitoes+0.081
aro+0.080
ican+0.080
Neg logit contributions
 un-0.036
 was-0.035
 entire-0.035
 open-0.033
 opened-0.032
Token ze
Token ID41271
Feature activation-5.80e+00

Pos logit contributions
orting+0.023
 conclusions+0.022
 mosquitoes+0.020
aro+0.020
ican+0.019
Neg logit contributions
 un-0.034
 entire-0.034
 pack-0.033
 was-0.032
 open-0.032
Tokenbr
Token ID1671
Feature activation-2.27e+00

Pos logit contributions
 conclusions+0.147
 narrative+0.132
 elbows+0.131
 violence+0.128
 fault+0.125
Neg logit contributions
 was-0.154
 un-0.154
 opened-0.153
 open-0.151
 entire-0.150
Tokenas
Token ID292
Feature activation-1.41e+00

Pos logit contributions
 conclusions+0.047
orting+0.047
 mosquitoes+0.045
aro+0.042
ican+0.042
Neg logit contributions
 entire-0.071
 un-0.070
 pack-0.068
 open-0.066
 opened-0.066
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
 conclusions+0.053
orting+0.051
 mosquitoes+0.051
 hawk+0.049
aro+0.048
Neg logit contributions
 un-0.021
 entire-0.020
 was-0.020
 pack-0.020
 open-0.019
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
orting+0.041
 conclusions+0.041
 mosquitoes+0.040
aro+0.038
ican+0.038
Neg logit contributions
 entire-0.031
 un-0.031
 pack-0.030
 open-0.029
 was-0.028
Token gir
Token ID37370
Feature activation-4.83e+00

Pos logit contributions
orting+0.023
 conclusions+0.022
 mosquitoes+0.021
aro+0.020
ican+0.020
Neg logit contributions
 entire-0.034
 un-0.034
 pack-0.033
 open-0.032
 was-0.032
Token animals
Token ID4695
Feature activation-1.32e+00

Pos logit contributions
 entire+0.024
 pack+0.022
 un+0.022
 opened+0.021
 whole+0.021
Neg logit contributions
orting-0.025
 course-0.025
 mosquitoes-0.024
 conclusions-0.024
orns-0.024
Token,
Token ID11
Feature activation-1.51e+00

Pos logit contributions
orting+0.043
 conclusions+0.043
 mosquitoes+0.042
aro+0.041
ican+0.040
Neg logit contributions
 entire-0.025
 un-0.024
 pack-0.023
 open-0.022
 opened-0.022
Token like
Token ID588
Feature activation-9.31e-01

Pos logit contributions
orting+0.046
 conclusions+0.046
 mosquitoes+0.045
aro+0.044
ican+0.043
Neg logit contributions
 entire-0.032
 un-0.031
 pack-0.030
 open-0.029
 was-0.029
Token lions
Token ID31420
Feature activation-2.55e+00

Pos logit contributions
 mosquitoes+0.017
orting+0.017
 conclusions+0.017
aro+0.016
 course+0.016
Neg logit contributions
 entire-0.031
 un-0.030
 pack-0.030
 open-0.029
 opened-0.028
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
 conclusions+0.079
orting+0.077
 mosquitoes+0.076
aro+0.074
ican+0.073
Neg logit contributions
 un-0.054
 entire-0.053
 was-0.051
 pack-0.051
 open-0.050
Token ze
Token ID41271
Feature activation-5.76e+00

Pos logit contributions
orting+0.033
 conclusions+0.032
 mosquitoes+0.030
aro+0.029
ican+0.029
Neg logit contributions
 entire-0.051
 un-0.051
 pack-0.050
 was-0.049
 open-0.049
Tokenbr
Token ID1671
Feature activation-2.51e+00

Pos logit contributions
 conclusions+0.148
 narrative+0.130
 violence+0.125
 fault+0.124
 elbows+0.124
Neg logit contributions
 un-0.159
 was-0.155
 open-0.155
 opened-0.154
 with-0.150
Tokenas
Token ID292
Feature activation-2.17e+00

Pos logit contributions
 conclusions+0.052
orting+0.052
 mosquitoes+0.050
aro+0.047
ican+0.046
Neg logit contributions
 entire-0.078
 un-0.077
 pack-0.075
 open-0.073
 opened-0.073
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
 conclusions+0.079
 mosquitoes+0.076
orting+0.075
 hawk+0.072
 unnecessary+0.072
Neg logit contributions
 un-0.036
 was-0.034
 entire-0.034
 open-0.033
 pack-0.033
Token and
Token ID290
Feature activation-1.69e+00

Pos logit contributions
orting+0.063
 conclusions+0.063
 mosquitoes+0.061
aro+0.060
ican+0.059
Neg logit contributions
 entire-0.021
 un-0.020
 pack-0.019
 open-0.018
 opened-0.017
Token gir
Token ID37370
Feature activation-4.68e+00

Pos logit contributions
orting+0.038
 conclusions+0.036
 mosquitoes+0.034
aro+0.033
ican+0.033
Neg logit contributions
 un-0.051
 entire-0.050
 was-0.049
 pack-0.049
 open-0.048
Token animals
Token ID4695
Feature activation-1.48e+00

Pos logit contributions
 entire+0.017
 pack+0.016
 un+0.016
 opened+0.016
 whole+0.015
Neg logit contributions
orting-0.020
 conclusions-0.020
 mosquitoes-0.020
 course-0.020
aro-0.019
Token,
Token ID11
Feature activation-1.78e+00

Pos logit contributions
 conclusions+0.052
orting+0.051
 mosquitoes+0.050
aro+0.048
ican+0.048
Neg logit contributions
 un-0.025
 entire-0.025
 pack-0.024
 open-0.023
 was-0.023
Token like
Token ID588
Feature activation-1.16e+00

Pos logit contributions
orting+0.057
 conclusions+0.057
 mosquitoes+0.055
aro+0.054
ican+0.054
Neg logit contributions
 entire-0.035
 un-0.035
 pack-0.033
 open-0.033
 was-0.032
Token lions
Token ID31420
Feature activation-2.78e+00

Pos logit contributions
orting+0.024
 conclusions+0.023
 mosquitoes+0.023
aro+0.022
 course+0.021
Neg logit contributions
 entire-0.036
 un-0.035
 pack-0.035
 open-0.034
 opened-0.033
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 conclusions+0.091
orting+0.087
 mosquitoes+0.086
aro+0.083
ican+0.083
Neg logit contributions
 un-0.056
 was-0.054
 entire-0.054
 open-0.053
 pack-0.053
Token ze
Token ID41271
Feature activation-5.73e+00

Pos logit contributions
orting+0.046
 conclusions+0.045
 mosquitoes+0.041
aro+0.040
ican+0.040
Neg logit contributions
 un-0.063
 entire-0.062
 was-0.062
 pack-0.062
 open-0.061
Tokenbr
Token ID1671
Feature activation-2.71e+00

Pos logit contributions
 conclusions+0.147
 narrative+0.129
 fault+0.125
 violence+0.124
 elbows+0.123
Neg logit contributions
 un-0.158
 open-0.153
 opened-0.152
 was-0.151
 with-0.147
Tokenas
Token ID292
Feature activation-2.46e+00

Pos logit contributions
 conclusions+0.062
orting+0.060
 mosquitoes+0.059
ican+0.056
 hawk+0.055
Neg logit contributions
 entire-0.080
 un-0.079
 pack-0.077
 open-0.075
 opened-0.075
Token,
Token ID11
Feature activation-2.12e+00

Pos logit contributions
 conclusions+0.091
 mosquitoes+0.087
orting+0.085
 unnecessary+0.084
 narrative+0.083
Neg logit contributions
 un-0.040
 was-0.039
 open-0.038
 pack-0.037
 entire-0.036
Token and
Token ID290
Feature activation-2.09e+00

Pos logit contributions
 conclusions+0.086
orting+0.086
 mosquitoes+0.083
aro+0.082
ican+0.081
Neg logit contributions
 entire-0.024
 un-0.024
 pack-0.022
 was-0.022
 open-0.022
Token gir
Token ID37370
Feature activation-4.61e+00

Pos logit contributions
orting+0.050
 conclusions+0.047
ican+0.045
aro+0.044
 mosquitoes+0.044
Neg logit contributions
 un-0.060
 was-0.059
 entire-0.058
 pack-0.058
 open-0.058
Token lions
Token ID31420
Feature activation-2.91e+00

Pos logit contributions
orting+0.025
 conclusions+0.025
 mosquitoes+0.024
aro+0.023
 course+0.022
Neg logit contributions
 entire-0.045
 un-0.043
 pack-0.043
 open-0.041
 opened-0.041
Token,
Token ID11
Feature activation-1.75e+00

Pos logit contributions
 conclusions+0.094
orting+0.090
 mosquitoes+0.088
aro+0.086
ican+0.086
Neg logit contributions
 un-0.059
 was-0.058
 entire-0.057
 open-0.056
 pack-0.056
Token monkeys
Token ID26697
Feature activation-3.42e+00

Pos logit contributions
orting+0.040
 conclusions+0.040
 mosquitoes+0.037
aro+0.036
ican+0.036
Neg logit contributions
 un-0.050
 entire-0.050
 was-0.049
 pack-0.049
 open-0.048
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
 conclusions+0.131
 mosquitoes+0.120
orting+0.120
 narrative+0.117
ican+0.117
Neg logit contributions
 was-0.058
 un-0.056
 open-0.054
 with-0.053
 opened-0.052
Token and
Token ID290
Feature activation-1.61e+00

Pos logit contributions
 conclusions+0.071
orting+0.070
aro+0.067
 mosquitoes+0.067
ican+0.066
Neg logit contributions
 un-0.016
 entire-0.016
 was-0.015
 pack-0.015
 open-0.015
Token ze
Token ID41271
Feature activation-5.73e+00

Pos logit contributions
orting+0.042
 conclusions+0.040
aro+0.038
ican+0.037
 mosquitoes+0.037
Neg logit contributions
 un-0.043
 entire-0.042
 was-0.041
 pack-0.041
 open-0.041
Tokenbr
Token ID1671
Feature activation-2.48e+00

Pos logit contributions
 conclusions+0.148
 narrative+0.132
 elbows+0.127
 violence+0.126
 injuries+0.124
Neg logit contributions
 un-0.155
 was-0.152
 opened-0.150
 open-0.149
 with-0.146
Tokenas
Token ID292
Feature activation-1.11e+00

Pos logit contributions
 conclusions+0.053
orting+0.053
 mosquitoes+0.051
aro+0.048
ican+0.048
Neg logit contributions
 entire-0.076
 un-0.075
 pack-0.073
 open-0.071
 opened-0.071
Token.
Token ID13
Feature activation+1.77e-01

Pos logit contributions
 conclusions+0.042
orting+0.041
 mosquitoes+0.041
 hawk+0.039
 unnecessary+0.039
Neg logit contributions
 un-0.016
 entire-0.016
 pack-0.015
 open-0.015
 was-0.015
Token Sara
Token ID24799
Feature activation-2.65e-02

Pos logit contributions
 entire+0.004
 un+0.004
 unlocked+0.004
 pack+0.004
 whole+0.003
Neg logit contributions
orting-0.005
 conclusions-0.005
 mosquitoes-0.005
aro-0.005
crow-0.005
Token liked
Token ID8288
Feature activation-1.43e+00

Pos logit contributions
orting+0.001
 mosquitoes+0.001
bs+0.001
 course+0.001
aro+0.001
Neg logit contributions
 entire-0.000
 pack-0.000
 un-0.000
 whole-0.000
 open-0.000
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
orting+0.063
 conclusions+0.062
 mosquitoes+0.061
aro+0.060
ican+0.059
Neg logit contributions
 entire-0.033
 un-0.032
 pack-0.030
 open-0.029
 opened-0.028
Token tigers
Token ID43114
Feature activation-2.27e+00

Pos logit contributions
orting+0.026
 conclusions+0.026
 mosquitoes+0.024
aro+0.024
ican+0.023
Neg logit contributions
 entire-0.031
 un-0.031
 pack-0.030
 open-0.029
 was-0.029
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 conclusions+0.074
orting+0.073
 mosquitoes+0.071
aro+0.069
ican+0.069
Neg logit contributions
 un-0.043
 entire-0.043
 pack-0.042
 was-0.041
 open-0.040
Token monkeys
Token ID26697
Feature activation-3.13e+00

Pos logit contributions
orting+0.042
 conclusions+0.042
 mosquitoes+0.041
aro+0.040
ican+0.040
Neg logit contributions
 entire-0.016
 un-0.015
 pack-0.015
 open-0.014
 was-0.014
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
 conclusions+0.119
orting+0.111
 mosquitoes+0.110
 narrative+0.108
ican+0.107
Neg logit contributions
 was-0.050
 un-0.050
 open-0.047
 pack-0.047
 with-0.046
Token ze
Token ID41271
Feature activation-5.64e+00

Pos logit contributions
orting+0.030
 conclusions+0.029
 mosquitoes+0.028
aro+0.027
ican+0.027
Neg logit contributions
 entire-0.028
 un-0.028
 pack-0.027
 was-0.027
 open-0.027
Tokenbr
Token ID1671
Feature activation-2.53e+00

Pos logit contributions
 conclusions+0.145
 narrative+0.130
 elbows+0.120
 injuries+0.119
 violence+0.118
Neg logit contributions
 un-0.154
 was-0.150
 opened-0.148
 open-0.147
 with-0.144
Tokenas
Token ID292
Feature activation-1.06e+00

Pos logit contributions
 conclusions+0.053
orting+0.053
 mosquitoes+0.051
aro+0.048
ican+0.048
Neg logit contributions
 entire-0.078
 un-0.077
 pack-0.075
 open-0.073
 opened-0.073
Token.
Token ID13
Feature activation+5.12e-01

Pos logit contributions
 conclusions+0.040
orting+0.039
 mosquitoes+0.038
 hawk+0.037
 narrative+0.036
Neg logit contributions
 un-0.016
 entire-0.015
 pack-0.015
 was-0.015
 open-0.014
Token He
Token ID679
Feature activation+8.12e-02

Pos logit contributions
 entire+0.010
 un+0.009
 pack+0.009
 whole+0.008
 opened+0.008
Neg logit contributions
orting-0.017
 conclusions-0.016
 mosquitoes-0.016
 course-0.016
ican-0.015
Token was
Token ID373
Feature activation-6.87e-01

Pos logit contributions
 entire+0.001
 pack+0.001
 un+0.001
 whole+0.001
 open+0.001
Neg logit contributions
 mosquitoes-0.003
aro-0.003
orting-0.003
 hawk-0.003
 conclusions-0.003
Token lions
Token ID31420
Feature activation-2.76e+00

Pos logit contributions
orting+0.023
 conclusions+0.023
 mosquitoes+0.023
aro+0.021
 course+0.021
Neg logit contributions
 entire-0.043
 un-0.042
 pack-0.041
 open-0.040
 opened-0.040
Token,
Token ID11
Feature activation-2.09e+00

Pos logit contributions
 conclusions+0.087
orting+0.084
 mosquitoes+0.083
aro+0.080
ican+0.080
Neg logit contributions
 un-0.057
 entire-0.056
 was-0.055
 pack-0.054
 open-0.054
Token monkeys
Token ID26697
Feature activation-3.29e+00

Pos logit contributions
orting+0.046
 conclusions+0.045
 mosquitoes+0.041
aro+0.041
ican+0.040
Neg logit contributions
 un-0.062
 was-0.062
 entire-0.062
 pack-0.060
 open-0.060
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 conclusions+0.125
 mosquitoes+0.115
orting+0.114
 unnecessary+0.113
 narrative+0.112
Neg logit contributions
 was-0.055
 un-0.054
 open-0.052
 with-0.051
 opened-0.051
Token and
Token ID290
Feature activation-1.85e+00

Pos logit contributions
 conclusions+0.085
orting+0.085
aro+0.081
ican+0.080
 mosquitoes+0.080
Neg logit contributions
 was-0.021
 un-0.021
 entire-0.020
 open-0.019
 pack-0.019
Token ze
Token ID41271
Feature activation-5.63e+00

Pos logit contributions
orting+0.046
 conclusions+0.043
aro+0.041
ican+0.041
 mosquitoes+0.040
Neg logit contributions
 un-0.052
 entire-0.051
 was-0.050
 pack-0.050
 open-0.050
Tokenbr
Token ID1671
Feature activation-2.57e+00

Pos logit contributions
 conclusions+0.145
 narrative+0.131
 elbows+0.124
 violence+0.124
 injuries+0.124
Neg logit contributions
 un-0.153
 was-0.149
 open-0.146
 opened-0.146
 with-0.145
Tokenas
Token ID292
Feature activation-1.13e+00

Pos logit contributions
 conclusions+0.058
orting+0.056
 mosquitoes+0.055
ican+0.052
 hawk+0.052
Neg logit contributions
 entire-0.076
 un-0.075
 pack-0.073
 open-0.071
 opened-0.071
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
 conclusions+0.043
orting+0.042
 mosquitoes+0.042
 hawk+0.041
 unnecessary+0.040
Neg logit contributions
 un-0.016
 entire-0.015
 pack-0.015
 open-0.015
 was-0.014
Token But
Token ID887
Feature activation-6.76e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.005
 open+0.005
 was+0.005
Neg logit contributions
orting-0.011
 conclusions-0.011
 mosquitoes-0.010
aro-0.010
 hawk-0.010
Token their
Token ID511
Feature activation+8.91e-01

Pos logit contributions
orting+0.021
 conclusions+0.021
 mosquitoes+0.020
aro+0.020
ican+0.019
Neg logit contributions
 entire-0.014
 un-0.014
 pack-0.013
 open-0.013
 opened-0.012
Token animals
Token ID4695
Feature activation-1.25e+00

Pos logit contributions
orting+0.010
 conclusions+0.010
 mosquitoes+0.010
aro+0.010
 course+0.010
Neg logit contributions
 entire-0.009
 un-0.009
 pack-0.009
 open-0.008
 opened-0.008
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 conclusions+0.046
orting+0.046
 mosquitoes+0.045
 hawk+0.043
aro+0.043
Neg logit contributions
 un-0.019
 entire-0.018
 pack-0.018
 open-0.018
 was-0.018
Token like
Token ID588
Feature activation-3.13e-01

Pos logit contributions
orting+0.037
 conclusions+0.037
 mosquitoes+0.036
aro+0.035
ican+0.034
Neg logit contributions
 entire-0.025
 un-0.025
 pack-0.024
 open-0.023
 opened-0.023
Token lions
Token ID31420
Feature activation-2.55e+00

Pos logit contributions
 mosquitoes+0.006
orting+0.006
 conclusions+0.006
aro+0.006
 course+0.006
Neg logit contributions
 entire-0.010
 un-0.009
 pack-0.009
 opened-0.009
 open-0.009
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
 conclusions+0.086
orting+0.082
 mosquitoes+0.081
aro+0.078
ican+0.078
Neg logit contributions
 un-0.049
 was-0.048
 entire-0.046
 open-0.046
 pack-0.046
Token ze
Token ID41271
Feature activation-5.61e+00

Pos logit contributions
orting+0.034
 conclusions+0.034
 mosquitoes+0.031
aro+0.030
ican+0.030
Neg logit contributions
 un-0.048
 entire-0.047
 was-0.047
 pack-0.047
 open-0.046
Tokenbr
Token ID1671
Feature activation-2.25e+00

Pos logit contributions
 conclusions+0.150
 narrative+0.134
 elbows+0.131
 fault+0.130
 violence+0.127
Neg logit contributions
 un-0.148
 open-0.145
 opened-0.143
 was-0.142
 with-0.139
Tokenas
Token ID292
Feature activation-2.34e+00

Pos logit contributions
 conclusions+0.049
orting+0.048
 mosquitoes+0.047
aro+0.044
ican+0.044
Neg logit contributions
 entire-0.068
 un-0.067
 pack-0.066
 open-0.064
 opened-0.064
Token,
Token ID11
Feature activation-1.67e+00

Pos logit contributions
 conclusions+0.090
 mosquitoes+0.086
orting+0.084
 unnecessary+0.083
 narrative+0.083
Neg logit contributions
 un-0.035
 was-0.033
 open-0.032
 pack-0.032
 entire-0.031
Token gir
Token ID37370
Feature activation-4.74e+00

Pos logit contributions
orting+0.067
 conclusions+0.067
 mosquitoes+0.064
aro+0.064
ican+0.063
Neg logit contributions
 entire-0.020
 un-0.020
 pack-0.019
 was-0.018
 open-0.018
Tokenaff
Token ID2001
Feature activation-2.61e+00

Pos logit contributions
 conclusions+0.132
 narrative+0.123
 violence+0.116
 mosquitoes+0.112
 unnecessary+0.111
Neg logit contributions
 open-0.114
 un-0.111
 was-0.111
 opened-0.109
 with-0.108
Token animals
Token ID4695
Feature activation-9.85e-01

Pos logit contributions
 entire+0.018
 un+0.017
 pack+0.017
 opened+0.016
 open+0.016
Neg logit contributions
 mosquitoes-0.020
 conclusions-0.020
orting-0.020
 course-0.019
aro-0.019
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
 conclusions+0.035
orting+0.035
 mosquitoes+0.034
aro+0.033
ican+0.032
Neg logit contributions
 entire-0.016
 un-0.016
 pack-0.015
 open-0.015
 was-0.014
Token like
Token ID588
Feature activation-6.72e-01

Pos logit contributions
orting+0.052
 conclusions+0.051
 mosquitoes+0.049
aro+0.049
ican+0.048
Neg logit contributions
 entire-0.033
 un-0.033
 pack-0.032
 open-0.031
 was-0.030
Token monkeys
Token ID26697
Feature activation-2.94e+00

Pos logit contributions
 mosquitoes+0.013
 conclusions+0.013
orting+0.013
aro+0.012
 course+0.012
Neg logit contributions
 entire-0.022
 un-0.021
 pack-0.021
 open-0.020
 opened-0.020
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
 conclusions+0.108
orting+0.099
 mosquitoes+0.099
ican+0.096
 unnecessary+0.096
Neg logit contributions
 un-0.053
 was-0.052
 open-0.050
 opened-0.049
 with-0.048
Token ze
Token ID41271
Feature activation-5.58e+00

Pos logit contributions
orting+0.060
 conclusions+0.059
 mosquitoes+0.055
aro+0.055
ican+0.054
Neg logit contributions
 un-0.045
 was-0.045
 pack-0.044
 open-0.043
 opened-0.043
Tokenbr
Token ID1671
Feature activation-2.47e+00

Pos logit contributions
 conclusions+0.144
 narrative+0.129
 violence+0.124
 elbows+0.123
 fault+0.122
Neg logit contributions
 un-0.152
 open-0.148
 opened-0.147
 was-0.146
 with-0.143
Tokenas
Token ID292
Feature activation-2.11e+00

Pos logit contributions
 conclusions+0.056
orting+0.054
 mosquitoes+0.053
ican+0.050
aro+0.050
Neg logit contributions
 entire-0.073
 un-0.072
 pack-0.071
 open-0.069
 opened-0.069
Token and
Token ID290
Feature activation-1.74e+00

Pos logit contributions
 conclusions+0.075
 mosquitoes+0.072
orting+0.071
 unnecessary+0.069
 narrative+0.068
Neg logit contributions
 un-0.037
 was-0.036
 open-0.035
 pack-0.035
 entire-0.034
Token gir
Token ID37370
Feature activation-4.73e+00

Pos logit contributions
orting+0.040
 conclusions+0.038
 mosquitoes+0.036
aro+0.035
ican+0.035
Neg logit contributions
 un-0.051
 entire-0.050
 was-0.050
 pack-0.049
 open-0.049
Tokenaff
Token ID2001
Feature activation-2.13e+00

Pos logit contributions
 conclusions+0.126
 narrative+0.120
 violence+0.112
 mosquitoes+0.109
 hawk+0.106
Neg logit contributions
 open-0.116
 un-0.114
 was-0.113
 opened-0.112
 entire-0.110
Token animals
Token ID4695
Feature activation-1.34e+00

Pos logit contributions
 entire+0.021
 pack+0.020
 un+0.019
 opened+0.019
 whole+0.018
Neg logit contributions
orting-0.023
 course-0.022
 conclusions-0.022
 mosquitoes-0.022
orns-0.021
Token,
Token ID11
Feature activation-1.62e+00

Pos logit contributions
orting+0.044
 conclusions+0.044
 mosquitoes+0.043
aro+0.042
ican+0.041
Neg logit contributions
 entire-0.025
 un-0.024
 pack-0.024
 open-0.023
 opened-0.022
Token like
Token ID588
Feature activation-1.02e+00

Pos logit contributions
orting+0.050
 conclusions+0.050
 mosquitoes+0.048
aro+0.047
ican+0.047
Neg logit contributions
 entire-0.034
 un-0.033
 pack-0.032
 open-0.031
 was-0.031
Token lions
Token ID31420
Feature activation-2.61e+00

Pos logit contributions
orting+0.019
 mosquitoes+0.019
 conclusions+0.019
aro+0.018
 course+0.018
Neg logit contributions
 entire-0.033
 un-0.032
 pack-0.032
 open-0.031
 opened-0.031
Token,
Token ID11
Feature activation-1.77e+00

Pos logit contributions
 conclusions+0.081
orting+0.079
 mosquitoes+0.078
aro+0.075
ican+0.075
Neg logit contributions
 un-0.055
 entire-0.055
 pack-0.052
 was-0.052
 open-0.052
Token ze
Token ID41271
Feature activation-5.57e+00

Pos logit contributions
orting+0.036
 conclusions+0.036
 mosquitoes+0.034
aro+0.032
ican+0.032
Neg logit contributions
 un-0.055
 entire-0.055
 pack-0.054
 was-0.054
 open-0.053
Tokenbr
Token ID1671
Feature activation-2.69e+00

Pos logit contributions
 conclusions+0.140
 narrative+0.122
 fault+0.117
 violence+0.116
 elbows+0.115
Neg logit contributions
 un-0.159
 open-0.154
 was-0.154
 opened-0.152
 with-0.149
Tokenas
Token ID292
Feature activation-2.14e+00

Pos logit contributions
 conclusions+0.057
orting+0.056
 mosquitoes+0.055
aro+0.052
ican+0.051
Neg logit contributions
 entire-0.083
 un-0.081
 pack-0.080
 open-0.078
 opened-0.077
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
 conclusions+0.077
 mosquitoes+0.074
orting+0.073
 hawk+0.070
 narrative+0.070
Neg logit contributions
 un-0.036
 entire-0.034
 was-0.034
 pack-0.034
 open-0.034
Token and
Token ID290
Feature activation-1.91e+00

Pos logit contributions
orting+0.072
 conclusions+0.072
 mosquitoes+0.070
aro+0.068
ican+0.068
Neg logit contributions
 entire-0.023
 un-0.022
 pack-0.021
 open-0.020
 was-0.020
Token gir
Token ID37370
Feature activation-4.55e+00

Pos logit contributions
orting+0.044
 conclusions+0.042
 mosquitoes+0.039
ican+0.039
aro+0.039
Neg logit contributions
 un-0.056
 entire-0.055
 was-0.055
 pack-0.054
 open-0.054
Token lions
Token ID31420
Feature activation-2.73e+00

Pos logit contributions
orting+0.025
 conclusions+0.024
 mosquitoes+0.024
aro+0.023
 course+0.022
Neg logit contributions
 entire-0.042
 un-0.040
 pack-0.040
 open-0.039
 opened-0.038
Token,
Token ID11
Feature activation-2.10e+00

Pos logit contributions
 conclusions+0.088
orting+0.084
 mosquitoes+0.082
aro+0.081
ican+0.080
Neg logit contributions
 un-0.056
 was-0.054
 entire-0.054
 open-0.053
 pack-0.053
Token monkeys
Token ID26697
Feature activation-3.31e+00

Pos logit contributions
orting+0.049
 conclusions+0.048
 mosquitoes+0.044
ican+0.043
aro+0.043
Neg logit contributions
 un-0.061
 was-0.060
 entire-0.059
 pack-0.059
 open-0.059
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
 conclusions+0.128
 mosquitoes+0.116
orting+0.116
 unnecessary+0.115
ican+0.114
Neg logit contributions
 was-0.056
 un-0.055
 open-0.052
 with-0.052
 opened-0.051
Token and
Token ID290
Feature activation-2.02e+00

Pos logit contributions
 conclusions+0.084
orting+0.084
aro+0.080
ican+0.079
 mosquitoes+0.078
Neg logit contributions
 was-0.020
 un-0.020
 open-0.018
 pack-0.018
 opened-0.018
Token ze
Token ID41271
Feature activation-5.56e+00

Pos logit contributions
orting+0.053
 conclusions+0.049
ican+0.047
aro+0.047
ittens+0.046
Neg logit contributions
 un-0.055
 was-0.053
 open-0.053
 entire-0.052
 pack-0.052
Tokenbr
Token ID1671
Feature activation-2.36e+00

Pos logit contributions
 conclusions+0.142
 narrative+0.130
 violence+0.122
 elbows+0.121
 fault+0.120
Neg logit contributions
 un-0.151
 was-0.145
 open-0.143
 opened-0.142
 with-0.141
Tokenas
Token ID292
Feature activation-1.27e+00

Pos logit contributions
 conclusions+0.051
orting+0.051
 mosquitoes+0.049
ican+0.047
aro+0.046
Neg logit contributions
 entire-0.072
 un-0.071
 pack-0.069
 open-0.067
 opened-0.067
Token.
Token ID13
Feature activation+1.43e-01

Pos logit contributions
 conclusions+0.049
 mosquitoes+0.048
orting+0.048
 hawk+0.047
 unnecessary+0.047
Neg logit contributions
 un-0.017
 pack-0.016
 entire-0.016
 open-0.016
 was-0.015
Token But
Token ID887
Feature activation-9.68e-01

Pos logit contributions
 un+0.002
 entire+0.002
 pack+0.002
 was+0.002
 open+0.002
Neg logit contributions
orting-0.005
 conclusions-0.005
aro-0.005
 mosquitoes-0.005
 hawk-0.005
Token their
Token ID511
Feature activation+6.46e-01

Pos logit contributions
orting+0.031
 conclusions+0.031
 mosquitoes+0.030
aro+0.029
ican+0.029
Neg logit contributions
 entire-0.019
 un-0.019
 pack-0.018
 open-0.018
 was-0.018
Token of
Token ID286
Feature activation+4.79e-01

Pos logit contributions
 entire+0.008
 un+0.007
 pack+0.007
 open+0.007
 opened+0.007
Neg logit contributions
orting-0.011
 conclusions-0.011
 mosquitoes-0.011
aro-0.011
 course-0.010
Token animals
Token ID4695
Feature activation-1.12e+00

Pos logit contributions
 entire+0.011
 pack+0.011
 un+0.010
 opened+0.010
 unlocked+0.010
Neg logit contributions
orting-0.013
 conclusions-0.013
 mosquitoes-0.013
 course-0.012
orns-0.012
Token like
Token ID588
Feature activation-6.59e-01

Pos logit contributions
orting+0.038
 conclusions+0.037
 mosquitoes+0.037
aro+0.036
 course+0.036
Neg logit contributions
 entire-0.020
 un-0.019
 pack-0.018
 open-0.017
 opened-0.017
Token lions
Token ID31420
Feature activation-2.35e+00

Pos logit contributions
orting+0.012
 conclusions+0.012
 mosquitoes+0.012
aro+0.012
 course+0.012
Neg logit contributions
 entire-0.022
 un-0.020
 pack-0.020
 open-0.020
 opened-0.020
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
 conclusions+0.080
orting+0.078
 mosquitoes+0.076
aro+0.074
ican+0.074
Neg logit contributions
 un-0.043
 entire-0.042
 was-0.040
 pack-0.040
 open-0.039
Token ze
Token ID41271
Feature activation-5.49e+00

Pos logit contributions
orting+0.023
 conclusions+0.023
 mosquitoes+0.021
aro+0.021
ican+0.020
Neg logit contributions
 entire-0.040
 un-0.040
 pack-0.039
 open-0.038
 was-0.038
Tokenbr
Token ID1671
Feature activation-2.40e+00

Pos logit contributions
 conclusions+0.143
 narrative+0.128
 elbows+0.123
 violence+0.120
 fault+0.120
Neg logit contributions
 un-0.150
 was-0.147
 open-0.144
 with-0.143
 opened-0.142
Tokenas
Token ID292
Feature activation-1.74e+00

Pos logit contributions
 conclusions+0.050
orting+0.049
 mosquitoes+0.048
aro+0.045
ican+0.044
Neg logit contributions
 entire-0.075
 un-0.074
 pack-0.072
 open-0.070
 opened-0.070
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
 conclusions+0.068
 mosquitoes+0.065
orting+0.062
 narrative+0.062
 unnecessary+0.062
Neg logit contributions
 un-0.026
 was-0.025
 pack-0.024
 open-0.023
 with-0.023
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
orting+0.040
 conclusions+0.040
 mosquitoes+0.039
aro+0.038
 course+0.037
Neg logit contributions
 entire-0.025
 un-0.024
 pack-0.023
 open-0.022
 opened-0.022
Token gir
Token ID37370
Feature activation-3.99e+00

Pos logit contributions
orting+0.023
 conclusions+0.023
 mosquitoes+0.021
aro+0.021
ican+0.020
Neg logit contributions
 entire-0.034
 un-0.034
 pack-0.033
 open-0.032
 was-0.032
Token lions
Token ID31420
Feature activation-2.38e+00

Pos logit contributions
orting+0.032
 conclusions+0.031
 mosquitoes+0.029
aro+0.028
ican+0.028
Neg logit contributions
 un-0.045
 entire-0.045
 pack-0.043
 was-0.043
 open-0.043
Token,
Token ID11
Feature activation-8.88e-01

Pos logit contributions
 conclusions+0.085
orting+0.083
 mosquitoes+0.080
aro+0.078
ican+0.078
Neg logit contributions
 un-0.040
 was-0.039
 entire-0.039
 open-0.037
 pack-0.036
Token monkeys
Token ID26697
Feature activation-3.00e+00

Pos logit contributions
orting+0.020
 conclusions+0.019
 mosquitoes+0.018
aro+0.017
ican+0.017
Neg logit contributions
 un-0.027
 entire-0.027
 pack-0.026
 was-0.026
 open-0.026
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 conclusions+0.123
orting+0.114
 narrative+0.113
 mosquitoes+0.113
 unnecessary+0.112
Neg logit contributions
 was-0.044
 un-0.040
 with-0.038
 opened-0.037
 open-0.037
Token and
Token ID290
Feature activation-9.20e-01

Pos logit contributions
orting+0.033
 conclusions+0.033
 mosquitoes+0.031
aro+0.031
ican+0.030
Neg logit contributions
 entire-0.021
 un-0.021
 pack-0.020
 was-0.020
 open-0.020
Token ze
Token ID41271
Feature activation-5.48e+00

Pos logit contributions
orting+0.023
 conclusions+0.022
 mosquitoes+0.021
aro+0.021
ican+0.021
Neg logit contributions
 entire-0.025
 un-0.025
 pack-0.024
 open-0.024
 was-0.024
Tokenbr
Token ID1671
Feature activation-2.38e+00

Pos logit contributions
 conclusions+0.138
 narrative+0.122
 elbows+0.122
 violence+0.119
 injuries+0.117
Neg logit contributions
 un-0.150
 was-0.148
 opened-0.146
 open-0.142
 with-0.140
Tokenas
Token ID292
Feature activation-1.01e+00

Pos logit contributions
 conclusions+0.050
orting+0.049
 mosquitoes+0.048
aro+0.045
ican+0.045
Neg logit contributions
 entire-0.074
 un-0.072
 pack-0.071
 open-0.069
 opened-0.068
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
 conclusions+0.036
orting+0.036
 mosquitoes+0.035
 hawk+0.034
aro+0.034
Neg logit contributions
 un-0.016
 entire-0.016
 pack-0.015
 was-0.015
 open-0.015
Token But
Token ID887
Feature activation-1.23e+00

Pos logit contributions
 conclusions+0.018
orting+0.017
 mosquitoes+0.017
aro+0.017
 hawk+0.017
Neg logit contributions
 entire-0.006
 un-0.005
 pack-0.005
 was-0.005
 open-0.005
Token the
Token ID262
Feature activation-6.30e-02

Pos logit contributions
 conclusions+0.042
orting+0.042
 mosquitoes+0.041
ican+0.039
aro+0.039
Neg logit contributions
 un-0.022
 entire-0.021
 was-0.021
 pack-0.020
 open-0.020
Token animals
Token ID4695
Feature activation-1.21e+00

Pos logit contributions
 entire+0.016
 un+0.016
 pack+0.016
 opened+0.015
 open+0.014
Neg logit contributions
 conclusions-0.021
orting-0.021
 mosquitoes-0.021
 course-0.020
aro-0.020
Token,
Token ID11
Feature activation-5.09e-01

Pos logit contributions
 conclusions+0.044
orting+0.043
 mosquitoes+0.043
aro+0.041
 hawk+0.041
Neg logit contributions
 un-0.019
 entire-0.018
 was-0.018
 pack-0.018
 open-0.018
Token like
Token ID588
Feature activation-9.17e-01

Pos logit contributions
orting+0.016
 conclusions+0.016
 mosquitoes+0.016
aro+0.015
 course+0.015
Neg logit contributions
 entire-0.010
 un-0.010
 pack-0.010
 open-0.009
 opened-0.009
Token lions
Token ID31420
Feature activation-2.20e+00

Pos logit contributions
orting+0.024
 conclusions+0.023
 mosquitoes+0.022
aro+0.022
ican+0.021
Neg logit contributions
 entire-0.024
 un-0.024
 pack-0.023
 open-0.022
 was-0.022
Token,
Token ID11
Feature activation-9.71e-01

Pos logit contributions
 conclusions+0.076
orting+0.074
 mosquitoes+0.072
aro+0.070
ican+0.070
Neg logit contributions
 un-0.040
 was-0.039
 entire-0.038
 open-0.037
 pack-0.037
Token ze
Token ID41271
Feature activation-5.46e+00

Pos logit contributions
orting+0.023
 conclusions+0.022
 mosquitoes+0.021
aro+0.021
ican+0.020
Neg logit contributions
 entire-0.028
 un-0.028
 pack-0.027
 open-0.027
 was-0.027
Tokenbr
Token ID1671
Feature activation-2.35e+00

Pos logit contributions
 conclusions+0.145
 narrative+0.129
 elbows+0.124
 violence+0.120
 fault+0.119
Neg logit contributions
 un-0.147
 was-0.143
 opened-0.143
 open-0.142
 with-0.139
Tokenas
Token ID292
Feature activation-1.86e+00

Pos logit contributions
 conclusions+0.048
orting+0.048
 mosquitoes+0.046
aro+0.044
ican+0.043
Neg logit contributions
 entire-0.074
 un-0.073
 pack-0.071
 open-0.069
 opened-0.069
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 conclusions+0.070
 mosquitoes+0.067
orting+0.066
 narrative+0.065
 unnecessary+0.064
Neg logit contributions
 un-0.029
 was-0.028
 pack-0.027
 open-0.027
 entire-0.026
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
orting+0.043
 conclusions+0.043
 mosquitoes+0.041
aro+0.041
ican+0.040
Neg logit contributions
 entire-0.014
 un-0.014
 pack-0.013
 open-0.012
 was-0.012
Token gir
Token ID37370
Feature activation-3.82e+00

Pos logit contributions
orting+0.027
 conclusions+0.026
 mosquitoes+0.025
aro+0.024
ican+0.024
Neg logit contributions
 un-0.033
 entire-0.033
 pack-0.032
 open-0.032
 was-0.032
Token butterflies
Token ID44571
Feature activation-2.99e+00

Pos logit contributions
orting+0.119
ican+0.114
 narrative+0.113
robat+0.113
crow+0.112
Neg logit contributions
 un-0.055
 a-0.054
 open-0.054
 with-0.054
 put-0.052
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
 conclusions+0.125
ican+0.122
 narrative+0.121
orting+0.120
aro+0.119
Neg logit contributions
 with-0.034
 was-0.033
 open-0.030
 un-0.030
 put-0.028
Token lady
Token ID10846
Feature activation-1.86e+00

Pos logit contributions
orting+0.070
ican+0.063
arling+0.063
ittens+0.063
aro+0.062
Neg logit contributions
 un-0.057
 was-0.056
 open-0.055
 put-0.055
 opened-0.054
Tokenbugs
Token ID32965
Feature activation-1.93e+00

Pos logit contributions
 conclusions+0.058
orting+0.057
 hawk+0.055
 mosquitoes+0.054
ican+0.054
Neg logit contributions
 un-0.039
 was-0.038
 entire-0.037
 pack-0.037
 open-0.037
Token and
Token ID290
Feature activation-2.21e+00

Pos logit contributions
 conclusions+0.077
ican+0.076
orting+0.074
aro+0.074
 mosquitoes+0.073
Neg logit contributions
 was-0.024
 with-0.024
 un-0.023
 open-0.023
 pack-0.021
Token cater
Token ID20825
Feature activation-5.46e+00

Pos logit contributions
orting+0.066
arling+0.060
aro+0.059
ican+0.059
ittens+0.059
Neg logit contributions
 un-0.053
 was-0.053
 open-0.052
 put-0.052
 opened-0.051
Tokenpill
Token ID27215
Feature activation-4.73e+00

Pos logit contributions
 narrative+0.155
 hurricane+0.138
 inconvenience+0.134
 injuries+0.131
 earthquake+0.129
Neg logit contributions
 entire-0.112
 with-0.110
 put-0.109
 a-0.107
 smells-0.107
Tokenars
Token ID945
Feature activation-5.90e-01

Pos logit contributions
ican+0.141
 conclusions+0.129
 hawk+0.128
aro+0.127
 Cle+0.125
Neg logit contributions
 entire-0.109
 whole-0.105
 un-0.103
 put-0.100
 fit-0.100
Token."
Token ID526
Feature activation-1.21e+00

Pos logit contributions
 conclusions+0.023
ican+0.023
orting+0.022
 mosquitoes+0.022
 hawk+0.022
Neg logit contributions
 un-0.008
 was-0.008
 entire-0.008
 open-0.007
 pack-0.007
Token
Token ID198
Feature activation-6.22e-01

Pos logit contributions
 conclusions+0.058
orting+0.057
leness+0.057
 hawk+0.057
 unnecessary+0.057
Neg logit contributions
 was-0.007
 un-0.005
 with-0.005
 He-0.004
 excited-0.003
Token
Token ID198
Feature activation-5.99e-01

Pos logit contributions
 conclusions+0.027
orting+0.027
 mosquitoes+0.027
ican+0.027
 hawk+0.026
Neg logit contributions
 was-0.005
 un-0.005
 entire-0.004
 open-0.004
 opened-0.003
Token lions
Token ID31420
Feature activation-1.39e+00

Pos logit contributions
 entire+0.013
 un+0.012
 pack+0.012
 open+0.012
 opened+0.011
Neg logit contributions
 conclusions-0.014
orting-0.014
 mosquitoes-0.014
aro-0.013
 hawk-0.013
Token,
Token ID11
Feature activation-9.00e-01

Pos logit contributions
orting+0.047
 conclusions+0.047
 mosquitoes+0.045
aro+0.044
ican+0.044
Neg logit contributions
 entire-0.025
 un-0.025
 pack-0.024
 open-0.023
 was-0.023
Token monkeys
Token ID26697
Feature activation-2.43e+00

Pos logit contributions
orting+0.019
 conclusions+0.018
 mosquitoes+0.017
ican+0.016
aro+0.016
Neg logit contributions
 entire-0.028
 un-0.028
 pack-0.027
 open-0.027
 was-0.027
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
 conclusions+0.090
orting+0.085
 mosquitoes+0.083
 narrative+0.083
ican+0.082
Neg logit contributions
 was-0.041
 un-0.040
 open-0.038
 opened-0.038
 with-0.037
Token and
Token ID290
Feature activation-1.42e+00

Pos logit contributions
orting+0.036
 conclusions+0.035
aro+0.032
ican+0.032
 mosquitoes+0.032
Neg logit contributions
 entire-0.031
 un-0.031
 open-0.030
 was-0.030
 pack-0.030
Token ze
Token ID41271
Feature activation-5.46e+00

Pos logit contributions
orting+0.034
 conclusions+0.033
 mosquitoes+0.031
aro+0.031
ican+0.031
Neg logit contributions
 entire-0.040
 un-0.039
 pack-0.038
 open-0.038
 was-0.037
Tokenbr
Token ID1671
Feature activation-2.73e+00

Pos logit contributions
 conclusions+0.138
 narrative+0.126
 fault+0.122
 injuries+0.121
 elbows+0.120
Neg logit contributions
 un-0.143
 was-0.141
 open-0.140
 with-0.140
 opened-0.133
Tokenas
Token ID292
Feature activation-1.60e+00

Pos logit contributions
 conclusions+0.070
 mosquitoes+0.067
orting+0.066
ican+0.064
 hawk+0.063
Neg logit contributions
 entire-0.073
 un-0.072
 pack-0.070
 opened-0.068
 open-0.068
Token.
Token ID13
Feature activation-5.78e-01

Pos logit contributions
 conclusions+0.061
orting+0.059
 mosquitoes+0.059
 narrative+0.057
 hawk+0.056
Neg logit contributions
 un-0.023
 open-0.021
 was-0.021
 entire-0.021
 pack-0.021
Token
Token ID198
Feature activation+3.05e-01

Pos logit contributions
 conclusions+0.024
orting+0.024
 mosquitoes+0.023
aro+0.023
ican+0.023
Neg logit contributions
 un-0.006
 entire-0.006
 pack-0.005
 open-0.005
 was-0.005
Token
Token ID198
Feature activation+7.03e-01

Pos logit contributions
 un+0.004
 entire+0.004
 was+0.003
 pack+0.003
 open+0.003
Neg logit contributions
orting-0.012
 conclusions-0.012
 mosquitoes-0.012
ican-0.012
aro-0.012
Tokenaff
Token ID2001
Feature activation-2.52e+00

Pos logit contributions
 conclusions+0.128
 narrative+0.114
 violence+0.113
 mosquitoes+0.111
 unnecessary+0.110
Neg logit contributions
 open-0.113
 entire-0.110
 was-0.110
 opened-0.109
 un-0.108
Tokenes
Token ID274
Feature activation-1.56e+00

Pos logit contributions
 conclusions+0.087
orting+0.084
 mosquitoes+0.081
ican+0.078
 narrative+0.077
Neg logit contributions
 un-0.045
 was-0.045
 entire-0.043
 opened-0.043
 open-0.041
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
 conclusions+0.066
 mosquitoes+0.062
 unnecessary+0.061
orting+0.061
obb+0.060
Neg logit contributions
 was-0.021
 with-0.019
 un-0.018
 open-0.018
 opened-0.017
Token and
Token ID290
Feature activation-8.96e-01

Pos logit contributions
orting+0.053
 conclusions+0.053
 mosquitoes+0.052
aro+0.051
 course+0.050
Neg logit contributions
 entire-0.017
 un-0.016
 pack-0.016
 open-0.014
 opened-0.014
Token fast
Token ID3049
Feature activation-4.71e-01

Pos logit contributions
orting+0.026
 conclusions+0.025
 mosquitoes+0.024
ican+0.024
aro+0.024
Neg logit contributions
 un-0.021
 entire-0.021
 pack-0.020
 was-0.020
 open-0.020
Token ze
Token ID41271
Feature activation-5.34e+00

Pos logit contributions
orting+0.014
 conclusions+0.014
 mosquitoes+0.014
aro+0.013
ican+0.013
Neg logit contributions
 entire-0.010
 un-0.010
 pack-0.010
 open-0.010
 was-0.009
Tokenbr
Token ID1671
Feature activation-2.50e+00

Pos logit contributions
 conclusions+0.132
 narrative+0.114
 elbows+0.112
 violence+0.110
 caution+0.109
Neg logit contributions
 un-0.149
 open-0.147
 opened-0.147
 was-0.144
 entire-0.142
Tokenas
Token ID292
Feature activation-1.10e+00

Pos logit contributions
 conclusions+0.054
orting+0.053
 mosquitoes+0.052
ican+0.049
aro+0.048
Neg logit contributions
 entire-0.076
 un-0.074
 pack-0.072
 open-0.071
 opened-0.070
Token.
Token ID13
Feature activation-9.49e-01

Pos logit contributions
 conclusions+0.043
 mosquitoes+0.042
orting+0.041
 hawk+0.040
ican+0.039
Neg logit contributions
 un-0.015
 entire-0.014
 open-0.014
 was-0.014
 pack-0.014
Token Tim
Token ID5045
Feature activation+9.22e-02

Pos logit contributions
 conclusions+0.036
orting+0.035
 mosquitoes+0.035
aro+0.034
 hawk+0.034
Neg logit contributions
 un-0.013
 entire-0.013
 was-0.013
 pack-0.012
 open-0.012
Token was
Token ID373
Feature activation-7.46e-01

Pos logit contributions
 entire+0.001
 pack+0.001
 un+0.001
 open+0.001
 whole+0.001
Neg logit contributions
orting-0.003
 course-0.003
orns-0.003
 mosquitoes-0.003
ican-0.003
Token like
Token ID588
Feature activation-1.73e+00

Pos logit contributions
orting+0.036
 conclusions+0.035
ican+0.033
aro+0.033
 hawk+0.033
Neg logit contributions
 un-0.019
 entire-0.019
 open-0.018
 pack-0.018
 was-0.018
Token monkeys
Token ID26697
Feature activation-3.48e+00

Pos logit contributions
orting+0.050
 conclusions+0.048
ican+0.045
 mosquitoes+0.044
aro+0.044
Neg logit contributions
 un-0.042
 entire-0.040
 was-0.039
 pack-0.039
 open-0.039
Token and
Token ID290
Feature activation-2.19e+00

Pos logit contributions
 conclusions+0.135
orting+0.124
 narrative+0.122
ican+0.122
 mosquitoes+0.122
Neg logit contributions
 was-0.058
 un-0.054
 with-0.053
 open-0.052
 opened-0.051
Token elephants
Token ID25368
Feature activation-2.57e+00

Pos logit contributions
orting+0.057
 conclusions+0.050
arling+0.048
ittens+0.048
ican+0.048
Neg logit contributions
 un-0.062
 was-0.062
 open-0.060
 opened-0.060
 pack-0.059
Token and
Token ID290
Feature activation-1.91e+00

Pos logit contributions
 conclusions+0.108
orting+0.104
 mosquitoes+0.102
ican+0.102
 unnecessary+0.102
Neg logit contributions
 was-0.029
 un-0.028
 with-0.027
 open-0.027
 opened-0.025
Token gir
Token ID37370
Feature activation-5.32e+00

Pos logit contributions
orting+0.052
 conclusions+0.046
ican+0.045
ittens+0.045
arling+0.045
Neg logit contributions
 un-0.051
 was-0.051
 open-0.050
 opened-0.049
 pack-0.049
Tokenaff
Token ID2001
Feature activation-2.61e+00

Pos logit contributions
 conclusions+0.146
 narrative+0.135
 violence+0.134
 mosquitoes+0.127
orting+0.127
Neg logit contributions
 open-0.121
 was-0.121
 opened-0.118
 entire-0.116
 un-0.116
Tokenes
Token ID274
Feature activation-1.16e+00

Pos logit contributions
 conclusions+0.081
orting+0.081
 mosquitoes+0.076
ican+0.074
 hawk+0.073
Neg logit contributions
 entire-0.053
 un-0.053
 was-0.053
 opened-0.051
 pack-0.049
Token.
Token ID13
Feature activation-6.63e-01

Pos logit contributions
 conclusions+0.050
orting+0.048
 mosquitoes+0.048
ican+0.048
 unnecessary+0.048
Neg logit contributions
 was-0.013
 with-0.012
 open-0.011
 un-0.011
 opened-0.010
Token They
Token ID1119
Feature activation-9.82e-01

Pos logit contributions
orting+0.030
 conclusions+0.030
 mosquitoes+0.030
 hawk+0.029
ede+0.029
Neg logit contributions
 un-0.004
 was-0.004
 open-0.003
 entire-0.003
 with-0.003
Token had
Token ID550
Feature activation-1.07e+00

Pos logit contributions
orting+0.037
 conclusions+0.037
 mosquitoes+0.037
aro+0.036
ican+0.035
Neg logit contributions
 entire-0.013
 un-0.013
 pack-0.012
 open-0.011
 opened-0.011
Token
Token ID198
Feature activation+2.46e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.004
 open+0.004
 was+0.004
Neg logit contributions
orting-0.016
 conclusions-0.016
 mosquitoes-0.016
aro-0.015
ican-0.015
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
 un+0.003
 entire+0.003
 was+0.003
 pack+0.002
 open+0.002
Neg logit contributions
orting-0.010
 conclusions-0.010
 mosquitoes-0.010
ican-0.010
 hawk-0.009
Token"
Token ID1
Feature activation-1.85e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 was+0.002
Neg logit contributions
orting-0.004
 conclusions-0.004
 mosquitoes-0.004
aro-0.003
ican-0.003
TokenI
Token ID40
Feature activation-7.50e-01

Pos logit contributions
orting+0.006
 conclusions+0.006
 mosquitoes+0.006
aro+0.006
ican+0.006
Neg logit contributions
 entire-0.004
 un-0.003
 pack-0.003
 open-0.003
 was-0.003
Token liked
Token ID8288
Feature activation-3.71e-01

Pos logit contributions
orting+0.025
 conclusions+0.025
 mosquitoes+0.024
aro+0.024
ican+0.023
Neg logit contributions
 entire-0.014
 un-0.014
 pack-0.013
 open-0.013
 opened-0.012
Token it
Token ID340
Feature activation+2.04e+00

Pos logit contributions
orting+0.013
 conclusions+0.013
ican+0.012
aro+0.012
crow+0.012
Neg logit contributions
 un-0.006
 entire-0.006
 open-0.006
 was-0.006
 pack-0.006
Token very
Token ID845
Feature activation+2.25e+00

Pos logit contributions
 un+0.047
 entire+0.047
 pack+0.046
 opened+0.044
 unlocked+0.043
Neg logit contributions
orting-0.055
 mosquitoes-0.054
 course-0.053
 conclusions-0.052
aro-0.050
Token much
Token ID881
Feature activation+4.51e-01

Pos logit contributions
 pack+0.058
 entire+0.057
 un+0.057
 opened+0.055
 unlocked+0.054
Neg logit contributions
 conclusions-0.053
orting-0.053
 course-0.052
 mosquitoes-0.052
bs-0.049
Token,
Token ID11
Feature activation+2.43e-01

Pos logit contributions
 entire+0.011
 un+0.011
 pack+0.011
 open+0.010
 was+0.010
Neg logit contributions
orting-0.012
 conclusions-0.012
 mosquitoes-0.012
aro-0.012
ican-0.011
Token mom
Token ID1995
Feature activation+9.80e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.005
 open+0.005
 opened+0.005
Neg logit contributions
orting-0.007
 conclusions-0.007
 mosquitoes-0.007
aro-0.007
ican-0.007
Token,"
Token ID553
Feature activation-6.83e-01

Pos logit contributions
 entire+0.023
 un+0.022
 pack+0.021
 open+0.021
 opened+0.020
Neg logit contributions
orting-0.028
 conclusions-0.027
 mosquitoes-0.027
 course-0.026
aro-0.025
Token and
Token ID290
Feature activation+9.09e-02

Pos logit contributions
 entire+0.032
 un+0.030
 pack+0.030
 unlocked+0.027
 opened+0.027
Neg logit contributions
orting-0.058
 mosquitoes-0.056
 conclusions-0.056
bs-0.056
 course-0.055
Token had
Token ID550
Feature activation+1.64e-01

Pos logit contributions
 entire+0.002
 pack+0.001
 un+0.001
 whole+0.001
pack+0.001
Neg logit contributions
 mosquitoes-0.003
 conclusions-0.003
 course-0.003
orting-0.003
aro-0.003
Token a
Token ID257
Feature activation+3.97e-01

Pos logit contributions
 entire+0.003
 pack+0.003
 un+0.003
pack+0.003
load+0.003
Neg logit contributions
 course-0.005
 mosquitoes-0.005
orting-0.005
 conclusions-0.004
orns-0.004
Token great
Token ID1049
Feature activation+1.41e+00

Pos logit contributions
 entire+0.008
 un+0.008
 pack+0.007
 opened+0.007
 unlocked+0.007
Neg logit contributions
 mosquitoes-0.012
 conclusions-0.012
orting-0.012
 course-0.012
 hawk-0.011
Token time
Token ID640
Feature activation-7.28e-01

Pos logit contributions
 un+0.046
 unlocked+0.044
 opened+0.044
 entire+0.044
load+0.042
Neg logit contributions
 course-0.024
orting-0.024
 mosquitoes-0.023
 conclusions-0.022
ittens-0.020
Token at
Token ID379
Feature activation+2.03e+00

Pos logit contributions
orting+0.025
 conclusions+0.025
 mosquitoes+0.024
aro+0.024
ican+0.023
Neg logit contributions
 entire-0.013
 un-0.012
 pack-0.012
 open-0.011
 was-0.011
Token the
Token ID262
Feature activation+3.82e-01

Pos logit contributions
 entire+0.037
 un+0.034
 pack+0.034
 opened+0.033
 unlocked+0.033
Neg logit contributions
 course-0.064
 conclusions-0.063
 mosquitoes-0.061
orting-0.060
aro-0.058
Token picnic
Token ID35715
Feature activation+9.01e-01

Pos logit contributions
 entire+0.009
 un+0.009
 pack+0.009
 opened+0.009
 open+0.009
Neg logit contributions
 conclusions-0.010
orting-0.010
 mosquitoes-0.010
 course-0.010
ican-0.009
Token.
Token ID13
Feature activation+5.34e-03

Pos logit contributions
 entire+0.018
 un+0.017
 pack+0.017
 opened+0.016
 open+0.016
Neg logit contributions
orting-0.028
 conclusions-0.027
 mosquitoes-0.027
 course-0.027
aro-0.026
Token In
Token ID554
Feature activation+1.28e+00

Pos logit contributions
 entire+0.000
 un+0.000
 pack+0.000
 open+0.000
 opened+0.000
Neg logit contributions
orting-0.000
 conclusions-0.000
 mosquitoes-0.000
aro-0.000
 course-0.000
Token the
Token ID262
Feature activation+4.06e-01

Pos logit contributions
 entire+0.016
 un+0.015
 pack+0.014
 open+0.013
 opened+0.013
Neg logit contributions
orting-0.050
 conclusions-0.049
 mosquitoes-0.049
 course-0.048
aro-0.047
Token and
Token ID290
Feature activation+2.16e-01

Pos logit contributions
 entire+0.003
 un+0.003
 pack+0.003
 open+0.003
 was+0.003
Neg logit contributions
orting-0.008
 conclusions-0.008
 mosquitoes-0.007
aro-0.007
ican-0.007
Token tell
Token ID1560
Feature activation-1.46e+00

Pos logit contributions
 entire+0.004
 un+0.003
 pack+0.003
 whole+0.003
 open+0.003
Neg logit contributions
 conclusions-0.007
 mosquitoes-0.007
orting-0.007
aro-0.007
 course-0.007
Token it
Token ID340
Feature activation+5.49e-01

Pos logit contributions
orting+0.061
 hawk+0.060
ican+0.060
 conclusions+0.060
orns+0.060
Neg logit contributions
 un-0.016
 open-0.015
 was-0.015
 his-0.014
 put-0.014
Token how
Token ID703
Feature activation+2.12e-01

Pos logit contributions
 entire+0.014
 pack+0.012
 un+0.012
load+0.012
pack+0.011
Neg logit contributions
orting-0.015
 conclusions-0.015
 mosquitoes-0.014
 course-0.014
bs-0.014
Token pretty
Token ID2495
Feature activation+3.74e-02

Pos logit contributions
 entire+0.004
 pack+0.004
 un+0.004
 open+0.003
 opened+0.003
Neg logit contributions
 mosquitoes-0.007
 conclusions-0.007
orting-0.007
bs-0.007
aro-0.007
Token it
Token ID340
Feature activation+2.16e+00

Pos logit contributions
 entire+0.001
 un+0.001
 pack+0.001
 open+0.001
 opened+0.001
Neg logit contributions
orting-0.001
 conclusions-0.001
 mosquitoes-0.001
aro-0.001
ican-0.001
Token was
Token ID373
Feature activation+4.43e-01

Pos logit contributions
 entire+0.028
 pack+0.027
load+0.025
pack+0.025
 3+0.023
Neg logit contributions
 course-0.079
 mosquitoes-0.079
ede-0.077
orting-0.076
orns-0.074
Token.
Token ID13
Feature activation-6.38e-01

Pos logit contributions
 entire+0.008
 un+0.008
 pack+0.007
 open+0.007
 opened+0.007
Neg logit contributions
orting-0.015
 conclusions-0.015
 mosquitoes-0.015
aro-0.014
ican-0.014
Token And
Token ID843
Feature activation-2.17e-01

Pos logit contributions
 conclusions+0.023
orting+0.023
 mosquitoes+0.023
aro+0.022
 hawk+0.022
Neg logit contributions
 entire-0.010
 un-0.010
 pack-0.009
 was-0.009
 open-0.009
Token even
Token ID772
Feature activation-1.01e+00

Pos logit contributions
orting+0.006
 conclusions+0.006
 mosquitoes+0.006
aro+0.006
 course+0.006
Neg logit contributions
 entire-0.005
 un-0.005
 pack-0.005
 open-0.004
 opened-0.004
Token though
Token ID996
Feature activation-6.80e-02

Pos logit contributions
orting+0.026
 conclusions+0.026
 mosquitoes+0.025
aro+0.024
ican+0.024
Neg logit contributions
 entire-0.027
 un-0.027
 pack-0.026
 open-0.025
 was-0.025
Token Lily
Token ID20037
Feature activation+8.85e-01

Pos logit contributions
 entire+0.023
 un+0.021
 pack+0.020
 open+0.019
 unlocked+0.019
Neg logit contributions
 mosquitoes-0.025
 conclusions-0.024
orting-0.024
aro-0.023
 course-0.023
Token and
Token ID290
Feature activation-4.44e-01

Pos logit contributions
 entire+0.016
 whole+0.013
 pack+0.013
 un+0.012
load+0.012
Neg logit contributions
 mosquitoes-0.028
orting-0.027
bs-0.027
crow-0.027
 conclusions-0.027
Token her
Token ID607
Feature activation+4.12e-01

Pos logit contributions
 mosquitoes+0.013
orting+0.013
 conclusions+0.013
 hawk+0.012
bs+0.012
Neg logit contributions
 entire-0.010
 un-0.009
 whole-0.009
 pack-0.008
 opened-0.008
Token family
Token ID1641
Feature activation+1.08e-01

Pos logit contributions
 entire+0.007
 un+0.007
 pack+0.007
 open+0.006
 was+0.006
Neg logit contributions
orting-0.015
 conclusions-0.015
 mosquitoes-0.014
aro-0.014
ican-0.014
Token went
Token ID1816
Feature activation-4.90e-01

Pos logit contributions
 entire+0.002
 un+0.001
 pack+0.001
 whole+0.001
load+0.001
Neg logit contributions
orting-0.004
 course-0.003
crow-0.003
bs-0.003
 mosquitoes-0.003
Token on
Token ID319
Feature activation+2.25e+00

Pos logit contributions
orting+0.016
 mosquitoes+0.015
 conclusions+0.015
aro+0.014
ican+0.014
Neg logit contributions
 entire-0.010
 un-0.009
 pack-0.009
 opened-0.009
 was-0.009
Token a
Token ID257
Feature activation-6.63e-01

Pos logit contributions
 entire+0.024
 un+0.023
 pack+0.021
 opened+0.019
 was+0.018
Neg logit contributions
 mosquitoes-0.089
 conclusions-0.089
orting-0.088
 course-0.085
 hawk-0.084
Token trip
Token ID5296
Feature activation+7.39e-01

Pos logit contributions
orting+0.019
 conclusions+0.019
 mosquitoes+0.019
 course+0.018
aro+0.018
Neg logit contributions
 entire-0.014
 un-0.014
 pack-0.013
 opened-0.013
 open-0.013
Token to
Token ID284
Feature activation+2.16e+00

Pos logit contributions
 entire+0.015
 un+0.015
 pack+0.014
 opened+0.013
 gle+0.013
Neg logit contributions
orting-0.022
 course-0.019
 conclusions-0.019
bs-0.019
aro-0.019
Token a
Token ID257
Feature activation-5.05e-02

Pos logit contributions
 entire+0.040
acked+0.034
 contents+0.032
 opened+0.031
 unlocked+0.031
Neg logit contributions
 conclusions-0.073
 mosquitoes-0.064
 strangers-0.064
 course-0.063
orting-0.062
Token far
Token ID1290
Feature activation+2.10e-01

Pos logit contributions
 conclusions+0.002
orting+0.002
 mosquitoes+0.002
 hawk+0.002
 course+0.002
Neg logit contributions
 entire-0.001
 un-0.001
 pack-0.001
 opened-0.001
 was-0.001
Token took
Token ID1718
Feature activation-4.07e-01

Pos logit contributions
 entire+0.042
 pack+0.039
 un+0.038
 whole+0.035
load+0.035
Neg logit contributions
orting-0.060
 conclusions-0.059
 mosquitoes-0.058
bs-0.058
 course-0.058
Token time
Token ID640
Feature activation+1.26e-01

Pos logit contributions
orting+0.014
 conclusions+0.014
 mosquitoes+0.014
aro+0.014
 course+0.013
Neg logit contributions
 entire-0.007
 un-0.007
 pack-0.006
 open-0.006
 opened-0.006
Token and
Token ID290
Feature activation-6.70e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 opened+0.002
Neg logit contributions
orting-0.004
 conclusions-0.004
 mosquitoes-0.004
aro-0.004
 course-0.004
Token patience
Token ID16336
Feature activation-1.08e+00

Pos logit contributions
orting+0.024
 conclusions+0.023
 mosquitoes+0.023
 course+0.022
aro+0.022
Neg logit contributions
 entire-0.011
 un-0.010
 pack-0.010
 whole-0.009
 open-0.009
Token to
Token ID284
Feature activation-5.15e-01

Pos logit contributions
 conclusions+0.042
 mosquitoes+0.040
ican+0.040
orting+0.040
 hawk+0.040
Neg logit contributions
 was-0.015
 un-0.015
 entire-0.015
 opened-0.014
 open-0.014
Token grow
Token ID1663
Feature activation+2.01e+00

Pos logit contributions
 conclusions+0.018
orting+0.018
 mosquitoes+0.017
 course+0.017
aro+0.017
Neg logit contributions
 entire-0.009
 un-0.007
 was-0.007
 whole-0.007
 pack-0.007
Token.
Token ID13
Feature activation-7.71e-01

Pos logit contributions
 entire+0.039
 un+0.038
 pack+0.037
 opened+0.034
 open+0.034
Neg logit contributions
orting-0.065
 conclusions-0.064
 mosquitoes-0.063
aro-0.061
 course-0.059
Token She
Token ID1375
Feature activation+7.33e-01

Pos logit contributions
 conclusions+0.027
orting+0.026
 mosquitoes+0.026
aro+0.026
 hawk+0.026
Neg logit contributions
 entire-0.013
 un-0.013
 was-0.012
 pack-0.012
 open-0.012
Token learned
Token ID4499
Feature activation-1.19e+00

Pos logit contributions
 entire+0.015
 pack+0.014
 whole+0.013
 un+0.013
 open+0.012
Neg logit contributions
 course-0.022
orting-0.022
 mosquitoes-0.022
 conclusions-0.021
 hawk-0.021
Token that
Token ID326
Feature activation-9.42e-01

Pos logit contributions
 conclusions+0.039
aro+0.038
orting+0.038
ican+0.037
 mosquitoes+0.037
Neg logit contributions
 was-0.024
 un-0.023
 a-0.023
 open-0.023
 entire-0.022
Token sometimes
Token ID3360
Feature activation-1.15e+00

Pos logit contributions
orting+0.032
 conclusions+0.031
ican+0.030
aro+0.030
 hawk+0.029
Neg logit contributions
 un-0.018
 was-0.018
 open-0.017
 entire-0.017
 pack-0.017
Token was
Token ID373
Feature activation+8.35e-01

Pos logit contributions
 entire+0.011
 pack+0.011
 un+0.011
pack+0.010
load+0.009
Neg logit contributions
 mosquitoes-0.025
 hawk-0.024
orting-0.024
bs-0.024
 course-0.024
Token warm
Token ID5814
Feature activation+1.82e+00

Pos logit contributions
 entire+0.013
 pack+0.013
 un+0.013
pack+0.012
 was+0.011
Neg logit contributions
 mosquitoes-0.028
orting-0.027
 hawk-0.027
 conclusions-0.026
 course-0.026
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 un+0.033
 entire+0.032
 pack+0.032
pack+0.030
 unlocked+0.030
Neg logit contributions
orting-0.061
 mosquitoes-0.058
 course-0.057
 conclusions-0.056
 hawk-0.055
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
 entire+0.024
pack+0.023
 pack+0.022
 un+0.021
load+0.020
Neg logit contributions
 mosquitoes-0.045
 hawk-0.042
 course-0.042
 conclusions-0.041
aro-0.040
Token rain
Token ID6290
Feature activation+7.15e-01

Pos logit contributions
 entire+0.019
 un+0.019
 pack+0.018
 open+0.018
 opened+0.017
Neg logit contributions
orting-0.022
 conclusions-0.022
 mosquitoes-0.021
aro-0.020
ican-0.020
Token was
Token ID373
Feature activation+3.97e-01

Pos logit contributions
 un+0.010
 pack+0.010
 entire+0.010
pack+0.009
 open+0.008
Neg logit contributions
 mosquitoes-0.025
orting-0.025
 conclusions-0.024
bs-0.024
 hawk-0.024
Token soft
Token ID2705
Feature activation+1.31e+00

Pos logit contributions
 entire+0.006
 un+0.006
 pack+0.006
 was+0.005
 open+0.005
Neg logit contributions
 mosquitoes-0.014
orting-0.014
 conclusions-0.013
 hawk-0.013
aro-0.013
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 un+0.026
 pack+0.026
 entire+0.025
pack+0.024
 unlocked+0.024
Neg logit contributions
 mosquitoes-0.040
orting-0.040
 conclusions-0.039
 course-0.037
 hawk-0.037
Token The
Token ID383
Feature activation+8.89e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 whole+0.002
Neg logit contributions
orting-0.004
 conclusions-0.004
 mosquitoes-0.004
ican-0.004
aro-0.004
Token seed
Token ID9403
Feature activation+5.87e-01

Pos logit contributions
 entire+0.019
 un+0.018
 pack+0.017
 was+0.017
 open+0.017
Neg logit contributions
orting-0.027
 conclusions-0.027
 mosquitoes-0.027
 hawk-0.026
 course-0.025
Token became
Token ID2627
Feature activation+9.43e-01

Pos logit contributions
 entire+0.006
 un+0.006
 pack+0.006
 open+0.005
 whole+0.005
Neg logit contributions
orting-0.023
 conclusions-0.023
 mosquitoes-0.023
aro-0.023
 course-0.022
Token find
Token ID1064
Feature activation-7.84e-01

Pos logit contributions
 entire+0.034
rily+0.029
nie+0.028
 whole+0.028
 was+0.027
Neg logit contributions
 conclusions-0.046
 course-0.041
 hawk-0.040
 mosquitoes-0.040
 strangers-0.039
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
orting+0.032
 conclusions+0.032
 mosquitoes+0.031
aro+0.031
ican+0.031
Neg logit contributions
 entire-0.009
 un-0.009
 open-0.008
 pack-0.008
 opened-0.007
Token treasure
Token ID14068
Feature activation-1.24e+00

Pos logit contributions
orting+0.043
 conclusions+0.042
 mosquitoes+0.041
aro+0.041
ican+0.040
Neg logit contributions
 entire-0.013
 un-0.013
 pack-0.013
 open-0.013
 was-0.012
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 conclusions+0.044
orting+0.043
 mosquitoes+0.043
aro+0.041
ican+0.041
Neg logit contributions
 un-0.021
 entire-0.021
 open-0.020
 pack-0.020
 was-0.020
Token As
Token ID1081
Feature activation-3.13e-01

Pos logit contributions
orting+0.032
 conclusions+0.032
 mosquitoes+0.031
 course+0.030
aro+0.030
Neg logit contributions
 entire-0.021
 un-0.020
 pack-0.019
 open-0.018
 opened-0.018
Token she
Token ID673
Feature activation+9.27e-02

Pos logit contributions
orting+0.010
 conclusions+0.010
ican+0.010
 mosquitoes+0.009
aro+0.009
Neg logit contributions
 was-0.007
 un-0.007
 open-0.007
 opened-0.006
 entire-0.006
Token walked
Token ID6807
Feature activation-1.09e+00

Pos logit contributions
 entire+0.002
 whole+0.002
 pack+0.001
 alive+0.001
 un+0.001
Neg logit contributions
 course-0.003
orting-0.003
aro-0.003
orns-0.003
 hawk-0.003
Token,
Token ID11
Feature activation-5.19e-01

Pos logit contributions
 conclusions+0.035
 mosquitoes+0.033
orting+0.033
aro+0.032
ican+0.032
Neg logit contributions
 entire-0.023
 un-0.023
 open-0.022
 pack-0.022
 was-0.022
Token she
Token ID673
Feature activation-3.80e-01

Pos logit contributions
orting+0.016
 conclusions+0.015
aro+0.014
ican+0.014
 mosquitoes+0.014
Neg logit contributions
 was-0.012
 entire-0.012
 un-0.012
 opened-0.011
 open-0.011
Token saw
Token ID2497
Feature activation-1.01e-01

Pos logit contributions
orting+0.015
 conclusions+0.015
 mosquitoes+0.014
aro+0.014
ican+0.014
Neg logit contributions
 entire-0.005
 un-0.005
 pack-0.005
 was-0.004
 opened-0.004
Token a
Token ID257
Feature activation-9.50e-01

Pos logit contributions
orting+0.005
 conclusions+0.005
aro+0.005
leness+0.005
ican+0.005
Neg logit contributions
 was-0.000
 open-0.000
 un-0.000
 a-0.000
 entire-0.000
Token He
Token ID679
Feature activation+8.67e-01

Pos logit contributions
 entire+0.015
 pack+0.013
 un+0.012
 whole+0.012
 open+0.011
Neg logit contributions
orting-0.040
 conclusions-0.038
 mosquitoes-0.037
crow-0.037
aro-0.037
Token shouted
Token ID17293
Feature activation-7.68e-02

Pos logit contributions
 entire+0.016
 pack+0.014
 whole+0.014
 un+0.013
pack+0.012
Neg logit contributions
bs-0.027
orting-0.027
 hawk-0.027
aro-0.027
 mosquitoes-0.026
Token out
Token ID503
Feature activation+2.83e+00

Pos logit contributions
orting+0.002
 conclusions+0.002
 mosquitoes+0.002
aro+0.002
ican+0.002
Neg logit contributions
 entire-0.002
 un-0.002
 pack-0.002
 opened-0.001
 whole-0.001
Token loud
Token ID7812
Feature activation-2.71e-01

Pos logit contributions
 entire+0.061
 pack+0.055
 un+0.053
pack+0.051
 whole+0.050
Neg logit contributions
 mosquitoes-0.083
orting-0.081
 conclusions-0.079
aro-0.079
 hawk-0.078
Token,
Token ID11
Feature activation-5.74e-01

Pos logit contributions
orting+0.009
 conclusions+0.009
 mosquitoes+0.009
aro+0.009
ican+0.009
Neg logit contributions
 entire-0.005
 un-0.005
 pack-0.004
 open-0.004
 opened-0.004
Token "
Token ID366
Feature activation-7.00e-02

Pos logit contributions
orting+0.020
 conclusions+0.020
 mosquitoes+0.019
aro+0.019
 hawk+0.019
Neg logit contributions
 entire-0.010
 un-0.009
 pack-0.009
 open-0.008
 whole-0.008
TokenTom
Token ID13787
Feature activation+6.56e-01

Pos logit contributions
 conclusions+0.002
orting+0.002
 mosquitoes+0.002
 course+0.002
ican+0.002
Neg logit contributions
 entire-0.001
 un-0.001
 pack-0.001
 open-0.001
 opened-0.001
Token,
Token ID11
Feature activation+1.90e-01

Pos logit contributions
 entire+0.018
 un+0.016
 pack+0.016
 whole+0.015
 open+0.015
Neg logit contributions
orting-0.016
 course-0.015
 mosquitoes-0.015
 hawk-0.015
aro-0.014
Token let
Token ID1309
Feature activation-9.33e-01

Pos logit contributions
 entire+0.004
 un+0.003
 pack+0.003
 opened+0.003
 open+0.003
Neg logit contributions
orting-0.006
 conclusions-0.006
 mosquitoes-0.006
aro-0.006
 course-0.006
Token's
Token ID338
Feature activation+5.88e-01

Pos logit contributions
 mosquitoes+0.029
orting+0.029
 course+0.029
 conclusions+0.028
crow+0.028
Neg logit contributions
 entire-0.019
 pack-0.016
 un-0.016
 opened-0.016
 unlocked-0.015
Token ignore
Token ID8856
Feature activation-8.65e-01

Pos logit contributions
 entire+0.010
 un+0.009
 pack+0.009
 opened+0.009
 was+0.009
Neg logit contributions
 conclusions-0.020
orting-0.020
 mosquitoes-0.019
aro-0.019
 hawk-0.019
Token went
Token ID1816
Feature activation-6.45e-01

Pos logit contributions
 entire+0.003
 pack+0.002
 un+0.002
 whole+0.002
 open+0.002
Neg logit contributions
 mosquitoes-0.006
 conclusions-0.006
orting-0.006
aro-0.006
 course-0.006
Token back
Token ID736
Feature activation-2.07e-01

Pos logit contributions
orting+0.024
 conclusions+0.023
 mosquitoes+0.023
aro+0.022
 course+0.022
Neg logit contributions
 entire-0.010
 un-0.009
 pack-0.009
 was-0.008
 opened-0.008
Token and
Token ID290
Feature activation-1.32e+00

Pos logit contributions
orting+0.006
 mosquitoes+0.006
 conclusions+0.006
aro+0.006
bs+0.006
Neg logit contributions
 entire-0.005
 un-0.004
 pack-0.004
 whole-0.004
 opened-0.004
Token sat
Token ID3332
Feature activation+6.45e-01

Pos logit contributions
orting+0.044
 conclusions+0.042
 mosquitoes+0.042
aro+0.041
 course+0.040
Neg logit contributions
 entire-0.026
 un-0.023
 pack-0.022
 whole-0.021
 open-0.021
Token in
Token ID287
Feature activation+1.78e+00

Pos logit contributions
 entire+0.010
 un+0.010
 pack+0.010
 opened+0.009
 open+0.009
Neg logit contributions
orting-0.023
 conclusions-0.022
 mosquitoes-0.022
aro-0.021
 course-0.021
Token her
Token ID607
Feature activation-6.62e-01

Pos logit contributions
 entire+0.023
 un+0.022
 pack+0.021
 open+0.019
 opened+0.018
Neg logit contributions
 conclusions-0.069
orting-0.068
 mosquitoes-0.067
 course-0.066
aro-0.066
Token chair
Token ID5118
Feature activation-5.10e-01

Pos logit contributions
orting+0.021
 conclusions+0.021
 mosquitoes+0.021
aro+0.020
ican+0.020
Neg logit contributions
 entire-0.013
 un-0.013
 pack-0.012
 open-0.012
 opened-0.012
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 conclusions+0.021
 mosquitoes+0.020
orting+0.020
aro+0.020
ican+0.019
Neg logit contributions
 entire-0.006
 un-0.006
 open-0.006
 pack-0.006
 opened-0.006
Token happy
Token ID3772
Feature activation+1.43e-01

Pos logit contributions
orting+0.044
 mosquitoes+0.043
 conclusions+0.043
aro+0.042
 course+0.041
Neg logit contributions
 entire-0.027
 un-0.025
 pack-0.024
 whole-0.022
 open-0.022
Token that
Token ID326
Feature activation-3.94e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 opened+0.002
Neg logit contributions
 conclusions-0.005
orting-0.005
 mosquitoes-0.005
aro-0.005
 course-0.005
Token the
Token ID262
Feature activation+5.71e-01

Pos logit contributions
orting+0.012
 conclusions+0.012
 mosquitoes+0.012
ican+0.012
aro+0.011
Neg logit contributions
 un-0.008
 entire-0.008
 was-0.008
 pack-0.008
 open-0.008
Token smiled
Token ID13541
Feature activation+2.38e+00

Pos logit contributions
orting+0.001
 course+0.001
 mosquitoes+0.001
bs+0.001
 conclusions+0.001
Neg logit contributions
 entire-0.001
 pack-0.001
 whole-0.001
 un-0.001
 open-0.001
Token at
Token ID379
Feature activation+1.19e+00

Pos logit contributions
 pack+0.060
pack+0.060
 entire+0.060
load+0.059
 unlocked+0.058
Neg logit contributions
orting-0.055
 anymore-0.050
 mosquitoes-0.046
 conclusions-0.046
bs-0.046
Token Lucy
Token ID22162
Feature activation+1.47e+00

Pos logit contributions
 entire+0.020
 pack+0.020
 un+0.019
 opened+0.017
 open+0.017
Neg logit contributions
 course-0.041
 conclusions-0.041
aro-0.040
 mosquitoes-0.040
orting-0.039
Token,
Token ID11
Feature activation-4.85e-01

Pos logit contributions
 entire+0.032
 un+0.029
 pack+0.028
 whole+0.027
 unlocked+0.027
Neg logit contributions
orting-0.044
 mosquitoes-0.039
 conclusions-0.039
 course-0.039
bs-0.039
Token asking
Token ID4737
Feature activation-5.31e-01

Pos logit contributions
orting+0.016
 mosquitoes+0.016
 conclusions+0.015
 course+0.015
aro+0.015
Neg logit contributions
 entire-0.009
 pack-0.008
 un-0.008
 whole-0.008
 open-0.008
Token her
Token ID607
Feature activation+6.05e-01

Pos logit contributions
 conclusions+0.019
ican+0.019
orting+0.019
 mosquitoes+0.018
 hawk+0.018
Neg logit contributions
 un-0.009
 was-0.009
 open-0.009
 pack-0.008
 entire-0.008
Token to
Token ID284
Feature activation+1.42e-01

Pos logit contributions
 un+0.011
 pack+0.010
 was+0.010
 open+0.010
 entire+0.010
Neg logit contributions
orting-0.021
 conclusions-0.021
 mosquitoes-0.020
ican-0.020
 hawk-0.020
Token go
Token ID467
Feature activation+5.96e-01

Pos logit contributions
 entire+0.002
 un+0.002
 pack+0.002
 open+0.002
 opened+0.002
Neg logit contributions
 conclusions-0.005
orting-0.005
 mosquitoes-0.005
aro-0.005
 course-0.005
Token first
Token ID717
Feature activation+5.12e-01

Pos logit contributions
 entire+0.012
 un+0.010
 opened+0.010
 pack+0.010
 whole+0.010
Neg logit contributions
orting-0.019
 conclusions-0.018
 course-0.017
aro-0.017
 mosquitoes-0.017
Token.
Token ID13
Feature activation+7.33e-01

Pos logit contributions
 entire+0.010
 un+0.010
 pack+0.009
 opened+0.009
 open+0.009
Neg logit contributions
orting-0.016
 conclusions-0.016
 mosquitoes-0.015
 course-0.015
aro-0.015
Token She
Token ID1375
Feature activation+3.44e-01

Pos logit contributions
 entire+0.016
 pack+0.014
 whole+0.013
 un+0.013
 packed+0.013
Neg logit contributions
orting-0.020
 mosquitoes-0.019
crow-0.019
aro-0.019
 conclusions-0.019
Token
Token ID6184
Feature activation-1.08e+00

Pos logit contributions
 conclusions+0.051
orting+0.050
orns+0.048
ican+0.047
 mosquitoes+0.047
Neg logit contributions
 was-0.029
 un-0.029
 open-0.027
 opened-0.027
 pack-0.027
Token
Token ID95
Feature activation+6.01e-01

Pos logit contributions
orting+0.021
 conclusions+0.020
 mosquitoes+0.019
ican+0.019
aro+0.019
Neg logit contributions
 entire-0.035
 un-0.034
 pack-0.034
 was-0.033
 opened-0.033
Token
Token ID26391
Feature activation-3.28e-01

Pos logit contributions
 un+0.014
 opened+0.013
 pack+0.013
 entire+0.013
 open+0.013
Neg logit contributions
orting-0.018
 course-0.016
ican-0.016
 conclusions-0.015
ede-0.015
Token
Token ID129
Feature activation-4.84e-01

Pos logit contributions
orting+0.008
 mosquitoes+0.007
bs+0.007
 course+0.007
ittens+0.007
Neg logit contributions
 entire-0.009
 un-0.009
 open-0.008
 pack-0.008
 opened-0.008
Token
Token ID241
Feature activation+4.16e-01

Pos logit contributions
orting+0.014
 conclusions+0.014
 mosquitoes+0.013
ican+0.013
aro+0.013
Neg logit contributions
 entire-0.011
 un-0.011
 pack-0.010
 opened-0.010
 was-0.010
TokenNo
Token ID2949
Feature activation-1.34e+00

Pos logit contributions
 un+0.009
 entire+0.008
 pack+0.008
 open+0.008
 opened+0.008
Neg logit contributions
 conclusions-0.013
 mosquitoes-0.013
orting-0.012
aro-0.012
 course-0.012
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
 conclusions+0.044
orting+0.043
 mosquitoes+0.042
aro+0.041
ican+0.041
Neg logit contributions
 entire-0.026
 un-0.026
 pack-0.025
 was-0.025
 open-0.024
Token we
Token ID356
Feature activation-1.10e+00

Pos logit contributions
orting+0.006
 conclusions+0.006
 mosquitoes+0.006
aro+0.005
ican+0.005
Neg logit contributions
 entire-0.003
 un-0.003
 pack-0.003
 open-0.003
 was-0.003
Token can
Token ID460
Feature activation+1.16e-01

Pos logit contributions
 conclusions+0.035
orting+0.035
 mosquitoes+0.034
aro+0.034
ican+0.033
Neg logit contributions
 entire-0.022
 un-0.022
 pack-0.021
 open-0.020
 was-0.020
Tokenâ
Token ID22940
Feature activation-5.01e-01

Pos logit contributions
 entire+0.003
 un+0.003
 pack+0.003
 open+0.003
 was+0.003
Neg logit contributions
orting-0.003
 conclusions-0.003
 mosquitoes-0.003
aro-0.003
ican-0.003
Token
Token ID26391
Feature activation-5.65e-01

Pos logit contributions
orting+0.013
 conclusions+0.013
 mosquitoes+0.012
 course+0.012
aro+0.012
Neg logit contributions
 entire-0.013
 un-0.013
 pack-0.012
 opened-0.012
 open-0.012
Token!
Token ID0
Feature activation+4.67e-01

Pos logit contributions
orting+0.006
 conclusions+0.006
 mosquitoes+0.006
aro+0.005
 course+0.005
Neg logit contributions
 entire-0.004
 un-0.004
 pack-0.004
 opened-0.003
 open-0.003
Token<|endoftext|>
Token ID50256
Feature activation+1.19e+00

Pos logit contributions
 entire+0.009
 un+0.008
 pack+0.008
 open+0.008
 opened+0.008
Neg logit contributions
orting-0.016
 conclusions-0.015
 course-0.015
aro-0.015
 mosquitoes-0.014
TokenOnce
Token ID7454
Feature activation-1.89e-01

Pos logit contributions
 entire+0.030
 un+0.030
 pack+0.030
 open+0.029
 opened+0.028
Neg logit contributions
orting-0.030
 course-0.028
 conclusions-0.028
orns-0.028
crow-0.028
Token upon
Token ID2402
Feature activation+2.27e+00

Pos logit contributions
 conclusions+0.007
orting+0.007
 mosquitoes+0.007
aro+0.007
ican+0.007
Neg logit contributions
 entire-0.002
 un-0.002
 pack-0.002
 was-0.002
 open-0.002
Token a
Token ID257
Feature activation-8.54e-01

Pos logit contributions
load+0.043
boxing+0.042
 entire+0.040
 un+0.039
loads+0.039
Neg logit contributions
 mosquitoes-0.067
 conclusions-0.067
 course-0.065
 nightmares-0.061
 hawk-0.058
Token time
Token ID640
Feature activation-1.54e+00

Pos logit contributions
orting+0.022
 conclusions+0.022
 mosquitoes+0.021
aro+0.021
ican+0.020
Neg logit contributions
 entire-0.022
 un-0.022
 pack-0.021
 open-0.021
 was-0.020
Token,
Token ID11
Feature activation-5.94e-02

Pos logit contributions
 conclusions+0.046
orting+0.046
 mosquitoes+0.044
ican+0.042
aro+0.042
Neg logit contributions
 un-0.034
 entire-0.034
 was-0.032
 pack-0.032
 open-0.032
Token there
Token ID612
Feature activation+4.41e-01

Pos logit contributions
orting+0.002
 mosquitoes+0.002
 conclusions+0.002
aro+0.002
 course+0.002
Neg logit contributions
 entire-0.001
 pack-0.001
 un-0.001
load-0.001
 open-0.001
Token was
Token ID373
Feature activation+3.45e-01

Pos logit contributions
 entire+0.006
 pack+0.006
load+0.005
 un+0.005
pack+0.005
Neg logit contributions
 course-0.017
 conclusions-0.016
 mosquitoes-0.015
crow-0.015
orting-0.015
Token a
Token ID257
Feature activation+9.61e-02

Pos logit contributions
 un+0.003
 entire+0.003
 pack+0.003
 opened+0.003
 unlocked+0.003
Neg logit contributions
 mosquitoes-0.014
 conclusions-0.014
orting-0.014
 course-0.014
 hawk-0.013
Token little
Token ID1310
Feature activation+1.01e-02

Pos logit contributions
 entire+0.001
 un+0.001
 pack+0.001
 opened+0.001
 unlocked+0.001
Neg logit contributions
 mosquitoes-0.004
 conclusions-0.004
orting-0.004
 hawk-0.004
ican-0.003
Token had
Token ID550
Feature activation-4.24e-01

Pos logit contributions
orting+0.061
 conclusions+0.060
 mosquitoes+0.057
ican+0.056
aro+0.056
Neg logit contributions
 un-0.031
 entire-0.030
 was-0.030
 pack-0.029
 opened-0.028
Token so
Token ID523
Feature activation-1.05e+00

Pos logit contributions
 mosquitoes+0.016
 course+0.015
 conclusions+0.014
orting+0.014
 hawk+0.014
Neg logit contributions
 entire-0.006
 pack-0.006
 un-0.006
load-0.005
pack-0.005
Token much
Token ID881
Feature activation-1.50e+00

Pos logit contributions
orting+0.035
 conclusions+0.035
 mosquitoes+0.034
ican+0.033
aro+0.033
Neg logit contributions
 entire-0.020
 un-0.019
 pack-0.018
 was-0.018
 open-0.018
Token energy
Token ID2568
Feature activation-6.00e-01

Pos logit contributions
aro+0.056
 conclusions+0.056
ican+0.056
orting+0.055
 hawk+0.054
Neg logit contributions
 un-0.024
 was-0.023
 entire-0.023
 open-0.023
 whole-0.021
Token to
Token ID284
Feature activation-6.68e-01

Pos logit contributions
 conclusions+0.023
orting+0.022
 mosquitoes+0.021
ican+0.021
aro+0.021
Neg logit contributions
 un-0.009
 entire-0.009
 was-0.009
 pack-0.009
 open-0.008
Token spare
Token ID13952
Feature activation-1.15e+00

Pos logit contributions
orting+0.025
 mosquitoes+0.024
 conclusions+0.024
aro+0.024
 hawk+0.023
Neg logit contributions
 un-0.011
 pack-0.010
 entire-0.010
 open-0.010
 put-0.009
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
orting+0.042
ican+0.041
 conclusions+0.040
aro+0.040
 hawk+0.040
Neg logit contributions
 un-0.018
 entire-0.017
 open-0.017
 pack-0.017
 was-0.017
Token She
Token ID1375
Feature activation-1.89e-02

Pos logit contributions
 mosquitoes+0.060
 conclusions+0.059
orting+0.059
 hawk+0.058
aro+0.058
Neg logit contributions
 was-0.016
 un-0.015
 entire-0.014
 pack-0.013
 open-0.013
Token loved
Token ID6151
Feature activation-1.47e+00

Pos logit contributions
orting+0.001
 conclusions+0.001
 mosquitoes+0.001
 course+0.001
aro+0.001
Neg logit contributions
 entire-0.000
 un-0.000
 pack-0.000
 open-0.000
 whole-0.000
Token to
Token ID284
Feature activation-5.66e-01

Pos logit contributions
orting+0.054
 conclusions+0.054
aro+0.051
ican+0.051
 hawk+0.050
Neg logit contributions
 un-0.023
 entire-0.022
 pack-0.021
 was-0.021
 open-0.021
Token run
Token ID1057
Feature activation-7.00e-01

Pos logit contributions
orting+0.020
 conclusions+0.020
 mosquitoes+0.020
aro+0.019
 hawk+0.019
Neg logit contributions
 un-0.010
 pack-0.009
 entire-0.009
 open-0.009
 was-0.009
Token "
Token ID366
Feature activation-4.61e-01

Pos logit contributions
 un+0.007
 entire+0.007
 was+0.007
 pack+0.006
 opened+0.006
Neg logit contributions
 conclusions-0.031
orting-0.031
 mosquitoes-0.030
ican-0.029
aro-0.029
TokenIt
Token ID1026
Feature activation+3.34e-01

Pos logit contributions
orting+0.015
 conclusions+0.015
 mosquitoes+0.014
aro+0.014
 course+0.014
Neg logit contributions
 entire-0.009
 un-0.009
 was-0.009
 pack-0.008
 open-0.008
Token's
Token ID338
Feature activation+3.46e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.005
 whole+0.004
 open+0.004
Neg logit contributions
orting-0.012
 mosquitoes-0.012
 conclusions-0.012
aro-0.012
 hawk-0.011
Token not
Token ID407
Feature activation-2.87e-01

Pos logit contributions
 entire+0.005
 un+0.005
 pack+0.005
 open+0.005
 was+0.005
Neg logit contributions
 conclusions-0.013
orting-0.013
 mosquitoes-0.012
aro-0.012
ican-0.012
Token safe
Token ID3338
Feature activation+8.33e-01

Pos logit contributions
 mosquitoes+0.010
orting+0.010
 conclusions+0.009
 violence+0.009
 hawk+0.009
Neg logit contributions
 entire-0.005
 un-0.004
 opened-0.004
 pack-0.004
 was-0.004
Token to
Token ID284
Feature activation-2.19e+00

Pos logit contributions
 entire+0.016
 un+0.016
 pack+0.015
 opened+0.014
 open+0.014
Neg logit contributions
orting-0.027
 conclusions-0.026
 mosquitoes-0.026
aro-0.025
 hawk-0.024
Token go
Token ID467
Feature activation-6.94e-01

Pos logit contributions
orting+0.088
 narrative+0.084
aro+0.083
orns+0.082
ican+0.081
Neg logit contributions
 open-0.035
 un-0.035
 pack-0.035
 put-0.032
 was-0.029
Token near
Token ID1474
Feature activation-1.88e-01

Pos logit contributions
orting+0.025
 conclusions+0.025
 mosquitoes+0.024
 course+0.024
aro+0.024
Neg logit contributions
 un-0.012
 entire-0.011
 pack-0.011
 open-0.011
 was-0.010
Token the
Token ID262
Feature activation-9.73e-01

Pos logit contributions
orting+0.008
 conclusions+0.008
 mosquitoes+0.007
 narrative+0.007
ican+0.007
Neg logit contributions
 un-0.002
 entire-0.002
 open-0.002
 pack-0.002
 was-0.002
Token z
Token ID1976
Feature activation-2.66e+00

Pos logit contributions
orting+0.024
 conclusions+0.024
 mosquitoes+0.023
aro+0.022
ican+0.022
Neg logit contributions
 entire-0.027
 un-0.026
 pack-0.025
 open-0.025
 was-0.025
Tokenebra
Token ID37052
Feature activation-1.99e+00

Pos logit contributions
 conclusions+0.047
orting+0.046
 mosquitoes+0.046
 violence+0.041
 caution+0.041
Neg logit contributions
 entire-0.092
 un-0.088
 pack-0.088
 open-0.085
 whole-0.084
Token listen
Token ID6004
Feature activation-7.49e-01

Pos logit contributions
orting+0.059
 conclusions+0.058
 mosquitoes+0.056
 course+0.055
 hawk+0.054
Neg logit contributions
 entire-0.033
 un-0.030
 pack-0.028
 opened-0.028
 whole-0.027
Token to
Token ID284
Feature activation-1.03e+00

Pos logit contributions
 conclusions+0.029
orting+0.028
ican+0.028
aro+0.028
 hawk+0.028
Neg logit contributions
 entire-0.011
 un-0.010
 was-0.010
 open-0.010
 pack-0.009
Token her
Token ID607
Feature activation+4.07e-01

Pos logit contributions
ican+0.044
orting+0.043
aro+0.043
 hawk+0.042
 conclusions+0.041
Neg logit contributions
 un-0.013
 open-0.012
 pack-0.011
 put-0.011
 entire-0.010
Token mom
Token ID1995
Feature activation-5.38e-01

Pos logit contributions
 un+0.005
 open+0.005
 pack+0.005
 entire+0.005
 was+0.005
Neg logit contributions
ican-0.017
orting-0.017
 course-0.017
aro-0.017
 conclusions-0.017
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
 conclusions+0.022
aro+0.022
 mosquitoes+0.022
ican+0.021
 hawk+0.021
Neg logit contributions
 un-0.006
 entire-0.006
 was-0.006
 pack-0.006
 open-0.006
Token trust
Token ID3774
Feature activation-1.99e+00

Pos logit contributions
orting+0.046
 conclusions+0.046
 mosquitoes+0.045
aro+0.044
ican+0.043
Neg logit contributions
 entire-0.024
 un-0.024
 pack-0.023
 open-0.022
 was-0.021
Token her
Token ID607
Feature activation+2.49e-01

Pos logit contributions
ican+0.084
orting+0.083
 hawk+0.080
 inconvenience+0.079
aro+0.078
Neg logit contributions
 was-0.025
 un-0.023
 open-0.021
 with-0.020
 his-0.019
Token decisions
Token ID5370
Feature activation-1.36e+00

Pos logit contributions
 un+0.004
 entire+0.004
 open+0.004
 pack+0.004
 was+0.004
Neg logit contributions
ican-0.010
orting-0.010
aro-0.010
 conclusions-0.010
orns-0.009
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
 conclusions+0.054
ican+0.054
 hawk+0.052
 mosquitoes+0.052
aro+0.051
Neg logit contributions
 un-0.019
 was-0.019
 open-0.017
 entire-0.017
 with-0.017
Token<|endoftext|>
Token ID50256
Feature activation+7.50e-01

Pos logit contributions
orting+0.044
 conclusions+0.043
 mosquitoes+0.042
aro+0.041
ican+0.041
Neg logit contributions
 entire-0.018
 un-0.017
 pack-0.016
 open-0.016
 opened-0.015
TokenOnce
Token ID7454
Feature activation-6.48e-01

Pos logit contributions
 entire+0.014
 un+0.014
 pack+0.013
 open+0.012
 opened+0.012
Neg logit contributions
orting-0.024
 conclusions-0.024
 mosquitoes-0.023
 course-0.023
orns-0.023
Token teeth
Token ID9941
Feature activation-5.79e-01

Pos logit contributions
orting+0.030
 conclusions+0.029
 mosquitoes+0.028
 course+0.028
ican+0.028
Neg logit contributions
 entire-0.014
 un-0.014
 pack-0.013
 was-0.013
 open-0.013
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
 conclusions+0.022
orting+0.022
ican+0.022
 mosquitoes+0.022
 hawk+0.022
Neg logit contributions
 un-0.008
 was-0.008
 open-0.008
 entire-0.007
 opened-0.007
Token
Token ID198
Feature activation+3.72e-01

Pos logit contributions
 un+0.001
 entire+0.001
 pack+0.001
 was+0.001
 open+0.001
Neg logit contributions
orting-0.005
 conclusions-0.005
 mosquitoes-0.005
ican-0.005
aro-0.005
Token
Token ID198
Feature activation+7.61e-01

Pos logit contributions
 un+0.004
 entire+0.004
 was+0.004
 open+0.004
 pack+0.004
Neg logit contributions
orting-0.015
 conclusions-0.015
 mosquitoes-0.015
ican-0.015
 hawk-0.014
Token"
Token ID1
Feature activation-2.33e-01

Pos logit contributions
 entire+0.013
 pack+0.012
 un+0.012
 open+0.011
 opened+0.011
Neg logit contributions
 conclusions-0.026
orting-0.026
 mosquitoes-0.026
aro-0.025
 course-0.025
TokenRo
Token ID15450
Feature activation-4.56e+00

Pos logit contributions
 conclusions+0.006
orting+0.006
 mosquitoes+0.006
aro+0.005
ican+0.005
Neg logit contributions
 entire-0.006
 un-0.006
 pack-0.006
 open-0.006
 opened-0.006
Tokenar
Token ID283
Feature activation-5.88e-01

Pos logit contributions
 narrative+0.114
 conclusions+0.112
 mosquitoes+0.109
 violence+0.109
 hawk+0.106
Neg logit contributions
 was-0.108
 un-0.108
 pack-0.108
 open-0.106
 with-0.105
Token!"
Token ID2474
Feature activation-5.89e-02

Pos logit contributions
orting+0.021
 conclusions+0.020
 mosquitoes+0.020
aro+0.020
ican+0.019
Neg logit contributions
 entire-0.010
 un-0.010
 pack-0.009
 open-0.009
 opened-0.009
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
orting+0.002
 conclusions+0.002
 mosquitoes+0.002
ican+0.002
aro+0.002
Neg logit contributions
 un-0.001
 entire-0.001
 was-0.001
 open-0.001
 opened-0.001
Token le
Token ID443
Feature activation-3.73e+00

Pos logit contributions
orting+0.049
 conclusions+0.049
 mosquitoes+0.046
aro+0.045
bs+0.044
Neg logit contributions
 entire-0.051
 un-0.050
 pack-0.049
 was-0.048
 open-0.048
Tokenopard
Token ID15478
Feature activation-1.45e+00

Pos logit contributions
 conclusions+0.083
orting+0.074
 mosquitoes+0.074
 caution+0.073
 nightmares+0.072
Neg logit contributions
 was-0.109
 entire-0.108
 un-0.105
 with-0.103
 pack-0.102
Token took
Token ID1718
Feature activation-2.06e-01

Pos logit contributions
orting+0.041
 conclusions+0.041
 mosquitoes+0.039
ican+0.038
aro+0.038
Neg logit contributions
 un-0.017
 entire-0.016
 pack-0.016
 opened-0.016
 open-0.016
Token his
Token ID465
Feature activation-6.20e-01

Pos logit contributions
 conclusions+0.007
orting+0.007
 mosquitoes+0.007
aro+0.007
 course+0.007
Neg logit contributions
 entire-0.004
 un-0.003
 pack-0.003
 opened-0.003
 whole-0.003
Token hat
Token ID6877
Feature activation-1.35e+00

Pos logit contributions
orting+0.020
 conclusions+0.020
 mosquitoes+0.019
aro+0.019
ican+0.019
Neg logit contributions
 entire-0.012
 un-0.012
 pack-0.012
 open-0.011
 opened-0.011
Token from
Token ID422
Feature activation+9.91e-02

Pos logit contributions
 conclusions+0.058
 mosquitoes+0.056
orting+0.056
aro+0.055
ican+0.054
Neg logit contributions
 un-0.014
 entire-0.013
 open-0.013
 pack-0.013
 opened-0.012
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
 un+0.001
 open+0.001
 entire+0.001
 pack+0.001
 was+0.001
Neg logit contributions
orting-0.004
 conclusions-0.004
unn-0.004
 mosquitoes-0.004
aro-0.004
Token ostr
Token ID42387
Feature activation-4.37e+00

Pos logit contributions
orting+0.059
 conclusions+0.054
aro+0.052
orns+0.052
unn+0.052
Neg logit contributions
 entire-0.043
 open-0.042
 un-0.041
 pack-0.040
 whole-0.040
Tokenich
Token ID488
Feature activation-2.40e+00

Pos logit contributions
 conclusions+0.135
 mosquitoes+0.130
orting+0.130
 violence+0.126
 course+0.122
Neg logit contributions
 entire-0.091
 un-0.082
 was-0.082
 whole-0.080
 opened-0.078
Token's
Token ID338
Feature activation-2.62e+00

Pos logit contributions
 conclusions+0.106
 mosquitoes+0.104
 narrative+0.103
orting+0.102
obb+0.101
Neg logit contributions
 open-0.024
 un-0.023
 was-0.022
 with-0.021
 opened-0.020
Token be
Token ID307
Feature activation-8.36e-01

Pos logit contributions
orting+0.101
 conclusions+0.099
 mosquitoes+0.098
ican+0.096
aro+0.095
Neg logit contributions
 entire-0.038
 un-0.038
 open-0.037
 pack-0.037
 was-0.033
Tokenak
Token ID461
Feature activation-5.77e-01

Pos logit contributions
 conclusions+0.029
orting+0.028
 mosquitoes+0.027
ican+0.027
aro+0.027
Neg logit contributions
 entire-0.016
 un-0.015
 pack-0.014
 open-0.014
 opened-0.014
Token and
Token ID290
Feature activation-1.03e+00

Pos logit contributions
orting+0.022
 conclusions+0.022
 mosquitoes+0.022
aro+0.021
ican+0.021
Neg logit contributions
 entire-0.007
 un-0.007
 pack-0.007
 open-0.006
 opened-0.006
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
orting+0.026
 conclusions+0.026
 mosquitoes+0.025
aro+0.024
ican+0.024
Neg logit contributions
 entire-0.015
 un-0.014
 pack-0.014
 open-0.013
 opened-0.013
Token One
Token ID1881
Feature activation+3.90e-01

Pos logit contributions
aro+0.097
 conclusions+0.097
 mosquitoes+0.093
 narrative+0.093
 hawk+0.092
Neg logit contributions
 was-0.042
 pack-0.041
 un-0.040
 He-0.039
 open-0.036
Token day
Token ID1110
Feature activation-2.61e+00

Pos logit contributions
 un+0.012
 entire+0.011
 pack+0.011
 unlocked+0.011
 opened+0.011
Neg logit contributions
 mosquitoes-0.008
orting-0.008
 conclusions-0.008
 course-0.007
 hawk-0.007
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
 conclusions+0.109
ican+0.102
 narrative+0.100
orting+0.099
aro+0.095
Neg logit contributions
 was-0.041
 with-0.037
 a-0.035
 pack-0.034
 opened-0.034
Token while
Token ID981
Feature activation-2.44e+00

Pos logit contributions
 conclusions+0.038
orting+0.036
aro+0.034
 narrative+0.034
ican+0.034
Neg logit contributions
 was-0.047
 pack-0.047
 un-0.046
 open-0.045
 opened-0.045
Token Mitt
Token ID16627
Feature activation-4.52e+00

Pos logit contributions
orting+0.092
 conclusions+0.091
 narrative+0.086
ican+0.085
obb+0.085
Neg logit contributions
 was-0.042
 a-0.041
 un-0.040
 his-0.039
 with-0.038
Tokenens
Token ID641
Feature activation-4.24e-01

Pos logit contributions
aro+0.083
 conclusions+0.080
 violence+0.067
 prope+0.067
 hawk+0.066
Neg logit contributions
 was-0.159
 entire-0.153
 pack-0.151
 whole-0.147
 put-0.147
Token was
Token ID373
Feature activation-1.94e+00

Pos logit contributions
 mosquitoes+0.016
ede+0.016
 course+0.016
orting+0.015
 conclusions+0.015
Neg logit contributions
 entire-0.005
 pack-0.004
 un-0.004
 whole-0.004
pack-0.004
Token eating
Token ID6600
Feature activation-1.31e+00

Pos logit contributions
orting+0.062
 conclusions+0.062
 mosquitoes+0.060
aro+0.059
ican+0.058
Neg logit contributions
 entire-0.038
 un-0.037
 pack-0.036
 open-0.035
 opened-0.034
Token her
Token ID607
Feature activation+1.04e+00

Pos logit contributions
 conclusions+0.054
orting+0.053
aro+0.053
ican+0.053
unn+0.052
Neg logit contributions
 open-0.015
 un-0.015
 was-0.015
 with-0.014
 his-0.014
Token pasta
Token ID26296
Feature activation-1.14e+00

Pos logit contributions
 un+0.028
 entire+0.027
 unlocked+0.027
 pack+0.026
 opened+0.025
Neg logit contributions
 mosquitoes-0.023
orting-0.022
 course-0.022
 conclusions-0.021
bs-0.021
Token garden
Token ID11376
Feature activation-1.13e+00

Pos logit contributions
aro+0.020
orting+0.020
 conclusions+0.019
bs+0.018
ican+0.018
Neg logit contributions
 entire-0.014
 open-0.013
 un-0.013
 whole-0.013
 pack-0.012
Token.
Token ID13
Feature activation-9.63e-01

Pos logit contributions
 conclusions+0.042
orting+0.042
 mosquitoes+0.041
aro+0.041
 hawk+0.040
Neg logit contributions
 entire-0.017
 un-0.016
 pack-0.015
 open-0.015
 was-0.015
Token She
Token ID1375
Feature activation-3.04e-01

Pos logit contributions
orting+0.034
 conclusions+0.034
 mosquitoes+0.033
aro+0.032
 hawk+0.032
Neg logit contributions
 entire-0.016
 un-0.016
 pack-0.015
 was-0.015
 open-0.015
Token saw
Token ID2497
Feature activation-8.60e-01

Pos logit contributions
orting+0.012
 conclusions+0.012
 mosquitoes+0.012
aro+0.011
 course+0.011
Neg logit contributions
 entire-0.004
 un-0.004
 pack-0.004
 open-0.003
 whole-0.003
Token a
Token ID257
Feature activation-4.13e-01

Pos logit contributions
orting+0.044
 conclusions+0.042
leness+0.041
 narrative+0.041
ede+0.041
Neg logit contributions
 was-0.005
 a-0.004
 open-0.003
 opened-0.002
 un-0.002
Token cater
Token ID20825
Feature activation-4.37e+00

Pos logit contributions
orting+0.014
 conclusions+0.014
 mosquitoes+0.014
aro+0.014
ican+0.013
Neg logit contributions
 entire-0.007
 un-0.007
 pack-0.007
 open-0.006
 was-0.006
Tokenpillar
Token ID41643
Feature activation-1.85e+00

Pos logit contributions
 supervision+0.118
 injuries+0.118
 elbows+0.113
 fault+0.112
 violence+0.108
Neg logit contributions
 entire-0.115
 with-0.111
 was-0.111
 filled-0.109
 open-0.107
Token crawling
Token ID34499
Feature activation-8.05e-01

Pos logit contributions
orting+0.065
 conclusions+0.065
aro+0.061
 mosquitoes+0.060
unn+0.060
Neg logit contributions
 was-0.034
 with-0.032
 open-0.032
 entire-0.031
 un-0.031
Token on
Token ID319
Feature activation-1.14e+00

Pos logit contributions
 conclusions+0.035
orting+0.034
unn+0.033
ittens+0.033
 inconvenience+0.032
Neg logit contributions
 was-0.009
 open-0.008
 entire-0.008
 with-0.008
 opened-0.007
Token a
Token ID257
Feature activation-3.86e+00

Pos logit contributions
orting+0.059
aro+0.057
unn+0.057
 narrative+0.055
ittens+0.055
Neg logit contributions
 open-0.006
 a-0.005
 was-0.004
 entire-0.004
 un-0.003
Token leaf
Token ID12835
Feature activation-2.85e+00

Pos logit contributions
orting+0.112
aro+0.105
 narrative+0.101
 conclusions+0.101
ican+0.098
Neg logit contributions
 entire-0.102
 open-0.100
 un-0.096
 whole-0.095
 pack-0.093
Token that
Token ID326
Feature activation-7.71e-01

Pos logit contributions
 entire+0.001
 un+0.001
 pack+0.001
 open+0.001
 opened+0.001
Neg logit contributions
orting-0.001
 conclusions-0.001
 mosquitoes-0.001
aro-0.001
 course-0.001
Token ducks
Token ID39694
Feature activation-1.21e+00

Pos logit contributions
orting+0.027
 conclusions+0.026
ican+0.026
aro+0.025
 mosquitoes+0.025
Neg logit contributions
 un-0.013
 was-0.013
 entire-0.013
 pack-0.012
 open-0.012
Token were
Token ID547
Feature activation-1.07e+00

Pos logit contributions
 conclusions+0.044
orting+0.043
 mosquitoes+0.042
aro+0.041
ican+0.041
Neg logit contributions
 un-0.019
 entire-0.019
 was-0.018
 pack-0.018
 open-0.017
Token better
Token ID1365
Feature activation-6.48e-01

Pos logit contributions
orting+0.038
 conclusions+0.038
 mosquitoes+0.036
aro+0.036
ican+0.035
Neg logit contributions
 entire-0.018
 un-0.018
 pack-0.017
 open-0.016
 was-0.016
Token than
Token ID621
Feature activation-1.41e+00

Pos logit contributions
orting+0.023
 conclusions+0.023
 mosquitoes+0.022
aro+0.022
ican+0.021
Neg logit contributions
 entire-0.011
 un-0.011
 pack-0.010
 open-0.010
 was-0.010
Token ge
Token ID4903
Feature activation-5.15e+00

Pos logit contributions
orting+0.049
 conclusions+0.048
ican+0.046
ittens+0.045
ede+0.045
Neg logit contributions
 un-0.025
 was-0.024
 open-0.024
 entire-0.023
 opened-0.023
Tokenese
Token ID2771
Feature activation-2.22e+00

Pos logit contributions
 conclusions+0.153
 narrative+0.149
ion+0.145
aro+0.143
 mosquitoes+0.141
Neg logit contributions
 entire-0.106
 whole-0.098
 un-0.095
 was-0.095
 pack-0.092
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
 conclusions+0.084
orting+0.082
ican+0.080
 mosquitoes+0.079
aro+0.079
Neg logit contributions
 was-0.033
 un-0.032
 entire-0.030
 open-0.029
 pack-0.029
Token They
Token ID1119
Feature activation-1.33e+00

Pos logit contributions
orting+0.016
 conclusions+0.016
 mosquitoes+0.016
aro+0.015
ican+0.015
Neg logit contributions
 un-0.003
 entire-0.003
 pack-0.003
 was-0.003
 open-0.003
Token looked
Token ID3114
Feature activation-3.60e-01

Pos logit contributions
 conclusions+0.051
orting+0.051
 mosquitoes+0.049
ican+0.048
aro+0.048
Neg logit contributions
 un-0.019
 was-0.018
 entire-0.018
 pack-0.018
 opened-0.018
Token for
Token ID329
Feature activation-1.36e+00

Pos logit contributions
 conclusions+0.013
orting+0.012
 mosquitoes+0.012
aro+0.012
ican+0.012
Neg logit contributions
 entire-0.006
 un-0.006
 pack-0.006
 open-0.006
 was-0.005