TOP ACTIVATIONS
MAX = 0.073

After they finished cleaning the chim ney , her dad gave
" The carpet is ripped ," said Mama .
The hockey stick was broken ! The little
games to play with the chim ney . Kids had a
need to decor ate the chim ney ," Ben said ,
't know where this box came from and it could belong
was definitely coming from the chim ney ! G ently she
day , they began to quarrel . The one
. I won 't be mad . I just want to
, but it was too torn . Tim my was sad
everyone was happy about the chim ney . The farmer who
, but it was too torn . So , she decided
help push himself onto the chim ney . He made it
. I won 't be mad at you ".
, but it was too broken to play with again .
but it was still all messed up . Mama got really
Melody was swimming around the tank , when suddenly she noticed
mom my didn 't get mad , but she did order
wanted to look inside the chim ney , but it was
of animal the wing had belonged to . The

MIN ACTIVATIONS
MIN = -0.085

M ummy pe ered down at the ground
crying . He pe ered out to the hallway
the couch . She said n aps were good for him
. Mom my pe ered behind the hedge and
" Anna ? Ben ? Where are you ?" Mom calls
Ab by pe ep ed through the window
beautiful princess dress . She tw irled and danced around the
dad 's room . She pe ered inside and saw lots
Spot 's nose . He n uzz led her and made
Why are you bald ? Where is your hair ?" Tom
me ." Tom pe eks from under the covers
a magic wand . She tw irled around in her princess
behind a tree . She pe ep ed around the corner
. The cam els were so happy to
a show for us . Where is your menu ?"
" Who is he ? Where is he ?" Lily asked
pick it up . She tw irled it around and around
am your family . I do not care about the honey
him . He pe eks from behind the tree
again ?" Spot n uzz led Tom . He

INTERVAL 0.034 - 0.073
CONTAINS 0.670%

there was a little girl named L izzy . She was
days , he was very sad and very hungry . He
basket . He was very proud of his work .
it was or where it came from . He began to
a big gust of wind came by and scattered all the

INTERVAL -0.006 - 0.034
CONTAINS 69.021%

had poison inside ! The poison se ep ed
new here ." The old j agu ar was
fits the cup board ," he said . Mary
cup was old and cracked . It was not a good
said he was ok and they both were soon smiling and

INTERVAL -0.046 - -0.006
CONTAINS 30.242%

. Sally was not foolish like the other people in the
ice - cream truck ," mom said . She took their
't you try it on ?" Mom said , showing her
the park . But her Mom said no . She said
Tom . " Wow , that is a pretty

INTERVAL -0.085 - -0.046
CONTAINS 0.066%

the movies . Ann abel le was very excited and said
see a pony . She kne eled down and looked out
you want to play with me ?" Tom nods
. Maybe we should ask someone for help ."
One day his mum took him to the park
Token After
Token ID2293
Feature activation-1.21e-02

Pos logit contributions
urt+0.005
lock+0.005
 contents+0.005
gy+0.005
ucer+0.005
Neg logit contributions
 films-0.007
 concert-0.007
 movies-0.007
 crowds-0.006
 pian-0.006
Token they
Token ID484
Feature activation+1.73e-02

Pos logit contributions
 concert+0.020
 movies+0.020
 films+0.020
 crowds+0.018
 pian+0.018
Neg logit contributions
urt-0.022
ometer-0.022
 contents-0.021
lock-0.021
gy-0.021
Token finished
Token ID5201
Feature activation+1.14e-02

Pos logit contributions
ometer+0.033
urt+0.033
gy+0.032
 contents+0.032
ucer+0.031
Neg logit contributions
 concert-0.026
 movies-0.024
 pian-0.023
 films-0.022
 crowds-0.022
Token cleaning
Token ID12724
Feature activation+2.01e-02

Pos logit contributions
urt+0.019
lock+0.019
 contents+0.018
ometer+0.018
gy+0.018
Neg logit contributions
 concert-0.020
 films-0.019
 movies-0.019
 crowds-0.019
 pian-0.019
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
urt+0.034
lock+0.033
 contents+0.033
gy+0.032
ometer+0.032
Neg logit contributions
 concert-0.034
 films-0.034
 movies-0.033
 crowds-0.033
 pian-0.033
Token chim
Token ID18205
Feature activation+6.52e-02

Pos logit contributions
urt+0.030
ucer+0.029
lock+0.028
gy+0.027
inky+0.027
Neg logit contributions
 movies-0.028
 films-0.028
 pian-0.027
 concert-0.027
 crowds-0.027
Tokenney
Token ID1681
Feature activation+2.32e-02

Pos logit contributions
 contents+0.079
lock+0.077
ucer+0.075
urt+0.073
 token+0.072
Neg logit contributions
 movies-0.117
aw-0.117
 films-0.114
 pastor-0.110
 pian-0.110
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
urt+0.044
lock+0.043
 contents+0.042
ucer+0.042
ometer+0.041
Neg logit contributions
 concert-0.034
 movies-0.033
 films-0.033
 crowds-0.032
 pian-0.032
Token her
Token ID607
Feature activation-1.43e-02

Pos logit contributions
urt+0.021
gy+0.020
lock+0.020
 contents+0.019
 Finally+0.019
Neg logit contributions
 concert-0.024
 films-0.023
 crowds-0.023
 pian-0.023
 movies-0.023
Token dad
Token ID9955
Feature activation-1.58e-02

Pos logit contributions
 concert+0.028
 films+0.028
 movies+0.027
 crowds+0.027
 pian+0.027
Neg logit contributions
urt-0.021
lock-0.020
 contents-0.020
ometer-0.019
gy-0.019
Token gave
Token ID2921
Feature activation+5.45e-03

Pos logit contributions
 concert+0.029
 films+0.029
 movies+0.028
 pian+0.028
 crowds+0.028
Neg logit contributions
urt-0.024
lock-0.023
 contents-0.023
ometer-0.023
gy-0.023
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
urt+0.020
 contents+0.019
lock+0.019
gy+0.019
ometer+0.018
Neg logit contributions
 concert-0.024
 films-0.023
 movies-0.023
 pian-0.023
 crowds-0.022
Token"
Token ID1
Feature activation+1.66e-02

Pos logit contributions
urt+0.016
 contents+0.016
gy+0.015
ometer+0.015
lock+0.015
Neg logit contributions
 concert-0.023
 films-0.023
 movies-0.023
 pian-0.023
 crowds-0.022
TokenThe
Token ID464
Feature activation-3.47e-03

Pos logit contributions
urt+0.023
lock+0.023
 contents+0.022
ometer+0.022
ucer+0.021
Neg logit contributions
 concert-0.034
 films-0.033
 crowds-0.032
 movies-0.032
 pian-0.031
Token carpet
Token ID20710
Feature activation+1.79e-02

Pos logit contributions
 concert+0.006
 movies+0.006
 crowds+0.006
 films+0.006
 pian+0.006
Neg logit contributions
urt-0.006
lock-0.006
ucer-0.006
gy-0.006
ometer-0.005
Token is
Token ID318
Feature activation+1.17e-02

Pos logit contributions
urt+0.031
lock+0.030
 contents+0.029
gy+0.029
ometer+0.029
Neg logit contributions
 concert-0.030
 films-0.029
 movies-0.029
 crowds-0.028
 pian-0.028
Token ripped
Token ID19551
Feature activation+6.42e-02

Pos logit contributions
urt+0.016
lock+0.015
 contents+0.015
gy+0.015
ometer+0.015
Neg logit contributions
 concert-0.024
 films-0.023
 movies-0.023
 crowds-0.023
 pian-0.023
Token,"
Token ID553
Feature activation+1.69e-02

Pos logit contributions
urt+0.107
lock+0.104
 contents+0.101
 Finally+0.101
ometer+0.100
Neg logit contributions
 films-0.108
 concert-0.106
 movies-0.105
 crowds-0.104
 pian-0.104
Token said
Token ID531
Feature activation+2.09e-02

Pos logit contributions
urt+0.027
lock+0.026
 contents+0.026
gy+0.026
ometer+0.025
Neg logit contributions
 concert-0.030
 films-0.030
 movies-0.030
 pian-0.029
 crowds-0.029
Token Mama
Token ID40084
Feature activation-2.45e-02

Pos logit contributions
urt+0.029
lock+0.027
 contents+0.026
gy+0.026
ometer+0.026
Neg logit contributions
 concert-0.043
 films-0.042
 movies-0.042
 crowds-0.041
 pian-0.041
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
 concert+0.038
 films+0.038
 movies+0.038
 pian+0.037
 crowds+0.036
Neg logit contributions
 contents-0.043
urt-0.041
lock-0.041
ucer-0.041
 virtue-0.040
Token
Token ID198
Feature activation+1.58e-02

Pos logit contributions
urt+0.015
lock+0.014
 contents+0.014
gy+0.014
ometer+0.014
Neg logit contributions
 concert-0.022
 films-0.022
 movies-0.022
 crowds-0.022
 pian-0.022
Token
Token ID198
Feature activation-7.90e-04

Pos logit contributions
 concert+0.005
 movies+0.005
 films+0.005
 pian+0.005
 crowds+0.005
Neg logit contributions
urt-0.005
 contents-0.005
gy-0.005
lock-0.005
ometer-0.005
TokenThe
Token ID464
Feature activation+7.61e-03

Pos logit contributions
 concert+0.001
 movies+0.001
 films+0.001
 pian+0.001
La+0.001
Neg logit contributions
urt-0.001
gy-0.001
 contents-0.001
ometer-0.001
ucer-0.001
Token hockey
Token ID12217
Feature activation+2.01e-04

Pos logit contributions
urt+0.011
lock+0.011
gy+0.010
ometer+0.010
 contents+0.010
Neg logit contributions
 concert-0.015
 films-0.015
 movies-0.015
 crowds-0.015
 pian-0.014
Token stick
Token ID4859
Feature activation+7.75e-03

Pos logit contributions
urt+0.000
lock+0.000
 contents+0.000
gy+0.000
ometer+0.000
Neg logit contributions
 concert-0.000
 films-0.000
 movies-0.000
 crowds-0.000
 pian-0.000
Token was
Token ID373
Feature activation+1.34e-02

Pos logit contributions
urt+0.012
lock+0.011
 contents+0.011
gy+0.011
ometer+0.011
Neg logit contributions
 concert-0.014
 films-0.014
 movies-0.014
 crowds-0.014
 pian-0.014
Token broken
Token ID5445
Feature activation+6.39e-02

Pos logit contributions
urt+0.022
lock+0.021
ometer+0.020
 contents+0.020
gy+0.020
Neg logit contributions
 concert-0.024
 films-0.023
 movies-0.023
 crowds-0.023
 pian-0.023
Token!
Token ID0
Feature activation+7.57e-03

Pos logit contributions
urt+0.118
lock+0.115
 contents+0.113
ometer+0.111
gy+0.110
Neg logit contributions
 concert-0.098
 films-0.096
 movies-0.095
 crowds-0.094
 pian-0.093
Token
Token ID198
Feature activation+5.04e-03

Pos logit contributions
urt+0.010
lock+0.010
 contents+0.010
ucer+0.010
gy+0.010
Neg logit contributions
 concert-0.015
 films-0.015
 movies-0.015
 crowds-0.015
 pian-0.014
Token
Token ID198
Feature activation+3.20e-03

Pos logit contributions
urt+0.008
 contents+0.008
gy+0.008
lock+0.008
ometer+0.008
Neg logit contributions
 concert-0.009
 movies-0.008
 films-0.008
 pian-0.008
La-0.008
TokenThe
Token ID464
Feature activation+2.57e-03

Pos logit contributions
urt+0.005
 contents+0.005
gy+0.005
ometer+0.005
lock+0.005
Neg logit contributions
 concert-0.006
 movies-0.005
 films-0.005
 pian-0.005
La-0.005
Token little
Token ID1310
Feature activation+1.14e-02

Pos logit contributions
urt+0.004
lock+0.004
gy+0.003
 contents+0.003
ometer+0.003
Neg logit contributions
 concert-0.005
 films-0.005
 movies-0.005
 crowds-0.005
 pian-0.005
Token games
Token ID1830
Feature activation+1.30e-02

Pos logit contributions
 contents+0.035
urt+0.032
 job+0.031
lock+0.030
 hammer+0.030
Neg logit contributions
 concert-0.056
La-0.052
 films-0.051
 crowds-0.051
 pastor-0.050
Token to
Token ID284
Feature activation+8.50e-03

Pos logit contributions
urt+0.019
lock+0.018
 contents+0.018
gy+0.017
ucer+0.017
Neg logit contributions
 films-0.025
 concert-0.025
 movies-0.024
 crowds-0.024
 pian-0.024
Token play
Token ID711
Feature activation+1.36e-02

Pos logit contributions
urt+0.011
lock+0.011
 contents+0.011
gy+0.011
 Finally+0.010
Neg logit contributions
 films-0.017
 concert-0.017
 pian-0.017
 crowds-0.016
 movies-0.016
Token with
Token ID351
Feature activation+2.42e-02

Pos logit contributions
urt+0.018
 contents+0.018
lock+0.018
gy+0.018
 Finally+0.017
Neg logit contributions
 concert-0.027
 crowds-0.026
 pastor-0.026
 films-0.026
 pian-0.026
Token the
Token ID262
Feature activation+7.27e-03

Pos logit contributions
urt+0.036
lock+0.035
 contents+0.035
gy+0.034
ometer+0.033
Neg logit contributions
 concert-0.046
 films-0.045
 crowds-0.044
 movies-0.044
 pian-0.044
Token chim
Token ID18205
Feature activation+6.36e-02

Pos logit contributions
urt+0.013
lock+0.012
ucer+0.012
gy+0.011
ometer+0.011
Neg logit contributions
 movies-0.013
 crowds-0.013
 films-0.013
 pian-0.012
 concert-0.012
Tokenney
Token ID1681
Feature activation+1.56e-02

Pos logit contributions
lock+0.069
 contents+0.069
urt+0.067
ucer+0.066
 virtue+0.062
Neg logit contributions
 movies-0.131
aw-0.130
 films-0.129
 pastor-0.126
 pian-0.126
Token.
Token ID13
Feature activation-1.19e-03

Pos logit contributions
urt+0.024
lock+0.023
 contents+0.022
gy+0.022
ometer+0.022
Neg logit contributions
 concert-0.030
 films-0.029
 movies-0.028
 crowds-0.028
 pian-0.028
Token Kids
Token ID17476
Feature activation+3.48e-03

Pos logit contributions
 concert+0.003
 films+0.002
 movies+0.002
 crowds+0.002
 pian+0.002
Neg logit contributions
urt-0.002
lock-0.002
 contents-0.001
gy-0.001
ometer-0.001
Token had
Token ID550
Feature activation+1.03e-02

Pos logit contributions
urt+0.006
lock+0.005
 contents+0.005
gy+0.005
ometer+0.005
Neg logit contributions
 concert-0.006
 films-0.006
 movies-0.006
 crowds-0.006
 pian-0.006
Token a
Token ID257
Feature activation+3.54e-04

Pos logit contributions
urt+0.014
 contents+0.014
lock+0.013
gy+0.013
 Finally+0.013
Neg logit contributions
 concert-0.021
 films-0.021
 movies-0.020
 pian-0.020
 crowds-0.020
Token need
Token ID761
Feature activation-2.86e-03

Pos logit contributions
urt+0.019
 contents+0.018
lock+0.018
ometer+0.018
gy+0.017
Neg logit contributions
 concert-0.022
 films-0.021
 movies-0.021
 crowds-0.021
 pian-0.021
Token to
Token ID284
Feature activation+7.04e-03

Pos logit contributions
 concert+0.006
 films+0.006
 crowds+0.006
 pian+0.006
 movies+0.006
Neg logit contributions
 contents-0.004
urt-0.004
lock-0.004
 Finally-0.003
ometer-0.003
Token decor
Token ID11705
Feature activation+3.66e-03

Pos logit contributions
urt+0.011
ometer+0.011
ucer+0.010
lock+0.010
 contents+0.010
Neg logit contributions
 concert-0.014
 movies-0.013
 films-0.013
 pian-0.013
 crowds-0.012
Tokenate
Token ID378
Feature activation+1.55e-03

Pos logit contributions
urt+0.003
 contents+0.003
gy+0.003
lock+0.002
 job+0.002
Neg logit contributions
 crowds-0.009
 concert-0.009
 pian-0.009
 strangers-0.009
 pastor-0.009
Token the
Token ID262
Feature activation+2.49e-03

Pos logit contributions
urt+0.002
 contents+0.002
lock+0.002
gy+0.002
ometer+0.002
Neg logit contributions
 concert-0.003
 crowds-0.003
 pian-0.003
 films-0.003
 movies-0.003
Token chim
Token ID18205
Feature activation+6.26e-02

Pos logit contributions
urt+0.004
gy+0.004
lock+0.004
ometer+0.004
ucer+0.004
Neg logit contributions
 concert-0.005
 movies-0.005
 crowds-0.005
 films-0.005
 pian-0.005
Tokenney
Token ID1681
Feature activation+2.64e-02

Pos logit contributions
urt+0.072
 contents+0.067
ucer+0.065
lock+0.065
inky+0.061
Neg logit contributions
 movies-0.129
 films-0.125
 concert-0.121
aw-0.121
 pastor-0.120
Token,"
Token ID553
Feature activation+3.62e-03

Pos logit contributions
urt+0.041
lock+0.039
 contents+0.038
ucer+0.037
ometer+0.037
Neg logit contributions
 films-0.048
 concert-0.048
 movies-0.048
 crowds-0.046
 pian-0.046
Token Ben
Token ID3932
Feature activation-1.41e-02

Pos logit contributions
urt+0.005
lock+0.005
 contents+0.005
 Finally+0.005
ometer+0.004
Neg logit contributions
 concert-0.007
 films-0.007
 crowds-0.007
 pian-0.007
 movies-0.007
Token said
Token ID531
Feature activation+9.24e-03

Pos logit contributions
 concert+0.025
 films+0.024
 movies+0.024
 pian+0.024
 crowds+0.024
Neg logit contributions
urt-0.023
lock-0.022
 contents-0.022
gy-0.022
ucer-0.022
Token,
Token ID11
Feature activation-1.84e-02

Pos logit contributions
urt+0.016
lock+0.016
 contents+0.016
ucer+0.015
ometer+0.015
Neg logit contributions
 concert-0.015
 films-0.015
 movies-0.015
 crowds-0.015
 pian-0.014
Token't
Token ID470
Feature activation+1.90e-02

Pos logit contributions
urt+0.002
lock+0.002
 contents+0.002
gy+0.002
ometer+0.002
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 crowds-0.004
 pian-0.004
Token know
Token ID760
Feature activation-2.00e-02

Pos logit contributions
urt+0.030
 contents+0.029
ucer+0.029
ometer+0.029
lock+0.029
Neg logit contributions
 concert-0.035
 crowds-0.034
 movies-0.033
 pian-0.033
 films-0.033
Token where
Token ID810
Feature activation-2.93e-02

Pos logit contributions
 concert+0.038
 films+0.037
 crowds+0.036
 pian+0.036
La+0.035
Neg logit contributions
urt-0.030
lock-0.028
ometer-0.028
gy-0.027
 contents-0.027
Token this
Token ID428
Feature activation-3.81e-03

Pos logit contributions
 concert+0.060
 pian+0.058
 films+0.057
 crowds+0.057
La+0.057
Neg logit contributions
urt-0.037
lock-0.037
ometer-0.036
 contents-0.035
gy-0.035
Token box
Token ID3091
Feature activation+2.18e-02

Pos logit contributions
 concert+0.007
 films+0.007
 movies+0.007
 crowds+0.007
 pian+0.007
Neg logit contributions
urt-0.006
lock-0.006
ucer-0.006
ometer-0.006
gy-0.006
Token came
Token ID1625
Feature activation+6.24e-02

Pos logit contributions
urt+0.037
lock+0.036
 contents+0.035
ometer+0.034
gy+0.034
Neg logit contributions
 concert-0.037
 films-0.036
 movies-0.035
 crowds-0.035
 pian-0.035
Token from
Token ID422
Feature activation+1.74e-02

Pos logit contributions
urt+0.107
lock+0.103
 contents+0.101
ometer+0.100
gy+0.099
Neg logit contributions
 concert-0.106
 films-0.104
 movies-0.102
 crowds-0.101
 pian-0.100
Token and
Token ID290
Feature activation+1.37e-04

Pos logit contributions
urt+0.029
gy+0.029
lock+0.028
 contents+0.028
 Finally+0.027
Neg logit contributions
 pian-0.030
 concert-0.030
 films-0.029
 crowds-0.029
 movies-0.029
Token it
Token ID340
Feature activation-5.33e-03

Pos logit contributions
urt+0.000
lock+0.000
 contents+0.000
gy+0.000
ometer+0.000
Neg logit contributions
 concert-0.000
 films-0.000
 movies-0.000
 crowds-0.000
 pian-0.000
Token could
Token ID714
Feature activation-2.56e-03

Pos logit contributions
 concert+0.009
 films+0.008
 movies+0.008
 crowds+0.008
 pian+0.008
Neg logit contributions
urt-0.010
lock-0.009
ucer-0.009
 contents-0.009
ometer-0.009
Token belong
Token ID5594
Feature activation+2.40e-02

Pos logit contributions
 concert+0.004
 movies+0.004
 strangers+0.004
 films+0.004
 crowds+0.004
Neg logit contributions
urt-0.004
ucer-0.004
ometer-0.004
lock-0.004
 Finally-0.004
Token was
Token ID373
Feature activation+2.02e-02

Pos logit contributions
 concert+0.003
 films+0.003
 movies+0.003
 pian+0.003
 crowds+0.003
Neg logit contributions
urt-0.003
lock-0.003
 contents-0.003
ometer-0.003
ucer-0.003
Token definitely
Token ID4753
Feature activation+1.50e-02

Pos logit contributions
urt+0.028
gy+0.027
 contents+0.027
lock+0.026
 job+0.026
Neg logit contributions
 concert-0.041
 films-0.039
 crowds-0.039
 pian-0.038
 movies-0.038
Token coming
Token ID2406
Feature activation+4.08e-02

Pos logit contributions
urt+0.021
lock+0.020
 contents+0.020
gy+0.020
ometer+0.019
Neg logit contributions
 concert-0.030
 films-0.030
 movies-0.029
 crowds-0.029
 pian-0.029
Token from
Token ID422
Feature activation+1.64e-02

Pos logit contributions
urt+0.073
 contents+0.069
lock+0.068
ucer+0.068
ometer+0.067
Neg logit contributions
 movies-0.064
 concert-0.064
 films-0.063
 crowds-0.062
 pian-0.061
Token the
Token ID262
Feature activation+1.51e-02

Pos logit contributions
urt+0.021
gy+0.021
 contents+0.021
lock+0.021
 Finally+0.020
Neg logit contributions
 concert-0.034
 pian-0.033
 films-0.033
 crowds-0.032
 movies-0.032
Token chim
Token ID18205
Feature activation+6.21e-02

Pos logit contributions
urt+0.026
lock+0.024
ucer+0.023
 contents+0.023
ages+0.023
Neg logit contributions
 movies-0.027
 concert-0.026
 crowds-0.026
 films-0.026
 pian-0.026
Tokenney
Token ID1681
Feature activation+2.19e-02

Pos logit contributions
 contents+0.066
urt+0.065
lock+0.062
ucer+0.061
 Finally+0.058
Neg logit contributions
 movies-0.133
aw-0.133
 concert-0.130
 films-0.129
 crowds-0.128
Token!
Token ID0
Feature activation+3.03e-03

Pos logit contributions
urt+0.034
lock+0.032
 contents+0.032
gy+0.031
ometer+0.031
Neg logit contributions
 concert-0.041
 films-0.040
 movies-0.040
 crowds-0.039
 pian-0.039
Token G
Token ID402
Feature activation-3.50e-02

Pos logit contributions
 contents+0.005
urt+0.005
ucer+0.005
gy+0.005
lock+0.005
Neg logit contributions
 movies-0.005
 concert-0.005
 crowds-0.005
 films-0.005
 pian-0.005
Tokenently
Token ID1473
Feature activation-2.14e-02

Pos logit contributions
 concert+0.063
 films+0.061
 movies+0.059
 pian+0.059
 pastor+0.059
Neg logit contributions
 contents-0.051
 Finally-0.050
urt-0.050
lock-0.050
ucer-0.049
Token she
Token ID673
Feature activation-7.95e-03

Pos logit contributions
 films+0.042
 concert+0.041
 movies+0.040
 crowds+0.040
 pian+0.040
Neg logit contributions
urt-0.032
lock-0.031
gy-0.030
 contents-0.029
ometer-0.029
Token day
Token ID1110
Feature activation-1.08e-03

Pos logit contributions
urt+0.025
lock+0.024
 contents+0.024
ucer+0.024
 Finally+0.023
Neg logit contributions
 movies-0.029
 films-0.029
 concert-0.029
 dolphins-0.027
 pian-0.027
Token,
Token ID11
Feature activation+5.90e-03

Pos logit contributions
 films+0.002
 concert+0.002
 movies+0.002
 pian+0.001
 crowds+0.001
Neg logit contributions
urt-0.002
 contents-0.002
lock-0.002
ometer-0.002
gy-0.002
Token they
Token ID484
Feature activation+6.59e-03

Pos logit contributions
 contents+0.013
urt+0.013
 virtue+0.013
lock+0.013
urd+0.012
Neg logit contributions
 films-0.008
 movies-0.007
 dolphins-0.007
 pian-0.007
 concert-0.007
Token began
Token ID2540
Feature activation+1.46e-02

Pos logit contributions
urt+0.010
lock+0.010
 contents+0.009
gy+0.009
ometer+0.009
Neg logit contributions
 concert-0.013
 films-0.012
 movies-0.012
 crowds-0.012
 pian-0.012
Token to
Token ID284
Feature activation+5.57e-03

Pos logit contributions
urt+0.021
lock+0.020
 contents+0.020
gy+0.020
ometer+0.020
Neg logit contributions
 concert-0.029
 films-0.028
 crowds-0.027
 movies-0.027
 pian-0.027
Token quarrel
Token ID41998
Feature activation+6.20e-02

Pos logit contributions
urt+0.008
lock+0.008
 contents+0.008
gy+0.008
ometer+0.008
Neg logit contributions
 concert-0.011
 films-0.010
 movies-0.010
 crowds-0.010
 pian-0.010
Token.
Token ID13
Feature activation+5.00e-03

Pos logit contributions
urt+0.105
lock+0.102
 contents+0.100
gy+0.099
ometer+0.098
Neg logit contributions
 concert-0.107
 films-0.105
 movies-0.103
 crowds-0.102
 pian-0.101
Token
Token ID198
Feature activation-6.31e-03

Pos logit contributions
urt+0.007
lock+0.007
 contents+0.006
ucer+0.006
ometer+0.006
Neg logit contributions
 films-0.010
 concert-0.010
 movies-0.010
 crowds-0.010
 pian-0.010
Token
Token ID198
Feature activation+8.09e-04

Pos logit contributions
 concert+0.013
 films+0.013
 crowds+0.013
 movies+0.013
 pian+0.013
Neg logit contributions
urt-0.008
lock-0.008
gy-0.008
 contents-0.008
 Finally-0.008
TokenThe
Token ID464
Feature activation+1.25e-03

Pos logit contributions
urt+0.001
 contents+0.001
lock+0.001
ometer+0.001
ucer+0.001
Neg logit contributions
 films-0.002
 concert-0.002
 movies-0.001
 pian-0.001
 crowds-0.001
Token one
Token ID530
Feature activation-2.21e-02

Pos logit contributions
urt+0.002
ucer+0.002
lock+0.002
 Finally+0.002
ometer+0.002
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 dolphins-0.002
 opera-0.002
Token.
Token ID13
Feature activation-3.69e-03

Pos logit contributions
 films+0.043
 concert+0.042
 pian+0.042
 crowds+0.041
 movies+0.041
Neg logit contributions
urt-0.024
gy-0.024
lock-0.023
 contents-0.023
ometer-0.022
Token I
Token ID314
Feature activation-1.24e-02

Pos logit contributions
 concert+0.006
 films+0.006
 movies+0.006
 crowds+0.006
 pian+0.006
Neg logit contributions
urt-0.006
 contents-0.006
lock-0.006
ucer-0.006
gy-0.006
Token won
Token ID1839
Feature activation-6.70e-04

Pos logit contributions
 concert+0.028
 films+0.027
 movies+0.027
 crowds+0.027
 pian+0.027
Neg logit contributions
urt-0.015
lock-0.014
 contents-0.014
gy-0.013
ometer-0.013
Token't
Token ID470
Feature activation+1.62e-02

Pos logit contributions
 concert+0.001
 films+0.001
 movies+0.001
 crowds+0.001
 pian+0.001
Neg logit contributions
urt-0.001
lock-0.001
gy-0.001
 contents-0.001
ometer-0.001
Token be
Token ID307
Feature activation+2.43e-02

Pos logit contributions
urt+0.022
lock+0.021
gy+0.021
 contents+0.021
ometer+0.020
Neg logit contributions
 films-0.033
 concert-0.033
 movies-0.033
 pian-0.032
 crowds-0.032
Token mad
Token ID8805
Feature activation+6.19e-02

Pos logit contributions
urt+0.031
gy+0.031
 contents+0.029
 job+0.029
lock+0.028
Neg logit contributions
 concert-0.049
 films-0.049
 movies-0.049
 pian-0.047
 pastor-0.047
Token.
Token ID13
Feature activation-3.56e-03

Pos logit contributions
urt+0.104
lock+0.099
ometer+0.097
gy+0.094
 job+0.094
Neg logit contributions
 concert-0.108
 films-0.104
 movies-0.101
 pian-0.101
 announcements-0.097
Token I
Token ID314
Feature activation-1.20e-02

Pos logit contributions
 concert+0.006
 films+0.006
 movies+0.006
 crowds+0.006
 pian+0.006
Neg logit contributions
urt-0.006
lock-0.005
 contents-0.005
gy-0.005
ometer-0.005
Token just
Token ID655
Feature activation-1.41e-02

Pos logit contributions
 concert+0.026
 films+0.026
 movies+0.026
 crowds+0.025
 pian+0.025
Neg logit contributions
urt-0.015
lock-0.014
 contents-0.014
gy-0.013
ometer-0.013
Token want
Token ID765
Feature activation-1.16e-02

Pos logit contributions
 concert+0.028
 films+0.027
 movies+0.027
 crowds+0.027
 pian+0.027
Neg logit contributions
urt-0.020
lock-0.019
 contents-0.019
gy-0.019
ometer-0.019
Token to
Token ID284
Feature activation-5.52e-04

Pos logit contributions
 concert+0.024
 films+0.023
 pian+0.023
 crowds+0.023
 announcements+0.023
Neg logit contributions
urt-0.014
 contents-0.014
lock-0.013
 job-0.013
ometer-0.012
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
urt+0.048
 contents+0.048
ucer+0.047
lock+0.047
gy+0.045
Neg logit contributions
 concert-0.028
 movies-0.027
 crowds-0.026
 films-0.026
 pian-0.025
Token but
Token ID475
Feature activation-1.52e-02

Pos logit contributions
urt+0.022
lock+0.021
 contents+0.021
gy+0.020
ometer+0.020
Neg logit contributions
 concert-0.018
 films-0.018
 movies-0.018
 crowds-0.017
 pian-0.017
Token it
Token ID340
Feature activation-6.09e-03

Pos logit contributions
 concert+0.019
 movies+0.018
 pian+0.017
 strangers+0.017
 crowds+0.017
Neg logit contributions
urt-0.032
 contents-0.031
lock-0.030
inky-0.030
ometer-0.030
Token was
Token ID373
Feature activation+1.13e-02

Pos logit contributions
 concert+0.009
 pian+0.008
 movies+0.008
 films+0.008
 strangers+0.008
Neg logit contributions
urt-0.011
lock-0.011
ucer-0.011
 contents-0.011
 Finally-0.010
Token too
Token ID1165
Feature activation+2.41e-02

Pos logit contributions
urt+0.025
lock+0.025
ucer+0.025
 contents+0.024
ometer+0.024
Neg logit contributions
 concert-0.012
 pian-0.011
 strangers-0.011
 films-0.011
 crowds-0.011
Token torn
Token ID12445
Feature activation+6.18e-02

Pos logit contributions
stack+0.041
gy+0.041
 Finally+0.040
 contents+0.040
lock+0.039
Neg logit contributions
 concert-0.035
 pian-0.034
 crowds-0.033
 strangers-0.032
aw-0.032
Token.
Token ID13
Feature activation-1.55e-03

Pos logit contributions
lock+0.118
 contents+0.118
urt+0.118
 Finally+0.115
ucer+0.114
Neg logit contributions
 crowds-0.075
 movies-0.074
aw-0.074
 pian-0.073
 films-0.073
Token Tim
Token ID5045
Feature activation+5.95e-03

Pos logit contributions
 movies+0.002
 crowds+0.002
 films+0.002
 pian+0.002
aw+0.002
Neg logit contributions
ucer-0.003
lock-0.003
 contents-0.003
urt-0.003
sy-0.003
Tokenmy
Token ID1820
Feature activation+5.37e-03

Pos logit contributions
lock+0.009
urt+0.009
 contents+0.009
ometer+0.008
 Finally+0.008
Neg logit contributions
 concert-0.011
 crowds-0.011
 films-0.011
 movies-0.011
 pian-0.010
Token was
Token ID373
Feature activation-7.08e-03

Pos logit contributions
urt+0.009
lock+0.009
 contents+0.009
ucer+0.009
ometer+0.009
Neg logit contributions
 concert-0.009
 films-0.008
 movies-0.008
 pian-0.008
 crowds-0.008
Token sad
Token ID6507
Feature activation+3.13e-02

Pos logit contributions
 concert+0.012
 films+0.012
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.012
lock-0.012
 contents-0.012
ometer-0.012
ucer-0.012
Token everyone
Token ID2506
Feature activation-2.20e-02

Pos logit contributions
urt+0.032
lock+0.031
 contents+0.030
gy+0.030
ometer+0.030
Neg logit contributions
 concert-0.036
 films-0.036
 movies-0.035
 crowds-0.035
 pian-0.034
Token was
Token ID373
Feature activation+3.97e-03

Pos logit contributions
 films+0.050
 concert+0.047
 pian+0.047
 movies+0.047
La+0.047
Neg logit contributions
urt-0.025
lock-0.025
gy-0.024
ometer-0.023
 contents-0.021
Token happy
Token ID3772
Feature activation+2.91e-02

Pos logit contributions
 job+0.005
gy+0.005
urt+0.005
 contents+0.005
lock+0.005
Neg logit contributions
 films-0.008
 movies-0.008
 concert-0.008
 crowds-0.007
 pian-0.007
Token about
Token ID546
Feature activation+3.29e-05

Pos logit contributions
urt+0.034
 job+0.030
 contents+0.030
gy+0.030
lock+0.029
Neg logit contributions
 concert-0.064
La-0.064
 films-0.062
 announcements-0.062
 pian-0.062
Token the
Token ID262
Feature activation+1.70e-02

Pos logit contributions
urt+0.000
gy+0.000
lock+0.000
 contents+0.000
 job+0.000
Neg logit contributions
 concert-0.000
 pian-0.000
La-0.000
 films-0.000
 crowds-0.000
Token chim
Token ID18205
Feature activation+6.18e-02

Pos logit contributions
urt+0.026
lock+0.025
ucer+0.025
ometer+0.024
gy+0.024
Neg logit contributions
 films-0.032
 concert-0.032
 movies-0.032
 crowds-0.031
 pian-0.031
Tokenney
Token ID1681
Feature activation+2.28e-02

Pos logit contributions
 contents+0.062
urt+0.060
lock+0.059
ucer+0.059
inky+0.055
Neg logit contributions
 movies-0.133
 films-0.132
 pastor-0.132
aw-0.131
 concert-0.130
Token.
Token ID13
Feature activation+2.31e-03

Pos logit contributions
urt+0.036
lock+0.036
 contents+0.035
gy+0.035
ometer+0.033
Neg logit contributions
 concert-0.041
 films-0.040
 pian-0.039
 crowds-0.038
 movies-0.038
Token The
Token ID383
Feature activation+4.83e-03

Pos logit contributions
urt+0.003
lock+0.003
 contents+0.003
ucer+0.003
gy+0.003
Neg logit contributions
 concert-0.005
 films-0.005
 movies-0.005
 crowds-0.005
 pian-0.005
Token farmer
Token ID18739
Feature activation-1.09e-02

Pos logit contributions
urt+0.008
lock+0.007
ucer+0.007
ometer+0.007
gy+0.007
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 crowds-0.009
 pian-0.009
Token who
Token ID508
Feature activation-2.68e-03

Pos logit contributions
 films+0.023
 concert+0.022
 pian+0.022
 movies+0.022
 crowds+0.022
Neg logit contributions
urt-0.014
lock-0.014
gy-0.014
 contents-0.013
ometer-0.013
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
ucer+0.023
urt+0.022
lock+0.022
 contents+0.022
ages+0.021
Neg logit contributions
 concert-0.010
 movies-0.010
 films-0.009
 pian-0.009
 or-0.009
Token but
Token ID475
Feature activation-8.47e-03

Pos logit contributions
urt+0.020
lock+0.019
 contents+0.019
 Finally+0.019
gy+0.018
Neg logit contributions
 concert-0.019
 films-0.018
 movies-0.018
 crowds-0.018
 pian-0.017
Token it
Token ID340
Feature activation-6.76e-04

Pos logit contributions
 concert+0.012
 films+0.012
 movies+0.012
 pian+0.012
 crowds+0.011
Neg logit contributions
urt-0.017
 contents-0.016
lock-0.016
gy-0.016
ometer-0.016
Token was
Token ID373
Feature activation+1.58e-02

Pos logit contributions
 concert+0.001
 pian+0.001
 movies+0.001
 films+0.001
 strangers+0.001
Neg logit contributions
urt-0.001
lock-0.001
ucer-0.001
 contents-0.001
 Finally-0.001
Token too
Token ID1165
Feature activation+2.95e-02

Pos logit contributions
urt+0.036
lock+0.034
ucer+0.034
ages+0.033
ometer+0.033
Neg logit contributions
 concert-0.018
 pian-0.018
 crowds-0.017
 films-0.016
 strangers-0.016
Token torn
Token ID12445
Feature activation+6.17e-02

Pos logit contributions
stack+0.051
 contents+0.048
gy+0.048
lock+0.047
urt+0.047
Neg logit contributions
 concert-0.044
 pian-0.044
 crowds-0.041
 films-0.039
 movies-0.039
Token.
Token ID13
Feature activation+8.66e-03

Pos logit contributions
urt+0.123
lock+0.123
 contents+0.122
 Finally+0.121
ucer+0.119
Neg logit contributions
 crowds-0.068
 pian-0.068
 concert-0.067
aw-0.067
 movies-0.066
Token So
Token ID1406
Feature activation+2.19e-02

Pos logit contributions
urt+0.012
lock+0.012
 contents+0.012
ucer+0.012
gy+0.011
Neg logit contributions
 concert-0.017
 films-0.017
 movies-0.017
 crowds-0.016
 pian-0.016
Token,
Token ID11
Feature activation+9.81e-03

Pos logit contributions
urt+0.044
lock+0.042
 contents+0.042
ometer+0.041
ucer+0.041
Neg logit contributions
 concert-0.032
 films-0.030
 movies-0.030
 pian-0.030
 crowds-0.029
Token she
Token ID673
Feature activation-6.66e-04

Pos logit contributions
urt+0.018
lock+0.017
 contents+0.017
ometer+0.017
gy+0.017
Neg logit contributions
 concert-0.016
 films-0.016
 movies-0.015
 crowds-0.015
 pian-0.015
Token decided
Token ID3066
Feature activation+5.70e-03

Pos logit contributions
 concert+0.001
 pian+0.001
 films+0.001
 movies+0.001
 crowds+0.001
Neg logit contributions
urt-0.001
 contents-0.001
lock-0.001
ucer-0.001
ometer-0.001
Token help
Token ID1037
Feature activation+1.19e-03

Pos logit contributions
urt+0.020
ometer+0.020
ucer+0.019
 contents+0.019
inky+0.019
Neg logit contributions
 concert-0.023
 movies-0.022
 crowds-0.022
 films-0.022
 strangers-0.022
Token push
Token ID4574
Feature activation+1.95e-03

Pos logit contributions
urt+0.002
lock+0.002
 contents+0.002
gy+0.002
ometer+0.002
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 crowds-0.002
 pian-0.002
Token himself
Token ID2241
Feature activation-4.16e-03

Pos logit contributions
urt+0.003
lock+0.003
 contents+0.003
gy+0.003
ometer+0.003
Neg logit contributions
 concert-0.003
 films-0.003
 movies-0.003
 crowds-0.003
 pian-0.003
Token onto
Token ID4291
Feature activation-3.64e-03

Pos logit contributions
 movies+0.006
 films+0.006
 concert+0.006
 crowds+0.006
 pian+0.006
Neg logit contributions
urt-0.007
 Finally-0.007
ucer-0.007
 contents-0.007
lock-0.007
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
 films+0.005
 concert+0.005
 movies+0.005
 crowds+0.005
 pian+0.005
Neg logit contributions
urt-0.007
lock-0.007
 contents-0.007
ucer-0.007
ometer-0.007
Token chim
Token ID18205
Feature activation+6.13e-02

Pos logit contributions
urt+0.023
ucer+0.021
lock+0.020
ages+0.020
iane+0.019
Neg logit contributions
 movies-0.019
 crowds-0.019
 films-0.018
 opera-0.018
 pastor-0.018
Tokenney
Token ID1681
Feature activation+1.57e-02

Pos logit contributions
 contents+0.078
ucer+0.074
lock+0.072
 reward+0.072
 token+0.070
Neg logit contributions
aw-0.110
 movies-0.108
 films-0.105
 pastor-0.104
 pian-0.102
Token.
Token ID13
Feature activation+5.19e-03

Pos logit contributions
urt+0.028
ucer+0.027
 contents+0.027
lock+0.027
 Finally+0.026
Neg logit contributions
 movies-0.024
 films-0.024
 concert-0.023
 crowds-0.023
 pian-0.023
Token He
Token ID679
Feature activation+1.10e-02

Pos logit contributions
ucer+0.008
 contents+0.008
urt+0.008
gy+0.007
lock+0.007
Neg logit contributions
 movies-0.010
 films-0.009
 crowds-0.009
 concert-0.009
 pian-0.009
Token made
Token ID925
Feature activation+6.30e-03

Pos logit contributions
urt+0.016
lock+0.016
 contents+0.015
gy+0.015
ometer+0.015
Neg logit contributions
 concert-0.021
 films-0.021
 movies-0.021
 crowds-0.021
 pian-0.020
Token it
Token ID340
Feature activation+5.94e-03

Pos logit contributions
urt+0.009
lock+0.009
gy+0.009
ometer+0.009
 contents+0.009
Neg logit contributions
 films-0.012
 concert-0.012
 movies-0.012
 crowds-0.012
 pian-0.012
Token.
Token ID13
Feature activation-2.98e-03

Pos logit contributions
urt+0.013
lock+0.012
gy+0.012
ometer+0.012
 contents+0.012
Neg logit contributions
 concert-0.018
 films-0.018
 crowds-0.017
 movies-0.017
 pian-0.017
Token I
Token ID314
Feature activation-5.85e-03

Pos logit contributions
 concert+0.006
 films+0.006
 movies+0.006
 pian+0.006
 crowds+0.006
Neg logit contributions
urt-0.004
 contents-0.004
lock-0.004
gy-0.004
ucer-0.004
Token won
Token ID1839
Feature activation+6.60e-03

Pos logit contributions
 concert+0.013
 films+0.012
 movies+0.012
 pian+0.012
 crowds+0.012
Neg logit contributions
urt-0.007
lock-0.007
 contents-0.007
ucer-0.006
ometer-0.006
Token't
Token ID470
Feature activation+1.96e-02

Pos logit contributions
urt+0.008
lock+0.007
 contents+0.007
gy+0.007
ometer+0.007
Neg logit contributions
 concert-0.015
 films-0.015
 movies-0.014
 crowds-0.014
 pian-0.014
Token be
Token ID307
Feature activation+2.47e-02

Pos logit contributions
urt+0.028
lock+0.027
gy+0.026
 contents+0.026
ometer+0.025
Neg logit contributions
 films-0.039
 concert-0.039
 movies-0.038
 crowds-0.038
 pian-0.038
Token mad
Token ID8805
Feature activation+6.12e-02

Pos logit contributions
urt+0.031
gy+0.031
 contents+0.029
 job+0.029
lock+0.028
Neg logit contributions
 concert-0.051
 films-0.050
 movies-0.050
 pian-0.048
 crowds-0.048
Token at
Token ID379
Feature activation+7.37e-04

Pos logit contributions
urt+0.104
lock+0.097
ometer+0.097
gy+0.094
 Finally+0.093
Neg logit contributions
 concert-0.107
 films-0.102
 pian-0.100
 movies-0.099
 announcements-0.096
Token you
Token ID345
Feature activation+1.08e-02

Pos logit contributions
urt+0.001
gy+0.001
 contents+0.001
lock+0.001
 job+0.001
Neg logit contributions
 concert-0.001
 pian-0.001
 films-0.001
 announcements-0.001
 crowds-0.001
Token".
Token ID1911
Feature activation+1.25e-02

Pos logit contributions
urt+0.017
lock+0.016
 contents+0.016
gy+0.015
ometer+0.015
Neg logit contributions
 concert-0.020
 films-0.020
 movies-0.020
 crowds-0.020
 pian-0.019
Token
Token ID220
Feature activation+2.61e-02

Pos logit contributions
urt+0.018
lock+0.017
 contents+0.017
ucer+0.016
gy+0.016
Neg logit contributions
 films-0.024
 concert-0.024
 movies-0.024
 crowds-0.024
 pian-0.023
Token
Token ID198
Feature activation+4.06e-04

Pos logit contributions
 contents+0.047
urt+0.046
lock+0.046
ucer+0.045
gy+0.044
Neg logit contributions
 films-0.039
 movies-0.039
 concert-0.038
 crowds-0.037
aw-0.036
Token,
Token ID11
Feature activation+9.29e-03

Pos logit contributions
ucer+0.017
urt+0.017
lock+0.017
 contents+0.017
ometer+0.017
Neg logit contributions
 concert-0.010
 movies-0.010
 films-0.009
 crowds-0.009
 pian-0.009
Token but
Token ID475
Feature activation-6.69e-03

Pos logit contributions
urt+0.015
lock+0.015
 contents+0.014
 Finally+0.014
gy+0.014
Neg logit contributions
 concert-0.017
 films-0.016
 movies-0.016
 crowds-0.016
 pian-0.016
Token it
Token ID340
Feature activation+1.85e-03

Pos logit contributions
 concert+0.009
 movies+0.008
 films+0.008
 crowds+0.008
 pian+0.008
Neg logit contributions
urt-0.014
 contents-0.013
lock-0.013
gy-0.013
inky-0.013
Token was
Token ID373
Feature activation+1.73e-02

Pos logit contributions
urt+0.003
lock+0.003
ucer+0.003
 contents+0.003
 Finally+0.003
Neg logit contributions
 concert-0.003
 pian-0.003
 movies-0.003
 films-0.003
 strangers-0.003
Token too
Token ID1165
Feature activation+3.14e-02

Pos logit contributions
urt+0.037
lock+0.036
ucer+0.035
ometer+0.034
 contents+0.034
Neg logit contributions
 concert-0.021
 pian-0.020
 films-0.020
 crowds-0.020
 movies-0.020
Token broken
Token ID5445
Feature activation+6.12e-02

Pos logit contributions
gy+0.054
stack+0.053
 contents+0.052
urt+0.052
 Finally+0.052
Neg logit contributions
 concert-0.046
 pian-0.045
 crowds-0.043
 pastor-0.042
 movies-0.042
Token to
Token ID284
Feature activation+2.65e-02

Pos logit contributions
urt+0.105
lock+0.103
 contents+0.102
ucer+0.099
 Finally+0.099
Neg logit contributions
 concert-0.097
 films-0.095
 movies-0.095
 crowds-0.094
 pian-0.093
Token play
Token ID711
Feature activation+9.29e-03

Pos logit contributions
urt+0.041
ometer+0.041
 contents+0.040
 Finally+0.039
ages+0.039
Neg logit contributions
 concert-0.048
 movies-0.046
 films-0.044
 crowds-0.044
 strangers-0.043
Token with
Token ID351
Feature activation+3.01e-02

Pos logit contributions
urt+0.015
lock+0.014
 contents+0.014
gy+0.014
ometer+0.013
Neg logit contributions
 concert-0.017
 films-0.017
 crowds-0.016
 movies-0.016
 pian-0.016
Token again
Token ID757
Feature activation-1.00e-02

Pos logit contributions
urt+0.060
lock+0.059
 contents+0.057
ometer+0.057
ucer+0.056
Neg logit contributions
 concert-0.041
 films-0.041
 movies-0.041
 crowds-0.040
 pian-0.039
Token.
Token ID13
Feature activation+1.25e-03

Pos logit contributions
 concert+0.019
 films+0.019
 movies+0.019
 crowds+0.018
 pian+0.018
Neg logit contributions
urt-0.015
lock-0.014
 contents-0.014
gy-0.014
ometer-0.014
Token but
Token ID475
Feature activation-3.16e-04

Pos logit contributions
urt+0.014
lock+0.014
 Finally+0.013
 contents+0.013
gy+0.013
Neg logit contributions
 films-0.014
 concert-0.014
 movies-0.013
 pian-0.013
 crowds-0.013
Token it
Token ID340
Feature activation+2.53e-03

Pos logit contributions
 concert+0.001
 films+0.001
 movies+0.001
 crowds+0.001
 pian+0.001
Neg logit contributions
urt-0.001
lock-0.001
 contents-0.000
gy-0.000
ometer-0.000
Token was
Token ID373
Feature activation+1.57e-02

Pos logit contributions
urt+0.004
lock+0.004
 contents+0.004
gy+0.004
ometer+0.004
Neg logit contributions
 concert-0.005
 films-0.005
 movies-0.005
 pian-0.004
 crowds-0.004
Token still
Token ID991
Feature activation+2.10e-02

Pos logit contributions
urt+0.030
lock+0.029
 contents+0.029
gy+0.028
ometer+0.028
Neg logit contributions
 concert-0.024
 films-0.023
 movies-0.023
 crowds-0.023
 pian-0.023
Token all
Token ID477
Feature activation+1.14e-02

Pos logit contributions
urt+0.036
lock+0.035
 contents+0.035
gy+0.034
ometer+0.034
Neg logit contributions
 concert-0.035
 films-0.035
 movies-0.034
 crowds-0.034
 pian-0.034
Token messed
Token ID32621
Feature activation+6.11e-02

Pos logit contributions
urt+0.018
lock+0.018
 contents+0.017
gy+0.017
ometer+0.017
Neg logit contributions
 concert-0.021
 films-0.020
 movies-0.020
 pian-0.020
 crowds-0.020
Token up
Token ID510
Feature activation-3.77e-06

Pos logit contributions
lock+0.106
urt+0.103
 Finally+0.101
ucer+0.099
 contents+0.099
Neg logit contributions
 films-0.094
 movies-0.094
 concert-0.092
 pian-0.090
 crowds-0.090
Token.
Token ID13
Feature activation+1.66e-02

Feature not active on prev token;
no predictions were affected.
Token Mama
Token ID40084
Feature activation-2.64e-02

Pos logit contributions
 contents+0.025
urt+0.025
lock+0.025
ucer+0.024
gy+0.024
Neg logit contributions
 crowds-0.030
 movies-0.030
 films-0.029
 concert-0.029
 pian-0.029
Token got
Token ID1392
Feature activation+5.78e-03

Pos logit contributions
 films+0.050
 concert+0.050
 crowds+0.048
 movies+0.048
 pian+0.048
Neg logit contributions
urt-0.040
lock-0.039
gy-0.038
 contents-0.037
ometer-0.037
Token really
Token ID1107
Feature activation+4.04e-03

Pos logit contributions
urt+0.011
lock+0.011
 contents+0.010
gy+0.010
ometer+0.010
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 crowds-0.008
 pian-0.008
Token Melody
Token ID38906
Feature activation-1.19e-02

Pos logit contributions
 concert+0.000
 films+0.000
 crowds+0.000
 movies+0.000
 pian+0.000
Neg logit contributions
urt-0.000
lock-0.000
 contents-0.000
gy-0.000
 Finally-0.000
Token was
Token ID373
Feature activation-4.02e-03

Pos logit contributions
 films+0.027
 crowds+0.026
 concert+0.026
 pian+0.026
 movies+0.025
Neg logit contributions
gy-0.013
urt-0.013
 contents-0.013
lock-0.013
sy-0.012
Token swimming
Token ID14899
Feature activation+3.62e-03

Pos logit contributions
 concert+0.008
 crowds+0.008
 films+0.008
 pian+0.007
 movies+0.007
Neg logit contributions
gy-0.005
 Finally-0.005
urt-0.005
 contents-0.005
sy-0.005
Token around
Token ID1088
Feature activation+2.24e-03

Pos logit contributions
urt+0.006
 contents+0.006
gy+0.006
lock+0.006
sy+0.005
Neg logit contributions
 concert-0.007
 films-0.006
 pian-0.006
 crowds-0.006
 announcements-0.006
Token the
Token ID262
Feature activation+8.02e-03

Pos logit contributions
gy+0.003
 job+0.003
 Finally+0.003
 contents+0.003
urt+0.003
Neg logit contributions
 announcements-0.005
 concert-0.004
 pian-0.004
 films-0.004
aw-0.004
Token tank
Token ID6873
Feature activation+6.10e-02

Pos logit contributions
 contents+0.013
urt+0.013
gy+0.013
lock+0.012
 Finally+0.012
Neg logit contributions
 pian-0.014
 concert-0.014
 crowds-0.014
 announcements-0.014
 films-0.013
Token,
Token ID11
Feature activation+7.46e-03

Pos logit contributions
urt+0.105
lock+0.101
 contents+0.100
ometer+0.098
gy+0.098
Neg logit contributions
 concert-0.102
 films-0.101
 movies-0.100
 crowds-0.098
 pian-0.096
Token when
Token ID618
Feature activation-1.50e-02

Pos logit contributions
urt+0.012
gy+0.011
lock+0.011
 contents+0.011
 Finally+0.011
Neg logit contributions
 concert-0.014
 films-0.014
 pian-0.013
 crowds-0.013
 movies-0.013
Token suddenly
Token ID6451
Feature activation+1.47e-03

Pos logit contributions
 concert+0.026
 crowds+0.025
 movies+0.025
 films+0.024
 pian+0.024
Neg logit contributions
urt-0.026
 contents-0.025
gy-0.025
lock-0.025
ometer-0.024
Token she
Token ID673
Feature activation-4.04e-03

Pos logit contributions
urt+0.002
lock+0.002
 contents+0.002
gy+0.002
ometer+0.002
Neg logit contributions
 films-0.003
 concert-0.003
 movies-0.003
 crowds-0.003
 pian-0.003
Token noticed
Token ID6810
Feature activation-4.98e-04

Pos logit contributions
 films+0.008
 concert+0.008
 pian+0.008
 movies+0.008
 crowds+0.007
Neg logit contributions
 contents-0.006
lock-0.006
gy-0.006
urt-0.006
ucer-0.005
Token mom
Token ID1995
Feature activation-6.47e-03

Pos logit contributions
 concert+0.001
 films+0.001
 movies+0.001
 pian+0.001
 crowds+0.001
Neg logit contributions
urt-0.001
ucer-0.001
lock-0.001
gy-0.001
 Finally-0.001
Tokenmy
Token ID1820
Feature activation-3.81e-03

Pos logit contributions
 concert+0.012
 movies+0.012
 films+0.012
 pian+0.012
 crowds+0.011
Neg logit contributions
lock-0.010
ometer-0.010
 contents-0.010
urt-0.010
ucer-0.009
Token didn
Token ID1422
Feature activation-7.84e-03

Pos logit contributions
 concert+0.007
 films+0.006
 movies+0.006
 crowds+0.006
 pian+0.006
Neg logit contributions
urt-0.007
lock-0.006
 contents-0.006
gy-0.006
ometer-0.006
Token't
Token ID470
Feature activation+2.45e-02

Pos logit contributions
 concert+0.016
 films+0.016
 movies+0.015
 pian+0.015
 crowds+0.015
Neg logit contributions
urt-0.011
 contents-0.010
lock-0.010
gy-0.010
ometer-0.010
Token get
Token ID651
Feature activation+1.66e-02

Pos logit contributions
urt+0.042
ometer+0.041
 contents+0.040
gy+0.039
lock+0.039
Neg logit contributions
 concert-0.043
 crowds-0.041
 films-0.041
 pian-0.040
 movies-0.040
Token mad
Token ID8805
Feature activation+6.10e-02

Pos logit contributions
urt+0.028
lock+0.027
 contents+0.027
gy+0.026
ometer+0.026
Neg logit contributions
 concert-0.028
 films-0.028
 movies-0.027
 crowds-0.027
 pian-0.027
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
urt+0.107
lock+0.098
ometer+0.097
 job+0.096
 contents+0.095
Neg logit contributions
 concert-0.104
 films-0.102
 pian-0.098
La-0.097
 movies-0.097
Token but
Token ID475
Feature activation-7.71e-03

Pos logit contributions
urt+0.029
 Finally+0.028
lock+0.027
 contents+0.027
gy+0.027
Neg logit contributions
 concert-0.031
 films-0.030
 movies-0.029
 pian-0.029
 crowds-0.028
Token she
Token ID673
Feature activation-5.10e-03

Pos logit contributions
 concert+0.014
 films+0.013
 movies+0.013
 pian+0.013
 crowds+0.013
Neg logit contributions
urt-0.013
lock-0.013
 contents-0.012
ometer-0.012
gy-0.012
Token did
Token ID750
Feature activation-9.25e-03

Pos logit contributions
 concert+0.009
 movies+0.009
 pian+0.009
 films+0.009
 crowds+0.008
Neg logit contributions
urt-0.008
lock-0.008
 contents-0.008
ucer-0.008
gy-0.008
Token order
Token ID1502
Feature activation+5.97e-03

Pos logit contributions
 concert+0.018
 movies+0.018
 films+0.017
 crowds+0.017
 pian+0.017
Neg logit contributions
urt-0.014
ometer-0.013
ucer-0.013
lock-0.013
gy-0.013
Token wanted
Token ID2227
Feature activation+1.13e-02

Pos logit contributions
gy+0.005
urt+0.004
lock+0.004
ometer+0.004
gen+0.004
Neg logit contributions
 crowds-0.010
 concert-0.010
 films-0.010
 pian-0.010
 announcements-0.010
Token to
Token ID284
Feature activation+8.45e-04

Pos logit contributions
urt+0.016
lock+0.015
 contents+0.015
gy+0.014
ometer+0.014
Neg logit contributions
 concert-0.023
 films-0.022
 pian-0.022
 crowds-0.022
 movies-0.022
Token look
Token ID804
Feature activation-8.24e-03

Pos logit contributions
urt+0.001
ometer+0.001
ucer+0.001
 contents+0.001
 Finally+0.001
Neg logit contributions
 concert-0.002
 movies-0.002
 films-0.001
 crowds-0.001
 strangers-0.001
Token inside
Token ID2641
Feature activation+1.13e-02

Pos logit contributions
 concert+0.016
 films+0.016
 pian+0.015
 movies+0.015
 crowds+0.015
Neg logit contributions
urt-0.012
lock-0.011
 contents-0.011
ometer-0.011
 Finally-0.011
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
urt+0.017
lock+0.016
 contents+0.016
 Finally+0.015
 job+0.015
Neg logit contributions
 concert-0.022
 pian-0.021
 announcements-0.020
La-0.020
 films-0.020
Token chim
Token ID18205
Feature activation+6.10e-02

Pos logit contributions
urt+0.023
lock+0.022
ucer+0.021
gy+0.021
 contents+0.021
Neg logit contributions
 movies-0.028
 films-0.028
 concert-0.028
 crowds-0.027
 pian-0.027
Tokenney
Token ID1681
Feature activation+2.26e-02

Pos logit contributions
 contents+0.069
urt+0.064
lock+0.063
ucer+0.061
 virtue+0.060
Neg logit contributions
 movies-0.128
 films-0.126
aw-0.123
 pastor-0.122
 concert-0.121
Token,
Token ID11
Feature activation+4.98e-03

Pos logit contributions
urt+0.038
lock+0.036
 contents+0.036
gy+0.036
ucer+0.036
Neg logit contributions
 films-0.038
 movies-0.038
 concert-0.038
 crowds-0.037
 pian-0.036
Token but
Token ID475
Feature activation-1.00e-02

Pos logit contributions
urt+0.007
lock+0.007
 contents+0.007
 Finally+0.007
 hammer+0.007
Neg logit contributions
 concert-0.010
 pian-0.009
 films-0.009
La-0.009
 pastor-0.009
Token it
Token ID340
Feature activation-2.81e-03

Pos logit contributions
 concert+0.016
 movies+0.016
 films+0.016
 crowds+0.015
 pian+0.015
Neg logit contributions
urt-0.018
 contents-0.017
lock-0.017
ucer-0.017
gy-0.017
Token was
Token ID373
Feature activation+9.89e-03

Pos logit contributions
 concert+0.004
 movies+0.004
 films+0.004
 pian+0.004
 crowds+0.004
Neg logit contributions
urt-0.005
ucer-0.005
lock-0.005
 contents-0.005
inky-0.005
Token of
Token ID286
Feature activation+5.90e-03

Pos logit contributions
urt+0.028
lock+0.027
 contents+0.026
ometer+0.026
ucer+0.026
Neg logit contributions
 concert-0.036
 films-0.036
 crowds-0.035
 movies-0.035
 pian-0.034
Token animal
Token ID5044
Feature activation-1.14e-02

Pos logit contributions
urt+0.009
lock+0.009
 contents+0.008
gy+0.008
ometer+0.008
Neg logit contributions
 concert-0.011
 films-0.011
 movies-0.011
 crowds-0.011
 pian-0.011
Token the
Token ID262
Feature activation+1.76e-03

Pos logit contributions
 concert+0.021
 films+0.021
 movies+0.020
 crowds+0.020
 pian+0.020
Neg logit contributions
urt-0.018
lock-0.017
 contents-0.017
gy-0.017
ometer-0.017
Token wing
Token ID8539
Feature activation+1.02e-02

Pos logit contributions
urt+0.003
ucer+0.003
 Finally+0.003
lock+0.003
ometer+0.003
Neg logit contributions
 concert-0.003
 strangers-0.003
 films-0.003
 crowds-0.003
 pian-0.003
Token had
Token ID550
Feature activation+3.69e-02

Pos logit contributions
urt+0.016
lock+0.015
 contents+0.015
ometer+0.015
gy+0.015
Neg logit contributions
 concert-0.019
 films-0.018
 movies-0.018
 crowds-0.018
 pian-0.018
Token belonged
Token ID19611
Feature activation+6.09e-02

Pos logit contributions
urt+0.057
lock+0.055
 contents+0.054
ometer+0.054
gy+0.053
Neg logit contributions
 concert-0.069
 films-0.067
 movies-0.066
 crowds-0.066
 pian-0.065
Token to
Token ID284
Feature activation+1.25e-02

Pos logit contributions
urt+0.104
lock+0.099
ometer+0.098
 contents+0.098
ucer+0.097
Neg logit contributions
 concert-0.102
 films-0.099
 movies-0.098
 crowds-0.098
 pian-0.097
Token.
Token ID13
Feature activation+1.93e-03

Pos logit contributions
urt+0.017
lock+0.017
 contents+0.016
gy+0.016
 Finally+0.016
Neg logit contributions
 concert-0.025
 films-0.025
 pian-0.024
 movies-0.024
 crowds-0.024
Token
Token ID198
Feature activation+2.93e-03

Pos logit contributions
 contents+0.004
lock+0.004
ucer+0.003
urt+0.003
gy+0.003
Neg logit contributions
 movies-0.003
 films-0.003
 concert-0.003
La-0.003
 crowds-0.003
Token
Token ID198
Feature activation+2.03e-03

Pos logit contributions
urt+0.005
lock+0.005
 contents+0.005
gy+0.005
ometer+0.005
Neg logit contributions
 concert-0.005
 films-0.005
 movies-0.005
 pian-0.005
 crowds-0.005
TokenThe
Token ID464
Feature activation-5.36e-03

Pos logit contributions
urt+0.003
gy+0.003
ometer+0.003
lock+0.003
 contents+0.003
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 pian-0.004
La-0.004
Token
Token ID220
Feature activation+3.02e-02

Pos logit contributions
ucer+0.010
urt+0.010
gy+0.010
 contents+0.010
lock+0.010
Neg logit contributions
 movies-0.013
 films-0.013
 crowds-0.012
 concert-0.012
 dolphins-0.012
Token
Token ID198
Feature activation+6.15e-03

Pos logit contributions
urt+0.049
lock+0.048
 contents+0.048
gy+0.047
ucer+0.046
Neg logit contributions
 films-0.051
 movies-0.051
 concert-0.051
 crowds-0.049
 pian-0.048
Token
Token ID198
Feature activation+9.54e-03

Pos logit contributions
urt+0.009
lock+0.009
 contents+0.009
gy+0.009
ometer+0.009
Neg logit contributions
 concert-0.011
 films-0.011
 movies-0.011
 crowds-0.011
 pian-0.011
TokenM
Token ID44
Feature activation-1.78e-02

Pos logit contributions
urt+0.015
gy+0.015
ometer+0.015
 contents+0.014
lock+0.014
Neg logit contributions
 movies-0.017
 films-0.017
 concert-0.017
 crowds-0.016
La-0.016
Tokenummy
Token ID13513
Feature activation-3.63e-02

Pos logit contributions
 concert+0.040
 films+0.040
 movies+0.039
 crowds+0.039
 pian+0.039
Neg logit contributions
urt-0.020
lock-0.019
 contents-0.019
gy-0.018
ometer-0.018
Token pe
Token ID613
Feature activation-7.60e-02

Pos logit contributions
 films+0.074
 crowds+0.074
 concert+0.074
 pian+0.072
 movies+0.072
Neg logit contributions
urt-0.051
lock-0.047
gy-0.046
 contents-0.045
ometer-0.044
Tokenered
Token ID1068
Feature activation-1.68e-02

Pos logit contributions
 concert+0.129
 films+0.127
 movies+0.124
 crowds+0.123
 pian+0.122
Neg logit contributions
urt-0.132
lock-0.127
 contents-0.125
gy-0.123
ometer-0.122
Token down
Token ID866
Feature activation-1.36e-02

Pos logit contributions
 concert+0.037
 crowds+0.034
 announcements+0.034
 pian+0.034
La+0.033
Neg logit contributions
urt-0.021
 contents-0.020
lock-0.019
 seed-0.019
gy-0.019
Token at
Token ID379
Feature activation-1.06e-02

Pos logit contributions
 concert+0.026
 films+0.026
 pian+0.025
 crowds+0.025
 movies+0.025
Neg logit contributions
urt-0.020
lock-0.020
 Finally-0.019
 contents-0.019
ometer-0.019
Token the
Token ID262
Feature activation-3.68e-03

Pos logit contributions
 concert+0.016
 films+0.016
 movies+0.016
 crowds+0.015
 pian+0.015
Neg logit contributions
urt-0.020
lock-0.019
 contents-0.019
gy-0.019
ometer-0.019
Token ground
Token ID2323
Feature activation+3.65e-03

Pos logit contributions
 concert+0.006
 films+0.006
 pian+0.006
 crowds+0.006
 movies+0.006
Neg logit contributions
urt-0.006
lock-0.006
 contents-0.006
ometer-0.006
gy-0.005
Token crying
Token ID13774
Feature activation+1.94e-02

Pos logit contributions
La+0.057
 dolphins+0.056
 films+0.056
 pian+0.056
 concert+0.055
Neg logit contributions
urt-0.045
gy-0.044
lock-0.043
 hammer-0.042
 contents-0.041
Token.
Token ID13
Feature activation-4.48e-03

Pos logit contributions
urt+0.020
gy+0.020
lock+0.019
ometer+0.018
 job+0.017
Neg logit contributions
 concert-0.045
 films-0.044
 pian-0.044
 pastor-0.042
 dolphins-0.042
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
 concert+0.010
 pian+0.010
 films+0.010
 movies+0.010
 crowds+0.010
Neg logit contributions
urt-0.005
 Finally-0.005
lock-0.005
 contents-0.004
ometer-0.004
Token
Token ID198
Feature activation-6.14e-03

Pos logit contributions
 concert+0.029
 crowds+0.029
 pian+0.029
 films+0.028
 movies+0.028
Neg logit contributions
 Finally-0.012
urt-0.012
lock-0.011
ometer-0.011
 contents-0.011
TokenHe
Token ID1544
Feature activation+3.65e-03

Pos logit contributions
 films+0.012
 concert+0.012
 pian+0.012
 crowds+0.012
 movies+0.012
Neg logit contributions
urt-0.008
lock-0.008
 contents-0.008
 Finally-0.008
ucer-0.008
Token pe
Token ID613
Feature activation-7.04e-02

Pos logit contributions
gy+0.005
lock+0.005
urt+0.004
 contents+0.004
sy+0.004
Neg logit contributions
 films-0.008
 concert-0.008
 crowds-0.008
 movies-0.008
 pian-0.007
Tokenered
Token ID1068
Feature activation-2.01e-03

Pos logit contributions
 concert+0.120
 films+0.119
 movies+0.117
 crowds+0.117
 pian+0.114
Neg logit contributions
urt-0.122
lock-0.117
gy-0.116
 contents-0.116
ucer-0.112
Token out
Token ID503
Feature activation-2.32e-02

Pos logit contributions
 concert+0.004
 pian+0.004
 films+0.004
La+0.004
 crowds+0.004
Neg logit contributions
urt-0.003
gy-0.003
 contents-0.003
lock-0.002
 tight-0.002
Token to
Token ID284
Feature activation+5.95e-03

Pos logit contributions
 concert+0.049
aw+0.047
La+0.046
 pastor+0.046
 pian+0.046
Neg logit contributions
urt-0.026
 tight-0.025
lock-0.025
 contents-0.025
 tightly-0.024
Token the
Token ID262
Feature activation-1.22e-02

Pos logit contributions
lock+0.008
urt+0.008
 contents+0.008
gy+0.008
 job+0.008
Neg logit contributions
 pian-0.012
 concert-0.011
 films-0.011
aw-0.011
 crowds-0.011
Token hallway
Token ID23959
Feature activation+9.69e-03

Pos logit contributions
 concert+0.022
 movies+0.021
 films+0.021
 crowds+0.021
 pian+0.021
Neg logit contributions
urt-0.020
lock-0.019
 contents-0.019
gy-0.019
ometer-0.019
Token the
Token ID262
Feature activation-1.21e-02

Pos logit contributions
urt+0.011
lock+0.011
 contents+0.011
gy+0.011
ometer+0.010
Neg logit contributions
 concert-0.012
 films-0.012
 pian-0.011
 crowds-0.011
 movies-0.011
Token couch
Token ID18507
Feature activation+5.68e-03

Pos logit contributions
 concert+0.021
 movies+0.021
 films+0.020
 crowds+0.020
 pian+0.019
Neg logit contributions
urt-0.023
lock-0.021
gy-0.020
ucer-0.020
ometer-0.020
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
urt+0.010
 contents+0.009
lock+0.009
ucer+0.009
ometer+0.009
Neg logit contributions
 concert-0.010
 movies-0.010
 films-0.010
 crowds-0.009
 pian-0.009
Token She
Token ID1375
Feature activation-1.04e-02

Pos logit contributions
 concert+0.031
 films+0.030
 movies+0.030
 crowds+0.030
 pian+0.030
Neg logit contributions
urt-0.027
lock-0.026
 contents-0.025
gy-0.025
ometer-0.025
Token said
Token ID531
Feature activation-8.08e-03

Pos logit contributions
 films+0.021
 concert+0.021
 crowds+0.021
 movies+0.020
 pian+0.020
Neg logit contributions
urt-0.014
lock-0.014
gy-0.014
 contents-0.013
ometer-0.013
Token n
Token ID299
Feature activation-7.01e-02

Pos logit contributions
 concert+0.017
 films+0.016
 crowds+0.016
 movies+0.016
 pian+0.015
Neg logit contributions
urt-0.011
lock-0.011
ometer-0.010
 Finally-0.010
gy-0.010
Tokenaps
Token ID1686
Feature activation-3.79e-02

Pos logit contributions
 concert+0.146
 films+0.144
 movies+0.142
 crowds+0.141
 pian+0.140
Neg logit contributions
urt-0.097
lock-0.092
gy-0.090
 contents-0.089
ometer-0.088
Token were
Token ID547
Feature activation+1.12e-03

Pos logit contributions
 concert+0.071
 films+0.069
 movies+0.068
 pian+0.067
 crowds+0.067
Neg logit contributions
urt-0.059
lock-0.057
 contents-0.056
ometer-0.055
 Finally-0.054
Token good
Token ID922
Feature activation+1.08e-02

Pos logit contributions
urt+0.002
lock+0.001
 contents+0.001
gy+0.001
ometer+0.001
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 crowds-0.002
 pian-0.002
Token for
Token ID329
Feature activation-1.05e-02

Pos logit contributions
 contents+0.015
urt+0.015
 job+0.015
ages+0.014
lock+0.014
Neg logit contributions
 concert-0.021
La-0.020
 pian-0.020
 films-0.020
 movies-0.020
Token him
Token ID683
Feature activation-1.81e-03

Pos logit contributions
 concert+0.015
 films+0.014
 pian+0.014
 crowds+0.014
 movies+0.014
Neg logit contributions
urt-0.021
lock-0.020
 contents-0.019
gy-0.019
 Finally-0.019
Token.
Token ID13
Feature activation+8.12e-03

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.011
 crowds+0.010
 pian+0.010
Neg logit contributions
urt-0.009
lock-0.008
gy-0.008
 contents-0.008
ometer-0.008
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
 contents+0.013
urt+0.013
lock+0.013
ucer+0.013
gy+0.013
Neg logit contributions
 concert-0.014
 movies-0.014
 films-0.014
 pian-0.013
 crowds-0.013
Token
Token ID198
Feature activation+9.71e-03

Pos logit contributions
 contents+0.024
urt+0.024
lock+0.023
gy+0.023
ucer+0.022
Neg logit contributions
 films-0.020
 concert-0.020
 movies-0.020
La-0.019
 pian-0.019
TokenMom
Token ID29252
Feature activation-5.69e-03

Pos logit contributions
urt+0.016
gy+0.016
 contents+0.016
ometer+0.015
ucer+0.015
Neg logit contributions
 concert-0.017
 movies-0.016
La-0.016
 films-0.016
 pian-0.016
Tokenmy
Token ID1820
Feature activation-1.18e-02

Pos logit contributions
 concert+0.010
 movies+0.010
 pian+0.010
 films+0.009
La+0.009
Neg logit contributions
 contents-0.009
lock-0.009
 job-0.008
ometer-0.008
 virtue-0.008
Token pe
Token ID613
Feature activation-6.91e-02

Pos logit contributions
 films+0.021
 concert+0.021
 crowds+0.020
 movies+0.020
 pian+0.020
Neg logit contributions
urt-0.020
lock-0.019
gy-0.019
 contents-0.018
ometer-0.018
Tokenered
Token ID1068
Feature activation-1.07e-02

Pos logit contributions
 concert+0.112
 films+0.110
 movies+0.108
 crowds+0.108
 pian+0.105
Neg logit contributions
urt-0.127
lock-0.122
 contents-0.120
gy-0.119
ometer-0.117
Token behind
Token ID2157
Feature activation-1.47e-03

Pos logit contributions
 concert+0.022
 films+0.021
 crowds+0.021
 movies+0.021
 pian+0.021
Neg logit contributions
urt-0.015
 contents-0.014
lock-0.014
gy-0.014
 Finally-0.014
Token the
Token ID262
Feature activation+1.67e-03

Pos logit contributions
 concert+0.003
 pian+0.003
 films+0.003
 pastor+0.003
 announcements+0.003
Neg logit contributions
 contents-0.002
 Finally-0.002
urt-0.002
gy-0.002
lock-0.002
Token hedge
Token ID19859
Feature activation+2.21e-02

Pos logit contributions
urt+0.003
lock+0.003
ometer+0.003
gy+0.003
ages+0.003
Neg logit contributions
 concert-0.003
 crowds-0.003
 films-0.003
 movies-0.003
 strangers-0.003
Token and
Token ID290
Feature activation-1.02e-02

Pos logit contributions
urt+0.044
 contents+0.042
ucer+0.042
lock+0.042
ometer+0.041
Neg logit contributions
 concert-0.031
 films-0.029
 movies-0.029
 crowds-0.028
 pian-0.028
Token"
Token ID1
Feature activation+1.63e-02

Pos logit contributions
 concert+0.004
 films+0.004
 movies+0.004
 pian+0.004
 crowds+0.004
Neg logit contributions
urt-0.003
 contents-0.003
gy-0.003
lock-0.003
ometer-0.002
TokenAnna
Token ID31160
Feature activation-1.46e-02

Pos logit contributions
lock+0.022
 contents+0.022
urt+0.020
ucer+0.020
ometer+0.020
Neg logit contributions
 concert-0.034
 films-0.034
 crowds-0.033
 movies-0.032
 oper-0.031
Token?
Token ID30
Feature activation-1.78e-02

Pos logit contributions
 concert+0.025
 films+0.025
 movies+0.024
 crowds+0.024
 pian+0.024
Neg logit contributions
urt-0.025
lock-0.024
 contents-0.024
gy-0.023
ometer-0.023
Token Ben
Token ID3932
Feature activation-1.73e-02

Pos logit contributions
 concert+0.033
 films+0.033
 movies+0.033
 crowds+0.032
 pian+0.032
Neg logit contributions
urt-0.028
 contents-0.026
lock-0.026
gy-0.025
ucer-0.025
Token?
Token ID30
Feature activation-2.01e-02

Pos logit contributions
 concert+0.032
 films+0.032
 movies+0.032
 pian+0.031
 crowds+0.031
Neg logit contributions
urt-0.027
lock-0.026
gy-0.025
 contents-0.025
ometer-0.025
Token Where
Token ID6350
Feature activation-6.86e-02

Pos logit contributions
 films+0.036
 concert+0.036
 movies+0.035
 crowds+0.035
 pian+0.034
Neg logit contributions
urt-0.032
 contents-0.031
lock-0.031
gy-0.030
ucer-0.029
Token are
Token ID389
Feature activation+1.09e-02

Pos logit contributions
 concert+0.130
 films+0.129
 crowds+0.126
 movies+0.125
 pian+0.124
Neg logit contributions
urt-0.103
lock-0.102
 contents-0.097
ometer-0.097
gy-0.097
Token you
Token ID345
Feature activation-6.72e-03

Pos logit contributions
urt+0.021
lock+0.021
 contents+0.020
gy+0.020
ometer+0.020
Neg logit contributions
 concert-0.016
 films-0.016
 movies-0.016
 crowds-0.015
 pian-0.015
Token?"
Token ID1701
Feature activation+1.60e-03

Pos logit contributions
 pian+0.011
 films+0.011
 announcements+0.010
 crowds+0.010
 concert+0.010
Neg logit contributions
urt-0.012
lock-0.012
gy-0.012
 job-0.011
 contents-0.011
Token Mom
Token ID11254
Feature activation-4.01e-02

Pos logit contributions
 Finally+0.001
ometer+0.001
 job+0.001
 tight+0.001
lock+0.001
Neg logit contributions
 concert-0.004
aw-0.004
 pian-0.004
 crowds-0.004
 films-0.004
Token calls
Token ID3848
Feature activation+8.77e-04

Pos logit contributions
 concert+0.081
 dolphins+0.077
 pian+0.074
 films+0.074
 movies+0.073
Neg logit contributions
 trunk-0.048
urt-0.048
ometer-0.047
lock-0.047
 tight-0.046
Token
Token ID220
Feature activation+3.12e-02

Pos logit contributions
urt+0.012
lock+0.012
 contents+0.012
 Finally+0.011
gy+0.011
Neg logit contributions
 concert-0.020
 films-0.019
 movies-0.019
 pian-0.019
 crowds-0.019
Token
Token ID198
Feature activation+9.70e-03

Pos logit contributions
urt+0.051
lock+0.049
 contents+0.049
gy+0.048
ucer+0.047
Neg logit contributions
 films-0.054
 concert-0.054
 movies-0.053
 crowds-0.052
 pian-0.051
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
urt+0.012
lock+0.012
 contents+0.012
 Finally+0.012
gy+0.011
Neg logit contributions
 concert-0.021
 films-0.021
 pian-0.020
 crowds-0.020
 movies-0.020
TokenAb
Token ID4826
Feature activation-1.07e-02

Pos logit contributions
urt+0.017
lock+0.017
 contents+0.017
 Finally+0.016
gy+0.016
Neg logit contributions
 concert-0.019
 films-0.019
 movies-0.019
 pian-0.019
 crowds-0.018
Tokenby
Token ID1525
Feature activation-2.92e-02

Pos logit contributions
 concert+0.025
 films+0.025
 movies+0.025
 crowds+0.024
 pian+0.024
Neg logit contributions
urt-0.012
lock-0.011
 contents-0.011
gy-0.011
ometer-0.010
Token pe
Token ID613
Feature activation-6.86e-02

Pos logit contributions
 concert+0.053
 films+0.052
 movies+0.051
 crowds+0.051
 pian+0.051
Neg logit contributions
urt-0.047
lock-0.045
 contents-0.044
gy-0.044
ometer-0.043
Tokenep
Token ID538
Feature activation+1.10e-02

Pos logit contributions
 concert+0.121
 films+0.120
 movies+0.118
 crowds+0.118
 pian+0.115
Neg logit contributions
urt-0.115
lock-0.111
gy-0.109
 contents-0.109
ometer-0.105
Tokened
Token ID276
Feature activation-1.77e-03

Pos logit contributions
urt+0.018
ometer+0.018
 contents+0.018
lock+0.018
 Finally+0.017
Neg logit contributions
 concert-0.018
aw-0.017
 movies-0.017
 crowds-0.017
 films-0.017
Token through
Token ID832
Feature activation-1.53e-02

Pos logit contributions
 concert+0.003
 films+0.003
 pian+0.003
 crowds+0.003
 movies+0.003
Neg logit contributions
urt-0.003
gy-0.003
lock-0.003
 contents-0.003
 Finally-0.003
Token the
Token ID262
Feature activation+7.29e-03

Pos logit contributions
 concert+0.032
 pian+0.031
 films+0.030
La+0.029
aw+0.029
Neg logit contributions
 contents-0.020
lock-0.019
gy-0.019
urt-0.017
 shovel-0.017
Token window
Token ID4324
Feature activation+6.06e-03

Pos logit contributions
urt+0.011
lock+0.011
 contents+0.010
gy+0.010
ometer+0.010
Neg logit contributions
 concert-0.014
 films-0.014
 movies-0.014
 crowds-0.013
 pian-0.013
Token beautiful
Token ID4950
Feature activation-6.82e-04

Pos logit contributions
urt+0.003
lock+0.003
gy+0.003
 contents+0.003
ometer+0.003
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 pian-0.004
 crowds-0.004
Token princess
Token ID21752
Feature activation-1.85e-02

Pos logit contributions
 films+0.001
 concert+0.001
 crowds+0.001
 movies+0.001
 pian+0.001
Neg logit contributions
urt-0.001
 contents-0.001
gy-0.001
lock-0.001
ometer-0.001
Token dress
Token ID6576
Feature activation+1.77e-03

Pos logit contributions
 concert+0.033
 films+0.032
 movies+0.031
 pian+0.031
 crowds+0.031
Neg logit contributions
urt-0.031
lock-0.029
 contents-0.029
 Finally-0.029
ometer-0.029
Token.
Token ID13
Feature activation+2.66e-03

Pos logit contributions
urt+0.003
lock+0.003
 contents+0.003
 Finally+0.003
gy+0.003
Neg logit contributions
 concert-0.003
 films-0.003
 movies-0.003
 pian-0.003
 crowds-0.003
Token She
Token ID1375
Feature activation+1.01e-02

Pos logit contributions
lock+0.006
 contents+0.005
urt+0.005
ucer+0.005
sy+0.005
Neg logit contributions
 movies-0.003
 films-0.003
 concert-0.003
 crowds-0.003
 them-0.003
Token tw
Token ID665
Feature activation-6.86e-02

Pos logit contributions
urt+0.015
lock+0.015
 contents+0.015
gy+0.014
ometer+0.014
Neg logit contributions
 concert-0.019
 films-0.019
 movies-0.019
 crowds-0.018
 pian-0.018
Tokenirled
Token ID49376
Feature activation-4.62e-03

Pos logit contributions
 concert+0.117
 films+0.116
 crowds+0.116
 movies+0.115
 pian+0.111
Neg logit contributions
urt-0.119
 contents-0.112
gy-0.111
lock-0.111
ucer-0.108
Token and
Token ID290
Feature activation+2.95e-03

Pos logit contributions
 pastor+0.009
 concert+0.009
 films+0.008
 strangers+0.008
 announcements+0.008
Neg logit contributions
 Finally-0.007
urt-0.007
 contents-0.007
gy-0.006
 tight-0.006
Token danced
Token ID39480
Feature activation-6.41e-03

Pos logit contributions
urt+0.006
lock+0.006
 contents+0.006
ometer+0.006
gy+0.006
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 pian-0.004
 crowds-0.004
Token around
Token ID1088
Feature activation-1.58e-02

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.010
 crowds+0.010
 pian+0.010
Neg logit contributions
urt-0.011
lock-0.010
 contents-0.010
gy-0.010
 Finally-0.010
Token the
Token ID262
Feature activation+1.37e-03

Pos logit contributions
 concert+0.030
 films+0.029
 crowds+0.028
 movies+0.028
 pian+0.028
Neg logit contributions
urt-0.023
gy-0.023
 Finally-0.022
 contents-0.022
lock-0.022
Token dad
Token ID9955
Feature activation-1.34e-02

Pos logit contributions
 concert+0.012
 dolphins+0.011
 movies+0.011
 crowds+0.011
 films+0.011
Neg logit contributions
urt-0.017
 contents-0.016
ometer-0.016
inky-0.016
lock-0.016
Token's
Token ID338
Feature activation-2.11e-02

Pos logit contributions
 concert+0.022
 films+0.022
 movies+0.022
 crowds+0.021
 pian+0.021
Neg logit contributions
urt-0.023
ometer-0.022
 contents-0.022
lock-0.022
ucer-0.022
Token room
Token ID2119
Feature activation+8.40e-04

Pos logit contributions
 concert+0.038
 films+0.038
 movies+0.037
 crowds+0.037
 pian+0.036
Neg logit contributions
urt-0.035
lock-0.032
ucer-0.032
gy-0.032
ometer-0.031
Token.
Token ID13
Feature activation+4.21e-03

Pos logit contributions
urt+0.001
ometer+0.001
lock+0.001
gy+0.001
ucer+0.001
Neg logit contributions
 concert-0.001
 films-0.001
 movies-0.001
 crowds-0.001
 pian-0.001
Token She
Token ID1375
Feature activation+3.60e-03

Pos logit contributions
ucer+0.007
urt+0.007
lock+0.006
gy+0.006
 contents+0.006
Neg logit contributions
 movies-0.008
 films-0.007
 crowds-0.007
 concert-0.007
 pastor-0.007
Token pe
Token ID613
Feature activation-6.84e-02

Pos logit contributions
urt+0.005
lock+0.005
 contents+0.005
gy+0.005
 Finally+0.005
Neg logit contributions
 concert-0.007
 films-0.007
 crowds-0.007
 movies-0.007
 pian-0.007
Tokenered
Token ID1068
Feature activation-5.19e-03

Pos logit contributions
 concert+0.103
 films+0.101
 movies+0.099
 pian+0.098
 crowds+0.097
Neg logit contributions
urt-0.129
lock-0.125
 contents-0.122
ometer-0.121
gy-0.120
Token inside
Token ID2641
Feature activation+2.14e-03

Pos logit contributions
 concert+0.011
 films+0.010
 movies+0.010
 pian+0.010
 crowds+0.010
Neg logit contributions
 contents-0.007
urt-0.007
lock-0.007
gy-0.007
 Finally-0.006
Token and
Token ID290
Feature activation-5.41e-03

Pos logit contributions
urt+0.004
lock+0.004
 contents+0.004
 Finally+0.003
gy+0.003
Neg logit contributions
 concert-0.004
 films-0.004
 pian-0.004
 crowds-0.003
 movies-0.003
Token saw
Token ID2497
Feature activation+7.60e-03

Pos logit contributions
 concert+0.009
 films+0.009
 movies+0.009
 crowds+0.009
 pian+0.009
Neg logit contributions
urt-0.009
lock-0.009
 contents-0.009
gy-0.008
ometer-0.008
Token lots
Token ID6041
Feature activation+3.67e-04

Pos logit contributions
urt+0.011
lock+0.011
gy+0.011
ometer+0.010
 contents+0.010
Neg logit contributions
 movies-0.015
 films-0.015
 concert-0.015
 crowds-0.014
 pian-0.014
Token Spot
Token ID15899
Feature activation-1.09e-04

Pos logit contributions
 concert+0.028
 films+0.027
 pian+0.026
 crowds+0.026
 movies+0.026
Neg logit contributions
urt-0.025
lock-0.024
gy-0.024
ometer-0.024
 contents-0.023
Token's
Token ID338
Feature activation-6.34e-03

Pos logit contributions
 films+0.000
 movies+0.000
 concert+0.000
 crowds+0.000
 pian+0.000
Neg logit contributions
urt-0.000
 contents-0.000
lock-0.000
ucer-0.000
 Finally-0.000
Token nose
Token ID9686
Feature activation-1.51e-02

Pos logit contributions
 concert+0.012
 films+0.012
 movies+0.012
 crowds+0.012
 pian+0.012
Neg logit contributions
urt-0.009
lock-0.009
ucer-0.008
 contents-0.008
gy-0.008
Token.
Token ID13
Feature activation-7.68e-03

Pos logit contributions
 concert+0.027
 films+0.027
 movies+0.027
 crowds+0.026
 pian+0.026
Neg logit contributions
urt-0.024
lock-0.023
 contents-0.023
gy-0.022
ometer-0.022
Token He
Token ID679
Feature activation+2.70e-03

Pos logit contributions
 concert+0.018
 films+0.017
 pian+0.017
 dolphins+0.017
 crowds+0.017
Neg logit contributions
 Finally-0.009
urt-0.008
ometer-0.008
lock-0.008
gy-0.007
Token n
Token ID299
Feature activation-6.83e-02

Pos logit contributions
urt+0.003
lock+0.003
 contents+0.003
gy+0.003
 Finally+0.003
Neg logit contributions
 films-0.006
 concert-0.006
 crowds-0.005
 movies-0.005
 pian-0.005
Tokenuzz
Token ID4715
Feature activation-2.05e-02

Pos logit contributions
 concert+0.139
 films+0.137
 movies+0.135
 crowds+0.134
 pian+0.133
Neg logit contributions
urt-0.095
lock-0.091
 contents-0.089
gy-0.088
ometer-0.087
Tokenled
Token ID992
Feature activation-7.26e-03

Pos logit contributions
 concert+0.041
 films+0.040
 movies+0.038
 pian+0.038
 crowds+0.037
Neg logit contributions
urt-0.029
gy-0.029
 contents-0.027
ucer-0.027
ometer-0.027
Token her
Token ID607
Feature activation-2.19e-02

Pos logit contributions
 concert+0.017
 pian+0.015
 films+0.015
 crowds+0.014
 pastor+0.014
Neg logit contributions
urt-0.009
 tight-0.009
gy-0.008
 contents-0.008
 tightly-0.008
Token and
Token ID290
Feature activation-2.77e-03

Pos logit contributions
 concert+0.043
 films+0.042
 movies+0.041
 crowds+0.041
 pian+0.040
Neg logit contributions
urt-0.033
lock-0.032
 contents-0.031
gy-0.031
ometer-0.031
Token made
Token ID925
Feature activation-1.14e-02

Pos logit contributions
 concert+0.005
 films+0.005
 movies+0.005
 crowds+0.005
 pian+0.005
Neg logit contributions
urt-0.004
lock-0.004
gy-0.004
 contents-0.004
ometer-0.004
TokenWhy
Token ID5195
Feature activation-2.91e-02

Pos logit contributions
urt+0.006
lock+0.006
 contents+0.006
ometer+0.006
 Finally+0.006
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.008
 crowds-0.008
 pian-0.008
Token are
Token ID389
Feature activation+2.38e-03

Pos logit contributions
 concert+0.054
 films+0.053
 movies+0.052
 crowds+0.051
 pian+0.051
Neg logit contributions
urt-0.046
lock-0.044
 contents-0.044
gy-0.043
ometer-0.042
Token you
Token ID345
Feature activation-8.22e-03

Pos logit contributions
urt+0.004
lock+0.004
 contents+0.004
ometer+0.004
 Finally+0.004
Neg logit contributions
 concert-0.004
 films-0.004
 crowds-0.004
 movies-0.004
 pian-0.004
Token bald
Token ID28273
Feature activation-9.73e-03

Pos logit contributions
 concert+0.016
 crowds+0.015
 movies+0.015
 pian+0.015
aw+0.015
Neg logit contributions
urt-0.012
gy-0.012
ometer-0.010
 hammer-0.010
lock-0.010
Token?
Token ID30
Feature activation-2.34e-02

Pos logit contributions
 concert+0.017
 films+0.017
 pian+0.017
La+0.017
 announcements+0.016
Neg logit contributions
urt-0.016
gy-0.015
 job-0.015
lock-0.015
ometer-0.014
Token Where
Token ID6350
Feature activation-6.83e-02

Pos logit contributions
 concert+0.046
 films+0.044
 crowds+0.043
 movies+0.043
 pian+0.043
Neg logit contributions
urt-0.034
lock-0.033
ometer-0.032
gy-0.032
 Finally-0.032
Token is
Token ID318
Feature activation-1.46e-04

Pos logit contributions
 concert+0.125
 films+0.123
 movies+0.121
 crowds+0.120
 pian+0.118
Neg logit contributions
urt-0.108
lock-0.106
 contents-0.101
ometer-0.101
gy-0.101
Token your
Token ID534
Feature activation-1.84e-02

Pos logit contributions
 concert+0.000
 crowds+0.000
 films+0.000
 announcements+0.000
 pian+0.000
Neg logit contributions
urt-0.000
ometer-0.000
gy-0.000
lock-0.000
 job-0.000
Token hair
Token ID4190
Feature activation-7.92e-03

Pos logit contributions
 concert+0.037
 films+0.037
 movies+0.036
 crowds+0.036
 pian+0.036
Neg logit contributions
urt-0.025
lock-0.024
 contents-0.024
ometer-0.024
gy-0.023
Token?"
Token ID1701
Feature activation-1.83e-02

Pos logit contributions
 films+0.012
 concert+0.012
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.015
lock-0.015
gy-0.015
 contents-0.015
ometer-0.014
Token Tom
Token ID4186
Feature activation-2.14e-02

Pos logit contributions
 concert+0.042
 films+0.041
 pian+0.039
 movies+0.039
 crowds+0.039
Neg logit contributions
urt-0.021
lock-0.021
 Finally-0.020
ometer-0.019
gy-0.019
Token me
Token ID502
Feature activation-2.96e-02

Pos logit contributions
urt+0.010
lock+0.010
 contents+0.010
gy+0.010
 Finally+0.009
Neg logit contributions
 concert-0.008
 pian-0.008
 films-0.008
 crowds-0.007
 announcements-0.007
Token."
Token ID526
Feature activation+9.50e-03

Pos logit contributions
 concert+0.055
 films+0.054
 movies+0.054
 crowds+0.053
 pian+0.052
Neg logit contributions
urt-0.046
lock-0.044
 contents-0.044
ometer-0.042
gy-0.042
Token
Token ID198
Feature activation+8.94e-04

Pos logit contributions
 Finally+0.010
urt+0.010
lock+0.010
ometer+0.010
gy+0.009
Neg logit contributions
 concert-0.023
 films-0.022
 pian-0.022
 dolphins-0.022
 movies-0.022
Token
Token ID198
Feature activation-3.81e-03

Pos logit contributions
urt+0.001
lock+0.001
 contents+0.001
gy+0.001
ometer+0.001
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 crowds-0.002
 pian-0.002
TokenTom
Token ID13787
Feature activation-1.14e-03

Pos logit contributions
 concert+0.006
 movies+0.005
 films+0.005
 pian+0.005
 crowds+0.005
Neg logit contributions
urt-0.007
 contents-0.007
lock-0.007
gy-0.007
ometer-0.007
Token pe
Token ID613
Feature activation-6.82e-02

Pos logit contributions
 films+0.002
 concert+0.002
 pian+0.002
 oper+0.002
 movies+0.002
Neg logit contributions
gy-0.002
 trunk-0.002
urt-0.002
 contents-0.002
lock-0.001
Tokeneks
Token ID2573
Feature activation-1.86e-02

Pos logit contributions
 concert+0.107
 crowds+0.107
 movies+0.106
 films+0.106
 pian+0.103
Neg logit contributions
urt-0.124
gy-0.122
lock-0.122
 contents-0.117
ucer-0.117
Token from
Token ID422
Feature activation-7.97e-03

Pos logit contributions
 concert+0.042
 pian+0.040
 pastor+0.038
 films+0.038
 crowds+0.038
Neg logit contributions
lock-0.017
gy-0.016
 Finally-0.016
 contents-0.016
 hinges-0.016
Token under
Token ID739
Feature activation-1.30e-02

Pos logit contributions
 concert+0.013
 pian+0.013
 films+0.013
 movies+0.012
 crowds+0.012
Neg logit contributions
gy-0.014
lock-0.013
urt-0.013
 contents-0.013
 Finally-0.013
Token the
Token ID262
Feature activation-5.76e-03

Pos logit contributions
 pian+0.027
 announcements+0.027
 concert+0.027
 films+0.026
aw+0.026
Neg logit contributions
gy-0.016
lock-0.015
urt-0.015
 contents-0.014
 job-0.014
Token covers
Token ID8698
Feature activation-2.05e-02

Pos logit contributions
 films+0.012
 concert+0.012
 pian+0.012
 crowds+0.012
 movies+0.012
Neg logit contributions
lock-0.007
 contents-0.007
urt-0.007
ucer-0.006
gy-0.006
Token a
Token ID257
Feature activation+4.80e-03

Pos logit contributions
urt+0.011
lock+0.010
 contents+0.010
gy+0.010
ometer+0.010
Neg logit contributions
 concert-0.015
 films-0.015
 movies-0.014
 crowds-0.014
 pian-0.014
Token magic
Token ID5536
Feature activation-1.66e-02

Pos logit contributions
urt+0.008
lock+0.007
gy+0.007
 contents+0.007
 Finally+0.007
Neg logit contributions
 concert-0.009
 pian-0.009
 movies-0.009
 films-0.009
 crowds-0.008
Token wand
Token ID11569
Feature activation-1.08e-02

Pos logit contributions
 films+0.032
 concert+0.032
 crowds+0.031
 movies+0.031
 pian+0.030
Neg logit contributions
urt-0.024
lock-0.024
 contents-0.024
ometer-0.023
gy-0.022
Token.
Token ID13
Feature activation+3.05e-03

Pos logit contributions
 concert+0.021
 films+0.020
 movies+0.020
 crowds+0.020
 pian+0.020
Neg logit contributions
urt-0.016
lock-0.016
 contents-0.015
gy-0.015
ometer-0.015
Token She
Token ID1375
Feature activation+4.27e-03

Pos logit contributions
ucer+0.005
urt+0.005
lock+0.005
 contents+0.005
gy+0.005
Neg logit contributions
 movies-0.005
 crowds-0.005
 films-0.005
 pian-0.005
 concert-0.005
Token tw
Token ID665
Feature activation-6.79e-02

Pos logit contributions
urt+0.005
gy+0.005
 contents+0.005
lock+0.005
 Finally+0.005
Neg logit contributions
 films-0.009
 crowds-0.009
 concert-0.009
 movies-0.009
 strangers-0.008
Tokenirled
Token ID49376
Feature activation-5.92e-03

Pos logit contributions
 concert+0.112
 films+0.110
 movies+0.109
 crowds+0.108
 pian+0.105
Neg logit contributions
urt-0.122
lock-0.116
 contents-0.116
gy-0.115
ometer-0.112
Token around
Token ID1088
Feature activation-1.55e-02

Pos logit contributions
 films+0.011
 concert+0.011
 pastor+0.011
 strangers+0.011
 movies+0.010
Neg logit contributions
urt-0.009
 contents-0.008
 Finally-0.008
gy-0.008
 tight-0.008
Token in
Token ID287
Feature activation-8.23e-04

Pos logit contributions
 concert+0.027
 films+0.027
 pastor+0.027
 announcements+0.027
 crowds+0.027
Neg logit contributions
gy-0.023
urt-0.023
 Finally-0.023
lock-0.023
 contents-0.022
Token her
Token ID607
Feature activation-8.10e-03

Pos logit contributions
 concert+0.001
 pastor+0.001
 films+0.001
 announcements+0.001
 crowds+0.001
Neg logit contributions
 contents-0.001
urt-0.001
lock-0.001
 hammer-0.001
 job-0.001
Token princess
Token ID21752
Feature activation-2.45e-02

Pos logit contributions
 concert+0.016
 films+0.016
 movies+0.015
 crowds+0.015
 announcements+0.015
Neg logit contributions
urt-0.012
 contents-0.012
gy-0.011
lock-0.011
ometer-0.011
Token behind
Token ID2157
Feature activation+5.96e-03

Pos logit contributions
 concert+0.001
 pian+0.001
 films+0.001
 movies+0.001
 crowds+0.001
Neg logit contributions
gy-0.001
lock-0.001
 contents-0.001
urt-0.001
ucer-0.001
Token a
Token ID257
Feature activation-1.40e-02

Pos logit contributions
urt+0.009
lock+0.009
 contents+0.009
gy+0.008
 Finally+0.008
Neg logit contributions
 concert-0.011
 films-0.011
 pian-0.011
 movies-0.011
 crowds-0.011
Token tree
Token ID5509
Feature activation-7.03e-03

Pos logit contributions
 concert+0.031
 films+0.031
 movies+0.030
 pian+0.030
 crowds+0.030
Neg logit contributions
urt-0.017
lock-0.016
 contents-0.016
gy-0.015
ometer-0.015
Token.
Token ID13
Feature activation+2.20e-03

Pos logit contributions
 concert+0.014
 films+0.014
 movies+0.013
 crowds+0.013
 pian+0.013
Neg logit contributions
urt-0.010
lock-0.010
 contents-0.010
gy-0.009
ometer-0.009
Token She
Token ID1375
Feature activation+4.47e-03

Pos logit contributions
urt+0.004
ucer+0.004
 contents+0.003
gy+0.003
lock+0.003
Neg logit contributions
 crowds-0.004
 concert-0.004
 movies-0.004
aw-0.004
 films-0.004
Token pe
Token ID613
Feature activation-6.77e-02

Pos logit contributions
lock+0.006
urt+0.006
gy+0.006
 contents+0.006
ometer+0.005
Neg logit contributions
 films-0.010
 concert-0.009
 movies-0.009
 crowds-0.009
 pian-0.009
Tokenep
Token ID538
Feature activation+5.44e-03

Pos logit contributions
 concert+0.115
 films+0.113
 movies+0.111
 crowds+0.110
 pian+0.109
Neg logit contributions
urt-0.116
lock-0.112
 contents-0.111
gy-0.109
ometer-0.108
Tokened
Token ID276
Feature activation-1.86e-03

Pos logit contributions
urt+0.009
 contents+0.008
lock+0.008
ometer+0.008
 Finally+0.008
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 crowds-0.009
 pian-0.009
Token around
Token ID1088
Feature activation-1.19e-02

Pos logit contributions
 concert+0.004
 films+0.004
 pian+0.003
 crowds+0.003
 movies+0.003
Neg logit contributions
gy-0.003
urt-0.003
lock-0.003
 contents-0.003
 Finally-0.003
Token the
Token ID262
Feature activation-6.90e-03

Pos logit contributions
 films+0.026
 concert+0.026
 pian+0.025
 crowds+0.024
 pastor+0.024
Neg logit contributions
lock-0.014
gy-0.013
 shovel-0.012
urt-0.012
 Finally-0.012
Token corner
Token ID5228
Feature activation-6.38e-03

Pos logit contributions
 concert+0.014
 films+0.014
 movies+0.014
 crowds+0.014
 pian+0.014
Neg logit contributions
urt-0.010
lock-0.009
 contents-0.009
gy-0.009
ometer-0.009
Token.
Token ID13
Feature activation+6.97e-03

Pos logit contributions
urt+0.017
lock+0.016
gy+0.016
 contents+0.015
ometer+0.015
Neg logit contributions
 concert-0.026
 films-0.026
 pian-0.025
 movies-0.025
 crowds-0.025
Token
Token ID220
Feature activation+2.17e-02

Pos logit contributions
lock+0.011
 contents+0.011
urt+0.011
ucer+0.011
ometer+0.010
Neg logit contributions
 movies-0.012
 films-0.012
 concert-0.012
 crowds-0.012
 dolphins-0.011
Token
Token ID198
Feature activation-1.97e-03

Pos logit contributions
urt+0.038
 contents+0.037
lock+0.037
gy+0.035
ucer+0.035
Neg logit contributions
 films-0.034
 movies-0.034
 concert-0.033
 crowds-0.032
 pian-0.032
Token
Token ID198
Feature activation+2.18e-03

Pos logit contributions
 concert+0.004
 films+0.004
 movies+0.004
 crowds+0.004
 pian+0.004
Neg logit contributions
urt-0.003
lock-0.003
 contents-0.003
gy-0.003
ometer-0.003
TokenThe
Token ID464
Feature activation-2.55e-03

Pos logit contributions
urt+0.003
lock+0.003
ometer+0.003
 contents+0.003
gy+0.003
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 pian-0.004
 crowds-0.004
Token cam
Token ID12172
Feature activation-6.75e-02

Pos logit contributions
 concert+0.005
 films+0.005
 pian+0.005
 crowds+0.005
 movies+0.005
Neg logit contributions
urt-0.004
 contents-0.004
lock-0.004
gy-0.004
ometer-0.004
Tokenels
Token ID1424
Feature activation-3.41e-03

Pos logit contributions
 movies+0.125
 pian+0.124
 concert+0.123
 crowds+0.119
 announcements+0.118
Neg logit contributions
gy-0.117
urt-0.112
olving-0.100
lock-0.097
gen-0.096
Token were
Token ID547
Feature activation+8.05e-04

Pos logit contributions
 concert+0.007
 films+0.007
 pian+0.006
 movies+0.006
 crowds+0.006
Neg logit contributions
urt-0.005
lock-0.005
gy-0.005
 contents-0.005
 Finally-0.005
Token so
Token ID523
Feature activation+8.47e-05

Pos logit contributions
urt+0.001
lock+0.001
 contents+0.001
gy+0.001
ometer+0.001
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 crowds-0.002
 pian-0.002
Token happy
Token ID3772
Feature activation+1.64e-02

Pos logit contributions
urt+0.000
lock+0.000
 contents+0.000
ucer+0.000
gy+0.000
Neg logit contributions
 films-0.000
 movies-0.000
 concert-0.000
 crowds-0.000
 pian-0.000
Token to
Token ID284
Feature activation+3.64e-03

Pos logit contributions
urt+0.024
gy+0.022
lock+0.022
 contents+0.022
ometer+0.021
Neg logit contributions
 concert-0.033
 films-0.032
 pian-0.032
 crowds-0.031
 movies-0.031
Token a
Token ID257
Feature activation-9.17e-03

Pos logit contributions
 concert+0.038
 crowds+0.037
 pian+0.037
 films+0.037
 pastor+0.036
Neg logit contributions
urt-0.024
 contents-0.023
lock-0.023
 job-0.022
 Finally-0.022
Token show
Token ID905
Feature activation+2.59e-03

Pos logit contributions
 concert+0.019
 films+0.019
 crowds+0.019
 movies+0.018
 pian+0.018
Neg logit contributions
urt-0.012
lock-0.011
 contents-0.011
ometer-0.011
gy-0.011
Token for
Token ID329
Feature activation-1.16e-02

Pos logit contributions
urt+0.003
lock+0.003
 job+0.003
ometer+0.003
gy+0.003
Neg logit contributions
 concert-0.005
 films-0.005
 crowds-0.005
 announcements-0.005
 pian-0.005
Token us
Token ID514
Feature activation-1.78e-02

Pos logit contributions
 concert+0.020
 films+0.020
 movies+0.019
 crowds+0.019
 pian+0.019
Neg logit contributions
urt-0.019
lock-0.019
 contents-0.018
gy-0.018
ometer-0.018
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
 concert+0.035
 films+0.035
 crowds+0.034
 movies+0.034
 pian+0.034
Neg logit contributions
urt-0.025
lock-0.024
 contents-0.023
gy-0.023
ometer-0.023
Token Where
Token ID6350
Feature activation-6.75e-02

Pos logit contributions
 concert+0.031
 films+0.030
 movies+0.030
 crowds+0.030
 pian+0.030
Neg logit contributions
urt-0.023
lock-0.022
 contents-0.022
gy-0.022
ometer-0.021
Token is
Token ID318
Feature activation-4.28e-03

Pos logit contributions
 concert+0.122
 films+0.120
 movies+0.118
 crowds+0.117
 pian+0.116
Neg logit contributions
urt-0.108
lock-0.105
 contents-0.102
ometer-0.101
gy-0.101
Token your
Token ID534
Feature activation-1.86e-02

Pos logit contributions
 concert+0.008
 crowds+0.007
 films+0.007
 pian+0.007
 movies+0.007
Neg logit contributions
urt-0.007
lock-0.006
 contents-0.006
ometer-0.006
 job-0.006
Token menu
Token ID6859
Feature activation-9.64e-03

Pos logit contributions
 concert+0.038
 films+0.038
 movies+0.037
 crowds+0.037
 pian+0.037
Neg logit contributions
urt-0.025
lock-0.024
 contents-0.023
gy-0.023
ucer-0.023
Token?"
Token ID1701
Feature activation-7.75e-03

Pos logit contributions
 announcements+0.017
 pian+0.016
 films+0.016
La+0.016
 crowds+0.016
Neg logit contributions
lock-0.016
urt-0.016
 job-0.016
 contents-0.015
ometer-0.015
Token
Token ID198
Feature activation-7.27e-03

Pos logit contributions
 concert+0.017
 films+0.017
 pian+0.017
 crowds+0.017
 movies+0.016
Neg logit contributions
urt-0.009
 Finally-0.009
lock-0.009
ometer-0.008
 contents-0.008
Token"
Token ID1
Feature activation+1.27e-02

Pos logit contributions
urt+0.005
ometer+0.005
 contents+0.005
gy+0.005
 job+0.005
Neg logit contributions
La-0.003
aw-0.003
 concert-0.003
Leave-0.003
 pian-0.003
TokenWho
Token ID8241
Feature activation-5.20e-02

Pos logit contributions
urt+0.019
lock+0.018
 contents+0.018
gy+0.017
ometer+0.017
Neg logit contributions
 concert-0.025
 films-0.025
 movies-0.024
 pian-0.024
 crowds-0.024
Token is
Token ID318
Feature activation-8.67e-03

Pos logit contributions
 concert+0.099
 films+0.098
 crowds+0.097
 movies+0.096
 pian+0.095
Neg logit contributions
urt-0.077
lock-0.076
 contents-0.073
gy-0.072
ometer-0.071
Token he
Token ID339
Feature activation-3.46e-02

Pos logit contributions
 concert+0.016
 films+0.016
 crowds+0.016
 pian+0.016
 movies+0.016
Neg logit contributions
urt-0.013
lock-0.012
gy-0.012
ometer-0.012
 contents-0.012
Token?
Token ID30
Feature activation-2.53e-02

Pos logit contributions
 films+0.054
 pian+0.054
 concert+0.054
 crowds+0.052
 movies+0.052
Neg logit contributions
urt-0.062
lock-0.061
gy-0.060
 contents-0.059
 job-0.057
Token Where
Token ID6350
Feature activation-6.75e-02

Pos logit contributions
 concert+0.047
 films+0.046
 movies+0.046
 crowds+0.045
 pian+0.045
Neg logit contributions
urt-0.039
 contents-0.038
lock-0.037
gy-0.036
ucer-0.036
Token is
Token ID318
Feature activation-3.72e-03

Pos logit contributions
 concert+0.126
 films+0.125
 crowds+0.122
 movies+0.121
 pian+0.120
Neg logit contributions
urt-0.104
lock-0.102
 contents-0.098
ometer-0.097
gy-0.097
Token he
Token ID339
Feature activation-3.60e-02

Pos logit contributions
 concert+0.007
 films+0.007
 pian+0.007
 crowds+0.007
 movies+0.007
Neg logit contributions
urt-0.006
lock-0.005
 contents-0.005
gy-0.005
ometer-0.005
Token?"
Token ID1701
Feature activation-1.13e-02

Pos logit contributions
 films+0.058
 pian+0.057
 concert+0.057
 crowds+0.055
 movies+0.055
Neg logit contributions
urt-0.064
lock-0.063
gy-0.062
 contents-0.061
 job-0.059
Token Lily
Token ID20037
Feature activation-2.00e-02

Pos logit contributions
 concert+0.021
 films+0.021
 movies+0.020
 crowds+0.020
 pian+0.020
Neg logit contributions
urt-0.018
lock-0.017
 contents-0.017
gy-0.017
ometer-0.017
Token asked
Token ID1965
Feature activation+9.59e-03

Pos logit contributions
 films+0.039
 concert+0.038
 movies+0.038
 crowds+0.037
 pian+0.037
Neg logit contributions
urt-0.030
lock-0.028
gy-0.027
 contents-0.025
ometer-0.025
Token pick
Token ID2298
Feature activation-1.43e-02

Pos logit contributions
 concert+0.028
 movies+0.027
 films+0.026
 pian+0.025
 crowds+0.025
Neg logit contributions
urt-0.025
ucer-0.025
ometer-0.025
 contents-0.024
 Finally-0.023
Token it
Token ID340
Feature activation-1.57e-02

Pos logit contributions
 movies+0.015
 films+0.015
 concert+0.015
aw+0.014
 crowds+0.014
Neg logit contributions
lock-0.032
urt-0.032
gy-0.032
 contents-0.031
 hinges-0.031
Token up
Token ID510
Feature activation-2.69e-02

Pos logit contributions
 them+0.017
le+0.015
 movies+0.015
 films+0.014
 strangers+0.014
Neg logit contributions
ucer-0.034
bek-0.034
lock-0.034
 virtue-0.033
 Finally-0.033
Token.
Token ID13
Feature activation-8.32e-03

Pos logit contributions
 concert+0.038
 movies+0.037
 strangers+0.037
 films+0.036
 crowds+0.036
Neg logit contributions
urt-0.048
 contents-0.048
ucer-0.048
ometer-0.048
gy-0.047
Token She
Token ID1375
Feature activation-2.73e-03

Pos logit contributions
 movies+0.013
 concert+0.013
 crowds+0.013
 films+0.012
 pian+0.012
Neg logit contributions
urt-0.014
ucer-0.014
 contents-0.014
lock-0.014
 virtue-0.014
Token tw
Token ID665
Feature activation-6.74e-02

Pos logit contributions
 concert+0.005
 films+0.005
 movies+0.005
 pian+0.005
 crowds+0.005
Neg logit contributions
urt-0.004
lock-0.004
 contents-0.004
ometer-0.004
gy-0.004
Tokenirled
Token ID49376
Feature activation-9.63e-03

Pos logit contributions
 concert+0.096
 films+0.094
 movies+0.091
 pian+0.091
 crowds+0.090
Neg logit contributions
urt-0.131
lock-0.129
 contents-0.126
ometer-0.125
gy-0.123
Token it
Token ID340
Feature activation-7.72e-03

Pos logit contributions
 films+0.019
 concert+0.019
 pastor+0.018
 movies+0.018
 crowds+0.018
Neg logit contributions
urt-0.014
lock-0.013
 contents-0.013
 Finally-0.013
gy-0.013
Token around
Token ID1088
Feature activation-2.07e-02

Pos logit contributions
 concert+0.012
 films+0.011
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.015
lock-0.014
 contents-0.014
gy-0.014
ometer-0.014
Token and
Token ID290
Feature activation-8.89e-03

Pos logit contributions
 films+0.035
 concert+0.035
 announcements+0.034
 crowds+0.034
 movies+0.034
Neg logit contributions
urt-0.035
gy-0.034
lock-0.034
 Finally-0.033
 contents-0.032
Token around
Token ID1088
Feature activation-2.03e-02

Pos logit contributions
 films+0.016
 concert+0.016
 movies+0.015
 crowds+0.015
 pian+0.015
Neg logit contributions
urt-0.014
lock-0.014
gy-0.013
 contents-0.013
 Finally-0.013
Token am
Token ID716
Feature activation-3.72e-02

Pos logit contributions
 concert+0.086
 films+0.085
 movies+0.083
 crowds+0.083
 pian+0.082
Neg logit contributions
urt-0.048
lock-0.047
 contents-0.046
gy-0.045
ometer-0.044
Token your
Token ID534
Feature activation-2.31e-02

Pos logit contributions
 concert+0.076
 crowds+0.075
 films+0.074
 movies+0.073
 pian+0.073
Neg logit contributions
urt-0.048
gy-0.046
lock-0.045
 Finally-0.044
ometer-0.044
Token family
Token ID1641
Feature activation-9.67e-03

Pos logit contributions
 concert+0.052
 films+0.051
 movies+0.051
 pian+0.050
 crowds+0.050
Neg logit contributions
urt-0.027
lock-0.026
 contents-0.025
gy-0.024
ucer-0.024
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
 concert+0.021
 films+0.020
 pian+0.020
 crowds+0.020
 movies+0.020
Neg logit contributions
urt-0.012
lock-0.012
gy-0.011
 contents-0.011
ometer-0.011
Token I
Token ID314
Feature activation-4.14e-02

Pos logit contributions
 concert+0.035
 films+0.034
 crowds+0.033
 movies+0.033
 pian+0.033
Neg logit contributions
lock-0.023
urt-0.023
ometer-0.023
 Finally-0.023
gy-0.022
Token do
Token ID466
Feature activation-6.74e-02

Pos logit contributions
 concert+0.090
 films+0.089
 movies+0.087
 crowds+0.087
 pian+0.087
Neg logit contributions
urt-0.051
lock-0.049
 contents-0.048
gy-0.047
ometer-0.046
Token not
Token ID407
Feature activation-1.08e-02

Pos logit contributions
 concert+0.153
 films+0.151
 movies+0.148
 crowds+0.148
 pian+0.147
Neg logit contributions
urt-0.076
lock-0.073
 contents-0.071
gy-0.069
ometer-0.068
Token care
Token ID1337
Feature activation-2.14e-02

Pos logit contributions
 films+0.023
 concert+0.023
 movies+0.023
 pian+0.023
 crowds+0.023
Neg logit contributions
urt-0.013
lock-0.012
gy-0.012
 contents-0.012
 job-0.011
Token about
Token ID546
Feature activation-2.13e-02

Pos logit contributions
 concert+0.034
 films+0.032
 crowds+0.032
 movies+0.031
 pian+0.031
Neg logit contributions
urt-0.040
lock-0.038
ometer-0.038
gy-0.037
 contents-0.037
Token the
Token ID262
Feature activation-1.40e-02

Pos logit contributions
 concert+0.036
 pian+0.034
 films+0.033
 announcements+0.033
 crowds+0.032
Neg logit contributions
urt-0.035
gy-0.034
 contents-0.033
lock-0.032
 job-0.031
Token honey
Token ID12498
Feature activation-2.31e-02

Pos logit contributions
 concert+0.020
 films+0.019
 pian+0.019
 movies+0.019
 crowds+0.019
Neg logit contributions
 contents-0.027
urt-0.027
lock-0.026
gy-0.025
ometer-0.025
Token him
Token ID683
Feature activation-8.75e-03

Pos logit contributions
 concert+0.001
 films+0.001
 pian+0.001
 dolphins+0.001
 crowds+0.001
Neg logit contributions
urt-0.001
gy-0.001
lock-0.001
 contents-0.000
 Finally-0.000
Token.
Token ID13
Feature activation+1.34e-03

Pos logit contributions
 concert+0.015
 films+0.015
 movies+0.015
 crowds+0.015
 pian+0.015
Neg logit contributions
urt-0.014
 contents-0.014
lock-0.014
ucer-0.013
ometer-0.013
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 contents+0.002
urt+0.002
ucer+0.002
lock+0.002
gy+0.002
Neg logit contributions
 movies-0.002
 pian-0.002
 crowds-0.002
 films-0.002
 concert-0.002
Token
Token ID198
Feature activation-4.50e-03

Pos logit contributions
 concert+0.022
 films+0.022
 movies+0.021
 crowds+0.021
 pian+0.021
Neg logit contributions
urt-0.020
lock-0.019
 contents-0.019
gy-0.018
ometer-0.018
TokenHe
Token ID1544
Feature activation-8.90e-04

Pos logit contributions
Leave+0.007
 pian+0.007
 movies+0.007
 concert+0.007
aw+0.007
Neg logit contributions
ometer-0.008
urt-0.008
 contents-0.007
inky-0.007
gy-0.007
Token pe
Token ID613
Feature activation-6.73e-02

Pos logit contributions
 concert+0.002
 films+0.002
 pian+0.001
 musicians+0.001
 movies+0.001
Neg logit contributions
gy-0.001
 contents-0.001
 trunk-0.001
 Finally-0.001
lock-0.001
Tokeneks
Token ID2573
Feature activation-1.89e-02

Pos logit contributions
 concert+0.117
 crowds+0.116
 movies+0.114
 films+0.112
 pian+0.110
Neg logit contributions
gy-0.112
urt-0.110
lock-0.105
 contents-0.102
ucer-0.101
Token from
Token ID422
Feature activation-5.38e-03

Pos logit contributions
 concert+0.044
 pian+0.042
 films+0.042
 movies+0.040
 pastor+0.040
Neg logit contributions
lock-0.018
 contents-0.017
 hinges-0.017
 shovel-0.015
ucer-0.015
Token behind
Token ID2157
Feature activation-1.28e-03

Pos logit contributions
 concert+0.008
 films+0.007
 pian+0.007
 movies+0.007
 crowds+0.007
Neg logit contributions
urt-0.011
gy-0.010
lock-0.010
 contents-0.010
 Finally-0.010
Token the
Token ID262
Feature activation-7.51e-03

Pos logit contributions
 concert+0.003
 pian+0.002
 films+0.002
 crowds+0.002
 movies+0.002
Neg logit contributions
urt-0.002
gy-0.002
 contents-0.002
lock-0.002
 Finally-0.002
Token tree
Token ID5509
Feature activation+3.24e-04

Pos logit contributions
 concert+0.016
 films+0.016
 movies+0.016
 crowds+0.016
 pian+0.016
Neg logit contributions
urt-0.010
lock-0.009
 contents-0.009
gy-0.009
ometer-0.009
Token again
Token ID757
Feature activation-3.66e-02

Pos logit contributions
 films+0.026
 pian+0.025
 announcements+0.025
 crowds+0.024
 pastor+0.024
Neg logit contributions
lock-0.025
gy-0.024
urt-0.024
 contents-0.023
ometer-0.023
Token?"
Token ID1701
Feature activation-1.06e-02

Pos logit contributions
 films+0.069
 pian+0.068
 announcements+0.068
 crowds+0.067
 concert+0.065
Neg logit contributions
lock-0.053
gy-0.052
urt-0.051
 job-0.051
 Finally-0.048
Token
Token ID198
Feature activation-9.61e-03

Pos logit contributions
 concert+0.024
 pian+0.024
 films+0.024
 crowds+0.023
 movies+0.023
Neg logit contributions
urt-0.012
 Finally-0.012
lock-0.011
gy-0.011
ometer-0.011
Token
Token ID198
Feature activation-7.99e-03

Pos logit contributions
 concert+0.021
 films+0.021
 crowds+0.021
 pian+0.020
 movies+0.020
Neg logit contributions
urt-0.012
lock-0.011
 Finally-0.011
gy-0.011
ometer-0.011
TokenSpot
Token ID32565
Feature activation-4.95e-03

Pos logit contributions
 concert+0.014
 films+0.014
 movies+0.014
 crowds+0.014
 pian+0.013
Neg logit contributions
urt-0.013
lock-0.013
 contents-0.013
gy-0.012
ometer-0.012
Token n
Token ID299
Feature activation-6.72e-02

Pos logit contributions
 films+0.010
 pian+0.010
 concert+0.010
 crowds+0.009
 movies+0.009
Neg logit contributions
gy-0.007
urt-0.007
 Finally-0.007
ometer-0.007
lock-0.007
Tokenuzz
Token ID4715
Feature activation-2.46e-02

Pos logit contributions
 concert+0.136
 films+0.134
 movies+0.132
 crowds+0.131
 pian+0.130
Neg logit contributions
urt-0.094
lock-0.090
 contents-0.088
gy-0.086
ometer-0.085
Tokenled
Token ID992
Feature activation-9.88e-03

Pos logit contributions
 concert+0.046
 films+0.045
 pian+0.045
 movies+0.044
 oper+0.044
Neg logit contributions
urt-0.036
gy-0.035
ometer-0.034
ucer-0.033
sy-0.032
Token Tom
Token ID4186
Feature activation-5.20e-03

Pos logit contributions
 concert+0.021
 pian+0.020
 films+0.020
 crowds+0.019
 movies+0.019
Neg logit contributions
urt-0.014
gy-0.014
 Finally-0.013
lock-0.013
ometer-0.012
Token.
Token ID13
Feature activation-5.15e-03

Pos logit contributions
 concert+0.008
 films+0.008
 movies+0.008
 pian+0.008
 crowds+0.008
Neg logit contributions
urt-0.010
lock-0.009
gy-0.009
 contents-0.009
ometer-0.009
Token He
Token ID679
Feature activation+2.36e-03

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.007
lock-0.006
 contents-0.006
ometer-0.006
gy-0.006
Token there
Token ID612
Feature activation+4.94e-03

Pos logit contributions
 concert+0.007
 films+0.007
 movies+0.007
 pian+0.007
 crowds+0.007
Neg logit contributions
urt-0.011
lock-0.011
 contents-0.011
ometer-0.011
ucer-0.011
Token was
Token ID373
Feature activation+1.65e-02

Pos logit contributions
urt+0.006
gy+0.005
lock+0.005
 contents+0.005
 Finally+0.005
Neg logit contributions
 concert-0.011
 films-0.011
 crowds-0.011
 pian-0.011
 announcements-0.011
Token a
Token ID257
Feature activation+1.62e-03

Pos logit contributions
urt+0.022
lock+0.021
ometer+0.020
 contents+0.020
gy+0.020
Neg logit contributions
 films-0.035
 concert-0.035
 movies-0.034
 pian-0.033
 crowds-0.033
Token little
Token ID1310
Feature activation+2.13e-02

Pos logit contributions
urt+0.002
lock+0.002
 contents+0.002
gy+0.002
ometer+0.002
Neg logit contributions
 concert-0.003
 films-0.003
 movies-0.003
 crowds-0.003
 pian-0.003
Token girl
Token ID2576
Feature activation+1.06e-02

Pos logit contributions
urt+0.031
lock+0.029
gy+0.028
ometer+0.028
 contents+0.028
Neg logit contributions
 concert-0.041
 films-0.041
 crowds-0.040
 movies-0.040
 pian-0.039
Token named
Token ID3706
Feature activation+3.99e-02

Pos logit contributions
urt+0.018
lock+0.017
gy+0.017
 contents+0.016
ometer+0.016
Neg logit contributions
 concert-0.019
 films-0.018
 crowds-0.018
 movies-0.018
 pian-0.017
Token L
Token ID406
Feature activation-1.41e-02

Pos logit contributions
urt+0.091
lock+0.088
ometer+0.087
 contents+0.086
ucer+0.086
Neg logit contributions
 concert-0.046
 movies-0.046
 pian-0.044
 films-0.044
 opera-0.044
Tokenizzy
Token ID40593
Feature activation-1.05e-02

Pos logit contributions
 films+0.025
 concert+0.025
 movies+0.025
 crowds+0.025
 pian+0.024
Neg logit contributions
urt-0.023
lock-0.022
 contents-0.021
ucer-0.021
gy-0.021
Token.
Token ID13
Feature activation-1.32e-03

Pos logit contributions
 concert+0.018
 films+0.018
 movies+0.018
 pian+0.017
 crowds+0.017
Neg logit contributions
urt-0.017
 contents-0.017
lock-0.017
ometer-0.017
ucer-0.016
Token She
Token ID1375
Feature activation+6.43e-03

Pos logit contributions
 movies+0.002
 pian+0.002
 crowds+0.002
 films+0.002
 concert+0.002
Neg logit contributions
urt-0.002
lock-0.002
 contents-0.002
ucer-0.002
gy-0.002
Token was
Token ID373
Feature activation-1.24e-02

Pos logit contributions
urt+0.008
lock+0.008
gy+0.008
 contents+0.008
ometer+0.007
Neg logit contributions
 concert-0.014
 films-0.014
 crowds-0.013
 movies-0.013
 pian-0.013
Token days
Token ID1528
Feature activation-1.66e-02

Pos logit contributions
 concert+0.018
 pian+0.018
 movies+0.017
La+0.017
 films+0.017
Neg logit contributions
 contents-0.024
lock-0.023
 job-0.023
 Finally-0.023
urt-0.022
Token,
Token ID11
Feature activation+1.38e-02

Pos logit contributions
 films+0.032
 concert+0.032
 crowds+0.032
 pian+0.031
 movies+0.031
Neg logit contributions
urt-0.024
lock-0.024
gy-0.023
 Finally-0.023
 contents-0.022
Token he
Token ID339
Feature activation-2.66e-03

Pos logit contributions
urt+0.029
inky+0.028
 contents+0.028
lock+0.028
ucer+0.028
Neg logit contributions
 movies-0.018
 concert-0.017
 films-0.017
 crowds-0.017
 pian-0.016
Token was
Token ID373
Feature activation-9.06e-03

Pos logit contributions
 films+0.006
 pian+0.006
 concert+0.006
 crowds+0.005
 movies+0.005
Neg logit contributions
urt-0.003
gy-0.003
lock-0.003
 contents-0.003
 Finally-0.003
Token very
Token ID845
Feature activation+1.24e-02

Pos logit contributions
 concert+0.018
 films+0.018
 pian+0.018
 movies+0.018
 crowds+0.018
Neg logit contributions
urt-0.012
lock-0.012
 contents-0.012
gy-0.012
 Finally-0.011
Token sad
Token ID6507
Feature activation+3.45e-02

Pos logit contributions
urt+0.022
lock+0.021
gy+0.021
ucer+0.021
 contents+0.021
Neg logit contributions
 concert-0.020
 films-0.020
 crowds-0.019
 movies-0.019
 pastor-0.019
Token and
Token ID290
Feature activation+1.06e-02

Pos logit contributions
urt+0.052
lock+0.048
ometer+0.048
gy+0.048
 contents+0.047
Neg logit contributions
 concert-0.067
 films-0.067
 pian-0.065
La-0.064
 movies-0.064
Token very
Token ID845
Feature activation+1.46e-02

Pos logit contributions
urt+0.018
lock+0.017
 contents+0.017
ometer+0.016
gy+0.016
Neg logit contributions
 concert-0.019
 films-0.018
 movies-0.018
 crowds-0.018
 pian-0.018
Token hungry
Token ID14720
Feature activation+8.01e-03

Pos logit contributions
urt+0.026
lock+0.025
ucer+0.025
 contents+0.025
gy+0.024
Neg logit contributions
 concert-0.023
 crowds-0.023
 movies-0.023
 films-0.023
 strangers-0.022
Token.
Token ID13
Feature activation+5.03e-03

Pos logit contributions
urt+0.010
lock+0.010
 contents+0.010
gy+0.009
ometer+0.009
Neg logit contributions
 concert-0.017
 films-0.017
 pian-0.017
 crowds-0.016
 movies-0.016
Token He
Token ID679
Feature activation+1.57e-02

Pos logit contributions
ucer+0.008
 contents+0.008
urt+0.008
lock+0.008
gy+0.008
Neg logit contributions
 movies-0.009
 films-0.009
 crowds-0.008
 concert-0.008
 pian-0.008
Token basket
Token ID7988
Feature activation+2.26e-02

Pos logit contributions
urt+0.001
gy+0.000
lock+0.000
ucer+0.000
ometer+0.000
Neg logit contributions
 movies-0.001
 films-0.001
 concert-0.001
 crowds-0.001
 strangers-0.001
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
urt+0.045
ometer+0.045
 contents+0.044
 virtue+0.044
ucer+0.043
Neg logit contributions
 movies-0.025
aw-0.024
 films-0.023
 dolphins-0.023
 concert-0.023
Token He
Token ID679
Feature activation+1.33e-02

Pos logit contributions
ucer+0.018
 contents+0.018
lock+0.018
urt+0.018
 virtue+0.017
Neg logit contributions
 movies-0.019
 films-0.019
 crowds-0.017
 concert-0.017
 dolphins-0.016
Token was
Token ID373
Feature activation-7.63e-03

Pos logit contributions
urt+0.020
lock+0.019
 contents+0.019
gy+0.018
ometer+0.018
Neg logit contributions
 concert-0.026
 films-0.025
 movies-0.025
 crowds-0.025
 pian-0.025
Token very
Token ID845
Feature activation+1.12e-02

Pos logit contributions
 films+0.013
 movies+0.013
 concert+0.013
 crowds+0.012
 pian+0.012
Neg logit contributions
urt-0.013
lock-0.013
ucer-0.012
ometer-0.012
 contents-0.012
Token proud
Token ID6613
Feature activation+3.49e-02

Pos logit contributions
gy+0.022
urt+0.022
lock+0.021
ucer+0.021
ometer+0.021
Neg logit contributions
 films-0.016
 movies-0.016
 dolphins-0.015
 pastor-0.015
 crowds-0.015
Token of
Token ID286
Feature activation-4.49e-03

Pos logit contributions
urt+0.056
lock+0.053
 contents+0.053
ometer+0.050
 Finally+0.050
Neg logit contributions
 concert-0.064
 films-0.061
 crowds-0.061
 announcements-0.060
 pian-0.060
Token his
Token ID465
Feature activation+2.03e-02

Pos logit contributions
 films+0.007
 concert+0.007
 movies+0.007
 crowds+0.007
 pian+0.007
Neg logit contributions
urt-0.008
lock-0.008
ometer-0.008
 contents-0.008
gy-0.008
Token work
Token ID670
Feature activation+2.33e-02

Pos logit contributions
urt+0.028
lock+0.027
ucer+0.026
gy+0.026
 Finally+0.025
Neg logit contributions
 films-0.043
 concert-0.042
 movies-0.042
 crowds-0.040
 dolphins-0.040
Token.
Token ID13
Feature activation+8.21e-03

Pos logit contributions
urt+0.037
lock+0.036
 contents+0.035
gy+0.035
ometer+0.034
Neg logit contributions
 concert-0.042
 films-0.042
 movies-0.041
 crowds-0.041
 pian-0.040
Token
Token ID198
Feature activation-8.83e-03

Pos logit contributions
 contents+0.014
lock+0.014
ucer+0.014
urt+0.014
gy+0.013
Neg logit contributions
 movies-0.013
 films-0.013
 crowds-0.012
 concert-0.012
 dolphins-0.012
Token it
Token ID340
Feature activation-1.14e-02

Pos logit contributions
 crowds+0.058
 concert+0.057
 films+0.055
 pastor+0.054
 announcements+0.053
Neg logit contributions
lock-0.026
 hammer-0.025
ometer-0.024
gy-0.023
 tool-0.023
Token was
Token ID373
Feature activation+2.67e-02

Pos logit contributions
 concert+0.022
 films+0.022
 movies+0.021
 crowds+0.021
 pian+0.021
Neg logit contributions
urt-0.017
lock-0.016
 contents-0.016
gy-0.016
ometer-0.016
Token or
Token ID393
Feature activation+2.11e-02

Pos logit contributions
urt+0.044
lock+0.042
 contents+0.041
gy+0.041
ometer+0.040
Neg logit contributions
 concert-0.048
 films-0.047
 movies-0.046
 pian-0.046
 crowds-0.046
Token where
Token ID810
Feature activation-1.08e-02

Pos logit contributions
urt+0.035
lock+0.035
 contents+0.034
gy+0.034
ometer+0.033
Neg logit contributions
 concert-0.036
 films-0.035
 pian-0.034
 movies-0.034
 crowds-0.033
Token it
Token ID340
Feature activation-4.64e-03

Pos logit contributions
 concert+0.020
 films+0.020
 movies+0.019
 pian+0.019
 crowds+0.019
Neg logit contributions
urt-0.017
lock-0.016
 contents-0.016
gy-0.016
ometer-0.016
Token came
Token ID1625
Feature activation+5.57e-02

Pos logit contributions
 concert+0.010
 films+0.010
 pian+0.009
 movies+0.009
 pastor+0.009
Neg logit contributions
gy-0.006
urt-0.006
lock-0.006
 contents-0.006
ometer-0.005
Token from
Token ID422
Feature activation+1.41e-02

Pos logit contributions
urt+0.091
lock+0.088
gy+0.087
 contents+0.086
ometer+0.084
Neg logit contributions
 concert-0.100
 films-0.097
 movies-0.095
 pian-0.095
 crowds-0.095
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
urt+0.019
gy+0.018
lock+0.018
 contents+0.018
ometer+0.017
Neg logit contributions
 concert-0.029
 films-0.029
 pian-0.029
 movies-0.028
 crowds-0.028
Token He
Token ID679
Feature activation+1.02e-02

Pos logit contributions
urt+0.025
ucer+0.025
lock+0.025
 contents+0.024
 virtue+0.023
Neg logit contributions
 movies-0.027
 films-0.026
 crowds-0.026
 concert-0.025
 pian-0.025
Token began
Token ID2540
Feature activation+3.01e-03

Pos logit contributions
gy+0.014
lock+0.013
urt+0.013
 contents+0.013
 Finally+0.012
Neg logit contributions
 films-0.022
 concert-0.021
 movies-0.021
 crowds-0.020
 pian-0.020
Token to
Token ID284
Feature activation+2.37e-03

Pos logit contributions
gy+0.003
urt+0.003
lock+0.002
 contents+0.002
 hammer+0.002
Neg logit contributions
 concert-0.007
aw-0.007
 films-0.007
 crowds-0.007
 movies-0.007
Token a
Token ID257
Feature activation-4.91e-03

Pos logit contributions
 films+0.001
 movies+0.001
 concert+0.001
 crowds+0.001
 dolphins+0.001
Neg logit contributions
urt-0.001
lock-0.001
 contents-0.001
ometer-0.001
ucer-0.001
Token big
Token ID1263
Feature activation-4.37e-03

Pos logit contributions
 concert+0.010
 films+0.010
 movies+0.009
 crowds+0.009
 pian+0.009
Neg logit contributions
urt-0.007
lock-0.007
 contents-0.007
gy-0.006
ometer-0.006
Token gust
Token ID35253
Feature activation+3.54e-03

Pos logit contributions
La+0.009
 films+0.008
 pian+0.008
 concert+0.008
 movies+0.008
Neg logit contributions
gy-0.006
urt-0.006
 job-0.005
lock-0.005
 hammer-0.005
Token of
Token ID286
Feature activation+1.59e-02

Pos logit contributions
lock+0.005
urt+0.005
gy+0.005
ucer+0.005
ometer+0.004
Neg logit contributions
 concert-0.007
 crowds-0.007
 films-0.007
 movies-0.007
 announcements-0.007
Token wind
Token ID2344
Feature activation-8.13e-03

Pos logit contributions
urt+0.026
 contents+0.026
lock+0.026
gy+0.026
 Finally+0.024
Neg logit contributions
 concert-0.027
 films-0.025
 pian-0.025
 movies-0.025
 crowds-0.025
Token came
Token ID1625
Feature activation+4.23e-02

Pos logit contributions
 films+0.016
 movies+0.016
 crowds+0.016
 concert+0.015
 pian+0.015
Neg logit contributions
gy-0.012
urt-0.011
lock-0.011
ucer-0.011
 contents-0.010
Token by
Token ID416
Feature activation+1.29e-02

Pos logit contributions
urt+0.089
lock+0.087
 contents+0.086
gy+0.084
ometer+0.084
Neg logit contributions
 concert-0.055
 films-0.054
 movies-0.053
 crowds-0.052
 pian-0.052
Token and
Token ID290
Feature activation+6.66e-03

Pos logit contributions
urt+0.026
lock+0.025
 contents+0.025
gy+0.025
ometer+0.025
Neg logit contributions
 concert-0.018
 films-0.017
 movies-0.017
 pian-0.017
 crowds-0.017
Token scattered
Token ID16830
Feature activation+2.72e-02

Pos logit contributions
urt+0.012
lock+0.011
 contents+0.011
gy+0.011
ometer+0.011
Neg logit contributions
 concert-0.011
 films-0.011
 movies-0.011
 crowds-0.011
 pian-0.011
Token all
Token ID477
Feature activation+1.23e-03

Pos logit contributions
urt+0.046
lock+0.045
 contents+0.044
gy+0.043
ometer+0.043
Neg logit contributions
 concert-0.047
 films-0.046
 movies-0.045
 crowds-0.045
 pian-0.045
Token the
Token ID262
Feature activation-2.88e-03

Pos logit contributions
urt+0.002
gy+0.002
 contents+0.002
lock+0.002
ucer+0.002
Neg logit contributions
 concert-0.002
 films-0.002
 crowds-0.002
 movies-0.002
 pian-0.002
Token had
Token ID550
Feature activation+2.22e-02

Pos logit contributions
urt+0.011
lock+0.010
 contents+0.010
gy+0.010
ometer+0.010
Neg logit contributions
 concert-0.013
 films-0.013
 movies-0.013
 crowds-0.013
 pian-0.013
Token poison
Token ID8764
Feature activation-4.69e-03

Pos logit contributions
urt+0.035
lock+0.033
ometer+0.032
ucer+0.032
gy+0.031
Neg logit contributions
 concert-0.040
 crowds-0.040
 movies-0.040
 films-0.039
 dolphins-0.039
Token inside
Token ID2641
Feature activation+1.48e-02

Pos logit contributions
 concert+0.008
 films+0.008
 movies+0.008
 crowds+0.007
 pian+0.007
Neg logit contributions
urt-0.008
lock-0.008
 contents-0.008
gy-0.008
ometer-0.008
Token!
Token ID0
Feature activation+2.68e-03

Pos logit contributions
urt+0.028
lock+0.027
ometer+0.026
 contents+0.026
gy+0.026
Neg logit contributions
 films-0.022
 concert-0.022
 movies-0.022
 crowds-0.021
 strangers-0.021
Token
Token ID198
Feature activation-1.56e-03

Pos logit contributions
urt+0.004
 contents+0.004
lock+0.004
ucer+0.003
gy+0.003
Neg logit contributions
 films-0.005
 concert-0.005
 movies-0.005
 crowds-0.005
 pian-0.005
Token
Token ID198
Feature activation+5.59e-04

Pos logit contributions
 concert+0.003
 films+0.003
 movies+0.003
 crowds+0.003
 pian+0.003
Neg logit contributions
urt-0.002
lock-0.002
 contents-0.002
gy-0.002
ometer-0.002
TokenThe
Token ID464
Feature activation+2.40e-03

Pos logit contributions
urt+0.001
 contents+0.001
lock+0.001
ometer+0.001
gy+0.001
Neg logit contributions
 concert-0.001
 films-0.001
 movies-0.001
 pian-0.001
 crowds-0.001
Token poison
Token ID8764
Feature activation-1.28e-02

Pos logit contributions
urt+0.004
lock+0.004
ucer+0.004
ometer+0.004
gy+0.004
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 crowds-0.004
 dolphins-0.004
Token se
Token ID384
Feature activation-2.26e-02

Pos logit contributions
 concert+0.025
 films+0.024
 movies+0.024
 crowds+0.024
 pian+0.024
Neg logit contributions
urt-0.019
lock-0.018
 contents-0.018
gy-0.018
ometer-0.017
Tokenep
Token ID538
Feature activation+1.38e-02

Pos logit contributions
 concert+0.025
 films+0.024
 movies+0.023
 pian+0.023
aw+0.023
Neg logit contributions
urt-0.051
lock-0.051
 contents-0.050
ometer-0.049
ucer-0.048
Tokened
Token ID276
Feature activation+6.54e-03

Pos logit contributions
urt+0.023
lock+0.022
 contents+0.022
ometer+0.022
 Finally+0.022
Neg logit contributions
 concert-0.023
 films-0.022
 movies-0.022
 crowds-0.022
aw-0.022
Token new
Token ID649
Feature activation+2.10e-03

Pos logit contributions
 concert+0.028
 films+0.027
 crowds+0.027
 movies+0.027
 pian+0.026
Neg logit contributions
urt-0.016
gy-0.016
lock-0.015
 job-0.015
 Finally-0.015
Token here
Token ID994
Feature activation+4.92e-03

Pos logit contributions
urt+0.003
lock+0.002
gy+0.002
ometer+0.002
 job+0.002
Neg logit contributions
 concert-0.005
 films-0.004
 announcements-0.004
 movies-0.004
 crowds-0.004
Token."
Token ID526
Feature activation+1.92e-02

Pos logit contributions
urt+0.008
lock+0.008
 contents+0.007
gy+0.007
ometer+0.007
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 crowds-0.009
 pian-0.008
Token
Token ID198
Feature activation+9.22e-03

Pos logit contributions
urt+0.026
lock+0.025
 contents+0.024
gy+0.024
ometer+0.024
Neg logit contributions
 concert-0.040
 films-0.039
 movies-0.038
 crowds-0.038
 pian-0.038
Token
Token ID198
Feature activation+5.71e-03

Pos logit contributions
urt+0.013
lock+0.013
 contents+0.012
gy+0.012
ometer+0.012
Neg logit contributions
 concert-0.018
 films-0.018
 movies-0.018
 crowds-0.018
 pian-0.018
TokenThe
Token ID464
Feature activation+5.81e-03

Pos logit contributions
urt+0.010
lock+0.009
 contents+0.009
gy+0.009
ometer+0.009
Neg logit contributions
 concert-0.010
 films-0.010
 movies-0.010
 crowds-0.009
 pian-0.009
Token old
Token ID1468
Feature activation+1.35e-04

Pos logit contributions
urt+0.009
lock+0.009
 contents+0.008
gy+0.008
ometer+0.008
Neg logit contributions
 concert-0.011
 films-0.011
 movies-0.010
 crowds-0.010
 pian-0.010
Token j
Token ID474
Feature activation-2.30e-02

Pos logit contributions
urt+0.000
lock+0.000
 contents+0.000
gy+0.000
ometer+0.000
Neg logit contributions
 films-0.000
 movies-0.000
 concert-0.000
 crowds-0.000
 pian-0.000
Tokenagu
Token ID11433
Feature activation-2.88e-02

Pos logit contributions
 films+0.031
 movies+0.029
 announcements+0.028
 pastor+0.027
 pian+0.026
Neg logit contributions
urt-0.048
lock-0.047
ucer-0.046
inky-0.045
 job-0.045
Tokenar
Token ID283
Feature activation-2.86e-02

Pos logit contributions
 concert+0.051
 films+0.051
 movies+0.050
 crowds+0.049
 pian+0.049
Neg logit contributions
urt-0.047
lock-0.046
 contents-0.045
gy-0.044
ometer-0.044
Token was
Token ID373
Feature activation-6.11e-03

Pos logit contributions
 films+0.060
 concert+0.058
 movies+0.057
 crowds+0.057
 pian+0.057
Neg logit contributions
urt-0.039
gy-0.036
lock-0.035
 contents-0.035
ometer-0.033
Token fits
Token ID11414
Feature activation-1.29e-02

Pos logit contributions
urt+0.022
lock+0.020
 contents+0.020
gy+0.020
ucer+0.019
Neg logit contributions
 concert-0.024
 films-0.024
 movies-0.024
 pian-0.023
 crowds-0.023
Token the
Token ID262
Feature activation-5.13e-03

Pos logit contributions
 concert+0.021
 films+0.021
 movies+0.021
 crowds+0.020
 pian+0.020
Neg logit contributions
urt-0.022
lock-0.022
 contents-0.021
gy-0.021
ometer-0.021
Token cup
Token ID6508
Feature activation+1.62e-02

Pos logit contributions
 concert+0.011
 movies+0.011
 films+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.007
gy-0.006
lock-0.006
ometer-0.006
ucer-0.006
Tokenboard
Token ID3526
Feature activation+9.43e-03

Pos logit contributions
urt+0.025
gy+0.024
lock+0.023
 contents+0.023
ucer+0.023
Neg logit contributions
 concert-0.029
 movies-0.028
 films-0.028
 crowds-0.028
 pian-0.027
Token,"
Token ID553
Feature activation+4.50e-03

Pos logit contributions
urt+0.014
lock+0.013
 contents+0.013
gy+0.013
ometer+0.013
Neg logit contributions
 concert-0.018
 films-0.017
 movies-0.017
 crowds-0.017
 pian-0.017
Token he
Token ID339
Feature activation-1.13e-03

Pos logit contributions
urt+0.007
lock+0.006
 contents+0.006
 Finally+0.006
gy+0.006
Neg logit contributions
 concert-0.009
 films-0.009
 crowds-0.008
 pian-0.008
 movies-0.008
Token said
Token ID531
Feature activation+9.13e-03

Pos logit contributions
 concert+0.002
 films+0.002
 pian+0.002
 movies+0.002
 announcements+0.002
Neg logit contributions
ucer-0.002
urt-0.002
 contents-0.002
lock-0.002
gy-0.002
Token.
Token ID13
Feature activation+8.32e-03

Pos logit contributions
 contents+0.016
urt+0.016
ucer+0.016
lock+0.015
ometer+0.015
Neg logit contributions
 films-0.014
 concert-0.014
 movies-0.013
 pian-0.013
 crowds-0.013
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
urt+0.012
lock+0.011
 contents+0.011
gy+0.011
ometer+0.011
Neg logit contributions
 concert-0.017
 films-0.016
 movies-0.016
 crowds-0.016
 pian-0.016
Token
Token ID198
Feature activation+9.70e-03

Pos logit contributions
urt+0.017
lock+0.017
 contents+0.016
gy+0.016
ometer+0.016
Neg logit contributions
 concert-0.023
 films-0.022
 movies-0.022
 crowds-0.022
 pian-0.022
TokenMary
Token ID24119
Feature activation-1.11e-02

Pos logit contributions
urt+0.015
 contents+0.014
lock+0.014
gy+0.014
ometer+0.014
Neg logit contributions
 concert-0.018
 films-0.018
 movies-0.018
 pian-0.017
 crowds-0.017
Token cup
Token ID6508
Feature activation+6.53e-03

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.008
lock-0.008
 Finally-0.007
gy-0.007
 contents-0.007
Token was
Token ID373
Feature activation-2.83e-03

Pos logit contributions
urt+0.010
lock+0.010
ometer+0.010
 contents+0.010
ucer+0.010
Neg logit contributions
 films-0.012
 concert-0.012
 pian-0.011
 crowds-0.011
 movies-0.011
Token old
Token ID1468
Feature activation+1.86e-02

Pos logit contributions
 concert+0.006
 films+0.006
 movies+0.006
 crowds+0.006
 pian+0.006
Neg logit contributions
urt-0.004
lock-0.004
 contents-0.004
gy-0.003
ometer-0.003
Token and
Token ID290
Feature activation-4.41e-03

Pos logit contributions
urt+0.030
ometer+0.029
 job+0.029
 contents+0.029
gy+0.029
Neg logit contributions
 concert-0.031
La-0.031
 announcements-0.031
 films-0.031
 movies-0.030
Token cracked
Token ID21368
Feature activation+2.95e-02

Pos logit contributions
 concert+0.008
 films+0.008
 movies+0.008
 crowds+0.008
 pian+0.008
Neg logit contributions
urt-0.007
lock-0.006
gy-0.006
 contents-0.006
 Finally-0.006
Token.
Token ID13
Feature activation-5.16e-03

Pos logit contributions
urt+0.054
lock+0.052
 contents+0.051
gy+0.050
ometer+0.050
Neg logit contributions
 concert-0.047
 films-0.046
 movies-0.046
 crowds-0.045
 pian-0.045
Token It
Token ID632
Feature activation-6.62e-03

Pos logit contributions
 concert+0.010
 films+0.010
 movies+0.010
 crowds+0.010
 pian+0.010
Neg logit contributions
urt-0.008
lock-0.007
 contents-0.007
gy-0.007
ometer-0.007
Token was
Token ID373
Feature activation-8.04e-03

Pos logit contributions
 concert+0.013
 films+0.013
 movies+0.012
 crowds+0.012
 pian+0.012
Neg logit contributions
urt-0.010
lock-0.009
 contents-0.009
gy-0.009
ometer-0.009
Token not
Token ID407
Feature activation+1.21e-02

Pos logit contributions
 concert+0.016
 films+0.016
 movies+0.016
 crowds+0.016
 pian+0.016
Neg logit contributions
urt-0.011
lock-0.011
 contents-0.010
gy-0.010
ometer-0.010
Token a
Token ID257
Feature activation+1.34e-02

Pos logit contributions
urt+0.019
lock+0.018
 Finally+0.018
 contents+0.017
ometer+0.017
Neg logit contributions
 films-0.023
 concert-0.022
 pian-0.022
 movies-0.022
 crowds-0.022
Token good
Token ID922
Feature activation+9.65e-03

Pos logit contributions
urt+0.018
lock+0.017
gy+0.016
 contents+0.016
 Finally+0.016
Neg logit contributions
 concert-0.029
 films-0.028
 movies-0.028
 pian-0.028
 crowds-0.027
Token said
Token ID531
Feature activation+7.02e-03

Pos logit contributions
 films+0.021
 concert+0.020
 movies+0.020
 crowds+0.020
 pian+0.020
Neg logit contributions
urt-0.014
gy-0.014
lock-0.014
 contents-0.013
ometer-0.013
Token he
Token ID339
Feature activation-1.61e-03

Pos logit contributions
urt+0.010
lock+0.010
ometer+0.010
 Finally+0.010
gy+0.009
Neg logit contributions
 concert-0.014
 films-0.014
 crowds-0.013
 movies-0.013
 pian-0.013
Token was
Token ID373
Feature activation+5.61e-04

Pos logit contributions
 concert+0.003
 movies+0.003
 films+0.003
 crowds+0.003
 pian+0.003
Neg logit contributions
urt-0.003
lock-0.003
ucer-0.002
 contents-0.002
ometer-0.002
Token ok
Token ID12876
Feature activation+6.91e-03

Pos logit contributions
urt+0.001
gy+0.001
lock+0.001
 contents+0.001
 Finally+0.001
Neg logit contributions
 concert-0.001
 films-0.001
 movies-0.001
 pian-0.001
 crowds-0.001
Token and
Token ID290
Feature activation+6.06e-03

Pos logit contributions
urt+0.011
gy+0.010
lock+0.010
ometer+0.010
 contents+0.010
Neg logit contributions
 concert-0.013
 films-0.013
 crowds-0.012
 movies-0.012
 pian-0.012
Token they
Token ID484
Feature activation+1.44e-02

Pos logit contributions
urt+0.010
lock+0.009
 contents+0.009
gy+0.009
ometer+0.009
Neg logit contributions
 concert-0.011
 films-0.011
 movies-0.011
 crowds-0.011
 pian-0.011
Token both
Token ID1111
Feature activation+8.07e-03

Pos logit contributions
urt+0.024
 contents+0.023
lock+0.023
ometer+0.022
ucer+0.021
Neg logit contributions
 concert-0.026
 movies-0.024
 crowds-0.024
 films-0.024
 pian-0.024
Token were
Token ID547
Feature activation+8.45e-03

Pos logit contributions
urt+0.013
lock+0.013
 contents+0.013
gy+0.012
ometer+0.012
Neg logit contributions
 concert-0.014
 films-0.014
 movies-0.014
 crowds-0.014
 pian-0.014
Token soon
Token ID2582
Feature activation+5.17e-03

Pos logit contributions
urt+0.011
gy+0.010
 contents+0.010
lock+0.010
ometer+0.010
Neg logit contributions
 concert-0.018
 films-0.018
 movies-0.017
 crowds-0.017
 pian-0.017
Token smiling
Token ID16755
Feature activation+2.58e-02

Pos logit contributions
urt+0.008
lock+0.008
 contents+0.008
gy+0.007
ucer+0.007
Neg logit contributions
 concert-0.009
 films-0.009
 pian-0.009
 movies-0.009
 crowds-0.009
Token and
Token ID290
Feature activation+8.11e-03

Pos logit contributions
urt+0.044
lock+0.042
 contents+0.041
gy+0.041
ometer+0.041
Neg logit contributions
 concert-0.044
 films-0.044
 movies-0.043
 pian-0.042
 crowds-0.042
Token.
Token ID13
Feature activation-1.69e-03

Pos logit contributions
urt+0.000
lock+0.000
 contents+0.000
ometer+0.000
gy+0.000
Neg logit contributions
 concert-0.000
 films-0.000
 movies-0.000
 crowds-0.000
 pian-0.000
Token Sally
Token ID25737
Feature activation-9.20e-03

Pos logit contributions
 crowds+0.003
aw+0.003
 movies+0.002
 films+0.002
 concert+0.002
Neg logit contributions
ucer-0.003
urt-0.003
 contents-0.003
lock-0.003
ometer-0.003
Token was
Token ID373
Feature activation-2.10e-02

Pos logit contributions
 films+0.020
 concert+0.019
 pian+0.019
 crowds+0.019
 movies+0.019
Neg logit contributions
gy-0.012
urt-0.012
lock-0.011
 Finally-0.010
 contents-0.010
Token not
Token ID407
Feature activation+1.02e-02

Pos logit contributions
 concert+0.048
 films+0.047
 movies+0.046
 crowds+0.046
 pian+0.046
Neg logit contributions
gy-0.023
urt-0.022
lock-0.022
 contents-0.021
 Finally-0.020
Token foolish
Token ID19538
Feature activation+1.53e-02

Pos logit contributions
urt+0.017
lock+0.017
gy+0.016
 contents+0.016
ometer+0.016
Neg logit contributions
 concert-0.018
 films-0.017
 movies-0.017
 crowds-0.017
 pian-0.017
Token like
Token ID588
Feature activation-8.11e-03

Pos logit contributions
urt+0.018
 job+0.018
lock+0.016
 contents+0.016
ometer+0.016
Neg logit contributions
 concert-0.033
La-0.032
 films-0.032
 movies-0.031
 pian-0.031
Token the
Token ID262
Feature activation+7.27e-03

Pos logit contributions
La+0.016
 pian+0.016
 concert+0.016
 films+0.015
 announcements+0.015
Neg logit contributions
urt-0.010
 job-0.010
 contents-0.009
 seed-0.009
 Mickey-0.009
Token other
Token ID584
Feature activation-7.71e-04

Pos logit contributions
urt+0.013
 contents+0.013
lock+0.013
gy+0.013
ometer+0.013
Neg logit contributions
 concert-0.011
 films-0.011
 pian-0.010
 movies-0.010
 crowds-0.010
Token people
Token ID661
Feature activation-1.06e-02

Pos logit contributions
 concert+0.001
 pian+0.001
La+0.001
 films+0.001
 movies+0.001
Neg logit contributions
 contents-0.001
lock-0.001
gy-0.001
urt-0.001
 job-0.001
Token in
Token ID287
Feature activation-1.15e-02

Pos logit contributions
 concert+0.021
 films+0.021
La+0.021
 pian+0.021
 movies+0.020
Neg logit contributions
lock-0.014
urt-0.014
gy-0.013
ometer-0.013
 job-0.012
Token the
Token ID262
Feature activation+5.01e-03

Pos logit contributions
 concert+0.021
 pian+0.021
 movies+0.021
aw+0.021
 pastor+0.020
Neg logit contributions
gy-0.016
 contents-0.016
 job-0.015
urt-0.015
lock-0.015
Token ice
Token ID4771
Feature activation-1.07e-02

Pos logit contributions
 concert+0.029
 films+0.028
 movies+0.028
 crowds+0.027
 pian+0.027
Neg logit contributions
urt-0.033
lock-0.032
 contents-0.031
gy-0.031
ometer-0.031
Token-
Token ID12
Feature activation+1.28e-03

Pos logit contributions
 concert+0.018
 films+0.018
 crowds+0.018
 movies+0.018
 pian+0.017
Neg logit contributions
urt-0.018
lock-0.018
 contents-0.017
gy-0.017
ometer-0.017
Tokencream
Token ID36277
Feature activation+2.00e-02

Pos logit contributions
 contents+0.002
urt+0.001
 hinges+0.001
 virtue+0.001
lock+0.001
Neg logit contributions
 movies-0.003
aw-0.003
 concert-0.002
 pastor-0.002
 films-0.002
Token truck
Token ID7779
Feature activation+1.06e-02

Pos logit contributions
urt+0.034
lock+0.033
 contents+0.033
gy+0.032
ometer+0.031
Neg logit contributions
 concert-0.034
 movies-0.033
 films-0.033
 crowds-0.032
 pian-0.032
Token,"
Token ID553
Feature activation+5.55e-03

Pos logit contributions
urt+0.019
 contents+0.018
lock+0.018
ucer+0.017
ometer+0.017
Neg logit contributions
 concert-0.017
 movies-0.017
 films-0.017
 crowds-0.016
 pian-0.016
Token mom
Token ID1995
Feature activation-3.38e-02

Pos logit contributions
urt+0.007
 Finally+0.007
lock+0.007
 contents+0.007
ometer+0.007
Neg logit contributions
 concert-0.012
 pian-0.011
 crowds-0.011
 films-0.011
 announcements-0.011
Token said
Token ID531
Feature activation-5.66e-03

Pos logit contributions
 concert+0.061
 films+0.059
 movies+0.059
 pian+0.058
 crowds+0.057
Neg logit contributions
urt-0.054
lock-0.052
 contents-0.052
gy-0.051
ometer-0.050
Token.
Token ID13
Feature activation-6.10e-03

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.010
 crowds+0.010
 pian+0.010
Neg logit contributions
urt-0.009
lock-0.008
 contents-0.008
gy-0.008
ometer-0.008
Token She
Token ID1375
Feature activation-8.42e-03

Pos logit contributions
 concert+0.014
 films+0.014
 pian+0.013
 movies+0.013
 crowds+0.013
Neg logit contributions
 Finally-0.007
urt-0.007
ometer-0.006
lock-0.006
 contents-0.006
Token took
Token ID1718
Feature activation-1.24e-02

Pos logit contributions
 concert+0.017
 films+0.017
 crowds+0.016
 movies+0.016
 pian+0.016
Neg logit contributions
urt-0.012
lock-0.012
 contents-0.011
gy-0.011
ometer-0.011
Token their
Token ID511
Feature activation+2.02e-03

Pos logit contributions
 concert+0.025
 pian+0.023
 crowds+0.023
 films+0.023
 movies+0.022
Neg logit contributions
urt-0.018
 contents-0.017
 Finally-0.017
lock-0.017
gy-0.016
Token't
Token ID470
Feature activation+3.56e-04

Pos logit contributions
 concert+0.059
 films+0.058
 crowds+0.058
 movies+0.058
 pian+0.057
Neg logit contributions
urt-0.024
lock-0.024
ometer-0.022
 contents-0.021
gy-0.021
Token you
Token ID345
Feature activation+4.72e-03

Pos logit contributions
urt+0.001
lock+0.001
gy+0.000
 job+0.000
 contents+0.000
Neg logit contributions
 films-0.001
 concert-0.001
 crowds-0.001
 movies-0.001
 pian-0.001
Token try
Token ID1949
Feature activation-1.81e-02

Pos logit contributions
urt+0.007
 contents+0.007
lock+0.007
ometer+0.007
gy+0.007
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 pian-0.009
 crowds-0.008
Token it
Token ID340
Feature activation-1.84e-02

Pos logit contributions
 concert+0.033
 announcements+0.033
 pastor+0.032
 crowds+0.032
 films+0.032
Neg logit contributions
urt-0.027
gy-0.026
 hammer-0.025
 contents-0.024
ometer-0.024
Token on
Token ID319
Feature activation-8.40e-03

Pos logit contributions
 concert+0.028
 films+0.027
 crowds+0.027
 movies+0.027
 pian+0.026
Neg logit contributions
urt-0.035
lock-0.034
 contents-0.033
gy-0.033
ometer-0.033
Token?"
Token ID1701
Feature activation-8.09e-03

Pos logit contributions
 announcements+0.015
 crowds+0.015
 concert+0.015
 pian+0.015
 films+0.015
Neg logit contributions
gy-0.012
urt-0.012
 contents-0.012
 job-0.012
lock-0.011
Token Mom
Token ID11254
Feature activation-3.94e-02

Pos logit contributions
 concert+0.019
 films+0.019
 pian+0.018
 crowds+0.018
 movies+0.018
Neg logit contributions
 Finally-0.008
urt-0.008
lock-0.008
gy-0.008
ometer-0.007
Token said
Token ID531
Feature activation-6.32e-03

Pos logit contributions
 films+0.090
 concert+0.089
 crowds+0.088
 dolphins+0.087
 movies+0.086
Neg logit contributions
urt-0.048
lock-0.042
gy-0.040
ometer-0.040
 Finally-0.038
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
 concert+0.011
 films+0.011
 movies+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.010
lock-0.010
 contents-0.010
gy-0.010
ometer-0.010
Token showing
Token ID4478
Feature activation+2.24e-03

Pos logit contributions
 concert+0.030
 films+0.030
 movies+0.029
 crowds+0.029
 pian+0.029
Neg logit contributions
urt-0.025
lock-0.025
gy-0.024
 Finally-0.023
 contents-0.022
Token her
Token ID607
Feature activation-4.46e-03

Pos logit contributions
urt+0.004
lock+0.004
 contents+0.004
gy+0.004
ometer+0.004
Neg logit contributions
 concert-0.004
 films-0.004
 movies-0.004
 crowds-0.004
 pian-0.004
Token the
Token ID262
Feature activation-4.90e-03

Pos logit contributions
urt+0.004
lock+0.004
gy+0.003
 contents+0.003
ometer+0.003
Neg logit contributions
 concert-0.004
 films-0.003
 pian-0.003
 crowds-0.003
 movies-0.003
Token park
Token ID3952
Feature activation+1.59e-02

Pos logit contributions
 movies+0.007
 concert+0.007
 films+0.007
 opera+0.007
 crowds+0.007
Neg logit contributions
urt-0.010
ucer-0.009
lock-0.009
gy-0.009
ometer-0.009
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
urt+0.022
lock+0.022
gy+0.021
 contents+0.020
ometer+0.020
Neg logit contributions
 films-0.032
 concert-0.032
 crowds-0.031
 pian-0.031
 announcements-0.030
Token But
Token ID887
Feature activation-3.74e-03

Pos logit contributions
ucer+0.027
urt+0.026
gy+0.026
 contents+0.025
lock+0.025
Neg logit contributions
 movies-0.024
 crowds-0.023
 concert-0.023
 films-0.023
 pian-0.022
Token her
Token ID607
Feature activation+7.39e-04

Pos logit contributions
 concert+0.006
 movies+0.006
 films+0.006
 pian+0.006
 crowds+0.006
Neg logit contributions
urt-0.007
lock-0.007
 contents-0.007
ometer-0.006
ucer-0.006
Token Mom
Token ID11254
Feature activation-1.67e-02

Pos logit contributions
urt+0.001
 contents+0.001
lock+0.001
ometer+0.001
gy+0.001
Neg logit contributions
 concert-0.002
 films-0.002
 pian-0.002
 crowds-0.002
 movies-0.002
Token said
Token ID531
Feature activation+1.04e-04

Pos logit contributions
 concert+0.029
 movies+0.027
 films+0.027
 pian+0.027
 crowds+0.026
Neg logit contributions
urt-0.027
 contents-0.027
lock-0.027
ucer-0.026
gy-0.026
Token no
Token ID645
Feature activation+5.04e-03

Pos logit contributions
lock+0.000
urt+0.000
ometer+0.000
 Finally+0.000
 contents+0.000
Neg logit contributions
 concert-0.000
 films-0.000
 crowds-0.000
 movies-0.000
 pian-0.000
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
urt+0.008
lock+0.008
 contents+0.007
gy+0.007
ometer+0.007
Neg logit contributions
 concert-0.009
 films-0.009
 movies-0.009
 crowds-0.009
 pian-0.009
Token She
Token ID1375
Feature activation-1.68e-03

Pos logit contributions
urt+0.020
lock+0.020
 contents+0.019
ucer+0.019
gy+0.019
Neg logit contributions
 films-0.022
 concert-0.021
 crowds-0.021
 movies-0.021
 pian-0.021
Token said
Token ID531
Feature activation-8.25e-04

Pos logit contributions
 concert+0.003
 films+0.003
 movies+0.003
 crowds+0.003
 pian+0.003
Neg logit contributions
urt-0.003
lock-0.002
 contents-0.002
gy-0.002
ometer-0.002
Token Tom
Token ID4186
Feature activation-4.04e-04

Pos logit contributions
urt+0.008
lock+0.007
 contents+0.007
gy+0.007
 Finally+0.007
Neg logit contributions
 concert-0.008
 films-0.008
 pian-0.008
 crowds-0.008
 movies-0.008
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
 films+0.001
 concert+0.001
 movies+0.001
 crowds+0.001
 pian+0.001
Neg logit contributions
urt-0.001
lock-0.001
 contents-0.001
ometer-0.001
ucer-0.001
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
 concert+0.026
 films+0.025
 pian+0.025
 crowds+0.025
 movies+0.025
Neg logit contributions
urt-0.016
 Finally-0.015
lock-0.015
ometer-0.015
gy-0.015
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
 concert+0.056
 films+0.055
 crowds+0.055
 pian+0.054
 movies+0.054
Neg logit contributions
urt-0.032
lock-0.030
 Finally-0.030
ometer-0.029
 contents-0.028
Token"
Token ID1
Feature activation+9.79e-03

Pos logit contributions
 films+0.041
 concert+0.041
 crowds+0.041
 pian+0.039
 movies+0.039
Neg logit contributions
urt-0.026
lock-0.025
 Finally-0.024
 contents-0.023
ometer-0.023
TokenWow
Token ID22017
Feature activation-9.52e-03

Pos logit contributions
urt+0.014
lock+0.014
 contents+0.013
 Finally+0.013
ucer+0.013
Neg logit contributions
 films-0.019
 concert-0.019
 crowds-0.018
 pian-0.018
 movies-0.018
Token,
Token ID11
Feature activation+5.44e-03

Pos logit contributions
 concert+0.017
 films+0.017
 movies+0.017
 crowds+0.016
 pian+0.016
Neg logit contributions
urt-0.015
lock-0.015
 contents-0.014
gy-0.014
ometer-0.014
Token that
Token ID326
Feature activation+2.12e-02

Pos logit contributions
urt+0.009
lock+0.008
 contents+0.008
ometer+0.008
gy+0.008
Neg logit contributions
 films-0.010
 concert-0.010
 crowds-0.010
 movies-0.009
 pian-0.009
Token is
Token ID318
Feature activation+1.53e-02

Pos logit contributions
urt+0.039
lock+0.038
ometer+0.037
gy+0.037
 contents+0.036
Neg logit contributions
 concert-0.033
 films-0.033
 movies-0.033
 crowds-0.032
 pian-0.032
Token a
Token ID257
Feature activation+1.21e-02

Pos logit contributions
urt+0.026
lock+0.025
ometer+0.024
gy+0.024
 contents+0.024
Neg logit contributions
 films-0.026
 movies-0.026
 concert-0.026
 crowds-0.025
 pian-0.025
Token pretty
Token ID2495
Feature activation+1.47e-02

Pos logit contributions
urt+0.017
lock+0.016
 contents+0.016
gy+0.016
ometer+0.015
Neg logit contributions
 concert-0.025
 films-0.024
 movies-0.024
 crowds-0.024
 pian-0.023
Token the
Token ID262
Feature activation-7.46e-03

Pos logit contributions
urt+0.003
gy+0.003
 job+0.003
lock+0.003
 contents+0.003
Neg logit contributions
aw-0.004
 pian-0.004
 announcements-0.004
 crowds-0.004
 films-0.004
Token movies
Token ID6918
Feature activation+1.09e-02

Pos logit contributions
 films+0.011
 concert+0.011
 crowds+0.011
 pian+0.011
 announcements+0.011
Neg logit contributions
urt-0.013
lock-0.013
 contents-0.013
ometer-0.013
gy-0.013
Token.
Token ID13
Feature activation-8.15e-03

Pos logit contributions
urt+0.015
lock+0.015
 contents+0.014
ometer+0.014
gy+0.014
Neg logit contributions
 films-0.021
 concert-0.021
 crowds-0.021
 pian-0.021
 movies-0.020
Token Ann
Token ID5506
Feature activation-1.98e-02

Pos logit contributions
 concert+0.015
 films+0.015
 movies+0.014
 crowds+0.014
 pian+0.014
Neg logit contributions
urt-0.013
lock-0.013
 contents-0.012
gy-0.012
ometer-0.012
Tokenabel
Token ID9608
Feature activation-4.85e-02

Pos logit contributions
 crowds+0.038
 films+0.037
 concert+0.037
 pian+0.035
 dolphins+0.035
Neg logit contributions
urt-0.030
gy-0.030
 contents-0.028
 Finally-0.028
sy-0.028
Tokenle
Token ID293
Feature activation-4.61e-02

Pos logit contributions
 crowds+0.074
 films+0.069
 pian+0.067
 movies+0.065
 concert+0.065
Neg logit contributions
urt-0.093
sy-0.092
lock-0.091
gy-0.089
 contents-0.088
Token was
Token ID373
Feature activation-1.32e-02

Pos logit contributions
 crowds+0.102
 films+0.102
 concert+0.101
 pian+0.099
 movies+0.099
Neg logit contributions
gy-0.052
urt-0.051
 contents-0.048
lock-0.047
 Finally-0.045
Token very
Token ID845
Feature activation+7.17e-03

Pos logit contributions
 concert+0.028
 films+0.027
 crowds+0.026
 movies+0.026
 pian+0.026
Neg logit contributions
urt-0.017
lock-0.017
gy-0.016
 contents-0.016
 Finally-0.016
Token excited
Token ID6568
Feature activation+1.81e-02

Pos logit contributions
urt+0.010
lock+0.010
 contents+0.010
ometer+0.009
gy+0.009
Neg logit contributions
 concert-0.014
 films-0.014
 movies-0.014
 crowds-0.014
 pian-0.014
Token and
Token ID290
Feature activation+2.43e-03

Pos logit contributions
urt+0.028
 contents+0.025
lock+0.025
 job+0.024
gy+0.024
Neg logit contributions
 films-0.036
La-0.035
 pian-0.035
 concert-0.035
 announcements-0.034
Token said
Token ID531
Feature activation+5.08e-03

Pos logit contributions
urt+0.004
 contents+0.004
ucer+0.004
lock+0.004
ometer+0.004
Neg logit contributions
 concert-0.004
 movies-0.004
 crowds-0.004
 films-0.004
 pian-0.004
Token see
Token ID766
Feature activation-2.76e-03

Pos logit contributions
 concert+0.016
 films+0.015
 movies+0.015
 pian+0.015
 crowds+0.015
Neg logit contributions
urt-0.012
ometer-0.011
 contents-0.011
lock-0.011
ucer-0.011
Token a
Token ID257
Feature activation-5.24e-03

Pos logit contributions
 concert+0.005
 announcements+0.005
 crowds+0.005
 films+0.005
 pian+0.005
Neg logit contributions
urt-0.004
 hammer-0.003
 contents-0.003
lock-0.003
gy-0.003
Token pony
Token ID26902
Feature activation+5.71e-03

Pos logit contributions
 films+0.010
 crowds+0.010
 announcements+0.010
 concert+0.010
 pian+0.010
Neg logit contributions
urt-0.007
ometer-0.006
lock-0.006
 contents-0.006
 job-0.006
Token.
Token ID13
Feature activation+6.58e-03

Pos logit contributions
urt+0.010
 contents+0.010
ometer+0.010
lock+0.010
ucer+0.009
Neg logit contributions
 concert-0.010
 films-0.009
 movies-0.009
 crowds-0.009
 pian-0.009
Token She
Token ID1375
Feature activation+1.65e-03

Pos logit contributions
urt+0.011
ucer+0.011
 contents+0.010
lock+0.010
gy+0.010
Neg logit contributions
 movies-0.011
 films-0.011
 crowds-0.011
 concert-0.011
 pian-0.010
Token kne
Token ID24813
Feature activation-4.71e-02

Pos logit contributions
urt+0.002
lock+0.002
gy+0.002
 contents+0.002
ometer+0.002
Neg logit contributions
 films-0.003
 concert-0.003
 crowds-0.003
 movies-0.003
 pian-0.003
Tokeneled
Token ID18449
Feature activation-9.51e-03

Pos logit contributions
 concert+0.070
 films+0.068
 movies+0.067
 crowds+0.066
 pian+0.066
Neg logit contributions
urt-0.092
lock-0.089
gy-0.089
 contents-0.087
ometer-0.086
Token down
Token ID866
Feature activation-8.82e-03

Pos logit contributions
 concert+0.019
 films+0.019
 movies+0.019
 pian+0.019
 crowds+0.018
Neg logit contributions
urt-0.013
lock-0.013
gy-0.013
 contents-0.012
ometer-0.012
Token and
Token ID290
Feature activation+1.94e-03

Pos logit contributions
 concert+0.013
 films+0.013
 movies+0.013
 crowds+0.013
 pian+0.012
Neg logit contributions
urt-0.017
lock-0.016
 contents-0.016
gy-0.016
ometer-0.016
Token looked
Token ID3114
Feature activation-1.59e-02

Pos logit contributions
urt+0.003
lock+0.003
gy+0.003
 Finally+0.003
ometer+0.003
Neg logit contributions
 films-0.004
 concert-0.004
 pian-0.003
 movies-0.003
 crowds-0.003
Token out
Token ID503
Feature activation-2.27e-02

Pos logit contributions
 concert+0.030
 films+0.028
 movies+0.028
 pian+0.028
La+0.028
Neg logit contributions
urt-0.025
 Finally-0.024
 tight-0.024
 contents-0.023
ometer-0.023
Token you
Token ID345
Feature activation-1.56e-02

Pos logit contributions
 concert+0.064
 films+0.063
 crowds+0.061
 pian+0.061
 movies+0.060
Neg logit contributions
urt-0.054
lock-0.053
 contents-0.051
gy-0.050
 Finally-0.050
Token want
Token ID765
Feature activation-2.66e-02

Pos logit contributions
 concert+0.035
 films+0.034
 pian+0.033
 movies+0.033
 crowds+0.033
Neg logit contributions
urt-0.018
lock-0.018
gy-0.017
 Finally-0.016
 contents-0.016
Token to
Token ID284
Feature activation-1.72e-02

Pos logit contributions
 concert+0.056
 films+0.056
 pian+0.055
 crowds+0.055
 announcements+0.054
Neg logit contributions
urt-0.031
lock-0.031
 contents-0.030
ometer-0.028
 Finally-0.028
Token play
Token ID711
Feature activation-2.24e-02

Pos logit contributions
 films+0.035
 concert+0.035
 pian+0.034
 crowds+0.034
 movies+0.034
Neg logit contributions
urt-0.023
lock-0.022
gy-0.022
 contents-0.021
ometer-0.020
Token with
Token ID351
Feature activation-1.79e-03

Pos logit contributions
 pastor+0.049
 announcements+0.049
 crowds+0.048
 strangers+0.046
 pian+0.046
Neg logit contributions
lock-0.024
gy-0.024
urt-0.024
 job-0.023
 contents-0.022
Token me
Token ID502
Feature activation-4.77e-02

Pos logit contributions
 concert+0.003
 crowds+0.002
 films+0.002
 pian+0.002
 announcements+0.002
Neg logit contributions
urt-0.003
gy-0.003
 contents-0.003
lock-0.003
inky-0.003
Token?"
Token ID1701
Feature activation-1.12e-02

Pos logit contributions
 announcements+0.081
 crowds+0.079
 concert+0.078
 films+0.078
 pian+0.077
Neg logit contributions
urt-0.081
lock-0.079
gy-0.079
ometer-0.076
ucer-0.073
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
 concert+0.028
 films+0.028
 pian+0.027
 dolphins+0.027
 announcements+0.026
Neg logit contributions
 Finally-0.011
urt-0.010
lock-0.010
ometer-0.009
gy-0.008
Token
Token ID198
Feature activation-1.64e-02

Pos logit contributions
 concert+0.022
 films+0.022
 crowds+0.022
 movies+0.022
 pian+0.022
Neg logit contributions
urt-0.015
lock-0.015
 contents-0.014
ometer-0.014
gy-0.014
TokenTom
Token ID13787
Feature activation-3.40e-03

Pos logit contributions
 concert+0.023
 films+0.023
 movies+0.021
 crowds+0.021
 pian+0.021
Neg logit contributions
urt-0.034
lock-0.033
 contents-0.032
gy-0.031
ometer-0.031
Token nods
Token ID34833
Feature activation-1.29e-02

Pos logit contributions
 concert+0.006
 films+0.006
 pian+0.006
 announcements+0.006
 musicians+0.006
Neg logit contributions
urt-0.005
 Finally-0.005
 trunk-0.005
 tight-0.005
gy-0.005
Token.
Token ID13
Feature activation-6.49e-03

Pos logit contributions
urt+0.002
lock+0.002
gy+0.002
 contents+0.002
ometer+0.002
Neg logit contributions
 concert-0.002
 films-0.002
 movies-0.002
 pian-0.002
 crowds-0.002
Token Maybe
Token ID6674
Feature activation-1.80e-02

Pos logit contributions
 films+0.012
 concert+0.012
 movies+0.012
 pian+0.012
 crowds+0.012
Neg logit contributions
urt-0.010
 contents-0.010
lock-0.009
gy-0.009
ucer-0.009
Token we
Token ID356
Feature activation+5.09e-03

Pos logit contributions
 concert+0.033
 films+0.032
 crowds+0.031
 movies+0.031
 pian+0.031
Neg logit contributions
lock-0.029
urt-0.028
 contents-0.027
gy-0.026
ometer-0.026
Token should
Token ID815
Feature activation-2.36e-02

Pos logit contributions
urt+0.009
 contents+0.008
ucer+0.008
lock+0.008
ometer+0.008
Neg logit contributions
 concert-0.009
 films-0.008
 movies-0.008
 crowds-0.008
 pian-0.008
Token ask
Token ID1265
Feature activation+4.46e-03

Pos logit contributions
 concert+0.047
 movies+0.046
 films+0.045
 crowds+0.044
 pian+0.044
Neg logit contributions
urt-0.034
lock-0.033
 contents-0.032
ometer-0.032
ucer-0.032
Token someone
Token ID2130
Feature activation-4.88e-02

Pos logit contributions
urt+0.007
lock+0.007
 contents+0.007
gy+0.007
 Finally+0.007
Neg logit contributions
 concert-0.008
 films-0.007
 pian-0.007
 crowds-0.007
 movies-0.007
Token for
Token ID329
Feature activation-2.35e-02

Pos logit contributions
 concert+0.091
 films+0.087
 pian+0.087
 crowds+0.087
 movies+0.085
Neg logit contributions
urt-0.074
lock-0.073
ometer-0.070
 contents-0.069
gy-0.069
Token help
Token ID1037
Feature activation-1.30e-02

Pos logit contributions
 concert+0.049
 films+0.048
 movies+0.047
 crowds+0.047
 pian+0.047
Neg logit contributions
urt-0.032
lock-0.030
 contents-0.030
gy-0.029
ometer-0.029
Token."
Token ID526
Feature activation+7.41e-03

Pos logit contributions
 concert+0.027
 films+0.027
 pian+0.026
 crowds+0.026
 movies+0.026
Neg logit contributions
urt-0.017
lock-0.016
 contents-0.016
gy-0.016
ometer-0.015
Token
Token ID198
Feature activation-8.41e-03

Pos logit contributions
urt+0.011
lock+0.011
 contents+0.011
gy+0.010
ometer+0.010
Neg logit contributions
 concert-0.014
 films-0.014
 movies-0.014
 crowds-0.014
 pian-0.014
Token
Token ID198
Feature activation-7.24e-03

Pos logit contributions
 concert+0.017
 films+0.017
 crowds+0.016
 movies+0.016
 pian+0.016
Neg logit contributions
urt-0.012
lock-0.011
 contents-0.011
gy-0.011
ometer-0.011
Token
Token ID198
Feature activation-1.67e-02

Pos logit contributions
 movies+0.018
 crowds+0.017
aw+0.017
 films+0.017
 concert+0.017
Neg logit contributions
ucer-0.019
 contents-0.019
lock-0.018
sy-0.017
urt-0.017
Token
Token ID198
Feature activation-1.42e-02

Pos logit contributions
 concert+0.026
 movies+0.025
 films+0.025
La+0.024
 pian+0.024
Neg logit contributions
urt-0.029
 contents-0.029
ometer-0.028
gy-0.028
ucer-0.028
TokenOne
Token ID3198
Feature activation+5.60e-03

Pos logit contributions
aw+0.022
 movies+0.022
 concert+0.021
La+0.021
 pian+0.021
Neg logit contributions
ometer-0.024
ucer-0.024
 contents-0.024
urt-0.023
gy-0.023
Token day
Token ID1110
Feature activation-8.54e-03

Pos logit contributions
urt+0.007
lock+0.007
 contents+0.007
gy+0.007
ometer+0.006
Neg logit contributions
 concert-0.012
 films-0.012
 movies-0.012
 crowds-0.012
 pian-0.012
Token his
Token ID465
Feature activation-1.18e-02

Pos logit contributions
 concert+0.012
 movies+0.011
 films+0.011
 crowds+0.011
 pian+0.011
Neg logit contributions
urt-0.017
 contents-0.016
ometer-0.016
lock-0.016
ucer-0.016
Token mum
Token ID25682
Feature activation-4.98e-02

Pos logit contributions
 concert+0.026
 films+0.025
 movies+0.025
 pian+0.024
 crowds+0.024
Neg logit contributions
urt-0.015
lock-0.014
 contents-0.014
gy-0.014
ucer-0.013
Token took
Token ID1718
Feature activation-1.36e-02

Pos logit contributions
 films+0.094
 concert+0.092
 crowds+0.091
 movies+0.090
 pian+0.090
Neg logit contributions
urt-0.078
lock-0.075
gy-0.074
 contents-0.071
 Finally-0.069
Token him
Token ID683
Feature activation-8.40e-03

Pos logit contributions
 concert+0.015
 films+0.015
 movies+0.014
 crowds+0.014
 pian+0.014
Neg logit contributions
urt-0.031
lock-0.030
 contents-0.030
gy-0.030
ometer-0.029
Token to
Token ID284
Feature activation-8.96e-06

Pos logit contributions
 concert+0.015
 films+0.015
 movies+0.014
 crowds+0.014
 pian+0.014
Neg logit contributions
urt-0.014
lock-0.013
 contents-0.013
ometer-0.013
gy-0.013
Token the
Token ID262
Feature activation-5.60e-03

Feature not active on prev token;
no predictions were affected.
Token park
Token ID3952
Feature activation+6.77e-03

Pos logit contributions
 concert+0.009
 movies+0.009
 films+0.009
 crowds+0.009
 pian+0.009
Neg logit contributions
urt-0.010
lock-0.010
 contents-0.009
gy-0.009
ucer-0.009