This is a variable key for ConversationAlign’s lookup database.
Inspect how we constructed the database by linking here:
https://reilly-lab.github.io/ConversationAlign_LookupHowTo_Jun25.html
## [1] "word" "emo_anger"
## [3] "emo_anger_rescale" "emo_anxiety"
## [5] "emo_anxiety_rescale" "emo_arousal_b24"
## [7] "emo_arousal_b24_rescale" "emo_boredom"
## [9] "emo_boredom_rescale" "emo_confusion"
## [11] "emo_confusion_rescale" "emo_excitement"
## [13] "emo_excitement_rescale" "emo_guilt"
## [15] "emo_guilt_rescale" "emo_happiness"
## [17] "emo_happiness_rescale" "emo_intensity"
## [19] "emo_intensity_recale" "emo_sadness"
## [21] "emo_sadness_rescale" "emo_trust"
## [23] "emo_trust_rescale" "emo_valence_b24"
## [25] "emo_valence_b24_rescale" "lex_AoA"
## [27] "lex_AoA_rescale" "lex_freqlg10"
## [29] "lex_freqlg10_rescale" "lex_n_morphemes"
## [31] "lex_n_senses" "lex_n_senses_rescale"
## [33] "phon_n_lett" "phon_nsyll"
## [35] "sem_auditory" "sem_auditory_rescale"
## [37] "sem_cnc_b24" "sem_cnc_b24_rescale"
## [39] "sem_cnc_v2013" "sem_cnc_v2013_rescale"
## [41] "sem_diversity" "sem_diversity_rescale"
## [43] "sem_neighbors" "sem_neighbors_rescale"
## [45] "sem_visual" "sem_visual_rescale"
## [1] 156203 46
Description: All words and word fragment tokens in the lookup
database. every word is converted to lowercase.
Words with Complete
Coverage across N-Dimensions = 156203
Words with Partial Coverage
across N-Dimensions = 0
Description: raw embedding-based distance from each target word in
the database to the base word ‘anger’
Source: affectvec (Raji &
da Melo, 2020)
Possible Range/Scale : -1 to 1
Actual
Range/Scale: -0.3755, 1
Words with Complete Coverage across
N-Dimensions (N) = 76427
Missing Observations (N) = 76427
Description: rescaled embedding-based distance from each target word
in the database to the base word ‘anger’ Source: affectvec (Raji &
da Melo, 2020)
Possible Range/Scale : 0 to 9
Actual
Range/Scale: 0, 9
Words with Complete Coverage across N-Dimensions
(N) = 76427
Missing Observations (N) = 76427
Description: raw embedding-based distance to the base word ‘anxiety’
Source: affectvec (Raji & da Melo, 2020)
Possible
Range/Scale : -1 to 1
Actual Range/Scale: -0.3577, 1
Words
with Complete Coverage across N-Dimensions = 76427
Missing
Observations (N) = 76427
Description: rescaled embedding-based distance to the base word
‘anxiety’
Source: affectvec (Raji & da Melo, 2020), rescaled
using scales package
Possible Range/Scale : 0 to 9
Actual
Range/Scale: 0, 9
Words with Complete Coverage across N-Dimensions
= 76427
Missing Observations (N) = 76427
Description: Physiological arousal norms generated by LLM raw from
original article
Source: Brysbaert, M., Martínez, G., &
Reviriego, P. (2025)
Actual Range/Scale: NA, NA
Words with
Complete Coverage across N-Dimensions = 126392
Physiological arousal norms generated by LLM raw from original
article
Source: Brysbaert, M., Martínez, G., & Reviriego, P.
(2025) rescaled using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across
N-Dimensions = 126392
Missing Observations (N) = 126392
Description: raw embedding-based distance to the base word ‘bordeom’
Source: affectvec (Raji & da Melo, 2020)
Possible
Range/Scale: -1 to 1
Actual Range/Scale: -0.3686, 1
Words with
Complete Coverage across N-Dimensions = 76427
Missing Observations
(N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
rescaled using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across
N-Dimensions = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
Possible Range/Scale: -1 to 1
Actual Range/Scale: -0.2495, 1
Words with Complete Coverage across N-Dimensions = 76427
Missing
Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
rescaled using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across
N-Dimensions = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
Possible Range/Scale: -1 to 1
Actual Range/Scale: -0.4485, 1
Complete Observations (N) = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
rescaled using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across
N-Dimensions = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word ‘guilt’
Source: affectvec (Raji & da Melo, 2020)
Possible
Range/Scale: -1 to 1
Actual Range/Scale: -0.349, 1
Complete
Observations (N) = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word ‘guilt’
Source: affectvec (Raji & da Melo, 2020) rescaled using
scales package
Possible Range/Scale: 0 to 9
Actual
Range/Scale: 0, 9
Complete Observations (N) = 76427
Missing
Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘happiness’
Source: affectvec (Raji & da Melo, 2020)
Possible Range/Scale: -1 to 1
Actual Range/Scale: -0.4146, 1
Complete Observations (N) = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘happiness’
Source: affectvec (Raji & da Melo, 2020)
rescaled 0-9 using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete Observations (N) = 76427
Missing Observations (N) = 76427
Description: Valence z-scored then absolute value from
Source:
Possible Range/Scale: -1 to 1
Actual Range/Scale: 0.0024843,
3.1479237
Complete Observations (N) = 126392
Missing
Observations (N) = 126392
Description: scaled embedding-based distance to the base word
‘intensity’
Source: affectvec (Raji & da Melo, 2020)
rescaled 0-9 using scales package
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete Observations (N) = 126392
Missing Observations (N) = 126392
Description: scaled embedding-based distance to the base word
‘sadness’
Source: affectvec (Raji & da Melo, 2020) raw value
Possible Range/Scale: -1 to 1
Actual Range/Scale: -0.3479, 1
Complete Observations (N) = 76427
Missing Observations (N) =
76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete
Observations (N) = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word
‘boredom’
Source: affectvec (Raji & da Melo, 2020)
Possible Range/Scale: -1 to 1
Actual Range/Scale: -0.3884, 1
Complete Observations (N) = 76427
Missing Observations (N) = 76427
Description: scaled embedding-based distance to the base word ‘trust’
Source: affectvec (Raji & da Melo, 2020) rescaled 0-9
using scales package
Possible Range/Scale: 0 to 9
Actual
Range/Scale: 0, 9
Complete Observations (N) = 76427
Missing
Observations (N) = 76427
Description: Valence (pleasantness) as rated by LLM raw score
reported in article.
Source: Martinez et al (2025)
Actual
Range/Scale: 0.97, 9
Complete Observations (N) = 126392
Missing Observations (N) = 126392
Description: Valence (pleasantness) as rated by LLM scaled to 0 to
9.
Source: Martinez et al (2025)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete Observations (N) = 126392
Missing Observations (N) = 126392
Description: Human rated estimates of the age of acquisition at which
a word was acquired
Source: Kuperman et al (2012)
Actual
Range/Scale: 1.58, 25
Complete Observations (N) = 31104
Missing Observations (N) = 31104
Description: Age of acquisition estimate rescaled 0 to 9
Source:
Kuperman et al (2012) rescaled 0 to 9
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete Observations (N) = 31104
Missing Observations (N) = 31104
Description: Lexical frequency (log10) normalized to X-per-million
words of English
Source: Brysbaert and New (2009)
Actual
Range/Scale: 0.4771, 6.3293
Complete Observations (N) = 60384
Missing Observations (N) = 60384
Description: Lexical frequency (log10) normalized to X-per-million
words of English, rescaled 0-9
Source: Brysbaert and New (2009)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Complete Observations (N) = 60384
Missing Observations (N) = 60384
Description: Number of morphemes for each word
Source:
Sánchez-Gutiérrez, C. H., Mailhot, H., Deacon, S. H., & Wilson, M.
A. (2018)
Description: Number of different word senses (an index of polysemy)
Source: WordNet https://wordnet.princeton.edu/ Miller (1995)
Actual
Range/Scale: 0, 75
Words with Complete Coverage across N-Dimensions
= 36408
Missing Observations (N) = 36408
Description: Number of different word senses (an index of polysemy)
rescaled 0-9
Source: WordNet https://wordnet.princeton.edu/ Miller (1995)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words
with Complete Coverage across N-Dimensions = 36408
Missing
Observations (N) = 36408
Description: Number of phonemes per word
Source: Balota et al as
indexed by the SCOPE database X
Possible Range/Scale: 1 to infinity
Actual Range/Scale: 1, 40
Words with Complete Coverage across
N-Dimensions = 156203
Missing Observations (N) = 156203
Description: Number of syllables per word
Source: ELP norms per
Balota et al (2007) as indexed by SCOPE norms
Possible Range/Scale:
1 to infinity
Actual Range/Scale: 0, 9
Words with Complete
Coverage across N-Dimensions = 31104
Missing Observations (N) =
31104
Description: Rated auditory salience for each word by real humans
Source: Lancaster Sensorimotor Norms (Lynott et al, 2020)
Actual Range/Scale: 0, 5
Complete Observations (N) = 39329
Missing Observations (N) = 39329
Description: Auditory salience of each word as rated by humans
rescaled 0 to 9
Source: Lancaster Sensorimotor Norms (Lynott et al,
2020)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across N-Dimensions =39329
Missing Observations (N) = 39329
Description: Raw concreteness rating for each word as rated by an LLM
Source: Martinez et al (2025) BRM
Actual Range/Scale: 0, 9
Words with Complete Coverage across N-Dimensions = 126392
Missing Observations (N) = 126392
Description: Scaled concreteness rating for each word as rated by an
LLM from 0-9
Source: Martinez et al (2025) BRM recsaled using
Scales package
Possible Range/Scale: 0 to 9
Actual
Range/Scale: 0, 9
Words with Complete Coverage across N-Dimensions
= 126392
Missing Observations (N) = 126392
Description: Word concreteness as rated by real humans
Source:
Brysbaert et al 2013
Actual Range/Scale: 1.04, 5
Words with
Complete Coverage across N-Dimensions = 39576
Missing Observations
(N) = 39576
Description: Concreteness for each word as rated by humans rescaled
to 0 to 9
Source: Brysbaert, M., Warriner, A. B., & Kuperman,
V. (2014)
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0,
9
Words with Complete Coverage across N-Dimensions =39576
Missing Observations (N) = 39576
Description: Number of contexts a word appears in (as derived by
embeddings)
Source: Hoffman, P., Ralph, M. A. L., & Rogers, T.
T. (2013) as indexed in SCOPE database
Actual Range/Scale:
0.1574494, 2.4131099
Words with Complete Coverage across
N-Dimensions = 29613
Missing Observations (N) = 29613
Description: Number of contexts a word appears in (as derived by
embeddings) rescaled 0 to 9
Source: Hoffman, P., Ralph, M. A. L.,
& Rogers, T. T. (2013) as indexed in SCOPE database
Possible
Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with
Complete Coverage across N-Dimensions = 29613
Missing Observations
(N) = 29613
Description: Number of semantic neighbors within a threshold by
HIDEX
Source: Shaoul, C., & Westbury, C. (2010) as indexed
within the SCOPE database
Actual Range/Scale: 0, 9931
Words
with Complete Coverage across N-Dimensions = 45871
Missing
Observations (N) = 45871
Description: XX
Source: XX
Possible Range/Scale: 0 to 9
Actual Range/Scale: 0, 9
Words with Complete Coverage across
N-Dimensions = 76427
Missing Observations (N) = 76427
Description: Rated visual salience for each word by real humans
Source: Lancaster Sensorimotor Norms (Lynott et al, 2020)
Possible
Range/Scale: XX
Actual Range/Scale: 0, 9
Words with Complete
Coverage across N-Dimensions = 39329
Missing Observations (N) =
39329
Description: Visual salience as derived from the Lancaster Norms
(Lynott et al, 2020)
Source: Lancaster Sensorimotor Norms rescaled
using SCALES package (Lynott et al, 2020)
Possible Range/Scale: 0
to 9
Actual Range/Scale: 0, 9
Complete Observations (N) =
39329
Missing Observations (N) = 39329