<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.3 20210610//EN" "JATS-archivearticle1-3-mathml3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="1.3" xml:lang="en" article-type="research-article"><?properties open_access?><?properties manuscript?><processing-meta base-tagset="archiving" mathml-version="3.0" table-model="xhtml" tagset-family="jats"><restricted-by>pmc</restricted-by></processing-meta><front><journal-meta><journal-id journal-id-type="nlm-journal-id">1254142</journal-id><journal-id journal-id-type="pubmed-jr-id">6777</journal-id><journal-id journal-id-type="nlm-ta">Psychol Med</journal-id><journal-id journal-id-type="iso-abbrev">Psychol Med</journal-id><journal-title-group><journal-title>Psychological medicine</journal-title></journal-title-group><issn pub-type="ppub">0033-2917</issn><issn pub-type="epub">1469-8978</issn></journal-meta><article-meta><article-id pub-id-type="pmid">34154682</article-id><article-id pub-id-type="pmc">8692489</article-id><article-id pub-id-type="doi">10.1017/S0033291721002294</article-id><article-id pub-id-type="manuscript">NIHMS1729093</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>World Trade Center responders in their own words: predicting PTSD symptom trajectories with AI-based language analyses of interviews</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Son</surname><given-names>Youngseo</given-names></name><contrib-id contrib-id-type="orcid">http://orcid.org/0000-0001-9370-0126</contrib-id><xref rid="A1" ref-type="aff">1</xref></contrib><contrib contrib-type="author"><name><surname>Clouston</surname><given-names>Sean A. P.</given-names></name><xref rid="A2" ref-type="aff">2</xref><xref rid="A3" ref-type="aff">3</xref></contrib><contrib contrib-type="author"><name><surname>Kotov</surname><given-names>Roman</given-names></name><xref rid="A4" ref-type="aff">4</xref></contrib><contrib contrib-type="author"><name><surname>Eichstaedt</surname><given-names>Johannes C.</given-names></name><xref rid="A5" ref-type="aff">5</xref></contrib><contrib contrib-type="author"><name><surname>Bromet</surname><given-names>Evelyn J.</given-names></name><xref rid="A4" ref-type="aff">4</xref></contrib><contrib contrib-type="author"><name><surname>Luft</surname><given-names>Benjamin J.</given-names></name><xref rid="A6" ref-type="aff">6</xref></contrib><contrib contrib-type="author"><name><surname>Schwartz</surname><given-names>H. Andrew</given-names></name><xref rid="A1" ref-type="aff">1</xref></contrib></contrib-group><aff id="A1"><label>1</label>Department of Computer Science, Stony Brook University, New York, USA</aff><aff id="A2"><label>2</label>Program in Public Health, Stony Brook University, New York, USA</aff><aff id="A3"><label>3</label>Department of Family, Population and Preventive Medicine, Stony Brook University, New York, USA</aff><aff id="A4"><label>4</label>Department of Psychiatry, Stony Brook University, New York, USA</aff><aff id="A5"><label>5</label>Department of Psychology &#x00026; Institute for Human-Centered A.I., Stanford University, Stanford, California, USA</aff><aff id="A6"><label>6</label>Department of Medicine, Stony Brook University, New York, USA</aff><author-notes><corresp id="CR1"><bold>Author for correspondence:</bold> Youngseo Son, <email>yson@cs.stonybrook.edu</email></corresp></author-notes><pub-date pub-type="nihms-submitted"><day>5</day><month>8</month><year>2021</year></pub-date><pub-date pub-type="ppub"><month>2</month><year>2023</year></pub-date><pub-date pub-type="epub"><day>22</day><month>6</month><year>2021</year></pub-date><pub-date pub-type="pmc-release"><day>22</day><month>12</month><year>2022</year></pub-date><volume>53</volume><issue>3</issue><fpage>918</fpage><lpage>926</lpage><permissions><license><ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/" specific-use="textmining" content-type="ccbylicense">https://creativecommons.org/licenses/by/4.0/</ali:license_ref><license-p>This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">http://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.</license-p></license></permissions><abstract id="ABS1"><sec id="S1"><title>Background.</title><p id="P1">Oral histories from 9/11 responders to the World Trade Center (WTC) attacks provide rich narratives about distress and resilience. Artificial Intelligence (AI) models promise to detect psychopathology in natural language, but they have been evaluated primarily in non-clinical settings using social media. This study sought to test the ability of AI-based language assessments to predict PTSD symptom trajectories among responders.</p></sec><sec id="S2"><title>Methods.</title><p id="P2">Participants were 124 responders whose health was monitored at the Stony Brook WTC Health and Wellness Program who completed oral history interviews about their initial WTC experiences. PTSD symptom severity was measured longitudinally using the PTSD Checklist (PCL) for up to 7 years post-interview. AI-based indicators were computed for depression, anxiety, neuroticism, and extraversion along with dictionary-based measures of linguistic and interpersonal style. Linear regression and multilevel models estimated associations of AI indicators with concurrent and subsequent PTSD symptom severity (significance adjusted by false discovery rate).</p></sec><sec id="S3"><title>Results.</title><p id="P3">Cross-sectionally, greater depressive language (<italic toggle="yes">&#x003b2;</italic> = 0.32; <italic toggle="yes">p</italic> = 0.049) and first-person singular usage (<italic toggle="yes">&#x003b2;</italic> = 0.31; <italic toggle="yes">p</italic> = 0.049) were associated with increased symptom severity. Longitudinally, anxious language predicted future worsening in PCL scores (<italic toggle="yes">&#x003b2;</italic> = 0.30; <italic toggle="yes">p</italic> = 0.049), whereas first-person plural usage (<italic toggle="yes">&#x003b2;</italic> = &#x02212;0.36; <italic toggle="yes">p</italic> = 0.014) and longer words usage (<italic toggle="yes">&#x003b2;</italic> = &#x02212;0.35; <italic toggle="yes">p</italic> = 0.014) predicted improvement.</p></sec><sec id="S4"><title>Conclusions.</title><p id="P4">This is the first study to demonstrate the value of AI in understanding PTSD in a vulnerable population. Future studies should extend this application to other trauma exposures and to other demographic groups, especially under-represented minorities.</p></sec></abstract><kwd-group><kwd>9/11</kwd><kwd>depression</kwd><kwd>disaster responders</kwd><kwd>language-based assessments</kwd><kwd>oral history interviews</kwd><kwd>posttraumatic stress disorder</kwd><kwd>risk factors</kwd><kwd>trajectories</kwd><kwd>World Trade Center</kwd></kwd-group></article-meta></front><body><sec id="S5"><title>Introduction</title><p id="P5">The 9/11 attacks on the World Trade Center (WTC) left thousands of casualties and drastically affected the lives of hundreds of thousands of New Yorkers and others nearby (<xref rid="R3" ref-type="bibr">Bergen, 2019</xref>). Many affected were those dedicating their lives to the safety of others &#x02013; police, firefighters, emergency medical personnel, and other responders to the crisis. There has been a significant physical and mental burden of the events that day which has left many struggling with their health as they age (<xref rid="R16" ref-type="bibr">Durkin, 2018</xref>; <xref rid="R32" ref-type="bibr">Luft et al., 2012</xref>). Many responders suffer from PTSD which has been either worsening, staying the same, or gradually improving over time (<xref rid="R14" ref-type="bibr">Cukor et al., 2011</xref>; <xref rid="R38" ref-type="bibr">Neria et al., 2010</xref>).</p><p id="P6">Massive disasters, such as the WTC attacks, can affect a large number of people at the same time and usually occur within a relatively short period. Illuminating the risk and protective factors that reliably predict future reductions or increases in PTSD symptoms can lead to improved understanding, more accessible in-clinic guidance on patient&#x02019;s well-being, and more immediate care for those involved in catastrophic events. Previous work has made major headway in establishing longitudinal associations of exposure severity, demographic characteristics, and job duties with health trajectories of WTC responders (<xref rid="R10" ref-type="bibr">Bromet et al., 2016</xref>; <xref rid="R12" ref-type="bibr">Cone et al., 2015</xref>; <xref rid="R43" ref-type="bibr">Pietrzak et al., 2014</xref>). However, additional approaches to risk assessment are needed to more rapidly and thoroughly differentiate those at greatest risk in situations where structured approaches to data collection are not possible.</p><p id="P7">Recently, Artificial Intelligence (AI)-based techniques have begun to show promise for quickly and accurately assessing mental health from human behavioral data, such as language use patterns. For example, from social media language, researchers have predicted those more prone to post-partum depression (<xref rid="R15" ref-type="bibr">De Choudhury, Kiciman, Dredze, Coppersmith, &#x00026; Kumar, 2016</xref>), those more likely to receive a clinical diagnosis of depression (<xref rid="R18" ref-type="bibr">Eichstaedt et al., 2018</xref>) or those appearing at greatest risk of suicide (<xref rid="R34" ref-type="bibr">Matero et al., 2019</xref>; <xref rid="R59" ref-type="bibr">Zirikly, Resnik, Uzuner, &#x00026; Hollingshead, 2019</xref>). For PTSD in particular, although studies have yet to validate models in a clinical setting, past work has shown that AI-based language techniques can distinguish Twitter users that have publicly disclosed a diagnosis of the condition from random selections of users (e.g. <xref rid="R13" ref-type="bibr">Coppersmith, Dredze, &#x00026; Harman, 2014</xref>; <xref rid="R44" ref-type="bibr">Preotiuc-Pietro, Sap, Schwartz, &#x00026; Ungar, 2015</xref>; <xref rid="R46" ref-type="bibr">Reece et al. 2017</xref>).</p><p id="P8">AI-based language analyses are strong candidates to improve risk assessments in a clinical setting because they enable a much wider range of responses (like an interview) whereby a score can be objectively determined (e.g. like standardized assessment). Once an AI-based technique is created (i.e. it is &#x02018;pre-trained&#x02019;), it will always yield the same and robust score for a given input. While these language-based assessments were studied with social media texts for PTSD based on self-disclosures (<xref rid="R13" ref-type="bibr">Coppersmith et al., 2014</xref>; <xref rid="R44" ref-type="bibr">Preotiuc-Pietro et al., 2015</xref>), few works have investigated how effective these approaches are with the language outside social media for predicting PTSD severity evaluated in the clinical settings, especially in a longitudinal study context for PTSD future trajectories. In all such cases, modern machine learning techniques are used to automatically extract and quantify patterns of language from hundreds to thousands of words per individual, which are then used to automatically produce a mental health or risk score. As compared to traditional questionnaire-based assessments, such approaches seem to suffer from fewer self-report biases (<xref rid="R58" ref-type="bibr">Youyou, Kosinski, &#x00026; Stillwell, 2015</xref>) and generally leverage a larger amount of information per person (<xref rid="R28" ref-type="bibr">Kern et al., 2016</xref>). However, using such approaches in a clinical setting requires patients to share private information from social media pages, and requires that each participant has a substantial amount of data to share in the first place.</p><p id="P9">In this study, we present the first evaluation of AI-based mental health assessments from language (henceforth language-based assessments) to predict future PTSD symptom trajectories of patients monitored in a clinical setting. Rather than social media, we utilize transcripts of oral history interviews from responders to the 9/11 attacks. We first examine whether existing (&#x02018;pre-trained&#x02019;) predictive models (most of which were trained on social media) produce assessments associated with PTSD symptoms scores close to the time of interview. We then compare these language assessments to other information available within a mental health clinical cohort (e.g. age, gender, occupation) to evaluate the additional benefit of the AI-based assessments. Lastly, we seek to quantify the predictive power of language-based indicators, in part to assess their potential suitability for informing personalized therapeutic approaches.</p></sec><sec id="S6"><title>Methods</title><sec id="S7"><title>Participants</title><p id="P10">The sample was derived from Stony Brook University&#x02019;s WTC Health &#x00026; Wellness program, funded by the Centers for Disease Control and Prevention, that provides ongoing monitoring of WTC responders. A total of <italic toggle="yes">N</italic> = 124 responders underwent an oral history interview and agreed to allow researchers to merge data from the transcript of the oral history with information in their health monitoring records. <xref rid="R22" ref-type="bibr">Hammock et al. (2019)</xref> provide an extensive summary of data collection methods. Briefly, oral history participants were primarily recruited <italic toggle="yes">via</italic> word of mouth and by flyers posted in the Stony Brook WTC Wellness Program.</p><p id="P11">Each interview lasted approximately 1 h. It covered the responders&#x02019; memory of 9/11 attacks and disaster relief efforts, their work activities at the site, experiences and sensations over the days and weeks that followed, and how the WTC disaster ultimately impacted their lives since. Interviews were conducted by clinical staff with diverse healthcare backgrounds after a comprehensive orientation in conducting guided interviews and eliciting details relevant to the key topics to be covered. Responders were encouraged to discuss what was most important to them. Interviews were completed between 2010 and 2018.</p><p id="P12">In order to restrict our sample responders who were not new to the WTC Health Program, the analysis sample was restricted to participants who had at least one valid score on the PTSD Checklist (PCL; <xref rid="R5" ref-type="bibr">Blanchard, Jones-Alexander, Buckley, &#x00026; Forneris, 1996</xref>) within 2 years of their interview, and at least one pre-interview PCL yielding an analysis sample of <italic toggle="yes">N</italic> = 113 responders. The few newer health program enrollees who were excluded from this study were qualitatively different, having only had just begun care (and potential PTSD treatment) at interview time. Furthermore, to study longitudinal trajectories post-interview, we focused on the subset of individuals with at least three post-interview mental health assessments at least 2 years after the interview (<italic toggle="yes">N</italic> = 75). The demographic characteristics of the study samples are listed in <xref rid="T1" ref-type="table">Table 1</xref>. The demographic ratio of gender and police remained similar (&#x0003c;4% difference) after we limited the sample to responders who met the criteria for our language analysis; 92% of the subset group were male and 49% were police; their mean age at interview was 53.</p></sec><sec id="S8"><title>Ethics</title><p id="P13">This study was approved by the Stony Brook University Institutional Review Board. The participants provided written informed consent.</p></sec><sec id="S9"><title>Language-based assessments</title><p id="P14">We automatically derived nine variables assessing the responders&#x02019; language during the interviews: four AI-based assessments of psychological traits (expression of anxiety, depression, neuroticism, and extraversion), three lexicon-based assessments of language style (first-person singular pronouns, plural pronouns, and use of articles), and two meta variables describing counts of words and lengths of words. The process to get these variables consisted of three steps: text transcription, conversion of text to linguistic features, and application of AI-based models or <italic toggle="yes">lexica</italic>.</p><p id="P15">Audio of each interview was transcribed into text using <italic toggle="yes">TranscribeMe</italic>, a HIPAA-approved transcription service. Each time the responders spoke, transcribers labelled the time and the words mentioned. The text of each interview was converted into &#x02018;features&#x02019; &#x02013; quantitative values describing the content of the interview language &#x02013; and then input into: (a) four AI-based assessments of psychological traits, (b) three lexicon-based assessments of language style, and (c) two meta-variable extractions describing counts of words and lengths of words. All analyses, described below, were performed using the Differential Language Analysis ToolKit (DLATK) (<xref rid="R51" ref-type="bibr">Schwartz et al., 2017</xref>).</p></sec><sec id="S10"><title>Conversion into linguistic features</title><p id="P16">The models we used required up to three types of linguistic features: (1) relative frequencies of words and phrases, (2) binary indicators of words and phrases, and (3) topic prevalence scores. Words and phrases are sequences of 1&#x02013;3 words in a row. Their relative frequency was recorded by <italic toggle="yes">DLATK</italic> by counting each word or phrase mentioned and dividing by the total number of words or phrases mentioned by the responder. The binary indicator for words and phrases simply indicated whether each word or phrase shows up (1) or not (0). The tokenizer built into the <italic toggle="yes">DLATK</italic> package was used to extract words per interview.</p><p id="P17">Topics are weighted groups of semantically-related words, often derived through a statistical process called latent Dirichlet allocation (<xref rid="R6" ref-type="bibr">Blei, Ng, &#x00026; Jordan, 2003</xref>). Once derived, topics can be applied to textual data to scoring, ranging from 0 to 1, indicating how frequently each group of words was mentioned (<xref rid="R28" ref-type="bibr">Kern et al., 2016</xref>). We use a standard set of 2000 topics introduced by <xref rid="R48" ref-type="bibr">Schwartz et al. (2013a</xref>, <xref rid="R49" ref-type="bibr">2013b</xref>), which has frequently been applied in the psychological domain including most recently in <xref rid="R17" ref-type="bibr">Eichstaedt et al. (2020)</xref>. Once extracted, features were mapped to nine coarse-grained scores as described below and used for analyses herein.</p></sec><sec id="S11"><title>AI-based psychological traits (4)</title><p id="P18">The AI-based assessments input linguistic features such as words, phrases, and topics, and map them to psychological constructs (<xref rid="R28" ref-type="bibr">Kern et al., 2016</xref>; <xref rid="R52" ref-type="bibr">Schwartz &#x00026; Ungar, 2015</xref>). We focused on existing pre-trained models for constructs known to be related to our mental health outcomes: (1) neuroticism and (2) extraversion (<xref rid="R41" ref-type="bibr">Park et al., 2015</xref>; <xref rid="R49" ref-type="bibr">Schwartz et al., 2013b</xref>) &#x02013; the two factors of the five-factor model known to relate negatively to depression and anxiety-related mental health conditions (<xref rid="R19" ref-type="bibr">Farmer et al., 2002</xref>; <xref rid="R26" ref-type="bibr">Jorm et al., 2000</xref>; <xref rid="R27" ref-type="bibr">Jylh&#x000e4; &#x00026; Isomets&#x000e4;, 2006</xref>), as well as (3) degree of depression and (4) anxiousness (<xref rid="R50" ref-type="bibr">Schwartz et al., 2014</xref>) &#x02013; subfacets of emotional stability which correspond to negative high arousal language (anxiousness) and negative low arousal language (depressive). These models were trained on large and diverse populations (approximately sample sizes of <italic toggle="yes">N</italic> = 65 000 for neuroticism and extraversion and <italic toggle="yes">N</italic> = 29 000 for degrees of depression and anxiousness). They utilize the linguistic features of previously mentioned words and phrases as well as topics as input and output continuous scores for each of the four constructs. They have been validated against standard questionnaire-based measures as well as convergent factors and external criteria under a range of situations (<xref rid="R28" ref-type="bibr">Kern et al., 2016</xref>; <xref rid="R34" ref-type="bibr">Matero et al., 2019</xref>; <xref rid="R41" ref-type="bibr">Park et al., 2015</xref>; <xref rid="R50" ref-type="bibr">Schwartz et al., 2014</xref>). However, the predictive validity of these models has yet to be assessed in clinical interview settings. Importantly, to guard against overfitting, no adjustments were made to the models, and thus this can be considered an evaluation of the models exactly as they were presented in their respective papers (<xref rid="R41" ref-type="bibr">Park et al., 2015</xref>; <xref rid="R50" ref-type="bibr">Schwartz et al., 2014</xref>).</p></sec><sec id="S12"><title>Function word lexicon features (3)</title><p id="P19">We extracted word frequencies of terms in LIWC 2015 categories (<xref rid="R42" ref-type="bibr">Pennebaker, Boyd, Jordan, &#x00026; Blackburn, 2015</xref>) and calculated categories for an interview with each responder. Due to the relatively low sample size, we focused on the function word categories that were most prevalent and then selected those that had a literature-suggested association with mental health:</p><list list-type="bullet" id="L2"><list-item><p id="P20">First-person singular: depressed, low status, personal, emotional, informal. Previously correlated positively with neuroticism, depression, and anxiety (<xref rid="R2" ref-type="bibr">Baddeley &#x00026; Singer, 2008</xref>; <xref rid="R24" ref-type="bibr">Holtzman, 2017</xref>; <xref rid="R47" ref-type="bibr">Rude, Gortner, &#x00026; Pennebaker, 2004</xref>) and negatively with life satisfaction (<xref rid="R48" ref-type="bibr">Schwartz et al., 2013a</xref>).</p></list-item><list-item><p id="P21">First-person plural: high status, socially connected to group. Previously correlated negatively with depression and anxiety (<xref rid="R45" ref-type="bibr">Ramirez-Esparza, Chung, Kacewicz, &#x00026; Pennebaker, 2008</xref>) and positively correlated with life satisfaction (<xref rid="R48" ref-type="bibr">Schwartz et al., 2013a</xref>) along with the cognition and psychological well-being variables of our interest (<xref rid="R55" ref-type="bibr">Tausczik &#x00026; Pennebaker, 2010</xref>).</p></list-item><list-item><p id="P22">Articles: use of concrete nouns, interest in objects and things (<xref rid="R55" ref-type="bibr">Tausczik &#x00026; Pennebaker, 2010</xref>).</p></list-item></list></sec><sec id="S13"><title>Language meta features (2)</title><list list-type="bullet" id="L4"><list-item><p id="P23">Average word length is known to be associated with higher cognitive (<xref rid="R29" ref-type="bibr">Khawaja, Chen, &#x00026; Marcus, 2010</xref>), conceptual complexity (<xref rid="R31" ref-type="bibr">Lewis &#x00026; Frank, 2016</xref>), education, and social class (<xref rid="R23" ref-type="bibr">Hartley, Pennebaker, &#x00026; Fox, 2003</xref>; <xref rid="R55" ref-type="bibr">Tausczik &#x00026; Pennebaker, 2010</xref>). PTSD is known to impair cognitive processing and impose a cognitive burden (e.g. through intrusive memories and thought suppression) (<xref rid="R39" ref-type="bibr">Nixon, Nehmy, &#x00026; Seymour, 2007</xref>).</p></list-item><list-item><p id="P24">Word counts: We also recorded total word counts, the number by which all lexica above were normalized. Given the interviews were all an hour long, this is a proxy for the rate of speech from each participant.</p></list-item></list></sec><sec id="S14"><title>Mental health outcomes</title><p id="P25">The PTSD Symptom Checklist for DSM-IV PTSD (PCL) was used to assess PTSD severity in the past month (<xref rid="R10" ref-type="bibr">Bromet et al., 2016</xref>; <xref rid="R12" ref-type="bibr">Cone et al., 2015</xref>; <xref rid="R43" ref-type="bibr">Pietrzak et al., 2014</xref>). We chose the PCL closest to the interview date (all within 2 years) for concurrent analyses (average initial PCL score = 33.7; <sc>s.d.</sc> = 16.2). Following previous work which suggests that a fixed cutoff might not be optimally established for all cases (<xref rid="R1" ref-type="bibr">Andrykowski, Cordova, Studts, &#x00026; Miller, 1998</xref>; <xref rid="R7" ref-type="bibr">Bovin et al., 2016</xref>), we focused on continuous values. Post-interview PCL scores were used to create trajectories as described under <italic toggle="yes">trajectory prediction</italic> below.</p></sec><sec id="S15"><title>Statistical analysis</title><p id="P26">We used linear regression coefficient of the target explanatory variable (PCL score) as its correlation strength and multivariable adjustment for possible confounders (age, gender, occupation, and years after 9/11) to acquire the unique effects of language-based assessments. On average, the interviews were conducted 10.31 years (<sc>s.d.</sc> = 1.43) after the event. We controlled for days since 9/11 in the analyses. Since we explored many language assessments at once, we considered coefficients significant if their Benjamini&#x02013;Hochburg adjusted <italic toggle="yes">p</italic> values were &#x0003c;0.05.</p></sec><sec id="S16"><title>Concurrent evaluation</title><p id="P27">We processed the interviews of responders who had PTSD assessments three or more interviews after the closest dates to interviews and at least one assessment before the closest dates to interviews for the stable future trajectory modeling. For our cross-sectional correlation analysis linking language-based assessments with PTSD, we selected PCL scores of WTC responders as their cross-sectional PTSD symptom severity at the time of the interview (Interview PCL), and it is controlled for future PTSD trajectories as a baseline.</p></sec><sec id="S17"><title>Trajectory prediction</title><p id="P28">For modeling the trajectory of PCL scores of each responder, we fit an ordinary least squares regression model with an intercept to the post-interview PCL scores as a function of time <italic toggle="yes">t</italic>:
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="M1" display="block"><mml:mrow><mml:msub><mml:mtext>PCL</mml:mtext><mml:mrow><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>&#x003b2;</mml:mi><mml:mrow><mml:mn>0</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b2;</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:msubsup><mml:mi>&#x003f5;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math>
</disp-formula>
where PCL scores were measured at (<italic toggle="yes">t</italic>) years after the interviews, then use the <italic toggle="yes">&#x003b2;</italic><sub>1<italic toggle="yes">i</italic></sub> coefficient as a future PCL score trajectory of a responder (<italic toggle="yes">i</italic>). Then, for the person-level prediction over <italic toggle="yes">&#x003b2;</italic><sub>1<italic toggle="yes">i</italic></sub> using the language-based assessments controlling the age, gender, occupation, and years between the interview and 9/11 of the responder <italic toggle="yes">i</italic> as following:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="M2" display="block"><mml:mrow><mml:msub><mml:mi>&#x003b2;</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x02026;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>6</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>6</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mi>&#x003f5;</mml:mi><mml:mi>i</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>i</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math>
</disp-formula>
where <italic toggle="yes">x</italic><sub>1</sub>: language-based assessments, <italic toggle="yes">x</italic><sub>2</sub>: baseline PCL, <italic toggle="yes">x</italic><sub>3&#x02026;6</sub>: age, gender, occupation, years after 9/11 (all valuables standardized). Using <xref rid="FD1" ref-type="disp-formula">equation (1)</xref> and <xref rid="FD2" ref-type="disp-formula">(2)</xref>, we use the following joint model:
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="M3" display="block"><mml:mrow><mml:msub><mml:mtext>PCL</mml:mtext><mml:mrow><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>&#x003b2;</mml:mi><mml:mrow><mml:mn>0</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x02026;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x003b1;</mml:mi><mml:mn>5</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>5</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:msubsup><mml:mi>&#x003f5;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math>
</disp-formula>
and evaluate an effect size of each language-based assessment as its predictive power for future PCL trajectories of the responders (<xref rid="F1" ref-type="fig">Fig. 1</xref>).</p><p id="P29">For the longitudinal trajectories post-interview, we focused on the subset of individuals with at least three post-interview PCL assessments occurring at least 2 years following the interview (<italic toggle="yes">N</italic> = 75). Sample demographics are reported in <xref rid="T1" ref-type="table">Table 1</xref>. Counting the interview, these criteria allowed the trajectories to be derived from at least four data points per participant, with the last assessment occurring on average (mean) 5.5 years (<sc>s.d.</sc> = 1.3) after the interview. By using this trajectory-based approach, all assessments available were used and aligned with their dates of administration.</p></sec></sec><sec id="S18"><title>Results</title><p id="P30">Most responders were male (90%) and half (48%) were police (see <xref rid="T1" ref-type="table">Table 1</xref> for sample characteristics). Their median age at their interviews was 55 (53 for the longitudinal cohort). The median number of words across the interviews was 10 254.</p><sec id="S19"><title>Associations between language-based assessments and PTSD severity</title><p id="P31"><xref rid="T2" ref-type="table">Table 2</xref> shows the linear regression analyses linking language-based assessments and PCL scores among responders around their interview dates. Higher PCL scores were significantly associated with language-based assessments consistent with anxious, depressive, and neuroticism. High scores were also associated with greater use of first-person singular and more total count of words in their interviews (<italic toggle="yes">r</italic> &#x0003e; 0.22). Conversely, higher scores were also associated with less extraversion language patterns, first-person plurals, and articles. Results remained unchanged after adjusting for age, gender, occupation, and years after 9/11 despite some effects from covariates (&#x0003c;0.07).</p></sec><sec id="S20"><title>Trajectory analysis</title><p id="P32"><xref rid="T3" ref-type="table">Table 3</xref> shows that language-based assessments of the oral histories significantly predicted responders&#x02019; PCL trajectories during the follow-up period. First, we calculated linear regression coefficient effect sizes when we modeled PCL score trajectories with language features only (first column of <xref rid="T3" ref-type="table">Table 3</xref>). Then we add the control variables into the model (the second column). Although the general directions of correlations were the same both with and without controls, suppression effects of control variables increased the effect sizes for anxiety and first-person plural usage (<xref rid="F2" ref-type="fig">Fig. 2</xref>).</p></sec></sec><sec id="S21"><title>Discussion</title><p id="P33">The goal of the present study was to examine whether AI-based language assessments developed in non-clinical contexts were reliably (1) associated with PTSD on self-reported questionnaires, and (2) able to predict the extent to which one&#x02019;s symptoms would get better or worse (trajectory) within a long-term clinical setting. The study found support for the view that language-based assessments could be reliably used in a clinical setting when processing naturalistic interviews: specifically, we found that language-based features were indicative of current functioning (supporting aim 1) and that language-based features could predict future PTSD symptom trajectories (supporting aim 2). This study, for the first time, suggested that AI assessments of interviews from a clinical sample not focused specifically on the topic of mental health could be used to identify features indicative of a person&#x02019;s current and future mental well-being.</p><sec id="S22"><title>Implications</title><p id="P34">There are three major implications from this work. First, AI-based assessments of interviews were associated with the assessment of mental health scores concurrently, supporting the first aim. Specifically, depressed language was associated with greater PTSD symptom severity, as is self-focused language. This corroborates the clinical conceptualization of PTSD as involving self-focused rumination that maintains PTSD symptoms over time (<xref rid="R36" ref-type="bibr">Michael, Halligan, Clark, &#x00026; Ehlers, 2007</xref>).</p><p id="P35">Second, the use of more anxious language predicted increased PTSD symptoms in the future, even when adjusting for age, gender, occupation, and days since the 9/11 disaster. This suggests that while immediate PTSD severity is associated with low mood, a worsening of PTSD is determined by anxiety, rather than depression. These results may suggest that while immediate PTSD severity is reflected in affective experience, it may be the cognitive processes associated with anxiety (worry, rumination) that underlie future increases in PTSD symptoms. This dovetails with the accounts of PTSD that understand it to be maintained through rumination and worry (<xref rid="R36" ref-type="bibr">Michael et al., 2007</xref>).</p><p id="P36">Third, the use of more first-person plural pronouns (&#x02018;we&#x02019;, &#x02018;us&#x02019;, &#x02018;our&#x02019;) predicted decreased PTSD symptoms in the future when adjusting for the confound variables. This supports research showing that social support is an important affordance that can buffer against and help alleviate the psychopathological load of a traumatic life event. Previous findings have suggested that processes associated with chronic sympathetic arousal (which include the chronic activation of the HPA-axis in states of hypervigilance) may be &#x02018;buffered against&#x02019; by social interactions with kin and close others (e.g. <xref rid="R35" ref-type="bibr">McGowan, 2002</xref>).</p></sec><sec id="S23"><title>Depressive language and current PTSD severity</title><p id="P37">Depressive language (<italic toggle="yes">&#x003b2;</italic> = 0.32; <italic toggle="yes">p</italic> = 0.049) and high usage of first-person singulars (<italic toggle="yes">&#x003b2;</italic> = 0.31; <italic toggle="yes">p</italic> = 0.049) were most highly correlated with high PCL scores even after accounting for age, gender, days since 9/11, and responder occupation. These associations were consistent with findings from prior studies showing an association of PTSD symptoms with increased risk of depression (<xref rid="R9" ref-type="bibr">Breslau, Davis, Peterson, &#x00026; Schultz, 2000</xref>; <xref rid="R54" ref-type="bibr">Stander, Thomsen, &#x00026; Highfill-McRoy, 2014</xref>). Similarly, high usage of first-person singular in messages is negatively correlated with life satisfaction (<xref rid="R48" ref-type="bibr">Schwartz et al., 2013a</xref>). Anxious and neurotic language patterns had strong positive correlations with PCL scores, which align with a previous study that identified avoidance and hyperarousal symptoms as frequently reported symptoms (<xref rid="R10" ref-type="bibr">Bromet et al., 2016</xref>). For the associations between personality traits and PTSD symptoms, previous studies found that low extraversion and high neuroticism are associated with an increased risk of PTSD (<xref rid="R8" ref-type="bibr">Breslau, Davis, &#x00026; Andreski, 1995</xref>; <xref rid="R20" ref-type="bibr">Fauerbach, Lawrence, Schmidt, Munster, &#x00026; Costa, 2000</xref>), and we observed the same patterns of our language-based extraversion and neuroticism with PTSD severity.</p></sec><sec id="S24"><title>Predictors of PTSD symptom trajectories</title><p id="P38">We examined language-based assessments as a predictor of responders&#x02019; PCL trajectories after their interviews. Usage of first-person plurals and longer average word lengths were most highly correlated with improvement in PTSD in all cases, whether adjusting for baseline PCL-score and demographics or not. For other language-based assessments, coefficient effect sizes increased when we accounted for confounding due to the suppression effects mainly attributable to PCL scores, age at interview, and gender (see <xref rid="SD1" ref-type="supplementary-material">online Supplementary Table S1</xref>).</p><p id="P39">Furthermore, we analyzed potential mediation effects of marital status to address whether differences in the use of pronouns were merely reflecting marital status although previous work does not suggest such a relationship (Simmons, Gordon, &#x00026; Chambless, 2015). Our results showed that these two types of language-based assessments predicted beyond marital status as their correlations remained statistically significant after adjusting for both controls and marital status (see <xref rid="SD1" ref-type="supplementary-material">online Supplementary Tables S2</xref> and <xref rid="SD1" ref-type="supplementary-material">S3</xref>). This demonstrates that these linguistic markers capture an orientation toward the self and others over and above marital status.</p></sec><sec id="S25"><title>Social support</title><p id="P40">In line with an extensive literature in psychology, we observed the use of &#x02018;I&#x02019; <italic toggle="yes">v.</italic> &#x02018;we&#x02019; pronouns to mark classes of psychological processes that determined adjustment to and recovery from trauma. Previous work has related higher use of first-person singular pronouns (&#x02018;I&#x02019;-talk) self-focus (<xref rid="R11" ref-type="bibr">Carey et al., 2015</xref>); we found it correlated with high cross-sectional PTSD severity. On the other hand, we found high usage of first-person plural pronouns (&#x02018;we&#x02019;) to be associated with a decrease of PTSD symptoms in the future. Self-focused thinking has been identified as a transdiagnostic factor of PTSD and depressive symptoms marking an often maladaptive preoccupation with the self and negative experience (<xref rid="R4" ref-type="bibr">Birrer &#x00026; Michael, 2011</xref>; <xref rid="R25" ref-type="bibr">Ingram, 1990</xref>; <xref rid="R33" ref-type="bibr">Martin, 1985</xref>). The use of &#x02018;I&#x02019; pronouns, in turn, has previously been found to be a dependable marker of self-focus in natural language (<xref rid="R11" ref-type="bibr">Carey et al., 2015</xref>; <xref rid="R56" ref-type="bibr">Watkins &#x00026; Teasdale, 2001</xref>; <xref rid="R57" ref-type="bibr">Wegner &#x00026; Giuliano, 1980</xref>). Beyond mere self-focus, depression and negative affectivity have also been robustly associated with higher use of first person singular pronouns (<xref rid="R24" ref-type="bibr">Holtzman, 2017</xref>; <xref rid="R47" ref-type="bibr">Rude et al., 2004</xref>) and PTSD (<xref rid="R37" ref-type="bibr">Miragoli, Camisasca, &#x00026; Di Blasio, 2019</xref>); PTSD also with few &#x02018;we&#x02019; pronouns (<xref rid="R40" ref-type="bibr">Papini et al., 2015</xref>). Our study showed further evidence for these patterns: greater use of &#x02018;I&#x02019; pronouns positively correlated with severe cross-sectional PTSD symptoms, and high usage of &#x02018;we&#x02019; pronouns predicted decreasing PTSD symptoms in the future.</p></sec><sec id="S26"><title>Limitations</title><p id="P41">This was the first study to evaluate the relationship between automatic language-based assessments from interviews and PTSD symptoms of a trauma population, and there were several limitations. First, our sample covered a particular cohort of trauma survivors, those responding to the WTC disaster. WTC responders are predominantly male, and members of the monitoring population eligible to participate in this study were predominantly police officers. As such, this study relied on a sample that is similar to the rest of the WTC responder population (<xref rid="R10" ref-type="bibr">Bromet et al., 2016</xref>). Nevertheless, this may limit the generalizability of present findings to other occupations and demographic groups. Future research would need to investigate whether the results replicate to additional populations. Second, language-based assessment predicted future change in PTSD and suggested that cognitive and social risk processes may be involved, but mechanisms underpinning these predictive effects were not tested directly. Third, while our feature-based identification process was completed in a large database with ample capacity to train robust AI models, the present analysis had a relatively small sample size that could only be reliably used for application and was too small to retrain models for the current population. Future work in larger samples will be able to tailor AI-based assessments to specific populations and clinical questions substantially enhancing their predictive power.</p></sec><sec id="S27"><title>Potential use in clinical care</title><p id="P42">Clinical evaluation of PTSD symptoms in trauma-exposed patients is time-consuming and burdensome. Moreover, primary care providers often lack expertise to complete this assessment. Our results show that natural language can provide clinically useful information both for the detection of PTSD and the prediction of future symptom escalation. These methods can be applied to routine clinical interviews completed by staff without mental health expertise. Although oral history interviews used in this project were lengthy, previous research has shown that interactions as brief as 5 min (e.g. 200 words) can be sufficient to obtain reliable AI-based assessments (<xref rid="R28" ref-type="bibr">Kern et al., 2016</xref>). These assessments would not replace a psychiatric evaluation, but can be useful for screening in primary care and as an aid to psychiatrists, picking-up on diagnostic and prognostic features in language that may be missed clinically. Specific language-based risk factors could inform treatment selection, such as low social support, and may suggest group therapy or peer support interventions, whereas maladaptive cognitive styles suggest cognitive behavioral therapy.</p></sec></sec><sec id="S28"><title>Conclusion</title><p id="P43">We found automated AI-based assessments utilizing the language of WTC responders in their oral history interviews predicted their PTSD symptoms in both cross-sectional and longitudinal trajectory analyses. The patterns and the correlations from these studies should be examined cautiously, and may require independent confirmations from other WTC cohorts and across different types of exposures before general applications for PTSD treatments. Still, the patterns of language-based assessments consistent with previous findings in other settings and their strong statistical correlations provided unique insights and explanations beyond commonly known confounds or risk factors such as age, gender, occupation, marital status, or even questionnaire-based depression measures, suggesting support for clinicians toward more precise decisions. More generally, language-based assessments that capture individual digital phenotypes and distinctive linguistic markers from transcripts of interviews are veiy useful for investigating underlying causes of PTSD and may play a critical role as a supplement for enhancing personalized preventive care (<xref rid="R21" ref-type="bibr">Hamburg &#x00026; Collins, 2010</xref>) and more effective treatments for PTSD; they may even enable real-time screening or preventive measures with reduced costs and less therapist time for helping a large number of people exposed to large-scale traumatic events (e.g. natural disasters, WTC PTSD) similar to a previous online PTSD treatment (<xref rid="R30" ref-type="bibr">Lewis et al., 2017</xref>). Nevertheless, future studies with applying language-based assessment on larger samples will be required in order to more precisely validate their statistical significance and correlations, and even further studies into subphenotypes and more detailed categorizations of language-based assessments will lead to more diverse analysis with rich high-dimensional digital phenotypes.</p></sec><sec sec-type="supplementary-material" id="SM1"><title>Supplementary Material</title><supplementary-material id="SD1" position="float" content-type="local-data"><label>Supplementary Table S1</label><media xlink:href="NIHMS1729093-supplement-Supplementary_Table_S1.docx" id="d64e988" position="anchor"/></supplementary-material></sec></body><back><ack id="S29"><title>Acknowledgements.</title><p id="P44">The authors are extremely grateful to the WTC rescue and recovery workers, who gave of themselves so readily in response to the WTC attacks and agreed to participate in this ongoing research effort. We also thank the clinical staff of the World Trade Center Medical Monitoring and Treatment Programs for their dedication and the labor and community organizations for their continued support. Son and Schwartz were supported, in part, by NIH R01 AA028032-01 and, in part, by DARPA via grant #W911NF-20-1-0306 to Stony Brook University; the conclusions and opinions expressed are attributable only to the authors and should not be construed as those of DARPA or the U.S. Department of Defense. Clouston was supported, in part, by NIH R01 AG049953.</p></ack><fn-group><fn fn-type="COI-statement" id="FN1"><p id="P45"><bold>Conflict of interest.</bold> None.</p></fn><fn id="FN2"><p id="P46"><bold>Supplementary material.</bold> The supplementary material for this article can be found at <ext-link xlink:href="10.1017/S0033291721002294" ext-link-type="doi">https://doi.org/10.1017/S0033291721002294</ext-link>.</p></fn></fn-group><ref-list><title>References</title><ref id="R1"><mixed-citation publication-type="journal"><name><surname>Andrykowski</surname><given-names>MA</given-names></name>, <name><surname>Cordova</surname><given-names>MJ</given-names></name>, <name><surname>Studts</surname><given-names>JL</given-names></name>, &#x00026; <name><surname>Miller</surname><given-names>TW</given-names></name> (<year>1998</year>). <article-title>Posttraumatic stress disorder after treatment for breast cancer: Prevalence of diagnosis and use of the PTSD Checklist &#x02013; Civilian Version (PCL-C) as a screening instrument</article-title>. <source>Journal of Consulting and Clinical Psychology</source>, <volume>66</volume>(<issue>3</issue>), <fpage>586</fpage>.<pub-id pub-id-type="pmid">9642900</pub-id></mixed-citation></ref><ref id="R2"><mixed-citation publication-type="journal"><name><surname>Baddeley</surname><given-names>JL</given-names></name>, &#x00026; <name><surname>Singer</surname><given-names>JA</given-names></name> (<year>2008</year>). <article-title>Telling losses: Personality correlates and functions of bereavement narratives</article-title>. <source>Journal of Research in Personality</source>, <volume>42</volume> (<issue>2</issue>), <fpage>421</fpage>&#x02013;<lpage>438</lpage>.</mixed-citation></ref><ref id="R3"><mixed-citation publication-type="journal"><name><surname>Bergen</surname><given-names>PL</given-names></name> (<year>2019</year>). <source>September 11 attacks</source>. [Online; posted <day>10</day>-<month>September</month>-2019].</mixed-citation></ref><ref id="R4"><mixed-citation publication-type="journal"><name><surname>Birrer</surname><given-names>E</given-names></name>, &#x00026; <name><surname>Michael</surname><given-names>T</given-names></name> (<year>2011</year>). <article-title>Rumination in PTSD as well as in traumatized and non-traumatized depressed patients: A cross-sectional clinical study</article-title>. <source>Behavioural and Cognitive Psychotherapy</source>, <volume>39</volume>(<issue>4</issue>), <fpage>381</fpage>&#x02013;<lpage>397</lpage>.<pub-id pub-id-type="pmid">21457604</pub-id></mixed-citation></ref><ref id="R5"><mixed-citation publication-type="journal"><name><surname>Blanchard</surname><given-names>EB</given-names></name>, <name><surname>Jones-Alexander</surname><given-names>J</given-names></name>, <name><surname>Buckley</surname><given-names>TC</given-names></name>, &#x00026; <name><surname>Forneris</surname><given-names>CA</given-names></name> (<year>1996</year>). <article-title>Psychometric properties of the PTSD Checklist (PCL)</article-title>. <source>Behaviour Research and Therapy</source>, <volume>34</volume>(<issue>8</issue>), <fpage>669</fpage>&#x02013;<lpage>673</lpage>.<pub-id pub-id-type="pmid">8870294</pub-id></mixed-citation></ref><ref id="R6"><mixed-citation publication-type="journal"><name><surname>Blei</surname><given-names>DM</given-names></name>, <name><surname>Ng</surname><given-names>AY</given-names></name>, &#x00026; <name><surname>Jordan</surname><given-names>MI</given-names></name> (<year>2003</year>). <article-title>Latent Dirichlet allocation</article-title>. <source>Journal of Machine Learning Research</source>, <volume>3</volume>(<month>Jan</month>), <fpage>993</fpage>&#x02013;<lpage>1022</lpage>.</mixed-citation></ref><ref id="R7"><mixed-citation publication-type="journal"><name><surname>Bovin</surname><given-names>MJ</given-names></name>, <name><surname>Marx</surname><given-names>BP</given-names></name>, <name><surname>Weathers</surname><given-names>FW</given-names></name>, <name><surname>Gallagher</surname><given-names>MW</given-names></name>, <name><surname>Rodriguez</surname><given-names>P</given-names></name>, <name><surname>Schnurr</surname><given-names>PP</given-names></name>, &#x00026; <name><surname>Keane</surname><given-names>TM</given-names></name> (<year>2016</year>). <article-title>Psychometric properties of the PTSD checklist for diagnostic and statistical manual of mental disorders&#x02013;fifth edition (PCL-5) in veterans</article-title>. <source>Psychological Assessment</source>, <volume>28</volume>(<issue>11</issue>), <fpage>1379</fpage>.<pub-id pub-id-type="pmid">26653052</pub-id></mixed-citation></ref><ref id="R8"><mixed-citation publication-type="journal"><name><surname>Breslau</surname><given-names>N</given-names></name>, <name><surname>Davis</surname><given-names>GC</given-names></name>, &#x00026; <name><surname>Andreski</surname><given-names>P</given-names></name> (<year>1995</year>). <article-title>Risk factors for PTSD-related traumatic events: A prospective analysis</article-title>. <source>The American Journal of Psychiatry</source>, <volume>152</volume>(<issue>4</issue>), <fpage>529</fpage>&#x02013;<lpage>535</lpage>.<pub-id pub-id-type="pmid">7694900</pub-id></mixed-citation></ref><ref id="R9"><mixed-citation publication-type="journal"><name><surname>Breslau</surname><given-names>N</given-names></name>, <name><surname>Davis</surname><given-names>GC</given-names></name>, <name><surname>Peterson</surname><given-names>EL</given-names></name>, &#x00026; <name><surname>Schultz</surname><given-names>LR</given-names></name> (<year>2000</year>). <article-title>A second look at comorbidity in victims of trauma: The posttraumatic stress disorder &#x02013; major depression connection</article-title>. <source>Biological Psychiatry</source>, <volume>48</volume>(<issue>9</issue>), <fpage>902</fpage>&#x02013;<lpage>909</lpage>.<pub-id pub-id-type="pmid">11074228</pub-id></mixed-citation></ref><ref id="R10"><mixed-citation publication-type="journal"><name><surname>Bromet</surname><given-names>E</given-names></name>, <name><surname>Hobbs</surname><given-names>M</given-names></name>, <name><surname>Clouston</surname><given-names>S</given-names></name>, <name><surname>Gonzalez</surname><given-names>A</given-names></name>, <name><surname>Kotov</surname><given-names>R</given-names></name>, &#x00026; <name><surname>Luft</surname><given-names>B</given-names></name> (<year>2016</year>). <article-title>DSM-IV post-traumatic stress disorder among World Trade Center responders 11&#x02013;13 years after the disaster of 11 September 2001 (9/11)</article-title>. <source>Psychological Medicine</source>, <volume>46</volume>(<issue>4</issue>), <fpage>771</fpage>&#x02013;<lpage>783</lpage>.<pub-id pub-id-type="pmid">26603700</pub-id></mixed-citation></ref><ref id="R11"><mixed-citation publication-type="journal"><name><surname>Carey</surname><given-names>AL</given-names></name>, <name><surname>Brucks</surname><given-names>MS</given-names></name>, <name><surname>K&#x000fc;fner</surname><given-names>AC</given-names></name>, <name><surname>Holtzman</surname><given-names>NS</given-names></name>, <name><surname>Back</surname><given-names>MD</given-names></name>, <name><surname>Donnellan</surname><given-names>MB</given-names></name>, &#x02026; <name><surname>Mehl</surname><given-names>MR</given-names></name> (<year>2015</year>). <article-title>Narcissism and the use of personal pronouns revisited</article-title>. <source>Journal of Personality and Social Psychology</source>, <volume>109</volume>(<issue>3</issue>), <fpage>e1</fpage>.<pub-id pub-id-type="pmid">25822035</pub-id></mixed-citation></ref><ref id="R12"><mixed-citation publication-type="journal"><name><surname>Cone</surname><given-names>JE</given-names></name>, <name><surname>Li</surname><given-names>J</given-names></name>, <name><surname>Kornblith</surname><given-names>E</given-names></name>, <name><surname>Gocheva</surname><given-names>V</given-names></name>, <name><surname>Stellman</surname><given-names>SD</given-names></name>, <name><surname>Shaikh</surname><given-names>A</given-names></name>, &#x02026; <name><surname>Bowler</surname><given-names>RM</given-names></name> (<year>2015</year>). <article-title>Chronic probable PTSD in police responders in the world trade center health registry ten to eleven years after 9/11</article-title>. <source>American Journal of Industrial Medicine</source>, <volume>58</volume>(<issue>5</issue>), <fpage>483</fpage>&#x02013;<lpage>493</lpage>.<pub-id pub-id-type="pmid">25851164</pub-id></mixed-citation></ref><ref id="R13"><mixed-citation publication-type="journal"><name><surname>Coppersmith</surname><given-names>G</given-names></name>, <name><surname>Dredze</surname><given-names>M</given-names></name>, &#x00026; <name><surname>Harman</surname><given-names>C</given-names></name> (<year>2014</year>). <article-title>Quantifying mental health signals in Twitter</article-title>. In <source>Proceedings of the workshop on computational linguistics and clinical psychology: From linguistic signal to clinical reality</source>, pp. <fpage>51</fpage>&#x02013;<lpage>60</lpage>.</mixed-citation></ref><ref id="R14"><mixed-citation publication-type="journal"><name><surname>Cukor</surname><given-names>J</given-names></name>, <name><surname>Wyka</surname><given-names>K</given-names></name>, <name><surname>Mello</surname><given-names>B</given-names></name>, <name><surname>Olden</surname><given-names>M</given-names></name>, <name><surname>Jayasinghe</surname><given-names>N</given-names></name>, <name><surname>Roberts</surname><given-names>J</given-names></name>, &#x02026; <name><surname>Difede</surname><given-names>J</given-names></name> (<year>2011</year>). <article-title>The longitudinal course of PTSD among disaster workers deployed to the world trade center following the attacks of September 11th</article-title>. <source>Journal of Traumatic Stress</source>, <volume>24</volume>(<issue>5</issue>), <fpage>506</fpage>&#x02013;<lpage>514</lpage>.<pub-id pub-id-type="pmid">22095774</pub-id></mixed-citation></ref><ref id="R15"><mixed-citation publication-type="journal"><name><surname>De Choudhury</surname><given-names>M</given-names></name>, <name><surname>Kiciman</surname><given-names>E</given-names></name>, <name><surname>Dredze</surname><given-names>M</given-names></name>, <name><surname>Coppersmith</surname><given-names>G</given-names></name>, &#x00026; <name><surname>Kumar</surname><given-names>M</given-names></name> (<year>2016</year>). <article-title>Discovering shifts to suicidal ideation from mental health content in social media</article-title>. In <source>Proceedings of the 2016 CHI conference on human factors in computing systems</source>, pp. <fpage>2098</fpage>&#x02013;<lpage>2110</lpage>.</mixed-citation></ref><ref id="R16"><mixed-citation publication-type="journal"><name><surname>Durkin</surname><given-names>E</given-names></name> (<year>2018</year>). <source>September 11: nearly 10 000 people affected by &#x02018;cesspool of cancer&#x02019;</source>. [Online; posted <day>11</day>-<month>September</month>-2018].</mixed-citation></ref><ref id="R17"><mixed-citation publication-type="journal"><name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Kern</surname><given-names>ML</given-names></name>, <name><surname>Yaden</surname><given-names>DB</given-names></name>, <name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Giorgi</surname><given-names>S</given-names></name>, <name><surname>Park</surname><given-names>G</given-names></name>, &#x02026; <name><surname>Ungar</surname><given-names>LH</given-names></name> (<year>2020</year>). <article-title>Closed- and open-vocabulary approaches to text analysis: A review, quantitative comparison, and recommendations</article-title>. <source>Psychological Methods</source>. The preprint of this article is available at <comment><ext-link xlink:href="https://psy-arxiv.com/t52c6/" ext-link-type="uri">https://psy-arxiv.com/t52c6/</ext-link>.</comment> The DOI of this preprint is <pub-id pub-id-type="doi">10.31234/osf.io/t52c6</pub-id>.</mixed-citation></ref><ref id="R18"><mixed-citation publication-type="journal"><name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Smith</surname><given-names>RJ</given-names></name>, <name><surname>Merchant</surname><given-names>RM</given-names></name>, <name><surname>Ungar</surname><given-names>LH</given-names></name>, <name><surname>Crutchley</surname><given-names>P</given-names></name>, <name><surname>Preo&#x00163;iuc-Pietro</surname><given-names>D</given-names></name>, &#x02026; <name><surname>Schwartz</surname><given-names>HA</given-names></name> (<year>2018</year>). <article-title>Facebook language predicts depression in medical records</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>115</volume>(<issue>44</issue>), <fpage>11203</fpage>&#x02013;<lpage>11208</lpage>.</mixed-citation></ref><ref id="R19"><mixed-citation publication-type="journal"><name><surname>Farmer</surname><given-names>A</given-names></name>, <name><surname>Redman</surname><given-names>K</given-names></name>, <name><surname>Harris</surname><given-names>T</given-names></name>, <name><surname>Mahmood</surname><given-names>A</given-names></name>, <name><surname>Sadler</surname><given-names>S</given-names></name>, <name><surname>Pickering</surname><given-names>A</given-names></name>, &#x00026; <name><surname>McGuffin</surname><given-names>P</given-names></name> (<year>2002</year>). <article-title>Neuroticism, extraversion, life events and depression: The Cardiff Depression Study</article-title>. <source>The British Journal of Psychiatry</source>, <volume>181</volume>(<issue>2</issue>), <fpage>118</fpage>&#x02013;<lpage>122</lpage>.<pub-id pub-id-type="pmid">12151281</pub-id></mixed-citation></ref><ref id="R20"><mixed-citation publication-type="journal"><name><surname>Fauerbach</surname><given-names>JA</given-names></name>, <name><surname>Lawrence</surname><given-names>JW</given-names></name>, <name><surname>Schmidt</surname><given-names>CW</given-names><suffix>Jr</suffix></name>, <name><surname>Munster</surname><given-names>AM</given-names></name>, &#x00026; <name><surname>Costa</surname><given-names>PT</given-names><suffix>Jr.</suffix></name> (<year>2000</year>). <article-title>Personality predictors of injury-related posttraumatic stress disorder</article-title>. <source>The Journal of Nervous and Mental Disease</source>, <volume>188</volume>(<issue>8</issue>), <fpage>510</fpage>&#x02013;<lpage>517</lpage>.<pub-id pub-id-type="pmid">10972570</pub-id></mixed-citation></ref><ref id="R21"><mixed-citation publication-type="journal"><name><surname>Hamburg</surname><given-names>MA</given-names></name>, &#x00026; <name><surname>Collins</surname><given-names>FS</given-names></name> (<year>2010</year>). <article-title>The path to personalized medicine</article-title>. <source>New England Journal of Medicine</source>, <volume>363</volume>(<issue>4</issue>), <fpage>301</fpage>&#x02013;<lpage>304</lpage>.<pub-id pub-id-type="pmid">20551152</pub-id></mixed-citation></ref><ref id="R22"><mixed-citation publication-type="journal"><name><surname>Hammock</surname><given-names>AC</given-names></name>, <name><surname>Dreyer</surname><given-names>RE</given-names></name>, <name><surname>Riaz</surname><given-names>M</given-names></name>, <name><surname>Clouston</surname><given-names>SA</given-names></name>, <name><surname>McGlone</surname><given-names>A</given-names></name>, &#x00026; <name><surname>Luft</surname><given-names>B</given-names></name> (<year>2019</year>). <article-title>Trauma and relationship strain: Oral histories with World Trade Center disaster responders</article-title>. <source>Qualitative Health Research</source>, <volume>29</volume>(<issue>12</issue>), <fpage>1751</fpage>&#x02013;<lpage>1765</lpage>.<pub-id pub-id-type="pmid">30920915</pub-id></mixed-citation></ref><ref id="R23"><mixed-citation publication-type="journal"><name><surname>Hartley</surname><given-names>J</given-names></name>, <name><surname>Pennebaker</surname><given-names>JW</given-names></name>, &#x00026; <name><surname>Fox</surname><given-names>C</given-names></name> (<year>2003</year>). <article-title>Abstracts, introductions and discussions: How far do they differ in style?</article-title>. <source>Scientometrics</source>, <volume>57</volume>(<issue>3</issue>), <fpage>389</fpage>&#x02013;<lpage>398</lpage>.</mixed-citation></ref><ref id="R24"><mixed-citation publication-type="journal"><name><surname>Holtzman</surname><given-names>NS</given-names></name> (<year>2017</year>). <article-title>A meta-analysis of correlations between depression and first person singular pronoun use</article-title>. <source>Journal of Research in Personality</source>, <volume>68</volume>, <fpage>63</fpage>&#x02013;<lpage>68</lpage>.</mixed-citation></ref><ref id="R25"><mixed-citation publication-type="journal"><name><surname>Ingram</surname><given-names>RE</given-names></name> (<year>1990</year>). <article-title>Self-focused attention in clinical disorders: Review and a conceptual model</article-title>. <source>Psychological Bulletin</source>, <volume>107</volume>(<issue>2</issue>), <fpage>156</fpage>.<pub-id pub-id-type="pmid">2181521</pub-id></mixed-citation></ref><ref id="R26"><mixed-citation publication-type="journal"><name><surname>Jorm</surname><given-names>AF</given-names></name>, <name><surname>Christensen</surname><given-names>H</given-names></name>, <name><surname>Henderson</surname><given-names>AS</given-names></name>, <name><surname>Jacomb</surname><given-names>PA</given-names></name>, <name><surname>Korten</surname><given-names>AE</given-names></name>, &#x00026; <name><surname>Rodgers</surname><given-names>B</given-names></name> (<year>2000</year>). <article-title>Predicting anxiety and depression from personality: Is there a synergistic effect of neuroticism and extraversion?</article-title>. <source>Journal of Abnormal Psychology</source>, <volume>109</volume>(<issue>1</issue>), <fpage>145</fpage>.<pub-id pub-id-type="pmid">10740946</pub-id></mixed-citation></ref><ref id="R27"><mixed-citation publication-type="journal"><name><surname>Jylh&#x000e4;</surname><given-names>P</given-names></name>, &#x00026; <name><surname>Isomets&#x000e4;</surname><given-names>E</given-names></name> (<year>2006</year>). <article-title>The relationship of neuroticism and extraversion to symptoms of anxiety and depression in the general population</article-title>. <source>Depression and Anxiety</source>, <volume>23</volume>(<issue>5</issue>), <fpage>281</fpage>&#x02013;<lpage>289</lpage>.<pub-id pub-id-type="pmid">16688731</pub-id></mixed-citation></ref><ref id="R28"><mixed-citation publication-type="journal"><name><surname>Kern</surname><given-names>ML</given-names></name>, <name><surname>Park</surname><given-names>G</given-names></name>, <name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Sap</surname><given-names>M</given-names></name>, <name><surname>Smith</surname><given-names>LK</given-names></name>, &#x00026; <name><surname>Ungar</surname><given-names>LH</given-names></name> (<year>2016</year>). <article-title>Gaining insights from social media language: Methodologies and challenges</article-title>. <source>Psychological Methods</source>, <volume>21</volume>(<issue>4</issue>), <fpage>507</fpage>.<pub-id pub-id-type="pmid">27505683</pub-id></mixed-citation></ref><ref id="R29"><mixed-citation publication-type="confproc"><name><surname>Khawaja</surname><given-names>MA</given-names></name>, <name><surname>Chen</surname><given-names>F</given-names></name>, &#x00026; <name><surname>Marcus</surname><given-names>N</given-names></name> (<year>2010</year>). <source>Using language complexity to measure cognitive load for adaptive interaction design</source>. In <conf-name>Proceedings of the 15th international conference on Intelligent user interfaces</conf-name>, pp. <fpage>333</fpage>&#x02013;<lpage>336</lpage>.</mixed-citation></ref><ref id="R30"><mixed-citation publication-type="journal"><name><surname>Lewis</surname><given-names>CE</given-names></name>, <name><surname>Farewell</surname><given-names>D</given-names></name>, <name><surname>Groves</surname><given-names>V</given-names></name>, <name><surname>Kitchiner</surname><given-names>NJ</given-names></name>, <name><surname>Roberts</surname><given-names>NP</given-names></name>, <name><surname>Vick</surname><given-names>T</given-names></name>, &#x00026; <name><surname>Bisson</surname><given-names>JI</given-names></name> (<year>2017</year>). <article-title>Internet-based guided self-help for posttraumatic stress disorder (PTSD): Randomized controlled trial</article-title>. <source>Depression and Anxiety</source>, <volume>34</volume> (<issue>6</issue>), <fpage>555</fpage>&#x02013;<lpage>565</lpage>.<pub-id pub-id-type="pmid">28557299</pub-id></mixed-citation></ref><ref id="R31"><mixed-citation publication-type="journal"><name><surname>Lewis</surname><given-names>ML</given-names></name>, &#x00026; <name><surname>Frank</surname><given-names>MC</given-names></name> (<year>2016</year>). <article-title>The length of words reflects their conceptual complexity</article-title>. <source>Cognition</source>, <volume>153</volume>, <fpage>182</fpage>&#x02013;<lpage>195</lpage>.<pub-id pub-id-type="pmid">27232162</pub-id></mixed-citation></ref><ref id="R32"><mixed-citation publication-type="journal"><name><surname>Luft</surname><given-names>B</given-names></name>, <name><surname>Schechter</surname><given-names>C</given-names></name>, <name><surname>Kotov</surname><given-names>R</given-names></name>, <name><surname>Broihier</surname><given-names>J</given-names></name>, <name><surname>Reissman</surname><given-names>D</given-names></name>, <name><surname>Guerrera</surname><given-names>K</given-names></name>, &#x02026; <name><surname>Bromet</surname><given-names>E</given-names></name> (<year>2012</year>). <article-title>Exposure, probable PTSD and lower respiratory illness among world trade center rescue, recovery and clean-up workers</article-title>. <source>Psychological Medicine</source>, <volume>42</volume>(<issue>5</issue>), <fpage>1069</fpage>&#x02013;<lpage>1079</lpage>.<pub-id pub-id-type="pmid">22459506</pub-id></mixed-citation></ref><ref id="R33"><mixed-citation publication-type="journal"><name><surname>Martin</surname><given-names>M</given-names></name> (<year>1985</year>). <article-title>Neuroticism as predisposition toward depression: A cognitive mechanism</article-title>. <source>Personality and Individual Differences</source>, <volume>6</volume>(<issue>3</issue>), <fpage>353</fpage>&#x02013;<lpage>365</lpage>.</mixed-citation></ref><ref id="R34"><mixed-citation publication-type="confproc"><name><surname>Matero</surname><given-names>M</given-names></name>, <name><surname>Idnani</surname><given-names>A</given-names></name>, <name><surname>Son</surname><given-names>Y</given-names></name>, <name><surname>Giorgi</surname><given-names>S</given-names></name>, <name><surname>Vu</surname><given-names>H</given-names></name>, <name><surname>Zamani</surname><given-names>M</given-names></name>, &#x02026; <name><surname>Schwartz</surname><given-names>HA</given-names></name> (<year>2019</year>). <source>Suicide risk assessment with multi-level dual-context language and bert</source>. In <conf-name>Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology</conf-name>, pp. <fpage>39</fpage>&#x02013;<lpage>44</lpage>.</mixed-citation></ref><ref id="R35"><mixed-citation publication-type="journal"><name><surname>McGowan</surname><given-names>S</given-names></name> (<year>2002</year>). <article-title>Mental representations in stressful situations: The calming and distressing effects of significant others</article-title>. <source>Journal of Experimental Social Psychology</source>, <volume>38</volume>(<issue>2</issue>), <fpage>152</fpage>&#x02013;<lpage>161</lpage>.</mixed-citation></ref><ref id="R36"><mixed-citation publication-type="journal"><name><surname>Michael</surname><given-names>T</given-names></name>, <name><surname>Halligan</surname><given-names>SL</given-names></name>, <name><surname>Clark</surname><given-names>DM</given-names></name>, &#x00026; <name><surname>Ehlers</surname><given-names>A</given-names></name> (<year>2007</year>). <article-title>Rumination in posttraumatic stress disorder</article-title>. <source>Depression and Anxiety</source>, <volume>24</volume>(<issue>5</issue>), <fpage>307</fpage>&#x02013;<lpage>317</lpage>.<pub-id pub-id-type="pmid">17041914</pub-id></mixed-citation></ref><ref id="R37"><mixed-citation publication-type="journal"><name><surname>Miragoli</surname><given-names>S</given-names></name>, <name><surname>Camisasca</surname><given-names>E</given-names></name>, &#x00026; <name><surname>Di Blasio</surname><given-names>P</given-names></name> (<year>2019</year>). <article-title>Investigating linguistic coherence relations in child sexual abuse: A comparison of PTSD and non-PTSD children</article-title>. <source>Heliyon</source>, <volume>5</volume>(<issue>2</issue>), <fpage>e01163</fpage>.<pub-id pub-id-type="pmid">30828653</pub-id></mixed-citation></ref><ref id="R38"><mixed-citation publication-type="journal"><name><surname>Neria</surname><given-names>Y</given-names></name>, <name><surname>Olfson</surname><given-names>M</given-names></name>, <name><surname>Gameroff</surname><given-names>MJ</given-names></name>, <name><surname>DiGrande</surname><given-names>L</given-names></name>, <name><surname>Wickramaratne</surname><given-names>P</given-names></name>, <name><surname>Gross</surname><given-names>R</given-names></name>, &#x02026; (<year>2010</year>). <article-title>Long-term course of probable PTSD after the 9/11 attacks: A study in urban primary care</article-title>. <source>Journal of Traumatic Stress</source>, <volume>23</volume>(<issue>4</issue>), <fpage>474</fpage>&#x02013;<lpage>482</lpage>.<pub-id pub-id-type="pmid">20690169</pub-id></mixed-citation></ref><ref id="R39"><mixed-citation publication-type="journal"><name><surname>Nixon</surname><given-names>RD</given-names></name>, <name><surname>Nehmy</surname><given-names>T</given-names></name>, &#x00026; <name><surname>Seymour</surname><given-names>M</given-names></name> (<year>2007</year>). <article-title>The effect of cognitive load and hyperarousal on negative intrusive memories</article-title>. <source>Behaviour Research and Therapy</source>, <volume>45</volume>(<issue>11</issue>), <fpage>2652</fpage>&#x02013;<lpage>2663</lpage>.<pub-id pub-id-type="pmid">17666185</pub-id></mixed-citation></ref><ref id="R40"><mixed-citation publication-type="journal"><name><surname>Papini</surname><given-names>S</given-names></name>, <name><surname>Yoon</surname><given-names>P</given-names></name>, <name><surname>Rubin</surname><given-names>M</given-names></name>, <name><surname>Lopez-Castro</surname><given-names>T</given-names></name>, &#x00026; <name><surname>Hien</surname><given-names>DA</given-names></name> (<year>2015</year>). <article-title>Linguistic characteristics in a non-trauma-related narrative task are associated with PTSD diagnosis and symptom severity</article-title>. <source>Psychological Trauma: Theory, Research, Practice, and Policy</source>, <volume>7</volume>(<issue>3</issue>), <fpage>295</fpage>.<pub-id pub-id-type="pmid">25961121</pub-id></mixed-citation></ref><ref id="R41"><mixed-citation publication-type="journal"><name><surname>Park</surname><given-names>G</given-names></name>, <name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Kern</surname><given-names>ML</given-names></name>, <name><surname>Kosinski</surname><given-names>M</given-names></name>, <name><surname>Stillwell</surname><given-names>DJ</given-names></name>, &#x02026; <name><surname>Seligman</surname><given-names>ME</given-names></name> (<year>2015</year>). <article-title>Automatic personality assessment through social media language</article-title>. <source>Journal of Personality and Social Psychology</source>, <volume>108</volume>(<issue>6</issue>), <fpage>934</fpage>.<pub-id pub-id-type="pmid">25365036</pub-id></mixed-citation></ref><ref id="R42"><mixed-citation publication-type="journal"><name><surname>Pennebaker</surname><given-names>JW</given-names></name>, <name><surname>Boyd</surname><given-names>RL</given-names></name>, <name><surname>Jordan</surname><given-names>K</given-names></name>, &#x00026; <name><surname>Blackburn</surname><given-names>K</given-names></name> (<year>2015</year>). <article-title>The development and psychometric properties of LIWC 2015</article-title>. <source>Technical report</source>.</mixed-citation></ref><ref id="R43"><mixed-citation publication-type="journal"><name><surname>Pietrzak</surname><given-names>RH</given-names></name>, <name><surname>Feder</surname><given-names>A</given-names></name>, <name><surname>Singh</surname><given-names>R</given-names></name>, <name><surname>Schechter</surname><given-names>CB</given-names></name>, <name><surname>Bromet</surname><given-names>EJ</given-names></name>, <name><surname>Katz</surname><given-names>C</given-names></name>, &#x02026; (<year>2014</year>). <article-title>Trajectories of PTSD risk and resilience in World Trade Center responders: An 8-year prospective cohort study</article-title>. <source>Psychological Medicine</source>, <volume>44</volume>(<issue>1</issue>), <fpage>205</fpage>&#x02013;<lpage>219</lpage>.<pub-id pub-id-type="pmid">23551932</pub-id></mixed-citation></ref><ref id="R44"><mixed-citation publication-type="journal"><name><surname>Preotiuc-Pietro</surname><given-names>D</given-names></name>, <name><surname>Sap</surname><given-names>M</given-names></name>, <name><surname>Schwartz</surname><given-names>HA</given-names></name>, &#x00026; <name><surname>Ungar</surname><given-names>LH</given-names></name> (<year>2015</year>). <article-title>Mental illness detection at the world well-being project for the CLPsych 2015 Shared Task</article-title>. In <source>Proceedings of the second workshop on computational linguistics and clinical psychology</source>, pp. <fpage>40</fpage>&#x02013;<lpage>45</lpage>.</mixed-citation></ref><ref id="R45"><mixed-citation publication-type="journal"><name><surname>Ramirez-Esparza</surname><given-names>N</given-names></name>, <name><surname>Chung</surname><given-names>CK</given-names></name>, <name><surname>Kacewicz</surname><given-names>E</given-names></name>, &#x00026; <name><surname>Pennebaker</surname><given-names>JW</given-names></name> (<year>2008</year>). <article-title>The psychology of word use in depression forums in English and in Spanish: Texting two text analytic approaches</article-title>. In <source>ICWSM</source>.</mixed-citation></ref><ref id="R46"><mixed-citation publication-type="journal"><name><surname>Reece</surname><given-names>AG</given-names></name>, <name><surname>Reagan</surname><given-names>AJ</given-names></name>, <name><surname>Lix</surname><given-names>KL</given-names></name>, <name><surname>Dodds</surname><given-names>PS</given-names></name>, <name><surname>Danforth</surname><given-names>CM</given-names></name>, &#x00026; <name><surname>Langer</surname><given-names>EJ</given-names></name> (<year>2017</year>). <article-title>Forecasting the onset and course of mental illness with Twitter data</article-title>. <source>Scientific Reports</source>, <volume>7</volume>(<issue>1</issue>), <fpage>1</fpage>&#x02013;<lpage>11</lpage>.<pub-id pub-id-type="pmid">28127051</pub-id></mixed-citation></ref><ref id="R47"><mixed-citation publication-type="journal"><name><surname>Rude</surname><given-names>S</given-names></name>, <name><surname>Gortner</surname><given-names>E-M</given-names></name>, &#x00026; <name><surname>Pennebaker</surname><given-names>J</given-names></name> (<year>2004</year>). <article-title>Language use of depressed and depression-vulnerable college students</article-title>. <source>Cognition &#x00026; Emotion</source>, <volume>18</volume>(<issue>8</issue>), <fpage>1121</fpage>&#x02013;<lpage>1133</lpage>.</mixed-citation></ref><ref id="R48"><mixed-citation publication-type="confproc"><name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Kern</surname><given-names>ML</given-names></name>, <name><surname>Dziurzynski</surname><given-names>L</given-names></name>, <name><surname>Lucas</surname><given-names>RE</given-names></name>, <name><surname>Agrawal</surname><given-names>M</given-names></name>, &#x02026; <name><surname>Ungar</surname><given-names>L</given-names></name> (<year>2013a</year>). <source>Characterizing geographic variation in well-being using tweets</source>. In <conf-name>Seventh International AAAI Conference on Weblogs and Social Media</conf-name>.</mixed-citation></ref><ref id="R49"><mixed-citation publication-type="journal"><name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Eichstaedt</surname><given-names>JC</given-names></name>, <name><surname>Kern</surname><given-names>ML</given-names></name>, <name><surname>Dziurzynski</surname><given-names>L</given-names></name>, <name><surname>Ramones</surname><given-names>SM</given-names></name>, <name><surname>Agrawal</surname><given-names>M</given-names></name>, &#x02026; (<year>2013b</year>). <article-title>Personality, gender, and age in the language of social media: The open-vocabulary approach</article-title>. <source>PLoS ONE</source>, <volume>8</volume>(<issue>9</issue>), <fpage>e73791</fpage>.<pub-id pub-id-type="pmid">24086296</pub-id></mixed-citation></ref><ref id="R50"><mixed-citation publication-type="journal"><name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Eichstaedt</surname><given-names>J</given-names></name>, <name><surname>Kern</surname><given-names>M</given-names></name>, <name><surname>Park</surname><given-names>G</given-names></name>, <name><surname>Sap</surname><given-names>M</given-names></name>, <name><surname>Stillwell</surname><given-names>D</given-names></name>, &#x02026; <name><surname>Ungar</surname><given-names>L</given-names></name> (<year>2014</year>). <article-title>Towards assessing changes in degree of depression through Facebook</article-title>. In <source>Proceedings of the workshop on computational linguistics and clinical psychology: from linguistic signal to clinical reality</source>, pp. <fpage>118</fpage>&#x02013;<lpage>125</lpage>.</mixed-citation></ref><ref id="R51"><mixed-citation publication-type="journal"><name><surname>Schwartz</surname><given-names>HA</given-names></name>, <name><surname>Giorgi</surname><given-names>S</given-names></name>, <name><surname>Sap</surname><given-names>M</given-names></name>, <name><surname>Crutchley</surname><given-names>P</given-names></name>, <name><surname>Ungar</surname><given-names>L</given-names></name>, &#x00026; <name><surname>Eichstaedt</surname><given-names>J</given-names></name> (<year>2017</year>). <article-title>Dlatk: Differential language analysis toolkit</article-title>. In <source>Proceedings of the 2017 conference on empirical methods in natural language processing: System demonstrations</source>, pp. <fpage>55</fpage>&#x02013;<lpage>60</lpage>.</mixed-citation></ref><ref id="R52"><mixed-citation publication-type="journal"><name><surname>Schwartz</surname><given-names>HA</given-names></name>, &#x00026; <name><surname>Ungar</surname><given-names>LH</given-names></name> (<year>2015</year>). <article-title>Data-driven content analysis of social media: A systematic overview of automated methods</article-title>. <source>The ANNALS of the American Academy of Political and Social Science</source>, <volume>659</volume>, <fpage>78</fpage>&#x02013;<lpage>94</lpage>.</mixed-citation></ref><ref id="R53"><mixed-citation publication-type="journal"><name><surname>Simmons</surname><given-names>RA</given-names></name>, <name><surname>Gordon</surname><given-names>PC</given-names></name>, &#x00026; <name><surname>Chambless</surname><given-names>DL</given-names></name> (<year>2005</year>). <article-title>Pronouns in marital interaction: What do &#x0201c;you&#x0201d; and &#x0201c;I&#x0201d; say about marital health?</article-title>. <source>Psychological Science</source>, <volume>16</volume>(<issue>12</issue>), <fpage>932</fpage>&#x02013;<lpage>936</lpage>.<pub-id pub-id-type="pmid">16313655</pub-id></mixed-citation></ref><ref id="R54"><mixed-citation publication-type="journal"><name><surname>Stander</surname><given-names>VA</given-names></name>, <name><surname>Thomsen</surname><given-names>CJ</given-names></name>, &#x00026; <name><surname>Highfill-McRoy</surname><given-names>RM</given-names></name> (<year>2014</year>). <article-title>Etiology of depression comorbidity in combat-related PTSD: A review of the literature</article-title>. <source>Clinical Psychology Review</source>, <volume>34</volume>(<issue>2</issue>), <fpage>87</fpage>&#x02013;<lpage>98</lpage>.<pub-id pub-id-type="pmid">24486520</pub-id></mixed-citation></ref><ref id="R55"><mixed-citation publication-type="journal"><name><surname>Tausczik</surname><given-names>YR</given-names></name>, &#x00026; <name><surname>Pennebaker</surname><given-names>JW</given-names></name> (<year>2010</year>). <article-title>The psychological meaning of words: Liwc and computerized text analysis methods</article-title>. <source>Journal of Language and Social Psychology</source>, <volume>29</volume>(<issue>1</issue>), <fpage>24</fpage>&#x02013;<lpage>54</lpage>.</mixed-citation></ref><ref id="R56"><mixed-citation publication-type="journal"><name><surname>Watkins</surname><given-names>ED</given-names></name>, &#x00026; <name><surname>Teasdale</surname><given-names>JD</given-names></name> (<year>2001</year>). <article-title>Rumination and overgeneral memory in depression: Effects of self-focus and analytic thinking</article-title>. <source>Journal of Abnormal Psychology</source>, <volume>110</volume>(<issue>2</issue>), <fpage>353</fpage>.<pub-id pub-id-type="pmid">11358029</pub-id></mixed-citation></ref><ref id="R57"><mixed-citation publication-type="journal"><name><surname>Wegner</surname><given-names>DM</given-names></name>, &#x00026; <name><surname>Giuliano</surname><given-names>T</given-names></name> (<year>1980</year>). <article-title>Arousal-induced attention to self</article-title>. <source>Journal of Personality and Social Psychology</source>, <volume>38</volume>(<issue>5</issue>), <fpage>719</fpage>.</mixed-citation></ref><ref id="R58"><mixed-citation publication-type="journal"><name><surname>Youyou</surname><given-names>W</given-names></name>, <name><surname>Kosinski</surname><given-names>M</given-names></name>, &#x00026; <name><surname>Stillwell</surname><given-names>D</given-names></name> (<year>2015</year>). <article-title>Computer-based personality judgments are more accurate than those made by humans</article-title>. <source>Proceedings of the National Academy of Sciences</source>, <volume>112</volume>(<issue>4</issue>), <fpage>1036</fpage>&#x02013;<lpage>1040</lpage>.</mixed-citation></ref><ref id="R59"><mixed-citation publication-type="confproc"><name><surname>Zirikly</surname><given-names>A</given-names></name>, <name><surname>Resnik</surname><given-names>P</given-names></name>, <name><surname>Uzuner</surname><given-names>O</given-names></name>, &#x00026; <name><surname>Hollingshead</surname><given-names>K</given-names></name> (<year>2019</year>). <source>CLPsych 2019 shared task: Predicting the degree of suicide risk in Reddit posts</source>. In <conf-name>Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology</conf-name>, pp. <fpage>24</fpage>&#x02013;<lpage>33</lpage>.</mixed-citation></ref></ref-list></back><floats-group><fig position="float" id="F1"><label>Fig. 1.</label><caption><p id="P47">Evaluation setup for trajectory prediction. According to <xref rid="FD3" ref-type="disp-formula">equation 3</xref>, we can then model the control-adjusted trajectory per user as <italic toggle="yes">B</italic><sub>1&#x02212;cntrl, <italic toggle="yes">i</italic></sub> = (<italic toggle="yes">&#x003b1;</italic><sub>0</sub> + <italic toggle="yes">&#x003b1;</italic><sub>1</sub><italic toggle="yes">x</italic><sub>1<italic toggle="yes">i</italic></sub> + <italic toggle="yes">&#x003b1;</italic><sub>2</sub><italic toggle="yes">x</italic><sub>2<italic toggle="yes">i</italic></sub> + &#x02026; + <italic toggle="yes">&#x003b1;</italic><sub>5</sub><italic toggle="yes">x</italic><sub>5<italic toggle="yes">i</italic></sub>). Then, we used the slope of the fitted line as the PCL trajectory of the corresponding subject. Our main outcome was correlations between this trajectory slope and the subject&#x02019;s language patterns. The figure illustrates our trajectory modeling; dots in the figure represent the PTSD scores at the health assessments after the oral history interview of a responder and the red line represents the PTSD future trajectory line which is correlated with his/her language assessment from the interview.</p></caption><graphic xlink:href="nihms-1729093-f0001" position="float"/></fig><fig position="float" id="F2"><label>Fig. 2.</label><caption><p id="P48">Average future PCL score trajectories of top (blue) and bottom (red) terciles of responders based on language-based assessments: word usages of first-person plurals (left), anxious language patterns (right), and average word lengths (bottom). All trajectories have been adjusted for interview (baseline) PCL scores, representing the residual after accounting for the expected trajectory at baseline. All differences are significant at <italic toggle="yes">p</italic> &#x0003c; 0.05 (see <xref rid="SD1" ref-type="supplementary-material">online Supplementary Table S1</xref> for further analysis).</p></caption><graphic xlink:href="nihms-1729093-f0002" position="float"/></fig><table-wrap position="float" id="T1" orientation="landscape"><label>Table 1.</label><caption><p id="P49">Data on subjects for health state correlation cross-sectional analysis and trajectory predictions</p></caption><table frame="hsides" rules="rows"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="top" rowspan="1" colspan="1"/><th align="center" valign="middle" rowspan="1" colspan="1">
<italic toggle="yes">N</italic>
</th><th align="center" valign="middle" rowspan="1" colspan="1">Female (%)</th><th align="center" valign="middle" rowspan="1" colspan="1">Police (%)</th><th align="center" valign="middle" rowspan="1" colspan="1">Mean age at the interview (<sc>s.d.</sc>)</th><th align="center" valign="middle" rowspan="1" colspan="1">Median number of words</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">All participants</td><td align="right" valign="middle" rowspan="1" colspan="1">124</td><td align="right" valign="middle" rowspan="1" colspan="1">10</td><td align="center" valign="middle" rowspan="1" colspan="1">48</td><td align="center" valign="middle" rowspan="1" colspan="1">55.4 (9.8)</td><td align="right" valign="middle" rowspan="1" colspan="1">10 254</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">Meet inclusion criteria</td><td align="right" valign="middle" rowspan="1" colspan="1">75</td><td align="right" valign="middle" rowspan="1" colspan="1">8</td><td align="center" valign="middle" rowspan="1" colspan="1">49</td><td align="center" valign="middle" rowspan="1" colspan="1">53.4 (9.5)</td><td align="right" valign="middle" rowspan="1" colspan="1">9944</td></tr></tbody></table></table-wrap><table-wrap position="float" id="T2" orientation="landscape"><label>Table 2.</label><caption><p id="P50">Cross-sectional association between language-based assessments and PCL PTSD Score</p></caption><table frame="hsides" rules="rows"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th rowspan="2" align="left" valign="bottom" colspan="1">Interview language features</th><th colspan="2" align="center" valign="middle" rowspan="1">PTSD symptoms</th></tr><tr><th align="center" valign="middle" rowspan="1" colspan="1"><italic toggle="yes">r</italic> (direct correlation with symptom score)</th><th align="center" valign="middle" rowspan="1" colspan="1"><italic toggle="yes">&#x003b2;</italic> (adjusted for age, gender, occupation, days since 9-11)</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">Psychological traits</td><td align="center" valign="middle" rowspan="1" colspan="1"/><td align="center" valign="middle" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Anxiety</td><td align="center" valign="middle" style="background-color:#A6D38C" rowspan="1" colspan="1">0.26 (0.03&#x02013;0.46)</td><td align="center" valign="middle" style="background-color:#BBDDAA" rowspan="1" colspan="1">0.20 (&#x02212;0.03 to 0.41)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Depression</td><td align="center" valign="middle" style="background-color:#8CC755" rowspan="1" colspan="1">0.38<xref rid="TFN2" ref-type="table-fn">*</xref> (0.16&#x02013;0.56)</td><td align="center" valign="middle" style="background-color:#99CC73" rowspan="1" colspan="1">0.32<xref rid="TFN2" ref-type="table-fn">*</xref> (0.10&#x02013;0.51)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Neuroticism</td><td align="center" valign="middle" style="background-color:#99CC73" rowspan="1" colspan="1">0.32<xref rid="TFN2" ref-type="table-fn">*</xref> (0.10&#x02013;0.51)</td><td align="center" valign="middle" style="background-color:#A6D38C" rowspan="1" colspan="1">0.26 (0.04&#x02013;0.46)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Extraversion</td><td align="center" valign="middle" style="background-color:#FCD9D9" rowspan="1" colspan="1">&#x02212;0.10 (&#x02212;0.32 to 0.13)</td><td align="center" valign="middle" style="background-color:#F9C1C3" rowspan="1" colspan="1">&#x02212;0.15 (&#x02212;0.37 to 0.08)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">Linguistic style</td><td align="center" valign="middle" rowspan="1" colspan="1"/><td align="center" valign="middle" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;First-person singular</td><td align="center" valign="middle" style="background-color:#9BCD78" rowspan="1" colspan="1">0.31<xref rid="TFN2" ref-type="table-fn">*</xref> (0.09&#x02013;0.50)</td><td align="center" valign="middle" style="background-color:#99CC73" rowspan="1" colspan="1">0.31<xref rid="TFN2" ref-type="table-fn">*</xref> (0.09&#x02013;0.51)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;First-person plural</td><td align="center" valign="middle" style="background-color:#FEEBEB" rowspan="1" colspan="1">&#x02212;0.05 (&#x02212;0.28 to 0.18)</td><td align="center" valign="middle" style="background-color:#FCDCDD" rowspan="1" colspan="1">&#x02212;0.10 (&#x02212;0.32 to 0.13)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Articles</td><td align="center" valign="middle" style="background-color:#FCD9D9" rowspan="1" colspan="1">&#x02212;0.09 (&#x02212;0.31 to 0.14)</td><td align="center" valign="middle" style="background-color:#FEF0F0" rowspan="1" colspan="1">&#x02212;0.06 (&#x02212;0.28 to 0.17)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;AVG word length</td><td align="center" valign="middle" style="background-color:#EDF6EA" rowspan="1" colspan="1">0.05 (&#x02212;0.18 to 0.27)</td><td align="center" valign="middle" style="background-color:#F4F9F1" rowspan="1" colspan="1">0.03 (&#x02212;0.20 to 0.25)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Word count</td><td align="center" valign="middle" style="background-color:#99CC73" rowspan="1" colspan="1">0.22 (&#x02212;0.01 to 0.43)</td><td align="center" valign="middle" style="background-color:#C5E1B7" rowspan="1" colspan="1">0.20 (&#x02212;0.02 to 0.41)</td></tr></tbody></table><table-wrap-foot><fn id="TFN1"><p id="P51">Associations are from ordinary least squares over standardized independent variable &#x02013; the language-based assessment and the standardized dependent variable &#x02013; PTSD Checklist scores (PCL scores). Without controls is equivalent to Pearson Product-Moment Correlation (<italic toggle="yes">N</italic> = 75). Square brackets indicate 95% confidence intervals. Controls included as covariates (right column) included age, gender, occupation, days between 9/11/01 and interview date.</p></fn><fn id="TFN2"><label>*</label><p id="P52">Indicates significant correlations (multi-test, Benjamini&#x02013;Hochburg adjusted <italic toggle="yes">p</italic> &#x0003c; 0.050). Each row is color-coded separately, from red (negative correlations) to green (positive correlations); greyed values indicate non-significant.</p></fn></table-wrap-foot></table-wrap><table-wrap position="float" id="T3" orientation="landscape"><label>Table 3.</label><caption><p id="P53">Predicting PCL trajectories of the responders using language-based assessments</p></caption><table frame="hsides" rules="rows"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th rowspan="2" align="left" valign="bottom" colspan="1">Interview language features</th><th colspan="2" align="center" valign="bottom" rowspan="1">PTSD symptoms future trajectories</th></tr><tr><th align="center" valign="bottom" rowspan="1" colspan="1"><italic toggle="yes">r</italic> (direct correlation with symptom slope)</th><th align="center" valign="bottom" rowspan="1" colspan="1"><italic toggle="yes">&#x003b2;</italic> (adjusted for age, gender, occupation,<break/> days since 9&#x02013;11, Interview PCL)</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">Psychological traits</td><td align="center" valign="middle" rowspan="1" colspan="1"/><td align="center" valign="middle" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Anxiety</td><td align="center" valign="middle" style="background-color:#BBDDA8" rowspan="1" colspan="1">0.16 (&#x02212;0.07 to 0.37)</td><td align="center" valign="middle" style="background-color:#99CC70" rowspan="1" colspan="1">0.30* (0.08&#x02013;0.49)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Depression</td><td align="center" valign="middle" rowspan="1" colspan="1">&#x02212;0.00 (&#x02212;0.23 to 0.22)</td><td align="center" valign="middle" style="background-color:#C5E1B7" rowspan="1" colspan="1">0.16 (&#x02212;0.07 to 0.37)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Neuroticism</td><td align="center" valign="middle" rowspan="1" colspan="1">0.07 (0.29 to &#x02212;0.16)</td><td align="center" valign="middle" style="background-color:#B2D89A" rowspan="1" colspan="1">0.20 (&#x02212;0.03 to 0.40)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Extraversion</td><td align="center" valign="middle" style="background-color:#C1DFB3" rowspan="1" colspan="1">0.17 (&#x02212;0.06 to 0.38)</td><td align="center" valign="middle" style="background-color:#BFDFAD" rowspan="1" colspan="1">0.18 (&#x02212;0.05 to 0.39)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">Linguistic style</td><td align="center" valign="middle" rowspan="1" colspan="1"/><td align="center" valign="middle" rowspan="1" colspan="1"/></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;First-person singular</td><td align="center" valign="middle" rowspan="1" colspan="1">0.00 (&#x02212;0.23 to 0.23)</td><td align="center" valign="middle" style="background-color:#DAECD2" rowspan="1" colspan="1">0.13 (&#x02212;0.10 to 0.35)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;First-person plural</td><td align="center" valign="middle" style="background-color:#F27172" rowspan="1" colspan="1">&#x02212;0.36* (&#x02212;0.54 to &#x02212;0.14)</td><td align="center" valign="middle" style="background-color:#F26B6C" rowspan="1" colspan="1">&#x02212;0.36* (&#x02212;0.54 to &#x02212;0.15)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Articles</td><td align="center" valign="middle" style="background-color:#FBD0D0" rowspan="1" colspan="1">&#x02212;0.16 (&#x02212;0.37 to 0.07)</td><td align="center" valign="middle" style="background-color:#F69C9E" rowspan="1" colspan="1">&#x02212;0.23 (&#x02212;0.43 to 0.00)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;AVG word length</td><td align="center" valign="middle" style="background-color:#F27071" rowspan="1" colspan="1">&#x02212;0.36* (&#x02212;0.54 to &#x02212;0.14)</td><td align="center" valign="middle" style="background-color:#F27071" rowspan="1" colspan="1">&#x02212;0.35* (&#x02212;0.53 to &#x02212;0.13)</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">&#x02003;Word count</td><td align="center" valign="middle" rowspan="1" colspan="1">0.06 (&#x02212;0.17 to 0.28)</td><td align="center" valign="middle" style="background-color:#BBDDA8" rowspan="1" colspan="1">0.14 (&#x02212;0.09 to 0.36)</td></tr></tbody></table><table-wrap-foot><fn id="TFN3"><p id="P54">Associations are from ordinary least squares over standardized independent variable &#x02013; the language-based assessment and the standardized dependent variable &#x02013; PCL future trajectory. Without controls is equivalent to Pearson Product-Moment Correlation (<italic toggle="yes">N</italic> = 75) with controls: age, gender, occupation, days between 9/11/01 and interview date, and interview PCL score.</p></fn></table-wrap-foot></table-wrap></floats-group></article>