Detection of Temporal Shifts in Semantics Using Local Graph Clustering
Abstract
:1. Introduction
1.1. Literature Review
1.2. Contributions
 An unsupervised algorithm based on local graph clustering to automatically characterize and detect shifts in term semantics:
 (a)
 Clusters are incrementally built out, starting with the target term as the center of locality and adding to the cluster contextual words that meet the userdefined thresholds for informativeness and the word embedding dimension;
 (b)
 It has constant time complexity, where the constant is the userdefined description length for the target term, and hence is scalable in the size of the corpus;
 (c)
 The resulting “soft clusters” allow clusters of different terms to overlap to varying degrees.
 A novel empirical analysis of the semantics of the term “Chinavirus”:
 (a)
 Along the time dimension, the term took on significantly, albeit temporarily, more negative sentiment soon after its use by the White House in March 2020;
 (b)
 Compared to the control term “Coronavirus”, the semantics of “Chinavirus” diverged significantly in March 2020.
2. Materials and Methods
2.1. Notation and Terminology
2.2. Local Graph Clustering Algorithm in [52]
2.3. Adapting the Local Graph Clustering Algorithm for Semantic Analysis
Algorithm 1 GenerateSamples [52] 
Input: Target term ${v}_{0}$, nonzero integers $T,B,\kappa $, target conductance $\mathsf{\Phi}\in [0,1]$. Output: A set ${\mathbf{S}}_{\tau}$ from the volumebiased ESP with the stopping time $\tau $ depending on input parameters $\tau =\tau (T,B,\mathsf{\Phi},\kappa )$. Internal State: A set ${\mathbf{S}}_{\tau}$ from the volumebiased ESP with $\tau =\tau (T,B,\mathsf{\Phi},\kappa )$; the current location ${v}_{t}$ of the random walk; $\partial \left({\mathbf{S}}_{t}\right),vol\left({\mathbf{S}}_{t}\right),$ and $cost({\mathbf{S}}_{0},\dots ,{\mathbf{S}}_{t})$ for the current set ${\mathbf{S}}_{t}$.

Algorithm 2 EvoPar($v,k,\varphi ,\u03f5$) [52] 
Input: Target term ${v}_{0}$, target volume k, target conductance $\varphi \in [0,1]$, a constant $\u03f5\in (0,1)$. Output: A set $\mathbf{S}$ of vertices.

3. Datasets
4. Results
5. Discussion
References
“Chinavirus”  “Coronavirus”  

Chinavirus  Chinacovid  Coronavirus  Coronaviruses 
Chinaviruses  Chinesecovid  Covidvirus  Covidviruses 
Chineseviruses  Chinesevirus  Caronavirus  Caronaviruses 
Chinacorona  Wuhanviruses  Viruscorona  Viruscovid 
Wuhancovid  Wuhancorona  Coronaflu  Coronoviruses 
CCPVirus  CCPCoronavirus  Coronacovid  Covidcorona 
Wuhanchinavirus  Coronaoutbreak  Coronovirus  
Wuhanchinaviruses  
Chinawuhanvirus  
Chinesecoronavirus  
Chinesecoronaviruses  
ChineseCommunistPartyvirus  
ChineseCommunistPartyviruses 
Period ^{a}  Words ^{b}  Vocabulary ^{b}  Tweets ^{b}  AuthorIDs ^{b} 

Jan2H  123,874  10,985  10,462  8881 
Feb1H  129,545  12,286  10,748  7190 
Feb2H  363,394  20,113  17,639  10,927 
Mar1H  2,261,059  52,724  176,386  93,339 
Mar2H  2,745,589  65,138  215,908  99,069 
Apr1H  1,322,025  45,444  99,874  47,457 
Apr2H  866,074  37,137  65,139  31,404 
May1H  593,134  30,621  43,706  21,972 
May2H  432,832  25,620  32,425  17,494 
restaurant  livestock  divulge  hotpot  wheel  floor 
remainder  initiative  Chinese  gaslit  storm  fight 
designation  sacrifice  married  undone  funny  front 
inevitable  historical  forward  global  sadly  army 
sabotaging  possible  quicker  insult  guess  sight 
investment  together  strategy  faster  slump  drug 
supervisor  ingenious  suspend  spiral  unite  alive 
overreach  quarantine  cronies  apologizes  exile  scare 
morality  panicked  kissing  moronovirus  abject  stud 
mourner  sacrifice  laughts  accusation  goofy  dog 
antiviral  baselessly  urine  compassion  poked  hoax 
pollution  espionage  fucked  overwhelmed  clout  loser 
assassin  dripping  evils  prohibition  risking  alien 
exposure  inability  outrage  unbecoming  stroke  secrets 
derailed  enflames  destroy  denounce  namaste  nutjob 
tearful  dumbkirk  debunk  discharges  cooking  chased 
butthead  robbing  selfish  dangerous  diarrhea  cheat 
debunk  despair  huawei  heartattack  ill  protest 
scream  gorillas  thrive  concussion  mystery  alarmed 
repellant  distance  chaotic  marauding  follies  bedbug 
migraine  prosecute  purifier  exterminate  illicit  racism 
explodes  pangolin  lizard  reelection  fuck  kloots 
cancellation  robitussin  interfere  greenlit  bravely  nuk 
disruptions  humiliates  overcome  prophecy  disobey  ease 
animalistic  equanimity  pharmacy  hosepipe  unmask  sigh 
moisturized  heatstroke  decouple  cheapgas  service  fiery 
envelopment  propagate  guillotine  confront  dirty  gosh 
untouchable  retraction  crackhead  helpless  pumped  goofy 
overeacting  perpective  concerned  funeral  punitive  piss 
perpetuates  inadequate  scramble  ecstatic  midget  cost 
breathmints  planetizen  deadlier  colonial  faceass  fuel 
precipitate  unfriended  populism  educate 
hospitable  collapsed  crackpot  pumped  stranger  nuk 
celebration  blockhead  feckless  pissing  breach  ease 
overshadow  possessive  regarded  sodexo  nutcase  bane 
excellence  expensive  rampage  stabbed  reckless  dirty 
deprivation  disappear  bungling  unstable  ghetto  revive 
panoramic  apologize  evacuate  cringed  fucking  goof 
fucknugettes  strangled  punitive  squeak  cleanly  ruined 
martyrdom  virulence  prevent  flagging  midget  dirt 
determined  reputable  kickback  badass  faceass  sigh 
contender  downside  delusional  babble 
