Mapping Languages and Demographics with Georeferenced Corpora

Type: Preprint

Publication Date: 2020-01-01

Citations: 9

DOI: https://doi.org/10.48550/arxiv.2004.00809

View

Locations

  • arXiv (Cornell University) - View - PDF
  • UC Research Repository (University of Canterbury) - View - PDF
  • DataCite API - View

Similar Works

Action Title Year Authors
+ PDF Chat Pre-Trained Language Models Represent Some Geographic Populations Better Than Others 2024 Jonathan Dunn
Benjamin Adams
Harish Tayyar Madabushi
+ The importance of geographic and demographic data from census for locating and mapping vulnerable populations 2020 Maureen Jones
Ellie Aleda Moeller
John G. Meara
Sabrina Juran
+ PDF Chat The Twitter of Babel: Mapping World Languages through Microblogging Platforms 2013 Delia Mocanu
Andrea Baronchelli
Nicola Perra
Bruno Gonçalves
Qian Zhang
Alessandro Vespignani
+ Comparing Measures of Linguistic Diversity Across Social Media Language Data and Census Data at Subnational Geographic Areas 2023 Sidney Wong
Jonathan Dunn
Benjamin Adams
+ A study of the use of linked routinely collected administrative data at the local level to count and profile populations 2017 Gill Harper
+ PDF Chat Race, religion and the city: twitter word frequency patterns reveal dominant demographic dimensions in the United States 2016 Eszter Bokányi
Dániel Kondor
László Dobos
Tamás Sebők
József Stéger
István Csabai
Gábor Vattay
+ GeoLLM: Extracting Geospatial Knowledge from Large Language Models 2023 Rohin Manvi
Samar Khanna
Gengchen Mai
Marshall Burke
David B. Lobell
Stefano Ermon
+ Census-Independent Population Estimation using Representation Learning 2021 Isaac Neal
Sohan Seth
Gary R. Watmough
Mamadou S. Diallo
+ Census Data Resources 2013 Michael F. J. McDonnell
+ PDF Chat Demographic Inference and Representative Population Estimates from Multilingual Social Media Data 2019 Zijian Wang
Scott A. Hale
David Ifeoluwa Adelani
Przemyslaw A. Grabowicz
Timo Hartman
Fabian Flöck
David Jurgens
+ PDF Chat Census-independent population estimation using representation learning 2022 Isaac Neal
Sohan Seth
Gary R. Watmough
Mamadou S. Diallo
+ PDF Chat Confounds and Consequences in Geotagged Twitter Data 2015 Umashanthi Pavalanathan
Jacob Eisenstein
+ PDF Chat Geographically-Informed Language Identification 2024 Jonathan Dunn
Lane Edwards-Brown
+ Confounds and Consequences in Geotagged Twitter Data 2015 Umashanthi Pavalanathan
Jacob Eisenstein
+ Confounds and Consequences in Geotagged Twitter Data 2015 Umashanthi Pavalanathan
Jacob Eisenstein
+ PDF Chat Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale 2023 Marta R. Costa‐jussà
Pierre Andrews
Eric E. Smith
Prangthip Hansanti
Christophe Ropers
Elahe Kalbassi
Cynthia Gao
Daniel J. Licht
Carleigh Wood
+ PDF Chat GeoSEE: Regional Socio-Economic Estimation With a Large Language Model 2024 Sungwon Han
Dong‐Hyun Ahn
Seungeon Lee
Minhyuk Song
Sungwon Park
Sangyoon Park
Jihee Kim
Meeyoung Cha
+ Some Languages are More Equal than Others: Probing Deeper into the Linguistic Disparity in the NLP World 2022 Surangika Ranathunga
Nisansa de Silva
+ PDF Chat Large Language Models are Geographically Biased 2024 Rohin Manvi
Samar Khanna
Marshall Burke
David B. Lobell
Stefano Ermon
+ Fine-grained Population Mapping from Coarse Census Counts and Open Geodata 2022 Nando Metzger
John E. Vargas-Muñoz
Rodrigo Caye Daudt
Benjamin Kellenberger
Thao Ton-That Whelan
Ferda Ofli
Muhammad Ali Imran
Konrad Schindler
Devis Tuia