Cyberspace of Shujun LI


Shujun's Publications

e-Data and Data Analytics Services: Zenodo Figshare Dryad DataCite Wolfram Data Repository The GDELT Project: A Global Database of Society Global Open Data Index (Open Knowledge) Sunlight Foundation European Union Open Data Portal JRC (Joint Research Centre) Data Catalogue UK Data Service Police Foundation’s Public Safety Open Data Portal mldata (machine learning data set repository) MLcomp datasets UCI Machine Learning Repository Kaggle KNIME emotion icon 数据堂 emotion icon DataSift DataGenetics Informatica Corporation Splunk Inc. Tableau Software Social Media Analysis Toolkit (SMAT) (GitLab) Scrintal Trint
Personal Data Management Platforms: MyData Global Solid HAT (Hub-of-All-Things) (HATDeX - HAT Data Exchange Ltd, HAT Community Foundation (HCF), HAT Accelerator, Documentation for Developers) DataBox Project Aircloak openPDS/SafeAnswers: Personal Data with Privacy

False Information

Organizations, Tools and Data: Truth Decay @ RAND (Fighting Disinformation Online: A Database of Web Tools) NewsGuard misinformation datasets @ FakeNewsTracker Google Fact Check (Google Fact Check Tools API, Google Fact Check Explorer, Google Fact Check Markup Tool) SMAT: The Social Media Analysis Toolkit Content Authenticity Initiative (CAI) Media Manipulation Casebook Fact-checking organizations with FACTS-NFT 台灣事實查核中心 (Taiwan FactCheck Center) Lead Stories emotion icon EU DisinfoLab Full Fact First Draf MisinfoCon Credibility Coalition Fake News Challenge (FNC) (Stance Detection dataset for FNC-1) Poynter Institute (International Fact-Checking Network - IFCN, IFCN Code of Principles; #CoronaVirusFacts Alliance, CoronaVirusFacts/DatosCoronaVirus Alliance Database) Fairness & Accuracy In Reporting (FAIR) Coalition for Content Provenance and Authenticity (C2PA) BBC Disinformation Watch BBC Reality Check FactCheck @ Channel 4 News Fact check @ The Ferret The Reporters' Lab (Fact-Checking, The Duke Tech & Check Cooperative, ClaimReview) Truth or Fiction Check Your Fact FactsCan AFP Fact Check Africa Check Bad News game Go Viral! emotion icon Snopes PolitiFact Global Disinformation Index (GDI) Gossip Cop Fact Checker @ The Washington Post Hoax-Slayer emotion icon Factmata Arkose Labs (Fake Reviews, Fake Users) Lie Detectors Fakespot fake-resume-generator

Multimedia False Information: This Person Does Not Exist Which Face is Real? Virtual Humans FaceForensics Benchmark Partnership on AI's AI and Media Integrity Steering Committee (Deepfake Detection Challenge = DFDC) CoMoFoD - Image Database for Copy-Move Forgery Detection Copy-Move Forgery Database with Similar but Genuine Objects (COVERAGE) Truthmark GANDCTAnalysis
MKLab-ITI's image-verification-corpus
Other Research Related: WeVerify GATE (General Architecture for Text Engineering) PHEME project PAN (a series of scientific events and shared tasks on digital text forensics and stylometry) emotion icon CLEF2020 CheckThat! Lab (Enabling Automatic Identification and Verification of Claims in Social Media) CLEF2019 CheckThat! Lab CLEF2018 CheckThat! Lab ClaimBuster: Automated Live Fact-checking (ClaimPortal, ICWSM 2020 dataset) ClaimsKG Claim Extraction for Scientific Publications Too Many Claims to Fact-Check: Prioritizing Political Claims Based on Check-Worthiness (MAISoN'2020 @ CIKM'2020) emotion icon OSoMe (Observatory on Social Media) @ Network Science Institute (IUNI), Center for Complex Networks and Systems Research (CNetS), Indiana University (Tools and Datasets: HOAXY, Fakey, Botometer, BotSlayer, EchoDemo; Meme Trend, Meme Network, Meme Maps, Meme Movie) emotion icon Graph-based Fraud Detection Papers and Resources VoterFraud2020 (@ GitHub, @ Fighshare) FakeNewsNet Dichotomies of Disinformation The COVID-19 Infodemic: Can the Crowd Judge Recent Misinformation Objectively? (SIGIR 2020 + ECIR 2020 + CIKM 2020) ReCOVery: A Multimodal Repository for COVID-19 News Credibility Research (CIKM 2020) CHECKED: Chinese COVID-19 Fake News Dataset (2020) Factuality and Bias Prediction of News Media (ACL 2020 + EMNLP 2018) FakeHealth repository (ICWSM 2020) FiveThirtyEight's dataset of 3 million Russian troll tweets Raiders of the Lost Kek: 3.5 Years of Augmented 4chan Posts from the Politically Incorrect Board (ICWSM 2020) Learning from Fact-checkers (SIGIR 2019) The Rise of Guardians (SIGIR 2018) LIAR-PLUS fake news databse (FEVER 2018) LIAR fake news databse (ACL 2017) CREDBANK-data (ICWSM 2015)

Information Visualization

Tools: Transparency Vis


Tools: GetOldTweets-java (GetOldTweets3)
Data: COVID-19 @ Aminer (COVID-19 Open Datasets, dashboard)

Valid XHTML 1.0 Transitional


Germany (CET)