<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Text Mining | Stefano Blando</title><link>https://stefano-blando.github.io/en/tags/text-mining/</link><atom:link href="https://stefano-blando.github.io/en/tags/text-mining/index.xml" rel="self" type="application/rss+xml"/><description>Text Mining</description><generator>HugoBlox Kit (https://hugoblox.com)</generator><language>en-US</language><lastBuildDate>Sat, 10 Jan 2026 00:00:00 +0000</lastBuildDate><image><url>https://stefano-blando.github.io/media/icon_hu_8d0dee6c10a3c598.png</url><title>Text Mining</title><link>https://stefano-blando.github.io/en/tags/text-mining/</link></image><item><title>NLP &amp; Semantic Network Analysis</title><link>https://stefano-blando.github.io/en/projects/nlp-semantic-network-analysis/</link><pubDate>Sat, 10 Jan 2026 00:00:00 +0000</pubDate><guid>https://stefano-blando.github.io/en/projects/nlp-semantic-network-analysis/</guid><description>&lt;p&gt;This project is the technical implementation behind the publication &lt;strong&gt;“A Multi-Method Validation Framework for Large-Scale Multilingual Text Analytics”&lt;/strong&gt; (JADT 2026, in review). It operationalizes the full analytical workflow used in the paper, from data preparation to cross-method validation and result comparison.&lt;/p&gt;
&lt;p&gt;The pipeline combines &lt;strong&gt;R and Python&lt;/strong&gt; modules over a large multilingual review corpus, including: preprocessing and TF-IDF, &lt;strong&gt;LDA topic modeling&lt;/strong&gt;, &lt;strong&gt;LSA and Correspondence Analysis&lt;/strong&gt;, lexicon- and model-based sentiment analysis, clustering, and &lt;strong&gt;co-occurrence network analysis&lt;/strong&gt;. The repository also includes cross-platform validation scripts to compare method outputs and check structural stability across implementations.&lt;/p&gt;
&lt;p&gt;The central objective is methodological robustness: verifying which findings remain consistent when methods, model families, and language-specific components vary. In this sense, the project is not a generic NLP demo, but a reproducible research pipeline designed for quantitative validation of text-analytic conclusions.&lt;/p&gt;</description></item></channel></rss>