Journal article 34 views 6 downloads
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features
Mathematics, Volume: 14, Issue: 2, Start page: 360
Swansea University Authors:
Hassan Eshkiki , Fabio Caraffini
-
PDF | Version of Record
© 2026 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Download (15.79MB)
DOI (Published version): 10.3390/math14020360
Abstract
Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modal...
| Published in: | Mathematics |
|---|---|
| ISSN: | 2227-7390 |
| Published: |
MDPI AG
2026
|
| Online Access: |
Check full text
|
| URI: | https://cronfa.swan.ac.uk/Record/cronfa71351 |
| first_indexed |
2026-01-30T13:40:18Z |
|---|---|
| last_indexed |
2026-01-31T05:34:56Z |
| id |
cronfa71351 |
| recordtype |
SURis |
| fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2026-01-30T13:41:24.1073519</datestamp><bib-version>v2</bib-version><id>71351</id><entry>2026-01-30</entry><title>A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features</title><swanseaauthors><author><sid>c9972b26a83de11ffe211070f26fe16b</sid><ORCID>0000-0001-7795-453X</ORCID><firstname>Hassan</firstname><surname>Eshkiki</surname><name>Hassan Eshkiki</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>d0b8d4e63d512d4d67a02a23dd20dfdb</sid><ORCID>0000-0001-9199-7368</ORCID><firstname>Fabio</firstname><surname>Caraffini</surname><name>Fabio Caraffini</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2026-01-30</date><deptcode>MACS</deptcode><abstract>Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modalities, as semantic gaps and contextual variations between images and text complicate alignment, interpretation, and the detection of subtle or blatant inconsistencies. To enhance accuracy in fake news detection, this article introduces an ensemble-based framework that integrates textual and visual data using ViLBERT’s two-stream architecture, incorporates VADER sentiment analysis to detect emotional language, and uses Image–Text Contextual Similarity to identify mismatches between visual and textual elements. These features are processed through the Bi-GRU classifier, Transformer-XL, DistilBERT, and XLNet, combined via a stacked ensemble method with soft voting, culminating in a T5 metaclassifier that predicts the outcome for robustness. Results on the Fakeddit and Weibo benchmarking datasets show that our method outperforms state-of-the-art models, achieving up to 96% and 94% accuracy in fake news detection, respectively. This study highlights the necessity for advanced multimodal fake news detection systems to address the increasing complexity of misinformation and offers a promising solution.</abstract><type>Journal Article</type><journal>Mathematics</journal><volume>14</volume><journalNumber>2</journalNumber><paginationStart>360</paginationStart><paginationEnd/><publisher>MDPI AG</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint/><issnElectronic>2227-7390</issnElectronic><keywords>fake news detection; NLP; sentiment analysis; transformers; deep learning</keywords><publishedDay>21</publishedDay><publishedMonth>1</publishedMonth><publishedYear>2026</publishedYear><publishedDate>2026-01-21</publishedDate><doi>10.3390/math14020360</doi><url/><notes/><college>COLLEGE NANME</college><department>Mathematics and Computer Science School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MACS</DepartmentCode><institution>Swansea University</institution><apcterm>Other</apcterm><funders/><projectreference/><lastEdited>2026-01-30T13:41:24.1073519</lastEdited><Created>2026-01-30T13:35:57.7577885</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Muhammad</firstname><surname>Abdullah</surname><orcid>0009-0000-9434-7977</orcid><order>1</order></author><author><firstname>Hongying</firstname><surname>Zan</surname><order>2</order></author><author><firstname>Arifa</firstname><surname>Javed</surname><orcid>0009-0002-6112-6398</orcid><order>3</order></author><author><firstname>Muhammad</firstname><surname>Sohail</surname><order>4</order></author><author><firstname>Orken</firstname><surname>Mamyrbayev</surname><orcid>0000-0001-8318-3794</orcid><order>5</order></author><author><firstname>Zhanibek</firstname><surname>Turysbek</surname><orcid>0009-0004-2311-6249</orcid><order>6</order></author><author><firstname>Hassan</firstname><surname>Eshkiki</surname><orcid>0000-0001-7795-453X</orcid><order>7</order></author><author><firstname>Fabio</firstname><surname>Caraffini</surname><orcid>0000-0001-9199-7368</orcid><order>8</order></author></authors><documents><document><filename>71351__36148__5c55af71409942a2b3e73bf2b8ff2a6c.pdf</filename><originalFilename>71351.VOR.pdf</originalFilename><uploaded>2026-01-30T13:39:07.4505517</uploaded><type>Output</type><contentLength>16553566</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>© 2026 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807> |
| spelling |
2026-01-30T13:41:24.1073519 v2 71351 2026-01-30 A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features c9972b26a83de11ffe211070f26fe16b 0000-0001-7795-453X Hassan Eshkiki Hassan Eshkiki true false d0b8d4e63d512d4d67a02a23dd20dfdb 0000-0001-9199-7368 Fabio Caraffini Fabio Caraffini true false 2026-01-30 MACS Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modalities, as semantic gaps and contextual variations between images and text complicate alignment, interpretation, and the detection of subtle or blatant inconsistencies. To enhance accuracy in fake news detection, this article introduces an ensemble-based framework that integrates textual and visual data using ViLBERT’s two-stream architecture, incorporates VADER sentiment analysis to detect emotional language, and uses Image–Text Contextual Similarity to identify mismatches between visual and textual elements. These features are processed through the Bi-GRU classifier, Transformer-XL, DistilBERT, and XLNet, combined via a stacked ensemble method with soft voting, culminating in a T5 metaclassifier that predicts the outcome for robustness. Results on the Fakeddit and Weibo benchmarking datasets show that our method outperforms state-of-the-art models, achieving up to 96% and 94% accuracy in fake news detection, respectively. This study highlights the necessity for advanced multimodal fake news detection systems to address the increasing complexity of misinformation and offers a promising solution. Journal Article Mathematics 14 2 360 MDPI AG 2227-7390 fake news detection; NLP; sentiment analysis; transformers; deep learning 21 1 2026 2026-01-21 10.3390/math14020360 COLLEGE NANME Mathematics and Computer Science School COLLEGE CODE MACS Swansea University Other 2026-01-30T13:41:24.1073519 2026-01-30T13:35:57.7577885 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Muhammad Abdullah 0009-0000-9434-7977 1 Hongying Zan 2 Arifa Javed 0009-0002-6112-6398 3 Muhammad Sohail 4 Orken Mamyrbayev 0000-0001-8318-3794 5 Zhanibek Turysbek 0009-0004-2311-6249 6 Hassan Eshkiki 0000-0001-7795-453X 7 Fabio Caraffini 0000-0001-9199-7368 8 71351__36148__5c55af71409942a2b3e73bf2b8ff2a6c.pdf 71351.VOR.pdf 2026-01-30T13:39:07.4505517 Output 16553566 application/pdf Version of Record true © 2026 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license. true eng https://creativecommons.org/licenses/by/4.0/ |
| title |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| spellingShingle |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features Hassan Eshkiki Fabio Caraffini |
| title_short |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| title_full |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| title_fullStr |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| title_full_unstemmed |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| title_sort |
A Multimodal Ensemble-Based Framework for Detecting Fake News Using Visual and Textual Features |
| author_id_str_mv |
c9972b26a83de11ffe211070f26fe16b d0b8d4e63d512d4d67a02a23dd20dfdb |
| author_id_fullname_str_mv |
c9972b26a83de11ffe211070f26fe16b_***_Hassan Eshkiki d0b8d4e63d512d4d67a02a23dd20dfdb_***_Fabio Caraffini |
| author |
Hassan Eshkiki Fabio Caraffini |
| author2 |
Muhammad Abdullah Hongying Zan Arifa Javed Muhammad Sohail Orken Mamyrbayev Zhanibek Turysbek Hassan Eshkiki Fabio Caraffini |
| format |
Journal article |
| container_title |
Mathematics |
| container_volume |
14 |
| container_issue |
2 |
| container_start_page |
360 |
| publishDate |
2026 |
| institution |
Swansea University |
| issn |
2227-7390 |
| doi_str_mv |
10.3390/math14020360 |
| publisher |
MDPI AG |
| college_str |
Faculty of Science and Engineering |
| hierarchytype |
|
| hierarchy_top_id |
facultyofscienceandengineering |
| hierarchy_top_title |
Faculty of Science and Engineering |
| hierarchy_parent_id |
facultyofscienceandengineering |
| hierarchy_parent_title |
Faculty of Science and Engineering |
| department_str |
School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science |
| document_store_str |
1 |
| active_str |
0 |
| description |
Detecting fake news is essential in natural language processing to verify news authenticity and prevent misinformation-driven social, political, and economic disruptions targeting specific groups. A major challenge in multimodal fake news detection is effectively integrating textual and visual modalities, as semantic gaps and contextual variations between images and text complicate alignment, interpretation, and the detection of subtle or blatant inconsistencies. To enhance accuracy in fake news detection, this article introduces an ensemble-based framework that integrates textual and visual data using ViLBERT’s two-stream architecture, incorporates VADER sentiment analysis to detect emotional language, and uses Image–Text Contextual Similarity to identify mismatches between visual and textual elements. These features are processed through the Bi-GRU classifier, Transformer-XL, DistilBERT, and XLNet, combined via a stacked ensemble method with soft voting, culminating in a T5 metaclassifier that predicts the outcome for robustness. Results on the Fakeddit and Weibo benchmarking datasets show that our method outperforms state-of-the-art models, achieving up to 96% and 94% accuracy in fake news detection, respectively. This study highlights the necessity for advanced multimodal fake news detection systems to address the increasing complexity of misinformation and offers a promising solution. |
| published_date |
2026-01-21T05:35:15Z |
| _version_ |
1856896516390649856 |
| score |
11.096068 |

