Journal article 235 views 62 downloads
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit
Applied Corpus Linguistics, Volume: 4, Issue: 3, Start page: 100103
Swansea University Author:
Steve Morris
-
PDF | Version of Record
© 2024 The Author(s). This is an open access article under the CC BY license.
Download (5.11MB)
DOI (Published version): 10.1016/j.acorp.2024.100103
Abstract
Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/...
| Published in: | Applied Corpus Linguistics |
|---|---|
| ISSN: | 2666-7991 |
| Published: |
Elsevier BV
2024
|
| Online Access: |
Check full text
|
| URI: | https://cronfa.swan.ac.uk/Record/cronfa70086 |
| first_indexed |
2025-07-31T14:16:32Z |
|---|---|
| last_indexed |
2025-08-12T06:05:06Z |
| id |
cronfa70086 |
| recordtype |
SURis |
| fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2025-08-11T15:42:44.5738748</datestamp><bib-version>v2</bib-version><id>70086</id><entry>2025-07-31</entry><title>FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit</title><swanseaauthors><author><sid>c696b0282208e9edd88c35b049a8c8d8</sid><ORCID>0000-0003-3789-1233</ORCID><firstname>Steve</firstname><surname>Morris</surname><name>Steve Morris</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2025-07-31</date><abstract>Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English and Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (www.freetxt.app), we aimed to empower end-users to direct and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by anyone in any sector in Wales and beyond.</abstract><type>Journal Article</type><journal>Applied Corpus Linguistics</journal><volume>4</volume><journalNumber>3</journalNumber><paginationStart>100103</paginationStart><paginationEnd/><publisher>Elsevier BV</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint>2666-7991</issnPrint><issnElectronic/><keywords>Corpus tools; Qualitative analysis; Free-text responses; Questionnaires</keywords><publishedDay>1</publishedDay><publishedMonth>12</publishedMonth><publishedYear>2024</publishedYear><publishedDate>2024-12-01</publishedDate><doi>10.1016/j.acorp.2024.100103</doi><url/><notes/><college>COLLEGE NANME</college><CollegeCode>COLLEGE CODE</CollegeCode><institution>Swansea University</institution><apcterm>Another institution paid the OA fee</apcterm><funders>The FreeTxt project was funded by AHRC (Arts and Humanities Research Council) follow-on funding for impact and engagement (grant number AH/W004844/1). This work also feeds more broadly into the work carried out as part of the ESRC (Economic and Social Research Council) and AHRC funded Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction project (grant number ES/M011348/1).</funders><projectreference/><lastEdited>2025-08-11T15:42:44.5738748</lastEdited><Created>2025-07-31T15:14:18.0433866</Created><path><level id="1">Faculty of Humanities and Social Sciences</level><level id="2">School of Culture and Communication - English Language, Tesol, Applied Linguistics</level></path><authors><author><firstname>Dawn</firstname><surname>Knight</surname><orcid>0000-0002-4745-6502</orcid><order>1</order></author><author><firstname>Nouran</firstname><surname>Khallaf</surname><order>2</order></author><author><firstname>Paul</firstname><surname>Rayson</surname><order>3</order></author><author><firstname>Mahmoud</firstname><surname>El-Haj</surname><order>4</order></author><author><firstname>Ignatius</firstname><surname>Ezeani</surname><order>5</order></author><author><firstname>Steve</firstname><surname>Morris</surname><orcid>0000-0003-3789-1233</orcid><order>6</order></author></authors><documents><document><filename>70086__34939__f82a24c8c3ea49aaad5c813e2a7643c7.pdf</filename><originalFilename>70086.VoR.pdf</originalFilename><uploaded>2025-08-11T15:40:40.9049104</uploaded><type>Output</type><contentLength>5362011</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>© 2024 The Author(s). This is an open access article under the CC BY license.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>http://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807> |
| spelling |
2025-08-11T15:42:44.5738748 v2 70086 2025-07-31 FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit c696b0282208e9edd88c35b049a8c8d8 0000-0003-3789-1233 Steve Morris Steve Morris true false 2025-07-31 Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English and Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (www.freetxt.app), we aimed to empower end-users to direct and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by anyone in any sector in Wales and beyond. Journal Article Applied Corpus Linguistics 4 3 100103 Elsevier BV 2666-7991 Corpus tools; Qualitative analysis; Free-text responses; Questionnaires 1 12 2024 2024-12-01 10.1016/j.acorp.2024.100103 COLLEGE NANME COLLEGE CODE Swansea University Another institution paid the OA fee The FreeTxt project was funded by AHRC (Arts and Humanities Research Council) follow-on funding for impact and engagement (grant number AH/W004844/1). This work also feeds more broadly into the work carried out as part of the ESRC (Economic and Social Research Council) and AHRC funded Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction project (grant number ES/M011348/1). 2025-08-11T15:42:44.5738748 2025-07-31T15:14:18.0433866 Faculty of Humanities and Social Sciences School of Culture and Communication - English Language, Tesol, Applied Linguistics Dawn Knight 0000-0002-4745-6502 1 Nouran Khallaf 2 Paul Rayson 3 Mahmoud El-Haj 4 Ignatius Ezeani 5 Steve Morris 0000-0003-3789-1233 6 70086__34939__f82a24c8c3ea49aaad5c813e2a7643c7.pdf 70086.VoR.pdf 2025-08-11T15:40:40.9049104 Output 5362011 application/pdf Version of Record true © 2024 The Author(s). This is an open access article under the CC BY license. true eng http://creativecommons.org/licenses/by/4.0/ |
| title |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| spellingShingle |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit Steve Morris |
| title_short |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| title_full |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| title_fullStr |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| title_full_unstemmed |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| title_sort |
FreeTxt: A corpus-based bilingual free-text survey and questionnaire data analysis toolkit |
| author_id_str_mv |
c696b0282208e9edd88c35b049a8c8d8 |
| author_id_fullname_str_mv |
c696b0282208e9edd88c35b049a8c8d8_***_Steve Morris |
| author |
Steve Morris |
| author2 |
Dawn Knight Nouran Khallaf Paul Rayson Mahmoud El-Haj Ignatius Ezeani Steve Morris |
| format |
Journal article |
| container_title |
Applied Corpus Linguistics |
| container_volume |
4 |
| container_issue |
3 |
| container_start_page |
100103 |
| publishDate |
2024 |
| institution |
Swansea University |
| issn |
2666-7991 |
| doi_str_mv |
10.1016/j.acorp.2024.100103 |
| publisher |
Elsevier BV |
| college_str |
Faculty of Humanities and Social Sciences |
| hierarchytype |
|
| hierarchy_top_id |
facultyofhumanitiesandsocialsciences |
| hierarchy_top_title |
Faculty of Humanities and Social Sciences |
| hierarchy_parent_id |
facultyofhumanitiesandsocialsciences |
| hierarchy_parent_title |
Faculty of Humanities and Social Sciences |
| department_str |
School of Culture and Communication - English Language, Tesol, Applied Linguistics{{{_:::_}}}Faculty of Humanities and Social Sciences{{{_:::_}}}School of Culture and Communication - English Language, Tesol, Applied Linguistics |
| document_store_str |
1 |
| active_str |
0 |
| description |
Qualitative free-text responses (e.g. from questionnaires and surveys) pose a challenge to many companies and institutions which lack the expertise to analyse such data with ease. While a range of sophisticated tools for the analysis of text do exist, these are often expensive, difficult to use and/or inaccessible to non-expert users. These tools also lack support for the analysis of English and Welsh text, which can be a particular challenge in the bilingual context of Wales. This paper details the key functionalities of the first corpus-based ‘FreeTxt’ toolkit which has been designed to support the systematic analysis and visualisation of free-text data, as a direct response to these two key needs. This paper demonstrates how, by working in partnership, software engineers, natural language processing (NLP) experts and corpus linguists can collaborate with end-users and beneficiaries to provide effective solutions to real world problems. Through the development of FreeTxt (www.freetxt.app), we aimed to empower end-users to direct and lead their own analyses of both small-scale and more extensive datasets to maximise the reach and potential impact generated. The approaches reported here, and the bilingual toolkit developed, can be replicated and extended for use in other language contexts and across a range of public and professional sectors. FreeTxt is now available for the analysis of Welsh and/or English, for use by anyone in any sector in Wales and beyond. |
| published_date |
2024-12-01T05:25:42Z |
| _version_ |
1851641290640326656 |
| score |
11.089822 |

