<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.celtic-languages.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=AnSpideog</id>
	<title>Celtic Languages - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.celtic-languages.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=AnSpideog"/>
	<link rel="alternate" type="text/html" href="https://www.celtic-languages.org/Special:Contributions/AnSpideog"/>
	<updated>2026-04-30T03:07:56Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.36.1</generator>
	<entry>
		<id>https://www.celtic-languages.org/index.php?title=Category:Text_corpora&amp;diff=1313</id>
		<title>Category:Text corpora</title>
		<link rel="alternate" type="text/html" href="https://www.celtic-languages.org/index.php?title=Category:Text_corpora&amp;diff=1313"/>
		<updated>2024-07-07T09:16:58Z</updated>

		<summary type="html">&lt;p&gt;AnSpideog: /* Manx */ Updated discord username&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A [https://en.wikipedia.org/wiki/Text_corpus '''text corpus'''] is a large collection of (often as diverse as possible) texts in a given language. Text corpora are very useful for checking how often a given expression is used in a language (and which are rare or never occur), thus they might be a huge help to learners who want to check whether a certain construction is natural-sounding or not. They are also extremely useful when doing research about language grammar and how it changes over time. Many corpora contain also additional information about individual words (like their part-of-speech and other inflectional information when applicable, like grammatical case, gender, tense, mood…).&lt;br /&gt;
&lt;br /&gt;
Below is a list of useful publicly-available corpora for Celtic languages.&lt;br /&gt;
&lt;br /&gt;
== Brittonic ==&lt;br /&gt;
&lt;br /&gt;
=== Breton ===&lt;br /&gt;
&lt;br /&gt;
=== Cornish ===&lt;br /&gt;
&lt;br /&gt;
=== Welsh ===&lt;br /&gt;
&lt;br /&gt;
== Goidelic ==&lt;br /&gt;
&lt;br /&gt;
=== Irish ===&lt;br /&gt;
* [http://corpas.focloir.ie ''Nua-Chorpas na hÉireann''] (''New Corpus for Ireland''), also known as the ''Foclóir.ie corpus''&lt;br /&gt;
:* '''requires registration''' (free-of-charge)&lt;br /&gt;
:* all kinds of modern texts (fiction, poetry, official documents, newspaper articles, etc. from 20th and 21st century) written by both native and non-native speakers&lt;br /&gt;
:* allows filtering by text type (native/non-native, specific dialect)&lt;br /&gt;
:* part-of-speech tagged&lt;br /&gt;
:* uses [https://en.wikipedia.org/wiki/Sketch_Engine Sketch Engine]&lt;br /&gt;
:* alternative new interface at [https://focloir.sketchengine.eu focloir.sketchengine.eu]&lt;br /&gt;
:* See [[Irish/Using Nua-Chorpas na hÉireann]] for some additional tips&lt;br /&gt;
* [http://corpas.ria.ie ''Historical Irish Corpus''], also known as the ''RIA corpus''&lt;br /&gt;
:* publicly available&lt;br /&gt;
:* literary texts composed in Irish between 1600 and 1926&lt;br /&gt;
:* part-of-speech tagged but with limited search functionality&lt;br /&gt;
:* allows searching for words in original spelling and by their modern standardized forms&lt;br /&gt;
&lt;br /&gt;
=== Manx ===&lt;br /&gt;
* [https://corpus.gaelg.im/ Manx Corpus Search]&lt;br /&gt;
:* publicly available&lt;br /&gt;
:* over 600 texts between 1610 and present, accompanied by English translations&lt;br /&gt;
:* focus on pre-1908 native Manx literature with the aim to store '''everything''' written in Manx before 1908&lt;br /&gt;
:* open source, the search interface software hosted at [https://github.com/david-allison/manx-corpus-search https://github.com/david-allison/manx-corpus-search], corpus data at [https://github.com/david-allison/manx-search-data https://github.com/david-allison/manx-search-data]&lt;br /&gt;
:* if you need assistance, have a feature request, etc. you may contact the corpus’ maintainer on Github or on Celtic Languages Discord (Discord handle ''davida0813'')&lt;br /&gt;
&lt;br /&gt;
=== Scottish Gaelic ===&lt;br /&gt;
* [https://dasg.ac.uk/corpus/ ''Corpas na Gàidhlig'' DASG]&lt;br /&gt;
:* publicly available&lt;br /&gt;
:* contains literary texts in Scottish Gaelic, from 12th century to 21st century (but mostly modern, 18th–21st c. texts; might be extended in the future with transcriptions of spoken language)&lt;br /&gt;
:* '''not''' part-of-speech tagged&lt;br /&gt;
:* based on [https://cwb.sourceforge.io/ Corpus Workbench (CWB)] with modified [https://cwb.sourceforge.io/cqpweb.php CQPWeb] interface&lt;br /&gt;
:* allows filtering by time period, geographical origin, particular text&lt;br /&gt;
:* allows complex queries using the CQP-syntax (but limited due to the lack of POS-tagging)&lt;br /&gt;
:* See [[Gaelic/Using Corpas na Gàidhlig]] for some additional tips&lt;br /&gt;
&lt;br /&gt;
=== Classical Gaelic ===&lt;br /&gt;
* [https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Ffiliochttcdv1 ''Irish Syllabic Poetry'' corpus]&lt;br /&gt;
:* corpus of Classical Gaelic bardic poetry&lt;br /&gt;
:* part-of-speech tagged (although very imperfect since based on tagging method for modern Irish)&lt;br /&gt;
:* uses non-normalized spelling (so finding a form might be difficult sometimes)&lt;br /&gt;
:* based on [https://bardic.celt.dias.ie/ Bardic Poetry Database]&lt;br /&gt;
:* uses [https://en.wikipedia.org/wiki/Sketch_Engine Sketch Engine]&lt;br /&gt;
* [http://corpas.ria.ie ''Historical Irish Corpus''] (corpus RIA) – mainly a Modern Irish corpus, but also useful for Early Modern texts and sometimes bardic poetry since it contains texts from 1600 and later, see under [[#Irish|Irish]]&lt;br /&gt;
&lt;br /&gt;
=== Old Irish ===&lt;br /&gt;
* [https://chronhib.maynoothuniversity.ie/chronhibWebsite/tables ''Corpus Palaeo-Hibernicum''] – created by the ChronHib project&lt;br /&gt;
:* 78 Old Irish texts: OIr. glosses, Annals of Ulster, poems of Blathmac, and some tales&lt;br /&gt;
:* fully translated and with full annotations of morphological forms&lt;br /&gt;
:* it’s structured around a spreadsheet-like table interface – searching for specific forms requires displaying the '''whole''' table though (so choose the biggest “Results per Page” value at the bottom, beware that it may make it pretty laggy)&lt;br /&gt;
:* the corpus text is in the ''Sentences'' table, the ''Lemmata'' table is basically a glossary of all the words in the corpus, with translation and often etymological info, the ''Morphology'' table contains the whole corpus text broken down into morphological units with POS-tags and comments&lt;br /&gt;
:* it offers expected normalized spellings of corpus forms&lt;br /&gt;
:* the data can be exported to CSV files&lt;/div&gt;</summary>
		<author><name>AnSpideog</name></author>
	</entry>
	<entry>
		<id>https://www.celtic-languages.org/index.php?title=Irish/Resources&amp;diff=1258</id>
		<title>Irish/Resources</title>
		<link rel="alternate" type="text/html" href="https://www.celtic-languages.org/index.php?title=Irish/Resources&amp;diff=1258"/>
		<updated>2024-03-07T09:07:44Z</updated>

		<summary type="html">&lt;p&gt;AnSpideog: /* Changed from broken discord links to google drive links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Free resources on the internet=&lt;br /&gt;
&lt;br /&gt;
==Textbooks==&lt;br /&gt;
* [https://bhillers.pages.iu.edu/buntus.html Buntús na Gaeilge] (mostly Standard Irish with a preference for Ulster options, dialect differences are noted)&lt;br /&gt;
* [https://drive.google.com/file/d/14r6uOpc_RgNdyccwTle7-9y6NbGh4sqN/view?usp=sharing Teach Yourself Irish 1961 - Retypeset] (dialectal Munster Irish)&lt;br /&gt;
* [https://drive.google.com/file/d/1f3cOHk6-GClmUBkPpw4x85ZtlHoO0nQV/view?usp=sharing Teach Yourself Irish 1961 - Scanned] (dialectal Munster Irish)&lt;br /&gt;
* [https://drive.google.com/file/d/1mXWSqVhYtVCcZa3hG5CGhXz-DIzdyZ_e/view?usp=sharing Teach Yourself Irish 1961 - Audio] (dialectal Munster Irish)&lt;br /&gt;
The retypeset version of Teach Yourself Irish has the audio built in and more reliable search, but it also has typos and even stunning errors where the retypesetter has added incorrect footnotes.&lt;br /&gt;
&lt;br /&gt;
==Video series==&lt;br /&gt;
* Now You're Talking (Ulster Irish)&lt;br /&gt;
** [https://www.youtube.com/watch?v=7nz--LpYC30 Video series on Youtube]&lt;br /&gt;
** [https://eolas38.wixsite.com/ultach/copy-of-aonad-1 Textbook + Audio + Video]&lt;br /&gt;
* [http://www.ceimaraghaidh.ie/don-fhoghlaimeoir/ Céim Ar Aghaidh] (Ulster Irish, for advanced learners)&lt;br /&gt;
&lt;br /&gt;
==Podcasts==&lt;br /&gt;
* [http://culturlann.org/beginner%20podcast Beginner Podcast series from Cultúrlann Uí Chanáin] (Ulster Irish)&lt;br /&gt;
&lt;br /&gt;
==Spelling and Pronunciation==&lt;br /&gt;
* [https://www.youtube.com/watch?v=oIokUII7LX0 Introduction to the Irish spelling] (Video)&lt;br /&gt;
* [https://web.archive.org/web/20120321160459/http://www.ultach.dsl.pipex.com/ForLearners/UIPron.doc Ulster Irish, A Pronunciation Guide]&lt;br /&gt;
* [http://www.fuaimeanna.ie/en/ Fuaimeanna na Gaeilge] (Sounds of Irish with sample recordings in three dialects)&lt;br /&gt;
** A site to go along with the book ''Fuaimeanna na Gaeilge''. Contains recordings from native speakers of each main dialect area (specifically, Gaoth Dobhair in Donegal, An Ceathrú Rua in Connemara and Corca Dhuibhne in Kerry) pronouncing each phoneme in the language. The book contains lists of minimal pairs as well.&lt;br /&gt;
* [https://www.cogg.ie/wp-content/uploads/Understanding-Irish-Spelling-A-Handbook-for-Teachers-and-Learners-by-Dr.-Nancy-Stenson-and-Dr.-Tina-Hickey-1.pdf Understanding Irish Spelling: A Handbook for Teachers and Learners]&lt;br /&gt;
* [https://www.forasnagaeilge.ie/about/an-gum/focloir-poca/?lang=en Foclóir Póca - Description and Phonetic Guide] pdf and audio files containing an introduction to the standardised system of pronunciation sometimes known as &amp;quot;lárchanúint&amp;quot; (central dialect) devised by Dónall Ó Baoill&lt;br /&gt;
&lt;br /&gt;
==Dictionaries (with pronunciation)==&lt;br /&gt;
* [https://www.teanglann.ie/en Teanglann]&lt;br /&gt;
** An online version of de Bhaldraithe's English-Irish Dictionary (EID) and Ó Dónall's Foclór Gaeilge-Béarla (FGB), as well as An Foclóir Beag and a grammar tab and pronunciation tab. Very handy to look up words.&lt;br /&gt;
* [https://www.focloir.ie/ New English-Irish Dictionary (NEID)]&lt;br /&gt;
&lt;br /&gt;
==Grammar==&lt;br /&gt;
* Gramadach na Gaeilge (GnaG): An online grammar covering just about anything you can think of for Irish. Does contain some mistakes, but the most comprehensive free grammar you can find. (Sometimes a bit technical, but if you can deal with grammar terminology, very useful)&lt;br /&gt;
** [http://nualeargais.ie/gnag/gramadac.htm English translation of an older version]&lt;br /&gt;
** [http://braesicke.de/gramadac.htm German original version]&lt;br /&gt;
* [https://data.oireachtas.ie/ie/oireachtas/caighdeanOifigiul/2017/2017-08-03_an-caighdean-oifigiuil-2017_en.pdf An Caighdeán Oifigiúil] (the official standard; written in Irish)&lt;br /&gt;
* [http://www.reimnigh.com/ Réimnigh]: Conjugation tables for common verbs (standard and all three major dialects)&lt;br /&gt;
&lt;br /&gt;
==Word games==&lt;br /&gt;
* [https://www.foclach.com/ Foclach]: Wordle in Irish&lt;br /&gt;
* [https://www.litreach.ie/ Litreach]: Spell the words you hear&lt;br /&gt;
* [https://cadhan.com/beach/ga/index.html# Cíor Thuathail]: Find as many Irish words as you can&lt;br /&gt;
* [https://seafóid.com/ Seafóid!] An Irish adaption of the Waffle game&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
=== Raidió na Gaeltachta (RnaG) ===&lt;br /&gt;
[https://www.rte.ie/radio/rnag/ Raidió na Gaeltachta] is an Irish-language radio station. They broadcast a wide variety of programmes, including:&lt;br /&gt;
&lt;br /&gt;
* [https://www.rte.ie/radio/rnag/adhmhaidin/ Adhmhaidin]: A(n) (inter)national current affairs show, with speakers from '''all over Ireland'''&lt;br /&gt;
* [https://www.rte.ie/radio/rnag/an-saol-o-dheas/ An Saol Ó Dheas]: Current affairs from '''Munster''', with speakers from the dialectal areas included&lt;br /&gt;
* [https://www.rte.ie/radio/rnag/barrscealta/ Barrscéalta]: Current affairs in '''Donegal''', usually with presenters from Gaoth Dobhair&lt;br /&gt;
* [https://www.rte.ie/radio/rnag/iris-aniar/ Iris Aniar]: Current affairs in '''Connacht''' and '''Meath''', with speakers from the dialectal areas included&lt;br /&gt;
&lt;br /&gt;
=== TG4 ===&lt;br /&gt;
[https://www.tg4.ie TG4] is the Irish-language television station. Plays a variety of programmes from a variety of different topics. Most programmes only contain burnt-in English subs, but this is changing recently. Various recommended  programmes include: &lt;br /&gt;
&lt;br /&gt;
* [https://www.tg4.ie/en/player/categories/lifestyle-tv-shows/?series=Comhr%C3%A1&amp;amp;genre=Saolchlar Comhrá]: A regular conversation show between iconic host Máirtín Tom Sheáinín and a different guest where he talks with them about their life experiences&lt;br /&gt;
* [https://www.tg4.ie/en/player/categories/childrens-tv-shows/ Cúla4]: Kids programming on TG4. Good for children or those who want to practice with vocabulary/structures aimed at a child's ability. Might be boring for adults.&lt;br /&gt;
* [https://www.tg4.ie/en/player/categories/news-stories/?series=Nuacht%20TG4&amp;amp;genre=Cursai%20Reatha Nuacht TG4]: The daily news show. High quality presenters. Contains no subtitles.&lt;br /&gt;
* [https://www.tg4.ie/en/player/categories/drama-tv-shows/?series=Ros%20na%20R%C3%BAn%20(S25)&amp;amp;genre=Drama Ros na Rún]: An Irish-language soap opera, running since 1996 with over 1500 episodes. Later seasons have options for English subs or Irish subs. Earlier seasons and episodes can be found on YouTube.&lt;br /&gt;
* [https://www.tg4.ie/en/player/categories/news-stories/?series=7%20L%C3%A1&amp;amp;genre=Cursai%20Reatha Seacht Lá]: Politics and the economy, regional, national and international affairs. Live debate on the major issues of the day with a panel of analysts, experts and political pundits. No subtitles.&lt;br /&gt;
&lt;br /&gt;
=== Vifax ===&lt;br /&gt;
[https://vifax.maynoothuniversity.ie/ Vifax] is a website created and updated by Maynooth University, which takes clips from Nuacht TG4 and transcribes them. They also include questions at various levels (beginner, intermediate, advanced) to help guide listening and focus on listening ability. Very useful resource to practice listening ability.&lt;br /&gt;
&lt;br /&gt;
There is also a helpful blog post about [http://gaeilgechonamara.com/how-to-best-use-vifax-at-any-level/ how to use Vifax at varying levels].&lt;br /&gt;
&lt;br /&gt;
==Apps==&lt;br /&gt;
* [http://www.duolingo.com Duolingo]: It's all right when you have the right expectations. I found it useful for vocabulary and just practice in general, but I wouldn't recommend it as the only thing to use.&lt;br /&gt;
** The mobile apps don't provide to the tips page that each skill has in the web version. You can refer to [https://duome.eu/tips/en/ga duome.eu] to read them on mobile.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Blogs and other dialectal resources==&lt;br /&gt;
Most of these blogs focus on specific regional dialects.&lt;br /&gt;
&lt;br /&gt;
===Southern Scotland and East Ulster===&lt;br /&gt;
&lt;br /&gt;
* [http://dalriada.scot/ dalriada.scot]: &amp;quot;Dail Riada&amp;quot; (Central Argyll), Southern Scotland&lt;br /&gt;
* [https://gaidhligarainn.blogspot.com/ gaidhligarainn.blogspot]: Arran, Southern Scotland&lt;br /&gt;
* [http://rathlingaelic.blogspot.com/ rathlingaelic.blogspot]: Rathlin&lt;br /&gt;
* [https://gaedhilgreachraidh.blogspot.com/ gaedhilgreachraidh.blogspot]: Rathlin&lt;br /&gt;
* [http://cluaindaimh.blogspot.com/ cluaindaimh.blogspot]: Cluain Daimh (Clonduff), co. Down&lt;br /&gt;
* [http://irishofthefews.blogspot.com/ irishofthefews.blogspot]: the Fews, south co. Armagh&lt;br /&gt;
* [http://oirialla.blogspot.com/ oirialla.blogspot]: Oriel&lt;br /&gt;
* [https://archive.org/details/mionaintnamieag00dubhgoog/ Mion-ċaint na Miḋe agus Ulaḋ (Pádraig Ó Dubhthaigh)]: a book from 1905 with information on the extinct Meath dialect&lt;br /&gt;
* [https://gaeltachtnaspeirini.blogspot.com/ gaeltachtnaspeirini.blogspot]: the Sperrins, co. Tyrone&lt;br /&gt;
* [https://inishowenirish.blogspot.com/ inishowenirish.blogspot]: Inishowen, co. Donegal&lt;br /&gt;
&lt;br /&gt;
===West Ulster===&lt;br /&gt;
&lt;br /&gt;
* [https://gaedhilgthoraighe.blogspot.com/ gaedhilgthoraighe.blogspot]: Tory, co. Donegal&lt;br /&gt;
* [https://ramasbearoide.blogspot.com/ ramasbearoide.blogspot]: mostly north co. Donegal&lt;br /&gt;
* [http://www.rannnafeirste.com/ rannnafeirste.com]: Rann na Feirste, co. Donegal&lt;br /&gt;
&lt;br /&gt;
===Connaught===&lt;br /&gt;
&lt;br /&gt;
* [http://gaeilgeiorrais.ie/index.php/canuinti-local-dialects gaeilgeiorrais.ie]: some resources for Iorras, co. Mayo&lt;br /&gt;
* [https://www.irishlanguageincountymayo.com/ irishlanguageincountymayo.com]: co. Mayo&lt;br /&gt;
* [http://gaeilgechonamara.com/ gaeilgechonamara.com]: Conamara, co. Galway&lt;br /&gt;
* [https://gaeiglenalachan.wordpress.com/ gaeiglenalachan.wordpress]: Mionlach, co. Galway&lt;br /&gt;
&lt;br /&gt;
===West Munster===&lt;br /&gt;
&lt;br /&gt;
* [https://corkirish.wordpress.com/ corkirish.wordpress]: co. Cork&lt;br /&gt;
* [https://muiris.net/ muiris.net]: most Cléire (Cape Clear), co. Cork&lt;br /&gt;
&lt;br /&gt;
===East Munster===&lt;br /&gt;
&lt;br /&gt;
* [https://tonyofloinn.weebly.com/gaelainn-oirthear-chorcaiacute.html/ tonyofloinn.weebly]: East Cork&lt;br /&gt;
&lt;br /&gt;
===Leinster===&lt;br /&gt;
&lt;br /&gt;
* [http://kilkennyirish.blogspot.com/ kilkennyirish.blogspot]: co. Kilkenny&lt;br /&gt;
* [https://gaeiligchillechoinnigh.blogspot.com/ gaeiligchillechoinnigh.blogspot]: co. Kilkenny&lt;br /&gt;
* [http://dublingaelic.blogspot.com/ dublingaelic.blogspot]: Dublin (the historical dialects, not post-revival)&lt;br /&gt;
&lt;br /&gt;
===General===&lt;br /&gt;
&lt;br /&gt;
* [https://www3.smo.uhi.ac.uk/oduibhin/index.htm Gaelic Resources - Ciarán Ó Duibhín] mostly West Ulster, some East Ulster and Scottish stuff&lt;br /&gt;
&lt;br /&gt;
==Misc==&lt;br /&gt;
* [https://cdn.discordapp.com/attachments/262774160294412288/1009119722702385193/Bealoideas_Index.pdf Béaloideas index]&lt;br /&gt;
&lt;br /&gt;
[[Category:Irish]]&lt;/div&gt;</summary>
		<author><name>AnSpideog</name></author>
	</entry>
</feed>