summaryrefslogtreecommitdiff
path: root/index.html
diff options
context:
space:
mode:
Diffstat (limited to 'index.html')
-rw-r--r--index.html15
1 files changed, 10 insertions, 5 deletions
diff --git a/index.html b/index.html
index 306087f..8dff7cd 100644
--- a/index.html
+++ b/index.html
@@ -22,11 +22,6 @@
</p>
<ul>
<li>
- The Big List: <a href="/tmt/word-list.txt.xz">word-list.txt.xz (27MB compressed, 120MB uncompressed, 9,878,558 entries)</a>.¹<br>
- Every English Wikipedia article title &amp; entry in English Wiktionary; containing only ASCII a-z/A-Z/space, max 2 words.<br>
- Words labelled <i>offensive</i> on Wiktionary were filtered out (overly aggressively—some totally inoffensive words were removed in the process).
- </li>
- <li>
English definitions:
<a href="/wiktionary/en-definitions.txt.xz">en-definitions.txt.xz (23MB compressed, 127MB uncompressed, 1,629,482 entries)</a>
and<br>Translingual definitions:
@@ -68,6 +63,16 @@
<code>DEFINITION</code> is in the wikitext format.<br>
It’s possible that there are parsing errors, but I haven’t spotted any yet.
</li>
+ <li>
+ All English animal terms: <a href="/wiktionary/animalia.txt.xz">animalia.txt.xz (62KB compressed, 192KB uncompressed)</a>.¹<br>
+ This includes both nouns referring to animals (e.g. <i>dog</i>) and animal-related adjectives (e.g. <i>canine</i>).
+ There could definitely be errors due to bad parsing (but I have checked a number of entries at random and they seem good).
+ </li>
+ <li>
+ The Big List: <a href="/tmt/word-list.txt.xz">word-list.txt.xz (27MB compressed, 120MB uncompressed, 9,878,558 entries)</a>.¹<br>
+ Every English Wikipedia article title &amp; entry in English Wiktionary; containing only ASCII a-z/A-Z/space, max 2 words.<br>
+ Words labelled <i>offensive</i> on Wiktionary were filtered out (overly aggressively—some totally inoffensive words were removed in the process).
+ </li>
</ul>
<p>¹ Derived from <a href="https://dumps.wikimedia.org/enwiktionary/20250701/" target="_blank">enwiktionary-20250701</a> dump.</p>
</body>