For full functionality of Sketch Engine it is necessary to
enable JavaScript
Internet_Parsebank_4B
Internet_Parsebank_4B
defaults
Reset settings
English
česky
slovensky
简体中文
繁體中文
Gaeilge
slovenščina
hrvatski
العربية
español
français
українська
polski
Home
Search
Word list
Corpus info
My jobs
User guide
All words
All lemmas
Find x
Menu position
This action may take several minutes for large corpora, please wait.
Word list options
Subcorpus:
None (whole corpus)
Laakea ap
info
create new
Search attribute:
word
lemma
tag
morpho
deptype
head
doc.url
doc.wordcount
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
Internet_Parsebank_4B
(whole corpus)
the rest of the corpus
Laakea ap
Prefer:
rare words
common words
Change output attribute(s)
---
word
lemma
tag
morpho
deptype
head
---
word
lemma
tag
morpho
deptype
head
---
word
lemma
tag
morpho
deptype
head
You can select one or more output attributes. Please note that this option can be time-consuming.