Université Paris5 Lexique - Une Base de Données Lexicales Libre CNRS
A website made by Boris New & Christophe Pallier
Main Menu
  • Home
  • Use Lexique online
  • Documentation
  • Download (in french)
  • Forums
  • UnDows

  • We distribute on our website a lot of resources for the French language. These resources include:

    • A database of 130 000 words with phonemic representation, syllabled form, gramatical category, gender, number, frequency, lemma, number of phonemes, number of letters, unicity point, web frequency, etc. [Lexique]
    • A database with surface frequencies (letter, bigram, trigram, syllable, and phonemes) [Surface]
    • A first name database with sex, language, frequencies (11000 entries) [Prenoms]
    • An homographs list
    • An anagram database [Anagrammes]
    • All the orthographic neighbours with frequencies [Voisins]
    • A words and nonwords lists from the Frantext corpus (including proper nouns, etc) and frequencies [FreqFrant]
    • A free corpus of 37 millions of words [Corpatext]
    • A set of GNU tools (gawk, bash, etc.) that beginners can easily install for text manipulations [Undows]
    • Almost all these resources are distributed under a GNU or GNU-like licence
    • A search engine to interrogate several databases simultaneously

    There is not yet a complete English version of this site, however, you can search Lexique online, download them, and ask your question in the Forum.

    If you're interested in an english version, you can help us to do it by

    Best,

    The Lexique Team