The Collection of Distributionally Idiosyncratic Items (CoDII)
CoDII was funded by the Deutsche
Forschungsgemeinschaft (2002-2008) and the Landesstiftung
Baden-Württemberg (2007-2009).
Purpose and Design of CoDII
The Collection of Distributionally Idiosyncratic Items (CoDII) is a linguistic resource on lexical items which have highly idiosyncratic occurrence patterns.
Steps in the development of CoDII:
Collaborators
The following people have contributed to the design and the compilation of CoDII.
Subcollections of CoDII
-
CoDII-BW.de
(Collection of Distributionally Idiosyncratic Items: German Bound Words / Sammlung unikaler Wörter des Deutschen, SuWD)
CoDII-BW.de contains information on 446 German bound words.
-
CoDII-BW.en
(Collection of Distributionally Idiosyncratic Items: English Bound Words)
CoDII-BW.en contains information on 77 English bound words.
-
CoDII-NPI.de
(Collection of Distributionally Idiosyncratic Items: Negative Polarity Items in German)
CoDII-NPI.de contains information on 165 German Negative Polarity Items.
-
CoDII-NPI.ro
(Collection of Distributionally Idiosyncratic Items: Negative Polarity Items in Romanian)
CoDII-NPI.ro contains information on 58 Romanian Negative Polarity Items.
-
CoDII-PPI.de
(Collection of Distributionally Idiosyncratic Items: Positive Polarity Items in German)
CoDII-PPI.de contains information on 88 German Positive Polarity Items.
Publications
- Jan-Philipp Soehn, Mingya Liu, Beata Trawiński and
Gianina Iordachioaia (2010). Nicht
sonderlich oder doch sattsam bekannt? Positive und Negative
Polaritätselemente als lexikalische Einheiten mit
Distributionsidiosynkrasien. In Jarmo Korhonen, Wolfgang
Mieder, Elisabeth Piirainen and Rosa Piñel (Eds.),
EUROPHRAS 2008 Beiträge zur internationalen
Phraseologiekonferenz vom 13.-16.8.2008 in Helsinki,
pp. 273-281, Helsinki, Finland.
- Beata Trawinski, Manfred Sailer, Jan-Philipp Soehn, Lothar Lemnitzer and Frank Richter (2008).
Cranberry Expressions in English and in German. In
Proceedings of the LREC Workshop Towards a Shared Task for Multiword Expressions (MWE 2008),
pp. 35-38. European Language Resources Association (ELRA): Marrakech, Morocco.
- Beata Trawiński and Jan-Philipp Soehn (2008).
A Multilingual Database of Polarity Items. In
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08).
European Language Resources Association (ELRA): Marrakech, Morocco.
- Beata Trawiński, Jan-Philipp Soehn, Manfred Sailer and Frank Richter (2008).
A Multilingual Electronic Database of Distributionally Idiosyncratic Items.
In Elisenda Bernal and Janet DeCesaris (Eds.),
Proceedings of the XIII Euralex International Congress,
Series: Activitats, Volume 20, pp. 1445-1451. Universitat Pompeu Fabra: Barcelona, Spain.
- Manfred Sailer and Beata Trawinski (2006).
Die Sammlung unikaler Wörter des Deutschen. Aufbauprinzipien und erste Auswertungsergebnisse
[The Collection of German Bound Words. Design Principles and First Evaluation].
In Annelies Häcki Buhofer and Harald Burger (Eds.), Phraseology in Motion I. Methoden und Kritik. Akten der Internationalen Tagung zur Phraseologie (Basel, 2004),
Series: Phraseologie und Parämiologie, Volume 19, pp. 439-450. Hohengehren: Schneider Verlag.
- Manfred Sailer and Beata Trawinski
(2006). The Collection of Distributionally Idiosyncratic
Items: A Multilingual Resource for
Linguistic Research. In
Proceedings of the
5th International Conference on Language Resources and Evaluation, LREC 2006,
pp. 471-474. Genoa, Italy.
Presentations
- 18. - 21. September 2012
Frank Richter, Manfred Sailer, Beata
Trawiński: Ein Forschungsportal an der Grammatik-Lexikon-Schnittstelle. Auf der
Jahrestagung der Gesellschaft für Angewandte
Linguistik, Erlangen.
- August 13 - 16, 2008
Jan-Philipp Soehn, Mingya Liu, Beata Trawinski and Gianina Iordachioaia:
Positive und Negative Polaritätselemente als lexikalische Einheiten mit Distributionsidiosynkrasien.
Talk given at Europhras 2008,
Helsinki, Finland.
- Juli 15 - 19, 2008
Beata Trawinski, Jan-Philipp Soehn, Manfred Sailer and Frank Richter:
A Multilingual Electronic Database of Distributionally Idiosyncratic Items.
Poster presented at the XIII Euralex Internacional Congress,
Barcelona, Spain.
- May 1, 2008
Beata Trawinski, Manfred Sailer, Jan-Philipp Soehn, Lothar Lemnitzer and Frank Richter:
Cranberry Expressions in English and in German.
Talk given at the LREC
Workshop Towards a Shared
Task for Multiword Expressions (MWE 2008),
Marrakech, Morocco.
- May 28-30, 2008
Beata Trawinski and Jan-Philipp Soehn:
A Multilingual Database of Polarity Items.
Poster presented at the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco.
- December 15, 2007
Mingya Liu, Frank Richter, Jan- Philipp Soehn and Beata Trawinski:
Distributionsidiosynkrasien in der Logischen Form.
Project report presented at the SFB-Tag, Tübingen, German.
- April 11-13, 2007
Beata Trawinski, Jan-Philipp Soehn and Frank Richter:
Modeling Distributionally Idiosyncratic Items in XML.
Poster presented at GLDV Conference 2007 (Biannual Conference of the Society for Computational Linguistics and Language Technology), Tübingen, Germany.
- March 08-10, 2007
Frank Richter, Jan-Philipp Soehn and Beata Trawinski:
Spotting, Collecting and Documenting NPIs.
Talk given at the Workshop on Negation and Polarity, Tübingen, Germany.
- October 5-7, 2006
Manfred Sailer: Modeling the Lexis-Grammar Interface in a Competence-Based Framework: The Case of Bound Words. Presentation at Exploring the
Lexis-Grammar Interface (ELeGI 2006), Hannover, Germany.
- May 22-28, 2006
Manfred Sailer and Beata Trawinski:
The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research.
Talk given at the
5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy.
- December 20, 2004
Beata Trawinski: The Collection of Distributionally Idiosyncratic Items.
Invited talk given at the Institute of Computer Science at the
Polish Academy of Sciences in Warsaw, Poland.
- August 26-29, 2004
Manfred Sailer and Beata Trawinski:
Die Sammlung unikaler Wörter des Deutschen.
Aufbauprinzipien und erste Auswertungsergebnisse [The Collection of German Bound Words.
Design Principles and First Evaluation].
Talk given at EUROPHRAS 2004: Europäische Gesellschaft für Phraseologie
at the University of Basel, Switzerland.