THE NEW/S/LEAK PROJECT
new/s/leak (NetWork of Searchable Leaks) is a research project producing a piece of software that allows to quickly and intuitively explore large amounts of textual data. The tool will support journalists working with datasets like the War Diaries or the Embassy cables (both distributed by Wikileaks) or the hacking of Hacking Team. The goal is to provide a quick access to important entities (people, organizations, places) and their relationships, and how those things change over time.
Based on the Network of the Day, our software will combine the latest research in language technology and information visualization to create a powerful tool for searching and exploring: it will be easy to use and understand, with a pleasant look-and-feel.
You can access publicly available demos of new/s/leak:
- Wikipedia Multilingual WW II (user: user, password: password)
- Transparenzportal Hamburg (user: user, password: password)
The demo instance is populated with a dataset of Wikipedia articles around the topic of World War II from four different languages (English, German, Hungarian, Spanish). For more information see:
- Wiedemann G., Yimam S.M., and Biemann C. (2018) : New/s/leak 2.0 – Multilingual Information Extraction and Visualization for Investigative Journalism. In: Proceedings of the 10th International Conference on Social Informatics (SocInfo 2018). St.Petersburg, Russia (pdf)