We expect that researchers will contribute corpora constructed with TalkBank programs and tools. It is the obligation of TalkBank and TalkBank users to assure that these contributions are properly acknowledged and cited and that the data are correctly stored and distributed.
To contribute a new data set to the Discourse PsychosisBank:
For PsychosisBank contributions, please write an email message to email@example.com, Brian MacWhinney (firstname.lastname@example.org) and Lena Palaniyappan (email@example.com) describing your contribution.
Both audio files and transcriptions are welcomed. If you have transcripts, please be aware that TalkBank uses CHAT files, which must pass CLAN's check program.
However, it is also possible to contribute transcripts with a different format. Please specify the format of your transcriptions in your contribution description and we will work to find the best solution to upload your data.
If you happen to have transcripts in CHAT format, please note that TalkBank uses a strict system for matching transcripts to media. This requires that each transcript align with only one media file and that the names of the transcript file and the media file be the same (ignoring the extensions). For example, the file 020456.cha must have a matching 020456.mp4 (or .wav or .mp3) media file. In addition, the @Media line in the *.cha file should use the name of the media which matches the name of the transcript. In general, please try to use short file names to make processing easier. Information already provided in folder names and the @ID lines does not need to be duplicated in file names.
Please combine your audios, transcripts and documentation files into a single .zip file and send that file as an encrypted email attachment to Brian MacWhinney (firstname.lastname@example.org).
Documentation should include information for a web page, such as this one . You can use this template to create that HTML page. You just need to replace the various XXX fields with the necessary information.
Our recommendations for media formats are given in section 5.1 of the CLAN manual. Audio should be WAV and we can then create MP3 files from the WAV.
Because audio files are usually too large to send through email, you will need to transfer them through WeTransfer, following these steps: