|
|
|
FAQ on the CPE Project (for publishers) How did you select the journals you would like to include in the Corpus? We selected journals based on the "Journal Citation Report 2001," which is published by the Institute for Scientific Information. In principle, we chose the top 20% of the journals listed in terms of impact factor rating within each category. Would you like permission for 50,000 words from each title or a total of 50,000 words selected from all our journals? If possible, we would like to ask you to provide us with 50,000 words of text per title, not simply a total of 50,000. We do not grant blanket permissions. Could you let us know which parts of our journal(s) you would like to include in the Corpus? Also, how are you planning to obtain the texts? As we are an academic society specializing in linguistics, we do not have copies of the title(s) from which we are requesting text and do not subscribe to them. If it is convenient for you, we would appreciate very much if you could send us samples of text in a format such as plain ASCII text, or alternatively, in XML, SGML, or HTML with header information which provides the journal title, issue number, publication year, and the author name(s). In terms of the choice of the material, any 50,000 words from the title (either from one issue or from several is perfectly acceptable to us). If it is easy for you to do, it would be fine if we could receive, for example, the first three articles out of any five issues (this should come to approximately 50,000 words). Of course, if it is too difficult for you to select and send us the text, we are willing to discuss alternative ways of accessing the text. We are concerned that you are asking for permission to use material from many journals, and we may not be able to give permission for all of them. Is it necessary for us to sign the form giving blanket permission for all the journals included? If you cannot give us permission for some of the journals listed on our permission request, we would be happy if you could delete them from the list. It is fine with us if you simply cross out the name of any journals that you cannot give us permission to, and then sign and return the agreement. Who will be the owner of the Corpus? PERC will be the IPR holder of the Corpus, though the IPRs of the text samples will remain with the original holders. How will users access to the Corpus and how will you secure the texts against illegal access or illegal copying? Access to the Corpus will require a valid user ID and password, and our program will not allow simultaneous access from different computers using the same user ID. The search program we are using does not allow users to copy whole texts. |