img.wp-smiley, img.emoji { display: inline !important; border: none !important; box-shadow: none !important; height: 1em !important; width: 1em !important; margin: 0 0.07em !important; vertical-align: -0.1em !important; background: none !important; padding: 0 !important; } /*# sourceURL=wp-emoji-styles-inline-css */

海角论坛

The Department of Linguistics' Amir Zeldes is the co-director of Coptic SCRIPTORIUM,聽a platform for interdisciplinary and computational research in聽Coptic language texts.聽
News Story

A New Life for Ancient Texts

July 14, 2015鈥擜ssistant Professor of Computational Linguistics has always had an interest in the data of language鈥攊n particular, the ability to test and verify how language works. Computational linguists specialize in the intersection of language and technology; specifically, how computers process 鈥渘atural鈥 languages, meaning those spoken by human beings.

鈥淭hink about how a computer sees language,鈥 Zeldes said. 鈥淲hen you鈥檙e saying things in English, or whatever language you鈥檙e speaking, how would you expect a computer to understand that?鈥

Expertise in this field is what led Zeldes to serve as the co-director of , an interdisciplinary, collaborative project that focuses on digitizing and sharing texts written in the ancient Egyptian language of Coptic. A direct descendent of hieroglyphics, Coptic enjoyed a heyday between the second and tenth centuries before being replaced by Arabic. Many texts from the earliest periods of Christianity (including the Bible) were written in or translated to Coptic.

Today, the study of these texts is extremely valuable to religious scholars, historians, and linguists, among others. Access to such texts, however, has been relatively limited鈥攕omething that Zeldes and his co-director, Caroline Schroeder (associate professor of religious and classical studies at the University of the Pacific), hope to change.

The two first met in 2012, when Schroeder attended Zeldes’ course during a summer school session at Tufts University. At the time, Schroeder was already working with Coptic. She and Zeldes soon realized that, as partners, they could vastly improve access to and study of Coptic texts.

鈥淪he had the expertise in Coptic studies and I had expertise in computational linguistics techniques that hadn鈥檛 yet been applied to that language,鈥 Zeldes explained. 鈥淚t just seemed like something that could really be done now.鈥

After receiving an initial grant from the National Endowment for the Humanities (NEH), Zeldes and Schroeder focused on accumulating and coding materials, as well as the process of segmenting words into their constituent parts and determining their parts of speech.

鈥淐optic has a rather challenging system of multiple segments. Like a lot of languages from the Middle East, the things you end up writing together with spaces between them are not exactly what you would call an individual word鈥攜ou need to split that up even smaller, and you can have a computer program help you with that,鈥 Zeldes explained.

With its initial funding, SCRIPTORIUM also built a Coptic part of speech tagger that automatically determined which category a word belongs to, such as noun, verb, or preposition, among many others.

In May 2015, the project  a second NEH grant that will provide funding for two years, titled 鈥溾 (Koptische/Coptic Electronic Language and Literature International Alliance). The $192,500 grant is one of six nationally awarded by the NEH/DFG Bilateral Digital Humanities Program.

KELLIA will support improved international coordination of Coptic projects through a collaboration between Coptic SCRIPTORIUM and other partners, including Germany鈥檚 University of G枚ttingen and the University of M眉nster. Funds from the grant will support efforts in gathering, annotating, sharing, and editing Coptic texts.

On the technical side, one of the challenges is taking existing tools that were designed for English or other mainstream languages and making them work for Coptic.

鈥淲hat I鈥檇 really like to be able to do is understand what makes Coptic difficult in specific ways, and then, by reusing tools that are already available, not make Coptic an exception, but make the rules work for it. As soon as you do that, there鈥檚 a flood of other tools that become useable for you,鈥 said Zeldes.

Once published, the project鈥檚 website will offer various ways of viewing and interacting with its Coptic texts, including a normalized view with an option to view a translation. Users will also be able to see what a piece of text looked like in the original written manuscript鈥攈ow it was laid out (columns and lines) as well various colors of ink. For those interested in linguistic analysis, there will also be a view that offers part of speech analysis.

For Zeldes, the importance of making these texts available goes beyond the technical and linguistic opportunities. The KELLIA project takes its name from an area in the Egyptian desert where monks lived alongside one another in cells鈥攌nown as 鈥渒ellia”鈥攁s opposed to living in isolation.

鈥淵ou were supposed to stay in your cell as if you were on a mountain alone, but they did it in a community,鈥 Zeldes explained. 鈥淲hen Christianity started out, there wasn鈥檛 the idea that people should live together for the purpose of worship鈥攖he idea of a monastery that we now take for granted developed in Egypt in this period. And if you鈥檙e interested in learning how that came about, then these texts are what you need.鈥

Related Information

For news and updates about Coptic SCRIPTORIUM, visit the project’s .

Tagged
Faculty