Elise

About Elise

Elise recently got her BA in Linguistics from McGill, having written her undergraduate thesis about Mi'gmaq possession. She spent the summer working closely with the teachers at the Listuguj Education Centre, and learned a lot of the language! Now she's devoting her time to the various programs run through this blog.

Introducing the Comp/Field Workshop

Coming up in two and a half weeks, McGill will be hosting a computational field (linguistics) workshop.

On May 27th, we’ll have two talks and a wine & cheese reception: iLanguage Lab will be presenting about LingSync, and Alexis Palmer, a postdoc working for SEASIDE, will be giving a talk entitled “Computational Linguistics for Low-Resource Languages.”

May 28th, we’ll have two workshops and two talks: iLanguage will host a session entitled “Plugging in to LingSync” and Alexis Palmer will coordinate one focusing on what exactly one should do with field linguistic data. Our talks will be from Erin Olson (presenting on the Prosodylab’s Forced Aligner and its use in segmental analysis) and Robert Henderson (“Reclaiming SIL Bibles for Linguistic Research”).

More details available in our newly-minted workshop section!

L’nui’sultinej Conference 2013

The L’nui’sultinej Conference is the place to be for Mi’gmaq speakers and learners; as such, we are proud to announce that we are planning on going to Antigonish to talk about our projects and our classes!

As the organizers say on their site:

“L’nui’sultinej na tesipunqek mawita’mk wjit Mi’kmaq Kisna ta’n pasik wen ta’n ketu’ kinu’tmasit l’nui’sin kwlaman kisi klo’tesnu ksitunnu iapjiw. Ula mawita’mk wjit msit wen aqq ta’n pasik wen wlta’sualaten.

L’nui’sultinej is an annual conference which brings together Mi’kmaq people from across the Mi’kmaq Nation to look at education issues related to language preservation, enhancement and revitalization. It is a conference for parents youth, elders, community members, educators, administrators, counselors and researchers. We welcome all Mi’kmaq into the circle whatever their level of language proficiency.”

Mali-Beth, Carol, and I (Elise) will be attending the conference to talk about how linguists and speakers and learners can work together, inside and outside a language classroom. We hope to see you there!

Some ICLDC Presentations

This list quickly sums up a few of the presentations I really enjoyed at ICLDC, in no particular order. The days were very packed, so sadly I only got to see a fraction of the interesting talks that were happening from 9-5:30 every day (in 6 different conference rooms!), but hopefully this gives you a quick idea of the types of great conversations and work that is being done by linguists, language revitalizationists, and language conservationists around the world.

  • The Algonquian Online Interactive Linguistic Atlas (Marie-Odile Junker, Nicole Rosen, Hélène St-Onge, Arok Wolvengrey, Mimie Neacappo)
    • This website maps a lot of different dialects of various Algonquian languages, putting equivalent sentences (for instance, translations of “This is my mother.”) side-by-side on the map. You can choose to use the website in English, French, or without colonial languages altogether.
    • All their technology (using Python and MySQL) is open-source and non-proprietary, so this model could easily be adjusted to show language variation in other language families as well.
  • The documentary linguist as facilitator: The view from Trung (Dulong) (Ross Perlin, University of Bern)
    • As linguists we have to find ways of situating our own roles in communities studying languages that we may not speak.
    • One potential model we can draw from is the literature on being a facilitator, placing “a focus on process and group dynamics, impartiality or neutrality, the evoking of participation, trust and consensus-building, and resource aggregation.”
  • Sharing worlds of knowledge: Research protocols for communities (Andria Wilhelm, Universities of Victoria and of Alberta; Connie Cheecham, Northern Lights School District)
    • Copyright law and other legal measures are generally insufficient when it comes to protecting Indigenous communities, specifically with respect to intangible property like linguistic expertise.
    • It is important for researchers to collaboratively form concrete research protocols with their community/the community they are working with!
    • These protocols may address guidelines for principles of respect, ownership & profit, informed consent, access, fixation, and any other facets are relevant to your work.
  • Developing a regional Master-Apprentice training network in Australia (Gwendolyn Hyslop, Australian National University)
    • Last year, Leanne Hinton and others led workshops for representatives from 31 Indigenous language communities in order to instruct them in the best strategies for engaging in the Master-Apprentice Program.
    • They practiced (among other techniques) non-verbal communication, going through wordless books in the language, listening and repetition, immersion sets, talking about modern items/new vocabulary, games for counting, and puppet play.
    • The goal of these sessions was to “train the trainer” and form a network of MAP groups throughout Australia–it was more popular than anticipated, and they had to run 3 workshops instead of the planned 1!
    • People generally found that language pods, where 3-6 people engage in immersion together, felt more comfortable and natural than the usual MAP pair system of a single speaker and a single learner.
      • We should try these out, too! And it would be great to get Leanne Hinton out for a workshop, no?
  • Developing consistency by consensus: Avoiding fiat in language revitalization (Lance Twitchell, University of Alaska Southeast; James Crippen, University of British Columbia)
    • The Tlingit language has a lot of sounds, to put it lightly. Developing an orthography was a bit of a problem, and for a while there were two separate writing systems. Over time, speakers merged the best features of each into what is known as the ‘email’ orthography to some people, a process that happened gradually, by internal consensus rather than external decree.
    • This presentation said that standards should be violable; mistakes should be okay, since a language is owned by everybody who uses it; it is helpful to standardize aspects of the language until wide usage, not after.

LingSync database glossing conventions

Since we’re getting a lot of work done with the LingSync application* from the programming side of things, we decided it was high time we start using it the way it was intended to be used; as a database making it easier for us to share data and collaborate.

As we’ve started putting in data, we’ve been laying down some conventions for us to follow in the rest of the database. This blog seemed like a good place to discuss the conventions we’ve established, and an even better place to debate new conventions for areas we haven’t fully fleshed out yet (verbs…).

All of this information and more is stored also on the specific wiki page for LingSync glossing: http://wiki.migmaq.org/index.php?title=LingSync_Glosses

Our general guiding principles are as follow:

  • Gloss everything–no defaults!
    • This is mainly to make search easier and more intuitive. If we had, for instance, “animate” as the understood default person and only glossed inanimate morphology as such, it would be very difficult to get a datalist of all animate words. Having no default glossing means that all our glosses will be very explicit and therefore easy to search.
    • This will be painful at the start, but once we have enough data in there, LingSync will autogloss and make our lives much easier! Hang in there.
  • When in doubt, don’t parse it out!
    • Only separate morphemes if you and a collaborator are completely, 100% sure that they are separable. Make sure you pass your theory by someone else’s eyes first, too!
    • Feel free to use dots frequently in your glosses. It is safer, generally speaking, to group morphemes (and later split them up) than it is to be over-enthusiastic about splitting them up (and later having to go back and re-group).
  • In general, be faithful to the surface/pronounced form when drawing morpheme boundaries. (ie match the morpheme line to the utterance line as closely as possible)
    • Please use the Notes section to leave comments about phonology if you think there is a predictable process going on!
    • (One exception is the palatalization of ‘t’ at morpheme boundaries. Throughout LingSync we will assume that t -> j / _-i, so it is safe to have the utterance and morpheme lines different here.)

As far as specifics go, there is more information on the wiki page itself (WordPress hyperlinks seem broken, here’s the address again http://wiki.migmaq.org/index.php?title=LingSync_Glosses ).

Please use the comments to discuss…

  • Verbs! Since we are glossing with the maximal amount of information, including tense and mood (ie. present indicative), where should we put this information? So far we’ve been sticking it onto the end of the root using dots (ie. tli’ma-tis = tell.TA.PRES.IND-1SG>2SG). Any other suggestions for the placement of tense/aspect/mood?
  • Verbs part 2! How should we identify the difference between various evidentialities? And what about tense, aspect, mood? Right now we’re marking present indicative, imperative, future, evidential past, inferentialiOpen discussion in comments below!

*(The LingSync extension can be found here: https://chrome.google.com/webstore/detail/lingsync/ocmdknddgpmjngkhcbcofoogkommjfoj )

Mid-November Update

It’s been pretty quiet on the blog for the last little while, but that’s not because we’re working less on the project! We’ve been working a whole lot on a big aspect of our Mi’gmaq Research Partnership: specifically, trying to get money.

This past month, we’ve been working on our application for a grant. It’s allowed us to get a good look at the different strengths and goals of this project, and we’ve begun to get very excited about the coming work! Turns out, we’re doing good work, and there are a whole lot of interesting areas for our project to grow and explore.

  • LingSync application/Chrome extension. This is a really exciting application (at least, exciting for linguists) which will let us store, organise, and selectively share our language data (with encryption on all of it for the sake of our speakers’ privacy). Right now, we’re working with iLanguage on building on the existing code. On our to-do list is…
    • Making a Conversation! LingSync expects people to enter data as single lines, unconnected to each other. What we’d like to do is add another level, which lets users add conversations, dialogues, and discourse so that the separate lines of data are linked to each other just like speakers connect to each other in real life.
  • Mobile language-learning app. We’ve got (very rough) Android and Chrome extension prototypes for this already, and have a lot of ideas about where it’s going to go from here. The parts that we want to focus on the most are…
    •  its online/offline abilities, since a lot of language-learning apps work best online and have very limited offline capabilities
    • its tool for also letting students build their own lessons! Most language lessons are “read-only”, and students are expected to consume what is put on their app and be satisfied with that. But what we want to do is let students create material with speakers in their lives, and tailor their app to their own needs.
  • Master-Apprentice program. This is my favourite part of this project right now. It’s a wonderful opportunity for learners of Mi’gmaq to get some support and guidance, and partner one-on-one with fluent speakers. Again, get in touch with Vicky or look at an older blog post for more details about it!
One last thing we are looking forward to: two of our members (Carol and me, Elise) are lucky to say that they’re going to Hawai’i in late February for the 3rd International Conference on Language Documentation and Conservation! We’ll be presenting a poster version of the talk that we gave with Mary-Beth Wysote and Sarah Vicaire for the Algonquian Conference last month, based on the slides here.

Some website changes

There’ve been a couple small changes to this site–

1) anybody can now post a comment, but if they aren’t logged in then they have to do a CAPTCHA to be able to post. If you are logged in but the CAPTCHA is appearing anyway, please let me (Elise) know, and I can change the settings again to try to fix that.

2) only people with accounts can edit the wiki. To get an account, just email wiki@migmaq.org , and Gretchen will set you up with one!

3) we’re making a name for ourselves! Choose the name you like best, or suggest something new in the comments below.

ETA: The votes are in! This post declares us the Mi’gmaq Research Partnership, barring any strong objections.

What should we call ourselves?

CAN-8 Update

Hello, folks! Reporting from Listuguj here, where there has been a lot of work done towards setting up CAN-8 for real live users. We’ve been focussing on

  • making connections with (potential) users of the program
  • establishing a work-flow that works for people involved in the project (more on this below), and
  • recording and uploading content

We’ll start off talking about that first point, making connections. We’re hoping to create a CAN-8 that is geared towards the people using it. Part of this is listening the input of anybody who has an opinion to give. Vicky Metallic has been getting the word out to people outside the Education Complex that the CAN-8 project is happening, and that another tool for Mi’gmaq-learning will be added to the considerable resources that the community already has at its fingertips. We’ve all been talking to people, and in doing so, we’ve also been taking the advice of the people who have seen the program. We’ll use their input to select scenarios for dialogue-making, so that our content is relevant to the lives of the people who are using it.

Next up, the question of “how do the dialogues get made, anyway”? The fine details of this process are still being hammered out, but we have a general outline that works well for us! We write scripts in part-English, part-Mi’gmaq, then Joe Wilmot and his trusty team work with us to tweak the scripts so they flow like real Mi’gmaq. Once the scripts are speaker-worthy, Joe et al record them. The resulting sound files and Mi’gmaq scripts go into CAN-8, so far by my hands, but hopefully this will become a Montreal job soon. And later, we will write up helpful meta-linguistic notes which also go into CAN-8 for the users’ enjoyment.

There are two kinds of dialogue: implicit teaching ones, and daily life ones. These two categories do overlap quite a bit sometimes, but they aren’t one and the same!

For the implicit teaching dialogues:

  • Conor and I work on establishing corners of the grammar that might be tricky for learners to pick up directly from speakers–there is a lot that people can learn from the speakers around them, but we’d like to give people some scaffolding to base that knowledge on, and make speaker-knowledge more accessible to more people.
  • Once these corners are picked, we then use a theme to construct a little scene. One starts with a boy telling a girl, “I love you!” and she replies, “You love me, but I don’t love you.”–they talk to each other, and explain the predicament to others, and (spoilers!) he eventually wins her over. The aim of this dialogue is to demonstrate TA morphology in a (hopefully) fun and funny way!
  • Lastly, we give an overt, meta-linguistic description of the phenomenon in question. We also encourage the CAN-8 learner to go back to the dialogue they just heard with these meta-linguistic notes in mind, and will provide a CAN-8 space to experiment with the forms on their own.

The daily life dialogues are a little different. We don’t have many of them, but we’ll collect more as the summer goes on!

  • Two (or maybe three) speakers record a mostly-improvised chat about topics that are likely to come up. One chat that we have already uploaded features a phone call between two speakers, where one invites the other to go across the river for some shopping.
  • We then go over the recording with the speakers in question, and transcribe the exchange. This transcription gives CAN-8 users a chance to familiarize themselves with the orthography we’re using, as well as something to hang on to (as this speech tends to be a little faster than the scripted dialogue speech).
  • The dialogues are then uploaded! The advantage of these improvised chats is that they are less artificial than the scripts, and will give the CAN-8 user a window into natural speech that they are likely to hear (and eventually produce) with the speakers in their lives.

We’ve also uploaded the rhythmic profiles that we’ve talked about in a previous post, and Roger Metallic is working on Mi’gmaq-izing some of the Akwesasne verb conjugations and integrating them into the Listuguj CAN-8, too. There’s a lot going on, here!

And, as always, please chat with us in comments!

  • What do you think of our plan and strategy? What are the things that we’re missing?
  • Do you have any suggestions for material to cover?
  • Do you have suggestions for new methods to use to show linguistic patterns?
  • What have been your favourite bits about self-directed language-learning programs in the past? Should our CAN-8 program have them, too?

Canadian Indigenous Language and Literacy Development Institute

I had an interesting conversation today, with a professor of education from the University of Alberta, specializing in language and literacy learning (okay, it was my mother). She mentioned a program that the university runs, called the Canadian Indigenous Language and Literacy Development Institute. It is an annual program aiming to help  ”First Peoples speakers and educators in endangered language documentation, linguistics, language acquisition, second language teaching methodologies, curriculum development, and language-related research and policy-making.” Sounds good to me!

Has anybody had any experience with CILLDI? Does it sound like something we’d be interested in?

Akwesasne trip

Yesterday, we (the CAN-8 group) took a small yellow car on a road trip to Akwesasne to visit Karen Mitchell, who has been running a magnificent program on the Mohawk language at the Akwesasne Economic Development Agency. We learned so much from the trip, and were simultaneously inspired and a little bit daunted! This (long) post is about some of what we learned while we were there, and how we’re hoping to use it for building a Mi’gmaq CAN-8 program.

The Mohawk CAN-8 program has immense breadth and depth (it has been developing for over five years), and there is no way to accomplish that much in the course of this summer. However, there were certain strategies and tools that seemed very adaptable to what we’re doing with our Mi’gmaq CAN-8!

First, there was a section about Mohawk writing, and the possible Mohawk syllables. Since Mohawk (unlike English and French) makes heavy use of a distinction between what we would write as /a/, /ah/, and /aʔ/, the students were introduced to the contrast between them. They heard each syllable /ma/, /mah/, /maʔ/ and saw them written on the screen in Mohawk orthography. How we’re talking about this for Mi’gmaq

  • This section went through every possible Mohawk syllable, but we’re thinking that covering every possible syllable might not be necessary for us. The main advantage of this for Mi’gmaq might be in helping show the long/short/schwa vowel differences. We can record individual instances of the vowels, show the spelling (just enough to familiarize the student), and then record some minimal pair words to highlight how the difference shows up in speech.
  • A helpful and perhaps even crucial followup to that is to give learners sets of rhythm patterns/profiles: pairs and sets of combinations of each of the three degrees of vowel length in the language: regular, extra-short (= schwa), and extra-long.  That is, V, ‘, and V’. Hence we want to demonstrate rhythmic profiles like these—V-V, ‘-V, V-’, V-V’, V’-V, ‘-V’, V’-’ (etc.)—in the form of actual words that model them.  From there, as learners are presented with new words, they can compare them to the rhythmic profiles of each the model words they’ve already mastered, pick out the one best match, and so basically get the longs, shorts, and schwas of the whole new word automatically and accurately.  It is also necessary to include coda consonants: geminates and clusters, as these too have a distinct effect on the rhythmic profile of the word.

Second, each vocabulary word/phrase was presented in four ways (with a supporting element). 1) the student clicks on the English approximation of the word, 2) there is a picture indicating the meaning, and 3) the Mohawk word written out syllable by syllable and then as a unit, 4) the student hears a recording of the Mohawk word, syllable by syllable and then as a unit. As the recorded speaker says each syllable, the written Mohawk on the screen is highlighted to align with the recording (i.e. a less bouncy version of “follow the bouncing ball”). A supporting structure in the system is that the words/phrases were in Question/Answer format: The CAN-8 user, after clicking on the vocab word, would hear a questions such as “what animal is this?” or “how do you feel today?” and respond using the new phrase like a mini-dialogue. We like this because it takes vocabulary out of a “vocabulary/phrase list”-based approach and situates it in real interactive language use. How we’re talking about this for Mi’gmaq

  • This was a fantastic format that covered a lot of ground! For citation style, it’s hard to think of a better-rounded way to show it. We’re discussing the idea of also using video, and also talking about eliminating solo words altogether, and using mainly phrases in context, as a vocabulary-teaching tool.
  • We’ll also have semi-scripted dialogues, which is material for another post.

Third, they had a thematic dictionary. It was a very well-organized reference, and showed vocabulary in a nice, usable way. How we’re talking about this for Mi’gmaq

  • Obviously, building a dictionary is a huge project. We’re hoping to lay the groundwork for the whole thing, building a skeleton for the future and keeping track of the phrases that we are entering for the individual lessons. Actual input for the summer will prioritize thematic contexts where the language is currently likely to be used–school vocabulary and some topics relevant to speaking with older relatives (suggestions welcome). Later work will then branch out to topics where the language isn’t presently as common.
  • We already have a great resource for Mi’gmaq words! The Mi’kmaq Online Talking Dictionary has a substantial quantity of vocabulary, all of which is not only illustrated with sentence-level usages but also provides sound files of both. Maybe in the future, we can look at how this alphabetically-arranged resource might be cross-linked into a thematically-arranged presentation, and if possible filed in the CAN-8 system, too.

There was more, too, but this is already quite a long post! We’d like to hear what you think!

  • What themes should we work on?
  • Do these three features sound good to you? What are the problems with them?
  • What are your thoughts for useful dialogues for a CAN-8 user to hear? Again, there will be a post just for dialogues coming up later, but please start talking about it now, if you like.