Gear-News, November 2011

Finished prospectus for the dissertation and now seeking signatures. Learning the Python programming language. Teaching one literature class. Very busy with robotics work, husbandhood, and fatherhood.


Wednesday, September 7, 2011

Johanna Drucker

Humanist Computing at the End of the Individual Voice (TILTS 2011)


Tuesday, August 16, 2011

The Hidden Meaning of Pronouns


Scientific American has a sensationally titled interview with psychologist James Pennebaker on language use.

The entire interview is worth reading, but the following bit should interest anyone with a bent toward computational analysis of language:
Historians and biographers should jump on this new technology. The recent release of the Google Books Project should be required reading for everyone in the humanities. For the first time in the history of the world, there are methods by which to analyze tremendously large and complex written works by authors from all over the world going back centuries. We can begin to see how thinking, emotional expression, and social relations evolve as a function of world-wide events. The possibilities are breathtaking.

In my own work, we have analyzed the collected works of poets, playwrights, and novelists going back to the 1500s to see how their writing changed as they got older. We’ve compared the pronoun use of suicidal versus non-suicidal poets. Basically, poets who eventually commit suicide use I-words more than non-suicidal poets.

The analysis of language style can also serve as a psychological window into authors and their relationships. We have analyzed the poetry of Elizabeth Barrett and Robert Browning and compared it with the history of their marriage. Same thing with Ted Hughes and Sylvia Plath. Using a method we call Language Style Matching, we can isolate changes in the couples’ relationships.
h/t Kim Salazar

Friday, August 5, 2011

Three Questions to Ask about Hidden Markov Models



Continuing through Chapter 9 of Foundations of Statistical Natural Language Processing (1999) by Christopher D. Manning and Hinrich Schütze.

Manning and Schütze describe three fundamental questions to ask about Hidden Markov Models (HMMs):
  1. Given a certain HMM (knowing its state transition probabilities, symbol emission probabilities, and initial state probabilities), how do we efficiently compute how likely a certain observation is?
  2. Given an observation sequence and a certain HMM, how do we choose a state sequence that best explains the observations?
  3. Given an observation sequence and a space of possible HMMs found by varying the model parameters (e.g., state transition probabilities, symbol emission probabilities, and initial state probabilities), how do we find the HMM that best explains the observed data?
Future posts will address how to approach these questions.

* * * * *

In other news, I've intermittently been playing with Python and the Natural Language Processing Toolkit.

Monday, July 11, 2011

The Return of Pierre Menard


In the guise of Sean "Puffy" Combs. This is a rather old item from my favorite news source, The Onion:
NEW YORK—Noted rapper/producer Sean "Puffy" Combs released his hotly anticipated new single Tuesday, "Tha Kidd (Is Not My Son)," which samples Michael Jackson's 1983 smash "Billie Jean" in its entirety and adds nothing. "When I was in the studio mixing and recording, I decided 'Tha Kidd' would work best if I kept all the music and vocals from the original version and then didn't rap over it," Combs said. "So what I did is put in a tape with 'Billie Jean' on it, and then I hit record. The thing turned out great." Combs' current number-one hit, "Eye Of The Tiger," is dedicated to slain rapper Notorious B.I.G.

Friday, July 1, 2011

Inspiration



No kidding. Rajesh Rao's work on the Indus script is the direct inspiration for what I seek to do in my dissertation on Old English texts.

Monday, June 6, 2011

Python for Newbies


The Python folks have a page with some excellent links for non-programmers. I've listed the ones that most interest me. Notice the links for children at the end:
  • How to Think Like a Computer Scientist - 2nd edition Allen Downey's open source textbook has a Python version, written with Jeff Elkner. It's also available in book form. It was updated and current version is 2nd edition.

  • The Programming Historian From the "About This Book" page: "This book is a tutorial-style introduction to programming for practicing historians. We assume that you're starting out with no prior programming experience and only a basic understanding of computers. More experience, of course, won't hurt. Once you know how to program, you will find it relatively easy to learn new programming languages and techniques, and to apply what you know in unfamiliar situations."

  • Learning to Program An introduction to programming for those who have never programmed before, by Alan Gauld. It introduces several programming languages but has a strong emphasis on Python.

  • A Byte of Python, by Swaroop C.H., is also an introductory text for people with no previous programming experience.

  • One Day of IDLE Toying A very gentle introduction to the IDLE development environment that comes with Python. This tutorial by Danny Yoo has been translated into nine different languages.

  • Free Python video lectures are also available as a course titled Intro to programming with Python and Tkinter, Unix users can view the video using mplayer once you have downloaded the files. Windows users will need to have a DivX player, available from http://www.divx.com/divx/windows/. (One user reports success viewing the videos on OS X 10.4 using the VLC player -- http://www.videolan.org/).

  • A Non-Programmer's Tutorial for Python 3 on Wikibooks.

  • Learning Python (for the complete nOOb) by Derrick Wolters. A beginner's tutorial to learn how to program in Python.

  • Beginning Python for Bioinformatics by Patrick O'Brien. An introduction to Python aimed at biologists that introduces the PyCrust shell and Python's basic data types.

  • Two courses from the Pasteur Institute are aimed at biologists but are useful to anyone wanting to learn Python. Both tutorials are quite extensive, covering data types, object-oriented programming, files, and even design patterns.
  • Python Tutorial This tutorial is part of Python's documentation set and is updated with each new release. It's not written with non-programmers in mind, but skimming through it will give you an idea of the language's flavor and style.

  • Invent Your Own Computer Games with Python, 2nd Ed, by Al Sweigart is a free e-Book that teaches complete beginners how to program by making games.

  • LiveWires A set of Python lessons used during 1999, 2000, 2001 and 2002 children's summer camps in Britain by Richard Crook, Gareth McCaughan, Mark White, and Rhodri James. Aimed at children 12-15 years old.

  • Guido van Robot A teaching tool in which students write simple programs using a Python-like language to control a simulated robot. Field-tested at Yorktown High School, the project includes a lesson plan.

  • PythonTurtle A learning environment for Python suitable for beginners and children, inspired by Logo. Geared mainly towards children, but known to be successful with adults as well.

My favorite resource right now is a video series by Bucky Roberts.

Sunday, June 5, 2011

Why I'll Use Python to Set Up My Markov Program


I am not a computer programmer. The best I can do is some HTML tagging and a few macros in Microsoft Word and Excel.

However, to run the planned (or, to-be-planned) Markov simulation for my dissertation project, I will need to use some programming language. Python will be that language.

I chose Python for several reasons:
  • It's free and well documented.
  • It runs everywhere.
  • It's widely used.
  • It's well supported.
I am using two books as primary resources on programming in Python:
  1. Campbell, Jennifer, Paul Gries, Jason Montojo, and Greg Wilson. Practical Programming: An Introduction to Computer Science using Python. The Pragmatic Bookshelf, 2009. Print.
  2. Bird, Steven, Ewan Klein, and Edward Loper. Natural Language Processing with Python. Beijing: O'Reilly Media, 2009. Print.