View on GitHub

Creative Coding and Generative AI

Winter 2025 - Anastasia Salter

Tutorial: Reading Across Texts

This week, we’re going to start exploring the relationship of generative AI to text. To follow along during the demo, choose at least one text to analyze through distant reading, starting with my prompts and working towards developing and iterating your own questions. Depending on the level of access you have to the model you’ve chosen, you might find that you have trouble getting results with a complete text, particularly one of the longer books: keep iterating until you are happy with your results.

AI-Assisted Distant Read

Start by selecting a work from Project Gutenberg (anything other than Frankenstein, as I’m using that here as a sample), and make sure you download the “Plain Text UTF-8” version as a .txt file. For instance, the plain text version of Frankenstein is the file here: TXT. You’ll notice that this plain text version has some noise at the top of the file, and at the end – this is information and metadata added by Project Gutenberg. We could delete that ourselves, but we’re going to try out the model’s preprocessing and have it work with us throughout the entire process. So, download that plain text file for now and have it ready to attach when you’re in conversation with the system.

Here’s a guiding set of basic prompts to try - these are general, and it might require several iterations to get the output of each:

These basic steps will result in errors, but they can also provide some useful rapid visualizations and data. Here’s a few examples from my output - you’ll notice that the charts in some cases mention they are corrected because I had to ask for several iterations:

phrases Figure 1. Frequent bigrams and trigrams

word cloud Figure 2. Word cloud, after iterating stop words

character network Figure 3. Character network, weighting for significance