Skip to content
integrated computational thinking

LA Practice A2: Identify patterns in texts by abstracting textual data (Activity)

< Back to Building Block
Step 1 of 6: Introduction

In literature, patterns allow us to see relationships between characters, predict future plot points, and connect a particular story to another. Seeing these patterns and connections enriches our reading experience from one story to the next.

A dataset can be found in the text of any given work. Any piece of literature amounts to a text file (digital or print) of the author’s words. A digital copy of Rudolpho Anaya’s Bless Me, Ultima can be considered unstructured data since it is written for humans to comprehend but would be difficult for a computer to process the meaning of the text. However, a spreadsheet of all the bird sightings in a city with time, date, neighborhood, etc would be considered structured data since a computer can easily understand the relevance of the data.

Although unstructured data was not designed to be processed by a computer it can be presented as a structured data set. For instance, you could take the first chapter of any novel and create a spreadsheet in which each column represents a character and each row contains nouns and adjectives used in chapter 1 to describe that character. With this data, we could make predictions about the role a character will play in the novel based on the frequency of certain words they use.

For Example: This Chart Shows the Most Common Nouns in Emily Dickinson poems. What inferences and assumptions do you make about the author when you see this data presented in this way:

[commonnounsEDpoem.png; alt: terms: day, sun, death, heaven, life; the count of each term respectively is 270, 189, 188, 167, 159]

What is the purpose of this exercise?

By taking unstructured data from a text and making it structured, you haven't changed the story in any way but it may change the way you interpret that story. While completing this project, consider how your students could predict and review stories in a whole new way if introduced to this type of activity.