Assignment 1 Guidelines
Reading Like a Computer
Assignment 1: Text Analysis with a Computational Tool
Overview
In this assignment, you will select a text corpus and use a computational text analysis tool to explore it, generate insights, and communicate your findings to an audience.
Due Date: Friday, May 1, 2026
Submission Format: Web-based (on your course website)
Length: ~1,500 words + visualizations
Value: 15% of your final grade
Learning Objectives
By completing this assignment, you will:
- Apply a computational text analysis tool to a real-world corpus
- Generate and interpret visualizations and quantitative results
- Ground computational findings in humanistic analysis
- Communicate technical work to a non-specialist audience
- Critically evaluate the affordances and limitations of computational methods
Assignment Description
Part 1: Selection and Framing (300 words)
Choose a corpus of texts that interests you and articulate a research question or analytical goal. Your corpus could be:
Suggested Corpora:
- A collection of literary texts (e.g., short stories, poems, novels by a single author)
- News articles on a particular topic or from a specific publication
- Social media posts, tweets, or blog posts
- Historical documents or letters
- Song lyrics, movie scripts, or other cultural texts
- Your own dataset (with instructor approval)
Your Framing Should:
- Describe the corpus (what texts? how many? what time period?)
- Explain why you chose this corpus
- Articulate 2-3 research questions you want to explore
- Indicate what you hope to discover through computational analysis
- Explain why a computational approach is useful for your questions
Part 2: Computational Analysis (600 words)
Conduct your analysis using one or more of the tools we’ve explored in class:
Tool Options:
- Voyant Tools — Web-based visualization and analysis
- AntConc — Corpus analysis software
- R/RStudio — Statistical text analysis
- Python — With libraries like NLTK or spaCy
- Other tools (with instructor approval)
Your Analysis Should:
- Clearly explain which tool(s) you used and why
- Describe the analysis process you followed
- Present 3-5 key findings or patterns you discovered
- Include 3-5 visualizations (screenshots, graphs, word clouds, etc.)
- Explain what each visualization shows and what it means
Examples of Analysis Approaches:
- Frequency analysis: Which words appear most often? How do frequencies change over time?
- Comparative analysis: How do different texts or corpora differ in their word use?
- Pattern finding: What clusters or groupings emerge from the data?
- Sentiment analysis: What emotional tones or attitudes appear in the texts?
- Stylistic analysis: How do different authors or texts differ stylistically?
Part 3: Critical Interpretation (500 words)
Reflect on what your computational analysis reveals and its limitations.
Address:
- What do your results suggest about the corpus? What is interesting or surprising?
- How do these findings relate to your research questions?
- What humanistic knowledge or context helps interpret these results?
- What are the limitations of your analysis? (corpus bias, tool limitations, etc.)
- What biases or assumptions might be embedded in the tool you used?
- What questions remain unanswered? What would you explore next?
Tone: Thoughtful, critical, and honest about what you don’t know
Format and Submission
Publication
- Post your analysis on your course website
- Create a clear title and include the date
- Make sure all images and links work correctly
- Include a link from your main course site index
Visual Elements
- Include 3-5 visualizations from your analysis
- Caption each visualization clearly
- Reference visualizations in your written text
- Consider layout and readability
Structure
- Title and introductory paragraph
- Section 1: Corpus and Research Questions
- Section 2: Analysis and Findings
- Section 3: Critical Reflection
- Visualization gallery or integrated throughout
Citation
Cite any sources you reference (readings, tutorials, data sources, tools).
Assessment Rubric
See the Rubrics page for detailed evaluation criteria.
Key Areas Evaluated:
- Method selection and application (15%)
- Research question and design (15%)
- Analysis and interpretation (25%)
- Critical evaluation of limitations (15%)
- Visual communication (15%)
- Writing quality and organization (15%)
Resources and Support
Tools and Tutorials
- Voyant Tools Guide
- AntConc Tutorial
- R for Text Analysis
- Course materials and recorded tutorials
Text Sources
- Project Gutenberg — 70,000+ free ebooks
- JSTOR Daily — Recent articles
- Reddit Data — Historical Reddit data
- Kaggle Datasets — Diverse datasets
- Text Analysis News Archive — News texts
Getting Help
- Attend office hours to discuss corpus selection and analysis approach
- Bring questions about tool use to class sessions
- Review exemplary assignments from previous semesters
- Participate in peer discussions and feedback sessions
Common Questions
Q: Can I use a corpus I found online?
A: Yes, as long as it’s legally and ethically available. Avoid copyright-protected texts without permission.
Q: What if the tool doesn’t work as I expected?
A: That’s valuable information! Document what happened, explain what you learned, and discuss it in your critical section. Tool challenges are learning opportunities.
Q: Can I use a tool that’s not on the suggested list?
A: Yes, but get instructor approval first. Bring an example or link to class.
Q: How many visualizations do I need?
A: At least 3, no more than 5. Quality over quantity.
Q: Can I work with a partner?
A: No, assignments must be individual work. You may discuss ideas, but writing and analysis must be your own.
Timeline Suggestions
- Week of April 7: Select corpus; articulate research questions
- Week of April 14: Begin exploratory analysis; generate initial visualizations
- Week of April 21-28: Refine analysis; experiment with different approaches; gather final visualizations
- Week of April 28-May 1: Write critical reflection; revise and polish; post to website