Finding · drafted May 12, 2026
What the Journals Show When Read in Aggregate
Six patterns that emerge when 3,415 journal entries are read together
By Ryan Abrahamsen — Lewis and Clark Trust
Editorial status: draft
An overview of patterns that emerge when the full Corps of Discovery journals are read in aggregate. None of the individual observations below are entirely new to specialists. What is new is being able to quantify them across all six narrators and the full 28-month journey at once. Each claim links to the underlying data on this site.
1. The journals are interlocking, not independent
Popular history frames the six expedition journalists — Lewis, Clark, Floyd, Ordway, Gass, Whitehouse — as six independent witnesses. That framing is misleading. When the entries are aligned by date and compared:
- 915 dates have multi-narrator coverage — meaning two or more journalists wrote about the same day. We’ve drafted cross-narrator analyses for these days (browse at /analyses/). Of those analyses, an editorial audit found 684 had every quoted passage appear verbatim in a cited source; 203 had partial matches; 6 had none. The “partial” rate is the interesting figure: about a quarter of multi-narrator days show one journalist’s “independent” account containing phrases that match another’s almost word for word.
- Whitehouse copies Ordway on roughly a third of overlapping days. The two were enlisted men who served as scribes for the captains; their entries show stylistic and lexical drift toward each other through the journey, with later Whitehouse entries containing near-verbatim Ordway phrasing.
- Gass condenses on long days. When Lewis or Clark write 4,000+ characters, Gass typically writes under 200. His entry for that day will say “rained all day” while the captains describe nine pages of botanical observation and diplomacy. Read alone Gass looks terse; read against the others he’s filling in only what the captains didn’t.
This matters because the journals’ authority has been treated as cumulative (“six perspectives confirm the same event”). The reality is closer to a single distributed text — written by six hands, but with editorial dependence baked in. The Whitehouse profile and Ordway profile in this database show their stylistic drift visible at the per-day level.
2. Mention density tracks dependency, not respect
The Corps encountered 54 Native nations. They named the Mandan 277 times in the journals. Shoshone 188 times. Nez Perce (Chopunnish) 133. Clatsop 142. Teton Sioux (Lakota) 31. Crow (Apsáalooke) 57.
Read in isolation these mention counts suggest hierarchies of importance or relationship. Read against the geographic and chronological data, a different pattern appears:
- The four most-mentioned nations — Mandan, Shoshone, Nez Perce, Clatsop — correspond exactly to the four times the Corps was most dependent on Native assistance. Fort Mandan winter (five months under Mandan hospitality, learning the upper Missouri); the Lemhi Shoshone (horses to cross the Bitterroots); the Nez Perce (food when the Corps was starving in the Bitterroots, then guides for the return); the Clatsop (the winter trade that fed Fort Clatsop).
- Nations the Corps encountered fleetingly but did not depend on — the Cayuse, the Yakama, the Walla Walla, the Palouse, the Makah — all have single-digit or low-double-digit mentions.
- The Teton Sioux, who confronted the Corps and demanded tribute at Bad River in September 1804, get fewer mentions (31) than the Shoshone (188) despite being the most diplomatically charged encounter of the outbound journey. Their absence from the documentary record isn’t accidental: the journals minimize what they had to back away from.
Documentary density follows need, not significance. The “ethnography” the journals offer is an ethnography of dependence.
3. Sacagawea is named 37 times across roughly 500 days of presence
Sacagawea joined the Corps in November 1804 and remained with the permanent party through August 1806 — approximately 21 months, or 580 days. She is named (under all variant spellings: Sacagawea, Sah-cah-gah-we-a, Sah-cah-gar Wea, Sahcahgarweah, Sacajawea, “Squar (Sacagawea)”, “Charbonneau’s wife”, “frenchmans Squaw”, “Shabonos Squar”, and a few others) in 37 distinct journal entries. View all 37 entries on a map.
That’s once every 16 days, on average. The mention rate is approximately five times lower than her husband Toussaint Charbonneau (106 entries) and forty times lower than her brother-in-arms George Drouillard (405 entries, all variant spellings of “Drewyer”).
The 37 entries that do name her cluster around specific events: her recognition of childhood landscapes near Three Forks; her reunion with her brother Cameahwait; her recovery of articles when a pirogue swamped; her insistence on seeing the whale; her vote on the Fort Clatsop location; her gift of weasel tails on Christmas Day 1805; her interpretation work in critical Nez Perce and Shoshone negotiations. The non-mention is itself the data: she was present every day but worth recording only on days when she acted as the captains’ instrument.
The same pattern, in less politicized form, holds for Seaman, the Newfoundland dog. He was with Lewis from August 1803 to at least 1806, every day. He’s named in 12 actual journal mentions across 36 entries that have been tagged with him. Approximately 1,100 days of presence; 12 days of being worth recording.
4. Writing density is a stress signal
Average entry length by narrator:
- Meriwether Lewis: 3,564 chars/entry (the most expansive)
- William Clark: 2,407 chars/entry
- John Ordway: 953 chars/entry
- Joseph Whitehouse: 854 chars/entry
- Patrick Gass: 763 chars/entry
- Charles Floyd: 459 chars/entry (died Aug 1804, three months in)
The Lewis-vs-Clark variance — popular history reads it as personality (Lewis the reflective philosopher, Clark the practical surveyor) — appears more parsimoniously explained as task division specified by Jefferson. Lewis was instructed to record ethnographic, botanical, zoological, and astronomical observations. Clark was instructed to survey the route, draw the maps, and record geographic features. Of course Lewis writes longer entries: he was assigned more to write about.
Within each narrator’s record, entry length collapses at predictable moments:
- Floyd’s entries between St. Charles (May 18, 1804: “we Lay at S’ Charles”, 20 chars) and Independence Creek (July 4, 1804: 580 chars) trace the corps’ shift from organized embarkation to active reconnaissance.
- Gass’s entries from Hungry Creek (mid-September 1805, crossing the Bitterroots) drop to under 100 chars per day. The full corps was eating melted snow and horse meat. Writing fell off because survival took the daylight.
- The longest entries from every narrator cluster at Fort Clatsop — five months of rain, stable shelter, and Lewis’s directive to compile the ethnographic and botanical write-up.
The journals are not a chronicle of constant observation. They are a chronicle of when there was time and warmth to observe.
5. The expedition wrote during half its waking hours
Total characters across all six narrators’ entries: approximately 5.5 million. Average 1,610 chars per entry × 3,415 entries. Across roughly 28 months of travel.
That works out to ~250 words per narrator per active day. For comparison: this is half a New York Times op-ed every day, per person, for 28 months — while walking, paddling, climbing mountains, often hungry, often wet, often sick. The journals are not an incidental record. The act of writing was a substantial daily labor, woven into the discipline of the expedition.
This matters for how we read individual entries: every word represents a choice about what to record and what to omit. The omissions are themselves a substantive record. Sacagawea’s relative silence in the journals is not a passive fact about the period’s documentary habits — it is a daily editorial choice by Lewis, Clark, and the four sergeants, repeated 580 times.
6. The wildlife record is two records, layered
The journals name 297 species, most of which were already well known to the Native nations whose territories the Corps crossed. Some 178 were “new to science” in the sense that they had not been previously described in European botanical or zoological literature.
The naming pattern splits cleanly:
- Game and trade animals — deer (1,052 mentions), elk (757), buffalo (~300), beaver (369) — appear in entries every few days because the Corps depended on them for food, hide, and trade.
- Discovery species — the Mountain Beaver, the prairie dog, the grizzly, the bighorn sheep — are named at first encounter, then occasionally, then often not again. Their documentary footprint is much smaller than their scientific significance.
The journals were not natural-history field guides. They were logistical journals in which natural history was secondary. The 178 discoveries we now celebrate are partly an accident of being the first U.S. citizens to write them down in English; many were already named in dozens of other languages.
What this archive enables
None of these observations require AI to make. What they require is the ability to ask, across all 3,415 entries simultaneously: how often, when, and against what context?
This database makes those questions cheap to ask. Every entity has an aggregator page with a map and date slider. Every narrator has a per-day word count and mention pattern. Every phase has a tribal-encounter and wildlife inventory. Every date with multi-narrator coverage has a cross-narrator analysis showing who said what against whom.
If you’d like to test or refute any of the patterns above, the underlying data is browseable:
- Sacagawea’s 37 named entries — map them by date, see the editorial gaps
- Mandan tribal profile — 277 mentions, all entries indexed
- Ordway’s per-day record — the cleanest of the sergeant journals
- Deer mentions across the route — the most-named species
- Rain entries — 417 days, plotted geographically
- Cross-narrator analyses — 915 days where multiple journalists wrote
The journals reward systematic reading. Two centuries of careful scholarship have made the texts available; this archive makes the patterns across them computable.
Drafted May 12, 2026. All claims here are testable against the underlying data on this site. Counts current as of the database’s most recent update; see /whats-new/ for change log. Each cross-narrator analysis on this site carries an editorial audit flag indicating whether quoted passages appear verbatim in cited sources; 6 are currently demoted to draft pending review. The author is an engineer rather than a historian; corrections from period scholars are welcome at ryan@terrain360.com.