Researchers need structured, reproducible data
If you study social movements, political participation, protest behaviour, or public gatherings, you know the frustration: reliable attendance data barely exists.
Most historical protest records rely on:
- Newspaper estimates (often wrong, sometimes politically motivated)
- Organiser claims (usually optimistic)
- Police counts (usually conservative)
- A handful of well-documented academic case studies
None of these is systematic. All of them are subject to bias and interpretation. Cross-study comparison is nearly impossible.
Rally Tally offers researchers something new: structured, reproducible, non-political attendance data recorded directly by participants.
Why historians and researchers struggle with attendance data
Academic research on protest movements, political participation, and social change all depend on knowing how many people participated. But good attendance data is scarce because:
- Historical records are inconsistent. Older events have no systematic records at all; only newspaper coverage, which varied widely in rigor.
- Sources are biased. Organisers and police have opposing incentives. Neither produces neutral estimates.
- Methods are not documented. Even when a number survives, it is rarely accompanied by an explanation of how it was reached.
- Comparability is difficult. One event's estimate was made by photo analysis. Another's came from checkpoint counting. You cannot compare them directly.
The result is that protest databases built by researchers often come with large margins of uncertainty, and cross-national or cross-temporal comparisons are fraught.
What structured attendance data enables
When attendance data is:
- Documented (method is clear)
- Reproducible (you can verify it or replicate the method)
- Time-stamped (you know when it was recorded)
- Geographically-registered (you know where it took place)
- Non-political (collected by participants, not authorities or organisers)
Then researchers can:
- Build reliable databases of protest activity and turnout
- Compare events across time and geography with confidence
- Analyze trends (is protest activity growing? Changing character? Shifting locations?)
- Study participation (who shows up to what kinds of events?)
- Understand movement dynamics (which campaigns grow? Which fade?)
Rally Tally creates exactly this kind of data.
Rally Tally as a research tool
For researchers, Rally Tally offers several advantages:
Structured event records: Each event in Rally Tally includes date, time, location(s), route information (if applicable), and event category. This structure is ideal for database building.
Direct participation signals: Check-in counts are not estimates or guesses. They are counts of participants who voluntarily reported their presence. This is more reliable data than observational estimates.
Time series data: Participants check in throughout the event. This gives researchers a direct participation signal, not just a single observer estimate.
Reproducible across movements: Rally Tally uses the same structure regardless of whether the event is a labour protest, environmental rally, political march, or community gathering. This enables cross-movement comparison.
Anonymized and ethical: Check-in data is anonymous. You can analyze participation patterns without compromising participant privacy—a major advantage for studying sensitive events like political protests.
Non-censored historical record: Rally Tally creates an ongoing record independent of media coverage or government reports. Events that get minimal press coverage are still documented if participants check in.
Building protest databases with better foundations
If you are building a dataset of protest activity, including Rally Tally data alongside traditional sources (newspapers, organiser claims, police reports) dramatically improves data quality.
You can:
- Record all available sources for each event (organiser estimate, police estimate, news reports, check-in count)
- Compare them to identify patterns (authorities consistently lower by X%, organisers consistently higher by Y%)
- Use check-in data as calibration (if check-ins suggest Z, what does that tell us about the reliability of other sources?)
- Build confidence intervals rather than point estimates ("based on check-in data, turnout was likely in the range of X–Y")
This kind of transparency about data sources and uncertainty is far more valuable for research than false precision.
Understanding movement characteristics through check-in patterns
Rally Tally check-in data reveals things that single crowd estimates cannot:
Peak vs. sustained participation: Does participation spike at a particular time and then drop (rally-like) or remain steady throughout (march-like)? Different movement tactics show different patterns.
Geographic spread: For multi-location events or distributed actions, check-in data shows whether participation was concentrated or spread across locations.
Repeat participation: If you can link check-ins across events while maintaining anonymity, you can study whether participation is concentrated among core activists or distributed among broader publics.
Temporal trends: Comparing check-in data across successive events reveals whether a campaign is growing, holding steady, or declining.
Category shifts: Has a movement shifted from large rallies to smaller, more distributed actions? Check-in data would reveal that pattern over time.
Ethical research on sensitive populations
One advantage of Rally Tally for researchers studying sensitive movements is that the data is collected with privacy-first principles built in.
If you are researching:
- Political protest and dissent
- Sensitive social movements
- Activism in restrictive contexts
Rally Tally data can support your research without requiring you to compromise participant privacy or create personal identification trails. Participants contributed the data voluntarily, and anonymity was built into the system.
This is a significant advantage for research ethics compared to other data sources.
Comparative movement analysis
When different movements use Rally Tally, it enables comparative research:
- How do environmental movement events compare in scale to labour movement events?
- Are student-led protests growing or declining relative to other activism?
- Which geographic regions show more participation in public events?
- How do tactics (large rallies vs. distributed actions) affect participation?
These kinds of comparative questions are difficult to answer reliably with traditional crowd counting. Rally Tally structured data makes them addressable.
Limitations to be transparent about
As a researcher using Rally Tally data, be clear about its limitations:
- Not all participants check in. Check-in numbers represent a portion of physical attendance, not the total.
- Selection bias. People who check in may differ systematically from those who do not (tech-savvy, privacy-conscious, etc.).
- Tool adoption varies. Early events will have lower check-in rates as the tool gains adoption. Compare carefully across time.
- Geographic variation. Rally Tally may be more or less known in different regions, affecting representativeness.
These limitations are not flaws—all research has them. The important thing is to acknowledge them in your methods section and discuss them when interpreting results.
Accessing and analyzing Rally Tally data
If you are a researcher interested in using Rally Tally data:
- Explore rally-tally.com to see what event data is publicly available
- Contact Rally Tally about research access and data sharing agreements
- Combine Rally Tally data with your other sources (newspaper archives, police reports, organiser claims)
- Document your methods carefully so other researchers can replicate or extend your work
Contributing to better protest scholarship
As protest activity becomes increasingly decentralized and varied, traditional crowd counting becomes even more difficult. At the same time, having better participation data becomes even more important for understanding political change.
Rally Tally offers researchers a tool to contribute to better data infrastructure. By using and citing check-in data, scholars can:
- Build more reliable movement databases
- Enable better comparative research
- Create an open historical record
- Support future scholarship
Starting with one research question
If you are considering using Rally Tally data in your research:
- Identify a research question that would benefit from structured participation data (e.g., "Are environmental protests growing in size relative to labour protests?")
- Map out what events are relevant to your question
- Collect Rally Tally data alongside traditional sources
- Analyze the combined data to see what insights emerge
- Document your methods transparently for other researchers
A practical methods framework for researchers
If you want to use Rally Tally in a publishable study, define the data model before collecting events. A practical baseline schema includes:
- event identifier
- event date and time window
- location and route fields
- event category and movement type
- check-in total and check-in time distribution
- organiser estimate (if available)
- authority estimate (if available)
- media estimate(s)
Once this structure is fixed, your analysis becomes easier to replicate and audit.
Improving validity with triangulation
No single source captures full attendance. The strongest studies triangulate.
A useful sequence:
- Use Rally Tally check-ins as the direct participation signal.
- Add observer-based context (photos, route monitoring, field notes).
- Add organiser and authority estimates as comparative claims.
- Estimate plausible participation ranges instead of reporting one exact figure.
This approach keeps the advantages of check-in data while still acknowledging undercount risk and adoption effects.
Designing studies for reproducibility
For high-quality comparative research, predefine:
- inclusion criteria (which events enter the dataset)
- exclusion rules (which events are dropped and why)
- handling for missing fields
- treatment of multi-location events
- rule for separating physical and virtual participation
Then publish those rules in your appendix or methods note. Reproducibility improves when another team can apply your rules and reach similar conclusions.
Handling bias explicitly
Selection effects are expected in attendance datasets. Instead of hiding them, model them.
Common bias controls:
- include region and period fixed effects
- include proxy variables for digital access
- compare early-adoption and late-adoption periods separately
- run sensitivity analyses with conservative and liberal turnout assumptions
Even basic robustness checks significantly improve the credibility of findings based on participation signals.
Example research designs enabled by Rally Tally
1) Time-series movement analysis
Question: Is campaign participation increasing over a six-month period?
Data strategy: Use weekly check-in totals by event category, control for seasonality and day-of-week effects, then estimate trend slope.
2) Cross-city participation comparison
Question: Which cities sustain higher repeat mobilisation for the same cause?
Data strategy: Aggregate event-level check-ins by city and cause, normalize by event count, compare stability and volatility across cities.
3) Tactical format comparison
Question: Do static rallies or route marches produce higher sustained participation?
Data strategy: Categorize events by structure, compare check-in time distributions, and test differences in peak-to-total ratios.
Reporting standards for transparent scholarship
When publishing work that uses Rally Tally data, include:
- event selection rules
- period covered by the dataset
- total event count and missingness summary
- definition of participation variables
- limitations and uncertainty statement
This mirrors best practice in social science data reporting and makes downstream re-use much easier.
Frequently asked questions for research teams
Can Rally Tally check-ins stand alone as attendance truth?
No. They are a high-value participation signal, but best used in triangulation with other sources and method notes.
Is this data useful for cross-country studies?
Potentially, yes, but adoption differences across countries must be modeled carefully. Comparability requires explicit controls.
How should we treat events with very low check-ins?
Do not automatically discard them. Mark them as low-signal events, run sensitivity tests, and report the handling rule.
Can this support causal inference?
Only if your design supports it. Rally Tally can improve measurement quality, but research design still determines causal credibility.
What is the main value for historians?
A structured event archive that reduces dependence on inconsistent retrospective estimates.
External reading and references
- Rally Tally home: https://www.rally-tally.com
- Rally Tally guides: https://www.rally-tally.com/guides
- Rally Tally downloads: https://www.rally-tally.com/download
- Universal Declaration of Human Rights (Article 20, peaceful assembly): https://www.un.org/en/about-us/universal-declaration-of-human-rights
- Amnesty International overview of expression and protest rights: https://www.amnesty.org/en/what-we-do/freedom-of-expression/
- Background on crowd estimation methods: https://en.wikipedia.org/wiki/Herbert_Jacobs
Related guides for research context
- Methodology baseline: Crowd Counter
- Protest-specific participation data: Protest Attendance Tracker
- Geographic and route context: Protest Mapping Guide
- Broad attendance framing: Rally Attendance Tracker
- Event discovery source: All Events
- App download page: Download
Better protest scholarship starts with better data. Rally Tally is one tool that makes that possible.
For researchers studying social movements, political participation, and public gatherings, explore Rally Tally's research resources at rally-tally.com.