In May 2026 the SEC issued a proposal that would allow public companies to switch from quarterly (Form 10-Q) to semi-annual financial reporting (a new Form 10-S). From the date of publication, the public has 60 days to comment on the proposal.
This site was produced by Professor Tzachi Zach at The Ohio State University Fisher College of Business as a public service to track the comment letters as they arrive, classify their positions, and surface patterns in the docket. A number of methodological decisions had to be made along the way — how to classify stance, how to bucket commenters by entity type, how to handle hedged or conditional letters — those decisions are explained below.
Beyond the tracker itself, the project has two other goals. First, I hope it will encourage discussion among accounting academics about the proposal. The economic-analysis questions the proposal raises (short-termism, fraud risk, compliance burden, retail-investor protection) are central to what we study, and the comment period is a good moment to bring our expertise to bear. Second, it is quite interesting to test how well language-model classifiers handle regulatory text. With a few hundred letters and fast iteration, can we converge on classifier design choices that scale to future research? The site will be most relevant to accounting academics, auditors, preparers, financial analysts, IR professionals, and anyone who follows the SEC docket professionally.
This is not the first time the SEC has revisited reporting frequency. In 2018 the Commission issued a request for comment on the same question — earnings releases and the frequency of quarterly reporting (Release 33-10588, File No. S7-26-18). That one solicited public input without proposing a specific rule, unlike the current formal proposal. I built a companion tracker of the 2018 comment letters, available here.
Comments, suggestions, or corrections welcome — send feedback here or email zach.7@osu.edu.
59972 submitters across 10 template types · 59972 Oppose
The SEC posts campaign / form letters as a template text plus a submitter count, without docketing the individual submissions. These are tracked separately from the 8011 individually-classified letters above and are not summed with them: the submitter counts are anonymous and may overlap with named letters. Held out of the classification statistics and the regression. As reported by the SEC as of 2026-07-01.
| Type | Submitters | Stance | Rationale | Template text |
|---|---|---|---|---|
| Type A | 23786 | Oppose | — | “Dear Commissioners, Please keep quarterly reporting requirements in place. Thanks,” |
| Type B | 212 | Oppose | — | “Dear SEC Officials, As an investor, I oppose this rule change. Regards,” |
| Type C | 142 | Oppose | — | “I strongly oppose moving to semiannual reporting.” |
| Type D | 7 | Oppose | IA MF IP CMP | View full template on SEC.gov → |
| Type E | 43 | Oppose | IA IP CMP MF AL | View full template on SEC.gov → |
| Type F | 35105 | Oppose | IA FR IP | “Dear SEC, I oppose the SEC proposal because it prevents investors like me from accessing information about companies; lets companies hide behind closed doors; and allows fraud to fester. Sincerely,” |
| Type G | 73 | Oppose | IA IP FR MF ICc | View full template on SEC.gov → |
| Type H | 59 | Oppose | IA MF CMP FR SG IP | View full template on SEC.gov → |
| Type I | 365 | Oppose | FR IA EX CB | View full template on SEC.gov → |
| Type J | 180 | Oppose | IA IP MF US | View full template on SEC.gov → |
If the form-letter submitters are added to the docketed letters (hypothetical, not the headline count): 67983 total (8011 + 59972) — 67897 Oppose (7925 + 59972) = 99.9%, 52 Conditional = 0.1%, 34 Support = 0.1%. Shown for scale only; the two tallies are reported separately above because the form-letter submitters are anonymous and may overlap with named letters.
These letters were filed on the docket but state no Support / Oppose / Conditional position on reporting frequency. One is substantive — Better Markets' formal request to extend the comment period. The rest are brief acknowledgments ("Thanks", "Bueno", "I'm interested", "Sounds cool") that engage no argument. All are excluded from the stance totals, the charts, the rater statistics, and the regression, but appear in the All-letters table and the search so they remain findable.
| # | Date | Commenter | Type | What it is |
|---|---|---|---|---|
| #2926 | 2026-06-30 | Meryl Moran Individual | Individual | Empty submission — salutation only, no content or position. |
| #2918 | 2026-06-30 | Cristian mj Sanchez Individual | Individual | Empty submission — salutation only, no content or position. |
| #1380 | 2026-06-15 | MFA, AIMA, SIFMA Asset Management Trade associations (MFA, AIMA, SIFMA AM) | Trade association / advocacy organization | Three asset-management associations request a 60-day extension of the comment period, taking no substantive position on the proposal itself. |
| #4254 | 2026-06-11 | Chris Bartlett Individual | Individual | No position: the comment contains only the commenter’s name and city, with no stance on the proposal. |
| #1964 | 2026-06-07 | Karen Pedowitz Individual | Individual | Manager at a 'top firm'; comment is about clean air and water, not reporting frequency — states no position on the proposal (held out). |
| #1702 | 2026-06-07 | Tammy Munson Individual | Individual | Incoherent submission with no discernible position on the proposal. |
| #1585 | 2026-06-06 | Keith Individual | Individual | Off-topic remark about making money online with no position on the reporting rule. |
| #1584 | 2026-06-06 | Kasey Shue Individual | Individual | Garbled message about not seeing open comments with no discernible position on the proposal. |
| #1524 | 2026-06-06 | Theodore Manzanares Individual | Individual | A rambling general statement about people and loyalty that takes no clear position on the rule. |
| #1487 | 2026-06-06 | Nick Matthews Individual | Individual | Off-topic political commentary about progressive leadership that never addresses the reporting rule. |
| #1132 | 2026-06-05 | Justin Toney Individual | Individual | An individual muses that reporting frequency was not what enabled Enron's fraud and is unsure the proposed change matters — taking no clear position. |
| #739 | 2026-06-04 | Joseph Corcoran & Raymond Mosca, SIFMA Managing Director & Associate General Counsel; AVP, Asset Management Group | Trade association / advocacy organization | SIFMA (Joseph Corcoran, MD & Associate General Counsel) and SIFMA AMG (Raymond Mosca, AVP) — the leading broker-dealer/asset-manager trade association — request a 60-day extension of the comment period, citing the proposal's complexity (underwriter due diligence, investor-information timeliness, comparability), two overlapping offering-reform proposals, and the July 6 deadline's collision with the U.S. semiquincentennial. The second formal extension request on the docket (after Better Markets #701). Classified No position (procedural); entity Trade association / advocacy organization. |
| #701 | 2026-06-04 | Dennis M. Kelleher & Benjamin L. Schiffrin, Better Markets, Inc. Cofounder/President/CEO; Director of Securities Policy | Trade association / advocacy organization | Better Markets (Dennis Kelleher, Cofounder/President/CEO; Benjamin Schiffrin, Director of Securities Policy) — a non-profit financial-reform advocacy organization — requests a 60-day extension of the comment period (to 120 days total), arguing the 279-page, 58-RFC proposal needs more time and that the far shorter 2018 request got 90 days. It defers its substantive position to a later filing. Classified No position (procedural extension request); entity Trade association / advocacy organization. |
| #1008 | 2026-06-01 | Rodney Ross Individual | Individual | An individual submits a rambling appeal for equal access and inclusion in financial opportunity that states no clear position on the reporting-frequency proposal. |
| #695 | 2026-05-31 | Sara Brantley Individual | Individual | Two-word submission ('Sounds cool') stating no position. Held out as No position. |
| #657 | 2026-05-29 | Scott James Individual | Individual | Two-word submission ('I'm interested') stating no position on the proposal. Held out as No position. |
| #656 | 2026-05-29 | Shem Alhassan Individual | Individual | Two-word submission ('I'm interested') stating no position on the proposal. Held out as No position. |
| #496 | 2026-05-28 | Pedro Cejudo Individual | Individual | Single Spanish word 'Bueno' - does not substantively engage with the proposal. Held out as No position per the precedent set by #308 ('Thanks'). |
| #308 | 2026-05-22 | Anonymous Individual | Individual | Single word ('Thanks'); no position on the proposal. Held out of the public split (No position). |
| #293 | 2026-05-21 | Levonnie Hunter Individual | Individual | Non-substantive comment ('I am very excited') with no position on reporting frequency. Filed under the new 'No position' category (held out of the public split, like Off-topic). |
Each letter card in the table below carries an agreement badge (Unanimous / 2-of-3) and shows all three rater calls on hover.
I then ran the same 3 rubric prompts through ChatGPT-5.5 as an independent second ensemble. The question I wanted to answer: would the stance calls hold up under a different model family? Carlson & Burbano (SMJ 2026) recommend this kind of cross-model check where feasible.
GPT-Majority vs Claude-Majority: 136 / 142 = 96% raw, Cohen's κ = 0.86. Substantial cross-model agreement on the aggregated label.
Per-rubric agreement varies:
The Skeptic divergence reflects a rubric-conditioning effect. The same "default to Conditional unless unambiguous" instruction yielded 86 Conditional calls in GPT-5.5 versus 37 in Claude Opus 4.7. Same prompt, different operationalization across model families. Aggregate agreement on the majority vote holds; per-rubric agreement is more model-dependent.
6 letters fall outside the cross-model majority match: #2 Fardeen Irani, #3 Anonymous, #13 Skyler Mathis, #43 Steven A. Collazo, #80 Bayo Olabisi, #122 Tal Madison. All 6 push from Claude's Support or Oppose call into GPT's Conditional call. 4 of the 6 already had at least one Claude rater calling Conditional, so the cross-model disagreement concentrates on the hedge-boundary letters Claude's own ensemble was already split on.
I read all 6 of these letters by hand. On every one, Claude-Majority is the call I would have made; GPT-Majority is not. The uniform GPT failure mode: it treats any structural alternative or rhetorical hedge in the letter body as evidence of conditionality, even when the author's stated position is unambiguous. Six letters is a small validation set, but the direction is one-sided.
I ran those same comment letters through four open-weight models that anyone can download and run for free, and compared each model's majority stance call to the label shown on this site.
On the same letters where ChatGPT-5.5 reaches κ = 0.86, the three capable open models land in the same band (κ = 0.80 to 0.90), and a free 24-billion-parameter model edges the commercial system. Across the full docket of 1,898 letters the capable models reproduce the site's stance label on 93 to 98 percent. Kappa runs lower on the full docket because it is roughly 97 percent Oppose, so agreement by chance is high and the metric is pushed down even when raw agreement is near-perfect; the 142-letter sample above is less skewed. Qwen, kept as a deliberate floor, is the one that breaks down.
I validated these classifications against a human expert gold standard and a 14-model benchmark spanning Claude, GPT, Gemini, and four open-weight models. Stance is where the labels are most reliable; entity and rationale are harder, for people and machines both.
How reliable are these labels? →Substantially higher agreement than the stance ensemble (κ = 0.625). Majority headline distribution: Individual 7615 / Investment professional 105 / Accountant (CPA) 96 / Industry practitioner 55 / Legal practitioner 37 / Issuer-current 33 / Issuer-former 29 / Academic researcher 26 / Student 9 / Trade association / advocacy organization 6.
The same 3 rubric prompts ran through ChatGPT-5.5 as an independent second ensemble.
GPT-Majority vs Claude-Majority: 120 / 142 = 85%, Cohen's κ = 0.63. Moderate cross-model agreement on the aggregated label.
Per-rubric agreement:
The pattern is systematic. GPT-5.5 has a stronger "Individual" prior than Claude Opus 4.7 across all three rubrics. The biggest split is on writers who sign as "CFO, ACME Corp" or similar institutional role but write in a personal register engaging investor-protection concerns rather than issuer-specific concerns: GPT-Primary classifies as Individual; Claude-Primary classifies as Issuer-current. Both readings are defensible under the rubric. The rubric requires a "speaking-as" judgment, and the two model families weight surface role vs register differently.
Intra-model Fleiss κ is 0.87 for Claude and 0.78 for GPT. Within-model agreement holds for both ensembles; the divergence is across model families.
22 letters fall outside the cross-model majority match. 17 of 22 flow into GPT-Individual from a more specific Claude bucket. 11 of these 22 are already on the contested-letters list internal to Claude's own three-rater ensemble.
The same four open-weight models, scored against the entity bucket shown on this site:
Entity is the harder call for every model. The capable open models (κ = 0.51 to 0.55) sit just behind ChatGPT-5.5 (κ = 0.63) and well above the Qwen floor, the same ordering as on stance. Who is writing is a genuine judgment call, and the open and commercial models miss it in the same places.
I validated these classifications against a human expert gold standard and a 14-model benchmark spanning Claude, GPT, Gemini, and four open-weight models. Stance is where the labels are most reliable; entity and rationale are harder, for people and machines both.
How reliable are these labels? →Anti-proposal codes use a red shade and pro-proposal codes a green shade, so the directional balance reads at a glance. Every SEC-engaged code carries a verbatim quote from the proposing release. The quote shows how the SEC itself frames the argument.
Low-frequency codes have volatile κ (swinging to either extreme on a handful of calls): CC (contractual constraints) on 9 letters (κ 0.93); ICs (intl evidence — supportive) on 9 letters (κ 1.00); OP (OP) on 49 letters (κ 0.91). κ stabilizes as code frequency rises.
The same 3 rubric prompts ran through ChatGPT-5.5 as an independent second ensemble.
GPT-Majority vs Claude-Majority: mean per-code Cohen's κ = 0.44. Moderate cross-model agreement, lower than the stance ensemble (κ = 0.86) and the entity ensemble (κ = 0.63). The ranking is consistent with rationale being the most inferential and multi-label of the three ensembles. Set-level exact match (GPT majority code set identical to Claude majority code set): 37 of 142 letters, 26%. Mean Jaccard similarity between the two majority sets: 0.53.
Per-rubric mean κ across the 20 codes:
Surface-readable codes converge across model families: MF (κ = 0.79), CMP (κ = 0.76), FR (κ = 0.70), LE (κ = 0.70), NR (κ = 0.60). Inferential or umbrella codes diverge: EX (κ = 0.20), IP (κ = 0.32), OP (κ = 0.32), AU (κ = 0.39).
The rubric-conditioning pattern visible in the stance and entity ensembles shows up again. GPT-Inclusive fires 4.50 codes per letter; Claude-Inclusive fires 2.84. Same "code whenever plausibly invoked" instruction, very different operationalization. Claude's three rationale raters stay within a 25% spread of each other (2.25 to 2.84 codes per letter); GPT's three span nearly 2x (2.44 to 4.50). The 3-rater majority κ (0.477) is higher than two of the three matched-rubric κs, which shows that aggregation dampens cross-model variance just as it does within-model.
Full argument taxonomy with SEC quotes →
I validated these classifications against a human expert gold standard and a 14-model benchmark spanning Claude, GPT, Gemini, and four open-weight models. Stance is where the labels are most reliable; entity and rationale are harder, for people and machines both.
How reliable are these labels? →Letters grouped by who submitted them, color-coded by stance.
Letters grouped by word count, color-coded by stance.
For each entity type, how its letters distribute across word-count buckets.
Same predictors across all three models (7 entity dummies with Individual as reference, plus log(words+1)). The Logit and LPM share a binary outcome (Support=1 / Oppose=0, Conditional dropped). The ordinal logit uses the full 3-class outcome (Oppose < Conditional < Support). Each cell shows coefficient on top, SE in parentheses below, p-value in italics underneath.
| Variable | Logit Support vs Oppose |
Ordinal logit Oppose < Cond. < Support |
LPM (OLS) Support vs Oppose, HC1 |
|---|---|---|---|
| Constant | −9.59 *** (0.79) p = 0.000 | — (cutpoints below) | −0.021 *** (0.007) p = 0.004 |
| Accountant CPA | +0.60 (1.07) p = 0.571 | +1.48 *** (0.48) p = 0.002 | +0.005 (0.011) p = 0.668 |
| Issuer-current | +2.17 ** (0.88) p = 0.014 | +2.55 *** (0.48) p = 0.000 | +0.065 (0.050) p = 0.187 |
| Issuer-former | separated | +1.97 *** (0.65) p = 0.002 | separated |
| Investment prof. | +0.17 (1.07) p = 0.875 | +0.11 (0.75) p = 0.881 | +0.002 (0.009) p = 0.832 |
| Academic | +0.59 (1.07) p = 0.580 | +0.88 (0.67) p = 0.189 | +0.074 (0.059) p = 0.203 |
| Industry pract. | +0.86 (1.07) p = 0.421 | +0.82 (0.68) p = 0.228 | +0.011 (0.019) p = 0.576 |
| Legal pract. | +1.01 (1.16) p = 0.382 | +0.39 (1.06) p = 0.713 | +0.019 (0.025) p = 0.464 |
| Trade assoc. | +1.95 * (1.08) p = 0.071 | +1.47 (0.98) p = 0.134 | +0.308 (0.192) p = 0.108 |
| Student | +2.95 *** (1.11) p = 0.008 | +2.08 * (1.11) p = 0.061 | +0.102 (0.106) p = 0.334 |
| log(words+1) | +0.94 *** (0.17) p = 0.000 | +0.97 *** (0.11) p = 0.000 | +0.006 *** (0.002) p = 0.001 |
| N | 7959 | 8011 | 7959 |
| Fit | McFadden R² = 0.164 | McFadden R² = 0.179 | R² = 0.038 (adj. 0.037) |
| Log-likelihood | -183.45 | -437.59 | (OLS) |
Each letter can invoke zero or more argument families (multi-label). 22-code taxonomy anchored on the SEC's proposing release — see the argument taxonomy for code definitions and verbatim SEC quotes.
Same rationales, stacked by the stance of the letter that cited them. Hover any code pill on the y-axis for a short explanation.
Top 5 longest letters within each stance bucket. The substantive intellectual center of each side reads at a glance.
Each letter is classified three times for stance, entity, and rationales, each time by a different Claude rubric. The small pill next to each value shows whether the three raters agreed: Unanimous (all three matched), 2 of 3 (majority match), or Split (all three differed, rationales only). See the methodology sections above for the rubric details.
| # | Date | Commenter | Role | Stance | Words | Rationales |
|---|