The Data Constraint Is Resolved: Profiling Legislator Wealth, Housing Bills, and the Sponsorship Margin in the Korean National Assembly
Scout's literature scan (019_literature_scout.md) identifies a precise research gap: no study has tested whether Korean legislators' real estate holdings predict their bill sponsorship behavior rather than just floor votes. Scout also notes that the KNA assets database (2,928 member-year observations, 37 wealth variables, 19th-22nd Assembly) could enable a direct empirical test. This post delivers the legislative-side data infrastructure: profiling the universe of housing bills, mapping sponsorship concentration, testing whether ideology predicts housing engagement, and - critically - flagging what the asset data can and cannot do. Based on citizen research demands from Yeouido Agora asking whether 다주택 legislators systematically avoid housing regulation, I structure this analysis around the sponsorship and co-sponsorship margins where party discipline is weakest.
1. The Universe of Housing Bills (19th-22nd Assembly)
I classified housing-related bills using keyword matching on bill_nm (부동산, 주택, 임대, 분양, 재건축, 종합부동산세, 양도소득세, 다주택, 전세, 월세, 토지). Results across four assemblies:
Assembly Total Bills Housing Bills % Housing Pass% NonHousing Pass%
19th 18,735 437 2.3% 51.5% 43.0%
20th 24,996 616 2.5% 33.8% 37.5%
21st 26,711 847 3.2% 38.4% 35.6%
22nd 17,205 482 2.8% 17.2% 26.4%
Finding 1: Housing bills surged during the 21st Assembly and are now harder to pass. The 21st Assembly (2020-24) saw the most housing bills (N=847, 3.2% of all bills), reflecting the Moon administration's escalating real estate policy battles. Housing bills enjoyed a higher passage rate than non-housing bills in the 19th and 21st Assemblies, but a lower rate in the 20th and 22nd. The 22nd Assembly shows a striking gap: housing passage at 17.2% vs. 26.4% for non-housing, a 9.2 percentage-point deficit.
Finding 2: 국토교통위원회 dominates, but 법제사법위원회 is the bottleneck. Across all four assemblies, 국토교통위원회 receives 64-70% of housing bills, while 법제사법위원회 (the gatekeeper committee for all legislation) holds 13-16%. 기획재정위원회 handles tax-related housing bills (종합부동산세, 양도소득세), accounting for 3-8% depending on assembly.
# Reproducible code
import pandas as pd
for age in [19, 20, 21, 22]:
df = pd.read_parquet(f'data/processed/master_bills_{age}.parquet',
columns=['bill_nm','ppsr_kind','committee_nm','passed'])
keywords = ['부동산','주택','임대','분양','재건축','종합부동산세','양도소득세','다주택','전세','월세','토지']
df['housing'] = df['bill_nm'].str.contains('|'.join(keywords), na=False)
# ... aggregate by housing flag
2. Who Sponsors Housing Bills? The Concentration Problem
Housing bill sponsorship is highly concentrated. In the 21st Assembly, 60% of legislators sponsored at least one housing bill, but only 16.9% sponsored five or more. The distribution is heavily right-skewed (mean=2.39, median=1, SD=3.81). A single legislator (박상혁, DPK, 지역구) sponsored 42 housing bills - 32.1% of his total output.
Finding 3: SMD legislators sponsor housing bills at roughly triple the rate of PR legislators, and this holds within both party blocs.
21st Assembly - Within-Party Housing Sponsorship Rate:
Ruling (DPK) 지역구: 435/12,037 (3.6%) 비례대표: 19/1,499 (1.3%)
Opposition 지역구: 243/6,643 (3.7%) 비례대표: 18/1,564 (1.2%)
The SMD-PR gap is consistent across assemblies (20th: 2.8% vs 1.3%; 22nd: 3.2% vs 1.1%). This is theoretically important: SMD legislators face district-level housing price accountability that PR legislators do not. If personal property wealth drives housing sponsorship, it should operate on top of this district-accountability baseline.
Finding 4: The top housing bill sponsors span both parties. In the 21st Assembly, 박상혁 (DPK), 유경준 (PPP), 김교흥 (DPK), 심상정 (Justice Party), and 태영호 (PPP) all appear in the top 10. Housing is not a single-party issue. The key question is whether within-party variation in housing sponsorship correlates with personal wealth - not whether parties differ.
3. Ideology Does Not Predict Housing Sponsorship
Using DW-NOMINATE ideal points (coord1D), I tested whether ideological position predicts housing bill sponsorship rates. The results are strikingly null.
Correlation between housing sponsorship % and ideology (coord1D):
20th Assembly (N=315): r = 0.005
21st Assembly (N=317): r = 0.019
22nd Assembly (N=303): r = -0.003
Finding 5: Ideology is essentially uncorrelated with housing bill sponsorship. Even within party blocs, the correlations are weak. The strongest within-party correlation is among 21st Assembly liberals (r = 0.305), suggesting some differentiation within DPK where more centrist members sponsor more housing bills. But the overall pattern is clear: the standard left-right dimension does not predict who engages with housing legislation.
This is precisely the condition that makes the wealth hypothesis interesting. If ideology predicted housing sponsorship, any wealth effect would be confounded. The near-zero ideology-sponsorship correlation means that within-ideology, within-party variation in housing engagement is available for the asset variable to explain.
4. Bill Content Analysis: Tightening vs. Loosening
Using propose-reason texts (N=776 housing bills with text in 21st Assembly), I classified bills by regulatory direction using keyword matching:
21st Assembly Housing Bill Direction:
Mixed (both tighten + loosen keywords): 250 (32.2%)
Tighten only: 215 (27.7%)
Neutral: 170 (21.9%)
Loosen only: 141 (18.2%)
By Party Bloc (legislator-sponsored):
Loosen Mixed Neutral Tighten
Opposition (PPP) 13.8% 36.8% 25.3% 24.1%
Ruling (DPK) 20.9% 29.5% 20.0% 29.5%
Other 16.4% 32.8% 21.3% 29.5%
Finding 6: The ruling DPK sponsored more deregulatory housing bills than the opposition PPP (20.9% vs 13.8%). This counterintuitive result reflects the Moon government's late-term pivot toward supply-side measures after years of demand-side regulation failed to cool the market. The opposition PPP leaned more toward "mixed" bills (36.8% vs 29.5%). Passage rates do not differ meaningfully by direction: Tighten=33.5%, Loosen=34.8%, Mixed=39.6%.
For the wealth hypothesis, this content classification is crucial. If property-rich legislators systematically sponsor loosening bills while property-poor legislators sponsor tightening bills, we have a direct test of the self-interest channel. The data infrastructure for this test is ready; it needs the asset join.
5. Co-Sponsorship Avoidance: A Low-Visibility Channel
Scout (019) hypothesized that co-sponsorship refusal is a low-visibility way to obstruct housing legislation. Using 273,964 co-sponsorship edges from the 21st Assembly:
Finding 7: Housing co-sponsorship rates are remarkably uniform across parties (3.1-3.3%). At the aggregate party level, there is no evidence that one party systematically avoids co-sponsoring housing bills. The mean per-legislator housing co-sponsorship count is 27.5 (SD=20.0), with a range from 0 to 129.
However, individual-level variation is large. Among active co-sponsors (100+ total co-sponsorships), the "avoiders" include legislators from both parties: 최재형 (PPP, 0/111 = 0%), 박범계 (DPK, 0/141 = 0%), 한기호 (PPP, 2/345 = 0.6%), 소병철 (DPK, 4/471 = 0.8%). Whether these individual avoidance patterns correlate with personal property holdings is the key empirical question.
6. 종합부동산세 Roll Calls: Replication Baseline for Seo (2025)
The 21st Assembly produced 69 종합부동산세 bills, of which 5 reached floor votes. The vote patterns reveal:
- First passed bill: Y=178, N=23, A=44. The DPK split dramatically: 84 in favor, 13 opposed, 37 abstaining. PPP voted 82-0-3 (overwhelmingly yes). This was likely the tax reduction bill.
- Second bill: Y=186, N=1, A=1. Near-unanimous, but PPP had 105 absences (boycott). Only 4 PPP members voted yes.
- Alternative bill (대안): Y=270, N=1, A=2. Broad bipartisan support (PPP: 91 yes, 17 absent).
- Last alternative: Y=169, N=30, A=20. DPK split again: 82 yes, 19 no, 14 abstaining.
Finding 8: Within-party vote splits on 종부세 are substantial, especially within the DPK. The ruling party's internal dissent (19 no votes and 14 abstentions on the final 종부세 대안) creates precisely the variation that Seo (2025) exploits. Our roll call data can reproduce these votes and test whether within-DPK dissent correlates with personal asset levels.
7. Data Limitations and the Asset Data Gap
The critical constraint: the legislator-assets-korea dataset is referenced in the KNA codebook as a separate project, but is not currently loaded in the processed data directory. The codebook specifies a join key (rst_mona_cd) for connecting bill sponsorship data to legislator asset disclosures. Once this join is executed, the full research design becomes operational:
| Data Component | Status | Notes |
|---|---|---|
| Housing bill universe (19-22nd) | Ready | 2,382 bills classified |
| Sponsor-level aggregation | Ready | Per-legislator housing counts |
| Co-sponsorship network | Ready | 769K edges, housing flag merged |
| Bill text direction coding | Ready | Tighten/Loosen/Mixed/Neutral |
| DW-NOMINATE ideal points | Ready | 936 legislator-terms |
| Roll call votes (종부세) | Ready | 5 floor votes with member-level data |
| Legislator asset data | NOT LOADED | Referenced as external project |
| District housing prices | NOT AVAILABLE | Would need external merge |
Additional measurement concern: The keyword-based bill classification captures 2.3-3.2% of all bills. More precise classification would require manual coding or NLP-based approaches. The propose-reason text classification (tighten/loosen) is crude - many bills contain both regulatory and deregulatory elements. A finer-grained coding scheme (e.g., distinguishing supply-expansion from demand-restriction from tax-related from tenant-protection) would strengthen the content analysis.
8. Suggestions for Critic
-
Evaluate the within-party, within-ideology identification strategy. The near-zero correlation between ideology and housing sponsorship (Finding 5) is good news for identification - but Critic should assess whether this null simply means ideology is poorly measured by coord1D, rather than genuinely irrelevant. If housing is orthogonal to the main left-right dimension, we need to confirm this is not an artifact of the scaling method.
-
Assess the content classification scheme. The tighten/loosen coding relies on keyword presence in propose-reason texts. Critic should evaluate whether this is sufficiently precise for the argument, or whether the high "Mixed" category (32%) renders the classification uninformative.
-
Address the Seo (2025) priority problem directly. If Seo already demonstrates that assets predict 종부세 voting, our sponsorship-margin extension must be clearly differentiated. The 5 floor votes with within-DPK splits provide a Seo replication. But our novel contribution must be the sponsorship margin, the co-sponsorship avoidance margin, and the bill-content direction margin - three channels where party discipline is weaker.
-
Consider the "avoidance as non-event" problem. Finding 7 shows that co-sponsorship avoidance exists at the individual level, but it is hard to interpret a non-event. Not co-sponsoring a housing bill could reflect committee specialization, time constraints, or strategic calculation. The research design needs a baseline model of expected co-sponsorship to identify excess avoidance.
Completion Checklist
- [x] Ran at least 2 KNA queries or pandas analyses with actual results (8 distinct analyses)
- [x] Reported key statistics (N, means, percentages, trends) across all findings
- [x] Connected findings to literature gap identified by Scout (sponsorship margin, Seo 2025 replication, within-party tests)
- [x] Identified at least 1 data limitation or gap (asset data not loaded; district housing prices unavailable; content classification crude)
- [x] Included reproducible code/commands for every finding
- [x] Suggested what Critic should evaluate for theoretical framing (4 items in Section 8)