The Texas Tribune captured every impassioned debate, angry tirade, anecdote, joke and banality uttered into the microphones in the state House and Senate during the 82nd legislative session in a collection of online transcripts. The latest series of our data applications helps you identify when important debates occurred by visualizing the frequency of keywords.
Each day this week we’ll be releasing new topic pages to help readers delve into the debates: The Budget (today), Public and Higher Education (Tuesday), Health Care (Wednesday), Immigration (Thursday), Criminal Justice and Wild Card Issues (Friday).
Below is a brief explanation of how to use the applications.
This graph shows an overview of how frequently lawmakers discussed these topics. Each point denotes the total number of times the words were said in either chamber on that day. Simplifying the graph — clicking on the legend to remove topics from the visualization — makes it easier to identify trends. The higher the peak — i.e., the greater number of times a word was spoken — the more likely a lengthy debate about the topic took place that day.
On March 31, for example, both “budget” and “education” were said more than 100 times. A look at the House transcript from that day reveals representatives debated multiple bills related to balancing the budget by making drastic cuts to public education.
The visualizations on each of the topic pages are more useful for finding transcripts related to specific debates in the House and Senate. Separate graphs for each chamber show the frequency of multiple keywords related to the topic. Highlights from the session, with links to the transcripts and the Tribune’s coverage of the event, are also included on each page.
Visually, the graphs provide a sense of how often lawmakers discussed certain topics. The graph above shows lawmakers said the words “budget,” “education” and “health care” almost daily.
Take the word “education” — which appears in the transcripts more than the words “budget,” “health care” and “immigration” combined. The particularly high frequency could be attributed to a variety of factors. Lawmakers considered more than 600 bills related to public and higher education. And disagreements on how to reform education finance had lawmakers tied up in days-long debates, as they attempted (and often failed), to pass bills related to “education.”
In comparison, lawmakers discussed few bills about TWIA, the Texas Windstorm Insurance Association, and the word was rarely heard in either chamber. After Gov. Rick Perry called on lawmakers to reform TWIA’s claims process during the special session, the frequency of the word “TWIA” spikes dramatically, but only on the days lawmakers discussed that particular bill. Similarly, the frequency of the word “immigration” is low most days, but its usage increases on days when lawmakers debated a bill to ban so-called “sanctuary cities.”
One note of caution about making assertions based on the frequency of a word. The program designed by the Tribune to search for word frequency counts the number of times a string of letters appears in the text of each transcript. A search for the word “tax,” which is included in the budget visualization, will also count the appearance of words like “taxes” and “taxation.” But it will also count words unrelated to the topic, such as “taxidermy” and “taxi.”
Searches for longer, more specific text like “Rainy Day Fund” usually generate fewer, but more precise word counts. The best way to find important debates on a subject like taxation is to identify days where multiple related words like “revenue” and “fee” also have a high frequency.
Comparing the use of two or more words is also a useful method for finding debates incorporating both topics. For example, comparing “budget” and “education” can help pinpoint debates related to education finance reform. When the frequency of both words peaks on the same day it suggests lawmakers were debating how to finance education.
When making assumptions from the graph about the frequency of a word, remember to consider all the possible usages of a word. For example, two words in the budget visualization — “appropriation” and “finance” — are also the names of the House and Senate committees, respectively, that review the budget bill. Therefore, every mention of a House Appropriations Committee hearing will be counted in the search for “appropriation.”
Once you’re acquainted with how to use the graphs, click between the tabs to see how often words appeared in the Senate compared to the House. A cross-examination of graphs from the two chambers can reveal differences in how often lawmakers spoke about certain issues. Please note: The average frequency of most words is lower in the Senate than in the House because there are fewer lawmakers involved: 32 senators compared to 150 representatives.
// <![CDATA[Highcharts.theme = { colors: [‘blue’] };// prevent errors in default themevar highchartsOptions = Highcharts.getOptions(); var comp_all = new Highcharts.Chart({ chart: { renderTo: ‘comp_all’, defaultSeriesType: ‘line’, width: 636, marginRight: 130, marginBottom: 25 }, title: { text: ‘Days of Debate: Comparative Word Counts’, x: -20 //center }, subtitle: { text: ‘Source: 82nd Legislative Session Transcripts’, x: -20 }, xAxis: { categories: [‘Jan. 11’, ‘Jan. 12’, ‘Jan. 13’, ‘Jan. 18’, ‘Jan. 19’, ‘Jan. 20’, ‘Jan. 24’, ‘Jan. 25’, ‘Jan. 26’, ‘Jan. 27’, ‘Jan. 31’, ‘Feb. 1’, ‘Feb. 2’, ‘Feb. 3’, ‘Feb. 7’, ‘Feb. 8’, ‘Feb. 9’, ‘Feb. 14’, ‘Feb. 15’, ‘Feb. 16’, ‘Feb. 17’, ‘Feb. 21’, ‘Feb. 22’, ‘Feb. 23’, ‘Feb. 24’, ‘Feb. 28’, ‘March 1’, ‘March 2’, ‘March 3’, ‘March 7’, ‘March 8’, ‘March 9’, ‘March 10’, ‘March 14’, ‘March 15’, ‘March 16’, ‘March 17’, ‘March 21’, ‘March 22’, ‘March 23’, ‘March 24’, ‘March 28’, ‘March 29’, ‘March 30’, ‘March 31’, ‘April 1’, ‘April 4’, ‘April 5’, ‘April 6’, ‘April 7’, ‘April 8’, ‘April 11’, ‘April 12’, ‘April 13’, ‘April 14’, ‘April 15’, ‘April 18’, ‘April 19’, ‘April 20’, ‘April 21’, ‘April 26’, ‘April 27’, ‘April 28’, ‘April 29’, ‘May 2’, ‘May 3’, ‘May 4’, ‘May 5’, ‘May 6’, ‘May 7’, ‘May 9’, ‘May 10’, ‘May 11’, ‘May 12’, ‘May 13’, ‘May 16’, ‘May 17’, ‘May 18’, ‘May 19’, ‘May 20’, ‘May 21’, ‘May 23’, ‘May 24’, ‘May 25’, ‘May 26’, ‘May 27’, ‘May 28’, ‘May 29’, ‘May 30’, ‘May 31’, ‘June 1’, ‘June 2’, ‘June 3’, ‘June 6’, ‘June 7’, ‘June 8’, ‘June 9’, ‘June 10’, ‘June 13’, ‘June 14’, ‘June 15’, ‘June 16’, ‘June 20’, ‘June 21’, ‘June 22’, ‘June 24’, ‘June 27’, ‘June 28’, ‘June 29’], labels: { step: 20 }, }, plotOptions: { series: { marker: { enabled: false } } }, yAxis: { min: 0, title: { text: ‘Word Count’ }, plotLines: [{ value: 0, width: 1, color: ‘#808080’ }] }, tooltip: { formatter: function() { return ‘‘+ this.series.name +’
‘+ this.x +’: ‘+ this.y +’ occurences’; } }, legend: { layout: ‘vertical’, align: ‘right’, verticalAlign: ‘top’, x: 0, y: 100, borderWidth: 0 },series: [{ name: ‘Budget’, color: ‘ #B12125’, data: [ 20, 0, 0, 7, 2, 0, 32, 1, 11, 0, 1, 1, 2, 0, 0, 0, 1, 7, 0, 3, 5, 4, 2, 30, 4, 1, 0, 7, 14, 2, 2, 3, 2, 3, 3, 2, 7, 1, 5, 11, 2, 1, 0, 2, 139, 91, 31, 0, 12, 6, 3, 0, 10, 43, 8, 2, 2, 34, 20, 2, 4, 1, 29, 71, 3, 100, 22, 15, 20, 1, 4, 12, 7, 10, 15, 18, 20, 20, 17, 49, 2, 6, 21, 3, 2, 18, 77, 43, 3, 0, 0, 0, 8, 6, 0, 5, 91, 18, 0, 6, 5, 11, 4, 1, 2, 0, 24, 12, 3] }, { name: ‘Education’, color: ‘ #3E67B0’, data: [ 6, 1, 0, 2, 5, 2, 2, 5, 12, 6, 5, 9, 14, 3, 3, 4, 7, 16, 16, 19, 15, 2, 40, 11, 5, 26, 10, 13, 13, 7, 14, 7, 4, 6, 8, 19, 10, 13, 9, 28, 9, 8, 8, 17, 136, 86, 10, 7, 34, 16, 1, 5, 18, 24, 47, 10, 11, 22, 22, 16, 33, 14, 29, 52, 22, 89, 61, 25, 77, 1, 30, 28, 43, 57, 30, 34, 21, 14, 36, 63, 7, 76, 79, 12, 11, 47, 45, 56, 7, 2, 2, 6, 10, 4, 3, 3, 116, 29, 2, 9, 3, 13, 4, 0, 1, 0, 15, 5, 2] }, { name: ‘Health Care’, color: ‘ #742E66’, data: [ 0, 0, 0, 2, 2, 0, 0, 0, 4, 1, 0, 2, 4, 0, 1, 1, 5, 1, 5, 1, 6, 1, 12, 1, 1, 3, 5, 2, 24, 2, 3, 1, 1, 1, 1, 1, 0, 0, 3, 2, 3, 1, 1, 1, 8, 9, 6, 1, 0, 10, 0, 1, 6, 4, 16, 0, 3, 11, 20, 3, 24, 2, 6, 5, 7, 8, 12, 4, 3, 0, 6, 2, 0, 7, 0, 15, 8, 8, 40, 3, 2, 20, 20, 0, 1, 11, 13, 17, 0, 3, 1, 1, 4, 0, 1, 23, 28, 4, 2, 1, 14, 0, 2, 0, 0, 0, 7, 1, 0]}, { name: ‘Immigration’, color: ‘ #FCC10F’, data: [ 2, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 2, 1, 0, 3, 1, 0, 0, 1, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 1, 6, 0, 9, 2, 0, 2, 4, 0, 40, 0, 0, 30, 1, 3, 3, 3, 2, 0, 2, 1, 0, 3, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 4, 0, 0, 95, 0, 0, 0, 0, 0, 0, 1, 0, 0] }, { name: ‘Criminal’, color: ‘#44ABDF’, data: [0, 0, 0, 0, 0, 0, 3, 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 7, 3, 0, 8, 1, 12, 3, 1, 1, 4, 4, 5, 5, 5, 0, 1, 3, 3, 8, 3, 3, 7, 8, 3, 1, 3, 12, 21, 7, 9, 10, 2, 22, 5, 7, 7, 18, 19, 2, 15, 18, 20, 23, 45, 3, 30, 13, 10, 48, 26, 7, 29, 1, 4, 15, 11, 44, 38, 8, 7, 20, 14, 19, 12, 48, 32, 1, 9, 14, 8, 8, 4, 0, 0, 0, 0, 0, 1, 1, 5, 0, 0, 51, 1, 0, 11, 5, 2, 0, 6, 13, 1]},{ name: ‘TWIA’, color: ‘#FF7F0E’, data: [ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 64, 0, 0, 0, 0, 1, 0, 0, 0, 0, 16, 0, 0, 0, 0, 1, 0, 3, 0, 0, 0, 1, 2, 0, 0, 0, 0, 1, 0, 72, 0, 0, 1, 28, 0, 2, 64, 0] }] });// ]]>
Texas Tribune donors or members may be quoted or mentioned in our stories, or may be the subject of them. For a complete list of contributors, click here.

