Big Data Offer New Strategy For Public Health Campaigns
Chicago health officials had a serious problem. The city had long been trying to attack breast cancer among minorities with a program offering uninsured women free mammograms at in the predominantly black South Side. But black women 鈥 who are far more likely than white women to die of breast cancer 鈥 weren鈥檛 getting screened.
Because traditional public health outreach didn鈥檛 seem to be working, the city鈥檚 Department of Public Health decided to do something new: It turned to a Chicago-based data mining company, Civis Analytics, for help.
Data mining, often employed by political teams and mass marketers, uses statistical analysis to find patterns within large data sets to project trends about individual behavior and demographics.

Civis, a private company with offices in Chicago and Washington, D.C., was formed by members of the data analytics team from President Barack Obama鈥檚 re-election campaign. Back then, as campaign staffers, they used their skills to identify Obama voters for a get-out-the-vote effort. Later, after the company was formed, Civis employees worked with Enroll America, a nonprofit group, to find people to sign up for health insurance under the Affordable Care Act.
When Civis teamed up with Chicago鈥檚 health department, it moved on to another health-related mission: to help the city refine its outreach for the breast cancer screening program by using its big-data tool box to identify uninsured women aged 40 and older living in the South Side.
This project represents a distinctive step in public health outreach, said Jonathan Weiner, professor and director of the Johns Hopkins Center for Population Health IT in Baltimore. But Chicago is not the only city investigating how population data can be used in health programs, he added, citing New York City, Baltimore and San Diego as other examples.
鈥淚t鈥檚 a growing trend that some of the techniques first developed for commercial applications are now spinning off for health applications,鈥 he said. So far, he said, 鈥渢hese techniques have not been as widely applied for social good and public health,鈥 but that appears to be changing.
As it happens, in this case, the company didn鈥檛 have to start from scratch.
鈥淭here鈥檚 pretty rich data鈥 out there, said Kate Jordan, an engagement manager at Civis who leads the company鈥檚 health practice. The U.S. Census Bureau tracks demographic factors such as gender, race, income, location and insurance status 鈥 data, Jordan said, that with the right analysis, 鈥渃an tell us a lot about the statistical mathematic pattern between various characteristics and whether or not a person might be uninsured.鈥
Civis鈥 side of the project, which it did as a pro bono initiative, began after the city reached out in July. It took about two months, Jordan said, enlisting between two and five Civis employees 鈥 analysts, statisticians and data experts 鈥 at any given time, though she and another senior analyst did the bulk of the work.
The Civis team sorted through material from the Census Bureau鈥檚 to uncover how characteristics such as gender, race and income correlate to insurance status. (The survey is an annual project that collects responses to questions varying from age to income to education from a small but representative group of people.)
Civis then used its own survey findings to build an algorithm to predict, with confidence, where, by neighborhood, the uninsured are most likely to cluster. It鈥檚 鈥渁 proprietary way of walking public data onto our own data,鈥 Jordan said.
鈥淣eighborhoods have a high rate of uninsurance or a low rate of uninsurance, which makes a lot of sense,鈥 she said, since the traits that correlate to whether people purchase insurance often also seem to influence where they live.
From that point, the team was also able to locate women who were likely to lack insurance and who fell into the appropriate age range. The city then used this information to mail fliers to about 5,000 women, according to Jay Bhatt, chief innovation officer and managing deputy commissioner at the health department.
Meanwhile, at Roseland Hospital, staffers were eager for assistance. The mailings 鈥済enerated quite a bit of buzz,鈥 said Nikisha Coleman, a Roseland spokeswoman. For instance, she said, the hospital usually provides fewer than 10 free mammograms per month. But in October, the month the city sent out the mailers, 31 women came in for the free service. She expects that, for months, they will continue to receive telephone inquiries and do more screenings.
鈥淭o send out that sort of mailing, it creates awareness,鈥 Coleman said. That鈥檚 important, said Rhonda Perdue, a Roseland staff member who does outreach for the hospital鈥檚 mammography program. 鈥淢ost of us who have insurance, who go to the doctor,鈥 are reminded about getting mammograms, but that鈥檚 not true for the uninsured, she said.
The city intends to 鈥渃ontinue its approach in leveraging open data and predictive analytics鈥 to address public health issues, a department spokeswoman said. Civis, for its part, 鈥渁bsolutely鈥 sees potential for doing more such public health projects, and not just in Chicago, according to Jordan.
But population modeling isn鈥檛 a silver bullet, said Weiner, the Johns Hopkins professor.
鈥淚dentifying the problem, looking at both community and individuals who are in need of services, that鈥檚 definitely part of what鈥檚 needed,鈥 Weiner said. 鈥淏ut having solutions once you identify the problem 鈥 that鈥檚 far more challenging, and the data alone will not solve the issue.鈥