The risk of influenza infection depends on biological characteristics, individual or collective behaviors and the environmental context. The Cohorts for Pandemic Influenza (CoPanFlu) France study was set up in 2009 after the identification of the novel swine-origin A/H1N1 pandemic influenza virus. This cohort of 601 households (1450 subjects) representative for the general population aims at using an integrative approach to study the risk and characteristics of influenza infection as a complex combination of data collected from questionnaires regarding sociodemographic, medical, behavioral characteristics of subjects and indoor environment, using biological samples or environmental databases.
Households were included between December 2009 and July 2010. The design of this study relies on systematic follow-up visits between influenza seasons and additional visits during influenza seasons, when an influenza-like illness is detected in a household via an active surveillance system. During systematic visits, a nurse collects individual and environmental data on questionnaires and obtains blood samples from all members of the household. When an influenza-like-illness is detected, a nurse visits the household three times during the 12 following days, and collects data on questionnaires regarding exposure and symptoms, and biological samples (including nasal swabs) from all subjects in the household. The end of the follow-up period is expected in fall 2012.
The large amount of data collected throughout the follow-up will permit a multidisciplinary study of influenza infections. Additional data is being collected and analyzed in this ongoing cohort. The longitudinal analysis of these households will permit integrative analyses of complex phenomena such as individual, collective and environmental risk factors of infection, routes of transmission, or determinants of the immune response to infection or vaccination.
Keywords:Influenza a virus H1N1 subtype; Cohort study; Risk factors; France
The first human cases of influenza caused by a novel swine-origin A/H1N1 pandemic influenza virus variant (H1N1pdm) were reported in Mexico and the United States in April 2009 . Given the rapid spread of this virus and considering the likelihood of its pandemic extent – confirmed by the World Health Organization (WHO) on June 11, 2009  – the Cohorts for Pandemic Influenza (CoPanFlu) international consortium was initiated to study individual and collective determinants of H1N1pdm influenza infection across countries by setting up prospective cohorts of households, followed for 2 years in 6 countries or regions of the world: metropolitan France, Mali , Bolivia, Laos, Reunion Island  and Djibouti. The CoPanFlu-France cohort, set up in metropolitan general population, is part of the CoPanFlu international consortium and its protocol served as a blueprint for the other international cohorts.
Several studies already reported risk factors for seasonal or pandemic influenza infection in households. These studies focused on including individual characteristics of index patients and their household contacts [5-10] or hygiene measures as predictors of secondary household infections [11,12]. In addition to household studies, risk factors of seasonal influenza infections have been studied in relation to characteristics of social contacts  or baseline serological status of the host . However, to our knowledge, no attempt was made to study the risk of influenza infection as a complex combination of biological characteristics (including immunity), individual or collective behaviors and environmental context. This integrative approach, in which epidemiological data is comprehensively collected and analyzed, is currently developed for non-communicable diseases and relies on methods derived from Genome-Wide Association Studies (GWAS) [15,16]. To achieve our objectives, we developed a multidisciplinary approach, with an original design involving data collection on subjects and their environment and biological samples.
This cohort was designed to assess the relative risk of infection by the H1N1pdm virus. We first intended to include 1000 households (about 2100 subjects) which would have permitted to detect covariates associated to a relative risk ≥ 1.4 with a 80% power and 5% significance, assuming a cumulative incidence of 10% and intra-household correlation of 0.3.
Households were sampled using a random telephonic procedure (Mitofsky–Waksberg design ) in a stratified geographical sampling scheme, aimed at including a sample of subjects as close as possible to the French general population [18,19]. Forty addresses were drawn from the national directory. These addresses defined the centers of 40 areas inside which subjects were eligible. The limits of these areas were defined as the smallest circle including 130,000 household addresses in the public directory. The size of these areas varied (5 to 5000 km2) according to population density (see Figure 1). In each area, two lists of households were drawn:
· A “landline” list of 25 households: these households were chosen as those with the phone number immediately following a landline number drawn in this area. Since landline numbers are geographically allocated, this method ensured reaching households who chose not to be listed in the national directory.
· A “mobile phone” list of 7 households: these households were directly drawn in the national directory, in order to reach households without a fixed phone.
Figure 1 . Distribution of included households in relation to density of population (total: N = 601). White discs represent the 40 areas of the study. Overlapping areas are merged.
Addresses were iteratively drawn from the 40 lists of 130,000 households each and these households were phoned to present the study and, upon meeting eligibility criteria, were sent a written description of the study. A household was considered as “pre-included” when a referent member sent back a filled form to confirm his agreement. According to this method, 1,280 households were pre-included, i.e. agreed to be visited by a nurse for an inclusion visit involving all household members. We anticipated that 20% of pre-included households would finally decline to participate in the cohort.
A household was defined as a person or group of people occupying the same domicile. All households were eligible to participate in the cohort, provided at least one member was over 18 years of age and French-speaking. A household member was defined as a person living at least half his/her time in the household. All household members were eligible, regardless of age. The inclusion of a household required the participation of all members: the refusal of one or more member(s) prevented the inclusion of other members.
Five thousand one hundred and two households were contacted by phone in order to achieve our targeted number of 1280 pre-included households (see Figure 2). The rate of contacted eligible households who agreed to be pre-included varied from 17% to 34% across the 40 areas. The main reasons for non-participation were lack of time and expected difficulty to collect blood samples from children.
Figure 2 . Flow diagram until the 2ndsystematic follow-up visit.
Six hundred and seven households were visited by a nurse for an inclusion visit, among which six finally did not agree to participate (refusal of at least one member after receiving more detailed information on the study). Data was collected on the 601 remaining households (1450 subjects). According to population census data , these households had sociodemographic characteristics close to the general population (see Additional file 1: Figure S1 and Additional file 1: Tables S1–S6 for details).
The main objective of this study was to identify individual and collective determinants of H1N1pdm infection; therefore we tried to collect comprehensive data about subjects and their environment, in addition to biological samples. Several household visits are carried on by nurses for this purpose (see Figure 3 for details).
· Inclusion visits During the inclusion visit, nurses collected from all subjects detailed data regarding medical history, vaccination and preventive measures against influenza, smoking habits, socioeconomic status, risk perception and beliefs, frequency and characteristics of meetings with other people and housing (personal room, house or apartment). As the households’ addresses were geocoded, we were able to get additional information from public databases regarding the immediate surrounding environment of households. An overview of data collected from questionnaires at entry in the cohort is shown in Figure 4. Blood samples were collected and centralized for serological analyses. For subjects over 10 years, a heparinated tube was also collected to study cellular immunity, as well as a blood sample dedicated to transcript analyses.
· Systematic yearly visits After the inclusion visit, systematic follow-up visits are carried on between influenza seasons. During a systematic visit, a nurse collects or updates individual and environmental data on questionnaires, completes previously missing data, and obtains blood samples from all members of the household. Two waves of systematic follow-up visits have already occurred (summer-fall 2010 and 2011). A third wave is expected by the end of the second year of follow-up (summer 2012).
· Influenza-like illness (ILI) visits During the influenza season (as defined by the French surveillance network ), we use an active surveillance system order to detect ILIs: all households are called by an interactive voice response system (IVRS) weekly and are asked if any subject has symptoms of ILI (fever ≥ 37.8°C associated with cough or sore throat, as defined by the CDC ). A free phone number is given to subjects to report symptoms spontaneously between two weekly calls. In case of reported ILI, symptoms are validated by the study team and then three “ILI visits” are organized: nurses visit the household within 48 h after the onset of symptoms, then 3–6 days and 8–12 days after the onset.
· During these visits, a detailed questionnaire collects data about the circumstances of possible exposure to influenza viruses and the chronology of symptoms (if any) in all subjects. Nasal swabs are collected from all subjects. A stool sample and a throat swab are also collected from subjects with ILI, as well as a blood sample from those over 10 years of age. Moreover, a self-swab procedure is previously sent to the households in order to collect virological samples when a visit by a nurse within the first 48 h is not possible. Nasal swabs are used to identify various respiratory viruses by PCR and biochips allowing for multiple diagnosis tests.
· This series of three visits can occur several times in the same household during an influenza season. There were 23 ILI alerts during the 2009–2010 season (as households were still being included) and 143 during the 2010-2011 season, all of which triggered up to three ILI visits.
· Vaccination visits In order to update serological information, a blood sample was collected from subjects who had an influenza vaccination, between 2 and 4 weeks following this vaccination. There was one vaccination visit following the inclusion visits; 29 vaccination visits were conducted following the first wave of follow-up visits and 69 following the second wave.
Figure 3 . Design of the study: systematic and additional visits of households by nurses.
The cohort was initially designed to include households before the 2009 pandemic season and to follow subjects during the two subsequent influenza seasons. We obtained funding in June 2009, the cohort protocol and questionnaires were finalized in July 2009, and the protocol obtained ethical approval on September 8, 2009. Households were pre-included between September 25 and December 17, 2009 and inclusion visits began on December 4, 2009 – as the final administrative authorizations were obtained. The inclusion period was extended until July 31, 2010, in order to get a relevant sample size for the planned analyses, some of which were postponed until the following season. A total of 575 households (96%) were included after the first pandemic season (September 7 to December 27, 2009 ).
The protocol of the CoPanFlu-France study was approved by the research ethics committee “Comité de Protection des Personnes Ile-de-France 1” on September 8, 2009. Information was previously given by investigators to participants indirectly through written descriptions of the study and training of the nurses, and directly by e-mail and telephone for any question. Written informed consent was obtained for all subjects.
Figure 4 . Main data collected on questionnaires at entry in the cohort, in addition to blood samples.
Many analyses have been recently completed or are currently being carried out. Based on inclusion data, we used a data-driven approach to identify factors associated with a high anti-H1N1pdm serological titer. We are conducting several analyses to identify risk factors associated with influenza infections (based on serological data for the first pandemic season, then on both serological and virological data for the following seasons). Nasal swabs are being analyzed to identify various respiratory viruses and the characteristics of infected subjects. Blood samples collected during ILI visits are used to study innate immunity against influenza and the related transcriptome. Determinants of vaccination against influenza have also been identified, and other studies are being conducted in the field of social science and risk perception. Several other analyses are expected soon from different collaboration partners in various biomedical fields.
Strengths of the study
The main strength of this cohort is the large amount of available and expected data and the different biological samples to be collected, which will permit to carry on many studies in various biomedical fields. To our knowledge, this project is the first attempt to study so thoroughly the determinants of infections by respiratory viruses in a large sample of households randomly selected in the community. This approach is likely to provide new insights from the interaction of sparse data usually studied separately, especially with the help of data-driven methods such as those already under development in the field of non-communicable diseases [15,16]. Additional data is being collected and analyzed in this ongoing cohort, whose longitudinal analysis will permit integrative analyses of complex phenomena such as individual, collective and environmental risk factors of infection, routes of transmission, or determinants of the immune response to infection or vaccination.
We designed the CoPanFlu-France study in order to enable inference to the French general population, yet we cannot exclude a selection bias induced by the proportion of contacted households who refused to participate. However, a comparison between CoPanFlu subjects and population census data  suggests that this bias was controlled (see supplementary material part 2).
We wish we were able to set up this project a few months earlier, in order to include households before and to follow-up subjects during the 2009 pandemic season. Due to organizational impairments, the inclusion process was delayed and data regarding ILIs were collected retrospectively, sometimes up to 6 months after the epidemic. Thus, this timeline of inclusion may have induced recall or reporting biases for the 2009 season, and we were not able to collect enough pre-pandemic blood samples and nasal swabs during this first H1N1pdm season to investigate laboratory-confirmed infections. Another consequence of this delayed inclusion process is that we decided to stop inclusions as only 601 out of the 1000 expected households were included. This limit is the main reason why we decided to postpone the end of the study until 2012 instead of 2011 as initially expected, in order to collect data during an additional influenza season.
Pr Carrat reported not having shares or paid employment with pharmaceutical companies; received honoraria from Novartis, GlaxoSmithKline and Boiron and received travel support to attend scientific meetings from Novartis. Other authors have no competing interest to declare.
NL, FC and AF drafted the protocol of the study. All authors contributed to conception and design of the study. NL, PF, MSe, RMD and FC participated to the acquisition of data. NL and FC carried out the statistical analyses. NL and FC drafted the manuscript and all authors revised it critically. All authors gave final approval of this version of the manuscript.
This study was supported by the Institut de Microbiologie et Maladies Infectieuses (IMMI-AVIESAN), the Institut de Santé Publique (ISP-AVIESAN), the French Ministry of Health and Assistance Publique Hôpitaux de Paris - PHRC 2010 #AOM10199, the French Ministry of research and the Institut de Recherche en Sante Publique (IReSP – TGIR 2009).
World now at the start of 2009 influenza pandemic [http:/ / www.who.int/ mediacentre/ news/ statements/ 2009/ h1n1_pandemic_phase6_20090611/ en/ index.html webcite].
Koita OA, Sangare L, Poudiougou B, Aboubacar B, Samake Y, Coulibaly T, Pronyk P, Salez N, Kieffer A, Ninove L, Flahault A, de Lamballerie X: A seroepidemiological study of pandemic A/H1N1(2009) influenza in a rural population of Mali.
Dellagi K, Rollot O, Temmam S, Salez N, Guernier V, Pascalis H, Gérardin P, Fianu A, Lapidus N, Naty N, Tortosa P, Boussaïd K, Jaffar-Banjee M-C, Filleul L, Flahault A, Carrat F, Favier F, de Lamballerie X: Pandemic influenza due to pH1N1/2009 Virus: estimation of infection burden in Reunion Island through a prospective serosurvey, Austral Winter 2009.
Morgan OW, Parks S, Shim T, Blevins PA, Lucas PM, Sanchez R, Walea N, Loustalot F, Duffy MR, Shim MJ, Guerra S, Guerra F, Mills G, Verani J, Alsip B, Lindstrom S, Shu B, Emery S, Cohen AL, Menon M, Fry AM, Dawood F, Fonseca VP, Olsen SJ: Household transmission of pandemic (H1N1) 2009, San Antonio, Texas, USA, April-May 2009.
Cowling BJ, Chan K-H, Fang VJ, Cheng CKY, Fung ROP, Wai W, Sin J, Seto WH, Yung R, Chu DWS, Chiu BCF, Lee PWY, Chiu MC, Lee HC, Uyeki TM, Houck PM, Peiris JSM, Leung GM: Facemasks and hand hygiene to prevent influenza transmission in households: a cluster randomized trial.
Simmerman JM, Suntarattiwong P, Levy J, Jarman RG, Kaewchana S, Gibbons RV, Cowling BJ, Sanasuttipun W, Maloney SA, Uyeki TM, Kamimoto L, Chotipitayasunondh T: Findings from a household randomized controlled trial of hand washing and face masks to reduce influenza transmission in Bangkok, Thailand.
Coudeville L, Bailleux F, Riche B, Megas F, Andre P, Ecochard R: Relationship between haemagglutination-inhibiting antibody titres and clinical protection against influenza: development and application of a bayesian random-effects model.
J Am Stat Assoc 1978, 73:40-46. Publisher Full Text
Databases - The population census [http://www.insee.fr/en/bases-de-donnees/default.asp?page=recensements.htm webcite].
Case Definitions for Infectious Conditions Under Public Health Surveillance [http://www.cdc.gov/osels/ph_surveillance/nndss/casedef/novel_influenzaA.htm webcite].
The pre-publication history for this paper can be accessed here: