This map shows the distribution of total serum samples from 4 collection times for each district, with the triangle representing the location of Children’s Hospital 1. The majority of serum samples were collected from districts on the West side of Children’s Hospital 1. The following section will explore the distribution of sample numbers for each district’s collection times.
Fig 2 shows the age structure of each district’s collection times. I divided continuous age variables into five groups: <0 & ≤3 years old,<3 & ≤6 years old,<6 & ≤9 years old, <9 & ≤12 years old,<12 & ≤15 years old to facilitate the visualization (instead of visualizing 15 age group).
In this part, I used a generalized linear model for districts to fit the seropositive by age with formula \(seropositive \sim age + age^{2}\).
Code
# data_sle %>% # ggplot(aes(x = age, y = pos)) +# geom_jitter(height = 0.05)+# facet_wrap(vars(qhchuan,# factor(col_time,levels = c("Dec 2022",# "Apr 2023",# "Aug 2023",# "Dec 2023"))),ncol=4)+# binomial_smooth(formula = y ~ x + I(x^2) )+# labs(x = "Collection date", y = "Seroprevalence")+# theme_bw()data_pt %>%ggplot(aes(x = age, y = pos)) +geom_jitter(height =0.05)+# geom_point(shape = "|",size = 2,# col = "#0808cf")+facet_wrap(~qhchuan,ncol=5)+binomial_smooth(formula = y ~ x +I(x^2))+labs(x ="Age", y ="Seroprevalence")+theme_bw()
Fig 3
Fig 3 shows the result of the fitting process: visualization with seroprevalence (solid line) with a 95% confidence interval (coloured area) and serum sample (point).
3.3 Seroprevalence in the same age group over time
# | fig-width: 13# | fig-height: 8# | out-width: "100%"# | fig-cap: "Fig 4"# ggplot(data_pt,# aes(x = col_date, y = pos)) +# geom_jitter(height = 0.05)+# facet_wrap(~age_gr,# ncol = 5)+# binomial_smooth(formula = y ~ x + I(x^2))+# labs(x = "Collection date", y = "Seroprevalence")+# theme_bw()+# theme(axis.text.x = element_text(vjust = 0.5,angle = 45))##Fig 4 gives information on the seroprevalence of the 15 age groups. The seroprevalence of the age group 1-5 years old increased, which reflect the outbreak's impact in seroprevalence. The outbreak began in June, and 95% of symptomatic cases were aged 0-6 years old; and seroprevalence result also increased in 1-6 years old. However, there is a contrast; our serological shows an increase in the 2-3 years old age group in the 2nd peak (September), while the leading symptomatic group contributing to this peak is children aged 0-2 years old.
I combined all samples for four collection times and fitted the GLM model with formula \(seropositive \sim age + age^{2}\) for each age group. Fig 4 show the seroprevalence over time in the same age group.
For our assumption, asymptomatic cases of children who went to school infected their siblings. As you can see, the high seroprevalence of children at school age at the beginning of 2023 may cause the infection in their siblings aged 0-6, who have low seroprevalence simultaneously.
Code
ggplot(data_pt,aes(x = col_date, y = pos)) +geom_jitter(height =0.05)+facet_wrap(~age_gr2,ncol =5)+binomial_smooth(formula = y ~ x +I(x^2))+labs(x ="Collection date", y ="Seroprevalence")+theme_bw()+theme(axis.text.x =element_text(vjust =0.5,angle =45))
Fig 4
3.4 Investigate location of 0-3 year-old serosample