SLEAC Classification

The SLEAC method classifies program coverage for a service delivery unit such as a health district. A SLEAC survey does not provide an estimate of overall coverage with a confidence interval for a single service delivery unit. Instead, a SLEAC survey identifies the category of coverage (e.g. low, moderate, or high) that best describes the coverage of the service delivery unit being assessed. The advantage of this approach is that relatively small sample sizes (e.g. n = 40) are required in order to make accurate and reliable classifications.

SLEAC uses the lot quality assurance sampling (LQAS) classification technique.

The SLEAC survey sample is designed to represent an entire service delivery unit such as a district.
A target sample size for SLEAC surveys is decided in advance of data-collection. This is usually about n = 40 SAM cases.
SLEAC surveys may classify coverage into three (or more) classes.

Analysis of data using the simplified LQAS classification technique involves examining the number of cases found in the survey sample (n) and the number of covered cases found:

If the number of covered cases found exceeds a threshold value (d) then coverage is classified as being satisfactory.
If the number of covered cases found does not exceed this threshold value (d) then coverage is classified as being unsatisfactory.

The threshold value (d) depends on the number of cases found (n) and the standard (p) against which coverage is being evaluated. A specific combination of n and d is called a sampling plan.

The following rule-of-thumb formula may be used to calculate a suitable threshold value (d) for any coverage proportion (p) and any sample size (n):

\[d ~ = ~ \left \lfloor n ~ \times ~ \frac{p}{100} \right \rfloor\]

For example, with a sample size (n) of 40 and a coverage proportion (p) of 70% an appropriate value for d would be:

\[d ~ = ~ \left \lfloor n ~ \times ~ \frac{p}{100} \right \rfloor ~ = ~ \left \lfloor 40 ~ \times ~ \frac{70}{100} \right \rfloor ~ = ~ \left \lfloor 40 ~ \times ~ 0.7 \right \rfloor ~ = ~ 28\]

It is unlikely that a SLEAC survey will return the target sample size (n) exactly. If a survey does not return the target sample size (n) exactly then the classification threshold value (d) should be recalculated using the achieved sample size. For example, if the target sample size is 40 but the achieved sample size is 43 and given a classification threshold of 70%, the appropriate classification threshold can be recalculated as follows:

\[d ~ = ~ \left \lfloor 43 ~ \times ~ \frac{70}{100} \right \rfloor ~ = ~ 30\]

Coverage is classified as described above. For example, out of the 43 SAM cases found only 34 are in the programme, the number of covered cases can be compared to the the decision threshold d calculated earlier. Since 34 > 30, then coverage can be classified as satisfactory.

The lqas_classify_cf() function in sleacr can perform this classification as follows:

lqas_classify_cf(cases_in = 34, cases_out = 43 - 34, threshold = 0.7)
#> [1] 1

Extending the classification method to yield finer classifications

The simplified LQAS classification technique provides binary or two-tier classifications.

The method may be extended to provide more granular classifications.

Three classes are sufficient for most SLEAC applications. A three-tier classification method is particularly useful for identifying very high coverage service delivery units and very low coverage service delivery units for inclusion in subsequent investigations.

Three-tier classifications require two sampling plans/decision rules. These are created using the rule-of-thumb formula presented earlier.

For three-tier classifications there are two coverage proportions:

p1: The upper limit of the low coverage tier or class

p2: The lower limit of the high coverage tier or class

The moderate coverage class runs from p1 to p2.

Two classification thresholds (d1 and d2) are used and are calculated as:

\[d_1 ~ = ~ \left \lfloor n ~ \times ~ \frac{p_1}{100} \right \rfloor\]

\[d_2 ~ = ~ \left \lfloor n ~ \times ~ \frac{p_2}{100} \right \rfloor\]

Classifications are made using the algorithm below.

Two-tier classification algorithm used in SLEAC

This three-class classification works well with small sample sizes (e.g. n = 40) provided that the difference between p1 and p2 is greater than or equal to 25 percentage points.

Here is an example of the calculations required for a sample size of 40 and classification thresholds of 30% and 70%:

\[d_1 ~ = ~ \left \lfloor n ~ \times ~ \frac{p_1}{100} \right \rfloor ~ = ~ \left \lfloor 40 ~ \times ~ \frac{30}{100} \right \rfloor ~ = ~ 12\]

\[d_2 ~ = ~ \left \lfloor n ~ \times ~ \frac{p_2}{100} \right \rfloor ~ = ~ \left \lfloor 40 ~ \times ~ \frac{70}{100} \right \rfloor ~ = ~ 28\]

Based on these decision rules, the following classifications are made:

Number of covered cases	Classification
1 to 12	Low (< 30%) coverage
13 to 28	Moderate (between 30% and 70%) coverage
29 to 40	High (>= 70%) coverage

If a survey does not return the target sample size (n) exactly then the classification thresholds (d1 and d2) should be calculated using the achieved sample size and classifications made using the algorithm illustrated above. For example, a survey classifying coverage in individual clinic catchment areas using a target sample size of forty (40) cases for each catchment area and the class boundaries p1 = 30% and p2 = 70% returned the following data:

Clinic catchment area	Sample size	d1	d2	Number of covered cases	Classification
Chawama	38	11	26	29	High
Matero	32	9	22	18	Moderate
Makeni	43	12	30	36	High
Chipata	35	10	24	15	Moderate
Ngombe	42	12	29	14	Moderate
Kalingalinga	37	11	25	10	Low
Mtendere	39	11	27	5	Low
Kanyama	42	12	29	23	Moderate
All	308	92	215	150	Moderate

In this example the target sample size was applied to each of the clinic catchment areas separately. This allows coverage classifications to be made for individual clinic catchment areas. This approach enables the identification of low coverage and high coverage service delivery units (clinics in this example) for subsequent investigations.

The lqas_classify_cf() function in sleacr can perform this classification as well. This can be done as follows:

samp_size <- c(38, 32, 43, 35, 42, 37, 39, 42)
in_cases <- c(29, 18, 36, 15, 14, 10, 5, 23)

lqas_classify_cf(
  cases_in = in_cases, cases_out = samp_size - in_cases, 
  threshold = c(0.3, 0.7), label = TRUE
)

which produces the following outputs:

#> [1] "High"     "Moderate" "High"     "Moderate" "Moderate" "Low"      "Low"     
#> [8] "High"

Mark Myatt

Extending the classification method to yield finer classifications