# 9. Credit-excluded households in a developing country Solutions

These are not model answers. They are provided to help students, including those doing the project outside a formal class, to check their progress while working through the questions using the Excel, R, or Google Sheets walk-throughs. There are also brief notes for the more interpretive questions. Students taking courses using Doing Economics should follow the guidance of their instructors.

## Part 9.1 Households that did not get a loan

• Solution figure 9.1 indicates the proportion of households that lived in each region and area type.
Region Proportion in the region living in small towns Proportion of regional households living in large towns Proportion of regional households living in rural areas Proportion of all households living in the region
Addis Ababa 0 1.00 0 0.06
Afar 0.15 0.10 0.76 0.03
Amhara 0.11 0.22 0.67 0.20
Benshagul Gumuz 0.10 0.00 0.90 0.02
Diredwa 0.00 0.47 0.53 0.04
Gambelia 0.08 0.12 0.80 0.02
Harari 0.00 0.27 0.73 0.03
Oromia 0.11 0.28 0.61 0.20
SNNP 0.09 0.18 0.73 0.23
Somalie 0.09 0.16 0.76 0.06
Tigray 0.07 0.37 0.56 0.12
Grand total 0.09 0.28 0.63 1.00

Solution figure 9.1 Proportion of sample living in towns vs rural areas, by ‘region’. (Note that numbers may not add up to 1 due to rounding.)

• 30.40% of households have a female household head.
• Solution figure 9.2 provides the summary table for the variables (‘SD’ refers to standard deviation).
Mean SD Min Max
Household size 4.58 2.40 1.00 16.00
Gender 0.30 0.46 0.00 1.00
Age 44.18 15.61 3.00 99.00
Young children 1.89 1.71 0.00 10.00
Working-age adults 2.58 1.52 0.00 10.00
Max education 7.53 7.28 0.00 30.00
Number of assets 14.90 17.23 0.00 203.00

Solution figure 9.2 Summary table for household demographics (all households).

• The average household size is 4.58 (or 5 people, rounding to the nearest person), with around 1–2 children and 2–3 working-age adults. On average, the most educated person in the household has had 7-8 years of schooling. Households have around 15 assets on average.
• Note that only households with information for all three variables are included.
Not rejected Rejected Blank or NA Sum
Applied for loan 1,363 201 1 1,565
Did not apply for loan 3,632 24 2 3,658
Blank or NA 37 2 0 39
Grand total 5,032 227 3 5,262

Solution figure 9.3 Loan applications and approvals.

• There are some responses that do not make sense. For example, 1,363 households applied for a loan and received the loan. Two households did not answer whether they applied for a loan but at the same time indicated that they were rejected a loan. In fact, 24 households indicated that they did not apply for a loan, yet they indicated that they were refused a loan. As the questions about loan application and loan rejection refer to the same 12-month period, these observations clearly make no sense, so we decided to exclude them from the data set analysed. This decision is contestable, since we exclude more than 10% of households that indicate that they were refused a loan. Thus, it is important to be transparent about this decision.

We shall also remove all observations that have missing data for any of these two questions. This leaves us with 1,363 + 3,632 + 201 = 5,196 observations (those highlighted in green in Solution figure 9.3).

Of these observations, 30.10% of households applied for a loan. 96.13% of these were successful.

• The 26.23% of households that applied for a loan and received one are clearly not credit excluded. However, they may still be credit constrained, as the mere fact that they received a loan says nothing about whether the terms were favourable or not (‘favourable’ being a criterion which may be somewhat difficult to pin down). The 69.90% of households that did not apply for a loan could be credit excluded, credit constrained, or neither as they may not have a need for a loan. It is here where additional information on the reasons for not applying will become important. The 3.87% of households that did apply but were refused a loan can be described as credit excluded.
• Create the ‘HH_status’ variable in your software.
• After creating this variable in your software, you should find that 588 (11.32%) households were discouraged borrowers (4,608 were not discouraged).
• 3,012 (57.97%) households were credit constrained (2,184 were not).
• Solution figures 9.4 and 9.5 provide frequency tables (with values expressed in proportions) showing the most important and the second most important reasons for not applying for a loan.

Across the two tables it is clear that there is a substantial portion of households that do not want to be in debt. Others indicate that they do not believe they can afford a loan (‘Fear not to be able to pay’ or ‘Believe would be refused’). These households would certainly be considered credit constrained, or even credit excluded.

Almost 20% of households also state that they do not have any need for a loan (‘Have adequate farm’).

Reason Proportion
Do Not Like To Be In Debt 0.19
Fear Not Be Able To Pay 0.17
Believe Would Be Refused 0.12
Do Not Know Any Lender 0.07
Too Expensive 0.05
Other (Specify) 0.04
Too Much Trouble 0.03

Solution figure 9.4 Most important reason for not applying for a loan.

Reason Proportion
Fear Not Be Able To Pay 0.28
Do Not Like To Be In Debt 0.24
Believe Would Be Refused 0.09
Too Expensive 0.07
Do Not Know Any Lender 0.06
Too Much Trouble 0.06
Other (Specify) 0.02

Solution figure 9.5 Second most important reason for not applying for a loan.

1. Solution figure 9.6 compares the loan purposes of ‘successful’ and ‘denied’ borrowers. Compared with all households, a greater proportion of households that got loans indicated the purpose for the loan as consumption, and a smaller proportion of households that got loans indicated the purpose as for investment.

Note: The entry ‘10’ is likely due to incorrect recording of responses, since ‘10’ is not a valid category in the original survey.

Purpose Successful Denied
10 0.00 0.02
Other (specify) 0.03 0.26
Purchase agricultural inputs for food crop 0.30 0.21
Purchase house/lease land 0.02 0.03
Purchase inputs for other crops 0.01 0.06
Purchase non-farm inputs 0.12 0.03
For consumption and personal expenses 0.20 0.00

Solution figure 9.6 Loan purpose for successful and denied (credit-excluded) borrowers.

### Note

Although most categories are the same, the two tables are not exactly comparable because ‘Consumption and personal expenses’ was a separate category from ‘Other’ for successful borrowers, but was considered as one of the responses in ‘Other (specify)’ for credit-excluded households. Looking at the variable loan_purpose_other in the ‘All households’ tab, we can see that reasons related to consumption and personal expenses (such as medical, school, transport expenses) apply to around 20 or so households.

• Solution figure 9.7 compares the averages of the specified household characteristics.
Household characteristic Successful Denied
Number of observations 1,363 201
Age of household head 43.40 41.20
Highest education in household 7.26 8.00
Number of assets 15.90 14.50
Household size 4.87 4.82
Number of young children 2.09 2.22
Number of working-age adults 2.75 2.76

Solution figure 9.7 Comparison of household characteristics (successful and denied borrowers).

• Explanations of how characteristics may affect a household’s ability to get a loan:

• Younger households are more likely to be credit excluded compared to older people because they are more likely to have less assets and unstable employment.
• Less educated households are less likely to get higher paid jobs and hence less likely to repay the loans. We can thus expect more loans given to more educated people or people with a higher-educated household head.
• Assets can serve as collateral. We can expect households with less collateral to receive fewer loans because the risks involved for the lenders are higher.
• Households with more members to share the repayment burden are more likely to be able to repay loans.
• Having a greater number of children improves the prospects of repayments as the children will eventually grow up and become productive members of the family. Governments may provide benefits to families with more children.
• Having a greater number of working-age adults means greater repayment capability.
• The data reveals patterns consistent with our expectations, except in the pattern for the relation between education and loans. The data shows that credit-excluded people tend to have better education. One possible explanation is that people with more educated family members may be more likely to apply for loans because they are more likely to be aware of how to apply for loans and be more confident of their abilities to get a loan and repay. However, it is also possible that education is also correlated with other variable(s) that are significantly correlated with the likelihood of receiving a loan.
• There are many possibilities for conditioning on the variables ‘rural’ or ‘region’. The example in Solution figure 9.8 shows means conditioned on household status (‘successful’ or ‘denied’) and the ‘rural’ variable.

Some patterns remain the same, though we can now see that in rural areas, successful borrowers have on average more education. In terms of numbers of children and working-age adults, the conditioning on the ‘rural’ variable changed the values. Now we can see that successful borrowers tend to have on average more children and more working-age adults.

However, we ought to interpret these differences carefully as some of the groups have very small numbers of observations.

Rural   Small town (urban)   Large town (urban)
Successful Denied Successful Denied Successful Denied
Number of observations 903 128 128 17 332 56
Age of household head 45.10 43.40 42.10 37.50 39.20 37.30
Highest education in household 5.00 4.55 9.59 13.12 12.51 14.3
Number of assets 13.60 12.10 15.70 16.70 22.10 19.20
Household size 5.35 5.47 4.12 4.06 3.84 3.57
Number of young children 2.49 2.84 1.56 1.41 1.20 1.05
Number of working-age adults 2.91 2.88 2.54 2.33 2.42 2.36

Solution figure 9.8 Comparison of household characteristics, conditioning on the ‘rural’ variable.

• The difference in means is shown in the table in Solution figure 9.9.
• The standard deviation (SD) for the difference in means, and the number of observations in both groups, are shown in the table below.
Successful borrowers Denied borrowers Difference in means
Household characteristic Mean SD N Mean SD N Difference in means CI lower CI upper
Age of household head 43.40 14.30 1,361 41.20 12.90 201 2.15 0.21 4.09
Highest education in household 7.26 6.74 1,361 8.00 7.90 201 −0.73 −1.89 0.42
Number of assets 15.90 19.00 1,361 14.50 16.80 201 1.42 −1.13 3.97
Household size 4.87 2.35 1,361 4.82 2.35 201 0.05 −0.30 0.40
Number of young children 2.09 1.68 1,361 2.22 1.80 201 −0.14 −0.40 0.13
Number of working-age adults 2.75 1.49 1,361 2.76 1.42 201 −0.003 −0.22 0.21

Solution figure 9.9 Calculating 95% confidence interval for difference in means between ‘successful’ and ‘denied’ borrowers.

• Solution figure 9.10 shows the differences in household characteristics.

Solution figure 9.10 Difference in means between ‘successful’ and ‘denied’ borrowers by household characteristics, with 95% confidence intervals.

• On average, successful borrowers have an older household head (2 years older) and 1–2 more assets than denied borrowers, though these differences are quite imprecisely estimated (as with the differences for all other characteristics. Therefore the sample differences we see could well come from a population where there is no difference between households which were denied a loan and those who were successful borrowers. Recall that 95% confidence intervals may not always contain the respective population means, so the above conclusions are never definite.
• See Solution figure 9.11.
Household characteristic Successful borrowers Denied borrowers Discouraged borrowers Constrained borrowers
Number of observations 1,363 201 588 3,012
Age 43.37 41.21 43.28 44.84
Highest education in household 7.26 8.00 6.50 7.14
Number of assets 15.88 14.46 10.16 13.54
Household size 4.87 4.82 4.65 4.44
Number of young children 2.09 2.22 2.03 1.81
Number of working-age adults 2.75 2.76 2.49 2.49

Solution figure 9.11 Comparison of household characteristics by borrower type.

• We can see that, in addition to what we analysed before, discouraged borrowers have less education and fewer assets. Constrained borrowers have smaller household sizes.
1. One example of selection bias is the study of the determinants of prosperity. To do this, we can compare developing with developed countries. However, only developing countries with adequate resources and willingness collect and submit their data. The set of developing countries for which we have data is thus not representative of the population of developing countries.

## Part 9.2 Households that got a loan

• No solution is provided.
• There are a total of 1,480 observations for these variables. There are six blanks for the start date (0.41%) and 535 (36.15%) for the end date. Hence for more than a third of observations we have no end date.
• No solution is provided.
• No solution is provided.
• No solution is provided.
• Percentage of the loans that were long term: 22.80% if using the measure of loan duration with negatives replaced by absolute values; and ignoring blanks (NAs).
• Numbers are rounded to the nearest whole number, and shown in Solution figure 9.12.
Mean SD Min Max
Loan amount (principal) 26,896 783,587 1 30,000,000
Total amount 29,223 827,144 20 31,260,000

Solution figure 9.12 Summary measures of loan amount (principal) and total amount.

• No solution is provided.
• 50.52% of the loans are zero interest. There is one loan for which the interest rate is 200%. It may not be evident on a column chart (histogram), but you can also identify the extreme value(s) with a scatterplot, as shown in Solution figure 9.13.

Solution figure 9.13 Loan amounts and interest rates.

• Solution figure 9.14 shows summary tables of measures of the loan amount and the interest rate. (Note: It is good practice to show the number of observations in a particular group.)

Short-term loans are, on average, for smaller amounts. A greater proportion of short-term loans offer very low interest rates. Smaller proportions of short-term loans offer high interest rates. The spread of interest rates and loan amounts is much larger for long-term loans than for short-term loans.

Loan amount N Mean SD Min Max 1st quartile 2nd quartile 3rd quartile
Long term 211 172,718 2,072,736 20 30,000,000 1,000 3,700 8,000
Short term 717 3,017 8,398 40 150,000 480 1,500 3,500
Interest rate N Mean SD Min Max 1st quartile 2nd quartile 3rd quartile
Long term 211 0.19 0.27 0.00 2.24 0.00 0.14 0.25
Short term 717 0.11 0.17 0.00 1.12 0.00 0.05 0.17

Solution figure 9.14 Comparison of distribution of long-term and short-term loans.

• Solution figure 9.15 shows the correlation between the interest rate and household characteristics. Here, we recoded gender as a dummy variable (1 = Female, 0 = Male).

The correlations are all fairly weak. We would expect that lenders would charge lower interest rates to households with more working-age adults and more members because these households are better able to repay, and the risks associated with lending to them are therefore lower. This is not what we observe in the data. The rest of the correlations have the expected sign (positive/negative).

Household characteristics Interest rate
Household size 0.11
Gender –0.02
Age 0.03
Young children 0.10
Max education −0.08
Number of assets –0.05

Solution figure 9.15 Correlations between interest rate and household characteristics.

• Solution figures 9.16 and 9.17 show the proportion of loans with sources of finance ‘borrowed_from’ and ‘borrowed_from_other’. (Numbers are quoted to 3 decimal places due to small values).

Compared to rural households, urban households are more likely to borrow from banks and employers. Rural households are more likely to borrow from microfinance institutions and governments.

Proportion of 'borrowed_from'
Source of finance Large town (urban) Rural Small town (urban) Total
Bank (commercial) 0.02 0.00 0.00 0.01
Employer 0.04 0.00 0.01 0.01
Grocery/Local Merchant 0.08 0.05 0.10 0.06
Microfinance Institution 0.19 0.28 0.27 0.26
Money Lender (Katapila) 0.00 0.05 0.02 0.04
Neighbour 0.11 0.12 0.07 0.11
NGO 0.01 0.05 0.05 0.04
Other (specify) 0.06 0.12 0.04 0.10
Relative 0.49 0.32 0.43 0.37
Religious Institution 0.00 0.02 0.00 0.02

Solution figure 9.16 Source of loan, by variable ‘rural’.

Proportion of borrowed_from_other
Source of finance Large town (urban) Rural Small town (urban) Total
Bank 0.00 0.01 0.00 0.01
Cooperatives 0.42 0.43 0.40 0.43
Equib 0.05 0.01 0.00 0.01
From government 0.05 0.30 0.00 0.26
From individuals 0.05 0.01 0.00 0.01
From private business 0.00 0.02 0.00 0.02
From relatives 0.05 0.00 0.20 0.01
From women association 0.00 0.01 0.00 0.01
From Youth Association 0.00 0.01 0.00 0.01
HAB project 0.00 0.02 0.00 0.01
Iddir 0.16 0.15 0.00 0.14
Micro and small enterprise 0.11 0.00 0.00 0.01
Micro finance 0.00 0.04 0.40 0.05
Mobile 0.05 0.00 0.00 0.01
NGO 0.05 0.00 0.00 0.01

Solution figure 9.17 Source of loan, by variable ‘rural’ (‘Other’ category only).

• Solution figures 9.18, 9.19, and 9.20 provide tables for each of the variables.

There are many possible comparisons to make, for example:

• Duration of loan: In large towns and rural areas, the average loan duration of banks is the longest. Across all types of areas, informal sources (such as relatives or neighbours) tend to lend for shorter durations on average.
• Loan amount: Banks lend larger amounts on average.
• Interest rate: The interest rate from informal sources is not necessarily higher (on average) than interest rates from formal sources such as banks. Lending from people that know the household (employer, relatives, neighbours) incurs lower average interest rates, possibly because the principal–agent problem is less severe.
Source of finance Large town (urban) Rural Small town (urban)
Bank (commercial) 1,814 619 0
Employer 602 290 0
Grocery/Local Merchant 166 259 176
Microfinance Institution 712 411 510
Money Lender (Katapila) 365 332 365
Neighbour 125 187 296
NGO 373 395 236
Other (specify) 274 372 806
Relative 237 217 393
Religious Institution 1,461 343 0
(blank) 1,096 289 151

Solution figure 9.18 Duration of loan (rounded to nearest day).

Source of finance Large town (urban) Rural Small town (urban)
Bank (commercial) 5,755,833 3,575 0
Employer 5,915 1,350 1,000
Grocery/Local Merchant 1,370 1,429 1,599
Microfinance Institution 14,525 3,852 7,543
Money Lender (Katapila) 350 1,360 5,150
Neighbour 602 837 686
NGO 5,532 1,4475 2,300
Other (specify) 3,912 1,842 3,829
Relative 7,872 1,576 6,802
Religious Institution 50,000 910 0

Solution figure 9.19 Loan amount (rounded to nearest whole number).

Source of finance Large town (urban) Rural Small town (urban)
Bank (commercial) 0.14 0.26 0.00
Employer 0.04 0.06 0.00
Grocery/Local Merchant 0.00 0.09 0.00
Microfinance Institution 0.16 0.18 0.15
Money Lender (Katapila) 1.00 0.43 0.09
Neighbour 0.00 0.12 0.00
NGO 0.06 0.12 0.05
Other (specify) 0.13 0.16 0.22
Relative 0.01 0.08 0.00
Religious Institution 0.00 0.20 0.00

Solution figure 9.20 Interest rate (rounded to two decimal places).

• The following example uses a dummy variable called ‘Gender numerical’ which is equal to 1 for ‘Female’ and 0 for ‘Male’. The averages of this variable across categories are equivalent to the proportions.

Households with female heads are relatively more likely to borrow from grocery/local merchants, neighbours, NGOs, and relatives.

Source of finance Large town (urban) Rural Small town (urban)
Bank (commercial) 0.00 0.50 0.00
Employer 0.31 0.00 0.00
Grocery/Local Merchant 0.39 0.19 0.50
Microfinance Institution 0.41 0.13 0.31
Money Lender (Katapila) 0.00 0.27 0.50
Neighbour 0.35 0.25 0.43
NGO 0.25 0.31 0.20
Other (specify) 0.39 0.19 0.25
Relative 0.40 0.21 0.29
Religious Institution 1.00 0.24 0.00
(blank) 1.00 0.43 1.00

Solution figure 9.21 Proportion of households with a female head, according to source of finance.

• Government policies such as tax reliefs and benefit, distance and/or access to various sources of financing, and stability of employment are all examples of variables that can be important.
1. One hypothesis is that lack of knowledge of loan availability affects access by households to lending. The government could fund education programmes aimed at improving the poor’s understanding of financial services. The government could identify two poor regions with similar characteristics that are distant from each other. One region would serve as the control group while the other as the treatment group. The government could then randomly select and educate individuals from the treatment region. The causal effects of the policy could be assessed by comparing outcomes in the two regions after the treatment. During the study period, the government should avoid implementing other policies in the regions, especially those that can affect the regions differently. The two regions should be chosen such that they would evolve in similar ways without the policy, and that treatments on one region cannot indirectly affect outcomes in the other region.