CENSOAGRO

Overview

The Census of Agriculture (Censo Agropecuário) is Brazil’s comprehensive survey of agricultural establishments and activities, conducted by IBGE (Instituto Brasileiro de Geografia e Estatística). This census collects detailed information about:

The census provides critical data for agricultural policy, market research, and understanding the structure of Brazilian agriculture across regional and temporal dimensions.

Data Coverage

Data is collected at multiple geographic levels: - Country level: aggregate national statistics - State level: disaggregated by Brazilian states - Municipality level: available for select datasets (currently "livestock_production")

Historical data spans from 1920 onwards, with different time series available for different datasets based on IBGE’s survey methodology evolution.


Available Datasets

1. agricultural_land_area

Provides comprehensive data on total agricultural land area and the number of agricultural properties.

2. agricultural_area_use

Details how agricultural properties use their land (crop farming, pasture, forests, etc.).

3. agricultural_employees_tractors

Captures information about the agricultural workforce and mechanization levels.

4. agricultural_producer_condition

Describes the tenure status of agricultural land (ownership, rental, partnership, etc.).

5. animal_production

Details the number of livestock animals farmed by species and type.

6. animal_products

Quantifies production volumes of animal-based products.

7. vegetable_production_area

Provides detailed crop production data including area planted and volume produced.

8. vegetable_production_temporary

Focuses specifically on temporary crops (annual crops that must be replanted each season).

9. vegetable_production_permanent

Focuses on permanent crops (perennial crops that produce for multiple years).

10. livestock_production

Specialized dataset on bovine cattle production and related establishments.


Function Parameters

1. dataset

Selects which dataset to download. See dataset descriptions above.

dataset = "agricultural_land_area"  # character string

2. raw_data

Controls whether to download the original data or the processed/cleaned version.

Default behavior: Raw data typically requires more cleaning and interpretation, while treated data is ready for immediate analysis.

raw_data = FALSE  # logical

3. geo_level

Specifies the geographic aggregation level.

geo_level = "state"  # character string

4. time_period

Defines which year(s) to download. Availability varies by dataset:

Dataset Available Years
agricultural_land_area 1920, 1940, 1950, 1960, 1970, 1975, 1980, 1985, 1995, 2006, 2017
agricultural_area_use 1970, 1975, 1980, 1985, 1995, 2006, 2017
agricultural_employees_tractors 1970, 1975, 1980, 1985, 1995, 2006, 2017
agricultural_producer_condition 1920, 1940, 1950, 1960, 1970, 1975, 1980, 1985, 1995, 2006, 2017
animal_production 1970, 1975, 1980, 1985, 1995, 2006, 2017
animal_products 1920, 1940, 1950, 1960, 1970, 1975, 1980, 1985, 1995, 2006, 2017
vegetable_production_area 1920, 1940, 1950, 1960, 1970, 1975, 1980, 1985, 1995, 2006, 2017
vegetable_production_temporary 1970, 1975, 1980, 1985, 1995, 2006, 2017
vegetable_production_permanent 1940, 1950, 1960, 1970, 1975, 1980, 1985, 1995, 2006, 2017
livestock_production 2017

You can request a single year or a range of years:

time_period = 2006           # single year
time_period = c(1995, 2006)  # multiple specific years
time_period = 1995:2006      # will select years within this range that are available

5. language

Output language for variable names and labels.

language = "eng"  # character string

Examples

# download treated land area data at the country level in 2017
data <- load_censoagro(
  dataset = "agricultural_land_area",
  raw_data = FALSE,
  geo_level = "country",
  time_period = 2017,
  language = "eng"
)

# download treated temporary crop data by state in 1995 in portuguese
data <- load_censoagro(
  dataset = "vegetable_production_temporary",
  raw_data = FALSE,
  geo_level = "state",
  time_period = 1995,
  language = "pt"
)

# download municipality-level cattle data (only available for livestock_production)
data <- load_censoagro(
  dataset = "livestock_production",
  raw_data = FALSE,
  geo_level = "municipality",
  time_period = 2017,
  language = "eng"
)

Data Notes

Raw vs. Treated Data

Data Organization

When using treated data, the output is typically in long format with one row per observation unit, containing: - Geographic identifiers (state, municipality if applicable) - Year of the census - Product/category names (crop type, animal species, etc.) - Quantitative measurements (area, quantity, count) - Number of establishments/properties

Important Considerations

  1. Time gaps: Census data is not collected every year. Years with no data simply won’t be available.
  2. Geographic changes: Brazil’s state boundaries have changed historically; use caution when comparing very old data
  3. Definition changes: IBGE’s classification of crops and agricultural activities has evolved. Variables may not be directly comparable across all decades.
  4. Municipality data: Currently only available for livestock_production in 2017
  5. Download size: Historical data requests with multiple years may be large; plan accordingly

Citing the Data

When using this data in research or publications, cite:

IBGE - Instituto Brasileiro de Geografia e Estatística. Censo Agropecuário. Available at: https://sidra.ibge.gov.br/pesquisa/censo-agropecuario