# Concentration Index

## Contents

## Definition

A **concentration index** is any mathematical function that converts a *distribution* of observed values into a *single number* expressing the prevalence (concentration) of certain observations amongst the total set. Concentration indexes find widespread use in (among others) studies of biological diversity, economic inequality, market competition, and portfolio management.

The reduction of Concentration Measurement to a single number is only one of the available tools. The Lorentz Curve is an example of a more complex approach to studying concentration.

## List of General Concentration Indexes

The very widespread use of concentration indexes means that the same essential functions have been invented many times separately, have been given different names and are used in slightly different manner (e.g. inverse relationships, different normalizations etc.). The following aims to be a complete list of the essentially distinct varieties of concentration indexes:

- Atkinson Index
- Berger-Parker Index
- Concentration Ratio
- Generalized Entropy Index
- Gini Index
- Theil Index (Alternate name for the Generalized Entropy Index)
- Hannah Kay Index
- Hoover Index
- Herfindahl-Hirschman Index
- Kolm Index
- Shannon Index (Special case of the Generalized Entropy Index)
- Renyi Index (Alternate name for the Generalized Entropy Index)

## Spatial and Geographic Concentration Indexes

When spatial distribution (e.g., geographic location) is an important consideration the above general indexes may have reduced applicability. In this case a specialized type of Geographic Concentration Index might more applicable. Several geographic concentration indexes have been proposed:

- Generalization of the Gini index
- Generalization of the Herfindahl index
- Multigroup Thiel index
- Ellison and Glaeser index
- Maurel and Sedillot index
- Continuous indexes that use distances between locations

## Discrete versus Continuous data sets

There are two possible starting points for input data:

- Observations are discrete, for example is the proportion of a population with total count N that belongs to category i. A number of indexes used in diversity studies (Abundance, Richness, Margalev's, Menhinick’s diversity indexes) are based on species counts.
- Observations are continuous, for example is a range of observations of some continuous quantity E. Such data will typically converted into weights