Sigmoid function

Mathematical function having a characteristic S-shaped curve or sigmoid curve


title: "Sigmoid function" type: doc version: 1 created: 2026-02-28 author: "Wikipedia contributors" status: active scope: public tags: ["sigmoid-functions", "artificial-neural-networks"] description: "Mathematical function having a characteristic S-shaped curve or sigmoid curve" topic_path: "general/sigmoid-functions" source: "https://en.wikipedia.org/wiki/Sigmoid_function" license: "CC BY-SA 4.0" wikipedia_page_id: 0 wikipedia_revision_id: 0

::summary Mathematical function having a characteristic S-shaped curve or sigmoid curve ::

::figure[src="https://upload.wikimedia.org/wikipedia/commons/8/88/Logistic-curve.svg" caption="The [[logistic curve"] ::

::figure[src="https://upload.wikimedia.org/wikipedia/commons/2/2f/Error_Function.svg" caption="Plot of the [[error function"] ::

A sigmoid function is any mathematical function whose graph has a characteristic S-shaped or sigmoid curve.

A common example of a sigmoid function is the logistic function, which is defined by the formula :\sigma(x) = \frac{1}{1 + e^{-x}} = \frac{e^x}{1 + e^x} = 1 - \sigma(-x).

Other sigmoid functions are given in the Examples section. In some fields, most notably in the context of artificial neural networks, the term "sigmoid function" is used as a synonym for "logistic function".

Special cases of the sigmoid function include the Gompertz curve (used in modeling systems that saturate at large values of x) and the ogee curve (used in the spillway of some dams). Sigmoid functions have domain of all real numbers, with return (response) value commonly monotonically increasing but could be decreasing. Sigmoid functions most often show a return value (y axis) in the range 0 to 1. Another commonly used range is from −1 to 1.

There is also the Heaviside step function, which instantaneously transitions between 0 and 1.

A wide variety of sigmoid functions including the logistic and hyperbolic tangent functions have been used as the activation function of artificial neurons. Sigmoid curves are also common in statistics as cumulative distribution functions (which go from 0 to 1), such as the integrals of the logistic density, the normal density, and Student's t probability density functions. The logistic sigmoid function is invertible, and its inverse is the logit function.

Definition

A sigmoid function is a bounded, differentiable, real function that is defined for all real input values and has a positive derivative at each point.

Properties

In general, a sigmoid function is monotonic, and has a first derivative which is bell shaped. Conversely, the integral of any continuous, non-negative, bell-shaped function (with one local maximum and no local minimum, unless degenerate) will be sigmoidal. Thus the cumulative distribution functions for many common probability distributions are sigmoidal. One such example is the error function, which is related to the cumulative distribution function of a normal distribution; another is the arctan function, which is related to the cumulative distribution function of a Cauchy distribution.

A sigmoid function is constrained by a pair of horizontal asymptotes as x \rightarrow \pm \infty.

A sigmoid function is convex for values less than a particular point, and it is concave for values greater than that point: in many of the examples here, that point is 0.

Examples

::figure[src="https://upload.wikimedia.org/wikipedia/commons/6/6f/Gjl-t(x).svg" caption="Some sigmoid functions compared. In the drawing all functions are normalized in such a way that their slope at the origin is 1."] ::

  • Logistic function f(x) = \frac{1}{1 + e^{-x}}
  • Hyperbolic tangent (shifted and scaled version of the logistic function, above) f(x) = \tanh x = \frac{e^x-e^{-x}}{e^x+e^{-x}}
  • Arctangent function f(x) = \arctan x
  • Gudermannian function f(x) = \operatorname{gd}(x) = \int_0^x \frac{dt}{\cosh t} = 2\arctan\left(\tanh\left(\frac{x}{2}\right)\right)
  • Error function f(x) = \operatorname{erf}(x) = \frac{2}{\sqrt{\pi}} \int_0^x e^{-t^2} , dt
  • Generalised logistic function f(x) = \left(1 + e^{-x} \right)^{-\alpha}, \quad \alpha 0
  • Smoothstep function f(x) = \begin{cases} {\displaystyle \left( \int_0^1 \left(1 - u^2\right)^N du \right)^{-1} \int_0^x \left( 1 - u^2 \right)^N \ du}, & |x| \le 1 \ \ \sgn(x) & |x| \ge 1 \ \end{cases} \quad N \in \mathbb{Z} \ge 1
  • Some algebraic functions, for example f(x) = \frac{x}{\sqrt{1+x^2}}
  • and in a more general form f(x) = \frac{x}{\left(1 + |x|^{k}\right)^{1/k}}
  • Up to shifts and scaling, many sigmoids are special cases of f(x) = \varphi(\varphi(x, \beta), \alpha) , where \varphi(x, \lambda) = \begin{cases} (1 - \lambda x)^{1/\lambda} & \lambda \ne 0 \e^{-x} & \lambda = 0 \ \end{cases} is the inverse of the negative Box–Cox transformation, and \alpha and \beta are shape parameters.
  • Smooth transition function normalized to (−1,1): f(x) = \begin{cases} {\displaystyle 2\frac{e^{\frac{1}{u}}}{e^{\frac{1}{u}}+e^{\frac{-1}{1+u}}} - 1}, u=\frac{x+1}{-2}, & |x| \ \sgn(x) & |x| \ge 1 \ \end{cases} AManWithNoPlan simplified below -- \begin{align}f(x) &= \begin{cases} {\displaystyle \frac{2}{1+e^{-2m\frac{x}{1-x^2}}} - 1}, & |x| \ \sgn(x) & |x| \ge 1 \ \end{cases} \ &= \begin{cases} {\displaystyle \tanh\left(m\frac{x}{1-x^2}\right)}, & |x| \ \sgn(x) & |x| \ge 1 \ \end{cases}\end{align} using the hyperbolic tangent mentioned above. Here, m is a free parameter encoding the slope at x=0, which must be greater than or equal to \sqrt{3} because any smaller value will result in a function with multiple inflection points, which is therefore not a true sigmoid. This function is unusual because it actually attains the limiting values of −1 and 1 within a finite range, meaning that its value is constant at −1 for all x \leq -1 and at 1 for all x \geq 1. Nonetheless, it is smooth (infinitely differentiable, C^\infty) everywhere, including at x = \pm 1.

Applications

::figure[src="https://upload.wikimedia.org/wikipedia/commons/4/46/Gohana_inverted_S-curve.png" caption="Inverted logistic S-curve to model the relation between wheat yield and soil salinity"] ::

Many natural processes, such as those of complex system learning curves, exhibit a progression from small beginnings that accelerates and approaches a climax over time. When a specific mathematical model is lacking, a sigmoid function is often used.

The van Genuchten–Gupta model is based on an inverted S-curve and applied to the response of crop yield to soil salinity.

Examples of the application of the logistic S-curve to the response of crop yield (wheat) to both the soil salinity and depth to water table in the soil are shown in modeling crop response in agriculture.

In artificial neural networks, sometimes non-smooth functions are used instead for efficiency; these are known as hard sigmoids.

In audio signal processing, sigmoid functions are used as waveshaper transfer functions to emulate the sound of analog circuitry clipping.

In Digital signal processing in general, sigmoid functions, due to their higher order of continuity, have much faster asymptotic rolloff in the frequency domain than a Heavyside step function, and therefore are useful to smoothen discontinuities before sampling to reduce aliasing. This is, for example, used to generate square waves in many kinds of Digital synthesizer.

In biochemistry and pharmacology, the Hill and Hill–Langmuir equations are sigmoid functions.

In computer graphics and real-time rendering, some of the sigmoid functions are used to blend colors or geometry between two values, smoothly and without visible seams or discontinuities.

Titration curves between strong acids and strong bases have a sigmoid shape due to the logarithmic nature of the pH scale.

The logistic function can be calculated efficiently by utilizing type III Unums.

An hierarchy of sigmoid growth models with increasing complexity (number of parameters) was built{{cite journal | author = Kleshtanova, Viktoria and Ivanov, Vassil V and Hodzhaoglu, Feyzim and Prieto, Jose Emilio and Tonchev, Vesselin | title = Heterogeneous Substrates Modify Non-Classical Nucleation Pathways: Reanalysis of Kinetic Data from the Electrodeposition of Mercury on Platinum Using Hierarchy of Sigmoid Growth Models | journal = Crystals | volume = 13 | number = 12 | page = 1690 | year = 2023 | publisher = MDPI | doi = 10.3390/cryst13121690 | doi-access = free | hdl = 10261/341589 | hdl-access = free | author = Markov, I. and Stoycheva, E. | title = Saturation Nucleus Density in the Electrodeposition of Metals onto Inert Electrodes II. Experimental | journal = Thin Solid Films | volume = 35 | number = 1 | pages = 21–35 | year = 1976 | publisher = Elsevier | doi = 10.1016/0040-6090(76)90237-6 | author = Ivanov, V.V. and Tielemann, C. and Avramova, K. and Reinsch, S. and Tonchev, V. | title = Modelling Crystallization: When the Normal Growth Velocity Depends on the Supersaturation | journal = Journal of Physics and Chemistry of Solids | volume = 181 | article-number = 111542 | year = 2023 | publisher = Elsevier | doi = 10.1016/j.jpcs.2023.111542 | doi-access = free | author = Fanfoni, M. and Tomellini, M. | title = The Johnson-Mehl-Avrami-Kolmogorov Model: A Brief Review | journal = Il Nuovo Cimento D | volume = 20 | pages = 1171–1182 | year = 1998 | publisher = Springer | doi = 10.1007/BF03185527 | author = Tjørve, E. and Tjørve, K.M.C. | title = A Unified Approach to the Richards-Model Family for Use in Growth Analyses: Why We Need Only Two Model Forms | journal = Journal of Theoretical Biology | volume = 267 | number = 3 | pages = 417–425 | year = 2010 | publisher = Elsevier | doi = 10.1016/j.jtbi.2010.09.008

References

References

  1. EpsilonDelta. (16 August 2022). "Smooth Transition Function in One Dimension | Smooth Transition Function Series Part 1".
  2. (2022). "Harnessing the Power of S-Curves". [[RMI (energy organization).
  3. (December 1993). "Entropic analysis of biological growth models". [[IEEE Transactions on Biomedical Engineering]].
  4. (1995). "From Natural to Artificial Neural Computation".
  5. (2015-12-28). "Some extensions in continuous methods for immunological correlates of protection". [[BMC Medical Research Methodology]].
  6. (November 2000). "Variational Gaussian process classifiers". [[IEEE Transactions on Neural Networks]].
  7. (2010). "Physical Audio Signal Processing". W3K Publishing.
  8. (2017-06-12). "Beating Floating Point at its Own Game: Posit Arithmetic".
  9. (9 July 2022). "grex --- Growth-curve Explorer".

::callout[type=info title="Wikipedia Source"] This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page. ::

sigmoid-functionsartificial-neural-networks