Puneet Varma (Editor)

Soliton (optics)

Updated on
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Soliton (optics)

In optics, the term soliton is used to refer to any optical field that does not change during propagation because of a delicate balance between nonlinear and linear effects in the medium. There are two main kinds of solitons:


  • spatial solitons: the nonlinear effect can balance the diffraction. The electromagnetic field can change the refractive index of the medium while propagating, thus creating a structure similar to a graded-index fiber. If the field is also a propagating mode of the guide it has created, then it will remain confined and it will propagate without changing its shape
  • temporal solitons: if the electromagnetic field is already spatially confined, it is possible to send pulses that will not change their shape because the nonlinear effects will balance the dispersion. Those solitons were discovered first and they are often simply referred as "solitons" in optics.
  • Spatial solitons

    In order to understand how a spatial soliton can exist, we have to make some considerations about a simple convex lens. As shown in the picture on the right, an optical field approaches the lens and then it is focused. The effect of the lens is to introduce a non-uniform phase change that causes focusing. This phase change is a function of the space and can be represented with φ ( x ) , whose shape is approximately represented in the picture.

    The phase change can be expressed as the product of the phase constant and the width of the path the field has covered. We can write it as:

    φ ( x ) = k 0 n L ( x )

    where L ( x ) is the width of the lens, changing in each point with a shape that is the same of φ ( x ) because k 0 and n are constants. In other words, in order to get a focusing effect we just have to introduce a phase change of such a shape, but we are not obliged to change the width. If we leave the width L fixed in each point, but we change the value of the refractive index n ( x ) we will get exactly the same effect, but with a completely different approach.

    That's the way graded-index fibers work: the change in the refractive index introduces a focusing effect that can balance the natural diffraction of the field. If the two effects balance each other perfectly, then we have a confined field propagating within the fiber.

    Spatial solitons are based on the same principle: the Kerr effect introduces a Self-phase modulation that changes the refractive index according to the intensity:

    φ ( x ) = k 0 n ( x ) L = k 0 L [ n + n 2 I ( x ) ]

    if I ( x ) has a shape similar to the one shown in the figure, then we have created the phase behavior we wanted and the field will show a self-focusing effect. In other words, the field creates a fiber-like guiding structure while propagating. If the field creates a fiber and it is the mode of such a fiber at the same time, it means that the focusing nonlinear and diffractive linear effects are perfectly balanced and the field will propagate forever without changing its shape (as long as the medium does not change and if we can neglect losses, obviously). In order to have a self-focusing effect, we must have a positive n 2 , otherwise we will get the opposite effect and we will not notice any nonlinear behavior.

    The optical waveguide the soliton creates while propagating is not only a mathematical model, but it actually exists and can be used to guide other waves at different frequencies. This way it is possible to let light interact with light at different frequencies (this is impossible in linear media).


    An electric field is propagating in a medium showing optical Kerr effect, so the refractive index is given by:

    n ( I ) = n + n 2 I

    we remember that the relationship between irradiance and electric field is (in the complex representation):

    I = | E | 2 2 η

    where η = η 0 / n and η 0 is the impedance of free space, given by:

    η 0 = μ 0 ϵ 0 377 Ω

    The field is propagating in the z direction with a phase constant k 0 n . About now, we will ignore any dependence on the y axis, assuming that it is infinite in that direction. Then the field can be expressed as:

    E ( x , z , t ) = A m a ( x , z ) e i ( k 0 n z ω t )

    where A m is the maximum amplitude of the field and a ( x , z ) is a dimensionless normalized function (so that its maximum value is 1) that represents the shape of the electric field among the x axis. In general it depends on z because fields change their shape while propagating. Now we have to solve the Helmholtz equation:

    2 E + k 0 2 n 2 ( I ) E = 0

    where it was pointed out clearly that the refractive index (thus the phase constant) depends on intensity. If we replace the expression of the electric field in the equation, assuming that the envelope a ( x , z ) changes slowly while propagating, i.e.

    | 2 a ( x , z ) z 2 | | k 0 a ( x , z ) z |

    the equation becomes:

    2 a x 2 + i 2 k 0 n a z + k 0 2 [ n 2 ( I ) n 2 ] a = 0

    Let us introduce an approximation that is valid because the nonlinear effects are always much smaller than the linear ones:

    [ n 2 ( I ) n 2 ] = [ n ( I ) n ] [ n ( I ) + n ] = n 2 I ( 2 n + n 2 I ) 2 n n 2 I

    now we express the intensity in terms of the electric field:

    [ n 2 ( I ) n 2 ] 2 n n 2 | A m | 2 | a ( x , z ) | 2 2 η 0 / n = n 2 n 2 | A m | 2 | a ( x , z ) | 2 η 0

    the equation becomes:

    1 2 k 0 n 2 a x 2 + i a z + k 0 n n 2 | A m | 2 2 η 0 | a | 2 a = 0

    We will now assume n 2 > 0 so that the nonlinear effect will cause self focusing. In order to make this evident, we will write in the equation n 2 = | n 2 | Let us now define some parameters and replace them in the equation:

  • ξ = x X 0 , so we can express the dependence on the x axis with a dimensionless parameter; X 0 is a length, whose physical meaning will be clearer later.
  • L d = X 0 2 k 0 n , after the electric field has propagated across z for this length, the linear effects of diffraction can not be neglected anymore.
  • ζ = z L d , for studying the z-dependence with a dimensionless variable.
  • L n l = 2 η 0 k 0 n | n 2 | | A m | 2 , after the electric field has propagated across z for this length, the nonlinear effects can not be neglected anymore. This parameter depends upon the intensity of the electric field, that's typical for nonlinear parameters.
  • N 2 = L d L n l
  • The equation becomes:

    1 2 2 a ξ 2 + i a ζ + N 2 | a | 2 a = 0

    this is a common equation known as nonlinear Schrödinger equation. From this form, we can understand the physical meaning of the parameter N:

  • if N 1 , then we can neglect the nonlinear part of the equation. It means L d L n l , then the field will be affected by the linear effect (diffraction) much earlier than the nonlinear effect, it will just diffract without any nonlinear behavior.
  • if N 1 , then the nonlinear effect will be more evident than diffraction and, because of self phase modulation, the field will tend to focus.
  • if N 1 , then the two effects balance each other and we have to solve the equation.
  • For N = 1 the solution of the equation is simple and it is the fundamental soliton:

    a ( ξ , ζ ) = sech ( ξ ) e i ζ / 2

    where sech is the hyperbolic secant. It still depends on z, but only in phase, so the shape of the field will not change during propagation.

    For N = 2 it is still possible to express the solution in a closed form, but it has a more complicated form:

    a ( ξ , ζ ) = 4 [ cosh ( 3 ξ ) + 3 e 4 i ζ cosh ( ξ ) ] e i ζ / 2 cosh ( 4 ξ ) + 4 cosh ( 2 ξ ) + 3 cos ( 4 ζ )

    It does change its shape during propagation, but it is a periodic function of z with period ζ = π / 2 .

    For soliton solutions, N must be an integer and it is said to be the order or the soliton. For higher values of N, there are no closed form expressions, but the solitons exist and they are all periodic with different periods. Their shape can easily be expressed only immediately after generation:

    a ( ξ , ζ = 0 ) = N sech ( ξ )

    on the right there is the plot of the second order soliton: at the beginning it has a shape of a sech, then the maximum amplitude increases and then comes back to the sech shape. Since high intensity is necessary to generate solitons, if the field increases its intensity even further the medium could be damaged.

    The condition to be solved if we want to generate a fundamental soliton is obtained expressing N in terms of all the known parameters and then putting N = 1 :

    1 = N = L d L n l = X 0 2 k 0 2 n 2 | n 2 | | A m | 2 2 η 0

    that, in terms of maximum irradiance value becomes:

    I m a x = | A m | 2 2 η 0 / n = 1 X 0 2 k 0 2 n | n 2 |

    in most of the cases, the two variables that can be changed are the maximum intensity I max and the pulse width X 0 .

    Curiously, higher-order solitons can attain complicated shapes before returning exactly to their initial shape at the end of the soliton period. In the picture of various solitons, the spectrum (left) and time domain (right) are shown at varying distances of propagation (vertical axis) in an idealized nonlinear medium. This shows how a laser pulse might behave as it travels in a medium with the properties necessary to support fundamental solitons. In practice, in order to reach the very high peak intensity needed to achieve nonlinear effects, laser pulses may be coupled into optical fibers such as photonic-crystal fiber with highly confined propagating modes. Those fibers have more complicated dispersion and other characteristics which depart from the analytical soliton parameters.

    Generation of spatial solitons

    The first experiment on spatial optical solitons was reported in 1974 by Ashkin and Bjorkholm in a cell filled with sodium vapor. The field was then revisited in experiments at Limoges University in liquid carbon disulphide and expanded in the early '90s with the first observation of solitons in photorefractive crystals, glass, semiconductors and polymers. During the last decades numerous findings have been reported in various materials, for solitons of different dimensionality, shape, spiralling, colliding, fusing, splitting, in homogeneous media, periodic systems, and waveguides. Spatials solitons are also referred to as self-trapped optical beams and their formation is normally also accompanied by a self-written waveguide. In nematic liquid crystals, spatial solitons are also referred to as nematicons.

    Temporal solitons

    The main problem that limits transmission bit rate in optical fibres is group velocity dispersion. It is because generated impulses have a non-zero bandwidth and the medium they are propagating through has a refractive index that depends on frequency (or wavelength). This effect is represented by the group delay dispersion parameter D; using it, it is possible to calculate exactly how much the pulse will widen:

    Δ τ D L Δ λ

    where L is the length of the fibre and Δ λ is the bandwidth in terms of wavelength. The approach in modern communication systems is to balance such a dispersion with other fibers having D with different signs in different parts of the fibre: this way the pulses keep on broadening and shrinking while propagating. With temporal solitons it is possible to remove such a problem completely.

    Consider the picture on the right. On the left there is a standard Gaussian pulse, that's the envelope of the field oscillating at a defined frequency. We assume that the frequency remains perfectly constant during the pulse.

    Now we let this pulse propagate through a fibre with D > 0 , it will be affected by group velocity dispersion. For this sign of D, the dispersion is anomalous, so that the higher frequency components will propagate a little bit faster than the lower frequencies, thus arriving before at the end of the fiber. The overall signal we get is a wider chirped pulse, shown in the upper right of the picture.

    Now let us assume we have a medium that shows only nonlinear Kerr effect but its refractive index does not depend on frequency: such a medium does not exist, but it's worth considering it to understand the different effects.

    The phase of the field is given by:

    φ ( t ) = ω 0 t k z = ω 0 t k 0 z [ n + n 2 I ( t ) ]

    the frequency (according to its definition) is given by:

    ω ( t ) = φ ( t ) t = ω 0 k 0 z n 2 I ( t ) t

    this situation is represented in the picture on the left. At the beginning of the pulse the frequency is lower, at the end it's higher. After the propagation through our ideal medium, we will get a chirped pulse with no broadening because we have neglected dispersion.

    Coming back to the first picture, we see that the two effects introduce a change in frequency in two different opposite directions. It is possible to make a pulse so that the two effects will balance each other. Considering higher frequencies, linear dispersion will tend to let them propagate faster, while nonlinear Kerr effect will slow them down. The overall effect will be that the pulse does not change while propagating: such pulses are called temporal solitons.

    History of temporal solitons

    In 1973, Akira Hasegawa and Fred Tappert of AT&T Bell Labs were the first to suggest that solitons could exist in optical fibres, due to a balance between self-phase modulation and anomalous dispersion. Also in 1973 Robin Bullough made the first mathematical report of the existence of optical solitons. He also proposed the idea of a soliton-based transmission system to increase performance of optical telecommunications.

    Solitons in a fibre optic system are described by the Manakov equations.

    In 1987, P. Emplit, J.P. Hamaide, F. Reynaud, C. Froehly and A. Barthelemy, from the Universities of Brussels and Limoges, made the first experimental observation of the propagation of a dark soliton, in an optical fiber.

    In 1988, Linn Mollenauer and his team transmitted soliton pulses over 4,000 kilometres using a phenomenon called the Raman effect, named for the Indian scientist Sir C. V. Raman who first described it in the 1920s, to provide optical gain in the fibre.

    In 1991, a Bell Labs research team transmitted solitons error-free at 2.5 gigabits over more than 14,000 kilometres, using erbium optical fibre amplifiers (spliced-in segments of optical fibre containing the rare earth element erbium). Pump lasers, coupled to the optical amplifiers, activate the erbium, which energizes the light pulses.

    In 1998, Thierry Georges and his team at France Télécom R&D Centre, combining optical solitons of different wavelengths (wavelength division multiplexing), demonstrated a data transmission of 1 terabit per second (1,000,000,000,000 units of information per second).


    An electric field is propagating in a medium showing optical Kerr effect through a guiding structure (such as an optical fibre) that limits the power on the xy plane. If the field is propagating towards z with a phase constant β 0 , then it can be expressed in the following form:

    E ( r , t ) = A m a ( t , z ) f ( x , y ) e i ( β 0 z ω 0 t )

    where A m is the maximum amplitude of the field, a ( t , z ) is the envelope that shapes the impulse in the time domain; in general it depends on z because the impulse can change its shape while propagating; f ( x , y ) represents the shape of the field on the xy plane, and it does not change during propagation because we have assumed the field is guided. Both a and f are normalized dimensionless functions whose maximum value is 1, so that A m really represents the field amplitude.

    Since in the medium there is a dispersion we can not neglect, the relationship between the electric field and its polarization is given by a convolution integral. Anyway, using a representation in the Fourier domain, we can replace the convolution with a simple product, thus using standard relationships that are valid in simpler media. We Fourier-transform the electric field using the following definition:

    E ~ ( r , ω ω 0 ) = E ( r , t ) e i ( ω ω 0 ) t d t

    Using this definition, a derivative in the time domain corresponds to a product in the Fourier domain:

    t E i ( ω ω 0 ) E ~

    the complete expression of the field in the frequency domain is:

    E ~ ( r , ω ω 0 ) = A m a ~ ( ω , z ) f ( x , y ) e i β 0 z

    Now we can solve Helmholtz equation in the frequency domain:

    2 E ~ + n 2 ( ω ) k 0 2 E ~ = 0

    we decide to express the phase constant with the following notation:

    n ( ω ) k 0 = β ( ω ) = β 0 linear non dispersive + β l ( ω ) linear dispersive + β n l non linear = β 0 + Δ β ( ω )

    where we assume that Δ β (the sum of the linear dispersive component and the non linear part) is a small perturbation, i.e. | β 0 | | Δ β ( ω ) | . The phase constant can have any complicated behaviour, but we can represent it with a Taylor series centred on ω 0 :

    β ( ω ) β 0 + ( ω ω 0 ) β 1 + ( ω ω 0 ) 2 2 β 2 + β n l

    where, as known:

    β u = d u β ( ω ) d ω u | ω = ω 0

    we put the expression of the electric field in the equation and make some calculations. If we assume the slowly varying envelope approximation:

    | 2 a ~ z 2 | | β 0 a ~ z |

    we get:

    2 i β 0 a ~ z + [ β 2 ( ω ) β 0 2 ] a ~ = 0

    we are ignoring the behavior in the xy plane, because it is already known and given by f ( x , y ) . We make a small approximation, as we did for the spatial soliton:

    β 2 ( ω ) β 0 2 = [ β ( ω ) β 0 ] [ β ( ω ) + β 0 ] = [ β 0 + Δ β ( ω ) β 0 ] [ 2 β 0 + Δ β ( ω ) ] 2 β 0 Δ β ( ω )

    replacing this in the equation we get simply:

    i a ~ z + Δ β ( ω ) a ~ = 0 .

    Now we want to come back in the time domain. Expressing the products by derivatives we get the duality:

    Δ β ( ω ) i β 1 t β 2 2 2 t 2 + β n l

    we can write the non linear component in terms of the irradiance or amplitude of the field:

    β n l = k 0 n 2 I = k 0 n 2 | E | 2 2 η 0 / n = k 0 n 2 n | A m | 2 2 η 0 | a | 2

    for duality with the spatial soliton, we define:

    L n l = 2 η 0 k 0 n n 2 | A m | 2

    and this symbol has the same meaning of the previous case, even if the context is different. The equation becomes:

    i a z + i β 1 a t β 2 2 2 a t 2 + 1 L n l | a | 2 a = 0

    We know that the impulse is propagating along the z axis with a group velocity given by v g = 1 / β 1 , so we are not interested in it because we just want to know how the pulse changes its shape while propagating. We decide to study the impulse shape, i.e. the envelope function a(.) using a reference that is moving with the field at the same velocity. Thus we make the substitution

    T = t β 1 z

    and the equation becomes:

    i a z β 2 2 2 a T 2 + 1 L n l | a | 2 a = 0

    We now further assume that the medium where the field is propagating in shows anomalous dispersion, i.e. β 2 < 0 or in terms of the group delay dispersion parameter D = 2 π c λ 2 β 2 > 0 . We make this more evident replacing in the equation β 2 = | β 2 | . Let us define now the following parameters (the duality with the previous case is evident):

    L d = T 0 2 | β 2 | ; τ = T T 0 ; ζ = z L d ; N 2 = L d L n l

    replacing those in the equation we get:

    1 2 2 a τ 2 + i a ζ + N 2 | a | 2 a = 0

    that is exactly the same equation we have obtained in the previous case. The first order soliton is given by:

    a ( τ , ζ ) = sech ( τ ) e i ζ / 2

    the same considerations we have made are valid in this case. The condition N = 1 becomes a condition on the amplitude of the electric field:

    | A m | 2 = 2 η 0 | β 2 | T 0 2 n 2 k 0 n

    or, in terms of irradiance:

    I m a x = | A m | 2 2 η 0 / n = | β 2 | T 0 2 n 2 k 0

    or we can express it in terms of power if we introduce an effective area A e f f defined so that P = I A e f f :

    P = | β 2 | A e f f T 0 2 n 2 k 0

    Stability of solitons

    We have described what optical solitons are and, using mathematics, we have seen that, if we want to create them, we have to create a field with a particular shape (just sech for the first order) with a particular power related to the duration of the impulse. But what if we are a bit wrong in creating such impulses? Adding small perturbations to the equations and solving them numerically, it is possible to show that mono-dimensional solitons are stable. They are often referred as (1 + 1) D solitons, meaning that they are limited in one dimension (x or t, as we have seen) and propagate in another one (z).

    If we create such a soliton using slightly wrong power or shape, then it will adjust itself until it reaches the standard sech shape with the right power. Unfortunately this is achieved at the expense of some power loss, that can cause problems because it can generate another non-soliton field propagating together with the field we want. Mono-dimensional solitons are very stable: for example, if 0.5 < N < 1.5 we will generate a first order soliton anyway; if N is greater we'll generate a higher order soliton, but the focusing it does while propagating may cause high power peaks damaging the media.

    The only way to create a (1 + 1) D spatial soliton is to limit the field on the y axis using a dielectric slab, then limiting the field on x using the soliton.

    On the other hand, (2 + 1) D spatial solitons are unstable, so any small perturbation (due to noise, for example) can cause the soliton to diffract as a field in a linear medium or to collapse, thus damaging the material. It is possible to create stable (2 + 1) D spatial solitons using saturating nonlinear media, where the Kerr relationship n ( I ) = n + n 2 I is valid until it reaches a maximum value. Working close to this saturation level makes it possible to create a stable soliton in a three-dimensional space.

    If we consider the propagation of shorter (temporal) light pulses or over a longer distance, we need to consider higher-order corrections and therefore the pulse carrier envelope is governed by the higher-order nonlinear Schrödinger equation (HONSE) for which there are some specialized (analytical) soliton solutions.

    Effect of power losses

    As we have seen, in order to create a soliton it is necessary to have the right power when it is generated. If there are no losses in the medium, then we know that the soliton will keep on propagating forever without changing shape (1st order) or changing its shape periodically (higher orders). Unfortunately any medium introduces losses, so the actual behaviour of power will be in the form:

    P ( z ) = P 0 e α z

    this is a serious problem for temporal solitons propagating in fibers for several kilometers. Let us consider what happens for the temporal soliton, generalization to the spatial ones is immediate. We have proved that the relationship between power P 0 and impulse length T 0 is:

    P = | β 2 | A e f f T 0 2 n 2 k 0

    if the power changes, the only thing that can change in the second part of the relationship is T 0 . if we add losses to the power and solve the relationship in terms of T 0 we get:

    T ( z ) = T 0 e α 2 z

    the width of the impulse grows exponentially to balance the losses! this relationship is true as long as the soliton exists, i.e. until this perturbation is small, so it must be α z 1 otherwise we can not use the equations for solitons and we have to study standard linear dispersion. If we want to create a transmission system using optical fibres and solitons, we have to add optical amplifiers in order to limit the loss of power.

    Generation of soliton pulse

    Experiments have been carried out to analyse the effect of high frequency (20 MHz-1 GHz) external magnetic field induced nonlinear Kerr effect on Single mode optical fibre of considerable length (50-100m) to compensate group velocity dispersion (GVD) and subsequent evolution of soliton pulse ( peak energy, narrow, secant hyperbolic pulse). Generation of soliton pulse in fibre is an obvious conclusion as self phase modulation due to high energy of pulse offset GVD, whereas the evolution length is 2000 km. (the laser wavelength chosen greater than 1.3 micrometers). Moreover, peak soliton pulse is of period 1-3ps so that it is safely accommodated in the optical bandwidth. Once soliton pulse is generated it is least dispersed over thousands of kilometres length of fibre limiting the number of repeater stations.

    Dark solitons

    In the analysis of both types of solitons we have assumed particular conditions about the medium:

  • in spatial solitons, n 2 > 0 , that means the self-phase modulation causes self-focusing
  • in temporal solitons, β 2 < 0 or D > 0 , anomalous dispersion
  • Is it possible to obtain solitons if those conditions are not verified? if we assume n 2 < 0 or β 2 > 0 , we get the following differential equation (it has the same form in both cases, we will use only the notation of the temporal soliton):

    1 2 2 a τ 2 + i a ζ + N 2 | a | 2 a = 0.

    This equation has soliton-like solutions. For the first order (N=1):

    a ( τ , ζ ) = tanh ( τ ) e i ζ .  

    The plot of | a ( τ , ζ ) | 2 is shown in the picture on the right. For higher order solitons ( N > 1 ) we can use the following closed form expression:

    a ( τ , ζ = 0 ) = N tanh ( τ ) .  

    It is a soliton, in the sense that it propagates without changing its shape, but it is not made by a normal pulse; rather, it is a lack of energy in a continuous time beam. The intensity is constant, but for a short time during which it jumps to zero and back again, thus generating a "dark pulse"'. Those solitons can actually be generated introducing short dark pulses in much longer standard pulses. Dark solitons are more difficult to handle than standard solitons, but they have shown to be more stable and robust to losses.


    Soliton (optics) Wikipedia