A model-independent characterisation of strong gravitational lensing by observables

When light from a distant source object, like a galaxy or a supernova, travels towards us, it is deflected by massive objects that lie on its path. When the mass density of the deflecting object exceeds a certain threshold, multiple, highly distorted images of the source are observed. This strong gravitational lensing effect has so far been treated as a model-fitting problem. Using the observed multiple images as constraints yields a self-consistent model of the deflecting mass density and the source object. As several models meet the constraints equally well, we develop a lens characterisation that separates data-based information from model assumptions. The observed multiple images allow us to determine local properties of the deflecting mass distribution on any mass scale from one simple set of equations. Their solution is unique and free of model-dependent degeneracies. The reconstruction of source objects can be performed completely model-independently, enabling us to study galaxy evolution without a lens-model bias. Our approach reduces the lens and source description to its data-based evidence that all models agree upon, simplifies an automated treatment of large datasets, and allows for an extrapolation to a global description resembling model-based descriptions.


Introduction
Forty years ago, in 1979, two images of the quasar QSO 0957+561 were observed, [1], which marked the discovery of strong gravitational lenses. Since then, observations of multiple images have been routinely used to probe the deflecting mass-density distribution, in particular in order to infer its dark matter content and dark matter properties, see e.g. [2] and [3] for galaxy-scale lens surveys, or [4] and [5] for galaxy-cluster-scale lens surveys. Recent benchmarks with simulations of cluster-scale lenses show that a lot of different approaches accurately and precisely reconstruct the mass-density distribution in the vicinity of extended multiple images up to a few percent, [6]. For galaxy-scale lenses, an analogous project is currently being pursued, [7]. Furthermore, strong gravitational lensing of time varying objects has been used as a cosmological probe to determine the Hubble-Lemaître constant, H 0 , [8], [9], [10]. Attempts to infer the cosmic spatial curvature, the cosmic matter density parameter, and dark energy properties of the current cosmological concordance model and its potential extensions are also being pursued with galaxy-scale and cluster-scale lenses, [11], [12], [13]. Even after forty years of strong gravitational lensing studies, it still is a subject of high research interest because unprecedented observations, like supernova (SN) Refsdal, [14], or the recently discovered fast radio bursts (FRBs), [15], have continuously contributed to develop lens reconstruction methods further and thereby broaden the range of strong-lensing applications.
In our research, as being developed in [16], [17], [18], [19], [20], [21], and [22], we pursue the question which properties of a strong gravitational lens can be directly inferred from the observables that describe a multiple-image configuration. Contrary to most approaches, we do not aim for a global mass-density reconstruction of the entire lens that usually employs the observables as constraints in a model-fitting problem. For galaxy-scale lenses, such global lens reconstructions using parametric models or free-form arXiv:1906.05285v1 [astro-ph.CO] 12 Jun 2019 methods can be found in [23], [24], [25], [26], [27], or [28], for instance. Galaxy-cluster mass densities can be reconstructed with global lens reconstructions as, for instance, detailed in [29], [30], [31], [32]. References to further approaches and a comparison between the approaches can be found in [6]. Some of the approaches originally developed for galaxy-scale lenses can also be employed and extended to reconstruct cluster-scale lenses and vice versa, see [33] for an example. Due to the high symmetry of the lenses on galaxy scale, fully automatic characterisations are easier to implement than for cluster-scale lenses. The latter usually require at least a visual inspection of the results, or manual fine-tuning at intermediate stages. In addition, aiming at a global characterisation, the sparsely distributed multiple images are not sufficient to fully constrain the mass density within the cluster, [34]. As a consequence, different approaches employ different additional information and assumptions to reconstruct the mass-density distribution, which complicates the comparison of the results.
Instead of finding the global deflecting mass density that can cause the multiple images observed, we separate data-based information from model-based assumptions to find those lens properties that all approaches agree upon. We employ the most general equations of the standard gravitational lensing formalism to directly determine local lens properties from the observables. These local properties are unique and calculated in the same way for all lenses from symmetric galaxy-scale lenses to amorphous galaxy-cluster-scale lenses. Thus, the approach is highly efficient and robust to handle large datasets with a minimum amount of manual intervention.
The cosmic distances between us, the lens, and the source are usually determined according to the cosmological standard model and, as such, are not model-independent, see e.g. [35]. To make them independent of any assumption about the nature and the overall abundance of dark matter and dark energy in the universe, we set up data-based cosmic distances from the most recent Pantheon sample of SNe type Ia, [36]. This covers SNe up to redshift 2.3 such that data-based distances are available for a lot of observed lensing configurations. In this way, our approach is independent of a specific lens model and of a specific parametrisation of the cosmological background model, which is only assumed to be spatially homogeneous and isotropic and should fulfil the Einstein field equations. To set up the data-based cosmic distances, we employ the analytic orthonormal basis introduced in [37] for the sake of run-time efficiency and numerical stability. Other approaches to develop distance measures based on the standardisable candles are, for instance, [38], [39], [40], and [41].
In this work, we summarise the status quo of our approach, show a comparison to different global lens reconstruction ansatzes, based on [42] and [43], and detail its contribution to determine the invariance transformations of the most general equations of the gravitational lensing formalism, especially the time-delay equation which is used to infer H 0 . Based on these results, we outline future developments of lens reconstructions that could be enabled and tested by the recently discovered fast radio bursts, as further described in [44].
The next sections are organised as follows: In Section 2, we introduce the theory of the model-independent lens characterisation for the leading-order configurations of multiple images. We list the local lens characteristics that can be determined by the different observables in each multiple-image configuration. Section 3 shows examples for these different cases on galaxy and on galaxy-cluster scale and the results obtained by our approach. We also estimate to which precision H 0 can be determined from the time-delay equation employing the Pantheon-data-based angular diameter distances. After these proof-of-concept examples, we summarise the assets and applications of our approach in Section 4. Section 5 concludes the review with a diagrammatic summary of our method.

Materials and Methods
In this section, we introduce the theoretical concepts to determine local properties of a strong gravitational lens without assuming a specific lens model and without assuming a specific parametrisation of a spatially homogeneous and isotropic Friedmann background cosmology. Most cases of strong gravitational lensing Figure 1. Strong gravitational lenses can cause a deflection of light rays from a source S (grey circle) in the source plane P s into two images (yellow circles) in the lens plane P l . The upper image is observed at the angular position x 1 on the sky. The deflecting mass density (grey cloud) is assumed to consist of all masses along the line of sight (red circles) projected into P l , such that the light deflection is assumed to happen instantaneously at P l and not at the individual masses (as indicated by the grey dashed paths). Differences in the arrival times of the light rays from the images at the observer O (indicated by the yellow arrows) occur due to the different light paths. are well described by an effective theory of light deflection by a mass distribution in the static, thin-screen, weak-field limit in a spatially homogeneous and isotropic background cosmology, as summarised in [45], [46]. This means that the deflecting mass distribution is assumed to be located in one (or several) plane(s) orthogonal to the line of sight, the lens plane(s). The mass density in these lens planes can be described by a Newtonian gravitational potential on top of the cosmic background metric which is usually based on a Friedmann-Lemaître-Robertson-Walker metric (FLRW metric). Depending on the mass-density distribution, as investigated in [47], this Newtonian potential can deflect light rays emitted from a source, lying in a source plane, into multiple images. It is assumed that the distances on the sky between the angular positions of the source and the multiple images are small compared to the distances that the light rays travel along the line of sight. For the light deflection to be static, all motions of the observer, the source, and the lens with respect to each other are considered negligible during the observing time. If the source object has an intrinsic time-varying intensity, the deflected light rays arrive at different times at the observer because they cross environments of different mass densities and travel distances of varying length from the source to the observer. Figure 1 depicts an example of the strong gravitational lensing effect in which one background source is mapped to two images by a deflecting mass density in one lens plane.
In principle, there are other ways to describe the deflection of light rays in our universe which are detailed in [22]. For instance, the approach in which the background metric accounts for the entire deflecting mass-density distribution is an equivalent ansatz. Yet, determining the geodesics in such an inhomogeneous universe is more difficult than treating gravitational lensing in an effective theory based on lens planes in a homogeneous and isotropic background cosmology. Another option partitions the mass density along the line of sight into a main deflecting object and small-scale perturbations scattered along the light path in an FLRW background metric without projecting the masses to lens planes.
In order to avoid degeneracies within the description, we should choose the background metric, small-scale weakly deflecting masses, and deflecting masses that cause multiple images in such a way that the observables can directly constrain all parts of the chosen partition. We detail our choices of an FLRW background cosmology and a single-lens-plane Newtonian deflection potential in Sections 2.1 and 2.2.
Given these prerequisites, singularity theory has been established as the mathematical foundation to characterise the generic, non-linear mappings between multiple-image configurations and their sources, [48]. Originally developed by Vladimir Arnold to describe the set of singular points arising on manifolds that undergo (sudden) changes, e.g. projections to lower dimensions, singularity theory is the ideal tool to describe gravitational lensing in the lens-plane formalism introduced above. It thus provides a framework to solve the under-constrained inverse problem of reconstructing the gravitational lens and the source from an observed multiple-image configuration.

Observables in multiple images
As the (time-varying) brightness profile of multiple images caused by the strong gravitational lensing effect depends on the (time-varying) brightness profile of the source object and the deflecting mass distribution, one may expect a large variance in the appearances of multiple images and an equally high variance in the arrival time patterns of time-varying multiple images. Yet, the current measurement precision and resolution allow us to categorise the observations into three types of multiple images sharing the same observable features as summarised in Table 1. The number of multiple-image configurations is also limited and most of the occurring cases can be categorised into the three configurations shown in Table 2. Here, we focus on the observational properties of these configurations. In Section 2.2, we give an explanation for the limited amount of possible configurations based on singularity theory. Furthermore, general rules about the order in which light from these multiple-image configurations reaches the observer can also be derived from singularity theory, [48]. Any point source or object far below the resolution limit of the telescope will be mapped to multiple image points by the gravitational lensing effect with the angular position on the sky as the quantitative observable. Examples for such cases are multiple images of SNe, FRBs, or most quasars (QSOs), i.e. objects of short duration or with a time-varying intensity, see Table 1 (left). We denote images above the resolution limit of the data acquisition as unresolved images if these extended images do not show intrinsic features in the brightness profile above noise level, as shown in Table 1 (centre). The brightness profile of these images can be decomposed into multipoles. We obtain the centre of light of the brightness profiles of the images and the quadrupole around the centre of brightness as observables. Higher-order moments may be extractable, as we are currently investigating. From the quadrupole, we obtain the vectors of the semi-minor and semi-major axes with respect to the centre of light. The ellipticity of an unresolved image is calculated from the axis ratio between the semi-minor and the semi-major axis. The orientation of the image is defined as the direction of the semi-major axis in the coordinate system of the telescope observation. Decomposing the brightness profiles into other sets of basis functions that rely on specific profile models may lead to biases when inferring lens properties, as found e.g. in [49] and [50] for weakly-distorted galaxy brightness profiles. Therefore, any set of basis functions that is based on non-parametric, statistical properties of the brightness profiles is preferred, see e.g. [51] for a moment-based decomposition of weakly-distorted galaxy brightness profiles, or [52] and [53] for another suitable basis set. Table 2. Special multiple-image configurations of a common source. The figure shows an observed example, the schematic underneath the observables for unresolved images. Point-like and resolved images can also form these configurations. The bottom rows list the quantitative observables for each case. For arcs, we assume the centre of brightness of the lens is known. Image credits: NASA/ESA/HST. In the optimal case, the source object has a brightness profile with intrinsic features and the lensing effect is so strong that these features can be identified in all images like in Table 1 (right). As quantitative observables in each image, we obtain the vectors of the angular distances between the brightness features. Our approach requires at least two such non-parallel vectors per multiple image. This implies that any unresolved image can also be treated as a resolved image by means of the semi-major and semi-minor axes of the quadrupole. If the source is a galaxy, the brightness features can be star-forming regions. So far, three cases of resolved multiple images of background galaxies caused by a galaxy cluster as lens have been observed: Five multiple images in the galaxy cluster CL0024+17 as found by [54], a giant arc and its counter image in the galaxy cluster RCS2 032727-132623 as discovered by [55], and three images of a spiral galaxy in the galaxy cluster MACS J1149.5+2223 as first detailed in [56]. In the radio bands, some multiple images of QSOs show resolved features like core-jet structures, see e.g. [57] or [58]. Table 2 shows the most common multiple-image configurations, i.e. the relative positions and orientations of multiple images with respect to each other. The configurations are shown for unresolved images. Yet, they are equally observed for point-like and resolved images. As we detail in Section 2.2, these configurations are derived from a leading order approximation to the general non-linear lens mapping. In addition to these configurations, central images close to the lens centre can occur. They remain unobserved in most cases because they are demagnified by the lens and are usually outshone or occluded by a bright galaxy-scale lens or by the brightest cluster galaxy in the centre of a galaxy-cluster-scale lens.

Fold
The first case in Table 2 (left) is a fold configuration of two images that are mirror images of each other. This is often expressed as images A and B having opposite parity (handedness). The second case in Table 2 (centre) is a cusp configuration of three multiple images arranged in a parabolic shape. To leading order, this configuration can be decomposed into the superposition of two fold configurations formed by images A and B and A and C, as further detailed in Section 2.4. The last case in Table 2 (right) occurs for (almost) axisymmetric lenses if the position of the background source falls close to the point in the source plane to which the symmetry centre of the lens is back-projected. For a perfect alignment of the source position and the symmetry centre of the lens along the observer's line of sight, the source is mapped into an Einstein ring. Small deviations from this alignment or from a perfect axisymmetric lens lead to two giant arcs, a larger one and a smaller one. The latter is called the counter-image. These arcs surround the lens as shown in Table 2 (right). Sometimes, the larger arc can be split further into a cusp configuration, if the deviations from an axisymmetric mass-density distribution are so strong that the ellipticity of the deflecting mass profile becomes relevant.
Measuring the redshift of the images and the lens, the distances between the observer, the source, and the lens can be determined, employing the distance measures of the background metric. For a spatially flat FLRW metric, the angular diameter distances between two redshifts z 1 and z 2 with z 2 > z 1 are given as in which E(z) is the expansion function of the universe. For the standard cold-dark-matter model with a cosmological constant Λ (ΛCDM) based on a spatially flat FLRW metric, it reads The Ω i , i = r, m, Λ, denote the dimensionless contributions of radiation, matter, and the cosmological constant to the total energy density of the universe, normalised such that E(0) = 1. The term for the spatial curvature with Ω K is discarded, as [59] found that the curvature is compatible with zero. Alternatively, any data-based expansion function can be inserted, as, for instance set up for our approach in [21]. Section 2.5 details these data-based distance measures. The distances between the observer, the source, and the lens are only relevant in the time-delay equation, i.e. for cases involving images of time-varying intensity, as detailed in Section 2.4.

The standard gravitational lensing formalism
Based on the physical prerequisites of Section 2 and the detailed derivation in [22], we set up the standard gravitational lensing formalism for one lens plane as follows. First, the light propagation along the line of sight is divided into the light propagation through the flat FLRW metric and the light deflection at the lens plane. Relative to the unperturbed light ray that propagates along a null-geodesic of the background metric, let the light be subject to a time delay t s due to the deflection in the lens plane. Due to this deflection, the light also travels the additional time t g in the flat FLRW metric relative to the unperturbed ray. Minimising the total travel time with respect to an unperturbed light ray, t = t g + t s , we obtain for a light ray starting at the source redshift z s and being deflected by a two-dimensional gravitational potential ψ(x) at redshift z l , as shown in Figure 1. α(x) ∈ R 2 denotes the deflection angle caused by the gravitational potential in the lens plane. It is given by the difference of the angular position of a multiple image in the lens plane, x = (x 1 , x 2 ) ∈ R 2 , and the angular position of the source in the source plane, In addition, the deflection angle α(x) is given as the gradient of the projected deflection potential ψ(x) with respect to x, α(x) = ∇ψ(x) such that Equation (4) is often stated as For the sake of convenience, we abbreviate φ(x, ψ) is called the Fermat potential. From the viewpoint of singularity theory, φ(x, ψ) is a function of x parametrised by ψ(x). Its critical points, ∇φ(x, ψ) = 0, are the positions of the multiple images given by Equation (5). The second derivative, the Hessian matrix of φ(x, ψ), also called distortion matrix, is a two-dimensional mapping from vectors in the lens plane around x to vectors in the source plane around y. Hence, it describes the distortion of light bundles in the vicinity of x caused by the gravitational lensing effect. A(x) is often parametrised by the scaled, projected mass density called convergence κ(x) and the shear γ(x) = (γ 1 (x), γ 2 (x)), such that A(x) is decomposed into a magnifying trace part and a distorting symmetric part κ(x) is related to the deflection potential by the Poisson equation ∆ψ(x) = 2κ(x). γ(x) can be physically interpreted as the deflection caused by parts of the mass density surrounding x. The shear thus represents the non-local part of the deflection potential that is not included in the Poisson equation. It enters ψ(x) via the boundary conditions, as detailed in [22]. g(x) = (g 1 (x), g 2 (x)) is the reduced shear, which is the quantity constrained by observables, as detailed in Section 2.3. Given A(x) either as parametrised by Equation (7) or (8), it is possible to reconstruct vectors in the source plane in the vicinity of x up to an overall scale factor, as also detailed in Section 2.3. The set of degenerate critical points x 0 for which ∇φ(x 0 , ψ) = 0 and det(A(x 0 )) = 0 are called the critical curves. Using Equations (7) and (8), they are given by all x 0 fulfilling As shown in [48], the critical curves described by Equation (9) consist of only two kinds of critical points, so-called fold points, x f , and cusp points, x c . Irrespective of the specific form of ψ(x), x f and x c are determined by a Taylor expansion of the Fermat potential of order four around each critical point. Thus, we have two possibilities to determine local lens properties without assuming a specific form of ψ(x): First, we assume that A(x) is constant in the vicinity of the centre of light of a multiple image. This approximation is valid if the extensions of a multiple image are small compared to the scale on which the deflecting mass density changes. A comprehensive synopsis of the validity of these approximations for extended sources and finite beam sizes in gravitational lensing can be found in [60]. [61] derived the most general equations to constrain local lens properties from Equation (5). In [18], we developed this idea further by assuming a constant A(x) in the area spanned by the quadrupole of an unresolved multiple image or in the area spanned by the convex hull of all identifiable reference points of a resolved multiple image. Section 2.3 details the ansatz and shows the local lens properties that are determined in the vicinity of the multiple images. They are calculated using the quadrupoles of at least two unresolved images in a fold configuration. Alternatively, the ansatz leads to the same local lens properties for at least three identifiable reference points in at least two images in a fold configuration.
Second, we perform the Taylor expansion to fourth order around a critical point and employ the observables of the multiple images to infer local properties of the critical curves, as detailed in Section 2.4.
Consequently, local lens properties in the vicinity of the multiple images and approximations to the critical curves can be determined independently of the specific global form of ψ(x), i. e. without any prior assumptions about the decomposition of the deflecting gravitational potential.
2.3. Local lens properties from a Taylor expansion around the centre of light of the multiple images [57] already described the possibility to extract the local lens properties contained in A(x) by transforming multiple images onto each other. In [18] and [42], we realised this idea for unresolved and resolved multiple images that contain at least two non-parallel vectors, as indicated by the black arrows in the figures of Table 1. We assume to have m reference points within each multiple image i which are located at positions x iα , α = 1, ..., m. Using the quadrupole of an unresolved image as observables, the three reference points are the centre of light and the end points of the semi-major and semi-minor axis of the quadrupole. Furthermore, we assume that the multiple images have a very small extent compared to the scale on which the mass density changes. Sections 3.2 and 3.3 show the validity of this assumption in two practical applications on cluster and on galaxy scale. This implies that the entries in A(x) are approximately constant over the area spanned by the x iα in each multiple image. Since the vectors in all multiple images x iα − x iβ , α, β = 1, ..., m, originate from the same, corresponding vectors y α − y β in the source plane, we note that for each of these vectors holds. As the distortion matrix is assumed to be constant over the area spanned by the reference points, x i and x j represent an arbitrary point within this area. Reformulating the part on the right-hand side of Equation (10) yields a linear transformation T ij between vectors in image j to vectors in image i as Using Equation (11) to solve for the entries in A(x) given the reference points x iα is a total-least-squares parameter estimation problem. It differs from the standard least-squares parameter estimation because the vectors between the reference points on both sides of the equation are subject to measurement uncertainties.
To account for this, we introduce one latent variable x i per multiple image, the so-called anchor points in [18] and [42]. Solving the total-least-squares parameter estimation yields ratios of convergences, f ij , between pairs of multiple images i and j and the reduced shear g(x) at the positions of the multiple images It also yields the positions of the anchor points x i . Alternatively, expressed in derivatives of the Fermat potential, we can infer the analogous quantities In [18], we performed a comparison between the different parametrisations of A(x) by Equations (12) and (13) on simulated data. We found that both parametrisations have their advantages and drawbacks, so that it depends on the image configuration, which one yields more accurate, precise, and robust results. Furthermore, we showed that the larger the area of each multiple image from which the observables are extracted, the smaller the confidence bounds on the inferred local lens properties. On the other hand, the area should be so small that A(x) is still approximately constant for all x within the area. In any case, the approach needs at least three non-collinear reference points x iα , α ≥ 3, in each multiple image i and at least three images to return a unique solution for the local lens properties as given by Equations (12) or (13). The system of equations is under-determined, if T ij is diagonal, so that we cannot determine local lens properties from aligned multiple-image configurations created by an axisymmetric lens (see [18] for a detailed explanation). In addition, κ(x j ) = 1 should be avoided in the denominator of f ij in Equation (12), such that f ij does not become singular. Apart from these restrictions, it handles any configuration of at least three images including central maxima or counter images of cusp configurations. Figure 2 visualises the principle to obtain Equations (12) and (13). An implementation to solve for Equation (12) is publicly available 1 and details about it can be found in [42]. As mentioned in Section 2.2, Equation (10) shows that vectors in the source plane, y α − y β can be determined from the back-projection of the respective vectors in the image plane, x iα − x iβ , using A(x i ) as constrained by the local lens properties in Equations (12) or (13). Comparing Equations (12) and (13) with Equations (7) and (8), we find that y α − y β are only determined up to an overall scale factor because the observables do not constrain κ(x) but only ratios of convergences, f ij . An example source reconstruction using this method is performed in Section 3.2 and clearly shows that the reconstructed morphology of the source can give useful information about the relative shapes and sizes of the star-forming regions of a multiply-imaged galaxy. A more detailed analysis and comparison to lens-model-based source reconstructions is currently in preparation.
From the local lens properties given by Equations (12) and (13), we calculate the relative image parity information between the multiple images i and j, For resolved multiple images, the relative parity can be read off the telescope observation by visual inspection as a consistency check. In addition, J ij is also the magnification ratio between the two images i and j. Hence, it can be compared to the measured flux density ratios, if available, to indicate extinction due to dust, micro-lensing, or higher order lensing effects beyond convergence and shear.

The fold case
In order to uniquely solve Equation (11) for a fold configuration, we have to reduce the amount of variables in Equations (12) and (13). Therefore, we approximate f ij ≈ 1 in Equations (12) and (13), assuming that the convergence or the φ 11 (x) are equal for the areas covered by the two fold images. Thus, only the reduced shear can be determined for the two images at a fold.
The validity of the approximation f ij ≈ 1 was investigated in [18] for two analytic mass density models (the ones discussed in Appendix A, for choices of parameters, see [18]). We found that the decrease in accuracy of the approximation f ij ≈ 1 depends on the underlying mass density. Even for our simulated extreme case of a highly elliptical lens, at least one of the parameterisations in Equations (12) and (13) yielded local lens properties that were within the 2−σ confidence intervals around the true values.

Local lens properties from a Taylor expansion around a critical point
Next, we Taylor-expand the Fermat potential around a critical point x 0 , whose source is located at the so-called caustic point y 0 . Without loss of generality, we choose the coordinate systems in the lens and the source plane such that the critical and the caustic point are at the origin, x 0 = y 0 = 0. Inserting Equation (5) into the Fermat potential, the Taylor series to order four reads As x 0 is a critical point, Equation (9) must be fulfilled for the second-order derivatives in Equation (15).
Using Equation (15) as the local Fermat potential, the time-delay difference between light rays arriving from two multiple images i and j that are located close to x 0 is given by yields the lensing equation to determine the positions of multiple images in the vicinity of x 0 . From Equations (16) and (17), a system of equations is set up that contains y and derivatives of the Fermat potential at x 0 as unknowns. The observables, introduced in Section 2.1, are employed to determine the angular positions x of the multiple images with respect to x 0 , as detailed in the following sections.
Depending on the type of critical point, x f or x c , different further approximations to Equation (15) are necessary to reduce the number of unknowns, such that the system can be uniquely solved. Table 3 shows these approximations to Equation (17) to obtain the lensing equations for Sections 2.4.1 and 2.4.2. The approximations are inspired by axisymmetric and elliptical lens models (see Appendix A for two examples) and can be motivated by the fact that Newtonian gravity (as also employed for the description of the gravitational lens), is a central force. The goodness of the approximating assumptions will increase with decreasing distance of the multiple images to the centre of the potential of the deflecting galaxy or galaxy cluster and with increasing depth of this potential with respect to neighbouring gravitationally interacting objects.
Diagonalising A(x 0 ) introduces a special coordinate system, which we choose, as in [45], without loss of generality. The transformation into this coordinate frame simplifies the equations, but also complicates the combination of several multiple-image configurations in a mutual telescope observation because each multiple-image system has its own frame of reference. Table 3 summarises the specifications that we employ to embed the observables of Section 2.1 into the coordinate system given by Equation (18).
Similarly to Section 2.3, the observables allow to determine the distortion matrices at the image positions, A(x i ), i = A, B, C, as described in Sections 2.4.1 and 2.4.2. Here, x i is the centre of light of a Table 3. Synopsis of approximations to Equation (15) and embedding of the observables for fold and cusp critical points, x f and x c , in the coordinate system of Equation (18). The critical point is marked by a black dot in the figures. Degeneracies arise in the sign of slope of the critical curve at x f . Depending on the parity of image A, there are two possible configurations for a cusp.

Fold Cusp
(positive) (negative) coordinate system specifications coordinate system specifications multiple image. Again, these distortion matrices can be employed to reconstruct vectors in the source plane from corresponding vectors in the image plane close to x i up to an overall scale factor.
2.4.1. The fold case [45] derived that, to leading order, the fold configuration consists of two images that are symmetrically located around x f as shown in Table 3 (left figure). Employing φ T (x, ψ) in the coordinate system as introduced in Table 3, we derive the time delay difference between light rays arriving from the two images by Equation (16) and the lensing equation by Equation (17). The distortion matrix as the second derivative of φ T (x, ψ) with respect to x reads Hence, we make the approximation that linearly between x f and x i . We factor A(x i ) into an overall scale, φ 11 (x f ) and a reduced distortion matrix, analogous to Equation (8). Comparing Equation (19) to Equation (13), we arrive again at the approximation made in Section 2.3.1. In addition, the mirror symmetry implies that the images are of different parity, which can easily be verified by inserting x i2 into Equation (19). Due to the symmetry, x A2 = δ AB /2 and x B2 = −δ AB /2, in which the distance δ AB between x A and x B is an observable. Further observables are the quadrupole of each image around the centre of light, x i , parametrised by the axis ratio r i of the semi-minor to the semi-major axis and the orientation angle of the semi-major axis to δ AB , ϕ i , i = A, B. The validity of the symmetry assumption can thus be checked by comparing the observables of the two quadrupoles with each other. A measured flux ratio close to one between images A and B additionally corroborates the symmetry assumption, if dust extinction and micro-lensing are negligible effects (see also Section 2.3). If the symmetry assumption is applicable, the images are close enough to the critical curve to assume that the source properties are negligible. In the appendix of [17], we generally showed that the influence of the intrinsic quadrupole of the source decreases for decreasing distance to the critical point. Consequently, we assume that the observed quadrupole of the multiple images is caused by the gravitational lensing effect. We set it equal to the reduced distortion matrix in Equation (19).
Altogether, we can determine the following leading-order local lens properties from a fold configuration of resolved or unresolved multiple images with a measured time delay difference of τ AB in which the slope of the critical curve at x f , m f , is determined by the ratio of Equations (21) and (22). The sign of the slope cannot be determined. This requires a third image, as discussed in Section 2.4.2. In [17], we found that the approximation assuming ϕ i = 0 on the right-hand side of Equation (22) yields the most accurate results for this ratio of derivatives. We note that Equations (21), (22), and (23) do not contain any information about the angular diameter distances to the lens or the source plane. As derived in [20] and explained in [22], the former fact arises because the Fermat potential is an angular, projected potential, and the multiple-image shapes are the corresponding angular observables on the celestial sphere. The entire geometric information is contained in Γ (see Equation (6)) and therefore only relevant in Equation (20). Hence, the gravitational lensing geometry, and properties of the underlying cosmology, can only be constrained with measured time delay differences by this model-independent approach. Furthermore, Equations (21), (22), and (23) only constrain ratios of derivatives. This is a consequence of the fact, that the deflecting mass density is an inhomogeneity defined on top of a background mass density. Rescaling the background mass density and scaling the deflecting mass density accordingly is a choice of parametrisation and does not change the observables. As detailed in [22], Equation (20) breaks this degeneracy for fixed Γ, i.e. fixing the background cosmology and thereby the background mass density at x A and x B .
Linearising Equation (5) and applying it to fields of single weakly distorted galaxies, [62] already found that weak-gravitational-lensing observations only constrain the reduced shear g(x) and not γ(x) because the weak-lensing equations are also subject to the mass-sheet degeneracy (MSD) that is intrinsic in the lensing formalism, [63]. This finding and the MSD can now be understood with the results as stated above. Further details can be found in Section 3.6 and in [22].

The cusp case
Analogously to the derivations performed for Section 2.4.1, the leading-order local lens properties for the cusp configuration as embedded into the coordinate system shown in Table 3 (centre and right figure) can be determined. The distortion matrix which is set equal to the quadrupoles of the multiple images is given by The x 2 -axis of the special coordinate system of the fold is aligned with the relative distance between the two images, such that the observables directly correspond to the variables in the equations. Contrary to that, the coordinate system for the cusp configuration is constructed such that its x 1 -axis is the symmetry axis of the parabolic approximation of the critical curve around x c . Therefore, the quadrupoles of the multiple images have to be rotated into this coordinate system first, i.e. calculating the ϕ i with respect to the positive x 1 -axis from the relative anglesφ i as shown in Table 1. Appendix B shows the equation to obtain the rotation angle which transforms a configuration as shown in Table 1 into the coordinate system shown in Table 3. In this appendix, we also give the coordinates of the image closest to the cusp, x A . This determines the position of x c with respect to x A from the observables, analogous to x f being half the distance between image A and B in Section 2.4.1.
Without loss of generality, we denote the two images that are closest to x c by A and B, as shown in Table 3. Any combination of two images from A, B, C can be used to determine the following leading-order local lens properties. The most accurate results are obtained for images A and B closest to the expansion point of the Taylor approximation, as we showed in [17]. Summarising the calculations of [16], [17], and [18], we obtain as leading-order lens properties from the relative distances between the multiple images, the quadrupoles, and a time delay difference of τ AB between images A and B As in Section 2.4.1 and for the same reasons, we only obtain ratios of derivatives from the quadrupole observables and, as before, the time delay difference breaks this degeneracy. To leading order, the cusp configuration is a linear superposition of two fold configurations, as investigated in detail in [17] and [45]. This also becomes apparent when we compare Equations (21) and (26) 2 . We see that Equation (26) is the sum of the |φ 12 (x)| of images A and B, while, for Equation (21), images A and B are described as mirror images of each other, so that |φ 12 (x)| of one image, A or B, is sufficient to determineφ 122 (x 0 ). Treating images A and B as a fold configuration, the sign of the slope in Equation (21) can be determined from the position of image C, as the sign of the slope at x f between images A and B should be consistent with the slope of the parabola given by m c . The leading-order Taylor expansion of the cusp configuration requires a rotation into a coordinate system that is not directly aligned with an observable and it needs a fourth order 2 The expansion point is a different one in the two cases, yet, the underlying principle to determineφ 122 (x 0 ) from the off-diagonal entries of the quadrupole moment of images A and B is the same. derivative, i.e. an expansion to one order more than the fold configuration. Consequently, the local lens properties determined at x c are less accurate than the ones at x f , see [17] and [18] for details. The remaining degeneracy is shown in Equation (27). Consistently to Sections 2.3 and 2.4.1, only the relative parities between the three images are known. Hence, there are two possible pathways for the critical curve around the three images, as further detailed in [48]. The first with the minus-sign in Equation (27) encloses image A in the parabolic approximation to the critical curve (called a positive cusp, as shown in Table 3 (centre)). The second with the plus-sign in Equation (27) encloses images B and C, such that image A lies on the negative side of the x 1 -axis (called a negative cusp, as shown in Table 3 (right)).

The giant-arc case
"Golden lenses" are highly (axi-)symmetric galaxies that show multiple images in form of giant arcs (see Table 2 (right)) close to the (almost) circular critical curve, or even complete Einstein rings, see e.g. [64] or [65]. Giant arcs and Einstein rings as highly distorted and extended images provide non-local information in the sense that they trace the critical curves in a larger region compared to less distorted and therefore locally more constrained multiple images that are located at larger distances to the critical curves. As noted in Section 2.2, potential biases in gravitational lens modelling decrease for decreasing distance to the model-invariant critical curve, such that Einstein rings are model-independent probes of the total enclosed mass. For static sources, it is advantageous that the intrinsic source properties become negligible with decreasing distance to the critical curve as well. Multiple images of a time-varying source close to the critical curve embedded in a host galaxy that forms giant arcs or an Einstein ring are ideal configurations to constrain H 0 , see e.g. [9], or to calculate the total enclosed mass with Equation (16). Example applications are given in Section 3.6.
Symmetric multiple-image configurations with a small perturbation also constrain the properties of the perturbing mass. Using lens models, upper bounds on the temperature and the sub-galactic distribution of dark matter can be inferred, see e.g. [66], [67], [68], [69].
In [19], we investigated which properties of a perturber to an axisymmetric mass-density distribution could be determined from the observables shown in Table 2 (right) without a lens model. Due to the underlying axisymmetry, we used polar coordinates (r, ϕ) and perturbed the symmetric deflection potential, ψ a (r), around the Einstein radius r E with a small deflection potential ψ p (r, ϕ), ψ p (r, ϕ) |ψ a (r)|, such that the total deflection potential ψ(r, ϕ) reads ψ(r, ϕ) = ∞ ∑ n=0 (a n + p n (ϕ)) (r − r E ) n with the coefficients a n and p n (ϕ) determined as derivatives of the two potentials at r = r E a n = 1 n!
To leading order, the lens equation yields the model-independent local lens properties κ a (r E ) denotes the convergence of the axisymmetric part of the potential at r E . α p = (α p,r , α p,ϕ ) is the deflection angle caused by the perturber. r i − r j can be the difference between the observed radial positions of arcs A and B. Alternatively, the difference between the radial positions of the centre-close and centre-far isophote delineating one arc can also be inserted as r i − r j . Dividing Equation (31) for the radial differences of the delineating isophotes of arc A by the same equation for arc B, we arrive again at Equation (12). Since this procedure maps the two arcs onto each other, it is not surprising that the ratio of Equations (31) of arc A to B is a special case of Equation (12) for lenses with almost axisymmetric mass-density distributions. Equation (31) implies a degeneracy between the deflection angle of the perturber and the convergence of the axisymmetric lens. As further detailed in [19], this degeneracy can be interpreted as an exact source-position transformation, introduced in [70]. As explained in [22], this degeneracy originates from an a priori arbitrary split of the total deflection potential into a main lens and a perturber. Without additional assumptions or information on the mass-density profile of the axisymmetric lens at r E or on the deflection angle of the perturber, Equations (31) and (32) are under-constrained. By means of simulations, this was noted in [71]. Applying Equations (31) and (32) to their data, as detailed in [19], the observed degeneracies can be explained.
At best, different models based on a single smooth deflecting mass density or including perturbing inhomogeneities can be ranked with respect to each other in a Bayesian framework, see e.g. [26] or [72]. In this way, the most likely explanation for the perturbed, symmetric multiple-image configuration is determined from a certain class of models. In Section 3.4, we apply Equations (31) and (32) to an example, which is further detailed in [19]. Assuming a specific model for the perturber and the main lens and inserting them into the model-independent equations, we can compare the total mass of the perturber with existing estimates obtained by the Bayesian approach of [26].

Lensing distance ratios in a general Friedmann universe
SNe are standardisable candles which allow for a data-based distance measure that is independent of any specific parametrisation of the FLRW metric. Replacing the distances based on the FLRW metric (see Equation (1), using Equation (2) as E(z)) by data-based distance measures makes the angular diameter distances in Equation (3) independent of any parametrisation of the FLRW metric. This means that the data-based distances are agnostic about any partition of the total cosmic energy content into radiation, matter, or additional contributions (as done for Equation (2)). Thus, using these SNe-based distance measures, we free our lens-model-independent approach from a parametrisation of the FLRW metric as well. In particular, we avoid to set up particular assumptions about the cosmological constant or potential dark energy models.
In the standardisation of the SNe, an overall distance scale for an ensemble of observed SNe has to be fixed by complementary data. It can either be given by the absolute magnitude of a standard SNe (e.g. from measurements in our local cosmic neighbourhood, see [73]), or by H 0 (as determined in our local cosmic neighbourhood, [74], or from the cosmic microwave background, [59]). This implies that data sets of SNe determine E(z) and leave H 0 as a free parameter. The Pantheon sample, as detailed in [36], is the most recent compilation of 1048 unscaled SNe with redshifts z < 2.3. This redshift range covers a large fraction of the universe, in which gravitational lensing effects occur. Therefore, we determined angular diameter distances based on the Pantheon sample in a flat FLRW metric.
First, we expanded the scale-free, observable luminosity distancesD L (0, z) into a complete, orthonormal set of analytic basis functions b j , j = 0, ..., n b , the so-called Einstein-de-Sitter polynomial basis as introduced in [37 From the Pantheon sample of SNe, four coefficients for the first four basis functions can be significantly determined given the measurement precision and the covariances in the sample. Inserting these scale-free luminosity distances represented by the Einstein-de-Sitter basis into the equation and requiring that E(0) = 1, we obtained a data-based cosmic expansion function, E(z), as first noted by [75]. Subsequently, we inserted this E(z) into Equation (1) together with a value for H 0 , e. g. as constrained by [59] or [74] to obtain a distance measure.
Replacing the model-based angular diameter distances in Equation (3) by the data-based ones, we obtain the data-based lensing distance ratio, as detailed in [21]. Leaving H 0 as a free parameter, we can use time delay differences to constrain its value, if the difference of the Fermat potentials between the image positions is known. Alternatively, with a fixed value for H 0 , the time delay differences between two images constrain the difference between their Fermat potentials (see Equation (16)).
In Section 3.5, we compare the relative precision of the lensing distance ratio of angular diameter distances as obtained from the Pantheon sample with the lensing distance ratio based on the angular diameter distances of the Friedmann model as established by [59]. Alternatively, given the Fermat potential difference between two images, a measured time delay difference between these images infers a value for H 0 . Constraints on its precision are estimated in Section 3.6.

Results
In this section, we discuss example applications for our approach, theoretically outlined in Section 2, to observational data. In Section 3.2, we show the application of the approach of Section 2.3 for five resolved images of a background galaxy behind the galaxy cluster CL0024 observed with the Hubble Space Telescope (HST). Further details about this example can be found in [42]. Section 3.3 subsequently treats the application of our approach of Section 2.3 to four unresolved, extended images of a quasar background source behind the spiral galaxy B0128, as further detailed in [43]. For this application, very-long-baseline interferometry (VLBI) observations in the radio bands by [58], were employed. In Section 3.4, we discuss the degeneracies detailed in Section 2.4 that arise for the two giant arcs of a background galaxy caused by the early-type galaxy B1938+666. We review the precision up to which the lensing distance ratio in Equation (6) can be determined from the Pantheon sample of SNe type Ia, [36], in Section 3.5. The details to set up the angular diameter distance measure from the sample are found in [21]. Section 3.6 sets up theoretical limits on the precision to which H 0 can be determined by Equation (16). This section is based on the findings in [22] and supported by data from quasar time delay monitoring, see e.g. the paper series of [76], and FRB observations, discussed in [44].
In [17] and [18], we used simulation data based on simple lens models to test the accuracy of the approximations detailed in Sections 2.3 and 2.4. The relative image distances and the parameters employed for the lens models considered in these simulations were extreme cases compared to observations and lens model parameters as inferred by observations. Despite this, we obtained inaccuracies of about 10% and mostly less for the ratios of third order derivatives at folds and cusps. Most of the f ij and g(x i ), i, j = A, B, C, lay within the 1−σ confidence bounds around the true value. As a consequence, we expect our approach to yield more accurate results in average-case scenarios considered in this section, so that the local lens properties and pathways of the critical curves in the vicinity of multiple images can be constrained more tightly.

Data preprocessing
Before any observables of Tables 1 and 2 can be extracted, the observational data has to be preprocessed. Effects of the data acquisition, such as shot noise or the impact of the point spread function (PSF), have to be taken into account. In addition, astrophysical effects like stray light from or partial occlusion by neighbouring objects must be handled, see e.g. [77] or [78] for a detailed explanation of all effects.
In order to keep our approach model-independent, the data preprocessing should not rely on any model. Data processing methods that employ a model-based separation of the foreground deflecting galaxy from the multiple images for galaxy-scale lensing can be found in [79], [80], or [81]. Furthermore, we separate the extraction of the observables from the application of our approach to these data. In this way, the lensing analysis pipeline is kept as flexible as possible to use our approach for data of various ground-based and space-based instruments and the impact of different preprocessing methods on the local lens properties can be easily compared.
Since the PSF of the HST is of the order of one pixel and thus much smaller than the extensions of the multiple images, effects of the convolution with the PSF are negligible. For Section 3.2, the reference points for in the multiple images were identified manually directly from the FITS-file provided in the HST archive. The same applies to the observables in Section 3.4. The observables in the radio band, including the quadrupoles of the multiple images as required for the application in Section 3.3, were already measured by [58] using the National Radio Astronomy Observatory Astronomical Image Processing Software package AIPS.

Comparison to lens modelling approaches for the cluster-scale lens CL0024
CL0024+1654, abbreviated as CL0024, is a massive galaxy cluster at redshift z l = 0.39. It consists of two components merging along the line of sight (as detailed e.g. in [82], [83], [84]) and it magnifies a blue galaxy at redshift z s = 1.675 into five highly-distorted, well-resolved images, as first described by [85] (see images 1.1 to 1.5 in the yellow ellipses in Figure 3). In Figure 3, the yellow points in the ellipses mark the coordinates of the multiple images when they are treated as point-like constraints of global lens reconstructions. In the details at the sides, the six reference points that can be identified in all multiple images are marked by red circles. We extracted the reference points in all multiple images as observables from an HST ACS/WFC picture in the F475W band 3 . Using these reference points, we applied our approach of Section 2.3 to these multiple images to constrain the local lens properties in Equation (12). World-coordinate-system coordinates of all points can be found in [42]. [86] predicted ten additional multiple-image sets (MISs) from sources with photometrically determined mean redshifts z s ∈ [0.25, 4.16] (see blue points 2.1 to 11.4 in Figure 3, coordinates of these points can be found in [86]). From these additional ten MISs, we employed MIS 3, 4, 5, 8, and 10 as point-like multiple-image constraints to set up global mass-density reconstructions. We selected these MISs because they are scattered broadly in the region covered by the cluster on the sky so that we expected well-constrained global mass-density reconstructions. In addition, these MISs are located close to the resolved multiple images of set 1, so that a potential influence of constraints from neighbouring MISs on the local lens properties at MIS 1 should be observable.
As global lens reconstruction methods, we chose the parametric reconstruction method LENSTOOL, described in detail in [31] and references therein, and the free-form reconstruction approach GRALE, as detailed in [29]. Appendix C contains a brief characterisation of both algorithms in terms of their peculiarities that are relevant for our comparison. Both approaches used the same data and yielded maps of the convergence, κ(x), and the shear, γ(x), in the region covered by the multiple images. We calculated the Figure 3. Galaxy cluster CL0024 and its multiple-image sets (MISs): MIS 1 (yellow ellipses) with five images with six reference points each (red circles in image details) to apply our approach as described in Section 2.3 and with MISs 2 to 11 (blue points) as predicted by [86], but not spectroscopically confirmed. MIS 3, 4, 5, 8, and 10 were used to set up lens models that were compared to the model-independent approach. Image credits NASA/ESA/HST. Figure 3. Then, we compared these local lens properties obtained by the global methods with the local lens properties at the same position obtained by our model-independent approach detailed in Section 2.3.

local lens properties of Equation (12) from the κ(x)-and γ(x)-maps of the global reconstruction approaches at the positions of the multiple images of MIS 1 marked by the yellow points in
Implementing different constraints in the LENSTOOL and GRALE reconstructions and comparing the results among each other and with the results of our approach, we investigated 1. whether neighbouring MISs influence the reconstruction of local lens properties at the image positions of MIS 1 in LENSTOOL and GRALE and thus introduce correlations between MISs that are not accounted for in our approach, 2. whether the local lens properties at MIS 1 as obtained by our local approach and by the two global reconstructions coincide, or if image distortions beyond leading order play a significant role (apart from potential correlations between MISs mentioned under 1.), 3. whether the light-traces-mass (LTM) assumption used in LENSTOOL is corroborated and, if so, which mass-to-light ratio (MLR) is favoured for the cluster member galaxies, and 4. whether the small-scale fine-tuning of the mass density implemented in GRALE overfits the free-form lens reconstruction to the multiple-image constraints.
The local lens properties that we obtained showed broad confidence bounds for the multiple images close to the κ(x) = 1 isocontour, which was to be expected from Equation (12). The tightest confidence bounds were obtained using all six reference points which delineate the largest area in each multiple image in accordance with the statements made in Section 2.3. Relative sizes of the confidence bounds with respect to the quantities in Equation (12) ranged from 8% to 150% with an average relative size of 9.4%. The implementation of our approach took less than half a second to determine the quantities in Equation (12).
Assuring that all reconstructions are of similar quality, we found a high degree of coincidence in the local lens properties obtained by all three approaches. From this result, we concluded that 1. non-local influences of neighbouring MISs are negligible at the current precision of the quantities in Equation (12). Thus, the local lens properties are mainly derived from the local constraints of the respective multiple images. Hence, we can reconstruct the morphology of the source up to an overall scale as detailed in Section 2.3. The back-projections of all images to the source plane by their  (8) for each image, normalised such that the transformation between 1.1. and 1.2 has unit determinant. Comparing the back-projections for images 1.1 and 1.5, we see that highly-resolved, large images with high signal-to-noise ratio (SNR) yield higher resolutions in the source reconstruction than smaller images with a low SNR.
model-independently reconstructed A(x i ) and their pixel-wise averaged source are shown in Figure 4 (for comparison with model-based source reconstructions, see [85] and [87]; a physical analysis of the LENSTOOL-reconstructed source can be found in [88] and [89]). 2. the leading-order local lens properties of our model-independent approach mostly agree to local lens properties as obtained by LENSTOOL and GRALE within their 1−σ confidence bounds. 3. the LTM assumption is corroborated by the high degree of agreement of the local lens properties as obtained by LENSTOOL and the other two approaches that do not make any assumptions about the relation between dark and luminous matter. A comparison of the local lens properties of all approaches favours a constant MLR for the brightest cluster member galaxies over a non-constant MLR. This result was also found in [84] and [90]. A non-constant MLR that reproduces the fundamental plane (see [91] and references therein) yields a higher degree of agreement between the redshifts as estimated by LENSTOOL and the photometric redshifts of the additional MISs. This indicates that spectroscopic redshift observations are necessary to further constrain the redshifts and the MLR. 4. the small-scale fine-tuning of the mass density in GRALE does not significantly change the local lens properties, nor does it tighten the confidence bounds. As it does not significantly alter the pathways of the critical curves, either, it can be omitted for run-time optimisation.
Consequently, assuming that our results for CL0024 are representative, we demonstrated that a comparison of the local lens properties between the three conceptually different reconstruction ansatzes allows to corroborate or refute assumptions of global reconstruction methods. Furthermore, we showed that the source object can also be reconstructed by a back-projection of the multiple images with the local distortion matrix. Hence, a full reconstruction of the deflecting mass density is not necessary to study the morphology of individual background objects. The back-projections shown in Figure 4 are the most general leading-order source reconstructions directly based on the standard lensing formalism and the observables in the multiple images. In addition, as our source reconstruction does not assume a lens model, studying star formation and stellar populations in faint and distant, otherwise unobservable galaxies in the early universe (even at redshifts as high as 9-10) without a potential lens-model bias becomes possible. The source reconstruction is still subject to the local MSD of this MIS (see [20] for this local generalisation of the global MSD as defined by [63]). But the latter can be easily broken by rescaling the distortion matrices, as soon as the actual scaling is known, e.g. from time delay difference observations. Image credits [58], [93].

Transfer of the method to unresolved multiple images for the galaxy-scale lens B0128
In the second example which is discussed in detail in [43], we demonstrate that our approach can also be applied to galaxy-scale gravitational lenses with multiple-image configurations as observed in the radio bands. Figure 5 (left) shows the configuration of four images of a quasar at z s = 3.124 created by the lenticular or late-type galaxy B0128+437, B0128 for short. Most probably, its redshift is z l = 1.145, [92]. The observation was made in the Multi-Element Radio Linked Interferometer Network (MERLIN) 5GHz band, as detailed in [93]. The multiple images denoted by A, C, and D in Figure 5 are resolved into three clearly identifiable sub-components in VLBI observations in the 5GHz and 8.4GHz bands (see details on the left and right of the MERLIN observation), see [58]. Multiple image B is most likely scatter-broadened. The coordinates of all observational data are summarised in [43]. Analogously to the example in Section 3.2, we used the example to compare the local lens properties obtained by our approach according to Section 2.3 with the local lens properties obtained by the global free-form lens reconstruction approach PIXELENS, [24]. We investigated whether the lens reconstruction by previous global parametric lens reconstruction methods was oversimplified, because no global parametric lens-modelling approach could explain the sub-components in images A, C, and D to their sub-milli-arcsecond precision, while the configuration of the four images shown in Figure 5 (left, large figure) could be described by a singular isothermal elliptical (SIE) mass-density profile with external shear, as detailed in [58]. According to the simulations performed in [68], it was found that the anomalous flux ratio between images A and B is not likely to be caused by small-scale dark-matter clumps. Therefore, [68] concluded that the scatter-broadening in image B and an oversimplified lens reconstruction could cause the discrepancies between the fits of smooth symmetric models and the observations.
Using the sub-components of images A, C, and D as reference points, we calculated the local lens properties according to Equation (12). In addition, as the sub-components 1 and 3 in these three images were characterised by their quadrupole moment, we also calculated the local lens properties for both sub-components using the centre of light and the end points of the semi-minor and the semi-major axis as reference points (see Section 2.1). Figure 5 (right) sketches the transformations on image and sub-component scale to obtain the respective local lens properties. Due to the almost parallel alignment of the sub-components, the local lens properties determined from the sub-components as reference points Figure 6. Observational data of B1938+666: F160W band filter of HST NIC1 (left); H-band of the Keck NIRC2 camera with laser guide star adaptive optics system (centre); K'-band of the Keck NIRC2 camera with laser guide star adaptive optics system (right). North is up and East is to the left. Image credits [81].
were subject to large confidence bounds. Their relative sizes compared to the f -and g-values range from 8% to over 1000%. Comparing these confidence bounds to the ones obtained from the quadrupole moments, which mostly have relative sizes on the order of 20%, we found that the latter are smaller because the vectors spanned by the reference points are orthogonal to each other. Half of the local lens properties at the positions of the two sub-components agree within their confidence bounds, so that it remains undecided if the distortion matrix can be assumed constant over the area of all sub-components or not. Only a weak upper limit to the scale of potential higher order perturbations, like gradients, in the surface mass density at the position of the images A, C, and D may be inferred from this result. Yet, the result confirms the hypothesis of [68] that the global parametric lens reconstructions were indeed oversimplified. A comparison with our PIXELENS lens reconstruction and an additional symmetry analysis according to the method based on relative opening angles between the multiple images as detailed in [94] both further corroborate the hypothesis. All results concordantly hint at a mass-density distribution that deviates from the simple elliptical models usually assumed.

Degeneracies arising in the detection of perturbers for the galaxy-scale lens B1938+666
B1938+666, detected in the Jodrell Bank Very Large Array Astrometric Survey, [95], is an early-type gravitational lens at redshift z l = 0.88, [96]. The background source, a quasar embedded in its host galaxy at redshift z s = 2.06, [97], is assumed to consist of two components, [98]. Multifrequency Very Large Array (VLA), MERLIN and VLBI radio observations revealed that the first source component is mapped into four images and the second into two images, [99]. As observed in [100] by HST observations and later on confirmed by observations with the NIRC2 camera on the Keck II Telescope, using the Laser Guide Star adaptive optics system in [81], the source galaxy forms an Einstein ring visible in the near infrared band. Figure 6 shows the observational data in the optical and near infra-red.
Perturbations in the symmetric brightness distribution of the Einstein ring were used to infer properties of a potential perturber, like its location and its mass in [81] and [101]. As there is no evidence for luminous perturbers equal and above the mass estimate determined in [101] (see below), [81], the existence and the properties of a potential perturber were solely inferred from the observable brightness profiles of the giant arcs.
[81] and [101] used the Bayesian lens-reconstruction algorithm of [26] to infer the most likely properties of a perturber. Using the HST data, [101] arrived at m p = (1.9 ± 0.1) · 10 8 M (35) assuming that the perturber is located close to the critical curve in the lens plane and the projected distance between the symmetry centre of the main lens and the perturber in the lens plane is the total separation.
Hence, Equation (35) is a lower limit on the perturbing mass. As also confirmed in [81], the mass estimates derived from the H-and K'-band Keck data are consistent with the value in Equation (35). Furthermore, [81] followed up on these results with Keck observations and compared the precision to which the parameters of the most likely lens reconstruction can be determined to the precision obtainable when using the HST observations. For this highly symmetric case, the Keck observations were found to yield more precise parameter estimates. Nevertheless, due to the high symmetry of the main lens and the featureless, i.e. unresolved, giant arcs, degeneracies between the lens model parameters still remained large. This implies that significantly different models can fit the data equally well. To arrive at this result and test the robustness of the obtained properties of the perturber, [81] and [101] had to set up different models and probe the multi-dimensional parameter space of each model within given prior ranges.
Employing the approach outlined in Section 2.4.3 to estimate the mass of a potential perturber, we first fitted a circle to the Einstein ring of the HST data and obtained r E = 0.45 . This agrees well with r E = 0.465 , as determined in [100]. [101] use a lower value of 3.39 kpc, which corresponds to 0.444 at z l = 0.88, assuming a flat ΛCDM with Ω m = 0.25, Ω Λ = 0.75, H 0 = 73 km/s/Mpc.
Fitting two circles to the upper westward and lower eastward arc, denoted as image A and B, respectively (see Figure 6 (left)), r A = (0.50 ± 0.1) and r B = (0.41 ± 0.1) were obtained. Since the signal-to-noise ratio does not allow to resolve brightness features in the arcs like the ones observable in the multiple images in CL0024, Equation (32) cannot be applied to this case. Assuming that the perturber is a satellite galaxy with a singular isothermal sphere (SIS) as mass-density profile close to a SIS main lens, we inserted the α p (x) and κ a (r E ) for a SIS (see e.g. [45]) into Equation (31). Solving for the Einstein radius of the perturber, we obtained as a mass estimate for the perturber, if it is optimally aligned, i.e. located on the line connecting the centre of light of the giant arc and the symmetry centre of the lens in the lens plane. Comparing Equations (35) and (36), we find that both are of the same order of magnitude. Summarising the results for this example, as further detailed in [19], we showed that our simple Equations (31) and (32) to relate lens properties with observables allow us to infer lens properties that are in agreement with sophisticated lens reconstruction algorithms like [26], which implement more realistic classes of main lens and perturbing mass-density profiles. In addition, Equations (31) and (32) clearly show the most general transformations between the perturber and the main, axisymmetric lens that leave the observables of the giant-arc brightness profiles invariant. Similar equations could be analogously derived for main-lens mass profiles with other symmetries (e.g. ellipses). But, as also noted in [66], without additional non-lensing information on the properties of the perturber, like its location (on the sky and also along the line of sight) and its brightness profile, the properties of the potential perturbers remain highly degenerate.

Lensing distance ratios from the Pantheon sample of supernovae
As outlined in Section 2.5 and detailed in [21], we used the Pantheon sample of SNe type Ia of [36] tand H 0 from [102] o set up data-based angular diameter distances for the lensing distance ratio in Equation (6). We abbreviate the lensing distance ratio by To investigate the precision of the data-based D(z l , z s ) as established with the Pantheon sample, we implemented a Monte Carlo simulation of 1000 Pantheon-like data sets, each containing 1048 simulated, scale-free luminosity distances at the same redshifts as the Pantheon sample. Each of the 1048 data points was generated by drawing a random number from a Gauss distribution around the measured scale-free luminosity distance from the Pantheon sample with a standard deviation given by its measurement uncertainty. With these simulated data sets, the confidence bounds on D(z l , z s ) can be determined for any lens and source redshift in the range of the Pantheon sample (z ≤ 2.3), given a value for H 0 . As confidence bounds, we calculated the standard deviation around the mean D(z l , z s ) and the confidence bounds based on percentiles, given by the 68%, 95%, and 99% confidence levels. In principle, the precision of D(z l , z s ) also depends on H 0 and its uncertainty. Yet, due to the still ongoing debate over the tension in the value for H 0 , [59], [74], we only considered the relative precision of D(z l , z s ) and did not include the imprecision of H 0 in its uncertainty.
To compare the precision of the data-based D(z l , z s ) with the one obtained from the ΛCDM standard cosmological model as determined by [59], we developed a second Monte Carlo simulation. We drew 1000 different cosmological models from a Gaussian distribution around the cosmological parameters as determined by [102], Ω m = (0.3089 ± 0.0062), Ω Λ = (0.6911 ± 0.0062), with their measurement uncertainty as standard deviation 4 . For each of the 1000 cosmological models, we determined model-based luminosity distances at the redshifts of the Pantheon sample, given H 0 = 67.74 km/s/Mpc. With this catalogue of 1000 data sets containing 1048 data points each, we analogously calculated the relative precision of D(z l , z s ), leaving out the imprecision of H 0 .
As D(z l , z s ) is dependent on z l and z s , the best visualisation is obtained when fixing z l and plotting the relative imprecision of D(z l , z s ) for all z s , as shown in Figure 7. In addition to the comparison at z l = 0.5 established in [21], we also show the relative imprecisions for D(z l , z s ) at z l = 0.25 and z l = 1.0 as given by the confidence bounds of the simulated data sets.
By comparing the left-and the right-hand side of Figure 7, we find that there are one to two orders of magnitude difference in the imprecision for the data-based (left) and Planck-model-based lensing distance ratio (right). The precision to which the data-based lensing distance ratio is determined is constrained by the measurement uncertainties of the SNe and the choice of the set of basis functions. Among the analytic sets of basis functions that we tested in [21], the Einstein-de-Sitter basis with four basis functions was found to yield the tightest confidence bounds and the most accurate reconstruction of a flat ΛCDM model. Its inaccuracies, determined in a simulation of a flat ΛCDM model, are contained within the confidence bounds of the imprecisions to avoid biases. In addition, the number of basis functions was determined in a χ 2 -parameter estimation with a reduced χ 2 -value of 0.9 to avoid overfitting. Details about the quality test of all basis functions are given in [21]. With further SNe data or a higher measurement precision, the confidence bounds can be further reduced and the number of basis functions increased.
The confidence bounds of the lensing distance ratio in a Planck-model-based cosmology are tighter and negligible compared to other sources of uncertainty in the time-delay equation (see Section 3.6). Yet, the tighter confidence bounds come at the cost that the model may be currently incomplete or it may become biased for an increasing amount of future data. While the extensions of the ΛCDM model that can be included to amend the bias have a direct physical interpretation, e.g. an additional curvature term or any forms of dark energy, it is not clear which extension approach should be followed. Contrary to that, an extension of the Einstein-de-Sitter basis with four basis functions is straightforward. Assigning the basis functions a physical interpretation is the difficult part in this approach.
3.6. Usage of the time-delay equation to infer H 0 As shown in [22], H 0 is uniquely determined by a time-delay-difference measurement in a FLRW background cosmology if the total integrated deflecting mass along the two light paths is known. Yet, observations to tightly constrain the total mass are rare, especially if we include the small-scale perturbing masses not associated with the most massive deflecting object along a light path. In addition, 4 As we consider the late-time cosmology with z ∈ [0, 2.3], Ω r = 0 in Equation (2) to good approximation. parametrisations of the FLRW background cosmology are still subject to debates about potentially required extensions to the current concordance ΛCDM standard model (see Section 3.5). Therefore, current approaches to determine H 0 first solve the time-delay equation for the lensing distance ratio D(z l , z s ), Equation (37). For τ ij , the observed time-delay difference between light rays arriving from the multiple images i and j is inserted and ψ) is derived from a lens model. Since D(z l , z s ) contains the full information about the cosmological geometry of the multiple-image configuration, subsequently inferring H 0 from it conveniently allows to investigate different parametrisations of the FLRW background cosmology without repeating the lens reconstruction.
In practice, reconstructions of φ ij are subject to degeneracies. In our approach as outlined in Section 2, a local MSD at x i and x j remains to be broken, i.e. φ ij between the two light rays can be scaled by a factor and H 0 can be rescaled accordingly to keep the observed time delay difference invariant. As, by definition, H 0 is linked to the overall energy density of our universe today, this freedom of rescaling can be interpreted as redefining the split between the cosmological background energy density and the deflecting mass density of gravitational lensing on top of it. Other approaches usually split the total mass along the line of sight into a main deflecting mass and small-scale perturbers along the line of sight. In this case, the factor λ appearing in a MSD consists of two parts: one part that can be associated with the small-scale perturbers and neighbouring masses of the main deflector, which is usually called external convergence κ ext , see e.g. [72]. The second part of the factor is the remaining MSD far away from the lens, in the region where only the embedding cosmological background density remains. This explains the argument stated in [71] that the MSD cannot be solely caused by κ ext .
On galaxy-cluster scale, [103] systematically investigated the relative precision to which H 0 can be determined from the well-resolved system of multiple images of the host galaxy of SN Refsdal and a plethora of multiple-image configurations from different background sources at different redshifts alone and in combination with different non-lensing observables and assumptions. They found that a combination of a flexible free-form with a consistent parametric lens reconstruction is capable of alleviating all occurring degeneracies to obtain H 0 to a relative imprecision of less than 10%. The precision of the inferred H 0 -value greatly depends on the configuration of the multiple-image sets, the measured time-delay differences but even more on the additional assumptions being made.
Galaxies as lenses are usually described by comparably simple models, yet, the high symmetry and the sparsity of multiple-image configurations also require additional non-lensing observables, like the velocity dispersions along the line of sight, to be included in the model, see e.g. [104]. On both scales, potential biases due to the choice of a specific model (class) and degeneracies between the model parameters remain, see [22] and references therein. To further decrease the imprecision in H 0 , several multiple-image configurations with measured time delay differences can be jointly analysed, as done e.g. in [105], [106], or [107].
Having obtained a value for D(z l , z s ), the definition of model-based angular diameter distances, Equation (1) usually using Equation (2) (or E(z) for an extension of the ΛCDM model), is inserted to relate D(z l , z s ) to H 0 . To determine H 0 from the time-delay equation in approaches based on Bayesian parameter inference, see e.g. [10] and [35], parameters of the expansion function are either fixed, as in Section 3.5, or considered as nuisance parameters and are marginalised over. Marginalising implies that the imprecision of D(z l , z s ) for Planck-model-based distances, is larger than the ones shown in Figure 7. As found in [35], the currently lowest imprecision for H 0 inferred from a single galaxy-scale lens is 3-4% at 68% confidence level. On galaxy-cluster level, the lowest imprecision for H 0 is 4-5% at 68% confidence level obtained from the multiple images of SN Refsdal in MACS 1149, [10].
To estimate the precision to which H 0 can be theoretically determined with a single time-delay-difference measurement when using the Pantheon-data-based distances, we rearrange Equation (38), such that withΓ = H 0 Γ (see Equation (6)). Then, we determine the relative imprecision of H 0 as assuming that the relative imprecision inΓ is dominated by the imprecision in D(z l , z s ) and the measurement uncertainty in z l is negligible. From Figure 7, we read off the relative imprecision of the data-based lensing distance ratio of about 2% at 68% confidence level. Constraints on the second term for different time-varying sources in galaxy-cluster-scale lensing can be found in [44]. For a quasar, we found a relative imprecision in the time delay difference of 1.5%. In [108] for quasars multiply-imaged by galaxy-scale lenses, it is stated that 3% imprecision are obtained by the best algorithms. For FRBs, the second term is negligible with absolute measurement imprecisions on the order of milli-seconds compared to time delay differences of the order of days or longer. Yet, by far the largest source of uncertainties is the third term, see e.g. [103], [109], [110]. A systematic investigation for lens-modelling accuracy and precision on galaxy scale is currently being pursued by [7]. [107] give estimates for the accuracy and precision of models potentially achievable with HST in the 160W band of the Wide Field Camera 3 for exposure times of about 10 000 s and an astrometric precision of the order of sub-milliarcseconds in the angular positions, assuming the time-varying source is an FRB. Including the line-of-sight external convergence to the lens modelling, their optimum estimate of relative imprecisions for the difference in the Fermat potential is about 2-3%. Estimating an imprecision of 5% for the difference of the Fermat potential 5 for currently available data, we obtain ∆H 0 H 0 galaxy = (0.02) 2 + (0.030) 2 + (0.05) 2 ≈ 6.1% .
At the current measurement and lens-reconstruction precision, the imprecision of the Pantheon-data-based angular diameter distances is of the same order of magnitude as the measurement uncertainty of the time-delay-difference and smaller than the imprecision caused by the lens reconstruction. Hence, the cost for avoiding a parametrisation of E(z) in terms of specific Ω i is an increase in the imprecision in H 0 up to a factor of two for galaxy-scale lenses and by a factor of 1.4 for galaxy-cluster-scale lenses. With the currently increasing number of observations of multiple-image configurations with measured time-delay-differences this disadvantage can be compensated with a joint analysis of several configurations. Vice versa, solving Equation (38) for φ ij for a given H 0 , we found that the differences in the Fermat potential can be determined up to 2.7-3.7%, if H 0 can be measured with a relative imprecision of 1%. We showed in [44] by means of a simulation that even a single time-delay-difference observation can greatly increase the accuracy in the global lens reconstruction of a galaxy-cluster-scale lens, as it fixes the global MSD.

Discussion
So far, constraints from observables of multiple images on the total deflecting mass-density distribution only sparsely cover the area of the latter. Attempting to reconstruct the total deflecting mass-density distribution in the entire lensing region on the sky is thus an under-constrained optimisation problem. To solve it, additional assumptions are usually employed which capture the global shape and the overall smoothness of the deflecting mass-density distribution. One way to capture the shape uses physically motivated parametric models of the mass-density profile, like an Navarro-Frenk-White profile, or a superposition of parametric mass-density profiles. Another common way decomposes the deflecting mass density into a set of orthonormal basis functions and determines the weights of the leading basis functions by means of the observables. In both methodological approaches, a regularisation has to be introduced to avoid overfitting. The regularisation also sets the smoothness of the mass-density distribution. In Section 1, we gave examples of such approaches for galaxy-scale and galaxy-cluster-scale lenses. 5 The reconstruction of φ ij might include angular diameter distances between us, the source, and the lens, which correlates the imprecisions between φ ij and D(z l , z s ). For the following estimates, these correlations are ignored.
Contrary to these global ansatzes that treat the reconstruction of the deflecting mass-density distribution as a global model fit and intertwine data-based evidence and model assumptions, we develop an approach to locally characterise strong gravitational lenses solely by observables of the multiple images that they create. Our approach exploits the symmetries and correlations between the observables in the multiple images and thereby avoids introducing specific lens models. During its development, we were able to mathematically derive all degeneracies, like the mass-sheet degeneracy, treating the single-lens-plane gravitational-lensing formalism as partial differential equations. Subsequently, we found a physical interpretation for the respective invariance transformations that explains why they appear.
As our method does not reconstruct the global underlying mass density, it can be applied to any multiple-image configuration on any lens scale in the same way. So far, we have employed relative distances between multiple images, features in the brightness profiles of resolved multiple images, multipole moments of the brightness profiles up to the quadrupole of unresolved multiple images, and time delay differences, if the source has a time-varying component. From these observables, we calculate local lens properties that are not subject to any degeneracies, if time delay differences in a fixed background metric are used. For static sources, the local equivalent of a mass-sheet degeneracy remains to be broken. This is a consequence of setting up the standard gravitational lensing formalism as a theory of light deflection caused by inhomogeneities on top of a cosmic background metric and only observing angular quantities on the celestial sphere.
To leading order, our approach determines ratios of deflecting mass densities, the reduced shear, and the linear and parabolic approximations of the critical curves in the vicinity of the multiple images. We showed the respective equations in Section 2. From simulations, we found inaccuracies of the order of and less than 10% for extreme cases of large deflecting masses and highly elliptical density profiles. For realistic deflecting mass densities, we expect them to be lower and in the range of a few percent, which will be investigated in N-body simulated gravitational lenses. Imprecisions for some observational cases were found to be much larger, but the imprecision greatly depends on the type of multiple image and the positions of the multiple images with respect to the scaled mass density isocontour κ(x) = 1 or to the isocontour of φ 11 (x) = 0, depending on the parametrisation that is chosen.
The set of local lens properties is the maximum leading-order local information in which all lens models coincide and that is directly constrained by the observables in the multiple images in the standard single-lens-plane gravitational-lensing formalism. Formulated differently, our model-independent approach partitions the lensing region on the sky into areas where local lens properties are directly inferred from observables, and the remaining areas where global lens reconstructions are forced to make assumptions about the mass-density distribution because no constraints from multiple images are available. With the continuously increasing number of observed multiple-image configurations in galaxy-cluster-scale lenses, areas where global assumptions are necessary are shrinking. As a consequence, the confidence bounds on the local lens properties of our approach will tighten. Hence, our data-driven approach will become increasingly useful to determine the mass-density distribution of galaxy clusters. As multiple images of different sources rarely overlap, the local lens properties for newly discovered multiple-image sets can be easily added to the already known set of multiple images and their local lens characterisations. It is particularly advantageous for galaxy-cluster-scale lenses with their highly complex mass-density distribution to solicit properties of the deflecting mass distribution at certain positions that do not rely on global symmetry assumptions and that do not jointly utilise the information of several multiple-image configurations.
On galaxy-scale, the increasing amount of observed resolved multiple images of quasars in several radio bands reveals a more asymmetric mass-density distribution of their lenses than parametric, symmetric lens models can provide, so that the local lens properties determined by our approach are useful to sketch the asymmetries in the mass-density distribution. In addition, with increasing measurement precision to extract the shapes of the individual parts of the quasar, applying our approach on the scale of a quasar-image and on the scale of the individual parts of a quasar-image, small-scale lens properties may become detectable.
Beyond constraining local lens properties, comparing the model-independent approach to global lens reconstructions corroborates or refutes the global assumptions that are employed in the lens reconstructions. For instance, the hypothesis whether luminous matter traces dark matter can be checked. Alternatively, our local lens properties can be integrated as constraints in the global lens reconstructions, so that global lens reconstructions can be envisioned that extrapolate the local lens properties, i.e. the information that all models agree upon, into areas where no multiple images are observed. Compared to existing approaches that directly intertwine observational evidence with model assumptions, the extrapolating global lens reconstructions could separate the evidence from the assumptions. In this way, updating existing lens models with new observations becomes very efficient. In contrast, if some of the multiple images are not spectroscopically confirmed, the global non-extrapolating lens reconstruction has to be redone.
In Section 3, we provided proof-of-principle example applications. They showed that our method determines local lens properties for galaxy-scale and galaxy-cluster-scale lenses and in optical and radio bands alike. For the galaxy-cluster-scale example, CL0024, we found that our approach obtained local lens properties in agreement with the local lens properties obtained by two different global lens reconstructions. For the galaxy-scale example, B0128, this comparison is still outstanding at the time of writing, but we already found that our approach and a third global lens reconstruction algorithm consistently hinted at an oversimplified parametric lens model that was assumed in previous works.
On galaxy-cluster scale, we found that the model-independent approach is also of great use for the reconstruction of the source from the observables of the respective multiple images. Our approach first determines the local lens properties in the vicinity of the multiple images and then infers the source by a local back-projection of the multiple images. Hence, the reconstructed source does not depend on a lens model. This enables us to study the morphologies of faint and distant galaxies to high redshifts free of potential biases due to a lens model.
For galaxy-scale lenses that show giant arcs, we could establish a simple set of equations that clearly show the degeneracies between an axisymmetric main lens and a smaller perturbing mass density. These equations explain the degeneracies between the main lens and the perturber which were previously discovered in simulations. Furthermore, they show that most of the observed examples of perturbed, symmetric giant-arc image configurations do not allow us to simultaneously constrain the mass-density distribution in the main lens and properties of the perturber, like its position along the line of sight or its total mass. With Bayesian lens-modelling algorithms, the most likely composition of the deflecting mass-density distribution can be determined. Prior to an elaborate modelling and Monte-Carlo simulations, our set of model-independent equations is an efficient means to exclude classes of models and parameter sets that are incompatible with the observations. Strong gravitational lenses are also used to infer the Hubble-Lemaître constant, H 0 . As the ratio of angular diameter distances in the time-delay equation is dependent on the cosmological background model, calculating H 0 from this equation implies to either fix the other parameters of the cosmological model by complementary observations, or marginalise over them. Both ways are difficult to realise in particular due to the unknown form of the parameter attributed to a cosmological constant or dark energy. To avoid this problem, we set up a ratio of angular diameter distances that is based on the standardisable supernovae type Ia from the recent Pantheon sample. The ratio of angular diameter distances is thus determined up to an overall scale factor, H 0 , and all distances (up to this overall scale) are expressed in terms of a basis-function decomposition. Purely data-based distance measures cannot be established due to the sparsity of the supernovae. Yet, the flexible set of orthonormal, analytic basis functions captures all characteristics of the data set. In addition, it avoids overfitting as the number of basis functions is limited by the measurement uncertainties of the supernovae. Using these data-based angular diameter distances in the lensing time-delay equation, the imprecision on H 0 only grows about a factor of two for the current measurement uncertainty and imprecision in the lens reconstruction compared to using angular diameter distances based on the currently best-fit cosmological standard model. This imprecision can be compensated for by increasing the number of jointly evaluated lensing configurations. If multiply-imaged fast radio bursts are observed in the future, they can replace the commonly used quasar sources, which will also yield an increase in the precision of H 0 inferred from time-delay difference measurements.
On the whole, we have introduced a new approach to strong gravitational lensing with the aim to separate model-based assumptions from data-based evidence which is a complementary and useful tool to analyse large data sets in the most efficient and objective way. A detailed discussion of the results achievable with current observations can be found in Section 3. We plan to further extend the approach including higher-order features of the multiple images and additional non-lensing observables. Figure 8 summarises our approach (grey boxes) with its prerequisites (yellow boxes) as outlined in Section 2. It shows the underlying working principle and the resulting local lens properties that can be obtained from the single-lens-plane standard gravitational-lensing formalism without using a specific lens model. The remaining degeneracies and the way to break them are also listed in addition to those cases to which the algorithm cannot be applied. Figure 8. Summary of our approach as outlined in Section 2, its prerequisites (yellow boxes), working principles and resulting lens properties (grey boxes). Each result also has its remaining degeneracies and there are cases to which the approach cannot be applied.
Acknowledgments: I would like to thank everybody who contributed to this research project, especially my collaborators, colleagues, friends, and anonymous referees for stimulating discussions, their continuing support, and contributions to develop this new approach to gravitational lensing. I highly appreciate to be a part of the very friendly and supportive gravitational lensing community and the continuous funding by the DFG.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript: All other derivatives vanish. We insert these derivatives into Equation (15) and take into account that the image positions r close to r c = 0.81 are smaller than one and that the coordinate system can be chosen such that x 2 x 1 . As r n > r n+1 for r < 1, the approximation shown in Table 3 can be used for cusp configurations of multiple images created by an SIE mass-density distribution. common source is maximised in the source plane. Additionally, it employs a nullspace fitness measure, i.e. the reconstruction algorithm should not generate images in regions where no images have been observed. The third fitness measure that is implemented in GRALE, but not used in the analysis of CL0024, requires that no critical line intersects non-merging multiple images. A fourth one, also not used for CL0024, is the time-delay fitness measure that constrains the model-predicted time-delay differences by observed ones. In order to determine the optimum number of basis functions and their weights, a genetic algorithm samples the space of all potential solutions to this multi-objective optimisation problem on the current grid configuration. The refinement of the grid is subsequently performed until an acceptable reconstruction is obtained.
The number of parameters to be determined by the data is usually larger than for parametric approaches, as e.g. for LENSTOOL. Therefore, allowing for local fine-tuning, e.g. by adding masses in regions without multiple images, free-form methods are more prone to overfitting than to introducing biases. A good estimate of the quality of fit of such parameter-free models to the sparse amount of observations is the size of the set of all possible solutions allowed by the observables. The smaller this set and the tighter the fit, the more constraining the data. GRALE averages over several acceptable solutions generated by the genetic algorithm to arrive at the final mass-density reconstruction, such that the standard deviation from this average is a measure for the constraining power of the observables. Hence, to optimally constrain local lens properties with GRALE, the tightest confidence bounds are achieved when using constraints from a lot of reference points in the multiple images of the same MIS close to the positions of interest 6 . This is the opposite way to obtain the tightest confidence bounds compared to LENSTOOL.