Unveiling Women’s Needs and Expectations as Users of Bike Sharing Services: The H2020 DIAMOND Project

: Within the objectives of the H2020 DIAMOND project, the paper investigates women’s needs and expectations as users of the bike-sharing service managed by Syndicat Mixte Autolib et Velib Métropole in the territory of Paris Region-Petite Couronne (France). The paper presents a thematic literature review focused on gender inclusion in bike-sharing schemes. The proposed methodological approach is based on ( i ) Geographic Information Systems for the analysis of geolocated open datasets related to land, sociodemographic and mobility characteristics of the areas surrounding each docking stations. This was aimed at identifying a short list of suitable bike-sharing docking stations, which were further characterized through: ( ii ) structured proprietary data focused on travel demand; ( iii ) onsite observations focused on universal design indicators; ( iv ) survey questionnaires focused on women’s concerns, needs and expectations; and ( v ) social media data from Twitter focused on the opinion of the end-users. Results showed that women use the VELIB’s bike-sharing service much less than men (about 30% of the total number of users), since they are more concerned about the following issues: accessibility (e.g., availability of bikes at the docking stations, distance to the nearest station, type and quality of the cycle paths); safety and security (e.g., perception of danger and insecurity while cycling and using the current bicycle infrastructures); social constraints (e.g., perceptions and cultural stigmatization associated with cycling and bike-sharing); weather and topography (e.g., impact of weather and the urban terrain on cycling and bike-sharing). The ﬁnal aim of the H2020 DIAMOND project is to support the deﬁnition of guidelines and policies for the inclusion of women’s needs in the design of future bike-sharing services.


Introduction
Encouraging the shift towards sustainable mobility strategies based on public transport, shared micromobility and active modes of travel is one of the main challenges of European cities [1], since they are increasingly facing problems of traffic congestion, road safety, energy dependency and air pollution. In this context, advanced urban planning activities are shifting towards a focus on active modes of transport [2], among which is the development of strategies and design elements which enhance the accessibility, comfort and safety of the urban setting for cycling.
Women, in fact, experience and use transport systems differently than men, since they are more concerned with economic, accessibility and security issues [9]. In particular, statistical facts and figures showed the low use of bike-sharing schemes by women and the need to increase their participation [10][11][12], given that men cycle on average three times as often as women and travel more than four times as far. One of the barriers reported by women users of bike-sharing schemes are the unsafe driving conditions and the need of more traffic rules and speed limits on public roads, with many claiming they prefer cycling in zones with lower traffic [11,13].
In this framework, the H2020 DIAMOND research project (see: https://diamondproject.eu, accessed on 30 March 2021) aims at transforming data from various sources into actionable knowledge for ensuring the inclusion of women's needs and expectations in transport systems. The research follows a gender-sensitive approach that brings together urban and mobility experts, transport authorities, computer and data scientists, mobility economists and social scientists. The project focuses on four Use Cases: (i) Public Transport Infrastructures (Railways); (ii) (Emotion in) Autonomous Passenger Car; (iii) Vehicle (Bike) Sharing Fleet Management; (iv) Employment of Women in Rail Industry and Freight/CSR Protocols. Within the objectives of the third Use Case of the DIAMOND project, the paper proposes a data driven approach for investigating the level of accessibility, comfort and security for women of the bike-sharing service that is managed by Syndicat Mixte Autolib et Velib Métropole (VELIB) in the territory of the Paris Region-Petite Couronne (France).
The methodological approach that sets the current research work is based on the use of Geographic Information Systems (GIS) for the analysis of several geolocated structured open data focused on: (i) land characteristics (e.g., urban fabric of land use, points of interest); (ii) sociodemographic characteristics of the inhabitants (population density, gender, age and nationality of the inhabitants); (iii) mobility characteristics (e.g., transport services; cycling infrastructures). This was aimed at identifying and characterizing a short list of suitable bike-sharing docking stations, as characterized by high and low levels of accessibility for women. Results of GIS-based analysis were merged with VELIB's proprietary data related to the travel demand of the selected docking stations. Then, the selected docking stations were further investigated through onsite observations focused on universal design indicators, survey questionnaires and social media data focused on women'concerns, needs and expectations related to the bike-sharing services. The disaggregated data was used to understand and trace the mobility patterns of women as users of the bike-sharing services, to ask their opinion and to identify the factors important for them, in order to plan and design a fair, gender equitable and integrated bike-sharing schemes.
First, the paper proposes a thematic literature review focused on gender inclusion in bike-sharing schemes (Section 2), in order to provide a preliminary assessment of women's needs and barriers (i.e., Fairness Characteristics). Then, it presents the methodology which sets the current work (Section 3) and the results of the analyses (see Section 4) with reference to: Structured Open Data, Travel Demand Data, Onsite Observations, Users' Satisfaction Index Questionnaires and Social Media Data from Twitter. The paper concludes with final remarks about the achieved results and future work.

Literature Review
A bicycle-sharing system, or public bike share (PBS) scheme is a transport service in which bicycles are provided for a shared use on short term basis to individuals for a fee or free. Bicycle-sharing contributes towards achieving sustainable and inclusive transport services in urban centres. Women make more shorter and multipurpose or multistop trips and complex trips than men, due to the constructed normative gender roles in most societies [14,15]. The complex mobility pattern of women due to caring and parenting responsibilities makes the use of traditional public transportation time consuming and inconvenient [14][15][16]. Cycling or bicycle sharing services provide a better alternative to meet the complex urban mobility needs of women. Existing evidence suggest that traditionally, women cycle less than men [17]. However, recent evidence from the UK suggest a narrowing gender split of bike-share usage compared to general cycling [18,19].
The aim of the H2020 DIAMOND project is to investigate the mobility needs and challenges of women in relation to bike sharing services in order to produce guidelines for providing more inclusive infrastructure, improve planning and distribution of docking points, engender fair inclusion for women and address the social imbalance within the domain of transport. Therefore, the final goal of the research is to identify and reduce the barriers preventing women from using bike sharing services and, then, to increase the percentage of women using bike sharing services to meet their mobility needs.
Bicycle-sharing services is fraught with several barriers and limitations, which prevent women from using the services. Taking advantage of a preliminary work already presented by the authors [20], the review of extant literature focused on identifying pertinent issues and challenges bothering on women mobility experience as users of bike sharing service and barriers preventing women from using bike sharing services. Literature review was conducted through several academic database (e.g., Web of Science, Scopus, Google Scholar, ResearchGate, etc.) and organized in a tabular structure (see Table A1 and Figure A1). Overall, the process allows the identification of about eighteen Fairness Characteristics (FCs) influencing women participation in bike-sharing systems, which were validated through the execution of several focus groups and semistructured interviews. The FCs have been grouped into four Cluster of Fairness Characteristics (CFCs): 1.
CFC-Accessibility & Spontaneity (e.g., availability of bikes at the docking stations, distance to the nearest station, type and quality of the cycle paths, etc.); 2.
CFC-Safety & Security (e.g., perception of danger and insecurity while cycling and using the current bicycle infrastructures, etc.); 3.
CFC-Social Constraints (e.g., perceptions and cultural stigmatization associated with cycling and bike-sharing, etc.); 4. CFC-Weather & Topography (e.g., impact of weather and the urban terrain on cycling and bike-sharing, etc.).
The availability of bikes at the stations when needed, the distance to the nearest station and the type and quality of the cycle paths available are some of the issues users have to take into consideration when planning a trip involving bike-sharing services. CFC-Accessibility & Spontaneity is defined in terms of those characteristics of the service related to the ease with which all women groups can access and use the bike-sharing services for a trip. Significant differences are reported between women and men cyclist and bike-share users due to their respective normative gender roles [17,21]. The fairness characteristics for this CFC include the following: • According to TfL [22], there is poor awareness of local walking and cycling routes among low-income and minority groups. Similarly, Stredwick [21] found a low-level of awareness of practical cycling (such as cycling with children or carrying luggage or shopping on a bike). This observation is described in FC-Public awareness. Public awareness campaigns of bikes sharing schemes, promoting cycling/bike-share as a legitimate form of transport and offering lessons on practical cycling amongst women and minority groups could ramp-up women interest in cycling and using bike-sharing services. • Majority of the bike-sharing services rely on the use of credit/debit card, smart phones and require internet access to sign-up for membership and for rental. This requirement prevents potential users of bike-sharing services from using the services [23]. This is highlighted in FC-Sign-up and booking process. McNeil et al. [24] found that some individuals from minority and low-income background and the elderly are less likely to have smart phones, reliable internet access or credit/debit cards. • Accessibility in terms of FC-Membership cost, includes entry cost, rental charges above a stated threshold, the cost of other essential cycling accessories (such as helmet, clothing etc.) and a possible liability cost resulting from the theft or damages of the bike. Bikeplus [25] found the entry cost of bike-sharing services to be lower than the cost of owning a bike; however, entry costs and rental charges above a stated time limit of some schemes are observed to be higher [26], which come as a major barrier to the full adoption of the services, particularly, by commuters from minority and low-income groups [27]. • FC-Spontaneity of accessing bike/dock relates to the possibility of finding a bike at the station and finding a vacant docking point at the trip destination station to return a bike after the trip in a traditional dock-based bike-share systems. This relates to the reliability of the service and significantly influence trip makers decision on whether or not to include bike-sharing as an option in their daily travel plans [28]. This has been reported to be one of the major barriers to cycling. The inability to guarantee a bike at the station or an empty docking point to return a bike after the trip when needed has resulted in many users giving up on using the schemes or using them for important trips [18]. • FC-Proximity of docking station relates to how far a user has to walk to pick-up a bike or walk to the trip destination after returning a bike. Some users report travelling further from their trip destination to return a bike after their trip because the station at their destination did not have an empty dock to return the bike. The spatial distribution of docking stations is seen as a critical factor influencing bike-share usage; the proximity to members as well as to low-income and minority neighbourhoods promotes membership [26]. • Women are mostly encumbered due to their parenting and gender role; this significantly affects their mobility options [29]. FC-Travelling with children/carrying things, describes the lack of child seat and good-sized carry baskets on most bike-sharing services limits the use of such services for shopping trips and trips involving children by women. This raises gender and social justice concerns since women make more 'escort' trips with children, and more shopping trips, than men.
• FC-Insufficient infrastructure focuses on the lack of protective Infrastructure (segregated cycle infrastructure), discontinuity of the cycle infrastructure including cycle path and cycle facilities. This raises safety concerns particularly for users with dependants who considers the infrastructure unsafe for cycling with kids and identified as barriers to women in the use of bike-share services [21,30]. Similar to road network improvement, cycling infrastructure makes cycling and bike-share attractive to users and potential users [23,31].
The second CFC-Safety & Security relates to factors influencing the perception of danger and insecurity while cycling and using the current bicycle infrastructures. The fairness characteristics for this CFC include the following: • FC-Driver behaviour: While more research is required to understand why more women experience and report more incidents than men, cyclists believed most of the near misses and scary incidences can be blamed on factors such as speed, drivers passing too close, negligent opening of a vehicle door and aggressive driving on the road, which is preventable [32]. The attitude and behaviour of drivers towards cyclists is seen as a major deterrent to cycling and the use of bike-sharing services, particularly among women [23,33]. • FC-Separate infrastructure: while sharing the road space with motor vehicles seems problematic for females, it is believed that the slower cycling speed of women on the road [34] could be a plausible explanation as drivers become impatient with the slow riding speeds. Developing a safe and protective cycling network separated from vehicular traffic have a positive effect on women perception of safety and could get more women cycling and using bike-share [21,30]. Segregated infrastructure, alongside interventions targeted at road culture and driver behaviour, is suggested to have stronger influence on the rate of cycling of women [35,36]. • FC-Harassment: Women are susceptible to harassment, verbal abuse and attacks in the public space and are more likely to report sexiest harassment from other road users when cycling than men [21,37,38]. Howland et al. [17] found the fear of harassment by men and drivers is a significant barrier to cycling and the use of bike-sharing services by women. Street harassment is one of the barriers to cycling and the use of bike sharing. Consequently, continuous public education could help address this social menace. • FC-Safe environment and personal safety: The subjective safety (perceptions of insecurity) and the objective safety (measured risk level) have greater implication on the rate of cycling [36]. The likelihood that a rider was a woman is higher than a man if the cycling environment is friendly. Off-road infrastructure, on-road infrastructure without parked vehicles and residential streets are considered safer than mixed-traffic roads by cyclists [36]. The findings of our interview also suggest that the level of lighting and visibility at the stations, the presence/absence of emergency help buttons at the bikes stations and the characteristics of the lanes (width, location, lighting, etc.) have significant impact on cycling, which is consistent with the subjective and objective safety findings reported in Kumar et al. [36] and Ravensbergen et al. [38]. • FC-Confidence/experience: Less experienced cyclists, or those with little confidence on their own cycling abilities, see the interaction with vehicles on on-road cycling infrastructure more challenging and intimidating [34]. The fear women have of traffic results from the sense of inexperience and lack of self confidence in cycling [37]. These are barriers and possibly explain why women are more uncomfortable cycling in traffic [21]. Off-road and dedicated and enforced on-road infrastructure may encourage inexperienced cyclists and get more women cycling and using bike-sharing services. • FC-Traffic safety: Road safety is a gendered issue when it comes to cycling; women report twice as many incidents of 'frightening near misses' and more concerned about cycling on the road in traffic than men [32,36,39]. Cycling on the road with vehicular traffic is very intimidating for women [21] and disproportionately impacts a woman's decision to cycle [27].
There is sociocultural dimension to barriers of cycling, and this is more pronounced in women than men. This is presented in the third CFC-Social Constraints. The perceptions and cultural stigmatization associated with cycling and bike-sharing is fuelled by gender stereotyping in society and the perception that cycling is for the poor. The fairness characteristics for this CFC include the following: • FC-Subjective norm (peer influence): Important others have significant influence on behaviour and on the decision to cycle or otherwise; peers and coworker normative beliefs on cycling can influence women's participation in cycling to and from work [40]. • FC-Sociocultural constraint (negative perception): There exist culturally embedded perceptions about the symbolic value of cycling. Cycling is perceived as a transport mode reserved for people of low status, with evidence of poverty and the inability to afford a car [22,37]. The stigmatisation of cycling in most communities serves as a major barrier to the desire to cycle and use bike-sharing services. Campaigns to promote cycling as a legitimate form of transport for all income groups and encouraging all income groups to cycle could help overcome this myth, encourage cycling and the use of bike-sharing services [37]. Additionally, the appearance of women (the wearing of skirts, high heeled shoes, hair style and the likelihood of carrying a purse) constrains and limits the rate at which women cycle [41]. End-of-trip or workplace facilities and the design of bikes are suggested to address this barrier. • FC-Family responsibilities: The constructed normative roles of women including childcare prevent women from cycling and using bike-sharing services because of the complexity of cycling with children and for shopping [41]. Education on practical cycling and the possibility of cycling with children and for shopping could help address this barrier and encourage more women to cycle and use bike-sharing services.
The fourth and last CFC-Weather & Topography relates on the impact of weather and the urban terrain on cycling and bike-sharing. The fairness characteristics for this CFC include the following: • FC-Weather: The impact of weather on cycling is emphasised in literature. The demand for cycling and bike share is subject to seasonal variation and weather such as humidity, temperature, wind and rains [33,42]. • FC-Topography: The presence of hills along cycle routes have negative impact on cycling and a barrier to women urban cycling and bike share [33,43]. The development of electric assisted bikes has overcome this barrier and further makes long distance trips possible [25].

Enabling Data and Methodology
Within the scope of the H2020 DIAMOND project, the objective of the proposed analysis was to investigate the women's needs and expectations as users of bike-sharing services managed by VELIB in the territory of Paris Region-Petite Couronne (1358 docking stations in total). This was aimed at supporting the development of EU policies and guidelines for gender-equitable bike-sharing fleet management, focusing on the Clusters of Fairness Characteristics defined through the proposed thematic literature review (see In this framework, the methodology which sets the current work was based on a series of (geolocated) Structured Open Data, which were retrieved, sorted and filtered from open data repositories, national geoportals and census databases (see Section 4.1). In analogy with a previous work already presented by the authors [44], preliminary structured open data analysis was based on GIS (all GIS-based analyses presented in this paper have been performed by using the software QGIS v.3.16.1) in order to identify and characterize a short list of relevant docking stations, in which to perform further data collection activities. A series of thematic maps related to the localisation and density distribution of datasets were designed to assess the level of accessibility of the bike-sharing docking stations managed by VELIB, focusing on the following:

•
Territorial Data: density distribution of urban fabric on land use (including continuous urban fabric, discontinuous dense urban fabric and isolated structures) and points of interest (e.g., commercial activities, schools, facilities, public services, attractions, etc.); • Sociodemographic Data: density distribution of total population, female population, elderly population and foreigner population per census section; • Mobility Data: density distribution of public transport services (e.g., metro and commuter railway stations, bus stops, tram stops, etc.) and cycling infrastructure.
The proposed approach for structured open data collection allowed the identification of a short list of twenty heterogeneous and nonadjacent docking stations, characterised by positively and negatively relevant characteristics related to the objectives of the analysis. In order to further characterise the shortlisted stations, structured open data were merged with:

Structured Open Data
The proposed methodological approach is based on analysing and merging a series of geolocated structured datasets (see Table 1), in order to assess the level of accessibility for the women using the bike-sharing service managed by VELIB (1358 docking stations in total, see Figure 1), to identify and characterise a short list of twenty suitable stations (ten docking stations characterised by high levels of accessibility and ten stations characterised by low levels of accessibility). The indicators were analysed to design a multilayer map of Paris Region-Petite Couronne (France) and to estimate the spatial distribution of each dataset considering the localization of the VELIB's docking stations (see Table A2, Figures A2-A4). From a general point of view, the analysis was based on various attributes and characteristics of the urban area surrounding each docking station. To do so, raw data related to the urban scale were extracted about surrounding areas of each docking station, namely by considering the boundaries of census sections as catchment areas. A preliminary data validation phase aiming at checking any missing and/or not relevant values allowed us to filter 1297 valid docking stations distributed on 914 census sections, containing at least one docking station.
Data were postprocessed through: (i) density-based calculation on census section areas; (ii) normalisation of values (z values in a range between 0 and 1); and (iii) weighted formulas of normalised values to calculate the Territorial Data Index (TDI), Socio-demographic Data Index (SDDI), and Mobility Data Index (MDI). Quintile frequency distribution of results made possible the identification of the docking stations characterised by high levels of accessibility for the women (belonging to the highest quintile, ≥80th percentile) and the docking stations characterised by low levels of accessibility for the women (belonging to the lowest quintile, ≤20th percentile). A group of 76 docking stations belonging to three out of three highest quintiles among the TDI, SDDI and MDI was identified (distributed on 32 census sections). Then, a group of 80 docking stations belonging to three out of three lowest quintiles among the TDI, SDDI and MDI was identified (distributed on 47 census sections). The list was further shortened considering the localisation of each station in order to identify a group of twenty heterogeneous and nonadjacent docking stations characterised by positively and negatively relevant characteristics related to their level of accessibility.

Territorial Data Index
The calculation of the Territorial Data Index (see Figure 2 and Table 2) was based on the density distribution of the urban fabric (UF_cs) and Points of Interest (PoI_cs) on the census sections surrounding the VELIB's docking stations. Land-use dataset include the localization of continuous urban fabric, discontinuous dense urban fabric and isolated structures. Data analysis was aimed at estimating the level of urbanisation of the catchment areas since the level of accessibility of bike-sharing service for women greatly differs based on the urban or periurban characteristics of the surroundings. Points-of-interest datasets include a series of heterogeneous services and facilities (e.g., commercial activities, bars, supermarkets, playgrounds, sport facilities, nightclubs, university facilities, public services, tourist attractions, etc.). Data analysis was aimed at assessing the level of attractiveness of the catchment areas surrounding each docking station, considering the needs of different users' profile (e.g., commuters, students, tourists).
TDI was calculated through the weighted summation of normalised density distribution values of urban fabric and points of interest on catchment areas (see Equation (1)). The constant parameters KUF and KPoI were equally balanced (∑ constant parameters = 1). TDI was calculated through the weighted summation of normalized density distribution values of urban fabric and points of interest on catchment areas (see Equation (1)). The constant parameters KUF and KPoI were equally balanced (∑ constant parameters = 1).

Socio-demographic Data Index
The calculation of the Socio-demographic Data Index (see Figure 3 and Table 3) was based on the density distribution of the Total Population (TP_cs), Female Population (FeP_cs), Elderly Population (Elderly population dataset includes the spatial distribution of the inhabitants being over 64 years old.) (EP_ca), and Foreigner Population (FoP_ca) on the census section of the Paris Region-Petite Couronne (France). Data analysis was aimed at estimating the density distribution of the population and the age, gender and nationality characteristics of the inhabitants living in the catchment areas surrounding the docking stations managed by VELIB, as potential users of the bike-sharing service. The calculation of the SDDI relies on the density distribution of the population on the urban fabrics of the catchment areas surrounding the VELIB's docking stations, to balance the population density between urban and peri-urban areas.

Sociodemographic Data Index
The calculation of the Socio-demographic Data Index (see Figure 3 and Table 3) was based on the density distribution of the Total Population (TP_cs), Female Population (FeP_cs), Elderly Population (Elderly population dataset includes the spatial distribution of the inhabitants being over 64 years old.) (EP_ca), and Foreigner Population (FoP_ca) on the census section of the Paris Region-Petite Couronne (France). Data analysis was aimed at estimating the density distribution of the population and the age, gender and nationality characteristics of the inhabitants living in the catchment areas surrounding the docking stations managed by VELIB, as potential users of the bike-sharing service. The calculation of the SDDI relies on the density distribution of the population on the urban fabrics of the catchment areas surrounding the VELIB's docking stations, to balance the population density between urban and periurban areas.
SDDI was calculated through the weighted summation of normalised density distribution values of total population, female population, elderly population and foreigner population on urban fabric of catchment areas (see Equation (2)). The constant parameters KTP (corresponding to 0.3), KFeP (corresponding to 0.3), KEP (corresponding to 0.2) and KFoP (corresponding to 0.2) were weighted to accentuate the impact of the density distribution of the total and female populations on SDDI (∑ constant parameters = 1).

Mobility Data Index
The calculation of the Mobility Data Index (see Figure 4 and Table 4) was based on the density distribution of Public Transports (PT_cs) and cycling Road Infrastructure (RI_ca) on the catchment surrounding the docking stations managed by VELIB. Public transport datasets include the spatial distribution of metro and commuter railway stations, bus stops, tram stops, and taxi stations. Data analysis was aimed at estimating the level of connectivity of the bike-sharing docking stations with other transport services. Data analysis of cycle-way road infrastructures dataset was aimed at analyzing the level of accessibility of the docking stations.
MDI was calculated through the weighted summation of normalized density distribution values of public transports and cycling infrastructure on catchment areas (see Equation (3)). The constant parameters KPT (corresponding to 0.4) and KRI (corresponding to 0.3) were equally balanced (∑ constant parameters = 1).

Mobility Data Index
The calculation of the Mobility Data Index (see Figure 4 and Table 4) was based on the density distribution of Public Transports (PT_cs) and cycling Road Infrastructure (RI_ca) on the catchment surrounding the docking stations managed by VELIB. Public transport datasets include the spatial distribution of metro and commuter railway stations, bus stops, tram stops and taxi stations. Data analysis was aimed at estimating the level of connectivity of the bike-sharing docking stations with other transport services. Data analysis of cycleway road infrastructures dataset was aimed at analysing the level of accessibility of the docking stations.
MDI was calculated through the weighted summation of normalised density distribution values of public transports and cycling infrastructure on catchment areas (see Equation (3)). The constant parameters KPT (corresponding to 0.4) and KRI (corresponding to 0.3) were equally balanced (∑ constant parameters = 1).

Travel Demand Data
The list of bike-sharing docking stations shortlisted through the proposed GIS-based analysis (see Section 4.1) was further characterized through the analysis of Structured Proprietary Data (see Table 5), namely Travel Demand Data: number of bikes, started rentals (e.g., total number, e-bikes), ended rentals (e.g., total number, e-bikes), average distance and time duration per trip, and unique users (e.g., total number, female users).
The retrieved Travel Demand Data was post-processed to calculate the average monthly values for each of the selected docking station (see Table A3), which were analyzed to highlight the relation between demand patterns and level of accessibility for the women. The results are highlighted below and in Figure 5 and Figure 6: • The utilization/capacity ratio (i.e. relation between started/ended rentals and the capacity of the docking stations) is 93% higher in positively relevant docking stations compared to negatively relevant docking stations (see Figure A5); • Negatively relevant docking stations, while being less used, showed higher values in trip distance (+55%) and duration (+60%) in respect to the positively relevant stations. This could be caused by a location bias, since most of the negatively related stations are located in non-central areas; • The unique user ratio (i.e. the relation between the number of rentals and the number of unique users) is consistent between both positively and negatively relevant docking stations. However, the user ratio related to the docking stations located in the territory outside the City of Paris is lower (-14%) compared to the one of the overall docking stations (see Figure A6).

Travel Demand Data
The list of bike-sharing docking stations shortlisted through the proposed GIS-based analysis (see Section 4.1) was further characterised through the analysis of Structured Proprietary Data (see Table 5), namely Travel Demand Data: number of bikes, started rentals (e.g., total number, e-bikes), ended rentals (e.g., total number, e-bikes), average distance and time duration per trip and unique users (e.g., total number, female users).
The retrieved Travel Demand Data was postprocessed to calculate the average monthly values for each of the selected docking stations (see Table A3), which were analysed to highlight the relation between demand patterns and level of accessibility for the women. The results are highlighted below and in Figures 5 and 6: • The utilisation/capacity ratio (i.e., relation between started/ended rentals and the capacity of the docking stations) is 93% higher in positively relevant docking stations compared to negatively relevant docking stations (see Figure A5); • Negatively relevant docking stations, while being less used, showed higher values in trip distance (+55%) and duration (+60%) in respect to the positively relevant stations. This could be caused by a location bias, since most of the negatively related stations are located in noncentral areas; • The unique user ratio (i.e., the relation between the number of rentals and the number of unique users) is consistent between both positively and negatively relevant docking stations. However, the user ratio related to the docking stations located in the territory outside the City of Paris is lower (−14%) compared to the one of the overall docking stations (see Figure A6). • The female user ratio (i.e., the relation between female and overall users) is consistent between both positively and negatively relevant docking stations. However, the female user ratio related to the docking stations located in the territory outside the City of Paris is slightly lower (−5%) compared to the one of the overall docking stations (see Figure A7).

Onsite Observations
A list of selected bike-sharing docking stations shortlisted through the proposed GIS-based analysis (see Section 4.1) was further characterised through the execution of Onsite Observations focused on universal design indicators. In particular, an ad hoc developed checklist (see Table A4) was used for the evaluation of infrastructure design and surrounding context characteristics related to women's needs as users of bike-sharing services around the Fairness Characteristics identified through the above mentioned literature review (see Section 2). Data collection campaigns were executed from March 2020 to September 2020, by the staff of Genre et Ville (partner of the consortium of the H2020 DIAMOND project) at different time periods of the day (i.e., day and night). Onsite observations took the form of researchers visually observing the public and recording their finding on paper. In addition, some still photography of the bike sharing docking stations was carried out. Data analysis was focused on comparing data between the twenty heterogeneous and nonadjacent docking stations characterised by positively relevant characteristics (i.e., Highest Quintile) and negatively relevant characteristics (i.e., Lowest Quintile) related to their level of accessibility. The results are highlighted below and in Figure 7: • About 60% of the docking stations displayed some negative connotations in relation to spontaneity of accessing the bike service (the index is for reasons related to the maximum number of bikes observed at the docking stations or the minimum number of bikes recorded during the observation), the percentage of bikes recorded during the observation (considering the capacity of the docking station) and the number of other public modes of transport near the docking station. In addition, no bikes were reported to have a child seat for children; • The characteristics related to public awareness were equally distributed between positive and negative and it was simply measured as an assessment of the observed percentage of female users; • About 90% of observed docking stations displayed some negative features related to separate infrastructure like the presence of a cycle lane nearby the docking station (approx 200 meters around) and/or the lack of a separate cycle infrastructure; • Above 40% of docking stations had negative features related to safe environment and perceived personal safety related to features such as observation of safety of the cycle paths and lanes for cycling at night, and/or percentage of users wearing hi-visibility cycling gear, and/or percentage of worthy bikes among the fleet (e.g., well-functioning brakes, lights, etc.).

User Satisfaction Index Questionnaires
The outline of the Users' Satisfaction Index Questionnaires was designed to collect data on users' experience and perception on the bike-sharing service managed by VELIB by focusing on: (i) bike-sharing experience; (ii) the four CFCs identified through literature review (see Section 2) and (iii) general sociodemographic information. The questionnaire assesses participants knowledge, perceived challenges and opportunities of the bikesharing services as well as their expectation of bike-sharing schemes.
Data collection was carried out in the territory of Paris Region-Petite Couronne (France) from October 2020 to November 2020. The questionnaire was administered online and through intercept survey at the selected docking stations (see Section 4.1). In total 407 users completed the survey: 46 online and 361 in the intercept survey. The gender split of respondents was almost half for both sexes, 50.6% male compared to 49.4% for female. Respondents were aged between 18 to 74 years old, with the majority (69.3%) aged between 18 and 34 years old. Furthermore, 50.8% of respondents have at least a university degree. In addition, 99.5% of respondents are users of bike-sharing services, 47.2% uses bike sharing services one to five times in a week, 26.8% between 6 and 10 times a week and 26.0% more than 10 times a week. More than one third of respondents cycle to work (37.7%). Almost three-quarters did not live with any dependent (73.5%). As expected, more female than male respondents travel with dependants. Almost half of the respondents were in paid employment (49.9%); more than a quarter were students (26.8%) and a little over a half of respondents were in full-time employment (50.4%).
According to the CFCs and FCs identified through the proposed literature review (see Section 2), the UESI Questionnaire responses were analysed to identify possible differences in needs and perceptions by users of bicycle sharing services and possible challenges when disaggregated in respect to individual and demographic features. Linear regression (all statistics presented in this paper have been performed by using the software R v.4.0.5, and they have been conducted at the p < 0.01 level) was estimated using step-wise analysis in MASS R package on the prediction of satisfaction with bike-sharing services (see Table A5). The key findings of the analysis are presented below: • The booking and sign-up process is linked to user satisfaction. Easy sign-up process increases user satisfaction, perception of the reliability of the services and in other words the willingness to use the services; • The proximity of stations, in other words, how far a user must walk to pick up a bike or the distance a user has to walk after returning a bike is found to significantly affect user satisfaction. Increasing the density of docking stations or siting stations close to users makes the service more accessible and increases user satisfaction; • In general, 68.1% of respondents either disagree or are unsure about the possibility of using bike-sharing services with children. Furthermore, 70.1% of women disagree compared to 65.5% of men. The results further indicate that user satisfaction is linked to the perception about using the services with children. Users who believe trips involving children could be possible are likely to be dissatisfied with the inability to use the service with children. This suggests that supporting trips involving children could increase user level of satisfaction; • Perception of safety was linked to traffic speed, safety of the infrastructure and neighbourhood cycling environment. Lower traffic, safe infrastructure and safe cycling environment predicted increased perception of safety and consequently, user satisfaction and odds of using bike-sharing services. Similarly, visibility and adequate lighting at the docking stations predicted increased sense of security and user satisfaction; • The likelihood of cycling or using bike-share was found to depend on user level of understanding of the benefits of cycling, perception about cycling or bike-share and acceptance of cycling or bike-share as an acceptable form of transportation. The results indicate that the acceptance of cycling or bike-share as a legitimate mode of transport and knowledge about the benefits of cycling increases user satisfaction and use of bike-share; • Unsurprisingly, weather conditions had a significant impact on cycling and bikesharing; however, supply of cycling raincoats could help mitigate this and increase user satisfaction; • From the correlation index most questions were not strongly correlated; however, there were higher correlations (>0.45) around the CFC-Accessibility & Apontaneity and CFC-Safety & Security (see Figure A8).

Social Media Data
The aim of Social Media Data analysis was to complement the proposed characterisation of the selected docking stations by focusing on the opinion of a larger samples of users about the bike-sharing service managed by VELIB. Social Media Data were collected through the Twitter API from December 2019 to June 2020 by using Kalium [45], a tool that allows one to efficiently and flexibly manage the tracking of social network data in real time. In this way, we collected 44,262 tweets from 8414 users. Geospatial data analysis enabled researchers to focus on the tweets localised within the territory of Paris Region-Petite Couronne (France) and to exclude those with repetitive geographic information due to Twitter business accounts. By retrieving all messages including keyword "Velib", we were able to virtually access all conversations that take place in the social network related to this topic and get a picture of the concerns of the users with regards to it.
Although sociodemographic characteristics of Twitter users are not available, the gender of the users was inferred through the M3inference tool [46], that relies on deep learning models, trained over a large sample of users from different countries. This method was proven to achieve high accuracy for all major European countries and languages. Based on the user name, short bio and picture, the tool returns the estimated probability of a user to be man or woman. We assigned a gender to users having a probability above a threshold of 0.9 for one of the two genders. In this way we ensure that we only have reliable estimation and leave gender unassigned for the remaining users. In total, we identified 3729 men users and 916 women users, while 3769 users remained unclassified. This allowed us to focus on the differences between men's and women's discourse. We are aware that this method has the limitations of not dealing with nonbinary gender and of not further characterising women according to other demographic characteristics. Unfortunately, these limitations are embedded in the method, which allows us to study a much larger sample of users but with coarser-grained granularity. Still, we believe that this analysis can add value as it encompasses a large sample of users.
We compared the lexical differences between tweets posted by men and women using a novel method for visualising and explaining pairwise comparisons between texts [47]. In particular, we created a word shift graph based on the difference between the relative frequency of each hashtag in the tweets written by men and women. Results (see Figure 8, Tables A6 and A7) highlight the hashtags exhibiting the largest gender differences, with left bars and right bars indicating hashtags that are more frequent in tweets written by men and women, respectively. We observe a politicised conversation around VELIB for both genders, given the references to Anna Hidalgo, the current mayor of Paris, and the municipal elections that were held in 2020. However, we note a large preference of men for critical hashtags (e.g., #velibgate, #lahonte), while women used the hashtag of the mayor's campaign more often (e.g., #hidalgo2020). In addition, women make more references to the strike (e.g., #greve) and the use of shared bikes to go to the office (e.g., #velotaf).

Discussion
The paper is based on an extended GIS-based analysis of Structured Open Data for maximising the diversity of the bike-sharing docking stations managed by VELIB in the territory of Paris Region-Petite Couronne (France), which were subsequently further characterised through Travel Demand Data, Onsite Observations, UESI Questionnaires and Social Media Data from Twitter. This was aimed at ensuring that the observed cases are representative of the different situations/locations of any single docking station.
In particular, GIS-based analysis aimed to assess the level of accessibility for the women users of the bike-sharing service managed by VELIB through the investigation of: (i) the level of urbanisation and attractiveness of the areas surrounding each docking station in terms of urban/periurban contexts, available services and facilities; (ii) the sociodemographic characteristics of the population living in the areas surrounding each docking station; and (iii) the level of connectivity of the bike-sharing service with other public transport services and cycling infrastructure. This enabled to identify and characterise a short list of twenty heterogeneous and nonadjacent docking stations.
The analysis of Travel Demand Data allowed researchers to correlate the overall level of accessibility for women of the selected docking stations with a series of Structured Proprietary Data (e.g., number of started and ended rentals, trip distance and duration, number of unique users, number of female users, etc). Results confirmed that the utilization/capacity ratio of the service is influenced by the level of accessibility of the docking stations. Moreover, the female user ratio related to the docking stations located in the territory outside the City of Paris is slightly lower compared to the one of the overall docking stations.
Then, the selected docking stations were further investigated through Onsite Observations focused on universal design indicators. Results showed that the majority of the docking stations display some negative connotations in relation to spontaneity of accessing the bike service and separate cycling infrastructure. Moreover, half of docking stations had negative features related to safe environment and perceived personal safety.
The analysis of UESI Questionnaires was focused on women's concerns, needs and expectations related to the VELIB's bike-sharing services. Results showed that the overall user satisfaction is strongly influenced by the booking and sign-up process, the proximity of stations, the possibility to use the services with children, the lack of cycling infrastructure and adequate lighting at the docking stations.
The analysis of disaggregated Social Media Data collected from Twitter has been applied to further investigate the opinion of the end users about the bike-sharing service managed by VELIB's. Results showed a politicised conversation around VELIB for both genders, while women where found to make more references to the lack of reliability of the service and on the possibility to use of shared bikes for commuting.
The presented data collection campaign represents a valuable example of the potential of this methodological approach. Indeed, the research work was aimed at investigating the possibility to analyse digitally widespread data sources as a valuable support of the activity of decision-makers by unveiling hidden patterns and specific target-users' needs. The diversity of the data collected and used for this study helps to build a narrative around the diversity of influences on cycling behaviour for women. However, the results of the analysis could be potentially biased by the impact of the lockdown period due to the COVID-19 pandemic. The timing of the data collection was also timely as Paris has seen a large increase in cycling during the COVID-19 pandemic. Therefore, the survey is likely to have captured a diversity of new and established users of VELIB's service.

Conclusions and Future Work
The objective of the paper was to identify an appropriate sample of docking stations to be further investigated through travel demand data, onsite observations, survey questionnaires and social-media data collection, focusing on the women users' needs and expectations as users of bike-sharing services. Results showed that women experience and use this transport mode differently than men, since they are more concerned with accessibility, safety and security, social constraints, weather and topography issues.
Future work will focus on the application of data analytics techniques based on Analytic Hierarchy Process (AHP) and Machine Learning techniques (Factor Analysis and Bayesian Networks). This is aimed at defining a hierarchical model for the design of parameters influencing the inclusion of women, by unveiling hidden mobility patterns through a gender-based intersectional analysis. Within the objectives of the H2020 DIAMOND project, the collected disaggregated data will be used to support the definition of guidelines and policies for the inclusion of women's needs in the design of future bike-sharing transport services.
Author Contributions: Conceptualisation, All Authors; methodology, All Authors; software, All Authors; validation, All Authors; formal analysis, All Authors; investigation, All Authors; resources, All Authors; data curation, All Authors; writing-original draft preparation, A.G.; writing-review and editing, A.G.; visualisation, All Authors. All authors have read and agreed to the published version of the manuscript.
Funding: This research was funded by the EU H2020 program within the DIAMOND project (grant No. 824326).

Institutional Review Board Statement:
The study was conducted according to the guidelines of the Declaration of Helsinki and the General Data Protection Regulation (EU, 2016/679) and approved by the Ethics Committee of the H2020 DIAMOND project.

Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.
Data Availability Statement: All information gathered were treated as anonymous and confidential and can not be reproduced outside the H2020 DIAMOND project and associated scientific reports and publications.

Acknowledgments:
The authors thank VELIB (Syndicat Mixte Autolib et Velib Métropole), partner of the H2020 DIAMOND project, for sharing proprietary data about the bike-sharing service and for their fruitful contribution to the research. The authors thank also Lucia Recio Naranjo and Francisco Santarremigia for managing and supervise the research activities of the H2020 DIAMOND project.

Conflicts of Interest:
The authors declare no conflict of interest. The funder had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix F. Results of Social Media Data Analysis from Twitter
Francesco Fabbri is a third year PhD student at Pompeu Fabra University, Barcelona, in the Web Science and Social Computing Group (WSSC). He is supported by the Vicente López scholarship, awarded by Eurecat, Research Center in Catalunya. His work focuses on analyzing and mitigating unintended and potentially harmful effects embedded in online social platforms (e.g., social networks, music streaming platforms or rental marketplaces). He obtained a master's degree in Data Science and a bachelor's degree in Statistics from Sapienza University, Rome.
Dr. David Laniado is Senior research scientist in Computational Social Science, in the Data Science and Big Data Analytics unit at Eurecat. His main research interest is the study of patterns of online conversation and discussion, controversies, collective dynamics and deliberation processes in peer production communities, gender differences and sociotechnical aspects of online interactions.
Dr. Pablo Aragón is a research scientist and adjunct professor at Universitat Pompeu Fabra. He is interested in computational social science and social computing through interdisciplinary and participatory approaches to characterize and facilitate participation, deliberation, and collaboration in online platforms, primarily civic technologies and social media. These interests have led him to participate in EU-funded research projects like DECODE, focused on free open-source infrastructures to increase data sovereignty of European citizens; or D-CENT, committed to transforming democratic decision-making through free open source digital technologies.