Quantitative Analysis of Farmers Perception of the Constraints to Sunﬂower Production: A Transverse Study Approach Using Hierarchical Logistic Model (HLM)

: Sunﬂower production is an excellent choice for both local and national agribusiness with signiﬁcant potential in smallholder farming systems due to low input costs, consistent yields, and a short planting window. However, farmers are faced with interrelated constraints in their sunﬂower production. This study was undertaken to examines the perceptions of the constraints to sunﬂower production as perceived by smallholder farmers in North West Province, South Africa. The research identiﬁes some factors related to sunﬂower production constraints. Then, a stratiﬁed random sampling technique was used to select 172 sunﬂower farmers and they were asked to rate their response on these attributes on a 5-points Likert scale. Leveraging principal component analysis (PCA), we agglomerated and condense information from the original datasets of the constraint attributes into three main components (innovation, farm ﬁnance, and crop management practice). Because of the hierarchical structure of the dataset with farmers nested within their local municipalities, we use Hierarchical Logistic Modeling techniques to identify the factors that determine farmer’s perceived interest in innovation, farm ﬁnance, and crop management practices. Innovation and farm ﬁnance emerge to be critical elements for sustainable sunﬂower production. The ﬁndings indicate that age, education, household size, farm size, cooperative membership, and gender are strongly correlated with farmers’ perceived interest in innovation and farm ﬁnance characteristics. This study recom-mends that to promote these elements among smallholder farmers, it is necessary for governments at the local and national level to invest in extension service and education, cooperative organizations, research, and development in disentangling the age, gender, and farm size inequalities existing in the district sunﬂower production. Contributions: Conceptualization, E.A.A., O.S.O. S.S.T.; methodology, E.A.A. and O.S.O.; software, E.A.A.; validation, E.A.A.; formal analysis, E.A.A.; investigation, E.A.A.; resources, E.A.A.; data curation, E.A.A.; writing—original draft preparation, E.A.A.; writing—review and editing, E.A.A., O.S.O.


Introduction
Agricultural sustainability faces hosts of constraints, and if food demands of the growing population are to be met, agriculture will need to undergo significant transformation. Thus, smallholder farmers would play a critical role in the drastic process of transforming the agricultural sector, because some households pose the capacity to participate in highvalue agricultural markets and use advanced production inputs. However, the majority are unable to do so due to varying constraints. On the one hand, agricultural operation is daunting for households with profit potential in the presence of climate change, limited credit alternatives, production and market shocks, poor asset holdings, and access to markets barriers. On the other hand, smallholder farmers can effectively adjust their livelihood strategies to these constraints, but it will demand supportive regulatory climates [1].
Smallholder farmers and agripreneurs represent a diverse image, but the innate and social-political terrain in which they are found in South Africa is somewhat common, at least to a certain degree, owing to the country's racially and economically polarized history and current challenges [2]. This situation is reflected in the several types of research and intervention that aimed to better understand the challenges and opportunities that household farmers face in terms of productivity and access to mainstream agricultural markets [3][4][5][6]. There is no doubting that research on smallholder farmers' participation in a number of agricultural commodities has been undertaken, thus cannot be unrelated to the fundamental role of sustainable production and market access for households with knitted income to crop and animal production.
In South Africa, several strategies, and interventions to boost productivity in the smallholder farming systems, especially in the sunflower sector have all registered limited impacts. For instance, the Agricultural Policy Action Plan (APAP) and South African Industrial Policy Action Plan (IPAP), both designed to inspire growth in the agricultural sector lacks a precise and effective approach towards the edible oil and oilseed industry, as APAP failed to treat oilseed crops as strategic commodities, while IPAP on the other end was limited to other labor-intensive sectors such as fruit, milling, and poultry [7]. Resulting in the unbreached gap between demand and supply of sunflower oil, oilcake, and the continual decline of the total area of production. Sunflower production accounts for nearly 60 percent of the local oilseed crop produce, making it a favored oilseed crop choice. However, national production is yet to meet the growing demand, despite being earmarked as a beneficial cash crop with great potential in local and national agribusiness operations, owing to its consistent yield and a short growing circle [8][9][10]. Sunflower is a cost-effective and desirable agricultural crop with outstanding characteristics, such as the augmentation of valuable market products, revenue generation, and poverty reduction. Unfortunately, the dearth of viable seeds available to farming households, as well as extreme weather events, have hampered its maximum potential across the food value chains [11]. The crop can be integrated into local cropping systems to improve soil health and boost biodiversity in crop rotation practices. The plant also has excellent adaption responses for growing in harsh environments, unlike other grain crops such as wheat, soybean, and maize.
In 1999, the South Africa sunflower industry experienced a production boom of over a million tons on 828,000 hectares of land with an average output of 1.4/ha similar to the three leading sunflower producing countries i.e., Ukraine, Argentina, and Russia [8,9]. Unfortunately, since the production boom, the total area of land dedicated for domestic sunflower production has followed a downward trend, leading to an average yield decrease per hectare with drastic reductions in the North West Province in contrast to the other provinces [9]. The impacts of this decline in production are expressed in the limited access to high-value markets, lack of market competition, low productivity, increased seed, oil, and oilcake import as well as farmers' lackadaisical attitudes towards the sunflower crop [7,8,12]. [8] links this decline to the lack of economic inclusion of black farmers (commonly referred to as smallholder farmers) in the sector, as well as in the seed trade business as some of the widespread impediments to farmers entering the sunflower industry, exacerbated by inefficient credit markets and budgetary limitation to purchase the necessary equipment, among other barriers. The constraints to South Africa sunflower production vary significantly but can be traced to the unpredictable rainfall patterns, lack of crop awareness, unreliable markets, insect-pest infestation on the crop, lack of farm machinery, disease attack, high input cost, harsh weather condition, stiff competition, price fluctuation, birds depredation, scarcity of improved seed varieties, as well as poor extension services [11]. Going further, [13] highlighted that one of the most significant challenges facing smallholder sunflower farmers' in South Africa is the absence of direct links between local producers and consumers.
These constraints pose extreme barriers to entrant smallholder farmers and undermine the production and market capacity of existing households, despite the wide recognition of the sector with regard to job and wealth creation. Given the abundance of arable land, rising domestic demand for healthy vegetable oil and oilcake, we are confident that there are significant economies of scope, among other opportunities for smallholder farmers that are involved in the sunflower production. Therefore, it is imperative to understand some of these barriers in order to foster a sustainable crop production system. Understanding the perceptions of smallholder farmers, particularly those in the sunflower sector, in terms of constraints faced is crucial to government, agricultural scholars, and private sector programs.
While acknowledging the inevitability of constraints in the smallholder farming sector, it is an essential facet of this article to present a robust approach to examine farmers' perceived interest in sunflower production constraints using empirical data obtained from North West Province, South Africa. We contribute to the growing body of literature on smallholder farming by evaluating the constraints to sunflower production, as perceived by smallholder producers on a variety of attributes. Using econometric techniques, we identify the socio-demographic factors of sunflower-producing households that explain their perceptions. Farmers were asked to rank their perceptions on a range of attributes using Likert scales from 0 to 5. On this scale, a score of 5 indicates that they strongly agree with the constraint attributes, while a score of 1 implies strongly disagrees with the set of statements.
Considering the large numbers of constraint statements gleaned, we invoked a dimension reducing technique to condense the set of attributes into smaller dataset alias principal components while preserving most of the significant information. To further comprehend what explicates variance in constraints to sunflower production, we construct and evaluate a Hierarchical Logistic Model in which variation in the score is dependent on both constraint statements and farmer demographic variables. Overall, the core objective of the study was to investigate the socio-economic attributes of smallholder sunflower farmers that explain their perceived interest in a set of sunflower production constraints in Ngaka Modiri Molema District, North West Province, South Africa.
The rest of the article is structured as follows. Section 2 presents the materials and methods, Section 3 presents summary statistics, while Sections 4 and 5 present the result, conclusion, and highlights the policy implications.

Study Area
The research was carried out in one of the five districts (Ngaka Modiri Molema) of North West Province, South Africa. According to estimates, North West Province is the second largest sunflower producer in the country, accounting for 32 percent of total domestic production [14]. The importance of smallholder farming and the high share of sunflower production in the province were the two factors that influenced the selection of the Ngaka Modiri Molema District Municipality (NMMDM). The district is made up of five Local Municipalities (LMs): Mahikeng (District capital), Ditsobotla, Ramotshere Moiloa, Ratlou, and Tswaing, with a total area of 28,206 km 2 . NMMDM has a population of 889,108 people, sharing a border with the Republic of Botswana to the north, Limpopo Province to the north-east, and Northern Cape Province to the south-west. Agriculture contributed 4.9 percent of the district's GDP in 2017, and the industry experienced its maximum annual growth rate of 25.5 percent in 2017. Maize, wheat, sunflower, vegetables, and fruits are the most popular crops grown in the district [15]. Sunflower production is a significant economic contributor in the district, but it is hampered by a number of constraints faced by household producers.

Data Collection
Data from respondents were gathered with the aid of semi-structured questionnaires. The questionnaire collects information on respondents' socioeconomic characteristics as well as their awareness and perceptions of the production constraints being faced. Two trained enumerators who are fluent in the native dialect (Setswana) aided data collection in periods from March to October 2020. This method minimized the difficulties associated with survey fatigue while also increasing the data's reliability. Furthermore, questionnaire was pre-tested to ensure that no questions and items were vague or culturally sensitive. The constraint attributes were measured on a 5-point Likert item, with scores of 5, 4, 3, 2, and 1 for strongly agree, agree, neutral, disagree, and strongly disagree, respectively.
Scaling, as [16] stated, is a method of assigning a numerical value to items based on some principle, thus by specifying scale items, every phenomenon may as well be "measured." Likert scales, Guttman scales, and Thurstone scales are only a few of the notable scaling techniques that have emerged as a result of the advancement of approaches to quantify perception. Likert items are also regarded to as aggregated (or summated) rating scales because they are part of a broader set of measurements, on the conjecture that some fundamental phenomenon can be quantified by pooling a person's rating of his or her opinions, perceptions, or behaviors linked to a set of individual statements [16]. Thus, it was deem fit for our study. Respondents were selected using a stratified random sampling technique, and only households engaged in sunflower production were involved in the survey. The sampling frame for sunflower producers was obtained from extension officers and local community chiefs. The survey featured 172 households from a list of 212 sunflower farmers, comprising 103 farmers from Mahikeng, 51 farmers from Ditsobotla, 10, 8, and 5 farmers from Ratlou, Tswaing, and Ramotshere Moiloa, respectively. The sample size was calculated using the formula developed by Krejcie and Morgan (1970) to obtain a district representative sample of 172 farmers.

Principal Component Analysis (PCA)
We highlighted 17 attributes, and farmers were asked to rank their responses on a 5-point Likert-type item. Analytically, it is cumbersome to effectively analyze all 17 attributes since some are interrelated, and we are interested in the most dominating and correlated ones. One strategy to get around this impediment is to randomly group constraints into a smaller number of cases. As a result, we hinge on and use data reduction techniques like principal component analysis (PCA) to condense the constraint statements into a few dominant numbers without compromising the broader image presented by these constraints. PCA is a well-known multivariate method for converting multiple correlated variables into linearly uncorrelated principal components [17]. The prime objective of PCA is to minimize the dimensionality of dataset with a large set of interrelated variables while preserving as much variance as possible [18]. This technique allows structure to be extracted from large dataset by extracting eigenvectors from the input distribution that are associated with the largest eigenvalues λ.
As [19] emphasize PCA attempts to reduce the dimensionality of a dataset with a large number of interrelated variables while also tackling the issues of multicollinearity. The reduction is accomplished by converting the original variables into a new collection of uncorrelated variables called principal components (PC), which are structured such that the first few preserve the majority of the variance contained in all of the original dataset [20]. The computational approach for PCA is referred to as eigen analysis, in which the sums of squares and cross products are used to find the eigenvalues λ and eigenvectors of a square symmetric matrix [21]. The number of relevant PC, based on the Kaiser index, is equivalent to the number of eigenvalues λ of the correlation matrix with values greater than one. The eigenvalue is the aggregate of a factor's squared loadings, and it shows the magnitude of variance that a factor accounts for.
The first principal component is usually matched with the direction of the eigenvector with the highest eigenvalues. The eigenvector corresponding to the second-highest eigenvalue follows a similar pattern, and so on. It's worth noting that the first component derived in a principal component analysis account for the highest amount of overall variance in the observed variables, implying that the first component would be associated with at least some or all the observed variables under normal circumstances. The extracted second component will have two key features: first, it will account for the highest number of variances in datasets that the first component did not account for, implying that it will be linked to certain variables that had low correlations with the first component. The percent-Sustainability 2021, 13, 13331 5 of 18 age of variation decreases with each new component such that the first few components are retained. Following [17], principal component analysis can be generically expressed as follows: Assuming that the new orthonormal basis is U, that each column of U is a onedimensional unit vector, and we are interested in retaining the system's K coordinates. Does not lose generalizability supposing it's the first K component. The new base U = [U K,ÛK ] is an orthonormal basis. Where, U K is a sub-matrix generated by the first K columns of U, the data matrix is written; where Z = U T K X, and Y =Û T K X PCA aims to find the orthonormal matrix U that retains most of the information,Û T K omitted and replaced with a matrix, irrespective of the data point. Y is approximated by a matrix with the same for all columns. Let's call this column b and is bias, then we can approximate Y ≈ b1 T with 1 T ∈ R 1×N is a row which contains all elements that are equal to one (1). Let assume we want to find U, meaning we have to find b to satisfy Solving the derivative equation using the objective function's b, which equals 0: where 1 T 1 = N and x = 1NX1 is the mean vector o f all column in X Note: the original data approximated by PCA is a method of finding an orthonormal matrix U with the best approximation assuming that the means vector x = 0. Subsequently, X X ≈ X = U K Z. PCA's optimal problem would become PCA uses the following algorithm: a. Determine the mean vector for all data Subtract the means vector from each of the data points: To generate the U K matrix with columns forming an orthogonal system, select K eigenvectors matching the K highest eigenvalues. The main components, or K vectors, form a subspace that is identical to the orthonormal data matrix. f.
Make an orthonormal data matrix projection to the subspace that is found. g.
The dimensions of the data points on the new space make up the new data.
The new data can be used to estimate the actual data in the following way: The first principal components of this conversion process account for most of the dataset's information, and the application was motivated by the desire to minimize the number of variables in the Likert items to a smaller set of different composite factors. The PCA scores obtained are continuous and contain a continuum of ratings. In context, respondents were asked to rank constraint attributes on discrete scales. As a result, it may be highly unlikely that two ratings vary by small delta (δ) effectively convey varying perceptions. For this reason and estimation brevity, we dichotomize the continuous score gleaned from the PCA into variables in the components that are of more and less interest to the farmers. The variables constructed are analyzed using Hierarchical Model to account for both district and socio-economic characteristics in the explanation of respondent's perceived interest. The use of the Hierarchical Logistic Model as an estimation mechanism was prompted by the structure of data with farmers nested within their district municipalities and the likelihood of heterogeneity at the household level.

Hierarchical Logistic Model (HLM)
The Multilevel Approach used to examine the PCA-derived constraints indices is described in this section. Following [22], we choose the multilevel logistics system since the response variable is binary. The response variable includes the aggregate score Yij linked to innovation, finance, and crop management attributes by farmer i of district j. The variable Yij = log(pij/1 − pij) where pij denotes the likelihood that the farmer i ranks positively her district j in line with the composite indicator obtained from the PCA. The perceived interest in innovation, farm finance, and crop management practice depends on the characteristics X of the farmer but also on the characteristics W in the district municipality a farmer belongs. Estimating a logistic or classical linear model given the data structure of the association between Y and X and W poses an econometrics barrier.
In this case, some of the underlying assumptions of OLS are violated. Firstly, the household's assessment of the interest Y or equivalently the error terms in the regression model are likely to be inter-dependent and correlated. Secondly, the equivalence of the variances of errors for all observations in a classical regression model is also invalidated because farmers from different districts municipalities have differing perceptions of the interest of variables and dissimilar assessments. Moreover, there is an increase in model misspecification problems and Type I error [23,24]. To circumvent these estimate barriers and account for heterogeneity at the household level, we leverage the data's hierarchical structure to construct a Hierarchical Model, precisely, the HLM. We begin by estimating a blank model to see if farmers from different district municipalities assess their district differently on average. The model can be express as follows: Full model : The intercept Y 00 in this model is a fixed effect that summarizes the mean rating score by farmers, while V 0j is the random effect from the district level. The variation percent of the outcome variable attributable to the district level attributes is p = σ 2 (V0j) σ 2 (Uij)+σ 2 (V0j) and the percentage of the variation in the score attributable to farmers characteristics is 1 − p. if p is not too low, then it is necessary to add first-level variable in the model, and If 1 − p is not very low it is necessary to incorporate first-level variables. Adding district-level variable is valuable to compensate for the variation. As a result, we estimate the following Random Intercept Model using level 2 predictors: W denotes the district level variable. Y 00 and Y 01 are the fixed effects, and ν 0j is the random effect. Then, we add farmers' attributes X to account for their effect in the variation of evaluation. The matching model is a Random Intercept Model with both levels 1 and 2 regressors. It can be express as follow: Household level 1 model : Farmers' level variables, which include socioeconomic data, are denoted by the X. The fixed effects are Y 00 , Y 01 , and Y 10 , whereas the random effects are ν 0j and ν 1j . Full Maximum Likelihood can be used to evaluate the models (FML). The random effects are summarized via their covariance and variances, whereas the fixed effects are directly estimated. It is possible to compute a log-likelihood test of the HLM vs. Linear Regression or Logistic Regression. Akaike's Information Criteria (AIC) as well as Bayesian Information Criteria (BIC), can be used to evaluate the overall model performance.

Summary Statistics
Descriptive Statistics Table 1 shows descriptive statistics of the respondent's socio-economic and farm characteristics. We found that about 79.1% of the respondents were male with an average age of 52.55 years, suggesting the existence of gender and age disparities in sunflower production in the study area. This report is similar to [12] findings that women and young people are underrepresented in the sunflower industry in North West Province, South Africa. In many Sub-Saharan African countries, gender and age inequalities cannot be separated from social and cultural imbalances, exclusion from credit markets, unequal resource distribution, and privileges, which are commonly reported by literature as obstacles in integrating into the agricultural sector. Table 1 further presents other household characteristics such as marital status, education, household size, transport and livestock ownership, market distance, access to credit, grant, extension services, market outlet, land size, hectare dedicated for sunflower production, information access, off-farm income, and cooperative membership. Almost 90% of the survey participants have some level of education, and most of them are full-time farmers. Education attainment is a crucial indicator of human capital, as it stimulates the development of skills that would equip a household to earn off-farm income through formal or informal employment, while still having time to engage in farming operations [25,26]. Notwithstanding, when agriculture is not the primary source of income for educated farmers, the variable may yield divergent results. While off-farm earnings is a measure of a household's reliance on off-farm jobs in its community and surrounding areas, which may impact household labor supply and income earnings [27]. About 77.9% of the households in the polled sample own transportation facilities. Accordingly, transport infrastructure is a measure of a household wealth index and is indispensable for attenuating production and market shocks [25,27,28]. Large landholdings could serve as equity during times of financial distress, while livestock owned, on the other hand, can be marketed to meet immediate working capital requirements of households.
At a glimpse, the location characteristics, as described by the distance to market outlets, do not indicate remoteness. The majority of farms (53.5%) in the study area are less than 30 km away from the commonly used market outlet (North West Kooperative [NWK]).
Approximately, 28% are members of agricultural cooperatives. Farmer's cooperatives are social platforms that promote collaborative engagements, thus play an important role in farming operations. About 58% of the sample households receive grants as input from the government. While grants may facilitate agricultural commercialization by easing farmers' liquidity and credit pressures, a longer duration of earning social grants may lead to detrimental implications, such as dependency attitude and expectations, which weakens self-reliance and deter farm operations [28,29]. Table 2 presents the frequency distribution of how respondents rated the Likert constraints statement and Figure 1 for graphical representation of responses to constraints statements. One glaring point to take note of is that most respondents rate their responses as "Agree" to most of the constraints statements.

Aggregation of Factors and Determination of the Dimension of Farmers Perceived Interest in Constraints to Sunflower Production
As stated earlier, Principal Component Analysis (PCA) was used to assess the constraints facing sunflower-producing households. It is essential to note that the score vectors provide the composition of the principal components for the samples (objects), while the loading vectors offer similar compositions for the variables when using PCA. Before PCA, the reliability of the Likert response option (statements) concerning constraint variables was tested using coefficient alpha (∝). The coefficient alpha for the 17 scale attributes employed to capture respondent's response on constraints is 0.87, which surpassed the threshold level of 0.7, indicating a high degree of reliability. Furthermore, before performing the PCA, we also use the Kaiser-Meyer-Olkin (KMO) sampling adequacy criterion and the Bartlett test of sphericity to examine whether our dataset of 17 variables and 172 surveyed respondents contains sufficient correlation for a justifiable PCA. The cumulative KMO index for our dataset is 0.84, with KMO values higher than the threshold of 0.70 for all individual variables except for unequal access to grants and farmland fencing, while Bartlett's sphericity test was highly significant as well (p-value 0.001). This indicates that our dataset meets the criteria for a justifiable PCA. Table 3 presents Kaiser-Meyer-Olkin measure of sampling adequacy The eigenvalues, that represent the actual influence of each factor on the total inertia of the data, are a major output of the PCA. Table 4 shows the eigenvalues and their inputs to the PCA-derived explained variation, while Figure 2 displays the associated scree plots. In effect, the eigenvalues scree plot is inspected to see if there is a "fracture" in the chart, with the residual components representing fewer variations. Following Kaiser's maxim, which suggests that only retain components with eigenvalues λ which exceed unity in the remainder of the estimation. Subsequently, in our estimation, four components with eigenvalues greater than unity were eventually retained. These four components in the PCA explained 55% of the variation in the original dataset, while the rest components explained relatively less variation.   Table 5 displays the factor loadings, which show the relationship between the components and the actual variables, and the factor loadings are also shown graphically in Figures 3 and 4 respectively. Perhaps more specifically, it should be noted that Figures 3  and 4 are two-dimensional graphs that represent only components 1 and 2, as much insight cannot be gained from a three-dimensional graph in our context. For the sake of simplicity, factor loadings greater than 0.3 are outlined and deemed reasonably correlated to the relevant components. Principal components are represented by variables with a high correlation (dominant factor loading). For each observation in the data, the scoring coefficients are exploited to predict the values of the four principal components. However, we only offer interpretation for three components, since some of the dominant variables in component four have already emerged in components two and three.  Considering the number of variables in our datasets, these proportions of the variance explained are widely regarded as permissible. While there is no straightforward criterion for an acceptable explained level of variance, it is plausible and applicable to take into account a solution that explains at least 50% of the overall variance in social sciences where data are not exact and therefore sensitive to random errors [28,30]. We also implemented factor rotation such as orthogonal varimax rotation to the factor loadings index to allow factor loading on the retained components whilst maintaining their autonomy. The objective is to aid interpretation and to optimize the variance of the squared loadings across variables when all factors are added together [31]. Table 5 displays the factor loadings, which show the relationship between the components and the actual variables, and the factor loadings are also shown graphically in Figures 3 and 4 respectively. Perhaps more specifically, it should be noted that Figures 3 and 4 are two-dimensional graphs that represent only components 1 and 2, as much insight cannot be gained from a three-dimensional graph in our context. For the sake of simplicity, factor loadings greater than 0.3 are outlined and deemed reasonably correlated to the relevant components. Principal components are represented by variables with a high correlation (dominant factor loading). For each observation in the data, the scoring coefficients are exploited to predict the values of the four principal components. However, we only offer interpretation for three components, since some of the dominant variables in component four have already emerged in components two and three.     Lack of production facilities (Q9) is the only variable in principal component one (PC1) which explains most of the variance in the constraint statements. This variable has a scoring coefficient of 0.31 exceeding the threshold which represents smallholder sunflower farmers who exhibit high levels of interest on quality production inputs. The variable is better categorized as a component capturing factor relating to innovation. Poor access to agricultural innovation (high-quality inputs) required for production and commercialization, such as farm machinery, automobiles, chemicals, seeds, and fertilizers are major constraints facing households, not only in the sunflower industry but across the smallholding sector. The second principal component (PC2) loads highly on variables Q13, Q14, Q16, and Q17 with scoring coefficients of 0.33, 0.45, 0.34, and 0.46, respectively, as shown in Figure 3. The constraints statement in component 2 represents natural disasters such as drought, high crop theft, unequal access to grants, and lack of fencing. This result signifies their dominance in the constraint statements. The second component is hard to label, but based on the variables with the highest correlation with this component; we can classify it as a factor capturing largely variables related to farm finance. Component 3 Lack of production facilities (Q9) is the only variable in principal component one (PC 1 ) which explains most of the variance in the constraint statements. This variable has a scoring coefficient of 0.31 exceeding the threshold which represents smallholder sunflower farmers who exhibit high levels of interest on quality production inputs. The variable is better categorized as a component capturing factor relating to innovation. Poor access to agricultural innovation (high-quality inputs) required for production and commercialization, such as farm machinery, automobiles, chemicals, seeds, and fertilizers are major constraints facing households, not only in the sunflower industry but across the smallholding sector. The second principal component (PC 2 ) loads highly on variables Q13, Q14, Q16, and Q17 with scoring coefficients of 0.33, 0.45, 0.34, and 0.46, respectively, as shown in Figure 3. The constraints statement in component 2 represents natural disasters such as drought, high crop theft, unequal access to grants, and lack of fencing. This result signifies their dominance in the constraint statements. The second component is hard to label, but based on the variables with the highest correlation with this component; we can classify it as a factor capturing largely variables related to farm finance. Component 3 loads highly on constraint variables such as poor road infrastructure (Q1), post-harvest loss (Q12), and Sclerotinia sclerotiorum Diseases (Q15), exercising dominance over the remaining original variables. This third component is defined as indicators that capture most of the respondents' crop management practices. Knowledge and application of optimum crop management strategies are critical in decreasing the effect of post-harvest loss and the prevalence of Sclerotinia sclerotiorum Diseases of sunflower crop.
The kernel density estimates of the scores are shown in Figure 5A-C, and the distribution of the new dimensions derived from PCA appears to be anchored on 0 with significant variations. As previously stated, the new scores are continuous, contrasting with the discrete form of the original datasets. A threshold point of 0 is a worthy contender to categorize the continuous score into variables that are of less interest to farmers i.e., low loadings in components (positive or negative) close to 0 and variables of higher interest to farmers i.e., with higher loadings (positive or negative) greater than 0 in components. We can estimate the binary values to perform a disaggregated analysis of the socioeconomic determinants of farmer's perceived interest using a Hierarchical Logistic Model. We categorize sets of attributes relating to innovation, finance, and management practices from several features based on farmers' perceived interest in sunflower production constraints. egorize the continuous score into variables that are of less interest to farmers i.e., low loadings in components (positive or negative) close to 0 and variables of higher interest to farmers i.e., with higher loadings (positive or negative) greater than 0 in components. We can estimate the binary values to perform a disaggregated analysis of the socioeconomic determinants of farmer's perceived interest using a Hierarchical Logistic Model. We categorize sets of attributes relating to innovation, finance, and management practices from several features based on farmers' perceived interest in sunflower production constraints.

Determinants of Smallholder Sunflower Farmers Perceived Interest in Innovation, Finance, and Crop Management Practice
We build and estimate a Hierarchical Logistic Model in which variance in the score is dependent on both district municipalities and household characteristics to examine the factors that underlie variation in farmers' perceived interest in the outcome variables. Proceeding with a blank model that is analogous to Equations (1)-(3), highlighted in Section 2.3.2. The result in Table 6, column a, d, and g indicate no significant difference from zero variation from district level variable regarding households perceptions to innovation, finance, and crop management practice dimensions. Cells (b), (e), and (h) feature district-

Determinants of Smallholder Sunflower Farmers Perceived Interest in Innovation, Finance, and Crop Management Practice
We build and estimate a Hierarchical Logistic Model in which variance in the score is dependent on both district municipalities and household characteristics to examine the factors that underlie variation in farmers' perceived interest in the outcome variables.
Proceeding with a blank model that is analogous to Equations (1)-(3), highlighted in Section 2.3.2. The result in Table 6, column a, d, and g indicate no significant difference from zero variation from district level variable regarding households perceptions to innovation, finance, and crop management practice dimensions. Cells (b), (e), and (h) feature districtlevel variable in the model, whereas columns (c), (f), and (i) assess the entire model, including district and household levels regressors. It can be notice that when more controls variables are included, the variance of the residual decreases. The AIC and BIC indicate that the full model is more suitable for the data. As a result, we will focus the interpretation on these models. The outcome variables are dummy variables with 1 corresponding to farmers having higher interest (positive score). Standard errors in parentheses, * p < 0.1, ** p < 0.05, *** p < 0.01.
The result of the determinants of innovation, finance, and crop management practice of households indicates that there are a number of variables that have significant explanatory power of farmer's perceived interest. We find that the district level variable has no statistically significant relationship in terms of innovation and farm finance characteristics. On the one hand, the age of household head, household size, and farm size are negatively associated with farmers perceived interest regarding innovation characteristics. The result indicates that since most of the respondents are middle-aged and some clustered within communal land tenure systems, acquiring innovative practices and obtaining production inputs from adjacent farm families will not be a barrier, triggering the negative relationship between farmers' perceived interest in innovation and age variable. This also implies age disparities in the district sunflower production, with farmers' interest declining as they get older in terms of their innovation constraints attributes. The result concurs with the findings of [32] studying the adoption of improved maize variety among farming households which revealed an inverse relationship between age and technological innovation. Large farming households pose the capacity to relax labor constraints involved during production, which might have resulted in the negative association with farmer's perceived interest regarding innovation. Generally, farmers with large farm sizes tend to have better access to technological progress, therefore, justify the lower rating and the negative relationship. This is consistent with [33] study, which reported a negative relationship between farmers with large landholdings and innovation. On the other hand, we find that the age of household heads is the only variable that is statistically significant and positively associated with farmers' perceived interest in the dimension of farm finance characteristics.
In terms of cooperative membership, the rating is higher and positively associated with farmers perceived interest relating to innovation characteristics but with no significance for the dimension of crop management practice. The role of cooperative membership and dispersion of innovations in agriculture has been studied by several researchers, and it is frequently cited by experts when addressing access, speed, and uptake of innovation. It has been argued, inter alia, that the presence of information and knowledge about innovations among cooperative members increase credence about innovative practices, leading to efficient access, application, and implementation. This validates the positive relationship for farmers belonging to agricultural cooperatives.
We also find that older male sunflower farmers with education, belonging to cooperatives tend to have higher interests (positively associated) in farm finance characteristics. This is not surprising, given that male farmers dominate the sunflower production in the district municipality and are more likely to be keen on attributes regarding farm finance. Since education is a predictor of literacy, it is tenable that the positive association is due to the farmers' awareness and grasp of financing sources. Surprisingly, none of the household level variables, besides the farming system were found to be statistically significant in determining smallholder sunflower farmers' perceived interest in crop management practices. However, the municipality level variable appears to be statistically significant and negatively associated with crop management characteristics.

Conclusions and Policy Recommendations
Sunflower production is an excellent choice for both local and national agribusiness with significant potential in smallholder farming systems due to low input costs, consistent yields, and a short planting window. However, farmers are faced with interrelated constraints to sustainable and profitable sunflower production. The study examined the socio-economic attributes of smallholder farmers that explain their perceived interest in a set of sunflower production constraints using data from Ngaka Modiri Molema District Municipality, North West Province, South Africa. The research used extant works of literature to design sets of constraints attributes relevant to sunflower production and respondents were asked to rank their responses on these statements.
The study uses principal component analysis (PCA) to agglomerate and condenses information from a large dataset of constraint statements into three main components. The three retain components explained most of the variance in the original dataset. Leveraging the hierarchical structure of the data with farmers nested within their respective local municipalities, we invoke the hierarchical logistic model (HLM) technique to identify the factors that explicate farmer's perceived interest in innovation, finance, and crop management practices. The findings indicate that farmer's perceived interest in innovation, finance, and crop management practices are strongly correlated with some of the socioeconomic attributes such as age, education, household size, farm size, farming system, cooperative membership, and gender.
Innovation and farm finance characteristics emerge to be critical elements for sustainable sunflower production. The findings of this study recommend that to promote these elements among smallholder farmers, it is necessary for governments at the district, provincial, and national levels to invest in extension services and education, cooperative organizations, research, and development in disentangling the age, gender, and farm size inequalities existing in the district sunflower production. This will help to boost youth and women participation in sunflower farming operations and stimulate equitable access to land, thus increase farmers' resilience. Going forward, the result of this study cannot be broadly generalizable as the sample was obtained from a single district due to financial and budgetary limitations, further research incorporating the other district municipalities is needed to comprehend farmers' perceptions of sunflower production constraints within the province.