1. Introduction
Ensuring stable grain production and improving agricultural efficiency remain central challenges for food security in many developing countries characterized by smallholder-dominated farming systems. Arable land constitutes the fundamental material basis of food production; however, long-standing land fragmentation, manifested in small, scattered, and irregular plots, has substantially constrained agricultural modernization and productivity growth [
1]. In China, the coexistence of a large population and limited arable land, combined with egalitarian land allocation practices adopted during the early stages of rural reform, has resulted in a highly fragmented land structure that continues to impede efficient grain production. Reducing land fragmentation and promoting moderate-scale farming have therefore become key policy objectives. The chinese government emphasizes not only the expansion of total operational farm size through land transfer, but also the consolidation of fragmented plots at the micro level. This shift reflects growing recognition that plot size, rather than farm size alone, constitutes the basic operational unit for agricultural production. Plot size directly affects the feasibility and efficiency of mechanized operations, the adoption of modern agricultural technologies, and the control of production costs [
2]. Consequently, understanding how average plot size influences grain production efficiency is critical for improving land-use efficiency and strengthening food security.
The inverse relationship between farm size and grain production efficiency is a classic starting point in this field. Since Sen’s (1966) pioneering study [
3], the finding that farm size and land productivity are negatively correlated in developing countries has been widely discussed [
4]. The classic explanation for this relationship primarily focuses on market failures: small farms tend to rely more on family labor, and the costs of supervising this labor are lower, thereby giving them a comparative advantage in labor-intensive production [
5]. However, subsequent research has introduced significant revisions to this conclusion. Helfand and Taylor (2021) [
6] point out that the choice of productivity indicators is crucial; if total factor productivity (TFP) is used instead of land productivity alone, the relationship between farm size and productivity evolves with the process of modernization, exhibiting a “U”-shaped or even positive relationship. Ferreira and Féres (2020) [
7] similarly found a U-shaped relationship between farm size and land use efficiency in the Brazilian Amazon region. These studies indicate that the relationship between farm size and production efficiency is not static but depends on the stage of development, technological conditions, and the choice of productivity indicators.
However, farm size alone is insufficient to fully capture the complex reality of farmers’ land allocation. Unlike farm size as an aggregate indicator, land fragmentation characterizes land allocation from multiple dimensions, including the number of plots, average plot size, spatial distribution of plots, and the distance between plots and residences, thereby providing a more precise understanding of micro-level land configuration. In recent years, this multidimensional perspective has attracted increasing attention in the international academic literature [
8,
9]. Regarding the impact of land fragmentation on agricultural production efficiency, two contrasting conclusions have emerged in the literature. A large body of research confirms that land fragmentation exerts a negative effect on agricultural production efficiency through multiple channels, including constraining the effective use of agricultural machinery, increasing fuel consumption and labor input, hindering the adoption of modern agricultural technologies, and raising overall production costs [
10,
11]. For example, Rahman and Rahman (2009) [
12] found that for every 1% increase in the degree of land fragmentation, rice yield decreased by 0.05% and technical efficiency declined by 0.03%. At the same time, some studies have also revealed potential positive effects of land fragmentation. On the one hand, fragmentation can help optimize crop planting patterns, allowing farmers to optimize crop matching across different plots, thereby increasing overall yields [
13]. On the other hand, land fragmentation can help disperse production risks and reduce overall losses caused by localized pest infestations, hailstorms, or floods [
14]. For example, A study by Eder (2025) [
15] of crop farms in Austria showed that although land fragmentation has no significant negative impact on technical efficiency, it leads to higher risk-adjusted returns. A study by Ayalew et al. (2019) [
16] in Rwanda similarly found that land fragmentation may have positive economic functions under specific circumstances.
In summary, the debate over the relationship between farm size and production efficiency, coupled with the coexistence of both positive and negative effects of land fragmentation, collectively points to a core issue: how can the causal logic between land resource allocation and production efficiency be revealed at a micro level? The arable land plot size is the key link between these two research trajectories. It serves as the fundamental unit of farm management size and is the core indicator of land fragmentation. Therefore, conducting an in-depth examination of the impact of plot size on grain production efficiency and thoroughly analyzing its underlying mechanisms not only deepens the debate over the “inverse relationship” but also refines research on the effects of “fragmentation”. Based on this, relevant scholars have analyzed the mechanisms through which plot size affects grain production efficiency from the perspectives of agricultural machinery use, adoption of agricultural technologies and social services, agricultural production inputs, and grain output. First, at the level of agricultural machinery utilization, farm machinery is characterized by indivisibility and technological rigidity, which impose objective thresholds and compatibility requirements with respect to cultivated plot size [
9]. Excessively small plots not only constrain the normal operating speed of machinery, increase turning and idling time, and reduce overall operational efficiency, but also intensify boundary effects, leading to higher crop damage rates and a greater proportion of land occupied by ridges and bunds. These factors ultimately result in increased fuel consumption and time costs for mechanized operations, thereby limiting both the diffusion and effectiveness of agricultural mechanization [
17]. As agricultural machinery constitutes a core carrier of agricultural modernization and a pivotal driver of improvements in grain production efficiency [
18], the average size of cultivated plots directly affects the operability, utilization efficiency, and economic returns of machinery, constraining the level of mechanization and, in turn, influencing grain production outcomes. Second, excessively small plot sizes are not conducive to the adoption of agricultural technologies or the development of agricultural socialized services. With respect to technology and service adoption, the application of modern agricultural technologies exhibits a significant dependence on plot size [
19]. When plots are too small, the fixed costs associated with technology adoption cannot be effectively amortized through economies of scale, reducing the rate of return per unit area and thereby discouraging farmers from adopting and applying advanced technologies [
20]. Moreover, fragmented and diminutive plots are incompatible with the standardized and large-scale production models required by modern agriculture, impeding the deepening of agricultural socialized services. Specifically, small plot sizes increase the difficulty of unified operations and machinery scheduling, elevate coordination and transaction costs, and induce service providers to raise operational fees to compensate for efficiency losses. Such “price discrimination” against small plots, in turn, weakens farmers’ willingness to purchase specialized services [
21]. Third, in terms of production costs, expanding plot size contributes to reductions in per-unit input of material resources and labor costs. Compared with larger plots, smaller plots entail repeated boundary setting, more frequent machinery transfers, and lower field management efficiency, leading to redundant input of production materials and wasted labor time, thereby raising per-unit production costs [
22]. Empirical evidence indicates that for every 1% increase in plot size, the utilization efficiency of chemical fertilizers and other agricultural inputs increases by approximately 0.2% [
23], while labor input per unit area decreases by about 0.73% [
24]. Fourth, with regard to grain output, excessively small plot sizes may undermine production performance. Smaller plots constrain the rational layout and efficient construction of public infrastructure, such as field roads and irrigation channels, and reduce the utilization efficiency of existing agricultural facilities, thereby imposing rigid constraints on grain yields per unit area [
25]. In addition, small plots raise production costs and depress operational returns, making it difficult to achieve contiguous and consolidated farming through land transfer arrangements. This may induce farmland abandonment by farmers [
26], resulting in declines in both sown area and total grain output. Research further shows that expanding plot size from less than 1.5 mu to 105–900 mu can increase grain output by 2% to 13% [
24].
In summary, the existing literature provides valuable multi-perspective insights into the relationship between plot size and grain production efficiency. However, it still has the following shortcomings: on the one hand, most studies concentrate on the impact of plot size on a single intermediate stage, such as mechanized operations and technology adoption, which is a lack of systematic examination within a unified analytical framework to determine whether these parallel mechanisms exist and their relative importance. In other words, the empirical identification of the underlying transmission mechanisms remains incomplete; on the other hand, the majority of existing studies focus on the effect of arable land plot size on average production efficiency, while overlooking potential structural heterogeneity at both the household and regional levels.
Against this background, drawing on micro-level data from 1362 farm households across provinces along the Yellow River Basin, this study investigates the efficiency-enhancing effect of cultivated plot size on grain production, its underlying mechanisms, and its heterogeneous impacts. The potential marginal contributions of this study are twofold. First, it develops a systematic analytical framework and simultaneously test multiple core mechanism paths, namely agricultural machinery use, adoption of agricultural technologies and socialized services, production cost control, and grain yield improvement, thereby comprehensively revealing the internal “black box” of how field size influences grain production efficiency. Second, it overcomes the limitations of traditional mean-reversion approaches by analyzing the heterogeneity of plot size effects across four dimensions: internal structure (differing efficiency levels), farm scale, crop type, and geographic region. This provides a more scientific basis for formulating differentiated, targeted policies on consolidating small plots into larger fields and promoting scale operations.
3. Data, Variables, and Methods
3.1. Data Sources
The data used in this paper is sourced from the special survey on “Ecological Conservation and High-Quality Development of Agriculture and Rural Areas in the Yellow River Basin,” conducted by the College of Economics and Management of Northwest A&F University in China from July to August 2023. Employing a multistage stratified random sampling method, the survey systematically collected microdata on grain production, land parcel characteristics, and factor inputs from 1362 farming households across 165 administrative villages in five provinces/autonomous regions: Gansu, Ningxia, Inner Mongolia, Henan, and Shanxi. This provides a robust data foundation for measuring key variables and identifying causal effects in this study. The selection of this dataset was based on the following considerations: First, the middle and upper reaches of the Yellow River Basin constitute a vital grain-producing region and ecological barrier for China. The efficient utilization of its arable land resources holds strategic significance for ensuring regional food security and promoting sustainable agricultural development. Second, this region exhibits strong representativeness in terms of topography, climate conditions, and agricultural development levels, coupled with distinct gradient variations. This makes it an excellent observational sample for studying the impact of average plot size on grain production efficiency.
3.2. Variables Selection and Description
3.2.1. Dependent Variables
To analyze the impact and mechanism of cultivated arable land plot size on farmers’ grain production efficiency and test the research hypothesis, this study categorizes dependent variables into two types: outcome dependent variables and intermediate dependent variables. The outcome dependent variable is grain production efficiency, focusing on the technical efficiency of producing the two primary grain crops, wheat and corn, in the sample region. Drawing on existing research [
40], the technical efficiency of grain production is measured through an indicator system constructed from input and output dimensions. Efficiency calculations employ an input-oriented non-radial SBM model under conditions of variable returns to scale.
denotes the grain production efficiency value; n represents the number of decision-making units; m and s denote the number of input and output variables, respectively; and denote the slack variables for inputs and outputs, respectively; x and y denote the input and output variables, respectively; X and Y denote the input and output matrices, respectively; λ denotes the weight vector.
The reason for using an input-oriented non-radial SBM (Slack-Based Measurement) model to measure efficiency is twofold. On the one hand, changes in the plot size primarily affect farmers’ input behaviors, such as the use of agricultural machinery, labor, and materials, which in turn impact grain production efficiency. An input-oriented approach can accurately identify the reduction in input redundancy achieved by expanding plot size for a given output. On the other hand, different input factors respond differently to changes in plot size, and the non-radial model allows for the reduction in each input factor by varying proportions, thereby avoiding the limitation of proportional adjustments found in radial models. This approach more accurately captures the structural economies of scale and factor substitution relationships resulting from plot consolidation.
Intermediate dependent variables include material inputs, agricultural machinery operations, labor inputs, adoption of agricultural technologies, and grain yield per mu. Given that agricultural socialized services often rely on farm machinery operations for implementation, this study uses the number of agricultural socialized services utilized by households to reflect their farm machinery usage. Changes in grain production efficiency primarily stem from both input and output aspects. Therefore, the analysis examines how the size of cultivated land parcels influences the aforementioned intermediate explained variables to explore the mechanism of action on grain production efficiency.
3.2.2. Explanatory Variables
The core explanatory variable is the average plot size of farmland managed by households, expressed as the ratio of total managed farmland area to the number of plots. Generally, larger average plot sizes indicate more contiguous farmland holdings and lower fragmentation. The selection of this variable is primarily based on the following considerations: first, the core of this study is to investigate the scale economy effects at the plot level. The average plot size directly reflects the degree of land consolidation in a single plot, which allows for more precise capture of key transmission paths such as improved mechanization efficiency and input savings. Indicators such as the number of plots and the distance between them primarily reflect the degree of fragmentation in farm operations. Second, the Chinese government’s policies, such as promoting the consolidation of small plots into larger ones (“small fields into big fields”), aim to expand the average size of individual plots. The average plot size has stronger practical relevance and guiding significance in the context of ongoing national agricultural strategies. Third, the number of plots is highly correlated with the total scale of farm operations, and including it directly could lead to multicollinearity issues. In contrast, average plot size allows for better identification of the net effects of plot consolidation while controlling for total scale.
3.2.3. Control Variables
Considering the practical realities and influencing factors of grain production, this study controls for the head of household’s personal characteristics, household management status, farmland quality, and regional location.
3.2.4. Instrumental Variables
Identifying the causal effects of plot size on each dependent variable requires selecting appropriate instrumental variables. Following the methodology of Zhang et al. [
40], this study employs “the average plot size of other farmers in the same village” as the instrumental variable for household average plot size. Theoretically, this instrument satisfies two fundamental conditions: first, correlation, meaning it is highly correlated with the dependent variable. Land allocation methods, topography, and whether concentrated land consolidation has been implemented are two key factors determining the average size of farmland plots managed by households [
41]. Within the same village, topography, land allocation methods, and the presence of concentrated land consolidation exhibit high homogeneity. Consequently, the average size of farmland plots managed by a particular household is clearly closely related to the average size of farmland plots managed by other households in the village. Second, exogeneity, meaning no direct relationship with the dependent variable. Grain production constitutes an internal household activity [
41]. Households allocate their grain production using their own arable land, labor, and agricultural machinery services. The average plot size of other households in the same village does not directly impact their grain production efficiency.
The definitions and related explanations of variables are shown in
Table 1.
3.3. Econometric Model
To estimate the impact of farmland parcel size on household grain production efficiency, this paper establishes the following model:
In the equation, epresents the explanatory variable, denoting the average area of cultivated land plots for the i-th farmer; the dependent variable simultaneously reflects the production efficiency of wheat and corn for the i-th farmer, as well as their material inputs, agricultural machinery operations, labor inputs, adoption of agricultural technology, and grain yield per mu; denote the set of control variables encompassing the individual characteristics of the household head, the operational status of the household, and the quality of cultivated land; , and are the parameters to be estimated; represents the random disturbance term.
To test research hypotheses H1 and H2, this paper first estimates Equation (1) using ordinary least squares (OLS). Subsequently, to identify causal effects, both two-stage least squares (2SLS) and propensity score matching (PSM) methods are employed to estimate Equation (1). By employing instrumental variables, the 2SLS method effectively addresses endogeneity biases arising from omitted variables, measurement errors, or mutual causation. The core principle of PSM involves matching samples based on observable covariates to mitigate self-selection bias in causal inference. Its fundamental logic involves finding highly similar matching counterparts for each individual in the treatment group from the control group based on covariate characteristics, thereby constructing a “counterfactual” control group to estimate the average treatment effect (ATT) for the treatment group. However, since the explanatory variable “average plot size” in this study is continuous, to adapt it for PSM, two treatment groups are constructed using the sample median and mean of the average plot size managed by households as thresholds to construct two treatment variables. Households with an average plot size above the median (or mean) were assigned to the treatment group (value 1), while those below the median (or mean) were assigned to the control group (value 0). Multiple matching methods, including nearest neighbor matching, kernel matching, and radius matching, were employed to test the robustness of the average plot size’s impact on household grain production efficiency.
However, the aforementioned estimation methods primarily examine the impact of on the conditional mean of . Their results reflect the effect of average plot size on grain production efficiency for farmers in the “average situation,” failing to reveal the differentiated impact of plot size on different types of farmers. To avoid “averages masking structural differences” and examine the differentiated impact of average plot size on heterogeneous farmers, this study introduces an instrumental variable quantile regression (IVQR) model to test research hypothesis H3.
6. Discussion
This study provides empirical evidence that expanding the average plot size can help improve grain production efficiency in smallholder-dominated agricultural systems. These findings contribute to the ongoing debate regarding land fragmentation and economies of scale in agriculture.
First, in terms of average effects, there is a significant positive correlation between average arable land plot size and the grain production efficiency of farmers in the Yellow River Basin. This finding is consistent with existing research on land fragmentation and agricultural performance. Expanding land plot size can significantly improve the grain production efficiency of farmers in the Yellow River Basin. This finding is consistent with existing research conclusions on land fragmentation and agricultural performance [
44,
45]. At the same time, this study further incorporates the farm size into the analytical framework and finds that, when considering plot size, the optimal grain production efficiency occurs when farmers operate around 81.135 mu of arable land. In contrast, the study by Ni et al. (2015) [
46], based on China’s National Rural Household Survey data, indicates that the maximum grain production efficiency is achieved when the farmland area managed by a household is around 130 mu. The difference in these findings can be attributed to the strict control of the key variable of plot size in this study. This result highlights that the concentration of land plots significantly moderates the relationship between farm size and grain production efficiency. When the plot size is larger and more concentrated, the positive impact of farm size on production efficiency can be more fully realized. However, when land fragmentation is high, even if the total farm size is expanded, its effect on improving production efficiency will be constrained. Therefore, when formulating relevant policies, attention should be paid to the appropriate expansion of farm size and effective management of land fragmentation, with a focus on increasing the size of individual plots to fully capitalize on the benefits of scale farming for improving grain production efficiency.
Second, the mechanism analysis indicates that the positive relationship between plot size and grain production efficiency may be realized through multiple pathways. This study finds that expanding plot size is associated with an increase in the adoption of agricultural machinery and technology, a reduction in material and labor inputs per unit area, and an improvement in grain yield per unit area. This multi-channel transmission framework expands upon existing research. The previous literature typically examines individual mechanisms in isolation, such as mechanization or input costs, whereas this study integrates them into a unified analytical framework, revealing the complete chain of effects linking “plot size-input structure-production efficiency”. Notably, the transmission effects of agricultural machinery use and technology adoption are particularly prominent. This finding is consistent with the conclusions of Balogun et al. (2018) [
47], indicating that the optimization of land structure is a fundamental prerequisite for transforming agricultural production methods. Therefore, policy formulation should comprehensively consider the synergistic effects of increased plot size across multiple pathways, including mechanized operations, technology adoption, cost control, and yield enhancement, to systematically advance land structure optimization.
Finally, the heterogeneity analysis reveals the conditional nature of scale advantages. This study finds that the positive association between land plot size and grain production efficiency is more pronounced among farmers with higher efficiency and smaller operational scales, exhibiting both “supporting the strong” and “favoring small farmers” characteristics. This finding indicates that farmers with different initial conditions benefit from land structure adjustments to varying degrees. For high-efficiency farmers, their management capabilities complement the expanded plot size, enabling them to more effectively integrate modern production factors. For small-scale farmers, expanding plot size helps break the economies of scale constraints caused by land fragmentation, allowing them to achieve marginal benefits through land consolidation, even with limited overall scale. This finding addresses concerns regarding whether land consolidation policies favor large-scale operators. The study confirms that there is also a positive correlation between moderate land consolidation and the improvement of grain production efficiency among small farmers. Therefore, policy development should avoid a “one-size-fits-all” approach and instead implement differentiated land consolidation strategies based on farmers’ production efficiency levels and operational scale characteristics, ensuring that farmers with different conditions can all benefit from the expansion of plot size.
7. Conclusions and Policy Implications
7.1. Main Conclusions
Based on survey data from grain-growing households in the Yellow River Basin, this study employs methods such as 2SLS, PSM, and IVQR to examine the average treatment effect of farmland plot size on household grain production efficiency. It also reveals the underlying transmission mechanisms and investigates the heterogeneity of this effect across different households, regions, and crops. The study yields the following three key findings: First, there is a significant positive correlation between expanding average land plot size and grain production efficiency in the Yellow River Basin. Second, in terms of mechanisms, the expansion of land plot size is associated with reductions in material and labor input costs, as well as increases in grain yield, with these pathways potentially working together to improve farmers’ production efficiency. Third, the heterogeneity analysis reveals that in the Yellow River Basin, the positive association between land plot size and grain production efficiency is more pronounced among higher-efficiency farmers and smallholders with less than 30 mu of operational land. Additionally, compared with farmers in the central region and corn growers, the association between land plot size and production efficiency is stronger in the western regions and among wheat-growing farmers.
7.2. Policy Implication
First, to enhance grain production capacity and safeguard national food security in the Yellow River Basin, it is essential to fully leverage the key role of farmers in grain production. This requires addressing the issues of excessively small plot sizes and severe land fragmentation. Expanding the average plot size and mitigating fragmentation should be regarded as a crucial measure to promote the effective linkage between smallholder farming and modern agricultural development, thereby facilitating the moderate consolidation and contiguous management of farmland operated by smallholders. Second, based on the actual conditions of the Yellow River Basin, efforts should focus on accelerating agricultural mechanization and technology adoption to increase farmers’ operating income and enhance grain production capacity, which requires addressing the fragmented nature of smallholder contracted land. Efforts should focus on promoting contiguous land use to swiftly transform the current situation of “multiple small plots” held by smallholder farmers. During the extension of second-round land contracts, integrating efforts with high-standard farmland construction and land consolidation, promoting land swaps and consolidation among farmers to merge small plots into larger ones, achieving “one plot per household”, and expanding the average size of cultivated land parcels represent a key reform approach. Third, in response to the heterogeneous needs of different types of farmers, regions, and crops within the Yellow River Basin, a differentiated and pilot-based approach should be adopted and advanced in an orderly manner. Actively explore and introduce legal and policy documents to resolve fragmented farmland and excessively small plots. At the same time, the average plot size within a region can be incorporated as a key indicator in the evaluation and acceptance of high-standard farmland construction, thereby promoting the continuous expansion of the average plot size.
7.3. Limitations and Prospects
First, limitations in the core variable measurement. Land fragmentation is a multidimensional concept, encompassing not only plot size but also plot number, spatial distribution, plot spacing, and other dimensions. This study primarily uses the average plot size as the core explanatory variable, focusing on the scale dimension of land fragmentation, and does not fully capture other dimensions of land fragmentation and their interactive effects. Therefore, the empirical results of this study should be interpreted as the impact of fragmentation at the plot size level on grain production efficiency, rather than the overall effect of land fragmentation as a comprehensive phenomenon. Future research could combine spatial plot data (e.g., GIS data and plot coordinates) to further investigate the differentiated impact of multidimensional land fragmentation on agricultural production efficiency.
Second, limitations in the data dimension. Due to constraints on data availability, this study uses cross-sectional data from a single year, making it difficult to capture the dynamic impact and long-term effects of changes in plot size on grain production efficiency. Grain production is characterized by cyclicality and volatility, and the efficiency gains from land consolidation may exhibit a lag. Observational data from a single period cannot fully account for the interference of factors such as annual climate and market fluctuations. Therefore, the conclusions of this study are primarily based on cross-sectional comparisons among farmers, reflecting differences between groups at the time of the survey rather than causal evolution over time. In the future, panel data could be collected to track changes in farmers’ circumstances before and after land consolidation, or quasi-natural experiments such as land consolidation initiatives, using the difference-in-differences (DID) method to provide more rigorous empirical evidence for causal relationships.
Third, limitations in sample scope. The study focuses on provinces along the Yellow River Basin, and the sample crops are limited to wheat and corn, which may restrict the generalizability of the findings. There are significant differences in resource endowments, topographical conditions, agricultural production methods, and levels of economic development among major grain-producing regions. For instance, large-scale mechanized farming dominates in the Northeast Plain, while rice cultivation in the Yangtze River Basin exhibits distinct patterns of fragmentation and operational modes compared to the Yellow River Basin. Therefore, the findings of this study mainly reflect empirical evidence under the specific crop and regional context of the Yellow River Basin, and caution is needed when extending them to the national level or to other crops. Future research could expand the research scope to include major grain-producing regions such as the Northeast Plain, the Yangtze River Basin, and the Sichuan Basin, and include a diverse range of crops such as rice, soybeans, and potatoes. This would help examine the regional heterogeneity and crop-specific variations in the effects of plot size, thereby further enhancing the external validity of the conclusions.