Movatterモバイル変換


[0]ホーム

URL:


Previous Article in Journal
KRID: A Large-Scale Nationwide Korean Road Infrastructure Dataset for Comprehensive Road Facility Recognition
 
 
Search for Articles:
Title / Keyword
Author / Affiliation / Email
Journal
Article Type
 
 
Section
Special Issue
Volume
Issue
Number
Page
 
Logical OperatorOperator
Search Text
Search Type
 
add_circle_outline
remove_circle_outline
 
 
Journals
Data
Volume 10
Issue 3
10.3390/data10030037
Font Type:
ArialGeorgiaVerdana
Font Size:
AaAaAa
Line Spacing:
Column Width:
Background:
Data Descriptor

Experimental Parametric Forecast of Solar Energy over Time: Sample Data Descriptor

1
CS-OGET—Center of Excellence of Studies in Oil and Gas Engineering and Technology, Faculty of Engineering, Eduardo Mondlane University, Mozambique Avenue Km 1.5, Maputo 257, Mozambique
2
CPE—Centre of Research in Energies, Faculty of Sciences, Eduardo Mondlane University, Main Campus No. 3453, Maputo 257, Mozambique
3
Department of Physics, Faculty of Sciences, Eduardo Mondlane University, Main Campus No. 3453, Maputo 257, Mozambique
4
Department of Mechanical Engineering, Instituto Superior Técnico, University of Lisbon, 1600-214 Lisbon, Portugal
5
Department of Mathematics and Informatics, Faculty of Science, Eduardo Mondlane University, Main Campus No. 3453, Maputo 257, Mozambique
*
Author to whom correspondence should be addressed.
Submission received: 10 November 2024 /Revised: 7 January 2025 /Accepted: 14 January 2025 /Published: 17 March 2025
(This article belongs to the TopicSmart Energy Systems, 2nd Edition)

Abstract

:
Variations in solar energy when it reaches the Earth impact the production of photovoltaic (PV) solar plants and, in turn, the dynamics of clean energy expansion. This incentivizes the objective of experimentally forecasting solar energy by parametric models, the results of which are then refined by machine learning methods (MLMs). To estimate solar energy, parametric models consider all atmospheric, climatic, geographic, and spatiotemporal factors that influence decreases in solar energy. In this study, data on ozone, evenly mixed gases, water vapor, aerosols, and solar radiation were gathered throughout the year in the mid-north area of Mozambique. The results show that the calculated solar energy was close to the theoretical solar energy under a clear sky. When paired with MLMs, the clear-sky index had a correlational order of 0.98, with most full-sun days having intermediate and clear-sky types. This suggests the potential of this area for PV use, with high correlation and regression coefficients in the range of 0.86 and 0.89 and a measurement error in the range of 0.25. We conclude that evenly mixed gases and the ozone layer have considerable influence on transmittance. However, the parametrically forecasted solar energy is close to the energy forecasted by the theoretical model. By adjusting the local characteristics, the model can be used in diverse contexts to increase PV plants’ electrical power output efficiency.
Dataset: https://github.com/Muco-1990/Parametric.git (accessed on 5 November 2024).
Dataset License: CC–BY Attribution 4.0 International.

    1. Summary

    It is estimated that just 70% of solar energy reaches the Earth’s surface [1,2], and that energy passes through all atmospheric routes unaltered [3]. It is believed that the atmosphere absorbs the remaining 28% and releases the remainder into space [1,3]. Sub-Saharan Africa has the highest rate of non-electrification of rural areas in the world. In response to this problem, PV technology has been increasingly adopted in this context compared to other technologies [4]. In some regions, the significant variability in solar energy and the fluctuations resulting from the ability of this resource to reach the Earth’s surface are still unknown; this affects the optimal performance of solar plants [3,5]. However, high-capital technologies, such as hydro, thermal, wind, and sea, are still being used in other parts of the world [6,7]. These technologies not only lead to the destruction of ecosystems and require the use of suitable heat transfer materials, but they also sometimes introduce atmospheric pollution, making PV solar energy a more practical option that can be used anywhere there is solar energy availability [1,3,7].
    The primary problem is the fluctuation in solar energy, which is largely brought on by a variety of reasons including human and industrial activity as well as natural processes that create aerosols, which are microscopic particles or vapors that eventually enter the Earth’s atmosphere [1,2]. About 30% of solar energy is absorbed and attenuated [1]. Uniformly mixed gases absorb 0.205%; water vapor absorbs 16–20% of radiation [2]; gases like carbon dioxide strengthen the ozone layer, which absorbs and attenuates about 3–7% of radiation [8]; greenhouse and climate change gases contribute to the attenuation and partial absorption of 2–3% [1,5]. Thus, solar energy is occasionally interrupted before its arrival on the Earth’s surface.
    Using MLM approaches and models for the spatial interpolation of measured solar radiation, recent research has demonstrated a smaller margin of error with medium error rates [9,10,11]. For instance, the Random Forest (RF) model and seven more models were used in an MLM for the spatial interpolation of solar radiation measurements [10,12]. Multiple regression and correlation analysis were used to provide solar radiation flows for applications in solar energy systems, and regression was used to forecast solar irradiance using image resources from the entire sky [13,14,15]. The results show that the models estimated the global flux of solar radiation within a narrow relative error range. The absolute average error was 10.31 W/m2, while the standard deviation was 1.5 W/m2 [16,17,18,19].
    Because of issues with solar energy fluctuation that impact the PV output power of solar plants, the goal was to use the MLM to parametrically model the forecast of solar energy in a short-scale measurement along the mid-north Mozambique Channel and predict the excess energy to deposit in the electrical network for other uses.
    This study offers a chronological analysis of solar energy, including all of the parameters contributing to its decrease from 2019 to 2021. This allows for a thorough analysis of the intentions to use solar energy for a variety of purposes, including engineering, architecture, and agriculture, with a focus on solar PV conversion. This makes it possible to predict the next fleet of electrical energy production using solar PV resources and effectively analyze the current fleet to eliminate any fluctuations in the incidence of solar energy beams on a solar power plant’s surface. This research is different from other previous studies on PV solar energy forecast modeling because it uses atmospheric, meteorological, environmental, spatiotemporal, and geographic origin variables as the input parameters. These variables correct and ensure that the solar energy forecast, which was analyzed using the MLM, is accurate. This analysis reduces the output errors by connecting the current and future performance with the interpolation of modeled energy. It also includes a parametric analysis of the components related to the reduction in solar energy, ensuring they are modeled in the same dimension. This, in turn, presents greater efficiency and accuracy in solar energy forecasting. In contrast to other studies of the same type, which include estimates of hourly, daily, monthly, and even annual averages (the latter of which are more often included in long-term forecasts), this study uses short measurement intervals of roughly one minute as the analysis sample.
    Solar energy samples along the mid-north Mozambique Channel were collected using both short-term and long-term measurement intervals, using Eppley pyranometers. The data were collected between 47 and 70 m above the ground. The data were then downscaled and used as a variable for sample analysis related to PV installations, where it was assumed that all samples, under specific conditions, aligned with the location’s geographic characteristics. This helped to estimate the PV power and size of the PV system equipment according to the estimated solar energy.
    Together with several additional parameters, the parametric model, computed using the MLM, considers the several features that prevent energy from penetrating the Earth’s surface as input variables. The quantity of energy withheld from the Earth’s surface is determined by the total of the experimental solar radiation and the due irradiations brought on by many factors, including the ozone layer, mixed uniform gases, and aerosols. Seasons can cause changes on the Earth’s surface. Among other effects considered, for example, rain greatly reduces the number of micro aerosols (Aitkens).
    The MLM enhances the performance of PV solar energy systems and energy estimation by reducing the estimated error. However, the same well-known MLM predicts the future behavior of the GHI sample and optimizes the clear-sky index and its composition in the regions, giving a short-scale measurement for of variability of solar energy availability [7,12,20].
    Controlling when solar energy is most or least prevalent has crucial uses in building, architecture, fishing, agriculture, and other fields that can benefit from the data sample. It is essential to comprehend the amount of solar energy available via estimation that accounts for all the factors that contribute to its decrease upon its arrival on the Earth. The information provided demonstrates the behavior of solar energy, its consideration in PV projects, and the substantial impact that energy fluctuations have on the lifespan of the system’s component devices, as well as their optimal use and essential elements. Along with optimizing PV systems and ensuring efficient energy consumption, these components can also assist society in using energy for disposal.
    The current study, however, presents information regarding solar energy composition in some remote locations, such as desert regions, high seas, animal reserves, and woodlands, which could represent a difficult problem. Apart from the network that would tackle the main electrical energy problem, this information would allow for the modeling of solar energy that is accessible in these areas and may be used for several regional applications [21,22]. The output of the MLM that reduces error presents an optimal estimate of solar energy levels and the optimal discharge of solar energy systems. The given parametric modeling method of solar energy takes into account all the parameters that lower solar radiation to the Earth’s surface and can be applied anywhere in the world by adopting local parameters like the ozone layer, aerosols, the concentration of uniformly mixed gases, dust, and water vapors, inflated by local climate, domestic, and industrial conditions.

    2. Data Description

    2.1. Data Collection and Processing

    A sample of GHI, temperature, aerosols, evenly mixed gases, and ozone across the mid-north region of Mozambique was collected between 2019 and 2021 from the National Fund of Energy (FUNAE) and the National Institute of Meteorology (INAM). Pyranometer sensors were used in combination with an Eppley pyrheliometer to monitor solar radiation. The aerosol optical mass, gas reflectance, and precipitation (measured using a photometric solar camera) were collected from the AERONET database in the provinces of Niassa and Sofala. The data were calibrated using accepted methods, and some areas of the data were incomplete and required additional interpolation. The data, collected over several years, were compressed at levels 1 through 2.0. The data included AOT (aerosol optical thickness), precipitation, and ozone levels. The AERONET database showed continuously high levels of ozone, precipitation, AOT, and others as shown inTable 1.

    2.2. Study Area

    Mid-north Mozambique is the research region, and the Ocua, Chiputo, Vanduzi, Choa, Lugela, Massangulo, and Nanhupo stations were set up there. It is located between the parallels 10°27′ and 26°52′ south latitude and the meridians 30°12′ and 40°51′ east longitude. The sample was prepared as follows to perform the computation utilizing data from the hours of 6:00 am to 6:00 pm, which is sunrise to sunset: because the sample collected includes a three-year interval of complete measurement in 2019, 2020, and 2021, starting in the sixth month of the year, the data in the FUNAE campaign only covered the months of June through December in 2019 and 2021.

    2.3. Specifications of Each Sample File

    The “CSV” files entitled “Chiputo_MZ06_PF”, “Vanduzi_1_MZ11_PF”, “Choa_MZ21_PF”, “Nanhupo_1_MZ24_PF”, “Nanhupo_2_MZ24_PF”, “Massangulo_1_MZ25_PF” and “Ocua_MZ03_PF” refer to the measured and estimated data samples and to parameters collected in Chiputo in the province of Tete, Vanduzi-1 in the province of Sofala, Choa-1 and Choa-2 in the province of Manica, Nanhupo-1 and Nanhupo-2 in the province of Nampula, Massangulo-1 and Massangulo-2 in the province of Niassa, and Ocua in the province of Cabo Delgado.
    In each file, column A under the title “Year” refers to the year of measurement in the state, column B under “month” refers to the month of measurement, column C under the title “Date” refers to the day data were recorded, column D under the title “date” refers to the accumulative days of measurement, the column under the title “GHI” refers to the measured global sounding data recorded at 10-min intervals, column F under the title “ID” refers to the identification assigned by the researcher to the measurement station, column G under the title “Station” refers to the name of the measurement station, column H under the title “Province” refers to the province of the location of the measurement station, column I under the title “Tower” refers to the name of the tower at the measurement site (owner of the tower), column J under the title “Code” refers to the code of the measurement site, column K under the title “Long (X)” refers to the longitude of the measurement site, column L under the title “Lat. (Y) refers to the latitude of the measurement site, column M under the title “altitude” refers to the altitude relative to sea level of the measurement location, column N under the title “WV_AOT (675_nm)” refers to the exact wavelength of the optical aerosol thickness (675 nm) in nanometers, column O under the title “WV_AOT (440_NM)” refers to the exact wave comparison of aerosol optical thickness (440 nm) in nanometers, column P under the title “AOT (675_nm)” refers to the optical spot of solar energy in the 675 nm wave composition, the Q column “AOT (440_nm)” refers to solar energy optics featured in the 440 nm wave composition, column R under the title “PW (cm)” refers to the measured precipitating water, column S “Pre (mbars)” refers to millibars, column T under the title “O3 (cm)” refers to the thickness of the ozone layer, column U under the title “N (cm)” refers to NO2 and other uniform mixed gases, and finally column V under the title “temperature_ (K)” refers to local temperature.

    3. Methods

    3.1. Validation and Data Curation

    The solar energy samples and atmospheric parameters were collected in situ, on a short scale, for the hours of sunlight (from 6:00 am to 6:00 pm). Potential measurement failures were reviewed at all locations, and the devices were associated with convertible networks and assessed for potential obstruction by animals, shadows, and others. These flaws were addressed by interpolation using the RF model. Then, the error of the dataset was determined (sample error, instrumental error, and scale error), and the sample was organized into files according to the installation reference, the year of measurement, day and time, and the value of solar energy and other parameters. Then, to avoid the high deviation of the sample due to several factors such as multiple reflection and interference, indicated by pyranometers displaying high values, the data were separated into acceptable classes and validated according their position in the spectrum of theoretical radiation. This comprehensively shows the presence of outliers that would considerably affect the estimate and consequently estimate failures. The application of the solar energy MLM was simplified. Testing using several training models points to lower measurement errors in the order of 0.24 and 0.31 using the Artificial Neural Network (ANN) model and the RF model.
    The Collected Source Sample was registered and validated with ID 52721 in the EPPI Reviewer 6 Platform version 6.15 (https://eppi.ioe.ac.uk/eppireviewer-web/intropage) on 23 January 2024. According to the connectivity diagram included in Explorer, roughly 768 of the gathered keywords were interconnected during testing.
    Several meta-analyses were presented by the system, such as those highlighting the transmittances, absorbances, attenuations, and types of solar energy on the Earth’s surface as a result of the parameters provided; analyzing the effects of the clear-sky index; calculating the temporal variations of a single measurement point; and calculating the spatiotemporal variable between two regions. Additionally, the data were subjected to statistical software analyses. Similarities were found in a few meta-analyses (ID: 54986-STATA), with a focus on recommendations for the calculation of parametric energy efficiency, as shown inSection 3.2.

    3.2. Forecast Parametric Model

    The parametric estimate of solar energy considers all related parameters, starting by determining the hemisphere declination that exhibits the behavior shown inFigure 1.
    The total incident radiation amount under the specified atmospheric conditions is calculated using the idealized parameterization approach, accounting for radiant energy. The transmittance of the atmosphere is determined using regression analysis, using an overall spectrally integrated transmittance for each atmospheric constituent. The relative optical air massmτ [2] is calculated consideringp as the pressure in mbars and reduced precipitable water w, with w’ the precipitable water [3,23,24]. The thickest vertical ozone layer, represented in centimeters byl, is the relative optical path length of ozoneU3, whereU3=lma.U1=wmτ is the relative optical path length adjusted for precipitable water pressure [25,26,27]. Assuming a multiplicative transmittance,τ=i=1i=jτi.
    To illustrate the transmittance via Rayleigh scatteringτr, the distinct spectrally integrated quantities of the direct irradiance owing to the different atmospheric elements, as given in Equation (1), are as follows [1,26,28]:
    τr=e0.0903ma0.841.0+ma+ma1.01
    The transmittance for direct irradiance owing to ozone absorption, orτ0, is given asτ0=1α0 [24,29,30]. The transmittance for uniformly mixed gas absorption under direct irradiance,τg, is expressed as given in Equation (2) [1,3].
    τg=e0.0127ma0.26
    The transmittance for direct radiation is caused by water vapor absorptionτw=1αw [2,31,32,33]. The transmittance for direct irradiance owing to aerosol attenuation is denoted byτa, considering the formula forδa=0.2758δa1+0.35δa2 is given in Equation (3) as [2,34],
    τa=eδa0.8731.0+δaδa0.7088ma0.91708
    The summarized comparison of transmittances shows transmittance due to ozone and uniformly mixed gases with the highest transmittances, as depicted inFigure 2.
    The direct normal irradianceGb that arrives without suffering substantial reductions, consideringGn the direct irradiance at mean sun-earth, taking the zenith angleθZ is given in Equation (4) as [7,35],
    Gb=Gncosθz
    The transmittance of direct radiation due to aerosol absorptionτaa, and considering the scattered diffuse Rayleigh irradiance is given in Equation (5) [1,36].
    τaa=11w01ma+ma1.061τa
    Diffuse irradiance scattered by aerosols, considering the multiple reflected radiationGda, withFc andw0 denoted roughly, is given in Equation (6) [2,24,37].
    Gda=Gsccosθzτ0τgτwτaa·0.79Fc1τas1ma+ma1.02
    Global radiationG is determined by adding the estimates ofGb and diffuse irradianceGd, and this is provided in Equation (7) [3,5,7]
    G=Gb+Gd
    Data were taken from the original, in situ measurements. The corresponding measurement errors were then found by applying models for forecasting solar energy data [14,38,39]. The research employed a number of MLMs, such as Gaussian Process Regression (GPR), the RF model, a Support Vector Machine (SVM), the ARIMA model, the ANN model, Regression Kriging (RK), Simple Linear Regression, Gradient Boosting Machines, and Long Short-Term Memory (LSTM) networks [30,40,41]. By contrasting data with theoretical clear–sky radiation, the GHI model was established [7,8,39,40,41,42]. The resulting different irradiances are compared inFigure 3, potentiating the model for parametrically forecasting the solar radiation because the estimated solar radiation is similar to the theoretical radiation [22,29,42,43].

    3.3. Clear-Sky Radiation and Its Regression Correlation

    By relating GHI and clear sky radiation, the clear-sky index (Kt) was obtained [3,5,7]. Using values ofKt for different classes of days (clear, intermediate, and cloudy), a connection was made as a function of the distance of the correlation coefficient or systematic connection of the clear-sky indexχijKt [7,14,18],
    χijKt=covKt,it,Kt,j(t)τ=1τKt,itKt,i¯2τ=1τKt,jtKt,j¯2
    Data 10 00037 g003
    Figure 3. Forecasted irradiance comparison (scale [1/365]).
    Figure 3. Forecasted irradiance comparison (scale [1/365]).
    Data 10 00037 g003
    Statistically,χijKt can be1, equal to zero, or1 in a space considered for a subspace station between two points x and y. In order to collect data from direct GHI measurements, a priori estimation was used to reduce losses for any errors, also known as uncertainty or the error margin (error). The number of classes (n) was established by classifying the values derived from theKt. Inferring the regression coefficient ofKt and its incrementsβijKt, withαl as the intercept constant andεt as the error term, as shown in Equation (9) described the characteristics of the regression inferential analysis ofKt and its increments for various classes and types of days in all years, between the previously indicated seasons [14,33],
    Kt,jt=αl+βijKtKt,it+εt
    The correlation between a temporal series and its associated values is measured by the autocorrelation function (ACF). The autocorrelation betweenKt andKt1 that cannot be explained by the delays between them is indicated by the Partial Autocorrelation Function (PAF) for a temporal seriesKt, as shown inFigure 4. This improves the used parametric forecast model by showing that there is a sudden variation and subsequently a homogeneous state for both functions in terms of forecasted energy.
    The spatial autocorrelation, as a function of the distance between the pair of pyranometers for acceptable and unacceptable days, modeled for the classes of clear, cloudy, intermediate and all types of days, decreases with the increase in distance between the pair of sensors. This result was also observed in Ahmad et al. (2018) [20] and Mucomole et al. (2024) [7], who evaluated the inference of 50 to 62 pyranometers over hundreds of kilometers, obtaining many correlation points. The rate of decrease is more pronounced on days that record sudden experimental GHI values of more than 1084.04 W/m2, which are not common in measurements. These indicate that solar radiation resulting from solar flares and other extreme solar activities may be to blame for these values. However, as detailed in the analysis, the solar energy decreases along the correlative curve. With the increase in distance, the spatial correlation structures of the three sky types, along the analysis section, also differ in their decay rates: for acceptable days, the approximate values ofKt correlation coefficient, are as follows: under cloudy sky conditions 0.2986, clear 0.3586, intermediate 0.3201, and all sky types 0.3844; on unacceptable days under cloudy sky conditions 0.3723, clear 0.3758, intermediate 0.3569, and all sky types 0.4899. The regressive inferred structures ofKt within the different sky types are consistent with the associated cloud patterns, presenting cloudy and clear-sky conditions with greater homogeneity equivalent to the existence or not of cloud layers, sometimes with localized shading factors in pyranometers, as shown inFigure 5.
    Both acceptable and unacceptable days have estimated energies near the GHI based on the theoretical model, according to the experimental modeling of the parametric estimation of solar energy classified in terms of day types. Correlative accessibility evaluations regarding heat maps using theKt heat correlation coefficient have an above-average potential, according to a summative analysis of the correlative-regressive study, as seen inFigure 5a. However, the influence of atmospheric, climatic, geographic, and spatiotemporal parameters related to human-industrial activity, reproductiveness of nature, and other factors that emit gases into the environment, such as aerosols, water vapor, gases that strengthen the ozone layer, and even climate change gases, that absorb, attenuate, and emit solar energy, contributes to significant increases in solar energy that can be quantified increasingly by reducing the observation interval, here quantified in about 1 to 10 min of solar energy measurement interval. The increases in solar energy, both for acceptable and unacceptable days, analyzable in terms of the correlation coefficient ofKt, exhibit a decorrelating structure analyzable in the structure of correlative-regression heat maps, as illustrated inFigure 5b.
    TheKt correlation of incrementsχijKt that characterizes the space–time metric of the region taken through the classes of similar spectra of acceptable and unacceptable days presents a decorrelation, as a function of distance; this means that energy accessibility occurs from the beginning of the study period to the end and decreases from cloudy to clear skies and then to intermediate skies. The comparison of the correlationχijKt with the model and several different models using a cloud speed of between 4 and 7 m/s shows a decay of the correlation corresponding to the intermediate skies, but after that, the correlations fall much faster, and there are greater energy transmission disturbances in intermediate conditions.

    3.4. Quality and Solar Energy Dataset Noise

    The sample of GHI and atmospheric parameters after undergoing error correction and interpolation presented high signal clarity and fewer disturbances; however, the variables excluded a priori measurements and during the statistical selection ofKt treatment in statistical classes. The energy estimated by the parametric model shown inSection 3.2 is very close to the theoretical solar energy in clear skies, enhancing the model’s ability to estimate solar energies.

    4. Handling and Applicability of the Forecasted Solar Energy Dataset

    The sample of GHI and atmospheric parameter data was measured on a short-scale using 1-, 5-, and 10-min measurement intervals with daily amplitude after the estimate, and consequently, the daily averages were determined. The data provide the real behavior and estimate of solar energy that reaches the Earth’s surface. They provide the energy transmitted and absorbed by the atmospheric parameters of solar energy estimation such as aerosols, uniformly mixed gases, water vapor, the ozone layer, and atmospheric dust, as well as their attenuation, in the provinces along the mid-north channel. These data are of potential use for forecasting atmospheric environmental pollution in various industries and estimates of emissions of gases harmful to the environment, among other uses. The data sample also provides the estimated solar energy for the direct, indirect, and global components given in parametric form, which are of crucial interest for PV use, among other purposes.
    The solar energy estimated by parametric models with summative analysis by the MLM is close to the theoretical energy, which makes this model appropriate for estimating solar energy adopting the parameters previously described. These data are potentially usable for optimal dimensioning, projection, and implementation of utilization projects as they provide the real scenario of solar energy availability, allowing systems to be adapted according to the available energy, eliminating any potential oscillation in the output of a solar plant, increasing the optimal performance of the solar energy system, and consequently leading to a long lifespan and durability of the PV system. The same data provided can be used for various solar energy estimation applications, including architecture, fishing, agriculture, etc., in addition to assisting society in energy recycling.
    In addition, theKt and its regressive, correlative, and decorrelative behavior, illustrating a high flow of solar energy throughout the analysis area, are tools for making decisions on the efficient implementation of solar projects that contribute to the mass use of clean energy within the scope of sustainable development objectives of access to clean energy for more rural societies without access to energy for basic needs, which still make up the majority of the population.
    This study introduces knowledge about the composition of solar energy in areas with difficult access, such as forests, animal reserves, high-margin, areas, and desert regions. This knowledge allows the modeling of the solar energy available in these areas and its applications. Parametric solar energy modeling considers all parameters that reduce solar radiation and can be applied globally. This combination of MLMs reduces errors, provides optimal solar energy levels, and optimizes solar energy systems.

    4.1. Findings and Conclusions

    It is possible to draw the following conclusions regarding the experimentally forecasted solar energy behavior that is appropriate for various locations and purposes, with a focus on electricity production as a mechanism for helping to achieve the sustainable development goals (SDGs) in the upcoming years:
    All parameters related to solar energy during its journey to the Earth’s surface are addressed to accurately estimate solar energy and forecast the PV production fleet without oscillation, obtaining energy signals of solar energy that are close to those of the theoretical model.
    Low albedo in parametric solar energy forecast modeling indicates low diffuse irradiance because of numerous reflections between the Earth’s surface and the atmosphere of a cloudless sky. At medium albedo, diffuse irradiance is modest, while at high albedo, it is growing. This has been observed at Chiputo, Choa-1, Lugela-1, Massangulo-2, Nanhupo-1, and Ocua, among other locations. In the diffuse irradiance of the cloudless sky, the divergence grows with albedo;
    Approximately 70% to 68%, of the statistical data produced by parametric forecast modeling pertain to intermediate sky days;
    The strong correlation between the clear-sky index and the transmittances and irradiances at each station validates the individual transmittances of the atmospheric constituents;
    The temporal history of solar energy increases near the end of the year and declines in the middle of the analysis region, whereas it peaks around midday and declines at dawn and twilight;
    The real solar energy values become more apparent when theKt is estimated and calculated using all the different parameters once the required adjustments have been made;
    The bulk of forecast and in situ measurements are lower on intermediate sky days, which are the most common during the analysis period.

    4.2. Limitations

    The inability to install radiometers on towers without the shadow effect, frequent outages in the conventional electrical current connected to the solar PV hybrid data storage base in the data logger, or bird obstructions are some of the limitations encountered during the data collection process. However, methods were used to eliminate sample influence errors. Lack of access to GHI data samples for raw sample comparison that are measured at short intervals (less than one minute and ten minutes) was another issue. Different days can be categorized using a binomial distribution comparable to the study that uses cloud speed, dynamics, and sun brightness hours. The modification of MLMs to improve error estimation and address solar energy estimation can also be carried out.

    5. Patents

    This research article is one of the first obligations in obtaining a Doctorate, the Doctoral Course in Energy Science and Technology, at Eduardo Mondlane University, which the main author is attending.

    Author Contributions

    Conceptualization, methodology, validation, formal analysis: F.V.M. and C.A.S.S.; investigation, resources, data curation, writing, preparation of the original draft, review and editing, acquisition of funding, visualization, and software: the main author F.V.M. and L.L.M.; supervision and project administration: F.V.M. and C.A.S.S.; advanced curation of data, writing, and supervision: F.V.M., C.A.S.S. and L.L.M. All authors have read and agreed to the published version of the manuscript.

    Funding

    This research was funded by CS–OGET, Faculty of Engineering, Eduardo Mondlane University, under funding number CS-OGET/2023 for doctoral research.

    Institutional Review Board Statement

    Not applicable.

    Informed Consent Statement

    Not applicable.

    Data Availability Statement

    The data that support the reported outcomes are accessible on the AERONET website athttps://aeronet.gsfc.nasa.gov/new_web/webtool_aod_v3.html (accessed on 11 March 2023), NOAA website athttps://www.noaa.gov/weather (accessed on 24 February 2023), and NASA POWER websitehttps://power.larc.nasa.gov/data-access-viewer/ (accessed on 25 January 2023). Additional data that support the conclusions of this research have not been released and can be obtained from INAM, FUNAE, UEM, or by contacting the corresponding author upon request.

    Acknowledgments

    We express our gratitude to the FUNAE entities for their assistance in providing us with sample data from the campaign conducted between 2012 and 2014. We would also like to thank INAM for supplying us with the sample of solar radiation data spanning from 1995 to 2023 and for granting us access to their facilities for training and experimental tests. Additionally, we extend our appreciation to the Department of Physics at Eduardo Mondlane University for generously making their facilities available for real-time testing and measurements of the latest solar energy behavior. Their provision of a laboratory for data processing greatly contributed to the compilation of this research. Lastly, we would like to acknowledge CS-OGET for their support, as it played an integral role in the culmination stage of this doctoral research.

    Conflicts of Interest

    The authors declare no conflict of interest.

    References

    1. Wenham, S.R.; Green, M.A.; Watt, M.E.; Corkish, R.; Sproul, A.Applied Photovoltaics, 3rd ed; Routledge: London, UK, 2011. [Google Scholar] [CrossRef]
    2. Iqbal, M.An Introduction to Solar Radiation; Academic Press: Toronto, ON, Canada; New York, NY, USA, 1983. [Google Scholar]
    3. Duf, J.A.; Beckman, W.A.Solar Engineering of Thermal Processes; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2013. [Google Scholar]
    4. IEA; IRENA; UNSD; World Bank; WHO.Tracking SDG 7: The Energy Progress Report 2023; World Bank: Washington, DC, USA, 2023; Available online:https://cdn.who.int/media/docs/default-source/air-pollution-documents/air-quality-and-health/sdg7-report2023-full-report_web.pdf?sfvrsn=669e8626_3&download=true (accessed on 16 December 2023).
    5. Mucomole, F.V.; Silva, C.A.S.; Magaia, L.L. Temporal Variability of Solar Energy Availability in the Conditions of the Southern Region of Mozambique.Am. J. Energy Nat. Resour.2023,2, 27–50. [Google Scholar] [CrossRef]
    6. Zhang, Y.; Chen, L. Estimation of Daily Average Shortwave Solar Radiation under Clear-Sky Conditions by the Spatial Downscaling and Temporal Extrapolation of Satellite Products in Mountainous Areas.Remote Sens.2022,14, 2710. [Google Scholar] [CrossRef]
    7. Mucomole, F.V.; Silva, C.A.S.; Magaia, L.L. Regressive and Spatio-Temporal Accessibility of Variability in Solar Energy on a Short Scale Measurement in the Southern and Mid Region of Mozambique.Energies2024,17, 2613. [Google Scholar] [CrossRef]
    8. Mucomole, F.V.; Silva, C.S.A.; Magaia, L.L. Quantifying the Variability of Solar Energy Fluctuations at High–Frequencies through Short-Scale Measurements in the East–Channel of Mozambique Conditions.Am. J. Energy Nat. Resour.2024,3, 21–40. [Google Scholar] [CrossRef]
    9. McCandless, T.; Jiménez, P.A. Examining the Potential of a Random Forest Derived Cloud Mask from GOES-R Satellites to Improve Solar Irradiance Forecasting.Energies2020,13, 1671. [Google Scholar] [CrossRef] [PubMed]
    10. Abuella, M.; Chowdhury, B. Random forest ensemble of support vector regression models for solar power forecasting. In Proceedings of the 2017 IEEE Power & Energy Society Innovative Smart Grid Technologies Conference (ISGT), Washington, DC, USA, 23–26 April 2017; pp. 1–5. [Google Scholar] [CrossRef]
    11. Inman, R.H.; Pedro, H.T.C.; Coimbra, C.F.M. Solar forecasting methods for renewable energy integration.Prog. Energy Combust. Sci.2013,39, 535–576. [Google Scholar] [CrossRef]
    12. Babar, B.; Luppino, L.T.; Boström, T.; Anfinsen, S.N. Random forest regression for improved mapping of solar irradiance at high latitudes.Sol. Energy2020,198, 81–92. [Google Scholar] [CrossRef]
    13. Saffaripour, M.H.; Mehrabian, M.A.; Bazargan, H. Predicting solar radiation fluxes for solar energy system applications.Int. J. Environ. Sci. Technol.2013,10, 761–768. [Google Scholar] [CrossRef]
    14. Abdul-Wahab, S.A.; Bakheit, C.S.; Al-Alawi, S.M. Principal component and multiple regression analysis in modelling of ground-level ozone and factors affecting its concentrations.Environ. Model. Softw.2005,20, 1263–1271. [Google Scholar] [CrossRef]
    15. Benghanem, M.; Joraid, A.A. A multiple correlation between different solar parameters in Medina, Saudi Arabia.Renew. Energy2007,32, 2424–2435. [Google Scholar] [CrossRef]
    16. Arsham, A.; Rosenberg, P.; Little, M. Effects of stopping criterion on the growth of trees in regression random forests.N. Engl. J. Stat. Data Sci.2022,1, 46–61. [Google Scholar] [CrossRef] [PubMed]
    17. Jogunuri, S.; Josh, F.T.; Stonier, A.A.; Peter, G.; Jayaraj, J.; Jaganathan, S.; Joseph, J.J.; Ganji, V. Random forest machine learning algorithm based seasonal multi-step ahead short-term solar photovoltaic power output forecasting.IET Renew. Power Gener.2024,11, 1–16. [Google Scholar] [CrossRef]
    18. Smith, P.F.; Ganesh, S.; Liu, P. A comparison of random forest regression and multiple linear regression for prediction in neuroscience.J. Neurosci. Methods2013,220, 85–91. [Google Scholar] [CrossRef]
    19. Klink, K. Trends and Interannual Variability of Wind Speed Distributions in Minnesota.J. Clim.2002,15, 3311–3317. [Google Scholar] [CrossRef]
    20. Ahmad, M.W.; Reynolds, J.; Rezgui, Y. Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees.J. Clean. Prod.2018,203, 810–821. [Google Scholar] [CrossRef]
    21. Cerralbo, P.; Grifoll, M.; Moré, J.; Bravo, M.; Afif, A.S.; Espino, M. Wind variability in a coastal area (Alfacs Bay, Ebro River delta).Adv. Sci. Res.2015,12, 11–21. [Google Scholar] [CrossRef]
    22. Sahin, G.; Isik, G.; van Sark, W.G. Predictive modeling of PV solar power plant efficiency considering weather conditions: A comparative analysis of artificial neural networks and multiple linear regression.Energy Rep.2023,10, 2837–2849. [Google Scholar] [CrossRef]
    23. Klein, S.A.; Cooper, P.I.; Freeman, T.L.; Beekman, D.M.; WBeckman, A.; Duffie, J.A. A method of simulation of solar processes and its application.Sol. Energy1975,17, 29–37. [Google Scholar] [CrossRef]
    24. Twidell, J.; Weir, T.Renewable Energy Resources, 3rd ed.; Routledge: London, UK, 2015. [Google Scholar] [CrossRef]
    25. Myhre, G.; Stordal, F.; Johnsrud, M.; Diner, D.J.; Geogdzhayev, I.V.; Haywood, J.M.; Holben, B.N.; Holzer-Popp, T.; Ignatov, A.; Kahn, R.A.; et al. Intercomparison of satellite retrieved aerosol optical depth over ocean during the period September 1997 to December 2000.Atmos. Chem. Phys.2005,5, 1697–1719. [Google Scholar] [CrossRef]
    26. Levy, R.C.; Munchak, L.A.; Mattoo, S.; Patadia, F.; Remer, L.A.; Holz, R.E. Towards a long-term global aerosol optical depth record: Applying a consistent aerosol retrieval algorithm to MODIS and VIIRS-observed reflectance.Atmos. Meas. Tech.2015,8, 4083–4110. [Google Scholar] [CrossRef]
    27. Penner, J.E.; Zhang, S.Y.; Chin, M.; Chuang, C.C.; Feichter, J.; Feng, Y.; Geogdzhayev, I.V.; Ginoux, P.; Herzog, M.; Higurashi, A.; et al. A Comparison of Model- and Satellite-Derived Aerosol Optical Depth and Reflectivity.J. Atmos. Sci.2002,59, 441–460. [Google Scholar] [CrossRef]
    28. Wu, Y.; de Graaf, M.; Menenti, M. The impact of aerosol vertical distribution on aerosol optical depth retrieval using CALIPSO and MODIS data: Case study over dust and smoke regions.J. Geophys. Res. Atmos.2017,122, 8801–8815. [Google Scholar] [CrossRef]
    29. Wu, J.; Luo, J.; Zhang, L.; Xia, L.; Zhao, D.; Tang, J. Improvement of aerosol optical depth retrieval using visibility data in China during the past 50 years.J. Geophys. Res. Atmos.2014,119, 13370–13387. [Google Scholar] [CrossRef]
    30. Arola, A.; Koskela, T. On the sources of bias in aerosol optical depth retrieval in the UV range.J. Geophys. Res. Atmos.2004,109, D08209. [Google Scholar] [CrossRef]
    31. Chylek, P.; Henderson, B.; Mishchenko, M. Aerosol radiative forcing and the accuracy of satellite aerosol optical depth retrieval.J. Geophys. Res. Atmos.2003,108, 4764. [Google Scholar] [CrossRef]
    32. Yan, X.; Luo, N.; Liang, C.; Zang, Z.; Zhao, W.; Shi, W. Simplified and Fast Atmospheric Radiative Transfer model for satellite-based aerosol optical depth retrieval.Atmos. Environ.2020,224, 117362. [Google Scholar] [CrossRef]
    33. Yagli, G.M.; Yang, D.; Srinivasan, D. Automatic hourly solar forecasting using machine learning models.Renew. Sustain. Energy Rev.2019,105, 487–498. [Google Scholar] [CrossRef]
    34. Kleissl, J. Current State of the Art in Solar Forecasting. 2010. Available online:https://escholarship.org/uc/item/4fx8983f (accessed on 21 September 2024).
    35. Mucomole, F.V.; Silva, C.A.S.; Magaia, L.L. A Systematic Review on the Accessibility of Spatial and Temporal Variability of Solar Energy Availability on a Short Scale Measurement.Am. J. Energy Nat. Resour.2024,3, 60–85. [Google Scholar] [CrossRef]
    36. Voyant, C.; Notton, G.; Duchaud, J.-L.; Gutiérrez, L.A.G.; Bright, J.M.; Yang, D. Benchmarks for solar radiation time series forecasting.Renew. Energy2022,191, 747–762. [Google Scholar] [CrossRef]
    37. Atique, S.; Noureen, S.; Roy, V.; Subburaj, V.; Bayne, S.; Macfie, J. Forecasting of total daily solar energy generation using ARIMA: A case study. In Proceedings of the 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, 7–9 January 2019; pp. 114–119. [Google Scholar] [CrossRef]
    38. Chodakowska, E.; Nazarko, J.; Nazarko, Ł.; Rabayah, H.S.; Abendeh, R.M.; Alawneh, R. ARIMA Models in Solar Radiation Forecasting in Different Geographic Locations.Energies2023,16, 5029. [Google Scholar] [CrossRef]
    39. Stevović, I.; Mirjanić, D.; Stevović, S. Possibilities for wider investment in solar energy implementation.Energy2019,180, 495–510. [Google Scholar] [CrossRef]
    40. Aryaputera, A.W.; Yang, D.; Zhao, L.; Walsh, W.M. Very short-term irradiance forecasting at unobserved locations using spatio-temporal kriging.Sol. Energy2015,122, 1266–1278. [Google Scholar] [CrossRef]
    41. Persson, C.; Bacher, P.; Shiga, T.; Madsen, H. Multi-site solar power forecasting using gradient boosted regression trees.Sol. Energy2017,150, 423–436. [Google Scholar] [CrossRef]
    42. Anand, M.P.; Rajapakse, A.; Bagen, B. Stochastic Model for Generating Synthetic Hourly Global Horizontal Solar Radiation Data Sets Based on Auto Regression Characterization.Int. Energy J.2020,20, 181–200. [Google Scholar]
    43. Amjad, D.; Mirza, S.; Raza, D.; Sarwar, F.; Kausar, S. A Statistical Modeling for spatial-temporal variability analysis of solar energy with respect to the climate in the Punjab Region.Bahria Univ. Res. J. Earth Sci.2023,7, 10. [Google Scholar]
    Data 10 00037 g001
    Figure 1. Declination as a function of time equation.
    Figure 1. Declination as a function of time equation.
    Data 10 00037 g001
    Data 10 00037 g002
    Figure 2. Yearly comparison of different transmittances types.
    Figure 2. Yearly comparison of different transmittances types.
    Data 10 00037 g002
    Data 10 00037 g004
    Figure 4. ACF and PACF in Nanhupo-2, during the year 2021.
    Figure 4. ACF and PACF in Nanhupo-2, during the year 2021.
    Data 10 00037 g004
    Data 10 00037 g005
    Figure 5. Regression inference across stations in the Mozambique region for all sky types, for acceptable (A) and unacceptable (NA) days for (aKt coefficients and (b)Kt coefficients, in terms of the parametric forecasted solar energy assessment. The coloring with a high density ofKt coefficients indicate the regions with high solar energy fluxes and theirKt on the vertical scale in both (a,b). These regions diffuse through the interprovincial distance through solar energy flux transport to the areas at the end of the decorrelation, which is identified here as having a low density ofKt coefficients and theirKt, evaluated concerning the other regions of the region as having medium-low solar energy.
    Figure 5. Regression inference across stations in the Mozambique region for all sky types, for acceptable (A) and unacceptable (NA) days for (aKt coefficients and (b)Kt coefficients, in terms of the parametric forecasted solar energy assessment. The coloring with a high density ofKt coefficients indicate the regions with high solar energy fluxes and theirKt on the vertical scale in both (a,b). These regions diffuse through the interprovincial distance through solar energy flux transport to the areas at the end of the decorrelation, which is identified here as having a low density ofKt coefficients and theirKt, evaluated concerning the other regions of the region as having medium-low solar energy.
    Data 10 00037 g005
    Table 1. Solar radiation and atmospheric parameters.
    Table 1. Solar radiation and atmospheric parameters.
    StationGHI (W/m2)Longitude (°)Latitude (°)AOT
    (675 nm)
    AOT
    (440 nm)
    Precipitable Water (cm)Pressure (mbars)Ozone (cm)NO2 (cm)Temperature (K)
    Ocua351.1339.39−11.550.150.292.69958.612.620.0014272.99
    Chiputo399.6731.67−14.970.140.302.131016.352.630.0014299.89
    Vanduzi477.1935.04−19.730.180.343.471010.302.830.0018308.26
    Choa-1355.4933.24−17.790.140.302.131016.082.630.0014299.96
    Choa-2354.6633.24−17.790.140.302.131016.082.630.0014299.96
    Nanhupo-1352.6439.51−15.970.150.292.59958.432.610.0015272.99
    Nanhupo-2349.6539.51−15.970.150.292.59958.442.610.0015272.99
    Massangulo-1345.9135.44−13.910.150.302.59958.422.610.0015272.99
    Massangulo-2430.0935.44−13.910.150.302.59958.422.610.0015272.99
    Lugela367.9936.71−16.470.140.292.60958.352.610.0014272.99
    Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

    © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

    Share and Cite

    MDPI and ACS Style

    Mucomole, F.V.; Silva, C.A.S.; Magaia, L.L. Experimental Parametric Forecast of Solar Energy over Time: Sample Data Descriptor.Data2025,10, 37. https://doi.org/10.3390/data10030037

    AMA Style

    Mucomole FV, Silva CAS, Magaia LL. Experimental Parametric Forecast of Solar Energy over Time: Sample Data Descriptor.Data. 2025; 10(3):37. https://doi.org/10.3390/data10030037

    Chicago/Turabian Style

    Mucomole, Fernando Venâncio, Carlos Augusto Santos Silva, and Lourenço Lázaro Magaia. 2025. "Experimental Parametric Forecast of Solar Energy over Time: Sample Data Descriptor"Data 10, no. 3: 37. https://doi.org/10.3390/data10030037

    APA Style

    Mucomole, F. V., Silva, C. A. S., & Magaia, L. L. (2025). Experimental Parametric Forecast of Solar Energy over Time: Sample Data Descriptor.Data,10(3), 37. https://doi.org/10.3390/data10030037

    Article Metrics

    No
    No

    Article Access Statistics

    For more information on the journal statistics, clickhere.
    Multiple requests from the same IP address are counted as one view.
    Data, EISSN 2306-5729, Published by MDPI
    RSSContent Alert

    Further Information

    Article Processing Charges Pay an Invoice Open Access Policy Contact MDPI Jobs at MDPI

    Guidelines

    For Authors For Reviewers For Editors For Librarians For Publishers For Societies For Conference Organizers

    MDPI Initiatives

    Sciforum MDPI Books Preprints.org Scilit SciProfiles Encyclopedia JAMS Proceedings Series

    Follow MDPI

    LinkedIn Facebook Twitter
    MDPI

    Subscribe to receive issue release notifications and newsletters from MDPI journals

    © 1996-2025 MDPI (Basel, Switzerland) unless otherwise stated
    Terms and Conditions Privacy Policy
    We use cookies on our website to ensure you get the best experience.
    Read more about our cookieshere.
    Accept
    Back to TopTop
    [8]ページ先頭

    ©2009-2025 Movatter.jp