Introduction
The performance of all solar-energy systems is dependent upon solar radiation, ambient temperature, humidity, and wind speed. These variables are neither completely random nor deterministic and can best be described as random functions of time. The analysis of solar-energy systems and simulation methods in energy efficiency is inconvenienced by the random behavior of the weather. Information concerning hourly and daily air temperature values is required for most practical applications of solar energy in active and passive systems. Photovoltaic and thermal solar systems, building design, and thermal simulation performance analysis all require air temperature values. However, in different geographical areas these data are not available and must be estimated through models that use daily maximum, daily minimum, or monthly average air temperature values obtained from published data.
Temperature is one of the main meteorological variables measured by meteorological service networks. Nevertheless, daily and hourly time series, required for the most sophisticated studies and simulations, are expensive. Moreover, they often contain missing data, correspond to short recording periods, and, worst of all, are available for only very few sites world wide. In the European Union (EU), this situation is worse for the north Mediterranean belt area, where meteorological networks have low-timescale recording stations that are often hundreds of kilometers apart.
There are several existing models to predict daily mean, daily maximum, and daily minimum air temperature values, and some of them are based on data from stations in North America, Italy, Germany, or Spain. The following models are among the most prominent. Cuomo et al. (1986) studied and analyzed air temperature on a daily basis in the Italian climate. Amato et al. (1989) discussed stochastic–dynamic models for both air temperature and solar irradiance daily time series in the Italian climate. Hernández et al. (1991) developed stochastic models for the prediction of daily minimum air temperatures. Macchiato et al. (1993) analyzed cold and hot air temperatures observed at 50 stations in southern Italy. Some of the existing models have also been developed for northern latitudes with high albedos and cold air masses. For instance, Heinemann et al. (1996) developed an algorithm for the synthesis of hourly ambient temperature time series that takes into account a monthly average daily temperature pattern. The above-mentioned works developed models for predicting air temperature values; however, it is hard to find studies that compare the performance of different temperature models in order to obtain the best models for different places. In the framework of the “JOULE III” project on Climatic Synthetic Time Series for the Mediterranean Belt (CLIMED), a Mediterranean dataset was assembled by the participating institutions with the purpose of simulating meteorological variables.
The aim of this paper is to survey a number of air temperature models and to validate a selected subset based on the Mediterranean dataset in order to select the one(s) most suited to predict hourly and daily air temperature values necessary for photovoltaic, thermal, and building-energy analysis. Model selection was made on the basis of model equation availability, necessary input variables, and ability to generate data from limited average values.
The models selected may be divided into the following groups:
hourly models developed from daily minimum and maximum air temperature values,
stochastic models that link hourly air temperature to monthly average air temperature values, and
daily global models that link daily mean temperature and daily minimum and maximum values.
After selection, the models were tested using datasets from 34 measuring stations in France, Greece, Italy, Portugal, and Spain. The datasets consist of hourly and daily mean air temperature and daily maximum and minimum measured temperature data. After the analysis of the existing models, a new daily stochastic model based on the European Mediterranean dataset was proposed. For testing the models, the measured and generated data were compared in some north Mediterranean locations. The performance of the selected models was assessed using statistical characteristics of measured and modeled data of stations considered to be typical of various climatic divisions of the north Mediterranean belt area. The most appropriate model for the different climatic zones was subsequently proposed.
The theoretical base of the selected models, the new proposed model, the performance of different models, and the recommended models for use in this area are given in the following sections.
Data collection
A large volume of data from various sites in the south of France, Greece, Italy, Portugal, and Spain was collected. Not all operating stations were used in this study because of redundant and poor data quality for some stations (Kambezidis and Adamopoulos 1997). The main criterion for selection of the stations was the completeness of the database and the period covered. Another issue in the selection procedure was to cover the various climatic zones encountered in each country as fully as possible.
After the selection process, 34 measuring stations with multiyear records (ranging from 4 to 15 yr) of hourly air temperature, located in the five countries mentioned above, were retained for the work. Figure 1 shows the stations' situation in each country and reflects the climatological characteristics, following Kambezidis and Adamopoulos (1997). It can be said that the Portugal stations, numbers 2 and 3, belong to the Atlantic maritime zone with full influence from weather coming from the open ocean, and station 1 belongs to the Atlantic semimaritime zone, combining features of Atlantic and continental climates. In Spain, station 3 belongs to the mountainous Pyrenean zone and the rest of the stations belong to continental climate with cold winters and sunny summers with high temperatures. Because of its latitude, station 5 (Seville) has a milder winter. Most stations in France belong to a transition zone from oceanic to Mediterranean, but Pau (French station 3) has an oceanic climate with influence from the Atlantic Ocean. The stations selected in Greece are situated in the Mediterranean maritime (milder winters and summers) and Mediterranean terrestrial zone. The Italian stations are situated in the mountainous (Apennines) zone. Stations 1 (Campochiaro) and 8 (Santa Fista) have more frequent rainfall throughout the year. Stations 2, 10, 11, and 9 are situated in the Mediterranean maritime zone, and stations 3, 7, and 12 are situated in the continental zone (with cold winters and warm summers).
Table 1 shows the available number of data and the geographical characteristics of the stations, such as latitude, longitude, and altitude above sea level. Latitude ranges between 35.34°N for Linoperamata (Greece) and 45.50°N for Mirano (Italy). Altitude ranges between 5 m for Athens (Greece) and 1326 m for Vinuesa (Spain). This factor greatly influences the performance of air temperature models.
An initial analysis of the data characteristics was made. From the data series, the daily mean air temperature values were standardized using Eq. (15) from section 3d, and the lag autocorrelation coefficients for lags between 1 and 7 were calculated. The results showed that the lag-1 autocorrelation coefficient for all stations differed substantially from 0 but that subsequent autocorrelation coefficients were all close to 0. Table 2 shows the station name and the lag-1 autocorrelation coefficient ρ obtained from the measured data. It can be seen that the obtained results are between 0.60 for Kefallonia (Greece) and 0.82 for Athens (Greece). The mountainous stations can be seen to obtain a smaller lag-1 autocorrelation coefficient value because of the high difference between air temperature values. Average values are between 0.72 and 0.75. Similar results have been obtained by Cuomo et al. (1986) and Klein (1987). The results indicate that air temperature changes slowly from day to day in the area.
Existing air temperature models
A set of models was selected as the most promising for further study. The criteria used for selecting models were (i) full availability of algorithms and numerical coefficients, (ii) use of input data that are either generally available or obtainable from available model cascades (from monthly to daily, from daily to hourly) (iii) that models are based on data for one or more Mediterranean sites, and (iv) the quality of the results reported by the original authors as well as those published in reviews. The models selected for the current work may be divided into four groups, as described below.
Models that link hourly air temperature T(y, m, d, t) and the daily maximum and minimum air temperature values, Tmin(y, m, d) and Tmax(y, m, d)
In this kind of model, the hourly air temperature values are obtained from the daily maximum and minimum values. The following models were selected.
Double cosine model (1995)
Erbs's model (1984)
Stochastic models that link hourly and monthly average temperature values
Hollands et al. (1989) studied the effect of neglecting the random component in hourly temperature data for various solar heat systems. The results indicate that, for some systems, the extra complexities of including the random component of the hourly ambient temperature are unwarranted. Boland (1997) showed that the stochastic air temperature component is critical for evaluating heating and cooling loads for passive solar applications. As a result, there are systems for which its inclusion is important, and for this reason the following model is studied.
Knight's model (1991)
From this method, the transformation that relates hourly temperature values to the normalized value and the corresponding χ values is obtained.
From the long-term series of hourly temperature data, the standardized values were calculated, and from Eq. (9) the corresponding χ values and the lag-1 autocorrelation coefficient were evaluated. Each hour, a new χ value is generated according to a first-order autoregressive model, following Eq. (8).
Models that link daily mean air temperature T(y, m, d) and daily maximum and minimum air temperature values, Tmin(y, m, d) and Tmax(y, m, d)
In this kind of model the daily mean air temperature is calculated from the daily maximum and minimum air temperature values.
Standard model
This approximation would be exact if the daily mean air temperature profile were smooth and symmetrical, which is not true. For instance, the increase in temperature in the morning is steeper than its decrease in the afternoon and at night (Aguiar 1997).
New proposed model
Within each group of existing models there is considerable disagreement between results, depending upon which air temperature model is used, as can be seen in Bilbao et al. (1997). The differences may be the result of various methods of calculating the data, location dependence of the data, or insufficient data. Models are also sometimes based on data from only one location, often at higher latitudes than those of the majority of the Mediterranean stations. For these reasons a new daily model, the CLIMED Temperature Model (CLIMEDTEM), was developed using the combined data of Table 1 except those for Athens (Greece), Porto (Portugal), and Seville (Spain). These data were used to test the new model. The residual daily series histograms were analyzed in this study, and from the results it could be seen that the histograms are closely fitted by a Gaussian function distribution.
Performance results and discussion
Models that link hourly air temperature T(y, m, d, t) with daily maximum and minimum air temperature and daily monthly mean air temperature values: The Knight, double cosine, and Erbs models
In this group, models that link hourly air temperature values with the daily maximum and minimum air temperature and monthly mean daily air temperature were tested. Table 3 shows the name of the model and the statistical estimators (absolute and relative rmse values) obtained for all selected stations.
Comparing the rmse absolute results, more scatter is observed in Knight's model (which can vary from 4.14° to 6.55°C depending on the station) than in the case of Erbs's model (which varies from 1.08° to 2.57°C). In comparing the rmse relative values, it can be said that Knight's model shows the highest values, ranging between 21.90% and 69.76%, and Erbs's model obtains the lowest values, between 6.12% and 23.87%. The lowest rmse is obtained for stations in both the Mediterranean and Atlantic maritime zones, for instance, Athens at 6.12% and Lisbon at 7.77%.
The highest rmse values for the Erbs model were obtained for higher-level stations belonging to mountain climatic zones, for instance, Jaca (14.63%) and Vinuesa (23.87%) in Spain and Campochiaro (22.29%) and Santa Fista (16.61%) in Italy. In Greece, the stations obtaining greatest rmse value were Megalopoli (12.44%) and Ptolemaida (10.64%), both of which belong to the terrestrial Mediterranean climatic zone. Millau, France, located in the Mediterranean transitional zone and the station with the highest latitude, obtained the highest rmse value. Perpignan obtained the best result in relation to Erbs's model, being as it is the nearest station to the sea and is located at a lower altitude than the other French stations.
In conclusion, it may be seen that Erbs's model obtained the best results. This is because it gives the lowest error values at all stations and better predicts the air temperature values at the stations that belong to maritime climatic zones, except in Italy where the model obtained the best results for stations belonging to the continental zone, as can be seen, for instance, in Carpeneto (9.31%) and Montanaso-Lombardo (9.20%).
Comparison of scatterplots with hourly estimated versus measured air temperatures are made at each station, and, in the interest of clarity, selected plots were chosen for discussion of the results. Plot selection was based on the minimum relative rmse value to pick the best model and on the maximum relative rmse value to pick the worst. The diagonal line represents the ideal match between the estimated and measured values. Figures 2a and 2b show examples of the estimated versus measured hourly air temperature values using Erbs model results. Each figure consists of two scatter graphs; Fig. 2a shows the best performing model and Fig. 2b the worst. In comparing the scatter graphs of the double cosine, Knight, and Erbs models, it appears that the Erbs model introduces better predictions than the other models because the parameter model is more dependent on the location.
Figures 3a and 3b show the comparison of cumulative frequency distribution curves, where long-term distribution was obtained with measured data. In comparing these results, it can be said that the Erbs, double cosine, and Knight models for Athens (Greece) and Valladolid (Spain), respectively, show a similar behavior. In both figures, the results from Knight's model differ from the long-term distribution for high and low temperature values. The Erbs model distribution is close to the long-term results, indicating good results.
From these results it can be said that Erbs's model gives the best overall results in predicting the hourly temperature, and the double cosine model performed better than Knight's model.
Daily air temperature models: Standard and CLIMEDTEM models
In this group, the standard and CLIMEDTEM models were tested. The standard model links daily mean air temperature values with the corresponding daily maximum and minimum air temperature values. The CLIMEDTEM model is a newly proposed stochastic model that links daily mean air temperature with monthly average temperature values. The difference between these models is the input data; the standard model needs maximum and minimum data, and CLIMEDTEM only needs monthly average temperature values.
Table 4 shows the statistical estimators of the standard and CLIMEDTEM models. In comparing the rmse relative values, it can be observed that the standard model obtains smaller relative error values. Aliveri and Sageika (Greece) obtain the best results and Vinuesa (Spain) the worst. The rmse absolute values vary between 0.49° and 1.31°C, depending on the station. Figures 4a and 4b show the estimated versus the measured values for the best and the worst stations, respectively. Figure 4a shows that, for the standard model, the scatter is small for Aliveri (Greece) in comparison with Fig. 4b, for Vinuesa (Spain), which has the worst result. It can also be observed that the simulated values are underestimated in Vinuesa (Spain).
Figure 5 shows the performance of the standard model and the comparison between estimated and measured values for Athens (Greece). It can be seen that estimated values are similar to measured ones and the model slightly underestimates the results.
The CLIMEDTEM model (Table 4) gives the lowest rmse values for the following stations: Linoperamata, Preveza, and Sageika in Greece; Sibari in Italy; and Seville in Spain. All these stations have a relatively low latitude and low altitude. In comparing the rmse absolute values, more scatter is observed from the CLIMEDTEM model (which varies from 4.65° to 1.17°C depending on the station) than on the standard model. Figures 6a and 6b show the estimated versus the measured values for the CLIMEDTEM model in Aliveri (Greece) and Vinuesa (Spain), respectively. The model overestimates the air temperature for Vinuesa.
Table 2 shows the lag-1 autocorrelation coefficient values for measured and CLIMEDTEM model estimated values ρest. The results are similar and agree with those obtained by Klein (1987), and from these results it can be said that the model performs well for the temperature values in the Mediterranean area. In comparing the standard and CLIMEDTEM models, it can be said that the standard model gives the best results because it is based on local temperature values of daily maximum and minimum temperature data, but, because the standard model needs long data series, it could be inconvenient to use in the Mediterranean area. The CLIMEDTEM model is an autoregressive model that only needs temperature data and variables on a big timescale that can be evaluated from a few climatological data values, as has been shown in previous sections. From the study and taking into account the results, it can be said that the CLIMEDTEM model gives good results for stations that have a relatively low altitude and latitude; CLIMEDTEM may be useful when monthly average temperature values are available, and the advantages are that it requires fewer input data than the other daily temperature model shown in the work.
In comparing Knight and CLIMEDTEM model results based on monthly temperature values, it can be said that these models may be useful when monthly average temperature data are available, which are values that can be obtained easily from different publications and meteorological atlases. In conclusion, the CLIMEDTEM model may be used in the Mediterranean belt area where monthly temperature values are available—for example, from temperature isoline maps—and the best results could be obtained in places with low altitude and latitude.
Conclusions
A dataset was assembled and used that is thought to be among the best available at this time in the north Mediterranean belt area. The data analysis performed has shown that the lag-1 autocorrelation coefficient is independent over time and its variations with location are negligible, at least for maritime stations. The different established models that calculate hourly and daily air temperature have been selected, run, and tested to decide which model is recommended, and a new model has also been proposed for the north Mediterranean area.
The models studied were classified into two groups. In the first group, selected models that calculate hourly air temperature from daily maximum and minimum air temperature values were tested. In the second group, one daily air temperature model was run and a new model was proposed.
The statistical estimators rmse (absolute and relative values), cumulative probability distributions, and scatterplots were used to indicate how closely the models agree with the data, and the variation of the rmse values with climatic zone has been studied. Among the first group of models, Erbs's model is recommended as the best for maritime climatic zones and for mountainous climatic zones in Italy, because it gives the best test results (small rmse values, together with the best scatterplot and cumulative frequency distribution) and provides good estimation for all data.
Among the models that calculate daily air temperature, the standard model is the most recommended, because it is the best in reproducing the statistical characteristics of data in all climatic zones.
It has been observed that a model's performance also depends on station altitude and climatic zone. In most cases, all models perform well at stations near sea level, although some previous studies show that altitude can be a dominant parameter (Macchiato et al. 1995). Stochastic models have also been tested with measured data from different stations in the Mediterranean area. The stochastic CLIMEDTEM model could be used for predicting daily air temperature at the southern maritime zones of the studied region and in low-latitude and-altitude cities, for instance, Seville (Spain). The advantages of the two stochastic models, Knight and CLIMEDTEM, are the limited necessary input data series in comparison with the analyzed models.
The study gives some new evidence that, for the Mediterranean area and by means of stochastic models, only limited temperature information will be needed as input to simulate data, and thus synthesized data might be obtained in many more locations.
The results of the paper can be used in different scientific areas such as solar climatology, renewable solar energy simulation and design, energy-efficiency studies, and solar-energy engineering applications, as well as in other scientific fields for which air temperature data, in different timescales, are required as input for important system simulations.
Acknowledgments
This investigation is a part of the JOULE III Project on Climatic Synthetic Time Series for the Mediterranean Belt (Contract JOR3-CT96-0042), known as CLIMED, for which the participating institutions were The Institute of Industrial Engineering (INETI) in Lisbon, Portugal, which was the coordinator, the National Observatory of Athens (NOA), Greece; the University of Valladolid (Spain); and, as a subcontractor to INETI, Energy Consulting of Aix en Provence, France. The authors thank the CLIMED Project Coordinator Dr. R. Aguiar for the project management. The authors gratefully acknowledge the financial support extended by the EU JOULE III Programme. The authors express their thanks to the national meteorological services in the countries involved for making the necessary data for this study available. The anonymous reviewers are also gratefully acknowledged for their useful comments and suggestions in improving the paper.
REFERENCES
Aguiar, R. 1996. Séries sintèticas de parâmetros meteorològicos (Synthetic series of meterological parameters). Ph.D. thesis. Lisbon University, Lisbon, Portugal, 525 pp.
Aguiar, R. . 1997. Climatic synthetic series for the Mediterranean belt. Instituto Nacional de Engenharia E Tecnologia Industrial. Final CLIMED Project Rep., 101 pp.
Amato, U., V. Cuomo, F. Fontana, and F. C. Serio. 1989. Statistical predictability and parametric models of daily ambient temperature and solar irradiance: An analysis in the Italian climate. J. Appl. Meteor. 28:711–721.
Bilbao, J., A. de Miguel, J. A. Medina, and J. J. López. 1997. Model performance tests. CLIMED Project Rep. to European Community DGXII, Applied Physics I Dept., Valladolid University, Spain, 183 pp.
Boland, J. 1997. The importance of the stochastic component of climatic variable in simulating the thermal behavior of domestic dwellings. Sol. Energy 60:359–370.
Cuomo, V., F. Fontana, and C. Serio. 1986. Behaviour of ambient temperature on daily basis in Italian climate. Rev. Phys. Appl. 21:211–218.
de Miguel, A., J. Bilbao, R. Aguiar, H. Kambezidis, and E. Negro. 2001. Diffuse solar irradiation model evaluation in the north Mediterranean belt area. Sol. Energy 70:143–153.
Erbs, D. G. 1984. Models and applications for weather statistics related to building heating and cooling loads. Ph.D. thesis, Mechanical Engineering Dept., University of Wisconsin—Madison, 336 pp.
Erbs, D. G., S. A. Klein, and W. A. Beckman. 1983. Estimation of degree-day and ambient temperature bin data from monthly-average temperatures. ASHRAE J. 25:60–65.
Heinemann, D., C. Langer, and J. Schumacher. 1996. Synthesis of hourly ambient temperature time series correlated with solar radiation. Proc. EuroSun'96 Conf., Freiburg, Germany, ISES-Europe, 1518–1523. [Available from Energy Department, Oldenburg University, D-26111 Oldenburg, Germany.].
Hernández, E., R. García, and M. T. Teso. 1991. Minimum temperature forecasting by stochastic techniques: An evidence of the heat island effect. Mausam 41:161–166.
Hollands, G. T., L. T. D'Andrea, and I. D. Morrison. 1989. Effect of random fluctuations in ambient air temperature on solar system performance. Sol. Energy 42:335–338.
Kambezidis, H. D. and A. D. Adamopoulos. 1997. Final data set. Third Progress Rep. of the CLIMED Project to the European Community. JOULE DG XII Programme, 21 pp. [Available from Institute of Environmental Research and Sustainable Development, National Observatory of Athens, Athens, Greece.].
Kambezidis, H. D., B. E. Psiloglou, and C. Gueymard. 1994. Measurements and models for total solar irradiance on inclined surface in Athens, Greece. Sol. Energy 53:177–185.
Klein, S. A. 1987. Scientific vs. correlation methods. Proc. ISES Solar World Congress 1987, Hamburg, Germany, ISES, 3109–3114.
Knight, K. M., S. A. Klein, and J. A. Duffie. 1991. A methodology for the synthesis of hourly weather data. Sol. Energy 46:109–120.
Macchiato, M., C. Serio, V. Lapenna, and L. La Rotonda. 1993. Parametric time series analysis of cold and hot spells in daily temperature: An application in southern Italy. J. Appl. Meteor. 32:1270–1281.
Macchiato, M., L. La Rotonda, V. Lapenna, and M. Ragosta. 1995. Time modelling and spatial clustering of daily ambient temperature: An application in southern Italy. Environmetrics 6:31–53.
APPENDIX
Nomenclature
Latin symbols
AT(y, m, d) daily thermal amplitude (°C)
AT(y, m) monthly mean thermal amplitude (°C)
F[h(m, d, t)] cumulative distribution of hourly air temperature
h(m, d, t) normalized hourly air temperature
Nm No. of hours in the month
Y(y, m, d) independent normal variable with 0 mean and (1 − ρ2)1/2 std dev
t hour of the day
T(y, m, d, t) hourly air temperature (°C)
T(y, m, d) daily mean air temperature (°C)
Tmax(y, m, d) daily max air temperature (°C)
Tmin(y, m, d) daily min air temperature (°C)
T(y, m, t) hourly monthly mean air temperature (°C)
T(y, m) daily monthly mean air temperature (°C)
T(m) monthly mean air temperature for a representative year (°C)
T(m, d, t) hourly air temperature of a representative year (°C)
Tyr yearly mean air temperature over the whole period (°C)
Ti,est estimated (daily or hourly) mean air temperature values (°C)
Ti,me measured (daily or hourly) mean air temperature values (°C)
X(y, m, d) standardized daily mean air temperature (°C)
y No. of the year
m No. of the month
d No. of the day
Greek symbols
ε normally distributed random variable with 0 mean and variance of (1 − ϕ1)2
ϕ1 lag-1 autocorrelation coefficient in stochastic model
χ normally distributed stochastic variable with 0 mean and variance of 1
ρ lag-1 autocorrelation coefficient of standardized daily mean air temperature values
μ(m, d) long-term daily mean air temperature value over the corresponding set of values for each allowable year (°C)
σ(m, d) daily temperature std dev with regard to long-term daily mean air temperature (°C)
σ(m) std dev of a month's daily mean air temperature with regard to the long-term mean value for that mouth (°C)
σyr std dev of the T(m) values with regard to the yearly mean daily temperature (°C)
Number of data and geographical parameters used at the meteorological stations in the north Mediterranean belt area (height is meters above sea level)
Lag-1 autocorrelation coefficient of the standardized measured and estimated daily air temperature values using CLIMEDTEM model
Rmse statistical estimator values for air temperature computed from the Knight, Erbs, and double cosine models
Rmse statistical estimator values for CLIMEDTEM and standard air temperature models