|
Predictive Modelling Methodology (continued)
by
Introduction METHODOLOGICAL ISSUESAlso, while the choice of variables has been limited and the source for these variables even more so, there seems to be an absence of culturally relevant variables. Very few, if any, predictive models incorporate variables derived from Native land use studies, ethnographic data or local informant interviews. While the utility of incorporating these kinds of data has yet to be demonstrated, logically, there is considerable value to incorporating them into predictive models of prehistoric activity locations (Dalla Bona and Larcombe 1992). A second concern relates to the quality of some primary data sources. For example, Altschul states that the primary source for his three variables is a USGS digital terrain model. Kvamme (1990:114) evaluated USGS digital terrain data and concluded: "although there is a general correspondence, in the (purchased USGS data) (1) many small ridges and hills are absent, (2) minor drainages are missing, (3) large features are greatly smoothed, and (4) there is a major error in the form of a 120 ft. high cliff face which would surely make a spectacular waterfall on the Colorado River (which flows down the central valley) if it really existed!" Archaeologists generally consider terraces, hills and small creeks to be extremely important for regional settlement analysis. It is difficult to perform an analytical study using criteria such as terraces and small hills, when the map from which these criteria are drawn does not represent them accurately. In addition to digital data of questionable quality is the issue of appropriate scale for modelling. For example, let us consider a predictive model developed for a large region at an effective scale of 1:50,000. However, much of the data may be derived from maps published at a scale of 1:125,000 and 1:250,000. The high level of data generalization on these maps relative to the 1:50,000 map forces a significant degradation of the quality of the final model. Additional potential difficulties can develop if some data variables are derived from large scale sources such as 1:15,000 aerial photographs. In this circumstance it may be necessary to reduce the precision and detail of variables derived from large scale sources (air photos) to match the scale of the 1:50,000 predictive model. The implications of variable scales of primary data are addressed more fully in a later chapter. Thirdly, those developing predictive models tend to present their results as statements of high/medium/low potential areas, or areas of favourability/non-favourability. However, the means by which these terms were defined is seldom clearly expressed. The reader is rarely informed of the means by which determination of categories of potential is made. The cutoff points between high and medium, and medium and low potential is rarely if ever discussed. Clearly, this is an issue that is of importance to cultural resource managers and archaeological researchers alike. The modelling approach developed in this research does not categorize potential, rather it presents a scale of potential where zones of high/medium/low can be determined more clearly and the rationale for that determination is openly presented for further discussion.
ADVANTAGES OF PREDICTIVE MODELLING
ON DEVELOPING A PREDICTIVE MODELAs stated in Kohler's definition (1988:33), a predictive model is comprised of a set of testable hypotheses. To arrive at testable hypotheses, a model must be explicit in the variables that are used, and the manner in which those variables are manipulated. This includes clearly identifying and outlining the variables included in the model, the manner in which these variables interact, and any weighting placed upon the variables. Ideally, a flowchart-like diagram outlining the various processes involved in developing the model should be available. Such a diagram would graphically illustrate what variables are used, and how they interact to produce the final result. One of the major stumbling blocks and criticisms of all predictive models is the subjective input of the researcher's own knowledge. All archaeologists acknowledge that this information is important and should not be ignored. However, to be really useful, it should be made clear what knowledge is being applied to the development of the model as well as how it is being used. The methodology employed here makes that explicit - indeed the researcher is forced to be explicit. There are a number of assumptions that one works under when developing a predictive model. The first involves the assumption that choices of activity locations made by prehistoric people were influenced by elements of the natural and physical environment. The researcher also assumes these environmental variables have survived, and can be represented by presently available data. These data may be in the form of maps, monographs or may still remain to be collected in the field. The third assumption asserts that correlations between archaeological sites and the natural/physical environment observed by modern researchers reflect land use choices made by prehistoric decision makers. That is, the correlation is assumed to be not due to chance, or reflecting the affect of another, presently undocumented, independent variable. These assumptions may be strengthened or confirmed by repeated testing or application of a model, but the true nature of prehistoric human action can never be fully known. As predictive models attempt to codify aspects of human behaviour, one cannot expect a model to be simplistic in its makeup, or to be developed in a single effort. The development period of a predictive model is not finite. Altschul calls this a "...dynamic modelling approach. Once anomalies...are identified, they become the subject of additional research. As patterns are found, many anomalies become predictable. Those sites whose locations remain anomalous grow in importance" (1990:228). Modelling should be seen and conducted as a dynamic process whereby data collected from any source, at any time, can be incorporated into the modelling process to increase its integrity, accuracy and scope. As such, predictive modelling may be seen as involving three stages: (1) primary stage predictive modelling involving data collection and organization; (2) secondary stage predictive modelling in which an initial model is developed and tested, and; (3) tertiary stage predictive modelling in which the model is subjected to an infinite number of applications and refinements. This process is summarized in Table 2.
Primary Stage Modelling: Organization and Data Collection"To have confidence in any models which emerge, we need to know why the behavior we predict patterns as it does" (Tainter 1983:7). It is important to note that the researcher must start somewhere and existing data and successful examples of other predictive models offer an acceptable base, subject to a careful evaluation of their relevance and completeness. The primary stage may be understood as the organizational stage of the modelling process. The researcher must make numerous decisions including:
a) the scale at which modelling will take place; Many issues may be predetermined and a function of the project proposal or terms of reference. This is particularly the case with predictive models developed for cultural resource management purposes. It is during the primary stage that an archaeological field survey may be conducted. Usually this involves inductive data collection from portions of the "research universe" that are unrepresented in the existing heritage resource inventory. While it may be that some archaeological information already exists in the form of a site database, it may be subject to a number of biases beyond the control of the researcher. Thus, the collection of new baseline archaeological data provides the researcher with a more complete and representative database with which to build the model. The field program should include as complete an areal survey as possible. The size of the survey area need not be exceedingly large but should represent the study area as a whole. It is also important that a range of environmental characteristics, that are deemed to be the independent variables, be known and mapped within the survey area. The intention of the survey is to understand the distribution, frequency, and component parts of all the sites in the survey area. With the completion of the initial round of data collection and archaeological reconnaissance, primary stage predictive modelling is complete.
Secondary Stage Modelling: Initial Model Development and TestingThe researcher may now develop an initial predictive model and test it using the area surveyed in the primary stage. While it may appear that this step is a 'self-fulfilling prophecy', one must be reminded that a variety of data were used to develop the initial predictive model - not solely the data derived from the primary stage survey. Based upon the hypotheses generated earlier, variables can be introduced or removed from the process, or the weighting of the variables can be adjusted until the model is able to predict the highest percentage of sites possible. A second field survey program in an area near the first is necessary to collect more baseline data and/or test the model. It is recognized however, that this may not always be feasible because of external limitations such as time and money. Once again, the strength of correlation between known site locations and the identified independent variables should be measured. This information should be incorporated into a new, 'second generation' predictive model. This model would then be applied to both the primary and secondary stage survey areas. The variables would be modified in such a way as to produce a model predicting the highest percentages of known archaeological sites. Once this has been achieved, tertiary stage modelling may begin.
Tertiary Stage Modelling: Application and RefinementFew, if any, models have achieved tertiary stage development because of the nature of the agencies employing them. Most cultural resource management agencies are limited by their role as resource managers and have neither the time nor resources for ongoing research and development. They are interested in identifying the location of resources in order to facilitate informed planning and management. Given the urgency of these goals, it is often an irresistible temptation to implement partially tested models as a part of routine resource management and planning. As a result, predictive models developed for such agencies often resemble a procedural 'cook book'. The variables identified in the prototype model become tranformed from untested indicators of site distribution into routine 'red flags' of site location. As the predictive model is being routinely used, it gains unwarranted credence and uncritical acceptance. Users may reason that if certain steps are followed, a scientifically valid result will follow. The three stage modelling process outlined here reduces the likelihood that such a 'cook book' approach will result. An additional point may be raised concerning the prospect that a predictive modelling approach will supplant conventional archaeological field work. There should never be a point where predictive models take the place of field work. In the context of an academic modelling exercise, the negative implications of a poor model are relatively minor. However, in the context of cultural resource management, the implications of applying a poorly tested model can be severe, and perhaps even disastrous. At the same time, it is not realistic to expect resource management agencies to do nothing until all possible sites have been field inspected. Clearly a reasonable compromise is possible and best serves all agencies involved. "I have no objection to the use of multivariate locational models for research and planning purposes, but they simply cannot provide sufficient evidence to warrant the granting of archaeological clearance without the benefit of field survey. Any such reliance on predictive models to 'write off' areas of low projected site density constitutes both an abuse of statistical methods and an abrogation of É management responsibilities" (Berry 1984:845-6). Once again, the development of predictive models is a dynamic process where models are rigorously tested over many years and in many different areas. The results of each year's testing must also be incorporated into the existing model. Information gained in future years of application are also incorporated into the model development process. Ideally, this process should never stop.
SUMMARY |