A comparison of Fourier methods for the description of wing shape in mosquitoes (Diptera: Culicidae)

A Comparison of Fourier Methods for the Description of Wing Shape in Mosquitoes (Diptera: Culicidae) ® F. James Rohlf; James W. Archie Systematic Zoology, Vol. 33, No. 3 (Sep., 1984), 302-317. Stable URL: http://links.jstor.org/sici?sici=0039-7989%28198409%2933%3A3%3C302%3AACOFMF%3E2.0.CO%3B2-T Systematic Zoology is currently published by Society of Systematic Biologists. Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/journals/ssbiol.html. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is an independent not-for-profit organization dedicated to creating and preserving a digital archive of scholarly journals. For more information regarding JSTOR, please contact support@jstor.org. http://www.j stor.org/ FriJul 30 16:15:31 2004 Syst. ZooL, 33(3):302-317/ 1984 A COMPARISON OF FOURIER METHODS FOR THE DESCRIPTION OF WING SHAPE IN MOSQUITOES (DIPTERA: CULICIDAE) F. James Rohlf1 and James W. Archie2 department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794; and ^Department of Zoology, University of Hawaii, Honolulu, Hawaii 96822 Abstract.—Outlines of wings from 127 species of North American mosquitoes were digitized. Comparisons were made among several different methods of reducing the information in the resulting coordinates to a series of descriptors that could be used in multivariate analysis. Methods included Fourier analysis of both radii and tangent angle change functions. In addi- tion, the relatively new method of elliptic Fourier analysis was tried. Cluster and ordination analyses based on the various sets of descriptors summarized well the pattern of similarities and differences in wing shapes, but clusters of similar wings do not agree well with traditional taxonomic groupings. The use of elliptic Fourier descriptors appears to be especially promising for future work. [Fourier analysis; morphometries; mosquitoes; image analysis; feature extrac- tion.] The present study is concerned with methods for quantitative analysis of simi- larities and differences in wing shape among 127 species of mosquitoes from America north of Mexico (see Fig. 1 for examples of the shapes of wings found among these species). There are many methods for quantifying outline shapes (a feature extraction problem in image anal- ysis; Rohlf and Ferson, 1983) so that their variation can be analyzed using tech- niques of multivariate analysis. The con- ventional morphometric approach would be to define a series of characters—such as length to width ratios, and more complex ad hoc indices—to describe the ways in which wings differ in shape. This is less likely to be successful in cases like the present one where the diversity of shapes is not large and, hence, the morphometric trends are apt to be rather subtle. We are interested here in methods that allow one to capture the entire shape outline in a systematic manner and with a desired de- gree of precision. We have investigated the relative usefulness of five different feature extraction methods based on Fourier anal- yses of various functions of the coordi- nates of points around the outline of each wing. Kaesler and Waters' (1972) analysis was one of the first applications of Fourier de- scriptors to study morphological shapes in systematics. Since that time there have been many such applications (some of them are listed in Rohlf and Ferson, 1983). The present study represents an attempt to define and evaluate new types of taxo- nomic characters that can be recorded au- tomatically using optical scanners on dig- ital computers and, thus, make larger quantitative studies more feasible. Meth- ods based on Fourier coefficients are not the only ones that can be used. Another approach is to use moment invariants (Hu, 1962). In this method an image is treated as if it were a bivariate density function (image darkness treated as proportional to probability density) and then the x and y moments are computed. Functions of these moments are computed that are invariant to differences in image location, size, ro- tation, reflection, and contrast. One limi- tation of this approach is that one cannot easily reconstruct an image from the de- scriptors. Empirical evaluations of their usefulness in biological image analysis will be given by Ferson et al. (in prep.) and Rohlf and Ferson (in prep.). Rohlf and So- kal (1967) described another method in which the presence or absence of a visible feature at a series of randomly or system- 1984 FOURIER METHODS AND WING SHAPES 303 atically selected locations were used as characters. This method requires careful alignment of images and does not permit an accurate reconstruction of the image. While this method seems to "work," it is not very satisfying intellectually. There have been few numerical taxo- nomic studies of mosquitoes. Rohlf (1963) studied 48 species of Aedes based upon characters from both the adults and larvae, while Steward (1968) reported on a similar analysis using 82 characters for 42 Cana- dian species of this genus. Hendrickson and Sokal (1968) studied 29 species from the genera Psorophora and Aedes based on 158 adult morphological characters. Rohlf (1967) reported on a classification of 45 species in several genera using only pupal characters. The images of these same pu- pae were also studied in Rohlf and Sokal (1967). Rohlf (1977) analyzed data on 63 species of Aedes using 14 adult thoracic se- tal characters. Moss et al. (1979) studied 25 species of the genus Toxorhynchites from the Orient using 79 adult and 26 pupal characters. A follow-up study (which in- cluded a numerical cladistic analysis) by Simon et al. (1982) included 32 species of Toxorhynchites based on 100 characters from the immature stages. While general agree- ment has been found with previous clas- sifications, there have also been a number of important differences (Crovello, 1969; Nielsen, 1969). Much further work is clearly needed both at the higher and low- er taxonomic levels. MATERIALS AND METHODS Since this study is concerned with the possible usefulness of the wing outline to determine broad relationships within the family as a whole (rather than to distin- guish between similar species), the data were obtained from plates 1 to 127 in Car- penter and LaCasse (1955). These corre- spond to the 127 species of mosquitoes from America north of Mexico included in that volume. The use of these published illustrations made it possible for this study to be comprehensive without the compli- cations of mixing images from different authors or from microscope slides pre- AN3 T012 WY14 PS42 PS43 AE46 AE8 1 CX1 16 Fig. 1. Examples of mosquito wing shapes based on the RP data set. The central "dot" corresponds to the centroid of each wing and the "dot" to its left corresponds to the point at which the fifth longitu- dinal vein branches. Codes explained in Table 1. pared using different procedures. (Studies concerned with the problem of distin- guishing between pairs of similar species are underway using actual wings from many replicate specimens digitized auto- matically from TV images. These will be reported separately.) In various diagrams in this paper, the species are identified by a two-letter designation for each genus (see Table 1) followed by a number that cor- responds to the plate number in Carpenter and LaCasse (1955). The original data were obtained using a coordinate digitizer connected to an HP9830 desk-top computer that was con- trolled by a program written in BASIC. The digitizer is accurate to 0.01 inches with a 304 SYSTEMATIC ZOOLOGY VOL. 33 Table 1. List of genera with codes and species code numbers. Numbers Anopheles AN 1-11 Toxorhynchites TO 12 Wyeomyia WY 13-16 Uranotaenia UR 17-20 Culiseta CA 21-27 Orthopodomyia OR 28 Mansonia MA 29-31 Psorophora PS 32-43 Aedes AE 44-100 Culex CX 101-125 Deinocerites DE 126-127 field of 17 by 17 inches (effectively a 1,700 by 1,700 pixel grid). The outline of a right wing of each species was traced using a cursor and the x- and y-coordinates were read "continuously" by the computer (i.e., the digitizer furnished coordinates as rap- idly as the computer could request them). The wings were oriented more or less hor- izontally and the point at which the fifth longitudinal vein branches was taken as the origin (this landmark was used, since it is centrally located on most wings). These initial coordinates were converted to polar-coordinate form as the points were digitized. Angles were measured in a clockwise direction starting from the point at which the leading edge of the wing joins the thorax. In order to eliminate the effect of unequal sampling due to the variation in speed at which different parts of the wing were traced, coordinates were com- puted for the end points of 100 equally- spaced radii (trigonometric interpolation was used to give equal angles). Thus, the basic data set was obtained in radius-func- tion form (it consisted of 100 distances from the branching point of the fifth hor- izontal vein to the edge of the wing, every 3.6°) for each of 127 species of mosquitoes (see Fig. 2A). This was repeated five times for each wing (to reduce operator error in tracing) and the averaged lengths of the radii were transmitted to the campus' UNIVAC 11/82 computer and the depart- ment's MODCOMP IV/25 and IV/35 min- icomputers for further processing. These radii (see Fig. 2B) were analyzed by Fou- rier analysis. The coefficients for the first 15 harmonics were computed for each of the 127 wings and used as descriptors. Since the "zeroth" harmonic is just a func- tion of the size of the wing, all coefficients were scaled so that the zeroth harmonic was equal to 1.0. The resulting matrix of descriptors for harmonics 1 through 15 is referred to below as the "RP" (raw polar) data set. A second data set was constructed math- ematically from the first by translating the origin of the system of polar coordinates to the centroid (center of gravity) of each wing (see Fig. 2C). The location of the cen- troid was determined by a numerical in- tegration. Anstey and Delmet (1973) in- dicated that this choice of an origin was preferable to that of a morphological land- mark since its use increases the relative amount of information in the harmonics above the first. Actually, the use of the centroid simply ensures that the results are just a function of the wing shape and not of changes in the location of the landmark within a wing (this simple adjustment seems to have been overlooked by Roberts et al., 1983:379). This proved to be impor- tant since species were found that had very similar wing outlines but differed in the location of the branching point of the fifth horizontal vein. A new set of points were computed by interpolation so that the ra- dii would be equally-spaced (every 3.6° around the new origin). A matrix of Fourier coefficients for har- monics 0 through 15 was calculated from these data and is denoted below as the "CP" (centered polar) data set. One of the major advantages of these various types of Fourier coefficients as descriptors is that one can reconstruct the outlines of the wings both from the various data matrices and from cluster's centroids and points along principal component axes. This is useful as a check on the numerical results (and as an aid to interpretation). Figure 3A shows the separate contribution of har- monics 0 through 8 to the average mos- quito wing (the wing one obtains if one uses the average Fourier coefficients based 1984 FOURIER METHODS AND WING SHAPES 305 Fig. 2. Representations of the coordinate systems used for the data sets in the present study. (A) Polar coordinates relative to a landmark. (B) The radius from A shown as a function of the angle. (C) Polar coor- dinates relative to the centroid of the wing. (D) Angle of a tangent vector as a function of distance along the perimeter of the wing. The elliptic Fourier coefficients are based simply on the x- and y-coordinates of points along the wing outline and are not shown. on all 127 species), and Figure 3B provides the successive reconstructions of the wing outline based on harmonics 0 through 8 and 15. Since the zeroth harmonic is again just a function of the size of the wing, all coefficients were scaled by dividing them by the square root of the area of the wing. While it is not important when consid- ering wing outlines (which are simple convex images), the two methods de- scribed above are limited to images in which the radius functions are single-val- ued. These methods cannot be used for shapes that have many concave curves (such as maple leaves). Zahn and Roskies (1972) suggested a more general way to describe the shape of an object based on the cumulative change in angle of a vector tangent to the outline of the object as a function of distance around the periphery of the wing (see Fig. 2D). The wing is first scaled so that its perimeter is 2ir. Then the function 0(0 = d(t) - 6(0) - t (1) is computed for 100 equally-spaced values of t. In this function, t is the distance along the periphery of the wing from the start- ing point (ranges from 0 to 2w radians), 0(0) is the angle of a tangent at the starting point, and 6(t) is the angle of a tangent vector at a distance t from the starting point. Bookstein et al. (1982) referred to this as an intrinsic representation. The third, "TA" (tangent angle), data set con- sists of the coefficients for harmonics 0 through 15 based on a Fourier analysis of the function 0(f) described above. Figure 4 shows the contributions of the first eight harmonics to the description of the aver- 306 SYSTEMATIC ZOOLOGY VOL. 33 A B Fig. 3. Fourier harmonics for the average mos- quito wing based on polar coordinates relative to the centroid of the wing. (A) Contributions of the first eight harmonics. The relative sizes of the figures re- flect the magnitudes of their individual contribu- tions. (B) Reconstructions of the wing outline based on the cumulative contributions of the first eight har- monics and for the first 15 harmonics. age wing (defined as above) and its recon- struction using the sums of the contribu- tions of various harmonics. The Fourier analyses applied above can be thought of simply as multiple regres- sion analyses in which the best fitting val- ues (in a least squares sense) of the coef- ficients at and b, are found in the following trigonometric regression equation, A B 2 O CT3 3 O <=> Fig. 4. Fourier harmonics for the average mos- quito wing based on angle of a tangent vector as a function of distance along the perimeter of the wing. (A) Contributions of the first eight harmonics. Note that the contribution for the first harmonic does not result in a closed curve. (B) Reconstructions of the wing outline based on the cumulative contributions of the first eight harmonics and for the first 15 har- monics. The curves do not close due to the effects of the first harmonic. k y = 0o + 2 (0,cos[#] z=l + fe,sin[if]). (2) In this equation the angle t varies from 0 to 27r. The zeroth harmonic describes the contribution of a centered circle, the first harmonic an offset circle, the second a 1984 FOURIER METHODS AND WING SHAPES 307 "figure 8," the third a trefoil, the fourth a quatrefoil, etc. (see Fig. 3A). The contri- butions are less obvious for the TA data set (see Fig. 4A), since the function being fitted, 0(f), is not just a simple radius. The coefficients can be computed by a variety of computational algorithms. The algo- rithm given in Ralston (1965) was used for data sets RP, CP, and TA described above (this algorithm takes advantage of the fact that the data points were equally-spaced). Since there are relatively few data points, there is no great advantage in using the fast Fourier transformation type algo- rithms for any of the data sets we used. The fourth, fifth, and sixth data sets were constructed using the x- and y-coordinates (rectilinear) of each data point in the orig- inal basic data matrix. The 'TP" (elliptic Fourier) data matrix consists of coefficients of the elliptic Fourier series of the gener- alized chain-encoded outline of a figure as suggested by Kuhl and Giardina (1982). Their algorithm does not require the points to be spaced equally and can fit an arbi- trary closed contour—given enough har- monics. The coefficients of the nth har- monic in a Fourier series expansion for the x-projections of an outline are A = and 2n2ir2 ~ Atp •(cos[2rniVT] - cos[2mr£p - 1/T]) 2^ (3) 2«27T2 ~2 Atp ■(sin[2nirtp/T] - sin[2tt7rfp - 1/T]), (4) where k is the number of steps in the trace around the outline (indexed by p), and Axp is the displacement along the x-axis of the contour between steps p — 1 and p. Atp is the length of the linear segment between these steps, tp is the accumulated length of such segments, and T = tk is the total length of the contour as approximated by the trace polygon. The coefficients for the y-coordinates, C„ and D„, are found in the same way using the incremental changes in the y-direction. The resulting coeffi- cients are not dependent on the x or y dis- placement of the wing, and the original wings were aligned so as to have the same orientation and starting point for the trace of their outline. A related method called "dual-axis Fou- rier shape analysis," DAFSA, has been proposed by Moellering and Rayner (1981, 1982). As presented by them, this tech- nique requires that the outline must be a closed plane curve that is sampled at uni- form arc-length intervals. The x- and y-co- ordinates are treated as pairs of complex numbers (x + iy) expressed as a function of the arc-length distance of each point from an arbitrary reference point. This function is then fitted by Fourier analysis. Kincaid and Schneider (1983) seem to have used the same method. The "EPO" data matrix is the "EP" matrix with the terms for the zeroth harmonics deleted to remove the effects of these terms (the starting position for the digitization trace along the outline). Figure 5 shows the individual and the cumulative contri- butions of harmonics one through eight to the reconstruction of the average mosqui- to wing. The coefficients for the "EP" data set can be mathematically adjusted (i.e., normal- ized) to be invariant to size, rotation, and starting position of the outline trace. The use of the following matrix transforma- tion yields the "NEP" (normalized elliptic Fourier) data set. c„ dt _W cos \p sin \p E*lsin \p cos \p Cn D, cos n6 —sin n6\ sin n6 cos n6 r (5) where n is the harmonic order of the four coefficients {a, b, c, and d), and E* = {a*2 + c*2)05 (6) 308 SYSTEMATIC ZOOLOGY VOL. 33 is the magnitude of the semimajor axis of the best-fitting ellipse. The angle of rota- tion (between 0 and 2ir radians) of this el- lipse is i^ = arc tan(c*/fl*), (7) and the angle of rotation (from 0 to 7r ra- dians) of the starting point from the new starting point at the end of the ellipse (standardized arbitrarily to one end for a given set of images) is 0 = 0.5 arctan^xBi + QDJ/ [A,2 + Q2 - B> - Dffj. (8) The values of a* and c* are given by the matrix equation (?)-(£ £)(£::} After this transformation, three coeffi- cients are constant (ax = 1, bx = 0, cx = 0) and can be ignored. The last coefficient for the first harmonic, du gives the eccentric- ity of the ellipse. Each of the matrices of Fourier coeffi- cients were then used as input to several methods of multivariate analysis. The ef- fect of standardization of the characters (coefficients) was investigated by using both standardized and unstandardized data (the variances among species for each har- monic are shown in Fig. 6). The use of unstandardized data is of interest here since the original coordinates were scaled so that the wings all had the same area and the coefficients of the various harmonics were all in comparable units. An un- weighted pair group (UPGMA) cluster analysis based on a matrix of average taxo- nomic distances among the 127 species of mosquitoes was performed and cophenet- ic correlation coefficients were computed as measures of goodness of fit (for a gen- eral account, see Sneath and Sokal, 1973). Next a principal components analysis (PCA) was performed on the variance— covariance matrix for the characters and the 127 species were then projected onto A B Fig. 5. Elliptic Fourier harmonics for the average mosquito wing based on x- and y-coordinates of the wing. (A) Contributions of harmonics one through eight. (B) Reconstructions of the wing outline based on the cumulative contributions of the first 8 har- monics and for the first 20 harmonics. the resulting axes. The configuration of points obtained from the data sets were compared visually and by Gower's (1971) M2 criterion which measures the sum of the squared distances between corre- sponding points after one PCA matrix has been rotated to superimpose as well as possible on another matrix. Smaller M2 values correspond to better agreement be- tween two PCA solutions. RESULTS The effect of standardization was ex- amined first. The cophenetic correlation 1984 FOURIER METHODS AND WING SHAPES 309 0.0006 o. o o 04-; 0.0 002 v, B P71 FY! r 10 12 0.00 15- 0.0010 0.0005 0.0006 0.0004 0.0002 1 1 T"1 T"1 —1-1-1-1—1 2 4 6 8 10 12 14 1 Fig. 6. Plot of the variance among wings of the 127 species of mosquitoes for the harmonics used in this study. The variances for each coefficient have been summed to give a total contribution for each harmonic. Variances for the: (A) RP data set; (B) CP data set; (C) TA data set; (D) EF data set; and (E) NEF data set. coefficients from the UPGMA cluster anal- yses were similar whether or not stan- dardization of the variables was per- formed. The average correlation for unstandardized data was 0.705 versus 0.691 for standardized data. The average matrix correlation between the distance matrices and distances based upon three-dimen- sional principal component analyses (see Rohlf, 1972) was higher for unstandard- ized data than for standardized data (0.991 versus 0.909). The percentage of variance explained by the first three principal com- ponent axes was also higher for unstan- dardized data than it was for standardized data. Standardization increases the rela- 310 SYSTEMATIC ZOOLOGY VOL. 33 Table 2. Correlations among taxonomic distance matrices for data sets based upon unstandardized data. Data set3 1 RP 2 CP 3 TA 4 EF 5 EFO 2 CP 0.592 3 TA 0.180 0.310 4 EF 0.561 0.798 0.297 5 EFO 0.560 0.798 0.289 1.000 6 NEF 0.469 0.827 0.296 0.907 0.907 a Data set codes: RP, raw polar; CP, centroid translated polar; TA, tangent angle; EF, elliptic Fourier; EFO, EF with zeroth harmonic left out; and NEF, normalized elliptic Fourier. tive contribution of harmonics that do not explain much of the variance (e.g., the odd harmonics in Fig. 3). This increases the ef- fective dimensionality of the space so that the first three axes explain a smaller pro- portion of the now larger total variance. The relationships indicated by unstan- dardized data seem to correspond more to one's subjective visual impression of sim- ilarity. This can be seen by examining par- ticular pairs of species for which very dif- ferent relationships are indicated when the data are standardized. For example, based on unstandardized data from the RP ma- trix, species PS42 and AE81 are indicated to be very similar to each other but rather different from the other species. When the matrix was standardized, species PS42 is indicated to be very different from all oth- er species. If one superimposes the wing outlines for these two species one finds that there is a general overall agreement in the outlines. There are, however, some small differences (that would result in dif- ferent values for the higher order har- monics). Species AN3, AN11, and AE55 present another interesting example of the effect of standardization. Based on stan- dardized data AN 11 and AE55 are indicat- ed to be much more similar to each other than either is to AN3. Using unstandard- ized data one finds that AN3 and AE55 now are more similar than either is to AN11. These latter similarities agree closely with the visual impression one ob- tains by looking at the actual wing out- lines. The explanation for the unsatisfactory effects of standardization appear to be just that it gives equal weight to all harmonics and, thus, results in much greater empha- sis on differences among species in the higher order harmonics (which are much more sensitive to small irregularities in the outlines and to measurement errors). This is clearly seen in the results of principal components analyses where many more of the higher order harmonics had high loadings on the first few axes. Therefore, unstandardized data and variance-covari- ance matrices were used in all analyses re- ported below. The overall degree of similarity across the different data sets for the relationships among the 127 species is shown in Table 2. This table gives matrix correlations be- tween pairs of average taxonomic distance matrices from the various data sets. The matrices all are positively correlated, in- dicating some degree of general agree- ment. Data set TA has much lower corre- lations with the other data sets. Leaving out the zeroth elliptical harmonic appar- ently had little effect in this study since EF and EFO had a correlation of 1.000 (the two matrices are not, however, identical). The NEF distance matrix is highly corre- lated with those of the EF and EFO data sets. Data sets CP and EF indicate quite similar relationships among the 127 species since the correlation between their dis- tance matrices was 0.798. The similarity among these data sets is a measure of the consistency with which the original wings were manually aligned. Table 3 gives some comparisons among the results of UPGMA cluster analysis ap- plied to the various data sets. The TA data set resulted in the largest cophenetic cor- relation (due in part to the fact that one species was indicated to be isolated from the other species). Pairs of phenograms were compared by computing the CIC con- sensus index from a strict consensus tree (Rohlf, 1982). This very stringent criterion indicates that the phenograms are by no means identical for the different data sets. The largest value, 0.752, indicates, for ex- ample, that the EF and the EFO pheno- grams have 75.2% of their clusters repre- sented identically in both trees. The lowest values are for comparisons of data set TA 1984 FOURIER METHODS AND WING SHAPES 311 Table 3. Comparisons among UPGMA phenograms based on distance coefficients from unstandardized data. Coefficients are the CIC consensus coefficients based on strict consensus trees. Cophenetic correlation coefficients are also given for each data set. Larger values correspond to closer agreement. Data set _ Cophenetic Data set 1 RP 2 CP 3 TA 4 EF 5 EFO correlation 1 RP 0.636 2 CP 0.048 0.720 3 TA 0.088 0.008 0.786 4 EF 0.088 0.080 0.016 0.704 5 EFO 0.088 0.088 0.024 0.752 0.690 6 NEF 0.016 0.096 0.016 0.128 0.128 0.708 with the others. Figure 7 shows the phe- nograms from the CP and NEF data set (the others are not furnished in order to save space). While similar wings usually cluster together, the clusters so found do not cor- respond well to conventional taxonomic groupings. Table 4 summarizes the results of com- parisons of pairs of three-dimensional principal components solutions for the various data sets. The values given are Gower's (1971) M2 coefficient. Again, data sets EF and EFO agree closely and data set TA gives the least agreement with the oth- ers. The percentages of variance ex- plained, matrix correlations with the orig- inal distance matrices and between the principal component solutions are also furnished in Table 4. Figure 8 shows the results of the prin- cipal components analysis for the CP and the NEF data sets. In order to properly ap- preciate the relationship among the points, it is essential that the first and second axes are plotted to the same scale (this impor- tant point is often overlooked, even in re- cent papers). It is apparent from these fig- ures that, while there is little evidence of taxonomic structure in the data, similarly- shaped wings are plotted close together in the figures. There are also consistent changes in the shapes of the wings as one goes from left to right and from top to bottom across the figures. These trends can best be visualized by reconstructing wings corresponding to points exactly along the principal component axes. Figure 9 de- picts wings that would be plus or minus three standard deviations from the mean along axes I and II and zero along all other axes. This shows very clearly that axis I has narrower wings at the right and wider at the left. Axis II is harder to describe but it clearly contrasts wings that are wider near the base versus near the apex. This technique is potentially a very powerful tool since it allows one to visualize what an organism would look like if it were to be located in a particular region of a prin- cipal component (or even a canonical vari- ate) space, even if no observed points fell into that particular region. The ordination for the TA data set (not shown) is more similar to the results given by the other data sets than is suggested by the values given in Table 4. Most of the differences seemed to be due to the fact that species CXI 16 is indicated to be an extreme out- lier in the plot for the TA data set, but not for the other data sets. DISCUSSION Bookstein et al. (1982) pointed out a number of limitations on the biological in- terpretations one can give to the coeffi- cients of Fourier functions (but see also the rebuttal by Ehrlich et al., 1983). Book- stein et al. pointed out, for example, that the methods used for the RP and CP data sets are sensitive only to differences in shape not to differences in interpretation of homology between radii at different points along an outline (however, if one's goal is to measure shape per se, this can be considered an advantage). Another limitation is that a change in part of the shape (a 'Tocal change") may result in changes in the values of many of the coef- 1984 FOURIER METHODS AND WING SHAPES 313 Table 4. Comparisons of projections of the 127 species onto the first three principal component axes based upon unstandardized data. Gower's M2 coefficient is given for each pair of data sets compared. The percent- ages of variance explained and matrix correlations are also provided for each data set. Data set Data set 1 RP 2 CP 3 TA 4 EF 5 EFO explained correlation 1 RP 85.33 0.984 2 CP 0.692 88.25 0.990 3 TA 1.460 0.989 79.32 0.985 4 EF 0.706 0.469 0.968 92.04 0.998 5 EFO 0.706 0.469 0.968 0.000 92.15 0.996 6 NEF 0.878 0.466 0.975 0.288 0.288 88.60 0.993 ficients (not just in coefficients corre- sponding to the high frequency compo- nents as one might have hoped). This makes interpretation of the coefficients more difficult. However, these problems are not unique to Fourier methods. The coefficients of linear, quadratic, cubic, etc., polynomials fitted to some data need not be biologically interpretable if the under- lying biological process that generated the relationship under study corresponds to some other functional relationship (such as exponential or sinusoidal). This is an inherent problem in curve fitting when one fits functions that do not correspond to the actual underlying biological pro- cesses. Traditional morphometric ap- proaches have an analogous problem since the usual arbitrary suites of morphometric measurements need not correspond di- rectly to the underlying developmental, evolutionary factors either. Principal com- ponents and canonical variates analyses have the same type of problem in inter- pretation. The most important limitation of our approach is that we deal only with overall shape—not with changes in dis- tances between homologous points. Book- stein (1982) considered the latter to be of more fundamental importance in morpho- metries. Work in progress by one of us (FJR) uses information on the shape and position of each wing vein. The branching points of the veins and their points of in- tersection with the margin of the wing will provide homologous reference points. The fact that the individual Fourier coef- ficients will almost never be morphologi- cally meaningful might actually be con- sidered an advantage—since it may help an investigator to resist the temptation to create complex interpretations of the meaning of the individual coefficients (Rohlf and Ferson, 1983). We would not expect, for example, that a relatively high contribution of the second spherical har- monic in the average mosquito wing (see Fig. 3) would lead an investigator to con- clude that there was a growth field that had the form of a quatrefoil. The investi- gator is thereby forced to deal with overall shape differences by using suites of coef- ficients. After comparing several methods, the question as to which method is "best" nat- urally arises. All the methods are mathe- matically equivalent in that given enough harmonics they can encode the shape of the wing outlines exactly. For objects with more complex outlines there may not be a single "center" that enables the radius function to be single-valued. This elimi- nates the two polar methods from consid- eration as general methods (even though they may work well for many shapes such as those of insect wings). The method Fig. 7. UPGMA phenograms based on average taxonomic distance from unstandardized Fourier coeffi- cients. (A) CP data set. (B) NEF data set. 314 SYSTEMATIC ZOOLOGY VOL. 33 Fig. 8. Plots of the first two dimensions from principal components analyses of unstandardized Fourier coefficients. (A) CP data set. (B) NEF data set. 1984 FOURIER METHODS AND WING SHAPES 315 based on tangent angles has two serious problems. The first is that, as pointed out by Zahn and Roskies (1972), reconstruc- tions with relatively few harmonics do not usually result in figures with closed con- tours (see Fig. 4). This makes it more dif- ficult to interpret trends in reconstructed images (such as shown in Fig. 9 for the CP and NEF methods) since they may not look realistic. The second problem is that the estimation of the coefficients seems to be more sensitive to "noise" in the image than the other methods (as mentioned above species CXI 16 was considered to be an outlier only by this method). This leaves the methods based on elliptic Fourier coef- ficients as the most generally useful. The use of the normalized coefficients, NEF, is a convenience since it saves the investi- gator from having to worry about aligning the images in a standard fashion. The only disadvantage is that a small amount of ad- ditional computation is required (which may be a problem if the analyses are being performed on a small microcomputer without floating-point hardware). Our original (somewhat naive) hope was that an analysis of wing shape might yield classifications comparable to conventional taxonomic classification, but even in the absence of such congruence we feel that the present application is quite successful. The ordinations enabled us to summarize and display the diversity of shapes of wings found among the North American Culicidae—even though the wings are fairly similar, and some of the differences rather subtle and difficult to describe by conventional means. The differences in wing shape must have functional and ecological significance, but we did not find any obvious associations with published accounts indicating that a given species was a "strong" or "weak" flier, was found in exposed or sheltered habitats, etc. The detection of such asso- ciations may require additional quantita- tive information about the behavior and physiology of the various species of mos- quitoes. These results are in contrast to the study A Fig. 9. Reconstructed wing outlines for points along the principal component axes shown in Figure 8. (A) CP data set. (B) NEF data set. by Plowright and Stephen (1973) who found good agreement between their morphometric analysis of bee wings and the conventional classification. Their characters described features of the vena- tion rather than the outline shape of the wings as in the present study. Brown (1979) and Brown and Shipp (1977, 1978) also used wing vein lengths to examine phenetic relationships within two groups of Australian Diptera and found that the results varied in their consistency with re- sults from previous studies. The previous studies had used standard taxonomic char- acters including genitalic characters as well as geographic distribution for erecting species groups. Simon (1983) used a series of 49 characters involving wing vein length and was able to establish differ- ences among year classes (broods) and life cycle forms of periodical cicadas (Magici- cada) where no differences had previously been established. Whether differences in wing outline shape will provide useful taxonomic information when more di- verse wing shapes are analyzed remains to be seen. Minor wing shape differences may also be expected between closely related 316 SYSTEMATIC ZOOLOGY VOL. 33 species or populations if these differences are important for ecological / functional dif- ferences in flight characteristics. One of us (JWA) is examining both wing shape and vein length differences among popula- tions and species of Hawaiian picture- winged Drosophila where drastic wing shape differences between species and sexes are apparent. The ready availability of inexpensive coordinate digitizers attached to micro- computers (or to terminals) makes these methods practical for general use. The ac- tual process of digitizing outlines is tedi- ous, however. It takes several minutes to position an image, carefully trace its out- line, and then store the information in a computer. This process can be greatly speeded up for conveniently-sized high- contrast images (where one does not have to adjust a microscope and the background can clearly be distinguished from the ob- ject) by the use of a television camera at- tached to a computer-controlled digitizer. Such a setup in our laboratory (see Ferson et al., in prep.) enables large numbers of specimens to be processed since only a few seconds are required for each image. The principal problem is the fact that the pre- cision of the resulting measurements is somewhat limited. The scanner in our lab- oratory has a resolution of maximally 480 by 640 pixels compared to the 1,700 by 1,700 field on the digitizer used in the present study. Most video units presently available for microcomputers have much less resolution (undoubtedly only a tem- porary limitation). Charge-coupled device (CCD) cameras are also available that have a resolution of 1,700 by 2,200 (as much res- olution as a digitizer), but the processing of an image with 3.74 x 106 bytes requires much more computational effort. The de- termination of which device should be used will depend upon the complexity of the outline of the image and how many harmonics are needed to describe it satis- factorily. This will have to be determined in each study. ACKNOWLEDGMENTS This paper represents contribution number 489 from Graduate Studies in Ecology and Evolution, State University of New York at Stony Brook. This work was supported in part by a grant (BSR8202269) from the National Science Foundation. Some of the pre- liminary computations were performed while one of us (FJR) was on leave at the IBM Thomas J. Watson Research Center. Mr. Scott Ferson assisted this proj- ect in many ways. His help was greatly appreciated. Barbara Thomson assisted in the preparation of the final figures. REFERENCES Anstey, R. L., and D. A. Delmet. 1973. Fourier anal- yses of zooecial shapes in fossil tubular bryozoans. Geol. Soc. Am. Bull., 84:1753-1764. Bookstein, F. L. 1982. Foundations of morphomet- ries. Annu. Rev. Ecol. Syst., 13:451-470. Bookstein, F. L., R. E. Strauss, J. M. Humphries, B. Chernoff, R. L. Elder, and G. R. Smith. 1982. A comment on the uses of Fourier methods in sys- tematics. Syst. Zool., 31:85-92. Brown, K. R. 1979. Multivariate assessment of phe- netic relationships within the tribe Luciliini (Dip- tera: Calliphoridae). Aust. J. Zool., 27:465-477. Brown, K. R., and E. Shipp. 1977. Wing morpho- metries of Australian Luciliini (Diptera: Calliphor- idae). Aust. J. Zool., 25:765-777. Brown, K. R., and E. Shipp. 1978. Wing morpho- metry analysis of Australian Sarcophaginae (Dip- tera: Sarcophagidae). Syst. Entomol., 3:179-188. Carpenter, S. J., and W. J. LaCasse. 1955. Mosqui- toes of North America (North of Mexico). Univ. California Press, Los Angeles. Crovello, T. J. 1969. Numerical taxonomy: Its val- ue to mosquito taxonomy. Mosquito Syst., 1:63-67. Ehrlich, R., R. B. Pharr, Jr., and N. Healy-Williams. 1983. Comments on the validity of Fourier de- scriptors in systematics: A reply to Bookstein et al. Syst. Zool., 32:302-306. Gower, J. 1971. Statistical methods of comparing different multi-variate analyses of the same data. Pages 138-149 in Mathematics in the archaeologi- cal and historical sciences (F. R. Hodson, D. G. Kendall, and P. Tautu, eds.). Edinburgh Univ. Press, Edinburgh. Hendrickson, J. A., and R. R. Sokal. 1968. A nu- merical taxonomic study of the genus Psorophora (Diptera: Culicidae). Ann. Entomol. Soc. Am., 61: 385-392. Hu, M. K. 1962. Visual pattern recognition by mo- ment invariants. IRE Trans, on Information The- ory, 8:179-187. Kaesler, R. L., and J. A. Waters. 1972. Fourier anal- ysis of the ostracod margin. Bull. Geol. Soc. Am., 83:1169-1178. Kincaid, D. T., and R. B. Schneider. 1983. Quanti- fication of leaf shape with a microcomputer and Fourier transform. Can. J. Bot., 61:2333-2342. Kuhl, F. P., and C. R. Giardina. 1982. Elliptic Fou- rier features of a closed contour. Computer Graph- ics and Image Processing, 18:236-258. Moellering, H., and J. N. Rayner. 1982. The dual axis Fourier shape analysis of closed cartographic forms. Cartographic J., 19:53-59. Moellering, H., and J. N. Rayner. 1983. The har- 1984 FOURIER METHODS AND WING SHAPES 317 monic analysis of spatial shapes using dual axis Fourier shape analysis (DAFSA). Geograph. Anal- ysis, 13:64-77. Moss, W. W., W. A. Steffan, N. L. Evenhuis, and D. L. Manning. 1979. The genus Toxorhynchites (Diptera: Culicidae); Analysis of T. splendens and allies using techniques of numerical taxonomy. Mosquito Syst., 11:258-274. Nielsen, L. T. 1969. A critique on numerical tax- onomy. Mosquito Syst., 1:23-25. Plowright, R. C., and W. P. Stephen. 1973. A nu- merical taxonomic analysis of the evolutionary re- lationships of Bombus and Psithyrus (Apidae: Hy- menoptera). Can. Entomol., 105:733-743. Ralston, A. 1965. A first course in numerical anal- ysis. McGraw Hill, New York. Roberts, D. McL., A. Waren, and C. R. Curds. 1983. Morphometric analysis of outline shape applied to the peritrich genus Vorticella. Syst. Zool., 32:377- 388. Rohlf, F. J. 1963. Classification of Aedes by numer- ical taxonomic methods (Diptera: Culicidae). Ann. Entomol. Soc. Am., 56:798-804. Rohlf, F. J. 1967. Correlated characters in numeri- cal taxonomy. Syst. ZooL, 16:109-126. Rohlf, F. J. 1972. Empirical comparison of three ordination techniques in numerical taxonomy. Syst. Zool., 21:271-280. Rohlf, F. J. 1977. Classification of Aedes mosquitoes using statistical methods. Mosquito Syst., 9:372-388. Rohlf, F. J. 1982. Consensus indices for comparing classifications. Math. Biosci., 59:131-144. Rohlf, F. J., and S. Ferson. 1983. Image analysis. Pages 583-599 in Numerical taxonomy (J. Felsen- stein, ed.). NATO ASI Series G, Ecological Sciences No. 1. Springer-Verlag, New York. Rohlf, F. J., and R. R. Sokal. 1967. Taxonomic structure from randomly and systematically scanned biological images. Syst. ZooL, 16:246-260. Simon, C. M. 1983. Morphological differentiation in wing venation among broods of 13- and 17-year periodical cicadas. Evolution, 37:104-115. Simon, C. M., W. A. Steffan, W. W. Moss, and N. L. Evenhuis. 1982. The genus Toxorhynchites (Dip- tera: Culicidae); Numerical phylogenetic analysis of Tx. splendens and allies with phenetic compari- sons. Mosquito Syst., 14:221-261. Sneath, P. H. A., and R. R. Sokal. 1973. Numerical taxonomy. W. H. Freeman and Co., New York. Steward, C. C. 1968. Numerical classification of the Canadian species of the genus Aedes, Syst. Zool., 17:426-437. Zahn, C. T., and R. Z. Roskies. 1972. Fourier de- scriptors for plane closed curves. IEEE Trans, on Computers, 21:269-281. Received 7 February 1984; accepted 1 March 1984.
x

Log In

or reset password

Reset Password

Enter the email address you signed up with, and we'll send a reset password email to that address

Academia © 2012