1: \documentclass[10pt,english,preprint2]{aastex}
2: \usepackage[T1]{fontenc}
3: \usepackage[latin1]{inputenc}
4: \setcounter{tocdepth}{3}
5: \usepackage{graphicx}
6: \usepackage{amssymb}
7:
8: \makeatletter
9:
10: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% LyX specific LaTeX commands.
11: %% Because html converters don't know tabularnewline
12: \providecommand{\tabularnewline}{\\}
13:
14: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% User specified LaTeX commands.
15: \usepackage{graphicx}
16: \usepackage{times}
17: \usepackage{courier}
18: \setlength{\emergencystretch}{2em}
19: \usepackage{babel}
20: \makeatother
21: \begin{document}
22:
23: \title{Solar Magnetic Tracking. I. Software Comparison and Recommended Practices \small{\emph{(In press, \apj, 2007)}}}
24:
25:
26: \author{C.E. DeForest}
27:
28:
29: \affil{Southwest Research Institute, 1050 Walnut Street Suite 400, Boulder,
30: CO 80302}
31:
32:
33: \email{deforest@boulder.swri.edu, }
34:
35:
36: \author{H.J. Hagenaar}
37:
38:
39: \affil{Lockheed Martin Advanced Technology Center, Org. ADBS., Bldg. 252,
40: Palo Alto, CA 94304}
41:
42:
43: \email{hagenaar@lmsal.com}
44:
45:
46: \author{D.A. Lamb}
47:
48:
49: \affil{Dept. of Astrophysical and Planetary Science, University of Colorado,
50: Boulder 80309-0391 USA}
51:
52:
53: \email{derek@boulder.swri.edu}
54:
55:
56: \author{C.E. Parnell}
57:
58:
59: \affil{School of Mathematics and Statistics, St. Andrews University, St.
60: Andrews, Scotland KY16 9S}
61:
62:
63: \email{clare@mcs.st-and.ac.uk}
64:
65:
66: \and{ }
67:
68:
69: \author{B.T. Welsch}
70:
71:
72: \affil{University of California at Berkeley Space Sciences Laboratory, 7
73: Gauss Way, UCB, CA 94720-7450, USA}
74:
75:
76: \email{welsch@ssl.Berkeley.edu}
77:
78: \begin{abstract}
79: Feature tracking and recognition are increasingly common tools for
80: data analysis, but are typically implemented on an ad-hoc basis by
81: individual research groups, limiting the usefulness of derived results
82: when selection effects and algorithmic differences are not controlled.
83: Specific results that are affected include the solar magnetic turnover
84: time, the distributions of sizes, strengths, and lifetimes of magnetic
85: features, and the physics of both small scale flux emergence and the
86: small-scale dynamo. In this paper, we present the results of a detailed
87: comparison between four tracking codes applied to a single set of
88: data from SOHO/MDI, describe the interplay between desired tracking
89: behavior and parameterization tracking algorithms, and make recommendations
90: for feature selection and tracking practice in future work.
91:
92: \newpage
93: ~\newpage
94:
95: \end{abstract}
96:
97: \keywords{Sun: magnetic fields, Sun: photosphere, methods: data analysis}
98:
99: \newpage
100: \section{Introduction}
101:
102: The last decade has seen a sea change in the way that solar physics
103: is accomplished. Advances in detector technology have permitted missions
104: such as SOHO (e.g. \citealt{Scherrer1995}) and TRACE (\citealt{Handy1999}),
105: and ground-based observatories such as GONG (\citet{Leibacher1995}),
106: to produce far more data than can be analyzed directly by humans.
107: The planned SDO mission (\citealt{Schwer2002}) will produce data
108: a thousand times faster. Hence, automated data mining has become a
109: necessary tool of the analysis trade. Applied to image data, data
110: mining consists of algorithmic recognition of visual features in the
111: data. Applications such as feature and pattern recognition fall within
112: the field of \emph{computer vision}, which is the subject of active
113: research in the computer science community.
114:
115: Magnetic feature identification and tracking have proven useful for
116: extracting statistical parameters of the solar dynamo
117: (e.g. \citealt{Hagenaar1999,Schrijver1997}), allowing more
118: sophisticated analyses than have been possible by hand
119: (e.g. \citealt{HarveyKL1993}). Current applications of feature
120: tracking include characterization of bulk field behavior at the
121: photosphere, probing of the solar dynamo, identification of the
122: magnetic roots of solar atmospheric features, and constraint of MHD
123: models. Each of these applications is discussed below.
124:
125: In the last few years, each of our research groups has independently
126: developed four separate tracking codes adapted to studying slightly
127: different aspects of the solar magnetic field. CURV was the first
128: code developed to study magnetic features in the MDI quiet sun data
129: (\citealt{Hagenaar1999}); MCAT has been used to study interaction
130: between network flux elements (\citealt{Parnell2002}); SWAMIS
131: (\citealt{LambDeForest2003}) is intended to drive semi-empirical MHD
132: models of the quiet sun; and YAFTA (\citealt{Welsch2003}) was
133: developed to study active region dynamics.
134:
135: Our separate tracking codes are similar enough to be applied to
136: similar problems and to yield directly comparable results. However,
137: feature tracking is not a simple endeavor, and many subtle
138: characteristics of each code can strongly affect derived results.
139: This, together with the ad-hoc manner in which each tracking code
140: was developed, made it difficult to compare or duplicate results
141: between groups.
142:
143: In 2004 November we met at St. Andrews University to reconcile results
144: from all four sets of software. We applied each code to a sample data
145: set and compared results from the different algorithms to reconcile
146: the results across research groups. Furthermore, we identified how
147: algorithmic choices affect magnetic feature tracking results, and
148: developed a set of recommended practices to guide future development
149: of feature tracking and related software for the solar community.
150:
151: Developing a baseline of best recommended practices for feature
152: tracking and computer vision is an important goal for the solar
153: imaging community, because feature tracking is a fundamental component
154: of many types of data analysis. Applied to the solar magnetic field,
155: it has been used to characterize the statistical parameters of the
156: field by determining the distribution of feature sizes
157: and fluxes
158: (\citealt{HarveyKL1993,Hagenaar1999,Hagenaar2001,Parnell2002}) and the
159: average lifetime of individual features (\citealt{Hagenaar2003}).
160: Automated extraction of parameters such as clustering distributions
161: (\citealt{LambDeForest2003}) and event distributions
162: (\citealt{DeForestLamb2004}) are being used to derive more detailed
163: information about the solar dynamo. All of these applications are
164: dominated by the relationship between small scale event detections and
165: the noise floor of the instrument used for detection, generally a
166: line-of-sight / scalar magnetograph such as SOHO/MDI
167: (\citealt{Scherrer1995}) or GONG (\citealt{Leibacher1999}).
168:
169: Feature tracking is further useful for constraining the energy input into flux
170: systems in the solar corona. Much of the energy deposited into the
171: chromosphere and corona is thought to be transported by the Poynting
172: vector, as photospheric motions do work on the magnetic field by
173: pushing magnetic flux around the surface
174: (e.g. \citealt{Parker1988,Fossum2004}). Feature tracking allows
175: simple derivation of the motion field from time series of
176: images. \citet*{Welsch2004} used feature tracking to estimate the
177: quiet sun helicity flux into the corona, and \citet{DeForestLamb2004}
178: and Parnell (\citeyear{ParnellJupp2000,Parnell2002}) are using feature
179: tracking to identify the roots and nature of small scale heating
180: events such as bright points.
181:
182: A third important application of feature tracking is to drive boundary
183: conditions of semi-empirical MHD models of the solar atmosphere, such
184: as are anticipated for space weather prediction. Time-dependent MHD
185: modeling requires knowledge not just of the three-dimensional vector
186: field at the surface of the Sun, but also of the motion of individual
187: lines of magnetic flux; feature tracking derives the motion
188: information from time series measurements of the magnetic
189: field. Indeed, Peano's existence and completeness theorem (see, e.g.,
190: \citealt{Simmons}) implies that knowledge of the initial magnetic
191: topology in the force-free upper layers of the atmosphere, together
192: with the radial component of the field at the lower boundary, is
193: equivalent to knowledge of the full vector field everywhere on the
194: lower boundary. Provided that the initial topology may be estimated,
195: this equivalence makes feature tracking a powerful tool for modeling
196: energy input into the solar atmosphere even in the absence of full
197: vector field measurements, as the distribution of radial magnetic flux
198: on the $\tau=1$ surface at the photosphere approximates the
199: distribution at the $\beta=1$ surface in the upper chromosphere.
200:
201: In this article, the first in a series on results from tracking of
202: photospheric magnetic features, we discuss the state of the art and some
203: current applications of magnetic tracking software. In
204: \S\ref{sec:Discussion-of-Tracking} we outline the basic steps of a
205: feature tracking algorithm; in
206: \S\S\ref{sec:Tracking-Results:-A}-\ref{sec:Discussion} we present and
207: discuss the differences between the codes' results as applied to a
208: reference data set; and in \S\ref{sec:recommendations} we recommend
209: {}``best practices'' for future codes to follow for feature
210: tracking applications. Finally, \S\ref{sec:Conclusions} contains some
211: general conclusions and insights, and a glossary at the end contains
212: recommended vocabulary to describe specific aspects of magnetic
213: tracking.
214:
215:
216: \section{\label{sec:Discussion-of-Tracking}Discussion of Tracking Algorithms}
217:
218: Feature tracking can be divided into five separate operations: (i)
219: image preprocessing; (ii) discrimination/detection; (iii) feature
220: identification within a frame; (iv) feature association across frames;
221: and (v) event detection. In addition, some noise filtering is
222: accomplished by filtering the associated features to discard
223: short-lived or small features that have too high a likelihood of being
224: noise. Here, we discuss the important components of magnetic feature
225: tracking algorithms in general, and outline the differences between
226: each of the four principal codes that we compared.
227:
228:
229: \subsection{Preprocessing}
230:
231: In general, magnetograms arrive from an instrument with some level of
232: background noise and with position-dependent foreshortening due to the
233: curvature of the Sun. Reducing the noise floor and eliminating
234: perspective effects requires preprocessing images before applying
235: feature recognition. Temporal averaging, projection angle scaling, and
236: resampling to remove perspective and solar rotation effects are
237: commonly applied before most high-level analysis. In particular, 0.5-2
238: arcsecond scale magnetograms benefit from being averaged over 5-12
239: minutes to reduce background noise, and line-of-sight (Stokes V)
240: magnetograms of the quiet sun benefit from being divided by a cosine
241: factor to account for the difference between the magnetogram line of
242: sight and local vertical at the surface of the Sun, under the model
243: that weak field is close to vertical at the photosphere.
244:
245: Spaceborne magnetographs such as MDI and the anticipated SDO are also
246: susceptible to cosmic ray spikes, which must be removed either by
247: temporal filtering of tracked features or by preprocessing the images.
248:
249: Most magnetograms made with a filtergraph type instrument such as MDI
250: or GONG contain at least three sources of random noise at each pixel:
251: (i) photon statistics, which produce a familiar white noise spectrum;
252: (ii) P-mode contamination, which is due to the five-minute Doppler
253: oscillations leaking into the Zeeman signal; and (iii) granulation
254: noise, which is due to solar evolution between the different
255: filtergraph exposures that make up each magnetogram. The photon shot
256: noise is a uniform random variable with an independent sample at every
257: pixel and a presumed Gaussian distribution. The P-mode contamination
258: is a random variable with far fewer independent spatial samples per
259: image, because of the low spatial frequencies of the P-modes, and an
260: oscillating temporal component. Granulation-based noise has a spatial
261: scale of a few arc seconds and a coherence time of 5 minutes. MDI is
262: well tuned so that the three sources of noise are about equal in
263: individual images; but in spatially binned, temporally averaged, or
264: smoothed images, the granulation and P-modes dominate the noise
265: spectrum.
266:
267: Although data preprocessing is not part of the process of feature
268: identification and tracking, preprocessing effects can affect tracking
269: results and we recommend (in \S\ref{sec:recommendations}) specific
270: practices to reduce artifacts.
271:
272: \subsection{Discrimination}
273:
274: Any feature-recognition algorithm requires \emph{discrimination},
275: i.e. the separation of foreground features from background noise.
276: Every magnetogram sequence appears to contain many faint features
277: at or slightly below the level of the noise floor, so discrimination
278: is not trivial.
279:
280: The simplest discrimination scheme, direct thresholding, works well
281: only for strong magnetic features that are well separated from the
282: noise floor, such as flux concentrations in the magnetic network or
283: in active regions. Other types of magnetic feature, such as weak intranetwork
284: fields, suffer because keeping the threshold high enough to avoid
285: false-positive detections creates a large number of false-negative
286: non-detections of the weak magnetic features. The problem is the huge
287: number of individual detection operations (one per pixel per time
288: step), which makes false positives a significant problem. With a Gaussian
289: noise distribution, setting the threshold to three standard deviation
290: ($\sigma$) units yields a false positive rate of about $10^{-4}$,
291: so that a 300-frame data set with dimension 300x300 pixels would yield
292: around three thousand false positive detections from noise alone,
293: and perhaps 10-30 times that number of inconsistently detected weak
294: features (false negatives).
295:
296: Each of our codes used a different discrimination scheme, affecting
297: what types of feature could be detected. YAFTA, originally intended
298: for use with active region magnetograms well above the noise floor,
299: uses a simple threshold test to discriminate. The other three codes
300: have adopted two different schemes to work closer to the noise floor,
301: both of which add additional tests to the basic threshold test.
302:
303: SWAMIS and MCAT use hysteresis, used by \citet{LambDeForest2003} and
304: by \citet{Parnell2002}, in which two thresholds are applied: a high
305: threshold for isolated pixels, and a second, lower threshold for
306: pixels that are adjacent to already-selected pixels. Adjacency is
307: allowed in space and/or time. The hysteresis misses some very weak
308: features, but captures every feature that at some location and/or time
309: exceeds the large threshold. The proximity requirement reduces the
310: number of pixels that undergo the lower threshold test, and therefore
311: reduces the number of false-positive detections. Depending on
312: application, the higher threshold is chosen to be $3-6\,\sigma$, and
313: the lower threshold $1-3\,\sigma$, where $\sigma^{2}$ is the variance
314: (and $\sigma$ is the RMS variation) of the data.
315:
316: MCAT and SWAMIS differ slightly in the nature of the hysteresis. Both
317: codes use separate masks for positive and negative flux
318: concentrations, but MCAT applies the low threshold to any pixels in
319: the current frame that are adjacent to a detected feature in the next
320: or previous frame. MCAT makes one forward pass through the data,
321: comparing pixels in each frame to the higher threshold at most
322: locations and to the lower threshold in locations that were occupied
323: in the previous frame; and then one reverse pass that is identical
324: except that the low-threshold mask comes from the next, rather than
325: previous, frame.
326:
327: SWAMIS uses a ``contagion'' algorithm that treats the time axis as a
328: third spatial dimension: a pixel is subjected to the low threshold if
329: it is adjacent, either in time or in space, to any detected pixel.
330: Considering pixels as cubes in (x,y,t) space, each pixel is subjected
331: to the lower threshold if it shares at least one edge with a pixel
332: that has been marked occupied and that has the same sign. The
333: contagion algorithm is executed in a single pass through the data with
334: in-frame recursion to dilate the detected regions annd with
335: backtracking to re-test newly ``infected'' pixels in previous frames.
336:
337: CURV uses the curvature method used by \textbf{}\citet{Strous1996} and
338: by \citet{Hagenaar1999}, in which both the data
339: values and their second derivative are tested. To be considered part
340: of a local maximum/minimum by the curvature algorithm, the magnitude
341: of a pixel must exceed a value threshold and all surrounding pixels
342: must have a negative/positive second derivative in each of the
343: horizontal, vertical, and two diagonal directions. The second
344: derivative criterion adds four additional independent threshold tests
345: for each pixel, reducing the number of false detections at a given
346: threshold and allowing single threshold values comparable to the low
347: threshold values used in SWAMIS and MCAT.
348:
349: All three of MCAT, SWAMIS, and CURV impose minimum-size and lifetime
350: requirements on features at a later step in the processing, reducing the
351: effect of false positives in the detection step. YAFTA also imposes a
352: minimum lifetime requirement to reduce false positives from noise
353: fluctuation.
354:
355: \subsection{\label{sub:Feature-identification}Feature identification}
356:
357: Feature identification is the operation of connecting masked pixels
358: into distinct identifiable (and identified) structures in each frame.
359: In practice, this means forming a detected feature map, an image whose
360: pixels have integer numeric values that correspond to index numbers of
361: particular features. Each of our codes uses a variant of a clumping
362: dilation algorithm that identifies connected loci of pixels within a
363: masked region.
364:
365: MCAT clumps masked pixels directly into contiguous regions.
366: YAFTA and SWAMIS can switch between direct clumping and a gradient
367: based (``downhill'') method that dilates local maxima by expansion
368: down the gradient toward zero flux density. CURV also uses direct
369: clumping, but generates initial feature masks with a data-value curvature
370: method that restricts the features to isolated regions, yielding features
371: that are segmented more like those of the downhill method in YAFTA
372: and SWAMIS than like the other clumping codes. All three techniques
373: are illustrated in Figure \ref{fig:identification-types}.
374:
375: %
376: \begin{figure}[!tb]
377: \begin{center}\center{\includegraphics[%
378: width=3in,
379: height=1.14in]{f1a.eps}}\end{center}
380:
381:
382: \caption{\label{fig:identification-types}Effect of different feature-identification
383: schemes on the identified structure of a large flux concentration.
384: (A) Clumping identifies all connected above-threshold pixels into
385: a single feature. (B) Downhill methods identify one feature per local
386: maximum region. (C) Curvature methods identify the convex core around
387: each local maximum.}
388: \end{figure}
389:
390:
391: The tradeoff between the clumping and downhill methods is that the
392: downhill method is better at picking out the structure of individual
393: clusters of magnetic flux, while the clumping method is somewhat less
394: noise-susceptible. Fluctuations from either solar convection or
395: instrument noise can easily create small local maxima that are
396: identified as transient structures by the downhill method; simple
397: clumping eliminates these small transients, for better or for worse.
398:
399: Which method is appropriate depends on the specific scientific
400: application, as discussed further in \S
401: \ref{sec:recommendations}.
402:
403: \subsection{\label{sub:Feature-association}Feature association}
404:
405: Feature association is the fixing of a feature's identity across
406: different frames of an image sequence. Most features in adjacent
407: frames of an image are related by similarity of position and shape:
408: when a feature in frame $m+1$ is sufficiently similar to the feature
409: in frame $m$, then it is likely that the two features represent the
410: same physical object at the two different times. All of our existing
411: codes use variations of a dual-maximum-overlap criterion to identify
412: persistent features across frames (Figure \ref{fig:assoc}); this
413: technique associates two features B and C in adjacent frames only if
414: $B\cap C$ is larger (in a flux-weighted sense) than any other
415: intersection with either $B$ or $C$.
416:
417: %
418: \begin{figure}[!tb]
419: \center{\includegraphics[%
420: width=3in,
421: height=2.473in]{f2a.eps}}
422:
423:
424: \caption{\label{fig:assoc}A pathological association case. Features A and
425: B are in the previous frame, C and D in the current frame. A maximum-overlap
426: method associates B and C. The recommended associative algorithm (dual-maximum
427: overlap) associates B=C if and only if $B\cap C$ is the largest of
428: C's intersecting regions and also the largest of B's intersecting
429: regions. A and B merge to form C, at the same time that D calves via
430: fragmentation from B.}
431: \end{figure}
432:
433: All four of our codes follow variants of the largest-intersection
434: criterion, either following maximum flux overlap or maximum area
435: overlap. YAFTA uses arbitrary label choice on its first pass through
436: the data, and then ``label conflicts'' in a subsequent pass that
437: uses maximum overlap.
438:
439: \subsection{Filtering based on size/longevity}
440:
441: When working close to the noise floor, it is useful to reject small
442: features, because false positives are much more likely in small
443: clusters of pixels than in large ones. All of our codes reject
444: identified features that do not meet some minimum size
445: criterion. Criteria that are useful include: maximum size; average
446: size; lifetime; or total number of pixels across the life of the
447: feature. The filtering can be accomplished only after feature
448: identification (for per-frame size checks) or feature association (for
449: maximum size checks or longevity checks).
450:
451: Additional problems exist due to fluctuations in background noise that
452: may cause features that appear for only a single frame, or may cause weak
453: but persistent features to disappear for a frame (the \emph{Swiss
454: cheese problem}). Similarly, associated features may split and then
455: re-merge rapidly due to fluctuations in a single frame (the
456: \emph{oscillating twins} problem).
457:
458: CURV sidesteps association problems by requiring oversampling on the
459: time axis of the input data cube; this reduces the frame-to-frame
460: fluctuations of individual features.
461:
462: MCAT avoids both problems with a three-step process. (i)
463: completely-surrounded holes in the center of a feature are filled in
464: and the missing pixels are counted as part of the feature; (ii) Twin
465: features that merge for a single frame are forced to remain separate;
466: (iii) single features that split for a single frame are forced to
467: remain merged.
468:
469: SWAMIS overcomes both the oscillating twins and Swiss cheese problems
470: by re-associating short lived features with nearby larger features if
471: there is sufficient overlap between them.
472:
473:
474: \subsection{Classification of Origin and Demise}
475:
476: Identifying and locating individual features as they evolve is
477: properly described as \emph{feature tracking}, but identifying
478: structures and events that may include several features is more
479: properly described as a complete \emph{computer vision}
480: application. Not all of our feature tracking codes include provision
481: to detect and identify interactions of multiple flux concentrations,
482: such as pairwise emergence, but such detection is an important part of
483: characterizing magnetic evolution and hence is discussed here. In
484: particular, because magnetic features are not corks but rather
485: cross-sections of curvilinear manifolds (field lines that pass through
486: the photosphere), they are connected pairwise by the magnetic
487: field. Identifying the association between freshly emerged pairs thus
488: gives useful information about the overall field topology and how it
489: changes via reconnection of the overlying field before the death
490: (e.g. by submergence) of the individual features.
491:
492: The origin and demise of features is different than the origin and
493: demise of magnetic flux itself: in particular, features can fragment
494: or merge under the influence of the photospheric flow field (e.g.
495: \citealt{Schrijver1997}) without any flux emerging or submerging through
496: the photospheric surface. Fragmentation and merging can result in
497: apparent violation of the conservation of flux as magnetic flux sinks
498: below or rises above the detection threshold of the instrument being
499: used to detect it.
500:
501: Software to identify origin events recognizes flux concentrations
502: near each newly detected concentration, and classifies the origin
503: according to these nearby concentrations and the time derivative of
504: their contained flux. To avoid missing associated structure, an allowed
505: margin of error is required in the spatial or temporal offset between
506: two associated features, and also in the flux rate-of-change between
507: the features.
508:
509: Demise events are similar to origin events and may be recognized with
510: the same code, operating on tracked data in reverse time order. As
511: with birth events, demise events are not necessarily related to emergence
512: or submergence of magnetic flux.
513:
514:
515: \section{\label{sec:Tracking-Results:-A}Tracking Results: A Comparison Across
516: Codes}
517:
518: %
519: \begin{figure}[!tbh]
520: \begin{center}\center{\includegraphics[%
521: width=3in,height=5.9in]{f3a.eps}}\end{center}
522:
523:
524: \caption{\label{fig:context}The tracked field of view, in context. Note that
525: the edge of the grey circle in the MDI full-disk image is not the
526: limb of the Sun, it is a crop radius for the instrument, just outside
527: the limb.}
528: \end{figure}
529:
530:
531: %
532: \begin{figure}[!t]
533: \begin{center}\center{\includegraphics[%
534: width=3in,
535: height=2.285in]{f4a.eps}}\end{center}
536:
537:
538: \caption{\label{fig:dist-func}Distribution function of weak-field pixels
539: in the test dataset, a sequence of five-minute average high resolution
540: magnetograms from SOHO/MDI. A Gaussian fit to the low-valued pixels
541: (presumed to be noise) is shown. The measured standard deviation ($\sigma$)
542: of the images is 18.3 Gauss.}
543: \end{figure}
544:
545:
546:
547: \subsection{Description of the Dataset}
548:
549: We analyzed a sequence of 600 one-minute-cadence MDI high resolution
550: quiet-sun images from 2003 June 04, beginning at 05:43 UT. The images
551: were resampled into heliographic longitude/latitude coordinates (\emph{plate
552: caree} projection) using an orthographic model of the solar image,
553: as shown in Figure \ref{fig:context}. The reprojection used the ANA
554: language resampling tools by R. Shine (1999, personal communication). The tracked
555: images were 300x300 pixels and ran over the range $-16.3^{\circ}$
556: --- $-7.3^{\circ}$ in longitude and $-2.8^{\circ}$---$6.2^{\circ}$
557: in latitude. This scale slightly enlarged the images, to a pixel size
558: of 0.03 heliocentric degrees (about 0.48 observer arc seconds at disk
559: center). Images were derotated to the central time in the data sequence,
560: using the \citet{Snodgrass1983} synodic differential rotation curve
561: and rigid-body rotation at a latitude of $14^{\circ}$. These derotated,
562: plate caree images were averaged together in blocks of five minutes
563: each to reduce shot noise and P-mode interference. The noise level
564: was determined by fitting a Gaussian profile to the weak portion of
565: the pixel strength distribution curve (Figure \ref{fig:dist-func}).
566: The width ($\sigma$) of the best fit Gaussian profile was 18.3 Gauss,
567: which should be taken as the sum of all incoherent noise components
568: (principally shot noise, granulation, and P-mode leakage). Small frame-to-frame
569: offsets of the zero point (presumably due to variations in the instrument's
570: exposure time) were found by measuring the offset from zero for the
571: best-fit Gaussian, and removed by subtraction from each frame.
572:
573:
574: \subsection{Feature size distribution}
575:
576: The simplest comparison to make across codes is distribution of fluxes
577: of detected magnetic features. Figure \ref{fig:sizes} shows the results
578: of applying all four of our codes (with two different identification
579: techniques for SWAMIS) to the same data. The five different techniques
580: yield obviously different flux distributions for the network; here
581: we discuss the features in the plots and the differences between them.
582: The plots all have the same height scale and the same bin size, so
583: the histograms are directly comparable. All the codes exhibit high
584: and low threshold behaviors that are discussed below; but it should
585: be immediately apparent by inspection of Figure \ref{fig:sizes} that
586: the codes diverge at the small end of the flux spectrum, achieving
587: a moderately good agreement in slope only for flux concentrations
588: larger than about $2\times10^{18}$~Mx. All four methods produce a
589: slope of about $-0.35\pm0.05$ decades per $10^{18}$~Mx, corresponding
590: to an e-folding width of about $1.2\times10^{18}$~Mx in the distribution.
591:
592: %
593: \begin{figure*}[tb]
594: \center{\includegraphics[%
595: width=6in,
596: height=2.441in]{f5a.eps}}
597:
598:
599: \caption{\label{fig:sizes}Network feature flux distributions as derived by
600: the five algorithms we compared in \S3. See text for full discussion. }
601: \end{figure*}
602:
603: All of the codes display a weak-feature threshold effect (false
604: negatives) as small features that are close to the noise floor are
605: eliminated by the discriminators. YAFTA and MCAT show the strongest
606: threshold effects, because they rely on a combination of minimum
607: strength and minimum feature size in each frame to eliminate
608: false-positives from the discrimination step. The YAFTA threshold is
609: particularly abrupt because the initial detection discriminator uses
610: no hysteresis, so that all detected features must have a minimum
611: number of pixels with a minimum amount of flux per pixel. MCAT's
612: threshold is softer because the hysteresis feature of the
613: discriminator allows weaker pixels to be detected around a strong
614: core; the dearth of very weak features is due to the combination
615: lifetime-and-strength requirement, which removes many weak features
616: that are detected by the other codes. CURV shows a still softer turnover
617: and threshold because the CURV discriminator does not rely on a high
618: threshold value in any one pixel to trigger detection. The turnover
619: at about $1\times10^{18}$~Mx reflects the geometrical factor of 3 that
620: is applied to CURV measurements, together with the requirement for a
621: 9-pixel concave-down region. SWAMIS shows no obvious threshold at all
622: because its recursive temporal hysteresis admits many features that
623: have no strong pixels in a particular frame: provided that a feature
624: has a single strong pixel at any point in its lifetime, all of its
625: pixels are subjected to the weaker threshold.
626:
627: The disagreement between the different codes on the weak feature distribution
628: is telling: it is difficult to distinguish reliably the flux distribution
629: of magnetic features that are smaller than about $10^{18}$ Mx in
630: strength even with time-averaged and conditioned MDI data. The MDI
631: hi-res $1\ \sigma$ detection threshold in our time-averaged data
632: is about $2.2\times10^{16}$ Mx, corresponding to a single pixel with
633: an 18 Gauss signal (Figure \ref{fig:dist-func}); features with less
634: than $50\times$ this much flux are not reliably detected across methods.
635:
636: The greater weak-feature counts of CURV and YAFTA compared with SWAMIS
637: and MCAT do not necessarily correspond to greater sensitivity: our
638: data set was not controlled for false positives. When characterizing
639: a code for weak feature sensitivity, one should use noise injection
640: null techniques to identify false positive rates. Likewise, despite
641: the lack of obvious threshold the SWAMIS weak-feature distribution
642: curve should not be trusted below about $1\times10^{18}$ Mx because the
643: hysteresis requirement may reject many transient weak features that
644: never happen to achieve the flux density required to trip the high
645: threshold.
646:
647: In the moderate-strength feature range of $2-5\times10^{18}$ Mx,
648: all four of YAFTA, MCAT, SWAMIS/downhill, and SWAMIS/clump are in reasonably
649: good agreement, with the main difference being between the downhill-like
650: codes (CURV, YAFTA, and SWAMIS/downhill) and the clumping codes (MCAT,
651: SWAMIS/clump). The difference is due to the segmentation of large
652: features into several smaller ones, giving the downhill-like codes
653: slightly more small features and slightly fewer large ones.
654:
655: The different codes disagree substantially on slope of the flux
656: distribution curve in two different regions. Below about
657: $1\times10^{18}$~Mx , the codes diverge strongly in feature counts and all have
658: distribution features that might serve to indicate a transition to noise-dominated
659: numbers: the slopes change in all the codes, and codes with thresholds of various
660: sorts exhibit turnover behaviors due to those thresholds.
661:
662: In the small-feature range $1-1.5 \times 10^{18}$~Mx, each individual curve
663: has no clear indication that the data are becoming unreliable, but the different
664: detection schemes give divergent results. CURV and YAFTA find more small features than
665: MCAT or SWAMIS in this range, due to a combination of noise and
666: higher sensitivity. MCAT, which has the most stringent noise-elimination steps in
667: the detection code, detects signifficantly fewer features in this size range,
668: yielding a lower slope.
669:
670: In the window of $~1.5-7\times10^{18}$~Mx, all three codes agree on
671: the slope of $-0.28\pm0.03\ \textrm{decade}^{-1}$, or an e-folding width of
672: $1.55\pm0.15\ \times\ 10^{18}$~Mx. The downhill-like methods find the
673: steeper limit and the clumping methods find the shallower
674: limit. Features in this size range are strong enough to be detected by
675: all three discriminators but not so large that the differences between
676: the large-scale behavior of the three codes is important. We conclude
677: that features in this size range are easily detectible with MDI and
678: results that use this feature size range are robust against small
679: changes in detection technique.
680:
681: The large-feature performance of the codes varies slightly across
682: algorithm, though all four algorithms are in rough agreement below
683: $10^{19}$~Mx. At higher values the feature counts are too low to
684: provide good statistics, but general comments are possible. The CURV
685: discriminator tends to break up large features into multiple small
686: features, and very large features tend to have wider wings than the
687: Gaussian profile that is assumed by CURV, slightly lowering the number
688: of detections well above $10^{19}$~Mx. YAFTA and SWAMIS/downhill also
689: tend to break up very large concentrations of flux into multiple
690: features, but that effect is not as strongly apparent. SWAMIS/clump
691: and YAFTA agree quite well on the flux distribution from the YAFTA
692: threshold to several $\times10^{19}$Mx. In this size range, results appear
693: reproducible but care is needed when inferring physical values from
694: tracking results as the results appear dependent on the method used to
695: identify individual features.
696:
697:
698:
699: %
700: \begin{figure*}
701: \center{\includegraphics[%
702: width=6in,
703: height=2.441in]{f6a.eps}}
704:
705:
706: \caption{\label{fig:lifetime} Measured feature lifetime is strongly
707: dependent on tracking technique, as seen by comparing lifetime histograms
708: derived from each of the authors' separate tracking codes. See text
709: for full discussion.}
710: \end{figure*}
711:
712: \subsection{Feature Lifetimes}
713:
714: Feature lifetime is strongly affected by the feature-association step
715: of the codes and hence cross-code comparison is important to identify
716: how reproducible that step is. Feature lifetime is also important to
717: the solar physics: it is used as important measure of flux turnover
718: rate (e.g. \citet{Hagenaar2003}), although some physical effects other than
719: flux turnover can affect it. By comparing our codes we obtain a measure
720: of the reliability of feature lifetime measurements in the literature.
721:
722: Several potential effects can introduce errors into flux turnover
723: rates measured with feature tracking codes. In particular,
724: fragmentations and mergers of like-signed features cause end-of-life
725: events, while the associated magnetic flux survives. Similarly,
726: fluctuations in the total flux or the area of a small feature cause
727: many birth and death events. These effects tend to shorten the
728: measured lifespan of features, causing an apparent (but not real)
729: increase in the turnover rate of magnetic flux.
730:
731: Similarly, all of our codes observe many features that are born and/or
732: die in a way that does not apparently conserve magnetic flux; these
733: events may be due to asymmetries in the field strength of small
734: bipoles, or due to statistical fluctuation in a collection of very
735: small, unresolved concentrations of magnetic flux. If the latter is
736: true, then the individual unresolved concentrations that make up a
737: feature must have much shorter lifespans than the resolvable feature,
738: causing an apparent (but not real) decrease in the turnover rate of
739: magnetic flux.
740:
741: Figure \ref{fig:lifetime} shows a histogram plot of feature lifetime
742: from each of our codes. The codes agree on the slope (but not the
743: value) of the lifetime histogram for a narrow range of lifetimes
744: between 20-50 minutes. Roughly 75\% of features found by SWAMIS in
745: this range of lifetimes are in the $2-6\times 10^{18}$~Mx size range
746: in which the codes agree on feature counts, suggesting that this
747: region of slope agreement is similar to the region along the size
748: axis: features in this population are high enough above the noise
749: floor to be readily detectible but not so large nor long-lived that
750: geometrical effects fool the different tracking algorithms.
751:
752: The differences between the curves are entirely due to differences in
753: the algorithms of the codes, as we examined identical data. MCAT
754: requires longevity of more than two frames (10 minutes) in the
755: identification step; the small number of one-frame features are due to
756: fragmentations (features that fragment from an existing feature, then
757: disappear one frame later). SWAMIS and CURV use much weaker longevity
758: requirements, and therefore detect similar numbers of short-lived
759: features. YAFTA detects many more short-lived features than the other
760: codes, in part because of a lower detection threshold (no detection
761: hysteresis was used for this data set) but does not include any
762: features with less than a 4-frame (20 minute) lifetime.
763:
764: CURV, alone of all the codes, shows a minimum in the lifetime
765: histogram followed by a slight rise in the 150-200 minute range. The
766: population in the rise consists of nearly 100 structures, enough to be
767: statistically significant compared to just 9 features features found
768: by CURV in this data set with lifetimes between 100 and 150 minutes.
769: It is not clear whether this is an unusual statistical event or a
770: quirk of the CURV association scheme.
771:
772: The main conclusion to draw from this comparison is that feature
773: lifetimes are extremely difficult to measure with tracking codes; in
774: consequence, average magnetic lifetime results from magnetic tracking
775: of arcsecond-scale data should be considered weak. We will address
776: the nuances of lifetime measurement, and its relevance to physical
777: parameters such as magnetic turnover time and heating rate, in a later
778: article in this series.
779:
780: \section{\label{sec:Discussion}Discussion}
781:
782: Each of the techniques we considered has advantages for a particular
783: regime of fragment size and strength relative to the noise floor of
784: the instrument. Here, we discuss the tradeoffs of the different detection
785: schemes. The main differences between our codes lay in the discrimination
786: and feature-identification steps, which are discussed separately.
787:
788:
789: \subsection{Discrimination}
790:
791: The main problem faced by tracking discriminators is the huge number
792: of statistically independent samples across an image sequence dataset.
793: The simplest discriminator is a threshold trigger; while threshold
794: triggers are inadequate for many tasks when used alone, they form the
795: basis of every discrimination algorithm. The three main ways we
796: improved upon simple threshold triggering were curvature sensing
797: (CURV), hysteresis (MCAT, SWAMIS), and post-discrimination filtering
798: for feature size and longevity (all codes). Simple trigger
799: discrimination is useful mainly where the signal-to-noise ratio is
800: overwhelmingly large. One code in our study (YAFTA) was optimized for
801: strong field detections and used simple trigger discrimination, although
802: subsequent versions of YAFTA include the ability to use hysteresis.
803:
804: Curvature sensing as implemented in CURV has the advantage that, when
805: combined with a threshold trigger, it applies five statistically
806: independent threshold tests to each pixel, significantly reducing the
807: false-positive rate. CURV rejects features whose convex cores are
808: smaller than 9 pixels. Including the effects of smoothing in the
809: preprocessing steps, which leave granulation as the dominant source of
810: noise, there are about 12 statistically independent tests (of 45 total
811: conditions) required to detect a particular feature. By contrast, a
812: direct trigger yields only about three statistically independent tests
813: with the same size threshold. Curvature discrimination permits a
814: detection threshold much closer to the noise floor than would
815: otherwise be possible, which in turn should make curvature
816: discrimination rather sensitive to weak concentrations of magnetic
817: flux.
818:
819: The disadvantage of curvature discrimination is that it only finds the
820: convex core of a magnetic feature. This is addressed by
821: \citet{Hagenaar1999} via a simple scaling: they find that for a large
822: variety of near-Gaussian distributions the convex core is about 1/3 of
823: the total flux in the feature, and scale accordingly. This works well
824: for small features near the resolution limit of the observations, but
825: not as well for larger features, which are observed to have flatter
826: profiles than a Gaussian. Large concentrations of flux typically have
827: several local maxima, and the total flux may be over- or
828: underestimated by the assumption of a simple Gaussian shape, depending
829: on the actual morphology of the feature.
830:
831: Hysteresis is a simple way of reducing the false positive rate of
832: threshold-trigger discrimination. Pixels are compared against
833: different trigger thresholds depending on whether they are isolated or
834: adjacent to other detected pixels. Both MCAT and SWAMIS use a
835: recursive-hysteresis scheme that starts with a simple threshold
836: scheme, and then dilates detected pixels using a lower threshold. Such
837: schemes eliminate many false positives due to the background noise
838: floor, and detect the full extent and shape of large features. The
839: drawback is that weak features are only detected if they have at least
840: one 'seed' pixel that is stronger than the high threshold. SWAMIS
841: further allows dilation along the time axis, so that weak features are
842: detected if at some point in their lifetime they have a single strong
843: pixel; but even so, many transient weak features that are visible to
844: the eye go undetected for lack of a single strong pixel.
845:
846:
847: \subsection{Feature Identification}
848:
849: Here, we contrast the two principal dilation strategies of the codes:
850: downhill and clumping dilation from local maxima. The distinction
851: between these strategies is academic within CURV, as the curvature-based
852: discriminator provides well-separated loci around each local maximum:
853: the final detected loci are the same regardless of dilation method.
854: MCAT, SWAMIS, and YAFTA can dilate using clumping, and SWAMIS and
855: YAFTA can dilate using the downhill technique.
856:
857: Both the downhill and clumping techniques, together with hysteretic
858: thresholding, do better than curvature at identifying the size and
859: shape of mid-sized magnetic features in the several arc second size
860: range. In this size range the shape of individual features varies
861: considerably, though most features still have but one local maximum
862: in the MDI data that we tracked; under these conditions, both dilation
863: techniques do about as well as one another and both measure the flux
864: of individual features with more precision than CURV (which uses a
865: simple geometric factor to estimate the flux in the wings of the structure).
866:
867: %
868: \begin{figure}
869: \center{\includegraphics[%
870: width=3in,
871: height=1.14in]{f7a.eps}}
872:
873:
874: \caption{\label{fig:methods}Effect of feature identification technique: a
875: pathological case. Clumping (right) is less sensitive to noise in
876: weak features than is downhill dilation (left), but can lead to counterintuitive
877: results in active regions as large, irregular patches of flux are
878: identified as a single object.}
879: \end{figure}
880:
881:
882: Large scale structures that are more than about 15 arc seconds across
883: yield stronger differences between the downhill and clumping techniques,
884: as illustrated in Figure \ref{fig:methods}. The downhill technique
885: does a better job at tracking substructure of large, extended objects
886: such as plage and active region fields, but at the expense of more
887: noise susceptibility. Because small amounts of noise can produce transient
888: local maxima in a large extended feature, the downhill technique is
889: susceptible to the \emph{swiss cheese problem} in which a single large
890: clump of flux with no strong local maximum can be oscillate between
891: being detected as one or several separate features. If lifetime filtering
892: is being applied to the detected-feature list, then large holes may
893: appear in the detected feature, giving it the appearance of an irregular
894: block of Ementhaler cheese. Furthermore, downhill detection alone
895: tends to miss very large concentrations of flux, treating them as
896: a collection of smaller features: while this is desirable for tracking
897: the motion of the solar surface, it is not desirable when measuring
898: the statistics of strength or size of magnetic features.
899:
900: We discuss the tradeoff between these detection techniques, neither of
901: which is perfect, in \S\ref{sec:recommendations}, below.
902:
903:
904: \subsection{\label{sub:Cross-frame-feature-association}Cross-frame feature association}
905:
906: All of the codes we compared use essentially the same cross-frame
907: association strategy of finding the association map that maximizes
908: overlap between features in adjacent frames, as described in
909: \S\ref{sub:Feature-association}. In practice, most features in most
910: frames overlap with exactly one feature in the following and adjacent
911: frames, so variations in the type of overlap (e.g. number of pixels
912: vs. amount of flux) or of permissiveness of overlap (e.g. including
913: pixels nearby each feature as part of the feature itself, for purposes
914: of finding overlap) only affect the {}``edge cases'' in which multiple
915: magnetic features are interacting, or in which a single feature is
916: moving rapidly.
917:
918: Overlap-style algorithms such as described in
919: \S\ref{sub:Feature-association}\ are quite robust for associating
920: features with the properties $\forall
921: i,j:\left|\overrightarrow{x_{i}}-\overrightarrow{x_{j}}\right|\gtrsim
922: r_{i}$ and $\forall
923: i:\left|\Delta\overrightarrow{x_{i}}\right|\lesssim r_{i}/2$, where
924: $i$ and $j$ are indices across features, $\overrightarrow{x_{i}}$ is
925: the centroid location of feature $i$, $r_{i}$ is the typical radius of
926: feature $i$, and $\overrightarrow{\Delta x_{i}}$ is the displacement
927: vector of feature $i$ across frames. Fast moving features with
928: $\left|\Delta\overrightarrow{x_{i}}\right|\gtrsim r_{i}/2$ become
929: subject to the \emph{mistaken identity problem}, where they are
930: identified as a different feature in different frames. The mistaken
931: identity problem affects statistical feature-lifetime and feature
932: history results even if only a very few features are subject to it,
933: because a single fast-moving feature may register as a very large
934: number of separate magnetic features. The only reliable way to beat
935: the mistaken identity problem is to use high enough time resolution in
936: the data. Marginal data in which the fastest moving features have
937: $\Delta\overrightarrow{x_{i}}\thickapprox r_{i}$ may be improved by
938: interpolating interstitial frames, but wider separations cannot be
939: helped by that method. In practice, we found by visual inspection that
940: 12-minute final effective cadence with direct boxcar averaging was not
941: sufficient to avoid the mistaken identity problem for the fastest
942: moving features at the MDI {}``full disk'' resolution (1.4 Mm pixels
943: at Sun center), accounting for \textasciitilde{}0.5\% of features in a
944: given frame and perhaps 5\% of total features identified by SWAMIS,
945: but that 12 minute final effective cadence with anti-aliasing in the
946: time direction (time-weighted averaging of 1 minute cadence
947: magnetograms, with a 12 minute FWHM Gaussian weighting profile)
948: eliminates virtually all cases of mistaken identity.
949:
950: We considered, but did not implement, various methods to reduce the
951: mistaken identity problem in cases where high enough cadence data are
952: not available. Promising directions to try include linear
953: extrapolation of feature location from last associated location before
954: the overlap calculation, with or without backtracking on the time
955: axis; dilation with size checking; and simulated annealing of feature
956: association. We suspect that all such algorithms are likely to
957: include more faulty associations than does direct overlap.
958:
959:
960: \section{Recommendations on Accepted Practice\label{sec:recommendations}}
961:
962: To enable meaningful comparison and reproducibility of tracking results
963: across research groups, we recommend the following techniques as appropriate
964: for most applications of magnetic feature tracking.
965:
966: \subsection{Data preprocessing}
967:
968: While preprocessing of data is not technically a part of feature tracking,
969: preprocessing can affect the statistics of image tracking and therefore
970: warrants mention here. We discuss despiking, time averaging, and resampling
971: into a desired coordinate system.
972:
973:
974: \paragraph{Despiking}
975:
976: A brief note on space-based magnetograms is in order: \emph{SOHO}/MDI
977: is, and presumably SDO/HMI will be, susceptible to cosmic ray impacts.
978: A typical MDI {}``full-disk'' magnetogram has evidence of cosmic
979: ray impacts in $~10^{2}$ pixels, so 5-10 minute averages may have
980: as many as $~10^{3}$ bad pixels caused by cosmic rays. The cosmic
981: rays are not saturated in the images, and may have either negative-going
982: or positive-going direction. These cosmic rays can skew the size,
983: strength, and lifetime statistics of small, short-lived features if
984: not considered. We recommend either despiking sequences of space-based
985: magnetograms with a second time derivative technique such as ZSPIKE
986: (\citealt{DeForest2004b}), or imposing a lifetime threshold on detected
987: features to limit the effects of cosmic rays.
988:
989:
990: \paragraph{Time and spatial averaging}
991:
992: Time averaging of images is useful as a preparatory step to reduce
993: noise in the magnetograms and to smooth features for better association
994: across frames. There are several sources of noise in currently available
995: magnetograms, with different statistics for each source; we discuss
996: them briefly here, as the noise characteristics of averaged data sets
997: hold a complex relationship to the noise characteristics of individual
998: frames.
999:
1000: Most magnetographs are photon-limited, so that there is an approximately
1001: Gaussian distribution noise source of \emph{photon noise} associated
1002: with photon counting statistics in each pixel of each magnetogram.
1003: Each pixel contains an independent sample of this noise source. Magnetographs
1004: such as MDI that assemble multiple exposures are subject to \emph{shutter
1005: noise}, which results from very slightly different exposure times
1006: across each independent exposure used to produce the magnetogram:
1007: shutter noise is an approximately Gaussian distribution noise source
1008: that is added to the common mode of all pixels across each image.
1009: Finally, solar evolution (and, for ground-based telescopes, seeing
1010: effects) across the time of assembly of the magnetogram induces an
1011: additional noise source, \emph{evolution noise} that is dominated
1012: by the evolution of granules. Granulation, and the associated evolution
1013: noise, has about one independent sample every five minutes, per square
1014: megameter of solar surface area.
1015:
1016: Individual MDI magnetograms have about equal amounts of photon and
1017: evolution noise. Because the photon noise is independently sampled
1018: in each image, averages of more than about five minutes of magnetic
1019: data tend to be dominated by evolution noise, which is attenuated
1020: much more slowly by further averaging.
1021:
1022: Anti-aliased time averaging, using overlapping Gaussian or Hanning
1023: windows in the time domain, is preferable to simple boxcar averaging,
1024: which has frequency sidelobes that allow more noise to enter the data.
1025:
1026:
1027: \paragraph{Image resampling}
1028:
1029: Image sequences are typically resampled to remove the solar rotation
1030: and perspective \emph{a priori.} For the present work, we derotated
1031: and prepared time averages of MDI magnetograms using a simple interpolation
1032: scheme into plate caree coordinates. This scheme follows current common
1033: practice but is not recommended: it fails to preserve small-scale
1034: feature statistics in two important ways.
1035:
1036: First, the plate caree ({}``lon/lat'') map projection is \emph{non-authalic}:
1037: a feature of unit area on the surface of the Sun may have different
1038: areas in plate caree coordinates, depending on its latitude. To avoid
1039: skewing the statistics of flux content, authors should use an authalic
1040: (equal-area) projection to prepare the data before tracking. The area
1041: of a feature in the plate caree projection is scaled by a factor of
1042: secant(latitude). A simple way to compensate is to scale the vertical
1043: or horizontal scale by cos(latitude) at each point. Scaling the vertical
1044: axis by cos(latitude) yields the common \emph{sin-lat cylindrical
1045: projection}, so named because integrating the scale factor $y'_{map}~cos(lat)$
1046: yields $y_{map}~sin(lat)$. Scaling the horizontal axis yields the
1047: \emph{sinusoidal projection}. Other useful authalic choices include
1048: the Hammer/Aitoff elliptical projection used by the cosmology community
1049: and Lambert's azimuthal equal-area projection, which minimizes linear
1050: distortion near the origin. Many useful projections have been cataloged
1051: by \citet{Snyder1987}.
1052:
1053: Secondly, linear interpolation leaves much to be desired as a resampling
1054: method, skewing (among other things) the noise profile of individual
1055: pixels and potentially introducing large amounts of distortion into
1056: the statistics of small features. A statistically sound, photometrically
1057: accurate resampling method, relying on spatially variable sampling
1058: filters, has been described by \citet{DeForest2004}; that or similar
1059: techniques are recommended for preparing data for survey applications.
1060:
1061: For virtually every application of tracking, it is important to compensate
1062: by rigid rotation based on the differential rotation speed at a particular
1063: point in the field of view, and not by differentially rotating every
1064: pixel in the image independently. The former preserves the actual
1065: evolving spatial structures in question; the latter only preserves
1066: the plasma reference frame at the start of the observing run.
1067:
1068:
1069: \subsection{Feature discrimination \& identification}
1070:
1071: We recommend combining the three methods of feature detection. Standard
1072: codes should use a dual-discriminator scheme for detection: an initial
1073: convex-core discrimination as in CURV, followed by dilation
1074: to a low noise threshold. This combination takes best advantage of
1075: the extra discrimination afforded by the convex core technique, while
1076: eliminating some of the difficulties of identifying oddly-shaped and
1077: large features.
1078:
1079: Feature identification should use the downhill method to avoid pathologies
1080: of the clumping technique, particularly when used for motion tracking
1081: and to identify interacting magnetic features; but for applications
1082: where larger clusters of flux are important, we recommend keeping
1083: track of groups of touching or nearby features according to a clumping
1084: algorithm. Groups of mutually touching features are the same loci
1085: as would be identified by a direct clumping scheme, but tracking individual
1086: peaks within the group affords better localization of the magnetic
1087: flux that makes up the feature(s). This can be accomplished either
1088: by maintaining a table of mutually touching features or by using a
1089: dual-labeling scheme at the feature-identification step.
1090:
1091:
1092: \subsection{Feature association}
1093:
1094: For best general purpose utility, we recommend a flux-weighted maximum
1095: overlap method of association between frames, as is currently used
1096: by SWAMIS; for example, in cases of associative conflict such as Figure
1097: \ref{fig:assoc}, regions B and C would be associated as identical,
1098: region A would be classified as dying by merger into B/C, and region
1099: D would be classified as originating by fragmentation from B/C. For
1100: analyses that require feature identification, it is important to ensure
1101: that the cadence is sufficient to allow associated features in adjacent
1102: frames to overlap. While more sophisticated motion-correlation algorithms
1103: are in principle feasible, they add complexity and fallibility that
1104: is not necessary provided that the data have high enough cadence.
1105:
1106: It is notable that no local overlap algorithm agrees with a human
1107: observer in all cases, as human observers use more information than
1108: strict overlap -- including something like a predictor/corrector position
1109: algorithm. Maximum overlap works well in the case where the motion
1110: of all features is small compared to their width divided by the time
1111: step. If the time step is too long, small features can move more than
1112: their diameter in a single frame, leading to the mistaken identity
1113: problem, where a single visually identifiable feature frequently
1114: changes its identity in the tracked data. In such cases, one can (i)
1115: use faster frame rates, (ii) generate dilated feature masks for association,
1116: (iii) use linear location extrapolation to account for the large interframe
1117: motion, and/or (iv) use a minimum-distance criterion rather than maximum
1118: overlap.
1119:
1120:
1121: \subsection{Feature tabulation}
1122:
1123: When tabulating feature histories, we recommend that the following
1124: minimum information be kept for each feature, and for each frame for
1125: which a particular feature exists: area (A), total flux ($\Phi$),
1126: flux-weighted average location (\emph{x,y}), and flux-weighted quadrupole
1127: moments ($<\Phi^{2}dx^{2}>,<\Phi^{2}dy^{2}>,<\Phi^{2}dxdy>$), for
1128: a total of 7 numerical quantities per feature per frame. The quadrupole
1129: moments, in particular, summarize tersely and simply the shape of
1130: the feature, and the features detected by the downhill dilation method
1131: tend to be simple shapes that are readily described with the quadrupole
1132: moment set. Quantities may be kept in physical or image units (e.g.
1133: km or pixels). Quantities which we recommend avoiding are: pixel value
1134: maximum and variance, which depend on resolution and phase of the
1135: underlying feature relative to the pixel grid; and non-weighted average
1136: location, because it is more dependent on noise-dominated pixels at
1137: the feature's edge than is the flux-weighted average location.
1138:
1139:
1140: \subsection{Event identification}
1141:
1142: Several of the scientific applications of tracking require classifying
1143: the origin and demise of each feature based on visual heuristics for
1144: the underlying physics. Useful event classification requires characterizing
1145: the geometry and manner of change of nearby features. Event classification
1146: is a rich topic that is not fully discussed in this paper; however,
1147: we make some brief recommendations.
1148:
1149: We recommend classifying origin events into four categories: (i) \emph{isolated
1150: appearance}, in which a particular feature appears in the absence
1151: of interaction with surrounding detected features; (ii) \emph{balanced
1152: emergence}, in which a bipolar, approximately balanced pair of features
1153: appear together in nearly the same location at nearly the same time;
1154: (iii) \emph{unbalanced emergence}, in which a new feature appears
1155: next to a pre-existing, opposite sign feature in a nearly flux-conserving
1156: manner; and (iv) \emph{fragmentation} (or splitting), in which a single
1157: pre-existing feature breaks up into multiple smaller features in a
1158: nearly flux-conserving manner. Demise events should be classified
1159: in the exact same way as origin events, in a time reversed sense:
1160: (i) \emph{isolated disappearance}; (ii) \emph{balanced cancellation};
1161: (iii) \emph{unbalanced cancellation}; and (iv) \emph{merging}. For
1162: both origin and demise events, (i) is the only recognized case that
1163: apparently violates conservation of flux; (ii) corresponds to isolated
1164: passage through the photosphere of a magnetic loop; and (iv) represents
1165: reshuffling of existing flux. For completeness, event identification
1166: software should also maintain a \emph{complex} class for events which
1167: cannot be classified easily into the above four groups, including
1168: such events as isolated asymmetric emergence that violate conservation
1169: of magnetic flux.
1170:
1171: It is important to understand that this is a \emph{visual} classification
1172: scheme, to be more fully developed in future work. Interpretation
1173: of these visual events in terms of physical mechanisms is neither
1174: straightforward nor obvious. For example, appearance events may or
1175: may not correspond to new flux on the solar surface.
1176:
1177: Physical modeling of feature behavior requires some care. In particular,
1178: only some emergence events (bipolar emergence) appear to be due to flux
1179: tubes that emerge from below the surface of the Sun
1180: (\citealt{HarveyMartin1973,HarveyKL1993,Chae2001}). Such events should give
1181: rise to two oppositely signed magnetic features that grow together and
1182: separate in a divergent surface flow
1183: (\citealt{Hagenaar2001,Hagenaar2003,SimonTitleWeiss2001}), and the
1184: origin detection code in SWAMIS and in CURV was originally intended to
1185: identify such events. However, proportionally few magnetic
1186: features are observed to originate with this \emph{balanced emergence}
1187: mechanism. New small features can also form by \emph{fragmentation}
1188: of pre-existing large features into like-signed fragments; this
1189: process is also called \emph{calving} if the new feature is small
1190: compared to the surviving feature. Furthermore, many features simply
1191: \emph{appear}, without any surrounding flux at all or in ways that
1192: appear to violate flux conservation. The nature of these appearances
1193: -- whether \emph{coalescence} of existing weak flux or
1194: \emph{unbalanced emergence} with one large, weak-field pole and one
1195: small, strong-field pole, will be considered in detail in Paper II of
1196: this series.
1197:
1198: Results using our recommended classification scheme should be presented
1199: together with a notation describing what criteria are used to detect
1200: balanced changes in the flux of interacting features. Event classification
1201: results can be quite different, for example, if the changes in the flux
1202: of two interacting features are considered {}``approximately balanced''
1203: if they merely have opposite sign, or if they must agree within,
1204: say, 10\%.
1205:
1206:
1207: \section{\label{sec:Conclusions}Conclusions}
1208:
1209: We have compared four magnetic feature tracking codes by applying
1210: them to the same preprocessed set of magnetic data. Feature tracking
1211: code output is sensitive to a variety of decisions that are made during
1212: development, and this sensitivity is a reason why it has historically
1213: been difficult to reproduce results obtained by feature tracking:
1214: it is crucial to explain exactly what algorithm is being used. In
1215: particular, codes that were designed for one regime of study (e.g.
1216: very small intranetwork flux concentrations or very large, strong
1217: features) should not be applied to different regimes of detection
1218: without careful study, and all discrimination and association techniques
1219: need to be lain out exactly as performed.
1220:
1221: The difficulty of reproducing apparently simple results in feature
1222: tracking appears to stem both from the complicated, noisy nature of
1223: the magnetograph data and from the complexity of the underlying
1224: structures. The solar magnetic field is not divided into well
1225: separated, strongly magnetized features; rather, there is a continuum
1226: of feature sizes due to the clustering behavior of the field across
1227: scales, in keeping with the concept of \emph{magnetochemistry}
1228: outlined by \citet{Schrijver1997}. Bulk summary characteristics such
1229: as the lifetime of individual features or the size distribution of the
1230: features depend strongly both on the instrument being used to image
1231: the magnetic field and on threshold and related decisions made during
1232: code development.
1233:
1234: All of our codes agree reasonably well on important summary
1235: characteristics in a particular circumscribed range of scales and
1236: lifetimes, indicating that there is an underlying pattern to be
1237: measured; but the region of agreement (which we take to be the range
1238: of valid measurement using the tracking codes) is much smaller than
1239: might be surmised from cursory analysis of the output of any one
1240: algorithm. We conclude that particular care must be used when
1241: interpreting magnetic tracking results, which are often much weaker
1242: than might be surmised given the apparent clarity of solar magnetic
1243: features in magnetogram sequences.
1244:
1245: In particular, we find that the magnetic turnover time, perhaps the most
1246: accessible summary result to come out of magnetic tracking studies, is
1247: also perhaps the weakest result to come out of magnetic tracking studies.
1248: Average feature lifetimes are only weakly related to magnetic turnover time
1249: in the best of circumstances, and we have found that average lifetime
1250: measurements are strongly dependent on the code being used to perform
1251: the measurement.
1252:
1253: By comparing and contrasting the algorithms of our four separate
1254: codes, we have determined why the they produce different results for
1255: the flux distribution in quiet sun, and evaluated under what
1256: circumstances each technique performs best. Further, we have made
1257: recommendations about how to improve feature detection and
1258: reproducibility in feature tracking for future work. To aid that work,
1259: all four of our codes are being made available to the scientific
1260: community in source-code form via \emph{solarsoft. }
1261:
1262: Specific physical problems such as flux emergence and cancellation,
1263: diffusion of active region flux and plage formation, and feature
1264: lifetime, will be covered in more detail in future papers in this
1265: series.
1266:
1267:
1268: \bibliographystyle{plainnat}
1269: \clearpage\addcontentsline{toc}{chapter}{\bibname}
1270:
1271: \begin{thebibliography}{27}
1272: \providecommand{\natexlab}[1]{#1}
1273: \providecommand{\url}[1]{\texttt{#1}}
1274: \expandafter\ifx\csname urlstyle\endcsname\relax
1275: \providecommand{\doi}[1]{doi: #1}\else
1276: \providecommand{\doi}{doi: \begingroup \urlstyle{rm}\Url}\fi
1277:
1278: \bibitem[{Chae} et~al.(2001)]{Chae2001}
1279: {Chae}, J. et~al. 2001,
1280: \newblock \apj, 548, 497
1281:
1282: \bibitem[{DeForest}(2004)]{DeForest2004}
1283: {DeForest}, C.~E. 2004,
1284: \newblock \solphys, 219, 3
1285:
1286: \bibitem[{DeForest} and {Lamb}(2004)]{DeForestLamb2004}
1287: {DeForest}, C.~E. and {Lamb}, D.~A. 2004,
1288: \newblock BAAS, 204
1289:
1290: \bibitem[{DeForest} (2004)]{DeForest2004b}
1291: {DeForest}, C.~E. 2004,
1292: \newblock \apj, 617, L89
1293:
1294: \bibitem[{Fossum} and {Carlsson}(2004)]{Fossum2004}
1295: {Fossum}, A. and {Carlsson}, M. 2004,
1296: \newblock ESA SP-547, 125
1297:
1298: \bibitem[{Hagenaar}(2001)]{Hagenaar2001}
1299: {Hagenaar}, H.~J. 2001,
1300: \newblock \apj, 555, 448
1301:
1302: \bibitem[{Hagenaar} et~al.(1999)]{Hagenaar1999}
1303: {Hagenaar} H.~J. et~al. 1999,
1304: \newblock \apj, 511, 932
1305:
1306: \bibitem[{Hagenaar} et~al.(2003)]{Hagenaar2003}
1307: {Hagenaar}, H.~J. , {Schrijver}, C.~J., and {Title}, A.~M.
1308: \newblock \apj, 584, 1107
1309:
1310: \bibitem[{Handy} et~al. (1999)]{Handy1999}
1311: {Handy}, B.~N. et~al. 1999
1312: \newblock \solphys, 187, 229
1313:
1314: \bibitem[{Harvey}(1993)]{HarveyKL1993}
1315: {Harvey}, K.~L. 1993
1316: \newblock Ph.D.~Thesis, Univ. of Utrecht
1317:
1318: \bibitem[{Harvey} and {Martin}(1973)]{HarveyMartin1973}
1319: {Harvey}, K.~L., and {Martin}, S.~F. 1973,
1320: \newblock \solphys, 32, 389
1321:
1322: \bibitem[{Lamb} and {Deforest}(2003)]{LambDeForest2003}
1323: {Lamb}, D.~A. and {Deforest}, C.~E. 2003,
1324: \newblock AGU FMA, B530
1325:
1326: \bibitem[{Leibacher}(1995)]{Leibacher1995}
1327: {Leibacher}J.~W.~et~al. 1995,
1328: \newblock ASP Conf. Ser., 76, 381
1329:
1330: \bibitem[{Leibacher}(1999)]{Leibacher1999}
1331: {Leibacher}, J.~W. 1999,
1332: \newblock Adv. Sp. Res., 24, 173
1333:
1334: \bibitem[{Parker}(1988)]{Parker1988}
1335: {Parker}, E.~N. 1988,
1336: \newblock \apj, 330, 474
1337:
1338: \bibitem[{Parnell}(2002)]{Parnell2002}
1339: {Parnell}, C.~E. 2002,
1340: \newblock \mnras, 335, 389
1341:
1342: \bibitem[{Parnell} and {Jupp}(2000)]{ParnellJupp2000}
1343: {Parnell}, C.~E. and {Jupp}, P.~E 2000,
1344: \newblock \apj, 529, 554
1345:
1346: \bibitem[{Scherrer} et~al.(1995)]{Scherrer1995}
1347: {Scherrer}, P.~H. et~al. 1995,
1348: \newblock \solphys, 162, 129
1349:
1350: \bibitem[{Schrijver} et~al.(1997)]{Schrijver1997}
1351: {Schrijver} C.~J. et~al. 1997,
1352: \newblock \apj, 487, 424
1353:
1354: \bibitem[{Schwer} et~al.(2002)]{Schwer2002}
1355: {Schwer}, K.~ et~al. 2002,
1356: \newblock AGU FMA, C1
1357:
1358: \bibitem[{Simmons}(1972)]{Simmons}
1359: {Simmons}, G.~F. 1972,
1360: \newblock McGraw-Hill
1361:
1362: \bibitem[{Simon} et~al.(2001){Simon}, {Title}, and {Weiss}]{SimonTitleWeiss2001}
1363: {Simon}, G.~W., {Title}, A.~M., and {Weiss}, N.~O. 2001,
1364: \newblock \apj, 561, 427
1365:
1366: \bibitem[{Snodgrass}(1983)]{Snodgrass1983}
1367: {Snodgrass}H.~B. 1983,
1368: \newblock \apj, 270, 288
1369:
1370: \bibitem[Snyder(1987)]{Snyder1987}
1371: {Snyder}, J.~P. 1987,
1372: \newblock USGS Prof. Papers, 1395
1373:
1374: \bibitem[{Strous} et~al.(1996)]{Strous1996}
1375: {Strous}, L.~H. et~al. 1996,
1376: \newblock \aap, 306, 947
1377:
1378: \bibitem[{Welsch} and {Longcope}(2002)]{Welsch2002}
1379: {Welsch}, B.~T. and {Longcope},D.~W. 2002,
1380: \newblock ESA SP-505, 611
1381:
1382: \bibitem[{Welsch} and {Longcope}(2003)]{Welsch2003}
1383: -----------. 2003,
1384: \newblock \apj, 588, 620
1385:
1386: \bibitem[{Welsch} et~al.(2004){Welsch}, {Fisher}, {Abbett}, and
1387: {Regnier}]{Welsch2004}
1388: {Welsch}, B.~T. et~al. 2004,
1389: \newblock \apj, 610, 1148
1390:
1391: \end{thebibliography}
1392:
1393:
1394:
1395:
1396: \appendix{Glossary}
1397:
1398: Feature tracking and magnetic observations are mature enough to have
1399: developed a collection of commonly used terms, which unfortunately
1400: have drifted into slightly different usage in different locations.
1401: In an attempt to regularize terminology, we present a glossary of
1402: commonly used terms, with their recommended definitions. Also, because
1403: some terms are strictly observational and others imply a physical
1404: model, we have noted which are which.
1405:
1406:
1407: \paragraph{\emph{Object descriptions}}
1408:
1409: \begin{description}
1410: \item [bipole]- a pair of magnetic features of opposite sign and approximately
1411: equal flux content, that appear to be associated (as in bipolar \textbf{emergence}).
1412: When seen to emerge together, the poles of a bipole may be associated
1413: observationally.
1414: \item [ephemeral~region]- a resolved small bipole with particular properties
1415: as described by \citet{Hagenaar2001}.
1416: \item [feature]- a visually identifiable part of an image, such as a clump
1417: of magnetic flux or a blob in a magnetogram. The term {}``feature''
1418: is purely observational and is preferable to {}``flux concentration''
1419: or {}``ephemeral region'' when describing individual visual objects
1420: in an image. The specific definition of a feature is dependent on
1421: both the Sun itself and the characteristics of the observing telescope.
1422: \item [flux~concentration]- a localized cluster of magnetic flux, with
1423: or without resolved substructure. A flux concentration may consist
1424: of one or more magnetic features. While somewhat vague, the definition
1425: of a flux concentration is approximately independent of observing
1426: telescope: a flux concentration may appear as a single feature when
1427: seen with one instrument but as several features with another.
1428: \item [fragment]- a small piece of a larger magnetic structure, \emph{not}
1429: a generic small bit of magnetic flux. Usage: {}``this magnetic flux
1430: concentration is composed of many fragments'', or {}``unresolved
1431: fragments make up this magnetic feature''. {}``Fragment'' should
1432: not be used interchangeably with {}``feature'', as it implies that
1433: the subject is part of a larger whole, while {}``feature'' does
1434: not.
1435: \item [monopole]- a lone magnetic pole (thought to be physically impossible).
1436: \item [stenflo]- (after J. Stenflo) a tiny, strong concentration of order
1437: $10^{17}$ Mx of flux. Usage: {}``The asymmetric formation of flux
1438: concentrations in the network may be due to convergence of stenflos,
1439: though Stenflo himself may object to this terminology.''
1440: \item [unipole]- a single magnetic feature with no obvious associated feature
1441: of the opposite sign. The photospheric boundary provides a {}``hiding
1442: place'' for the opposing pole, so that unipoles are thought not to
1443: be monopoles. Oppose {}``bipole'', {}``monopole''.
1444: \end{description}
1445:
1446: \paragraph{\emph{Event} \emph{descriptions}}
1447:
1448: \begin{description}
1449: \item [appearance]- used specifically to describe the origin of a single
1450: unipolar feature where there were none before. Appearances appear
1451: to violate conservation of magnetic flux, but probably result from
1452: flux hiding under the noise floor of an instrument -- so the definition
1453: of {}``appearance'' depends on the instrument being used.
1454: \item [asymmetric~emergence]- emergence in which the two sides of the
1455: emerging magnetic loop of flux have quite different cross sections,
1456: perhaps reducing the field strength of the larger leg of the loop
1457: below the detection threshold of an instrument. This can be a physical
1458: description of one type of feature appearance; coalescence is another
1459: type. Note that {}``asymmetric emergence'' and {}``unbalanced emergence''
1460: are not synonyms.
1461: \item [balanced~emergence]- emergence in which the two final opposing-sign
1462: features have approximately the same magnitude; this is the type of
1463: emergence predicted by a simple model of magnetic flux tubes rising
1464: through the photosphere. Compare {}``emergence''; contrast {}``unbalanced
1465: emergence''.
1466: \item [balanced~cancellation]- cancellation in which the two initial opposing-sign
1467: features have approximately the same magnitude. Compare {}``cancellation'';
1468: contrast {}``unbalanced cancellation''. Balanced cancellation is
1469: the time reversal of balanced emergence.
1470: \item [calving]- a form of fragmentation in which one of the daughter features
1471: contains much more flux than the other, by analogy to the behavior
1472: of icebergs. Usage: {}``This movie shows small features calving off
1473: of the main flux concentration''. Oppose {}``splitting''; compare
1474: {}``fragmentation''.
1475: \item [cancellation]- the demise of a magnetic feature that collides (and
1476: cancels) with an opposing sign feature, in such a way that flux is
1477: approximately conserved. Compare {}``balanced cancellation'', {}``unbalanced
1478: cancellation''; contrast {}``disappearance''.
1479: \item [coalescence]- the collection of diffuse flux from below detection
1480: threshold to a small, denser feature that can be detected. This may
1481: be an example of unresolved merging. This is a physical description
1482: of one type of feature appearance; asymmetric emergence is another
1483: type. To avoid confusion, eschew {}``coalescence'' when describing
1484: observational results; use {}``merging'' or {}``appearance'' instead.
1485: \item [demise]- the end of a magnetic feature's existence.
1486: \item [disappearance]- the end of a single, unipolar magnetic feature that
1487: {}``fades away'' to nothing in the absence of nearby features (the
1488: time reversal of an {}``appearance'').
1489: \item [dispersal]- deprecated. This has been used to describe the opposite
1490: of coalescence, the breakup of strong flux concentrations into many
1491: fragments, and the diffusion of flux across the surface of the Sun.
1492: It is now too ambiguous to be used clearly in most cases.
1493: \item [emergence]- the origination of two balanced, opposing magnetic features
1494: nearby one another in such a way that flux is approximately conserved.
1495: This observational definition follows the common physical definition
1496: of a loop of flux emerging from below the surface. Compare {}``balanced
1497: emergence'', {}``unbalanced emergence''; contrast {}``appearance''.
1498: Emergence is the time reversal of {}``cancellation''.
1499: \item [fragmentation]- the breakup of a single magnetic feature into at
1500: least two like-sign features. (compare {}``splitting'', {}``calving'')
1501: \item [merging]- the joining of two magnetic features of similar sign into
1502: a single larger feature.
1503: \item [splitting]- the breakup of a single magnetic feature into at least
1504: two like-sign features, with the implication of rough flux balance
1505: between the two daughter features. (oppose {}``calving''; compare
1506: {}``fragmentation'').
1507: \item [unbalanced~emergence]- emergence in which the two final opposing-sign
1508: features have different magnitudes due to interaction with a nearby
1509: unipolar feature. Compare {}``emergence''; contrast {}``fragmentation'',
1510: {}``balanced emergence''. Unbalanced emergence is the time-reversal
1511: of unbalanced cancellation.
1512: \item [unbalanced~cancellation]- cancellation that is not complete because
1513: one of the canceling features contains more flux than the other. Compare
1514: {}``cancellation''; contrast {}``merging'', {}``balanced cancellation''.
1515: \end{description}
1516:
1517: \acknowledgements{Thanks to the SOHO/MDI team for kind use of their data, and to the
1518: University of St. Andrews for hosting the workshop which made this
1519: comparison possible. This work was funded by NASA's SOHO project,
1520: the SOHO/MDI effort, NASA's SEC-GI program, the Air Force Office of
1521: Scientific Research MURI program, and the PPARC Advanced Fellowship
1522: program. SOHO is a project of international collaboration between
1523: NASA and ESA.}
1524: \end{document}
1525: