# Search Strategies

Once we know what to look for and the dimensions of the haystack it may be hiding in (see Waterhole) it’s time to develop an arsenal of tools to find it.
SETI Net runs on the concept of Falsification. This is the idea that for a hypothesis to be promoted to a scientific theory it must be falsifiable. This section defines the use of falsification and details each of the tests used.

# Falsification

In the area of the philosophy of science Karl Popper is well known. His idea that for a hypothesis to be considered to be scientific it must be able to be proven false. A well-known example of this is the Black Swan:

“One notices a white swan. From this one can conclude:

At least one swan is white.

From this, one may wish to conjecture:

All swans are white”.

Notice that there is no practical way to prove this conjecture to be true unless you examined each and every swan on Earth. Not only on Earth but in the universe. But the conjecture is easy to prove wrong simply by observing one non-white swan. No need to look throughout the universe after finding that single swan.

So, when the SETI station collects a candidate signal it sets about looking for the non-whiteness of the signal – It attempts to falsify it. It does this through a series of detection gates or conditional tests.

These tests take the form:

Hypotheses:

There is a signal at frequency X that is from ET

The falsification is then:

The signal at frequency X is not ET because of condition Y

If the hypotheses cannot be falsified because of Y it is subjected to Y+1 and so on until you run out of Y’s. Then the hypotheses remains un-falsified. It never becomes true.

This seems counter intuitive and it is. After all the system is called the Search which implies success at some point, that the signal at X really is an ET, but look at it from a practicable point of view. If a signal is subjected to all the conditional tests available to this SETI station and it still passes the next logical step would be to refer it to a second station where it could be subjected to a different set of conditions. If it passes those as well it should be sent to a third and then fourth   until you run out of stations. This still only tells you that you have not found or invented a conditional test to falsify it.

After passing all tests known the hypothesis becomes a scientific theory. Wikipedia defines it this way:

A scientific theory is a well-substantiated explanation of some aspect of the natural world, based on a body of facts that have been repeatedly confirmed through observation and experiment. Such fact-supported theories are not "guesses" but reliable accounts of the real world.

If a later, newly devised test disproves the hypothesis it becomes falsified.

In this SETI station the hypothesis comes from the search software itself as it manipulates the hardware components of the station. For example: during a long search run the station detects a signal that fits some of the criteria of what an ET could be sending; It’s narrow band, not to strong and in the expected part of the spectrum. The station then switches to falsification mode in an attempt to find that this is NOT from ET. The station does not switch to a mode to prove that it IS from ET, in fact there is no mode like that to switch to. The station is applying the Popperian process of falsification to the hypothesis to show that it is NOT ET.

# Basic Detection

The Spectrum Analyzer is a stand along module written to run exclusively on Windows 64-bit operating system. It is also a HTTP server that can be directed by its client, the SETI Control panel. This allows the Spec Ana to run on a different machine or in a different location.

The basic  detection process is performed completely within the Spec Ana and will run without the SETI Control panel at all. It does the heavy lifting, bit shoving and signal line detection by itself. This is how that happens.

The Spec Ana software captures the output of the connected receiver and builds a buffer of ‘lines’. Each line captured is of a 20 KHz wide segment of the tuned frequency. That segment is digitized, put through a 32 K point FFT and presented on the face of the Spec Ana. The above figure shows one captured line. At this resolution, each line consists of 16 K bins of signal amplitude and each bin corresponds to 1.3 Hz of the original 20 KHz segment. That is one line.

Each of these lines of raw digitized receiver output is placed in a buffer as they come in. The buffer is 16 K wide and normally 20 K deep of integer values. The buffer makes up one ‘Frame’ of data. The number of Lines per Frame can be selected to fit the falsification scenario but the station can collect about 10 frames / second. A typical frame buffer 20,000 deep will take a little more than a half hour to fill.

A frame buffer can be quite large. A typical half hour run at one 20 Hz segment of the band will require a frame buffer of 20,000 * 15,000 * 4 (integers are 4 bytes each) or about 1.3 Gig bytes of main memory. This can only be managed with a Windows 64-bit operating system that allows manipulation of large arrays.

After the frame buffer is instantiated and cleared in memory the Spec Ana will go about filling it.

 Average Buffer
 Neighborhood Scanner
 Frame Buffer
 ‘Hit’
 ‘Noise’

The difference between Noise and a possible ‘Hit’ is described in the section ‘Chirp’.

# Setup Parameters

The following settings are available to the operator over the Frame Buffer

·         Lines Per Frame – The number of lines to collect for one Frame

·         Integration – The time to integrate each of incoming samples

·         Segment Size – The size, in bins, of the neighborhood to be scanned

·         Line Limit – The maximum signal acceptable as a candidate ‘Hit’

·         Threshold – The trigger level applied by the ‘hood scanner

# Integration

Incoming digitized data is integrated as it arrives. This process improves the signal to noise level of the signal but it is not a free lunch. I generally run the integration at 25 – 30 seconds which will pull very small signals from the mud but if I increase integration time to more than about 100 seconds the data in the Frame Buffer will be ‘flattened’ to the point of being unusable. This is because the Frame Buffer Search software depends on bin-to-bin differences in amplitude to make its decision and a buffer with little differences will not work. Too much integration defeats the process.

# Line Limiting

The fully operational system moves the receiver up band at a predetermined rate, scanning as it goes, and sooner or later it will run into a patch of junk signals that will be treated at possible ET and will cause the station to spend a lot of its time processing these false signals. Fortunately, this type of signal has one characteristic that makes it identifiable and ignorable – it is usually very loud.
The operator can set a maximum amplitude for each Bin that if exceeded will cause the signal to be ignored.

# Hit Threshold

The point at which a neighborhood limit test is determined to be a Hit is set by the operator. I like to use 20% but this is not hard and fast. More integration will allow a tighter for a lower value because the Frame Buffer will be flatter.

# Frame Buffer Fill

As integers come in from the FFT they are integrated and placed in each line of the buffer. Each line is also used to construct a WAV file and a JPG file. Twenty minutes of signal create about 50 Meg of WAV. This is a large file so to be useable you should be connected to a fast ISP. Twenty minutes would create a JPG too large to render so it is constantly overwritten every few minutes. The WAV and JPG files are saved locally and can be batch uploaded during off line operations.

# Frame Buffer Search

Filling the Frame Buffer will require, depending on the Lines Per Frame setting, up to 20 minutes. During this time, the Antenna will keep the target in line and the integration algorithm will pull any signal out of the noise that it can. At the end of filling the buffer the Frame Buffer search takes place.

When the buffer is completely filled each column of raw signal amplitude bins is averaged and placed in the Average Buffer. The neighborhood scanner then comes on and slides a window of Segment Size width across the buffer. For each target bin the scanner averages the bins to the left and right in the segment neighborhood. It then compares this value with the signal amplitude in the target bin.

If the result is greater than the threshold a ‘Hit’ is logged. The operator is signaled, by playing the Song of ET, the WAV and JPG files are finalized then tagged with the SETI Markup Language (SML) XML data. The embedded SML completely defines the state of the station when the Hit was found and allows the station to proceed without the operator being concerned with saving the file keeping notes. The ‘Hit’ is moved to the MySQL database on the main server immediately. The WAV and JPG files are move to a temporary local directly ready to be batch moved to the server when time permits.

The Spec Ana then goes on to collect more data about the Hit. Five more Frame Buffers are created, and the WAVs and JPGs are created and saved.  Therefor a complete Hit requires one hour twenty minutes to complete.

At this point the Spec Ana halts and waits for a restart from the operator or, more likely, from the controlling SETI software system.

# SETI Control Panel

The heavy lifting of basic detection is done completely in the Spectrum Analyzer but the SETI Control Panel is the thing that makes sense of it all.

This module is an HTTP client to the Spec Ana server so it can be located in a different computer.

It’s with this software module that the following extended detection logic takes place

·         Antenna Station Keeping – For long duration integration with large Frame Buffers the antenna must be kept trained on the target

·         Chirp – This makes use of the Doppler effect of the Earth’s rotation for detection

·         Birds – All systems have areas of the spectrum that is simply useless to scan because of self-generated spurs

·         Antenna Off Axis – One useful technique to falsify a ‘Hit’ is to move the antenna away from the signal and verify that It disappears. If it does not it is a local signal of some sort.

·         Receiver Image – When the receiver front end is being swamped by an out of band signal one of the symptoms is that when the frequency is moved up the detected signal moves up as well. Normally it moves down

·         Filter On/Off – The system allows the Band Pass Filter to be switched in and out. A signal that is outside the filter band pass should drop down by a very large amount when the filter is switched in.

# Antenna Station Keeping

The earth rotates to the East at a constant (thankfully) rate of 15 degrees per hour so that at the end of a 24-hour day (24 * 15 = 360) its back where it started. During that period, it also progresses in its path around the sun and this causes the stars, the targets in SETI Net, to appear to move as well. This sidereal day is just a bit shorted than a standard day by about 4 minutes. To keep the SETI net antenna pointed at a single target is the job of three modules in the system: The Mount, The Antenna and the Sky Map.

## Mount

The Mount is where the location on Earth is defined as well as the type of mount. Its used to set the Lat/Lon of the station, the nominal frequency in use so that the HPBW can be calculated from the Dish Diameter. It also defines the ‘No Fly’ zone of the mount. That is the mechanical stop that restricts that antenna movement to less than 360 Az so that it doesn’t wring its own neck. The Red and Green arrows show this to be almost due South around 180 degrees. It also defines the minimum and maximum El. The two red arrows show this to be about +6 degrees to 90 degrees.

## Antenna

The Antenna module manages the movement of the antenna itself. The Az and El dials on the right side are used to set the needed physical position of the antenna and drive the rotor motors to the correct location. It also controls the band Pass Filter in or out of the system.
When the Antenna is first stated the operator presses the At Home button (shown Green here) and the rotor motors are commanded to more to a known home location, in this case due south.

Once home the Antenna becomes ‘Ready’ and can be commanded from the Sky Map

## Sky Map

This module is responsible for commanding the Antenna to move to the target location in terms of DEC and Ra. The large white ‘spike’ on the image is the result of the ‘No Fly’ zone in the Mount. The sine wave looking figure is the horizon of the antenna. It makes a sine wavy because of the use of a Mercator projection of the sky. The white figure moves from right to left during the day.
The Sky Map is also responsible to keep the Antenna locked to the target. It recalculates the needed Az and El of the Antenna every second and sends a move command it the target is on the edge of the HPBW of the antenna.
On this image the antenna current position is shown on the left side by a circle (the HPBW). The current target is shown by a cross just below it. When they lock, the symbols overlap.

# Chirp

Chirping is the principal technique to sift local Earth bound signals, on land or orbiting above, from possible ET beacons. It makes use of the Earth’s rotation in this task.

When you think of chirp you think of a bird’s song. This chirp is a like that except that the chirp is always down at a calculated rate and is *very* slow on the order of 0.1 Hz / second.

## Doppler on Earth

There seems to be a common misunderstanding of the effect of Doppler shift. People seem to think that the approaching ambulance siren pitch gets higher and higher as it approaches you and then gets lower and lower as it moves away. This is not the case. Actually, the pitch *always* goes lower as it approaches you.

Think of it this way. You and not an ambulance siren expert so you don’t know what the pitch is supposed to be but a mile or so away one is approaching you. As long as it is not accelerating in its path toward you the pitch will remain the same. As it passes you the pitch will suddenly drop and remain at the lower pitch as it moves away from you.

If the drop in frequency is *very* sudden it will mean that the ambulance has just run over you.  The Astronomer John Dobson explained the effect thus:

“The reason the siren slides is because it doesn’t hit you.”

Normally, hopefully, this is not the case. If the ambulance is moving at an angle its radial velocity does not remain constant but instead varies as a function as a function of the angle between your line of sight and the siren by:

Where is the angle between the ambulances forward velocity and your line of sight.

Therefore, as the ambulance move toward you at an angle its siren will fall from higher than its actual pitch until it is at a right angle to you. At that point what you hear is its actual frequency. Then as it proceeds past you the pitch continues to fall until is out of hearing range.

It always falls.

## Earths Doppler

This same effect is put to use to sort signals that are local, on Earth or orbiting overhead, from possible signals from an ET beacon.

Consider a far distant target star. The SETI antenna is pointing due East but the target is below the horizon. Then the target rises and its beacon is detected. What frequency is that beacon being transmitted on. We are not exactly sure but we do know that because of the rotation of the Earth it appears to us to be higher than it actually is. The SETI antenna is in tracking mode so it keeps the target in its beam.

As the target rises higher and higher the frequency will fall just as the siren does on Earth. When the target is directly overhead, the beacon is at is actual frequency and as it moves further to the West it continues to fall until it final sets.

Back in the description of the Frame Buffer there is an image with a blue and a red lines. The blue line represents how the vertical columns of a local signal are filled. The maximum amplitude is saved in the single bin. The red lines, however are spread across many bins. When the Frame Buffer is completely filled and the search Spec Ana search algorithm comes on it will find the blue bin with a high degree of accuracy and it will miss the spread out red bins every time.

How to tell the difference?

One approach would be to know the Doppler shift of the Earth at each collection time and adjust the placement of the amplitude in the right bin. This would be a rather time consuming and calculation intensive operation. A better way is to adjust the received frequency in step with the Doppler.

That is how it’s done in the SETI Net software. During the search phase the Doppler shift is calculated from the Antennas Az/El through a set of Trig function’s and the known location of the station on Earth. The receiver is then adjusted down in frequency when it is necessary as the tracking system moves ahead.

The end result is that the ET beacon is contained one bin and the local signals are smeared over many and will be ignored.

For this technique to be effective the bin shift must move more than the size of the neighborhood scanned. The calculated shift of a target at an angel to direct East is on the order of 0.08 Hz/ sec so a 20,000-line Frame Buffer that fills at 10 lines/second will be bin shifted:

BinShift = (20,000/10)* -0.08
or 160 Bins

This is plenty to allow the smearing and concentration needed to take place.

Therefor the receiver will be started at a specific frequency and while collecting data will be shifted down in frequency. After a full 20,000 lines have been collected it will have moved down 160 bins or about 200 Hz at 1.3 Hz/bin.
At the end of a complete Scan the receiver is moved up to the next 20 KHz segment and the process repeated.

This technique is very effective. A single simulated signal, chirped at a rate in step with the calculated receiver chirp can be detected in a thicket of local signals.