Astr 328/428 Group Project #2

The Galaxy Two Point Correlation Function

Step 1: Simulated Expectations

There is a "chunk of the Virgo consortium universe" available for you here. The data come from a massive simulation of a cube of the universe measuring 140 Mpc on a side. Details of the simulation and the galaxy creation can be found at http://www.mpa-garching.mpg.de/Virgo/data_download.html The data give the x, y, and z coordinates in Mpc and star formation rate in solar masses per year of 8384 simulated galaxies.

We are going to define subsets of galaxies as "late types" (ie Sb/Sc spirals) and "early types" (ellipticals and S0's) based on their star formation rates. Let's say late types are things with SFR's > 1 Msun/yr, and early types are things w/ SFR's < 0.1 Msun/yr. (Does this definition make sense?)

Step 2: Observational Reality

Here now is a galaxy redshift catalog taken from the latest SDSS results (courtesy Prof Zehavi). This now gives you RA, Dec, redshift, and g-r color for SDSS galaxies (RA and Dec are both in degrees). This is a volume limited sample of L* galaxies, and I have trimmed the volume so that it is a rectangular volume (which will make life much easier for the analysis).

Go through the same steps as you did for the Virgo Simulation. In this case, let's divide our galaxies by color: make our "early types" galaxies with (g-r)>0.85, and our "late types" galaxies with (g-r)<0.65.

First we need to transform our dataset from observed spherical coordinates (RA, Dec, z) into physical cartesian coordinates (x,y,z). Turn redshift into distance using Hubble's constant (H0=72), and then convert spherical to cartesian. You can easily find spherical->cartesian conversions online, but just be careful of definitions: most definitions have an Dec-like angle measured from the z-axis, while Dec itself is actually measured from the celestial equator.

Once you've done this, you should be able to run your code on the SDSS data.

Remember, all throughout the process you should be thinking about two things:
And remember, your writeup should be ApJ-style, like last time.


Technical Note: Calculating the 2pt correlation function:

Remember the 2ptcf describes the probability of finding two galaxies seperated by a distance r over and beyond that expected from random distribution.

The simplest estimator for this is given by

1+xi(r) = DD(r)/RR(r)

where xi(r) is the 2ptcf, DD(r) is the number of pairs in the dataset with seperation r, and RR(r) is the number of pairs with seperation r that you'd expect just from random points. Note that this expression assumes equal total numbers of data points and random points.

This is easy to calculate, but very time-consuming (computing time ~ N2). I strongly recommend running your code using a compiled language (Fortran, C, etc),  not an interpreted one (Matlab, Supermongo).