CalCOFI Database 21Jun2006 Information (21 Jun 2006)

It has been quite a task to combine the 52-year archive into a single database.  To achieve this, a few compromises have been made: 

  • Line & Station numbering: this version uses “standard” Line.Sta as a basis for Sta_ID.  The Sta_ID is built from the nearest Standard CalCOFI station (± 1nm).  Three columns are listed for each station and line.  First is “reported” - the Line or Station recorded in the IEH file is the first column (labeled Line or Sta).  Second is “nearest standard station” according to the CalCOFI 1950 station pattern (labeled StndL or StndS).  Thirdly, the “calculated Line or Sta” ( CalcLine or CalcSta which is also used for Line or Sta if they are missing in the IEH) - line and station are recalculated from IEH latitude and longitude.  This third option is the most accurate provided the lat & lon were properly recorded.  Potential problems occur on near shore stations where the ± 1nm rule does not work since some stations are spaced 1nm apart - corrections will be made when identified.
  • Sta_ID is a text variable type: since I chose not to separate Gulf of California data, negative or “G-labeled” station designations are included. Using the algorithm for calculating line & station number generates negative station numbers for most Gulf stations.  Another method must have been used to generate these station numbers since many are reported differently in the IEH.  Most have a “G” designation - the original labeling is listed in the Orig_Sta_ID column.
  • Cruz_Num which usually identifies a cruise using YYMM may be listed according to YYMM of the station.  IEH data that do not have “CalCOFI YYMM” listed in the header get a Cruz_Num based on the station's year and month. Potential problem: using the Cruz_Numas a criteria may not include all the stations that were done on cruise YYMM. Obvious corrections will and have been made - the very first station was changed from 4902 to 4903 for example, since it was a single instance. 
  • Cruise-sorted - CIFT (CalCOFI IEH Formatting Tool) groups data according to cruise and ship.  This program was used to generate the importable, comma-delimited data for this database.  The Cast_Data and particularly the Cast_Cnt are sequentially number based on date and time, not specific cruise or ship. 
  • CalCOFI Data Report data included in this database was regenerated from the CIFTIEH files and some but not all of the data has been compared to the printed data reports for parity.  Data Report data includes derived data such as Potential Temperature and Dynamic Height which are not archived in the IEH format.

General comments & descriptions:

The Cast_ID format:

19 - 4902NS - HY059 - 2242 - 093037

[Century] - [YY][MM][ShipCode] - [CastType][Julian Day] - [CastTime]-[Line][Sta]

Century - for Y2K compliance 19 for 1900's; 20 for 2000's
YY - Decade and year (combine with Century for YYYY)
MM - month  (YYMM may designate cruise: see item 3 on previous page)
Ship Code - two-letter ship code

NM - New Horizon

PT - Paolina-T

RR - Roger Revelle

SB - Baird, S. F.

JD - David Starr Jordan

ST - Stranger

QU - Robert Gordon Sproul

YE - Yellowfin

NS - Scofield, N. B.

EB - Scripps, Ellen B.

CR - Crest

WE - Westwind

HO - Horizon

TC - Cromwell, T.

AX - Agassiz, A.

VA - Von Humboldt, A.

AR - Argo

EK - Ekvator

BD - Black Douglas

PN - Poseydon

ES - Scripps, E. W.

DP - David Phillip Dolphin

HS - Smith, H. M.

M4 - McArthur

OR - Orca

AS - Alaska

            From http://www.nodc.noaa.gov/OC5/shipname.txt

Cast type - two-letter cast designation

  • HY - hydro
  • MX - mixed (Bottle+CTD)
  • PR - prodo
  • 10 - 10 meter cast

Julian Day - with 1 being Jan 1 and 365 (or 366) being Dec 31st
Cast Time - bottle cast time (UTC from IEH); if cast time is missing “9999” or julian day
Line & Sta - nearest standard CalCOFI Line & Sta (LLLSSS)

The Depth_ID format:

Uses the Cast_ID prefix but adds three additional variables:

19-4902NS-HY059-2242-093037-0000A-3

                                                ^   ^  ^

                                        [Depth][Bottle]-[Rec_Ind]

Depth - is observed or interpolated depth in meters
Bottle - alpha character to distinguish multiple bottles at the same depth
Record Indicator -  

  • 3 - observed bottle data
  • 4 - “ghost” level -an interpolated level was added to correct a standard level calculation.
  • 5 - CTD data
  • 7 - interpolated standard level - all data interpolated

 

 

 

 

 

 

 

 

Stats

  website metrics

 

 

 

 

 

 

 

 

 

 

 

 

 

Design Notes www.calcofi.org