Soundex

From FamilySearch Wiki

(Difference between revisions)
(Badgley)
m (fixed many links)
(One intermediate revision by one user not shown)
Line 1: Line 1:
<p><i><a _fcknotitle="true" href="United States">United States</a> <img src="/learn/wiki/en/images/b/b1/Gotoarrow.png" _fck_mw_filename="Gotoarrow.png" alt="" /> <a href="United States Census">U.S. Census</a> <img src="/learn/wiki/en/images/b/b1/Gotoarrow.png" _fck_mw_filename="Gotoarrow.png" alt="" /> <a _fcknotitle="true" href="Soundex" class="selflink">Soundex</a></i>
+
[[United States|United States]] [[Image:Gotoarrow.png]] [[United States Census|U.S. Census]] [[Image:Gotoarrow.png]] [[Soundex]]  
</p><p>The Russell Soundex (a.k.a. American Soundex, and Miracode) and its usefulness to genealogists are explained, some online Soundex converters listed, and rules given for how to manually create a Soundex code. <img src="ef=" _fck_mw_filename="{{Soun1910cap}}" alt="" /><img src="ef=" _fck_mw_filename="{{Miracodecap}}" alt="" />
+
</p>
+
<h3> Definition and Value  </h3>
+
<p>Soundex is a phonetic index that groups together names that sound alike but are spelled differently, <i>for example</i>, <i><b>Stewart</b></i> and <i><b>Stuart</b></i>. This helps searchers find names that are spelled differently than originally expected, a relatively common genealogical research problem.
+
</p><p>The indexing system was developed by Robert C. Russell and Margaret K. Odell. It was patented in 1918<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), archive unknown; digital images, ''Google Patents'' (http://www.google.com/patents/about?id=EbdgAAAAEBAJ : accessed 6 May 2010).</span> (reissued 1923<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), reissue no. RE15,582 (1923), archive unknown; digital images, ''Google Patents'' (http://www.google.com/patents/about?id=ProkAAAAEBAJ : accessed 6 May 2010).</span>) and 1922.<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Robert C. Russell, a method of phonetic indexing, patent no. 1,435,663 (1922), archive unknown; digital images, ''Google Patents'' (http://www.google.com/patents/about?id=t6FkAAAAEBAJ : accessed 6 May 2010).</span> It is formally called the Russell Soundex, and a variation used on the censuses is called the American Soundex.<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Rick Parsons, ''[http://west-penwith.org.uk/misc/soundex.htm Soundex - the True Story],'' (http://west-penwith.org.uk/misc/soundex.htm amp;nbsp;: accessed 30 July 2008).</span>&#160; When a computer was used to generate a Soundex index card for a census it was called a Miracode, and the information listed was slightly different from handwritten Soundex cards for the same census.
+
</p><p>The most well-known genealogical use of Soundex is on parts of the 1880, 1900, 1910, 1920, and 1930 United States federal censuses. It is also used by the federal government for selected ship passenger arrival lists, certain Canadian border crossings, and some naturalization records. A few county governments have also used a version of Soundex for courthouse kinds of records. More recently, Ancestry.com and other Internet companies have featured a Soundex search for their huge online genealogical databases.
+
</p><p>Many non-genealogical search engine algorithms borrow heavily from concepts first introduced by Soundex.<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Ibid.</span>
+
</p>
+
<h3> Online Soundex Converters  </h3>
+
<p>The easiest way to obtain the Soundex code for a name is to use one of several online Soundex converter programs. Simply type a name, and at the click of a button, the converter will divulge the corresponding Soundex code. There may be subtle differences between programs:<br>
+
</p>
+
<ul><li><a href="http://bradandkathy.com/genealogy/yasc.html">Yet Another Soundex Converter</a>
+
</li><li><a href="http://www.creativyst.com/Doc/Articles/SoundEx1/SoundEx1.htm#SoundExConverter">SoundEx Converter Form</a>
+
</li><li><a href="http://www.eogn.com/soundex/">Eastman’s Online Genealogy Newsletter Soundex Calculator</a>
+
</li><li><a href="http://resources.rootsweb.ancestry.com/cgi-bin/soundexconverter">RootsWeb’s Soundex Converter</a>
+
</li></ul>
+
<h3> How Soundex Works  </h3>
+
<p>Soundex is based on the classification of letters of the alphabet (consonants) into six sound-alike key letter groups. For example, in many languages the <b>B</b> and <b>V</b> sounds are nearly interchangeable; as are <b>B</b> and <b>P</b>; and <b>V</b> and <b>F</b>. So the first phonetic group of key letter consonants is <b>b, f, p, v</b>. Vowels are fluid and disregarded, as are <b>H</b> and <b>W</b>. By giving the same value to key letter consonants that often sound alike, the index brings names together that would usually be pronounced alike with little regard to their actual spelling. Each sound-alike group of key letter consonants is assigned a number. Each family name is assigned a Soundex code that has the initial letter of the name followed by exactly three of the sound-alike key letter group numbers. For example, <i><b>Stewart</b></i>&#160;= S363 and <i><b>Stuart</b></i>&#160;= S363.
+
</p><p>Modern online search engines that use Soundex do it without displaying the Soundex codes—similar names spelled differently simply appear together on the search results list. But from time to time a researcher may need to understand Soundex codes in order to use one of the older Soundex indexes on microfilm.
+
</p>
+
<h4> Soundex Rules  </h4>
+
<p>Use these rules to manually create a Soundex code for an ancestor’s name.
+
</p>
+
<h5> Basic Rules<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Based on rules in ''[http://www.archives.gov/genealogy/census/soundex.html The Soundex Indexing System],'' ''The National Archives'' (http://www.archives.gov/genealogy/census/soundex.html : accessed 30 July 2008).</span>  </h5>
+
<ul><li>Every soundex code consists of a letter and three numbers, such as D432.
+
</li><li>The letter is always the first letter of the name. For example, <b>Clausen</b>&#160;= C425, and <b>Klausen</b>&#160;= K425.
+
</li><li>After the first letter, disregard vowels (<b>a, e, i, o, u,</b> and <b>y</b>) and ignore the consonants <b>h</b>, and <b>w</b>.
+
</li><li>Numbers are assigned to the remaining letters of the name according to the table of <i>Soundex Key Letter Codes</i> shown below.
+
</li><li>Zeroes are added at the end if necessary to produce a four-character code. Excess letters are disregarded if they would produce a code longer than four-characters. For example <b>Lee</b>&#160;= L000, and <b>Christopherson</b>&#160;= C623.
+
</li></ul>
+
<table align="center" width="225" cellspacing="1" cellpadding="1" class="plain FCK&#95;_ShowTableBorders">
+
<caption> <u><b>Soundex Key Letter Codes</b></u>
+
</caption>
+
<tr>
+
<td align="center" valign="middle"> <i><b>Number</b></i><br>
+
</td>
+
<td> &#160;&#160;&#160;
+
</td>
+
<td> <i><b>Represents the Letters<br></b></i>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 1<br>
+
</td>
+
<td> &#160;
+
</td>
+
<td> b, f, p, v <br>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 2<br>
+
</td>
+
<td>
+
</td>
+
<td> c, g, j, k, q, s, x, z<br>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 3<br>
+
</td>
+
<td>
+
</td>
+
<td> d, t<br>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 4<br>
+
</td>
+
<td>
+
</td>
+
<td> l<br>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 5<br>
+
</td>
+
<td>
+
</td>
+
<td> m, n<br>
+
</td></tr>
+
<tr>
+
<td align="right" valign="middle"> 6<br>
+
</td>
+
<td>
+
</td>
+
<td> r<br>
+
</td></tr></table>
+
<h5> Additional Rules  </h5>
+
<ul><li><b>Double key letters</b> should be treated as one letter. For example, <b>Gutierrez</b>&#160;= G362.
+
</li><li><b>Side-by-side letters with the same code number</b> should be treated as one letter. For example, <b>Campbell</b>&#160;= C514, and <b>Jackson</b>&#160;= J250, and <b>Pfister</b>&#160;= P236.
+
</li><li><b>Vowel key letter seperators.</b> If a vowel (<b>a, e, i, o, u, y</b>) separates key letters that have the same code number, those key letters should be treated as two letters. For example, <b>Tomzak</b>&#160;= T522, and <b>Roses</b>&#160;= R220.
+
</li><li><b>H or W key letter seperators.</b> If an <b>h</b> or <b>w</b> separates key letters that have the same code number, those key letters should be treated as one letter. For example, <b>Ashcroft</b>&#160;= A261, and <b>Carwruth</b>&#160;= C630.
+
</li><li><b>Names with prefixes</b>, such as Van, Con, De, Di, La, or Le, are coded both with and without the prefix because the name might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example, <b>Van Deusen</b>&#160;= V532 or D250.
+
</li></ul>
+
<h4> More Soundex Examples<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Anne Bruner Eales, and Robert M. Kvasnicka, ''[http://www.worldcat.org/oclc/44420788&amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;amp;referer=brief_results Guide to Genealogical Research in the National Archives, 3rd ed.]'' (Washington, D.C.: NARA, 2000), 22.</span>  </h4>
+
<table align="center" width="100%" cellspacing="1" cellpadding="1" class="plain FCK&#95;_ShowTableBorders">
+
  
<tr>
+
The Russell Soundex (a.k.a. American Soundex, and Miracode) and its usefulness to genealogists are explained, some online Soundex converters listed, and rules given for how to manually create a Soundex code. {{Soun1910cap}} {{Miracodecap}}  
<td> <i>Name</i>
+
</td>
+
<td> <i>Code</i>
+
</td>
+
<td width="8%"> &#160;&#160;&#160;&#160;&#160;
+
</td>
+
<td> <i>Name</i>
+
</td>
+
<td> <i>Code</i>
+
</td>
+
<td width="8%"> &#160;&#160;&#160;&#160;&#160;
+
</td>
+
<td> <i>Name</i>
+
</td>
+
<td> <i>Code</i>
+
</td>
+
<td width="8%"> &#160;&#160;&#160;&#160;&#160;
+
</td>
+
<td> <i>Name</i>
+
</td>
+
<td> <i>Code</i>
+
</td></tr>
+
<tr>
+
<td> Allricht
+
</td>
+
<td> A462
+
</td>
+
<td>
+
</td>
+
<td> Kavanagh
+
</td>
+
<td> K152
+
</td>
+
<td>
+
</td>
+
<td> McGee
+
</td>
+
<td> M200
+
</td>
+
<td>
+
</td>
+
<td> Schafer
+
</td>
+
<td> S160
+
</td></tr>
+
<tr>
+
<td> Eberhard
+
</td>
+
<td> E166
+
</td>
+
<td>
+
</td>
+
<td> Lind
+
</td>
+
<td> L530
+
</td>
+
<td>
+
</td>
+
<td> O'Brien
+
</td>
+
<td> O165
+
</td>
+
<td>
+
</td>
+
<td> Shaeffer
+
</td>
+
<td> S160
+
</td></tr>
+
<tr>
+
<td> Hanselmann
+
</td>
+
<td> H524
+
</td>
+
<td>
+
</td>
+
<td> Lukaschowsky
+
</td>
+
<td> L222
+
</td>
+
<td>
+
</td>
+
<td> Oppenheimer
+
</td>
+
<td> O155
+
</td>
+
<td>
+
</td>
+
<td> Zita
+
</td>
+
<td> Z300
+
</td></tr>
+
<tr>
+
<td> Heimbach
+
</td>
+
<td> H512
+
</td>
+
<td>
+
</td>
+
<td> McDonnell
+
</td>
+
<td> M235
+
</td>
+
<td>
+
</td>
+
<td> Riedemanas
+
</td>
+
<td> R355
+
</td>
+
<td>
+
</td>
+
<td> Zitzmeinn
+
</td>
+
<td> Z325
+
</td></tr></table>
+
<h4> Limitations and How to Overcome Them  </h4>
+
<p><b>Indexing rules were not always followed consistently.</b> The government indexers may have occasionally overlooked some of the fine points of the additional indexing rules.<span class="fck_mw_ref" _fck_mw_customtag="true" _fck_mw_tagname="ref">Parsons.</span> If you cannot find a name under the correct code, try looking under the code as if the additional rules were overlooked. For example, try looking for <b>Ashcroft</b> under both A226 and A261, or try looking for <b>Pfister</b> under both P236 and P123.
+
</p><p><b>Unrelated names may be grouped together.</b> Sometimes names that do not appear to be related show up together on a Soundex index. Ignore clearly unrelated names. For example, if you were looking for <b>Wilkins</b>, you may also find under the same Soundex code, W425, the name <b>Walakynowski</b>.
+
</p><p><b>Related names may not be grouped together.</b> Sometimes names that are obviously related do not come together in the same Soundex index group. For example, <b>Clausen</b> is under C425 and <b>Klausen</b> under K425. If you cannot find a name you seek in a Soundex index, there are 20 alternative ideas in the Wiki article <a _fcknotitle="true" href="Guessing a Name Variation">Guessing a Name Variation</a> to help find elusive names in indexes.
+
</p>
+
<h3> Census Soundex Cards Show Limited Data  </h3>
+
<p>One of the most well-known uses of Soundex indexes is for some of the federal censuses of the United States. More recently, these old microfilm indexes have been largely replaced by online search engines. If you ever have an occasion to use a census Soundex on microfilm, keep in mind that the Soundex card is only a summary. It does not show as much information as the original census schedule.
+
</p><p><i>Figure 1. Sample 1930 Soundex index cards. Original image from the <a href="http://1930census.archives.gov/searchStrategiesSoundex.html">NARA 1930 Census Microfilm Locator</a></i>. <img src="/learn/wiki/en/images/e/ea/SoundexCards1930.png" _fck_mw_filename="SoundexCards1930.png" alt="" />
+
</p><p><br><i>Figure 2. Half a page of the 1930 federal census of Bronx, New York, with more data showing than on Soundex cards.</i> <img src="/learn/wiki/en/images/f/fa/1930CensusBronx.png" _fck_mw_filename="1930CensusBronx.png" alt="" />
+
</p>
+
<h3> Related Content  </h3>
+
<ul><li>Rick Parsons, <i><a href="http://west-penwith.org.uk/misc/soundex.htm">Soundex - the True Story</a>,</i> (http://west-penwith.org.uk/misc/soundex.htm &#160;: accessed 30 July 2008).
+
</li><li><i><a href="http://www.archives.gov/genealogy/census/soundex.html">The Soundex Indexing System</a>,</i> <i>The National Archives</i> (http://www.archives.gov/genealogy/census/soundex.html &#160;: accessed 30 July 2008).
+
</li><li>Kimberly Powell, "<a href="http://genealogy.about.com/od/census/a/russell_index.htm">R. C. Russell Soundex Index - Full Text Description from the Original Patent</a>," <i>About.com: Genealogy </i>(http://genealogy.about.com/od/census/a/russell_index.htm &#160;: accessed 29 July 2008).
+
</li><li>Gary Mokotoff, "<a href="http://www.avotaynu.com/soundex.html">Soundexing and Genealogy</a>," <i>Avotaynu</i> (http://www.avotaynu.com/soundex.html &#160;: accessed 30 July 2008).
+
</li><li><a href="United States Census Indexes">United States Census Indexes</a> FamilySearch Wiki article.
+
</li><li><a href="Finding a Person in the 1930 Census (Even Without An Index)">Finding a Person in the 1930 Census (Even without and Index)</a> FamilySearch Wiki article.
+
</li></ul>
+
<h3> Sources  </h3>
+
<p><span class="fck_mw_template">{{reflist}}</span>
+
</p><p><span class="fck_mw_template">{{USCensus}}</span>
+
</p>
+
<h3> <br>  </h3>
+
<p><b>A wiki article describing this collection is found at:</b>
+
</p>
+
<ul><li><a _fcknotitle="true" href="New York, Northern Arrival Manifests (FamilySearch Historical Records)">New York, Northern Arrival Manifests (FamilySearch Historical Records)</a>
+
</li></ul>
+
  
 +
=== Definition and Value  ===
  
<a href="Category:United_States_Census">United_States_Census</a> <a href="Category:United_States_Emigration_and_Immigration">United_States_Emigration_and_Immigration</a> <a href="Category:United_States_Naturalization_and_Citizenship">United_States_Naturalization_and_Citizenship</a> <a href="Category:United_States_Land_and_Property">United_States_Land_and_Property</a>
+
Soundex is a phonetic index that groups together names that sound alike but are spelled differently, ''for example'', '''''Stewart''''' and '''''Stuart'''''. This helps searchers find names that are spelled differently than originally expected, a relatively common genealogical research problem.
 +
 
 +
The indexing system was developed by Robert C. Russell and Margaret K. Odell. It was patented in 1918 <ref> Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), archive unknown; digital images,Google Patents(http://www.google.com/patents/about?id=EbdgAAAAEBAJ  accessed 6 May 2010).</ref> (reissued 1923 <ref>Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), reissue no. RE15,582 (1923), archive unknown; digital images, Google Patents (http://www.google.com/patents/about?id=ProkAAAAEBAJ  accessed 6 May 2010).</ref>) and 1922.<ref>Robert C. Russell, a method of phonetic indexing, patent no. 1,435,663 (1922), archive unknown; digital images, Google Patents  (http://www.google.com/patents/about?id=t6FkAAAAEBAJ  accessed 6 May 2010).</ref> It is formally called the Russell Soundex, and a variation used on the censuses is called the American Soundex.<ref>Rick Parsons</ref>, a <ref>[http://west-penwith.org.uk/misc/soundex.htm  _ http://west-penwith.org.uk/misc/soundex.htm" Soundex - the True Story]</ref> <ref>(http://west-penwith.org.uk/misc/soundex.htm  accessed 30 July 2008).</ref> When a computer was used to generate a Soundex index card for a census it was called a Miracode, and the information listed was slightly different from handwritten Soundex cards for the same census.
 +
 
 +
The most well-known genealogical use of Soundex is on parts of the 1880, 1900, 1910, 1920, and 1930 United States federal censuses. It is also used by the federal government for selected ship passenger arrival lists, certain Canadian border crossings, and some naturalization records. A few county governments have also used a version of Soundex for courthouse kinds of records. More recently, Ancestry.com and other Internet companies have featured a Soundex search for their huge online genealogical databases.
 +
 
 +
Many non-genealogical search engine algorithms borrow heavily from concepts first introduced by Soundex.<ref>Ibid.</ref>
 +
 
 +
=== Online Soundex Converters  ===
 +
 
 +
The easiest way to obtain the Soundex code for a name is to use one of several online Soundex converter programs. Simply type a name, and at the click of a button, the converter will divulge the corresponding Soundex code. There may be subtle differences between programs:<br>
 +
 
 +
*[http://bradandkathy.com/genealogy/yasc.html et Another Soundex Converter]
 +
*[http://www.creativyst.com/Doc/Articles/SoundEx1/SoundEx1.htm#SoundExConverter SoundEx Converter Form]
 +
*[http://www.eogn.com/soundex/ Eastman’s Online Genealogy Newsletter Soundex Calculator]
 +
*[http://resources.rootsweb.ancestry.com/cgi-bin/soundexconverter RootsWeb’s Soundex Converter]
 +
 
 +
=== How Soundex Works  ===
 +
 
 +
Soundex is based on the classification of letters of the alphabet (consonants) into six sound-alike key letter groups. For example, in many languages the '''B''' and '''V''' sounds are nearly interchangeable; as are '''B''' and '''P'''; and '''V''' and '''F'''. So the first phonetic group of key letter consonants is '''b, f, p, v'''. Vowels are fluid and disregarded, as are '''H''' and '''W'''. By giving the same value to key letter consonants that often sound alike, the index brings names together that would usually be pronounced alike with little regard to their actual spelling. Each sound-alike group of key letter consonants is assigned a number. Each family name is assigned a Soundex code that has the initial letter of the name followed by exactly three of the sound-alike key letter group numbers. For example, '''''Stewart'''''&nbsp;= S363 and '''''Stuart'''''&nbsp;= S363.
 +
 
 +
Modern online search engines that use Soundex do it without displaying the Soundex codes—similar names spelled differently simply appear together on the search results list. But from time to time a researcher may need to understand Soundex codes in order to use one of the older Soundex indexes on microfilm.
 +
 
 +
==== Soundex Rules  ====
 +
 
 +
Use these rules to manually create a Soundex code for an ancestor’s name.
 +
 
 +
==== = Basic Rules  ====
 +
 
 +
Based on rules in [http://www.archives.gov/genealogy/census/soundex.html www.archives.gov/genealogy/census/soundex.html] The Soundex Indexing System in the The National Archives [http://www.archives.gov/genealogy/census/soundex.html www.archives.gov accessed 30 July 2008). =====
 +
 
 +
*Every soundex code consists of a letter and three numbers, such as D432.
 +
*The letter is always the first letter of the name. For example, '''Clausen'''&nbsp;= C425, and '''Klausen''' = K425.
 +
*After the first letter, disregard vowels ('''a, e, i, o, u,''' and '''y''') and ignore the consonants '''h''', and '''w'''.
 +
*Numbers are assigned to the remaining letters of the name according to the table of ''Soundex Key Letter Codes'' shown below.
 +
*Zeroes are added at the end if necessary to produce a four-character code. Excess letters are disregarded if they would produce a code longer than four-characters. For example '''Lee''' = L000, and '''Christopherson'''&nbsp;= C623.
 +
 
 +
{| width="225" cellspacing="1" cellpadding="1" align="center" class="plain FCK__ShowTableBorders"
 +
|+ <u>'''Soundex Key Letter Codes'''</u>
 +
|-
 +
| valign="middle" align="center" | '''''Number'''''<br>
 +
| &nbsp;&nbsp;&nbsp;
 +
| '''''Represents the Letters<br>'''''
 +
|-
 +
| valign="middle" align="right" | 1<br>
 +
| &nbsp;
 +
| b, f, p, v <br>
 +
|-
 +
| valign="middle" align="right" | 2<br>
 +
|
 +
| c, g, j, k, q, s, x, z<br>
 +
|-
 +
| valign="middle" align="right" | 3<br>
 +
|
 +
| d, t<br>
 +
|-
 +
| valign="middle" align="right" | 4<br>
 +
|
 +
| l<br>
 +
|-
 +
| valign="middle" align="right" | 5<br>
 +
|
 +
| m, n<br>
 +
|-
 +
| valign="middle" align="right" | 6<br>
 +
|
 +
| r<br>
 +
|}
 +
 
 +
===== Additional Rules  =====
 +
 
 +
*'''Double key letters''' should be treated as one letter. For example, '''Gutierrez'''&nbsp;= G362.
 +
*'''Side-by-side letters with the same code number''' should be treated as one letter. For example, '''Campbell'''&nbsp;= C514, and '''Jackson'''&nbsp;= J250, and '''Pfister'''&nbsp;= P236.
 +
*'''Vowel key letter seperators.''' If a vowel ('''a, e, i, o, u, y''') separates key letters that have the same code number, those key letters should be treated as two letters. For example, '''Tomzak'''&nbsp;= T522, and '''Roses'''&nbsp;= R220.
 +
*'''H or W key letter seperators.''' If an '''h''' or '''w''' separates key letters that have the same code number, those key letters should be treated as one letter. For example, '''Ashcroft'''&nbsp;= A261, and '''Carwruth'''&nbsp;= C630.
 +
*'''Names with prefixes''', such as Van, Con, De, Di, La, or Le, are coded both with and without the prefix because the name might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example, '''Van Deusen'''&nbsp;= V532 or D250.
 +
 
 +
==== More Soundex Examples  ====
 +
 
 +
<ref>Anne Bruner Eales, and Robert M. Kvasnicka,  [http://www.worldcat.org/oclc/44420788]referer=brief_results]</ref> Guide to Genealogical Research in the National Archives, 3rd ed.(Washington, D.C.: NARA, 2000), 22.
 +
 
 +
{| width="100%" cellspacing="1" cellpadding="1" align="center" class="plain FCK__ShowTableBorders"
 +
|-
 +
| ''Name''
 +
| ''Code''
 +
| width="8%" | &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
 +
| ''Name''
 +
| ''Code''
 +
| width="8%" | &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
 +
| ''Name''
 +
| ''Code''
 +
| width="8%" | &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
 +
| ''Name''
 +
| ''Code''
 +
|-
 +
| Allricht
 +
| A462
 +
|
 +
| Kavanagh
 +
| K152
 +
|
 +
| McGee
 +
| M200
 +
|
 +
| Schafer
 +
| S160
 +
|-
 +
| Eberhard
 +
| E166
 +
|
 +
| Lind
 +
| L530
 +
|
 +
| O'Brien
 +
| O165
 +
|
 +
| Shaeffer
 +
| S160
 +
|-
 +
| Hanselmann
 +
| H524
 +
|
 +
| Lukaschowsky
 +
| L222
 +
|
 +
| Oppenheimer
 +
| O155
 +
|
 +
| Zita
 +
| Z300
 +
|-
 +
| Heimbach
 +
| H512
 +
|
 +
| McDonnell
 +
| M235
 +
|
 +
| Riedemanas
 +
| R355
 +
|
 +
| Zitzmeinn
 +
| Z325
 +
|}
 +
 
 +
==== Limitations and How to Overcome Them  ====
 +
 
 +
'''Indexing rules were not always followed consistently.''' The government indexers may have occasionally overlooked some of the fine points of the additional indexing rules.<ref>Parsons.</ref> If you cannot find a name under the correct code, try looking under the code as if the additional rules were overlooked. For example, try looking for '''Ashcroft''' under both A226 and A261, or try looking for '''Pfister''' under both P236 and P123.
 +
 
 +
'''Unrelated names may be grouped together.''' Sometimes names that do not appear to be related show up together on a Soundex index. Ignore clearly unrelated names. For example, if you were looking for '''Wilkins''', you may also find under the same Soundex code, W425, the name '''Walakynowski'''.
 +
 
 +
'''Related names may not be grouped together.''' Sometimes names that are obviously related do not come together in the same Soundex index group. For example, '''Clausen''' is under C425 and '''Klausen''' under K425. If you cannot find a name you seek in a Soundex index, there are 20 alternative ideas in the Wiki article &lt;a _fcknotitle="true" [[Guessing a Name Variation|Guessing a Name Variation]] to help find elusive names in indexes.
 +
 
 +
=== Census Soundex Cards Show Limited Data  ===
 +
 
 +
One of the most well-known uses of Soundex indexes is for some of the federal censuses of the United States. More recently, these old microfilm indexes have been largely replaced by online search engines. If you ever have an occasion to use a census Soundex on microfilm, keep in mind that the Soundex card is only a summary. It does not show as much information as the original census schedule.
 +
 
 +
''Figure 1. Sample 1930 Soundex index cards. Original image from the &lt;a [http://1930census.archives.gov/searchStrategiesSoundex.html NARA 1930 Census Microfilm Locator]''.
 +
 
 +
[[Image:SoundexCards1930.png|right|335px|SoundexCards1930.png]]
 +
 
 +
<br>''Figure 2. Half a page of the 1930 federal census of Bronx, New York, with more data showing than on Soundex cards.'' &lt;img src="/learn/wiki/en/images/f/fa/1930CensusBronx.png" _fck_mw_filename="1930CensusBronx.png" alt="" /&gt;
 +
 
 +
=== Research Wiki  ===
 +
 
 +
'''A wiki article describing this collection is found at:'''
 +
 
 +
*[[New York, Northern Arrival Manifests (FamilySearch Historical Records)|New York, Northern Arrival Manifests (FamilySearch Historical Records)]]
 +
 
 +
=== Related Content  ===
 +
 
 +
*Rick Parsons, ''<ref>[http://west-penwith.org.uk/misc/soundex.htm Soundex - the True Story ''</ref> (http://west-penwith.org.uk/misc/soundex.htm &nbsp;: accessed 30 July 2008). ''
 +
*''<ref>[http://www.archives.gov/genealogy/census/soundex.html The Soundex Indexing System] </ref>'' ''The National Archives'' (http://www.archives.gov/genealogy/census/soundex.html &nbsp;: accessed 30 July 2008).
 +
*Kimberly Powell, "<ref>[http://genealogy.about.com/od/census/a/russell_index.htm R. C. Russell Soundex Index - Full Text Description from the Original Patent]</ref> ''About.com: Genealogy ''(http://genealogy.about.com/od/census/a/russell_index.htm&nbsp;: accessed 29 July 2008).
 +
*Gary Mokotoff, "<ref>[http://www.avotaynu.com/soundex.html Soundexing and Genealogy]</ref>" ''Avotaynu'' (http://www.avotaynu.com/soundex.html &nbsp;: accessed 30 July 2008).
 +
*<ref>United States Census Indexes United States Census Indexes</ref> FamilySearch Wiki article.
 +
*<ref>[[Finding a Person in the 1930 Census (Even Without An Index)| Finding a Person in the 1930 Census (Even without and Index)]]</ref> FamilySearch Wiki article.
 +
 
 +
=== References  ===
 +
 
 +
<references />
 +
 
 +
<br>
 +
 
 +
[[Category:United_States_Census|United_States_Census]] [[Category:United_States_Emigration_and_Immigration|United_States_Emigration_and_Immigration]] [[Category:United_States_Naturalization_and_Citizenship|United_States_Naturalization_and_Citizenship]] [[Category:United_States_Land_and_Property|United_States Land_and_Property]]

Revision as of 04:18, 25 June 2013

United States Gotoarrow.png U.S. Census Gotoarrow.png Soundex

The Russell Soundex (a.k.a. American Soundex, and Miracode) and its usefulness to genealogists are explained, some online Soundex converters listed, and rules given for how to manually create a Soundex code. Soundex1910.jpg|thumb|right|1910 U.S. federal census Soundex family card. Miracode.jpg|thumb|right|1910 U.S. federal census Miracode for 4 households.

Contents

Definition and Value

Soundex is a phonetic index that groups together names that sound alike but are spelled differently, for example, Stewart and Stuart. This helps searchers find names that are spelled differently than originally expected, a relatively common genealogical research problem.

The indexing system was developed by Robert C. Russell and Margaret K. Odell. It was patented in 1918 [1] (reissued 1923 [2]) and 1922.[3] It is formally called the Russell Soundex, and a variation used on the censuses is called the American Soundex.[4], a [5] [6] When a computer was used to generate a Soundex index card for a census it was called a Miracode, and the information listed was slightly different from handwritten Soundex cards for the same census.

The most well-known genealogical use of Soundex is on parts of the 1880, 1900, 1910, 1920, and 1930 United States federal censuses. It is also used by the federal government for selected ship passenger arrival lists, certain Canadian border crossings, and some naturalization records. A few county governments have also used a version of Soundex for courthouse kinds of records. More recently, Ancestry.com and other Internet companies have featured a Soundex search for their huge online genealogical databases.

Many non-genealogical search engine algorithms borrow heavily from concepts first introduced by Soundex.[7]

Online Soundex Converters

The easiest way to obtain the Soundex code for a name is to use one of several online Soundex converter programs. Simply type a name, and at the click of a button, the converter will divulge the corresponding Soundex code. There may be subtle differences between programs:

How Soundex Works

Soundex is based on the classification of letters of the alphabet (consonants) into six sound-alike key letter groups. For example, in many languages the B and V sounds are nearly interchangeable; as are B and P; and V and F. So the first phonetic group of key letter consonants is b, f, p, v. Vowels are fluid and disregarded, as are H and W. By giving the same value to key letter consonants that often sound alike, the index brings names together that would usually be pronounced alike with little regard to their actual spelling. Each sound-alike group of key letter consonants is assigned a number. Each family name is assigned a Soundex code that has the initial letter of the name followed by exactly three of the sound-alike key letter group numbers. For example, Stewart = S363 and Stuart = S363.

Modern online search engines that use Soundex do it without displaying the Soundex codes—similar names spelled differently simply appear together on the search results list. But from time to time a researcher may need to understand Soundex codes in order to use one of the older Soundex indexes on microfilm.

Soundex Rules

Use these rules to manually create a Soundex code for an ancestor’s name.

= Basic Rules

Based on rules in www.archives.gov/genealogy/census/soundex.html The Soundex Indexing System in the The National Archives [http://www.archives.gov/genealogy/census/soundex.html www.archives.gov accessed 30 July 2008). =====

  • Every soundex code consists of a letter and three numbers, such as D432.
  • The letter is always the first letter of the name. For example, Clausen = C425, and Klausen = K425.
  • After the first letter, disregard vowels (a, e, i, o, u, and y) and ignore the consonants h, and w.
  • Numbers are assigned to the remaining letters of the name according to the table of Soundex Key Letter Codes shown below.
  • Zeroes are added at the end if necessary to produce a four-character code. Excess letters are disregarded if they would produce a code longer than four-characters. For example Lee = L000, and Christopherson = C623.
Soundex Key Letter Codes
Number
    Represents the Letters
1
  b, f, p, v
2
c, g, j, k, q, s, x, z
3
d, t
4
l
5
m, n
6
r
Additional Rules
  • Double key letters should be treated as one letter. For example, Gutierrez = G362.
  • Side-by-side letters with the same code number should be treated as one letter. For example, Campbell = C514, and Jackson = J250, and Pfister = P236.
  • Vowel key letter seperators. If a vowel (a, e, i, o, u, y) separates key letters that have the same code number, those key letters should be treated as two letters. For example, Tomzak = T522, and Roses = R220.
  • H or W key letter seperators. If an h or w separates key letters that have the same code number, those key letters should be treated as one letter. For example, Ashcroft = A261, and Carwruth = C630.
  • Names with prefixes, such as Van, Con, De, Di, La, or Le, are coded both with and without the prefix because the name might be listed under either code. Note, however, that Mc and Mac are not considered prefixes. For example, Van Deusen = V532 or D250.

More Soundex Examples

[8] Guide to Genealogical Research in the National Archives, 3rd ed.(Washington, D.C.: NARA, 2000), 22.

Name Code       Name Code       Name Code       Name Code
Allricht A462 Kavanagh K152 McGee M200 Schafer S160
Eberhard E166 Lind L530 O'Brien O165 Shaeffer S160
Hanselmann H524 Lukaschowsky L222 Oppenheimer O155 Zita Z300
Heimbach H512 McDonnell M235 Riedemanas R355 Zitzmeinn Z325

Limitations and How to Overcome Them

Indexing rules were not always followed consistently. The government indexers may have occasionally overlooked some of the fine points of the additional indexing rules.[9] If you cannot find a name under the correct code, try looking under the code as if the additional rules were overlooked. For example, try looking for Ashcroft under both A226 and A261, or try looking for Pfister under both P236 and P123.

Unrelated names may be grouped together. Sometimes names that do not appear to be related show up together on a Soundex index. Ignore clearly unrelated names. For example, if you were looking for Wilkins, you may also find under the same Soundex code, W425, the name Walakynowski.

Related names may not be grouped together. Sometimes names that are obviously related do not come together in the same Soundex index group. For example, Clausen is under C425 and Klausen under K425. If you cannot find a name you seek in a Soundex index, there are 20 alternative ideas in the Wiki article <a _fcknotitle="true" Guessing a Name Variation to help find elusive names in indexes.

Census Soundex Cards Show Limited Data

One of the most well-known uses of Soundex indexes is for some of the federal censuses of the United States. More recently, these old microfilm indexes have been largely replaced by online search engines. If you ever have an occasion to use a census Soundex on microfilm, keep in mind that the Soundex card is only a summary. It does not show as much information as the original census schedule.

Figure 1. Sample 1930 Soundex index cards. Original image from the <a NARA 1930 Census Microfilm Locator.

SoundexCards1930.png


Figure 2. Half a page of the 1930 federal census of Bronx, New York, with more data showing than on Soundex cards. <img src="/learn/wiki/en/images/f/fa/1930CensusBronx.png" _fck_mw_filename="1930CensusBronx.png" alt="" />

Research Wiki

A wiki article describing this collection is found at:

Related Content

References

  1. Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), archive unknown; digital images,Google Patents(http://www.google.com/patents/about?id=EbdgAAAAEBAJ accessed 6 May 2010).
  2. Robert C. Russell, a method of phonetic indexing, patent no. 1,261,167 (1918), reissue no. RE15,582 (1923), archive unknown; digital images, Google Patents (http://www.google.com/patents/about?id=ProkAAAAEBAJ accessed 6 May 2010).
  3. Robert C. Russell, a method of phonetic indexing, patent no. 1,435,663 (1922), archive unknown; digital images, Google Patents (http://www.google.com/patents/about?id=t6FkAAAAEBAJ accessed 6 May 2010).
  4. Rick Parsons
  5. _ http://west-penwith.org.uk/misc/soundex.htm" Soundex - the True Story
  6. (http://west-penwith.org.uk/misc/soundex.htm accessed 30 July 2008).
  7. Ibid.
  8. Anne Bruner Eales, and Robert M. Kvasnicka, [1]referer=brief_results]
  9. Parsons.
  10. [http://west-penwith.org.uk/misc/soundex.htm Soundex - the True Story
  11. The Soundex Indexing System
  12. R. C. Russell Soundex Index - Full Text Description from the Original Patent
  13. Soundexing and Genealogy
  14. United States Census Indexes United States Census Indexes
  15. Finding a Person in the 1930 Census (Even without and Index)