tsegall / fta

Metadata/data identification Java library. Identifies Semantic Type information (e.g. Gender, Age, Color, Country,...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support.
Apache License 2.0
23 stars 2 forks source link

Counts do not match for SemanticType - Street Address #25

Closed fredxing closed 1 year ago

fredxing commented 1 year ago

When train with sample rows (>400?), 'Street Address' starts to show counts miss matching.

Output below shows there were 14 counts missing.

RowCount=500 { "fieldName" : "address", "totalCount" : -1, "sampleCount" : 500, "matchCount" : 486, "nullCount" : 0, "blankCount" : 0, "distinctCount" : 485, "regExp" : ".+", "confidence" : 1.0, "type" : "String", "isSemanticType" : true, "semanticType" : "STREET_ADDRESS_EN", "min" : "1 Beilfuss Point", "max" : "996 Westend Junction", "minLength" : 9, "maxLength" : 28, "topK" : [ "996 Westend Junction", "99 Truax Trail", "99 Pepper Wood Pass", "98969 Everett Park", "986 Grasskamp Place", "985 Truax Crossing", "98 Iowa Circle", "97656 Sherman Avenue", "97595 Dayton Lane", "97410 Esker Way" ], "bottomK" : [ "1 Beilfuss Point", "1 Bonner Place", "1 Briar Crest Point", "1 Columbus Place", "1 Corben Point", "1 Eastlawn Point", "1 Fisk Court", "1 Glendale Court", "1 Grover Road", "1 Northfield Way" ], "cardinality" : 485, "outlierCardinality" : 0, "invalidCardinality" : 0, "shapesCardinality" : 244, "leadingWhiteSpace" : false, "trailingWhiteSpace" : false, "multiline" : false, "keyConfidence" : 0.0, "uniqueness" : 0.9979381443298969, "detectionLocale" : "en-US", "ftaVersion" : "12.8.2", "structureSignature" : "5P7tWzPdbjVvyHhLklpTf00Zxl8=", "dataSignature" : "GLaZHNyv01dDZ72FtYNIY7Yhy8k=" }

Test code

String[] fieldnames = new String[] {"address"}; String[][] values = new String[][] { new String[] {"67 Corscot Lane"}, new String[] {"7021 Claremont Parkway"}, new String[] {"804 Debs Circle"}, new String[] {"04104 Carey Junction"}, new String[] {"420 Dennis Street"}, new String[] {"55 Di Loreto Way"}, new String[] {"0 Schlimgen Avenue"}, new String[] {"848 Buena Vista Circle"}, new String[] {"4 Springview Drive"}, new String[] {"484 Clarendon Point"}, new String[] {"715 Melvin Trail"}, new String[] {"1 Eastlawn Point"}, new String[] {"643 Northport Lane"}, new String[] {"319 Summit Hill"}, new String[] {"79924 Bobwhite Avenue"}, new String[] {"0604 Corben Center"}, new String[] {"7 Barnett Plaza"}, new String[] {"3960 Sunbrook Hill"}, new String[] {"04 Buhler Hill"}, new String[] {"8 Lakeland Plaza"}, new String[] {"310 American Ash Place"}, new String[] {"245 4th Court"}, new String[] {"34 Waywood Hill"}, new String[] {"59 Cody Terrace"}, new String[] {"203 Blaine Parkway"}, new String[] {"8 Ryan Lane"}, new String[] {"2963 Helena Way"}, new String[] {"3 Bellgrove Pass"}, new String[] {"6761 Moland Drive"}, new String[] {"9 Fremont Pass"}, new String[] {"00519 Lakeland Terrace"}, new String[] {"0 Kenwood Avenue"}, new String[] {"186 Esch Court"}, new String[] {"1970 Canary Junction"}, new String[] {"68 Surrey Plaza"}, new String[] {"15 Morning Center"}, new String[] {"1 Corben Point"}, new String[] {"696 Trailsway Center"}, new String[] {"5439 Lyons Hill"}, new String[] {"31388 Doe Crossing Trail"}, new String[] {"35052 Blue Bill Park Alley"}, new String[] {"502 Merchant Court"}, new String[] {"8 Cardinal Street"}, new String[] {"84 Briar Crest Avenue"}, new String[] {"719 Trailsway Plaza"}, new String[] {"63568 Melody Circle"}, new String[] {"19 Fisk Point"}, new String[] {"768 Nevada Alley"}, new String[] {"912 Continental Way"}, new String[] {"8075 Warrior Avenue"}, new String[] {"3 Huxley Drive"}, new String[] {"5 5th Crossing"}, new String[] {"62836 Fisk Drive"}, new String[] {"1 Victoria Park"}, new String[] {"912 2nd Pass"}, new String[] {"02 Barby Court"}, new String[] {"04 Westend Drive"}, new String[] {"41250 Hollow Ridge Drive"}, new String[] {"722 Lotheville Avenue"}, new String[] {"66 Manley Trail"}, new String[] {"15876 Orin Center"}, new String[] {"7 Sloan Point"}, new String[] {"42 Wayridge Way"}, new String[] {"03129 Moland Park"}, new String[] {"4481 Maple Wood Court"}, new String[] {"30127 Ruskin Park"}, new String[] {"69812 Weeping Birch Trail"}, new String[] {"879 Prentice Place"}, new String[] {"57 Carpenter Parkway"}, new String[] {"05422 Eagle Crest Plaza"}, new String[] {"8 Eliot Way"}, new String[] {"81 Dwight Junction"}, new String[] {"3 Mockingbird Hill"}, new String[] {"777 Steensland Terrace"}, new String[] {"2 Acker Way"}, new String[] {"8 Ronald Regan Trail"}, new String[] {"4 Shelley Junction"}, new String[] {"746 Boyd Parkway"}, new String[] {"62 Fremont Drive"}, new String[] {"9 Holy Cross Crossing"}, new String[] {"7693 Chive Plaza"}, new String[] {"467 Bonner Crossing"}, new String[] {"52013 Esker Plaza"}, new String[] {"05 Kropf Center"}, new String[] {"963 Oxford Avenue"}, new String[] {"2 Clarendon Hill"}, new String[] {"67392 Welch Street"}, new String[] {"3 Hooker Parkway"}, new String[] {"73169 Meadow Valley Alley"}, new String[] {"940 Old Shore Center"}, new String[] {"36726 Prairieview Point"}, new String[] {"75 Reinke Place"}, new String[] {"952 Mallory Center"}, new String[] {"3 Utah Trail"}, new String[] {"222 Larry Alley"}, new String[] {"0 Harper Terrace"}, new String[] {"01 Northland Place"}, new String[] {"2265 Nevada Terrace"}, new String[] {"01010 Longview Circle"}, new String[] {"187 Victoria Way"}, new String[] {"80 Elmside Pass"}, new String[] {"84 4th Park"}, new String[] {"0 Esch Circle"}, new String[] {"5 Dapin Plaza"}, new String[] {"047 Morning Court"}, new String[] {"89720 Schmedeman Park"}, new String[] {"97410 Esker Way"}, new String[] {"48 Del Sol Center"}, new String[] {"478 Burrows Junction"}, new String[] {"57821 Morningstar Alley"}, new String[] {"1 Fisk Court"}, new String[] {"6131 Red Cloud Crossing"}, new String[] {"5 Eastlawn Terrace"}, new String[] {"6 Merchant Terrace"}, new String[] {"32 Canary Avenue"}, new String[] {"8836 Pleasure Place"}, new String[] {"971 Express Park"}, new String[] {"14 Knutson Lane"}, new String[] {"55961 Summerview Point"}, new String[] {"7 Beilfuss Center"}, new String[] {"05527 American Ash Avenue"}, new String[] {"1 Ryan Lane"}, new String[] {"63498 Tennyson Street"}, new String[] {"692 Graceland Park"}, new String[] {"5816 Westridge Circle"}, new String[] {"86 Namekagon Street"}, new String[] {"3017 Fairview Junction"}, new String[] {"7938 Ramsey Junction"}, new String[] {"739 Knutson Hill"}, new String[] {"9599 Clemons Junction"}, new String[] {"79389 Rusk Crossing"}, new String[] {"176 Dawn Circle"}, new String[] {"4 Scoville Crossing"}, new String[] {"6 5th Point"}, new String[] {"43469 Fairview Road"}, new String[] {"779 Roxbury Place"}, new String[] {"57 Kedzie Crossing"}, new String[] {"70 Loeprich Hill"}, new String[] {"952 Westend Trail"}, new String[] {"377 Sutteridge Drive"}, new String[] {"1 Springs Place"}, new String[] {"26 Oxford Road"}, new String[] {"96 Tennessee Lane"}, new String[] {"582 Beilfuss Point"}, new String[] {"4 Victoria Alley"}, new String[] {"235 Burrows Park"}, new String[] {"59 Merrick Pass"}, new String[] {"728 Sherman Way"}, new String[] {"338 Buell Court"}, new String[] {"363 Vernon Avenue"}, new String[] {"42451 Pleasure Terrace"}, new String[] {"6323 Sutherland Street"}, new String[] {"45 Basil Road"}, new String[] {"52097 Talmadge Plaza"}, new String[] {"99 Pepper Wood Pass"}, new String[] {"40 Hudson Circle"}, new String[] {"940 Sycamore Park"}, new String[] {"8 Waywood Hill"}, new String[] {"0 5th Way"}, new String[] {"07 Everett Crossing"}, new String[] {"014 Dorton Pass"}, new String[] {"65 Grim Way"}, new String[] {"771 Caliangt Terrace"}, new String[] {"0404 Moose Way"}, new String[] {"27295 2nd Court"}, new String[] {"643 Ramsey Point"}, new String[] {"307 Esch Point"}, new String[] {"02 Dorton Plaza"}, new String[] {"116 Hovde Trail"}, new String[] {"985 Truax Crossing"}, new String[] {"941 Sauthoff Junction"}, new String[] {"395 Harper Lane"}, new String[] {"54 Lillian Court"}, new String[] {"2 Manitowish Park"}, new String[] {"92 Donald Plaza"}, new String[] {"75 Annamark Pass"}, new String[] {"62160 Melvin Point"}, new String[] {"34 Trailsway Court"}, new String[] {"5700 Loomis Parkway"}, new String[] {"8 Starling Drive"}, new String[] {"55171 Fremont Plaza"}, new String[] {"9 Boyd Crossing"}, new String[] {"4153 Rieder Alley"}, new String[] {"091 Corscot Circle"}, new String[] {"07 Marcy Place"}, new String[] {"84 Fallview Junction"}, new String[] {"07 Dennis Way"}, new String[] {"530 Manley Trail"}, new String[] {"8438 Scofield Pass"}, new String[] {"21077 Aberg Alley"}, new String[] {"103 New Castle Place"}, new String[] {"1666 Sugar Court"}, new String[] {"54 Caliangt Hill"}, new String[] {"84 Commercial Hill"}, new String[] {"7296 8th Street"}, new String[] {"75002 Crescent Oaks Point"}, new String[] {"22272 Packers Center"}, new String[] {"951 Prairieview Avenue"}, new String[] {"5 Fair Oaks Parkway"}, new String[] {"62 Moulton Trail"}, new String[] {"73 Westridge Alley"}, new String[] {"6686 Welch Park"}, new String[] {"11624 Laurel Trail"}, new String[] {"293 Badeau Junction"}, new String[] {"39366 Mifflin Park"}, new String[] {"996 Westend Junction"}, new String[] {"7783 Blackbird Alley"}, new String[] {"566 Manufacturers Parkway"}, new String[] {"5931 Corscot Center"}, new String[] {"04848 Steensland Center"}, new String[] {"442 Lakewood Gardens Way"}, new String[] {"63 Arapahoe Junction"}, new String[] {"040 Schurz Hill"}, new String[] {"1847 Daystar Plaza"}, new String[] {"677 Eastlawn Road"}, new String[] {"1 Briar Crest Point"}, new String[] {"1359 Stang Terrace"}, new String[] {"68 Burning Wood Point"}, new String[] {"78241 Merrick Place"}, new String[] {"62933 Mayer Junction"}, new String[] {"73 Fair Oaks Trail"}, new String[] {"39 Shopko Way"}, new String[] {"8775 Anderson Street"}, new String[] {"28205 Butterfield Place"}, new String[] {"17028 Pine View Junction"}, new String[] {"97656 Sherman Avenue"}, new String[] {"84067 Prairieview Court"}, new String[] {"97595 Dayton Lane"}, new String[] {"49835 Carioca Court"}, new String[] {"4 Texas Lane"}, new String[] {"56 Debs Park"}, new String[] {"0 Florence Hill"}, new String[] {"9398 Mariners Cove Point"}, new String[] {"37 La Follette Alley"}, new String[] {"842 Lunder Center"}, new String[] {"8 Badeau Place"}, new String[] {"716 Eastlawn Park"}, new String[] {"7 Crescent Oaks Lane"}, new String[] {"3526 Sugar Circle"}, new String[] {"65538 Milwaukee Street"}, new String[] {"515 Warbler Place"}, new String[] {"0 Holy Cross Way"}, new String[] {"3 Welch Place"}, new String[] {"22 Swallow Way"}, new String[] {"35 Jenifer Trail"}, new String[] {"1 Waubesa Terrace"}, new String[] {"71411 Dapin Pass"}, new String[] {"72 Rusk Junction"}, new String[] {"1 Grover Road"}, new String[] {"7 Farmco Junction"}, new String[] {"56520 Springview Alley"}, new String[] {"38 Pearson Center"}, new String[] {"7006 Drewry Court"}, new String[] {"4916 Warner Place"}, new String[] {"4713 Vera Trail"}, new String[] {"093 Eggendart Road"}, new String[] {"20144 Rutledge Crossing"}, new String[] {"41 Park Meadow Drive"}, new String[] {"529 6th Pass"}, new String[] {"42 American Place"}, new String[] {"4 Division Pass"}, new String[] {"8 Arrowood Park"}, new String[] {"9 Ridge Oak Pass"}, new String[] {"07 5th Alley"}, new String[] {"85877 Clyde Gallagher Street"}, new String[] {"68 Thompson Pass"}, new String[] {"83726 Mockingbird Center"}, new String[] {"06667 Lunder Trail"}, new String[] {"92 Merry Plaza"}, new String[] {"552 Katie Point"}, new String[] {"89179 Harper Drive"}, new String[] {"6489 Starling Junction"}, new String[] {"3 Cherokee Road"}, new String[] {"21 Mandrake Way"}, new String[] {"3 Stang Way"}, new String[] {"51534 Lakeland Place"}, new String[] {"30531 Stang Parkway"}, new String[] {"550 Stuart Terrace"}, new String[] {"1 Glendale Court"}, new String[] {"4697 East Avenue"}, new String[] {"12 Chinook Pass"}, new String[] {"51 Alpine Pass"}, new String[] {"75776 Stephen Court"}, new String[] {"54846 Green Pass"}, new String[] {"62148 Southridge Park"}, new String[] {"082 Hanson Park"}, new String[] {"5331 Victoria Drive"}, new String[] {"117 Carey Avenue"}, new String[] {"99 Truax Trail"}, new String[] {"84027 Eggendart Point"}, new String[] {"7003 Orin Road"}, new String[] {"5704 Kensington Hill"}, new String[] {"70271 Washington Plaza"}, new String[] {"535 Bayside Plaza"}, new String[] {"5279 Wayridge Place"}, new String[] {"671 Eastwood Point"}, new String[] {"33492 Debs Park"}, new String[] {"595 Lyons Place"}, new String[] {"153 Sunnyside Street"}, new String[] {"20 Hallows Place"}, new String[] {"9 Hovde Drive"}, new String[] {"08 Dixon Lane"}, new String[] {"6 Eagle Crest Junction"}, new String[] {"54434 Mccormick Point"}, new String[] {"0 Union Pass"}, new String[] {"22 Algoma Hill"}, new String[] {"0023 Merchant Avenue"}, new String[] {"18947 Monterey Drive"}, new String[] {"97 Elmside Way"}, new String[] {"5 Logan Crossing"}, new String[] {"8 Calypso Point"}, new String[] {"5 Stoughton Street"}, new String[] {"1 Bonner Place"}, new String[] {"7904 Elgar Trail"}, new String[] {"68292 Claremont Lane"}, new String[] {"89 Golf View Junction"}, new String[] {"03 Onsgard Plaza"}, new String[] {"04 Spaight Trail"}, new String[] {"3 Westerfield Way"}, new String[] {"70 Bay Alley"}, new String[] {"66898 Hudson Drive"}, new String[] {"440 Blackbird Court"}, new String[] {"218 Meadow Ridge Lane"}, new String[] {"7572 Briar Crest Plaza"}, new String[] {"10 Redwing Junction"}, new String[] {"778 Westerfield Alley"}, new String[] {"148 Elka Center"}, new String[] {"65529 Scofield Point"}, new String[] {"95562 Warner Avenue"}, new String[] {"418 Tennessee Terrace"}, new String[] {"1 Northfield Way"}, new String[] {"40950 Moose Center"}, new String[] {"986 Grasskamp Place"}, new String[] {"5 Mesta Junction"}, new String[] {"7 Mitchell Crossing"}, new String[] {"472 Hallows Parkway"}, new String[] {"93 Lindbergh Drive"}, new String[] {"8 Myrtle Trail"}, new String[] {"2703 Nobel Plaza"}, new String[] {"8 Harper Alley"}, new String[] {"1 Columbus Place"}, new String[] {"520 Westridge Parkway"}, new String[] {"8633 Hanover Drive"}, new String[] {"002 Bunker Hill Park"}, new String[] {"756 Mayfield Point"}, new String[] {"0 Prentice Center"}, new String[] {"9 Walton Point"}, new String[] {"95898 Parkside Point"}, new String[] {"425 Killdeer Lane"}, new String[] {"863 Oak Valley Place"}, new String[] {"6873 Glacier Hill Junction"}, new String[] {"5 Manley Center"}, new String[] {"753 Manley Crossing"}, new String[] {"52 Crest Line Park"}, new String[] {"7534 Forster Road"}, new String[] {"2077 Melrose Terrace"}, new String[] {"04674 Service Way"}, new String[] {"59 Dayton Pass"}, new String[] {"9 Mccormick Trail"}, new String[] {"0 Judy Drive"}, new String[] {"1068 Thompson Road"}, new String[] {"661 Jana Place"}, new String[] {"19007 Butterfield Road"}, new String[] {"82 Forest Trail"}, new String[] {"2480 Oak Valley Place"}, new String[] {"0 Melrose Drive"}, new String[] {"30604 Graedel Drive"}, new String[] {"0 Mccormick Hill"}, new String[] {"0438 Fisk Place"}, new String[] {"782 Cascade Court"}, new String[] {"76302 Sloan Court"}, new String[] {"02 Dwight Drive"}, new String[] {"194 Arrowood Park"}, new String[] {"7 Hovde Avenue"}, new String[] {"2 6th Trail"}, new String[] {"794 Harbort Alley"}, new String[] {"25 Red Cloud Way"}, new String[] {"16208 Declaration Street"}, new String[] {"98 Iowa Circle"}, new String[] {"26543 Atwood Terrace"}, new String[] {"004 Merry Pass"}, new String[] {"951 Dwight Court"}, new String[] {"65 Havey Crossing"}, new String[] {"35 Loeprich Court"}, new String[] {"051 Redwing Avenue"}, new String[] {"765 Coolidge Terrace"}, new String[] {"6866 Clarendon Drive"}, new String[] {"693 4th Way"}, new String[] {"035 Mccormick Point"}, new String[] {"36 Meadow Ridge Terrace"}, new String[] {"49406 Tennyson Terrace"}, new String[] {"039 Welch Terrace"}, new String[] {"23 New Castle Alley"}, new String[] {"7182 Rockefeller Place"}, new String[] {"47 Walton Junction"}, new String[] {"4950 Grasskamp Circle"}, new String[] {"2 David Place"}, new String[] {"4861 Derek Junction"}, new String[] {"54 Caliangt Lane"}, new String[] {"89589 Armistice Junction"}, new String[] {"00902 Shoshone Circle"}, new String[] {"3884 Randy Lane"}, new String[] {"30 Grasskamp Circle"}, new String[] {"37 Valley Edge Park"}, new String[] {"28620 4th Center"}, new String[] {"58695 Sutherland Crossing"}, new String[] {"006 Mallard Point"}, new String[] {"19 Oxford Court"}, new String[] {"04 Dapin Way"}, new String[] {"8704 Lakeland Way"}, new String[] {"680 Evergreen Trail"}, new String[] {"13 Cherokee Street"}, new String[] {"22621 Ridge Oak Plaza"}, new String[] {"4115 Grover Lane"}, new String[] {"16943 Sutherland Street"}, new String[] {"44127 Oak Lane"}, new String[] {"7 Northwestern Trail"}, new String[] {"837 Mandrake Parkway"}, new String[] {"3220 Del Mar Avenue"}, new String[] {"17 Fieldstone Alley"}, new String[] {"8098 Elgar Parkway"}, new String[] {"036 Packers Crossing"}, new String[] {"3702 Dayton Plaza"}, new String[] {"42 Killdeer Parkway"}, new String[] {"3 Cherokee Road"}, new String[] {"35 Hanover Terrace"}, new String[] {"4 Basil Avenue"}, new String[] {"9 Coolidge Terrace"}, new String[] {"771 Manitowish Center"}, new String[] {"681 Golf View Park"}, new String[] {"77395 Gale Crossing"}, new String[] {"2 Bultman Plaza"}, new String[] {"879 Duke Court"}, new String[] {"9 Petterle Road"}, new String[] {"8 Bluestem Court"}, new String[] {"794 Bultman Center"}, new String[] {"89 Fuller Terrace"}, new String[] {"48309 Corscot Park"}, new String[] {"075 2nd Road"}, new String[] {"61273 Springview Point"}, new String[] {"33 Ronald Regan Point"}, new String[] {"6 Cottonwood Trail"}, new String[] {"86 Sachtjen Pass"}, new String[] {"72238 Amoth Trail"}, new String[] {"1712 Fairview Avenue"}, new String[] {"7 Lillian Center"}, new String[] {"4 Dwight Road"}, new String[] {"13108 Stuart Plaza"}, new String[] {"1 Beilfuss Point"}, new String[] {"9 Fairview Parkway"}, new String[] {"88 6th Way"}, new String[] {"7 Killdeer Parkway"}, new String[] {"212 Graedel Plaza"}, new String[] {"3 Waubesa Hill"}, new String[] {"6 Autumn Leaf Pass"}, new String[] {"06159 Dryden Avenue"}, new String[] {"01660 Ruskin Street"}, new String[] {"55499 Merrick Circle"}, new String[] {"032 Superior Road"}, new String[] {"6 Homewood Plaza"}, new String[] {"2294 Barby Circle"}, new String[] {"782 Dahle Plaza"}, new String[] {"606 Springs Junction"}, new String[] {"6 Old Shore Hill"}, new String[] {"246 Kipling Point"}, new String[] {"0 Bonner Junction"}, new String[] {"22 Morningstar Point"}, new String[] {"94159 Esker Place"}, new String[] {"7 Calypso Alley"}, new String[] {"528 Fuller Hill"}, new String[] {"1678 Becker Trail"}, new String[] {"9 Badeau Parkway"}, new String[] {"0 Fremont Street"}, new String[] {"76654 High Crossing Avenue"}, new String[] {"8192 Manitowish Avenue"}, new String[] {"72 Sunnyside Drive"}, new String[] {"3 Dawn Road"}, new String[] {"057 Forest Dale Crossing"}, new String[] {"91 Shoshone Terrace"}, new String[] {"55 Westridge Crossing"}, new String[] {"35592 Westport Street"}, new String[] {"6 Kropf Crossing"}, new String[] {"10323 Dottie Terrace"}, new String[] {"3 Mariners Cove Terrace"}, new String[] {"13631 Union Pass"}, new String[] {"15 Division Street"}, new String[] {"4146 Thackeray Junction"}, new String[] {"07762 Huxley Lane"}, new String[] {"7 Burning Wood Pass"}, new String[] {"8 Manufacturers Center"}, new String[] {"0231 Prairie Rose Pass"}, new String[] {"98969 Everett Park"}, new String[] {"6559 Dawn Point"}, new String[] {"14 Kingsford Road"}, new String[] {"02 Luster Way"}, new String[] {"13 Sachtjen Hill"}, new String[] {"659 Shasta Crossing"}, new String[] {"288 Raven Way"}, new String[] {"6111 Dovetail Parkway"}, new String[] {"4584 Forest Run Hill"} };

AnalyzerContext context = new AnalyzerContext(null, DateTimeParser.DateResolutionMode.Auto, "ftatest", fieldnames);
TextAnalyzer textAnalyzer = new TextAnalyzer(context);
textAnalyzer.setLocale(Locale.getDefault());
RecordAnalyzer analyzer = new RecordAnalyzer(textAnalyzer);

System.out.println("RowCount=" + values.length);
for (String[] value : values) {
  analyzer.train(value);
}

for (TextAnalysisResult result : analyzer.getResult().getStreamResults()) {
  String json = result.asJSON(true, 0);
  System.out.println(json);
}
tsegall commented 1 year ago

Resolved in 12.8.3.

tsegall commented 1 year ago

Please note that there were a number of other mismatched counts that were flushed out by running a count check across the entire semantic-type reference set (see https://github.com/tsegall/semantic-types) which is approximately 120,000 fields. So you may want to bump to the latest version at your convenience. As usual see the ChangeLog for details.