Basketball, P. (2000). For the P. Ball, H. F. Spirer, & L. Spirer (Eds.), Putting some Circumstances: Investigating Major Human Rights Violations Having fun with Advice Expertise and you can Studies Analysis. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A technique to have calibrating incorrect-meets prices into the record linkage. Log of one’s American Analytical Association, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Transformative Duplicate Recognition Having fun with Learnable Sequence Resemblance Steps. Within the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Number Linkage Having fun with Seeded Nearby Neighbor and you can Support Vector Machine Category. In the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey from indexing methods for scalable list linkage and you can deduplication. IEEE Transactions on Studies and you will Investigation Engineering, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison away from string metrics to possess matching labels and you may details. Into the KDD working area into the research tidy up and you can target integration (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Analytical models getting complimentary desktop records. Log of your Regal Statistical Community, Collection A beneficial, 153(3), 287–320.
Dai, An effective. Yards., & Storkey, Good. J. (2011). The latest categorized blogger-topic design for unsupervised entity resolution. Inside Fake sensory networks and servers studying–icann 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B., Nuccitelli, A., & Scanu, Meters. (2001). Towards Bayesian Checklist Linkage. Research during the Formal Analytics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, A beneficial. (2013). A good bayesian means of file connecting to research avoid- of-lifetime medical costs. Record of your Western Analytical Organization, 108(501), 34–47.
Hsu, W., Lee, Meters. L., Liu, B., & Ling, T. W. (2000). Exploration Exploration into the Diabetics Database: Findings and Conclusions. During the KDD ’00 (pp. 430–436). ACM.
A torn-mix Markov strings Monte Carlo process of brand new Dirichlet process combination design
Jewell, N. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you will Casualty Matters: Presumptions, Translation, and you can Pressures. Into the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civilian Casualties: An overview of Recording and you will Quoting Nonmilitary Deaths in conflict. Oxford, UK: Oxford University Drive.
Larsen, Yards. D. (2002)ments on the Hierarchical Bayesian Number Linkage. In Proceedings of the mutual statistical meetings, area on the survey research tips (pp. 1995–2000). The newest American Analytical Association.
Steorts, R
Larsen, Yards. D. (2005). Enhances in Record Linkage Principle: Hierarchical Bayesian Listing Linkage Principle. Into the Process of combined statistical conferences, section for the survey research methods (pp. 3277–3284). The fresh Western Statistical Association.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automated record linkage having fun with blend models. Record of one’s Western Mathematical Relationship, 96(453), 32–41 kissbrides.com veza.
Lum, K., Rate, Meters. Elizabeth., & Banks, D. (2013). Applications regarding Multiple Expertise Quote when you look at the Human Legal rights Browse. The latest American Statistician, 67(4), 191–200.
Marchant, Letter. G., C., Kaplan, A great., Rubinstein, B. We. P., & Elazar, D. N. (2019). D-blink: Distributed avoid-to-end bayesian organization quality.
McCallum, A great., & Wellner, B. (2004). Conditional Types of Title Suspicion which have Application so you can Noun Coreference. During the Enhances from inside the sensory recommendations running assistance (nips ’04) (pp. 905–912). MIT Push.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A site-Specific Product on the Deduplication out-of Inoculation Records Information in the Teens Immunization Registriesputers and you can Biomedical Browse, 33(2), 126–143.
Murphy, J., Brackbill, Roentgen. Yards., Thalji, L., Dolan, M., Pulliam, P., & Walker, D. J. (2007). Measuring and you will Improving Coverage around the globe Trade Cardio Wellness Registry. Statistics during the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic list linkage and you can deduplication immediately following indexing, blocking, and you can filtering. Diary regarding Confidentiality and you will Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. M., Axford, S. J., & James, A beneficial. P. (1959). Automatic linkage away from public information hosts can be used to extract” follow-up” analytics regarding group from data of techniques details. Science, 130(3381), 954–959.
Sadinle, Yards. (2014). Finding Duplicates inside a murder Registry Having fun with a beneficial Bayesian Partitioning Means. Annals from Applied Statistics, 8(4), 2404–2434.
Sariyar, Yards., Borg, A., & Pommerening, K. (2012). Effective Learning Tricks for brand new Deduplication regarding Digital Patient Analysis Using Group Woods. Record out of Biomedical Informatics, 45(5), 893–900.
C., Hall, R., & Fienberg, S. E. (2016). Good Bayesian Method of Graphical Record Linkage and you can Deduplication. Journal of your Western Statistical Connection, 111(516), 1660–1672.
Tancredi, A., & Liseo, B. (2011). An excellent hierarchical Bayesian way of checklist linkage and society proportions dilemmas. Annals out-of Used Statistics, 5(2B), 1553–1585.