{"id":52,"date":"2015-05-01T22:50:03","date_gmt":"2015-05-01T22:50:03","guid":{"rendered":"http:\/\/giwebb.com\/index.php\/2015\/05\/01\/sdm15-paper-award\/"},"modified":"2015-05-01T22:50:03","modified_gmt":"2015-05-01T22:50:03","slug":"sdm15-paper-award","status":"publish","type":"post","link":"http:\/\/giwebb.com\/index.php\/2015\/05\/01\/sdm15-paper-award\/","title":{"rendered":"SDM15 paper award"},"content":{"rendered":"<p><a href=\"http:\/\/i0.wp.com\/i.giwebb.com\/wp-content\/uploads\/2016\/06\/SDM-award.jpg\"><img decoding=\"async\" class=\"size-medium wp-image-270 alignright\" src=\"http:\/\/i0.wp.com\/i.giwebb.com\/wp-content\/uploads\/2016\/06\/SDM-award.jpg?resize=300%2C226\" alt=\"SDM-award\" \/><\/a>We are delighted to receive the <strong>SDM15 Best Research Paper Honorable Mention <\/strong>award.<\/p>\n<p>The Society for Industrial and Applied Math (SIAM) International Conference on Data Mining (SDM15) Awards Committee selected 4 papers for awards from nearly 400 submissions.<\/p>\n<p><a href=\"https:\/\/www.pathlms.com\/siam\/courses\/1249\/sections\/1329\/video_presentations\/11413\" target=\"_blank\">View the presentation here<\/a>.<\/p>\n<p>And here is a link to the paper and its bibliographic details:<\/p>\n<ul class=\"papercite_bibliography\">\n<p>                  <a href='http:\/\/epubs.siam.org\/doi\/pdf\/10.1137\/1.9781611974010.53' target=\"_blank\" class=\"no-link-icon\" title='View document on publisher site'><img src='http:\/\/i1.wp.com\/i.giwebb.com\/wp-content\/plugins\/papercite\/img\/external.png?resize=10%2C10' alt='[URL]' \/><\/a>    Petitjean, F., &amp; Webb, G. I. (2015). Scaling log-linear analysis to datasets with thousands of variables. <span style=\"font-style: italic\">Proceedings of the 2015 SIAM International Conference on Data Mining<\/span>, pp.  469-477. <br \/>   <a href=\"void(0)\" id=\"papercite_29\" class=\"papercite_toggle\">[Bibtex]<\/a>       <a href=\"void(0)\" id=\"papercite_abstract_29\" class=\"papercite_toggle\">[Abstract]<\/a>          <font color=\"grey\">&nbsp;&rarr; <a href=\"http:\/\/i.giwebb.com\/index.php\/research-programs\/scalable-graphical-modeling\/\" style=\"color: grey\">Related papers and software<\/a><\/font><\/p>\n<div class=\"papercite_bibtex\" id=\"papercite_29_block\">\n<pre><code class=\"tex bibtex\">@InProceedings{PetitjeanWebb15,\nTitle = {Scaling log-linear analysis to datasets with thousands of variables},\nAuthor = {F. Petitjean and G.I. Webb},\nBooktitle = {Proceedings of the 2015 {SIAM} International Conference on Data Mining},\nYear = {2015},\nPages = {469-477},\nAbstract = {Association discovery is a fundamental data mining task. The primary statistical approach to association discovery between variables is log-linear analysis. Classical approaches to log-linear analysis do not scale beyond about ten variables. We have recently shown that, if we ensure that the graph supporting the log-linear model is chordal, log-linear analysis can be applied to datasets with hundreds of variables without sacrificing the statistical soundness [21]. However, further scalability remained limited, because state-of-the-art techniques have to examine every edge at every step of the search. This paper makes the following contributions: 1) we prove that only a very small subset of edges has to be considered at each step of the search; 2) we demonstrate how to efficiently find this subset of edges and 3) we show how to efficiently keep track of the best edges to be subsequently added to the initial model. Our experiments, carried out on real datasets with up to 2000 variables, show that our contributions make it possible to gain about 4 orders of magnitude, making log-linear analysis of datasets with thousands of variables possible in seconds instead of days.},\nComment = {Best Research Paper Honorable Mention Award},\nKeywords = {Association Rule Discovery and statistically sound discovery and scalable graphical models and Learning from large datasets},\nRelated = {scalable-graphical-modeling},\nUrl = {http:\/\/epubs.siam.org\/doi\/pdf\/10.1137\/1.9781611974010.53}\n}<\/code><\/pre>\n<\/div>\n<div class=\"papercite_bibtex\" id=\"papercite_abstract_29_block\">\n<pre><code><b>ABSTRACT<\/b> Association discovery is a fundamental data mining task. The primary statistical approach to association discovery between variables is log-linear analysis. Classical approaches to log-linear analysis do not scale beyond about ten variables. We have recently shown that, if we ensure that the graph supporting the log-linear model is chordal, log-linear analysis can be applied to datasets with hundreds of variables without sacrificing the statistical soundness [21]. However, further scalability remained limited, because state-of-the-art techniques have to examine every edge at every step of the search. This paper makes the following contributions: 1) we prove that only a very small subset of edges has to be considered at each step of the search; 2) we demonstrate how to efficiently find this subset of edges and 3) we show how to efficiently keep track of the best edges to be subsequently added to the initial model. Our experiments, carried out on real datasets with up to 2000 variables, show that our contributions make it possible to gain about 4 orders of magnitude, making log-linear analysis of datasets with thousands of variables possible in seconds instead of days.<\/code><\/pre>\n<\/div>\n<\/ul>\n<p>The post <a rel=\"nofollow\" href=\"http:\/\/i.giwebb.com\/index.php\/2015\/05\/01\/sdm15-paper-award\/\">SDM15 paper award<\/a> appeared first on <a rel=\"nofollow\" href=\"http:\/\/i.giwebb.com\">Geoff Webb<\/a>.<\/p>\n<p>Source: i.giwebb.com<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We are delighted to receive the SDM15 Best Research Paper Honorable Mention award. The Society for Industrial and Applied Math (SIAM) International Conference on Data Mining (SDM15) Awards Committee selected 4 papers for awards from nearly 400 submissions. View the presentation here. And here is a link to the paper and its bibliographic details: Petitjean, <a href=\"http:\/\/giwebb.com\/index.php\/2015\/05\/01\/sdm15-paper-award\/\" class=\"more-link\">...continue reading<span class=\"screen-reader-text\"> \"SDM15 paper award\"<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"false","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"enabled":false},"version":2}},"categories":[1],"tags":[],"class_list":{"0":"post-52","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-uncategorized","7":"h-entry","8":"hentry","9":"h-as-article"},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p6IJkk-Q","_links":{"self":[{"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/posts\/52","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/comments?post=52"}],"version-history":[{"count":0,"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/posts\/52\/revisions"}],"wp:attachment":[{"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/media?parent=52"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/categories?post=52"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/giwebb.com\/index.php\/wp-json\/wp\/v2\/tags?post=52"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}