ACM Silicon Valley Data Mining Camp

Starts: Sunday November 01, 2009 at 12:00pm
Ends: Sunday November 01, 2009 at 7:30pm
Event Type: Conference
Region: San Francisco Bay Area
Location: Hackers Dojo
S Whisman Rd
Mountain View, CA 94041 US
Price:
Website: http://www.sfbayacm.org/?p=894
Industry: computer software
Keywords: Acm, Data Mining, Cloud Computing, Machine Learning
Intended For: C**, VP's, consultants, software developers, product managers, those who want their business dollars to go farther with analytics
Organization: San Francisco Bay Area Association of Computing Machinery

See the ACM site for full details, and to share topics you are interested in for the un-conference. Optional $20 to join the SF Bay ACM for the year.

POSSIBLE TOPICS that may be proposed: • Introduction to data mining, how to get started, FAQ • Challenges in vertical market X (internet advertising, medical, green tech, finance, marketing, retail, …) • Discuss algorithm X (Support Vector Machines, TreeNet, NaïveBayes, Clustering, outlier detection, text mining) • Bring your challenges to brainstorm on current projects with experts • Netflix $1,000,000 data mining competition, presentation of collaborative filtering papers by Yehuda Koren from Greg Makowski • New developments in R and the Windows IDE – presented by David of REvolution Computing • Problem identification - best customers/prod

GOLD SPONSOR: REvolution Computing - Open source products and servinces for high performance analytics. KXEN - Knowledge Extraction Engines - The leading provider of automated data mining software and customer analytic solutions. LinkedIn - Over 50 million professionals use LinkedIn to exchange information, ideas and opportunities

APPROXIMATE SCHEDULE 12:00 Arrive, name tags, network, brainstorm discussion topics with others, eat 12:45 Main session starts, overview of the day 1:00 Panel of experts answering questions from the audience 1:30 Gold Sponsor & Dojo presentations 1:50 Audience members line up to suggest discussion topics to the room. If a minimum threshold of people are interested in the topic, then it gets a discussion slot We can have 6+ concurrent discussion slots per time slot Recommend for each discussion a primary facilitator and a note taker to report at the end 2:30 Time Slot 1 (many concurrent sessions) 3:30 Time Slot 2 ( “ “ “ ) 4:30 Time Slot 3 5:30 Time Slot 4 6:30 Report summary of sessions over food & drinks in the main area, networking 7:30 Camp organizers invite any help in picking up after the free unconference

Comments (16)

  • Mining healthcare data

    When
    Posted about 1 month ago
    Author
    Junling Hu, at Robert Bosch Research and Technology Center
    Quotes
  • I am proposing a health care topic too: Biomedical Data Mining: Dimensionality, Noise, Applications

    When
    Posted about 1 month ago
    Author
    Irene Gabashvili, Entrepreneur, Innovator, and Educator
    044de13
  • I am using the LinkedIn site for the RSVP details, and the ACM site for the topic discussions. I would invite you to post your healthcare topics to http://www.sfbayacm.org/?p=894. Please share some details and any links to related reading people might feel interested in. THANKS FOR YOUR INTEREST!!

    When
    Posted about 1 month ago
    Author
    Greg Makowski, Principal Consultant at Golden Data Mining
    29a2035
  • There are many areas in healthcare where data mining is critical. At present time, clinical data mining for decision making system, and machine learning on consumer health information are probably interesting to some people here. Healthcare search engine is also another exciting area.

    When
    Posted about 1 month ago
    Author
    AJ Chen, Advocate of semantic search engine, web intelligence, and digital health.
    2f4c832
  • I propose discussing algorithms X (Support Vector Machines, TreeNet, NaïveBayes, Clustering, outlier detection, text mining)

    When
    Posted about 1 month ago
    Author
    Pavani Vantimitta, Software Engineer at Clearwell Systems Inc.
    Quotes
  • FYI. I am going to present a paper "A Brief Guide to Legal Issues for Data-miners" to the Bay Area SAS User group. You can find out more details in basas.com. Date: Thursday, October 29th, 2009 Time: 1:30pm-4:30pm Registration starts at 1:00pm Locations: Northern California Kaiser Permanente Division of Research 2000 Broadway Oakland, CA 94612

    When
    Posted 29 days ago
    Author
    Aaron Lai, CFA, VP and Senior Quantitative Research Associate at Bank of America
    2b43e99
  • I'd like to hear discussions on dealing with highly imbalanced data (e.g., 0.1% density of successes in dataset. Also would be useful to discuss best practices and suitable models for datasets where most features are categorical rather than numerical.

    When
    Posted 27 days ago
    Author
    Scott Nicholson, Business Optimization and User Behavior Modeling
    2b8753e
  • I'm interesting in OSS tools that support data mining in production environments (live or near-real time). Otherwise I'm very interested in general overview and survey of the state-of-the-art.

    When
    Posted 27 days ago
    Author
    Alan Hawrylyshen, Director, Strategic Technology Applications at Ditech Networks
    Quotes
  • I would like to discuss how to data mine without disturbing the existing systems. Too often the set of data applications is such a fine set of spaghetti that you do not want to disturb.

    When
    Posted 26 days ago
    Author
    Hans van Rietschote, CEO and founder at Mercury Swan Consulting
    02fee45
  • Hello, I would encourage all suggestions to get posted on http://www.sfbayacm.org/?p=894

    When
    Posted 25 days ago
    Author
    Greg Makowski, Principal Consultant at Golden Data Mining
    29a2035
  • Hi Aaron, I would be very interested in learning more about your researching pertaining to the legal issues with Data-mining. Where can I find the paper to read more? Please let me know, sukantag@gmail.com SG

    When
    Posted 22 days ago
    Author
    Sukanta Ganguly, Experienced Entrepreneur
    2085b6f
  • AJ, I agree with you whole heartedly on the emphasis of data mining for health information. My emphasis is not in the clinical decision space but am aggressive about data mining for health care and health information for consumers directed towards Pervasive Health Care. MedgoLine is working on this and we do our best to stay stealth mode but nonetheless big from research point of view SG

    When
    Posted 22 days ago
    Author
    Sukanta Ganguly, Experienced Entrepreneur
    2085b6f
  • Dr. Irene, Very interested in this area too. Unfortunately will miss the session today but am hoping the presentation and discussions can come online. SG

    When
    Posted 22 days ago
    Author
    Sukanta Ganguly, Experienced Entrepreneur
    2085b6f
  • doesn't anyone have the link to the scribe notes (on etherpad) of the sparse-data talk? Thanks!

    When
    Posted 21 days ago
    Author
    Srivatsan Ramanujam, Software Engineer in Analytics, Salesforce.com
    3e7170b
  • does*

    When
    Posted 21 days ago
    Author
    Srivatsan Ramanujam, Software Engineer in Analytics, Salesforce.com
    3e7170b
  • It was an amazing event! There were more than 225 people in attendance. Thank you to everyone who attended! There is a blogs about the camp: General impression: http://www.zemanta.com/fruitblog/acm-data-mining-camp-silicon-valley-report/ Focus on R: http://bit.ly/3sM5kQ BioMedical: http://aurametrix.blogspot.com/2009/11/biomedical-data-mining-dimensionality.html There were many "tweets" about the camp People even put up a twitpics http://twitpic.com/nx57f http://tweetphoto.com/xbs1abje http://img682.yfrog.com/i/b07.jpg/ Here are some slides about the event! http://www.slideshare.net/clibou/datacamp

    When
    Posted 20 days ago
    Author
    Patricia Hoffman, PhD, at Aha Solutions!
    32fb162