Brendan Herger | Machine Learning Techniques for Class Imbalances & Adversaries
PyData DC 2016 There are many areas of applied Machine Learning which require models optimized for rare occurrences (i.e. class imbalance), as well as users actively attempting to subvert the system (i.e. adversaries). This talk will guide the audience through multiple published techniques which specifically attempt to address these issues. The Data Innovation Lab at Capital One has explored more advanced modeling techniques for class imbalance & adversarial actors. Our use case has allowed us to survey the many related fields which deal with these issues, and attempt many of the suggested modeling techniques. Additionally, we have introduce a few novel variations of our own. This talk will provide an introduction to the problem space, a brief overview of the modeling frameworks we've chosen to work with, a brief overview of our approaches, a discussion of lessons learned, and our proposed future work. The approaches discussed will include ensemble models, deep learning, genetic algorithms, outlier detection via dimensionally reduction (PCA and neural network auto-encoders), time-decay weighting, and Synthetic Minority Over-sampling Technique (SMOTE sampling).
MIA: David Tse, Case studies in DNA and RNA assembly
Models, Inference and Algorithms Broad Institute of MIT and Harvard April 27, 2016 MIA Meeting: https://www.youtube.com/watch?v=XfJQ_XzsBJ8&index=70&t=93s&list=PLlMMtlgw6qNjROoMNTBQjAcdx53kV50cS David Tse Stanford University and U.C. Berkeley The Science of Information: Case Studies in DNA and RNA Assembly In this talk given as part of the 2016 Models, Inference & Algorithms series, David Tse from Stanford University and the University of California, Berkeley, describes and benchmarks Shannon, a new program for assembling RNA sequences from shotgun sequencing data designed using the principles of information theory laid down nearly 70 years ago by mathematician and cryptographer Claude Shannon. For more information regarding the Broad Institute and Models, inference and algorithms visit: http://www.broadinstitute.org/mia Copyright Broad Institute, 2016. All rights reserved.
Sparse grid
Sparse grids are numerical techniques to represent, integrate or interpolate high dimensional functions.They were originally developed by the Russian mathematician Sergey A.
Bio-Data, Bioinformatics and the Gears that Make Biological Machines Tick
The intersection of biology and computer science is showing tremendous potential. Literally thousands of data sources great and small are publicly available now. Come and enjoy a low key introduction to all the 'omes' - genomes, transcriptomes, proteomes, methylomes, metabolomes, etc. We'll play with some of the more useful data sources, tools and biological pathways. No biology training is necessary - the talk will be accessible to anyone. Programming skills are useful, but not necessary. If you have a laptop, please bring it. We will try to look at data in the class and preview a couple of tools. Cygwin on windows machines, OsX or linux are useful here, but we can go over these at the time if needed. We will be designing a project for anyone who wants to go deeper and get more experience with Bioinformatics. Ron Shigeta PhD has been doing bioinformatics for 12 years in biotech in the east bay, before that he made biological information as a protein crystallographer.
Lec-33 Dimensionality reduction Using PCA
Lecture Series on Neural Networks and Applications by Prof.S. Sengupta, Department of Electronics and Electrical Communication Engineering, IIT Kharagpur. For more details on NPTEL visit http://nptel.iitm.ac.in
DEC 94: Telecollaborative Data Exploration
DEC-CRL 94: Tools to register and segment soft tissue from medical images and visualize it three-dimensionally. Numerous techniques have been developed for biomedical image segmentation, but most methods do not perform well on a broad variety of data. We present an interactive approach to the segmentation of medical images. This approach is based on interactive deformable contours and has broad applicability to soft tissue segmentation. We describe a system, implemented in AVS, that we have applied to the segmentation of nerve cells and a human embryo heart from serial tissue sections, and the segmentation of lymph nodes from CT data. This work was conducted in the Visualization Group of the Cambridge Research Lab of Digital Equipment Corporation (Cambridge, MA). In collaboration with Dr. Ingrid Carlbom, William Hsu and Dr. Demetri Terzopoulos. Thyroid data courtesy of Dr. Michael Doyle from UCSF. Neuron data courtesy of Dr. Kristen Harris from Children's Hospital, Boston. Special thanks to Maria Loughlin and Dr. Keith Waters for video production assistance. See also: http://campar.in.tum.de/Chair/KlinkerCRL http://ar.in.tum.de/Chair/PublicationDetail?pub=klinker1994TR94-6 http://ar.in.tum.de/Chair/PublicationDetail?pub=klinker1995cvrmed
2011 Frontiers of Engineering: Additive Manufacturing is Changing Surgery
National Academy of Engineering 2011 U.S. Frontiers of Engineering Symposium September 19-21, 2011 Google, Inc. Mountain View, California Additive Manufacturing is Changing Surgery September 19, 2011 Presented by Andrew Christensen. ABSTRACT: Google hosted 100 attendees of the 2011 Nat'l Academy of Engineering's U.S. Frontiers of Engineering symposium (FOE) at our Mountain View office and Dinah's Garden Hotel in Palo Alto. The symposium is an annual three-day meeting that brings together 100 of the nation's outstanding young engineers (ages 30-45) from industry, academia, and government to discuss pioneering technical and leading-edge research in various engineering fields and industry sectors. About the speaker: Andrew Christensen is the President of Medical Modeling Inc. in Colorado.
A Comprehensive Review of Stability Analysis of Continuous Time Recurrent Neural Networks
Toward data driven ontologies for mental function
Event Date: August 20, 2018 Presenter: Russell A. Poldrack, Ph.D. The National Institutes of Health (NIH) Office of Behavioral and Social Science Research (OBSSR) hosts the 2017-2018 OBSSR Director’s Webinar Series. Abstract Psychological science has long been focused on the discovery of novel behavioral phenomena and the mechanistic explanation of those phenomena, which has led to a lack of cumulative conceptual progress. Dr. Poldrack will argue that the development of ontologies is essential for progress, but that these need to be tied directly to empirical data. He will provide an example from the domain of self-regulation, where we have used data-driven ontology development to describe the psychological structure of this domain and characterize its predictive validity with respect to real-world outcomes. Biography Russell A. Poldrack, Ph.D. Albert Ray Lang Professor of Psychology Professor (by courtesy) of Computer Science Stanford University Russell A. Poldrack is the Albert Ray Lang Professor in the Department of Psychology and Professor (by courtesy) of Computer Science at Stanford University, and Director of the Stanford Center for Reproducible Neuroscience. His research uses neuroimaging to understand the brain systems underlying decision making and executive function. His lab is also engaged in the development of neuroinformatics tools to help improve the reproducibility and transparency of neuroscience, including the Openneuro.org and Neurovault.org data sharing projects and the Cognitive Atlas ontology.
Kara Miller, Nigel Jacob, Kent Larson: "Innovative Cities" | Talks at Google
In the third Thought Leaders at Google in Cambridge program, Kara Miller, host and executive editor of WGBH's Innovation Hub radio show discusses innovative cities with Nigel Jacob (Boston Mayor's Office for New Urban Mechanics) and Kent Larson (director of the MIT Media Lab’s Changing Places group).
