Mark Derthick
Carnegie
Mellon University |
5819
Kentucky Avenue |
Human
Computer Interaction Institute |
Pittsburgh,
PA 15232 |
Pittsburgh,
PA 15213-3891 |
(412)
363-9859 |
(412)
268-8812 |
|
|
|
Research
Interests |
· Human-Information Interaction · Interactive Information Visualization · Knowledge Representation · Exploratory Data Analysis ·
Collection, Organization, and Summarization of
Multi-Media Data Streams |
||
|
|
|
||
Experience
|
|
|||
1995
– Present |
|
|||
|
Research on interactive visual environments for
exploratory data analysis including: · visual query languages · integrated interface environments for exploration of multi-media data, including relational databases, free text, and images · handling large data sets · visualization of the history of users’ interactions with these systems |
|||
|
|
|||
1988
– 1994 |
|
|||
|
Helped build the Cyc large-scale common-sense
knowledge base by: · implementing a smart spreadsheet application for intelligence data, featuring database integration and explanations generated by general-purpose translator from formal Cyc language to English. The spreadsheets performed equality reasoning, semantic contradiction detection, and automatic schema recognition. · writing many smart interfaces and tools to support Cyc theory formulation, retrieval, browsing, testing, debugging, and statistical learning. · formalizing the domains of commercial organizations, documents, and many others. |
|||
|
|
|||
Summer
1984 |
|
|||
|
Developed a knowledge representation system for
common sense reasoning, and utilized for mechanical design by prototype
modification. |
|||
|
|
|||
Summer
1982 |
|
|||
|
Developed a test system for microwave oscillators. |
|||
|
|
|||
Education
|
|
|||
1983
– 1988 |
|
|||
|
Thesis research, directed by Geoff Hinton and
David Touretzky, developed a connectionist computational model similar to the
Boltzmann Machine. Connectionist
models use a densely connected network of very simple processors, each
storing only one scalar value.
Connectionist models are sometimes likened to the brain’s network of
neurons. Networks were built by
compiling a high-level theory of a problem domain expressed in a formal
language using two-valued logic. We
used the language of KLONE, a pioneering knowledge representation and
reasoning system. KLONE reasons
sequentially using rules of deduction.
When given an incomplete theory it returns “unknown” to some
questions. Given an inconsistent
theory, it may refuse to answer any questions at all. In contrast, the connectionist network
degraded gracefully, returning a reasonable answer that minimized conflict. Minor in Montague Semantics supervised by Dana
Scott and Richmond Thomason. |
|||
|
|
|||
1979
– 1983 |
|
|||
|
Areas of concentration included communication
theory and digital system design.
Photo editor of the Washington Times and an officer in the
social fraternity Theta Xi. |
|||
|
Research
Statement |
The Research Statement has three parts: · A description of analog models and why they’re interesting · A chronological survey of my past and present research, and how it all relates to analog models · Strategic plan for expanding the scope and influence of my research |
||
|
Driving Research Questions
|
Analog models fascinate me. An analog model is much like a piece of the real world. In the world, the laws of everyday physics ensure that no two objects occupy the same space, and it is never ambiguous whether an object is in New York or California. Analog computer models also represent complete and consistent information that is constantly updated as the model responds to external influences. At any point the answer to any question about what is true in the model can simply be read off. Spreadsheets are analog models, as long as the inputs are well behaved so that all formulas can be evaluated. Images are generally complete and consistent, as well. A picture of a person generally is usually unambiguous with respect to gender and hair color, for instance. By contrast, AI knowledge representation systems are language-based and can represent the abstraction person or disjunctions like the New York/California case. As an example, an analog model can find the quickest rail route between two cities. The model is a network of beads for cities connected by strings whose length is proportional to the travel time. The fastest route can be found by grabbing the beads representing each city and pulling. Imagine hanging the network on a wall map so that each bead is on top of its city. Impediments like the the Rocky Mountains show up as regions of droopy string. Higher-level patterns like this are often called emergent representations. Humans, automated learning algorithms, or a combination can look for and characterize these representations. Analog models are the most effective starting point for discovering emergent patterns by humans, because their completeness allows them to be visualized as an image and people excel at image perception and interpretation. Knowledge representation languages are used to express theories formally. Usually the language is designed so that computers can reason about the theories efficiently. However, I think of information visualization as knowledge representation for humans. In both cases, finding the right representation is key. Thus knowledge, reasoning about knowledge, and presentation of knowledge all fit within the same framework. This provides a natural infrastructure for the nascent field of Human-Information Interaction, whose goal is to improve the way users find, interact with, and understand information in Internet and other information resources. Most of my research relates to
understanding the characteristics of analog models, their relationship to
more expressive theories, and the applications for which they are most
appropriate. In my current work, simple models are embedded in interfaces
with a Model View Controller architecture.
Further, in accordance with the Direct Manipulation principle that
input=output, the view is the controller as well as the visualization. I draw on my training as a Computer
Scientist to develop efficient algorithms so that the feedback appears
continuous. I draw on my experience
at MCC in knowledge representation to ensure that the computer's model, and
the view of it, is appropriate for the domain and task. These ideas almost constitute a formula for qualitatively changing the experience of using many current software applications: build an interface in Visage, and make it run fast even if you have to give up some functionality. Visage, developed in collaboration with the Maya Group and other members of the Sage group, embodies the MV=C architecture. It is a flexible data visualization and exploration environment. For instance, AI scheduling systems currently run for hours. With Steve Smith, I developed an interface with 100ms feedback for looking at schedule tradeoffs, which are actually more useful than a detailed schedule, at least for the initial stages of planning. With Phil Gibbons and Andrew Moore I have developed algorithms to filter data with a million records with 100ms feedback, by restricting the range of visualizations. For the Informedia project, I developed an interactive image retrieval algorithm that can search 15,000 images in 500ms, while their previous algorithm required 30 secondsAgain, the interface design led to changes in the search algorithm. These interfaces are among those pictured below. |
||
|
Research Chronology
|
|
||
CMU
|
My thesis work compiled a high-level symbolic domain
theory into a sub-symbolic analog model.
The questions it could answer were of the form “if these given
conditions hold, what else will follow?”
The conditions were expressed by forcing parts of the model into
particular states. After some time
for reaching equilibrium, the conditions that follow could be read off other
parts of the model. The model is
always in some state, so it will always give an unambiguous answer to any
question. If the high-level theory is
incomplete or inconsistent, the most likely state is one that minimizes
conflicts and makes free choices that maximize the robustness of the solution
to small changes in the theory. |
|||
MCC |
I built a system that used the Minimum Description
Length (MDL) principle to 1) automatically discover emergent representations
that are intelligible to people and 2) to
find analogical mappings between domains.
Knowledge of analogical mappings can help problem solving in
unfamiliar domains. For instance if you already understand water flow,
someone can explain heat flow to you by analogy. The analogy is helpful to
the degree that the explanation is shorter than an explanation from first
principles. This efficiency gain can be measured in bits. This turns it into an optimization
problem, in which all contributing factors are weighed at once, as with any
analog model. |
|||
CMU |
Currently I build interfaces based on visualizations that embody analog models, with a primary goal of supporting Exploratory Data Analysis. Models that capture the important aspects of the domain must be developed in tandem with efficient algorithms, so that the experience of interaction is direct. |
|||
Scheduling Visualization |
AI scheduling systems currently run for hours. With Steve Smith, I developed an interface with 100ms feedback for looking at high level schedule tradeoffs, which are actually more useful for understanding the constraints than a detailed schedule. The key is to show all possible sub-intervals of the schedule at once. Later intervals are farther to the right, and longer intervals are farther back. Then the set of requirements becomes a landscape of mountain ranges, where elevation shows the magnitude of resources required. The available resource capacity appears as a partially transparent inclined plane. Mountain peaks jutting above the capacity plane represent shortfalls, where requirements exceed capacity. The effect is similar to flying above a cloud layer, where it is very easy to identify mountain peaks that extend above the clouds. |
|||
TimeTravel Interface |
Exploratory Data
Analysis is an opportunistic process, where users try multiple avenues in
search of understanding. Visage
explicitly models these alternative pathways, and visualizes them as a
tree. This makes it easy to switch
back and forth, iteratively extending the most promising. New branches are created automatically
when operations are performed in a past situation. In addition, visualization of the exploration path serves as
documentation for how results were obtained.
Navigation in the tree is done by dragging a cursor. This is more natural than current
interfaces that use discrete undo/redo operations to restore previous
states. User operations are shown in
the tree as events. Selective undo
and redo are accomplished by selecting subsets of events and dragging them
out of the Time Travel interface (selective undo), or by dropping events on
any point in the tree (selective redo).
|
Visual Query Environment |
The Visual Query Environment (VQE) uses a navigation metaphor to capture the incremental process of Exploratory Data Analysis and to situate the analyst in the world of the data. It uses the concept of “aggregate” to bridge the gap between the expressive power of database query languages and the concreteness of data. The navigation path among aggregates resembles a graphical query language, but no “execute” step is needed to convert from specification to realization. Using efficient indexing, aggregate visualizations are updated continuously to maintain a sense of engagement with an analog model. Navigation structures can be created in VQE, or a subset of a user’s normal Visage
operations can be selected from the TimeTravel visualization and compiled
into a visual query. Any
visualizations that were involved in the user operations are copied into the
VQE window as well. The navigation
structure can be graphically edited, and data shown in the visualizations can
be added or removed by drag-and-drop.
Sometimes
abstract queries are desirable, because they can be reused on different
data. Dragging all data out of VQE
leaves a template for reproducing an analysis. A user can drop new data on the template, which will carry out
the navigation operations and populate the visualizations. This declarative approach is more powerful
than procedural macros, because navigation paths can be traversed either
forward or backward. For instance,
the original analysis may have begun with students and proceeded to the
classes they take. On reuse, the user
might drop classes on the query, which would look up the students. |
Personal Information |
An ongoing
project is capturing and building manipulable models of my personal
information stream. The diagram at
left shows the dataflow from my email, calendar, contacts, tasks, web
browsing history, phone calls, medical diary, Quicken, and Emacs and
Microsoft Office visits and saves into Visage. Additional data comes from an Informedia Experience on Demand
camera, microphone, and GPS receiver, and from a BodyMedia arm band that
monitors heart rate, skin galvanic response, and posture. I want to build visualizations that can
help me search and browse this collection for either particular sets of items
or for emergent patterns. Current
applications are tied to one or a few of these information sources, while
people’s goals and interests are largely independent of modality or source
application. As a starting point,
Chris Neuwirth, Jim Morris, and I intend to generalize email programs to
organize information by task rather than message. |
Images |
Current
interfaces that search for similar images return an ordered list of matches
and a match score for each.
Researchers need insight into why an image is chosen in order
to improve their retrieval algorithms.
I developed an algorithm in which credit for a match can be assigned
to individual pixels, and that runs in 500ms instead of the 30 seconds that
the Informedia search engine previously took. Each pixel’s contribution to the match is mapped to opacity, so
what you see is what matched. The
portion of the image to search, as well as the resulting color histogram, can
be interactively updated. What was
formerly a batch-oriented black box is now a manipulable analog model. |
Strategic Plan |
I have led a number of proposals with a large number of collaborators from HCII, other CMU departments, and other institutions. With each one I have learned a little more about what it takes to have a successful HCI-oriented collaboration. I believe the pending NIH proposal "Exploratory Analysis and Visualization Software" contains all the needed ingredients: · We have a large group of potential users who are unhappy with their current tools, and have written a letter of support promising access to their data and to be beta testers. These are public health analysts responsible for the National Survey of Child and Adolescent Well-Being (NSCAW), an ongoing nationally representative longitudinal study of children and families or other caregivers who have had contact with the child welfare system. The first wave included 5400 children. This gives us a clear target for user-centered design. Co-PI Kelly Kelleher, a physician with a Master of Public Health degree, understands the user community and the need for visualization to support Exploratory Data Analysis. · Locally, we can bring in similar users for more frequent study. Many statisticians associated with Pitt and UPMC analyze similar survey data. Co-PI Howard Seltman, a physician and CMU Statistics Professor who has long collaborated with Dr. Kelleher, runs a summer program for graduate students from Historically Black Colleges and Universities that we can draw on for less experienced analysts. · Jane Siegel and John Zimmerman complete the team with their expertise in user-centered design and evaluation. Through this project I believe I will break through my Computer Science biases and become a much more rounded HCI researcher. Sage, Visage, and Informedia have not yet embodied good HCI in this sense. In my statement of career goals three years ago I said clinical medicine was going to be my primary focus. With the NIH proposal, as well as possible projects with Highmark and the Pittsburgh Regional Healthcare Initiative, I am now poised to make this happen. With other HCII faculty interested in medical applications and socially responsible computing, I would like to make CMU a leader in this area. I am eager to share ideas with students as well as colleagues. I hope to reach a steady state of supporting two or three PhD students, and to teach a graduate visualization seminar often enough that every PhD student has the opportunity to take it, starting with Spring 2004. |
Grants |
1. NIH, “Improving Bayesian Phylogeny.” Joseph Kadane, Mark Derthick. 6/15/2003 – 6/14/2007. $730,642. Subcontract from the University of Wisconsin, Madison. 2. DoD/STTR, “An Electronic Workspace for the Commander.” Mark Derthick. Phase I (2002-2003, $29,700) & Phase II (2003-2004, $140,000). Subcontract from Maya Design. 3. Advanced Research and Development Activity (ARDA) AQUAINT Program “Question Answering from Errorful Multi-Media Data Streams.” 07/03/02-01/02/04. Howard Wactlar, Alex Hauptmann, Mark Derthick, John Lafferty, Steven Roth, Laurie Waisel. $1,115,833. 4. NSF Award #0121641 AWSFL008-DS3 ITR/IM: “Capturing, Coordinating and Remembering Human Experience.” Howard D. Wactlar, Takeo Kanade, Michael G. Christel, Alexander G. Hauptmann, Mark Derthick. 10/1/01 – 9/30/04. $2,000,000. 5. CMU Seed Grant, “Visual Discovery: Exploring the Universe at Interactive Speeds.” 1999. Robert Nichol, Mark Derthick, Andrew Connolly, Andrew Moore, Jeff Shneider, Larry Wasserman, Nathan Stone, Joel Welling, Mike Levine, Steven Roth. $96,000. 6. MCC Internal Grant, “Using the Minimum Description Length Principle to Learn Features and Analogical Mappings.” 1990. Mark Derthick. $110,000. |
|
|
Pending
Proposals |
8. NSF ITR: “Detecting Video Perspectives from Global Sources in support of Intelligence Analysis and Contextual Awareness.” Howard Wactlar, Mike Christel, Mark Derthick, Alex Hauptmann, Dorbin Ng. $3,951,018. |
|
|
Books |
1. Mark Derthick. Mundane Reasoning by Parallel Constraint Satisfaction. Research Notes in Artificial Intelligence. Pitman, London, 1990. Reprint of PhD thesis, also available as CMU Technical Report CMU-CS-88-182. |
Book
Chapters |
2. Mark Derthick. Finding a maximally plausible model of an inconsistent theory. In John A. Barnden and Jordan B. Pollack, editors, Advances in Connectionist and Neural Computation Theory, Volume 1: High-Level Connectionist Models, pages 241-258. Ablex, Norwood, NJ, 1991. |
Journal
Articles |
3. Mark Derthick and Stephen F. Smith. An Interactive 3D Visualization for Requirements Analysis. To appear in Journal of Scheduling. 4. Mark Derthick. Mundane Reasoning by settling on a plausible model. Artificial Intelligence. 46(1-2):107-157, November 1990. 5.
Mark Derthick and Steven F. Roth. Enhancing Data
Exploration with a Branching History of User Operations. Knowledge
Based Systems, 14(1-2):65-74, March 2001.. |
Journal
Articles (submitted) |
6.
Mark Derthick and Steven F. Roth. A Navigation Semantics for
Visual Queries. ACM Transactions on Information Systems.. |
Conference
Papers |
7. Mark Derthick, Michael G. Christel , Alexander G. Hauptmann, and Howard D. Wactlar. Constant Density Displays Using Diversity Sampling. To appear in Proceedings of the IEEE Information Visualization Conference (InfoVis'03). 8. Mark Derthick. Interfaces for Palmtop Image Search. In Proceedings of the Joint ACM/IEEE Conference on Digital Libraries, Portland, OR, July, 2002, pp. 340-341. ACM Press, 2002. 9. Mark Derthick and Steven F. Roth. Example-based generation of custom data analysis appliances. Proceedings of Intelligent User Interfaces (IUI '01), Santa Fe, NM, January, 2001, pp. 57-64. 10. Joel Welling and Mark Derthick. Visualization of Large Multi-Dimensional Datasets. in Proceedings of Virtual Observatories of the Future 2000. Pasadena, CA. June, 2000. Ed. R. J. Brunner, S. G. Djorgovski, and A. Szalay. 11. Mark Derthick and Steven F. Roth. Data exploration across temporal contexts. In Proceedings of Intelligent User Interfaces (IUI ‘00), pages 60-67, 2000. (Nominee for Best Paper.) 12. Mark Derthick, James Harrison, Andrew Moore, and Steven F. Roth. Efficient multi-object dynamic query histograms. In Proceedings of the IEEE Information Visualization Conference (InfoVis’99), pages 84-91, 1999. 13. Mark Derthick, John A. Kolojejchick, and Steven F. Roth. An interactive visualization environment for data exploration. In Proceedings of Knowledge Discovery in Databases (KDD’97), pages 2-9, 1997. 14. Mark Derthick, John A. Kolojejchick, and Steven F. Roth. An interactive visual query environment for exploring data. In Proceedings of the ACM Symposium on User Interface Software and Technology (UIST ‘97), pages 189-198, 1997. 15. Mark Derthick. A minimal encoding approach to feature discovery. In Proceedings of the Ninth National Conference on Artificial Intelligence, pages 565-571, 1991. Also published as MCC tech report ACT-CYC-212-91, June 1991. 16. Mark Derthick and Joe Tebelskis. ‘Ensemble’ Boltzmann units have collective computational properties like those of Hopfield and Tank neurons. In Proceedings of the IEEE Conference on Neural Information Processing Systems, pages 223-232. American Institute of Physics, 1988. 17. David S. Touretzky and Mark A. Derthick. Symbol processing in connectionist networks: Five properties and two architectures. In Proceedings of IEEE Spring COMPCON87. IEEE, February 1987. 18. Mark Derthick. A connectionist architecture for representing and reasoning about structured knowledge. In Proceedings of the Ninth Annual Conference of the Cognitive Science Society, pages 131-142.Lawrence Erlbaum, 1987. 19. Mark Derthick. Counterfactual reasoning with direct models. In Proceedings of the Sixth National Conference on Artificial Intelligence, pages 346-351, Seattle, Washington, July 1987. 20. Mark Derthick and David C. Plaut. Is distributed connectionism compatible with the physical symbol system hypothesis? In Proceedings of the 1986 Cognitive Science Conference, pages 639-644. Lawrence Erlbaum, 1986. |
|
|
Unrefereed
Conference Papers |
21. A. Hauptmann, R. Yan, Y. Qi, R. Jin, M. Christel, M. Derthick, M.-Y. Chen, R. Baron, W.-H. Lin, T.D. Ng. Video Classification and Retrieval with the Informedia Digital Video Library System. In Proceedings of The Eleventh Text Retrieval Conference (TREC 2002), Gaithersburg, MD, November 2002. 22. Mark Derthick. Introduction to mundane reasoning. In David Touretzky, Geoffrey Hinton, and Terrence Sejnowski, editors, Proceedings of the 1988 Connectionist Models Summer School, pages 291-300. Morgan Kaufmann, 1988. |
Technical
Reports |
23. Mark Derthick. The minimum description length principle applied to feature learning and analogical mapping. Technical Report ACT-AI-234-90, MCC, June 1990. 24. Mark Derthick. An epistemological level interface for CYC. Technical Report ACT-CYC-084-90, MCC, February 1990. 25. Mark Derthick. Variations on the Boltzmann Machine learning algorithm. Technical Report CMU-CS- 84-120, Carnegie-Mellon University, Pittsburgh PA, August 1984. |
|
|
Book
Reviews |
26. Mark Derthick. Book review of Connections and Symbols. Artificial Intelligence, 43(2):251-265, May 1990. |
Videos
|
27. Mark Derthick and Stephen F. Smith. An Interactive 3D Visualization for Requirements Analysis. Accompanies Paper 3. 28. Mark Derthick, Michael G. Christel , Alexander G. Hauptmann, and Howard D. Wactlar. Constant Density Displays Using Diversity Sampling. Accompanies Paper 7. 29. Mark Derthick, Kelleher, Howard Seltman, Jane Siegel, John Zimmerman. Exploratory Analysis and Visualization Software. Submitted with Proposal 7. 30. Mark Derthick. Interactive Visualization of Video Metadata.(Video Demo). In Proceedings of the Joint ACM/IEEE Conference on Digital Libraries, Roanoke, VA, 2001, page 453. ACM Press, 2001. 31. Mark Derthick and Steven F. Roth. Data exploration across temporal contexts. 2000. Accompanies Paper 11. 32. Mark Derthick, John A. Kolojejchick, and Steven F. Roth. An interactive visual query environment for exploring data. 1997. Accompanies Paper 14. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Reviewer |
NSF,
IJCAI, FLAIRS, AAAI, IEEE Information Visualization program committee, CHI,
UIST, East-West International Conference on Human-Computer Interaction, ACM
Transactions on Graphics, IEEE Transactions on Visualization and Computer
Graphics, IEEE Internet Computing, Journal of Visual Languages, Graphics
Interface, IEEE Transactions on Information Systems |
|
|
Societies |
ACM,
AAAI, IEEE Computer Society, Tau Beta Pi, Sigma Xi, Eta Kappa Nu |
|
|
Consulting |
|
2000 |
Cycorp. Visualization of a large AI common-sense
knowledge base. |
|
|
Invited Presentations |
|
2000 |
Microsoft
Research. Interactive
Visualizations for Exploratory Data Analysis. |
|
Lotus
Research. Interactive
Visualizations for Exploratory Data Analysis. |
|
Andersen
Consulting (CSTaR). Interactive
Visualizations for Exploratory Data Analysis. |
|
USC
Information Sciences Institute. Interactive
Visualizations for Exploratory Data Analysis. |
1994 |
Salk
Institute. Learning hidden unit
representations with the Minimum Description Length principle. |
1992 |
AAAI
Spring Symposium. The minimum description
length principle applied to feature learning and analogical mapping. |
1991 |
Siemens
Corporate Research Center, Princeton, NJ.
The
minimum description length principle applied to feature learning and
analogical mapping. |
1989 |
Stanford
University. Interfaces for
Browsing and Editing the Cyc Knowledge Base. |
1988 |
IBM
T. J. Watson Research Center. Mundane Reasoning by
Parallel Constraint Satisfaction. AT&T
Bell Labs, Murray Hill, NJ. Mundane Reasoning by
Parallel Constraint Satisfaction. MCC. Mundane Reasoning by Parallel Constraint Satisfaction. SRI. Mundane Reasoning by Parallel Constraint Satisfaction. ICSI. Mundane Reasoning by Parallel Constraint
Satisfaction. High-Level
Connectionism Workshop, Las Cruces, NM.
Finding
a maximally plausible model of an inconsistent theory. |
1987 |
MIT.
Mundane
Reasoning by Parallel Constraint Satisfaction. |
1985 |
St.
Michaels Workshop on Connectionist Symbol Processing, St. Michaels, MD. Analysis of Learning in Boltzmann
Machines. |
July
2003 |
For hardcopies of papers,
send me email. http://www.cs.cmu.edu/~mad/cv03.htm
is a hyper-linked version of this document. |