04/29/2020 – Dylan Finch – IdeaHound: Improving Large-scale Collaborative Ideation with Crowd-powered Real-time Semantic Modeling

Word count: 550

Summary of the Reading

This paper aims to create a system that will improve large-scale collaboration by making collaboration easier. These researchers aim to accomplish this by giving idea creators access to a new and better form of semantic modeling that would allow them to more easily see all of the ideas that have been created so far on a project. They want this new system to be nearly-real time and self-sustainable (they want to limit external labor). 

The system has 3 main components: (1) it allows users to request to see the ideas of others from a global idea map, with some of the returned ideas will be very different from each other; (2) the user can request to see ideas similar to their own idea; and (3) the user can request to see a map of ideas, that shows how they relate to each other and how similar they are. 

An evaluation was conducted and found that on average most people thought that the system was helpful. 

Reflections and Connections

I find this idea to be very interesting because it seems like if it was done correctly, it would have many applications. I took an HCI class in undergrad which covered design (and ideation) and I always found ideation to be a hard process. The system proposed here could help with many parts of the ideation process. One of the hardest parts of ideation is keeping track of all the ideas. Sometimes, it can be overwhelming to try and organize all of the ideas that a group has come up with. When you get to a certain number of ideas, you just can’t cope with the sheer volume of information. It becomes extremely hard to organize the ideas in a coherent way and ideas you see one second can become lost in another. A system like this would make dealing with all these ideas much easier. Plus, with the weight of managing ideas lifted, the contributors would be free to come up with even more ideas.

I think that when it comes to ideaton, there really is no magic bullet solution. The fact is, ideation simply involves too much information. The human brain simply cannot deal with all of the information that comes with doing ideation. So, there is no perfect solution that will make ideation easy. This system does not eliminate all of the struggles of ideation. It is still hard to keep all the ideas you need in your head and it is still hard to deal with all the relevant information. But, this system does vastly improve on older, traditional methods, like sticky notes. This system makes it much easier to find ideas you are looking for and to manage all the ideas you’ve had. That is about as good as it gets when it comes to ideation.

Questions

  1. Could this system do anything else to help make ideation easier?
  2. What part of this system would you use most when you’re doing ideation?
  3. What makes it so hard to automate all or part of the ideation process? Can we ever achieve full automation of the whole process or even parts of it beyond what this system shows?

Read More

04/29/2020 – Dylan Finch – Accelerating Innovation Through Analogy Mining

Word count: 579

Summary of the Reading

This paper works to make it easier to find analogies in large, unstructured datasets that cover a variety of domains. The work has many real world applications, targeting real world examples of large, unstructured datasets, like data from the US patent office. The system works by collecting data about each entry in the dataset. This data includes the purpose and the mechanism that achieves the purpose. By collecting both of these features, it makes it easier to find analogies. 

This system was evaluated to see how well it helped with ideation by analogy. The evaluation used crowd workers to see how effective the new method is. Many workers were asked to come up with new product ideas. To help, these workers were shown example products. Some were shown products found to be similar using the system described in this paper. Others were shown products found to be similar by other means and others were shown random products. The evaluation showed that this system helped the workers to come up with better ideas than the other 2 methods.

Reflections and Connections

I think this problem is really one that spans generations. The paper brings up the troubles of trying to use a real world dataset like the one from the US patent office. In a case like this, I think that the data presents many different challenges, depending on when you look. For data from the past, there are probably inconsistencies in the formats of the data, the data may be in multiple different sources, and some of it may have been lost or changed over time. For data from the present, there is just so much of it. With so many more people and intellectual property more valuable than ever, the US patent office probably has more data about inventions than they can deal with. These represent two very different challenges that a dataset like the one from the US patent office face and they are a great reason why research like this is so sorely needed.

The idea of this paper also reminds me of an idea from last week’s papers: SOLVENT. Both papers try to make it easier for researchers to find analogies in data sources. In fact the existence of both of these papers I think helps to illustrate the need for technology like this. In fact, neither of these papers cite each other even though they are working on very similar research. Perhaps if there had been a widely available version of SOLVENT, they would have been able to find each other’s papers and build off of each other. 

I think that as these datasets get larger and larger, the need for easier ways to access things from them will become more and more important. The number of patents and papers is growing quicker than ever before and that means that it is easier than ever for valuable knowledge to be lost. We need to start implementing more ideas like this so that we don’t lose important knowledge. I hope that the existence of both of these papers helps to show others the real need for technology like this.

Questions

  1. Do you think this system is better or worse than SOLVENT?
  2. What is another real world, unstructured data source that a system like this might work well on?
  3. What are some applications for this system outside of ideation?

Read More

04/29/2020 – Nan LI – VisiBlends: A Flexible Workflow for Visual Blends

Summary

The author in this paper introduced an advanced graphic design technique which combines two objects or concept in a novel and meaningful way in conveying a message symbolically. To achieve this, the author presents a tool, VisiBlends, a flexible hybrid system that facilitates the generating of visual blends based on an iterative design process. The author first introduces and defines the problem of vidual blends and then decomposes the process of creating visual blends into sub-task. The baseline of this iterative design process is that let users brainstorm first regarding the concept and then find certain relevant types of images. Then the user annotates images for the convenience of the system automatically detects which images to blend. Finally, users evaluate each blend and decide whether or not iterate the process. To find out whether the system could support decentralized collaboration and co-located teams generate the visual blend, and whether this system would help novices create blends efficiently, the author conducted three user studies. The study results indicate that both decentralized groups and co-located groups can generate visual blends to express their messages efficiently. Further, the system VisiBlends indeed helps novices generate visual blends.

Reflection

I really like this paper, and I even want to try the system. Create something novel out of thin air is always hard. Therefore, people are continually looking for tools that can stimulate creativity and brainstorming, hoping that these tools can inspire us. The paper we discussed last week, which presents a tool to help find analogy from papers also trying to do the same thing. This really shows the essence of creative inspiration.

It really enjoins to see the study process, especially study 2, group collaboration on blends for messages. They discovered many constraints, but they also solved these constraints cleverly, such as focusing on the images of the other concept to increase the chance of finding a blend when the image of another concept is limited. I particularly like the example of women + CS. Workers were trying to avoid gender stereotypes, even though it’s tough to think about the creative way. Thus, the author concludes that it’s hard to meet all the constraints, and we have to decide where to compromise.

The human visual system inspires me of a way to create a database of visual patterns. Since human tends to recognize an object based on its 3D shape, silhouette, depth, color, and details, we could let a group of people identify a blurred shape, which contains certain features but does not clear enough to recognize the actual object. Then we can base on participants’ visual perspectives to perceive the user perspective of the metaphor of this shape.

For the third study, there is an interesting phenomenon pointed out by the author. Participants who saw VisiBlends first then removed VisiBlends have much worse performance than a participant who did not see VisiBlends at all. This reminds me of the participant in one of the previous papers said they are afraid to be spoiled by the automatic system leads to no active thinking. So I think this might be one of the cases.

Questions

  1. Do you think this system would help you generate a visual blend? Will you use the system to help with your design?
  2. It is mentioned in the paper that sometimes we need to compromise to achieve our goals, what do you think about this perspective? Can you think about the examples of this situation?
  3. What is the essential tool for you when you are trying to do the brainstorm?
  4. For the iterative design process described in the paper, which part is the most significant for you? Which part you think could be replaced by an automatic machine.

Read More

04/29/2020 – Myles Frantz – Visiblends: A flexible workflow for visual blends

Summary

Advanced graphics designs such as visual blending are a very difficult art to master. Let alone for a singular person to understand and able to create them. Utilizing different designs through various companies to ensure the widest market share and the greatest public opinion requires time, effort, and typically a team of experts to ensure quality. Aiming to ensure a lower barrier of entry for younger or smaller companies, this team created VisiBlends, a crowd sourced framework aiming to ease the burden on teams. This framework breaks down the task of visual blends into 6 different stages to ensure there is validation throughout the whole framework. Throughout the various studies the team had created to measure the results, there was a majority success stapling this usage as a potential tool to generate visual blend assistance through the whole process. 

Reflection

I think this problem domain is similar to the research problem domain in various aspects. Researchers look at the bleeding edge of technology and aim to solve new or old problems using new techniques. These techniques are usually accumulated through various studies, various other methods, or even a new method that uses existing technology to extend previous work. It is interesting to see how the process for creating a new research idea can also be applied throughout the 6 different steps within this teams framework, however with the lab mates, advisor, and committee filling in for the users being crowd sourced within the original framework. 

I do appreciate the machine learning algorithm learning to match the two different pictures (or ideas). This provides several starting places for uses while continuously learning the better matching locations. I believe this could be improved by allowing a human in the loop factor or an override however. Within this kind of work the art is continuously being improved upon in a real time scale. To keep up design artists may have a better idea of new ideas to be used and could be used as an early wave, better feeding the algorithm with new results. 

Questions

  • Visual blend problems are potentially a factor when marketing for a new item or product. Blending two (and potentially very different) objects helps to grab the attention of passersby who only give it a split second while either scrolling past the story or going past the ad. Have you had any experience in creating a visual blend for one of your products and did you use this technique to reach a specific demographic? 
  • One of the perceived problems throughout Visual Blends is the specific pairing of elements that can be related to each other. Notably within this study this was overcome by obtaining the consensus (throughout the Mechanical Turk Workers) the initial idea that came to their head with the idea. Do you think this could be alleviated by using another Machine Learning algorithm pairing with the stream of information from a social media website (accounting for the social consensus of a group)?  
  • This technique could potentially remove part of the need for specific marketing teams who specialize in visual blending. Do you think this program could become widespread or integrated with another bigger system already in use by another company? 

Read More

04/29/2020 – Myles Frantz – IdeaHound: Improving large-scale collaborative ideation with crowd-powered real-time semantic modeling

Summary

Creating solutions for everyday problems requires out of the box solutions. For common problems that people have already worked through this is a problem since many common solutions or work arounds have already been created. Crowd sourcing these solutions are bound to run into redundancy issues if there is no arbiter to ensure there is forward momentum. To alleviate the need for such an arbiter while ensuring there are fresh ideas generated throughout the system, the team created a public sharing platform for sharing ideas. Utilizing the crowd platform, notepads (symbolizing new ideas or small excerpts) can be retrieved from others on the same platform through a similarity ranking. 

Reflection

 I can greatly appreciate the effort the team has gone to solve this problem. From working within a team of technologically advanced peers, I have been able to collaborate and learn greatly from each of the members. Unfortunately for many teams (teams cross-located across the globe or teams working remotely similar to the Corona Virus working from home order) the kind of general idea generation is greatly limited to the technology being used. Whiteboard technology like Draw.io and Microsoft Whiteboard is great but takes extra time to be adapted and written out. A design like this hosted by a company’s platform could provide greater collaboration across teams for potential side projects or to create a better future direction for the company. 

I do appreciate how simplistic and relatively familiar the design that the team made of the platform. Utilizing a similar design of sticky notes from Windows (at least version 7) is a great way to lower the “ramp up” or time of education the user has to take to learn the system. Despite this the team reported the workers needed to use a lot of mental effort to accomplish their tasks that were given to them. It is interesting to see not only with a common design there is always the “writers block” or the mental stopping point that is common throughout many people.

Questions

  • Collaboration is a great way for ideas to be shared and improved. This is only increased when the participants have different experiences and backgrounds, helping to provide potentially new perspective onto the idea. What was your best idea collaboration you have had so far (can be from a professional environment or research environment)? 
  • Unfortunately as described earlier there are a multitude of problems and issues that may arise when teams of people work or interact remotely. These problems are only further exemplified through the recent stay at home orders across the globe due to the Corona Virus typically only permitting essential workers. What types of issues (technological or not) have you experienced working with someone across technology? 
  • Given how wide spread and with a potential spread of the Corona Virus this is predicted to last until at least August of 2020. With this it is likely people will still be hesitant to go directly back to work and even more so openly work together. Would you use this technology to help brainstorm with others in your field (or even in general)? 

Read More

04/22/2020 – Sukrit Venkatagiri – SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers

Paper: Joel Chan, Joseph Chee Chang, Tom Hope, Dafna Shahaf, and Aniket Kittur. 2018. SOLVENT: A Mixed Initiative System for Finding Analogies Between Research Papers. Proc. ACM Hum.-Comput. Interact. 2, CSCW: 31:1–31:21

Summary: In this paper, the authors attempt to help researchers by generating (mining)  analogies in other domains to help support interdisciplinary research. The paper proposes a unique annotation schema to extend prior work by Hope et al. and has four key facets: background, purpose, mechanism, and findings. The paper also has 3 interesting studies. First, it was collecting annotations from domain experts in research fields, and second, using the Solvent system to generate analogies with real-world usefulness. Finally, the authors scaled up Solvent through the use of crowdsourcing workflows. In each of the three studies, they used semantic vector representations for the annotations. The first study had a dataset focused on papers from CSCW and annotated by a member of the research team, while the second study involved working with an interdisciplinary team in bioengineering and mechanical engineering to determine whether Solvent could help identify analogies not easily found with citation tree search. Finally, in the third study, the authors leveraged crowd workers from UpWork and Amazon Mechanical Turk to generate annotations, and the authors found that workers had difficulties with the purpose and mechanism type annotations. On the whole, the Solvent system was found to help researchers and generate analogies effectively. 

Reflection: Overall, I think this paper is well-motivated, and the 3 studies that form the basis for the results are impressive. It was also interesting that there was significant agreement between crowd workers and researchers in terms of annotation percentage. This proves a useful finding more broadly in that novices may be able to contribute to science not necessarily by doing science (especially as science gets harder to do by “normal” people, and is done in larger and larger teams), but by finding analogies between different disciplines’ literatures.

For their second study, the authors trained a word2vec model on a curated dataset of over 3000 papers from 3 domains. This was also good because they did not limit their work to just one domain and strived to generalize their work/findings. However, they are still largely engineering disciplines, albeit CSCW has a somewhat social science component to it. I wonder how it would work in other disciplines such as between the pure sciences? That might be an interesting follow up study. 

I wonder how such a system might be deployed more broadly, as compared to a limited way that was done in this paper. I also wonder how long it would have taken crowd workers to go through the tasks and generate the findings in total.

Questions:

  1. What other domains do you think Solvent would be useful in? Would it easily generalize?
  2. Is majority vote an appropriate mechanism? What else could be used?
  3. What are the challenges to creating analogies?

Read More

04/22/2020 – Sukrit Venkatagiri – Opportunities for Automating Email Processing: A Need-Finding Study

Paper: Soya Park, Amy X. Zhang, Luke S. Murray, and David R. Karger. 2019. Opportunities for Automating Email Processing: A Need-Finding Study. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (CHI ’19), 1–12

Summary: This paper is a need-finding study exploring the opportunities and challenges for automating email processing. The authors conducted a mixed-methods study to pinpoint users’ expectations and needs in terms of automating email handling, in addition to the informational and computational support required for it. This study was divided into three main parts: what types of automated emails users want, what types of information and computation is needed, and then a field deployment of a simple inbox scripting tool. They did so in two steps. First, they had a formative design workshop where 13 computer science students created email processing rules. Second, they had a survey where 77 people (as well as 35 people without a technical background) answered questions to better understand categories of email automation, and their needs. The results show that there is a need to strengthen richer data on email, better management features, use of internal and external context,, and affordances. Finally, the paper describes a platform for writing small scripts for users’ inboxes, called YouPS. They enlisted 12 email users and found that users wanted more automation in their email management, especially in terms of richer data models, and processing content automatically.

Reflection: I agree with the premise of the paper: the fact that we should and can help people better manage their email inboxes to reduce the amount of energy people spend making sense of it. I wonder why email itself has got so overwhelming in the first place, and how it has affected workplace productivity. 

I especially like the multi-pronged approach that the authors took in this paper, with a formative study, a survey, and building a system. I believe this multi-state approach is valuable and can provide multiple insights as well as opportunities for triangulating data. 

With respect to their findings, I think the need for richer data models and rules, as well as ways to leverage internal and external email contexts are very important. If we are able to understand, for example, the senders’ urgency level and the receivers’ commitments to that sender, then we could draft a rule prioritizing or deprioritizing said emails. I also think the use of email templates and autofill options are useful and Google does something but in a more intelligent way with Gmail’s autofill feature. 

However, I wonder how many users actually make use of intelligent filters, and/or would make use of any new tools that are introduced in the feature. It may be the case that only knowledge workers are bombarded with emails that require responses, while most other users simply receive spam (which, I think, is about 90-95% of emails that are sent in the entire world). It would also be interesting to see how this differs between people’s work and home emails. I myself maintain an email address for communications that I know will be spammy, such as insurance applications.

Questions:

  1. How do you manage your email? Do you use filters?
  2. Do you manage your different inboxes differently? How?
  3. What do you think of YouPS? Would you use it? Why or why not?

Read More

04/22/2020 – Yuhang Liu – Opportunities for Automating Email Processing: A Need-Finding Study

Summary:

This article discusses the e-mail management system. It is well known that e-mail has very important position in our lives, so the authors have developed a platform to help manage e-mail in order to make e-mail severs better. The authors have implemented methods that need study to learn: (i) what kind of automatic e-mails do users want, and (ii)what kinds of information and computation are needed to support that automation. The authors conducted three surveys: designing a workshop to identify categories of needs, conducting surveys to better understand these categories, and classifying existing email automation software to determine the needs that have been resolved. The authors ’results highlight the need to strengthen the following aspects in order to better automate the management of mail: richer data, more management attention, use of internal and external email contexts, complex processing (such as response summarization) and affordance senders. Finally, the authors developed a platform for writing small scripts on user inboxes. In their research, we found that most of the popular mail services are not enough to support automated management, which also supports us to develop new mail services.

Reflection:

First of all, from my personal experience, I agree with that we need a system that can help people manage email to reduce the energy people spend in this regard. Usually during my studies or work. If there is a new e-mail, I usually go to deal with the mail first, which is seriously affected the work efficiency. So a system to help manage mail is very necessary. And I also agree that the authors use three probes to study the needs of the mail management system. Among them, I think there are several requirements that are really urgent, such as richer data model for rules and the leveraging of internal and external email contexts. The richer data model helps to study the content and format of the email. For example, we have another article this week. Marking the structure of the article through people can help the machine to understand the article. Similarly, more email formats and templates Can help the machine understand the mail. And studying the internal and external emails can also better understand the content of the email, and at the same time understand the relationship between the sender and the user. These can be greatly improved. But I have doubts about the system mentioned in the article and the future development direction. Because researching the content of emails and users, especially such in-depth research, I think that it will intrude the privacy of users, and at the same time, I think that the accuracy of management cannot be guaranteed, so the wider application may bring more problems. As an example, I often find that some of the mails that I think are important are considered as spam and enter the spam box and make me miss many things. But I think the author proposes to make people customize in the text is a good solution, however for those who are not familiar with computer applications, whether such customization is really beneficial to their use is also a question and worth thinking about.

Question:

  1. What aspects of the current popular mail service system needs to be improved?
  2. If we want to build an automating e-mail processing system, do you think we need a brand new framework, or change slowly based on the existing service system?
  3. Will automating processing of e-mail cause other problems?

Read More

04/22/2020 – Yuhang Liu – SOLVENT: A Mixed Initiative System for Finding Analogies between Research Papers

Summary:

This paper mainly talks about a new paper search system, and I think that it can be thought as a thought initiation system rather than a paper search system. The article first proposes that scientific discovery is usually promoted by finding analogies in distant fields, but as more and more papers are published now, it is difficult to find papers with relevant ideas in a field, even though in those cross-field. Therefore, in order to achieve this aim, the authors introduced a hybrid system. In this system, crowdsourcing workers are mainly responsible for reading and understanding an article. People need to analyze an article from four aspects, Background, Purpose, Mechanism, Findings. The computer then analyzes the article based on these semantic frameworks, such as through TFIDF, or a combination of different architectures, and then finds similar usages from research papers. And through verification, found that these annotations are more effective, and can help experts.

Reflection:

First of all, I agree with the goal of the thesis, helping more researchers to obtain new ideas by analogy from outside the field, and then use these ideas and innovative ideas to promote the development of science and technology. I also think that analogies can help technology anyway. The article also cited quite a few examples to show the effectiveness of analogy and the breakthrough of research after the analogy. And in my reading in the past few weeks, I often feel that the article uses analogy. Since then, I have been thinking about how to help people more effectively through analogy and learn from other subject areas. Secondly, I think it is a very effective method to introduce people to assist in the completion. This is also the biggest gain in the course of my word. When a problem is encountered, the consideration is to use human power to solve it. And in the article, let the crowd source workers to annotate the article from four directions, I think it can greatly decompose an article, so that the computer can better understand this article. It is still difficult for a computer to directly understand an article and find out from it, but based on these architectures, finding the connection between articles is indeed a relatively simple task. But at the same time, I have some doubts about the effectiveness of the system proposed in the article. The article also spent a considerable amount of space to describe the limitations of this system. And my doubts are mainly focused on the third point, the usefulness in the real world. I think there are many aspects that will affect this practicality. For example, when the data increases, there will be more similar analogies, and the quality of these analogies is difficult to control. As we know, not all the lessons are useful, some ideas may bring Other problems, such as reduced efficiency, wasted resources, etc. The final point is that I think it takes a lot to get a good idea. We also need to control the quality of the work done by the workers and whether the algorithm can be so enlightening. In my opinion, a good innovation is usually electro-optical flint. Although this may be relatively easy to achieve on the basis of analogy, it still needs a good collaboration between human and machine to complete.

Question:

  1. Do you think finding analogy by analyze papers based on their framework is a useful way?
  2. Is there any other factors might influence this system, such as the increase of similar articles or different understanding between workers?
  3. Except the methods mentioned in the article, as computer scientists, what can we do to inspire people?

Read More

04/22/20 – Fanglan Chen – SOLVENT: A Mixed Initiative System for Finding Analogies Between Research Papers

Summary

Chan et al.’s paper “SOLVENT: A Mixed Initiative System for Finding Analogies Between Research Papers” explores the feasibility to leverage a mixed-initiative system to categorize research papers into their relational schemas by a collaborative Human-AI team, which can be utilized to identify analogical research papers potentially leading to innovative knowledge discoveries. The motivation of the researchers is the boom of research papers during recent decades, which makes searching for relevant papers in one domain or cross domains become more and more difficult. To facilitate the paper retrieval and interdisciplinary analogies search, the researchers develop a mixed-initiative system called SOLVENT in which humans mark the key aspects of research papers (their background, purpose, mechanism, and findings) with a computational machine learning model extracting semantic representations from these key aspects, which can facilitate the identifying analogies across different research domains.

Reflection

I think this paper conducted an innovative study on how the proposed system can actually support knowledge sharing and discovery in one domain and across different research communities. In the research explosion era, researchers would greatly benefit from using such a system for their own research and explore more interdisciplinary possibilities. That makes me think about why the system can achieve good performance via annotating the content of abstracts in the domains they conducted experiments. As we know, abstracts of the papers usually summarize the most important point of the research papers at a high-level. So it is intuitive and wise to utilize that part for annotating and further tasks. The researchers adopt the pre-trained word embedding models to generate semantic vector representations for each component, which performs pretty well in the tasks presented in the paper. I would imagine that the framework would probably work especially well for experimentation-driven domains, computer science, civil engineering, biology, etc., in which the research papers follow a specific writing structure. Can the proposed framework scale up to other less structured text materials, such as essays, novels, by extending it to full content instead of focusing on the abstracts? I think that would be an interesting future direction to explore.

In addition, one potential future work discussed in the paper is to extend the content-based approach with graph-based approaches like citation networks. I feel this is a novel idea and there is a lot of potential in this direction. Since the proposed system has the ability to find analogies across various research areas, I would be curious to see if it is possible to generate a knowledge graph based on the analogy pairs that can create something similar to a research road map, which indicates how the ideas from different papers in various research areas relate in a larger scope. I would imagine researchers would benefit from a systematized collection of research ideas. 

Discussion

I think the following questions are worthy of further discussion.

  • Would you use this system to support your own research? Why or why not?
  • Do you think that the annotation categories capture the majority of the research papers? Can you think about other categories the paper did not mention? 
  • What do you think of the researchers’ approach to annotating the abstracts? Would it be helpful to expand on this work to annotate the full content of the papers?
  • Do you think the domains involved in cross-domain research share the same purpose and mechanism? Can you think about some possible examples?

Read More