1. Introduction
Digital personal archives log life materials chronologically, necessitating the identification of novel user experiences in re-visiting them and generating new ideas [
1,
2,
3]. Interaction designs supporting reading one’s own life-logs seek to regenerate prior experiences by encouraging meaningful interaction with photographs [
1], videos [
2], and geo-locations [
4]. However, personal textual content such as blogs has long been under-studied in re-visitation [
3] and generation of new perspectives about self [
5]. The textual content takes time to read, especially when it grows massively large but minimally organized [
6,
7]. When bloggers need to re-visit and share stories from blogs, for example during themed conferences or meetings, the massive volume and textual nature of blogs makes it hard to understand core themes and capture an overview and connections.
People spend time crafting blogs, recording moments in life, and capturing essences of thoughts [
8]. In 2017, blog authors on the WordPress crafted over 80 million new blog posts each month (a live look at activity across WordPress.com (September 2018):
https://wordpress.com/activity/). Around 80% of the regular bloggers posted at least once per month, and over 90% of blogs had more than 500 words (Blogging Statistics and Trends: The 2017 Survey of 1000 Bloggers (September 2018);
https://www.orbitmedia.com/blog/blogging-statistics/). Bloggers write about experiences over time, generating a temporally-ordered flow of experience [
8,
9]. Many blog authors have a desire to reflect, but generally do not wish to re-read the entire collection [
6]. Given the massive volume of content in many blogs, retrieving and organizing information about experiences cost effort of memory and time [
10], while combination and reorganization of materials generates new ideas [
11]. To support novel blog interaction, natural language processing (NLP) and visualization (VIS) provide opportunities to reorganize large blog content. Large touch displays can present the representations of personal contents dynamically to support multimodal interaction with blogs.
NLP and text visualization (VIS) offer new design materials to build multimodal experiences with large personal texts. This research reports our experiences inviting three long-term bloggers to re-visit their large personal blogs with
BlogCloud, a system which implements blog
reorganization—breaking the temporal order of blogs and recombining experience segments by natural language processing (NLP) techniques. Reorganization depicts a new design opportunity to utilize text processing techniques to support multimodal interaction with personal blogs, in contrast to their original forms of long textual information on web pages. To obtain an early understanding of user’s reaction to the new form of personal content, we designed and implemented a blog re-visitation system,
BlogCloud. We report our experiences taking a walk-through of the interaction in use context and report users’ open-ended experience stories, which helped us collect meaningful feedback and identify new design opportunities with the reorganization [
12,
13]. We consider the blog re-visitation during themed public events such as workshops and exhibitions, to probe how the new re-visitation modality created by NLP and VIS could encourage re-discovery of one’s own themed experiences. Concluded from our experience with BlougCloud, we identify that the
reorganized topics—sets of related or similar content from personal blogs—offer new materials for reflection and new paths for re-visitation, enabling multimodal interaction with symbolic memories. This paper is organized as follow: we first review prior work on re-visitation and reflection on large personal content, with identification of key factors and theories in life-logging and re-visitation. Then, we introduce the opportunities to utilize natural language processing techniques to reorganize large personal blogs. Next, we introduce BlogCloud, a multimodal interactive system which operationalizes blog reorganization with natural language processing and visualization technologies. There after, we focus on our experiences with three long-term blog authors using BlogCloud during a themed public event, to describe new blog re-visitation manners with the reorganized content. Finally, our discussion and conclusion seek to suggest new design opportunities to enable interaction with the reorganized blog topics.
2. Related Work
Weblogs, or
blogs, are an open online medium that allow people to journal their own experiences. Studies have investigated people’s reasons for blogging, recognizing a wide spectrum of motivations (e.g., [
6,
8]). People blog to document life events, provide opinions, express emotions, articulate ideas, and form and maintain community forums [
8]. Other studies also suggested self-presentation [
14,
15] and experience-sharing [
16,
17] could encourage blogging. Bloggers record their personal experiences by documenting remarkable items and activities, along with correlated opinions and emotions [
8]. The life documentation and expression aspect of personal blogs makes them rich and valuable materials to reflect on the past and generate new ideas [
1]. However, prior studies investigating the “reading” side of blogs noticed that many blog explorers tend to quickly skim through most recent posts [
18,
19]. Some readers appear to be casual and passive; they rarely reflect on knowledge gleaned, nor do they care about the freshness of the content [
6]. These somewhat careless behaviors are reflected by posts appearing in reverse chronological order [
8,
20] that are updated over long periods of time [
6] and are comprised of large volume of text and photos [
21]. Following blogs is time-consuming, and it is hard to combine and reorganize related items in experiences [
6,
7,
11]. During public events such as conferences, meetings, workshops, and exhibits, when people re-live and share themed experiences, the overwhelming texts make blogs a less effective material to combine and reorganize to deliver novel re-visitation experiences [
11,
22,
23,
24].
We focus on understanding whether the NLP-reorganized personal blogs could bring in novel experiences re-visiting the broken-and-recombined experiential items [
11,
25]. This paper provides a seminal understanding of how bloggers interact with a multimodal system augmented by NLP and VIS. Interactive systems encouraging re-visitation and reflection have been widely studied over the past decade [
3,
26]. One thread of studies explored people’s daily practices in preserving reflective items, such as personal archives [
27], mementos [
28], and digital legacies [
29]. Technologies for gathering and managing digital personal items, referred to as life-logging systems, are recommended for reminiscence and reflection [
2,
9,
30,
31]. With increasingly more personal content collected by technologies [
9], another realm of research sought to understand people’s interaction with digital past [
1,
32,
33,
34,
35]. Fleck et al. identified re-visitation as a fundamental reflection activity that depends on the higher levels of reflection that require an explaining, connecting, and transforming experience [
3].
Systems for experiencing re-visitation need to deal with the challenges of ever-growing, minimally-organized content [
9,
26,
31]. Many prior studies describe re-visitation and reflection as common but complex psychological activities, in which people deliberate in-and-on actions and associate thoughts for new perspectives [
25,
26,
36]. Ernst von Glasersfeld defined reflection on past experiences as an activity “to step out of the stream of direct experience, to re-present a chunk of it, and to look at it as though it were direct experience” [
25]. People “segment and recombine the memorable parts at will” [
25], to re-present as if a direct experience, and compare it with other experiences that are similar. Through reflective activities with personal content, people re-understand experiences and uncover new life patterns [
26]. Other researchers have examined the roles that text, pictures, relationships, and randomness can have on the re-visitation experience [
1,
7,
37]. Sellen et al. summarized this area of work with their “5Rs” (
recollecting,
reminiscing,
retrieving,
reflecting, and
remembering) as potential design benefits of life-logging systems [
30]. They put forth the research challenge to explore connections within and between these areas, which inspired our direction of study. Areas of focus in this research lie in identifying ways to trigger reminiscence, and to identify the connections and relationships between blogged items [
3,
32,
34].
A promising way to address these issues with blog re-visitation is through natural language processing and visualization [
4,
35,
38]. Studies suggest that aggregated personal content creates value in blog reading experiences [
5,
11,
34,
38]. Recent studies showed that visualization could raise re-living of personal events [
4,
31,
39]. Technologies in natural language processing (NLP) and interactive information visualization make it possible to categorize and present similar textual items in an aggregated manner, and allow users to interactively suggest interests [
40,
41,
42]. Using text-mining and content indexing could facilitate blog navigation, as prior studies found that blog readers would search information about specific interests, and prefer an overview to a wide variety of content [
43,
44,
45]. Reordered and categorized content from one’s own blogs may create unique multimodal interaction experiences, as studies interestingly found blog categorization widgets were voluntarily used by participants to navigate their own blogs [
43,
46]. Considering the textual format and large volume of blogs could hinder generating new ideas, this study examined using NLP and VIS to reorganize and present the blogs and design opportunities to augment multimodal life-log re-visitation.
3. Blog Reorganization for Re-Visitation
This study explored the
reorganization of large personal blogs with text processing and visualization technologies, with the goal of understanding their roles in creating new modalities and interaction experiences from re-visitation. Natural language processing (NLP) and visualization (VIS) are promising approaches to break their original order and reorganize experiential items [
3]. However, little is known about the opportunities of re-visiting one’s own NLP-processed and VIS-presented long blogs, to engender new ideas about past experiences and open multimodal pathways for re-visitation. Our conceptual approach of
reorganization with NLP reduces the effort to identify and retrieve related experiences. Exploring the computer reorganized contents and topics offers a high level view of the experiences. NLP and VIS support re-visitation by reducing the effort of organizing and exploring the large blog volume. The derived topics may provide new perspectives from the massive data, which potentially increases the expressiveness of blogs and the enjoyment of re-visitation.
We examined two blog reorganization approaches to support re-visitation: NLP techniques break the original order of blogs; users can search keywords among blogs and re-visit blog sections that are similar to a particular experience (
Figure 1). NLP techniques are applied on personal blogs to raise awareness of similar experiences, which match a symbolic word or relate to a particular section. NLP makes it technologically possible to search large blog texts and recognize relevant content by computing document similarity. Common NLP technologies provide methods to filter less meaningful words (i.e., stop words), identify keywords, vectorize textual data, and compute similarity. The processed blogs can be indexed to enable searching. NLP techniques can also automatically identify and group the blog content that are similar to a re-visited blog section. Contents that are identified from searching or grouped by NLP-calculated similarity constitute a new content group, which breaks the original order and reorganized into a new topic. A visualized topic contains similar or related experiences, which could illustrate the context of a past experience and offer a new interaction modal [
31]. Interactive text visualizations can present inferential information of the topics, such as the number of similar segments and keywords of the topics, to inspire new perspectives and connect related experiences [
7,
11].
4. BlogCloud: Blog Re-Visitation Tool
To explore the
reorganization ability of natural language processing (NLP) and visualization (VIS) in supporting blog re-visitation, we designed and implemented
BlogCloud (
Figure 2). BlogCloud is an interactive blog exploration tool implemented on large displays, which allows the bloggers to search, read, and organize blog segments (the system is adapted from a prior system [
47,
48]). During public events such as a themed workshop, bloggers can walk up and use BlogCloud to search related keywords among their blog archives (
Figure 2). After searching, retrieved blogs will be presented on a tabletop, while a word cloud visualization on the vertical display shows topic words, blog numbers, and sentiment information to support reflection on topics from the blogs. Users can iterate searching keywords and reflecting on the visualization to re-explore the blog collection.
4.1. BlogCloud on Large Interactive Displays
We considered re-visiting one’s own personal blogs during public events, when bloggers interact with a walk-up-and-use tool [
49] to reflect on the NLP- and VIS- augmented blogs. To explore the multimodal design for this context, BlogCloud is implemented with large interactive displays: a multi-touch tabletop and a vertical display (
Figure 2). We chose large interactive displays as the platform since they feature content manipulation with natural touch interaction [
50]. Large displays also benefit reading activities by expanding visual and interaction space [
51]. As blogs may contain multiple entries of the same topic, presenting related content in one interaction space affords comparison and reflection. Large displays allow users to allocate spaces to facilitate organization and comparing details [
51]. With multi-display design, bloggers can shift between visiting particular blog entries and viewing high-level topics, with possible better perception to the NLP results.
4.2. System Implementation
Although there are other ways to reorganize large blog collections, we chose to use keyword searching and similarity analysis as two common methods to explore the design benefits of reorganization. BlogCloud reorganizes blogs (
Figure 1) and supports re-visitation by:
Using natural language processing techniques to chunk the temporal blog stream and identify potentially important keywords.
Allowing bloggers to denote symbolic memories and retrieve blogs by searching keywords.
Supporting generation of related topics connected to the re-visited blog sections, by visualizing keywords and presenting inferential information of topics.
BlogCloud incorporates natural language processing (NLP) techniques to break the temporal stream of blogs, which allows searching and similarity-based grouping. To instantiate the concept, we chose three generic but common NLP techniques in text processing: keyword extraction, term-frequency based vectors and cosine-similarity clustering. Although other high-level NLP and machine learning techniques could be used, extracting key information and grouping documents reflect core functions of many NLP and machine learning based approaches.
A paragraph is used as the basic reflection section (see
Figure 3) since paragraphs usually focus on a few interrelated experiential items that are lexically self-contained and map well for natural language processing. Text processing is running on a paragraph based collection. BlogCloud segments the words by spaces and punctuation. Stop-words (e.g., “a”, “an” and “the”) are eliminated from further processing. For each paragraph, nouns, verbs, adjectives and adverbs are extracted as keywords with coreNLP [
52]. The weight of each keyword in a paragraph is calculated using term frequency-inverse document frequency (TF-IDF) [
53], through which the importance of words are estimated. Segmentation and term weighting techniques enable the system to process words and prepare for blog reorganization. BlogCloud stores the weight of each keyword for each paragraph.
The text processing segments the blog stream, so that the users can search memories and read matched blog paragraphs (
Figure 4). Grouping similar paragraphs makes a topic that is related to the re-visited blog section. The system first converts the entire blog set into a document-term matrix
M.
is the TF-IDF weight [
53] of
in paragraph
. When one blog paragraph (
) is viewed on a card, BlogCloud creates an array of terms
, which only contains the keywords that appear in
. Then, the system creates a sub-matrix
from
M by selecting columns whose corresponding terms appear in
. The similarity comparison method used by BlogCloud is based on paragraph vectors and cosine similarity. For the re-visited paragraph (
) and a paragraph (
) that is being compared, the system decides similarity through two generic steps: (1) For VP, create a vector
where
to
are the weights of the
u unique keywords in
. The row
c in
is
’s vector
. Since
is extracted from
,
and
have the same size and each element represents the TF-IDF weights of the same word. (2) Calculate the cosine similarity [
41] between
and
; if the similarity score is above 0.5, the system determines CP is a paragraph that is similar to VP. All similar
s form a
paragraph group. For all paragraphs in this group, the system chooses the top
m words with highest TF-IDF weights to generate the word cloud for
.
m varies between 10 and 20, with larger paragraph groups containing more words. To indicate the relatedness between a pair of paragraph groups, BlogCloud tracks the number of paragraphs that co-appear in both paragraph groups. The NLP methods and parameters are decided from multiple trials on a test blog set.
The BlogCloud interface supports breaking the original time order of blogs through keyword searching and visualizing similar content, to present the
reorganization of blogs. Users can search words with BlogCloud to retrieve a list of matched paragraphs. To support re-visiting particular experiences, digital cards presenting the paragraphs can be created and manipulated on the multi-touch tabletop (
Figure 3). The cards can be moved, rotated and zoomed with multi-finger gestures. Zooming a card shows the actual blog paragraph content. Bloggers can search keywords in their memory to retrieve and read particular blog paragraphs. The search word is automatically highlighted on the paragraph card. When a paragraph card is placed on the tabletop, it triggers the NLP technique to identify other similar paragraphs to create a related topic (paragraph groups). A word cloud is visualized for each paragraph group created by the paragraph cards on the tabletop (see
Figure 5). A word cloud contains four components about a topic: highlighted words, feature words, number of similar documents, and average sentiment values (The word cloud visualization in
Figure 5). The highlighted words are the words being searched to retrieve the topic and presented with yellow borders. The size of the feature words in one word cloud is proportional to the weight of words in the paragraph group. A count of similar paragraphs in each paragraph group is presented in a circle. The connection between topics is represented by the number of paragraphs that co-appear between two paragraph groups. A strip with the number of associated blogs connects each cloud pair. The width of the strip is proportional to the number of blog paragraphs that exist in both paragraph groups. The summary of paragraph groups seeks to capture a high-level concept about different topics. The layout of word cloud and positioning of different topics are implemented by a force-directed graph. Sentiment scores, which reflect the feeling or tone of text and are calculated by Stanford NLP, range between 0 and 1, where 0 is entirely negative (red) and 1 is positive (green).
The tabletop interface contains a menu bar to use the searching function.
Search word allows users to type a word or phrase with a virtual keyboard, displaying matched paragraphs in the scrollable paragraph card list (
Figure 5, bottom). It allows bloggers to actively query symbolic words in their memory. With the menu bars on the sides of the tabletop interface, users can search for words, view results, and update the visualization (
Figure 5, bottom). Paragraph cards in the card list can be dragged onto the table to read the actual blog content.
Show cluster shows paragraph cards in each paragraph group.
Show all doc loads all paragraph cards in the card list in chronological order. This button is added in case the users do not have anything to search for or want look at blogs in the original order.
Refresh cloud button reloads the visualization.
5. Re-Visiting Personal Blogs
To evaluate how bloggers interact with BlogCloud and re-visit the NLP-reorganized content, we adopted the framework of “5Rs” identified by Sellen et al. [
30]:
recollecting,
reminiscing,
retrieving,
reflecting, and
remembering. We observed blog authors’ interaction with BlogCloud from each of the five re-visitation activities. Recollecting and reminiscing is when blog authors think back about experiences and re-live particular life events. Retrieving personal experiences depends on details in memory to find desired experiences for re-visitation. Reflecting on experiences requires examining patterns of past experiences and derive abstract representations. Remembering is a form of prospective memory, which is a memory for actions to be performed sometime in the future. In this case, remembering serves as cues to remember forgotten events. Re-visitation of large personal blogs needs to solve the high effort of organizing the content, the low expressiveness of unorganized text, and the enjoyment of reading large volume of data. We explored whether the NLP-reorganization could lead to new perspectives when exploring one’s own data, and support multimodal interaction experiences.
Grounded on the five re-visitation activities of life-logging systems, we sought to understand whether and how the NLP-based blog reorganization implemented by BlogCloud. BlogCloud is designed for bloggers to re-visit the computer reorganized life experiences with multimodal interaction. The system allows bloggers to search and read blogs, re-live specific life experiences and reminisce about emotional and sentimental traits. The searching function enables retrieving symbolic memories by exploring different words among blogs. While reading the actual blogs, the visualization depicts the topics of the re-visited sections to support reflection. When multiple topics from the blogs are read together, the inferential information of topic sizes and connections between different topics afford reflection from a new perspective. When exploring a topic, bloggers could possibly encounter content that fades in memory and trigger remembering the experience. To examine the opportunities of natural language processing in supporting multimodal interaction with one’s own large personal blogs, we report our experiences with long-distance hikers re-visiting their hiking blogs with BlogCloud.
6. Examining BlogCloud with “5 Rs”
In planning an examination of the NLP-based reorganization and how BlogCloud supports reading one’s own large personal blogs, we recognized great value in gaining perspectives both from frequent bloggers and from interface and interaction experts. We identified three blog authors who also had relevant research skills in areas including human–computer interaction, user experience, ethnography, tech design, industrial systems engineering, and management systems to re-visit their blogs with BlogCloud (
Figure 6). All of these blog authors have crafted a blog that describes a long-distance (over 1000 miles) hike that they undertook. Long-distance hiking was chosen as the blog theme because of its life significance and need for re-visitation: dedicated hikers prepare for months or even years for a long-distance endeavor, and they often blog, journal, or otherwise write about the hike’s significance in their lives (e.g., Trail Journals
http://www.trailjournals.com/, The Hiking Life
https://www.thehikinglife.com/ and [
54,
55]). During events when hikers are gathered, re-visiting and reflecting on such hiking experiences is worthwhile even years after the accomplishment, when memories may have faded and the enormity of the writings from the hike may appear even more daunting. Considering the challenges of measuring blog re-visitation through controlled lab studies (e.g., quantifying reflection and motivating sharing personal experiences) [
56] and given the early stage of this research, we chose to engage in highly interactive and largely qualitative observation sessions with a small number of experts who maintain large personal blogs and also have relevant research knowledge. Observing blog authors’ reflective activities and summarizing their feedback helped us obtain a descriptive understanding of the roles of NLP and VIS in multimodal interaction with blogs. The author sessions were conducted during an evening session at a hiking-themed workshop in which the authors were invited guests. During each session, blog authors were introduced to the system and encouraged to re-visit their extended hiking experiences, and share their hiking knowledge with others verbally so that the research team could understand their thought patterns. In each author session, we sought to observe how bloggers interact with the reorganized blogs presented by BlogCloud, with a focus on learning the following “5R” categories of re-visitation activities:
How do bloggers recollect and reminisce about both high level ideas and specific experiences?
How do bloggers interact with BlogCloud to retrieve symbolic memories from the reorganized experiences?
How do bloggers perceive the topics derived from the re-visited blog paragraphs and reflect on the reorganized content?
Whether bloggers’ re-visitation to the reorganized blogs triggers remembering lost memories?
In the author session, each blog author spent 20–40 min using BlogCloud to explore his or her own hiking blogs. During the workshop, other attendees occasionally joined the conversation and engaged with the system. However, the blog author remained the primary user of the system throughout the session. A graduate student served as the exhibitor to introduce the system and explain operations. The qualitative data were collected through the conversation with blog authors while they were interacting with the system, and emails afterwards discussed feedback to the system.
Blog author 1 (A1) is a male HCI researcher and a dedicated hiker who keeps a personal hiking website with more than 100 hiking-related blog posts since 2012. In A1’s online blog set, there are 97 blog posts with 3999 paragraphs and 230,159 words in total about a 1000-mile hike that he undertook. Blog author 2 (A2) is a female HCI researcher, consultant, university instructor, and ethnographer who maintains a personal website that contains her hiking blog posts. During one 2500-mile hike, A2 recorded her hiking experiences and posted on her personal website. This blog set has 34 posts, which contain 467 paragraphs and 23,663 words. Blog author 3 (A3) is a male university professor and a section hiker (a hiker who breaks a long trail into sections). Starting in 2007, A3 started hiking sections of a 2000-mile trail, and wrote one post for each section he hiked. Cumulatively, he posted 213 blogs with 1904 paragraphs and 1,536,054 words.
7. Findings
Blog authors acknowledged that it was a unique and valuable experience using BlogCloud to explore their own blog collection. All three blog authors noted they did not carefully organize the blogs and tweak the writing, and all had an unfulfilled wish to clean up and better organize their blogs when they have time (even though all authors completed their hikes several years prior to our session). They commented that the system helped them re-visit and organize thoughts about their blogs, which implies potential extended utility of the blog reorganization.
7.1. Recollecting and Reminiscing
Recollecting and reminiscing are interactions with life-logs, which lead to “re-living” specific experiences and reminiscing about the past experiences. During the interaction with BlogCloud, authors reminisced about particular experiences that connect to their memories and recollect experience topics from a non-temporal dimension.
7.1.1. Reminiscing about Particular Experiences
Through word searching, matched experiences are grouped by BlogCloud, avoiding skimming through the most-recent posts. During the author session, blog authors reminisced about particular experiences, meanwhile connecting them to the symbolic words that represent symbolic memories. When A1 searched “climb” (see
Figure 7), he explored the first few results and picked a paragraph about a gradually narrowing cliff path. A1 reminisced about the experience on the cliff path by telling a story:
“It was a path on the cliff. The first time when I passed it, it was this wide”, A1 opened his arms to show how wide it was.
“But the second time when I was there it was barely my feet’s width. I have to pass it with my body clinging to the cliff and be careful.” The particular experience of passing a terrifying cliff path reorganized this kind of
climb experience. A2 noted that reflecting on the word cloud surfaced a memory about her experiences:
“the connections I got to explore and the way that the semantic connections surfaced specific memories… it was a really cool experience to see words connected that acted as triggers to surface memories that I otherwise would not necessarily have been thinking about.” Reading the visualization raised the reminiscence about particular experiences, which suggests that re-visiting a general experience topic turns into reminiscing about a particular one [
25]. However, without the NLP reorganization, the experience might be neglected in the long blog stream, with fewer chances to encounter a particular experience and reconstruct that type of experience.
7.1.2. Crossing the Time Dimension
We notice the broken-and-recombined experiential items bring forth a new perspectives to perceive blogs by recollecting multiple experiences across multiple days and merging them into a topic or story. A1 re-visited multiple paragraphs from different days after each search, trying to interpret whether the content matched his internal conceptualization of the tough nature of his experiences. A2 searched for the word “picnic”, and after viewing the visualization (
Figure 8) and several paragraph cards, she reminisced about two specific experiences on two different days:
“[Interacting with the system reminded me of] a very specific memory about … a nearby store that had non-backpack-friendly snacks available for purchase” and another time
“… at a picnic table in time for the luxury of a place to sit-down while eating a less exciting rehydrated meal.” BlogCloud helped A2 bring two experiences together, explaining how hiking food needs to be prepared for hikers. A3 noted he wanted to view content about different topics, commenting
“It (BlogCloud) allows me to compare different things. This tool can help me organize my thought, like if I am thinking about writing a book about section hiking”. The keywords in the visualization were abstracted high-level descriptions of a general idea, but the actual content comes from content at different time. Our blog authors appreciated viewing experiences across time, enabling recollection of similar items that are sparsely distributed.
7.2. Retrieving
BlogCloud uses user-defined keywords to retrieve and reorganize the blog content. In our observation of BlogCloud, we noticed the importance of the symbolic words in bloggers’ memory. Based on blog authors’ choice of words to describe memorable experiences, we identify the patterns of keywords usage when retrieving symbolic memories.
7.2.1. Active Experience Seeking
During the author session, blog authors were seeking familiar experiences with BlogCloud. Although BlogCloud allows blog viewing with the show-all button (a non-reorganized presentation), blog authors rarely selected paragraphs from their original order, using the search tool to search their own blogs, in contrast with the skimming and random reading observed in prior studies [
6,
18]. All three blog authors spontaneously identified multiple interesting words to interact with BlogCloud. All three authors could explain why they wanted to search for these words to retrieve interesting memories, which direct the interaction with BlogCloud. Rather than passively or randomly re-visiting the blog paragraphs, blog authors used BlogCloud in an active manner: they actively asked questions to the system to seek the reorganized content related to their memories.
7.2.2. Using Alternative Search Words
Blog authors used search words that contrasted common semantics. The NLP used in BlogCloud introduced bias interpreting author semantics in the context of their experiences. A1 wrote about a “bus shed” in his blogs, an uncommon term for observers. He noted “it is my own geographical neologism…I was using it to mean something different as a parallel to a ‘water shed’. It is not so much a single landmark as a generic kind of feature, but boundaries which are easy to walk across, but hard to return by public transport”. Similarly, A2 used “picnic” in her blog, but in contrast to “picnic” in common sense. Her use of picnic referred to “non-backpack-friendly snacks” and “rehydrated food”. Identifying unique semantics of words which contrast with ordinary meaning but importantly represent experiences is a challenge for natural language processing technique of systems which support re-visitation of large personal texts.
The authors attempted to identify alternatives to imprecise search words. When realizing their words were semantically different from ones found in blog paragraphs, blog authors sought alternative words to describe their experiences. After realizing results for the searched term “tear” was not related to sadness, A1 came up with another search word “cry” as an alternative. However, again, he noticed the word “cry” was mostly used to describe animal sounds. A3 searched the phrase “New Jersey” but noticed results only matching “new”, so he searched “Jersey”. These issues imply that blog authors had particular topics to explore that were not well retrieved, so they had to refine and adjust their search words or acknowledge failure for a topic. Text processing techniques should reflect alternative word meaning, and interactive visualization should guide bloggers for clarification when necessary.
7.3. Reflecting
BlogCloud derives topics from the re-visited paragraphs and conducts reorganization of related contents. It presents similar contents as topics in the visualization, to provide abstract representations of blogs and support reflection. During the author session, the topics identified from the entire blog collections led to alternative ways to re-visit hiking experiences.
7.3.1. Making Sense of Topics
BlogCloud offers a different view of experiences to trigger reflection. BlogCloud reorganize experiences by similarity. Information about experience topics can be extracted to inspire new perspectives. In the author sessions, blog authors undertook sensemaking activities with their own blogs. A1 found sensemaking important for reflection on personal content. He commented,
“although you have written the blog, it does not mean you fully grasp the big themes that run through it … hence the need for sense-making tools.” A2 searched several words and reflected on the overview of multiple experiences. She commented
“the connections were the interesting part, much more than just reflecting on some experience alone.”. The visualization of paragraph groups was used by A3 to make sense of symbols as a way to reflect on his own blog from a different perspective. A3 used the visualization to compare recurring experiences; he searched “difficult”, “rocky”, “steep”, and “hot”, creating four corresponding paragraph groups. After reading the connecting strips (
Figure 9), A3 commented,
“it makes sense to me since ‘steep’ is more related to ‘rocky’ and ‘difficult’ than ‘hot’”. When seeing “bike” had more documents than “scooter”, he commented
“[the visualization] reflects that I used bike more than
scooter for transportation”. When speculating that “cold” was mentioned more than “hot” in his blogs, he paused for a while and said,
“‘cold’ is bigger than ‘hot’, hmm… maybe that is because it took me more time hiking the colder north part, than the warmer south.”. Making sense of the visualized blog reorganization leads to new perspectives and deeper reflections about the experiences.
7.3.2. Connecting Related Topics
Blog authors associated related symbolic memories during reflection. Besides perceiving experience overviews and making sense of experience aspects, blog authors were also interested in delving into the connections between different experience topics. Seeing connections between word clouds of different topics helped A2 interpret related experiential items, noting: “it would be good if I could see the words between two groups, like on the table, or in the visualization, so I know how the words relate to each group, and know what things fall between [the two word clouds].” The connection from one topic to another could also be a flow of recalling consecutive experiences. When reading blog paragraphs about “bramble”, A1 remembered on that day there was one “bus-shed”, even though the paragraph was not directly describing it, and he explained his unique definition of “bus shed”. Connecting groups of reorganized experiential items sheds light on design improvements: besides showing abstracted information, the visualization should reflect how different experience topics are connected.
7.4. Remembering
Remembering in Sellen et al.’s definition emphasizes suggesting prospective events [
30]. However, blogs store experiences along life and contain large volume of content, therefore memory might fail even with the blogs created by oneself.
Recovering Lost Memories
Experience documentation has been recognized as the most prevalent purpose for blogging [
8,
57]. However, considering the large number of blog posts and time passed since creation, experiential items might be lost in memory. During multiple rounds of searching and reading the visualization, blog authors have chances to explore content they did not remember well. We notice all three blog authors found words in the visualization or paragraphs on cards that they did not immediately recognize. However, blog authors did not skip these clues; they would collect more information to recover their lost memory. For example, when the visualization showed “dead” and “porpoise” as keywords in one paragraph group, one observer asked the author whether seeing a “dead porpoise” made A1 sad? A1 could not remember where he wrote about a porpoise, but a search revealed that he once thought the cracked wood looked like the “snout of a dead porpoise”. As another example, A3 searched “Jersey”, and several people’s names were displayed. Through reading the paragraphs related to the names, he recalled hikers he met in New Jersey. In addition, efforts to recover failed memories were also reflected through their suggestions for system improvement. A1 realized he could not find consecutive paragraphs with BlogCloud, so he recommended to add a “before” and “after” button to help transition gently from searching to reading. Similarly, A2 suggested an additional layer to show the whole blog entry. To recover lost memories, blog exploration tools should not only present the blog sections, but also give enough contextual information to help memory recovery.
9. Conclusions and Future Directions
In this paper, the term
reorganization describes a pathway to incorporate natural language processing and visualization in support of re-visitation of large personal blog repositories (see
Figure 1). While AI and other automated data-processing approaches are increasingly incorporated to support multi-model interaction, little attention has been drawn to use automation for personal content re-visitation. This work presents our experience observing three long-term bloggers re-visiting personal blogs with computer-organized content, seeking to provide an initial understandinh of how automation supports re-visitation. The author sessions investigated how the re-organized blogs lead to new ways to interact with blogs, toward understanding one’s own past experiences. From the experience with BlogCloud, we identify that
reorganized topics offer reflection materials and build new re-visitation paths, and they allow bloggers to interact with symbolic memories. The ever-growing personal content is becoming a valuable repository for people to re-visit and reflect on previous experiences, but following this is the challenge of designing interactive systems for identifying values and creating meanings from large personal datasets [
1,
9]. Natural language processing as a material for design could be used to bridge the gap between the massive personal content and human’s need to re-visit and reflect on meaningful experiences [
59]. From the findings of this work, NLP and machine learning techniques can be incorporated in future multi-modality design. Designers should consider intelligent systems to help bloggers identify topics from one’s own experience, re-visit experience from anti-time-line paths, and interact with memory symbols with the re-visitation tools and systems.
Certainly this work suggests much-needed follow-up efforts. The small participant number and brief author session was useful in delving deeply into usability issues, but questions remain about generalizability of findings. Similar to other exploratory studies on emerging issues and opportunities in designs for reflection [
1,
4], we focused on qualitative understanding of how bloggers interact with a multimodal design which presents the NLP reorganized personal blogs to support re-visitation of past experiences. While we have only a small number of participants, we selected three blog authors whom we found have valuable knowledge about large personal blogs as well as usability and engineering. Short but focused sessions with them helped to probe whether the
reorganization leads to creative re-visiting activities, and what user experience would be expected. This approach can better help capture the opportunities and issues with NLP-based re-visitation than other controlled lab scenarios (e.g., a lab study that recruits participants to read irrelevant blogs). Through the sessions with the dedicated bloggers, we investigated the re-visitation interaction and issues that emerged from the reorganized blog content. With the initial findings from this work, follow-up research needs to consider how to support diverse reading activities with different reader types and reading interests, for example, new readers read other’s blogs, bloggers reflect on blogs of different themes, or users read blog collections created by multiple bloggers. In addition, BlogCloud is implemented on large multi-touch displays, which are not universally available to all bloggers. In our study, large displays offer benefits for managing large blog representations and supporting group discussion; however, whether such experience could be generated when interacting with personal devices calls for future investigation. Future work will consider adapt the system to personal devices such as computers or tablets.
Moving forward, we foresee new possibilities for this work in three directions. First, we will seek better interactive designs that encourage people to express their meaningful symbolic words, and collect user’s interests implicitly, by using computational approaches to recognize promising reflective topics. Second, alternative visual and interactive designs that support reorganization will be explored to enable more ways of topic interaction. Re-visitation with other devices such as phones, tablets, or personal computers calls for better design to support interaction with large personal texts. How the new re-visitation paths brought by NLP and VIS could be seamlessly integrated to the existing chronological-ordered blogs requires investigation. It is important to support meaning back-and-forth navigation between high-level topics and blog details. Third, the collaborative understanding of personal blog systems, at collocated events such as conferences and group talks, include possibilities to reflect on and exchange experience topics through experience sharing and collocated re-visitation. New data mining and visualization techniques need to be designed and applied in collaborative tools to support collocated reflection, leading to identification of connections between experience topics of different people.