<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data-Analysis</title><link>https://jwheel.org/tags/data-analysis/</link><description>Homepage of Justin Wheeler, an Open Source contributor and Free Software advocate from Georgia, USA.</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><managingEditor>Justin Wheeler</managingEditor><lastBuildDate>Tue, 15 Nov 2022 00:00:00 +0000</lastBuildDate><atom:link href="https://jwheel.org/rss/tags/data-analysis/index.xml" rel="self" type="application/rss+xml"/><item><title>Scrub gently: On data scrubbing in a community survey.</title><link>https://jwheel.org/blog/2022/11/scrub-gently-community-survey/</link><pubDate>Tue, 15 Nov 2022 00:00:00 +0000</pubDate><guid>https://jwheel.org/blog/2022/11/scrub-gently-community-survey/</guid><description><![CDATA[<p>Recently, my team with the <a href="https://chaoss.community/">CHAOSS Project</a> had a data concern emerge when I was working on a project to run a community survey. This community had never run a survey before, and it was the first notable event where the project made an explicit, structured ask for feedback from the community. As a result, this first experience was also a calibration event, so we could guide this kind of work in future years.</p>

<h2 id="survey-says-what">Survey says: What?&nbsp;<a class="hanchor" href="#survey-says-what" aria-label="Anchor link for: Survey says: What?">🔗</a></h2>
<p>At some point, after we opened the survey, a question emerged about how to handle an unruly response. In the ongoing responses, our data manager noticed one response that was objectively harmful. The person was strongly against the D.E.I. initiative that organized the survey. The response was written in a hostile tone, made insulting and derogatory comments about groups of people, and was entirely opposed to the project spending any time and resources on diversity, equity, and inclusion. The question asked to our group was whether we would include this response in the published data, or whether we would omit it.</p>
<p>There were two perspectives. Some elected to remove this response from the final report and any published data. Others felt it was important to wait and see if this response would become a pattern as we ran the survey. I found myself in the second group that felt it was important to wait and see first. I want to unpack this rationale, both for future me and perhaps someone else reading.</p>

<h2 id="on-discarding-the-survey-response">On discarding the survey response&nbsp;<a class="hanchor" href="#on-discarding-the-survey-response" aria-label="Anchor link for: On discarding the survey response">🔗</a></h2>
<p>There were good points about removing the harmful response.</p>
<p>Firstly, the response used harmful language and was likely triggering. This particular response included angry rhetoric that was reflective, to a degree, of the social and political &ldquo;climate&rdquo; of our world today. Including the response in our final reporting could also be giving it a platform, which would arguably be a harmful act. It would validate that input as acceptable input. Our group was not in disagreement that the response was harmful and not behavior the community should tolerate.</p>
<p>Second, the response did not provide actionable insight or useful asks to the project and community. It was written in an aggressive, angry tone towards the reader and did not offer workable suggestions other than ending and divesting from all D.E.I. work immediately. Given this was not an acceptable option, there wasn&rsquo;t much there for us to learn or understand about CHAOSS from this individual response. So, why include or save this response?</p>
<p>There is an option to ignore feedback by intentionally discarding it, but what if the individual feedback represents a larger trend?</p>

<h2 id="what-is-community-culture">What is community culture?&nbsp;<a class="hanchor" href="#what-is-community-culture" aria-label="Anchor link for: What is community culture?">🔗</a></h2>
<p>It is important to be aware of threats to community culture. What is community culture? My improvised definition is any organizational culture oriented towards the care, well-being, and thriving of others (including the self) within a single, shared community environment. Regardless of other values and goals in a project, the shared culture of the project can either lean towards a collective, communal-oriented approach or an independent, individual-oriented approach. The communal approach that prioritizes the well-being of all instead of a privileged view could also be considered as community culture. Many traditional &ldquo;Open&rdquo; projects skew toward a strong community culture.</p>

<h2 id="on-monitoring-survey-responses-for-a-pattern">On monitoring survey responses for a pattern&nbsp;<a class="hanchor" href="#on-monitoring-survey-responses-for-a-pattern" aria-label="Anchor link for: On monitoring survey responses for a pattern">🔗</a></h2>
<p>Coming back to the survey response, what if omitting the data leaves holes in the story of your community? If there is not just one, but several of these kinds of responses, what comments does that make about the community culture? Is there already a strong community culture, or is there resistance and challenges to building a more cooperative, caring environment? There is real work to do at both ends of the spectrum, but what that work might look like depends on which side you are on.</p>
<p>I posit that omitting the &ldquo;unhappy&rdquo; or harmful responses can create a dangerous blind spot to toxicity within a community culture. When it comes to direct, interpersonal interactions with others (e.g. meetings, emails, chats, etc.), stewards of the community culture need to take direct action against visible challenges and threats to the community culture. If someone starts swearing out at someone in a meeting, that is a hard-to-miss action. It is visible, and anyone could observe it or even record it.</p>
<p>In anonymous surveys, you might find a more subtle layer of the community culture than what is shown by the actions of a small few. There can be greater trust that someone&rsquo;s comments will not be tied back to their identity, so some responders may feel emboldened with their words and true opinions.</p>

<h2 id="dont-discard-a-blind-spot">Don&rsquo;t discard a blind spot.&nbsp;<a class="hanchor" href="#dont-discard-a-blind-spot" aria-label="Anchor link for: Don&rsquo;t discard a blind spot.">🔗</a></h2>
<p>The point of this is that especially in larger communities, it is worth noting negative and harmful responses and not totally ignoring them. Communities that organize in more decentralized ways will always have supporters, users, and contributors from both the core and the periphery. The core project membership may not interact or engage often with the periphery often, so there can be a blind spot to parts of the project that identify with the community but are a few degrees removed from the inner ring of the project community.</p>
<p>Noting whether something is indicative of a larger pattern is important. If your community has a ton of jerks, you need to know that your community is full of jerks so that you don&rsquo;t waste time persuading people otherwise, when the lived experience is very different.</p>
<p>In the original conversation with the CHAOSS Project team, this data scrubbing question emerged in the process of running the survey instead of <em>after</em> the data collection concluded. The survey later closed and our data manager confirmed that the flagged response from earlier was the only one of its kind. As a group, we then felt more confident in discarding that one outlier as an anomaly since the survey was open to the general public.</p>
<hr>
<p><em>Feature photo by <a href="https://unsplash.com/@jeshoots?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">JESHOOTS.COM</a> on <a href="https://unsplash.com/s/photos/cleaning?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Unsplash</a>. Modified by Justin Wheeler.</em></p>]]></description></item><item><title>CHAOSS DEI Review: Midyear reflection</title><link>https://jwheel.org/blog/2022/10/chaoss-dei-review-reflection/</link><pubDate>Tue, 25 Oct 2022 00:00:00 +0000</pubDate><guid>https://jwheel.org/blog/2022/10/chaoss-dei-review-reflection/</guid><description><![CDATA[<p>Since February 2021, the CHAOSS Project is conducting a funded, long-term review of its governance, practices, and processes in a diversity, equity, and inclusion (D.E.I.) &ldquo;audit.&rdquo; I originally joined as an internal community liaison and initially helped to identify a team of D.E.I. practitioners external to the CHAOSS Project to support this work. Thanks to the support of the Ford Foundation, we are slowly approaching the two-year anniversary of when this work began.</p>
<p>My brief readout is a guided reflection using questions shared by Matt Germonprez. This reflects my review of our work as a team to date and also shares some of my hopeful outlooks for what our amazing team can accomplish together. This readout will cover <strong>(1)</strong> our accomplishments as a team, <strong>(2)</strong> what was expected and surprising, and <strong>(3)</strong> what we could change in the next year.</p>

<h2 id="chaoss-accomplishments--learnings">CHAOSS accomplishments &amp; learnings&nbsp;<a class="hanchor" href="#chaoss-accomplishments--learnings" aria-label="Anchor link for: CHAOSS accomplishments &amp; learnings">🔗</a></h2>
<p>Three achievements and aspirations stand out over the past year:</p>
<ol>
<li>Established process management and a team workflow.</li>
<li>Created a small but active Community of Practice (CoP).</li>
<li>Sharing our results with CHAOSS and the Open ecosystem.</li>
</ol>

<h3 id="processes--workflow">Processes &amp; workflow&nbsp;<a class="hanchor" href="#processes--workflow" aria-label="Anchor link for: Processes &amp; workflow">🔗</a></h3>
<p>
<figure>
  <img src="/blog/2022/10/jonny-gios-4AT3mZMuFuI-unsplash.jpg" alt="A metalworker is working at an anvil. A red-hot iron rod is on the anvil, and a person uses a hammer to shape and mold the hot iron into a hooked shape." loading="lazy">
  <figcaption>We had to forge our own practices that worked best for our group. Photo by Jonny Gios (<a href="https://unsplash.com/@supergios?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@supergios?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/forge?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/forge?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>).</figcaption>
</figure>
</p>
<p>For direct participants of the team, the Ford Foundation funding did not come with strict requirements or success metrics. As we assembled our team, we were given the discretion of how to conduct a D.E.I. review for the project and determine the best course of doing that. This allowed for creative freedom to figure out what would work best for CHAOSS. Additionally, I could not identify a straightforward way to discover other Open communities and projects doing our kind of work. Since there were also not many other known successful models to follow, we combined our shared experiences across multiple Open communities to build our team, identify main areas of focus, and engage the community around our efforts.</p>
<p>This is an achievement because we collectively created an active group that makes incremental, positive changes to CHAOSS. This is a model we could share with other projects so that others can learn from our experiences.</p>

<h3 id="community-of-practice">Community of Practice&nbsp;<a class="hanchor" href="#community-of-practice" aria-label="Anchor link for: Community of Practice">🔗</a></h3>
<p>Our team is a small but engaged group of D.E.I. practitioners. We share a connection through our ongoing review of the CHAOSS Project, but we also give and take from our own personal experiences outside of CHAOSS. Our group regularly meets and discusses complex, difficult issues that are both (a) not easy to discuss openly and (b) applicable to many communities beyond only CHAOSS. Our team meetings are a safe space that promotes honest and constructive discussion centered on diversity, equity, and inclusion. In addition to our recommendations and direct efforts with CHAOSS, I often reflect on our conversations as a team when working with other Open communities. An example of this is how we built a list of questions to get a &ldquo;pulse&rdquo; from the community on their feelings about CHAOSS.</p>

<h3 id="sharing-results-with-chaoss-and-beyond">Sharing results with CHAOSS and beyond&nbsp;<a class="hanchor" href="#sharing-results-with-chaoss-and-beyond" aria-label="Anchor link for: Sharing results with CHAOSS and beyond">🔗</a></h3>
<p>This is aspirational and not yet fully realized. Our team has collected a solid portfolio of stories and experiences that other communities would stand to benefit learning from. I consider this a current achievement because while our work does specifically look at CHAOSS, we also often reflect from a general perspective and how a topic of interest might look in other communities. When the time comes to package our findings, I believe we are setting ourselves up for easier messaging and outreach opportunities in the future.</p>

<h2 id="according-to-expectations">According to expectations&nbsp;<a class="hanchor" href="#according-to-expectations" aria-label="Anchor link for: According to expectations">🔗</a></h2>
<p>While I have worked in Open Source D.E.I. communities since 2015, I have never conducted an applied research review for community D.E.I. before. I did not come into this with strong immediate expectations because it would inevitably reflect the backgrounds and strengths of the team we would assemble. However, I did have specific hopes or things I hoped would be realized by this work.</p>

<h3 id="as-expected">As expected&nbsp;<a class="hanchor" href="#as-expected" aria-label="Anchor link for: As expected">🔗</a></h3>
<ul>
<li><strong>Data-driven approach</strong>: We began this work without a strong representation of the state of CHAOSS. What do contributors think about the project? While data is not a universal panacea, we gravitated to a community survey early on because we needed to understand the community experience better first before making serious suggestions.</li>
<li><strong>Time zones are hard</strong>: Our team was spread out across North America, Africa, LATAM, and Europe. Additionally, the work with CHAOSS was also a part-time venture for most of us, in addition to primary employment. Calendars and schedules are hard to get right. Since our team&rsquo;s organization was ad-hoc, momentum would occasionally slow for some periods.</li>
<li><strong>We have an amazing team!</strong> I expected great things once we identified our roster. We have also had more amazing people join us over time and add new passion and insight to our focus as a group.</li>
</ul>

<h3 id="surprises">Surprises&nbsp;<a class="hanchor" href="#surprises" aria-label="Anchor link for: Surprises">🔗</a></h3>
<ul>
<li><strong>Documenting our impact is not always intuitive</strong>: While we have done internal storytelling work within the CHAOSS Project, we do not have a good record of our achievements to date. Our linear progression does not lend itself easily to self-reflection and recalibration. Although much of our focus is on the CHAOSS community survey and CHAOSS Africa, we also facilitated several other notable achievements in the project in the last year. See the following examples:
<ul>
<li>Supporting the establishment of a Code of Conduct Committee.</li>
<li>Community office hours for newcomers.</li>
<li>Improved, peer-to-peer onboarding experience in CHAOSS.</li>
<li>Increased efforts in CHAOSS mentored projects (e.g. Outreachy and GSoC).</li>
<li>Recommending changes to the project and community, like broader localization to Chinese &amp; Spanish and establishing a D.E.I. council.</li>
</ul>
</li>
<li><strong>Losing and regaining steam on the survey</strong>: Although the community pulse survey was one of the earliest tasks identified in our work, launching a first survey proved to take a lot of resources from the team. We briefly stalled out on the survey effort while focused on other areas (like listed above). While our team was able to achieve many smaller victories for CHAOSS with low-hanging fruits, it took a sustained focus and slowdown on new topics to achieve larger contributions like the community pulse survey.</li>
</ul>

<h2 id="changes-for-the-chaoss-team-next-year">Changes for the CHAOSS team next year&nbsp;<a class="hanchor" href="#changes-for-the-chaoss-team-next-year" aria-label="Anchor link for: Changes for the CHAOSS team next year">🔗</a></h2>
<p>Looking ahead to 2023, I hope to strengthen our efforts as a team in these areas:</p>
<ol>
<li>Packaging our work</li>
<li>Dissemination of our work</li>
</ol>
<p>
<figure>
  <img src="/blog/2022/10/christophe-rollando-uOi-nHgMR5o-unsplash.jpg" alt="Large, gold-colored balloons spell out 2023. Several other silver-colored objects surround the gold letters, like star-shaped balloons, tree ornaments, and card-stock stars." loading="lazy">
  <figcaption>Photo by Christophe Rollando (<a href="https://unsplash.com/@chrisrolls?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@chrisrolls?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/2023?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/2023?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>).</figcaption>
</figure>
</p>

<h3 id="packaging">Packaging&nbsp;<a class="hanchor" href="#packaging" aria-label="Anchor link for: Packaging">🔗</a></h3>
<p>Our work stream was linearly ordered and we took a forward-looking approach. Now is a good time to look back and reflect on our results to date. What are our key findings and observations? What suggestions will we make to CHAOSS? How could other communities learn from our experience running this review? One task for us as a team is to identify key messages and themes so that dissemination into broader domains is possible.</p>

<h3 id="dissemination">Dissemination&nbsp;<a class="hanchor" href="#dissemination" aria-label="Anchor link for: Dissemination">🔗</a></h3>
<p>Once we package our work, notes, and reflections, we should take an active approach to disseminating and sharing our work. This includes both the CHAOSS Project and a more general audience. For the CHAOSS Project, this could be a written report, presentations to the CHAOSS board, speaking at <a href="/tags/chaosscon/">CHAOSScon</a>, and outreach to the multiple Working Groups. For a general audience, this could include speaking at industry conferences, sharing our work with other Communities of Practice, social media, or other ways of promoting our deliverables.</p>]]></description></item><item><title>4 metrics to measure sustainable open source investments.</title><link>https://jwheel.org/blog/2021/12/4-metrics-open-source-investments/</link><pubDate>Fri, 31 Dec 2021 00:00:00 +0000</pubDate><guid>https://jwheel.org/blog/2021/12/4-metrics-open-source-investments/</guid><description><![CDATA[<p>How do we understand value when we talk about sustainability? What does investing in open source mean? The meaning is different for many people because of an implicit understanding of what open source means.</p>
<p>This post is a reflection on the past year in my work with the <a href="https://www.unicefinnovationfund.org/">UNICEF Venture Fund</a>. We integrated new open source tools to capture metrics and data about open source repositories connected to UNICEF portfolio companies and created a shortlist of key metrics that map to business sustainability metrics. Now, we are better positioned to look back on past, current, and upcoming portfolio companies and mentor support programs.</p>
<p>As we move into 2022, this post covers my current thinking on these points:</p>
<ol>
<li>Defining investments.</li>
<li>How do these investments impact sustainability?</li>
<li>CHAOSS metrics as an open source tool for an investment lens on sustainability.</li>
<li>What next?</li>
</ol>

<h2 id="defining-investments">Defining investments.&nbsp;<a class="hanchor" href="#defining-investments" aria-label="Anchor link for: Defining investments.">🔗</a></h2>
<p>When we talk about investing in open source, what do we mean? What are the known inputs? What are the expected outputs? &ldquo;Investments&rdquo; and &ldquo;investing&rdquo; are broad terms. Investments typically mean sizeable financial injections of support and growth, but can also include non-financial investments too. Investments can also take the form of both time and energy (i.e. electricity and digital infrastructure).</p>
<p>The UNICEF Venture Fund provides equity-free funding for start-up companies building open source solutions of interest to UNICEF. All the start-up companies are registered companies in <a href="https://www.unicef.org/where-we-work">UNICEF program countries</a>. As part of the Venture Fund&rsquo;s location in the <a href="https://www.unicef.org/innovation/">Office of Innovation</a>, it is also a vehicle for UNICEF to explore frontier technology areas through the investments. When a start-up company is receiving investment from UNICEF, the company receives both funding and tailored mentorship about business and open technology.</p>
<p>A question I want to know is, <strong>what is the impact of the received funding plus guided mentorship</strong>? How does this approach enable the companies to be successful after graduating? What discoveries or knowledge could be shared with others to assist the development of their own open programs?</p>
<p>To summarize, <strong>an investment can be financial or non-financial</strong>. Financial investments include direct funding, grants, venture capital, fellowships, or any other exchange of capital. Non-financial investments include time spent in coaching sessions, personalized content for companies, and shared digital infrastructure. Neither list is exhaustive.</p>

<h2 id="how-do-these-investments-impact-sustainability">How do these investments impact sustainability?&nbsp;<a class="hanchor" href="#how-do-these-investments-impact-sustainability" aria-label="Anchor link for: How do these investments impact sustainability?">🔗</a></h2>
<p>
<figure>
  <img src="https://cauldron.io/static/img/cauldron-logo-white.png" alt="Logo for Bitergia&rsquo;s Cauldron hosted analytics platform. A key part of metrics for UNICEF Venture Fund investments." loading="lazy">
  <figcaption>Bitergia Cauldron.io (<a href="https://cauldron.io" class="bare">https://cauldron.io</a>)</figcaption>
</figure>
</p>
<p>Data makes introspection easier. Bitergia&rsquo;s <a href="https://cauldron.io/">Cauldron.io</a> was a champion tool for kickstarting an open source metrics strategy for the UNICEF Venture Fund. Its introduction as a tool opened up a wider span of data to look at. There are new opportunities to ask questions and explore growth, scale, and sustainability.</p>
<p>In order to come to a conclusion on sustainability impact, we need streamlined data to test a thesis. The Venture Fund team improved internal processes to how metrics are collected from portfolio companies. The team is unifying behind fewer tools and methods to ensure we see the same data and have the same view of the data points we measure. This also provides a fresh opportunity to review how we measure open source impact across portfolio companies. Many have dashboards on Cauldron.io, but data needs a storyteller for it to make meaning. So, the next step is to ask questions with this new data and frame a thesis to measure and test the sustainability of Venture Fund investments into open source.</p>
<p>Many have traveled before me on the same trail of thought. I started first with the <a href="https://chaoss.community/">Community Health Analytics Open Source Software (CHAOSS) project</a> and its metrics releases. This served as the initial point of brainstorming to frame questions and different scenarios of risk, evolution, DEI, and value.</p>

<h2 id="chaoss-metrics-as-an-open-source-tool-for-an-investment-lens-on-sustainability">CHAOSS metrics as an open source tool for an investment lens on sustainability.&nbsp;<a class="hanchor" href="#chaoss-metrics-as-an-open-source-tool-for-an-investment-lens-on-sustainability" aria-label="Anchor link for: CHAOSS metrics as an open source tool for an investment lens on sustainability.">🔗</a></h2>
<p>I reviewed the <a href="https://chaoss.community/wp-content/uploads/2021/10/English-Release-2021-10-21.pdf">latest release</a> of CHAOSS metrics and narrowed down four metrics I want to measure in the next year. I also shared thoughts on why collect this data and how to do it. This blog post is no more than me wondering out loud, to help me frame an analytical approach for this metrics strategy.</p>
<p>The four metrics are detailed below:</p>
<ol>
<li>Contribution Attribution</li>
<li>Contributors</li>
<li>Collaboration Platform Activity</li>
<li>Labor Investment</li>
</ol>
<p>
<figure>
  <img src="/blog/2021/12/glenn-carstens-peters-RLw-UC03Gwc-unsplash.jpg" alt="A hand holds a pen and is writing on a sheet of notebook paper. They appear to be making a list." loading="lazy">
  <figcaption>Take note of your dependencies and contributors.
<em>Photo by Glenn Carstens-Peters (<a href="https://unsplash.com/@glenncarstenspeters?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@glenncarstenspeters?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/lists?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/lists?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>)</em>.</figcaption>
</figure>
</p>

<h3 id="contribution-attribution"><a href="https://chaoss.community/metric-contribution-attribution/">Contribution Attribution</a>&nbsp;<a class="hanchor" href="#contribution-attribution" aria-label="Anchor link for: Contribution Attribution">🔗</a></h3>
<blockquote>
<p>Question: Who has contributed to an open source project and what attribution information about people and organizations is assigned for contributions?</p>
<p><a href="https://chaoss.community/metric-contribution-attribution/">chaoss.community/metric-contribution-attribution/</a></p>
</blockquote>
<p>This metric is insightful because it is targeted deeply into team and project culture. This metric is a good representation of how much the project leans into an open source model of building their project. This work ethos and intention to forge on an open source path is difficult to understand at times. If a team takes care to attribute their software dependencies and other contributors to their code (if any), this is a good sign that the team accepts collaboration as a value and encourages working with others.</p>
<p>I would measure this across two <a href="https://chaoss.community/metric-types-of-contributions/">types of contributions</a>: attributions for software dependencies including those with permissive licenses, and for any other direct contributors to the code and how they are recognized for their participation. This could be filtered in a red-yellow-green light approach:</p>
<ol>
<li><strong>Red</strong>: No attributions are made, or all attributions are inadequate.</li>
<li><strong>Yellow</strong>: One of two attributions are made, or one attribution type is inadequately attributed.</li>
<li><strong>Green</strong>: All dependencies and used works are correctly attributed.</li>
</ol>
<p>
<figure>
  <img src="/blog/2021/12/alex-hudson-m3I92SgM3Mk-unsplash.jpg" alt="View looking down at a small farmer&rsquo;s market, where a woman sits behind several different cases of vegetables. A man hands payment to the woman for unseen goods. This is connected how knowing your customers can also be like knowing your community." loading="lazy">
  <figcaption>Spend more time getting to know who participates and why.
<em>Photo by Alex Hudson (<a href="https://unsplash.com/@aliffhassan91?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@aliffhassan91?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/bazaar?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/bazaar?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>)</em>.</figcaption>
</figure>
</p>

<h3 id="contributors"><a href="https://chaoss.community/metric-contributors/">Contributors</a>&nbsp;<a class="hanchor" href="#contributors" aria-label="Anchor link for: Contributors">🔗</a></h3>
<blockquote>
<p>Question: Who are the contributors to a project?</p>
<p><a href="https://chaoss.community/metric-contributors/">chaoss.community/metric-contributors/</a></p>
</blockquote>
<p>This metric explores a more human dimension of the people and participants to an open source project. The metric defines contributors and contributions broadly, as &ldquo;anyone who contributes to the project in any way.&rdquo; Understanding the people participating in a community, their motivations, goals, and why they choose to be in that community is important to understand sustainability. Otherwise, you may lose out on good opportunities to attract contributions from people who are already engaged, and new engagements may be difficult because of a mismatch of expectations.</p>
<p>This metric is more a means than it is an end; that is, it provides opportunities to ask more questions than provide detailed answers. Nevertheless, it does provide some guidance towards understanding contributors in a project, and it can lead to some concrete actions based on gathered insights. For example, this metric will enable deeper looks in areas of diversity, equity, and inclusion.</p>
<p>Since I work with start-up companies with small, lean development teams, I look to understand the motivations of the developers working on their projects and where the motivations may align with another open source solution. This enables the two communities to leverage their combined brainstorming for meeting complimentary goals around development and innovation.</p>
<p>To collect this data, I would have the team define what <a href="https://chaoss.community/metric-types-of-contributions/">areas of contribution</a> they seek for their open source solutions and then map those desired contributions to a specific project area or different team members. This enables a form of consistent accountability for checking expectations with reality and understanding team capacity. Each area could be a key-value pair, where the value is the project area, team lead, or delegated team member for the type of contribution solicited.</p>
<p>
<figure>
  <img src="/blog/2021/12/kai-dahms-5paXZX8lWk-unsplash.jpg" alt="The dashboard of an older plane is shown, with several different meters, switches, and control knobs. In many ways, the places where we collaborate on our projects can also be as complicated, and we can miss out on some useful features if we are not looking in the right place." loading="lazy">
  <figcaption>There are many ways to collaborate, but the question is, are you counting the right ways?
<em>Photo by Kai Dahms (<a href="https://unsplash.com/@dilucidus?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@dilucidus?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/measure?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/measure?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>)</em>.</figcaption>
</figure>
</p>

<h3 id="collaboration-platform-activity"><a href="https://chaoss.community/metric-collaboration-platform-activity/">Collaboration Platform Activity</a>&nbsp;<a class="hanchor" href="#collaboration-platform-activity" aria-label="Anchor link for: Collaboration Platform Activity">🔗</a></h3>
<blockquote>
<p>Question: What is the count of activities across digital collaboration platforms (e.g., GitHub, GitLab, Slack, email) used by a project?</p>
<p><a href="https://chaoss.community/metric-collaboration-platform-activity/">chaoss.community/metric-collaboration-platform-activity/</a></p>
</blockquote>
<p>Collaboration platform activity is one effective proxy metric for community engagement if measured accurately. The metric does not define collaboration as much as it provides a data structure to measure it. It abstracts collaboration into key data points like timestamp, sender, whether the platform has threaded or non-threaded discussions, data collection date, and platform message identifier. To a degree, collaboration can be abstracted out in this way: a person takes any given action at a given time in a given way, and this action is measured as project-related activity on the collaboration platform.</p>
<p>There are a few possible approaches to collecting this data from UNICEF Venture Fund companies. Each approach does not cancel out another, but each approach could be combined with the others:</p>
<ol>
<li><strong>Measure common git activity like commits, issues, pull/merge requests</strong>. We already measure this data, but use it only in connection to validating Venture Fund workplans for each team with UNICEF portfolio manager(s).</li>
<li><strong>Count communications like comments, reviews, public messages, and other outreach</strong>. Communications strategies and tools are typically inferred from common git activity. Measuring for engagement and stratifying those metrics into a smaller group could allow for deeper insights to the evolution of early-stage open source communities.</li>
<li><strong>Make community hubs first-class citizens in the data curation process to infer about informal engagement</strong>. Both open source projects and UNICEF Venture Fund portfolio companies use a variety of tools to communicate, especially in view of COVID-19 and its seismic impact on how we work. Platforms like Discord, Telegram, Mattermost, Slack, Rocket.chat, Matrix, and others are focal points where projects collaborate, ask questions, and support others. Bringing this data stream into the mix offers deeper insights into how teams engage and build community around their work, and also guidance on when to push for contribution opportunities at the right time.</li>
</ol>
<p>The satisfaction of these three options in their totality is not enough. To leverage the fullest impact, these metrics must tie into each other, and need to be connected back to a narrative. Why is this data being collected and what actions are influenced by the knowledge of this data? The data collection enables the evaluation of sustainability and understanding the birth, growth, and evolution of an open source technology product. Influenced actions can include moving more human resources (i.e. contractors or staff) to support a project, adopting a new open source best practice, and/or engaging new customers, talent, or other leads based on participation in the community.</p>
<p>Measuring collaboration platform activity is not black and white. Many new questions would likely come forward as part of measuring this activity. Yet this is the point—it lays the foundation for the next layer to the data collection, analysis, and reporting process around sustainability.</p>
<p>
<figure>
  <img src="/blog/2021/12/jon-tyson-kR4K8nJ9JRc-unsplash.jpg" alt="A man is facing forward with his back to the camera. He wears a heavy coat and a construction hard hat. The background is blurred and unclear. In this way, we can think of labor investment from a human-centered approach first." loading="lazy">
  <figcaption>What is the impact of an investment on fair and equitable labor?
<em>Photo by Jon Tyson (<a href="https://unsplash.com/@jontyson?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/@jontyson?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>) on Unsplash (<a href="https://unsplash.com/s/photos/worker?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText" class="bare">https://unsplash.com/s/photos/worker?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText</a>)</em>.</figcaption>
</figure>
</p>

<h3 id="labor-investment"><a href="https://chaoss.community/metric-labor-investment/">Labor Investment</a>&nbsp;<a class="hanchor" href="#labor-investment" aria-label="Anchor link for: Labor Investment">🔗</a></h3>
<blockquote>
<p>Question: What was the cost of an organization for its employees to create the counted contributions (e.g., commits, issues, and pull requests)?</p>
<p><a href="https://chaoss.community/metric-labor-investment/">chaoss.community/metric-labor-investment/</a></p>
</blockquote>
<p>This metric is perhaps the most ambitious of the group. How do you measure labor investment into an open source project? Or literally, the number of person-hours that go into software design, development, co-creation, and community management? It feels like a gargantuan effort, but there may be better ways to measure this in connection to other data the UNICEF Venture Fund is already connected about the businesses.</p>
<p>Measuring labor investment impacts two narratives: the rate of development on the open source work, and the impact of UNICEF investment into a company backing an open source work.</p>
<p>Firstly, understanding the rate of development on an open source work is easier to infer by understanding who is allocated on a project and how much of their time they dedicate to it. If a team of three contributors spares a few hours a week, it will mean something different compared to a team of five engineers spread across different disciplines working full-time. Mapping the labor investment for open source projects supported by UNICEF would enable better planning by understanding the typical labor investment in open source workplan tasks as piloted by other Venture Fund portfolio companies.</p>
<p>Secondly, this gives us a new way of talking about the impact of UNICEF Venture Fund investments as an investment not only in software products but also in labor. It gives us insight into the investment of labor in software engineering talent among portfolio companies. How does this measurement change over time of the investment? Do projects receive more or less investment of labor during the 12 month period we work with them? This could also be used as a proxy metric for the impact of our unique mentorship and coaching opportunities.</p>

<h2 id="what-next">What next?&nbsp;<a class="hanchor" href="#what-next" aria-label="Anchor link for: What next?">🔗</a></h2>
<p>Knowing is half the journey. Even if the knowledge is not yet firmly rooted. The analysis and introspection are from me as an individual working among the UNICEF Venture Fund and do not represent the views and beliefs of UNICEF or the UN in any capacity. My intent is that by sharing this analysis in the open, it allows for a space where conversation can spark where it could not before. It also invites others to share ideas, feedback, and constructive criticism of an emerging metrics strategy for investments made into the open source ecosystem.</p>
<p>Next, more layers can be added and internal and external validation can help to keep this moving forward. An implementation plan would be the next step to follow this post. The implementation plan considers the process of how start-up companies move through the Venture Fund from start to finish. Who interacts with the companies and when? At what point is a company ready to begin building in a new metric or count in their monthly metrics? Do they understand the implications and assessments of these metrics? At what points in the process is data already being collected? Could these new data requests be added to existing requests? And so on.</p>
<p>I hope to formalize some of this new reporting and metrics strategy in upcoming cohorts in 2022, as part of a renewed effort into communicating how our open source investments tie into sustainable impact towards the U.N. Sustainable Development Goals.</p>
<p>This post will serve as a milestone marker on the metrics strategy discussion in the coming one to two months. See you in 2022.</p>
<hr>
<p><em>Featured photo by <a href="https://unsplash.com/@edwardhowellphotography?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Edward Howell</a> on <a href="https://unsplash.com/s/photos/sustainable?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Unsplash</a>. Modified by Justin Wheeler. CC BY-SA 4.0</em>.</p>]]></description></item><item><title>How five Queen songs went mainstream in totally different ways</title><link>https://jwheel.org/blog/2018/10/five-queen-songs-mainstream/</link><pubDate>Tue, 16 Oct 2018 00:00:00 +0000</pubDate><guid>https://jwheel.org/blog/2018/10/five-queen-songs-mainstream/</guid><description><![CDATA[<p><a href="http://blog.musicbrainz.org/2018/10/16/five-queen-songs-mainstream/"><em>Originally published on the MusicBrainz blog.</em></a></p>
<p><a href="http://blog.musicbrainz.org/2018/10/16/five-queen-songs-mainstream/"></a></p>
<hr>
<p>Making graphs is easy. Making intuitive, easy-to-understand graphs? It&rsquo;s harder than most people think. At the <a href="https://www.rit.edu/">Rochester Institute of Technology</a>, the ISTE-260 (Designing the User Experience) course teaches the language of design to IT students. For an introductory exercise in the class, students are tasked to visualize any set of data they desire. Students <a href="https://www.linkedin.com/in/davidkim18/">David Kim</a>, <a href="https://www.linkedin.com/in/jathan-a/">Jathan Anandham</a>, <a href="https://www.linkedin.com/in/jwwheel/">Justin Wheeler</a>, and <a href="https://www.linkedin.com/in/scott-tinker-216962129/">Scott Tinker</a> used the MusicBrainz database to look at how five different Queen songs went mainstream in different ways.</p>

<h2 id="five-factors-of-queen">Five factors of Queen&nbsp;<a class="hanchor" href="#five-factors-of-queen" aria-label="Anchor link for: Five factors of Queen">🔗</a></h2>
<p>Our mini data science experiment decided to look at five unique data points available to us via <a href="https://musicbrainz.org/doc/Work">MusicBrainz Works</a>:</p>
<ul>
<li>Number of recorded covers</li>
<li>Number of artists who covered a song</li>
<li>Release year</li>
<li>Year of last recorded cover</li>
<li>Time elapsed between release year and year of last recorded cover</li>
</ul>
<p>Originally, we looked at songs from different artists, but decided to look at five recordings from the same artist. With Queen being a notoriously famous band, there were several data points to work with in terms of how often a song was covered.</p>
<p>
<figure>
  <img src="/blog/2018/10/Queen-data-visualization.png" alt="How five Queen songs went mainstream in totally different ways" loading="lazy">
  <figcaption><strong><em>Studying five Queen songs</em></strong>: Another One Bites the Dust, Bohemian Rhapsody, Don’t Stop Me Now, Fat Bottomed Girls, We Will Rock You</figcaption>
</figure>
</p>

<h2 id="making-sense-of-the-data">Making sense of the data&nbsp;<a class="hanchor" href="#making-sense-of-the-data" aria-label="Anchor link for: Making sense of the data">🔗</a></h2>
<p>A few explanations are necessary for some of the data, especially the difference in number of covers and number of artists. <em>Don&rsquo;t Stop Me Now</em>, <em>Fat Bottomed Girls</em>, and <em>We Will Rock You</em> had the same number of recorded covers as number of artists who have covered the song. Why were <em>Another One Bites the Dust</em> and <em>Bohemian Rhapsody</em> different?</p>
<p>As it turns out, <em>Another One Bites the Dust</em> had more covers than the number of artists who have covered the song. This happens because some artists have covered the song twice (e.g. once on a studio release and another on a live recording release). On the other hand, Bohemian Rhapsody had more artists covering it than number of covers because some recordings featured multiple artists on the same cover (e.g. the 1992 live performance with Elton John and Axl Rose).</p>
<p>The data opens many interesting questions. Why have some songs persisted longer than others (in terms of recent covers)? Have these songs impacted culture and society in different ways? How have they permeated culture? Is there geographical bias in the data?</p>
<p>This exercise was an exploratory assignment, but we had fun visualizing it and ended up learning an interesting pattern in music data.</p>

<h2 id="check-out-the-presentation-and-paper">Check out the presentation and paper&nbsp;<a class="hanchor" href="#check-out-the-presentation-and-paper" aria-label="Anchor link for: Check out the presentation and paper">🔗</a></h2>
<p>If you&rsquo;re interested for the full details, the <a href="https://docs.google.com/presentation/d/1sMgwgo5dxi2n0j1elnlfZgk3OU5QnwsZfcrNwQYmmCQ/edit?usp=sharing">slides</a> and a <a href="https://docs.google.com/document/d/1sLXBK2uUCpBazBUe_EiRdHOGlKUcBMwP7CG1XX9-Aiw/edit?usp=sharing">short paper</a> about the presentation are available online. They provide deeper context for the research and the visualization details based on different design concepts.</p>
<p>You can see what else <a href="https://www.linkedin.com/in/davidkim18/">David Kim</a>, <a href="https://www.linkedin.com/in/jathan-a/">Jathan Anandham</a>, <a href="https://www.linkedin.com/in/jwwheel/">Justin Wheeler</a>, and <a href="https://www.linkedin.com/in/scott-tinker-216962129/">Scott Tinker</a> are up to on LinkedIn. Thanks for tuning in to this adventure into music data analysis, powered by MusicBrainz!</p>
<hr>
<p><em>Photo by <a href="https://unsplash.com/photos/QrqeusbpFMM?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Matthias Wagner</a> on <a href="https://unsplash.com/search/photos/microphone?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Unsplash</a>.</em></p>]]></description></item></channel></rss>