Browse Source

add new spreadsheet with correct header

julien colomb 3 years ago
parent
commit
c42f6d82ab
4 changed files with 128 additions and 123 deletions
  1. 118 118
      01_data/Use_cases.tsv
  2. BIN
      01_data/Use_cases.xlsx
  3. BIN
      01_data/Use_cases_old.xlsx
  4. 10 5
      02_code/summary_usecase.R

+ 118 - 118
01_data/Use_cases.tsv

@@ -1,118 +1,118 @@
-	IMPORTANT NOTES: Use cases should have the following form: As a [role] i want to [goal] so that [benefit]	This is a draft. For a more thorough documentation please refer to https://docs.google.com/document/d/1sq78eCFYgmWcMFYcbNonA2KrkUO1qdGr7d49tHuRcRM/edit?usp=sharing 			
-	Roles should be end users (researcher, student etc.), not intermediaries (software developer, repository manager etc.)				
-					
-ID	Use case	Actor	Cluster	Contributed by	Source
-21	As a researcher, I want to use autocomplete, so I don't get hung up on spelling mistakes. 	Researcher	Convenience	Brigitte Mathiak/GESIS	Observation study
-26	As a researcher I want to be notified when data that matches my interests becomes available, so that I get the newest data as soon as it becomes available (for example, by using RSS syndication like opensearch).	Researcher	Convenience		RDA IG Data Discovery
-41	As a researcher, I want to have a thumbnail preview of a dataset, so that I can quickly assess the relevance of data.	Researcher	Convenience		RDA IG Data Discovery
-46	As a researcher, I want to have 2D-visualisations of molecules, so that I can assess the relevance for my research purpose.	Researcher	Convenience		RDA IG Data Discovery
-48	As a researcher I want to export the list of results returned from a query, so that I can save, sort, share, examine the list later.	Researcher	Convenience		RDA IG Data Discovery
-62	As a researcher, I want to have a Save Search link/button to access searches that were saved before to build research data collections or to check for updates since last search.	Researcher	Convenience		RDA IG Data Discovery
-14	As a researcher, I want to find data (people and papers) outside of my field that can help me addressing research questions in my field, so that I can use these data from other disciplines to better tackle the research questions.	Researcher	Cross-domain	Jonathan Jeschke/IGB	Implementation Network
-16	As a researcher, I want to discover non-domain data, especially when it links in some way to data/topics I am already interested in.	Researcher	Cross-domain	Brigitte Mathiak/GESIS	Workshop
-17	As a researcher I want to find datasets that are related to or originated from certain research domains I'm interested in to filter out records from irrelevant domains.	Researcher	Cross-domain	Heinrich Widmann/DKRZ	Implementation Network
-20	As a researcher, I want to discover in one interface, regardless of the documentation standard used for the data.	Researcher	Cross-domain	Brigitte Mathiak/GESIS	Workshop
-36	As a researcher, I want to have a semantic layer that maps on my discipline terms, so that I can interpret the dataset even if it comes from another discipline.	Researcher	Cross-domain	Alessia Bardi/OpenAIRE	Workshop
-52	As a researcher, I want to have a tool that suggests me datasets based on topics (automatically identified) so that I can find them regardless where they are deposited.	Researcher	Cross-domain	Alessia Bardi/OpenAIRE	Workshop
-7	As a researcher, I want to find papers that use datasets, so I can find datasets that are used in the community.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study
-8	As a researcher, I want to find documents of any kind (slides, newspaper articles, statista) that use datasets, to get an idea of what exists.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study
-11	As a researcher, I want to find datasets based on data citations, so I can re-use them.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study
-38	As a researcher, I want to know known faults and errors in the dataset, so I can discuss them in the paper. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study
-42	As a researcher, I want to be able to sort a search result by popularity/the number of downloads and accesses, so that I can quickly find the most re-used data.	Researcher	Data Citation		RDA IG Data Discovery
-44	As a researcher, I want to sort search results by citation, so that I can assess if the data is widely accepted by my research community.	Researcher	Data Citation		RDA IG Data Discovery
-51	As a researcher, I want to know what datasets have been cited along with the datasets I am currently using, so I can be sure I am not missing something relevant. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Implementation Network
-53	As a researcher, I want to find highly-cited datasets in my field. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study
-56	As a researcher, I want to see in which publications the datasets I’m considering to work with have been used. 	Researcher	Data Citation	Nataliia Sokolovska/HIIG	Implementation Network
-57	As a researcher, I want to know which is the first publication citing a dataset/software, so that I can get to the primary publication that describes the process where the dataset/software was used/produced/analysed.	Researcher	Data Citation	Paolo Manghi/OpenAIRE	Workshop
-58	As a researcher, I want those IDs to be actually forever, so I can cite them in good faith. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Workshop
-64	As a researcher, I want to know the citation index of a dataset/number of downloads/in which papers it was published, so that I can assess its relevance. 	Researcher	Data Citation		RDA IG Data Discovery
-66	As a researcher I want to to use metadata to make my research known to people working in developing countries, so that they can make practical use of my research and so that my research is put into use.	Researcher	Discoverability		RDA IG Data Discovery
-28	As a researcher I want to have free text descriptions of experimental procedures, endpoints and analysis, so that for discovery I don't have to rely on the creator and the finder/user happening to use the same keywords/vocabulary.	Researcher	Documentation		RDA IG Data Discovery
-34	As a researcher, I want to know additional web resources on a dataset, like a project website, so that I can use the services offered there. 	Researcher	Documentation	Brigitte Mathiak/GESIS	Observation study
-37	As a researcher, I want to have instructions on how to interpret the data, so that I can re-use the data in a proper way, even across domains.	Researcher	Documentation	Alessia Bardi/OpenAIRE	Workshop
-39	As a researcher, I want human-readable documentation, so I can look at it.	Researcher	Documentation	Brigitte Mathiak/GESIS	Workshop
-60	As a researcher, I want to have a single-entry point for reference data so that I can select the appropriate reference dataset for my research.	Researcher	Documentation		RDA IG Data Discovery
-65	As a researcher, I want to know how the results of the search are obtained (where in the data, or metadata the search is done), so that I can fully understand the sample of datasets I have obtained, possible biases, omissions etc.	Researcher	Documentation		RDA IG Data Discovery
-45	As a researcher, I want datasets linked to a (journal) publication, so that I can examine methods and results in detail. 	Researcher	Documentation/ Metadata for Discovery		RDA IG Data Discovery
-59	As a researcher, I want to make an informed decision about the quality and usability of the found datasets  (trust in data provider, context and background of the data production and producers).	Researcher	Documentation/ Metadata for Quality assessment	Heinrich Widmann/DKRZ	Workshop
-35	As a researcher, I want a filter for time and geo coordinates for combining data.	Researcher	Linking	Brigitte Mathiak/GESIS	Workshop
-49	As a researcher, I want to find datasets that are similar to those that I used before, so that I can expand and compare my studies.	Researcher	Linking	Isabella Peters/ZBW	Implementation Network
-50	As a researcher, I want to have an overview of datasets that can be linked to the dataset I am currently using, so I can analyze them together. 	Researcher	Linking	Brigitte Mathiak/GESIS	Implementation Network
-47	As a researcher I want to find more data that correlates with the geolocations of my tortoise populations, so that I canput my research into perspective and identify possible collaborators.	Researcher	Linking/ Metadata for Discovery		RDA IG Data Discovery
-40	As a researcher, I want machine-readable documentation, so I can run algorithms on it.	Researcher	Machine discoverability	Brigitte Mathiak/GESIS	Workshop
-19	As a researcher, I want a mulitlingual search, so I don't miss anything.	Researcher	Metadata for Discovery	Brigitte Mathiak/GESIS	Workshop
-24	As a researcher, I want to know about the accessability of the data, so that I can skip unavailable datasets.	Researcher	Metadata for Discovery		RDA IG Data Discovery
-30	As a researcher, I want to filter search results by various fields, e.g. licences, amount of data points, format, to better be able to decide what search results are relevant or not.	Researcher	Metadata for Discovery	Isabella Peters/ZBW	Implementation Network
-43	As a researcher, I want to filter a search result by date, so that I can quickly see most recently published records.	Researcher	Metadata for Discovery		RDA IG Data Discovery
-54	As a researcher, I want to see data that have an experiment context/methodology similar to mine, so that I can do a comparison study.	Researcher	Metadata for Discovery		RDA IG Data Discovery
-61	As a researcher, I want search capabilities based on identifiers so that I can cross-reference results of runs against similar databases.	Researcher	Metadata for Discovery		RDA IG Data Discovery
-63	As a researcher, I want to know how/by which instrument/beamline etc. the data was produced, so that I can assess the relevance for my research.	Researcher	Metadata for Discovery		RDA IG Data Discovery
-22	As a researcher, I want to find data with a specific experimental design, because my research question is only answerable in this way, or I am interested in the technical details of the design.	Researcher	Metadata for Discovery	Brigitte Mathiak/GESIS	Observation study
-32	As a researcher. I want to know the provenance and use licences of datasets in order to re-use (process, analyse, visualize etc.) foreign data resources by giving credit to, cite and share your results with the original data producer.	Researcher	Metadata for Discovery		Implementation Network
-33	As a researcher, I want to know the organization that made the data, so I have estimate quality and trustworthiness. 	Researcher	Metadata for Quality assessment	Brigitte Mathiak/GESIS	Observation study
-55	As a researcher, I want to have information on the quality and relevance of the dataset, I am currently looking at, so I can get idea if it has the quality I am looking for. 	Researcher	Metadata for Quality assessment	Brigitte Mathiak/GESIS	Implementation Network
-23	As a researcher, I want to see what data is available right now so that I can make a forecast.	Researcher	Not a use case		RDA IG Data Discovery
-25	As a researcher, I want to track publications and citations of my colleagues, so that I can see what is already being done in my field of research.	Researcher	Not a use case		RDA IG Data Discovery
-27	As a researcher I want to to be able to discover what is out there, so that I can avoid duplication and maximise efficiency and access.	Researcher	Not a use case		RDA IG Data Discovery
-9	As a researcher, I want to get an overview of datasets for my field of interest so that I can determine which datasets already exist that I can reuse.	Researcher	Overview	Peter Kraker/OKMaps	Implementation Network
-10	As a researcher, I want to get an overview of datasets for my field of interest so that I can determine which TYPES OF datasets already exist that I can reuse so that I can decide whether to dive in deeper.	Researcher	Overview	Girija Goyal/ReFigure	Implementation Network
-15	As a researcher, I want to get an overview of available data for a given research question or hypothesis in my discipline.	Researcher	Overview	Jonathan Jeschke/IGB	Implementation Network
-18	As a researcher, I want to be supported in exploration of datasets so that I can find datasets I did not know they existed.	Researcher	Overview	Alessia Bardi/OpenAIRE	Workshop
-31	As a researcher, I want to receive information on available datasets in a structured and comprehensible way, so that it does not take me much time to get an overview.	Researcher	Overview	Tina Heger	Implementation Network
-12	As a researcher, I want to know the people who are generating datasets in my field to have another way of searching and also create community.	Researcher	Person	Girija Goyal/ReFigure	Implementation Network
-13	As a researcher, I want to look for people who have published on a specific dataset, so that I can contact them. 	Researcher	Person	Brigitte Mathiak/GESIS	Observation study
-29	As a researcher I want to to identify data within existing genomic datasets, so that I can answer specific research questions through new analysis.	Researcher	Search for specific data within the dataset		RDA IG Data Discovery
-68	As a genom biologist I want get information about the human genome	Researcher - discipline specific	Overview	Heinrich/EUDAT	EUDAT-Prace Summerschool 2019
-69	As a astronomer/climate researcher I'm intersted in artificial satellites - type (weather, communications, navigation, reconnaissance, astronomy, ...), - owner (private, public,..) - cost, energy source, age,...	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	EUDAT-Prace Summerschool 2019
-70	As a climate researcher I'm looking for the 'historical run' of the CORDEX relies project. I'm especially interested in surface temperature	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher
-71	As a astronomer/climate researcher I'm intersted in artificial satellites -  type (weather, communications, navigation, reconnaissance, astronomy, ...),  - owner (private, public,..) - cost, energy source, age,... - evaluate/analyse their performance	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher
-119	As a developer, I want to make our repositories as standards-compliant as possible, so that they can be as visible as possible and we can make a lot of money.	Software/service developer	Discoverability		Workshop
-118	As a data-processing service, I want to be able to collect information about research datasets from across databases in the world, enrich this data . with information from other databases, so that I can offer value added services to satisfy my users. 	Software/service developer	Machine discoverability	Petr Knoth/CORE	Workshop
-86	As a student, I want to receive information on available datasets in a structured and comprehensible way, so that it does not take me much time to get an overview.	Student	Convenience	Tina Heger	Implementation Network
-89	As a student working on a research project (e.g. BSc or MSc thesis), I want to know known faults and errors in the dataset, so I can discuss them in my thesis. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study
-93	As a student working on a research project (e.g. BSc or MSc thesis), I want to know which datasets have been cited along with the datasets I am currently using, so I can be sure I am not missing something relevant. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Implementation Network
-95	As a student working on a research project (e.g. BSc or MSc thesis), I want to find highly-cited datasets related to the topic of my thesis. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study
-96	As a student working on a research project (e.g. BSc or MSc thesis), I want to know which is the first publication citing a dataset/software, so that I can get to the primary publication that describes the process where the dataset/software was used/produced/analysed.	Student	Data Citation	Paolo Manghi/OpenAIRE, Jonathan Jeschke/IGB	Workshop
-90	As a student, I want human-readable documentation, so I can look at it.	Student	Documentation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Workshop
-98	As a student, I want to have instructions on how to interpret the data, so that I can re-use the data in a proper way.	Student	Documentation	Alessia Bardi/OpenAIRE, Jonathan Jeschke/IGB	Workshop
-88	As a student working on a research project (e.g. BSc or MSc thesis), I want a filter for time and geo coordinates for combining data.	Student	Linking	Brigitte Mathiak/GESIS	Workshop
-92	As a student working on a research project (e.g. BSc or MSc thesis), I want to have an overview of datasets that can be linked to datasets I already found, so I can analyze them together. 	Student	Linking	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Implementation Network
-91	As a student, I want machine-readable documentation, so I can run algorithms on it.	Student	Machine discoverability	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Workshop
-87	As a student, I want to filter search results by various fields, e.g. licences, amount of data points, format, to be better able to decide what search results are relevant or not.	Student	Metadata for Discovery	Isabella Peters/ZBW, Jonathan Jeschke/IGB	Implementation Network
-97	As a student working on a research project (e.g. BSc or MSc thesis), I want to make an informed decision about the quality and usability of the found datasets (trust in data provider, context and background of the data production and producers).	Student	Metadata for Quality assessment	Heinrich Widmann/DKRZ, Jonathan Jeschke/IGB	Workshop
-99	As a student, I want to know the organization that made the data, so I can estimate data quality and trustworthiness. 	Student	Metadata for Quality assessment	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study
-101	As a student, I want to know whether datasets have undergone peer review or whether they belong to papers that have been peer-reviewed, so that I can be sure that the data has high scientific quality.	Student	Metadata for Quality assessment	Isabella Peters/ZBW	Implementation Network
-83	As an undergraduate student, I want to be able to find data around discrete research questions to do analysis and findings for short-term research projects.	Student	Overview	Girija Goyal/ReFigure	Implementation Network
-84	As a student working on a research project (e.g. BSc or MSc thesis), I want to be able to find data related to a given research question or hypothesis, so that I can collect and analyze these data.	Student	Overview	Jonathan Jeschke/IGB	Implementation Network
-94	As a student working on a research project (e.g. BSc or MSc thesis), I want to have a tool that suggests me datasets based on topics (automatically identified) so that I can find them regardless where they are deposited.	Student	Overview	Alessia Bardi/OpenAIRE, Jonathan Jeschke/IGB	Workshop
-85	As a student interested in a given research topic, I want to find researchers working on this topic (e.g. to contact them as potential supervisors).	Student	Person	Jonathan Jeschke/IGB	Implementation Network
-100	As a student, I want to find other students and/or researchers working on a similar research question as I do, so that I can contact them and exchange thoughts, e.g. about issues related to the quality and analysis of datasets.	Student	Person	Jonathan Jeschke/IGB	Implementation Network
-77	As a ecological citizen I'm planning to install solar panels on my roof           	Citizen	Not a use case	Heinrich/EUDAT	DKRZ researcher
-75	As a citizen I want to know more about  plastic in the oceans to get information about estimate plastic degradation, prominence of plastic pollution in the environment,... 	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher
-76	As a citizen I want to get an overview of datasets about Green roofs in order to evaluate their performance, support energy decision makers	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher
-79	As a astronomer/climate researcher I'm intersted in artificial satellites -  type (weather, communications, navigation, reconnaissance, astronomy, ...),  - owner (private, public,..) - cost, energy source, age,... - evaluate/analyse their performance	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher
-80	As a craftwoman I'm interested in alloys to use the best material for my installations  - physical properties, production costs, - availability, market price, ... - support engineers designs, mining decision makers, 	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher
-78	As a citizen or a biodiversity researcher I want to get an overview over extincted species. - Today, after/before Columbus, or before/after Industrial Era, ... - understand anthropogenic impacts on species 	Citizen	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher
-113	As a reseacher, I want to spend as little time as possible creating extra metadata, especially if the information was entered somewhere else (for example author information already available in the publication linked to the data).	Data producer	Convenience	Julien Colomb	Implementation Network
-109	As a researcher, I want to know whether my shared datasets have been reused so that I know what impact I have.	Data producer	Discoverability	Isabella Peters/ZBW	Implementation Network
-111	As a researcher, I want to have metrics on the reuse of my datasets automatically sent to my CV (via Orcid?) so that I can show the importance of my work for the community (and get funding).	Data producer	Discoverability	Julien Colomb	Implementation Network
-112	As a researcher, I want to see my top 5 highly cited datasets, so I can put this information into my grant application.	Data producer	Discoverability	RDA IG Data Discovery	
-114	As a researcher, I may want to preregister which data I plan to collect, so that others know what I'm working on and double work is avoided.	Data producer	Discoverability	Jonathan Jeschke/IGB	Implementation Network
-115	As a researcher, I want to easily connect my data to important research questions and/or hypotheses in my discipline, so that others interested in these questions or hypotheses will discover my data und possibly reuse them.	Data producer	Discoverability	Jonathan Jeschke/IGB	Implementation Network
-108	As a researcher, I want to know where I can put my data, insights and annotations for maximum discoverability.	Data producer	Discoverability	Girija Goyal/ReFigure	Implementation Network
-110	As a researcher, I want to demonstrate the value of sharing data, to inspire my colleagues to share their data.	Data producer	Not a use case	Brigitte Mathiak/GESIS	Implementation Network
-104	As a funder, I want to identify datasets created in projects I am funding and ensure that all projects I am funding are following relevant data sharing policies.	Funder	Metadata for Discovery	Petr Knoth/CORE	Implementation Network
-73	As an investor I want to invest in wind farms and evaluate  their performance, support energy 	Investor - discipline specific	Overview	Heinrich/EUDAT	DKRZ researcher
-					
-67					
-72					
-74					
-81					
-82					
-102					
-103					
-105					
-106					
-107					
-116					
-117					
+	IMPORTANT NOTES: Use cases should have the following form: As a [role] i want to [goal] so that [benefit]	This is a draft. For a more thorough documentation please refer to https://docs.google.com/document/d/1sq78eCFYgmWcMFYcbNonA2KrkUO1qdGr7d49tHuRcRM/edit?usp=sharing 				
+	Roles should be end users (researcher, student etc.), not intermediaries (software developer, repository manager etc.)					
+						
+ID	Use case	Actor	Cluster	Contributed by	Source	Closely_related_to
+21	As a researcher, I want to use autocomplete, so I don't get hung up on spelling mistakes. 	Researcher	Convenience	Brigitte Mathiak/GESIS	Observation study	
+26	As a researcher I want to be notified when data that matches my interests becomes available, so that I get the newest data as soon as it becomes available (for example, by using RSS syndication like opensearch).	Researcher	Convenience		RDA IG Data Discovery	
+41	As a researcher, I want to have a thumbnail preview of a dataset, so that I can quickly assess the relevance of data.	Researcher	Convenience		RDA IG Data Discovery	
+46	As a researcher, I want to have 2D-visualisations of molecules, so that I can assess the relevance for my research purpose.	Researcher	Convenience		RDA IG Data Discovery	
+48	As a researcher I want to export the list of results returned from a query, so that I can save, sort, share, examine the list later.	Researcher	Convenience		RDA IG Data Discovery	
+62	As a researcher, I want to have a Save Search link/button to access searches that were saved before to build research data collections or to check for updates since last search.	Researcher	Convenience		RDA IG Data Discovery	
+14	As a researcher, I want to find data (people and papers) outside of my field that can help me addressing research questions in my field, so that I can use these data from other disciplines to better tackle the research questions.	Researcher	Cross-domain	Jonathan Jeschke/IGB	Implementation Network	
+16	As a researcher, I want to discover non-domain data, especially when it links in some way to data/topics I am already interested in.	Researcher	Cross-domain	Brigitte Mathiak/GESIS	Workshop	
+17	As a researcher I want to find datasets that are related to or originated from certain research domains I'm interested in to filter out records from irrelevant domains.	Researcher	Cross-domain	Heinrich Widmann/DKRZ	Implementation Network	
+20	As a researcher, I want to discover in one interface, regardless of the documentation standard used for the data.	Researcher	Cross-domain	Brigitte Mathiak/GESIS	Workshop	
+36	As a researcher, I want to have a semantic layer that maps on my discipline terms, so that I can interpret the dataset even if it comes from another discipline.	Researcher	Cross-domain	Alessia Bardi/OpenAIRE	Workshop	
+52	As a researcher, I want to have a tool that suggests me datasets based on topics (automatically identified) so that I can find them regardless where they are deposited.	Researcher	Cross-domain	Alessia Bardi/OpenAIRE	Workshop	
+7	As a researcher, I want to find papers that use datasets, so I can find datasets that are used in the community.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study	
+8	As a researcher, I want to find documents of any kind (slides, newspaper articles, statista) that use datasets, to get an idea of what exists.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study	
+11	As a researcher, I want to find datasets based on data citations, so I can re-use them.	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study	
+38	As a researcher, I want to know known faults and errors in the dataset, so I can discuss them in the paper. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study	89
+42	As a researcher, I want to be able to sort a search result by popularity/the number of downloads and accesses, so that I can quickly find the most re-used data.	Researcher	Data Citation		RDA IG Data Discovery	
+44	As a researcher, I want to sort search results by citation, so that I can assess if the data is widely accepted by my research community.	Researcher	Data Citation		RDA IG Data Discovery	
+51	As a researcher, I want to know what datasets have been cited along with the datasets I am currently using, so I can be sure I am not missing something relevant. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Implementation Network	
+53	As a researcher, I want to find highly-cited datasets in my field. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Observation study	
+56	As a researcher, I want to see in which publications the datasets I’m considering to work with have been used. 	Researcher	Data Citation	Nataliia Sokolovska/HIIG	Implementation Network	
+57	As a researcher, I want to know which is the first publication citing a dataset/software, so that I can get to the primary publication that describes the process where the dataset/software was used/produced/analysed.	Researcher	Data Citation	Paolo Manghi/OpenAIRE	Workshop	96
+58	As a researcher, I want those IDs to be actually forever, so I can cite them in good faith. 	Researcher	Data Citation	Brigitte Mathiak/GESIS	Workshop	
+64	As a researcher, I want to know the citation index of a dataset/number of downloads/in which papers it was published, so that I can assess its relevance. 	Researcher	Data Citation		RDA IG Data Discovery	
+66	As a researcher I want to to use metadata to make my research known to people working in developing countries, so that they can make practical use of my research and so that my research is put into use.	Researcher	Discoverability		RDA IG Data Discovery	
+28	As a researcher I want to have free text descriptions of experimental procedures, endpoints and analysis, so that for discovery I don't have to rely on the creator and the finder/user happening to use the same keywords/vocabulary.	Researcher	Documentation		RDA IG Data Discovery	
+34	As a researcher, I want to know additional web resources on a dataset, like a project website, so that I can use the services offered there. 	Researcher	Documentation	Brigitte Mathiak/GESIS	Observation study	
+37	As a researcher, I want to have instructions on how to interpret the data, so that I can re-use the data in a proper way, even across domains.	Researcher	Documentation	Alessia Bardi/OpenAIRE	Workshop	
+39	As a researcher, I want human-readable documentation, so I can look at it.	Researcher	Documentation	Brigitte Mathiak/GESIS	Workshop	90
+60	As a researcher, I want to have a single-entry point for reference data so that I can select the appropriate reference dataset for my research.	Researcher	Documentation		RDA IG Data Discovery	
+65	As a researcher, I want to know how the results of the search are obtained (where in the data, or metadata the search is done), so that I can fully understand the sample of datasets I have obtained, possible biases, omissions etc.	Researcher	Documentation		RDA IG Data Discovery	
+45	As a researcher, I want datasets linked to a (journal) publication, so that I can examine methods and results in detail. 	Researcher	Documentation/ Metadata for Discovery		RDA IG Data Discovery	
+59	As a researcher, I want to make an informed decision about the quality and usability of the found datasets  (trust in data provider, context and background of the data production and producers).	Researcher	Documentation/ Metadata for Quality assessment	Heinrich Widmann/DKRZ	Workshop	
+35	As a researcher, I want a filter for time and geo coordinates for combining data.	Researcher	Linking	Brigitte Mathiak/GESIS	Workshop	
+49	As a researcher, I want to find datasets that are similar to those that I used before, so that I can expand and compare my studies.	Researcher	Linking	Isabella Peters/ZBW	Implementation Network	
+50	As a researcher, I want to have an overview of datasets that can be linked to the dataset I am currently using, so I can analyze them together. 	Researcher	Linking	Brigitte Mathiak/GESIS	Implementation Network	
+47	As a researcher I want to find more data that correlates with the geolocations of my tortoise populations, so that I canput my research into perspective and identify possible collaborators.	Researcher	Linking/ Metadata for Discovery		RDA IG Data Discovery	
+40	As a researcher, I want machine-readable documentation, so I can run algorithms on it.	Researcher	Machine discoverability	Brigitte Mathiak/GESIS	Workshop	91
+19	As a researcher, I want a mulitlingual search, so I don't miss anything.	Researcher	Metadata for Discovery	Brigitte Mathiak/GESIS	Workshop	
+24	As a researcher, I want to know about the accessability of the data, so that I can skip unavailable datasets.	Researcher	Metadata for Discovery		RDA IG Data Discovery	
+30	As a researcher, I want to filter search results by various fields, e.g. licences, amount of data points, format, to better be able to decide what search results are relevant or not.	Researcher	Metadata for Discovery	Isabella Peters/ZBW	Implementation Network	87
+43	As a researcher, I want to filter a search result by date, so that I can quickly see most recently published records.	Researcher	Metadata for Discovery		RDA IG Data Discovery	
+54	As a researcher, I want to see data that have an experiment context/methodology similar to mine, so that I can do a comparison study.	Researcher	Metadata for Discovery		RDA IG Data Discovery	
+61	As a researcher, I want search capabilities based on identifiers so that I can cross-reference results of runs against similar databases.	Researcher	Metadata for Discovery		RDA IG Data Discovery	
+63	As a researcher, I want to know how/by which instrument/beamline etc. the data was produced, so that I can assess the relevance for my research.	Researcher	Metadata for Discovery		RDA IG Data Discovery	
+22	As a researcher, I want to find data with a specific experimental design, because my research question is only answerable in this way, or I am interested in the technical details of the design.	Researcher	Metadata for Discovery	Brigitte Mathiak/GESIS	Observation study	
+32	As a researcher. I want to know the provenance and use licences of datasets in order to re-use (process, analyse, visualize etc.) foreign data resources by giving credit to, cite and share your results with the original data producer.	Researcher	Metadata for Discovery		Implementation Network	
+33	As a researcher, I want to know the organization that made the data, so I have estimate quality and trustworthiness. 	Researcher	Metadata for Quality assessment	Brigitte Mathiak/GESIS	Observation study	
+55	As a researcher, I want to have information on the quality and relevance of the dataset, I am currently looking at, so I can get idea if it has the quality I am looking for. 	Researcher	Metadata for Quality assessment	Brigitte Mathiak/GESIS	Implementation Network	
+23	As a researcher, I want to see what data is available right now so that I can make a forecast.	Researcher	Not a use case		RDA IG Data Discovery	
+25	As a researcher, I want to track publications and citations of my colleagues, so that I can see what is already being done in my field of research.	Researcher	Not a use case		RDA IG Data Discovery	
+27	As a researcher I want to to be able to discover what is out there, so that I can avoid duplication and maximise efficiency and access.	Researcher	Not a use case		RDA IG Data Discovery	
+9	As a researcher, I want to get an overview of datasets for my field of interest so that I can determine which datasets already exist that I can reuse.	Researcher	Overview	Peter Kraker/OKMaps	Implementation Network	
+10	As a researcher, I want to get an overview of datasets for my field of interest so that I can determine which TYPES OF datasets already exist that I can reuse so that I can decide whether to dive in deeper.	Researcher	Overview	Girija Goyal/ReFigure	Implementation Network	
+15	As a researcher, I want to get an overview of available data for a given research question or hypothesis in my discipline.	Researcher	Overview	Jonathan Jeschke/IGB	Implementation Network	
+18	As a researcher, I want to be supported in exploration of datasets so that I can find datasets I did not know they existed.	Researcher	Overview	Alessia Bardi/OpenAIRE	Workshop	
+31	As a researcher, I want to receive information on available datasets in a structured and comprehensible way, so that it does not take me much time to get an overview.	Researcher	Overview	Tina Heger	Implementation Network	
+12	As a researcher, I want to know the people who are generating datasets in my field to have another way of searching and also create community.	Researcher	Person	Girija Goyal/ReFigure	Implementation Network	
+13	As a researcher, I want to look for people who have published on a specific dataset, so that I can contact them. 	Researcher	Person	Brigitte Mathiak/GESIS	Observation study	
+29	As a researcher I want to to identify data within existing genomic datasets, so that I can answer specific research questions through new analysis.	Researcher	Search for specific data within the dataset		RDA IG Data Discovery	
+68	As a genom biologist I want get information about the human genome	Researcher - discipline specific	Overview	Heinrich/EUDAT	EUDAT-Prace Summerschool 2019	
+69	As a astronomer/climate researcher I'm intersted in artificial satellites - type (weather, communications, navigation, reconnaissance, astronomy, ...), - owner (private, public,..) - cost, energy source, age,...	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	EUDAT-Prace Summerschool 2019	
+70	As a climate researcher I'm looking for the 'historical run' of the CORDEX relies project. I'm especially interested in surface temperature	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher	
+71	As a astronomer/climate researcher I'm intersted in artificial satellites -  type (weather, communications, navigation, reconnaissance, astronomy, ...),  - owner (private, public,..) - cost, energy source, age,... - evaluate/analyse their performance	Researcher - discipline specific	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher	
+119	As a developer, I want to make our repositories as standards-compliant as possible, so that they can be as visible as possible and we can make a lot of money.	Software/service developer	Discoverability		Workshop	
+118	As a data-processing service, I want to be able to collect information about research datasets from across databases in the world, enrich this data . with information from other databases, so that I can offer value added services to satisfy my users. 	Software/service developer	Machine discoverability	Petr Knoth/CORE	Workshop	
+86	As a student, I want to receive information on available datasets in a structured and comprehensible way, so that it does not take me much time to get an overview.	Student	Convenience	Tina Heger	Implementation Network	
+89	As a student working on a research project (e.g. BSc or MSc thesis), I want to know known faults and errors in the dataset, so I can discuss them in my thesis. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study	
+93	As a student working on a research project (e.g. BSc or MSc thesis), I want to know which datasets have been cited along with the datasets I am currently using, so I can be sure I am not missing something relevant. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Implementation Network	
+95	As a student working on a research project (e.g. BSc or MSc thesis), I want to find highly-cited datasets related to the topic of my thesis. 	Student	Data Citation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study	
+96	As a student working on a research project (e.g. BSc or MSc thesis), I want to know which is the first publication citing a dataset/software, so that I can get to the primary publication that describes the process where the dataset/software was used/produced/analysed.	Student	Data Citation	Paolo Manghi/OpenAIRE, Jonathan Jeschke/IGB	Workshop	
+90	As a student, I want human-readable documentation, so I can look at it.	Student	Documentation	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Workshop	
+98	As a student, I want to have instructions on how to interpret the data, so that I can re-use the data in a proper way.	Student	Documentation	Alessia Bardi/OpenAIRE, Jonathan Jeschke/IGB	Workshop	
+88	As a student working on a research project (e.g. BSc or MSc thesis), I want a filter for time and geo coordinates for combining data.	Student	Linking	Brigitte Mathiak/GESIS	Workshop	35
+92	As a student working on a research project (e.g. BSc or MSc thesis), I want to have an overview of datasets that can be linked to datasets I already found, so I can analyze them together. 	Student	Linking	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Implementation Network	
+91	As a student, I want machine-readable documentation, so I can run algorithms on it.	Student	Machine discoverability	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Workshop	
+87	As a student, I want to filter search results by various fields, e.g. licences, amount of data points, format, to be better able to decide what search results are relevant or not.	Student	Metadata for Discovery	Isabella Peters/ZBW, Jonathan Jeschke/IGB	Implementation Network	
+97	As a student working on a research project (e.g. BSc or MSc thesis), I want to make an informed decision about the quality and usability of the found datasets (trust in data provider, context and background of the data production and producers).	Student	Metadata for Quality assessment	Heinrich Widmann/DKRZ, Jonathan Jeschke/IGB	Workshop	
+99	As a student, I want to know the organization that made the data, so I can estimate data quality and trustworthiness. 	Student	Metadata for Quality assessment	Brigitte Mathiak/GESIS, Jonathan Jeschke/IGB	Observation study	
+101	As a student, I want to know whether datasets have undergone peer review or whether they belong to papers that have been peer-reviewed, so that I can be sure that the data has high scientific quality.	Student	Metadata for Quality assessment	Isabella Peters/ZBW	Implementation Network	
+83	As an undergraduate student, I want to be able to find data around discrete research questions to do analysis and findings for short-term research projects.	Student	Overview	Girija Goyal/ReFigure	Implementation Network	
+84	As a student working on a research project (e.g. BSc or MSc thesis), I want to be able to find data related to a given research question or hypothesis, so that I can collect and analyze these data.	Student	Overview	Jonathan Jeschke/IGB	Implementation Network	
+94	As a student working on a research project (e.g. BSc or MSc thesis), I want to have a tool that suggests me datasets based on topics (automatically identified) so that I can find them regardless where they are deposited.	Student	Overview	Alessia Bardi/OpenAIRE, Jonathan Jeschke/IGB	Workshop	
+85	As a student interested in a given research topic, I want to find researchers working on this topic (e.g. to contact them as potential supervisors).	Student	Person	Jonathan Jeschke/IGB	Implementation Network	
+100	As a student, I want to find other students and/or researchers working on a similar research question as I do, so that I can contact them and exchange thoughts, e.g. about issues related to the quality and analysis of datasets.	Student	Person	Jonathan Jeschke/IGB	Implementation Network	
+77	As a ecological citizen I'm planning to install solar panels on my roof           	Citizen	Not a use case	Heinrich/EUDAT	DKRZ researcher	
+75	As a citizen I want to know more about  plastic in the oceans to get information about estimate plastic degradation, prominence of plastic pollution in the environment,... 	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher	
+76	As a citizen I want to get an overview of datasets about Green roofs in order to evaluate their performance, support energy decision makers	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher	
+79	As a astronomer/climate researcher I'm intersted in artificial satellites -  type (weather, communications, navigation, reconnaissance, astronomy, ...),  - owner (private, public,..) - cost, energy source, age,... - evaluate/analyse their performance	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher	
+80	As a craftwoman I'm interested in alloys to use the best material for my installations  - physical properties, production costs, - availability, market price, ... - support engineers designs, mining decision makers, 	Citizen	Overview	Heinrich/EUDAT	DKRZ researcher	
+78	As a citizen or a biodiversity researcher I want to get an overview over extincted species. - Today, after/before Columbus, or before/after Industrial Era, ... - understand anthropogenic impacts on species 	Citizen	Search for specific data within the dataset	Heinrich/EUDAT	DKRZ researcher	
+113	As a reseacher, I want to spend as little time as possible creating extra metadata, especially if the information was entered somewhere else (for example author information already available in the publication linked to the data).	Data producer	Convenience	Julien Colomb	Implementation Network	
+109	As a researcher, I want to know whether my shared datasets have been reused so that I know what impact I have.	Data producer	Discoverability	Isabella Peters/ZBW	Implementation Network	
+111	As a researcher, I want to have metrics on the reuse of my datasets automatically sent to my CV (via Orcid?) so that I can show the importance of my work for the community (and get funding).	Data producer	Discoverability	Julien Colomb	Implementation Network	
+112	As a researcher, I want to see my top 5 highly cited datasets, so I can put this information into my grant application.	Data producer	Discoverability	RDA IG Data Discovery		
+114	As a researcher, I may want to preregister which data I plan to collect, so that others know what I'm working on and double work is avoided.	Data producer	Discoverability	Jonathan Jeschke/IGB	Implementation Network	
+115	As a researcher, I want to easily connect my data to important research questions and/or hypotheses in my discipline, so that others interested in these questions or hypotheses will discover my data und possibly reuse them.	Data producer	Discoverability	Jonathan Jeschke/IGB	Implementation Network	
+108	As a researcher, I want to know where I can put my data, insights and annotations for maximum discoverability.	Data producer	Discoverability	Girija Goyal/ReFigure	Implementation Network	
+110	As a researcher, I want to demonstrate the value of sharing data, to inspire my colleagues to share their data.	Data producer	Not a use case	Brigitte Mathiak/GESIS	Implementation Network	
+104	As a funder, I want to identify datasets created in projects I am funding and ensure that all projects I am funding are following relevant data sharing policies.	Funder	Metadata for Discovery	Petr Knoth/CORE	Implementation Network	
+73	As an investor I want to invest in wind farms and evaluate  their performance, support energy 	Investor - discipline specific	Overview	Heinrich/EUDAT	DKRZ researcher	
+						
+67						
+72						
+74						
+81						
+82						
+102						
+103						
+105						
+106						
+107						
+116						
+117						

BIN
01_data/Use_cases.xlsx


BIN
01_data/Use_cases_old.xlsx


+ 10 - 5
02_code/summary_usecase.R

@@ -1,14 +1,19 @@
-library(VennDiagram)
+#library(VennDiagram)
+library(readr)
 library(dplyr)
 library(ggplot2)
 library(forcats)
 
 # load data, only 101 rows.
 
-usecases <- read_delim("data/[DRAFT] Stocktaking GO FAIR Discovery IN - Use cases, infrastructure - Use cases.tsv",
+usecases <- read_delim("01_data/Use_cases.tsv",
 "\t", escape_double = FALSE, trim_ws = TRUE,
 skip = 3)[1:101,]
 
+expanded =left_join(usecases,usecases, by = c("Closely_related_to"= "ID"))
+
+write.csv(expanded , file = "doubles.csv")
+
 ## changing cluster, duplicating entry with multiple cluster
 usecases2=usecases
 usecases3=usecases [0,]
@@ -119,19 +124,19 @@ F1=usecases2 %>%
                   , ")")
   ) +
   ylab("Amount of use cases")+
-  xlab("Cluster")
+  xlab("Cluster")+theme_minimal(base_size = 17)
 
 
 F2=df%>%
   mutate(cluster = factor(cluster, levels= ord_clust))%>%
   mutate_if(is.numeric,coalesce,0) %>%
            ggplot( aes(cluster,`number of answers`)) + 
-           geom_point(aes(colour = priority))+
+           geom_point(aes(colour = priority), size = 3)+
            coord_flip()+
   labs(
              title= paste0("Prioritisation score")
            ) +
-   expand_limits(y = 0)
+   expand_limits(y = 0) +theme_minimal(base_size = 17)
          
 Ftot=gridExtra::grid.arrange(F1, F2+ theme (axis.title.y=element_blank(),
                                        axis.text.y=element_blank()),