Movatterモバイル変換


[0]ホーム

URL:


Kristin Briney, profile picture
Uploaded byKristin Briney
PPTX, PDF1,002 views

Data Management 101 (2015)

The document discusses the importance of data management to prevent data loss and ensure easy access, analysis, and documentation of data. It emphasizes best practices such as the 3-2-1 backup rule, detailed documentation for reproducibility, effective file organization and naming conventions, and data security measures. Additionally, it highlights the need for future file usability through appropriate file formats and technology updates.

Embed presentation

Downloaded 22 times
Do You Still Have Your Data?• What if your hard drive crashes?• What if you are accused of fraud?• What if your collaborator abruptly quits?• What if the building burns down?• What if you need to use your old data?• What if your backup fails?• What if your computer gets stolen?• What if…
Data Management 10125 September 2015Kristin Briney, PhD
Why Data Management?• Don’t lose data
Why Data Management?• Don’t lose data• Find data more easily– Especially if you need older data
Why Data Management?• Don’t lose data• Find data more easily– Especially if you need older data• Easier to analyze organized, documented data
Why Data Management?• Don’t lose data• Find data more easily– Especially if you need older data• Easier to analyze organized, documented data• Avoid accusations of fraud & misconduct
Why Data Management?• Don’t lose data• Find data more easily– Especially if you need older data• Easier to analyze organized, documented data• Avoid accusations of fraud & misconduct• Get credit for your data
Why Data Management?• Don’t lose data• Find data more easily– Especially if you need older data• Easier to analyze organized, documented data• Avoid accusations of fraud & misconduct• Get credit for your data• Don’t drown in irrelevant data
Data Management Basics• Introduction to a few topics in datamanagement– Storage and backups– Documentation– File organization and naming– Future file usability
For each minute of planning atbeginning of a project, you will save10 minutes of headache later
grover_net, http://www.flickr.com/photos/9246159@N06/599820538/ (CC BY-ND)STORAGE AND BACKUPS
http://www.theonion.com/article/heroic-computer-dies-to-save-world-from-masters-th-1963
Follow the 3-2-1 Rule3 copies of your dataIn 2 different locationsOn more than 1 type of storage hardware321
Storage• How?– Computer– External hard drive– Shared drives/servers– Tape backup– Cloud storage*– CDs/DVDs– USB flash driveErica Wheelan, https://www.flickr.com/photos/reinventedwheel/5985479866 (CC BY)
*Cloud Storage• Read the Terms of Service!• Eg. Google Drive– “When you upload or otherwise submit content to our Services,you give Google (and those we work with) a worldwide licenseto use, host, store, reproduce, modify, create derivative works(such as those resulting from translations, adaptations or otherchanges we make so that your content works better with ourServices), communicate, publish, publicly perform, publiclydisplay and distribute such content. The rights you grant in thislicense are for the limited purpose of operating, promoting, andimproving our Services, and to develop new ones”
Backups• How?– Any backup is better than none– Automatic backup is better than manual– Your work is only as safe as your backup plan
Backups• How?– Check your backups• Backups only as good as ability to recover data• Test your backups periodically– Preferably a fixed schedule– 1 or 2 times a year may be enough– Bigger/more complex backups should be checked more often• Test your backup whenever you change things
Example• I keep my data– On my computer– Backed up manually on shared drive• I set a weekly reminder to do this– Backed up automatically via SpiderOak cloudstorage
DOCUMENTATIONBrady, https://www.flickr.com/photos/freddyfromutah/4424199420 (CC BY)
http://retractionwatch.com/2015/07/23/data-mismatch-and-authors-illness-pluck-finch-study-from-literature/
What would someone unfamiliarwith your data need in order to find,evaluate, understand, and reusethem?
Documentation• Why?– Data without notes are unusable– Because you won’t remember everything– For others who may need to use your files
Documentation• How?– Take good notes• Capture as much detail as possible• Your coworkers should be able to understand
Documentation• How?– Keep methods• Protocols• Code• Survey• Codebook• Data dictionary• Anything that lets someone reproduce your results
Documentation• How?– README.txt• For digital information, address the questions– “What the heck am I looking at?”– “Where do I find X?”• Use for project description in main folder• Use to document conventions• Use where ever you need extra clarity
Example• Project-wide README.txt– Basic project information• Title• Contributors• Grant info• etc.– Contact information for at least one person– All locations where data live, including backups
FILE ORGANIZATION & NAMINGDan Zen, http://www.flickr.com/photos/danzen/5551831155/ (CC BY)
https://twitter.com/CMBuddle/status/638800933598679040
https://twitter.com/CMBuddle/status/638802547365556224
https://twitter.com/CMBuddle/status/638808820874133504
File Organization• Why?– Easier to find and use data– Tell, at a glance, what is done and what you haveyet to do– Can still find and use files in the future
File Organization• How?– Pick a system• Maybe work out a system with your coworkers– Get in the habit
File Organization• How?– Any system is better than none– Make your system logical for your data• 80/20 Rule– Possibilities• By project• By analysis type• By date• …
Example• Thesis– By chapter• By file type (draft, figure, table, etc.)• Data– By researcher• By analysis type– By date
http://retractionwatch.com/2014/01/07/doing-the-right-thing-authors-retract-brain-paper-with-systematic-human-error-in-coding/
File Naming Conventions• Why?– Make it easier to find files– Avoid duplicates– Make it easier to wrap up a project because youknow which files belong to it
File Naming Conventions• How?– Pick what is most important for your name• Date• Site• Analysis• Sample• Short description
File Naming Conventions• How?– Files should be named consistently– Files names should be descriptive but short (<25characters)– Use underscores instead of spaces– Avoid these characters: “ /  : * ? ‘ < > [ ] & $– Use the dating convention: YYYY-MM-DD– Document your system!
Example• YYYYMMDD_site_sampleNum– 20140422_PikeLake_03– 20140424_EastLake_12• Analysis-sample-concentration– UVVis-stilbene-10mM– IR-benzene-pure
DATA SECURITYhttps://www.flickr.com/photos/bilal-kamoon/6958578902/ (CC BY)
https://chronicle.com/article/UNC-Chapel-Hill-Researcher/124821/
Know Your Data Security Plan• HIPAA, FERPA, FISMA, IRB, etc.• If you have sensitive data, know the plan– Who has access?– What are the procedures?– Who’s responsible?• Ask for help!
FUTURE FILE USABILITYIan, http://www.flickr.com/photos/ian-s/2152798588/ (CC BY-NC-ND)
http://retractionwatch.com/2013/07/19/jci-paper-retracted-for-duplicated-panels-after-authors-cant-provide-original-data/
Data Retention• 3 years required by government• Better to do 5-10 years
lukasbenc, https://www.flickr.com/photos/lukasbenc/3493808772 (CC BY-NC-SA)
Future File Usability• What?– Can you read your files from 10 years ago?– Data needs to be• Accessible• Interpretable• Readable
Future File Usability:Interpretable• How?– Back up written notes• People always forget this one• Difficult to interpret data without notes• Options– Digitally scan (recommended with digital data)– Photocopies
Future File Usability:Readable• How?– Convert file formats• Can you open digital files from 10 years ago?• Use open, non-proprietary formats that are in wide use– .docx  .txt– .xlsx  .csv– .jpg  .tif• Save a copy in the old format, just in case• Preserve software if no open file format
Future File Usability:Accessible• How?– Move to new media• Hardware dies and becomes obsolete– Floppy disks!• Expect average lifetime to be 3-5 years• Keep up with technology
WHERE TO GO FROM HERE
easylocum, https://www.flickr.com/photos/easylocum/2921542814 (CC BY)
Chris Hoving, https://www.flickr.com/photos/pcrucifer/2433274595 (CC BY-ND)
http://www.flickr.com/photos/72775875@N06/7729764370/ (CC BY-NC-SA)
Resources• Data Services– http://uwm.edu/libraries/dataservices/• http://uwm.edu/libraries/dataservices/#videos• Data Management Guide– http://guides.library.uwm.edu/data• Data Services Librarian– briney@uwm.edu
Thank You!• This presentation available under a CreativeCommons Attribution (CC-BY) license• Some content courtesy of Dorothea Salo– http://www.graduateschool.uwm.edu/research/researcher-central/proposal-development/data-plan/boot-camp/ (CC BY)

Recommended

PPTX
NCURA Webinar on Open Data
PPTX
Organizing Your Research Data
PPTX
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
PPTX
Practical Data Management - ACRL DCIG Webinar
PPTX
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
PPTX
Breaking the Data Management Barrier
PPTX
Data Management 101
PPTX
Ala cspace aspace rep services demo 2015
PPTX
Data Management Crash Course
PDF
Data Storage & Preservation
PDF
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
PDF
Data Publishing Models by Sünje Dallmeier-Tiessen
PDF
Preventing data loss
 
PPTX
The liaison librarian: connecting with the qualitative research lifecycle
PPTX
Linked Data: A short(-ish) introduction
PDF
Persistent Identifier Services and their Metadata by John Kunze
PDF
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
ZIP
Linked Open Data in Libraries, Archives & Museums
PPT
Internet tips lewis 2013
PPTX
BEng Product Design 1st years session 1 Oct 2021
PPTX
It's 2015. Do You Know Where Your Data Are?
PPT
Web crawlingchapter
PPTX
Research Data Management in the Humanities and Social Sciences
PDF
Ldl2012
PPTX
The Chemist's Toolkit 10 9 09
PPTX
Linked Data at ISAW: How and Why
PDF
Organizing for Customer Data Management
PDF
Master Data Governance Best Practices
PDF
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...

More Related Content

PPTX
NCURA Webinar on Open Data
PPTX
Organizing Your Research Data
PPTX
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
PPTX
Practical Data Management - ACRL DCIG Webinar
PPTX
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
PPTX
Breaking the Data Management Barrier
PPTX
Data Management 101
NCURA Webinar on Open Data
Organizing Your Research Data
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
Practical Data Management - ACRL DCIG Webinar
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Breaking the Data Management Barrier
Data Management 101

What's hot

PPTX
Ala cspace aspace rep services demo 2015
PPTX
Data Management Crash Course
PDF
Data Storage & Preservation
PDF
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
PDF
Data Publishing Models by Sünje Dallmeier-Tiessen
PDF
Preventing data loss
 
PPTX
The liaison librarian: connecting with the qualitative research lifecycle
PPTX
Linked Data: A short(-ish) introduction
PDF
Persistent Identifier Services and their Metadata by John Kunze
PDF
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
ZIP
Linked Open Data in Libraries, Archives & Museums
PPT
Internet tips lewis 2013
PPTX
BEng Product Design 1st years session 1 Oct 2021
PPTX
It's 2015. Do You Know Where Your Data Are?
PPT
Web crawlingchapter
PPTX
Research Data Management in the Humanities and Social Sciences
PDF
Ldl2012
PPTX
The Chemist's Toolkit 10 9 09
PPTX
Linked Data at ISAW: How and Why
Ala cspace aspace rep services demo 2015
Data Management Crash Course
Data Storage & Preservation
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
Data Publishing Models by Sünje Dallmeier-Tiessen
Preventing data loss
 
The liaison librarian: connecting with the qualitative research lifecycle
Linked Data: A short(-ish) introduction
Persistent Identifier Services and their Metadata by John Kunze
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
Linked Open Data in Libraries, Archives & Museums
Internet tips lewis 2013
BEng Product Design 1st years session 1 Oct 2021
It's 2015. Do You Know Where Your Data Are?
Web crawlingchapter
Research Data Management in the Humanities and Social Sciences
Ldl2012
The Chemist's Toolkit 10 9 09
Linked Data at ISAW: How and Why

Viewers also liked

PDF
Organizing for Customer Data Management
PDF
Master Data Governance Best Practices
PDF
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
PDF
Ebook - The Guide to Master Data Management
PPT
Gartner: Master Data Management Functionality
PPTX
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes
Organizing for Customer Data Management
Master Data Governance Best Practices
Master Data Management (MDM) 101 & Oracle Trading Community Architecture (TCA...
Ebook - The Guide to Master Data Management
Gartner: Master Data Management Functionality
Webinar: Enterprise Data Management in the Era of MongoDB and Data Lakes

Similar to Data Management 101 (2015)

PPTX
Data Management 101
PPTX
Data management for TA's
PPTX
Research Data Management Fundamentals for MSU Engineering Students
PDF
Practical Best Practices for Data Management
PPTX
Good Practice in Research Data Management
PPTX
Responsible Conduct of Research: Data Management
PPTX
Research Data Curation _ Grad Humanities Class
PPTX
Introduction to Data Management
PPTX
Data managementbasics issr_20130301
PPTX
Managing Your Research Data
PPTX
records-and-information-management-month (1).pptx
PDF
Research Data Management and Sharing for the Social Sciences and Humanities
PPT
Data Management for Graduate Students
PDF
Planning for Research Data Managment
PDF
Planning for Research Data Management
PPTX
Managing your research data
PPT
What is-rdm
PDF
Data Management Lab: Session 2 slides
 
PPTX
CSU-ACADIS_dataManagement101-20120217
PPTX
Introduction to data management
Data Management 101
Data management for TA's
Research Data Management Fundamentals for MSU Engineering Students
Practical Best Practices for Data Management
Good Practice in Research Data Management
Responsible Conduct of Research: Data Management
Research Data Curation _ Grad Humanities Class
Introduction to Data Management
Data managementbasics issr_20130301
Managing Your Research Data
records-and-information-management-month (1).pptx
Research Data Management and Sharing for the Social Sciences and Humanities
Data Management for Graduate Students
Planning for Research Data Managment
Planning for Research Data Management
Managing your research data
What is-rdm
Data Management Lab: Session 2 slides
 
CSU-ACADIS_dataManagement101-20120217
Introduction to data management

More from Kristin Briney

PPTX
Internet Privacy
PDF
Leveling Up Data Management
PPTX
Twitter For Academics
PPTX
TEDxUWMilwaukee: Rethinking Research Data
PPTX
Measuring Research Impact
PPTX
Retaining Your Old Research Data
PPTX
Documenting Your Research Data
PPTX
Storing Your Research Data
PPTX
Research Data & Digital Preservation - CUWL Conference 2014
PPTX
Electronic Laboratory Notebooks
PDF
Data Management Tips Handout
PDF
Data Management Plan Checklist
PPTX
Creating a Data Management Plan
PPTX
Data Services
PPTX
Electronic Lab Notebooks
PPTX
Lab Notebooks: A Librarian's Primer
Internet Privacy
Leveling Up Data Management
Twitter For Academics
TEDxUWMilwaukee: Rethinking Research Data
Measuring Research Impact
Retaining Your Old Research Data
Documenting Your Research Data
Storing Your Research Data
Research Data & Digital Preservation - CUWL Conference 2014
Electronic Laboratory Notebooks
Data Management Tips Handout
Data Management Plan Checklist
Creating a Data Management Plan
Data Services
Electronic Lab Notebooks
Lab Notebooks: A Librarian's Primer

Recently uploaded

PDF
Past Memories and a New World: Photographs of Stoke Newington from the 70s, 8...
PPTX
Anatomy of the eyeball An overviews.pptx
PDF
Agentic AI and AI Agents 20251121.pdf - by Ms. Oceana Wong
PDF
Rigor, ethics, wellbeing and resilience in the biomedical doctoral journey
 
PPTX
Introduction to Beauty Care and Wellness Services.pptx-day fcs 3rd quarter tl...
PDF
The invasion of Alexander of Macedonia in India
PDF
1. Doing Academic Research: Problems and Issues, 2. Academic Research Writing...
PDF
Deep Research and Analysis - by Ms. Oceana Wong
PDF
AI Workflows and Workflow Rhetoric - by Ms. Oceana Wong
PPTX
Time Series Analysis - Method of Simple Moving Average 3 Year and 4 Year Movi...
PDF
Conferencia de Abertura_Virgilio Almeida.pdf
PDF
Risk Management and Regulatory Compliance - by Ms. Oceana Wong
PPTX
LYMPHATIC SYSTEM.pptx it includes lymph, lymph nodes, bone marrow, spleen
PDF
45 ĐỀ LUYỆN THI IOE LỚP 8 THEO CHƯƠNG TRÌNH MỚI - NĂM HỌC 2024-2025 (CÓ LINK ...
PPTX
Chapter 3. Pharmaceutical Aids (pharmaceutics)
PDF
Digital Electronics – Registers and Their Applications
PDF
ASRB NET 2025 Paper GENETICS AND PLANT BREEDING ARS, SMS & STODiscussion | Co...
PDF
Unit 2: Functions of Management (POSDC.)
PPTX
Time Series Analysis - Least Square Method Fitting a Linear Trend Equation
PPTX
ATTENTION - PART 1.pptx cognitive processes -For B.Sc I Sem By Mrs.Shilpa Hot...
Past Memories and a New World: Photographs of Stoke Newington from the 70s, 8...
Anatomy of the eyeball An overviews.pptx
Agentic AI and AI Agents 20251121.pdf - by Ms. Oceana Wong
Rigor, ethics, wellbeing and resilience in the biomedical doctoral journey
 
Introduction to Beauty Care and Wellness Services.pptx-day fcs 3rd quarter tl...
The invasion of Alexander of Macedonia in India
1. Doing Academic Research: Problems and Issues, 2. Academic Research Writing...
Deep Research and Analysis - by Ms. Oceana Wong
AI Workflows and Workflow Rhetoric - by Ms. Oceana Wong
Time Series Analysis - Method of Simple Moving Average 3 Year and 4 Year Movi...
Conferencia de Abertura_Virgilio Almeida.pdf
Risk Management and Regulatory Compliance - by Ms. Oceana Wong
LYMPHATIC SYSTEM.pptx it includes lymph, lymph nodes, bone marrow, spleen
45 ĐỀ LUYỆN THI IOE LỚP 8 THEO CHƯƠNG TRÌNH MỚI - NĂM HỌC 2024-2025 (CÓ LINK ...
Chapter 3. Pharmaceutical Aids (pharmaceutics)
Digital Electronics – Registers and Their Applications
ASRB NET 2025 Paper GENETICS AND PLANT BREEDING ARS, SMS & STODiscussion | Co...
Unit 2: Functions of Management (POSDC.)
Time Series Analysis - Least Square Method Fitting a Linear Trend Equation
ATTENTION - PART 1.pptx cognitive processes -For B.Sc I Sem By Mrs.Shilpa Hot...

Data Management 101 (2015)


[8]ページ先頭

©2009-2025 Movatter.jp