What We’ll Cover
- Productivity: Navigation and output tips
- Internals: How R Notebooks really work
- Ecosystem: Publishing, sharing, and version control
Basic familiarity with R Notebooks is presumed.
Notebook Recap
data:image/s3,"s3://crabby-images/49d5c/49d5cb4d18f45cd7262719bc21ff1bba12a8e8b0" alt="notebook demo"
Notebook Recap
- Notebooks are R Markdown documents + new interaction model
- Code chunks are executed individually
- Code output appears beneath the chunk and is saved with the document
- Combines iterative approach with reproducible result
Navigation: Outline
data:image/s3,"s3://crabby-images/825b0/825b02df68a6c7f536b3095b90fdbce206d62049" alt="document outline"
Navigation: Jump To
data:image/s3,"s3://crabby-images/b349b/b349b437c42885e12989250778e3120b9a6c95b3" alt="jump to chunk"
Navigation: By Chunk
data:image/s3,"s3://crabby-images/ec504/ec5046c19e9cf4eaa1203bd188c7f371fa6718eb" alt="prev/next chunk"
Navigation: Running
data:image/s3,"s3://crabby-images/def79/def795c712ef64814a14eb512144f291df4d7907" alt="nav-running"
Output Management
data:image/s3,"s3://crabby-images/616ee/616ee339b3c162b718f12c344445a33af1520a15" alt="output-management"
Reproducibility
- Notebooks can be less reproducible than traditional R Markdown documents.
- Chunks can be run in any order.
- Chunks can access the global environment.
Simulating a Knit
data:image/s3,"s3://crabby-images/31225/31225d16396bd6138a9ba5202f34ec32c3e7debb" alt="restart and run"
Consistency
data:image/s3,"s3://crabby-images/4b16b/4b16b1fa8ff6aeaf622e30cb6ec5051fec178b6a" alt="run chunks above"
How R Notebooks Really Work
Publishing and Collaborating
Publishing
Two choices:
- Publish the notebook file directly (HTML)
- Render to a separate output format
Publishing: Notebook HTML
- Self-contained
- Compatible with any hosting service
- No viewer required
- Hydrates to full notebook (more on this later)
Publishing: RStudio Connect
- One-click publish inside your org
- Fine-grained access control
- Execute on the server
- Schedule executions
Collaborating: Plain Files
To: analyst@contoso.com
From: customer@northwind.com
I need some help with this analysis. Could you
take a look at what I have so far?
Thanks,
Charles
[ Attachment: foo.nb.html ]
Collaborating: Execution in Reverse
Collaborating: Opening Code in RStudio
data:image/s3,"s3://crabby-images/85632/85632ef3790b8415ef27b8ec9c289b09452333cf" alt="download R markdown"
Collaborating: Opening Notebook in RStudio
data:image/s3,"s3://crabby-images/324d1/324d13ecacb2495b8851b790c5627546cdb31068" alt="save as notebook"
Collaborating: Open Inside RStudio
data:image/s3,"s3://crabby-images/3be53/3be5364c1c91a4ffd30bca02cad761f95616af90" alt="pre open"
Collaborating: Open Notebook
data:image/s3,"s3://crabby-images/963ab/963abb26996bb4af497ddd5005f96f54f8615f6d" alt="hydrated"
Collaborating: Version Control #1
- Add
*.nb.html
to your .gitignore
or similar
- Check in only the
.Rmd
file
- All diffs are plain text
- Encourages reproducibility & independent verification of results
Collaborating: Version Control #2
- Check in the
.nb.html
file and the .Rmd
file
- Diffs are noisier
- RStudio loads outputs from
.nb.html
if newer
- Outputs and inputs are versioned together
- No need to re-execute lengthy or fragile computations
Versioned Output
When a notebook is opened:
- The local cache modified time is compared to the
.nb.html
- If
.nb.html
is older, it is ignored
- If
.nb.html
is newer, it replaces the local cache
- No merging or conflict management is performed!
The End
data:image/s3,"s3://crabby-images/9c9d4/9c9d4e35b327afc65af7a5104c13b049fa3385fd" alt="lego cat"