Newer
Older
<h2 id="top_">SU2025 GBSC/IDNE 720/790-VTR JC- Data Science Club</h2>
<p> </p>
<p><strong>Please take the time to read this syllabus in its entirety. All of the information here will be useful as you work through the course.</strong></p>
<p> </p>
<p>The Data Science Journal Club course originated from a collaboration between the UAB IT Research Computing (RC) group, the Graduate Biomedical Sciences (GBSC) department and the Neuroengineering (IDNE) department. The course exists to foster knowledge transfer between Research Computing and students interested in growing their data science, programming and high-performance computing skill set. Future years will see increasing demand for these skills in every employment sector. Our goal is to enable students to be more competitive in their future careers by facilitating self-directed growth in the data science space. We accept students at all skill levels of data science, assuming familiarity with computers and graphical plots.</p>
While the course is mostly self-directed and hands-off, we don't want students to get lost. We want you to be successful and come away from this course feeling confident in applying the skills you have learned. If you have concerns, get stuck, need ideas or even a
<a href="https://en.wikipedia.org/wiki/Rubber_duck_debugging">rubber duck</a>
, please feel free to contact us.
</p>
<ul>
<li>
<p>
William Warriner (he/him),
<a href="mailto:wwarr@uab.edu" target="_blank" rel="noopener">wwarr@uab.edu</a>
, Instructor of Record
</p>
</li>
<li>
<p>
Matthew Defenderfer,
<a href="mailto:mdefende@uab.edu" target="_blank" rel="noopener">mdefende@uab.edu</a>
</p>
</li>
<li>
<p>
Prema Soundararajan,
<a href="mailto:prema@uab.edu" target="_blank" rel="noopener">prema@uab.edu</a>
</p>
</li>
</ul>
Please visit the following link to see our Office Hours dates, times, and Zoom links:
<a class="inline_disabled" href="https://docs.rc.uab.edu/#contact-us" target="_blank" rel="noopener">https://docs.rc.uab.edu/#contact-us</a>
<h3 id="oae">Office of Access and Engagement:</h3>
<p>The University of Alabama at Birmingham considers all its students, faculty, and staff to be a strength and critical to its educational mission. In this class, we will strive to be a community where we can learn from the many perspectives and worldviews which may differ from our own. We are all expected to contribute to creating a respectful, welcoming, and safe environment that fosters a sense of belonging through open and honest dialogue. To this end, we should always conduct discussions in a way that honors, respects, and extends dignity to all class members.</p>
<p>The Office of Access & Engagement works to promote success for everyone in the UAB community. We work to address challenges faced by our students, faculty and staff to ensure everyone has access to available programs and resources they need to promote success and retention, and to foster an accessible and welcoming culture.</p>
<p>Since its founding, UAB has played a pivotal role in shaping the future in Birmingham and beyond by making a positive difference in as many lives as possible. We do this by serving all people — a commitment that continues today through our vision, mission and shared values.</p>
<p>The Office of Access & Engagement strives to make sure every member of the UAB community has access to available programs and resources they need, recognizing that some students, faculty and staff may face more barriers and have a greater need for additional support than others due to their circumstances.</p>
<a class="inline_disabled" href="https://www.uab.edu/access-engagement/students" target="_blank" rel="noopener">The Office of Access and Engagement Website</a>
<p>
Note: This OAE section has replaced the previous section on DEI (Diversity, Equity, and Inclusion) to comply with
<a href="https://alison.legislature.state.al.us/files/pdf/SearchableInstruments/2024RS/SB129-enr.pdf" target="_blank" rel="noopener">AL-SB129-2024</a>
.
</p>
<p> </p>
<p>Our intent with the course has three parts. The first is for students to learn new techniques and technologies to assist with or facilitate data science. The second is for students to grapple with the application of those techniques and technologies. The third is to discuss with us how that went so we can check understanding and provide mentoring and guidance. We also love seeing all the cool stuff our students learn, and frequently find ourselves learning new perspectives, details, gotchas and technologies in the process.</p>
<li>Learn new data science related stuff</li>
<li>Grapple with that new stuff</li>
<li>Discuss that new stuff with us</li>
<p>
All students are required to give three demonstrations
<strong></strong>
</p>
<li>during any office hours, one per visit...</li>
<li>...in a student-led informal discussion style.</li>
<li>Students may collaborate on projects...</li>
<li>...but each student must show their original work.</li>
<li>Each demonstration must be data science related or somehow facilitate data science.</li>
<p>We realize the last bullet point is very broad, so we account for student knowledge, skill and experience when you give your demonstrations. We have no expectations of student starting skill level or experience. With that in mind, examples of past topics are included in the following list. If you have an idea but are unsure if it would be sufficient material, please feel free to contact us.</p>
<li>Using machine learning or deep learning to create a predictive model.</li>
<li>Creating your very first Anaconda environment to share with collaborators.</li>
<li>Learning to use version control (git and github) with your existing code.</li>
<li>Extracting insights from existing results by applying new visualization methods.</li>
<li>Migrating a data analysis workflow from Excel to Python and pandas.</li>
<li>Scaling a data analysis workflow from a lab workstation to Cheaha.</li>
<li>Exploring a new programming language.</li>
<li>Writing your very first script in any programming language.</li>
<li>Identifying and extracting salient features from complex data for further processing.</li>
<li>Understand what makes data tidy and why it is important.</li>
<p>As you prepare for your demonstrations, please be thinking about the context of your work. An incomplete list of questions to guide your thoughts about context.</p>
<li>What are some drawbacks or limitations?</li>
<li>How does it compare to alternatives?</li>
<li>How does it work?</li>
<li>Why does it work?</li>
<li>Where can we go next?</li>
<li>Why will others want to use my work?</li>
<li>How can I transfer techniques to other projects?</li>
</ul>
<p>Be prepared for us to challenge you with new ways of thinking!</p>
We are less interested in what your results are, and more interested in how you obtained those results and how they fit into the context of data science in general and your field in particular.
<p>If you are not able to make the office hours sessions, please contact us as soon as possible to make alternative arrangements.</p>
The dates in the course summary below are the last day of final exams, and therefore the due dates for the demonstrations. Please plan ahead to finish all three of them before that date. We cannot accept work after the due dates below without prior, individual discussion.
<p>The course is treated as a pass/fail course. The grade reflected on your transcript may depend on the section you are in. If the section you are in uses a letter grade, then a pass is assigned "A" and fail assigned "F".</p>
<p>Grading is based on a syllabus attestation, due on the second Wednesday of classes, and on demonstrations. Three demonstrations are required, and each has its own assignment within Canvas.</p>
<p>
See
<a>Assignments</a>
, or the bottom of this Syllabus, for details.
</p>
<p>Due dates progress throughout the semester:</p>
<li>Syllabus Affirmation is due by the 2nd Friday.</li>
<li>Demonstration #1 is due by the 6th Friday.</li>
<li>Demonstration #2 is due by the 10th Friday.</li>
<li>Demonstration #3 is due by the last Friday.</li>
Because office hours are open to potentially hundreds of researchers outside the scope of this course,
<strong>only one session can be presented per student per office hours visit.</strong>
<p>If for any reason you are not able to adhere to the above requirements, please contact us as soon as you know so we can make plans to accommodate you. Remember, we want you to be successful!</p>
<h3 id="scheduling">Scheduling, Attendance, & Office Hours:</h3>
<p>Our course may differ from the typical university course experience. We do not host regularly scheduled course meetings. Instead, we offer twice-weekly office hours. The office hours serve two purposes for this course.</p>
<li>Each student must make three office hours visits for the three demonstrations.</li>
<li>Students are encouraged to attend office hours if they have questions that are best answered in a virtual meeting. Past examples include:</li>
<li style="list-style-type: none">
<ul style="list-style-type: disc">
<li>Is my topic suitable for this course?</li>
<li>Where can I find more resources on topics?</li>
<li>Can you preview my demonstration material?</li>
<li>Am I on the right track?</li>
</ul>
</li>
Office Hours are held Mondays and Thursdays from 10 AM to noon via Zoom. Meeting links are available at our documentation (
<a href="https://docs.rc.uab.edu/#contact-us" target="_blank" rel="noopener">https://docs.rc.uab.edu/#contact-us</a>
For students who are just entering the data science space, please consider trying some of Kaggle's micro-courses on data analysis, visualization and machine learning:
<a class="inline_disabled" href="https://www.kaggle.com/learn" target="_blank" rel="noopener">https://www.kaggle.com/learn</a>
For students who want to develop skills to facilitate data science, see the Software Carpentry's list of lessons:
<a class="inline_disabled" href="https://software-carpentry.org/lessons/" target="_blank" rel="noopener">https://software-carpentry.org/lessons/</a>
. These lessons include the Unix shell, Git version control and programming skills in Python and R.
For students who want to develop data literacy skills in the context of specific fields, see the Data Carpentry's list of lessons:
<a class="inline_disabled" href="https://datacarpentry.org/lessons/" target="_blank" rel="noopener">https://datacarpentry.org/lessons/</a>
. One of the included fields is genomics, which may be of particular interest to many UAB students.
For students interested in learning more about any of the above with a focus on scaling with high-performance computing, we maintain a YouTube channel with some Data Science Club Material to get you started using Cheaha:
<a class="external" href="https://www.youtube.com/channel/UCZoOS2e699Ge0DND1oy1BJQ" target="_blank" rel="noopener">
<span>https://www.youtube.com/channel/UCZoOS2e699Ge0DND1oy1BJQ</span>
</a>
.
</p>
<p>
Here is a list of free data science resources and starting points:
<a class="inline_disabled" title="Link" href="https://www.datapen.io/" target="_blank" rel="noopener">https://www.datapen.io/</a>
. The following link narrows those resources down to just courses
<a class="inline_disabled" title="Link" href="https://www.datapen.io/resources/free-course-resources" target="_blank" rel="noopener">https://www.datapen.io/resources/free-course-resources</a>
<p>Feel free to seek out and use other sources! If you do so, please inform us of those sources so we can curate our list and better serve future students.</p>
<h3 id="resources">Usage of Artificial Intelligence (AI):</h3>
<p>We encourage you to responsibly explore usage of AI. As always, beware of AI nonsense or "hallucinations". As a budding technologist, we want you to be aware that "hallucintaion" is a marketing term. We prefer to call it what is: nonsense. Nevertheless, the term of art is "hallucination" so, to avoid confusion, we will continue using that term.</p>
<p>
You should be aware that hallucinations are a
<a href="https://arxiv.org/abs/2409.05746">mathematical certainty</a>
and undetectable without independent verification. In other words, you need to already understand the AI's output (and its relationship to reality) well enough to be able to spot hallucinations. Alternatively, you can have an expert check the output.Otherwise, you cannot hope to spot hallucinations reliably. And, now that you are aware of these facts, you should be aware it would be
<a href="https://en.wikipedia.org/wiki/Intellectual_honesty">intellectually dishonest</a>
to pretend AI output is valid or truthful when you haven't verified. **Always verify AI output!**
</p>
<p>
You may use AI to assist with the development of any demonstration assignments, subject to the
<a href="https://www.uab.edu/ai/guidelines-principles">UAB AI guidelines</a>
, and our additional, course-specific guidelines below.
</p>
<ul>
<li>You may not use AI to do content development work for you, it may only be used assist.</li>
<li>You may not use AI to present work, it may only assist you in a manner, except as consistent with accomodations.</li>
<li>You may not use AI to assist with quizzes, tests, or any assessments.</li>
<li>If you use AI to assist you in this course, then you must report your usage of AI as part of relevant demonstrations. This applies both as demonstration content and assistance when developing content. (We'd like to learn more about how AI can be used, too!)</li>
<li>You are responsible for the correctness and accuracy of your work, regardless of AI use.</li>
<li>Remember, the purpose of the course is for you to learn something and show off what you learned. Don't replace your mind with AI. This is a chance for you to shine, so don't let the AI outshine you!</li>
</ul>
<h4>I'm completely new to data science, what are some good starting points?</h4>
<ul>
To get started with general data science tools and techniques, try Kaggle Learn:
<a href="https://www.kaggle.com/learn" target="_blank" rel="noopener">https://www.kaggle.com/learn</a>
</li>
<li>
Check out the Data Carpentries Lessons:
<a href="https://datacarpentry.org/lessons/" target="_blank" rel="noopener">https://datacarpentry.org/lessons/</a>
</li>
<li>
For starting bioinformatics, also try Rosalind.info:
<a href="https://rosalind.info/problems/locations/" target="_blank" rel="noopener">https://rosalind.info/problems/locations/</a>
<h4>What tools are available to make it easier to manage my projects?</h4>
<ul>
<a href="https://swcarpentry.github.io/git-novice/" target="_blank" rel="noopener">https://swcarpentry.github.io/git-novice/</a>
<h4>Am I allowed to use my own research projects for this course?</h4>
<ul>
<li>Yes, and we encourage applying new tools and techniques! Be mindful that we can't accept any work for research protected by IRB or using PHI/HIPAA data. Be sure to get your supervisor's approval first.</li>
<ul>
<li>
<p>
Research Computing Official Site:
<a class="external" href="https://www.uab.edu/it/home/research-computing" target="_blank" rel="noopener">https://www.uab.edu/it/home/research-computing</a>
.
</p>
</li>
<li>
<p>
Cheaha Documentation:
<a class="external" href="https://docs.rc.uab.edu" target="_blank" rel="noopener">https://docs.rc.uab.edu</a>
.
</p>
</li>
<li>
<p>
Cheaha Web Portal:
<a class="external" href="https://rc.uab.edu" target="_blank" rel="noopener">https://rc.uab.edu</a>
.
</p>
</li>
</ul>
If you need software installed, or something just doesn't work like you expect, please email us at
<a href="mailto:support@listserv.uab.edu" target="_blank" rel="noopener">support@listserv.uab.edu</a>
. When asking for help, consider including all relevant details about your goal, what you tried, what you expected to happen, what actually happened, steps needed to reproduce the issue, and any error messages. If the error messages are longer than about 10 lines, copy-paste them into a text document and attach that to the email instead. Screenshots are also helpful. See our documentation on support (
<a href="https://docs.rc.uab.edu/help/support" target="_blank" rel="noopener">https://docs.rc.uab.edu/help/support</a>
In accordance with Title IX, the University of Alabama at Birmingham does not discriminate on the basis of gender in any of its programs or services. The University is committed to providing an environment free from discrimination based on gender and expects individuals who live, work, teach, and study within this community to contribute positively to the environment and to refrain from behaviors that threaten the freedom or respect that every member of our community deserves. For more information about Title IX, policy, reporting, protections, resources and supports, please visit the
<a class="inline_disabled" href="http://www.uab.edu/titleix" target="_blank" rel="noopener">UAB Title IX webpage</a>
for UAB’s Title IX Sex Discrimination, Sexual Harassment, and Sexual Violence Policy; UAB’s Equal Opportunity and Discriminatory Harassment Policy; and the Duty to Report and Non-Retaliation Policy.
<h3 id="dss">Disability Support Services (DSS) Statement:</h3>
<p>
UAB is committed to providing an accessible learning experience for all students. If you are a student with a disability that qualifies under the Americans with Disabilities Act (ADA) and/or Section 504 of the Rehabilitation Act, and you require accommodations, please contact Disability Support Services for information on accommodations, registration, and procedures. Requests for reasonable accommodations involve an interactive process and consist of a collaborative effort among the student, DSS, faculty and staff. If you are registered with Disability Support Services, please contact me to discuss accommodations that may be necessary in this course. If you have a disability but have not contacted Disability Support Services, please call (205) 934-4205 or visit the
<a class="inline_disabled" href="http://www.uab.edu/dss" target="_blank" rel="noopener">DSS website</a>