Skip to main content
Cyberinfrastructure Community-wide Mentorship Network
Search
Join
Log in
Mentorship
Join CCMNet
CCMNet Guide
Mentorship Opportunities
Community
CCMNet Members
CCMNet Affinity Group
People
Affinity Groups
Blog
Jobs
Organizations
Community of Communities
Join the CSSN
Get Help
Ask a Question
Resources
Request a Consult
Projects
Knowledge Base
Mentorship Resources
KB Resources
Ask.CI Forum
Tags
About Us
About Us
User Guide
Become a Campus Champion
User Guide
Affinity Groups FAQ
Governance
Code of Conduct
News
About CCMNet
Annual Meeting
Tags
Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
Submission navigation links for Project
‹
Previous submission
Next submission
›
Submission information
Submission Number:
134
Submission ID:
237
Submission UUID:
56e75a52-dd01-49dd-bb82-8616ea97d9f3
Submission URI:
/form/project
Created:
Thu, 01/13/2022 - 11:07
Completed:
Thu, 01/13/2022 - 11:07
Changed:
Wed, 07/06/2022 - 15:09
Remote IP address:
192.112.102.251
Submitted by:
Gerald Kruse
Language:
English
Is draft:
No
Webform:
Project
Project Title
Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
Program
CAREERS
Project Image
{Empty}
Tags
cluster-management (495), hadoop (12), software-installation (211), unix-environment (60)
Status
Halted
Project Leader
Project Leader
Gerald Kruse
Email
kruse@juniata.edu
Mobile Phone
814-644-9206
Work Phone
814-641-3595
Project Personnel
Mentor(s)
{Empty}
Student-facilitator(s)
{Empty}
Mentee(s)
{Empty}
Project Information
Project Description
Our Data Science high-performance cluster was delivered in Jan 2020. It is a Cloudseek 1000 from PSSCLabs.
Unfortunately, Covid impacted our efforts to configure it for our Data Science courses (https://www.juniata.edu/academics/departments/data-science/curriculum.php). At Juniata, we offer a Major (our "Program of Emphasis"), a minor (our "Secondary Emphasis"), and an online graduate degree in Data Science. We've been able to get by, but with a Big Data course coming available, we need to configure this system. We would like funding for one of our students to work on this project. We have the name of a possible technical mentor, or at least someone who will need to be consulted.
It's been a challenge to get this cluster operational, and we would really appreciate any assistance.
Project Information Subsection
Project Deliverables
{Empty}
Project Deliverables
{Empty}
Student Research Computing Facilitator Profile
{Empty}
Mentee Research Computing Profile
{Empty}
Student Facilitator Programming Skill Level
{Empty}
Mentee Programming Skill Level
{Empty}
Project Institution
{Empty}
Project Address
{Empty}
Anchor Institution
CR-Penn State
Preferred Start Date
{Empty}
Start as soon as possible.
No
Project Urgency
Already behind3Start date is flexible
Expected Project Duration (in months)
{Empty}
Launch Presentation
{Empty}
Launch Presentation Date
{Empty}
Wrap Presentation
{Empty}
Wrap Presentation Date
{Empty}
Project Milestones
{Empty}
Github Contributions
{Empty}
Planned Portal Contributions (if any)
{Empty}
Planned Publications (if any)
{Empty}
What will the student learn?
{Empty}
What will the mentee learn?
{Empty}
What will the Cyberteam program learn from this project?
{Empty}
HPC resources needed to complete this project?
{Empty}
Notes
{Empty}
Final Report
What is the impact on the development of the principal discipline(s) of the project?
{Empty}
What is the impact on other disciplines?
{Empty}
Is there an impact physical resources that form infrastructure?
{Empty}
Is there an impact on the development of human resources for research computing?
{Empty}
Is there an impact on institutional resources that form infrastructure?
{Empty}
Is there an impact on information resources that form infrastructure?
{Empty}
Is there an impact on technology transfer?
{Empty}
Is there an impact on society beyond science and technology?
{Empty}
Lessons Learned
{Empty}
Overall results
{Empty}