Skip to main content

Sharpen your teeth in Data Analytics (and maybe create a portfolio while you’re at it)


 

As a novice data analyst myself, I truly understand the mind of another beginner rearing to go out in the world and trying to tame a wild data set and analysing the heck out of it. Understandable, it is then, the urge to pick up the same old Kaggle data set that is begging not to be analysed for the umpteenth time, all whilst arriving at the same insights as the 24,637 people that came before you. One must not be utterly taken aback then, when they see the face of their interviewer (seeing the same analysis thirteenth time that day) turn red in anger without application of any sort of conditional formatting.

So, one might naturally enquire into the remedy to such an ailment. I would then point you towards the less traversed yet endlessly fascinating direction of obscure publicly available datasets. From an exhaustive data set of all known passengers of the RMS Titanic to the largest reference data set of the human genome, not only do they make for remarkably interesting candidates for analytics projects, but they also set you apart in the eyes of interviewers. 

The variety of data out there is so diverse, every budding data enthusiast is bound to discover something that piques their analytical interest.


So here are some of my favourites that you can check out:
  1. Data.gov.in: Do your parents boast that they used to shop for a whole week’s worth of groceries all for 15 rupees in 2009? Bust out the last 15 years of Consumer Price index (CPI) data from the Government of India’s official data repository to prove them wrong, once and for all. Developed by the National Informatics Centre (NIC) under the aegis of Ministry of Electronics and Information Technology (aka MEITy), Data.gov.in has data from more than 6,00,000 resources including crime, judiciary, urban and sports. At this data heaven, everyone is sure to find something that would make a worthy addition to their portfolio. Not only this, but you can have access to a wide selection of their APIs as well. Bonus points for no sign ups necessary.

  2. Awesome public datasets (GitHub): From Swiss apartment models to the biggest crowdsourced database of American gut biome, ‘Awesome public datasets’ is a source of admittedly more global, yet no less amusing datasets which one can explore in search for their next project. These datasets were painstakingly collected and tidied from blogs and user responses. Most of them are absolutely free and part of the open source movement. Again, no obligation to sign up!

  3. Sindresorhus’s Awesome Collection (GitHub): This list, my friends! Is the GitHub equivalent to Sir Ravindra Jadeja because of the all-rounder variety of resources it holds. Not only is it home to learning resources ranging from fintech to Generative AI, it also holds free books, public datasets and much more. This list is a one stop shop to learn anything and everything! Do, however, make sure to have blinders on while you visit this page, otherwise you’re guaranteed to be distracted along the way (speaking from personal experience).

  4. Figshare: This one is for all the academically minded folks out there. Figshare has an endless trove of datasets from close to 25 categories ranging from economics to earth sciences. Be it China’s Covid-19 case data from January 2020 or the species of native plants in any state of the US, if you can think of it, this data repository probably has it. With a clean UX, it successfully distinguishes itself from the typical academic website, making it easier for a newbie to find his way around. The good news? You can download up to 20 GB of this data for FREE! (thank me later)

  5. Google Trends: Did you know that mentions of the term “big data” peaked in October 2018? Wanna know why? Then this is my homework for you to find out through Google’s own repository of all things trends and keywords. Alright! I must admit this one isn’t very “obscure” but deserves a mention, nonetheless. A pioneer in “nowcasting”, google trends is the back bone for all sorts of projects to get real time updates, the OECD’s weekly GDP tracker being a good example. Being the good Samaritans they are, they have an extremely helpful section right upfront to teach newbies how to make the most of this data as well.

  6. World Bank Open data: Wonder how the GDP of the nations of the world has changed over the past 30 years? No biggie, the World Bank has you covered! With it’s ‘World Bank Open Data’ initiative, it has made a true wealth of financial and fiscal data available to the masses. This data is available to download in CSV, XML and Excel formats along with access to their own data bank and thematic tables for easy understanding. All at one click of a button!

  7. OECD Data Explorer: In their own words, The Organisation for Economic Cooperation and Development (OECD) (phew) is an international organisation working towards making better policies for better lives. But they’re not all talk, they’ve made available data ranging from Tobacco consumption, Variation in Body weight between nationalities and Wildfires, to one and all. This excellent selection of data can help you analyse everything from levels of alcoholism between states in India to the variation of occurrence of obesity within the country. Truly a great way to spend one’s Saturday, don’t you think? (just kidding)

  8. UCI Machine learning repository: Focused towards machine learning enthusiasts, this is an excellent repository of more than 600 datasets for all the newbies trying to get themselves familiar with ML. This will ensure that you go from being an ML clueless to an ML connoisseur in no time!

So, I hope that equipped with these sources, you will make your portfolio stand out like a kangaroo in a penguin enclosure. Do always remember the wise words of Franklin D. Roosevelt, “The only thing we have to fear is fear itself, and maybe not backing up important data” (don’t quote me on that though).

Till we meet again, Data comrades!
 
About Author
Author Photo
Vasudev Pandey
I am a budding data scientist and mechatronics engineer with a passion for history and finance. I write about anything and everything I find interesting.

Check our offerings below!


Success Stories



See all the Success Stories - here
Testimonials - here


You can check out other TakeOff Talent offerings that have helped 7,000+ people land jobs.

Offerings
๐Ÿ“„ CV Review
๐Ÿ“˜ 200 most-asked SQL interview questions with detailed solutions
๐Ÿ“˜ 200 most-asked Python interview questions with detailed solutions
๐Ÿ“Š SQL Crash Course
✍️ CV Writing for freshers
✍️ CV Writing
๐Ÿ› ️ Portfolio Project
๐Ÿ—ฃ️ English Speaking Practice (Live 1:1)
๐ŸŽฏ Job Search Mentorship Package

  In case of any questions around services above, write to us at vibhanshu@takeofftalent.com

  Connect with our founder on Linkedin - https://www.linkedin.com/in/vibvibgyor/



Video Gallery



Check more videos here>>.

Other popular job openings

Concentrix is hiring for a fresher entry level Data Analyst (BI Analyst) role in India

Position: Data Analyst (BI Analyst) Company: Concentrix Location: Gurugram, Haryana, India Job type: Full-time Job mode: On-site Job requisition id: R1621569 Years of experience: 0–3 years (Entry level applicants encouraged) Company Description: Concentrix is a global leader in technology-driven services and digital transformation. It specializes in designing, building, and operating customer engagement and business process solutions for leading global brands. The company operates across multiple sectors including technology, financial services, healthcare, automotive, telecommunications, retail, and consumer goods. With more than 100,000 employees worldwide and a presence in over 40 countries, Concentrix continues to expand its influence through both organic growth and strategic acquisitions. The recent acquisition of Webhelp has further strengthened its global delivery network, expanding capabilities in AI, analytics, and intelligent experience products. Co...

Turing is hiring for a fresher entry level Data Scientist/Analyst role in India

Position: Data Scientist / Analyst Company: Turing Enterprises, Inc. Location: India (Remote opportunity working with US clients) Job Type: Contractor (Full-time options available based on commitment) Job Mode: Remote Job Requisition ID: Not specified Years of Experience: 0-3 years Company Description Turing is a global AI powerhouse driving the progress and deployment of intelligent systems at scale. The company collaborates with some of the world’s leading AI labs to develop advanced model capabilities in reasoning, coding, agentic behavior, multimodality, multilingual processing, and frontier STEM knowledge. It focuses on bridging cutting-edge AI research with practical, real-world solutions that address mission-critical challenges for enterprises. Turing’s clients include top organizations across industries, ranging from tech giants to Fortune 500 companies. The organization emphasizes innovation, ethics, and inclusivity, offering its team exposure to ...

Collins Aerospace is hiring for a fresher entry level Associate Data Analyst (Business Intelligence) role in India

Position: Associate Data Analyst (Business Intelligence) Company: Collins Aerospace (a Raytheon Technologies Company) Location: Bengaluru, Karnataka, India Job type: Full-time Job mode: Onsite Job requisition id: 01795950 Years of experience: 0–3 years Company Description Collins Aerospace, part of Raytheon Technologies (RTX), is one of the world’s foremost innovators in aerospace and defense systems. The organization is dedicated to developing advanced technological solutions that redefine aviation, enhance connectivity, and ensure safety and reliability across air and space missions. It focuses on creating integrated and intelligent systems that empower both commercial and military aircraft operations globally. The company operates with a clear vision to reshape the future of flight through engineering excellence, advanced analytics, and cutting-edge digital transformation. Its Digital Technology division drives a global effort to modernize systems, opti...

EXL is hiring for a fresher entry level Data Analyst role in India

Position: Data Analyst - Application Development - Data Visualization Company: EXL Location: Gurugram, Haryana, India Job type: Full-time Job mode: Onsite Job requisition id: Not disclosed Years of experience: 0-3 years Company Description EXL is a global leader in analytics, digital transformation, and business process management, empowering organizations to make smarter decisions using data-driven strategies. The company blends advanced analytics, automation, AI, cloud technologies, and industry-specific expertise to help businesses reimagine operations and achieve sustainable growth. With a workforce of over 47,000 professionals across more than 30 countries, EXL partners with clients in industries such as insurance, healthcare, banking, logistics, and retail. The company’s culture emphasizes continuous innovation, collaboration, and client-centricity, where every solution is tailored to suit unique business challenges. EXL believes that meaningful tran...

UnitedHealth Group is hiring for a fresher entry level Data Analyst role in India

Position: Data Analyst Company: UnitedHealth Group (Optum) Location: Noida, Uttar Pradesh, India Job Type: Full-time Job Mode: Onsite Job Requisition ID: 2324896 Years of Experience: 0–3 years Company Description UnitedHealth Group (UHG) is a global leader in health care innovation and management, dedicated to improving how the health system works for everyone. It operates through two main entities, Optum and UnitedHealthcare, both focused on integrating advanced data solutions, technology, and care delivery. The company’s mission revolves around simplifying health care delivery, improving outcomes, and creating affordable and accessible services for millions of individuals. UHG combines analytical expertise, medical knowledge, and technology to identify opportunities for improvement in patient care and health management. Employees at Optum, the data and technology arm of UHG, are empowered to make a measurable difference through impactful projects that af...

Tesco India is hiring for a fresher entry level Data Scientist role in India

Position: Data Scientist Company: Tesco India Location: Bengaluru, Karnataka, India Job type: Full-time Job mode: Hybrid Job requisition id: Not specified Years of experience: 0-3 years Company Description Tesco is one of the world’s leading retail companies with a long-standing presence in the global market. The company is focused on using data-driven insights to improve the customer experience and optimize business operations. Tesco India serves as a strategic technology and innovation hub supporting global functions such as analytics, supply chain optimization, digital transformation, and data science. Through advanced analytics and machine learning, Tesco aims to improve store performance, pricing accuracy, demand forecasting, and logistics efficiency. The organization emphasizes collaboration, continuous learning, and cross-domain exposure, ensuring its teams are always aligned with the latest trends in AI and data-driven decision-making. Tesco Ind...

Ford Motor Company is hiring for a fresher entry level AI Data Scientist role in India

Position: AI Data Scientist Company: Ford Motor Company Location: Chennai, Tamil Nadu, India Job type: Full-time Job mode: Hybrid Job requisition id: 54097 Years of experience: 0–3 years Company Description Ford Motor Company is a global leader in mobility, technology, and innovation, operating in more than 100 countries with a mission to shape the future of transportation. The company integrates advanced data, analytics, and artificial intelligence to design next-generation vehicles and digital experiences. Ford’s technology and enterprise teams work together to build safe, efficient, and intelligent mobility systems, driving digital transformation in every area of the business. The organization fosters a culture of learning, collaboration, and integrity, empowering individuals to innovate and contribute to solving complex global challenges. With a commitment to sustainability, Ford invests in electric mobility, AI-driven automation, and cybersecurity, en...

NTT DATA is hiring for a fresher entry level Data & Business Insights Associate role in India

Position: Data & Business Insights Associate (as per the JD, roles has a DS component as well) Company: NTT DATA Location: Chennai, Tamil Nadu, India Job Type: Full-time Job Mode: Onsite Job Requisition ID: 344192 Years of Experience: 0-3 years Company Description NTT DATA is a global technology and business services organization valued at over $30 billion. The company works with 75% of the Fortune Global 100, offering a blend of consulting, digital transformation, and managed services. It operates in more than 50 countries, combining expertise from local teams and global resources. NTT DATA is known for its strong partner network that includes major technology companies and innovative startups. Its focus areas include artificial intelligence, data science, digital infrastructure, and business consulting. The company invests heavily in research and development, allocating approximately $3.6 billion annually to innovation and sustainability initiativ...

Wood Mackenzie is hiring for a fresher entry level Associate Data Analyst role in India

Position: Associate  Data Analyst Company: Wood Mackenzie Location: Gurugram, India Job Type: Full-time Job Mode: Hybrid Job Requisition ID: JR2197 Years of Experience: 0–3 years Company Description Wood Mackenzie is a globally recognized organization providing data and analytics solutions for the renewables, energy, and natural resources sectors. Established over 50 years ago, the company has become a trusted source of actionable intelligence for industries that are shaping the transition toward sustainability. With a team of more than 2,400 professionals working across 30 international offices, Wood Mackenzie combines human expertise with advanced technology to deliver high-impact insights. The company enables businesses, governments, and organizations to make informed decisions through comprehensive data coverage, real-time analytics, consultancy, and research-driven insights. Their expertise spans the entire value chain of energy, from production t...

Hiring for a Python Full Stack Engineer (AI-Focused) for one of our clients at TakeOff Talent

Company: A leading product startup in the AI Security space (one of our clients) Position: Python Full Stack Engineer (AI-Focused)  Experience: 1+ years Location: Bengaluru, India (Onsite) Job ID: TOT2025103 Company Overview We are a seed-stage startup headquartered in San Francisco and Bangalore, building the world’s most advanced security tooling for GenAI systems. Our mission is to secure AI agents, chatbots, assistants, and copilots by proactively surfacing existing vulnerabilities before attackers can exploit them. Our flagship product is the industry’s first fully autonomous red-teaming agent for GenAI applications. It continuously stress-tests apps and models, identifies weaknesses, and guides teams to strengthen defenses. We open-source a significant part of our research and believe in shipping fast, experimenting rapidly, and leading innovation in AI security and safety. Backed by top venture capital investors, we are focused on building foundational systems in one of th...