[RFC]: build a developer dashboard for tracking repository build status

stdlib-js / google-summer-of-code

Google Summer of Code resources.

23 stars 5 forks source link

Full name

Naveen Kumar

University status

Yes

University name

Siksha O Anusandhan

University program

Bachelor of Technology in Computer Science and Information Technology

Expected graduation

2025

Short biography

I am a third-year B.Tech. Undergraduate at Siksha O Anusandhan, Bhubaneswar, pursuing Computer Science and Information Technology as my major. I also have a certificate in the Applied Machine Learning program from my college.

I have a strong passion for web application development, and thus I have been actively involved in such projects for the past two years. I have a good command of JS and web development, particularly in the MERN stack. I have participated in a symposium organized at my college, where I submitted a research-based project called Predicting Hospital Acquired Infection (HAI). For this, my paper was in the top 20. Further, I was among the 106 students selected from all over India for contributing to C4GT, an open source program. I developed the npm package in JS, utilizing the Bhasini API for this project. I completed the project within the given timeline. I have also done freelance work on Upwork, where I built an Android mobile application named YourRadio, a social media platform for my client using React Native and Node.js.

My interest in computer science, particularly web development, took root during my intermediate days in school. Since then, I have been working hard to grow this interest into a full-fledged experience. I see this opportunity as a stepping stone in my professional growth.

Timezone

Asia/Kolkata UTC +0530

Contact details

email: navstr10@gmail.com , GitHub: https://github.com/naveen1m , Gitter: @naveen1m:gitter.im

Platform

Mac

Editor

VSCode. This is simple and lightweight, supports almost all programming languages. It has also numerous extension which enhance its functionality even further.

Programming experience

I have more than two years of experience in programming. My first project was Image-to-Ascii-Art in Java, and after this I have built many projects in the field of web development. Projects include frontend using HTML, CSS, and JS, as well as backend using Node.js, Express, and MongoDB, and a few are full-stack; some of them are blog app, which are deployed as well; mern-auth; and one mobile app, yourradio, using React Native. I also have a few projects where I have written code for reactjs and API code in FastAPI to connect ML models; these are HAI, where I also developed ML models to predict HAI, and Namami-gange-guide. During the C4GT open-source programme, I created a npm package in JS, utilizing the Bhashini API, and wrote documents in JSDoc for this.

JavaScript experience

I have been working on JavaScript since 1st year of college. I learnt it from youtube and built some fundamental projects and intermediate level projects on it. view all vanilla-js projects here, It includes digital-clock, calculator, food-app(using api) and more. I love this language because I can use it in frontend and backend on the web as well as in mobile app development. It has also many libraries and large community

My most favorite features includes:

Functional and Object Oriented Programming
Promise and async/await for asynchronous programming

My Least favorite features includes:

loose typing
callback hell

Node.js experience

I learnt node js before React js and used it with library Express js. Most of the time I used mongodb as a database, but recently I used postgreSQL as a database using PG library to develop dashboard-demo. I have written server code for a blog app, Mern-auth and for my freelance project YourRadio.

C/Fortran experience

I read C programming during my 6th semester. Thereafter I solved a few dsa problems on it. This course of basic algorithm implementation strengthened my core concepts in C.

Interest in stdlib

Stdlib is a project which is pushing the boundaries of what JavaScript can do and above that it is a community driven initiative. Who won’t be excited to be a part of such projects? Moreover, It's not just about writing code, it's about innovation, it's about finding creative solutions to real world problems. This is exactly what drives me. The opportunity to work under Athen Reigns also fills me with excitement. The way Athen Reigns has taken stdlib from a personal project to a position where it has over 2000 functionalities in less than a decade inspires and motivates me. It will be an honor to contribute to this project under him.

Also I have been looking for an Open source organization whose requirements align with my skill and interest, thanks to GSOC my search came to an end here.

Version control

Yes

Contributions to stdlib

refactor: update /blas/ext/base/dapx/l to follow current project conventions #1954 [ merged ]
refactor: @stdlib/blas/ext/base/ssorthp to follow current project conventions #1770 [ open ] I updated 50 files to match the current project conventions, and its under review now. I will start working on it after submission deadline and will solve more such issues.

Goals

Project goals: Develop a Node.js backend to query a PostgreSQL database, and construct a frontend interface using React.js along with other technologies, incorporating the following features:

Features:

Repository list view: List of all repositories under the stdlib project, with pagination or lazy loading to handle the large number of repositories efficiently.
Filtering and Search: Filtering and search capabilities to quickly find specific repositories by name, description, build status or other metadata.
Visual build Status indicator: Visual indicators to represent the latest build status of each repository at a glance.
Build history and trends: Display historical build data for each repository, allowing developers to view and analyze past build failures and trends over time.
Access resources and build artifacts: Easy access to repository resources and build artifacts for seamless navigation and utilization.

Proposed Features:

Interactive Data Visualization: Incorporate interactive charts or graphs to drill down into specific data points or time periods. Generate reports and analytics on build statuses, failure rates, and other relevant metrics across the stdlib ecosystem. [ as said in issue to extend ]
Alert and Notifications: Configurable notification preferences based on repository, severity. Alert in email or website (toast) about critical build failures or issues.

Technology Stack

Frontend React + vite (javascript)	vite(react) is known for its incredibly fast development server. It only rebuilds the parts of the application that have changed, resulting in faster reload during development time.
react-router-dom	It provides routing capabilities for React applications, allowing to create single-page applications with multiple views and navigation without full page reloads.
Tailwind CSS	Tailwind CSS is a utility-first CSS framework which provides a set of pre-designed utility classes. This helps in building UI faster.
React virtualized	This is used for displaying large lists of data in tables with headers and scrolls efficiently.
axios	It provides a simple and consistent interface for making HTTP requests from Node.js, offering features like automatic JSON data transformation and request/response interceptors.

Backend Express.js	Express.js is a popular, lightweight and flexible web application framework for building server-side applications in Node.js.
pg	node-postgres, or pg, is a nonblocking PostgreSQL client for Node.js. Essentially, node-postgres is a collection of Node.js modules for interfacing with a PostgreSQL database.

Backend
Express.js

Express.js is a popular, lightweight and flexible web application framework for building server-side applications in Node.js.

node-postgres, or pg, is a nonblocking PostgreSQL client for Node.js. Essentially, node-postgres is a collection of Node.js modules for interfacing with a PostgreSQL database.

const query = ` SELECT r.name, t.tag, n_p.version, n_p.node_version, n_p.published_at, n_p.tarball_size, n_p.license, n_r_v_d_c.count AS downloads, w_r.status, w_r.run_number, w_r.run_attempt, r.owner, EXTRACT( EPOCH FROM ( w_j.started_at - w_j.completed_at )) AS duration FROM stdlib_github.repository r FULL JOIN stdlib_github.tag t ON r.id = t.repository_id FULL JOIN stdlib_github.npm_publish n_p ON r.id = n_p.repository_id FULL JOIN stdlib_github.npm_rolling_version_download_count n_r_v_d_c ON r.id = n_r_v_d_c.repository_id FULL JOIN stdlib_github.workflow_run w_r ON r.id = w_r.repository_id FULL JOIN stdlib_github.workflow_job w_j ON r.id = w_j.repository_id ORDER BY w_r.status, r.id LIMIT $1 OFFSET $2 `;

app.get( '/api/v1/repository-data', async ( req, res ) => { const { pagesize, page } = req.query; const offset = ( page - 1 ) * pagesize; const { rows } = await client.query( query, [ pagesize, offset ]); // Send the query result as JSON response res.json( rows ); });

useEffect(()=>{ const fetchData = async () =>{ const response = await axios.get( API_URL, { params: { pageSize, page } }); const newData = response.data; setData( prevData => prevData.concat( newData )); } fetchData(); },[page])

import { AutoSizer, InfiniteLoader, Grid } from 'react-virtualized'; const cellRenderer = ({ columnIndex, key, rowIndex, style }) => { const repository = repositories[ rowIndex ]; let content = ''; switch ( columnIndex ) { case 0: content = repository.name; break; case 1: content = repository.tag; break; // Add cases for other columns default: content = ''; } return ( <div key={ key } style={ style }> { content } </div> ); };

return ( <InfiniteLoader isRowLoaded={() => !isLoading || repositories.length >= PAGE_SIZE * page } loadMoreRows={ loadMoreRows } rowCount={ repositories.length + 1 } > {({ onRowsRendered, registerChild }) => ( <AutoSizer> <Grid ref={ registerChild } onSectionRendered={ onRowsRendered } cellRenderer={ cellRenderer } columnCount={ 11 } columnWidth={ 30 } height={ 300 } rowCount={ repositories.length } rowHeight={ 50 } width={ width } /> </AutoSizer> )} </InfiniteLoader> );

const sortQuery = `SELECT r.name, t.tag, n_p.version, n_p.node_version, n_p.published_at, n_p.tarball_size, n_p.license, n_r_v_d_c.count AS downloads, w_r.status, w_r.run_number, w_r.run_attempt, r.owner, EXTRACT(EPOCH FROM (w_j.started_at - w_j.completed_at)) AS duration FROM stdlib_github.repository r FULL JOIN stdlib_github.tag t ON r.id = t.repository_id FULL JOIN stdlib_github.npm_publish n_p ON r.id = n_p.repository_id FULL JOIN stdlib_github.npm_rolling_version_download_count n_r_v_d_c ON r.id = n_r_v_d_c.repository_id FULL JOIN stdlib_github.workflow_run w_r ON r.id = w_r.repository_id FULL JOIN stdlib_github.workflow_job w_j ON r.id = w_r.repository_id ORDER BY -- TEXT datatype CASE $1 WHEN 'name' THEN r.name WHEN 'tag' THEN t.tag WHEN 'version' THEN n_p.version WHEN 'node_version' THEN n_p.node_version WHEN 'license' THEN n_p.license WHEN 'status' THEN w_r.status WHEN 'owner' THEN r.owner END $2, -- timestamp datatype CASE $1 WHEN 'published_at' THEN n_p.published_at END $2, -- double precision datatype CASE $1 WHEN 'downloads' THEN n_r_v_d_c.count END $2, -- integer datatype CASE $1 WHEN 'run_number' THEN w_r.run_number WHEN 'run_attempt' THEN w_r.run_attempt END $2, -- bigint datatype CASE $1 WHEN 'tarball_size' THEN n_p.tarball_size END $2, -- numeric datatype CASE $1 WHEN 'duration' THEN EXTRACT(EPOCH FROM (w_j.started_at - w_j.completed_at)) END $2, w_r.status -- Default sorting column `

const searchQuery = `SELECT r.name, t.tag, n_p.version, n_p.node_version, n_p.published_at, n_p.tarball_size, n_p.license, n_r_v_d_c.count AS downloads, w_r.status, w_r.run_number, w_r.run_attempt, r.owner, EXTRACT(EPOCH FROM ( w_j.started_at - w_j.completed_at )) AS duration FROM stdlib_github.repository r FULL JOIN stdlib_github.tag t ON r.id = t.repository_id FULL JOIN stdlib_github.npm_publish n_p ON r.id = n_p.repository_id FULL JOIN stdlib_github.npm_rolling_version_download_count n_r_v_d_c ON r.id = n_r_v_d_c.repository_id FULL JOIN stdlib_github.workflow_run w_r ON r.id = w_r.repository_id FULL JOIN stdlib_github.workflow_job w_j ON r.id = w_j.repository_id WHERE r.name ILIKE '%${search_term}%' OR t.tag ILIKE '%${search_term}%' OR w_r.status ILIKE '%${search_term}%' `

@naveen1m Thanks for sharing a draft of your proposal. A few comments:

For sorting, you provide a Postgres query for returning sorted results. I am somewhat leery about this, as it seems to me that sorting could be done server-side. Our hosted server is somewhat compute-constrained, so, if possible, I'd prefer if we can offload as much as possible to the client.

While 4K rows is expensive from a UI perspective (i.e., many DOM nodes), 4K rows of JSON is not. So, I think it should be fine to simply ship a single JSON blob to the client, but only display a subset of rows at a time to minimize DOM nodes.

My personal preference is using fastify for the backend server, rather than expressjs, as that is what we are already familiar with.

You mention the metrics page in weeks 7-9. Would you be able to comment a bit more here? What are you thinking you'd show in this page? How would navigation from the table to drill down work?

How would search work? In particular, what types of queries what search support? Is it just filtering the list of packages? Or would we be able to support more complex queries (e.g., "packages whose builds have failed over the last week")?

@kgryte thanks for review! To address above mentioned issue I think the the possible soultion could be :

sorting:If the current size of json file will be not large, saving it on client side can boost respnse time. We can perform sorting operation like done by npm-statusboard keeping data in json format. Then sorting can be performed in nlogn time but will have to compare it.
TansStack Table use dynamic rendering means it only renders the data that is currently in view. It increses the performance.
yes, We canuse fastify it will be better then, I can learn and implement it in very less time since I already have knoledge of node.js and experience working with server-side code . I will completely learn it before coding period starts.
We can navigate to analytics page on clicking project name, I am thinking to display build status, downloads, PR merged in graphical view over a period of time.
I am thinking to use indexing method and other strategy to perform searching, The question about complex searching (e.g., "packages whose builds have failed over the last week") can be performed but will have to see if it is fast or not.

stdlib-js / google-summer-of-code