AmauryVanEspen commented 5 hours ago

Hi @KostasSliazas is the html rendered file compliant with the XML format ? Thank you Amaury

KostasSliazas commented 3 hours ago

hello,

It just saves to html file and it's not generic format probably. They are different formats comaring xml to html. Have a great days

When comparing XML specification and HTML5, it’s important to note that they serve different purposes and are not directly comparable as "better" or "worse." Here's an outline to clarify their roles and differences:

1. XML Specification

Purpose:
- XML (eXtensible Markup Language) is a markup language designed to store and transport data in a platform-independent, human-readable format. It is highly generic and doesn’t define any semantics, leaving that to the application.
Features:
- Customizable tags and structures.
- Strictly follows rules for well-formedness (e.g., every tag must be properly nested and closed).
- Can be validated against a schema (DTD or XSD) to ensure correctness.
- Used in scenarios where structured data exchange is required (e.g., web services, config files, etc.).
Strengths:
- Universal data interchange format.
- Clear separation of content and presentation.
- Extensible, allowing users to define their own structure.
Weaknesses:
- Verbose syntax, leading to large file sizes.
- Requires more processing power and memory for parsing compared to lightweight alternatives (like JSON).

2. HTML5

Purpose:
- HTML5 is a markup language used for structuring and presenting content on the web. It includes semantics for elements, multimedia integration, APIs, and interactivity features.
Features:
- Built-in semantics: Tags like
  ,
  , and
  for better content structuring.
- Supports multimedia:
- Interactive elements: Built-in support for APIs like
- Backward compatible with older HTML versions.
Strengths:
- Optimized for web use, supporting modern browsers and devices.
- Simplifies development with predefined elements and APIs.
- Rich ecosystem for interactivity and multimedia.
Weaknesses:
- Limited to web content; not designed for generic data interchange.
- Less strict than XML, which can lead to inconsistent document structures.

Key Comparisons AspectXML SpecificationHTML5 Primary Use Data storage and interchange Web content presentation Structure Fully customizable Predefined structure Strictness Very strict (well-formed rules) Lenient (error-tolerant) Extensibility Highly extensible Fixed set of tags and APIs Context Platform-agnostic Web-specific Complexity Verbose, complex Simplified for developers

When to Use Each

-

XML is ideal for:

Data exchange between systems (e.g., SOAP, configuration files).
- Scenarios requiring strict data validation.
- Custom markup requirements.
HTML5 is ideal for:
Building web pages and apps.
- Embedding multimedia and creating interactive UIs.
- Using web-specific APIs and semantics.

KostasSliazas commented 3 hours ago

I have uploaded once pdf format to one webpage(don't remember name(some cv maker online)) and it could extract values(maybe not all) from .pdf when html was saved in *.pdf format in browser. (Probably by titles (semantic)). For example.

<section>
<h2 class="left">Address(es)</h2>
 <h3 class="righ">Address Address, Address Address (Address)</h3>
</section>

So maybe they took first (h2) and added values from (h3). I'm not sure how pdf is structured. I just know it can be saved (printed) to pdf file. And php probably have tools for making pdf and work with these files. So server (probably php) could extract values somehow.

AmauryVanEspen commented 3 hours ago

Nice, do you believe that we can build a XML format from the values ?

Le mer. 20 nov. 2024 à 12:37, Kostas @.***> a écrit :

I have uploaded once pdf format to one webpage(don't remember name(some cv maker online)) and it could extract values(maybe not all) from .pdf when html was saved in *.pdf format in browser. (Probably by titles (semantic)). For example.

Address(es)

Address Address, Address Address (Address)

— Reply to this email directly, view it on GitHub https://github.com/KostasSliazas/Europass-Maker-Offline/issues/1#issuecomment-2488348705, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABCHPAPUNWT42RODXPYRNWD2BRYA3AVCNFSM6AAAAABSEEWTZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIOBYGM2DQNZQGU . You are receiving this because you authored the thread.Message ID: @.***>

KostasSliazas commented 2 hours ago

I have tested html source with online tools html > xml and it coverts probably best from tables. So maybe changing html structure to tables is most generic way. They read best from tables as I understand for converting. So it could be done by modifiyng structure of source code. So when saving changing something with js or by coverting to table source and adjusting js to it. Or changing js to change structure on saving file. In short it need to change html structure or maybe loop with js from elements adding some dataset attributes and then saving in loop all values. Maybe the best way to add attributes in source like 'Name', 'Address' and etc. and then 'injecting' in xml source and saving. So adding some attributes maybe would be the most accurate way. So by according by AI. It just need to be changed structure for example: `To convert HTML to XML, you need to ensure that the content is well-formed according to XML rules. Here's what you should follow:

Use Proper Tag Nesting: Tags must be properly opened and closed.
No Unquoted Attribute Values: All attribute values must be enclosed in double or single quotes.
No Special Characters: Special characters (e.g., &, <, >) must be escaped.
No Implicit Tag Closures: All tags must explicitly close (e.g., <br /> instead of <br>).

Here’s how your provided HTML can be transformed into valid XML:

Email Address(es)

e-pastas@gmail.com

` It just need to escape characters. What could be done with JS as well. So in short it could be done with JS. It just need structure how it must look and in loop with JS it could be done by making proper xml tags.

KostasSliazas commented 1 hour ago

cv structure.txt Asked AI for code and it made this: If you paste in console you can get xml right to it.

function generateXML() {
  // Function to create XML content from DOM element recursively
  function elementToXML(element) {
    const tagName = element.tagName.toLowerCase();
    let xmlContent = "";

    // Process only if the element has children or text content
    if (element.children.length === 0 && element.textContent.trim()) {
      xmlContent = `<${tagName}>${element.textContent.trim()}</${tagName}>`;
    } else if (element.children.length > 0) {
      xmlContent = `<${tagName}>`;
      Array.from(element.children).forEach(child => {
        xmlContent += elementToXML(child); // Recursively process child elements
      });
      xmlContent += `</${tagName}>`;
    }

    return xmlContent;
  }

  // Select the main container to start processing
  const mainContainer = document.querySelector("#main");
  if (!mainContainer) {
    console.error("Main container not found.");
    return;
  }

  // Generate XML from the main container
  let xml = `<?xml version="1.0" encoding="UTF-8"?>\n<document>`;
  Array.from(mainContainer.children).forEach(child => {
    xml += elementToXML(child);
  });
  xml += `</document>`;

  // Create a downloadable XML file
  const blob = new Blob([xml], { type: "application/xml" });
  const link = document.createElement("a");
  link.href = URL.createObjectURL(blob);
  link.download = "document.xml";
  document.body.appendChild(link);
  link.click();
  document.body.removeChild(link);

 // console.log("XML Generated:\n", xml); // Log XML for debugging
}

// Trigger the function (you can bind this to a button click)
generateXML();

And structure would look like this for example: `

Europass CV

Profession Field

IT, Telecommunications, Programming, Other

Personal Information

Name / Surname

Europass Maker

Phone Number(s)

0000000000000000000

Email Address(es)

e-pastas@gmail.com

Address(es)

Address Address, Address Address (Address)

Nationality

Date of Birth

Year Month Day

Gender

Work Experience

Dates

Date - Date

Profession or Position

CV Template Creation

Main Activities and Responsibilities

CV Template Creation

Employer Name and Address

CV Maker, Europe, Lithuania

Employer's Field of Activity or Industry Sector

Education

Dates

Date - Date

Qualification

Key Subjects / Professional Skills

General Subjects - Computer Science, Foreign Language (English) Professional Subjects - Computer Hardware Setup, Maintenance, Servicing

Name and Type of Educational Provider

School Address Address, Address (Country)

Qualification Level According to National or International Classification

ISCED7

Technical Skills and Competences

CV Template Creation

Computer Skills and Competences

Operating Systems: Unix/Linux, Windows, Mac OS, Android.

Social Skills and Competences

Good communication skills; Ability to work in a team and independently

Personal Skills and Competences

Native Language(s)

Lithuanian

Other Language(s)

Language, language

Understanding

Speaking

Writing

Listening

Reading

Spoken Interaction

Spoken Production

Language

Double Click

Other Language

Double Click

`

KostasSliazas commented 1 hour ago

Yes, if you are asking about tag names it could be done by: email_address>example@example.com</email_address it's said and it's valid . According AI > tags cannot have spaces, but you can use hyphens or underscores to separate words.

AmauryVanEspen commented 35 minutes ago

do you believe it could be compliant with the JSON Resume Schema ? https://jsonresume.org/schema

KostasSliazas commented 8 minutes ago

we just need to get values by id: to make it JSON?: for example: `function extractDataToJson() { // Extracting data from the HTML page const cvData = { basics: { name: document.querySelector("#vardas") ? document.querySelector("#vardas").textContent.trim() : "John Doe", label: document.querySelector("#pozicija") ? document.querySelector("#pozicija").textContent.trim() : "Programmer", image: document.querySelector("img#profile-picture") ? document.querySelector("img#profile-picture").src : "", email: document.querySelector("#email a") ? document.querySelector("#email a").href.replace('mailto:', '').trim() : "", phone: document.querySelector("#number a") ? document.querySelector("#number a").href.replace('tel:', '').trim() : "", url: document.querySelector("#website a") ? document.querySelector("#website a").href : "", summary: document.querySelector("#summary") ? document.querySelector("#summary").textContent.trim() : "No summary available", location: { address: document.querySelector("#address") ? document.querySelector("#address").textContent.trim() : "", postalCode: document.querySelector("#postalCode") ? document.querySelector("#postalCode").textContent.trim() : "", city: document.querySelector("#city") ? document.querySelector("#city").textContent.trim() : "", countryCode: document.querySelector("#countryCode") ? document.querySelector("#countryCode").textContent.trim() : "", region: document.querySelector("#region") ? document.querySelector("#region").textContent.trim() : "" }, profiles: [{ network: "Twitter", username: "europassmaker", // Replace with actual username if available in HTML url: "https://twitter.com/europassmaker" // Replace with actual URL if available }] }, work: [{ name: document.querySelector("#companyName") ? document.querySelector("#companyName").textContent.trim() : "Company Name", position: document.querySelector("#jobTitle") ? document.querySelector("#jobTitle").textContent.trim() : "Position", url: document.querySelector("#companyWebsite a") ? document.querySelector("#companyWebsite a").href : "", startDate: document.querySelector("#workStartDate") ? document.querySelector("#workStartDate").textContent.trim() : "", endDate: document.querySelector("#workEndDate") ? document.querySelector("#workEndDate").textContent.trim() : "", summary: document.querySelector("#workSummary") ? document.querySelector("#workSummary").textContent.trim() : "", highlights: [ document.querySelector("#workHighlights") ? document.querySelector("#workHighlights").textContent.trim() : "No highlights available" ] }], volunteer: [{ organization: "Volunteer Organization", position: "Volunteer Developer", url: "https://nonprofit.com/", startDate: "2019-01-01", endDate: "2020-01-01", summary: "Contributed to open-source projects.", highlights: [ "Developed open-source software" ] }], education: [{ institution: "University of Lithuania", url: "https://university.com/", area: "Computer Science", studyType: "Bachelor", startDate: "2015-09-01", endDate: "2019-06-01", score: "4.0", courses: [ "CS101 - Introduction to Programming", "CS102 - Data Structures" ] }], awards: [{ title: "Best Developer", date: "2021-06-01", awarder: "Company XYZ", summary: "Awarded for excellence in development." }], certificates: [{ name: "Certified Web Developer", date: "2022-11-07", issuer: "Certification Body", url: "https://certificate.com" }], publications: [{ name: "Creating Perfect CV Templates", publisher: "Tech Journal", releaseDate: "2022-05-01", url: "https://publication.com", summary: "A detailed guide on designing CV templates." }], skills: [{ name: "Web Development", level: "Advanced", keywords: [ "HTML", "CSS", "JavaScript", "PHP" ] }], languages: [{ language: "English", fluency: "Native speaker" }], interests: [{ name: "Technology", keywords: [ "AI", "Machine Learning" ] }], references: [{ name: "Jane Doe", reference: "John is a skilled developer who contributed significantly to our projects." }], projects: [{ name: "CV Maker Project", startDate: "2019-01-01", endDate: "2021-01-01", description: "A project to create customizable CV templates.", highlights: [ "Developed user-friendly templates", "Integrated with Europass standards" ], url: "https://cvproject.com/" }] };

// Return the populated JSON return cvData; }

// Example usage: const jsonData = extractDataToJson(); console.log(JSON.stringify(jsonData, null, 2)); `

KostasSliazas / Europass-Maker-Offline

is this compliant with XML specification ? #1

When comparing XML specification and HTML5, it’s important to note that they serve different purposes and are not directly comparable as "better" or "worse." Here's an outline to clarify their roles and differences:

Address(es)

Address Address, Address Address (Address)

Email Address(es)

e-pastas@gmail.com

Europass CV

Profession Field

IT, Telecommunications, Programming, Other

Personal Information

Name / Surname

Europass Maker

Phone Number(s)

0000000000000000000

Email Address(es)

e-pastas@gmail.com

Address(es)

Address Address, Address Address (Address)

Nationality

Nationality

Date of Birth

Year Month Day

Gender

Gender

Work Experience

Dates

Date - Date

Profession or Position

CV Template Creation

Main Activities and Responsibilities

CV Template Creation

Employer Name and Address

CV Maker, Europe, Lithuania

Employer's Field of Activity or Industry Sector

Employer's Field of Activity or Industry Sector

Education

Dates

Date - Date

Qualification

Qualification

Key Subjects / Professional Skills

General Subjects - Computer Science, Foreign Language (English) Professional Subjects - Computer Hardware Setup, Maintenance, Servicing

Name and Type of Educational Provider

School Address Address, Address (Country)

Qualification Level According to National or International Classification

ISCED7

Technical Skills and Competences

CV Template Creation

Computer Skills and Competences

Operating Systems: Unix/Linux, Windows, Mac OS, Android.

Social Skills and Competences

Good communication skills; Ability to work in a team and independently

Personal Skills and Competences

Native Language(s)

Lithuanian

Other Language(s)

Language, language

Understanding

Speaking

Writing

Language

Other Language