Hong Kong MPF Statement Extraction to Excel

Convert Hong Kong MPF statements, pay-records, and ABS PDFs to Excel for payroll reconciliation, eMPF cleanup, and audit support.

Published
Updated
Reading Time
13 min
Topics:
Financial DocumentsPayrollHong KongMPFExcelaudit support

Hong Kong MPF statement extraction to Excel means turning remittance statements, monthly pay-records, and annual benefit statements into structured data that can be reconciled against payroll and retained for audit support. A useful output is not just a PDF converted into text. It keeps one row per employee and contribution period, with relevant income, employer mandatory contribution, employee mandatory contribution, voluntary contributions, payment date, trustee or eMPF source, source file, and page reference.

That structure matters because MPF records sit between payroll, trustee administration, and statutory record keeping. The payroll register may show the deduction taken from an employee and the employer's MPF cost for the month. The trustee or eMPF record shows what was actually remitted. The extracted spreadsheet is the bridge that lets a bookkeeper, payroll bureau, or CPA compare those two sets of numbers without reading every PDF line by line.

The document set is specific. A remittance statement is the employer-facing contribution record. A monthly pay-record is the employee-facing confirmation issued after contributions are remitted. An annual benefit statement is a member statement showing account balances and annual movement, not a month-end payroll remittance file. Treating all three as a generic "MPF statement" is how spreadsheets become hard to reconcile.

The factual baseline should come from the regulator, not from a converter tool. The MPFA says a remittance statement shows each employee's relevant income and the contributions made by both the employer and employees, and that employers should give each employee a monthly pay-record within seven working days after remitting contributions, according to its MPFA guidance on remittance statements and monthly pay-records.

For extraction, that means the first question is not "can this PDF become Excel?" It is "does the Excel file preserve the MPF fields needed to prove what was calculated, paid, and reviewed?" The answer should be visible in the column design before anyone starts processing the archive.

Treat remittance statements, pay-records, and ABS files as separate data sources

An MPF remittance statement is the main employer reconciliation document. It should give the contribution period, the employee's relevant income, the employer mandatory contribution, the employee mandatory contribution, any voluntary contribution amounts, and the total remitted to the trustee or eMPF. When the extraction target is an employer file, this is usually the document that needs one employee-period row in Excel.

A monthly pay-record has a different audience. It is issued to the employee after the employer remits contributions, so MPF monthly pay-record extraction is useful when an employer needs to answer employee queries, confirm historical contribution treatment, or compare employee-facing records with the employer remittance file. The same contribution fields matter, but the source is not the same control document as the employer's remittance statement.

An annual benefit statement, or ABS, should not be forced into the monthly remittance schema. MPF annual benefit statement extraction is about balances and account movement: opening balance, closing balance, employer contributions received during the year, investment gains or losses, and fees where the statement shows them. It can help with employee support and benefit-record review, but it is not the best source for a month-end payroll tie-out.

For that reason, the spreadsheet needs a document type column. A clean MPF contribution record PDF to spreadsheet workflow might use one tab for remittance statements and pay-records, then a separate tab for ABS records. If everything must sit in one workbook, the row type should make the source obvious. Otherwise, a balance statement can be mistaken for a contribution record, or an employee pay-record can be treated as if it were the employer's submitted remittance file.

The practical rule is simple enough for the user but strict enough for audit work: extract the fields each document actually supports, and do not invent missing data to make the rows look uniform.

Build an Excel schema that ties MPF records to the payroll register

For remittance statements and monthly pay-records, the most useful structure is one row per employee per contribution period. That row should carry enough detail to compare the trustee or eMPF record with payroll, without making the reviewer reopen the PDF for every check.

The core columns are:

  • Employee name
  • Employee identifier, where available
  • Masked HKID or internal payroll number, if used in the source file
  • Contribution period
  • Relevant income
  • Employer mandatory contribution
  • Employee mandatory contribution
  • Employer voluntary contribution
  • Employee voluntary contribution
  • Total contribution
  • Payment or remittance date
  • Trustee or eMPF source
  • MPF scheme name
  • Source file
  • Source page
  • Extraction notes

The amount columns should be numeric Excel values, not pasted text. Payroll teams need to sum, filter, run pivot tables, and add variance columns. If the extracted employer mandatory contribution is stored as text, the file may look correct while failing the first reconciliation formula.

The basic tie-out is by contribution period and employee. Payroll's MPF deduction line should agree to the extracted employee mandatory contribution. Payroll's employer MPF cost should agree to the extracted employer mandatory contribution. Voluntary contributions should be separated before comparing statutory amounts, because mixing voluntary and mandatory figures makes a correct remittance look wrong.

This is where source references become part of the control, not just a convenience. A reviewer should be able to trace a variance row back to the source file and page reference. Invoice Data Extraction outputs structured Excel, CSV, or JSON and includes source file and page references for extracted rows, which is useful when the spreadsheet becomes the working file for the payroll reconciliation process.

If ABS files are included in the same project, put them in a separate tab or mark them with a different row type. Balance data belongs in the workbook, but it should not be summed into monthly remittance totals.

Separate eMPF exports from older trustee PDF formats

The eMPF Platform changes the forward-looking workflow, but it does not make every historical MPF file look the same. Current employer administration is moving toward more standardized contribution submission, Excel-template upload, downloadable contribution records, and standardized remittance statement workflows. Older records still reflect the trustee that produced them.

That distinction matters when converting an eMPF statement PDF to Excel alongside older BCT, HSBC, Hang Seng, Sun Life, AIA, Manulife, Principal, or other trustee files. Historical PDFs may use different column labels, employee identifiers, contribution-period wording, scheme names, payment references, and total sections. Some files may be native PDFs. Others may be scanned copies from an archive or email attachment.

The extraction schema should stay consistent even when the source layout changes. For example, an HSBC employer statement and an eMPF contribution record can both map into the same employee-period columns: relevant income, employer mandatory contribution, employee mandatory contribution, voluntary contributions, payment date, trustee or eMPF source, source file, and source page. The source column tells the reviewer where the record came from; the field structure lets the reviewer compare periods across trustee changes.

This is especially important for employers that switched schemes before or during the eMPF transition. A payroll bureau might receive three years of BCT statements, two years of HSBC statements, then current eMPF exports. The goal is not to preserve each trustee's original layout in Excel. The goal is to normalize the contribution facts while keeping enough source detail to trace every row.

Mixed language and scan quality deserve a sample test before bulk processing. Hong Kong finance records may combine English labels with Traditional Chinese labels, and older PDFs may not extract cleanly on the first pass. The same practical issue appears in Hong Kong bilingual invoice extraction: the output works best when the field list is explicit and the sample includes the formats that will appear in the full batch.

Invoice Data Extraction supports PDFs and image files such as JPG and PNG, including scanned documents and major scripts such as East Asian scripts. That does not remove the need for review; it makes the review more targeted because the first sample can show whether each trustee layout maps into the intended columns.

Reconciliation checks that MPF extraction should make easier

The first check is the monthly tie-out. For each contribution period, the employee MPF deductions in payroll should agree to the employee mandatory contributions extracted from the remittance statement. The employer MPF expense in payroll should agree to the employer mandatory contributions. Differences should be visible by employee, not only as a net total.

Relevant income is the next control point. MPFA's current monthly relevant income levels for monthly paid employees are HKD 7,100 and HKD 30,000, with employer and employee mandatory contributions generally calculated at 5 percent of relevant income within the applicable limits. For live compliance decisions, verify the thresholds against MPFA, because contribution levels can be reviewed over time. For reconciliation, the extracted file should make the income base visible enough to explain why an employee contribution is zero, capped, or lower than expected.

Zero-income employees are a common source of false differences. If an employee has no relevant income for a contribution period, the employer still needs the person represented in the remittance statement, with zero relevant income or the treatment instructed by the trustee. If the extraction skips zero lines because they look empty, the headcount comparison between payroll and the remittance statement will fail.

New joiners need a separate check. Employee contributions do not always start in the first wage period because of the contribution holiday, while employer contributions begin from the start of employment. A spreadsheet that carries employment start date, contribution period, and extraction notes can help explain why the employer and employee columns do not mirror each other for a new employee.

Terminated employees create the opposite problem. The final contribution period, cessation date, payment date, and trustee processing date may not line up neatly with the payroll month. A good MPF extraction file keeps those fields visible so the variance can be classified as a timing issue, a missing final contribution, or a source-document mismatch.

Voluntary contributions should never be blended into mandatory contribution columns. Keep employer voluntary and employee voluntary amounts separate, then compare statutory payroll amounts against mandatory fields only. That one design choice prevents many avoidable reconciliation disputes.

Design the file for MPFA retention and Hong Kong audit support

MPF audit data extraction in Hong Kong is not just about producing a tidy workbook. The file has to preserve employee-level detail, show how totals tie back to payroll, and point back to the original evidence. A CPA reviewing payroll support should not have to choose between trusting a spreadsheet and reopening hundreds of trustee PDFs.

Employers should design the archive around the retention requirement for remittance statement information. MPF employer records include relevant income, employer and employee mandatory contributions, and employer and employee voluntary contributions where applicable, and the retention period for remittance statement information is at least seven years from the date the statement is issued. The extracted file should therefore keep the same contribution facts in a format that can be searched, filtered, and traced.

A practical folder structure helps. Keep the source PDFs or scans alongside the extracted workbook. Name folders by employer entity, contribution year, contribution period, and trustee or eMPF source. If a company changed trustees, keep the source naming explicit rather than merging everything into a single undifferentiated "MPF" folder.

The workbook itself should make the audit trail visible. Useful columns include source file, source page, trustee or eMPF source, scheme name, payment date, contribution period, employee identifier, relevant income, mandatory contribution amounts, voluntary contribution amounts, total contribution, and extraction notes. If a reviewer questions a variance, those fields show where to look first.

Hong Kong statutory audit documentation is broader than MPF records. It may include payroll reports, employment records, invoices, bank payments, board approvals, and tax support depending on the engagement. MPF extraction belongs in that wider file as payroll supporting evidence, and readers preparing a broader audit pack can use the separate guide to Hong Kong audit documentation requirements for context beyond MPF.

The defensible endpoint is not a spreadsheet that replaces the source documents. It is a spreadsheet that makes the source documents usable.

Run a representative MPF extraction before processing the whole archive

Start with a sample that reflects the real archive. For an MPF remittance statement PDF to Excel project, that usually means one current eMPF record, one or two historical trustee statements, a monthly pay-record, and an ABS if annual benefit statements are in scope. If the files include scans or older trustee layouts, include those in the first test rather than saving them for the full batch.

The prompt should describe the MPF documents and the output structure, not just ask for extraction. Name the document types. Ask for one row per employee per contribution period for remittance statements and pay-records. Specify the columns: employee name, employee identifier, contribution period, relevant income, employer mandatory contribution, employee mandatory contribution, employer voluntary contribution, employee voluntary contribution, total contribution, payment date, trustee or eMPF source, source file, source page, and notes. Tell the extraction to separate mandatory and voluntary contributions and to flag uncertain fields rather than silently guessing.

Invoice Data Extraction is built for this kind of financial document extraction to Excel: upload PDFs or image files, describe the required fields in a natural-language prompt, and download Excel, CSV, or JSON. For recurring MPF work, the prompt can be saved and reused so each monthly or client batch follows the same structure.

Review the sample before scaling up. Compare extracted totals to the source totals, then spot-check employees with zero relevant income, new joiners, terminated employees, and voluntary contributions. If the archive spans several trustees, check at least one row from each trustee layout. If the sample output misses a field or mixes mandatory and voluntary amounts, adjust the prompt and rerun the sample.

Excel is usually the working format for reconciliation because payroll teams can filter, sum, and add variance columns. CSV is useful when the file needs to be imported into another system or handled at larger scale. JSON fits API or automation workflows where the MPF data becomes one step in a wider payroll or compliance process.

For teams extracting MPF records as part of a larger payroll project, the broader method is covered in the guide to extract payroll data from PDF to Excel. For the MPF file itself, the output should end as an employee-level contribution workbook that ties to payroll and keeps every row traceable to its source file and page.

Extract invoice data to Excel with natural language prompts

Upload your invoices, describe what you need in plain language, and download clean, structured spreadsheets. No templates, no complex configuration.

Exceptional accuracy on financial documents
1–8 seconds per page with parallel processing
50 free pages every month — no subscription
Any document layout, language, or scan quality
Native Excel types — numbers, dates, currencies
Files encrypted and auto-deleted within 24 hours
Continue Reading