feat(seo): add robots.txt by prudentbird · Pull Request #118 · Ajared/retailintelligence

prudentbird · 2026-06-05T09:47:27Z

Summary

Adds a generated robots.txt (src/app/robots.ts) so crawlers get explicit guidance.

Allows all public pages.
Disallows /api/, /admin, and /user (gated/authenticated areas) to save crawl budget.
Points to https://retailytics.ajared.ng/sitemap.xml (added in the next PR, feat(seo): add sitemap.xml #108).

Auth pages (/login, /register, /forgot-password) are intentionally left crawlable so their noindex,follow meta is honored.

Done when (issue #107)

robots.txt is served, allows public pages, blocks API and gated/authenticated routes, and points to the sitemap.

Validation

npm run typecheck ✅
npm run build ✅ — /robots.txt route generated.

Notes

Stacked on #112 (targets feat/seo-112-footer-legal-pages).

Closes #107

Every public page now emits Open Graph and Twitter card tags (title, description, url, image) so links render branded previews on LinkedIn, Slack, WhatsApp and X. - Add a branded 1200x630 social card generated with next/og, served site-wide via app/opengraph-image and reused for app/twitter-image. - Add buildMetadata() helper that produces matching title/description/ canonical plus OG and Twitter card objects for each page. - Route home and auth pages through the helper; add OG/Twitter defaults to the root layout for inheritance. Refs #106

Footer links now resolve to real pages or anchors instead of placeholders that fell back to the homepage, and the previously missing contact and legal pages exist. - Footer: grouped Product/Company/Legal navigation with a real contact email, on-page anchors (#how-it-works, #faq), the parent company site, and links to the new pages. - Add /contact (indexable) with a real, monitored address. - Add /privacy and /terms minimal legal pages (noindex,follow). - Add SitePage shell for consistent standalone-page layout. Refs #112

Serve a robots.txt that allows public pages, blocks the API and gated admin/user areas, and points crawlers to the sitemap. Refs #107

claude · 2026-06-05T09:47:43Z

Claude finished @prudentbird's task —— View job

PR Review: feat(seo): add robots.txt

Reviewing this PR for code quality, best practices, potential bugs, performance considerations, security concerns, and test coverage.

Todo List:

Read repository guidelines (CLAUDE.md) - No CLAUDE.md found
Analyze the robots.ts implementation
Check code quality and Next.js best practices
Review security considerations
Assess performance implications
Evaluate test coverage needs
Provide comprehensive feedback

🟢 Overall Assessment

This is a well-implemented robots.txt solution that follows Next.js App Router conventions correctly. The code is clean, follows TypeScript best practices, and addresses the SEO requirements effectively.

✅ Strengths

Code Quality & Best Practices:

Proper use of Next.js 16+ MetadataRoute.Robots type from next/types
Clean, readable TypeScript implementation
Follows Next.js App Router file convention (robots.ts in app directory)
Good separation of concerns by importing site config from ~/lib/site
Consistent with existing patterns (similar to opengraph-image.tsx, twitter-image.tsx)

SEO Implementation:

✅ Allows all public pages with allow: '/'
✅ Properly blocks sensitive areas: /api/, /admin, /user
✅ Strategic decision to leave auth pages crawlable for noindex,follow meta handling
✅ Includes sitemap reference and host declaration
✅ Dynamic URL generation using site configuration

🟡 Minor Considerations

Security & Best Practices:

The implementation is secure - no sensitive information exposed
Dynamic URL generation from siteConfig.url is appropriate
Disallow rules properly protect API routes and admin areas

Performance:

Minimal performance impact - static route generation at build time
Could benefit from additional file-level cache headers, but Next.js handles this automatically for metadata routes

Code Robustness:

frontend/src/lib/site.ts:6 - The URL trimming logic (replace(/\/$/, '')) is good for consistency
Consider if trailing slashes in disallow rules might be needed: /api/ vs /api (current implementation is correct for blocking subdirectories)

🔍 Testing Considerations

Current State: No dedicated tests for robots.txt generation
Recommendation: While not critical for this simple implementation, consider:

// Potential test structure (if testing is added later)
describe('robots.txt', () => {
  it('should block sensitive routes', () => {
    const robots = robotsFunction();
    expect(robots.rules.disallow).toContain('/api/');
  });
});

However, since this is a simple metadata route with no complex logic, the validation mentioned in the PR description (npm run build ✅) is sufficient.

🎯 Recommendations

Documentation: The PR description is excellent and covers the implementation well
Future Enhancement: When sitemap.xml is implemented (feat(seo): add sitemap.xml #108), verify the robots.txt correctly points to it
Monitoring: Consider adding robots.txt to your SEO monitoring once deployed

📝 Code-Specific Feedback

frontend/src/app/robots.ts:6-10 - The rules object structure is optimal for this use case. The single userAgent rule with array disallow is more maintainable than multiple rules.

frontend/src/app/robots.ts:11-12 - Good use of the site configuration for dynamic URLs. This ensures consistency across environments.

✨ Final Verdict

Approve - This PR is ready for merge. The implementation is solid, follows Next.js best practices, and effectively addresses the SEO requirements. The strategic decisions around crawlability are well-reasoned and documented.

The code quality is high, security considerations are properly addressed, and the performance impact is minimal. No blocking issues identified.

prudentbird added 3 commits June 5, 2026 10:41

feat(seo): add robots.txt

9061d94

Serve a robots.txt that allows public pages, blocks the API and gated admin/user areas, and points crawlers to the sitemap. Refs #107

prudentbird changed the base branch from feat/seo-112-footer-legal-pages to dev June 5, 2026 19:48

prudentbird merged commit 74ce05e into dev Jun 5, 2026
1 check passed

prudentbird deleted the feat/seo-107-robots-txt branch June 5, 2026 19:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(seo): add robots.txt#118

feat(seo): add robots.txt#118
prudentbird merged 3 commits into
devfrom
feat/seo-107-robots-txt

prudentbird commented Jun 5, 2026

Uh oh!

claude Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

prudentbird commented Jun 5, 2026

Summary

Done when (issue #107)

Validation

Notes

Uh oh!

claude Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: feat(seo): add robots.txt

Todo List:

🟢 Overall Assessment

✅ Strengths

🟡 Minor Considerations

🔍 Testing Considerations

🎯 Recommendations

📝 Code-Specific Feedback

✨ Final Verdict

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude Bot commented Jun 5, 2026 •

edited

Loading