The Synkro Pipeline
Synkro uses a multi-stage pipeline to generate high-quality training data:Stages
1. Document Ingestion
Your policy is parsed into raw text. Supported formats:- Plain text
- PDF files
- DOCX files
- Markdown
- URLs
2. Logic Extraction
The policy is analyzed and structured rules are extracted into a Logic Map:- Each rule has an ID (R001, R002, etc.)
- Conditions and actions identified
- Dependencies between rules mapped
- Categories assigned
3. Scenario Generation
Diverse test scenarios are created based on the Logic Map:| Type | Description | % |
|---|---|---|
| Positive | User meets all criteria | 35% |
| Negative | User violates one criterion | 30% |
| Edge Case | Boundary conditions | 25% |
| Irrelevant | Outside policy scope | 10% |
4. Response Generation
For each scenario, a policy-compliant assistant response is generated that:- Correctly applies the relevant rules
- Cites specific policy sections
- Handles edge cases appropriately
5. Grading
Each response is evaluated for:- Policy compliance
- Accurate citations
- Sound reasoning
- Appropriate tone